kaggle-global-wheat-detection:Kaggle全球小麦检测的第9名解决方案资源-CSDN下载

共118个文件

py：96个

sh：10个

md：1个

computer-vision

deep-learning

kaggle

object-detection

mmdetection

需积分: 46 146 浏览量 2021-05-04 18:43:11 上传评论 1 收藏 119KB ZIP 举报

资源详情

资源评论

资源推荐

收起资源包目录

kaggle-global-wheat-detection:Kaggle全球小麦检测的第9名解决方案（118个子文件）

setup.cfg 200B

Dockerfile 1KB

.dockerignore 1KB

.gitignore 1KB

.gitkeep 0B

requirements.in 219B

LICENSE 10KB

Makefile 1KB

README.md 11KB

gfl_head.py 29KB

resnet.py 18KB

transforms.py 11KB

collect_images.py 8KB

resnext.py 8KB

albumentations.py 8KB

run.py 7KB

test.py 7KB

atss_assigner.py 7KB

upgrade_model_version.py 6KB

train.py 6KB

atss.py 6KB

cascade_rcnn_r50_fpn.py 5KB

wheat_detection.py 5KB

sepc.py 5KB

net.py 5KB

script_template.py 4KB

evaluation.py 4KB

models.py 4KB

wheat_detection_mstrain.py 4KB

wheat_detection_mstrain_light.py 4KB

test_evaluation.py 4KB

pipeline.py 4KB

wbf.py 4KB

rfp.py 3KB

sepc_dconv.py 3KB

saconv.py 3KB

detectors_r50_ga_mstrain_local_pseudo.py 3KB

coco.py 3KB

calculate_distance.py 3KB

kaggle2coco.py 3KB

submit.py 3KB

split_folds.py 3KB

visualization.py 2KB

function.py 2KB

cross_entropy_loss.py 2KB

conv_aws.py 2KB

universe_r101_gfl.py 2KB

spike2kaggle.py 2KB

wheat_detection_mstrain_hard.py 2KB

crop.py 2KB

test_aug.py 2KB

generate.py 2KB

collect_bboxes.py 2KB

coco2crop.py 2KB

res2net.py 2KB

prepare_weights.py 1KB

submission.py 1KB

logging.py 1KB

source_balanced_dataset.py 1KB

images2coco.py 1KB

__init__.py 1KB

prepare_pseudo.py 1KB

loading.py 1KB

kmeans.py 1KB

patches.py 1KB

select_anchors.py 1KB

detectors_r50_ga.py 833B

sources.py 807B

build.py 624B

rm_optimizer.py 604B

universe_r101_gfl_mstrain_private_pseudo.py 527B

detectors_r50_ga_mstrain_private_pseudo.py 526B

universe_r101_gfl_mstrain_stage1.py 484B

universe_r101_gfl_mstrain_stage2.py 478B

universe_r101_gfl_mstrain_stage0.py 467B

universe_r101_gfl_mstrain_public_pseudo.py 457B

wheat_detection_mstrain_pseudo.py 449B

detectors_r50_ga_mstrain_stage1.py 371B

detectors_r50_ga_mstrain_stage2.py 365B

utils.py 354B

universe_r101_gfl_mstrain_local_pseudo.py 349B

detectors_r50_ga_mstrain_stage0.py 312B

schedule_1x.py 262B

detectors_r50_ga_mstrain_public_pseudo.py 250B

default_runtime.py 250B

schedule_pseudo.py 161B

schedule_4x.py 95B

setup.py 66B

__init__.py 0B

共 118 条

# :ear_of_rice: 9th Place Solution of [Global Wheat Detection](https://www.kaggle.com/c/global-wheat-detection) ![](https://www.googleapis.com/download/storage/v1/b/kaggle-forum-message-attachments/o/inbox%2F413252%2F8115a3b84299209abd11cd8e7167e31e%2FSelection_149.png?generation=1598252059641875&alt=media) - Our team: [Miras Amir](https://www.linkedin.com/in/amirassov/), [Or Katz](https://www.linkedin.com/in/or-katz-9ba885114/), [Shlomo Kashani](https://www.linkedin.com/in/quantscientist) - [Kaggle post](https://www.kaggle.com/c/global-wheat-detection/discussion/172569) - Submission kernel: [pseudo ensemble: detectors (3 st)+universenet r10](https://www.kaggle.com/amiras/pseudo-ensemble-detectors-3-st-universenet-r10) # Solution ## Summary Our solution is based on the excellent [MMDetection framework](https://github.com/open-mmlab/mmdetection). We trained an ensemble of the following models: - [DetectoRS with the ResNet50 backbone](https://github.com/joe-siyuan-qiao/DetectoRS) - [UniverseNet+GFL with the Res2Net101 backbone](https://github.com/shinya7y/UniverseNet) To increase the score a single round of pseudo labelling was applied to each model. Additionally, for a much better generalization of our models, we used heavy augmentations. ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F413252%2Fc2f34231c3dbffc0e8335b8d9cb15898%2FSelection_120.png?generation=1596619828315578&alt=media) ## Jigsaw puzzles In the original corpus provided by the organizers, the training images were cropped from an original set of larger images. Therefore, we collected and assembled the original puzzles resulting in a corpus of 1330 puzzle images. The puzzle collection algorithm we adopted was based on [this code](https://github.com/lRomul/argus-tgs-salt/blob/master/mosaic/create_mosaic.py). But we were unsuccessful in collecting the bounding boxes for puzzles. Mainly because of the existence of bounding boxes that are located on or in the vicinity the border of the image. For this reason, we generated crops for the puzzles offline in addition to training images and generated boxes for them using pseudo labelling. ## Validation approach We used MultilabelStratifiedKFold with 5 folds of [iterative stratification](https://github.com/trent-b/iterative-stratification) stratified by the number of boxes, a median of box areas and source of images. We guaranteed that there isn’t any leak between the sub-folds, so that the images of one puzzle were used only in that one particular fold. Referring to the [paper](https://arxiv.org/abs/2005.02162), one can see wheat heads from different sources. We assumed that the wheat heads of `usask_1, ethz_1` sources are very different from the test sources (`UTokyo_1, UTokyo_2, UQ_1, NAU_1`). Therefore, we did not use these sources for validation. However, our validation scores did not correlate well with the Kaggle LB. We only noticed global improvements (for example, DetectoRS is better than UniverseNet). Local improvements such as augmentation parameters, WBF parameters etc. did not correlate. We, therefore, shifted our attention to the LB scores mainly. We trained our models only on the first fold. ## Augmentations Due to the relatively small size of our training set, and another test set distribution, our approach relied heavily on data augmentation. During training, we utilized an extensive data augmentation protocol: - Various augmentations from [albumentations](https://albumentations.ai): - HorizontalFlip, ShiftScaleRotate, RandomRotate90 - RandomBrightnessContrast, HueSaturationValue, RGBShift - RandomGamma - CLAHE - Blur, MotionBlur - GaussNoise - ImageCompression - CoarseDropout - RandomBBoxesSafeCrop. Randomly select N boxes in the image and find their union. Then we cropped the image keeping this unified. - [Image colorization](https://www.kaggle.com/orkatz2/pytorch-pix-2-pix-for-image-colorization) ![](https://www.googleapis.com/download/storage/v1/b/kaggle-forum-message-attachments/o/inbox%2F413252%2Fe469741f86a27bad9e9f23c22fb758f1%2Fcolored.jpg?generation=1598080876141360&alt=media) - [Style transfer](https://github.com/bethgelab/stylize-datasets). A random image from a small test (10 images) was used as a style. ![](https://www.googleapis.com/download/storage/v1/b/kaggle-forum-message-attachments/o/inbox%2F413252%2F66df1a63023a5636e960f0a9b04c4850%2FBeFunky-collage.jpg?generation=1597411461053544&alt=media) - Mosaic augmentation. `a, b, c, d` -- randomly selected images. Then we just do the following: ``` top = np.concatenate([a, b], axis=1) bottom = np.concatenate([c, d], axis=1) result = np.concatenate([top, bottom], axis=0) ``` - Mixup augmentation. `a, b` -- randomly selected images. Then: `result = (a + b) / 2` - Multi-scale Training. In each iteration, the scale of image is randomly sampled from `[(768 + 32 * i, 768 + 32 * i) for i in range(25)]`. - All augmentations except colorization and style transfer were applied online. Examples of augmented images: ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F413252%2F54b1605a052bc34520cce5f7ca81f86f%2F0.jpg?generation=1596619282971124&alt=media) | ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F413252%2Fe7bad4c91782a5614ee5731f3610e376%2F6.jpg?generation=1596619343754665&alt=media) :-------------------------:|:-------------------------: ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F413252%2F4dfab25fb6ed84e7ef545c42361382c5%2F27.jpg?generation=1596619635969807&alt=media) | ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F413252%2Fcb53ccda22bf16bef104caf9e25459df%2F47.jpg?generation=1596619666130434&alt=media) ## External data: [SPIKE dataset](https://www.kaggle.com/c/global-wheat-detection/discussion/164346) ## Models We used DetectoRS with ResNet50 and UniverseNet+GFL with Res2Net101 as main models. DetectoRS was a little bit more accurate and however much slower to train than UniverseNet: - Single DetectoRS Public LB score without pseudo labeling: 0.7592 - Single UniverseNet Public LB score without pseudo labeling: 0.7567 For DetectoRS we used: - LabelSmoothCrossEntropyLoss with parameter `0.1` - [Empirical Attention](https://github.com/open-mmlab/mmdetection/tree/master/configs/empirical_attention) ## Training pipeline In general, we used a multi-stage training pipeline: ![](https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F413252%2F6b28b3ab6458d9a0763d34c78abf15a0%2FSelection_125.png?generation=1596629231468089&alt=media) ## Model inference We used TTA6 (Test Time Augmentation) for all our models: - Multi-scale Testing with scales `[(1408, 1408), (1536, 1536)]` - Flips: `[original, horizontal, vertical]` For TTA was used a standard MMDet algorithm with NMS that looks like this for two-stage detectors (DetectoRS): ![](https://www.googleapis.com/download/storage/v1/b/kaggle-forum-message-attachments/o/inbox%2F413252%2F6fff06749b880222e0c06bea777b8e84%2FSelection_126.png?generation=1597392868005836&alt=media) For one-stage detectors (UniverseNet), the algorithm is similar, only without the part with RoiAlign, Head, etc. ## Pseudo labelling - Sampling positive examples. We predicted the test image and received its scores and the bounding boxes. Then we calculated `confidence = np.mean(scores > 0.75)`. If the confidence was greater than 0.6 we accepted this image and used for pseudo labelling. - Sources `[usask_1, ethz_1]` and augmentations like mosaic, mixup, colorization, style transfer weren’t used for pseudo labelling. - 1 epoch, 1 round, 1 stage. - Data: original data + pseudo test data :heavy_multiplication_x: 3 ## Ensemble We used [WBF](https://github.com/ZFTurbo/Weighted-Boxes-Fusion) for the ensemble. The distribution of DetectoRS and UniverseNet scores is different. So we applied scali