# yoloe_data_engine **Repository Path**: mirrors_ultralytics/yoloe_data_engine ## Basic Information - **Project Name**: yoloe_data_engine - **Description**: yoloe_data_engine - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-11-16 - **Last Updated**: 2026-02-08 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # pipline of the data engine ### read the grounding data from json file for each sample, per-store the others samples sharing the same image. add the ### model predict and save the jons files - visual to check the json files - found that some boxes are overlapped heavily, with different text - how to deal with these boxes? ### merge model prediction to label, - discard the bbox with higher iou ( > 0.8, higher iou , no consider the class or text) - generate the visual prompt embedding for each instance (bbox) - merge bboxes within the same image ( consider the vpe distance and text similarity ,bbox iou<0.8 ) - transfer to grounding format cache for training - to do: write a tools to visual the bbox ious. within the same images