Annotation methodology

Cogite applies a structured annotation methodology built on four pillars: precise guidelines, double annotation, statistical quality control, and continuous team calibration. This page documents our standards for clients who want to understand in depth how we operate.

1. Annotation guidelines

Every project starts with the joint drafting (with your ML team) of a guidelines document. This reference document covers:

2. Systematic double annotation

On all our Production-plan projects, every sample is annotated independently by two annotators. Divergent annotations are arbitrated by a third senior annotator. This redundancy is essential for final quality and allows measuring inter-annotator agreement.

3. Quality metrics

We systematically measure:

These metrics are communicated in a weekly report sent to your ML team.

4. Continuous calibration

Every week, our annotators participate in calibration sessions where they discuss difficult cases encountered. These sessions are led by the AI project manager and allow refining collective understanding of the guidelines. It's also an opportunity to flag any ambiguity requiring client clarification.

Tools used

We work with the leading annotation tools on the market:

Delivery formats

We deliver in your preferred format: JSON, COCO, Pascal VOC, JSONL, YOLO, MS COCO Keypoints, CSV, or any proprietary format specified in the brief. Deliveries include a dataset card documenting composition, quality statistics and conventions used.