
Count people in images with AI-powered detection and segmentation. Monitor crowd density, occupancy limits, and capacity for safety compliance.
The Bynn People Counting model accurately detects and counts individual people in images while providing precise segmentation masks for each person. This model is ideal for crowd analysis, occupancy monitoring, and retail analytics.
Accurate people counting enables critical decisions across industries. Retail stores optimize layouts and staffing based on foot traffic. Venues ensure compliance with occupancy limits for fire safety. Security operations monitor crowd density to prevent dangerous overcrowding. Yet manual counting is slow, error-prone, and impossible at scale.
Traditional counting methods fail in real-world conditions. Overlapping people in crowds are counted as one. Partial visibility at frame edges causes undercounting. Varying poses, clothing, and lighting confuse simple detectors. Accurate counting requires sophisticated detection that identifies individuals even in challenging scenarios—dense crowds, partial occlusion, and varying perspectives.
Connected to CCTV systems, people counting becomes a powerful security tool. Detect unauthorized gatherings in restricted areas. Monitor for groups forming where they shouldn't be—after-hours in office buildings, in secure zones, or in areas with access limits. Real-time alerts when thresholds are exceeded enable rapid security response before situations escalate.
When provided with an image, the detector analyzes the visual content to identify and count distinct individuals. The model uses advanced object detection with instance segmentation, providing both the location of each person and pixel-level masks that precisely outline each individual.
Achieving 97.1% accuracy, the model can reliably count people in various scenarios from single portraits to crowded scenes, providing accurate counts and individual segmentation for each person detected.
The model employs open-vocabulary object detection technology optimized for person detection. For each detected individual, the system returns:
The API returns a structured response containing:
The model detects:
| Metric | Value |
|---|---|
| Detection Accuracy | 97.1% |
| Average Response Time | 5000ms |
| Max File Size | 20MB |
| Supported Formats | GIF, JPEG, JPG, PNG, WebP |
Important Considerations:
This model provides probability scores and counts, not definitive census data.
Best Practice: Use the segmentation masks to visually verify counts and combine with human review for high-stakes counting applications.
Open-vocabulary object detection with segmentation. Detects objects using text prompts.
image_urlstringRequiredURL of the image for object detection
https://example.com/image.jpgObject detection results with bounding boxes and segmentation masks
num_detectionsintegerNumber of objects detected
3detectionsarrayArray of detected objects
bboxobjectBounding box {x1, y1, x2, y2}
{"x1":100,"y1":150,"x2":300,"y2":400}scorefloatDetection confidence (0.0-1.0)
0.95class_idintegerClass index
0class_namestringDetected object class name
personmask_pngstringBase64-encoded PNG segmentation mask
data:image/png;base64,...image_sizeobjectOriginal image dimensions
{
"width": 1920,
"height": 1080
}{
"model": "people-counting",
"image_url": "https://example.com/image.jpg"
}{
"success": true,
"data": {
"num_detections": 2,
"detections": [
{
"bbox": {
"x1": 100,
"y1": 150,
"x2": 300,
"y2": 400
},
"score": 0.95,
"class_id": 0,
"class_name": "person",
"mask_png": "data:image/png;base64,iVBORw0KGgo..."
},
{
"bbox": {
"x1": 400,
"y1": 200,
"x2": 600,
"y2": 450
},
"score": 0.87,
"class_id": 1,
"class_name": "car",
"mask_png": "data:image/png;base64,iVBORw0KGgo..."
}
],
"image_size": {
"width": 1920,
"height": 1080
}
}
}429 HTTP error code along with an error message. You should then retry with an exponential back-off strategy, meaning that you should retry after 4 seconds, then 8 seconds, then 16 seconds, etc.Integrate People Counting into your application today with our easy-to-use API.