Project Title: "Depth-Guard" – 3D Spatial Occupancy Monitor
1. The Problem
In a smart warehouse, a robot needs to know if a loading zone is clear or occupied. A 2D camera alone can’t tell the difference between a "flat picture of a box" on the floor and an "actual 3D box."
The Goal: Build a Python-based system that uses Computer Vision and Depth Perception (AI 3D) to identify objects and determine their 3D volume (Size) and Distance from the camera.
2. Intern Tasks
Object Detection: Use a pre-trained model (like YOLOv8) to draw 2D boxes around objects.
Depth Mapping: Use a depth estimation model (like MiDaS or a simulated Stereo-depth feed) to calculate how far each object is.
Occupancy Logic: If an object is closer than 1 meter and larger than a specific volume, mark the zone as "BLOCKED."
Alert System: Print a warning if the 3D space is too crowded.
3. Sample Datasets (Simulation)
Since interns may not have 3D cameras (LiDAR/RGB-D), provide these two JSON-style datasets to simulate the camera feed.
Dataset A: Vision Feed (Object & Depth)
This dataset simulates what the AI "sees" in a single frame.
| Object ID | Label | 2D Bounding Box (x,y,w,h) | Avg. Depth Value (Meters) |
| Obj_101 | "Cardboard Box" | [100, 200, 50, 50] | 0.8m |
| Obj_102 | "Human" | [400, 150, 60, 180] | 2.5m |
| Obj_103 | "Pallet" | [600, 500, 100, 30] | 1.2m |
| Obj_104 | "Small Tool" | [250, 300, 10, 10] | 0.5m |
Dataset B: Logic Constraints
The "Rules" the AI must follow to make decisions.
| Parameter | Value | Rule |
| CRITICAL_DISTANCE | 1.0m | Any object closer than this is a "Hazard." |
| MIN_VOLUME_SIZE | 20cm³ | Ignore small objects (like dust or tiny tools). |
| ZONES | Zone A (0-2m) | Categorize objects based on depth ranges. |
4. Success Metrics for Interns
The interns must write a script that processes the data and outputs a Status Report:
Frame Analysis:
Obj_101 (Box): 0.8m away. STATUS: HAZARD (Inside 1.0m limit).
Obj_104 (Tool): 0.5m away. STATUS: IGNORE (Below minimum volume size).
Obj_102 (Human): 2.5m away. STATUS: SAFE (Zone B).
Final Decision: 🚩 Loading Zone Blocked
Simple Tech Recommendation
Language: Python.
Libraries:
OpenCV(for image processing),NumPy(for depth math), andMatplotlib(to plot the objects in a 3D scatter plot).AI Model (Optional): If they want to go beyond the sample data, suggest using MediaPipe or YOLOv8 for real-time webcam testing.
Comments
Post a Comment