Two common approaches are employed in estimating the poses of individuals in a given image.
The processing is done from low to high resolutions, follow the detection of the individual instances in the image first using a bounding box object detector and then focus on determining their poses.
- If the detection of individuals fails, there is no possibility of recovering.
- It is vulnerable when multiple individuals are nearby.
- The computational cost depends on the number of people in the image.
The processing is done from high to low resolutions. It starts by localizing identity free semantic entities and then grouping them into person instance.
💡 Bottom-up approaches overcame 1) early commitment and showed detached run-time complexity from the number of people in the image.
🔥 But, bottom-up approaches face challenges in grouping body parts when there is a large overlap between people.
I implemented some tasks...
- Draw pose landmarks and connect lines(image)
- Draw pose landmarks and connect lines on webcam
- Squat counting using pose landmarks