Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can it be used for detection task? #26

Open
chenhao10807 opened this issue May 23, 2018 · 1 comment
Open

Can it be used for detection task? #26

chenhao10807 opened this issue May 23, 2018 · 1 comment

Comments

@chenhao10807
Copy link

Thank you for sharing the code.but can it be used for detection task?

@MarcelGabriel1993
Copy link

I know this issue is really old, but I guess some random guy reading this might be interested as well ...

For each image, the 5 best guesses for classification are considered. For each of those 5 guesses (referred to as top 1, top 2 ... etc.) bounding boxes are generated. The ILSVRC challenge allows 5 guesses, because there are sometimes multiple instances of one object in the image, and it is hard to label all instances correctly by hand (imagine, there are thousands of images. See the paper "ImageNet Large Scale Visual Recognition Challenge, Chapter 4: Evaluation at large scale" for more information). So you might not hit the "right" object with your first guess.

Well, you could try to run the CAM algorithm on an image with multiple classes/objects. But what now? Take the first guess for the first object, the guess with second highest confidence for the second object and so on, hoping for some good results? What about images without objects to localize/classify? Those cases do exist as well. The main problem is that you have no clue how many objects there are in the image. I think this algorithm is not meant to be used for detection.

You could try to use some kind of combination with selective search. Get region proposals by selective search, crop those regions from the image, run this algorithm, create bounding boxes and try to remap them on to the original image. Something like that might work...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants