-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Precomputed visual features pytorch FasterRcnn VG #34
Comments
Hi, does it work? |
It is the same model as the one the repo uses to extract but rather than dealing with caffe (can be painful) it is on pytorch. |
I have run it by myself, and the feature is different with the author's. In fact, it cannot converge when training. Have you try it? |
@LetsGoFir Yes I have tried. The training seems to converge but the results are not as good as the ones with the original caffe feats. What I got: 63.6 | 86.4 | 92.1 47.6 | 75 | 83.3 If you find some other good repo to use as feature extractor let me know. Good luck! Probably there is another repo that implements this Faster R CNN on VG in a better manner. |
Sign, my training cannot converge, can you give me your f30k_test.json and f30k_test.npy? I want to know where it goes wrong. |
Well atm I don't have the npy file with the features. Nonetheless, in literature the most common split used in Flickr30K is the Karpathy split (https://cs.stanford.edu/people/karpathy/deepimagesent/). Good luck |
I have tried Anderson's caffe feature and it is good. |
hello , Do you have the image bboxs of the features of Flickr30k |
I hope the bbox is be set in the faster-rcnn Caffe |
Hello @kuanghuei ,
First congrats for your paper and repo.
Since your repo uses the precomputed visual features from the Bottom-up model, but sadly if somebody wants to use your model in another dataset they have to install caffe (sometimes painful).
I have found a repo:
https://github.com/shilrley6/Faster-R-CNN-with-model-pretrained-on-Visual-Genome
That offers the Faster RCNN trained on VG as well as the data preprocessing toolkit (tsv and numpy) to extract features on another dataset on pytorch :) .
Maybe this info is helpful for any other that runs into the same problem. You can either add this info in your Readme as an optional way of extracting the visual features.
I am testing it out myself atm.
Cheers.
The text was updated successfully, but these errors were encountered: