Precomputed visual features pytorch FasterRcnn VG #34

AndresPMD · 2020-05-05T22:25:37Z

First congrats for your paper and repo.
Since your repo uses the precomputed visual features from the Bottom-up model, but sadly if somebody wants to use your model in another dataset they have to install caffe (sometimes painful).
I have found a repo:
https://github.com/shilrley6/Faster-R-CNN-with-model-pretrained-on-Visual-Genome

That offers the Faster RCNN trained on VG as well as the data preprocessing toolkit (tsv and numpy) to extract features on another dataset on pytorch :) .

Maybe this info is helpful for any other that runs into the same problem. You can either add this info in your Readme as an optional way of extracting the visual features.

I am testing it out myself atm.

Cheers.

LetsGoFir · 2020-05-27T07:50:03Z

Hi, does it work?

AndresPMD · 2020-05-27T12:04:34Z

It is the same model as the one the repo uses to extract but rather than dealing with caffe (can be painful) it is on pytorch.
Check that repo and read the code :)

LetsGoFir · 2020-05-29T07:39:45Z

It is the same model as the one the repo uses to extract but rather than dealing with caffe (can be painful) it is on pytorch.
Check that repo and read the code :)

I have run it by myself, and the feature is different with the author's. In fact, it cannot converge when training. Have you try it?

AndresPMD · 2020-05-29T12:53:55Z

@LetsGoFir Yes I have tried. The training seems to converge but the results are not as good as the ones with the original caffe feats.
Performance on Flickr30K (Paper)
(R@1, R@5, R10, Caption Retrieval and Image Retrieval)
VSRN (ours)71.3 90.6 96.0 54.7 81.8 88.2

What I got:

63.6 | 86.4 | 92.1 47.6 | 75 | 83.3

If you find some other good repo to use as feature extractor let me know.

Good luck!

Probably there is another repo that implements this Faster R CNN on VG in a better manner.

LetsGoFir · 2020-05-30T02:42:01Z

@LetsGoFir Yes I have tried. The training seems to converge but the results are not as good as the ones with the original caffe feats.
Performance on Flickr30K (Paper)
(R@1, R@5, R10, Caption Retrieval and Image Retrieval)
VSRN (ours)71.3 90.6 96.0 54.7 81.8 88.2

What I got:

63.6 | 86.4 | 92.1 47.6 | 75 | 83.3

If you find some other good repo to use as feature extractor let me know.

Good luck!

Probably there is another repo that implements this Faster R CNN on VG in a better manner.

Sign, my training cannot converge, can you give me your f30k_test.json and f30k_test.npy? I want to know where it goes wrong.
And I am extracting the feature by bottom-up-attention caffe, but with python3. The feature seems like different with the author's, again.
But the shapes match well.
:(

AndresPMD · 2020-05-31T02:29:54Z

Well atm I don't have the npy file with the features. Nonetheless, in literature the most common split used in Flickr30K is the Karpathy split (https://cs.stanford.edu/people/karpathy/deepimagesent/).
You can donwload the anns files from the link,

Good luck

LetsGoFir · 2020-06-02T08:12:54Z

Well atm I don't have the npy file with the features. Nonetheless, in literature the most common split used in Flickr30K is the Karpathy split (https://cs.stanford.edu/people/karpathy/deepimagesent/).
You can donwload the anns files from the link,

Good luck

I have tried Anderson's caffe feature and it is good.

184446223 · 2022-11-20T05:06:02Z

你好，有效果吗？

hello , Do you have the image bboxs of the features of Flickr30k

184446223 · 2022-11-20T05:07:04Z

好吧，atm 我没有具有这些功能的 npy 文件。尽管如此，在文献中，Flickr30K 中最常用的拆分是 Karpathy 拆分 ( https://cs.stanford.edu/people/karpathy/deepimagesent/ )。
您可以从链接下载 anns 文件，
祝你好运

我已经尝试过 Anderson 的咖啡功能，它很好。

I hope the bbox is be set in the faster-rcnn Caffe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Precomputed visual features pytorch FasterRcnn VG #34

Precomputed visual features pytorch FasterRcnn VG #34

AndresPMD commented May 5, 2020

LetsGoFir commented May 27, 2020

AndresPMD commented May 27, 2020

LetsGoFir commented May 29, 2020 •

edited

Loading

AndresPMD commented May 29, 2020

LetsGoFir commented May 30, 2020

AndresPMD commented May 31, 2020

LetsGoFir commented Jun 2, 2020 •

edited

Loading

184446223 commented Nov 20, 2022

184446223 commented Nov 20, 2022

Precomputed visual features pytorch FasterRcnn VG #34

Precomputed visual features pytorch FasterRcnn VG #34

Comments

AndresPMD commented May 5, 2020

LetsGoFir commented May 27, 2020

AndresPMD commented May 27, 2020

LetsGoFir commented May 29, 2020 • edited Loading

AndresPMD commented May 29, 2020

LetsGoFir commented May 30, 2020

AndresPMD commented May 31, 2020

LetsGoFir commented Jun 2, 2020 • edited Loading

184446223 commented Nov 20, 2022

184446223 commented Nov 20, 2022

LetsGoFir commented May 29, 2020 •

edited

Loading

LetsGoFir commented Jun 2, 2020 •

edited

Loading