-
Notifications
You must be signed in to change notification settings - Fork 6
/
2020.06.30.txt
1473 lines (1207 loc) · 113 KB
/
2020.06.30.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
==========New Papers==========
1, TITLE: A Tool for Automatic Estimation of Patient Position in Spinal CT Data
http://arxiv.org/abs/2006.15330
AUTHORS: Roman Jakubicek ; Tomas Vicar ; Jiri Chmelik
COMMENTS: 6 pages, 3 figures, submitted on EMBEC 2020; the paper has not been reviewed yet
HIGHLIGHT: This paper presents a tool for automatic rotation of CT data into a standardized (HFS) patient position.
2, TITLE: Compositional Video Synthesis with Action Graphs
http://arxiv.org/abs/2006.15327
AUTHORS: Amir Bar ; Roei Herzig ; Xiaolong Wang ; Gal Chechik ; Trevor Darrell ; Amir Globerson
HIGHLIGHT: Towards this end, we introduce a generative model (AG2Vid) based on Action Graphs, a natural and convenient structure that represents the dynamics of actions between objects over time.
3, TITLE: Chroma Intra Prediction with attention-based CNN architectures
http://arxiv.org/abs/2006.15349
AUTHORS: Marc Górriz ; Saverio Blasi ; Alan F. Smeaton ; Noel E. O'Connor ; Marta Mrak
COMMENTS: 27th IEEE International Conference on Image Processing, 25-28 Oct 2020, Abu Dhabi, United Arab Emirates
HIGHLIGHT: This paper proposes a new neural network architecture for cross-component intra-prediction.
4, TITLE: End-Effect Exploration Drive for Effective Motor Learning
http://arxiv.org/abs/2006.15960
AUTHORS: Emmanuel Daucé
COMMENTS: 6 pages, 3 figures, submitted to IWAI 2020 (1st International Workshop on Active Inference)
HIGHLIGHT: End-Effect Exploration Drive for Effective Motor Learning
5, TITLE: Forgery Detection in a Questioned Hyperspectral Document Image using K-means Clustering
http://arxiv.org/abs/2006.16057
AUTHORS: Maria Yaseen ; Rammal Aftab Ahmed ; Rimsha Mahrukh
COMMENTS: 5 pages,6 figures
HIGHLIGHT: In this paper an extensive ink mismatch detection technique is presented which uses KMean Clustering to identify different inks on the basis of their unique spectral response and separates them into different clusters.
6, TITLE: The Evolutionary Dynamics of Independent Learning Agents in Population Games
http://arxiv.org/abs/2006.16068
AUTHORS: Shuyue Hu ; Chin-Wing Leung ; Ho-fung Leung ; Harold Soh
HIGHLIGHT: This paper presents a formal relation between stochastic processes and the dynamics of independent learning agents who reason based on the reward signals.
7, TITLE: Patch SVDD: Patch-level SVDD for Anomaly Detection and Segmentation
http://arxiv.org/abs/2006.16067
AUTHORS: Jihun Yi ; Sungroh Yoon
HIGHLIGHT: In this paper, we tackle the problem of image anomaly detection and segmentation.
8, TITLE: AerialMPTNet: Multi-Pedestrian Tracking in Aerial Imagery Using Temporal and Graphical Features
http://arxiv.org/abs/2006.15457
AUTHORS: Maximilian Kraus ; Seyed Majid Azimi ; Emec Ercelik ; Reza Bahmanyar ; Peter Reinartz ; Alois Knoll
COMMENTS: ICPR 2020
HIGHLIGHT: In this paper, we propose AerialMPTNet, a novel approach for multi-pedestrian tracking in geo-referenced aerial imagery by fusing appearance features from a Siamese Neural Network, movement predictions from a Long Short-Term Memory, and pedestrian interconnections from a GraphCNN.
9, TITLE: Measuring Memorization Effect in Word-Level Neural Networks Probing
http://arxiv.org/abs/2006.16082
AUTHORS: Rudolf Rosa ; Tomáš Musil ; David Mareček
COMMENTS: Accepted to TSD 2020. Will be published in Springer LNCS
HIGHLIGHT: In our work, we propose a simple general method for measuring the memorization effect, based on a symmetric selection of comparable sets of test words seen versus unseen in training.
10, TITLE: A Deep Reinforced Model for Zero-Shot Cross-Lingual Summarization with Bilingual Semantic Similarity Rewards
http://arxiv.org/abs/2006.15454
AUTHORS: Zi-Yi Dou ; Sachin Kumar ; Yulia Tsvetkov
HIGHLIGHT: In this work, we propose an end-to-end cross-lingual text summarization model.
11, TITLE: Layered Stereo by Cooperative Grouping with Occlusion
http://arxiv.org/abs/2006.16094
AUTHORS: Jialiang Wang ; Todd Zickler
HIGHLIGHT: In this work, we introduce a layered approach to stereo that explicitly incorporates occlusions.
12, TITLE: Frequency learning for image classification
http://arxiv.org/abs/2006.15476
AUTHORS: José Augusto Stuchi ; Levy Boccato ; Romis Attux
HIGHLIGHT: In this context, this paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum.
13, TITLE: MvMM-RegNet: A new image registration framework based on multivariate mixture model and neural network estimation
http://arxiv.org/abs/2006.15573
AUTHORS: Xinzhe Luo ; Xiahai Zhuang
COMMENTS: Accepted for publication at MICCAI 2020
HIGHLIGHT: In this paper, we propose a new image registration framework based on multivariate mixture model (MvMM) and neural network estimation.
14, TITLE: Multi-Person Pose Regression via Pose Filtering and Scoring
http://arxiv.org/abs/2006.15576
AUTHORS: Huixin Miao ; Junqi Lin ; Junjie Cao
COMMENTS: 8 pages, 4 figures
HIGHLIGHT: In this work, we propose an end-to-end network framework for multi-person pose regression to predict the instance-aware keypoints directly.
15, TITLE: Fabric Image Representation Encoding Networks for Large-scale 3D Medical Image Analysis
http://arxiv.org/abs/2006.15578
AUTHORS: Siyu Liu ; Wei Dai ; Craig Engstrom ; Jurgen Fripp ; Peter B. Greer ; Stuart Crozier ; Jason A. Dowling ; Shekhar S Chandra
COMMENTS: 12 pages, 10 figures
HIGHLIGHT: In this work, a novel 3D segmentation network, Fabric Image Representation Networks (FIRENet), is proposed to extract and encode generalisable feature representations from multiple medical image datasets in a large-scale manner.
16, TITLE: Self-Attention Networks for Intent Detection
http://arxiv.org/abs/2006.15585
AUTHORS: Sevinj Yolchuyeva ; Géza Németh ; Bálint Gyires-Tóth
COMMENTS: Proceedings of the International Conference on Recent Advances in Natural Language Processing (RANLP 2019)
HIGHLIGHT: In this paper, we present a novel intent detection system which is based on a self-attention network and a Bi-LSTM.
17, TITLE: BERTology Meets Biology: Interpreting Attention in Protein Language Models
http://arxiv.org/abs/2006.15222
AUTHORS: Jesse Vig ; Ali Madani ; Lav R. Varshney ; Caiming Xiong ; Richard Socher ; Nazneen Fatema Rajani
HIGHLIGHT: Through the lens of attention, we analyze the inner workings of the Transformer and explore how the model discerns structural and functional properties of proteins.
18, TITLE: Perception-Prediction-Reaction Agents for Deep Reinforcement Learning
http://arxiv.org/abs/2006.15223
AUTHORS: Adam Stooke ; Valentin Dalibard ; Siddhant M. Jayakumar ; Wojciech M. Czarnecki ; Max Jaderberg
HIGHLIGHT: We introduce a new recurrent agent architecture and associated auxiliary losses which improve reinforcement learning in partially observable tasks requiring long-term memory.
19, TITLE: A lateral semicircular canal segmentation based geometric calibration for human temporal bone CT Image
http://arxiv.org/abs/2006.15588
AUTHORS: Xiaoguang Li ; Peng Fu ; Hongxia Yin ; ZhenChang Wang ; Li Zhuo ; Hui Zhang
HIGHLIGHT: We propose an automatic calibration algorithm for temporal bone CT images.
20, TITLE: On the Relationship Between Probabilistic Circuits and Determinantal Point Processes
http://arxiv.org/abs/2006.15233
AUTHORS: Honghua Zhang ; Steven Holtzen ; Guy Van den Broeck
HIGHLIGHT: We propose a unified analysis and shared language for discussing DPPs and PCs.
21, TITLE: Computation Offloading in Multi-Access Edge Computing Networks: A Multi-Task Learning Approach
http://arxiv.org/abs/2006.16104
AUTHORS: Bo Yang ; Xuelin Cao ; Joshua Bassey ; Xiangfang Li ; Timothy Kroecker ; Lijun Qian
HIGHLIGHT: In this paper, we propose a dynamic offloading framework for the MEC network, in which the uplink non-orthogonal multiple access (NOMA) is used to enable multiple devices to upload their tasks via the same frequency band.
22, TITLE: Iris Recognition: Inherent Binomial Degrees of Freedom
http://arxiv.org/abs/2006.16107
AUTHORS: J. Michael Rozmus
COMMENTS: 4 pages, 6 figures
HIGHLIGHT: This paper shows by direct pixel-by-pixel comparison of high-quality iris images that the inherent number of degrees of freedom embodied in the human iris, independent of any encoding, is at least 536.
23, TITLE: GramGAN: Deep 3D Texture Synthesis From 2D Exemplars
http://arxiv.org/abs/2006.16112
AUTHORS: Tiziano Portenier ; Siavash Bigdeli ; Orçun Göksel
HIGHLIGHT: We present a novel texture synthesis framework, enabling the generation of infinite, high-quality 3D textures given a 2D exemplar image.
24, TITLE: Attention-Guided Generative Adversarial Network to Address Atypical Anatomy in Modality Transfer
http://arxiv.org/abs/2006.15264
AUTHORS: Hajar Emami ; Ming Dong ; Carri K. Glide-Hurst
HIGHLIGHT: In this paper, we propose a novel spatial attention-guided generative adversarial network (attention-GAN) model to generate accurate synCTs using T1-weighted MRI images as the input to address atypical anatomy.
25, TITLE: PCLNet: A Practical Way for Unsupervised Deep PolSAR Representations and Few-Shot Classification
http://arxiv.org/abs/2006.15351
AUTHORS: Lamei Zhang ; Siyu Zhang ; Bin Zou ; Hongwei Dong
COMMENTS: 16 pages, 16 figures
HIGHLIGHT: To handle this problem, in this paper, learning transferrable representations from unlabeled PolSAR data through convolutional architectures is explored for the first time.
26, TITLE: MiniNet: An extremely lightweight convolutional neural network for real-time unsupervised monocular depth estimation
http://arxiv.org/abs/2006.15350
AUTHORS: Jun Liu ; Qing Li ; Rui Cao ; Wenming Tang ; Guoping Qiu
HIGHLIGHT: To address this issue, we propose a new powerful network with a recurrent module to achieve the capability of a deep network while at the same time maintaining an extremely lightweight size for real-time high performance unsupervised monocular depth prediction from video sequences.
27, TITLE: Coloured noise time series as appropriate models for environmental variation in artificial evolutionary systems
http://arxiv.org/abs/2006.16204
AUTHORS: Matt Grove ; James M. Borg ; Fiona Polack
COMMENTS: 8 pages, 4 figures, 2020 conference on Artificial Life
HIGHLIGHT: Ultimately we argue that Artificial Life as a field should embrace the use of coloured noise to produce models of environmental variability.
28, TITLE: An Evoked Potential-Guided Deep Learning Brain Representation For Visual Classification
http://arxiv.org/abs/2006.15357
AUTHORS: Xianglin Zheng ; Zehong Cao ; Quan Bai
COMMENTS: This paper is submitting to ICONIP 2020
HIGHLIGHT: In this study, we proposed a deep learning framework guided by the visual evoked potentials, called the Event-Related Potential (ERP)-Long short-term memory (LSTM) framework, extracted by EEG signals for visual classification.
29, TITLE: Towards the Study of Morphological Processing of the Tangkhul Language
http://arxiv.org/abs/2006.16212
AUTHORS: Mirinso Shadang ; Navanath Saharia ; Thoudam Doren Singh
COMMENTS: In proceeding of Regional International Conference on Natural Language Processing (regICON) 2017, 3rd and 4th November 2017, Imphal, India
HIGHLIGHT: We use a small corpus collected from different sources of text books, short stories and articles of other topics.
30, TITLE: ReMarNet: Conjoint Relation and Margin Learning for Small-Sample Image Classification
http://arxiv.org/abs/2006.15366
AUTHORS: Xiaoxu Li ; Liyun Yu ; Xiaochen Yang ; Zhanyu Ma ; Jing-Hao Xue ; Jie Cao ; Jun Guo
COMMENTS: IEEE TCSVT 2020
HIGHLIGHT: In this paper, we propose to enhance the discriminative power of features from a new perspective by introducing a novel neural network termed Relation-and-Margin learning Network (ReMarNet).
31, TITLE: MTStereo 2.0: improved accuracy of stereo depth estimation withMax-trees
http://arxiv.org/abs/2006.15373
AUTHORS: Rafael Brandt ; Nicola Strisciuglio ; Nicolai Petkov
HIGHLIGHT: In this paper, we propose a stereo matching method, called MTStereo 2.0, for limited-resource systems that require efficient and accurate depth estimation.
32, TITLE: Liquid Resource Types
http://arxiv.org/abs/2006.16233
AUTHORS: Tristan Knoth ; Di Wang ; Adam Reynolds ; Jan Hoffmann ; Nadia Polikarpova
HIGHLIGHT: This article presents liquid resource types, a technique for automatically verifying the resource consumption of functional programs.
33, TITLE: The Generalized Independent and Dominating Set Problems on Unit Disk Graphs
http://arxiv.org/abs/2006.15381
AUTHORS: Sangram K. Jena ; Ramesh K. Jallu ; Gautam K. Das ; Subhas C. Nandy
HIGHLIGHT: In this article, we study a generalized version of the maximum independent set and minimum dominating set problems, namely, the maximum $d$-distance independent set problem and the minimum $d$-distance dominating set problem on unit disk graphs for a positive integer $d>0$.
34, TITLE: Self-Supervised MultiModal Versatile Networks
http://arxiv.org/abs/2006.16228
AUTHORS: Jean-Baptiste Alayrac ; Adrià Recasens ; Rosalia Schneider ; Relja Arandjelović ; Jason Ramapuram ; Jeffrey De Fauw ; Lucas Smaira ; Sander Dieleman ; Andrew Zisserman
HIGHLIGHT: In this work, we learn representations using self-supervision by leveraging three modalities naturally present in videos: vision, audio and language.
35, TITLE: Light Pose Calibration for Camera-light Vision Systems
http://arxiv.org/abs/2006.15389
AUTHORS: Yifan Song ; Furkan Elibol ; Mengkun She ; David Nakath ; Kevin Köser
HIGHLIGHT: This paper presents a novel light calibration approach by taking multi-view and -distance images of a reference plane in order to provide pose information of the employed light sources to the computer vision system.
36, TITLE: Automated Stitching of Coral Reef Images and Extraction of Features for Damselfish Shoaling Behavior Analysis
http://arxiv.org/abs/2006.15478
AUTHORS: Riza Rae Pineda ; Kristofer delas Peñas ; Dana Manogan
HIGHLIGHT: To effectively analyze shoaling behavior given the issues posed by capturing data in the wild, we propose a pre-processing system that utilizes color correction and image stitching techniques and extracts behavior features for manual analysis.
37, TITLE: Interpretable Deepfake Detection via Dynamic Prototypes
http://arxiv.org/abs/2006.15473
AUTHORS: Loc Trinh ; Michael Tsang ; Sirisha Rambhatla ; Yan Liu
HIGHLIGHT: To this end, we propose Dynamic Prototype Network (DPNet) - a simple, interpretable, yet effective solution that leverages dynamic representations (i.e., prototypes) to explain deepfake visual dynamics.
38, TITLE: Deep Sea Robotic Imaging Simulator for UUV Development
http://arxiv.org/abs/2006.15398
AUTHORS: Yifan Song ; David Nakath ; Mengkun She ; Furkan Elibol ; Kevin Köser
HIGHLIGHT: Thus this paper presents a physical model-based image simulation solution which uses in-air and depth information as inputs to generate underwater images for robotics in deep ocean scenarios.
39, TITLE: Laplacian Regularized Few-Shot Learning
http://arxiv.org/abs/2006.15486
AUTHORS: Imtiaz Masud Ziko ; Jose Dolz ; Eric Granger ; Ismail Ben Ayed
COMMENTS: ICML 2020 paper
HIGHLIGHT: We propose a transductive Laplacian-regularized inference for few-shot tasks.
40, TITLE: Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint Estimates
http://arxiv.org/abs/2006.15480
AUTHORS: Ke Sun ; Zigang Geng ; Depu Meng ; Bin Xiao ; Dong Liu ; Zhaoxiang Zhang ; Jingdong Wang
HIGHLIGHT: The typical bottom-up human pose estimation framework includes two stages, keypoint detection and grouping.
41, TITLE: Video Representation Learning with Visual Tempo Consistency
http://arxiv.org/abs/2006.15489
AUTHORS: Ceyuan Yang ; Yinghao Xu ; Bo Dai ; Bolei Zhou
COMMENTS: Technical report. Models are available at https://github.com/decisionforce/VTHCL
HIGHLIGHT: In this work, we demonstrate that visual tempo can also serve as a self-supervision signal for video representation learning.
42, TITLE: Rethinking the Positional Encoding in Language Pre-training
http://arxiv.org/abs/2006.15595
AUTHORS: Guolin Ke ; Di He ; Tie-Yan Liu
HIGHLIGHT: In this work, we investigate the problems in the previous formulations and propose a new positional encoding method for BERT called Transformer with Untied Positional Encoding (TUPE).
43, TITLE: A Framework for Pre-processing of Social Media Feeds based on Integrated Local Knowledge Base
http://arxiv.org/abs/2006.15854
AUTHORS: Taiwo Kolajo ; Olawande Daramola ; Ayodele Adebiyi ; Seth Aaditeshwar
COMMENTS: 38 pages, 5 figures, 6 tables
HIGHLIGHT: This paper proposes an improved framework for pre-processing of social media feeds for better performance.
44, TITLE: MoNet3D: Towards Accurate Monocular 3D Object Localization in Real Time
http://arxiv.org/abs/2006.16007
AUTHORS: Xichuan Zhou ; Yicong Peng ; Chunqiao Long ; Fengbo Ren ; Cong Shi
HIGHLIGHT: The MoNet3D method incorporates prior knowledge of the spatial geometric correlation of neighbouring objects into the deep neural network training process to improve the accuracy of 3D object localization.
45, TITLE: OpenDVC: An Open Source Implementation of the DVC Video Compression Method
http://arxiv.org/abs/2006.15862
AUTHORS: Ren Yang ; Luc Van Gool ; Radu Timofte
COMMENTS: Technical report of OpenDVC; the project page is at https://github.com/RenYang-home/OpenDVC
HIGHLIGHT: We introduce an open source Tensorflow implementation of the Deep Video Compression (DVC) method in this technical report.
46, TITLE: Abnormal activity capture from passenger flow of elevator based on unsupervised learning and fine-grained multi-label recognition
http://arxiv.org/abs/2006.15873
AUTHORS: Chunhua Jia ; Wenhai Yi ; Yu Wu ; Hui Huang ; Lei Zhang ; Leilei Wu
COMMENTS: 9 pages, 8 figures, submitted to 34th Conference on Neural Information Processing System(NeurIPS 2020)
HIGHLIGHT: More specifically in our implementation we propose GraftNet, a solution for fine-grained multi-label recognition task, to recognize human attributes, e.g. gender, age, appearance, and occupation.
47, TITLE: Deep Involutive Generative Models for Neural MCMC
http://arxiv.org/abs/2006.15167
AUTHORS: Span Spanbauer ; Cameron Freer ; Vikash Mansinghka
COMMENTS: 13 pages, 6 figures
HIGHLIGHT: We introduce deep involutive generative models, a new architecture for deep generative modeling, and use them to define Involutive Neural MCMC, a new approach to fast neural MCMC.
48, TITLE: Propagation for Dynamic Continuous Time Chain Event Graphs
http://arxiv.org/abs/2006.15865
AUTHORS: Aditi Shenvi ; Jim Q. Smith
HIGHLIGHT: We present a tractable exact inferential scheme analogous to the scheme in Kj{\ae}rulff (1992) for discrete Dynamic Bayesian Networks (DBNs) which employs standard junction tree inference by "unrolling" the DBN.
49, TITLE: Intrinsic Autoencoders for Joint Neural Rendering and Intrinsic Image Decomposition
http://arxiv.org/abs/2006.16011
AUTHORS: Hassan Abu Alhaija ; Siva Karthik Mustikovela ; Justus Thies ; Matthias Nießner ; Andreas Geiger ; Carsten Rother
HIGHLIGHT: The main contribution of this work is to lift this restriction by training a neural rendering algorithm from unpaired data.
50, TITLE: Compressive MR Fingerprinting reconstruction with Neural Proximal Gradient iterations
http://arxiv.org/abs/2006.15271
AUTHORS: Dongdong Chen ; Mike E. Davies ; Mohammad Golbabaee
COMMENTS: To appear in MICCAI 2020
HIGHLIGHT: To address this, we propose ProxNet, a learned proximal gradient descent framework that directly incorporates the forward acquisition and Bloch dynamic models within a recurrent learning mechanism.
51, TITLE: Explainable 3D Convolutional Neural Networks by Learning Temporal Transformations
http://arxiv.org/abs/2006.15983
AUTHORS: Gabriëlle Ras ; Luca Ambrogioni ; Pim Haselager ; Marcel A. J. van Gerven ; Umut Güçlü
COMMENTS: 10 pages, 5 figures, 4 tables
HIGHLIGHT: In this paper we introduce the temporally factorized 3D convolution (3TConv) as an interpretable alternative to the regular 3D convolution (3DConv).
52, TITLE: 1st Place Solution for Waymo Open Dataset Challenge -- 3D Detection and Domain Adaptation
http://arxiv.org/abs/2006.15505
AUTHORS: Zhuangzhuang Ding ; Yihan Hu ; Runzhou Ge ; Li Huang ; Sijia Chen ; Yu Wang ; Jie Liao
HIGHLIGHT: In this technical report, we introduce our winning solution "HorizonLiDAR3D" for the 3D detection track and the domain adaptation track in Waymo Open Dataset Challenge at CVPR 2020.
53, TITLE: 1st Place Solutions for Waymo Open Dataset Challenges -- 2D and 3D Tracking
http://arxiv.org/abs/2006.15506
AUTHORS: Yu Wang ; Sijia Chen ; Li Huang ; Runzhou Ge ; Yihan Hu ; Zhuangzhuang Ding ; Jie Liao
HIGHLIGHT: This technical report presents the online and real-time 2D and 3D multi-object tracking (MOT) algorithms that reached the 1st places on both Waymo Open Dataset 2D tracking and 3D tracking challenges.
54, TITLE: 2nd Place Solution for Waymo Open Dataset Challenge -- 2D Object Detection
http://arxiv.org/abs/2006.15507
AUTHORS: Sijia Chen ; Yu Wang ; Li Huang ; Runzhou Ge ; Yihan Hu ; Zhuangzhuang Ding ; Jie Liao
HIGHLIGHT: In this report, we introduce a state-of-the-art 2D object detection system for autonomous driving scenarios.
55, TITLE: Shape from Projections via Differentiable Forward Projector
http://arxiv.org/abs/2006.16120
AUTHORS: Jakeoung Koo ; Qiongyang Chen ; J. Andreas Bærentzen ; Anders B. Dahl ; Vedrana A. Dahl
HIGHLIGHT: In this paper, we propose a differentiable forward projector for 3D meshes, to bridge the gap between the forward model for 3D surfaces and optimization.
56, TITLE: Human Activity Recognition based on Dynamic Spatio-Temporal Relations
http://arxiv.org/abs/2006.16132
AUTHORS: Zhenyu Liu ; Yaqiang Yao ; Yan Liu ; Yuening Zhu ; Zhenchao Tao ; Lei Wang ; Yuhong Feng
HIGHLIGHT: In this paper, we develop a method for human activity recognition that tackles these two issues.
57, TITLE: Layer Sparsity in Neural Networks
http://arxiv.org/abs/2006.15604
AUTHORS: Mohamed Hebiri ; Johannes Lederer
HIGHLIGHT: In this paper, we discuss sparsity in the framework of neural networks.
58, TITLE: Localization Uncertainty Estimation for Anchor-Free Object Detection
http://arxiv.org/abs/2006.15607
AUTHORS: Youngwan Lee ; Joong-won Hwang ; Hyung-Il Kim ; Kimin Yun ; Joungyoul Park
HIGHLIGHT: Therefore, we propose a new object detector called Gaussian-FCOS that estimates the localization uncertainty based on an anchor-free detector that captures the uncertainty of similar property with four directions of box offsets (left, right, top, bottom) and avoids the anchor tuning.
59, TITLE: Shadow Removal by a Lightness-Guided Network with Training on Unpaired Data
http://arxiv.org/abs/2006.15617
AUTHORS: Zhihao Liu ; Hui Yin ; Yang Mi ; Mengyang Pu ; Song Wang
COMMENTS: Submitted to IEEE TIP
HIGHLIGHT: In this paper, we present a new Lightness-Guided Shadow Removal Network (LG-ShadowNet) for shadow removal by training on unpaired data.
60, TITLE: Analogical Image Translation for Fog Generation
http://arxiv.org/abs/2006.15618
AUTHORS: Rui Gong ; Dengxin Dai ; Yuhua Chen ; Wen Li ; Luc Van Gool
COMMENTS: 18 pages, 9 figures, 7 tables
HIGHLIGHT: In this work, we are interested in adding adverse weather effects, more specifically fog effects, to images taken in clear weather.
61, TITLE: Offline Handwritten Chinese Text Recognition with Convolutional Neural Networks
http://arxiv.org/abs/2006.15619
AUTHORS: Brian Liu ; Xianchao Xu ; Yu Zhang
COMMENTS: 6 pages, 5 figures, and 3 tables
HIGHLIGHT: In this paper, we build the models using only the convolutional neural networks and use CTC as the loss function.
62, TITLE: Active Finite Reward Automaton Inference and Reinforcement Learning Using Queries and Counterexamples
http://arxiv.org/abs/2006.15714
AUTHORS: Zhe Xu ; Bo Wu ; Daniel Neider ; Ufuk Topcu
HIGHLIGHT: We propose an active learning approach that iteratively infers finite reward automata and performs RL (specifically, q-learning) based on the inferred finite reward automata.
63, TITLE: Improving VQA and its Explanations \\ by Comparing Competing Explanations
http://arxiv.org/abs/2006.15631
AUTHORS: Jialin Wu ; Liyan Chen ; Raymond J. Mooney
HIGHLIGHT: To address this issue, we present a novel framework that uses explanations for competing answers to help VQA systems select the correct answer.
64, TITLE: Motion Pyramid Networks for Accurate and Efficient Cardiac Motion Estimation
http://arxiv.org/abs/2006.15710
AUTHORS: Hanchao Yu ; Xiao Chen ; Humphrey Shi ; Terrence Chen ; Thomas S. Huang ; Shanhui Sun
COMMENTS: Accepted by MICCAI2020
HIGHLIGHT: In this paper, we propose Motion Pyramid Networks, a novel deep learning-based approach for accurate and efficient cardiac motion estimation.
65, TITLE: Want to Identify, Extract and Normalize Adverse Drug Reactions in Tweets? Use RoBERTa
http://arxiv.org/abs/2006.16146
AUTHORS: Katikapalli Subramanyam Kalyan ; S. Sangeetha
COMMENTS: 4 pages
HIGHLIGHT: This paper presents our approach for task 2 and task 3 of Social Media Mining for Health (SMM4H) 2020 shared tasks.
66, TITLE: Large Deformation Diffeomorphic Image Registration with Laplacian Pyramid Networks
http://arxiv.org/abs/2006.16148
AUTHORS: Tony C. W. Mok ; Albert C. S. Chung
COMMENTS: Paper accepted by MICCAI 2020
HIGHLIGHT: In this paper, we propose a deep Laplacian Pyramid Image Registration Network, which can solve the image registration optimization problem in a coarse-to-fine fashion within the space of diffeomorphic maps.
67, TITLE: Simulation and Optimisation of Air Conditioning Systems using Machine Learning
http://arxiv.org/abs/2006.15296
AUTHORS: Rakshitha Godahewa ; Chang Deng ; Arnaud Prouzeau ; Christoph Bergmeir
HIGHLIGHT: We introduce a deep-learning model based on Recurrent Neural Networks (RNN) that can predict the temperatures of a future period directly where a particular room is unoccupied and by using these predicted temperatures, we define the optimal thermal setpoints to be used inside the room during the unoccupied period.
68, TITLE: The Heterogeneity Hypothesis: Finding Layer-Wise Dissimilated Network Architecture
http://arxiv.org/abs/2006.16242
AUTHORS: Yawei Li ; Wen Li ; Martin Danelljan ; Kai Zhang ; Shuhang Gu ; Luc Van Gool ; Radu Timofte
COMMENTS: Code will be available at https://github.com/ofsoundof/Heterogeneity_Hypothesis
HIGHLIGHT: In this paper, we tackle the problem of convolutional neural network design.
69, TITLE: The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization
http://arxiv.org/abs/2006.16241
AUTHORS: Dan Hendrycks ; Steven Basart ; Norman Mu ; Saurav Kadavath ; Frank Wang ; Evan Dorundo ; Rahul Desai ; Tyler Zhu ; Samyak Parajuli ; Mike Guo ; Dawn Song ; Jacob Steinhardt ; Justin Gilmer
COMMENTS: Datasets, code, and models available at https://github.com/hendrycks/imagenet-r
HIGHLIGHT: We introduce three new robustness benchmarks consisting of naturally occurring distribution changes in image style, geographic location, camera operation, and more.
70, TITLE: Leveraging Subword Embeddings for Multinational Address Parsing
http://arxiv.org/abs/2006.16152
AUTHORS: Marouane Yassine ; David Beauchemin ; François Laviolette ; Luc Lamontagne
COMMENTS: submitted to IEEE CiSt'20
HIGHLIGHT: We propose an approach in which we employ subword embeddings and a Recurrent Neural Network architecture to build a single model capable of learning to parse addresses from multiple countries at the same time while taking into account the difference in languages and address formatting systems.
71, TITLE: Progressive Generation of Long Text
http://arxiv.org/abs/2006.15720
AUTHORS: Bowen Tan ; Zichao Yang ; Maruan AI-Shedivat ; Eric P. Xing ; Zhiting Hu
COMMENTS: Code available at https://github.com/tanyuqian/progressive-generation
HIGHLIGHT: To overcome the limitation, we propose a simple but effective method of generating text in a progressive manner, inspired by generating images from low to high resolution.
72, TITLE: Unsupervised Learning of Video Representations via Dense Trajectory Clustering
http://arxiv.org/abs/2006.15731
AUTHORS: Pavel Tokmakov ; Martial Hebert ; Cordelia Schmid
HIGHLIGHT: This paper addresses the task of unsupervised learning of representations for action recognition in videos.
73, TITLE: Roweisposes, Including Eigenposes, Supervised Eigenposes, and Fisherposes, for 3D Action Recognition
http://arxiv.org/abs/2006.15736
AUTHORS: Benyamin Ghojogh ; Fakhri Karray ; Mark Crowley
COMMENTS: key-words: Roweisposes, Roweis discriminant analysis, Fisherposes, eigenposes, supervised eigenposes, action recognition
HIGHLIGHT: In this paper, we propose Roweisposes which uses Roweis discriminant analysis for generalized subspace learning.
74, TITLE: Mapping Topic Evolution Across Poetic Traditions
http://arxiv.org/abs/2006.15732
AUTHORS: Petr Plechac ; Thomas N. Haider
HIGHLIGHT: Poetic traditions across languages evolved differently, but we find that certain semantic topics occur in several of them, albeit sometimes with temporal delay, or with diverging trajectories over time.
75, TITLE: EmotionNet Nano: An Efficient Deep Convolutional Neural Network Design for Real-time Facial Expression Recognition
http://arxiv.org/abs/2006.15759
AUTHORS: James Ren Hou Lee ; Linda Wang ; Alexander Wong
COMMENTS: 9 pages
HIGHLIGHT: Motivated by this need for a compact, low latency, yet accurate system capable of performing FEC in real-time on low-cost embedded devices, this study proposes EmotionNet Nano, an efficient deep convolutional neural network created through a human-machine collaborative design strategy, where human experience is combined with machine meticulousness and speed in order to craft a deep neural network design catered towards real-time embedded usage.
76, TITLE: Exploring Optimal Control With Observations at a Cost
http://arxiv.org/abs/2006.15757
AUTHORS: Rui Aguiar ; Nikka Mofid ; Hyunji Alex Nam
COMMENTS: 8 pages, 10 figures
HIGHLIGHT: Our approach models this problem using OpenAI Gym's Mountain Car and aims to address when to observe the patient's physiological state and partly how to intervene, as we have assumed we can only act after following an observation.
77, TITLE: Confidence-rich grid mapping
http://arxiv.org/abs/2006.15754
AUTHORS: Ali-akbar Agha-mohammadi ; Eric Heiden ; Karol Hausman ; Gaurav S. Sukhatme
COMMENTS: Published at International Journal of Robotics Research (IJRR) 2019 (https://journals.sagepub.com/doi/10.1177/0278364919839762)
HIGHLIGHT: In this work, we present confidence-rich mapping (CRM), a new algorithm for spatial grid-based mapping of the 3D environment.
78, TITLE: Empirically Verifying Hypotheses Using Reinforcement Learning
http://arxiv.org/abs/2006.15762
AUTHORS: Kenneth Marino ; Rob Fergus ; Arthur Szlam ; Abhinav Gupta
HIGHLIGHT: Specifically, we aim to build an agent that, given a hypothesis about the dynamics of the world, can take actions to generate observations which can help predict whether the hypothesis is true or false.
79, TITLE: Active Ensemble Deep Learning for Polarimetric Synthetic Aperture Radar Image Classification
http://arxiv.org/abs/2006.15771
AUTHORS: Sheng-Jie Liu ; Haowen Luo ; Qian Shi
COMMENTS: Accepted by GRSL
HIGHLIGHT: In this letter, we take the advantage of active learning and propose active ensemble deep learning (AEDL) for PolSAR image classification.
80, TITLE: Mining Persistent Activity in Continually Evolving Networks
http://arxiv.org/abs/2006.15410
AUTHORS: Caleb Belth ; Xinyi Zheng ; Danai Koutra
COMMENTS: 9 pages, plus 2 pages of supplementary material. Accepted at KDD 2020
HIGHLIGHT: In this work, we propose the problem of mining activity that persists through time in continually evolving networks---i.e., activity that repeatedly and consistently occurs.
81, TITLE: BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision
http://arxiv.org/abs/2006.15509
AUTHORS: Chen Liang ; Yue Yu ; Haoming Jiang ; Siawpeng Er ; Ruijia Wang ; Tuo Zhao ; Chao Zhang
COMMENTS: Proceedings of the 26th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '20)
HIGHLIGHT: To address this challenge, we propose a new computational framework -- BOND, which leverages the power of pre-trained language models (e.g., BERT and RoBERTa) to improve the prediction performance of NER models.
82, TITLE: Improving Sequence Tagging for Vietnamese Text Using Transformer-based Neural Models
http://arxiv.org/abs/2006.15994
AUTHORS: Viet Bui The ; Oanh Tran Thi ; Phuong Le-Hong
HIGHLIGHT: This paper describes our study on using mutilingual BERT embeddings and some new neural models for improving sequence tagging tasks for the Vietnamese language.
83, TITLE: Counting Out Time: Class Agnostic Video Repetition Counting in the Wild
http://arxiv.org/abs/2006.15418
AUTHORS: Debidatta Dwibedi ; Yusuf Aytar ; Jonathan Tompson ; Pierre Sermanet ; Andrew Zisserman
COMMENTS: Accepted at CVPR 2020. Project webpage: https://sites.google.com/view/repnet
HIGHLIGHT: We present an approach for estimating the period with which an action is repeated in a video. We also collect a new challenging dataset called Countix (~90 times larger than existing datasets) which captures the challenges of repetition counting in real-world videos.
84, TITLE: Improving Interpretability of CNN Models Using Non-Negative Concept Activation Vectors
http://arxiv.org/abs/2006.15417
AUTHORS: Ruihan Zhang ; Prashan Madumal ; Tim Miller ; Kris Ehinger ; Benjamin Rubinstein
HIGHLIGHT: In this work, we rethink the ACE algorithm of Ghorbani et~al., proposing an alternative concept-based explanation framework.
85, TITLE: String-based methods for tonal harmony: A corpus study of Haydn's string quartets
http://arxiv.org/abs/2006.15411
AUTHORS: David R. W. Sears
COMMENTS: This is an original manuscript / preprint of a book chapter: Sears, David R. W (in press). String-based methods for tonal harmony: A corpus study of Haydn's string quartets." In D. Shanahan, A. Burgoyne, & I. Quinn (Eds.), Oxford Handbook of Music and Corpus Studies. New York: Oxford University Press. The manuscript contains 2 musical examples, 3 figures, and 4 tables
HIGHLIGHT: This chapter considers how string-based methods might be adapted to address music-analytic questions related to the discovery of musical organization, with particular attention devoted to the analysis of tonal harmony.
86, TITLE: Enhancement of a CNN-Based Denoiser Based on Spatial and Spectral Analysis
http://arxiv.org/abs/2006.15517
AUTHORS: Rui Zhao ; Kin-Man Lam ; Daniel P. K. Lun
COMMENTS: ICIP 2019
HIGHLIGHT: In this paper, we propose a discrete wavelet denoising CNN (WDnCNN), which restores images corrupted by various noise with a single model.
87, TITLE: On the generalization of learning-based 3D reconstruction
http://arxiv.org/abs/2006.15427
AUTHORS: Miguel Angel Bautista ; Walter Talbott ; Shuangfei Zhai ; Nitish Srivastava ; Joshua M Susskind
HIGHLIGHT: In this paper we study the inductive biases encoded in the model architecture that impact the generalization of learning-based 3D reconstruction methods.
88, TITLE: Predictive and Generative Neural Networks for Object Functionality
http://arxiv.org/abs/2006.15520
AUTHORS: Ruizhen Hu ; Zihao Yan ; Jingwen Zhang ; Oliver van Kaick ; Ariel Shamir ; Hao Zhang ; Hui Huang
COMMENTS: Accepted to SIGGRAPH 2018, project page at https://vcc.tech/research/2018/ICON4
HIGHLIGHT: We develop predictive and generative deep convolutional neural networks to replicate this feat.
89, TITLE: DeepACC:Automate Chromosome Classification based on Metaphase Images using Deep Learning Framework Fused with Prior Knowledge
http://arxiv.org/abs/2006.15528
AUTHORS: Chunlong Luo ; Tianqi Yu ; Yufan Luo ; Manqing Wang ; Fuhai Yu ; Yinhao Li ; Chan Tian ; Jie Qiao ; Li Xiao
HIGHLIGHT: In this work, we propose a detection based method, DeepACC, to locate and fine classify chromosomes simultaneously based on the whole metaphase image.
90, TITLE: Few-Shot Class-Incremental Learning via Feature Space Composition
http://arxiv.org/abs/2006.15524
AUTHORS: Hanbin Zhao ; Yongjian Fu ; Xuewei Li ; Songyuan Li ; Bourahla Omar ; Xi Li
HIGHLIGHT: With this motivation, we propose a novel few-shot class-incremental learning pipeline based on a composite representation space, which makes old-knowledge preserving and new-knowledge adaptation mutually compatible by feature space composition (enlarging the embedding capacity).
91, TITLE: Compositional Convolutional Neural Networks: A Robust and Interpretable Model for Object Recognition under Occlusion
http://arxiv.org/abs/2006.15538
AUTHORS: Adam Kortylewski ; Qing Liu ; Angtian Wang ; Yihong Sun ; Alan Yuille
HIGHLIGHT: In this work, we show that black-box deep convolutional neural networks (DCNNs) have only limited robustness to partial occlusion.
92, TITLE: Mind The Facts: Knowledge-Boosted Coherent Abstractive Text Summarization
http://arxiv.org/abs/2006.15435
AUTHORS: Beliz Gunel ; Chenguang Zhu ; Michael Zeng ; Xuedong Huang
COMMENTS: NeurIPS 2019, Knowledge Representation & Reasoning Meets Machine Learning (KR2ML workshop)
HIGHLIGHT: In this work, we propose a novel architecture that extends Transformer encoder-decoder architecture in order to improve on these shortcomings.
93, TITLE: DHARI Report to EPIC-Kitchens 2020 Object Detection Challenge
http://arxiv.org/abs/2006.15553
AUTHORS: Kaide Li ; Bingyan Liao ; Laifeng Hu ; Yaonong Wang
COMMENTS: Challenge Winner in the EPIC-Kitchens 2020 Object Detection Challenge(EPIC@CVPR2020)
HIGHLIGHT: Experimental results demonstrate that our approach can significantly improve the mean Average Precision (mAP) of object detection on both the seen and unseen test sets of EPICKitchens.
94, TITLE: SAR Image Despeckling by Deep Neural Networks: from a pre-trained model to an end-to-end training strategy
http://arxiv.org/abs/2006.15559
AUTHORS: Emanuele Dalsasso ; Loïc Denis ; Florence Tupin
COMMENTS: Notebook with Colab compatibility is available at https://github.com/emanueledalsasso/SAR-CNN
HIGHLIGHT: To handle this problem, this paper analyzes different strategies one can adopt, depending on the speckle removal task one wishes to perform and the availability of multitemporal stacks of SAR data.
95, TITLE: When and How Can Deep Generative Models be Inverted?
http://arxiv.org/abs/2006.15555
AUTHORS: Aviad Aberdam ; Dror Simon ; Michael Elad
HIGHLIGHT: Deep generative models (e.g. GANs and VAEs) have been developed quite extensively in recent years.
96, TITLE: I can attend a meeting too! Towards a human-like telepresence avatar robot to attend meeting on your behalf
http://arxiv.org/abs/2006.15647
AUTHORS: Hrishav Bakul Barua ; Chayan Sarkar ; Achanna Anil Kumar ; Arpan Pal ; Balamuralidhar P
HIGHLIGHT: In this work, we focus on a telepresence robot that can be used to attend a meeting remotely with a group of people.
97, TITLE: Video Representations of Goals Emerge from Watching Failure
http://arxiv.org/abs/2006.15657
AUTHORS: Dave Epstein ; Carl Vondrick
HIGHLIGHT: We introduce a video representation learning framework that models the latent goals behind observable human action.
98, TITLE: Geometry-Inspired Top-k Adversarial Perturbations
http://arxiv.org/abs/2006.15669
AUTHORS: Nurislam Tursynbek ; Aleksandr Petiushko ; Ivan Oseledets
HIGHLIGHT: We propose an effective geometry-inspired method of computing Top-k adversarial examples for any k.
99, TITLE: Differential Privacy of Hierarchical Census Data: An Optimization Approach
http://arxiv.org/abs/2006.15673
AUTHORS: Ferdinando Fioretto ; Pascal Van Hentenryck ; Keyu Zhu
HIGHLIGHT: The key technical contribution of the paper shows that this optimization problem can be solved in polynomial time by exploiting the structure of its cost functions.
100, TITLE: A Retinex based GAN Pipeline to Utilize Paired and Unpaired Datasets for Enhancing Low Light Images
http://arxiv.org/abs/2006.15304
AUTHORS: Harshana Weligampola ; Gihan Jayatilaka ; Suren Sritharan ; Roshan Godaliyadda ; Parakrama Ekanayaka ; Roshan Ragel ; Vijitha Herath
HIGHLIGHT: This paper presents a novel deep learning pipeline that can learn from both paired and unpaired datasets.
101, TITLE: A Benchmark dataset for both underwater image enhancement and underwater object detection
http://arxiv.org/abs/2006.15789
AUTHORS: Long Chen ; Lei Tong ; Feixiang Zhou ; Zheheng Jiang ; Zhenyang Li ; Jialin Lv ; Junyu Dong ; Huiyu Zhou
HIGHLIGHT: To investigate how the underwater image enhancement methods influence the following underwater object detection tasks, in this paper, we provide a large-scale underwater object detection dataset with both bounding box annotations and high quality reference images, namely OUC dataset.
102, TITLE: Seasonal Averaged One-Dependence Estimators: A Novel Algorithm to Address Seasonal Concept Drift in High-Dimensional Stream Classification
http://arxiv.org/abs/2006.15311
AUTHORS: Rakshitha Godahewa ; Trevor Yann ; Christoph Bergmeir ; Francois Petitjean
HIGHLIGHT: We introduce a novel classifier named Seasonal Averaged One-Dependence Estimators (SAODE), which extends the AODE classifier to handle seasonal drift by including time as a super parent.
103, TITLE: Combine Convolution with Recurrent Networks for Text Classification
http://arxiv.org/abs/2006.15795
AUTHORS: Shengfei Lyu ; Jiaqi Liu
HIGHLIGHT: In this paper, we propose a novel method to keep the strengths of the two networks to a great extent.
104, TITLE: Uncertainty-aware Self-training for Text Classification with Few Labels
http://arxiv.org/abs/2006.15315
AUTHORS: Subhabrata Mukherjee ; Ahmed Hassan Awadallah
HIGHLIGHT: In this work, we study self-training as one of the earliest semi-supervised learning approaches to reduce the annotation bottleneck by making use of large-scale unlabeled data for the target task.
105, TITLE: Video-Grounded Dialogues with Pretrained Generation Language Models
http://arxiv.org/abs/2006.15319
AUTHORS: Hung Le ; Steven C. H. Hoi
COMMENTS: Accepted at ACL 2020 (Short Paper)
HIGHLIGHT: In this paper, we leverage the power of pre-trained language models for improving video-grounded dialogue, which is very challenging and involves complex features of different dynamics: (1) Video features which can extend across both spatial and temporal dimensions; and (2) Dialogue features which involve semantic dependencies over multiple dialogue turns.
106, TITLE: Interactive Deep Refinement Network for Medical Image Segmentation
http://arxiv.org/abs/2006.15320
AUTHORS: Titinunt Kitrungrotsakul ; Iwamoto Yutaro ; Lanfen Lin ; Ruofeng Tong ; Jingsong Li ; Yen-Wei Chen
COMMENTS: 10 pages, 4 figures
HIGHLIGHT: In this paper, we propose an interactive deep refinement framework to improve the traditional semantic segmentation networks such as U-Net and fully convolutional network.
107, TITLE: Dynamic Sampling Networks for Efficient Action Recognition in Videos
http://arxiv.org/abs/2006.15560
AUTHORS: Yin-Dong Zheng ; Zhaoyang Liu ; Tong Lu ; Limin Wang
COMMENTS: To appear in IEEE Transaction on Image Processing
HIGHLIGHT: To address these issues, we propose a new framework for action recognition in videos, called {\em Dynamic Sampling Networks} (DSN), by designing a dynamic sampling module to improve the discriminative power of learned clip-level classifiers and as well increase the inference efficiency during testing.
108, TITLE: Joint Hand-object 3D Reconstruction from a Single Image with Cross-branch Feature Fusion
http://arxiv.org/abs/2006.15561
AUTHORS: Yujin Chen ; Zhigang Tu ; Di Kang ; Ruizhi Chen ; Linchao Bao ; Zhengyou Zhang ; Junsong Yuan
HIGHLIGHT: In this work, we propose to consider hand and object jointly in feature space and explore the reciprocity of the two branches.
109, TITLE: Data augmentation versus noise compensation for x- vector speaker recognition systems in noisy environments
http://arxiv.org/abs/2006.15903
AUTHORS: Mohammad Mohammadamini ; Driss Matrouf
HIGHLIGHT: In this work, we want to know if explicit noise compensation techniques continue to be effective despite the general noise robustness of these systems.
110, TITLE: Using Reinforcement Learning to Herd a Robotic Swarm to a Target Distribution
http://arxiv.org/abs/2006.15807
AUTHORS: Zahi M. Kakish ; Karthik Elamvazhuthi ; Spring Berman
COMMENTS: Paper was submitted to Conference on Robot Learning 2019 and IEEE Robotics and Automation Letters 2020
HIGHLIGHT: In this paper, we present a reinforcement learning approach to designing a control policy for a "leader'' agent that herds a swarm of "follower'' agents, via repulsive interactions, as quickly as possible to a target probability distribution over a strongly connected graph.
111, TITLE: Revision by Conditionals: From Hook to Arrow
http://arxiv.org/abs/2006.15811
AUTHORS: Jake Chandler ; Richard Booth
COMMENTS: Extended version of a paper accepted to KR 2020
HIGHLIGHT: We introduce a 'plug and play' method for uniquely extending any iterated belief revision operator to the conditional case.
112, TITLE: Improving Few-Shot Learning using Composite Rotation based Auxiliary Task
http://arxiv.org/abs/2006.15919
AUTHORS: Pratik Mazumder ; Pravendra Singh ; Vinay P. Namboodiri
HIGHLIGHT: In this paper, we propose an approach to improve few-shot classification performance using a composite rotation based auxiliary task.
113, TITLE: Interpreting and Disentangling Feature Components of Various Complexity from DNNs
http://arxiv.org/abs/2006.15920
AUTHORS: Jie Ren ; Mingjie Li ; Zexu Liu ; Quanshi Zhang
HIGHLIGHT: This paper aims to define, quantify, and analyze the feature complexity that is learned by a DNN.
114, TITLE: Retro*: Learning Retrosynthetic Planning with Neural Guided A* Search
http://arxiv.org/abs/2006.15820
AUTHORS: Binghong Chen ; Chengtao Li ; Hanjun Dai ; Le Song
COMMENTS: Presented at ICML 2020
HIGHLIGHT: In this paper, we propose Retro*, a neural-based A*-like algorithm that finds high-quality synthetic routes efficiently.
115, TITLE: Towards Learning-automation IoT Attack Detection through Reinforcement Learning
http://arxiv.org/abs/2006.15826
AUTHORS: Tianbo Gu ; Allaukik Abhishek ; Hao Fu ; Huanle Zhang ; Debraj Basu ; Prasant Mohapatra
COMMENTS: 11 pages, 8 figures, 2 tables, to appear in the 21st IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks (IEEE WoWMoM 2020)
HIGHLIGHT: In order to adapt to the new characteristics in IoT attacks, we propose a reinforcement learning-based attack detection model that can automatically learn and recognize the transformation of the attack pattern.
116, TITLE: Solving MKP Applied to IoT in Smart Grid Using Meta-heuristics Algorithms: A Parallel Processing Perspective
http://arxiv.org/abs/2006.15927
AUTHORS: Jandre Albertyn ; Ling Cheng ; Adnan M. Abu-Mahfouz
HIGHLIGHT: Solving MKP Applied to IoT in Smart Grid Using Meta-heuristics Algorithms: A Parallel Processing Perspective
117, TITLE: Answering Questions on COVID-19 in Real-Time
http://arxiv.org/abs/2006.15830
AUTHORS: Jinhyuk Lee ; Sean S. Yi ; Minbyul Jeong ; Mujeen Sung ; Wonjin Yoon ; Yonghwa Choi ; Miyoung Ko ; Jaewoo Kang
COMMENTS: 10 pages
HIGHLIGHT: In this work, we outline our effort to contribute to shrinking this knowledge vacuum by creating covidAsk, a question answering (QA) system that combines biomedical text mining and QA techniques to provide answers to questions in real-time.
118, TITLE: Is Japanese gendered language used on Twitter ? A large scale study
http://arxiv.org/abs/2006.15935
AUTHORS: Tiziana Carpi ; Stefano Maria Iacus
HIGHLIGHT: This study analyzes the usage of Japanese gendered language on Twitter.
119, TITLE: Hybrid Tensor Decomposition in Neural Network Compression
http://arxiv.org/abs/2006.15938
AUTHORS: Bijiao Wu ; Dingheng Wang ; Guangshe Zhao ; Lei Deng ; Guoqi Li
COMMENTS: submitted to <<Neural Networks>> on Apr.18,2020; received first review comments on June.02,2020; the revised manuscipt submitted on June.28,2020
HIGHLIGHT: In this work, we introduce the hierarchical Tucker (HT), a classical but rarely-used tensor decomposition method, to investigate its capability in neural network compression.
120, TITLE: End-to-End Differentiable Learning to HDR Image Synthesis for Multi-exposure Images
http://arxiv.org/abs/2006.15833
AUTHORS: Jung Hee Kim ; Siyeong Lee ; Soyeon Jo ; Suk-Ju Kang
COMMENTS: 10 pages, 5 figures
HIGHLIGHT: Therefore, we tackle the major challenge in stack reconstruction-based methods by proposing a novel framework with the fully differentiable HDRI process.
121, TITLE: Adversarial Multi-Source Transfer Learning in Healthcare: Application to Glucose Prediction for Diabetic People
http://arxiv.org/abs/2006.15940
AUTHORS: Maxime De Bois ; Mounîm A. El Yacoubi ; Mehdi Ammi
HIGHLIGHT: To improve the quality of the transfer between multiple sources of data, we propose a multi-source adversarial transfer learning framework that enables the learning of a feature representation that is similar across the sources, and thus more general and more easily transferable.
122, TITLE: Hinting Semantic Parsing with Statistical Word Sense Disambiguation
http://arxiv.org/abs/2006.15942
AUTHORS: Ritwik Bose ; Siddharth Vashishtha ; James Allen
COMMENTS: 7 pages, 3 figures
HIGHLIGHT: In this work, we provide hints from a statistical WSD system to guide a logical semantic parser to produce better semantic type assignments while maintaining the soundness of the resulting logical forms.
123, TITLE: Towards hybrid primary intersubjectivity: a neural robotics library for human science
http://arxiv.org/abs/2006.15948
AUTHORS: Hendry F. Chame ; Ahmadreza Ahmadi ; Jun Tani
HIGHLIGHT: Hence, in this work we pursue three main objectives.
124, TITLE: A Transformer-based joint-encoding for Emotion Recognition and Sentiment Analysis
http://arxiv.org/abs/2006.15955
AUTHORS: Jean-Benoit Delbrouck ; Noé Tits ; Mathilde Brousmiche ; Stéphane Dupont
COMMENTS: Winner of the ACL20: Second Grand-Challenge on Multimodal Language
HIGHLIGHT: This paper describes a Transformer-based joint-encoding (TBJE) for the task of Emotion Recognition and Sentiment Analysis.
125, TITLE: Multi-level colonoscopy malignant tissue detection with adversarial CAC-UNet
http://arxiv.org/abs/2006.15954
AUTHORS: Chuang Zhu ; Ke Mei ; Ting Peng ; Yihao Luo ; Jun Liu ; Ying Wang ; Mulan Jin
COMMENTS: accepted by Neurocomputing; winner of the MICCAI DigestPath 2019 challenge on colonoscopy tissue segmentation and classification task
HIGHLIGHT: In this paper, we propose a highly efficient multi-level malignant tissue detection through the designed adversarial CAC-UNet.
126, TITLE: Application of Neuroevolution in Autonomous Cars
http://arxiv.org/abs/2006.15175
AUTHORS: Sainath G ; Vignesh S ; Siddarth S ; G Suganya
COMMENTS: 13 pages, 9 figures, 1 table
HIGHLIGHT: We propose a system that requires no data for its training.
127, TITLE: Bookworm continual learning: beyond zero-shot learning and continual learning
http://arxiv.org/abs/2006.15176
AUTHORS: Kai Wang ; Luis Herranz ; Anjan Dutta ; Joost van de Weijer
HIGHLIGHT: We propose bookworm continual learning(BCL), a flexible setting where unseen classes can be inferred via a semantic model, and the visual model can be updated continually.
128, TITLE: Creating Artificial Modalities to Solve RGB Liveness
http://arxiv.org/abs/2006.16028
AUTHORS: Aleksandr Parkin ; Oleg Grinchuk
COMMENTS: CVPRW2020
HIGHLIGHT: In this work we propose a method to utilize the difference in dynamic appearance between bona fide and spoof samples by creating artificial modalities from RGB videos.
129, TITLE: Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper
http://arxiv.org/abs/2006.16035
AUTHORS: Kallil M. C. Zielinski ; Marcelo Teixeira ; Richardson Ribeiro ; Dalcimar Casanova
HIGHLIGHT: In this way, this work presents the concept and the implementation of a tool that allows us to convert any dynamic system modeled as an FSM to the open-source Gym wrapper.
130, TITLE: Visual Kinship Recognition: A Decade in the Making
http://arxiv.org/abs/2006.16033
AUTHORS: Joseph P Robinson ; Ming Shao ; Yun Fu
HIGHLIGHT: We list and review the public resources and data challenges that enabled and inspired many to hone-in on one or more views of automatic kinship recognition in the visual domain.
131, TITLE: Region-of-interest guided Supervoxel Inpainting for Self-supervision
http://arxiv.org/abs/2006.15186
AUTHORS: Subhradeep Kayal ; Shuai Chen ; Marleen de Bruijne
COMMENTS: Accepted at MICCAI 2020
HIGHLIGHT: In this work, we focus on image inpainting as the self-supervised proxy task, and propose two novel structural changes to further enhance the performance of a deep neural network.
132, TITLE: Making DensePose fast and light
http://arxiv.org/abs/2006.15190
AUTHORS: Ruslan Rakhimov ; Emil Bogomolov ; Alexandr Notchenko ; Fung Mao ; Alexey Artemov ; Denis Zorin ; Evgeny Burnaev
HIGHLIGHT: In this work, we target the problem of redesigning the DensePose R-CNN model's architecture so that the final network retains most of its accuracy but becomes more light-weight and fast.
133, TITLE: Automatic Operating Room Surgical Activity Recognition for Robot-Assisted Surgery
http://arxiv.org/abs/2006.16166
AUTHORS: Aidean Sharghi ; Helene Haugerud ; Daniel Oh ; Omid Mohareri
COMMENTS: International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI'20)
HIGHLIGHT: In this paper, we investigate automatic surgical activity recognition in robot-assisted operations. We collect the first large-scale dataset including 400 full-length multi-perspective videos from a variety of robotic surgery cases captured using Time-of-Flight cameras.
134, TITLE: Building Rule Hierarchies for Efficient Logical Rule Learning from Knowledge Graphs
http://arxiv.org/abs/2006.16171
AUTHORS: Yulong Gu ; Yu Guan ; Paolo Missior
HIGHLIGHT: In this work, we address such scalability issues by proposing new methods for pruning unpromising rules using rule hierarchies.
135, TITLE: Multichannel CNN with Attention for Text Classification
http://arxiv.org/abs/2006.16174
AUTHORS: Zhenyu Liu ; Haiwei Huang ; Chaohong Lu ; Shengfei Lyu
HIGHLIGHT: In order to combine the strengths of the two kinds of networks and alleviate their shortcomings, this paper proposes Attention-based Multichannel Convolutional Neural Network (AMCNN) for text classification.
136, TITLE: Natural Backdoor Attack on Text Data
http://arxiv.org/abs/2006.16176
AUTHORS: Lichao Sun
COMMENTS: Paper submitted to a conference, due to the double-blind policy, we only preprint it with all random name and school information
HIGHLIGHT: In this paper, we systematically study the backdoor attack against models on text data.
137, TITLE: Unsupervised Learning Consensus Model for Dynamic Texture Videos Segmentation
http://arxiv.org/abs/2006.16177
AUTHORS: Lazhar Khelifi ; Max Mignotte
HIGHLIGHT: We present an effective unsupervised learning consensus model for the segmentation of dynamic texture (ULCM).
138, TITLE: Harvesting, Detecting, and Characterizing Liver Lesions from Large-scale Multi-phase CT Data via Deep Dynamic Texture Learning
http://arxiv.org/abs/2006.15691
AUTHORS: Yuankai Huo ; Jinzheng Cai ; Chi-Tung Cheng ; Ashwin Raju ; Ke Yan ; Bennett A. Landman ; Jing Xiao ; Le Lu ; Chien-Hung Liao ; Adam Harrison
HIGHLIGHT: In this work, we curate a patient cohort of 1305 dynamic contrast CT studies (i.e., 5220 multi-phase 3D volumes) with pathology confirmed ground truth.
139, TITLE: Simulation of Brain Resection for Cavity Segmentation Using Self-Supervised and Semi-Supervised Learning
http://arxiv.org/abs/2006.15693
AUTHORS: Fernando Pérez-García ; Roman Rodionov ; Ali Alim-Marvasti ; Rachel Sparks ; John S. Duncan ; Sébastien Ourselin
COMMENTS: 13 pages, 6 figures, accepted at the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2020
HIGHLIGHT: We developed an algorithm to simulate resections on preoperative MR images.
140, TITLE: Traditional and accelerated gradient descent for neural architecture search
http://arxiv.org/abs/2006.15218
AUTHORS: Nicolas Garcia Trillos ; Felix Morales ; Javier Morales
HIGHLIGHT: In this paper, we introduce two algorithms for neural architecture search (NASGD and NASAGD) following the theoretical work by two of the authors [3].
141, TITLE: MIMC-VINS: A Versatile and Resilient Multi-IMU Multi-Camera Visual-Inertial Navigation System
http://arxiv.org/abs/2006.15699
AUTHORS: Kevin Eckenhoff ; Patrick Geneva ; Guoquan Huang
COMMENTS: 20 pages, 10 figures, 13 tables
HIGHLIGHT: To this end, rather than the standard VINS paradigm using a minimal sensing suite of a single camera and IMU, in this paper we design a real-time consistent multi-IMU multi-camera (MIMC)-VINS estimator that is able to seamlessly fuse multi-modal information from an arbitrary number of uncalibrated cameras and IMUs.
==========Updates to Previous Papers==========
1, TITLE: Channel Pruning via Automatic Structure Search
http://arxiv.org/abs/2001.08565
AUTHORS: Mingbao Lin ; Rongrong Ji ; Yuxin Zhang ; Baochang Zhang ; Yongjian Wu ; Yonghong Tian
COMMENTS: Accepted by IJCAI2020. SOLO copyright holder is IJCAI (International Joint Conferences on Artificial Intelligence)
HIGHLIGHT: In this paper, we propose a new channel pruning method based on artificial bee colony algorithm (ABC), dubbed as ABCPruner, which aims to efficiently find optimal pruned structure, i.e., channel number in each layer, rather than selecting "important" channels as previous works did.
2, TITLE: Cross-Supervised Object Detection
http://arxiv.org/abs/2006.15056
AUTHORS: Zitian Chen ; Zhiqiang Shen ; Jiahui Yu ; Erik Learned-Miller
HIGHLIGHT: In this work, we show how to build better object detectors from weakly labeled images of new categories by leveraging knowledge learned from fully labeled base categories.
3, TITLE: A Novel Nudity Detection Algorithm for Web and Mobile Application Development
http://arxiv.org/abs/2006.01780
AUTHORS: Rahat Yeasin Emon
COMMENTS: 5 pages
HIGHLIGHT: This paper presents a runtime nudity detection method for web and mobile application development.
4, TITLE: FastReID: A Pytorch Toolbox for General Instance Re-identification
http://arxiv.org/abs/2006.02631
AUTHORS: Lingxiao He ; Xingyu Liao ; Wu Liu ; Xinchen Liu ; Peng Cheng ; Tao Mei
HIGHLIGHT: To meet the increasing application demand for general instance re-identification, we present FastReID as a widely used software system in JD AI Research.
5, TITLE: POMDP Modelling for Assessing Hierarchies
http://arxiv.org/abs/1908.07031
AUTHORS: Weipeng Huang ; Guangyuan Piao ; Raul Moreno ; Neil J. Hurley
HIGHLIGHT: This motivates us to propose a framework for assessing the quality of hierarchical clustering allocations which covers the case of no ground-truth information.
6, TITLE: Exploring the Capacity of an Orderless Box Discretization Network for Multi-orientation Scene Text Detection
http://arxiv.org/abs/1912.09629
AUTHORS: Yuliang Liu ; Tong He ; Hao Chen ; Xinyu Wang ; Canjie Luo ; Shuaitao Zhang ; Chunhua Shen ; Lianwen Jin
HIGHLIGHT: Here we solve this problem by proposing a novel method, termed Orderless Box Discretization (OBD), which first discretizes the quadrilateral box into several key edges containing all potential horizontal and vertical positions.
7, TITLE: Universal Self-Attention Network for Graph Classification
http://arxiv.org/abs/1909.11855
AUTHORS: Dai Quoc Nguyen ; Tu Dinh Nguyen ; Dinh Phung
COMMENTS: We have updated the Pytorch and Tensorflow implementation at: https://github.com/daiquocnguyen/Graph-Transformer
HIGHLIGHT: To this end, we present U2GNN -- a novel embedding model leveraging the transformer self-attention network -- to learn plausible node and graph embeddings.
8, TITLE: Efficient Second-Order TreeCRF for Neural Dependency Parsing
http://arxiv.org/abs/2005.00975
AUTHORS: Yu Zhang ; Zhenghua Li ; Min Zhang
COMMENTS: ACL 2020
HIGHLIGHT: To address this issue, we propose an effective way to batchify the inside and Viterbi algorithms for direct large matrix operation on GPUs, and to avoid the complex outside algorithm via efficient back-propagation.
9, TITLE: Symbolic Querying of Vector Spaces: Probabilistic Databases Meets Relational Embeddings
http://arxiv.org/abs/2002.10029
AUTHORS: Tal Friedman ; Guy Van den Broeck
HIGHLIGHT: We propose unifying techniques from probabilistic databases and relational embedding models with the goal of performing complex queries on incomplete and uncertain data.
10, TITLE: "Good Robot!": Efficient Reinforcement Learning for Multi-Step Visual Tasks with Sim to Real Transfer
http://arxiv.org/abs/1909.11730
AUTHORS: Andrew Hundt ; Benjamin Killeen ; Nicholas Greene ; Hongtao Wu ; Heeyeon Kwon ; Chris Paxton ; Gregory D. Hager
COMMENTS: This is a major update to v2 with 100% success in real robot tests, clearer equations, a comprehensive ablation study, and generalization to a grid world environment. 8 pages, 6 figures, 3 tables, 1 algorithm. Code is available at https://github.com/jhu-lcsr/good_robot and a video overview is at https://youtu.be/MbCuEZadkIw
HIGHLIGHT: The SPOT framework successfully completes simulated trials of a variety of tasks, improving a baseline trial success rate from 13% to 100% when stacking 4 cubes, from 13% to 99% when creating rows of 4 cubes, and from 84% to 95% when clearing toys arranged in adversarial patterns.
11, TITLE: It's Not What Machines Can Learn, It's What We Cannot Teach
http://arxiv.org/abs/2002.09398
AUTHORS: Gal Yehuda ; Moshe Gabel ; Assaf Schuster
COMMENTS: Accepted to ICML 2020
HIGHLIGHT: In this work we offer a different perspective on this question.
12, TITLE: Joint Spatial and Angular Super-Resolution from a Single Image
http://arxiv.org/abs/1911.11619
AUTHORS: Andre Ivan ; Williem ; In Kyu Park
COMMENTS: arXiv admin note: substantial text overlap with arXiv:1903.12364
HIGHLIGHT: In this paper, we show that both super-resolution problems can be solved jointly from a single image by proposing a single end-to-end deep neural network that does not require a physical-based approach.
13, TITLE: 2D Image Relighting with Image-to-Image Translation
http://arxiv.org/abs/2006.07816
AUTHORS: Paul Gafton ; Erick Maraz
COMMENTS: 12 pages, 52 Postscript figures, uses cvpr_eso.sty eso-pic.sty ruler.sty
HIGHLIGHT: Here we provide our attempt to solve this problem using GANs.
14, TITLE: A Novel Approach for Correcting Multiple Discrete Rigid In-Plane Motions Artefacts in MRI Scans
http://arxiv.org/abs/2006.13804
AUTHORS: Michael Rotman ; Rafi Brada ; Israel Beniaminy ; Sangtae Ahn ; Christopher J. Hardy ; Lior Wolf
HIGHLIGHT: In this paper we propose a novel method for removing motion artefacts using a deep neural network with two input branches that discriminates between patient poses using the motion's timing.
15, TITLE: G2D: Generate to Detect Anomaly
http://arxiv.org/abs/2006.11629
AUTHORS: Masoud Pourreza ; Bahram Mohammadi ; Mostafa Khaki ; Samir Bouindour ; Hichem Snoussi ; Mohammad Sabokrou
HIGHLIGHT: In this paper, we propose a novel method for irregularity detection.
16, TITLE: Failure of Normalization in Impredicative Type Theory with Proof-Irrelevant Propositional Equality
http://arxiv.org/abs/1911.08174
AUTHORS: Andreas Abel ; Thierry Coquand
HIGHLIGHT: Failure of Normalization in Impredicative Type Theory with Proof-Irrelevant Propositional Equality
17, TITLE: On the Relationship Between Active Inference and Control as Inference
http://arxiv.org/abs/2006.12964
AUTHORS: Beren Millidge ; Alexander Tschantz ; Anil K Seth ; Christopher L Buckley
COMMENTS: final workshop version
HIGHLIGHT: In the context of this comparison, we highlight several ways in which these frameworks can inform one another.
18, TITLE: Context-Sensitive Generation Network for Handing Unknown Slot Values in Dialogue State Tracking
http://arxiv.org/abs/2005.03923
AUTHORS: Puhai Yang ; Heyan Huang ; Xian-Ling Mao
HIGHLIGHT: To tackle the problem, in this paper, we propose a novel Context-Sensitive Generation network (CSG) which can facilitate the representation of out-of-vocabulary words when generating the unknown slot value.
19, TITLE: Self-supervised Robust Object Detectors from Partially Labelled Datasets
http://arxiv.org/abs/2005.11549
AUTHORS: Mahdieh Abbasi ; Denis Laurendeau ; Christian Gagne
HIGHLIGHT: With the goal of training \emph{one integrated robust object detector with high generalization performance}, we propose a training framework to overcome missing-label challenge of the merged datasets.
20, TITLE: Dynamic Task Weighting Methods for Multi-task Networks in Autonomous Driving Systems
http://arxiv.org/abs/2001.02223
AUTHORS: Isabelle Leang ; Ganesh Sistu ; Fabian Burger ; Andrei Bursuc ; Senthil Yogamani
COMMENTS: Accepted for Oral Presentation at IEEE Intelligent Transportation Systems Conference (ITSC) 2020
HIGHLIGHT: In this work, we review and systematically evaluate nine task weighting strategies on common grounds on three automotive datasets (KITTI, Cityscapes and WoodScape).
21, TITLE: Two-phase protein folding optimization on a three-dimensional AB off-lattice model
http://arxiv.org/abs/1903.01456
AUTHORS: Borko Bošković ; Janez Brest
COMMENTS: 20 pages, 13 tables, 6 figures. arXiv admin note: text overlap with arXiv:1710.07031
HIGHLIGHT: This paper presents a two-phase protein folding optimization on a three-dimensional AB off-lattice model.
22, TITLE: A novel approach for multi-agent cooperative pursuit to capture grouped evaders
http://arxiv.org/abs/2006.01022
AUTHORS: Muhammad Zuhair Qadir ; Songhao Piao ; Haiyang Jiang ; Mohammed El Habib Souidi
COMMENTS: published paper's draft version
HIGHLIGHT: An approach of mobile multi-agent pursuit based on application of self-organizing feature map (SOFM) and along with that reinforcement learning based on agent group role membership function (AGRMF) model is proposed.
23, TITLE: Transformer based Grapheme-to-Phoneme Conversion
http://arxiv.org/abs/2004.06338
AUTHORS: Sevinj Yolchuyeva ; Géza Németh ; Bálint Gyires-Tóth
COMMENTS: INTERSPEECH 2019
HIGHLIGHT: In this paper, we investigate the application of transformer architecture to G2P conversion and compare its performance with recurrent and convolutional neural network based approaches.
24, TITLE: Adaptive Online Planning for Continual Lifelong Learning
http://arxiv.org/abs/1912.01188
AUTHORS: Kevin Lu ; Igor Mordatch ; Pieter Abbeel
COMMENTS: Originally published in NeurIPS Deep RL 2019
HIGHLIGHT: We present a new algorithm, Adaptive Online Planning (AOP), that achieves strong performance in this setting by combining model-based planning with model-free learning.
25, TITLE: Natural Language Processing Advancements By Deep Learning: A Survey
http://arxiv.org/abs/2003.01200
AUTHORS: Amirsina Torfi ; Rouzbeh A. Shirvani ; Yaser Keneshloo ; Nader Tavaf ; Edward A. Fox
HIGHLIGHT: It covers core NLP tasks and applications and describes how deep learning methods and models advance these areas.
26, TITLE: Rapid trial-and-error learning with simulation supports flexible tool use and physical reasoning
http://arxiv.org/abs/1907.09620
AUTHORS: Kelsey R. Allen ; Kevin A. Smith ; Joshua B. Tenenbaum
COMMENTS: This manuscript is in press at PNAS. It is an extended version of a paper "Rapid Trial-and-Error Learning in Physical Problem Solving" accepted for oral presentation at the 41st Annual Meeting of the Cognitive Science Society (2019). It represents ongoing work on the part of the authors
HIGHLIGHT: To study this type of general physical problem solving, we introduce the Virtual Tools game.
27, TITLE: Symbolic Network: Generalized Neural Policies for Relational MDPs
http://arxiv.org/abs/2002.07375
AUTHORS: Sankalp Garg ; Aniket Bajpai ; Mausam
COMMENTS: In Proceeding of ICML 2020. Code can be found at https://github.com/dair-iitd/symnet
HIGHLIGHT: We present SymNet, the first neural approach for solving RMDPs that are expressed in the probabilistic planning language of RDDL.
28, TITLE: Specular- and Diffuse-reflection-based Face Spoofing Detection for Mobile Devices
http://arxiv.org/abs/1907.12400
AUTHORS: Akinori F. Ebihara ; Kazuyuki Sakurai ; Hitoshi Imaoka
COMMENTS: Accepted to IJCB2020
HIGHLIGHT: Here, we propose an efficient face presentation attack detection (PAD) algorithm that requires minimal hardware and only a small database, making it suitable for resource-constrained devices such as mobile phones.
29, TITLE: Similarity Learning Networks for Animal Individual Re-Identification -- Beyond the Capabilities of a Human Observer
http://arxiv.org/abs/1902.09324
AUTHORS: Stefan Schneider ; Graham W. Taylor ; Stefan Linquist ; Stefan C. Kremer
COMMENTS: 9 pages, 4 figures, 3 table. WACV 2020 - Deep Learning for Animal Re-ID Workshop
HIGHLIGHT: The ability for researchers to re-identify an animal individual upon re-encounter is fundamental for addressing a broad range of questions in the study of population dynamics and community/behavioural ecology.
30, TITLE: Learning to Detect 3D Objects from Point Clouds in Real Time
http://arxiv.org/abs/2006.01250
AUTHORS: Abhinav Sagar
COMMENTS: 11 pages
HIGHLIGHT: In this work, we address the problem of 3D object detection from point cloud data in real time.
31, TITLE: AraBERT: Transformer-based Model for Arabic Language Understanding
http://arxiv.org/abs/2003.00104
AUTHORS: Wissam Antoun ; Fady Baly ; Hazem Hajj
COMMENTS: Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020), Marseille, France (2020)
HIGHLIGHT: In this paper, we pre-trained BERT specifically for the Arabic language in the pursuit of achieving the same success that BERT did for the English language.
32, TITLE: PnPNet: End-to-End Perception and Prediction with Tracking in the Loop
http://arxiv.org/abs/2005.14711
AUTHORS: Ming Liang ; Bin Yang ; Wenyuan Zeng ; Yun Chen ; Rui Hu ; Sergio Casas ; Raquel Urtasun
COMMENTS: CVPR2020
HIGHLIGHT: Towards this goal we propose PnPNet, an end-to-end model that takes as input sequential sensor data, and outputs at each time step object tracks and their future trajectories.
33, TITLE: Motion Planning Networks: Bridging the Gap Between Learning-based and Classical Motion Planners
http://arxiv.org/abs/1907.06013
AUTHORS: Ahmed H. Qureshi ; Yinglong Miao ; Anthony Simeonov ; Michael C. Yip
COMMENTS: Supplementary material including implementation parameters and project videos are available at https://sites.google.com/view/mpnet/home. This work has been accepted for publication at IEEE Transactions on Robotics
HIGHLIGHT: To train the MPNet models, we present an active continual learning approach that enables MPNet to learn from streaming data and actively ask for expert demonstrations when needed, drastically reducing data for training.
34, TITLE: Pseudo-Labeling and Confirmation Bias in Deep Semi-Supervised Learning
http://arxiv.org/abs/1908.02983
AUTHORS: Eric Arazo ; Diego Ortego ; Paul Albert ; Noel E. O'Connor ; Kevin McGuinness
HIGHLIGHT: We, conversely, propose to learn from unlabeled data by generating soft pseudo-labels using the network predictions.
35, TITLE: A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
http://arxiv.org/abs/2006.14171
AUTHORS: Shengyi Huang ; Santiago Ontañón
COMMENTS: Preprint. Corrected a major issue of the withdrawn version submitted to NeurIPS 2020
HIGHLIGHT: In this paper, we show that the standard working mechanism of invalid action masking corresponds to valid policy gradient updates.
36, TITLE: Derivative Manipulation for General Example Weighting
http://arxiv.org/abs/1905.11233
AUTHORS: Xinshao Wang ; Elyor Kodirov ; Yang Hua ; Neil M. Robertson
COMMENTS: Is it the right time to make a change from the design of loss function to the design of derivative directly?
HIGHLIGHT: Therefore, we propose derivative manipulation (DM), a novel and general example weighting approach for training robust deep models under these adverse conditions.
37, TITLE: Reliable Fidelity and Diversity Metrics for Generative Models
http://arxiv.org/abs/2002.09797
AUTHORS: Muhammad Ferjad Naeem ; Seong Joon Oh ; Youngjung Uh ; Yunjey Choi ; Jaejun Yoo
COMMENTS: First two authors have contributed equally; ICML 2020 accepted
HIGHLIGHT: In this paper, we show that even the latest version of the precision and recall metrics are not reliable yet.
38, TITLE: DoubleU-Net: A Deep Convolutional Neural Network for Medical Image Segmentation
http://arxiv.org/abs/2006.04868
AUTHORS: Debesh Jha ; Michael A. Riegler ; Dag Johansen ; Pål Halvorsen ; Håvard D. Johansen
HIGHLIGHT: To improve the performance of U-Net on various segmentation tasks, we propose a novel architecture called DoubleU-Net, which is a combination of two U-Net architectures stacked on top of each other.
39, TITLE: Neural Machine Translation for Multilingual Grapheme-to-Phoneme Conversion
http://arxiv.org/abs/2006.14194
AUTHORS: Alex Sokolov ; Tracy Rohlin ; Ariya Rastrow
COMMENTS: Published in INTERSPEECH (2019)
HIGHLIGHT: As an alternative, we present a single end-to-end trained neural G2P model that shares same encoder and decoder across multiple languages.
40, TITLE: Learning Near Optimal Policies with Low Inherent Bellman Error
http://arxiv.org/abs/2003.00153