🏠
Working from home
PhD student at @showlab NUS |
Vision+Language, Video-Understanding, AI-Human Interaction | @microsoft, ex-@facebookresearch @Tencent
-
National University of Singapore
- Singapore
- qhlin.me
- @KevinQHLin
Pinned Loading
-
showlab/ShowUI
showlab/ShowUI PublicOpen-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
-
showlab/EgoVLP
showlab/EgoVLP Public[NeurIPS2022] Egocentric Video-Language Pretraining
-
showlab/computer_use_ootb
showlab/computer_use_ootb PublicOut-of-the-box (OOTB) GUI Agent for Windows and macOS
-
showlab/UniVTG
showlab/UniVTG Public[ICCV2023] UniVTG: Towards Unified Video-Language Temporal Grounding
-
showlab/Awesome-GUI-Agent
showlab/Awesome-GUI-Agent Public💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
-
showlab/VLog
showlab/VLog PublicTransform Video as a Document with ChatGPT, CLIP, BLIP2, GRIT, Whisper, LangChain.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.