Skip to content
View duj12's full-sized avatar

Block or report duj12

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
duj12/README.md
  • 👋 Hi, I’m @duj12. I graduated from Tsinghua University, Department of Engineering Physics.
  • 👀 I’m interested in Speech and Spoken Language Processing and Understanding, and Voice Generation.
  • 🌱 I mainly focus on Automatic Speech Recognition, Voice Activity Detection, Key Word Spotting, Language Modeling and related fields.
  • ⏳ Now I'm working on Text to Speech, Zero-Shot Speech Synthesis, and Voice Cloning.
  • 💞️ Hoping to communicate with you in the fields of deep learning, generative artificial intelligence, and large language models, and so on.
  • 📫 How to reach me: [email protected].

Pinned Loading

  1. duj12 duj12 Public

    Config files for my GitHub profile.

  2. ASR-2Pass ASR-2Pass Public

    ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).

    HTML 52 7

  3. vad_asr vad_asr Public

    Python 1

  4. kws_demo kws_demo Public

    KWS demo based on CTC prefix beam search.

    Python 12 2

  5. CosyVoice CosyVoice Public

    Forked from FunAudioLLM/CosyVoice

    LLM based TTS model, providing inference/training/deployment full-stack ability.

    Python

  6. FunASR FunASR Public

    Forked from modelscope/FunASR

    A Fundamental End-to-End Speech Recognition Toolkit

    Python 1