Skip to content
View naumnaum's full-sized avatar

Block or report naumnaum

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
naumnaum/README.md

Aleksei Naumov


Feel free to reach out!

📫 [email protected]
assets/Google_Scholar_logo.svg.png Linkedin
assets/Google_Scholar_logo.svg.png Telegram
assets/Google_Scholar_logo.svg.png Twitter
assets/Google_Scholar_logo.svg.png My Papers On Google Scholar
📄 My CV

About me

I’m an Lead AI Product Engineer, working on building AI voice-assistant in Terra Quantum, Germany.
And I’m in love with creating AI products from scratch!

  • Training, deploying and serving LLMs, SD and audio models.

  • Architecting, building and deploying the whole backend of product: from APIs and databases to RAG pipelines with vector stores.

  • Working with both mobile and web frontends.

Prior to that I was leading an AI research team in Terra Quantum, we published multiple works on On-Device AI in IEEE Conferences (you can read more info below)


🛠️ My stack

Programming Languages

  • Python
  • Javascript
  • Typescript

Backend Development

API Development: FastAPI, Express
Databases: PostgreSQL, Redis, Vector Databases
Cloud Services: Google Cloud Platform (Compute, Storage, Cloud Run), Firebase \

Machine Learning & AI

Model Training: LLMs, Stable Diffusion (SD)
Model Deployment: LLMs, TTS (Text-to-Speech), STT (Speech-to-Text), SD on Google Cloud and other GPU providers
RAG Pipelines: Retrieval-Augmented Generation for enhanced LLM performance \

DevOps & Infrastructure

Containerization & Orchestration: Docker, Kubernetes, Helm
CI/CD: GitHub Actions, Google Cloud CI/CD pipelines
Scalable Inference Systems: Cloud-based model inference on GCP and other GPU providers \


📝 Papers and articles


🎤 Talks


🎓 Education

assets/msu_logo_new-white-modified.png Lomonosov’s Moscow State University (QS #37 WORLD RANKINGS IN PHYSICS)

BSc. of Physics With Applied Mathematics Specialization

⚛️ Relevant Coursework

Mathematics and statistics: Linear algebra, Probability & Statistics, Mathematical Analysis, Robotics.
Physics: Fundamental Physics, Theoretical Mechanics, Quantum Theory, Physical Chemistry

🏆 Awards

assets/msu_logo_new-white-modified.png Winner of Lomonosov’s Physics Olympiad

assets/msu_logo_new-white-modified.png Best Bachelor’s Thesis Lomonosov’s MSU 2021

Pinned Loading

  1. efficient-dl-systems efficient-dl-systems Public

    Forked from mryab/efficient-dl-systems

    Efficient Deep Learning Systems course materials (HSE, YSDA)

    Jupyter Notebook

  2. GPU-Puzzles GPU-Puzzles Public

    Forked from srush/GPU-Puzzles

    Solve puzzles. Learn CUDA.

    Jupyter Notebook

  3. nn_zero_to_hero nn_zero_to_hero Public

    Exercises from Andrej Karpathy's course

    Jupyter Notebook

  4. python-mastery python-mastery Public

    Forked from dabeaz-course/python-mastery

    Advanced Python Mastery (course by @dabeaz)

    Python

  5. fastapi-opentelemetry-tracing fastapi-opentelemetry-tracing Public

    Simple FastAPI app with implementation of tracing using Opentelemetry and Jaeger

    Python 3

  6. TQCompressedGPT2 TQCompressedGPT2 Public

    Forked from terra-quantum-public/TQCompressedGPT2

    Python