+
Skip to content
View tonywu71's full-sized avatar

Block or report tonywu71

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
tonywu71/README.md

Hi, I'm Tony 👋🏼

I'm a Research Engineer based in Paris 🇫🇷. I grew up in a small town near Disneyland Paris 🏰, and I'm lucky to have traveled around the world during my academic years (love you 🇧🇷🇭🇰🇬🇧).

  • 🎓 I studied a MEng CentraleSupélec in Paris-Saclay 🇫🇷 and the MPhil in Machine Learning and Machine Intelligence (MLMI) at the University of Cambridge (Sidney Sussex College) 🇬🇧.
  • 💼 I am currently working at H Company on building state-of-the-art web agents.
  • 🔬 Research interests: LLM, Multimodal, Agents, Information Retrieval, RAG, Speech.

💬 Feel free to reach out to discuss research ideas (mostly active on X)!

Contact: tonywu.ai@outlook.com


GitHub X Hugging Face Scholar LinkedIn

Pinned Loading

  1. illuin-tech/colpali illuin-tech/colpali Public

    The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

    Python 2.2k 205

  2. illuin-tech/vidore-benchmark illuin-tech/vidore-benchmark Public

    Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

    Python 244 32

  3. AnswerDotAI/byaldi AnswerDotAI/byaldi Public

    Use late-interaction multi-modal models such as ColPali in just a few lines of code.

    Python 824 92

  4. colpali-cookbooks colpali-cookbooks Public

    Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳

    336 27

  5. huggingface/transformers huggingface/transformers Public

    🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

    Python 151k 30.7k

  6. dotfiles dotfiles Public

    My personal dotfiles.

    Ruby

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载