👻
🚀 AI Researcher & Engineer @ LLM & NLP
🎓 PhD @ Decision Science & Managerial Economics
💻 MSc @ Computer Science
Pinned Loading
-
DeepEnlighten
DeepEnlighten PublicPure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
Python 38
-
Logic-RL-Lite
Logic-RL-Lite PublicLightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
Python 48
-
Awesome-LLM-Interview-Questions-and-Answers
Awesome-LLM-Interview-Questions-and-Answers Public大模型算法工程师、大模型Agent开发工程师面试:常见题目和答案
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.