initial-h

🎯

Focusing

Hongming Zhang initial-h

🎯

Focusing

Shape the way you think.

40 followers · 32 following

www.cnblogs.com/initial-h/

Achievements

Highlights

Stars

bytedance / deer-flow

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 15,356 1,904 Updated Jul 18, 2025

zilliztech / deep-searcher

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 6,544 644 Updated Jul 10, 2025

jina-ai / node-DeepResearch

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,628 422 Updated Jul 9, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 11,159 1,855 Updated Jul 18, 2025

jennyzzt / dgm

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,524 323 Updated Jun 12, 2025

codelion / openevolve

Open-source implementation of AlphaEvolve

Python 3,241 429 Updated Jul 18, 2025

MingshiYangUIUC / AI-Doudizhu

easydou

Python 5 Updated Dec 22, 2024

deecamp2019-group20 / RuleBasedModelV2

改进过的rule版本

Python 9 4 Updated Aug 14, 2019

1310183534 / DouDiZhu

Python 13 7 Updated Sep 14, 2021

kuleshov-group / bd3lms

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 734 42 Updated Jul 10, 2025

CUHK-ARISE / GAMABench

Benchmarking LLMs' Gaming Ability in Multi-Agent Environments

Jupyter Notebook 83 1 Updated May 1, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 9,580 2,545 Updated Jul 16, 2025

apexrl / Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

585 26 Updated Nov 29, 2024

opendilab / awesome-diffusion-model-in-rl

A curated list of Diffusion Model in RL resources (continually updated)

1,256 66 Updated Feb 15, 2025

qordmlwls / WDPOP

Forked from OpenRLHF/OpenRLHF

Jupyter Notebook 1 Updated May 7, 2025

Sun-Haoyuan23 / Awesome-RL-based-Reasoning-MLLMs

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,005 44 Updated Jul 16, 2025