+
Skip to content
View initial-h's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report initial-h

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

Python 15,356 1,904 Updated Jul 18, 2025

Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.

Python 6,544 644 Updated Jul 10, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,628 422 Updated Jul 9, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 11,159 1,855 Updated Jul 18, 2025

Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents

Python 1,524 323 Updated Jun 12, 2025

Open-source implementation of AlphaEvolve

Python 3,241 429 Updated Jul 18, 2025

easydou

Python 5 Updated Dec 22, 2024

改进过的rule版本

Python 9 4 Updated Aug 14, 2019
Python 13 7 Updated Sep 14, 2021

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Python 734 42 Updated Jul 10, 2025

Benchmarking LLMs' Gaming Ability in Multi-Agent Environments

Jupyter Notebook 83 1 Updated May 1, 2025

A framework for few-shot evaluation of language models.

Python 9,580 2,545 Updated Jul 16, 2025

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

585 26 Updated Nov 29, 2024

A curated list of Diffusion Model in RL resources (continually updated)

1,256 66 Updated Feb 15, 2025
Jupyter Notebook 1 Updated May 7, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

1,005 44 Updated Jul 16, 2025

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 318 34 Updated Aug 6, 2024
Python 147 17 Updated Dec 20, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,800 134 Updated Jan 17, 2025

Official PyTorch implementation for "Large Language Diffusion Models"

Python 2,584 173 Updated Jun 17, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,146 62 Updated Feb 25, 2025

Source code of the ICML24 paper "Self-Composing Policies for Scalable Continual Reinforcement Learning" (selected for oral presentation)

Python 21 3 Updated Jul 20, 2024

A Doudizhu reinforcement learning AI

Python 35 12 Updated May 20, 2025

C++/python fight the lord with pybind11 (强化学习AI斗地主), Accepted to AIIDE-2020

Python 160 40 Updated Jun 17, 2021
Python 3 Updated Mar 2, 2025

基于DouZero定制AI实战欢乐斗地主

Python 2,030 444 Updated May 28, 2022

将DouZero用于欢乐斗地主自动化

Python 738 177 Updated May 19, 2024

Douzero with ResNet and GPU support for Windows

Python 43 18 Updated Dec 23, 2021

Evaluation of PerfectDou vs DouZero ResNet.

Python 5 2 Updated Oct 11, 2022
Next
点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载