Project

This repo has been populated by an initial template to help get you started. Please make sure to update the content to build a great experience for community-building.

As the maintainer of this project, please make a few updates:

Improving this README.MD file to provide a great experience
Updating SUPPORT.MD with content about this project's support experience
Understanding the security reporting process in SECURITY.MD
Remove this section from the README

Motivation

R&D Agent: Focusing on automating the most core and valuable part of the industrial R&D process.

Core method: Evolving;

Scenarios

Data-driven R&D

TODO: importance justification

🎯 Goal

In this project, we are aiming to build a Data-Centric R&D Agent that can

Read real-world material (reports, papers, etc.) and extract key formulas, descriptions of interested features, factors and models.
Implement the extracted formulas, features, factors and models in runnable codes.
- Due the limited ability for LLM in implementing in once, evolving the agent to be able to extend abilities by learn from feedback and knowledge and improve the agent's ability to implement more complex models.
Further propose new ideas based on current knowledge and observations.

🛣️ Brief Roadmap

In this section, we will briefly introduce the roadmap/technical type of this project.

Backbone LLM: We use GPT series as main backbone of the agent. .env file uis used to config settings (such as APIkey, APIEndpoint and etc) in the environment variables way. Check this Readme for environment set up.
KnowledgeGraph based evolving: We do not do any further pertain or fine-tune on the LLM model. Instead, we modify prompts like RAG, but use knowledge graph query information to evolve the agent's ability to implement more complex models.
- Typically, we build a knowledge consisted with Error, Component(you can think of it as a numeric operation or function), Trail and etc. We add nodes of these types to the knowledge graph with relationship while the agent tries to implement a model. For each attempts, the agent will query the knowledge graph to get the information of current status as prompt input. The agent will also update the knowledge graph with the new information after the attempt.

Code Refinement

Example: code standard, design. Lint

🔧 Development

Set up the development environment.
```
make dev
```
Run linting and formatting.
```
make lint
```

📚 Code Structure

Backbone/APIBackend of LLm are encapsulated in src/finco/llm.py. All chat completion request are managed by this file.
All frequently modified codes under tense development are included in the src/scripts folder.
- The most important task is to improve the agent's performance in the benchmark of factor implementation.
Currently, factor implementation is the main task. We define basic class of factor implementation in [src/scripts/factor_implementation/share_modules] and implementation strategies in [src/scripts/factor_implementation/baselines].

🔮 Future Code Structure

Currently, the code structure is unstable and will frequently change for quick updates. The code will be refactored before a standard release. Please try to align with the following principles when developing to minimize the effort required for future refactoring.

📂 src
➥ 📂 <project name>: avoid namespace
  ➥ 📁 core
  ➥ 📁 component A
  ➥ 📁 component B
  ➥ 📁 component C
  ➥ 📂 app
    ➥ 📁 scenario1
    ➥ 📁 scenario2
➥ 📁 scripts

Folder Name	Description
📁 core	The core framework of the system. All classes should be abstract and usually can't be used directly.
📁 component X	Useful components that can be used by others(e.g. scenario). Many subclasses of core classes are located here.
📁 app	Applications for specific scenarios (usually built based on components). Removing any of them does not affect the system's completeness or other scenarios.
📁 scripts	Quick and dirty things. These are candidates for core, components, and apps.

Configuration:

You can manually source the .env file in your shell before running the Python script: Most of the workflow are controlled by the environment variables.

# Export each variable in the .env file; Please note that it is different from `source .env` without export
export $(grep -v '^#' .env | xargs)
# Run the Python script
python your_script.py

Naming convention

File naming convention

Name	Description
`conf.py`	The configuration for the module & app & project

Contributing

Guidance

This project welcomes contributions and suggestions. You can find issues in the issues list or simply running grep -r "TODO:".

Making contributions is not a hard thing. Solving an issue(maybe just answering a question raised in issues list ), fixing/issuing a bug, improving the documents and even fixing a typo are important contributions to RDAgent.

Policy

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.github		.github
constraints		constraints
docs		docs
rdagent		rdagent
requirements		requirements
test		test
.env.example		.env.example
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project

Motivation

Scenarios

Data-driven R&D

🎯 Goal

🛣️ Brief Roadmap

Code Refinement

🔧 Development

📚 Code Structure

🔮 Future Code Structure

Configuration:

Naming convention

File naming convention

Contributing

Guidance

Policy

Trademarks

About

Uh oh!

Releases

Packages

Languages

License

thefivespace/RD-Agent

Folders and files

Latest commit

History

Repository files navigation

Project

Motivation

Scenarios

Data-driven R&D

🎯 Goal

🛣️ Brief Roadmap

Code Refinement

🔧 Development

📚 Code Structure

🔮 Future Code Structure

Configuration:

Naming convention

File naming convention

Contributing

Guidance

Policy

Trademarks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages