+

erfanMhi

Follow

Erfan Miahi erfanMhi

Follow

Working on really hard problems, especially the problem of intelligence.

124 followers · 76 following

Fronix
Canada, Toronto
https://www.linkedin.com/in/erfan-miahi-8637a1130/
https://orcid.org/0000-0001-7510-083X
@erfan_mhi
erfan_mhi
in/erfan-miahi-8637a1130

Achievements

Achievements

Organizations

Pinned Loading

rlvr_pipeline rlvr_pipeline Public

A composable component orchestrator for Reinforcement Learning from Verifiable Rewards (RLVR) training of Large Language Models on reasoning tasks.

Python 1
intractai/IntractCodeAPI intractai/IntractCodeAPI Public

An API designed for code completion and fine-tuning of open-source large language models on internal codebases and documents.

Python 9 2
base_reinforcement_learning base_reinforcement_learning Public

This is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experimentation and analysis.

Python 12 1
flypi flypi Public

Circuit Analysis for Extracting Components and Connections for XR (Toronto Meta Llama Hackathon)

Python 6
Deep-Reinforcement-Learning-CS285-Pytorch Deep-Reinforcement-Learning-CS285-Pytorch Public

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Python 138 11
A-quantum-inspired-genetic-algorithm-for-k-means-clustering A-quantum-inspired-genetic-algorithm-for-k-means-clustering Public

Implementation of a Quantum inspired genetic algorithm proposed by A quantum-inspired genetic algorithm for k-means clustering paper.

Jupyter Notebook 37 10

点击这是indexloc提供的php浏览器服务，不要输入任何密码和下载