-
datatrove Public
Forked from huggingface/datatroveFreeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Python Apache License 2.0 UpdatedMar 4, 2025 -
procqa Public
Community-based Programming Question Answering
-
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedMay 18, 2024 -
nanotron Public
Forked from huggingface/nanotronMinimalistic large language model 3D-parallelism training
Python Apache License 2.0 UpdatedMar 16, 2024 -
embeddings Public
Training Large-scale Text Embedding Models with 🤗 Transformers
-
huggingface_hub Public
Forked from huggingface/huggingface_hubThe official Python client for the Huggingface Hub.
Python Apache License 2.0 UpdatedFeb 6, 2024 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJul 11, 2023 -
tevatron Public
Forked from texttron/tevatronTevatron - A flexible toolkit for dense retrieval research and development.
Python Apache License 2.0 UpdatedApr 30, 2023 -
mteb Public
Forked from embeddings-benchmark/mtebMTEB: Massive Text Embedding Benchmark
Python Apache License 2.0 UpdatedApr 25, 2023 -
-
dual-cross-encoder Public
Dual Cross Encoder for Dense Retrieval
-
-
-
conv-diff-mp Public
Solving 2D convection diffusion equation using julia multiprocessing
Julia UpdatedSep 6, 2022 -
yologo Public
Logo ASCII Art based on YOLO and PaddleOCR
-
-
beir Public
Forked from beir-cellar/beirA Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Python Apache License 2.0 UpdatedMay 12, 2022 -
GR-for-KBQG Public
Graph Retrieval for Question Generation over Knowledge Base
-
-
BART4KBQG Public
Finetuning a BART for Question Generation over Knowledge Bases
-
triple2seq Public
PyTorch reimplementation of Serban et al.'s paper "Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus" at ACL'2016
-
yologo-dataset Public
YOLO based logo detection
Python GNU General Public License v3.0 UpdatedAug 26, 2021 -
ASCIIArt Public
Converting an RGB image to its ASCII encoding
Python Apache License 2.0 UpdatedAug 25, 2021 -
MachineLearning Public
Implementation of some common machine learning models from scratch with Numpy
Python UpdatedAug 5, 2021 -
ParaSolver Public
Numerical simulation of particle deformation in the fluid flow
-
Collision Public
A virtual physics experiment system for the quantitative simulation of physical collision based on Unity and C#
-
-