-
Yale University / Allen AI (AI2)
- New Haven, CT
- http://armancohan.com/
Stars
A bibliography and survey of the papers surrounding o1
hamishivi / EasyLM
Forked from young-geng/EasyLMLarge language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.
Simple code for generating a color-coded latex table from raw data
Reconquer the canvas: beautiful Tikz figures without clunky Tikz code
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
helper to create sheet music from flowkey songs
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Provides a common interface to many IR ranking datasets.
Mesh TensorFlow: Model Parallelism Made Easier
An end-to-end neural ad-hoc ranking pipeline.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
cybertronai / transformer-xl
Forked from kimiyoung/transformer-xlTraining Transformer-XL on 128 GPUs
Tool to find already running ssh-agent compatible agents
A Python wrapper for the ROUGE summarization evaluation package
Code for the paper "Language Models are Unsupervised Multitask Learners"
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
📡 Organized Resources for Deep Learning in Natural Language Processing
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Measuring the Evolution of a Scientific Field through Citation Frames
mathsyouth / awesome-text-summarization
Forked from lipiji/App-DLA curated list of resources dedicated to text summarization
Interactive in-browser attention visualizer tool for recurrent networks
The Berkeley Document Summarizer is a learning-based, single-document summarization system that extracts source document content, exploits syntactic information to compress it, and uses coreference…
Preparation links and resources for system design questions
TensorFlow Neural Machine Translation Tutorial
Deep Learning papers reading roadmap for anyone who are eager to learn this amazing tech!