- Groningen
- http://andreasvc.github.io
Stars
Top2Vec learns jointly embedded topic, document and word vectors.
Enhancing Translation with RAG-Powered Large Language Models
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Efficient few-shot learning with Sentence Transformers
Replication code for Chaudhuri et al., "A small set of stylometric features differentiates Latin prose and verse," Digital Scholarship in the Humanities 2018
Package to extract connotation frames
A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python
Material for UvA course Coding the Humanities 2023
Positive-unlabeled learning with Python.
Code and documentation to train Stanford's Alpaca models, and generate the data.
Extraction of structured and unstructured information from fandom.com pages
A simple, bare-bones, no-frills note taking app for Android.
Debian, Ubuntu, and others packaging for ungoogled-chromium
Using machine learning to classify book reviews based on genre
The code used for the paper "Evaluating and Improving the Coreference Capabilities of Machine Translation Models"
An extremely fast Python linter and code formatter, written in Rust.
Cultural Analytics Open Science Guide (powered by Quarto)
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)