-
Palo Alto Networks
- Raleigh, NC
- @evanqjones
- in/evanqjones
Stars
Create and edit images using your voice
Sean-Bradley / three.js
Forked from mrdoob/three.jsJavaScript 3D library.
A library for constraining triangulations from Delaunator
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
the framework/ sdk that lets you build browser controlling agents in 3 lines of code. join chat @ https://discord.gg/umgnyQU2K8
A MLX port of FLUX based on the Huggingface Diffusers implementation.
streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL
VideoSys: An easy and efficient system for video generation
Speech To Speech: an effort for an open-sourced and modular GPT4-o
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
Llama-3 agents that can browse the web by following instructions and talking to you
Navigation mesh utilities for three.js, based on PatrolJS.
An SDK/Python library for Automatic 1111 to run state-of-the-art diffusion models
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
open Multiple View Geometry library. Basis for 3D computer vision and Structure from Motion.
Lifting ControlNet for Generalized Depth Conditioning
Infinite Photorealistic Worlds using Procedural Generation
Bitmap & tilemap generation from a single example with the help of ideas from quantum mechanics
Probabilistic language based on pattern matching and constraint propagation, 153 examples