-
University of Western Australia
- Perth, WA6009, Australia
-
19:20
(UTC +08:00)
Highlights
- Pro
Lists (11)
Sort Name ascending (A-Z)
🎤ASR/TTS
repos for voice assistantAUTO4508
Pioneer related-repo only🕹️ gamepad visuallisation
Visualize a joystick controller or gamepad for video. The control signal comes from rosbag and stream.gba-dev
Geo&Construction
humanoidRobotics
🔒LLM-safety
🏭mopu
Stars
It is said that, Ilya Sutskever gave John Carmack this reading list of ~ 30 research papers on deep learning.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
aider is AI pair programming in your terminal
CMake based toolchain for GBA homebrew development
A random event driven text-based game engine.
Supercharge Your LLM Application Evaluations 🚀
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
ROS2 STT node. An out of the box speach to text recognizer using standalone Vosk speech recognition toolkit. This is a MIRRORED REPOSITORY Refer to the GitLab page for the origin.
A ROS2-based Conversational AI system that processes speech input, interacts with various chatbot APIs (including ChatGPT, Hugging Face, and local LLMs via Ollama), and generates spoken responses u…
A Study on Prompt Injection Attack Against LLM-Integrated Mobile Robotic Systems
Everything about the SmolLM and SmolVLM family of models
Public code release associated with SceneScript.
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
Detect and redact PII locally with SOTA performance
Official code for article "LLMLight: Large Language Models as Traffic Signal Control Agents".
Docs2KG: A Human-LLM Collaborative Approach to Unified Knowledge Graph Construction from Heterogeneous Documents
This repository contains the code for the paper“iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement”
Simple, hackable offline speech to text - using the VOSK-API.
Discover the GPT-4o multimodal model at Microsoft Build 2024, now with text and image capabilities. My prototype enhances chats with real-time camera snapshots, powered by Flask, OpenCV, and Azure’…
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking (CVPR 2023)
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation
PyTorch code and models for the DINOv2 self-supervised learning method.