AI Singapore SEA-LION model served by Text Generation Inference (TGI) with Docker Compose

Model

Llama-SEA-LION-v3-8B-IT

Requirements

Docker
GPU: https://huggingface.co/docs/text-generation-inference/en/quicktour#supported-hardware
80GB of disk storage for the model and docker image

Quick Start

Start the service.
```
docker compose up
```

TGI is deployed as a server that implements the OpenAI API protocol. The server can be queried via http://localhost:8080 in the same format as the OpenAI API. For example:

curl http://localhost:8080/v1/completions \
  -H "Content-Type: application/json" \
  -d '{
      "model": "Llama-SEA-LION-v3-8B-IT",
      "prompt": "Artificial Intelligence is",
      "max_tokens": 50,
      "temperature": 0.8,
      "repetition_penalty": 1.2
  }'

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Singapore SEA-LION model served by Text Generation Inference (TGI) with Docker Compose

Model

Requirements

Quick Start

About

Uh oh!

Languages

License

aisingapore/sealion-tgi

Folders and files

Latest commit

History

Repository files navigation

AI Singapore SEA-LION model served by Text Generation Inference (TGI) with Docker Compose

Model

Requirements

Quick Start

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages