LLM Chatbot Backend Service

This Java backend service enables communication with Large Language Models (LLMs) using Ollama. It supports the llama3 8b and llama3 8b 2-bit quantized models, providing both premium and free user functionalities. The functionalities differ in terms of response quality and speed.

Models

The service utilizes Meta's LLaMA3 LLM models. LLaMA is a family of transformer-based models optimized for efficiency and performance across a range of computing scales. These models are designed to be used in various applications, from generating text to answering questions and more. The main advantage of LLaMA models is their flexibility in deployment, from small-scale devices to large server-based systems, without compromising the quality of results.

LLaMA3 models are latest versions within the LLaMA family that are designed with advanced capabilities in language understanding and generation, building on the architecture and training approaches of earlier models.

The following models are used in this service:

Prerequisites

Before you begin, ensure you have the following installed:

Java JDK 17 or higher
Docker (optional, for running Ollama without local installation)

Configuration

Environment Variables

The service requires setting up the following environment variables:

MONGODB_URI: The connection string for your MongoDB service.
JWT_SECRET: Secret key for signing and verifying JWT tokens.

Contact the repository owners to obtain these variables.

Optional Configuration

If you want to run the Ollama dependency locally you can either download Ollama or use Docker. Ollama's Docker image simplifies the setup for running large language models locally.

To start with Ollama on Docker, firstly pull Ollama's image from Docker repository

docker pull ollama/ollama

To run the Ollama Docker container on a CPU-only machine, use the following command:

docker run -d -v ollama:/root/.ollama -p 11434:11434 --name ollama ollama/ollama

For running on GPUs you can check here

Run both models locally:

docker exec -it ollama ollama run llama3

docker exec -it ollama ollama run llama3:8b-instruct-q2_K

You should replace ollama.base-url variable on the application.properties with the correct host and port of your LLM model deployment. The default is set to http://localhost:11434.

Usage

Once the service is running, you can interact with chat with exposed endpoints. Here is cURL example:

curl --location 'http://localhost:8081/api/chat/generate/premium' \
--header 'Content-Type: application/json' \
--header 'Authorization: Bearer eyJhbGciOiJIUzI1NiJ9.eyJzdWIiOiJhcmVnYWsiLCJpYXQiOjE3MTUyNjQ3ODgsImV4cCI6MTcxNTI2NjIyOH0.sjYaVkPGiVQbMWMjahn_ZFosFGVQSXwH_TkkjFbVMk0' \
--header 'Cookie: paa-did=a.c.komx7yxu.7f982d5d-6b8e-4414-b7fd-e533851f28e7' \
--data '{
    "prompt" : "Tell me a joke",
    "context" : []
}'

Except login and sign-up endpoints, others require JWT token authorization in headers.

Support

For any technical issues or questions, contact:

Gohar Aghajanyan - [email protected]
Alice Martirosyan - [email protected]
Mariam Yevdokimova - [email protected]
Inna Khachikyan - [email protected]
Melanya Khachatryan - [email protected]
Levon Keshishoghlyan - [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.mvn/wrapper		.mvn/wrapper
src		src
.gitignore		.gitignore
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Chatbot Backend Service

Models

Prerequisites

Configuration

Environment Variables

Optional Configuration

Usage

Support

About

Releases

Packages

Languages

yevdokimova/chat

Folders and files

Latest commit

History

Repository files navigation

LLM Chatbot Backend Service

Models

Prerequisites

Configuration

Environment Variables

Optional Configuration

Usage

Support

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages