DocProcAI-Service

This service is designed to process and manage uploaded lecture material (video recordings, documents, slides) to facilitate some advanced features in the MEITREX platform.

Features

Splitting of lecture videos into sections based on detected slide changes via computer vision
OCR of lecture video on screen text
Transcript & Closed Captions generation for lecture videos
Generating of text embeddings on a per-section-basis for videos and per-page-basis for documents
Semantic search/fetching of semantically similar sections of lecture material
Automatic generation of section titles for the video sections generated

Installation

This service requires pytorch to function. As pytorch GPU-support is required for some features of this service, the pip-distributed version of pytorch cannot be used and instead a platform-specific version has to be used. By default, pytorch for NVIDIA CUDA 12.4 is used, as this should provide the most capability for widespread GPUs. If you need to use a different version of pytorch, you can change the install script located in the Dockerfile.

Caution

Note that GPU features require a supported GPU and OS to function, especially in conjunction with Docker, as the service runs in a Docker container.

Docker does not provide GPU-support for MacOS at this point in time, thus GPU-features of the service do not function on MacOS.

GPU features can be disabled using the config.yaml.

Configuration

The service uses the config.yaml file located in the root directory for configuration. For further information about configuration check out this file, all configuration properties are explained using in-file comments.

Name		Name	Last commit message	Last commit date
Latest commit History 132 Commits
.vs		.vs
client		client
components		components
config		config
controller		controller
dto		dto
fileextractlib		fileextractlib
persistence		persistence
pg-init-scripts		pg-init-scripts
schema		schema
service		service
util		util
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
app.py		app.py
config.yaml		config.yaml
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocProcAI-Service

Features

Installation

Configuration

About

Releases

Packages

Contributors 3

Languages

MEITREX/docprocai_service

Folders and files

Latest commit

History

Repository files navigation

DocProcAI-Service

Features

Installation

Configuration

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages