No description

Python 98.6%
Mako 0.7%
Dockerfile 0.7%

Find a file

Marco Realacci c02bfac9c4 All checks were successful Code Check / code-check (push) Successful in 13s Details chore: fix formatting		2026-02-27 21:07:02 +01:00
.forgejo/workflows	improve code check	2025-07-04 15:22:20 +02:00
alembic	fix linter	2025-07-12 02:01:33 +02:00
demo	Finish refactor	2025-07-04 00:54:53 +02:00
prompts	remove qwen3 specific tokens in prompts	2025-07-13 16:57:42 +02:00
src	chore: fix formatting	2026-02-27 21:07:02 +01:00
.env.example	add ollama specific	2026-02-27 21:05:38 +01:00
.gitignore	add .env.template for configuration and improve summary generation prompts	2025-07-11 17:07:08 +02:00
.python-version	initialize uv repository	2025-07-03 22:28:35 +02:00
alembic.ini	initial migration support	2025-07-04 14:44:19 +02:00
docker-compose.example.yml	refactor configuration to load system prompts from text files	2025-07-12 13:40:25 +02:00
Dockerfile	refactor configuration to load system prompts from text files	2025-07-12 13:40:25 +02:00
LICENSE	add LICENSE	2025-07-05 18:17:41 +02:00
MIGRATIONS.md	remove outdated comment in migrations.md	2025-07-05 18:27:43 +02:00
pyproject.toml	add missing library	2025-07-05 15:44:42 +02:00
README.md	add ollama specific	2026-02-27 21:05:38 +01:00
run_migration.py	fix linter	2025-07-04 15:20:23 +02:00
uv.lock	add missing library	2025-07-05 15:44:42 +02:00

README.md

⚠️ SuntoBot is in an early development stage. Expect issues

SuntoBot - Telegram Chat Summarization Bot

A Telegram bot that automatically saves chat messages and provides personalized summaries using an LLM when requested by users. The bot operates in groups authorized by admins and stores conversation history in a PostgreSQL database.

Features

Dynamic Group Management: Admins can allow/deny groups using commands
Message Storage: Save all text messages and compressed images to PostgreSQL
Forwarded Message Support: Tracks and displays information about forwarded messages
Personalized Summaries: Generate summaries tailored to the requesting user
Flexible Time Ranges: Support custom intervals (10m, 1h, 24h, 10d)
LLM Integration: Uses OpenAI API with configurable base URL
- We recommend running a local Ollama server for LLM processing, using gemma3.

Quick Start

Option 1: Docker Setup (Recommended)

The easiest way to get started is using Docker Compose with the included PostgreSQL database.

Clone the repository:

git clone <repository-url>
cd suntobot

Set up environment variables:

cp .env.example .env

Edit the .env file with your configuration:

# Required: Get your bot token from @BotFather on Telegram
TELEGRAM_BOT_TOKEN=your_bot_token_here

# Required: Your Telegram user ID (get from @userinfobot)
ADMIN_IDS="123456789"

# Database (use this for Docker setup)
DATABASE_URL=postgresql://suntobot:suntopassword@db:5432/suntobot

# Required: OpenAI API key
OPENAI_API_KEY=your_openai_api_key_here

# Optional: Use different OpenAI-compatible API
OPENAI_BASE_URL=https://api.openai.com/v1

# Optional: Change the models (defaults work with OpenAI)
SUMMARY_MODEL=gpt-4o-mini
IMAGE_MODEL=gpt-4o-mini

# Optional: Mention behavior configuration
MENTION_CONTEXT_SIZE=30
MENTION_CONTEXT_HOURS=4
OLD_MENTION_CONTEXT_SIZE=10

# Optional: Image analysis configuration
IMAGE_ANALYSIS_ENABLED=true

# Optional: Summary processing configuration
SUMMARY_CHUNK_SIZE=70
MAX_PARALLEL_CHUNKS=2

# Optional: Custom system prompts (uncomment to use)
# SYSTEM_PROMPT="Your custom summary prompt..."
# MENTION_SYSTEM_PROMPT="Your custom mention reply prompt..."
# CHUNK_SYSTEM_PROMPT="Your custom chunk processing prompt..."
# META_SUMMARY_SYSTEM_PROMPT="Your custom meta-summary prompt..."
# SYSTEM_PROMPT_CHUNK_PREAMBLE="Your custom chunk preamble..."
# META_SUMMARY_SYSTEM_PROMPT_SUFFIX="Your custom meta-summary suffix..."

Start the bot with Docker Compose:

# Copy the example docker-compose file
cp docker-compose.example.yml docker-compose.yml

# Start the bot and database
docker-compose up -d

# Check logs
docker-compose logs -f bot

Option 2: Local Development Setup

For development or if you prefer running locally:

Prerequisites

Python 3.13+
PostgreSQL database
Telegram Bot Token (from @BotFather)
OpenAI API key

Installation

Clone and setup:

git clone <repository-url>
cd suntobot

Install dependencies using uv:

uv sync

Set up environment variables:

cp .env.example .env
# Edit .env with your configuration

Set up PostgreSQL database:
- Create a PostgreSQL database
- Update DATABASE_URL in .env with your database connection string
- Example: postgresql://username:password@localhost:5432/suntobot
Run database migrations:

uv run python run_migration.py

Run the bot:

uv run python src/main.py

Configuration

Environment Variables

All configuration is done through environment variables in the .env file:

Variable	Description	Required	Default	Example
`TELEGRAM_BOT_TOKEN`	Telegram bot token from @BotFather	Yes	-	`1234567890:ABCdefGHIjklMNOpqrSTUvwx`
`ADMIN_IDS`	Comma-separated admin user IDs	Yes	-	`"123456789,987654321"`
`DATABASE_URL`	PostgreSQL connection string	Yes	-	`postgresql://user:pass@host:5432/db`
`OPENAI_API_KEY`	OpenAI API key	Yes	-	`sk-...`
`OPENAI_BASE_URL`	OpenAI API base URL	No	`https://api.openai.com/v1`	`http://localhost:11434/v1`
`SUMMARY_MODEL`	Model name for summaries	No	`gpt-4o-mini`	`gemma3:27b`
`IMAGE_MODEL`	Model name for image analysis	No	`gpt-4o-mini`	`gemma3:27b`
`MENTION_CONTEXT_SIZE`	Messages to include when mentioned	No	`30`	`50`, `20`
`MENTION_CONTEXT_HOURS`	Hours to look back for context	No	`4`	`6`, `2`
`OLD_MENTION_CONTEXT_SIZE`	Older messages for context	No	`10`	`15`, `5`
`IMAGE_ANALYSIS_ENABLED`	Enable/disable image analysis	No	`true`	`false`
`SUMMARY_CHUNK_SIZE`	Messages per chunk for large summaries	No	`70`	`100`, `50`
`MAX_PARALLEL_CHUNKS`	Maximum parallel chunks to process	No	`2`	`4`, `1`
`OLLAMA_NUM_CTX`	Ollama context window size	No	-	`8192`, `32768`
`OLLAMA_KEEP_ALIVE`	How long model stays loaded in memory	No	-	`5m`, `1h`, `0`
`OLLAMA_THINKING`	Enable thinking/reasoning mode	No	-	`true`, `false`
`OLLAMA_THINK_BUDGET`	Max tokens for thinking budget	No	-	`4096`, `8192`

Note: System prompts are now managed through text files. See the Prompt Customization section for details.

Getting Required Values

1. Telegram Bot Token

Message @BotFather on Telegram
Use /newbot command and follow instructions
Copy the token provided

2. Admin User IDs

Message @userinfobot on Telegram
Copy your user ID from the response
For multiple admins, separate IDs with commas: "123456789,987654321"

3. OpenAI API Key

Go to https://platform.openai.com/api-keys
Create a new API key
Copy the key (starts with sk-)

Using Alternative LLM Providers

SuntoBot supports any OpenAI-compatible API. Examples:

Local Ollama

# In .env
OPENAI_BASE_URL=http://localhost:11434/v1
OPENAI_API_KEY=ollama  # Can be anything for local Ollama
SUMMARY_MODEL=qwen3:8b
IMAGE_MODEL=llava:7b

# Optional: Ollama-specific settings
OLLAMA_NUM_CTX=8192
OLLAMA_KEEP_ALIVE=5m
OLLAMA_THINKING=true
OLLAMA_THINK_BUDGET=4096

Anthropic Claude (via proxy)

# In .env
OPENAI_BASE_URL=https://your-claude-proxy.com/v1
OPENAI_API_KEY=your_anthropic_api_key
SUMMARY_MODEL=claude-3-sonnet-20240229
IMAGE_MODEL=claude-3-sonnet-20240229

Advanced Configuration

Mention Behavior Settings

When users mention the bot in a group, these parameters control how the bot responds:

MENTION_CONTEXT_SIZE: Number of recent messages to consider (default: 30)
MENTION_CONTEXT_HOURS: Hours of chat history to look back (default: 4)
OLD_MENTION_CONTEXT_SIZE: Additional older messages for context (default: 10)

Example: If a user mentions the bot, it will analyze the last 30 messages within 4 hours, plus 10 older messages for additional context.

Image Analysis Configuration

IMAGE_ANALYSIS_ENABLED: Enable or disable image analysis functionality (default: true)

Summary Processing Configuration

These settings control how large conversations are processed:

SUMMARY_CHUNK_SIZE: Number of messages per chunk for large summaries (default: 70)
MAX_PARALLEL_CHUNKS: Maximum number of chunks to process in parallel (default: 2)

Model Configuration

SUMMARY_MODEL: Model used for generating chat summaries
IMAGE_MODEL: Model used for analyzing images in chat (must support vision)

Custom System Prompts

You can customize how the bot behaves by setting custom prompts. See the Prompt Customization section below for detailed instructions on editing prompt files or using environment variables.

Available prompts:

SYSTEM_PROMPT: Controls how summaries are generated (default is in Italian)
MENTION_SYSTEM_PROMPT: Controls how the bot responds to mentions (default is in Italian)
CHUNK_SYSTEM_PROMPT: Controls how individual chunks are processed (default is in Italian)
META_SUMMARY_SYSTEM_PROMPT: Controls how final summaries are created from chunks (default is in Italian)
SYSTEM_PROMPT_CHUNK_PREAMBLE: Additional context for chunk processing (default is in Italian)
META_SUMMARY_SYSTEM_PROMPT_SUFFIX: Additional context for meta-summary generation (default is in Italian)

Note: The default prompts are in Italian. If you want English responses, you'll need to customize the prompts (see Prompt Customization section below).

Prompt Customization

SuntoBot's behavior is controlled by prompt templates stored in text files. You can customize these prompts in two ways:

Method 1: Edit Prompt Files Directly (Docker Volume Mount)

For easy prompt editing without rebuilding the Docker image, you can mount the prompts directory as a volume:

Uncomment volume mount in docker-compose.yml:

  bot:
    image: git.marcorealacci.me/marcorealacci/suntobot:latest
    env_file: .env
    volumes:
      - ./prompts:/app/prompts  # Uncomment this line

Copy prompts to your local directory:

# Create prompts directory if it doesn't exist
mkdir -p prompts

# Copy default prompts from a running container
docker-compose up -d
docker cp $(docker-compose ps -q bot):/app/prompts/. ./prompts/

# Or extract from the image directly
docker run --rm -v $(pwd)/prompts:/tmp/prompts git.marcorealacci.me/marcorealacci/suntobot:latest cp -r /app/prompts/. /tmp/prompts/

Edit prompt files directly: The following prompt files are available for customization:

prompts/system_prompt.txt - Main summary generation prompt
prompts/system_prompt_suffix.txt - Summary completion instructions
prompts/mention_system_prompt.txt - Bot mention response behavior
prompts/chunk_system_prompt.txt - Individual chunk processing
prompts/chunk_system_prompt_suffix.txt - Chunk processing completion
prompts/meta_summary_system_prompt.txt - Final summary from chunks
prompts/meta_summary_system_prompt_suffix.txt - Meta-summary completion
prompts/system_prompt_chunk_preamble.txt - Context for chunk processing

Restart the bot to apply changes:

docker-compose restart bot

Method 2: Environment Variables (Legacy)

You can still override prompts using environment variables (these take precedence over file-based prompts):

# In .env
SYSTEM_PROMPT="Your custom summary prompt..."
MENTION_SYSTEM_PROMPT="Your custom mention reply prompt..."
CHUNK_SYSTEM_PROMPT="Your custom chunk processing prompt..."
META_SUMMARY_SYSTEM_PROMPT="Your custom meta-summary prompt..."
SYSTEM_PROMPT_CHUNK_PREAMBLE="Your custom chunk preamble..."
META_SUMMARY_SYSTEM_PROMPT_SUFFIX="Your custom meta-summary suffix..."

Prompt Template Placeholders

The custom system prompts support the following placeholders that are automatically replaced:

SYSTEM_PROMPT placeholders:

{username}: The Telegram username of the user requesting the summary
{time_range}: A description of the time period being summarized (e.g., "last 2 hours", "since your last message")

CHUNK_SYSTEM_PROMPT placeholders:

{chunk_index}: The current chunk number being processed
{total_chunks}: The total number of chunks in the conversation

META_SUMMARY_SYSTEM_PROMPT placeholders:

{num_chunks}: The number of chunk summaries being combined

META_SUMMARY_SYSTEM_PROMPT_SUFFIX placeholders:

{time_range}: A description of the time period being summarized

MENTION_SYSTEM_PROMPT and SYSTEM_PROMPT_CHUNK_PREAMBLE do not use placeholders.

Example custom system prompt:

SYSTEM_PROMPT="You are a helpful assistant that creates summaries for {username}. 
Analyze the messages from {time_range} and provide a concise summary.
Focus on topics that mention or involve {username} directly."

Usage

Setting Up Your Bot

Step 1: Add Bot to Group

Invite your bot to a Telegram group
Make sure the bot has permission to read messages

Step 2: Allow the Group

An admin must run the /allow command in the group:

/allow

The bot will respond confirming the group is now allowed.

Step 3: Start Using Summary Commands

Once allowed, any group member can request summaries:

/sunto          # Summary since your last message
/sunto 1h       # Summary for last hour
/sunto 30m      # Summary for last 30 minutes
/sunto 2d       # Summary for last 2 days

Bot Commands

User Commands (in allowed groups):

/start - Show welcome message and help
/help - Show help information
/sunto - Get summary since your last message
/sunto <time> - Get summary for specified time period

Admin Commands:

/allow - Allow the current group to use the bot (use in group)
/deny <group_id> - Deny a group from using the bot (use in private chat with bot)
/list - List all allowed groups with IDs (use in private chat with bot)

Advanced Usage

Custom Time Formats

m - minutes (e.g., 30m, 45m)
h - hours (e.g., 2h, 12h)
d - days (e.g., 7d, 30d)

Managing Multiple Groups

Admins can manage multiple groups from a private chat with the bot:

Get group IDs using /list command
Deny groups using /deny <group_id>
Allow new groups by using /allow in the target group

Troubleshooting

Bot Not Responding

Check bot logs: docker-compose logs bot
Verify the bot token in .env
Ensure the bot has message reading permissions in the group

Database Issues

Check database logs: docker-compose logs db
Verify DATABASE_URL is correct in .env
Try restarting: docker-compose restart

Summary Generation Issues

Check your OpenAI API key and usage limits
Verify OPENAI_BASE_URL if using alternative providers
Check logs for specific error messages

Group Not Allowed

Ensure an admin has run /allow in the group
Check allowed groups with /list in private chat
Verify your user ID is in ADMIN_IDS

Docker Management

Common Docker Commands

# Start the bot
docker-compose up -d

# Stop the bot
docker-compose down

# Restart the bot
docker-compose restart

# View logs
docker-compose logs -f bot        # Bot logs
docker-compose logs -f db         # Database logs  
docker-compose logs -f            # All logs

# Update to latest version
docker-compose pull
docker-compose up -d

# Reset database (⚠️ deletes all data)
docker-compose down -v
docker-compose up -d

Updating SuntoBot

# Pull latest image
docker-compose pull bot

# Restart with new version
docker-compose up -d bot

License

This project is licensed under the MIT License - see the LICENSE file for details.

Support

For issues and questions:

Open an issue on GitHub
Check the documentation
Review the logs for error details