DEVAN Agent transforms Excel spreadsheets and PDF documents into structured, queryable, AI-ready data — entirely on your machine. Zero cloud. Zero compromise.
Businesses hold years of operational knowledge locked inside Excel files and PDF reports. That data is too structured for LLMs to read directly, and too sensitive to send to the cloud.
Data trapped in silos
Excel files with dozens of sheets, formulae, and references. PDFs with tables, charts, and embedded data. None of it natively digestible by language models.
Privacy prevents cloud upload
Financial records, HR data, client information — you can't upload it to a third-party service. GDPR, compliance, and plain common sense stand in the way.
Custom pipelines take months
Writing custom ETL pipelines to extract, clean, and structure this data takes engineering teams months — before a single LLM query can run.
DEVAN bridges the gap
Upload your files. Ask questions in plain English. Get structured data, charts, and SQL queries — all running locally in minutes, not months.
No configuration. No cloud sign-up. No data leaving your machine.
Upload
Drop any Excel workbook or PDF document into the DEVAN interface. Multi-sheet workbooks, scanned PDFs, and complex layouts all supported.
Ask
Type a question in plain English. DEVAN's AI selects the right MCP tool, extracts structured data, and runs queries — automatically.
Export
Download clean CSV, JSON, Excel, or Parquet. Copy the generated SQL. Share a chart. Your data — your format.
System overview
┌─────────────────────────────────────────────────────────┐ │ Your Machine │ │ │ │ ┌──────────────┐ ┌───────────────────────────┐ │ │ │ Your files │─────▶│ DEVAN Agent (Docker) │ │ │ │ Excel · PDF │ │ │ │ │ └──────────────┘ │ ┌─────┐ ┌──────────┐ │ │ │ │ │ LLM │ │ 8 MCP │ │ │ │ ┌──────────────┐ │ │local│ │ servers │ │ │ │ │ Browser UI │◀────▶│ └─────┘ └──────────┘ │ │ │ │ localhost: │ │ │ │ │ │ 5001 │ └───────────────────────────┘ │ │ └──────────────┘ │ │ Zero data exits ──▶ ✗ │ └─────────────────────────────────────────────────────────┘
Each MCP server is purpose-built for a specific data task, orchestrated by DEVAN's AI to answer your questions end-to-end.
Excel Retriever
Multi-sheet extraction, formula resolution, data type inference, and relationship detection across complex workbooks.
PDF Extractor
Layout-aware text extraction, OCR for scanned documents, table parsing, and chunking for RAG pipelines.
Data Modelling
Infer relational structure from flat files. Export to JSON, Parquet, Arrow, or SQLite. Run DuckDB queries directly.
Prompt Engineering
Curated prompt templates for document summarisation, data dictionary generation, and structured extraction tasks.
Guardrail
LLM output safety validation with PII detection, content filtering, and audit logging for compliance-sensitive workflows.
Web Design
React component generation from natural language descriptions. Ideal for rapid prototyping data dashboards.
Document Intelligence
Universal document understanding with Apache Arrow / Parquet compute tools. Handles mixed-format datasets: Excel, PDF, CSV, and JSON in a unified pipeline.
Audit & Compliance
Full audit trail of every LLM call, tool invocation, and data access event. Exportable log for enterprise compliance teams.
DEVAN runs entirely in a local Docker container. No API calls for your data. No telemetry. No third-party storage. Your files are processed and discarded on your own hardware — full stop.
Outbound network requests
0
No account required. No credit card. Just Docker.
Pull the image
Grab the latest DEVAN Agent image from Docker Hub.
docker pull m2laborg/devan-agent
Run it
Mount your data folder and start the agent.
docker run -p 5001:5001 \
-v ~/data:/data \
m2laborg/devan-agent
Open the UI
Navigate to the agent in your browser and start asking questions.
open http://localhost:5001
Ready to make your data speak?
Star the repo, pull the image, or read the docs — everything is free and open source.