🧠 Local-DocuBrain-RAG-Engine

An enterprise-grade, memory-optimized Retrieval-Augmented Generation (RAG) engine designed to chat with complex local documentation. Powered 100% locally and offline by Ollama (Phi-3 & Nomic-Embed-Text) using the LlamaIndex framework, this architecture guarantees absolute data privacy within an isolated local system environment.

🎯 Engineering Highlights & Capabilities

Persistent Vector Indexing: Computes mathematical document embeddings once and caches the indices to disk (db/). Future applications boot instantly with zero CPU overhead.
Bounded Context Windows: Hard-restricts model context loops to 2048 tokens to prevent system memory overload or massive RAM allocations during long generation passes.
Deterministic Local Grounding: Integrates strict prompt injection to force the model to answer questions using only localized facts, completely eliminating hallucinations.

⚙️ Core System Architecture

graph TD
    A[Raw Source Documents] --> B(SimpleDirectoryReader)
    B --> C(OllamaEmbedding: nomic-embed-text)
    C --> D[Persistent Storage Cache: /db]
    E[User Query] --> F(LlamaIndex Retrieval Node)
    D --> F
    F -->|Bounded Context Window| G[Local Ollama Daemon: phi3]
    G --> H[Deterministic Answer Output]

    style C fill:#2a2a2a,stroke:#4f46e5,stroke-width:2px;
    style D fill:#1e1b4b,stroke:#818cf8,stroke-width:2px;
    style H fill:#064e3b,stroke:#34d399,stroke-width:2px;

🚀 Local Environment Initialization & Setup

1. Prerequisites

Operating System: Windows 11 (PowerShell Isolation)
Python Runtime: Python 3.10+
Local Inference Daemon: Ollama Runtime Engine active with models pre-pulled:
```
ollama pull phi3
ollama pull nomic-embed-text
```

2. Sandbox Setup & Virtual Environment Isolation

# Navigate to project root folder
cd "F:\Local AI Library"

# Initialize and activate isolated workspace
python -m venv fresh_venv
.\fresh_venv\Scripts\Activate.ps1

3. Production Framework Installation

pip install llama-index llama-index-llms-ollama llama-index-embeddings-ollama pypdf

🛠️ Execution Specification

Drop target document assets (PDF, TXT, or MD format) directly into your local documents/ directory and execute the primary runtime orchestration file:

python app.py

On its first run, the framework builds the vector index from scratch. Subsequent boots read instantly from the local cached db/ block.

📁 Repository Directory Matrix

Local-DocuBrain-RAG-Engine/
├── fresh_venv/               # Isolated Local Virtual Environment (Ignored)
├── db/                       # Persistent Vector Database Indices (Ignored Cache)
├── documents/                # Target ingestion directory for source context files
├── .gitignore                # Production Isolation and Safety Tracking Matrix
├── app.py                    # Core Unified LlamaIndex RAG Orchestration Engine
└── README.md                 # System Overview & Architecture Documentation

🔐 Security & Data Isolation

The project's .gitignore asset completely isolates heavy background environment binaries (fresh_venv/), local database binaries (db/), and unindexed raw data files. This structure allows seamless team collaboration while ensuring private assets remain 100% confidential.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
Local_DocuBrain_Project_Report.docx		Local_DocuBrain_Project_Report.docx
README.md		README.md
app.py		app.py
generate_report.py		generate_report.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Local-DocuBrain-RAG-Engine

🎯 Engineering Highlights & Capabilities

⚙️ Core System Architecture

🚀 Local Environment Initialization & Setup

1. Prerequisites

2. Sandbox Setup & Virtual Environment Isolation

3. Production Framework Installation

🛠️ Execution Specification

📁 Repository Directory Matrix

🔐 Security & Data Isolation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 Local-DocuBrain-RAG-Engine

🎯 Engineering Highlights & Capabilities

⚙️ Core System Architecture

🚀 Local Environment Initialization & Setup

1. Prerequisites

2. Sandbox Setup & Virtual Environment Isolation

3. Production Framework Installation

🛠️ Execution Specification

📁 Repository Directory Matrix

🔐 Security & Data Isolation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages