.md
This commit is contained in:
@@ -1,43 +1,59 @@
|
||||
# RAG Engine MVP Tasks - AI Study Assistant
|
||||
|
||||
## 🧠 MVP RAG ENGINE ROADMAP (8-12 WEEKS)
|
||||
> ⚠️ **IMPORTANTE - DOCUMENTO DESATUALIZADO**: Este documento descreve uma arquitetura Python/FAISS que **NÃO FOI IMPLEMENTADA**.
|
||||
>
|
||||
> **Implementação Real:**
|
||||
> - **Linguagem**: Dart (Flutter)
|
||||
> - **Localização**: `lib/core/services/materials_rag_service.dart`, `lib/core/services/rag_ai_service.dart`
|
||||
> - **Vector Store**: Firestore com embeddings mock (hash-based)
|
||||
> - **PDF Processing**: `syncfusion_flutter_pdf` (não Python)
|
||||
> - **Busca**: Keyword window search (não FAISS)
|
||||
>
|
||||
> **NÃO EXISTE:** Python, FAISS, Sentence Transformers, OpenAI, Anthropic
|
||||
|
||||
---
|
||||
|
||||
## 📚 WEEK 1-2: FOUNDATION & SETUP
|
||||
## 🧠 MVP RAG ENGINE ROADMAP (DOCUMENTAÇÃO ORIGINAL - NÃO IMPLEMENTADA)
|
||||
|
||||
---
|
||||
|
||||
## 📚 WEEK 1-2: FOUNDATION & SETUP (NOT IMPLEMENTED)
|
||||
|
||||
### Task 1.1: Vector Database Setup
|
||||
**Priority**: Critical
|
||||
**Estimated Time**: 8 hours
|
||||
**Dependencies**: None
|
||||
**Status**: ❌ NOT IMPLEMENTED - FAISS não é utilizado
|
||||
|
||||
#### Subtasks:
|
||||
- [ ] Choose vector database technology (FAISS for MVP)
|
||||
- [ ] Set up development environment
|
||||
- [ ] Install required dependencies
|
||||
- [ ] Configure storage for vector indices
|
||||
- [ ] Create basic vector operations
|
||||
- [ ] Set up backup and recovery
|
||||
#### What Actually Exists:
|
||||
```dart
|
||||
// lib/core/services/vector_service.dart
|
||||
class VectorService {
|
||||
// Mock embedding generation using text hashing
|
||||
static List<double> generateEmbedding(String text) {
|
||||
final embedding = List<double>.filled(384, 0.0);
|
||||
// Hash-based deterministic embeddings (not ML)
|
||||
return embedding;
|
||||
}
|
||||
}
|
||||
|
||||
#### Technology Stack:
|
||||
// lib/core/services/materials_rag_service.dart
|
||||
class MaterialsRAGService {
|
||||
// Keyword-based window search for PDFs
|
||||
static Future<String> getContextForQuestion(...) async {
|
||||
// 1. Extract PDF text with syncfusion_flutter_pdf
|
||||
// 2. Find keyword matches
|
||||
// 3. Return window of 1200 chars around match
|
||||
// NO FAISS, NO VECTOR SEARCH, NO PYTHON
|
||||
}
|
||||
}
|
||||
```
|
||||
|
||||
#### Technology Stack (Original - NOT USED):
|
||||
```bash
|
||||
# Core dependencies
|
||||
pip install faiss-cpu # or faiss-gpu for GPU acceleration
|
||||
pip install sentence-transformers
|
||||
pip install numpy
|
||||
pip install pandas
|
||||
pip install scikit-learn
|
||||
|
||||
# Text processing
|
||||
pip install nltk
|
||||
pip install spacy
|
||||
python -m spacy download en_core_web_sm
|
||||
|
||||
# Storage and utilities
|
||||
pip install firebase-admin
|
||||
pip install google-cloud-storage
|
||||
pip install pickle
|
||||
pip install h5py
|
||||
# These dependencies DO NOT EXIST in the project:
|
||||
# ❌ pip install faiss-cpu
|
||||
# ❌ pip install sentence-transformers
|
||||
# ❌ pip install numpy
|
||||
# ❌ pip install nltk
|
||||
# ❌ pip install spacy
|
||||
```
|
||||
|
||||
#### Project Structure:
|
||||
|
||||
Reference in New Issue
Block a user