An intuitive, AI-guided modeling platform that helps non-expert analysts build, explore, and deploy machine learning models—without writing a line of code.
This project aims to democratize modeling by combining powerful ML automation, storytelling-driven user experience, and visual workflows, wrapped in a seamless SaaS front end. The long-term vision includes a fully open-source deployment engine and intelligent model lifecycle management.
- Drag-and-drop data ingestion (CSV/XLSX)
- Automated EDA & visual insights
- AI-powered model recommendations
- Narrative-driven workflow guidance
- Feature engineering & preprocessing
- Feature Store - Centralized feature repository with versioning and reusability
- Model training and explainability tools
Future phases will include:
- Open-source deployment layer
- Automated model monitoring & retraining
- Data integration & streaming pipelines
narrative-modeling-app/
├── apps/
│ ├── frontend/ # Next.js + Tailwind UI
│ ├── backend/ # FastAPI backend for ML orchestration
│ └── mcp/ # MCP server for advanced data processing
├── ml/ # Python modeling scripts & training logic
├── shared/ # Shared types, constants, and utilities
├── infrastructure/ # Infrastructure as code (deployment configs)
├── scripts/ # Utility scripts for development
├── docs/ # Project documentation
├── .github/ # GitHub Actions / CI workflows
├── README.md
└── .gitignore
- Frontend: Next.js, Tailwind CSS, NextAuth v5 (Auth), React Flow
- Backend: FastAPI, Python, Pydantic, Beanie ODM
- Modeling: scikit-learn, pandas, XGBoost, SHAP
- Database: MongoDB Atlas (cloud-hosted) with Redis caching
- Storage: AWS S3
- Auth: NextAuth with Google/GitHub providers
- Dev Tools: GitHub, Linear (issue tracking), uv (Python), Docker
⚠️ Project is in active solo development. Contributions and deployment tooling will be part of Phase 2+.
cd apps/backend
uv sync # Install dependencies
uvicorn app.main:app --reloadcd apps/frontend
npm install
npm run dev- Backend: Copy
.env.exampleto.envand configure- MongoDB Atlas connection required (no local MongoDB needed)
- Set
MONGODB_URIto your Atlas connection string - Configure AWS S3 credentials for file storage
- Frontend: Copy
.env.local.exampleto.env.localand configure- Set
NEXT_PUBLIC_API_URLto backend URL
- Set
- Development: Set
SKIP_AUTH=trueto bypass authentication
✅ Sprint 11 Complete: Data Model Refactoring & Performance Benchmarking
- ✅ Model Architecture Refactoring - UserData split into DatasetMetadata, TransformationConfig, ModelConfig
- ✅ Data Versioning Foundation - Content-based hashing, lineage tracking, S3 integration
- ✅ Migration Testing Infrastructure - Volume testing, rollback procedures, data integrity verification
- ✅ Performance Benchmarking - pytest-benchmark framework with throughput targets
- ✅ 100% Test Pass Rate - 214/214 tests passing with 85%+ coverage
🟢 Sprint 12: 87% Complete - API Integration & Production Readiness (33/38 story points)
- ✅ API Integration - Version API routes with 23/23 tests passing
- ✅ Data Versioning API - Version tracking, lineage, recipe management
- ✅ Production Deployment - Model deployment API with 45/45 tests passing
- ✅ MongoDB Atlas Migration - Integration tests now use cloud-hosted Atlas
- ✅ Critical Bug Fixes (2025-11-11) - Fixed 11 runtime bugs + 1 critical security vulnerability (PR #48)
- 🚧 AutoML Integration - In progress
- 🚧 CI/CD Pipeline - Integration tests migrated to Atlas
For comprehensive documentation, see DOCUMENTATION_INDEX.md
Quick links:
- Local Development Setup
- Current Sprint (Sprint 12)
- Sprint History
- User Stories
- Production Deployment
- Production API Guide
Copyright © Frank Bria Future deployment engine intended for release under an open-source license (MIT or Apache 2.0 TBD)
Frank Bria
Building solo with help from ChatGPT & GitHub Copilot
frankbria.com