warbler-cda / DEPLOYMENT.md
Bellok
staged changes are still showing even after forced push.
55d584b
|
raw
history blame
2.47 kB

Warbler CDA HuggingFace Deployment

This directory contains the Warbler CDA package prepared for HuggingFace deployment.

Quick Start

Local Testing

# Install dependencies
pip install -r requirements.txt

# Install package in development mode
pip install -e .

# Run Gradio demo
python app.py

Deploy to HuggingFace Space

Option 1: Manual Deployment

# Install HuggingFace CLI
pip install huggingface_hub

# Login
huggingface-cli login

# Upload to Space
huggingface-cli upload YOUR_USERNAME/warbler-cda . --repo-type=space

Option 2: GitLab CI/CD (Automated)

  1. Set up HuggingFace token in GitLab CI/CD variables:

    • Go to Settings > CI/CD > Variables
    • Add variable HF_TOKEN with your HuggingFace token
    • Add variable HF_SPACE_NAME with your Space name (e.g., username/warbler-cda)
  2. Push to main branch or create a tag:

    git tag v0.1.0
    git push origin v0.1.0
    
  3. The pipeline will automatically sync to HuggingFace!

Package Structure

warbler-cda-package/
β”œβ”€β”€ warbler_cda/              # Main package
β”‚   β”œβ”€β”€ __init__.py
β”‚   β”œβ”€β”€ retrieval_api.py      # Core RAG API
β”‚   β”œβ”€β”€ semantic_anchors.py   # Semantic memory
β”‚   β”œβ”€β”€ stat7_rag_bridge.py   # STAT7 hybrid scoring
β”‚   β”œβ”€β”€ embeddings/           # Embedding providers
β”‚   β”œβ”€β”€ api/                  # FastAPI service
β”‚   └── utils/                # Utilities
β”œβ”€β”€ app.py                    # Gradio demo for HF Space
β”œβ”€β”€ requirements.txt          # Dependencies
β”œβ”€β”€ pyproject.toml            # Package metadata
β”œβ”€β”€ README.md                 # Documentation
└── LICENSE                   # MIT License

Features

  • Semantic Search: Natural language document retrieval
  • STAT7 Addressing: 7-dimensional multi-modal scoring
  • Hybrid Scoring: Combines semantic + STAT7 for superior results
  • Production API: FastAPI service with concurrent query support
  • CLI Tools: Command-line interface for management
  • HF Integration: Direct dataset ingestion

Testing

# Run tests
pytest

# Run specific experiments
python -m warbler_cda.stat7_experiments

Documentation

See README.md for full documentation.

Support