Bugfix/backend fixes and test suite stabilization by MasumRab · Pull Request #107 · MasumRab/EmailIntelligence

MasumRab · 2025-09-28T12:52:14Z

Summary by CodeRabbit

New Features
- Backend now serves the Single-Page App with a catch‑all route.
- Email search supports combining a category filter with a search term.
- Default categories and data files initialized.
Bug Fixes
- Clearer Gmail API error messages in failures.
Documentation
- Overhauled README with simplified local setup.
- Added workflow documentation.
Chores
- Added CI and linting/type-check configs.
- Simplified launcher (Python 3.12 supported; removed CUDA/Gradio options).
- Replaced Node/Vite tooling with a Python-based run script.
Style
- UI tweaks: reduced border radius; base font size set to 14px.

… done so far and provide feedback for Jules to continue.

This commit introduces the 'scientific' branch, a significantly streamlined version of the application tailored for simpler deployments and focused on core functionalities. Key changes include: 1. **Branch Creation:** * Created the `scientific` branch from the main development line. 2. **Codebase Slimming & Refactoring:** * **Removed Enterprise & Docker Configurations:** Deleted the `deployment/` directory and root Docker Compose files. * **Simplified Backend Data Storage:** * Refactored `server/python_backend/database.py` to use JSON files (`emails.json`, `categories.json`, `users.json`) instead of PostgreSQL. * Removed PostgreSQL dependencies (`psycopg2-binary`, `asyncpg`) from Python requirements. * Removed Node.js PostgreSQL dependencies (`pg`, `drizzle-orm`, `connect-pg-simple`, `drizzle-kit`) and related files (`server/db.ts`, `shared/schema.ts`). * Simplified `smart_filters.db` (SQLite) schema by removing the unused `google_scripts` table. * **Simplified Frontend (UI):** * Removed `StatsCards`, `RecentActivity`, and `CategoryOverview` components from the dashboard. * Simplified the AI Control Panel and header elements on the dashboard. * Integrated `AIAnalysisPanel` to display when an email is selected. * Removed `recharts` (charting library) from client dependencies. * **Streamlined Python Backend & NLTK Pipeline:** * Removed `dashboard_routes.py`, `gradio_app.py`, performance monitoring (`performance_monitor.py`, `metrics.py`), action item extraction features (`action_routes.py`, `action_item_extractor.py`), and AI training (`ai_training.py`). * Removed unused NLP utilities (`data_strategy.py`, `retrieval_monitor.py`). * Updated `NLPEngine` and `AdvancedAIEngine` to remove dependencies on deleted modules. * Removed associated test files for many of these components. 3. **Styling Updates:** * Adjusted global CSS (`client/src/index.css`) for a more compact appearance (reduced corner radius, smaller base font size) inspired by functional UIs. 4. **Environment & Setup Simplification:** * Removed `gradio`, `pyngrok` from Python requirements. * Significantly simplified `launch.py` by removing Gradio UI, ngrok/share, PyTorch/CUDA specifics, and extension/model management features. * Created a new `README.md` tailored for the `scientific` branch, detailing the simplified setup process. This branch is intended for you if you need the core email analysis and smart filtering capabilities with a minimal setup footprint, suitable for local development, research, or scientific use cases.

…ientific_2' into scientific

…ix-tests Fix/refactor email routes and fix tests

…gence into scientific

This commit refactors the application to be a pure Python server, removing the Node.js/TypeScript backend and all associated dependencies. Changes: - All Python source code from `server/python_backend` and `server/python_nlp` has been consolidated into a new, single `backend` directory. - The `extensions` directory and database files have also been moved into the `backend` directory. - All Python import statements and hardcoded file paths have been updated to reflect the new directory structure. - The FastAPI server has been modified to serve the frontend assets. - A new `run.py` script has been created at the project root to provide a simple entrypoint for the application. Known Issues and Next Steps: - Due to persistent environment errors (`TMP RAM FS is not large enough`), I was unable to build the frontend assets or remove the leftover Node.js files. The application is configured to serve the raw frontend files from the `client` directory as a temporary measure. The next step is to build the frontend and update the server to serve the built assets from the `dist` directory. - The static file paths in `backend/python_backend/main.py` are likely incorrect and need to be adjusted to be relative to the `main.py` file. - The `run.py` entrypoint could be improved by moving it into the `backend` directory and adjusting the run command accordingly. - The application requires downloading large machine learning models, which may cause timeouts in some environments. Running the `download_hf_models.py` script before starting the server is recommended.

…into scientific

Introduces configuration files for linting (.flake8, .pylintrc), ignore rules (.gitignore), and project templates (.continue/ and codebuff.json). Adds project knowledge documentation (knowledge.md) and initial rule, model, and prompt YAMLs for the EmailIntelligence project.

commit 94375f0 Author: MasumRab <8943353+MasumRab@users.noreply.github.com> Date: Mon Jun 16 17:07:39 2025 +1000 Create diagnosis_message.txt

- Add dependabot-auto-merge.yml workflow that automatically merges Dependabot PRs when tests pass - Add ci.yml workflow for comprehensive testing on all PRs and pushes - Include safety checks: test execution, linting, formatting, and merge readiness verification - Add pytest-cov dependency for coverage reporting - Add documentation for workflow setup and customization Co-authored-by: openhands <openhands@all-hands.dev>

CRITICAL FIXES: - Replace fragile bash JSON parsing with GitHub's native PR status checks - Consolidate auto-merge steps into single action with comprehensive error handling - Remove unnecessary matrix strategy from single-version CI - Add proper error handling for GitHub CLI operations with graceful degradation - Eliminate workflow duplication by trusting CI results instead of re-running tests IMPROVEMENTS: - Use GitHub context variables (mergeable_state, draft) instead of API calls - Implement wait-for-check action to properly depend on CI completion - Add set -e for proper error propagation in bash scripts - Fix mypy configuration to show meaningful errors - Update documentation to reflect architectural improvements This addresses all fundamental reliability and complexity issues identified in code review. Co-authored-by: openhands <openhands@all-hands.dev>

- Updated all dependencies to latest versions (64 packages upgraded) * FastAPI 0.115.12 → 0.117.1 * Pydantic 2.11.5 → 2.11.9 (with v2 migration) * PyTorch 2.7.1 → 2.8.0 * Transformers 4.52.4 → 4.56.2 * And many more core dependencies - Fixed Pydantic v2 compatibility issues: * Migrated @validator to @field_validator * Updated Config to ConfigDict * Fixed min_items → min_length * Resolved syntax errors in models - Modernized launcher system: * Replaced deprecated pkg_resources with importlib.metadata * Extended Python support to 3.11-3.12 range * Fixed module import paths (server → backend) * Improved async database initialization - Code quality improvements: * Removed unused imports using unimport * Fixed async/await patterns * Enhanced error handling - Added comprehensive repository documentation: * Created .openhands/microagents/repo.md * Documented project structure and setup * Included development guidelines - Verified functionality: * All tests passing (category routes: 4/4) * API server running correctly * Launcher system working properly * Dependencies properly updated and locked Co-authored-by: openhands <openhands@all-hands.dev>

Combines latest repository updates with the improved GitHub Actions workflows: - Maintains all critical workflow fixes (native GitHub API usage, error handling) - Preserves pytest-cov dependency for coverage reporting - Integrates new backend improvements and test updates Co-authored-by: openhands <openhands@all-hands.dev>

The `get_emails` endpoint did not previously support searching within a specific category. This change adds the ability to filter emails by both a search term and a category ID simultaneously. A new `search_emails_by_category` method has been added to the `DatabaseManager` to handle the combined query. The `get_emails` route in `email_routes.py` has been updated to use this new method when both `search` and `category_id` are provided. A new test case has been added to verify the new functionality, and existing tests have been refactored for clarity and maintainability.

…entific

This commit addresses several bugs in the Python backend and improves the reliability of the test suite. - **ai_engine.py:** - Fixed a bug where a database call was made unnecessarily when the AI analysis returned no categories. - Added a check to ensure `db.get_all_categories()` is only called when there are categories to match. - **filter_routes.py:** - Added missing `await` keywords to `async` function calls in the `generate_intelligent_filters` and `prune_filters` routes. - Fixed a bug in the `create_filter` route where it was not correctly serializing the `actions` object. - Corrected the `description` attribute access in the `create_filter` route. - **gmail_routes.py:** - Improved error handling for `GoogleApiHttpError` to prevent crashes when the error response has an unexpected format. - **smart_retrieval.py:** - Fixed a command-line argument parsing error by adding `--strategies` as an alias for `--strategy-names`. - **Test Suite:** - Stabilized the test suite by fixing test isolation issues, correcting mock setups, and updating test payloads to match Pydantic models. - All 28 tests in the backend test suite now pass.

coderabbitai · 2025-09-28T12:52:20Z

Walkthrough

This PR transitions the app from a mixed Node/Express + Python stack to a Python-first FastAPI setup. It removes the Node server and TS toolchain, adds CI workflows, introduces SPA static serving, refactors Python imports and data handling, removes action_items and training from NLP paths, and adds new configs/data.

Changes

Cohort / File(s)	Summary
GitHub Workflows `.github/workflows/ci.yml`, `.github/workflows/dependabot-auto-merge.yml`, `.github/workflows/README.md`	Adds CI (tests, lint, type-check), Dependabot auto-merge workflow, and documentation of workflows.
Continue Configs `.continue/models/new-model.yaml`, `.continue/prompts/new-prompt.yaml`, `.continue/rules/new-rule.yaml`	Adds model, prompt, and rule YAMLs for tooling integration.
Docs & Knowledge `README.md`, `knowledge.md`, `.openhands/microagents/repo.md`, `diagnosis_message.txt`, `server/README.md`	Rewrites/introduces project docs; adds repo overview and diagnostic transcript; removes server README.
Dev Tooling & Config `.flake8`, `.pylintrc`, `.gitignore`, `pyproject.toml`, `codebuff.json`, `postcss.config.js`, `tailwind.config.ts`, `tsconfig.json`, `vite.config.ts`, `setup.js`	Adds Python lint configs; updates deps; adds CodeBuff config; removes JS/CSS build configs and setup script; adjusts ignores.
Client Assets `client/src/index.css`, `client/package.json`	Tweaks CSS radius and base font-size; removes client package manifest.
Launcher & Run `launch.py`, `run.py`	Simplifies launcher (drops CUDA/ngrok/gradio flags/flows), updates ASGI target, adds simple Uvicorn runner.
Backend Package & Data `backend/__init__.py`, `backend/data/categories.json`, `backend/data/emails.json`, `backend/data/users.json`	Marks backend as package; adds default categories; initializes empty emails/users datasets.
Python Backend Core `backend/python_backend/main.py`, `.../run_server.py`, `.../email_routes.py`, `.../filter_routes.py`, `.../gmail_routes.py`, `backend/extensions/example/example.py`, `backend/python_backend/__init__.py`	Switches imports from server.* to backend.*; mounts SPA static files and catch-all route; enhances email search+category logic; awaits async filter ops; improves Gmail error detail logging; simplifies relative imports.
Database Manager Enhancements `backend/python_backend/database.py`	Centralizes constants, lazy-inits data, adds category enrichment, expands search (incl. by category), normalizes fields, updates save/load, adds helpers and new method `search_emails_by_category`.
AI/NLP Engine Changes `backend/python_backend/ai_engine.py`, `backend/python_nlp/nlp_engine.py`, `backend/python_nlp/ai_training.py`, `backend/python_nlp/smart_retrieval.py`, `backend/python_nlp/gmail_service.py`	Removes action_items support and `train_models`; adjusts imports; adds ModelConfig stub; adds `--strategies` CLI option; updates import paths.
Performance Monitoring `backend/python_backend/performance_monitor.py`	Adds simple metrics recorder with context manager timing.
Tests (Python) `backend/python_backend/tests/`, `backend/python_nlp/tests/analysis_components/`	Updates imports to backend.*; adds Gmail routes test module; adjusts fixtures and new search-in-category test; aligns with API shape changes.
Node/Express Server Removal `server/...` (all removed: `index.ts`, `routes.ts`, `storage.ts`, route modules and tests, ai-engine, python-bridge, vite, init-db, etc.)	Deletes the Node server, routes, storage layer, AI engine, Python bridge, Vite integration, and associated tests.
Project JS/Build Removal `package.json`, `drizzle.config.ts`	Removes root Node package and Drizzle config.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant C as Client
  participant API as FastAPI email_routes
  participant DB as DatabaseManager

  C->>API: GET /api/emails?search=&category_id=
  alt search and category_id provided
    API->>DB: search_emails_by_category(search, category_id)
  else search only
    API->>DB: search_emails(search)
  else category_id only (not None)
    API->>DB: get_emails_by_category(category_id)
  else none
    API->>DB: get_all_emails()
  end
  DB-->>API: emails[]
  API-->>C: 200 emails[]

sequenceDiagram
  autonumber
  participant GH as GitHub
  participant WF as Dependabot Auto-Merge Workflow
  participant CI as CI Workflow

  GH-->>WF: PR event (opened/sync) by dependabot[bot]
  WF->>CI: Wait for check "test" to complete
  alt CI success
    WF->>GH: gh pr review --approve
    WF->>GH: gh pr merge --auto --merge
    alt Auto-merge already enabled
      WF-->>GH: log "already enabled"
    else Enabled now
      WF-->>GH: log "auto-merge enabled"
    end
  else CI failed/timeout
    WF-->>GH: Exit with error
  end

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Possibly related PRs

Jules was unable to complete the task in time. Please review the work… #66 — Modifies ai_engine.py and NLPEngine integration; overlaps with this PR’s ai_engine refactor and training/action_items changes.
Fix TypeScript compilation errors #11 — Changes in server/python-bridge.ts and server storage/AI modules; directly related to this PR’s removal of those server files.
Refactor: Improve modularity and readability of backend services. #69 — Refactors backend routing and Python FastAPI/NLP surfaces; aligns with this PR’s route module relocations and import updates.

Suggested labels

enhancement

Poem

A bunny taps keys with a dancer’s delight,
Node sails away; FastAPI takes flight.
New rules whisper, flake8 stands tall,
Emails and categories answer the call.
CI keeps watch through the starry night—
Hop, hop, ship it, the burrow feels right! 🐇✨

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Title Check	⚠️ Warning	The title “Bugfix/backend fixes and test suite stabilization” suggests a minor backend bugfix and test adjustments, but the changeset actually undertakes a major architectural restructuring—removing the Node/Express server, introducing a new Python FastAPI backend, adding CI workflows, new configuration files, and extensive renaming and deletion of modules—so the title is misleading and does not capture the primary scope.	Please update the pull request title to accurately reflect the main changes; for example: “Migrate server from Express/TypeScript to Python FastAPI and restructure project architecture,” or a similarly concise summary that highlights the core refactoring and new backend integration.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changes. Docstring coverage check skipped.

✨ Finishing touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch bugfix/backend-fixes-and-test-suite-stabilization

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

sourcery-ai

Sorry @MasumRab, your pull request is larger than the review limit of 150000 diff characters

coderabbitai

Actionable comments posted: 22

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (14)

backend/python_nlp/gmail_service.py (4)

120-129: Success path returns 'error' field for non‑JSON output

Returning {"success": True, "error": "..."} on success is misleading. Use a warning field.

-                    return {
-                        "success": True,
-                        "output": stdout_decoded,
-                        "error": f"Non-JSON output: {str(e)}",
-                    }  # Script success, but output not JSON
+                    return {
+                        "success": True,
+                        "output": stdout_decoded,
+                        "warning": f"Non-JSON output: {str(e)}",
+                    }

338-456: Robustness: guard optional fields in metadata

encryptions_info/attachments/label structures may be None; .get() would raise.

-        analysis_metadata_payload.update(
+        enc_info = gmail_metadata.encryption_info or {}
+        thread_info = gmail_metadata.thread_info or {}
+        attachments = gmail_metadata.attachments or []
+        analysis_metadata_payload.update(
             {
-                "importance_markers": gmail_metadata.importance_markers,
-                "thread_info": gmail_metadata.thread_info,
-                "custom_headers": gmail_metadata.custom_headers,
+                "importance_markers": gmail_metadata.importance_markers,
+                "thread_info": thread_info,
+                "custom_headers": gmail_metadata.custom_headers,
                 "attachments_summary": [
-                    {"filename": att.get("filename"), "size": att.get("size")}
-                    for att in gmail_metadata.attachments
+                    {"filename": (att or {}).get("filename"), "size": (att or {}).get("size")}
+                    for att in attachments
                 ],
             }
         )
@@
-            "isEncrypted": gmail_metadata.encryption_info.get("tls_encrypted", False)
-            or gmail_metadata.encryption_info.get("end_to_end_encrypted", False),
-            "isSigned": gmail_metadata.encryption_info.get("signed", False),
+            "isEncrypted": enc_info.get("tls_encrypted", False) or enc_info.get("end_to_end_encrypted", False),
+            "isSigned": enc_info.get("signed", False),

566-634: Potential None dereferences when reading subject/labels

.subject and .label_ids may be None; .lower() and iteration would fail.

-        if metadata.category == "primary":
+        subject = metadata.subject or ""
+        subject_lower = subject.lower()
+        label_ids = metadata.label_ids or []
+        if metadata.category == "primary":
@@
-            if any(label in ["CATEGORY_PERSONAL"] for label in metadata.label_ids):
+            if any(label in ["CATEGORY_PERSONAL"] for label in label_ids):
@@
-            elif metadata.mailing_list or any(
-                word in metadata.subject.lower() for word in ["newsletter", "promotion", "offer"]
+            elif metadata.mailing_list or any(
+                word in subject_lower for word in ["newsletter", "promotion", "offer"]
             ):
                 return "promotions"
@@
-        subject_lower = metadata.subject.lower()
+        subject_lower = (metadata.subject or "").lower()

460-565: Broken method: undefined attributes (self.model_trainer, self.prompt_engineer)

Calling train_models_from_gmail_data will raise AttributeError. Either wire required dependencies or deprecate/remove this method.

-    async def train_models_from_gmail_data(
-        self, training_query: str = "newer_than:30d", max_training_emails: int = 5000
-    ) -> Dict[str, Any]:
-        self.logger.info(
-            f"Starting model training from Gmail data. Query: {training_query}, Max emails: {max_training_emails}"
-        )
-        try:
-            ...
-            return {
-                "success": True,
-                "training_samples_count": len(training_samples),
-                "models_trained": training_results,
-                "training_timestamp": datetime.now().isoformat(),
-            }
-        except Exception as e:
-            self.logger.error(f"Model training failed: {e}", exc_info=True)
-            return {"success": False, "error": str(e), "training_samples_count": 0}
+    async def train_models_from_gmail_data(self, *args, **kwargs) -> Dict[str, Any]:
+        # Temporarily disabled until model_trainer/prompt_engineer are reintroduced
+        self.logger.warning("train_models_from_gmail_data is not available in this refactor.")
+        raise NotImplementedError("Training pipeline is not part of the current service.")

backend/python_nlp/tests/analysis_components/test_sentiment_model.py (1)

41-44: Fix the patched module path after the import relocation.

Line 8 now imports SentimentModel from backend..., but the three patch() blocks (Line 41 onward) still target server.... unittest.mock.patch will raise ModuleNotFoundError, so these tests will crash before their assertions run. Point the patches at the relocated module path.

         with patch(
-            "server.python_nlp.analysis_components.sentiment_model.TextBlob",
+            "backend.python_nlp.analysis_components.sentiment_model.TextBlob",
             return_value=mock_textblob_instance,
         ) as mock_textblob_class:
...
         with patch(
-            "server.python_nlp.analysis_components.sentiment_model.TextBlob",
+            "backend.python_nlp.analysis_components.sentiment_model.TextBlob",
             side_effect=Exception("TextBlob error"),
         ):
...
         with patch(
-            "server.python_nlp.analysis_components.sentiment_model.TextBlob",
+            "backend.python_nlp.analysis_components.sentiment_model.TextBlob",
             return_value=mock_textblob_instance,
         ) as mock_textblob_class:

Also applies to: 57-59, 73-75

backend/python_backend/run_server.py (1)

44-52: Multi‑process + JSON files = corruption risk

With a file‑backed JSON “DB”, multiple workers can interleave writes and corrupt data. Run a single worker until a real DB or file locking is introduced.

-    config = {
+    is_dev = os.getenv("NODE_ENV") == "development"
+    use_json_db = os.getenv("USE_JSON_DB", "1") == "1"  # default: file-backed storage
+    config = {
         "host": host,
         "port": port,
         "log_level": "info",
         "access_log": True,
-        "reload": os.getenv("NODE_ENV") == "development",
-        "workers": 1 if os.getenv("NODE_ENV") == "development" else 4,
+        "reload": is_dev,
+        # Keep single worker when using JSON files to avoid race conditions/corruption
+        "workers": 1 if (use_json_db or is_dev) else 4,
     }

backend/python_backend/gmail_routes.py (2)

100-108: Redact Gmail error payloads to avoid logging PII and oversized blobs.

full_gmail_error can include message bodies and addresses. Log only minimal fields.

-        log_data = {
+        log_data = {
             "message": "Gmail API operation failed during sync",
             "endpoint": str(req.url),
             "error_type": type(gmail_err).__name__,
-            "error_detail": error_detail_message,
+            "error_detail": error_detail_message[:512],  # cap length
             "gmail_status_code": getattr(gmail_err.resp, "status", None),
-            "full_gmail_error": error_details_dict,
+            "gmail_error_summary": {
+                "code": (error_content.get("code") if isinstance(error_content, dict) else None),
+                "message_present": bool(error_detail_message),
+            },
         }

-        log_data = {
+        log_data = {
             "message": "Gmail API operation failed during smart retrieval",
             "endpoint": str(req.url),
             "error_type": type(gmail_err).__name__,
-            "error_detail": error_detail_message,
+            "error_detail": error_detail_message[:512],
             "gmail_status_code": getattr(gmail_err.resp, "status", None),
-            "full_gmail_error": error_details_dict,
+            "gmail_error_summary": {
+                "code": (error_content.get("code") if isinstance(error_content, dict) else None),
+                "message_present": bool(error_detail_message),
+            },
         }

Also applies to: 162-170

21-27: Remove module‐level instantiation; inject GmailAIService via Depends
Module‐level DatabaseManager() bypasses the async initialize() in get_db (database.py lines 623–629), leading to uninitialized state and flaky tests. In backend/python_backend/gmail_routes.py (21–27, 30–36, 137–146), replace:

-db_manager_for_gmail_service = DatabaseManager()
-ai_engine_for_gmail_service = AdvancedAIEngine()
-gmail_service = GmailAIService(
-    db_manager=db_manager_for_gmail_service,
-    advanced_ai_engine=ai_engine_for_gmail_service,
-)

with per‐request wiring, e.g.:

from fastapi import Depends
from .database import get_db

async def get_gmail_service(
    db = Depends(get_db),
):
    return GmailAIService(db_manager=db, advanced_ai_engine=AdvancedAIEngine())

@router.post("/sync")
async def sync_gmail(
    req: Request,
    request_model: GmailSyncRequest,
    gmail_service: GmailAIService = Depends(get_gmail_service),
    background_tasks: BackgroundTasks,
):
    ...

This ensures initialize() is awaited and each request gets an isolated, fully initialized service.

backend/python_backend/email_routes.py (1)

5-12: Guard psycopg2 import to keep tests/envs without Postgres from crashing.

Importing psycopg2 at module import-time will fail in lightweight test runners. Guard it and catch a local alias.

-import psycopg2
+try:
+    import psycopg2
+    PsycopgError = psycopg2.Error
+except Exception:  # psycopg2 unavailable
+    class PsycopgError(Exception):
+        pass

-    except psycopg2.Error as db_err:
+    except PsycopgError as db_err:

-    except psycopg2.Error as db_err:
+    except PsycopgError as db_err:

-    except psycopg2.Error as db_err:
+    except PsycopgError as db_err:

-    except psycopg2.Error as db_err:
+    except PsycopgError as db_err:

Also applies to: 50-60, 88-98, 154-164, 200-208

launch.py (2)

334-336: Venv Python check contradicts widened support (forces 3.11.x).

You allow 3.11–3.12 globally but here you treat any venv not exactly 3.11 as “incompatible” and prompt to recreate with 3.11.x. This will incorrectly flag perfectly fine 3.12 venvs and lead to avoidable churn.

Apply this diff to accept any interpreter within [PYTHON_MIN_VERSION, PYTHON_MAX_VERSION] and align prompts:

-                            target_major, target_minor = PYTHON_MIN_VERSION
-                            if not (venv_major == target_major and venv_minor == target_minor):
+                            min_major, min_minor = PYTHON_MIN_VERSION
+                            max_major, max_minor = PYTHON_MAX_VERSION
+                            if (venv_major, venv_minor) < (min_major, min_minor) or (venv_major, venv_minor) > (max_major, max_minor):
                                 logger.warning(
-                                    f"WARNING: The existing virtual environment at './{VENV_DIR}' was created with Python {venv_major}.{venv_minor}. "
-                                    f"This project requires Python {target_major}.{target_minor}."
+                                    f"WARNING: The existing virtual environment at './{VENV_DIR}' was created with Python {venv_major}.{venv_minor}. "
+                                    f"This project supports Python {min_major}.{min_minor}–{max_major}.{max_minor}."
                                 )
...
-                                            "Do you want to delete and recreate the virtual environment with "
-                                            f"Python {target_major}.{target_minor}? (yes/no): "
+                                            "Do you want to delete and recreate the virtual environment with a supported Python version "
+                                            f"({min_major}.{min_minor}–{max_major}.{max_minor})? (yes/no): "
                                             )

And fix the earlier corrupted‑venv prompt:

-                            f"It might be corrupted. Do you want to delete and recreate it with Python 3.11.x? (yes/no): "
+                            f"It might be corrupted. Do you want to delete and recreate it with a supported Python ({PYTHON_MIN_VERSION[0]}.{PYTHON_MIN_VERSION[1]}–{PYTHON_MAX_VERSION[0]}.{PYTHON_MAX_VERSION[1]})? (yes/no): "

Also applies to: 393-449

745-748: Respect --api-url when starting the frontend.

VITE_API_URL always points to host:port even if --api-url is provided.

Apply this diff:

-    env = os.environ.copy()
-    env["VITE_API_URL"] = f"http://{args.host}:{args.port}"  # Backend URL for Vite
+    env = os.environ.copy()
+    env["VITE_API_URL"] = args.api_url or f"http://{args.host}:{args.port}"  # Backend URL for Vite

backend/python_backend/ai_engine.py (2)

215-220: Map status to "healthy" (not "ok") to satisfy ServiceHealth model.

ServiceHealth.status only allows healthy|degraded|unhealthy; returning ok will fail validation downstream.

Apply this diff:

-            status = "ok"
+            status = "healthy"
             if not all_models_loaded:
                 status = "degraded"
             if not nltk_available or not sklearn_available:
                 status = "degraded"  # Or "unhealthy" depending on severity

91-101: Harden category matching against non-string entries.

Guard against None or non-str items from NLPEngine to avoid AttributeError on lower().

Apply this diff:

-            for ai_cat_str in ai_categories:
-                for db_cat in all_db_categories:
-                    name_lower = db_cat["name"].lower()
-                    ai_cat_lower = ai_cat_str.lower()
+            for ai_cat_str in ai_categories:
+                if not isinstance(ai_cat_str, str) or not ai_cat_str:
+                    continue
+                ai_cat_lower = ai_cat_str.lower()
+                for db_cat in all_db_categories:
+                    name = db_cat.get("name")
+                    if not isinstance(name, str):
+                        continue
+                    name_lower = name.lower()
                     if name_lower in ai_cat_lower or ai_cat_lower in name_lower:
                         log_msg = (
                             f"Matched AI category '{ai_cat_str}' to DB "
-                            f"category '{db_cat['name']}' (ID: {db_cat['id']})"
+                            f"category '{name}' (ID: {db_cat.get('id')})"
                         )
                         logger.info(log_msg)
-                        return db_cat["id"]
+                        return db_cat.get("id")

backend/python_backend/models.py (1)

81-98: Fix EmailResponse parsing from DB (snake_case → camelCase).

Current model cannot parse DB records (message_id, category_id, etc.), causing ValidationError in email_routes.create_email. Map validation aliases to snake_case keys.

Apply this diff:

 class EmailResponse(EmailBase):
     id: int
-    messageId: Optional[str]
-    threadId: Optional[str]
+    messageId: Optional[str] = Field(validation_alias="message_id")
+    threadId: Optional[str] = Field(validation_alias="thread_id")
     preview: str
     category: Optional[str]
-    categoryId: Optional[int]
+    categoryId: Optional[int] = Field(validation_alias="category_id")
     labels: List[str]
     confidence: int = Field(ge=0, le=100)
-    isImportant: bool
-    isStarred: bool
-    isUnread: bool
-    hasAttachments: bool
-    attachmentCount: int
-    sizeEstimate: int
-    aiAnalysis: Dict[str, Any] = Field(default_factory=dict)
+    isImportant: bool = Field(validation_alias="is_important")
+    isStarred: bool = Field(validation_alias="is_starred")
+    isUnread: bool = Field(validation_alias="is_unread")
+    hasAttachments: bool = Field(validation_alias="has_attachments")
+    attachmentCount: int = Field(validation_alias="attachment_count")
+    sizeEstimate: int = Field(validation_alias="size_estimate")
+    aiAnalysis: Dict[str, Any] = Field(default_factory=dict, validation_alias="analysis_metadata")
     filterResults: Dict[str, Any] = Field(default_factory=dict)

🧹 Nitpick comments (42)

client/src/index.css (2)
25-25: Token change: check component rounding consistency

Changing --radius to 0.375rem subtly alters all components using this token. Verify buttons, inputs, and menus still match the design system and Tailwind rounded-* utilities if mapped to this var.

64-68: Global 14px body font can hurt readability/accessibility

A 14px base is small; prefer 16px (1rem) or a responsive clamp. Example:
-    font-size: 14px;
+    font-size: 1rem; /* or: clamp(0.9375rem, 0.9vw + 0.6rem, 1rem) */
pyproject.toml (3)
10-10: psycopg2-binary in prod: confirm suitability

psycopg2-binary is convenient but often discouraged for long-lived production. Consider plain psycopg2 or document why binary is acceptable for your deployment.

14-15: Align and harden uvicorn dependency

Avoid duplicating uvicorn in both runtime and dev groups; keep one source of truth.

Consider uvicorn[standard] for production and align to >=0.35.0 if compatible with Python 3.11.
-    "uvicorn>=0.34.3",
+    "uvicorn[standard]>=0.35.0",
Based on learnings

Also applies to: 41-42

6-15: Prefer pinning/constraints for reproducible builds

Wide >= ranges can cause unexpected CI drift. Add a constraints/lock (e.g., requirements.lock/uv pip compile) or pin critical infra deps (fastapi, uvicorn, httpx).
backend/python_nlp/gmail_service.py (3)
53-61: DB manager constructed but never initialized

DatabaseManager often needs initialize() before use. Consider an explicit async initializer for the service.
 class GmailAIService:
@@
-        self.db_manager = db_manager
+        self.db_manager = db_manager
@@
-            self.db_manager = DatabaseManager()
+            self.db_manager = DatabaseManager()
+
+    async def initialize(self) -> None:
+        # Call this after constructing the service
+        try:
+            if hasattr(self.advanced_ai_engine, "initialize"):
+                self.advanced_ai_engine.initialize()  # sync per engine summary
+            if hasattr(self.db_manager, "initialize"):
+                await self.db_manager.initialize()
+        except Exception:
+            self.logger.exception("Service initialization failed")
85-95: Consider subprocess timeout to avoid hangs

Wrap communicate() with asyncio.wait_for and surface timeout errors.
-            stdout, stderr = await process.communicate()
+            try:
+                stdout, stderr = await asyncio.wait_for(process.communicate(), timeout=300)
+            except asyncio.TimeoutError:
+                process.kill()
+                await process.communicate()
+                return {"success": False, "error": "Command timed out", "return_code": None}
401-404: Optional: extract clean sender email

senderEmail currently mirrors the raw From header; consider parsing the address.
-            "senderEmail": gmail_metadata.from_address,
+            "senderEmail": (gmail_metadata.from_address.split("<")[-1].rstrip(">") if "<" in gmail_metadata.from_address else gmail_metadata.from_address),
.pylintrc (1)

1-20: Reasonable baseline; keep an eye on disabled checks

Good starting point. Consider re-enabling R0913 (too-many-arguments) later to curb API bloat as the FastAPI surface grows.

.continue/models/new-model.yaml (1)

5-11: Tooling config: keep it out of runtime packaging

Ensure this Continue config isn’t included in production builds/containers and secrets are injected only via CI. Confirm anthropic client is not an app dependency.
backend/python_nlp/ai_training.py (1)
6-20: Prefer default_factory over manual post-init dict

We can drop the custom __post_init__ and let the dataclass build a fresh dict for each instance with field(default_factory=dict), which is the idiomatic pattern and removes the branch entirely.
-from dataclasses import dataclass
+from dataclasses import dataclass, field
@@
-    parameters: Dict[str, Any] = None
+    parameters: Dict[str, Any] = field(default_factory=dict)
@@
-    
-    def __post_init__(self):
-        if self.parameters is None:
-            self.parameters = {}
run.py (2)
7-7: Avoid sys.path mutation or at least prepend safely

Appending can cause shadowing/duplication. Prefer insert(0) with a guard, or remove entirely by relying on proper packaging.
-# Add the current directory to the path to ensure modules can be found
-sys.path.append(str(Path(__file__).parent))
+# Add this file's directory to sys.path (prepend) only if missing
+p = str(Path(__file__).resolve().parent)
+if p not in sys.path:
+    sys.path.insert(0, p)
11-12: Avoid drift with run_server.py

run.py omits startup initialization used in backend/python_backend/run_server.py (database init, logging). Consider deleting run.py or delegating to run_server.py for consistency.
backend/data/categories.json (1)

1-37: Static seed looks good; consider adding stable slugs

Names work, but downstream matching currently relies on substring comparisons. Adding a stable slug per category can prevent ambiguous matches and ease i18n.
backend/python_backend/tests/test_ai_engine.py (1)
21-40: Tighten patching and remove redundant rebinding

Use autospec to keep the method signature honest and avoid reassigning the instance attribute; the class patch already covers the instance.
-    with patch.object(NLPEngine, "analyze_email") as mock_nlp_analyze:
+    with patch.object(NLPEngine, "analyze_email", autospec=True) as mock_nlp_analyze:
         # Configure the mock for NLPEngine().analyze_email
         mock_nlp_analyze.return_value = {
@@
         }
         engine = AdvancedAIEngine()
-        # Store the mock for assertions if needed directly on nlp_engine's mock
-        engine.nlp_engine.analyze_email = mock_nlp_analyze
         yield engine
backend/python_backend/run_server.py (3)
41-43: Allow HOST override

Minor: make host configurable via HOST env for container friendliness.
-    host = "0.0.0.0"
+    host = os.getenv("HOST", "0.0.0.0")
23-37: Confirm db instance wiring

Startup creates a DatabaseManager but doesn’t store it on app.state. If routes rely on a different/global instance, this is fine; otherwise wire it: app.state.db = db.

59-59: Production extras for Uvicorn

When deploying, prefer installing uvicorn[standard] for better performance (uvloop, httptools). Based on learnings.
backend/python_backend/tests/test_category_routes.py (2)
29-35: Use an async override for get_db to match the dependency’s async signature.

Prevents subtle sync/async mismatches and mirrors the real dependency behavior.
-    app.dependency_overrides[get_db] = lambda: mock_db_manager_cat
+    async def _override_db():
+        return mock_db_manager_cat
+    app.dependency_overrides[get_db] = _override_db
17-19: Remove unused mock_performance_monitor_cat_instance.

It’s never referenced.
-# Mock PerformanceMonitor
-mock_performance_monitor_cat_instance = MagicMock()
backend/python_backend/email_routes.py (1)
127-136: Make analysisMetadata extraction resilient to different AI result shapes.

Avoid attribute errors if analyze_email returns a dict/Pydantic model.
-        email_data.update(
-            {
-                "confidence": int(ai_analysis.confidence * 100),
-                "categoryId": ai_analysis.category_id,
-                "labels": ai_analysis.suggested_labels,
-                "analysisMetadata": ai_analysis.to_dict(), # Assuming AIAnalysisResult has to_dict, or use model_dump if Pydantic
-            }
-        )
+        analysis_metadata = (
+            ai_analysis.to_dict() if hasattr(ai_analysis, "to_dict")
+            else (ai_analysis.model_dump() if hasattr(ai_analysis, "model_dump")
+                  else (dict(ai_analysis) if isinstance(ai_analysis, (list, tuple)) is False and hasattr(ai_analysis, "__iter__") and not isinstance(ai_analysis, str)
+                        else ai_analysis))
+        )
+        email_data.update(
+            {
+                "confidence": int(getattr(ai_analysis, "confidence", 0.5) * 100) if hasattr(ai_analysis, "confidence") else int((ai_analysis.get("confidence", 0.5)) * 100) if isinstance(ai_analysis, dict) else 50,
+                "categoryId": getattr(ai_analysis, "category_id", None) if not isinstance(ai_analysis, dict) else ai_analysis.get("category_id"),
+                "labels": getattr(ai_analysis, "suggested_labels", []) if not isinstance(ai_analysis, dict) else ai_analysis.get("suggested_labels", []),
+                "analysisMetadata": analysis_metadata,
+            }
+        )
backend/python_backend/main.py (3)
48-57: Fix CORS for wildcard subdomains.

allow_origins doesn’t support patterns. Use allow_origin_regex for *.replit.dev.
 app.add_middleware(
     CORSMiddleware,
-    allow_origins=[
+    allow_origins=[
         "http://localhost:5000",
         "http://localhost:5173",
-        "https://*.replit.dev",
     ],
+    allow_origin_regex=r"^https://.*\.replit\.dev$",
     allow_credentials=True,
     allow_methods=["*"],
     allow_headers=["*"],
 )
89-94: Silence linter: unused full_path.

Rename param since it’s not used.
-@app.get("/{full_path:path}")
-async def catch_all(full_path: str):
+@app.get("/{full_path:path}")
+async def catch_all(_: str):
135-136: Prefer uvicorn.run(app, …) or correct dotted path.

"main:app" may fail when run outside module root. Running the object avoids import path issues.
-    uvicorn.run("main:app", host="0.0.0.0", port=port, reload=True, log_level="info")
+    uvicorn.run(app, host="0.0.0.0", port=port, reload=True, log_level="info")
backend/python_nlp/nlp_engine.py (2)
21-21: Use a relative import for package resilience.

Prevents failures when the project isn’t installed as “backend” package.
-from backend.python_nlp.text_utils import clean_text
+from .text_utils import clean_text
719-721: Mark unused parameter to satisfy linters.

Keep signature but underscore the arg.
-    def _analyze_action_items(self, text: str) -> List[Dict[str, Any]]:
+    def _analyze_action_items(self, _: str) -> List[Dict[str, Any]]:
launch.py (5)
1182-1187: Remove stale --gradio-ui argument (feature removed; help text is misleading).

The flag is kept but does nothing and its help duplicates --api-only. Remove to avoid confusion.

Apply this diff:
-    parser.add_argument(
-        "--gradio-ui",
-        action="store_true",
-        help="Run only the API server without the frontend", # Description kept, but --gradio-ui removed
-    )
-    # Gradio UI argument removed
+    # --gradio-ui removed
694-706: Bail out early if npm is missing to avoid noisy failures.

You log the absence of npm but proceed to run npm commands that will fail later.

Apply this diff:
         if npm_executable_path is None:
             logger.error(
                 f"The 'npm' command was not found in your system's PATH. "
                 f"Please ensure Node.js and npm are correctly installed and that the npm installation directory is added to your PATH environment variable. "
                 f"Attempted to find 'npm' for the client in: {client_dir}"
             )
-            # Potentially return None here if npm is essential and not found,
-            # or let it proceed to fail at the npm install line, which will now be more informed.
-            # For now, let's log and let it try, as the original code attempts to continue.
-            # If we want to stop it here, uncomment the next line:
-            # return None
+            return None
         else:
             logger.info(f"Found 'npm' executable at: {npm_executable_path}")
@@
     try:
-        logger.info(f"Running frontend command: {' '.join(cmd)} in {str(ROOT_DIR / 'client')}")
+        logger.info(f"Running frontend command: {' '.join(cmd)} in {str(ROOT_DIR / 'client')}")
         process = subprocess.Popen(cmd, cwd=str(ROOT_DIR / "client"), env=env)
Also applies to: 709-729, 755-759

19-20: Update usage doc to match supported stages.

Docstring advertises {dev,test,staging,prod} but argparse only allows ["dev","test"].

Apply this diff:
-    --stage {dev,test,staging,prod}  Specify the application stage to run
+    --stage {dev,test}               Specify the application stage to run
1269-1279: Align interpreter‑discovery comments/logs with supported range.

Comment still says “Ensure 3.11.x”; log now reflects 3.11–3.12. Make the intent consistent.

Apply this diff:
-    # Goal: Ensure launch.py runs with Python 3.11.x
+    # Goal: Ensure launch.py runs with a supported Python in [PYTHON_MIN_VERSION, PYTHON_MAX_VERSION]
690-733: Optional: Skip npm install when package.json is present but lockfile unchanged.

Consider a fast path: run npm ci when lockfile exists; or skip install if node_modules cache is valid. This speeds up local/dev runs.
backend/python_backend/performance_monitor.py (2)
17-20: Make metrics thread‑safe and accumulate values; remove unused start_times.

Current dict overwrites on repeated measurements and is not concurrency‑safe under ASGI. Use a lock and store lists of samples; drop unused start_times.

Apply this diff:
-from typing import Dict, Any
+from typing import Dict, Any, List
+from threading import RLock
+from copy import deepcopy
@@
 class PerformanceMonitor:
     """Monitor and log performance metrics for the application."""
     
     def __init__(self):
-        self.metrics: Dict[str, Any] = {}
-        self.start_times: Dict[str, float] = {}
+        self.metrics: Dict[str, List[Any]] = {}
+        self._lock = RLock()
@@
     def record_metric(self, name: str, value: Any):
         """Record a performance metric."""
-        self.metrics[name] = value
-        logger.debug(f"Performance metric recorded: {name} = {value}")
+        with self._lock:
+            self.metrics.setdefault(name, []).append(value)
+        logger.debug(f"Performance metric recorded: {name} += {value}")
@@
     def get_metrics(self) -> Dict[str, Any]:
         """Get all recorded metrics."""
-        return self.metrics.copy()
+        with self._lock:
+            return deepcopy(self.metrics)
@@
     def clear_metrics(self):
         """Clear all recorded metrics."""
-        self.metrics.clear()
-        self.start_times.clear()
+        with self._lock:
+            self.metrics.clear()
Also applies to: 21-35, 36-43

21-30: Optional: Expose summary helpers (count/avg/p95).

If these metrics feed endpoints, consider computed summaries to avoid large arrays in responses.
backend/python_backend/tests/test_gmail_routes.py (1)
51-80: Assert parameter mapping for sync_gmail to catch regressions.

Validate that camelCase request fields map to service kwargs.

Apply this diff:
     response = client_gmail.post("/api/gmail/sync", json=request_payload)
@@
     mock_gmail_service_instance.sync_gmail_emails.assert_called_once()
+    # Verify arg mapping
+    args, kwargs = mock_gmail_service_instance.sync_gmail_emails.call_args
+    assert kwargs.get("query_filter") == "test-query"
+    assert kwargs.get("max_emails") == 100
backend/python_backend/ai_engine.py (3)
123-125: Normalize AI categories before matching.

Pre-filter to non-empty strings to reduce noise and exceptions.

Apply this diff:
-            ai_categories = analysis_data.get("categories")
-            if db and ai_categories:
+            ai_categories = [
+                c for c in analysis_data.get("categories", [])
+                if isinstance(c, str) and c.strip()
+            ]
+            if db and ai_categories:
107-107: Use logger.exception for caught exceptions.

Improves traceback visibility; aligns with TRY400.

Based on static analysis hints

Apply this diff:
-            logger.error(f"Error during category matching: {e}", exc_info=True)
+            logger.exception(f"Error during category matching: {e}")
-            logger.error(f"An unexpected error occurred during AI analysis: {e}", exc_info=True)
+            logger.exception(f"An unexpected error occurred during AI analysis: {e}")
-            logger.error(f"AI health check failed during direct inspection: {e}", exc_info=True)
+            logger.exception(f"AI health check failed during direct inspection: {e}")
-                    except OSError as e:
-                        err_msg = f"Error removing temp file {temp_file} " f"during cleanup: {e}"
-                        logger.error(err_msg)
+                    except OSError as e:
+                        logger.exception(f"Error removing temp file during cleanup: {temp_file}")
-        except Exception as e:
-            logger.error(f"AI Engine cleanup failed: {e}")
+        except Exception as e:
+            logger.exception("AI Engine cleanup failed")
-            logger.error(f"Error generating fallback analysis itself: {e}", exc_info=True)
+            logger.exception(f"Error generating fallback analysis itself: {e}")
Also applies to: 140-140, 229-229, 254-254, 259-259, 305-305

278-303: Private API usage for fallback.

Calling NLPEngine._get_simple_fallback_analysis uses a private method; low risk but brittle to internal changes. Consider exposing a public simple_fallback(...) API in NLPEngine.
backend/python_backend/models.py (1)
10-10: Type-hint EmailCreate preview validator for clarity.

Minor hygiene; also import FieldValidationInfo.

Apply this diff:
-from pydantic import BaseModel, Field, field_validator, ConfigDict
+from pydantic import BaseModel, Field, field_validator, ConfigDict, FieldValidationInfo
-    def set_preview(cls, v, info):
-        if not v and info.data and "content" in info.data:
+    def set_preview(cls, v: Optional[str], info: FieldValidationInfo) -> Optional[str]:
+        if not v and info.data and "content" in info.data:
             content = info.data["content"]
             return (
                 content[:200] + "..."
                 if len(content) > 200
                 else content
             )
         return v
Also applies to: 57-67
backend/python_backend/database.py (4)
84-85: Specify UTF‑8 encoding for JSON I/O.

Prevents locale-dependent behavior.

Apply this diff:
-                    with open(file_path, 'r') as f:
+                    with open(file_path, 'r', encoding='utf-8') as f:
                         data = await asyncio.to_thread(json.load, f)
-            with open(file_path, 'w') as f:
+            with open(file_path, 'w', encoding='utf-8') as f:
                 await asyncio.to_thread(json.dump, data_to_save, f, indent=4)
Also applies to: 116-117

193-195: Replace unnecessary dict comprehension.

Minor cleanup flagged by Pylint R1721.

Based on static analysis hints

Apply this diff:
-            update_payload = {k: v for k, v in email_data.items()}
+            update_payload = dict(email_data)
331-347: Sorting on ISO strings is okay; add defensive parse if mixed formats appear.

If you see heterogeneous time formats, consider parsing to datetime for consistent ordering; keep current fallback for performance.

Also applies to: 453-467, 500-514

623-629: Singleton init: ensure idempotence under concurrent startup.

Low risk, but two coroutines could hit _db_manager_instance is None before assignment. Consider an asyncio.Lock if startup races are observed.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

Disabled knowledge base sources:

Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 99a114d and 9199144.

⛔ Files ignored due to path filters (9)

backend/email_cache.db is excluded by !**/*.db
backend/python_nlp/intent_model.pkl is excluded by !**/*.pkl
backend/python_nlp/sentiment_model.pkl is excluded by !**/*.pkl
backend/python_nlp/sync_checkpoints.db is excluded by !**/*.db
backend/python_nlp/topic_model.pkl is excluded by !**/*.pkl
backend/python_nlp/urgency_model.pkl is excluded by !**/*.pkl
backend/smart_filters.db is excluded by !**/*.db
package-lock.json is excluded by !**/package-lock.json
uv.lock is excluded by !**/*.lock

📒 Files selected for processing (77)

.continue/models/new-model.yaml (1 hunks)
.continue/prompts/new-prompt.yaml (1 hunks)
.continue/rules/new-rule.yaml (1 hunks)
.flake8 (1 hunks)
.github/workflows/README.md (1 hunks)
.github/workflows/ci.yml (1 hunks)
.github/workflows/dependabot-auto-merge.yml (1 hunks)
.gitignore (1 hunks)
.openhands/microagents/repo.md (1 hunks)
.pylintrc (1 hunks)
README.md (1 hunks)
backend/__init__.py (1 hunks)
backend/data/categories.json (1 hunks)
backend/data/emails.json (1 hunks)
backend/data/users.json (1 hunks)
backend/extensions/example/example.py (3 hunks)
backend/python_backend/__init__.py (1 hunks)
backend/python_backend/ai_engine.py (7 hunks)
backend/python_backend/database.py (13 hunks)
backend/python_backend/email_routes.py (2 hunks)
backend/python_backend/filter_routes.py (4 hunks)
backend/python_backend/gmail_routes.py (3 hunks)
backend/python_backend/main.py (2 hunks)
backend/python_backend/models.py (16 hunks)
backend/python_backend/performance_monitor.py (1 hunks)
backend/python_backend/run_server.py (1 hunks)
backend/python_backend/tests/test_ai_engine.py (2 hunks)
backend/python_backend/tests/test_category_routes.py (2 hunks)
backend/python_backend/tests/test_email_routes.py (9 hunks)
backend/python_backend/tests/test_filter_routes.py (4 hunks)
backend/python_backend/tests/test_gmail_routes.py (1 hunks)
backend/python_nlp/ai_training.py (1 hunks)
backend/python_nlp/gmail_service.py (2 hunks)
backend/python_nlp/nlp_engine.py (7 hunks)
backend/python_nlp/smart_retrieval.py (2 hunks)
backend/python_nlp/tests/analysis_components/test_intent_model.py (1 hunks)
backend/python_nlp/tests/analysis_components/test_sentiment_model.py (1 hunks)
backend/python_nlp/tests/analysis_components/test_topic_model.py (1 hunks)
backend/python_nlp/tests/analysis_components/test_urgency_model.py (1 hunks)
client/package.json (0 hunks)
client/src/index.css (2 hunks)
codebuff.json (1 hunks)
diagnosis_message.txt (1 hunks)
drizzle.config.ts (0 hunks)
knowledge.md (1 hunks)
launch.py (17 hunks)
package.json (0 hunks)
postcss.config.js (0 hunks)
pyproject.toml (2 hunks)
run.py (1 hunks)
server/README.md (0 hunks)
server/activityRoutes.test.ts (0 hunks)
server/activityRoutes.ts (0 hunks)
server/ai-engine.ts (0 hunks)
server/aiRoutes.test.ts (0 hunks)
server/aiRoutes.ts (0 hunks)
server/categoryRoutes.test.ts (0 hunks)
server/categoryRoutes.ts (0 hunks)
server/dashboardRoutes.test.ts (0 hunks)
server/dashboardRoutes.ts (0 hunks)
server/emailRoutes.test.ts (0 hunks)
server/emailRoutes.ts (0 hunks)
server/gmail-ai-service.ts (0 hunks)
server/gmailRoutes.test.ts (0 hunks)
server/gmailRoutes.ts (0 hunks)
server/index.ts (0 hunks)
server/init-db.ts (0 hunks)
server/performanceRoutes.ts (0 hunks)
server/python-bridge.ts (0 hunks)
server/python_backend/tests/test_gmail_routes.py (0 hunks)
server/routes.ts (0 hunks)
server/storage.ts (0 hunks)
server/vite.ts (0 hunks)
setup.js (0 hunks)
tailwind.config.ts (0 hunks)
tsconfig.json (0 hunks)
vite.config.ts (0 hunks)

💤 Files with no reviewable changes (31)

postcss.config.js
server/categoryRoutes.ts
server/dashboardRoutes.ts
package.json
server/performanceRoutes.ts
server/gmail-ai-service.ts
server/gmailRoutes.ts
server/README.md
server/vite.ts
server/init-db.ts
tsconfig.json
server/aiRoutes.ts
server/dashboardRoutes.test.ts
server/storage.ts
server/activityRoutes.test.ts
server/routes.ts
client/package.json
tailwind.config.ts
server/emailRoutes.ts
server/activityRoutes.ts
server/python-bridge.ts
server/categoryRoutes.test.ts
server/ai-engine.ts
server/python_backend/tests/test_gmail_routes.py
server/aiRoutes.test.ts
setup.js
server/gmailRoutes.test.ts
server/index.ts
vite.config.ts
drizzle.config.ts
server/emailRoutes.test.ts

🧰 Additional context used

🧬 Code graph analysis (20)

backend/python_nlp/tests/analysis_components/test_topic_model.py (1)

backend/python_nlp/analysis_components/topic_model.py (1)

TopicModel (7-132)

backend/python_backend/gmail_routes.py (1)

backend/python_nlp/gmail_service.py (1)

GmailAIService (30-770)

backend/python_nlp/gmail_service.py (2)

backend/python_backend/ai_engine.py (1)

AdvancedAIEngine (55-321)

backend/python_backend/database.py (1)

DatabaseManager (50-618)

backend/python_backend/run_server.py (1)

backend/python_backend/database.py (1)

DatabaseManager (50-618)

backend/python_nlp/nlp_engine.py (1)

backend/python_nlp/text_utils.py (1)

clean_text (4-16)

backend/python_backend/tests/test_gmail_routes.py (2)

backend/python_nlp/gmail_service.py (4)

sync_gmail_emails (151-207)

execute_smart_retrieval (649-706)

get_retrieval_strategies (708-735)

get_performance_metrics (737-770)

backend/python_backend/gmail_routes.py (1)

get_retrieval_strategies (201-214)

backend/extensions/example/example.py (1)

backend/python_nlp/nlp_engine.py (1)

NLPEngine (59-883)

backend/python_backend/tests/test_category_routes.py (1)

backend/python_backend/database.py (1)

get_db (623-629)

backend/python_backend/__init__.py (2)

backend/python_nlp/gmail_service.py (1)

GmailAIService (30-770)

backend/python_nlp/smart_filters.py (2)

EmailFilter (17-31)

SmartFilterManager (50-1530)

backend/python_backend/tests/test_ai_engine.py (3)

backend/python_backend/ai_engine.py (2)

AdvancedAIEngine (55-321)

AIAnalysisResult (21-52)

backend/python_nlp/nlp_engine.py (1)

NLPEngine (59-883)

backend/python_backend/database.py (1)

get_all_categories (273-276)

backend/python_nlp/tests/analysis_components/test_sentiment_model.py (1)

backend/python_nlp/analysis_components/sentiment_model.py (1)

SentimentModel (18-156)

backend/python_backend/tests/test_filter_routes.py (2)

backend/python_nlp/smart_filters.py (6)

main (1533-1570)

EmailFilter (17-31)

get_active_filters_sorted (1405-1427)

add_custom_filter (707-735)

create_intelligent_filters (385-403)

prune_ineffective_filters (737-853)

backend/python_backend/database.py (2)

get_recent_emails (524-526)

get_db (623-629)

backend/python_backend/database.py (2)

backend/python_backend/email_routes.py (1)

create_email (111-172)

backend/python_backend/category_routes.py (1)

create_category (53-88)

backend/python_nlp/tests/analysis_components/test_urgency_model.py (1)

backend/python_nlp/analysis_components/urgency_model.py (1)

UrgencyModel (8-76)

backend/python_nlp/tests/analysis_components/test_intent_model.py (1)

backend/python_nlp/analysis_components/intent_model.py (1)

IntentModel (8-83)

backend/python_backend/filter_routes.py (1)

backend/python_nlp/smart_filters.py (3)

add_custom_filter (707-735)

create_intelligent_filters (385-403)

prune_ineffective_filters (737-853)

backend/python_backend/email_routes.py (1)

backend/python_backend/database.py (2)

search_emails_by_category (477-521)

search_emails (436-474)

backend/python_backend/main.py (2)

backend/python_nlp/gmail_service.py (1)

GmailAIService (30-770)

backend/python_nlp/smart_filters.py (1)

SmartFilterManager (50-1530)

backend/python_backend/tests/test_email_routes.py (3)

backend/python_backend/database.py (4)

search_emails_by_category (477-521)

search_emails (436-474)

create_email (185-264)

get_email_by_id (266-271)

backend/python_backend/ai_engine.py (2)

to_dict (38-52)

analyze_email (110-141)

backend/python_backend/email_routes.py (1)

create_email (111-172)

backend/python_backend/ai_engine.py (2)

backend/python_backend/database.py (1)

DatabaseManager (50-618)

backend/python_nlp/nlp_engine.py (1)

NLPEngine (59-883)

🪛 Pylint (3.3.8)

backend/python_nlp/gmail_service.py

[error] 19-19: Attempted relative import beyond top-level package

(E0402)

[error] 20-20: Attempted relative import beyond top-level package

(E0402)

backend/python_nlp/ai_training.py