Spaces:

MCP-1st-Birthday
/

DETERMINATOR

Running

App Files Files Community

Joseph Pollack commited on 11 days ago

Commit

35d9120

unverified ·

1 Parent(s): d5a01e1

adds file returns , configuration enhancements , oauth fixes , and interface fixes

Browse files

This view is limited to 50 files because it contains too many changes. See raw diff

Files changed (50) hide show

FILE_OUTPUT_IMPLEMENTATION_PLAN.md +237 -0
REPORT_WRITING_AGENTS_ANALYSIS.md +2 -0
SERPER_WEBSEARCH_IMPLEMENTATION_PLAN.md +2 -0
docs/api/agents.md +2 -0
docs/api/models.md +2 -0
docs/api/orchestrators.md +2 -0
docs/api/services.md +2 -0
docs/api/tools.md +2 -0
docs/architecture/agents.md +2 -0
docs/architecture/middleware.md +2 -0
docs/architecture/services.md +2 -0
docs/architecture/tools.md +2 -0
docs/contributing/code-quality.md +2 -0
docs/contributing/code-style.md +2 -0
docs/contributing/error-handling.md +2 -0
docs/contributing/implementation-patterns.md +2 -0
docs/contributing/index.md +2 -0
docs/contributing/prompt-engineering.md +2 -0
docs/contributing/testing.md +2 -0
docs/getting-started/examples.md +2 -0
docs/getting-started/installation.md +2 -0
docs/getting-started/mcp-integration.md +2 -0
docs/getting-started/quick-start.md +2 -0
docs/implementation/IMPLEMENTATION_SUMMARY.md +2 -0
docs/implementation/TTS_MODAL_IMPLEMENTATION.md +2 -0
docs/license.md +2 -0
docs/overview/architecture.md +2 -0
docs/overview/features.md +2 -0
docs/team.md +2 -0
new_env.txt +2 -0
src/agent_factory/judges.py +45 -18
src/app.py +8 -1
src/middleware/state_machine.py +2 -0
src/orchestrator/graph_orchestrator.py +40 -0
src/orchestrator/research_flow.py +63 -0
src/services/image_ocr.py +2 -0
src/services/report_file_service.py +269 -0
src/tools/crawl_adapter.py +2 -0
src/tools/searchxng_web_search.py +2 -0
src/tools/serper_web_search.py +2 -0
src/tools/vendored/__init__.py +2 -0
src/tools/vendored/searchxng_client.py +2 -0
src/tools/vendored/serper_client.py +2 -0
src/tools/vendored/web_search_core.py +2 -0
src/tools/web_search_factory.py +2 -0
src/utils/config.py +18 -0
tests/unit/middleware/__init__.py +2 -0
tests/unit/middleware/test_budget_tracker_phase7.py +2 -0
tests/unit/middleware/test_state_machine.py +2 -0
tests/unit/middleware/test_workflow_manager.py +2 -0

FILE_OUTPUT_IMPLEMENTATION_PLAN.md ADDED Viewed

	@@ -0,0 +1,237 @@

+# File Output Implementation Plan
+## Overview
+This plan implements file writing and return functionality for report-writing agents, enabling reports to be saved as files and returned through the Gradio ChatInterface.
+## Current State Analysis
+✅ **Report Generation**: All agents generate markdown strings
+✅ **File Output Integration**: `event_to_chat_message()` supports file paths
+✅ **Graph Orchestrator**: Can handle file paths in results
+❌ **File Writing**: No agents write files to disk
+❌ **File Service**: No utility service for saving reports
+---
+## Implementation Plan
+### PROJECT 1: File Writing Service
+**Goal**: Create a reusable service for saving reports to files
+#### Activity 1.1: Create Report File Service
+**File**: `src/services/report_file_service.py` (NEW)
+**Tasks**:
+1. Create `ReportFileService` class
+2. Implement `save_report()` method
+   - Accepts: report content (str), filename (optional), output_dir (optional)
+   - Returns: file path (str)
+   - Uses temp directory by default
+   - Supports custom output directory
+   - Handles file naming with timestamps
+3. Implement `save_report_multiple_formats()` method
+   - Save as .md (always)
+   - Optionally save as .html, .pdf (future)
+4. Add configuration support
+   - Read from settings
+   - Enable/disable file saving
+   - Configurable output directory
+5. Add error handling and logging
+6. Add file cleanup utilities (optional)
+**Line-level subtasks**:
+- Line 1-20: Imports and class definition
+- Line 21-40: `__init__()` method with settings
+- Line 41-80: `save_report()` method
+  - Line 41-50: Input validation
+  - Line 51-60: Directory creation
+  - Line 61-70: File writing
+  - Line 71-80: Error handling
+- Line 81-100: `save_report_multiple_formats()` method
+- Line 101-120: Helper methods (filename generation, cleanup)
+---
+### PROJECT 2: Configuration Updates
+**Goal**: Add settings for file output functionality
+#### Activity 2.1: Update Settings Model
+**File**: `src/utils/config.py`
+**Tasks**:
+1. Add `save_reports_to_file: bool` field (default: True)
+2. Add `report_output_directory: str | None` field (default: None, uses temp)
+3. Add `report_file_format: Literal["md", "md_html", "md_pdf"]` field (default: "md")
+4. Add `report_filename_template: str` field (default: "report_{timestamp}_{query_hash}.md")
+**Line-level subtasks**:
+- Line 166-170: Add `save_reports_to_file` field after TTS config
+- Line 171-175: Add `report_output_directory` field
+- Line 176-180: Add `report_file_format` field
+- Line 181-185: Add `report_filename_template` field
+---
+### PROJECT 3: Graph Orchestrator Integration
+**Goal**: Integrate file writing into graph execution
+#### Activity 3.1: Update Graph Orchestrator
+**File**: `src/orchestrator/graph_orchestrator.py`
+**Tasks**:
+1. Import `ReportFileService` at top
+2. Initialize service in `__init__()` (optional, can be lazy)
+3. Modify `_execute_agent_node()` for synthesizer node
+   - After `long_writer_agent.write_report()`, save to file
+   - Return dict with `{"message": report, "file": file_path}`
+4. Update final event generation to handle file paths
+   - Already implemented, verify it works correctly
+**Line-level subtasks**:
+- Line 1-35: Add import for `ReportFileService`
+- Line 119-148: Update `__init__()` to accept optional file service
+- Line 589-650: Modify `_execute_agent_node()` synthesizer handling
+  - Line 642-645: After `write_report()`, add file saving
+  - Line 646-650: Return dict with file path
+- Line 534-564: Verify final event generation handles file paths (already done)
+---
+### PROJECT 4: Research Flow Integration
+**Goal**: Integrate file writing into research flows
+#### Activity 4.1: Update IterativeResearchFlow
+**File**: `src/orchestrator/research_flow.py`
+**Tasks**:
+1. Import `ReportFileService` at top
+2. Add optional file service to `__init__()`
+3. Modify `_create_final_report()` method
+   - After `writer_agent.write_report()`, save to file if enabled
+   - Return string (backward compatible) OR dict with file path
+**Line-level subtasks**:
+- Line 1-50: Add import for `ReportFileService`
+- Line 48-120: Update `__init__()` to accept optional file service
+- Line 622-667: Modify `_create_final_report()` method
+  - Line 647-652: After `write_report()`, add file saving
+  - Line 653-667: Return report string (keep backward compatible for now)
+#### Activity 4.2: Update DeepResearchFlow
+**File**: `src/orchestrator/research_flow.py`
+**Tasks**:
+1. Add optional file service to `__init__()` (if not already)
+2. Modify `_create_final_report()` method
+   - After `long_writer_agent.write_report()` or `proofreader_agent.proofread()`, save to file
+   - Return string (backward compatible) OR dict with file path
+**Line-level subtasks**:
+- Line 670-750: Update `DeepResearchFlow.__init__()` to accept optional file service
+- Line 954-1005: Modify `_create_final_report()` method
+  - Line 979-983: After `write_report()`, add file saving
+  - Line 984-989: After `proofread()`, add file saving
+  - Line 990-1005: Return report string (keep backward compatible)
+---
+### PROJECT 5: Agent Factory Integration
+**Goal**: Make file service available to agents if needed
+#### Activity 5.1: Update Agent Factory (Optional)
+**File**: `src/agent_factory/agents.py`
+**Tasks**:
+1. Add optional file service parameter to agent creation functions (if needed)
+2. Pass file service to agents that need it (currently not needed, agents return strings)
+**Line-level subtasks**:
+- Not required - agents return strings, file writing happens at orchestrator level
+---
+### PROJECT 6: Testing & Validation
+**Goal**: Ensure file output works end-to-end
+#### Activity 6.1: Unit Tests
+**File**: `tests/unit/services/test_report_file_service.py` (NEW)
+**Tasks**:
+1. Test `save_report()` with default settings
+2. Test `save_report()` with custom directory
+3. Test `save_report()` with custom filename
+4. Test error handling (permission errors, disk full, etc.)
+5. Test file cleanup
+**Line-level subtasks**:
+- Line 1-30: Test fixtures and setup
+- Line 31-60: Test basic save functionality
+- Line 61-90: Test custom directory
+- Line 91-120: Test error handling
+#### Activity 6.2: Integration Tests
+**File**: `tests/integration/test_file_output_integration.py` (NEW)
+**Tasks**:
+1. Test graph orchestrator with file output
+2. Test research flows with file output
+3. Test Gradio ChatInterface receives file paths
+4. Test file download in Gradio UI
+**Line-level subtasks**:
+- Line 1-40: Test setup with mock orchestrator
+- Line 41-80: Test file generation in graph execution
+- Line 81-120: Test file paths in AgentEvent
+- Line 121-160: Test Gradio message conversion
+---
+## Implementation Order
+1. **PROJECT 2** (Configuration) - Foundation
+2. **PROJECT 1** (File Service) - Core functionality
+3. **PROJECT 3** (Graph Orchestrator) - Primary integration point
+4. **PROJECT 4** (Research Flows) - Secondary integration points
+5. **PROJECT 6** (Testing) - Validation
+6. **PROJECT 5** (Agent Factory) - Not needed, skip
+---
+## File Changes Summary
+### New Files
+- `src/services/report_file_service.py` - File writing service
+- `tests/unit/services/test_report_file_service.py` - Unit tests
+- `tests/integration/test_file_output_integration.py` - Integration tests
+### Modified Files
+- `src/utils/config.py` - Add file output settings
+- `src/orchestrator/graph_orchestrator.py` - Add file saving after report generation
+- `src/orchestrator/research_flow.py` - Add file saving in both flows
+---
+## Gradio Integration Notes
+According to Gradio ChatInterface documentation:
+- File paths in chat message content are automatically converted to download links
+- Markdown links like `[Download: filename](file_path)` work
+- Files must be accessible from the Gradio server
+- Temp files are fine as long as they exist during the session
+Current implementation in `event_to_chat_message()` already handles this correctly.
+---
+## Success Criteria
+✅ Reports are saved to files when generated
+✅ File paths are included in AgentEvent data
+✅ File paths appear as download links in Gradio ChatInterface
+✅ File saving is configurable (can be disabled)
+✅ Backward compatible (existing code still works)
+✅ Error handling prevents crashes if file writing fails

REPORT_WRITING_AGENTS_ANALYSIS.md CHANGED Viewed

	@@ -181,3 +181,5 @@ return {
181	The infrastructure to handle file outputs in Gradio is in place, but the agents themselves do not yet write files. They would need to be enhanced or wrapped to add file writing capability.
182
183


181	The infrastructure to handle file outputs in Gradio is in place, but the agents themselves do not yet write files. They would need to be enhanced or wrapped to add file writing capability.
182
183
184	+
185	+

SERPER_WEBSEARCH_IMPLEMENTATION_PLAN.md CHANGED Viewed

	@@ -395,3 +395,5 @@ This plan details the implementation of SERPER-based web search by vendoring cod
395	- Consider adding relevance scoring in the future
396
397


395	- Consider adding relevance scoring in the future
396
397
398	+
399	+

docs/api/agents.md CHANGED Viewed

	@@ -272,3 +272,5 @@ def create_input_parser_agent(model: Any \| None = None) -> InputParserAgent
272
273
274


272
273
274
275	+
276	+

docs/api/models.md CHANGED Viewed

	@@ -250,3 +250,5 @@ class BudgetStatus(BaseModel):
250
251
252


250
251
252
253	+
254	+

docs/api/orchestrators.md CHANGED Viewed

	@@ -197,3 +197,5 @@ Runs Magentic orchestration.
197
198
199


197
198
199
200	+
201	+

docs/api/services.md CHANGED Viewed

	@@ -203,3 +203,5 @@ Analyzes a hypothesis using statistical methods.
203
204
205


203
204
205
206	+
207	+

docs/api/tools.md CHANGED Viewed

	@@ -237,3 +237,5 @@ Searches multiple tools in parallel.
237
238
239


237
238
239
240	+
241	+

docs/architecture/agents.md CHANGED Viewed

	@@ -194,3 +194,5 @@ Factory functions:
194
195
196


194
195
196
197	+
198	+

docs/architecture/middleware.md CHANGED Viewed

	@@ -144,3 +144,5 @@ All middleware components use `ContextVar` for thread-safe isolation:
144
145
146


144
145
146
147	+
148	+

docs/architecture/services.md CHANGED Viewed

	@@ -144,3 +144,5 @@ if settings.has_openai_key:
144
145
146


144
145
146
147	+
148	+

docs/architecture/tools.md CHANGED Viewed

	@@ -177,3 +177,5 @@ search_handler = SearchHandler(
177
178
179


177
178
179
180	+
181	+

docs/contributing/code-quality.md CHANGED Viewed

	@@ -83,3 +83,5 @@ async def search(self, query: str, max_results: int = 10) -> list[Evidence]:
83
84
85


83
84
85
86	+
87	+

docs/contributing/code-style.md CHANGED Viewed

	@@ -63,3 +63,5 @@ result = await loop.run_in_executor(None, cpu_bound_function, args)
63
64
65


63
64
65
66	+
67	+

docs/contributing/error-handling.md CHANGED Viewed

	@@ -71,3 +71,5 @@ except httpx.HTTPError as e:
71
72
73


71
72
73
74	+
75	+

docs/contributing/implementation-patterns.md CHANGED Viewed

	@@ -86,3 +86,5 @@ def get_embedding_service() -> EmbeddingService:
86
87
88


86
87
88
89	+
90	+

docs/contributing/index.md CHANGED Viewed

	@@ -165,3 +165,5 @@ Thank you for contributing to DeepCritical!
165
166
167


165
166
167
168	+
169	+

docs/contributing/prompt-engineering.md CHANGED Viewed

	@@ -71,3 +71,5 @@ This document outlines prompt engineering guidelines and citation validation rul
71
72
73


71
72
73
74	+
75	+

docs/contributing/testing.md CHANGED Viewed

	@@ -67,3 +67,5 @@ async def test_real_pubmed_search():
67
68
69


67
68
69
70	+
71	+

docs/getting-started/examples.md CHANGED Viewed

	@@ -211,3 +211,5 @@ USE_GRAPH_EXECUTION=true
211
212
213


211
212
213
214	+
215	+

docs/getting-started/installation.md CHANGED Viewed

	@@ -150,3 +150,5 @@ uv run pre-commit install
150
151
152


150
151
152
153	+
154	+

docs/getting-started/mcp-integration.md CHANGED Viewed

	@@ -217,3 +217,5 @@ You can configure multiple DeepCritical instances:
217
218
219


217
218
219
220	+
221	+

docs/getting-started/quick-start.md CHANGED Viewed

	@@ -121,3 +121,5 @@ What are the active clinical trials investigating Alzheimer's disease treatments
121
122
123


121
122
123
124	+
125	+

docs/implementation/IMPLEMENTATION_SUMMARY.md CHANGED Viewed

	@@ -180,3 +180,5 @@ Located in `src/app.py` lines 667-712:
180
181
182


180
181
182
183	+
184	+

docs/implementation/TTS_MODAL_IMPLEMENTATION.md CHANGED Viewed

	@@ -134,3 +134,5 @@ To test TTS:
134
135
136


134
135
136
137	+
138	+

docs/license.md CHANGED Viewed

	@@ -41,3 +41,5 @@ SOFTWARE.
41
42
43


41
42
43
44	+
45	+

docs/overview/architecture.md CHANGED Viewed

	@@ -198,3 +198,5 @@ The system supports complex research workflows through:
198
199
200


198
199
200
201	+
202	+

docs/overview/features.md CHANGED Viewed

	@@ -150,3 +150,5 @@ DeepCritical provides a comprehensive set of features for AI-assisted research:
150
151
152


150
151
152
153	+
154	+

docs/team.md CHANGED Viewed

	@@ -46,3 +46,5 @@ We welcome contributions! See the [Contributing Guide](contributing/index.md) fo
46
47
48


46
47
48
49	+
50	+

new_env.txt CHANGED Viewed

	@@ -96,3 +96,5 @@ MODAL_TOKEN_SECRET=your_modal_token_secret_here
96
97
98


96
97
98
99	+
100	+

src/agent_factory/judges.py CHANGED Viewed

@@ -33,34 +33,61 @@ def get_model(oauth_token: str | None = None) -> Any:
     Explicitly passes API keys from settings to avoid requiring
     users to export environment variables manually.
-    Priority: If OAuth token is available, prefer HuggingFace (even if provider is set to OpenAI).
     This ensures users logged in via HuggingFace Spaces get the free tier.
     Args:
         oauth_token: Optional OAuth token from HuggingFace login (takes priority over env vars)
     """
     # Priority: oauth_token > settings.hf_token > settings.huggingface_api_key
     effective_hf_token = oauth_token or settings.hf_token or settings.huggingface_api_key
-    # HuggingFaceProvider requires a token - cannot use None
-    if not effective_hf_token:
-        raise ConfigurationError(
-            "HuggingFace token required. Please either:\n"
-            "1. Log in via HuggingFace OAuth (recommended for Spaces)\n"
-            "2. Set HF_TOKEN environment variable\n"
-            "3. Set huggingface_api_key in settings"
         )
-    # Always use HuggingFace with available token
-    model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
-    hf_provider = HuggingFaceProvider(api_key=effective_hf_token)
-    logger.info(
-        "using_huggingface_with_token",
-        has_oauth=bool(oauth_token),
-        has_settings_token=bool(settings.hf_token or settings.huggingface_api_key),
-        model=model_name,
     )
-    return HuggingFaceModel(model_name, provider=hf_provider)
 class JudgeHandler:

     Explicitly passes API keys from settings to avoid requiring
     users to export environment variables manually.
+    Priority order:
+    1. HuggingFace (if OAuth token or API key available - preferred for free tier)
+    2. OpenAI (if API key available)
+    3. Anthropic (if API key available)
+    If OAuth token is available, prefer HuggingFace (even if provider is set to OpenAI).
     This ensures users logged in via HuggingFace Spaces get the free tier.
     Args:
         oauth_token: Optional OAuth token from HuggingFace login (takes priority over env vars)
+    Returns:
+        Configured Pydantic AI model
+    Raises:
+        ConfigurationError: If no LLM provider is available
     """
     # Priority: oauth_token > settings.hf_token > settings.huggingface_api_key
     effective_hf_token = oauth_token or settings.hf_token or settings.huggingface_api_key
+    # Try HuggingFace first (preferred for free tier)
+    if effective_hf_token:
+        model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
+        hf_provider = HuggingFaceProvider(api_key=effective_hf_token)
+        logger.info(
+            "using_huggingface_with_token",
+            has_oauth=bool(oauth_token),
+            has_settings_token=bool(settings.hf_token or settings.huggingface_api_key),
+            model=model_name,
         )
+        return HuggingFaceModel(model_name, provider=hf_provider)
+    # Fallback to OpenAI if available
+    if settings.has_openai_key:
+        assert settings.openai_api_key is not None  # Type narrowing
+        model_name = settings.openai_model
+        openai_provider = OpenAIProvider(api_key=settings.openai_api_key)
+        logger.info("using_openai", model=model_name)
+        return OpenAIModel(model_name, provider=openai_provider)
+    # Fallback to Anthropic if available
+    if settings.has_anthropic_key:
+        assert settings.anthropic_api_key is not None  # Type narrowing
+        model_name = settings.anthropic_model
+        anthropic_provider = AnthropicProvider(api_key=settings.anthropic_api_key)
+        logger.info("using_anthropic", model=model_name)
+        return AnthropicModel(model_name, provider=anthropic_provider)
+    # No provider available
+    raise ConfigurationError(
+        "No LLM provider available. Please configure one of:\n"
+        "1. HuggingFace: Log in via OAuth (recommended for Spaces) or set HF_TOKEN\n"
+        "2. OpenAI: Set OPENAI_API_KEY environment variable\n"
+        "3. Anthropic: Set ANTHROPIC_API_KEY environment variable"
     )
 class JudgeHandler:

src/app.py CHANGED Viewed

@@ -158,6 +158,7 @@ def configure_orchestrator(
         judge_handler=judge_handler,
         config=config,
         mode=effective_mode,  # type: ignore
     )
     return orchestrator, backend_info
@@ -570,7 +571,13 @@ async def research_agent(
     if oauth_token is not None:
         # OAuthToken has a .token attribute containing the access token
-        token_value = oauth_token.token if hasattr(oauth_token, "token") else None
     if oauth_profile is not None:
         # OAuthProfile has .username, .name, .profile_image attributes

         judge_handler=judge_handler,
         config=config,
         mode=effective_mode,  # type: ignore
+        oauth_token=oauth_token,
     )
     return orchestrator, backend_info
     if oauth_token is not None:
         # OAuthToken has a .token attribute containing the access token
+        if hasattr(oauth_token, "token"):
+            token_value = oauth_token.token
+        elif isinstance(oauth_token, str):
+            # Handle case where oauth_token is already a string (shouldn't happen but defensive)
+            token_value = oauth_token
+        else:
+            token_value = None
     if oauth_profile is not None:
         # OAuthProfile has .username, .name, .profile_image attributes

src/middleware/state_machine.py CHANGED Viewed

	@@ -135,3 +135,5 @@ def get_workflow_state() -> WorkflowState:
135
136
137


135
136
137
138	+
139	+

src/orchestrator/graph_orchestrator.py CHANGED Viewed

@@ -32,6 +32,7 @@ from src.legacy_orchestrator import JudgeHandlerProtocol, SearchHandlerProtocol
 from src.middleware.budget_tracker import BudgetTracker
 from src.middleware.state_machine import WorkflowState, init_workflow_state
 from src.orchestrator.research_flow import DeepResearchFlow, IterativeResearchFlow
 from src.utils.models import AgentEvent
 if TYPE_CHECKING:
@@ -147,6 +148,9 @@ class GraphOrchestrator:
         self.oauth_token = oauth_token
         self.logger = logger
         # Initialize flows (for backward compatibility)
         self._iterative_flow: IterativeResearchFlow | None = None
         self._deep_flow: DeepResearchFlow | None = None
@@ -155,6 +159,21 @@ class GraphOrchestrator:
         self._graph: ResearchGraph | None = None
         self._budget_tracker: BudgetTracker | None = None
     async def run(self, query: str) -> AsyncGenerator[AgentEvent, None]:
         """
         Run the research workflow.
@@ -649,6 +668,27 @@ class GraphOrchestrator:
             estimated_tokens = len(final_report) // 4  # Rough token estimate
             context.budget_tracker.add_tokens("graph_execution", estimated_tokens)
             return final_report
         # Standard agent execution

 from src.middleware.budget_tracker import BudgetTracker
 from src.middleware.state_machine import WorkflowState, init_workflow_state
 from src.orchestrator.research_flow import DeepResearchFlow, IterativeResearchFlow
+from src.services.report_file_service import ReportFileService, get_report_file_service
 from src.utils.models import AgentEvent
 if TYPE_CHECKING:
         self.oauth_token = oauth_token
         self.logger = logger
+        # Initialize file service (lazy if not provided)
+        self._file_service: ReportFileService | None = None
         # Initialize flows (for backward compatibility)
         self._iterative_flow: IterativeResearchFlow | None = None
         self._deep_flow: DeepResearchFlow | None = None
         self._graph: ResearchGraph | None = None
         self._budget_tracker: BudgetTracker | None = None
+    def _get_file_service(self) -> ReportFileService | None:
+        """
+        Get file service instance (lazy initialization).
+        Returns:
+            ReportFileService instance or None if disabled
+        """
+        if self._file_service is None:
+            try:
+                self._file_service = get_report_file_service()
+            except Exception as e:
+                self.logger.warning("Failed to initialize file service", error=str(e))
+                return None
+        return self._file_service
     async def run(self, query: str) -> AsyncGenerator[AgentEvent, None]:
         """
         Run the research workflow.
             estimated_tokens = len(final_report) // 4  # Rough token estimate
             context.budget_tracker.add_tokens("graph_execution", estimated_tokens)
+            # Save report to file if enabled
+            file_path: str | None = None
+            try:
+                file_service = self._get_file_service()
+                if file_service:
+                    file_path = file_service.save_report(
+                        report_content=final_report,
+                        query=query,
+                    )
+                    self.logger.info("Report saved to file", file_path=file_path)
+            except Exception as e:
+                # Don't fail the entire operation if file saving fails
+                self.logger.warning("Failed to save report to file", error=str(e))
+                file_path = None
+            # Return dict with file path if available, otherwise return string (backward compatible)
+            if file_path:
+                return {
+                    "message": final_report,
+                    "file": file_path,
+                }
             return final_report
         # Standard agent execution

src/orchestrator/research_flow.py CHANGED Viewed

@@ -25,6 +25,7 @@ from src.middleware.budget_tracker import BudgetTracker
 from src.middleware.state_machine import get_workflow_state, init_workflow_state
 from src.middleware.workflow_manager import WorkflowManager
 from src.services.llamaindex_rag import LlamaIndexRAGService, get_rag_service
 from src.tools.tool_executor import execute_tool_tasks
 from src.utils.exceptions import ConfigurationError
 from src.utils.models import (
@@ -112,6 +113,24 @@ class IterativeResearchFlow:
         # Graph orchestrator (lazy initialization)
         self._graph_orchestrator: Any = None
     async def run(
         self,
         query: str,
@@ -659,6 +678,19 @@ FINDINGS:
             tokens=estimated_tokens,
         )
         # Note: Citation validation for markdown reports would require Evidence objects
         # Currently, findings are strings, not Evidence objects. For full validation,
         # consider using ResearchReport format or passing Evidence objects separately.
@@ -725,6 +757,24 @@ class DeepResearchFlow:
         # Graph orchestrator (lazy initialization)
         self._graph_orchestrator: Any = None
     async def run(self, query: str) -> str:
         """
         Run the deep research flow.
@@ -1000,6 +1050,19 @@ class DeepResearchFlow:
                 agent="long_writer" if self.use_long_writer else "proofreader",
             )
         self.logger.info("Final report created", length=len(final_report))
         return final_report

 from src.middleware.state_machine import get_workflow_state, init_workflow_state
 from src.middleware.workflow_manager import WorkflowManager
 from src.services.llamaindex_rag import LlamaIndexRAGService, get_rag_service
+from src.services.report_file_service import ReportFileService, get_report_file_service
 from src.tools.tool_executor import execute_tool_tasks
 from src.utils.exceptions import ConfigurationError
 from src.utils.models import (
         # Graph orchestrator (lazy initialization)
         self._graph_orchestrator: Any = None
+        # File service (lazy initialization)
+        self._file_service: ReportFileService | None = None
+    def _get_file_service(self) -> ReportFileService | None:
+        """
+        Get file service instance (lazy initialization).
+        Returns:
+            ReportFileService instance or None if disabled
+        """
+        if self._file_service is None:
+            try:
+                self._file_service = get_report_file_service()
+            except Exception as e:
+                self.logger.warning("Failed to initialize file service", error=str(e))
+                return None
+        return self._file_service
     async def run(
         self,
         query: str,
             tokens=estimated_tokens,
         )
+        # Save report to file if enabled
+        try:
+            file_service = self._get_file_service()
+            if file_service:
+                file_path = file_service.save_report(
+                    report_content=report,
+                    query=query,
+                )
+                self.logger.info("Report saved to file", file_path=file_path)
+        except Exception as e:
+            # Don't fail the entire operation if file saving fails
+            self.logger.warning("Failed to save report to file", error=str(e))
         # Note: Citation validation for markdown reports would require Evidence objects
         # Currently, findings are strings, not Evidence objects. For full validation,
         # consider using ResearchReport format or passing Evidence objects separately.
         # Graph orchestrator (lazy initialization)
         self._graph_orchestrator: Any = None
+        # File service (lazy initialization)
+        self._file_service: ReportFileService | None = None
+    def _get_file_service(self) -> ReportFileService | None:
+        """
+        Get file service instance (lazy initialization).
+        Returns:
+            ReportFileService instance or None if disabled
+        """
+        if self._file_service is None:
+            try:
+                self._file_service = get_report_file_service()
+            except Exception as e:
+                self.logger.warning("Failed to initialize file service", error=str(e))
+                return None
+        return self._file_service
     async def run(self, query: str) -> str:
         """
         Run the deep research flow.
                 agent="long_writer" if self.use_long_writer else "proofreader",
             )
+        # Save report to file if enabled
+        try:
+            file_service = self._get_file_service()
+            if file_service:
+                file_path = file_service.save_report(
+                    report_content=final_report,
+                    query=query,
+                )
+                self.logger.info("Report saved to file", file_path=file_path)
+        except Exception as e:
+            # Don't fail the entire operation if file saving fails
+            self.logger.warning("Failed to save report to file", error=str(e))
         self.logger.info("Final report created", length=len(final_report))
         return final_report

src/services/image_ocr.py CHANGED Viewed

	@@ -243,3 +243,5 @@ def get_image_ocr_service() -> ImageOCRService:
243
244
245


243
244
245
246	+
247	+

src/services/report_file_service.py ADDED Viewed

	@@ -0,0 +1,269 @@

+"""Service for saving research reports to files."""
+import hashlib
+import tempfile
+from datetime import datetime
+from pathlib import Path
+from typing import Literal
+import structlog
+from src.utils.config import settings
+from src.utils.exceptions import ConfigurationError
+logger = structlog.get_logger()
+class ReportFileService:
+    """
+    Service for saving research reports to files.
+    Handles file creation, naming, and directory management for report outputs.
+    Supports saving reports in multiple formats (markdown, HTML, PDF).
+    """
+    def __init__(
+        self,
+        output_directory: str | None = None,
+        enabled: bool | None = None,
+        file_format: Literal["md", "md_html", "md_pdf"] | None = None,
+    ) -> None:
+        """
+        Initialize the report file service.
+        Args:
+            output_directory: Directory to save reports. If None, uses settings or temp directory.
+            enabled: Whether file saving is enabled. If None, uses settings.
+            file_format: File format to save. If None, uses settings.
+        """
+        self.enabled = enabled if enabled is not None else settings.save_reports_to_file
+        self.file_format = file_format or settings.report_file_format
+        self.filename_template = settings.report_filename_template
+        # Determine output directory
+        if output_directory:
+            self.output_directory = Path(output_directory)
+        elif settings.report_output_directory:
+            self.output_directory = Path(settings.report_output_directory)
+        else:
+            # Use system temp directory
+            self.output_directory = Path(tempfile.gettempdir()) / "deepcritical_reports"
+        # Create output directory if it doesn't exist
+        if self.enabled:
+            try:
+                self.output_directory.mkdir(parents=True, exist_ok=True)
+                logger.debug(
+                    "Report output directory initialized",
+                    path=str(self.output_directory),
+                    enabled=self.enabled,
+                )
+            except Exception as e:
+                logger.error("Failed to create report output directory", error=str(e), path=str(self.output_directory))
+                raise ConfigurationError(f"Failed to create report output directory: {e}") from e
+    def _generate_filename(self, query: str | None = None, extension: str = ".md") -> str:
+        """
+        Generate filename for report using template.
+        Args:
+            query: Optional query string for hash generation
+            extension: File extension (e.g., ".md", ".html")
+        Returns:
+            Generated filename
+        """
+        # Generate timestamp
+        timestamp = datetime.now().strftime("%Y%m%d_%H%M%S")
+        # Generate query hash if query provided
+        query_hash = ""
+        if query:
+            query_hash = hashlib.md5(query.encode()).hexdigest()[:8]
+        # Generate date
+        date = datetime.now().strftime("%Y-%m-%d")
+        # Replace template placeholders
+        filename = self.filename_template
+        filename = filename.replace("{timestamp}", timestamp)
+        filename = filename.replace("{query_hash}", query_hash)
+        filename = filename.replace("{date}", date)
+        # Ensure correct extension
+        if not filename.endswith(extension):
+            # Remove existing extension if present
+            if "." in filename:
+                filename = filename.rsplit(".", 1)[0]
+            filename += extension
+        return filename
+    def save_report(
+        self,
+        report_content: str,
+        query: str | None = None,
+        filename: str | None = None,
+    ) -> str:
+        """
+        Save a report to a file.
+        Args:
+            report_content: The report content (markdown string)
+            query: Optional query string for filename generation
+            filename: Optional custom filename. If None, generates from template.
+        Returns:
+            Path to saved file
+        Raises:
+            ConfigurationError: If file saving is disabled or fails
+        """
+        if not self.enabled:
+            logger.debug("File saving disabled, skipping")
+            raise ConfigurationError("Report file saving is disabled")
+        if not report_content or not report_content.strip():
+            raise ValueError("Report content cannot be empty")
+        # Generate filename if not provided
+        if not filename:
+            filename = self._generate_filename(query=query, extension=".md")
+        # Ensure filename is safe
+        filename = self._sanitize_filename(filename)
+        # Build full file path
+        file_path = self.output_directory / filename
+        try:
+            # Write file
+            with open(file_path, "w", encoding="utf-8") as f:
+                f.write(report_content)
+            logger.info(
+                "Report saved to file",
+                path=str(file_path),
+                size=len(report_content),
+                query=query[:50] if query else None,
+            )
+            return str(file_path)
+        except Exception as e:
+            logger.error("Failed to save report to file", error=str(e), path=str(file_path))
+            raise ConfigurationError(f"Failed to save report to file: {e}") from e
+    def save_report_multiple_formats(
+        self,
+        report_content: str,
+        query: str | None = None,
+    ) -> dict[str, str]:
+        """
+        Save a report in multiple formats.
+        Args:
+            report_content: The report content (markdown string)
+            query: Optional query string for filename generation
+        Returns:
+            Dictionary mapping format to file path (e.g., {"md": "/path/to/report.md"})
+        Raises:
+            ConfigurationError: If file saving is disabled or fails
+        """
+        if not self.enabled:
+            logger.debug("File saving disabled, skipping")
+            raise ConfigurationError("Report file saving is disabled")
+        saved_files: dict[str, str] = {}
+        # Always save markdown
+        md_path = self.save_report(report_content, query=query, filename=None)
+        saved_files["md"] = md_path
+        # Save additional formats based on file_format setting
+        if self.file_format == "md_html":
+            # TODO: Implement HTML conversion
+            logger.warning("HTML format not yet implemented, saving markdown only")
+        elif self.file_format == "md_pdf":
+            # TODO: Implement PDF conversion
+            logger.warning("PDF format not yet implemented, saving markdown only")
+        return saved_files
+    def _sanitize_filename(self, filename: str) -> str:
+        """
+        Sanitize filename to remove unsafe characters.
+        Args:
+            filename: Original filename
+        Returns:
+            Sanitized filename
+        """
+        # Remove or replace unsafe characters
+        unsafe_chars = '<>:"/\\|?*'
+        sanitized = filename
+        for char in unsafe_chars:
+            sanitized = sanitized.replace(char, "_")
+        # Limit length
+        if len(sanitized) > 200:
+            name, ext = sanitized.rsplit(".", 1) if "." in sanitized else (sanitized, "")
+            sanitized = name[:190] + ext
+        return sanitized
+    def cleanup_old_files(self, max_age_days: int = 7) -> int:
+        """
+        Clean up old report files.
+        Args:
+            max_age_days: Maximum age in days for files to keep
+        Returns:
+            Number of files deleted
+        """
+        if not self.output_directory.exists():
+            return 0
+        deleted_count = 0
+        cutoff_time = datetime.now().timestamp() - (max_age_days * 24 * 60 * 60)
+        try:
+            for file_path in self.output_directory.iterdir():
+                if file_path.is_file() and file_path.stat().st_mtime < cutoff_time:
+                    try:
+                        file_path.unlink()
+                        deleted_count += 1
+                    except Exception as e:
+                        logger.warning("Failed to delete old file", path=str(file_path), error=str(e))
+            if deleted_count > 0:
+                logger.info("Cleaned up old report files", deleted=deleted_count, max_age_days=max_age_days)
+        except Exception as e:
+            logger.error("Failed to cleanup old files", error=str(e))
+        return deleted_count
+def get_report_file_service() -> ReportFileService:
+    """
+    Get or create a ReportFileService instance (singleton pattern).
+    Returns:
+        ReportFileService instance
+    """
+    # Use lru_cache for singleton pattern
+    from functools import lru_cache
+    @lru_cache(maxsize=1)
+    def _get_service() -> ReportFileService:
+        return ReportFileService()
+    return _get_service()

src/tools/crawl_adapter.py CHANGED Viewed

	@@ -64,3 +64,5 @@ async def crawl_website(starting_url: str) -> str:
64
65
66


64
65
66
67	+
68	+

src/tools/searchxng_web_search.py CHANGED Viewed

	@@ -118,3 +118,5 @@ class SearchXNGWebSearchTool:
118	raise SearchError(f"SearchXNG search failed: {e}") from e
119
120


118	raise SearchError(f"SearchXNG search failed: {e}") from e
119
120
121	+
122	+

src/tools/serper_web_search.py CHANGED Viewed

	@@ -118,3 +118,5 @@ class SerperWebSearchTool:
118	raise SearchError(f"Serper search failed: {e}") from e
119
120


118	raise SearchError(f"Serper search failed: {e}") from e
119
120
121	+
122	+

src/tools/vendored/__init__.py CHANGED Viewed

	@@ -25,3 +25,5 @@ __all__ = [
25	]
26
27


25	]
26
27
28	+
29	+

src/tools/vendored/searchxng_client.py CHANGED Viewed

	@@ -97,3 +97,5 @@ class SearchXNGClient:
97	raise SearchError(f"SearchXNG search failed: {e}") from e
98
99


97	raise SearchError(f"SearchXNG search failed: {e}") from e
98
99
100	+
101	+

src/tools/vendored/serper_client.py CHANGED Viewed

	@@ -93,3 +93,5 @@ class SerperClient:
93	raise SearchError(f"Serper search failed: {e}") from e
94
95


93	raise SearchError(f"Serper search failed: {e}") from e
94
95
96	+
97	+

src/tools/vendored/web_search_core.py CHANGED Viewed

	@@ -204,3 +204,5 @@ def is_valid_url(url: str) -> bool:
204	return True
205
206


204	return True
205
206
207	+
208	+

src/tools/web_search_factory.py CHANGED Viewed

	@@ -72,3 +72,5 @@ def create_web_search_tool() -> SearchTool \| None:
72	return None
73
74


72	return None
73
74
75	+
76	+

src/utils/config.py CHANGED Viewed

@@ -164,6 +164,24 @@ class Settings(BaseSettings):
         description="Modal GPU type for TTS (T4, A10, A100, L4, L40S). None uses default T4.",
     )
     @property
     def modal_available(self) -> bool:
         """Check if Modal credentials are configured."""

         description="Modal GPU type for TTS (T4, A10, A100, L4, L40S). None uses default T4.",
     )
+    # Report File Output Configuration
+    save_reports_to_file: bool = Field(
+        default=True,
+        description="Save generated reports to files (enables file downloads in Gradio)",
+    )
+    report_output_directory: str | None = Field(
+        default=None,
+        description="Directory to save report files. If None, uses system temp directory.",
+    )
+    report_file_format: Literal["md", "md_html", "md_pdf"] = Field(
+        default="md",
+        description="File format(s) to save reports in. 'md' saves only markdown, others save multiple formats.",
+    )
+    report_filename_template: str = Field(
+        default="report_{timestamp}_{query_hash}.md",
+        description="Template for report filenames. Supports {timestamp}, {query_hash}, {date} placeholders.",
+    )
     @property
     def modal_available(self) -> bool:
         """Check if Modal credentials are configured."""

tests/unit/middleware/__init__.py CHANGED Viewed

	@@ -18,6 +18,8 @@
18
19
20


21
22
23


18
19
20
21	+
22	+
23
24
25

tests/unit/middleware/test_budget_tracker_phase7.py CHANGED Viewed

	@@ -176,6 +176,8 @@ class TestIterationTokenTracking:
176
177
178


179
180
181


176
177
178
179	+
180	+
181
182
183

tests/unit/middleware/test_state_machine.py CHANGED Viewed

	@@ -373,6 +373,8 @@ class TestContextVarIsolation:
373
374
375


376
377
378


373
374
375
376	+
377	+
378
379
380

tests/unit/middleware/test_workflow_manager.py CHANGED Viewed

	@@ -303,6 +303,8 @@ class TestWorkflowManager:
303
304
305


306
307
308


303
304
305
306	+
307	+
308
309
310