Web Fetch Configuration¶
Configure how MAESTRO fetches and processes content from web pages discovered during research.
Overview¶
Web fetching extracts full content from web pages found during searches, enabling MAESTRO to read entire articles beyond search snippets.
Quick Setup¶
- Navigate to Settings → Web Fetch
- Select fetch provider (Original + Jina Fallback recommended)
- Configure Jina settings if using Jina features
- Adjust timeout and options as needed
- Click "Save & Close"
Fetch Providers¶
Original + Jina Fallback (Recommended)¶
Built-in fetcher with automatic Jina fallback for problematic sites.
How it Works:
- First attempts with built-in fetcher (fast)
- On 403 errors or blocks, automatically uses Jina
- Best of both worlds approach
Best For:
- Most use cases
- Balanced speed and reliability
- Automatic handling of difficult sites
Original (Built-in Fetcher)¶
- Fast and free - No API limits or costs
- Limited - Can't handle JavaScript sites, many sites block bot access
- Best for - Static HTML, news articles, documentation
Jina Reader API¶
- Advanced - Handles JavaScript and complex sites
- Requires API key - Get from jina.ai
- Best for - JavaScript apps, sites that block scrapers
Jina Configuration Options¶
Browser Engine¶
- Default - Balanced speed and quality
- Fast - Quick but may miss dynamic content
- Complete - Thorough but slower
Content Format¶
- Markdown (Recommended) - Structured, clean format
- HTML - Raw output for special needs
- Text - Plain text, smallest size
Timeout Settings¶
- Simple sites: 10-15 seconds
- Complex sites: 30 seconds
- Heavy apps: 45-60 seconds
Processing Options¶
- Gather links - Extract all hyperlinks
- Gather images - Collect image URLs
Configuration Tips¶
For Speed¶
- Use Original fetcher
- Short timeouts (10-15s)
- Fast browser engine
For Quality¶
- Use Jina Reader
- Longer timeouts (30-45s)
- Complete browser mode
For Cost Efficiency¶
- Use Original + Jina fallback
- Cache fetched content
Troubleshooting¶
- 403 Errors: Switch to Jina
- Incomplete Content: Increase timeout, use Complete mode
- Slow Fetching: Reduce timeout, use Fast mode
- Format Issues: Try Markdown format, enable link gathering
Provider Comparison¶
Feature | Original | Jina | Original + Fallback |
---|---|---|---|
Speed | Fast | Slow | Variable |
JavaScript | No | Yes | Yes (fallback) |
Cost | Free | Paid | Minimal |
Reliability | medium | high | high |
Setup | None | API Key | API Key |
Next Steps¶
After configuring web fetch:
- Set up Research Parameters
- Test with various website types
- Monitor performance and costs
- Optimize based on usage patterns