Example Research Reports¶

Explore research reports generated by MAESTRO using various Large Language Models, showcasing quality, style, and capabilities across different model sizes and deployment options.

Overview¶

These example reports demonstrate:

Model Capabilities - Quality differences between model sizes
Writing Styles - Various tones and formats achieved through prompting
Research Depth - How models handle complex topics
Context Handling - Long-form content generation abilities
Deployment Options - Self-hosted vs cloud-based models

Note on Report Quality

These reports include examples of mistakes from various models. We are improving the framework to try and catch some of these mistakes, but due to the unpredictable nature of these, it is impossible to account for all eventualities. It is impossible to design a system that will work well with all models. If the model you're using keeps producing bad results, you may want to try a different model.

Cloud-Based Model Combinations¶

Reports generated using API-based models (OpenAI, Google Gemini) in various combinations for Fast, Mid, and Intelligent tiers.

GPT-4o-mini + Gemini 2.5 Flash

Fast gpt-4o-mini

Mid gemini-2.5-flash-lite

Smart gemini-2.5-flash

Example Reports:
GPT-5 Nano + Mini + Full

Fast gpt5nano

Mid gpt5mini

Smart gpt5

Example Reports:
DeepSeek V3 via OpenRouter

Fast deepseek-v3

Mid deepseek-v3

Smart deepseek-v3

View Reports
Qwen 3 Next 80B via OpenRouter

Fast qwen-3-next-80b

Mid qwen-3-next-80b

Smart qwen-3-next-80b

View Reports
Tongyi DeepResearch 30B via OpenRouter

Fast tongyi-deepresearch-30b

Mid tongyi-deepresearch-30b

Smart tongyi-deepresearch-30b

View Reports

Self-Hosted Open Source Models¶

Reports generated using locally deployed models via vLLM/SGLang, providing complete data privacy and customization.

Large Models (70B+ Parameters)¶

Qwen 2.5 72B

One of the most capable open models, excellent for complex research.

Highlights:
- Superior reasoning and analysis
- Excellent context handling
- Strong technical writing
View Reports
GPT-OSS 120B

The largest open model tested, exceptional comprehension.

Highlights:
- Best-in-class synthesis
- Creative writing excellence
- Deep technical analysis
View Reports

Medium Models (20-34B Parameters)¶

Qwen 3 32B

Excellent balance of quality and performance.

Highlights:
- Fast inference speed
- Strong general capabilities
- Good context retention
View Reports
Qwen 3 30B-A3B

Specialized variant with unique strengths.

Highlights:
- Strong economic analysis
- Creative content generation
- Policy research expertise
View Reports
Gemma 3 27B

Google's model with domain-specific strengths.

Highlights:
- Academic writing
- Business strategy
- Professional tone
View Reports
GPT-OSS 20B

Smaller but efficient for focused tasks.

Highlights:
- Fast inference
- Good for summaries
- Psychology/business topics
View Reports

Research Styles¶

Explore reports organized by writing style and target audience.

Academic & Technical

Scholarly papers with citations, methodology sections, and technical depth.

Best for: Research papers, technical documentation, academic analysis
Popular Science

Accessible explanations of complex topics for general audiences.

Best for: Blog posts, magazine articles, educational content
Business & Professional

Executive summaries, market analyses, and strategic recommendations.

Best for: Business reports, market research, strategy documents
Creative & Narrative

Storytelling approaches including travel writing and fictional narratives.

Best for: Travel guides, creative content, narrative reports

Performance Observations¶

Local Model Insights¶

Context Usage Patterns

The above reports were generated using the following context settings in the Settings → Research Tab:

Context Settings Tested:

Planning: 100,000 - 150,000 tokens
Writing preview: 20,000 - 30,000 tokens
Note content: 10,000 - 25,000 tokens
Writing agent: 200,000 - 250,000 tokens

KV Cache Observations:

Normal usage: ~25% with 25 concurrent requests
Peak usage: 80% of 100K context observed (multiple reports running at the same time)
Extreme case: 97% with 12 concurrent requests (multiple reports running at the same time)
Most users unlikely to reach limits during normal operation

Model-Specific Issues

Qwen 3 30B-A3B: Issues generating valid structured outputs
GPT-OSS 20B: Instruction following problems, especially in non-traditional reports

Cloud vs Local Comparison¶

Aspect	Cloud Models	Local Models
Privacy	Data sent to provider	Complete data control
Cost	Per-token pricing	One-time hardware cost
Latency	Faster in most cases	Speed depends on hardware
Availability	Subject to rate limits	Always available
Model Selection	Proprietary & Open Weights models	Any open model
Maintenance	Provider managed	Self-managed

Getting Started¶

Choose Deployment Type:
- Cloud: Quick setup, no hardware required
- Local: Privacy-focused, requires GPU
Select Model Tier:
- Fast: Quick responses, simple tasks
- Mid: Balanced performance
- Intelligent: Complex analysis
Configure in Settings:
- Navigate to Settings → AI Config
- Select provider and models
- Test configuration
Start Researching:
- Use example prompts as inspiration
- Adjust parameters for your needs
- Experiment with different models

Fast	`gpt-4o-mini`
Mid	`gemini-2.5-flash-lite`
Smart	`gemini-2.5-flash`

Fast	`deepseek-v3`
Mid	`deepseek-v3`
Smart	`deepseek-v3`

Fast	`qwen-3-next-80b`
Mid	`qwen-3-next-80b`
Smart	`qwen-3-next-80b`

Fast	`tongyi-deepresearch-30b`
Mid	`tongyi-deepresearch-30b`
Smart	`tongyi-deepresearch-30b`

Fast	`gpt5nano`
Mid	`gpt5mini`
Smart	`gpt5`