# Researcher **Repository Path**: local-scholar/Researcher ## Basic Information - **Project Name**: Researcher - **Description**: No description available - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: main - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2025-11-24 - **Last Updated**: 2026-04-18 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # AI-powered Research and Review Ecosystem [ICLR 2025, ACL 2025]
## 🚀 Getting Started
### Installation
```bash
pip install ai_researcher
```
### Using CycleResearcher
```python
# Import necessary libraries
from ai_researcher import CycleResearcher
from ai_researcher.utils import print_paper_summary
# Initialize CycleResearcher with default 12B model
researcher = CycleResearcher(model_size="12B")
# Load references from BibTeX file
with open('cycleresearcher_references.bib', 'r') as f:
references_content = f.read()
# Generate paper with specific references
generated_papers = researcher.generate_paper(
topic="AI Researcher",
references=references_content,
n=1 # Generate a single paper
)
# Print summary of generated paper
print_paper_summary(generated_papers[0])
```
### Using CycleReviewer
```python
# Import necessary libraries
from ai_researcher import CycleReviewer
# Initialize CycleReviewer with default 8B model
reviewer = CycleReviewer(model_size="8B")
# Review a paper (assuming paper_text contains the paper content)
review_results = reviewer.evaluate(paper_text)
# Print review results
print(f"Average score: {review_results[0]['avg_rating']}")
print(f"Decision: {review_results[0]['paper_decision']}")
```
### Using DeepReviewer
```python
# Import necessary libraries
from ai_researcher import DeepReviewer
# Initialize DeepReviewer with 14B model
deep_reviewer = DeepReviewer(model_size="14B")
# Review a paper with multiple simulated reviewers in Standard Mode
review_results = deep_reviewer.evaluate(
paper_text,
mode="Standard Mode", # Options: "Fast Mode", "Standard Mode", "Best Mode"
reviewer_num=4 # Simulate 4 different reviewers
)
# Print review results
for i, review in enumerate(review_results[0]['reviews']):
print(f"Reviewer {i+1} Rating: {review.get('rating', 'N/A')}")
print(f"Reviewer {i+1} Summary: {review.get('summary', 'N/A')[:100]}...")
```
## 📊 Model Evaluation
CycleResearcher-12B achieves an average score of 5.36, approaching the 5.69 average for conference-accepted papers and surpassing AI Scientist's score of 4.31.
CycleReviewer outperforms both proprietary systems and human experts with a 48.77% reduction in Proxy MSE and a 26.89% reduction in Proxy MAE compared to human reviewers. With a decision accuracy of 74.24%, our model demonstrates a significant lead over other closed-source systems.
DeepReviewer provides multi-perspective simulation with self-verification, enabling more comprehensive and balanced feedback. It offers three distinct review modes: Fast Mode, Standard Mode, and Best Mode to accommodate different use cases.
| Model Name | Pre-training Language Model | HF Link |
|---|---|---|
| CycleResearcher-ML-12B | Mistral-Nemo-Instruct-2407 | 🤗 link |
| CycleResearcher-ML-72B | Qwen2.5-72B-Instruct | 🤗 link |
| CycleResearcher-ML-123B | Mistral-Large-2 | 🤗 link |
| Model Name | Pre-training Language Model | HF Link |
|---|---|---|
| CycleReviewer-ML-Llama3.1-8B | Llama3.1-8B-Instruct | 🤗 link |
| CycleReviewer-ML-Llama3.1-70B | Llama3.1-70B-Instruct | 🤗 link |
| CycleReviewer-ML-Pro-123B | Mistral-Large-2 | 🤗 link |
| Model Name | Parameters | HF Link |
|---|---|---|
| DeepReviewer-7B | 7B | 🤗 link |
| DeepReviewer-14B | 14B | 🤗 link |
| Dataset Name | Train Data | Test Data | Description | HF Link |
|---|---|---|---|---|
| Review-5K | 4,189 | 781 | Peer review dataset for CycleReviewer training | 🤗 link |
| Research-14K | 12,696 | 802 | Research paper dataset for CycleResearcher training | 🤗 link |
| DeepReview-13K | 13,378 | 1,286 | Multi-perspective review dataset for DeepReviewer training | 🤗 link |
Quick review generation for rapid feedback. Provides essential evaluation without multi-reviewer simulation.
Default mode that simulates multiple reviewers and includes self-verification to ensure reliable assessments.
Most comprehensive mode with background knowledge search, multi-reviewer simulation, and self-verification for in-depth analysis.