examples/open_deep_research/README.md
Welcome to this open replication of OpenAI's Deep Research! This agent attempts to replicate OpenAI's model and achieve similar performance on research tasks.
Read more about this implementation's goal and methods in our blog post.
This agent achieves 55% pass@1 on the GAIA validation set, compared to 67% for the original Deep Research.
To get started, follow the steps below:
git clone https://github.com/huggingface/smolagents.git
cd smolagents/examples/open_deep_research
Run the following command to install the required dependencies from the requirements.txt file:
pip install -r requirements.txt
smolagentspip install -e ../../.[dev]
The agent uses the GoogleSearchTool for web search, which requires an environment variable with the corresponding API key, based on the selected provider:
SERPAPI_API_KEY for SerpApi: Sign up here to get a keySERPER_API_KEY for Serper: Sign up here to get a keyDepending on the model you want to use, you may need to set environment variables.
For example, to use the default o1 model, you need to set the OPENAI_API_KEY environment variable.
Sign up here to get a key.
[!WARNING] The use of the default
o1model is restricted to tier-3 access: https://help.openai.com/en/articles/10362446-api-access-to-o1-and-o3-mini
Then you're good to go! Run the run.py script, as in:
python run.py --model-id "o1" "Your question here!"
The data used in our submissions to GAIA was augmented in this way:
This process was done manually but could be automatized.
After processing, the annotated was uploaded to a new dataset. You need to request access (granted instantly).