Skip to content

Algorithmic Research Group

About Us

We're dedicated to pushing the boundaries of scientific exploration through safe and responsible AI. Our mission is to create advanced AI models and agents that accelerate scientific progress.

Featured Open Source Repositories

ML Research Benchmark arXiv

  • ML-Research-Agent A baseline agent for ML Research Benchmark. This agent provides a foundation for comparing and evaluating machine learning research and development tasks that agents can perform.

  • ML-Research-Agent-Tasks Tasks for ML Research Benchmark, a benchmark designed to evaluate the capabilities of AI agents in accelerating AI research and development.

  • ML-Research-Agent-Evals Agent-Eval is a library for evaluating the performance of an agent on ML Research Benchmark tasks

ARIA

  • ARIA ARIA Benchmarks is a suite of closed-book benchmarks designed to assess an LLMs knowledge and understanding of machine learning research and methodologies

Agent States

  • Agent-States Agent States is a library designed to manage the state and decision-making processes of AI agents.

Featured Open Source Datasets

  • ArXivDLInstruct ArXivDLInstruct is a dataset designed for instruction tuning on Python research code for pretraining and fine-tuning language models in code generation tasks.

  • ArXiv Research Code ArtifactAI/arxiv_research_code contains over 21.8GB of source code files referenced strictly in ArXiv papers. The dataset serves as a curated dataset for Code LLMs.

  • ArXiv Python_Research_Code AlgorithmicResearchGroup/arxiv_python_research_code contains over 4.13GB of source code files referenced strictly in ArXiv papers. The dataset serves as a curated dataset for Code LLMs.

  • ArXiv C++ Research_Code ArtifactAI/arxiv_python_research_code contains over 10.6GB of source code files referenced strictly in ArXiv papers. The dataset serves as a curated dataset for Code LLMs.

Popular repositories Loading

  1. ML-Research-Agent ML-Research-Agent Public

    A baseline agent for ML Research Benchmark. This agent provides a foundation for comparing and evaluating machine learning research and development tasks that agents can perform.

    Python 7 1

  2. ML-Research-Agent-Public ML-Research-Agent-Public Public

    Public, general purpose agent for ML Research Benchmark. This agent provides a foundation for comparing and evaluating machine learning research and development tasks that agents can perform.

    Python 2 1

  3. ML-Research-Agent-Tasks ML-Research-Agent-Tasks Public

    Tasks for ML Research Benchmark, a benchmark designed to evaluate the capabilities of AI agents in accelerating AI research and development.

    Python 1

  4. ML-Research-Agent-Baselines ML-Research-Agent-Baselines Public

    Python 1

  5. Agent-States Agent-States Public

    The AI Agent State Library is a library designed to manage the state and decision-making processes of AI agents.

    Python 1

  6. Doc-Downloader Doc-Downloader Public

    Doc Downloader is a library for downloading web pages from a list of URLs in parallel

    Python

Repositories

Showing 10 of 11 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…