ReasonIR-8B: Achieve State-of-the-Art Reasoning with This New Retriever Model

Want to boost your reasoning tasks with cutting-edge AI? ReasonIR-8B, a groundbreaking retriever model, is here to revolutionize retrieval-augmented generation (RAG). This article dives deep into ReasonIR, covering its features, usage, and how it achieves state-of-the-art performance in reasoning-intensive retrieval.

ReasonIR Logo

What is ReasonIR-8B?

ReasonIR-8B is the first retriever specifically trained for general reasoning tasks. It achieves state-of-the-art retrieval performance on BRIGHT, a challenging reasoning-intensive retrieval benchmark. Moreover, when integrated into applications using retrieval-augmented generation (RAG), ReasonIR-8B provides great improvements on complex benchmarks like MMLU and GPQA.

Why Use ReasonIR for Reasoning Tasks?

Superior Reasoning Performance: Trained specifically for reasoning, ReasonIR-8B outperforms general-purpose retrievers in complex tasks. By focusing on reasoning-intensive retrieval, it sets itself apart.
RAG Integration: ReasonIR excels when used for Retrieval-Augmented Generation, significantly boosting performance on MMLU and GPQA benchmarks.
State-of-the-Art Results: ReasonIR is currently the top-performing retriever on the BRIGHT benchmark.

Getting Started with ReasonIR: Installation & Setup

Ready to get started? Here's how to set up ReasonIR for your projects. First, ensure you have the necessary libraries installed.

Install Transformers: Make sure you have a recent version of the transformers library.
```
pip install transformers>=4.47.0
```
Load the Model and Tokenizer: Use the following code snippet to load the ReasonIR-8B model:
```
from transformers import AutoModel, AutoTokenizer
model = AutoModel.from_pretrained("reasonir/ReasonIR-8B", torch_dtype="auto", trust_remote_code=True)
```
- trust_remote_code=True is crucial to load the custom bidirectional encoding architecture.
- torch_dtype="auto" activates bf16 precision for faster and more efficient computation. If you don't include this, the model will default to fp32.
Move the Model to GPU (Optional): For faster processing, move the model to your GPU:
```
model = model.to("cuda")
model.eval()
```

ReasonIR Usage Examples

Here's how to use ReasonIR for encoding queries and documents:

Define Query and Document:

query = "The quick brown fox jumps over the lazy dog."
document = "The quick brown fox jumps over the lazy dog."
query_instruction = ""
doc_instruction = ""

Encode and Calculate Similarity:

query_emb = model.encode(query, instruction=query_instruction)
doc_emb = model.encode(document, instruction=doc_instruction)
sim = query_emb @ doc_emb.T

Using Sentence Transformers with ReasonIR

You can also leverage ReasonIR with Sentence Transformers. This allows for simpler integration with existing workflows, but comes with a slight precision trade-off.

Install Sentence Transformers: If you don't have it, install the library:
```
pip install sentence-transformers
```

Load the Model:

from sentence_transformers import SentenceTransformer
model_kwargs = {"torch_dtype": "auto"}
model = SentenceTransformer("reasonir/ReasonIR-8B", trust_remote_code=True, model_kwargs=model_kwargs)
model.set_pooling_include_prompt(include_prompt=False) # exclude the prompt during pooling

Encode and Calculate Similarity:

query = "The quick brown fox jumps over the lazy dog."
document = "The quick brown fox jumps over the lazy dog."
query_instruction = ""
doc_instruction = ""
query_emb = model.encode(query, instruction=query_instruction)
doc_emb = model.encode(document, instruction=doc_instruction)
sim = query_emb @ doc_emb.T

NOTE: there seems to be some very slight floating point discrepancy when using the SentenceTransformer (because it does not support bf16 precision), though it should not affect the results in general.

Diving Deeper: Evaluations and Training the Retriever Model

For advanced users, ReasonIR offers resources for evaluations, synthetic data generation, and retriever training.

Evaluations: Follow the instructions in the evaluation section of the repository to assess ReasonIR's performance on various tasks.
Synthetic Data Generation: Create synthetic data for training using the provided guidelines in the synthetic_data_generation section.
Retriever Training: Train your own custom ReasonIR models following the training instructions.

License and Acknowledgments

ReasonIR is released under the FAIR Noncommercial Research License. The developers express their gratitude to the open-source repositories, BRIGHT, GritLM, and MassiveDS.

ReasonIR: The Future of Reasoning-Intensive Retrieval

ReasonIR-8B represents a significant leap forward in retriever technology, especially for reasoning-heavy applications. By implementing it, you can drastically enhance the performance of RAG systems and unlock new insights from complex data. Give ReasonIR a try and experience the advantages of a specifically trained retriever for general reasoning tasks.

ReasonIR-8B: Achieve State-of-the-Art Reasoning with This New Retriever Model

ReasonIR Logo

What is ReasonIR-8B?

Why Use ReasonIR for Reasoning Tasks?

Superior Reasoning Performance: Trained specifically for reasoning, ReasonIR-8B outperforms general-purpose retrievers in complex tasks. By focusing on reasoning-intensive retrieval, it sets itself apart.
RAG Integration: ReasonIR excels when used for Retrieval-Augmented Generation, significantly boosting performance on MMLU and GPQA benchmarks.
State-of-the-Art Results: ReasonIR is currently the top-performing retriever on the BRIGHT benchmark.

Getting Started with ReasonIR: Installation & Setup

Ready to get started? Here's how to set up ReasonIR for your projects. First, ensure you have the necessary libraries installed.

Install Transformers: Make sure you have a recent version of the transformers library.
```
pip install transformers>=4.47.0
```
Load the Model and Tokenizer: Use the following code snippet to load the ReasonIR-8B model:
```
from transformers import AutoModel, AutoTokenizer
model = AutoModel.from_pretrained("reasonir/ReasonIR-8B", torch_dtype="auto", trust_remote_code=True)
```
- trust_remote_code=True is crucial to load the custom bidirectional encoding architecture.
- torch_dtype="auto" activates bf16 precision for faster and more efficient computation. If you don't include this, the model will default to fp32.
Move the Model to GPU (Optional): For faster processing, move the model to your GPU:
```
model = model.to("cuda")
model.eval()
```

ReasonIR Usage Examples

Here's how to use ReasonIR for encoding queries and documents:

Define Query and Document:

query = "The quick brown fox jumps over the lazy dog."
document = "The quick brown fox jumps over the lazy dog."
query_instruction = ""
doc_instruction = ""

Encode and Calculate Similarity:

query_emb = model.encode(query, instruction=query_instruction)
doc_emb = model.encode(document, instruction=doc_instruction)
sim = query_emb @ doc_emb.T

Using Sentence Transformers with ReasonIR

You can also leverage ReasonIR with Sentence Transformers. This allows for simpler integration with existing workflows, but comes with a slight precision trade-off.

Install Sentence Transformers: If you don't have it, install the library:
```
pip install sentence-transformers
```

Load the Model:

from sentence_transformers import SentenceTransformer
model_kwargs = {"torch_dtype": "auto"}
model = SentenceTransformer("reasonir/ReasonIR-8B", trust_remote_code=True, model_kwargs=model_kwargs)
model.set_pooling_include_prompt(include_prompt=False) # exclude the prompt during pooling

Encode and Calculate Similarity:

query = "The quick brown fox jumps over the lazy dog."
document = "The quick brown fox jumps over the lazy dog."
query_instruction = ""
doc_instruction = ""
query_emb = model.encode(query, instruction=query_instruction)
doc_emb = model.encode(document, instruction=doc_instruction)
sim = query_emb @ doc_emb.T

Diving Deeper: Evaluations and Training the Retriever Model

For advanced users, ReasonIR offers resources for evaluations, synthetic data generation, and retriever training.

Evaluations: Follow the instructions in the evaluation section of the repository to assess ReasonIR's performance on various tasks.
Synthetic Data Generation: Create synthetic data for training using the provided guidelines in the synthetic_data_generation section.
Retriever Training: Train your own custom ReasonIR models following the training instructions.

License and Acknowledgments

ReasonIR is released under the FAIR Noncommercial Research License. The developers express their gratitude to the open-source repositories, BRIGHT, GritLM, and MassiveDS.