llama-index-integrations/postprocessor/llama-index-postprocessor-siliconflow-rerank/README.md
SiliconCloud provides cost-effective GenAI services based on an excellent open-source foundation model. introduction: https://docs.siliconflow.cn/introduction
As a one-stop cloud service platform that integrates top large models, SiliconCloud is committed to providing developers with faster, cheaper, more comprehensive, and smoother model APIs.
SiliconCloud has been listed on Qwen2.5-72B, DeepSeek-V2.5, Qwen2, InternLM2.5-20B-Chat, BCE, BGE, SenseVoice-Small, Llama-3.1, FLUX.1, DeepSeek-Coder-V2, SD3 Medium, GLM-4-9B-Chat, A variety of open-source large language models, image generation models, code generation models, vector and reordering models, and multimodal large models, including InstantID.
Among them, Qwen 2.5 (7B), Llama 3.1 (8B) and other large model APIs are free to use, so that developers and product managers do not need to worry about the computing power costs caused by the R&D stage and large-scale promotion, and realize "token freedom".
Provide out-of-the-box large model inference acceleration services to bring a more efficient user experience to your GenAI applications.
import os
from llama_index.postprocessor.siliconflow_rerank import SiliconFlowRerank
reranker = SiliconFlowRerank(
model="BAAI/bge-reranker-v2-m3",
api_key=os.getenv("SILICONFLOW_API_KEY"),
top_n=4,
)
nodes = [
...
]
response = reranker.postprocess_nodes(nodes=nodes, query_str="...")
print(response)