Posts on Artificial Intelligence
-
AI in 2024: Year in Review and Predictions for 2025
The past year has been transformative for artificial intelligence, marked by breakthrough innovations, emerging regulations, and a shift toward practical AI tools that enhance productivity. As we look ahead to 2025, let's review the major developments of 2024 and explore what th...
-
Is the EU Falling Behind in the AI Race?
The recent announcement that Meta's Llama 3.2 Vision models won't be available in the European Union has reignited discussions about the impact of EU regulations on AI innovation and accessibility. This development joins a growing list of AI technologies from major tech companie...
-
Build an Advanced RAG App: Query Routing
Conclusion In conclusion, Query Routing is a great step towards a more advanced RAG application. It allows to set up a base for a more complex system, where our app can better plan how to best answer questions. Also, Query Routing can be the glue that ties together other advance...
-
Build an Advanced RAG App: Query Rewriting
The new query now matches with the chunk of information I wanted to get my answer from, giving the LLM a better chance of answering a much better response for my question. Conclusion We have taken our first step out of basic RAG pipelines and into Advanced RAG. Query Rewriting i...
-
How to build a basic RAG app
Common problems and pitfalls As the title implies, this solution is a basic or naïve RAG implementation. It will empower your application to make the most out of the LLM it’s using and your data. But it won’t work for all cases. These are just some of the most common problems wi...
-
How to use LLMs: Summarize long documents
And that’s it! You now have a short summary of the most important points of a large document. But before you start processing your whole documentation, there are a few important notes you need to consider: This MapReduce method might not be less expensive than using an LLM with...
-
Understanding LLMs: Mixture of Experts
Another paper, named Switch Transformers, looked at techniques to reduce communication costs between devices and reduce training instabilities. To optimize parallelism, they proposed to use a single expert approach and reduce the capacity factor to almost all tokens being equall...
-
What to Expect for AI in 2024?
2023 was a great year for AI. Large Language Models were already in the spotlight for both users and businesses. ChatGPT had been just released in late 2022 and was taking the world by storm. Still, 2023 has brought more rapid change in the field than we could have imagined. Thi...
-
How to supercharge your LLM with Langchain Agents
Tools and toolkits Tools are functions that will perform actions on behalf of the LLM. An agent gets a list of tools for it to use and it will request to use one, several, or none. The Agent Executor will execute the required tools and feed the result back to the Agent. An examp...
-
Maximizing the Potential of LLMs: Using Vector Databases
What do vector databases do? A vector database stores and indexes vector embeddings. This is useful for fast retrieval of vectors and looking for similar vectors. Similarity search We can find similarity of vectors by calculating a vector's distance to all other vectors. The nea...