top of page

TensorOps
Mar 31 min read
Emerging Architectures of LLM Applications 2025
Watch the full webinar, "Building the Future of AI: Emerging Architectures of LLM Applications in 2025

TensorOps
Feb 191 min read
Deepseek - Open Source Revolution in AI Models - (AI Lover podcast)
The conversation explores the emergence of DeepSeek as a significant player in the AI landscape, particularly in the context of open-source

Miguel Carreira Neves
Feb 1313 min read
DeepSeek-V3 Technical Analysis - MoE, Fine-Grained Quantization, DualPipe, MLA
Analysis of Performance and Technical Innovations. Dive into Mixture of Experts (MoE), Fine-Grained Quantization, DualPipe, MLA and more.

Diogo Gonçalves
Jan 65 min read
Vector DBs Will Not Save Your RAG
The AI world has collectively fixated on Vector Databases as the holy grail for scalable, accurate information retrieval and synthesis to so

Gad Benram
Dec 25, 20246 min read
Agents Are Just Long-Running Jobs: A Pragmatic View of an Overhyped AI
The Routing Workflow. Source: Anthropic "Building effective agents" Building an “AI agent” sounds exciting —visions of an autonomous...

TensorOps
Dec 8, 20244 min read
Emerging Architectures of LLM Applications (2025 Update)
The world of AI applications is changing rapidly. Not too long ago, most AI systems were simple: a single model received input, made a...

Gad Benram
Dec 5, 20245 min read
Faster Than XGBoost: Using Catboost with C++
Integrating machine learning models into production environments often requires a balance between performance, compatibility, and ease of...

Miguel Carreira Neves
Nov 7, 20246 min read
Contextual Retrieval - Enhancing RAG Performance
Traditional RAG systems cannot maintain context in retrieved information. Contextual Retrieval addresses this by enriching data with context

Diogo Azevedo
Oct 18, 20243 min read
Deploying LLM Proxy on Google Kubernetes Engine: A Step-by-Step Guide
In our previous post , we explored the concept of an LLM Proxy and its importance in scalable LLM application architectures. In this...


Higor Ribeiro de Oliveira
Oct 17, 20244 min read
Cohort-Based Forecasting: A Technical Deep Dive
At TensorOps , we specialize in implementing AI solutions that drive business growth. One powerful application of AI in the business...

Clara Gadelho
Oct 14, 20249 min read
Building AI and LLM Agents from the Ground Up: A Step-by-Step Guide
OpenAI’s vision of creating artificial general intelligence (AGI) might still be futuristic, but today’s AI agents are already making a sign

Bruno Alho
Oct 14, 20244 min read
Comparing Context Caching in LLMs: OpenAI vs. Anthropic vs. Google Gemini
Compare context caching in LLMs—OpenAI, Anthropic, Google Gemini. Discover the best option for your project's cost, ease, and features.
Gad Benram
Oct 13, 20245 min read
10 Essential AI Technologies for Software Supply Chain Companies
Table of Contents Introduction The Software Supply Chain AI in Software Development: The Rise of Code Assistants...

Gad Benram
Oct 13, 20246 min read
Knowledge Graph RAG vs. Vector DB RAG: Is It Time for GraphDBs to Shine?
The emergence of AI has revolutionized the way we interact with data—or even knowledge itself. Among the buzzwords circulating in the...

Diogo Gonçalves
Oct 8, 20246 min read
Moving from Chatbots to Agents
While the terms “Chatbot” and “AI agent” are sometimes used interchangeably, there are notable differences between them:

Gad Benram
Sep 24, 20245 min read
Prompt Translation: The Way to Switch Between LLMs Without Losing Performance
Since the debut of ChatGPT in 2023, the landscape of Large Language Models (LLMs) has evolved dramatically. Back then, the primary...

Gad Benram
Sep 20, 20244 min read
UX in LLM Applications: Examples of 4 Companies Getting It Right and 1 That Missed the Mark
Over the past year, TensorOps has observed a recurring scenario: organizations invest significant time—often 5-7 months—fine-tuning...

Gad Benram
Sep 12, 20243 min read
OpenAI Unveils o1 Model: The Biggest Leap Towards AGI since ChatGPT
September 12, 2024 OpenAI has unveiled its latest breakthrough in artificial intelligence—the O1 model series—now available in Preview....

Gad Benram
Aug 31, 20245 min read
What can shift Nvidia's stock up or down?
NVIDIA's stock soared 2750% due to $10B in Q2 data center sales. Buyers like Google and Meta aim to leverage AI tech, but is it a bubble?


TensorOps
Aug 29, 20242 min read
Lessons Learned From Managing AI Innovation Projects
Watch the video here: In this session, Senior Engineering Managers will share the pains and successes of over 18 months into the GenAI...
bottom of page