October 17, 2024 No Comments

How to use Function calling with OpenAI Realtime API

Introduction Realtime communication has changed the way businesses connect with customers, handle information, and offer services. OpenAI Realtime APIs make it easy to enable these fast interactions between apps and users. In this article, we will look at the main benefits of OpenAI Realtime APIs, examples of how they are used, and how they can […]

October 14, 2024 No Comments

Simplifying OpenAI Function Calling with Structured Output: A 2024 Guide

Introduction OpenAI’s function calling feature, introduced earlier, empowers developers to create interactive applications by enabling language models to call functions. This feature allows models to return structured data that can trigger APIs, extract data, or perform tasks. Recently, OpenAI has enhanced this capability with Structured Outputs, simplifying the way developers interact with models. Now, instead […]

October 9, 2024 No Comments

Introducing Structured Outputs with OpenAI

Introduction In application development, ensuring that data exchanged between systems is properly structured is critical—especially when using formats like JSON, which is widely adopted for data communication. However, getting models to produce valid JSON outputs consistently can be a challenging task. Developers often need to invest significant time in crafting complex prompts to ensure the […]

September 10, 2024 No Comments

Building a RAG Bot for Slack Using LangChain and OpenAI

Introduction​ In today’s busy work environments, getting information quickly is very important. With so many documents, finding what you need can be hard. A PDF Q&A bot can help, especially when used with Slack, a popular tool for team communication. This bot lets you ask questions about PDF documents and get instant answers, making it […]

June 25, 2024 No Comments

VapiAI: A Developer’s Guide To Creating A Conversational Voice Bot In Minutes

What is VapiAI? Vapi is the Voice AI platform designed specifically for developers, streamlining the process of creating voice AI agents. With Vapi, developers can build, test, and deploy voice AI applications in mere minutes instead of the typical months. This platform addresses the fundamental challenges inherent in voice AI development, offering a robust and […]

June 6, 2024 No Comments

Comparing Q&A Performance of Phi-3, ChatGPT, Gemini, and Claude on Text, Tables, and Graphs

Introduction In today’s digital age, extracting meaningful insights from PDFs is a common task. Whether it’s for academic research, business analysis, or everyday information retrieval, we often rely on advanced models to perform these tasks efficiently. This blog aims to compare four popular models—Phi-3, GPT-3.5, Gemini1.5, and Claude2.1—in handling various types of data within PDFs, […]

June 5, 2024 No Comments

PaliGemma: A Lightweight Open-Source VLM for Image Analysis and Understanding

PaliGemma stands out as a lightweight vision-language model (VLM) that’s freely available. It goes beyond generating simple captions for your images, offering deeper understanding through insightful analysis. Inspired by the PaLI-3 VLM, PaliGemma is built on open-source components like the SigLIP vision model (SigLIP-So400m/14) and the Gemma 2B language model. PaliGemma’s architecture combines a powerful […]

May 29, 2024 No Comments

Evaluating GPT-4o And Gemini 1.5-Pro: Which AI Reigns Supreme?

OpenAI recently unveiled its flagship GPT-4o model at the Update event, offering it for free to everyone. This model is multimodal, capable of accepting both text and image inputs and producing text outputs, enhancing its versatility and application. The announcement marked a significant milestone in the accessibility of advanced AI technology. In a rapid follow-up, […]

May 2, 2024 No Comments

Parallel Function Calling in OpenAI using ChatGPT

Introduction We’ve already explored the functionality of function calling within OpenAI, where it identifies the requirement for a function call based on user prompts and responds with a JSON object containing arguments to call your functions. Upon requesting a function call, the model executes it and sends back the results to the API. For further […]