Tag: LLM

Gemini LLM Machine Learning July 22, 2024

Is a Zero Temperature Deterministic? - Learn more about a crucial LLM model parameter, and how to configure it on Gemini Pro with Vertex AI.

LLM Vertex AI July 22, 2024

Control LLM output with response type and schema - Vertex AI introduces two new features, response_mime_type and response_schema, to control the output format of large language models (LLMs).

Billing Generative AI LLM July 22, 2024

Control LLM costs with context caching - Context caching is a cost-saving technique for large language models (LLMs) with extensive context windows. The cached content can be used for subsequent prompts, and the number of input tokens cached are billed at a reduced rate.

LLM July 15, 2024

Caching Out with Gemini: Making AI Chat Less Taxing (on Your Wallet) - Context caching, a feature of Google's Gemini API, optimizes AI chat interactions by storing frequently used data and reducing repetitive requests. It saves computational costs and enhances efficiency, particularly for chatbots with extensive backstories, video analysis, large document processing, and code analysis. By caching content like PDFs and videos, users can ask questions based on the cached data, leading to more streamlined and cost-effective AI conversations.

Generative AI Google Kubernetes Engine Kubernetes LLM Machine Learning July 15, 2024

Distributed OpenSource LLM Fine-Tuning with LLaMA-Factory on GKE - This blog post explores distributed fine-tuning for LLMs using open-source tool LLaMA-Factory on Google Kubernetes Engine. LLaMA-Factory empowers researchers and developers to leverage pre-trained LLaMA models and efficiently fine-tune them on their own datasets.

AI Generative AI LLM July 8, 2024

Quizaic — A Generative AI Case Study - Continuing with the series application Quizaic, which uses generative AI to create and play high quality trivia quizzes. This article explores how best to assess the accuracy of our AI-generated quizzes.

Generative AI LLM Official Blog Vertex AI July 8, 2024

How to evaluate the impact of LLMs on business outcomes - The Vertex Gen AI Evaluation Service provides a toolkit with quality-controlled and explainable methods to evaluate the impact of large language models (LLMs) on business outcomes. It offers online and offline evaluations, auto-logging in Vertex AI Experiments, and pre-built pipeline components for production monitoring.

BigQuery Data Analytics Generative AI LLM Official Blog July 8, 2024

Prompting best practices for BigQuery data canvas - Tips to increase Natural Language to SQL or Chart queries in Bigquery Data Canvas.

Gemini LLM Python July 1, 2024

How to prompt Gemini asynchronously using Python on Google Cloud - How to send all your prompts at the same time and collect the answers, rather than sending them one by one, using Python.

LLM July 1, 2024

Building a Custom Classification API on Google Cloud: A Technical Deep Dive - Unlock the potential of LLMs with a custom API that streamlines content classification for many real-world applications.

Generative AI LLM June 24, 2024

Quizaic — A Generative AI Case Study - Part 3— Prompting and Image Generation.

Generative AI LLM Official Blog Vertex AI June 24, 2024

Announcing Anthropic’s Claude 3.5 Sonnet on Vertex AI, providing more choice for enterprises - Anthropic's newly released model, Claude 3.5 Sonnet, is now generally available on Google Cloud's Vertex AI platform. With advanced capabilities in reasoning, knowledge, math, and coding, it can power various applications, including coding assistance, customer support, data analysis, and visual processing. Enterprises can leverage Vertex AI's enterprise-grade infrastructure, tooling, and security to build and deploy production-grade generative AI applications.

AI Generative AI LLM Networking Official Blog June 24, 2024

Exploring Google Cloud networking enhancements for generative AI applications - Google Cloud offers new networking capabilities to optimize traffic for generative AI applications. These capabilities include Cross-Cloud Network for accelerated AI training and inference, Model as a Service Endpoint for secure and reliable access to AI models, custom AI-aware load balancing for minimized inference latency, optimized traffic distribution for AI inference applications, and Service Extensions for enhanced gen AI serving.

LLM Machine Learning June 17, 2024

Implementing Semantic Caching: A Step-by-Step Guide to Faster, Cost-Effective GenAI Workflows - This article is a focused, in-depth exploration of semantic caching, its intricate implementation process, its relationship to LLMs, and its strategic positioning within the broader AI landscape.

BigQuery Data Analytics LLM Official Blog June 17, 2024

Exploring synthetic data generation with BigQuery DataFrames and LLMs - BigQuery DataFrames enables the generation of synthetic data directly within BigQuery, eliminating the need for third-party solutions or data movement. It integrates seamlessly with Vertex AI, allowing users to leverage advanced language models like Gemini to generate code that produces synthetic data based on specified schemas or existing table structures. This approach addresses data privacy concerns and accelerates AI development by providing a scalable and cost-efficient platform for synthetic data generation.

BigQuery dbt Generative AI LLM Terraform Vertex AI June 10, 2024

Productionise genAI directly in dbt - Using Vertex AI in DBT.

AI Flutter LLM Machine Learning June 10, 2024

Quizaic — A Generative AI Case Study - Quizaic is a demo application that uses generative AI to create high-quality trivia quizzes and manage the interactive quiz-playing experience. The app is built using Google Cloud Platform, AI, Flutter, Machine Learning, and LLM.

BigQuery LLM Official Blog June 10, 2024

Getting started with retrieval augmented generation on BigQuery with LangChain - The blog demonstrates how to build a simple RAG pipeline using BigQuery and LangChain, and highlights the benefits of using BigQuery Vector Search, which is optimized for large-scale analytical workloads and offers features like scalability, serverless operation, and fine-grained access control.

LLM Official Blog Vertex AI June 3, 2024

Vertex AI's Grounding with Google Search: how to use it and why - Vertex AI's Grounding with Google Search helps improve the factuality and up-to-date information of large language models (LLMs) by grounding their responses on trusted Google Search world knowledge and public facts. It addresses the limitations of LLMs, such as hallucinations, staleness, lack of citation, and limited relation to private data. With Grounding, LLMs can provide more reliable and trustworthy responses, especially for questions that require recent or factual information.

BigQuery Gemini LLM May 27, 2024

Unlocking Multimodal AI with Google Gemini, Embeddings, Vertex Search, and RAG: A Practical Guide with BigQuery - Google's latest AI innovations, including Gemini, embeddings, Vertex Search, and Retrieval Augmented Generation (RAG), are revolutionizing how we interact with and extract insights from data. By leveraging these concepts with BigQuery, users can unlock powerful AI capabilities such as image tagging, vector search, and retrieval augmented generation. This enables enhanced image discovery, improved user experience, efficient scalability, and the generation of creative ideas and insights. The combination of these technologies opens up a world of possibilities for building recommendation systems, question-answering bots, and interactive multimodal experiences.

BigQuery Data Analytics LLM Official Blog May 27, 2024

Unlocking enhanced LLM capabilities with RAG in BigQuery - Now you can build smarter AI applications from right inside your data warehouse.

AI LLM Official Blog May 20, 2024

To tune or not to tune? A guide to leveraging your data with LLMs

LLM Official Blog Translation API May 13, 2024

LLMs, AI Studio, higher quality, oh my! Our latest Translation AI advancements - Announcing new generative model for Google Cloud’s Translation API.

BigQuery Gemini LLM Official Blog May 6, 2024

Simplifying data modeling and schema generation in BigQuery using multi-modal LLMs - Now you can pass multi-modal input to Gemini to create data models for your data warehouse.

AI LLM April 29, 2024

LLm infini-attention with linear complexity - Introducing Google’s Infini-attention to increase LLM attention windows and reduce quadratic complexity.

Cloud Spanner Generative AI LLM April 29, 2024

LLM in your favorite Transactional Database: Spanner - Build a Patent Search App with Spanner, Vector Search & Gemini 1.0 Pro!

Gemini Generative AI LLM April 29, 2024

Gemini has entered the chat: building an LLM-powered Discord bot - Take your first steps into the world of Generative AI by building a Discord bot that uses Gemini to talk with other users.

LLM Official Blog Vertex AI April 22, 2024

Meta Llama 3 Available Today on Google Cloud Vertex AI - Meta Llama 3 model is available on Vertex AI Model Garden.

BigQuery LLM Official Blog April 22, 2024

Introducing LLM fine-tuning and evaluation in BigQuery - Supervised fine-tuning via BigQuery uses a dataset which has examples of input text (the prompt) and the expected ideal output text (the label), and fine-tunes the model to mimic the behavior or task implied from these examples.

AI Google Kubernetes Engine LLM April 22, 2024

GKE Orchestration : Deploy your Gemma LLM - Deploying Gemma - lightweight open model on GKE.

Generative AI LLM April 8, 2024

Shh, It’s Free: But Let’s Not Tell Google! Exploring Gemini’s Multimodal Capabilities on Vertex AI - Consider this your backdoor pass into a free club, where the only membership requirement is your curiosity.

AI Gemini LLM Python April 1, 2024

Crafting Bespoke Output Formats with Gemini API - Propose a method using question phrasing and API calls to craft a bespoke output, enabling seamless integration with user applications.

Generative AI LLM Machine Learning April 1, 2024

Demystifying Generative AI for Enterprise Developers - Guide to kickstart your Enterprise GenAI journey.

Google Kubernetes Engine Kubernetes LLM April 1, 2024

GKE + Gemma + Ollama: The Power Trio for Flexible LLM Deployment - Deploying Gemma on GKE.

BigQuery Generative AI LLM March 25, 2024

In-Place LLM Insights: BigQuery & Gemini for Structured & Unstructured Data Analytics - Introduction.

BigQuery LLM March 11, 2024

Apply GenAI on Dataset in Data Mesh with HandOns experiment (GCP BigQuery) - Apply GenAI on Dataset in Data Mesh with HandOns experiment (GCP BigQuery).

Cloud Memorystore LLM Official Blog March 11, 2024

Memorystore for Redis vector search and LangChain integrations for gen AI - An example of how to combine Memorystore for Redis with LangChain to create a chatbot that answers questions about movies.

AI LLM Official Blog March 11, 2024

Domain-specific AI apps: A three-step design pattern for specializing LLMs - This article embarks on a journey through the key advantages of domain-specific LLMs.

LLM Machine Learning Vertex AI Feb. 26, 2024

Making AI more Open and Accessible to Cloud Developers with Gemma on Vertex AI - Gemma is a family of open, lightweight, and easy-to-use models developed by Google Deepmind.

LLM Python Feb. 26, 2024

Using and Finetuning Google’s State-of-the-Art Open Source Model Gemma-2B - This article describes how to use and fine-tune Gemma model.

Google Kubernetes Engine Infrastructure Kubernetes LLM Feb. 19, 2024

Serving Open Source LLMs on GKE using vLLM framework - This post shows how to serve Open source LLM models(Mistrial 7B, Llama2 etc) on Nvidia GPUs(L4, Tesla-T4, for example) running on Google Cloud Kubernetes Engine (GKE).

AI LLM Machine Learning Official Blog Feb. 19, 2024

Your RAGs powered by Google Search technology, part 2 - A deeper look at the critical technologies that are essential for building a successful RAG system to help ground large language models (LLM) when building applications.

LLM Official Blog Feb. 19, 2024

Your RAGs powered by Google Search technology, part 1 - Exploring the key features that power Google-quality retrieval in LLM and RAG-based applications.

AI Data Science LLM Machine Learning Feb. 19, 2024

BigQuery Data Analyses With Gemini LLM - The Gemini-Pro LLM model is now available in BigQuery ML. Here’s how to use it.

Cloud Dataflow LLM Official Blog Feb. 11, 2024

Leveraging streaming analytics for actionable insights with gen AI and Dataflow - In this blog post, we showcase how to get real-time LLM insights in an easy and scalable way using Dataflow.

Cloud Workstations Generative AI LLM Official Blog Feb. 11, 2024

No GPU? No problem. localllm lets you develop gen AI apps on local CPUs - In this post, we introduce you to a novel solution that allows developers to harness the power of LLMs locally on CPU and memory, right within Cloud Workstations, Google Cloud’s fully managed development environment.

Generative AI LLM Official Blog Feb. 5, 2024

Build enterprise gen AI apps with Google Cloud databases - An overview of databases on GCP that can be used to store and query vector embeddings.

AlloyDB LLM Official Blog Translation API Jan. 22, 2024

How to create a multilingual chatbot that queries AlloyDB with Langchain, Streamlit, LLMs, and Google Translate

AI BigQuery LLM Machine Learning Official Blog Jan. 22, 2024

Integrating BigQuery data into your LangChain application - See how to integrate your BigQuery data into LLM solutions.

LLM Vertex AI Jan. 15, 2024

Large Language Models(LLMs) in Google Cloud with VertexAI - From concept to code: Everything you need to know to start building an application with GenAI’s LLMs.

LLM Vertex AI Dec. 18, 2023

Fine Tuning of LLM’S in GCP Vertex AI - This article delves into fine-tuning of LLM’S why it is required, how to fine-tune it, and the results that can be achieved through fine-tuning.

Generative AI LLM Machine Learning Python Dec. 18, 2023

Google Imagen (through GCP Vertex AI Studio) as fashion design assistant - In this article, we will explore how generative AI can assist fashion designers in generating new ideas and designs using Google’s suite of generative models for text and image generation.

BigQuery LLM Dec. 18, 2023

BigQuery Meets LLM: Unlocking New Frontiers in AI-Driven Data Analytics - Unlocking a level of understanding that was previously unimaginable.

API Colab LLM Dec. 11, 2023

Fine-tune and deploy an LLM on Google Colab Notebook with QLoRA and VertexAI - An example of fine-tuning and deploying MistralAI 7B model using QLoRA on your data and VertexAI endpoint, in Google Colab Notebook .

Generative AI LLM Official Blog Dec. 3, 2023

Introducing sample GenAI Databases Retrieval App – augment your LLMs with Google Cloud databases

LLM Machine Learning Vertex AI Nov. 27, 2023

Vertex AI Model Garden - Vertex AI Model Garden is a collection of pre-built foundation models, task-specific models, and Google ML APIs.

LLM Official Blog Vertex AI Oct. 30, 2023

Serving open-source large language models efficiently on Vertex AI Model Garden - An updated LLM-efficient serving solution that improves serving throughput in Vertex AI.

 

Latest Issues




Contact

Zdenko Hrček
Třebanická 183
Prague, Czech Republic
Phone: +420 777 283 075
Email: [email protected]