Changelog

Follow up on the latest improvements and updates.

RSS

New
set global default 2
  • Set the default LLM for a specific tenant from the admin dashboard.
set default LLM for tenant
  • Set the default LLM as a General Admin.
set default as general admin
  • Disable LLMs Globally.
Screen Recording 2025-06-12 at 10
  • Disable LLMs for specific tenants.
tenant disable llm
  • Disable LLM as General Admin
disable as general admin2
Fixed
  • Fixed a bug causing some users to not be able to save edits to apps and workflows created by another user.
New
New LLMs including Open AI's o3 and o4-mini, Google Gemini 2.5 Pro and Flash, and Llama 4 Maverick and Scout.
image
  • Description from OpenAI:
    OpenAI o3 is our most powerful reasoning model that pushes the frontier across coding, math, science, visual perception, and more. It sets a new SOTA on benchmarks including Codeforces, SWE-bench (without building a custom model-specific scaffold), and MMMU. It’s ideal for complex queries requiring multi-faceted analysis and whose answers may not be immediately obvious. It performs especially strongly at visual tasks like analyzing images, charts, and graphics. In evaluations by external experts, o3 makes 20 percent fewer major errors than OpenAI o1 on difficult, real-world tasks—especially excelling in areas like programming, business/consulting, and creative ideation. Early testers highlighted its analytical rigor as a thought partner and emphasized its ability to generate and critically evaluate novel hypotheses—particularly within biology, math, and engineering contexts.
image
  • Description from OpenAI:
    OpenAI o4-mini is a smaller model optimized for fast, cost-efficient reasoning—it achieves remarkable performance for its size and cost, particularly in math, coding, and visual tasks. It is the best-performing benchmarked model on AIME 2024 and 2025. In expert evaluations, it also outperforms its predecessor, o3‑mini, on non-STEM tasks as well as domains like data science. Thanks to its efficiency, o4-mini supports significantly higher usage limits than o3, making it a strong high-volume, high-throughput option for questions that benefit from reasoning.
image
  • Description from Google:
    Gemini 2.5 Pro is our state-of-the-art thinking model, capable of reasoning over complex problems in code, math, and STEM, as well as analyzing large datasets, codebases, and documents using long context. Gemini 2.5 Pro rate limits are more restricted since it is a preview model.
image
  • Description from Google:
    Our best model in terms of price-performance, offering well-rounded capabilities. Gemini 2.5 Flash rate limits are more restricted since it is an experimental / preview model.
image
  • Description from Meta:
    Llama 4 Scout, a 17 billion active parameter model with 16 experts, is the best multimodal model in the world in its class and is more powerful than all previous generation Llama models, while fitting in a single NVIDIA H100 GPU. Additionally, Llama 4 Scout offers an industry-leading context window of 10M and delivers better results than Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 across a broad range of widely reported benchmarks.
image
  • Description from Meta:
    Llama 4 Maverick, a 17 billion active parameter model with 128 experts, is the best multimodal model in its class, beating GPT-4o and Gemini 2.0 Flash across a broad range of widely reported benchmarks, while achieving comparable results to the new DeepSeek v3 on reasoning and coding—at less than half the active parameters. Llama 4 Maverick offers a best-in-class performance to cost ratio with an experimental chat version scoring ELO of 1417 on LMArena.
  • Display of thinking tokens fro reasoning models. Current supported models include DeepSeek, o3, and o4-mini.
thinking tokens click
Improved
  • Improved performance and speed for DeepSeek.
New
  • New LLM Selector with featured LLMs, model attributes, credit bands, searching, and other improvements.
View Featured LLMs
featured models2
Sidebar for Viewing All Models
view all models
Filter by Developer
filter by developer
Model Attributes
  • Model Descriptions
  • Capabilities (Low, Medium, High)
  • Speed (Slow, Medium, Fast)
  • Credit Usage (Lowest, Low, Medium Low, Medium High, High, Premium)
  • Vision capabilities (Images)
  • Tool usage (AI-Powered Search + Other Tools)
image
  • New model selector and sidebars in workflows and apps
image
image
  • Scrape websites as a step in workflows. Use inputs, outputs, or text.
image
website step 5
Screenshot 2025-05-29 at 9
Screenshot 2025-05-29 at 9
Screenshot 2025-05-29 at 9
Improved
  • Autoscroll in chat adjusted to make it easier for users to read outputs as they stream.
autoscroll2
  • Better responsiveness with the chat sidebar and other chat improvements.
Fixed
  • Fixed a bug causing some users to experience stream errors with long outputs lasting longer than two minutes.
New
  • New LLMs: Claude 4 Sonnet and Claude 4 Opus. Claude Opus 4 is the world’s best coding model, with sustained performance on complex, long-running tasks and agent workflows. Claude Sonnet 4 is a significant upgrade to Claude Sonnet 3.7, delivering superior coding and reasoning while responding more precisely to your instructions.
image
Improved
  • Various chat improvements including styling updates to code blocks, text-input auto resizing, and pasting user experience.
Fixed
  • Fixed bug causing legacy word files to sometimes fail file upload.
New
  • View
    your
    run history of AI Workflows. You can open the workflow run history for a specific workflow from the run view or view all run history from the "Run History" tab in Workshop. Review credits used for each step, time to run each step, and compare inputs with outputs.
run history3
Screenshot 2025-05-16 at 8
Run History Management Part 1
Run History Management3
  • Generate, view, and manage API keys outside of the customer dashboard. End customers can now generate their own API Keys.
Screenshot 2025-05-16 at 8
  • End customer MFA settings can now be accessed by navigating to Workspace > Account
Improved
  • Various improvements to AI-Powered search including the swapping of Brave Search for Google Search. The AI-Powered search button will also now stay clicked throughout a chat.
Fixed
  • Various fixes for File Upload. Some users experienced errors when uploading relatively larger files that were still under the file size limit. Improvements made to the copying and pasting of files and Office Document Contents sometimes pasting as images.
New
  • Introduced a redesigned chat experience with a new chat sidebar, chat window, and chat input, supporting chat creation, pinning, editing, deletion, and advanced file uploads (including OneDrive integration).
  • Added the ability to rename chats with better deletion from the chat sidebar.
new chat
  • Added a powerful, filterable, and sortable chat history table with multi-select, bulk actions, and improved search and filtering options.
chat sorting
  • Integrated AI-assisted HTML editing and suggestions in email template editing.
email editor leet
Improved
  • Improved the chat input box to add more spacing and an adjustable height.
  • Enhanced chat management with real-time updates, optimistic UI, and animated transitions.
  • Improved chat input with file attachment previews and AI-powered search toggle.
  • Updated markdown rendering with improved code block copying and syntax highlighting.
Bug Fixes
  • Adjusted muted text color for better readability in both light and dark themes.
  • Improved error handling and user feedback in file upload and chat streaming features.
New
  • Use an AI-powered combination of multiple search and web crawling tools inside of the chat interface. The AI will choose what providers to use and can select multiple tools for the task. Currently the AI has access to Brave Search, Perplexity, and Firecrawl. Note: Some operations involving multiple tools and reading multiple web pages make take time to respond.
tell me about hatz ai
web scrape hatz
search enabled models
  • Create and run multi-step workflows (beta). Chain multiple prompts together. Use the output of one prompt into another step.
  • Select different LLMs for each node of the workflow.
  • New "Workflows" section in workshop.
  • See output from each step of the workflow result.
multi-step
workflow demo
  • Create custom email templates for sending password resets.
email template
Improved
  • Better chat input and clickable links.
clickable links
  • Drag and drop file upload in app builder.
drag and drop
  • Private Apps populate in "My Apps"
New
  • GPT-4.1, GPT-4.1 Mini, and GPT-4.1 Nano added to the model selector. These models offer a large context window and a credit usage similar to 4o. They are excellent for coding or tasks requiring a lot of input.
New
  • Grok 3 model family added to the model selector. These models are currently marked beta by xAI. The release includes Grok 3, Grok 3 mini, Grok 3 fast, and Grok 3 mini fast.
  • Amazon Nova model family added to the model selector. The release includes Amazon Nova Micro, Amazon Nova Pro (vision), and Amazon Nova Lite (vision)
new llms grok amazon
Fixed
  • Fixed a bug sometimes preventing Client Admins from deleting apps.
Improved
  • Dual audio recording for AI Phone Agents
Fixed
  • Fixed bug causing constants to not properly load when invoked via the API
Load More