terminal
howtonotcode.com
business

Gemini 3.1 Pro

Ai Tool

Gemini is a family of multimodal large language models (LLMs) developed by Google DeepMind, and the successor to LaMDA and PaLM 2. Comprising Gemini Pro, Gemini Deep Think, Gemini Flash, and Gemini Flash Lite, it was announced on December 6, 2023. It powers the chatbot of the same name.

article 4 storys calendar_today First seen: 2026-02-20 update Last seen: 2026-03-03 open_in_new Website menu_book Wikipedia

Resources

Links to check for updates: homepage, feed, or git repo.

home Homepage

Stories

Showing 1-4 of 4

Google’s Gemini 3.1 Flash-Lite targets high-volume, low-latency workloads

Google released Gemini 3.1 Flash-Lite, a faster, cheaper model aimed at high-volume developer workloads and signaling a broader shift to lighter LLMs for routine backend and data tasks. Google’s launch of [Gemini 3.1 Flash-Lite](https://thenewstack.io/google-gemini-3-1-flash-lite/) emphasizes low-latency responses for tasks where cost is critical, with preview access via the Gemini API in Google AI Studio and enterprise access in Vertex AI, alongside industry moves like OpenAI’s GPT-5.3 Instant toward lighter models ([context and availability](https://www.thedeepview.com/articles/openai-google-target-lighter-models)). Independent coverage pegs Flash-Lite at $0.25/million input tokens and $1.5/million output tokens—about one-eighth the price of Gemini 3.1 Pro—and notes support for four “thinking” levels to trade speed for reasoning when needed ([pricing and modes](https://simonwillison.net/2026/Mar/3/gemini-31-flash-lite/#atom-everything)). For backend/data teams, this sweet spot makes Flash-Lite a strong default for translation, content moderation, summarization, and structured generation (dashboards/simulations), reserving heavier models for only the hardest requests ([use cases](https://www.thedeepview.com/articles/openai-google-target-lighter-models)). If your pipelines push files, mind Gemini’s surface-specific limits across Apps (including NotebookLM notebooks), API, and enterprise tools—think up to 10 files per prompt, 100MB per file/ZIP with caveats, strict video caps, and code folder/GitHub repo constraints—so ingestion doesn’t silently truncate or fail ([file-handling constraints](https://www.datastudios.org/post/gemini-file-upload-support-explained-supported-formats-size-constraints-and-document-handling-acr)). Zooming out, the race to lighter models (OpenAI’s GPT-5.3 Instant and Alibaba’s Qwen Small Model Series) underscores a clear pattern: push routine throughput to cheaper, faster tiers and escalate to heavyweight reasoning only on ambiguity or failure ([trend snapshot](https://www.thedeepview.com/articles/openai-google-target-lighter-models)).

calendar_today 2026-03-03
google gemini-31-flash-lite gemini-api google-ai-studio vertex-ai

Windsurf ships new models, Linux ARM64, and enterprise hooks

Windsurf rolled out new frontier coding models, full Linux ARM64 support, and enterprise-grade Cascade Hooks while community feedback spotlights its transparent crediting versus rivals' opaque limits. Windsurf’s latest updates add Gemini 3.1 Pro, Claude Sonnet 4.6, GLM-5, Minimax M2.5, and GPT-5.3-Codex-Spark with time-limited credit multipliers, plus quality-of-life fixes and features like automatic Plan→Code switching, skills loading from .agents/skills, tracked rules in post_cascade_response, and diff zones auto-closing on commit; importantly, it now provides full Linux ARM64 deb/rpm packages and enterprise cloud config for Cascade Hooks with Devin service key auth, as detailed in the [Windsurf changelog](https://windsurf.com/changelog). A power user’s comparison underscores cost control and predictability: they favored Windsurf’s clear credit model over Cursor/Claude Code’s rate-limit surprises, keeping GitHub Copilot Pro+ for predictable premium requests while continuing to code primarily in Windsurf, per this [Reddit write-up](https://www.reddit.com/r/windsurf/comments/1r9b58e/i_almost_left_windsurf/).

calendar_today 2026-02-20
windsurf gemini-31-pro claude-sonnet-46 glm-5 minimax-m25