What can this AI actually do?

A plain-English reference for AI capabilities, plans, constraints, and implementations.

  1. Pick a capability Learn what AI can do today: understand, create, act for me, and more.
  2. Check your plan See which subscription tier unlocks it.
  3. Get the real answer Find the limits, caveats, and platform support you need. Ditch the hype that you don't.
18 capabilities 66 implementations Last built: 2026-04-03

Understand

3 capabilities

Hear Audio and Speech

Understand

Can accept spoken or audio input for understanding.

4 of 5 free No terminalNo Linux Region-limited

What Counts

  • microphone input in conversation
  • understanding spoken prompts
  • multimodal listening modes

What Does Not Count

  • text-only chat
  • outputting audio without taking spoken input
Voice inputLiveVoice Mode

Read Text and Documents

Understand

Can interpret text, documents, PDFs, and similar uploaded or pasted materials.

4 of 5 free No terminal

What Counts

  • reading attached documents
  • parsing structured textual files
  • understanding document content for analysis

What Does Not Count

  • merely storing a file without interpreting it
  • only connecting to a system without reading its contents
file analysisdocument understandingspreadsheet parsing

See Images and Screens

Understand

Can interpret images, screenshots, camera views, or other visual interface context.

4 of 5 free No terminal Region-limited

What Counts

  • image understanding
  • screenshot interpretation
  • live visual context in apps or browsers

What Does Not Count

  • image generation
  • text-only reasoning with no visual input
VisionProject Astrascreen understanding

Respond

2 capabilities

Speak Back in Real Time

Respond

Can respond with live or near-live speech in a conversational flow.

4 of 5 free No terminalNo Linux Region-limited

What Counts

  • back-and-forth spoken conversation
  • real-time voice replies
  • low-latency live voice modes

What Does Not Count

  • downloadable audio only
  • delayed audio generation without conversation
Advanced Voice ModeGemini LiveVoice Mode

Write and Explain

Respond

Can produce useful written responses, explanations, summaries, and structured output.

5 of 6 free No terminal Region-limited

What Counts

  • core chat responses
  • explanatory writing
  • structured written answers

What Does Not Count

  • model entitlement with no distinct output behavior
  • actions or tool use without a written result
chatexplanationsummary

Create

4 capabilities

Generate Images

Create

Can create or transform images from prompts, edits, or multimodal instructions.

1 of 4 free No terminal Region-limited

What Counts

  • text-to-image generation
  • image editing or variation
  • branded image generation tools

What Does Not Count

  • only understanding an image
  • producing text descriptions without making an image
DALL-EImagenDesignerAurora

Generate Video

Create

Can create or transform video outputs from prompts or edits.

0 of 3 free No terminal Region-limited

What Counts

  • text-to-video generation
  • short generated clips
  • vendor video generation modes

What Does Not Count

  • static image generation
  • ordinary screen recording tools
SoraVeoImagine

Make and Edit Documents

Create

Can create, revise, or iteratively develop documents, presentations, notes, or other standalone work products.

5 of 6 free No terminal

What Counts

  • writing workspaces
  • artifact/document generation
  • editing inside office or document environments

What Does Not Count

  • simple single-message answers without document context
  • code-only workflows when no document artifact is involved
ArtifactsCanvasOffice integration

Write and Edit Code

Create

Can generate, refactor, debug, or execute code-oriented work.

4 of 4 free

What Counts

  • coding agents
  • code editing workspaces
  • code generation and review

What Does Not Count

  • general writing features with no coding support
  • model access alone without a coding workflow
Claude CodeCodexAI StudioGrok Studio

Work With My Stuff

3 capabilities

Organize Work in Projects

Work With My Stuff

Can keep ongoing work organized in named projects, spaces, collections, or similar containers.

4 of 4 free No terminal

What Counts

  • persistent project workspaces
  • reusable folders of context
  • grouped research or collaboration spaces

What Does Not Count

  • one-off chats with no persistent organization
  • generic chat history without project structure
ProjectsSpacesCollections

Remember Context Over Time

Work With My Stuff

Can retain preferences, memory, or persistent context across sessions.

4 of 4 free No terminal Region-limited

What Counts

  • explicit memory features
  • saved preferences reused later
  • persistent user context across chats

What Does Not Count

  • only remembering content inside a single active session
  • temporary context windows with no persistence
Memorysaved preferences

Use Files I Provide

Work With My Stuff

Can work with files or documents the user uploads or attaches directly.

4 of 4 free No terminal

What Counts

  • uploaded documents
  • attached spreadsheets
  • local file analysis in chat or project workspaces

What Does Not Count

  • live app integrations without file upload
  • generic chat with no file handling
uploadsattachmentsfile analysis

Act for Me

3 capabilities

Do Multi-Step Research

Act for Me

Can plan, search, synthesize, and return a more developed research output across multiple steps or sources.

3 of 5 free No terminal

What Counts

  • deep research reports
  • multi-query synthesis
  • cited research workflows

What Does Not Count

  • a single simple web lookup
  • unsupported claims without evidence gathering
Deep ResearchDeepSearchPro SearchPages

Search the Web

Act for Me

Can retrieve current information from the web as part of an interaction.

3 of 4 free No terminal Region-limited

What Counts

  • built-in web search
  • browser-assisted web lookup
  • current-information retrieval

What Does Not Count

  • answers based only on training data
  • private system connectors that do not search the open web
SearchBrowseWeb lookup

Take Actions and Run Tools

Act for Me

Can go beyond plain chat by taking actions, using tools, browsing interfaces, or running task steps on the user's behalf.

3 of 5 free Region-limited

What Counts

  • agentic actions
  • tool use
  • form filling or workflow execution

What Does Not Count

  • text suggestions that the user must execute manually
  • static answers without tool or action support
Agent ModeComet actionsCode agents

Connect

2 capabilities

Build Reusable AI Workflows

Connect

Can create reusable assistants, agents, GPTs, gems, skills, or other repeatable AI setups.

1 of 4 free Region-limited

What Counts

  • building a named reusable assistant
  • packaging instructions or tools for repeated use
  • creating no-code or low-code agent workflows

What Does Not Count

  • one-off prompts in a single chat
  • temporary settings that do not create a reusable artifact
Custom GPTsGemsSkillsAgent Builder

Connect to External Systems

Connect

Can connect to external apps, services, APIs, data sources, or tool ecosystems beyond the base chat product.

2 of 6 free Region-limited

What Counts

  • connectors to cloud services
  • MCP or similar tool connections
  • actions that call external systems

What Does Not Count

  • uploading a file directly into chat
  • reading only the text already present in the conversation
ConnectorsMCPActionsExtensions

Access Context

1 capability

Use It on My Surfaces

Access Context

Can be accessed across the surfaces that matter to the user, such as web, desktop, mobile, terminal, API, or embedded environments.

5 of 6 free Region-limited

What Counts

  • access on distinct user surfaces
  • native or embedded interface availability
  • API or terminal availability when relevant

What Does Not Count

  • a capability claim with no actual surface support
  • broad branding claims with no surface detail
webmobileterminalAPI