AI Interfaces Of The Future | Design Review

Exploring cutting-edge AI interfaces that move beyond chat UIs, featuring voice agents, autonomous AI workflows, adaptive interfaces, and AI-generated video production.

36m·Guest Raphael Shad·Host Aaron·

Ddeepu.kalidindi 🤖AI & Technology ⚡Productivity 🎨Creativity· added 4 days ago

Verbs Over Nouns

1 / 17

The shift from static UI elements (nouns) to dynamic workflows (verbs) as the foundation of AI interfaces, requiring new design paradigms to visualize and control autonomous actions.

#3d-ai13

Software today is mostly nouns; AI requires verbs for workflows and actions.

Static interfaces use nouns like text forms, buttons
AI introduces verbs like auto-complete, auto-suggest, gather info
Current tools lack ways to visually represent verbs on screen

Quote

00:00:50

“software of today or kind of like up until this point was mostly kind of like just clear things you can point out on the screen um that are you know kind of nouns like text forms drop downs buttons Etc and with AI what really changes is I think so much of the design of what AI does is kind of more verbs um it's more the workflows Auto Complete Auto suggest um go out and gather some information for me Etc and we don't really have the tooling yet to kind of draw verbs on the screen” — Rafael Shad

Quote

00:35:20

“these are all verbs we're creating videos we have agents going out executing tasks and so much of it is how do you keep the user in the loop and in control while AI does its magic” — Host

AI interfaces are at a touch-device 2010 moment, requiring full reimagining of software components.

Similar to how touch devices forced UI redesigns
Every component is being rethought for AI-native interactions
Current static interfaces are being replaced by dynamic, action-oriented designs

Quote

00:35:46

“it almost feels like back in like 2010 or so when touch um devices really kind of came on the market and everything had to reinvented kind of Touch first and we're at one of those moments again where like all of software all the components that we kind of took for granted um they are really being reimagined and reshaped by the builders and startups and designers out there right now” — Host

Quote

00:34:48

“when we start first kind of started to get this llm technology everything was sort of like a chat box and people just kind of like prompting it and now within just a few short like like a few short months or or one two years we see this explosion of AI interface and AI components that really kind of are built AI natively um totally different modalities how to interact with this new teolog with the llms um and really just endless opportunity for uh iteration and uh building a new world of software” — Rafael Shad

Current tools lack visualization for AI workflows as verbs.

No existing design patterns for drawing verbs on screen
Traditional UI elements (buttons, forms) are nouns, not actions
Requires new metaphors for autonomous agent workflows

Quote

00:01:17

“we don't really have the tooling yet to kind of draw verbs on the screen and so that's what's really fascinating how you know this software is now emerging in this new AO World” — Rafael Shad

Quote

00:00:38

“from a high level what are the differences between kind of the say static web-based uh 2D interfaces that we're used to today with where things are going in the future” — Host

Opening question about interface evolution

From Nouns to Verbs: The Shift in AI Interface Design

2 / 17

Traditional interfaces focused on static elements (nouns) like buttons and forms, while AI interfaces emphasize actions (verbs) like auto-complete and autonomous workflows. This shift requires new design paradigms to visualize dynamic processes.

#ai-alignment7 #design-paradigms2

AI interfaces focus on workflows and actions rather than static UI elements

Traditional software UI consists of static elements like buttons, forms, and dropdowns (nouns)
AI interfaces emphasize dynamic processes like auto-complete, auto-suggest, and information gathering (verbs)
Current design tooling isn't optimized for visualizing these dynamic workflows
The challenge is representing actions visually in interfaces where the AI handles complex processes

Quote

00:00:50

Quote

00:01:17

“we don't really have the tooling yet to kind of draw verbs on the screen and so that's what's really fascinating how you know this software is now emerging in this new AO World” — Raphael Shad

Latency becomes a critical UI element in conversational interfaces

In voice interfaces, response latency directly impacts perceived naturalness
Sub-200ms responses feel human-like while longer delays reveal the robotic nature
Some interfaces now expose latency metrics to help developers understand thresholds
Visual feedback during voice interactions helps maintain user confidence

Quote

00:03:17

“the latency is the interface in some ways and that how fast it responds to you the longer it takes the less it feels like a natural conversation and the more it feels like you're talking to a robot” — Aaron

Quote

00:03:07

“they always rendered um kind of like a little label that shows you instantly for each each answer the milliseconds of the delay um really kind of building you an intuition you know how many milliseconds feels natural ver it kind of feels like oh I'm talking to a robot” — Raphael Shad

Nouns vs Verbs in AI Interfaces

3 / 17

Traditional software interfaces are built around static 'nouns' like buttons and forms, but AI introduces dynamic 'verbs' that execute workflows autonomously. This shift requires entirely new design paradigms to visualize and control AI-driven actions, marking a fundamental reset for software interfaces similar to the touch revolution of 2010.

#ai-alignment7 #user-experience4 #design-paradigms2

AI interfaces shift from static 'nouns' to dynamic 'verbs'

Traditional software interfaces are built around static elements like text fields, buttons, and dropdowns—nouns that users interact with. AI introduces workflows where software autonomously performs actions like gathering information, auto-completing tasks, or executing processes. This shift requires new design paradigms to visualize and manage these verb-based actions, as current UI tools aren't built to represent dynamic workflows. The core challenge is translating abstract AI behaviors into tangible user controls.

Quote

00:00:50

Quote

00:35:20

“these are all verbs we're creating videos we have agents going out executing tasks and so much of it is how do you keep the user in the loop and in control while AI does its magic and we've seen some pretty amazing interfaces to get that level of control and and make sure it's doing the right thing that leads to incredible output that would have taken days years it almost feels it almost feels like back in like 2010 or so when touch um devices really kind of came on the market and everything had to reinvented kind of Touch first and we're at one of those moments again where like all of software all the components that we kind of took for granted um they are really being reimagined and reshaped by the builders and startups and designers out there right now future is going to be incredible” — Host

Current UI tools lack verbs, requiring new design paradigms

Existing design tools are optimized for static elements, but AI-driven workflows require dynamic, context-aware interactions that don't fit traditional UI patterns. Designers must invent new ways to represent processes like 'go gather information' or 'auto-complete this task', which aren't just clickable buttons but ongoing actions. This is a fundamental shift requiring rethinking how users interact with software beyond static screens.

Quote

00:01:17

“we don't really have the tooling yet to kind of draw verbs on the screen and so that's what's really fascinating how you know this software is now emerging in this new AO World” — Rafael

Quote

00:35:20

“it almost feels it almost feels like back in like 2010 or so when touch um devices really kind of came on the market and everything had to reinvented kind of Touch first and we're at one of those moments again where like all of software all the components that we kind of took for granted um they are really being reimagined and reshaped by the builders and startups and designers out there right now” — Host

AI interfaces require reimagining software components from scratch

Just as touch interfaces in 2010 forced a complete redesign of software (e.g., no more right-click menus), AI is now forcing a similar reset. Components like buttons, forms, and navigation menus are being replaced by dynamic, context-aware interactions that adapt to user needs. This isn't incremental improvement but a fundamental rethinking of how users interact with software across all domains.

Quote

00:35:46

“back in like 2010 or so when touch um devices really kind of came on the market and everything had to reinvented kind of Touch first and we're at one of those moments again where like all of software all the components that we kind of took for granted um they are really being reimagined and reshaped by the builders and startups and designers” — Host

Quote

00:34:55

“within just a few short like like a few short months or or one two years we see this explosion of AI interface and AI components that really kind of are built AI natively um totally different modalities how to interact with this new teolog with the llms” — Rafael

Voice Interface Nuances

4 / 17

Effective voice interfaces require attention to latency, multimodal feedback, and interruption handling to maintain natural conversation flow and user trust.

#ai-adoption21 #adaptive-ui5

Latency is the interface — longer response times break the illusion of a human conversation.

Delays make interactions feel robotic
Real-time responsiveness is critical for natural feel
Developers should expose latency metrics for debugging

Quote

00:06:05

“latency is is an issue huh that's what kind of like breaks the illusion of this being a real person yeah” — Host

Quote

00:03:14

Multimodal cues are essential for voice interfaces to indicate active listening.

Visual feedback for microphone status (e.g., recording indicator)
No visual cues during voice input/output leads to confusion
Screen-based feedback complements audio for clarity

Quote

00:02:09

Quote

00:02:33

“important I guess to kind of pair multimodal cues um so not just rely on voice um in these type of scenarios where you do have a screen uh on the phone that would be a different scenario” — Rafael Shad

Latency metrics in dev mode build intuition for developers.

Showing milliseconds of delay helps developers understand performance
Metrics provide transparency into system responsiveness
Dev mode features aid in debugging and optimization

Quote

00:03:01

Quote

00:03:14

Voice Interface Latency & Multimodal Feedback

5 / 17

Latency and visual feedback are critical for voice interfaces to feel natural. Delays break immersion, while multimodal cues (like visual indicators) ensure users understand system state. Effective interruption handling and immediate feedback are essential for human-like interactions.

#latency8 #multimodal-design2

Latency is the core of voice interface quality

In voice interactions, response time directly affects perceived naturalness. Delays as short as hundreds of milliseconds make the system feel robotic, while near-instant responses (under 200ms) create the illusion of human conversation. This latency is not just a technical metric but a critical design element that shapes user trust and engagement—longer delays break immersion and force users to question whether the system is working.

Quote

00:03:14

“latency is the interface in some ways and that how fast it responds to you the longer it takes the less it feels like a natural conversation and the more it feels like you're talking to a robot” — Rafael

Quote

00:06:05

“latency is is an issue huh that's what kind of like breaks the illusion of this being a real person” — Host

Multimodal feedback is essential for voice interactions

Voice interfaces must provide visual cues alongside audio to confirm input/output states. Without visual indicators (e.g., microphone active, processing status), users can't tell if the system is listening or responding, leading to confusion. This is especially critical in screen-based environments where users expect visual feedback for all actions, unlike phone-only voice interactions where audio alone suffices.

Quote

00:02:11

“when I was speaking um it wasn't there was no visual feedback um uh making it clear that my voice is actually recognized by the microphone um and then similarly when the uh voice was answering um there was no sort of like visual indication um that that's what's happening so for example if our laptop was a mute uh we were not sure whether demo is broken or what's going on so important I guess to kind of pair multimodal cues um so not just rely on voice um in these type of scenarios where you do have a screen uh on the phone that would be a different scenario” — Host

Quote

00:02:38

Interrupt handling requires real-time processing

Current voice agents often fail to handle interruptions gracefully, continuing to speak even when the user tries to cut in. This disrupts natural conversation flow and highlights the need for systems that can pause, reprocess inputs, and dynamically adjust responses. Effective interruption handling is a key differentiator between robotic and human-like voice interfaces.

Quote

00:03:54

“it didn't pause uh when you were interrupting and then two um it entirely missed um your your question when uh when it actually got done with with its own sort of agenda” — Host

Quote

00:04:00

“it entirely missed um your your question when uh when it actually got done with with its own sort of agenda yeah” — Rafael

Voice Interfaces: The New Frontier

6 / 17

Voice AI interfaces are achieving human-like interaction quality, enabling natural conversations with software. However, challenges remain around latency, interruption handling, and multimodal feedback.

#ai-adoption21 #latency8 #adaptive-ui5

Voice interfaces require multimodal feedback to maintain user confidence

Pure voice interfaces without visual feedback create uncertainty
Users can't tell if the system is listening or responding without visual cues
Combining voice with visual indicators creates more robust interactions
The modality should match the device context (phone vs screen)

Quote

00:02:14

Quote

00:02:36

Natural conversation requires handling interruptions gracefully

Human conversations involve frequent interruptions and overlaps
Current voice AI struggles with mid-speech interruptions
Systems either ignore interruptions or lose context
Future interfaces need to manage conversational flow more dynamically

Quote

00:02:46

“when you're talking to a human um the latency is really important and also interruptions and it felt pretty fast and pretty natural when we were conversing I wonder what would happen if we tried to interrupt it would it be able to handle it” — Aaron

Quote

00:03:54

“two things happened one um it didn't pause uh when you were interrupting and then two um it entirely missed um your your question when uh when it actually got done with with its own sort of agenda” — Raphael Shad

Visual Workflow Modeling

7 / 17

Canvases and flowcharts provide intuitive ways to design, monitor, and control complex AI agent workflows through visual representation of branching logic and multi-dimensional processes.

#ai-adoption21 #workflow-design3

Canvases are a new document type ideal for modeling AI agent workflows.

Visual pan/zoom interfaces allow complex process mapping
Color-coded blocks distinguish input, actions, outputs
Enables non-linear, multi-dimensional process design

Quote

00:09:27

“Canabis has really emerged as a really interesting kind of almost new document type um that seems to lend itself pretty well to not just kind of for design tools or or kind of brainstorming tools but lends itself really well for these sort of modeling these kind of like AI processes yeah” — Host

Quote

00:09:46

“it's great because it gives us the user a visual overflow of exactly what steps the agent is going to take and we can control what it should do at each of these steps” — Rafael Shad

Branching logic is the key power of AI workflow modeling.

Linear flows are insufficient for complex agent decisions
Multi-dimensional branching handles real-world unpredictability
Visual tools must support non-linear, conditional paths

Quote

00:11:38

“the power is in sort of like the multi-dimensionality in the branching um and so for as a starter template to kind of like explain the power of this tool to mod these processes I think one that is multi-dimensional would really showcase the power of this” — Rafael Shad

Quote

00:13:03

“it's always historically been static and it seems like what's new is actually making it interactive” — Host

Flowcharts resurface in AI era as interactive tools.

Legacy flowchart techniques from chip design are being reused
Modern interactivity adds real-time control and feedback
Combines historical paradigms with new AI capabilities

Quote

00:12:40

“it's interesting to kind of like see this Paradigm kind of getting resurfaced in the AI era so you know we didn't inent invent this today but we're building on a lot of Legacy um and on the Giant on the shoulder of giants here yeah and it's always historically been static and it seems like what's new is actually making it interactive” — Rafael Shad

Quote

00:12:43

“flowcharts Etc probably like chip designers like 50 years ago they're like oh yeah we used to you know kind of model our things like that and so it's interesting to kind of like see this Paradigm kind of getting resurfaced in the AI era” — Host

Adaptive Contextual UIs

8 / 17

Interfaces that dynamically adjust based on content context reduce cognitive load by showing only relevant controls. Consistent keyboard shortcuts maintain usability despite changing UI elements, but clear focus states prevent unintended actions when typing.

#adaptive-ui5 #cost-efficiency4 #context-aware3

UIs that adapt to content context reduce cognitive load

Traditional interfaces show all possible options regardless of context, overwhelming users. Adaptive UIs dynamically surface only relevant actions based on current content—like email-specific response buttons or document-specific formatting tools. This reduces clutter and streamlines workflows, but requires precise context understanding to avoid unpredictability.

Quote

00:25:57

“Microsoft Word right where like the thing that everybody is so familiar with is a billion buttons on the top row because they're never sure which one you might need because they don't know the context of how you're editing and with AI now we don't need to show all the buttons we can just show you the buttons that are relevant” — Host

Quote

00:25:27

“the interface then dynamically changes which typically isn't you know static software typically wasn't the case and so here it's kind of like the input is the actual content and then the output of the AI llm is then the UI to interact back with that content” — Rafael

Keyboard shortcuts maintain consistency in adaptive UIs

Even as UI elements change based on context, consistent keyboard shortcuts (e.g., pressing 'Y' to confirm) allow users to interact without relearning new controls. This preserves muscle memory while enabling dynamic behavior—critical for high-efficiency workflows like email processing where speed matters.

Quote

00:29:32

“the buttons and the responses are technically changing for every single email but the the keys that you're pressing do not and so you can kind of keep your hand right there and and know what to expect each time” — Host

Quote

00:29:13

“being able to access all these adaptive kind of like uh uh options by just keyboard shortcut with a single letter um is uh is is really on point” — Rafael

Input focus ambiguity causes unintended actions

Adaptive UIs risk accidental actions when keyboard input is ambiguous—e.g., pressing 'Y' to type a letter in a text field versus confirming a button. Clear visual indicators of focus state are essential to prevent unintended commands, especially in high-speed workflows where users expect immediate feedback.

Quote

00:29:56

“what if I think that my cursor is focused inserting text and I want to kind of reply yes then basically my first y keystroke like submits a button right and so there's always this challenge of really being very clear when an input element is focused and you're typing versus now typing on the keyboard will just do stuff in your UI” — Host

Quote

00:30:06

“really being very clear when an input element is focused and you're typing versus now typing on the keyboard will just do stuff in your UI” — Rafael

Visualizing AI Workflows

9 / 17

As AI agents perform complex, autonomous tasks, new interface paradigms like canvas-based flowcharts emerge to help users understand and control these processes.

#ai-adoption21 #workflow-design3

Canvas interfaces are ideal for modeling complex AI decision trees

Traditional linear documentation can't capture branching AI workflows
Canvas interfaces allow spatial organization of multi-step processes
Color coding helps distinguish different types of actions/nodes
The paradigm resembles chip design flowcharts from decades past

Quote

00:09:27

Quote

00:11:26

“the canvas and modeling these kind of like AI ancient decision trees gets really really powerful when it isn't something you could just kind of like linearly write in a document like a recipe first do this then do this then do this but really the power is in sort of like the multi-dimensionality in the branching” — Raphael Shad

Zoom levels should adapt to show relevant workflow detail

At high zoom levels, detailed text becomes unreadable
Interfaces should collapse nodes to colored blocks when zoomed out
Different zoom levels should show different fidelity of information
This maintains overview while preserving ability to drill into details

Quote

00:11:07

“because it is uh canis um kind of having different Zoom levels showing different Fidelity so right now we're so zoomed out I can't read any of the small text why not just kind of hide it and make the note almost collapse it into just you know in this case a brown Block in this case a Yellow Block to kind of give different Zoom levels different fidelities” — Raphael Shad

Data Extraction & Trust

10 / 17

Structured data outputs from AI agents require clear source attribution and transparency to build user trust, especially when handling sensitive or critical information.

#3d-ai13 #confidence4

Every spreadsheet cell can have its own AI agent for data extraction.

AI processes data at cell-level granularity
Eliminates need for predefined columns; dynamic column creation
Enables on-demand data gathering from multiple sources

Quote

00:15:30

“it's like a spreadsheet on steroids” — Host

Quote

00:15:28

“it's almost like every cell of the spreadsheet gets its own AI agent to get the data that we want which is pretty incredible” — Rafael Shad

Source attribution is critical for validating AI-generated data.

Inline references allow users to verify information
Reduces hallucination risks by showing provenance
Builds trust through transparency in data sourcing

Quote

00:17:08

“by having a source closely attached that you can just you know Click on each of these right here you can see immediately where the sources came from it helps us to be able to validate and Trust the data that the AI agent is bringing back” — Host

Quote

00:18:23

“it's also interesting you know you mentioned before about how um you know we always had flowcharts and these are like modern flowcharts with the canvases and it's interesting too that you know a lot of the citing sources in the footnotes is not a new thing that's been around since the beginning of books but now it's actually being used in a new way to actually validate and verify information in real time that an agent brings back which is really cool” — Rafael Shad

Dynamic column creation enables human-guided data extraction.

Users define new data points on the fly
Agents fetch and populate columns automatically
Eliminates rigid pre-defined schemas

Quote

00:15:01

“we can do is add columns and have sort of the agent go out again not on a sort of like static uh you know set of columns that were predefined but our columns like things we want to know kind of putting the human back into the loop” — Host

Quote

00:15:30

“it's like a spreadsheet on steroids” — Host

Visual Workflow Modeling for AI Agents

11 / 17

Canvas-based interfaces enable complex AI agent decision trees through spatial layouts, zoom levels, and color coding. Legacy flowchart paradigms are resurfacing in AI for dynamic, interactive workflows that replace static diagrams with executable processes.

#ai-adoption21 #workflow-design3 #multimodal-design2

Canvases enable complex agent decision trees

Traditional linear workflows fail to represent the branching, multi-dimensional logic of AI agents. Visual canvases allow designers to model complex decision paths, conditional branches, and parallel tasks in a spatial layout. This makes it easier to understand, debug, and iterate on agent behavior—especially for non-technical users who need to see the entire process flow at once.

Quote