Input Simulation
xa11y can drive a target application in two distinct ways:
- Accessibility actions —
locator.press(),locator.type_text(),locator.toggle(), etc. These call the platform accessibility API directly. They work even when the target window is not focused, are deterministic, and are the preferred way to drive a UI. - Input simulation —
InputSimgenerates OS-level pointer and keyboard events at the system event layer (CGEvent on macOS, SendInput on Windows, XTest on X11).
Prefer accessibility actions wherever possible. Reach for input simulation only for gestures that have no accessibility equivalent: drag-and-drop, scroll wheels, global shortcuts, or tests that must exercise the event loop itself.
The two mechanisms are not bridged. A failed accessibility action never falls back to input simulation, and input simulation never inspects the accessibility tree for you. If you want to click an element with synthesised input, you compute its bounds via the a11y API and hand a point or Element to InputSim.
When to use which
Section titled “When to use which”| Scenario | Mechanism |
|---|---|
| Press a button, toggle a checkbox, expand a menu | Accessibility action (locator.press(), locator.toggle(), locator.expand()) |
| Type text into a focused field with IME support | Accessibility action (locator.type_text()) |
| Drag-and-drop between two elements | Input simulation (mouse.drag) |
| Scroll a list with the wheel | Input simulation (mouse.scroll) |
| Global shortcut without a focused element | Input simulation (keyboard.chord) |
| Click at a fixed pixel coordinate | Input simulation (mouse.click((x, y))) |
Permissions
Section titled “Permissions”| Platform | Requirement |
|---|---|
| macOS | Accessibility + Input Monitoring for the terminal/IDE running your code |
| Windows | None for normal user sessions |
| Linux (X11) | XTest extension enabled on the X server. Zero setup. |
| Linux (Wayland / headless) | Membership in the input group (sudo usermod -aG input $USER then re-login). xa11y opens /dev/uinput and registers a virtual evdev keyboard+pointer — same mechanism used by xdotool --using-uinput, ydotool, wtype, Steam Input, and Wine. The privilege grants global input read/write access — comparable to macOS Input Monitoring. |
On macOS, input simulation usually also requires the target window to be foregrounded. Activate the window explicitly before synthesising input.
xa11y picks the Linux backend at runtime: if DISPLAY is set we drive XTest; otherwise we fall through to uinput. There is no compile-time feature flag — both backends are always built into xa11y-linux. The uinput path is compositor-agnostic, so it works on every Wayland desktop (GNOME, KDE Plasma, sway, Hyprland, Cosmic, weston) and also on headless Linux servers. Errors from a missing input group surface as PermissionDenied with an actionable message; a missing uinput kernel module surfaces as Unsupported.
Basic usage
Section titled “Basic usage”use xa11y::*;
fn main() -> Result<()> { let sim = input_sim()?;
// Click at a screen coordinate sim.mouse().click(Point::new(400, 300))?;
// Click at the centre of an element's current bounds let app = App::by_name("Finder", std::time::Duration::from_secs(5))?; let button = app.locator("button[name='New Folder']").element()?; sim.mouse().click(&button)?;
// Type text into whatever has keyboard focus sim.keyboard().type_text("Hello, world!")?;
// Keyboard shortcut — Cmd/Ctrl+S sim.keyboard().chord(Key::Char('s'), &[Key::Meta])?;
Ok(())}import xa11y
sim = xa11y.input_sim()
# Click at a screen coordinatesim.click((400, 300))
# Click at the centre of an element's current boundsapp = xa11y.App.by_name("Finder")button = app.locator("button[name='New Folder']").element()sim.click(button)
# Type text into whatever has keyboard focussim.type_text("Hello, world!")
# Keyboard shortcut — Cmd/Ctrl+Ssim.chord("s", ["Meta"])import { inputSim, App } from '@crowecawcaw/xa11y';
const sim = inputSim();
// Click at a screen coordinateawait sim.click([400, 300]);
// Click at the centre of an element's current boundsconst app = await App.byName('Finder');const button = await app.locator("button[name='New Folder']").element();await sim.click(button);
// Type text into whatever has keyboard focusawait sim.typeText('Hello, world!');
// Keyboard shortcut — Cmd/Ctrl+Sawait sim.chord('s', ['Meta']);Targets
Section titled “Targets”Every pointer method accepts either a screen-space point or an Element:
- Point — absolute screen coordinates in the platform’s native coordinate space (points on macOS, physical pixels on Windows and Linux). Origin is the top-left of the primary display; negative values are valid on multi-monitor setups.
- Element — uses the centre of the element’s
boundsat the time of the call.InputSimdoes not re-read the accessibility tree for you — re-fetch via a locator first if the UI may have moved.
In Rust, additional anchors are available via ClickOptions.anchor (Anchor::TopLeft, Anchor::Offset { dx, dy }, etc.).
let m = sim.mouse();
m.click((100, 100))?; // left clickm.double_click(&element)?; // double clickm.right_click((100, 100))?; // right clickm.move_to((200, 200))?; // move without clickingm.drag((50, 50), (200, 200))?; // left-drag with 150ms durationm.scroll((400, 300), ScrollDelta::vertical(-3))?; // 3 ticks down
// Shift-click with explicit optionsm.click_with( ClickTarget::from(&element), ClickOptions { held: vec![Key::Shift], anchor: Anchor::TopLeft, ..Default::default() },)?;
// Slow drag for apps that need more framesm.drag_with( (50, 50), (200, 200), DragOptions { duration: Duration::from_millis(500), ..Default::default() },)?;sim.click((100, 100)) # left clicksim.double_click(element) # double clicksim.right_click((100, 100)) # right clicksim.move_to((200, 200)) # move without clickingsim.drag((50, 50), (200, 200)) # left-dragsim.scroll((400, 300), dy=-3) # 3 ticks downsim.scroll((400, 300), dx=2, dy=0) # 2 ticks rightawait sim.click([100, 100]); // left clickawait sim.doubleClick(element); // double clickawait sim.rightClick([100, 100]); // right clickawait sim.moveTo([200, 200]); // move without clickingawait sim.drag([50, 50], [200, 200]); // left-dragawait sim.scroll([400, 300], 0, -3); // dx=0, dy=-3 (3 ticks down)Scroll: positive dx scrolls right, positive dy scrolls content down (moves the viewport up), in platform “ticks” (typically one notch of a physical scroll wheel).
Keyboard
Section titled “Keyboard”Modifier keys (Shift, Ctrl, Alt, Meta) are ordinary keys — pass them alongside character keys. Meta is the platform’s “command” modifier: Cmd on macOS, Win on Windows, Super on Linux.
let k = sim.keyboard();
// Tap a single keyk.press(Key::Enter)?;k.press(Key::Char('a'))?;
// Shortcuts — modifiers are just keys held during the tapk.chord(Key::Char('a'), &[Key::Meta])?; // Cmd/Ctrl+Ak.chord(Key::Char('z'), &[Key::Meta, Key::Shift])?; // Cmd/Ctrl+Shift+Z
// Uppercase letters — hold Shift explicitlyk.chord(Key::Char('a'), &[Key::Shift])?; // 'A'
// Hold and release manuallyk.down(Key::Shift)?;k.press(Key::Char('a'))?;k.press(Key::Char('b'))?;k.up(Key::Shift)?;
// Type literal text (handles case and IME for you)k.type_text("Hello, world!")?;Key::Char rejects ASCII uppercase letters at the API boundary to prevent the common bug where chord(Key::Char('K'), &[Key::Meta]) is read as “Cmd+K” but would mean “Cmd+Shift+K” under auto-shift semantics. For arbitrary text, call type_text.
# Tap a single keysim.press("Enter")sim.press("a")
# Shortcuts — modifiers are just keys listed in `held`sim.chord("a", ["Meta"]) # Cmd/Ctrl+Asim.chord("z", ["Meta", "Shift"]) # Cmd/Ctrl+Shift+Z
# Type literal text (handles case and IME for you)sim.type_text("Hello, world!")Key names: single printable characters are literal ("a", "7", ";"); named keys use their Pascal name ("Enter", "Escape", "ArrowUp", "F5"); modifiers are "Shift", "Ctrl", "Alt", "Meta". Aliases: "Control", "Option", "Cmd"/"Command", "Esc", "Up"/"Down"/"Left"/"Right".
Uppercase letters are rejected by press/chord — pass "a" with "Shift" held, or use type_text for arbitrary text.
// Tap a single keyawait sim.press('Enter');await sim.press('a');
// Shortcuts — modifiers are just keys listed in `held`await sim.chord('a', ['Meta']); // Cmd/Ctrl+Aawait sim.chord('z', ['Meta', 'Shift']); // Cmd/Ctrl+Shift+Z
// Type literal text (handles case and IME for you)await sim.typeText('Hello, world!');Key names match the Python binding: literal characters, Pascal-cased named keys ('Enter', 'ArrowUp', 'F5'), modifiers 'Shift' / 'Ctrl' / 'Alt' / 'Meta'.
Example: drag-and-drop
Section titled “Example: drag-and-drop”Drag-and-drop has no accessibility equivalent on most platforms. Resolve both endpoints via the a11y tree first, then synthesise the drag.
let app = App::by_name("Finder", std::time::Duration::from_secs(5))?;let source = app.locator("list_item[name='notes.txt']").element()?;let target = app.locator("list_item[name='Archive']").element()?;
let sim = input_sim()?;sim.mouse().drag(&source, &target)?;app = xa11y.App.by_name("Finder")source = app.locator("list_item[name='notes.txt']").element()target = app.locator("list_item[name='Archive']").element()
sim = xa11y.input_sim()sim.drag(source, target)const app = await App.byName('Finder');const source = await app.locator("list_item[name='notes.txt']").element();const target = await app.locator("list_item[name='Archive']").element();
const sim = inputSim();await sim.drag(source, target);Error handling
Section titled “Error handling”| Error | When |
|---|---|
PermissionDenied | OS denied the synthesis permission (macOS Input Monitoring, etc.) |
Unsupported | Platform has no backend for the operation (e.g. Linux session with neither DISPLAY nor WAYLAND_DISPLAY set) |
NoElementBounds | Target Element has no bounds — re-fetch it or anchor on a point |
InvalidActionData | Uppercase Char key, malformed key name, or bad target tuple |
Platform | Raw OS failure (FFI return code etc.) |