Post-Touch

Expressive gesture in touchless creative interfaces · Design Project 1 · Department of Design, IIT Delhi

Long before language we made meaning with our hands. This project asks what that knowledge looks like when the hand, and not a learned device grammar, becomes the interface.

A hand rendered as five strokes of blue ink
Project
Design Project 1, Department of Design, IIT Delhi
Role
Concept, system design, build, study design
Guides
Prof. Sabyasachi Paldas · Prof. Gourab Kar
Status
Probe built · formal study forthcoming

To create
is human.

Hand stencils on the rock wall of the Cueva de las Manos
Cueva de las Manos, Patagonia. Hands pressed in pigment around nine thousand years ago.

The oldest marks we have are handmade. People pressed pigment around their own hands on a cave wall. The gesture came first and the image followed.

We point before we can speak. We push away what we fear and shape the air to describe what we mean. The hand is, as Kant put it, the visible part of the brain.

A banana fixed to a wall with tape
The near touch of two hands in Michelangelo's Creation of Adam
From the reach of Michelangelo's Adam to a banana taped to a wall, our charged images are about the gesture, not the object.

It began as a tool.
It became an instrument.

Post-Touch started as a proposal for a tool, a fluid medium that would mirror the fluid nature of the hand. A canvas with no mouse and no stylus, where brush size and colour mapped to movement. The refinement turned that tool into a research instrument. The question was no longer whether we could paint without hardware. It was what expressive gesture actually is, and why no one had measured it.

Functional gesture

Studied as a command. Swipe, pinch, point. Success is measured by accuracy, agreement, and error rate. The body is asked to be precise and repeatable.

Expressive gesture

Modulated on purpose. Precision is sometimes abandoned and rhythm carries meaning. No prior study used a generative canvas to capture it, so it stayed invisible. The gap is methodological as much as empirical.

What does expressive gesture look like when the body, and not a learned device grammar, is the interface for creative mark making?

  1. 01

    What kinematic parameters, velocity, depth, and dwell, characterise expressive gesture in mid air drawing?

  2. 02

    Which spatial zones of the interaction volume are preferentially used, and how does this relate to fatigue?

  3. 03

    What spontaneous gestural vocabularies do participants develop without prior instruction?

  4. 04

    How should kinematic data be encoded visually so the resulting mark preserves expressive intent?

  5. 05

    What design guidelines can be derived for inclusive expressive touchless interfaces?

The work rests on four ideas. Phenomenology, that the body is the site where perception and meaning happen and not a tool the mind drives. Kinematics, that speed, depth, and dwell describe how a gesture is made and not only what it points at. Creativity, that expressive work depends on uninterrupted attention an interface can support or break. And ergonomics, that every movement made in the air carries a physical cost the design has to account for.

Reading the hand used to
mean wearing the machine.

Each generation removed a layer of hardware until nothing was left between the hand and the camera.

1990s
A child wearing a Nintendo Power Glove

Interaction meant suiting up. The Power Glove was expensive and tethered, and the machine, not the hand, was the focus.

2010
A Microsoft Kinect sensor bar

The Kinect dropped the glove but still read the room with a special depth sensor.

Today
A hand with a MediaPipe skeleton of tracked joints overlaid

A standard webcam is enough. Machine learning finds the hand in plain RGB video, which is what Post-Touch runs on.

The canvas is the instrument,
not the product.

Post-Touch tracks both hands in real time and turns bare movement into painted marks. It runs at thirty frames per second, logging position, velocity, and depth for each hand. The painting is not the goal. It is the trace the research reads.

  1. 01

    Camera

    Laptop webcam, 30 fps, 1280×720

  2. 02

    Landmarks

    MediaPipe Hands, 21 points per hand

  3. 03

    Kinematics

    Per frame speed, depth, dwell

  4. 04

    Gesture

    Open, pinch, fist, point

  5. 05

    Brush

    Width, alpha, pigment, then canvas

A vocabulary the
hand already knows.

Nothing here has to be learned. The system listens for gestures already in the body, so a first time participant can begin without instruction. This is the principle of primal mimesis.

A hand with the index finger extended

Draw

The index fingertip is tracked in real time and its position drives the paint mark.

A pinching hand, thumb and index together

Pinch

Sliding a pinched hand left and right maps to the colour wheel, shifting the active hue.

A closed fist

Fist

Both hands closing at once changes the brush, the left held still while the right advances through the set.

An open palm

Open palm

The palm becomes an eraser. Any stroke within its radius is lifted from the canvas.

Concentric rings marking the point of contact between finger and thumb

Closing the
proprioceptive gap

Mid air has no surface to rest against, so the hand cannot feel where a mark will land. A ring anchored between the index and the thumb projects the exact point of contact and scales with the pinch aperture.

It is a small piece of pseudo haptics, a stand in for the resistance that is missing, and it resolved the disorientation almost immediately.

Eight brushes are organised into three families, with a ninth neutral line kept as a control. Each answers the same movement differently, mapping depth to weight, velocity to energy, and dwell to bloom. The mapping is where kinematics becomes image.

One gesture, many responses.

BrushFamilyDepthVelocityDwell
GlowMarkStroke weightBlur radiusCore opacity
BlossomMarkBranching countPetal scaleBloom radius
RibbonMarkNib widthNib angleInk density
NeonMarkGlow spreadStrobe frequencyCore thickness
ScatterWetParticle sizeSpread radiusParticle count
WatercolorWetWash opacityEdge grainingBloom delay
ElectricAtmosphericBranch depthArc frequencyResidual glow
ConstellationAtmosphericNode densityEdge lengthStar magnitude

What the instrument records

Every frame is written to a session log as position, velocity, depth, gesture, and active brush. A thirty minute session becomes a structured record that can be replayed, measured, and compared across participants.

Dominant
Non-dominant
0102030 min

Tracked time · Dominant 72% · Non-dominant 15% · Both hands 10%

Across early sessions the dominant hand carried most of the drawing while the non-dominant hand steadied and framed. Velocity and dwell separated the brushes cleanly, and most activity stayed within a comfortable central zone, with reach to the edges falling off as fatigue set in.

Claude as a witness

When a session ends, the finished canvas and a compact kinematic summary are sent to a vision language model. It returns a short metaphorical title and a two sentence reading grounded in the body and its movement. The human gesture makes the work and the machine reads it afterward.

Expressive gesture is not functional gesture performed artistically. It is a distinct register, marked by continuous modulation, spatial exploration, and rhythmic variation.

Participants consistently moved beyond the functional gesture space. They invented compound gestures the classifier never anticipated, like a slow pinch and draw to blend a wash. Four ideas carried the work. Primal mimesis, the reticle as pseudo haptics, the mapping of kinematics to visual form, and the machine as a respondent rather than a generator.

The system in motion

Live session capture, forthcoming

A recorded walkthrough of a painting session will live here.
The final Post-Touch research poster
Open in a new tab for full resolution.

The synthesis

The final poster gathers the pivot, the system, and the early findings onto one sheet. It is the dark artifact this lighter page unpacks, reproduced as it was presented.

Outputs and what comes next

  1. D1

    Kinematic Heatmap

    Instrument built
  2. D2

    Spatial Lexicon

    Data collection pending
  3. D3

    Kinematic to visual mappings

    Prototype implemented
  4. D4

    Universal reach guidelines

    Analysis pending

The best interface is
the one we were born with.

← Back to Work