Google DeepMind open-sources LayoutLM-Next for interactive design understanding
AI ยท 6 min read
LayoutLM-Next is an evolution of layout and document understanding models but reoriented to UI semantics: component types, hierarchy, accessibility attributes, and interaction affordances. It produces structured outputs that are easier to map into design tool primitives or automated testing scripts.
DeepMind provided prebuilt adapters for common formats (HTML, Figma, JSON layer maps) and released evaluation datasets focusing on UI comprehension benchmarks. The permissive license aims to accelerate academic and commercial tooling innovations while encouraging dataset citations.
Toolmakers praised the clarity of the structured output format, saying it reduces glue code between recognition and generation stages. DeepMind also announced a community challenge with prize funding to build useful integrations into prototyping flows.