FLAIDec 28, 2020

FOREST: An Interactive Multi-tree Synthesizer for Regular Expressions

arXiv:2012.14235v113 citations
AI Analysis

This work addresses the challenge of writing regular expressions for form validators, making it easier for users to create robust digital forms.

This paper introduces FOREST, a regular expression synthesizer for digital form validations that generates regular expressions and conditions over capturing groups. FOREST successfully returns the desired regular expression in 72% of real-world instances, outperforming the state-of-the-art REGEL.

Form validators based on regular expressions are often used on digital forms to prevent users from inserting data in the wrong format. However, writing these validators can pose a challenge to some users. We present FOREST, a regular expression synthesizer for digital form validations. FOREST produces a regular expression that matches the desired pattern for the input values and a set of conditions over capturing groups that ensure the validity of integer values in the input. Our synthesis procedure is based on enumerative search and uses a Satisfiability Modulo Theories (SMT) solver to explore and prune the search space. We propose a novel representation for regular expressions synthesis, multi-tree, which induces patterns in the examples and uses them to split the problem through a divide-and-conquer approach. We also present a new SMT encoding to synthesize capture conditions for a given regular expression. To increase confidence in the synthesized regular expression, we implement user interaction based on distinguishing inputs. We evaluated FOREST on real-world form-validation instances using regular expressions. Experimental results show that FOREST successfully returns the desired regular expression in 72% of the instances and outperforms REGEL, a state-of-the-art regular expression synthesizer.

Foundations

The foundational work for this paper's niche, ranked by how specifically the neighbourhood builds on it — not by global fame.

Your Notes