robinbressan / regex-parser
AST for PCRE Regex
Installs: 12 436
Dependents: 0
Suggesters: 0
Security: 0
Stars: 23
Watchers: 5
Forks: 4
Open Issues: 0
Requires (Dev)
- phpspec/prophecy: ~1.0
- phpunit/phpunit: 3.7.32
This package is not auto-updated.
Last update: 2025-01-18 18:16:10 UTC
README
RegexParser is a parser for PCRE regex. It produces an AST which represents your regex. It can help you generating some inputs which match your regex.
Installation
It is available with Composer:
composer install robinbressan/regex-parser
Usage
To build an AST you need to create a parser:
$parser = \RegexParser\Parser\Parser::create(); $ast = $parser->parse('YOUR_REGEX');
You can now use a formatter to convert the AST to several format (only XML is supported today):
$formatter = new \RegexParser\Parser\Formatter\XMLFormatter(); $xml = $formatter->format($ast); // $xml is now an instance of DOMDocument
If you wish you can display it easily:
$xml->formatOutput = true; echo $xml->saveXML();
Because you can get a DOMDocument, you are able to use XPath engine to query your AST.
Example
The regex ^[_a-z0-9-]+(\.[_a-z0-9-]+)*@[a-z0-9-]+(\.[a-z0-9-]+)*(\.[a-z]{2,4})$
will produce the following AST:
<?xml version="1.0" encoding="utf-8"?> <ast> <begin> <repetition min="1"> <block sub-pattern="false"> <token type="underscore">_</token> <character-class> <token type="char">a</token> <token type="char">z</token> </character-class> <character-class> <token type="integer">0</token> <token type="integer">9</token> </character-class> <token type="minus">-</token> </block> </repetition> </begin> <repetition min="0"> <block sub-pattern="true"> <token type="char">.</token> <repetition min="1"> <block sub-pattern="false"> <token type="underscore">_</token> <character-class> <token type="char">a</token> <token type="char">z</token> </character-class> <character-class> <token type="integer">0</token> <token type="integer">9</token> </character-class> <token type="minus">-</token> </block> </repetition> </block> </repetition> <token type="at">@</token> <repetition min="1"> <block sub-pattern="false"> <character-class> <token type="char">a</token> <token type="char">z</token> </character-class> <character-class> <token type="integer">0</token> <token type="integer">9</token> </character-class> <token type="minus">-</token> </block> </repetition> <repetition min="0"> <block sub-pattern="true"> <token type="char">.</token> <repetition min="1"> <block sub-pattern="false"> <character-class> <token type="char">a</token> <token type="char">z</token> </character-class> <character-class> <token type="integer">0</token> <token type="integer">9</token> </character-class> <token type="minus">-</token> </block> </repetition> </block> </repetition> <end> <block sub-pattern="true"> <token type="char">.</token> <repetition min="2" max="4"> <block sub-pattern="false"> <character-class> <token type="char">a</token> <token type="char">z</token> </character-class> </block> </repetition> </block> </end> </ast>
Generator
You can also create a generator based on your AST which will generate a string that match your regex:
$generator = new \RegexParser\Generator\RandomGenerator($ast); $generator->generate($seed = null);
If you wish you can also create directly the generator :
$generator = new \RegexParser\Generator\RandomGenerator::create('YOUR_REGEX'); $generator->generate($seed = null);
Test
To run the test you must run phpunit
command.
Contributing
All contributions are welcome. If you add a new feature, ensure all tests are green!
License
This application is available under the MIT License.