digiaonline / common-sanitization-stages
A collection of pipeline stages that are commonly needed when sanitizing data
Installs: 35 319
Dependents: 0
Suggesters: 0
Security: 0
Stars: 0
Watchers: 6
Forks: 0
Open Issues: 0
Requires
- php: >=7.1
- league/pipeline: ^0.3.0 | ^1.0
Requires (Dev)
- ezyang/htmlpurifier: ^4.10
- phpstan/phpstan: ^0.9.2
- phpunit/phpunit: ^7.2
Suggests
- ezyang/htmlpurifier: Required to use the HTML purification stage
README
A collection of pipeline stages that are commonly needed when sanitizing data
Requirements
- PHP >= 7.1
Installation
composer require digiaonline/common-sanitization-stages
Usage
Let's say you're importing some legacy data into a new system. The original data is user generated content, so it cannot be trusted. Additionally, it contains some simple HTML that you want to strip. On top of all this, the legacy data must be possible to store in a CSV file, so you need to encode it somehow so the CSV delimiter is guaranteed not to occur in the text value.
To accomplish this, just combine the stages you need into a pipeline, then run the pipeline against your data:
<?php require_once(__DIR__.'/vendor/autoload.php'); $rawInputData = <<<EOT The quick brown fox<br /> jumped over the <i>incredibly lazy dog</i> & it ran away. EOT; $encodedInputData = \base64_encode($rawInputData); /** @var \League\Pipeline\Pipeline $pipeline */ $pipeline = (new \League\Pipeline\Pipeline()) ->pipe(new \Digia\Sanitization\Stages\Base64DecodeStage()) ->pipe(new \Digia\Sanitization\Stages\HtmlPurifierStage()) ->pipe(new \Digia\Sanitization\Stages\HtmlEntityDecodeStage()) ->pipe(new \Digia\Sanitization\Stages\StripLineFeedsStage(["\n"], true)) ->pipe(new \Digia\Sanitization\Stages\TrimStringStage()); $outputData = $pipeline->process($encodedInputData); var_dump($outputData); // string(70) "The quick brown fox jumped over the incredibly lazy dog & it ran away."
License
MIT