Automatic sanitization of textual documents

  • Fundació URV
  • From Spain
  • Responsive
  • Innovative Products and Technologies

Summary of the technology

Redaction or sanitization is required to declassify sensitive textual documents or make them available for secondary use. This task, which is complex, time-consuming and prone to errors, is performed manually by one or several human experts. Our technology automatizes the process by automatically detecting terms and term combinations appearing in the documents that may disclose sensitive information. Such terms are then subject to redaction or generalization.

Fundació URV

Details of the Technology Offer

Our solution consists on a semantic privacy model by which the users can intuitivelly define their privacy requirements on the document contents, that is, which topics they consider sensitive.

Then, an automated algorithm analyses the document content in order to detect individual terms or combinations of terms that partially or totally disclose any of the sensitive topics stated by the user. This assessment relies on the information distribution in the Web, which represents the knowledge an attacker may use when attempting to disclose sensitive data in the protected document.

Finally, another automated algorithm redacts (supresses) or generalizes risky terms consistently with the privacy requirements stated by the user.

More technical details are provided in the following papers:
1) https://arxiv.org/abs/1406.4285
2) https://arxiv.org/abs/1701.00436

Intellectual property status

Other forms of protection

Current development status

Experimental technologies

Desired business relationship

Technology development

New technology applications

Adaptation of technology to other markets

Related Keywords

  • Data Protection, Storage Technology, Cryptography, Data Security
  • Information Technology/Informatics
  • Computer related
  • Computer Software Market
  • Applications software
  • semantics
  • sanitization
  • document redaction
  • ontologies
  • privacy

About Fundació URV

The Technology Transfer and Innovation Center (CTTi) meets from the University environment the technological needs and services generated by the productive sectors and administration, through the management of Transfer of Technology and Knowledge, the Intellectual and Intellectual Property management, Technology Watch, Entrepreneurship, and Technology Infrastructures Offer (business incubator).

Fundació URV

Never miss an update from Fundació URV

Create your free account to connect with Fundació URV and thousands of other innovative organizations and professionals worldwide

Fundació URV

Send a request for information
to Fundació URV

About Technology Offers

Technology Offers on Innoget are directly posted
and managed by its members as well as evaluation of requests for information. Innoget is the trusted open innovation and science network aimed at directly connect industry needs with professionals online.

Help

Need help requesting additional information or have questions regarding this Technology Offer?
Contact Innoget support