Skip to content

Latest commit

 

History

History
30 lines (25 loc) · 1.45 KB

README.md

File metadata and controls

30 lines (25 loc) · 1.45 KB

Privacy Preserving Record Linkage (PPRL)

This project demonstrates steps to implement privacy preserving record linkage using different approaches of bloom filter encoding. This project was implemented using the FEBRL synthetic dataset.

Project Features

  1. Data pre-processing (data cleaning, phonetic encoding)
  2. Privacy preservation (field-level & record-level bloom filters, bloom filter hardening techniques)
  3. Blocking and indexing
  4. Comparison (dice coefficient similarity)
  5. Classification (supervised and unsupervised)
  6. Evaluation (blocking:pair completeness, reduction ratio; linkage: accuracy, f1 score, precision, recall)

Requirements

Complied using:

Referenced projects

This project uses a few other projects as below:


License

This project is licensed under the GNU General Public License v3.0