Skip to content
/ PPRL Public

privacy preserving record linkage using different approaches of bloom filter encoding

License

Notifications You must be signed in to change notification settings

caniekwe/PPRL

Repository files navigation

Privacy Preserving Record Linkage (PPRL)

This project demonstrates steps to implement privacy preserving record linkage using different approaches of bloom filter encoding. This project was implemented using the FEBRL synthetic dataset.

Project Features

  1. Data pre-processing (data cleaning, phonetic encoding)
  2. Privacy preservation (field-level & record-level bloom filters, bloom filter hardening techniques)
  3. Blocking and indexing
  4. Comparison (dice coefficient similarity)
  5. Classification (supervised and unsupervised)
  6. Evaluation (blocking:pair completeness, reduction ratio; linkage: accuracy, f1 score, precision, recall)

Requirements

Complied using:

Referenced projects

This project uses a few other projects as below:


License

This project is licensed under the GNU General Public License v3.0

About

privacy preserving record linkage using different approaches of bloom filter encoding

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published