Structured data classification


Data classification is the process of classifying data as a whole (e.g. database schema) or its parts (e.g. column name, column values) into categories. It can also be evaluated for its identifiability, sensitivity and/or confidentiality. In this work, our focus lies in and around structured (and semi structured) data.

Our goal is to identify, classify and understand the data residing in structured repositories such relational databases (tables) Object storage (sets of related semi structured files) and single semi structured files (e.g. patient release form in xml).

