From: Addressing big data variety using an automated approach for data characterization
Confidential data | Regular expression |
---|---|
| (?:5[1-5] [0-9]{2}|222[1-9]|22[3-9] [0-9]|2[3-6] [0-9]{2}|27[01][0-9]|2720)[0-9]{12} |
| 4[0-9]{12}(?:[0-9]{3})? |
3[47][0-9]{13} | |
| 3(?:0[0-5]|[68][0-9])[0-9]{11} |
Gulf Countries Civil ID | \d{1} (?!00)\d{2} (?!00)\d{2} (?!00)\d{2} (?!0000)\d{4} |
Greek Civil ID | [Α-Ω]{1,2}[0-9]{6} |
International Passport | [A-Z0-9<]{9}[0-9]{1}[A-Z]{3}[0-9]{7}[A-Z]{1}[0-9]{7}[A-Z0-9<]{14}[0-9]{2} |
IBAN | [a-zA-Z]{2}[0-9]{2}[a-zA-Z0-9]{4}[0-9]{7}([a-zA-Z0-9]?){0,16} |
(?:[a-z0-9!#$%&'*+/=?^_`{|}~-]+(?:\.[a-z0-9!#$%&'*+/=?^_`{|}~-]+)*|\"(?:[\×01-\×08\x0b\x0c\x0e-\x1f\×21\×23-\x5b\x5d-\x7f]|\\[\×01-\×09\x0b\x0c\x0e-\x7f])*\")@(?:(?:[a-z0-9](?:[a-z0-9-]*[a-z0-9])?\.)+[a-z0-9](?:[a-z0-9-]*[a-z0-9])?|\[(?:(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?)\.){3}(?:25[0-5]|2[0-4][0-9]|[01]?[0-9][0-9]?|[a-z0-9-]*[a-z0-9]:(?:[\×01-\×08\x0b\x0c\x0e-\x1f\×21-\x5a\×53-\x7f]|\\[\×01-\×09\x0b\x0c\x0e-\x7f])+)\]) | |
MAC Address | ([0-9A-Fa-f]{2}[:-]){5}([0-9A-Fa-f]{2}) |