Entity extraction is a feature that can identify when data contains sensitive information, like credit card numbers and bank routing information.
The following table lists the types of information entities that you can extract from your data source with Data Cube.
Entity Type |
Description |
Sensitivity Level |
Entity Match Pattern |
---|---|---|---|
Social Security Number |
Social security numbers. |
High |
entity_ssn:* |
Credit Card Number |
Credit card numbers. |
High |
entity_ccn:* |
|
Email addresses. |
Low |
entity_email:* |
Phone |
Telephone numbers. |
Low |
entity_phone:* |
IP Address |
IP addresses. |
Low |
entity_ip:* |
Routing Transit Number |
Bank routing numbers. |
Medium |
entity_rtn:* |
Individual Taxpayer ID Number |
Tax identification numbers. |
High |
entity_itin:* |
US Driving License |
United States driving license identity numbers. |
Medium |
entity_usdl:* |
Hostname |
Fully qualified domain names (FQDN). |
Low |
entity_hostname:* |
FinanceTags |
Patterns of words and phrases that are significant to finance and other businesses. To view the complete list of words and phrases that are identified as entities using the FinanceTags entity type, see finance_tags.txt. |
Medium |
entity_finance_tags:* |
FinlandHetu |
Finnish personal identity numbers (HETU). |
Medium |
entity_finland_hetu:* |
FranceINSEE |
French national identity numbers (INSEE). |
Medium |
entity_france_insee:* |
GreeceAFM |
Greek tax identity numbers (AFM). |
Medium |
entity_greece_afm:* |
GreeceTautotita |
Greek national identity numbers (Tautotita). |
Medium |
entity_greece_tautotita:* |
GermanyPK |
German personal identity numbers (Personenanzahl). |
Medium |
entity_germany_pk:* |
GermanySteuerID |
German tax identity numbers (Steuer-ID). |
Medium |
entity_germany_steuer_id:* |
GermanyVSNR |
German insurance identity numbers (Versicherungsnummer). |
Medium |
entity_germany_vsnr_rvnr:* |
UKNINO |
United Kingdom National Insurance numbers. |
Medium |
entity_uk_nino:* |
UKNHS |
United Kingdom National Health Service numbers. |
Medium |
entity_uk_nhs:* |
UKDL |
United Kingdom driving license numbers. |
Medium |
entity_ukdl:* |
For instructions on adding a custom entity type, see Configuring Custom Entities for Entity Extraction.