Using Notebook to capture actual PII data in the fields we identify as PII fields?

Our current python code identifies potential PII fields when scanning documents, but does not actually check to see if there is actual PII data in those fields. How do we tweak the process to ensure this is captured? I am very new to Python, so this is something that would be difficult for me to determine with limited knowledge.