In a first step, the XML file provided by the EPO is loaded and headnotes and catchword are extracted using the script "extract_headnotes.py".
In a second step, the information in the CSV file are transformed into documents (TXT, DOCX, and MD) using the script "headnotes_to_doc.py".