I am using Python and spaCy as my NLP library. I am new to NLP work and I hope for some guidance in order to extract tabular information from a text. My goal is to find what type of expenses are frozen or not. Any guidance would be highly appreciated.
TYPE_OF_EXPENSE FROZEN? NOT_FROZEN?
purchase order frozen null
capital frozen null
consulting frozen null
business meetings frozen null
external hires frozen null
KM&L null not frozen
travel null not frozen
import spacy
nlp = spacy.load('en_core_web_sm')
doc = nlp(u'Non-revenue-generating purchase order expenditures will be frozen. All capital
related expenditures are frozen effectively for Q4. Following spending categories
are frozen: Consulting, (including existing engagements), Business meetings.
Please note that there is a hiring freeze for external hires, subcontractors
and consulting services. KM&L expenditure will
not be frozen. Travel cost will not be on ‘freeze’.)
My ultimate goal is to extract all this table into an excel file. Even if you can advise for few of the categories above I would be deeply grateful. Thank you very much in advance.