I want to get all 'xlsx' files that somewhere have 'feedback report' in them. I want to make this filter very strong. So any partial matches like 'feedback_report', 'feedback report', 'Feedback Report' should all return true.
Example file names :
- ZSS Project_JKIAL-SA_FEEDBACK_REPORT_Jan 29th 2015.xlsx
- ZL-SA_feedback report_012844.xlsx
- ASARanem-SA_Feedback Report_012844.xlsx
A futile attempt below.
regex = re.compile(r"[a-zA-Z0-0]*[fF][eE][eE][dD][bB][aA][cC][kK]\s[rR][eE][pP][oO][rR][tT][a-zA-Z0-0]*.xlsx")
.*feedback[\s_]report.*\.xlsxseems to be sufficient with theIGNORECASEoption.