Here is four lines of a sample text file...
The Star Schema is the simplest style of data mart schema
The star schema consists of one or more fact tables referencing any number of dimension tables
Pay attention to bogus schema
Cheers
A python code should create a sorted list in an alphabetic order as shown below with duplicate words removed and capital words come sorted first.
Final output like this...
[ “Cheers”, “Pay”, “Schema”, “Star”, “The”, “any”, “bogus”,………..]