The first column is the project number and the columns after that are the applications that are impacted by each project (quick example made up). My question I want to solve is how many projects are in a application?
The problem is I can't count it by a specific text string because the data is not all consistent. If I put a wild card for a specific string, it will count those apps who may contain that text or letters. There is not a uniqueness in the naming convention. Also, the data set is too large to manually convert every instance of an application to be the exact same. Replace all won't be able to replace all the different instances of how an app is spelled, cap differences, or # of spaces without manually changing the values to be consistent. For example, pink white is the same as Pink White, pink White, Pink white, etc.
Please help or ideas to help minimize manually clean up.