26

I have a directory which contains the following files:

apple1.json.gz
apple2.json.gz
banana1.json.gz
melon1.json.gz
melon2.json.gz

I wish to find all of the apple, banana and melon file types.

From this SO answer I know that I can find by file type by:

import glob, os
os.chdir("/mydir")
for file in glob.glob("*.json.gz"):
    print(file)

However, in my case I can't match by file name or file type. Rather it is a partial file name match (all apple's and so on)

In this SO question, this solution was proposed:

[in] for file in glob.glob('/path/apple*.json.gz'):
    print file

However, this returns zero

[out]
     0
Community
  • 1
  • 1
LearningSlowly
  • 8,641
  • 19
  • 55
  • 78

2 Answers2

36

Having the files in /mydir as follows

mydir
├── apple1.json.gz
├── apple2.json.gz
├── banana1.json.gz
├── melon1.json.gz
└── melon2.json.gz

you could either do

import glob
import os

os.chdir('/mydir')
for file in glob.glob('apple*.json.gz'):
    print file

or

import glob

for file in glob.glob('/mydir/apple*.json.gz'):
    print file

Changing directories will not effect glob.glob('/absolute/path').

Simon Fromme
  • 3,104
  • 18
  • 30
3

Double List Comprehension Method

I was looking for a similar tool and developed a double list comprehension method that should work well for your case (I tested it for my case) ...

import os

def get_file_names_with_strings(str_list):
    full_list = os.listdir("path_to_your_dir")
    final_list = [nm for ps in str_list for nm in full_list if ps in nm]

    return final_list


print(get_file_names_with_strings(['apple', 'banana', 'melon']))
Thom Ives
  • 3,642
  • 3
  • 30
  • 29