Welcome to Software Development on Codidact!

Will you help us build our independent community of developers helping developers? We're small and trying to grow. We welcome questions about all aspects of software development, from design to code to QA and more. Got questions? Got answers? Got code you'd like someone to review? Please join us.

Open file in script's own folder

−0

I have a Python script that needs to access some data (or configuration) file in its very own folder. For example, say script.py does something like this:

with open('data.txt') as file:
    data = file.read()

The script will find the file, data.txt, if it is run in the terminal via python script.py from the same folder the script itself is in. But I want to call the script from any other folder, then with a relative or absolute path: python path/to/script.py. In which case it will fail to find the data file, raising FileNotFoundError.

How can I make sure the script finds the external file in its own folder?

python file-handling

posted over 3 years ago

CC BY-SA 4.0

3y ago by Alexei‭

J-hen‭

171 reputation 2 1 30 1

Raw

Markdown

History

is a duplicate

This question has been asked before and has already been answered. It should be marked as a duplicate.

Please enter the URL of the proposed duplicate in the details field below.

not constructive

This question cannot be answered in a way that is helpful to anyone. It's not possible to learn something from possible answers, except for the solution for the specific problem of the asker.

0 comment threads

2 answers

Score Active Age

+10

−0

Worked for J-hen‭

The following users marked this post as Works for me:

User	Comment	Date
J-hen‭	(no comment)	Dec 16, 2021 at 16:30

You can use the pathlib standard module with __file__ to make things simple.

from pathlib import Path

scriptFolder = Path(__file__).parent

with open(scriptFolder / 'data.txt') as file:
	data = file.read()

posted over 3 years ago

CC BY-SA 4.0

Moshi‭

2366 reputation 16 36 273 55

Copy Link

Raw

Markdown

History

0 comment threads

−0

Theory

Relative paths are relative to the current working directory of the Python process. This can be checked from within Python using the os.getcwd function and set using the os.chdir function.

Whenever Python loads a module, a __file__ attribute may be set on the resulting module object, giving the path to the source code file that the module is based on. (Normally, this will happen for all user code; but the attribute could be missing for built-in modules like sys, or it could be the name of a file containing compiled code if the module was implemented in C.) Within the code for that module, these attributes are just the global variables for that code. (There is also no __file__ global variable available when using the REPL.)

The technique

Therefore, the code in script.py can simply check the __file__ global variable to find a full path to script.py; and the containing folder for script.py can thus be determined by parsing that path, in any number of ways. The simplest way is to take advantage of the pathlib standard library module (introduced in Python 3.4), as shown in Moshi's answer.

In Python 3.9 and above, the same technique also works if script.py was run directly as a module. However, in Python 3.4 through 3.8, the __file__ attribute will be a relative path if script.py was run directly as a module. (For example, if the command was python script.py, the __file__ value will be simply 'script.py', and the Pathlib logic will produce just '.'.)

As a result, this technique will break if the current working directory has changed between the time the script was started and the time that the path taken from __file__ is used.

To avoid this issue, make sure to determine __file__ immediately, compute an absolute path immediately, and store that path until needed:

from pathlib import Path

# when the script starts
here = Path(__file__).parent.absolute()

# later, possibly after an `os.chdir` call
with open(here / 'data.txt') as file:
    data = file.read()

Legacy support (Python 3.3 and earlier)

In older versions of Python, __file__ is still documented to be present on module objects (and thus be available as a global variable), with the same purpose. However, it is not guaranteed to be an absolute path. (For an absolute import, it will depend upon the entry in sys.path that was used for the import.) Further, pathlib is not available.

Thus, the necessary code might look like:

import os

# when the script starts
here = os.path.split(os.path.abspath(__file__))[0]

# later, possibly after an `os.chdir` call
with open(scriptFolder / 'data.txt') as file:
    data = file.read()

Here, os.path.abspath creates an absolute path, and os.path.split splits it into two parts - the "path" (i.e., to the containing folder) and the filename itself.

Credit to kindall and wim on Stack Overflow for the corresponding research about the semantics of __file__. kindall uncovered the legacy semantics, which agf then described in that answer; later, wim edited to supply the updated semantics in Python 3.4+.

posted over 1 year ago

CC BY-SA 4.0

Karl Knechtel‭

2455 reputation 40 59 314 87

Copy Link

Raw

Markdown

History

1 comment thread

Also mention package_data? (2 comments)

Communities

Open file in script's own folder

0 comment threads

2 answers

0 comment threads

Theory

The technique

Legacy support (Python 3.3 and earlier)

1 comment thread