Python & Perl: Split File Path into Parts

,

Often, we are given a file fullpath and we need to split it into the directory name and file name. The file name is often split into a core part and a extension part. For example:

'/Users/t/web/perl-python/I_Love_You.html'
becomes

'/Users/t/web/perl-python/'  (directory name)
'I_Love_You'                 (file's base name)
'.html'                      (file's “extension”)

Depending on the language, some language will remove the trailing slash after the dir name, and some will omit the dot before the suffix.

Python

In Python, to split a full path into parts is done with the os.path module. Example:

# -*- coding: utf-8 -*-
# python

import os.path

myPath = '/Users/t/web/perl-python/I_Love_You.html'
(dirName, fileName) = os.path.split(myPath)
(fileBaseName, fileExtension)=os.path.splitext(fileName)

print dirName         # /Users/t/web/perl-python
print fileName        # I_Love_You.html
print fileBaseName    # I_Love_You
print fileExtension   # .html

Perl

In Perl, splitting a full path into parts is done like this:

# -*- coding: utf-8 -*-
# perl

use File::Basename;

$myPath = '/Users/t/web/perl-python/I_Love_You.html';

($fileBaseName, $dirName, $fileExtension) = fileparse($myPath, ('\.html') );

print $fileBaseName, "\n";   # I_Love_You
print $dirName, "\n";        # /Users/t/web/perl-python/
print $fileExtension, "\n";  # .html

Note: the second argument to fileparse() is a list of regex. In particular, you need to escape the dot. Normally, one gives it a value such as ('\.html', '\.HTML', '\.jpg', '\.JPG'). Yes, it is case sensitive. If you want to match any extension (that is, the string after the last dot), use ('\.[^.]+$').

For the official doc, type in command line: “perldoc File::Basename” or http://perldoc.perl.org/File/Basename.html

blog comments powered by Disqus