What is an image?#

In this exercise we explore the nature of NIfTI images.

First we will have a look at Python strings. Here is a variable called my_string with value “neuroimaging is hard but fair”

my_string = "neuroimaging is hard but fair"
# Show the result
my_string
'neuroimaging is hard but fair'

We can see what type of thing this variable contains (points to) using the type function:

type(my_string)
str

We can see how many characters the string has with the len function:

len(my_string)
29
# The first character of the string
print(my_string[0])
n
# The last character of the string (don't forget indexing starts at 0)
print(my_string[28])
r

Now we introduce string slicing. This is where you take some sequential characters from the string, using the colon (:) between the square brackets. The value before the colon is the index to the first character you want, and the value after the colon is the index to the character after the last character you want. It sounds strange, but you will get used to it…

# The first two characters of the string
print(my_string[0:2])  # from index 0 up to, but not including, 2
ne
# The first 5 characters of the string
print(my_string[0:5])  # from index 0 up to, but not including, 5
neuro

We will go into more details on strings and slicing soon.

Now we will try loading an example image and seeing if we can understand the image data.

Exploring modules and objects#

If you want to explore modules or objects, type their name followed by a period, and press tab to see what functions or classes are available.

Let us start by making an object to point to the current working directory. We can do that with the Path class from the pathlib module:

# Import the Path class
from pathlib import Path
# Path() points to the current working directory by default.
cwd = Path()
cwd
PosixPath('.')

The working directory of the notebook is the directory that contains this notebook file.

The new cwd object is if type (class) Path.

type(cwd)
pathlib.PosixPath

Try exploring this cwd object now. Type cwd. (cwd followed by a period) then press the Tab key, to see everything attached to the cwd object. Continue typing so you have cwd.absolute, and then type ? followed by Return. This shows you the help for the Path absolute method. (Remember, a method is a function attached to an object).

# Use this cell to explore the "cwd" object.

Using the absolute method, we can print out the full (absolute) path to Python’s working directory:

cwd.absolute()
PosixPath('/home/runner/work/dipy-textbook/dipy-textbook')

We next fetch a data file from the web. We have a special utility to do that, that knows where the data files are for this course. The utility is called, simply, nipraxis.

# Load utility that will fetch data from the web and store it.
import nipraxis

Here we ask Nipraxis to download the data to the local hard disk.

structural_fname = nipraxis.fetch_file('ds114_sub009_highres.nii')
# Show the filename.
structural_fname
'/home/runner/.cache/nipraxis/0.5/ds114_sub009_highres.nii'

Let’s read the bytes from the image into memory using the read_bytes method of the Path object:

# Open a file, read in binary bytes.
contents = Path(structural_fname).read_bytes()

How do I find out what type of object is attached to this variable called contents?

# your code here

How big is this file in terms of bytes? Can you find out from the contents variable? (Hint: you want to know the length of contents).

# n_bytes = ?

If 1 mebibyte (MiB) (http://en.wikipedia.org/wiki/Megabyte) is size 1024 * 1024, what is the file size in MiB? (Hint - the right answer is between 0 and 100).

# n_mib = ?

This is a NIfTI1 format file. That means that the first 352 bytes contains the “header” that describes the parameters of the image and the data following.

We might want to print out the contents of the first 352 bytes of contents to have a look at it.

To do this, you will need to use string slicing to get the first 352 bytes:

# Here you print out the first 352 characters of `contents`
# Your code here:
# print(...)

Which software wrote this image?

Here is the format of the NIfTI1 header : http://nifti.nimh.nih.gov/nifti-1/documentation/nifti1fields

We are now going to try and work out the datatype of this image. This is stored in the datatype field of the header. Careful - there is also a data_type field (with an underscore), which we will ignore.

Looking at the web page above, how many bytes is the datatype value stored in?

How would you get the bytes in contents that contain the datatype value?

Hint - you need slicing again, and the information from Byte offset column in the NIfTI1 header page above):

Hint2 - if you want to work it out, don’t look at the cell further down!

# data_type_chars = ?

The datatype value is stored in binary form (rather than text form). The value for datatype is stored in the header in the same format that the computer stores the number in memory. We want to convert this binary format to a number that Python understands. To do that, we use the struct module.

import struct

We are going to use the struct.unpack function. Open a new cell below this one with b and type struct.unpack? followed by Shift-Return to see the help for this function.

Now we have read the help, we know we need two things. The first is a string that give the code for the binary format of the data. This is the “format string”. The second is the string containing the bytes of the data.

We first need to specify the format of the character data. Have a look at the help on format strings in the Python documentation and the NIfTI web page above.

Here is the format specifier for our value:

fmt_specifier = 'h'  # Why? (check the web pages above)

Now we read the datatype value into a number that Python understands:

# In case you didn't do this above, this is the way to get the bytes
# we need from the header.
data_type_chars = contents[70:72]
datatype = struct.unpack(fmt_specifier, data_type_chars)
print(datatype)
(16,)

This is a numerical code for a data type. What actual data type is this? (See: http://nifti.nimh.nih.gov/nifti-1/documentation/nifti1fields/nifti1fields_pages/datatype.html)

We could continue reading the NIfTI header in the same way, but luckily someone has done that work for us. Enter the nibabel package:

import nibabel

For now, we will use this package without worrying much about how it works. Have a look to see what nibabel can do by opening up a new cell with b and typing nibabel? and nibabel. followed by Tab.

As with most Python packages, you can check what version of nibabel you have by printing the __version__ variable of the package:

print(nibabel.__version__)
5.1.0

If you have a nibabel version below 3.0, please let your instructor know so they can fix that.

You can make an image object by passing an image file name to nibabel.load:

img = nibabel.load(structural_fname)

We now have a Nibabel image object - and specifically, a Nifti type of image object.

type(img)
nibabel.nifti1.Nifti1Image

Let’s have a look at the header that Nibabel read in when it made the image object.

print(img.header)
<class 'nibabel.nifti1.Nifti1Header'> object, endian='<'
sizeof_hdr      : 348
data_type       : b''
db_name         : b''
extents         : 0
session_error   : 0
regular         : b'r'
dim_info        : 0
dim             : [  3 256 156 256   1   1   1   1]
intent_p1       : 0.0
intent_p2       : 0.0
intent_p3       : 0.0
intent_code     : none
datatype        : float32
bitpix          : 32
slice_start     : 0
pixdim          : [1.        1.        1.3002223 1.        0.00972   0.        0.
 0.       ]
vox_offset      : 0.0
scl_slope       : nan
scl_inter       : nan
slice_end       : 0
slice_code      : unknown
xyzt_units      : 10
cal_max         : 0.0
cal_min         : 0.0
slice_duration  : 0.0
toffset         : 0.0
glmax           : 0
glmin           : 0
descrip         : b'FSL5.0'
aux_file        : b''
qform_code      : scanner
sform_code      : scanner
quatern_b       : -0.11747453
quatern_c       : 0.008146102
quatern_d       : 0.022481605
qoffset_x       : -129.82573
qoffset_y       : -119.09057
qoffset_z       : -143.41777
srow_x          : [ 9.9885648e-01 -6.0528666e-02  1.0895197e-02 -1.2982573e+02]
srow_y          : [ 4.2725317e-02  1.2630211e+00  2.3362094e-01 -1.1909057e+02]
srow_z          : [-2.1454209e-02 -3.0280653e-01  9.7226673e-01 -1.4341777e+02]
intent_name     : b''
magic           : b'n+1'

As you can see, it has worked out the datatype for us.

Soon, we do some more work to get used to basic Python. After that we will start playing with the image using the Python tools for arrays, and for plotting.