Introduction to Matrices and Modeling in Biology
A central theme in biology is that biological objects of
interest have structure that may affect the way those objects should be
analyzed. Moving up the hierarchical biological scale, examples include the various
components of cells, the cellular composition of tissues, the tissue
composition of individual organisms, the composition of different individuals
which make up a population, the species which make up a community and the types
of habitat across a landscape. Each of these biological entities has a
sub-structure. Think of different types of individuals within a population: the
males and females of different ages in a meerkat population, the demographic
(age, location, economic status, etc.) structure of a human population, the
chemical composition of cells, etc. One of the most useful mathematical
concepts provides a way to describe the composition of these biological
objects, and provides a mechanism to analyze how the composition might change
through time or across space.
A motivating example that we will use regularly in this
section of the course concerns patterns of species across a landscape. Think of
taking aerial photographs of a plot of land once each decade, following the
changes in pattern across the landscape. On a real landscape, this could be
looking at species, communities of plants (deciduous hardwoods, herbacious,
grassland, etc.), or land-use (agricultural, urban, suburban, etc.). For each
decade you have an photo in which you might be able to classify each area as
consisting of one of a few species. Then one descriptive summary you might use
for the landscape is to describe it by the fraction of the total area which is
covered by each of the species. In the EcoBeaker example, this corresponds to
what fraction of the landscape pictured on the computer is of the different
colors, representing different types of organisms (grass, blackberry bushes,
hickory trees, etc.).
Describing this landscape at a particular time then could
use a list of numbers giving the fraction of the landscape of each species, in
other words a vector of numbers which sum to one. Pictorially, this description
corresponds to the bar charts illustrated in the EcoBeaker example, showing the
number of squares in each landscape covered by each type of organism. There is
of course a loss of information in going from the picture of the landscape to a
simple list of numbers representing the fraction of the landscape of each type.
All information about the spatial arrangement is lost - that is we can't tell
from just a vector of numbers whether all the grass is clustered on one side of
the landscape or whether it is dispersed throughout the landscape.
As time goes on (e.g. we take a picture of the landscape
every decade), the vector describing the fraction covered by each type of
organism will change. If we could determine the rules by which this vector
changes, we would have a basis for a model of the landscape's dynamics (called
succession in ecology). From this model we might then be able to determine from
just a few decades of pictures, what the landscape might look like many decades
from now (the process we called extrapolation when we were talking about
regressions).
Note that this entire area of looking at landscapes and
how they can be described and how they change is part of the field of
geography. The main tool used to analyze changes is called Geographical
Information Systems (GIS), which is a fancy name for software that allows you
to look at, build and manipulate maps.
Our objective in this section of the course is to
construct the mathematical tools needed to analyze changes in vectors
describing landscapes. This doesn't just apply to landscapes though. It applies
to any biological entity which can be broken down into discrete classes. So we
will be able to use this to describe changes in the age breakdown of human
population of Knoxville (or any other region). This is the basis for the entire
field of demography - the study of changes in the structure of human populations
through time and space. But this is just as easily applied to any population -
bears in the Smokies, killer whales in the Pacific, meerkats in South Africa,
etc. More than that however, it applies to an immense number of biological
problems from cell population dynamics (think of bacterial populations in which
you follow which bacteria are resistant to each of a list of antibiotics), to
behavior (think of the fraction of a group of meerkats carrying out each of a
list of behaviors), to pharmacokinetics (think of the fraction of a drug
infused into the body in each of a list of tissues and how this changes through
time - there would be a class here called "removed" in which the drug
has been excreted).
It turns out that the mathematics which allows us to
describe changes in the structure of biological entities or landscapes is
simply described by building an appropriate set of rules for manipulating
numbers that are arranged in particular orders. The order matters because the
way we describe these structures depends upon the order that you list the
numbers. So a vector that describes the fraction of the landscape in (grass,
shrub, trees) as (.2, .5, .3) is different from the vector (.5, .2, .3) - these
represent very different landscapes - the order of the numbers matters. So we
have to build up a way of manipulating numbers in which order matters - this is
different from the typical rules of algebra you have seen since middle school.
We essentially have to come up with a way to manipulate vectors in the same way
you used the algebraic manipulations (addition, subtraction, multiplication and
division) for single numbers (single numbers are called "scalars" to
differentiate them from vectors which consist of lists of numbers in particular
orders). Just as numbers can be represented in general by a letter (the
essential idea of algebra you've seen since high school), vectors can be
represented by letters, and the elements of a vector (the numbers which make it
up) can be letters (algebraic entities) as well.
So what is a "Matrix" - yes it is the movie,
but in our case it simply represents a list of numbers arranged in a
2-dimensional array:
Yes this is just like the cascading stream of numbers in
the movie, but in this case they are not cascading - just static. What we will
do in this section of the course is first describe an algebra for how to do the
basic algebraic manipulations of vectors (just a row or column of numbers
arranged in a particular order) and matrices (the plural of matrix) which are
rectangular arrays of numbers.
Our objective in describing this area of mathematics
(called Matrix Algebra or Linear Algebra) is to get to the point of being able
to describe mathematically how the structure of a landscape, population, etc.
changes through time.
One way to express this is similar to the notion that had
for the equation describing exponential growth of a population - N(t)= N(0)
exp(-kt). Only this just describes how a single number giving the entire
population size of density (numbers of individuals per unit area or per unit
volume) changes with time. When we want to describe how a population with
structure changes, we need to express how a vector changes with time. If this
is in discrete periods, say from one year to the next, then we might express
how we go from a vector of population structure at time t, N(t) to a vector of population structure at time t+1,
N(t+1). (Note that I am using Bold
Type here to indicate that we are dealing
with a vector or matrix, not a scalar). To be able to do this, we need to learn
some basics of matrix manipulations.
Matrix algebra - basic operations
These are described in Section 55 of the text. This
includes the definition of an m x n matrix (m rows and n columns), what it
means to say that two matrices are equal (the element in a row and column of
one matrix is the same as the element in the same row and column of the second
matrix, and this holds for all elements), how to add matrices (you can only do
this if they are the same size and then you add each element of the first
matrix to the corresponding element of the second matrix to get the sum), how
to subtract matrices (the same as addition, only you subtract element by
element), how to multiply (can only do this if the number of columns of the
first matrix is the same as the number of rows of the second matrix), and the
basic laws of matrix algebra (the same as standard arithmetic, except that the
commutative law doesn't hold in general, so that A B is not the same as B A for two matrices A and B.