CSC 210

Final Project


Logistics:


Option 1: Your Choice

For this choice you are asked to construct a project of your own choosing. Talk to your professors (including us), TAs, consult books, journals, the library of equations in Phaser, etc. as possible sources. For example, you can find a model equation of your choice and simulate it on the computer to uncover a phenomenon; you can write code to implement some numerical algorithms. If you have access to some experimental data, you can try to fit the data to a dynamic model.


Option 2: Professor's Suggestion

  1. Try to read parts of the original paper Deterministic Nonperiodic Flow by E. N. Lorenz, Journal of the Atmospheric Sciences, 1963; 20: 130-141.
  2. These famous differential equations (25-27) of this paper are in the ODE library of PHASER.
  3. Try to reproduce some of the computational experiments descibed in the paper and create similar illustrations. Interpret your computational findings in terms of fluid mechanics.
  4. You should explore the three-dimensional plotting facilities of PHASER to visualize the solutions.
  5. You should create a sequence of simulations as the parameter r is varied and explain the dynamical changes (bifurcations) that can occur.
  6. You should consult other sources on this famous example.


Option 2: Professor's Suggestion

On the ncbi genome site, find a bacterial species with at least two chromosomes and download these two chromosomes. Then find another related species (in the same genus) and download one more chromosome from that one. Now compute the distribution of overlapping 4-mers (substrings of length 4) for each of these chromosomes, by counting the number of appearances of each 4-mer in each chromosome. Compute the correlation of the distributions using the sample Pearson's correlation coefficient. What do you observe? What happens if we vary the size of the k-mers (instead of 4 use another value)? How do these distributions correlate to a chromosome of another unrelated species from a different genus?


Option 2: Professor's Suggestion

Download a large chromosome (> 5Mb) of a bacterial species. Identify all open reading frames (ORFs) in this chromosome. Compare this number to the number of known genes in this chromosome. What are your observations? Did you discover any regions which could potentially indicate the existence of novel undiscovered genes?