Maximally distant genomes under the DCJ operation

Similar documents
Advanced Digital Signal Processing Part 4: DFT and FFT

Monomino-Domino Tatami Coverings

Domino Tatami Covering is NP-complete

Smart Grid Tables. Exploring the Electrical Infrastructure Grades 9-12

«9 ~ Leucaena hedgerow c c c c c c c Maize plants in

Symmetries, patterns and orbifolds

arxiv: v1 [math.co] 19 Mar 2014

Lecture 8 OPENINGS &ORGANIZATION OF FORM AND SPACE

GENETICS. (a) Using appropriate letter symbols. Work out the genotype of the f1 generation

Corn Rootworm Control in Field Corn with planting time treatments. Soil Insecticide Test # 3, 2005

3-5 / Math: Planting a Square Foot Garden

A New Construction of 16-QAM Golay Complementary Sequences

2-4/114 Number Board CC.2.1.K.A.1

MODEL SF-10 CONTROL OPERATION AND INSTRUCTION MANUAL

Lecture 4 Circulation MoveMent through Space

Plumber Level 3 Rev. May 2011

Table of Contents PART I: The History and Current Status of the Industrial HMI PART II: Fundamentals of HMI Design and Best Practices

Selecting Circulators TD10 EFFECTIVE: SUPERSEDES:

MINTRAC MEAT INDUSTRY ENVIRONMENT CONFERENCE WASTE HEAT RECOVERY FROM RENDERING PLANTS PRESENTED BY DEREK HENDERSON

Lesson Plan Contest 2018 Growing Food for Our Classroom Animals: Exploring Hydroponics Grades PR-K 2 nd

Orchard Establishment

SMART IRRIGATION PROGRAM APPLICATION

LOWE'S EDI 850 Purchase Order

Laboratory Exercise #6

Algorithmic Generation of Chinese Lattice Designs

LEARNING LYNX Building Architectural Awareness

P2267 NETWORK INTERFACE

Fertilization of the common pea plant

Force and Bungee Cord Length: Will Longer Length Un-Stretched of Cord result in a Greater or Lesser Force than a Shorter Length of Un-Stretched Cord

Cereal Seed Health and Seed Treatment Strategies: Exploiting new seed testing technology to optimise seed health decisions for wheat.

Performance Rating of Variable Refrigerant Flow (VRF) Multi-split Air-conditioning and Heat Pump Equipment

The Performance of Thermocompressors as Related to Paper Machine Dryer Drainage Systems

Benefits of Enhanced Event Analysis in. Mark Miller

LRPC VISION GOALS-OBJECTIVES-STRATEGIES

Smart Combiners Installation Guide. For Obvius A89DC-08 sensor modules

armfield COMPUTER CONTROLLED HEAT EXCHANGER SERVICE MODULE HT30XC series issue 2

Recent Researches in Energy, Environment and Landscape Architecture

ANNUAL REPORT TO NC DWARF APPLE ROOTSTOCK TRIAL SUMMARY FOR THE 2010 SEASON

Comparing Communication iapps. SET-BC District Partner Conference February 21 st and 22 nd, 2017

Action Plan. Saving Norfolk s Wildlife for the Future. Key Stage 1. House Sparrow. Summary of Activity. Health and Safety Consideration

VIBRA SCHULTHEIS. Vibrating Spiral Conveyors.

Lesson Objectives At the end of this lesson you should be able to

Page1. Title: Planning a Garden Using a Grid Grades: 3 Subject: Science, Math Time: 45 minutes

Low Cost Kinematic Couplings 2.75 Fall 2001

SNC1P Electricity Practice Test

The Effect of Spray Sprinkler Spacing on Distribution Uniformity

Compressibility and One Dimensional Consolidation of Soil

Facilities Operations And Maintenance

South Dakota 100 Corridor Preservation

Building Technology III New York City College of Technology. Course Day 21 Revit Day 12 (Project Day 10)

Visualization of Evaporatively Cooled Heat Exchanger Wetted Fin Area

BI-AMPING THE ACOUSTIC ENERGY 120 SPEAKERS

Page 1 of 16 FLASH PUSLE WIDTH EFFECTIVENESS IN NOTIFICATION APPLIANCES

Cassette units Model Comfort Circle. - heating - cooling - ventilation

Potassium Applications and Yellow Shoulder Disorder of Tomatoes in High Tunnels

Sprinkler System Installer Consultation Discussion Guide

DRAFT 1. Goal: To use knowledge of coordinates to plot given points in a garden designed as a coordinate plane.

Shear Strength of Soils

1. FREE-BODY DIAGRAM. Pretend the dot at right is your mousetrap car. Draw and corectly label the four force vectors acting on the mouse cart.

RC802/ B 8E1 Modular Fiber-Optic Multiplexer (Rev. M) User Manual

WHAT WE ARE LEARNING TODAY

EJB Custom-Built Control Panels

Installation and Servicing Instructions. Alpha FlowSmart

The meeting house is of low-key design typical of the inter-war period. It has an attractive appearance with a spacious interior and good amenities.

TOUCH-SCREEN WEATHER STATION MODEL WS Operation Manual

Landscaping Home Grounds Service and Private Areas Introduction ming pool or a reflection pool. (See Figure 1.) In this leaflet you are going to add t

ELECTROPROJECT GTV-A POWER SUPPLY FOR DOOR MAGNETS

Smoke Layer Height and Heat Flow through a Door

You can ignore the Workbook until we ask you to do an exercise in it.

Collaboration on the Integration of Sculpture and Architecture in The Eden Project

The effect of organic fertilizers and thinning methods on quality parameters and the yield on apple cultivars Aroma and Discovery in Norway

Introduction... 1 System Overview... 1 System Diagram... 2

INTROGRESSING MULTIPLE QTL IN BACKCROSS BREEDING PROGRAMS OF LIMITED SIZE

Travaglini EQUIPMENT FOR FERMENTING AND AGING OF SALAMI. Travaglini_inglese.indd 6 05/07/

The Vanguard System TM VM-2900 Air Seeder Monitor. Operator s Manual

Process Simulation and Optimization of Cryogenic Operations Using Multi-Stream Brazed Aluminum Exchangers

EXECUTING HIGHLY ACCURATE FIBER MEASUREMENTS IN A FRACTION OF THE TIME

Landscaping and Turf Management Instructional Framework

TECHNICAL HANDBOOK. for MULTISPLIT NETWORK SYSTEMS OR GROUP CONTROL

Versatility 96 % ::::::::::::::::::::::::::::::::::::::: the best. mod/con Performance. and UP TO. DOE AFUE Efficiency.

Operating Manual. the semi automatic

4 Glow Bowling. Glow Bowling Products. Help for Dual Pindeck Lamp Fixtures. Products. Pindeck Light Relay Control #400154

Energy Conservation with PARAG Energy Efficient Axial Flow FRP Fans

Name: Date: Period: Genetics and Heredity S8.B2.2 What is DNA? Cells use a chemical code called

Yesler Terrace Explorations of Ideas 11 March 2009

Fibre Specification Standards

A hydronic system controls comfort by delivering heated or cooled fluid to the conditioned space through pipes.

1. Odd one out Give students the sheet Odd one out or simply read out the lists. The answers are all related to the activities during the lesson.

FAN ENGINEERING. Surge, Stall and Instabilities in Fans

Stanford Linear Accelerator Center. Record Details 2. BaBar-ESD Training May, 2002

SHP High Pressure Radiant Gas Brooder Series

CEILINGS BACKSPLASHES WALLS PROJECTS INSTALLATION GUIDE.

UNIQUE COLLEGE OF COMPUTER SCIENCES General Certificate of Education Ordinary Level

LOWE'S EDI 832 Price/Sales Catalog Version: 4010 Final Publication: June 7, 2004


State of Nevada Department of Transportation Materials Division METHOD OF TEST FOR EVALUATING CLEANNESS OF COARSE AGGREGATE

Technical Presentation on

Real Log Homes. Photos by Great Island Photography Builder: David Anderson Hill, Inc. The best of BOTH WORLDS. Real Log Homes and Timberpeg

Comparing & Validating Deviation Surveys

Performance of Geosynthetics in the Filtration of High Water Content Waste Material

Transcription:

Maximally distant genomes under the DCJ operation Manda Riehl Permutation Patterns, 2010 Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 1 / 37

Genomes Made of chromosomes. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 2 / 37

Genomes Made of chromosomes. Made of genes. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 2 / 37

Replication Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 3 / 37

Mouse and Human Genomes 90.2% of the human genome and 93.3% of the mouse genome lie in conserved syntenic segments. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 4 / 37

Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 5 / 37

Chromosomes as Permutations 3.5 Billion base pairs Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 6 / 37

Chromosomes as Permutations 3.5 Billion base pairs But only around 23,000 genes! Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 6 / 37

Chromosomes as Permutations 3.5 Billion base pairs But only around 23,000 genes! Direction matters: Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 6 / 37

Chromosomes as Permutations 3.5 Billion base pairs But only around 23,000 genes! Direction matters: GAU is aspartic acid. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 6 / 37

Chromosomes as Permutations 3.5 Billion base pairs But only around 23,000 genes! Direction matters: GAU is aspartic acid. UAG is STOP. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 6 / 37

Chromosomes as Permutations 3.5 Billion base pairs But only around 23,000 genes! Direction matters: GAU is aspartic acid. UAG is STOP. Even so, Christie (1996), Pevzner (1998), Labarre (2005) have also considered unsigned versions. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 6 / 37

Chromosomes as SIGNED Permutations 1 2 4 3 indicates the substring 3 4 was attached to 1 2 backwards". Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 7 / 37

Distances between permutations Fundamental Question: Given two genomes/permutations, how many mistakes/mutations/operations do we need to change one into the other? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 8 / 37

Distances between permutations Fundamental Question: Given two genomes/permutations, how many mistakes/mutations/operations do we need to change one into the other? Fundamental Answer: It depends on your operation. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 8 / 37

(Note: Because in this talk, we are using a multichromosal model, our signed permutations are more like broken" permutations, or ordered set partitions) Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 9 / 37

Inversions: Reverse the order of a chromosome or part of the genome Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 10 / 37

Inversions: Reverse the order of a chromosome or part of the genome Interchanges: Switch two segments of the genome Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 10 / 37

Inversions: Reverse the order of a chromosome or part of the genome Interchanges: Switch two segments of the genome Translocations: Swap the ends of two chromosomes Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 10 / 37

Inversions: Reverse the order of a chromosome or part of the genome Interchanges: Switch two segments of the genome Translocations: Swap the ends of two chromosomes Fusions: two segments are joined Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 10 / 37

Inversions: Reverse the order of a chromosome or part of the genome Interchanges: Switch two segments of the genome Translocations: Swap the ends of two chromosomes Fusions: two segments are joined Fissions: one segment is split into two Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 10 / 37

Inversions: Reverse the order of a chromosome or part of the genome Interchanges: Switch two segments of the genome Translocations: Swap the ends of two chromosomes Fusions: two segments are joined Fissions: one segment is split into two Circularizations and Linearizations: Convert between linear and circular chromosomes Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 10 / 37

Double Cut and Join Includes them all! Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 11 / 37

Double Cut and Join Includes them all! Pros: Very general Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 11 / 37

Double Cut and Join Includes them all! Pros: Very general Cons: Very general Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 11 / 37

Definitions An external vertex or telomere is half an element of the signed permutation whose tail or head is not connected to any other element. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 12 / 37

Definitions An external vertex or telomere is half an element of the signed permutation whose tail or head is not connected to any other element. An internal vertex or adjacency is half an element of the signed permutation with its tail or head joined to other elements (or itself). Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 12 / 37

Definitions An external vertex or telomere is half an element of the signed permutation whose tail or head is not connected to any other element. An internal vertex or adjacency is half an element of the signed permutation with its tail or head joined to other elements (or itself). Example: 2 1 4 3 5C... Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 12 / 37

Definitions An external vertex or telomere is half an element of the signed permutation whose tail or head is not connected to any other element. An internal vertex or adjacency is half an element of the signed permutation with its tail or head joined to other elements (or itself). Example: 2 1 4 3 5C... (2t), (4t), (3h), and (3t) are external vertices. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 12 / 37

Definitions An external vertex or telomere is half an element of the signed permutation whose tail or head is not connected to any other element. An internal vertex or adjacency is half an element of the signed permutation with its tail or head joined to other elements (or itself). Example: 2 1 4 3 5C... (2t), (4t), (3h), and (3t) are external vertices. (2h,1t), (1h,4h), and (5h,5t) are internal vertices. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 12 / 37

Double Cut and Join (Yancoupoulos 2005) A DCJ operation involves making two cuts in a genome and rejoining the pieces in one of the following ways: Two internal vertices (a,b) and (c,d) can be replaced with two new internal vertices (a,d) and (c,b) or (a,c) and (b,d). Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 13 / 37

Double Cut and Join (Yancoupoulos 2005) A DCJ operation involves making two cuts in a genome and rejoining the pieces in one of the following ways: Two internal vertices (a,b) and (c,d) can be replaced with two new internal vertices (a,d) and (c,b) or (a,c) and (b,d). An internal vertex (a,b) and an external vertex (c) can be replaced with a new internal and external vertex (a,c) and (b) or (b,c) and (a). Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 13 / 37

Double Cut and Join (Yancoupoulos 2005) A DCJ operation involves making two cuts in a genome and rejoining the pieces in one of the following ways: Two internal vertices (a,b) and (c,d) can be replaced with two new internal vertices (a,d) and (c,b) or (a,c) and (b,d). An internal vertex (a,b) and an external vertex (c) can be replaced with a new internal and external vertex (a,c) and (b) or (b,c) and (a). Two external vertices (a) and (b) can be replaced by an internal vertex (a,b). Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 13 / 37

Double Cut and Join (Yancoupoulos 2005) A DCJ operation involves making two cuts in a genome and rejoining the pieces in one of the following ways: Two internal vertices (a,b) and (c,d) can be replaced with two new internal vertices (a,d) and (c,b) or (a,c) and (b,d). An internal vertex (a,b) and an external vertex (c) can be replaced with a new internal and external vertex (a,c) and (b) or (b,c) and (a). Two external vertices (a) and (b) can be replaced by an internal vertex (a,b). An internal vertex (a,b) can be replaced by two external vertices (a) and (b). Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 13 / 37

The example below shows how a DCJ operation can transform one genome into another. a b a c c d iii b d Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 14 / 37

DCJ Distance The DCJ distance between two genomes on the same set of genes is defined to be the fewest number of Double-Cut-and-Join operations that it takes to transform one genome into the other. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 15 / 37

DCJ Distance The DCJ distance between two genomes on the same set of genes is defined to be the fewest number of Double-Cut-and-Join operations that it takes to transform one genome into the other. What for? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 15 / 37

DCJ Distance The DCJ distance between two genomes on the same set of genes is defined to be the fewest number of Double-Cut-and-Join operations that it takes to transform one genome into the other. What for? Audience Poll: Which has a more recent common ancestor: humans and rabbits, humans and camels, or humans and pigs? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 15 / 37

Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 16 / 37

Adjacency/Breakpoint Graph Any genome can be represented by a distinct arrangement of sets of internal vertices and external vertices. A bipartite adjacency graph is constructed with vertices corresponding to the sets of internal and external vertices of the two genomes. Two vertices are connected with an edge for every head or tail that they share. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 17 / 37

1t 1h2h 2t 3h4t 3t4h 1t 1h2h 2t 3h4t 3t4h 1t 1h4t 4h 2t2h 3t 3h 1t 1h4t 4h 2h2t 3t 3h Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 18 / 37

Distance Formula Distance Formula Theorem (Bergeron, Mixtacki, Stoye 2008) The DCJ distance between two genomes, A and B, defined on the same set of N genes is given by d DCJ (A, B) = N (C + I/2), where C is the number of cycles and I is the number of odd paths in the adjacency graph of A and B. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 19 / 37

Example d DCJ (1 2 3 4 5, 1 4 2 5 3) Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 20 / 37

Distance distribution We were interested in exploring, for a starting permutation A, how the other permutations of the same length were distributed in terms of their distance from A. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 21 / 37

Distance distribution We were interested in exploring, for a starting permutation A, how the other permutations of the same length were distributed in terms of their distance from A. Questions: Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 21 / 37

Distance distribution We were interested in exploring, for a starting permutation A, how the other permutations of the same length were distributed in terms of their distance from A. Questions: Are most" genomes near A or far from A? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 21 / 37

Distance distribution We were interested in exploring, for a starting permutation A, how the other permutations of the same length were distributed in terms of their distance from A. Questions: Are most" genomes near A or far from A? What features of A will this distribution depend on? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 21 / 37

Distance distribution We were interested in exploring, for a starting permutation A, how the other permutations of the same length were distributed in terms of their distance from A. Questions: Are most" genomes near A or far from A? What features of A will this distribution depend on? Are there symmetry properties of this distribution? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 21 / 37

Maximum Distance Obvious Corollary to BMS: The maximum distance between two genomes is N and occurs when C + I/2 = 0. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 22 / 37

Maximum Distance Obvious Corollary to BMS: The maximum distance between two genomes is N and occurs when C + I/2 = 0. This means that there are no cycles and no odd paths in the adjacency graph of two maximally distant genomes. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 22 / 37

Maximum Distance By considering an arbitrary starting genome, A, defined on N signed genes and counting the number of distinct adjacency graphs that could be created from it containing only even paths we showed: Theorem The number of maximally distant genomes is given by G max (m, n) = (2m 1)!! n ( n+m 1 k=0 k )( n k ) 2 k k!, where 2m is the number of telomeres, and n is the number of adjacencies in A. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 23 / 37

Show me the values! m n 0 1 2 3 4 5 0 N/A 1 5 37 361 4361 1 1 3 17 139 1473 19091 2 3 15 111 1083 13083 188103 3 15 105 975 11265 155535 2495865 4 105 945 10605 142485 2228625 39757305 5 945 10395 137025 2104515 36893745 726753195 Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 24 / 37

Theorem G(m, 1) = G(m+1, 0) Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 25 / 37

Theorem G(m, 1) = G(m+1, 0) Conjecture: G(m, n) G(s, t) otherwise. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 25 / 37

Generating Functions (for fixed m) Theorem The exponential generating function for the sequence {g m } is given by x 1 2x e f m (x) = (2m 1)!! (1 2x) m, Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 26 / 37

Chromosomes as UNSIGNED permutations No known nice formula for distance as in signed case. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 27 / 37

Chromosomes as UNSIGNED permutations No known nice formula for distance as in signed case. The method of BMS breaks down thoroughly. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 27 / 37

Chromosomes as UNSIGNED permutations No known nice formula for distance as in signed case. The method of BMS breaks down thoroughly. Python program to generate data. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 27 / 37

Number of unsigned genomes distance D from a single linear chromosome of length N N \ D 0 1 2 3 4 5 1 1 1 2 1 4 1 3 1 10 12 1 4 1 18 64 39 1 5 1 28 208 387 149 1 6 1 40 501 2096 2478 661 Column when n = 1 is A028552 in OEIS. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 28 / 37

Strategy The key to BMS s success was a clever data structure. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 29 / 37

Strategy The key to BMS s success was a clever data structure. Graphs don t seem to work. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 29 / 37

Strategy The key to BMS s success was a clever data structure. Graphs don t seem to work. Find a new data structure! Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 29 / 37

Strategy The key to BMS s success was a clever data structure. Graphs don t seem to work. Find a new data structure! Must incorporate the symmetries. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 29 / 37

*In progress* Consider the vertices of your two genomes as ordered pairs, with external vertices having a 0 in their pair. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 30 / 37

*In progress* Consider the vertices of your two genomes as ordered pairs, with external vertices having a 0 in their pair. Plot the vertices, and their reflections across the line x = y, on the upper right quarter plane. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 30 / 37

*In progress* Consider the vertices of your two genomes as ordered pairs, with external vertices having a 0 in their pair. Plot the vertices, and their reflections across the line x = y, on the upper right quarter plane. Use circles for your start genome, crosses for your destination. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 30 / 37

*In progress* Consider the vertices of your two genomes as ordered pairs, with external vertices having a 0 in their pair. Plot the vertices, and their reflections across the line x = y, on the upper right quarter plane. Use circles for your start genome, crosses for your destination. Imagine an infinite source of circles at (0, 0). Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 30 / 37

About the Game Board No more than 2 of the same symbol at each grid point. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 31 / 37

About the Game Board No more than 2 of the same symbol at each grid point. Row and column restrictions arise from genome motivation. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 31 / 37

About the Game Board No more than 2 of the same symbol at each grid point. Row and column restrictions arise from genome motivation. Want to abolish all crosses by moving circles on top of them. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 31 / 37

Rules of the Game Take a circle, move it in its row or column. Recording these moves gives a sequence of unsigned DCJ operations. When all crosses have been destroyed, you have reached your destination genome. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 32 / 37

Rules of the Game Take a circle, move it in its row or column. (Do the reflection of this move simultaneously with the reflected circle.) Recording these moves gives a sequence of unsigned DCJ operations. When all crosses have been destroyed, you have reached your destination genome. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 32 / 37

Rules of the Game Take a circle, move it in its row or column. (Do the reflection of this move simultaneously with the reflected circle.) At a right angle to the new location, move the next circle an opposite move, if it exists. Recording these moves gives a sequence of unsigned DCJ operations. When all crosses have been destroyed, you have reached your destination genome. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 32 / 37

Rules of the Game Take a circle, move it in its row or column. (Do the reflection of this move simultaneously with the reflected circle.) At a right angle to the new location, move the next circle an opposite move, if it exists. OR: Take two circles (0, a),(0, b) and create one circle (a, b), or vice versa. Recording these moves gives a sequence of unsigned DCJ operations. When all crosses have been destroyed, you have reached your destination genome. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 32 / 37

Strategy of the Game Want to move circles onto crosses as much as possible. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 33 / 37

Strategy of the Game Want to move circles onto crosses as much as possible. Do doubles when you can, and choose doubles to get the most doubles immediately. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 33 / 37

Strategy of the Game Want to move circles onto crosses as much as possible. Do doubles when you can, and choose doubles to get the most doubles immediately. A double is always better than a single. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 33 / 37

Strategy of the Game Want to move circles onto crosses as much as possible. Do doubles when you can, and choose doubles to get the most doubles immediately. A double is always better than a single. No single is better than any other! Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 33 / 37

In progress: Complicated proof by contradiction outlining why no single is better than any other. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 34 / 37

In progress: Complicated proof by contradiction outlining why no single is better than any other. Basically, large loops of dependencies terminate. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 34 / 37

In progress: Complicated proof by contradiction outlining why no single is better than any other. Basically, large loops of dependencies terminate. Not only shows that the maximum distance is n, but also gives the sequences of DCJ s. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 34 / 37

Questions: Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 35 / 37

Questions: Is there a better data structure that yields a distance without the work of finding the sequence of moves? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 35 / 37

Questions: Is there a better data structure that yields a distance without the work of finding the sequence of moves? The total number of these unsigned genomes is not known. Is there a smart way to count them? Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 35 / 37

Thank you to the organizers for allowing me to speak and all their hard work. Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 36 / 37

Manda Riehl (UW Eau Claire) Max Distance Genomes PP2010 37 / 37