Common DNA Subsequence of Genes
The Version 7 functions LongestCommonSequence and LongestCommonSubsequence are now supplemented by their positional counterparts LongestCommonSequencePositions and LongestCommonSubsequencePositions.
Compare the DNA sequences of random genes of the Y chromosome.
In[1]:=
data:image/s3,"s3://crabby-images/e52f3/e52f3af1448a678d2ca2de4caf5a0836b1bba7e6" alt="Click for copyable input"
genes = RandomSample[GenomeData["ChromosomeYGenes"], 4]
Out[1]=
data:image/s3,"s3://crabby-images/e97f0/e97f00e5064eec5a57ac23a8910a689142cd15ab" alt=""
Group these genes by pairs.
In[2]:=
data:image/s3,"s3://crabby-images/9232b/9232bf2b6ef3b58fd80dcdd2568357c66f0c9ec3" alt="Click for copyable input"
With[{subsets = Subsets[genes, {2}]},
Table[pair[i] = subsets[[i]], {i, 1, Length[subsets]}]];
Define a function that will get the positions of the longest continuous DNA sequence common to each pair, together with the sequence itself.
In[3]:=
data:image/s3,"s3://crabby-images/c0b74/c0b7481ab72488415bffec91bff21546ab9bb579" alt="Click for copyable input"
commonDNASubequence[{g1_, g2_}] :=
With[{d1 = GenomeData[g1], d2 = GenomeData[g2]}, {{g1, g2},
LongestCommonSubsequencePositions[d1, d2],
LongestCommonSubsequence[d1, d2]}]
Longest common subsequence of the first pair.
In[4]:=
data:image/s3,"s3://crabby-images/a06e7/a06e75883472317f27f7c5807e57424db859b871" alt="Click for copyable input"
commonDNASubequence[pair[1]]
Out[4]=
data:image/s3,"s3://crabby-images/c6dcd/c6dcdfc5af359b7e434fc6a2f0d18fe21b7cceb1" alt=""
Longest common subsequence of the second pair.
In[5]:=
data:image/s3,"s3://crabby-images/f8976/f8976e61379996c96d786e501fd62001a8b1293b" alt="Click for copyable input"
commonDNASubequence[pair[2]]
Out[5]=
data:image/s3,"s3://crabby-images/a97d2/a97d2ea477d39046a0be35e69b55f1cab7ed2eb4" alt=""
Longest common subsequence of the third pair.
In[6]:=
data:image/s3,"s3://crabby-images/d07cd/d07cdaa0fc435437ea98ba9618386d21b1dcdd4b" alt="Click for copyable input"
commonDNASubequence[pair[3]]
Out[6]=
data:image/s3,"s3://crabby-images/0d0c2/0d0c25e47b9c3faf88b0d752aea672ad83503ad0" alt=""
Longest common subsequence of the fourth pair.
In[7]:=
data:image/s3,"s3://crabby-images/c67ba/c67ba6ba0d3193e262f0be9069324f473d6e407b" alt="Click for copyable input"
commonDNASubequence[pair[4]]
Out[7]=
data:image/s3,"s3://crabby-images/e47f1/e47f177eb6983dcc14b6e0b2f29f10efb0fd9c99" alt=""
Longest common subsequence of the fifth pair.
In[8]:=
data:image/s3,"s3://crabby-images/6b83c/6b83c9e24ade536e896b83db594b32dcc0764e78" alt="Click for copyable input"
commonDNASubequence[pair[5]]
Out[8]=
data:image/s3,"s3://crabby-images/03efa/03efaec1e70fe31cbd2c0b840276ca3e66d345bb" alt=""
Longest common subsequence of the sixth pair.
In[9]:=
data:image/s3,"s3://crabby-images/72ef8/72ef89309d12f3cc04cbd94c31d380a6bce255d6" alt="Click for copyable input"
commonDNASubequence[pair[6]]
Out[9]=
data:image/s3,"s3://crabby-images/195a8/195a80bbad2e18f26e0e7d889790445b530ae241" alt=""