Published 2017-12-08 18:00 | Ellinor Lindholm

Since the last time, we have used ABySS to assemble the single read data of P. andina, with several different k-mers, and looked at the N50 and E-size values to somewhat determine the quality of the contigs.

Below is two graphs where the k-mer sizes have been plotted against N50 and E-size, respectively.

K-mers vs N50

Different k-mer sizes plotted against the corresponding N50 values. The highest N50 value corresponds to a k-mer of 23.

K-mers vs E-size

Different k-mer sizes plotted against the corresponding E-size values. The highest E-size value corresponds to a k-mer of 23.

Based on what we see here, using a k-mer of 23 seems like the best option for now. We did not see any difference in these parameters for the trimmed data using the tool sickle. After meeting with Lars Arvestad today, this is one of the things we brought up, and as we understood it, the data has already been trimmed from e.g. adapters, and therefore trimming it again does not make any real difference.

After the meeting, and after explaining what we had done so far and the results we had obtained, we could conclude that the assembly is more or less done and that the scaffolding part indeed is not possible as expected (since we do not have paired reads). The purpose of our assembly is therefore not to 100% assemble the whole genome, but rather to create contigs large enough for a whole genes to fit inside it, and hopefully those will be the CHS-genes. The problem with not being able to access the CHS-gene files was also solved during the meeting as Lars gave us access.

During a group meeting after the meeting with Lars, we sat down and started preparing the presentation for Seminar 2.

In this presentation we have focused on the three main tasks that define the project, and also what we have done so far and some issues that we have had. The presentation is not finished at the time of writing as we will work during the weekend and probably add more results.

To conclude what we have done so far in the project:

Upcoming days

The main goal is now to finish Task 2 which is the identification of CHS genes. The CHS-gene file contains genes from four different species (all possible hybrids with P. andina):

CHS-gene.fa (Click to view)
>chs_Psoj1 Phytophthora sojae
MSGAPPPSSGFAPRSYGQQPLSHAPRSSMMSVEYDGIPLPPPSIRSCGSQQYVTSYIPTGAAFPPSSVQDMISSMKSYASATDLVRTYSEIPSVEEALSTLDRAAAALNARRYRDALKLYLEGGYAMANVAERQANPKICNLLTSKGFETLNWCARLCDWIEGRIKEKHPRPGVHKVGIPVSNWDEDWVGPFMDEEEARRMWYTPVYCPHPIDFSNLGYRLRCVETGRRPRLMICITMYNEGPQQLKATLKKLANNLAYLKEQMPGDEKSLTGAFAGDDVWQNVLVCIVADGREQVHPKTLDYLEAIGLYDEDLLTINSAGIGAQCHLFEHTLQLSVNGKCLLPIQTVFALKENKASKLDSHHWYFNAFAEQIQPEYTAVMDVGTMLTKSALYHLLFAFERNHQIGGACGQLTVDNPFENLSNWVISAQHFEYKISNILDKSLESCFGFISVLPGAFSAYRYEAIRGAPLDAYFQTLNIELDVLGPFIGNMYLAEDRILSFEVVARKNCNWTMHYVKDAVARTDVPHDLVGLISQRKRWLNGAFFATLFSIWNWGRIYSESKHTFVRKMAFLVFYVYHLLYTAFGFFLPANLYLALFFIVFQGFQQNRLEFIDTSEYSQTVLDCAVYIYNFSYLFGLLMLIIIGLGNNPKHMKLTYYFVGAVFGLMMMLSSLVGAGIFFSTPATVHSIVVSILTVGVYFIASALHGEVHHIFMTFTHYTALIPSFVNIFTIYSFCNLQDLSWGTKGLHDDPLLAASLDETEKGDFKDVIAKRRALEELRREEKERVENRKKNFEAFRTNVLLTWAFSNLIFALFVVYFASSSTYMPVLYIFVASLNTCRLLGSIGHWVYIHTEGLRGRVIDKSECGNGTGRYPQNSYVQLEEHYAALAEDQRTYASGRTNASVRTVNDVSSAA
>chs_Psoj2 Phytophthora sojae
MEQDGGGYVSSNAEEYDAAPSSGLPTPDGPDVWRDTRPTSHSYVDEPHANQPIERQLSYRQPGLNPAAEETYEAAFRLIQLAADAERSGSPLQAIQLYTDAGDVLIKVGKREPDPLLKQGIRQKANEIMKRAEELDEWYYSVQESARKAALPPQLQIQRTQVPRVQEAWQGRKPTLSQAPEFTHMRYTAVSTKDPVKFSDDGFQLRVEEAGKKIRVFITITMYNEEGSELQGTLKGIARNLEFMAEQWGDRAWESVAVAVVSDGRTKASASCLDFLAGLGAFDEEIMTVTSMGVDVQLHLFEATVQLVRDDNFESFYPPIQLIYALKENNAGKLNSHLWFFNAFSEQLLPTYTVLVDVGTIPGPTSIFKLIRSMDRNPQIGGVAGEIAVDHPNYFNPVIAAQHFEYKISNIMDKSLESVLGFISVLPGAFSAYRYEAIRAEKGVGPLPEYFKSLTTSTRDLGPFKGNMYLAEDRILCFELLARRGRNWTMHYVKDAIARTDVPETLVDLIKQRRRWLNGSFFAGLFAISNFSRVWKESGHSLSRKIVLTLQFIYLSVQNVLSWFLLSNLFLTFYYVLTLTLYSKYPALLAIVLGIYLAIVGGIVVFALGNRPEKRTAAFYSFSYVFMGLVMLVVSVISIYGLVADVSVTDPRDDLASCSVSNFELVGGVVTAIGLVFASAFIHGEFSVLLSTLQYYFMLPTFVNILGIYAYSNLHDLSWGTKGLETAGHGAAKVAQGNGNLKEIVAQQKRIEAEKQRAAQEKEDVDNSFRAFRSSLLLFWLVTNAVWMYCMTYFVSSSCYLKFISYVVAVFNVVRFFGSAVFLCFRIARRLGTCGASRSGKSGRNYQAHLPAEWQAHYKRSSNATTETTTTGNGQQTYVGLNISDLSSPAATNYRNMEEVR
>chs_Pram1 Phytophthora ramorum
MSGGPPPSSGFVPRSFGLPPISQAPRSSMMSVEYDGIPLPPPSIRSCGSQQYVTSYIPTGAAFPSGSVQEMISSMKSYASATDLVRTYSEIPSVEEALVTLDRAAVALNARRYRDALKLYLEGGYAMANVAERQANPKICNLLTSKGFETLNWCARLCDWIEGRIKEKHPRPGVHKVGIPVSNWDEDWVGPYLDEEEARRMWYTPVYCPHPIDFSNMGYRLRCVETGRRPRLMICITMYNEGPQQLKATLKKLANNLTYLKEQDSGHDKALSETFKGNDVWQNVLICIVADGREQVNDKMLDYLEAVGLYDEDLLTINSAGIGAQCHLFEHTLQLSVNGKCLLPIQTVFALKETKSNKLDSHHWYFNAFAEQIQPEFTAVMDVGTMLTKSALYHLLFAFERNHQIGGACGQLTVDKPFENLSNWVISAQHFEYKISNILDKSLESCFGFISVLPGAFSAYRYEAIRGAPLDAYFQTLNVDLDVLGPFIGNMYLAEDRILSFEVVARKNCNWTMHYVKDAVARTDVPHDLVGLISQRKRWLNGAFFATLFSIWNWGRIYSESSHTFTRKVAFLVFYGYHLLYTAFGFFLPANLYLALFFIVFQGFQQNRLQFIDTSDFSQTVLDCAVYIYNFAYLFGLLMLIIIGLGNNPKHMKLTYYFVGAVFGVMMMLSSLVGLGIFFSTPATINSIVVSILTVGVYFIGSALHGEMHHIFMTFTHYTALIPSFVNIFTIYSFCNLQDLSWGTKGLHDDPLLAASLDETEKGDFKDVIAKRRAMEERRREEKERTDNRKKNFEAFRTNVLLTWAFSNLILALFVVYFTDSSTYMPILYYFVASLNSCRLLGCIGHWVYIHTDGLRGRVLDKTECGNGTGRYPQNSYIQLDEHYAALVEDQRTYASGRTNASVRTNNDISSAA
>chs_Pram2 Phytophthora ramorum
MSNGRSTEDLRARLQSLRASRTSSLQPQEAARDWGDTFQPGELEELQSMLEQDAAGDFSSGTEEYDAIASSGLSTPDNVDVWRDTRPTSHSYVDEPHPNQPIERQLSYRQSGLNPAVEETYEAAFRLIQLAAEAERSGTPLQAIQLYTDAGDVLVKVGRREPDPLLKQGIREKANEIMKRAEELDEWYYSVQESARKAALPPQLQIQRTQVPRVQEAWQGRKPTLNQPPEFTHMRYTAVSTKDPVKFTDDGFQLRIEQAGKKIRVFITITMYNEEGSELQGTLTGIARNLEFMAEQWGDRAWESVAVAVVSDGRTKASASCLDFLTGLGAFDEEIMTVTSVGVDVQLHLFEATVQLVRDDNFESFYPPIQLIFALKESNAGKLNSHLWFFNAFSEQLLPTYTVLVDVGTIPGPTSIFKLIRSMDRNPQIGGVAGEIAVDHPNYFNPVIAAQHFEYKISNVMDKSLESVLGFISVLPGAFSAYRYEAIRAEKGVGPLPEYFKSLTTSTRDLGPFKGNMYLAEDRILCFELLARRGRSWTMHYVKDAIARTDVPETLVDLIKQRRRWLNGSFFAGLFAISNFSRVWRESGHSLSRKLVLTLQFVYLGVQNLLSWFLLSNLFLTFYYVLTLTLYSDYPVLLAIVLGIYLVIVGGIVVFALGNRPEKRTAAFYSFSYVFMGLVMLVVSVISIYALVADISVTDPRDDLASCSVSNFELVGGVATAIGLVFASAFIHGEFSVLLSTLQYYFMLPTFVNILGIYAYSNLHDLSWGTKGLETAGHAAAKVTQGNGNLKEIVAQQKRLEALKQRAAEEKEDVDNSFRAFRSSLLIFWLVTNAVWMYCMTYFVSSSCYLKLISYVVAVFNMARFLGSAVFLCFRIARRLGACGTVRSGASGRNYQASLPIEWQAHFKHSSNTTTETTTRNGHIGLNMSDLSSPVATNYKNMEEAM
>chs_Pinf Phytophthora infestans
MSGAPPPSSGFQPRSIGLPPLSHGPRSSMMSVEYDGIPLPPPSIRSCGSQQYVTSYIPTGAAFPSGSVQDLISSMKSYASATDLVRTYSEIPSVEEALVTLDRAAVALGARRYRDALKLYLEGGYAMANVAERQANPKICNLLTSKGFETLNWCARLCDWIEGRVKEKHPRPGVHKVGIPVSNWDEDWVGPYLDEEEARRMWYTPVYCPHPIDFSNMGYRLRCVETGRRPRLMICITMYNEGPQQLKATLKKLANNLAYLKEQKKDHEKTLSRDFAGDDVWQNVLLCIVADGREQVNDKMLDYMEAIGLYDEDLLTINSAGIGAQCHMFEHTLQLNVNGKSLLPIQTVFALKESKSSKLDSHHWYFNAFAEQIQPEYTAVMDVGTMLTKSALYHLLFAFERNHQIGGACGQLTVEKPFENLSNWVISAQHFEYKISNILDKSLESCFGFISVLPGAFSAYRYEAIRGAPLDAYFQTLNIDLDVLGPFIGNMYLAEDRILSFEVVARKDCKWTMHYVKDAVARTDVPHDLVGLISQRKRWLNGAFFATLFSIWNWGRIYSESNHSFTRKMAFLVFYLYHLLYTAFTFFLPANLYLALFFIVFQGFQQNRLEFVDTSEYSQTVLDCAVYMYNFVYLFGLLMLIIIGLGNNPKHMKLTYYFVGAVFGVMMMLSSLVGMGIFFSTPATTHSIVVSILTVGVYFIGSALHGELHHIFMTFTHYTALIPSFVNIFTIYSFCNLQDLSWGTKGLHDDPLLAASLDETEKGDFKDVIAKRRAMEERRREENERMENRKKNFEAFRSNVLLTWTFSNLIFALFVAYFADSSTYMPILYIFVASINSCRLLGCIGHWIYIHTTGLRESFLDKSECGNGTGRYPQNSYVQLDEHYAALAEDQRTYASGRTNASVRTNNDVSSIA
>chs_Spar1 Saprolegnia parasitica
MPPKRPTEASGRRYAPPAGRPSNNAANAKPRAPRKGVSSRASNVPSAASSYEYDYEYNMMPMMQAPPKSQPTFLSNIAPISAKEASMKGSNAMQLLLQGTSFTIDDAFRAIERAIQAENEGRFREALKHFLDGGEMIVTAAEKEASQKVRNLLLHKGKEVLEWAEHLAEWIERYNTSTPPVRIAKPMAVEVTYDRTMNSPDLDETEARMMFYTPVCSGPKAFTETGYRLQCIQSGRRPRLMVVITMYNEDENELRSTLRKVCNNVLYLKQHSLPGYEGDDAWKQVLVVVVSDGRTKANKGTLEWLANVGLYDEDVMNITSTGVKVQCHLFEHSLQMTKENSIRFPPLQLDSHLWYFDAFAEQIMPDYTVLLDVGTMPTKSSFYKLLTALEINAQIGGVCGEIAVDKPLPNMCNWVIAAQHFEYKISNILDKSLESCFGFISVLPGAFSAYRYKAIRGAPLQAYFKSLTTDMAELGPFAGNMYLAEDRILCFELLARKDCNWTMHYVKDAIARTDVPTNLIDLVGQRRRWLNGSFFATLFAIWNWGRVYTESNHSLTRKLALLVHALLGVSAANFYLALYFVIFQGFRDNRWNFIDTSEYPQWVLDGLPTAFNVFYAVTVFTQVTIGLGNKPKHVKGTHYLISVLFGLLMLLASGVAIVIFITSSKDAMAIVLAVLILGTFFIGSALHCEVHHIVLTFVQYTALMPSFVNILMVYSFCNLHDLSWGTKGIDTGHEAHKTEAVGQYKDIVARQKALEAKKAQDARNQDELKKRFDSFRSNLLLVWVMSNMSMVIICVNTVGADSFLPFLYAFVAAFNGIRLLGCIGYLIYYARQFLLFNTLSATGVLHKRHEARKHKKAEDPDPIDMELGTFNEPATSEIGAPMMQAPYNRMR
>chs_Spar2 Saprolegnia parasitica
MSDSNLDLAARLRALREGGAEPAPAPAPTPYMHSPPSRTRPTPLYTQESLEFGGTYTTGSPVGAEADGVYTQVPVWKDSKEKTYGYLDDEPAPQAQTLLNKANDLVQRQASNKAFRRQHTAAFRPLPNTVEELLDGSPTYEGAFRLVQLAVQMEQDGDPQAAINLYADAGATLVEVGRKEVDPLLQKGIRQKAQELLQRAEDLEAWMNGVAEEARKAALPPSLRIARTNVPTVEQTWAGRPPPFHDANEFKLMRYTAVATKDPIQFSDDGYVLRVHELQRPIKVFITITMYNEEGSEIKGTLTGLAKGLAYMCKEYGDDFWQQVAVAIVSDGRTKASKTCLEYLKAVGAFDEEIMTVTSLGVDVQMHLFESTLQLVENQNFEAYYPPLQVIYALKENNGGKLNSHLWFFNAFSEQLNPKYTVLVDVGTIPAETSVFRLIRSMERNAQIGGVAGEIAVEAPNFFNPVIAAQHFEYKISNIMDKSLESVFGFISVLPGAFSAYRYEAIRAVKGVGPLPEYFKSLTSTTKELGPFQGNMYLAEDRILCFELLARKQRRWTMHYVKDAIARTDVPETLVDLIKQRRRWLNGSFFAGLFAIGHFGRVWSQSSHSFGRKLVFTFQFVYLALQNLLSWFLLSNLFLTFYFVLTLAFTESAPALLQTMLTVYLAIIGGLIVFALGNKPEPRTASFYLFSCLYMGIIMLLVTGISIYGLIGKGTSAVKDPRTITGIFSNCTVSDAELAGGVITSLGLIFLSAFVHGEFGILLSFVQYFFMLPTFVNVLGIYAYSNLHDLSWGTKGLESGGGHGPAKAGGGNVKDVVEQQKKIEAARQAAAREKEDVDNSFRAFRSTLLLSWLTTNGIWLYVVTDYMSSGCYLKGLSYIVGFFNVVRFTGCVVFVILRMFRRFGCGARASRDNYQEALPAEWQTHYNVTNRTDGRVAPPPKHAASMDPTTPHGGVYQQV
>chs_Spar3 Saprolegnia parasitica
MGVPTLSKASVFRFARQPRHHGVKTRLLRSRSKTLMLGGQSTAETAQYASCNPNGDETSSNSLKPLKLKDMSTDDLLRHAEQLDMHLAKIYIESQKTKALAPIVTKSIGLPQQLWEDAGVAPPYHSAAEFQDLRYTAVRTADPIAFSADGYSLRVHTLGKSIKVFITVTMYNEPASQLQATLTGLAGGIDYLCHQYGYDFWQEVAVVVVADGRSKTHHSVLPYLESFGAFEKNLLAQAIAASKDTHVHLFESTIQLRKTNGSFHAPMQLIFALKEHNAGKLHSHLWFFNAFSEQVDPTYTALVDVGTVPAESSVYRLIRSMERNPQIGGVAGEIAVDDPDFFNPVIAAQHFEYKIANIMDASLQSVFGFIGVLPGAFSAYRYEAIRPINGVGPLAEYFKSLTASKKELGLCVGNMYLAEDRILCFEILARKNCDWTMHYVKDAIAHTDVPETLVDLIKQRRRWLNGSFFAGLFAIWNFGRVWTQSAHSLPRKCAFSLQFLYLAFQNVMNWFLLSNLFLTFFYILSLALYYKSIELLHVVLGTYFVLVGSLIVFALGNKPGHRTAIYYRVSSYIMGTIMLCVTCISLYALLGNVQFVDPRSDLPSCSVSNYELEAGAFFSLGIIFVCAFMHGEFGIVRSTVQYFFMLPTFVNVLGIYAYSNLHDLSWGTKGIETSAGHNGLPTSKFGSVKDMVALHLNATSTTDVVSEADKRKGVVAAEHEDVDNRFRVFRSLLLLTWLLTNGCWLYYATSFISCSCYLKYLSYIVAVFNTFRFLGGLLFLSFRMARGALHCCRQGVKKTRPLRCGNPQAGDDCSPV
>chs_Spar4 Saprolegnia parasitica
MTTLPERLLARVTSSMSALGGTAKLVTAQEGFRLIEQGVLAERQQHYKEAVDRFLGAAGVLDAVAATEADLHVRRLLHAKASDVVAWTEGIVAWMQHRPEVRPAYPPRQSKGISMPTTTVSAATLAFGMDEVERTSLHYTPVLTRSPSEFSRDGYELQVLRRHRRPRMLIVITMYNEDGSEIEATLRKVGNNVAYLCRHDLPGYEGELAWQNVLVVIVSDGRAKASASTLITLREMGVYDEDTLRITSAGLATSMHLFERTLLLPEAPGAKKLWTHTSETMPPLQVVFALKEENAGKLHSHLWFFHGFCNQVDPTYTVLLDVGTLPTKSALYKLVSAMEVNQQVGGVCGEIAVSQPLPHLTSLIISTQHYEYKISNVLDKATESCFGFISVLPGAFSAYRFKAIQGAPLDAYFKSLTTDMLALGPFQGNMYLAEDRILCFELLARKNCSWTMMYVKDAIARTDVPTTLVDLMAQRRRWLNGSFFAMLYTIFNWGRVYSEARHSLCRGLALLVQYTFMTVQVVFNWFLVANFYLTVYYVIFYALERNALGVLDTRAFYASHGALAKGLFNVVYGLVFVVQIILGMGNRPKHVARTYRAIGAYYMLLVVLTTAASVLTLVHTGAAALAPKEIALGIAVFGVYFIAAACHCELHHIVLSFVQYSLLLPVMINTLTIYSFCNLQDLSWGTKGIDTSSHDTGASENGEYKDVVARQKAAEDRAKQAAATTDLVRRRFDSFRSNLLLLWLVSNAALVGGLLYGALLDVYLPCLFVAIGAFNTYRLLGSLLFLLYTGRQWLLLQLCLCCGCLRRRYDRERRRGSDDRTMAILSPRDPLATI
>chs_Spar5 Saprolegnia parasitica
MVSSQLSTGLPAIARRPSVRSTRVGSHVRNDNDCVTTVDAFRYIERGVRAEYDMFYSEAINCFVNAGECLLIVAEQNDDDVSQMLLAKSQEVIGWAEELSIWLENGRAGPLPSRNCRGIQIPFTKEYEGGEHYEEAAELSYTPVATVNPINFTLDGYRMQCVTRGRKPTMMLVITMYNEDGAELAQTLRKVCNNVKYIQKNALPGYEGDDAWQNIVVCIVSDGRTKANPSATSFLRDIGVFNEDAMTIFSSGAATQMHLFERTVRLAKDPLNKQSVIMSNNSTIGADYPPLQMVYALKEHNAGKLNSHLWFFNAFCNQVDPEYNILLDVGTLPTKAALYKLLATLEMKADIGGVCGEIAVSRPIPNLWNFVIATQHFEYKVSNLLDKATESCFGFVSVLPGAFSAYRFAAIKGAPLQAYFKSLTTDMAELGPFYGNMYLAEDRILCFELLARTNGAWKLKYIKDAVARTDVPSTLVDLMAQRRRWLNGSFFAMLYSIVQWGRLYSHTNHSLFTKAGLLIQYFQLLVQLFFGWFMCGFFYLSVYYVVFTTLKKSKLPFWDSEEWYDDHHSMAMSIFNIVYAFLIMVQIIFGLGNKPKHVKWLYTFLSIFYAIVVITAVFFSVCSLSSHNGMSSFNIVLLAATFGVYFIAAAFHFELHHVVFTFVQYLVMLPTTINILMIYAFCNIQDLSWGTKGLGDSAGHGPTKGGGQRLGSGGYSDLVAQRKAAEAAARHDAVVADQVRRRFDSFRSYTLLFWLISNALLIMTCIYFVGANVFLPSLFLFIALFNVTRLLGSIAFVLATGRDWLLLKLCLCSGGMAKRKQKKEKVKDQDGFGALDSSKVLRHSA
>chs_Spar6 Saprolegnia parasitica
MSRRNYAPAARGGNGPRPNMNPANMGPPPQMPPPQHGSGRNLRAPPPRQVQQQGSFDRDSDDDMYGQHGANGVLGVPVWQDSAAYEHGSYLQENTPPSQRIPYPGPQGGGMMPPPGGMMPPPQQFAPAPVRMLNAGALQNQSNQQMQVMRDSNVGQAVPASTPEAFRIIASGVTAEAEARYQSAVNDFLAGGEMLALVSEREADPHIRSLLNTKAIQVLEWSKNLHDWYQQGMRGPMPRRFVGKIGVNVVNRLGACAGRIDAGSPSELRTMYYTPAANKVVSDFTKDGYRLQCIEEGRTPQLMVVITMYNEDQVEMYSTLKKVANNIAHIKSQKLPGYEGDDAWKNILVVIVSDGRTKANKGTLAFLRDVGAFDEDVMNILMVGVDVMCHVFEFCVQLKKANTIEASASSSERYPPTQVVFALKEHNGGKLNSHEWYFNAFAEQIQPEYTVLLDVGTMPTAKAFYLLLCAMEIDPQIGGTCGEIAVDKPIPHLCNWVIAAQHFEYKISNVMDKSLESVFGFISVLPGAFSAYRYKAIRGAPLEAYFKSLTTPMNELGPFQGNMYLAEDRILCFELLARRNCRWTMQYVKDAIARTDVPTDLVALIGQRRRWLNGSFFALVYTILNWGRVYTESNHSYIRKFFLCIQYAYMTANVALSWVLPANFFLVTYFLVIVGFKQNNWGYIPTSGIPENTKDIIVQVFSLLYGTSFLIQLVAGLGNKPKHIKGVYRLTAVFYALVMLLTSVIAFGFIMKPWIDSLRSGVPFAAMVTSFEIKDIAAFVASVGVFFLASCLHCEMHHIAMSFIQYMCLLPTFVNILNTYSFCNLHDLSWGTKGLESSDGHGPKAGGGGNYKDAAEAKKAEEARKKKEGEIKDKMEGDFQYFRSKLLIFWLLSQMGFAYLIISFDSVNGQEGANYLKFLFYIVAGFNLFRLFGSTYFLLIEARICVEKMCIRGTMRDRLKAKKKKQAIRAAQTQHV

For the next meeting, which will take place on Monday the 11th after Seminar 2, we plan to have done the following:

Additionally, after presenting to each other what we have found individually, we plan to start with the next task which is to take out the largest contig and BLAST that against the P. infestans genome which was assembled in 2009. We will also look into the tool MUMmer which should already be available at UppMax for the comparison of contigs.