Sushi Collection information
Date: Tuesday, August 27 2019
Time: 5:00pm
Location: Ginza Sushi & Bar
Mini Sashimi Matsu
(from l to r: akami, salmon, yellowtail)
bottom row (used for lab assignment): bara sushi roll (fished used – maguro)
- The fish were stored in the freezer until lab on Tuesday, September 3rd.
Tuesday, September 3rd 2019
DNA Extraction From Animal Tissue Protocol
Reagents: Commercial DNA Extraction Lit (Sigma REDExtract-N-Amp Tissue PCR Kit)
- Extraction Solution (labeled ES)
- Tissue Preparation Solution (labeled TPS)
- Neutralizing Solution (labeled NS)
Materials:
- P200 microcentrifuge
- 5 mL microcentrifuge tubes
- Razor blades
- Heat block
- Vortex
- Ice
- Sharpie
- Gloves
1. Recorded the information about my samples on the “Animal Tissue DNA Extraction” data sheet along with giving each of my samples a unique ID code.
MM01 – Akami, MM02 – Salmon, MM03 – Yellowtail, MM04 – Maguro
2. Put on gloves and labeled one 1.5 mL screw-cap/lock lid microcentrifuge tube for each of my samples with the unique ID code. Used a sharpie to write the unique ID code on the side and on the top of each of the tubes.
3. Took my samples to free table to cut out a small piece. Using a razor blade, I carefully cut out a small square of tissue on a paper plate. Also made sure to clean the cutting utensil with Ethanol between specimens, and cut them on different parts of plate.
4. Weighed my sample on the scale using a weigh boat ( ~ 2 – 10 mg ) of sample tissue.
5. Added 100 mL of Extraction Solution (ES) to each of your labeled sample tubes using a p200 µL micropipette and filtered tips
6. Added 25 mL of Tissue Preparation Solution (TPS) to the microcentrifuge tubes with the 100 mL of Extraction Solution (ES) and micropipetted up and down to mix using a p200 µL micropipette and filtered tips.
7. Carefully added each sample to its corresponding extraction microcentrifuge tube using forceps.
8. Took a disposable non-filtered pipette tip and mashed my tissue sample up a bit.
9. Incubated the sample at room temperature for 10 minutes.
10. Moved my samples to the heat block and incubated the sample at 95o C for 3 minutes.
11. Took the sample out of the heat block. Added 100 ml of Neutralizing Solution (NS) using a p200 pipette and filtered tips and mixed by vortexing to vigorously mix up my samples.
12. Placed my samples on ice for the time being.
Amplifying CO1 from Fish Procedure
To amplify COI from my fish DNA, I used components that came in the same kit that I used to isolate genomic DNA (gDNA). The Sigma REDExtract-N-Amp Kit included a solution for use in PCR amplification that includes Taq, dNTPs, and buffers that work optimally given the reagents that are still in your gDNA sample from the isolation procedure
USED FILTER TIPS FOR ALL PCR STEPS
Diluting gDNA
10x dilution of my gDNA:
1. Labeled a microcentrifuge tube at my lab bench with “1:10”, the name of each my samples and initials on the top and side of the tube.
2. Added 18 mL of purified water to the labeled tubes.
3. Added 2 mL of my gDNA to the tube.
4. Gently flicked the tube to mix the solution.
PCR Reaction
Each of the PCR reactions included the following reagents and volumes:
Reagent Volume
Water (PCR Quality – autoclaved, filtered) 6.4 µl
REDExtract-N-Amp PCR rxn mix 10 µl
Forward Primer FbcF 0.8 µl
Reverse Primer FbcR 0.8 µl
Tissue Extract (gDNA) (1:10 dilution) 2 µl
Total Volume 20 µl
The standard procedure when setting up multiple PCR reactions under the same conditions is to make a master mix that includes the combined volumes of all of the reagents for multiple reactions, except for the component that is different. This improves success by minimizing errors due to pipetting small volumes multiple times. For this reaction, I made up a master mix that included all reaction components except gDNA. One master mix was made for my entire lab table.
The convention for making a master mix is to use the volumes for the individual reactions (listed above) multiplied by the number of reactions, plus some extra just in case. For example, if we have 6 samples, one negative control, and we want to make sure we have a little extra. So we would multiply the volumes times 6 + 1+ 1 = 8.
The additional reagents are needed to account for minor errors in pipetting that could result in not having enough total volume in the master mix for all individual reactions. A negative control is all the reagents required for a PCR reaction, but NO gDNA template. Thus, if anything is amplified in the negative control, you know there is a source of contaminating gDNA, other than the samples you are trying to PRC amplify.
Master Mix Recipe:
Reagent Volume (1x) Master (volume x 18)
Water (PCR Quality) 6.4 µl 115.2 µl
REDExtract-N-Amp PCR rx mix 10 µl 180 µl
Forward Primer 0.8 µl 14.4 µl
Reverse Primer 0.8 µl 14.4 µl
Tissue Extract (gDNA- 1:10 dilution) 2 µl ———
Total Volume 20 µl 324 µl
1. Wrote the labels of my gDNA sample on my PCR tubes on the top and side below the lid.
2. Added 2 mL of the 1:10 dilution of my gDNA to each of the PCR tubes, except the negative control. Made sure that my gDNA and mastermix were mixed together and changed tips between each sample.
3. Pipetted 18 mL of the master mix into each of my PCR tubes, including the negative control.
I left this reaction on ice next to the thermocycler until all PCR reactions were set up. Then I placed the PCR tubes (all samples and the negative control) in the thermocycler and started the reaction, which took between 1.5-2 hours. MyPCR reactions will be placed in the freezer when the cycling is complete.
Settings for the Thermocycler:
94o C – 4 min (initial denaturation)
30 cycles of:
94o C for 30 sec (denaturing)
52o C for 40 sec (annealing)
72o C for 1 min (extension)
72o C for 10 min (final extension)
10o C hold
Fish DNA Barcode
Tuesday, September 17, 2019
Gel Electrophoresis/PCR Cleanup
Gel Electrophoresis of PCR products:
- Grabbed PCR tubes from ice and thawed
- Dotted out 16 loading dye dots (1 µL) on a sheet of parafilm
- Pipetted 3 µL of each PCR product into its own dot
- Loaded all the dots into the gel (5 µL each)
- Ran the gel at 130 volt for 30 minutes
Gel Electrophoresis Loading Order for Queen Salmon:
Lane # Sample ID #
TOP
1 OY01
2 OY02
3 OY03
4 OY04
5 MM01
6 MM02
7 MM03
8 MM04
9 KRS 01
10 KRS02
11 KRS03
12 KRS04
13 Negative Control
14 EB01
15 Ladder
BOTTOM
16 EB02
17 EB03
18 EB04
19 Ladder
Clean-up of PCR products for sequencing – ExoSAP:
- Labeled new 0.2 µL PCR tubes with each of my sample codes (MM01, MM02, MM03, MM04) and shared an 8-strip with my lab partner which will be sent off for sequencing.
- Determined the number of PCR clean-ups my table will make up.
- Calculated the volume of Master mix needed (calculated for a couple additional cleanups to account for pipette errors).
- Placed reagents on ice.
- Pipetted 7.5 uL of each PCR product into a clean, labeled 0.2 uL PCR tube.
- Made up the ExoSap master mix, keeping the reagents on ice while it was made.
- Pipetted 12.5 uL into each PCR product tube.
- Placed the tubes in a thermocycler and started the EXOSAP program.
- After program completed (~ 45 minutes) placed PCR tubes in a labeled tube rack and put in the freezer.
ExoSap PCR Clean-Up Protocol
Recipe to clean-up one PCR reaction of 7.5 uL
Master Mix: Rxn: 1 Rxns: 18
H2O 10.59 uL 190.6 µL
10x buffer (Sap 10x) 1.25 uL 22.5 µL
SAP 0.44 uL 7.92 µL
Exo 0.22 uL 3.96 µL
Master Mix Total 12.5 uL 225 µL
PCR Product:
PCR 7.5 ul
Total Cleaned-up Volume 20.0 uL
Tuesday, October 1st 2019
Geneious Lab (DNA and protein sequence data)
*Followed protocol*
- Created a folder to store my fish barcodes and downloaded the classes DNA barcoding sequences (forward and reverse reads) from Canvas which were dropped into my fish barcode folder
- My PCR failed completely because the sequence was very short. Therefore, I had the DNA sequences of another students from a previous year (KJ01 and KJ02)
- At the sequence view, reverse complimented my sequence so that the orientation was switched and the base call were complimentary to the previous one.
- Assembled forward and reverse sequence reads of the same sample —- copied the reverse sequences from the ‘Fish barcode Reverse Reads’ folder and pasted them in the ‘Forward Reads’ folder. Selected two of my sequences that have the same ID but one was indicative of forward and the other of reverse. Right clicked and chose ‘De novo assembly.’
- Clicked ‘Allow Editing’ and deleted any bases on the two ends that were unreadable — this is important because it only keeps the base calls that I have confidence in.
- Looked at the consensus sequence for any ambiguities (N, Y, R) and determined its discrepancy; edited the sequence with the poor read to match the base call of the good read.
- Closed and saved my sequence assembly.
- Selected the assembly file that I just saved and right clicked ‘Generat consenseus sequence’ and kept the default settings.
- Selected the new consensus sequence file and right clicked ‘BLAST’ that will search the database for highly similar sequences
- Hit Table opened up a set of the 100 top matches to my sequence.
- Built an alignment with my sequence and some of the top 100 hits
- Created a new folder ‘Fish barcode test alignment’, went back to my fish barcode colder and selected ‘…assembly consensus sequence’ file, right clicked and chose ‘Copy documnet’ and selected the new ‘Fish barcode test alignment’ folder to paste it in.
- Scrolled through BLAST search results and selected about 5-10 hits and copied it into the ‘Fish barcode alignmnet’ tab.
- Double clicked the ‘nucleotide alignment file’ to open the alignment and noticed any polymorphisms in my alignment (looked at the consensus identity for the ambiguity code)
- The sequences for the species sampled (KJ01) stated it was a Thunnus albacares (yellowfin tuna) which was the same type of fish that the student thought their fish to be. The sequences for the species sampled (KJ02) stated that it was a Oncorhynchus tshantytscha voucher (chinook salmon) otherwise known as king salmon which was the same type of fish that the student thought their fish to be.
- First 10 polymorphic sites found in the alignment based on the column found for KJ01 (4 sites) and KJ02 (13 sites)
KJ01
- Column 9 – [C] consensus identity: GU673630
- Column 21 – [T] consensus identity: GU672630
- Column 42 – [A] consensus identity: GU324196
- Column 273 – [T] consensus identity: GU673630
KJ02
- Column 55 – [G] consensus identity: JX960909; [C] consensus identity: FJ998625; [A] consensus identity: FJ998949
- Column 69 – [A] consensus identity: FJ998625
- Column 78 – [G] consensus identity: FJ998625
- Column 108 – [T] consensus identity: FJ998625
- Column 113 – [T] consensus identity: FJ998625
- Column 117 – [C] consensus identity: FJ998625
- Column 123 – [A] consensus identity: FJ998625
- Column 165 – [C] consensus identity: FJ998625
- Column 171 – [A] consensus identity: JX960909; [A] consensus identity: FJ998625
- Column 204 – [C] consensus identity: FJ998625
Consensus sequence of KJ01, KJ02, and 25 sequences of the COI gene from Actinopterygii
-included one ray (outgroup)
Tuesday, October 8th 2019
Geneious Lab (Phylogenetic Inference)
- Cleaned up my alignment of COI sequences (fish DNA barcode sequences and sequences downloaded from NCBI through Geneious) by editing the alignment down so that all sequences begin and end at the same point — clicked ‘Allow Editing’ and chose the bases (all the columns in the alignment) that precede where the sequences start (blue selected parts) and the ones that are beyond the sequences (blue selected parts)
- Looking at my sequence, I found 9 columns of my alignment to be polymorphic \
Choosing the best model of Molecular Evolution:
- Used the jModelTest2 program (downloaded) to figure out the best model of molecular evolution for my sequences.
- Exported my alignment in Geneious in Phylip format (relaxed) – selected my alignment and from the top ‘File’ menu chose ‘Export > Selected documents >’ and once a new window appeared, scrolled down to ‘Phylip(*.phy)’ and clicked ok. A window appeared after I chose where to save the file and selected the ‘releaxed’ format.
- Went back to the jMOdelTest2 program and opened the exported alignment using ‘File > Load DNA Alignment’ in the menu.
- Selected ‘Analysis > Compute likelihood scores’ from the menu. Kept the defaults that are in the next window and chose ‘Compute Likelihoods.’ The program took about 10-20 minutes because it went through a hierarchal set of 88 models (increasing complexity) and calculated the likelihood for each of those models of molecular evolution, based on an initial tree that is used for all tests.
- Two methods are used: Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC). Under the Analysis menu chose ‘Do AIC,’ a window appeared, kept the defaults, and clicked ‘Do AIC calculations.’ This analysis is done instantaneously. In the main jModelTest window clicked ‘Results > Show results table’ and clicked the ‘AIC’ tab at the top. Clicked on the ‘AIC’ column to bring the top scoring model to the top (red). My best model based on AIC is: GTR + I + G
- Followed the same steps from above to check the best model based on BIC. My best model based on BIC: GTR + I + G. The AIC and BIC chose the same model. The comparison of these models are important because more complicated models will always have a better fit, but it is necessary to see if the better fit is worth the cost of the extra parameters.
Bayesian Inference
- Selected my alignment in Geneious, right clicked to select “Tree…” and chose ‘MrBayes.’ Chose the ‘GTR’ (complicated) substitution model — closest to the jModelTest choice in complexity.
- Looked at the ‘Rate Variation’ and chose ‘I + G( (invgamma).’ Chose the Chondrostroma voucher as my outgroup. Looked under the MCMC settings. The ‘Chain length’ is how many samples of the posterior distribution my analysis will use. In a real analysis this number will be very large (10-50 million). Set it to 10,100. My ‘sample frequency’ is how often I actually save the inferred trees for the posterior distribution – don’t want this to be too small (less than 100) because I may sample trees with only very slight differences. It’s better to sample a little bit more sparsely — used the default of 200. Below ‘Subsampling freq’ there is ‘Burn-in length.’ This is the # of samples that should be removed at the beginning of my analysis because MCMC is a ‘hill-climbing’ algorithm indicating that the samples from the beginning are probably poor estimates of the phylogeny. This set to 100.
- The number of trees sampled will be the number of samples (10,100) – the burn in (100) = 50. The heated chains remain at 4 and the heated chain temp and random seed defaults remain the same.
- Priors used: ‘Unconstrained Branch Length: Exponential (10), Shape Parameter (10) — these affect the shape of the prior statistical distribution.
- Ran my analysis. Double clicked the ‘Posterior Output’ line which opened a new window. Clicked the ‘Parameter estimates tab.’ Clicked on the ‘Trace’ tab which showed an upward slope (increase in posterior probability). The analysis was too short and did not do a good job sampling the posterior distribution.
- Clicked the ‘Tree’ tab to look at the inferred tree. Clicked ‘Show Branch Labels’ and for ‘Display’ chose posterior probability. This labels the branches with their support values based on the percentage of the trees that had a particular clade present.
- Ran the analysis again but used 100,000 for the ‘chain length’ and 10 for the ‘burn in.’ I recovered the same clades in my tree but they didn’t have the same similar support values.
Maximum Likelihood
- Used maximum likelihood to infer a phylogenetic tree of my aligned data set. Installed the plugin RAxML.
- Chose my evolutionary model (similar choices to the Bayesian analysis) and chose ‘Rapid bootstrap with rapid hill climbing.’
- When RAxML is done, there is a new line called ‘RAxML bootstrapping trees.’ To build a consensus tree, selected this line, right clicked ‘Tree’ ‘Consensus Tree Builder,’ ‘Create consensus tree,’ and ‘Support threshold’ of 50% and ok. The consensus tree will be listed as the 1st tree in the ‘RAxML bootstrapping trees’ file. The clades with higher support don’t match those from my Bayesian analyses.
- Downloaded the PHYML plugin and ran a maximum likelihood with bootstrapping in PHYML. Used the HKY85 model of molecular evolution for this and the final Bayesian tree. Used the tree window to manipulate the resulting best ML tree with bootstrap support. Made a ray species the root, showed the bootstrap proportions, and made the tree aesthetically pleasing by adjusting the line weights, font size, and tree display.
- Ran the same alignment and the HYY85 model of molecular evolution to infer my best tree using MrBayes. This was ran at 3,000,000 generations, subsample frequency set to 500, and set my burn-in 300,000. Chose the ray species as the outgroup. When the run is complete, the final tree with support values, posterior distribution, and the trace was exported.