admin管理员组

文章数量:1530987

2024年1月21日发(作者:)

生物圈/Tour Guide for Windows and Macintosh

Introduction

SEQUENCHER has been developed to work with a wide range of sequencing

applications. For example, SEQUENCHER can be used to:

Create assemblies for shotgun or EST sequencing projects

Edit contigs while viewing all relevant trace data

Assemble multiple sequences to a user-defined Reference Sequence

Detect and annotate polymorphisms

Align cDNAs to their genomic sequence using the Large Gap algorithm

Discover heterozygous peaks

Create difference reports for SNP discovery

Display restriction maps, ORF maps, protein translations

Automatically trim poor quality and vector sequences

Macintosh and PC Support

SEQUENCHER is available for both Macintosh and PC platforms. The demo CD

provides software for both environments.

Unlimited Trial

With the SEQUENCHER demo, you can use your own data and enjoy an unlimited

evaluation period. The only functions not available are copying data to other

applications, saving, exporting, printing, and reporting. Some sample data have been

included to get you started, but once you have tried SEQUENCHER with the sample

files, please try using your own data so you can see precisely what SEQUENCHER

can do for you.

What You Will Learn in This Tutorial

The purpose of this tutorial is to guide you through SEQUENCHER‘s core assembly

and editing functions. Additional application-specific tutorials are included on your

CD in PDF format. In this tutorial, you will

1. Install the SEQUENCHER demo

2. Create a new project

3. Import data

4. Trim sequences

5. Assemble a contig

6. View contig assembly

7. Edit assembled chromatograms

8. Find heterozygotes

9. Work with a Reference Sequence

10. Translate sequences to amino acids

生物圈11. Annotate a sequence

12. Create a Variance Table and Report

13. Create a Translated Variance Table

14. What else can I do with SEQUENCHER?

/Once you have mastered these techniques, you will be ready to explore

SEQUENCHER ’s other powerful features.

Conventions Used in this Guide

Menu items or keys that you are to select are in bold. The purple text provides step-by-step instructions for running through the tour guide and the black text provides

additional information. Greater than symbols define menu > submenu commands.

Before you start

Check that you have the appropriate hardware and disk space.

Additional Requirements will vary with your project.

MACINTOSH RECOMMENDED REQUIREMENTS:

10.3.9 and higher

512 MB RAM

150 MB hard disk space

WINDOWS MINIMUM REQUIREMENTS:

Windows 2000 and higher

512 MB RAM

150 MB hard disk space

1. Install SEQUENCHER Demo

Insert the demo CD into your machine.

Double-click on the Sequencher 4.9 Demo installer icon.

The installer will create a Sequencher 4.9 Demo folder in your Applications (Mac) or

Program Files > Gene Codes (Windows) folder.

2. Create a New Project

Launch SEQUENCHER by double-clicking on the Sequencher Demo icon.

On your Macintosh, this will be in the opened Sequencher Demo folder. On a

PC there is a Sequencher Demo icon on your desktop.

When SEQUENCHER is launched, a dialog will alert you that you are running in

Demo Mode. After you hit the OK command button on the alert dialog,

生物圈/SEQUENCHER will present you with an empty Project window. This is where you

import, manipulate, and display sequence fragments and assembled contigs.

New Project Window

3. Import Data

To import data, select Import > Folder from the File menu.

Browse to the Applications (Mac) or Program Files > Gene Codes (Windows)

then Sequencher 4.9 Demo > Sample Data > Demo Sample Data > Demo

Sequences folder.

Click on the Choose (Mac) or OK (Windows) button at the bottom window.

When prompted to import the 9 files, select the Import All Files in Folder

command button. The Project now contains the 9 sequences.

Imported Sequences

These imported files, with their associated quality scores, are just one example of the

wide variety of file types SEQUENCHER accepts for import. Note that the Quality

column displays the % quality for each of the imported sequences—the percent of

bases that are above the low quality threshold as set in the Confidence User

Preference pane.

4. Trim Sequences

SEQUENCHER has tools that allow you to trim imported sequences based on several

different criteria: ambiguous data, data that have low confidence scores, or data

contaminated with vector sequence. The trimmed data are fully recoverable within the

SEQUENCHER project. To trim the low confidence sequence:

生物圈•

/From the menu bar, choose Select > Select All to highlight all sequences if

they aren’t already highlighted.

From the menu bar, choose Sequence > SEQUENCHER displays

the Ends Trimming window for the default trimming parameters.

SEQUENCHER recognizes poor quality sequence based on a number of criteria. The

confidence score, provided in these samples, is one of the most sensitive. Increasing

the stringency of the trim criteria further increases the quality of your data.

Select the Change Trim Criteria button.

Uncheck all but the three criteria checked below, and adjust the values of the

two confidence trims to match.

Click OK to return to the overview for Ends Trimming.

The Ends Trimming window displays how much poor quality data will be trimmed

based on the defined criteria. You have the additional option to individually deselect a

fragment for trimming on either the 5' or 3' end by removing the "X" from the

appropriate box below the trim graphic.

Ends Trimming

Click the Trim Checked Items button at the top of the window, and then click

the Trim button when asked for confirmation.

生物圈/The data that you have removed are completely recoverable. SEQUENCHER always

stores two copies of every imported Sequence, the original sequence and the data as

you have edited it in SEQUENCHER.

Close the Ends Trimming window and return to the Project window by

clicking on the close control in the upper corner of the window.

5. Assemble a Contig

Several alignment algorithms are provided with SEQUENCHER to accommodate the

wide variety of assembly applications. For this example, you will use the Dirty Data

algorithm because it is best suited for data that may include the occasional

ambiguities or miss-calls generated by automated sequencers. The Assembly

Strategies and the Assemble by Name tutorials in the Sequencher 4.9 DemoTutorials

folder explain the other assembly options.

生物圈•

/Click on the Assembly Parameters button at the top of the Project window.

Accept the defaults for the Assembly Algorithm, Minimum Match Percentage,

and Minimum Overlap parameters. They should be Dirty Data, 85% and 20

bases, respectively.

Optimize gap placement by selecting Use ReAligner and Prefer 3’ Gap

Placement, if not already selected.

Select OK.

Once you have returned to the Project window, you are ready to begin assembly.

All of the sequence fragments should be selected. If they are not, select them

now using Select > Select All.

Click on the Assemble Automatically button at the top of the Project window.

Click Close to dismiss the dialog.

6. View Contig Assembly

The Contig User Preference defines sorting criteria for new assemblies. The default

preference sorts the fragments according to position, 5' to 3', within the contig.

SEQUENCHER provides a number of alternative sorting options.

Double-click on the Contig[0001] icon to open the contig Overview window.

Click on the Sort button, select the by Strand radio button, and click on OK to

sort the fragments by strand.

Contig Overview

Click on the Sort button, select the by Position radio button, and click OK to

return to the original sorting order.

The Overview contains three sections. The top section displays a schematic of how

the fragments are assembled in this contig. The arrows indicate the direction of the

fragment in relation to the assembly.

生物圈/

SEQUENCHER provides a Selection Marquee that allows you to navigate from

within the Overview into the Bases window.

The next section provides coverage information. For instance, the consensus called

around base 1206 has less coverage than the surrounding consensus bases.

Below the coverage bar is the open reading frame map. Three bars marked with green

flags and red lines, representing start and stop codons respectively.

Click and drag on the Selection Marquee in the Overview so that it selects the

region around base 1,206..

Select the Bases button at the top of the window to open the Contig Editor and

view the base sequences that assemble at this position.

7. Edit Assembled Chromatograms

The Contig Editor provides the tools for checking and editing sequences. It is divided

into four quadrants. You can modify the appearance of the Contig Editor from the

View menu and in your User Preferences.

Select View > Display Color Bases.

Under Window > User Preferences, change Display > Contig so the Font is

Courier New and the Size is 18.

Close the User Preferences window.

生物圈/

Contig Editor

The two upper panels show the individual fragment names to the left with their

sequences to the right. The Agent Box contains descriptive information about your

sequences and your selection. The lower right panel displays consensus information

including ambiguities <+> and disagreements <•> if there are any. The blue shading

describes the confidence, low, medium, and high, with the lighter shades defining

higher confidence. Note that you may toggle off this shading by selecting View >

Display Base Confidences.

To begin the editing process, move your selection to base one in the consensus.

The Select menu provides several tools to navigate to areas of interest in the contig.

From the Select menu, choose the command Next Ambiguous Base. The

Agent box reads "4 frag bases selected at consensus position 83," and "Select

Next Ambiguous Base = spacebar".

You have now moved your position in the consensus to the position of the first

ambiguous base. An ambiguous base in the consensus is any that includes: 1) A

contributing fragment base that is not an A, C, G, or T, 2) Disagreements between

other fragment bases, and 3) All fragment bases that contribute to the consensus have

low quality scores. Position 83 is flagged, because the base called in three of the

fragments is an "N".

To view the chromatograms at this position, while your selection is still on the

ambiguous consensus position, select the Show Chromatograms button at the

top of the window.

生物圈/

Assembled Trace Windows

Each chromatogram window displays the current version of the base calls in black

and above a line that separates them from the original base calls. At the left of the

trace window, SEQUENCHER provides tools that allow you to manipulate how the

traces are displayed. The volume control bars allow you to adjust the chromatogram

peak height. The A, C, G, and T buttons allow you to turn off the display of the signal

from any or all of the bases. Additional controls to format the traces are available

through the Window menu under Display pane.

The base call at position 83 is ambiguous in the forward direction, probably

because the irregular spacing challenged the original base caller. To change

the "N" to "C", with your selection still on the consensus base, type "C". Your

edit will correct the consensus and every sequence in the contig at that

position.

Note that SEQUENCHER displays the edited base call in a contrasting color. You are

now ready to continue editing your sequence. You can move in your contig from one

region of ambiguity to the next using SEQUENCHER ’s navigational tools.

From the Select menu, choose Next Ambiguous Base or use the spacebar to

execute the previous Select command.

SEQUENCHER jumps to the next ambiguity in the contig.

生物圈/8. Call Heterozygous bases

The next ambiguous base in the consensus is at position 126. One forward sequence

calls a T and three forward sequences call a C. A quick look at the chromatogram,

however, shows that the forward and the reverse sequences have both C and T peaks

at position 126. This sequence is from a mixed population.

Raise the slider for the 1-Reverse sequence to amplify the signal.

Type "Y" in the consensus line at position 126.

In SEQUENCHER, you can manually call each heterozygote one at a time, or you can

use the Call function to automatically find the heterozygous bases.

From the Sequence menu, select Call

Change the Minimum lower peak height to 35%.

Click on the Only make changes that result in an ambiguity checkbox to turn it

on.

Click OK.

SEQUENCHER displays a dialog so that you can confirm the changes to 41 base calls.

Click Continue.

With your cursor still in the consensus, click the spacebar to select the next

ambiguous base position.

生物圈/

The next ambiguous base position is 320. At this position, you will find that all of the

contributing sequences have bold magenta "R"s. The color indicates that the

sequences have been edited, in this case automatically by the Call Secondary Peaks…

function. The "R" is the IUPAC abbreviation for a mix of puRines, or "A or G". The

"+" under the consensus line flags the ambiguous position.

Continue selecting and editing until all the ambiguities are resolved.

Note that not all of the automatically called bases will be true heterozygotes nor will

this function capture all heterozygotes, but it will find all bases that have secondary

peaks that are at least 35% of the primary peak.

Close the Contig and Chromatogram windows by clicking on the close buttons

in the corners of each.

9. Translate Sequences to Amino Acids

There are numerous potential applications for the Reference Sequence. The Reference

Sequence facilitates comparative sequence alignments, defines base numbering, and

boosts assembly speed. In this Tour Guide, we will use the Reference Sequence

function to compare the assembled trace sequences to a known text sequence.

• From the Project window, select File > Import > Sequencher Project… and

navigate to the Demo Sample Data folder from which you imported the trace files.

• Click on the file HepC , then click on the Open button.

The new sequence, AB049090, is now in your project. You can tell that this sequence

is already a Reference, because it has an "R" in the icon, and "Ref:" precedes "DNA

Fragment" in the Kind column. You can make any sequence a Reference Sequence in

SEQUENCHER from the Sequence menu. You can also right click or Ctrl+click on

the sequence name to invoke the context sensitive menu. The Reference Sequence

tutorial provides more information on the Reference functions.

生物圈/You can use the same assembly parameters that we used for the autoseq fragments.

Note that the current parameters are listed in the Project window just below the button

bar.

Choose Select > Select All and click on the Assemble to Reference button.

Click Close to dismiss the Assembly Completed dialog.

The Reference Sequence is now incorporated into Contig[0001].

Double-click on the Contig[0001] icon to display the contig Overview.

Immediately you can see that the Reference contributes to the contig in a different

manner than the non-Reference sequences. For example, the numbering of the

sequence, as it appears on the coverage bar, is negative until the Reference begins to

contribute. Note the white space at the 3’ end of the coverage bar indicates that the

Reference does not contribute coverage at all. The blue bar in the Reference is a

graphic representation of the CDS feature.

When the data that extends beyond the Reference is not of interest, you can trim the

contig sequences to the Reference.

From the Contig menu, select the Trim to Reference Sequence command.

Contig[0001] now starts at base position 1. It is useful to view a Reference Sequence

when editing a sample sequence. The Reference Sequence will guide you so you can

critically examine the differences between the sample consensus and the Reference.

Yet the Reference will not contribute to the consensus nor will the Reference be

affected by edits in the consensus.

10. Translate Sequences to Amino Acids

SEQUENCHER provides a variety of ways to display the translation of DNA

sequence. For instance, you can display one or all three of the reading frames below

the sequence while editing.

Drag the Selection Marquee in the Overview to the 5’ end of the contig.

Click on the Bases button.

Place your cursor on base position 1 in the consensus line and click on it.

Click on the [ • ] button in the lower right corner of the Contig Editor.

SEQUENCHER displays the translated consensus.

The first click changes the [ • ] to a [ 1 ], displaying the translation in the first reading

frame. This button will toggle through each of the reading frames, followed by the

display of all three and then concluding with the display of an [ r ] which displays

生物圈/both the Reference Sequence translation and the consensus translation in the

Reference Frame.

Click on the upper translation button until it reaches a [ 1 ] to leave the

translation in the first reading frame.

To toggle the display of the Reference Sequence translation, click on the lower

translation button, with an R icon.

Contig With Protein Translation of Consensus and Reference Sequence

11. Annotate a Sequence

SEQUENCHER provides for sequence annotation by allowing you to create a feature

for a single base or a range of bases.

If your cursor is not in the consensus, select a base in the consensus.

From the menu bar, choose Select > Bases by Number….

In the Select Base dialog, enter 208 in both text boxes.

Click OK.

At this position, the sample sequence has a Valine, GTG, and the Reference codes for

a Methionine, ATG.

Choose Sequence > Mark Selection As Feature.

From the drop down Feature Key menu, choose variation.

In the Feature Name box, give your feature a brief name like "A-> G, Met->

Val".

The default Feature Style for a variation is red and underlined, but you may choose

any style you wish.

Click OK.

If the View > Display Features menu item is not checked, select it.

Turn off Display Color Bases from the View menu.

生物圈/The feature will now be displayed in both the Bases view and the Overview. Note that

in the Overview, the red single base feature displays over the range of bases covered

in the blue CDS feature.

Feature in the Bases View

Feature in the Overview

12. Create a Variance Table and Report

After you have edited the sample sequence so that you are confident that the

consensus base calls are correct, you can generate a comparison report of how the

sample differs from the Reference.

Close the Contig Editor window.

With Contig[0001] selected, choose Contig > Compare Consensus to

Reference.

SEQUENCHER displays a table listing the differences between the consensus

sequence that you edited and the Reference Sequence that you imported. Note that

base 208 is still red and underlined.

You can also modify the look of this table.

Click on the symbol that looks like an open elevator button in the bottom left

corner to expand the width of the columns.

From the View menu, choose Colors As Backgrounds.

生物圈/This report lists the differences between the consensus of one contig and the

Reference, but it is also possible to create this report for hundreds of contigs, when

they share the same Reference.

Variance Table

The Variance Table also acts as a link to the original data.

Double-click on the cell at base position 208.

SEQUENCHER rearranges the windows and opens the Contig and Chromatogram

Editors for that base position.

Use the arrow keys on your keyboard to navigate to bases of interest.

Create a new variation feature from the Variance Table at position 338 by

selecting that cell in the Variance Table and then executing the Sequence >

Mark Selection As Feature command.

Click OK.

SEQUENCHER defaults to the variation feature key, because this was the last feature

key used. The new feature name is "variation" and it is also red and underlined.

Click on the Reports button to open the Reports dialog

You have several reporting options available to you, so that you can share the

contents of the Variance Table outside of SEQUENCHER. Reporting is disabled in

the Demo, but you can view sample reports in SEQUENCHER help. The image

below is the result of creating a Variance Table from the Row Selection for positions

153 – 338.

生物圈/

13. Create a Translated Variance Table

In addition to the Variance Table, which displays the results of the comparison of

DNA sequences, Sequencher provides a Translated Variance Table, which displays

the differences in the translation of DNA sequences. The following directs you to

create the translated "sister" table to the currently open Variance Table, but you can

also create a Translated Variance Table directly from a contig by selecting sequence

names in the contig or by selecting contigs in a Project window.

While you are still in the Review mode of this Variance Table, click on the

Translation button on the button bar of the Variance Table window.

生物圈/

Sequencher opens the Translated Variance Table. It is configured in the same way as

the Variance Table. Note that the positions of the other three windows also adjust to

accommodate the new table.

The row of numbers in the left most reference column refer to both the first

base of the codon, above, and amino acid, below, of the Reference Sequence.

Adjacent to the numbers are the corresponding Reference codon and its

translation.

The sequences for comparison are in the columns to the right of the Reference.

In this case, there is only one.

The pink header is to flag any comparison in which the entire length of the

Reference is not covered by the sample sequence.

14. What else can I do in Sequencher?

You have now tried the basic capabilities of SEQUENCHER for assembly and

alignment of DNA sequences. Continue to explore the power of SEQUENCHER

using your own data. We invite you to explore the additional tutorials available as

PDFs on the Demo CD.

Thousands of laboratories around the world have made SEQUENCHER the desktop

standard for DNA assembly and alignment. Once you have worked with

SEQUENCHER, you will understand why.

生物圈

/

本文标签: 软件说明拼接序列