SEQED

[ Program Manual | User's Guide | Data Files | Databases ]

Table of Contents
FUNCTION
DESCRIPTION
EXAMPLE
EDIT NEW OR EXISTING SEQUENCES
SCREEN MODE
COMMAND MODE
DIGITIZER MODE
COMMENT MODE
HEADING MODE
SYSTEM CRASH OR HANGUP
RESTRICTIONS
ACKNOWLEDGEMENTS
SEQUENCE TYPE
COMMAND-LINE SUMMARY
LOCAL DATA FILES
PARAMETER REFERENCE

FUNCTION

[ Top | Next ]

SeqEd is an interactive editor for entering and modifying sequences and for assembling parts of existing sequences into new genetic constructs. You can enter sequences from the keyboard or from a digitizer.

DESCRIPTION

[ Previous | Top | Next ]

SeqEd uses the screen of your terminal as a window into a sequence. It works like the UNIX vi text editor. Changes you make in the sequence take place at the cursor position and are reflected immediately on the screen. You can insert or delete symbols, move the cursor, search for patterns, check sequences by reentering them, and edit documentation and embedded comments.

SetKeys lets you change the positions of the keys on your terminal keyboard to make it more convenient to enter the letters G, A, T, and C.

You can enter a sequence and control SeqEd either from the terminal keyboard or from a Graf/Bar digitizer.

EXAMPLE

[ Previous | Top | Next ]

If you are already familiar with the UNIX vi editor, you can learn to use SeqEd quickly. When you run SeqEd with a command like % seqed sample.seq, your screen will look something like this:


sample.seq            ***** K E Y B O A R D *****                 seqed
    : Some documentary text about your sequence can be placed          :
    : here in the heading.                                             :
    : You can have as many lines of header comments as you wish, and   :
    : you can edit them in this space with the HEAding command.        :

        1 $The scale below has extra dots where comments occur$
        9 <The number indicates which symbol the comment precedes.<
       20 >There can be as many comments as you like>
       28 $The four comments closest to the cursor are displayed$

      AGTCTTAGTCGATCGTAcTGCATRCGA
 ....|:.......:|.........i.......:.|.........|.........|.........|.........|..
      0        10        20        30        40        50        60       70

     ~~~~~~~~~~~~~~~~~~~^
     |......|......|......|......|......|......|......|......|......|......|
      0     10     20     30     40     50     60     70     80     90    100

   "sample.seq"   27 nucleotides

EDIT NEW OR EXISTING SEQUENCES

[ Previous | Top | Next ]

If you name a sequence file that already exists, SeqEd displays the first four lines of documentation on the top of the screen followed by up to four embedded comments and the base number with which each is associated. SeqEd shows the end of the sequence across the middle of the screen.

If the sequence you name does not exist, SeqEd starts in Heading Mode (see below) to allow you to enter documentation for the new sequence. Use <Ctrl>D to stop editing the documentation.

SCREEN MODE

[ Previous | Top | Next ]

Entering a Sequence

In Screen Mode the cursor shows your position in the sequence. You can move around in the sequence, add symbols, delete symbols, and search for patterns. You can insert any valid GCG sequence symbol (see Appendix III) into the sequence by typing the symbol. It is inserted at the cursor.

Deleting a Sequence

The <Delete> key and <Ctrl>H delete the symbols to the left of the cursor, one by one.

Moving the Cursor

To move the cursor to the right, use the <Right-arrow> key; to move to the left, use the <Left-arrow> key. Movements are confined to the length of the sequence.

If you type a number followed by <Return>, the cursor moves to that sequence position.

The arrow keys can be preceded by a number indicating how many symbols to move to the left or right. 10<Right-arrow> moves 10 symbols to the right.

Finding Patterns

To search for a pattern, type a / (slash) in Screen Mode. The cursor moves to the lower-left corner of the screen to let you enter a sequence pattern that you wish to find. You may type in a pattern up to 40 characters long. You can repeat the last search by simply typing /<Return>. SeqEd treats all nucleic acid sequences as circular and finds your pattern even if it wraps from the end of the sequence into the beginning. SeqEd uses the same rules for pattern definition and recognition as the programs FindPatterns, MapPlot, Map, and MapSort.

The command-line parameters -PROtein and -PERFect or the PROtein or PERFect commands in Command Mode make SeqEd treat the sequence as linear and disable the nucleic acid ambiguity meanings of the GCG sequence symbols (see Appendix III) during pattern searches.

The NUCleotide command in Command Mode tells SeqEd to recognize patterns containing IUB nucleotide ambiguity symbols during searches.

Even if SeqEd thinks your sequence is nucleotide, you can request a perfect-match search by typing = after the /. For example, /=RCT only matches RCT (case does not matter) no matter which kind of sequence SeqEd thinks you have.

Finding a Marked Position

You can mark a position in a sequence to which you wish to return. You give the marked position a letter (like giving it a name) using the Command Mode Mark command (see below). Then, in Screen Mode, a single quote followed by the letter used to mark the sequence moves the cursor to the position where that mark was defined.

Leaving Screen Mode

Use <Ctrl>D to leave Screen Mode and enter Command Mode.

Screen Mode Summary

Here is the summary of Screen Mode commands in the on-line help:


                       Screen Mode

           [n] is an optional numeric parameter.

      G, A, T, . . .   - insert a sequence character
      <Delete>         - delete a sequence character
      <Ctrl>H          - delete a sequence character
      /TAACG<Return>   - find the next occurrence of TAACG
                           (last pattern entered is the default)
      1<Return>        - move to start of the sequence
      <Ctrl>E          - move to end of the sequence
      [n]<Right-arrow> - go ahead n characters
      [n]<Left-arrow>  - go back  n characters
      <Up-arrow>       - go up to check sequence
      <Down-arrow>     - go down to original sequence
      'markcharacter   - go to marked position
      37<Return>       - go to position 37 (any positive integer)
      <                - go back  50 characters
      >                - go ahead 50 characters
      <Ctrl>R          - redraw the screen
      <Ctrl>D          - enter command mode

COMMAND MODE

[ Previous | Top | Next ]

Use <Ctrl>D in Screen Mode to enter Command Mode. The cursor moves down to the lower-left corner of the screen next to a colon prompt after which you can enter any of the commands shown below followed by <Return>.

Editing SeqEd Commands

SeqEd command editing is modeled on OpenVMS DCL command-line editing. The <Left-arrow> and <Right-arrow> keys let you move your cursor around in a command that you have typed so you can insert or delete characters at any position. <Ctrl>E moves the cursor to the end of the line. <Ctrl>U deletes all the characters from the current cursor position to the start of the line.

Editing Previous SeqEd Commands

SeqEd lets you modify and execute previous commands. The <Up-arrow> key displays previous commands.

Returning to Screen Mode

If you press <Return> without entering a command, SeqEd returns to Screen Mode (described above). If you have -SINGlecommand on the command line or in your command-line initializing file, SeqEd returns to Screen Mode immediately after executing each command.

Commands May Be Shortened

Only the capitalized portion of the commands described in the documentation below needs to be typed.

Parameters are Used with Commands

Some commands can be preceded with numeric parameters or succeeded with a file name. The square brackets ([ and ]) in the documentation below show command parameters that are optional, meaning you can leave them out.

Command Mode Summary

Here is the summary of Command Mode commands you would see with the Help command:


                       Command Mode

 Commands end with <Return>. [n] indicates an optional parameter.
 s and f are numbers for start and finish of a range of interest
 Only the capitalized part of the command is necessary.

      EDit seqname       - get a new sequence file to edit
[n]   Include [seqname]  - insert another sequence [at position n]
                           (SeqEd prompts for range and strand)
s,f   Delete             - delete a range of bases
[s]   Check [/Blind]     - check a range of bases [beginning at s]
      37                 - go to base 37
      REDraw             - redraw the screen
[n]   COmment comment    - insert a comment [at position n]
[n]   COmment            - enter comment editing mode [at position n]
[n]   HEAding            - edit documentary heading [at line n]
      change             - enter screen mode (<Return> is sufficient)
      screen             - enter screen mode (<Return> is sufficient)
      OVERstrike         - enter overstrike mode
      INSert             - enter insert mode
[n]   Mark markcharacter - mark the sequence [at position n]
      PERFect            - require finds to be perfect matches
      PROtein            - set sequence type to PROTEIN
      NUCleotide         - set sequence type to NUCLEOTIDE
[s,f] Write [seqname]    - write [a part of] the sequence to a file
      DIGitizer          - enter digitizer mode
      RELoad             - enter reload mode
      ACCept             - terminate reload mode
      Help               - show commands in screen and command modes
[s,f] EXit  [seqname]    - write [a part of] the sequence and quit
      Quit               - quit the editor without writing the sequence

EDit SeqName

gets a new sequence from the file you have named for editing with SeqEd. The sequence you are currently editing is lost if you have not written it out before using the EDit command.

[s] Include [filename]

includes another sequence within the sequence being edited at the current cursor position or at the position specified by the optional parameter. SeqEd creates two embedded comments at the start and end of the included section to show what was included. If you do not supply a file name with this command, SeqEd prompts you for one.

s,f Delete

deletes some or all of the sequence. You must specify a beginning and ending coordinate for the range of symbols you want to delete.

[s] CHeck [/Blind]

lets you check a sequence entry in Screen Mode. A sequence already entered may be typed in again. If a symbol is typed that disagrees with the first entry, a ^ is printed at the point of disagreement and the terminal bell rings. While checking, the <Up-arrow> and <Down-arrow> keys move the cursor back and forth between the second entry and the original sequence, allowing you to make changes in either one as mistakes are found. If the optional starting coordinate precedes the command, it specifies where checking begins. If you wish to check your sequence without seeing the original version, type /Blind following the CHeck command (there must be a blank between the CHeck command and the /).

REDraw

redraws your terminal screen. This is useful if noise in the line between your terminal and the computer has changed the screen in some unreasonable way or if a system message appears on your screen.

[s] COmment [comment text]

allows you to enter, delete or modify embedded comments to document your sequence. In its simplest use, the COmment command lets you insert new comments. You simply type the entire comment on the command line. Deletion and modification of existing comments is handled by entering Comment Mode. To do this, you type only the COmment command and optional position but no comment text. See the COMMENT MODE topic for more information.

Whenever you enter a comment, SeqEd ensures that comment-delimiting characters are placed around it. A $, <, or > must appear at each end of, and not within, your comment. (SeqEd deletes comment delimiting characters found within a comment when they are the same as the flanking comment delimiting characters.)

SeqEd inserts new comments at your current cursor position or at the position specified by the optional position number and then returns to Command Mode.

[s] HEAding

enters Heading Mode, which lets you edit the documentary heading. You can modify any part of the heading. Heading Mode is terminated with <Ctrl>D. The optional parameter specifies which line of the heading you want to start editing.

change

returns your session to Screen Mode. Note that the entire command is optional and a simple <Return> is equivalent.

screen

returns your session to Screen Mode. Note that the entire command is optional and a simple <Return> is equivalent.

OVERstrike

enters overstrike mode. Typing in a new symbol deletes the old symbol at that position and replaces it with the new symbol.

INSert

enters insert mode. Typing in a new symbol shifts all symbols from the current position to the end of the sequence by one position to the right and adds a new symbol at the current position.

[n] Mark markcharacter

You can mark a position in the sequence if you wish to return to it later. If the optional position number is absent, the position marked is the current cursor position. You give the marked position a letter (like giving it a name) using this command. Then, in Screen Mode, a single quote followed by the letter used to mark the sequence moves the cursor to the position where that mark was defined.

PERFect

makes searches linear and disables the nucleic acid ambiguity meanings of the GCG sequence symbols (see Appendix III).

PROtein

sets the sequence type to protein. This makes searches linear and disables the nucleic acid ambiguity meanings of the GCG sequence symbols. This also makes SeqEd ignore any set.keys file in your local directory.

NUCleotide

sets the sequence type to nucleotide. This makes searches circular and tells SeqEd to recognize patterns containing IUB nucleotide ambiguity symbols. SeqEd also remaps the keys if a set.keys file is in your local directory.

[s,f] Write [filename]

writes the current form of the sequence into a file. If you supply starting and finishing coordinates, SeqEd only writes the indicated segment. For example, 1,56 Write would write symbols 1 to 56 into a file. If you name a file, SeqEd writes the sequence into a file with that name instead of into the input file.

DIGitizer [G,A,C,T] [*]

enters Digitizer Mode. The lane order from left to right may be redefined. See below for a detailed discussion of digitized input.

The DIGitizer command has two options.

You can specify the left-to-right order of your lanes. The default order is alphabetic (A,C,G,T). You can change the default order with the command-line parameter -LANes=T,A,G,C or with the SeqEd command DIGitizer G,T,C,A.

Maxam and Gilbert sequencers find it useful to use -LANes=G,AG,TC,C. This order is equivalent to G,A,T,C because SeqEd uses the first of multiple letters when assigning a base to a lane. However, specifying pairs of letters causes the screen display of your gel to look more like the autoradiograph.

The second parameter with the DIGitizer command is a simple asterisk (*). This suppresses prompting for lane and menu positions. If you have been digitizing some lanes and return to the keyboard for some reason, you can go back to digitizing the same lanes without having to redefine their positions.

RELoad

goes into Reload Mode, which is similar to Checking Mode, except that the reloaded sequence grows leftwards from the right end of the main sequence. This is designed to help find the overlap of two loadings of the same reaction. Mismatched bases are marked with ^ (caret) characters, as in Checking Mode. Also, you can use the arrow keys to move around in and edit either the main sequence or the reloaded sequence. When the match becomes especially good, the terminal bell rings. You are free to accept or reject SeqEd's rules of what constitutes a good overlap. See the COMMAND LINE SUMMARY topic below for more information.

ACCept

terminates Reload Mode. The display of the reloaded sequence goes away, leaving you with only the main sequence with the cursor at the end, ready for more input. SeqEd helps you to decide when to ACCept an overlap, but the decision is yours.

Help

shows the commands available in the Screen and Command Modes of SeqEd.

[s,f] EXit [filename]

works exactly like Write except that your session with SeqEd ends after the sequence is written out into a new sequence file.

Quit

terminates a session with SeqEd without writing a new sequence file.

DIGITIZER MODE

[ Previous | Top | Next ]

This part explains how to use the the Graf/Bar GP-7 Sonic Digitizer with SeqEd for sequence entry.

Preparation

We use the Graf/Bar GP-7 Sonic Digitizer with stylus S-7 formerly manufactured by the Science Accessories Corporation which was purchased by GTCO, 7125 Riverwood Dr., Columbia, MD 21046, Phone: (410) 381-6688, Toll-free: (800) 344-4723, URL: http://www.gtco.com. Here is how to set up the GP-7 to work with SeqEd:

Baud Rate

Remove the cover, find the DIP switches, and set the baud rate and parity according to the instructions in the Graf/Bar Operator's Manual. The parity and the baud rate must match your modem or terminal settings.

Special Cable

We use a special cable when connecting the Graf/Bar to the system, whether it is connected directly or through a data splitter. Build a cable between two 25-pin RS-232 connectors as follows:

Graf/Bar               Computer
   pin                   pin
    1                     1
    7                     7
    3                     2

Connecting the Digitizer to Your Terminal

We connect our Graf/Bar through a modem data splitter to the same line that we use for the terminal. The modem data splitter we use is model 232MDS from B & B Electronics, 1500 Boyce Memorial Drive, Ottawa, Illinois, 61350, USA, Phone: (815) 434-0846

Connecting the Digitizer to Another UNIX Port

The program reads input from the digitizer through a line whose logical name is DigitizerPort. This is assigned to TERM unless you (or your system manager) assign it somewhere else. If you have a port other than your terminal available, put a command like:

% name -s DigitizerPort /dev/tty11

in your login file. (Such commands are usually made at login time so you don't have to remember them every time you want to use the digitizer.)

Layout

See the figure at the back of this document to get an overview of how to lay out the digitizer.

The Graf/Bar digitizes an active area about 24 inches wide and 18 inches deep, starting 2 inches from the front of the unit. The region of interest in your radiograph must lie entirely within this active area, with the gel lanes perpendicular to the face of the digitizer. The menu must also fit in this area, with its long axis perpendicular to the digitizer. Tape or clamp the radiograph and the menu tightly so that their positions with respect to the digitizer do not change.

Menu

The digitizer controls SeqEd through a printed menu, which you place in your work area next to the radiograph. Cut out one of menus in the figures below or Fetch the file seqedmenu.fig and use the Figure program to draw more copies. The menu may be drawn to whatever absolute size you like, as long as it lies within the active area of the digitizer.

Entering Digitizer Mode

The DIGitizer command switches SeqEd into Digitizer Mode. All input comes from the digitizer stylus in this mode. After you issue this command, SeqEd asks you to define your gel lanes (described below) and to locate opposite corners of the menu. The KEYBOARD key on the menu turns off Digitizer Mode and returns control to the keyboard. A banner across the top of your screen reminds you whether the keyboard or the digitizer is in control. In Digitizer Mode, keyboard input is ignored and in Keyboard Mode, digitizer input is ignored.

Defining Gel Lanes

The basic idea is this: click (with the stylus) in the middle of each lane where you begin (or resume) reading the gel. See the figure below. All of the points should be in the center of the lane and the stylus must always be held at the same angle and orientation. If the program is uncertain of which base to assign a digitized position, it beeps twice and asks you to redefine your lanes. The program uses the command-line value of tolerance as the basis for certainty. A tolerance of 0 is the least tolerant setting and the slightest deviation would require you to redefine your lanes. A tolerance of 1.0 is the most tolerant setting, such that any deviation is accepted. Based on our limited experience, you should not use a tolerance value less than 0.25 or greater than 0.6. The default value (0.4) was chosen because it has seldom made an incorrect assignment and does not require you to redefine the lanes too frequently. The algorithm employed is that of Staden (Nucl. Acids Res., 14, 217 (1986)).

Remember, when answering the prompts for lane position, digitize the centers of the lanes near the first bands you intend to read.

Defining Menu Position

After clicking on the fourth (right-most) lane or your radiograph, you are prompted to click on two opposite corners of the menu.

Using Digitizer Mode

The digitizer replaces the keyboard in Digitizer Mode. You now control all of SeqEd's functions by clicking on the "keys" in the menu with the digitizer stylus. When you click on the KEYBOARD key, control is returned to the keyboard.

Entering a Sequence

Enter bases into the sequence by clicking the bands on the radiograph. SeqEd behaves exactly as though you were typing uppercase letters. You can also click the menu keys labeled A, C, G and T for exactly the same effect. The menu also includes ambiguity codes R, X, and Y, which are entered into your sequence in lowercase. The question mark is like a shift key -- the next base you enter from the gel lanes after clicking the question mark is entered in lowercase.

Deleting a Sequence

Remove bases from the sequence by clicking the DELETE key on the menu. This behaves just like the <Delete> key, removing the base to the left of the cursor.

Moving the Cursor

Move around in your sequence by clicking on the left and right arrow keys. You can click on numbers first in order to move more than one base at a time. You can click on the numbers and then click the RETURN key to move the cursor to any position. The BEGIN and END keys jump to the ends of the sequence. The FIND key lets you move to the beginning of a pattern. You can enter the pattern by clicking the letter keys on the menu ending with a click on the RETURN key.

Loading Second Fragment

RELOAD and ACCEPT function together to help you find the overlap of two loadings of the same sequence. RELOAD prompts you to define new lane positions. (It is not necessary to redefine the menu position.) See the documentation on RELoad and ACCept under Command Mode above.

Checking Entered Sequence

CHECK puts you in Checking Mode so you can re-enter the data and increase your confidence. Use the up and down arrows to move between the main sequence and the check sequence.

Writing Sequences Into Files

WRITE FILE writes your sequence into a file without leaving Digitizer Mode.

Redraw Screen

FRESH SCREEN refreshes your terminal screen if line noise or broadcast messages have interrupted it.

New Lanes

SeqEd should handle slight bends and bulges in your lanes. However, if your bands are not being recorded correctly, it may be necessary to delete the incorrect bases and redefine your lane positions starting with a click on the NEW LANES key.

You can also use this key when you have finished one set of lanes and wish to start another without returning to Keyboard Mode.

Return to Keyboard Control

The KEYBOARD key returns control to the terminal keyboard. You cannot use the digitizer again until you use SeqEd's DIGitizer command.

Help

The HELP key displays the Digitizer Mode help message (below) on your terminal screen. The next click you make, whatever it is, redraws your screen the way it was before the help screen was displayed.


                     Digitizer Mode

  37<Return>         - go to position 37 (any positive integer)
  <Up-arrow>         - go up to check sequence (Checking Mode)
  <Down-arrow>       - go down to original sequence (Checking Mode)
  [n]<Right-arrow>   - go ahead [n] characters
  [n]<Left-arrow>    - go back  [n] characters
  FIND TAACG<Return> - find next occurrence of TAACG
  Begin              - go to beginning of sequence
  End                - go to end of sequence
  A C G T R X Y      - insert A C G ... into sequence (at cursor)
  ?                  - enter next sequence character in lowercase
  <Delete>           - delete a sequence character
  New Lanes          - define new lanes (another loading) for this
                         fragment
  Reload             - define new lanes and slide new sequence
                         back over old until overlap is "Accepted"
  Accept             - accept overlap from "Reload"
  Check              - enter Checking Mode to re-enter and verify
                         sequence
  Write File         - write out fragment into a file
  Fresh Screen       - redraw the terminal screen
  Keyboard           - return control to terminal keyboard

  Click the digitizer anywhere to go back to SEQED:

COMMENT MODE

[ Previous | Top | Next ]

Comment Mode allows you to add, change, or delete embedded comments and helps you move quickly to any position in your sequence where a comment is associated. To enter Comment Mode, you must first enter Command Mode with <Ctrl>D.

Entering New Comments

If you type the COmment command without any comment text, SeqEd creates a new, empty comment at the position indicated by the optional sequence position number, if present, or at your current position in the sequence. The cursor moves to the part of the screen where embedded comments are displayed. Initially, the cursor is adjacent to a position number followed by an empty comment. You may then type a new comment or move to an existing comment that you wish to modify. Only one new comment can be created each time you enter Comment Mode.

Cursor Movement in Comment Mode

While in Comment Mode you can move around in the comment using the <Left-arrow> and <Right-arrow> keys, insert text by typing, or delete text using the <Delete> key or <Ctrl>H. <Ctrl>E positions the cursor at the end of the comment. <Ctrl>U deletes all characters from the beginning of the comment to the cursor position. You can move from one comment to another using the <Up-arrow> or <Down-arrow> keys.

Deleting Comments

When you move the cursor off of a comment that is empty, the comment is deleted. You can delete a comment by entering Comment Mode, moving to the end of the comment you wish to delete, and using <Ctrl>U. When you move to another comment or leave Comment Mode, the comment disappears. Likewise, the empty comment created when you enter Comment Mode is deleted if you don't type anything at the new comment position.

Comment Delimiters

Comments must start and end with one of the characters <, >, or $. A comment must start and end with the same delimiting character. If you try to move your cursor off of a comment that does not have one of these characters at the ends, or if the delimiters aren't identical, then SeqEd makes sure the delimiters are corrected.

Changing Sequence Position With Comment Mode

As you move to different comments, your position in the sequence in Screen Mode changes to the symbol with which that comment is associated. This allows you to move quickly to any symbol with which a comment is associated when you leave Comment Mode. By marking your place with a comment at the end of one session with SeqEd, you can easily restore your place at the next session.

Leaving Comment Mode

To exit Comment Mode, press <Return> or use <Ctrl>D.

Comments Are Associated With Sequence Symbols

Comments may be associated with any base. They stay with that base, even though the base's position may change, unless the base is deleted (see below). They can also be associated with either end of the sequence. For example, you may issue the command, 0 CO to create a comment associated with the left end of the sequence. This comment must be delimited with < (SeqEd makes sure of this). Similarly, a comment can be created at the extreme right of the sequence and must be delimited with > or $.

Comments Can Be Used in Pairs to Bracket Sections of the Sequence.

Comments can document a whole fragment as well as an individual sequence symbol. For example, the Include command automatically puts an identifying comment at each end of the included fragment. The characters < and > were selected as comment delimiters because they imply direction; the comments bracketing the included fragment point at the fragment. A >-comment is associated with the first base of the fragment and a <-comment with the last. When the sequence is saved in a file, all >- and $-comments are written before the base they are associated with and all the <-comments after. This way the bracketing comments surround the entire fragment and point to it.

Between two bases in a sequence file there may be several comments. The <-comments are always associated with the base to the left, the >- and $-comments with the base to the right.

Deleting Comments

The only way to delete a comment is to go into Comment Mode and delete all the characters of the comment. When you move your cursor away from the empty comment, it goes away.

Deleting Bases Associated With Comments

If you delete a base with which a comment is associated, the comments do not go away. They just attach themselves to adjacent bases. To preserve the properties of fragment bracketing comments, the <-comments become associated with the left-hand base, the >- and $-comments with the right-hand base.

HEADING MODE

[ Previous | Top | Next ]

Heading Mode allows you to edit the documentation that appears above the sequence. When a new sequence is edited, SeqEd goes directly into Heading Mode to let you identify the new sequence.

Entering Heading Mode

SeqEd lets you enter Heading Mode by using the HEAding command.

Leaving Heading Mode

Use <Ctrl>D to return to Command Mode.

Moving the Cursor

You can move around using the arrow keys and make insertions and deletions as you wish. Although the editing window is only four lines high, it scrolls over the heading vertically to let you see and modify any part. <Ctrl>E positions the cursor at the end of the current line.

Editing in Heading Mode

As with many text editors, typing inserts text at the cursor and the <Delete> key or <Ctrl>H delete characters to the left of the cursor. <Ctrl>U deletes everything from the current cursor position to the start of the line; <Return> creates a new line starting at the current position in the heading.

SYSTEM CRASH OR HANGUP

[ Previous | Top | Next ]

While you are editing a sequence, SeqEd records your session in a file called seqed.log. This file is automatically deleted when the editor exits normally. If you are accidentally disconnected or the system crashes, your work can be recovered by logging back in, moving to the directory where the crash occurred, and running SeqEd again. SeqEd finds seqed.log and restores the sequence to the state it was in just before you were cut off.

If you do not want SeqEd to restore the session, delete the file seqed.log.

RESTRICTIONS

[ Previous | Top | Next ]

The total length of all vector sequences specified with the -VECtors command-line parameter may not exceed 100,000 bases. If the total vector sequence length exceeds 100,000 bases, SeqEd notifies you that only the first 100,000 vector bases will be checked.

SeqEd only works on terminals that can provide screen support. Your system manager may be able to help if your terminal is not behaving correctly.

Many terminal conditions must be set if you are using the Graf/Bar sonic digitizer; see the paragraph on Preparation under the DIGITIZER MODE topic.

ACKNOWLEDGEMENTS

[ Previous | Top | Next ]

SeqEd was originally designed by Paul Haeberli and implemented for VAX/VMS by Paul Haeberli and John Devereux. It was completely revised for GCG Version 4 by William Winsborough. The digitizer interface and the RELoad command were implemented for Version 5 by Philip Delaquess. We are very grateful for the collaboration of Dr. William Boorstein.

SEQUENCE TYPE

[ Previous | Top | Next ]

When it opens a new sequence file, SeqEd initially assumes it is nucleic acid. When you write the file, SeqEd examines the sequence to see if it contains only IUB-IUPAC nucleotide symbols in the first 300 symbols. If so, it writes the new sequence as a nucleic acid sequence; if not, it writes it as a peptide sequence.

When it opens a pre-existing GCG sequence file, SeqEd obtains the sequence type (nucleotide or protein) from the Type: field of the dividing line (the line that contains two successive periods). If the Type: field is absent, as in the case of sequence files created prior to Version 7 of the Wisconsin Package, SeqEd infers the type of the sequence from the composition of the sequence characters. When SeqEd writes the edited file, it writes the Type: field according to its current understanding of the sequence type.

It is possible for SeqEd to make a mistake. If the Type: field of an existing file is incorrect, SeqEd will accept the incorrect type; it doesn't check the composition in this case. For files without a Type: field, it is possible for SeqEd to infer the wrong sequence type. For example, a peptide sequence that contains only those amino acids that share IUB-IUPAC symbols with nucleotides will be incorrectly typed as nucleic acid (see Appendix III).

You can override SeqEd's assignment of sequence type in two ways. When you run SeqEd, you can add -PROtein or -NUCleotide to the command line to tell SeqEd which type of sequence it will be editing. Once SeqEd is running, you can use the Command Mode commands PROtein and NUCleotide to force the assignment of sequence type.

COMMAND-LINE SUMMARY

[ Previous | Top | Next ]

All parameters for this program may be added to the command line. Use -CHEck to view the summary below and to specify parameters before the program executes. In the summary below, the capitalized letters in the parameter names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.


Minimal Syntax: % seqed [-INfile1=]sample.seq

Prompted Parameters: None

Local Data Files:

set.keys (must be in your current working directory to be used)

Optional Parameters:

-SINGlecommand        automatically returns to screen mode after commands
-PROtein              sets sequence type to protein, and sets find to
                        search for perfect symbol matches
-NUCleotide           sets sequence type to nucleotide, and sets find to
                        allow nucleotide ambiguity symbol matches
-PERFect              sets find to search for perfect symbol matches,
                        even if sequence type is nucleotide
-VECtors=GB:SynpBR322 highlights sequences from pBR322
-SITes=GAATTC         highlight GAATTC patterns
-LANes=A,C,G,T        sets the default lane order for digitizer
-MINOverlap=10        sets minimum overlap length for Reload command
-PCTOverlap=95        sets stringency for the Reload command
-TOLerance=0.4        sets tolerance for digitizing ambiguity (0 to 1),
                        with 1 being the most tolerant

LOCAL DATA FILES

[ Previous | Top | Next ]

The files described below supply auxiliary data to this program. The program automatically reads them from a public data directory unless you either 1) have a data file with exactly the same name in your current working directory; or 2) name a file on the command line with an expression like -DATa1=myfile.dat. For more information see Chapter 4, Using Data Files in the User's Guide.

Customizing Your Keyboard With SetKeys

You can use the program SetKeys to create a set.keys file that tells the SeqEd, GelEnter, LineUp,GelAssemble, and SeqLab sequence editors how to interpret the letters you type at the terminal. When entering gel readings, it is useful to have the symbols for G, A, T, and C under the fingers of one hand in the same positions as the lanes in your gel. SeqEd, GelEnter, LineUp, GelAssemble, and the SeqLab sequence editor automatically read the file set.keys if it is present in your local directory. If set.keys is absent, or if the sequence type is set to Protein (in SeqEd and LineUp only) the terminal keys retain their conventional meanings.

If you have a set.keys file in your directory, SeqEd, GelEnter, LineUp, and GelAssemble only respond to the keys that it redefines. You can edit the file set.keys with a text editor if some of the keys you want to use are not in it. Any keys not mentioned in set.keys appear to be dead in these sequence editors. In the SeqLab sequence editor, keys that are not redefined retain their normal meanings.

Several keys are vital for the control of SeqEd, LineUp, GelEnter, and GelAssemble; this means you are not allowed to redefine the keys for /, [, ], {, }, (, ), :, ,, 1, 2, 3, 4, 5, 6, 7, 8, 9, 0, <Ctrl>R, <Ctrl>D, <Ctrl>H, <Return>, and <Ctrl>E.

PARAMETER REFERENCE

[ Previous | Top | Next ]

You can set the parameters listed below from the command line. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.

-SINGlecommand

sets SeqEd to return automatically to Screen Mode after every command in Command Mode.

-PROtein

sets the sequence type to be protein, and makes pattern searches use perfect symbol matches. SeqEd treats protein sequences as linear and will not find patterns that start at the end and continue into the beginning of the sequence. Furthermore, -PROtein causes SeqEd to ignore any set.keys file in your local directory.

-NUCleotide

sets the sequence type to be nucleotide, and makes pattern searches use nucleotide ambiguity symbol matches (unless you force the program to use perfect symbol matches by including -PERFect on the command line or by entering the PERFect command in Command Mode.) SeqEd treats nucleotide sequences as circular and will find patterns that start at the end and continue into the beginning of the sequence. Furthermore, -NUCleotide causes SeqEd to use a set.keys file in your local directory.

-PERFect

makes pattern searches use perfect symbol matches. Normally if you type /GARC in Screen Mode, the patterns GAAC or GAGC could be found. If you have -PERFect on the command line, /GARC would only find the pattern GARC. This also makes SeqEd treat sequences as linear and not find patterns that start at the end and continue into the beginning of the sequence.

-VECtors=GB:synpbr322,GB:m13mp18

tells SeqEd which cloning vector or vectors are of interest to you. SeqEd checks your sequence against them to make sure you are not entering a vector sequence. If it finds that you are entering vector sequence, the terminal bell rings and the vector sequence characters are highlighted with reverse video.

-SITes=GAATTC,GAnTC

tells SeqEd to highlight enzyme recognition sites that interest you.

-LANes=A,C,G,T

establishes the default left-to-right order of gel lanes. The default may be over-ridden when you issue a DIGitizer command in Command Mode.

-MINOverlap=10

sets the minimum overlap length regarded as meaningful by the RELoad command. SeqEd ignores matches shorter than this, even if they are perfect. However, you are always free to end a reload with the ACCept command.

-PCTOverlap=95

sets the minimum percentage of matching bases regarded as meaningful by the RELoad command. In Reload Mode, when the overlap is long enough and good enough, the terminal bell rings to alert you. Again, you have complete freedom to reject or ACCept SeqEd's opinion.

-TOLerance=0.4

sets the tolerance for digitizing. When digitizing, the program must determine which base lane the sonic pen has touched. Since the gel lane may bend, the program must have some tolerance for deviation. The tolerance value determines how great this deviation can be before you must redefine your lanes. A tolerance of 0 is the least tolerant setting and the slightest deviation would require you to redefine your lanes. A tolerance of 1.0 is the most tolerant setting such that any deviation is accepted. Based on our limited experience, you should not use a tolerance value less than 0.25 or greater than 0.6. The default value (0.4) was chosen because it has seldom made an incorrect assignment and does not require you to redefine the lanes too frequently. The algorithm employed is that of Staden (Nucl. Acids Res., 14; 217 (1986)).

Printed: December 9, 1998 16:24 (1162)



[ Program Manual | User's Guide | Data Files | Databases ]


Documentation Comments: doc-comments@gcg.com
Technical Support: help@gcg.com

Copyright (c) 1982, 1983, 1985, 1986, 1987, 1989, 1991, 1994, 1995, 1996, 1997, 1998 Genetics Computer Group Inc., a wholly owned subsidiary of Oxford Molecular Group, Inc. All rights reserved.

Licenses and Trademarks Wisconsin Package is a trademark of Genetics Computer Group, Inc. GCG and the GCG logo are registered trademarks of Genetics Computer Group, Inc.

All other product names mentioned in this documentation may be trademarks, and if so, are trademarks or registered trademarks of their respective holders and are used in this documentation for identification purposes only.

Genetics Computer Group

www.gcg.com