REVERSE

[ Program Manual | User's Guide | Data Files | Databases ]

Table of Contents
FUNCTION
DESCRIPTION
EXAMPLE
OUTPUT
INPUT FILES
RESTRICTIONS
INPUT FILE
COMMAND-LINE SUMMARY
LOCAL DATA FILES
PARAMETER REFERENCE

FUNCTION

[ Top | Next ]

Reverse reverses and/or complements a sequence.

DESCRIPTION

[ Previous | Top | Next ]

Reverse reverses and/or complements the symbols in a sequence. The complements of all of the supported IUPAC-IUB nucleic acid symbols are listed in Appendix III. The output is written into a new sequence file.

EXAMPLE

[ Previous | Top | Next ]

The file test.seq contains all of the legitimate GCG sequence characters. Here is a session using Reverse to show how they would look on the opposite strand:


% reverse

  REVERSE of what sequence ?  test.seq

                    Begin (* 1 *) ?
                  End (*   389 *) ?

  Do you want to:

       1) reverse only
       2) complement only
       3) reverse and complement

  Please choose one (* 3 *):

  What should I call the output file (* test.rev *)

%

OUTPUT

[ Previous | Top | Next ]

Here is the output file test.rev:


!!NA_SEQUENCE 1.0
  REVERSE-COMPLEMENT of: test.seq  check: 3365  from: 1  to: 389

This sequence contains every symbol in the alphabet of
legitimate GCG sequence characters (Appendix III).

test.rev  Length: 389  October 21, 1996 10:15  Type: N  Check: 8096  ..

       1  .~....rxwb aasy...nk. m..dc..hgv t...~..RXW BAASY...NK

      51  .M..DC..HG VTYYATYAYT ATCACTATTA YTSCTGTTGC TCTTCRTAGT

     101  AATAXXXCCA XACCACTACG ACAACXGTCG TTGTGGTAGT XSWRCTXGAG

     151  CTACTCGATG AGGAAGAXCK YCKCCKTCKY CTXCGCCTTC TCCGCCGTCG

     201  GCGACGYTGC TGTTGXGGCG GTGGGGGAGG RTTGTTATTC ATXARYARCA

     251  RTARXAGYAA CAGTAGGAGA AGTAACAAYT TCTTTTTDAT TATGATAATR

     301  TGGTGATGXC CCCCTCCGCC ACCRAAGAAA AAYTCCTCTT CRTCGTCATC

     351  RCAGCAACAR TYGTTATTGT CATCXGCCGC TGCGGCAGC

INPUT FILES

[ Previous | Top | Next ]

Reverse accepts a single sequence as input. The function of Reverse depends on whether your input sequence(s) are protein or nucleotide. Programs determine the type of a sequence by the presence of either Type: N or Type: P on the last line of the text heading just above the sequence. If your sequence(s) are not the correct type, see Appendix VI for information on how to change or set the type of a sequence.

RESTRICTIONS

[ Previous | Top | Next ]

A reversed sequence is renumbered so that the first base corresponds to the last base of the range you chose. It only makes sense to complement nucleotide sequences! If you don't reverse and complement a sequence, you are in danger of having the sequence in 3' to 5' orientation. All GCG programs and all databases assume that nucleotide sequences are in 5' to 3' order, so be careful. Peptide sequences are generally kept in amino-to-carboxyl orientation. Many legitimate sequence symbols are not IUPAC-IUB-supported nucleic acid symbols, so they have no sensible complement (see Appendix III).

Embedded comments are lost.

INPUT FILE

[ Previous | Top | Next ]

Here is the input file test.seq used in the example above:


!!NA_SEQUENCE 1.0
This sequence contains every symbol in the alphabet of
legitimate GCG sequence characters (Appendix III).

Test.Seq  Length: 389  September 30, 1998 10:49  Type: N  Check: 3365  ..

       1
          >starts with the codons from appendix iii>
          GCTGCCGCAG CGGCXGATGA CAATAACRAY TGTTGCTGYG ATGACGAYGA

      51  AGAGGARTTT TTCTTYGGTG GCGGAGGGGG XCATCACCAY ATTATCATAA

     101  THAAAAAGAA RTTGTTACTT CTCCTACTGT TRCTXYTAYT GYTRYTXATG

     151  AATAACAAYC CTCCCCCACC GCCXCAACAG CARCGTCGCC GACGGCGGAG

     201  AAGGCGXAGR MGAMGGMGRM GXTCTTCCTC ATCGAGTAGC TCXAGYWSXA

     251  CTACCACAAC GACXGTTGTC GTAGTGGTXT GGXXXTATTA CTAYGAAGAG

     301  CAACAGSART AATAGTGATA RTRATRR
                                       >continues with all
          uppercase sequence characters>
                                       ABC DEFGHIJKLM NOPQRSTUVW

     351  XYZ.~@&*ab cdefghijkl mnopqrstuv wxyz*@&~.

                                                    <ends with
          all lowercase sequence characters<

COMMAND-LINE SUMMARY

[ Previous | Top | Next ]

All parameters for this program may be added to the command line. Use -CHEck to view the summary below and to specify parameters before the program executes. In the summary below, the capitalized letters in the parameter names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.


Minimal Syntax: % reverse [-INfile=]ggamma.seq

Prompted Parameters:

-BEGin=1 -END=1700      sets the range of interest
-NOREVerse              doesn't reverse the strand
-NOCOMPlement           doesn't complement the strand
[-OUTfile=]ggamma.rev   names the output file

Local Data Files: None

Optional Parameters: None

LOCAL DATA FILES

[ Previous | Top | Next ]

None.

PARAMETER REFERENCE

[ Previous | Top | Next ]

You can set the parameters listed below from the command line. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.

-NOCOMPlement

doesn't complement the strand.

Printed: December 9, 1998 16:29 (1162)

[ Program Manual | User's Guide | Data Files | Databases ]


Documentation Comments: doc-comments@gcg.com
Technical Support: help@gcg.com

Copyright (c) 1982, 1983, 1985, 1986, 1987, 1989, 1991, 1994, 1995, 1996, 1997, 1998 Genetics Computer Group Inc., a wholly owned subsidiary of Oxford Molecular Group, Inc. All rights reserved.

Licenses and Trademarks Wisconsin Package is a trademark of Genetics Computer Group, Inc. GCG and the GCG logo are registered trademarks of Genetics Computer Group, Inc.

All other product names mentioned in this documentation may be trademarks, and if so, are trademarks or registered trademarks of their respective holders and are used in this documentation for identification purposes only.

Genetics Computer Group

www.gcg.com