TOPIR

[ Program Manual | User's Guide | Data Files | Databases ]

Table of Contents
FUNCTION
DESCRIPTION
EXAMPLE
OUTPUT
INPUT FILES
RELATED PROGRAMS
COMMAND-LINE SUMMARY
LOCAL DATA FILES
PARAMETER REFERENCE

FUNCTION

[ Top | Next ]

ToPIR writes GCG sequence(s) into a single file in PIR format.

DESCRIPTION

[ Previous | Top | Next ]

Any sequence file in GCG format can be converted with ToPIR into a format suitable for use with PIR programs. ToPIR accepts one or more GCG sequence files as input and creates one output file, containing all the sequences converted to PIR format.

EXAMPLE

[ Previous | Top | Next ]

In this session with ToPIR, all of the individual files generated in the sample run of FromPIR are put back into PIR format.


% topir

 TOPIR of what GCG sequence(s) ?  *.gcg

 What should I call the output file (* fwsyg3.pir *) ?  sample.pir

 FWSYG3 516 characters.
 JDVLS 881 characters.
 KGRT 179 characters.

%

OUTPUT

[ Previous | Top | Next ]

Here is some of the output file:


>P1;FWSYG3
fwsyg3.gcg;2 => FWSYG3
          MGKPFFTLSL SSLCLLLLSS ACFAITSSKF NECQLNNLNA LEPDHRVESE
          GGLIETWNSQ HPELQCAGVT VSKRTLNRNG SHLPSYLPYP QMIIVVQGKG
          AIGFAFPGCP ETFEKPQQQS SRRGSRSQQQ LQDSHQKIRH FNEGDVLVIP
          LGVPYWTYNT GDEPVVAISP LDTSNFNNQL DQNPRVFYLA GNPDIEHPET
          MQQQQQQKSH GGRKQGQHRQ QEEEGGSVLS GFSKHFLAQS FNTNEDTAEK
          LRSPDDERKQ IVTVEGGLSV ISPKWQEQED EDEDEDEEYG RTPSYPPRRP
          SHGKHEDDED EDEEEDQPRP DHPPQRPSRP EQQEPRGRGC QTRNGVEENI
          CTMKLHENIA RPSRADFYNP KAGRISTLNS LTLPALRQFG LSAQYVVLYR
          NGIYSPDWNL NANSVTMTRG KGRVRVVNCQ GNAVFDGELR RGQLLVVPQN
          PAVAEQGGEQ GLEYVVFKTH HNAVSSYIKD VFRVIPSEVL SNSYNLGQSQ
          VRQLKYQGNS GPLVNP*
C;P1;FWSYG3 - Glycinin, A3B4 chain precursor - Soybean
C;N;Alternate names: 11S globulin
C;Species: Glycine max (soybean)
C;Accession: A22615
C;R;Fukazawa, C., Momma, T., Hirano, H., Harada, K., and Udaka, K.
C;J. Biol. Chem. 260, 6234-6239, 1985 (cv. Bonminori, sequence . . .

////////////////////////////////////////////////////////////////////

INPUT FILES

[ Previous | Top | Next ]

ToPIR accepts multiple (one or more) nucleotide or protein sequences as input. You can specify multiple sequences in a number of ways: by using a list file, for example @project.list; by using an MSF or RSF file, for example project.msf{*}; or by using a sequence specification with an asterisk (*) wildcard, for example GenEMBL:*. The input files in this example are the output files from the example for FromPIR. To create the input files for this example, run FromPIR on qqqss.psq.

RELATED PROGRAMS

[ Previous | Top | Next ]

The following programs convert sequences between other formats and GCG format: FromEMBL, FromGenBank, FromIG, FromPIR, FromStaden, FromFastA, ToIG, ToPIR, ToStaden and ToFastA.

DataSet creates a GCG data library from any set of sequences in GCG format. GCGToBLAST creates a database that can be searched by the BLAST program from any set of sequences in GCG format.

COMMAND-LINE SUMMARY

[ Previous | Top | Next ]

All parameters for this program may be added to the command line. Use -CHEck to view the summary below and to specify parameters before the program executes. In the summary below, the capitalized letters in the parameter names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.


Minimal Syntax: % topir [-INfile=]seqname(s) -Default

Prompted Parameters:

-BEGin=1 -END=444       sets the range of sequence to convert
-REVerse                uses the reverse strand (nucleic acid sequences)
[-OUTfile=]seqname.pir  names the output file

Local Data Files:  None

Optional Parameters:

-CIRcular                indicates that the input sequence(s) are circular
-NUMbering               numbers the output sequence(s)
-NODOCumentation         suppresses documentation accompanying each sequence

LOCAL DATA FILES

[ Previous | Top | Next ]

None.

PARAMETER REFERENCE

[ Previous | Top | Next ]

You can set the parameters listed below from the command line. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.

-CIRcular

treats the sequence(s) as circular. The header line for each circular sequence in the output file begins with >DC;.

-NUMbering

adds numbering next to each sequence line.

-NODOCumentation

suppresses writing the sequence documentation in the output file.

Printed: December 9, 1998 16:27 (1162)

[ Program Manual | User's Guide | Data Files | Databases ]


Documentation Comments: doc-comments@gcg.com
Technical Support: help@gcg.com

Copyright (c) 1982, 1983, 1985, 1986, 1987, 1989, 1991, 1994, 1995, 1996, 1997, 1998 Genetics Computer Group Inc., a wholly owned subsidiary of Oxford Molecular Group, Inc. All rights reserved.

Licenses and Trademarks Wisconsin Package is a trademark of Genetics Computer Group, Inc. GCG and the GCG logo are registered trademarks of Genetics Computer Group, Inc.

All other product names mentioned in this documentation may be trademarks, and if so, are trademarks or registered trademarks of their respective holders and are used in this documentation for identification purposes only.

Genetics Computer Group

www.gcg.com