commit f8b7bfff1b9e933830c089d9d7f24a7349326363
Author: starsareintherose <starsareintherose@outlook.com>
Date:   Sat Dec 4 05:07:58 2021 +0000

    init

diff --git a/README.txt b/README.txt
new file mode 100644
index 0000000..f8c2986
--- /dev/null
+++ b/README.txt
@@ -0,0 +1,296 @@
+                       General Information
+                   (Not for the faint hearted)
+
+			30 September 1992
+
+
+0. Introduction
+---------------
+
+This document contains information on the following subjects:
+
+   1. Installing the Staden Package on SPARCstations and DECstations
+   2. Installing the Staden Package on Other Machines
+   3. A Quick Guide to What's on the Release Tape
+   4. Overview of Data Flow During Sequence Assembly
+   5. Acknowledgements
+
+
+
+1. Installing the Staden Package on SPARCstations and DECstations
+-----------------------------------------------------------------
+
+We are endeavouring to make the installation of the Staden Package as
+quick and as easy as possible. In this current release we provide
+statically linked sparc and mips executables as well as all sources.
+
+To install the package:
+
+1) Create a new directory for the software. You may have to log on as
+superuser to do this.
+
+	% mkdir -p /home/BioSW/staden
+
+2) Place the distribution tape in the drive and down load the package:
+
+   -sun-
+	% tar xvf /dev/rst0
+	...system messages...
+
+   -dec- 
+	% tar xvf /dev/rmt0h
+	...system messages...
+
+3) Users of the C Shell should add the following to his/her .login
+file:
+
+	setenv STADENROOT /home/BioSW/staden
+	source $STADENROOT/staden.login
+
+Users of the Bourne shell should add the following to their .profile
+file: 
+
+	STADENROOT=/home/BioSW/staden
+	export STADENROOT
+	. $STADENROOT/staden.profile
+
+
+4) When the user next logs onto the work station the required
+initialisation will automatically be performed, and the programs in
+the Staden package can be run. Refer to the help/*.MEM files for
+information on the various program. (eg help on xdap is in
+help/DAP.MEM)
+
+
+2. Installing the Staden Package on Other Machines
+--------------------------------------------------
+
+This is a little more difficult as you will need to remake all the
+executables. Your system configuration may also mean that some changes
+will need to be made, though hopefully only to makefiles. We provide
+a script to aid installation (we hope!), but you may prefer to make
+all the components manually.
+
+To remake the Staden package you will require the following:
+	1) A Fortran77 compiler
+	2) An ANSI C compiler
+	3) X11 Release 4, including the Athena Widget libraries.
+
+Start by following step 1 through 3 above, to unload the sources and
+perform initialisations. Read the rest of this document and the other
+help files. Look at the make files. Follow your nose!
+
+If you have any problems or successes porting our software to other
+platforms we would love to hear from you. We would also appreciate
+receiving your general comments on the package.
+
+Rodger Staden (principle author)
+  phone: +44 223 402389  email: rs@mrc-lmba.cam.ac.uk
+  post: MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, U.K.
+Simon Dear:    
+  phone: +44 223 402266  email: sd@mrc-lmba.cam.ac.uk
+  post: MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, U.K.
+James Bonfield:
+  phome: +44 223 402499  email: jkb@mrc-lmba.cam.ac.uk
+  post: MRC Laboratory of Molecular Biology, Hills Road, Cambridge CB2 2QH, U.K.
+
+
+
+3. A Quick Guide to What's on the Release Tape
+----------------------------------------------
+
+The directory structure on this tape is very important. Once set up, the Staden
+package expects things to be in a predefined place. The root directory
+of the structure is referred to by the environment variable
+STADENROOT. Below this there should be at least the following:
+
+1) bin/
+All executable files and scripts should be in this directory.
+$STADENROOT/bin is added to the search path by the script staden.login
+(or staden.profile if you are using the Bourne Shell). Though you are
+not forced to keep programs here, we find it is the simplest place to
+keep them.
+
+2) help/
+All on-line help files are in this directory. Files of the form *.MEM
+or *.mem are formatted ascii files and can be printed for personal
+reference. The script staden.login sets up many environment variables
+that refer to files in this directory, as well as modifying
+XFILESEARCHPATH, which is used by X programs.
+
+3) manl/
+Local manual pages for ted and the staden package are in this directory. The
+environment variable MANPATH is modified in staden.login to search
+here too.
+
+4) staden.login and staden.profile
+These two files are scripts to set up environment variables required
+by the Staden package. C Shell users should source staden.login from
+their .login file, and Bourne Shell users should "source" staden.profile
+from their .profile directory. See "Installing the Staden Package on
+SPARCstations and DECstations", Part 3.
+
+5) tables/
+Configuration files for the Staden package are in this directory.
+Various environment variables are set in staden.login to refer to
+files in this directory.
+
+Also of use are the following:
+
+doc/           - Miscellaneous documentation.
+userdata/      - Sample databases
+src/           - program sources
+ReleaseNotes   - Notes on this and future releases
+Staden_install - Installation script
+SequenceLibraries - Notes on the use and installation of sequence libraries
+
+
+Program Sources
+---------------
+
+All the program sources are found in the directories in $STADENROOT/src:
+
+0) Misc/
+Sources for a library of useful routines used by the staden package.
+** Should be made before the programs in staden/ **
+
+1) staden/
+Sources for the Staden suite: mep, xmep, nip, xnip, nipl, pip, xpip,
+pipl, sap (now superseded by dap), xsap (now superceded by xdap), sip,
+xsip, sipl, dap, xdap, splitp1, splitp2, splitp3, gip and convert_project.
+
+2) ted/
+Sources for the trace display and sequence editing program ted.
+
+3) abi/
+Sample scripts and programs for handling ABI 373A data files.
+
+4) alf/
+Sample scripts and programs for handling Pharmacia A.L.F. data files.
+
+Each directory has appropriate makefiles and README files.
+
+
+
+4. Overview of Data Flow During Sequence Assembly
+-------------------------------------------------
+
+During a sequence assembly project the data can enter the sequence
+assembly program from various routes (See Figure below).
+
+
+       
+             Fluorescent Based
+             Sequencing Machine
+                Chromatogram                      Autoradiogram
+
+	 ABI 373A     	Pharmacia A.L.F.                |  
+             |                 |                        |
+             |                 |                        |
+             |             alfsplit                     |
+             |                 |                        |
+             +--------+--------+                        |
+                      |                                 |
+                      |                                 |
+                     ted                              (gip)
+                      |                                 |
+                      +----------------+----------------+
+                                       |
+                                       |
+                                     xdap
+
+
+                 Figure 1: Data Flow Through The Staden Suite
+
+
+The Pharmacia A.L.F. data files in their original format consist of
+one file for the (up to 10) samples that were on the gel. The program
+alfsplit divides the file up so that each sample is in a file of
+its own. From then on each gel reading can be handled individually.
+Whether these files can be transferred back to the Compaq for
+reprocessing is unknown.
+
+All data from fluorescent based sequencing machines must pass through
+the trace editing program ted. Ted allows data vector sequence at the
+5' end and unreliable data at the 3' end to be clipped. The sequence
+can be edited if desired, though we should stress that this is NOT
+RECOMMENDED when used in conjunction with xdap. Ted translates all
+Pharmacia A.L.F. uncertainty codes to a hyphen ("-") and outputs the
+clipped sequence, along with additional information on the position
+and content of cutoffs, to a file.
+
+People wanting to use xdap with ABI and Pharmacia files, but who have
+written their own trace clipping software should be aware that xdap
+requires information to be passed in the sequence file so that
+traces can be displayed. You may want to modify your software to be
+compatible with our file format. The file consists of four parts:
+
+	1) Cut off information (Optional).
+	Format is ";%6d%6d%6d%-4s%-16s", where
+	field 1 = total number of bases called
+	      2 = number of bases in the clipped sequence at the 5' end
+	      3 = number of bases in the sequence in this file
+	      4 = type of trace file.
+	          "ALF " - Pharmacia A.L.F.
+		  "ABI " - ABI 373A
+		  "SCF " - SCF
+		  "PLN " - Text only
+	      5 = name of trace file.
+
+	2) Content of the clipped sequence at the 5' end (Optional).
+	The sequence can extend over several lines. Each line must
+	begin with ";<" and should be less than 80 characters in
+	length.
+
+	3) Content of the clipped sequence at the 3' end (Optional).
+	The sequence can extend over several lines. Each line must
+	begin with ";>" and should be less than 80 characters in
+	length.
+
+	4) Initial tags for the sequence (Optional)
+	Format is: ";;%4s %6d %6d %s\n", where
+	field 1 = type of tag to be created (see $STADTABL/TAGDB)
+	      2 = position of tag
+	      3 = length of tag
+	      4 = annotation for tag (optional)
+	This feature is only available in the program xbap, which
+	at the time of writing is not yet being distributed with
+	the package.
+
+	5) The sequence, which can extend over several lines. Each
+	line should be less than 80 characters in length.
+
+Here is a sample file:
+
+;   660    55   450ABI a21d12.s1RES
+;<AGCTTGCATGCCTGCAGGTCGACTCTAGAGGATCCCCCGGTTCCTTCTGG
+;<ATATC
+;>-GATAAGCTGATTTG-TTT-CCATTATGGC-GGTTTGAGCCTC-G-GGTC
+;>GACCACTCGGTGTGCCAGGAAGGGGTCTGAAATTGAATGGGTTATCACTA
+;>GGCGACGTTT--TTTTCAAATTCCGGGCTAAATTTTACGGC-GGA-CGGT
+;>TCCG-
+;;COMM      1     10 M13mp18 subclone
+CAAGACATTTTGAAATACTTGGAATACTGAATCCAAGATGTGGAACATTA
+GACATATCCGTGTGCTCAACAATCGACATTTGATCCACTGATGAAAATGT
+TCTTCGTTTAGAATTTCTCATAGCATCAGCCACTTTTGCATAATACTCGA
+TTGAAGGTTCATGGAAAAAGCTGCGTAGAAGGCATGTCATTGTGCTTACG
+AGCCATTTCGGATATCTTGTGAATTTAGCAGGAAGTTCTGTAACTGGTTG
+GAATTCAAATATATCAGTTCTTCTTCCTGGATCTCGTCCTTTTTGCACTA
+AAACCATTGCGATTGCATCCGGATTCTGAGTAAGAGCCACTACAGCTTTA
+TGATACAGGCTCTTGTTATTCCTTTCGTGCTCGAATGGGAACTTTCCAGT
+GGCACAAAAATATAGTGTACATCCCAGAGCCCATAGATCACATGTTCCGA
+
+
+
+5. Acknowledgements
+
+We would like to thank Applied Biosystems, Inc. and Pharmacia LKB
+Biotechnology for their cooperation in agreeing to our routines
+accessing the data files of their fluorescent sequencing machines.
+
+373A sequence data file formats are the exclusive property of Applied
+Biosystems, Inc.
+
+ALF sequence data file formats are the exclusive property of Pharmacia
+LKB Biotechnology, Inc.
+
diff --git a/ReleaseNotes b/ReleaseNotes
new file mode 100644
index 0000000..896cae1
--- /dev/null
+++ b/ReleaseNotes
@@ -0,0 +1,190 @@
+		Release Notes for Staden Package 1992.3
+		---------------------------------------
+
+
+	Installation guide
+	------------------
+
+The file doc/install.PS contain installation instructions.
+
+
+	Manual for the Staden Package
+	-----------------------------
+
+There is now a 135 page manual on the Staden Package. It is currently
+being distributed on a Word4 document on a Macintosh floppy disk.
+
+
+	Feedback and bug reports
+	------------------------
+
+We welcome comments and suggestions on all aspects of the package and are
+best contacted by email: rs@uk.ac.cam.mrc-lmb and sd@uk.ac.cam.mrc-lmb.
+All abnormal terminations are bugs and we would like to be told of them
+so they can be fixed. We recommend that you request an update at least once
+a year as the package is evolving very rapidly.
+
+Note due to popular demand we have decided to release new routines earlier 
+than in the past so please report bugs. The documentation for additions may
+be sparser than before, or non-existent, but if there is something with which 
+you need help, email us.
+
+
+	Changes this release
+	--------------------
+
+
+	The assembly programs bap and xbap heve several new functions:
+	1. Find single stranded regions and try to fill them with "hidden"
+	data from the adjacent readings.
+	2. Find single stranded regions (includes ends of contigs) and 
+	select primers and templates for double stranding them (joining
+	them).
+	3. Pre assembly screening for readings to find those that align
+	best. Optionally the hidden data can also be included in the
+	comparison (part of assembly function).
+	4. Find pairs of readings taken from opposite ends of the same
+	template (ie forward and reverse read pairs). List or plot their
+	positions.
+	5. A new function to check that readings have been assembled into
+	the correct positions. It aligns the hidden (previously termed "unused")
+	parts of readings with the consensus they overlap to see how well
+	they align. Poor alignments are reported.
+	6. During assembly each reading is now allowed to match up to 100
+	different places.
+
+	It might be guessed from the above that we are trying to improve our
+	ability to deal with the assembly of human data. Hence, also the next
+	addition.
+
+	A new experimental program (rep) for screening readings for Alu
+	sequences prior to assembly. The Alu containing segments are tagged
+	so they can be seen in the contig editor. A library of Alu sequences
+	is included in /tables/alus. The program is quite slow as it compares
+	each reading in both orientations with all of the Alu sequences (126
+	of them) in order to find the best match. Only time and more data will
+	tell how sensitive it is, and whether the current default score 0f 0.6
+	is "correct". BEWARE rep modifies the original reading files to include
+	the tag information. The only information is in /help/alu.help
+
+	A new program for extracting sets of sequences and their annotations
+	from the sequence libraries (lip). The only information is in
+	/help/lip.help
+
+	Changes to the xterm userinterface. These routines have been completely
+	rewritten. One addition is that now ?? in response to a question will
+	allow the user to get help on any function in a program. help is also
+	improved in the x version.
+
+
+	Changes last release
+	--------------------
+
+
+	DAP, XDAP have been replaced by BAP and XBAP (see below)
+
+	A new function for examining repeats has been added to NIP
+
+	A new repeat search has been added to SIP
+
+	Some outputs have been changed to produce FASTA format files
+	instead of PIR.
+
+	MEP now allows searches for motifs in which any 8 out of a string
+	of 20 can be switched on.
+
+	The manual has been updated.
+
+        Keyword and author searches on sequence libraries
+
+	All programs that use the libraries can now perform author
+and keyword searches on all libraries (only nip did so before).
+
+	Postscript output
+
+	All graphics can now be saved to disk in postscript form by
+use of a sub-option in "Redirect output".
+	
+
+
+	Sequence assembly
+
+BAP, XBAP replace DAP and XDAP. A program to convert DAP databases to BAP
+databases (convert) is included. BAP databases can contain up to 8000 readings
+and a consensus of 500,000 bases. A minor edit and recompilation will allow
+up to 99,999 readings. The space is used more efficiently now as the databases
+grow as the number of readings increases. Reading names can be 16 characters
+in length. In addition:
+
+1) Assembly is 4 times as fast as in the DAP.
+
+2) Find internal joins is 5 times as fast and now brings up the join editor
+with the two contigs in the correct orientation and aligned.
+
+3) The assembly routines align pads better, plus a new automatic function can
+also be used to align them prior to editing.
+
+4) The contig editor has been greatly speeded up and its functionality
+has been enhanced.
+
+5) A routine for selecting oligos for primer walking is included. 
+
+6) A new routine allows batches of readings to be removed from a database. 
+
+7) We have also included routines for making SCF files, for getting the 
+sequence from SCF files, and one for marking the poor quality data in 
+readings. See the manual.
+
+	Sequence library formats
+
+	The standard sequence library indexing method is now that used on the
+EMBL CD-ROM. The libraries (EMBL nucleotide and SWISSPROT protein) can be
+left on the CD-ROM or copied to disk. We include in the package programs
+for creating this type of index for EMBL updates, PIR in codata format,
+NRL3D and GenBank. If the indexes are created all programs can read all
+these libraries. Programs and scripts for this task are contained in the
+directory indexseqlibs.
+	The keyword and author searches are particularly fast and the
+keyword index is based on ALL text in the files - not just the keywords.
+
+	Feature table formats
+
+	The programs now use the new feature table format common to EMBL
+and GenBank, but retain the old format for SWISSPROT which has not yet
+changed. 
+
+	For details of the above see file SequenceLibraries.
+
+	Pattern searches
+
+	Pipl and Nipl now have the facility to find only the best scoring
+match for each sequence. The prompt is "? report all matches", so typing
+only return means all matches will be shown and typing n means only the
+highest scoring will be reported. It is particularly useful when employed
+to create alignments. The corresponding help file has not been updated.
+Also to incorporate long unix file names the pattern files no longer include
+the annotation "filename".
+
+
+	Nip
+
+	Option 38 in nip "translate and list" has been removed as the the
+more flexible routines of option 39 incorporate all its functionality. Many
+options that relate to feature tables have been modified but their help files
+are not yet up to date.
+
+
+        Vep
+
+	A program (vep) for automatic excising of vector (either
+sequencing vector or cosmid vector) sequences from readings is now
+included in the package.
+
+
+
+
+	Rodger Staden, Simon Dear, James Bonfield
+
+  
+
+
diff --git a/SequenceLibraries b/SequenceLibraries
new file mode 100644
index 0000000..5f22ff9
--- /dev/null
+++ b/SequenceLibraries
@@ -0,0 +1,420 @@
+	Notes on library handling
+	-------------------------
+
+Contents of this document:
+
+I)   Introduction
+II)  Details of file organisation and use
+III) Options currently available
+IV)  Installation guide
+V)   New feature table handling routines
+VI)  Indexing the sequence libraries
+
+
+		Section I Introduction
+		----------------------
+
+Available sequence libraries
+
+There are a number of different sequence libraries for nucleotide and protein:
+PIR, GenBank, EMBL, Swissprot, and the Japanese Databank. Even after all the
+years of their existence they still use different formats for their data. This
+provides tedious and unrewarding work for software developers. Recently EMBL
+and GenBank agreed a new and common way of writing their feature tables, which
+is great help, although the rest of their format is different. Swissprot still
+uses the old embl style feature table format and PIR yet another.
+
+All the libraries distribute their data on magnetic tapes and EMBL and GenBank
+have started to distribute on cdrom. The EMBL cdrom also contains Swissprot.
+The GenBank and EMBL cdroms use different formats and have different contents.
+The EMBL cdrom has useful indexes sorted alphabetically: those for entry name
+and accession number, brief descriptions, keywords and freetext indexes are
+already available and others are expected. These indexes point to the data for
+each entry, and can be used to extract the data for any entry quickly.
+
+Moving to unix
+
+The VAX version of our package used PIR format which meant reformatting all
+libraries other than PIR into that format.  This required, at least
+temporarily, having space for two copies of the libraries, and quite a lot of
+cpu time. The software for doing this was provided by PIR, and is very VAX
+specific and hence will not run under unix. For the unix version of our package
+I have decided to use the EMBL cdrom format and its indexes as the primary
+format.  The current programs also support the use of PIR format libraries
+without indexes - ie just the sequence and annotation files.
+
+Indexing GenBank, EMBL updates, PIR and NRL3D
+
+We include programs to create indexes for the above libraries. See below and
+the README file in indexseqlibs. The programs can read all the above libraries
+once the indexes are created. The indexing programs index the data in its
+distributed form: WE DO NOT REFORMAT OR COPY THE LIBRARIES but simply create
+indexes to the original files. Obviously this saves a lot of disk space, and
+for those content to use only embl and swissprot from the cdrom, almost no disk
+space is required. We havent tried it yet, but for genbank on cdrom, the only
+extra disk space required would be for the indexes.
+
+ ---------------------------------------------------------------------------
+
+		Section II Details of file organisation and use
+		-----------------------------------------------
+
+The following strategy has been used to try to deal with alternate
+and changing sequence library formats.
+
+1) libraries are described at several levels:
+
+	a) the top level file is a list of available libraries which contains:
+ 		the library type, the name of the file containing the name of
+		each libraries individual files, and the prompt to appear on
+		the users screen: LTYPE LOGNAM PROMPT
+
+	b) the file containing the names of the libraries individual files
+		contains flags to define the file types: FTYPE LOGNAM
+
+	c) the individual library files
+
+
+
+2) libary types handled: 
+
+	a) EMBL/SWISSPROT in distributed format with cdrom index format
+	   LTYPE = 'A'
+	b) GenBank in distributed format with cdrom index format LTYPE = 'C'
+	c) PIR/NRL3D in CODATA format with cdrom index format LTYPE = 'B'
+	d) PIR/NBRF .seq files can be read sequentially as "personal files
+	   in PIR format" and do not appear in the list of available libraries.
+	e) FASTA format files can be read sequentially as "personal files
+	   in FASTA format" and do not appear in the list of available
+	   libraries.
+
+3) EMBL, SWISSPROT and other libraries for which EMBL-style indexes have been
+created
+
+		current file types: 
+
+		A division.lookup
+		B entryname.index
+		C accession.target
+		D accession.hits
+                E brief description
+                F freetext.target
+                G freetext.hits
+                H author.target
+                I author.hits
+
+
+                   Library list
+level 1
+			|
+			|
+	        -----------------------------------------------------------
+		|			|			|
+	  lib 1 file list		lib 2 file list		lib 3 file list
+level 2
+		|			|
+ 	  --------		---------
+level 3
+	  file 1			file 1
+	  file 2			file 2
+	  .			.
+	  file n			file n
+
+ ---------------------------------------------------------------------------
+
+
+Example
+-------
+
+Level 1
+
+	File name: sequence.libs 
+	Environment variable: SEQUENCELIBRARIES
+	Contents:
+
+A EMBLFILES EMBL nucleotide library ! in cdrom format
+C GENBFILES GenBank nucleotide library!
+A SWISSFILES SWISSPROT protein library! in cdrom format
+B PIRFILES PIR protein library!
+B NRL3DFILES NRL3D protein library!
+
+	Notes: 
+
+The libraries have types A,B,C.  The logical names are EMBLLIBDESCRP and
+SWISSLIBDESCRP, etc and the prompts are 'EMBL nucleotide library' and
+'SWISSPROT protein library', etc.  Anything to the right of a ! is a comment.
+
+Level 2: the list of library files (using embl as an example)
+
+	File name: embl.files
+	Environment variable: EMBLFILES
+	Contents:
+
+A EMBLDIVPATH/embl_div.lkp
+B EMBLINDPATH/entrynam.idx
+C EMBLINDPATH/acnum.trg
+D EMBLINDPATH/acnum.hit
+E EMBLINDPATH/brief.idx
+F EMBLINDPATH/freetext.trg
+G EMBLINDPATH/freetext.hit
+H EMBLINDPATH/author.trg
+I EMBLINDPATH/author.hit
+
+
+Level 3: the sequence and annotation files (eg 15 for embl, 1 for swissprot).
+
+	Paths and file names:
+
+   EMBLPATH/bb.dat
+   EMBLPATH/fun.dat
+   EMBLPATH/inv.dat
+   EMBLPATH/mam.dat
+   EMBLPATH/org.dat
+   EMBLPATH/patent.dat
+   EMBLPATH/phg.dat
+   EMBLPATH/pln.dat
+   EMBLPATH/pri.dat
+   EMBLPATH/pro.dat
+   EMBLPATH/rod.dat
+   EMBLPATH/syn.dat
+   EMBLPATH/una.dat
+   EMBLPATH/vrl.dat
+   EMBLPATH/vrt.dat
+
+All files from the division lookup file down are exactly as they appear on the
+cdrom.  The division lookup file relates numbers stored in the indexes to
+actual division (or data) files stored on the disk. We rewrite it so the
+directory structure and file names can be chosen locally. Its format is
+I6,1x,A. An example is given below.
+
+	Division lookup file 
+
+	File name: STADTABL/embl_div.lkp
+	Environment variable path EMBLDIVPATH
+	Contents:
+
+     1 EMBLPATH/bb.dat
+     2 EMBLPATH/fun.dat
+     3 EMBLPATH/inv.dat
+     4 EMBLPATH/mam.dat
+     5 EMBLPATH/org.dat
+     6 EMBLPATH/patent.dat
+     7 EMBLPATH/phg.dat
+     8 EMBLPATH/pln.dat
+     9 EMBLPATH/pri.dat
+    10 EMBLPATH/pro.dat
+    11 EMBLPATH/rod.dat
+    12 EMBLPATH/syn.dat
+    13 EMBLPATH/una.dat
+    14 EMBLPATH/vrl.dat
+    15 EMBLPATH/vrt.dat
+ ---------------------------------------------------------------------------
+
+
+		Section III Options currently available
+		---------------------------------------
+
+Facilities currently offered in nip,pip,sip,nipl,pipl,sipl:
+
+	Get a sequence by knowing its entry name
+	Get a sequences' annotation by knowing its entry name
+	Get an entry name by knowing its accession number
+        Search the freetext index
+        Search the author index
+
+Facilities currently offered in nipl,pipl,sipl:
+
+	Search whole library
+	Search only a list of entry names
+	Search all but a list of entry names
+
+Outline of each type of operation
+
+Looking for an entry by name: the programs will open the library description
+file and read the names of its files and their file types. Then they will open
+the entrynam.idx file, and find the sequence offset, annotation offset and
+division number. Then open the division lookup file, find the file name for the
+division required, open that file, seek to the required byte and get the data.
+
+Looking for an entry by accession number: the programs will open the library
+description file and read the names of its files and their file types. Then
+they open the acnum.trg and acnum.hit files. The acnum.trg file is read to find
+the accession number and a pointer to the acnum.hit file and the number of
+hits.  That file is read and the corresponding entry names displayed. At
+present no further action is performed, although I expect to list out the
+titles for the entries found.
+
+Searching the whole of a library: the programs will open the library
+description file and read the names of its files and their file types. Then
+they open the division lookup file, read the names and numbers of the sequence
+files, open all of them, then open the entryname file. Then the library is
+processed sequentially by reading the entry names, their sequence offsets and
+division numbers from the entry names file, and then the sequence from the
+appropriate data file.
+
+Searching the whole of a library using a list of entry names to include: the
+programs will open the library description file and read the names of its files
+and their file types. Then they open the division lookup file, read the names
+and numbers of the sequence files, open all of them, then open the entryname
+file. Then the library is processed by reading the list of entry names and
+finding the names in the entry names file to get their sequence offsets and
+division numbers, and then the sequence from the appropriate data file. It will
+stop when it reaches the end of the list of entry names. The list of entry
+names can be in any order.
+
+Searching the whole of a library using a list of entry names to exclude: the
+programs will open the library description file and read the names of its files
+and their file types. Then they open the division lookup file, read the names
+and numbers of the sequence files, open all of them, then open the entryname
+file. Then the library is processed sequentially by reading the list of entry
+names, reading the next entry in the entry names file to make sure it does not
+match, then getting the sequence offsets and division numbers, and then the
+sequence from the appropriate data file. If a the next name matches the name on
+the list of entry names, it will be skipped, and the next name to exclude read.
+If the list of excluded names is finished the rest of the library is searched
+sequentially. The list of entry names must be in the same order as those in the
+library (ie sorted alphabetically).
+
+Searching a whole library using a PIR format file is performed by reading it
+sequentially. If as list of entry names is used it must be in the same order as
+the entries in the library file.
+ ---------------------------------------------------------------------------
+
+
+
+
+		Section IV Installation guide
+		-----------------------------
+
+EMBL CDROM
+
+ The data can be left on the cdrom or copied to hard disk. The files
+staden.login and staden.profile source the file $STADTABL/libraries.config.csh
+and $STADTABL/libraries.config.sh respectively. Refer to this file to see what
+is required to install, add or move a sequence library that you want to be used
+by the programs.
+
+Other libraries (PIR, Genbank, EMBL updates)
+
+Create the indexes then edit the files that tell the programs where the data is
+stored.  The files staden.login and staden.profile source the file
+$STADTABL/libraries.config Refer to this file to see what is required to
+install, add or move a sequence library that you want to be used by the
+programs.
+
+
+------------------------------------------------------------------------------
+
+
+		Section V New feature table handling facilities
+		-----------------------------------------------
+
+As mentioned above EMBL and GenBank have recently introduced new feature tables
+for annotating the sequences. They are a great improvement on the previous ones
+and, among other things, now permit correct translation of spliced genes.
+Various options within nip have been added or modified to take advantage of
+these changes.  The routine to translate DNA to protein and write the protein
+to disk now gives correct results for spliced genes. The routine to translate
+DNA to protein and display the two together now gives correct translations
+except for the amino acids spanning intron/exon junctions. The routine to plot
+maps from feature tables can use the new style. The open reading frame finding
+routine writes out its results in the new style. The routine that finds open
+reading frames and writes their translations to disk also writes a title in the
+form of a new style feature table entry. The feature table format output from
+the pattern searches in nip also uses the new style.
+
+ 
+
+----------------------------------------------------------------------------
+
+                Section VI Indexing the sequence libraries
+                --------------------------------------------
+
+We handle EMBL, SwissProt, and GenBank in their distributed format, plus
+PIR and NRL3D in codata format. All programs and scripts are in directory 
+indexseqlibs.
+
+Currently we produce entryname index, accession number index freetext index,
+and brief index (brief index contains the entry name the primary accession 
+number the sequence length and an 80 character description).
+
+To produce any of the indexes requires the creation of several intermediate
+files and the indexing programs are written so that the intermediate files
+are the same for all libraries. This means that only the programs that read
+the distributed form of each library need to be unique to that library, and
+all the other processing programs can be used for all libraries.
+
+
+However even the though the indexes have the same format, programs (like nip)
+that read the libraries need to treat each library separately because their
+actual contents are written differently.
+
+Making the entry name index
+---------------------------
+
+Common program entryname2
+
+EMBL		emblentryname1
+SwissProt	emblentryname1
+
+GenBank		genbentryname1
+
+PIR		pirentryname1
+NRL3D		pirentryname1
+
+
+Making the accession number index
+---------------------------------
+
+Common programs access2 access3 access4
+
+EMBL		emblaccess1
+SwissProt	emblaccess1
+
+GenBank		genbaccess1
+
+PIR		piraccess1 piraccess2 
+NRL3D		No accession numbers
+
+Making the brief index
+----------------------
+
+Common program title2
+
+EMBL		embltitle1
+SwissProt	embltitle1
+
+GenBank		genbtitle1
+
+PIR		pirtitle1 pirtitle2 (pir3 has no accession numbers)
+NRL3D		pirtitle2
+
+Scripts
+-------
+
+emblentryname.script
+emblaccession.script
+embltitle.script
+
+swissentryname.script
+swissaccession.script
+swisstitle.script
+
+genbentrynamescript
+genbaccession.script
+genbtitle.script
+
+pirentryname.script
+piraccession.script
+pirtitle.script
+
+nrl3dentryname.script
+nrl3dtitle.script
+
+
+ 
+
+
+
+
+
diff --git a/Staden_install-alpha b/Staden_install-alpha
new file mode 100644
index 0000000..34a9b69
--- /dev/null
+++ b/Staden_install-alpha
@@ -0,0 +1,453 @@
+#! /bin/csh -f
+#
+# staden_install - version 2.4
+#
+#	This is a prototype installation program.
+#
+# 9 March 1992
+#	Modified for installation on Sun, Alliant, etc
+#	No longer install 2rs
+#
+# 20 November 1992
+#	Now includes convert, cop, frog, getMCH and scf
+#
+# 25 November 1992
+#	SGI supported
+# 
+# 19 May 1993
+#	DEC Alpha, Solaris supported
+# 
+# Written by sd@uk.ac.cam.mrc-lmb
+#
+
+# prelim
+set prog = $0 ; set prog = $prog:t
+
+# Machines supported: al sun dec sgi alpha solaris
+#set MACHINE = `echo $prog | sed 's/.*-//'`
+set MACHINE = alpha
+
+# For local (MRC-LMB) setup only
+#set LOCAL = `echo $prog | awk '/local/{print "YES";exit;}{print "NO";}'`
+set LOCAL = NO
+
+
+echo ""
+echo -n "Staden Package installation procedure - "
+switch (${MACHINE})
+	case "al":
+		echo "Alliant FX/2800 Concentrix version"
+		set MAKE = "make -sk"
+		breaksw
+	case "sun":
+		echo "SunOS version"
+		set MAKE = "make -sk"
+		breaksw
+	case "dec":
+		echo "DEC Ultrix (mips) version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "sgi":
+		echo "Silicon Graphics Iris version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "alpha":
+		echo "DEC Alpha OSF/1 version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "solaris":
+		echo "Solaris version"
+		set MAKE = "make -sk"
+		breaksw
+	default:
+		echo "Panic. Unknown version"
+		exit 1
+endsw
+echo ""
+echo "* starting initialization...please wait."
+echo ""
+
+# Binary fork of source directory
+if ($LOCAL == "YES") then
+    set DIR_BINARIES = ${MACHINE}-binaries
+    set DIR_PROGS = ${MACHINE}-bin
+else
+    set DIR_BINARIES = .
+    set DIR_PROGS = bin
+    set MAKE = "$MAKE -f makefile-${MACHINE}"
+endif
+
+init:
+# Set useful shell variables
+set YES="YES";
+set NO="NO"
+
+# set/unset some .cshrc envs.
+unset noclobber
+set noglob
+
+# set interrupt trap
+onintr end_failure
+
+# Make dir command
+set MKDIR = "mkdir"
+
+# Copy command
+set CP = "cp -p"
+
+# Install command
+#set INSTALL = "install"
+#set INSTALL = "mv"
+set INSTALL = "cp"
+
+# Set up default responses
+set DEF_STADEN_ROOT = `pwd`
+
+set DEF_REQ_NONX = "$YES"
+set DEF_REQ_X = "$YES"
+set DEF_REQ_TED = "$YES"
+set DEF_REQ_MISC = "$YES"
+
+# directories
+set DIR_SRC = $DEF_STADEN_ROOT/src
+set DIR_BIN = $DEF_STADEN_ROOT/$DIR_PROGS
+set DIR_MISC = $DIR_SRC/Misc
+set DIR_STADEN = $DIR_SRC/staden
+set DIR_TED = $DIR_SRC/ted
+set DIR_ABI = $DIR_SRC/abi
+set DIR_ALF = $DIR_SRC/alf
+set DIR_BAP = $DIR_SRC/bap
+set DIR_OSP = $DIR_SRC/bap/osp-bits
+set DIR_CONVERT = $DIR_SRC/convert
+set DIR_COP = $DIR_SRC/cop
+set DIR_FROG = $DIR_SRC/frog
+set DIR_GETMCH = $DIR_SRC/getMCH
+set DIR_SCF = $DIR_SRC/scf
+
+
+main:
+
+	
+preamble:
+	echo ""
+	echo ""
+	echo "* Please answer the following questions."
+	echo "  Default answers to questions are given in square brackets."
+	echo "  If you require help at any stage respond with a ? to the question."
+	echo ""
+
+ask_staden_root:
+	set ANS_STADEN_ROOT = $DEF_STADEN_ROOT
+
+ask_require_nonx_progs: 
+	echo -n "Compile all the non-X programs in the Staden Package [$DEF_REQ_NONX]? "
+	set ANS_REQ_NONX = $<
+	if ("$ANS_REQ_NONX" == "?") then
+		echo "* If you do not have X windows on your system you will require"
+		echo "  these.  However, you will require Tektronics terminal emulation."
+		echo "  If you do not require all of the non-X programs, you should abort"
+		echo "  and manually make the ones you require."
+		echo ""
+		goto ask_require_nonx_progs
+	else if ("$ANS_REQ_NONX" != "") then
+		if ("$ANS_REQ_NONX" =~ [yY]*) then
+			set ANS_REQ_NONX=$YES
+		else if ("$ANS_REQ_NONX" =~ [nN]*) then
+			set ANS_REQ_NONX=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_NONX=$DEF_REQ_NONX
+	endif
+
+ask_require_x_progs:
+	echo -n "Compile all the X programs in the Staden Package [$DEF_REQ_X]? "
+	set ANS_REQ_X = $<
+	if ("$ANS_REQ_X" == "?") then
+		echo "* These are the programs that require X windows."
+		echo "  If you do not require all of the X programs, you should abort"
+		echo "  and manually make the ones you require."
+
+		echo ""
+		goto ask_require_x_progs
+	else if ("$ANS_REQ_X" != "") then
+		if ("$ANS_REQ_X" =~ [yY]*) then
+			set ANS_REQ_X=$YES
+		else if ("$ANS_REQ_X" =~ [nN]*) then
+			set ANS_REQ_X=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_X=$DEF_REQ_X
+	endif
+
+
+ask_require_ted:
+	echo -n "Compile the trace editing program ted [$DEF_REQ_TED]? "
+	set ANS_REQ_TED = $<
+	if ("$ANS_REQ_TED" == "?") then
+		echo "* This is the trace editor program.  It allows you to look at"
+		echo "  traces obtained from automated fluorescent sequencing machines."
+		echo ""
+		goto ask_require_ted
+	else if ("$ANS_REQ_TED" != "") then
+		if ("$ANS_REQ_TED" =~ [yY]*) then
+			set ANS_REQ_TED=$YES
+		else if ("$ANS_REQ_TED" =~ [nN]*) then
+			set ANS_REQ_TED=$NO
+		else
+			goto ask_require_ted
+		endif
+	else
+		set ANS_REQ_TED=$DEF_REQ_TED
+	endif
+
+
+
+ask_require_misc:
+	echo -n "Compile other programs [$DEF_REQ_MISC]? "
+	set ANS_REQ_MISC = $<
+	if ("$ANS_REQ_MISC" == "?") then
+		echo "* Other programs include:"
+		echo "    alfsplit"
+		echo "    getABISampleName"
+		echo ""
+		goto ask_require_misc
+	else if ("$ANS_REQ_MISC" != "") then
+		if ("$ANS_REQ_MISC" =~ [yY]*) then
+			set ANS_REQ_MISC=$YES
+		else if ("$ANS_REQ_MISC" =~ [nN]*) then
+			set ANS_REQ_MISC=$NO
+		else
+			goto ask_require_misc
+		endif
+	else
+		set ANS_REQ_MISC=$DEF_REQ_MISC
+	endif
+
+
+
+time_taken_warning:
+	echo ""
+	echo "The installation procedure is now ready to start."
+	echo ""
+	echo "**** Warning:"
+	echo "    The installation will take considerable time to complete.  If you"
+	echo "    are installing the whole Staden Package from scratch it could"
+	echo "    take as long as an hour for all exectuables to be compiled and"
+	echo "    installed."
+	echo ""
+
+ask_goahead:
+	echo -n "Proceed with the installation [YES]? " 
+	set ANSWER=$<
+	if ("$ANSWER" == "?") then
+		echo "* Final confirmation to proceed with the installation.  Answer"
+		echo "  YES to proceed; otherwise, answer NO to abort the installation."
+		echo ""
+		goto ask_goahead
+	else if ("$ANSWER" != "") then
+		if ("$ANSWER" =~ [nN]*) then
+			goto chickens_exit
+		else if ("$ANSWER" !~ [yY]*) then
+			goto ask_goahead
+		endif
+	endif
+
+installation_proper:
+
+# make binaries directory if it doesn't exist
+
+	if (! -d $DIR_BIN) then
+		$MKDIR $DIR_BIN
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES" || "$ANS_REQ_X" == "$YES" || "$ANS_REQ_NONX" == "$YES" ) then
+		echo ""
+		echo "+ Compiling miscellaneous library"
+
+		pushd $DIR_MISC > /dev/null
+
+		cd $DIR_BINARIES
+	        $MAKE all
+
+		popd > /dev/null
+
+	endif
+
+        if ("$ANS_REQ_NONX" == "$YES") then
+		echo ""
+		echo "+ Installing non X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE nprogs lprogs
+                $INSTALL mep $DIR_BIN
+		$INSTALL nip $DIR_BIN
+		$INSTALL pip $DIR_BIN
+		$INSTALL sap $DIR_BIN
+		$INSTALL sapf $DIR_BIN
+		$INSTALL sip $DIR_BIN
+		$INSTALL splitp1 $DIR_BIN
+		$INSTALL splitp2 $DIR_BIN
+		$INSTALL splitp3 $DIR_BIN
+		$INSTALL sethelp $DIR_BIN
+		$INSTALL gip $DIR_BIN
+		$INSTALL nipl $DIR_BIN
+		$INSTALL pipl $DIR_BIN
+		$INSTALL sipl $DIR_BIN
+		$INSTALL dap $DIR_BIN
+		$INSTALL nipf $DIR_BIN
+		$INSTALL vep $DIR_BIN
+		$INSTALL rep $DIR_BIN
+		$INSTALL lip $DIR_BIN
+		#$INSTALL convert_project $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE bap
+                $INSTALL bap $DIR_BIN
+		popd > /dev/null
+
+	endif
+
+	if ("$ANS_REQ_TED" == "$YES") then
+		echo ""
+		echo "+ Installing Trace editor"
+
+		pushd $DIR_TED > /dev/null
+		cd $DIR_BINARIES
+                $MAKE ted
+		$INSTALL ted $DIR_BIN
+		popd > /dev/null
+	endif
+
+	if ("$ANS_REQ_X" == "$YES") then
+		echo ""
+		echo "+ Installing X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+                $MAKE xprogs
+		$INSTALL xmep $DIR_BIN
+		$INSTALL xnip $DIR_BIN
+		$INSTALL xpip $DIR_BIN
+		$INSTALL xsap $DIR_BIN
+		$INSTALL xsip $DIR_BIN
+		$INSTALL xdap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE xbap
+                $INSTALL xbap $DIR_BIN
+		popd > /dev/null
+
+
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES") then
+		echo ""
+		echo "+ Installing miscellaneous programs"
+
+		pushd $DIR_ABI > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL getABISampleName $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_ALF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE alfsplit
+		$INSTALL alfsplit $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_CONVERT > /dev/null
+		cd $DIR_BINARIES
+                $MAKE convert
+		$INSTALL convert $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_COP > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL cop $DIR_BIN
+		$INSTALL cop-bap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_FROG > /dev/null
+		cd $DIR_BINARIES
+                $MAKE frog
+		$INSTALL frog $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_GETMCH > /dev/null
+		cd $DIR_BINARIES
+                $MAKE trace2seq
+		$INSTALL trace2seq $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_SCF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE makeSCF
+		$INSTALL makeSCF $DIR_BIN
+		popd > /dev/null
+
+
+
+	endif
+
+
+installation_done:
+	echo ""
+	echo "+ Installation completed"
+	echo ""
+
+	echo "  Some further initialisation is required in order to use the"
+	echo "  package.  csh users should insert the following in their .login"
+	echo "  files:"
+	echo "  "
+	echo "  	setenv STADENROOT $ANS_STADEN_ROOT"
+	echo '  	source $STADENROOT/staden.login'
+	echo "  "
+	echo "  Users of the Bourne shell, sh, should insert the following in"
+	echo "  their .profile:"
+	echo "  "
+	echo "  	STADENROOT=$ANS_STADEN_ROOT"
+	echo "  	export STADENROOT"
+	echo '  	. $STADENROOT/staden.profile'
+	echo "  "
+	echo "  These initialisations will alter the shell's search path so that"
+	echo "  it can find the programs in the STADEN Package"
+	echo "  "
+
+normal_exit:
+	exit 0
+
+chickens_exit:
+	echo ""
+	echo "+ Installation cancelled"
+	echo ""
+
+	exit 0
+
+end_failure:
+	unset noglob
+	echo ""
+	echo "Aborted STADEN Package installation on `date`" 
+	echo ""
+	exit 1
+
diff --git a/Staden_install-dec b/Staden_install-dec
new file mode 100644
index 0000000..9b06240
--- /dev/null
+++ b/Staden_install-dec
@@ -0,0 +1,453 @@
+#! /bin/csh -f
+#
+# staden_install - version 2.4
+#
+#	This is a prototype installation program.
+#
+# 9 March 1992
+#	Modified for installation on Sun, Alliant, etc
+#	No longer install 2rs
+#
+# 20 November 1992
+#	Now includes convert, cop, frog, getMCH and scf
+#
+# 25 November 1992
+#	SGI supported
+# 
+# 19 May 1993
+#	DEC Alpha, Solaris supported
+# 
+# Written by sd@uk.ac.cam.mrc-lmb
+#
+
+# prelim
+set prog = $0 ; set prog = $prog:t
+
+# Machines supported: al sun dec sgi alpha solaris
+#set MACHINE = `echo $prog | sed 's/.*-//'`
+set MACHINE = dec
+
+# For local (MRC-LMB) setup only
+#set LOCAL = `echo $prog | awk '/local/{print "YES";exit;}{print "NO";}'`
+set LOCAL = NO
+
+
+echo ""
+echo -n "Staden Package installation procedure - "
+switch (${MACHINE})
+	case "al":
+		echo "Alliant FX/2800 Concentrix version"
+		set MAKE = "make -sk"
+		breaksw
+	case "sun":
+		echo "SunOS version"
+		set MAKE = "make -sk"
+		breaksw
+	case "dec":
+		echo "DEC Ultrix (mips) version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "sgi":
+		echo "Silicon Graphics Iris version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "alpha":
+		echo "DEC Alpha OSF/1 version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "solaris":
+		echo "Solaris version"
+		set MAKE = "make -sk"
+		breaksw
+	default:
+		echo "Panic. Unknown version"
+		exit 1
+endsw
+echo ""
+echo "* starting initialization...please wait."
+echo ""
+
+# Binary fork of source directory
+if ($LOCAL == "YES") then
+    set DIR_BINARIES = ${MACHINE}-binaries
+    set DIR_PROGS = ${MACHINE}-bin
+else
+    set DIR_BINARIES = .
+    set DIR_PROGS = bin
+    set MAKE = "$MAKE -f makefile-${MACHINE}"
+endif
+
+init:
+# Set useful shell variables
+set YES="YES";
+set NO="NO"
+
+# set/unset some .cshrc envs.
+unset noclobber
+set noglob
+
+# set interrupt trap
+onintr end_failure
+
+# Make dir command
+set MKDIR = "mkdir"
+
+# Copy command
+set CP = "cp -p"
+
+# Install command
+#set INSTALL = "install"
+#set INSTALL = "mv"
+set INSTALL = "cp"
+
+# Set up default responses
+set DEF_STADEN_ROOT = `pwd`
+
+set DEF_REQ_NONX = "$YES"
+set DEF_REQ_X = "$YES"
+set DEF_REQ_TED = "$YES"
+set DEF_REQ_MISC = "$YES"
+
+# directories
+set DIR_SRC = $DEF_STADEN_ROOT/src
+set DIR_BIN = $DEF_STADEN_ROOT/$DIR_PROGS
+set DIR_MISC = $DIR_SRC/Misc
+set DIR_STADEN = $DIR_SRC/staden
+set DIR_TED = $DIR_SRC/ted
+set DIR_ABI = $DIR_SRC/abi
+set DIR_ALF = $DIR_SRC/alf
+set DIR_BAP = $DIR_SRC/bap
+set DIR_OSP = $DIR_SRC/bap/osp-bits
+set DIR_CONVERT = $DIR_SRC/convert
+set DIR_COP = $DIR_SRC/cop
+set DIR_FROG = $DIR_SRC/frog
+set DIR_GETMCH = $DIR_SRC/getMCH
+set DIR_SCF = $DIR_SRC/scf
+
+
+main:
+
+	
+preamble:
+	echo ""
+	echo ""
+	echo "* Please answer the following questions."
+	echo "  Default answers to questions are given in square brackets."
+	echo "  If you require help at any stage respond with a ? to the question."
+	echo ""
+
+ask_staden_root:
+	set ANS_STADEN_ROOT = $DEF_STADEN_ROOT
+
+ask_require_nonx_progs: 
+	echo -n "Compile all the non-X programs in the Staden Package [$DEF_REQ_NONX]? "
+	set ANS_REQ_NONX = $<
+	if ("$ANS_REQ_NONX" == "?") then
+		echo "* If you do not have X windows on your system you will require"
+		echo "  these.  However, you will require Tektronics terminal emulation."
+		echo "  If you do not require all of the non-X programs, you should abort"
+		echo "  and manually make the ones you require."
+		echo ""
+		goto ask_require_nonx_progs
+	else if ("$ANS_REQ_NONX" != "") then
+		if ("$ANS_REQ_NONX" =~ [yY]*) then
+			set ANS_REQ_NONX=$YES
+		else if ("$ANS_REQ_NONX" =~ [nN]*) then
+			set ANS_REQ_NONX=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_NONX=$DEF_REQ_NONX
+	endif
+
+ask_require_x_progs:
+	echo -n "Compile all the X programs in the Staden Package [$DEF_REQ_X]? "
+	set ANS_REQ_X = $<
+	if ("$ANS_REQ_X" == "?") then
+		echo "* These are the programs that require X windows."
+		echo "  If you do not require all of the X programs, you should abort"
+		echo "  and manually make the ones you require."
+
+		echo ""
+		goto ask_require_x_progs
+	else if ("$ANS_REQ_X" != "") then
+		if ("$ANS_REQ_X" =~ [yY]*) then
+			set ANS_REQ_X=$YES
+		else if ("$ANS_REQ_X" =~ [nN]*) then
+			set ANS_REQ_X=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_X=$DEF_REQ_X
+	endif
+
+
+ask_require_ted:
+	echo -n "Compile the trace editing program ted [$DEF_REQ_TED]? "
+	set ANS_REQ_TED = $<
+	if ("$ANS_REQ_TED" == "?") then
+		echo "* This is the trace editor program.  It allows you to look at"
+		echo "  traces obtained from automated fluorescent sequencing machines."
+		echo ""
+		goto ask_require_ted
+	else if ("$ANS_REQ_TED" != "") then
+		if ("$ANS_REQ_TED" =~ [yY]*) then
+			set ANS_REQ_TED=$YES
+		else if ("$ANS_REQ_TED" =~ [nN]*) then
+			set ANS_REQ_TED=$NO
+		else
+			goto ask_require_ted
+		endif
+	else
+		set ANS_REQ_TED=$DEF_REQ_TED
+	endif
+
+
+
+ask_require_misc:
+	echo -n "Compile other programs [$DEF_REQ_MISC]? "
+	set ANS_REQ_MISC = $<
+	if ("$ANS_REQ_MISC" == "?") then
+		echo "* Other programs include:"
+		echo "    alfsplit"
+		echo "    getABISampleName"
+		echo ""
+		goto ask_require_misc
+	else if ("$ANS_REQ_MISC" != "") then
+		if ("$ANS_REQ_MISC" =~ [yY]*) then
+			set ANS_REQ_MISC=$YES
+		else if ("$ANS_REQ_MISC" =~ [nN]*) then
+			set ANS_REQ_MISC=$NO
+		else
+			goto ask_require_misc
+		endif
+	else
+		set ANS_REQ_MISC=$DEF_REQ_MISC
+	endif
+
+
+
+time_taken_warning:
+	echo ""
+	echo "The installation procedure is now ready to start."
+	echo ""
+	echo "**** Warning:"
+	echo "    The installation will take considerable time to complete.  If you"
+	echo "    are installing the whole Staden Package from scratch it could"
+	echo "    take as long as an hour for all exectuables to be compiled and"
+	echo "    installed."
+	echo ""
+
+ask_goahead:
+	echo -n "Proceed with the installation [YES]? " 
+	set ANSWER=$<
+	if ("$ANSWER" == "?") then
+		echo "* Final confirmation to proceed with the installation.  Answer"
+		echo "  YES to proceed; otherwise, answer NO to abort the installation."
+		echo ""
+		goto ask_goahead
+	else if ("$ANSWER" != "") then
+		if ("$ANSWER" =~ [nN]*) then
+			goto chickens_exit
+		else if ("$ANSWER" !~ [yY]*) then
+			goto ask_goahead
+		endif
+	endif
+
+installation_proper:
+
+# make binaries directory if it doesn't exist
+
+	if (! -d $DIR_BIN) then
+		$MKDIR $DIR_BIN
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES" || "$ANS_REQ_X" == "$YES" || "$ANS_REQ_NONX" == "$YES" ) then
+		echo ""
+		echo "+ Compiling miscellaneous library"
+
+		pushd $DIR_MISC > /dev/null
+
+		cd $DIR_BINARIES
+	        $MAKE all
+
+		popd > /dev/null
+
+	endif
+
+        if ("$ANS_REQ_NONX" == "$YES") then
+		echo ""
+		echo "+ Installing non X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE nprogs lprogs
+                $INSTALL mep $DIR_BIN
+		$INSTALL nip $DIR_BIN
+		$INSTALL pip $DIR_BIN
+		$INSTALL sap $DIR_BIN
+		$INSTALL sapf $DIR_BIN
+		$INSTALL sip $DIR_BIN
+		$INSTALL splitp1 $DIR_BIN
+		$INSTALL splitp2 $DIR_BIN
+		$INSTALL splitp3 $DIR_BIN
+		$INSTALL sethelp $DIR_BIN
+		$INSTALL gip $DIR_BIN
+		$INSTALL nipl $DIR_BIN
+		$INSTALL pipl $DIR_BIN
+		$INSTALL sipl $DIR_BIN
+		$INSTALL dap $DIR_BIN
+		$INSTALL nipf $DIR_BIN
+		$INSTALL vep $DIR_BIN
+		$INSTALL rep $DIR_BIN
+		$INSTALL lip $DIR_BIN
+		#$INSTALL convert_project $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE bap
+                $INSTALL bap $DIR_BIN
+		popd > /dev/null
+
+	endif
+
+	if ("$ANS_REQ_TED" == "$YES") then
+		echo ""
+		echo "+ Installing Trace editor"
+
+		pushd $DIR_TED > /dev/null
+		cd $DIR_BINARIES
+                $MAKE ted
+		$INSTALL ted $DIR_BIN
+		popd > /dev/null
+	endif
+
+	if ("$ANS_REQ_X" == "$YES") then
+		echo ""
+		echo "+ Installing X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+                $MAKE xprogs
+		$INSTALL xmep $DIR_BIN
+		$INSTALL xnip $DIR_BIN
+		$INSTALL xpip $DIR_BIN
+		$INSTALL xsap $DIR_BIN
+		$INSTALL xsip $DIR_BIN
+		$INSTALL xdap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE xbap
+                $INSTALL xbap $DIR_BIN
+		popd > /dev/null
+
+
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES") then
+		echo ""
+		echo "+ Installing miscellaneous programs"
+
+		pushd $DIR_ABI > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL getABISampleName $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_ALF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE alfsplit
+		$INSTALL alfsplit $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_CONVERT > /dev/null
+		cd $DIR_BINARIES
+                $MAKE convert
+		$INSTALL convert $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_COP > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL cop $DIR_BIN
+		$INSTALL cop-bap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_FROG > /dev/null
+		cd $DIR_BINARIES
+                $MAKE frog
+		$INSTALL frog $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_GETMCH > /dev/null
+		cd $DIR_BINARIES
+                $MAKE trace2seq
+		$INSTALL trace2seq $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_SCF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE makeSCF
+		$INSTALL makeSCF $DIR_BIN
+		popd > /dev/null
+
+
+
+	endif
+
+
+installation_done:
+	echo ""
+	echo "+ Installation completed"
+	echo ""
+
+	echo "  Some further initialisation is required in order to use the"
+	echo "  package.  csh users should insert the following in their .login"
+	echo "  files:"
+	echo "  "
+	echo "  	setenv STADENROOT $ANS_STADEN_ROOT"
+	echo '  	source $STADENROOT/staden.login'
+	echo "  "
+	echo "  Users of the Bourne shell, sh, should insert the following in"
+	echo "  their .profile:"
+	echo "  "
+	echo "  	STADENROOT=$ANS_STADEN_ROOT"
+	echo "  	export STADENROOT"
+	echo '  	. $STADENROOT/staden.profile'
+	echo "  "
+	echo "  These initialisations will alter the shell's search path so that"
+	echo "  it can find the programs in the STADEN Package"
+	echo "  "
+
+normal_exit:
+	exit 0
+
+chickens_exit:
+	echo ""
+	echo "+ Installation cancelled"
+	echo ""
+
+	exit 0
+
+end_failure:
+	unset noglob
+	echo ""
+	echo "Aborted STADEN Package installation on `date`" 
+	echo ""
+	exit 1
+
diff --git a/Staden_install-sgi b/Staden_install-sgi
new file mode 100644
index 0000000..7ce5b1e
--- /dev/null
+++ b/Staden_install-sgi
@@ -0,0 +1,453 @@
+#! /bin/csh -f
+#
+# staden_install - version 2.4
+#
+#	This is a prototype installation program.
+#
+# 9 March 1992
+#	Modified for installation on Sun, Alliant, etc
+#	No longer install 2rs
+#
+# 20 November 1992
+#	Now includes convert, cop, frog, getMCH and scf
+#
+# 25 November 1992
+#	SGI supported
+# 
+# 19 May 1993
+#	DEC Alpha, Solaris supported
+# 
+# Written by sd@uk.ac.cam.mrc-lmb
+#
+
+# prelim
+set prog = $0 ; set prog = $prog:t
+
+# Machines supported: al sun dec sgi alpha solaris
+#set MACHINE = `echo $prog | sed 's/.*-//'`
+set MACHINE = sgi
+
+# For local (MRC-LMB) setup only
+#set LOCAL = `echo $prog | awk '/local/{print "YES";exit;}{print "NO";}'`
+set LOCAL = NO
+
+
+echo ""
+echo -n "Staden Package installation procedure - "
+switch (${MACHINE})
+	case "al":
+		echo "Alliant FX/2800 Concentrix version"
+		set MAKE = "make -sk"
+		breaksw
+	case "sun":
+		echo "SunOS version"
+		set MAKE = "make -sk"
+		breaksw
+	case "dec":
+		echo "DEC Ultrix (mips) version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "sgi":
+		echo "Silicon Graphics Iris version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "alpha":
+		echo "DEC Alpha OSF/1 version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "solaris":
+		echo "Solaris version"
+		set MAKE = "make -sk"
+		breaksw
+	default:
+		echo "Panic. Unknown version"
+		exit 1
+endsw
+echo ""
+echo "* starting initialization...please wait."
+echo ""
+
+# Binary fork of source directory
+if ($LOCAL == "YES") then
+    set DIR_BINARIES = ${MACHINE}-binaries
+    set DIR_PROGS = ${MACHINE}-bin
+else
+    set DIR_BINARIES = .
+    set DIR_PROGS = bin
+    set MAKE = "$MAKE -f makefile-${MACHINE}"
+endif
+
+init:
+# Set useful shell variables
+set YES="YES";
+set NO="NO"
+
+# set/unset some .cshrc envs.
+unset noclobber
+set noglob
+
+# set interrupt trap
+onintr end_failure
+
+# Make dir command
+set MKDIR = "mkdir"
+
+# Copy command
+set CP = "cp -p"
+
+# Install command
+#set INSTALL = "install"
+#set INSTALL = "mv"
+set INSTALL = "cp"
+
+# Set up default responses
+set DEF_STADEN_ROOT = `pwd`
+
+set DEF_REQ_NONX = "$YES"
+set DEF_REQ_X = "$YES"
+set DEF_REQ_TED = "$YES"
+set DEF_REQ_MISC = "$YES"
+
+# directories
+set DIR_SRC = $DEF_STADEN_ROOT/src
+set DIR_BIN = $DEF_STADEN_ROOT/$DIR_PROGS
+set DIR_MISC = $DIR_SRC/Misc
+set DIR_STADEN = $DIR_SRC/staden
+set DIR_TED = $DIR_SRC/ted
+set DIR_ABI = $DIR_SRC/abi
+set DIR_ALF = $DIR_SRC/alf
+set DIR_BAP = $DIR_SRC/bap
+set DIR_OSP = $DIR_SRC/bap/osp-bits
+set DIR_CONVERT = $DIR_SRC/convert
+set DIR_COP = $DIR_SRC/cop
+set DIR_FROG = $DIR_SRC/frog
+set DIR_GETMCH = $DIR_SRC/getMCH
+set DIR_SCF = $DIR_SRC/scf
+
+
+main:
+
+	
+preamble:
+	echo ""
+	echo ""
+	echo "* Please answer the following questions."
+	echo "  Default answers to questions are given in square brackets."
+	echo "  If you require help at any stage respond with a ? to the question."
+	echo ""
+
+ask_staden_root:
+	set ANS_STADEN_ROOT = $DEF_STADEN_ROOT
+
+ask_require_nonx_progs: 
+	echo -n "Compile all the non-X programs in the Staden Package [$DEF_REQ_NONX]? "
+	set ANS_REQ_NONX = $<
+	if ("$ANS_REQ_NONX" == "?") then
+		echo "* If you do not have X windows on your system you will require"
+		echo "  these.  However, you will require Tektronics terminal emulation."
+		echo "  If you do not require all of the non-X programs, you should abort"
+		echo "  and manually make the ones you require."
+		echo ""
+		goto ask_require_nonx_progs
+	else if ("$ANS_REQ_NONX" != "") then
+		if ("$ANS_REQ_NONX" =~ [yY]*) then
+			set ANS_REQ_NONX=$YES
+		else if ("$ANS_REQ_NONX" =~ [nN]*) then
+			set ANS_REQ_NONX=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_NONX=$DEF_REQ_NONX
+	endif
+
+ask_require_x_progs:
+	echo -n "Compile all the X programs in the Staden Package [$DEF_REQ_X]? "
+	set ANS_REQ_X = $<
+	if ("$ANS_REQ_X" == "?") then
+		echo "* These are the programs that require X windows."
+		echo "  If you do not require all of the X programs, you should abort"
+		echo "  and manually make the ones you require."
+
+		echo ""
+		goto ask_require_x_progs
+	else if ("$ANS_REQ_X" != "") then
+		if ("$ANS_REQ_X" =~ [yY]*) then
+			set ANS_REQ_X=$YES
+		else if ("$ANS_REQ_X" =~ [nN]*) then
+			set ANS_REQ_X=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_X=$DEF_REQ_X
+	endif
+
+
+ask_require_ted:
+	echo -n "Compile the trace editing program ted [$DEF_REQ_TED]? "
+	set ANS_REQ_TED = $<
+	if ("$ANS_REQ_TED" == "?") then
+		echo "* This is the trace editor program.  It allows you to look at"
+		echo "  traces obtained from automated fluorescent sequencing machines."
+		echo ""
+		goto ask_require_ted
+	else if ("$ANS_REQ_TED" != "") then
+		if ("$ANS_REQ_TED" =~ [yY]*) then
+			set ANS_REQ_TED=$YES
+		else if ("$ANS_REQ_TED" =~ [nN]*) then
+			set ANS_REQ_TED=$NO
+		else
+			goto ask_require_ted
+		endif
+	else
+		set ANS_REQ_TED=$DEF_REQ_TED
+	endif
+
+
+
+ask_require_misc:
+	echo -n "Compile other programs [$DEF_REQ_MISC]? "
+	set ANS_REQ_MISC = $<
+	if ("$ANS_REQ_MISC" == "?") then
+		echo "* Other programs include:"
+		echo "    alfsplit"
+		echo "    getABISampleName"
+		echo ""
+		goto ask_require_misc
+	else if ("$ANS_REQ_MISC" != "") then
+		if ("$ANS_REQ_MISC" =~ [yY]*) then
+			set ANS_REQ_MISC=$YES
+		else if ("$ANS_REQ_MISC" =~ [nN]*) then
+			set ANS_REQ_MISC=$NO
+		else
+			goto ask_require_misc
+		endif
+	else
+		set ANS_REQ_MISC=$DEF_REQ_MISC
+	endif
+
+
+
+time_taken_warning:
+	echo ""
+	echo "The installation procedure is now ready to start."
+	echo ""
+	echo "**** Warning:"
+	echo "    The installation will take considerable time to complete.  If you"
+	echo "    are installing the whole Staden Package from scratch it could"
+	echo "    take as long as an hour for all exectuables to be compiled and"
+	echo "    installed."
+	echo ""
+
+ask_goahead:
+	echo -n "Proceed with the installation [YES]? " 
+	set ANSWER=$<
+	if ("$ANSWER" == "?") then
+		echo "* Final confirmation to proceed with the installation.  Answer"
+		echo "  YES to proceed; otherwise, answer NO to abort the installation."
+		echo ""
+		goto ask_goahead
+	else if ("$ANSWER" != "") then
+		if ("$ANSWER" =~ [nN]*) then
+			goto chickens_exit
+		else if ("$ANSWER" !~ [yY]*) then
+			goto ask_goahead
+		endif
+	endif
+
+installation_proper:
+
+# make binaries directory if it doesn't exist
+
+	if (! -d $DIR_BIN) then
+		$MKDIR $DIR_BIN
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES" || "$ANS_REQ_X" == "$YES" || "$ANS_REQ_NONX" == "$YES" ) then
+		echo ""
+		echo "+ Compiling miscellaneous library"
+
+		pushd $DIR_MISC > /dev/null
+
+		cd $DIR_BINARIES
+	        $MAKE all
+
+		popd > /dev/null
+
+	endif
+
+        if ("$ANS_REQ_NONX" == "$YES") then
+		echo ""
+		echo "+ Installing non X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE nprogs lprogs
+                $INSTALL mep $DIR_BIN
+		$INSTALL nip $DIR_BIN
+		$INSTALL pip $DIR_BIN
+		$INSTALL sap $DIR_BIN
+		$INSTALL sapf $DIR_BIN
+		$INSTALL sip $DIR_BIN
+		$INSTALL splitp1 $DIR_BIN
+		$INSTALL splitp2 $DIR_BIN
+		$INSTALL splitp3 $DIR_BIN
+		$INSTALL sethelp $DIR_BIN
+		$INSTALL gip $DIR_BIN
+		$INSTALL nipl $DIR_BIN
+		$INSTALL pipl $DIR_BIN
+		$INSTALL sipl $DIR_BIN
+		$INSTALL dap $DIR_BIN
+		$INSTALL nipf $DIR_BIN
+		$INSTALL vep $DIR_BIN
+		$INSTALL rep $DIR_BIN
+		$INSTALL lip $DIR_BIN
+		#$INSTALL convert_project $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE bap
+                $INSTALL bap $DIR_BIN
+		popd > /dev/null
+
+	endif
+
+	if ("$ANS_REQ_TED" == "$YES") then
+		echo ""
+		echo "+ Installing Trace editor"
+
+		pushd $DIR_TED > /dev/null
+		cd $DIR_BINARIES
+                $MAKE ted
+		$INSTALL ted $DIR_BIN
+		popd > /dev/null
+	endif
+
+	if ("$ANS_REQ_X" == "$YES") then
+		echo ""
+		echo "+ Installing X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+                $MAKE xprogs
+		$INSTALL xmep $DIR_BIN
+		$INSTALL xnip $DIR_BIN
+		$INSTALL xpip $DIR_BIN
+		$INSTALL xsap $DIR_BIN
+		$INSTALL xsip $DIR_BIN
+		$INSTALL xdap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE xbap
+                $INSTALL xbap $DIR_BIN
+		popd > /dev/null
+
+
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES") then
+		echo ""
+		echo "+ Installing miscellaneous programs"
+
+		pushd $DIR_ABI > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL getABISampleName $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_ALF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE alfsplit
+		$INSTALL alfsplit $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_CONVERT > /dev/null
+		cd $DIR_BINARIES
+                $MAKE convert
+		$INSTALL convert $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_COP > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL cop $DIR_BIN
+		$INSTALL cop-bap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_FROG > /dev/null
+		cd $DIR_BINARIES
+                $MAKE frog
+		$INSTALL frog $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_GETMCH > /dev/null
+		cd $DIR_BINARIES
+                $MAKE trace2seq
+		$INSTALL trace2seq $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_SCF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE makeSCF
+		$INSTALL makeSCF $DIR_BIN
+		popd > /dev/null
+
+
+
+	endif
+
+
+installation_done:
+	echo ""
+	echo "+ Installation completed"
+	echo ""
+
+	echo "  Some further initialisation is required in order to use the"
+	echo "  package.  csh users should insert the following in their .login"
+	echo "  files:"
+	echo "  "
+	echo "  	setenv STADENROOT $ANS_STADEN_ROOT"
+	echo '  	source $STADENROOT/staden.login'
+	echo "  "
+	echo "  Users of the Bourne shell, sh, should insert the following in"
+	echo "  their .profile:"
+	echo "  "
+	echo "  	STADENROOT=$ANS_STADEN_ROOT"
+	echo "  	export STADENROOT"
+	echo '  	. $STADENROOT/staden.profile'
+	echo "  "
+	echo "  These initialisations will alter the shell's search path so that"
+	echo "  it can find the programs in the STADEN Package"
+	echo "  "
+
+normal_exit:
+	exit 0
+
+chickens_exit:
+	echo ""
+	echo "+ Installation cancelled"
+	echo ""
+
+	exit 0
+
+end_failure:
+	unset noglob
+	echo ""
+	echo "Aborted STADEN Package installation on `date`" 
+	echo ""
+	exit 1
+
diff --git a/Staden_install-solaris b/Staden_install-solaris
new file mode 100644
index 0000000..93c2feb
--- /dev/null
+++ b/Staden_install-solaris
@@ -0,0 +1,453 @@
+#! /bin/csh -f
+#
+# staden_install - version 2.4
+#
+#	This is a prototype installation program.
+#
+# 9 March 1992
+#	Modified for installation on Sun, Alliant, etc
+#	No longer install 2rs
+#
+# 20 November 1992
+#	Now includes convert, cop, frog, getMCH and scf
+#
+# 25 November 1992
+#	SGI supported
+# 
+# 19 May 1993
+#	DEC Alpha, Solaris supported
+# 
+# Written by sd@uk.ac.cam.mrc-lmb
+#
+
+# prelim
+set prog = $0 ; set prog = $prog:t
+
+# Machines supported: al sun dec sgi alpha solaris
+#set MACHINE = `echo $prog | sed 's/.*-//'`
+set MACHINE = solaris
+
+# For local (MRC-LMB) setup only
+#set LOCAL = `echo $prog | awk '/local/{print "YES";exit;}{print "NO";}'`
+set LOCAL = NO
+
+
+echo ""
+echo -n "Staden Package installation procedure - "
+switch (${MACHINE})
+	case "al":
+		echo "Alliant FX/2800 Concentrix version"
+		set MAKE = "make -sk"
+		breaksw
+	case "sun":
+		echo "SunOS version"
+		set MAKE = "make -sk"
+		breaksw
+	case "dec":
+		echo "DEC Ultrix (mips) version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "sgi":
+		echo "Silicon Graphics Iris version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "alpha":
+		echo "DEC Alpha OSF/1 version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "solaris":
+		echo "Solaris version"
+		set MAKE = "make -sk"
+		breaksw
+	default:
+		echo "Panic. Unknown version"
+		exit 1
+endsw
+echo ""
+echo "* starting initialization...please wait."
+echo ""
+
+# Binary fork of source directory
+if ($LOCAL == "YES") then
+    set DIR_BINARIES = ${MACHINE}-binaries
+    set DIR_PROGS = ${MACHINE}-bin
+else
+    set DIR_BINARIES = .
+    set DIR_PROGS = bin
+    set MAKE = "$MAKE -f makefile-${MACHINE}"
+endif
+
+init:
+# Set useful shell variables
+set YES="YES";
+set NO="NO"
+
+# set/unset some .cshrc envs.
+unset noclobber
+set noglob
+
+# set interrupt trap
+onintr end_failure
+
+# Make dir command
+set MKDIR = "mkdir"
+
+# Copy command
+set CP = "cp -p"
+
+# Install command
+#set INSTALL = "install"
+#set INSTALL = "mv"
+set INSTALL = "cp"
+
+# Set up default responses
+set DEF_STADEN_ROOT = `pwd`
+
+set DEF_REQ_NONX = "$YES"
+set DEF_REQ_X = "$YES"
+set DEF_REQ_TED = "$YES"
+set DEF_REQ_MISC = "$YES"
+
+# directories
+set DIR_SRC = $DEF_STADEN_ROOT/src
+set DIR_BIN = $DEF_STADEN_ROOT/$DIR_PROGS
+set DIR_MISC = $DIR_SRC/Misc
+set DIR_STADEN = $DIR_SRC/staden
+set DIR_TED = $DIR_SRC/ted
+set DIR_ABI = $DIR_SRC/abi
+set DIR_ALF = $DIR_SRC/alf
+set DIR_BAP = $DIR_SRC/bap
+set DIR_OSP = $DIR_SRC/bap/osp-bits
+set DIR_CONVERT = $DIR_SRC/convert
+set DIR_COP = $DIR_SRC/cop
+set DIR_FROG = $DIR_SRC/frog
+set DIR_GETMCH = $DIR_SRC/getMCH
+set DIR_SCF = $DIR_SRC/scf
+
+
+main:
+
+	
+preamble:
+	echo ""
+	echo ""
+	echo "* Please answer the following questions."
+	echo "  Default answers to questions are given in square brackets."
+	echo "  If you require help at any stage respond with a ? to the question."
+	echo ""
+
+ask_staden_root:
+	set ANS_STADEN_ROOT = $DEF_STADEN_ROOT
+
+ask_require_nonx_progs: 
+	echo -n "Compile all the non-X programs in the Staden Package [$DEF_REQ_NONX]? "
+	set ANS_REQ_NONX = $<
+	if ("$ANS_REQ_NONX" == "?") then
+		echo "* If you do not have X windows on your system you will require"
+		echo "  these.  However, you will require Tektronics terminal emulation."
+		echo "  If you do not require all of the non-X programs, you should abort"
+		echo "  and manually make the ones you require."
+		echo ""
+		goto ask_require_nonx_progs
+	else if ("$ANS_REQ_NONX" != "") then
+		if ("$ANS_REQ_NONX" =~ [yY]*) then
+			set ANS_REQ_NONX=$YES
+		else if ("$ANS_REQ_NONX" =~ [nN]*) then
+			set ANS_REQ_NONX=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_NONX=$DEF_REQ_NONX
+	endif
+
+ask_require_x_progs:
+	echo -n "Compile all the X programs in the Staden Package [$DEF_REQ_X]? "
+	set ANS_REQ_X = $<
+	if ("$ANS_REQ_X" == "?") then
+		echo "* These are the programs that require X windows."
+		echo "  If you do not require all of the X programs, you should abort"
+		echo "  and manually make the ones you require."
+
+		echo ""
+		goto ask_require_x_progs
+	else if ("$ANS_REQ_X" != "") then
+		if ("$ANS_REQ_X" =~ [yY]*) then
+			set ANS_REQ_X=$YES
+		else if ("$ANS_REQ_X" =~ [nN]*) then
+			set ANS_REQ_X=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_X=$DEF_REQ_X
+	endif
+
+
+ask_require_ted:
+	echo -n "Compile the trace editing program ted [$DEF_REQ_TED]? "
+	set ANS_REQ_TED = $<
+	if ("$ANS_REQ_TED" == "?") then
+		echo "* This is the trace editor program.  It allows you to look at"
+		echo "  traces obtained from automated fluorescent sequencing machines."
+		echo ""
+		goto ask_require_ted
+	else if ("$ANS_REQ_TED" != "") then
+		if ("$ANS_REQ_TED" =~ [yY]*) then
+			set ANS_REQ_TED=$YES
+		else if ("$ANS_REQ_TED" =~ [nN]*) then
+			set ANS_REQ_TED=$NO
+		else
+			goto ask_require_ted
+		endif
+	else
+		set ANS_REQ_TED=$DEF_REQ_TED
+	endif
+
+
+
+ask_require_misc:
+	echo -n "Compile other programs [$DEF_REQ_MISC]? "
+	set ANS_REQ_MISC = $<
+	if ("$ANS_REQ_MISC" == "?") then
+		echo "* Other programs include:"
+		echo "    alfsplit"
+		echo "    getABISampleName"
+		echo ""
+		goto ask_require_misc
+	else if ("$ANS_REQ_MISC" != "") then
+		if ("$ANS_REQ_MISC" =~ [yY]*) then
+			set ANS_REQ_MISC=$YES
+		else if ("$ANS_REQ_MISC" =~ [nN]*) then
+			set ANS_REQ_MISC=$NO
+		else
+			goto ask_require_misc
+		endif
+	else
+		set ANS_REQ_MISC=$DEF_REQ_MISC
+	endif
+
+
+
+time_taken_warning:
+	echo ""
+	echo "The installation procedure is now ready to start."
+	echo ""
+	echo "**** Warning:"
+	echo "    The installation will take considerable time to complete.  If you"
+	echo "    are installing the whole Staden Package from scratch it could"
+	echo "    take as long as an hour for all exectuables to be compiled and"
+	echo "    installed."
+	echo ""
+
+ask_goahead:
+	echo -n "Proceed with the installation [YES]? " 
+	set ANSWER=$<
+	if ("$ANSWER" == "?") then
+		echo "* Final confirmation to proceed with the installation.  Answer"
+		echo "  YES to proceed; otherwise, answer NO to abort the installation."
+		echo ""
+		goto ask_goahead
+	else if ("$ANSWER" != "") then
+		if ("$ANSWER" =~ [nN]*) then
+			goto chickens_exit
+		else if ("$ANSWER" !~ [yY]*) then
+			goto ask_goahead
+		endif
+	endif
+
+installation_proper:
+
+# make binaries directory if it doesn't exist
+
+	if (! -d $DIR_BIN) then
+		$MKDIR $DIR_BIN
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES" || "$ANS_REQ_X" == "$YES" || "$ANS_REQ_NONX" == "$YES" ) then
+		echo ""
+		echo "+ Compiling miscellaneous library"
+
+		pushd $DIR_MISC > /dev/null
+
+		cd $DIR_BINARIES
+	        $MAKE all
+
+		popd > /dev/null
+
+	endif
+
+        if ("$ANS_REQ_NONX" == "$YES") then
+		echo ""
+		echo "+ Installing non X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE nprogs lprogs
+                $INSTALL mep $DIR_BIN
+		$INSTALL nip $DIR_BIN
+		$INSTALL pip $DIR_BIN
+		$INSTALL sap $DIR_BIN
+		$INSTALL sapf $DIR_BIN
+		$INSTALL sip $DIR_BIN
+		$INSTALL splitp1 $DIR_BIN
+		$INSTALL splitp2 $DIR_BIN
+		$INSTALL splitp3 $DIR_BIN
+		$INSTALL sethelp $DIR_BIN
+		$INSTALL gip $DIR_BIN
+		$INSTALL nipl $DIR_BIN
+		$INSTALL pipl $DIR_BIN
+		$INSTALL sipl $DIR_BIN
+		$INSTALL dap $DIR_BIN
+		$INSTALL nipf $DIR_BIN
+		$INSTALL vep $DIR_BIN
+		$INSTALL rep $DIR_BIN
+		$INSTALL lip $DIR_BIN
+		#$INSTALL convert_project $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE bap
+                $INSTALL bap $DIR_BIN
+		popd > /dev/null
+
+	endif
+
+	if ("$ANS_REQ_TED" == "$YES") then
+		echo ""
+		echo "+ Installing Trace editor"
+
+		pushd $DIR_TED > /dev/null
+		cd $DIR_BINARIES
+                $MAKE ted
+		$INSTALL ted $DIR_BIN
+		popd > /dev/null
+	endif
+
+	if ("$ANS_REQ_X" == "$YES") then
+		echo ""
+		echo "+ Installing X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+                $MAKE xprogs
+		$INSTALL xmep $DIR_BIN
+		$INSTALL xnip $DIR_BIN
+		$INSTALL xpip $DIR_BIN
+		$INSTALL xsap $DIR_BIN
+		$INSTALL xsip $DIR_BIN
+		$INSTALL xdap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE xbap
+                $INSTALL xbap $DIR_BIN
+		popd > /dev/null
+
+
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES") then
+		echo ""
+		echo "+ Installing miscellaneous programs"
+
+		pushd $DIR_ABI > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL getABISampleName $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_ALF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE alfsplit
+		$INSTALL alfsplit $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_CONVERT > /dev/null
+		cd $DIR_BINARIES
+                $MAKE convert
+		$INSTALL convert $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_COP > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL cop $DIR_BIN
+		$INSTALL cop-bap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_FROG > /dev/null
+		cd $DIR_BINARIES
+                $MAKE frog
+		$INSTALL frog $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_GETMCH > /dev/null
+		cd $DIR_BINARIES
+                $MAKE trace2seq
+		$INSTALL trace2seq $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_SCF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE makeSCF
+		$INSTALL makeSCF $DIR_BIN
+		popd > /dev/null
+
+
+
+	endif
+
+
+installation_done:
+	echo ""
+	echo "+ Installation completed"
+	echo ""
+
+	echo "  Some further initialisation is required in order to use the"
+	echo "  package.  csh users should insert the following in their .login"
+	echo "  files:"
+	echo "  "
+	echo "  	setenv STADENROOT $ANS_STADEN_ROOT"
+	echo '  	source $STADENROOT/staden.login'
+	echo "  "
+	echo "  Users of the Bourne shell, sh, should insert the following in"
+	echo "  their .profile:"
+	echo "  "
+	echo "  	STADENROOT=$ANS_STADEN_ROOT"
+	echo "  	export STADENROOT"
+	echo '  	. $STADENROOT/staden.profile'
+	echo "  "
+	echo "  These initialisations will alter the shell's search path so that"
+	echo "  it can find the programs in the STADEN Package"
+	echo "  "
+
+normal_exit:
+	exit 0
+
+chickens_exit:
+	echo ""
+	echo "+ Installation cancelled"
+	echo ""
+
+	exit 0
+
+end_failure:
+	unset noglob
+	echo ""
+	echo "Aborted STADEN Package installation on `date`" 
+	echo ""
+	exit 1
+
diff --git a/Staden_install-sun b/Staden_install-sun
new file mode 100644
index 0000000..0d6711e
--- /dev/null
+++ b/Staden_install-sun
@@ -0,0 +1,453 @@
+#! /bin/csh -f
+#
+# staden_install - version 2.4
+#
+#	This is a prototype installation program.
+#
+# 9 March 1992
+#	Modified for installation on Sun, Alliant, etc
+#	No longer install 2rs
+#
+# 20 November 1992
+#	Now includes convert, cop, frog, getMCH and scf
+#
+# 25 November 1992
+#	SGI supported
+# 
+# 19 May 1993
+#	DEC Alpha, Solaris supported
+# 
+# Written by sd@uk.ac.cam.mrc-lmb
+#
+
+# prelim
+set prog = $0 ; set prog = $prog:t
+
+# Machines supported: al sun dec sgi alpha solaris
+#set MACHINE = `echo $prog | sed 's/.*-//'`
+set MACHINE = sun
+
+# For local (MRC-LMB) setup only
+#set LOCAL = `echo $prog | awk '/local/{print "YES";exit;}{print "NO";}'`
+set LOCAL = NO
+
+
+echo ""
+echo -n "Staden Package installation procedure - "
+switch (${MACHINE})
+	case "al":
+		echo "Alliant FX/2800 Concentrix version"
+		set MAKE = "make -sk"
+		breaksw
+	case "sun":
+		echo "SunOS version"
+		set MAKE = "make -sk"
+		breaksw
+	case "dec":
+		echo "DEC Ultrix (mips) version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "sgi":
+		echo "Silicon Graphics Iris version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "alpha":
+		echo "DEC Alpha OSF/1 version"
+		set MAKE = "gmake -sk"
+		breaksw
+	case "solaris":
+		echo "Solaris version"
+		set MAKE = "make -sk"
+		breaksw
+	default:
+		echo "Panic. Unknown version"
+		exit 1
+endsw
+echo ""
+echo "* starting initialization...please wait."
+echo ""
+
+# Binary fork of source directory
+if ($LOCAL == "YES") then
+    set DIR_BINARIES = ${MACHINE}-binaries
+    set DIR_PROGS = ${MACHINE}-bin
+else
+    set DIR_BINARIES = .
+    set DIR_PROGS = bin
+    set MAKE = "$MAKE -f makefile-${MACHINE}"
+endif
+
+init:
+# Set useful shell variables
+set YES="YES";
+set NO="NO"
+
+# set/unset some .cshrc envs.
+unset noclobber
+set noglob
+
+# set interrupt trap
+onintr end_failure
+
+# Make dir command
+set MKDIR = "mkdir"
+
+# Copy command
+set CP = "cp -p"
+
+# Install command
+#set INSTALL = "install"
+#set INSTALL = "mv"
+set INSTALL = "cp"
+
+# Set up default responses
+set DEF_STADEN_ROOT = `pwd`
+
+set DEF_REQ_NONX = "$YES"
+set DEF_REQ_X = "$YES"
+set DEF_REQ_TED = "$YES"
+set DEF_REQ_MISC = "$YES"
+
+# directories
+set DIR_SRC = $DEF_STADEN_ROOT/src
+set DIR_BIN = $DEF_STADEN_ROOT/$DIR_PROGS
+set DIR_MISC = $DIR_SRC/Misc
+set DIR_STADEN = $DIR_SRC/staden
+set DIR_TED = $DIR_SRC/ted
+set DIR_ABI = $DIR_SRC/abi
+set DIR_ALF = $DIR_SRC/alf
+set DIR_BAP = $DIR_SRC/bap
+set DIR_OSP = $DIR_SRC/bap/osp-bits
+set DIR_CONVERT = $DIR_SRC/convert
+set DIR_COP = $DIR_SRC/cop
+set DIR_FROG = $DIR_SRC/frog
+set DIR_GETMCH = $DIR_SRC/getMCH
+set DIR_SCF = $DIR_SRC/scf
+
+
+main:
+
+	
+preamble:
+	echo ""
+	echo ""
+	echo "* Please answer the following questions."
+	echo "  Default answers to questions are given in square brackets."
+	echo "  If you require help at any stage respond with a ? to the question."
+	echo ""
+
+ask_staden_root:
+	set ANS_STADEN_ROOT = $DEF_STADEN_ROOT
+
+ask_require_nonx_progs: 
+	echo -n "Compile all the non-X programs in the Staden Package [$DEF_REQ_NONX]? "
+	set ANS_REQ_NONX = $<
+	if ("$ANS_REQ_NONX" == "?") then
+		echo "* If you do not have X windows on your system you will require"
+		echo "  these.  However, you will require Tektronics terminal emulation."
+		echo "  If you do not require all of the non-X programs, you should abort"
+		echo "  and manually make the ones you require."
+		echo ""
+		goto ask_require_nonx_progs
+	else if ("$ANS_REQ_NONX" != "") then
+		if ("$ANS_REQ_NONX" =~ [yY]*) then
+			set ANS_REQ_NONX=$YES
+		else if ("$ANS_REQ_NONX" =~ [nN]*) then
+			set ANS_REQ_NONX=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_NONX=$DEF_REQ_NONX
+	endif
+
+ask_require_x_progs:
+	echo -n "Compile all the X programs in the Staden Package [$DEF_REQ_X]? "
+	set ANS_REQ_X = $<
+	if ("$ANS_REQ_X" == "?") then
+		echo "* These are the programs that require X windows."
+		echo "  If you do not require all of the X programs, you should abort"
+		echo "  and manually make the ones you require."
+
+		echo ""
+		goto ask_require_x_progs
+	else if ("$ANS_REQ_X" != "") then
+		if ("$ANS_REQ_X" =~ [yY]*) then
+			set ANS_REQ_X=$YES
+		else if ("$ANS_REQ_X" =~ [nN]*) then
+			set ANS_REQ_X=$NO
+		else
+			goto ask_require_nonx_progs
+		endif
+	else
+		set ANS_REQ_X=$DEF_REQ_X
+	endif
+
+
+ask_require_ted:
+	echo -n "Compile the trace editing program ted [$DEF_REQ_TED]? "
+	set ANS_REQ_TED = $<
+	if ("$ANS_REQ_TED" == "?") then
+		echo "* This is the trace editor program.  It allows you to look at"
+		echo "  traces obtained from automated fluorescent sequencing machines."
+		echo ""
+		goto ask_require_ted
+	else if ("$ANS_REQ_TED" != "") then
+		if ("$ANS_REQ_TED" =~ [yY]*) then
+			set ANS_REQ_TED=$YES
+		else if ("$ANS_REQ_TED" =~ [nN]*) then
+			set ANS_REQ_TED=$NO
+		else
+			goto ask_require_ted
+		endif
+	else
+		set ANS_REQ_TED=$DEF_REQ_TED
+	endif
+
+
+
+ask_require_misc:
+	echo -n "Compile other programs [$DEF_REQ_MISC]? "
+	set ANS_REQ_MISC = $<
+	if ("$ANS_REQ_MISC" == "?") then
+		echo "* Other programs include:"
+		echo "    alfsplit"
+		echo "    getABISampleName"
+		echo ""
+		goto ask_require_misc
+	else if ("$ANS_REQ_MISC" != "") then
+		if ("$ANS_REQ_MISC" =~ [yY]*) then
+			set ANS_REQ_MISC=$YES
+		else if ("$ANS_REQ_MISC" =~ [nN]*) then
+			set ANS_REQ_MISC=$NO
+		else
+			goto ask_require_misc
+		endif
+	else
+		set ANS_REQ_MISC=$DEF_REQ_MISC
+	endif
+
+
+
+time_taken_warning:
+	echo ""
+	echo "The installation procedure is now ready to start."
+	echo ""
+	echo "**** Warning:"
+	echo "    The installation will take considerable time to complete.  If you"
+	echo "    are installing the whole Staden Package from scratch it could"
+	echo "    take as long as an hour for all exectuables to be compiled and"
+	echo "    installed."
+	echo ""
+
+ask_goahead:
+	echo -n "Proceed with the installation [YES]? " 
+	set ANSWER=$<
+	if ("$ANSWER" == "?") then
+		echo "* Final confirmation to proceed with the installation.  Answer"
+		echo "  YES to proceed; otherwise, answer NO to abort the installation."
+		echo ""
+		goto ask_goahead
+	else if ("$ANSWER" != "") then
+		if ("$ANSWER" =~ [nN]*) then
+			goto chickens_exit
+		else if ("$ANSWER" !~ [yY]*) then
+			goto ask_goahead
+		endif
+	endif
+
+installation_proper:
+
+# make binaries directory if it doesn't exist
+
+	if (! -d $DIR_BIN) then
+		$MKDIR $DIR_BIN
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES" || "$ANS_REQ_X" == "$YES" || "$ANS_REQ_NONX" == "$YES" ) then
+		echo ""
+		echo "+ Compiling miscellaneous library"
+
+		pushd $DIR_MISC > /dev/null
+
+		cd $DIR_BINARIES
+	        $MAKE all
+
+		popd > /dev/null
+
+	endif
+
+        if ("$ANS_REQ_NONX" == "$YES") then
+		echo ""
+		echo "+ Installing non X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE nprogs lprogs
+                $INSTALL mep $DIR_BIN
+		$INSTALL nip $DIR_BIN
+		$INSTALL pip $DIR_BIN
+		$INSTALL sap $DIR_BIN
+		$INSTALL sapf $DIR_BIN
+		$INSTALL sip $DIR_BIN
+		$INSTALL splitp1 $DIR_BIN
+		$INSTALL splitp2 $DIR_BIN
+		$INSTALL splitp3 $DIR_BIN
+		$INSTALL sethelp $DIR_BIN
+		$INSTALL gip $DIR_BIN
+		$INSTALL nipl $DIR_BIN
+		$INSTALL pipl $DIR_BIN
+		$INSTALL sipl $DIR_BIN
+		$INSTALL dap $DIR_BIN
+		$INSTALL nipf $DIR_BIN
+		$INSTALL vep $DIR_BIN
+		$INSTALL rep $DIR_BIN
+		$INSTALL lip $DIR_BIN
+		#$INSTALL convert_project $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE bap
+                $INSTALL bap $DIR_BIN
+		popd > /dev/null
+
+	endif
+
+	if ("$ANS_REQ_TED" == "$YES") then
+		echo ""
+		echo "+ Installing Trace editor"
+
+		pushd $DIR_TED > /dev/null
+		cd $DIR_BINARIES
+                $MAKE ted
+		$INSTALL ted $DIR_BIN
+		popd > /dev/null
+	endif
+
+	if ("$ANS_REQ_X" == "$YES") then
+		echo ""
+		echo "+ Installing X programs"
+
+		pushd $DIR_STADEN > /dev/null
+		cd $DIR_BINARIES
+                $MAKE xprogs
+		$INSTALL xmep $DIR_BIN
+		$INSTALL xnip $DIR_BIN
+		$INSTALL xpip $DIR_BIN
+		$INSTALL xsap $DIR_BIN
+		$INSTALL xsip $DIR_BIN
+		$INSTALL xdap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_OSP > /dev/null
+		cd $DIR_BINARIES
+		$MAKE
+		popd > /dev/null		
+
+		pushd $DIR_BAP > /dev/null
+		cd $DIR_BINARIES
+	        $MAKE xbap
+                $INSTALL xbap $DIR_BIN
+		popd > /dev/null
+
+
+	endif
+
+	if ("$ANS_REQ_MISC" == "$YES") then
+		echo ""
+		echo "+ Installing miscellaneous programs"
+
+		pushd $DIR_ABI > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL getABISampleName $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_ALF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE alfsplit
+		$INSTALL alfsplit $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_CONVERT > /dev/null
+		cd $DIR_BINARIES
+                $MAKE convert
+		$INSTALL convert $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_COP > /dev/null
+		cd $DIR_BINARIES
+                $MAKE all
+		$INSTALL cop $DIR_BIN
+		$INSTALL cop-bap $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_FROG > /dev/null
+		cd $DIR_BINARIES
+                $MAKE frog
+		$INSTALL frog $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_GETMCH > /dev/null
+		cd $DIR_BINARIES
+                $MAKE trace2seq
+		$INSTALL trace2seq $DIR_BIN
+		popd > /dev/null
+
+		pushd $DIR_SCF > /dev/null
+		cd $DIR_BINARIES
+                $MAKE makeSCF
+		$INSTALL makeSCF $DIR_BIN
+		popd > /dev/null
+
+
+
+	endif
+
+
+installation_done:
+	echo ""
+	echo "+ Installation completed"
+	echo ""
+
+	echo "  Some further initialisation is required in order to use the"
+	echo "  package.  csh users should insert the following in their .login"
+	echo "  files:"
+	echo "  "
+	echo "  	setenv STADENROOT $ANS_STADEN_ROOT"
+	echo '  	source $STADENROOT/staden.login'
+	echo "  "
+	echo "  Users of the Bourne shell, sh, should insert the following in"
+	echo "  their .profile:"
+	echo "  "
+	echo "  	STADENROOT=$ANS_STADEN_ROOT"
+	echo "  	export STADENROOT"
+	echo '  	. $STADENROOT/staden.profile'
+	echo "  "
+	echo "  These initialisations will alter the shell's search path so that"
+	echo "  it can find the programs in the STADEN Package"
+	echo "  "
+
+normal_exit:
+	exit 0
+
+chickens_exit:
+	echo ""
+	echo "+ Installation cancelled"
+	echo ""
+
+	exit 0
+
+end_failure:
+	unset noglob
+	echo ""
+	echo "Aborted STADEN Package installation on `date`" 
+	echo ""
+	exit 1
+
diff --git a/Version-1993.0.7 b/Version-1993.0.7
new file mode 100644
index 0000000..1689cc5
--- /dev/null
+++ b/Version-1993.0.7
@@ -0,0 +1,91 @@
+Wed Jul 7
+    *Version-1993.0.7*
+    New xbap and ted.
+    Can use Ctrl as well as Meta to shift cutoffs in contig editor.
+    Code to read in ABI traces now robust to ABI problem files, where
+    called base order is not base position order.
+
+Thu Jul 1
+    *Version-1993.0.6*
+    New xbap and bap, to fix bugs.
+    Break Contig was sometimes not recalculating consensus length correctly.
+    Contig Edit was trucating reading name lengths at 10 characters.
+
+Thu Jun 16
+    *Version-1993.0.5*
+    New xbap and bap executables. RS changed assembly in bap so that
+    when entry is not permitted the program asks for the percentage
+    mismatch - this allows display of alignments for all levels of
+    mismatch.
+
+Mon Jun 14 14:54:43 BST 1993
+    *Version-1993.0.4*
+    Bug in xdap. It was compiled with xbap's edUtils.h by mistake.
+
+Fri Jun 11 17:50:13 BST 1993
+    *Version-1993.0.3*
+    Bugs in bap/xbap fixed. New executables included.
+
+Thu Jun  3 13:53:38 BST 1993
+    *Version-1993.0.2*
+    Bugs in bap/xbap fixed. New executables included.
+
+Thu May 20 14:45:38 BST 1993
+    *Version-1993.0.1*
+    Changes to makefiles and Staden_install
+
+Fri Mar  5 11:27:22 GMT 1993
+    *Version-1993.0*
+    Now for DEC Alpha and Solaris
+    bap/xbap now includes double stranding and auto-creation of oligos
+
+Tue Jan 26 11:54:36 GMT 1993
+    *Version-1992.3.1*
+    Bug fixes
+    1. indexseqlibs/genbentryname1.c
+    2. convert bugs + new programs
+
+Mon Nov 23 13:50:39 WET 1992
+    *Version-1992.3*
+    Includes bap/xbap and utility programs
+
+
+Wed Sep 30 11:18:09 BST 1992
+    *Version-1992.2.1*
+    Source changes since last release
+	bug fixes to postscript output, sequence library programs
+    New sun and dec executables
+
+
+Thu Aug 27 15:27:05 BST 1992
+
+    *Version-1992.2*
+    
+
+Mon Jul 27 13:01:37 WET 1992
+
+    *Version-1992.1.3*
+    Miscellaneous bug fixes and enhancements
+    New sun and dec executables
+
+
+Tue Jun 16 16:07:41 BST 1992
+
+    *Version-1992.1.2*
+    Sun sparc executables now linked with cc and not gcc.
+    New makefile-sun files
+    New sources for hitNtrg.c and freetext4.c (indexseqlibs), and
+    tagU2.c (staden)
+
+
+Wed May 27 17:12:36 BST 1992
+
+    *Version-1992.1.1*
+    Inclusion of vep (vector excision program), plus minor changes and bug fixes
+
+
+Tue May 26 11:10:28 WET 1992
+
+    *Version-1992.1*
+    This version includes the port to DEC Ultrix (mips)
+
diff --git a/bin/alfsplit b/bin/alfsplit
new file mode 100644
index 0000000..97f5008
Binary files /dev/null and b/bin/alfsplit differ
diff --git a/bin/bap b/bin/bap
new file mode 100644
index 0000000..2f16f89
Binary files /dev/null and b/bin/bap differ
diff --git a/bin/convert b/bin/convert
new file mode 100644
index 0000000..2f59c33
Binary files /dev/null and b/bin/convert differ
diff --git a/bin/cop b/bin/cop
new file mode 100644
index 0000000..7b2b403
Binary files /dev/null and b/bin/cop differ
diff --git a/bin/cop-bap b/bin/cop-bap
new file mode 100644
index 0000000..48ea21a
Binary files /dev/null and b/bin/cop-bap differ
diff --git a/bin/dap b/bin/dap
new file mode 100644
index 0000000..476dd0a
Binary files /dev/null and b/bin/dap differ
diff --git a/bin/frog b/bin/frog
new file mode 100644
index 0000000..53485da
Binary files /dev/null and b/bin/frog differ
diff --git a/bin/getABISampleName b/bin/getABISampleName
new file mode 100644
index 0000000..17ae99d
Binary files /dev/null and b/bin/getABISampleName differ
diff --git a/bin/gip b/bin/gip
new file mode 100644
index 0000000..6b69ebb
Binary files /dev/null and b/bin/gip differ
diff --git a/bin/lip b/bin/lip
new file mode 100644
index 0000000..92f266c
Binary files /dev/null and b/bin/lip differ
diff --git a/bin/makeSCF b/bin/makeSCF
new file mode 100644
index 0000000..b5c1610
Binary files /dev/null and b/bin/makeSCF differ
diff --git a/bin/mep b/bin/mep
new file mode 100644
index 0000000..0d3ce7b
Binary files /dev/null and b/bin/mep differ
diff --git a/bin/nip b/bin/nip
new file mode 100644
index 0000000..c053e7e
Binary files /dev/null and b/bin/nip differ
diff --git a/bin/nipf b/bin/nipf
new file mode 100644
index 0000000..6fcc502
Binary files /dev/null and b/bin/nipf differ
diff --git a/bin/nipl b/bin/nipl
new file mode 100644
index 0000000..d91fc3d
Binary files /dev/null and b/bin/nipl differ
diff --git a/bin/pip b/bin/pip
new file mode 100644
index 0000000..5bb7464
Binary files /dev/null and b/bin/pip differ
diff --git a/bin/pipl b/bin/pipl
new file mode 100644
index 0000000..c4ab009
Binary files /dev/null and b/bin/pipl differ
diff --git a/bin/rep b/bin/rep
new file mode 100644
index 0000000..0c3775b
Binary files /dev/null and b/bin/rep differ
diff --git a/bin/sap b/bin/sap
new file mode 100644
index 0000000..09515fa
Binary files /dev/null and b/bin/sap differ
diff --git a/bin/sapf b/bin/sapf
new file mode 100644
index 0000000..cd8b574
Binary files /dev/null and b/bin/sapf differ
diff --git a/bin/sethelp b/bin/sethelp
new file mode 100644
index 0000000..858dcdd
Binary files /dev/null and b/bin/sethelp differ
diff --git a/bin/sip b/bin/sip
new file mode 100644
index 0000000..4730591
Binary files /dev/null and b/bin/sip differ
diff --git a/bin/sipl b/bin/sipl
new file mode 100644
index 0000000..f700267
Binary files /dev/null and b/bin/sipl differ
diff --git a/bin/splitp1 b/bin/splitp1
new file mode 100644
index 0000000..359ef70
Binary files /dev/null and b/bin/splitp1 differ
diff --git a/bin/splitp2 b/bin/splitp2
new file mode 100644
index 0000000..f7c6df5
Binary files /dev/null and b/bin/splitp2 differ
diff --git a/bin/splitp3 b/bin/splitp3
new file mode 100644
index 0000000..36b4baf
Binary files /dev/null and b/bin/splitp3 differ
diff --git a/bin/ted b/bin/ted
new file mode 100644
index 0000000..dae6b70
Binary files /dev/null and b/bin/ted differ
diff --git a/bin/trace2seq b/bin/trace2seq
new file mode 100644
index 0000000..b4cb9d0
Binary files /dev/null and b/bin/trace2seq differ
diff --git a/bin/vep b/bin/vep
new file mode 100644
index 0000000..f2e3c81
Binary files /dev/null and b/bin/vep differ
diff --git a/bin/xbap b/bin/xbap
new file mode 100644
index 0000000..70dd0c1
Binary files /dev/null and b/bin/xbap differ
diff --git a/bin/xbap.1 b/bin/xbap.1
new file mode 100644
index 0000000..70dd0c1
Binary files /dev/null and b/bin/xbap.1 differ
diff --git a/bin/xdap b/bin/xdap
new file mode 100644
index 0000000..28ba2bf
Binary files /dev/null and b/bin/xdap differ
diff --git a/bin/xmep b/bin/xmep
new file mode 100644
index 0000000..b58f4f9
Binary files /dev/null and b/bin/xmep differ
diff --git a/bin/xnip b/bin/xnip
new file mode 100644
index 0000000..a5e9550
Binary files /dev/null and b/bin/xnip differ
diff --git a/bin/xpip b/bin/xpip
new file mode 100644
index 0000000..761f4ce
Binary files /dev/null and b/bin/xpip differ
diff --git a/bin/xsap b/bin/xsap
new file mode 100644
index 0000000..0aa8f9b
Binary files /dev/null and b/bin/xsap differ
diff --git a/bin/xsip b/bin/xsip
new file mode 100644
index 0000000..c230988
Binary files /dev/null and b/bin/xsip differ
diff --git a/doc/Converting_Sap_Databases b/doc/Converting_Sap_Databases
new file mode 100644
index 0000000..bfeecea
--- /dev/null
+++ b/doc/Converting_Sap_Databases
@@ -0,0 +1,32 @@
+Converting Sap Databases For Be Used With XDAP         SD  10 July 1991
+=======================================================================
+
+The sequence assembly programmes dap and xdap are based on the programs
+sap and xsap, with major modifications. For a concise summary of the
+new features I refer you to Rodger and my paper, "A sequence assembly
+and editing program for efficient management of large projects"
+(Nucleic Acids Research, in press)
+
+The need for storing extra information in project databases has
+resulted in the creation of two files. For users who wish you use old
+(sap) databases with xdap, additional files must be created to use all
+the new features. The program 'convert_project' does this.  It is
+interactive, and asks you for names of relevant files, version numbers
+etc. Here is a sample program dialogue:
+
+
+    % convert_project
+    Database conversion program
+    Converts *.RD? file to *.TG? and *.CC? files
+
+    Project name ? test
+    Version ? 0
+    Conversion completed.
+
+
+Further, please ensure that the file TAGDB is in your project
+directory. Copies can be found in $STADTABL.  Alternatively ensure that
+the environment TAGDB variable is set to $STADTABL/TAGDB
+
+    setenv TAGDB $STADTABL/TAGDB
+
diff --git a/doc/README b/doc/README
new file mode 100644
index 0000000..26e473d
--- /dev/null
+++ b/doc/README
@@ -0,0 +1,30 @@
+Processing and printing LaTeX sources
+-------------------------------------
+
+Given a source file src.tex, run LaTeX to generate the bibliographic
+references:
+
+	latex src
+
+Now run BibTeX to search the bibliography for them:
+
+	bibtex src
+
+Now run LaTeX twice, first to pick up the references, second to bind
+forward references:
+
+	latex src
+	latex src
+
+This will have generated a src.dvi output file. Now we convert this
+to PostScript:
+
+	dvi2ps src.dvi >src.ps
+
+Now we can print this out:
+
+	lpr src.ps
+
+Most of the above is only necessay if you are building something from
+scratch, but it's best to go through it anyway until you fully
+understand how LaTeX works.
diff --git a/doc/gip-menu.PS b/doc/gip-menu.PS
new file mode 100644
index 0000000..17d7616
--- /dev/null
+++ b/doc/gip-menu.PS
@@ -0,0 +1,131 @@
+%!
+/cm {28.2 mul} def
+/BOXSIZE 2 cm def
+
+/boxcen
+{
+% move to centre of box
+BOXSIZE mul 2 div BOXSIZE 2 div rmoveto
+exch
+% move back by correct amount to ensure letter is in centre of box
+dup stringwidth
+pop 2 div neg % halve & neg x offset
+% y offset appears to be zero! - so use constant 'square' char (eg X)
+(X) stringwidth pop 2 div neg
+} def
+
+/letter
+{
+dup BOXSIZE mul 0 rlineto
+0 BOXSIZE rlineto
+dup BOXSIZE mul neg 0 rlineto
+0 BOXSIZE neg rlineto
+closepath
+gsave
+dup boxcen rmoveto
+show
+stroke
+grestore
+BOXSIZE mul 0 rmoveto
+} def
+
+/nextline {0 BOXSIZE neg rmoveto} def
+
+/line
+{
+gsave
+1 letter
+1 letter
+1 letter
+1 letter
+grestore
+nextline
+} def
+
+/Times-Roman findfont 50 scalefont setfont
+newpath
+5 setlinewidth
+200 650 translate
+0 0 moveto
+%2 setlinecap
+
+gsave
+(A) (G) (C) (T) line
+(3) (4) (1) (2) line
+(B) (H) (D) (V) line
+(M) (N) (K) (L) line
+(-) (X) (Y) (R) line
+(8) (7) (6) (5) line
+/Times-Roman findfont 25 scalefont setfont
+gsave
+(DELETE) 2 letter
+(RESET) 2 letter
+grestore
+nextline
+/Times-Roman findfont 35 scalefont setfont
+gsave
+(STOP) 4 letter
+grestore
+nextline
+gsave
+(START) 4 letter
+grestore
+nextline
+gsave
+(CONFIRM) 4 letter
+grestore
+nextline
+% yukky from here on
+gsave
+0 BOXSIZE rmoveto
+1 cm 0 rlineto stroke
+grestore
+(ORIGIN) dup 4 boxcen rmoveto show pop
+(ORIGIN) stringwidth neg exch neg exch rmoveto
+(X) stringwidth exch 2 div rmoveto
+-5 0 rmoveto
+2 setlinewidth
+-45 21 rlineto
+6 0 rlineto
+-6 0 rmoveto
+0 -6 rlineto
+stroke
+grestore
+2 setlinewidth
+0 BOXSIZE 1.4 mul rmoveto
+6 6 rlineto
+-6 -6 rmoveto
+6 -6 rlineto
+-6 6 rmoveto
+80 0 rlineto
+5 -6 rmoveto
+/Times-Roman findfont 30 scalefont setfont
+(8 cm) show
+5 6 rmoveto
+76 0 rlineto
+-6 6 rlineto
+6 -6 rmoveto
+-6 -6 rlineto
+stroke
+0 0 moveto
+BOXSIZE .4 mul neg BOXSIZE rmoveto
+currentpoint translate
+newpath
+0 0 moveto
+90 rotate
+-6 6 rlineto
+6 -6 rmoveto
+-6 -6 rlineto
+6 6 rmoveto
+-244 0 rlineto
+-84 0 rmoveto
+0 -6 rmoveto
+(20 cm) show
+0 6 rmoveto
+-84 0 rmoveto
+-227 0 rlineto
+6 6 rlineto
+-6 -6 rmoveto
+6 -6 rlineto
+stroke
+showpage
diff --git a/doc/install.PS b/doc/install.PS
new file mode 100644
index 0000000..0785781
--- /dev/null
+++ b/doc/install.PS
@@ -0,0 +1,2426 @@
+%!              for use by dvi2ps Version 2.00
+% $Header: tex.ps,v 2.0 88/06/07 15:12:32 peterd Rel2 $
+% a start (Ha!) at a TeX mode for PostScript.
+% The following defines procedures assumed and used by program "dvi2ps"
+% and must be downloaded or sent as a header file for all TeX jobs.
+
+% By:  Neal Holtz, Carleton University, Ottawa, Canada
+%      <holtz@cascade.carleton.cdn>
+%      <holtz%cascade.carleton.cdn@ubc.csnet>
+%      June, 1985
+%      Last Modified: Aug 25/85
+% oystr 12-Feb-1986
+%   Changed @dc macro to check for a badly formed bits in character
+%   definitions.  Can get a <> bit map if a character is not actually
+%   in the font file.  This is absolutely guaranteed to drive the
+%   printer nuts - it will appear that you can no longer define a
+%   new font, although the built-ins will still be there.
+% mackay 4-Jan-1988
+%   Changed size of character array to reflect gf usage (256 characters)
+
+% To convert this file into a downloaded file instead of a header
+% file, uncomment all of the lines beginning with %-%
+
+%-%0000000 			% Server loop exit password
+%-%serverdict begin exitserver
+%-%  systemdict /statusdict known
+%-%  {statusdict begin 9 0 3 setsccinteractive /waittimeout 300 def end}
+%-% if
+
+/TeXDict 200 dict def   % define a working dictionary
+TeXDict begin           % start using it.
+
+                        % units are in "dots" (300/inch)
+/Resolution 300 def
+/Inch  {Resolution mul} def  % converts inches to internal units
+
+/Mtrx 6 array def
+
+%%%%%%%%%%%%%%%%%%%%% Page setup (user) options %%%%%%%%%%%%%%%%%%%%%%%%
+
+% dvi2ps will output coordinates in the TeX system ([0,0] 1" down and in
+% from top left, with y +ive downward).  The default PostScript system
+% is [0,0] at bottom left, y +ive up.  The Many Matrix Machinations in
+% the following code are an attempt to reconcile that. The intent is to
+% specify the scaling as 1 and have only translations in the matrix to
+% properly position the text.  Caution: the default device matrices are
+% *not* the same in all PostScript devices; that should not matter in most 
+% of the code below (except for lanscape mode -- in that, rotations of
+% -90 degrees resulted in the the rotation matrix [ e 1 ]
+%                                                 [ 1 e ]
+% where the "e"s were almost exactly but not quite unlike zeros.
+
+/@letter
+  { letter initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+    310 -3005 translate      % move origin to top (these are not exactly 1"
+    Mtrx currentmatrix pop   % and -10" because margins aren't set exactly right)
+  } def
+        % note mode is like letter, except it uses less VM
+/@note
+  { note initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+    310 -3005 translate                      % move origin to top
+    Mtrx currentmatrix pop
+  } def
+
+/@landscape
+  { letter initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+%    -90 rotate                              % it would be nice to be able to do this
+    Mtrx currentmatrix 0 0.0 put             % but instead we have to do things like this because what
+    Mtrx 1 -1.0 put                          % should be zero terms aren't (and text comes out wobbly)
+    Mtrx 2 1.0 put                           % Fie!  This likely will not work on QMS printers
+    Mtrx 3 0.0 put                           % (nor on others where the device matrix is not like
+    Mtrx  setmatrix                          %  like it is on the LaserWriter).
+    300 310  translate                       % move origin to top
+    Mtrx currentmatrix pop
+  } def
+
+/@legal
+  { legal initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+    295 -3880 translate                      % move origin to top
+    Mtrx currentmatrix pop
+  } def
+
+/@manualfeed
+   { statusdict /manualfeed true put
+     statusdict /manualfeedtimeout 300 put  % 5 minutes
+   } def
+        % n @copies -   set number of copies
+/@copies
+   { /#copies exch def
+   } def
+
+%%%%%%%%%%%%%%%%%%%% Procedure Defintions %%%%%%%%%%%%%%%%%%%%%%%%%%
+
+/@newfont       % id @newfont -         -- initialize a new font dictionary
+  { /newname exch def
+    pop
+    newname 7 dict def          % allocate new font dictionary
+    newname load begin
+        /FontType 3 def
+        /FontMatrix [1 0 0 -1 0 0] def
+        /FontBBox [0 0 1 1] def
+% mackay 4-Jan-1987 changed size of array from 128 to 256 for gf fonts
+        /BitMaps 256 array def
+        /BuildChar {CharBuilder} def
+        /Encoding 256 array def
+        0 1 255 {Encoding exch /.undef put} for
+        end
+    newname newname load definefont pop
+  } def
+
+
+% the following is the only character builder we need.  it looks up the
+% char data in the BitMaps array, and paints the character if possible.
+% char data  -- a bitmap descriptor -- is an array of length 6, of 
+%          which the various slots are:
+
+/ch-image {ch-data 0 get} def   % the hex string image
+/ch-width {ch-data 1 get} def   % the number of pixels across
+/ch-height {ch-data 2 get} def  % the number of pixels tall
+/ch-xoff  {ch-data 3 get} def   % number of pixels below origin
+/ch-yoff  {ch-data 4 get} def   % number of pixels to left of origin
+/ch-tfmw  {ch-data 5 get} def   % spacing to next character
+
+/CharBuilder    % fontdict ch Charbuilder -     -- image one character
+  { /ch-code exch def           % save the char code
+    /font-dict exch def         % and the font dict.
+    /ch-data font-dict /BitMaps get ch-code get def     % get the bitmap descriptor for char
+    ch-data null eq not
+      { ch-tfmw   0   ch-xoff neg   ch-yoff neg   ch-width ch-xoff sub   ch-height ch-yoff sub
+            setcachedevice
+        ch-width ch-height true [1 0  0 1  ch-xoff ch-yoff]
+            {ch-image} imagemask
+      }
+    if
+  } def
+
+
+/@sf            % fontdict @sf -        -- make that the current font
+  { setfont() pop
+  } def
+
+                % in the following, the font-cacheing mechanism requires that
+                % a name unique in the particular font be generated
+
+/@dc            % char-data ch @dc -    -- define a new character bitmap in current font
+  { /ch-code exch def
+% ++oystr 12-Feb-86++
+    dup 0 get
+    length 2 lt
+      { pop [ <00> 1 1 0 0 8.00 ] } % replace <> with null
+    if
+% --oystr 12-Feb-86--
+    /ch-data exch def
+    currentfont /BitMaps get ch-code ch-data put
+    currentfont /Encoding get ch-code 
+       dup (   ) cvs cvn   % generate a unique name simply from the character code
+       put
+  } def
+
+/@bop0           % n @bop0 -              -- begin the char def section of a new page
+  { 
+  } def
+
+/@bop1           % n @bop1 -              -- begin a brand new page
+  { pop
+    erasepage initgraphics 
+    Mtrx setmatrix
+    /SaveImage save def() pop
+  } def
+
+%--  tjh sept. 87: if this page has a mac drawing on it, we have to
+%--  use showpage in the md dictionary.
+/@eop           % - @eop -              -- end a page
+  {
+    userdict /md known {
+	userdict /md get type /dicttype eq {
+	    md /MacDrwgs known {
+		md begin showpage end
+	    }{
+		showpage
+	    } ifelse
+	}{
+	    showpage
+	} ifelse
+    }{
+	showpage
+    } ifelse
+    SaveImage restore() pop
+  } def
+
+/@start         % - @start -            -- start everything
+  { @letter                             % (there is not much to do)
+  } def
+
+/@end           % - @end -              -- done the whole shebang
+  { end
+  } def
+
+/p              % x y p -               -- move to position
+  { moveto
+  } def
+
+/r              % x r -                 -- move right
+  { 0 rmoveto
+  } def
+
+/s              % string s -            -- show the string
+  { show
+  } def
+
+/c              % ch c -                -- show the character (code given)
+  { c-string exch 0 exch put
+    c-string show
+  } def
+
+/c-string ( ) def
+
+/ru             % dx dy ru -   -- set a rule (rectangle)
+  { /dy exch neg def    % because dy is height up from bottom
+    /dx exch def
+    /x currentpoint /y exch def def   % remember current point
+    newpath x y moveto
+    dx 0 rlineto
+    0 dy rlineto
+    dx neg 0 rlineto
+    closepath fill
+    x y moveto
+  } def
+
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+%%     the \special command junk
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+%   The structure of the PostScript produced by dvi2ps for \special is:
+%         @beginspecial
+%           - any number of @hsize, @hoffset, @hscale, etc., commands
+%         @setspecial
+%           - the users file of PostScript commands
+%         @endspecial
+
+% The @beginspecial command recognizes whether the Macintosh Laserprep
+% has been loaded or not, and redfines some Mac commands if so.
+% The @setspecial handles the users shifting, scaling, clipping commands
+
+%-- tjh sept. 87: made changes to allow postscript and macdrawing to
+%-- to be inserted with version 65 of the md dictionary.  Many bugs
+%-- were fixed:
+%--	vo changed to vof, name conflict with md
+%--	vs changed to vsz, name conflict with md
+%--	substantially changed @setspecial and @MacSetUp
+%-- Also, made changes to allow users to specify offsets
+%-- and clip rectangles in inches.
+
+% The following are user settable options from the \special command.
+
+/@SpecialDefaults
+  { /hs 8.5 72 mul def
+    /vsz 11 72 mul def
+    /ho 0 def
+    /vof 0 def
+    /hsc 1 def
+    /vsc 1 def
+    /CLIP false def
+  } def
+
+%       d @hsize -      specify a horizontal clipping dimension
+%                       these 2 are executed before the MacDraw initializations
+/@hsize {72 mul /hs exch def /CLIP true def} def
+/@vsize {72 mul /vsz exch def /CLIP true def} def
+
+%       d @hoffset -    specify a shift for the drwgs
+/@hoffset {72 mul /ho exch def} def
+/@voffset {72 mul /vof exch def} def
+
+%       s @hscale -     set scale factor
+/@hscale {/hsc exch def} def
+/@vscale {/vsc exch def} def
+
+/@setclipper
+  { hsc vsc scale
+    CLIP
+      { newpath 0 0 moveto hs 0 rlineto 0 vsz rlineto hs neg 0 rlineto closepath clip }
+    if
+  } def
+
+% this will be invoked as the result of a \special command (for the
+% inclusion of PostScript graphics).  The basic idea is to change all
+% scaling and graphics back to defaults, but to shift the origin
+% to the current position on the page.  Due to TeXnical difficulties,
+% we only set the y-origin.  The x-origin is set at the left edge of
+% the page.
+
+/@beginspecial
+  { gsave /SpecialSave save def
+          % the following magic incantation establishes the current point as
+          % the users origin, and reverts back to default scalings, rotations
+    currentpoint transform initgraphics itransform translate
+    @SpecialDefaults    % setup default offsets, scales, sizes
+    @MacSetUp           % fix up Mac stuff 
+  } def
+
+
+%--  tjh: assume this is raw postscript, but save some state in case its not.
+/@setspecial
+  {
+    /specmtrx matrix currentmatrix def
+    ho vof translate @setclipper
+  } def
+
+
+/@endspecial
+  { SpecialSave restore
+    grestore
+  } def
+
+
+% - @MacSetUp -   turn-off/fix-up all the MacDraw stuff that might hurt us
+                % we depend on 'psu' being the first procedure executed
+                % by a Mac document.  We redefine 'psu' to adjust page
+                % translations, and to do all other the fixups required.
+                % This stuff will not harm other included PS files
+/@MacSetUp
+  { userdict /md known  % if md is defined
+      { userdict /md get type /dicttype eq      % and if it is a dictionary
+         {
+           md begin                             % then redefine some stuff
+              /psu                              % redfine psu to set origins, etc.
+                /psu load
+                        % this procedure contains almost all the fixup code
+                {
+%		  /letter {} def    % it is bad manners to execute the real
+%		  /note {} def      %  versions of these (clears page image, etc.)
+%		  /legal {} def
+		  /MacDrwgs true def
+		  specmtrx setmatrix % restore pre-@setspecial state.
+		  initclip	     % ditto
+		  % change smalls to prevent page clearing.
+		  /smalls [ lnop lnop lnop lnop lnop lnop lnop lnop lnop ] def
+		  0 0 0 0 ppr astore pop  % prevents origin translation.
+		  % redifine cp, do the showpage later, see @eop
+                  /cp {
+			pop
+			pop
+			pm restore
+		  } def 		 	% no printing of pages
+                }
+                concatprocs 
+              def
+              /od
+		% redefine od to translate and scale.
+		% redfine load to set clipping region.
+                /od load
+                {
+		  ho vof translate
+		  hsc vsc scale
+		  CLIP {
+		      /nc
+			/nc load
+			{ newpath 0 0 moveto hs 0 rlineto 0 vsz rlineto
+			hs neg 0 rlineto closepath clip }
+			concatprocs
+		      def
+		  } if
+		}
+                concatprocs
+              def
+           end }
+        if }
+    if    
+  } def
+
+%       p1 p2 concatprocs p       - concatenate procedures
+/concatprocs
+  { /p2 exch cvlit def 
+    /p1 exch cvlit def
+    /p p1 length p2 length add array def
+    p 0 p1 putinterval 
+    p p1 length p2 putinterval
+    p cvx
+  } def
+
+end                     % revert to previous dictionary
+TeXDict begin @start
+%%Title: install.dvi
+%%Creator: dvi2ps
+%%EndProlog
+1 @bop0
+[ 300 ] /cmr17.300 @newfont
+cmr17.300 @sf
+[<FFFF80FFFF8007F00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E0
+  0003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003
+  E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00007F000FFFF80FFFF80> 24 49 -3 0 23.499] 73 @dc
+[<FFFC1FFF80FFFC1FFF800FC001F800078000F000078000F000078000F000078000F000078000F000078000F000078000F000
+  078000F000078000F000078000F000078000F000078000F000078000F000078000F000078000F000078000F000078000F000
+  078000F000078000F000078000F00007C000F00007C000F00007A000F00007A001E0000F9001E000FF8C03C000FF83078000
+  0780FE0000> 40 31 -2 0 36.644] 110 @dc
+[<80FE00C301C0CC0060F00030F00038E00018E0001CC0001CC0001C80001C80003C80003C0000F80001F8003FF003FFE00FFF
+  C01FFF003FF0007E0000F80000F00010E00010E00010E00010E000306000303000701800F00E033001FC10> 24 31 -2 0 25.776] 115 @dc
+[<001F000078C000E04001E02001C02003C01003C01003C01003C01003C01003C01003C01003C01003C00003C00003C00003C0
+  0003C00003C00003C00003C00003C00003C00003C00003C00003C00003C00003C00003C000FFFFE01FFFE00FC00007C00003
+  C00001C00001C00000C00000C00000C000004000004000004000004000004000> 24 44 -1 0 25.402] 116 @dc
+[<03FC03E00F0307F03E008F087C005E0478003E04F8003E04F8003E04F8001E04F8001E04F8001E047C001E003C001E003E00
+  1E001F001E000F801E0003E01E0000FC1E00000FFE0000001E0000001E0000001E0008001E003E001E003E001E003E001C00
+  3C003C0010003800100070000C00E0000303C00000FE0000> 32 31 -3 0 32.896] 97 @dc
+[<FFFCFFFC0FC00780078007800780078007800780078007800780078007800780078007800780078007800780078007800780
+  0780078007800780078007800780078007800780078007800780078007800780078007800780078007800F80FF80FF800780> 16 50 -2 0 17.907] 108 @dc
+[<FFF8FFF80FC00780078007800780078007800780078007800780078007800780078007800780078007800780078007800780
+  078007800F80FF80FF8007800000000000000000000000000000000000000000000007000F801F801F800F800700> 16 48 -2 0 17.907] 105 @dc
+[<003FE00001C01C00070007001C0001C0380000E07000007070000070E0000038E0000038E0000038E0000038E00000387000
+  0070300000F0180001E00E000FC003FFFF8007FFFF000FFFF8000E0000001C00000018000000180000001800000018000000
+  18000000087F000009C1C0000780E000070070000F0078001E003C001E003C003E003E003E003E003E003E003E003E003E00
+  3E003E003E001E003C001E003C000F007800070070080380E81C01C1C41C007F0308000000F0> 32 47 -2 15 32.896] 103 @dc
+[<FFFC1FFF80FFFC1FFF800FC001F800078000F000078000F000078000F000078000F000078000F000078000F000078000F000
+  078000F000078000F000078000F000078000F000078000F000078000F000078000F000078000F000078000F000078000F000
+  078000F000078000F000078000F00007C000F00007C000F00007A000F00007A001E000079001E000078C03C0000783078000
+  0780FE0000078000000007800000000780000000078000000007800000000780000000078000000007800000000780000000
+  0780000000078000000007800000000780000000078000000007800000000F80000000FF80000000FF800000000780000000> 40 50 -2 0 36.644] 104 @dc
+[<001FC00000F0300001C00C00078002000F0002000E0001001E0000803C0000803C0000007C00000078000000F8000000F800
+  0000F8000000F8000000F8000000F8000000FFFFFF80F8000780F80007807800078078000F807C000F003C000F001C000F00
+  1E001E000E001E0007003C000380380000E0E000003F8000> 32 31 -2 0 29.149] 101 @dc
+[<800FF000807FFC00C1F01E00C7000700EC000380F80001C0F00000E0E00000E0E00000F0C0000070C0000078800000788000
+  0078800000788000007880000078000000F8000000F8000000F0000001F0000003F0000007E000001FE00000FFC0000FFF80
+  00FFFF0003FFFC0007FFF8000FFF80001FF800003FC000003F0000007E0000007C000000F8000000F8000020F0000020F000
+  0020F0000020F0000060F000006070000060700000E0780000E0380001E03C0003E01E0006E00F001C6007C0786001FFE020
+  007F8020> 32 51 -4 1 36.644] 83 @dc
+[<003F81FF0000E061FF00038011F000070009E0000E0005E0001E0003E0001C0001E0003C0001E0003C0001E000780001E000
+  780001E000F80001E000F80001E000F80001E000F80001E000F80001E000F80001E000F80001E000F80001E000F80001E000
+  780001E0007C0001E0003C0001E0003C0001E0001E0001E0000E0003E0000F0003E000078005E00001C019E00000F061E000
+  001F81E000000001E000000001E000000001E000000001E000000001E000000001E000000001E000000001E000000001E000
+  000001E000000001E000000001E000000001E000000001E000000001E000000003E00000003FE00000003FE000000001E000> 40 50 -3 0 36.644] 100 @dc
+[<FFFF800000FFFF80000007F000000003E000000003E000000003E000000003E000000003E000000003E000000003E0000000
+  03E000000003E000000003E000000003E000000003E000000003E000000003E000000003E000000003E000000003E0000000
+  03E000000003E000000003E000000003E000000003FFFFF00003E0007C0003E0001F0003E000078003E00003C003E00003E0
+  03E00001E003E00001F003E00000F003E00000F803E00000F803E00000F803E00000F803E00000F803E00000F803E00000F0
+  03E00001F003E00001F003E00001E003E00003C003E000078003E0001F0007E0007E00FFFFFFF800FFFFFFC000> 40 49 -4 0 45.061] 80 @dc
+[<003F800000E0600003801800070004000F0002001E0002001E0001003C0001007C0000007C00000078000000F8000000F800
+  0000F8000000F8000000F8000000F8000000F8000000F8000000F8000000780000007C0008007C003E003C003E001C003E00
+  1E001E000F000400070004000380180000E06000003F8000> 32 31 -3 0 29.149] 99 @dc
+[<FFFC0FFF00FFFC0FFF000FC007F800078003E000078003C000078007C00007800F800007800F000007801F000007803E0000
+  07803C000007807C000007807800000780F0000007C1F0000007A1E000000793C00000078FC0000007878000000783800000
+  07810000000780800000078060000007801000000780080000078004000007800600000780078000078007E00007801FFC00
+  07801FFC00078000000007800000000780000000078000000007800000000780000000078000000007800000000780000000
+  0780000000078000000007800000000780000000078000000007800000000F80000000FF80000000FF800000000780000000> 40 50 -2 0 34.770] 107 @dc
+[ 300 ] /cmr12.300 @newfont
+cmr12.300 @sf
+[<81FC00C60700C80180F000C0E000C0C00060C000608000708000708000708000700000700000F00000F00001E00007E0003F
+  C003FF800FFF001FFE003FF0007F0000780000F00000F00000E00020E00020E00020E00060E000606000607000E03001E018
+  02600C0C6003F020> 24 36 -3 1 27.097] 83 @dc
+[<FFC00E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E001E00FE000E000000000000000000
+  00000000000000001C001E003E001E001C00> 16 34 -1 0 13.548] 105 @dc
+[<FFE3FF8FFE0E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00E003800E0
+  0E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00F003C00E00F003C00E01E807201C0FE60E18380
+  0E1FC07F00> 40 21 -1 0 40.645] 109 @dc
+[<01FC000707000E03801C01C03800E07800F0700070F00078F00078F00078F00078F00078F00078F000787000707000703800
+  E01800C00C018007070001FC00> 24 21 -1 0 24.387] 111 @dc
+[<FFE7FF000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E00
+  70000E0070000E0070000E0070000F0070000F0070001E80E000FE60C0000E1F8000> 32 21 -1 0 27.097] 110 @dc
+[<FFFFF8000F801E0007800700078003C0078001E0078000E0078000F007800078078000780780007C0780003C0780003C0780
+  003E0780003E0780003E0780003E0780003E0780003E0780003E0780003E0780003E0780003C0780003C0780003C0780007C
+  0780007807800078078000F0078000E0078001C0078003C0078007000F801E00FFFFF000> 32 34 -2 0 37.249] 68 @dc
+[<00FC000703000E00801C0040380020780020700000F00000F00000F00000F00000F00000FFFFE0F000E07000E07801E03801
+  C01C01C00C038007070001FC00> 24 21 -1 0 21.677] 101 @dc
+[<0FC1E03C2390781708F00F08F00708F00708F007087007007807003C07001E070007C70000FF000007000007000007001807
+  003C0E003C0C001838000FE000> 24 21 -2 0 24.387] 97 @dc
+[<FFF0000F00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000F00000F06
+  000F0F001E8F00FE46000E3C00> 24 21 -1 0 18.968] 114 @dc
+[<FFFFC07FFFC03FFFC030004018006008002004002002002001000001800000C000006000003000001800001C00000E000007
+  000007800003C00003C00003E02003E0F801E0F801E0F801E0F003E08003E04003C04003C02007801007000C1C0003F000> 24 33 -2 0 24.387] 50 @dc
+[<FFFE07C003800380038003800380038003800380038003800380038003800380038003800380038003800380038003800380
+  0380038003800380F3800F8003800080> 16 33 -4 0 24.387] 49 @dc
+[<FFE0203FFF001F007003F0000E007001E00004007001E0000400F801E0000400F801E0000400F801E0000401E401E0000401
+  E401E0000401E401E0000403C201E0000403C201E00004078101E00004078101E00004078101E000040F0081E000040F0081
+  E000040F0081E000041E0041E000041E0041E000043C0021E000043C0021E000043C0021E00004780011E00004780011E000
+  04780011E00004F00009E00004F00009E00004F00009E00005E00005E00005E00005E00007C00003E0000FC00003F000FFC0
+  0003FF00> 48 34 -2 0 44.692] 77 @dc
+[<3C0000430000F18000F08000F0400000400000200000200000200000100000100000380000380000380000740000740000E2
+  0000E20000E20001C10001C1000380800380800380800700400700400E00200E00200E00301E0078FFC1FE> 24 31 -1 10 25.742] 121 @dc
+[<0FC000103000201800700C007806007807003003000003800003800001C00001C00001C003E1E00619E00C05E01805E03803
+  E07003E07001E0F001E0F001E0F001E0F001E0F001E0F001C0F001C0F001C07003807003803803801807000C0600060C0001
+  F000> 24 34 -2 1 24.387] 57 @dc
+[<03F0000C1C00100F002007804007804003C0F003C0F803E0F803E07003E02003E00003E00003C00003C0000780000780000F
+  00001C0003F000003800000E00000F000007000007800007803807C07807C07803C07807C04007C02007801007000C1E0003
+  F800> 24 34 -2 1 24.387] 51 @dc
+[ 432 ] /cmbx10.432 @newfont
+cmbx10.432 @sf
+[<7FFFFE7FFFFE7FFFFE00FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE
+  0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE0000FE00F8FE00FF
+  FE00FFFE0007FE00007E00001E00000E00> 24 39 -5 0 34.370] 49 @dc
+[<FFFFFCFFFFFCFFFFFC01FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE
+  0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001FE0001
+  FE0001FE0001FE0001FE0001FE00FFFFFCFFFFFCFFFFFC> 24 41 -2 0 26.068] 73 @dc
+[<FFFE3FFF80FFFE3FFF80FFFE3FFF800FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F800
+  0FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FF003F800
+  0FF003F8000FD803F8000FD803F0000FCE03F000FFC787E000FFC1FFC000FFC07F0000> 40 27 -3 0 38.189] 110 @dc
+[<001F8000FFC001F86003F87003F03807F03807F03807F03807F03807F03807F03807F00007F00007F00007F00007F00007F0
+  0007F00007F00007F00007F00007F00007F00007F000FFFFF0FFFFF01FFFF007F00003F00003F00001F00000F00000F00000
+  F000007000007000007000007000> 24 38 -1 0 26.732] 116 @dc
+[<FFFF0000FFFF0000FFFF00000FE000000FE000000FE000000FE000000FE000000FE000000FE000000FE000000FE000000FE0
+  00000FE000000FE000000FE000000FE000000FE000000FE000000FF01C000FF03E000FF07F000FD87F000FD87F00FFCE3E00
+  FFC7FC00FFC1F000> 32 27 -2 0 28.310] 114 @dc
+[<003FE00001FFFC0007F07F000FC01F801F800FC03F800FE03F800FE07F0007F07F0007F0FF0007F8FF0007F8FF0007F8FF00
+  07F8FF0007F8FF0007F8FF0007F8FF0007F87F0007F07F0007F07F0007F03F0007E03F800FE01F800FC00FC01F8003F07E00
+  01FFFC00003FE000> 32 27 -2 0 34.370] 111 @dc
+[<003FC3FF8000FFF3FF8003F03BFF8007C00FF8000F8007F8001F8003F8003F8003F8007F0003F8007F0003F8007F0003F800
+  FF0003F800FF0003F800FF0003F800FF0003F800FF0003F800FF0003F800FF0003F8007F0003F8007F0003F8007F0003F800
+  3F8003F8001F8003F8000FC007F80007E00FF80003F03FF80000FFFBF800001FE3F800000003F800000003F800000003F800
+  000003F800000003F800000003F800000003F800000003F800000003F800000003F800000003F800000003F80000003FF800
+  00003FF80000003FF800> 40 42 -2 0 38.189] 100 @dc
+[<003FC3FF8001FFF3FF8003F03BFF8007E00FF80007E007F8000FE007F8000FE003F8000FE003F8000FE003F8000FE003F800
+  0FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F800
+  0FE003F8000FE003F8000FE003F8000FE003F800FFE03FF800FFE03FF800FFE03FF800> 40 27 -3 0 38.189] 117 @dc
+[<001FE00000FFFC0003F01E0007E007000FC003801F8001C03F8001C07F8000007F0000007F000000FF000000FF000000FF00
+  0000FF000000FF000000FF000000FF0000007F0000007F0000007F800E003F801F001F803F800FC03F8007E03F8003F01F00
+  00FFFE00001FF800> 32 27 -2 0 30.551] 99 @dc
+[<FFFEFFFEFFFE0FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE0FFE0
+  FFE0FFE0000000000000000000000000000007000FC01FE03FE03FE03FE01FE00FC00700> 16 43 -3 0 19.094] 105 @dc
+[ 329 ] /cmr10.329 @newfont
+cmr10.329 @sf
+[<001F800000F0F00001C0380007801E000F000F000E0007001E0007803C0003C03C0003C07C0003E07C0003E0780001E0F800
+  01F0F80001F0F80001F0F80001F0F80001F0F80001F0F80001F0F80001F0F80001F0780001E0780001E07C0003E03C0003C0
+  3C0003C01E0007800E0007000F000F0007801E0001C0380000F0F000001F8000> 32 33 -3 1 35.353] 79 @dc
+[<FFE7FC0E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00F01C00F01
+  C00E8180FE43000E3E00> 24 20 -1 0 25.252] 110 @dc
+[<01E0031006100E080E080E080E080E080E000E000E000E000E000E000E000E000E000E000E00FFF83E000E000E0006000600
+  020002000200> 16 28 -1 0 17.676] 116 @dc
+[<FFE7FC0E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00F01C00F01
+  C00E81800E43000E3E000E00000E00000E00000E00000E00000E00000E00000E00000E00000E0000FE00000E0000> 24 32 -1 0 25.252] 104 @dc
+[<01F8000706000C0100180080380080700000700000F00000F00000F00000FFFF80F00380F003807003807007003807003807
+  001C0E000E1C0003F000> 24 20 -1 0 20.202] 101 @dc
+[<0F83C0386720781E10F01E10F00E10F00E10F00E10780E00380E001E0E00078E0000FE00000E00000E00000E00300E00781C
+  007818003030001FE000> 24 20 -2 0 22.727] 97 @dc
+[<03F0000E0C001C0200380100380100700000700000F00000F00000F00000F00000F00000F00000700000700000380C00381E
+  001C1E000E0C0003F800> 24 20 -2 0 20.202] 99 @dc
+[<01F800070E001C03803801C03801C07000E07000E0F000F0F000F0F000F0F000F0F000F0F000F07000E07000E03801C03801
+  C01C0380070E0001F800> 24 20 -1 0 22.727] 111 @dc
+[<FFE7FE7FE00E00E00E000E00E00E000E00E00E000E00E00E000E00E00E000E00E00E000E00E00E000E00E00E000E00E00E00
+  0E00E00E000E00E00E000E00E00E000E00E00E000E00E00E000F00F00E000F00F00E000E81C81C00FE618618000E1F01F000> 40 20 -1 0 37.878] 109 @dc
+[<FFE0000E00000E00000E00000E00000E00000E00000E00000E00000E3E000EC3000F01C00F01E00E00E00E00F00E00700E00
+  780E00780E00780E00780E00780E00780E00700E00F00E00E00F00E00F01C0FEC3800E3E00> 24 29 -1 9 25.252] 112 @dc
+[<3C0000620000F10000F08000F0800000400000400000400000200000200000700000700000700000E80000E80001EC0001C4
+  0001C4000382000382000382000701000701000E00800E00800E00801C00C01E01E0FF83F8> 24 29 -1 9 23.989] 121 @dc
+[<FFC00E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E007E000E0000000000000000000000
+  00001C001E003E001E001C00> 16 31 0 0 12.626] 105 @dc
+[<03FC001C03803000C0600060C00030C00030C00030C000306000703001E00FFFC01FFF803FFE003000003000002000002000
+  0033E0001E38001C1C00380E00780F00780F00780F00780F00780F00380E001C1C300E3C3003E3300000E0> 24 31 -1 10 22.727] 103 @dc
+[<01F1FC030DC00603C00E03C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01C00E01
+  C00E01C0FE1FC00E01C0> 24 20 -1 0 25.252] 117 @dc
+[<004008000060180000E01C0000E01C0000F03C0001D03A0001D0320003C8730003887100038861000704E0800704C0800707
+  C0800E03C0400E0380400E0380401C0380201C0300603C078070FF9FE1FC> 32 20 -1 0 32.828] 119 @dc
+[<FFE00E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E00
+  0E000E000E000E000E00FE000E00> 16 32 0 0 12.626] 108 @dc
+[<7FC3FE0700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700
+  E00700E00700E0FFFFE00700000700000700000700000700000700000701E00701E00381E001C0C000E0C0003F00> 24 32 0 0 25.252] 12 @dc
+[<03E3F80E1B801C0780380780380380700380700380F00380F00380F00380F00380F00380F003807003807003803803803803
+  801C0780061B8003E380000380000380000380000380000380000380000380000380000380000380003F80000380> 24 32 -2 0 25.252] 100 @dc
+[<FF07FE1E01E00601E00201C0030380010700008700004E00007C00003C0000380000780000EC0000E40001C2000381000701
+  800701C00F01E07FC3FC> 24 20 0 0 23.989] 120 @dc
+[<083E000CC3000D01C00F00E00E00E00E00700E00700E00780E00780E00780E00780E00780E00780E00700E00700E00E00F00
+  E00F01C00EC3800E3E000E00000E00000E00000E00000E00000E00000E00000E00000E00000E0000FE00000E0000> 24 32 -1 0 25.252] 98 @dc
+[<8F80D060E030C018C01880188018803800700FF03FE07F807800E000C010C010C010403030701F90> 16 20 -2 0 17.929] 115 @dc
+[<7FF0000700000700000700000700000700000700000700000700000700000700000700000700000700000700000700000700
+  00070000070000FFF000070000070000070000070000070000070000070000070600038F00018F0000C600007C00> 24 32 0 0 13.889] 102 @dc
+[<FFE00E000E000E000E000E000E000E000E000E000E000E000E000E000E000F0C0F1E0F1EFE8C0E78> 16 20 -1 0 17.803] 114 @dc
+[<81F800CE0C00F00600E00300C00380C001808001C08001C08001C08001C00001C00003C00003C0000780001F8003FF000FFE
+  001FFC003FF0007F0000780000F00000F00000E00080E00080E00080E001806001806001803003801007800C198007E080> 24 33 -3 1 25.252] 83 @dc
+[<00FFE0000E00000E00000E00000E00000E00000E00000E00FFFFF0C00E00400E00200E00200E00100E00080E00080E00040E
+  00020E00020E00010E00008E00008E00004E00002E00002E00001E00000E00000E00000600000600> 24 30 -1 0 22.727] 52 @dc
+[<70F8F8F870> 8 5 -4 0 12.626] 46 @dc
+[<40201010080804040474FCFCF870> 8 14 -4 9 12.626] 44 @dc
+[<FFFF807FFF803FFF801000801800400C004006004003000001800000C000006000007000003800001C00000E00000F000007
+  800007800007C02007C0F803C0F803C0F803C0F007C0800780400780200700100E000C1C0003F000> 24 30 -2 0 22.727] 50 @dc
+[<FFFFE0000F803C0007800E000780070007800380078003C0078001E0078001E0078000F0078000F0078000F0078000F80780
+  00F8078000F8078000F8078000F8078000F8078000F8078000F8078000F8078000F0078000F0078001F0078001E0078001E0
+  078003C0078003800780070007801E000F803C00FFFFE000> 32 31 -2 0 34.721] 68 @dc
+[<FFFFFF800F800F8007800180078000C0078000C0078000400780004007800040078000200780002007800020078080000780
+  800007808000078080000781800007FF80000781800007808000078080000780800007808080078000800780008007800080
+  078001800780010007800300078003000F800F00FFFFFF00> 32 31 -2 0 30.934] 69 @dc
+[<000FC0000070380001C0040003800200070001000E0000801E0000801C0000403C0000407C0000407C00004078000000F800
+  0000F8000000F8000000F8000000F8000000F8000000F8000000F8000000F8000000780000407C0000407C0000403C0000C0
+  1C0000C01E0000C00E0001C0070003C0038005C001C009C0007030C0000FC040> 32 33 -3 1 32.828] 67 @dc
+[<000FC000003820000070180000E0080001C0040001C002000380020003800200078001000780010007800100078001000780
+  0100078001000780010007800100078001000780010007800100078001000780010007800100078001000780010007800100
+  07800100078001000780010007800100078003800FC007C0FFFC3FF8> 32 32 -2 1 34.090] 85 @dc
+[<FFFE00000FC00000078000000780000007800000078000000780000007800000078000000780000007808000078080000780
+  8000078080000781800007FF8000078180000780800007808000078080000780800007800080078000800780008007800080
+  078001800780010007800300078003000F800F00FFFFFF00> 32 31 -2 0 29.671] 70 @dc
+[<C00000C00000C000006000006000006000003000003000003000001800001800001800000C00000C00000C00000600000600
+  0006000003000003000003000001800001800001800000C00000C00000C00000600000600000600000300000300000300000
+  1800001800001800000C00000C00000C00000600000600000600000300000300000100> 24 45 -3 11 22.727] 47 @dc
+[<FFFE07C003800380038003800380038003800380038003800380038003800380038003800380038003800380038003800380
+  0380F3800F8003800180> 16 30 -4 0 22.727] 49 @dc
+[<000FE0000078182000E00460038002E0070001E00F0001E01E0001E01E0001E03C0001E03C0001E07C0001E0780001E0F800
+  03E0F8007FFCF8000000F8000000F8000000F8000000F8000000F8000000F8000000780000207C0000203C0000203C000060
+  1E0000601E0000600F0000E0070001E0038002E000E004E000781860000FE020> 32 33 -3 1 35.668] 71 @dc
+[<FFFC0FC007800780078007800780078007800780078007800780078007800780078007800780078007800780078007800780
+  07800780078007800FC0FFFC> 16 31 -1 0 16.414] 73 @dc
+[<FFC03FFE1F0007E00E0003C0040003C002000780020007800200078001000F0001000F0001000F0000FFFE0000801E000080
+  1E0000403C0000403C0000403C000020780000207800002078000010F0000010F0000010F0000009E0000009E0000009E000
+  0007C0000007C0000007C00000038000000380000003800000010000> 32 32 -1 0 34.090] 65 @dc
+[<FFCFF80E03E00E03C00E03800E07800E07000E0E000E1E000E1C000F38000EF8000E70000E30000E10000E08000E04000E02
+  000E03000E03C00E0FF00E00000E00000E00000E00000E00000E00000E00000E00000E00000E0000FE00000E0000> 24 32 -1 0 23.989] 107 @dc
+[<FFFC00000FC00000078000000780000007800000078000000780000007800000078000000780000007800000078000000780
+  0000078000000780000007FFE0000780780007801C0007801E0007800F0007800F8007800F8007800F8007800F8007800F80
+  07800F8007800F0007801E0007801C000F807800FFFFE000> 32 31 -2 0 30.934] 80 @dc
+[<70F8F8F8700000000000000000000070F8F8F870> 8 20 -4 0 12.626] 58 @dc
+[ 329 ] /cmbx10.329 @newfont
+cmbx10.329 @sf
+[<FFE3FF8FFEFFE3FF8FFE1F007C01F01F007C01F01F007C01F01F007C01F01F007C01F01F007C01F01F007C01F01F007C01F0
+  1F007C01F01F007C01F01F007C01F01F007C01F01F807E01F01E807E01F01E407D01F01E70F9C3E0FE1FF07FC0FE0FE03F80> 40 20 -3 0 43.559] 109 @dc
+[<00FF8007FFE00F80701E00183E00187C00007C0000FC0000FC0000FC0000FFFFF8FFFFF8FC00F87C00F87C00F03E00F01E01
+  E00F83C007FF8001FE00> 24 20 -1 0 23.958] 101 @dc
+[<FFE00000FFE000001F0000001F0000001F0000001F0000001F0000001F0000001F0000001F1F80001F7FE0001FC1F8001F80
+  FC001F007C001F007E001F003E001F003F001F003F001F003F001F003F001F003F001F003F001F003E001F007E001F007E00
+  1F80FC001FE1F800FF7FF000FF1FC000> 32 29 -2 9 29.040] 112 @dc
+[<0FE07E3FF8FE7E0DE0FC05E0F803E0F803E0F803E07C03E03C03E01F03E007FBE0007FE00003E00C03E03F03E03F03E03F07
+  C03F0F801FFF0007FC00> 24 20 -1 0 25.410] 97 @dc
+[<FFE3FF00FFE3FF001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00
+  F8001F00F8001F00F8001E80F8001EC0F8001E61F000FE3FE000FE0FC000> 32 20 -3 0 29.040] 110 @dc
+[<03F8FF000FFEFF001F07F8003E01F8007E00F8007C00F8007C00F800FC00F800FC00F800FC00F800FC00F800FC00F800FC00
+  F8007C00F8007C00F8007E00F8003E01F8001F83F8000FFEF80001F8F8000000F8000000F8000000F8000000F8000000F800
+  0000F8000000F8000000F8000000F8000000F8000007F8000007F800> 32 32 -2 0 29.040] 100 @dc
+[<FF81FF80FF81FF800E00FC000701F8000381F0000183E00000C7C000006F8000007F8000003F0000003E0000007E000000FF
+  000001F3800003E1800007E0C00007C0E0000F807000FFE1FF00FFE1FF00> 32 20 -1 0 27.588] 120 @dc
+cmr10.329 @sf
+[<FFE0C1FFF81F00C01F800E00C00F000401E00F000401E00F000401E00F000403900F000403900F000407080F000407080F00
+  0407080F00040E040F00040E040F00040E040F00041C020F00041C020F00041C020F000438010F000438010F000438010F00
+  0470008F000470008F000470008F0004E0004F0004E0004F0005C0002F0005C0002F0005C0002F000780001F000F80001F80
+  FF80001FF8> 40 31 -2 0 41.666] 77 @dc
+cmbx10.329 @sf
+[<FFE0FFE01F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F00FF00FF0000000000000000000000
+  00001C003E003F007F003F003E001C00> 16 33 -2 0 14.520] 105 @dc
+cmr10.329 @sf
+[<FFE001001F0003000E000300040007000400070004000F0004001F0004001F0004003D0004003D00040079000400F1000400
+  F1000401E1000401E1000403C1000407810004078100040F0100040F0100041E0100043C0100043C01000478010004780100
+  04F0010005E0010005E0010007C0038007C007C0FF803FF8> 32 31 -2 0 34.090] 78 @dc
+cmbx10.329 @sf
+[<FFE0FFE01F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F00
+  1F001F001F001F001F00FF00FF00> 16 32 -2 0 14.520] 108 @dc
+cmr10.329 @sf
+[<0020004000800100020006000C000C00180018003000300030007000600060006000E000E000E000E000E000E000E000E000
+  E000E000E000E0006000600060007000300030003000180018000C000C00060002000100008000400020> 16 46 -3 12 17.676] 40 @dc
+[<800040002000100008000C00060006000300030001800180018001C000C000C000C000E000E000E000E000E000E000E000E0
+  00E000E000E000E000C000C000C001C001800180018003000300060006000C0008001000200040008000> 16 46 -3 12 17.676] 41 @dc
+cmbx10.329 @sf
+[<C7F000FFFC00F81E00F00600E00700C00700C00700001F0003FE001FFE003FFC007FF800FFC000F80000E00600E00600600E
+  00701E003FFE000FE600> 24 20 -2 0 20.618] 115 @dc
+cmr10.329 @sf
+[<00200000700000700000700000E80000E80001EC0001C40001C4000382000382000382000701000701000E00800E00800E00
+  801C00C01E01E0FF83F8> 24 20 -1 0 23.989] 118 @dc
+[<FFC0FFC0> 16 2 -1 -9 15.151] 45 @dc
+[<003FF800038000038000038000038000038000038000038000038003E3800E13801C0B80380780380380780380700380F003
+  80F00380F00380F00380F00380F003807003807803803803803C07801C058006198003E080> 24 29 -2 9 23.989] 113 @dc
+[<07FFFE00001F8000000F0000000F0000000F0000000F0000000F0000000F0000000F0000000F0000000F0000000F0000000F
+  0000000F0000000F0000000F0000000F0000000F0000000F0000000F0000000F0000800F0010800F0010800F0010800F0010
+  C00F0030400F0020400F0020600F0060780F01E07FFFFFE0> 32 31 -2 0 32.828] 84 @dc
+cmbx10.329 @sf
+[<181F80001C7FE0001EC1F8001F807C001F007C001F003E001F003E001F003F001F003F001F003F001F003F001F003F001F00
+  3F001F003E001F003E001F007E001F807C001FE0F8001F7FF0001F1FC0001F0000001F0000001F0000001F0000001F000000
+  1F0000001F0000001F0000001F0000001F000000FF000000FF000000> 32 32 -2 0 29.040] 98 @dc
+[ 329 ] /cmti10.329 @newfont
+cmti10.329 @sf
+[<1F000031C00060E000607000E03800E03C00E01C00E01E00E01E00E01E00700F00700F00700F00700F00380F00380F003C0E
+  003A0E001D0C001CF0001C00001C00000E00000E00000E00000E00000700000700000700000700003F8000078000> 24 32 -5 0 20.908] 98 @dc
+[<0F0700308C80705C40703C40F01C40F01C40F00E20F00E00F00E00F00E007807007807007807003807003C03801C03800E03
+  800707800389C000F180> 24 20 -4 0 23.232] 97 @dc
+[<FFC0001C00001C00000E00000E00000E00000E0000070000070000071E0007238003C1C00380E003807003807801C03801C0
+  3C01C03C01C03C00E01E00E01E00E01E08E01E04701E04701E04781C04741C02621801C1E0> 24 29 0 9 23.232] 112 @dc
+cmr10.329 @sf
+[<FFFFFC0F807C07801C07800C0780040780040780060780020780020780020780020780000780000780000780000780000780
+  000780000780000780000780000780000780000780000780000780000780000780000780000FC000FFFE00> 24 31 -2 0 28.408] 76 @dc
+cmbx10.329 @sf
+[<FFF000FFF0001F00001F00001F00001F00001F00001F00001F00001F00001F00001F00001F00001F03001F8FC01E8FC01E8F
+  C01ECFC0FE7F80FE3E00> 24 20 -2 0 21.527] 114 @dc
+cmr10.329 @sf
+[<000003E0FFFC0F100FC01E0807803E0407807E0407807C0407807C0007807C0007807C000780780007807800078078000780
+  70000780F0000780E0000781C00007FF80000780F0000780780007803C0007801E0007801E0007801F0007801F0007801F00
+  07801F0007801E0007801E0007803C00078078000F80F000FFFF8000> 32 32 -2 1 33.459] 82 @dc
+cmbx10.329 @sf
+[<01F003F807CC0F860F860F860F860F860F800F800F800F800F800F800F800F800F800F80FFFCFFFC3F800F80078003800380
+  0380018001800180> 16 29 -1 0 20.328] 116 @dc
+cmr10.329 @sf
+[<FFE01FFE1F8007E0070003C0020007C00100078001800F0000801F0000401E0000603C0000207C00001078000018F8000009
+  F0000005E0000007E0000003C00000078000000F8000000F8000001E4000003E4000003C2000007C10000078100000F00800
+  01F00C0001E0040003C0020007C007000FE00FC07FF83FF8> 32 31 -1 0 34.090] 88 @dc
+[<7FE7FE0700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700E00700
+  E00700E00700E0FFFFE00700E00700E00700E00700E00700E00700E00700E00700E00381E001C1E000E0E0003FE0> 24 32 0 0 25.252] 13 @dc
+1 @bop1
+cmr17.300 @sf
+511 489 p (Installing) s
+22 r (the) s
+21 r (Staden) s
+22 r 80 c
+-1 r (ac) s
+-2 r 107 c
+-4 r (age) s
+cmr12.300 @sf
+810 616 p (Simon) s
+17 r (Dear) s
+800 718 p (21) s
+16 r (Ma) s
+0 r 121 c
+15 r (1993) s
+cmbx10.432 @sf
+224 911 p 49 c
+69 r (In) s
+-1 r (tro) s
+1 r (duction) s
+cmr10.329 @sf
+224 1012 p (On) s
+18 r (the) s
+17 r (accompan) s
+0 r (ying) s
+17 r (tap) s
+1 r 101 c
+18 r 121 c
+-1 r (ou) s
+17 r (will) s
+18 r (\014nd) s
+17 r (executables) s
+18 r (for) s
+17 r (one) s
+18 r (of) s
+18 r (SunOS) s
+224 1069 p (4.x,) s
+14 r (Sun) s
+13 r (Solaris) s
+14 r (2.x,) s
+14 r (DEC) s
+13 r (Ultrix,) s
+14 r (DEC) s
+13 r (OSF/1) s
+14 r (and) s
+13 r (Silicon) s
+14 r (Graphics) s
+13 r (SGI) s
+224 1125 p (op) s
+1 r (erating) s
+20 r (systems.) s
+33 r (Also) s
+19 r (there) s
+19 r (are) s
+20 r (sources) s
+19 r (for) s
+20 r (all) s
+19 r (the) s
+19 r (programs) s
+20 r (in) s
+19 r (the) s
+224 1181 p (Staden) s
+15 r (pac) s
+0 r 107 c
+-2 r (age.) s
+19 r (Programs) s
+15 r (in) s
+15 r (the) s
+15 r (pac) s
+0 r 107 c
+-3 r (age) s
+15 r (are:) s
+cmbx10.329 @sf
+224 1275 p (mep) s
+18 r (and) s
+17 r (xmep) s
+cmr10.329 @sf
+23 r (Motif) s
+15 r (exploration) s
+15 r (program.) s
+cmbx10.329 @sf
+224 1369 p (nip) s
+18 r (and) s
+17 r (xnip) s
+cmr10.329 @sf
+23 r (Nucleotide) s
+15 r (in) s
+0 r (terpretation) s
+14 r (program.) s
+cmbx10.329 @sf
+224 1463 p (nipl) s
+cmr10.329 @sf
+23 r (Nucleotide) s
+19 r (in) s
+0 r (terpretation) s
+19 r (program) s
+19 r (\(library\).) s
+34 r (Searc) s
+0 r (hes) s
+18 r 110 c
+0 r (ucleotide) s
+338 1519 p (libraries) s
+15 r (for) s
+15 r (patterns) s
+15 r (of) s
+15 r (motifs.) s
+cmbx10.329 @sf
+224 1613 p (pip) s
+18 r (and) s
+17 r (xpip) s
+cmr10.329 @sf
+23 r (Protein) s
+15 r (in) s
+0 r (terpretation) s
+14 r (program.) s
+cmbx10.329 @sf
+224 1707 p (pipl) s
+cmr10.329 @sf
+23 r (Protein) s
+11 r (in) s
+0 r (terpretation) s
+10 r (program) s
+11 r (\(library\).) s
+19 r (Searc) s
+0 r (hes) s
+10 r (protein) s
+12 r (libraries) s
+338 1763 p (for) s
+15 r (patterns) s
+15 r (of) s
+15 r (motifs.) s
+cmbx10.329 @sf
+224 1857 p (sip) s
+18 r (and) s
+17 r (xsip) s
+cmr10.329 @sf
+23 r (Similarit) s
+-1 r 121 c
+15 r (in) s
+-1 r 118 c
+-1 r (estigation) s
+14 r (program.) s
+cmbx10.329 @sf
+224 1951 p (sipl) s
+cmr10.329 @sf
+23 r (Similarit) s
+0 r 121 c
+14 r (in) s
+0 r 118 c
+-1 r (estigation) s
+14 r (program) s
+16 r (\(library\).) s
+21 r (Compares) s
+16 r 97 c
+16 r (prob) s
+1 r 101 c
+15 r (pro-) s
+338 2008 p (tein) s
+15 r (or) s
+15 r 110 c
+0 r (ucleic) s
+14 r (acid) s
+15 r (sequence) s
+15 r (against) s
+15 r 97 c
+16 r (library) s
+15 r (of) s
+15 r (sequences.) s
+cmbx10.329 @sf
+224 2101 p (sap) s
+18 r (and) s
+17 r (xsap) s
+cmr10.329 @sf
+23 r (The) s
+15 r (original) s
+15 r (sequence) s
+15 r (assem) s
+0 r (bly) s
+14 r (program.) s
+cmbx10.329 @sf
+224 2195 p (bap) s
+18 r (and) s
+17 r (xbap) s
+cmr10.329 @sf
+23 r (Our) s
+13 r (latest,) s
+13 r (most) s
+13 r (adv) s
+-1 r (anced) s
+12 r (sequence) s
+13 r (assem) s
+0 r (bly) s
+12 r (program.) s
+cmbx10.329 @sf
+224 2289 p (dap) s
+18 r (and) s
+17 r (xdap) s
+cmr10.329 @sf
+23 r (An) s
+15 r (obsolete) s
+15 r (assem) s
+0 r (bly) s
+14 r (program,) s
+15 r (sup) s
+1 r (erceded) s
+15 r 98 c
+0 r 121 c
+cmti10.329 @sf
+14 r 98 c
+-1 r (ap) s
+cmr10.329 @sf
+0 r 46 c
+cmbx10.329 @sf
+224 2383 p (lip) s
+cmr10.329 @sf
+23 r (Library) s
+15 r (in) s
+0 r (terface) s
+14 r (program.) s
+cmbx10.329 @sf
+224 2477 p (rep) s
+cmr10.329 @sf
+23 r (Rep) s
+1 r (eat) s
+15 r (examination) s
+15 r (program.) s
+cmbx10.329 @sf
+224 2570 p (ted) s
+cmr10.329 @sf
+23 r 88 c
+14 r (windo) s
+0 r (ws) s
+14 r (utilit) s
+-1 r 121 c
+14 r (for) s
+14 r (displa) s
+0 r (ying) s
+13 r (and) s
+15 r (editing) s
+14 r (\015uorescen) s
+0 r 116 c
+13 r (sequencing) s
+338 2627 p (mac) s
+0 r (hine) s
+14 r (traces.) s
+925 2776 p 49 c
+@eop
+2 @bop0
+cmbx10.329 @sf
+[<FFFF80FFFF8003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E00003E0
+  0003E00003E00003E00003E00003E00003E00003E000F3E000FFE0000FE00001E00000E000> 24 29 -4 0 26.136] 49 @dc
+[<2000700018000C000E0006000600030003003B007F00FF00FF00FE007C003800> 16 16 -4 9 14.520] 44 @dc
+[<FFFFC0FFFFC07FFFC03FFFC01FFFC01C00E00E006007006003806001C00000E00000F000007800003E00001F00001F80000F
+  C0000FC0000FE03807E07C07E0FE07E0FE0FE0FE0FC0FE0FC07C1F80383F001FFE0007F000> 24 29 -3 0 26.136] 50 @dc
+[<03FC001FFF803C0FC07807E0FC03F0FE03F0FE03F8FE03F87C03F83803F80003F80003F00003E00007C0000F8001FC0001FC
+  00001F00000F80000FC01E0FC03F07E03F07E03F07E03F07E01E0FC00E0F8007FF0001FC00> 24 29 -2 0 26.136] 51 @dc
+[<FFE3FF00FFE3FF001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00
+  F8001F00F8001F00F8001F80F8001FC0F8001F61F0001F3FE0001F0FC0001F0000001F0000001F0000001F0000001F000000
+  1F0000001F0000001F0000001F0000001F000000FF000000FF000000> 32 32 -3 0 29.040] 104 @dc
+cmr10.329 @sf
+[<FFFFF0000F803C0007801F0007800F0007800F8007800780078007C0078007C0078007C0078007C0078007C0078007800780
+  0F8007800F0007801E0007803C0007FFF00007803C0007801E0007801F0007800F0007800F8007800F8007800F8007800F80
+  07800F8007800F0007801F0007801E000F803800FFFFE000> 32 31 -2 0 32.196] 66 @dc
+cmbx10.329 @sf
+[<01FF000FFFE03F01F878003C78003CF0001EF0001EF0001E70003E3C007C1FFFFC07FFF80FFFF01FFF801C00001800001800
+  0009FC000FFF000F07801E03C03E03E03E03E03E03E03E03E03E03E01E03DE0F079E07FFFE01FC3C> 24 30 -1 10 26.136] 103 @dc
+[<01FC0007FF001F81C03F00C03E00607E00007C0000FC0000FC0000FC0000FC0000FC0000FC00007C03007C0FC03E0FC03E0F
+  C01F0FC007FF8001FE00> 24 20 -2 0 23.232] 99 @dc
+[<01FF0007FFC01F83F03E00F83E00F87C007C7C007CFC007EFC007EFC007EFC007EFC007EFC007E7C007C7C007C3E00F83E00
+  F81F83F007FFC001FF00> 24 20 -1 0 26.136] 111 @dc
+[<001C0000001C0000003E0000003E0000007F0000007F000000FF800000F9800001F9C00001F0C00001F0C00003E0600003E0
+  600007C0300007C030000F8018000F8018001F001C00FFE07F80FFE07F80> 32 20 -1 0 27.588] 118 @dc
+cmti10.329 @sf
+[<78780084C600E58100F38100F3808063808001C04001C00001C00001C00000E00000E00000E00040E0002070C02071E01071
+  E01068E00CC440038380> 24 20 -3 0 21.085] 120 @dc
+[<0F0700308C80705C40703C40F01C40F01C40F00E20F00E00F00E00F00E007807007807007807003807003C03801C03800E03
+  800707800389C000F1C00001C00001C00000E00000E00000E00000E00000700000700000700000700003F8000078> 24 32 -4 0 23.232] 100 @dc
+cmbx10.329 @sf
+[<FFF0FFF0FFF0FFF0> 16 4 -1 -8 17.424] 45 @dc
+[<0007FF000007FF000000F8000000F8000000F8000000F8000000F8000000F8000000F80003F8F8000FFEF8001F87F8003F01
+  F8007E00F8007E00F8007C00F800FC00F800FC00F800FC00F800FC00F800FC00F800FC00F8007C00F8007E00F8003E01F800
+  3F01F8001F87780007FE380001F81800> 32 29 -2 9 27.588] 113 @dc
+[<FFE00FFFE0FFE00FFFE00E0000FE00060000FC00060001FC00030001F800030001F800018003F000018003F00001FFFFF000
+  00FFFFE00000C007E00000600FC00000600FC00000701FC00000301F800000303F800000183F000000183F0000000C7E0000
+  000C7E0000000EFE00000006FC00000006FC00000003F800000003F800000001F000000001F000000001F000000000E00000
+  0000E00000> 40 31 -2 0 39.519] 65 @dc
+[<FFFFFF00FFFFFFE007C007F007C003F807C001FC07C000FC07C000FE07C000FE07C000FE07C000FE07C000FE07C000FC07C0
+  01FC07C001F807C003F007FFFFC007FFFF8007C00FE007C007F007C003F807C001F807C001FC07C001FC07C001FC07C001FC
+  07C001FC07C001F807C003F007C007E0FFFFFFC0FFFFFE00> 32 31 -2 0 37.183] 66 @dc
+[<FFFF00FFFF0007E00007E00007E00007E00007E00007E00007E00007E00007E00007E00007E00007E00007E00007E00007E0
+  0007E00007E00007E00007E00007E00007E00007E00007E00007E00007E00007E00007E000FFFF00FFFF00> 24 31 -2 0 19.823] 73 @dc
+[<81FF00E7FFC0FE01E0F80070E00078E00038C0003CC0003CC0003C00003C00007C0000FC0007F800FFF807FFF00FFFF01FFF
+  E03FFF807FFE007FC000FC0000F80000F00018F00018F000387000387000783800F81E03F80FFF3803FC08> 24 31 -3 0 29.040] 83 @dc
+[<FFF0000600FFF00006000600000E000600001E000600003E000600007E00060000FE00060000FE00060001FE00060003F600
+  060007E60006000FE60006001FC60006001F860006003F060006007E06000600FC06000601FC06000601F806000603F00600
+  0607E00600060FC00600061F800600063F800600063F000600067E00060006FC00060007F800060007F0000600FFF000FFF0
+  FFE000FFF0> 40 31 -2 0 40.908] 78 @dc
+[<FFC3FF00FFC3FF001E00F8001E01F0001E01E0001E03C0001E07C0001E0F80001F0F00001F9E00001FFC00001F7C00001F18
+  00001F0C00001F0700001F0380001F00C0001F00F0001F01FE001F01FE001F0000001F0000001F0000001F0000001F000000
+  1F0000001F0000001F0000001F0000001F000000FF000000FF000000> 32 32 -2 0 27.588] 107 @dc
+[<0007FC00003FFF8000FE01C003F0007007E000380FC000181F80000C3F00000C3F0000067F0000067E0000067E000000FE00
+  0000FE000000FE000000FE000000FE000000FE000000FE0000007E0000067E0000067F0000063F00000E3F00000E1F80001E
+  0FC0001E07E0003E03F000FE00FE03DE003FFF0E0007FC02> 32 31 -3 0 37.751] 67 @dc
+[<FFFF8000FFFF800007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0600007E0600007E0
+  600007E0E00007E1E00007FFE00007FFE00007E1E00007E0E00007E0600007E0603007E0603007E0003007E0003007E00070
+  07E0006007E000E007E001E007E007E0FFFFFFE0FFFFFFE0> 32 31 -2 0 32.890] 70 @dc
+[<3FFC003FFC0007C00007C00007C00007C00007C00007C00007C00007C00007C00007C00007C00007C00007C00007C00007C0
+  0007C000FFFC00FFFC0007C00007C00007C00007C00007C00007C3C007C7E003C7E003E7E001F3E000FFC0001F80> 24 32 -1 0 15.972] 102 @dc
+[<0000E000000000E000000000E000000000E000000000E000000000E000000000E000000000E000000000E000000000E00000
+  0000E000000000E000000000E000000000E000000000E000000000E00000FFFFFFFFC0FFFFFFFFC0FFFFFFFFC00000E00000
+  0000E000000000E000000000E000000000E000000000E000000000E000000000E000000000E000000000E000000000E00000
+  0000E000000000E000000000E000000000E000000000E00000> 40 35 -3 6 40.655] 43 @dc
+[<03F8FF0007FCFF000F06F8001F01F8001F01F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00F8001F00
+  F8001F00F8001F00F8001F00F8001F00F8001F00F800FF07F800FF07F800> 32 20 -3 0 29.040] 117 @dc
+[<0018007000E001C00380038007000E000E001E001C003C003C007800780078007800F800F000F000F000F000F000F000F000
+  F000F000F80078007800780078003C003C001C001E000E000E0007000380038001C000E000700018> 16 45 -3 11 20.328] 40 @dc
+cmti10.329 @sf
+[<1F8000206000401000E00800F00C00F00C00700E00000E00003E0003FC0007F8000FF0000F80000C00000C06000C07000C03
+  0006010003020000FC00> 24 20 -3 0 18.585] 115 @dc
+[<0FFE0000E00000E0000070000070000070000070000038000038000F380030B800705C00703C00F01C00F01C00F00E00F00E
+  00F00E00F00E007807007807007807003807003C03801C03800E03800705800388C000F040> 24 29 -4 9 20.908] 113 @dc
+[<07C3800C26401C1E20180E20180E201C0E201C07101C07001C07001C07000E03800E03800E03808703804701C04301C04381
+  C02301C03300E00E00C0> 24 20 -4 0 24.393] 117 @dc
+[<1C003300310070803080388038401C001C001C000E000E000E008700470043004380230033000E0000000000000000000000
+  0000000001C001E001E000C0> 16 31 -4 0 13.939] 105 @dc
+[<3000007000003800003800003800003800001C00001C00001C00001C00000E00000E00000E00008E00004703004707804787
+  804783802661001C1E00> 24 20 -4 0 19.166] 114 @dc
+[<07C000183800380400700200700100700000F00000F00000F00000F000007C00007BF000780C003802003C01001C01000E01
+  0007010001C200007C00> 24 20 -4 0 20.908] 101 @dc
+[<38006400E200E200E200E200710070007000700038003800380038001C001C001C001C000E000E000E000E00070007000700
+  070003800380038003801FC003C0> 16 32 -4 0 11.616] 108 @dc
+[<080000100000100000200000600000600000400000C00000C00000C00000C00000C00000C00000C00000C00000C00000C000
+  00C00000C00000C00000E00000E000006000006000007000007000003000003800003800001800001C00000C00000E000006
+  000007000003000001800001800000C000006000002000001000000800000400000200000100> 24 46 -7 12 18.585] 40 @dc
+[<03C0000E30001C08001C04001C04001C02001C02001C01001C01001C01000E00800E00800E00808700804700C04301C04383
+  C02307C03307800E0380> 24 20 -4 0 20.908] 118 @dc
+[<FFF80780038003800380038001C001C001C001C000E000E000E000E000700070007000700038003800380038001C031C00DC
+  003C000E000600020002> 16 30 -5 0 23.232] 49 @dc
+[<E0F0F8F870> 8 5 -5 0 13.939] 46 @dc
+[<006000007000007000003800003800003800003800001C00001C00801FC0607C003F8E00080E00040E00060E000307000107
+  0000870000C700006300003000003000001800001800000C00000E000006000007000007000003000003800003800001C000
+  01C00001C00000E00000E00000E0000060> 24 39 -2 9 23.232] 52 @dc
+[<8000006000003000001800000C000006000003000001000001800000C00000E000006000007000003000003800001800001C
+  00000C00000C00000E0000060000060000070000070000030000030000030000038000038000038000018000018000018000
+  0180000180000180000180000100000300000300000300000200000600000400000800001000> 24 46 0 12 18.585] 41 @dc
+cmbx10.329 @sf
+[<C000700038001C000E000E0007000380038003C001C001E001E000F000F000F000F000F80078007800780078007800780078
+  0078007800F800F000F000F000F001E001E001C003C00380038007000E000E001C0038007000C000> 16 45 -3 11 20.328] 41 @dc
+cmbx10.432 @sf
+[<FFFFFF80FFFFFF80FFFFFF807FFFFFC03FFFFFC01FFFFFC00FFFFFC0070001C0038001C001C001C000E000E0007000E0003C
+  00E0001E0000000F0000000780000003E0000003F0000001F8000000FC000000FE0000007F0000007F8000003FC000003FC0
+  00003FE03C003FE07E001FE0FF001FE0FF001FE0FF003FE0FF003FC0FE003FC07C007F803800FF001F03FE000FFFFC0003FF
+  F00000FF8000> 32 39 -3 0 34.370] 50 @dc
+[<FFFFF0003FF000FFFFF001FFF800FFFFF003FE1C0003FC0007F80E0003FC000FF8070003FC001FF8070003FC001FF0070003
+  FC001FF0000003FC001FF0000003FC001FF0000003FC001FE0000003FC001FE0000003FC001FE0000003FC001FE0000003FC
+  001FE0000003FC001FC0000003FC003F80000003FC003F80000003FC007F00000003FC00FC00000003FFFFF000000003FFFF
+  FE00000003FC007FC0000003FC001FE0000003FC0007F8000003FC0007F8000003FC0003FC000003FC0003FE000003FC0003
+  FE000003FC0003FE000003FC0003FE000003FC0003FE000003FC0003FE000003FC0003FC000003FC0007FC000003FC0007F8
+  000003FC000FF0000003FC007FE00000FFFFFFFF800000FFFFFFFE000000FFFFFFE0000000> 56 41 -3 0 51.555] 82 @dc
+[<001FF00000FFFE0003F81F0007E003800FC001C01F8000E03F8000E07F0000007F0000007F000000FF000000FF000000FF00
+  0000FFFFFFE0FFFFFFE0FF0007E0FF0007E07F0007E07F0007C07F000FC03F800FC01F800F800F801F8007C01F0003F07E00
+  01FFF800003FE000> 32 27 -2 0 31.506] 101 @dc
+[<00003FFF8000003FFF8000003FFF80000003F800000003F800000003F800000003F800000003F800000003F800000003F800
+  000003F800000003F800003FC3F80000FFF3F80003F07BF80007E01FF8000FC007F8001F8007F8003F8003F8007F8003F800
+  7F0003F8007F0003F800FF0003F800FF0003F800FF0003F800FF0003F800FF0003F800FF0003F800FF0003F8007F0003F800
+  7F8003F8007F8003F8003F8003F8001FC007F8000FC007F80007E00DF80003F838F80000FFF07800001FC03800> 40 39 -2 12 36.280] 113 @dc
+[<FFFE1FFFC3FFF8FFFE1FFFC3FFF8FFFE1FFFC3FFF80FE001FC003F800FE001FC003F800FE001FC003F800FE001FC003F800F
+  E001FC003F800FE001FC003F800FE001FC003F800FE001FC003F800FE001FC003F800FE001FC003F800FE001FC003F800FE0
+  01FC003F800FE001FC003F800FE001FC003F800FE001FC003F800FE001FC003F800FF001FE003F800FF001FE003F800FD801
+  FF003F800FCC01F9803F000FC603F8C07F00FFC383F0707E00FFC1FFE03FFC00FFC07F800FF000> 56 27 -3 0 57.283] 109 @dc
+[<C1FF00F7FFC0FF01E0FC0070F80038F00038F0003CE0003C60007C0000FC0003FC00FFF807FFF81FFFF03FFFE07FFF807FFE
+  00FFE000FC0000F80070F00070F000707000703800F01E03F00FFFF003FE30> 24 27 -2 0 27.114] 115 @dc
+cmr10.329 @sf
+[<007FFE00000007C000000003C000000003C000000003C000000003C000000003C000000003C000000003C000000003C00000
+  0003C000000003C000000003C000000007C000000007A00000000FB00000001F100000001E080000003E080000003C040000
+  007C04000000F802000000F003000001F001000001E000800003E000800007C000400007800040000F800060001F8000F800
+  FFF003FF00> 40 31 -1 0 34.090] 89 @dc
+[<7FE3FF0007007000070070000700700007007000070070000700700007007000070070000700700007007000070070000700
+  7000070070000700700007007000070070000700700007007000FFFFFF800700700007007000070070000700700007007000
+  07007000070070000300F0300380F87801C0787800F06E30001F83E0> 32 32 0 0 26.515] 11 @dc
+[<0000078000000FC000001FE000001FE000003FF0000038700000383000003010001FB01000F0F01001E0380007A03E000F20
+  4F000E2047001E1087803C0F03C03C0003C07C0003E0780001E0780001E0F80001F0F80001F0F80001F0F80001F0F80001F0
+  F80001F0F80001F0F80001F0F80001F0780001E07C0003E07C0003E03C0003C03C0003C01E0007800E0007000F000F000780
+  1E0001C0380000F0F000001F8000> 32 41 -3 9 35.353] 81 @dc
+[<03E0000C3800100E00200600400700400380E00380F003C0F003C07003C00003C00003C00003C00003800003801007801007
+  00180E00161C0011F0001000001000001000001000001000001000001FE0001FF8001FFC001FFE00180300> 24 31 -2 1 22.727] 53 @dc
+[<03F0000E1C001C0E00180600380700780780700380700380700380F003C0F003C0F003C0F003C0F003C0F003C0F003C0F003
+  C0F003C0F003C0F003C0F003C0F003C07003807003807003807003803807001806001C0E000E1C0003F000> 24 31 -2 1 22.727] 48 @dc
+[<FFFC3FFC0FC007E0078007C0078007800780078007800F0007801E0007801E0007803C0007803C00078078000780F00007C0
+  F00007A1E0000791E0000793C000078F80000787800007830000078100000780800007804000078020000780100007800800
+  078004000780020007800100078001800FC003E0FFFC0FFC> 32 31 -2 0 35.353] 75 @dc
+[<0FC000107000201800700C00780E0078060030070000070000038000038000038003E3C00E13C0180BC03807C07007C07007
+  C0F003C0F003C0F003C0F003C0F003C0F00380F003807003807007003807003806001C0C000E180003F000> 24 31 -2 1 22.727] 57 @dc
+[<03F0001C3C00200E00400F00400780F00780F807C0F807C0F807C02007C00007C0000780000780000F00000E00003C0003F0
+  00003800001C00000E00000F00000F00000F80380F80780780780780780F80200F00100E000C1C0003F000> 24 31 -2 1 22.727] 51 @dc
+[<01F000061C000C0E001807003807003803807003807003C07003C0F003C0F003C0F003C0F003C0F003C0F80380F80380F807
+  00F40600F21C00F1F0007000007000007800003800003803001C07800C07800E0380070100018200007C00> 24 31 -2 1 22.727] 54 @dc
+[<03000007800007800007800007800007800007800007800003800003800003800003800001800001C00000C00000C0000040
+  000040000020000020000010000008000008008004008002008002004001007FFF807FFF807FFFC0400000> 24 31 -3 1 22.727] 55 @dc
+[ 329 ] /cmsy10.329 @newfont
+cmsy10.329 @sf
+[<03C0000FF0001FF8003FFC007FFE007FFE00FFFF00FFFF00FFFF00FFFF00FFFF00FFFF007FFE007FFE003FFC001FF8000FF0
+  0003C000> 24 18 -3 -2 22.727] 15 @dc
+cmbx10.432 @sf
+[<00FF800007FFF0001FFFFC003F01FE007C007F007E007F80FF007FC0FF003FC0FF003FE0FF003FE07E003FE03C003FE00000
+  3FE000003FE000003FC000003FC000007F8000007F0000007E000001FC0000FFF00000FFC0000007F0000001F8000001FC00
+  0000FE000000FF000000FF000F007F801F807F803F807F803F807F803F807F803F80FF001F00FF000F81FE0007FFFC0003FF
+  F000007F8000> 32 39 -3 0 34.370] 51 @dc
+[<01FC03FC0FFF0FFC3F839FFC7F00DF807E007F80FE003F80FE003F80FE003F80FE003F807F003F803F003F803F803F800FE0
+  3F8007FC3F8000FFFF80000FFF8000003F8000003F8000003F8007003F800F803F801FC03F001FC07E001FC07E000F81F800
+  07FFF00001FF8000> 32 27 -2 0 33.415] 97 @dc
+[<FFFEFFFEFFFE0FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE0
+  0FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE00FE0FFE0FFE0FFE0> 16 42 -3 0 19.094] 108 @dc
+2 @bop1
+cmbx10.329 @sf
+224 307 p (splitp1,) s
+18 r (splitp2) s
+17 r (and) s
+17 r (splitp3) s
+cmr10.329 @sf
+23 r (Refer) s
+15 r (to) s
+15 r (help/SPLITP) s
+-2 r (.MEM.) s
+cmbx10.329 @sf
+224 401 p (sethelp) s
+cmr10.329 @sf
+23 r (Builds) s
+15 r (online) s
+15 r (help) s
+15 r (\014les.) s
+cmbx10.329 @sf
+224 494 p (gip) s
+cmr10.329 @sf
+23 r (Gel) s
+15 r (input) s
+15 r (program.) s
+cmbx10.329 @sf
+224 588 p (con) s
+0 r 118 c
+-2 r (ert) s
+cmr10.329 @sf
+22 r (Con) s
+0 r 118 c
+-2 r (erts) s
+15 r 98 c
+1 r (et) s
+0 r 119 c
+-2 r (een) s
+cmti10.329 @sf
+14 r (xdap) s
+cmr10.329 @sf
+19 r (and) s
+cmti10.329 @sf
+15 r (xb) s
+-2 r (ap) s
+cmr10.329 @sf
+17 r (databases.) s
+cmbx10.329 @sf
+224 682 p (cop) s
+18 r (and) s
+17 r (cop-bap) s
+cmr10.329 @sf
+23 r (Chec) s
+-1 r (ks) s
+12 r (completed) s
+cmti10.329 @sf
+13 r (xdap) s
+cmr10.329 @sf
+16 r (and) s
+cmti10.329 @sf
+13 r (xb) s
+-1 r (ap) s
+cmr10.329 @sf
+14 r (databases) s
+13 r (for) s
+13 r (edit-) s
+338 738 p (ing) s
+15 r (errors.) s
+cmbx10.329 @sf
+224 832 p (trace2seq) s
+cmr10.329 @sf
+23 r (Extracts) s
+15 r (sequence) s
+15 r (from) s
+15 r (trace) s
+15 r (\014les.) s
+cmbx10.329 @sf
+224 925 p (getABISampleName) s
+cmr10.329 @sf
+23 r (Extracts) s
+15 r (sample) s
+15 r (names) s
+15 r (from) s
+15 r (ABI) s
+16 r (trace) s
+15 r (\014les.) s
+cmbx10.329 @sf
+224 1019 p (mak) s
+0 r (eSCF) s
+cmr10.329 @sf
+21 r (Con) s
+0 r 118 c
+-1 r (erts) s
+14 r (existing) s
+15 r (trace) s
+15 r (\014les) s
+16 r (to) s
+15 r (the) s
+15 r (compact) s
+15 r (SCF) s
+15 r (format.) s
+cmbx10.329 @sf
+224 1113 p (alfsplit) s
+cmr10.329 @sf
+23 r (Splits) s
+16 r (the) s
+17 r (Pharmacia) s
+16 r (A.L.F.) s
+16 r (gel) s
+17 r (\014le) s
+16 r (in) s
+0 r (to) s
+15 r 109 c
+0 r (ultiple) s
+15 r (\014les,) s
+17 r (one) s
+16 r (for) s
+338 1169 p (eac) s
+0 r 104 c
+14 r (sample.) s
+cmbx10.329 @sf
+224 1263 p (frog) s
+cmr10.329 @sf
+23 r (Relab) s
+1 r (els) s
+15 r (lanes) s
+15 r (in) s
+16 r (ABI) s
+15 r (trace) s
+15 r (\014les.) s
+cmbx10.329 @sf
+224 1356 p 43 c
+18 r 110 c
+-1 r (umerous) s
+17 r (scripts) s
+17 r (\(including) s
+cmti10.329 @sf
+17 r (squirr) s
+-1 r (el) s
+15 r (\(v1.4\)) s
+cmbx10.329 @sf
+2 r 41 c
+cmbx10.432 @sf
+224 1499 p 50 c
+69 r (Requiremen) s
+-1 r (ts) s
+cmr10.329 @sf
+224 1601 p 89 c
+-3 r (ou) s
+14 r (will) s
+15 r (need) s
+14 r 97 c
+15 r (tap) s
+1 r 101 c
+15 r (driv) s
+0 r 101 c
+13 r (to) s
+15 r (read) s
+15 r (the) s
+14 r (soft) s
+0 r 119 c
+-1 r (are) s
+13 r (o\013) s
+15 r (the) s
+15 r (distribution) s
+14 r (tap) s
+2 r 101 c
+224 1657 p (\(QIC-150,) s
+14 r (TK50,) s
+13 r (or) s
+13 r (Exab) s
+0 r (yte\).) s
+18 r 89 c
+-2 r (ou) s
+12 r (will) s
+13 r (also) s
+13 r (need) s
+13 r 97 c
+13 r (large) s
+13 r (amoun) s
+-1 r 116 c
+12 r (of) s
+13 r (disk) s
+224 1714 p (storage) s
+16 r (to) s
+16 r (accommo) s
+2 r (date) s
+16 r (the) s
+16 r (whole) s
+16 r (pac) s
+-1 r 107 c
+-2 r (age.) s
+22 r 70 c
+-3 r (or) s
+15 r (release) s
+16 r 118 c
+0 r (ersion-1993.0,) s
+224 1770 p (requiremen) s
+0 r (ts) s
+20 r 119 c
+0 r (ere) s
+21 r (31Mb) s
+21 r (\(SunOS) s
+22 r (4.x\),) s
+23 r (36Mb) s
+22 r (\(Sun) s
+21 r (Solaris) s
+22 r (2.x\)) s
+21 r (30Mb) s
+224 1827 p (\(DEC) s
+15 r (Ultrix\)) s
+15 r (37Mb) s
+16 r (\(DEC) s
+15 r (OSF/1\)) s
+15 r (and) s
+15 r (27Mb) s
+15 r (\(Silicon) s
+15 r (Graphics) s
+16 r (SGI.\)) s
+295 1883 p 84 c
+-3 r 111 c
+14 r (compile) s
+15 r (the) s
+15 r (Staden) s
+16 r (pac) s
+-1 r 107 c
+-2 r (age) s
+14 r 121 c
+0 r (ou) s
+14 r (will) s
+15 r (require:) s
+cmsy10.329 @sf
+292 1976 p 15 c
+cmr10.329 @sf
+23 r (An) s
+15 r (ANSI) s
+15 r 67 c
+16 r (compiler.) s
+cmsy10.329 @sf
+292 2070 p 15 c
+cmr10.329 @sf
+23 r 65 c
+15 r 70 c
+0 r (OR) s
+-4 r (TRAN-77) s
+14 r (compiler.) s
+cmsy10.329 @sf
+292 2164 p 15 c
+cmr10.329 @sf
+23 r (X11) s
+15 r (\(Release) s
+15 r 52 c
+16 r (or) s
+15 r (5\).) s
+cmsy10.329 @sf
+292 2257 p 15 c
+cmr10.329 @sf
+23 r (GNU) s
+15 r (mak) s
+0 r 101 c
+14 r (\(except) s
+15 r (with) s
+15 r (SunOS) s
+16 r (and) s
+15 r (Solaris) s
+15 r (2.x.\)) s
+cmbx10.432 @sf
+224 2400 p 51 c
+69 r (Installation) s
+cmr10.329 @sf
+224 2502 p 84 c
+-3 r 111 c
+15 r (install) s
+15 r (the) s
+15 r (pac) s
+0 r 107 c
+-3 r (age,) s
+280 2595 p (1.) s
+22 r (Create) s
+22 r 97 c
+22 r (directory) s
+21 r (for) s
+22 r (where) s
+21 r 121 c
+0 r (ou) s
+21 r 119 c
+-1 r (ould) s
+21 r (lik) s
+0 r 101 c
+20 r (the) s
+22 r (soft) s
+-1 r 119 c
+-1 r (are) s
+21 r (to) s
+21 r 98 c
+2 r 101 c
+338 2652 p (placed.) s
+20 r 89 c
+-3 r (ou) s
+14 r (ma) s
+0 r 121 c
+14 r (ha) s
+0 r 118 c
+-1 r 101 c
+14 r (to) s
+15 r 98 c
+1 r 101 c
+15 r (sup) s
+2 r (eruser) s
+15 r (to) s
+15 r (do) s
+15 r (this.) s
+925 2776 p 50 c
+@eop
+3 @bop0
+[ 329 ] /cmtt10.329 @newfont
+cmtt10.329 @sf
+[<7F1F1F00FFBFBF807F1F1F001C1C1C001C1C1C001C1C1C001C1C1C001C1C1C001C1C1C001C1C1C001C1C1C001C1C1C001C1C
+  1C001C1C1C001E1E1C001E1E1C001F1F1C007FFFF800FFFBF8007CE0E000> 32 20 1 0 23.863] 109 @dc
+[<FFC7F8FFC7F8FFC7F80E03C00E03800E07000E0F000F0E000F9C000FFC000FF8000EF0000E78000E3C000E1E000E0F000E07
+  800E3FF00E7FF00E3FF00E00000E00000E00000E00000E0000FE0000FE0000FE0000> 24 28 -1 0 23.863] 107 @dc
+[<03E3F00FFBF81FFFF03C1F80380F80700780700780E00380E00380E00380E00380E00380E00380700380700780380F803C1F
+  801FFF800FFB8003E380000380000380000380000380000380001F80003F80001F80> 24 28 -2 0 23.863] 100 @dc
+[<FFFF00FFFF80FFFF0001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C0
+  007FC000FFC0007FC00000000000000000000000000003800007C00007C00007C000038000> 24 29 -4 0 23.863] 105 @dc
+[<7FFE00FFFF007FFE0003800003800003800003800003800003800003800003800003800003C00003C00003E00003F03003F8
+  787FBFF8FF9FF07F87E0> 24 20 -1 0 23.863] 114 @dc
+[<600000F00000F00000F800007800007C00003C00003C00003E00001E00001F00000F00000F00000F800007800007C00003C0
+  0003C00003E00001E00001F00000F00000F800007800007800007C00003C00003E00001E00001E00001F00000F00000F8000
+  0780000780000300> 24 36 -3 4 23.863] 47 @dc
+[<7FC3FCFFE7FE7FC3FC0E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00F00E00F80E00FC1
+  C00FFFC00EFF800E3E000E00000E00000E00000E00000E00007E0000FE00007E0000> 24 28 0 0 23.863] 104 @dc
+[<01F0000FFE001FFF003E0F803C07807803C07001C0F001E0E000E0E000E0E000E0E000E0E000E07001C07001C03803803E0F
+  801FFF000FFE0001F000> 24 20 -2 0 23.863] 111 @dc
+[<01FC0007FF001FFF803E03C03801C07001C0700000E00000FFFFC0FFFFC0FFFFC0E001C0E001C07003807003803807803E0F
+  001FFE0007FC0001F000> 24 20 -3 0 23.863] 101 @dc
+[<E7F800FFFE00FFFF80F80780F001C0E001E0E000E06000E00000E00000E00001C00003C0000F8000FF0007FE001FF0003F00
+  00780000700000E00000E00380E00380E003807007807C0F803FFF801FFF8003F380> 24 28 -2 0 23.863] 83 @dc
+[<003E0000FF8001FFC001C1C00380E00380E00380E00380400380000380000380000380000380000380000380000380000380
+  00FFFFC0FFFFC07FFFC0038000038000038000038000018000> 24 25 -1 0 23.863] 116 @dc
+[<07E1F01FFBF03FFFF0781F00F00F00E00700E00700E007007807007F07001FFF0007FF0000FF00000700000700300E00781E
+  007FFC003FF8001FE000> 24 20 -3 0 23.863] 97 @dc
+[<7FC3FCFFE7FE7FC3FC0E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00F00E00F80E00FC1
+  C07FFFC0FEFF807E3E00> 24 20 0 0 23.863] 110 @dc
+[<01FC0007FF001FFF803E03C03801C07001C0700000E00000E00000E00000E00000E00000E000007000007000003803003E07
+  801FFF8007FF0001FE00> 24 20 -3 0 23.863] 99 @dc
+[<7F8FF0FF8FF87F8FF00F0780070700038E00039E0001DC0000F80000F00000700000F80001F80001DC00039E00078E000707
+  007F8FF07F9FF07F8FF0> 24 20 -1 0 23.863] 120 @dc
+[<00700000F80000F80001DC0001DC0001DC00038E00038E00038E00038E000707000707000707000E03800E03800E03801E03
+  C07F8FF0FF8FF87F8FF0> 24 20 -1 0 23.863] 118 @dc
+[<7FFF007FFF007FFF0001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C00001C0
+  00FFFFC0FFFFC07FFFC001C00001C00001C00001C0C000E1E000FFE0007FC0001F80> 24 28 -1 0 23.863] 102 @dc
+[<E7F000FFFC00FFFF00F80F00F00380E00380E00380600780001F0003FE001FFC007FE0007C0000E00700E00700E00700780F
+  007FFF003FFF0007F700> 24 20 -3 0 23.863] 115 @dc
+[<01F00007FC000FFE001F1F001C07003803807803C07001C07001C0F001E0E000E0E000E0E000E0E000E0E000E0E000E0E000
+  E0E000E0E000E07001C07001C07803C03803801C07001F1F000FFE0007FC0001F000> 24 28 -2 0 23.863] 48 @dc
+cmbx10.329 @sf
+[<03FFFFC003FFFFC00007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007
+  E0000007E0000007E0000007E0000007E0000007E0000007E000C007E006C007E006C007E006C007E006E007E00E6007E00C
+  6007E00C7007E01C7C07E07C7FFFFFFC7FFFFFFC> 32 30 -2 0 36.362] 84 @dc
+[<FFFFFE0000FFFFFFC00007E007F00007E001F80007E000FC0007E0007E0007E0003F0007E0003F0007E0001F8007E0001F80
+  07E0001F8007E0001FC007E0001FC007E0001FC007E0001FC007E0001FC007E0001FC007E0001FC007E0001FC007E0001FC0
+  07E0001F8007E0001F8007E0001F8007E0003F0007E0003F0007E0007E0007E000FC0007E001F80007E007F000FFFFFFC000
+  FFFFFE0000> 40 31 -2 0 40.087] 68 @dc
+[<FFFFFFF0FFFFFFF007E003F007E000F807E0007807E0003807E0001807E0001807E0001807E0000C07E0600C07E0600C07E0
+  600007E0E00007E1E00007FFE00007FFE00007E1E00007E0E00007E0600007E0603007E0603007E0003007E0003007E00070
+  07E0006007E000E007E001E007E007E0FFFFFFE0FFFFFFE0> 32 31 -2 0 34.342] 69 @dc
+[<FFFF007F80FFFF01FFC007E003F86007E007F03007E00FE03007E00FE00007E00FE00007E00FE00007E00FC00007E00FC000
+  07E00FC00007E00F800007E00F800007E01F000007E03E000007FFFC000007FFFF000007E01FC00007E007E00007E003F000
+  07E003F00007E003F80007E003F80007E003F80007E003F80007E003F00007E003F00007E007E00007E01FC000FFFFFF0000
+  FFFFF80000> 40 31 -2 0 39.203] 82 @dc
+[<001FF8000000FFFF000001F81F800007E007E0000FC003F0001F8001F8003F8001FC003F0000FC007F0000FE007F0000FE00
+  7E00007E00FE00007F00FE00007F00FE00007F00FE00007F00FE00007F00FE00007F00FE00007F00FE00007F00FE00007F00
+  7E00007E007E00007E007F0000FE003F0000FC001F0000F8001F8001F8000FC003F00007E007E00001F81F800000FFFF0000
+  001FF80000> 40 31 -3 0 39.266] 79 @dc
+cmti10.329 @sf
+[<1E003100708070807040704038203800380038001C001C001C001C000E000E000E000E000700FFF007000700038003800380
+  038001C00180> 16 28 -4 0 15.101] 116 @dc
+[<3001C07003303803103807083803083803881C03841C01C01C01C01C01C00E00E00E00E00E00E08E00E04700704700704780
+  604740602630C01C0F80> 24 20 -4 0 25.555] 110 @dc
+[<07C000187000301800700E00700F00F00700F00780F003C0F003C0F003C07801E07801E07801E03C01E01C01E01E01C00E01
+  C003018001C300007C00> 24 20 -4 0 23.232] 111 @dc
+[<3F800060E000F07000783800301C00001C00001C00000E00000E0003CE000C2E001C17001C0F003C07003C07003C03803C03
+  803C03803C03801E01C01E01C01E01C00E01C00F00E00700E00380E001C1E000E270003C60> 24 29 -2 9 20.908] 103 @dc
+cmtt10.329 @sf
+[<07FF0007FF0007FF000070000070000070000070000070000070000070000070000070000070000070000070000070000070
+  00007000007000007000007000E07038E07038E07038E07038FFFFF8FFFFF87FFFF8> 24 28 -1 0 23.863] 84 @dc
+[<7F07F0FF8FF87F07F01C01C01C01C00E03800E03800FFF800FFF800FFF80070700070700070700070700030600038E00038E
+  00038E00038E00018C0001DC0001DC0001DC0000D80000D80000F80000F800007000> 24 28 -1 0 23.863] 65 @dc
+[<7FF800FFFE007FFF001C0F801C03C01C01C01C01E01C00E01C00E01C00F01C00701C00701C00701C00701C00701C00701C00
+  701C00701C00F01C00E01C00E01C01E01C03C01C03C01C0F807FFF00FFFE007FF800> 24 28 -1 0 23.863] 68 @dc
+[<FFFFF8FFFFF8FFFFF81C00381C00381C00381C00381C00381C00001C00001C0E001C0E001C0E001FFE001FFE001FFE001C0E
+  001C0E001C0E001C00001C00001C00701C00701C00701C0070FFFFF0FFFFF0FFFFF0> 24 28 -1 0 23.863] 69 @dc
+[<7F03C0FF87C07F07C01C0DC01C0DC01C0DC01C1DC01C19C01C19C01C39C01C39C01C39C01C31C01C71C01C71C01C61C01CE1
+  C01CE1C01CE1C01CC1C01CC1C01DC1C01D81C01D81C01D81C07F07F0FF0FF87E07F0> 24 28 -1 0 23.863] 78 @dc
+[<7F00F0FF81F87F01F81C039C1C039C1C039C1C03801C03801C03801C03801C03801C07001C0F001FFE001FFE001FFF001C0F
+  801C03801C03C01C01C01C01C01C01C01C03C01C03801C0F807FFF00FFFE007FF800> 24 28 -1 0 23.863] 82 @dc
+[<0FF8003FFE007FFF00780F00700700F00780E00380E00380E00380E00380E00380E00380E00380E00380E00380E00380E003
+  80E00380E00380E00380E00380E00380F00780700700780F007FFF003FFE000FF800> 24 28 -3 0 23.863] 79 @dc
+[<01FCFC03FFFE07FFFC0F03E00E01E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00E00
+  E07E07E0FE0FE07E07E0> 24 20 0 0 23.863] 117 @dc
+[<00C00001C00001C00001C00007E0001FF8003FFE0079DE0071C700E1C700E1C380F1C380F1C38061C38001C70001CF0001DE
+  0003FC000FF8001FE0003DC00079C000F1C000E1C780E1C780E1C780E1C38071C7007DCF003FFE000FFC0003F00001C00001
+  C00001C00000C000> 24 36 -3 4 23.863] 36 @dc
+[<3078FCFC7830> 8 6 -9 0 23.863] 46 @dc
+[<7FFFC0FFFFE07FFFC000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E0
+  0000E00000E00000E00000E00000E00000E00000E00000E0007FE000FFE0007FE000> 24 28 -2 0 23.863] 108 @dc
+[<01FC000FFF801FFFC07E03F07800F0E00038E00038E00038E000387000707801F03FFFE01FFFC01FFE001C000038000039E0
+  001FF8001FFC001E1E001C0E003807003807003807003807003807001C0E001E1E300FFFF807FFF801E1F0> 24 31 -1 11 23.863] 103 @dc
+cmti10.329 @sf
+[<7FE0FFE07FF0> 16 3 -3 -8 16.262] 45 @dc
+[<3C00000062000000F3000000798000003180000001C0000001C0000000C0000000E0000000E0038000E0064000E00E200070
+  0E2000700E2000700E2000700710007007000038070000380700003803800038038000380380001C0380001C01C0001C01C0
+  001C01C0001C01C0000E00E000FFFFE0000E0000000E0000000E000000070000000700000007000000070030000300780003
+  8078000180380000E01000003FE0> 32 41 2 9 25.555] 12 @dc
+cmtt10.329 @sf
+[<7FFFC0FFFFE0FFFFE0FFFFE0000000000000000000000000FFFFE0FFFFE0FFFFE07FFFC0> 24 12 -2 -8 23.863] 61 @dc
+[<7FC000FFE0007FC0000E00000E00000E00000E00000E00000E00000E00000E3E000EFF800FFFC00FC1E00F80E00F00700F00
+  700E00380E00380E00380E00380E00380E00380E00700F00700F80E00FC1E07FFFC0FEFF807E3E00> 24 30 0 10 23.863] 112 @dc
+cmr10.329 @sf
+[<40201010080804040474FCFCF870> 8 14 -4 -18 12.626] 39 @dc
+[<000400020000000C00030000000E00070000000E00070000001E00078000001F000F8000001F000F8000001F000F8000003C
+  801E4000003C801E4000003C801E40000078C03E20000078403C20000078403C200000F0403C100000F02078100000F02078
+  100001F02078080001E010F0080001E010F0080003E010F00C0003C009E0040003C009E0040003C009E00400078007C00200
+  078007C00200078007C002000F0007C001000F00078001000F00078003801F800FC007C0FFF07FF81FF0> 48 32 -1 1 46.716] 87 @dc
+cmbx10.432 @sf
+[<007FFFF8007FFFF8007FFFF80000FE000000FE000000FE000000FE000000FE000000FE000000FE000000FE00FFFFFFF8FFFF
+  FFF8FFFFFFF8E0007E0070007E0038007E001C007E000E007E000E007E0007007E0003807E0001C07E0000E07E0000E07E00
+  00707E0000387E00001C7E00000E7E00000E7E0000077E000003FE000001FE000000FE000000FE0000007E0000003E000000
+  1E0000000E00> 32 39 -2 0 34.370] 52 @dc
+[<00001FF800000001FFFF00000007FFFFC000000FF007E000003FC000F000007F00003800007E00001C0000FE00001C0001FE
+  00000E0001FC00000E0003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000
+  070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC00000700
+  03FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC0000070003FC
+  0000070003FC0000070003FC0000070003FC0000070003FC00000700FFFFF001FFFCFFFFF001FFFCFFFFF001FFFC> 48 41 -3 0 52.883] 85 @dc
+[<FFFE000000FFFE000000FFFE0000000FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE0000000
+  0FE00000000FE00000000FE1FC00000FE7FF80000FFE0FE0000FF803F0000FF001F8000FE001FC000FE000FE000FE000FF00
+  0FE000FF000FE0007F000FE0007F800FE0007F800FE0007F800FE0007F800FE0007F800FE0007F800FE0007F800FE0007F00
+  0FE0007F000FE000FF000FE000FE000FE000FC000FF001F8000FF803F000FFFE07E000FFE7FF8000FFE1FE0000> 40 39 -2 12 38.189] 112 @dc
+[<FFFFF00000FFFFF00000FFFFF0000003FC00000003FC00000003FC00000003FC00000003FC00000003FC00000003FC000000
+  03FC00000003FC00000003FC00000003FC00000003FC00000003FC00000003FC00000003FC00000003FC00000003FFFFFE00
+  03FFFFFF8003FC003FE003FC000FF003FC0007F803FC0007FC03FC0003FC03FC0003FE03FC0003FE03FC0003FE03FC0003FE
+  03FC0003FE03FC0003FE03FC0003FE03FC0003FC03FC0007FC03FC0007F803FC000FF003FC003FE0FFFFFFFFC0FFFFFFFF00
+  FFFFFFF800> 40 41 -3 0 46.989] 80 @dc
+[<7FFF80007FFF80007FFF800007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0
+  000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F00000FFFFC000
+  FFFFC000FFFFC00007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F03E0007F07F0003F07F0003F8
+  7F0001F87F0000FE3E00003FFC000007F000> 32 42 -2 0 21.004] 102 @dc
+cmti10.329 @sf
+[<81F80000C6060000E80380007000C0006000E000600060006000700020003000200038002000380000003800000038000000
+  7800000078000001F800001FF000007FF00001FFE00001FF800003F8000003C0000003C00000038000000380010003800100
+  038001000180010001C0018000C003800060038000300580001C18C00007E040> 32 33 -3 1 25.555] 83 @dc
+cmbx10.329 @sf
+[<FFF00C03FFF8FFF01E03FFF806001E001F0006003F001F0006003F001F0006003F001F0006007D801F0006007D801F000600
+  F8C01F000600F8C01F000601F0601F000601F0601F000603E0301F000603E0301F000607C0181F000607C0181F000607C018
+  1F00060F800C1F00060F800C1F00061F00061F00061F00061F00063E00031F00063E00031F00067C00019F00067C00019F00
+  06F80000DF0006F80000DF0006F80000DF0007F000007F00FFF000007FF8FFE000003FF8> 48 31 -2 0 49.620] 77 @dc
+cmti10.329 @sf
+[<300300380070070066003803806200380380E100380380610038038071001C01C070801C01C038001C01C038001C01C03800
+  0E00E01C000E00E01C000E00E01C008E00E01C004700700E004700700E004780680E004740640C002630C318001C0F80F000> 40 20 -4 0 37.171] 109 @dc
+3 @bop1
+cmtt10.329 @sf
+475 307 p (mkdir) s
+24 r (/home/Staden) s
+cmr10.329 @sf
+280 417 p (2.) s
+22 r (Change) s
+16 r (to) s
+15 r (this) s
+15 r (directory) s
+-3 r 46 c
+cmtt10.329 @sf
+475 527 p (cd) s
+24 r (/home/Staden) s
+cmr10.329 @sf
+280 638 p (3.) s
+22 r (Place) s
+16 r (the) s
+15 r (tap) s
+1 r 101 c
+15 r (in) s
+0 r (to) s
+14 r (the) s
+15 r (tap) s
+1 r 101 c
+16 r (unit.) s
+280 731 p (4.) s
+22 r (Extract) s
+17 r (the) s
+17 r (soft) s
+0 r 119 c
+-1 r (are) s
+15 r (o\013) s
+17 r (the) s
+17 r (distribution) s
+17 r (tap) s
+1 r 101 c
+17 r (\(NOTE:) s
+17 r (the) s
+17 r (device) s
+338 787 p (name) s
+15 r (ma) s
+0 r 121 c
+14 r 98 c
+1 r 101 c
+15 r (di\013eren) s
+0 r 116 c
+14 r (on) s
+15 r 121 c
+0 r (our) s
+14 r (mac) s
+0 r (hine\):) s
+cmtt10.329 @sf
+475 897 p (tar) s
+24 r (xvf) s
+24 r (/dev/rst0) s
+cmr10.329 @sf
+280 1007 p (5.) s
+22 r 67 c
+11 r (shell) s
+10 r (users) s
+10 r (should) s
+10 r (set) s
+10 r (the) s
+10 r (en) s
+0 r (vironmen) s
+-1 r 116 c
+9 r 118 c
+-2 r (ariable) s
+cmbx10.329 @sf
+9 r (ST) s
+-3 r (ADENR) s
+-2 r (OOT) s
+cmr10.329 @sf
+338 1064 p (to) s
+17 r 98 c
+1 r 101 c
+18 r (the) s
+17 r (directory) s
+17 r (where) s
+18 r (the) s
+17 r (pac) s
+0 r 107 c
+-3 r (age) s
+16 r (is) s
+18 r (installed) s
+17 r (and) s
+17 r (source) s
+17 r (the) s
+338 1120 p (\014le) s
+cmti10.329 @sf
+16 r (staden.lo) s
+-1 r (gin) s
+cmr10.329 @sf
+15 r (found) s
+16 r (there.) s
+24 r (This) s
+16 r (is) s
+17 r 98 c
+1 r (est) s
+16 r (done) s
+16 r 98 c
+0 r 121 c
+16 r (adding) s
+16 r (lines) s
+16 r (to) s
+338 1177 p (their) s
+cmti10.329 @sf
+15 r (.lo) s
+-1 r (gin) s
+cmr10.329 @sf
+14 r (\014le:) s
+cmtt10.329 @sf
+433 1287 p (setenv) s
+24 r (STADENROOT) s
+24 r (/home/Staden) s
+433 1343 p (source) s
+24 r ($STADENROOT/staden.login) s
+cmr10.329 @sf
+338 1453 p (Users) s
+14 r (of) s
+14 r (the) s
+13 r (Bourne) s
+14 r (shell,) s
+14 r (sh,) s
+15 r (should) s
+13 r (similarly) s
+14 r (add) s
+14 r (lines) s
+14 r (their) s
+cmti10.329 @sf
+14 r (.pr) s
+-1 r (o-) s
+338 1510 p (\014le) s
+cmr10.329 @sf
+15 r (\014le:) s
+cmtt10.329 @sf
+433 1620 p (STADENROOT=/home/Staden) s
+433 1676 p (export) s
+24 r (STADENROOT) s
+433 1733 p 46 c
+24 r ($STADENROOT/staden.profile) s
+cmr10.329 @sf
+338 1843 p (The) s
+22 r (startup) s
+22 r (routines) s
+23 r (set) s
+22 r (en) s
+0 r (vironmen) s
+-1 r 116 c
+21 r 118 c
+-2 r (ariables) s
+22 r (and) s
+22 r (mo) s
+1 r (dify) s
+23 r (the) s
+338 1899 p (shell's) s
+16 r (searc) s
+0 r 104 c
+15 r (path) s
+16 r (so) s
+16 r (that) s
+16 r (it) s
+16 r (can) s
+16 r (\014nd) s
+17 r (the) s
+16 r (programs) s
+16 r (in) s
+16 r (the) s
+16 r (Staden) s
+338 1956 p 80 c
+0 r (ac) s
+-2 r 107 c
+-2 r (age.) s
+20 r (When) s
+16 r (users) s
+15 r (next) s
+16 r (log) s
+15 r (on) s
+16 r (to) s
+15 r (the) s
+16 r (system,) s
+15 r (they) s
+16 r (will) s
+15 r 98 c
+2 r 101 c
+15 r (able) s
+338 2012 p (to) s
+15 r (use) s
+15 r (the) s
+15 r (programs.) s
+cmbx10.432 @sf
+224 2155 p 52 c
+69 r (Installation) s
+23 r (on) s
+23 r (Unsupp) s
+2 r (orted) s
+23 r (Platforms) s
+cmr10.329 @sf
+224 2256 p (Install) s
+12 r (the) s
+12 r (soft) s
+0 r 119 c
+-1 r (are) s
+11 r (as) s
+12 r 121 c
+0 r (ou) s
+11 r 119 c
+0 r (ould) s
+11 r (for) s
+12 r 97 c
+12 r (supp) s
+1 r (orted) s
+12 r (mac) s
+0 r (hine.) s
+18 r 89 c
+-3 r (ou) s
+11 r (will) s
+13 r (need) s
+224 2313 p (to) s
+15 r (remak) s
+-1 r 101 c
+14 r (all) s
+15 r (executables.) s
+20 r (The) s
+14 r (script) s
+cmti10.329 @sf
+15 r (Staden) s
+3 r 14 2 ru
+13 r (instal) s
+3 r 108 c
+cmr10.329 @sf
+14 r (can) s
+15 r 98 c
+1 r 101 c
+15 r (used) s
+14 r (to) s
+15 r (help) s
+224 2369 p (recompile) s
+14 r (the) s
+14 r (pac) s
+-1 r 107 c
+-2 r (age.) s
+19 r 65 c
+13 r (large) s
+14 r 110 c
+0 r (um) s
+-1 r 98 c
+0 r (er) s
+14 r (of) s
+13 r (assumptions) s
+14 r (ha) s
+0 r 118 c
+-1 r 101 c
+12 r 98 c
+2 r (een) s
+13 r (made,) s
+224 2426 p (and) s
+15 r 121 c
+0 r (ou) s
+14 r (ma) s
+0 r 121 c
+14 r (need) s
+15 r (to) s
+15 r 99 c
+0 r (hange) s
+14 r (the) s
+15 r (mak) s
+0 r (e\014les) s
+14 r (to) s
+15 r (suit) s
+16 r 121 c
+-1 r (our) s
+15 r (system.) s
+295 2482 p (The) s
+15 r (sources) s
+15 r (ha) s
+-1 r 118 c
+-1 r 101 c
+14 r 98 c
+1 r (een) s
+15 r (organised) s
+15 r (in) s
+0 r (to) s
+14 r (sub) s
+1 r (directories) s
+15 r (of) s
+15 r (the) s
+15 r (directory) s
+cmbx10.329 @sf
+224 2539 p (src) s
+cmr10.329 @sf
+0 r 46 c
+22 r (In) s
+cmbx10.329 @sf
+16 r (Misc) s
+cmr10.329 @sf
+15 r (are) s
+16 r (routines) s
+16 r (common) s
+15 r (to) s
+16 r (man) s
+0 r 121 c
+15 r (programs.) s
+21 r (They) s
+16 r (should) s
+16 r 98 c
+1 r 101 c
+224 2595 p (made) s
+19 r (\014rst.) s
+31 r (In) s
+cmbx10.329 @sf
+19 r (staden) s
+cmr10.329 @sf
+19 r (are) s
+19 r (all) s
+19 r (the) s
+18 r (programs) s
+19 r (of) s
+19 r (the) s
+19 r (Staden) s
+19 r (suite) s
+18 r 40 c
+cmti10.329 @sf
+0 r (mep) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+224 2652 p (nip) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+18 r (pip) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+18 r (sap) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+17 r (sip) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+18 r (dap) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+18 r (gip) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+17 r (vep) s
+cmr10.329 @sf
+0 r 44 c
+cmti10.329 @sf
+18 r (lip) s
+cmr10.329 @sf
+17 r (and) s
+cmti10.329 @sf
+17 r 114 c
+-1 r (ep) s
+cmr10.329 @sf
+0 r 41 c
+16 r (with) s
+17 r (the) s
+18 r (exception) s
+17 r (of) s
+cmti10.329 @sf
+17 r 98 c
+-1 r (ap) s
+cmr10.329 @sf
+0 r 46 c
+925 2776 p 51 c
+@eop
+4 @bop0
+cmbx10.329 @sf
+[<C00000E00000E000007000007000003800003800003800001C00001C00001C00000E00000E00000700000700000700000380
+  0003800003800001C00001C00000E00000E00000E000007000007000003800003800003800001C00001C00001C00000E0000
+  0E000007000007000007000003800003800003800001C00001C00000E00000E0000060> 24 45 -3 11 26.136] 47 @dc
+cmr10.329 @sf
+[<3F006180F0C0F060607000700070007000700070007000700070007000700070007000700070007000700070007000700070
+  007000F007F0007000000000000000000000000000E001F001F001F000E0> 16 40 2 9 13.889] 106 @dc
+cmbx10.432 @sf
+[<00FF800003FFF0000FFFF8001F01FE003C007F0078003F8078003F80FC001FC0FE001FC0FE001FE0FE001FE0FE001FE07C00
+  1FE018001FE000001FE000001FE000001FC000001FC000001F800C003F000E003E000F80FC000FFFF8000E7FC0000E000000
+  0E0000000E0000000E0000000E0000000E0000000FFE00000FFFC0000FFFE0000FFFF0000FFFF8000FFFFC000FFFFE000F80
+  3F000C000300> 32 39 -3 0 34.370] 53 @dc
+[<0000FFE00000000FFFFE0000003FC07F800000FF001FE00001FC0007F00003F80003F80007F00001FC000FF00001FE001FE0
+  0000FF001FE00000FF003FC000007F803FC000007F807FC000007FC07FC000007FC07F8000003FC0FF8000003FE0FF800000
+  3FE0FF8000003FE0FF8000003FE0FF8000003FE0FF8000003FE0FF8000003FE0FF8000003FE0FF8000003FE0FF8000003FE0
+  7F8000003FC07F8000003FC07F8000003FC07FC000007FC03FC000007F803FC000007F801FE00000FF001FE00000FF000FF0
+  0001FE0007F00001FC0003F80003F80001FC0007F000007F001FC000003FC07F80000007FFFC00000000FFE00000> 48 41 -4 0 51.638] 79 @dc
+[<FFFE3FFF80FFFE3FFF80FFFE3FFF800FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F800
+  0FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FE003F8000FF003F800
+  0FF003F8000FF803F8000FF803F0000FEE03F0000FE787E0000FE1FFC0000FE07F00000FE00000000FE00000000FE0000000
+  0FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE0000000FFE0000000
+  FFE0000000FFE0000000> 40 42 -3 0 38.189] 104 @dc
+[<C01FF000E0FFFE00F3FFFF00FFE01F80FF0007C0FC0003E0F80003F0F00001F0F00001F0E00001F8E00001F8E00001F86000
+  01F8000003F8000007F800000FF800007FF00007FFF000FFFFE003FFFFE007FFFFC00FFFFF801FFFFE003FFFFC007FFFE000
+  7FFE0000FFC00000FF000000FE0000E0FE0000E0FC0000E0FC0001E0FC0001E07C0001E07C0003E03E0007E01F001FE00F80
+  7FE007FFF9E003FFF0E0007F8060> 32 41 -4 0 38.189] 83 @dc
+[<00078003C00000078003C000000FC007E000000FC007E000000FC007E000001FE00FF000001FE00FF000003FF01FF800003F
+  F01FB800003FF01FB800007F783F3C00007F383F1C0000FF383F1E0000FE1C7E0E0000FE1C7E0E0001FE1EFC0F0001FC0EFC
+  070001FC0EFC070003F807F8038003F807F8038007F807F803C007F003F001C007F003F001C00FE007E000E0FFFE7FFC0FFE
+  FFFE7FFC0FFEFFFE7FFC0FFE> 48 27 -1 0 49.646] 119 @dc
+[<0001C000000003E000000003E000000007F000000007F00000000FF80000000FF80000000FF80000001FDC0000001FDC0000
+  003FDE0000003F8E0000007F8F0000007F070000007F07000000FE03800000FE03800001FC01C00001FC01C00003FC01E000
+  03F800E00007F800F00007F000700007F0007000FFFE03FF80FFFE03FF80FFFE03FF80> 40 27 -1 0 36.280] 118 @dc
+cmbx10.329 @sf
+[<FFFF0FFFF0FFFF0FFFF007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E00
+  07E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007FFFFFE0007FFFFFE0007E0007E0007E0007E0007E0007E00
+  07E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E0007E00FFFF0FFFF0
+  FFFF0FFFF0> 40 31 -2 0 40.908] 72 @dc
+[<0007FC0600003FFF8E0000FE01FE0003F000FE0007E0007E000FC0007E001F80007E003F00007E003F00007E007F00007E00
+  7E00007E007E00007E00FE003FFFE0FE003FFFE0FE00000000FE00000000FE00000000FE00000000FE000000007E00000600
+  7E000006007F000006003F00000E003F00000E001F80001E000FC0001E0007E0003E0003F000FE0000FE03DE00003FFF0E00
+  0007FC0200> 40 31 -3 0 41.097] 71 @dc
+[<0030018000007803C000007803C000007803C00000FC07E00000FC07E00001F60FB00001F60F300001F60F300003E31E1800
+  03E31E180007C1BE0C0007C1BC0C0007C1BC0C000F80F806000F80F806001F00F803001F00F00300FFE7FE1FE0FFE7FE1FE0> 40 20 -1 0 37.751] 119 @dc
+[<387CFEFEFE7C38> 8 7 -4 0 14.520] 46 @dc
+[<00FFFE00FFFE0007C00007C00007C00007C00007C00007C0FFFFFEFFFFFEE003C07003C03803C01803C00C03C00E03C00703
+  C00383C00183C000C3C00063C00073C0003BC0001FC0000FC00007C00007C00003C00001C0> 24 29 -1 0 26.136] 52 @dc
+[<FFFFFF80FFFFFF8007E01F8007E00F8007E0038007E0038007E001C007E001C007E001C007E000C007E000C007E000C007E0
+  000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E00000
+  07E0000007E0000007E0000007E00000FFFF8000FFFF8000> 32 31 -2 0 31.438] 76 @dc
+[<0003FC0000001FFF8000007E03C00000F800600001F000300001F000180003E000180007E0000C0007E0000C0007E0000C00
+  07E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C00
+  07E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C0007E0000C00FFFF01FFE0
+  FFFF01FFE0> 40 31 -2 0 40.213] 85 @dc
+[<001FFFE000001FFFE0000000FC00000000FC00000000FC00000000FC00000000FC00000000FC00000000FC00000000FC0000
+  0000FC00000000FC00000000FC00000001FC00000001FE00000003FF00000007F30000000FE18000000FE1C000001FC0C000
+  003F806000003F807000007F00300000FE00180001FC001C0001FC000C0003F800060007F000070007F0000380FFFE003FF8
+  FFFE003FF8> 40 31 -1 0 39.519] 89 @dc
+[<FFF00FFFE0FFF00FFFE0070000FE00030001FC00018003F80000C003F80000E007F00000700FE00000300FC00000181FC000
+  001C3F8000000E7F000000067E00000003FE00000003FC00000003F800000003F800000007F00000000FF00000000FF80000
+  001FD80000003F8C0000007F060000007E07000000FE03800001FC01800003F800C00003F800E00007F0007000FFFE07FF80
+  FFFE07FF80> 40 31 -2 0 39.519] 88 @dc
+[<FFFF0000FFFF000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0000007E0
+  000007E0000007FFFE0007FFFF8007E00FE007E003F007E001F807E001F807E001FC07E001FC07E001FC07E001FC07E001FC
+  07E001F807E001F807E003F007E00FE0FFFFFF80FFFFFE00> 32 31 -2 0 35.731] 80 @dc
+[<00018000300000000380003800000003C0007800000007C0007C00000007C0007C00000007E000FC0000000FE000FE000000
+  0FF001FE0000000FF001FE0000001FB001FB0000001F9803F30000003F9803F38000003F1803F18000003F0C07E18000007E
+  0C07E0C000007E0E0FE0C000007E060FC0C00000FC060FC0600000FC031F80600001FC031F80700001F8031F80300001F801
+  BF00300003F001BF00180003F001FF00180003F000FE00180007E000FE000C0007E000FE000C000FE000FC000E000FC000FC
+  000E00FFFE0FFFC0FFE0FFFE0FFFC0FFE0> 56 31 -1 0 54.039] 87 @dc
+[<FFFF03FFF0FFFF03FFF007E0007F0007E0007E0007E000FC0007E000FC0007E001F80007E003F00007E007E00007E00FE000
+  07E00FC00007E01F800007F03F000007F87F000007EC7E000007E6FC000007E3F8000007E1F8000007E0F0000007E0300000
+  07E018000007E00C000007E006000007E003800007E001C00007E000600007E000300007E000180007E0001E00FFFF00FFE0
+  FFFF00FFE0> 40 31 -2 0 40.971] 75 @dc
+cmbx10.432 @sf
+[<003FC00000FFF00003FFFC0007E07E000FC03F001F803F801F801FC03F001FC03F001FE07F001FE07F001FE07F001FE07F00
+  1FE0FF001FE0FF001FE0FF001FE0FF001FC0FF801FC0FF801F80FFC03F00FFC03E00FF707C00FF3FF800FF0FC000FF000000
+  7F0000007F0000007F0000003F8000003F801F001F803F801FC03F800FC03F8007E03F8003F01F8001FC0F0000FFFE00003F
+  FC000007F000> 32 39 -3 0 34.370] 54 @dc
+[<00000E00000700000000001F00000F80000000001F00000F80000000001F80001F80000000003F80001FC0000000003F8000
+  1FC0000000003FC0003FC0000000007FC0003FE0000000007FC0003FE000000000FFE0007FF000000000FFE0007FF0000000
+  00FFF000FFF000000001FE7000FF3800000001FE7000FF3800000001FE7801FF3800000003FC3801FE1C00000003FC3801FE
+  1C00000007FC1C03FC1E00000007F81C03FC0E00000007F81E07FC0E0000000FF00E07F8070000000FF00E07F8070000001F
+  F00F0FF0078000001FE0070FF0038000001FE0070FF0038000003FE0039FE003C000003FC0039FE001C000003FC003FFE001
+  C000007F8001FFC000E000007F8001FFC000E00000FF8000FF8000F00000FF0000FF8000700000FF0000FF8000700001FF00
+  00FF0000780001FE0000FF0000380001FE0001FF0000380003FC0001FE00001C0003FC0001FE00001C00FFFFE07FFFF007FF
+  F0FFFFE07FFFF007FFF0FFFFE07FFFF007FFF0> 72 41 -1 0 71.065] 87 @dc
+[<FFFE001FFFFEFFFE001FFFFEFFFE001FFFFE01C00000FF0001C00000FF0001E00001FF0000E00001FE0000E00001FE000070
+  0003FC0000700003FC0000780007FC00003FFFFFF800003FFFFFF800001FFFFFF000001C000FF000001E001FF000000E001F
+  E000000E003FE0000007003FC0000007003FC0000007807FC0000003807F80000003C0FF80000001C0FF00000001C0FF0000
+  0000E1FE00000000E1FE00000000F3FE0000000073FC0000000073FC000000003FF8000000003FF8000000003FF800000000
+  1FF0000000001FF0000000000FE0000000000FE0000000000FE00000000007C00000000007C00000000003800000> 48 41 -2 0 51.970] 65 @dc
+[<FFFFFFFFF0FFFFFFFFF0FFFFFFFFF003FC001FF803FC0003F803FC0001F803FC0000F803FC00007803FC00003803FC00003C
+  03FC00001C03FC00001C03FC00001C03FC01C01C03FC01C00E03FC01C00E03FC01C00E03FC03C00003FC03C00003FC0FC000
+  03FFFFC00003FFFFC00003FFFFC00003FC0FC00003FC03C00003FC03C00003FC01C03803FC01C03803FC01C03803FC01C078
+  03FC00007003FC00007003FC00007003FC0000F003FC0001F003FC0001F003FC0007F003FC001FE0FFFFFFFFE0FFFFFFFFE0
+  FFFFFFFFE0> 40 41 -3 0 45.163] 69 @dc
+[<FFFFFC0000FFFFFC0000FFFFFC000003FC00000003FC00000003FC00000003FC00000003FC00000003FC00000003FC000000
+  03FC00000003FC00000003FC03800003FC03800003FC03800003FC03800003FC07800003FC07800003FC1F800003FFFF8000
+  03FFFF800003FFFF800003FC1F800003FC07800003FC07800003FC03800003FC03807003FC03807003FC03807003FC0000F0
+  03FC0000E003FC0000E003FC0000E003FC0001E003FC0001E003FC0003E003FC000FE003FC003FC0FFFFFFFFC0FFFFFFFFC0
+  FFFFFFFFC0> 40 41 -3 0 43.253] 70 @dc
+[<1C003E007F00FF80FF80FF807F003E001C00> 16 9 -5 0 19.094] 46 @dc
+cmti10.329 @sf
+[<00FF80000300F8000C000F00100001C01000000020000000403E0F004061988040C0F84041C0382081C03C2081C01C108180
+  1C1041C01C1041C00E0841C00E0841C00E0841C00E0820E0070820E00704107007041030070408180408040C080804061808
+  0201E008010000100080002000600020001800C0000603000001FC00> 32 32 -6 0 34.847] 64 @dc
+[<07C000183800300400700200700100F00000F00000F00000F00000F000007800007800007800003C02001C07001E07800E07
+  8003008001C100007E00> 24 20 -4 0 20.908] 99 @dc
+[<601E00E0310070310070708070708070708038384038700038700038E0001FC0001E00001D00001C80000E40C00E21E00E11
+  E00E08E00704200703C007000007000003800003800003800003800001C00001C00001C00001C0000FE00001E000> 24 32 -3 0 20.908] 107 @dc
+cmr10.329 @sf
+[<0F800030E000407000407800F03800F83C00F83C00F83C00203C00003C00003C00003C00003C00003C00003C00003C00003C
+  00003C00003C00003C00003C00003C00003C00003C00003C00003C00003C00003C00003C00003C00007C000FFFC0> 24 32 -2 1 23.358] 74 @dc
+cmti10.329 @sf
+[<3C0000630000F1800079C00030E00000E00000E000007000007000007000007000003800003800003800003800001C00001C
+  00001C00001C00000E00000E00000E00020E00010700010700008700008700004600003C0000000000000000000000000000
+  00000000000000000003800003C00003C0000180> 24 40 2 9 13.939] 106 @dc
+cmr10.329 @sf
+[<FFF8FFF80F800F8007800F0007800F0007800F0007800F0007800F0007800F0007800F0007800F0007800F0007800F000780
+  0F0007800F0007800F0007800F0007FFFF0007800F0007800F0007800F0007800F0007800F0007800F0007800F0007800F00
+  07800F0007800F0007800F0007800F000F800F80FFF8FFF8> 32 31 -2 0 34.090] 72 @dc
+4 @bop1
+cmr10.329 @sf
+224 307 p (Co) s
+1 r (de) s
+14 r (for) s
+14 r (our) s
+14 r (latest) s
+14 r (sequence) s
+14 r (assem) s
+0 r (bly) s
+13 r (program) s
+cmti10.329 @sf
+14 r 98 c
+-2 r (ap) s
+cmr10.329 @sf
+13 r (is) s
+14 r (in) s
+14 r (directories) s
+cmbx10.329 @sf
+14 r (bap) s
+cmr10.329 @sf
+224 364 p (and) s
+cmbx10.329 @sf
+16 r (bap/osp-bits) s
+cmr10.329 @sf
+0 r 46 c
+24 r (Mak) s
+0 r 101 c
+15 r (the) s
+16 r (ob) s
+3 r (jects) s
+16 r (in) s
+cmbx10.329 @sf
+16 r (staden) s
+cmr10.329 @sf
+17 r (\014rst,) s
+16 r (then) s
+17 r (the) s
+16 r (ones) s
+16 r (in) s
+cmbx10.329 @sf
+224 420 p (bap/osp-bits) s
+cmr10.329 @sf
+0 r 44 c
+17 r (and) s
+16 r (\014nally) s
+16 r (the) s
+16 r (ones) s
+16 r (in) s
+cmbx10.329 @sf
+16 r (bap) s
+cmr10.329 @sf
+0 r 46 c
+24 r (In) s
+cmbx10.329 @sf
+16 r (ted) s
+cmr10.329 @sf
+16 r (is) s
+16 r (the) s
+16 r (trace) s
+16 r (editing) s
+224 477 p (program.) s
+cmbx10.432 @sf
+224 620 p 53 c
+69 r (Other) s
+23 r (Soft) s
+-1 r 119 c
+-2 r (are) s
+22 r (Pro) s
+-1 r (vided) s
+cmr10.329 @sf
+224 721 p (Other) s
+13 r (soft) s
+0 r 119 c
+-1 r (are) s
+11 r (and) s
+13 r (scripts) s
+13 r (can) s
+13 r 98 c
+1 r 101 c
+13 r (found) s
+13 r (in) s
+13 r (the) s
+cmbx10.329 @sf
+13 r (alf) s
+cmr10.329 @sf
+5 r 44 c
+cmbx10.329 @sf
+13 r (abi) s
+cmr10.329 @sf
+0 r 44 c
+cmbx10.329 @sf
+13 r (cop) s
+cmr10.329 @sf
+0 r 44 c
+cmbx10.329 @sf
+14 r (getMCH) s
+cmr10.329 @sf
+0 r 44 c
+cmbx10.329 @sf
+224 778 p (scf) s
+cmr10.329 @sf
+5 r 44 c
+cmbx10.329 @sf
+17 r (frog) s
+cmr10.329 @sf
+18 r (and) s
+cmbx10.329 @sf
+17 r (scripts) s
+cmr10.329 @sf
+17 r (directories.) s
+25 r (Eac) s
+0 r 104 c
+15 r (directory) s
+17 r (con) s
+0 r (tains) s
+16 r (do) s
+1 r (cumen) s
+0 r (ta-) s
+224 834 p (tion) s
+15 r (describing) s
+15 r (the) s
+16 r (programs) s
+15 r (con) s
+-1 r (tained.) s
+295 891 p (Since) s
+18 r (release) s
+19 r 118 c
+0 r (ersion-1993.0) s
+18 r 119 c
+-1 r 101 c
+18 r (ha) s
+0 r 118 c
+-2 r 101 c
+18 r (distributed) s
+19 r (the) s
+cmti10.329 @sf
+19 r (squirr) s
+-2 r (el) s
+19 r (\(v1.4\)) s
+cmr10.329 @sf
+224 947 p (pac) s
+0 r 107 c
+-3 r (age.) s
+19 r (Please) s
+13 r (read) s
+12 r (the) s
+13 r (disclaimer) s
+13 r (that) s
+13 r (accompanies) s
+13 r (this) s
+13 r (soft) s
+-1 r 119 c
+-1 r (are.) s
+18 r (Ad-) s
+224 1003 p (ditional) s
+12 r (sources) s
+12 r (and) s
+12 r (scripts) s
+12 r (can) s
+13 r 98 c
+1 r 101 c
+12 r (found) s
+12 r (in) s
+cmbx10.329 @sf
+12 r (expGetSeq) s
+cmr10.329 @sf
+0 r 44 c
+cmbx10.329 @sf
+13 r 118 c
+-1 r (ep) s
+1 r 101 c
+cmr10.329 @sf
+0 r 44 c
+cmbx10.329 @sf
+12 r (newted) s
+cmr10.329 @sf
+224 1060 p (and) s
+cmbx10.329 @sf
+15 r (squirrel-1.4) s
+cmr10.329 @sf
+15 r (directories.) s
+295 1116 p (Man) s
+-1 r 121 c
+19 r (scripts) s
+19 r (\(including) s
+cmti10.329 @sf
+20 r (squirr) s
+-2 r (el) s
+cmr10.329 @sf
+0 r 41 c
+19 r (and) s
+19 r (\014lters) s
+19 r 119 c
+0 r (ere) s
+18 r (dev) s
+0 r (elop) s
+0 r (ed) s
+20 r (at) s
+19 r (the) s
+224 1173 p (MR) s
+0 r (C-LMB) s
+15 r (for) s
+cmbx10.329 @sf
+16 r (INTERNAL) s
+19 r (USE) s
+19 r (ONL) s
+-4 r 89 c
+cmr10.329 @sf
+0 r 46 c
+16 r 87 c
+-3 r 101 c
+15 r (are) s
+17 r 97 c
+-1 r 119 c
+-1 r (are) s
+15 r (that) s
+17 r 112 c
+1 r (eople) s
+224 1229 p (elsewhere) s
+21 r (will) s
+21 r 119 c
+0 r (an) s
+-1 r 116 c
+20 r (to) s
+21 r (dev) s
+-1 r (elop) s
+20 r (similar) s
+21 r (soft) s
+0 r 119 c
+-1 r (are.) s
+37 r 87 c
+-3 r 101 c
+20 r (include) s
+21 r (them) s
+21 r (in) s
+224 1286 p (the) s
+14 r (Staden) s
+14 r 80 c
+-1 r (ac) s
+-1 r 107 c
+-2 r (age) s
+12 r (merely) s
+14 r (as) s
+cmbx10.329 @sf
+14 r (EXAMPLES) s
+cmr10.329 @sf
+14 r (of) s
+14 r (what) s
+13 r (has) s
+14 r 98 c
+1 r (een) s
+14 r (ac) s
+0 r (hiev) s
+-1 r (ed) s
+224 1342 p (elsewhere.) s
+cmbx10.329 @sf
+31 r (THESE) s
+21 r (SCRIPTS) s
+21 r (WILL) s
+21 r (NOT) s
+22 r 87 c
+-1 r (ORK) s
+21 r (ON) s
+21 r (YOUR) s
+224 1399 p (SYSTEM) s
+18 r (WITHOUT) s
+17 r (MODIFICA) s
+-3 r (TION.) s
+cmbx10.432 @sf
+224 1542 p 54 c
+69 r (When) s
+23 r (All) s
+23 r (Else) s
+23 r 70 c
+-5 r (ails...) s
+cmr10.329 @sf
+224 1643 p (If) s
+22 r 121 c
+0 r (ou) s
+21 r (ha) s
+-1 r 118 c
+-1 r 101 c
+21 r (an) s
+0 r 121 c
+21 r (problems) s
+21 r (please) s
+22 r (con) s
+0 r (tact) s
+21 r (the) s
+22 r (authors,) s
+23 r (Ro) s
+2 r (dger) s
+15 r (Staden) s
+224 1700 p 40 c
+cmti10.329 @sf
+0 r (rs@mr) s
+-1 r (c-lmb) s
+-2 r (a.c) s
+-3 r (am.ac.uk) s
+4 r 41 c
+cmr10.329 @sf
+0 r 44 c
+11 r (Simon) s
+15 r (Dear) s
+10 r 40 c
+cmti10.329 @sf
+0 r (sd@mr) s
+-1 r (c-lmb) s
+-2 r (a.c) s
+-3 r (am.ac.uk) s
+4 r 41 c
+cmr10.329 @sf
+10 r (and) s
+11 r (James) s
+15 r (Bon\014eld) s
+224 1756 p 40 c
+cmti10.329 @sf
+0 r (jkb@mr) s
+-1 r (c-lmb) s
+-2 r (a.c) s
+-3 r (am.ac.uk) s
+4 r 41 c
+cmr10.329 @sf
+0 r 44 c
+17 r 98 c
+-1 r 121 c
+15 r (email) s
+17 r (or) s
+16 r 98 c
+0 r 121 c
+15 r (writing) s
+16 r (to) s
+16 r (us) s
+16 r (at:) s
+23 r (MR) s
+0 r 67 c
+15 r (Lab-) s
+224 1813 p (oratory) s
+18 r (of) s
+19 r (Molecular) s
+18 r (Biology) s
+-3 r 44 c
+18 r (Hills) s
+19 r (Road,) s
+19 r (Cam) s
+0 r (bridge,) s
+18 r (CB2) s
+15 r (2QH) s
+0 r 44 c
+19 r (U.K.) s
+224 1869 p 87 c
+-3 r 101 c
+15 r (also) s
+15 r 119 c
+-1 r (elcome) s
+15 r (general) s
+15 r (commen) s
+-1 r (ts) s
+15 r (on) s
+15 r (the) s
+15 r (pac) s
+0 r 107 c
+-3 r (age.) s
+925 2776 p 52 c
+@eop
+@end
diff --git a/doc/install.tex b/doc/install.tex
new file mode 100644
index 0000000..37515cc
--- /dev/null
+++ b/doc/install.tex
@@ -0,0 +1,172 @@
+\documentstyle[a4,11pt]{article}
+
+\title{Installing the Staden Package}
+\author{Simon Dear}
+\date{21 May 1993}
+
+
+
+\begin{document}
+\maketitle
+
+
+
+\section{Introduction}
+
+On the accompanying tape you will find executables for 
+one of SunOS 4.x, Sun
+Solaris 2.x, DEC Ultrix, DEC OSF/1 and Silicon Graphics SGI operating systems.
+Also there are sources for all the programs in the Staden package.
+Programs in the package are:
+\begin{description}
+
+\item[mep and xmep] Motif exploration program.
+\item[nip and xnip] Nucleotide interpretation program.
+\item[nipl] Nucleotide interpretation program (library).
+Searches nucleotide libraries for patterns of motifs.
+\item[pip and xpip] Protein interpretation program.
+\item[pipl] Protein interpretation program (library).
+Searches protein libraries for patterns of motifs.
+\item[sip and xsip] Similarity investigation program.
+\item[sipl] Similarity investigation program (library).
+Compares a probe protein or nucleic acid sequence against
+a library of sequences.
+\item[sap and xsap] The original sequence assembly program.
+\item[bap and xbap] Our latest, most advanced sequence assembly program.
+\item[dap and xdap] An obsolete assembly program, superceded by {\em bap}.
+\item[lip] Library interface program.
+\item[rep] Repeat examination program.
+\item[ted] X windows utility for displaying and editing
+fluorescent sequencing machine traces.
+\item[splitp1, splitp2 and splitp3] Refer to help/SPLITP.MEM.
+\item[sethelp] Builds online help files.
+\item[gip] Gel input program.
+\item[convert] Converts between {\em xdap\/} and {\em xbap\/} databases.
+\item[cop and cop-bap] Checks completed {\em xdap\/} and {\em xbap\/}
+databases for editing errors.
+\item[trace2seq] Extracts sequence from trace files.
+\item[getABISampleName] Extracts sample names from ABI trace files.
+\item[makeSCF] Converts existing trace files to the compact
+SCF format.
+\item[alfsplit] Splits the Pharmacia A.L.F. gel
+file into multiple files, one for each sample.
+\item[frog] Relabels lanes in ABI trace files.
+\item[+ numerous scripts (including {\em squirrel (v1.4)\/})]
+
+\end{description}
+
+
+\section{Requirements}
+
+You will need a tape drive to read the software off the distribution
+tape (QIC-150, TK50, or Exabyte). You will also need a large amount of
+disk storage to accommodate the whole package. For release
+version-1993.0, requirements were
+31Mb (SunOS 4.x),
+36Mb (Sun Solaris 2.x)
+30Mb (DEC Ultrix)
+37Mb (DEC OSF/1)
+and
+27Mb (Silicon Graphics SGI.)
+
+
+To compile the Staden package you will require:
+\begin{itemize}
+\item An ANSI C compiler.
+\item A FORTRAN-77 compiler.
+\item X11 (Release 4 or 5).
+\item GNU make (except with SunOS and Solaris 2.x.)
+\end{itemize}
+
+\section{Installation}
+
+To install the package,
+\begin{enumerate}
+\item Create a directory for where you would like the software to be
+placed. You may have to be superuser to do this.
+      \begin{verbatim} mkdir /home/Staden\end{verbatim}
+\item Change to this directory.
+      \begin{verbatim} cd /home/Staden\end{verbatim}
+\item Place the tape into the tape unit.
+\item Extract the software off the distribution tape (NOTE: the device name may be
+different on your machine):
+      \begin{verbatim} tar xvf /dev/rst0\end{verbatim}
+\item C shell users should set the environment variable {\bf STADENROOT}
+to be the directory where the package is installed and source the file
+{\em staden.login} found there. This is best done by adding lines to their
+{\em .login} file:
+\begin{verbatim}
+    setenv STADENROOT /home/Staden
+    source $STADENROOT/staden.login
+\end{verbatim}
+Users of the Bourne shell, sh, should similarly add lines their {\em .profile} file:
+\begin{verbatim}  
+    STADENROOT=/home/Staden
+    export STADENROOT
+    . $STADENROOT/staden.profile
+\end{verbatim}
+
+The startup routines set environment variables and modify the shell's
+search path so that it can find the programs in the Staden Package.
+When users next log on to the system, they will be able to use the
+programs.
+
+\end{enumerate}
+
+
+\section {Installation on Unsupported Platforms}
+
+Install the software as you would for a supported machine.  You will
+need to remake all executables.  The script {\em Staden\_install} can
+be used to help recompile the package. A large number of
+assumptions have been made, and you may need to change the makefiles
+to suit your system.
+
+The sources have been organised into subdirectories of the directory
+{\bf src}. In {\bf Misc} are routines common to many programs. They
+should be made first.  In {\bf staden} are all the programs of the
+Staden suite ({\em mep}, {\em nip}, {\em pip}, {\em sap}, {\em sip},
+{\em dap}, {\em gip}, {\em vep}, {\em lip} and {\em rep}) with the
+exception of {\em bap}.  Code for our latest sequence assembly program
+{\em bap} is in directories {\bf bap} and {\bf bap/osp-bits}.  Make
+the objects in {\bf staden} first, then the ones in {\bf
+bap/osp-bits}, and finally the ones in {\bf bap}. In {\bf ted} is the
+trace editing program.
+
+
+\section {Other Software Provided}
+
+Other software and scripts can be found in the {\bf alf\/}, {\bf
+abi\/}, {\bf cop\/}, {\bf getMCH\/}, {\bf scf\/}, {\bf frog\/} and {\bf
+scripts}
+directories.
+Each directory contains documentation describing the programs
+contained.
+
+Since release version-1993.0 we have distributed the {\em squirrel (v1.4)}
+package. Please read the disclaimer that accompanies this software.
+Additional sources and scripts can be found in {\bf expGetSeq}, {\bf vepe},
+{\bf newted} and {\bf squirrel-1.4} directories.
+
+Many scripts (including {\em squirrel}) and filters were developed at the MRC-LMB for
+{\bf INTERNAL USE ONLY}.
+We are aware that people elsewhere will want to develop
+similar software.
+We include them in the Staden Package merely as {\bf EXAMPLES} of
+what has been achieved elsewhere.
+{\bf THESE SCRIPTS WILL NOT WORK ON YOUR SYSTEM WITHOUT MODIFICATION.}
+
+\section {When All Else Fails...}
+If you have any problems please contact the authors,
+\mbox{Rodger Staden}
+\mbox{(\em rs@mrc-lmba.cam.ac.uk\/)},
+\mbox{Simon Dear}
+\mbox{(\em sd@mrc-lmba.cam.ac.uk\/)}
+and
+\mbox{James Bonfield}
+\mbox{(\em jkb@mrc-lmba.cam.ac.uk\/)},
+by email or by writing to us at:
+MRC Laboratory of Molecular Biology, Hills Road, Cambridge, \mbox{CB2 2QH}, U.K.
+We also welcome general comments on the package.
+
+\end{document}
diff --git a/doc/manual.rtf b/doc/manual.rtf
new file mode 100644
index 0000000..cbc023c
--- /dev/null
+++ b/doc/manual.rtf
@@ -0,0 +1,5154 @@
+{\rtf1\mac\deff2 {\fonttbl{\f0\fswiss Chicago;}{\f2\froman New York;}{\f3\fswiss Geneva;}{\f4\fmodern Monaco;}{\f5\fscript Venice;}{\f6\fdecor London;}{\f7\fdecor Athens;}{\f8\fdecor San Francisco;}{\f11\fnil Cairo;}{\f12\fnil Los Angeles;}
+{\f13\fnil Zapf Dingbats;}{\f14\fnil Bookman;}{\f15\fnil N Helvetica Narrow;}{\f16\fnil Palatino;}{\f18\fnil Zapf Chancery;}{\f20\froman Times;}{\f21\fswiss Helvetica;}{\f22\fmodern Courier;}{\f23\ftech Symbol;}{\f24\fnil Mobile;}{\f33\fnil Avant Garde;}
+{\f34\fnil New Century Schlbk;}}{\colortbl\red0\green0\blue0;\red0\green0\blue255;\red0\green255\blue255;\red0\green255\blue0;\red255\green0\blue255;\red255\green0\blue0;\red255\green255\blue0;\red255\green255\blue255;}{\stylesheet{\s243\qc\sa60\sl280 
+\f20 \sbasedon222\snext0 footer;}{\s244\sl220\tqc\tx4320\tqr\tx8640 \f4\fs16 \sbasedon0\snext0 header;}{\sl220 \f4\fs16 \sbasedon222\snext0 Normal,Screen Font;}{\s2\qc\sa200\sl480 \b\f20\fs36 \sbasedon222\snext2 Chapter Heading;}{\s3\sb200\sa120\sl360 
+\b\f20\fs32 \sbasedon222\snext0 Main Subheading;}{\s4\qj\sa120\sl280 \f20 \sbasedon222\snext4 Body text;}{\s5\sb400\sa60\sl320\tx560 \b\f20\fs28 \sbasedon222\snext5 Subheading;}{\s6\sb240\sa60\sl280\tx560\tx860 \b\f20 \sbasedon5\snext6 SubSub heading;}{
+\s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 \sbasedon4\snext7 Indent Body;}{\s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 \sbasedon222\snext8 Figure legends;}{\s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 \sbasedon6\snext9 SubSubSub heading;}}
+\paperw11880\paperh16820\margl1440\margr1440\widowctrl\ftnbj\ftnrestart \sectd \linemod0\linex0\cols1\endnhere \pard\plain \s2\qc\sa600\sl480 \b\f20\fs36 {\i\fs48 Contents\par 
+}\pard\plain \s7\qj\fi-560\li560\sa120\sl400\tx560\tqr\tldot\tx8980 \f20 1\tab Preface\tab 1\par 
+2\tab Introduction\tab 3\par 
+3\tab Sequence input, editing and sequence library use\tab 17\par 
+4\tab Managing sequencing projects\tab 26\par 
+5\tab Analysing sequences to find genes\tab 51\par 
+6\tab Searching for motifs in nucleic acid sequences\tab 60\par 
+7\tab Using patterns to analyse nucleic acid sequences\tab 69\par 
+8\tab Searching for restriction sites\tab 77\par 
+9\tab Statistical and structural analysis of nucleotide sequences\tab 83\par 
+10\tab Translating and listing nucleic acid sequences\tab 93\par 
+11\tab Statistical and structural analysis of protein sequences\tab 99\par 
+12\tab Searching for motifs in protein sequences\tab 104\par 
+13\tab Using patterns to analyse protein sequences\tab 112\par 
+\pard \s7\qj\fi-560\li560\sa120\sl400\tx560\tqr\tldot\tx8980 14\tab Comparing sequences\tab 123\par 
+\pard\plain \s2\qc\sa200\sl480\tqr\tldot\tx8980 \b\f20\fs36 \sect \sectd \pgnrestart\linemod0\linex0\cols1\endnhere {\footer \pard\plain \s243\qc\sa60\sl280 \f20 \chpgn \par 
+}\pard\plain \s2\qc\sa600\sl480 \b\f20\fs36 1.  Preface (November, 1992)\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This second edition of the manual contains only minor revisions. The changes are mostly to do with managing sequencing pro
+jects which is the subject on which we are currently concentrating our efforts. We have replaced our previous Developing Assembly Program DAP with another developing assembly program BAP that can assemble Bigger projects. Although this new program can hand
+le 8000 readings as opposed to the miserly 1000 of the previous version, it actualy uses its space more efficiently over the course of a project. It contains a mechanism for preventing simultaneous use (and hence corruption) of databases. In addition it is
+ approximately four times faster during assembly and five times faster when looking for "internal joins". It now contains a routine for selecting primers and templates during the "walking" stage of a project . The "find internal joins" function now calls u
+p the contig joining editor with the two contigs aligned in the window and the editor has also been speeded up. Numerous other changes have also been made but we still regard BAP as temporary, and are actively working on its replacement which we believe wi
+ll overcome the limitations that BAPs aged structure has imposed on it. We have also included routines for converting ABI 373A and Pharmacia A.L.F. data to our new trace file format, for automatically marking poor quality regions of readings from these mac
+hines and for converting DAP databases to BAP databases.\par 
+\pard \s4\qj\sa120\sl280 Other changes include providing a postscript option for saving graphics output, and  facilities for using the author and freetext indexes of the sequence libraries. The sequence library indexes are v
+ery useful and allow rapid searching. The freetext index is derived from ALL the text in the annotations - not just the keywords. We have also added a new repeat examining routine in NIP and a new repeat listing option in SIP.\par 
+\pard \s4\qj\sa120\sl280 \par 
+\pard\plain \s2\qc\sa600\sl480 \b\f20\fs36 1. 1 Preface to first edition \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+It could be said that this manual is long overdue, for, apart from the extensive online help available from within the programs, it is the first printed guide to using a package that has been around for longer than I care remember. On the other hand, to
+ misquote a cliche much used by reviewers, it could be said that this manual fills a much needed gap, in that I believe the best way to learn about computer programs is to use them. Those who are prepared to experiment and play with programs will discover 
+far more than any manual of reasonable size can hope to convey. However the manual serves to give users an overview of what is available and a starting point for their exploration of the programs.\par 
+\pard \s4\qj\sa120\sl280 One of my objectives was to be able to distribute the manua
+l on floppy disk so that each site using the programs could print as many copies as they need. We had to balance the quality of the graphics and the sophistication of the layout, against the ease of producing updates and the availability of software, and d
+ecided to to use the WORD4 program running on the Apple Macintosh. The graphics figures reproduced in the manual are far below the quality seen on the terminal screen, and in some cases should be viewed as merely schematic.\par 
+\pard \s4\qj\sa120\sl280 Most of the chapters are self-contained but users are strongly advised to read sections 3 to 7 in chapter 1, as to do so will save a lot of time.\par 
+\pard \s4\qj\sa120\sl280 In future editions we will add chapters on other programs in the package and expand the Notes sections to give more information about the theory and algorithms used. We welcome comments and suggestions for improvements.\par 
+\pard \s4\qj\sa120\sl280 I thank Brian Pashley for transforming my original documents into, what I hope will be, a useful manual.\par 
+\pard\plain \s3\sb200\sa120\sl360 \b\f20\fs32 Rodger Staden,   March 1992.\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 2.  Introduction\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Materials\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1720 2.1\tab Versions\par 
+2.2\tab Terminals\par 
+2.3\tab Digitizers\par 
+2.4\tab Sequencing machines\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab User interfaces\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 3.1\tab The xterm and VAX interface\par 
+3.2 \tab The X interface\par 
+3.3\tab Use of the bell\par 
+3.4\tab Printing and saving results in files\par 
+3.5\tab Use of feature tables\par 
+3.6\tab Use of graphics\par 
+3.7\tab The active region\par 
+3.8\tab Files of file names\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Character sets\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1720 4.1\tab Character sets for finished sequences\par 
+4.2\tab Symbols used in gel readings\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Sequence formats\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1720 5.1\tab Personal sequence files\par 
+5.2\tab Sequence libraries\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Conventions used in text\par 
+7.\tab Notes\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+In this chapter we give an overview of the chapters on the "Staden Package" of programs. Here we describe the equipment required and outline the scope of the package and the user interfaces. In the next chapter we cover character sets, sequence formats and
+ sequence library access.\par 
+\pard \s4\qj\sa120\sl280 The main programs in the package are as follows\:\par 
+\pard\plain \s7\qj\sa120\sl280\tx1120 \f20 GIP\tab Gel input program\par 
+\pard \s7\qj\sa120\sl280\tx1120\tx1580 SAP\tab Sequence assembly program\par 
+\pard \s7\qj\sa120\sl280\tx1120 BAP\tab Sequence assembly program\par 
+NIP\tab Nucleotide interpretation program\par 
+PIP\tab Protein interpretation program\par 
+SIP\tab Similarity investigation program\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx1120 MEP\tab Motif exploration program\par 
+NIPL\tab Nucleotide interpretation program (library)\par 
+PIPL\tab Protein interpretation program (library)\par 
+SIPL\tab Similarity investigation program (library)\par 
+XBAP\tab Sequence assembly program\par 
+XNIP\tab Nucleotide interpretation program\par 
+XPIP\tab Protein interpretation program\par 
+XSIP\tab Similarity investigation program\par 
+XMEP\tab Motif exploration program\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 GIP uses a digitiser for entry of DNA sequences from autoradiographs. SAP, BAP and XBAP handle everything relating to assembling and edi
+ting gel readings. NIP provides functions for analysing and interpretting individual nucleotide sequences. PIP provides functions for analysing and interpretting individual protein sequences. MEP analyses families of nucleotide sequences to help discover n
+ew motifs. NIPL performs pattern searches on nucleotide sequence libraries. PIPL performs pattern searches on protein sequence libraries. SIP provides functions for comparing and aligning pairs of protein or nucleotide sequences. SIPL searches nucleotide a
+nd protein sequence libraries for entries similar to probe sequences.  The programs whose names begin with a letter X are X11 (see below) versions of the programs. For example XNIP is an X11 version of NIP.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Materials\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Versions.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The programs run on Apple Macintosh computers, on VAX computers using the VMS operating system, and on SUN workstations (which use the UNIX operating system.) The SUN version should run, with only minor changes, on other machines running UNIX and currently
+ we are aware of versi
+ons running on DEC ULTRIX, Silicon Graphics, Alliant FX2800 and Convex machines. Currently the Macintosh version is "frozen" in its April 1990 state, the VAX version is "frozen" in its April 1991 state and all development is being done on the SUN version.
+\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.1.1\tab VAX version.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The VAX version will run on any VAX using the VMS operating system. A FORTRAN compiler is required.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.1.2\tab UNIX version.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The UNIX version is being used here on SPARCstations and DECstation 5000/240s with at least 8 megabytes of memory, 20
+0 megabyte internal disk drives and 700 megabyte external disks. Colour monitors such as the GX are preferable for running the programs which display traces from fluorescent sequencing machines, but monochrome displays are adequate for all other programs. 
+We also use tape desktop backup packs for archiving, and a cdrom drive for handling the sequence libraries.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.1.3\tab Other UNIX versions.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Users of UNIX machines other than SUN SPARCstations, DECstation 5000/240 and SGI Indigo R3000 will require a FORTRAN comp
+iler and ANSI C. When operated directly on the workstation screen all UNIX versions require X11 release 4.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.1.4\tab The Macintosh version\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The Macintosh version of the package requires a machine with at least 1 megabyte of memory and a 20 megabyte hard disk.  It only operates on monochrome screens or colour screens set to black/white mode. The package contains only programs SAP, GIP, NIP, PIP
+ and SIP. All further information about this version of the package is contained in the notes.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Terminals.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program
+s can also be operated via a serial port using Tektronix terminals, PC's running MS-Kermit, or Apple Macintoshs running Versaterm Pro. The UNIX versions can also be run from X teminals or microcomputers running X emulators.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Digitizers.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The gel reading input program uses a sonic digitizer called a GRAPHBAR GP7 made by Science Accessories Corp., 200 Watson Blvd., Stratford, CT 06497, USA. When ordering specify that the device should be set to use metric units.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Sequencing machines.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The programs can handle data produced by the Applied Biosystems Inc. 373A and Pharmacia A.L.F fluorescent sequencing machines.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab User Interfaces\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The programs have two user interfaces. The first runs under the terminal emulator xterm and the second runs directly under X. On the VAX, at present only the xterm interface is available, but on UNIX systems either interface can be used. The xterm version 
+of the package will operate on the workstation screen, X terminals, Tektronix terminals, PC's or Macintoshes (see above). When run
+ on the workstation screen the programs have separate text and graphics windows, each of which can be moved, resized and iconized, and the text windiow can be scrolled in both directions. The versions that run directly under X can only be used on the works
+tation screen, X terminals or using an X emulator. They produce separate text and graphics windows, an independent, constantly available help window and a separate dialogue window. All input is controlled by mouse selection and dialogue boxes.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 3.1\tab The xterm and VAX interface\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The user interface is common to all programs. It consists of a set of menus and a uniform way of presenting choices and obtaining input from the user. This section describes\:
+ the menu system; how options are selected and other choices made; how values are 
+supplied to the program; how help is obtained, and how to escape from any part of a program. In addition it gives information about saving results in files and the use of graphics for presenting results.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.1.\tab Menus and option selection\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Each program has several menus and numerous options. Each menu or option has a unique number that is used to identify it. Menu numbers are distinguished from option numbers by being preceded by the letter m (or M, all programs make no distinction between u
+pper and lower case letters). With the exception of some parts of program SAP, the menus are not hierachical, rather the options they each contain are simply lists of related functions and their identifying numbers. Therefore options can be selected inde
+pendently of the menu that is currently being shown on the screen, and the menus are simply memory aides. All options and menus are selected by typing their option number when the programs present the prompt \par 
+\pard \s4\qc\sb120\sa180\sl280 "? Menu or option number =" \par 
+\pard \s4\qj\sa120\sl280 
+To select a menu type its number preceded by the letter M. To select an option type its number. If users type only "return" they will get menu m0 which is simply a list of menus. If users select an option they will return to the current menu after the func
+tion is completed. Where possible, equivalent or identical options have been given the same numbers in all programs, and so users quickly learn the numbers for the functions they employ most often.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.2\tab Execution and dialogue\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+All inputs requested by the program (apart from file names) have default values. In addition most of the analytical functions have a default path through which they will pass, so when users select an option, in many cases the program will immediately perfo
+rm the operation selected without further dialo
+gue. However if users precede an option number by the letter d (e.g. D17), they will force the program to offer dialogue about the selected option before the function operates, hence allowing them to change the value of any of its parameters. In addition, 
+alternative suboptions will be made available.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.3\tab Help\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Help about each option can be obtained by preceding the option number by the symbol ? when users are presented with the prompt "? Menu or option number", (e.g. ?17 gives help on the option 17), but 
+there are two further ways of obtaining help. Whenever the program asks a question users can respond by typing the symbol ? and they will receive information about the current option. In addition, option number 1 in all the programs will give help on all o
+f a programs functions. \par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.4.\tab Quitting \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 To exit from any point in a program users type ! for quit. If a menu is on the screen this will stop the program, otherwise they will be returned to the last menu. \par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.5.\tab Making selections\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Questions and choices are dealt with in three ways. Where there are choices that are not obvious opposites, or there are more than two choices, "radio buttons" and "check boxes" are used.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\pagebb\tx1140 \b\f20 3.1.5.1.\tab Choosing between opposites.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Obvious opposites such as "clear screen" and "keep picture" are presented with only the default shown. For example in this case the default is generally "keep picture" so the program will display\: \par 
+\pard\plain \li1720\sa200\sl220 \f4\fs16 "Keep picture  (y/n) (y) =" \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 and the picture will be retained if the user types Y or y or only return. If the user types N or n the picture will be cleared. Anything other than these or ? or ! will cause the question to be asked again.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.5.2. \tab Choosing one from many.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Radio buttons are used when only one of a number of choices can be made at any one time. The choices are presented arranged one above the other, each choice with a number for its selection, and the default choice marked with an X. For example when the user
+ is reading a new sequence file the following choices of format are offered.\par 
+\pard\plain \li1720\sb300\sl220\tx2460\tx3400 \f4\fs16 Select sequence file format\par 
+\pard \li1720\sl220\tx2460\tx3400 \tab 1\tab Staden\par 
+\tab 2\tab EMBL\par 
+X\tab 3\tab GenBank\par 
+\tab 4\tab PIR\par 
+\tab 5\tab GCG\par 
+       6        FASTA\par 
+\pard \li1720\sa300\sl220\tx2460\tx3400 ? Selection (1-5) (3) =\par 
+\pard\plain \s4\qj\sb60\sa120\sl280 \f20 Any single option can be selected by typing the option number, and the default option, (here shown as 3), is also obtained by typing only "return". Again help can be obtained by typing ? and  quit by typing !. 
+\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.5.3.\tab Choosing at least one from many.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Check boxes are used when any number of a set of choices can be made (i.e. the choices are not exclusive). Choices are made by typing choice numbers. Each choice c
+an be considered as a switch whose setting is reversed when it is selected. Choices that are currently switched on are marked with an X. The user quits from making selections by typing only "return". For example in the routine that plots base composition u
+sers can elect to plot the frequencies of any combination of bases, e.g. only A, or A+T, or A+T+G etc. The following check box is offered to the user\: \par 
+\pard\plain \li1720\sb300\sl220\tx2420\tx3400 \f4\fs16 X\tab 1\tab T\par 
+\pard \li1720\sl220\tx2420\tx3400 \tab 2\tab C\par 
+X\tab 3\tab A\par 
+\tab 4\tab G\par 
+\pard \li1720\sa300\sl220\tx2420\tx3400 ? Selection (1-4) ( ) =\par 
+\pard\plain \s4\qj\sb60\sa120\sl280 \f20 As shown this will plot the A+T composition. To switch off T  select 1, to switch on C select 2, etc, to quit, having set the bases required type only "return". \par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 \page 3.1.6.\tab Input of numerical values \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 All input of integer or decimal numbers is presented in a standard way with the allowed range shown in brackets and the default value also in brackets. For example\: \par 
+\pard\plain \li1700\sb160\sa300\sl220 \f4\fs16 ? Window (5-31) (11) = \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 In this example users could type any number between 5 and 31, or "return" only, or ! or ? (see above). Any other input will cause the program to ask the question again. Typing only "r
+eturn" gives the default value (here 11). \par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.1.7.\tab Input of character strings\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Character strings are requested using informative prompts of the form\:\par 
+\pard\plain \li1720\sb160\sa300\sl220 \f4\fs16 ? Search string =\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Or where possible the prompt will be preceded by a default value as in\:\par 
+\pard\plain \li1720\sb160\sl220 \f4\fs16 Default search string = atatatata\par 
+\pard \li1720\sa300\sl220 ? Search string =\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Question mark (?) or ! will get help or quit. Where appropriate, for example when a whole list of strings have been defined one after the other, typing return only will be a signal to the program that input is complete.
+\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 3.2.\tab The X interface\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This interface deals with all the types of interactions described above but options are selected using pulldown menus and all inputs are via appropriately styled dialogue boxes and buttons. Default values are accepted by clicking on an "OK" button, or typi
+ng return on the keyboard. Values are changed by overtyping the defaults. Quit is available from each dialogue via a "CANCEL" button. Help is constantly available via a "HELP" button in the main dialogue window. Details such as requestin
+g dialogue when an option is selected are dealt with using a button labelled "execute with dialogue" which toggles to "execute".\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 3.3.\tab Use of the bell \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The programs use the bell to indicate that a task is completed. When the bell sounds, the programs will wait until return is typed. Users can quit from these points by typing ! but no help is available.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 3.4.\tab Printing and saving results in files \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 A few of the functions in the programs automatically write their textual results to disk files, but for most functi
+ons users can choose whether results appear on the terminal screen or go to a file. For these functions the normal, or default, place for results to appear is on the screen, and users need to decide before the function is selected if they want to redirect 
+the results to a file. In all programs the option "Redirect output" gives control over whether results appear on the screen or go to a file. When a program is started results will be sent to the screen. If the option "Redirect output" is selected users wil
+l be given the choice of redirecting either text or graphics to a file or of creating a postscript file for the graphics. The program will then ask users to supply a file name. If users elect to redirect output, from that point on ,all results will be sent
+ to the file until the option is selected again, in which case the "redirection file" will be closed, and results will again appear on the screen. If these files contain textual results they can be looked at from within the programs by using option  "List 
+a text file". Once the program is left users can employ an appropriate system command to print the files. There is no function within the programs to direct files to a printer. If users elect to create a postscript file for the graphics the graphics will a
+lso appear on the screen. If they redirect graphics the graphics commands (in Tektronix codes) will only go to the file and will not appear on the screen\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 3.5.\tab Use of feature tables\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 One particular use of redirection should be noted. The programs can use EMB
+L/GenBank feature tables as input for directing translation of DNA to protein, etc, but the tables must be stored in separate text files, and cannot be read directly from the sequence libraries. The only routines that can read the sequence libraries are th
+ose available under "Read a sequence". So to create a text file containing the feature table for a particular library entry users must redirect text output to disk, and then use the "Read a sequence" to display the appropriate feature table. The feature ta
+ble will be written to the file, and then the file can be used for controlling translation etc. Note however that the redirection mechanism is a general function and it therefore does not add the required header and tail to saved files. To make the files u
+seable as feature tables they need, as a minimum, a line at the top with the word FEATURES starting in column 1, and two empty lines at the end of the file!\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 3.6.\tab Use of graphics \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The analytical programs including NIP, PIP and SIP present the results of many of their analyses graphically.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.6.1.\tab The drawing board and plot positions\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The position at which the results for any function appear on the screen is defined relative to a notional users "drawing board" of dimension 10,000 by 10,000. This drawing board fills the screen and results are drawn in windows defined using symbols x0,y0 
+and xlength,ylength, where x0,y0 is the position of the bottom left hand corner of the window, and xlength is the width of the window and ylength the height of the window. The win
+dow positions for each option are read from a file when a program is started. If required individual users can have their own set of plot positions, and also the positions can be redefined from within the programs using the option "Reposition plots".
+\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.6.2.\tab The plot interval\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+For those analyses that draw continuous lines to represent results (for example a plot of base composition) the user is asked to supply the "Plot interval". All the analyses produce a value for every point along the sequence but often i
+t is unnecessary to actually plot the values for all the points. The plot interval is simply the distance between the points shown on the screen. If the user selects a plot interval of 1, every point will be plotted; a plot interval of 3 will show every th
+ird point. \par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.6.3.\tab The window length\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The word "window" is used in a further way by the programs. Most of the functions that analyse the content of a sequence (the simplest such routine plots the base composition) perform their calculations over a segment o
+f the sequence of a certain length, display the result, then move on by 1 position, and recalculate. The fixed size of segment over which a calculation is performed is called a "window" and the segment size is the "window length". Many analytical functions
+ request "? Window length =", or more frequently "? Odd window length =". An odd number is used so that when a result is displayed for a particular window position it is derived from an equal number of points either side of the windows' midpoint.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.6.4.\tab Use of the cross hair\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+All programs that produce graphical output provide a function for using a cross hair to examine the plots. After the cross hair function is selected the cross will appear in the graphics window and can be steered around using the mouse or directional keys.
+ Special keyboard characters hit while the function is in operation produce the following results. For all programs the letter s (for sequence) will show the local sequence around the cross hair position. For the sequence comparison pro
+grams that show a dot matrix the two sequences will be displayed above one another. For the sequencing project management programs all the aligned sequences in the contig will be displayed. For the sequence comparison programs the letter m (for matrix) wil
+l show a matrix in which all identical characters for a window around the cross hair are marked. The punctuation symbol , will show the local position in sequence units, but leave the cross hair on the screen, whereas the space bar and any other non-specia
+l character will show the local position and exit the cross hair function. Further special characters are defined in the chapter on managing sequencing projects.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.6.5\tab Drawing scales on plots\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 All the programs have a function "Draw a ruler" which will allow users to add scales to the axes of graphical plots. The scale can be positioned anywhere on the plot.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 3.6.6\tab Saving graphics\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The best way of saving the graphics is to use the "Redirect output" function to open a postscript file which will then contain a co
+py of all plots that appear on the screen. This of course requires the file to be opened before the plots are drawn. Many terminals are not capable of dumping their screen contents to a file for subsequent printing. One convenient way of obtaining hard cop
+y of graphical results is to use a micro computer as a terminal. On the Macintosh we use the terminal emulator versa termPro. This allows graphics to be saved as Macintosh files that can be annotated and printed using Macdraw and other painting programs. A
+lternatively graphics can be redirected to a file and printed using a laser printer with tektronix capability (see "Printing and saving results in files"). \par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 3.7.\tab The active region\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+All the analytical programs use an "active region" for most of their functions. This is simply the current section of the sequence over which the analysis will be applied. When a sequence is first read in the active region will be set to its whole length, 
+but the user can restrict the scope of analytical functions by use of an opt
+ion called "Define active region". However some functions such as "List the sequence" are always given access to the whole sequence and will allow the user to define a limited range after they have been selected.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 3.8.\tab Files of file names\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+A useful device that is employed by many of the programs is that of "files of file names". If a program needs to perform the same operation in turn on each of 20 files, the user should not have to type in 20 file names. Instead the user types in the name o
+f a single file which contains the names of the other 20 files. This single file is a file of file names. They are used, for example, to process batches of gel readings, or to compare a sequence against a library of motifs.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab Character Sets\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 There are two types of character sets employed by the programs\: those for finished sequences and those used during sequencing projects.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 4.1\tab Character sets for finished sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The analytical programs will operate with uppercase or lowercase sequence characters. For nucleic acids T and 
+U are equivalent. For proteins the standard 1 letter codes are used. The analytical programs also use IUB symbols for redundancy in back translations and for sequence searches. The symbols are shown in table 2.1  \par 
+\pard \s4\qj\li2260\ri2220\sb300\sa120\sl280\box\brsp100\brdrth \tx3420\tx4800 A,C,G,T\par 
+\pard \s4\qj\li2260\ri2220\sa120\sl280\box\brsp100\brdrth \tx3420\tx4800 R\tab (A,G)\tab 'puRine'\par 
+Y\tab (T,C)\tab 'pYrimidine'\par 
+W\tab (A,T)\tab 'Weak'\par 
+S\tab (C,G)\tab 'Strong'\par 
+M\tab (A,C)\tab 'aMino'\par 
+K\tab (G,T)\tab 'Keto'\par 
+H\tab (A,T,C)\tab 'not G'\par 
+B\tab (G,C,T)\tab 'not A'\par 
+V\tab (G,A,C)\tab 'not T'\par 
+D\tab (G,A,T)\tab 'not C'\par 
+\pard \s4\qj\li2260\ri2220\sa120\sl280\keepn\box\brsp100\brdrth \tx3420\tx4800 N\tab (G,A,C,T)\tab 'aNy'\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Table 1.1\tab The NC-IUB characters used by the analytical programs\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 4.2\tab Symbols used in gel readings\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Th
+e information stored about a sequence reading has to show the original sequence, recording any doubts about its interpretation, and also, where possible, allow the changes made during editing to be indicated. Lowercase characters are used by the sequence p
+roject management programs for recording readings, and uppercase symbols are used when changes are made during editing. Alternatively the reverse convention can be used. Any other characters in a sequence are treated as dash (-) characters. The symbols are
+ shown in table 2.2.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 5.\tab Sequence Formats\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The data formats for the programs that deal with sequencing projects are described in the chapter on managing sequencing projects. All analytical programs can read sequences stored in several formats. We distinguish between two sources of input namely\:
+ "sequence libraries" and "personal files".\par 
+\pard \s4\qj\sa120\sl280 \par 
+\pard \s4\qj\li1120\ri1200\sa120\sl280\box\brsp100\brdrth \tqc\tx2800 {\b Symbol \tab Meaning}\par 
+\pard \s4\qj\li1120\ri1200\sl280\box\brsp100\brdrth \tx1400\tqc\tx2800\tqc\tx4240\tqc\tx5640\tx6820 \tab c\tab Definitely\tab c\par 
+\tab t\tab "\tab t\par 
+\tab a\tab "\tab a\par 
+\tab g\tab "\tab g\par 
+\tab 1\tab Probably\tab c\par 
+\tab 2\tab "\tab t\par 
+\tab 3\tab "\tab a\par 
+\tab 4\tab "\tab g\par 
+\tab d\tab "\tab c\tab Possibly\tab cc\par 
+\tab v\tab "\tab t\tab "\tab tt\par 
+\tab b\tab "\tab a\tab "\tab aa\par 
+\tab h\tab "\tab g\tab "\tab gg\par 
+\tab k\tab "\tab c\tab "\tab c-\par 
+\tab l\tab "\tab t\tab "\tab t-\par 
+\tab m\tab "\tab a\tab "\tab a-\par 
+\tab n\tab "\tab g\tab "\tab g-\par 
+\tab r\tab a or g\par 
+\tab y\tab c or t\par 
+\tab 5\tab a or c\par 
+\tab 6\tab g or t\par 
+\tab 7\tab a or t\par 
+\tab 8\tab g or c\par 
+\tab -\tab a or g or c or t\par 
+\pard \s4\qj\li1120\ri1200\sl280\box\brsp100\brdrth \tx1400\tqc\tx3780\tqc\tx4240\tqc\tx5640\tx6820 \tab A\tab a set by auto edit or corrected by user\par 
+\tab C\tab c set by auto edit or corrected by user\par 
+\tab G\tab g set by auto edit or corrected by user\par 
+\tab T\tab t set by auto edit or corrected by user\par 
+\pard \s4\qj\li1120\ri1200\sl280\box\brsp100\brdrth \tx1400\tqc\tx4020\tqc\tx5640\tx6820 \tab *\tab padding character placed by auto assembler\par 
+\pard \s4\qj\li1120\ri1200\sl280\keepn\box\brsp100\brdrth \tx1400\tqc\tx2800\tqc\tx4240\tqc\tx5640\tx6820 else = -\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa300\sl240\tx1140 \f21\fs20 Table 2.2\tab The symbols used to record gel readings\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 5.1\tab Personal sequence files\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The programs can read sequences from files in PIR, EMBL, GenBank, GCG, FASTA and Staden formats. Staden format
+ means text files with records of up to 80 characters; all spaces are removed; lines with ";" in the first position are treated as comments and will be displayed when the file is read but not included in the sequence; if the first line of data contains a 2
+0 character header of the form <---abcdefghij-----> it too will not be included in the processed sequence. This last facility allows the programs to read consensus sequences created by the sequence project management programs. Files in PIR format can conta
+in any number of entries (which the user selects by entry name), but all other formats are expected to contain only one sequence. If they contain more only the first will be read.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 5.2\tab Sequence libraries\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Users may not appreciate the fact that because the sequence libraries are so large, programs need to use indexes to provide rapid retrieval of individual entries. An index is a list of entry names and pairs of offsets. For each entry name the offsets defin
+e the position at which its sequence and annotation s
+tart in the large file. The index, which is in any case relatively small, is arranged so that it can be searched quickly - for example the EMBL cdrom index is sorted alphabetically. When the user supplies an entry name the program rapidly finds it in the i
+ndex file and then uses the associated offsets to locate the entry in the larger sequence files.\par 
+\pard \s4\qj\sa120\sl280 The sequence libraries are stored in different ways on the VAX and the SUN. On the VAX we adopted the widely used PIR format and indexing method and on the SUN we use the EMBL cdrom format and indexes.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 5.2.1\tab Sequence libraries on the VAX\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+On the VAX all libraries are stored in PIR format, and except for the facility to select entries by accession number, the same functions are provided as those on the SUN.  Note that this means that most libraries need reformatting after they have been read
+ from the distribution media. Because, for each entry, the sequence and its annotation are stored separately, the reformatting process consumes significant computer resources. T
+hese reformatting programs are available from PIR and we give no further information here. The programs that search whole libraries of sequences also expect the libraries to be in PIR format.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 5.2.2.\tab Sequence libraries for the UNIX version\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+For the UNIX version of the programst we use the EMBL cdrom as the primary source of sequence data and have chosen their indexing method for all libraries. These indexes leave the sequence libraries in their distribution format and simply provide offsets t
+o the original fi
+les. The cdrom provides the EMBL nucleic acid sequence library and the SWISSPROT protein sequence library. Currently it also includes indexes for entry names, accession numbers, authors and freetext and has an additional "title" file which, for each entry,
+ consists of entry name, entry length and an 80 character description of the entry. These indexes allow rapid retrieval of entries by name or accession number, and the author and freetext indexes can be searched very rapidly.  The files can be left on the 
+cdrom or transfered to a hard disk. The programs that search whole libraries of sequences expect the libraries to be in cdrom format or PIR format.\par 
+\pard \s4\qj\sa120\sl280 
+We have written our own programs for producing EMBL cdrom type indexes for other sequence libraries. These allow us to use the PIR protein libraries in CODATA format and between release updates of the EMBL nucleotide library. Others may wish to use them to
+ produce indexes for libraries such as GenBank. In addition to our own programs the scripts that produce the indexes also use the UNIX sort program. We give no further details here but the programs are described in Staden and Dear, 1992.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 5.2.2.1\tab Library description files.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The following information is only relevent to those installing the sequence libraries on a SUN. To make the sequence library handling as flexible as possible we use several level of files. As stated above, at present we only deal with the EMBL and SWISSPRO
+T libraries as distributed on cdrom and the PIR protein library in CODATA format. By including a "library type" flag in the library description file we also leave open the possibility of using alternative formats. \par 
+\pard \s4\qj\sa120\sl280 We describe the libraries at 3 levels\:
+ 1) a list of libraries and their types, which points to 2) the files which name the libraries individual files and their file types, then, finally 3) the librairies individual files. The files used are described below.\par 
+\pard\plain \s7\qj\fi-1120\li1120\sa120\sl280\pagebb\tx1120 \f20 Level 1)\tab The top level file is a list of available libraries which contains\:  the library type, the name of the file containing th
+e names of each libraries individual files, and the prompt to appear on the users screen. \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Example\: \par 
+\pard \s4\qj\li1100\sa120\sl280 File name\: SEQUENCELIBRARIES\par 
+File contents\:\par 
+\pard\plain \li1120\sl220 \f4\fs16 A\tab EMBLLIBDESCRP EMBL nucleotide library ! in cdrom format\par 
+A\tab SWISSLIBDESCRP SWISSPROT protein library! in cdrom format\par 
+\pard \li1120\sa300\sl220 B\tab PIRLIBDESCRP PIR protein library! in CODATA format\par 
+\pard\plain \s4\qj\sa180\sl280 \f20 The first two libraries are of type A. The logical names are EMBLLIBDESCRP and SWISSLIBDESCRP, and the prompts are "EMBL nucleotide library" and "SWISSPROT protein library". The third library is o
+f type B with logical name PIRLIBDESCRP. Space is used as a delimiter and anything to the right of a ! is a comment.\par 
+\pard\plain \s7\qj\fi-1100\li1100\sa120\sl280\tx1120 \f20 Level 2)\tab The file containing the names of the libraries individual files contains flags to define the file types and the path or logical names of the files. Current file types are\: \par 
+\pard\plain \fi100\li980\sl220 \f4\fs16 A\tab Division_lookup\par 
+B\tab Entryname_index\par 
+C\tab Accession_target\par 
+D\tab Accession_hits\par 
+E\tab Brief_directory.\par 
+F\tab Freetext_target\par 
+G\tab Freetext_hits\par 
+H\tab Author_target\par 
+I\tab Author_hits\par 
+\pard\plain \s4\qj\sa120 \f20 Example\par 
+\pard \s4\qj\li1120\sa120 File name\: EMBLLIBDESCRP\par 
+File contents\:\par 
+\pard\plain \fi100\li980\sl220 \f4\fs16 A\tab STADTABL/EMBLdiv.lkp\par 
+B\tab /cdrom/indices/embl/entrynam.idx\par 
+C\tab /cdrom/indices/embl/acnum.trg\par 
+D\tab /cdrom/indices/embl/acnum.hit\par 
+E\tab /cdrom/indices/embl/brief.idx\par 
+F\tab /cdrom/indices/embl/freetext.trg\par 
+G\tab /cdrom/indices/embl/freetext.hit\par 
+H\tab /cdrom/indices/embl/author.trg\par 
+I\tab /cdrom/indices/embl/author.hit\par 
+\pard \li1120\sa300\sl220 \par 
+\pard\plain \s7\qj\fi-1120\li1120\sa120\sl280\tx1120 \f20 Level 3)\tab 
+The individual library files. The contents of all files below Division_lookup are exactly as they appear on the cdrom. The Division_lookup file is rewritten so the directory structure and file names can be chosen locally. Its format is I6,1x,A. \par 
+\pard\plain \s4\qj\sb300\sa180\sl280 \f20 The files which define all the programs and standard data files used by the package\:
+ staden.login and staden.profile, define the file SEQUENCELIBRARIES which contains the list of available libraries. As should be clear from the description above the three 
+levels need to be created (actually modified from the contents of the distribution tape) and all names can be changed locally, or set to be the same as those on the cdrom.\par 
+\pard\plain \s7\qj\fi-1120\li1120\sa120\sl280\tx1120 \f20 \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Example of Division_lookup file \par 
+\pard \s4\qj\li1120\sa120\sl280 File name\: STADTABL/EMBLdiv.lkp\par 
+Contents\:\par 
+\pard\plain \li1120\sl220 \f4\fs16 1\tab /cdrom/embl/fun.dat\par 
+2\tab /cdrom/embl/inv.dat\par 
+3\tab /cdrom/embl/mam.dat\par 
+4\tab /cdrom/embl/org.dat\par 
+5\tab /cdrom/embl/phg.dat\par 
+6\tab /cdrom/embl/pln.dat\par 
+7\tab /cdrom/embl/pri.dat\par 
+8\tab /cdrom/embl/pro.dat\par 
+9\tab /cdrom/embl/rod.dat\par 
+10\tab /cdrom/embl/syn.dat\par 
+11\tab /cdrom/embl/una.dat\par 
+12\tab /cdrom/embl/vrl.dat\par 
+13\tab /cdrom/embl/vrt.dat\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 6.\tab Conventions Used In Text\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Obviously the programs can perform many more operations than there is space to describe but, in the selection of uses shown, we have tried to give some feel for the programs' sco
+pe. For this reason, and the need to conform as closely as possible to the format of the book, we have chosen specific paths through the programs, rather than attempt to describe all routes. For some sections, such as that on the facilities available for e
+diting contigs, this has not been possible and we have instead described how the major commands are used. It should also be noted that the user interactions described in the methods sections are those that would be required if the options were selected in 
+the "Execute with dialogue" mode. In practice many of the options would normally be used without any dialogue being required.\par 
+\pard \s4\qj\sa120\sl280 
+In the section on the user interface we outlined the different modes of obtaining input from users. Throughout the specific chapters we have adopted the following conventions to indicate which mode of input is being employed. When a program requests numeri
+cal or string input we have used the term "Define", as in Define "Minimum search score". When a program requests that a choice is 
+made between several options, as in the case of radio buttons or check boxes, we have used the term "Select". When a program offers a choice between two options in the form of a yes or no answer, as in "Hide translation", we use the terms "Accept" or "Reje
+ct". When the digitizer program uses the stylus for input we have used the term "Hit".\par 
+\pard \s4\qj\sa120\sl280 Because it is difficult to produce figures including pull down menus and dialogue boxes, almost all examples containing user input are taken from the xterm interface. Ho
+wever the actual wording of the prompts is the same for both interfaces.\par 
+\pard \s4\qj\sa120\sl280 
+The programs contain routines for drawing scales on plots and for simple annotation, but in general such embellishment is not done automatically by the programs. This is because the programs are designed so that many plots can be superimposed, and it is be
+tter for the user to explicitly decide to add scales and annotation. More elaborate annotation can be added by saving the graphics output to files which can be handled by, say Macinto
+sh, painting and drawing programs. None of the examples of graphical results shown in the following chapters have added scales\: all are exactly as drawn by the programs.\par 
+\pard\plain \s5\sa60\sl320\tx560 \b\f20\fs28 \par 
+\par 
+\par 
+\par 
+7.\tab NOTES\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 7.1\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Although all the programs in the Macintosh version of the package work, the conversion to this machine was never finished. The package does not provide access to the sequence libraries, handling only simple text files containing sequences, or those generat
+ed by the assembly program SAP. The user interface, although using pu
+ll down menus and dialogue boxes for all interactions, is not as "Mac like" as many would expect. However many people find this version very useful, and for others, the digitizer program alone makes the package worth having. Data input from a digitizer is 
+a task suited to a machine like the Macintosh, and the data files can be transferred to a larger machine for assembly and other analysis. With the exception of sequence library access, all the options available in the 1990 VAX version are contained in the 
+package (See Staden, 1990). We give no further details specific to the Macintosh version.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 8.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R. 1990. An improved sequence handling package that runs on the Apple Macintosh. Comput. {\i Applic. Biosc}. {\b 4}, 387-393.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Staden, R. and Dear, S. 1992. Indexing the sequence libraries\: Software providing a common indexing system for all the standard sequence libraries. {\i DNA Sequence} {\b 3}, 99-105.\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 3.  Sequence Input, Editing and Sequence Library Use\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 1.1\tab Introduction to sequence input\par 
+1.2 \tab Introduction to keyboard input\par 
+1.3\tab Introduction to input from digitizer\par 
+1.4\tab Introduction to editing single sequences\par 
+1.5\tab Introduction to using the sequence libraries\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Sequence input from keyboard\par 
+2.2\tab Sequence input from digitizer\par 
+2.3\tab Sequence input from the Pharmacia A.L.F.\par 
+2.4\tab Sequence input from the ABI 373A.\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.5\tab Editing a nucleic acid sequence using restriction sites and a translation and base numbering as landmarks.\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.6\tab Searching the freetext and author indexes of a sequence library\par 
+2.7\tab Using accession numbers to retrieve data from a sequence library\par 
+2.8\tab Displaying the annotations for an entry in a sequence library\par 
+2.9\tab Reading a sequence from sequence library\par 
+2.10\tab Worked example of sequence library access\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 In this chapter we describe sequence input and editing and the use of sequence libraries.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 1.1\tab Introduction to sequence input and editing\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The package contains facilities for input of sequence data from the keyboard, sonic digitizer
+s, and ABI 373A and Pharmacia A.L.F fluorescent sequencing machines.  Editing of single sequences can be performed using system editors such as EDT on the VAX and EMACS on the SUN. Editing of sequence alignments is discussed in the chapter on managing sequ
+encing projects.\par 
+\pard\plain \s6\sa60\sl280\pagebb\tx560\tx860 \b\f20 1.2\tab Introduction to keyboard input\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program SAP contains an option to enter sequence at the keyboard. It also creates a file of file names and will list the sequences. Users may choose any 4 keys to represent the characters A, C, G and 
+T. For example 4 adjacent keys in the same order as the lanes on a gel could be used. The program translates these symbols to A, C, G and T, and any other characters are left unchanged. No line of input should be longer than 80 characters. Terminate input 
+with the symbol @.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 1.3\tab Introduction to input from digitizer\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Digitisers provide a convenient way of entering sequences from films into a computer. The digitiser, which is connected directly to the computer, operates on a light box, and is controlled by a pr
+ogram named GIP (1). The film to be read is taped firmly to the surface of the light box, and the user defines the lane order and the centres of the four lanes to be read. These positions are defined at the point where reading will commence and the program
+ adjusts their values as the film is read. The user reads the sequence and transfers it to the computer by hitting the centres of the bands progressing up the film. Any number of sets of lanes and films can be read in a single run of the program. Each sequ
+ence is stored in a separate file and a file of file names is also written. The program also uses a menu, which is a series of reserved areas of the light box surface, for entering commands and uncertainty codes. When the pen is pressed in these areas the 
+program responds accordingly. Each time the pen tip is depressed in the digitizing area the program sounds the bell on the terminal to indicate to the user that a point has been recorded. As the sequence is read the program displays it on the screen. 
+\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 1.4\tab Introduction to editing single sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The editing method used by the programs is designed to give users access to an editor with which they are familiar - i.e. the one on their machine, say EDT on a VAX or EMACS on a UNIX system, and yet to allow them to edit a sequence which contains all the 
+landmarks they need in order to know where they are. Users can create a file containing a simple listing of the sequence (single stranded) with numbering, using "list the sequence", and then edit it with their syste
+m editor, using the numbering to know where they are within the sequence. When the edits are complete they exit from the editor and the program "analyses" the edited file to extract only the sequence characters. Similarly a file containing a three phase tr
+anlslation, or a file containing a sequence plus its three phase translation, plus its restriction sites marked above the sequence (see figure 3.1), can be edited. In order to be able to "analyse" such complicated listings and correctly extract the sequenc
+e the following simple rule is used\:
+ all lines in the file that contain a character that is not A,C,T,G or U are deleted. It is obviously important to be aware of this rule and its implications. For protein sequences only a simple listing i.e. the sequence plus numbering, can be used.\par 
+\pard\plain \s6\sb160\sa60\sl280\tx560\tx860 \b\f20 1.5\tab Introduction to using the sequence libraries\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The installation of the sequence libraries is described in the introductory chapter.  Direct access to the libraries is provided by all programs that need such a facility\: it is
+ not performed by separate programs. The facilities currently offered in NIP, PIP, SIP, NIPL, PIPL, and SIPL include the following\:\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 \tab Get a sequence by knowing its entry name\par 
+\tab Get a sequences' annotation by knowing its entry name\par 
+\tab Get an entry name by knowing its accession number\par 
+\pard\plain \li1120\ri1240\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16  HapII\par 
+\pard \li1120\ri1240\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth  HpaII\par 
+ MspI        MseI\par 
+.            .HincII\par 
+.            .HindII\par 
+.            .HpaI    DsaV\par 
+.            ..       EcoRII\par 
+.            ..       TspAI\par 
+.            ..       . ApyI\par 
+.            ..       . BstNI\par 
+.            ..       . MvaI\par 
+.            ..       . ScrFI                 MaeIII\par 
+.            ..       . .                     . BsrI  MseI\par 
+ccggttagactgttaacaacaaccaggttttctactgatataactggttacatttaacgc\par 
+        10        20        30        40        50        60\par 
+ P  V  R  L  L  T  T  T  R  F  S  T  D  I  T  G  Y  I  *  R\par 
+  R  L  D  C  *  Q  Q  P  G  F  L  L  I  *  L  V  T  F  N  A\par 
+\pard \li1120\ri1240\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth    G  *  T  V  N  N  N  Q  V  F  Y  *  Y  N  W  L  H  L  T  P\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb80\sa120\sl240\tx1140 \f21\fs20 Figure 3.1\tab The first page width of a sequence display that can be edited by the program.\par 
+\pard\plain \s7\qj\fi-560\li560\sb360\sa120\sl280\tx560 \f20 \tab Search the author index for author names\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab Search the freetext index for keywords\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The facilities currently offered in NIPL, PIPL and SIPL include\:\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 \tab Search whole library\par 
+\tab Search only a list of entry names\par 
+\tab Search all but a list of entry names\par 
+\pard\plain \s5\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Sequence input from keyboard\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Type in gel readings".\par 
+2.\tab Accept "Use special keys for A,C,T,G".\par 
+3.\tab Define the keys in turn.\par 
+4.\tab Define "File file names". A file of file names so the readings can be processed as a batch.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define in the sequence by typing it in using the selected keys. Finish by typing an @ symbol.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "File name for this gel reading". This is the name for the sequence just entered.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Accept "Type in another reading". This cycles round to step 5. If rejected the next step follows.\par 
+8.\tab Accept "List gel readings". The batch of readings entered will each be listed, one after the other, headed by their file names, on the screen.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Sequence input from digitizer\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Tape the autoradiograph down securely on the light box.\par 
+2.\tab Start the program (GIP).\par 
+3.\tab Define "File of file names".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Using the digitizer pen hit the digitizer menu ORIGIN, program menu ORIGIN, program menu START.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab After the bell has sounded the program will give the default lane order. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab If correct hit CONFIRM otherwise hit RESET. To reset the lane order hit the A,C,G,T boxes in the menu in left to right order.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Hit START, then hit in left to right order, at a height level with the first band to be read, the start positions for the next four lanes. The progr
+am will report the mean lane separations and asks for confirmation that they are correct.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Hit START\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab Hit the bands on the film in sequence order. If necessary use the uncertainty codes in the program menu. Continue until the sequence is finished.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab Hit STOP.\par 
+10.\tab Define "Name for this reading".\par 
+11.\tab Accept "Read another sequence". Otherwise the program will stop.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Sequence input from the Pharmacia A.L.F.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 After processing and base calling on the PC the data for all 10 clones is contained in a single f
+ile, and the user names each using local conventions. Then this single file is transfered to the SUN using PC-NFS. This program allows SUN directories to be mounted as if they were DOS disks and data can be transfered by use of the DOS copy command.  On th
+e SUN, to prepare for processing by program XBAP the 10 clones  are split into 10 separate files each with the names given on the PC. In addition a file of file names is written Then the reads for the individual clones need to be examined to clip off the v
+ector sequence and the poor data at the 5' end.  See note 2.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Sequence input from the ABI 373A.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 After processing and base calling on the Macintosh the data for each clone is contained in 2 files\:
+ one is simply the sequence but the main file contains the raw data, trace data and sequence. For our processing we do not use the sequence file as we can ex
+tract all we need from the main file. The user names each file using local conventions and then the folder is transfered to the SUN using TOPS. This program 
+allows SUN directories to be mounted as if they were on the Macintosh and data can be transfered by simply dragging folders on the Macintosh screen.  On the SUN, to prepare for processing by program XBAP, a file of file names is written and the reads for t
+he individual clones are examined to clip off the vector sequence and the poor data at the 5' end.  See note 2.\par 
+\pard\plain \s6\fi-560\li560\sb240\sa120\sl280\tx560\tx980 \b\f20 2.5\tab Editing a nucleic acid sequence using restriction sites and a translation and base numbering as landmarks.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select NIP.\par 
+2.\tab Read in the sequence to be edited.\par 
+3.\tab Direct output to disk, say creating file edit.seq.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Use the restriction enzyme site search routine (See the relevant chapter) to create a file showing "Names above the sequence", as in figure 3.1.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Close the redirection file.\par 
+6.\tab Select "Edit the sequence".  \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Define "Name of file to edit". This is the file containing the sequence listing, say edit.seq.The sytem editor will start up.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab Edit the sequence.\par 
+9.\tab Exit from the editor.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Accept "Make edited sequence active". The edited sequence will replace the original sequence. \par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.6\tab Searching the freetext (or author) index of a sequence library\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Read new sequence".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Select "Sequence library". The alternative is "Personal file", and if taken would be followed by questions about which of the formats "Staden, EMBL, GenBank, PIR, GCG or FASTA" it was stored in.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Select, say, "EMBL nucleotide library".\par 
+4.\tab Select "Search text index for keywords".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Keywords". Type up to 5 keywords separated by spaces - i.e.space is the delimiting character (see note below about author searches).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab 
+The search will start and for each match the program will display the contents of the matching line which includes the entry name, primary accession number,  its length and a 80 character description. After every 20 matches the program will ring the bel
+l and the user can escape by typing "!".\par 
+\tab The commands for searching the author index are effectively the same. Note that for authors it is useful to be able to link words together for names s
+uch as De Gaule or von Meyenberg. The symbol underscore (_) can be used for this purpose - e.g. De_Gaule or von_meyenberg. The same facility is available for the keyword searches.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.7\tab Using accession numbers to retrieve data from a sequence library\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Read new sequence".\par 
+2.\tab Select "Sequence library".\par 
+3.\tab Select, say, "EMBL nucleotide library".\par 
+4.\tab Select "Get entry names from accession numbers".\par 
+5.\tab Define "Accession number". \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab The program will display the entry names corresponding to the accession number. The last entry name found will become the default entry name.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.8\tab Displaying the annotations for an entry in a sequence library\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Read new sequence".\par 
+2.\tab Select "Sequence library".\par 
+3.\tab select, say, "EMBL nucleotide library".\par 
+4.\tab Select "Get annotations".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Entry name". The program will display the annotation for the entry. After every 20 lines the program will ring the bell and the user can escape by typing "!".\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.9\tab Reading a sequence from a sequence library\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Read new sequence".\par 
+2.\tab Select "Sequence library".\par 
+3.\tab Select, say, "EMBL nucleotide library".\par 
+4.\tab Select "Get a sequence".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Entry name".  The program will make the sequence the active sequence and display its base composition.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.10\tab Worked example of sequence library access\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The worked example in figure 3.2 shows a search of the text index for the keywords p53 and mouse, followed by a search of the author index for the names sanger and coulson, followed by search on accession number v00636, followed by "Get annotatio
+ns" for entry lambda, and finally "Get a sequence" for entry lambda. \par 
+\pard\plain \sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 {\f22\fs18  Select sequence source\par 
+}\pard \sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth {\f22\fs18  X  1 Personal file\par 
+    2 Sequence library\par 
+ ? Selection  (1-2) (1) =2\par 
+ Select a library\par 
+ X  1 EMBL 29 nucleotide library Dec 91\par 
+    2 SWISSPROT 20 protein library Nov 91\par 
+    3 PIR 31 protein library Dec 91\par 
+    4 NRL3D 58 From Brookhaven protein library Dec 91\par 
+    5 GenBank example\par 
+ ? Selection  (1-5) (1) =\par 
+Library is in EMBL format with indexes\par 
+ Select a task\par 
+ X  1 Get a sequence\par 
+    2 Get annotations\par 
+    3 Get entry names from accession numbers\par 
+    4 Search author index\par 
+    5 Search text index for keywords\par 
+ ? Selection  (1-5) (1) =5\par 
+ Search for keywords\par 
+ ? Keywords=p53 mouse\par 
+P53 hits  73\par 
+MOUSE hits  10140\par 
+\'00\par 
+ MMANT01    X00875         536 Murine gene fragment for cellular tumour antigen\par 
+ MMANT02    X00876          83 Murine gene fragment for cellular tumour antigen\par 
+ MMANT03    X00877          21 Murine gene fragment for cellular tumour antigen\par 
+ MMANT04    X00878         261 Murine gene fragment for cellular tumour antigen\par 
+ MMANT05    X00879         184 Murine gene fragment for cellular tumour antigen\par 
+ MMANT06    X00880         113 Murine gene fragment for cellular tumour antigen\par 
+ MMANT07    X00881         110 Murine gene fragment for cellular tumour antigen\par 
+ MMANT08    X00882         137 Murine gene fragment for cellular tumour antigen\par 
+}\pard \sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth {\f22\fs18  MMANT09    X00883          74 Murine gene fragment for cellular tumour antigen\par 
+ MMANT10    X00884         107 Murine gene for cellular tumour antigen p53 (exon\par 
+ MMANT11    X00885         562 Murine p53 gene 3' region with exon 11\par 
+ MMANTP53   M26862         536 Mouse tumor antigen p53 gene, 5' end.\par 
+ MMLYN      M64608        2044 Mouse lyn protein mRNA, complete cds.\par 
+ MMP53      X00741        1377 Mouse mRNA for transformation associated protein\par 
+ MMP53A     M13872        1285 Mouse p53 mRNA, complete cds, clone pcD53.\par 
+ MMP53B     M13873        1241 Mouse p53 mRNA, complete cds, clone p53-m11.\par 
+ MMP53C     M13874        1322 Mouse p53 mRNA, complete cds, clone p53-m8.\par 
+ MMP53G1    X01235         554 Mouse genomic DNA for 5' region of cellular tumou\par 
+ MMP53IN4   X60470         729 M.musculus p53 gene for p53 protein, intron 4\par 
+\'00\par 
+ MMP53P     X01236        2132 Mouse pseudogene for cellular tumour antigen p53\par 
+ MMP53R     X01237        1773 Mouse mRNA for cellular tumour antigen p53\par 
+ MMRSB2P5   M64597         196 Mouse B2 repeat in the 3' flank of protein 53 (p5\par 
+ MMSFFV1    X64656         165 M.musculus Friend spleen focus forming virus (SFF\par 
+ MMSFFV2    X64657         142 M.musculus Friend spleen focus forming virus (SFF\par 
+      24 different entries found\par 
+\'00\par 
+ Select a task\par 
+ X  1 Get a sequence\par 
+    2 Get annotations\par 
+    3 Get entry names from accession numbers\par 
+    4 Search author index\par 
+    5 Search text index for keywords\par 
+ ? Selection  (1-5) (1) =4\par 
+ Search for keywords\par 
+ ? Keywords=coulson sanger\par 
+COULSON hits  935\par 
+SANGER hits  15\par 
+\'00\par 
+ LAMBDA     V00636       48502 Genome of the bacteriophage lambda (Styloviridae)\par 
+ MIBTXX     V00654       16338 Complete bovine mitochondrial genome.\par 
+ MIHSCG     J01415       16569 Human mitochondrion, complete genome.\par 
+ MIHSM1     M10546        2771 Human mitochondrial DNA, fragment M1, encoding tr\par 
+ MIHSXX     V00662       16569 H.sapiens mitochondrial genome\par 
+ MIPX1C01   M10860         130 Bacteriophage phi-X174, nucleotides 3920-4049.\par 
+ MIPX1C02   M10861         115 Bacteriophage phi-X174, nucleotides 3480-3595.\par 
+ MIPX1C03   M10862         121 Bacteriophage phi-X174, nucleotides 4260-4380.\par 
+ MIPX1CTI   M10849         130 Bacteriophage phi-X174, nucleotides 3389-3520.\par 
+ PHIX174    V01128        5386 Bacteriophage phi-X174 (cs70 mutation) complete g\par 
+ R17CPRAA   M24826          61 Bacteriophage R17 coat protein RNA fragment.\par 
+      11 different entries found\par 
+\'00\par 
+ Select a task\par 
+ X  1 Get a sequence\par 
+    2 Get annotations\par 
+    3 Get entry names from accession numbers\par 
+    4 Search author index\par 
+    5 Search text index for keywords\par 
+ ? Selection  (1-5) (1) =3\par 
+ ? Accession number=v00636\par 
+Entry name LAMBDA\par 
+ Select a task\par 
+ X  1 Get a sequence\par 
+    2 Get annotations\par 
+    3 Get entry names from accession numbers\par 
+    4 Search author index\par 
+    5 Search text index for keywords\par 
+ ? Selection  (1-5) (1) =2\par 
+ Default Entry name=LAMBDA\par 
+ ? Entry name=\par 
+ID   LAMBDA     standard; DNA; PHG; 48502 BP.\par 
+}\pard \sl220\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth {\f22\fs18 XX\par 
+AC   V00636; J02459; M17233; X00906;\par 
+XX\par 
+DT   09-JUN-1982 (Rel. 01, Created)\par 
+DT   03-JUL-1991 (Rel. 28, Last updated, Version 3)\par 
+XX\par 
+DE   Genome of the bacteriophage lambda (Styloviridae).\par 
+XX\par 
+KW   circular; coat protein; DNA binding protein; genome;\par 
+KW   origin of replication.\par 
+XX\par 
+OS   Bacteriophage lambda\par 
+OC   Viridae; ds-DNA nonenveloped viruses; Siphoviridae.\par 
+XX\par 
+RN   [1]\par 
+RP   1-48502\par 
+RA   Sanger F., Coulson A.R., Hong G.F., Hill D.F., Petersen G.B.;\par 
+RT   "Nucleotide sequence of bacteriophage lambda DNA";\par 
+RL   J. Mol. Biol. 162\:729-773(1982).\par 
+XX\par 
+\'00\par 
+ Select a task\par 
+ X  1 Get a sequence\par 
+    2 Get annotations\par 
+    3 Get entry names from accession numbers\par 
+    4 Search author index\par 
+    5 Search text index for keywords\par 
+ ? Selection  (1-5) (1) =\par 
+ Default Entry name=LAMBDA\par 
+ ? Entry name=\par 
+DE   Genome of the bacteriophage lambda (Styloviridae).\par 
+ Sequence length  48502\par 
+ Sequence composition\par 
+           T          C          A          G          -\par 
+      11988.     11360.     12336.     12818.         0.\par 
+}\pard \sl220\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth {\f22\fs18          24.7%      23.4%      25.4%      26.4%       0.0%\par 
+}\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 3.2\tab A worked example of sequence library use.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab NOTES\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+The program menu for GIP is simply a set of boxes drawn on the digitizing surface that each contain a command or uncertainty code.  Right handed users will find it is best to position the menu to the right of the digitizing 
+area, but in practice as long as its top edge is parallel to the digitizer box, it can be put anywhere in the active region. As well as the codes a,c,g,t,1,2,3,4,b,d,h,v,r,y,x,-,5,6,7,8 the following commands are included in the menu\:
+ DELETE removes the la
+st character from the sequence; RESET allows the lane centres to be redefined; START means begin the next stage of the procedure; STOP means stop the current stage in the procedure; CONFIRM means confirm that the last command or set of coordinates are corr
+ect. \par 
+\tab 
+The digitizing device also has a menu of its own. This lies in a two inch wide strip immediately in front of the digitizing box. Pen positions within this two inch strip are interpretted as commands to the digitizer and are not sent to the GIP program. In
+ general the only time users will need to use the device menu is when they tell GIP where the program menu lies in the digitizing area. This is done by first hitting ORIGIN in the device menu and then hitting the bottom left hand corner of the progra
+m menu. If the bell does not sound after hitting START  try hitting METRIC in the device menu (the program uses metric units, and some digitizers are set to default to use inches; hitting metric switches between the two).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab 
+The user should try to hit the bands as near as possible to the centre of the lanes because the program tracks the lanes up the film using the pen positions. If the lane centres get too close the program stops responding to the pen positions of bands and 
+hence does not ring the bell. If t
+his occurs users must hit the reset box in the menu and the program will request them to redefine the lane centres at the current reading position. Then they can continue reading. As a further safeguard the program will only respond to pen positions either
+ in the menu or very close to the current reading position.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Details about preparing the data from fluorescent sequencing machines for processing by XBAP are contained in the notes for the chapter on managing sequencing projects. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab All of the operation
+s described for the EMBL nucleotide library can be performed in exactly the same way for GenBank and the SWISSPROT and PIR protein libraries.  For keyword searching the freetext index is most useful because it contains all words in feature tables, definiti
+on lines, title lines, keywords and comment lines. The searches are very fast. The search will find all words that start with the given keywords\:
+ e.g. keyword sugar will match with sugar, sugaractivating, sugars, etc. When several keywords are used together, only entries indexed on all the words will be reported. On the VAX, EMBL, GenBank, SWISSPROT and PIR can all be processed. \par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R. 1984. A computer program to enter DNA gel reading data into a computer. {\i Nucl. Acids Res}. {\b 12}, 499-503.\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 4.  Managing Sequencing Projects\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Starting a project database\par 
+2.2\tab Screening against restriction enzyme recognition sequences\par 
+2.3 \tab Screening against vector sequences\par 
+2.4 \tab Entering readings in to the project database (assembly)\par 
+2.5\tab Searching for internal joins\par 
+2.6\tab Editing in XBAP\par 
+2.7\tab Joining contigs interactively in XBAP\par 
+2.8\tab Selecting primers and templates\par 
+2.9\tab Examining the quality of a consensus\par 
+2.10\tab Using graphical displays to examine contigs\par 
+2.11\tab Disassembling contigs\par 
+2.12\tab Shuffling pads\par 
+2.13\tab Displaying a contig\par 
+2.14\tab Highlighting differences between readings and the consensus\par 
+2.15\tab Screen editing contigs in SAP\par 
+2.16\tab Automatic editing in SAP\par 
+2.17\tab Using the original editor in SAP\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Data input, assembly, checking and editing are the major tasks of sequence project management. Data input is described in a previous chapter and here we cover everything else. The programs can deal with data derived from autoradiographs and from automated 
+gel reading machines such as the Applied Biosystems 373A and the Pharmacia A.L.F. and film readers such as the Amersham scanner \par 
+\pard \s4\qj\sa120\sl280 We describe two alternative programs for managing sequencing projects. They contain the same assembly and vector screen
+ing routines but they differ in their editing methods. One program SAP (see references 1 and 2) can be operated from simple terminals and emulators but the other XBAP (3) requires an X terminal or emulator. XBAP contains a superior editor plus the facility
+ to annotate sequences and display the coloured traces for data derived from fluorescent sequencing machines. Those using autoradiographs will find that SAP is adequate but XBAP is essential for users of fluorescent sequencing machines. Readers should note
+ that several of the methods for displaying contigs described below are probably of value only to those unable to use the screen based contig editor in XBAP.\par 
+\pard \s4\qj\sa120\sl280 
+Fluorescent sequencing machines provide machine readable data. This means, given appropriate software,  that while making editing decisions the user can see, displayed on the screen, the coloured traces used to derive the sequence. However data from these 
+machines requires some extra processing. First the machines tend to produce long sequences with po
+or quality at their 3' ends and so we have to decide how much of the data to use. Secondly the sequencing machine does not recognise the primer region (as the user would) so we need to have some way of removing it from the data. The poor quality data from 
+both ends of the sequence and the vector sequences are identified non-interactively by programs clip-seqs and vep. Alternatively these tasks can be performed interactively using program TED (4). We term the data from the 3' end of a reading that is not emp
+loyed in the assembly process "unused" sequence. Note that we do not lose this data but simply ignore it until such time as it can be useful for locating joins between contigs, or for double stranding regions of the sequence.\par 
+\pard \s4\qj\sa120\sl280 
+The method described here uses a database to store all the data for each sequencing project. The individual sequence readings derived from autoradiographs or from sequencing machines are initially stored in separate files but the program copies them into t
+he database during the assembly process. For normal operation the program handles batches of readings - say 24 from a film or machine run. Batch processing is achieved by use of files of file names. \par 
+\pard \s4\qj\sa120\sl280 Depending on the strategy employed and the stage of the project the following operations may be performed.\par 
+\pard\plain \s7\qj\fi-560\li560\sb100\sa120\sl280\tx560 \f20 1)\tab Start a project database.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2)\tab Select primers and templates.\par 
+3)\tab Obtain readings.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4)\tab Put individual readings into the computer and write a file of file names. For data derived from fluorescent sequencing machines choose which data from
+ the 3' end of the reading should not be used for the assembly process.\par 
+5)\tab Screen the batch against any vectors that may be present, excising any vector sequence found and passing to the next step, the names of those readings that contain some non-vector sequence.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6)\tab Screen the batch against any restriction sites whose presence would indicate a problem, passing those that do not match on to the next step.\par 
+7)\tab Compare each reading in the batch with the current contents of the project database adding them to the contigs they overlap, joining contigs or starting new contigs.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8)\tab 
+Check the number of contigs and the quality of the consensus sequence and plan further experiments. Try to join contigs by searching for overlaps between their ends. (This is particularly useful for those using data from fluorescent sequencing machines,
+ where although the 3' end of the sequence is not good enough for automatic assembly, it can be valuable for finding overlaps between contigs).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9)\tab Edit the contigs to resolve dissagreements.\par 
+10)\tab Produce a consensus sequence.\par 
+11)\tab Analyse the consensus sequence, possibly discovering further errors.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Subsets of these operations will be cycled through repeatedly. A pure shotgun strategy would continue using steps 3-7, a pure primer walking strategy would also include step 2. A number of the steps require almost no user intervention, however checking qua
+lity and final editing decisions are still interactive procedures. The program contains several options, such as displays of the overlapping reading
+s in a contig, to help indicate, not only the poorly determined regions, but also which clones could be resequenced to resolve ambiguities, or those which can usefully be extended or sequenced in the reverse direction, to cover difficult regions. It is bes
+t to use a command procedure or script for handling steps 5-7.\par 
+\pard \s4\qj\sa120\sl280 For our projects we have a script which users employ by typing "assemble filename", where filename is the file of file names for the current batch of readings. This script calls all the necessa
+ry options in SAP or BAP (see notes) in order to make a backup of the database, screen against any vectors, assemble readings and print a report. In the text below we describe how these operations are performed interactively. \par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Starting a project database\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The assembled data for each project is stored in a database. At the beginning of a project it is necessary to create an empty database using program SAP or XBAP.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Open database"\par 
+2.\tab Select "Start new database"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define the database name. Database names can have from one to 12 letters and must not include full stop (.). \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Accept "Database is for DNA"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab 
+Define "Database size". This is an initial size and if necessary can be increased later using "Copy database".  Roughly speaking it is the number of readings expected to be needed to complete the project. Currently BAP limits the maximum to 8000 and SAP
+ has a limit of 1000.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Maximum reading length". This is the length of the longest reading that will be added to the database. The minimum is 512 bases, and the maximum 4096.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program should confirm that "copy 0" of the database has been started. See Note 14 for important information.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Screening against restriction enzyme recognition sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+For some strategies it is necessary to compare readings against any restriction enzyme recognition sequences that may have been used during cloning and which should not be present in the data. The function operates on single readings or processes batches a
+ccessed through files of file names. The algorithm looks for exact matches to recognition sequences. The recognition sequences should be stored in a simple text file with one recognition sequence per record.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Accept "Use file of filenames".\par 
+2.\tab Define "File of gel reading names". The input file of file names.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+Define "File for names of sequences that pass". A file of file names for those readings that do not contain the recognition sequences. After the run it will contain the names of all the files in the batch that do not match any
+ of the restriction enzyme recognition sequences. Hence it can be used for further processing of the batch.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "File name of recognition sequences". The name of the file of recognition sequences.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Screening against vector sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+For most strategies it is necessary to compare readings against any vector sequences that may have been picked up during cloning. The package contains two routines for screening against vectors. The original function simply reports any matches between the 
+readings and t
+he vector sequences and only passes on those that do not match. This function should now only be used to screen for any other sequences that should be excluded from the database, because the newer one (program name VEP for vector excising program) is capab
+le of both finding the vector sequences and editing them out automatically. \par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.3.1\tab Clipping off vector sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 There are two types of vector that may need to be screened out of gel readings\: the sequencing vector and, for cases where, say, whole cosmids 
+have been shotgunned, the cloning vector. The two tasks are different.  When screening out the sequencing vector we may expect to find data to exclude, both from the primer region and from the other side of the cloning site (when, for example, the insert i
+s short). When screening out cosmid vector we may find that either the 5' end, or the 3' end, or the whole of the sequence is vector. Also for the cosmid search we need to compare both strands of the sequence. The program (VEP) works slightly differently f
+or each of the two cases. Having read the vector sequence from a file the program asks for the "Position of the cloning site". A value of zero signifies that the search will be for the cosmid vector. A nonzero value signifies that the search is for the seq
+uencing vector, and so in this case the program then asks for the "Relative position of the primer site". A negative relative position signifies that a reverse primer is being used, otherwise a forward primer is assumed.\par 
+\pard \s4\qj\sa120\sl280 The program screens a batch of read
+ings using a file of file names and creates a new file of file names which contains the names of all those sequences that include some nonvector sequence. For each sequence that contains some vector it writes out a new copy of the file in which the vector 
+portion is identified.\par 
+\pard \s4\qj\sa120\sl280 
+The search, which uses a hashing algorithm, is very rapid. Users specify a "Word length", the "Number of diagonals to combine" and a "Minimum score". The word length is the minimum number of consecutive bases that will count as a mat
+ch. The algorithm treats the problem like a dot matrix comparison and finds the diagonal with the highest score. Then it adds the scores for the adjacent "Minimum number of diagonals to combine". If the combined score is at least "Minimim score" the sequen
+ce is marked to indicate that it contains vector. The score represents the proportion of a diagonal that contains matching words, so the maximum score for any diagonal is 1.0.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Define "Input file of file names". This is the file containing the names of all the readings to be screened.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "File name of vector sequence". \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+Define "Position of cloning site". This is the base number, relative to the beginning of the vector sequence, that is on the 3' side of the insert site. For example for m13mp18 the SmaI site is at 6249. A zero value signifies that the search is for cosm
+id vector.\par 
+4.\tab Define "Relative position of 3' end of primer site". This is the position, relative to the cloning site, of the first base that could be included in the sequence. For m13mp18, the 17mer Sequencing Primer  and the SmaI site, the position is 41.
+\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Word length". Only words of this length will be counted as matches.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Number of diagonals to combine". The scores for this number of diagonals around the highest scoring diagonal will be combined to give the total score.\par 
+7. \tab Define "Cutoff score". For a match, at least this proportion of the total length of the summed diagonals must contain identical words. \par 
+8.\tab Define "Output file of passed file names". The name of the file to contain the names of the readings to pass on to the assembly program.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Processing will commence and finishes with a summary stating the number of files processed, the number completely vector, the number partly vector and the number free of vector.\par 
+\pard\plain \s9\fi-560\li860\sb160\sa60\sl280\tx1140 \b\f20 2.3.2\tab Screening for "vectors"\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function is contained in both SAP and XBAP and operates on single readings or processes batches accessed through files of file names. The algorithm looks for exact matches of length "minimum match length" and disp
+lays the overlapping sequences.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Accept "Use file of filenames".\par 
+2.\tab Define "File of gel reading names". The input file of file names.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+Define "File for names of sequences that pass". A file of file names for those readings that do not contain the vector sequence. After the run it will contain the names of all the files in the batch that do not match the vector sequence. Hence it can be
+ used for further processing of the batch.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "File name of vector sequence". The name of the file containing the vector sequence.\par 
+\pard\plain \s6\sb160\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Entering readings into the project database (Assembly)\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Readings are entered into the database using the auto assemble function. This function compares each reading and its complement with a consensus of all the readings already stored in the database. If it finds any overlaps it aligns the overlapping sequence
+s by inserting padding characters, and then adds the new reading to the database. Readings that overlap are added to existing contigs and readings that do not overlap any data in 
+the database start new contigs. If a new reading overlaps two contigs they are joined. Any readings that appear to overlap but which cannot be aligned sufficiently well are not entered and have their names written to a file of failed gel reading names. Not
+e that it is possible that a reading may align well with two contigs (indicating a possible join) but that after it has been added to one of the contigs, the two contigs do not align sufficiently well. In this case, although the reading has been entered in
+to the database its name will also be added to the file of failed readings. Alignments using more than the maximum number of paddings characters, or exceeding the maximum mismatch may be displayed, but the readings will not be entered into the database. It
+ is advisable to set the consensus cutoff to 51% before running the assembly routine as this will improve the alignments. A typical run of the assembly routine is shown in figure 4.1.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Accept "Permit entry"\par 
+2.\tab Accept "Use file of file names"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "File of gel reading names". The name of the input file of file names, probably passed on from "Screen against vector".\par 
+4.\tab Define "File for names of failures". A file to contain the names of the readings that the program fails to enter, or for which joins are not made.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Select "Perform normal shotgun assembly"\par 
+6.\tab Accept "Permit joins"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Define "Minimum initial match". Only possible overlaps containing exact matches of at least this number of consecutive identical characters will be considered for alignment.\par 
+8.\tab Define "Maximum number of pads per reading" This is the maximum number of padding characters permitted in any new reading during the alignment procedure\par 
+9.\tab Define "Maximum number of pads per reading in contig"  This is the maximum number of padding characters permitted in the contig in order to align any new reading.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Define "Maximum percent mismatch after alignment"\par 
+\pard\plain \li560\ri500\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 Automatic sequence assembler\par 
+\pard \li560\ri500\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Database is logically consistent\par 
+? (y/n) (y) Permit entry\par 
+? (y/n) (y) Use file of file names\par 
+? File of gel reading names=demo.nam\par 
+? File for names of failures=demo.fail\par 
+Select entry mode\par 
+X  1 Perform normal shotgun assembly\par 
+   2 Put all sequences in one contig\par 
+   3 Put all sequences in new contigs\par 
+? Selection  (1-3) (1) =\par 
+? (y/n) (y) Permit joins\par 
+? Minimum initial match (12-4097) (15) =\par 
+? Maximum pads per gel (0-25) (8) =\par 
+? Maximum pads per gel in contig (0-25) (8) =\par 
+? Maximum percent mismatch after alignment (0.00-15.00) (8.00) =\par 
+\par 
+Results skipped to save space\par 
+\par 
+>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>\par 
+Processing  4 in batch\par 
+Gel reading name=hinw.009                                \par 
+Gel reading length=   292\par 
+Working\par 
+Contig    1 position   263 matches strand 1 at position   14\par 
+Contig    2 position     1 matches strand 1 at position  156\par 
+\pard \li560\ri500\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Total matches found  2\par 
+Trying to align with contig  1\par 
+Padding in contig=    1 and in gel=    0\par 
+Percentage mismatch after alignment =  2.9\par 
+Best alignment found\par 
+       251        261        271        281\par 
+         aattacagcg tt,cctattg acgggcgcat ccac\par 
+         ********** ** ** **** ********** ****\par 
+         aattacagcg ttcccvattg acgggcgcat ccac\par 
+         1         11         21         31\par 
+Trying to align with contig  2\par 
+Padding in contig=    0 and in gel=    2\par 
+Percentage mismatch after alignment =  1.4\par 
+Best alignment found\par 
+         1         11         21         31         41         51\par 
+         tgcacgacat cgagtatgag agttatatcc cgggcgcgct ctgcttgtac atggacctca\par 
+         ********** ********** ********** ********** ********** **********\par 
+         tgcacgacat cgagtatgag agttatatcc cgggcgcgct ctgcttgtac atggacctca\par 
+       156        166        176        186        196        206\par 
+        61         71         81         91        101        111\par 
+         tgtacctctt tgtctccgtg ctctacttca tgccctccga gcccggcagc gcccacactg\par 
+         ********** ********** ********** ********** ***** ** * **********\par 
+         tgtacctctt tgtctccgtg ctctacttca tgccctccga gcccg,ca,c gcccacactg\par 
+       216        226        236        246        256        266\par 
+       121        131\par 
+         ctcagacgac ggtcgctgc\par 
+         ********** *********\par 
+         ctcagacgac ggtcgctgc\par 
+       276        286\par 
+Overlap between contigs     2 and     1\par 
+Length of overlap between the contigs=  -122\par 
+Entering the new gel reading into contig     1\par 
+This gel reading has been given the number      4\par 
+Working\par 
+Trying to align the two contigs\par 
+Padding in contig=    2 and in gel=    0\par 
+Percentage mismatch after alignment =  1.5\par 
+Best alignment found\par 
+       406        416        426        436        446        456\par 
+         tgcacgacat cgagtatgag agttatatcc cgggcgcgct ctgcttgtac atggacctca\par 
+         ********** ********** ********** ********** ********** **********\par 
+         tgcacgacat cgagtatgag agttatatcc cgggcgcgct ctgcttgtac atggacctca\par 
+         1         11         21         31         41         51\par 
+       466        476        486        496        506        516\par 
+         tgtacctctt tgtctccgtg ctctacttca tgccctccga gcccg,ca,c gcccacactg\par 
+         ********** ********** ********** ********** ***** ** * **********\par 
+         tgtacctctt tgtctccgtg ctctacttca tgccctccga gcccggcagc gcccacactg\par 
+        61         71         81         91        101        111\par 
+       526        536\par 
+         ctcagacgac ggtcgct\par 
+         ********** *******\par 
+         ctcagacgac ggtcgct\par 
+       121        131\par 
+Editing contig     1\par 
+\pard \li560\ri500\sa100\sl220\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth Completing the join between contigs     1 and     2\par 
+     (Results for other readings skipped to save space)\par 
+\pard \li560\ri500\sl220\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth            Batch finished\par 
+           9 sequences processed\par 
+           9 sequences entered into database\par 
+\pard \li560\ri500\sa100\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth            2 joins made\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb60\sa120\sl240\tx1140 \f21\fs20 Figure 4.1\tab Part of a typical run of "Auto assemble".\par 
+\pard\plain \s6\sb160\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Searching for internal joins \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The purpose of this function is to use data already in the database to find possible joins between contigs. Although most joins will be made automatically during assembly, due to poor alignments, some may not have been done. The function is particularly us
+eful for sequences from fluorescent sequencing machines because it may be possible to find potential joins within the unused data from the 3' ends of  readings. For each potential
+ join found, when the X version is used, the contig joining editor is automatically called up with the two contigs aligned in the edit windows.\par 
+\pard \s4\qj\sa120\sl280 
+The program strategy is as follows. Take the first contig and calculate its consensus. If unused data is being employed, examine all readings that are in the complementary orientation, and sufficiently near to the contigs left end, to see if they have suff
+iciently good unused sequence which, if present, would protrude from the left end of the contig. If found add th
+e longest such sequence to the left end of the consensus. Do the same for the right end by examining readings that are in their original orientation. Repeat the consensus calculations and extensions for all contigs hence producing an extended consensus for
+ the whole database. If unused data is not being employed simply calculate the consensus for the whole database. Now look for possible joins by processing the extended consensus in the following way. Take the last, say 500, bases (termed the "probe length"
+ by the program) of the rightmost consensus, compare it in both orientations with the extended consensus of all the other contigs. Display any sufficiently good alignments. Repeat with the left end of the rightmost contig. Do the same for the ends of all t
+he contigs, always comparing only with the contigs to their left, so that the same matches do not appear twice. \par 
+\pard \s4\qj\sa120\sl280 Good unused data is defined by sliding a window of "Window size for good data scan" bases outwards along the sequence and stopping when greater
+ than "Maximum number of dashes in scan window" appear in the window. Note that it is advisable to have some sort of cutoff because if we simply take all the data it might be of such poor quality that we wont find any good matches. An initial run employing
+ no unused data is also recommended. Sufficiently good alignments are defined by criteria equivalent to those used in auto assemble, however here we only display alignments that pass all tests.\par 
+\pard \s4\qj\sa120\sl280 All numbering is relative to base number one in the contig\: ma
+tches to the left (i.e. in the unused data) have negative positions, matches off the right end of the contig (i.e. in the unused data) have positions greater than the contig length. The convention for reporting the orientations of overlaps is as follows\:
+ i
+f neither contig needs to be complemented the positions are as shown. If the program says "contig x in the - sense" then the positions shown assume contig x has been complemented. For example in the results given in figure 4.2 the positions for the first o
+verlap are as reported, but those for the second assume that the contig in the minus sense (i.e. 443) has been complemented.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Find internal joins".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Minimum initial match". Only matches containing this number of consecutive identical characters will be found.\par 
+3.\tab Define "Maximum pads per sequence". Only alignments containing less than or equal this number of padding characters in each sequence will be found.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Maximum percent mismatch after alignment". Only alignments with at lea
+st this level is similarity will be found. Particularly when poor data from the 3' ends of sequences derived from fluorescent sequencing machines is used, it is important to allow for a high degree of mismatch - say around 75%.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Probe length". This is the size of sequence from each end of each contig, that is compared with the total length of all other contigs.\par 
+6.\tab Accept "Employ unused data". This means, where available, add the unused data from the 3' ends of sequences, to the ends of the contigs.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab 
+Define "Window size for good data scan". To decide how much of the unused data should be added to the end of a contig the program scans outwards, counting the numbers of dashes (-) over a window of the size defined here.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab Define "Number of dashes in scan window". If the program finds this many dashes in the scan window it will add no more of the unused data to the end of the contig.\par 
+\pard\plain \qj\li680\ri780\sl220\pagebb\box\brsp100\brdrth \f4\fs16 Possible join between contig   445 in the + sense and contig   405\par 
+\pard \li680\ri780\sl220\box\brsp100\brdrth Percentage mismatch after alignment =  4.9\par 
+   412        422        432        442        452        462\par 
+405  TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAG,TT AGCTCACTCA\par 
+      ********* * ********  ***** *** ********** ********** **********\par 
+445  -TTCCCGACT G,AAAGCGGG TAGTGA,CGC AACGCAATTA ATGTGAG-TT AGCTCACTCA\par 
+  -127       -117       -107        -97        -87        -77\par 
+   472        482        492        502        512\par 
+405  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT\par 
+     ********** ********** ********** ********** **\par 
+445  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT\par 
+   -67        -57        -47        -37        -27\par 
+Possible join between contig   443 in the - sense and contig   423\par 
+Percentage mismatch after alignment = 10.4\par 
+    64         74         84         94        104        114\par 
+423  ATCGAAGAAA GAAAAGGAGG AGAAGATGAT TTTAAAAATG AAACG-CGAT GTCAGATGGG\par 
+     **** ***** ********** ********** ******  ** ***** **** *********\par 
+443  ATCG,AGAAA GAAAAGGAGG AGAAGATGAT TTTAAA,,TG AAACGACGAT GTCAGATGG,\par 
+  3610       3620       3630       3640       3650       3660\par 
+   124        134        144        154        164\par 
+423  TTG-ATGAAG TAGAAGTAGG AG-AGGTGGA AGAGAAGAGA GTGGGA\par 
+     *** ****** ********** ** *******  *** ***** ** **\par 
+443  TTGGATGAAG TAGAAGTAGG AGGAGGTGGA ,GAG,AGAGA GTTGG-\par 
+\pard \li680\ri780\sl220\keepn\box\brsp100\brdrth   3670       3680       3690       3700       3710\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4.2\tab Typical output from "Find internal joins".\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.6\tab Editing in XBAP\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The XBAP editor is mouse-driven and can insert, delete and change readings in contigs. It has facilities to display the traces for data from fluorescent sequenci
+ng machines and for annotation of readings. In addition it allows the poor quality data from the ends of readings to be viewed and, if required, added to the sequences. \par 
+\pard \s4\qj\sa120\sl280 
+A typical view of the editor is shown in figure 4.3. This includes the edit window showing an 80 character section of a contig, (position 3899 to 3978). Each reading is numbered and named in the left hand panel, minus signs indicating those in their revers
+e orientation. Underneath is their consensus. Some of the sequence letters are lighter
+ than the majority showing that they are "unused". One segment (3933 to 3949) is shaded which signifies that it has been annotated. The editing cursor is at position 3921. Above this window are the main buttons the user employs to direct the editing proces
+s. Below the edit window is a panel showing the traces for readings 37 and 123. Notice they are centred on the cursor position. Here the traces are shown in four different line styles, but on a colour screen they each have different colours. In the bottom 
+of the figure is the search window. These features are described in the relevant sections below.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.1\tab Scrolling through the contig\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The editor allows scrolling from one end of a contig to the other using the scroll bar and scroll buttons and also the arrow keys.\par 
+\pard \s4\qj\sa120\sl280 Action of mouse button presses when the mouse pointer is in the scroll bar\:\par 
+\pard \s4\qj\li1720\sa120\sl280\tx4520 Middle Mouse Button\tab Set editor position\par 
+Left  Mouse Button\tab Scroll forward one screenful\par 
+Right  Mouse Button\tab Scroll backwards one screenful\par 
+\pard\plain \li80\ri20\sl220\keepn\box\brsp20\brdrth \f4\fs16 {{\pict\macpict\picw444\pich344 
+82daffffffff015701bb1101a0008201000affffffff015701bb0900000000000000003100000000015601ba98007e00000000030703e900000000030703e900000000015601ba000102830002830002830007000286aa01a00007000186550140000700028600012000070001860001400007000286000120000b02013ff8
+8a00030ffe40000d0402200807c18c0003089220000f06012c28040110808e0003089240000f06022648040100808e0003089220001007012348040f31e3968f00030f924000100702220807911084598f000308122000100701258804111084508f00030812400010070224c804111084508f00030ff22000100701286804
+111094508f000308024000100702200807cf3863908f0003080220000b02013ff88a00030ffe4000070002860001200007000186000140000700028600012000070001865501400007000286aa01a00002830002830002830002830026e500001ff9ff0087f8ff01f87ff5ff01fe1fefff0087fcff01fe1ff2ff01f87ff2ff
+00e0fd0026e500001ff9ff0084f80001087ff5ff01fe1fefff0084fc00010210f200010840f2000020fd0026e500001ff9ff0084f80001087ff5ff01fe1fefff0084fc00010210f200010840f2000020fd0026e500001ff9ff0084f80001087ff5ff01fe1fefff0084fc00010210f200010840f2000020fd00380200003cfa
+000203fc03fa0008630c1800018180001ff9ff0084f80001087ff5ff01fe1fefff0084fc00010210f200010840f2000020fd006502000066fa0002030003fa000ac30c38000380c0001f807ffbff05841f8000003cfd0002087f87fbff07e01fe7fffffe1f81fcff03f0ffff87fdff0dc3f0ffff84186000060002106180fd
+00051f800000600ffe0002084180fb00021fe018fc000020fd006b020000c3fe0008c01800000300030603fd000a01830c7800078060001ff3faff058418c000000cfd0002087f33fbff07e7ffe7fffffe1f9cfcff03fcffff33fdff0d99e67fff84186000060002107180fd000518c000006003fe0002084180fb00021800
+18fc000020fd0072020000c0fe0008c01800000300030603fd000a01830cd8000d8060001ff3fcff07f9ff84186000000cfd0002087e79fbff08e7ffe7cfe7fe1f9e7ffdff1ffcfffe79fff9ffff99e67fff8418600006000210718000006000186000006003fe0002084180fb00041800183018fe000020fd0072020000c0
+fe0000c0fe00040300030003fd000a0301989800098030001ff3fcff07f9ff84186000000cfd0002087e7ffbff08e7ffe7cfe7fe1f9e7ffdff1ffcfffe7ffff9ffff9fe7ffff8418600006000210798000006000186000006003fe0002084180fb00041800183018fe000020fd00731d0000c00f0dc3f0781f4003003b1e0f
+c0f0de000301981800018030001ff3fcff07f9ff84186000000cfd0002087e7ffbff08e7ffe7ffe7fe1f9e7ffdff1ffcfffe7ffff9ffff9fe7ffff8418600006000210798000006000186000006003fe0002084180fb00041800180018fe000020fd007d790000c0198e60c01831c003f06706030198730003019818000180
+30001ff3e47c0f8790e07f841861e1b80c0fc1f078087f3f9e647e1e43ffe7fe270f81fe1f9e7879e787c0fcfffe7f9e607e1f9fe7f03f841866e0761e02106d878619f8001866f0786e0301e16c0841801e0fc61878001801d8f07e0786f020fd007d790000c030cc30c01831800300c30603030c60000300f01800018030
+001ff3e339e733c679ff8418c331cc0c186318cc087f879e633ccf19ffe07cc7cfe7fe1f9cf339e7339e7cfffe7f9e79fccf9fe7e79f84186730ce3302106d8cc330600018c398cc73030331fe08418033186618cc001f833830180cc39820fd007d790000c030cc30c01831800300c30603030c60000300f0180001803000
+1ff3e799fe79cff9ff841f8619860c00660186087ff39e6799e73fffe7f9e7cfe7fe1f81e79cce79fe7cfffe7f9e79f9e60781e7ff8418661986618210679861e060001f83018661830619b6084180618063318600180618301818630020fd007d790000c030cc30c01831800300c30603030c60000180f01800018060001f
+f3e79c0e01cff9ff841987f9860c0fe601fe087ff99e6798073fffe7f9e7cfe7fe1f99e01cce01c07cfffe7f9e79f9e79fe7f03f8418661986618210679fe0c0600018030186618307f9b60841807f8fe331fe00180618301818630020fd007d790000c330cc30c0181f000300c30603030c60000180601800018060001ff3
+e79fe67fcff9ff8418c601860c18660180087ff99e6799ff3fffe7f9e7cfe7fe1f9ce7fe1e7f9e7cfffe7f9e79f9e79fe7ff9f8418661986618210639800c060001803018661830601b6084180601861e18000180618301818630020fd007d79000066198c30cc18300003006706033198600000c06018070180c0001ff3e7
+9fe67fcff9ff8418c601860c18660180087e799e6799ff3fffe7f9e7cfe7fe1f9ce7fe1e7f9e7cfffe799e79f9e79fe7ff9f8418661986618210639801e060001803018661830601b6084180601861e18000180618301818630020fd007d7900003c0f0c3078ff1f8003fc3b3fc1e0f06000006060ff070ff180001ff3e799
+e739cff99f84186319cc0c186318c6087f33cc633ce73fffe7fcc7cfe67e1f9e739f3f399e7cffff33cc799ccf9fe7e79f840cc618ce330210618c63306600180300cc73030319b6084180319860c0c60018033830198cc30020fd0068f9000130c0ef005d1f80679c0f83cffc3f841861f1b87f8fa1f07c087f87e2647e0f
+3fffe01e2601f0fe1f9e783f3f83c1601fff87e27c3e1f9fe7f03f84078618761e02106187c6183c00180300786e1fe1f1b60841fe1f0fa0c07c001fe1d9fe0f07830020fd0032f9000130c0ef00001ff9ff048400000180fc0004087fffffe7f8ff01fe1fefff0084fc00010210f200010840f2000020fd0032f9000130c0
+ef00001ff9ff048400000180fc0004087fffffe7f8ff01fe1fefff0084fc00010210f200010840f2000020fd0032f900011f80ef00001ff9ff048400000180fc0004087fffffe7f8ff01fe1fefff0084fc00010210f200010840f2000020fd002de500001ff9ff048400000180fc0004087fffffe7f8ff01fe1fefff0084fc
+00010210f200010840f2000020fd0026e500001ff9ff0084f80001087ff5ff01fe1fefff0084fc00010210f200010840f2000020fd0026e500001ff9ff0084f80001087ff5ff01fe1fefff0084fc00010210f200010840f2000020fd0026e500001ff9ff0084f80001087ff5ff01fe1fefff0084fc00010210f200010840f2
+000020fd0026e500001ff9ff0087f8ff01f87ff5ff01fe1fefff0087fcff01fe1ff2ff01f87ff2ff00e0fd000283000283000283000283000283000283000283000283000901001f88ff00feff001a010010fc000006fe00010180fe000060fc00000c9d000002ff001f010010fc000006fe00010180fe000060fc00000cc2
+000102a8de000002ff001f010010fc000006fe00010180fe000060fc00000cc200010554de000002ff001f010010fc000006fe00010180fe000060fc00000cc2000102a8de000002ff002316001000004010000600004001800200006000100400000cc200010554de000002ff00231600100000c03000060000c001800300
+006000180600000cc2000102a8de000002ff0023160010000180600006000180018001800060000c0300000cc200010554de000002ff0023160010000300c00006000300018000c0006000060180000cc2000102a8de000002ff0023160010000601800006000600018000600060000300c0000cc200010554de000002ff00
+23160010000c03000006000c0001800030006000018060000cc2000102a8de000002ff00231600100018060000060018000180001800600000c030000cc200010554de000002ff0023160010000c03000006000c0001800030006000018060000cc2000102a8de000002ff0023160010000601800006000600018000600060
+000300c0000cc200010554de000002ff0023160010000300c00006000300018000c0006000060180000cc2000102a8de000002ff0023160010000180600006000180018001800060000c0300000cc200010554de000002ff00231600100000c03000060000c001800300006000180600000cc2000102a8de000002ff002316
+001000004010000600004001800200006000100400000cc200010554de000002ff001f010010fc000006fe00010180fe000060fc00000cc2000102a8de000002ff001f010010fc000006fe00010180fe000060fc00000cc200010554de000002ff001f010010fc000006fe00010180fe000060fc00000cc2000102a8de0000
+02ff001f010010fc000006fe00010180fe000060fc00000cc200010554de000002ff001f010010fc000006fe00010180fe000060fc00000cc2000102a8de000002ff001f010010fc000006fe00010180fe000060fc00000cc200010554de000002ff001a010010fc000006fe00010180fe000060fc00000c9d000002ff0009
+01001f88ff00feff000901001f88ff00feff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff004a010010ed00030c0300c0fa00040781e0300cf90004781e
+0780c0fa00040781e0780cf90004781e0040c0fa00040781e1fe0cf90004781e0780c0fa00040781e1fe0cf90002781e02ff004a010010ed00030c0781e0fa00040cc330701ef90004cc330cc1e0fa00040cc330cc1ef90004cc3300c1e0fa00040cc331801ef90004cc330cc1e0fa00040cc330061ef90002cc3302ff004e
+010010ed00030c0cc330fa0004186618f033fa0005018661986330fa00041866198633fa000501866181c330fa00041866198033fa0005018661986330fa00041866180633fa000301866182ff004e010010ed00030c0cc330fa0004186619b033fa0005018661986330fa00041866198633fa000501866183c330fa000418
+66198033fa0005018661980330fa00041866180c33fa000301866182ff004a010010ed00030c186618f900046619306180fa00040661806618f900046618066180fa0004066186c618f900046619806180fa00040661980618f9000466180c6180fa0002066182ff004a010010ed00030c186618f90004c618306180fa0004
+0c61806618f90004c6180c6180fa00040c618cc618f90004c619b86180fa00040c619b8618f90004c618186180fa00020c6182ff004e010010ed00030c186618fa0005038338306180fa0004383380c618fa0005038338386180fa0004383398c618fa0005038339cc6180fa000438339cc618fa0005038338186180fa0002
+383382ff004a010010ed00030c186618f90004c1d8306180fa00040c1d838618f90004c1d80c6180fa00040c1d98c618f90004c1d8066180fa00040c1d986618f90004c1d8306180fa00020c1d82ff004a010010ed00030c186618f900046018306180fa00040601860618f900046018066180fa000406019fe618f9000460
+18066180fa00040601986618f900046018306180fa0002060182ff004e010010ed00030c0cc330fa00041860183033fa00050186018c0330fa00041860198633fa000501860180c330fa00041860180633fa0005018601986330fa00041860186033fa000301860182ff004e010010ed00030c0cc330fa00041866183033fa
+0005018661980330fa00041866198633fa000501866180c330fa00041866198633fa0005018661986330fa00041866186033fa000301866182ff004a010010ed00030c0781e0fa00040cc330301ef90004cc331801e0fa00040cc330cc1ef90004cc3300c1e0fa00040cc330cc1ef90004cc330cc1e0fa00040cc330c01ef9
+0002cc3302ff004a010010ed00030c0300c0fa00040781e1fe0cf90004781e1fe0c0fa00040781e0780cf90004781e00c0c0fa00040781e0780cf90004781e0780c0fa00040781e0c00cf90002781e02ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d0100
+10ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff0076010010fc00041e0003c078f700650c0781e0301e0307f9fe0c0780c0300c0787f9fe0c0781e1fe1e1fe1e0000c0300c0781e0787f8781e0300c1fe1e0300c0781e0780c0301e1fe7f8
+301e0300c0787f8781e1fe1e0301e0300c1fe1e0300c1fe0c1fe7f8307f8780c0301e0780c0781e0300c02ff0076010010fc0004330000c0ccf700650c0cc33078330780c0301e0cc1e0781e0cc0c0301e0cc3303033030330001e0781e0cc330cc0c0cc330781e030330781e0cc330cc1e078330300c078330781e0cc0c0c
+c3303033078330781e030330781e0301e0300c0780c0cc1e078330cc1e0cc330781e02ff0076010010fc0004618000c186f700650c186618cc618cc0c03033186330cc331860c03033186618306183061800330cc33186619860c186618cc33030618cc3318661986330cc618300c0cc618cc331860c18661830618cc618cc
+33030618cc33030330300c0cc0c186330cc6198633186618cc3302ff0076010010fc0004618000c186f700650c180600cc600cc0c03033180330cc331800c030331806003060030600cc330cc33180601800c180600cc33030600cc3318060180330cc600300c0cc600cc331800c18060030600cc600cc33030600cc330303
+30300c0cc0c180330cc6018033180600cc3302ff0076010010fc0004018000c006f700650c18060186601860c0306198061986619800c030619806003060030600cc6198661980601800c180601866183060186619806018061986600300c18660186619800c180600306018660186618306018661830618300c1860c18061
+9866018061980601866182ff007b010010fc0009018000c006000fc1e076fc00650c18060186601860c0306198061986619800c030619806003060030600786198661980601800c180601866183060186619806018061986600300c18660186619800c180600306018660186618306018661830618300c1860c18061986601
+8061980601866182ff007d010010fe000b01fe030000c00c00186330cefc00650c18067986679860c03061980619866199e0c030619806003067830601fe61986619806799e0c19e6018661830679866199e6018061986678300c18667986619800c180678306018667986618306798661830618300c1860c1806198667980
+6199e601866182ff007b010010fc00090e0000c0380018061986fc00650c180619fe619fe0c0307f9807f9fe7f9860c0307f9806003061830600787f9fe7f980619860c186601fe7f830619fe7f986601807f9fe618300c1fe619fe7f9800c18061830601fe619fe7f830619fe7f8307f8300c1fe0c1807f9fe619807f9866
+01fe7f82ff007b010010fc0009180000c060000fc7f986fc00650c18061986619860c0306198061986619860c030619806003061830600cc6198661980619860c186601866183061986619866018061986618300c18661986619800c180618306018661986618306198661830618300c1860c1806198661980619866018661
+82ff007b010010fc0009300000c0c00000660186fc00650c18061986619860c0306198061986619860c030619806003061830600cc6198661980619860c186601866183061986619866018061986618300c18661986619800c180618306018661986618306198661830618300c1860c180619866198061986601866182ff00
+7b010010fc0009600000c1800000660186fc00650c18661986619860c0306198661986619860c030619866183061830618006198661986619860c186619866183061986619866198661986618300c18661986619860c186618306198661986618306198661830618300c1860c186619866198661986619866182ff007b0100
+10fc0009600000c1800e186318cefc00650c0cc33986339860c030618cc61986618ce0c030618cc33030338303300061986618cc338ce0c0ce331866183033986618ce330cc61986338300c18633986618cc0c0cc338303318633986618303398661830618300c1860c0cc61986338cc618ce331866182ff007b010010fc00
+097f8007f9fe0e0fc1f076fc00650c0781e9861e9860c03061878619866187a0c030618781e0301e8301e00061986618781e87a0c07a1e186618301e9866187a1e078619861e8300c1861e986618780c0781e8301e1861e986618301e98661830618300c1860c078619861e8786187a1e1866182ff0011010010f3000006fc
+00000c9d000002ff0011010010f3000006fc00000c9d000002ff0011010010f3000006fc00000c9d000002ff0011010010f3000006fc00000c9d000002ff000d010010ed00000c9d000002ff0013010010ed00000cd7000002ecaadd000002ff0013010010ed00000cd7000001ec55dd000002ff007e010010fd000c787f80
+0000041e0001e0000003fe00650c0781e0301e0307f9fe0c0780c0300c0787f9fe0c0781e1fe1e1fe1e0000c0300c0781e0787f8781e0300c3febeabaaeafabeafaaeababebfeffababeabaaeafa7f8781e1fe1e0301e0300c1fe1e0300c1fe0c1fe7f8307f8780c0301e0780c0781e0300c02ff007e010010fd000ccc0180
+00000c33000330000007fe00650c0cc33078330780c0301e0cc1e0781e0cc0c0301e0cc3303033030330001e0781e0cc330cc0c0cc330781e1757757d5f5dd775dd5f57d775755d57d7757d5f5dd0c0cc3303033078330781e030330781e0301e0300c0780c0cc1e078330cc1e0cc330781e02ff007f010010fe000d018601
+8000001c6180061800000ffe00650c186618cc618cc0c03033186330cc331860c03033186618306183061800330cc33186619860c186618cc332baebaeebbbaeebbaebbaeeebabaaeaeeebaeebbbae0c18661830618cc618cc33030618cc33030330300c0cc0c186330cc6198633186618cc3302ff007f010010fe00020186
+03fe00073c6000061800001bfe00650c180600cc600cc0c03033180330cc331800c030331806003060030600cc330cc33180601800c180600cc33175755dd775d5755d5775dd755755d5dd755dd775d50c18060030600cc600cc33030600cc33030330300c0cc0c180330cc6018033180600cc3302ff007e010010fd000106
+03fe00076c60000618000013fe00650c18060186601860c0306198061986619800c030619806003060030600cc6198661980601800c1806018661abaeabaeebbaaeabaaebbaeeaabaaebaeeabaeebbaa0c180600306018660186618306018661830618300c1860c180619866018061980601866182ff007e010010fd000c0c
+060003f0cc6e07c618003f03fe00650c18060186601860c0306198061986619800c030619806003060030600786198661980601800c1806018661975755d775dd5755d575dd7755755d5d7755d775dd50c180600306018660186618306018661830618300c1860c180619866018061980601866182ff007f120010000007f8
+38060006198c730c6338006183fe00650c18067986679860c03061980619866199e0c030619806003067830601fe61986619806799e0c19e6018661abaefbaeebbbeeabaaebbaeefabaaebaeefbaeebbaa0c180678306018667986618306798661830618300c1860c18061986679806199e601866182ff007e010010fd000c
+0c0c0000198c619801d8006003fe00650c180619fe619fe0c0307f9807f9fe7f9860c0307f9806003061830600787f9fe7f980619860c186601fe7f97575dff7fdd7755d57fdff75d755d5ff75dff7fdd50c18061830601fe619fe7f830619fe7f8307f8300c1fe0c1807f9fe619807f986601fe7f82ff007e010010fd000c
+060c0003f9fe61980018003f03fe00650c18061986619860c0306198061986619860c030619806003061830600cc6198661980619860c1866018661abaebbaeebbaeeabaaebbaeebabaaebaeebbaeebbaa0c180618306018661986618306198661830618300c1860c180619866198061986601866182ff007f010010fe000d
+0186180006180c61980018000183fe00650c18061986619860c0306198061986619860c030619806003061830600cc6198661980619860c186601866197575dd775dd7755d575dd775d755d5d775dd775dd50c180618306018661986618306198661830618300c1860c180619866198061986601866182ff007f010010fe00
+0d0186180006180c61980618000183fe00650c18661986619860c0306198661986619860c030619866183061830618006198661986619860c1866198661abaebbaeebbaeebbaeebbaeebabaaebaeebbaeebbae0c186618306198661986618306198661830618300c1860c186619866198661986619866182ff007e010010fd
+000ccc300006180c330c6330386183fe00650c0cc33986339860c030618cc61986618ce0c030618cc33030338303300061986618cc338ce0c0ce331866197577dd775ddf775dd75dd777d755d5d777dd775ddd0c0cc338303318633986618303398661830618300c1860c0cc61986338cc618ce331866182ff007e010010fd
+007578300003e80c1e07c1e0383f1fe000000c0781e9861e9860c03061878619866187a0c030618781e0301e8301e00061986618781e87a0c07a1e18661ababebaeebafabeafaebbaebeabaaebaebebaeebafa0c0781e8301e1861e986618301e98661830618300c1860c078619861e8786187a1e1866182ff0013010010ed
+00000cd7000001ec55dd000002ff0013010010ed00000cd7000002ecaadd000002ff0013010010ed00000cd7000001ec55dd000002ff0013010010ed00000cd7000002ecaadd000002ff0013010010ed00000cd7000001ec55dd000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff007f010010
+fe000dc0781e000001fe0c000010000003fe00650c02808028080aa2a8200a02000020080282a8aa080280a0aa001fe1e0300c0300c0781e0787f8781e0300c1fe1e0300c0781e0780c0301e1fe7f8301e0300c0787f8781e1fe1e0301e0300c1fe1e0300c1fe0c1fe7f8307f8780c0301e0780c0781e0300c02ff007f1200
+10000001c0cc33000001801c000030000007fe00650c04414044140100405011050000501404404010140441101000030330781e0781e0cc330cc0c0cc330781e030330781e0cc330cc1e078330300c078330781e0cc0c0cc3303033078330781e030330781e0301e0300c0780c0cc1e078330cc1e0cc330781e02ff007f12
+0010000003c18661800001803c00007000000ffe00650c08222082220200808820888000882208208020220822082000030618cc330cc33186619860c186618cc33030618cc3318661986330cc618300c0cc618cc331860c18661830618cc618cc33030618cc33030330300c0cc0c186330cc6198633186618cc3302ff007f
+120010000006c18661800001806c0000f000001bfe00650c10011100110100404440044000441110004010111004001000030600cc330cc33180601800c180600cc33030600cc3318060180330cc600300c0cc600cc331800c18060030600cc600cc33030600cc33030330300c0cc0c180330cc6018033180600cc3302ff00
+7f120010000004c00601800001804c0001b0000013fe00650c08020880208200808220082000822088008020208802002000030601866198661980601800c180601866183060186619806018061986600300c18660186619800c180600306018660186618306018661830618300c1860c180619866018061980601866182ff
+007f010010fe000dc006030003f1b80c07c330003f03fe00650c10041100410100410440104001044110004010411004001000030601866198661980601800c180601866183060186619806018061986600300c18660186619800c180600306018660186618306018661830618300c1860c180619866018061980601866182
+ff007f120010001fe0c00c0e000619cc0c0c6630006183fe00650c08a2088a2082008082200822a8822088a0802020880200202a8306018661986619806799e0c19e6018661830679866199e6018061986678300c18667986619800c180678306018667986618306798661830618300c1860c18061986679806199e6018661
+82ff007f010010fe000dc03803000018060c180630006003fe00650c10455104550100415440154001545510404010551004001000030601fe7f9fe7f980619860c186601fe7f830619fe7f986601807f9fe618300c1fe619fe7f9800c18061830601fe619fe7f830619fe7f8307f8300c1fe0c1807f9fe619807f986601fe
+7f82ff007f010010fe000dc060018003f8060c1807f8003f03fe00650c08220882208200808220082000822088208020208802002000030601866198661980619860c186601866183061986619866018061986618300c18661986619800c180618306018661986618306198661830618300c1860c180619866198061986601
+866182ff007f010010fe000dc0c061800618060c180030000183fe00650c10441104410100410440104001044110404010411004001000030601866198661980619860c186601866183061986619866018061986618300c18661986619800c180618306018661986618306198661830618300c1860c1806198661980619866
+01866182ff007f010010fe000dc18061800619860c180030000183fe00650c08220882208200808220882000822088208020208822082000030619866198661986619860c186619866183061986619866198661986618300c18661986619860c186618306198661986618306198661830618300c1860c18661986619866198
+6619866182ff007f010010fe000dc18033000618cc0c0c6030386183fe00650c044410444101004104111040010441044040104104411010000303318661986618cc338ce0c0ce331866183033986618ce330cc61986338300c18633986618cc0c0cc338303318633986618303398661830618300c1860c0cc61986338cc61
+8ce331866182ff007f7b0010000007f9fe1e0003e8787f87c030383f1fe000000c02a2082a20820080820a082000822082a08020208280a020000301e18661986618781e87a0c07a1e186618301e9866187a1e078619861e8300c1861e986618780c0781e8301e1861e986618301e98661830618300c1860c078619861e878
+6187a1e1866182ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff0040010010fe000bc0307f800001fe0c0000c030fe
+0002c0000cbf002201e1fe1e0301e0300c1fe1e0300c1fe0c1fe7f8307f8780c0301e0780c0781e0300c02ff0040160010000001c07801800001801c0001c070000001c0000cbf002203303033078330781e030330781e0301e0300c0780c0cc1e078330cc1e0cc330781e02ff0040160010000003c0cc01800001803c0003
+c0f0000003c0000cbf0022061830618cc618cc33030618cc33030330300c0cc0c186330cc6198633186618cc3302ff0040160010000006c0cc03000001806c0006c1b0000006c0000cbf0022060030600cc600cc33030600cc33030330300c0cc0c180330cc6018033180600cc3302ff0040160010000004c1860300000180
+4c0004c130000004c0000cbf00220600306018660186618306018661830618300c1860c180619866018061980601866182ff0040010010fe0011c186060003f1b80c0fc0c030000fc0c0000cbf00220600306018660186618306018661830618300c1860c180619866018061980601866182ff0040010010fe0011c1860600
+0619cc0c1860c030001860c0000cbf00220678306018667986618306798661830618300c1860c18061986679806199e601866182ff0040010010fe0011c1860c000018060c0060c030001800c0000cbf0022061830601fe619fe7f830619fe7f8307f8300c1fe0c1807f9fe619807f986601fe7f82ff0040010010fe0011c1
+860c0003f8060c0fe0c030000fc0c0000cbf00220618306018661986618306198661830618300c1860c180619866198061986601866182ff0040010010fe0011c0cc18000618060c1860c030000060c0000cbf00220618306018661986618306198661830618300c1860c180619866198061986601866182ff0040010010fe
+0011c0cc18000619860c1860c030000060c0000cbf00220618306198661986618306198661830618300c1860c186619866198661986619866182ff0040010010fe0011c07830000618cc0c1860c0300e1860c0000cbf00220338303318633986618303398661830618300c1860c0cc61986338cc618ce331866182ff004016
+0010000007f830300003e8787f8fa7f9fe0e0fc7f8000cbf002201e8301e1861e986618301e98661830618300c1860c078619861e8786187a1e1866182ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c
+9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff007e010010fd0001781efe0007041e03c1e0000003fe00650c0200a0aa2a8200a0282a8282a82808020080280a0282a8280a020080aa0a020080280a028080200a0aa2a8200a020080282a8280a0aa0a0200a020080aa0a020080002a820
+2a8aa080aa0a020080280a0200a028080200a028000280a0280a0202a8aa0a02ff007e010010fd0001cc33fe00070c33066330000007fe00650c050110100405011044040440404414050140441104404044110501401011050140441104414050110100405011050140440404411010110501105014010110501400004050
+0401014010110501404411050110441405011044000441104411050040101102ff007f010010fe000d0186618000001c6186661800000ffe001b0c088208200808820882080820808222088220822088208082208882fe20298882208220882220882082008088208882208208082208202088820888220202088822000080
+88080202fe20198882208220888208822208820882000822088220888080202082ff007f010010fe0002018060fe00073c6006061800001bfe00650c04440010040444010004100041001104411100401000410040044110104004411100401001104440010040444004411100041004001040044400441101040044110000
+40440401011010400441110040044401001104440100001004010040044040104002ff007f010010fe0002018060fe00076c60060018000013fe00650c082200200808220080080800808020882208802008008080200822082020082208802008020882200200808220082208800808020020200822008220820200822080
+0080820802020820200822088020082200802088220080000802008020082080202002ff007f010010fe000d01b86e0003f0cc6e060030003f03fe00650c1044001004104401000410004100411044110040100041004010441010401044110040100411044001004104401044110004100400104010440104410104010441
+000041040401041010401044110040104401004110440100001004010040104040104002ff007f120010000007f9cc730006198c730600e0006183fe00650c0822282008082200800808a0808020882208802288a0808a20082208202288220880200802088222820080822288220880080802282020082228822082022882
+208aa080820802020820200822088a2008222880208822008a2a8802288a20082080202282ff007f010010fe000d0186618000198c619f8030006003fe00650c154410100415440100041040410055154551004110404104401545501041154551004010055154410100415441154551000410041010401544115455010411
+5455000041540401055010401545510440154411005515440104001004110440154040104102ff007f010010fe000d0186618003f9fe61860018003f03fe00650c0822082008082200800808208080208822088020882080822008220820208822088020080208822082008082208822088008080208202008220882208202
+088220800080820802020820200822088220082208802088220082000802088220082080202082ff007f010010fe000d0186618006180c61860618000183fe00650c10441010041044010004104041004110441100411040410440104410104110441100401004110441010041044110441100041004101040104411044101
+04110441000041040401041010401044110440104411004110440104001004110440104040104102ff007f010010fe000d0186618006180c61860618000183fe00650c082208200808220882080820808220882208822088208082208822082020882208822088220882208200808220882208820808220820208822088220
+8202088220800080820802020820208822088220882208822088220882000822088220882080202082ff007e010010fd000ccc330006180c33060330386183fe00650c104110100410411044040440404441104410441104404044111044101011104410441104441104110100410411104410440404411010111041110441
+0101110441000041040401041010111044104411104110444110411044000441104411104040101102ff007e010010fd0075781e0003e80c1e0601e0383f1fe000000c0820a820080820a0280802a0802820882208280a82a0802a0a082208200a882208280a028208820a820080820a88220828080280a8200a0820a88220
+8200a882208000808208020208200a0822082a0a0820a828208820a02a000280a82a0a082080200a82ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff0012010010ed00000ce6000103ffba
+000002ff0012010010ed00000ce6000103ffba000002ff007b010010fa007201e078618787f9861e1861e0000c0781e0301e0307f9fe0c0780c0300c0787f9fe0c0781e1fe1e1fe1e3ff0c0300c0781e0787f8781e0300c1fe1e0300c0781e0780c0301e1fe7f8301e0300c0787f8781e1fe1e0301e0300c1fe1e0300c1fe0
+c1fe7f8307f8780c0301e0780c0781e0300c02ff007b010010fa00720330cc718cc601c633186330000c0cc33078330780c0301e0cc1e0781e0cc0c0301e0cc3303033030333ff1e0781e0cc330cc0c0cc330781e030330781e0cc330cc1e078330300c078330781e0cc0c0cc3303033078330781e030330781e0301e0300c
+0780c0cc1e078330cc1e0cc330781e02ff007b010010fa007206198671986601c661986618000c186618cc618cc0c03033186330cc331860c03033186618306183061bff330cc33186619860c186618cc33030618cc3318661986330cc618300c0cc618cc331860c18661830618cc618cc33030618cc33030330300c0cc0c1
+86330cc6198633186618cc3302ff007b010010fa007206018679980601e660186600000c180600cc600cc0c03033180330cc331800c030331806003060030603ff330cc33180601800c180600cc33030600cc3318060180330cc600300c0cc600cc331800c18060030600cc600cc33030600cc33030330300c0cc0c180330c
+c6018033180600cc3302ff007b010010fa007206018679980601e660186600000c18060186601860c0306198061986619800c030619806003060030603ff6198661980601800c180601866183060186619806018061986600300c18660186619800c180600306018660186618306018661830618300c1860c1806198660180
+61980601866182ff007b010010fa00720601866d8c0601b630186300000c18060186601860c0306198061986619800c030619806003060030603ff6198661980601800c180601866183060186619806018061986600300c18660186619800c180600306018660186618306018661830618300c1860c1806198660180619806
+01866182ff007b010010fa00720601866d8787e1b61e1861e0000c18067986679860c03061980619866199e0c0306198060030678306020161986619806799e0c19e6018661830679866199e6018061986678300c18667986619800c180678306018667986618306798661830618300c1860c18061986679806199e6018661
+82ff007b010010fa00720601866780c6019e03186030000c180619fe619fe0c0307f9807f9fe7f9860c0307f9806003061830603ff7f9fe7f980619860c186601fe7f830619fe7f986601807f9fe618300c1fe619fe7f9800c18061830601fe619fe7f830619fe7f8307f8300c1fe0c1807f9fe619807f986601fe7f82ff00
+7b010010fa0072060186678066019e01986018000c18061986619860c0306198061986619860c030619806003061830603ff6198661980619860c186601866183061986619866018061986618300c18661986619800c180618306018661986618306198661830618300c1860c180619866198061986601866182ff007b0100
+10fa0072060186638066018e01986018000c18061986619860c0306198061986619860c030619806003061830603ff6198661980619860c186601866183061986619866018061986618300c18661986619800c180618306018661986618306198661830618300c1860c180619866198061986601866182ff007b010010fa00
+72061986639866018e61986618000c18661986619860c0306198661986619860c03061986618306183061bff6198661986619860c186619866183061986619866198661986618300c18661986619860c186618306198661986618306198661830618300c1860c186619866198661986619866182ff007b010010fa00720330
+cc618cc60186330cc330000c0cc33986339860c030618cc61986618ce0c030618cc3303033830333ff61986618cc338ce0c0ce331866183033986618ce330cc61986338300c18633986618cc0c0cc338303318633986618303398661830618300c1860c0cc61986338cc618ce331866182ff007b010010fa007201e0786187
+87f9861e0781e0000c0781e9861e9860c03061878619866187a0c030618781e0301e8301e3ff61986618781e87a0c07a1e186618301e9866187a1e078619861e8300c1861e986618780c0781e8301e1861e986618301e98661830618300c1860c078619861e8786187a1e1866182ff0012010010ed00000ce6000103ffba00
+0002ff0012010010ed00000ce6000103ffba000002ff0012010010ed00000ce6000103ffba000002ff0012010010ed00000ce6000103ffba000002ff0012010010ed00000ce6000103ffba000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed00000c9d000002ff000d010010ed
+00000c9d000002ff000901001f88ff00feff0006fe008955fe0006fe0089aafe000afe0000408b000001fe000afe0000808b000002fe000afe0000408b000001fe000dfe000187ff8e000201ffc2fe000ffe0003440100f8900002011241fe000ffe000385850020900002011242fe000ffe000344c90020900002011241fe
+0013fe000784690022c71c71c094000201f242fe0013fe00074441002320a28a20940002010241fe0013fe000784b1002207a0f980940002010242fe0013fe00074499002208a0804094000201fe41fe0013fe0007850d002208a28a20940002010042fe0013fe000744010022079c71c0940002010041fe000dfe000187ff
+8e000201ffc2fe000afe0000408b000001fe000afe0000808b000002fe000afe0000408b000001fe0006fe0089aafe0006fe008955fe000afe0000808b000002fe000afe0000408b000001fe000afe0000808b000002fe000afe0000408b000001fe0012fe000083fdff00f0fdff00fc95000002fe0013fe000042fd000110
+80fe00000495000001fe0013fe000082fd00011080fe00000495000002fe0013fe000042fd00011080fe00000495000001fe0013fe000082fd00011080fe00000495000002fe0014fe000a420000180010860301800495000001fe0014fe000a82000018c010860301800495000002fe0014fe000042fe0006c01086000180
+0495000001fe0014fe000a820fb338f01087c70f9e0495000002fe0014fe000a4219b318c010866319b30495000001fe0014fe000a8219b318c010866319bf0495000002fe0014fe000a4219b318c010866319b00495000001fe0014fe000a820fb318cc10866319b30495000002fe0014fe000a42019f7e7810866fcf9e04
+95000001fe0014fe000682018000001080fe00000495000002fe0014fe000642018000001080fe00000495000001fe0013fe000082fd00011080fe00000495000002fe0013fe000042fd00011080fe00000495000001fe0012fe000083fdff00f0fdff00fc95000002fe000afe0000408b000001fe000afe0000808b000002
+fe000afe0000408b000001fe000afe0000808b000002fe000afe0000408b000001fe000afe0000808b000002fe000afe0000408b000001fe000afe0000808b000002fe000afe0000408b000001fe000efe000080f10000019cff0082fe000efe000040f10000019c000081fe0014fe000080f1000001d000fdaa00a8d20000
+82fe0016fe000040f1000001d1000001fd550050d2000081fe0014fe000080f1000001d000fdaa00a8d2000082fe0016fe000040f1000001d1000001fd550050d2000081fe0014fe000080f1000001d000fdaa00a8d2000082fe0016fe000040f1000001d1000001fd550050d2000081fe0021fe000080fe0009079f80000c
+f003c0000cfe000001d000fdaa00a8d2000082fe0023fe000040fe00090cc180001d980660001cfe000001d1000001fd550050d2000081fe0020fe000080fd0008c180003d800660002cfe000001d000fdaa00a8d2000082fe0022fe000040fd0008c3003c6d81e6600f0cfe000001d1000001fd550050d2000081fe0021fe
+000d80000007e3830006cdf333e0198cfe000001d000fdaa00a8d2000082fe0023fe000d40000007e0c6003ecd9b00600c0cfe000001d1000001fd550050d2000081fe001afe000080fd0008c60066fd9b0060030cfe0000019c000082fe001bfe000040fe00090ccc00660d9b3663198cfe0000019cff0081fe001bfe0000
+80fe0009078c003e0cf1e3c78f3ffe0000019cff0082fe0012fe000040f7000003fc0000019c000081fe000efe000080f10000019c000082fe000efe000040f10000019c000081fe000efe000080f10000019c000082fe000efe000040f10000019c000081fe0070fe000080f1000301f00002fe0006e0000001000002fe00
+1e8000003800003e0001f00000010000e000000e000007c00007000001f0001cfe000610000001000002fe00247000000e00000380007c00007000001c000004000020000003e00007000004000020000007fe00011c82fe0070fe000040f1002f014000050000011000000280000500000140000044000008000040000002
+800110000011000001000008800000400022fe000628000002800005fe00158800001100000440001000008800002200000a000050fe001080000880000a0000500000088000002281fe0070fe000080f1000601400008800001fe001504400008800002200000400000080000400000044001fe000610000001000008fe00
+02400020fe001f4400000440000880000080000010000004000010000080000020000011000088fe000b800008000011000088000008fe00012082fe0075fe0001401ffaff00f8fa000601400008800001fe001504400008800002200000400000080000400000044001fe000610000001000008fe0002400020fe001f4400
+000440000880000080000010000004000010000080000020000011000088fe000b800008000011000088000008fe00012081fe0075fe00018010fa000008fa000601400008800001fe001504400008800002200000400000080000400000044001fe000610000001000008fe0002400020fe001f4400000440000880000080
+000010000004000010000080000020000011000088fe000b800008000011000088000008fe00012082fe0075fe00014010fa000008fa00060140000f800001fe001507c0000f800003e000004c000008000040000007c001fe000c10000001000009800000400020fe001f7c000007c0000f80000080000013000004c00010
+00009800002000001f0000f8fe001080000980001f0000f80000098000002081fe0075fe00018010fa000008fa000601400008800001fe001504400008800002200000440000080000400000044001fe000c10000001000008800000400020fe001f4400000440000880000080000011000004400010000088000020000011
+000088fe00108000088000110000880000088000002082fe0075fe00014010fa000008fa002f014000088000011000000440000880000220000044000008000040000004400110000011000001000008800000400022fe001f4400000440000880000088000011000004400010000088000022000011000088fe0010800008
+8000110000880000088000002281fe0079fe0005801078000380fe000008fa002901400008800000e0000004400008800002200000380000080000400000044000e000000e000001000007fe000240001cfe001f440000044000088000007000000e00000380001000007000001c000011000088fe000b8000070000110000
+88000007fe00011c82fe0017fe00054010cc000180fe000008fa0000019c000081fe0017fe00058010c0000180fe000008fa0000019c000082fe0017fe00094010c0f1e18780337c08fa0000019c000081fe0017fe000980107998318cc0336608fa0000019cff0082fe0017fe000940100d81f18fc0336608fa0000019cff
+0081fe0017fe000980100d83318c00336608fa0000019c000082fe0017fe00094010cd9b318cc0337c08fa0000019c000081fe0017fe0009801078f1f7e7801f6008fa0000019c000082fe0014fe00014010fb00016008fa0000019c000081fe0014fe00018010fb00016008fa0000019c000082fe0013fe00014010fa0000
+08fa0000019c000081fe0013fe00018010fa000008fa0000019c000082fe0013fe0001401ffaff00f8fa0000019c000081fe000efe000080f10000019c000082fe000efe000040f10000019c000081fe000efe000080f10000019c000082fe000efe000040f10000019c000081fe0013fe0001801ff8ff00e0fc0000019c00
+0082fe0013fe00014010f8000020fc0000019c000081fe0013fe00018010f8000020fc0000019c000082fe0013fe00014010f8000020fc0000019c000081fe0013fe00018010f8000020fc0000019c000082fe0019fe00074010780003800003fe000020fc0000019c000081fe0019fe00078010cc0001800003fe000020fc
+0000019c000082fe0019fe00074010c00001800003fe000020fc0000019c000081fe0019fe000b8010c0f1e187801f3ccdf020fc0000019c000082fe0019fe000b40107998318cc03366cd9820fc0000019c000081fe0019fe000b80100d81f18fc03366cd9820fc0000019c000082fe0019fe000b40100d83318c003366fd
+9820fc0000019c000081fe0019fe000b8010cd9b318cc03366fd9820fc0000019c000082fe0019fe000b401078f1f7e7801f3c499820fc0000019c000081fe0013fe00018010f8000020fc0000019c000082fe0013fe00014010f8000020fc0000019c000081fe0013fe00018010f8000020fc0000019c000082fe0013fe00
+014010f8000020fc0000019c000081fe0013fe0001801ff8ff00e0fc0000019c000082fe000efe000040f10000019c000081fe000efe000080f10000019c000082fe000efe000040f10000019c000081fe0012fe000080f1000001dc000006c2000082fe0018fe0001401ffdff0080f7000001dc00010180c3000081fe0013
+fe00018010fd000080f70000019c000082fe0017fe00014010fd000080f7000001dc000040c2000081fe001bfe00018010fd000080f7000001dc000040e800000cdc000082fe001bfe00014010fd000080f7000001dc000040e7000080dd000081fe001dfe000680100000c00080f7000001dc00018020e8000080dd000082
+fe001cfe000640100000c60080f7000001db000010e9000020dc000081fe001cfe000680100000060080f7000001db000010e9000040dc000082fe001cfe000640107d99c78080f7000001db000010e8000010dd000081fe0018fe00068010cd98c60080f7000001c1000010dd000082fe001cfe00064010cd98c60080f700
+0001dd000002e7000080dc000081fe001cfe00068010cd98c60080f7000001dd000002e7000080dc000082fe001cfe000640107d98c66080f7000001dd000002e6000004dd000081fe001efe000680100cfbf3c080f7000001dd0002020004e8000004dd000082fe001cfe000240100cfe000080f7000001db000004ea0000
+01db000081fe001cfe000280100cfe000080f7000001db000004ea000001db000082fe001bfe00014010fd000080f7000001db000004e8000001dd000081fe001ffe00018010fd000080f7000001c1000001fd000040f3000061f1000082fe0026fe0001401ffdff0080f7000001dd000008e8000002fb000030f300018080
+f40002014081fe0022fe000080f1000001dd000008e8000002fb000008f40002010080f40002041082fe001afe000040f1000001dd000008e5000080ee000040f2000081fe001afe000080f1000001dd0002080001e7000040e00002100482fe001afe000040f1000001db000001ea000004fc000002e1000081fe001cfe00
+0080f1000001db000001ea000004fc000002e30002400282fe001efe000040f1000001db000001e700042000000202f4000010f0000081fe001efe000080f1000001c000041000000401f40002100020f40002800282fe0024fe000040f1000001ee00004cf1000020e8000008fb000001f40002200020f2000081fe0028fe
+000080f100010180ef000080f1000020e8000008fb000001f40002200010f5000301000182fe001cfe000040f100010140de000020e500010332ef000010f2000081fe0020fe000080f1000001ee000001f1000320000080e7000001e2000302000182fe0020fe000040f1000001ef0002020080ef000080eb000010fc0000
+08e1000081fe0021fe000080f1000001ef000002ed000080eb000010fc000008e4000304000082fe0021fe000040f1000001e2000040fa000080e6000380100080f5000040f0000081fe002cfe000080f100010110ee000020f800010110f9000003e7000340100080f50002800008f5000304000082fe002bfe000040f100
+010108f00002040020f2000040fd000020ed000020fa000080f50002800008f2000081fe0031fe000080f100010108f0000008f600010404fd000040fd000010ed000020fa000040f50002800004f5000308000082fe001ffe000040f100010104de000040fe000010e7000020f0000004f2000081fe0027fe000080f10000
+01ed000008f800010801fd0005800000401002e8000010e3000310000082fe0022fe000040f1000001ef0002100008ef0002400002ed000040fc000020e1000081fe002bfe000080f1000001ef000020f60002100040fb000040f70000fcf6000040fc000020e4000310000082fe0029fe000040f1000001e10002100028fd
+00014040f900010102f1000308400020f6000002ef000081fe0037fe000080f100010102ee000002f8000020fe000002fc0002400080fb00010202f1000308400020f6000302000002f5000320000082fe0039fe000040f100010102f000044000020070f800040800800080fc0002800040fd00010201f6000040fa000020
+f6000304000002f2000081fe003ffe000080f100010102f0000040fe000080fa000620000202010080fa000020fd0002040080f7000080fa000020f60005040000020006f7000340000082fe0032fe000040f100010102ea000001f90002880001fd00048000000810fd0002040040f2000004f0000301002040f5000081fe
+003bfe000080f1000001ed00050100010000c0fd0005400000200081fe0005208000200808fd0002080020f2000002ee00012020f8000340000082fe002ffe000040f1000001ef00078000010000c06020f400042000001010fc0002100010f7000080fc000080e1000081fe0040fe000080f1000001f500000efc000080fd
+00012180fc000080fd000040fe000020fe000010fc0002100010f8000001fb000080fb000014eb000380000082fe003ffe000040f1000001f5000311000003fc000004f0000011f80002200008f2000302800008fd00034000001cfe000008fd00018004fe000101e0fa000081fe0057fe000080f10002010080fe00000afb
+00042080000c80fe00010108fa000001fc000020fe000301000008fb0002200004f2000b018000080060000002000022fe000010fe0007808002003c000218fd000380000082fe0053fe000040f10002010080fd000080fc0005204000306001fe000088f4000002fb0002080002fd0002400004f8000002f9000008fd0007
+8000004100004010fe000080fe000342000406fe000080fe000081fe005bfe000080f100070100800e00002020fc0005404000401001fe000010fe000004fe000002fc000024f9000002fd0002400002f8000002f9000c0802060000010000808000b010fe000080fe00074100080100000c41fe000082fe0051fe000040f1
+00070100400980008010fc00044020004008f9000002f8000004fe000002fe00018002fd0002800001f2000001fe000a0201000200000100400108fd00078200010080801001fa000081fe005afe000080f100040100001040f900048010008004fd000040fe000002fe000002fc000014fe0005140000048001fe000001fe
+0000c0f3000001fc0008800000400100200208fd0002020000fe80052000c0000009fe000082fe0057fe000040f10012010000202002000800000c0000800801000402fe000040fe000002f400040800000480fd000001fe00013ffcfa000004fc000001fb00070400000200200204f900078040400020004004fe000081fe
+0056fe000080f100040100004010fc0008130001000402000202f6000004fc000008fe000308000001fc000002fd000003fa000004fc000002fe000008fd0005200400100404fa0008010040800020008002fe000082fe004efe000040f10011010000400804000400002080010002040002fd000040f0000008f9000002fc
+0000c0f5000f02800004100020080000040010080220fe0008080000810021000010fb000081fe005cfe000080f100040100208008fc00074040020001180002fd000090fa000008fc000004fe000308000001fe0002540004fc000030f5000f02800002000010000020080010080120fe000b48000042001e000008000003
+fe000082fe005dfe000040f10012010020800408000100018040020000e0000104fe000090fb000007fb000010fb000601000041000008fc000008fc00014008f9000002fe0008200000080008100140fe000340000002fd000308010001fe000081fe005afe000080f100040100110004fd000302002004fd000401040000
+01fc0003400018c8fc000012f8000340010008fc000008fd0002011010f900010220fd0006101000081000c0fe000340000004fd000304020002fe000082fe005cfe000040f1000c01001100021000008004002004fd000001f8000340002020fc000020fe000910000002000024000010fc000004fd00010404fa000e4000
+00400004800000100004600040fe000350000024fd000002fb000081fe0061fe000080f100040100020001fd000318001008fd000001fd000004fd000320004030fc000d2102000014000004800020004020fc000502000e000010f9000040fd000906000008200003800040fe000310000018fd00070200000480000082fe
+005bfe000040f1000c01000200009000004020000808fc000090fe000004fd000320018010fb000c08800004000004800010000020fc00070100118000400120fc000008f8000040fd000020fb000008fd00070104000040000081fe005afe000080f10005010004000080fe000320000410fc000090f800020e0010fb0006
+a0200004000004fd00011040fb000680204001000020fc000008fe000680000800000880fd000020fb000008fd000301080008fe000082fe005afe000040f1000c010004000060000020c0000220fc000340000004fb0002f00008f8000022fc000320000480fb000640c02004000080fc000610200001000002fe000080fe
+00010120fe000340000018fd000001fb000081fe005cfe000080f1000c01000c000020000003000001c0fc00044000000802fd000303000028fa0002100040fe00006afd000080fb0002230020f8000910100001000011000005fd00010210fe000360000014fc000685000810000082fe0059fe000040f100080100080000
+60000014fd000018fd00046000001002fd000304000008fc000080fa0004c000880001fa00061c001010000040f9000001fc000002fd00010410fe0003202a8020fc000690000010000081fe005cfe000080f1000801000c00001000000cfd00018180fe000c6000001000000140000c000088fc0002800008fb0003820800
+01f8000010fe000040f9000601000020000004fd00010408fe000310800020fc0002b04010fe000082fe005cfe000040f1000801000c000090000011fe000001fc000020fc000604100008000004fd000001fd000980000021002804000280f900040820000040fb0002280002fe0002800008fc000008fe000390002042fc
+000040fc000081fe005cfe000080f10008010010000008000010fc000010fe00001efd001a8000040014000104000003c002000004009f800020100004000420f9000004f700008afd000380400012fc000004fe000382000841fc000660102004000082fe005dfe000040f1000901001000010800002040fe00050a080000
+0110fd000b401001002400000200000420fc00046140004410fe00010408f900040280000120fc000380000002fc000060fc000002fb000080fc000660000004000081fe0066fe000080f10008010020000008000040fe0008080001000001081802fe000a40005040001b0200000810fe00050200813c0080fd000008f800
+0302000001fe000c7800010100800a800100000181fd00012001fe000308000280fc0006e0044000007082fe0067fe000040f1001501004000010400008010000010208080000008061d40fd0008048001808100001008fd00040180838010fd00011002f9000003fe001d1006008407860402002440010010020001e00000
+4000c000012000010080fd00011001fe00018c81fe0064fe000080f100070100410000040001fa001d014000050000100019000001000804110000100400004001010000604004fe000010f700208000080081808380055002401e60000010020080015000800020000104000280bcfd000690000002010282fe006bfe0000
+40f1001501208100020200020004014000404004101904400404fd00170240101008800020026001b0000100001d00040001002001fc00220400000440001e090001070011fc0000e1200200000c0000040001000010000087fe0cfc00070208000001020281fe006efe000080f1002a01cd00800001000403870410400000
+18040212101000000200000c11804000400061c180060c0082000017fe00040100400004fd000051fe00152000a1820002008006038010912807c0003000200008fe000628000a0e01f040fd00010308fe0002040182fe0070fe000040f1002b0102c8a0040080040401d386810010100086020540010001001ff001850000
+40018020800803000c000060c0fe0021818001c300000755004000101009c044000480405800610590c01c3004c000001102fe00088680a4320000042020fe00060401e000040c81fe0070fe000080f100660102860800004008000004058006004181004101f0000fe881e00005500001a006000040100099d00040003001
+038082000200800091800010804009938030000b5520600018830022600c0700003040c080000205ea084000001810183d403404021800283082fe0070fe000040f10066013c28c20887a0300000cc05048184079c0100860e00b03e1200001820000011e80018202001c0200029000f00bc78020004100803401f87870780
+9f86078d103000518000060c00119003fde000c10021ea0008086c018000002005441038003a0c0700100081fe000efe000080f10000019cff0082fe000efe000040f10000019cff0081fe000afe0000808b000002fe000afe0000408b000001fe000afe0000808b000002fe000afe0000408b000001fe000afe0000808b00
+0002fe000afe0000408b000001fe000efe000080f10000019cff0082fe000efe000040f10000019c000081fe0014fe000080f1000001e000fbaa00a0c4000082fe0016fe000040f1000001e1000001fb550040c4000081fe0014fe000080f1000001e000fbaa00a0c4000082fe0016fe000040f1000001e1000001fb550040
+c4000081fe0014fe000080f1000001e000fbaa00a0c4000082fe0016fe000040f1000001e1000001fb550040c4000081fe0021fe000d80000001878f0000fc600060000cfe000001e000fbaa00a0c4000082fe0023fe000d400000038cd98000c0e000e0001cfe000001e1000001fb550040c4000081fe0021fe000d800000
+058cc18000c16001e0002cfe000001e000fbaa00a0c4000082fe0023fe000d4000000180c1803cf861e3600f0cfe000001e1000001fb550040c4000081fe0021fe000d800003f183870006cc633660198cfe000001e000fbaa00a0c4000082fe0023fe000d400003f18601803e0c6306600c0cfe000001e1000001fb550040
+c4000081fe001bfe000d800000018c0180660c6307e0030cfe0000019c000082fe001bfe000d400000018c198066cc633063198cfe0000019cff0081fe001bfe000d80000007efcf003e79f9e0678f3ffe0000019cff0082fe0012fe000040f7000003fc0000019c000081fe000efe000080f10000019c000082fe000efe00
+0040f10000019c000081fe000efe000080f10000019c000082fe000efe000040f10000019c000081fe006ffe000080f1000d01007c00001f0000200001c00010fe0017040000100000e0001f000007c002000001c00000e000007cfd001d3e00003800000400008000400080007000001c0000070003e0001c00001cfe0018
+40004000007c000e0000200000080001c000000e00000e0082fe006ffe000040f1000d0100100000040000500002200028fe00170a0000280001100004000001000500000220000110000010fd001d0800004400000a00014000a0014000880000220000088000800022000022fe0018a000a0000010001100005000001400
+02200000110000110081fe006ffe000080f1000d0100100000040000880002000044fe00131100004400010000040000010008800002000001fe000010fd003008000040000011000220011002200080000020000008000080002000002000000110011000001000100000880000220002fe0005100000100082fe0074fe00
+01401ffaff00f8fa000d0100100000040000880002000044fe00131100004400010000040000010008800002000001fe000010fd003008000040000011000220011002200080000020000008000080002000002000000110011000001000100000880000220002fe0005100000100081fe0075fe00018010fa000008fa0024
+0100100000040000880002000044001f001100004400010000040000010008800002000001fe0035100000f80008000040000011000220011002200080000020000008000080002000002000000110011000001000100000880000220002fe0005100000100082fe0074fe00014010fa000008fa000d0100100000040000f8
+000200007cfe00131f00007c0001300004000001000f800002000001fe000010fd00390800004000001f0003e001f003e000800000260000098000800026000020000001f001f000001000130000f800003e0002600000100000100081fe0074fe00018010fa000008fa000d0100100000040000880002000044fe00131100
+004400011000040000010008800002000001fe000010fd003908000040000011000220011002200080000022000008800080002200002000000110011000001000110000880000220002200000100000100082fe0074fe00014010fa000008fa000d0100100000040000880002200044fe0017110000440001100004000001
+000880000220000110000010fd003908000044000011000220011002200088000022000008800080002200002200000110011000001000110000880000220002200000110000110081fe0078fe0005801078000380fe000008fa000d0100100000040000880001c00044fe0017110000440000e000040000010008800001c0
+0000e0000010fd00390800003800001100022001100220007000001c000007000080001c00001c000001100110000010000e0000880000220001c000000e00000e0082fe0017fe00054010cc000180fe000008fa0000019c000081fe0017fe00058010c0000180fe000008fa0000019c000082fe0017fe00094010c0f1e187
+80337c08fa0000019c000081fe0017fe000980107998318cc0336608fa0000019cff0082fe0017fe000940100d81f18fc0336608fa0000019cff0081fe0017fe000980100d83318c00336608fa0000019c000082fe0017fe00094010cd9b318cc0337c08fa0000019c000081fe0017fe0009801078f1f7e7801f6008fa0000
+019c000082fe0014fe00014010fb00016008fa0000019c000081fe0014fe00018010fb00016008fa0000019c000082fe0013fe00014010fa000008fa0000019c000081fe0013fe00018010fa000008fa0000019c000082fe0013fe0001401ffaff00f8fa0000019c000081fe000efe000080f10000019c000082fe000efe00
+0040f10000019c000081fe000efe000080f10000019c000082fe000efe000040f10000019c000081fe0013fe0001801ff8ff00e0fc0000019c000082fe0013fe00014010f8000020fc0000019c000081fe0013fe00018010f8000020fc0000019c000082fe0013fe00014010f8000020fc0000019c000081fe0013fe000180
+10f8000020fc0000019c000082fe0019fe00074010780003800003fe000020fc0000019c000081fe0019fe00078010cc0001800003fe000020fc0000019c000082fe0019fe00074010c00001800003fe000020fc0000019c000081fe0019fe000b8010c0f1e187801f3ccdf020fc0000019c000082fe0019fe000b40107998
+318cc03366cd9820fc0000019c000081fe0019fe000b80100d81f18fc03366cd9820fc0000019c000082fe0019fe000b40100d83318c003366fd9820fc0000019c000081fe0019fe000b8010cd9b318cc03366fd9820fc0000019c000082fe0019fe000b401078f1f7e7801f3c499820fc0000019c000081fe0013fe000180
+10f8000020fc0000019c000082fe0013fe00014010f8000020fc0000019c000081fe0017fe00018010f8000020fc000001c3000006db000082fe0013fe00014010f8000020fc0000019c000081fe001bfe0001801ff8ff00e0fc000001c3000010e0000040fd000082fe0017fe000040f1000001c300012040e1000090fd00
+0081fe0016fe000080f1000001c2000020e1000004fd000082fe0012fe000040f1000001a2000002fc000081fe0016fe000080f1000001c3000040e0000002fd000082fe001cfe0001401ffdff0080f7000001c300018010e2000004fc000081fe001ffe00018010fd000080f7000001da000080ea000008e1000001fd0000
+82fe001bfe00014010fd000080f7000001db000001c9000004fc000081fe001ffe00018010fd000080f7000001db000001ea000080e0000001fd000082fe0022fe00014010fd000080f7000001db00010208ec0002010004e2000008fc000081fe0020fe000680100000c00080f7000001da000004ea000002e0000080fe00
+0082fe001cfe000640100000c60080f7000001da000004ca000008fc000081fe0020fe000680100000060080f7000001da000004ec000002de000080fe000082fe001efe000640107d99c78080f7000001c40002020001e2000010fc000081fe0020fe00068010cd98c60080f7000001db000008e9000001e0000080fe0000
+82fe001cfe00064010cd98c60080f7000001db000010c9000010fc000081fe0020fe00068010cd98c60080f7000001db000010eb000002de000040fe000082fe0024fe000640107d98c66080f7000001db00011001ec000304000080e3000020fc000081fe0024fe000680100cfbf3c080f7000001e0000080fc000001e900
+0040e1000040fe000082fe0021fe000240100cfe000080f7000001e100010220fc000001ca000020fc000081fe0020fe000280100cfe000080f7000001da000001ec000004de000020fe000082fe0023fe00014010fd000080f7000001e100010408e6000304000020e3000020fc000081fe001ffe00018010fd000080f700
+0001db000040e8000020e1000020fe000082fe0020fe0001401ffdff0080f7000001e100010804fd000040c9000040fc000081fe001efe000080f1000001db000080eb000008ed000080f3000020fe000082fe002cfe000040f1000001fd000060e600011002fd0002800080ed000308000010f1000001f4000040fc000081
+fe002dfe000080f1000001fd000018e40002800280fe000080ea00040800000380f500010208f3000010fe000082fe002bfe000040f1000001eb000080f800042000000420fe000080e7000004f400010408f5000040fc000081fe002bfe000080f1000001fe000008ef000040f600014010fd000040ed000010ed000004f3
+000010fe000082fe0029fe000040f1000001fe000010e500042000100010e9000310000002f0000004f5000080fc000081fe0026fe000080f1000001fe000020f000010408f6000305400001e7000001e1000010fe000082fe002cfe000040f1000001fe00014001f100010404f8000020fe00011002e6000320000010e700
+0080fc000081fe0029fe000080f1000001fc000080e2000002ea000010fe000310001010f5000020f2000008fe000082fe002ffe000040f1000001fc000080e7000040fe00040802000020ed000020fc00012010f5000020f5000001fb000081fe002bfe000080f1000001fc000080f200011002f0000020e9000302002010
+f500014002f3000008fe000082fe002efe000040f1000001ec00012002f8000040fe000004fe000020e90002020020f400014002f6000001fb000081fe0021fe000080f1000301000004dc000020ed000020ed000001f3000004fe000082fe0029fe000040f1000301000008e4000080fe000004fd000003ee000020ed0000
+01f6000001fb000081fe0027fe000080f1000301000030ef00014001f3000004fe00010820ea000080e3000004fe000082fe0032fe000040f1000201001cfe000020f20002400080fa000001fd00010204fe00011010ea0002800004e8000001fb000081fe002dfe000080f10002010010fe000020e2000008fb000010f100
+0040fc00018004f6000001f1000004fe000082fe0033fe000040f1000001fc000010e8000001fd0008020800001000040028f1000040fc00018004f6000002f4000002fb000081fe0031fe000080f1000001fc000010f20002800080f100041040040040ec0002410004f6000302000080f4000004fe000082fe0031fe0000
+40f1000001ed000301000040fa000001fd000002fe00011080e9000041f4000302000080f7000002fb000081fe001efe000080f1000001d9000010ed000080ec000080f4000002fe000082fe002efe000040f10002010080e4000002fd000001fc0002010001fc000004f7000080ec000040f7000002fb000081fe0032fe00
+0080f100010101ee000301000040f40008100000010001000080fd00000bf2000020f0000003f5000002fe000082fe003afe000040f100010101fd000008f3000302000020fa000002fd000901100000010000020080fd00011080f30002100002e8000002fb000081fe003dfe000080f100010101fd000004ef000002f500
+0010fc0002020080fd00012040f8000080fd0002020002f6000008fe00011020f600040200080082fe0042fe000040f1000001fc000004ef000001fb000004fc0006a0000004000084fb00012020f9000001fc0002020002f6000008fe00012010f9000004fd0002200081fe0042fe000080f1000001fc000004f700000efe
+00070200001000c003c0f5000306000084fb00014010f30002120001fd000008fb000310000020f400040100020082fe0046fe000040f1000001f7000040fc000009fe0007040000104000c002fe000008fc000340000006f800018010f3000014fb000002fb000310000020f7000004fd0002400081fe0042fe000080f100
+0001f7000010fc000310800070fd000340030001f5000004fb00042a00008008f9000001f5000020fc000060fe0002204004f600040100010082fe004efe000040f100010104f900010104fc000320400088fd000080fe000380000008fc000040fc0008400020008000010004f9000001f5000001fc000090fe0002204004
+f9000008fd0002800081fe0049fe000080f100010108f20008202000840800000880fe000040f9000f4000000400002000200000800100020ef4000008fe00030c000040fc000090f8000103e0fa000380010082fe0057fe000040f100010108fd000002fe00010201fc00074020010208000008fa000008fc000f60000004
+000020001001000002000231f400040800008040fe000080fe00010108f800010c10fe000008fe000301000081fe0051fe000080f100010108fd000002fa000670000040100202f0000040fc000a200010000040020001c0c0fb000002fc00070800008040c00080fd0002020840fe00018001fe0001700cfa000340008082
+fe005efe000040f1000001fc000001fe000a0400400188000080080401f6000010fc000ca0000002000060000002000004fe000040fb000002fc000310000080fd000040fe0010020440000001000100c000800200026008fe000302000081fe0056fe000080f1000001fc000001fb000b020600008008040110000008f100
+030a000050fd00011004fe000020fc000050fb000614000080000001fe00100180040440000010000001300080020004fc000340004082fe005cfe000040f1000001f8000e080020020100010004080090000004fd000308000010fc000310000009fc000304000008fe000020f5001c140000010008000020000240040240
+0000100000020803000100000410fe000304000081fe0053fe000080f1000001f5000b0400c0010004100080000004fd000008f6000001fa00010808fe000010fd0002010804f9000301000402fe00030c200802fe000a1200008204040001000002fd000310002082fe0057fe000040f100010140f9000e10000808002002
+0002200080000008fd000304000020fc000008fd00077810000804000010fe000010fb000004f40005200030100802fe000b120000840218000080100010fb000081fe0057fe000080f100010140f6000b100010020001c0006000000afd0001041cfb000001fe000910000008000400000220fe000008fd00010202fb0000
+02fc00070400001fc0081002fb00060801e000008010fa00012082fe005afe000040f100010180fc00098000001000041000080cfd000340000002fc0002620040fd0005020400001004fe00040408000020fe000008f5000d0200004400020000102000081001fb000010fd000340000090fb000081fe005dfe000080f100
+010180fc000040fc000320000410fd000040f900018180fc000002fd000008fe000402000001c0fe000004fd0002020108fc000d4000004400010400004000042001fe000304000030fd000340000080fc00011082fe005efe000040f1000001fb000941c000200002200003e0fd000040f90002804080fd00010204fe0007
+3000040000100001fd000304000001fe000008fc000340000040fd000c08800002200180000004000020fd000340400020fb000081fe005afe000080f1000001fb00014220fd000040fa000060fe000080fe0002010020f8000320000004fd00010e40fe000704000006c0040080fc0014810000400000080000800001c002
+80000008000020fd00014040fa00010982fe005bfe000040f1000001fb000604180040000140fa0004a000002080fe0002030011fb000302000020fc0002200010fd00050201f0082004fa0008810000080000800009fd00070285000008000040fd000320000020fc00010181fe0052fe000080f1000001fb00010804fd00
+0080fd000608000010000020fd0002030008f000012020fe00070202081010000020f90005080000900001fc0006d0400008000090fd000320000020fc00010682fe0058fe000040f100010180fc000608040080000080fd000602000010000040fd0002050006fb000001fc0005020001200040fd0007010206e010080020
+f4000006fd0007014000000c000090fd000310800040fc00010281fe005dfe000080f100010180fc00011003fe000001fc00072000011000004040fe0002048002fc000004fe000940000001000080004010fd00068c010008000010fb000080fd0002100004fd0002044020fe000001fc000011f900010282fe005cfe0000
+40f100010140fc000610008080000140fd000301000108fe000040fe0002040006fc00040800800040fc0002c00080fc00049000000810f900074000100000200006fd00011040fd000003fc000310000048fc00010481fe005afe000080f100010140fc0005300080000002fc000340000008fa0002080001fc000008f800
+0340010008fd000660000004000044fd000004fe0005100000200008fd00070820100010000408fd000310000008fc00010582fe005efe000040f100010140fc0008300041000002200001fe0002400008fc0004e000080009fe0004c000100040fc000380004002f8000304100080fd000008fe000010fd000009fd000748
+20000010000408fd00030a000080fc00010881fe005ffe000080f100010140fc000e480020000004000007400080000404fe000a2000940010000080000130fc000080fe0005800000020004fa000302000002fd000910202000100000400010fd000610100800020008fc00000af900010882fe0060fe000040f100010120
+fc001c4000120000042000040f80002004040002001001080010202040000208fe00021000e0fc00018004f800010220fb000a1008200040000010002088fe0006901000000a0010fc000308000104fd0002c01081fe0062fe000080f100010120fc000a8000100000080000081819fe000e0200020000024a002020004000
+0208fd00010110fb0002040002fa000301000601fc00098200007000008800c022fd000610040022002004fd000304000002fe000301201082fe0064fe000040f100010120fc000680000e00000810fe00120600100002000c005004040020208020000404fe0002080210fe000340011008f90002028140fa0003f0000048
+fe000d0700008000020008000022002002fd000a0c000200f8000002182081fe006afe000080f100010110fd002301000001c000100000208803000010020301f8080484804092001000040400800000050cfe000620001010000080fc0011082080200080003800030910008600010038fd00070200080400808040fc000a
+0a00080104000004044082fe006afe000040f100010110fe002b3d020300083000100800010000800410010e8e07040802004228000800080200800004060200e00000020990f9001820088040180000c6010204480185000005c0000040001c0008fd0002800020fe000a0200a00104000008038081fe006ffe000080f100
+240108000400c702064000080020000040040410000001003001810a02408800000800100103fe00210802013a001c0fdc300000400a000a003800024200e72001010206020002848002fefd000a0610000402024021000188fe000a0102000683000008000082fe006ffe000040f10023010400114110cc0100110f002002
+01980000080280008810044010010080000006001001fe002101100bae848073f4a0680078000080388040c0003c0100800100c40c012004024007fd000b1008000004cc008002000302fe000a23a8800800800010000181fe0070fe000080f10048011500400200701118420a08c000060002080801aa8160680040e400a1
+20000001006003c00000cc240088022088091c800800602000c070002001900607f057552410010408022018fe000c0e00114000020084008c0009e0fe000a688371c800800321c00082fe0070fe000040f10055010251001dd90441858d919f8001098000a007980026a38008302800ce800fcc03c0cc8043fc1e00e8a070
+010301980301f00018802000080b99980ff800150800505000ca12a01620009000101e140000050090004afe000d81e000805cec31207f1070200081fe000efe000080f10000019cff0082fe000efe000040f10000019cff0081fe000afe0000808b000002fe000afe0000408b000001fe000afe0000808b000002fe000afe
+0000408b000001fe000afe0000808b000002fe0006fe008955fe0006fe0089aafe000283000283000283000afe000002b5aa00a8d4000afe000005b5550050d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b500
+0010d4000afe000002b5000008d4000afe000004b5000010d40015fe00020200f0fe00053000cc600060c0000008d40015fe0002040198fe00053000cc600060c0000010d40015fe0002020180fe00053000cc000060c0000008d40017fe000d040181e3cf8f3e00cce3e3e79980c2000010d40017fe000d0200f3306cd9b3
+00cc63366cd980c2000008d40017fe000d04001bf3ec183300fc63366cd980c2000010d40017fe000d02001b066c183300fc63366cdf80c2000008d40017fe000d04019b366c19b300cc63366cdf80c2000010d40016fe000c0200f1e3ec0f330085fb33e789c1000008d4000afe000004b5000010d4000afe000002b50000
+08d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d40014fe000004f600007ff9ff00c3f9
+ffd2000010d40014fe000002f6000040f9000043f9ffd2000008d40014fe000004f6000040f9000043f9ffd2000010d40014fe000002f6000040f9000043f9ffd2000008d4001cfe000304001f0cfe00010180fe000040f9000043f9ffd2000010d40025fe00080200198c0000030180fe000343000018fe0003180043f8fc
+ff019fffd2000008d40026fe000604001980000003fe00050c0043000018fe0004180043f27ffdff019fffd2000010d40025fe000f0200199c7c78f3c3879f1e0043000018fe0003180043f3fcff019fffd2000008d40027fe001d0400198c66cd9b018cd98c0043e3c799b33cf8f9e043f3e183330c1c187fd2000010d400
+27fe001d0200198c60fd83018cd9800043306cdb3306cd9b3043e1cc9933e4c9933fd2000008d40027fe001d0400198c60c183018cd980004333ec1e333ec1998043f3cc9f3304f999ffd2000010d40027fe001d0200198c60cd9b318cd98c0043366c1e3f66c1986043f3cc9f0264f99e7fd2000008d40027fe001d04001f
+3f6078f1e7e7999e0043366cdb3f66c19b3043f3cc9f0264f9933fd2000010d40021fe000002f800130c0043e3e799923ec0f9e043f3e19fb704fc187fd2000008d40014fe000004f6000040f9000043f9ffd2000010d40014fe000002f6000040f9000043f9ffd2000008d40014fe000004f6000040f9000043f9ffd20000
+10d40014fe000002f6000040f9000043f9ffd2000008d40014fe000004f600007ff9ff00c3f9ffd2000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004
+b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4002afe000002f600007ffaff00e1f6ff01f87ffaff00e1f8ff01fe1ffaff01f87ffbff00f0faff01e008d4002bfe000004f6000040fa000021f600010840fa000021f8ff01fe10fa00010840fb00
+011080fb00012010d4002bfe000002f6000040fa000021f600010840fa000021f8ff01fe10fa00010840fb00011080fb00012008d4002bfe000004f6000040fa000021f600010840fa000021f8ff01fe10fa00010840fb00011080fb00012010d40033fe000202000ffe000203000cfe000040fa000021f600010840fa0000
+21f8ff01fe10fa00010840fb00011080fb00012008d4004ffe000804001980000003000cfe001943e0000600180000210f8000063000000cc000000843f000003ffe000221f87ffdff05fe7ffffe1078fb00110843e000181c00001083c0001c1800002010d40050fe0002020018fe001f03000c000c004330000630180000
+210cc000063000000cc000000840c000000cfe000c21f33fffff3ffcfe7ffffe10ccfb001108433000180c0000108660000c18c0002008d40050fe00100400181e3cf8f3e00f999e004330000030fe0004210cc00006fe00090ec000000840c000000cfe000621f33fffff3ffcfeff02fe10c0fb001108433000180c000010
+8660000c00c0002010d40053fe004d02000f3306cd9b300cd98c004333c78e3c3879f0210ccf1e3e71f1d00ecf363c0840c3c7400c66f8f021f320c1c30f0c3c7860fe10c0f1f6679f1e3c084337c79f0c3cd810866ccf0c38f1982008d40053fe004d040001bf3ec183300cd9800043e66cc63018cd98210f998366319b30
+0fc1bf660840c06cc00c66cd9821f0264c993fe4fe73267e10799b366cd9b3660843e66cd98c66fc10866cc18c18c1982010d40053fe004d020001b066c183300cd98000430666063018cd98210f1f9f66319b300dcfbf7e0840c3ecc00c66cdf821f3264c993f04fe73267e100dfb366fd9b07e0843060cd98c7efc10866c
+cf8c18c1982008d40053fe004d040019b366c19b300ccf8c00430661863018cd98210d9833663199e00dd9b3600840c667800c66cd8021f3264c993e64fe73267e100d83366c19b0600843060cd98c60cc10876cd98c18c1982010d40053fe004d02000f1e3ec0f3300f819e0043066cc63318cd98210cd9b366319b000cd9
+b3660840c66c000c3ef99821f3264c993264ce73267e10cd99f66cd9b3660843060cd98c66cc1086ecd98c18ccf82008d4004efe000004f90044198c004303c79f9e7e7998210c4f1f3efd99e00ccfb33c0840c3e7800c06c0f021f3264cc387061818667e1078f033e7999e3c084306079f3f3ccc1083c7cfbf7e78182010
+d4003dfe000002f900030f000040fa000021fc00010330fd00090840000cc00066c00021f8ff04fe10000030fd00010840fb0002108060fe000301982008d40038fe000004f6000040fa000021fc000101e0fd00090840000780003cc00021f8ff04fe10000030fd00010840fb00011080fc0002f02010d4002bfe000002f6
+000040fa000021f600010840fa000021f8ff01fe10fa00010840fb00011080fb00012008d4002bfe000004f6000040fa000021f600010840fa000021f8ff01fe10fa00010840fb00011080fb00012010d4002afe000002f600007ffaff00e1f6ff01f87ffaff00e1f8ff01fe1ffaff01f87ffbff00f0faff01e008d4000afe
+000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b50000
+08d40015fe000004f600007ff6ff01fe1ff5ffda000010d40017fe000002f6000040f600010210f6000001da000008d40017fe000004f6000040f600010210f6000001da000010d40017fe000002f6000040f600010210f6000001da000008d4001efe000004fd000301980380fe000040f600010210f6000001da000010d4
+001ffe000002fd000301980180fe000040f600020210fcf7000001da000008d40020fe000004fd000701980180000c0040f60002021030f800010c01da000010d40022fe000002fc000691e18ccf1e0040f60005021030000003fb00010c01da000008d40025fe000004fc000690318cd98c0040f6000d0210319be3c7801e
+3cd9b1e7cf01da000010d40025fe000002fc0006f1f18cdf800040f6000d0210319b3663003366fdfb366c01da000008d40025fe000004fc000663318cd8000040f6000d0210319b37e0003066fdfbf66c01da000010d40025fe000002fc000663318cd98c0040f6000d0210319b3600003066cd9b066c01da000008d40025
+fe000004fc000661f7e7cf1e0040f6000d021030fbe663003366cd9b366cc1da000010d40021fe000002f800020c0040f6000d0210301b03c7801e3ccd99e66781da000008d4001bfe000004f6000058f600050210019b0003fa000001da000010d40019fe000002f600007cf60003021000f3f8000001da000008d40017fe
+000004f6000066f600010210f6000001da000010d40017fe000002f6000040f600010210f6000001da000008d40015fe000004f600007ff6ff01fe1ff5ffda000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe0000
+04b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d40012fe00010201fbff03e1fffffec0000008d40012fe00010401fb000321000002c0000010d40012fe00010201fb000321000002c00000
+08d40012fe00010401fb000321000002c0000010d40012fe00010201fb000321000002c0000008d40014fe00010401fd0005018021001802c0000010d40014fe00010201fd0005018021001802c0000008d40014fe00010401fd0005018021001802c0000010d40015fe000b0201078f1e7c79f021079982c0000008d40015
+fe000b04010cd98366cd98210cdb02c0000010d40015fe000b0201061f9f60c198210cde02c0000008d40015fe000b040101983360c198210cde02c0000010d40015fe000b02010cd9b360cd98210cdb02c0000008d40015fe000b0401078f1f60799821079982c0000010d40012fe00010201fb000321000002c0000008d4
+0012fe00010401fb000321000002c0000010d40012fe00010201fb000321000002c0000008d40012fe00010401fb000321000002c0000010d40012fe00010201fbff03e1fffffec0000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000004b5
+000010d4000afe000002b5000008d4000afe000004b5000010d4000afe000002b5000008d4000afe000005b5550050d4000afe000002b5aa00a8d400028300028300028300028300028300028300028300028300028300028300028300a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4.3\tab A typical display from the contig editor in XBAP\par 
+\pard\plain \s4\qj\sb160\sa120\sl280 \f20 The four scroll buttons operate as follows\:\par 
+\pard \s4\qj\li1720\sa120\sl280\tx4520 "<<"\tab Scroll left half a screenful\par 
+"<"\tab Scroll left one character\par 
+">"\tab Scroll right one character\par 
+">>"\tab Scroll right half a screenful\par 
+\pard \s4\qj\sa120\sl280 
+The Editor cursor can be positioned anywhere in the edit window by moving the mouse pointer over the character of interest, then pressing the left mouse button. The Editor cursor can also be moved by using the direction arrow keys.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.2\tab Editing operations \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The editor operates in two main edit modes - Replace
+ and Insert. Replace allows a character to be replaced by another. Insert allows characters to be inserted into a reading. Characters are entered by typing them from the keyboard. Only valid characters are permitted. Characters can be deleted by positionin
+g the cursor one character to their right, then pressing the delete key. Normally Insert and Delete apply to the consensus line of the contig only. This restraint can be overridden by using the "Super Edit" mode of operation, though it should be employed w
+ith caution as misuse may corrupt alignments.\par 
+
+Edits can also be performed on the consensus, though they are restricted to insertion and deletion of padding characters ("*"). These edits also have special meanings. A deletion will delete all characters at the position to the left of the cursor in the c
+ontig, and move the relative positions of all sequences starting to the right of the cursor position left one character. An insertion will insert the character typed ("*") into all gel reading sequences at the 
+cursors position in the contig, and move the relative positions of all sequences starting to the right of the cursor position right one character.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.3\tab Use of buttons \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The effect of the last edit can be undone by pressing the "Undo" button at the top of the editor window. Pressing it n times will undo the last n edits.\par 
+\pard \s4\qj\sa120\sl280 The cursor will automatically be positioned at the next problem when the "Find Next Problem" button is selected. The next problem is where the consensus shows either a disagreement ("-") or a pad ("*") character.\par 
+\pard \s4\qj\sa120\sl280 The edits to the contig can be saved by pressing the "Leave Editor" button and replying "Yes" to the prompt to "Save changes?".\par 
+As no changes are made to the working copy of the database until this point it is possible to abort the editor if the edit session ends up in an unsatisfactory state.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.4\tab Displaying traces for readings from fluorescent sequencing machines\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The original trace data from which the gel reading sequences were derived can be seen by double clicking (two quick clic
+ks) with the middle mouse button on the area of interest. The trace will be displayed with the point clicked at the centre of the trace viewport. All traces that are displayed are maintained in one window, which will display a maximum of four traces. When 
+four traces are already being displayed and a new one is requested, the one at the top of the window is removed and the new one is added to the bottom. Traces can be removed individually by using the "quit" button in the panel next to the trace. \par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.5\tab Extending reads with the unused data\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Sequence data from fluorescent sequencing machines is normally clipped to remove the primer region and the poor quality data from the 3' end is marked to be ignored during assembly. Only the sequence used during assembly is made visible in the XBAP editor.
+ However the unused data is copied into the database and can be viewed from within the editor. Also the position of this "cutoff" can be altered. To display the unused sequences, press the "Display Cutoff" button at the to
+p of the editor window. The cutoff sequence appears in grey. This sequence can be incorporated into the editable sequence, by moving the cutoff position. This is done by positioning the cursor at the end of the sequence, and using Meta-Left-Arrow and Meta-
+Right-Arrow to adjust the point of cutoff. The Meta key is a diamond on the Sun keyboard.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.6\tab Using the pop-up menu\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 A pop-up menu is revealed by depressing the "Control" key on the keyboard and at the same time pressing the left mouse button.\par 
+\pard \s4\qj\sa120\sl280 The menu has the following functions\:\par 
+\pard\plain \li1880\sl220 \f4\fs16 Find Next Problem\par 
+Highlight Disagreements\par 
+Save Contig\par 
+Create Tag\par 
+Edit Tag\par 
+Delete Tag\par 
+Search\par 
+Select Oligo\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 \par 
+\pard \s4\qj\sa120\sl280 "Find Next Problem" and "Save Contig" are described above. Operations on tags are described in the section on annotation below, and then searching is outlined.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.7\tab Annotating readings\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Parts of a sequence can be annotated to record the positions of primers used for walking, or to mark sites, such as compressions, that have caused problems during sequencing. The annotations ar
+e termed "tags".  Each tag has a type such as "primer", a position, a length and a comment. Each type has an associated colour that will be shown on the display. First the segment to tag is selected, then it is annotated. The consensus sequence cannot be a
+nnotated.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.8\tab Creating a new annotation\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Use the left mouse button to position the start of the selection. While this button is being held down, move the mouse to the other end of the segment. The selection can be extended further using the right mouse bu
+tton. To create the annotation, invoke the pop-up menu, and select the "Create Tag" function. A small "tag editor" will appear which allows users to select the type of the annotation from a pull-down menu, and specify a comment if desired. To select a new 
+type pull down the Type menu, and select the entry desired. To enter a comment, simply type into the text window in the tag editor. The annotation is created when the "Leave" button on the tag editor is pressed, and is displayed in the colour defined in th
+e tag database file (TAGDB).\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.6.9\tab Editing an existing annotation\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Position the cursor with the left mouse button on the tag, and select the "Edit Tag" off the pop-up menu. This invokes the tag editor, and changes to the type and comment of the annotation can be made. The tag is updated when the "Leave" button is pressed.
+\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1180 \b\f20 2.6.10\tab Deleting an annotation\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 To delete an existing annotation, position the cursor with the left mouse button on the tag, and select the "Delete Tag" off the pop-up menu.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1160 \b\f20 2.6.11\tab Searching\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Selecting "Search" brings up a window which can remain present during normal editor operation. The window allows the user to select the direction of search, the type of search and a value to search on. The value is entered into a value text window, then pr
+essing the "search" button performs the search. If successful, the cursor is positioned accordingly. An audible tone indicates failure. Pressing the "ok" button removes the search window. The search window is automatically removed when the contig editor is
+ exited. There are seven different search modes.\par 
+\pard\plain \s9\fi-560\li1120\sb180\sa60\sl280\tx1700 \b\f20 2.6.11.1\tab Search by position\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This positions the cursor at the numeric position specified in the value text window. Eg a value of "1234" causes the cursor to be placed at base number 1234 in the contig. Positioning withing a reading is achieved by prefixing the number with the "@" char
+acter, eg "@123" positions the cursor at base 123 of the sequence in which the cursor lies. Relative positions can be specified by prefixing the number with a plus or minus charac
+ter. Eg "+1234" will advance the cursor 1234 bases. If possible, the cursor is positioned within the same sequence. The direction buttons have no effect on the operation of "search by position".\par 
+\pard\plain \s9\fi-560\li1120\sb180\sa60\sl280\tx1720 \b\f20 2.6.11.2\tab Search by reading name\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This positions the cursor at the left end of the gel reading specified in the value text window. If the value is prefixed with a slash it is assumed to be a gel reading name. Otherwise it is assumed to be a gel reading number. Eg "123" positions the cursor
+ at the left end of gel readi
+ng number 123. "/a16a12.s1" positions at the start of reading a16a12.s1. If the value was "/a16" the cursor is positioned at the first reading which starts with "a16". The direction buttons have no effect on the operation of "search by reading name".
+\par 
+\pard\plain \s9\fi-560\li1120\sb180\sa60\sl280\tx1700 \b\f20 2.6.11.3\tab Search by tag type\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This positions the cursor at the start of the next tag which has the the same type as specified by the type value menu. To change the type, select from the menu that pops up when the mouse is clicked on the button labeled "Type\:". Th
+e search can be performed either forwards or backwards from the current cursor position. To find all tags, use "search by annotation", with a null text value string.\par 
+\pard\plain \s9\fi-560\li1120\sa60\sl280\tx1700 \b\f20 2.6.11.4\tab Search by annotation\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This positions the cursor at the start of the next tag which has a comment containing the string specified in the value text window. The search performed is a regular expression search, and certain characters have special meanings. Be careful when your val
+ue string contains ".", "*", "[", "^" or "$". The search can be performed either forwards or backwards from the current cursor position.\par 
+\pard\plain \s9\fi-560\li1120\sa60\sl280\tx1700 \b\f20 2.6.11.5\tab Search by sequence\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This positions the cursor at the start of the next piece of sequence that matches the value specified in the text value window. The search is for an exact match, which means that the case of the value string is important. The search is performed on the gel
+ readings themselves, rather than the consensus sequence. The search can be performed either forwards or backwards from the current cursor position.\par 
+\pard\plain \s9\fi-560\li1120\sa60\sl280\tx1700 \b\f20 2.6.11.6\tab Search by problem\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This positions the cursor at the next place in the consensus sequence which is not  "A", "C", "G" or "T". The search can be performed either forwards or backwards from the current cursor position.\par 
+\pard \s4\qj\sa120\sl280 \par 
+\pard\plain \s9\fi-560\li1120\sa60\sl280\tx1700 \b\f20 2.6.11.7\tab Search by quality\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This positions the cursor at the next place in the consensus sequence where the consensus for each strand is not  "A", "C", "G" or "T" or where the two strands disagree. The search can be performed either forwards or backwards from the current cursor posit
+ion.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 \par 
+2.7\tab Joining contigs interactively using XBAP\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The operation of the join editor in XBAP is very similar to the one for single contigs described above. It allows the user to align the ends of the two contigs by editing each contig separately. First specify which two contigs are to be joined. The program
+ checks that the two contig numbers are different (it will not allow circles to be formed!) The Join Editor consists of two Contig Editors in between which is sandwiched a disagreement box. This disagreement box
+ uses exclamation marks to denote mismatches between the two consensuses. A typical example is shown in figure 4.4. Here we see in the top window the right end of one contig and in the bottom window the left end of another. The left end of the overlap is c
+orrectly aligned, as indicated by an absense of exclamation marks, but the top contig has an extra character at position 558 which is spoiling the alignment over the next segment. Notice that the "lock" button is highlighted denoting that the user has aske
+d for the two contigs to scroll together.\par 
+\pard \s4\qj\sa120\sl280 The best strategy for joining is to align the leftmost character of the right contig with its counterpart in the left contig. Then press the \'d2Lock\'d3
+ button before editing the contigs to make them align for the whole overlap. The overlap must be of at least 
+one character. Use the scroll bar and the scroll buttons ("<<", "<", ">", and ">>") for positioning the relative positions of the two contigs. The join position can be fixed by pressing the "lock" button at the top
+ of the Join Editor. Locking allows the two contigs to be scrolled as one when using the scroll bar and buttons, the left ends always in the same position relative to each other. Once locked, it is best to proceed to the right along the contigs, inserting 
+padding characters ("*") into the consensuses to minimise the disagreements. It is important that the user aligns the two contigs throughout the whole region of overlap before completing the join because it is only at this stage that the two contigs can be
+ edited independently. If a join is completed leaving a region of mismatch the consensus will consist of dashes and the assembly function will fail to find overlaps in the bad section. Misaligned sections can be corrected using the "super edit" mode of the
+ editor. The join can be completed by pressing the "Leave Editor" button. The percentage mismatch is displayed, and users are required to confirm that they want to perform the join.\par 
+\pard\plain \li100\ri80\sb100\sl220\keepn\box\brsp20\brdrth \f4\fs16 {{\pict\macpict\picw441\pich144 
+4685ffffffff008f01b81101a0008201000affffffff008f01b80900000000000000003100000000008e01b798007c00000000014003db00000000014003db00000000008e01b7000102850002850026e600001ff9ff0087f8ff01f87ff5ff01fe1fefff0087fcff01fe1ffcff01f87ff2ff00e0f40026e600001ff9ff0084
+f80001087ff5ff01fe10ef000084fc0001021ffcff01f840f2000020f40026e600001ff9ff0084f80001087ff5ff01fe10ef000084fc0001021ffcff01f840f2000020f40026e600001ff9ff0084f80001087ff5ff01fe10ef000084fc0001021ffcff01f840f2000020f4003701003cfa000203fc03fa0008630c18000181
+80001ff9ff0084f80001087ff5ff01fe10ef000084fc0001021ffcff01f840f2000020f4005b010066fa0002030003fa000ac30c38000380c0001f807ffbff05841f8000003cfd0002087f87fbff07e01fe7fffffe107efc00030f000078fd00133c0f0000841860000600021f9ffffffe7ff84180fb00021fe018fc000020
+f400610100c3fe0008c01800000300030603fd000a01830c7800078060001ff3faff058418c000000cfd0002087f33fbff07e7ffe7fffffe1063fc0003030000ccfd001366198000841860000600021f9ffffffe7ff84180fb0002180018fc000020f400670100c0fe0008c01800000300030603fd000a01830cd8000d8060
+001ff3fcff07f9ff84186000000cfd0002087e79fbff08e7ffe7cfe7fe106180fd001b030001860006000066198000841860000600021f9ffffffe7ff84180fb00041800183018fe000020f400670100c0fe0000c0fe00040300030003fd000a0301989800098030001ff3fcff07f9ff84186000000cfd0002087e7ffbff08
+e7ffe7cfe7fe106180fd001b030001800006000060180000841860000600021f9ffffffe7ff84180fb00041800183018fe000020f400681c00c00f0dc3f0781f4003003b1e0fc0f0de000301981800018030001ff3fcff07f9ff84186000000cfd0002087e7ffbff08e7ffe7ffe7fe106180fd001b03000180000600006018
+0000841860000600021f9ffffffe7ff84180fb00041800180018fe000020f400726e00c0198e60c01831c003f0670603019873000301981800018030001ff3e47c0f8790e07f841861e1b80c0fc1f078087f3f9e647e1e43ffe7fe270f81fe1061878618783f03000180619f81e060180fc0841866e0761e021f9ff87e0e73
+f841801e0fc61878001801d8f07e0786f020f400726e00c030cc30c01831800300c30603030c60000300f01800018030001ff3e339e733c679ff8418c331cc0c186318cc087f879e633ccf19ffe07cc7cfe7fe10630cc618cc6183000180618603306018186084186730ce33021f9ff33ce667f8418033186618cc001f8338
+30180cc39820f400726e00c030cc30c01831800300c30603030c60000300f01800018030001ff3e799fe79cff9ff841f8619860c00660186087ff39e6799e73fffe7f9e7cfe7fe107e18633186018300018061860619f87e1800841866198661821f9fe799fe4ff84180618063318600180618301818630020f400726e00c0
+30cc30c01831800300c30603030c60000180f01800018060001ff3e79c0e01cff9ff841987f9860c0fe601fe087ff99e6798073fffe7f9e7cfe7fe10661fe331fe3f830001806186061860180fc0841866198661821f9fe799fe1ff841807f8fe331fe00180618301818630020f400726e00c330cc30c0181f000300c30603
+030c60000180601800018060001ff3e79fe67fcff9ff8418c601860c18660180087ff99e6799ff3fffe7f9e7cfe7fe10631801e18061830001806186061860180060841866198661821f9fe799fe0ff84180601861e18000180618301818630020f400726e0066198c30cc18300003006706033198600000c06018070180c0
+001ff3e79fe67fcff9ff8418c601860c18660180087e799e6799ff3fffe7f9e7cfe7fe10631801e18061830001866186061860180060841866198661821f9fe799fe47f84180601861e18000180618301818630020f400726e003c0f0c3078ff1f8003fc3b3fc1e0f06000006060ff070ff180001ff3e799e739cff99f8418
+6319cc0c186318c6087f33cc633ce73fffe7fcc7cfe67e10618c60c0c661830000cc3386633060181860840cc618ce33021f9ff33ce663f84180319860c0c60018033830198cc30020f4005efa000130c0ef00531f80679c0f83cffc3f841861f1b87f8fa1f07c087f87e2647e0f3fffe01e2601f0fe106187c0c07c3e9fe0
+00781d83c1e060180fc084078618761e021f80787e0e71f841fe1f0fa0c07c001fe1d9fe0f07830020f40032fa000130c0ef00001ff9ff048400000180fc0004087fffffe7f8ff01fe10ef000084fc0001021ffcff01f840f2000020f40032fa000130c0ef00001ff9ff048400000180fc0004087fffffe7f8ff01fe10ef00
+0084fc0001021ffcff01f840f2000020f40032fa00011f80ef00001ff9ff048400000180fc0004087fffffe7f8ff01fe10ef000084fc0001021ffcff01f840f2000020f4002de600001ff9ff048400000180fc0004087fffffe7f8ff01fe10ef000084fc0001021ffcff01f840f2000020f40026e600001ff9ff0084f80001
+087ff5ff01fe10ef000084fc0001021ffcff01f840f2000020f40026e600001ff9ff0084f80001087ff5ff01fe10ef000084fc0001021ffcff01f840f2000020f40026e600001ff9ff0084f80001087ff5ff01fe10ef000084fc0001021ffcff01f840f2000020f40026e600001ff9ff0087f8ff01f87ff5ff01fe1fefff00
+87fcff01fe1ffcff01f87ff2ff00e0f40002850002850002850002850002850002850002850002850007001f88ff01fe00180010fc000006fe00010180fe000060fc00000c9d00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00010180fe000060fc00000c
+c9000001fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de0001020024151000004010000600004001800200006000100400000cc9000001fa550040de000102002415100000c03000060000c001800300006000180600000cc9000002faaa00a0de0001020024151000018060
+0006000180018001800060000c0300000cc9000001fa550040de00010200241510000300c00006000300018000c0006000060180000cc9000002faaa00a0de00010200241510000601800006000600018000600060000300c0000cc9000001fa550040de00010200241510000c03000006000c000180003000600001806000
+0cc9000002faaa00a0de000102002415100018060000060018000180001800600000c030000cc9000001fa550040de00010200241510000c03000006000c0001800030006000018060000cc9000002faaa00a0de00010200241510000601800006000600018000600060000300c0000cc9000001fa550040de000102002415
+10000300c00006000300018000c0006000060180000cc9000002faaa00a0de00010200241510000180600006000180018001800060000c0300000cc9000001fa550040de000102002415100000c03000060000c001800300006000180600000cc9000002faaa00a0de00010200241510000040100006000040018002000060
+00100400000cc9000001fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00010180fe000060fc00000cc9000001fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00
+010180fe000060fc00000cc9000001fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00010180fe000060fc00000cc9000001fa550040de00010200180010fc000006fe00010180fe000060fc00000c9d0001020007001f88ff01fe0007001f88
+ff01fe000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d00010200420010ed00000cfb000307f8780cf800037f9fe0c0f9000307f8780cf800037f8780c0f9000301e0300cf800031e0300c0f900
+0301e0780cf800071e0780c000000200420010ed00000cfb00030600cc1ef80003600061e0f900030600cc1ef80003600cc1e0f900030330781ef80003330701e0f900030330cc1ef80007330cc1e000000200420010ed00000cfb000306018633f8000360006330f9000306018633f8000360186330f900030618cc33f800
+03618f0330f9000306198633f800076198633000000200420010ed00000cfb000306018033f800036000c330f9000306018633f8000360186330f900030600cc33f80003601b0330f9000306018633f800076018633000000200460010ed00000cfb00040601806180f900036000c618f900040601866180f9000360186618
+f900040601866180f9000360130618f900040600066180f900076000661800000200460010ed00000cfb000406e1b86180f900036e018618f9000406e0cc6180f900036e186618f9000406e1866180f900036e030618f9000406e0066180f900076e00c61800000200460010ed00000cfb00040731cc6180f9000373018618
+f900040730786180f90003730ce618f900040731866180f9000373030618f9000407300c6180f900077303861800000200440010ed00000cfa000319866180f9000301830618f8000318cc6180f9000301876618f900040619866180f9000361830618f900040618386180f900076180c61800000200440010ed00000cfa00
+0319866180f9000301830618f8000319866180f9000301806618f900040619866180f9000361830618f900040618606180f900076180661800000200400010ed00000cfa0002198633f8000301860330f80002198633f8000301806330f900030618cc33f8000361830330f900030618c033f8000761986330000002004200
+10ed00000cfb000306198633f8000361860330f9000306198633f8000361986330f900030618cc33f8000361830330f9000306198033f800076198633000000200420010ed00000cfb00030330cc1ef80003330c01e0f900030330cc1ef80003330cc1e0f900030330781ef80003330301e0f900030331801ef80007330cc1
+e000000200420010ed00000cfb000301e0780cf800031e0c00c0f9000301e0780cf800031e0780c0f9000301e0300cf800031e1fe0c0f9000301e1fe0cf800071e0780c0000002000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c
+9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102004b0010fe000dc0781e000001fe1e0001e0000003fe002f0c1fe0c0781e000001fe7f9fe7f8780c0787f9fe7f8301e0300c1fe1e0301e0301e1fe7f8780c1fe7f9fe0c0307f8780cc000102004b1110000001c0cc330000018033000330000007fe00
+050c0301e0cc33fe0026300c0300c0cc1e0cc0c0300c078330781e0303307833078330300c0cc1e0300c0301e0780c0cc0cc000102004b1110000003c18661800001806180061800000ffe002f0c0303318661800000300c0300c186331860c0300c0cc618cc33030618cc618cc618300c186330300c030330cc0c1860cc00
+0102004a1110000006c18660000001806000061800001bfe00050c0303318060fe0025300c0300c180331800c0300c0cc600cc33030600cc600cc600300c180330300c030330cc0c18cb000102004a1110000004c186600000018060000618000013fe00050c0306198060fe0025300c0300c180619800c0300c1866018661
+8306018660186600300c180618300c030619860c18cb000102004a0010fe000dc0cc6e0003f1b86e0fc618003f03fe00050c0306198060fe0025300c0300c180619800c0300c18660186618306018660186600300c180618300c030619860c18cb000102004b0010fe000dc07873000619cc73186338006183fe002f0c0306
+199e679fe7f8300c0300c1806199e0c0300c18667986618306018667986678300c180618300c030619860c19e0cc000102004b0010fe000dc0cc6180001806618061d8006003fe002f0c0307f98661800000300c0300c1807f9860c0300c1fe619fe7f830601fe619fe618300c1807f8300c0307f9fe0c1860cc000102004b
+0010fe000dc186618003f806618fe018003f03fe002f0c0306198661800000300c0300c180619860c0300c18661986618306018661986618300c180618300c030619860c1860cc000102004b0010fe000dc186618006180661986018000183fe002f0c0306198661800000300c0300c180619860c0300c1866198661830601
+8661986618300c180618300c030619860c1860cc000102004b0010fe000dc186618006198661986618000183fe002f0c0306198661800000300c0300c186619860c0300c18661986618306198661986618300c186618300c030619860c1860cc000102004b0010fe000dc0cc33000618cc33186330386183fe002f0c030618
+ce33800000300c0300c0cc618ce0c0300c18633986618303318633986338300c0cc618300c030619860c0ce0cc000102004b4410000007f8781e0003e8781e0fa1e0383f1fe000000c0306187a1e800000300c0300c0786187a0c0300c1861e986618301e1861e9861e8300c078618300c030619860c07a0cc000102000b00
+10ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d00010200630010fe000dc0041e000001fe0c03c7f8000003fe00470c1fe0c0780c030001fe7f9fe7f8780c0787f9fe7f83
+01e0300c1fe1e0301e0301e1fe7f8780c1fe7f9fe0c0307f8787f9fe1e0301e1fe7f9fe1e0780c0301e0781e0307f8781e03e400010200640d10000001c00c33000001801c0666fe000007fe00480c0301e0cc1e078000300c0300c0cc1e0cc0c0300c078330781e0303307833078330300c0cc1e0300c0301e0780c0cc0c0
+3033078330300c030330cc1e078330cc330780c0cc330780e500010200640d10000003c01c61800001803c0666fe00000ffe00480c03033186330cc000300c0300c186331860c0300c0cc618cc33030618cc618cc618300c186330300c030330cc0c1860c030618cc618300c03061986330cc61986618cc0c186618cc0e500
+010200640d10000006c03c60000001806c0606fe00001bfe00480c03033180330cc330300c0300c180331800c0300c0cc600cc33030600cc600cc600300c180330300c030330cc0c1800c030600cc600300c03060180330cc60180600cc0c180600cc0e500010200640d10000004c06c60000001804c0606fe000013fe0048
+0c0306198061986330300c0300c180619800c0300c18660186618306018660186600300c180618300c030619860c1800c03060186600300c030601806198660180601860c180601860e500010200640010fe000dc0cc6e0003f1b80c0606e0003f03fe00480c03061980619861e0300c0300c180619800c0300c1866018661
+8306018660186600300c180618300c030619860c1800c03060186600300c030601806198660180601860c180601860e500010200640010fe000dc18c73000619cc0c060730006183fe00480c0306199e619867f8300c0300c1806199e0c0300c18667986618306018667986678300c180618300c030619860c19e0c0306018
+6678300c0306019e6198660180601860c180601860e500010200640010fe000dc18c61800018060c1f8018006003fe00480c0307f9867f9fe1e0300c0300c1807f9860c0300c1fe619fe7f830601fe619fe618300c1807f8300c0307f9fe0c1860c030601fe618300c030601867f9fe60180601fe0c180601fe0e500010200
+640010fe000dc1fe618003f8060c060018003f03fe00480c0306198661986330300c0300c180619860c0300c18661986618306018661986618300c180618300c030619860c1860c03060186618300c030601866198660180601860c180601860e500010200640010fe000dc00c61800618060c060018000183fe00480c0306
+198661986330300c0300c180619860c0300c18661986618306018661986618300c180618300c030619860c1860c03060186618300c030601866198660180601860c180601860e500010200640010fe000dc00c61800619860c060618000183fe00480c0306198661986000300c0300c186619860c0300c1866198661830619
+8661986618300c186618300c030619860c1860c03061986618300c030619866198661986619860c186619860e500010200640010fe000dc00c33000618cc0c060330386183fe00480c030618ce61986000300c0300c0cc618ce0c0300c18633986618303318633986338300c0cc618300c030619860c0ce0c0303318633830
+0c030330ce61986330cc331860c0cc331860e500010200645d10000007f80c1e0003e8787f8601e0383f1fe000000c0306187a61986000300c0300c0786187a0c0300c1861e986618301e1861e9861e8300c078618300c030619860c07a0c0301e1861e8300c0301e07a619861e0781e1860c0781e1860e5000102000b0010
+ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102007d1110000001e03001000001fe1e0067f8000003fe00660c1fe0c0780c030001fe7f9fe7f8780c0787f9fe7f8301
+e0300c1fe1e0301e0301e1fe7f8780c1fe7f9fe0c0307f8787f9fe1e0301e1fe7f9fe1e0780c0301e0781e0307f8781e0300c0780c1fe1e0307f9fe000780c1fe1e1fe1e0787f8781e0780c0787f82007d0d1000000330700300000180330066fe000007fe00660c0301e0cc1e078000300c0300c0cc1e0cc0c0300c078330
+781e0303307833078330300c0cc1e0300c0301e0780c0cc0c03033078330300c030330cc1e078330cc330780c0cc330781e0cc1e030330780c030000cc1e03033030330cc0c0cc330cc1e0cc0c02007d0d1000000618f00700000180618066fe00000ffe00660c03033186330cc000300c0300c186331860c0300c0cc618cc
+33030618cc618cc618300c186330300c030330cc0c1860c030618cc618300c03061986330cc61986618cc0c186618cc3318633030618cc0c030001863303061830619860c18661986331860c02007d0d1000000619b00f00000180600066fe00001bfe00660c03033180330cc330300c0300c180331800c0300c0cc600cc33
+030600cc600cc600300c180330300c030330cc0c1800c030600cc600300c03060180330cc60180600cc0c180600cc3318033030600cc0c030001803303060030601800c18060180331800c02007d0010fe000919301b00000180600066fe000013fe00660c0306198061986330300c0300c180619800c0300c186601866183
+06018660186600300c180618300c030619860c1800c03060186600300c030601806198660180601860c180601866198061830601860c030001806183060030601800c18060180619800c02007d0010fe000d1830330003f1b86e0766e0003f03fe00660c03061980619861e0300c0300c180619800c0300c18660186618306
+018660186600300c180618300c030619860c1800c03060186600300c030601806198660180601860c180601866198061830601860c030001806183060030601800c18060180619800c02007d0010fe000d303063000619cc730ce730006183fe00660c0306199e619867f8300c0300c1806199e0c0300c1866798661830601
+8667986678300c180618300c030619860c19e0c03060186678300c0306019e6198660180601860c180601866198061830601860c0307f9806183060030601800c18060180619800c02007d0010fe000de030630000180661986018006003fe00660c0307f9867f9fe1e0300c0300c1807f9860c0300c1fe619fe7f830601fe
+619fe618300c1807f8300c0307f9fe0c1860c030601fe618300c030601867f9fe60180601fe0c180601fe7f9807f830601fe0c030001807f83060030601800c180601807f9800c02007d111000000180307f8003f80661986018003f03fe00660c0306198661986330300c0300c180619860c0300c18661986618306018661
+986618300c180618300c030619860c1860c03060186618300c030601866198660180601860c180601866198061830601860c030001806183060030601800c18060180619800c02007d11100000030030030006180661986018000183fe00660c0306198661986330300c0300c180619860c0300c1866198661830601866198
+6618300c180618300c030619860c1860c03060186618300c030601866198660180601860c180601866198061830601860c030001806183060030601800c18060180619800c02007d11100000060030030006198661986618000183fe00660c0306198661986000300c0300c186619860c0300c186619866183061986619866
+18300c186618300c030619860c1860c03061986618300c030619866198661986619860c186619866198661830619860c030001866183061830619860c18661986619860c02007d1110000006003003000618cc330ce330386183fe00660c030618ce61986000300c0300c0cc618ce0c0300c18633986618303318633986338
+300c0cc618300c030619860c0ce0c03033186338300c030330ce61986330cc331860c0cc33186618cc61830331860c030000cc6183033030330cc0c0cc330cc618cc0c02007d7b10000007f9fe030003e8781e0761e0383f1fe000000c0306187a61986000300c0300c0786187a0c0300c1861e986618301e1861e9861e830
+0c078618300c030619860c07a0c0301e1861e8300c0301e07a619861e0781e1860c0781e18661878618301e1860c03000078618301e0301e0780c0781e078618780c02000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102
+000b0010ed00000c9d000102000b0010ed00000c9d000102007d0010fe000dc0787f800001fe0c180010000003fe00660c1fe0c0780c030001fe7f9fe7f8780c0787f9fe7f8301e0300c1fe1e0301e0301e1fe7f8780c1fe001fe0c0307f8787f9fe1e0301e1fe7f9fe1e0780c0301e0781e0307f8781e0300c0780c1fe1e0
+307f9fe7f8780c1fe1e1fe1e0787f8781e0780c0787f82007d1110000001c0cc01800001801c180030000007fe00660c0301e0cc1e078000300c0300c0cc1e0cc0c0300c078330781e0303307833078330300c0cc1e030000301e0780c0cc0c03033078330300c030330cc1e078330cc330780c0cc330781e0cc1e03033078
+0c0300c0cc1e03033030330cc0c0cc330cc1e0cc0c02007d1110000003c18601800001803c18007000000ffe00660c03033186330cc000300c0300c186331860c0300c0cc618cc33030618cc618cc618300c1863303000030330cc0c1860c030618cc618300c03061986330cc61986618cc0c186618cc3318633030618cc0c
+0300c1863303061830619860c18661986331860c02007d1110000006c18003000001806c1800f000001bfe00660c03033180330cc330300c0300c180331800c0300c0cc600cc33030600cc600cc600300c1803303000030330cc0c1800c030600cc600300c03060180330cc60180600cc0c180600cc3318033030600cc0c03
+00c1803303060030601800c18060180331800c02007d1110000004c18003000001804c1801b0000013fe00660c0306198061986330300c0300c180619800c0300c18660186618306018660186600300c1806183000030619860c1800c03060186600300c030601806198660180601860c180601866198061830601860c0300
+c1806183060030601800c18060180619800c02007d0010fe000dc1b8060003f1b80c1b8330003f03fe00660c03061980619861e0300c0300c180619800c0300c18660186618306018660186600300c1806183000030619860c1800c03060186600300c030601806198660180601860c180601866198061830601860c0300c1
+806183060030601800c18060180619800c02007d1110001fe0c1cc06000619cc0c1cc630006183fe00660c0306199e619867f8300c0300c1806199e0c0300c18667986618306018667986678300c180618307f830619860c19e0c03060186678300c0306019e6198660180601860c180601866198061830601860c0300c180
+6183060030601800c18060180619800c02007d0010fe000dc1860c000018060c186630006003fe00660c0307f9867f9fe1e0300c0300c1807f9860c0300c1fe619fe7f830601fe619fe618300c1807f830000307f9fe0c1860c030601fe618300c030601867f9fe60180601fe0c180601fe7f9807f830601fe0c0300c1807f
+83060030601800c180601807f9800c02007d0010fe000dc1860c0003f8060c1867f8003f03fe00660c0306198661986330300c0300c180619860c0300c18661986618306018661986618300c1806183000030619860c1860c03060186618300c030601866198660180601860c180601866198061830601860c0300c1806183
+060030601800c18060180619800c02007d0010fe000dc18618000618060c186030000183fe00660c0306198661986330300c0300c180619860c0300c18661986618306018661986618300c1806183000030619860c1860c03060186618300c030601866198660180601860c180601866198061830601860c0300c180618306
+0030601800c18060180619800c02007d0010fe000dc18618000619860c186030000183fe00660c0306198661986000300c0300c186619860c0300c18661986618306198661986618300c1866183000030619860c1860c03061986618300c030619866198661986619860c186619866198661830619860c0300c18661830618
+30619860c18661986619860c02007d0010fe000dc0cc30000618cc0c186030386183fe00660c030618ce61986000300c0300c0cc618ce0c0300c18633986618303318633986338300c0cc6183000030619860c0ce0c03033186338300c030330ce61986330cc331860c0cc33186618cc61830331860c0300c0cc6183033030
+330cc0c0cc330cc618cc0c02007d7b10000007f878300003e8787f986030383f1fe000000c0306187a61986000300c0300c0786187a0c0300c1861e986618301e1861e9861e8300c0786183000030619860c07a0c0301e1861e8300c0301e07a619861e0781e1860c0781e18661878618301e1860c0300c078618301e0301e
+0780c0781e078618780c02000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d00010200790010fa007301e078618787f9861e1861e0000c1fe0c0780c030001fe7f9f
+e7f8780c0787f9fe7f8301e0300c1fe1e0301e0301e1fe7f8780c1fe7f9fe0c0307f8787f9fe1e0301e1fe7f9fe1e0780c0301e0781e0307f8781e0300c0780c1fe1e0307f9fe7f8780c1fe1e1fe1e0787f8781e0780c0787f8200790010fa00730330cc718cc601c633186330000c0301e0cc1e078000300c0300c0cc1e0c
+c0c0300c078330781e0303307833078330300c0cc1e0300c0301e0780c0cc0c03033078330300c030330cc1e078330cc330780c0cc330781e0cc1e030330780c0300c0cc1e03033030330cc0c0cc330cc1e0cc0c0200790010fa007306198671986601c661986618000c03033186330cc000300c0300c186331860c0300c0c
+c618cc33030618cc618cc618300c186330300c030330cc0c1860c030618cc618300c03061986330cc61986618cc0c186618cc3318633030618cc0c0300c1863303061830619860c18661986331860c0200790010fa007306018679980601e660186600000c03033180330cc330300c0300c180331800c0300c0cc600cc3303
+0600cc600cc600300c180330300c030330cc0c1800c030600cc600300c03060180330cc60180600cc0c180600cc3318033030600cc0c0300c1803303060030601800c18060180331800c0200790010fa007306018679980601e660186600000c0306198061986330300c0300c180619800c0300c1866018661830601866018
+6600300c180618300c030619860c1800c03060186600300c030601806198660180601860c180601866198061830601860c0300c1806183060030601800c18060180619800c0200790010fa00730601866d8c0601b630186300000c03061980619861e0300c0300c180619800c0300c18660186618306018660186600300c18
+0618300c030619860c1800c03060186600300c030601806198660180601860c180601866198061830601860c0300c1806183060030601800c18060180619800c0200790010fa00730601866d8787e1b61e1861e0000c0306199e619867f8300c0300c1806199e0c0300c18667986618306018667986678300c180618300c03
+0619860c19e0c03060186678300c0306019e6198660180601860c180601866198061830601860c0300c1806183060030601800c18060180619800c0200790010fa00730601866780c6019e03186030000c0307f9867f9fe1e0300c0300c1807f9860c0300c1fe619fe7f830601fe619fe618300c1807f8300c0307f9fe0c18
+60c030601fe618300c030601867f9fe60180601fe0c180601fe7f9807f830601fe0c0300c1807f83060030601800c180601807f9800c0200790010fa0073060186678066019e01986018000c0306198661986330300c0300c180619860c0300c18661986618306018661986618300c180618300c030619860c1860c0306018
+6618300c030601866198660180601860c180601866198061830601860c0300c1806183060030601800c18060180619800c0200790010fa0073060186638066018e01986018000c0306198661986330300c0300c180619860c0300c18661986618306018661986618300c180618300c030619860c1860c03060186618300c03
+0601866198660180601860c180601866198061830601860c0300c1806183060030601800c18060180619800c0200790010fa0073061986639866018e61986618000c0306198661986000300c0300c186619860c0300c18661986618306198661986618300c186618300c030619860c1860c03061986618300c030619866198
+661986619860c186619866198661830619860c0300c1866183061830619860c18661986619860c0200790010fa00730330cc618cc60186330cc330000c030618ce61986000300c0300c0cc618ce0c0300c18633986618303318633986338300c0cc618300c030619860c0ce0c03033186338300c030330ce61986330cc3318
+60c0cc33186618cc61830331860c0300c0cc6183033030330cc0c0cc330cc618cc0c0200790010fa007301e078618787f9861e0781e0000c0306187a61986000300c0300c0786187a0c0300c1861e986618301e1861e9861e8300c078618300c030619860c07a0c0301e1861e8300c0301e07a619861e0781e1860c0781e18
+661878618301e1860c0300c078618301e0301e0780c0781e078618780c02000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d00010200
+0b0010ed00000c9d0001020007001f88ff01fe0007001f88ff01fe000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d00010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c
+0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe
+000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0
+fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c03
+0000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0
+f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c
+0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300
+c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f0000102000b0010ed00000c9d00010200590010ed
+00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c00000030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f000010200590010ed00000cfa0000c0fd0003c0300c03fe00290c0300c0000c0300c0300c0300c0000c0300c000
+00030000300c030000300c0300c000000300c0300003fe000e0c0300c0000c0000c0300c0300c030fe0000c0f0000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed0000
+0c9d000102000b0010ed00000c9d000102000b0010ed00000c9d0001020007001f88ff01fe0007001f88ff01fe00180010fc000006fe00010180fe000060fc00000c9d00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00010180fe000060fc00000cc90000
+01fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de0001020024151000004010000600004001800200006000100400000cc9000001fa550040de000102002415100000c03000060000c001800300006000180600000cc9000002faaa00a0de0001020024151000018060000600
+0180018001800060000c0300000cc9000001fa550040de00010200241510000300c00006000300018000c0006000060180000cc9000002faaa00a0de00010200241510000601800006000600018000600060000300c0000cc9000001fa550040de00010200241510000c03000006000c0001800030006000018060000cc900
+0002faaa00a0de000102002415100018060000060018000180001800600000c030000cc9000001fa550040de00010200241510000c03000006000c0001800030006000018060000cc9000002faaa00a0de00010200241510000601800006000600018000600060000300c0000cc9000001fa550040de000102002415100003
+00c00006000300018000c0006000060180000cc9000002faaa00a0de00010200241510000180600006000180018001800060000c0300000cc9000001fa550040de000102002415100000c03000060000c001800300006000180600000cc9000002faaa00a0de00010200241510000040100006000040018002000060001004
+00000cc9000001fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00010180fe000060fc00000cc9000001fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00010180
+fe000060fc00000cc9000001fa550040de00010200200010fc000006fe00010180fe000060fc00000cc9000002faaa00a0de00010200200010fc000006fe00010180fe000060fc00000cc9000001fa550040de00010200180010fc000006fe00010180fe000060fc00000c9d0001020007001f88ff01fe0007001f88ff01fe
+000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d00010200340010ed00000cf700020300c0f70001780cf700020780c0f70001040cf700021fe0c0f70001780cf700021fe0c0f70003780c020033
+0010ed00000cf700020701e0f70001cc1ef700020cc1e0f700010c1ef700021801e0f70001cc1ef6000161e0f70003cc1e0200360010ed00000cf700020f0330f80002018633f70002186330f700011c33f70002180330f80002018633f600016330f800040186330200360010ed00000cf700021b0330f80002018633f700
+02186330f700013c33f70002180330f80002018033f60001c330f800040186330200370010ed00000cf70002130618f70002066180f700016618f700026c6180f80002180618f8000301806180f70001c618f800040186618200370010ed00000cf70002030618f70002066180f70001c618f70002cc6180f800021b8618f8
+000301b86180f80002018618f70003cc618200390010ed00000cf70002030618f700020c6180f80002038618f80003018c6180f800021cc618f8000301cc6180f80002018618f7000378618200370010ed00000cf70002030618f70002386180f70001c618f80003018c6180f700016618f8000301866180f80002030618f7
+0003cc618200380010ed00000cf70002030618f70002606180f700016618f8000301fe6180f700016618f8000301866180f80002030618f800040186618200350010ed00000cf70002030330f70001c033f70002186330f700010c33f600016330f80002018633f70002060330f800040186330200370010ed00000cf70002
+030330f80002018033f70002186330f700010c33f70002186330f80002018633f70002060330f800040186330200350010ed00000cf700020301e0f8000201801ef700020cc1e0f700010c1ef700020cc1e0f70001cc1ef700020c01e0f70003cc1e0200350010ed00000cf700021fe0c0f8000201fe0cf700020780c0f700
+010c0cf700020780c0f70001780cf700020c00c0f70003780c02000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d00010200100010ed00000cad0001ffc0f300010200100010ed00000cad0001ffc0f3000102007d0010fe0002
+01fe01fe0007041e0001e0000003fe00660c1fe0c0780c0307f9fe7f9fe1e0301e1fe7f9fe0c0780c0307f8780c0780c0787f9fe1e0307f9fe7f8300c1fe1e1fe7f8780c0787f9fe7f8781e0300c0781e0780c1fe1e0780c0301e0307f8780c1fe7f9fe00078f3dfe1e1fe1e0787f8781e0780c0787f82007d0010fe000201
+8003fe00070c33000330000007fe00660c0301e0cc1e0780c0300c03033078330300c0301e0cc1e0780c0cc1e0cc1e0cc0c030330780c0300c0781e030330300c0cc1e0cc0c0300c0cc330781e0cc330cc1e030330cc1e078330780c0cc1e0300c030000cce1c3033030330cc0c0cc330cc1e0cc0c02007d0010fe00020180
+07fe00071c6180061800000ffe00660c03033186330cc0c0300c030618cc618300c03033186330cc0c18633186331860c030618cc0c0300c0cc33030618300c186331860c0300c186618cc33186619863303061986330cc618cc0c186330300c03000186ccc3061830619860c18661986331860c02007d0010fe000201800f
+fe00073c6000061800001bfe00660c03033180330cc0c0300c030600cc600300c03033180330cc0c18033180331800c030600cc0c0300c0cc33030600300c180331800c0300c180600cc33180601803303060180330cc600cc0c180330300c03000180ccc3060030601800c18060180331800c02007d0010fe000201801bfe
+00076c60000018000013fe00660c03061980619860c0300c03060186600300c03061980619860c18061980619800c030601860c0300c18661830600300c180619800c0300c180601866198060180618306018061986601860c180618300c030001809e43060030601800c18060180619800c02007d0010fe000d01b8330003
+f0cc6e078030003f03fe00660c03061980619860c0300c03060186600300c03061980619860c18061980619800c030601860c0300c18661830600300c180619800c0300c180601866198060180618306018061986601860c180618300c030001809e43060030601800c18060180619800c02007d0010fe000d01cc63000619
+8c730cc0e0006183fe00660c0306199e619860c0300c03060186678300c0306199e619860c1806199e6199e0c030601860c0300c18661830678300c1806199e0c0300c180679866198060180618306018061986601860c180618300c0307f9809e43060030601800c18060180619800c02007c0010fd000c06630000198c61
+986030006003fe00660c0307f9867f9fe0c0300c030601fe618300c0307f9867f9fe0c1807f9867f9860c030601fe0c0300c1fe7f830618300c1807f9860c0300c180619fe7f980601807f830601807f9fe601fe0c1807f8300c030001808043060030601800c180601807f9800c02007c0010fd000c067f8003f9fe619fe0
+18003f03fe00660c03061986619860c0300c03060186618300c03061986619860c18061986619860c030601860c0300c18661830618300c180619860c0300c180619866198060180618306018061986601860c180618300c030001809e43060030601800c18060180619800c02007c0010fd000c06030006180c6198061800
+0183fe00660c03061986619860c0300c03060186618300c03061986619860c18061986619860c030601860c0300c18661830618300c180619860c0300c180619866198060180618306018061986601860c180618300c030001809e43060030601800c18060180619800c02007d0010fe000d0186030006180c619806180001
+83fe00660c03061986619860c0300c03061986618300c03061986619860c18661986619860c030619860c0300c18661830618300c186619860c0300c186619866198661986618306198661986619860c186618300c030001869e43061830619860c18661986619860c02007c0010fd000ccc030006180c330c6330386183fe
+00660c030618ce619860c0300c03033186338300c030618ce619860c0cc618ce618ce0c030331860c0300c18661830338300c0cc618ce0c0300c0cc33986618cc330cc61830330cc61986331860c0cc618300c030000cc9e43033030330cc0c0cc330cc618cc0c02007c0010fd007678030003e80c1e07c1e0383f1fe00000
+0c0306187a619860c0300c0301e1861e8300c0306187a619860c0786187a6187a0c0301e1860c0300c186618301e8300c0786187a0c0300c0781e986618781e078618301e078619861e1860c078618300c030000789e4301e0301e0780c0781e078618780c0200100010ed00000cad0001ffc0f300010200100010ed00000c
+ad0001ffc0f300010200100010ed00000cad0001ffc0f300010200100010ed00000cad0001ffc0f300010200100010ed00000cad0001ffc0f3000102000b0010ed00000c9d000102000b0010ed00000c9d00010200790010fa007301e078618787f9861e1861e0000c1fe0c0780c0307f9fe7f9fe1e0301e1fe7f9fe0c0780
+c0307f8780c0780c0787f9fe1e0307f9fe7f8300c1fe1e1fe7f8780c0787f9fe7f8781e0300c0781e0780c1fe1e0780c0301e0307f8780c1fe7f9fe000780c1fe1e1fe1e0787f8781e0780c0787f8200790010fa00730330cc718cc601c633186330000c0301e0cc1e0780c0300c03033078330300c0301e0cc1e0780c0cc1
+e0cc1e0cc0c030330780c0300c0781e030330300c0cc1e0cc0c0300c0cc330781e0cc330cc1e030330cc1e078330780c0cc1e0300c030000cc1e03033030330cc0c0cc330cc1e0cc0c0200790010fa007306198671986601c661986618000c03033186330cc0c0300c030618cc618300c03033186330cc0c18633186331860
+c030618cc0c0300c0cc33030618300c186331860c0300c186618cc33186619863303061986330cc618cc0c186330300c030001863303061830619860c18661986331860c0200790010fa007306018679980601e660186600000c03033180330cc0c0300c030600cc600300c03033180330cc0c18033180331800c030600cc0
+c0300c0cc33030600300c180331800c0300c180600cc33180601803303060180330cc600cc0c180330300c030001803303060030601800c18060180331800c0200790010fa007306018679980601e660186600000c03061980619860c0300c03060186600300c03061980619860c18061980619800c030601860c0300c1866
+1830600300c180619800c0300c180601866198060180618306018061986601860c180618300c030001806183060030601800c18060180619800c0200790010fa00730601866d8c0601b630186300000c03061980619860c0300c03060186600300c03061980619860c18061980619800c030601860c0300c18661830600300
+c180619800c0300c180601866198060180618306018061986601860c180618300c030001806183060030601800c18060180619800c0200790010fa00730601866d8787e1b61e1861e0000c0306199e619860c0300c03060186678300c0306199e619860c1806199e6199e0c030601860c0300c18661830678300c1806199e0
+c0300c180679866198060180618306018061986601860c180618300c0307f9806183060030601800c18060180619800c0200790010fa00730601866780c6019e03186030000c0307f9867f9fe0c0300c030601fe618300c0307f9867f9fe0c1807f9867f9860c030601fe0c0300c1fe7f830618300c1807f9860c0300c1806
+19fe7f980601807f830601807f9fe601fe0c1807f8300c030001807f83060030601800c180601807f9800c0200790010fa0073060186678066019e01986018000c03061986619860c0300c03060186618300c03061986619860c18061986619860c030601860c0300c18661830618300c180619860c0300c18061986619806
+0180618306018061986601860c180618300c030001806183060030601800c18060180619800c0200790010fa0073060186638066018e01986018000c03061986619860c0300c03060186618300c03061986619860c18061986619860c030601860c0300c18661830618300c180619860c0300c180619866198060180618306
+018061986601860c180618300c030001806183060030601800c18060180619800c0200790010fa0073061986639866018e61986618000c03061986619860c0300c03061986618300c03061986619860c18661986619860c030619860c0300c18661830618300c186619860c0300c1866198661986619866183061986619866
+19860c186618300c030001866183061830619860c18661986619860c0200790010fa00730330cc618cc60186330cc330000c030618ce619860c0300c03033186338300c030618ce619860c0cc618ce618ce0c030331860c0300c18661830338300c0cc618ce0c0300c0cc33986618cc330cc61830330cc61986331860c0cc6
+18300c030000cc6183033030330cc0c0cc330cc618cc0c0200790010fa007301e078618787f9861e0781e0000c0306187a619860c0300c0301e1861e8300c0306187a619860c0786187a6187a0c0301e1860c0300c186618301e8300c0786187a0c0300c0781e986618781e078618301e078619861e1860c078618300c0300
+0078618301e0301e0780c0781e078618780c02000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102000b0010ed00000c9d000102
+0007001f88ff01fe00028500028500a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4.4\tab A typical display from the join editor in XBAP.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.8\tab Selecting primers and templates\par 
+\par 
+\pard\plain \qj \f4\fs16 {\plain \f20 1.   Select "Edit contig". The primer and template selection function is available from the popup menu of the contig editor.\par 
+}\pard \qj {\plain \f20 \par 
+}\pard \qj {\plain \f20 2.   Open the oligo selection window, by selecting "Select Oligo" from the contig editor popup menu.\par 
+}\pard \qj {\plain \f20 \par 
+}\pard \qj {\plain \f20 3.   Position the cursor to where you want the oligo to be chosen. While the oligo selection window is visible, you will still have complete control over positioning and editing within the contig editor.\par 
+}\pard \qj {\plain \f20 \par 
+}\pard \qj {\plain \f20 4.    Indicate the strand for which you require an oligo. This is done by toggling the direction arrow ("----->" or "<------"), if necessary.\par 
+}\pard \qj {\plain \f20 \par 
+}\pard \qj {\plain \f20 
+5.    Press the "Find Oligos" button to find all suitable oligos (See "Oligo selection" in  Note 17.) Information for the closest oligo to the cursor position is given in the output text window. In the contig editor the position of the oligo is marked by a
+ temporary tag on the consensus. The window is recentered if the oligo is off the screen. Selecting "Display Selection Information" will print a short report on the numbers of oligos considered and rejected during oligo selection. \par 
+}\pard \qj {\plain \f20 \par 
+}\pard \qj {\plain \f20 6.    If this oligo is not suitable (it may have been previously chosen, and found to be unsuitable by experimentation, say), the next closest oligo can be viewed by pressing "Select Next".  \par 
+}\pard \qj {\plain \f20 \par 
+}\pard \qj {\plain \f20 
+7.   Suitable templates are automatically identified for the currently displayed oligo (See "Template selection" in Note 18.) By default, the template is that closest to the oligo site. If the choice is not suitable (it may be known to be a poor quality 
+template, say) another can be chosen from the "Choose Template for this Oligo" menu. Templates that do not appear on the menu can be specified by selecting "other". However, the template must be on the correct strand and be upstream of the oligo.  \par 
+}\pard \qj {\plain \f20 \par 
+}\pard \qj {\plain \f20 
+8.    A tag can be created for the current oligo by pressing the button "Create a tag for this oligo". The annotation for this tag holds the name of the template and the oligo primer sequence.  There are fields to allow the user to specify their own primer
+ name ("serial#") and comments ("flags") for this tag. An example of oligo tag annotation\: \par 
+}\pard \qj {\plain \f20 \par 
+         serial#=         \par 
+         template=a16a9.s1           \par 
+         sequence=CGTTATGACCTATATTTTGTATG         \par 
+         flags=\par 
+\par 
+}\pard \qj {\plain \f20  9.   The oligo selection window is closed when "Create a tag for this oligo" or "Quit" is selected. \par 
+}\pard \qj {\plain \f20 \par 
+}\pard\plain \s6\qj\sa60\tx560\tx860 \b\f20 \par 
+\pard \s6\sa60\sl280\tx560\tx860 2.9\tab Examining the "quality" of a contig\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function reports on the proportion of the consensus that is "well determined" and will display a sequence of symbols that indicate the quality 
+of the consensus at each position or produce a graphical display.  Each strand of the contig is analysed separately using the consensus algorithm, and a position is declared "well determined" if it is assigned one of the symbols a,c,g,t. The current consen
+sus calculation cutoff score is used.\par 
+\pard \s4\qj\sa120\sl280 A summary showing the percentage of the consensus that falls into each category of quality is shown. The analysis divides the data into five categories, assigning each a code as shown in figure 4.5.  Code 0 means well 
+determined on both strands and they agree,    1 means well determined on the plus strand only, 2 means well determined on the minus strand only, 3 means not well determined on either strand and 4 means well determined on both strands but they disagree. If 
+the user chooses to have the data displayed graphically the following scheme is used. A rectangular box is drawn so that the x coordinate represents the length of the contig. The box is notionally divided vertically into 5 possible levels which are given t
+he y values\:
+ -2,-1,0,1,2. The quality codes assigned to each base position are plotted as rectangles. Each rectangle represents a region in which the quality codes are identical, so a single base having a different code from its immediate neighbours will a
+ppear as a very narrow rectangle. Obviously a single  line  at  the  midheight  shows  a  perfect sequence. In figure 4.6 we show the result for the section of contig shown in figure 4.8.\par 
+\pard \s4\qj\sa120\sl280 \par 
+\par 
+\par 
+\par 
+\par 
+\pard \s4\qj\li1580\ri1760\sb160\sl280\box\brsp100\brdrth \tqc\tx2000\tqc\tx3960\tqc\tx6360 \tab {\b Strands\tab Quality\tab Y cordinates\par 
+}\pard \s4\qj\li1580\ri1760\sa120\sl280\box\brsp100\brdrth \tqc\tx2000\tqc\tx3960\tqc\tx6200 {\b \tab OK\tab code\par 
+}\pard \s4\qj\li1580\ri1760\sa120\sl280\box\brsp100\brdrth \tqc\tx1780\tqc\tx2120\tx2380\tqc\tx3960\tqr\tx6000\tx6220\tqr\tx6740 \tab +\tab -\tab and the same\tab 0\tab 0\tab to\tab 0\par 
+\pard \s4\qj\li1580\ri1760\sa120\sl280\box\brsp100\brdrth \tqc\tx1780\tqc\tx2120\tqc\tx3960\tqr\tx6000\tx6220\tqr\tx6740 \tab +\tab \tab 1\tab 0\tab to\tab 1\par 
+\tab -\tab \tab 2\tab -1\tab to\tab 0\par 
+\pard \s4\qj\li1580\ri1760\sa120\sl280\box\brsp100\brdrth \tqc\tx2120\tqc\tx3960\tqr\tx6000\tx6220\tqr\tx6740 \tab neither\tab 3\tab -1\tab to\tab 1\par 
+\pard \s4\qj\li1580\ri1760\sa60\sl280\keepn\box\brsp100\brdrth \tqc\tx1780\tqc\tx2120\tx2400\tqc\tx3960\tqr\tx6000\tx6220\tqr\tx6740 \tab +\tab -\tab but different \tab 4\tab -2\tab to\tab 2\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4.5\tab The codes and coordinates used by the "Quality plot". \par 
+\par 
+\pard\plain \li1500\ri1660\sb400\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 94.67 % OK on both strands and they agree(0)\par 
+\pard \li1500\ri1660\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth  0.67 % OK on plus strand only(1)\par 
+ 2.00 % OK on minus strand only(2)\par 
+ 2.67 % Bad on both strands(3)\par 
+ 0.00 % OK on both strands but they disagree(4)\par 
+\pard \li1500\ri1660\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth {\fs22 \par 
+}\pard \li1500\ri1660\sl220\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth       3310       3320       3330       3340       3350\par 
+0000000000 0000000000 0000000000 0000000000 0000000000\par 
+\par 
+      3360       3370       3380       3390       3400\par 
+0020000000 0000000032 0000032000 0000000000 0300000030\par 
+\par 
+      3410       3420       3430       3440       3450\par 
+\pard \li1500\ri1660\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth 0000000000 0010000000 0000000000 0000000000 0000000000\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4 6\tab Listed output from "Examine Quality" showing the results for the section of contig displayed in figure 4.8.\par 
+\pard \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.10\tab Using graphical displays to examine contigs\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The programs contain three graphical displays to aid the examination of contigs. The first simply gives an overview of all the contigs in the database and provides, with the use of a 
+crosshair, a mechanism for the other two displays to select contigs. One of these displays produces a schematic representation of each of the readings in a contig. The lines in the display show the relative positions of each reading and also their sense. T
+he plot is divided vertically into two sections by a line that is identified by an asterisk drawn at each end. All lines that lie above this line represent readings that are in their original sense, all lines below show readings that are in the complementa
+ry sense. The final graphical display is of the "quality" of the data as described above.\par 
+\pard \s4\qj\sa120\sl280 
+When these graphical displays are visible users may employ a crosshair, moved by mouse or keyboard commands, to examine the data in more detail. The crosshair is positioned and when keyboard characters S, Q, N or Z are typed the program will show the local
+ aligned sequences in a text window, produce the quality plot, give the names of the nearest readings or zoom into the display. \par 
+\pard \s4\qj\sa120\sl280 A typical display of all three plots 
+is shown in figure 4.7. The top rectangle shows a separate line for each of the projects contigs. The righthand one is bisected by a vertical line indicating that it has been selected by the user. The next rectangle below is divided by a horizontal line ma
+rked at each end by an asterisk. Each of the other horizontal lines in the box represents one of the selected contigs gel readings. Those above the dividing line are in their original orientation, those below have been complemented. The box below is also d
+ivided by a horizontal line and shows the "quality" for each base in the contig. Rectangluar areas marked above the central line show sections that only have a good consensus on the minus strand, and rectangles below show good sections from the other stran
+d. Places where the vertical lines reach the top and bottom of the box show disagreements between the two strands. Places with only the midline have a good consensus on both strands.\par 
+\pard\plain \li80\sl220\keepn\tx720 \f4\fs16 {{\pict\macpict\picw441\pich231 
+237effffffff00e601b81101a0008201000affffffff00e601b8090000000000000000310000000000e501b79800780000000001f103bb0000000001f103bb0000000000e501b70001028900028900028900028900028900090100158e550054ff000901000f8eff00f0ff000d0100089b000008f5000010ff000d0100089b
+000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b00
+0008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b0000
+08f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff005d14000801ffc0003ff803ff801ffc00007fff003fff80fb00003ffdff0ef000007fffe0
+003ffc03ffff00001ffdff00c0fd00003ffdff03f000007ff8ff04f003fffffefe00003ffcff00f0fc00000ffdff00fcfd000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff
+000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff00
+0d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d
+0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff005813000fff007fffe00ffe00fff007ffffc001ffe000faff00e0fd000e1fffffc0003fffe007fe0001fffff0fd00007ffdff00e0fd00031fffffc0f800041ffe000003feff00e0fc00001ffcff00f8fd000007f0ff00
+f0ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010
+ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff
+000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000d0100089b000008f5000010ff000901000f8eff00f8ff0009010008
+8e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff0009
+0100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff0018010008ba00007ffaff038000003ffaff00
+f8e9000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00
+090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff001d010008c40000
+1ffbff00e0f7000007faff00fcef000007f9ff00f0ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff0009010008
+8e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff001d
+010008e2000307fffffcd600000ffaff00feee000007faff01fe10ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010
+ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e
+000010ff00090100088e000010ff001f010008fc000001fbff00c0c000007ffaff00c0ee000007fbff02fe0010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00
+090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e0000
+10ff00090100088e000010ff00090100088e000010ff00090100088e000010ff001a02000801f9ff00f8c000faff02c00000faff00f0eb000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e0000
+10ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100188e000030ff00090100188e000030ff00090100188e000030ff00090100188e00023000000b020718
+e09000030e31c0000b020718e09000030e31c0000b020789e09000030f13c0000a0100ff8f000101feff00090100188e000030ff000901003f8eff02f800000b0201db8090000303b700000b020799e09000030f33c0000b020718e09000030e31c0000b020618609000030c30c000090100188e000030ff000901003c8e00
+0078ff00090100188e000030ff00090100188e000030ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100
+088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff001d01000ffaff00c0d5000007f0ff00e0f3000001f0ff00e0f6000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00
+090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e0000
+10ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff001c02000803fbff00f8c400007ff0ff0200000ff9ff0080f1000010ff00090100088e000010ff00090100088e000010ff00090100088e
+000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff000901
+00088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff001f010008f800f1ff00c0d500007ffbff00c0ed00003ffbff00c0f8000010ff00090100088e000010ff00090100088e0000
+10ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff0009010008
+8e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff001f010008e900007ff3ffe2000007fbff00fcea00007ffaff00f0fc000010ff
+00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e00
+0010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff0019010008dc000001f2ff00feef
+000003faff0080df000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff0009010008
+8e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff0019
+010008db00007ffbff00f0de000003faff00c0e8000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff000901
+00088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff00090100088e000010ff
+000901000f8eff00f0ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001
+f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce
+000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f800
+0008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff00190100
+08f8000008ce000001f3000010fc000004e1000010ff004203000fffb0fa00034800027fefff03fc02065fe7ff03f3800001fe00133fffffc201880000105177000001042408006002fe0001425ff4ff00fcfb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe001320
+00244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410
+ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe0013200024420188
+0000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff00520300
+0873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177
+000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa00
+0348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe001320002442018800001051770000010424
+08006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240
+ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe
+00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402
+065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe
+000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc0002
+6c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe000301000040fe0003e0000004fb0002400410ff005203000873b0fa000348000240ef00030402065cfc00026c00c0ef000313800001fe00132000244201880000105177000001042408006002fe00014250fe0003010000
+40fe0003e0000004fb0002400410ff000901000f8eff00f0ff005703000873b8fc00052800c8000240ef00030402067cfc00026c00c0ef000313800001fe00136000244201880800105177000001043408086002fe000163f0fe000301000040fe0006e0000004000010fe0002400410ff004c03000852b8fc00022800c8e9
+000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe00
+00e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc0012400024000188
+08001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc0002
+2800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe00030100
+0040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc00124000
+2400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852
+b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe
+000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc
+001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c
+03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd00
+0163f0fe000301000040fe0000e0fc000010fe0002400410ff004c03000852b8fc00022800c8e9000020fc00026c0040ef00010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc000010fe0002400410ff004a03000852b8fc00022fffc8e9000020fc00026c0040ef00
+010180fc001240002400018808001041770000010434000820fd000163f0fe000301000040fe0000e0fc00001ffcff00f0ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010
+ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1
+000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc00
+0004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff0019010008f8000008ce000001f30000
+10fc000004e1000010ff0019010008f8000008ce000001f3000010fc000004e1000010ff000901000f8eff00f0ff00028900028900028900028900028900028900a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4.7\tab A typical graphical display from XBAP or SAP.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \par 
+2.11\tab Disassembling contigs\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Sometimes it is necessary to drastically alter contigs. We may need to break a contig in two, remove a single reading, remove a whole set of consecutive readings from a contig, or remove a set of readings from the database independent of which contigs they
+ are in. \par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.11.1\tab Removing a single reading\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function is found in the "Alter relationships" menu. The user types in the number of the reading to be removed. If the reading is required to hold the contig together - i.e. is the only one cove
+ring a particular region - the program will create an extra contig consisting of the data to the right of the removed reading. The original contig will be shortened accordingly.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.11.2\tab Removing a set of readings\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function is called "Disassemble readings" and can remove any group of readings from a database. It works in two modes\:
+ 1. A set of adjacent readings in a contig can be removed by the user naming the two end ones (the left one first); 2. A set of readings from any number of contigs can be remove
+d by the user giving the name of a file that contains their names. In both modes the program cleans up the database by moving data to fill up any holes made in the files.\par 
+
+ For both modes of operation the program request a file of file names. If the user creates their own file (i.e. mode 2) each reading name must be on a separate line of the file. For mode 1 the user names the leftmost then the rightmost reading for removal.
+ They MUST be in left to right order. They and all intervening readings will be removed. For both modes, if necessary, new contigs will be created.  \par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.11.3\tab Breaking a contig\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This function is found in the "Alter relationships" menu. It can be used to break a contig at the beginning of a particular reading so that the identified reading becomes the left end of a new contig. The user types in the number of the reading that will b
+ecome the left end.\par 
+\pard \s4\qj\sa120\sl280 \par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.12\tab Shuffling pads\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 One weakness of the assembly routine is that padding characters introduced to line up the readings are not always aligned with the pads in other sequences\:
+ a single problem such as a compression can give rise to pads apparently randomly arranged in the different readings covering the region. This function attempts to shuffle the pads around so that they align with one another, h
+ence simplifying editing. No information is lost in the process\: only the positions of padding characters are changed. The function is best used prior to editing.\par 
+\pard\plain \s6\sb160\sa60\sl280\tx560\tx860 \b\f20 2.13\tab Displaying a contig\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The "Display a contig" option shows the aligned readings for any par
+t of a contig. Users select "Display a contig", then select the contig. The number, name and strandedness of each reading is shown and the consensus is written below.  A typical example, showing part of a contig from positions 3301 to 3450, is seen in figu
+re 4.8. Overlapping this region are readings 3, 40, 8, 37, 35 and 2, with archive names L3.SEQ, A21A7.S1 and so on. Readings 3, 8, 35 and 2 are in reverse orientation as indicated by the minus signs. There are a few padding characters in the working versio
+ns, but the consensus (shown below each page width) has a definite assignment for every position except 3376. \par 
+\pard\plain \s6\sb160\sa60\sl280\tx560\tx860 \b\f20 2.14\tab Highlighting differences between readings and the consensus\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+During the latter stages of a project this option is used to highlight disagreements between individual gel readings and their consensus sequences. Typical output is seen in the figure 4.9 which shows the result for the section of contig shown in figure 4.
+8. Characters that agree with the consensus are shown as + symbols for the plus 
+strand and - for the minus strand. Characters that disagree with the consensus are left unchanged and so stand out clearly. Note that a similar display is now more conveniently available within the contig editor.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Set the consensus cutoff score.\par 
+2.\tab Redirect output to disk.\par 
+3.\tab Display the contig.\par 
+4.\tab Close the redirection file.\par 
+5.\tab Select "Highlight disagreements".\par 
+6.\tab Define the name of the redirection file.\par 
+7.\tab Define an output file name.\par 
+8.\tab Select a symbol for good plus strand data.\par 
+9.\tab Select a symbol for good minus strand data.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \page \par 
+\pard\plain \li760\ri760\sl220\box\brsp100\brdrth \tqr\tx8240 \f4\fs16 10.\tab Print the file.{\plain \f20 \par 
+}\pard \li760\ri760\sl220\box\brsp100\brdrth \tqr\tx8240 \tab 3310      3320      3330      3340      3350\par 
+\pard \li760\ri760\sl220\box\brsp100\brdrth -3\tab L3.SEQ \tab atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+40\tab A21A7.S1\tab atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+-8\tab A16A2.S1\tab atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+37\tab A21A2.S1\tab atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+\tab CONSENSUS\tab atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+\par 
+\tab 3360      3370      3380      3390      3400\par 
+-3\tab L3.SEQ\tab gatctgaccaagcgacag*tttaaa*gtgctgcttgccatt*ctgcgt*a\par 
+40\tab A21A7.S1\tab gatctgaccaagcgacag*gttaaagttgctgctt\par 
+-8\tab A16A2.S1\tab gatctgaccaagcgacag*tttaaa*gtgctgcttgccatt*ctgcgt*a\par 
+37\tab A21A2.S1\tab ga-ctgaccaagcgacag*tttaaa*gtgctgcttgccatt*ctgcgt*a\par 
+35\tab A16D12.S1\tab gttttaaa-gtgctgcttgccatttctgcgtaa\par 
+-2\tab L2.SEQ\tab t*ctgcgt*a\par 
+\tab CONSENSUS\tab gatctgaccaagcgacag*tttaaa-gtgctgcttgccatt*ctgcgt*a\par 
+\par 
+\tab 3410      3420      3430      3440      3450\par 
+-3\tab L3.SEQ\tab aaacctatgggt*ggaataaaccaatggacagaatcaccgattctcaact\par 
+-8\tab A16A2.S1\tab aaacctatgggt*ggaataaaccaatggacagaatcaccgattctcaact\par 
+37\tab A21A2.S1\tab aaacctatgggtgggaataaaccaatggacagaatcaccgattctcaact\par 
+35\tab A16D12.S1\tab aaacctatgggt*ggaataaaccaatggacagaatcaccgattctcaact\par 
+-2\tab L2.SEQ\tab aaacctatgggt*ggaataaaccaatggacagaatcaccgattctcaact\par 
+\pard \li760\ri760\sl220\box\brsp100\brdrth   \tab CONSENSUS\tab aaacctatgggt*ggaataaaccaatggacagaatcaccgattctcaact\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4.8\tab Typical output from "Display contig".\par 
+\pard\plain \li840\ri940\sb320\sl220\box\brsp100\brdrth \f4\fs16                        3310      3320      3330      3340      3350\par 
+\pard \li840\ri940\sl220\box\brsp100\brdrth  -3  L3.SEQ      --------------------------------------------------\par 
+ 40  A21A7.S1    ++++++++++++++++++++++++++++++++++++++++++++++++++\par 
+ -8  A16A2.S1    --------------------------------------------------\par 
+ 37  A21A2.S1    ++++++++++++++++++++++++++++++++++++++++++++++++++\par 
+                 atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+\par 
+                       3360      3370      3380      3390      3400\par 
+ -3  L3.SEQ      -------------------------*------------------------\par 
+ 40  A21A7.S1    +++++++++++++++++++g+++++gt++++++++\par 
+ -8  A16A2.S1    -------------------------*------------------------\par 
+ 37  A21A2.S1    ++-++++++++++++++++++++++*++++++++++++++++++++++++\par 
+-35  A16D12.S1                    -t----------------------t------a-\par 
+ -2  L2.SEQ                                              ----------\par 
+                 gatctgaccaagcgacag*tttaaa-gtgctgcttgccatt*ctgcgt*a\par 
+\par 
+                       3410      3420      3430      3440      3450\par 
+ -3  L3.SEQ      --------------------------------------------------\par 
+ -8  A16A2.S1    --------------------------------------------------\par 
+ 37  A21A2.S1    ++++++++++++g+++++++++++++++++++++++++++++++++++++\par 
+-35  A16D12.S1   --------------------------------------------------\par 
+ -2  L2.SEQ      --------------------------------------------------\par 
+\pard \li840\ri940\sl220\keepn\box\brsp100\brdrth                  aaacctatgggt*ggaataaaccaatggacagaatcaccgattctcaact\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 \par 
+\pard \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 Figure 4.9\tab Typical output from "Highlight disagreements", showing the results for the section of contig displayed in figure 4.8.\par 
+\pard \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.15\tab Screen editing contigs in SAP\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 When using SAP the best way for users to edit a whole contig interactively is to use their prefered external editor on the standard display of a contig. When the screen edit function is selected SAP writ
+es a text file containing a display of the contig and passes it to an external editor - say EDT on the VAX or emacs on a UNIX system. The user modifies the file using the editor and when the editor is exited SAP moves the changed contig back into the proje
+ct database.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Screen edit".\par 
+2.\tab Select the contig to edit.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define a temporary file for use by the editor. After a slight pause the editor will start and the first page of the contig will appear on the screen.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Edit the contig using the editors standard commands.\par 
+5.\tab Exit from the editor.\par 
+6.\tab Accept "Put contig back into the database".\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.16\tab Automatic editing of contigs in SAP\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This function automatically changes characters in gel readings to make them agree with the consensus sequence. At first sight this may seem like an unethical procedure but as is explained in the notes it is quite legitimate and saves a great deal of time. 
+In figure 4.10 we show the effect on using autoedit on the section of contig displayed in figure 4.8. All changed characte
+rs (for example position 3369, reading A21A7.S1) are denoted by uppercase letters. Note that apart from position 3375 which has an unresolved consensus all other changes have been made. These edits were made using a combined consensus for both strands, but
+ the standard version of the program treats each strand separately and will only make a change if the consensus for the two strands agree.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Redirect output to disk.\par 
+2.\tab Select "Display contig".\par 
+3.\tab Identify the contig to edit/display.\par 
+4.\tab Close the redirection file.\par 
+5.\tab Print the file containing the displayed contig.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Check the contig and the original films and annotate the printout to indicate the required edits.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Set the cutoff for the consensus calculation.\par 
+8.\tab Select "Auto edit".\par 
+9.\tab Identify the contig and the section to edit. \par 
+10.\tab The program will display a summary of changes made.\par 
+11.\tab Display the contig and compare it with the annotated printout.\par 
+12.\tab Use another editing method to finish the editing.\par 
+\pard\plain \li820\ri960\sl220\pagebb\box\brsp100\brdrth \f4\fs16                        3310      3320      3330      3340      3350\par 
+\pard \li820\ri960\sl220\box\brsp100\brdrth  -3  L3.SEQ      atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+ 40  A21A7.S1    atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+ -8  A16A2.S1    atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+ 37  A21A2.S1    atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+     CONSENSUS   atggttacgccagactatcaaatatgctgcttgaggcttattcgggcgca\par 
+\par 
+                       3360      3370      3380      3390      3400\par 
+ -3  L3.SEQ      gatctgaccaagcgacagtttaaa*gtgctgcttgccattctgcgtaaaa\par 
+ 40  A21A7.S1    gatctgaccaagcgacagTttaaagGtgctg\par 
+ -8  A16A2.S1    gatctgaccaagcgacagtttaaa*gtgctgcttgccattctgcgtaaaa\par 
+ 37  A21A2.S1    gaTctgaccaagcgacagtttaaa*gtgctgcttgccattctgcgtaaaa\par 
+-35  A16D12.S1                    gtttaaa-gtgctgcttgccattctgcgtaaaa\par 
+ -2  L2.SEQ                                             tctgcgtaaaa\par 
+     CONSENSUS   gatctgaccaagcgacagtttaaa-gtgctgcttgccattctgcgtaaaa\par 
+\par 
+                       3410      3420      3430      3440      3450\par 
+ -3  L3.SEQ      cctatgggtggaataaaccaatggacagaatcaccgattctcaacttag\par 
+ -8  A16A2.S1    cctatgggtggaataaaccaatggacagaatcaccgattctcaacttagc\par 
+ 37  A21A2.S1    cctatgggtggaataaaccaatggacagaatcaccgattctcaacttagc\par 
+-35  A16D12.S1   cctatgggtggaataaaccaatggacagaatcaccgattctcaacttagc\par 
+ -2  L2.SEQ      cctatgggtggaataaaccaatggacagaatcaccgattctcaacttagc\par 
+\pard \li820\ri960\sl220\keepn\box\brsp100\brdrth      CONSENSUS   cctatgggtggaataaaccaatggacagaatcaccgattctcaacttagc{\fs22 \par 
+}\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 4.10\tab The result of applying the "Auto editor" to the section of contig displayed in figure 4.5.\par 
+\pard\plain \s6\sb400\sa60\sl280\tx560\tx860 \b\f20 2.17\tab Using the original editor in SAP\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This simple editor can insert, delete
+ and change gel reading sequences by performing one selected operation at a time. It is used during the interactive entry of new readings and interactive joining of contigs. The commands request the position at which the edit is required and the number of 
+characters to insert, delete or change.\par 
+\pard\plain \s5\sb400\sa160\sl320\tx560 \b\f20\fs28 3. NOTES\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+As each reading is entered into a project database it is given a unique number. The first is numbered 1, the second 2 and so on. Their original file names (known as "archives" because they are kept outsid
+e the database and never edited) are also copied into the database. During assembly contigs are constantly being changed and reordered so the program identifies them by the numbers or names of the readings they contain. Whenever the program asks users to i
+dentify a contig or reading they can type its number or its archive name. If they type its archive name they must precede the name by a slash "/" symbol to denote that it is a name rather than a number. For example if the archive name is fred.gel with numb
+er 99, users should type /fred.gel or 99 when asked to identify the contig. Generally, when it asks for the reading to be identified, the program will offer the user a default name, and if the user types only return, that contig will be accessed. When a da
+tabase is opened the default contig will be the longest one, but if another is accessed, it will subsequently become the current default. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab An XBAP database is made from five separate files\: the "archive names" file *.ARN, the "relationships" file *.RLN,
+ the "sequences" file *.SQN, the "tag" file *.TGN, and the "comments" file *.CCN. If the database is called FRED then version 0 of database FRED comprises files FRED.AR0, FRED.RL0, FRED.SQ0, FRED.TG0 and FRED.CC0. The version is the last symbol in the file
+ names. If the "copy database" option is used it will ask the user to define a new "version".  The normal strategy is to use version 0 for all work and to use other versions as backups. Program SAP uses databases formed from only the first three of these f
+iles. Normally the program is used to handle DNA sequences but many of the functions also work on protein sequences. The choice of sequence type is made when the database is started.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab The  vector  sequence  should  be stored in a simple text file   with up to 80 characters of data per line.  \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+Almost all readings are assembled automatically in their first pass through the assembly routine. Those that are not can be dealt with in two ways. Either they can be put through assembly again as single named rea
+dings (Users should type n when asked "Use file of file names"), with the parameters set to allow the reading in. Or they can be entered through the assembly routine using the "Put all readings in new contigs" mode, and then joined to the contig they overl
+ap using the Contig Joining Editor. If it is found that readings are not being assembled in their first pass through the assembler, then it is likely that the contigs require some editing to improve the consensus. Also it may be that poor quality data is b
+eing used, possibly by users overinterpretting films or traces. In the long term it can be more efficient to stop reading early and save time on editing. For those using fluorescent sequencing machines the unused data can be incorporated after assembly.
+\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Obviously we cannot use a script to operate a program that expects to be controlled by mouse clicks! The program BAP is an xterm version of XBAP which can be used from a script.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab There is a remote possibility of a join being missed by the "Find internal
+ joins" routine. If a small contig is wholly contained within a larger one, such that its ends are further than ("Probe length" - "Minimum initial match length") from the ends of the larger contig, and the consensus for the small contig lies to the left of
+ the consensus for large contig, the overlap will not be discovered. (See the search strategy).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab For those using fluorescent sequencing machines and XBAP the combination of the contig editor and the graphical displays of consensus "quality" will probably
+ be sufficient for checking and editing contigs as everything can be done at the computer screen. For those using autoradiographs the facility to produce printouts of "display" and "highlight disagreements" options for use while checking films, and the aut
+oedit command are most appropriate.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab 
+In general the quality of a reading deteriorates along the length of the gel and so it is also possible to use a length cutoff for the quality calculation. Only the data from the first section of each reading will be included in the calculation.  \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab 
+There are some limitations on the changes that can be made to the contigs when using the SAP screen editor. Alignments must be maintained during editing. Whole lines of sequence should not be deleted or added unless the order of the gel readings in the 
+contig is preserved. Each line in the contig display consists of gel reading numbers, their names and 50 character sections of sequence. Insertions are limited in the following way. No line of sequence can be extended rightwa
+rds more than 5 characters beyond the end of a full length line (a full length line is 50 characters long). Only one character can be added to the left end of full length lines, but sections of sequence beginning further into a line can be extended leftwar
+ds up to an equivalent position. Do not delete any non-sequence lines in the file. Before returning the contig to the database the program checks that the rules have been obeyed. If an error is found the number of the erroneous line in the file is displaye
+d and the contig will not be changed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab 
+The following is a justification for using the auto edit function. The general strategy employed when collecting shotgun sequence data is to keep sequencing until the redundancy in the contigs is fairly high, and then to get a printout of a contig, che
+ck problems against the films, note corrections on the printout, and make the changes using an interactive editor. In general the consensus is correct except for places where padding characters have been used to accomm
+odate a single gel with an extra character, or where the consensus is dash. The important point for the auto editor is that most edits simply make the gel readings conform to the consensus, or remove columns of pads.  The auto editor does the following. 1)
+ calculates a consensus for the contig (or part of a contig) to be edited, and then uses this consensus to direct the editing of the contig in 3 stages 2) stage 1\:
+ find and correct all places where, if the order of two adjacent characters is swapped, they will both agree with the consensus (given that they did not match the consensus before). These corrections are termed "transpositions" 3) stage 2\:
+ find and correct all places where there is a definite consensus but the gel reading has a different character. These corrections are termed "changes". 4) stage 3\:
+ delete all positions in which the consensus is a padding character. These corrections are termed "deletions". All changed characters are shown in uppercase letters so it will be obvious which characters
+ have been assigned by the program (except for deletions). The number of each type of correction will be displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 11.\tab 
+The "calculate consensus" function, the "display contig" routine, the contig editor and the "show quality" option use the rules outlined here to calculate a consensus from aligned gel readings. The consensus sequence can contain any of 6 possble symbols\:
+ a,c,g,t,* and -. The last symbols is assigned if none of the others makes up a sufficient proportion of the aligned characters at any pos
+ition in the contig. The following calculation is used to decide which symbol to place in the consensus at each position. Each uncertainty code contributes a score to one of a,c,g,t,* and also to the total at each point. Symbols like r and y which don't co
+rrespond to a single base type contribute only to the total at each point. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab Definite assignments i.e. A,C,G,T,a,c,g,t,b,d,h,v,k,l,m,n,a,c,g,t,* =1 probable assignments i.e. 1,2,3,4 = 0.75 other uncertainty codes including r,y,5,6,7,8,- = 0.1 A cutoff scor
+e between 51 and 100% is set by the user. (When the program starts this is set to 75%.). At each position in the contig we calculate the total score for each of the 5 symbols a,c,g,t and * (denote these by Xi, where i=a,c,g,t or *), and also the sum of the
+se totals (denote this by S). Then if 100 Xi / S > the cutoff for any i, symbol i is placed in the consensus; otherwise - is assigned. For the "examine quality" algorithm each strand is treated separately but the calculation is the same. \par 
+12.\tab Databases can
+ become corrupted if the machine crashes so the programs contain a function "Check database for logical consistency" which checks to see if all the relational data is internally consistent. Some routines automatically perform this check before they start. 
+ Users are advised to make frequent copies of their databases using the "Copy database" option. Note that if BAP is used in "execute with dialogue" mode the "Check logical consistency" function also creates a consensus for the whole database and scans it t
+o find any regions which contain 15 dashes in 20 characters. Such a finding would indicate problems with the database.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\pagebb\tx560 13.\tab 
+We have covered many of the most important or complicated operations peformed by SAP and XBAP, but several others have not been mentioned. These include those for creation of consensus sequence files for processing by other programs, and complementing 
+contigs, both of which are trivial. There is also a set of routines for fixing corrupted databases.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 14\tab The VAX version of SAP will only a
+llow one person to access a sequencing database at a time - producing an "unable to open database" error message if a second person tries. On UNIX machines there is no such check in program SAP so users need to make sure that simultaneous use does not occu
+r. Otherwise the data will be corrupted. Program BAP prevents more than one person from using a database at any time. It does so using the following mechanism. When a user requests to open a particular copy (say 0) of a database (say DB) the program checks
+ for the existence of a file named DB_BUSY0 in the current directory. In normal circumstances, if the file exists, it indicates that somebody else is currently using the database and the program displays the message "Sorry database busy" and does not open 
+the files. If the file does not exist the program creates it and opens the database. When a user stops using the database (usually by quitting the program) the "busy file" is deleted, hence allowing others to use the database. If the program terminates abn
+ormally the busy file will not be deleted and so the database will not be useable until the busy file is explicitly deleted using the rm command. Obviously it is dangerous to delete the file before checking if another user is using the database.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 15\tab 
+After a run of the assembly routine, reading names can appear in the file of failed reading names for the following reasons. 1. The reading file was not found; 2. the reading file was too short (less than the minimum match length); 3. the reading appear
+ed to matc
+h somewhere but failed to align sufficiently well (too many padding characters or too high a percentage mismatch); 4. a reading of the same name was already present in the database; 5. the reading was entered but also appeared to match another contig and t
+he join was not made. This can occur for two reasons\: a. because the overlap between the two contigs was too large, or b. because after the reading is entered into one contig a new consensus is calculated and compared to the other contig\:
+ it may then not match as well as it did originally, and the join will not be made.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 16\tab 
+We have recently devised our own file format (called SCF) for storing traces, sequences and confidence values for data produced by automated sequence readers (Dear and Staden, 1992). For ABI data these typically reduce the storage required to 30% of the
+ original. Data from the ABI 373A and the Pharmacia A.L.F. can be converted to this form using the program makeSCF. Note that A.L.F. files must first be processed by program alfsplit which s
+plits the original data into one file per reading. Sequences can be extracted from SCF files in a form suitable for assembly by use of the program trace2seq. To locate and mark regions of a sequence from an automated sequence reader that are of too low a q
+uality to be used for assembly we use the script clip-seqs. This script takes as input a file of reading file names. For each reading it renames the original file "original-filename~" and writes a new file called "original-filename" in which the poor quali
+ty regions are marked.\par 
+\pard\plain \qj \f4\fs16 {\plain \f20 \par 
+}\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 17\tab The oligo selection engine is the one used in the program OSP. It is described in some detail in\:
+ Hillier, L., and Green, P. (1991).  The parameters controlling the selection of oligos can be changed in the "Oligo Selection Parameters" window. The weigh
+ts controlling the scoring of selected oligos can be changed in the "Oligo Selection Weights" window.  By default, the oligos are selected from a window that extends 40 bases either side of the cursor. The size and location of this 
+window relative to the cursor position can be changed in the "Parameters" window.  In XBAP oligos are ranked according to their proximity to the cursor position, rather than by their scores.   \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 18\tab For simplicity, each reading is considered to represent a template. In practise, many readings can be made off the same template. Suitable templates that are identified are those that\:
+     1. are in the appropriate sense,     2. have 5' ends that start upstream of the oligo, and 3. are sufficiently close to the o
+ligo to be useful.  This last criterion relates to the insert size for the subclones used for sequencing and the average reading length. A template is considered useful if a full reading can be made from it, taking into account both of these factors. The d
+efault insert size is 1000 bases, and the default average reading length is 400 bases. These values can be changed in the "Parameters" window. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R. 1982. Automation of the computer handling of gel reading data produced by the shotgun method of DNA sequencing. {\i Nucl. Acids Res}. {\b 10 }(15)\:4731-4751.\par 
+2.\tab Staden, R. 1990. An improved sequence handling package that runs on the Apple Macintosh. Comput. {\i Applic. Biosci}. {\b 4}, 387-393.\par 
+3.\tab Dear S and Staden,R. 1991.  A sequence assembly and editing for efficient management of large projects. {\i Nucl. Acids Res}. {\b 19}, 3907-3911.\par 
+4.\tab Hillier, L., and Green, P. 1991. "OSP\: an oligonucleotide selection program," PCR Methods and Applications, {\b 1}\:124-128. \par 
+5.\tab Dear S and Staden, R. 1992. A standard file format for data from DNA sequencing instruments. DNA Sequence, {\b 3}, 107-110.\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 5.  Analysing Sequences to Find Genes\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1720 2.1\tab The uneven positional base frequencies method.\par 
+2.2\tab The positional base preferences method\par 
+2.3\tab The codon usage method\par 
+2.4\tab Searching for open reading frames\par 
+2.5\tab Searching for tRNA genes\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 We outline three methods for finding protein genes and one for locating tRNA genes, plus routines for finding open reading frames and displaying the p
+ositions of stop codons. All the methods are contained in the program NIP. The correct interpretation of the analyses presented requires a good understanding of the underlying ideas used by the methods. Despite this we concentrate here on the use of the te
+chniques and refer the reader to earlier publications (1-5) for more background information. \par 
+\pard \s4\qj\sa120\sl280 The assumption made by the methods for finding protein genes is that protein coding regions, when analysed in terms of 3 letter nonoverlapping "words", will look 
+different to noncoding regions analysed in the same way. Suppose we analyse a sequence in one reading frame and count its codons. Then we define the "positional base composition" as the frequency at which each of the four base types occupies each of the th
+ree positions in codons. In coding regions the positional base frequencies will be less random than they are in noncoding regions. This is the basis of method 1\:
+ the "Uneven positional base frequencies method". If this reading frame is coding for a protein
+ the positional base composition will tend towards a particular bias which is common to the majority of genes. This is the basis of method 2 the "Positional base preferences method". If the sequence has a very biased base composition then in protein genes 
+this may effect the choice of amino acids, and will effect the use of bases in the third positions of codons. This bias is also utilised by the positional base preferences method. Finally if the reading frame is coding for a protein its use of codons is al
+so likely to be nonrandom and this is the basis of method 3, the "Codon usage method".\par 
+\pard \s4\qj\sa120\sl280 
+All the methods perform their analyses over segments of the sequence of size "window", and then move the window on by three bases and repeat the calculation. The "Uneven positional base frequencies" method only produces a single value for each segment and 
+hence cannot distinguish between frames or strand - it only measures the probability that a region is coding and nothing more. The other two methods produce different va
+lues for each of the three potential reading frames and hence can help to decide which is coding. Their results are plotted in three separate boxes arranged one above the other. For these we also indicate which of the three reading frames is the highest sc
+oring at each position along the sequence. This is done by plotting a single dot at the mid-height of the box that contains the highest score, so that if one frame is the highest scoring for many consecutive positions, the dots will produce a solid line at
+ the mid-height of its box. We also mark the positions of stop codons. These are represented by short vertical lines and are positioned so that they bisect the mid-height of each box. Start codons are marked at the base of the box for each reading frame.
+\par 
+\pard \s4\qj\sa120\sl280 The search for tRNA genes involves looking for segments that could fold into the cloverleaf structure and which have the expected conserved bases in the appropriate positions.\par 
+\pard \s4\qj\sa120\sl280 Notice that we have not mentioned searches for relevent "signals" like promoters 
+or splice junctions which are also useful for finding genes. These searches are described in the chapter on searching for motifs. In the current chapter the only "signal" we include is the stop codon. However as all results are presented graphically it is 
+easy for users to overlay the displays of signal searches with those presented here and so effectively combine them.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab The uneven positional base frequencies method.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method produces a single value for each segment of the sequence, and wou
+ld give the same result if applied to each reading frame or to the complementary strand. The results are plotted in a box that is cut by a horizontal line. This line is labelled 76% and we expect 76% of noncoding sequences to score below this line and 76% 
+of coding sequence to score above it. Of the methods described this one makes the fewest assumptions and so is a good unbiased indicator of the probability that a sequence is coding.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Uneven positional base frequencies".\par 
+2.\tab Define "Odd window length". \par 
+3.\tab Define "Plot interval".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear as in figure 5.1. In the example shown the 5' end of the sequence codes for several proteins and the 3' end codes for ribosomal RNAs.\par 
+\pard\plain \li100\sb300\sl160\keepn \f4\fs16 {\plain \fs16 {\pict\macpict\picw436\pich41 
+1103ffffffff002801b31101a00082a0008c01000affffffff002801b3070000000022000100010000a000a0a100a400020de801000a00000000000000000700010001220027000100da23000021000101b22300002300262300002100270001230000a000a301000affffffff002801b32300da21000101b2230026210027
+0001a000a12000170001001701b2220025000100df2300032300062301002300fb2300fd2300022300fe2302032300ff2300002300fe2300fd2301002300032300002300fd2302022300042300002300052300002301fd2300002300032300002300012302fd2300fd2300002300fd2300ff2301fe23000023000023000223
+00062302012300fc2300032300012300002301052300062300fa2300f82302ff2300fb2300002300002300002301002300002300002300002300002302002300002300032300052300012301092300ff2300042300022300002302042300fa2300fc2300fe2300022301002300fd2300002300002302032300002300fe2300
+ff2300012300002300022301012300fc2300062300012300032302ff2300002300032300fe2300022301082300f92300fd2300032302022300032300fb2300fa2300002302ff2300fe2300002300fc2300fe2301f92300fb230000230000230000230200230000230000230000230000230100230000230000230000230003
+2302fd2300002300002300002301002300002300002300002300002302002300002300002300002300002301002300002300002300032300032302082300fa23000723000223000423010b2300f82300fd2300fa2300fc2302fe2300fd2300fc2300fe23010023000023000023000023000023020023000023000023000023
+00002301002300002300002300002300002300022300fe2302002300002300002300002301002300002300002300002300002302022300002300fe2300002300052301002300fb2300002300062300fc2302032300012300002300fc2300032301002300012300ff2300012300032302022300fb2300fd2300ff2302fe2300
+032300ff2300042300ff2301fb2300002300002300002300002302002300002300022300032300fe2301002300052300012300032300ff2302042300052300042300002300082301fb2300fc2300fb2300fa2302ff2300fd2300002300fa2300012301ff2300042300ff2300032300012302fc2300012300032300fd230006
+2301032300032300032300022300062300fd2302fb2300032300f92300002300002301fb2300f72300022300002300002302002300fe2300ff2300fe2300002301002300052300fe2300ff2300002302fe2300002300002300022300072301fa2300032300ff2300042302ff2300fa23000023000323000323010023000623
+00fd2300032300fb2302032300ff2300012300fd2300052302042300fa2300fd2300ff2300002301f82300002300032300fd2300002302002300002300002300002301032300092300062300022300032302fd2300fa2300fe2300fd2300ff2301fd2300fb2300062300022300fe2302fc2300012300062300fc2300032301
+032300fe2300002300022300032302002300032300002300032300fe2300022301fe2300052300032300fe2300022302fa2300032300fa2300fe2300002301062300032300ff2300fe2302fd2300ff2300f22300022300fb2301022300fe230000230000230000230200230000230000230000230000230100230000230000
+2300002300002302002300002300002300002300022301062300012300032300002302fd2300022300062300042300fd2301002300fd2300032300fc2300fd2302fb2300002300fa2300022300002302fe2300002300002300002300002301002300002300002300002300052302042300022300002300fe23000323010323
+00002300002300022302fd2300002300fd2300fe2300ff2301032300062300012300032300f72300002300fa2302062300032300fd2300fd2301ff2300fe2300002300fc2300fe2302002300002300002300002300002301002300022300032300012300002302ff2300042300002300082300042301032300052300fa2300
+012300ff2302fe2300fc2300042300032301002300fd2300fa2300062300002302fd2300ff2300032300fa2300fe2301092300fc2300fe2300032300002302052300fa2300fe2300fc2300032301fd2300002300002300fa2300fe2302032300fd2300052300032301032300f82300032300fc2300072302062300fa230005
+2300002300032302fd2300fe2300f92300fe2300ff2301012300ff2300fb2300032300002300052302012300ff2300fd2300002300032301002300012300fa2300022300fb23020023000023000023000023000023010023000023000023000023000023020023000023000023000023000023010023000023000023000323
+02032300022300012300062300fd2301032300f92300012300ff2300032302fb2300ff2300012300fd2300032301fd2300022300002300032300042302fd2300062300032300fd2300002301ff2300042300032300002302ff2300f823000b2300fb2300f92301002300002300fe2300fd2300fd2302ff2300fe2300002300
+002300002301002300002300022300002300fe2302002300002300002300062300022302012300ff2300fd2300012300022300002301fd2300012300022300fe2300ff2302fd2300fe230000230000230000230100230000230000230000230000230200230000230000230000230100230000230000230000230000230200
+2300002300002300002300002301002300002300002300002300002302002300002300002300002300002301002300002300002300002300002302002300002300002300022301fe2300002300002300002300002302002300002300002300002300002301002300022300032300062300fd23020423000323000323000023
+00fc2301fe2300022300fb2300fc2300fe2302fd2300002300002300002301002300002300002300002300002302002300002300002300002300002300002300002301002300002300002300002300002302002300002300002300002302002300002300002300002300002301002300002300002300002300002302002300
+002300002300002300002301002300002300002300032300032302fa2300032300ff2300fe2300022301092300012300062300ff2302002300012300062300022300fb2301002300002300fd2300032300fd2302fd2300032300052300062300fd2301fe23000a2300fb2300032300fd230203230003230000230000230000
+2301fc2300fb2300fd2300f82302002300022300002300012300032301022300032300012300022300002302002300f82300082300fa2300012301002300052300002300fe2300002300042302fe2300fe2300ff2300032300032301002300fb2300072300002300fd23020423000023000023000023000023010023000023
+00002300002300002302002300002300002300fd2300ff2302042300002300002300fd2301022300fd2300fe2300fa2300032302002300002300002300fe2300022301002300032300032300f823000a2302012300fd2300002300032300002301ff2300fe2300002300002300032302002300002300fd2300032301002300
+fa2300032300002300ff2302032300012300f82300002300002301022300032300032300002300002302002300002300fa2300fe2300052301fd2300fd2300032300002300022300002302002300012300032300fd2300022301012300fd2300ff2300032300fb2302fe2300022300002300052300f8230101230000230002
+2300002300022302002300fb2300fd2300012301ff2300012300032300042300fb2302032300fb2300032300042300fc2302fa2300032300002300002300072301f92300022300012300fa23000a2302042300002300ff2300fd2300002301f52300012300fd23000223020023000023000123000223000323010523000023
+00032300012300002302002300002300002300f52300002301ff2300032300052300fe2300fe2302ff2300fd2300002300062300062301002300ff2300002300fd2302032300f92300ff2300032300fd2300002300fd2301042300ff2300fb2300002300022302002300fe2300032300ff2300012301022300fd2300032300
+032302fe2300072300fd2300012300002301002300fd2300062300002300fd2302fb2300002300ff2300fd2300002301042300fd2300ff2300012300022302002300fb2300fd2300052300042301fc2300012300ff2300012302fd2300032300fa2300062300022302002300fe2300022300fb2300022301fe230003230005
+2300fd2300032302022300fb2300fd2300002300042301fc2300012300082300032300002302002300002300ff2300002301012300002300002300fd2300fd2302002300022300fe2300052300f62301ff2300fe2300ff2300062300032300fe2300022302fb2300fd2300062300042301fe23000323000323000023000023
+02ff2300fb2300002300032300022301f92300fa2300fc2300fb2300002302ff2300072300002300ff2300002301fe2300062300022300092300fd2302002300022300fe2300032301022300002300012300002300fc2302fe2300062300fc2300042300002301002300002300002300002300ff2302fe2300032300fc2300
+012300032301002300002300ff2300002300002302fe2300002300ff2300042302fc2300042300002300ff2300fd2301012300ff2300042300002300002302002300002300002300002300002301002300002300002300fd2300002300022302012300fd2300022300fb2300fb2301032300fc23000b2300fd230000230204
+2300ff2300fe2300022300fd2301fe2300002300fd2300002300fd2302fe2300032300fc2300032300fe2301ff2300042300fc2300012302002300002300022300062300fd2301032300052300fd2300012300ff2302012300fd2300fe2300022300062301fc2300032300fb2300022300fc2302042300002300fb2300fa23
+00002301fb23000623000a2300012302022300fe2300032300fa2300022301fe2300fe2300ff2300fb23000a2302002300012300022300012300002302002300fd2300fd2300032300fa2301012300ff2300012300fd2300052302002300fb2300052300fb2300ff2300fe2301022300032300032300fe2300022302002300
+002300002300032300002301fd2300fe2300f72300f923000023020123000623000b2300002301002300fe2300002300fd2300022302fe23000b2300fa2300022300032301fd2300f92300072300042300002302002300ff2300012300fd2300fa2301002300012300042300032300fb2302fe2300fd2300032300022301fd
+2300052300042300ff2300012302002300002300002300fc2300042301fd2300022300fd2300fe2300fe2302fd2300032300002300fa2300052301032300002300002300fa2300012302032300052300022300002301fd2300fe2300fe2300002300022302fa2300fe2300fa2300032300022300fe2300032302fa23000623
+00ff2300032301fa2300032300032300012300ff2302fe2300052300052300f92300022301fb2300062300ff2300012300002302052300fb2300fc2300002300fb2301032300062300072300012300fa2302fd2300082300f82300fd2300002301012300072300fc2300042302fe2300fa2300f82300022300042301022300
+fe2300022300002300fe2302002300fd2300002300f92300fd2301092300002300092300012300022302002300032300002300032300002301002300f52300fa2300032302fd2300002300fc23000923000b2301f92300002300fd2300fa2300fd2302062300052300012300ff2300032300fb2301072300042300002300ff
+2300fe2302022300012300002300002300002302002300002300fc2300fea0008da00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 5.1.\tab Example output from the uneven positional base frequencies method. The 5' end codes for proteins and the 3' end contains ribosomal RNA genes.\par 
+\pard\plain \s6\sb360\sa60\sl280\tx560\tx860 \b\f20 2.2\tab The positional base preferences method\par 
+\pard\plain \s4\qj\sa120\sl260 \f20 As a result of the genetic code and the relative frequencies with which amino acids are used in proteins, DNA sequences codi
+ng for proteins have a particular bias in their positional base frequencies. This method scans DNA sequences and measures the closeness of each reading frame to this bias in their positional base frequencies. The closeness to the expected bias is expressed
+ as a \:
+"score". By default the program will use a "global" set of expected values for the positional base frequencies which are derived from average amino acid compositions in known proteins. Alternatively users may create their own set of expected values 
+by analysing known genes from the same genome. In addition users can combine the "global"  values for the first two positions in codons with third position values derived from other genes of the same genome.\par 
+\pard \s4\qj\sa80\sl260 
+In order to use a nonglobal standard, a codon table in the format described in the chapter on statistical analysis of nucleic acid sequences, can be created using the method "Creating a codon usage file". Alternatively a section of the sequence being analy
+sed can be scanned to produce an internal standard. The method is particularly useful for selecting which reading frame is coding.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.2.1\tab Using the global standard\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Positional base preferences method".\par 
+2.\tab Select "Standard source" as "Global".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "Window length". The default length of 67 should be used for most cases. Shorter windows give noisier plots and the longer the window the more chance there is of missing a short exon.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Plot interval".\par 
+\pard\plain \s4\qj\sa120\sl260 \f20 The plot will appear as in figure 5.2. This shows a 10,000 base section of sequence tha
+t codes for several proteins in each of the three reading frames. See the introduction for an explanation of the plotting scheme used.\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb300\sa120\sl240\keepn\tx1140 \f21\fs20 {{\pict\macpict\picw447\pich225 
+0d7effffffff00e001be1101a0008201000affffffff00e001be090000000000000000310000000000df01bd98002400000000008d012000000000008d011f0000000000df01bd000102dd0006007fdfff00fc140040ed000e01f000e1ffffebffff87ff83d40004140040ed000e0110009200002a00008800425c00041400
+40ed000e2908009200002c00007800442a0004140040ed000e5a08008c0000140000400044220004170040f3000008fc00068608010c000010fd000324220004170078f3000008fc0002860501fe000010fd000328020004130040f3000008fc0002800701f9000328020004130040f3000014fc0002800101f90003300100
+04150040f30008140000100000800087f9000310010004150040f300081400001800010000a4f9000310010004130040f300081400002400010000e4f700018004130040f30008240000240001000018f700018004130040f30008220000240001000018f700018004130040f30008220000220002000018f7000180041300
+40f30008220000420002000008f700018004140040fa000002fb0005210000420002f400018004140040fa000002fb000541000042001cf400014004160040fa000003fd0007440041040042001cf400014004170040fb00011003fd0007cc00410c00410024f4000140041d1476befc5eafdbeff59adfb1e0d6ddbbc5ad0f
+e1bd24f600031000f7bc1d1476befc5eafdbeff59edfb1e0d6fdbbc5ff0fe1bd20f600031000f7bc1d1476befc5eafdbeff59affb1e0d6ffbbc5ff0fe1bd40f600031000f7bc1b1476befc5eafdbeff59fffb1e0d7ffbbffffdfffbdbff4ff01f7bc1a014008fd000e2288080a0000010380800299008180f4000120041901
+400efd000d2288100a00000102810000690081f30001200419016016fd000d5588100200000102810000650081f30001101419016012fd000d5588100100000100410000050081f30001101419016022fd000d9508100100000100410000050081f3000110141a026021b0fe000d8d08100100000600410000030081f30001
+102c1a136041c80000030808200100000600410000020041f300010c2c1a1350410e2800030806200100000a00210000020041f300010c6c1a135081015900020006200100000800210000020049f300010aec190e5081008700040005a0014200080022fd000036f300010304180e4880000700040005a000c200080022fd
+000012f2000004140e48800004c0240001a000c600080022ed000004140045fe000aa03400004000a600080012ed000004140045fe000aa048000040002602c8001aed000004140045fe000a1048000040002a03480014ed00000413007dfe00011080fd00041a04480014ed000004120043fe000011fc00041904280010ed
+000004100042fe000015fc0002016428eb000004100042fe00000dfc0002016430eb0000040f0042fe00000afc00010198ea0000042523400a00000a44013c4001109a0034842208e0400200808100020806088001c094080800042501400afe001e44013c4001109a0034842208e0400200808100020806088001c0940808
+000406007fdfff00fc0a0040fb00000ce60000040a0040fb00000ae60000040a0040fb000012e60000040a0040fb000011e60000040b0040fc00010191e60000040b0078fc000101a1e60000040b0040fc00010941e6000004100040fc0002094080fc000010ed000004100040fc00020a0080fc000010ed000004100040fc
+00020e0080fc000018ed000004100040fc00020e00c0fc00001ced000004100040fc00020e0040fc000024ed000004130040fc00020a0040fc000324000002f0000004130040fc0002080040fc000322000006f0000004130040fc0002100020fc000322000006f0000004130040fc0002100020fc000322000005f0000004
+130040fc0002100020fc000322000009f0000004140040fd000318100020fc000322000009f0000004170040fd000318100020fc0006220000090000c0f300000425235dea924fb4a5900076f67fdddb6f23effd311f5fe9f8769dc2bbc579fa7e5fd7e7f7fd7c25235dea924fb4a6600076f67fdddb6f63effd311f5fedf8
+769dc2bbc579fa7e5fd7e7f7fd7c25235debd24fb4a6600076f67fdddb6f63effd209f5fedf8769dc2bbc579fa7e5fd7e7f7fd7c25045debfa4fb4feff1bf6f67fdddb6f7feffd3f9f5feff8769dc2bbc579fa7e5fd7e7f7ff7c1b1440010800004020001000003000004100002080021af4000102041e1740020400004000
+001000002a00004100002080021a000004f7000102041e1760020700004000000800002e000040800020800419400004f7000165042017600205000080000008200022002040c0004080840140000af90003100055a4201760040100008000000a680041805280c0024040c400a4000bf9000310004d6420045004008001fe
+000f0eac0041825280c00340413800a6000bf900032800806c20045008008001fe000f079200418355802803804100002a0011f900032800803c21045008008001fe001002120081834d803402004100001a0030c0fa00032400801421044808008002fe001002118180458d003404004200001a0040c0fa00034400801421
+044808008002fc000e41005c8000030400420000110080a0fc000540004400800421044808004002fc000e6100548000029400240000010080a0fc0005a0004400800421044410004004fc0002220064fe000894002c000001010020fc0005a0004201000423044410004004fc0002220020fe001358001000000101002400
+0010000120008201000420044210002004fc000012fc000068fd000e0101002c00001000011000820100041f044210003014fc000012fc000060fc000d82003c00001800021000820200041f047a20000818fc00001efc000060fc000d82000200081c02021000820200041f044220000838fc000010fc000020fc000d8200
+0200142c020410110204000417044240000828f0000d44000200146205040a1101080004170442400008e8f0000d4400020022a315040a1101880004160342400005ef000d48000100e2a335040d2a01700004250642d00307440c06fe001910a040025000c00000040800340401018100c8a4456a21741304250643d00304
+440c06fe001910a040025000c00000040800340401fe010088dc45ac2074130406007fdfff00fc0a0043fe000008e30000040a0043fe000008e30000040a0043fe000008e30000040e044280000414fc000010e90000040e044280000494fc000018e900000410047a80000776fc0002188020eb00000410044280000756fc
+0002148030eb00000410044480000402fc0002278030eb00000410044440000802fc0002278048eb00000410044440000801fc0002264048eb00000410044440000801fc0002224048eb00000410044440000801fc0002224048eb00000410044440000801fc0002204048eb00000410044820001001fc0002205848eb0000
+0410044820002001fc0002405948eb0000041105482000400080fd0002402588eb0000041105482000400080fd0002c02588eb0000041105481000800080fd0002800588eb0000041205501000800040fe000301000684eb0000042523701fefe001cb3d2bffeb00020629f73b0ef1c60fef7ddff6f7dfe5f75e54fbacfd37
+34fc2523701fefe001cb3d2bffeb00020629f73b0ef1c61fef7ddff6f7dfe5f75e54fbacfd3734fc2523701fefe001cb3d2bffeb00020229f73b4ef1c62fef7ddff6f7dfe5f75e54fbacfd3734fc25237fffefffffcb3d2bffebfffffe29f73b4ef1c67fef7ddff6f7dfe5f75e54fbacfd373dfc1a05501001000040fe0003
+01000002fe000360000044f3000109b41a05600801000020fe000301000002fe000360000042f300010ab41a05600805000020fe000302000001fe000360000042f300010e141a05600806000020fe000302000001fe000390000042f3000116041a0560080800002cfe000302000001fe000398000082f3000110041a0540
+0908000014fe000012fe000681000088400082f3000110041a05400d08000014fe00001afe000682c00084c00081f3000110041a05400308000002fe00001afe000682400104c00081f3000110041a094002f00000020000082afe000682420103200101f3000120041a0940009000000300003426fe00069c250100200101
+f3000120041b09400080000001000054a4fe000664248100200201f400022020041f0040fc0003c0004364fe000c60288200204201000004000002fa00023040041e0040fc0002a00043fd000c4018820020a40100000c000102fa00023040041e0040fc0002a00040fd000c4010620020a40100000a000102fa0002314004
+1c0040fc0002200080fb000a6200132801000012000142fa00023140041f0078fc0002100080fb000a24001318010000121002cdfd00050800004a80041f0040fc0002100080fb000a14001b10010000111802adfd00050800004a80041f0040fc0002080080fb000a14001c0000880021e802b5fd0005140000ca80041f00
+40fc00010801fa00141400040000740020280c35803400001400014400041f0040fc00010c01fa00141800040000440040240c04803600001400010400042523400000aa0a020ec280020c801021001c809050009204c405501c846aee0573625284900c2523400000aa0a020ffe80020c801021001c8090500091ffc407f0
+1cffebfffff3fffe84900c06007fdfff00fc02dd00a00083ff}}\par 
+\pard \s8\qj\fi-1140\li1140\sa120\sl240\tx1140 Figure 5.2\tab Example output from the positional base preferences method. Most of the sequence is coding for proteins.\par 
+\pard\plain \s9\fi-560\li860\sb400\sa60\sl280\tx1140 \b\f20 2.2.2\tab Using a nonglobal standard\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Make an appropriate codon usage file as described in the chapter on statistical analysis of nucleotide sequences.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Select "Positional base preferences method".\par 
+3.\tab Select "Standard source" as "Codon usage table".\par 
+4.\tab Define "File name of standard". The file will be read and displayed on the screen.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab 
+Select "Normalisation" as "Combine with global standard". This alternative means we will use the values for the first two positions of codons combined with the third position values from our codon table. Otherwise ("Use observed frequencies") will use a
+ll three positions from our codon table. The positional base frequencies to be used will be displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Accept "Use 1.0 for positional weights". The alternative allows users to 
+give greater or lesser emphasis to any of the three positions by defining weights for each. The program displays the "Expected scores per codon in each frame".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Define "Window length". Windows shorter than the default of 67 may be useful if the bias is sufficiently strong. Look at the "Expected scores in each frame" to help decide.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab Define "Plot interval".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab Accept "Plot relative scores". This means that for each frame we plot its score divided by the sum of the scores for all three frames. It produces
+ smoother plots than the alternative "Plot absolute scores" which simply plots the scores for each frame. The minimum and maximum expected scores for the given standard and window length are displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Accept "Leave scaling values unchanged". The expected scores just displayed will be used to scale the plots. If required the user can change the scaling values at this point.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 The plot will now appear as in figure 5.2. Typical dialogue is shown in figure 5.3.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab The codon usage method\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The codon usage meth
+od scans along a sequence and measures the closeness of each reading frames codon composition to an expected set of codons. Of the methods described it is the most sensitive, but consequently has to make the strongest assumption, namely that we know the ap
+proximate codon usage for the genes being searched for. The codon usage will depend on the codon preferences and the amino acid composition of the protein product. For this reason the program contains three methods of "normalisation". The table of codon us
+age may be used as read "Observed frequencies"; the table may be transformed to reflect an average amino acid composition "Normalise to average amino acid composition"; the table may be transformed to have no amino acid bias "Normalise to no amino acid bia
+s". The table can be read from a file produced by "Creating a codon usage file" as described in the chapter on statistical analysis of nucleic acid sequences, or an "internal standard" can be used by the user defining a region of the current sequence. In t
+he latter case the program will calculate the codon usage for the defined region.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Codon usage method".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Reject "Define internal standard". If an internal standard is used the program will ask for the end points of the segments over which to calculate the codon usage.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "File name of standard". The file will be read and displayed on the screen.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Select "Normalisation" as "Average amino acid composition". The program will display the expected values for each reading frame for the window lengths 21, 31 and 41 codons. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Select "Window length".\par 
+6.\tab Select "Plot interval".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The plot will appear as in figure 5.4. This shows a 10,000 base section of sequence that codes for several proteins in each of the three reading frames. See the introduction for an explanation of the plotting scheme used.\par 
+\pard\plain \li1840\ri1980\sl220\pagebb\box\brsp100\brdrth \f4\fs16 Positional base preferences method to find protein genes\par 
+\pard \li1840\ri1980\sl220\box\brsp100\brdrth Select standard source\par 
+X  1 Use global standard\par 
+   2 Use internal standard\par 
+   3 Use codon usage table\par 
+? Selection  (1-3) (1) =3\par 
+? File name of standard=atpase.cods\par 
+     ===========================================\par 
+     F TTT  21. S TCT  33. Y TAT  15. C TGT   5.\par 
+     F TTC  55. S TCC  40. Y TAC  40. C TGC   4.\par 
+     L TTA   8. S TCA   7. * TAA   8. * TGA   0.\par 
+     L TTG  19. S TCG  12. * TAG   1. W TGG  17.\par 
+     ===========================================\par 
+     L CTT  22. P CCT  17. H CAT   6. R CGT  73.\par 
+     L CTC  21. P CCC   4. H CAC  30. R CGC  23.\par 
+     L CTA   1. P CCA  10. Q CAA  19. R CGA   5.\par 
+     L CTG 168. P CCG  48. Q CAG  80. R CGG   3.\par 
+     ===========================================\par 
+     I ATT  47. T ACT  14. N AAT  17. S AGT   8.\par 
+     I ATC  98. T ACC  54. N AAC  52. S AGC  26.\par 
+     I ATA   6. T ACA   7. K AAA  85. R AGA   0.\par 
+     M ATG  75. T ACG  13. K AAG  28. R AGG   0.\par 
+     ===========================================\par 
+     V GTT  67. A GCT  56. D GAT  41. G GGT  90.\par 
+     V GTC  29. A GCC  53. D GAC  66. G GGC  66.\par 
+     V GTA  49. A GCA  59. E GAA 101. G GGA   5.\par 
+     V GTG  57. A GCG  64. E GAG  41. G GGG   8.\par 
+     ===========================================\par 
+Select normalisation\par 
+X  1 Use observed frequencies\par 
+   2 Combine with global standard\par 
+? Selection  (1-2) (1) =2\par 
+          T      C      A      G      Range\par 
+      1  0.177  0.211  0.277  0.336  0.159\par 
+      2  0.271  0.238  0.310  0.182  0.128\par 
+      3  0.242  0.301  0.168  0.289  0.132\par 
+? Use 1.0 for positional weights (y/n) (y) =\par 
+ Expected scores per codon in each frame\par 
+       0.785     0.736     0.736\par 
+? odd span length (31-101) (67) =\par 
+? plot interval (1-11) (5) =\par 
+? Plot relative scores (y/n) (y) =\par 
+\par 
+   Minimum  maximum    range\par 
+    0.3219   0.3519   0.0214\par 
+\pard \li1840\ri1980\sl220\keepn\box\brsp100\brdrth ? Leave scaling values unchanged (y/n) (y) =\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 5.3\tab 
+Typical dialogue from the "Positional base preferences method" using a nonglobal standard in the form of a codon table to specify the values for the third positions in codons.\par 
+\pard\plain \s6\sb400\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Searching for open reading frames\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This routine finds all open reading frames of some minimum length and writes its results in the form of an EMBL feature table. \par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Find open reading frames".\par 
+\pard\plain \sl220\keepn \f4\fs16 {{\pict\macpict\picw442\pich218 
+0f42ffffffff00d901b91101a0008201000affffffff00d901b9090000000000000000310000000000d801b898002400000000008d012000000000008d011f0000000000d801b8000102dd0006007fdfff00fc1e0040fb000ef0fe00f26100dc0e004000180ffa40fe00020ffdc0fa0000041f0070fc000f01110101159180
+a412004000280906c0fe0002100240fa0000041f0040fc000f011101010d92808232004008280802a0fe0002100240fa0000041f0040fc000f02110101080a81027200c008288801a0fe0002100040fa0000041f0070fc000f02110081080a89037108d49425900120fe0002200040fa0000041f0040fc000f02090082000a
+8900a118d59445700120fe0002200040fa0000041f0070fc0015040a00c200048900a114dd9446700120000003c00020fa0000041f0040fc0015040a002200048a002124db5446100020000002000020fa0000041f0040fc0015040a002e00044a000123526282100020000002000020fa0000041f0070fc0015040a001000
+004a000121226280000020000002000020fa000004220040fc000b040a001000005a0001212223fe000920000002000020000008fd000004210040fc00010404fd00055c0000a02003fe000920000002000020000018fd000004210070fc00010404fd0005540000a02003fe000910000002000020000018fd000004200040
+fc000004fc0005740000a02001fe000910000002000020000018fd000004210070fc000008fc0005500000c02001fe000e100000040000200000180020000004210040fc000008fc0005100000c00001fe000e1000000400001000001400500000041e0070fc000008f90002c00001fe000e10000004000010000014005000
+00041c0040fc000018f90000c0fc000e1000001400001000001400480000041c0040fc000018f9000040fc000e1000001c00001000002400480000041e067c66de6dd21858f9000040fc000e1e7ff6fc00003dbebfe797cf9ddefc1e066c66de6dd21850f9000040fc000e1e7ff6e400003dbebfe797cf9ddefc1a067c66de
+6dd2185ff3ff02fe7ff6feff08fdbebfff97ff9ddefc1a066c66de6dd21850f3000e1e7ff6e400003dbebfe797cf9ddefc180040fc000010f3000e100000200000094002a40584001004180070fc000010f3000e080000400000094002a40584003004180040fc000010f3000e080000400000094007424804002804180040
+fc000010f3000e0800004000000a400543c804002804180070fc000020f3000e0800004000000a4004033804002804180060fc000020f3000e0800004000000a3004023804002804180070fc000020f3000e04000040000006300c023004002804180060fc000020f3000e040000400000063008003002002804190060fd00
+010220f3000e040000400000063008001002004804190078fd00010220f3000e040000400000060808001002004804190058fd00010220f3000e040000400000060808000002004804190078fd00010220f3000e040000800000020808000002004804190048fd00010320f3000302000080fe000708080000020048041900
+44fd00010520f3000302000080fe00070810000002004804190074fd00010520f3000302000080fe000708100000020044041a0644040000014520f3000302000080fe000704100000020084041a067406008001c520f3000302000080fe000704100000020084041a06440a0080012540f3000302000180fe000704100000
+010084041a06440a0080022540f3000302000180fe000704100000010084041a06720901400224c0f3000301800280fe00070220000001008704252362796940a3daec02e005042000000800400000a70019e403041201200220210005b90484252373f9df7fffdaec02e005042000000800400000a70019ffff0412012003
+e0210005ff04fc06007fdfff00fc180643803f0e1e00c0f2000171eefd000101e0fe0002ff00041906728041111200c0f20002891280fe00010120fe0002810004190642804090a200c0f200028a1280fe00010120fe00028100041906424040a0a10140f20002860180fe00010120fe00028100041a06724080e0618140f3
+000301060180fe00010220fe00028080041a0642408000018120f3000301040180fe00010210fe00028080041a0672298000004920f3000301000080fe000702100000010080041a014419fe00014920f3000301000080fe000702100000010080041a014416fe00012920f3000301000080fe000702100000010080041a01
+7406fe00013620f3000e0100004000000202100000010080041a014406fe00011620f3000e020000400000020210000001004004190074fd00011620f3000e020000400000060210000001004004190044fd00010620f3000e020000400000060210000001004004180044fc000020f3000e02000040000006021000000100
+4004180074fc000020f3000e020000400000060208020002002004180044fc000020f3000e0200004000000a0408020002002004180074fc000020f3000e0200004000000a0408030002002004180048fc000010f3000e0400004000000a0408030002002004180048fc000010f3000e040000400000090408030002002004
+230078fc001d3c7a36ac17fffffdf7dddefebfffb1fc0000768bba9b5c0e85c31a003cfc230068fc001d3c7a36ac17fffffdf7dddefebfffb1fc0000768bba9b5c0e85c39c003cfc230068fc001d3c7a36ac17fffffdf7dddefebfffb1f40000768bba9b5c0ec5c39c003cfc23007ffcff0efc7a36ac17fffffdf7dddefebf
+ffb1feff0bf68bba9f5ffec7c39ffffcfc180048fc000010f3000e100000200000111404c482880010041c0078fc000010f9000040fc000e100000200000111404c484880010041c0068fc000010f9000040fc000e100000200000111404c484880010041c0070fc000010f90000c0fc000e10000020000011340524848800
+10041e0070fc000008f90002c00001fe000e10000020000010b4052454480008041e0070fc000008f90002c00001fe000e10000020000020f405285850000804210070fc000008fc0005400000c00001fe000e10000020000020e805285850000804210050fc000008fc0005640000a00003fe000e10000020000020880628
+7850000804220040fc00010804fd0005640000a00003fe000e100000200000208806287850000804220070fc00010404fd0005640001200003fe000e200000200000208806283050000804230040fc000b040a001000005a0001210203fe000e2000003c0000200800283050000804230070fc000b040a003000005a000123
+0203fe000e200000240000200800280050000804230040fc001d040a002800005a0001230203001000200000240000200800180060000804230040fc001d040a004c00009a0001250302861000200000020000200000180020000404230070fc001d040a0044000099008114830286300120000003c0004000001000200004
+04230040fc001d020900820000890081148302853001200000022000400000100020000404230070fc00180211008200048900c108850445480120000002200040000010fe00010404230040fc000f02110102000a8900c208850429480120fe0005100240000010fe00010404200040fc000f01110102018a808122088484
+294802a0fe0002100240fb00010404200070fc000f011101010291808122004484288906a0fe0002100540fb000104042523400184262c0000949223065500813a00449418898ac68212084805400420800000106d6c2523700184262c0000f4ee22fe7500ff3e007c7c1887fac68212084ff8800420800000106ffc06007f
+dfff00fc070040e0000104fc070070e000010704070040e000010404070040e000010404070070e000010404070040e000010404070070e000010404070040e0000104040b0040e6000008fc000108040b0070e6000008fc000108040b0040e6000008fc000108040b0070e6000008fc000108040b0040e6000008fc000108
+040b0050e6000008fc000108040c0070e600013404fd000108040d0070e6000734040028000008040d0070e6000774040038000008040d0070e6000754070048000008040d0068e60007540700480000080425107fdcef8d2bebf7efdfffc720ffcda7fdfbfeef0fff303dfbe9fc7fdff44eddf6976ef80425107fdcef8d2b
+ebf7efdfffc720ffcda7fdfbfeef0fff303dfbe9fc7fdfb44efdf6976ef00425107fdcef8d2bebf7efdfffc720ffcda7fdfbfeef0fff303dfbe9fc7fdfb44cfdf6976ef00425107fdcef8d2bebf7efdfffc720ffcda7fdfbfeef0fff303dfbedfc7fdffc4ffdfe976efffc140048ed00030800002cfe000784082084840010
+04180078fc000008f300030c00003cfe00078408208486001004180048fc000008f300030c000034fe0007840821034a001004180044fc000008f3000312000034fe0007840811034a002004180074fc000008f3000e12000024000001040811024a0020041c0044fc000018fc000010f9000e12000024000001020811003a
+002004200074fc000018fc000010fe000020fd000e120000240000010210110029002004200044fc000018fc000010fe000020fd000e120000220000010210110001004004210044fd00010418fc000010fe000020fd000e12000022000019021012000100400422017404fe00011424fc000010fe000020fd000e12000022
+000016021012000100400423014406fe00011424fc000018fe00012020fe000e11000042000016021012000100400423017406fe00013a24fc00002cfe0013306080000021000042000016021014000100800425014419fe00072a2400000600002cfe00135250c0000021000042000026021014000100800425014429fe00
+1e4a2400000600042c0020015a50c000002100004200002202100c000100800425237229800000492400000600042a002001de914040002100004200002202100c000100800425234240800000412400000600062a002002d695404000210401420000220210080000808004252372404060604126000006080a6a03300ad6
+8f40c000210601820000220220080000808004252342404090a081460400090c0a4a02b016c18820c001208a01820000200120080000810004251d428040912080c20c00090c1a4102b010418820a001a08a12810000200120fe0002810004251d728040911080c10a00091419812470204100212002c08912811000400120
+fe00028100042523529a212a1190d95e0dcb3aa381ddf873c10835a20ac0972e8338a04801202028048108a42523739a3f2e1e90d9ffffbbfb6381ddfff3c1081fbffec0f7ec83ffffc801e0202804ff08a406007fdfff00fc02dd00a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 5.4\tab Example output from the codon usage method. Most of the sequence is coding for proteins.\par 
+\pard\plain \s7\qj\fi-560\li560\sb400\sa120\sl280\tx560 \f20 2.\tab Define "Minimum open frame in amino acids".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Select "Strands". The alternatives are\: + strand only, - strand only, or both strands. Typical output is shown in figure 5.5.\par 
+\pard\plain \li2120\ri2240\sb400\sl220\box\brsp100\brdrth \f4\fs16 FT   CDS             525..965              \par 
+\pard \li2120\ri2240\sl220\box\brsp100\brdrth FT   CDS             956..1789             \par 
+FT   CDS             2128..2607            \par 
+FT   CDS             2604..3155            \par 
+FT   CDS             3159..4709            \par 
+FT   CDS             4733..5623            \par 
+FT   CDS             5539..7032            \par 
+FT   CDS             7044..7454            \par 
+FT   CDS             7797..8134            \par 
+FT   CDS             complement(2227..2634)\par 
+FT   CDS             complement(2250..3023)\par 
+FT   CDS             complement(3027..3899)\par 
+FT   CDS             complement(3903..4760)\par 
+FT   CDS             complement(4327..4626)\par 
+FT   CDS             complement(4646..5332)\par 
+FT   CDS             complement(5345..5647)\par 
+FT   CDS             complement(5635..6012)\par 
+FT   CDS             complement(6016..6441)\par 
+FT   CDS             complement(6445..7083)\par 
+FT   CDS             complement(7035..7445)\par 
+\pard \qj\li2120\ri2240\sl220\keepn\box\brsp100\brdrth FT   CDS             complement(7406..7777)\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 5.5\tab Typical output from "Find open reading frames"\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Searching for tRNA genes\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 tRNA genes have two classes of feature that can be used to locate them in genomic sequences\:
+ their ability to fold into the cloverleaf secondary structure, and the presence of specific "conserved" bases at particular positions relative to this structure. The level of congruence with the canonical structure is quite variable\:
+ some tRNA genes contain intervening sequences and others, particular those from organelles, have few of the conserved bases. The program searches for potential cloverleaf forming str
+uctures and optionally the presence of conserved bases. The user can define the range of loop sizes, the minimum numbers of potential base pairs, a range of intron sizes, and which, if any, of the conserved bases should be present. The results are presente
+d either textually or graphically. \par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "tRNA search".\par 
+2.\tab Define "Maximum tRNA length".\par 
+3.\tab Define "Aminoacyl stem score". See note 8.\par 
+4.\tab Define "Tu stem score".\par 
+5.\tab Define "Anticodon stem score".\par 
+6.\tab Define "D stem score".\par 
+7.\tab Define "Minimum base pairing total".\par 
+8.\tab Define "Minimum intron length".\par 
+9.\tab Define "Maxmimum intron length".\par 
+10.\tab Define "Minimum length for TU loop".\par 
+11.\tab Define "Maximum length for TU loop".\par 
+12.\tab Accept "Skip search for conserved bases". See notes section.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 13.\tab Reject "Plot results". 
+This gives listed output in which the potential cloverleafs are displayed. The alternative plotted output simply draws a vertical line to represent the score for the potential gene, at the position it has been found. Typical dialogue and the beginning of s
+ome listed output is shown in figure 5.6.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+In general, for finding protein genes, we recommend the use of all the methods. The "Uneven positonal base frequencies" method can show which regions are likely to be coding but not which strand or fram
+e. The "Positional base preferences" method can show the correct frame and also help to find which regions are coding. The "Codon usage" method has the greatest resolution, having been used successfully with windows of 11 codons, and can help find small ex
+ons and to pinpoint exon/intron boundaries.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab 
+When the "Uneven positional base frequencies" calculation was applied to all the sequences in the 1984 version of the EMBL library 14% of noncoding segments failed to reach the value represented by the base of
+ the box, whereas all coding segments did. The top value of the box was not reached by any noncoding segments but was exceeded by 16% of coding sequences. 76% of noncoding segments failed to reach the line labelled 76% but 76% of coding segments fell above
+ it. We would not expect this result change significantly if it were to be recalculated on the current libraries.\par 
+3.\tab When the "Positional base preferences" method, using "global" values, was applied to all the {\i E. coli} genes in the 1984 version of the EMBL library it chose the correct reading frame for 91% of coding segments. {\i E. coli}
+ sequences were used for technical rather than scientific reasons and we have no reason to believe that other organisms should give significantly different results. This result used only the values for the first two positions in codons and so for genes wit
+h a strongly biased base composition we would expect even better discrimination.\par 
+\pard\plain \li1180\ri1440\sb100\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 tRNA search\par 
+\pard \li1180\ri1440\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth ? Maximum trna length (70-130) (92) =\par 
+? Aminoacyl stem score (0-14) (11) =\par 
+? Tu stem score (0-10) (8) =\par 
+? Anticodon stem score (0-10) (8) =\par 
+? D stem score (0-8) (3) =\par 
+? Minimum base pairing total (30-44) (30) =\par 
+? Minimum intron length (0-30) (0) =\par 
+? Maximum intron length (0-30) (0) =\par 
+? Minimum length for TU loop (4-12) (6) =\par 
+? Maximum length for TU loop (6-12) (9) =\par 
+? Skip search for conserved bases (y/n) (y) =n\par 
+Give a score for each base, then a minimum total at the end\par 
+? Base  8, T is 100% conserved. Score (0-100) (0) =\par 
+? Base 10, G is  95% conserved. Score (0-100) (0) =\par 
+? Base 11, Y is  96% conserved. Score (0-100) (0) =\par 
+? Base 14, A is 100% conserved. Score (0-100) (0) =\par 
+? Base 15, R is 100% conserved. Score (0-100) (0) =\par 
+? Base 21, A is  97% conserved. Score (0-100) (0) =\par 
+? Base 32, Y is 100% conserved. Score (0-100) (0) =\par 
+? Base 33, T is  98% conserved. Score (0-100) (0) =\par 
+? Base 37, A is  91% conserved. Score (0-100) (0) =\par 
+? Base 48, Y is 100% conserved. Score (0-100) (0) =\par 
+? Base 53, G is 100% conserved. Score (0-100) (0) =\par 
+? Base 54, T is  95% conserved. Score (0-100) (0) =\par 
+? Base 55, T is  97% conserved. Score (0-100) (0) =\par 
+? Base 56, C is 100% conserved. Score (0-100) (0) =\par 
+? Base 57, R is 100% conserved. Score (0-100) (0) =\par 
+? Base 58, A is 100% conserved. Score (0-100) (0) =\par 
+? Base 60, Y is  92% conserved. Score (0-100) (0) =\par 
+? Base 61, C is 100% conserved. Score (0-100) (0) =\par 
+? Minimum total conserved base score (0-0) (0) =\par 
+? Plot results (y/n) (y) =n\par 
+       264\par 
+                   t\par 
+                 t-a\par 
+                 c-g\par 
+                 a-t\par 
+                 t+g\par 
+\pard \li1180\ri1440\sl220\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth                  a-t\par 
+                 a a\par 
+                 a-t     gta\par 
+                c   aacgc\par 
+          a    t     !!!!   c\par 
+           cgt      gtgcg   a\par 
+           !!!      t    cga\par 
+         a gca       c\par 
+          g    t      g\par 
+                c aa   t\par 
+                a-t a\par 
+                t-a  t a\par 
+                t-a\par 
+                t-a\par 
+               g   t\par 
+               c   g\par 
+\pard \li1180\ri1440\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth                 caa\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 5.6\tab Typical dialogue and textual output from "Find tRNA genes".\par 
+\pard\plain \s7\qj\fi-560\li560\sa80\sl280\tx560 \f20 4.\tab If the codon table used by the "Codon usage" me
+thod is normalised to have average amino acid composition it retains its codon preference bias for each amino acid type but now the amino acid composition is the average of all proteins. In general this is optimal\:
+ we have the expected codon preference bia
+s plus an expected amino acid bias. If we normalise to no amino acid bias we are safeguarding ourselves against missing a protein of anomalous composition but at the expense of not employing all of the useful information for distinguishing coding from nonc
+oding. \par 
+\pard \s7\qj\fi-560\li560\sa80\sl280\tx560 5.\tab 
+The program also contains a graphical version of Ficketts method (6), except here we use a window to analyse each segment of the sequence rather than giving a single value for each open reading frame. The tables used are those from the original publicat
+ion.\par 
+\pard \s7\qj\fi-560\li560\sa80\sl280\tx560 6.\tab If the results from the "Find open reading frames" option are directed to disk (See the introductory chapter), the file can be used by the routines that use feature tables as input.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab The program also contains several routines for plotting the positions of stop and start codons for either strand of the sequence. One form of the output is included in figures 5.2 and 5.4.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab The tRNA gene search using a simple scoring system for base pairing\:
+ A-T and G-C base pairs each score 2 and G-T scores 1. The use of a "Minimum base pairing total" allows low cutoffs to be set for each individual stem, but that overall some reasonable 
+level of stability is possible. In this way a low score for one stem can be compensated by a high score in another.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Th
+e cloverleaf is composed of four base-paired stems and four loops. Three of the stems are of fixed length but the fourth, the dhu stem which usually has four base pairs, sometimes has only three. All of the loops can vary in size. The following relationshi
+ps between the stems in the cloverleaf are assumed in the program\:
+ (a) there are no bases between one end of the aminoacyl stem and the adjoining tuc stem; (b) there are two bases between the aminoacyl stem and the dhu stem; (c) there is one base between t
+he dhu stem and the anticodon stem; (d) there are at least three bases between the anticodon stem and the tuc stem.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R. and McLachlan, A.D. 1982. Codon preference and its use in identifying protein coding regions in long DNA sequences. {\i Nucl. Acids Res.} {\b 10}\:151-156.\par 
+2.\tab Staden, R.  1984.  Measurements of the effects that coding for a protein has on a DNA sequence and their use for finding genes. {\i Nucl. Acids Res}. {\b 12}\:551-567.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Staden, R.  1985.  Computer methods to locate genes and signals in nucleic acid sequences.  (in) {\i Genetic Engineering, Principle and Methods}, Setlow J.K., Hollaender A., (eds.), {\b 7}\:
+67-114, (Plenum Press, New York).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Staden, R. 1990. Finding Protein Coding Regions in Genomic Sequences. (in) {\i Methods in Enzymology} R.F. Doolittle (ed.),  {\b 183}\:163-180  (Academic Press, New York).\par 
+5.\tab Staden, R.  1980.  A computer program to search for tRNA genes. {\i Nucl. Acids Res}. {\b 8}\:817-825.\par 
+6.\tab Fickett, J.W. 1982. Recognition of protein coding regions in DNA sequences. {\i Nucl. Acids Res}. {\b 10}\:5303-5318.\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 6.  Searching for Motifs in Nucleic Acid Sequences\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Searching for percentage matches to consensus sequences\par 
+2.2\tab Searching for consensus sequences using a score matrix\par 
+2.3\tab Using weight matrices for searching nucleotide sequences\par 
+2.4\tab Using "hardwired" motif searches.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program NIP contains several ways of defining and searching for motifs (1-4), and also contains a number of "hardwired" motifs that are already 
+defined and can be selected as separate searches. We describe searches for percentage matches to consensus sequences, the use of score matrices and the creation and use of nucleotide and dinucleotide weight matrices (see note 7). In addition we give detail
+s of the "hardwired" motifs available from the program. In another chapter we have covered searches for exact matches to consensus sequences by describing how to find restriction enzyme recognition sequences. When searching for exact matches, percentage ma
+tches or using a score matrix the search string or consensus sequence may include IUB redundancy codes. All of the searches produce both listed and graphical output. The listed output displays the matching sequence and its position and the graphical output
+ draws a box to represent the length of the sequence, and plots vertical lines within the box at the positions of matches. The heights of the lines are proportional to the match score (see figure 6.1).\par 
+\pard\plain \sb200\sl220\keepn \f4\fs16 {{\pict\macpict\picw447\pich44 
+032fffffffff002b01be1101a0008201000affffffff002b01be0900000000000000003100000000002a01bd98002400000000001d012000000000001d011f00000000002a01bd000102dd0006007fdfff00fc060040df000004060040df000004060040df0000041002400088f7000020f1000001fd0000041002400088f7
+000020f1000001fd0000041002400088f7000020f1000001fd00000421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe0005014200
+05c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020
+012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc0003021004
+60fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482
+b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00421044482b81210fc000302100460fc00078080000020012008fe000501
+420005c00421044482b81210fc000302100460fc00078080000020012008fe000501420005c00406007fdfff00fc02dd00a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 6.1\tab Typical graphical output from a motif sea
+rch. It shows a rectangular box in which each match is identified by a vertical line whose height gives the match score and whose x coordinate indicates the position in the sequence.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Searching for percentage matches to consensus sequences\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Find percentage matches".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Accept "Type in strings". The alternative allows the string to be extracted from a named file.\par 
+3.\tab Reject "Keep picture". This will cause the graphics window to be cleared. The alternative leaves it unchanged.\par 
+4.\tab Define "String". Type in the search string. When the program cycles round to this point again the previous string will be offered as a default.\par 
+5.\tab Accept "This sense". The alternative directs the program to search for the complement of the string.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Percent match". The search is performed, the results are presented graphically (see figure 6.1), the number of matches displayed, and the scores and positions of the top 10 matches displayed.
+\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Define the number of matches to "Display". For the number of mat
+ches chosen the program will display the search string and matching sequence written one above the other with matching characters indicated by asterisk symbols. The program now cycles round to step 3. See figure 6.2.\par 
+\pard\plain \li220\ri280\sb400\sl220\box\brsp100\brdrth \f4\fs16  Find percentage matches\par 
+\pard \li220\ri280\sl220\box\brsp100\brdrth ? Type in string (y/n) (y) =\par 
+ ? Keep picture (y/n) (y) =\par 
+ ? String=AAAATTTT\par 
+STRING=AAAATTTT\par 
+? This sense (y/n) (y) =\par 
+ ? Percent match (1.00-100.00) (70.00) =\par 
+\par 
+Total scoring positions above 70.000 percent =  41\par 
+Scores         7      7      7      7      6      6      6      6      6      6\par 
+Positions    428    534   2994   7026    130    191    192    372    427    429\par 
+? Display (0-41) (0) =4\par 
+\par 
+       428\par 
+         aaaatatt\par 
+         ***** **\par 
+         AAAATTTT\par 
+         1\par 
+\par 
+       534\par 
+         aaagtttt\par 
+         *** ****\par 
+         AAAATTTT\par 
+         1\par 
+      2994\par 
+         aaaatttc\par 
+         *******\par 
+         AAAATTTT\par 
+         1\par 
+\par 
+      7026\par 
+         aaaacttt\par 
+         **** ***\par 
+         AAAATTTT\par 
+\pard \li220\ri280\sl220\keepn\box\brsp100\brdrth          1\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 6.2\tab Worked example for the percentage match search\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \page 2.2\tab Searching for consensus sequences using a score matrix\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+A score matrix gives a score for the alignment of each possible pair of sequence symbols. The matrix used by this program includes all the IUB redundancy codes and gives scores that represent the level of redundancy. The matrix is shown in figure 6.3.
+\par 
+\pard\plain \s7\qj\fi-560\li560\sb200\sa120\sl280\tx560 \f20 1.\tab Select "Find matches using a score matrix".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Accept "Type in strings". The alternative allows the string to be extracted from a named file.\par 
+3.\tab Reject "Keep picture". This will cause the graphics window to be cleared. The alternative leaves it unchanged.\par 
+4.\tab Define "String". Type in the search string. When the program cycles round to this point again the previous string will be offered as a default.\par 
+5.\tab Accept "This sense". The alternative directs the program to search for the complement of the string. The program displays the maximum possible score for the string.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Score". The search is performed, the results are presented graphically (see figure 6.1), the number of matches displayed, and the scores and positions of the top 10 matches displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab 
+Define the number of matches to "Display". For the number of matches chosen the program will display the search string and matching sequence written one above the other with matching characters indicated by asterisk symbols. The program now cycles round
+ to step 3. The dialogue shown in figure 6.2 is almost exactly the same as that for "Searching for consensus sequences using a score matrix".\par 
+\pard\plain \li1580\ri1560\sb300\sl220\box\brsp100\brdrth \f4\fs16      T  C  A  G  -  R  Y  W  S  M  K  H  B  V  D  N  ?\par 
+\pard \li1580\ri1560\sl220\box\brsp100\brdrth T   36  0  0  0  9  0 18 18  0  0 18 12 12  0 12  9  0\par 
+C    0 36  0  0  9  0 18  0 18 18  0 12 12 12  0  9  0\par 
+A    0  0 36  0  9 18  0 18  0 18  0 12  0 12 12  9  0\par 
+G    0  0  0 36  9 18  0  0 18  0 18  0 12 12 12  9  0\par 
+-    9  9  9  9 36 18 18 18 18 18 18 27 27 27 27 36  0\par 
+R    0  0 18 18 18 36  0  9  9  9  9  6  6 12 12 18  0\par 
+Y   18 18  0  0 18  0 36  9  9  9  9 12 12  6  6 18  0\par 
+W   18  0 18  0 18  9  9 36  0  9  9 12  6  6 12 18  0\par 
+S    0 18  0 18 18  9  9  0 36  9  9  6 12 12  6 18  0\par 
+M    0 18 18  0 18  9  9  9  9 36  0 12  6 12  6 18  0\par 
+K   18  0  0 18 18  9  9  9  9  0 36  6 12  6 12 18  0\par 
+H   12 12 12  0 27  6 12 12  6 12  6 36  8  8  8 27  0\par 
+B   12 12  0 12 27  6 12  6 12  6 12  8 36  8  8 27  0\par 
+V    0 12 12 12 27 12  6  6 12 12  6  8  8 36  8 27  0\par 
+D   12  0 12 12 27 12  6 12  6  6 12  8  8  8 36 27  0\par 
+N    9  9  9  9 36 18 18 18 18 18 18 27 27 27 27 36  0\par 
+\pard \li1580\ri1560\sl220\keepn\box\brsp100\brdrth ?    0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 6.3\tab The DNA score matrix using IUB symbols\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Using weight matrices for searching nucleotide sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 A we
+ight matrix is the most sensitive way of defining a motif. It is a table of values that gives scores for each base type in each position along a motif. For a motif of length 8 bases the weight matrix would be a table 8 positions long and 4 deep. The simple
+st way of choosing the values for the table is to take an alignment of all known examples of the motif and to count the frequency of occurrence of each base type at each position. These frequencies can be used as the table of weights. When the table is use
+d to search a new sequence the program calculates a score for each position along the sequence by adding or multiplying (see note 6) the relevant values in the table. All positions that exceed some cutoff score are reported as matching the original set of 
+motifs.\par 
+\pard \s4\qj\sa120\sl280 
+How can we select a suitable cutoff score? The simplest way is to apply the weight matrix to all the known occurrences of the motif - i.e. the set of sequence segments used to create the table - and to see what scores they achieve. The cutoff can b
+e selected accordingly. For convenience the weight matrix is stored as a file along with its cutoff score, a title that is displayed when the file is read, and a few other values need by the program. A routine for creating weight matrix files from sets of 
+aligned sequences is included in the program. When a search using the weight matrix is performed the program will either list the matching sequence segments or plot their positions as for the other motif search methods.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.3.1\tab Creating a weight matrix file from a set of aligned sequences\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Motif search using weight matrix".\par 
+2.\tab Select "Make weight matrix".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 3.\tab 
+Define "Name of aligned sequences file". We assume the file of aligned sequences has already been created (See note 3). The program reads and displays the contents of the file numbering each sequence as it goes. Then it displays the length of the longes
+t sequence.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Accept "Sum logs of weights". The alternative is to sum the weights when calculating scores (see note 4). \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Accept "Use all motif
+ positions". The alternative allows the user to define a "mask" which identifies positions within the motif that should be ignored when the matrix is created (see note 5). The program now calculates the weights and applies them in turn to each of the seque
+nces in the file. The number and score for each sequence is displayed, followed by the top, bottom and mean scores and the standard deviation. In addition the mean plus and minus 3 standard deviations is displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Cutoff score". The default is the mean minus 3 standard deviations, but users may, for example, decide to use the lowest score obtained by the sequences in the file.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Define "Top score for scaling plots". This parameter is used by the graphics output routine when scaling the plots. Its value will influence the height of lines plotted to represent matches.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab Define "Position to identify". When a search is performed it is not always appropriate to report the position of a match relative to the leftmost base in the motif. For example wh
+en performing a splice junction search we may want to know the position of the G in the conserved GT, rather than the position of the first base in the matrix. The "Position to identify" allows the user to define which base is marked. The bases in the tabl
+e are number 1,2,3 and so on.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab Define a "Title". This is a title that will be displayed when the matrix file is read prior to performing a search. It is limited to 60 characters.\par 
+10.\tab Define "Name for new weight matrix file". Give a name for the weight matrix file. Typical dialogue is shown in figure 6.4.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 \page 2.3.2\tab Searching using a weight matrix\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Once a weight matrix has been stored in a file it can be used to search any sequence. Results can be displayed graphically or the matching sequence segments can be listed out with their scores.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Motif search using weight matrix".\par 
+2.\tab Select "Use weight matrix".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "Motif weight matrix file". The name of the file containing the weight matrix. The program reads the file and displays its title.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define 
+"Cutoff score". The default will be the value set when the weight matrix file was created. If the score is negative the program will calculate sums of logs of frequencies, otherwise it will add frequencies.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Accept "Plot results". Alternatively they will be listed.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The results will appear as in figure 6.5\par 
+\pard\plain \li1440\ri1500\sb200\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 Motif search using weight matrix\par 
+\pard \li1440\ri1500\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Select operation\par 
+X  1 Use weight matrix\par 
+   2 Make weight matrix\par 
+   3 Rescale weight matrix\par 
+? Selection  (1-3) (1) =2\par 
+? Name of aligned sequences file=heatshock.seq\par 
+     1 ATAAAGAATATTCTAGAA\par 
+     2 CTCGAGAAATTTCTCTGG 144\par 
+     3 TTCTCGTTGCTTCGAGAG  36\par 
+     4 GCCTCGAATGTTCGCGAA  15\par 
+     5 GACTGGAATGTTCTGACC  45 DROSOPHILA HSP68\par 
+     6 ATCTCGAATTTTCCCCTC  12\par 
+     7 ATCCAGAAGCCTCYAGAA  35 DROSOPHILA HSP83\par 
+     8 CTCTAGAAGTTTCTAGAG  25\par 
+     9 TTCTAGAGACTTCCAGTT  15\par 
+    10 CCCCAGAAACTTCCACGG 147 DROSOPHILA HSP22\par 
+    11 GCGAAGAAAATTCGAGAG  46\par 
+    12 TGCCGGTATTTTCTAGAT  26\par 
+    13 CCCGAGAAGTTTCGTGTC  97 DROSOPHILA HSP23\par 
+    14 TTCCGGACTCTTCTAGAA  13 DROSOPHILA HSP26\par 
+    15 CTCGAGAAAGCTCGCGAA 204 XENOPUS HSP70\par 
+    16 CTCGCGAATCTTCCGCGA 194\par 
+    17 CTCGCGAAAGTTCTTCGG 139\par 
+    18 CTCGGGAAACTTCGGGTC  72\par 
+    19 TGCCAGAAGTTGCTAGCA 124 XENOPUS HSP30\par 
+    20 CTCGGGAACGTCCCAGAA  14\par 
+    21 ATCCCGAAACTTCTAGTT 129 SOYBEAN HSP17\par 
+    22 GTCCAGAATGTTTCTGAA  98\par 
+    23 TTTCAGAAAATTCTAGTT  78\par 
+    24 CCCAAGGACTTTCTCGAA  28\par 
+    25 TTTTAGAATGTTCTAGAA 179 DICTYOSTELIUM DIRS-1\par 
+    26 TTCTAGAACATTCGAAGA 169\par 
+Length of motif    18\par 
+? Sum logs of weights (y/n) (y) =\par 
+ ? Use all motif positions (y/n) (y) =\par 
+ Applying matrix to input sequences\par 
+   1      -15.609 ATAAAGAATATTCTAGAA\par 
+   2      -15.965 CTCGAGAAATTTCTCTGG\par 
+   3      -18.186 TTCTCGTTGCTTCGAGAG\par 
+   4      -15.331 GCCTCGAATGTTCGCGAA\par 
+   5      -20.897 GACTGGAATGTTCTGACC\par 
+   6      -17.347 ATCTCGAATTTTCCCCTC\par 
+   7      -16.271 ATCCAGAAGCCTCYAGAA\par 
+   8      -12.227 CTCTAGAAGTTTCTAGAG\par 
+   9      -15.933 TTCTAGAGACTTCCAGTT\par 
+  10      -15.604 CCCCAGAAACTTCCACGG\par 
+  11      -17.866 GCGAAGAAAATTCGAGAG\par 
+  12      -17.159 TGCCGGTATTTTCTAGAT\par 
+  13      -16.399 CCCGAGAAGTTTCGTGTC\par 
+  14      -14.646 TTCCGGACTCTTCTAGAA\par 
+  15      -14.801 CTCGAGAAAGCTCGCGAA\par 
+  16      -16.163 CTCGCGAATCTTCCGCGA\par 
+  17      -16.280 CTCGCGAAAGTTCTTCGG\par 
+  18      -15.598 CTCGGGAAACTTCGGGTC\par 
+  19      -17.721 TGCCAGAAGTTGCTAGCA\par 
+  20      -16.257 CTCGGGAACGTCCCAGAA\par 
+  21      -14.243 ATCCCGAAACTTCTAGTT\par 
+  22      -16.456 GTCCAGAATGTTTCTGAA\par 
+  23      -15.453 TTTCAGAAAATTCTAGTT\par 
+  24      -17.443 CCCAAGGACTTTCTCGAA\par 
+  25      -13.335 TTTTAGAATGTTCTAGAA\par 
+  26      -15.914 TTCTAGAACATTCGAAGA\par 
+Top score     -12.227  Bottom score     -20.897\par 
+Mean     -16.119  Standard deviation       1.636\par 
+Mean minus 3.sd     -21.028  Mean plus 3.sd     -11.210\par 
+? Cutoff score (-999.00-9999.00) (-21.03) =\par 
+? Top score for scaling plots (-21.03-999.00) (-11.21) =\par 
+? Position to identify (0-18) (1) =\par 
+? Title=Heatshock weights 24-10-91\par 
+\pard \li1440\ri1500\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth ? Name for new weight matrix file=heatshock.wts\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 6.4\tab An example run of creating a weight matrix\par 
+\pard\plain \li1400\ri1500\sb300\sl220\box\brsp100\brdrth \f4\fs16  Motif search using weight matrix\par 
+\pard \li1400\ri1500\sl220\box\brsp100\brdrth Select operation\par 
+X  1 Use weight matrix\par 
+   2 Make weight matrix\par 
+   3 Rescale weight matrix\par 
+? Selection  (1-3) (1) =\par 
+? Motif weight matrix file=heatshock.wts\par 
+ Heatshock weights 24-10-91\par 
+? Cutoff score (-9999.00-9999.00) (-21.03) =\par 
+? Plot results (y/n) (y) =\par 
+\par 
+    619    -20.84 gctcggaagcttctgctc\par 
+    818    -20.74 ttggcgaagctttcaaag\par 
+   1190    -21.02 gccaggtaagtttcagac\par 
+   1601    -20.91 tttgcgactgttcggtaa\par 
+   2387    -20.24 cgctcgcagattctggac\par 
+   2534    -20.87 gccgagaagatcatcgaa\par 
+   2890    -16.38 ctcccggatgttctggag\par 
+   2989    -19.54 ctcgcgaaaatttctgct\par 
+   3451    -20.76 atcctggaagttccggtt\par 
+   6020    -20.73 tctcaggaactgctggaa\par 
+   6335    -20.51 gctgagaaattccgtgac\par 
+   7107    -20.31 ctctggtctggtcgagaa\par 
+   7117    -19.61 gtcgagaaaatccaggta\par 
+\pard \li1400\ri1500\sl220\keepn\box\brsp100\brdrth    7892    -20.18 cttccgaaagtgctgcat\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 6.5\tab Example run of a search using a weight matrix to produce text output.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Using "hardwired" motif searches.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program contains predefined motif definitions for the following\:\par 
+\pard \s4\qj\li1120\sa120\sl280 {\i E. coli} promoters\par 
+prokaryotic ribosome binding sites\par 
+mRNA splice junctions\par 
+eukaryotic ribosome binding sites\par 
+polyadenylation sites\par 
+\pard \s4\qj\sb240\sa120\sl280 All except the po
+lyadenylation site, which is simply defined as an exact match to the string AATAAA, are represented as weight matrices. Each search is performed simply by the user selecting the appropriate option from the menu and each plots its results in its own graphic
+s window. The ribosome binding site searches are reading frame specific and so they normally plot their results to fit nicely with the output from the "gene search by content" methods described in the chapter on finding genes. Likewise the splice junction 
+searches produce separate output for each of the three reading frames. Below, as an example of using the hardwired motifs, we show how to perform such a search.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.4.1\tab Searching for splice junctions\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Splice search using weight matrix". The program automatically reads in weight matrices that define the donor and acceptor sites and displays their titles.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Donor cutoff score". The default is stored in the file.\par 
+3.\tab Define "Acceptor cutoff score". The default is stored in the file.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4. \tab Accept "Plot results". The alternative lists the results giving the position, score, matching sequence and reading frame. A typical plotted result appears in figure 6.6.\par 
+\pard\plain \qj\ri-100\sb240\sl480\keepn \f4\fs16 {{\pict\macpict\picw454\pich123 
+04be00000000007b01c6001102ff0c00fffe0000002d8f9e002d8f9e00000000004e011f000000000001000a00000000004e011f0098802400000000004e011f0000000000000000002d8f9e002d8f9e000000010001000100000000000000000000000000439867000000010000ffffffffffff0001000000000000000000
+00004e011f00000000004e011f000002dd0006007fdfff00fc060040df000004060040df0000040a0040e9000020f80000040a0040e9000020f80000040c0040e9000020fa00022000040c0040e9000020fa0002200004110040eb0005200020000080fd0002200004170040fd000001f200071000200020000090fd000220
+0004170040fd000001f200071000600020080090fd0002200004170040fd000001f200071000600020080090fd0002200004180040fe00011001f2000712006000200c0090fd000224000406007fdfff00fc060040df0000040a0040ee000008f30000040a0040ee000008f30000040a0040ee000008f30000040a0040ee00
+0008f30000040a0040ee000008f30000040a0040ee000008f300000c0a0040ee000008f300000c0a0040ee000008f300000c0e0040ee000008fe000010f700000c180040f6000001fc0002010008fe000010fc000008fd00000c2002400004fd0005400010000001fc000601000808001010fc000008fe0001800c06007fdf
+ff00fc060040df000004060040df0000040a0040fc000004e50000040a0040fc000004e50000040c0040fc000004e700020104040c0041fc000004e70002010404100041fc000004fe000008eb0002010404100041fc000004fe000008eb0002010404150041fc000004fe00010814f3000010fb00020104041a014180fd00
+0004fe0005081400400040f7000010fb00020904041b02498008fe000904400200081400400040f7000050fb000209040406007fdfff00fc060040df000004060040df000004060040df000004060040df000004060040df0000040a0040fe000010e30000040e0040fe000010f4000001f10000040e0040fe000010f40000
+01f10000040e0040fe000010f4000001f10000040e0040fe000018f4000001f1000004180040fe000018f60002080001fb00040800008001fc0000041d04400000081afd000005fc000308080001fb00040800008001fc00000406007fdfff00fc060040df000004060040df0000040a0040f8000008e90000040a0040f800
+0008e90000040a0040f8000008e90000040a0040f8000008e90000040e0040f8000008ee000004fd000004140040fa0002400008f6000002fa000004fd000004180040fe000040fe0002400008f6000002fa000004fd000004190040fe000040fe0002400008f600010a02fb000004fd000004220048fe000a402000004000
+4801000001fe0006408000000a0202fc000004fd00000406007fdfff00fc060040df000004060040df000004060040df000004060040df000004090040e2000340000004090040e20003400000040c0340000002e50003400000040c0340000002e50003400000040e0340000002e70005080040020004120340000002eb00
+0001fe0005080040020004120340080002eb000001fe00050800400200041b044008020280f6000040fd000008fd000001fe000508004002000406007fdfff00fc02dd000000ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 6.6\tab 
+Typical graphical output from using the hardwired splice junction search. The results are presented in a reading frame specific way so it shows, in the bottom three boxes, results for donor sites and in the top three boxes those for acceptor sit
+es.  In both cases the vertical ordering of the boxes is frame 0 at the bottom, frame 1 in 
+the middle and frame 2 at the top. For example there is a very strong peak corresponding to an acceptor  in frame 1 that can be seen just over halfway along the sequence .\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+For this program a motif is a short segment of sequence of fixed length. More complex structures termed "patterns" which we define as sets of motifs separated by varying gaps, are covered in another chapter. The current chapter should be read before the
+ chapter on patterns. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab It is debateable whether the gain in sensitivity that is afforded by the use of a score matrix is of value for searching nucleotide sequences, however it is very important for protein sequences.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+The files of aligned sequences used to make weight matrices have the following format. Each sequence should be on a separate line. The sequence should start in column 2 and is terminated by a new line or a space.  Anything after the space is treated as 
+a comment. The files can be created by previous searches or using an editor.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab The frequencies in the weight 
+matrix can be used in two ways to calculate scores for sequences. Some users prefer to add the frequencies to give a total score, and others to multiply them by summing their logs. If we regard the frequencies as probabilities then multiplication seems the
+ correct procedure. The user chooses which method will be employed when the weight matrix is created, however the choice can be overridden when the matrix is used. If multiplication is selected then all results will presented as sums of logs.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Masking th
+e weight matrix is particularly useful in cases where a limited number of examples of a motif are available, or when the motif may have several components. In the first case the limited number of examples may make the matrix unrepresentative of the motif b
+ecause the bases in the unconserved positions may bias the results of searches. When a large number of examples is available to create the matrix, the unconserved positions should tend towards equal base composition and hence have no influence on the overa
+ll score. We stated that a motif might have several components\: for example a motif might have both structural and specificity components. We may want to separate out the two parts and masking provides such a facility.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab 
+The weight matrix handling routine contains a further option "Rescale weight matrix". If the user has edited a weight matrix to change the frequency values this provides a way of selecting a new cutoff score.  It allows users to read in a set of aligned
+ sequences and a weight matrix and to apply the matrix to the set of sequences to see the range of scores achieved. A new weight matrix file containing the selected cutoff score is written to disk.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab The program also contains a set of routines identical to those used to create and search for nucleotide weight matrices, but which deal instead with dinucleotide weight matrices. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab The reader is reminded that most options in the program, if selected when in "execute without dialogue" mode, will automatically use a set of defaults and produce a 
+result with little or no user input. Most motif searches require far less user input than that shown above, where we have tried to show the scope of the methods.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab 
+Although the program contains hardwired motifs we expect most sites that use the programs to accumulate their own libraries of motifs and patterns, which users can employ by simply knowing the names of the corresponding files.\par 
+\pard\plain \s5\sa60\sl320\tx560 \b\f20\fs28 \page 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R. 1984. Computer methods to locate signals in nucleic acid sequences. {\i Nucl. Acids Res}. {\b 12}\:521-538.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Staden, R.  1985.  Computer methods to locate genes and signals in nucleic acid sequences.  (in) {\i Genetic Engineering, Principle and Methods, }Setlow J.K., Hollaender A., (eds.), {\b 7}\:
+67-114, (Plenum Press, New York).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Staden, R.  1988.  Methods to define and locate patterns of motifs in sequences.  {\i CABIOS} {\b 4 (1)}\:53-60.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Staden, R. 1990. Searching for patterns in protein and nucleic acid sequences.  (in) {\i Methods in Enzymology} R.F. Doolittle (ed.),  {\b 183}\:193-211  (Academic Press, New York).\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 7.  Using Patterns to Analyse Nucleic Acid Sequences\par 
+\pard\plain \s5\sb200\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Creating a pattern file containing an exact match motif and weight matrix motif.\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.2\tab Searching a sequence using a pattern file\par 
+2.3\tab Comparing a sequence against a library of patterns\par 
+2.4\tab Searching sequence libraries for patterns\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb200\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Here we describe one of the most powerful facilities provided by the program NIP\: the ability to define and search for complex patterns of motifs (1-3). 
+In  another chapter we give details of seaching for individual motifs but here we show how to create patterns and libraries of patterns and to use them to search single sequences and sequence libraries. Once a pattern has been defined and stored in a file 
+it can used to search any sequence. In addition if users want to routinely screen sequences against libraries of patterns this can be achieved by use of files of file names. The program can produce several alternative forms of output. It will display the s
+egment of sequence matching each individual motif in the pattern, display all the sequence between and including the two outermost motifs, produce a description of the match in the form of an EMBL feature table, or draw a simple graphical plot.\par 
+\pard \s4\qj\sa120\sl280 At the end of the chapter we describe how a related program NIPL is used to search libraries of sequences to find patterns. NIPL is capable of producing alignments of sequence families.\par 
+\pard \s4\qj\sa120\sl280 Patterns are defined as sets of motifs with variable spacing. Each motif in a pat
+tern can be defined using any of several methods, and their positions relative to one other are defined in terms of minimum and maximum separations. In addition, by the use of logical operators, each motif can be declared to be essential (the AND operator)
+, optional (the OR operator), or forbidden (the NOT operator). The following methods (termed "classes" by the program) for defining motifs are provided\:
+ 1) exact match to a short sequence; 2) percentage match to a short sequence; 3) match to a short sequen
+ce using a score matrix and cutoff score; 4) match to a weight matrix; 5) match to the complement of a weight matrix; 6) inverted repeat or stem-loop; 7) exact match to a short sequence with a defined step; 8) direct repeat. Classes 1, 2 , 3 and 7 permit t
+he use of IUB redundancy codes.\par 
+\pard \s4\qj\sa120\sl280 The motifs in a pattern are numbered sequentially and motif spacing is defined in the following way. When a new motif is added to a pattern the user specifies the "Reference motif" by its number and then a "Relative start po
+sition". The "Relative start position" is defined by taking the first base of the "Reference motif" as position 1, the next as 2, and so on. Then the user defines the allowed variation in the spacing by specifying the "Number of extra positions". Notice th
+at the position of a motif can be defined relative to any other motif, and that a negative "Relative start position" declares the motif to be to the left of its "Reference motif".\par 
+\pard \s4\qj\sa120\sl280 The probability of finding each individual motif in the current sequence, th
+e product of the probabilities for all the motifs in a pattern "Probability of finding pattern", and the "Expected number of matches" is calculated and displayed by the program. In addition to the cutoffs used for the individual motifs, users can apply two
+ pattern cutoffs\: "Maximum pattern probability" and "Minimum pattern score".\par 
+Below we describe\: how to create a pattern;  how to use a pattern file to search a sequence; how to use a "File of pattern file names" to search a sequence for a whole library of 
+patterns. To describe how to create a pattern file we first show all the steps to make one containing two motifs, and then, to save space, the parts specific to the individual motif types are sketched in the notes section.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2. Methods\par 
+\pard\plain \s6\fi-560\li560\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Creating a pattern file containing an exact match motif and weight matrix motif.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Pattern searcher".\par 
+2.\tab Select "Pattern definition mode" as "Use keyboard".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Select "Results display mode" as "Motif by motif". The alternatives are listed in the introduction.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Select "Motif definition mode" as "Exact match".\par 
+5.\tab Define "Motif name". Each motif can be given an 8 character name.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "String". Type in the sequence of the motif. The program will display the probability of finding the motif.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Select "Motif definition mode" as "Weight matrix".\par 
+8.\tab Define "Motif name".\par 
+9.\tab Select "Logical operator" as "AND". The alternatives are "OR" and "NOT".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Select "Number of reference motif". At this stage the only choice is 1 and this is the default.\par 
+11.\tab Define "Relative start position". The base position relative to the "Reference motif". See the introduction.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 12.\tab Define "Number of extra positions".\par 
+13.\tab Define "Weight matrix file name". Type the name of the file containing the weight matrix.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab The program now cycles round to step 7 and all subsequent passes round the loop to add further motifs to the pattern would differ only in the details for the different motif "classes".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 14.\tab Select "Pattern complete"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 15.\tab Accept "Save pattern in a file". The alternative does not save the pattern and so it can only be used once on the current sequence.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 16.\tab Define "Pattern definition file". Give a name for the new file.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 17. \tab 
+"Define "Pattern title". All patterns can have a 60 character title that can be displayed when the pattern file is read and the sequence searched. The program will now display a detailed textual description of the pattern, the "Probability of finding 
+the pattern" and the "Expected number of matches".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 18.\tab Define "Maximum pattern probability". Yes maximum\: any match with a greater probability of being found will be rejected. If no value is specified the search will be quicker (see notes).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 19.\tab 
+Define "Minimum pattern score". A minimum pattern score only makes sense if all the motifs in the pattern are defined with compatible scoring methods. For example percentage matches and weight matrices using sums of logs are incompatible. Searching wil
+l now commence and any matches displayed using the chosen method. A worked example of creating such a pattern and performing a search is shown in figure 7.1, and the actual pattern file is shown in figure 7.2.\par 
+\pard\plain \li1360\ri1300\sb200\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16  Pattern searcher\par 
+\pard \li1360\ri1300\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Select pattern definition mode\par 
+X  1 Use keyboard                  \par 
+   2 Use pattern file              \par 
+   3 Use file of pattern file names\par 
+? Selection  (1-3) (1) =\par 
+Select results display mode\par 
+X  1 Motif by motif                \par 
+   2 Inclusive                     \par 
+   3 Graphical                     \par 
+   4 EMBL feature table            \par 
+? Selection  (1-4) (1) =\par 
+Select motif definition mode\par 
+X  1 Exact match                    \par 
+   2 Percentage match               \par 
+   3 Cut-off score and score matrix \par 
+   4 Cut-off score and weight matrix\par 
+   5 Complement of weight matrix    \par 
+   6 Inverted repeat or stem-loop   \par 
+   7 Exact match, defined step      \par 
+   8 Direct repeat                  \par 
+   9 Pattern complete               \par 
+? Selection  (1-9) (1) =\par 
+? Motif name=T run\par 
+? String=TTTTT\par 
+Probability of score     5.0000 = 0.870E-03\par 
+Select motif definition mode\par 
+X  1 Exact match                    \par 
+   2 Percentage match               \par 
+   3 Cut-off score and score matrix \par 
+   4 Cut-off score and weight matrix\par 
+   5 Complement of weight matrix    \par 
+   6 Inverted repeat or stem-loop   \par 
+   7 Exact match, defined step      \par 
+   8 Direct repeat                  \par 
+   9 Pattern complete               \par 
+? Selection  (1-9) (1) =4\par 
+? Motif name=heat\par 
+Select logical operator\par 
+X  1 And                            \par 
+   2 Or                             \par 
+   3 Not                            \par 
+? Selection  (1-3) (1) =\par 
+? Number of reference motif (1-1) (1) =\par 
+? Relative start position (-1000-1000) (6) =10\par 
+? Number of extra positions (0-1000) (0) =20\par 
+? Weight matrix file name=heatshock.wts\par 
+ Heatshock weights 18-12-90     \par 
+Probability of score   -21.0280 = 0.117E-02\par 
+Select motif definition mode\par 
+   1 Exact match                    \par 
+   2 Percentage match               \par 
+   3 Cut-off score and score matrix \par 
+X  4 Cut-off score and weight matrix\par 
+   5 Complement of weight matrix    \par 
+   6 Inverted repeat or stem-loop   \par 
+   7 Exact match, defined step      \par 
+   8 Direct repeat                  \par 
+   9 Pattern complete               \par 
+? Selection  (1-9) (4) =9\par 
+? Save pattern in a file (y/n) (y) =\par 
+? Pattern definition file=_paper.pat\par 
+? Pattern title=demo pattern\par 
+Pattern description\par 
+\par 
+demo pattern          \par 
+Motif  1 named T run    is of class    1\par 
+Which is an exact match to the string\par 
+TTTTT\par 
+Motif  2 named heat     is of class    4\par 
+Which is a match to a weight matrix with score -21.028\par 
+and the 5 prime base can take positions     10 to      30\par 
+relative to the 5 prime end of motif   1\par 
+It is anded with the previous motif.\par 
+Probability of finding pattern = 0.1015E-05\par 
+Expected number of matches  = 0.1734E+00\par 
+? Maximum pattern probability (0.00-1.00) (1.00) =\par 
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =\par 
+Working\par 
+Match\par 
+    505 T run   \par 
+      ttttt\par 
+    528 heat    \par 
+      ttaaagaaagttttatac\par 
+Total matches found      1\par 
+\pard \li1360\ri1300\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth Minimum and maximum observed scores      -15.34      -15.34\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 7.1\tab Worked example of creating a simple pattern and performing a search.\par 
+\pard\plain \li2380\ri2520\sb300\sl220\box\brsp100\brdrth \f4\fs16 demo pattern \par 
+\pard \li2380\ri2520\sl220\box\brsp100\brdrth  A1          T run    Class \par 
+ TTTTT\par 
+ @ End of string\par 
+ A4          heat     Class \par 
+      1      Relative motif\par 
+     10      Relative start position\par 
+     20      Number of extra positions\par 
+\pard \li2380\ri2520\sl220\keepn\box\brsp100\brdrth heatshock.wts\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb80\sa120\sl240\tx1140 \f21\fs20 Figure 7.2\tab The pattern file created by the work shown in figure 7.1.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \page 2.2\tab Searching a sequence using a pattern file\par 
+\pard\plain \s7\qj\fi-560\li560\sb160\sa120\sl280\tx560 \f20 1.\tab Select "Pattern searcher"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Select "Pattern definition mode" as "Use pattern file".\par 
+3.\tab Select "Results display mode" as "Inclusive"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Pattern definition file". Type the name of the file containing the pattern. The pr
+ogram will read the file then display its title, a detailed textual description of the pattern, the "Probability of finding the pattern", and the "Expected number of matches".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Maximum pattern probability". \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Minimum pattern score". Searching will now commence and any matches displayed using the chosen method. A worked example, using the pattern file created in figure 7.1 is shown in figure 7.3.\par 
+\pard\plain \li1300\ri1320\sb300\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16  Pattern searcher\par 
+\pard \li1300\ri1320\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Select pattern definition mode\par 
+X  1 Use keyboard                  \par 
+   2 Use pattern file              \par 
+   3 Use file of pattern file names\par 
+? Selection  (1-3) (1) =2\par 
+? Pattern definition file=_paper.pat\par 
+Select results display mode\par 
+X  1 Motif by motif                \par 
+   2 Inclusive                     \par 
+   3 Graphical                     \par 
+   4 EMBL feature table            \par 
+? Selection  (1-4) (1) =2\par 
+Probability of score     5.0000 = 0.870E-03\par 
+ Heatshock weights 18-12-90    \par 
+Probability of score   -21.0280 = 0.117E-02\par 
+\par 
+Pattern description\par 
+\par 
+ demo pattern      \par 
+Motif  1 named T run    is of class    1\par 
+Which is an exact match to the string\par 
+TTTTT\par 
+Motif  2 named heat     is of class    4\par 
+Which is a match to a weight matrix with score -21.028\par 
+and the 5 prime base can take positions     10 to      30\par 
+relative to the 5 prime end of motif   1\par 
+It is anded with the previous motif.\par 
+Probability of finding pattern = 0.1015E-05\par 
+Expected number of matches  = 0.1734E+00\par 
+? Maximum pattern probability (0.00-1.00) (1.00) =\par 
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =\par 
+Working\par 
+    505 T run   \par 
+      tttttgatgcttgactctaagccttaaagaaagttttatac\par 
+Total matches found      1\par 
+\pard \li1300\ri1320\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth Minimum and maximum observed scores      -15.34      -15.34\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 7.3\tab Worked example of using a pattern file as input.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \page 2.3\tab Comparing a sequence against a library of patterns\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This mode of operation allows a sequence to be searched, in turn, for any number of patterns each stored in a separate pattern file. The names of the files containing the individual patterns must be stored in a simple text file. This file is called "a file
+ of pattern file names" and its name is the only user input required to define the search.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Pattern searcher"\par 
+2.\tab Select "Pattern definition mode" as "Use file of pattern file names".\par 
+3.\tab Select "Results display mode" as "Inclusive"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+Define "File of pattern file names". Type the name of the file containing the list of pattern file names. The program will read the file and then, in turn, all the pattern files it names. Each of these patterns will be compared against the current seque
+nce but only those that give matches will produce any output. The pattern title and each match will be displayed.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Searching sequence libraries for patterns\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The program NIPL can be used to search sequence libraries for patterns. Its use is similar to the pattern search routine described above, except that it does not have the facility for creating pattern files, so they must be created beforehand using NIP. In
+ addition to its obvious application of finding new occurrences of patterns or checking on their frequency it is a usef
+ul way of obtaining sequence alignments. It can restrict its search to a list of named entries or can search all but those on a list of entries. It can restrict its output to showing the highest scoring match in each sequence, but by default it will show a
+ll matches.\par 
+\pard \s4\qj\sa120\sl280 
+Of its modes of output, two require further description. The first "Padded sections" creates a new file for each match. The file will contain the sequence between and including the two outermost motifs in the pattern. It will be gapped to the f
+urthest extent defined by the pattern, which means that if all the files were subsequently written one above the other all the motifs in the pattern would be exactly aligned, with the sections between them containing the requisite numbers of padding charac
+ters. The second such mode of output is called "Complete padded sequences". Here the user must know the maximum distance between the leftmost motif and the start of all the sequences that match. A trial run in which only the positions of matches are report
+ed is usually required. The user gives this maximum distance to the program. The program then writes a new file containing the full length of all matching sequences, again maximally gapped (including their left ends) so that they would all align if written
+ above one another. For both of these modes of output the files created are named "entryname" where "entryname" is the name given to the sequence in the sequence library. These modes are best used with the option "Report all matches" rejected, so that only
+ the best match for each sequence is reported. The sequences can be lined up using the sequence assembly program SAP.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select NIPL.\par 
+2.\tab Define "Name for results file."\par 
+3.\tab Select a library.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+Select "Search whole library". The alternatives are "Search only a list of entries" and "Search all but a list of entries". The files containing the list of entries should contain one entry name per line, left justified.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Select "Results display mode" as "Inclusive". The alternatives include "Motif by motif", "Scores only", "Complete padded sequences" and "Padded sections".\par 
+6.\tab Accept "Report all matches". The alternative only shows the best match for each sequence.\par 
+7.\tab Define "Pattern definition file". The name of the file containing the pattern created using NIP. \par 
+\tab The program displays a textual description of the pattern and the expected number of matches per 1000 residues assuming an average nucleic acid composition.\par 
+8.\tab Define "Maximum pattern probability". The program will run much more quickly if none is given.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab Define "Minimum pattern score".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The search will start.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+The "exact match" motif class requires a consensus sequence. The "percentage match" motif class requires a consensus sequence and a cutoff score. The "score matrix" motif class requires a consensus sequence and a cutoff score.  The "weight matrix" searc
+h and the "complement of a weight matrix" only require the name of the file containing the matrix. The "inverted repeat" or "stem-loop" requires a stem length, minimum and maximum loop sizes,
+ and  a cutoff score using scores A-T = G-C = 2, G-T = 1. Note that if the user defines an inverted repeat as a "Reference motif" the "Relative position" can be defined from either its 5' or 3' ends. The "direct repeat" motif class requires a repeat length
+, the minimum and maximum gap between the two occurrences of the repeat, and a minimum score.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab The motif class "Exact match, defined step" is rarely used. A typical use might be to find a start codon followed, for some minimum distance, by no stop codons
+ in the same reading frame. The step would have the value 3 to keep the reading frame the same as that of the start codon, and the stop codon searches would be included using the NOT operator.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+The details of the probabilty calculations are outside the scope of this article. They are quite rapid and are essential both for assessing the statistical significance of any matches found and for allowing meaningful cutoffs to be applied to patterns. 
+Obviously, in general, cutoff scores are inappropriate for patterns containing a mixture of motif classes.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+The program calculates the "Probability of finding the pattern" and the "Expected number of matches". The first figure is actually the product of the individual motif probabilities but the latter figure is more useful because it takes into account the a
+llowed variation in spacing between motifs and the length of the current sequence. In both cases the composition of the current sequence is also used so that different probabilities would be calculated for other sequences.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab 
+The pattern definition system is very flexible. Assume that a laboratory has a large library of patterns stored in its computer. Different groups or users may want to screen their sequences against different subsets of a pattern library. Each group ther
+efore uses its own "File of pattern file names" which contains only the names of the pattern files that are relevant to their sequences. Of course a pattern may contain only one motif. Hence a library of patterns can include both simple and comp
+lex patterns. In the same way a laboratory may have a large library of weight matrices defining different motifs and different users may want to combine them in different ways to produce their own patterns. \par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R.  1988.  Methods to define and locate patterns of motifs in sequences.  {\i CABIOS} {\b 4(1)}\:53-60.\par 
+2.\tab Staden, R.  1989.  Methods for calculating the probabilities of finding patterns in sequences.  {\i CABIOS} {\b 5(2)}\:89-96.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Staden, R. 1990. Searching for patterns in protein and nucleic acid sequences.  (in) {\i Methods in Enzymology} R.F. Doolittle (ed.),  {\b 183}\:193-211  (Academic Press, New York).\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 8.  Searching for Restriction Sites\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Search for restriction sites and list them enzyme by enzyme\par 
+2.2\tab Search for restriction sites and list them by position\par 
+2.3\tab Search for restriction sites and list their names above the sequence\par 
+2.4\tab Search for restriction sites and plot their positions\par 
+2.5\tab Find restriction enzymes that cut infrequently\par 
+2.6\tab Producing a back translation from a protein sequence\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The program NIP contains a routine for finding and displaying the positions of the cut sites of restriction enzyme recognition sequences. Linear or circular sequences can be searched and the results can be listed in various forms or displayed graphically. 
+The recognition sequences to be searched for can be typed on the keyboard or read from files. The format of these files is given in note 1. At the end of the chapter we also describe how to pro
+duce back translations of protein sequences so that these routines can be used to search them for restriction sites.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Search for restriction enzyme sites and list them enzyme by enzyme\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Search".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Select "Input source" as "All enzymes file". A number of standard files are available and users may also have their own.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Accept "Search for all names". \par 
+4.\tab Select "Order results enzyme by enzyme".\par 
+5.\tab Accept "List matches".\par 
+6.\tab Accept "The sequence is linear". The alternative is circular.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Accept "Search for definite matches". The alternative is to search for possible matches in a sequence containing IUB redundancy codes.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The results will then appear in the form shown in figure 8.1 Each match is numbered and its enzyme name given, followed by the matching sequence with the cut site indicated by a ' symbol. The position of the cut site is given followed by the length of the 
+potential fragment ending at that site, followed by a list of fragments sizes sorted on length.\par 
+\pard\plain \li1160\ri1380\sl220\box\brsp100\brdrth \f4\fs16 Matches found=   3\par 
+\pard \li1160\ri1380\sl220\box\brsp100\brdrth     Name          Sequence          Position Fragment length\par 
+  1 AccII         cg'cg                  313    312     51\par 
+  2 AccII         cg'cg                  364     51    188\par 
+  3 AccII         cg'cg                  552    188    312\par 
+                                                449    449\par 
+Matches found=   6\par 
+    Name          Sequence          Position Fragment length\par 
+  1 AciI          cc'gc                  503    502     12\par 
+  2 AciI          gc'gg                  553     50     12\par 
+  3 AciI          gc'gg                  714    161     50\par 
+  4 AciI          gc'gg                  872    158    105\par 
+  5 AciI          gc'gg                  884     12    158\par 
+  6 AciI          cc'gc                  896     12    161\par 
+                                                105    502\par 
+Matches found=   3\par 
+    Name          Sequence          Position Fragment length\par 
+  1 AcyI          gg'cgtc                698    697      5\par 
+  2 AcyI          gg'cgtc                765     67     67\par 
+\pard \li1160\ri1380\sl220\keepn\box\brsp100\brdrth   3 AcyI          ga'cgcc                996    231    231\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 8.1\tab Typical output from "List enzyme by enzyme".\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Search for restriction enzyme sites and list them by position\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Search".\par 
+2.\tab Select "Input source" as "All enzymes file". \par 
+3.\tab Accept "Search for all names". \par 
+4.\tab Select "Order results by position".\par 
+5.\tab Accept "List matches". \par 
+6.\tab Accept "The sequence is linear".\par 
+7.\tab Accept "Search for definite matches". \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The results will then appear in the form shown in figure 8.2 Each match is numbered and its enzyme name given, followed b
+y the matching sequence with the cut site indicated by a ' symbol. The position of the cut site is given followed by the length of the potential fragment ending at that site.\par 
+\pard\plain \s6\fi-540\li560\sb240\sa60\sl280\tx560 \b\f20 2.3\tab Search for restriction enzyme sites and list their names above the sequence\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Search".\par 
+2.\tab Select "Input source" as "All enzymes file". \par 
+3.\tab Accept "Search for all names". \par 
+4.\tab Select "Show names above the sequence".\par 
+5.\tab Reject "Hide translation".\par 
+6.\tab Accept "Use 1 letter codes".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Define "Line length". This is the number of bases that will appear on each line of output. It must be a multiple of 30. \par 
+\pard\plain \li1640\ri1720\sl220\pagebb\box\brsp100\brdrth \f4\fs16    Name         Sequence             Position Fragment length\par 
+\pard \li1640\ri1720\sl220\box\brsp100\brdrth  1 HapII        c'cgg                       2      1\par 
+ 2 HpaII        c'cgg                       2      0\par 
+ 3 MspI         c'cgg                       2      0\par 
+ 4 MseI         t'taa                      14     12\par 
+ 5 HincII       gtt'aac                    15      1\par 
+ 6 HindII       gtt'aac                    15      0\par 
+ 7 HpaI         gtt'aac                    15      0\par 
+ 8 DsaV         'ccagg                     23      8\par 
+ 9 EcoRII       'ccagg                     23      0\par 
+10 TspAI        'ccagg                     23      0\par 
+11 ApyI         cc'agg                     25      2\par 
+12 BstNI        cc'agg                     25      0\par 
+13 MvaI         cc'agg                     25      0\par 
+14 ScrFI        cc'agg                     25      0\par 
+15 MaeIII       'gttac                     47     22\par 
+16 BsrI         actggt'                    49      2\par 
+17 MseI         t'taa                      55      6\par 
+18 MaeII        a'cgt                      63      8\par 
+19 SfaNI        gcatcaacaa'gata            86     23\par 
+\pard \li1640\ri1720\sl220\keepn\box\brsp100\brdrth 20 MaeII        a'cgt                      91      5\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 8.2\tab Typical output from "List by position".\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 8.\tab Accept "The sequence is linear".\par 
+9.\tab Accept "Search for definite matches". \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The results will then appear in the form shown in figure 8.3 The sequence is listed with a 3 phase translation underneath and every tenth base numbered. Above the sequence the positions of the cut sites of res
+triction enzymes are marked.\par 
+\pard\plain \s6\sb160\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Search for restriction enzyme sites and plot their positions \par 
+\pard\plain \s7\qj\fi-560\li560\sa80\sl260\tx560 \f20 1.\tab Select "Search".\par 
+2.\tab Select "Input source" as "All enzymes file". \par 
+3.\tab Accept "Search for all names". \par 
+4.\tab Select "Order results by position".\par 
+5.\tab Reject "List matches". \par 
+6.\tab Accept "The sequence is linear".\par 
+7.\tab Accept "Search for definite matches".\par 
+\pard\plain \s4\qj\sa80\sl260 \f20 The results will then appear in the form shown in figure 8.4. Each enzyme that has a match is named at the left edge of the display and its cut sites are marked by short 
+vertical lines. If the display window fills up the bell will ring. Users may then take a screen dump before typing return. The program then displays the message " ? Restart plotting from bottom of frame". To do so type return. To quit type !.\par 
+\pard\plain \li1200\ri1240\sl220\pagebb\box\brsp100\brdrth \f4\fs16 Search for restriction enzyme sites\par 
+\pard \li1200\ri1240\sl220\box\brsp100\brdrth Select operation\par 
+X  1 Search\par 
+   2 List enzyme file\par 
+   3 Clear text\par 
+   4 Clear graphics\par 
+? Selection  (1-4) (1) =\par 
+Select input source\par 
+   1 All enzymes file\par 
+X  2 Six cutter file\par 
+   3 Four cutter file\par 
+   4 Personal file\par 
+   5 Keyboard\par 
+? Selection  (1-5) (2) =1\par 
+? Search for all names (y/n) (y) =\par 
+ Select results display mode\par 
+X  1 Order results enzyme by enzyme\par 
+   2 Order results by position\par 
+   3 Show only infrequent cutters\par 
+   4 Show names above the sequence\par 
+? Selection  (1-4) (1) =4\par 
+? Hide translation (y/n) (y) =n\par 
+ ? Use 1 letter codes (y/n) (y) =\par 
+ ? Line length (30-90) (60) =\par 
+? The sequence is linear (y/n) (y) =\par 
+ ? Search for definite matches (y/n) (y) =\par 
+\par 
+ HapII\par 
+ HpaII\par 
+ MspI        MseI\par 
+ .           .HincII\par 
+ .           .HindII\par 
+ .           .HpaI    DsaV\par 
+ .           ..       EcoRII\par 
+ .           ..       TspAI\par 
+ .           ..       . ApyI\par 
+ .           ..       . BstNI\par 
+ .           ..       . MvaI\par 
+ .           ..       . ScrFI                 MaeIII\par 
+ .           ..       . .                     . BsrI  MseI\par 
+ccggttagactgttaacaacaaccaggttttctactgatataactggttacatttaacgc\par 
+        10        20        30        40        50        60\par 
+ P  V  R  L  L  T  T  T  R  F  S  T  D  I  T  G  Y  I  *  R\par 
+  R  L  D  C  *  Q  Q  P  G  F  L  L  I  *  L  V  T  F  N  A\par 
+\pard \li1200\ri1240\sl220\keepn\box\brsp100\brdrth    G  *  T  V  N  N  N  Q  V  F  Y  *  Y  N  W  L  H  L  T  P\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 8.3\tab Typical dialogue and output for a "Names above the sequence" search.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Finding restriction enzymes that cut infrequently\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Search".\par 
+2.\tab Select "Input source" as "All enzymes file". \par 
+3.\tab Accept "Search for all names". \par 
+4.\tab Select "Show only infrequent cutters".\par 
+5.\tab Define "Maximum number of cuts".\par 
+6.\tab Accept "The sequence is linear".\par 
+\pard\plain \li160\ri200\sl220\keepn\box\brsp100\brdrth \f4\fs16 {{\pict\macpict\picw430\pich254 
+0b99ffffffff00fd01ad1101a0008201000affffffff00fd01ad090000000000000000310000002400fa01ac9800240000000000b7011f0000000000b7011f0000002400fa01ac000102dd001402000080fd000001f00002100040fc000010ff001402000080fd000001f00002100040fc000010ff001402000080fd000001
+f00002100040fc000010ff001402000080fd000001f00002100040fc000010ff001402000080fd000001f00002100040fc000010ff001402000080fd000001f00002100040fc000210000006007fdfff00fc06f5000020ea0006f5000020ea0006f5000020ea0006f5000020ea0006f5000020ea0006f5000020ea0006007f
+dfff00fc06fe000008e10006fe000008e10006fe000008e10006fe000008e10006fe000008e10006007fdfff00fc06fe000008e10006fe000008e10006fe000008e10006fe000008e10006fe000008e10006007fdfff00fc040020de00040020de00040020de00040020de00040020de00040020de0006007fdfff00fc1402
+000080fd000001f00002100040fc000010ff001402000080fd000001f00002100040fc000010ff001402000080fd000001f00002100040fc000010ff001402000080fd000001f00002100040fc000010ff001402000080fd000001f00002100040fc000210000006007fdfff00fc06fb000004e40006fb000004e40006fb00
+0004e40006fb000004e40006fb000004e40006fb000004e40006007fdfff00fc0af8000080fe000080eb000af8000080fe000080eb000af8000080fe000080eb000af8000080fe000080eb000af8000080fe000080eb0006007fdfff00fc040020de00040020de00040020de00040020de00040020de00040020de0006007f
+dfff00fc0602000040e0000602000040e0000602000040e0000602000040e0000602000040e00006007fdfff00fc0602000040e0000602000040e0000602000040e0000602000040e0000602000040e00006007fdfff00fc0602000040e0000602000040e0000602000040e0000602000040e0000602000040e00006020000
+40e00006007fdfff00fc06eb000040f40006eb000040f40006eb000040f40006eb000040f40006eb000040f40006007fdfff00fc06eb000040f40006eb000040f40006eb000040f40006eb000040f40006eb000040f40006eb000040f40006007fdfff00fc06eb000010f40006eb000010f40006eb000010f40006eb000010
+f40006eb000010f40006007fdfff00fc040020de00040020de00040020de00040020de00040020de00040020de0006007fdfff00fc06eb000008f40006eb000008f40006eb000008f40006eb000008f40006eb000008f40006007fdfff00fc06fe000020e10006fe000020e10006fe000020e10006fe000020e10006fe0000
+20e10006fe000020e10006007fdfff00fc06fe000020e10006fe000020e10006fe000020e10006fe000020e10006fe000020e10006007fdfff00fc06fe000020e10006fe000020e10006fe000020e10006fe000020e10006fe000020e10006007fdfff00fc06eb000008f40006eb000008f40006eb000008f40006eb000008
+f40006eb000008f40006eb000008f40006007fdfff00fc06eb000010f40006eb000010f40006eb000010f40006eb000010f40006eb000010f40006007fdfff00fc06fe000008e10006fe000008e10006fe000008e10006fe000008e10006fe000008e10006007fdfff00fc06fe000008e10006fe000008e10006fe000008e1
+0006fe000008e10006fe000008e10006fe000008e10006007fdfff00fc06fa000080e50006fa000080e50006fa000080e50006fa000080e50006fa000080e50006007fdfff00fc06eb000008f40006eb000008f40006eb000008f40006eb000008f40006eb000008f40006eb000008f40006007fdfff00fc06fe000008e100
+06fe000008e10006fe000008e10006fe000008e10006fe000008e10006007fdfff00fc06fe000008e10006fe000008e10006fe000008e10006fe000008e10006fe000008e10006fe000008e10006007fdfff00fc02dd00a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00020000000e00
+252c000800140554696d65730300140d00092e0004000001002b010b055472753949a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a000c0000001800252a0a055366614e49a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a0014000000
+2000252a08055363724649a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a001c0000002800252a08044d766149a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00260000003200252a0a044d737049a00097a10096000c010000000200
+000000000000a1009a0008fffd00000011000001000a002e0000003a00252a08044d736549a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00370000004300252a09064d6165494949a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00
+400000004c00252a09054d61654949a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00490000005500252a09054d70614949a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00510000005d00252a08044d706149a00097a10096000c01
+0000000200000000000000a1009a0008fffc00000011000001000a00590000006500252a080648696e644949a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00630000006f00252a0a0648696e634949a00097a10096000c010000000200000000000000a1009a0008fffc000000
+11000001000a006b0000007700252a080648696e503149a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00750000008100252a0a0548696e3649a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a007d0000008900252a080448686149a0
+0097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00870000009300252a0a054861704949a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a008f0000009b00252a08054861654949a00097a10096000c010000000200000000000000a1009a00
+08fffd00000011000001000a0098000000a400252a090645636f524949a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00a1000000ad00252a090745636c31333649a00097a10096000c010000000200000000000000a1009a0008fffc00000011000001000a00a9000000b50025
+2a080444736156a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00b2000000be00252a090444646549a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00ba000000c600252a080443666f49a00097a10096000c01000000020000000000
+0000a1009a0008fffc00000011000001000a00c3000000cf00252a09054273744f49a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00cc000000d800252a09054273744e49a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00d4000000
+e000252a080442737249a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00de000000ea00252a0a084273703134334949a00097a10096000c010000000200000000000000a1009a0008fffd00000011000001000a00e6000000f200252a08054273694c49a00097a10096000c0100
+00000200000000000000a1009a0008fffd00000011000001000a00f0000000fc00252a0a0441707949a00097a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 8.4\tab Typical output from "Plot positions".\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 7.\tab Accept "Search for definite matches". \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The names and number of cut sites of all enzymes that cut less than or equal to the "Maximum number of cuts" will then be displayed.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.6\tab Producing a back translation from a protein sequence \par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The routine for producing back translations is contained in the program PIP. It back translates protein sequences into DNA using the standard genetic code.  The translation can use either the IUB symbols or a set of codon preferences. If a set of codon pre
+ferences is used they must conform to the format of codon tables pr
+oduced by the nucleotide interpretation program, and the back translation will contain the favoured codons. If, for any amino acid there is no favoured codon, the IUB symbols will be employed. The program will plot the redundancy along the sequence and hen
+ce can be used to find the best sequences to use as primers. The DNA sequence can be saved to a file and analysed using the nucleotide analysis program. \par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Back translate".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Accept "No codon preference". The alternative will cause the program to ask for "File name of codon table", which should be in the same format as those created by the nucleotide interpretation program.
+\par 
+3.\tab Reject "Plot redundancy". The alternative will ask for a window length to use for the plot. The window length is in codons. A plot will appear in which the best primers are sited at the peaks and the worst at the troughs.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Accept "Save DNA to disk"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "File name for DNA sequence". This file can later be read into program NIP and all the searches described above employed.\par 
+\pard\plain \s5\sa60\sl320\tx560 \b\f20\fs28 \page 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+The file containing the definitions of the restriction enzymes names and their recognition sequences uses the standard IUB redundancy symbols and has the following format. Each name is followed by a /, then each of its recognition sequences is followed 
+by a /. The last recognition sequence for each enzyme is followed by //. The cut sites should be indicated by a '. If the cut site is not contained in the recognition sequence, the recognition sequence should be extended by sufficent N symbo
+ls. For example the two lines from the standard file shown below define the enzymes Alw21I and Alw26I. These files are kindly updated each month by Dr. Rich Roberts.\par 
+\pard \s7\qj\li1720\sa120\sl280\tx1720 Alw21I/GWGCW'C//\par 
+Alw26I/GTCTCN'NNNN/'NNNNNGATCC//\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab 
+To search for a subset of the restriction enzymes in a file the user should reject "Search for all names" and the program will ask for the names of the enzymes wanted and extract their recognition sequences from the file. Alternatively, if a user was al
+ways using the same subset, then a file containing only those enzymes could be created by editing the standard file. This file would then be selected as "Personal file" for "Input source".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+The routine also allows names and recognition sequence to be entered on the keyboard. This is selected as "Keyboard" for "Input source", and the program will prompt for names and their recognition sequences. In this way the routine can be used to search
+ for exact matches to any short sequence. Again IUB redundancy codes can be used.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab When back translating 
+from proteins it is often useful to produce a back translation using both a table of codon preferences and one using the IUB symbols. This is because the restriction enzyme search program can distinguish between definite and possible cuts in the sequence. 
+Those matches that the program terms "definite matches" are ones in which the specification of the recognition sequence corresponds exactly to that of the back translation. The program will also find what it terms "possible matches" which are ones that dep
+end on the particular codons chosen for each amino acid. These are sites at which recognition sequences could be engineered to produce a cut in the DNA without changing the amino acid, but which are not necessarily found in the original sequence.  \par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 9.  Statistical and Structural Analysis of Nucleotide Sequences\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Calculating the base composition\par 
+2.2\tab Calculating the dinucleotide composition\par 
+2.3\tab Calculating the codon composition\par 
+2.4 \tab Creating a codon usage file\par 
+2.5\tab Plotting the base composition\par 
+2.6 \tab Searching for anomalous compositions\par 
+2.7\tab Search for anomalous word usage\par 
+2.8\tab Calculate codon constraint\par 
+2.9 \tab Searching for stem-loops\par 
+2.10\tab Searching for long range inverted repeats\par 
+2.11\tab Searching for long range repeats\par 
+2.12\tab Searching for repeated words\par 
+2.13\tab Searching for possible Z DNA\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 In this chapter we deal with performing simple statistical and structural analysis of nucleotide sequences and also describe some more unusual test
+s. We cover base, dinucleotide and codon compositions, potential amino acid compositions, and the relative frequencies of each base in each position of codons. We describe how to produce plots to show regions of unusual composition and to measure the codon
+ bias for a gene. In addition we describe a set of functions for finding "structures" in nucleotide sequences, including short range inverted repeats or stem-loops, long range inverted repeats, long range direct repeats, and Z DNA. All the methods are cont
+ained in the program NIP.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Calculating the base composition\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 \tab Select "Calculate base composition". The composition of the active region is shown.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \page 2.2\tab Calculating the dinucleotide composition\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 \tab 
+Select "Calculate dinucleotide composition". The dinucleotide composition of the active region and an expected dinucleotide composition is shown. The expected composition is calculated from the base composition assuming a random order of bases in the sequ
+ence. See figure 9.1.\par 
+\pard\plain \li1180\ri1440\sb200\sl220\box\brsp100\brdrth \f4\fs16          T             C             A             G\par 
+\pard \li1180\ri1440\sl220\box\brsp100\brdrth     Obs  Expected Obs  Expected Obs  Expected Obs  Expected\par 
+T   5.86   5.97   6.18   5.99   4.24   5.91   8.14   6.56\par 
+C   6.10   5.99   5.14   6.02   5.91   5.93   7.38   6.59\par 
+A   5.57   5.91   5.64   5.93   7.91   5.84   5.05   6.49\par 
+\pard \li1180\ri1440\sl220\keepn\box\brsp100\brdrth G   6.90   6.56   7.56   6.59   6.11   6.49   6.30   7.22\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa200\sl240\tx1140 \f21\fs20 Figure 9.1\tab The dinucleotide composition display\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Calculating the codon composition\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function counts codons, amino acid composition, protein molecular weights, hydrophobicity and base compos
+itions. Users select the segments of the sequence to be analysed. The segments can be defined on the keyboard or from an EMBL/GenBank feature table.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Calculate codon composition".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Accept "Show observed counts". The alternative displays its codon tables so that the total for each amino acid sums to 100. This makes it easier to see any bias present in the codon usage.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Accept "Define segments using keyboard". The alternative is to use a feature table.\par 
+4.\tab Define "From". The start of the segment to be analysed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab 
+Define "To". The end of the segment to be analysed. The results will be displayed as in figure 9.2 and then the program will again ask "From". The user should define a zero value for "From" when all segments of interest have been analysed. The program w
+ill then display a cummulative total for all the values it calculates.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The counts are broken down into several figures. Apart from the codon counts we see the base composition by position in codon expressed as a percentage of each bases own
+ frequency; base composition by  position in codon expressed as a percentage of the overall base composition of the segment; base composition expected for the observed amino acid composition if there was no codon preference; percentage deviations of the ob
+served amino acid composition from an average amino acid composition (1) ; the molecular weight and hydrophobicity (2) of the putative amino acid sequence.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.4  Creating a codon usage file\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method writes a file of codon usage in the form of a codon tab
+le (see figure 9.2). Such tables can be used by several other methods contained within the programs. If required the user can start with an existing file and add to it.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Calculate a codon table and write it to disk".\par 
+2.\tab Accept "Start with empty table".\par 
+\pard\plain \li440\ri500\sl220\pagebb\box\brsp100\brdrth \f4\fs16 Calculate base, codon and amino acid compositions\par 
+\pard \li440\ri500\sl220\box\brsp100\brdrth ? Show observed counts (y/n) (y) =\par 
+ ? Define segments using keyboard (y/n) (y) =\par 
+\par 
+? From (0-8134) (0) =1\par 
+? To (1-8134) (8134) =1000\par 
+? + strand (y/n) (y) =\par 
+     ===========================================\par 
+     F TTT   5. S TCT   7. Y TAT   4. C TGT   2.\par 
+     F TTC  17. S TCC   3. Y TAC   5. C TGC   3.\par 
+     L TTA   3. S TCA   4. * TAA   3. * TGA   1.\par 
+     L TTG   4. S TCG   3. * TAG   0. W TGG   7.\par 
+     ===========================================\par 
+     L CTT   3. P CCT   6. H CAT   6. R CGT   3.\par 
+     L CTC   1. P CCC   1. H CAC   4. R CGC   2.\par 
+     L CTA   0. P CCA   4. Q CAA   3. R CGA   1.\par 
+     L CTG  36. P CCG   6. Q CAG   5. R CGG   4.\par 
+     ===========================================\par 
+     I ATT  12. T ACT   3. N AAT   6. S AGT   0.\par 
+     I ATC  13. T ACC   5. N AAC   7. S AGC   7.\par 
+     I ATA   1. T ACA   2. K AAA   9. R AGA   0.\par 
+     M ATG   9. T ACG   7. K AAG   3. R AGG   1.\par 
+     ===========================================\par 
+     V GTT   6. A GCT   5. D GAT   7. G GGT   9.\par 
+     V GTC   3. A GCC   6. D GAC   6. G GGC   9.\par 
+     V GTA   7. A GCA   2. E GAA   5. G GGA   5.\par 
+     V GTG   9. A GCG   7. E GAG   3. G GGG   3.\par 
+     ===========================================\par 
+ Total codons=      333.\par 
+         T          C          A          G\par 
+1     25.00      34.27      40.28      35.94\par 
+2     45.42      28.63      36.02      22.27\par 
+3     29.58      37.10      23.70      41.80\par 
+      -----      -----      -----      -----\par 
+=     100%       100%       100%       100%\par 
+1     21.32      25.53      25.53      27.63  = 100%\par 
+2     38.74      21.32      22.82      17.12  = 100%\par 
+3     25.23      27.63      15.02      32.13  = 100%\par 
+%     28.43      24.82      21.12      25.63 Observed, overall totals\par 
+%     29.65      23.25      23.95      23.15 Expected, even codons per acid\par 
+        A    C    D    E    F    G    H    I    K    L\par 
+       20.   5.  13.   8.  22.  26.  10.  26.  12.  47.\par 
+O-E % -27. -11. -25. -61.  71.  10.  38.  52. -36.  59.\par 
+         M    N    P    Q    R    S    T    V    W    Y\par 
+         9.  13.  17.   8.  11.  24.  17.  25.   7.   9.\par 
+O-E %   14. -10.   1. -39. -41.   6. -11.  15.  64. -15.\par 
+\pard \li440\ri500\sl220\keepn\box\brsp100\brdrth Total acids=  329. Molecular weight=    36493. Hydrophobicity=  64.7\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb80\sa280\sl240\tx1140 \f21\fs20 Figure 9.2\tab A worked example of calculating codon, base and amino acid compositions.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 3.\tab Accept "Show observed counts". The alternative is to have the counts for each amino acid type sum to 100.\par 
+4.\tab Accept "Define segments using keyboard". The alternative is to use an EMBL/GenBank feature table.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "From". The start of the segment to count over.\par 
+6.\tab Define "To". The end of the segment.\par 
+7.\tab Accept "+ strand". Alternatively the minus strand.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab The table will appear on the screen and the program will cycle round to step 5. When all segments have been defined a zero v
+alue for "From" will instruct the program to display on the screen a table which is the sum of all the individual tables.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab Define "Name for codon table file". Give the name of the file in which to save the final table. \par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Plotting the base composition\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function plots the base composition for each "window length" of the sequence. The frequency of any combinations of bases can be plotted.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Plot base composition".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Select which combination of bases to plot. The default is A+T, but any single base or combination of bases can be used.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+Select "Odd window length". This is the size of window over which each count is made, it is "odd" so that the plotted point exactly corresponds to the centre of each window. The count is made over the window and then the window is moved on by 1 base, an
+d the count repeated.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Plot interval". Especially when using long windows it is unnecessary to plot the results for every point along the sequence. A plot interval of 5 will mean the value for every fif
+th point will be plotted. The plot will appear in the form shown in figure 9.3\par 
+\pard\plain \ri-100\sb360\sl220\keepn \f4\fs16 {\plain \fs16 {\pict\macpict\picw451\pich82 
+343affffffff005101c21101a00082a0008c01000affffffff005101c2070000000022000100010000a000a0a100a400020de801000a0000000000000000070001000122004f000100b223000021000101c123000023004e23000021004f0001230000a000a301000affffffff005101c22300b221000101c123004e21004f
+0001a000a122003c000100ff2300fb2300fa2300f82300fa2300fb2300fe2300022300012300022301002300002300022300042300ff2300012300002300ff2300fe2300032300022300022300032300012300fd2302022300fe2300022300fe2300fd2300032300fd23000323000223000023000023000023000123000523
+00fe2301002300ff2300fe2300ff2300002300012300002300fd2300002300032300022300002300fe2300002300ff2300fd2302002300032300fd2300fe2300fe2300002300002300022300032300022300002300012300002300022300002301012300fd2300022300fe2300002300ff2300fe2300002300032300002300
+022300fe2300ff2300fe2300fd2302002300002300fe2300022300002300fe2300022300fe2300022300032300032300ff2300002300fe2300032302ff2300012300032300002300fa2300ff2300012300002300002300002300fb2300002300002300022300002300022301fe2300052300002300032300002300fe2300ff
+2300002300fe2300032300ff2300fd2300002300012300ff2302012300032300ff2300fd2300002300062300fe2300022300fe2300ff2300fe2300022300002300fe2300ff2301002300002300012300022300fe2300002300022300fd2300012300fd2300002300022300002300fe2300002302022300012300022300fe23
+00032300002300022300fe2300022300fe2300032300022300022300fe2300fe2301002300002300002300002300ff2300fb2300022300fe2300002300002300002300002300fd2300002300032300ff2302002300032300fe2300022300002300fe2300002300022300fe2300002300ff2300002300fe2300002300032302
+002300022300fd2300012300032300002300ff2300002300fd2300fc2300022300002300022300fe2300022301012300002300022300032300012300002300022300fe2300ff2300fd2300032300fe2300002300fd2300022302012300002300002300022300fd2300012300002300022300fe2300f8230002230002230000
+2300032300002300fd2300002300002300062300002301012300ff2300fe2300002300052300022300002300fc2300002300ff2300002300012300ff2300002300002300fe2302032300fd2300ff2300012300ff2300002300042300ff2300fe2300022300012300002300ff2300002300002301fe2300ff2300fe23000023
+00ff2300012300002300002300022300002300012300022300002300002300002302012300022300fe2300002300022300fe2300002300002300002300ff2300fe2300032300022300fd2300002302012300002300fd2300022300fe2300ff2300fe2300022300012300ff2300002300002300002300012300ff2300002301
+002300fe2300032300ff2300002300012300032300042300fe2300022300fe2300fe2300022300002300002302fd2300fe2300022300002300fe2300022300fe2300002300022300012300002300002300ff2300002300012301ff2300012300002300002300ff2300002300fe230000230000230002230000230000230000
+2300fd2300012302002300022300fd2300fe2300002300ff2300fe2300022300002300012300032300ff2300042300002300002302002300022300fe2300022300fd2300002300fb2300002300002300022300fe2300022300002300012300022300012301002300042300002300002300002300002300002300fe2300fe23
+00022300002300002300052300002300002302fe2300022300002300002300fe2300ff2300fc2300002300022300022300002300002300fe2300fe2300022301002300fd2300fe2300002300022300012300022300022300002300002300fe2300022300002300fb2300fe2302ff2300fe2300002300002300032300ff2300
+032300012300002300022300fe2300002300022300fe2300002301002300ff2300012300fd2300002300002300002300ff2300002300002300002300fd2300002300002300012300ff2302012300032300002300032300022300022300032300fe2300002300002300ff2300002300fc2300002300002302ff230000230001
+2300022300022300002300012300ff2300012300022300012300fd2300ff2300fe2300fe2301ff2300fe2300fd2300fd2300002300fe2300022300052300002300012300022300032300032300ff2300012300002300ff2300fe2300002300002302022300fe2300002300fe2300ff2300012300002300ff23000023000123
+00ff2300fd2300002300012300ff2300fd2301fc2300fd2300022300012300002300ff2300002300fd2300032300032300002300fe2300ff2300002300032302022300002300002300012300022300002300002300012300ff2300fe2300032300002300ff2300012300032301002300002300022300002300fe2300ff2300
+012300fd2300ff2300fe2300022300002300002300002300fe2302032300022300012300ff2300012300fd2300022300012300022300fb2300ff2300012300002300002300002302002300fd2300022300012300022300fe2300ff2300002300012300002300002300ff2300fd230001230000230000230103230000230000
+2300022300002300012300022300fe2300002300002300022300fd2300012300ff2300012302ff2300fe2300022300fe2300ff2300002300012300fd2300002300022300002300002300002300002300002301012300022300fe2300ff2300012300002300022300fe2300002300ff2300012300002300002300ff23000023
+02fe2300022300fe2300052300fe2300ff2300032300002300002300002300012300042300fb2300032300002300fd2301002300012300fd2300002300ff2300012300ff2300032300002300fd2300fd2300002300012300002300032302ff2300002300012300052300022300012300fb2300002300ff2300012300fd2300
+002300022300fe2300022302012300ff2300fe2300002300032300fd2300002300032300fc2300012300002300032300ff2300012300022301fd2300fe2300ff2300032300fd2300012300fd2300002300002300002300022300012300ff2300032300fe2302032300002300022300002300fe2300fd2300002300ff230001
+2300ff2300002300fe2300002300002300032300fd2301ff2300012300ff2300002300032300002300fb2300fd2300032300022300002300002300012300fd2300032302002300022300012300fd2300032300002300ff2300012300022300fe2300002300022300002300002300002300fe2300002300002300ff23000023
+02012300ff2300012300002300002300002300fd2300002300002300002300fd2300fe2300022300022300fe2301032300032300ff2300012300022300fe2300002300002300002300ff2300002300fe2300022300012300032302fd2300002300022300012300042300032300fd2300fe2300fe2300002300022300fe2300
+002300fc2300012300022301002300012300ff2300002300002300002300002300012300002300002300022300022300002300012300002302ff2300002300fe2300fd2300012300ff2300fe2300002300002300ff2300fe2300ff2300032300fe2300022301fe2300ff2300fe2300fe2300002300022300fe2300042300fe
+2300022300002300012300032300022300fe2302002300022300fe2300002300022300fe2300002300ff2300002300fe2300002300022300fe2300002300ff2300012302ff2300002300002300002300012300002300022300002300002300012300022300012300022300002300fe2301002300022300fe2300002300fd23
+00ff2300fd2300002300fe2300002300002300fd2300012300042300032302012300002300ff2300002300032300012300002300022300fd2300012300002300022300fe2300ff2300fe2301002300ff2300012300002300fd2300022300002300012300002300fd2300ff2300fe2300022300002300002302032300002300
+fe2300022300012300032300ff2300032300002300fe2300022300002300022300002300012300002301ff2300012300ff2300002300012300022300012300002300ff2300002300fd2300002300fe2300fe2300ff2302fd2300002300032300012300002300022300002300002300002300022300fe2300fe2300ff230001
+2300022302fe2300022300fe2300002300022300022300002300002300002300fc2300002300022300002300fe2300022301fe2300002300002300022300002300002300002300022300fc2300ff2300012300ff2300002300002300fe2302ff2300fd2300012300002300fb2300ff2300032300fe23000223000223000123
+00002300002300032300022300fd2301012300022300fd2300002300002300002300012300022300fe2300002300fd2300022300fe2300022300012300fc2300012300ff2300012300002302022300002300002300002300012300022300012300022300fd2300fe2300022300012300fd2300022300012302002300002300
+022300002300fd2300002300012300002300fd2300ff2300012300032300fd2300022300012301022300fe2300022300022300002300fe2300002300002300fd2300012300ff2300fe2300ff2300002300002300fe2302022300032300fe2300032300002300002300002300ff2300fe2300fd230000230002230000230001
+2300002301002300ff2300002300012300ff2300012300022300002300002300012300042300032300fb2300022300fe2302002300fe2300fd2300002300ff2300fe2300022300012300002300002300022300002300012300022300022301002300002300fe2300fd2300002300012300002300ff2300fe2300ff23000323
+00fe2300022300002300fe2302022300002300fe2300ff2300fe2300032300ff2300002300fe2300002300032300002300ff2300002300012300032302002300002300002300fd2300022300012300002300fd2300002300002300022300fd2300002300012300ff2301fe2300002300002300002300022300012300ff2300
+012300022300fd2300fe2300002300002300002300ff2302012300002300ff2300012300022300042300ff2300fe2300002300002300ff2300032300012300002300ff2301002300002300fe2300ff2300032300002300fe2300002300002300032300ff2300032300fe2300002300002302ff230000230000230000230000
+2300012300ff2300002300fe2300002300ff2300002300fd2300002300fe2300002301fe2300ff2300002300002300002300052300012300022300002300032300032300002300002300022300fe2302fd2300002300002300002300fb2300052300012300022300fe2300052300002300ff2300032300012300ff2302fd23
+00002300012300ff2300fe2300002300002300fb2300022300012300fd2300ff2300002300002300fd2301012300022300fe2300032300022300012300ff2300002300012300ff2300012300002300fd2300fd2300002300002302032300ff2300032300002300fe2300022300fe2300022300012300002300ff2300012300
+ff2300002300002300032300fe2300022300022300002301fe2300fe2300002300ff2300fd2300032300002300fe2300fd2300032300022300fe2300032300022300022302002300002300002300fe2300fe2300042300fe2300002300fe2300ff2300fe2300ff2300002300fe2300ff2301002300fe230000230002230003
+2300002300012300022300012300ff2300002300fd2300fe2300022300fe2302002300ff2300fc2300002300042300012300032300fd2300002300022300012300022300fe2300ff2300002300012302002300ff2300002300002300012300022300032300fe2300002300002300022300fe2300022300fd2300fe23010023
+00002300ff2300fe2300002300022300002300012300ff2300002300012300ff2300fe2300002300ff2302032300fd2300042300ff2300012300022300032300002300fe2300002300ff2300fb2300002300002300022301002300002300012300022300fd2300062300002300002300fe2300ff2300fe2300002300ff2300
+012300ff2300012302fd2300022300002300002300012300002300002300002300022300012300002300022300002300fe2300022302002300002300fe2300022300fe2300ff2300002300012300022300fe2300ff2300002300fe2300022300002301012300002300ff2300002300002300012300002300ff2300fe230000
+2300002300ff2300002300002300012302002300032300ff2300002300fe2300022300032300002300fe2300fd2300fd2300002300022300012300022301fe2300022300002300002300012300052300ff2300fe2300002300002300fe2300022300fe2300ff2300002300fe23020023000023000223000023000023000023
+00012300ff2300032300022300fe2300fe2300002300002300002301002300002300002300022300002300002300fe2300022300022300fe2300002300fe2300ff2300012300022302022300002300002300fe2300022300002300002300002300002300002300002300002300012300002300002302022300fb2300022300
+fe2300fd2300012300fd2300fd2300002300fd2300022300002300002300002300002300fe2300022300fc2300002300022300fe2301002300022300022300032300fd2300012300002300022300012300fd2300002300fd2300022300fe2300032302002300002300ff2300002300fe230000230000230000230002230000
+2300012300ff2300032300fe2300002301002300002300002300002300ff2300002300012300032300002300022300012300002300ff2300012300ff2302002300fe2300002300ff2300fd2300fc2300022300002300032300fd2300052300fe2300022300012300032301002300022300fe2300022300fd2300002300fe23
+00002300002300ff2300fd2300fe2300022300fe2300022300002302002300fe2300022300032300012300022300002300002300fe2300ff2300012300002300fa2300032300002302002300ff2300032300fe2300fd2300002300022300fe2300fe2300022300022300002300012300032300002301ff2300012300002300
+022300002300fd2300fd2300002300fe2300032300022300012300002300ff2300002302032300fe2300022300fe2300002300022300fb2300002300022300fe2300032300022300fe2300ff2300fe2301032300ff2300012300fd2300002300002300002300002300022300fe2300022300fd2300032300002300002300fe
+2302ff2300002300fc2300042300032300fe2300ff2300fe2300fe2300022300022300002300012300fd2300fe2301022300022300042300002300002300002300002300002300002300002300ff2300012300022300fd2300002302002300fe2300022300002300032300fe2300ff23000123000223000123000223000023
+00002300fe2300002302042300fe2300fe2300002300042300fe2300032300ff2300012300002300ff2300012300ff2300002300fc2301002300022300002300fd2300002300002300012300002300022300002300002300fd2300032300002300fe2300ff2302fe2300ff2300012300002300ff2300002300002300012300
+002300022300012300022300fb2300072300fc2301042300002300012300002300ff2300fe2300032300fd2300fb2300002300fd2300ff2300002300002300012302ff2300002300012300002300032300002300fd2300ff2300002300002300012300022300002300012300fd2300022300002300032300fe2300002302ff
+2300002300fb2300052300002300fe2300002300002300002300002300022300032300fe2300fc2300002300fe2301032300ff2300012300ff2300fb2300002300002300032300022300002300012300032300ff2300012300002302052300fe2300002300ff2300fd2300002300002300002300fe23000223000023000023
+00002300002300002301012300ff2300002300fe2300ff2300002300002300012300ff2300002300002300002300002300042300022302fe2300ff2300012300ff2300fe2300ff2300fe2300fd2300fe2300ff2300012300002300022300002300032301032300002300ff2300042300002300002300ff2300fe2300002300
+ff2300012300022300fe2300fd2300fd2300002302002300fe2300002300ff2300fb2300012300ff2300032300fe2300022300032300002300002300032300002302042300002300012300002300ff2300002300012300022300fb2300002300032300fd2300002300022300fe2301022300fe230002230000230001230003
+2300fd2300fd2300032300ff2300002300002300002300fe2300fe2302052300ff2300002300002300fc2300002300022300002300002300fd2300012300fd2300022300fe2300ff2300fc2301002300fd2300022300fe2300002300ff2300012300002300022300fe2300032300002300ff2300012300ff2302fe23000023
+00fd2300ff2300fe2300002300002300002300002300002300022300012300032300022300012301ff2300032300002300022300fe2300fe2300ff2300fd2300002300fe2300fd2300fe2300022300022300002302012300022300012300ff2300042300002300ff2300012300fd2300ff2300fe2300ff2300012300fd2300
+fe2302022300fe2300052300022300012300002300002300032300022300002300002300002300fd2300002300002300012301fd2300002300fd2300022300fe2300022300fe2300ff2300fe2300002300002300002300002300022300002302032300002300012300002300022300002300012300fc2300fe2300022300fe
+2300ff2300fe2300fe2300022301022300012300ff2300012300002300032300022300002300002300fe2300002300002300022300012300ff2300fb2300022300fe2300022300002302002300fe2300fd2300032300fd2300fe2300ff23000023000123000223000223000323000123000523000523020523000323000123
+00ff2300fe2300002300fd2300fd2300002300fe2300002300002300022300012300022300032301022300002300fe2300002300ff2300fe2300fe2300ff2300fe2300ff2300fe2300fd2300002300002300fe2302022300002300012300fd2300002300002300022300fe2300002300022300002300fe2300022300fd2300
+012301002300022300052300fe2300fd2300002300012300022300022300032300002300fd2300012300022300012302022300fd2300002300fe2300ff2300012300ff2300fe2300022300032300012300ff2300002300fe2300ff2300fe2301fe2300002300ff2300fd2300032300fd2300002300032300fe2300032300fd
+2300ff2300002300012300002302022300012300022300fe2300042300fe2300022300fe2300022300fc2300fd2300ff2300012300fd2300032302022300012300fd2300002300022300012300ff2300012300ff2300002300fb2300022300012300002300022301012300ff2300fe23000223000323000023000023000223
+00fe2300fe2300022300022300002300012300022302fe2300ff2300032300002300fe2300002300002300002300002300002300ff2300002300002300fe2300fe2300ff2301002300002300fe2300022300fd2300002300002300032300002300fd2300002300012300ff2300012300002302ff2300002300002300012300
+002300ff2300012300022300002300012300ff2300002300002300012300022301022300fc2300022300002300002300002300022300fe2300fd2300002300fd2300012300022300fe2300002302ff2300032300012300002300052300002300002300fd2300032300022300fb2300fe2300002300ff230000230203230000
+2300002300fd2300032300022300002300fe2300022300fe2300fe2300022300fd2300002300fe2300002301002300ff2300012300002300ff2300fe2300022300002300fe2300ff2300012300002300002300022300012302ff2300012300022300012300fd2300022300012300002300022300002300fd2300fe2300ff23
+00042300ff2300fd2300002300002300fe2300022301032300002300fd2300002300002300032300fe2300052300002300fd2300fe2300032300002300022300002302fe2300002300022300032300002300ff2300032300fe2300ff2300012300022300012300ff2300012300042300fc2301ff2300012300022300002300
+fe2300022300022300fe2300002300fe2300022300fe2300fd2300022300fe2302ff2300fc2300002300002300ff2300002300fd2300fe2300fd2300fd2300012300ff2300012300ff2300002302012300042300032300002300fd2300002300042300022300012300042300002300002300032300002300032301fe230000
+2300002300002300022300fe2300002300022300fe2300042300002300002300fe2300002300fe2302ff2300fe2300002300ff2300002300fe2300002300002300002300002300002300022300fe2300022300002300032301fe2300002300022300012300fd2300022300012300fd2300ff23000323000023000023000023
+00032300002302002300fe2300022300022300fe2300002300022300fe2300fe2300002300042300002300fb2300fd2300002302fe2300052300fe2300002300fd2300002300022300012300022300012300fc2300012300ff2300002300012301022300fe2300022300012300002300ff2300fe2300022300002300012300
+002300ff2300002300002300012302042300fe2300fd2300fe2300032300002300002300002300022300002300032300022300002300fe2300002300ff2301002300012300ff2300fc2300fd2300022300002300002300012300002300002300002300002300022300002302022300fc2300fd2300022300002300fd230004
+2300ff2300fe2300fd2300002300022300fc2300022300022301fc2300ff2300032300022300012300fd2300002300fd2300012300002300ff2300002300002300012300042302fe2300032300002300002300002300ff2300fe2300002300fe2300002300fd2300ff23000123000223000123000223020023000223000123
+00002300022300fe2300022300002300012300fd2300002300002300022300032300022300fe2300fe2300002300002300002301002300022300fd2300012300022300002300002300022300002300fc2300002300002300002300ff2300012302ff2300fe2300002300022300002300002300032300fe2300ff2300032300
+022300fe2300002300002300002301fe2300022300022300012300ff2300002300002300012300fd2300022300002300fe2300002300002300022302002300002300002300012300022300fe2300ff2300fc2300002300fd2300022300fd2300002300fc2300042300fc2301022300022300002300fe230002230000230001
+2300ff2300012300022300002300032300002300022300fe2302002300fe2300022300fd2300012300ff2300012300002300002300022300002300002300022300012300ff2302fc2300002300002300022300fe2300ff2300032300fd2300002300002300002300002300fd2300012300002301ff23000023000023000323
+00fb2300032300032300ff2300002300012300022300002300002300022300012300fd2302002300032300fd2300fe2300022300fe2300022300fe2300022300fe2300002300002300042300fc2300042301002300fe2300002300022300002300002300002300002300012300ff2300012300022300fe2300022300fe2302
+022300012300fd2300022300fe2300ff2300032300fb2300032300ff2300fe2300022300fe2300022300032301fe2300032300fd2300002300002300002300022300fd2300002300012300fd2300022300032300012300fd2302032300002300002300002300022300fd2300002300fe2300002300ff2300012300ff230000
+2300fe2300022300fe2302032300fd2300022300fe2300fe2300042300002300002300002300002300002300fe2300022300002300002301fe2300fe2300022300022300012300022300002300002300fe2300022300002300002300002300fd2300002302002300002300fc2300022300fe23000223000223000023000023
+00002300fe2300022300012300022300fb2301002300fe2300022300002300022300002300012300ff2300012300022300012300002300002300fd2300ff2302002300fe2300fe2300002300002300ff2300002300032300002300002300022300002300fe2300002300032300ff2300012300002300022300002300012302
+022300012300fc2300002300002300fe2300ff2300fe2300022300fc2300002300ff2300012300ff2300fe2301ff2300002300fc2300022300fd2300002300012300002300002300052300fd2300002300022300032300fe2302002300002300032300022300002300002300022300fb2300002300012300022300fd230001
+2300ff2300002301fd2300012300022300fb2300022300012300002300fd2300022300012300002300002300022300fe2300022300fe2302ff2300002300fe2300002300022300fe2300002300fe2300ff2300012300002300002300002300fc2300012301002300002300002300022300032300fd23000123000423000123
+00022300002300002300fe2300ff2300002302fe2300002300002300fe2300002300002300022300022300002300032300fe2300ff2300012300002300022302fd2300002300012300fd2300002300032300002300002300ff2300002300fe2300002300022300002300fe2301002300002300002300032300022300002300
+012300ff2300002300002300002300fe2300ff2300002300002300002302002300012300002300002300ff2300002300002300012300002300002300ff2300fe2300022300002300002301fe2300002300fe2300022300002300002300fe2300002300002300042300002300fe2300002300022300002302002300fe230000
+2300fb2300022300012300ff2300012300fd2300002300022300fe2300022300fe2300002301032300ff2300052300032300002300032300002300022300002300fe2300022300fe2300fe2300ff2300002302002300002300012300002300002300022300002300002300032300032300002300ff2300002300fe23000223
+00002302002300002300fe2300002300ff2300012300002300002300002300022300fd2300fc2300002300022300002301022300fe2300002300002300002300002300002300fe2300fc2300fc2300022300fd2300032300fb2300032302fc2300fe2300022300032300fe2300002300032300042300002300012300022300
+012300ff2300fe2300022301fe2300fd2300032300002300032300ff2300032300fe2300fd2300002300002300fd2300022300012300032300ff2300032300fd2300012300002300042302fe2300fd2300002300012300ff2300002300fe2300ff2300fc2300022300022300fe2300fd2300fe2300032302ff230000230001
+2300ff2300002300012300022300022300002300012300002300002300002300002300032301ff2300002300fb2300002300002300002300002300fd2300fd2300012300ff2300012300022300fe2300022302032300022300032300002300002300fe2300032300002300042300fe2300fe2300fd23000223000123000223
+01fe2300002300002300fd2300022300002300032300fd2300fd2300002300fe2300002300022300fe2300fd2300fe2302ff2300002300012300002300022300002300012300002300002300022300032300fd2300fe2300ff2300fe2301022300002300012300002300ff2300002300032300032300022300012300022300
+002300022300002300012302ff2300fe2300fe2300022300fe2300002300fd2300022300012300002300002300042300fc2300022300022302012300002300ff2300002300002300fe2300fe2300022300002300fe2300ff2300002300012300ff2300002300002301002300fe2300002300022300002300fe2300ff230001
+2300ff2300012300fd2300fe2300002300ff2300012302042300012300032300002300ff2300fe2300ff2300042300ff2300fd2300fe2300fe2300ff2300032300002301032300ff2300fe2300002300022300002300012300002300002300002300002300022300002300002300002302012300ff2300002300fe2300ff23
+00032300fe2300002300002300fd2300002300052300002300fe2300002301ff2300fc2300002300022300fb2300002300002300032300002300002300022300022300012300032300022300022302fe2300022300fe2300022300002300012300002300ff2300012300002300ff2300012300022300012300fd2302002300
+ff2300fc2300ff2300002300fe2300ff2300fe2300032300002300002300022300002300012300022301022300012300fd2300fe2300002300002300002300002300002300fd2300ff2300012300002300ff2300002300fe2300032300002300fd2300022302012300002300002300002300022300fe230000230003230000
+2300fd2300fd2300fe2300ff2300032300022301002300002300002300002300062300052300002300032300fd2300002300002300002300002300002300fe2300002302ff2300012300022300002300012300022300fe2300032300002300002300002300fd2300002300002300002301ff2300fe2300ff2300002300fc23
+00022300002300fe2300022300002300022300002300002300032300fe2302022300002300fe2300032300ff2300012300ff2300002300012300ff2300012300002300ff2300fe2300032302002300ff2300012300ff2300012300002300032300ff2300fe2300ff2300012300ff2300012300ff2300002300fe2301002300
+ff2300012300022300fe2300002300002300002300002300fd2300032300022300fd2300002300012302fd2300022300012300ff2300002300fe2300002300022300fc2300022300fe2300002300042300012300002301002300022300012300002300002300002300ff2300fe2300002300022300fd2300012300fd230002
+2300012302ff2300002300fe2300fe2300002300ff2300032300fd2300fe2300022300fd2300002300012300002300002302002300002300002300ff2300032300032300fe2300002300042300fe2300022300002300012300022300fe2300002301002300ff2300fe2300022300002300fe23000023000223000023000023
+00fe2300022300002300002300012302002300ff2300012300022300fe2300022300002300fe2300002300ff2300002300002300fc2300022300fe2301ff2300012300ff2300fe2300002300002300022300012300002300022300fe2300022300002300fe2300ff2302fe2300ff2300032300032300fd2300012300ff2300
+002300032300022300002300fe2300fe2300042300002301fe2300022300fc2300002300022300fe2300ff2300002300fe2300002300ff2300002300002300fe2300022300012302ff2300002300032300032300022300002300002300002300002300002300fc2300002300ff2300fe2300002302002300002300fd230003
+2300022300012300002300002300022300fe2300022300022300fc2300022300fe2300002300042300fe2300022300012301022300002300fe2300fd2300022300002300fc2300002300ff2300fd2300002300012300032300002300002302002300002300002300022300022300fe2300002300002300fe23000223000523
+00fe2300022300fe2300002300ff2301012300ff2300fe2300002300022300fc2300022300002300002300002300002300022300fe2300002300022302fe2300fe2300fd2300002300002300022300fe2300022300002300fd2300032300012300042300002300002301032300002300012300002300ff2300fe2300002300
+002300ff2300fe2300022300fc2300022300022300012302ff2300002300002300012300ff2300012300fd2300fe2300002300042300fe2300fe2300022300fd2300002302012300ff2300012300ff2300fe2300022300002300012300002300022300fe2300002300fd2300032300ff2300002301002300002300002300fe
+2300ff2300012300002300002300032300002300ff2300012300ff2300012300002302022300fe2300002300ff2300fe2300022300012300002300ff2300fe2300002300022300fe2300002300002301ff2300fb2300012300022300022300fe2300002300002300022300012300032300022300fd2300012300022302fe23
+00022300002300fe2300ff2300012300022300002300fe2300002300002300022300fe2300002300002301022300022300032300fe2300ff2300012300022300012300002300002300ff2300fe2300ff2300012300ff2300fc2302002300002300002300fd2300ff2300012300022300012300002300022300052300fe2300
+002300032300002302ff2300032300002300fe2300fd2300002300002300022300002300002300fe2300002300022300012300ff2301012300ff2300fd2300002300002300002300fe2300022300fc2300ff2300fd2300fe2300fe2300042300fc2302002300ff230003230000230005230003230002230000230001230000
+2300022300fd2300002300002300002300fb2301002300fe2300ff2300002300032300012300ff2300002300032300022300fe2300032300032300ff2300fe2302022300012300ff2300fd2300012300ff2300fe2300022300012300ff2300002300012300032300fc2300012300ff2300fc2300fd2300fd23000023020023
+00fd2300032300002300022300032300002300052300012300002300022300fe2300032300fc2300002301012300fd2300002300fe2300002300022300fd2300012300022300fe2300022300fe2300002300ff2300002302002300fe2300022300002300012300022300fd2300002300002300032300002300fe2300002300
+ff2300012300fd2301032300022300022300002300032300002300002300002300012300002300fd2300032300fc2300002300002302002300002300002300012300022300fd2300032300fe2300ff2300012300002300ff2300fe2300002300002301002300002300032300fd2300fe2300022300022300fe2300fe230005
+2300fd2300002300022300012300002302ff2300002300fe2300fe2300002300042300002300fc2300022300022300032300032300002300032300002300fe2302002300002300002300fc2300fe2300fd2300fe2300fd2300ff2300012300ff2300002300032300fe2300022301fe2300002300ff23000023000023000023
+00fe2300002300002300032300002300002300002300022300012302ff2300002300002300fe2300032300022300fe2300ff2300fe2300022300002300fd2300002300fb2300fd2301002300fd2300012300002300022300fe2300032300032300052300032300022300002300002300022300012302002300ff2300002300
+fc2300042300fe2300fe2300ff2300fe2300fa2300fd2300fe2300002300022300002300002301002300012300002300032300ff2300012300ff2300fd2300002300fe2300022300002300002300002300fe2302002300032300002300002300ff2300012300ff2300002300002300032300fe230002230003230000230000
+2302002300032300002300ff2300fe2300fe2300fc2300012300022300012300002300022300002300052300002301002300fe2300ff2300fe2300002300002300fe2300ff2300032300022300fe2300002300002300fe2300002302002300002300ff2300002300032300032300ff2300fe2300032300002300002300fd23
+00002300002300002300032300002300ff2300002300002300002301012300002300022300fd2300002300fc2300022300fe2300052300022300002300002300002300012300002302002300002300002300ff2300002300002300012300ff2300012300022300fe2300fd2300ff2300fe2300002302fd2300fe2300032300
+002300022300fe230002230003230000230008230003230003230005a0008da00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa340\sl240\tx1140 \f21\fs20 Figure 9.3\tab A typical base composition plot. This is an A+T plot for bacteriophage Lambda and shows that one half is A+T rich and the other G+C rich.\par 
+\pard\plain \s6\sb240\sa100\sl280\tx560\tx860 \b\f20 2.6\tab Searching for anomalous compositions\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This "search" is performed by comparing a standard composition against each segment of the sequence and plotting the difference. The difference between the observed and expected composition at each point is expressed as the chi-square value.
+ Any one of the base, dinucleotide or trinucleotide compositions can be used as the standard. No expected level of divergence is used so the program always displays the results so that the plots fill the alloted space on the screen. At the end the observed
+ range is displayed.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Plot dinucleotide composition differences as chi squared". Alternatively select base or trinucleotides.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Start". Define the position of the first base to be used in the standard.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "End". Define last base of the standard. The default standard region is the whole sequence.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Odd window length". \par 
+5.\tab Define "Plot interval".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear as in figure 9.4\par 
+\pard\plain \sb200\sl220\keepn \f4\fs16 {{\pict\macpict\picw448\pich119 
+06f6ffffffff007601bf1101a0008201000affffffff007601bf0900000000000000003100000000007501be98002400000000004e012000000000004e011f00000000007501be000102dd0006007fdfff00fc0a0040fc000002e50000040a0040fc000002e50000040a0040fc000003e50000040a0040fc000003e5000004
+0a0040fc000003e50000040b0040fc00010380e60000040b0040fc00010280e60000040b0040fc00010280e60000040b0040fc00010240e60000040b0040fc00010240e60000040d0040fc0003027ffff8e80000040d0040fc000302000008e80000040d0040fc000302000008e80000040d0040fc000302000008e8000004
+0d0040fc000302000008e80000040d0040fc000302000008e80000040d0040fc000304000008e80000040d0040fc000304000004e80000040d0040fc000304000004e80000040d0040fc000304000004e80000040d0040fc000304000004e80000040d0040fc000304000004e80000040d0040fc000304000004e80000040e
+0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc00030400
+0004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000004e9000102040e0040fc000304000002e9000102040f0040fc000304000002ea00028002040f0040fc000304000002ea00028002040f0040fc000304000002ea0002a006040f0040fc000304000002ea00
+02a00604130040fc000304000002f6000080f60002a00604130040fc000304000002f6000080f60002a00604130040fc000304000002f6000080f60002a00544140040fc000308000002f700010180f60002a00564150040fc000308000002f700010180f70003017005641a0040fc000308000002fd000003fc00010180f8
+000420015005441a0040fc000308000002fd000003fc00010140f8000420015005441a0040fc000308000002fd000005fc00010240f8000420015005441e0040fe000620000800000202fe000005fd0002400240f8000430015005841e0040fe000620000800000202fe000005fd0002400240f8000430015005841f0040fe
+000620000800000202fe00010480fe0002400240f8000450015005841f0040fe0011200008000001060002020480000001c00240f8000450015005841f0040fe0011300008000001060002020480000001c00240f800045001100584230040fe0011300008000001050002020480000001c00440fd000018fd000450011005
+84230040fe0011300008000001050006060480020001a00440fd000014fd00044801080584230040fe0011300008000001050006060480020001200440fd000017fd0004c801080984241540018000500008000001090006060880060001200440fd0009110008800088020809842415400160005000080000010900050608
+80050002200420fd0009110008800108020809042523400120005000080000010900050608800500022004200180080021000880010802080904252340012000500008000001110005050880050002200420028008002080088001080208090425234002200050001000000111000905888005040221842002800810208018
+a001080208090425234002200050001000000111000905888009040221882002800810608018a00108020a1104250640022000500010fe001990800909884008870211882004401410608015600108020a1004250640021000500010fe001990800989884008950211882004401430808075600105020a1004250640021000
+500010fe001990800889484008f90212882004401428808045500105020a1004250640021000500010fe0019a080088948400809020a88200440142880804550010502061004250640021800900010fe0019a080108948400808840a48100440142880804550010704051004250640041800880050fe0019a08010b9504090
+08840a4810044012ac8040455101070405a004250640041802880050fe0019a080106050409000840a5010042022c50040835101068405a00425064004084b0800d0fe0019a08010605048900094065010082021430040821281028405a00425064004064d0dffa0fe0019a04010403048d000b40450104831e1030044800e
+8900840560042506780406b4020020fe00196040204020555000b804700c48124003004480088900840560042506480805b4020020fe000d40402040203560006804100ca80cfe00087a80085600880040042203444805b0fb000d405fa0000033600048000003b80cfe00080a80005600d000400406007fdfff00fc02dd00
+a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 9.4\tab An anomalous composition plot. This shows an immunoglobulin switch region and the plateau corresponds to a segment composed entirely of A and G bases.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.7\tab Search for anomalous word usage\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function is designed to examine the abundances of short words in a nucleotide sequence to see if particular ones are either under or over repre
+sented (3). It compares the observed and expected frequencies and plots them for each segment of the sequence. There has been some work on the relative abundances of CG dinucleotides in eukaryotic sequences (e.g. reference 4) and this routine can be used t
+o examine such biases or any others that might be of interest.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Plot observed-expected word usage".\par 
+2.\tab Define "String". That is the word to search for. The default is CG.\par 
+3.\tab Define "Odd window length".\par 
+4.\tab Define "Plot interval".\par 
+5.\tab Define "Maximum plot value". Define the maximum expected value for the plot.\par 
+6.\tab Define "Minimum plot value".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear as in figure 9.5.\par 
+\pard\plain \ri-60\sb200\sl220\keepn \f4\fs16 {\plain \fs16 {\pict\macpict\picw453\pich122 
+0800ffffffff007901c41101a00082a0008c01000affffffff007901c4070000000022000100010000a000a0a100a400020de801000a000000000000000007000100012200770001008a23000021000101c32300002300762300002100770001230000a000a301000affffffff007901c423008a21000101c3230076210077
+0001a000a120003b0001003b01c322003b00011a082300022302002300fe2302022300fe2301fe2300022300fe2302022300fe2301022300002302002300002300fe2302022300002301022300002302fc2300042300fe2301fe2300ff2302012300022301002300fd2300002302002300002302002300002301fe23000223
+00002302fe2300022301fd2300032300002302fd2300002302fe2300002300002301ff2300002302fe2300002301002300002300022302012300022301002300012302002300002300022302002300002301002300002302002300002301002300012300022302fe2300022302fe2300002301002300022300002302fe2300
+022301002300002300fe2302022300002301002300002300022302002300012302002300022301002300002300012302002300022301fe2300002302022300002300002302002300002301002300032302002300002300ff2301012300002302ff2300002301002300002300012302ff2300002302012300002301022300fe
+2300002302022300fe2300002301fd2300002302022300002302012300002300ff2301fc2300002302002300ff2301012300002300002302022300fd2302002300fe2301ff2300032302fe2300002300022301002300012302002300002301ff2300002300002302002300012302ff2300012301ff2300fe23000223020123
+00002301002300022300002302fe2300022302fd2300fe2300002301022300002302002300002301002300fe2300022302002300012301002300022302fe2300002300002302002300002301002300022302002300002300002301002300002302002300002302002300032300fb2301002300022302002300022301012300
+002300022302002300002300002301012300022302012300ff2302002300002300fe2301002300002302002300002301002300022300002302002300fe2302022300012301ff2300002302fe2300ff2300002301fe2300022302fe2300002301ff2300002300002302002300fe230200230000230100230000230002230201
+2300ff2300012301022300002302fe2300002302002300002300002301022300fe2302fd2300002301002300002300022302002300fe2301022300012302022300002300fe2302022300002301002300fe2302ff2300002300012301ff2300042302002300002302002300ff2300012301fd2300002302022300002301fe23
+00002300002302002300002300002302002300002301002300002302002300ff2300002301fe2300022302fe2300022301fe2300fe2300002302002300022302fe2300022301fe2300ff2302fe2300022300002301002300002302002300012302022300002300fe2301022300fe2302fd2300022301002300fe2300002302
+022300002300002301012300ff2302fe2300032302002300ff2300002301012300ff2302fe2300002301002300002300002302ff2300012302002300ff2301002300012300002302fd2300022301002300fe2302032300002300002301022300fe2302022300012302ff2300002300012301ff230000230200230000230000
+2301002300012302ff2300002300012302002300ff2301002300002302012300002300022301002300002302002300002301022300002302012300002300002302ff2300012301022300fe23020023000023000023010023000223020023000123020223000123000023010223000023020023000023010023000023000223
+02012300002300022301002300002302002300012302002300002300ff2301002300002302002300002301012300ff2300002302002300002302012300002301002300002300002302002300002301ff2300002302002300012300002301ff2300fe2302032300ff2302002300002300002301002300002302002300fe2300
+002301022300012302002300002300ff2302002300002301002300002302002300002300012301002300002302002300002302022300fd2301fe2300022300012302ff2300012301002300ff2302002300012300fd2301022300fe2302002300002302022300002300002301fe2300002302fd2300fe2300002301022300fe
+2302002300002300022302002300032301ff2300002302002300012300ff2301012300002302032300002301022300022300fb23020123000023020023000023010023000223000023020023000023010223000023020023000023000123020023000023010223000023020023000023000023010023000023000023020023
+00012301002300002302ff2300012300002302022300002301fe2300022302002300012300002301042300002302002300012302002300002301002300002300002302002300002301002300ff2302fe2300fe2300002301ff2300002302032300fe2302022300002300fe2301ff2300002302002300002300fe2301002300
+002302002300fd2300ff2302002300012301fb2300002302002300fd2300ff2301fe2300032302ff2300002301fe2300fd2300012302ff2300012302fd2300ff2301002300002300fe2302ff2300fe2301002300002302002300fe2300002302002300002301002300ff2302002300fe2300ff2301fe2300fd2300002302fe
+2300ff2302fe230000a0008da00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 9.5\tab 
+A plot of anomalous word usage. This shows a plot of CG usage for the Human CMV immediate-early region. The frequency of CG is much lower than would be expected from the composition.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \page 2.8\tab Calculate codon constraint\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method measures the level of constraint imposed on a sequence by coding for a protein. The codon constraint is the difference between the observe
+d codon improbability and the mean improbability for a sequence of the same composition. That is it is a measure of the codon bias and the program performs the calculation over windows of length 99 codons. See reference 5. The user can select segments to a
+nalyse either by defining them on the keyboard or by using an EMBL/GenBank feature table. The result for each selected segment, which is simply a single number, is displayed.\par 
+\pard\plain \s7\qj\fi-560\li560\sa80\sl280\tx560 \f20 1.\tab Select "Calculate codon constraint".\par 
+2.\tab Accept "Define segments using keyboard".\par 
+3.\tab Define "From". The start of the segment.\par 
+4.\tab Define "To". The end of the segment.\par 
+5.\tab Accept "+ strand".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The result will be displayed, and the program will ask for the next segment to be defined. \par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.9\tab Searching for stem-loop structures\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This routine finds simple putative stem-loop structures having a minimum number of base pairs in their stems. Results can be listed or plotted.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Search for hairpin loops".\par 
+2.\tab Define "Minimum loop size".\par 
+3.\tab Define "Maximum loop size".\par 
+4.\tab Define "Minimum number of base pairs"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Reject "Plot results". The alternative writes out the stem-loops as shown in figure 9.6. The plotted output marks the position of each stem, the height of the mark showing the length of the stem.\par 
+\pard\plain \li3480\ri3940\sb200\sl220\box\brsp100\brdrth \f4\fs16       g\par 
+\pard \li3480\ri3940\sl220\box\brsp100\brdrth      g.t\par 
+     t.g\par 
+     c-g\par 
+     a-t\par 
+     t.g\par 
+     t.g\par 
+     g-c\par 
+     t.g\par 
+     g.t\par 
+     g.t\par 
+     t.g\par 
+     t.g\par 
+     g-c\par 
+     t.g\par 
+tggcga gttttaa\par 
+\pard \li3480\ri3940\sl220\keepn\box\brsp100\brdrth    843\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 9.6\tab A typical textual display from the routine for finding simple hairpin loops.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \page 2.10\tab Searching for long range inverted repeats\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method finds inverted repeats. It allows for no mismatches, insertions or deletions within the matching segments.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Find long range inverted repeats".\par 
+2.\tab Accept "Plot results". The alternatve lists out all the matching segments.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "Start". The beginning of the region to analyse. In general the whole sequence will be analysed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "End".\par 
+5.\tab Define "Minimum inverted repeat". The length of the minimum match.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The results will now be plotted in an unusual way as shown in figure 9.7 in which the positions of matching segments are joined by rectangular lines.\par 
+\pard\plain \li100\sb200\sl220\keepn\box\brsp20\brdrth \f4\fs16 {{\pict\macpict\picw445\pich118 
+0448ffffffff007501bc1101a0008201000affffffff007501bc0900000000000000003100000000007401bb98001e00000000003d00f000000000003d00ec00000000007401bb000102e3000701001fe6ff00c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c0070100
+18e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c007010018e60000c00a00
+7ff1ff00c0f60000c00b014018f2000040f60000c00b014018f2000040f60000c00b014018f2000040f60000c00b014018f2000040f60000c00b014018f2000040f60000c00b014018f2000040f60000c00b014018f2000040f60000c00b014018f2000040f60000c00e007ff5ff00e0fe000040f60000c00f017818fb0000
+01f4ff00f0fb0000c019017818fb000501c1800000e0fe000040fe00017030fb0000c019017818fb000501c1800000e0fe000040fe00017030fb0000c019017818fb000501c1800000e0fe000040fe00017030fb0000c01502781807f7ff00e0fe000040fe00017030fb0000c01a02781804fc000501c1800000e0fe000040
+fe00017030fb0000c01a02781804fc000501c1800000e0fe000040fe00017030fb0000c01a02781804fc000501c1800000e0fe000040fe00017030fb0000c01a02781804fc000501c1800000e0fe000040fe00017030fb0000c01a02781804fc000501c1800000e0fe000040fe00017030fb0000c01a02781804fc000501c1
+800000e0fe000040fe00017030fb0000c01a02781804fc000501c1800000e0fe000040fe00017030fb0000c01a02781804fc000501c1800000e0fe000040fe00017030fb0000c01102781804fc000001f5ff01f030fb0000c01a02781804fc000e01c1800000f0006000400008007030fb0000c01a02781804fc000e01c180
+0000f0006000400008007030fb0000c01a02781804fc000e01c1800000f0006000400008007030fb0000c01a02781804fc000e01c1800000f0006000400008007030fb0000c01a02781804fc000e01c1800000f0006000400008007030fb0000c01a02781804fc000e01c1800000f0006000400008007030fb0000c01c1678
+1804000007ffffe1c1800000f0006000400008007030fb0000c01c167ffffc000007ffffe1c1800000f0006000400008007030fb0000c01c16781804000007ffffe1c1800000f0006000400008007030fb0000c01c16781804000007ffffe1c1800000f0006000400008007030fb0000c01c1678180407fe07ffffe1c18000
+00f0006000400008007030fb0000c01c1678180407fe07ffffe1c1800000f0006000400008007030fb0000c01c1678180407fe07ffffe1c1800000f0006000400008007030fb0000c002e300a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl20\tx1140 \f21\fs20 Figure 9.7\tab 
+A plot of direct or inverted repeats. Each matching segment is joined by a rectangular line. Here we show the direct repeats of at least 25 bases in a mouse immunoglobulin switch region.\par 
+\pard\plain \s6\sb120\sa40\sl280\tx560\tx860 \b\f20 2.11\tab Searching for long range repeats\par 
+\pard\plain \s4\qj\sa120\sl260 \f20 This method finds direct repeats. It allows for no mismatches, insertions or deletions within the matching segments.\par 
+\pard\plain \s7\qj\fi-560\li560\sa80\sl260\tx560 \f20 1.\tab Select "Find long range repeats".\par 
+2.\tab Accept "Plot results". The alternatve lists out all the matching segments.\par 
+\pard \s7\qj\fi-560\li560\sa80\sl260\tx560 3.\tab Define "Start". The beginning of the region to analyse. In general the whole sequence will be analysed.\par 
+\pard \s7\qj\fi-560\li560\sa80\sl260\tx560 4.\tab Define "End".\par 
+5.\tab Define "Minimum repeat". The length of the minimum match.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 The results will now be plotted in an unusual way as shown in figure 9.7 in which the positions of matching segments are joined by rectangular lines.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.12\tab Searching for repeated words\par 
+\pard\plain \s7\qj\sa120\sl260\tx540 \f20 \tab This function can be used to examine the frequencies of repeated words within a sequence. It finds all words that occ
+ur more than once. A "word" is a particular sequence of bases so we are dealing only with exact repeats. The user selects a minimum word length and the program finds all words of that length that occur more than once. Then it "follows" each repeated word u
+ntil it becomes unique. For each word length it can report the number of different repeated words, the number of occurrences of each word, and their actual sequences and positions.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 1.\tab Select "Examine repeats".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 2.\tab Define "Minimum word length". The maximum expected and observed word lengths are displayed.\par 
+3.\tab Define "Minimum word length for display of repeated word frequencies". The number of different repeated words of each length is listed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 4.\tab Define "Minimum frequency for display of repeated words". \par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 5.\tab Define "Minimum word length for display of repeated words".  All words occurring this number of times and of this given word length will be displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 \par 
+\pard\plain \sl220\box\brsp100\brdrth \f4\fs16 {\f22\fs18 Expected length of longest repeat    12\par 
+}\pard \sl220\box\brsp100\brdrth {\f22\fs18  ? Minumim word length (1-6) (6) = \par 
+Working\par 
+ Memory used in bytes   75164. Length of longest repeat    13\par 
+ ? Show repeat frequencies for words of at least length (6-13) (13) = 10\par 
+ For length    10 the number of different repeated words is    86\par 
+ For length    11 the number of different repeated words is    21\par 
+ For length    12 the number of different repeated words is     5\par 
+ For length    13 the number of different repeated words is     2\par 
+ ? Show repeats for words of length (6-13) (13) = 10\par 
+ ? Show repeats for words occuring with frequency (2-9999) (2) = 3\par 
+ aaggcatcat\par 
+ occurs at    276\par 
+ occurs at    969\par 
+ occurs at   6938\par 
+ gtctggcggc\par 
+ occurs at   1891\par 
+ occurs at   4714\par 
+ occurs at   7250\par 
+ ? Show repeats for words of length (6-13) (13) = 12\par 
+ ? Show repeats for words occuring with frequency (2-9999) (2) = \par 
+ gttactggtggt\par 
+ occurs at    641\par 
+ occurs at    851\par 
+ aaaggcatcatg\par 
+ occurs at    968\par 
+ occurs at   6937\par 
+ aaggcatcatgg\par 
+ occurs at    969\par 
+ occurs at   6938\par 
+ ttactggtggtg\par 
+ occurs at    642\par 
+ occurs at    852\par 
+ ctgctgggccgt\par 
+ occurs at   3477\par 
+ occurs at   6424\par 
+}\pard \sl220\box\brsp100\brdrth {\f22\fs18  ? Show repeats for words of length (6-13) (13) =!\par 
+}\pard \sl220 {\f22\fs18 \par 
+}{\f22\fs20 Figure 9.8  Typical output from "Examine repeats".\par 
+}\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 \par 
+2.13\tab Searching for possible Z DNA\par 
+\pard\plain \s4\qj\sa60\sl260 \f20 
+The program contains three algorithms for searching for sequences with the potential for forming Z DNA. In varying ways they look for segments of alternating purines and pyrimidines and they all plot their results. A typical result is shown in figure 9.9.
+\par 
+\pard\plain \sl220\keepn \f4\fs16 {{\pict\macpict\picw447\pich119 
+0512ffffffff007601be1101a0008201000affffffff007601be0900000000000000003100000000007501bd98002400000000004e012000000000004e011f00000000007501bd000102dd0006007fdfff00fc060040df000004060040df000004060040df000004060040df000004060040df000004060040df0000040600
+40df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040
+df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df0000040e0040f4000001fb000080f40000040e0040f4000001fb000080f40000040e0040f4000001fb000080f40000040e0040f4000001fb000080f40000040e0040f4000001
+fb000080f40000040e0040f4000001fb000080f40000040e0040f4000001fb000080f40000041202400040f6000001fb000080f600022000041202400040f6000001fb000080f600022000041202400040f6000001fb000080f600022000041202400040f6000001fb000080f600022000041202400040f6000001fb000080
+f600022000041202400040f6000001fb000080f600022000041202400040f6000001fb000080f600022000041702400040f6000001fb000380000004fb000440002000041702400040f6000001fb000380000004fb000440002000041702400040f6000001fb000380000004fb000440002000041702400040f6000001fb00
+0380000004fb000440002000041802400040f600010180fc0003c0000004fb000440002000041802400040f600010180fc0003c0000004fb000440002000041802400040f600010180fc0003c0000004fb0004400020000421044000400020fc0005020002000181fc0006c0004004000440fe000440003084142104400040
+0020fc0005020002000181fc0006c0004004000440fe0004400030841421044000400020fc0005020002000181fc0006c0004004000440fe0004400030841421044000400020fc0005020002000181fc0006c0004004000440fe0004400030841422044000c00030fc0005020003000281fd0007014000c006000440fe0004
+600051843c22044000c00030fc0005020003000281fd0007014000c006000440fe0004600051843c22044000c00030fc0005020003000281fd0007014000c006000440fe0004600051843c23044000c01430fc001903004300028181020042014000c0060146600040006404d3563c23044000c01430fc0019030043000281
+81020042014000c0060146600060006404d3563c23044000c01c30fc001903004300028181020042014000c00601e6600060006406d3563c23044000c01c30fc001903004300028181020042014000c00601e6600050006406d3563c23044000c01e28fc0019030062800282818600a3014000c00601e6a00088006406d55e
+3c23044000c01628fc0019030062800282818600a3014000c0060156a00088006405d55e3c23044000c01628fc0019030062800282818600a3014000c0060156a00084006405d55e3c20045ffffff7effaff03fefffefefeff02bfff7ffdff095fbfff87fffffddd7ffc060050df000004060060df000004060060df000004
+060060df000004060040df00000406007fdfff00fc02dd00a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 9.9\tab A plot of predictions for potential Z DNA containing some high peaks produced by regions of alternating purines and pyrimidines.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Whenever the program reads a sequence file it always displays the base composition to provide the user with a check on the correctness of the file.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab 
+The search for anomalous words function operates in the following way. Users select a "word" - say CG and a window length. The program examines each successive window length along the sequence, with each window overlapping the previous one by windowleng
+th-1 bases. For each window position the program calculates the base composition and the number of 
+occurrences of the chosen word. From the base composition it calculates an expected number of occurrences of the chosen word by simply multiplying the relevent frequencies and assuming random ordering. It plots observed - expected hence showing regions tha
+t are enriched or depleted in the chosen word.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+The codon constraint calculation offers a measure of the codon bias that is independent of any set tables of expected codons. Although some users may find the underlying mathematics difficult to understand 
+the values obtained provide an interesting measure. It was shown (5) for a set of {\i E. coli} genes that their values of codon constraint correlated with their levels of expression. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab The algorithm for finding possible stem loops counts A-T, G-C and G-T pairs as matching but will only find stems with no mismatches or loopouts.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab The long range inverted and direct repeat routines are fast but only find exact matches. More flexible and exhaustive methods are described in the chapter on sequence comparisons.\par 
+6.\tab It is also possible to use the pattern searching routines to define and search for inverted and direct repeats. They are particularly useful for finding specific structures - for example tRNA folds.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl260\tx560 7.\tab 
+It is possible that the "Examine repeats" algorithm may run out of memory, particularly if a short minimum word length is chosen or the sequence is very long or very repetitive. If this occurs the maximum word length reported may not be the longest in t
+he sequence\: the memory will have been consumed before it was found.\par 
+\pard\plain \s5\sb320\sa60\sl320\tx560 \b\f20\fs28 \page 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab McCaldon,P. and Argos,P. 1988 Oligopeptide biases in protein sequences and their use in predicting protein coding regions in nucleotide sequences. {\i Proteins} {\b 4}, 99-122.\par 
+2.\tab Sweet,R.M. and Eisenberg,D. 1983. Correlation of sequence hydrophobicity measures similarity in three-dimensional protein structure. {\i J. Mol. Biol}. {\b 171}\:479-488.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Honess,R.W., Gompels,U.A., Barrell,B.G., Craxton,M., Cameron,K.R., Staden,R., Chang,Y.-N and Hayward,G.S. 1989 Deviations from expected frequencies 
+of CpG dinucleotides in herpesvirus DNAs may be diagnostic of differences in the states of their latent genomes. {\i J. Gen. Virol}, {\b 70}, 837-855.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Bird,A.P. 1980 DNA methylation and the frequency of CpG in animal DNA. {\i Nucl. Acids Res}. {\b 8}, 1499-1504.\par 
+5.\tab McLachlan, A.D., Staden, R., and Boswell, D.R.  1984.  A method for measuring the non-random bias of a codon usage table.  {\i Nucl. Acids Res}. {\b 12}\:9567-9575.\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 10.  Translating and Listing Nucleic Acid Sequences\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Listing the sequence with all six reading frames translated\par 
+2.2\tab Listing the sequence with its open reading frames translated\par 
+2.3\tab Listing the sequence with defined segments  translated\par 
+2.4\tab Listing the sequence with translated segments defined from a feature table\par 
+2.5\tab Producing a file of protein sequences for all open reading frames.\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.6\tab Producing a file of protein sequences for segments defined from a feature table\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 In this chapter we deal with producing simple listings from nucleotide seque
+nces. All functions are contained in the program NIP.  We can list the sequence alone, in single or doubled stranded format or with translations to protein. The translations can be of all six phases, all open reading frames, or of specified segments. The p
+ositions of these segments can be defined on the keyboard or read from a EMBL/GenBank feature table. Translations can use the one letter or three letter codes. In addition we can produce files containing only the protein translations, and which are suitabl
+e for processing by other programs. Again the positions of the translated segments can be defined on the keyboard, read from a feature table, or be all open reading frames. For the user, producing all these results is very simple, so we only give examples 
+of "methods"  and show what the results look like. All outputs that list the sequence can be produced from the menu option named "Translate and list".\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Listing the sequence with all six reading frames translated\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Translate and list".\par 
+2.\tab Accept "Show translation".\par 
+3.\tab Select "The segments to translate will be "All six frames"".\par 
+4.\tab Accept "Use 1 letter codes".\par 
+5.\tab Define "Start". Where to list from.\par 
+6.\tab Define "End". Where to list to.\par 
+7.\tab Define "Line length". The number of characters in each line of output.\par 
+8.\tab Reject "Number ends of lines". This alternative writes the positions underneath each line.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The listing will then appear. Given the choices taken it will look the same as figure 10.1.\par 
+\pard\plain \li1240\ri1280\sb200\sl220\box\brsp100\brdrth \f4\fs16  Q  D  Y  I  G  H  H  L  N  N  L  Q  L  D  L  R  T  F  S  L\par 
+\pard \li1240\ri1280\sl220\box\brsp100\brdrth   R  I  T  *  D  T  T  *  I  T  F  S  W  T  C  V  H  S  R  W\par 
+   G  L  H  R  T  P  P  E  *  P  S  A  G  P  A  Y  I  L  A\par 
+caggattacataggacaccacctgaataaccttcagctggacctgcgtacattctcgctg\par 
+      1010      1020      1030      1040      1050      1060\par 
+gtcctaatgtatcctgtggtggacttattggaagtcgacctggacgcatgtaagagcgac\par 
+ L  I  V  Y  S  V  V  Q  I  V  K  L  Q  V  Q  T  C  E  R  Q\par 
+  P  N  C  L  V  G  G  S  Y  G  E  A  P  G  A  Y  M  R  A  P\par 
+   S  *  M  P  C  W  R  F  L  R  *  S  S  R  R  V  N  E  S\par 
+\par 
+ V  D  P  Q  N  P  P  A  T  F  W  T  I  N  I  D  S  M  F  F\par 
+  W  I  H  K  T  P  Q  P  P  S  G  Q  S  I  L  T  P  C  S  S\par 
+G  G  S  T  K  P  P  S  H  L  L  D  N  Q  Y  *  L  H  V  L\par 
+gtggatccacaaaaccccccagccaccttctggacaatcaatattgactccatgttcttc\par 
+      1070      1080      1090      1100      1110      1120\par 
+cacctaggtgttttggggggtcggtggaagacctgttagttataactgaggtacaagaag\par 
+ H  I  W  L  V  G  W  G  G  E  P  C  D  I  N  V  G  H  E  E\par 
+  P  D  V  F  G  G  L  W  R  R  S  L  *  Y  Q  S  W  T  R  R\par 
+T  S  G  C  F  G  G  A  V  K  Q  V  I  L  I  S  E  M  N  K\par 
+\par 
+ S  V  V  L  G  L  L  F  L  V  L  F  R  S  V  A  K  K  A  T\par 
+  R  W  C  W  V  C  C  S  W  F  Y  S  V  A  *  P  K  R  R  P\par 
+L  G  G  A  G  S  V  V  P  G  F  I  P  *  R  S  Q  K  G  D\par 
+tcggtggtgctgggtctgttgttcctggttttattccgtagcgtagccaaaaaggcgacc\par 
+      1130      1140      1150      1160      1170      1180\par 
+agccaccacgacccagacaacaaggaccaaaataaggcatcgcatcggtttttccgctgg\par 
+ R  H  H  Q  T  Q  Q  E  Q  N  *  E  T  A  Y  G  F  L  R  G\par 
+  P  P  A  P  D  T  T  G  P  K  I  G  Y  R  L  W  F  P  S  W\par 
+E  T  T  S  P  R  N  N  R  T  K  N  R  L  T  A  L  F  A  V\par 
+\par 
+ S  G  V  P  G  K  F  Q  T  A  I  E  L  V  I  G  F  V  N  G\par 
+  A  V  C  Q  V  S  F  R  P  R  L  S  W  *  S  A  L  L  M  V\par 
+Q  R  C  A  R  *  V  S  D  R  D  *  A  G  D  R  L  C  *  W\par 
+agcggtgtgccaggtaagtttcagaccgcgattgagctggtgatcggctttgttaatggt\par 
+      1190      1200      1210      1220      1230      1240\par 
+tcgccacacggtccattcaaagtctggcgctaactcgaccactagccgaaacaattacca\par 
+ A  T  H  W  T  L  K  L  G  R  N  L  Q  H  D  A  K  N  I  T\par 
+  R  H  A  L  Y  T  E  S  R  S  Q  A  P  S  R  S  Q  *  H  Y\par 
+\pard \li1240\ri1280\sl220\keepn\box\brsp100\brdrth L  P  T  G  P  L  N  *  V  A  I  S  S  T  I  P  K  T  L  P\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 10.1\tab A six phase translation using the 1 letter codes\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Listing the sequence with its open reading frames translated\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Translate and list".\par 
+2.\tab Accept "Show translation".\par 
+3.\tab Select "The segments to translate will be "Open reading frames"".\par 
+4.\tab Define "Minimum open frame in amino acids".\par 
+5.\tab Accept "Use 1 letter codes".\par 
+6.\tab Define "Start". Where to list from.\par 
+7.\tab Define "End". Where to list to.\par 
+8.\tab Define "Line length". The number of characters in each line of output.\par 
+9.\tab Select "Both strands"\par 
+10.\tab Accept "Number ends of lines".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 A typical result is shown in figure 10.2.\par 
+\pard\plain \li720\ri680\sb200\sl220\box\brsp100\brdrth \tx7780 \f4\fs16  Q  D  Y  I  G  H  H  L  N  N  L  Q  L  D  L  R  T  F  S  L\par 
+\pard \li720\ri680\sl220\box\brsp100\brdrth \tx7780 caggattacataggacaccacctgaataaccttcagctggacctgcgtacattctcgctg\tab 1060\par 
+    .    \:    .    \:    .    \:    .    \:    .    \:    .    \:\par 
+gtcctaatgtatcctgtggtggacttattggaagtcgacctggacgcatgtaagagcgac\par 
+ L  I  V  Y  S  V  V  Q  I  V  K  L  Q  V  Q  T  C  E  R  Q\par 
+                                 *  S  S  R  R  V  N  E  S\par 
+\par 
+ V  D  P  Q  N  P  P  A  T  F  W  T  I  N  I  D  S  M  F  F\par 
+gtggatccacaaaaccccccagccaccttctggacaatcaatattgactccatgttcttc\tab 1120\par 
+    .    \:    .    \:    .    \:    .    \:    .    \:    .    \:\par 
+cacctaggtgttttggggggtcggtggaagacctgttagttataactgaggtacaagaag\par 
+ H  I  W  L  V  G  W  G  G  E  P  C  D  I  N  V  G  H  E  E\par 
+T  S  G  C  F  G  G  A  V  K  Q  V  I  L  I  S  E  M  N  K\par 
+\par 
+ S  V  V  L  G  L  L  F  L  V  L  F  R  S  V  A  K  K  A  T\par 
+tcggtggtgctgggtctgttgttcctggttttattccgtagcgtagccaaaaaggcgacc\tab 1180\par 
+    .    \:    .    \:    .    \:    .    \:    .    \:    .    \:\par 
+agccaccacgacccagacaacaaggaccaaaataaggcatcgcatcggtttttccgctgg\par 
+ R  H  H  Q  T  Q  Q  E  Q  N  *  E  T  A  Y  G  F  L  R  G\par 
+E  T  T  S  P  R  N  N  R  T  K  N  R  L  T  A  L  F  A  V\par 
+\par 
+ S  G  V  P  G  K  F  Q  T  A  I  E  L  V  I  G  F  V  N  G\par 
+agcggtgtgccaggtaagtttcagaccgcgattgagctggtgatcggctttgttaatggt\tab 1240\par 
+    .    \:    .    \:    .    \:    .    \:    .    \:    .    \:\par 
+tcgccacacggtccattcaaagtctggcgctaactcgaccactagccgaaacaattacca\par 
+ A  T  H  W  T  L  K  L  G  R  N  L  Q  H  D  A  K  N  I  T\par 
+L  P  T  G  P  L  N  *  V  A  I  S  S  T  I  P  K  T  L  P\par 
+\par 
+ S  V  K  D  M  Y  H  G  K  S  K  L  I  A  P  L  A  L  T  I\par 
+agcgtgaaagacatgtaccatggcaaaagcaagctgattgctccgctggccctgacgatc\tab 1300\par 
+    .    \:    .    \:    .    \:    .    \:    .    \:    .    \:\par 
+tcgcactttctgtacatggtaccgttttcgttcgactaacgaggcgaccgggactgctag\par 
+ A  H  F  V  H  V  M  A  F  A  L  Q  N  S  R  Q  G  Q  R  D\par 
+\pard \li720\ri680\sl220\keepn\box\brsp100\brdrth \tx7780 L  T  F  S  M  Y  W  P  L  L  L  S  I  A  G  S  A  R  V  I\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa180\sl240\tx1140 \f21\fs20 Figure 10.2\tab A listing showing the translation of open reading frames from both strands of a sequence from position 1001 to 1300\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Listing the sequence with defined segments  translated\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Translate and list".\par 
+2.\tab Accept "Show translation".\par 
+3.\tab Select "The segments to translate will be "Typed on the keyboard"".\par 
+4.\tab Accept "Use 1 letter codes".\par 
+5.\tab Define "Start". Where to list from.\par 
+6.\tab Define "End". Where to list to.\par 
+7.\tab Define "Line length". The number of characters in each line of output.\par 
+8.\tab Select "Both strands".\par 
+9.\tab Accept "Number ends of lines".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Define "Translate from". Define the start of the next segment to translate - say the next exon.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 11.\tab Define "Translate to". Define the end of the next segment to translate.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 12.\tab Select "Strand". As both strands have been selected above the program will allow either to be translated for each defined segment.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program will now cycle around through steps 10, 11 and 12 until a zero value is defined for "Translate from". At which point the listing will appear. Given the choices made it will look the same as figure 10.2.
+\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Listing the sequence with translated segments defined from a feature table\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Translate and list".\par 
+2.\tab Accept "Show translation".\par 
+3.\tab Select "The segments to translate will be "Read from a feature table"".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Feature table file name". Type the name of the file containing the appropriate feature table in EMBL/GenBank format.\par 
+5.\tab Define "Operator". This defines which feature table operators should be employed when selecting the segments to translate.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Accept "Use 1 letter codes"\par 
+7.\tab Define "Start". Where to list from.\par 
+8.\tab Define "End". Where to list to.\par 
+9.\tab Define "Line length". The number of characters in each line of output.\par 
+10.\tab Select "Both strands"\par 
+11.\tab Accept "Number ends of lines".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program will now read the feature table file and translate the segments defined using the selected operator(s) and the listing will appear as in figure 10.2.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Producing a file of protein sequences for all open reading frames.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Translate and write protein sequences to disk".\par 
+2.\tab Reject "Translate selected regions". The alternative is "Open reading frames".\par 
+3.\tab Define "Minimum open frame in amino acids".\par 
+4.\tab Select "Both strands".\par 
+5.\tab Define "File name for translation".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+A typical results file is shown in figure 10.3. It shows that the file is written in FASTA format (i.e. an entry name line starting with a > symbol (here the first entry name is 188, the start of the DNA segment), followed by a title (here in EMBL feature 
+table format giving the start and end of the DNA that produced the protein), followed by the sequence terminated by an *.\par 
+\pard \s4\qj\sa120\sl280 \par 
+\pard\plain \sl220 \f4\fs16 {\f22\fs18 \par 
+}\pard \li1260\ri1360\sl220\box\brsp100\brdrth {\f22\fs18 >188                 188..733\par 
+}\pard \li1260\ri1360\sl220\box\brsp100\brdrth {\f22\fs18  TMEVNKKQLADIFGASIRTIQNWQEQGMPVLRGGGKGNEVLYDSAAVIKWYAERDAEIEN\par 
+ EKLRREVEELRQASEADLQPGTIEYERHRLTRAQADAQELKNARDSAEVVETAFCTFVLS\par 
+ RIAGEIASILDGLPLSVQRRFPELENRHVDFLKRDIIKAMNKAAALDELIPGLLSEYIEQ\par 
+ SG*\par 
+>711                 711..2633\par 
+ VNISNSQVNRLRHFVRAGLRSLFRPEPQTAVEWADANYYLPKESAYQEGRWETLPFQRAI\par 
+ MNAMGSDYIREVNVVKSARVGYSKMLLGVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVE\par 
+ PTIRDIPSLLALAPWYGKKHRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDEL\par 
+ AAFDDDIEQEGSPTFLGDKRIEGSVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVAC\par 
+ PHCGEEQYLKFGDKETPFGLKWTPDDPSSVFYLCEHNACVIRQQELDFTDARYICEKTGI\par 
+ WTRDGILWFSSSGEEIEPPDSVTFHIWTAYSPFTTWVQIVKDWMKTKGDTGKRKTFVNTT\par 
+ LGETWEAKIGERPDAEVMAERKEHYSAPVPDRVAYLTAGIDSQLDRYEMRVWGWGPGEES\par 
+ WLIDRQIIMGRHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTGGIDPTIVYERSKK\par 
+ HGLFRVIPIKGASVYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYNRFTLTPEGDEPLP\par 
+ GAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKRRNEALDCFVYALAALRI\par 
+ SISRWQLDLSALLASLQEEDGAATNKKTLADYARALSGEDE*\par 
+>74                  complement(74..727)\par 
+ LFDIFTQQPRYQFIQRGCFVHGFDDIPFQEINMSVFQFRKTPLHRQGEPVENTGNFTCDP\par 
+ RQHESTECGFHHFSGVSGILQFLCVGLRTRKSMAFVLNSSWLEICLAGLPQFFNLPAQLF\par 
+ VLNFSIPFGIPFYDGGRVIKHLITLATASQNGHSLFLPVLNGTDTRTENVSQLLFVDFHC\par 
+ SFHGQKQRKETTEAKKPRFQHLSFPFFSEGILNKNIKL*\par 
+>313                 complement(313..732)    \par 
+ PDCSIYSLSNPGISSSSAAALFMALMISRFRKSTCRFSSSGKRRCTDRGSPSRILAISPA\par 
+ IRDSTKVQNAVSTTSAESLAFFSSCASACARVSRWRSYSIVPGWRSASLACRSSSTSRRS\par 
+}\pard \li1260\ri1360\sl220\box\brsp100\brdrth {\f22\fs18  FSFSISASLSAYHFMTAAES*\par 
+}\pard \li1260\ri1360\sl220 {\f22\fs18 \par 
+}\pard\plain \s8\qj\fi-1140\li1140\sb120\sa240\sl240\tx1140 \f21\fs20 Figure 10.3\tab The contents of a file containing the protein sequences of the open reading frames found by the program\par 
+\pard\plain \s6\fi-560\li560\sb240\sa60\sl280\tx560 \b\f20 2.6\tab Producing a file of protein sequences for segments defined from a feature table\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Translate and write protein sequences to disk".\par 
+2.\tab Accept "Translate selected regions".\par 
+3.\tab Reject "Define segments using keyboard". The alternative is to use a feature table.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Feature table file name". Type the name of the file containing the appropriate feature table in EMBL/GenBank format.\par 
+5.\tab Define "Operator". This defines which feature table operators should be employed when selecting the segments to translate.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "File name for translation"\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program will now read the feature table file and translate the segments defined using the selected operator(s). The results will be stored as in figure 10.3.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab To produce a listing without translation the "Translate and list" function can be used with the "Show translation" option rejected. Alternatively the function "List the sequence" can be used.
+\par 
+2.\tab Some users may be confused by the fact that the program asks "Where to list from, and to" and also "Define segments to translate". This allows for 5' and 3' untranslated regions to be included in the listing.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+The feature table file employed by the programs is a simple text file containing the data for the current sequence. Because of the multiplicity of different sequence library formats we have not provided the facility of reading such data directly from li
+braries. The feature tables for individual library entries must be extracted (see the introductory chapter) or files can be created for new sequences.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+The current feature tables use "operators" such as "join" or "order" to specify which segments should be translated together to make a complete protein sequence. The program allows users to select which ones to employ, the default being "Use all operato
+rs".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab The program contains a function "Set genetic code" which allows users to choose from a menu of codes or to define their own by specifying amino acid and codon pairs. This sets the code for all functions.
+\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 11.  Statistical and Structural Analysis of Protein Sequences\par 
+\pard\plain \s3\sb200\sa120\sl360 \b\f20\fs32 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Plotting hydrophobicity\par 
+2.2 \tab Plotting charge\par 
+2.3\tab Plotting hydrophobic moment and hydrophobicity\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700\tx1980 2.4\tab Drawing helical wheels\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.5\tab Producing a Robson secondary structure prediction\par 
+2.6\tab Calculating the amino acid composition and molecular weight\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 In this chapter we describe the use of routines for plotting hydrophobicity, charge and hydrophobic moments, drawing helix wheels and predicting second
+ary structure. Use of all these routines is very straightforward and they are contained in the program PIP.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Plotting hydrophobicity\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method uses the values of Kyte and Doolittle (1)\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Plot hydrophobicity".\par 
+2.\tab Define "Window length".\par 
+3.\tab Define "Plot interval".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear as in figure 11.1.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Plotting charge\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Plot charge".\par 
+2.\tab Define "Window length".\par 
+3. \tab Define "Plot interval".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear and will be similar to that shown in figure 11.1.\par 
+\pard\plain \sl220\keepn \f4\fs16 {{\pict\macpict\picw448\pich81 
+0396ffffffff005001bf1101a0008201000affffffff005001bf0900000000000000003100000000004f01be9800240000000000350120000000000035011f00000000004f01be000102dd0006007fdfff00fc060040df000004060040df000004060040df000004060040df000004060078df000004060040df0000040600
+40df000004060040df000004060040df00000407017840e0000004070140b0e000000407014108e000000407014104e000000407014204e00000040b017a02fc000020e60000040c014202fd00010250e60000040f014402fd00010590e900031000000418014401fd00010490f800010380f8000020fe0003700000041c01
+4801fd00010808f800010480fd000010fd000060fe0003880000041d017801fd00010808f800010440fe00010428fd000090fe00038804000424074801000002000804fe00010110fd00010440fe00010a28fe000801108004010806000424075000800005001004fe000101a8fd00010840fe000d09440000020111800b01
+04090004240c500080000480100200800002a4fd00010840fe000d10c40020030a0a4009010709002425236000800004e0100201400002440020000210400004401002005004960c400882010881e42523780080000810100202300002430050000d1020001ba0200100900490004810820090412425234000800008102001
+02080002008088001120200020104000811004600037f0c20090222406007fdfff00fc2523400040002008e00104032c040022020020c0180080038000420808000020000c00600a14241440004000200500010400c204001201002080080080fe0002420410fd000404004004142113780030004005000084000104001401
+0040000401fd0002220410fb00024004141f13400008004005000098000088000c010040000401fd0002240410f900000c1f134000080040020000e00000900000010040000407fd00021c0220f90000041c05400009008002fc0008600000010e80000208fc000103e0f900000416044000068080fb000040fe0004918000
+0210f2000004110378000441f6000490800003f0f20000040d0340000022f60000a0ee0000040d0340000022f6000060ee000004090340000014e2000004090340000008e2000004060078df000004060040df000004060040df000004060040df000004060040df000004060078df000004060040df000004060040df0000
+04060040df000004060040df00000406007fdfff00fc02dd00a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 11.1\tab A hydrophobicity plot using the values of Kyte and Doolittle.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Plotting hydrophobic moment and hydrophobicity\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method plots the hydrophobic moment and the hydrophobicity as defined by Eisenberg {\i et al} (2).\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Plot hydrophobic moment".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Angle". This is the angle between the residues when the helix is viewed end on. The default value of 100 degrees is that found in alpha helices.\par 
+3.\tab Define "Window length". The default of 18, if used in conjunction with the default "Angle", is equivalent to 5 turns of the helix.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Plot interval".\par 
+\pard\plain \s4\qj\sa120\sl280\tx560 \f20 
+The plot will appear as in figure 11.2. with the hydrophobicity shown above the hydrophobic moment. The scale for the hydrophobicity runs from -1.0 to 1.5 and for the hydrophobic moment from 0.0 to 1.5. The program plots the mean values for each window pos
+ition with the value at position x representing the segment from x-window length+1 to x.\par 
+\pard\plain \sb200\sl220\keepn \f4\fs16 {{\pict\macpict\picw447\pich160 
+0659ffffffff009f01be1101a0008201000affffffff009f01be0900000000000000003100000000009e01bd9800240000000000670120000000000067011f00000000009e01bd000102dd0006007fdfff00fc060040df000004060070df000004060040df000004060070df000004060040df000004060070df0000040600
+40df000004060070df000004060040df00000406007edf000004060040df000004060070df000004060040df000004060070df000004060040df000004060078df000004060074df0000040a0072fc000008e50000040a0061fc000038e50000040e007ffc000044f2000020f500000413016080fd000084f2000050f90000
+08fe00000414017080fe00010104f2000048f9000016fe00000419016040fe00010202f2000088fe000001fe000502110008000419017030fe00010401f2000084fe000902818000052088140004200c40080001000401e00800000380f900012102fe00090242600004e096120004240c70080002800800101400000c40fd
+000008fe000e320380000004741000040061210004240c40080004800800082200003040fd000034fe000e4a004000001c0c0c00080001210004250c70040004500800042200004020fe00131c440000e04c0020000020000b0e080000c08004250c400400086f10000241f8038010fe001323820001104000100000400000
+91100000808004241e7e02000800f00001800404001100001c4002000e0880000c400080000060e0fd0000041f0340020008fd0012800208000ef000244002001004800003b00080f90000041e0370010010fc000b023800000fc0228001001003fe0002080080f90000041c0340010010fc000101c0fe00053042800100a0
+fd00010801f8000004170370008020f70005084280008160fd00010401f8000004160340008040f700040481000041fc000107c6f8000004150370004440f700040301000021fb000028f8000004110340004a80f300001afb000010f80000041002700031f2000006fb000010f8000004060040df00000406007edf000004
+060040df000004060070df000004060040df000004060070df000004060040df000004060070df000004060040df000004060070df000004060040df00000406007fdfff00fc060040df000004060040df000004060070df000004060040df000004060040df000004060040df000004060070df000004060040df00000406
+0040df000004060070df000004060040df000004060040df000004060070df000004060040df000004060040df000004060040df00000406007edf000004060040df000004060040df000004060070df000004060040df000004060040df000004060070df000004060040df000004060040df000004060040df0000040600
+70df0000040a0040f6000010eb0000040a0040f600002ceb0000040a0070f6000024eb0000040a0040f6000042eb0000040c0040f80002800082eb00000411007efe000040fd000303400082eb000004110040fe0000a0fd000304400101eb000004130040fe0000a0fd0005082002010001ed0000041e044000000110fd00
+070820040100030010f9000040fe000103e0fd0000042204700000011cfd000730100800c0048128fd000480000004c0fe00010410fd00000423044000000204fd000720081000200482c4fe00050380000006a0fe00012808fd00000424044000000402fe0008204008100020088404fe0005024000000920fe00015808fd
+00000424047000001c02fe0008504004100020084404fe0005044000004920fe00018004fd00000425104000002002700000484004200010084802fe000f84400000a820040001000400e00000042523400000200190000048800420001010480200000144420000b0100a018a000403100080042523700001200109000084
+80044000087030020000024826000110105206740002241003000425234001c2c0000a8008848002800008801002000804482508071010b208000003d812040004251b40022440000a601703000280000500000104340428291408000d0108fe0004080d080004241b60041800000411a00200030000070000012a44043019
+240800030108fd0003088800041e077f88100000040a60f9000701d982880000c408fe000090fc00025000041b014050fd00000cf7000601037800008210fe000070fc00025000040e014030ed000103f0f8000220000406007fdfff00fc02dd00a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 11.2\tab A hydrophobic moment (below) and hydrophobicity plot. The hydrophobicity plot displays the mean va
+lues on a scale of -1.5 to 1.0 and the hydrophobic moment on a scale of 0.0 to 1.5.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Drawing helical wheels\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method draws helical wheels for any segment of the sequence (3). In addition it displays the hydrophobic moment for the segment (2).\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Draw helix wheel".\par 
+2.\tab Define "Angle". The default angle of 100 degrees is that found in alpha helices.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "Window length". The default of 18, if used in conjunction with the default "Angle", is equivalent to 5 turns of the helix.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Step
+". To produce a display for a sequence position N bases from the current one type N, and the display will appear in place of the previous one. The default value of N is 1, so by repeatedly hitting carriage return the user can step, residue by residue, thro
+ugh the sequence.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The display for the current position in the sequence will appear as in figure 11.3. and the bell will ring. The program now allows the user to "step through the sequence displaying the helix wheel for each position.
+\par 
+\pard\plain \li900\ri960\sb500\sl220\keepn\box\brsp120\brdrth \f4\fs16 {{\pict\macpict\picw355\pich329 
+0c64ffffffff014801621101a00082a0008c01000affffffff0148016209000000000000000031010f01050121011338a10096000c010000000200000000000000a1009a0008fffd00000004000001000a01100106011e01112c000c00150948656c76657469636103001504010d000c2e00040000010028011a01070144a0
+0097a0008da0008c01000affffffff0148016231012600ba013800c838a10096000c010000000200000000000000a1009a0008fffc00000004000001000a012700bb013500c628013100bc014ca00097a0008da0008c01000affffffff0148016231011d0087012e009538a10096000c010000000200000000000000a1009a
+0008fffc00000004000001000a011d0088012b009328012700890146a00097a0008da0008c01000affffffff014801623100df004600f1005438a10096000c010000000200000000000000a1009a0008fffd00000004000001000a00e0004700ee00532800ea00480156a00097a0008da0008c01000affffffff0148016231
+0097003900a8004738a10096000c010000000200000000000000a1009a0008fffc00000004000001000a0097003a00a500452800a1003b0159a00097a0008da0008c01000affffffff0148016231006b004d007c005b38a10096000c010000000200000000000000a1009a0008fffc00000004000001000a006b004e007900
+59280075004f014ca00097a0008da0008c01000affffffff01480162310032008a0044009838a10096000c010000000200000000000000a1009a0008fffc00000004000001000a0033008b0041009628003d008c014ba00097a0008da0008c01000affffffff0148016231002b00ba003d00c838a10096000c010000000200
+000000000000a1009a0008fffd00000004000001000a002c00bb003a00c628003600bc0144a00097a0008da0008c01000affffffff0148016231003300f1004500ff38a10096000c010000000200000000000000a1009a0008fffd00000004000001000a003400f2004200fd2b37080148a00097a0008da0008c01000affff
+ffff0148016231005101190063012738a10096000c010000000200000000000000a1009a0008fffd00000004000001000a0052011a006001252b281e0145a00097a0008da0008c01000affffffff014801623100b9014400cb015238a10096000c010000000200000000000000a1009a0008fffc00000004000001000a00b9
+014500c701512b2b67014ba00097a0008da0008c01000affffffff01480162310098014400aa015238a10096000c010000000200000000000000a1009a0008fffc00000004000001000a0099014500a701512800a30146014ba00097a0008da0008c01000affffffff0148016231003e00ba004f00c838a10096000c010000
+000200000000000000a1009a0008fffc00000004000001000a003f00bb004d00c728004900bc0131a00097a0008da0008c01000affffffff014801623100b9013100ca013f38a10096000c010000000200000000000000a1009a0008fffd00000005000001000a00ba013200c8013e2b777b0132a00097a0008da0008c0100
+0affffffff014801623101080090011a009e38a10096000c010000000200000000000000a1009a0008fffc00000004000001000a010900910117009c28011300920133a00097a0008da0008c01000affffffff01480162310075005b0087006938a10096000c010000000200000000000000a1009a0008fffd000000050000
+01000a0076005c00840068280080005d0134a00097a0008da0008c01000affffffff0148016231005c0109006e011738a10096000c010000000200000000000000a1009a0008fffc00000005000001000a005d010a006b0116280067010b0135a00097a0008da0008c01000affffffff014801623100f900fe010b010c38a1
+0096000c010000000200000000000000a1009a0008fffd00000004000001000a00fa00ff0108010b28010401000136a00097a0008da0008c01000affffffff014801623100d5005700e7006538a10096000c010000000200000000000000a1009a0008fffd00000004000001000a00d6005800e400632800e000590137a000
+97a0008da0008c01000affffffff014801623100480093005a00a138a10096000c010000000200000000000000a1009a0008fffc00000005000001000a00490094005700a028005300950138a00097a0008da0008c01000affffffff01480162310098013200a9014038a10096000c010000000200000000000000a1009a00
+08fffc00000004000001000a0099013300a7013e2b9f500139a00097a0008da0008c01000affffffff0148016231010f00b7011c00d038a10096000c010000000200000000000000a1009a0008fffd00000009000001000a011000b8011e00cd28011a00b9023130a00097a0008da0008c01000affffffff01480162310097
+004a00a6006338a10096000c010000000200000000000000a1009a0008fffd00000009000001000a0098004b00a600602800a2004c023131a00097a0008da0008c01000affffffff0148016231004600e3005700f838a10096000c010000000200000000000000a1009a0008fffc00000008000001000a004700e4005500f6
+28005100e5023132a00097a0008da0008c01000affffffff014801623100e2011700f3012c38a10096000c010000000200000000000000a1009a0008fffc00000007000001000a00e3011800f101292b349c023133a00097a0008da10096000c010000000200000000000000a1009a0008fffd0000003a000001000a000000
+00000e007728000a00010d444b464c4544564b4b4c594853a00097a10096000c010000000200000000000000a1009a0008000400000007000001000a00180002003400132b0218044d20200d2a0e0148a00097a10096000c030000000200000000000000a1009a0008000b00000004000001000a0018000d00420031280022
+001a05372e38310d2800300016062d322e39370d2b070e03313532a00097a0008c01000affffffff0148016231003300890045009738a10096000c010000000200000000000000a1009a0008fffd00000005000001000a0034008a00420096296e014ba00097a0008da0008c01000affffffff014801623100f30123010401
+3138a10096000c010000000200000000000000a1009a0008fffd00000005000001000a00f40124010201302b9ac00153a00097a0008d01000affffffff0148016207000000002200bc01210000a000a0a100a4000209fd01000a0000000000000000070001000109ffffffffffffffff22005900bf62632300002100fc009f
+23000023cc8723000021006d00fe2300002100ee00fe2300002100d8006b23000023338723000021009f0120230000239f6323000023a29c23000021005f00e2230000233278230000a000a301000affffffff0148016222005900bf62632100fc009f23cc8721006d00fe2100ee00fe2100d8006b23338721009f0120239f
+6323a29c21005f00e2233278a000a1a10096000c030000000200000000000000a1009a0008fffc00000003000001000a002000f9003101020d000e28002c00fa012ba00097a10096000c030000000200000000000000a1009a0008fffc00000003000001000a002100820032008b28002d0083012ba00097a10096000c0300
+00000200000000000000a1009a0008fffc00000003000001000a0096015800a701612bd675012ba00097a10096000c030000000200000000000000a1009a0008fffc00000003000001000a00b7015700c801602800c30158012ba00097a10096000c030000000200000000000000a1009a0008fffc0000ffff000001000a00
+4401250055012f280050012a012da00097a10096000c030000000200000000000000a1009a0008fffc0000ffff000001000a001900b7002a00c128002500bc012da00097a10096000c030000000200000000000000a1009a0008fffc0000ffff000001000a011d0107012e0111280129010c012da00097a10096000c030000
+000200000000000000a1009a0008fffc0000ffff000001000a013600b6014700c028014200bc012ea00097a10096000c030000000200000000000000a1009a0008fffc0000ffff000001000a012a007c013b00862801360082012ea00097a10096000c030000000200000000000000a1009a0008fffc0000fffe000001000a
+00e4003100f5003b2800f00037012ea00097a10096000c030000000200000000000000a1009a0008fffc0000ffff000001000a0092002400a3002e28009e002a012ea00097a10096000c030000000200000000000000a1009a0008fffc0000ffff000001000a005a003e006b00472800660043012ea00097a00083ff}}
+\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa240\sl240\tx1140 \f21\fs20 Figure 11.3\tab A typica
+l helix wheel display using a window of only 13 residues. The display includes a schematic of the helix showing the links between residues, with each vertex numbered according to position; the residue type at each vertex; a symbol denoting a classification
+ as hydrophobic (.), positively charged (+), negatively charged (-), or otherwise (). The residue number of the first sequence element in the current window is displayed at the top left corner along with the sequence. Below this is the total hydrophobicity
+ and hydrophobic moment according to Eisenberg {\i et al }(2).\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Producing a Robson secondary structure prediction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method uses the method of Garnier {\i et al} (4) to predict the positions of alpha helices, beta sheets, turns and random coil. The results can be either plotted or listed.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Robson secondary structure prediction".\par 
+\pard \s7\qj\fi-560\li560\ri-100\sa120\sl280\tx560 \page 2.\tab Accept "Plot results". The alternative produces a listing like that shown in figure 11.4.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear as in figure 11.5. and the program also prints a count of the number of positions at which each of the 4 structure types is the highest scoring.\par 
+\pard\plain \li1500\ri1460\sb200\sl220\box\brsp100\brdrth \tqr\tx3220\tqr\tx4700\tqr\tx6140\tqr\tx7420 \f4\fs16 350 P\tab 274\tab -178\tab -84\tab -77\par 
+\pard \li1500\ri1460\sl220\box\brsp100\brdrth \tqr\tx3220\tqr\tx4700\tqr\tx6140\tqr\tx7420 351 L\tab 16\tab -192\tab -21\tab -38\par 
+352 K\tab 371\tab -223\tab -75\tab -68\par 
+353 L\tab 365\tab -152\tab -101\tab -65\par 
+354 S\tab 331\tab -82\tab -84\tab -63\par 
+355 K\tab 311\tab -43\tab -110\tab -88\par 
+356 A\tab 280\tab -23\tab -110\tab -80\par 
+357 V\tab 234\tab -12\tab -135\tab -75\par 
+358 H\tab 177\tab -10\tab -143\tab -92\par 
+359 K\tab 153\tab 2\tab -180\tab -138\par 
+360 A\tab 158\tab 52\tab -175\tab -130\par 
+361 V\tab 144\tab 78\tab -187\tab -115\par 
+362 L\tab 132\tab 58\tab -186\tab -80\par 
+363 T\tab 124\tab 63\tab -142\tab -78\par 
+364 I\tab 144\tab 32\tab -111\tab -43\par 
+365 D\tab 120\tab -49\tab -29\tab 5\par 
+366 E\tab 103\tab -80\tab 13\tab 43\par 
+367 K\tab 111\tab -113\tab 23\tab 42\par 
+368 G\tab 132\tab -127\tab -13\tab 64\par 
+369 T\tab 172\tab -132\tab -42\tab 52\par 
+\pard \li1500\ri1460\sl220\keepn\box\brsp100\brdrth \tqr\tx3220\tqr\tx4700\tqr\tx6140\tqr\tx7420 370 E\tab 216\tab -170\tab -122\tab -4{\b \par 
+}\pard\plain \s8\qj\fi-1140\li1140\sb120\sa200\sl240\tx1140 \f21\fs20 Figure 11.4\tab A listing of the Robson secondary structure prediction. It includes the sequence position, the residue type and the values for the four structure classes.\par 
+\pard\plain \sb200\sl220\keepn \f4\fs16 {{\pict\macpict\picw446\pich256 
+0d0fffffffff00ff01bd1101a0008201000affffffff00ff01bd090000000000000000310000000000fe01bc9800240000000000a601200000000000a6011f0000000000fe01bc000102dd0006007fdfff00fc060040df000004060040df000004060040df000004060041df00000407014280e0000004060042df0000040b
+0042fd00010140e500000410014380fe000101a0fd0000c0ea000004110640000008800120fe000101a0ea000004160c40000019900124000010022380fc00000cf1000004200c40000066e80216000070022240fc000012fe00014001fc000060fd00010404240e400000a68804190000908214400006fe000612200000a0
+0180fe00010190fd00010c04252340001100080409020111421820000b02800012500000a181900000040f10001006001204252340003f0008040903c20d3c0020001103c00022900000a1827900038e080800380a90120425234061410004340104a205200010021084200022910000a272279ea495300800440ba8218425
+234292e03b1c5f070a12f300e0d0059e9830002751860122f20493de5161f800440bb82d042508429400000648010814fe00171004a0b814002009b621221400600860c008008508a44104250843140000018000880cfe000d0f0460c01ad9200a4952220c0020fe000604808590424004230842140000018000880cfe000b
+0d046080032ac006014a2204fc000605e883904380041f014018fc000050fc000084fe000604400401861c04fc0006051500500200041e014008fc000030fc000048fe0005040000018018fb0006060500600000041a0040fb000020fc000070fb0002010018fb0006020600200000040e0040f5000050f2000002fc000004
+0a0040f5000040ec000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df00000406007fdfff00fc060040df000004060040df000004060040df00000407014380e0000004060041df000004060041df0000040a0041f7000020ea000004100041fe0002
+8000c0fd000060ea000004190640000008e00120fe00010161fb000008fd000001f60000041b064000000da00120fe000202e280fc000008fe00018001f60000041c0640600053200110fe000202e280fc00000cfe0002c00180f70000042306405010a3100212fe000202a280fe0002c00014fe0004c001800002fc000304
+00100425064090110310020efe001904948000000140002400000121018000c200018000000420120425234090190210040e0000c0049c8000000220002480000121814000a38c014000000c202e8425234090190210140d020330280880000c222000248100022281430123920f6000000ab02d04252340891d02182c0503
+0234d80040001a54202025830022224124821291083000300ab44d04252342f766eef868e50484150323c0002a55d0303be2e033e34124c21279381000500efc560425234287e60018800504c4170000440722881049c39491322231242a1a01e00880580adc440425234306a00015000108241600003c05418810c943140b
+4c1211343c0c00000980480a8a440425234106000016000108141400001218c1800d0902140780140d38240c000009a0841b0a8004252340040000060000881c080000111000000f0e02140780140608000c00000950852b0a8004252340040000040000880008000001900000070002080500180600000400000908872b0b
+000420014004fc0002900008fe000050fb000f040018060000040000050481230300041b0040fb000060fc000050fb000304000802fc0006060700a30000041b0040fb000020fc000060fb000304000802fc0006020100e2000004140040f5000060f9000008fb0006020000220000040a0040e5000002fc0000040a0040e5
+000002fc000004060040df000004060040df000004060040df000004060040df00000406007fdfff00fc060040df000004060040df000004060040df0000040b014380f8000008ea0000040a0042f700000cea0000040e0043f700000cf1000008fb0000040e0042f700000af100000cfb0000040f014380f8000012f10000
+14fb0000041c014004fd000008fd000012fd000004fc00010204fd000014fb0000041e01410afd000014fd000012fd00000afc0001060cfe00010224fc000101041f014292fd000014fd000012fd00000bfc00010a0cfe0001c522fd000203010421014291fd000024fd000621200000020904fd00060a12000001a522fd00
+02028284240642910000a00024fd00182150000007110f000007000a12000002252200010000028204252342508000d006443804000820900000091109c0000900091200300218e2180380000282042523426043f0918582280c000e20900030089109200008801112002802180128048000028404252343bfc32912899a68
+1200113e3000380e9f8a60001c802f1e002803f7ff2804802002858425234000440f0e480184122011201180c80890902000204e21120044040001281c406164840425234000440208500184225320c009410808a09010002052e0a110441c0001241040b2a44404252340002402085001044254a00009220810a060100040
+2380a12d841000012620210a34440425234000240000300002829c60000924041040600800802080c127022000014220210e08440425234000180000200002818000000614041000000881000080812002200001424011000848042402400018fd000c0300800000061c04a000000981fe000c01200240000082401b000028
+04200040fb000002fc00061004c000000a42fe000c012002400000014004000030041f0040fb000002fb0005028000000a64fe00070120014000000180fe000130041b0040f40005038000000614fe000301200180fe000080fe00011004160040f4000002fe00010614fe000301400080f9000004110040ef000008fe0003
+01400080f90000040a0040ea0000c0f70000040a0040ea000080f7000004060040df000004060040df000004060040df00000406007fdfff00fc0a0040e40000c0fd0000040a0040e40000a0fd00000413014280f4000080fe000006f7000090fd00000417014280f600021000c0fe000009f80006011001c00000041c0143
+80f600026800a0fe00010880fe000001fd00060110032000000421014280fa000002fe0002480110fe00011080fe00010280fe00060108042000000421014280fa000005fe0002c40110fe00011040fe00010640fe000602080420000004250f4000060001c000380405300001040110fe00011040fe000b3420e000000204
+0820008004250f40000500023000280a04a80003020210fe00101040080000482090000004060820018004252340000900021000440a0868000c0102080038001020140000c0211000000405901002400425234000110002080082120864001401020801e4002021e21c0100111000000801d0100240042523400011000404
+00826108040010008208030300202222240100121000000800201004200425234000108004040103a09002001000e208040381c012022201000a080000080000100420042523400020600404010000900200200024078c00418014022202000c08010008000008c82104252341c06020080401000060020020001804f00022
+001c0243e200000801c008000008a81704252342a0ff100ffe01ffff0ffe105fffebf811fda3bff3ff789ffffff802201ffffffdeff5042506401b000e880202fe0002012840fe000610003200000180fe00090b06201000000518148423064004000bc80202fe0002012480fc00001cfe000080fe00090a84383000000300
+0c841e0640040002480104fd0001a680fc000008fa00090a8c2ce00000020000041e0640000002280104fd0001e380fc000008fa0002045005fe000302000004170040fe0002100104fd00018080f40002042003fb000004140040fe0002100104fd000080f1000003fb0000040a0040fc0000fce50000040a0040fc000020
+e5000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df000004060040df00000406007fdfff00fc060040df000004060043df00000407014280e000000407014280e000000407014280e000000407014380e000000425134000203b1807070200f000e0c0000e00300006
+40fe000cf000000e1001f800000b980c04060040df000004060040df000004060040df000004060040df000004060040df0000042406407000eee000e0fe0011032380000001c00038800001e10100000208fd000304201204060040df000004060040df000004060040df000004060040df000004060040df000004060040
+df0000042202439f80fe000018fd00111e200010061f0260000c000e1e000001f7fefc00010184060040df000004060040df000004060040df000004060040df000004060040df000004252340005f0007fc01ffff0ffc001fffe3f801fd81bff3fe3801fffff800000ffffff8cfe004060040df000004060040df00000406
+0040df000004060040df000004060040df000004060040df00000406007fdfff00fc02dd00a00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 11.5\tab A secondary structure plot using the method of Robson. The likelihood that each 17 residue segment of the sequence forms one of the four structure classes\:
+ helix (H), extended (E) normally termed sheet, turn (T) and coil (C) are each plotted out across the screen in four strips. Below this 
+is a "decision" strip (D) in which a single dot is poltted for the higest scoring structure class at each point. Here we see a sequence that is predicted to be predominantly helical.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.6\tab Calculating the composition and molecular weight of a sequence.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+Select "Count amino acid composition". The composition and molecular weight are displayed as in figure 11.6.. Each column contains the one letter code for the amino acid, the number of occurrences of that amino acid in the sequence, and the number expresse
+d as a percentage, and its molecular weight.\par 
+\pard\plain \li220\ri280\sb200\sl220\box\brsp100\brdrth \f4\fs16 Sequence composition\par 
+\pard \li220\ri280\sl220\box\brsp100\brdrth A   C     S     T     P     A     G     N     D     E     Q     B     Z     H\par 
+N   0.   14.   19.   12.   30.   26.    3.   10.   11.    4.    0.    0.    0.\par 
+%   0.0   5.3   7.3   4.6  11.5   9.9   1.1   3.8   4.2   1.5   0.0   0.0   0.0\par 
+W    0. 1219. 1921. 1165. 2132. 1483.  342. 1151. 1420.  513.    0.    0.    0.\par 
+\par 
+A   R     K     M     I     L     V     F     Y     W     -     X     ?      \par 
+N   7.    7.   10.   15.   39.   23.   13.   11.    8.    0.    0.    0.    0.\par 
+%   2.7   2.7   3.8   5.7  14.9   8.8   5.0   4.2   3.1   0.0   0.0   0.0   0.0\par 
+W 1093.  897. 1312. 1697. 4413. 2280. 1913. 1795. 1490.    0.    0.    0.    0.\par 
+\pard \li220\ri280\sl220\keepn\box\brsp100\brdrth Total molecular weight=   28256.254\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 11.6\tab A typical molecular weight and composition display. It includes the residue type, their number, their percentage and their contribution to the molecular weight.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab The methods described in the chapters on motif and pattern searching can also be used to search for specifi
+c structures. For example a sequence can be searched for all the structures contained in the PROSITE motif library.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab It is often convenient to produce displays in which several of the plots described above appear together on the screen.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Kyte, J. and Doolittle, R.F. 1982. A simple method for displaying the hydropathic character of a protein.  {\i J.Mol. Biol}. {\b 157}\:105-132. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Eisenberg, D., Schwarz, E., Komaromy, M., and Wall, R. 1984. Analysis of membrane and surface protein sequences with the hydrophobic moment plot.  {\i J. Mol. Biol.} {\b 179}\:125-142.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Schiffer,M and Edmundson,A.B. 1967 Use of helical wheels to represent the structures of proteins and to identify the segments with helical potential.  {\i Biophys. J}. {\b 7}, 121-135.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Garnier, J., Osguthorpe, D.J., and Robson, B. 1978. Analysis of the accuracy and implications of simple methods for  predicting the secondary structure of globular proteins.  {\i J. Mol. Biol}. {\b 120}\:
+97-120.\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 12.  Searching for Motifs in Protein Sequences\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Searching for exact matches.\par 
+2.2\tab Searching for percentage matches to consensus sequences\par 
+2.3\tab Searching for consensus sequences using a score matrix\par 
+2.4\tab Using weight matrices for searching protein sequences\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The program PIP contains several ways of defining and searching for motifs (1,2).  We describe searches for exact matches and percentage matches, the use of score matrices and the creation and use of weight matrices. All of the searches produce 
+both listed and graphical output.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Searching for exact matches.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The routine for finding and displaying the positions of exact matches to sequences can display its results in various forms. It is equivalent to the restriction enzyme search routine in the nucleotide analysis programs.  The sequences to be searched for ca
+n be typed on the keyboard or read from files. The format of these files is given in the notes. Here we give only a single example of the use of the routine which shows how to produce a plot of the positions of all amino acid types in a sequence.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Search".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab 
+Select "Input source" as "All acids file". A number of standard files are available and users may also have their own. The one selected simply contains the one letter codes for all the standard amino acids.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Accept "Search for all names". The alternative allows users to select a subset of the entries in the file by name.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Select "Order results name by name".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Reject "List matches". If results are listed the output gives the name and position of each match and also the separations between matches.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The results will then appear in the form shown in figure 12.1. \par 
+\pard\plain \li80\ri80\sl220\keepn\box\brsp40\brdrth \f4\fs16 {{\pict\macpict\picw441\pich182 
+14a4ffffffff00b501b81101a0008201000affffffff00b501b8090000000000000000310000000000b201b798002a000000000083014400000000008301440000000000b201b7000102d70020f90002020080fd000a8000008000010204200201fb000620000200401004fe0020f90002020080fd000a8000008000010204
+200201fb000620000200401004fe00220050fa0002020080fd000a8000008000010204200201fb000620000200401004fe00220050fa0002020080fd000a8000008000010204200201fb000620000200401004fe00220070fa0002020080fd000a8000008000010204200201fb000620000200401004fe00220020fa000202
+0080fd000a8000008000010204200201fb000620000200401004fe00220020fa0002020080fd000a8000008000010204200201fb000620000200401004fe0009fd000007deff01c00005d900014000070050da00014000070050da00014000070050da00014000070070da000140000b0070fe000007deff01c00025fb0018
+c340000e04308008944000004080c01000400404880128004afe000340481008fd00270050fc0018c340000e04308008944000004080c01000400404880128004afe000340481008fd00270050fc0018c340000e04308008944000004080c01000400404880128004afe000340481008fd00270050fc0018c340000e043080
+08944000004080c01000400404880128004afe000340481008fd00270050fc0018c340000e04308008944000004080c01000400404880128004afe000340481008fd000b0020fe000007deff01c00026fc00018004fe000a0100400001000a00020120fc0008224412200041000820fe000308010000280070fd00018004fe
+000a0100400001000a00020120fc0008224412200041000820fe000308010000280020fd00018004fe000a0100400001000a00020120fc0008224412200041000820fe000308010000280020fd00018004fe000a0100400001000a00020120fc0008224412200041000820fe000308010000280020fd00018004fe000a0100
+400001000a00020120fc0008224412200041000820fe000308010000280020fd00018004fe000a0100400001000a00020120fc0008224412200041000820fe00010801ff0009fd000007deff01c00028fd0005014041802010fe00fe101900040500180001080080010084001804028000500500000480002a0020fe000501
+4041802010fe00fe101900040500180001080080010084001804028000500500000480002a0050fe0005014041802010fe00fe101900040500180001080080010084001804028000500500000480002a0060fe0005014041802010fe00fe101900040500180001080080010084001804028000500500000480002a0010fe00
+05014041802010fe00fe101900040500180001080080010084001804028000500500000480000b0070fe000007deff01c00026fc0014040010042000890000400310080040004180112058fe0008010401100800008004fd00280060fd0014040010042000890000400310080040004180112058fe00080104011008000080
+04fd00280050fd0014040010042000890000400310080040004180112058fe0008010401100800008004fd00280070fd0014040010042000890000400310080040004180112058fe0008010401100800008004fd00280050fd0014040010042000890000400310080040004180112058fe0008010401100800008004fd0028
+0050fd0014040010042000890000400310080040004180112058fe0008010401100800008004fd0009fd000007deff01c00027fd0004040a000080fc00092a010808100001000090fe000e040100008021048630000050008000290020fe0004040a000080fc00092a010808100001000090fe000e04010000802104863000
+0050008000290050fe0004040a000080fc00092a010808100001000090fe000e040100008021048630000050008000290050fe0004040a000080fc00092a010808100001000090fe000e040100008021048630000050008000290050fe0004040a000080fc00092a010808100001000090fe000e0401000080210486300000
+500080000b0070fe000007deff01c000230020fa00070800801009010408fc000920000090000020200120fe000301000001fd00230060fa00070800801009010408fc000920000090000020200120fe000301000001fd00230050fa00070800801009010408fc000920000090000020200120fe000301000001fd00230070
+fa00070800801009010408fc000920000090000020200120fe000301000001fd00230040fa00070800801009010408fc000920000090000020200120fe000301000001fd00230040fa00070800801009010408fc000920000090000020200120fe000301000001fd0009fd000007deff01c00021fd00080100880004800000
+40fd0002100101fc0005020000101440fa000022fe00230050fe0008010088000480000040fd0002100101fc0005020000101440fa000022fe00230070fe0008010088000480000040fd0002100101fc0005020000101440fa000022fe00230070fe0008010088000480000040fd0002100101fc0005020000101440fa0000
+22fe00230050fe0008010088000480000040fd0002100101fc0005020000101440fa000022fe000b0050fe000007deff01c0001ffd000604000001400108fd000010fc000028f8000020fe000380000080fe00210050fe000604000001400108fd000010fc000028f8000020fe000380000080fe00210070fe000604000001
+400108fd000010fc000028f8000020fe000380000080fe00210050fe000604000001400108fd000010fc000028f8000020fe000380000080fe00210050fe000604000001400108fd000010fc000028f8000020fe000380000080fe000b0050fe000007deff01c00029fd00230220000410c020462080000081000024028812
+06016000a0005000084842100c48208028ff0029fd00250220000410c02046208000008100002402881206016000a0005000084842100c4820802800002b0040fe00250220000410c02046208000008100002402881206016000a0005000084842100c4820802800002b0040fe00250220000410c020462080000081000024
+02881206016000a0005000084842100c4820802800002b0040fe00250220000410c02046208000008100002402881206016000a0005000084842100c4820802800002b0040fe00250220000410c02046208000008100002402881206016000a0005000084842100c4820802800000b0070fe000007deff01c00026fc000008
+fd000c06000200008004010840000001fe000016fd000a5800044c00040000620000280050fd000008fd000c06000200008004010840000001fe000016fd000a5800044c00040000620000280060fd000008fd000c06000200008004010840000001fe000016fd000a5800044c00040000620000280060fd000008fd000c06
+000200008004010840000001fe000016fd000a5800044c00040000620000280060fd000008fd000c06000200008004010840000001fe000016fd000a5800044c000400006200000b0070fe000007deff01c00027fc0012540430210000800802800860b2a20100001808fe0004100a821022fd0005020020900000290020fd
+0012540430210000800802800860b2a20100001808fe0004100a821022fd0005020020900000290020fd0012540430210000800802800860b2a20100001808fe0004100a821022fd0005020020900000290020fd0012540430210000800802800860b2a20100001808fe0004100a821022fd0005020020900000290020fd00
+12540430210000800802800860b2a20100001808fe0004100a821022fd0005020020900000290020fd0012540430210000800802800860b2a20100001808fe0004100a821022fd000302002090ff0009fd000007deff01c0001bfb00011008fc000040fc000008fd000001f9000001fe000002fd001d0050fc00011008fc00
+0040fc000008fd000001f9000001fe000002fd001d0050fc00011008fc000040fc000008fd000001f9000001fe000002fd001d0070fc00011008fc000040fc000008fd000001f9000001fe000002fd001d0050fc00011008fc000040fc000008fd000001f9000001fe000002fd000b0050fe000007deff01c00027fb002304
+488809088d15210106240210080004400048001502010223060000800082000c500000290020fc002304488809088d15210106240210080004400048001502010223060000800082000c500000290050fc002304488809088d15210106240210080004400048001502010223060000800082000c500000290070fc00230448
+8809088d15210106240210080004400048001502010223060000800082000c500000290050fc002304488809088d15210106240210080004400048001502010223060000800082000c500000290070fc002104488809088d15210106240210080004400048001502010223060000800082000c50ff0009fd000007deff01c0
+0020fc000001fa000020fa0014010000120020004048000003000004010000020000220070fd000001fa000020fa0014010000120020004048000003000004010000020000220040fd000001fa000020fa0014010000120020004048000003000004010000020000220060fd000001fa000020fa0014010000120020004048
+000003000004010000020000220040fd000001fa000020fa00140100001200200040480000030000040100000200000b0040fe000007deff01c00028fc0024a02800404010200008400080000010080240002021880800200000100020010880418000002a0070fd0024a02800404010200008400080000010080240002021
+880800200000100020010880418000002a0040fd0024a02800404010200008400080000010080240002021880800200000100020010880418000002a0060fd0024a02800404010200008400080000010080240002021880800200000100020010880418000002a0040fd0024a0280040401020000840008000001008024000
+2021880800200000100020010880418000000b0070fe000007deff01c00024fa000a820010400004c008201044fd0006a6000400000102fd000662000020040204ff0024fa000a820010400004c008201044fd0006a6000400000102fd0008620000200402040000260060fb000a820010400004c008201044fd0006a60004
+00000102fd0008620000200402040000260050fb000a820010400004c008201044fd0006a6000400000102fd0008620000200402040000260050fb000a820010400004c008201044fd0006a6000400000102fd0008620000200402040000260050fb000a820010400004c008201044fd0006a6000400000102fd0008620000
+2004020400000b0070fe000007deff01c0000dfa000301000002fb000001e9000f0020fb000301000002fb000001e9000f0050fb000301000002fb000001e9000f0040fb000301000002fb000001e9000f0040fb000301000002fb000001e9000b0070fe000007deff01c00028fc0024022004030450001016004001c02806
+369020a0101a404280048180c49001222052000100002a0020fd0024022004030450001016004001c02806369020a0101a404280048180c49001222052000100002a0050fd0024022004030450001016004001c02806369020a0101a404280048180c49001222052000100002a0070fd0024022004030450001016004001c0
+2806369020a0101a404280048180c49001222052000100002a0050fd0024022004030450001016004001c02806369020a0101a404280048180c49001222052000100000b0050fe000007deff01c00002d700a0008c310002000100b5001038a10096000c010000000200000000000000a1009a0008fffd0000000300000100
+0a00050002000f000a2c000c00150948656c76657469636103001504010d00082e0004000001002b030c0159a00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a000e00020018000a2a090157a00097a10096000c010000000200000000000000a1009a0008fffd0000000300000100
+0a00150002001f000a2a070156a00097a10096000c010000000200000000000000a1009a0008fffd00000003000001000a001f00020029000a2a0a0154a00097a10096000c010000000200000000000000a1009a0008fffd00000003000001000a002700020031000a2a080153a00097a10096000c01000000020000000000
+0000a1009a0008fffe00000003000001000a00300002003a000a2a090152a00097a10096000c010000000200000000000000a1009a0008fffd00000003000001000a003900020043000a2a090151a00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a00420002004c000a2a090150a0
+0097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a004a00020054000a2a08014da00097a10096000c010000000200000000000000a1009a0008fffd00000003000001000a00530002005d000a2a090148a00097a10096000c010000000200000000000000a1009a0008fffe0000000300
+0001000a005c00020066000a2a09014ca00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a00640002006e000a2a08014ba00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a006e00020078000a2a0a0149a00097a10096000c01000000020000
+0000000000a1009a0008fffe00000003000001000a007600020080000a2a080148a00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a00800002008a000a2a0a0147a00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a008800020092000a2a08
+0146a00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a00900002009a000a2a080145a00097a10096000c010000000200000000000000a1009a0008fffd00000003000001000a0099000200a3000a2a090144a00097a10096000c010000000200000000000000a1009a0008fffe0000
+0003000001000a00a2000200ac000a2a090143a00097a10096000c010000000200000000000000a1009a0008fffe00000003000001000a00aa000200b4000a2a080141a00097a0008da00083ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb40\sa120\sl240\tx1140 \f21\fs20 Figure 12.1\tab Typical graphical output from "Search for exact matches" in which the position of each matching string (here individual amino acid types) is marked.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Searching for percentage matches to sequences\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Find percentage matches".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Accept "Type in strings". The alternative allows the string to be extracted from a named file.\par 
+3.\tab Reject "Keep picture". This will cause the graphics window to be cleared. The alternative leaves it unchanged.\par 
+4.\tab Define "String". Type in the search string. When the program cycles round to this point again the previous string will be offered as a default.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Percent match". The search is performed, the results are presented graphically, the number of matches displayed, and the scores and positions of the top 10 matches displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define the number of matches to "Display". For the number of matches chose
+n the program will display the search string and matching sequence written one above the other with matching characters indicated by asterisk symbols. The program now cycles round to step 3.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Searching for sequences using a score matrix\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+A score matrix gives a score for the alignment of each possible pair of sequence symbols. This method is more sensitive than the simple percentage match search. The default matrix MDM78 used by this program is shown in figure 12.2.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Find matches using a score matrix".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Accept "Type in strings". The alternative allows the string to be extracted from a named file.\par 
+3.\tab Reject "Keep picture". This will cause the graphics window to be cleared. The alternative leaves it unchanged.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+Define "String". Type in the search string. When the program cycles round to this point again the previous string will be offered as a default. The program displays the minimum and maximum possible scores for the string.\par 
+5.\tab Define "Score". The search is performed, the results are presented graphically, the number of matches displayed, and the scores and positions of the top 10 matches displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab 
+Define the number of matches to "Display". For the number of matches chosen the program will display the search string and matching sequence written one above the other with matching characters indicated by asterisk symbols. The program now cycles round
+ to step 3. An example run is shown in figure 12.3.\par 
+\pard\plain \li220\ri280\sb200\sl220\box\brsp100\brdrth \f4\fs16    C  S  T  P  A  G  N  D  E  Q  B  Z  H  R  K  M  I  L  V  F  Y  W  -  X  ? \par 
+\pard \li220\ri280\sl220\box\brsp100\brdrth C 22 10  8  7  8  7  6  5  5  5  5  5  7  6  5  5  8  4  8  6 10  2 10 10 10 10\par 
+S 10 12 11 11 11 11 11 10 10  9 10 10  9 10 10  8  9  7  9  7  7  8 10 10 10 10\par 
+T  8 11 13 10 11 10 10 10 10  9 10 10  9  9 10  9 10  8 10  7  7  5 10 10 10 10\par 
+P  7 11 10 16 11  9  9  9  9 10  9 10 10 10  9  8  8  7  9  5  5  4 10 10 10 10\par 
+A  8 11 11 11 12 11 10 10 10 10 10 10  9  8  9  9  9  8 10  6  7  4 10 10 10 10\par 
+G  7 11 10  9 11 15 10 11 10  9 10 10  8  7  8  7  7  6  9  5  5  3 10 10 10 10\par 
+N  6 11 10  9 10 10 12 12 11 11 12 11 12 10 11  8  8  7  8  6  8  6 10 10 10 10\par 
+D  5 10 10  9 10 11 12 14 13 12 13 12 11  9 10  7  8  6  8  4  6  3 10 10 10 10\par 
+E  5 10 10  9 10 10 11 13 14 12 12 13 11  9 10  8  8  7  8  5  6  3 10 10 10 10\par 
+Q  5  9  9 10 10  9 11 12 12 14 11 13 13 11 11  9  8  8  8  5  6  5 10 10 10 10\par 
+B  5 10 10  9 10 10 12 13 12 11 13 11 11 10 10  8  8  6  8  5  7  4 10 10 10 10\par 
+Z  5 10 10 10 10 10 11 12 13 13 11 14 12 10 10  8  8  8  8  5  6  4 10 10 10 10\par 
+H  7  9  9 10  9  8 12 11 11 13 11 12 16 12 10  8  8  8  8  8 10  7 10 10 10 10\par 
+R  6 10  9 10  8  7 10  9  9 11 10 10 12 16 13 10  8  7  8  6  6 12 10 10 10 10\par 
+K  5 10 10  9  9  8 11 10 10 11 10 10 10 13 15 10  8  7  8  5  6  7 10 10 10 10\par 
+M  5  8  9  8  9  7  8  7  8  9  8  8  8 10 10 16 12 14 12 10  8  6 10 10 10 10\par 
+I  8  9 10  8  9  7  8  8  8  8  8  8  8  8  8 12 15 12 14 11  9  5 10 10 10 10\par 
+L  4  7  8  7  8  6  7  6  7  8  6  8  8  7  7 14 12 16 12 12  9  8 10 10 10 10\par 
+V  8  9 10  9 10  9  8  8  8  8  8  8  8  8  8 12 14 12 14  9  8  4 10 10 10 10\par 
+F  6  7  7  5  6  5  6  4  5  5  5  5  8  6  5 10 11 12  9 19 17 10 10 10 10 10\par 
+Y 10  7  7  5  7  5  8  6  6  6  7  6 10  6  6  8  9  9  8 17 20 10 10 10 10 10\par 
+W  2  8  5  4  4  3  6  3  3  5  4  4  7 12  7  6  5  8  4 10 10 27 10 10 10 10\par 
+- 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+X 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+? 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+\pard \li220\ri280\sl220\keepn\box\brsp100\brdrth   10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa300\sl240\tx1140 \f21\fs20 Figure 12.2\tab The amino acid score matrix MDM78.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.4\tab Using weight matrices for searching protein sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+A weight matrix is the most sensitive way of defining a motif. It is a table of values that gives scores for each amino acid type in each position along a motif. For a motif of length 8 amino acids the weight matrix would be a table 8 positions long and, a
+llowing for 26 amino acid symbols, 26  deep. The simplest way of choosing the values for the table is to take an alignment of all known 
+examples of the motif and to count the frequency of occurrence of each amino acid type at each position. These frequencies can be used as the table of weights. When the table is used to search a new sequence the program calculates a score for each position
+ along the sequence by adding or multiplying (see notes) the relevant values in the table. All positions that exceed some cutoff score are reported as matching the original set of motifs.\par 
+\pard \s4\qj\sa120\sl280 How can we select a suitable cutoff score? The simplest way is to ap
+ply the weight matrix to all the known occurrences of the motif - i.e. the set of sequence segments used to create the table - and to see what scores they achieve. The cutoff can be selected accordingly. For convenience the weight matrix is stored as a fil
+e along with its cutoff score, a title that is displayed when the file is read, and a few other values need by the program. A routine for creating weight matrix files from sets of aligned sequences is included in the program. When a search using the weight
+ matrix is performed the program will either list the matching sequence segments or plot their positions as for the other motif search methods.\par 
+\pard\plain \li2000\ri2260\sb200\sl220\box\brsp100\brdrth \f4\fs16 Find matches using a score matrix\par 
+\pard \li2000\ri2260\sl220\box\brsp100\brdrth ? Keep picture (y/n) (y) =\par 
+ ? String=ALPHA\par 
+Minimum score=    23 Maximum score=    72\par 
+? Score (23-72) (72) =60\par 
+\par 
+For score    60 the number of matches=     5\par 
+Scores        62     62     62     61     61\par 
+Positions    120    217    420     54    326\par 
+? Display (0-5) (0) =\par 
+\par 
+       120\par 
+         PLDHD\par 
+          * *\par 
+         ALPHA\par 
+         1\par 
+\par 
+       217\par 
+         ALANT\par 
+         **\par 
+         ALPHA\par 
+         1\par 
+\par 
+       420\par 
+         QLDHG\par 
+          * *\par 
+         ALPHA\par 
+         1\par 
+\par 
+        54\par 
+         SLPGN\par 
+          **\par 
+         ALPHA\par 
+         1\par 
+\par 
+       326\par 
+         ALPII\par 
+         ***\par 
+         ALPHA\par 
+         1\par 
+? Keep picture (y/n) (y) =\par 
+ Default String=ALPHA\par 
+\pard \li2000\ri2260\sl220\keepn\box\brsp100\brdrth ? String=!\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa420\sl240\tx1140 \f21\fs20 Figure 12.3\tab An example of the listed output from "Search using a score matrix".\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.4.1\tab Creating a weight matrix file from a set of aligned sequences\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Motif search using weight matrix".\par 
+2.\tab Select "Make weight matrix".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+Define "Name of aligned sequences file". We assume the file of aligned sequences has already been created (see note 5). The program reads and displays the contents of the file numbering each sequence as it goes. Then it displays the length of the longes
+t sequence.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Accept "Sum logs of weights". The alternative is to sum the weights when calculating scores (see note 6). \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Accept "Use all motif positions". The alternative allows the user to define a "mask" which i
+dentifies positions within the motif that should be ignored when the matrix is created (see note 7). The program now calculates the weights and applies them in turn to each of the sequences in the file. The number and score for each sequence is displayed, 
+followed by the top, bottom and mean scores and the standard deviation. In addition the mean plus and minus 3 standard deviations is displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Cutoff score". The default is the mean minus 3 standard deviations, but users may, for example, decide to use the lowest score obtained by the sequences in the file.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Define "Top score for scaling plots". This parameter is used by the graphics output routine when scaling the plots. Its value will influence the height of lines plotted to represent matches.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab 
+Define "Position to identify". When a search is performed it is not always appropriate to report the position of a match relative to the leftmost amino acid in the motif. For example when performing a helix-turn-helix motif search we may want to know
+ the position of the well conserved glycine rather than the position of the first amino acid in the matrix. The "Position to identify" allows the user to define which amino acid is marked. The amino acids in the table are number 1,2,3 and so on.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Define a "Title". This is a title that will be displayed when the matrix file is read prior to performing a search. It is limited to 60 characters.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 11.\tab Define "Name for new weight matrix file". Give a name for the weight matrix file.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 See the example run in figure 12.4.\par 
+\pard\plain \li1240\ri1180\sb300\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 Motif search using weight matrix\par 
+\pard \li1240\ri1180\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Select operation\par 
+X  1 Use weight matrix\par 
+   2 Make weight matrix\par 
+   3 Rescale weight matrix\par 
+? Selection  (1-3) (1) =2\par 
+? Name of aligned sequences file=atpbinding.seq\par 
+     1 GETLGIVGESGSG\par 
+     2 GESLGVVGESGGGKSTFAR OppF\par 
+     3 GDVISIDGSSGSGKSTFLR HisP\par 
+     4 GEFVVFVGPSGGGKSTLLR MalK E. coli\par 
+     5 NQVTAFIGPSGGGKSTLLR PstB\par 
+     6 GRVMALVGENGAGKSTMMK RbsA(N)\par 
+     7 GEVIGIVGRSGSGKSTLTK HlyB\par 
+     8 GECFGLLGPNGAGKSTITR NodI R. leguminosarum\par 
+     9 GEMAFLTGHSGAGKSTLLK FtsE E. coli\par 
+    10 GQRELIIGDRQTGKTALAI ATPase\par 
+    11 GGKVGLFGGAGVGKTVNMM ATPase\par 
+    12 GRIVEIYGPESSGKTTLTL RecA\par 
+    13 RSNLLVLAGAGSGKTRVLV UvrD\par 
+    14 GGKIGLFGGAGVGKTVGIM ATPase Bovine\par 
+    15 SKIIFVVGGPGSGKGTQCE Adenylate Kinase Rabbit\par 
+    16 NQSILITGESGAGKTVNTK Myosin Rabbit\par 
+    17 HVNVGTIGHVDHGKTTLTA EF-Tu E. coli\par 
+    18 YRNIGISAHIDAGKTTERI EF-G E. coli\par 
+    19 EYKLVVVGARGVGKSALTI v-ras (HARVEY)\par 
+    20 EYKLVVVGASGVGKSALTI v-ras (KIRSTEN)\par 
+    21 EYKLVVVGAVGVGKSALTI pEJ BLADDER CARCINOMA TRANSFORMING\par 
+    22 EYKLVVVGAGGVGKSALTI pEJ BLADDER CARCINOMA CELLULAR\par 
+Length of motif    19\par 
+? Sum logs of weights (y/n) (y) =\par 
+ ? Use all motif positions (y/n) (y) =\par 
+Applying weights to input sequences\par 
+   1      -36.651 GETLGIVGESGSGKSQSLR\par 
+   2      -35.780 GESLGVVGESGGGKSTFAR\par 
+   3      -38.180 GDVISIDGSSGSGKSTFLR\par 
+   4      -35.403 GEFVVFVGPSGGGKSTLLR\par 
+   5      -39.039 NQVTAFIGPSGGGKSTLLR\par 
+   6      -40.653 GRVMALVGENGAGKSTMMK\par 
+   7      -34.017 GEVIGIVGRSGSGKSTLTK\par 
+   8      -37.454 GECFGLLGPNGAGKSTITR\par 
+   9      -36.474 GEMAFLTGHSGAGKSTLLK\par 
+  10      -43.431 GQRELIIGDRQTGKTALAI\par 
+  11      -40.210 GGKVGLFGGAGVGKTVNMM\par 
+  12      -40.720 GRIVEIYGPESSGKTTLTL\par 
+  13      -45.143 RSNLLVLAGAGSGKTRVLV\par 
+  14      -40.684 GGKIGLFGGAGVGKTVGIM\par 
+  15      -45.197 SKIIFVVGGPGSGKGTQCE\par 
+  16      -39.098 NQSILITGESGAGKTVNTK\par 
+  17      -43.832 HVNVGTIGHVDHGKTTLTA\par 
+  18      -44.817 YRNIGISAHIDAGKTTERI\par 
+  19      -36.305 EYKLVVVGARGVGKSALTI\par 
+  20      -35.101 EYKLVVVGASGVGKSALTI\par 
+  21      -36.305 EYKLVVVGAVGVGKSALTI\par 
+  22      -36.711 EYKLVVVGAGGVGKSALTI\par 
+Top score     -34.017  Bottom score     -45.197\par 
+Mean     -39.146  Standard deviation       3.441\par 
+Mean minus 3.sd     -49.470  Mean plus 3.sd     -28.822\par 
+? Cutoff score (-999.00-9999.00) (-49.47) =\par 
+? Top score for scaling plots (-49.47-999.00) (-28.82) =\par 
+? Position to identify (0-19) (1) =13\par 
+? Title=ATP binding motif\par 
+\pard \li1240\ri1180\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth ? Name for new weight matrix file=atpbinding.wts\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa320\sl240\tx1140 \f21\fs20 Figure 12.4\tab An example run of the creation of a weight matrix from a set of aligned sequences.\par 
+\pard\plain \s9\fi-560\li860\sa60\sl280\tx1140 \b\f20 2.4.2\tab Searching using a weight matrix\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Once a weight matrix has been stored in a file it can be used to search any sequence. Results can be displayed graphically or the matching sequence segments can be listed out with their scores.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Motif search using weight matrix".\par 
+2.\tab Select "Use weight matrix".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Define "Motif weight matrix file". The name of the file containing the weight matrix. The program reads the file and displays its title.\par 
+4.\tab Accept "Use frequencies as weights". The alternative will use the weight matrix file as a definition of a "Membership of set" motif (see note 10).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab 
+Define "Cutoff score". The default will be the value set when the weight matrix file was created. If the score is negative the program will calculate sums of logs of frequencies, otherwise it will add frequencies.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Accept "Plot results". Alternatively they will be listed.\par 
+The results will appear.\par 
+\pard\plain \s5\sa60\sl320\tx560 \b\f20\fs28 \page 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab The files containing the definitions of peptides that can be be searched for by the exact match search routine have the following format. Each name is followed by a /, th
+en each of its peptide sequences is followed by a /. The last peptide sequence for each name is followed by //.  For example a file might contain the following.\par 
+\pard \s7\qj\li1720\sb200\sa120\sl280\tx1880 Acidic/D/E//\par 
+\pard \s7\qj\li1720\sa120\sl280\tx1880 Basic/R/K/H//\par 
+Glyco/N-S/N-T//\par 
+\pard \s7\qj\fi-560\li560\sb200\sa120\sl280\tx560 \tab Users could then search for these named sets of sequences. Note that the symbol - matches any amino acid.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab To search for a subset of the names in a file employed by exact match routine the user should reject "Search for all names" and the program will ask for the names wanted and extract their sequences 
+from the file. Alternatively, if a user was always using the same subset, then a file containing only those names could be created. This file would then be selected as "Personal file" for "Input source".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+The exact match routine also allows names and their sequences to be entered on the keyboard. This is selected as "Keyboard" for "Input source", and the program will prompt for names and their sequences. In this way the routine can be used to search for 
+exact matches to any short sequence. \par 
+4.\tab For this pr
+ogram a motif is a short segment of sequence of fixed length. More complex structures termed "patterns" which we define as sets of motifs separated by varying gaps, are covered in another chapter. The current chapter should be read before the chapter on pa
+tterns. \par 
+5.\tab The files of aligned sequences used to make weight matrices have the following format. Each sequence should be on a separate line. The sequence should start in column 2 and is terminated by a new line or a space.  Anything after the space is tr
+eated as a comment. The files can be created by previous searches or using an editor.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab 
+The frequencies in the weight matrix can be used in two ways to calculate scores for sequences. Some users prefer to add the frequencies to give a total score, and others to multiply them by summing their logs. If we regard the frequencies as probabilit
+ies then multiplication seems the correct procedure. The user chooses which method will be used when the weight matrix is created, however the choice can be overridden wh
+en the matrix is used. If multiplication is selected then all results will presented as sums of logs.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab 
+Masking the weight matrix is particularly useful in cases where a limited number of examples of a motif are available, or when the motif may have several components. In the first case the limited number of examples may make the matrix unrepresentative o
+f the motif because the amino acids in the unconserved positions may bias the results of searches.  We stated that a motif might have several components\: 
+for example it might have both structural and specificity components. We may want to separate out the two parts and again masking provides such a facility.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 8.\tab 
+The weight matrix handling routine contains a further option "Rescale weight matrix". If the user has edited a weight matrix to change the frequency values this provides a way of selecting a new cutoff score.  It allows users to read in a set of aligned
+ sequences and a weight matrix and to apply the matrix to the set of sequences to see the range of scores achieved. A new weight matrix file contining the selected cutoff score is written to disk.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab 
+The program contains no hardwired motifs as we expect most sites that use the programs to accumulate their own libraries of motifs and patterns, and to use the PROSITE library, both of which users can employ by simply knowing the names of the correspond
+ing files.\par 
+10.\tab The weight matrix search can also used as a "Membership of a set" search. This means that at each position in the motif, any amino acid type tha
+t is non-zero in the weight matrix is counted as a match and scores a value 1. See the chapter on searching protein sequences for patterns.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R.  1988.  Methods to define and locate patterns of motifs in sequences.  {\i CABIOS} {\b 4(1)}\:53-60.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Staden, R. 1990. Searching for patterns in protein and nucleic acid sequences.  (in) {\i Methods in Enzymology} R.F. Doolittle (ed.),  {\b 183}\:193-211  (Academic Press, New York).\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 13.  Using Patterns to Analyse Protein Sequences\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 1.1\tab Introduction to the PROSITE motif library\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Creating a pattern file containing a weight matrix motif and a membership of a set motif.\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.2\tab Searching a sequence using a pattern file\par 
+2.3\tab Comparing a sequence against a library of patterns including PROSITE\par 
+2.4\tab Searching libraries for patterns\par 
+2.5\tab Preparing the PROSITE motif library for use by the programs\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Here we describe one of the most powerful facilities provided by the program PIP\: the ability to d
+efine and search sequences or libraries of sequences for complex patterns of motifs. In  another chapter we give details of seaching for individual motifs but here we show how to create individual patterns and libraries of patterns and to use them to searc
+h sequences. Once a pattern has been defined and stored in a file it can used to search any sequence. In addition if users want to routinely screen sequences against libraries of patterns this can be achieved by use of files of file names. For example, the
+ program can use the PROSITE protein motif library. The program can produce several alternative forms of output. It will display the segment of sequence matching each individual motif in the pattern, display all the sequence between and including the two o
+utermost motifs, produce a description of the match in the form of a SWISSPROT feature table, or draw a simple graphical plot.\par 
+\pard \s4\qj\sa120\sl280 Towards the end of the chapter we describe how a related program PIPL is used to search libraries of sequences to find patterns. This program can produce alignments of sequence families.\par 
+\pard \s4\qj\sa120\sl280 
+Patterns are defined as sets of motifs with variable spacing. Each motif in a pattern can be defined using any of several methods, and their positions relative to one other are defined in terms of minimum and maximum separations. In addition, by the use of
+ logical operators, each motif can be declared to be essential (the AND operator), optional (the OR operator), or forbidden (the NOT operator). The following methods (termed "classes" by the program) for defining motifs are provided\:
+ 1) exact match to a short sequence; 2) percentage match to a short sequence; 3) match to a short sequence using a score matrix and cutoff score; 4) match to a weight matrix; 5) direct repeat; 6) membership of a set. \par 
+\pard \s4\qj\sa120\sl280 
+The motifs in a pattern are numbered sequentially and motif spacing is defined in the following way. When a new motif is added to a pattern the user specifies the "Reference motif" by its number and then a "Relative start position". The "Relative start pos
+iti
+on" is defined by taking the first base of the "Reference motif" as position 1, the next as 2, and so on. Then the user defines the allowed variation in the spacing by specifying the "Number of extra positions". Notice that the position of a motif can be d
+efined relative to any other motif, and that a negative "Relative start position" declares the motif to be to the left of its "Reference motif".\par 
+\pard \s4\qj\sa120\sl280 The probability of finding each individual motif in the current sequence, the product of the probabilities for 
+all the motifs in a pattern "Probability of finding pattern", and the "Expected number of matches" is calculated and displayed by the program. In addition to the cutoffs used for the individual motifs, users can apply two pattern cutoffs\:
+ "Maximum pattern probability" and "Minimum pattern score".\par 
+\pard \s4\qj\sa120\sl280 Below we describe\: how to create a pattern;  how to use a pattern file to search a sequence; how to use a "File of pattern file names" to search a sequence for a whole library of patterns; how to use a pattern file
+ to search a whole library of sequences; how to reformat the PROSITE motif library into a form compatible with these search programs. To describe how to create a pattern file we first show all the steps to make one containing two motifs, and then, to save 
+space, the parts specific to the individual motif types are sketched in the notes section.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 1.1\tab Introduction to the PROSITE motif library\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 A library of protein motifs (in our terminology, because they include variable gaps, many would be called patterns) has
+ recently become available from Amos Bairoch, Departement de Biochimie Medicale, University of Geneva. Currently it contains over 500 patterns/motifs and arrives on tape or cdrom in two files\:
+ a .DAT file and a .DOC file. There is also a user documentation file PROSITE.USR. Here we outline the library structure and what is required to prepare the PROSITE library for use by our programs. A typical entry in the .DAT file is shown in figure 13.1.
+\par 
+\pard \s4\qj\sa120\sl280 Each entry has an accession number (in figure 13.1 PS00197), a pattern definition (in figure 13.1 C-x(1,2)-[STA]-x(2)-C-[STA]-\{P\}-C) and a documentation file cross reference (in figure 13.1 PDOC00175). This pattern means\:
+ C, gap of 1 or 2, any of STA, gap of 2, C, any of STA, not P, C.\par 
+\pard \s4\qj\sa120\sl280 
+We need to convert all of these patterns into our pattern definitions (as membership of a set, with the appropriate gap ranges) and write each into a separate pattern file with corresponding "membership of a set" weight matrices. After the conversion each 
+pattern file is named accession_number.pat (here PS00197.PAT). The corresponding matrix files are accession_number.wtsa, accession_number.wtsb, etc for however many are needed (here PS00197.WTSA and PS00197.WTSB)\:
+ two are needed because of the variable gap.\par 
+
+n addition we can optionally split the .DAT and .DOC files into separate files, one for each entry, with names accession_number.dat and accession_number.doc. Also we create an index for the library which gives a one line description of each pattern, and en
+ds with the pattern file and do
+cumentation file numbers. The start of the file is shown in figure 13.2. So, refering to figure 13.2, the name of the pattern file for Glycosaminoglycan attachment site is PS00002.PAT, and for the documentation file PDOC00002.DOC\par 
+\pard \s4\qj\sa120\sl280 
+Finally we create a file of file names for all the patterns in the library. If this file of file names is PROSITE.NAM then to use the complete PROSITE library from program PIP, users select "pattern searcher" and choose the option "use file of pattern file
+ names", and give the file name PROSITE.NAM. For any matches found, the accession number and pattern title will be displayed.\par 
+\pard\plain \li360\ri440\sl220\pagebb\box\brsp40\brdrth \f4\fs16 ID   2FE2S_FERREDOXIN; PATTERN.\par 
+\pard \li360\ri440\sl220\box\brsp40\brdrth AC   PS00197;\par 
+DT   APR-1990 (CREATED); APR-1990 (DATA UPDATE); APR-1990 (INFO UPDATE).\par 
+DE   2Fe-2S ferredoxins, iron-sulfur binding region signature.\par 
+PA   C-x(1,2)-[STA]-x(2)-C-[STA]-\{P\}-C.\par 
+NR   /RELEASE=14,15409;\par 
+NR   /TOTAL=69(69); /POSITIVE=63(63); /UNKNOWN=0(0); /FALSE_POS=6(6);\par 
+NR   /FALSE_NEG=5(5);\par 
+CC   /TAXO-RANGE=A?EP?; /MAX-REPEAT=1;\par 
+CC   /SITE=1,iron_sulfur; /SITE=5,iron_sulfur; /SITE=8,iron_sulfur;\par 
+DR   P15788, FER$APHHA , T; P00250, FER$APHSA , T; P00223, FER$ARCLA , T;\par 
+DR   P00227, FER$BRANA , T; P07838, FER$BRYMA , T; P13106, FER$BUMFI , T;\par 
+DR   P00247, FER$CHLFR , T; P07839, FER$CHLRE , T; P00222, FER$COLES , T;\par 
+DO   PDOC00175;\par 
+\pard \li360\ri440\sl220\keepn\box\brsp40\brdrth //\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 13.1\tab A typical entry from the PROSITE library\par 
+\pard\plain \li440\ri480\sb300\sl220\box\brsp100\brdrth \f4\fs16 IN-glycosylation site.                                          00001,00001\par 
+\pard \li440\ri480\sl220\box\brsp100\brdrth Glycosaminoglycan attachment site.                              00002,00002\par 
+Tyrosine sulfatation site.                                      00003,00003\par 
+\pard \li440\ri480\sl220\keepn\box\brsp100\brdrth cAMP- and cGMP-dependent protein kinase phosphorylation site.   00004,00004\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 13.2\tab The start of the index created by the conversion program\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+In order to make the PROSITE library useable by the search programs it is only necessary to run a program named SPLITP3. Two other programs, SPLITP1 and SPLITP2, only make the original files marginally easier to manage and produce an index.   SPLITP1 split
+s the PROSITE.DAT file to create a separate file for each entry. Each file is automatically named PSentry_number.DAT. In addition it creates an index for the library (see above).\par 
+\pard \s4\qj\sa120\sl280 SPLITP2 performs the same operation for the PROSITE.DOC file, except that no index is created. Files are named PSentry_number.DOC.\par 
+\pard \s4\qj\sa120\sl280 
+SPLITP3 creates a separate pattern file and weight matrix files for each PROSITE entry from the file PROSITE.DAT. Pattern files are named PSentry_number.PAT, weight matrix files PSentry_number.WTSA, PSentry_number.WTSB, etc. The pattern title is the one li
+ne description of the motif. SPLITP3 also creates a file of file names. Notice that it will ask for a path name so that the path can be included in the file of file names. This is the path to the directory in which the pattern files are stored\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\fi-560\li560\sb240\sa60\sl280\tx560\tx920 \b\f20 2.1\tab Creating a pattern file containing a weight matrix motif and a membership of a set motif.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Pattern searcher"\par 
+2.\tab Select "Pattern definition mode" as "Use keyboard".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Select "Results display mode" as "Inclusive". The alternatives are listed in the introduction.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Select "Motif definition mode" as "Weight matrix"\par 
+5.\tab Define "Motif name". Each motif can be given an 8 character name\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Define "Weight matrix file name". Type in the name of the file containing the weight matrix. The program will display the probability of finding the motif.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 7.\tab Select "Motif definition mode" as "Membership of a set".\par 
+8.\tab Define "Motif name".\par 
+9.\tab Select "Logical operator" as "AND". The alternatives are "OR" and "NOT".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Select "Number of reference motif". At this stage the only choice is 1 and this is the default.\par 
+11.\tab Define "Relative start position". The base position relative to the "Reference motif". See the introduction.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 12.\tab Define "Number of extra positions".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 13.\tab Select input mode as "Keyboard". The alternative is an existing file in the form of a weight matrix.\par 
+14.\tab Define "String". Type in the sets of allowed residue types using the one letter code. See note 1\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 15.\tab Define the "Minimum matches". This is the number of positions within the motif that must match. The default is that 
+all positions must match but users may want to allow some flexibility by giving a lower score.\par 
+\tab The program now cycles round to step 7 and all subsequent passes round the loop to add further motifs to the pattern would differ only in the details for the different motif "classes".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 16.\tab Select "Pattern complete"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 17.\tab Accept "Save pattern in a file". The alternative does not save the pattern and so it can only be used once on the current sequence.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 18.\tab Define "Pattern definition file". Give a name for the new file.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 19.\tab "Define "Pattern title". All patterns can have a 60 character title that can be displayed when the pattern file is read and the sequence searched.\par 
+20.\tab Define "Weight matrix file name". The membership of a set motifs are stored in the form of weight matrices, and so the program needs the user to define a file name.\par 
+21.\tab Define "Title". Type in a title for the weight matrix like file. The title will be displayed when the file is read.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab The program will now display a detailed textual description of the pattern, the "Probability of finding the pattern" and the "Expected number of matches" (see figure 13.3).\par 
+22.\tab Define "Maximum pattern probability". Yes maximum\: any match with a greater probability of being found will be rejected. If no value is specified the search will be quicker (see notes).\par 
+\pard\plain \li1240\ri1360\sl220\pagebb\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 Pattern searcher\par 
+\pard \li1240\ri1360\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Select pattern definition mode\par 
+X  1 Use keyboard\par 
+   2 Use pattern file\par 
+   3 Use file of pattern file names\par 
+? Selection  (1-3) (1) =1\par 
+Select results display mode\par 
+X  1 Motif by motif\par 
+   2 Inclusive\par 
+   3 Graphical\par 
+   4 SWISSPROT feature table\par 
+? Selection  (1-4) (1) =2\par 
+Select motif definition mode\par 
+X  1 Exact match\par 
+   2 Percentage match\par 
+   3 Cut-off score and score matrix\par 
+   4 Cut-off score and weight matrix\par 
+   5 Direct repeat\par 
+   6 Membership of set\par 
+   7 Pattern complete\par 
+? Selection  (1-7) (1) =4\par 
+? Motif name=atp\par 
+? Weight matrix file name=atpbinding.wts\par 
+ ATP binding\par 
+Probability of score   -47.8010 = 0.302E-04\par 
+Select motif definition mode\par 
+   1 Exact match\par 
+   2 Percentage match\par 
+   3 Cut-off score and score matrix\par 
+X  4 Cut-off score and weight matrix\par 
+   5 Direct repeat\par 
+   6 Membership of set\par 
+   7 Pattern complete\par 
+? Selection  (1-7) (4) =6\par 
+? Motif name=hydro\par 
+Select logical operator\par 
+X  1 And\par 
+   2 Or\par 
+   3 Not\par 
+? Selection  (1-3) (1) =\par 
+? Number of reference motif (1-1) (1) =\par 
+? Relative start position (-1000-1000) (20) =22\par 
+? Number of extra positions (0-1000) (0) =5\par 
+Select input mode\par 
+X  1 Keyboard\par 
+   2 File\par 
+? Selection  (1-2) (1) =\par 
+Separate sets with commas\par 
+? String=ivl,ivl,,,rkhde\par 
+? Minimum matches (1.00-5.00) (3.00) =\par 
+Probability of score     3.000 = 0.145E-01\par 
+Select motif definition mode\par 
+   1 Exact match\par 
+   2 Percentage match\par 
+   3 Cut-off score and score matrix\par 
+   4 Cut-off score and weight matrix\par 
+   5 Direct repeat\par 
+X  6 Membership of set\par 
+   7 Pattern complete\par 
+? Selection  (1-7) (6) =7\par 
+? Save pattern in a file (y/n) (y) =\par 
+? Pattern definition file=_paper.pat\par 
+? Pattern title=atpbinding plus\par 
+? Weight matrix file name=_hydro.wts\par 
+Weight matrix needs a title\par 
+? Title=hydrophobic and + spot\par 
+Pattern description\par 
+atpbinding plus\par 
+Motif  1 named atp      is of class    4\par 
+Which is a match to a weight matrix with score -47.801\par 
+Motif  2 named hydro    is of class    6\par 
+Which is membership of a set with score   3.000\par 
+It is anded with the previous motif.\par 
+Probability of finding pattern = 0.4368E-06\par 
+Expected number of matches  = 0.1350E-02\par 
+? Maximum pattern probability (0.00-1.00) (1.00) =\par 
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =\par 
+{\f22\fs18     162\par 
+}      GQRELIIGDRQTGKTALAIDAIINQR\par 
+Total matches found      1\par 
+\pard \li1240\ri1360\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth Minimum and maximum observed scores      -38.35      -38.35\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa300\sl240\tx1140 \f21\fs20 Figure 13.3\tab The creation and use of a pattern containing a weight matrix motif and a membership of a set motif.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 23.\tab 
+Define "Minimum pattern score". A minimum pattern score only makes sense if all the motifs in the pattern are defined with compatible scoring methods. For example membership of a set motifs and weight matrices using sums of logs are incompatible. Searc
+hing will now commence and any matches displayed using the chosen method. In figure 13.3 we show a typical run i
+n which a pattern containing a weight matrix and a membership of a set motif is created and stored on disk. Figure 13.4 shows the contents of the pattern file. \par 
+\pard\plain \li2260\ri2380\sb200\sl220\box\brsp100\brdrth \f4\fs16  atpbinding plus                        \par 
+\pard \li2260\ri2380\sl220\box\brsp100\brdrth  A4          atp      Class \par 
+atpbinding.wts                          \par 
+ A6          hydro    Class \par 
+      1      Relative motif\par 
+     22      Relative start position\par 
+      5      Number of extra positions\par 
+\pard \li2260\ri2380\sl220\keepn\box\brsp100\brdrth _hydro.wts                              \par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa40\sl240\tx1140 \f21\fs20 Figure 13.4\tab The pattern file created in the worked example shown in figure 13.3.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Searching a sequence using a pattern file\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Pattern searcher"\par 
+2.\tab Select "Pattern definition mode" as "Use pattern file".\par 
+3.\tab Select "Results display mode" as "Inclusive"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+Define "Pattern definition file". Type the name of the file containing the pattern. The program will read the file then display its title, a detailed textual description of the pattern, the "Probability of finding the pattern", and the "Expected number 
+of matches".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Define "Maximum pattern probability". \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab D
+efine "Minimum pattern score". Searching will now commence and any matches displayed using the chosen method. Figure 13.5 shows a typical run using a pattern file and output in the form of a SWISSPROT feature table.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Comparing a sequence against a library of patterns including PROSITE\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This mode of operation allows a sequence to be searched, in turn, for any number of patterns each stored in a separate pattern file. The names of the files containing the individual patterns must be stored in a simple text 
+file. This file is called "a file of pattern file names" and its name is the only user input required to define the search. The file of file names could contain references to entries in the PROSITE motif library and also include the names of other patterns
+.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Pattern searcher"\par 
+2.\tab Select "Pattern definition mode" as "Use file of pattern file names".\par 
+3.\tab Select "Results display mode" as "Inclusive"\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "File of pattern file names". Type the name of the file containing the list of pattern file na
+mes. The program will read the file and then, in turn, all the pattern files it names. Each of these patterns will be compared against the current sequence but only those that give matches will produce any output. The pattern title and each match will be d
+isplayed.\par 
+\pard\plain \li1240\ri1360\sb320\sl220\box\brsp40\brdrth \f4\fs16  Pattern searcher\par 
+\pard \li1240\ri1360\sl220\box\brsp40\brdrth Select pattern definition mode\par 
+X  1 Use keyboard\par 
+   2 Use pattern file\par 
+   3 Use file of pattern file names\par 
+? Selection  (1-3) (1) =2\par 
+? Pattern definition file=_paper.pat\par 
+Select results display mode\par 
+X  1 Motif by motif\par 
+   2 Inclusive\par 
+   3 Graphical\par 
+   4 SWISSPROT feature table\par 
+? Selection  (1-4) (1) =4\par 
+ ATP binding sequences\par 
+Probability of score   -47.8010 = 0.302E-04\par 
+ hydrophobic and + spot\par 
+Probability of score     3.0000 = 0.145E-01\par 
+\par 
+Pattern description\par 
+\par 
+ atpbinding plus\par 
+Motif  1 named atp      is of class    4\par 
+Which is a match to a weight matrix with score -47.801\par 
+Motif  2 named hydro    is of class    6\par 
+Which is membership of a set with score   3.000\par 
+It is anded with the previous motif.\par 
+Probability of finding pattern = 0.4368E-06\par 
+Expected number of matches  = 0.1350E-02\par 
+? Maximum pattern probability (0.00-1.00) (1.00) =\par 
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =\par 
+\par 
+FT   atp         162    187       Program\par 
+\par 
+Total matches found      1\par 
+\pard \li1240\ri1360\sl220\keepn\box\brsp40\brdrth Minimum and maximum observed scores      -38.35      -38.35\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 13.5\tab Worked example of using a pattern file to search a sequence, and writing the results in the form of a SWISSPROT feature table.\par 
+\pard\plain \s6\sa60\sl280\tx560\tx860 \b\f20 \page 2.4\tab Searching libraries for patterns\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The program PIPL can be used to search whole sequence
+ libraries for patterns. Its use is similar to the pattern search routine described above, except that it does not have the facility for creating pattern files, so they must be created beforehand using PIP. In addition to its obvious application of finding
+ new occurrences of patterns or checking on their frequency it is a useful way of obtaining sequence alignments. It can restrict its search to a list of named entries or can search all but those on a list of entries. It can restrict its output to showing t
+he highest scoring match in each sequence, but by default it will show all matches.\par 
+\pard \s4\qj\sa120\sl280 
+Of its modes of output two require further description. The first "Padded sections" creates a new file for each match. The file will contain the sequence between and including the two outermost motifs in the pattern. It will be gapped to the furthest exten
+t defined by the pattern, which means that if all the files were subsequently written one above the other all the motifs in the pattern would be exactly aligned, with the s
+ections between them containing the requisite numbers of padding characters. The second such mode of output is called "Complete padded sequences". Here the user must know the maximum distance between the leftmost motif and the start of all the sequences th
+at match. A trial run in which only the positions of matches are reported is usually required. The user gives this maximum distance to the program. The program then writes a new file containing the full length of all matching sequences, again maximally gap
+ped (including their left ends) so that they would all align if written above one another. For both of these modes of output the files created are named "entryname" where "entryname" is the name given to the sequence in the sequence library. These modes ar
+e best used with the option "Report all matches" rejected, so that only the best match for each sequence is reported. The sequences can be lined up using the sequence assembly program SAP.\par 
+\pard \s4\qj\sa120\sl280 The searches, which have recently been recoded, are very rapid. For
+ example a search of the current SWISSPROT library for a pattern defining the globin family as 6 weight matrices with widely varying gaps, finds only globins and takes less than 4 minutes using a single processor on an Alliant FX2800. This time includes re
+ading in the whole library as stored in EMBL CDROM format.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select PIPL.\par 
+2.\tab Define "Name for results file."\par 
+3.\tab Select a library.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Select "Search whole library". The alternatives are "Search only a list of entries" and "Search all but a list of entries"
+. The files containing the list of entries should contain one entry name per line, left justified.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab Select "Results display mode" as "Inclusive". The alternatives include "Motif by motif", "Scores only", "Complete padded sequences" and "Padded sections".\par 
+6.\tab Accept "Report all matches". The alternative only shows the best match for each sequence.\par 
+7.\tab Define "Pattern definition file". The name of the file containing the pattern created using PIP. \par 
+\tab The program displays a textual description of the pattern and the expected number of matches per 1000 residues assuming an average amino acid composition.\par 
+8.\tab Define "Maximum pattern probability". The program will run much more quickly if none is given.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab Define "Minimum pattern score".\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The search will start.\par 
+A typical run is shown in figure 13.6\par 
+\pard\plain \li1120\ri1280\sb200\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16 PIPL (Protein interpretation program (library)) V4.1 Jul 1991\par 
+\pard \li1120\ri1280\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth Author\: Rodger Staden\par 
+Searches protein libraries for patterns of motifs\par 
+\par 
+? Name for results file=globin.res\par 
+Select a library\par 
+   1 EMBL nucleotide library                           \par 
+X  2 SWISSPROT protein library                         \par 
+   3 Personal file in PIR format                       \par 
+   4 Personal file in FASTA format                     \par 
+? Selection  (1-4) (2) =\par 
+Library is in EMBL format with indexes\par 
+Select a task\par 
+X  1 Search whole library                              \par 
+   2 Search only a list of entries                     \par 
+   3 Search all but a list of entries                  \par 
+? Selection  (1-3) (1) =\par 
+Select results display mode\par 
+X  1 Motif by motif           \par 
+   2 Inclusive                \par 
+   3 Scores only              \par 
+   4 Complete padded sequences\par 
+   5 Padded sections          \par 
+? Selection  (1-5) (1) =5\par 
+? (y/n) (y) Report all matches n\par 
+? Pattern definition file=globin.pat\par 
+ globin 1                                        \par 
+Probability of score   -34.5300 = 0.197E-02\par 
+ globin 2                                        \par 
+Probability of score   -44.6000 = 0.409E-02\par 
+ globin 3                                        \par 
+Probability of score   -75.1000 = 0.293E-01\par 
+ globin 4                                        \par 
+Probability of score   -36.1000 = 0.147E-01\par 
+ globin 5                                        \par 
+Probability of score   -73.7000 = 0.375E-01\par 
+ globin 6                                        \par 
+Probability of score   -55.9000 = 0.483E-01\par 
+\par 
+Pattern description\par 
+ Globin pattern file                          \par 
+Motif  1 named g1       is of class    4\par 
+Which is a match to a weight matrix with score -34.530\par 
+Motif  2 named g2       is of class    4\par 
+Which is a match to a weight matrix with score -44.600\par 
+and the N-terminal residue can take positions     17 to    22\par 
+relative to the N-terminal end of motif   1\par 
+It is anded with the previous motif.\par 
+Motif  3 named g3       is of class    4\par 
+Which is a match to a weight matrix with score -75.100\par 
+and the N-terminal residue can take positions     27 to    35\par 
+relative to the N-terminal end of motif   2\par 
+It is anded with the previous motif.\par 
+Motif  4 named g4       is of class    4\par 
+Which is a match to a weight matrix with score -36.100\par 
+and the N-terminal residue can take positions     29 to    53\par 
+relative to the N-terminal end of motif   3\par 
+It is anded with the previous motif.\par 
+Motif  5 named g5       is of class    4\par 
+Which is a match to a weight matrix with score -73.700\par 
+and the N-terminal residue can take positions     12 to    16\par 
+relative to the N-terminal end of motif   4\par 
+It is anded with the previous motif.\par 
+Motif  6 named g6       is of class    4\par 
+Which is a match to a weight matrix with score -55.900\par 
+and the N-terminal residue can take positions     29 to    33\par 
+relative to the N-terminal end of motif   5\par 
+It is anded with the previous motif.\par 
+Probability of finding pattern = 0.6273E-11\par 
+Expected number of matches per 1000 residues = 0.2119E-03\par 
+? Maximum pattern probability (0.00-1.00) (1.00) =\par 
+\pard \li1120\ri1280\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth ? Minimum pattern score (-9999.00-9999.00) (-9999.00) =\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa240\sl240\tx1140 \f21\fs20 Figure 13.6\tab A typical run of PIPL using a pattern of 6 weight matrices to search the SWISSPROT library.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Preparing the PROSITE motif library for use by the programs\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 Only the program SPLITP3 is essential for preparing the PROSITE library for use by our programs. \par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select SPLITP3\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Prosite library file". Type the name of the file containing the prosite library (usually PROSITE.DAT).\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+Define "Name for file of pattern file names". This is the file of file names that users will employ to search the whole library. It will be convenient for them if an environment variable is defined for this file name.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Path name of motif directory". This is the full path name, including the final /, to the directory in which the converted library will be stored.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+The "exact match" motif class requires a consensus sequence. The "percentage match" motif class requires a consensus sequence and a cutoff score. The "score matrix" motif class uses the MDM78 matrix and requires a consensus sequence and a cutoff score. 
+ The "weight matrix" search only requires the name of the file containing the matrix. The "direct repeat" motif class requires a repeat length, the minimum and maximum gap between the t
+wo occurrences of the repeat, and a minimum score. The "membership of a set" motif class defines sets of residue types that are allowed at each position in the motif. When they are first entered into the pattern they are normally typed on the keyboard, but
+ when they are stored in a file, they are written in the same format as a weight matrix. To enter them on the keyboard use the following format. Type the one letter codes for the set of residue types allowed at each position terminated by a comma (,). For 
+positions where any residue type is allowed simply type an extra comma. For example VLI,FY,,,DE means any of Valine, Leucine or Isoleucine in the first position, either Phenylalanine or Tyrosine in the next position, anything in the next two positions, and
+ Aspartic acid or Glutamic acid in the next. When the pattern is stored on the disk the program will request a name for the file and a title for the motif.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab The details of the probabilty calculations are outside the scope of this article. They are quite 
+rapid and are essential both for assessing the statistical significance of any matches found and for allowing meaningful cutoffs to be applied to patterns. Obviously, in general, cutoff scores are inappropriate for patterns containing a mixture of motif cl
+asses.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+The program calculates the "Probability of finding the pattern" and the "Expected number of matches". The first figure is actually the product of the individual motif probabilities but the latter figure is more useful because it takes into accoun
+t the allowed variation in spacing between motifs and the length of the current sequence. In both cases the composition of the current sequence is also used so that different probabilities would be calculated for other sequences.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+The pattern definition system is very flexible. Assume that a laboratory has a large library of patterns stored in its computer. Different groups or users may want to screen their sequences against different subsets of a pattern library. Each group ther
+efore uses its own "File o
+f pattern file names" which contains only the names of the pattern files that are relevant to their sequences. Of course a pattern may contain only one motif. Hence a library of patterns can include both simple and complex patterns. In the same way a labor
+atory may have a large library of weight matrices defining different motifs and different users may want to combine them in different ways to produce their own patterns.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab Also, of course, a library does not have to be used solely for performing mass screenings\:
+ each individual entry can be used as a single pattern by giving the name of its pattern file - eg pathname/PS00002.PAT.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab 
+Note that 5 of the PROSITE motifs contains the symbols > or < which means that the motifs must appear exactly at the N or C termini of the sequences. Currently our methods have no mechanism for such definitions and, for example KDEL motifs, will be perm
+itted to occur anywhere throughout a sequence.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Staden, R.  1988.  Methods to define and locate patterns of motifs in sequences.  {\i CABIOS} {\b 4(1)}\:53-60.\par 
+2.\tab Staden, R.  1989.  Methods for calculating the probabilities of finding patterns in sequences.  {\i CABIOS} {\b 5(2)}\:89-96.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Staden, R. 1990. Searching for patterns in protein and nucleic acid sequences.  (in) {\i Methods in Enzymology} R.F. Doolittle (ed.),  {\b 183}\:193-211  (Academic Press, New York).\par 
+\pard\plain \s2\qc\sa200\sl480 \b\f20\fs36 \page 14.  Comparing Sequences\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 Table of contents\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Introduction\par 
+2.\tab Methods\par 
+\pard \s7\qj\fi-560\li1700\sa120\sl280\tx1700 2.1\tab Producing a dot matrix plot (or list) of exact matches\par 
+2.2\tab Producing a dot matrix plot using the proportional algorithm\par 
+2.3\tab Producing a dot matrix plot using the quick scan algorithm\par 
+2.4\tab Producing a list of all matching segments using the proportional algorithm\par 
+2.5\tab Calculating the expected scores for the proportional algorithm\par 
+2.6\tab Calculating the observed scores for the proportional algorithm\par 
+2.7\tab Producing an optimal alignment\par 
+2.8\tab Comparing a sequence against a library of sequences\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab Notes\par 
+4.\tab References\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 1.\tab Introduction\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 In this chapter we describe methods for comparing and aligning pairs of nucleic acid or protein 
+sequences.  The program described (SIP), the original version of which was first described in 1982 (1), is based around several methods for producing "dot matrix" plots and includes routines for assessing the statistical significance of the plots, plus a d
+ynamic programming algorithm for finding optimal alignments.  At the end of the chapter we describe a program SIPL that is used for comparing a single sequence against a whole library of sequences.\par 
+\pard \s4\qj\sa120\sl280 We assume the reader is familiar with the general principl
+e of dot matrix diagrams. The program uses a number of different algorithms to calculate the score for each point in a dot matrix and the user defines a minimum score so that only those points in the diagram for which the score is at least this value will 
+be marked with a dot.  The first scoring method finds uninterrupted sections of perfect identity i.e. those that contain no mismatches, insertions or deletions. Generally this method, termed "the identities algorithm" is of limited value, but runs very qui
+ckly. \par 
+\pard \s4\qj\sa120\sl280 
+The second method looks for sections where a proportion of the characters in the sequence are similar, again allowing no insertions or deletions. For a thorough analysis this method, termed "the proportional algorithm", is the best.  The original method, o
+f this type was first described by McLachlan (2) and involves calculating a score for each position in the matrix by summing points found when looking forwards and backwards along a diagonal line of a given length (the window). The algorithm does no
+t simply look for identity but uses a score matrix that contains scores for every possible pair of characters. For comparing amino acid sequences we usually use the score matrix MDM78 (3) which is shown in figure 14.1..  It is also possible to use other ma
+trices, including an identity matrix for proteins.  For nucleic acids we usually use an identity matrix.\par 
+\pard\plain \li220\ri280\sl220\box\brsp100\brdrth \f4\fs16    C  S  T  P  A  G  N  D  E  Q  B  Z  H  R  K  M  I  L  V  F  Y  W  -  X  ? \par 
+\pard \li220\ri280\sl220\box\brsp100\brdrth C 22 10  8  7  8  7  6  5  5  5  5  5  7  6  5  5  8  4  8  6 10  2 10 10 10 10\par 
+S 10 12 11 11 11 11 11 10 10  9 10 10  9 10 10  8  9  7  9  7  7  8 10 10 10 10\par 
+T  8 11 13 10 11 10 10 10 10  9 10 10  9  9 10  9 10  8 10  7  7  5 10 10 10 10\par 
+P  7 11 10 16 11  9  9  9  9 10  9 10 10 10  9  8  8  7  9  5  5  4 10 10 10 10\par 
+A  8 11 11 11 12 11 10 10 10 10 10 10  9  8  9  9  9  8 10  6  7  4 10 10 10 10\par 
+G  7 11 10  9 11 15 10 11 10  9 10 10  8  7  8  7  7  6  9  5  5  3 10 10 10 10\par 
+N  6 11 10  9 10 10 12 12 11 11 12 11 12 10 11  8  8  7  8  6  8  6 10 10 10 10\par 
+D  5 10 10  9 10 11 12 14 13 12 13 12 11  9 10  7  8  6  8  4  6  3 10 10 10 10\par 
+E  5 10 10  9 10 10 11 13 14 12 12 13 11  9 10  8  8  7  8  5  6  3 10 10 10 10\par 
+Q  5  9  9 10 10  9 11 12 12 14 11 13 13 11 11  9  8  8  8  5  6  5 10 10 10 10\par 
+B  5 10 10  9 10 10 12 13 12 11 13 11 11 10 10  8  8  6  8  5  7  4 10 10 10 10\par 
+Z  5 10 10 10 10 10 11 12 13 13 11 14 12 10 10  8  8  8  8  5  6  4 10 10 10 10\par 
+H  7  9  9 10  9  8 12 11 11 13 11 12 16 12 10  8  8  8  8  8 10  7 10 10 10 10\par 
+R  6 10  9 10  8  7 10  9  9 11 10 10 12 16 13 10  8  7  8  6  6 12 10 10 10 10\par 
+K  5 10 10  9  9  8 11 10 10 11 10 10 10 13 15 10  8  7  8  5  6  7 10 10 10 10\par 
+M  5  8  9  8  9  7  8  7  8  9  8  8  8 10 10 16 12 14 12 10  8  6 10 10 10 10\par 
+I  8  9 10  8  9  7  8  8  8  8  8  8  8  8  8 12 15 12 14 11  9  5 10 10 10 10\par 
+L  4  7  8  7  8  6  7  6  7  8  6  8  8  7  7 14 12 16 12 12  9  8 10 10 10 10\par 
+V  8  9 10  9 10  9  8  8  8  8  8  8  8  8  8 12 14 12 14  9  8  4 10 10 10 10\par 
+F  6  7  7  5  6  5  6  4  5  5  5  5  8  6  5 10 11 12  9 19 17 10 10 10 10 10\par 
+Y 10  7  7  5  7  5  8  6  6  6  7  6 10  6  6  8  9  9  8 17 20 10 10 10 10 10\par 
+W  2  8  5  4  4  3  6  3  3  5  4  4  7 12  7  6  5  8  4 10 10 27 10 10 10 10\par 
+- 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+X 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+? 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+\pard \li220\ri280\sl220\keepn\box\brsp100\brdrth   10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 14.1\tab The amino acid score matrix MDM78.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+For the proportional method plotting dots at the centres of windows that reach the cutoff leads to a persistence effect that, to some extent, can be mitigated by a variation on the method. If, for example, all the high scoring amino acids are clustered at 
+the left end of a particular diagonal segment, dots will continue to be plotted to their right until the window score drops below the cutoff. Instead of plotting a single point for each window that reaches the cutoff score, the variant method plots p
+oints for all the identities that lie in windows that reach the cutoff. Obviously the persistence effect can be more pronounced for long windows and low cutoff scores, but note that the variant method will plot nothing if there are no identities present, a
+nd so similar regions could be missed!  A further variant, useful for comparing a sequence against itself, ignores the main diagonal.\par 
+\pard \s4\qj\sa120\sl280 The third comparison method called "quick scan" is really a combination of the first two, and is similar to the FASTP prog
+ram of Lipman and Pearson (4), but produces a dot matrix diagram. The algorithm is as follows. The dot matrix positions are found for all words of some minimum length (obviously length 1 is most sensitive) that are common to both sequences. Imagine a diago
+nal line running from corner to corner of the diagram, at right angles to the diagonals in the dot matrix, The scores for the common words (according to the current score matrix, e.g. MDM78) are accummulated at the appropriate positions on that imaginary l
+ine, hence producing a histogram. The histogram is analysed to find its mean and standard deviation. The diagonals that lie above some cutoff score (defined in standard deviation units), are rescanned using the proportional algorithm, and a diagram produce
+d. The method is very fast, and is also employed by the library comparison program (see below).\par 
+\pard \s4\qj\sa120\sl280 \par 
+\pard \s4\qj\sa120\sl280 The dynamic programming alignment algorithm contained in the program is based on that of Myers and Miller (5). It guarantees to produce alignments with the opt
+imum score given a score matrix, a gap start penalty, and a gap extension penalty. It is very useful to have the dot matrix methods and the alignment routine together in the same program because it allows users to produce a dot matrix diagram to help selec
+t which regions of the sequence they wish to align. Selection is made by use of the crosshair. The crosshair is positioned first at the bottom left hand end of the segment to be aligned and then at the top right of the segment. When the alignment routine i
+s selected the segment will be aligned. The alignment can replace the original segment of the sequence. By repeated plotting of dot matrices, followed by alignment, very long sequences can easily be aligned. \par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 2.\tab Methods\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.1\tab Producing a dot matrix plot (or list) of exact matches\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method is relatively fast and can be useful for very similar sequences. It marks the position of every exact match of some minimum length with a dot or lists out the matching segments.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Apply identities algorithm".\par 
+2.\tab Define "Identity score". \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.\tab 
+Select "Plot or List". The plot will appear as in figure 14.2, which shows a comparison of two protein sequences using a score of 2. Listed output displays the matching segments and defines their positions. \par 
+\pard\plain \li1700\sb300\sl220\keepn \f4\fs16 {{\pict\macpict\picw283\pich299 
+112800000000012b011b001102ff0c00fffe0000003cb4bc003cb4bc0000000000fc00ef000000000001000a0000000000fc00ef0098801e0000000000fc00ef0000000000000000003cb4bc003cb4bc00000001000100010000000000000000000000000048c23f000000010000ffffffffffff0001000000000000000000
+0000fc00ef0000000000fc00ef000002e30006003fe5ff00f80f0020f6000020f8000020fc000104080d0020fa000302000004f0000048060020e50000081b0020fe00042000000802fb0002100002fe000040fd00031000000817012008fc000004fd000020fe00011001f9000301000008100020fb000001f600018080fa
+00011008150020fe00014002f5000308000004fd000304000008060020e5000008110020f2000312010008fe000020fc0000080e0020f3000020fe000040f8000008060020e50000080d0320000020f3000001f7000008130020fb000040fa000080f90005200000100008110320000080f700042800000440f70000080b04
+2002000001ea00010808110020f7000080f90002012048fc000102080f0020fc000001f60000a0f8000102080a0020fb000020ec0000481c05200010000001fe000010fd000008fc0002010004fe0003800000080c012001f800010202f10000080b0020ea000002fe00010408140320000020f6000302000004fb000008fe
+00000816022010c0f9000020fc00040200100008fc0002200008130020fc000002fe000080fa000010f800010808150320000010fe000002f9000048fd000020fa000008160020fd0005100000040080fa000008fa0003040080080c0020f20002040080f7000008140020fe00010104f600010440fb000001fe0000081200
+20f700041000808010f8000010fe0000080a0020ef000008f80000080a0020fa000010ed000008110020f3000040fb000710100000080000080a0020ee000010f90000080c012080fa00010802ef000008110020f6000040f8000780008000080000080e0020f5000002fe000010f60000080e0020f5000002fe000030f600
+0008180620200000100020fd00041081808010f8000010fe000008110020f6000020f800072000000400000408060020e50000080c0020f60002200004f300000814012410f60002200040fb000010fe000308000008100022f9000008f40006400400100010080a0020fd000080ea0000080c0020fd00010888ec00010108
+110320800004fe00040400001810f0000008130020f900010404fe000001fe000001f7000008060020e5000008100022f9000008f4000640060010000408080020e700020200080e0320800008fc00010802ef0000080a0020f5000008f2000008150020fd000308000040f6000080fd000408000001280d02200020fd0001
+0108ed000008150020fd000001fe000010fd000008f6000380000008160020f8000340000042fe00011002fe000002fb0000080a0020f3000080f40000080a0020f6000010f1000008080020e70002020008190020fe00014002fe0002100008f900044004000010fd000008140620200000100020fc000081f5000004fe00
+00080c0020fb000080ee0002040008100020fb000080f200010240fe00010108150020fa00018001fc0002040080fe000008fa0000080a0030f7000002f0000008100020fe000080fe0002010001ef000088100020fb000080f20006024000000401080c0020fa000021ef0002200008160020fe00042000000802fb000010
+fc0000c0fa0000080d0020f7000320000001f3000008150020fc000088fd000080f90002012008fc000102081a0320000020fa00070400400002000004fd0002080008fe0000080d0320800004fe000004ec000008130020fa00018001fc000004fd000020f9000008140020f30002400080fe00010810fe00030800000814
+0620200000100020fc000081f5000004fe0000080e0020f5000080f8000001fc000028120020f8000008fe000004f9000002fc000008100020fb000080f200010240fe000101080d012008ec0002080002fe000008060020e5000008100020f5000080fa0002800001fc000028060020e50000080a0020ee000080f9000008
+0a0020f0000004f7000008120020fe000002f70000c0fa000006fc00000811042000020002f1000080fd000320000008160020fd000004fc000020fc0002020004fa0002200008060020e5000008180020fe0004200000080afb000030fe0002400040fa00000816042000800040fe000040f5000010fe000020fe0000080a
+0020ee000080f90000080f0020f00002080080fc000304000008140020fd00051000000400c0fa000018f800018008120030f7000002fb000080fe000080fb000008060020e50000080f05208010000008f3000080f9000008160020fe000080fe000080f9000020fa000401000200080e0020f5000008f6000004fe000008
+11072000200008000001fb000040f30000080d0320000080f9000020f10000081c042000040001fc000320000804fe000004fd000702400001404000080a0020fc000010eb0000080e0020fa000010f7000004f80000080d0020fa000004f0000380004008160020fe00046002000802fb000010fc000044fa000008120020
+fb000010fc000010fc000010f80000080a0020f5000001f2000008150020fd000411c8000020fe000302000020f5000008150020fc000080f90002010040fb00050400008000081e0320000002fe00071000002000081082fe00040210000002fd000280000812052010c0000004f5000340100008fa0000080b012802eb00
+0004fd0000081b042000040001fc000320000804fc000a40020002400000404000080a0020fc000010eb000008090320000008e8000008120020f700042000000108fa000080fc000008060020e5000008100020fa000021f7000008fa0002200008180020fc000080f90005010040000004fe0005040400800008110020f2
+000304000020fc00040420000008140020fe000080fc000002fe000010f50002100008140920020000011140000020fb000020f600010808140020fe000080fc000002fe000010f500021010080a0020f4000010f3000008060020e5000008120020fe000001f5000004fa000001fe0000081002200004fe000080f6000004
+f7000008140020fd0005100000040080fa000008f8000180081b0320000002fb00042000081082fe000006fe000002fd00028000081c0020fc000040fd000610008080100010fe000004fe000010fe0000080b0020f200018080f6000008090320000010e8000008150320000010fe000002f9000048fd000020fa0000080a
+0020f2000040f500000807012004e6000008140020f90002100008f9000040fe000010fd000008060020e5000008140320000020f6000302000004fb00000cfe0000080e022010c0f10002100008fa0000080e0020fd000040ef000008fd0000081402200020fe00040201000010f9000001f800000810012020f800010180
+fb000002f8000008140020fe000080fe0002010001f8000080f9000088060020e5000008090320000080e80000080a0020f2000002f5000008070020e6000120080b042002001001ea00010808120020f9000008fb000020fb000081fc000008190020fe000080fe000080f9000020fd000304000001fe0000081103200000
+80f7000028fe000040f700000811072000200040020001f4000004fa000008100020ef000640020000080002fe0000081a042000800040fe000040fa000004fd000010fe000020fe0000080d0020fa000004f0000380004008120020f8000001fc000001fc000010fb000008130020fb000001fd000010fb000080f9000110
+08110320000004fe00040400001810f00000080d0320800004fe000004ec0000080e02200010f00002010004fb000008140022f900000cfd000002f9000640040010000008060020e5000008060020e5000008160020fc000020fe000040fb00040810000040fa000008130020fc000002fe000080fa000010f80001080811
+072000200040020001f4000004fa000008160020fe0002200008fc000008fb0002400202fa0000080c0020ed00010208fc000140081402200004fe000080f4000004fd000004fe0000080f0020fe00014002f2000004fa000008180020fd00014002fe000080fa000010fc000008fe00010808130020fc000001f60004a000
+000208fc000142080c0020eb0002080002fe000008060020e50000080c0020f4000040f50002040008180020fe0002020002fe000010fd000040fa000004fc0000081002200020fd000001f6000001f8000008120021fd000040fc000080f5000008fd000008190020fd000610000004008080fb00040800012008fc000182
+081605200000010004fb000020fd000340008018f9000008150020fc000080f90002010040fb0005040000800008100022f9000008f40006500400100200080a0030f7000001f0000008110020f3000040fb000010fe000308000008130022f9000308000020f7000640040010000008060020e50000080f0020fc00010402
+f000044000080008160020f5000010fe000020fe00080100200002010000080a0020f1000020f6000008110320000804fe00040400001810f0000008060020e50000081a0020fe000080fe000080fe000020fd000020fa000001fe000008130320000080f90002200028fe000040f70000080f0028f90002220008f3000304
+021008110320000080f7000028fe000040f7000008140020fd00018001f60000a0fe000040fc000102080c0020fa000021ef0002200008110020fd000302000010fd000001f2000008140620000080000088fa000028fe000040f70000080a0020e9000004fe00000812012020f7000080fb000302000040fb0000080b0520
+8010000008ea000008120020fd000411c8000020fb000020f5000008120020fc000010fb000080f8000001fc000028140020fc0005020002008004fb000010f800010848160020fe00044002000010fd000001fa000004fa0000080e0020fd000004f4000004f8000008120020fd0002040001f700014080f9000110081100
+20fc000304000040f9000001f70000081605200000010004fb000030fd000340000010f9000008120020fe000002f7000040fa000004fc000008140020fd000040f7000304002020fb0003200000081302200004fe00018080f200010240fe00010108150020fd000001fe000010fd000008f60003800000080a0020f20000
+80f50000081102200044fe00018220f5000040f90000080c02200040ed000004fc0000080d0020f0000008fa0003040000080e0020fe000008f8000040f3000008160320810004fe000004fe00010202f8000004fb0000080a0020f6000040f100000813012001f800018002fa0002012008fc000102080c0020ed00010208
+fc000140080e0020fc000088f0000040fd0000080a0020e9000008fe0000080e0020fa000008f1000008fe0000081605200000010004fb000020fd000340400010f90000080a0020e9000004fe000008160020fe000080fe000080f90000a0fa000001fe000008090320040010e8000008060020e50000080a0020fa000008
+ed0000081605200000010004fb000020fd000340400010f90000081c0320000002fb00042800081086fe000002fe00010202fe00028000080a0020f3000080f4000008120020fc000040f7000010fe000004fa0000080e0020fd000004f4000004f80000081b0020fe000620000008020008fe00010410fc0002400002fc00
+00080a0020f2000008f5000008060020e5000008140020fb000008f9000304020020fb0003300000081408202000001000200021fe000081f30002200008190020fe00042000080802fe000308000010fc000042fa0000080f0020f90002220008f30003040000081002200020fd000001f6000001f80000080f012001fb00
+0008fe000002f1000008190020fc000002fe000080fa000010fe000080fe0003080008081b0020fd0008100000040080000080fd000008fd000001fd0001802806003fe5ff00f80000ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb80\sa400\sl240\tx1140 \f21\fs20 Figure 14.2\tab A dot-matrix for two related protein sequences using the "Identities algorithm" and a score of 2. Notice that the similarity is not apparent. \par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.2\tab Producing a dot matrix plot using the proportional algorithm\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This method gives the most thorough analysis.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Apply proportional algorithm".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Odd window length". The size of window over which the scores for each point are summed.\par 
+3.\tab Define "Proportional score". All points achieving at least this score will be marked with a dot in the diagram.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear as in figure 14.3.\par 
+\pard\plain \qj\li1700\sb300\sl480\keepn \f4\fs16 {{\pict\macpict\picw283\pich301 
+08a200000000012d011b001102ff0c00fffe0000003c32b0003c32b00000000000fc00ed000000000001000a0000000000fc00ed0098801e0000000000fc00ed0000000000000000003c32b0003c32b000000001000100010000000000000000000000000048ae57000000010000ffffffffffff0001000000000000000000
+0000fc00ed0000000000fc00ed000002e30006007fe5ff00f0060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100b0040f200010180f60000100a0040f2000003f50000100a0040f2000006f50000100a0040f2000004f50000100d0340000020f5000008f5000010090340000020e800
+0010060040e5000010090340000080e80000100802400001e70000100802400003e7000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000
+10060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100a0040f2000060f50000100a0040f2000040f5000010060040e50000100a0040ea000040fd0000100c0040ec0002040080fd000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040
+e5000010060040e5000010060040e5000010060040e5000010060040e50000100a0040eb000080fc0000100a0040ec000001fb0000100a0040ec000002fb000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000
+10060040e50000100a0040f9000010ee0000100a0040f9000030ee0000100a0040f9000060ee0000100a0040f90000c0ee0000100e0040f9000080fc000020f40000100a0040eb000040fc000010060040e5000010060040e5000010060040e5000010060040e50000100a0040ee000004f9000010060040e5000010060040
+e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100a0040f0000002f7000010060040e5000010
+060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010070040e600018010060040e5000010060040e50000100b0040fd000101
+80eb000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100a0040f4000002f30000100a0040f4000006f30000100a0040f400000cf30000100a0040f4000008f30000100a0040f4000010f30000100a0040f4
+000030f30000100a0040f4000060f30000100a0040f4000040f3000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010090040e8000301000010060040e5
+000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100a0040f7000004f00000100a0040f7000004f0000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5
+000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100a0040f9000008ee000010060040e5000010060040e50000100a0040f9000020ee0000100a0040f9
+000040ee0000100a0040f9000080ee0000100a0040fa000001ed0000100a0040fa000002ed0000100a0040fa000004ed0000100a0040fa000004ed0000100a0040fa000008ed000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e500
+0010060040e5000010060040e5000010060040e5000010060040e50000100a0040fb000040ec0000100a0040fb000080ec0000100a0040fc000001eb0000100a0040fc000002eb0000100a0040fc000006eb0000100a0040fc000004eb0000100a0040fc000008eb0000100a0040fc000010eb0000100a0040fc000020eb00
+00100a0040fc000060eb0000100a0040fc000080eb0000100b0040fd00010180eb0000100a0040fd000001ea0000100a0040fd000002ea0000100a0040fd000004ea0000100a0040fd000008ea0000100a0040fd000008ea0000100a0040fd000010ea0000100a0040fd000020ea0000100a0040fd000040ea000010060040
+e50000100a0040fe000001e9000010060040e50000100a0040fe000002e90000100a0040fe000004e90000100a0040fe000008e9000010060040e50000100e0040fe000010f0000040fb000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e500001006
+0040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100d0040fd000303000020ed000010060040e5000010060040e5000010060040e50000100a0040fc00000ceb0000100a0040fc00
+0008eb000010060040e5000010060040e5000010060040e5000010060040e5000010060040e500001006007fe5ff00f00000ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa240\sl240\tx1140 \f21\fs20 Figure 14.3\tab 
+A dot-matrix for the two related protein sequences shown in figure 14.2, but here using the "Proportional algorithm" with a window of 21 and a score of 240. Notice that the similarity is now apparent. \par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.3\tab Producing a dot matrix plot using the quick scan algorithm\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This method is very fast. Using the current score matrix it accumulates the scores for all the exact matches that lie on each diagonal. The mean diagonal score and its standard deviation is calculated, and those diagonals that have scores more than a chose
+n number of standard deviations above the mean are rescanned using the proportional algorithm and the points above the proportional algorithms cutoff are plotted.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Apply quick scan algorithm".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Identity score". The minimum number of consecutive identical sequence symbols that count as a match.\par 
+3.\tab Define "Odd window length". The size of window over which the scores for each point are summed when the proportional algorithm is applied to the best diagonals.\par 
+4.\tab Define "Proportional score". For the best diagonals all points achieving at least this score will be marked with a dot in the diagram.\par 
+5.\tab Define "Number of s.d. above mean". Diagonals with scores above the minimum number of standard deviations are rescanned using the proportional algorithm.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The plot will appear as in figure 14.4.\par 
+\pard\plain \qj\li1720\sb300\sl480\keepn \f4\fs16 {{\pict\macpict\picw283\pich301 
+07fa00000000012d011b001102ff0c00fffe0000003c32b0003c32b00000000000fc00ed000000000001000a0000000000fc00ed0098801e0000000000fc00ed0000000000000000003c32b0003c32b0000000010001000100000000000000000000000000491cbd000000010000ffffffffffff0001000000000000000000
+0000fc00ed0000000000fc00ed000002e30006007fe5ff00f0060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5
+000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e500
+0010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000
+10060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010
+060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e500001006
+0040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100600
+40e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040
+e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5
+000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e500
+0010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000100a0040f9000008ee000010060040e5000010060040e50000100a0040f9000020ee0000100a0040f9000040ee0000100a0040f9000080ee0000100a0040fa
+000001ed0000100a0040fa000002ed0000100a0040fa000004ed0000100a0040fa000004ed0000100a0040fa000008ed000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000
+10060040e50000100a0040fb000040ec0000100a0040fb000080ec0000100a0040fc000001eb0000100a0040fc000002eb0000100a0040fc000006eb0000100a0040fc000004eb0000100a0040fc000008eb0000100a0040fc000010eb0000100a0040fc000020eb0000100a0040fc000060eb0000100a0040fc000080eb00
+00100b0040fd00010180eb0000100a0040fd000001ea0000100a0040fd000002ea0000100a0040fd000004ea0000100a0040fd000008ea0000100a0040fd000008ea0000100a0040fd000010ea0000100a0040fd000020ea0000100a0040fd000040ea000010060040e50000100a0040fe000001e9000010060040e5000010
+0a0040fe000002e90000100a0040fe000004e90000100a0040fe000008e9000010060040e50000100a0040fe000010e9000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e50000
+10060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010060040e5000010
+06007fe5ff00f00000ff}}\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa240\sl240\tx1140 \f21\fs20 Figure 14.4\tab 
+A dot-matrix for the two related protein sequences shown in figures 14.2 and 14.3, but here using the "Quick scan algorithm" with an identity score of 1 and a window of 21 and a score of 240 for the proportional algorithm. Notice that the simil
+arity is now apparent but the absence of background "noise" is misleading.\par 
+\pard\plain \s6\fi-540\li560\sb240\sa60\sl280\tx860 \b\f20 2.4\tab Producing a list of all matching segments using the proportional algorithm\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "List matching segments".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Define "Odd window length". The size of window over which the scores for each point are summed.\par 
+3.\tab Define "Proportional score". All segments achieving at least this score will be listed out with the two sequences written one above the other. See figure 14.5.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.5\tab Calculating the expected scores for the proportional algorithm\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 This function calculates the probability of achieving each possible score using the proportional algorithm. Hence it provides a method of setting
+ cutoff scores and assessing the statistical significance of the scores found. The algorithm calculates the "Double matching probability" described by McLachlan (2) which is defined as the probability of finding the scores in two infinitely long sequences 
+of the same composition as the pair being compared. It is very much faster than the alternative of repeatedly  scrambling and recomparing the sequences. The program offers three ways for the user to see the results of the calculation\:
+ the user can type a \par 
+\pard\plain \li2320\ri2720\sl220\box\brsp100\brdrth \f4\fs16 List matching segments\par 
+\pard \li2320\ri2720\sl220\box\brsp100\brdrth ? Odd window length (1-401) (11) =\par 
+? Proportional score (1-567) (252) =\par 
+Working\par 
+     62\par 
+GLRRGLDVKDLEHPIEVPVGK\par 
+DLAEGMKVKCTGRILEVPVGR\par 
+     81\par 
+     63\par 
+LRRGLDVKDLEHPIEVPVGKA\par 
+LAEGMKVKCTGRILEVPVGRG\par 
+     82\par 
+     65\par 
+RGLDVKDLEHPIEVPVGKATL\par 
+EGMKVKCTGRILEVPVGRGLL\par 
+     84\par 
+     66\par 
+GLDVKDLEHPIEVPVGKATLG\par 
+GMKVKCTGRILEVPVGRGLLG\par 
+     85\par 
+     67\par 
+LDVKDLEHPIEVPVGKATLGR\par 
+MKVKCTGRILEVPVGRGLLGR\par 
+\pard \li2320\ri2720\sl220\keepn\box\brsp100\brdrth      86\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb60\sa400\sl240\tx1140 \f21\fs20 Figure 14.5\tab A typical run of "List matching segments.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 score and the program will display its probability; the user can type a probability and the program will display the corresponding score, alternatively the program will list the full range of scores and probabilities.
+\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Calculate expected scores".\par 
+2.\tab Define "Odd window length".\par 
+\tab The calculation takes a noticeable time.\par 
+3.\tab Select "List scores and probabilities".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Number of steps between scores". This allows, say, every fifth score to be listed if the user defines the number of steps to be 5. The list will appear as in figure 14.6.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.6\tab Calculating the observed scores for the proportional algorithm\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+This function applies the proportional algorithm, but instead of producing a dot matrix it accumulates the scores and their frequencies of occurrence. It provides a method of setting cutoff scores and assessing the statistical significance of the scores fo
+und. The program offers three ways for the user to see the results of the calculation\: the user can type a score and the program will display its frequency; the user can type a frequency and the progra
+m will display the corresponding score, alternatively the program will list the full range of scores and frequencies. The frequencies are expressed as percentages.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Calculate observed scores".\par 
+2.\tab Define "Odd window length".\par 
+\tab The calculation takes a noticeable time.\par 
+\pard\plain \li1320\ri1300\sl220\box\brsp100\brdrth \f4\fs16 Calculate expected proportional scores\par 
+\pard \li1320\ri1300\sl220\box\brsp100\brdrth ? Odd window length (1-401) (21) =\par 
+Working\par 
+Average score=   196.99062\par 
+Select probability display mode\par 
+   1 Show probability for a score\par 
+X  2 Show score for a probability\par 
+   3 List scores and probabilities\par 
+? Selection  (1-3) (2) =3\par 
+? Number of steps between scores (1-10) (5) =\par 
+\par 
+     5  0.10000E+01    200  0.40004E+00    395  0.00000E+00\par 
+    10  0.10000E+01    205  0.24037E+00    400  0.00000E+00\par 
+    15  0.10000E+01    210  0.12555E+00    405  0.00000E+00\par 
+    20  0.10000E+01    215  0.56905E-01    410  0.00000E+00\par 
+    25  0.10000E+01    220  0.22402E-01    415  0.00000E+00\par 
+    30  0.10000E+01    225  0.76821E-02    420  0.00000E+00\par 
+    35  0.10000E+01    230  0.23031E-02    425  0.00000E+00\par 
+    40  0.10000E+01    235  0.60614E-03    430  0.00000E+00\par 
+    45  0.10000E+01    240  0.14064E-03    435  0.00000E+00\par 
+    50  0.10000E+01    245  0.28888E-04    440  0.00000E+00\par 
+    55  0.10000E+01    250  0.52741E-05    445  0.00000E+00\par 
+    60  0.10000E+01    255  0.85917E-06    450  0.00000E+00\par 
+    65  0.10000E+01    260  0.12534E-06    455  0.00000E+00\par 
+    70  0.10000E+01    265  0.16433E-07    460  0.00000E+00\par 
+    75  0.10000E+01    270  0.19425E-08    465  0.00000E+00\par 
+    80  0.10000E+01    275  0.20772E-09    470  0.00000E+00\par 
+    85  0.10000E+01    280  0.20155E-10    475  0.00000E+00\par 
+    90  0.10000E+01    285  0.17801E-11    480  0.00000E+00\par 
+    95  0.10000E+01    290  0.14353E-12    485  0.00000E+00\par 
+   100  0.10000E+01    295  0.10599E-13    490  0.00000E+00\par 
+   105  0.10000E+01    300  0.71886E-15    495  0.00000E+00\par 
+   110  0.10000E+01    305  0.44920E-16    500  0.00000E+00\par 
+   115  0.10000E+01    310  0.25938E-17    505  0.00000E+00\par 
+\pard \li1320\ri1300\sl220\keepn\box\brsp100\brdrth    120  0.10000E+01    315  0.13881E-18    510  0.00000E+00\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa500\sl240\tx1140 \f21\fs20 Figure 14.6\tab A typical run of "Calculate expected proportional scores." The scores are listed in three columns alongside their probabilities. e.g. score 250 has a probability 0.527x10
+{\up6 -5}{\plain \b\f20 .}{\up6 \par 
+}\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 3.\tab Select "List scores and percentages".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Number of steps between scores". This allows, say, every fifth score to be listed if the user defines the number of steps to be 5. The list will appear as in figure 14.7.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.7\tab Producing an optimal alignment\par 
+\pard\plain \s7\qj\sa120\sl280\tx0 \f20 This function produces an optimal alignment for any segments of the two sequences 
+using the algorithm of Myers and Miller (5). It guarantees to produce alignments with the optimum score, given a score matrix, a "gap start penalty" and a "gap extension penalty". That is starting a gap costs a fixed penalty F and each residue added to the
+ gap costs a further penalty E, so for \par 
+\pard\plain \li1980\ri2060\sb400\sl220\box\brsp100\brdrth \f4\fs16 Calculate observed proportional scores\par 
+\pard \li1980\ri2060\sl220\box\brsp100\brdrth ? Odd window length (1-401) (21) =\par 
+Working\par 
+Maximum observed score is    285\par 
+Select score display mode\par 
+X  1 Show percentage reaching a score\par 
+   2 Show score for a percentage\par 
+   3 List scores and percentages\par 
+? Selection  (1-3) (1) =3\par 
+ ? Number of steps between scores (1-10) (5) =\par 
+  156   236949  0.99998E+02\par 
+  161   236938  0.99993E+02\par 
+  166   236792  0.99932E+02\par 
+  171   235882  0.99548E+02\par 
+  176   232582  0.98155E+02\par 
+  181   222875  0.94058E+02\par 
+  186   203232  0.85769E+02\par 
+  191   171507  0.72380E+02\par 
+  196   131216  0.55376E+02\par 
+  201    89194  0.37642E+02\par 
+  206    52791  0.22279E+02\par 
+  211    27315  0.11528E+02\par 
+  216    12117  0.51137E+01\par 
+  221     4890  0.20637E+01\par 
+  226     1774  0.74867E+00\par 
+  231      656  0.27685E+00\par 
+  236      263  0.11099E+00\par 
+  241      111  0.46845E-01\par 
+  246       66  0.27854E-01\par 
+  251       36  0.15193E-01\par 
+  256       23  0.97065E-02\par 
+  261       16  0.67524E-02\par 
+  266       15  0.63303E-02\par 
+  271       10  0.42202E-02\par 
+  276        6  0.25321E-02\par 
+\pard \li1980\ri2060\sl220\box\brsp100\brdrth   281        2  0.84405E-03\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa400\sl240\tx1140 \f21\fs20 Figure 14.7\tab 
+A typical run of "Calculate observed scores."  The scores are followed by their observed number of occurrences expressed both absolutely and as a percentage of the total number of points.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+gap of length K residues the penalty is F + KE. Gaps at the ends of sequences incur no penalty. The size of the segments of sequence that can be aligned at once is limited to 5000 characters. The user can select the start and end of the segments by use of 
+the crosshair simply by clicking on any dot matrix plot. After the alignment has been produce the user can elect to have it replace the original sequence segments. By alternate use of dot matrix plotting and alignment, very long sequences can be aligned.
+\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select "Align sequences". The crosshair will appear in the graphics window. \par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 2.\tab Position the crosshair on the bottom left of the segment to be aligned and hit the space bar on the keyboard. The bell will ring.\par 
+3.\tab Position the crosshair on the top right of the segment to be aligned and hit the space bar on the keyboard. The bell will ring.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab Define "Penalty for starting each gap".\par 
+5.\tab Define "penalty for each residue in gap".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 \tab A noticeable time will elapse before the alignment is displayed on the screen. A typical alignment is shown in figure 14.8.\par 
+6.\tab Reject "Keep alignment". If the alignment is "kept" the padded sequences from the alignment will replace the original sequences in the active region.\par 
+\pard\plain \li480\ri540\sl220\box\brdrth \f4\fs16 Align the sequences\par 
+\pard \li480\ri540\sl220\box\brdrth Aligning region  1 to  461\par 
+   with region  1 to  514\tab \tab Working\par 
+   V     1         11         21         31         41         51\par 
+         MA--TGKIVQ VIGA------ VVDVEFPQDA VPRVYDALEV QNG------N ERLVL-----\par 
+         *      *    *         **            * *       *        *   *\par 
+         MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY\par 
+   H     1         11         21         31         41         51\par 
+   V    61         71         81         91        101        111\par 
+         EVQQQLGGGI VRTIAMGSSD GLRRGLDVKD LEHPIEVPVG KATLGRIMNV LGEPVDMKGE\par 
+              *     *    **     *  *  **       *****    ***  *  ** * * **\par 
+         AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP\par 
+   H    61         71         81         91        101        111\par 
+   V   121        131        141        151        161        171\par 
+         IGEEERWAIH RAAPSYEELS NSQELLETGI KVIDLMCPFA KGGKVGLFGG AGVGKTVNMM\par 
+                *     **   *          **  *  * * *    *      *     ***\par 
+         LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI\par 
+   H   121        131        141        151        161        171\par 
+   V   181        191        201        211        221        231\par 
+         ELIRNIAIEH SGYS-VFAGV GERTREGNDF YHEMTDSNVI DKVSLVYGQM NEPPGNRLRV\par 
+           *  *     **         *                          *      *\par 
+         DAI--INQRD SGIKCIYVAI GQKASTISNV VRKLEEHGAL ANTIVVVATA SESAALQYLA\par 
+   H   181        191        201        211        221        231\par 
+   V   241        251        261        271        281        291\par 
+         ALTGLTMAEK FRDEGRDVLL FVDNIYRYTL AGTEVSALLG RMPSAVGYQP TLAEEMGVLQ\par 
+               * *  *** * * *    *        *    * **  * *                *\par 
+         RMPVALMGEY FRDRGEDALI IYDDLSKQAV AYRQISLLLR RPPGREAFPG DVFYLHSRLL\par 
+   H   241        251        261        271        281        291\par 
+   V   301        311        321        331        341        351\par 
+         ERITST---- ---------- -KTGSITSVQ AVYVPADDLT DPSPATTFAH LDATVVLSRQ\par 
+         **                     **** *         * *      *        *    *\par 
+         ERAARVNAEY VEAFTKGEVK GKTGSLTALP IIETQAGDVS AFVPTNVISI TDGQIFLETN\par 
+   H   301        311        321        331        341        351\par 
+   V   361        371        381        391        401        411\par 
+         IASLGIYPAV DPLDSTSRQL DPLVVGQEHY DTAR----GV QSILQRYQEL KDIIAILGMD\par 
+             ** ***  *  * **      * *             *     *  * **\par 
+         LFNAGIRPAV NPGISVSR-- ---VGGAAQT KIMKKLSGGI RTALAQYREL AAFSQFAS--\par 
+   H   361        371        381        391        401        411\par 
+   V   421        431        441        451        461        471\par 
+         ELSEEDKLVV ARARKIQRFL SQ----PFFV AE----VFTG SPGKYVSLKD --TIRGFKGI\par 
+          *             *    *  *    *  * *      *     * *         *  *\par 
+         DLDDATRKQL DHGQKVTELL KQKQYAPMSV AQQSLVLFAA ERG-YLADVE LSKIGSFEAA\par 
+   H   421        431        441        451        461        471\par 
+   V   481        491        501        511        521\par 
+         MEG--EYDHL P-EQAFYMVG SIEEAVE--- --------KA KKL*\par 
+                **  *  *     *       *                  *\par 
+         LLAYVDRDHA PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW*\par 
+   H   481        491        501        511        521\par 
+Conservation  22.5%\par 
+\pard \li480\ri540\sl220\keepn\box\brdrth Number of padding characters inserted    63 and    10\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb60\sa300\sl240\tx1140 \f21\fs20 Figure 14.8\tab A typical output from "Align the sequences". The horizontal and vertical sequences are labelled H and V.\par 
+\pard\plain \s6\sb240\sa60\sl280\tx560\tx860 \b\f20 2.8\tab Comparing a sequence against a library of sequences\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 
+The program SIPL is used for comparing a probe sequence against a whole library of sequences. The searches are very fast and use the "Quick scan" algorithm described above to produce a list of matching sequences sorted in score order, and optionally, this 
+is followed by the production of optimal alignments using the Myers and Miller (5) algorithm. The program will search the whole of a library or restrict its search using a list of entry names. The list of
+ entry names can be used either as a list of sequences to search or conversely as a list of sequences to exclude from a search.\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab Select SIPL.\par 
+2.\tab Select "Personal file".\par 
+3.\tab Select "Format".\par 
+4.\tab Define "Name of sequence file". The name of the file containing the probe sequence.\par 
+5.\tab Define "Name of results file".\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 6.\tab Accept "Display alignments". The alternative will stop after producing a list of the best matching sequences.\par 
+7.\tab Define "Minimum library sequence length". This permits the search to skip sequences that are too short to be of interest.\par 
+8.\tab Define "Maximum number of scores to list". The maximum number of sequences that will be included in the results file.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 9.\tab 
+Define "Identity score". This is the minimum number of consecutive sequence characters that will be counted as a match. Only matches of at least this length will be included in the overall score. For proteins maximum sensitivity is gained using a value 
+of 1, but for nucleic acids values of 4 or 6 are necessary to achieve reasonable speed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 10.\tab Define "Number of sd above mean". This means the number of standard deviations above the mean that a diagonal must score in order for it to be scanned using the proportional algorithm.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 11.\tab Define "Odd window length". This is the window size for the rescanning of high scoring diagonals using the proportional algorithm.\par 
+12.\tab Define "Proportional score". The score used by the proportional algorithm. It depends on the window length and the score matrix.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 13.\tab Define "Minimum global score". This is the total score achieved using the proportional algorithm when all the diagonals scoring the defined number of standard deviations above the mean, are rescanned.
+\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 14.\tab Define "Penalty for starting a gap". This is for the alignment algorithm.\par 
+15.\tab Define "Penalty for each residue in gap". See above.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 16.\tab Select a library to search. The default library will reflect the composition of the probe sequence. That is, a probe sequence that is less than 85% acgt will be guessed to be a protein.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 17.\tab Select "Search whole library". The alternatives allow the search to be restricted using a list of entry names.\par 
+\pard\plain \s4\qj\sa120\sl280 \f20 The search will start. A large number of parameters are required but for normal use the default value can be taken for them all. A worked example is shown in figure 14.9.\par 
+\pard\plain \li220\ri240\sl220\brdrt\brsp100\brdrth \brdrl\brsp100\brdrth \brdrr\brsp100\brdrth \f4\fs16  SIPL (Similarity investigation program (Library)) V3.0 June 1991\par 
+\pard \li220\ri240\sl220\brdrl\brsp100\brdrth \brdrr\brsp100\brdrth  Author\: Rodger Staden\par 
+ Compares a probe protein or nucleic acid\par 
+ sequence against a library of sequences\par 
+\par 
+Select probe sequence\par 
+ Select sequence source\par 
+ X  1 Personal file   \par 
+    2 Sequence library\par 
+ ? Selection  (1-2) (1) =2\par 
+ Select a library\par 
+    1 EMBL nucleotide library                           \par 
+ X  2 SWISSPROT protein library\par 
+    3 PIR protein library                         \par 
+ ? Selection  (1-3) (2) =\par 
+Library is in EMBL format with indexes\par 
+ Select a task\par 
+ X  1 Get a sequence                                    \par 
+    2 Get annotations                                   \par 
+    3 Get entry names from accession numbers            \par 
+    4 Search titles for keywords                        \par 
+    5 Search keyword index for keywords                 \par 
+ ? Selection  (1-5) (1) =\par 
+ ? Entry name=bacr$halha\par 
+DE   BACTERIORHODOPSIN PRECURSOR (BR) (GENE NAME\: BOP).             \par 
+ Sequence length=   262\par 
+ Sequence composition\par 
+ A   C     S     T     P     A     G     N     D     E     Q     B     Z     H\par 
+ N   0.   14.   19.   12.   30.   26.    3.   10.   11.    4.    0.    0.    0.\par 
+ %   0.0   5.3   7.3   4.6  11.5   9.9   1.1   3.8   4.2   1.5   0.0   0.0  0.0\par 
+ W    0. 1219. 1921. 1165. 2132. 1483.  342. 1151. 1420.  513.    0.    0.   0.\par 
+\par 
+A   R     K     M     I     L     V     F     Y     W     -     X     ?      \par 
+N   7.    7.   10.   15.   39.   23.   13.   11.    8.    0.    0.    0.    0.\par 
+%   2.7   2.7   3.8   5.7  14.9   8.8   5.0   4.2   3.1   0.0   0.0   0.0   0.0\par 
+W 1093.  897. 1312. 1697. 4413. 2280. 1913. 1795. 1490.    0.    0.    0.    0.\par 
+Total molecular weight=   28256.254\par 
+? Results file=sipl.res\par 
+? Display alignments (y/n) (y) =\par 
+? Minimum library sequence length (10-20000) (209) =\par 
+? Maximum number of scores to list (1-10000) (20) =10\par 
+? Identity score (1-3) (1) =\par 
+? Number of sd above mean (0.00-10.00) (3.00) =\par 
+? Odd window length (1-31) (11) =\par 
+? Proportional score (1-297) (132) =\par 
+? Minimum global score (1-69168) (1729) =\par 
+? Penalty for starting a gap (1-100) (10) =\par 
+? Penalty for each residue in gap (1-100) (10) =\par 
+Select a library\par 
+   1 EMBL nucleotide library                           \par 
+X  2 SWISSPROT protein library\par 
+   3 PIR protein library                       \par 
+   4 Personal file in PIR format                       \par 
+? Selection  (1-4) (2) =\par 
+Library is in EMBL format with indexes\par 
+Select a task\par 
+X  1 Search whole library                              \par 
+   2 Search only a list of entries                     \par 
+   3 Search all but a list of entries                  \par 
+? Selection  (1-3) (1) =3\par 
+? File of entry names=skip.nam\par 
+  21794 entries processed,    25 above cutoff, sorting now\par 
+Entries exceeding sd cutoff=  4439\par 
+Mean number of diagonals above span cutoff    1.32012\par 
+List in score order\par 
+  31007 BACA$HALSA DE   ARCHAERHODOPSIN PRECURSOR (AR).                        \par 
+  12177 BACH$NATPH DE   HALORHODOPSIN PRECURSOR (HR) (GENE NAME\: HOP).         \par 
+  10999 BACH$HALSP DE   HALORHODOPSIN PRECURSOR (HR) (GENE NAME\: HOP).         \par 
+   3999 HYAC$ECOLI DE   HYPOTHETICAL 27.6 KD PROTEIN IN HYAB 3'REGION (GENE NAM\par 
+   2670 OPS4$DROME DE   OPSIN RH4 (INNER R7 PHOTORECEPTOR CELLS OPSIN) (GENE NA\par 
+   2573 PYR1$MESAU DE   CAD PROTEIN (CONTAINS\: GLUTAMINE-DEPENDENT CARBAMOYL-PH\par 
+   2328 PFLA$ECOLI DE   PYRUVATE FORMATE-LYASE ACTIVATING ENZYME.   \par 
+   2194 DCOP$CANAL DE   OROTIDINE 5'-PHOSPHATE DECARBOXYLASE (EC 4.1.1.23) (OMP\par 
+   2145 BCM1$HUMAN DE   LYMPHOCYTE ACTIVATION MARKER BLAST-1 PRECURSOR (BCM1 SU\par 
+   2103 LAG3$HUMAN DE   LAG-3 PROTEIN PRECURSOR (FDC PROTEIN) (GENE NAME\: LAG3 \par 
+ BACA$HALSA DE   ARCHAERHODOPSIN PRECURSOR (AR).                        \par 
+    V     1         11         21         31         41         51\par 
+          MLELLPTAVE GVSQAQITGR PEWIWLALGT ALMGLGTLYF LVKGMGVSDP DAKKFYAITT\par 
+          *                  ** **  **  **  **  ** **  *** ** *   *   * ** \par 
+          M-DPIALTAA VGADLLGDGR PETLWLGIGT LLMLIGTFYF IVKGWGVTDK EAREYYSITI\par 
+    H     1         11         21         31         41         51\par 
+    V    61         71         81         91        101        111\par 
+          LVPAIAFTMY LSMLLGYGLT MVPFGGEQNP IYWARYADWL FTTPLLLLDL ALLVDADQGT\par 
+          *** **   * ***  * ***  *  * *    ** ******* ********** ***   *   \par 
+          LVPGIASAAY LSMFFGIGLT EVQVGSEMLD IYYARYADWL FTTPLLLLDL ALLAKVDRVS\par 
+    H    61         71         81         91        101        111\par 
+    V   121        131        141        151        161        171\par 
+          ILALVGADGI MIGTGLVGAL TKVYSYRFVW WAISTAAMLY ILYVLFFGFT SKAESMRPEV\par 
+          *  *** *   ** *******       *  * *  **  *    ** *        *    ***\par 
+          IGTLVGVDAL MIVTGLVGAL SHTPLARYTW WLFSTICMIV VLYFLATSLR AAAKERGPEV\par 
+    H   121        131        141        151        161        171\par 
+    V   181        191        201        211        221        231\par 
+          ASTFKVLRNV TVVLWSAYPV VWLIGSEGAG IVPLNIETLL FMVLDVSAKV GFGLILLRSR\par 
+          ****  *      *** ***   * ** ****  * * ***** ****** *** *** ******\par 
+          ASTFNTLTAL VLVLWTAYPI LWIIGTEGAG VVGLGIETLL FMVLDVTAKV GFGFILLRSR\par 
+    H   181        191        201        211        221        231\par 
+    V   241        251        261\par 
+          AIFGEAEAPE PSAGDGAAAT SD\par 
+          ** *  **** ****   *    *\par 
+          AILGDTEAPE PSAG-AEASA AD\par 
+    H   241        251        261\par 
+Conservation  56.1%\par 
+\pard \li220\ri240\sl220\keepn\brdrl\brsp100\brdrth \brdrb\brsp100\brdrth \brdrr\brsp100\brdrth Number of padding characters inserted     0 and     2\par 
+\pard\plain \s8\qj\fi-1140\li1140\sb120\sa120\sl240\tx1140 \f21\fs20 Figure 14.9\tab A run of SIPL using an entry from a sequence library and a file of entries to be excluded from the search.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 3.\tab Notes\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.\tab 
+The variants on the proportional algorithm are selected by setting parameters using a special menu. This includes the facility to switch off the main diagonal for all options, which is useful when comparing a sequence against itself.\par 
+2.\tab For nucleotide sequences the program also has a function to complement a sequence. If the sequence on one axis is the complement of that on the other, the plots will show possible base pairing.\par 
+3.\tab When the cross hair is being employed, in addition to the standard special keys, the letter m will produce a display showing  all the identical sequence characters around the cross hair position. The display is in the form of a matrix.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.\tab 
+Users should not be misled by the "Quick scan" algorithm. Its function is to perform rapid comparisons. The plots it produces may look quite striking because they will contain almost no background, however such plots tell nothing about the significance 
+of the similarities displayed.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 5.\tab By using the "Reposition plots" function users can display several dot matrix plots on the screen at the same time. In this way plots from several pairs of sequence comparisons can be viewed together.
+\par 
+6.\tab The library search program SIPL is of limited use for searching the nucleic acid libraries because it does not deal properly with sequences longer than 20,000 characters, but simply truncates them.\par 
+\pard\plain \s5\sb400\sa60\sl320\tx560 \b\f20\fs28 4.\tab References\par 
+\pard\plain \s7\qj\fi-560\li560\sa120\sl280\tx560 \f20 1.  Staden, R. 1982. An interactive graphics program for comparing and aligning nucleic acid and amino acid sequences. {\i Nucl. Acids  Res}. {\b 10(9)}\:2951-2961.\par 
+2.  McLachlan, A.D.  1971.  Test for comparing related amino acid sequences.  {\i J. Mol. Biol.} {\b 61}\:409-424.\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 3.  Schwartz, R.M. and Dayhoff, M.O.  1978.  Matrices for detecting distant relationships.  (in) {\i Atlas of Protein Sequence and Structure,} {\b 5 suppl. 3}\:353-358, Nat. Biomed. Res. Found., Washington D.C.
+\par 
+\pard \s7\qj\fi-560\li560\sa120\sl280\tx560 4.  Lipman, D.J. and Pearson, W.R. 1985. Rapid and sensitive protein similarity searches. {\i Science} {\b 227}\:1435-1441.\par 
+5.\tab Myers, E.W. and Miller, W. 1988. Optimal alignments in linear space. {\i Comput. Applic. Biosci}., {\b 4}, 11-17.\par 
+}
diff --git a/doc/ted.PS b/doc/ted.PS
new file mode 100644
index 0000000..88ef25a
--- /dev/null
+++ b/doc/ted.PS
@@ -0,0 +1,3033 @@
+%!              for use by dvi2ps Version 2.00
+% $Header: tex.ps,v 2.0 88/06/07 15:12:32 peterd Rel2 $
+% a start (Ha!) at a TeX mode for PostScript.
+% The following defines procedures assumed and used by program "dvi2ps"
+% and must be downloaded or sent as a header file for all TeX jobs.
+
+% By:  Neal Holtz, Carleton University, Ottawa, Canada
+%      <holtz@cascade.carleton.cdn>
+%      <holtz%cascade.carleton.cdn@ubc.csnet>
+%      June, 1985
+%      Last Modified: Aug 25/85
+% oystr 12-Feb-1986
+%   Changed @dc macro to check for a badly formed bits in character
+%   definitions.  Can get a <> bit map if a character is not actually
+%   in the font file.  This is absolutely guaranteed to drive the
+%   printer nuts - it will appear that you can no longer define a
+%   new font, although the built-ins will still be there.
+% mackay 4-Jan-1988
+%   Changed size of character array to reflect gf usage (256 characters)
+
+% To convert this file into a downloaded file instead of a header
+% file, uncomment all of the lines beginning with %-%
+
+%-%0000000 			% Server loop exit password
+%-%serverdict begin exitserver
+%-%  systemdict /statusdict known
+%-%  {statusdict begin 9 0 3 setsccinteractive /waittimeout 300 def end}
+%-% if
+
+/TeXDict 200 dict def   % define a working dictionary
+TeXDict begin           % start using it.
+
+                        % units are in "dots" (300/inch)
+/Resolution 300 def
+/Inch  {Resolution mul} def  % converts inches to internal units
+
+/Mtrx 6 array def
+
+%%%%%%%%%%%%%%%%%%%%% Page setup (user) options %%%%%%%%%%%%%%%%%%%%%%%%
+
+% dvi2ps will output coordinates in the TeX system ([0,0] 1" down and in
+% from top left, with y +ive downward).  The default PostScript system
+% is [0,0] at bottom left, y +ive up.  The Many Matrix Machinations in
+% the following code are an attempt to reconcile that. The intent is to
+% specify the scaling as 1 and have only translations in the matrix to
+% properly position the text.  Caution: the default device matrices are
+% *not* the same in all PostScript devices; that should not matter in most 
+% of the code below (except for lanscape mode -- in that, rotations of
+% -90 degrees resulted in the the rotation matrix [ e 1 ]
+%                                                 [ 1 e ]
+% where the "e"s were almost exactly but not quite unlike zeros.
+
+/@letter
+  { letter initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+    310 -3005 translate      % move origin to top (these are not exactly 1"
+    Mtrx currentmatrix pop   % and -10" because margins aren't set exactly right)
+  } def
+        % note mode is like letter, except it uses less VM
+/@note
+  { note initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+    310 -3005 translate                      % move origin to top
+    Mtrx currentmatrix pop
+  } def
+
+/@landscape
+  { letter initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+%    -90 rotate                              % it would be nice to be able to do this
+    Mtrx currentmatrix 0 0.0 put             % but instead we have to do things like this because what
+    Mtrx 1 -1.0 put                          % should be zero terms aren't (and text comes out wobbly)
+    Mtrx 2 1.0 put                           % Fie!  This likely will not work on QMS printers
+    Mtrx 3 0.0 put                           % (nor on others where the device matrix is not like
+    Mtrx  setmatrix                          %  like it is on the LaserWriter).
+    300 310  translate                       % move origin to top
+    Mtrx currentmatrix pop
+  } def
+
+/@legal
+  { legal initmatrix
+    72 Resolution div dup neg scale          % set scaling to 1.
+    295 -3880 translate                      % move origin to top
+    Mtrx currentmatrix pop
+  } def
+
+/@manualfeed
+   { statusdict /manualfeed true put
+     statusdict /manualfeedtimeout 300 put  % 5 minutes
+   } def
+        % n @copies -   set number of copies
+/@copies
+   { /#copies exch def
+   } def
+
+%%%%%%%%%%%%%%%%%%%% Procedure Defintions %%%%%%%%%%%%%%%%%%%%%%%%%%
+
+/@newfont       % id @newfont -         -- initialize a new font dictionary
+  { /newname exch def
+    pop
+    newname 7 dict def          % allocate new font dictionary
+    newname load begin
+        /FontType 3 def
+        /FontMatrix [1 0 0 -1 0 0] def
+        /FontBBox [0 0 1 1] def
+% mackay 4-Jan-1987 changed size of array from 128 to 256 for gf fonts
+        /BitMaps 256 array def
+        /BuildChar {CharBuilder} def
+        /Encoding 256 array def
+        0 1 255 {Encoding exch /.undef put} for
+        end
+    newname newname load definefont pop
+  } def
+
+
+% the following is the only character builder we need.  it looks up the
+% char data in the BitMaps array, and paints the character if possible.
+% char data  -- a bitmap descriptor -- is an array of length 6, of 
+%          which the various slots are:
+
+/ch-image {ch-data 0 get} def   % the hex string image
+/ch-width {ch-data 1 get} def   % the number of pixels across
+/ch-height {ch-data 2 get} def  % the number of pixels tall
+/ch-xoff  {ch-data 3 get} def   % number of pixels below origin
+/ch-yoff  {ch-data 4 get} def   % number of pixels to left of origin
+/ch-tfmw  {ch-data 5 get} def   % spacing to next character
+
+/CharBuilder    % fontdict ch Charbuilder -     -- image one character
+  { /ch-code exch def           % save the char code
+    /font-dict exch def         % and the font dict.
+    /ch-data font-dict /BitMaps get ch-code get def     % get the bitmap descriptor for char
+    ch-data null eq not
+      { ch-tfmw   0   ch-xoff neg   ch-yoff neg   ch-width ch-xoff sub   ch-height ch-yoff sub
+            setcachedevice
+        ch-width ch-height true [1 0  0 1  ch-xoff ch-yoff]
+            {ch-image} imagemask
+      }
+    if
+  } def
+
+
+/@sf            % fontdict @sf -        -- make that the current font
+  { setfont() pop
+  } def
+
+                % in the following, the font-cacheing mechanism requires that
+                % a name unique in the particular font be generated
+
+/@dc            % char-data ch @dc -    -- define a new character bitmap in current font
+  { /ch-code exch def
+% ++oystr 12-Feb-86++
+    dup 0 get
+    length 2 lt
+      { pop [ <00> 1 1 0 0 8.00 ] } % replace <> with null
+    if
+% --oystr 12-Feb-86--
+    /ch-data exch def
+    currentfont /BitMaps get ch-code ch-data put
+    currentfont /Encoding get ch-code 
+       dup (   ) cvs cvn   % generate a unique name simply from the character code
+       put
+  } def
+
+/@bop0           % n @bop0 -              -- begin the char def section of a new page
+  { 
+  } def
+
+/@bop1           % n @bop1 -              -- begin a brand new page
+  { pop
+    erasepage initgraphics 
+    Mtrx setmatrix
+    /SaveImage save def() pop
+  } def
+
+%--  tjh sept. 87: if this page has a mac drawing on it, we have to
+%--  use showpage in the md dictionary.
+/@eop           % - @eop -              -- end a page
+  {
+    userdict /md known {
+	userdict /md get type /dicttype eq {
+	    md /MacDrwgs known {
+		md begin showpage end
+	    }{
+		showpage
+	    } ifelse
+	}{
+	    showpage
+	} ifelse
+    }{
+	showpage
+    } ifelse
+    SaveImage restore() pop
+  } def
+
+/@start         % - @start -            -- start everything
+  { @letter                             % (there is not much to do)
+  } def
+
+/@end           % - @end -              -- done the whole shebang
+  { end
+  } def
+
+/p              % x y p -               -- move to position
+  { moveto
+  } def
+
+/r              % x r -                 -- move right
+  { 0 rmoveto
+  } def
+
+/s              % string s -            -- show the string
+  { show
+  } def
+
+/c              % ch c -                -- show the character (code given)
+  { c-string exch 0 exch put
+    c-string show
+  } def
+
+/c-string ( ) def
+
+/ru             % dx dy ru -   -- set a rule (rectangle)
+  { /dy exch neg def    % because dy is height up from bottom
+    /dx exch def
+    /x currentpoint /y exch def def   % remember current point
+    newpath x y moveto
+    dx 0 rlineto
+    0 dy rlineto
+    dx neg 0 rlineto
+    closepath fill
+    x y moveto
+  } def
+
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+%%     the \special command junk
+%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
+%   The structure of the PostScript produced by dvi2ps for \special is:
+%         @beginspecial
+%           - any number of @hsize, @hoffset, @hscale, etc., commands
+%         @setspecial
+%           - the users file of PostScript commands
+%         @endspecial
+
+% The @beginspecial command recognizes whether the Macintosh Laserprep
+% has been loaded or not, and redfines some Mac commands if so.
+% The @setspecial handles the users shifting, scaling, clipping commands
+
+%-- tjh sept. 87: made changes to allow postscript and macdrawing to
+%-- to be inserted with version 65 of the md dictionary.  Many bugs
+%-- were fixed:
+%--	vo changed to vof, name conflict with md
+%--	vs changed to vsz, name conflict with md
+%--	substantially changed @setspecial and @MacSetUp
+%-- Also, made changes to allow users to specify offsets
+%-- and clip rectangles in inches.
+
+% The following are user settable options from the \special command.
+
+/@SpecialDefaults
+  { /hs 8.5 72 mul def
+    /vsz 11 72 mul def
+    /ho 0 def
+    /vof 0 def
+    /hsc 1 def
+    /vsc 1 def
+    /CLIP false def
+  } def
+
+%       d @hsize -      specify a horizontal clipping dimension
+%                       these 2 are executed before the MacDraw initializations
+/@hsize {72 mul /hs exch def /CLIP true def} def
+/@vsize {72 mul /vsz exch def /CLIP true def} def
+
+%       d @hoffset -    specify a shift for the drwgs
+/@hoffset {72 mul /ho exch def} def
+/@voffset {72 mul /vof exch def} def
+
+%       s @hscale -     set scale factor
+/@hscale {/hsc exch def} def
+/@vscale {/vsc exch def} def
+
+/@setclipper
+  { hsc vsc scale
+    CLIP
+      { newpath 0 0 moveto hs 0 rlineto 0 vsz rlineto hs neg 0 rlineto closepath clip }
+    if
+  } def
+
+% this will be invoked as the result of a \special command (for the
+% inclusion of PostScript graphics).  The basic idea is to change all
+% scaling and graphics back to defaults, but to shift the origin
+% to the current position on the page.  Due to TeXnical difficulties,
+% we only set the y-origin.  The x-origin is set at the left edge of
+% the page.
+
+/@beginspecial
+  { gsave /SpecialSave save def
+          % the following magic incantation establishes the current point as
+          % the users origin, and reverts back to default scalings, rotations
+    currentpoint transform initgraphics itransform translate
+    @SpecialDefaults    % setup default offsets, scales, sizes
+    @MacSetUp           % fix up Mac stuff 
+  } def
+
+
+%--  tjh: assume this is raw postscript, but save some state in case its not.
+/@setspecial
+  {
+    /specmtrx matrix currentmatrix def
+    ho vof translate @setclipper
+  } def
+
+
+/@endspecial
+  { SpecialSave restore
+    grestore
+  } def
+
+
+% - @MacSetUp -   turn-off/fix-up all the MacDraw stuff that might hurt us
+                % we depend on 'psu' being the first procedure executed
+                % by a Mac document.  We redefine 'psu' to adjust page
+                % translations, and to do all other the fixups required.
+                % This stuff will not harm other included PS files
+/@MacSetUp
+  { userdict /md known  % if md is defined
+      { userdict /md get type /dicttype eq      % and if it is a dictionary
+         {
+           md begin                             % then redefine some stuff
+              /psu                              % redfine psu to set origins, etc.
+                /psu load
+                        % this procedure contains almost all the fixup code
+                {
+%		  /letter {} def    % it is bad manners to execute the real
+%		  /note {} def      %  versions of these (clears page image, etc.)
+%		  /legal {} def
+		  /MacDrwgs true def
+		  specmtrx setmatrix % restore pre-@setspecial state.
+		  initclip	     % ditto
+		  % change smalls to prevent page clearing.
+		  /smalls [ lnop lnop lnop lnop lnop lnop lnop lnop lnop ] def
+		  0 0 0 0 ppr astore pop  % prevents origin translation.
+		  % redifine cp, do the showpage later, see @eop
+                  /cp {
+			pop
+			pop
+			pm restore
+		  } def 		 	% no printing of pages
+                }
+                concatprocs 
+              def
+              /od
+		% redefine od to translate and scale.
+		% redfine load to set clipping region.
+                /od load
+                {
+		  ho vof translate
+		  hsc vsc scale
+		  CLIP {
+		      /nc
+			/nc load
+			{ newpath 0 0 moveto hs 0 rlineto 0 vsz rlineto
+			hs neg 0 rlineto closepath clip }
+			concatprocs
+		      def
+		  } if
+		}
+                concatprocs
+              def
+           end }
+        if }
+    if    
+  } def
+
+%       p1 p2 concatprocs p       - concatenate procedures
+/concatprocs
+  { /p2 exch cvlit def 
+    /p1 exch cvlit def
+    /p p1 length p2 length add array def
+    p 0 p1 putinterval 
+    p p1 length p2 putinterval
+    p cvx
+  } def
+
+end                     % revert to previous dictionary
+TeXDict begin @start
+%%Title: ted.dvi
+%%Creator: dvi2ps
+%%EndProlog
+5 @bop0
+[ 300 ] /cmr12.300 @newfont
+cmr12.300 @sf
+[<FFFFC07FFFC03FFFC030004018006008002004002002002001000001800000C000006000003000001800001C00000E000007
+  000007800003C00003C00003E02003E0F801E0F801E0F801E0F003E08003E04003C04003C02007801007000C1C0003F000> 24 33 -2 0 24.387] 50 @dc
+[<70F8F8F870> 8 5 -4 0 13.548] 46 @dc
+[<FFFFF8000F801E0007800700078003C0078001E0078000E0078000F007800078078000780780007C0780003C0780003C0780
+  003E0780003E0780003E0780003E0780003E0780003E0780003E0780003E0780003E0780003C0780003C0780003C0780007C
+  0780007807800078078000F0078000E0078001C0078003C0078007000F801E00FFFFF000> 32 34 -2 0 37.249] 68 @dc
+[<00FC000703000E00801C0040380020780020700000F00000F00000F00000F00000F00000FFFFE0F000E07000E07801E03801
+  C01C01C00C038007070001FC00> 24 21 -1 0 21.677] 101 @dc
+[<0FC1E03C2390781708F00F08F00708F00708F007087007007807003C07001E070007C70000FF000007000007000007001807
+  003C0E003C0C001838000FE000> 24 21 -2 0 24.387] 97 @dc
+[<FFF0000F00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000F00000F06
+  000F0F001E8F00FE46000E3C00> 24 21 -1 0 18.968] 114 @dc
+[<4020101008080404040474FCFCF870> 8 15 -4 10 13.548] 44 @dc
+[<81FC00C60700C80180F000C0E000C0C00060C000608000708000708000708000700000700000F00000F00001E00007E0003F
+  C003FF800FFF001FFE003FF0007F0000780000F00000F00000E00020E00020E00020E00060E000606000607000E03001E018
+  02600C0C6003F020> 24 36 -3 1 27.097] 83 @dc
+[<FFE7FF000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E00
+  70000E0070000E0070000E0070000F0070000F0070001E80E000FE60C0000E1F8000> 32 21 -1 0 27.097] 110 @dc
+[<01F0FE070CF00C02E01801E03800E07800E07000E0F000E0F000E0F000E0F000E0F000E0F000E0F000E07000E07800E03800
+  E01C01E00C02E00704E001F8E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00001E000
+  0FE00000E0> 24 35 -2 0 27.097] 100 @dc
+[<01F0030807080E040E040E040E040E040E040E000E000E000E000E000E000E000E000E000E000E00FFF83E001E000E000600
+  060006000200020002000200> 16 31 -1 0 18.968] 116 @dc
+[<0000007C00FFFC01E2000FC003C100078007C08007800FC08007800F808007800F800007800F800007800F800007800F0000
+  07800F000007800F000007800F000007800E000007801E000007801C00000780380000078070000007FFE0000007803C0000
+  07800E00000780078000078007C000078003C000078003E000078003E000078003E000078003E000078003E000078003C000
+  078007C000078007800007800E00000F803C0000FFFFE00000> 40 35 -2 1 35.894] 82 @dc
+[<00200040008001000300060004000C000C00180018003000300030007000600060006000E000E000E000E000E000E000E000
+  E000E000E000E000E000E000E0006000600060007000300030003000180018000C000C000400060003000100008000400020> 16 50 -4 13 18.968] 40 @dc
+[<FFFE07C003800380038003800380038003800380038003800380038003800380038003800380038003800380038003800380
+  0380038003800380F3800F8003800080> 16 33 -4 0 24.387] 49 @dc
+[<0FC000103000201800700C007806007807003003000003800003800001C00001C00001C003E1E00619E00C05E01805E03803
+  E07003E07001E0F001E0F001E0F001E0F001E0F001E0F001C0F001C0F001C07003807003803803801807000C0600060C0001
+  F000> 24 34 -2 1 24.387] 57 @dc
+[<800040002000100018000C000400060006000300030001800180018001C000C000C000C000E000E000E000E000E000E000E0
+  00E000E000E000E000E000E000E000C000C000C001C0018001800180030003000600060004000C0018001000200040008000> 16 50 -3 13 18.968] 41 @dc
+[<FFE00020001F000060000E00006000040000E000040001E000040001E000040003E000040003E000040007A00004000F2000
+  04000F200004001E200004003E200004003C200004007820000400F820000400F020000401E020000401E020000403C02000
+  04078020000407802000040F002000041F002000041E002000043C002000047C002000047800200004F000200004F0002000
+  05E000200007C000700007C000F800FF8007FF00> 40 34 -2 0 36.563] 78 @dc
+[<00FC7F0003827800060170000E00F0000E00F0000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E00
+  70000E0070000E0070000E0070000E0070000E0070001E00F000FE07F0000E007000> 32 21 -1 0 27.097] 117 @dc
+[<01F8000706000C01001C0080380040780040700000F00000F00000F00000F00000F00000F00000F000007000007800003803
+  001C07800C078007030001FE00> 24 21 -2 0 21.677] 99 @dc
+[<FF800FFF003E0001F8000C0000F0000C0000F000040001E000040001E000040003E000020003C000020003C000030007C000
+  0100078000010007800000FFFF000000800F000000800F000000401E000000401E000000401E000000203C000000203C0000
+  00203C0000001078000000107800000010F800000008F000000008F00000000DF000000005E000000005E000000003C00000
+  0003C000000003C00000000180000000018000000001800000> 40 35 -2 0 36.563] 65 @dc
+[<FFC00E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E001E00FE000E000000000000000000
+  00000000000000001C001E003E001E001C00> 16 34 -1 0 13.548] 105 @dc
+[<8FC0D030E018C008C00C800C800C801C003C01F80FF03FE07F80F000E008C008C008C018601830780F88> 16 21 -2 0 19.239] 115 @dc
+[<FFE0000E00000E00000E00000E00000E00000E00000E00000E00000E00000E3F000E41C00E80E00F00700E00380E003C0E00
+  3C0E001E0E001E0E001E0E001E0E001E0E001E0E001E0E001C0E003C0E00380F00700E8060FE61C00E1F00> 24 31 -1 10 27.097] 112 @dc
+[<03F0000C1C00100F002007804007804003C0F003C0F803E0F803E07003E02003E00003E00003C00003C0000780000780000F
+  00001C0003F000003800000E00000F000007000007800007803807C07807C07803C07807C04007C02007801007000C1E0003
+  F800> 24 34 -2 1 24.387] 51 @dc
+[<FFFC3FFF000FC003F000078001E000078001E000078001E000078001E000078001E000078001E000078001E000078001E000
+  078001E000078001E000078001E000078001E000078001E000078001E000078001E00007FFFFE000078001E000078001E000
+  078001E000078001E000078001E000078001E000078001E000078001E000078001E000078001E000078001E000078001E000
+  078001E000078001E0000FC003F000FFFC3FFF00> 40 34 -2 0 36.563] 72 @dc
+[<FFE00E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E00
+  0E000E000E000E000E000E000E001E00FE000E00> 16 35 -1 0 13.548] 108 @dc
+[<FFFFFF000F803F0007800F000780030007800300078001000780018007800180078000800780008007800080078000800780
+  0000078000000780000007800000078000000780000007800000078000000780000007800000078000000780000007800000
+  078000000780000007800000078000000780000007800000078000000FC00000FFFE0000> 32 34 -2 0 30.475] 76 @dc
+[<0007F00000003C0C080000E003180001C000B800038000B80007000078000F000078001E000078001E000078003C00007800
+  3C000078007C000078007800007800780000F800F8001FFF00F800000000F800000000F800000000F800000000F800000000
+  F800000000F800000000780000080078000008007C000008003C000018003C000018001E000018001E000038000F00003800
+  0700007800038000F80001C001B80000E0021800003C0C18000007F00800> 40 36 -3 1 38.270] 71 @dc
+[<FFFC00000FC00000078000000780000007800000078000000780000007800000078000000780000007800000078000000780
+  000007800000078000000780000007FFF00007803C0007800F0007800780078007C0078003C0078003E0078003E0078003E0
+  078003E0078003E0078003E0078003C0078007C00780078007800F000F803C00FFFFF000> 32 34 -2 0 33.185] 80 @dc
+[<083F000C41C00C80600F00700E00380E003C0E001C0E001E0E001E0E001E0E001E0E001E0E001E0E001E0E001C0E003C0E00
+  380F00300E80600E61C00E1F000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00001E0000FE
+  00000E0000> 24 35 -1 0 27.097] 98 @dc
+[<FFE3FF8FFE0E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00E003800E0
+  0E003800E00E003800E00E003800E00E003800E00E003800E00E003800E00F003C00E00F003C00E01E807201C0FE60E18380
+  0E1FC07F00> 40 21 -1 0 40.645] 109 @dc
+[ 300 ] /cmbx12.300 @newfont
+cmbx12.300 @sf
+[<FFFFE000FFFFE00007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0180007F0180007F0
+  180007F0180007F0380007F0780007FFF80007FFF80007F0780007F0380007F0180007F0180007F0180C07F0180C07F0000C
+  07F0000C07F0001C07F0001C07F0001807F0003807F0007807F001F8FFFFFFF8FFFFFFF8> 32 34 -2 0 35.226] 70 @dc
+[<FFE0FFE01F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F00FF00FF00000000000000
+  00000000000000001C003F007F007F007F003F001C00> 16 36 -2 0 15.566] 105 @dc
+[<01FF00000FFFE0003F01F8007C007C0078003C00F0001E00F0001E00F0001E00F0001E0070003E003800FC001FFFFC000FFF
+  F8001FFFF0003FFF800038000000300000003000000013FC00001FFF00001F0F80003E07C0003C03C0007C03E0007C03E000
+  7C03E0007C03E0007C03E0003C03C0003E07CF001F0F8F000FFF7F0003FC1E00> 32 33 -2 11 28.019] 103 @dc
+[<01FC3FC007FF3FC00F81BE001F00FE001F007E001F003E001F003E001F003E001F003E001F003E001F003E001F003E001F00
+  3E001F003E001F003E001F003E001F003E001F003E001F003E001F003E00FF01FE00FF01FE00> 32 22 -2 0 31.133] 117 @dc
+[<FFF000FFF0001F00001F00001F00001F00001F00001F00001F00001F00001F00001F00001F00001F00001F00001F83C01E87
+  E01E87E01EC7E01E67E0FE3FC0FE1F00> 24 22 -2 0 22.888] 114 @dc
+[<00FF0003FFC00FC0701F00303E00187E00007C00007C0000FC0000FC0000FC0000FFFFF8FFFFF8FC00F8FC00F87C00F87C00
+  F03E01F01E01E00F87C007FF8000FE00> 24 22 -2 0 25.569] 101 @dc
+[<7FFFE07FFFE001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F8
+  0001F80001F80001F80001F80001F80001F80001F80001F80001F80001F800FFF800FFF80001F800007800001800> 24 32 -4 0 28.019] 49 @dc
+[<FFE0FFE01F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F001F00
+  1F001F001F001F001F001F001F001F00FF00FF00> 16 35 -2 0 15.566] 108 @dc
+[<FFE1FFC0FFE1FFC01F003E001F003E001F003E001F003E001F003E001F003E001F003E001F003E001F003E001F003E001F00
+  3E001F003E001F003E001F003E001F803E001F803E001F403C001F307C00FF1FF800FF07E000> 32 22 -2 0 31.133] 110 @dc
+[<01FC3FC007FF3FC00F83BE001E00FE003E007E007C003E007C003E00FC003E00FC003E00FC003E00FC003E00FC003E00FC00
+  3E00FC003E00FC003E007C003E007E003E003E003E001F007E000F81FE0007FFBE0001FC3E0000003E0000003E0000003E00
+  00003E0000003E0000003E0000003E0000003E0000003E0000003E0000003E000001FE000001FE00> 32 35 -2 0 31.133] 100 @dc
+[<387CFEFEFE7C38> 8 7 -4 0 15.566] 46 @dc
+cmr12.300 @sf
+[<FFFE00000FC00000078000000780000007800000078000000780000007800000078000000780000007800000078020000780
+  20000780200007802000078060000780E00007FFE0000780E000078060000780200007802000078020000780202007800020
+  0780002007800020078000600780004007800040078000C0078001C00F8007C0FFFFFFC0> 32 34 -2 0 31.830] 70 @dc
+[<03FE000E03803800E0600030600030C00018C00018C000184000186000303800F00FFFE00FFFC01FFE001800001800001000
+  0010000019F0000F1C000E0E001C07001C07003C07803C07803C07803C07801C07001C07000E0E18071E1801F198000070> 24 33 -1 11 24.387] 103 @dc
+[<FFE7FF000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E0070000E00
+  70000E0070000E0070000E0070000F0070000F0070000E80E0000E60C0000E1F80000E0000000E0000000E0000000E000000
+  0E0000000E0000000E0000000E0000000E0000000E0000000E0000001E000000FE0000000E000000> 32 35 -1 0 27.097] 104 @dc
+[<01FC000707000E03801C01C03800E07800F0700070F00078F00078F00078F00078F00078F00078F000787000707000703800
+  E01800C00C018007070001FC00> 24 21 -1 0 24.387] 111 @dc
+[<00600600000060060000006006000000F00F000000F00F000000F00D000001C81C800001C81C800001C81880000384384000
+  038438400003843040000702702000070270200007026020000E01E010000E01E010000E01C018001C01C018001E01E03C00
+  FF8FF8FF00> 40 21 -1 0 35.225] 119 @dc
+[<381C7C3EFC7EFC7EB85C8040804080408040402040202010201010080804> 16 15 -6 -20 24.387] 92 @dc
+[<4020201010081008080408040402040204020402743AFC7EFC7EF87C7038> 16 15 -2 -20 24.387] 34 @dc
+[<7FF8000780000700000700000700000700000700000700000700000700000700000700000700000700000700000700000700
+  00070000070000070000FFF800070000070000070000070000070000070000070000070000070000030F00038F00018F0000
+  C600003C00> 24 35 0 0 14.903] 102 @dc
+[<03FFFF00000FC000000780000007800000078000000780000007800000078000000780000007800000078000000780000007
+  8000000780000007800000078000000780000007800000078000000780000007800000078000000780008007800480078004
+  8007800480078004C007800C40078008400780084007800860078018780780787FFFFFF8> 32 34 -2 0 35.225] 84 @dc
+[<FFE0FFE0> 16 2 -1 -10 16.258] 45 @dc
+[<3C0000430000F18000F08000F0400000400000200000200000200000100000100000380000380000380000740000740000E2
+  0000E20000E20001C10001C1000380800380800380800700400700400E00200E00200E00301E0078FFC1FE> 24 31 -1 10 25.742] 121 @dc
+[<FFFF807007807801803800801C00800E00C00F004007004003804001C00001E00000E000007000403800403C00401C00600E
+  002007003007803803803FFFC0> 24 21 -1 0 21.677] 122 @dc
+[<00100000380000380000380000740000740000E20000E20000E20001C10001C1000380800380800380800700400700400E00
+  200E00200E00301E0078FFC1FE> 24 21 -1 0 25.742] 118 @dc
+[<000FFE0000E00000E00000E00000E00000E00000E00000E00000E00000E001F0E0070CE00C02E01C01E03801E07800E07000
+  E0F000E0F000E0F000E0F000E0F000E0F000E0F000E07800E07800E03801E01C01600E026007046001F820> 24 31 -2 10 25.742] 113 @dc
+[<4020101008080404040474FCFCF870> 8 15 -4 -20 13.548] 39 @dc
+[<7FE3FF8007007800070070000700700007007000070070000700700007007000070070000700700007007000070070000700
+  700007007000070070000700700007007000070070000700700007007000FFFFFFC007007000070070000700700007007000
+  07007000070070000700700007007000070070000380F0780180F87800C07C7800706E30001F83E0> 32 35 0 0 28.451] 11 @dc
+[<7FE1FF8007003800070038000700380007003800070038000700380007003800070038000700380007003800070038000700
+  380007003800070038000700380007003800070038000700380007007800FFFFF80007000000070000000700000007000000
+  0700000007000000070000000700300007007800038078000180380000C0100000702000001FC000> 32 35 0 0 27.097] 12 @dc
+[<3E006180F180F0C060E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E000E0
+  00E000E000E001E00FE001E00000000000000000000000000000000001C003E003E003E001C0> 16 44 2 10 14.903] 106 @dc
+[<FFFFFC000F800F00078007C0078003E0078001F0078001F0078000F8078000F8078000F8078000F8078000F8078000F80780
+  00F0078001F0078001E0078003C00780078007FFFE0007803E0007800F80078007C0078003E0078001E0078001F0078001F0
+  078001F0078001F0078001F0078001E0078003E0078003C0078007800F800E00FFFFF800> 32 34 -2 0 34.539] 66 @dc
+[<03F0000C1C001006002007004003804003C08001C0E001C0F001E0F001E07001E00001E00001E00001E00001E00001C00001
+  C0100380180380140700130E0010F80010000010000010000010000010000010000013E0001FF8001FFE001FFF001E070010
+  0080> 24 34 -2 1 24.387] 53 @dc
+5 @bop1
+cmr12.300 @sf
+237 307 p (2.) s
+22 r (Dear,) s
+16 r (S.) s
+16 r (and) s
+17 r (Staden,) s
+16 r (R.) s
+16 r (\(1991\)) s
+16 r (Nuc.) s
+22 r (Acids) s
+16 r (Res.,) s
+17 r (in) s
+16 r (press.) s
+237 367 p (3.) s
+22 r (Hillier,) s
+16 r (L.) s
+16 r (and) s
+17 r (Green,) s
+16 r 80 c
+-3 r 46 c
+15 r (\(1991\)) s
+16 r (submitted.) s
+cmbx12.300 @sf
+237 428 p (Figure) s
+19 r 49 c
+18 r (legend.) s
+cmr12.300 @sf
+237 488 p (Figure) s
+17 r 49 c
+17 r (sho) s
+0 r (ws) s
+15 r 97 c
+17 r (\\screen) s
+17 r (dump") s
+17 r (of) s
+17 r (the) s
+17 r (ted) s
+17 r (graphical) s
+17 r (in) s
+-1 r (terface.) s
+23 r (The) s
+17 r (dis-) s
+164 548 p (pla) s
+0 r 121 c
+22 r (consists) s
+24 r (of) s
+23 r (the) s
+24 r (con) s
+-1 r (trol) s
+23 r (panel) s
+23 r (and) s
+24 r (the) s
+23 r (sync) s
+0 r (hronized) s
+22 r (view) s
+24 r (of) s
+23 r (the) s
+24 r (base) s
+164 608 p 112 c
+1 r (osition) s
+19 r (information,) s
+19 r (original) s
+18 r (and) s
+19 r (edited) s
+18 r (sequence) s
+19 r (data,) s
+19 r (and) s
+18 r (graphical) s
+19 r (rep-) s
+164 668 p (resen) s
+0 r (tation) s
+15 r (of) s
+16 r (the) s
+16 r (trace) s
+16 r (\(with) s
+16 r (eac) s
+0 r 104 c
+15 r 110 c
+-1 r (ucleotide's) s
+15 r (trace) s
+16 r 98 c
+2 r (eing) s
+16 r (represen) s
+-1 r (ted) s
+15 r 98 c
+0 r 121 c
+15 r 97 c
+164 729 p (di\013eren) s
+0 r 116 c
+16 r (color\).) s
+24 r (The) s
+17 r (con) s
+0 r (trol) s
+16 r (panel) s
+17 r (allo) s
+0 r (ws) s
+16 r (the) s
+18 r (user) s
+17 r (to) s
+17 r (read) s
+17 r (in) s
+17 r (new) s
+17 r (trace) s
+18 r (\014les) s
+164 789 p (\(in) s
+17 r (either) s
+16 r 98 c
+1 r (ottom) s
+17 r (or) s
+16 r (top) s
+17 r (strand) s
+17 r (orien) s
+-1 r (tation\)) s
+16 r (as) s
+16 r 119 c
+0 r (ell) s
+15 r (as) s
+17 r (to) s
+17 r (searc) s
+-1 r 104 c
+16 r (for) s
+16 r 97 c
+17 r (string) s
+164 849 p (of) s
+15 r 110 c
+0 r (ucleotides) s
+15 r (or) s
+15 r 97 c
+15 r (certain) s
+16 r (base) s
+15 r 112 c
+2 r (osition.) s
+21 r (Scroll) s
+16 r (bars) s
+15 r (allo) s
+0 r 119 c
+14 r (the) s
+16 r (user) s
+15 r (to) s
+15 r (adjust) s
+164 909 p (the) s
+15 r (magni\014cation) s
+15 r (of) s
+15 r (or) s
+14 r (scroll) s
+15 r (through) s
+15 r (the) s
+15 r (sequence) s
+15 r (and) s
+15 r (trace) s
+15 r (data.) s
+21 r (The) s
+15 r (user) s
+164 969 p (ma) s
+0 r 121 c
+15 r (also) s
+16 r 99 c
+-1 r (ho) s
+1 r (ose) s
+16 r (to) s
+16 r 99 c
+-1 r (hange) s
+15 r (the) s
+16 r 118 c
+0 r (ertical) s
+15 r (magni\014cation) s
+16 r (of) s
+16 r (the) s
+16 r (trace) s
+16 r (data.) s
+22 r 70 c
+-3 r (ur-) s
+164 1029 p (ther,) s
+17 r (sequence) s
+17 r (on) s
+17 r (the) s
+18 r (head) s
+17 r (\(v) s
+-1 r (ector\)) s
+16 r (or) s
+17 r (tail) s
+18 r (\(uncertain) s
+17 r (data\)) s
+17 r (of) s
+17 r (the) s
+17 r (sequence) s
+164 1090 p (ma) s
+0 r 121 c
+19 r 98 c
+1 r 101 c
+21 r (\\cuto\013) s
+3 r 34 c
+20 r (using) s
+21 r (the) s
+20 r (adjust) s
+20 r (left) s
+21 r (and) s
+20 r (righ) s
+0 r 116 c
+19 r (cuto\013) s
+20 r (buttons.) s
+34 r (Bases) s
+20 r (can) s
+164 1150 p 98 c
+1 r 101 c
+17 r (inserted,) s
+16 r (deleted,) s
+17 r (or) s
+16 r (replaced) s
+17 r (as) s
+16 r (with) s
+17 r (an) s
+0 r 121 c
+15 r (ordinary) s
+17 r 119 c
+-1 r (ord-pro) s
+1 r (cessor) s
+16 r (in) s
+17 r (the) s
+164 1210 p (sequence) s
+17 r (data) s
+16 r (windo) s
+0 r (w.) s
+22 r (Finally) s
+-3 r 44 c
+16 r (the) s
+17 r (sequence) s
+16 r (ma) s
+0 r 121 c
+16 r 98 c
+1 r 101 c
+17 r (written) s
+16 r (to) s
+17 r (an) s
+17 r (ascii) s
+17 r (\014le) s
+164 1270 p (using) s
+16 r (the) s
+16 r (output) s
+17 r (button) s
+16 r (on) s
+16 r (the) s
+17 r (con) s
+-1 r (trol) s
+15 r (panel.) s
+961 2599 p 53 c
+@eop
+4 @bop0
+cmbx12.300 @sf
+[<FFE00FFFF8FFE00FFFF80600007F000600007F00030000FE00030000FE00038001FE00018001FC00018001FC0000FFFFF800
+  00FFFFF80000E007F800006007F00000600FF00000300FE00000300FE00000381FE00000181FC00000183FC000000C3F8000
+  000C3F8000000E7F800000067F00000006FF00000003FE00000003FE00000003FE00000001FC00000001FC00000000F80000
+  0000F800000000F8000000007000000000700000> 40 34 -2 0 42.317] 65 @dc
+[<FFFF800000FFFF80000007F000000007F000000007F000000007F000000007F000000007F000000007F000000007F0000000
+  07F000000007F000000007F000000007F000000007F000000007FFFF000007FFFFE00007F007F00007F001FC0007F000FC00
+  07F0007E0007F0007E0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007E0007F0007E0007F000FC00
+  07F001FC0007F007F000FFFFFFE000FFFFFF0000> 40 34 -2 0 38.281] 80 @dc
+[<FFFFFFF0FFFFFFF007F007F007F001F007F000F007F0007007F0007007F0003807F0003807F0001807F0001807F0001807F0
+  001807F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F0000007F00000
+  07F0000007F0000007F0000007F0000007F0000007F0000007F00000FFFFE000FFFFE000> 32 34 -2 0 33.669] 76 @dc
+[<FFFFE0FFFFE003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F8
+  0003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F80003F800FFFFE0FF
+  FFE0> 24 34 -1 0 20.870] 73 @dc
+[<0003FE0000001FFFC00000FF00F00001F800380003F0000C0007C00006000F800003001F800003003F000003803F00000180
+  7F000001807E000001807E00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000
+  FE000000007E000001807E000001807F000001803F000003803F000003801F800007800F8000078007C0000F8003E0001F80
+  01F8003F8000FF01E380001FFF81800003FE0080> 40 34 -3 0 40.472] 67 @dc
+[<03FFFFF80003FFFFF8000003F800000003F800000003F800000003F800000003F800000003F800000003F800000003F80000
+  0003F800000003F800000003F800000003F800000003F800000003F800000003F800000003F800000003F800000003F80000
+  0003F800000003F80000C003F800C0C003F800C0C003F800C0C003F800C0E003F801C0E003F801C06003F801807003F80380
+  7803F807807E03F80F807FFFFFFF807FFFFFFF80> 40 34 -2 0 38.973] 84 @dc
+[<0007FC0000003FFF800000FC07E00003F001F80007E000FC000FC0007E001F80003F003F80003F803F00001F807F00001FC0
+  7F00001FC07E00000FC0FE00000FE0FE00000FE0FE00000FE0FE00000FE0FE00000FE0FE00000FE0FE00000FE0FE00000FE0
+  FE00000FE07E00000FC07E00000FC07F00001FC03F00001F803F00001F801F80003F001F80003F000FC0007E0007E000FC00
+  03F001F80000FC07E000003FFF80000007FC0000> 40 34 -3 0 42.086] 79 @dc
+[<FFF00000C0FFF00001C006000003C006000003C006000007C00600000FC00600001FC00600003FC00600007FC00600007FC0
+  060000FFC0060001FEC0060003FCC0060007F8C0060007F0C006000FF0C006001FE0C006003FC0C006007F80C00600FF00C0
+  0600FE00C00601FE00C00603FC00C00607F800C0060FF000C0060FE000C0061FE000C0063FC000C0067F8000C006FF0000C0
+  07FE0000C007FC0000C0FFFC001FFEFFF8001FFE> 40 34 -2 0 43.816] 78 @dc
+[<80FF8000C7FFE000FF00F800FC003C00F0003C00E0001E00E0001E00C0001F00C0001F00C0001F0000003F0000003F000000
+  7F000003FF00003FFE0003FFFE000FFFFC001FFFF8003FFFF0007FFFC0007FFF0000FFE00000FF000000FC000000FC000C00
+  F8000C00F8000C0078001C0078001C007C003C003C007C001F03FC0007FF8C0001FC0400> 32 34 -3 0 31.133] 83 @dc
+[<FFFFFF8000FFFFFFF00007F003FC0007F0007E0007F0003F0007F0001F8007F0000FC007F00007E007F00007E007F00003F0
+  07F00003F007F00003F007F00003F807F00003F807F00003F807F00003F807F00003F807F00003F807F00003F807F00003F8
+  07F00003F807F00003F007F00003F007F00003F007F00007F007F00007E007F00007E007F0000FC007F0001F8007F0003F00
+  07F0007E0007F003FC00FFFFFFF000FFFFFF8000> 40 34 -2 0 42.951] 68 @dc
+[<0001FF0000000FFFE000003F80F800007E001C0000FC000E0001F800060003F800030003F000030007F000018007F0000180
+  07F000018007F000018007F000018007F000018007F000018007F000018007F000018007F000018007F000018007F0000180
+  07F000018007F000018007F000018007F000018007F000018007F000018007F000018007F000018007F000018007F0000180
+  07F000018007F0000180FFFF803FFCFFFF803FFC> 40 34 -2 0 43.067] 85 @dc
+cmr12.300 @sf
+[<FFFC0FC007800780078007800780078007800780078007800780078007800780078007800780078007800780078007800780
+  07800780078007800780078007800FC0FFFC> 16 34 -2 0 17.595] 73 @dc
+[<0007E00000381C0000E0020001C0010003800080070000400E0000401E0000201C0000203C0000103C0000107C0000107800
+  001078000000F8000000F8000000F8000000F8000000F8000000F8000000F8000000F800000078000010780000107C000010
+  3C0000303C0000301C0000301E0000700E000070070000F0038000F001C0017000E00630003818300007E010> 32 36 -3 1 35.225] 67 @dc
+[<FFE3FE0E00F80E00F00E01E00E01C00E03C00E07800E07000E0E000E1E000F1C000EF8000E38000E10000E08000E04000E02
+  000E01800E01C00E01F00E03FC0E00000E00000E00000E00000E00000E00000E00000E00000E00000E00000E00001E0000FE
+  00000E0000> 24 35 -1 0 25.742] 107 @dc
+[<0003F00000001C0800000030060000006001000000E000800001C000800003C000400003C000400003800040000780002000
+  0780002000078000200007800020000780002000078000200007800020000780002000078000200007800020000780002000
+  0780002000078000200007800020000780002000078000200007800020000780002000078000200007800020000780002000
+  0780002000078000200007800070000FC000F800FFFC07FF00> 40 35 -2 1 36.563] 85 @dc
+[<FF03FE3E01F00C00E00401E00201C0010380018700008E00004E00003C0000380000780000780000E40001C2000383000381
+  000700800E00C01F01F0FF83FE> 24 21 -1 0 25.742] 120 @dc
+[<FEFEC0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0C0FEFE> 8 49 -5 12 13.548] 91 @dc
+[<FEFE060606060606060606060606060606060606060606060606060606060606060606060606060606060606060606FEFE> 8 49 -1 12 13.548] 93 @dc
+cmbx12.300 @sf
+[<0000380000000038000000007C000000007C00000000FE00000000FE00000000FE00000001FF00000001FF00000003FD8000
+  0003F980000007F9C0000007F0C0000007F0C000000FF06000000FE06000001FE03000001FC03000003FC03800003F801800
+  003F801800007F800C00007F000C0000FF00060000FE00060001FE00070001FC00030001FC00030003F800018003F8000180
+  07F80000C007F00000C0FFFF800FFEFFFF800FFE> 40 34 -1 0 42.317] 86 @dc
+[<FFFFFFC000FFFFFFF80007F001FE0007F0007F0007F0003F8007F0003F8007F0001FC007F0001FC007F0001FC007F0001FC0
+  07F0001FC007F0001FC007F0001F8007F0003F8007F0003F0007F0007E0007F001FC0007FFFFF00007FFFFF00007F003F800
+  07F000FC0007F000FE0007F0007E0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007E0007F0007E00
+  07F000FC0007F001F800FFFFFFE000FFFFFF8000> 40 34 -2 0 39.838] 66 @dc
+[<001FFFF000001FFFF0000000FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE00000000FE0000
+  0000FE00000000FE00000000FE00000000FE00000001FE00000001FF00000003FF80000003FD80000007F8C000000FF0E000
+  000FF06000001FE07000003FC03000003FC01800007F801C0000FF000C0000FF00060001FE00070001FE00030003FC000180
+  07F80001C007F80000C0FFFF800FFEFFFF800FFE> 40 34 -1 0 42.317] 89 @dc
+cmr12.300 @sf
+[<FFE0203FFF001F007003F0000E007001E00004007001E0000400F801E0000400F801E0000400F801E0000401E401E0000401
+  E401E0000401E401E0000403C201E0000403C201E00004078101E00004078101E00004078101E000040F0081E000040F0081
+  E000040F0081E000041E0041E000041E0041E000043C0021E000043C0021E000043C0021E00004780011E00004780011E000
+  04780011E00004F00009E00004F00009E00004F00009E00005E00005E00005E00005E00007C00003E0000FC00003F000FFC0
+  0003FF00> 48 34 -2 0 44.692] 77 @dc
+[<FFFC07FF800FC000FC00078000F800078000F000078001E000078001E000078003C000078007C000078007800007800F0000
+  07800F000007801E000007803C000007C03C000007A07800000790F000000788F000000789E000000787E000000783C00000
+  0781800000078080000007804000000780200000078010000007800800000780040000078002000007800100000780008000
+  078000400007800060000FC000F800FFFC03FF00> 40 34 -2 0 37.918] 75 @dc
+[<000001E0000003F8000007F8000007FC000007FC00000F0E00000E0600000C0200000C02000FEC02007C3C0200E80E0003C8
+  1780078813C00F0801E00E0420E01E0380F03C0000783C0000787C00007C7C00007C7800003CF800003EF800003EF800003E
+  F800003EF800003EF800003EF800003EF800003EF800003E7800003C7800003C7C00007C7C00007C3C0000783E0000F81E00
+  00F00E0000E00F0001E0078003C003C0078000E00E0000783C00000FE000> 32 45 -3 10 37.935] 81 @dc
+[<FFE007FFC01F8001FC00070000F800030000F000010001F000008001E00000C003E000004007C000002007800000300F8000
+  00101F000000081E0000000C3E000000047C000000027800000003F800000001F000000001E000000003E000000007C00000
+  0007A00000000FB00000001F100000001E080000003E0C0000007C040000007802000000F803000000F001000001E0008000
+  03E000C00003C000E00007E001F8007FF807FF00> 40 34 -1 0 36.563] 88 @dc
+[<000FE00000783C0000E00E0003C00780078003C00F0001E00F0001E01E0000F03E0000F83C0000787C00007C7C00007C7800
+  003CF800003EF800003EF800003EF800003EF800003EF800003EF800003EF800003EF800003E7800003C7800003C7C00007C
+  7C00007C3C0000783C0000781E0000F00E0000E00F0001E0078003C003C0078000E00E0000783C00000FE000> 32 36 -3 1 37.935] 79 @dc
+cmbx12.300 @sf
+[<FFFF83FFFCFFFF83FFFC07F0003FC007F0007F8007F0007F0007F000FF0007F001FE0007F001FC0007F003F80007F007F800
+  07F007F00007F00FE00007F01FE00007F01FC00007F83F800007FE7F800007F77F000007F3FE000007F1FE000007F0FC0000
+  07F038000007F018000007F00E000007F007000007F003800007F001C00007F000E00007F000300007F000180007F0000E00
+  07F000070007F0000380FFFF803FFCFFFF803FFC> 40 34 -2 0 43.874] 75 @dc
+[<0000E0000E00000000E0000E00000000F0001E00000001F0001F00000001F0001F00000003F8003F80000003F8003F800000
+  03FC007F80000007FC007FC0000007FC007FC000000FF600FFE000000FE600FE6000000FE600FE6000001FE301FC3000001F
+  C301FC3000001FC383FC3000003F8183F81800003F8183F81800007F80C7F81C00007F00C7F00C00007F00C7F00C0000FF00
+  6FE0060000FE006FE0060000FE007FE0060001FC003FC0030001FC003FC0030003FC003F80038003F8003F80018003F8007F
+  80018007F0007F0000C007F0007F0000C00FF000FF0000E0FFFF0FFFF01FFEFFFF0FFFF01FFE> 56 34 -1 0 57.883] 87 @dc
+[<FFFFFFFC00FFFFFFFC0007F001FC0007F0003E0007F0001E0007F0000E0007F0000E0007F000060007F000060007F0000600
+  07F000030007F018030007F018030007F018000007F018000007F038000007F078000007FFF8000007FFF8000007F0780000
+  07F038000007F018000007F018060007F018060007F018060007F000060007F0000E0007F0000E0007F0000C0007F0001C00
+  07F0003C0007F000FC00FFFFFFFC00FFFFFFFC00> 40 34 -2 0 36.782] 69 @dc
+[<0003FF00C0001FFFC3C0007F80E7C001FC003FC003F0001FC007E0001FC00FC0001FC01F80001FC03F00001FC03F00001FC0
+  7F00001FC07F00001FC07E000FFFFCFE000FFFFCFE00000000FE00000000FE00000000FE00000000FE00000000FE00000000
+  FE000000007E000000C07E000000C07F000000C03F000001C03F000001C01F800003C00FC00003C007C00007C003F0000FC0
+  01F8003FC0007F00F1C0001FFFC0C00003FE0040> 40 34 -3 0 44.047] 71 @dc
+[<FFF00600FFFF00FFF00F00FFFF0006000F0007E00006001F8007E00006001F8007E00006001F8007E00006003FC007E00006
+  003FC007E00006007E6007E00006007E6007E0000600FC3007E0000600FC3007E0000601F81807E0000601F81807E0000601
+  F81807E0000603F00C07E0000603F00C07E0000607E00607E0000607E00607E000060FC00307E000060FC00307E000060FC0
+  0307E000061F800187E000061F800187E000063F0000C7E000063F0000C7E000067E000067E000067E000067E00006FC0000
+  37E00006FC000037E00006FC000037E00007F800001FE000FFF800001FFF00FFF000000FFF00> 56 34 -2 0 53.156] 77 @dc
+cmr12.300 @sf
+[<70F8F8F870000000000000000000000070F8F8F870> 8 21 -4 0 13.548] 58 @dc
+[<07C000187000203800401C00F01E00F80E00F80F00F80F00700F00000F00000F00000F00000F00000F00000F00000F00000F
+  00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F0000
+  1F0003FFF0> 24 35 -2 1 25.056] 74 @dc
+[<00020000800000030001800000070001C00000070001C00000070001C000000F8003E000000F8003E000000F8003E000001E
+  40079000001E40079000001E40079000003C200F0800003C200F0800003C200F0800007C101E04000078101E04000078101E
+  040000F8183E060000F0083C020000F0083C020000F0083C020001E00478010001E00478010001E00478010003C002F00080
+  03C002F0008003C002F00080078001E00040078001E00040078001E000400F0003C000200F0003C000200F0003C000701F80
+  07E000F8FFF03FFC03FE> 48 35 -1 1 50.111] 87 @dc
+[<01F000071C000C06001C07003803803803807803C07001C07001C07001C0F001E0F001E0F001E0F001E0F001E0F001E0F001
+  E0F001E0F001E0F001E0F001E0F001E0F001E0F001E07001C07001C07001C07001C03803803803801803000C0600071C0001
+  F000> 24 34 -2 1 24.387] 48 @dc
+[<01F000070C000C06001C03001803803801C03801C07001E07001E07001E0F001E0F001E0F001E0F001E0F001E0F001C0F801
+  C0F80380F40300F40600F30C00F0F8007000007000007800003800003800001801801C03C00E03C00601C003008001C10000
+  7E00> 24 34 -2 1 24.387] 54 @dc
+cmbx12.300 @sf
+[<FFFF800FF0FFFF803FF807F000FF0C07F000FE0607F001FC0607F001FC0007F001FC0007F001FC0007F001FC0007F001FC00
+  07F001FC0007F001FC0007F001FC0007F003F80007F007F00007F00FE00007FFFF800007FFFFC00007F007F00007F001F800
+  07F000FC0007F0007E0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007F0007E0007F000FC00
+  07F001F80007F007F000FFFFFFC000FFFFFE0000> 40 34 -2 0 41.798] 82 @dc
+cmr12.300 @sf
+[<01FFF0001F00000E00000E00000E00000E00000E00000E00000E00FFFFF8800E00400E00200E00200E00100E00100E00080E
+  00040E00040E00020E00020E00010E00008E00008E00004E00004E00002E00001E00001E00000E00000E00000600000200> 24 33 -1 0 24.387] 52 @dc
+4 @bop1
+cmbx12.300 @sf
+164 307 p (APPLICA) s
+-4 r (TIONS) s
+18 r (AND) s
+19 r (CONCLUSIONS) s
+cmr12.300 @sf
+164 400 p (In) s
+18 r (the) s
+18 r (C.) s
+19 r (elegans) s
+18 r (genome) s
+18 r (sequencing) s
+19 r (pro) s
+2 r (ject,) s
+19 r (data) s
+18 r (from) s
+18 r (the) s
+19 r (ABI) s
+18 r (or) s
+18 r (A.L.F.) s
+164 460 p (sequencing) s
+20 r (mac) s
+0 r (hines') s
+19 r (computers) s
+20 r (are) s
+20 r (transferred) s
+20 r (to) s
+20 r (Sun) s
+21 r 119 c
+-1 r (orkstations.) s
+33 r (The) s
+164 520 p (user) s
+18 r (in) s
+0 r 118 c
+-2 r (ok) s
+-1 r (es) s
+17 r 97 c
+18 r (Unix) s
+18 r (shell) s
+18 r (script) s
+18 r (that) s
+18 r (calls) s
+18 r (ted) s
+18 r (systematically) s
+18 r (on) s
+18 r (eac) s
+-1 r 104 c
+17 r (of) s
+18 r (the) s
+164 580 p (new) s
+16 r (set) s
+16 r (of) s
+17 r (trace) s
+16 r (\014les) s
+16 r (creating) s
+16 r 97 c
+16 r (set) s
+17 r (of) s
+16 r (sequence) s
+16 r (\014les.) s
+22 r (The) s
+16 r (sequence) s
+16 r (\014les) s
+16 r (that) s
+164 640 p (are) s
+20 r (deemed) s
+20 r (to) s
+20 r 98 c
+1 r 101 c
+20 r (of) s
+20 r (acceptable) s
+20 r (qualit) s
+0 r 121 c
+19 r (are) s
+20 r (then) s
+20 r (en) s
+-1 r (tered) s
+19 r (in) s
+0 r (to) s
+19 r (the) s
+20 r (sequence) s
+164 700 p (assem) s
+0 r (bly) s
+16 r (program) s
+17 r (xdap) s
+18 r ([2]) s
+17 r (where) s
+18 r (the) s
+17 r (sequences) s
+17 r (are) s
+18 r (assem) s
+0 r (bled) s
+16 r (in) s
+0 r (to) s
+16 r (con) s
+0 r (tigs.) s
+164 761 p 80 c
+0 r (ortions) s
+14 r (of) s
+15 r (the) s
+16 r (ted) s
+15 r (trace-editor) s
+16 r (ha) s
+-1 r 118 c
+-1 r 101 c
+14 r 98 c
+2 r (een) s
+15 r (incorp) s
+2 r (orated) s
+15 r (in) s
+0 r (to) s
+14 r (the) s
+15 r (xdap) s
+16 r (\\trace) s
+164 821 p (manager,") s
+19 r (whic) s
+0 r 104 c
+18 r (is) s
+18 r (used) s
+19 r (in) s
+19 r (conjunction) s
+18 r (with) s
+19 r (the) s
+19 r (con) s
+-1 r (tig) s
+18 r (editor) s
+19 r (to) s
+19 r (view) s
+18 r (sets) s
+164 881 p (of) s
+16 r (aligned) s
+16 r (traces) s
+17 r (at) s
+16 r (sites) s
+16 r (of) s
+17 r (discrepancies) s
+16 r (in) s
+16 r (the) s
+16 r (aligned) s
+17 r (sequences.) s
+237 941 p 84 c
+-3 r (ed) s
+16 r (is) s
+17 r (also) s
+16 r (used) s
+17 r (at) s
+17 r (the) s
+17 r (stage) s
+16 r (of) s
+17 r 99 c
+0 r (ho) s
+0 r (osing) s
+17 r (oligo) s
+17 r (primers) s
+16 r (for) s
+17 r (the) s
+17 r (\\w) s
+-1 r (alking") s
+164 1001 p (stage) s
+21 r (of) s
+20 r (the) s
+21 r (sequencing) s
+20 r (pro) s
+3 r (ject.) s
+35 r (It) s
+20 r (can) s
+21 r 98 c
+1 r 101 c
+21 r (in) s
+-1 r 118 c
+-1 r (ok) s
+-1 r (ed) s
+19 r (directly) s
+21 r (from) s
+20 r (the) s
+21 r (oligo) s
+164 1062 p (selection) s
+23 r (program,) s
+24 r (osp) s
+23 r ([3],) s
+25 r (to) s
+23 r (allo) s
+0 r 119 c
+21 r (examination) s
+23 r (of) s
+23 r (the) s
+23 r (trace) s
+23 r (data) s
+23 r (in) s
+23 r (the) s
+164 1122 p (region) s
+16 r (of) s
+16 r (the) s
+17 r (primers) s
+16 r (so) s
+16 r (that) s
+17 r (in) s
+-1 r (tegrit) s
+-1 r 121 c
+15 r (of) s
+16 r (the) s
+17 r (sequence) s
+16 r (data) s
+16 r (can) s
+16 r 98 c
+2 r 101 c
+16 r 118 c
+0 r (eri\014ed.) s
+237 1182 p (Curren) s
+0 r (tly) s
+-4 r 44 c
+20 r (no) s
+20 r (other) s
+20 r (programs) s
+20 r (are) s
+20 r (kno) s
+-1 r (wn) s
+19 r (to) s
+20 r 98 c
+2 r 101 c
+20 r 97 c
+-1 r 118 c
+-2 r (ailable) s
+19 r (whic) s
+-1 r 104 c
+19 r (supp) s
+2 r (ort) s
+164 1242 p (editing) s
+18 r (of) s
+18 r (the) s
+18 r (ABI) s
+18 r (trace) s
+18 r (data.) s
+26 r 70 c
+-3 r (urther,) s
+18 r (the) s
+18 r (mo) s
+1 r (dular) s
+18 r (design) s
+18 r (of) s
+18 r (the) s
+18 r (program) s
+164 1302 p (should) s
+18 r (allo) s
+0 r 119 c
+17 r (supp) s
+2 r (ort) s
+18 r (for) s
+18 r (new) s
+18 r 116 c
+0 r (yp) s
+0 r (es) s
+19 r (of) s
+18 r (sequencing) s
+18 r (mac) s
+0 r (hines,) s
+18 r (with) s
+18 r (new) s
+18 r (data) s
+164 1363 p (formats,) s
+16 r (to) s
+16 r 98 c
+2 r 101 c
+16 r (implemen) s
+0 r (ted) s
+15 r (in) s
+16 r 97 c
+17 r (straigh) s
+-1 r (tforw) s
+-1 r (ard) s
+15 r (fashion.) s
+cmbx12.300 @sf
+164 1492 p 65 c
+-5 r 86 c
+-6 r (AILABILITY) s
+cmr12.300 @sf
+164 1585 p 84 c
+-3 r (ed) s
+18 r (is) s
+18 r (freely) s
+19 r 97 c
+0 r 118 c
+-3 r (ailable) s
+18 r (from) s
+19 r (the) s
+18 r (authors) s
+19 r (or) s
+19 r (from) s
+19 r (Ro) s
+1 r (dger) s
+19 r (Staden) s
+18 r (and) s
+19 r (Simon) s
+164 1645 p (Dear) s
+19 r (\(MR) s
+-1 r 67 c
+18 r (Lab) s
+1 r (oratory) s
+19 r (of) s
+19 r (Molecular) s
+18 r (Biology) s
+-3 r 44 c
+18 r (Hills) s
+19 r (Road,) s
+19 r (Cam) s
+0 r (bridge,) s
+18 r (UK,) s
+164 1705 p (CB2) s
+16 r (2QH\)) s
+16 r (for) s
+17 r (use) s
+16 r (on) s
+16 r (Sun) s
+17 r 119 c
+-1 r (orkstations) s
+15 r (running) s
+17 r (X-windo) s
+-1 r (ws) s
+16 r (\(or) s
+16 r (Op) s
+1 r (enLo) s
+2 r (ok\).) s
+cmbx12.300 @sf
+164 1835 p 65 c
+-1 r (CKNO) s
+-1 r (WLEDGMENTS) s
+cmr12.300 @sf
+164 1927 p (The) s
+19 r (authors) s
+19 r 119 c
+0 r (ould) s
+18 r (lik) s
+0 r 101 c
+18 r (to) s
+19 r (thank) s
+19 r (all) s
+20 r (mem) s
+-1 r 98 c
+1 r (ers) s
+19 r (of) s
+19 r (the) s
+19 r (C.) s
+20 r (elegans) s
+19 r (sequencing) s
+164 1988 p (pro) s
+3 r (ject) s
+16 r (with) s
+17 r (sp) s
+2 r (ecial) s
+16 r (thanks) s
+17 r (to) s
+17 r (the) s
+16 r (follo) s
+0 r (wing) s
+16 r 112 c
+1 r (eople:) s
+23 r (John) s
+17 r (Sulston,) s
+16 r (Bob) s
+17 r 87 c
+-3 r (a-) s
+164 2048 p (terston,) s
+16 r (Phil) s
+15 r (Green,) s
+16 r (Ric) s
+0 r 107 c
+15 r (Wilson,) s
+15 r (Ric) s
+0 r (hard) s
+15 r (Durbin,) s
+16 r (Simon) s
+15 r (Dear,) s
+16 r (and) s
+16 r (Ro) s
+1 r (dger) s
+164 2108 p (Staden) s
+13 r (for) s
+12 r (their) s
+13 r (helpful) s
+13 r (suggestions) s
+12 r (for) s
+13 r (impro) s
+0 r 118 c
+-2 r (emen) s
+-1 r (ts) s
+12 r (in) s
+12 r (the) s
+13 r (ted) s
+13 r (in) s
+0 r (terface) s
+11 r (and) s
+164 2168 p (for) s
+18 r (their) s
+19 r (parts) s
+18 r (in) s
+18 r (the) s
+19 r (dev) s
+-1 r (elopmen) s
+-1 r 116 c
+17 r (of) s
+19 r (ted.) s
+28 r (This) s
+18 r 119 c
+0 r (ork) s
+17 r 119 c
+0 r (as) s
+17 r (supp) s
+1 r (orted) s
+19 r 98 c
+-1 r 121 c
+18 r (the) s
+164 2228 p (Medical) s
+16 r (Researc) s
+0 r 104 c
+15 r (Council) s
+16 r (and) s
+17 r (NIH) s
+16 r (gran) s
+0 r 116 c
+15 r (R01-HG00136.) s
+cmbx12.300 @sf
+164 2358 p (REFERENCES) s
+cmr12.300 @sf
+164 2451 p (1.) s
+22 r 87 c
+-3 r (aterston,) s
+15 r (R.,) s
+16 r (Sulston,) s
+16 r (J.,) s
+17 r (et) s
+16 r (al.) s
+22 r (\(1991\),) s
+16 r (in) s
+16 r (preparation.) s
+961 2599 p 52 c
+@eop
+3 @bop0
+cmr12.300 @sf
+[<4040201010100808080878F8F8F870000000000000000000000070F8F8F870> 8 31 -4 10 13.548] 59 @dc
+[<FFFFFFE00F8003E0078000E00780007007800030078000300780001007800010078000100780000807800008078020080780
+  20000780200007802000078060000780E00007FFE0000780E000078060000780200007802000078020000780202007800020
+  0780002007800020078000600780004007800040078000C0078001C00F8007C0FFFFFFC0> 32 34 -2 0 33.185] 69 @dc
+[<7FF1FFCFFE07001C00E007001C00E007001C00E007001C00E007001C00E007001C00E007001C00E007001C00E007001C00E0
+  07001C00E007001C00E007001C00E007001C00E007001C00E007001C00E007001C00E007001C00E007001C00E007001C01E0
+  FFFFFFFFE007001C000007001C000007001C000007001C000007001C000007001C000007001C000007001C00C007003C01E0
+  03803E01E001801E00E000E00B0040007031C080000FC07F00> 40 35 0 0 40.645] 14 @dc
+3 @bop1
+cmr12.300 @sf
+164 307 p (program) s
+15 r (has) s
+15 r 98 c
+1 r (een) s
+15 r (in) s
+0 r 118 c
+-2 r (ok) s
+-1 r (ed.) s
+20 r (Other) s
+15 r (parameters) s
+15 r (whic) s
+0 r 104 c
+14 r (the) s
+15 r (user) s
+15 r (ma) s
+-1 r 121 c
+14 r (sp) s
+2 r (ecify) s
+15 r (on) s
+164 367 p (the) s
+15 r (command) s
+15 r (line) s
+15 r (include:) s
+21 r (the) s
+15 r (output) s
+15 r (\014le) s
+16 r (name;) s
+15 r 97 c
+15 r (base) s
+15 r 112 c
+2 r (osition) s
+15 r (or) s
+15 r (sequence) s
+164 428 p (string) s
+17 r (on) s
+18 r (whic) s
+0 r 104 c
+16 r (the) s
+17 r (trace) s
+18 r (is) s
+17 r (to) s
+18 r 98 c
+1 r 101 c
+18 r (cen) s
+-1 r (tered;) s
+17 r 97 c
+18 r (default) s
+17 r (trace) s
+18 r (magni\014cation;) s
+18 r 97 c
+164 488 p (5') s
+17 r 118 c
+-1 r (ector) s
+16 r (sequence) s
+16 r (for) s
+17 r (automated) s
+17 r (elimination) s
+16 r (of) s
+17 r (the) s
+16 r (sequence) s
+17 r (head) s
+17 r (\(v) s
+-1 r (ector\);) s
+164 548 p (top) s
+16 r (or) s
+17 r 98 c
+1 r (ottom) s
+17 r (strand) s
+16 r (orien) s
+0 r (tation;) s
+15 r (or) s
+17 r (an) s
+-1 r 121 c
+16 r (of) s
+16 r (the) s
+16 r (usual) s
+17 r (X-windo) s
+-1 r 119 c
+16 r (parameters) s
+164 608 p (\(e.g.) s
+22 r (displa) s
+-1 r 121 c
+-4 r 44 c
+15 r (geometry) s
+-3 r (...\).) s
+237 668 p (The) s
+21 r (graphics) s
+21 r (displa) s
+-1 r 121 c
+20 r (\(Figure) s
+21 r (1\)) s
+21 r (consists) s
+21 r (of) s
+21 r (the) s
+21 r (con) s
+-1 r (trol) s
+20 r (panel,) s
+22 r (the) s
+21 r (base) s
+164 729 p 112 c
+1 r (osition) s
+12 r (information,) s
+12 r (the) s
+11 r (original) s
+12 r (and) s
+11 r (edited) s
+12 r (sequence) s
+11 r (data,) s
+12 r (and) s
+12 r (the) s
+11 r (graphical) s
+164 789 p (represen) s
+0 r (tation) s
+16 r (of) s
+18 r (the) s
+18 r (trace.) s
+27 r (The) s
+17 r (user) s
+18 r (ma) s
+0 r 121 c
+17 r 98 c
+1 r (egin) s
+18 r 98 c
+0 r 121 c
+16 r (using) s
+18 r (the) s
+18 r (con) s
+0 r (trol) s
+17 r (panel) s
+164 849 p (INPUT) s
+11 r (button) s
+12 r (to) s
+11 r (input) s
+12 r 97 c
+11 r (new) s
+11 r (trace) s
+12 r (\014le) s
+11 r (at) s
+12 r (whic) s
+-1 r 104 c
+11 r (time) s
+11 r (the) s
+11 r (user) s
+12 r (selects) s
+11 r (whether) s
+164 909 p (to) s
+21 r (view) s
+21 r (the) s
+21 r (sequence) s
+21 r (and) s
+21 r (trace) s
+21 r (in) s
+21 r (top) s
+21 r (or) s
+21 r 98 c
+1 r (ottom) s
+21 r (strand) s
+21 r (orien) s
+-1 r (tation.) s
+35 r (The) s
+164 969 p (trace) s
+18 r (\014le) s
+18 r (is) s
+18 r (displa) s
+0 r 121 c
+-2 r (ed) s
+17 r (and,) s
+19 r (if) s
+18 r 97 c
+18 r (5') s
+18 r 118 c
+0 r (ector) s
+17 r (sequence) s
+18 r (has) s
+18 r 98 c
+1 r (een) s
+18 r (sp) s
+2 r (eci\014ed) s
+18 r (on) s
+18 r (the) s
+164 1029 p (command) s
+17 r (line,) s
+18 r (the) s
+17 r (program) s
+18 r (attempts) s
+17 r (to) s
+17 r (select) s
+18 r 97 c
+17 r (cuto\013) s
+17 r 112 c
+2 r (oin) s
+-1 r 116 c
+17 r (corresp) s
+1 r (onding) s
+164 1090 p (to) s
+16 r (the) s
+16 r 118 c
+-1 r (ector) s
+15 r (sequence) s
+16 r (at) s
+16 r (the) s
+15 r (\\head") s
+16 r (of) s
+16 r (the) s
+16 r (trace) s
+15 r (\014le.) s
+22 r (The) s
+16 r (bases) s
+15 r 98 c
+2 r (ey) s
+-1 r (ond) s
+15 r (the) s
+164 1150 p (\\cuto\013) s
+3 r 34 c
+17 r 112 c
+1 r (oin) s
+0 r 116 c
+15 r (are) s
+16 r (displa) s
+-1 r 121 c
+-1 r (ed) s
+15 r (on) s
+16 r 97 c
+16 r (shaded) s
+16 r (bac) s
+0 r (kground.) s
+21 r (The) s
+16 r (user) s
+16 r (ma) s
+-1 r 121 c
+16 r (mo) s
+1 r (dify) s
+164 1210 p (the) s
+18 r (cuto\013) s
+18 r 112 c
+1 r (osition) s
+18 r 98 c
+0 r 121 c
+17 r (clic) s
+-1 r (king) s
+17 r (on) s
+18 r (the) s
+18 r (\\Adj) s
+18 r (left) s
+18 r (cut") s
+18 r (button) s
+17 r (and) s
+18 r (clic) s
+0 r (king) s
+17 r (on) s
+164 1270 p (the) s
+19 r 112 c
+2 r (osition) s
+19 r (of) s
+20 r (the) s
+19 r (desired) s
+19 r (cuto\013.) s
+31 r (Similarly) s
+-3 r 44 c
+20 r (the) s
+19 r (user) s
+19 r (ma) s
+0 r 121 c
+18 r (adjust) s
+20 r (the) s
+19 r (righ) s
+0 r 116 c
+164 1330 p (cuto\013) s
+17 r (of) s
+16 r (the) s
+17 r (sequence) s
+17 r (\(c) s
+-1 r (hosen) s
+16 r 98 c
+0 r 121 c
+15 r (starting) s
+17 r (at) s
+16 r (the) s
+17 r (5') s
+17 r (end) s
+16 r (of) s
+17 r (the) s
+17 r (sequence) s
+16 r (and) s
+164 1391 p (lo) s
+1 r (oking) s
+20 r (for) s
+19 r (the) s
+19 r (\014rst) s
+20 r 111 c
+1 r (ccurrence) s
+20 r (when) s
+19 r 50 c
+19 r (out) s
+20 r (of) s
+19 r 53 c
+19 r (bases) s
+20 r (are) s
+19 r ('N'\)) s
+19 r 98 c
+0 r 121 c
+18 r (scrolling) s
+164 1451 p (along) s
+22 r (the) s
+21 r (sequence) s
+22 r (to) s
+22 r (that) s
+21 r 112 c
+2 r (oin) s
+-1 r (t,) s
+22 r (clic) s
+0 r (king) s
+20 r (on) s
+22 r (the) s
+22 r (\\Adj) s
+21 r (righ) s
+0 r 116 c
+21 r (cut") s
+21 r (button,) s
+164 1511 p (and) s
+16 r (clic) s
+0 r (king) s
+15 r (on) s
+17 r (the) s
+16 r (appropriate) s
+17 r (base.) s
+22 r (Automation) s
+16 r (of) s
+17 r (the) s
+16 r (\\cuto\013) s
+4 r 34 c
+16 r (pro) s
+1 r (cess) s
+17 r (is) s
+164 1571 p (optional;) s
+16 r (the) s
+16 r (user) s
+17 r (ma) s
+-1 r 121 c
+16 r (compile) s
+16 r (the) s
+16 r (program) s
+16 r (with) s
+17 r (that) s
+16 r (feature) s
+16 r (turned) s
+16 r (\\o\013.") s
+237 1631 p (Clic) s
+0 r (king) s
+13 r (on) s
+15 r (the) s
+14 r (\\Edit) s
+15 r (seq") s
+15 r (button) s
+14 r (allo) s
+0 r (ws) s
+13 r (the) s
+15 r (user) s
+15 r (to) s
+14 r (en) s
+0 r (ter) s
+13 r (the) s
+15 r (edit) s
+14 r (mo) s
+2 r (de.) s
+164 1692 p (The) s
+14 r (\\Searc) s
+0 r (h") s
+14 r (button) s
+14 r (can) s
+15 r 98 c
+1 r 101 c
+15 r (used) s
+14 r (to) s
+15 r (skip) s
+14 r (from) s
+15 r (\\problem") s
+14 r (to) s
+15 r (\\problem") s
+14 r (\(i.e.,) s
+164 1752 p (am) s
+0 r (biguit) s
+-2 r 121 c
+17 r (to) s
+17 r (am) s
+-1 r (biguit) s
+-1 r (y\)) s
+16 r (or) s
+18 r (to) s
+17 r (lo) s
+1 r (ok) s
+18 r (for) s
+17 r (runs) s
+17 r (of) s
+17 r (iden) s
+0 r (tical) s
+16 r (bases) s
+18 r (\(e.g.,) s
+17 r (TTTT\)) s
+164 1812 p (whic) s
+0 r 104 c
+15 r (are) s
+16 r (often) s
+16 r (mis-called) s
+17 r 98 c
+-1 r 121 c
+16 r (the) s
+16 r (mac) s
+-1 r (hine) s
+16 r (soft) s
+-1 r 119 c
+-1 r (are.) s
+237 1872 p (Bases) s
+20 r (can) s
+20 r 98 c
+1 r 101 c
+20 r (inserted,) s
+21 r (deleted,) s
+21 r (or) s
+20 r (replaced) s
+20 r (as) s
+20 r (with) s
+20 r (an) s
+0 r 121 c
+19 r (ordinary) s
+20 r 119 c
+-1 r (ord-) s
+164 1932 p (pro) s
+1 r (cessor.) s
+26 r (In) s
+18 r (di\016cult-to-read) s
+17 r (areas,) s
+18 r (the) s
+18 r (trace) s
+18 r (ma) s
+-1 r 121 c
+17 r 98 c
+1 r 101 c
+18 r 118 c
+-1 r (ertically) s
+17 r (or) s
+18 r (horizon-) s
+164 1992 p (tally) s
+23 r (scaled) s
+22 r 98 c
+0 r 121 c
+22 r (dragging) s
+23 r (or) s
+22 r (clic) s
+0 r (king) s
+22 r (on) s
+23 r (the) s
+22 r (magni\014cation) s
+23 r (scroll) s
+23 r (bar) s
+23 r (or) s
+22 r 98 c
+0 r 121 c
+164 2053 p (clic) s
+0 r (king) s
+17 r (on) s
+18 r (the) s
+18 r 118 c
+0 r (ertical) s
+17 r (scaling) s
+18 r (buttons) s
+18 r (\(\\Scale) s
+18 r (do) s
+0 r (wn",) s
+17 r (\\Scale) s
+18 r (up"\),) s
+19 r (resp) s
+1 r (ec-) s
+164 2113 p (tiv) s
+0 r (ely) s
+-4 r 46 c
+19 r (Finally) s
+-3 r 44 c
+12 r (the) s
+12 r (edited) s
+12 r (sequence) s
+12 r (is) s
+12 r (sa) s
+0 r 118 c
+-2 r (ed) s
+11 r (to) s
+12 r (an) s
+12 r (ascii) s
+12 r (\014le) s
+12 r (using) s
+13 r (the) s
+12 r (\\Output") s
+164 2173 p (button.) s
+21 r 65 c
+16 r (history) s
+16 r (of) s
+16 r (the) s
+15 r (editing) s
+16 r (session) s
+16 r (can) s
+15 r (also) s
+16 r 98 c
+1 r 101 c
+16 r (sa) s
+0 r 118 c
+-2 r (ed) s
+15 r (along) s
+16 r (with) s
+16 r (the) s
+15 r (se-) s
+164 2233 p (quence.) s
+30 r (The) s
+19 r (\\Quit") s
+18 r (button) s
+19 r (is) s
+19 r (used) s
+19 r (to) s
+19 r (exit) s
+19 r (the) s
+19 r (program.) s
+30 r (When) s
+19 r (rein) s
+-1 r 118 c
+-1 r (oking) s
+164 2293 p (ted) s
+16 r (on) s
+16 r (an) s
+15 r (edited) s
+16 r (trace) s
+16 r (\014le) s
+16 r (the) s
+15 r (edited) s
+16 r (base) s
+16 r (sequence,) s
+16 r (rather) s
+16 r (than) s
+15 r (the) s
+16 r (original) s
+164 2354 p (sequence,) s
+18 r (is) s
+17 r (sho) s
+0 r (wn) s
+16 r (in) s
+18 r (the) s
+17 r (edited) s
+18 r (base) s
+17 r (windo) s
+0 r (w.) s
+24 r (The) s
+18 r (user) s
+17 r (ma) s
+0 r 121 c
+17 r (in) s
+-1 r 118 c
+-1 r (ok) s
+-2 r 101 c
+17 r (ted) s
+17 r 98 c
+0 r 121 c
+164 2414 p (calling) s
+16 r (in) s
+16 r (an) s
+0 r 121 c
+15 r (one) s
+17 r (of) s
+16 r (the) s
+16 r (previous) s
+16 r (editing) s
+17 r (sessions.) s
+961 2599 p 51 c
+@eop
+2 @bop0
+cmbx12.300 @sf
+[<FFFF83FFFEFFFF83FFFE07F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC0
+  07F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007FFFFFFC007FFFFFFC007F0001FC0
+  07F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC007F0001FC0
+  07F0001FC007F0001FC0FFFF83FFFEFFFF83FFFE> 40 34 -2 0 43.816] 72 @dc
+[ 300 ] /cmti12.300 @newfont
+cmti12.300 @sf
+[<00FE0000000381C0000006003000001C000800001800040000380002000070000100007000008000F000008000E000004000
+  E000004000E000002000E000002000E000000000F000000000F000000000F000000000F000000000F0000000007800000000
+  780000000078000000003C000008003C000004001E000004000E000004000F000004000700000E000380000E0001C0000E00
+  00E0000E000070001F000038002700000E006300000380810000007F0080> 40 36 -7 1 34.869] 67 @dc
+[<07C000187000301800700C00700E00700700F00780F00380F003C0F003C07801E07801E07801E03801E03C01E01C01E00E01
+  C00701C003818001C300007E00> 24 21 -5 0 24.906] 111 @dc
+[<3003001E00700700310038038030803803807080380380704038038038401C01C038201C01C01C001C01C01C001C01C01C00
+  0E00E00E000E00E00E000E00E00E000E00E00E00870070070087007007008780780700878078070047606606002610C10C00
+  1C0F80F800> 40 21 -5 0 39.850] 109 @dc
+[<FFC0001C00001C00001C00000E00000E00000E00000E0000070000070000071E0007238003C1C003C0E003807003807001C0
+  7801C03C01C03C01C03C00E01E00E01E00E01E00E01E08701E08700E08700E08780C04741C02621801C1F0> 24 31 -1 10 24.906] 112 @dc
+[<07C3C00C26201C1E201C0E10180E101C0E101C07081C07001C07001C07000E03800E03800E03800703808701C08701C08381
+  C04381C04380E02300E01E0060> 24 21 -5 0 26.152] 117 @dc
+[<1E003100708070407020702038103800380038001C001C001C001C000E000E000E000E0007000700FFF80700038003800380
+  038001C001C001C001C000C0> 16 31 -4 0 16.189] 116 @dc
+[<1C00320071007080708070803840380038001C001C001C000E000E008700870087004300430023001C000000000000000000
+  000000000000000001C001C001E000C0> 16 33 -5 0 14.944] 105 @dc
+[<3003C0700620380610380E10380E083807081C07041C03801C03801C03800E01C00E01C00E01C00E01C08700E08700E08780
+  E08780E04740C02631C01C0F00> 24 21 -5 0 27.397] 110 @dc
+[<3F800060E000F03000F01800701C00000E00000E00000E0000070000070001E700061700060B800E07800E03801E03801E01
+  C01E01C01E01C01E01C00F00E00F00E00F00E00700E007807003807001C07001C07000E0B80030B8001F18> 24 31 -2 10 22.416] 103 @dc
+[<FFFFF000000F801C00000780070000078001C000078000E000078000700003C000380003C0003C0003C0001C0003C0000E00
+  01E0000F0001E000070001E000078001E000078000F00003C000F00003C000F00003C000F00003C000780001E000780001E0
+  00780001E000780001E0003C0001E0003C0001E0003C0001E0003C0001C0001E0001C0001E0001C0001E000380001E000380
+  000F000700000F000E00000F003C0000FFFFF000> 40 34 -3 0 36.783] 68 @dc
+[<0F80306070186004E002E002E000E000E000E000F000F000FFE0F018780438023C021C020E02038400F8> 16 21 -6 0 22.416] 101 @dc
+[<1FC000203000400800E00400F00600F00600700700000700000F00003E0003FE0007FC000FF0000F00000C00000C03000C03
+  8004018002008001830000FC00> 24 21 -3 0 19.925] 115 @dc
+[<0F0780308C40305C40703C20701C20F01C20F00E10F00E00F00E00F00E007807007807007807003807003C03801C03800E03
+  800E03800705C00185C000F8C0> 24 21 -5 0 24.906] 97 @dc
+[<0F0780308C40305C40703C20701C20F01C20F00E10F00E00F00E00F00E007807007807007807003807003C03801C03800E03
+  800E03800705C00185C000F9C00001C00000E00000E00000E00000E000007000007000007000007000003800003800003800
+  03F800003C> 24 35 -5 0 24.906] 100 @dc
+[<FFF8000F800007800007800007800007800003C00003C00003C00003C00001E00001E00001E00001E00000F00000F00000F0
+  0000F000007800007800007800007800003C00003C00003C00003C00001E00001E00001E00001E00000F00000F00000F0000
+  FFF8> 24 34 -3 0 18.772] 73 @dc
+[<38006400E200E200E200E200710070007000700038003800380038001C001C001C001C000E000E000E000E00070007000700
+  0700038003800380038001C001C001C01FC001E0> 16 35 -4 0 12.453] 108 @dc
+[<E0F0F8F870> 8 5 -6 0 14.944] 46 @dc
+[<00FE0000000381C1000006002300001C0013800018000F800038000780007000078000700003C000F00003C000E00003C000
+  E00003C000E00001E000E00001E000E00001E000F0003FFC00F000000000F000000000F000000000F0000000007800000000
+  780000000078000000003C000008003C000004001E000004000E000004000F000004000700000E000380000E0001C0000E00
+  00E0000E000070001F000038002700000E006300000380810000007F0080> 40 36 -7 1 37.694] 71 @dc
+[<3000007000003800003800003800003800001C00001C00001C00001C00000E00000E00000E00000E00008700008701808703
+  C08783C04741C02620801C1F00> 24 21 -5 0 20.548] 114 @dc
+[<6003C0E00620700610700E10700E087007083807043803803803803803801C01C01C01C01C01C01C01C00E00E00E00E00F00
+  E00F00E007C0C0072180071F0007000003800003800003800003800001C00001C00001C00001C00000E00000E00000E0000F
+  E00000F000> 24 35 -3 0 24.906] 104 @dc
+[<0FC000183000300C00700200700100F00100F00000F00000F00000F000007800007800007800003800003C00001C07800E07
+  8007038003018001C100007E00> 24 21 -5 0 22.416] 99 @dc
+[<3C00000062000000F3000000F18000007180000001C0000001C0000000C0000000E0000000E0000000E0000000E000000070
+  0000007000000070000000700000007000000038000000380000003800000038000000380000001C0000001C0000001C0000
+  001C0000001C0000000E0000000E0000000E000000FFF000000E000000070000000700000007000000070000000700000003
+  800000038000000380000001860000018F000000CF000000470000003E00> 32 45 2 10 14.944] 102 @dc
+cmr12.300 @sf
+[<7FF3FF8007003800070038000700380007003800070038000700380007003800070038000700380007003800070038000700
+  380007003800070038000700380007003800070038000700380007003800FFFFF80007003800070038000700380007003800
+  0700380007003800070038000700380007003800038078000180780000C0780000703800001FD800> 32 35 0 0 27.097] 13 @dc
+[<01800003C00003C00003C00003C00003C00003C00003C00003C00001C00001C00001C00001C00000C00000C00000E0000060
+  00006000006000002000003000001000000800000800000400800200800200800100C001004000807FFFC07FFFC07FFFE060
+  0000400000> 24 35 -3 1 24.387] 55 @dc
+2 @bop1
+cmr12.300 @sf
+164 307 p (in) s
+0 r 118 c
+-3 r (aluable.) s
+35 r 84 c
+-3 r (ed) s
+20 r (\(a) s
+21 r 84 c
+-3 r (race-EDitor\)) s
+21 r 119 c
+-1 r (as) s
+20 r (dev) s
+0 r (elop) s
+0 r (ed) s
+21 r (to) s
+22 r (\014ll) s
+21 r (this) s
+21 r (role) s
+21 r (in) s
+21 r (the) s
+21 r (C.) s
+164 367 p (elegans) s
+16 r (genome) s
+16 r (sequencing) s
+17 r (pro) s
+2 r (ject) s
+17 r ([1].) s
+cmbx12.300 @sf
+164 497 p (METHODS) s
+cmti12.300 @sf
+164 590 p (Computing) s
+15 r (Design) s
+14 r (and) s
+15 r (Implementation.) s
+cmr12.300 @sf
+21 r (When) s
+13 r (designing) s
+13 r (ted,) s
+14 r 119 c
+0 r 101 c
+12 r (had) s
+13 r 97 c
+14 r 110 c
+-1 r (um-) s
+164 650 p 98 c
+1 r (er) s
+11 r (of) s
+11 r (sp) s
+2 r (eci\014c) s
+11 r (computing) s
+11 r (goals) s
+11 r (in) s
+11 r (mind) s
+11 r (including) s
+11 r 112 c
+1 r (ortabilit) s
+0 r 121 c
+10 r (and) s
+11 r (adaptabilit) s
+0 r 121 c
+-4 r 46 c
+164 710 p 70 c
+-3 r (or) s
+14 r 112 c
+1 r (ortabilit) s
+0 r 121 c
+-4 r 44 c
+14 r 119 c
+-1 r 101 c
+14 r 99 c
+0 r (hose) s
+14 r (to) s
+15 r (write) s
+15 r (ted) s
+14 r (in) s
+15 r (ANSI) s
+15 r 67 c
+15 r (using) s
+15 r (the) s
+15 r 88 c
+14 r (windo) s
+0 r (wing) s
+14 r (sys-) s
+164 770 p (tem) s
+17 r (and) s
+17 r (the) s
+17 r (Xa) s
+-1 r 119 c
+16 r (to) s
+1 r (olkit.) s
+24 r 88 c
+17 r (pro) s
+-1 r (vides) s
+16 r (basic) s
+17 r (capabilities) s
+17 r (for) s
+17 r (the) s
+17 r (creation) s
+17 r (and) s
+164 830 p (use) s
+18 r (of) s
+18 r (windo) s
+0 r (ws,) s
+17 r (and) s
+18 r (the) s
+18 r (to) s
+1 r (olkit) s
+18 r (con) s
+0 r (tains) s
+17 r 97 c
+18 r 110 c
+0 r (um) s
+-2 r 98 c
+1 r (er) s
+18 r (of) s
+17 r (pre-pac) s
+0 r 107 c
+-3 r (aged) s
+17 r (comp) s
+2 r (o-) s
+164 891 p (nen) s
+0 r (ts,) s
+19 r (suc) s
+-1 r 104 c
+18 r (as) s
+20 r (the) s
+19 r (\\sliders") s
+19 r (used) s
+19 r (for) s
+19 r (scrolling.) s
+31 r 88 c
+19 r (also) s
+19 r (allo) s
+0 r (ws) s
+18 r (site,) s
+20 r (user) s
+19 r (and) s
+164 951 p 112 c
+1 r (er-run) s
+19 r (defaults) s
+18 r (to) s
+19 r 98 c
+1 r 101 c
+19 r (set.) s
+28 r (Adaptabilit) s
+0 r 121 c
+17 r (is) s
+19 r (also) s
+18 r (an) s
+18 r (imp) s
+2 r (ortan) s
+-1 r 116 c
+18 r (goal) s
+18 r (since) s
+19 r 119 c
+0 r 101 c
+164 1011 p (are) s
+18 r (pro) s
+0 r (viding) s
+17 r 97 c
+18 r (new) s
+18 r (function) s
+18 r (to) s
+18 r (researc) s
+-1 r 104 c
+17 r (groups) s
+18 r (who) s
+18 r (are) s
+18 r (constan) s
+0 r (tly) s
+17 r (adding) s
+164 1071 p (new) s
+16 r (requiremen) s
+0 r (ts.) s
+237 1131 p (St) s
+0 r (ylistically) s
+-4 r 44 c
+21 r 119 c
+0 r 101 c
+20 r (ha) s
+0 r 118 c
+-2 r 101 c
+20 r (follo) s
+0 r 119 c
+-1 r (ed) s
+20 r (an) s
+21 r (\\Abstract) s
+21 r (Data) s
+21 r 84 c
+0 r (yp) s
+0 r (e") s
+22 r (discipline.) s
+36 r (In) s
+164 1192 p (this) s
+20 r (discipline,) s
+20 r 97 c
+20 r (program) s
+20 r (is) s
+19 r (split) s
+20 r (in) s
+0 r (to) s
+18 r 97 c
+20 r 110 c
+0 r (um) s
+-2 r 98 c
+1 r (er) s
+19 r (of) s
+20 r (mo) s
+1 r (dules) s
+20 r (whic) s
+0 r 104 c
+18 r (pro) s
+0 r (vide) s
+164 1252 p (separate,) s
+15 r 119 c
+0 r (ell-de\014ned) s
+14 r (functions.) s
+22 r 87 c
+-3 r 101 c
+14 r (separate) s
+15 r (the) s
+15 r (in) s
+0 r (terface) s
+14 r (of) s
+16 r 97 c
+15 r (mo) s
+1 r (dule) s
+15 r (from) s
+164 1312 p (its) s
+15 r (implemen) s
+0 r (tation.) s
+20 r 70 c
+-3 r (or) s
+15 r (example,) s
+15 r 97 c
+16 r (uni\014ed) s
+15 r (in) s
+0 r (ternal) s
+15 r (sequence) s
+15 r (format) s
+15 r (is) s
+16 r (used.) s
+164 1372 p (This) s
+19 r (can) s
+20 r (store) s
+19 r 97 c
+19 r 118 c
+-1 r (arying) s
+18 r (amoun) s
+0 r 116 c
+18 r (of) s
+19 r (information.) s
+31 r (Ho) s
+0 r 119 c
+-2 r (ev) s
+-1 r (er,) s
+19 r (there) s
+19 r (is) s
+20 r 97 c
+19 r (clear) s
+164 1432 p (and) s
+17 r (simple) s
+16 r (in) s
+0 r (terface) s
+15 r 98 c
+0 r 121 c
+16 r (whic) s
+-1 r 104 c
+16 r (the) s
+16 r (rest) s
+17 r (of) s
+16 r (the) s
+17 r (program) s
+17 r (accesses) s
+16 r (this) s
+17 r (mo) s
+1 r (dule.) s
+164 1492 p (Suc) s
+0 r 104 c
+22 r 97 c
+23 r (st) s
+-1 r (yle) s
+23 r (is) s
+23 r (not) s
+23 r 119 c
+-1 r (ell) s
+23 r (supp) s
+1 r (orted) s
+23 r 98 c
+0 r 121 c
+22 r (C,) s
+23 r (but) s
+23 r (its) s
+23 r (adoption) s
+23 r (has) s
+23 r 98 c
+2 r (een) s
+23 r 118 c
+0 r (ery) s
+164 1553 p (successful.) s
+21 r (The) s
+15 r (addition) s
+15 r (of) s
+14 r (new) s
+15 r (sequencing) s
+15 r (mac) s
+-1 r (hines,) s
+14 r (and) s
+15 r (th) s
+0 r (us) s
+14 r (new) s
+14 r (external) s
+164 1613 p (data) s
+18 r (formats,) s
+17 r (ma) s
+0 r 121 c
+17 r (cause) s
+17 r (some) s
+18 r 99 c
+-1 r (hanges) s
+17 r (in) s
+18 r (the) s
+17 r (in) s
+0 r (ternal) s
+16 r (represen) s
+0 r (tation) s
+17 r (of) s
+17 r (the) s
+164 1673 p (sequence) s
+16 r (but) s
+16 r (should) s
+17 r (not) s
+16 r (a\013ect) s
+16 r (the) s
+17 r (rest) s
+16 r (of) s
+16 r (the) s
+16 r (program.) s
+237 1733 p 84 c
+-3 r (ed) s
+17 r (accepts) s
+17 r 97 c
+18 r (large) s
+18 r 110 c
+-1 r (um) s
+-1 r 98 c
+0 r (er) s
+18 r (of) s
+18 r (optional) s
+17 r (command) s
+18 r (line) s
+18 r (argumen) s
+-1 r (ts,) s
+17 r (man) s
+0 r 121 c
+164 1793 p (of) s
+18 r (whic) s
+0 r 104 c
+17 r (can) s
+18 r (also) s
+18 r 98 c
+1 r 101 c
+18 r (sp) s
+2 r (eci\014ed) s
+18 r (as) s
+18 r (system) s
+18 r (defaults.) s
+27 r (This) s
+18 r (supp) s
+1 r (orts) s
+19 r 97 c
+18 r (mo) s
+1 r (de) s
+18 r (of) s
+164 1854 p 119 c
+0 r (orking) s
+20 r (whereb) s
+-1 r 121 c
+20 r (ted) s
+22 r (is) s
+21 r (in) s
+-1 r 118 c
+-1 r (ok) s
+-1 r (ed) s
+20 r (not) s
+21 r (directly) s
+21 r 98 c
+-1 r 121 c
+21 r (the) s
+21 r (user) s
+21 r (but) s
+21 r (instead) s
+21 r 98 c
+-1 r 121 c
+21 r 97 c
+164 1914 p (script) s
+21 r (or) s
+21 r (another) s
+20 r (application) s
+21 r (whic) s
+0 r 104 c
+20 r (supplies) s
+20 r (argumen) s
+0 r (ts) s
+20 r (appropriate) s
+21 r (to) s
+20 r (the) s
+164 1974 p (editing) s
+16 r (task.) s
+cmti12.300 @sf
+237 2034 p (Gr) s
+-1 r (aphic) s
+-3 r (al) s
+22 r (Interfac) s
+-2 r (e.) s
+cmr12.300 @sf
+37 r 84 c
+-3 r (ed) s
+21 r (curren) s
+0 r (tly) s
+21 r (accepts) s
+21 r (data) s
+22 r (from) s
+22 r 116 c
+0 r 119 c
+-2 r 111 c
+21 r (\015uorescence) s
+164 2094 p (based) s
+18 r (sequencing) s
+19 r (mac) s
+-1 r (hines,) s
+18 r (the) s
+19 r (Pharmacia) s
+18 r (A.L.F.) s
+19 r (and) s
+18 r (the) s
+19 r (ABI) s
+18 r (373A.) s
+18 r (The) s
+164 2155 p (sequencing) s
+12 r (mac) s
+0 r (hine) s
+12 r (data) s
+12 r (consists) s
+13 r (of) s
+12 r (four) s
+13 r (traces) s
+12 r (of) s
+13 r (\015uorescence) s
+12 r (lev) s
+0 r (els) s
+12 r (together) s
+164 2215 p (with) s
+14 r (the) s
+14 r (mac) s
+0 r (hine's) s
+13 r (in) s
+-1 r (terpretation,) s
+14 r (whic) s
+0 r 104 c
+13 r (is) s
+14 r 97 c
+14 r (sequence) s
+14 r (of) s
+14 r (bases.) s
+21 r 84 c
+-3 r (ed) s
+13 r (displa) s
+-1 r (ys) s
+164 2275 p (the) s
+16 r (traces) s
+15 r (and) s
+16 r (the) s
+16 r (mac) s
+-1 r (hine-generated) s
+15 r (base) s
+16 r (list.) s
+21 r 65 c
+16 r (second,) s
+16 r (initially) s
+15 r (iden) s
+0 r (tical,) s
+164 2335 p (list) s
+16 r (of) s
+16 r (bases) s
+17 r (is) s
+16 r (pro) s
+0 r (vided) s
+15 r (for) s
+16 r (correction) s
+16 r 98 c
+0 r 121 c
+15 r (the) s
+17 r (user.) s
+237 2395 p 84 c
+-3 r (ed) s
+13 r (has) s
+13 r (an) s
+14 r 88 c
+13 r (windo) s
+0 r (ws) s
+12 r (based) s
+14 r (graphical) s
+14 r (in) s
+-1 r (terface.) s
+20 r (The) s
+13 r (trace) s
+14 r (\014le) s
+14 r (can) s
+13 r (either) s
+164 2455 p 98 c
+1 r 101 c
+12 r (input) s
+12 r (from) s
+11 r (the) s
+12 r (command) s
+12 r (line) s
+11 r (or) s
+12 r 98 c
+0 r 121 c
+10 r (clic) s
+0 r (king) s
+11 r (on) s
+11 r (the) s
+12 r (INPUT) s
+11 r (button) s
+12 r (after) s
+12 r (the) s
+961 2599 p 50 c
+@eop
+1 @bop0
+[ 622 ] /cmr10.622 @newfont
+cmr10.622 @sf
+[<FFFFC0003FFFFFC0FFFFC0003FFFFFC0FFFFC0003FFFFFC00FFC000001FFE00003F80000007FC00000F00000007FC0000060
+  0000007F800000600000007F80000070000000FF80000030000000FF00000030000000FF00000018000001FE000000180000
+  01FE00000018000001FE0000000C000003FC0000000C000003FC0000000C000007FC00000006000007F800000006000007F8
+  00000007FFFFFFF800000003FFFFFFF000000003FFFFFFF00000000380001FF00000000180001FE00000000180001FE00000
+  0001C0003FE000000000C0003FC000000000C0003FC000000000E0007F800000000060007F800000000060007F8000000000
+  3000FF00000000003000FF00000000003000FF00000000001801FE00000000001801FE00000000001803FE00000000000C03
+  FC00000000000C03FC00000000000C07FC00000000000607F800000000000607F80000000000070FF80000000000030FF000
+  00000000030FF00000000000039FF00000000000019FE00000000000019FE0000000000000FFC0000000000000FFC0000000
+  000000FFC00000000000007F800000000000007F800000000000007F800000000000003F000000000000003F000000000000
+  003F000000000000001E000000000000001E000000000000001E000000000000000C00000000> 64 61 -3 0 64.569] 65 @dc
+[<0003F000000FF800001F0C00003E0600007C030000FC030000F8030000F8018001F8018001F8018001F8018001F8018001F8
+  018001F8018001F8018001F8018001F8018001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F80000
+  01F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F80000FFFFFF00FFFFFF001FFF
+  FF0007F8000003F8000001F8000000F8000000F8000000780000007800000038000000380000003800000038000000180000
+  00180000001800000018000000180000> 32 54 -2 1 33.480] 116 @dc
+[<FFFFF800FFFFF800FFFFF80003FC000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8
+  000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F8000001F80000
+  01FC000001FC000001FC000001FC000001F6004001F601F001F603F801F303F807F183F8FFF0C1F8FFF070F0FFF03FE003F0
+  0F80> 32 38 -2 0 33.719] 114 @dc
+[<007F800F8003FFE01FE00FE0703FF01F80187E383F000C7C187F0006F80C7E0006F80CFE0003F80CFE0003F80CFE0001F80C
+  FE0001F80CFE0001F80C7E0001F80C7F0001F8007F0001F8003F8001F8001F8001F8000FC001F80007F001F80003F801F800
+  00FF01F800003FF9F8000007FFF80000001FF800000001F800000001F800000001F800000001F8000F0001F8001F8001F800
+  3FC001F0003FC003F0003FC003F0003FC003E0001F8007C0001E000F80000E001F000007C07E000001FFF80000003FC00000> 40 40 -4 1 43.046] 97 @dc
+[<0003FC0000001FFF0000007E03C00000F800E00001F000300003E000180007C0000C000FC00006001F800006001F80000300
+  3F000003003F000003007F000000007F000000007E00000000FE00000000FE00000000FE00000000FE00000000FE00000000
+  FE00000000FE00000000FE00000000FE000000007E000000007E000000007F000000007F000000003F000078003F0000FC00
+  1F8001FE000F8001FE000FC001FE0007C001FE0003E000FC0001F0003C0000F8003800003E01F000001FFFC0000003FE0000> 40 40 -3 1 38.263] 99 @dc
+[<0001FE0000000FFF8000003F01E000007800700001F000180003E0000C0007C00006000FC00003000F800003001F80000180
+  3F000001803F000001807F000000007F000000007E000000007E00000000FE00000000FE00000000FE00000000FE00000000
+  FE00000000FFFFFFFF80FFFFFFFF80FE00001F80FE00001F807E00001F807E00001F807F00001F003F00001F003F00001F00
+  1F00003F001F80003E000F80003E0007C0007C0003E000780001E000F00000F801E000007E07C000001FFF00000003FC0000> 40 40 -2 1 38.263] 101 @dc
+[<0003FC07E000001FFF07FFC0003E0387FFC000F800E7FFC001F00077F80003E0001FE00007C0001FE0000F80000FE0001F80
+  000FE0001F000007E0003F000007E0003F000007E0007F000007E0007E000007E0007E000007E000FE000007E000FE000007
+  E000FE000007E000FE000007E000FE000007E000FE000007E000FE000007E000FE000007E000FE000007E0007E000007E000
+  7E000007E0007F000007E0003F000007E0003F000007E0001F800007E0000F800007E0000FC0000FE00007C0000FE00003E0
+  001FE00001F00037E00000FC0067E000003F03C7E000000FFF87E0000001FC07E00000000007E00000000007E00000000007
+  E00000000007E00000000007E00000000007E00000000007E00000000007E00000000007E00000000007E00000000007E000
+  00000007E00000000007E00000000007E00000000007E00000000007E00000000007E0000000001FE000000003FFE0000000
+  03FFE000000003FFE00000000007E000> 48 61 -3 1 47.829] 100 @dc
+[<FFFFE0FFFFE0FFFFE003FC0001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F8
+  0001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80007
+  F8007FF8007FF8007FF80001F80000000000000000000000000000000000000000000000000000000000000000000001C000
+  03F00007F8000FF8000FF8000FF80007F80003F00001C000> 24 58 -1 0 23.914] 105 @dc
+[<C03FC000C0FFF800E3C03C00F7000E00FC000700FC000380F80003C0F00001C0F00001E0E00001E0E00001E0E00001E0E000
+  01E0C00003E0C00007E040000FC000001FC00000FF80001FFF8001FFFF0003FFFE000FFFFC001FFFF0003FFFC0007FFC0000
+  7F800000FE000000F8000180F8000180F0000180F0000180F0000180700003807000038038000780380007801E001F800F80
+  7F8003FFE380007F8180> 32 40 -3 1 33.958] 115 @dc
+[<FFFFF0000000FFFFF0000000FFFFF000000003FC0000000001F80000000001F80000000001F80000000001F80000000001F8
+  0000000001F80000000001F80000000001F80000000001F80000000001F80000000001F80000000001F80000000001F80FE0
+  000001F83FFC000001F8F03F000001F9C00FC00001FB0007E00001FE0003F00001FC0001F80001FC0000FC0001F80000FC00
+  01F800007E0001F800007F0001F800003F0001F800003F8001F800003F8001F800001F8001F800001FC001F800001FC001F8
+  00001FC001F800001FC001F800001FC001F800001FC001F800001FC001F800001FC001F800001FC001F800001F8001F80000
+  3F8001F800003F8001F800003F0001F800007F0001F800007E0001F800007E0001FC0000FC0001FC0000F80001FE0001F000
+  03FB0003E000FFF9C007C000FFF8F01F0000FFF83FFE000001F80FF00000> 48 55 -2 17 47.829] 112 @dc
+[<FFFFF0FFFFF0FFFFF003FC0001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F8
+  0001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001
+  F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F800
+  01F80001F80001F80001F80001F80007F800FFF800FFF800FFF80001F800> 24 60 -1 0 23.914] 108 @dc
+[<07C0000000001FF000000000383800000000700C000000007C0600000000FE0700000000FE0300000000FE0180000000FE01
+  800000007C00C00000001000C00000000000C000000000006000000000006000000000003000000000003000000000003000
+  000000001800000000001800000000003C00000000003C00000000003C00000000007E00000000007E0000000000FF000000
+  0000FF0000000000FF0000000001F98000000001F98000000003F9C000000003F0C000000003F0C000000007E06000000007
+  E0600000000FE0700000000FC0300000000FC0300000001F80180000001F80180000003F801C0000003F000C0000003F000C
+  0000007E00060000007E0006000000FE0007000000FC0003000000FC0003000001F80001800001F80001800003F80001C000
+  03F80003E00007FC0007F800FFFF801FFF00FFFF801FFF00FFFF801FFF00> 48 55 -2 18 45.437] 121 @dc
+[<FFFFF1FFFFE0FFFFF1FFFFE0FFFFF1FFFFE003FC0007F80001F80003F00001F80003F00001F80003F00001F80003F00001F8
+  0003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003
+  F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F000
+  01F80003F00001F80003F00001FC0003F00001FC0003F00001FC0003F00001FE0003F00001FB0003E00001FB0007E00007F9
+  8007C000FFF8E00F8000FFF8781F0000FFF81FFE000001F807F80000> 48 38 -2 0 47.829] 110 @dc
+[<0007FF8000003FFFF00001FC00FE0003E0001F000F800007C01F000003E03E000001F07C000000F87C000000F8F80000007C
+  F80000007CF80000007CF80000007CF80000007C780000007C7C000000F83C000000F83E000001F81F000007F00780003FE0
+  03FFFFFFC000FFFFFF8001FFFFFF0003FFFFFC0007FFFFC00007C00000000F000000000F000000000E000000000E00000000
+  0E0000000006000000000607F00000073FFE0000037C1F000001F007800001E003C00003C001E00007C001F0000FC001F800
+  0F8000F8001F8000FC001F8000FC001F8000FC001F8000FC001F8000FC001F8000FC001F8000FC000F8000F8000FC001F800
+  07C001F00003C001E00801E003E01C00F007F03E007C1F1E3E003FFE0FFC0007F001F8> 40 57 -2 19 43.046] 103 @dc
+[<0001FE0000000FFFC000003F03F00000F8007C0001F0003E0003E0001F0007C0000F800F800007C01F800007E01F000003E0
+  3F000003F03F000003F07F000003F87E000001F87E000001F8FE000001FCFE000001FCFE000001FCFE000001FCFE000001FC
+  FE000001FCFE000001FCFE000001FCFE000001FC7E000001F87E000001F87E000001F83F000003F03F000003F03F000003F0
+  1F000003E00F800007C00F800007C007C0000F8003E0001F0001F0003E0000F8007C00003F03F000000FFFC0000001FE0000> 40 40 -2 1 43.046] 111 @dc
+[<FFFFF0FFFFF0FFFFF0FFFFF0FFFFF0FFFFF0FFFFF0FFFFF0FFFFF003FC0003FC0003FC0001F80001F80001F80001F80001F8
+  0001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001
+  F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F800
+  01F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F8
+  0001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001F80001FC0001FC0001
+  F80001FC0001FC0001F80001FE0001FE0001F80001FE0001FE0001F80001FB0001F30001F00001FB0003F30003F00007F9C0
+  03E1C003E000FFF8E007C0E007C000FFF83C0F803C0F8000FFF81FFF001FFF000001F803F80003F80000> 72 38 -2 0 71.743] 109 @dc
+[<7FFFFC007FFFFC007FFFFC0001FE000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC
+  000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC0000
+  00FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC0000FFFFFC00FFFFFC00FFFFFC0000FC
+  000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC000000FC0000
+  00FC0000007C01E0007E03F0007E07F8003E07F8001F07F8000F83F8000781F00003E0E00001FFC000003F00> 32 61 -1 0 26.306] 102 @dc
+[<7FFFF8FFFFF07FFFF8FFFFF07FFFF8FFFFF001FE0003FC0000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC
+  0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001
+  F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F800
+  00FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC
+  0001F800FFFFFFFFF800FFFFFFFFF800FFFFFFFFF80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001
+  F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F80000FC0001F800
+  00FC0001F800007E0001F800007E0003F800003F0007F800001F0007F800000F8007F8000007C007F8000003F003F8000000
+  FC01F80000003FFFB800000007FE0000> 48 61 -1 0 47.829] 13 @dc
+[<0001FE03F000000FFF83FFE0001F81E3FFE0003E0073FFE0007C001BFC0000FC001BF00000F8000FF00000F8000FF00001F8
+  0007F00001F80007F00001F80007F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003
+  F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F000
+  01F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F8
+  0003F00007F8000FF000FFF801FFF000FFF801FFF000FFF801FFF00001F80003F000> 48 39 -2 1 47.829] 117 @dc
+[<00000FE0000001803FFC000001C0F03F000001E1C00FC00001E30003E00001F60001F00001FC0000F80001FC0000FC0001F8
+  00007C0001F800007E0001F800003F0001F800003F0001F800003F8001F800001F8001F800001F8001F800001FC001F80000
+  1FC001F800001FC001F800001FC001F800001FC001F800001FC001F800001FC001F800001FC001F800001FC001F800001F80
+  01F800001F8001F800003F8001F800003F0001F800003F0001F800003E0001F800007E0001FC00007C0001FC0000F80001FE
+  0001F00001FB0003E00001F9C007C00001F8F01F000001F83FFE000001F80FF0000001F80000000001F80000000001F80000
+  000001F80000000001F80000000001F80000000001F80000000001F80000000001F80000000001F80000000001F800000000
+  01F80000000001F80000000001F80000000001F80000000001F80000000001F80000000007F800000000FFF800000000FFF8
+  00000000FFF80000000001F800000000> 48 61 -2 1 47.829] 98 @dc
+[<000007FFFF80000007FFFF80000007FFFF800000001FE0000000000FC0000000000FC0000000000FC0000000000FC0000000
+  000FC0000000000FC0000000000FC0000000000FC0000000000FC0000000000FC0000000000FC0000000000FC0000003F80F
+  C000001FFE0FC000007E070FC00000F801CFC00001F000CFC00003E0006FC00007C0003FC0000FC0001FC0001F80001FC000
+  1F80000FC0003F00000FC0003F00000FC0007F00000FC0007F00000FC0007E00000FC000FE00000FC000FE00000FC000FE00
+  000FC000FE00000FC000FE00000FC000FE00000FC000FE00000FC000FE00000FC000FE00000FC0007E00000FC0007F00000F
+  C0007F00000FC0003F00000FC0003F80000FC0001F80001FC0001F80001FC0000FC0001FC00007E00037C00003E00067C000
+  01F00063C00000FC00C3C000003F0381C000000FFF01C0000001FC00C000> 48 55 -3 17 45.437] 113 @dc
+[<FFFFF1FFFFE0FFFFF1FFFFE0FFFFF1FFFFE003FC0007F80001F80003F00001F80003F00001F80003F00001F80003F00001F8
+  0003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003
+  F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F00001F80003F000
+  01F80003F00001F80003F00001FC0003F00001FC0003F00001FC0003F00001FE0003F00001FB0003E00001FB0007E00001F9
+  8007C00001F8E00F800001F8781F000001F81FFE000001F807F8000001F80000000001F80000000001F80000000001F80000
+  000001F80000000001F80000000001F80000000001F80000000001F80000000001F80000000001F80000000001F800000000
+  01F80000000001F80000000001F80000000001F80000000001F80000000007F800000000FFF800000000FFF800000000FFF8
+  0000000001F800000000> 48 60 -2 0 47.829] 104 @dc
+[ 432 ] /cmr10.432 @newfont
+cmr10.432 @sf
+[<00FFFFFE0000FFFFFE000000FE000000007C000000007C000000007C000000007C000000007C000000007C000000007C0000
+  00007C000000007C000000007C000000007C000000007C000000007C000000007C000000007C000000007C000000007C0000
+  00007C000000007C000000007C000000007C000000007C000000007C000000007C000080007C000480007C000480007C0004
+  80007C000480007C0004C0007C000CC0007C000C40007C000840007C000860007C001870007C00387C007C00F87FFFFFFFF8
+  7FFFFFFFF8> 40 41 -2 0 43.171] 84 @dc
+[<FFF8FFF8078007800780078007800780078007800780078007800780078007800780078007800780078007800F807F807F80
+  078000000000000000000000000000000000000007000F801F801F800F800700> 16 41 -1 0 16.604] 105 @dc
+[<FFFC7FFE3FFF00FFFC7FFE3FFF00078003C001E000078003C001E000078003C001E000078003C001E000078003C001E00007
+  8003C001E000078003C001E000078003C001E000078003C001E000078003C001E000078003C001E000078003C001E0000780
+  03C001E000078003C001E000078003C001E000078003C001E000078003C001E00007C003E001E00007C003E001E00007A003
+  D001E0000FA0079003C000FF98070C038000FF860E030700000781F800FC0000> 56 26 -1 0 49.812] 109 @dc
+[<007F000001C1C000070070000E0038001E003C003C001E003C001E0078000F0078000F00F8000F80F8000F80F8000F80F800
+  0F80F8000F80F8000F80F8000F80F8000F8078000F0078000F003C001E003C001E001C001C000E0038000700700001C1C000
+  007F0000> 32 26 -2 0 29.887] 111 @dc
+[<003E0000E10001C08003C0800780400780400780400780400780400780400780400780000780000780000780000780000780
+  00078000078000078000078000078000078000078000FFFF801FFF800F800007800003800003800001800001800001800000
+  8000008000008000008000> 24 37 -1 0 23.246] 116 @dc
+[<FFFCFFFCFFFCFFFC078007800780078007800780078007800780078007800780078007800780078007800780078007800780
+  078007800780078007800780078007800780078007800780078007C0078007C0078007A0078007A0070007900E00078C1C00
+  0783F80007800000078000000780000007800000078000000780000007800000078000000780000007800000078000000780
+  00000F800000FF800000FF80000007800000> 32 42 -1 0 33.208] 104 @dc
+[<1F00000060800000F0400000F8200000F8100000F81000007008000000080000000400000004000000040000000200000002
+  0000000700000007000000070000000F8000000F8000001E4000001E4000003E6000003C2000003C20000078100000781000
+  00F8180000F0080000F0080001E0040001E0040003E0020003C0020003C0020007800100078003800F8003C0FFF00FF8FFF0
+  0FF8> 32 38 -1 12 31.548] 121 @dc
+[<0000FF8000000007FFE01000001FC0383000003E000C700000F80002F00001F00002F00003E00001F00007C00001F0000F80
+  0001F0000F800001F0001F000001F0003F000001F0003E000001F0003E000001F0007E000001F0007E000003F0007C0000FF
+  FF00FC0000FFFF00FC0000000000FC0000000000FC0000000000FC0000000000FC0000000000FC0000000000FC0000000000
+  FC00000000007C00000000007E00000010007E00000010003E00000010003E00000030003F00000030001F00000030000F80
+  000070000F800000700007C00000F00003E00001F00001F00003F00000F80006F000003E000C7000001FC07830000007FFE0
+  30000000FF001000> 48 43 -3 1 46.906] 71 @dc
+[<FFFCFFFC07800780078007800780078007800780078007800780078007800780078007800780078007800780078007800780
+  07800780078007800780078007800780078007800780078007800F80FF80FF800780> 16 42 -1 0 16.604] 108 @dc
+[<007F0001C0C00700200E00101E00083C00043C00047C0000780000F80000F80000F80000F80000F80000FFFFFCF8003CF800
+  3C78003C78003C3C00383C00781C00700E00F00700E003C380007E00> 24 26 -2 0 26.566] 101 @dc
+[<83F800C40700F80180F001C0E000C0C000E0C000E0C000E08001E08001E00007C0003FC003FF800FFF003FFE007FF0007E00
+  00F80000F00040E00040E000406000C06000C03001C01C06C007F840> 24 26 -2 0 23.578] 115 @dc
+[<FFFCFFFCFFFCFFFC078007800780078007800780078007800780078007800780078007800780078007800780078007800780
+  078007800780078007800780078007800780078007800780078007C0078007C0078007A007800FA00700FF900E00FF8C1C00
+  0783F800> 32 26 -1 0 33.208] 110 @dc
+[<FFFFFFFC00FFFFFFFC0007E001FC0003E0007C0003E0001E0003E0000E0003E000060003E000060003E000020003E0000200
+  03E000020003E000020003E000010003E000010003E000010003E000010003E000000003E000000003E000000003E0000000
+  03E000000003E000000003E000000003E000000003E000000003E000000003E000000003E000000003E000000003E0000000
+  03E000000003E000000003E000000003E000000003E000000003E000000003E000000003E000000007F0000000FFFFE00000
+  FFFFE00000> 40 41 -2 0 37.359] 76 @dc
+[<07F80F001F063FC03C013C407C00F820F800F820F8007820F8007820F8007820780078207C0078003E0078001F0078000F80
+  780003E07800007FF800000078000000780000007800080078001C0078003E0078003E00F0003C00E0001001C0000E078000
+  01FC0000> 32 26 -2 0 29.887] 97 @dc
+[<FFFFFFC00000FFFFFFF8000007E000FE000003E0001F000003E0000F800003E00007C00003E00003E00003E00001F00003E0
+  0000F80003E00000F80003E000007C0003E000007C0003E000007E0003E000003E0003E000003E0003E000003F0003E00000
+  3F0003E000003F0003E000003F0003E000003F0003E000003F0003E000003F0003E000003F0003E000003F0003E000003F00
+  03E000003E0003E000003E0003E000003E0003E000007C0003E000007C0003E000007C0003E00000F80003E00000F00003E0
+  0001F00003E00003E00003E00007C00003E0000F800003E0003F000007E000FC0000FFFFFFF80000FFFFFFC00000> 48 41 -2 0 45.661] 68 @dc
+[<FFFF81FFFF00FFFF81FFFF0007F0000FE00003E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E0
+  0007C00003E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E00007
+  C00003E00007C00003E00007C00003E00007C00003FFFFFFC00003FFFFFFC00003E00007C00003E00007C00003E00007C000
+  03E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E00007C00003E0
+  0007C00003E00007C00003E00007C00003E00007C00003E00007C00007F0000FE000FFFF81FFFF00FFFF81FFFF00> 48 41 -2 0 44.831] 72 @dc
+[<FFFE00FFFE0007C0000780000780000780000780000780000780000780000780000780000780000780000780000780000780
+  0007800007800007C00007C0E007C1F00FA1F0FF91F0FF98E00787C0> 24 26 -1 0 23.412] 114 @dc
+[<0001FF0000000F01E000003C0078000078003C0000F0001E0003E0000F8007C00007C007800003C00F800003E01F000001F0
+  1F000001F03F000001F83E000000F87E000000FC7E000000FC7E000000FC7C0000007CFC0000007EFC0000007EFC0000007E
+  FC0000007EFC0000007EFC0000007EFC0000007EFC0000007EFC0000007E7C0000007C7C0000007C7E000000FC7E000000FC
+  3E000000F83E000000F81F000001F01F000001F00F800003E007800003C003C000078001E0000F0000E0000E000078003C00
+  003C007800000F01E0000001FF0000> 40 43 -3 1 46.491] 79 @dc
+[<007F0001C0C00780200F00101E00083C00043C00047C0000780000F80000F80000F80000F80000F80000F80000F80000F800
+  007800007C00103C00383C007C1E007C0F003C07800801C070007F80> 24 26 -2 0 26.566] 99 @dc
+[<0407E00006181C0007200E000740078007C003C0078001C0078001E0078001F0078000F0078000F0078000F8078000F80780
+  00F8078000F8078000F8078000F8078000F8078000F0078000F0078001E0078001E007C003C007A0038007B00700078C1C00
+  0783F00007800000078000000780000007800000078000000780000007800000078000000780000007800000078000000780
+  00000F800000FF800000FF80000007800000> 32 42 -1 0 33.208] 98 @dc
+[<03F800000FFE00001C0780001801C0003C00E0003E0070003E0078001C003C0000003C0000001E0000001E0000001E000000
+  0F0000000F0000000F00007E0F8003810F800700CF800E004F801C002F803C001F803C001F8078001F8078000F80F8000F80
+  F8000F80F8000F80F8000F80F8000F00F8000F00F8000F00F8000F0078001E0078001E003C001C001C003C001E0038000F00
+  700007C1E00001FFC000007F0000> 32 41 -2 1 29.887] 57 @dc
+[<4020101008040404020202027AFEFEFCFC78> 8 18 -5 12 16.604] 44 @dc
+[<7FFFE07FFFE001F80000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F0
+  0000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000F00000
+  F000F0F000FEF0000FF00001F000007000001000> 24 40 -5 0 29.887] 49 @dc
+cmti12.300 @sf
+[<FFFF000003E0000001E0000001E0000001E0000001E0000000F0000000F0000000F0000000F0000000780000007800000078
+  000000780000003C0000003C0000003C0000003C0000001E0000001E0000001E0000001E0000000F0000800F0010800F0010
+  400F0010400780104007801020078018200780183003C0181803C0181E03C0381FFFFFF8> 32 34 -9 0 34.869] 84 @dc
+1 @bop1
+cmr10.622 @sf
+222 508 p 65 c
+28 r (trace) s
+29 r (displa) s
+-1 r 121 c
+27 r (and) s
+29 r (editing) s
+29 r (program) s
+29 r (for) s
+214 612 p (data) s
+29 r (from) s
+29 r (\015uorescence) s
+28 r (based) s
+29 r (sequencing) s
+802 716 p (mac) s
+-1 r (hines) s
+cmr10.432 @sf
+464 864 p (Timoth) s
+-1 r 121 c
+19 r (Gleeson) s
+157 r (LaDeana) s
+19 r (Hillier) s
+765 981 p (Octob) s
+2 r (er) s
+20 r (9,) s
+19 r (1991) s
+cmbx12.300 @sf
+164 1381 p (ABSTRA) s
+-1 r (CT) s
+cmr12.300 @sf
+164 1473 p (\\T) s
+-3 r (ed") s
+16 r 40 c
+cmti12.300 @sf
+0 r 84 c
+cmr12.300 @sf
+0 r (race) s
+cmti12.300 @sf
+18 r 101 c
+-1 r 100 c
+cmr12.300 @sf
+0 r (itor\)) s
+16 r (is) s
+18 r 97 c
+17 r (graphical) s
+18 r (editor) s
+17 r (for) s
+18 r (sequence) s
+18 r (and) s
+17 r (trace) s
+18 r (data) s
+17 r (from) s
+164 1534 p (automated) s
+17 r (\015uorescence) s
+17 r (sequencing) s
+17 r (mac) s
+0 r (hines.) s
+23 r (It) s
+17 r (pro) s
+0 r (vides) s
+16 r (facilities) s
+17 r (for) s
+17 r (view-) s
+164 1594 p (ing) s
+13 r (sequence) s
+13 r (and) s
+13 r (trace) s
+12 r (data) s
+13 r (\(in) s
+13 r (top) s
+13 r (or) s
+13 r 98 c
+1 r (ottom) s
+13 r (strand) s
+13 r (orien) s
+0 r (tation\),) s
+12 r (for) s
+13 r (editing) s
+164 1654 p (the) s
+18 r (base) s
+18 r (sequence,) s
+19 r (for) s
+18 r (automated) s
+19 r (or) s
+18 r (man) s
+0 r (ual) s
+17 r (trimming) s
+18 r (of) s
+18 r (the) s
+19 r (head) s
+18 r (\(v) s
+-1 r (ector\)) s
+164 1714 p (and) s
+14 r (tail) s
+15 r (\(uncertain) s
+14 r (data\)) s
+14 r (from) s
+15 r (the) s
+14 r (sequence,) s
+15 r (for) s
+14 r 118 c
+0 r (ertical) s
+13 r (and) s
+14 r (horizon) s
+0 r (tal) s
+13 r (trace) s
+164 1774 p (scaling,) s
+13 r (for) s
+13 r 107 c
+0 r (eeping) s
+12 r 97 c
+13 r (history) s
+13 r (of) s
+12 r (sequence) s
+13 r (editing,) s
+14 r (and) s
+13 r (for) s
+12 r (output) s
+13 r (of) s
+13 r (the) s
+13 r (edited) s
+164 1835 p (sequence.) s
+20 r 84 c
+-3 r (ed) s
+12 r (has) s
+12 r 98 c
+1 r (een) s
+12 r (used) s
+13 r (extensiv) s
+-1 r (ely) s
+12 r (in) s
+12 r (the) s
+12 r (C.) s
+13 r (elegans) s
+12 r (genome) s
+12 r (sequencing) s
+164 1895 p (pro) s
+3 r (ject,) s
+20 r 98 c
+2 r (oth) s
+19 r (as) s
+20 r 97 c
+20 r (stand-alone) s
+19 r (program) s
+20 r (and) s
+20 r (in) s
+-1 r (tegrated) s
+19 r (in) s
+0 r (to) s
+18 r (the) s
+20 r (Staden) s
+20 r (se-) s
+164 1955 p (quence) s
+11 r (assem) s
+0 r (bly) s
+11 r (pac) s
+-1 r 107 c
+-2 r (age,) s
+11 r (and) s
+12 r (has) s
+11 r (greatly) s
+12 r (aided) s
+11 r (in) s
+12 r (the) s
+11 r (e\016ciency) s
+12 r (and) s
+11 r (accuracy) s
+164 2015 p (of) s
+16 r (sequence) s
+17 r (editing.) s
+21 r (It) s
+16 r (runs) s
+17 r (in) s
+16 r (the) s
+16 r 88 c
+17 r (windo) s
+-1 r (ws) s
+15 r (en) s
+0 r (vironmen) s
+-1 r 116 c
+15 r (on) s
+16 r (Sun) s
+17 r 119 c
+-1 r (orksta-) s
+164 2075 p (tions) s
+14 r (and) s
+14 r (is) s
+14 r 97 c
+-1 r 118 c
+-3 r (ailable) s
+13 r (from) s
+14 r (the) s
+14 r (authors.) s
+21 r 84 c
+-3 r (ed) s
+13 r (curren) s
+-1 r (tly) s
+13 r (supp) s
+2 r (orts) s
+13 r (sequence) s
+14 r (and) s
+164 2136 p (trace) s
+16 r (data) s
+16 r (from) s
+17 r (the) s
+16 r (ABI) s
+16 r (373A) s
+17 r (and) s
+16 r (Pharmacia) s
+16 r (A.L.F.) s
+16 r (sequencers.) s
+cmbx12.300 @sf
+164 2261 p (INTR) s
+-1 r (ODUCTION) s
+cmr12.300 @sf
+164 2354 p (Time) s
+23 r (in) s
+0 r 118 c
+-1 r (olv) s
+-2 r (ed) s
+22 r (in) s
+24 r (sequence) s
+23 r (editing) s
+23 r (is) s
+24 r (extensiv) s
+-1 r (e,) s
+24 r (and) s
+24 r (an) s
+-1 r (ything) s
+22 r (easing) s
+24 r (that) s
+164 2414 p (burden) s
+21 r (will) s
+21 r (impro) s
+0 r 118 c
+-1 r 101 c
+20 r (the) s
+21 r (e\016ciency) s
+21 r (of) s
+21 r (an) s
+0 r 121 c
+20 r (ma) s
+3 r (jor) s
+21 r (sequencing) s
+21 r (pro) s
+3 r (ject.) s
+37 r (Ha) s
+-1 r (v-) s
+164 2474 p (ing) s
+19 r (sequence) s
+19 r (and) s
+19 r (trace) s
+19 r (data) s
+20 r 97 c
+-1 r 118 c
+-2 r (ailable) s
+18 r (online) s
+19 r (in) s
+19 r (easily-) s
+19 r (manipulable) s
+19 r (form) s
+19 r (is) s
+961 2599 p 49 c
+@eop
+@end
diff --git a/doc/ted.tex b/doc/ted.tex
new file mode 100644
index 0000000..0a0b291
--- /dev/null
+++ b/doc/ted.tex
@@ -0,0 +1,213 @@
+\documentstyle[12pt]{article}
+
+\title{A trace display and editing program for data from fluorescence based 
+sequencing machines}
+\author{Timothy Gleeson \and LaDeana Hillier}
+
+\begin{document}
+\maketitle
+\section*{}
+\subsection*{}
+\subsubsection*{ABSTRACT}
+
+``Ted'' ({\em T}race {\em ed}itor) 
+is a graphical editor for sequence and trace data from automated 
+fluorescence sequencing machines.  It provides facilities 
+for viewing sequence and trace data (in top or bottom strand 
+orientation), for editing the base sequence,  for 
+automated or manual trimming of the head (vector) and tail 
+(uncertain data) from the sequence, for vertical and horizontal trace 
+scaling, for keeping a history of sequence editing, and for output of 
+the edited sequence.  Ted has been used extensively in the C. 
+elegans genome sequencing project,
+both as a stand-alone program and integrated into 
+the Staden sequence assembly package, and  has 
+greatly aided in the efficiency 
+and accuracy of sequence editing.  It runs in the X 
+windows environment on Sun workstations and is available from the 
+authors.  Ted currently supports sequence and trace data from the ABI 
+373A and Pharmacia A.L.F. sequencers.
+
+\subsubsection*{INTRODUCTION}
+	Time involved in sequence editing is extensive, and anything easing 
+that burden will improve the efficiency of any major sequencing 
+project.  Having sequence and trace data available online in easily-
+manipulable form is invaluable. Ted (a Trace-EDitor) was developed to 
+fill this role in the C. elegans genome 
+sequencing project [1]. 
+
+\subsubsection*{METHODS}
+
+{\em Computing Design and Implementation.}
+When designing ted, we had a number of specific computing goals 
+in mind including portability and adaptability.  For portability, we 
+chose to write ted in ANSI C using the X windowing system and the 
+Xaw toolkit.  X provides basic capabilities for the creation and use 
+of windows, and the toolkit contains a number of pre-packaged 
+components, such as the ``sliders'' used for scrolling. X also allows 
+site, user and per-run defaults to be set.  Adaptability is also an 
+important goal since we are providing a new function to 
+research groups who are constantly adding new requirements.  
+
+	Stylistically, we have followed an ``Abstract Data Type''
+discipline.  In this discipline, a program is split into a number of 
+modules which provide separate, well-defined functions.  We 
+separate the interface of a module from its implementation.  For 
+example, a unified internal sequence format is used.  This can store 
+a varying amount of information.  However, there is a clear and 
+simple interface by which the rest of the program accesses this 
+module.  Such a style is not well supported by C, but its adoption has 
+been very successful.  The addition of new sequencing machines, and 
+thus new external data formats, may cause some changes in the 
+internal representation of the sequence but should not affect  
+the rest of the program.
+
+	Ted accepts a large number of optional command line arguments,
+many of which can also be specified as system defaults. This
+supports a mode of working whereby ted is invoked not directly by the
+user but instead by a script or another application which supplies
+arguments appropriate to the editing task.
+
+
+{\em Graphical Interface.}
+Ted currently accepts data from two fluorescence based sequencing
+machines, the Pharmacia A.L.F. and the ABI 373A.
+The sequencing machine data consists of 
+four traces of fluorescence levels together with the machine's 
+interpretation, which is a sequence of bases.  
+Ted displays 
+the traces and the machine-generated base list.  
+A second, initially identical, list of bases is provided for correction 
+by the user.
+
+	Ted has an X windows based 
+graphical interface. The trace file
+can either be input from the command line or by 
+clicking on the INPUT button after the program has been invoked.  
+Other parameters which the user may specify on the
+command line include: the output 
+file name; a base position or sequence string on which the trace is 
+to be centered;  a default trace magnification;  a 5' vector sequence 
+for automated elimination of the sequence head (vector); top or 
+bottom strand orientation; or any of the usual X-window parameters (e.g. 
+display, geometry...).
+
+	The graphics display (Figure 1) consists of the control 
+panel, the base position information, the original and edited sequence 
+data, and the graphical representation of the trace.  The user may 
+begin by using the control panel INPUT button to input a new trace 
+file at which time the user selects whether to view the sequence
+and trace in top or bottom strand orientation.
+The trace file is displayed and, if a 5' vector sequence has been 
+specified on the command line, the program attempts to select a 
+cutoff point corresponding to the vector sequence at the ``head'' of the 
+trace file.  The bases beyond the ``cutoff'' point are  
+displayed on a shaded background.  The user may modify the cutoff 
+position by clicking on the ``Adj left cut'' button and clicking on the 
+position of the desired cutoff.  Similarly, the user may adjust the 
+right cutoff of the sequence (chosen by starting at the 5' end of the 
+sequence and looking for the first occurrence when 2 out of 5 bases 
+are 'N') by scrolling along the sequence to that point, clicking on the 
+``Adj right cut'' button, and clicking on the appropriate base.  
+Automation of the ``cutoff'' process is optional; the user may compile 
+the program with that feature turned ``off.'' 
+
+	Clicking on the ``Edit seq'' button allows the user to enter the edit 
+mode.  The ``Search'' button can be used to skip from ``problem'' to 
+``problem'' (i.e., ambiguity to ambiguity) or to look for runs of 
+identical bases (e.g., TTTT) which are often mis-called by
+the machine software.
+
+  Bases can be inserted, deleted, or replaced as with
+any ordinary word-processor.  In difficult-to-read areas,  
+the trace may be vertically or horizontally scaled by dragging or 
+clicking on the magnification scroll bar or by clicking on the 
+vertical scaling buttons (``Scale down'', ``Scale up''), respectively.  
+Finally, the edited sequence is saved to an ascii file using the 
+``Output'' button.  A history of the editing session can also be saved
+along with the sequence. 
+The ``Quit'' button is used 
+to exit the program.  When reinvoking ted on an edited trace file the 
+edited base sequence, rather than the original sequence, is shown in 
+the edited base window.  The user may invoke ted by calling in any one 
+of the previous editing sessions.   
+
+
+\subsubsection*{APPLICATIONS AND CONCLUSIONS}
+
+	In the C. elegans genome sequencing project, data from the ABI or 
+A.L.F. sequencing machines' computers are transferred to Sun 
+workstations.  
+The user invokes a Unix shell script that calls ted systematically 
+on each of the new set of trace files creating a set of sequence files.
+The sequence files that are deemed to be of acceptable quality
+are then entered into the sequence 
+assembly program xdap [2] where the sequences are assembled into 
+contigs.  Portions of the ted trace-editor have been incorporated 
+into the xdap ``trace manager,''  which is used in 
+conjunction with the contig editor to view sets of aligned traces 
+at sites of discrepancies in the aligned sequences.  
+
+	Ted is also used at the stage of choosing oligo primers for the 
+``walking'' stage of the sequencing project.  It can be invoked directly 
+from the oligo selection program, osp [3], to allow examination
+of the trace data in the region of the primers so that  
+integrity of the sequence data can be verified.
+
+	Currently, no other programs are known to be available 
+which support editing of the ABI trace data. 
+Further, the modular design of the program should allow
+support for new types of sequencing machines, with new data 
+formats, to be implemented in a straightforward fashion.  
+
+
+\subsubsection*{AVAILABILITY}
+	Ted is freely available from the authors or from Rodger Staden and
+Simon Dear (MRC Laboratory of Molecular Biology, Hills Road, Cambridge,
+UK, CB2 2QH) for use on Sun workstations running X-windows (or OpenLook).
+
+
+\subsubsection*{ACKNOWLEDGMENTS}
+	The authors would like to thank all members of the C. elegans
+sequencing project with special thanks to the following people:
+John Sulston, Bob Waterston,  
+Phil Green, Rick Wilson, Richard Durbin, Simon Dear, and Rodger Staden 
+for their helpful suggestions for improvements in the ted interface 
+and for their parts in the development of ted.  This work was 
+supported by the Medical Research Council and NIH grant R01-HG00136.
+
+\subsubsection*{REFERENCES}
+
+1. Waterston, R., Sulston, J., et al. (1991), in preparation.
+
+2. Dear, S. and Staden, R. (1991) Nuc. Acids Res.,  in press.
+
+3. Hillier, L. and Green, P. (1991) submitted.
+
+
+{\bf Figure 1 legend.}
+
+Figure 1 shows a ``screen dump'' of the ted graphical interface.  
+The display consists of
+the control panel and the synchronized view of the base position
+information, original and edited sequence data, 
+and graphical representation of the trace (with each nucleotide's trace
+ being represented
+by a different color).  The control
+panel allows the user to read in new trace files (in either
+bottom or top strand orientation)
+as well as to search for a string of nucleotides or a certain base position.
+Scroll bars allow the user to adjust the magnification of or scroll through
+the sequence and trace data.  The user may also choose to change the vertical
+magnification of the trace data.  Further, sequence on the head (vector)
+or tail (uncertain data) of the sequence may be ``cutoff'' 
+using the adjust left and right cutoff buttons. Bases can be inserted, 
+deleted, or replaced as with
+any ordinary word-processor in the sequence data window. Finally, the
+sequence may be written to an ascii file using the output button on
+the control panel.
+
+\end{document}
+
+
+
diff --git a/help/BAP.RNO b/help/BAP.RNO
new file mode 100644
index 0000000..731c8c9
--- /dev/null
+++ b/help/BAP.RNO
@@ -0,0 +1,2722 @@
+.npa
+.left margin1
+@-1. TX  0 @General
+.sp
+@-2. T   0 @Screen control
+.sp
+@-2. X   0 @Screen
+.sp
+@-3. TX  0 @Modification
+.sp
+@0.  TX -1 @BAP
+.left margin2
+.PARA
+This is an interactive program whose primary use is
+for managing shotgun sequencing projects, but it can also be used for 
+handling alignments of other sequences, including those of proteins. 
+Currently the maximum 'gel reading' length is set to 4096 characters. 
+Almost all of the information below describes the use of the program for 
+shotgun projects, but those using the programs for handling other 
+sequence 
+alignments should interpret it accordingly.
+The data for such a project is stored in a special type of database. The 
+program
+ contains the tools that are required to screen gel readings
+against vector sequences and restriction sites, and to assemble
+new gel 
+readings into the database (automatically comparing and aligning
+them). In addition it contains editors and functions to examine the quality 
+of the aligned sequences.
+.para
+ There are three main menus: "general", "screen" and  "modification", 
+and some functions have submenus.
+.left margin2
+.lit
+  The general menu contains the following options:
+
+       Open a database
+       Display a contig
+       List a text file
+       Direct output
+       Calculate a consensus
+       Screen against restriction enzymes
+       Screen against vector
+       Check logical consistency
+       Copy database
+       Show relationships
+       set parameters
+       Highlight disagreements
+       Examine quality
+       Check Assembly
+       Find read pairs
+
+The graphics menu contains:
+
+       Clear graphics
+       Clear text
+       Draw ruler
+       Use cross hair
+       Change margins
+       Label diagram
+       Plot map
+       Plot single contig
+       Plot all contigs
+
+
+The modification menu contains:
+
+       Edit contig
+       Auto assemble
+       Join contigs
+       Complement a contig
+       Alter relationships
+       Extract gel readings
+       Find internal joins
+       Disassemble readings
+       Shuffle pads
+       Auto-select oligos
+       Double strand
+
+The alter relationships menu contains:
+
+       Cancel
+       Line change
+       Check logical consistency
+       Remove contig
+       Shift
+       Move gel reading
+       Rename gel reading
+       Break a contig
+       Remove a gel reading
+       Alter raw data parameters
+
+.END LIT
+.SK1
+.para
+Overview of the methodology
+.para
+The shotgun sequencing strategy
+.para
+               In the shotgun sequencing procedure
+the sequence to be determined is randomly broken into fragments of 
+about 
+1000 nucleotides in length. These fragments are cloned and then 
+selected randomly and their
+
+          sequences   determined.    The  relationship  between  any  pair  of
+
+          fragments is not known beforehand
+but is found by comparing their  sequences.
+
+          If  the  sequence  of  one found to be wholly or partially contained
+
+          within that of another  for  sufficient  length  to  distinguish  an
+
+          overlap  from  a repeat then those two fragments can be joined.  
+The
+
+          process of select, sequence and compare is continued until the 
+whole
+
+          of  the  DNA  to  be  sequenced is in one continuous well 
+determined
+
+          piece.
+
+.para
+          Definition of a contig
+
+.para
+               A CONTIG is a set of gel  readings  that  are  related  to  one
+          another  by  overlap of their sequences.  All gel readings belong to
+          a contig and each contig contains at  least  one  gel
+          reading.   The  gel  readings in a contig can be summed to produce 
+a continuous consensus sequence and the length of this sequence is 
+the length  of the contig.  The rules used to perform this summation are
+          given under "the consensus algorithm".
+          At any stage
+          of  a  sequencing project the data will comprise a number of 
+contigs;
+when  a  project  is
+
+          complete  there  should be only one contig and its consensus will be
+          the finished sequence. Note that since being introduced and 
+defined as above the word "contig" has been taken up by those involved in 
+genomic mapping. In that context the consensus with a  precise length is,
+of course, not 
+defined.
+
+.SK1
+.LEFT MARGIN2
+Introduction to the computer method
+.LEFT margin2
+.PARA
+It is useful to consider the objectives of a sequencing project before 
+outlining how we use the computer to help achieve them. The aim of a 
+shotgun sequencing project is to 
+produce an accurate consensus sequence from many overlapping gel 
+readings.
+It is necessary to know, particularly at the latter
+stages of the project, how accurate the 
+consensus sequence is. This enables us to know which regions of the
+ sequence require further work and also to know when the project is 
+finished.
+To show the quality of the consensus, the programs described here
+produce displays like that shown below.
+.sk1
+.lit
+
+                           10        20        30        40        50
+   -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+       CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                          110       120       130       140       150
+   -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+   -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+   -5  HINW.009                                        GGCAGCGACCGTCG
+   17  HINW.999                                           AGCGACCGTCG
+       CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                          160       170       180       190       200
+   -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+   -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+   12  HINW.017                                              GTAGAGC*
+       CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+.END LIT
+.para
+                This is an example showing the left  end  of  a  contig  from
+          position  1 to 200.  Overlapping this region are gel readings 
+numbered 6, 3, 5, 17 and 12;
+6, 3 and 5 
+are in reverse orientation to their original reading (denoted by a minus  
+sign). Each gel reading also has a name (eg HINW.010). It can be seen that 
+in a number of places the sequences contain characters other than A,C,G 
+and 
+T. Some of these extra characters have been used by the sequencer to 
+indicate regions of uncertainty in the initial interpretation of the gel 
+reading, but the asterisks (*) have been inserted by the automatic 
+assembly function in order to align the sequences. Underneath each 50 
+character block of gel reading sequences is the consensus derived from 
+the 
+sequences aligned above (the line labelled CONSENSUS). For most of its 
+length the consensus has a definite nucleotide assignment but in a few 
+positions there is insufficient agreement between the gel readings and
+so a dash (-) appears in the sequence. This display contains all the 
+evidence needed to assess the quality of the consensus: the number of 
+times 
+the sequence has been determined on each strand of the DNA, and the 
+individual nucleotide assignments given for each gel reading.
+.para
+So the aim is to produce the consensus sequence and, equally important, 
+a display of the experimental results from which it was derived.
+.para
+In order to achieve this the following operations need to be performed:
+.left margin2
+1) Put individual gel readings into the computer.
+This might involved the manual interpretation of autoradiographs
+or the transfer and process of machine-readable files from fluorescent
+sequencing machines.
+.left margin2
+2) Check each gel reading to make sure it is not simply part of one of the 
+vectors used to clone the sequence.
+.left margin2
+3) Check each gel reading to make sure that those fragments that span 
+the 
+ligation point used prior to sonication are not assembled as single 
+sequences.
+.left margin2
+4) Compare all the remaining gel readings with one another to assemble 
+them 
+to produce the consensus sequence.
+.left margin2
+5) Check the quality of the consensus and edit the sequences.
+.left margin2
+6) When all the consensus is sufficiently well determined, produce a copy 
+of 
+it for processing by other analysis programs.
+.para
+It is very unlikely that this procedure will only be passed through once.
+Usually steps 1 to 5 are cycled through repeatedly, with step 4 just 
+adding 
+new sequences to those already assembled. Generally step 6 is also used 
+in 
+order to analyse imperfect sequence to check if it is the one the project 
+intended to sequence, or to look for interesting features. Analysis of 
+the consensus, such as 
+searches for protein coding regions,
+can also help to find errors in the sequence. The display of the 
+overlapping gel readings shown above can be used to indicate, not only 
+the 
+poorly determined regions, but also which clones should be resequenced 
+to 
+resolve ambiguities, or those which can usefully be extended or 
+sequenced 
+in the reverse direction, to cover 
+difficult regions.
+
+.PARA
+The original
+individual gel readings for a sequencing project are each stored in 
+separate files. As the gel readings are entered into the computer
+(usually in batches, say 10 
+from a film), the file names they are given are stored in 
+a further file, called a file of file names. Files of file names  
+enable gel readings to be processed in batches. 
+.para
+For each sequencing project 
+we start a project database. This database has a structure specifically 
+designed for
+dealing with shotgun sequence data. 
+In order to arrive at the final consensus sequence many operations will 
+be 
+performed on the sequence data. Individual fragments must be 
+sequenced and 
+compared in both senses (i.e. both orientations) with all the other 
+sequences. When an overlap between a new gel reading and a contig are 
+found 
+they must be aligned and the new gel reading added to the contig. If a 
+new 
+gel reading overlaps two contigs they must be aligned and joined. Before 
+the two contigs are joined one of them may need to be turned around 
+(reversed and complemented) so they are both in in the same orientation. 
+.para
+Clearly, keeping track of all these manipulations is quite complicated,
+and to be able to perform the operations 
+quickly requires careful choice of data 
+structure and algorithms. For these reasons it is not practicable to store 
+the gel readings aligned as shown in the display above. Rather, it is more 
+convenient to store the sequences unassembled, and to record sufficient 
+information for programs to assemble them during processing. The 
+data used to assemble the sequences is called relational information. 
+.left margin2
+.PARA
+ The database comprises five files and they are described under the 
+section entitled "open database".
+.PARA
+Before entry into the project database 
+each new gel reading must be compared to look for overlaps 
+with all the data already contained 
+within the database. This last point is 
+important: all searching for overlaps is between individual new gel 
+readings and the data already in the database. There is no searching for 
+overlaps between sequences within the database; overlaps must be found 
+before new gel readings are entered into the database.
+.para
+Below I give an introduction to how the sequences are processed by 
+being 
+passed from one function to the next.
+.para
+This program is used to start a 
+database for the project and 
+then the following procedure is used.
+.para
+Data in the form of individual gel readings are entered into the computer
+ 
+and stored in separate files (possibly using either the digitizer 
+
+program GIP). Batches 
+of these gel readings 
+are passed to the screening functions in this program to search for overlaps 
+
+with vector sequences (see VEP and "screen against vector") or for matches to 
+
+restriction enzyme sites  that should not be
+ 
+present ("screen against enzymes"). 
+Each run of these screening functions passes on only those gel
+ 
+readings that do not contain unwanted sequences. Sequences are passed 
+
+via 
+files of file names and eventually are processed by the automatic 
+assembly function ("auto assemble"). This function compares each gel 
+reading with a consensus of all the previous gel readings 
+stored in the database.
+If it finds any 
+overlaps
+ it aligns the overlapping sequences by inserting padding characters, 
+and then adds the new gel reading to the database. 
+Gels that overlap are added to existing contigs and gels that do not 
+overlap any data in the database start 
+new contigs. If a new gel overlaps two contigs they are joined. 
+Any gel readings that appear to overlap but which
+cannot be aligned sufficiently well are not entered and have 
+their names written to a file of failed gel reading names.
+.PARA
+Generally data is entered 
+into the database in batches as just described. The program
+ is also used to examine
+
+the data in the database, to enter gel readings that the automatic
+ 
+assembly function cannot align ("auto assemble"),
+
+ and to make final edits. Edits to whole contigs 
+
+can  be made using a
+ mouse-driven editor ("edit contig").
+
+.PARA
+Editing the sequences is obviously an essential part of managing a
+
+sequencing project. 
+Editing is required when new 
+
+sequences are added, when contigs are joined, and when sequences are 
+
+corrected.
+A basic part of the strategy
+
+used here is that new
+ 
+gel readings should be correctly aligned throughout their whole length
+
+when 
+they are entered into the database, and that when contigs are joined they
+ 
+are edited so that they are well aligned in the region of overlap.
+
+ Alignment can be achieved by
+
+adding padding characters to the sequences, and this is the way "auto
+ 
+assemble"  
+operates when adding new sequences to the database.
+
+.para
+In order to search 
+for overlaps that may have been missed or may be hidden in the "unused data"
+the function "find internal joins" can be used.
+
+.para
+Generally the users need not concern themselves with how the relational 
+information is used by the program, but it is necessary to know
+how contigs are identified. Because contigs are constantly being changed and 
+reordered the program identifies them by the numbers of the gel readings 
+they contain. Whenever users need to identify a contig they need only 
+know 
+the number or name of one of the gel readings it contains. Whenever the 
+program asks users to identify a contig or gel reading they can type its 
+number or its archive name. If they type its archive name they must precede 
+the name by a slash "/" symbol to denote that it is a name rather than a 
+number. E.g if the  archive
+name is fred.gel with number 99, users should
+type /fred.gel or 99 when asked to identify the contig. Generally,
+ when it asks for the gel reading to be identified,
+the program will offer the user a default name,
+ and if the user types only return, that 
+contig will be accessed. When a database is opened the default contig will 
+be the longest one, but if another is accessed, it will subsequently  become 
+the current default.
+.para
+Further information is located in the following places. 
+The database files are described under "open database". The format 
+for 
+vector and consensus sequences is given under "calculate a consensus", as are 
+the 
+uncertainty codes used in gel readings. 
+.left margin2
+.para
+         The digitiser program
+is used for the initial input of gel readings
+and for writing a file of file names. The program
+uses a digitizer for data entry.
+A digitizer is
+  a  two  dimensional  surface such as a light box
+which is such that if a special pen is pressed onto it, the pens 
+coordinates are recorded by a computer.
+These coordinates
+          can be interpreted by a program.
+.para
+               In order to read an autoradiograph placed on the light box
+the user  need  only  define the bottom of
+the four sequencing lanes and the bases
+          to which they correspond and then use  the  pen  to  point  to  each
+          successive  band progressing up the gel.  The program examines 
+the
+          coordinates of each pen position to see in which of the  four  
+lanes
+          it  lies  and  assigns  the  corresponding  base to be stored in the
+          computer.  Each time the pen tip is depressed to point to a position
+          on  the  surface of the digitizer the program sounds the bell on the
+          terminal to indicate to the user that a point has been recorded.  As
+          the  sequence  is read the program displays it on the screen.
+
+.left margin1
+@17. TX 1 @Screen against enzymes
+.left margin2
+.PARA
+Used to compare gel readings against any restriction enzyme recognition
+ 
+sequences that may have been used  during cloning and which should not 
+
+be present in the data. Works on single gel readings or processes batches
+ 
+accessed through files of file names. The algorithm looks for exact 
+
+matches to recognition sequences stored in a file. 
+
+.para
+The file containing the recognition sequences must be identified. The
+user 
+must choose between employing a file of file names, or typing in the
+
+ 
+names of individual gel reading files. If a file of file names is used the
+
+
+program will also create a new file of file names. When the option has
+ 
+finished operating this new file will contain the names of all those gel
+ 
+readings that did not match any of the recognition sequences. Hence it
+ can 
+be used for further processing of the batch. The recognition sequences 
+
+should be stored in a simple text file with one recognition sequence per 
+
+line.
+.left margin1
+@18. TX 1 @Screen against vector
+.left margin2
+.PARA
+Used to compare gel readings against any vector sequences that may have 
+
+been picked up during cloning and which have not been removed by vep. 
+It Works on single gel readings or processes 
+
+batches accessed through files of file names. The algorithm looks for
+exact 
+matches of length "minimum match length" and displays the overlapping 
+
+sequences.
+.para
+The file containing the vector sequence must be identified. The user must
+
+choose between employing a file of file names, or typing in the names of
+ 
+individual gel reading files. If a file of file names is used the program 
+will 
+also create a new file of file names. When the option has finished 
+
+operating  this new file will contain the names of all those gel readings
+
+that did not match the vector sequence. Hence it can be used for further 
+
+processing of the batch. The vector sequence should be stored in a simple
+ 
+text file with up to 80 characters of data per line. More than one vector
+ 
+can be stored in a single file. If so each should be preceded by a 20 
+
+character title of the form <---m13mp8.0001----> where the < and >
+ signs 
+and the number like .0001 are obligatory. The number must be preceded 
+
+by a dot (.) and be 4 digits long. The total sequence in the file must be <
+
+500,001 characters in length.
+
+.left margin1
+@20. TX 3 @Auto assemble
+.left margin2
+.PARA
+Compares gel readings against the current contents of the database and 
+
+produces  alignments. In its normal mode of operation 
+("entry permitted"), the function
+will automatically enter the gel readings into the database. 
+.para
+New assembly suboption.
+However
+if entry is not permitted the reads won't be entered but the program
+will produce alignments and (optionally) save each reading name and its best 
+alignment score (percentage mismatch) in a file. When used in
+this mode, the program will include in the alignment the poor quality data 
+for each reading. These files of names can then be sorted into score order 
+and then used for assembly, hence forcing the readings that align best to
+be entered into the database first.
+End of new suboption.
+.para
+The routine works on 
+
+single gel readings or processes batches of gel readings accessed through 
+
+files of file names. It is the only way to enter data into the database.
+
+.para
+The function will check the database for logical consistency and will
+ only 
+proceed if it is OK. Choose if gel readings should be entered into the
+ 
+database, or if they should only be compared. Choose between using a file 
+
+of file names or typing file names on the keyboard. If so selected, supply 
+
+the file of file names. Also supply a file of file names to contain the names of
+ 
+all the gel readings that fail to get entered. 
+Select the entry mode. Normal assembly is appropriate for all but special 
+cases, as is "permit joins". Uses for the other modes are not documented 
+here.
+Define a minimum initial
+ 
+match length. 
+Define the maximum number
+ 
+of padding characters allowed to be used in each gel reading to help
+
+achieve alignment, and the same for the number allowed in the contig for
+ 
+each gel reading. Finally define the maximum percentage mismatch to
+be allowed for any gel reading to be entered into the database. If
+ 
+for any gel reading, either of these last three values is exceeded the gel
+ 
+reading will not be entered into the database.
+
+.para
+In operation the function takes a batch of gel readings (probably  passed
+
+          on  as  a file of file names from one of the screening routines) and 
+enters them into a
+          database for a sequencing project.  It takes each  gel reading
+  in  turn,
+          compares  it with the current consensus for the database, it then
+          produces an  alignment  for  any  regions  of  the  consensus  it
+          overlaps;   if  this alignment is sufficiently good it then edits
+          both the new gel reading and the sequences it overlaps  and  adds  
+the
+          new  gel reading to the database.  The program then updates the 
+consensus
+          accordingly and carries on to the next  gel  reading.
+.para
+          All alignments are displayed and any gel readings 
+that do match but  that
+
+          cannot be aligned sufficiently well have their names written to a
+          file of failed gel reading names.  The function works without  any
+
+          user intervention and can process any number of gel readings in a
+          single run.  Those gel readings that fail can be recompared using
+
+          the same function (to find the current overlap position) and  the
+  
+user  can enter them into the database
+
+        using  the   "put all readings in new contigs" 
+assembly option and then joined using "join contigs".
+.para
+Typical dialogue and output from the function is shown below. (Note that 
+output for gel readings 2 - 9 has been deleted to save space).
+.lit
+Automatic sequence assembler
+Database is logically consistent
+? (y/n) (y) Permit entry 
+? (y/n) (y) Use file of file names 
+? File of gel reading names=demo.nam
+? File for names of failures=demo.fail
+Select entry mode
+X  1 Perform normal shotgun assembly 
+   2 Put all sequences in one contig 
+   3 Put all sequences in new contigs
+? Selection  (1-3) (1) =
+? (y/n) (y) Permit joins 
+? Minimum initial match (12-4097) (15) =
+? Maximum pads per gel (0-25) (8) =
+? Maximum pads per gel in contig (0-25) (8) =
+? Maximum percent mismatch after alignment (0.00-15.00) (8.00) =
+  >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
+  Processing           1 in batch
+  Gel reading name=HINW.004                                
+  Gel reading length=   283
+  Searching for overlaps
+  Strand     1
+  Strand     2
+  No matches found
+  Total matches found           1
+  Padding in contig=    0 and in gel=    1
+  Percentage mismatch after alignment =  1.8
+  Best alignment found
+         1         11         21         31         41         51
+         TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+         ********** ********** ********** ********** ********** **********
+         TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+         1         11         21         31         41         51
+        61         71         81         91        101        111
+         TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACA-GTTG
+         ********** ********** ********** ********** ********** ***** ****
+         TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACACGTTG
+        61         71         81         91        101        111
+       121        131        141        151        161        171
+         CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+         ********** ********** ********** ********** ********** **********
+         CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+       121        131        141        151        161        171
+       181        191        201        211        221        231
+         GCCATGGTGG CCACAGTGAC G-TATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+         ********** ********** * ******** ********** ********** **********
+         GCCATGGTGG CCACAGTGAC GCTATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+       181        191        201        211        221        231
+       241        251        261        271        281
+         CGCG--GTGG AATTACAGCG TTCCCTATTG ACGGGCGCAT CCAC
+         ****  **** ********** ** * ***** ********** ****
+         CGCGACGTGG AATTACAGCG TT,CDTATTG ACGGGCGCAT CCAC
+       241        251        261        271        281
+          Batch finished
+          9 sequences processed
+          0 sequences entered into database
+          0 joins made
+
+.end lit
+
+.para
+Note that "auto assemble" cannot align protein sequences.
+.left margin1
+@28. TX 1 @Highlight disagreements
+.left margin2
+.para
+Used in the latter stages of a project
+to highlight disagreements between individual gel readings
+and their consensus sequences. This display is also availbale in the 
+contig editor.
+Characters that agree with the
+ 
+consensus are shown as : symbols for the plus strand and . for the minus
+ 
+strand. Characters that disagree with the consensus are left unchanged 
+
+and so stand out clearly. The results of this analysis are written to a 
+file.
+
+.para
+Before selecting this option create a file of the display of the contig to 
+be 
+"highlighted". The option will ask for the name of this file. Select
+ symbols 
+to denote "agreeing" characters on each strand, the defaults are : and ., 
+
+but any others can be used. Supply the name of a file in which to put
+ 
+the output.
+.para
+The display file needed as input for this option is created by selecting 
+
+"Redirect output",  followed immediately by  "display contig", and then 
+"Redirect output" again. The 
+
+cutoff score used in the consensus calculation can be set by option "set
+
+display parameters". Note that for the highlight function
+there is a limit of 50 for the number of gel 
+readings that are aligned at any position - ie the contig must be less 
+than 51 gel readings deep at its thickest point. I hope that those performing
+shotgun sequencing never reach this limit, but those using the program for
+comparing sequence families might.
+.para
+Typical output from this function is shown below.
+.lit
+                                                                     
+                          210       220       230       240       250
+    1  HINW.004    :C::::::::::::::::::::::::::::::::::::::::::AC::::
+    7  HINW.018    :*::::::::::::::::::::::::::::::::::::::::::CA::::
+   -4  HINW.017                                 ...............AC....
+                   G-TATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCCCGCG--GTGG
+                                                                     
+                          260       270       280       290       300
+    1  HINW.004    ::::::::::::*:D:::::::::::::::::::
+    7  HINW.018    ::::::::::::::::::::CA:::::T:*:::*::::::::::::CA:
+   -4  HINW.017    ..............................................A...
+    3  HINW.009    :::::::::::::::V::::::::::::::::::::::::::::*AV:::
+   -6  HINW.028                            ......................A...
+                   AATTACAGCGTTCCCTATTGACGGGCGCATCCACGCTGATTCTCTT-CTG
+                                                                     
+.end lit
+.left margin1
+@32. TX 3 @Extract gel readings
+.left margin2
+.para
+Used to make copies of the aligned gel readings in a database,
+to write them into separate files, and to write a 
+
+corresponding file of file names. It operates in two modes: either all gel
+ 
+readings are extracted, or only those at the ends of contigs. 
+
+.para
+Choose which mode of operation is required and supply a file of file 
+
+names. 
+.para
+The gel readings are given their original 
+
+names. 
+.para
+If the option is used to extract all the gel readings from a database, a 
+
+subsequent run of "auto assemble" can reconstitute a database which has
+
+been corrupted. This  rarely occurs and is usually necessitated by a
+ 
+user  employing "alter relationships"  incorrectly without first having
+ 
+made a copy. 
+.left margin1
+@1. TX 0 @Help
+.left margin2
+.PARA
+Help is available on the following topics :
+
+.LEFT MARGIN1
+@2. TX 0 @Quit
+.LEFT MARGIN2
+.PARA
+This command stops the program and is the only safe way to terminate a 
+
+run 
+of the program that has altered the contents of the database in any way.
+
+.left margin1
+@3. TX 1 @Open a database
+.LEFT MARGIN2
+.PARA
+Opens existing databases or allows new ones to be started. The function
+ is 
+automatically called into operation 
+when the program is started but can also be selected 
+
+from the general menu.
+.para
+Choose to open an existing database or start a new one, or if ! is typed 
+when the program is first started, enter the program without opening a 
+database. Supply a project
+ 
+database name, and if it already exists, the "version". If starting a new
+
+database define the database size and if it is for DNA or protein sequences.
+The database size is an initial size for the database. It can be increased 
+later during the project. It is the sum of the number of gel
+readings plus the number of contigs. The current maximum size is 8000.
+.para
+Database names can have from one to 12 letters and must not include full
+ 
+stop (.). The database is made from five separate files. If the database
+ is 
+called FRED then version 0 of database FRED comprises files FRED.AR0, 
+
+FRED.RL0, FRED.SQ0, FRED.TG0 and FRED.CC0. The version is the last symbol in the file names.
+
+Only this program
+ can read these files. If the "copy database" option is used it
+ 
+will ask the user to define a new "version". 
+.para
+For normal use the maximum gel reading length is set to 512 characters,
+
+but when a database is started the user may choose lengths of either
+
+ 512, 
+1024, 1536..., 4096. Normally the program is used to handle DNA 
+
+sequences but many of the functions also work on protein sequences. The
+ 
+choice of sequence type is made when the database is started.
+
+.para
+The contigs are not stored on the disk as the user sees them displayed on
+
+the screen. Each gel reading is stored with sufficient information about
+ 
+how it overlaps other gel readings so that the program can work out how
+ 
+to 
+present them aligned on the screen. We refer to this extra data as "the 
+relationships" and it is explained below.
+ 
+The database comprises 5 separate files.
+
+.left margin2
+          1.  a working version of each gel reading.  This is the  version  of
+          the  gel  reading
+that is in the database and initially it is an exact copy of
+          the original sequence (known as the archive)
+ but it is edited and manipulated to align  it
+          with other gel readings.
+
+.left margin2
+          2.  the file of  relationships.   This  file  contains  all  of  the
+
+          information  that  is required to assemble the working versions 
+into
+
+          contigs during processing;  any manipulations on the data  use  this
+
+          file   and  it  is  automatically  updated  at  any  time  that  the
+
+          relationships are changed.  The  information  in  this  file  is  as
+
+          follows:
+.left margin2
+          (A) Facts about  each  gel reading  and  its  relationship  to  
+others  
+("gel
+
+          descriptor lines"):
+
+.left margin2
+             (a) the number of the gel 
+reading   (each gel reading   is given a number  as  it  is
+
+          entered into the database)
+
+.left margin2
+             (b) the length of the sequence from this gel reading  
+
+.left margin2
+             (c) the position of the left end of this gel 
+reading   relative to the left
+
+          end of the contig of which it is a member
+
+.left margin2
+             (d) the number of the next gel 
+reading   to the left of this gel reading  
+
+.left margin2
+             (e) the number of the next gel reading   to the right
+
+.left margin2
+             (f) the relative strandedness of this gel 
+reading  , ie whether  it  is  in
+
+          the same sense or the complementary sense as its archive.
+
+.left margin2
+          (B) Facts about each contig ("contig descriptor lines"):
+
+.left margin2
+             (a) the length of this contig
+
+.left margin2
+             (b) the number of the leftmost gel 
+reading   of this contig
+
+.left margin2
+             (c) the number of the rightmost gel reading   of this contig.
+
+.left margin2
+          (C) General facts:
+
+.left margin2
+             (a) the number of gel readings in the database
+
+.left margin2
+             (b) the number of contigs in the database.
+
+.left margin2
+          3.  the file of archive names.  This is simply a list of  the  names
+
+          of each of the archive files in the database.
+
+.left margin2
+	  4. the file of tags (annotation). 
+This consists of linked lists of tag information for each sequences in the
+database.
+Tags are created by the user as annotation, or by xdap as records of edits or
+for storing cutoff information.
+As the number of tags can grow without limit, so can this file.
+For each gel there is a header record, which contains the record number of
+the start of the linked list for that gel. On line IDBSIZ there is a record
+containing information about the file such as its present length and if there
+are any free "tag" slots to be reused in the file.
+
+	  5. the file of comments (annotation).
+This consists of linked lists of comment fragments.
+Comments are created by the user as a message attached to annotation,
+or by the system to store cutoff information.
+Comments are character strings of any length.
+Comments longer than 40 characters are broken up into fragments, each 40
+characters long, and are chained together in a link list.
+As the number of comments can grow without limit, so can this file.
+
+.para
+          Structure of the database files
+.para
+          1.  The file of relationships
+.para
+	      The file contains IDBSIZ lines of data:
+          the general data are stored on line IDBSIZ;   data about  gel 
+readings  are
+          stored  from  line 1 downwards;  data about contigs are stored from
+          line IDBSIZ-1 upwards. A database of 500 lines containing 25 gel 
+readings and 4 contigs would have a file 
+          of relationships as is shown below.
+.lit
+
+
+                  ---------------------------------------------
+                     0  Info about the database size
+                     1  Gel descriptor record
+                     2   "      "       "
+                     3   "      "       "
+                     4   "      "       "
+                     5   "      "       "
+                     '   '      '       '
+                     '   '      '       '
+                    25   "      "       "
+                    26  Empty record
+                     '    '     '
+
+                     '    '     '
+                   495    '     '
+                   496  Contig descriptor record
+                   497    "        "        "   
+                   498    "        "        "
+                   499    "        "        "
+                   500   Number of gel readings=25, Number of contigs=4
+                  ---------------------------------------------
+
+          The arrangement of the data in the file of relationships
+
+.end lit
+As each new gel reading   is added into the database a new line is  added
+          to  the  end  of  the  list  of gel descriptor
+ lines.  If this new gel  reading  does not
+          overlap with any gel readings
+ already in the database a new contig  line  is
+          added  to  the top of the list of contig lines.  If it overlaps with
+          one contig then no new contig line need be added but if it  overlaps
+          with  two  contigs  then  these  two  contigs must be joined and the
+          number of contig lines will be reduced by one. Then the list of 
+contig
+          lines is compressed  to  leave  the empty line at the top of the list.
+          Initially the two types of line will move towards  one  another  but
+          eventually,  as  contigs  are joined, the contig descriptor lines will
+          move in the same direction as the  gel descriptor
+ lines.   At  the  end  of  a
+          project  there should  be only one contig line.  The database is thus
+          capable of handling a project of 998 gels.
+.para
+          2.  Structure of the working versions file
+.para    
+        The working versions of gel readings are stored in  a  file  of
+        NGELS lines each containing MAXGEL characters.  Gel reading 
+number 1 is stored on line
+          1, gel reading number  2 on line 2 and so on. NGELS is the
+current number of readings and MAXGEL the maximum reading length.
+.para
+          3.  Structure of the archive names file
+.para
+          This file has NGELS lines of 16 characters.
+
+.para
+          4.  Structure of the tag file
+.para
+This file initially starts with IDBSIZ lines, and is expanded as new tags are
+created.
+Information about the length of the file, and which tag records are reusable
+is stored on line IDBSIZ.
+A database of 500 lines would have a file of tags as shown below.
+.lit
+
+                  ---------------------------------------------
+                     1  Tag descriptor record
+                     2   "      "       "
+                     3   "      "       "
+                     4   "      "       "
+                     5   "      "       "
+                     '   '      '       '
+                     '   '      '       '
+                   497   "      "       "   
+                   498   "      "       "
+                   499   "      "       "
+                   500   Length of file=N, Free list=0
+		   501  Tag record
+		   502   "   "
+		   503   "   "
+		     '   '   '
+		     '   '   '
+		   N-2   "   "
+		   N-1   "   "
+		     N  Tag record
+                  ---------------------------------------------
+
+          The arrangement of the data in the tag file
+
+.end lit
+As each new tag is added to the database, a check is made in the
+file descriptor record at line IDBSIZ. If the list of reusable records is 0,
+the file is extended by one line. Otherwise the new tag is assigned to
+record at the head of the freelist.
+When tags are deleted, they are added to the free list in the file descriptor
+record.
+.para
+          5.  Structure of the comment file
+.para
+This file initially starts with 1 line, and is expanded as new annotation is
+created.
+Information about the length of the file, and which comment records are reusable
+is stored on the first line.
+.lit
+
+                  ---------------------------------------------
+                     1  Length of file=N, Free list=0
+                     2  Comment fragment
+                     3   "       "
+                     4   "       "
+                     '   '       '
+                     '   '       '
+		   N-2   "       "
+		   N-1   "       "
+		     N  Comment fragment
+                  ---------------------------------------------
+
+          The arrangement of the data in the comment file
+
+.end lit
+As each new comment is added to the database, a check is made in the file
+descriptor record at line 1. If the list of reusable records is 0,
+the file is extended to hold the new comment. Otherwise the new comments is
+assigned to records starting with the head of the freelist.
+When comments are deleted, the discarded records are added to the free list in
+the file descriptor record.
+.para
+  There  are  various  checks  within  the  programs  to
+          protect users from themselves:-
+.left margin2
+               1.  All user input is checked for errors - e.g.   reference  to
+               non-existent  gel 
+readings or  contigs,  incorrect  positions in the
+               contig or gel readings.
+.left margin2
+               2.  Before entering a gel reading the system checks to see if a
+               file of the same name has already been entered.
+.left margin2
+               3.  Join will not allow the circularising of a contig.
+
+.left margin2
+5. Users may escape from any point in the program.
+.left margin2
+6. Help is available from all points in the program.
+.SK2
+.LEFT MARGIN2
+IT IS ESSENTIAL THAT USERS DO NOT KILL THE PROGRAM WHILE IT IS 
+DOING 
+ANYTHING THAT INVOLVES CHANGING THE CONTENTS OF THE 
+DATABASE. I.E DURING AUTO ASSEMBLE,
+COMPLETE JOIN, COMPLEMENT CONTIG, SAVE EDIT CONTIG.
+
+This could 
+corrupt the database so badly that it is impossible to fix. The program 
+should always be left using the QUIT option.
+
+.left margin1
+@4. TX 3 @Edit contig
+.LEFT MARGIN2
+.PARA
+The Contig Editor is a mouse-driven editor that can insert,
+delete and change gel reading sequences.
+.para
+The Contig Editor allows scrolling from one end of a contig to the other
+using the scroll bar and scroll buttons. Action of mouse button presses
+when the mouse pointer is in the scroll bar:
+.sk1
+.lit
+    Middle Mouse Button      Set editor position
+    Left   Mouse Button      Scroll forward one screenful
+    Right  Mouse Button      Scroll backwards one screenful
+.end lit
+.sk1
+The four scroll buttons operate as follows:
+.sk1
+.lit
+    "<<"                     Scroll left half a screenful
+    "<"                      Scroll left one character
+    ">"                      Scroll right one character
+    ">>"                     Scroll right half a screenful
+.end lit
+.para
+The Editor cursor can be positioned anywhere in the edit window by
+moving the mouse pointer over the character of interest, then pressing the
+left mouse button. The Editor cursor can also be moved by using the
+direction arrow keys.
+.para
+The editor operates in two main edit modes - Replace and Insert. Replace allows
+a character to be replaced by another. Insert allows characters to be
+inserted into a gel reading sequence. Characters are entered by typing
+them from the keyboard. Only valid characters are permitted.
+Characters can be deleted by positioning the cursor one character to the right,
+then pressing the delete key.
+Normally Insert and Delete apply to the consensus line of the contig ONLY. 
+This restraint can be overridden by using the "Super Edit" mode of
+operation, THOUGH IT IS NOT RECOMMENDED.
+.para
+Edits can also be performed on the consensus, though they are
+restricted to insertion and deletion of padding characters ("*").
+These edits also have special meanings.
+A deletion will delete ALL characters at the position to the left
+of the cursor in the contig, and move the relative positions of all
+sequences starting to the right of the cursor position left one
+character.
+An insertion will insert the character typed ("*") into ALL gel
+reading sequences at the cursors position in the contig, and move the
+relative positions of all sequences starting to the right of the cursor
+position right one character.
+.para
+The effect of the last edit can be undone by pressing the "Undo" button
+at the top of the editor window.
+.para
+The cursor will automatically be positioned at the next problem when the
+"Find Next Problem" button is selected. The next problem is where the
+consensus shows either an ambiguity ("-") or a pad ("*") character.
+.para
+The edits to the contig can be saved by pressing the "Leave Editor"
+button and replying "Yes" to the prompt to "Save changes?". As no changes
+are made to the working copy of your database til this point it
+is possible to abort the editor if
+the edit session ends up in an unsatisfactory state (ie if you've
+stuffed it up!)
+.left margin1
+.sk3
+Displaying Traces
+.left margin2
+.para
+The original data from which the gel reading sequences where derived can
+be seen by double clicking (two quick clicks) with the middle mouse button
+on the area of interest. The trace will be displayed with the point
+clicked at the centre of the trace viewport.
+.para
+All traces that are displayed are maintained in one window, called the Trace
+Manager. The Trace Manager will only display four traces maximum. When four
+traces are already being managed and a new one is requested, the one at the top
+of the Trace Manager is removed and the new one is added to the bottom.
+Traces can be removed individually by using the "quit" button in the panel next
+to the trace.
+.left margin1
+.sk3
+Extending Reads Using Cutoff Information
+.left margin2
+.para
+Sequence data read in from Automated Fluorescent sequencing machines
+trace files processed through the program ted
+will have the discarded sequence (vector at start and poor read at
+end) available to the contig editor. To display the cutoff
+information, press the "Display Cutoff" button at the top of the
+editor window.
+The cutoff sequence appears in grey. This sequence can be incorporated
+into the editable sequence, by moving the cutoff position. This is
+done by positioning the cursor at the end of the gel sequence, and
+using Meta-Left-Arrow and Meta-Right-Arrow to adjust the point of cutoff.
+The Meta key is a diamond on the Sun keyboard.
+.left margin1
+.sk3
+Pop-up menu
+.left margin2
+.para
+A pop-up menu is revealed by depressing the "Control" key on the keyboard
+and at the same time pressing the left mouse button. The menu has the following
+functions:
+.lit
+
+    Search
+    Highlight Disagreements
+    Save Contig
+    Create Tag
+    Edit Tag
+    Delete Tag
+    Select Oligo
+
+.end lit
+.left margin2
+"Highlight Disaggreements" simply toggles between the normal display showing
+the current base assignments and one in which only those assignments that
+differ from the consensus are shown.
+
+.left margin2
+"Save Contig" is described above.
+Searching and operations on tags are described below.
+.left margin2
+.sk3
+Searching
+.left margin2
+.para
+Selecting "Search" brings up a
+window which can remain present during normal editor operation. The
+window allows the user to select the direction of search, the type of
+search and a value to search on.  The value is entered into the value
+text window. Then pressing the "search" button
+performs the search. If successful, the cursor is positioned and
+centred accordingly. An audible tone indicates failure.  Pressing the
+"ok" button removes the search window. The search window is
+automatically removed when the contig editor is exited.
+.sk1
+There are seven different search modes:
+.sk1
+1. Search by position
+.sk1
+This positions the cursor at the numeric position specified in the
+value text window. Eg a value of "1234" causes the cursor to be placed
+at base number 1234 in the contig. Positioning withing a gel reading is
+achieved by prefixing the number with the "@" character, eg "@123"
+positions the cursor at base 123 of the sequence in which the cursor
+lies. Relative positions can be specified by prefixing the number with
+a plus or minus character. Eg "+1234" will advance the cursor 1234
+bases. If possible, the cursor is positioned within the same sequence.
+The direction buttons have no effect on the operation of "search
+by position".
+.sk1
+2. Search by reading name
+.sk1
+This positions the cursor at the left end of the gel reading specified
+in the value text window. If the value is prefixed with a slash is is
+assumed to be a gel reading name. Otherwise it is assumed to be a gel
+reading number. Eg "123" positions the cursor at the left end of gel
+reading number 123. "/a16a12.s1" positions at the start of reading
+a16a12.s1. If the value was "/a16" the cursor is positioned at the
+first reading which starts with "a16".  The direction buttons have no
+effect on the operation of "search by position".
+.sk1
+3. Search by tag type.
+.sk1
+This positions the cursor at the start of the next tag which has the
+the same type as specified by the type value menu. To change the type,
+select off the menu that pops up when the mouse is clicked on the
+button labeled "Type:". The search can be performed either forwards
+or backwards of the current cursor position. To find all tags, use
+"search by annotation", with a null text value string.
+.sk1
+4. Search by annotation.
+.sk1
+This positions the cursor at the start of the next tag which has a
+comment containing the string specified in the value text window. The
+search performed is a regular expression search, and certain
+characters have special meaning. Be careful when your value string
+contains ".", "*", "[", "^" or "$". The search can be performed either
+forwards or backwards from the current cursor position.
+.sk1
+5. Search by sequence.
+.sk1
+This positions the cursor at the start of the next piece of sequence
+that matches the value specified in the text value window. The search
+is for an exact match, which means the case of value string is
+important. The search is performed on the gel readings themselves,
+rather than the consensus sequence. The search can be performed either
+forwards or backwards from the current cursor position.
+.sk1
+6. Search by problem.
+.sk1
+This positions the cursor at the next place in the consensus sequence
+which is not an "A", "C", "G" or "T". The search can be performed
+either forwards or backwards from the current cursor position.
+.sk1
+7. Search by quality
+.sk1
+This positions the cursor at the next place in the consensus sequence
+where the consensus calculation for each strand disagrees. When only
+sequences on one strand is present, the search will stop at every
+base. The search can be performed either forwards or backwards from the
+current cursor position.
+.left margin1
+.sk3
+Annotation
+.left margin2
+.para
+Parts of a sequence can be annotated, to record the positions of primers used
+for walking, or to mark sites, such as compressions that have caused problems
+during sequencing.
+The consensus sequence CANNOT be annotated.
+.para
+To annotate a piece of sequence first select the part of sequence
+using the mouse buttons. Use the left mouse button to position the start of the
+selection, and while this button is being held down, move the mouse to extend.
+The selection can be extended further using the right mouse button.
+.para
+To create annotation, invoke the pop-up menu, and select the "Create Tag"
+function. A small "tag editor" will appear which
+allows you to select the type of the
+annotation from a pull-down menu, and specify a comment if desired.
+To select a new type pull down the Type menu, and select the entry desired.
+To enter a comment, simply type into the text window in the tag editor.
+The annotation is created when the "Leave" button on the tag editor,
+and is displayed in the colour defined in the tag database file (TAGDB).
+.para
+To edit existing annotation,
+position the cursor with the left mouse button
+on the tag, and select the
+"Edit Tag"
+off the pop-up menu.
+This invokes the tag editor, and changes to the type and comment of the
+annotation can be made. The tag is updated when the "Leave" button is pressed.
+.para
+To delete an existing annotation,
+position the cursor with the left mouse button
+on the tag, and select the
+"Delete Tag"
+off the pop-up menu.
+.left margin1
+.sk3
+NOTE:
+.left margin2
+.para
+As the Contig Editor is a very powerful tool, it is possible that the alignment
+of the gel reading sequences has unexpectedly been disrupted.
+This can easily happen to parts of the contig that lie to the right
+of the screen if excessive use has been made of the "Super Edit" facility.
+Until familiar with "Super Edit" it would benefit the sequencer to quickly
+scan through the contig after editing to check that bad alignments have not
+been created.
+.sp
+.left margin2
+Selecting Oligos
+----------------
+.sk1
+.left margin2
+1. Open the oligo selection window, by selecting "Select Oligo" from
+the contig editor popup menu.
+
+.left margin2
+2. Position the cursor to where you want the oligo to be chosen. While
+the oligo selection window is visible, you will still have complete
+control over positioning and editing within the contig editor.
+
+.left margin2
+3. Indicate the strand for which you require an oligo. This is done by
+toggling the direction arrow ("----->" or "<------"), if necessary.
+
+.left margin2
+3. Press the "Find Oligos" button to find all suitable oligos (See
+"Oligo selection" below.)  Information for the closest oligo to the
+cursor position is given in the output text window. In the contig
+editor the position of the oligo is marked by a temporary tag on the
+consensus. The window is recentered if the oligo is off the screen.
+Selecting "Display Selection Information" will print a short report on
+the numbers of oligos considered and rejected during oligo selection.
+
+.left margin2
+4. If this oligo is not suitable (it may have been previously chosen,
+and found to be unsuitable by experimentation, say), the next closest
+oligo can be viewed by pressing "Select Next".
+
+.left margin2
+5. Suitable templates are automatically identified for the currently
+displayed oligo (See "Template selection" below.) By default, the
+template is that closest to the oligo site. If the choice is not
+suitable (it may be known to be a poor quality template, say) another
+can be chosen from the "Choose Template for this Oligo" menu.
+Templates that do not appear on the menu can be specified by selecting
+"other". However, the template must be on the correct strand and be
+upstream of the oligo.
+
+.left margin2
+6. A tag can be created for the current oligo by pressing the button
+"Create a tag for this oligo". The annotation for this tag holds the
+name of the template and the oligo primer sequence.  There are fields
+to allow the user to specify their own primer name ("serial#") and
+comments ("flags") for this tag. An example of oligo tag annotation:
+.lit
+	serial#=
+	template=a16a9.s1  
+	sequence=CGTTATGACCTATATTTTGTATG
+	flags=
+
+.end lit
+.left margin2
+7. The oligo selection window is closed when "Create a tag for this
+oligo" or "Quit" is selected.
+
+
+.left margin2
+Oligo selection:
+.left margin2
+----------------
+
+.left margin2
+The oligo selection engine is the one used in the program OSP. It is
+described in some detail in:
+
+.left margin2
+	Hillier, L., and Green, P. (1991). "OSP: an oligonucleotide
+	selection program," PCR Methods and Applications, 1:124-128.
+
+.left margin2
+The parameters controlling the selection of oligos can be changed in
+the "Oligo Selection Parameters" window. The weights controlling the
+scoring of selected oligos can be changed in the "Oligo Selection
+Weights" window.
+
+.left margin2
+By default, the oligos are selected from a window that extends 40
+bases either side of the cursor. The size and location of this window
+relative to the cursor position can be changed in the "Parameters"
+window.
+
+.left margin2
+In xbap oligos are ranked according to their proximity to the cursor
+position, rather than by their scores.
+
+
+.left margin2
+Template selection:
+.left margin2
+-------------------
+
+.left margin2
+For simplicity, each reading is considered to represent a template. In
+practise, many readings can be made of the same template. Suitable
+templates that are identified are those that:
+.lit
+
+    1. are in the appropriate sense,
+    2. have 5' ends that start upstream of the oligo,
+and 3. are sufficiently close to the oligo to be useful.
+
+.end lit
+.left margin2
+
+This last criterion relates to the insert size for the subclones used
+for sequencing and the average reading length. A template is
+considered useful if a full reading can be made from it, taking into
+account both of these factors. The default insert size is 1000 bases,
+and the default average reading length is 400 bases. These values can
+be changed in the "Parameters" window.
+
+.left margin1
+@5. TX 1 @Display a contig
+.LEFT MARGIN2
+.para
+Used to show the aligned gel readings for any part of a contig. The 
+
+number, name and strandedness of each gel reading is shown and the 
+
+consensus is written below.
+.para
+If required identify the contig,  and then the start and end points of the
+ 
+region to display.
+.para
+The display can be directed to a disk file using "direct output to disk".
+ 
+.para
+                Below is an example showing the left  end  of  a  contig  from
+          position  1 to 200.  Overlapping this region are gels 6,3,5,17and 12;
+6, 3 and 5 
+are in reverse orientation to their archives (denoted by a minus  sign)
+          There are a few uncertainty codes and a few padding
+          characters in the working versions, but the consensus  (shown  
+below
+          each page width) has a definite assignment for almost every 
+position.
+.lit
+
+                           10        20        30        40        50
+   -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+       CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                          110       120       130       140       150
+   -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+   -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+   -5  HINW.009                                        GGCAGCGACCGTCG
+   17  HINW.999                                           AGCGACCGTCG
+       CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                          160       170       180       190       200
+   -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+   -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+   12  HINW.017                                              GTAGAGC*
+       CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+.END LIT
+.left margin1
+@6. TX 1 @List a text file
+.LEFT MARGIN2
+.PARA
+This option allows users to list text files on the screen. It can be used 
+to read a file containing notes, for checking files written to disk etc. The 
+user is asked to type the name of the file to list.
+.left margin1
+@8. TX 1 @Calculate a consensus
+.LEFT MARGIN2
+.para
+          Calculates a consensus sequence  either for  the whole database or 
+
+for selected contigs. The consensus is written to a file named by the
+ user.
+.left margin2
+Supply a file name, choose between  whole database or selected contigs.
+.para
+          Symbols for uncertainty in gel readings
+.para
+In  order  to  record uncertainties when reading gels the codes shown
+ 
+below can  be  used. Use  of these codes permits us to extract the
+
+maximum amount of data from each gel and yet record any doubts  by
+ 
+choice  of  code.   The program can deal with all of these codes and any 
+ 
+other  characters  in  a  sequence  are  treated  as  dash  (-) characters.
+
+
+.lit
+
+       SYMBOL                  MEANING
+
+         1             PROBABLY        C
+         2                "            T
+         3                "            A
+         4                "            G
+         D                "            C       POSSIBLY        CC
+         V                "            T          "            TT
+         B                "            A          "            AA
+         H                "            G          "            GG
+         K                "            C          "            C-
+         L                "            T          "            T-
+         M                "            A          "            A-
+         N                "            G          "            G-
+         R             A OR G
+         Y             C OR T
+         5             A OR C
+         6             G OR T
+         7             A OR T
+         8             G OR C
+         -             A OR G OR C OR T
+         a             A 
+         c             C 
+         g             G 
+         t             T 
+         *             padding character placed by auto assembler 
+          else = -
+
+.end lit
+
+.LEFT MARGIN2
+                           The DNA consensus algorithm
+.para
+The "calculate consensus" function, the "display  contig" routine and the
+ 
+"show quality" option use  the rules  outlined  here  to  calculate  a 
+ 
+consensus  from aligned gel readings.  Note that "display contig" 
+calculates 
+a consensus for  each  page  width  it displays  (it  does  not use the
+
+consensus sequence file calculated by the consensus function). 
+
+.LEFT MARGIN2
+.para
+We have 6 possible symbols in the consensus sequence: A,C,G,T,* and -. The 
+last symbols is assigned if none of the others makes up a sufficient 
+proportion of the aligned characters at any position in the contig. The 
+following calculation is used to decide which symbol to place in the 
+consensus at each position.
+.para
+Each uncertainty code contributes a score
+to one of A,C,G,T,*  and also to the total at each point. Symbols like R
+and Y which don't correspond to a single base type contribute only to the
+total at each point. The scores are shown below.
+.lit
+              definite assignments ie A,C,G,T,B,D,H,V,K,L,M,N,a,c,g,t,* =1
+
+              probable assignments ie 1,2,3,4 = 0.75
+
+              other uncertainty codes including R,Y,5,6,7,8,- = 0.1
+.end lit
+.para
+A cutoff score of 51% to 100% is supplied by the user. (When the program 
+starts this is set to 75%. See "set display parameters").
+At each position in the contig we calculate the total score for each of 
+the 5 symbols 
+A,C,G,T and * (denote these by Xi, where i=A,C,G,T or *), 
+and also the sum of these totals 
+(denote this by S). Then if 100 Xi / S > the cutoff for any i, symbol i is 
+placed in the consensus; otherwise - is assigned.
+.para
+Notice that S does not equal the number of times the sequence has been 
+determined, but is the score total, and hence we are less likely to put a - 
+in the consensus. For the "examine quality" algorithm each strand is
+treated separately but the calculation is the same. (It was originally
+different).
+.para
+Format of the consensus sequence ( and vector sequences).
+.para
+A consensus sequence file may contain the consensus for several contigs
+ 
+and so we identify each of them by preceding them by a 20 character
+ 
+title. The title is of the form <---LAMBDA.0076----> ( where LAMBDA is 
+
+the project name and gel reading number 
+
+
+          76 is the leftmost gel 
+reading to contribute to  this  consensus  sequence).
+
+
+          The  angle  brackets  <>  and the 4 digit number precede by a . 
+
+are important to some processing programs.
+.left margin1
+@25. TX 1 @Show relationships 
+.LEFT MARGIN2
+.para
+   Used to show the relationships of the gel readings in the database in 
+
+three ways -
+.LEFT MARGIN2
+               (a) All contig descriptor lines followed by all gel  descriptor
+               lines.
+.LEFT MARGIN2
+               (b) All contigs one after the  other  sorted,  i.e.   for  each
+               contig  show its  contig descriptor line followed by all its
+               gel descriptor lines sorted on position from left to right
+.LEFT MARGIN2
+               (c) Selected contigs:  show the contig  line  and,  in  order,
+               those gel readings that cover a user-defined region.
+               Note that this output can be directed to a disk file by 
+               prior selection of "redirect output".
+.LEFT MARGIN2
+.para
+                Below is an example showing a contig from position
+          1 to 689.  The left gel reading  is number 6 and has archive 
+name HINW.010, the 
+rightmost gel  reading is number 2 and is has archive name HINW.004.
+On each gel  descriptor  line  is  shown:
+          the name of the archive version, the gel number, the position of the
+          left end of the gel reading  relative to the left  end  of  the  contig,  the
+          length  of  the gel 
+reading  (if this is negative it means that the gel reading  is in
+          the opposite orientation to its archive), the number of the  gel
+reading   to
+          the left and the number of the gel reading  to the right.
+.lit
+
+
+ CONTIG LINES
+ CONTIG      LINE  LENGTH               ENDS
+                                     LEFT   RIGHT
+               48     689               6       2
+ GEL LINES
+ NAME      NUMBER POSITION LENGTH     NEIGHBOURS
+                                     LEFT   RIGHT
+ HINW.010       6        1   -279       0       3
+ HINW.007       3       91   -265       6       5
+ HINW.009       5      137   -299       3      17
+ HINW.999      17      140    273       5      12
+ HINW.017      12      193    265      17      18
+ HINW.031      18      385   -245      12       2
+ HINW.004       2      401   -289      18       0
+
+.end lit
+.left margin1
+@23. TX 3 @Complement a contig
+.LEFT MARGIN2
+.PARA
+               This function will complement and reverse all of the gel 
+readings in  a
+          contig.    It   automatically  reverses  and  complements  each  gel
+          reading sequence, reorders left and right neighbours, recalculates  
+relative
+          positions and changes each strandedness.
+.PARA
+               The only user  input  required  is  to  identify the  contig  to
+          complement  by  the  number or name of a gel reading it contains.
+DO NOT KILL THE 
+PROGRAM DURING THIS STEP!
+.left margin1
+@22. TX 3 @          Join contigs
+.LEFT MARGIN2
+.PARA
+This function joins contigs interactively using a mouse driven editor.
+The operation of this editor is very similar to the Contig Editor
+described in "Edit".
+
+.para
+It allows the
+user  to align the ends of the two contigs by editing each
+contig separately.  It is important that the alignment  achieved  is
+correct because once the join is completed the alignment is fixed.
+The program needs to know which two contigs to join.
+.para
+First specify which two contigs are to be joined.
+The user should identify the two 
+contigs. 
+The program checks that the two contig numbers are different (it will not 
+allow circles to be formed!)
+.para
+The Join Editor consists of two Contig Editors in between which is sandwiched
+a disagreement box. This disagreement box shows exclamation marks to
+denote mismatches between the two consensuses.
+.para
+For example, the display will look something like this:
+.lit
+
+                         1460      1470      1480      1490      1500
+   56  HINW.100    TCT*GAGCAGTGTGGGCGCTG*CCGG
+   33  HINW.300    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGG
+  -25  HINW.090    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGG
+   19  HINW.123    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+       CONSENSUS   TCTCGAGCAGTGTGGGCGCTG-CCGGGCTCGGAGGGCATGAAGTAGAGCG
+       MISMATCH                         !                      !!!!!! 
+                           10        20        30        40        50
+   -6  HINW.010    TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+   -3  HINW.007                TGGGCGCTGCCCGGGCTCGGAGGGCATGAAGT*AGAGC
+   -5  HINW.009                              GCTCGGAGGGCATGAAGT*AGAGC
+       CONSENSUS   TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+
+.END LIT
+.para
+The overlap must be of at least one character.
+Use the scroll bar and the scroll buttons (`<<',`<',`>',and`>>')
+for positioning the relative positions of the two contigs.
+.para
+The join position can be fixed in position
+by pressing the `lock' button at the top of the Join Editor.
+Locking allows the two contigs to be scrolled as one when using the scroll bar
+and buttons, the left ends always in the same position relative to each
+other.
+.para
+Once locked, it is best to proceed to the right along the contigs, inserting
+padding characters (`*') into the consensuses to minimise the
+disagreements.
+.para
+It is essential that the user aligns the two contigs throughout the whole 
+region of overlap before completing the join because it is only at this 
+stage that the two contigs can be edited independently. Once the join is 
+completed the alignment can only be altered using the routines supplied 
+by "alter relationships".
+.para
+The join can be completed by pressing the `Leave Editor' button. The
+percentage mismatch is displayed, and the user is required to confirm that
+they want to perform the join.
+.left margin1
+@24. TX 1 @               Copy the database
+.LEFT MARGIN2
+.PARA
+Used to make a copy of the database. If required the database size can be 
+
+altered using this option. The "version" of a database is  encoded as the
+ 
+last letter in the names of the five files that contain the database.
+
+.para
+Supply a "version" number (the default is version 1), and if required
+
+select a new size for the database. The size of a database is the number
+ of 
+lines of information it can hold. It needs a line for each gel reading and
+ 
+another for each contig.
+.left margin1
+@19. TX 1 @               Check database
+.LEFT MARGIN2
+.para
+Used to perform a check on  the  logical  consistency  of  the
+          database. No user intervention is required. If selected "with
+dialogue" the program also checks for any sections of the consensus that
+contain 15 dashes in 20 characters.
+.para
+  The following relationships are checked:
+.LEFT MARGIN2
+               1.       If gel reading A thinks gel reading B is its left
+ neighbour
+ 
+does B think A  is
+               its right neighbour?
+                The error message is
+.left margin2
+"Hand holding problem for gel reading A"
+.left margin2
+followed by  the
+               gel descriptor lines for gel readings A and B.
+.LEFT MARGIN2
+               2.       Are there any contig lines with no left or right
+end gel readings?
+                The error message is
+.left margin2
+"Bad contig line number A"
+.LEFT MARGIN2
+               3.       Do the gel readings that are described as left ends on  
+contig
+               lines agree that they are left ends?
+                The error message is
+.left margin2
+"The end gel readings of contig A have outward neighbours"
+.LEFT MARGIN2
+               4.       Are there gel readings that are in more than one contig?
+                The error message is
+.left margin2
+" Gel number A is used N times"
+.LEFT MARGIN2
+               5.       Are there gel readings that are not in any contig?
+                The error message is
+.left margin2
+" Gel number A is not used"
+.LEFT MARGIN2
+               6.       Do the relative positions of  gel readings  agree  with  
+their
+               position as defined by left and right neighbourliness?
+                The error message is
+.left margin2
+" Gel number A with position X is left neighbour of  gel  number  B  with 
+position Y"
+.LEFT MARGIN2
+               7.       Are there any loops in  contigs?   If  so  no  further
+               checking is done.
+                The error message is
+.left margin2
+" Loop in contig n no further checking done, but gel reading numbers follow"
+.left margin2
+   The
+               program  then  prints the gel reading numbers in the looped 
+contig up 
+to
+               the start of the loop.
+.LEFT MARGIN2
+8. Are there any contigs of length <1? The error message is
+.left margin2
+" The contig on line 
+number x has zero length"
+.LEFT MARGIN2
+9. Are there any gel readings (used in only one contig) that have zero 
+
+length? The error 
+message is
+.left margin2
+" Gel number N has zero length"
+.left margin2
+Note that "auto assemble"  also uses this logical consistency check and
+ will 
+only tolerate a "Gel number N
+ is not used" error. Any other error will cause it to 
+
+give up.
+
+.left margin1
+@29. TX 1 @               Examine quality 
+.LEFT MARGIN2
+.para
+Analyses the quality of the data in a contig. It reports on the proportion
+ 
+of the consensus that is "well determined" and will display a sequence of
+ 
+symbols that indicate the quality of the consensus at each position.
+
+.para
+Identify the contig to analyse, and the section of interest. The current
+ 
+consensus calculation cutoff score will be used to decide if each position 
+is 
+"well determined". In general the quality of a reading deteriorates along
+the length of the gel and so it is also possible to use a length cutoff for
+the quality calculation. Only the data from the first section of each reading
+will be included in the quality calculation. The length is altered under
+"set parameters" and is initially set to the maximum reading length.
+A summary showing the percentage of the consensus
+that falls into each category of quality is shown. Choose whether or not to 
+have the quality codes for each position of the consensus displayed. 
+They can be displayed as either graphics or text.
+.para
+The quality of the data depends on the number of times it has been
+
+sequenced and the particular uncertainty codes  used  in each  gel
+
+reading.  This function divides the data into five categories, assigning
+
+each 
+a symbol or code: 
+.LEFT MARGIN2
+                1.  Well determined on both strands and they agree.  code=0
+.LEFT MARGIN2
+                2.  Well determined on the plus strand only.  code=1
+.LEFT MARGIN2
+                3.  Well determined on the minus strand only.  code=2
+.LEFT MARGIN2
+                4.  Not well determined on either strand.  code=3
+.LEFT MARGIN2
+                5.  Well determined on both strands but they disagree.  code=4
+.LEFT MARGIN2
+ A position is "well determined" if it is assigned one of the symbols 
+A,C,G,T when the algorithm described in the section "calculate a 
+consensus".
+The calculation is performed
+separately for each strand.
+.para
+If the user chooses to have the data displayed graphically the following 
+scheme is used. A rectangular box is drawn so that the x coordinate
+represents the length of the contig. The box is notionally
+divided vertically into 
+5 possible levels which are given the y values: -2,-1,0,1,2.
+The quality codes attributed to each base position are plotted as 
+rectangles.
+Each rectangle represents a region in 
+which the quality codes are identical, so a single base having a different 
+code from its immediate neighbours will appear as a very narrow rectangle.
+.lit
+  
+  Rectangle bottom and top y values
+
+     Quality 0 rectangle from 0 to 0
+     Quality 1 rectangle from 0 to 1
+     Quality 2 rectangle from 0 to -1
+     Quality 3 rectangle from -1 to 1
+     Quality 4 rectangle from -2 to 2
+.end lit
+.para
+Obviously a single line at the midheight shows a perfect sequence.
+.para
+Typical dialogue is shown below.
+.lit
+
+   41.47% OK on both strands and they agree(0)
+   55.48% OK on plus strand only(1)
+    2.08% OK on minus strand only(2)
+    0.97% Bad on both strands(3)
+    0.00% OK on both strands but they disagree(4)
+  ? (y/n) (y) Show sequence of codes 
+
+           10         20         30         40         50
+   1111111111 1111111111 1111111111 1111111111 1111111111
+
+           60         70         80         90        100
+   1111111111 1111111111 1111111111 3111111111 1111111111
+
+          110        120        130        140        150
+   1111111111 1111131111 1111111111 1111111111 1111111111
+
+          160        170        180        190        200
+   1111111111 1111111111 1111111111 1111111111 1111111133
+
+          210        220        230        240        250
+   1311111111 1111111111 1111111110 0000000000 0000220000
+
+          260        270        280        290        300
+   0000000000 0020000000 2200000202 0002000000 0000222200
+
+.end lit
+.left margin1
+@26. TX 3 @              Alter relationships
+.LEFT MARGIN2
+.para
+Used to make what are normally illegal changes to the database. That is
+
+the normal checks are not done and any item in the database can be 
+changed independently of all others. Users need to know what they are
+ 
+doing because it is very easy to make a horrible mess. Always start by
+ 
+making a copy!
+.para
+By using the options here users can 
+move one section of a contig relative to another, break contigs, remove 
+contigs, remove gel readings, etc. To give flexibility most
+ of the commands do only one thing. This means that several commands 
+may 
+have to be executed to complete any change. 
+.para
+The following options are offered:
+.lit
+
+   Cancel
+   Line change
+   Check logical consistency
+   Remove contig
+   Shift
+   Move gel reading
+   Rename gel reading
+   Break a contig
+   Remove a gel reading
+   Alter raw data parameters
+
+.end lit
+.left margin2
+1. QUIT returns to the main options of BAP.
+.left margin2
+
+3. Line change 
+.left margin2
+            allows the user to change the contents  of  any line in the
+                
+file of relationships.  The line is selected by number, the
+               program prints the current line and prompts for the new  line.
+
+.left margin2
+4.  Check logical consistency
+.left margin2
+5. Remove a contig
+.left margin2
+This function removes a contig and all its gel readings. The user specifies
+any reading in the contig.
+.left margin2
+6.  Shift   
+.left margin2
+   allows the user to change all the relative  positions of  a  
+               set  of  neighbouring gel 
+readings by some fixed value, i.e.  it will
+               shift related gel readings
+ either left or right.  It can therefore
+               be  used  to  change the alignment of the gel 
+readings in a contig.
+It prompts for the number of the first gel 
+reading to 
+               shift and  then  for the  distance  to  move  them (Note a 
+               negative value will move the gel readings
+ left and a positive value 
+               right).   It  then chains rightwards (ie follows right 
+               neighbours) and shifts each gel 
+reading,  in  turn,  up to the  end  
+               of the contig.  (This means that only those gel readings
+ from the first
+               to shift to the rightmost are moved). It updates the length of 
+               the contig accordingly.
+
+.left margin2
+7. Move gel reading
+.left margin2
+    is a function to renumber a gel reading. It moves all the information 
+               about a gel 
+reading on to another line. The user must specify the 
+number
+               of the gel  reading 
+to move and the number of the line to place it. It 
+               takes care of all the relationships. Of course gel 
+readings must not be
+               moved to lines occupied by other gel 
+readings! 
+
+.left margin2
+8.  Rename gel reading
+.left margin2
+ is a function that is used to  rename  the archive  names  of  
+               gel 
+readings  in the database;  it only changes the name in the .ARN 
+               file of the  database.
+
+.sk1
+.LEFT MARGIN2
+9. Break contig
+.LEFT MARGIN2
+.PARA
+Occasionally it is necessary to break a contig into two parts and this can be 
+achieved using this option. The program needs only the number of a gel 
+reading. This is the gel  reading that will become a left end after the 
+break. That 
+is, the break is made between this gel 
+reading and its left neighbour. A new contig 
+line is created so ensure that there is sufficient space in the database.
+.left margin2
+10. Removing gel readings from contigs
+.left margin2
+.PARA
+Gel 
+readings can be removed from contigs. If they are essential for holding the 
+contig together (ie are the only gel reading covering a particular region),
+the program will create a new contig.
+.sk1
+.LEFT MARGIN2
+11. Alter raw data parameters
+.LEFT MARGIN2
+.PARA
+Allows the user to edit the individual raw data parameters, such as
+the left and right cutoff lengths and the name of the machine readable trace
+file.
+The user must specify the gel line to modify, and provide new values for
+the length of the raw sequence including cutoff lengths, the left cutoff position, the length of the original working sequence, the machine type, and the name
+of the raw data file, where these values change.
+.left margin1
+@27. TX 1 @  Set display parameters
+.LEFT MARGIN2
+.para
+Used to redefine the parameters that control the cutoff employed by the
+
+consensus calculation and quality examiner, the maximum length of each
+reading to include in the quality calculation, the line length used by
+
+the display function, the text window length used by the graphics 
+options, and the graphics window length used by the graphics options.
+.para
+The default cutoff score is 75%. The default line length is 50 characters. 
+For protein sequences the cutoff is always 100%.
+.para
+The text window used by the graphics options controls the amount of 
+sequence listed at the crosshair position. The graphics window controls the 
+"zoom" function. Both these windows are defined as the number of bases that 
+should be shown, to both left and right of the crosshair.
+.left margin1
+@30. TX 3 @  Shuffle pads
+.left margin2
+.para
+One weakness of the alignment strategy used is that padding
+characters are not always aligned by the assembly routine. This function
+attempts to align padding characters using a very simply strategy. It
+does not solve all pad alignment problems but is a useful first step during
+cleaning-up operations.
+.LEFT MARGIN1
+@10. TX 2 @Clear graphics
+.LEFT MARGIN2
+.para
+ Clears graphics from the screen.
+.left margin1
+@11. TX 2 @Clear text
+.LEFT MARGIN1
+.para
+ Clears  text from the screen.
+.left margin1
+@12. TX 2 @Draw a ruler.
+.LEFT MARGIN2
+.para
+This option
+allows the user to draw a ruler or scale along the x axis of the screen to 
+help identify the coordinates of points of interest. The user can define 
+the position of the first base to be marked (for example if the active 
+region is 1501 to 8000, the user might wish to mark every 1000th base 
+starting at either 1501 or 2000 - it depends if the user wishes to treat 
+the active region as an independent unit with its own numbering starting 
+at 
+its left edge, or as part of the whole sequence). The user can also define 
+the separation of the ticks on the scale and their height. If required the 
+labelling routine can be used to add numbers to the ticks.
+.left margin1
+@14. TX 2 @Reposition plots
+.LEFT MARGIN2
+.para
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "ANALMARG" when the 
+program is started. Users can have their own file if required.
+As all the plots start 
+at the same position in x and have the same width, x0 and xlength are the 
+same for all options. Generally users will only want to change the start 
+level of the window y0 and its height ylength. 
+ This option 
+allows users to change window positions whilst running the program.
+The routine prompts first for the number of the option that the users 
+wishes 
+to reposition; then for the y start and height; then for the x start and 
+length. Note that changes to the x values affect all options. If the user 
+types only carriage return for any value it will remain unchanged. 
+Note that, unlike all the other programs, the boxes used to contain 
+analytical results (eg plot quality) should not be made to overlap one 
+another, as the function of the crosshair routine depends on which box the 
+crosshair is in!
+.LEFT MARGIN1
+@15. TX 2 @Label a diagram
+.LEFT MARGIN2
+.para
+This routine allows users to label any diagrams they have produced. They 
+are asked to type in a label. When the user types carriage return to finish 
+typing the label the cross-hair appears on the screen. The user can 
+position it anywhere on the screen. If the user types R (for right justify)
+the label will be 
+written on the diagram with its right end at the cross-hair position. 
+If the user types L (for left justify) the label will be written on the 
+diagram with its left end at the cross hair position.
+The 
+cross-hair will then immediately reappear. The user may put the same 
+label 
+on another part of the diagram as before or if he hits the space bar he 
+will be asked if he wishes to type in another label.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=15
+Type label then drive cross hair to left or right end
+of label position then hit  "L"  to  write label left
+justified or  "R"  to  write label right justified or
+the space bar to quit
+ 
+ 
+? Label=delta gene
+
+ missing graphics 
+
+? Label=
+ 
+.end lit
+.left margin1
+@16. TX 2 @Display a map
+.LEFT MARGIN2
+.para
+This is disabled!
+.left margin1
+@7. TX 1 @Redirect output
+.LEFT MARGIN2
+.para
+Used to direct output that would normally appear on the screen to a file and
+to create postscript output.
+.para
+Select redirection of either text or graphics, and 
+supply the name of the file that the output should be written to.
+.para
+ The results from the next options selected will not appear on the screen 
+but will be written to the file. When option 7 is selected again
+the file will be 
+closed and output will again appear on the screen.
+.left margin1
+@13. TX 2 @Use crosshair
+.left margin2
+.para
+This option puts a steerable cross on the screen which the user 
+drives around 
+by using the arrow keys (or mouse). When the crosshair is 
+visible a number of options are available if the user types one of a 
+set of special keyboard characters. Any other characters will cause 
+an exit from the crosshair option. The special keys are:
+.lit
+
+    I = Identify the nearest gel reading
+    Z = Zoom in
+    Q = plot Quality
+    S = display the aligned Sequences at the crosshair position
+    N = list the Names and Numbers of the sequences at the crosshair
+.end lit
+.para
+In order for any of these special keys to operate, the crosshair 
+must be in an appropriate display box, and the precise function of 
+the keys will also depend on which box the crosshair is in.
+.para
+ If the 
+crosshair is in the "plot all contigs" box, Z will cause a new box to 
+appear showing all the readings for the nearest contig; Q will give 
+the same as Z but will also produce an extra box showing the 
+"quality" plot.
+.para
+ If Z is hit in the "plot single contig" box, the contig will be zoomed 
+to the current graphics window size. The zoom will be roughly 
+centred on the crosshair position. Because of this it is possible to 
+step along a contig by repeatedly zooming with the crosshair near 
+to one end of the single contig display box. If I is hit the crosshair 
+must be close to a gel reading line. If Q is hit, the quality plot will 
+be produced for the region shown in the plot single contig box. In 
+all cases when the "plot all contigs" box is shown, a vertical line will 
+bisect the line the represents the relevant contig, at the current 
+position.
+.para
+If the crosshair is in the plot quality box only the character "s" will operate 
+as a special symbol.
+.para
+The number of bases shown in the N and S options is controlled by 
+the current graphics text window size, and the size of the zoom 
+window by the current graphics window size. Both are set by the 
+parameter setting function of the general menu.
+.left margin1
+@33. TX 2 @Plot single contig
+.left margin2
+.para
+This option produces a schematic of a selected region of a single 
+contig by drawing a horizontal line to represent each of its gel 
+readings. The lines show the relative positions of each reading and 
+also their sense. The plot is divided vertically into two sections by 
+a line that is identified by an asterisk drawn at each end. All lines  
+that lie above this line represent readings that are in their original 
+sense, all lines below show readings that are in the 
+complementary sense to their original. By use of the crosshair 
+function the plot can be stepped through and examined in more 
+detail. See help on crosshair.
+.left margin1
+@34. TX 2 @Plot all contigs
+.left margin2
+.para
+This option produces a schematic of all the contigs in a database. It 
+does this by drawing a horizontal line to represent each of them. 
+In order to show the ends of each contig it draws the lines for 
+contigs at alternate heights: the first at height one, the 
+second at height two, the third at height one, etc. The order of the 
+contigs in the display is the same as their order in the database. 
+By use of the crosshair function the plot can be stepped 
+through and examined in more detail. See help on crosshair.
+.left margin1
+@31. TX 3 @ Disassemble readings
+.left margin2
+.para
+This function is used to remove a list of readings from a database, or
+to create a new contig from a single reading moved from an existing contig.
+This latter mode is useful for repositioning a reading in a repeat:
+once separated it can be placed in the join editor and scrolled by the
+other copies.
+Removal of sets of readings works in two modes: 
+1. A set of adjacent readings in a
+contig can be removed by the user naming the two end ones; or 2. A batch
+of readings from any number of contigs can be defined by the user naming
+a file containing a list of reading names. The program cleans up the
+database by moving data to fill up any holes made in the files.
+.para
+For both modes of operation the program will ask for a file of file names.
+If users create their own file (ie mode 2) each reading NAME must be on
+a separate line. For mode 1 the user types the NAMES of the leftmost
+and rightmost readings to be removed. They and all intervening readings 
+will be removed. Note that the routine operates on reading names - not 
+numbers. For both modes, if necessary, new contigs will be created.
+.left margin1
+@35. TX 1 3 @Find internal joins
+.left margin2
+.para
+The purpose of this function is to use data already in the database to
+find possible joins between contigs.
+Joins may have been missed due to poor data or may have not been made
+due to repeated sequences. Where appropriate, it may be 
+possible to find potential
+joins by using the "unused data" derived from sequencing machines.
+.left margin2
+For all overlaps found when the X version is used,
+ the contig editor (in join mode) will be
+called up with the two contigs aligned.
+.left margin2
+The database is checked for logical consistency. Supply a minimum initial
+match length, a minimum alignment block, the maximum pads per sequence,
+the maximum percent mismatch after alignment, the probe length. Choose
+if clipped data is to be used, if so define the window size for finding good
+data and the number of dashes allowed in the window. Processing will commence.
+Most of these values are used in an identical way in the autoassemble 
+function. The others are defined below. 
+.left margin2
+The program strategy
+.left margin2
+Take the first contig and calculate its consensus. If clipped data is being
+used examine all readings that
+are in the complementary orientation, and sufficiently near to the contigs left
+end, to see if they have good clipped sequence which if present, would 
+protrude 
+from the left end of the contig. If found add the longest such sequence to the 
+left end of the consensus. Do the same for the right end by examining  
+readings that are in their
+original orientation. If any are found add the longest extension to the 
+right end of
+the consensus. Repeat the consensus calculations and extensions 
+for all contigs hence producing an extended consensus. If clipped data is not
+being used simply calculate the consensus for the whole database. Now
+look for possible joins by processing the extended consensus in the following
+way. Take the last, say 100, bases (termed the "probe length" by the program)
+of the rightmost consensus, compare it both
+orientations with the extended consensus of all the other contigs. Display
+any sufficiently good alignments. Repeat with the left end of the rightmost
+contig. Do the same for the ends of all the entended contigs, always only
+comparing with the contigs to their left, so that the same matches do not 
+appear twice.
+.left margin2
+Good cliped data is defined by sliding a window of "Window size for good data
+scan" bases outwards
+along the sequence and stopping when "Maximum number of dashes in scan window"
+ or more dashes appear in the window.
+Note that
+it is advisable to have some sort of cutoff because if we simply take all the
+data it might be so full of rubbish that we wont find any good matches. For
+the same reason it is worth trying the procedure with different cutoffs. An
+initial run using no clipped data is also recommended.
+Sufficiently good
+alignments are defined by criteria equivalent to those used in autoassemble,
+however here we only display alignments that pass all tests.
+.left margin2
+Bugs
+.left margin2
+If a small contig is wholly contained within a larger one, such that its
+ends are further than ("Probe length" - "Minimum initial match length")
+from the ends of the larger contig, and the consensus for the small 
+contig lies to the left
+of the consensus for large contig, the overlap will not be discovered. (See
+the search stratgey).
+.left margin2
+ All numbering is
+relative to base number one in the contig: matches to the left (i.e. in
+the clipped data) have negative
+positions, matches off the right end of the contig (i.e. in the clipped
+data) have positions 
+greater than that of the contig length.
+The convention for reporting the positions of overlaps is as follows: if neither
+contig needs to be complemented the positions are as shown. If the program says
+"contig x in the - sense" then the positions shown assume contig x has been 
+complemented. For example in the results given below the positions for the 
+first overlap are as reported, but those for the second assume that the contig
+in the minus sense (i.e. 443) has been complemented.
+.lit
+
+
+ Possible join between contig   445 in the + sense and contig   405
+ Percentage mismatch after alignment =  4.9
+        412        422        432        442        452        462
+     405  TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAG,TT AGCTCACTCA
+           ********* * ********  ***** *** ********** ********** **********
+     445  -TTCCCGACT G,AAAGCGGG TAGTGA,CGC AACGCAATTA ATGTGAG-TT AGCTCACTCA
+       -127       -117       -107        -97        -87        -77
+        472        482        492        502        512
+     405  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+          ********** ********** ********** ********** **
+     445  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+        -67        -57        -47        -37        -27
+ Possible join between contig   443 in the - sense and contig   423
+ Percentage mismatch after alignment = 10.4
+         64         74         84         94        104        114
+     423  ATCGAAGAAA GAAAAGGAGG AGAAGATGAT TTTAAAAATG AAACG-CGAT GTCAGATGGG
+          **** ***** ********** ********** ******  ** ***** **** ********* 
+     443  ATCG,AGAAA GAAAAGGAGG AGAAGATGAT TTTAAA,,TG AAACGACGAT GTCAGATGG,
+       3610       3620       3630       3640       3650       3660
+        124        134        144        154        164
+     423  TTG-ATGAAG TAGAAGTAGG AG-AGGTGGA AGAGAAGAGA GTGGGA
+          *** ****** ********** ** *******  *** ***** ** ** 
+     443  TTGGATGAAG TAGAAGTAGG AGGAGGTGGA ,GAG,AGAGA GTTGG-
+       3670       3680       3690       3700       3710
+
+
+.end lit
+.left margin1
+@36. TX 3 @Double strand
+.left margin2
+.para
+PLEASE MAKE A COPY OF THE DATABASE BEFORE USING THIS OPTION AS IT HAS
+CURRENTLY HAD VERY LITTLE TESTING.
+.para
+Uses the cutoff data to change single stranded sections of a contig into
+double stranded sections. Data is used carefully to try and minimise the
+number of data disagreements created. However it must be noted that an overall
+slight degradation in quality will still occur.
+.para
+When using this option you will be prompted for a contig and a region within
+that contig. The default region is the entire contig. The option will then
+search through the region for areas of good data on one strand and cutoff data
+on the opposite strand, extending the cutoff data. The criteria for evaluating
+the amount of cutoff data to be used is based upon a maximum number of
+mismatches and a score (derived by accumulating points for mismatches (-8),
+matches(+1) and insertions (-5) over the length of an alignment). The defaults
+are:
+.lit
+
+maximum mismatches      :  6
+
+score for mismatch      : -8
+score for correct match : +1
+score for insertion     : -5
+.end lit
+.para
+Note that with successive calls to this option it is possible to double strand
+more and more data. Naturally however the quality of the data generated will
+diminish each time.
+.left margin1
+@37. TX 3 @Auto-select oligos
+.left margin2
+.para
+PLEASE MAKE A COPY OF THE DATABASE BEFORE USING THIS OPTION AS IT HAS
+CURRENTLY HAD VERY LITTLE TESTING.
+.para
+Generates a file (default "primers") of suggested primers to use for covering
+a single stranded section or for walking off the end of a contig. The file
+generated contains the gel reading name, the primer sequence, it's offset in
+the contig and the orientation. An example file would be :
+.lit
+
+c81d12.s1 TTGTCTGTAAGCGGATG (@ 6449 ) +
+c98a10.s1 ATTATCACTTTACGGGTC (@ 6959 ) +
+c81c1.s1 CAAGAAGGCGATAGAAG (@ 7643 ) +
+c76a10.s1 CCTCATCCTGTCTCTTG (@ 8441 ) +
+c81g4.s1 ATGAAACCTGGGCGTTG (@ 16156 ) +
+c91e6.s1 GTTTTCAGATGTCGGAG (@ 18249 ) +
+c81e12.s1 GCTACCGTAAAACACTTC (@ 18737 ) +
+c93h11.s1 GCTGCTTTTTGTTTTATCC (@ 19158 ) +
+c81h6.s1 CTTCCACTTCTTTCTTATC (@ 21210 ) +
+c86a12.s1 CGAATGATAAAGACAAATCAG (@ 22122 ) +
+c98b1.s1 GCCACTTTATCCGAGAC (@ 3048 ) -
+c97c5.s1 GTGTTTTGGGTATATTGTG (@ 3371 ) -
+c83d2.s1 CTACACAGAATGAACCC (@ 3768 ) -
+c78h10.s1 GGCGGTGAAGATTGAAG (@ 4200 ) -
+c98h9.s2dt CTCGTTTAAATTTCAAACTTCC (@ 7419 ) -
+c95a9.s1 ATTGGAAGGAAGGAGGG (@ 22996 ) -
+c82b4.s1 TGTAGCCGAAATCTTCC (@ 23369 ) -
+.end lit
+.para
+This is best employed after having previously used the 'Double strand' option.
+When selecting the option you will be asked for the contig, a region within
+this contig and the file to write the list of primers to. For each primer
+suggested a tag is automatically created containing details of the gel reading
+name and the sequence. Preferably the tag will be created on the gel reading
+from which the primer was selected. However this is not always possible so
+failing that the tag will be on another sequence overlapping the primer
+position.
+.para
+When invoked with the dialogue option you will be asked a couple more
+questions relating to the position and size of the consensus checked for
+suitable oligos. You will be prompted for the start and end of a region
+(default 40-140) at a relative position to the left of our initial region.
+.para
+For example:
+.lit
+
+? Menu or option number=d37
+ Auto-select oligos
+ Default Contig identfier=/e97f2.s1
+ ? Contig identfier=
+ ? Start position in contig (1-20942) (1) =10000
+ ? End position in contig (10000-20942) (20942) =11000
+ Default Name of file for primers=primers
+ ? Name of file for primers=
+ ? Start of oligo choice region (1-1024) (40) =50
+ ? End of oligo choice region (50-1024) (150) =150
+
+.end lit
+.para
+This implies that we are going to look for oligos to use as primers covering
+the region 10000 to 11000. For each single stranded section in this region we
+search for the oligos at between 50 and 150 to the left. So if we had a single
+stranded section from 10121 to 10295 we would search for oligos in the region
+9971 to 10071.
+.left margin1
+@38. TX 1 @Check assembly
+.left margin2
+.para
+This new function is used for checking the positioning of assembled readings.
+It is useful for checking sequences that contain repeats
+of length similar to that of a single gel reading. It takes the poor 
+quality data for each reading and compares it to the segment of the consensus
+to which it should align. 
+If the extension of the
+read does not match the consensus then the read (or its neighbours) has 
+probably been assembled into the wrong place. 
+The program displays the bad alignments.
+The quality of an alignment is defined by the percentage mismatch.
+Naturally the user should select a value that takes into account 
+the poor quality of the data being aligned.
+.para
+When the routine is used from the X version the
+user is offered the editor to examine poor alignments. 
+  If alignments are reported as poor, but on inspection are OK, the user
+can set a tag so that the poor quality data is ignored on subsequent passes
+through the routine. Note however such data will then also be ignored by
+the automatic double stranding routine!
+.para
+ The user defines the percentage mismatch; the window size and number of
+dashes allowed in the window used for selecting the amount of the poor data
+to be employed; can choose to save the names of the poorly aligned reads
+in a file; can select an individual contig or scan the whole database.
+The file containing the names of the poorly aligned reads can be used by 
+the disassembly routine to remove them from the database, and then can be used
+to reassemble them. Note that the routine complements each contig twice 
+during processing.
+
+.left margin1
+@39. TX 1 @Find read pairs
+.left margin2
+.para
+This new function is used to check the positions of readings taken from each
+end of the same template. For each forward read it searches for a corresponding
+reverse reading. The search can be over the whole database or over a single contig.
+The results can be presented graphically for single contig searches and the crosshair
+function can be used to identify the readings displayed.
+.para
+Note that at present the function only knows that two reads are from the same template
+by comparing reading names. For our local projects we use the following naming 
+convention: forward reads are named abcdefgh.s1 and reverse reads abcdefgh.r1. The 
+program expects this naming convention and so if it finds read fred.s1 and fred.r1 it
+assumes they are the forward and reverse reads for template fred. In the future we
+will make the routine more general!
+.para
+If a single contig is selected and the output is listed the program displays two
+lines for each pair: the first line shows the reading name, its position and length,
+and the distance between the extremeties of the two reads; the second line shows the
+other read name, its position and length. If there are pairs that are in separate contigs
+or are facing away from one another they are listed after the pairs that face inwards.
+Is this true?
+.para
+If the results are plotted the full length of the template is drawn with arrows 
+indicating the direction of reads and the extent of each reading. Those reads that have
+their partner in another contig are marked by asterisks.
+.para
+Typical dialogue is shown below.
+.lit
+
+ ? Select contigs (y/n) (y) = 
+ Default Contig identifier=/i55d8.s1
+ ? Contig identifier=
+ ? Start position in contig (1-15227) (1) =
+ ? End position in contig (1-15227) (15227) =
+ ? Plot results (y/n) (y) = n
+    852 k23a1.r1            249   238  1615
+    806 k23a1.s1           1529  -335
+    238 i68e6.s1            422   193  1632
+    868 i68e6.r1           1756  -298
+    576 k17a2.s1           2370   213  1676
+    885 k17a2.r1           3790  -256
+     84 k27g6.s1           3456   291  1777
+    867 k27g6.r1           4905  -328
+    453 k01g10.s1          5805   142  1251
+    881 k01g10.r1          6909  -147
+    781 i98b8.r1           6754   338  1079
+     10 i98b8.s1           7653  -180
+    883 k02d11.r1          7327   276  1597
+    283 k02d11.s1          8726  -198
+    269 i68f9.s1           8191   169  1055
+    777 i68f9.r1           8891  -355
+    710 i91c6.s1           8245    95  1516
+    780 i91c6.r1           9403  -358
+    596 k27d12.s1           136   329  -329
+    219 k27d12.r1             1  -116
+    159 k27d11.r1          1830  -263  -263
+    317 k27d11.s1          2902   343
+    886 k17g11.r1          7107  -123  -123
+    647 k17g11.s1          1867   265
+    851 i69g10.r1          8045  -137  -137
+    277 i69g10.s1          4658   174
+.end lit
+.para
+If contigs are not selected the pairs are sorted on their separations.
+.lit
+
+ ? Select contigs (y/n) (y) = n
+ i68f2.s1            27  1781  1777
+ i68f2.r1           776   111  1777
+ k17f6.s1           601    60  1706
+ k17f6.r1           856  1405  1706
+ k17a2.s1           576  2370  1676
+ k17a2.r1           885  3790  1676
+ k27g3.s1           177 14985  1664
+ k27g3.r1           889 13564  1664
+.
+.
+ k27b12.s1          764     1  1086
+ k27b12.r1          857   932  1086
+ i98b8.s1            10  7653  1079
+ i98b8.r1           781  6754  1079
+ k16a3.s1           748  1276  1070
+ k16a3.r1           784   472  1070
+ k17b7.r1           786 14937 18942*
+ k17b7.s1           787  3601 18942*
+ k27d12.r1          219     1 15208*
+ k27d12.s1          596   136 15208*
+ k01g2.s1           502    87 14754*
+ k01g2.r1           782  9224 14754*
+
+.end lit
+
+.left margin1
+@ end of help
diff --git a/help/DAP.RNO b/help/DAP.RNO
new file mode 100644
index 0000000..7bcfa2d
--- /dev/null
+++ b/help/DAP.RNO
@@ -0,0 +1,2724 @@
+.npa
+.left margin1
+@-1. TX  0 @General
+.sp
+@-2. T   0 @Screen control
+.sp
+@-2. X   0 @Screen
+.sp
+@-3. TX  0 @Modification
+.sp
+@0.  TX -1 @SAP
+.left margin2
+.PARA
+This is help information for the X Windows version of SAP.
+Currently it is being brought up to date with the new features in XDAP.
+The accuracy of this help should therefore not be assumed.
+.PARA
+This is an interactive program whose primary use is
+for managing shotgun sequencing projects, but it can also be used for 
+handling alignments of other sequences, including those of proteins. 
+Currently the maximum 'gel reading' length is set to 4096 characters. 
+Almost all of the information below describes the use of the program for 
+shotgun projects, but those using the programs for handling other 
+sequence 
+alignments should interpret it accordingly.
+The data for such a project is stored in a special type of database. The 
+program
+ contains the tools that are required to type in gel readings,
+screen them against vector sequences and restriction sites; 
+enter new gel 
+readings into the database (automatically comparing and aligning
+them). In addition it contains editors and functions to examine the quality 
+of the aligned sequences.
+.para
+ There are three main menus: "general", "screen" and  "modification", 
+and some functions have submenus.
+.left margin2
+.lit
+  The general menu contains the following options:
+
+       Open a database
+       Display a contig
+       List a text file
+       Direct output
+       Calculate a consensus
+       Screen against restriction enzymes
+       Screen against vector
+       Check database
+       Copy database
+       Show relationships
+       set parameters
+       Highlight disagreements
+       Examine quality
+       Find internal joins
+
+The graphics menu contains:
+
+       Clear graphics
+       Clear text
+       Draw ruler
+       Use cross hair
+       Change margins
+       Label diagram
+       Plot map
+       Plot single contig
+       Plot all contigs
+
+
+The modification menu contains:
+
+       Edit contig
+       Auto assemble
+       Join contigs
+       Complement a contig
+       Alter relationships
+       Extract gel readings
+
+
+The alter relationships menu contains:
+
+       Cancel
+       Line change
+       Edit single gel reading
+       Delete contig
+       Shift
+       Move gel reading
+       Rename gel reading
+       Break contig
+       Alter raw data parameters
+
+.END LIT
+.SK1
+.para
+Overview of the methodology
+.para
+The shotgun sequencing strategy
+.para
+               In the shotgun sequencing procedure
+the sequence to be determined is randomly broken into fragments of 
+about 
+400 nucleotides in length. These fragments are cloned and then 
+selected randomly and their
+
+          sequences   determined.    The  relationship  between  any  pair  of
+
+          fragments is not known beforehand
+but is found by comparing their  sequences.
+
+          If  the  sequence  of  one found to be wholly or partially contained
+
+          within that of another  for  sufficient  length  to  distinguish  an
+
+          overlap  from  a repeat then those two fragments can be joined.  
+The
+
+          process of select, sequence and compare is continued until the 
+whole
+
+          of  the  DNA  to  be  sequenced is in one continuous well 
+determined
+
+          piece.
+
+.para
+          Definition of a contig
+
+.para
+               A CONTIG is a set of gel  readings  that  are  related  to  one
+          another  by  overlap of their sequences.  All gel readings belong to
+          a contig and each contig contains at  least  one  gel
+          reading.   The  gel  readings in a contig can be summed to produce 
+a continuous consensus sequence and the length of this sequence is 
+the length  of the contig.  The rules used to perform this summation are
+          given under "the consensus algorithm".
+          At any stage
+          of  a  sequencing project the data will comprise a number of 
+contigs;
+when  a  project  is
+
+          complete  there  should be only one contig and its consensus will be
+          the finished sequence. Note that since being introduced and 
+defined as above the word "contig" has been taken up by those involved in 
+genomic mapping. In that context the consensus with a  precise length is not 
+defined.
+
+.SK1
+.LEFT MARGIN2
+Introduction to the computer method
+.LEFT margin2
+.PARA
+It is useful to consider the objectives of a sequencing project before 
+outlining how we use the computer to help achieve them. The aim of a 
+shotgun sequencing project is to 
+produce an accurate consensus sequence from many overlapping gel 
+readings.
+It is necessary to know, particularly at the latter
+stages of the project, how accurate the 
+consensus sequence is. This enables us to know which regions of the
+ sequence require further work and also to know when the project is 
+finished.
+To show the quality of the consensus, the programs described here
+produce displays like that shown below.
+.sk1
+.lit
+
+                           10        20        30        40        50
+   -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+       CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                          110       120       130       140       150
+   -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+   -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+   -5  HINW.009                                        GGCAGCGACCGTCG
+   17  HINW.999                                           AGCGACCGTCG
+       CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                          160       170       180       190       200
+   -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+   -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+   12  HINW.017                                              GTAGAGC*
+       CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+.END LIT
+.para
+                This is an example showing the left  end  of  a  contig  from
+          position  1 to 200.  Overlapping this region are gel readings 
+numbered 6, 3, 5, 17 and 12;
+6, 3 and 5 
+are in reverse orientation to their original reading (denoted by a minus  
+sign). Each gel reading also has a name (eg HINW.010). It can be seen that 
+in a number of places the sequences contain characters other than A,C,G 
+and 
+T. Some of these extra characters have been used by the sequencer to 
+indicate regions of uncertainty in the initial interpretation of the gel 
+reading, but the asterisks (*) have been inserted by the automatic 
+assembly function in order to align the sequences. Underneath each 50 
+character block of gel reading sequences is the consensus derived from 
+the 
+sequences aligned above (the line labelled CONSENSUS). For most of its 
+length the consensus has a definite nucleotide assignment but in a few 
+positions there is insufficient agreement between the gel readings and
+so a dash (-) appears in the sequence. This display contains all the 
+evidence needed to assess the quality of the consensus: the number of 
+times 
+the sequence has been determined on each strand of the DNA, and the 
+individual nucleotide assignments given for each gel reading.
+.para
+So the aim is to produce the consensus sequence and, equally important, 
+a display of the experimental results from which it was derived.
+.para
+In order to achieve this the following operations need to be performed:
+.left margin2
+1) Put individual gel readings into the computer.
+This might involved the manual interpretation of autoradiographs
+or the transfer and process of machine-readable files from fluorescent
+sequencing machines.
+.left margin2
+2) Check each gel reading to make sure it is not simply part of one of the 
+vectors used to clone the sequence.
+.left margin2
+3) Check each gel reading to make sure that those fragments that span 
+the 
+ligation point used prior to sonication are not assembled as single 
+sequences.
+.left margin2
+4) Compare all the remaining gel readings with one another to assemble 
+them 
+to produce the consensus sequence.
+.left margin2
+5) Check the quality of the consensus and edit the sequences.
+.left margin2
+6) When all the consensus is sufficiently well determined, produce a copy 
+of 
+it for processing by other analysis programs.
+.para
+It is very unlikely that this procedure will only be passed through once.
+Usually steps 1 to 5 are cycled through repeatedly, with step 4 just 
+adding 
+new sequences to those already assembled. Generally step 6 is also used 
+in 
+order to analyse imperfect sequence to check if it is the one the project 
+intended to sequence, or to look for interesting features. Analysis of 
+the consensus, such as 
+searches for protein coding regions,
+can also help to find errors in the sequence. The display of the 
+overlapping gel readings shown above can be used to indicate, not only 
+the 
+poorly determined regions, but also which clones should be resequenced 
+to 
+resolve ambiguities, or those which can usefully be extended or 
+sequenced 
+in the reverse direction, to cover 
+difficult regions.
+
+.PARA
+The original
+individual gel readings for a sequencing project are each stored in 
+separate files. As the gel readings are entered into the computer
+(usually in batches, say 10 
+from a film), the file names they are given are stored in 
+a further file, called a file of file names. Files of file names  
+enable gel readings to be processed in batches. 
+.para
+For each sequencing project 
+we start a project database. This database has a structure specifically 
+designed for
+dealing with shotgun sequence data. 
+In order to arrive at the final consensus sequence many operations will 
+be 
+performed on the sequence data. Individual fragments must be 
+sequenced and 
+compared in both senses (i.e. both orientations) with all the other 
+sequences. When an overlap between a new gel reading and a contig are 
+found 
+they must be aligned and the new gel reading added to the contig. If a 
+new 
+gel reading overlaps two contigs they must be aligned and joined. Before 
+the two contigs are joined one of them may need to be turned around 
+(reversed and complemented) so they are both in in the same orientation. 
+.para
+Clearly, keeping track of all these manipulations is quite complicated,
+and to be able to perform the operations 
+quickly requires careful choice of data 
+structure and algorithms. For these reasons it is not practicable to store 
+the gel readings aligned as shown in the display above. Rather, it is more 
+convenient to store the sequences unassembled, and to record sufficient 
+information for programs to assemble them during processing. The 
+data used to assemble the sequences is called relational information. 
+.left margin2
+.PARA
+ The database comprises five files and they are described under the 
+section entitled "open database".
+.PARA
+Before entry into the project database 
+each new gel reading must be compared to look for overlaps 
+with all the data already contained 
+within the database. This last point is 
+important: all searching for overlaps is between individual new gel 
+readings and the data already in the database. There is no searching for 
+overlaps between sequences within the database; overlaps must be found 
+before new gel readings are entered into the database.
+.para
+Below I give an introduction to how the sequences are processed by 
+being 
+passed from one function to the next.
+.para
+This program is used to start a 
+database for the project and 
+then the following procedure is used.
+.para
+Data in the form of individual gel readings are entered into the computer
+ 
+and stored in separate files using either program this program or the digitizer 
+
+program. Batches 
+of these gel readings 
+are passed to the screening functions in this program to search for overlaps 
+
+with vector sequences ("screen against vector") or for matches to 
+
+restriction enzyme sites  that should not be
+ 
+present ("screen against enzymes"). 
+Each run of these screening functions passes on only those gel
+ 
+readings that do not contain unwanted sequences. Sequences are passed 
+
+via 
+files of file names and eventually are processed by the automatic 
+assembly function ("auto assemble"). This function compares each gel 
+reading with a consensus of all the previous gel readings 
+stored in the database.
+If it finds any 
+overlaps
+ it aligns the overlapping sequences by inserting padding characters, 
+and then adds the new gel reading to the database. 
+Gels that overlap are added to existing contigs and gels that do not 
+overlap any data in the database start 
+new contigs. If a new gel overlaps two contigs they are joined. 
+Any gel readings that appear to overlap but which
+cannot be aligned sufficiently well are not entered and have 
+their names written to a file of failed gel reading names.
+.PARA
+Generally data is entered 
+into the database in batches as just described. The program
+ is also used to examine
+
+the data in the database, to enter gel readings that the automatic
+ 
+assembly function cannot align ("auto assemble"),
+
+ and to make final edits. Edits to whole contigs 
+
+can  be made in several ways.
+A mouse-driven editor ("edit contig") is used to perform all edits manually.
+Disagreements between gel readings 
+
+in contigs and their consensus 
+
+sequences can be highlighted by use of the function "highlight
+ 
+disagreements". 
+.PARA
+Editing the sequences is obviously an essential part of managing a
+
+sequencing project. 
+Editing is required when new 
+
+sequences are added, when contigs are joined, and when sequences are 
+
+corrected.
+A basic part of the strategy
+
+used here is that new
+ 
+gel readings should be correctly aligned throughout their whole length
+
+when 
+they are entered into the database, and that when contigs are joined they
+ 
+are edited so that they are well aligned in the region of overlap.
+
+ Alignment can be achieved by
+
+adding padding characters to the sequences, and this is the way "auto
+ 
+assemble"  
+operates when adding new sequences to the database.
+
+.para
+In order to search 
+for overlaps that may have been missed due to errors in 
+
+the gel readings, the function "extract gel readings" can be used to take 
+
+copies of the gel
+ 
+readings at the ends of contigs, and write them out as separate files.
+ 
+These can then be compared with the database consensus using the "auto 
+
+assemble" function in a mode that forbids entry of data into the 
+database,
+and any gel reading matching two contigs will indicate a join that has 
+
+been 
+missed. The joins can then be made interactively using "join contigs". 
+
+Missed matches can be 
+
+found at this stage because the errors in the sequences may have been 
+
+corrected by new data.
+
+.para
+Generally the users need not concern themselves with how the relational 
+information is used by the program, but it is necessary to know
+how contigs are identified. Because contigs are constantly being changed and 
+reordered the program identifies them by the numbers of the gel readings 
+they contain. Whenever users need to identify a contig they need only 
+know 
+the number or name of one of the gel readings it contains. Whenever the 
+program asks users to identify a contig or gel reading they can type its 
+number or its archive name. If they type its archive name they must precede 
+the name by a slash "/" symbol to denote that it is a name rather than a 
+number. E.g if the  archive
+name is fred.gel with number 99, users should
+type /fred.gel or 99 when asked to identify the contig. Generally,
+ when it asks for the gel reading to be identified,
+the program will offer the user a default name,
+ and if the user types only return, that 
+contig will be accessed. When a database is opened the default contig will 
+be the longest one, but if another is accessed, it will subsequently  become 
+the current default.
+.para
+Further information is located in the following places. 
+The database files are described under "open database". The format 
+for 
+vector and consensus sequences is given under "calculate a consensus", as are 
+the 
+uncertainty codes used in gel readings. 
+.left margin2
+.para
+There are two programs,
+other than this, relevant to sequencing are the digitizer 
+program  and the trace editor program, both is outlined briefly below.
+.para
+         The digitiser program
+is used for the initial input of gel readings
+and for writing a file of file names. The program
+uses a digitizer for data entry.
+A digitizer is
+  a  two  dimensional  surface such as a light box
+which is such that if a special pen is pressed onto it, the pens 
+coordinates are recorded by a computer.
+These coordinates
+          can be interpreted by a program.
+.para
+               In order to read an autoradiograph placed on the light box
+the user  need  only  define the bottom of
+the four sequencing lanes and the bases
+          to which they correspond and then use  the  pen  to  point  to  each
+          successive  band progressing up the gel.  The program examines 
+the
+          coordinates of each pen position to see in which of the  four  
+lanes
+          it  lies  and  assigns  the  corresponding  base to be stored in the
+          computer.  Each time the pen tip is depressed to point to a position
+          on  the  surface of the digitizer the program sounds the bell on the
+          terminal to indicate to the user that a point has been recorded.  As
+          the  sequence  is read the program displays it on the screen.
+.para
+	The trace editor program
+is used for the initial processing of data obtained from
+fluorescent sequencing machines. It allows the user to visually
+select left and right cutoff positions to denote the start and end of good
+data. Users may also edit the sequence at this point.
+Output from ted is a sequence file in Staden format with headers that
+describe to xdap the cutoff information.
+
+.left margin1
+@17. TX 1 @Screen against enzymes
+.left margin2
+.PARA
+Used to compare gel readings against any restriction enzyme recognition
+ 
+sequences that may have been used  during cloning and which should not 
+
+be present in the data. Works on single gel readings or processes batches
+ 
+accessed through files of file names. The algorithm looks for exact 
+
+matches to recognition sequences stored in a file. 
+
+.para
+The file containing the recognition sequences must be identified. The
+user 
+must choose between employing a file of file names, or typing in the
+
+ 
+names of individual gel reading files. If a file of file names is used the
+
+
+program will also create a new file of file names. When the option has
+ 
+finished operating this new file will contain the names of all those gel
+ 
+readings that did not match any of the recognition sequences. Hence it
+ can 
+be used for further processing of the batch. The recognition sequences 
+
+should be stored in a simple text file with one recognition sequence per 
+
+line.
+.left margin1
+@18. TX 1 @Screen against vector
+.left margin2
+.PARA
+Used to compare gel readings against any vector sequences that may have 
+
+been picked up during cloning. Works on single gel readings or processes 
+
+batches accessed through files of file names. The algorithm looks for
+exact 
+matches of length "minimum match length" and displays the overlapping 
+
+sequences.
+.para
+The file containing the vector sequence must be identified. The user must
+
+choose between employing a file of file names, or typing in the names of
+ 
+individual gel reading files. If a file of file names is used the program 
+will 
+also create a new file of file names. When the option has finished 
+
+operating  this new file will contain the names of all those gel readings
+
+that did not match the vector sequence. Hence it can be used for further 
+
+processing of the batch. The vector sequence should be stored in a simple
+ 
+text file with up to 80 characters of data per line. More than one vector
+ 
+can be stored in a single file. If so each should be preceded by a 20 
+
+character title of the form <---m13mp8.001-----> where the < and >
+ signs 
+and the number like .001 are obligatory. The number must be preceded 
+
+by a dot (.) and be 3 digits long. The total sequence in the file must be <
+
+50,001 characters in length.
+
+.left margin1
+@20. TX 3 @Auto assemble
+.left margin2
+.PARA
+Compares gel readings against the current contents of the database and 
+
+produces  alignments. In its normal mode of operation 
+("entry permitted"), the function
+will automatically enter the gel readings into the database, but if entry 
+is not permitted it will only produce alignments. It works on 
+
+single gel readings or processes batches of gel readings accessed through 
+
+files of file names. It is the usual way to enter data into the database.
+
+.para
+The function will check the database for logical consistency and will
+ only 
+proceed if it is OK. Choose if gel readings should be entered into the
+ 
+database, or if they should only be compared. Choose between using a file 
+
+of file names or typing file names on the keyboard. If so selected, supply 
+
+the file of file names. Also supply a file of file names to contain the names of
+ 
+all the gel readings that fail to get entered. 
+Select the entry mode. Normal assembly is appropriate for all but special 
+cases, as is "permit joins". Uses for the other modes are not documented 
+here.
+Define a minimum initial
+ 
+match length. Define a minimum alignment block (the default value is
+ 
+taken in all but exceptional circumstances). Define the maximum number
+ 
+of padding characters allowed to be used in each gel reading to help
+
+achieve alignment, and the same for the number allowed in the contig for
+ 
+each gel reading. Finally define the maximum percentage mismatch to
+be allowed for any gel reading to be entered into the database. If
+ 
+for any gel reading, either of these last three values is exceeded the gel
+ 
+reading will not be entered into the database.
+
+.para
+In operation the function takes a batch of gel readings (probably  passed
+
+          on  as  a file of file names from one of the screening routines) and 
+enters them into a
+          database for a sequencing project.  It takes each  gel reading
+  in  turn,
+          compares  it with the current consensus for the database, it then
+          produces an  alignment  for  any  regions  of  the  consensus  it
+          overlaps;   if  this alignment is sufficiently good it then edits
+          both the new gel reading and the sequences it overlaps  and  adds  
+the
+          new  gel reading to the database.  The program then updates the 
+consensus
+          accordingly and carries on to the next  gel  reading.
+.para
+          All alignments are displayed and any gel readings 
+that do match but  that
+
+          cannot be aligned sufficiently well have their names written to a
+          file of failed gel reading names.  The function works without  any
+
+          user intervention and can process any number of gel readings in a
+          single run.  Those gel readings that fail can be recompared using
+
+          the same function (to find the current overlap position) and  the
+  
+user  can enter them into the database
+
+          manually  using  the   "enter new gel reading" option.
+.para
+Typical dialogue and output from the function is shown below. (Note that 
+output for gel readings 2 - 9 has been deleted to save space).
+.lit
+Automatic sequence assembler
+Database is logically consistent
+? (y/n) (y) Permit entry 
+? (y/n) (y) Use file of file names 
+? File of gel reading names=demo.nam
+? File for names of failures=demo.fail
+Select entry mode
+X  1 Perform normal shotgun assembly 
+   2 Put all sequences in one contig 
+   3 Put all sequences in new contigs
+? Selection  (1-3) (1) =
+? (y/n) (y) Permit joins 
+? Minimum initial match (12-4097) (15) =
+? Minimum alignment block (2-5) (3) =
+? Maximum pads per gel (0-25) (8) =
+? Maximum pads per gel in contig (0-25) (8) =
+? Maximum percent mismatch after alignment (0.00-15.00) (8.00) =
+  >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
+  Processing           1 in batch
+  Gel reading name=HINW.004                                
+  Gel reading length=   283
+  Searching for overlaps
+  Strand     1
+  Strand     2
+  No matches found
+  Total matches found           1
+  Padding in contig=    0 and in gel=    1
+  Percentage mismatch after alignment =  1.8
+  Best alignment found
+         1         11         21         31         41         51
+         TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+         ********** ********** ********** ********** ********** **********
+         TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+         1         11         21         31         41         51
+        61         71         81         91        101        111
+         TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACA-GTTG
+         ********** ********** ********** ********** ********** ***** ****
+         TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACACGTTG
+        61         71         81         91        101        111
+       121        131        141        151        161        171
+         CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+         ********** ********** ********** ********** ********** **********
+         CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+       121        131        141        151        161        171
+       181        191        201        211        221        231
+         GCCATGGTGG CCACAGTGAC G-TATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+         ********** ********** * ******** ********** ********** **********
+         GCCATGGTGG CCACAGTGAC GCTATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+       181        191        201        211        221        231
+       241        251        261        271        281
+         CGCG--GTGG AATTACAGCG TTCCCTATTG ACGGGCGCAT CCAC
+         ****  **** ********** ** * ***** ********** ****
+         CGCGACGTGG AATTACAGCG TT,CDTATTG ACGGGCGCAT CCAC
+       241        251        261        271        281
+          Batch finished
+          9 sequences processed
+          0 sequences entered into database
+          0 joins made
+
+.end lit
+
+.para
+Note that "auto assemble" cannot align protein sequences.
+.left margin1
+@28. TX 1 @Highlight disagreements
+.left margin2
+.para
+Used in the latter stages of a project
+to highlight disagreements between individual gel readings
+and their consensus sequences. Characters that agree with the
+ 
+consensus are shown as : symbols for the plus strand and . for the minus
+ 
+strand. Characters that disagree with the consensus are left unchanged 
+
+and so stand out clearly. The results of this analysis are written to a 
+file.
+
+.para
+Before selecting this option create a file of the display of the contig to 
+be 
+"highlighted". The option will ask for the name of this file. Select
+ symbols 
+to denote "agreeing" characters on each strand, the defaults are : and ., 
+
+but any others can be used. Supply the name of a file in which to put
+ 
+the output.
+.para
+The display file needed as input for this option is created by selecting 
+
+"Redirect output",  followed immediately by  "display contig", and then 
+"Redirect output" again. The 
+
+cutoff score used in the consensus calculation can be set by option "set
+
+display parameters". Note that for the highlight function
+there is a limit of 50 for the number of gel 
+readings that are aligned at any position - ie the contig must be less 
+than 51 gel readings deep at its thickest point. I hope that those performing
+shotgun sequencing never reach this limit, but those using the program for
+comparing sequence families might.
+.para
+Typical output from this function is shown below.
+.lit
+                                                                     
+                          210       220       230       240       250
+    1  HINW.004    :C::::::::::::::::::::::::::::::::::::::::::AC::::
+    7  HINW.018    :*::::::::::::::::::::::::::::::::::::::::::CA::::
+   -4  HINW.017                                 ...............AC....
+                   G-TATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCCCGCG--GTGG
+                                                                     
+                          260       270       280       290       300
+    1  HINW.004    ::::::::::::*:D:::::::::::::::::::
+    7  HINW.018    ::::::::::::::::::::CA:::::T:*:::*::::::::::::CA:
+   -4  HINW.017    ..............................................A...
+    3  HINW.009    :::::::::::::::V::::::::::::::::::::::::::::*AV:::
+   -6  HINW.028                            ......................A...
+                   AATTACAGCGTTCCCTATTGACGGGCGCATCCACGCTGATTCTCTT-CTG
+                                                                     
+.end lit
+.left margin1
+@32. TX 3 @Extract gel readings
+.left margin2
+.para
+Used to make copies of the aligned gel readings in a database,
+to write them into separate files, and to write a 
+
+corresponding file of file names. It operates in two modes: either all gel
+ 
+readings are extracted, or only those at the ends of contigs. 
+
+.para
+Choose which mode of operation is required and supply a file of file 
+
+names. 
+.para
+The gel readings are given their original 
+
+names. 
+If used to extract the gel readings from the ends of contigs the function
+ is 
+useful for checking for missed contig joins: the file of file names can be 
+
+used with the auto assemble function to recompare these gel readings, 
+
+and each should only overlap one contig. Any that overlap two contigs
+
+will identify possible joins.
+.para
+If the option is used to extract all the gel readings from a database, a 
+
+subsequent run of "auto assemble" can reconstitute a database which has
+
+been corrupted. This  rarely occurs and is usually necessitated by a
+ 
+user  employing "alter relationships"  incorrectly without first having
+ 
+made a copy. 
+.left margin1
+@1. TX 0 @Help
+.left margin2
+.PARA
+Help is available on the following topics :
+
+.LEFT MARGIN1
+@2. TX 0 @Quit
+.LEFT MARGIN2
+.PARA
+This command stops the program and is the only safe way to terminate a 
+
+run 
+of the program that has altered the contents of the database in any way.
+
+.left margin1
+@3. TX 1 @Open a database
+.LEFT MARGIN2
+.PARA
+Opens existing databases or allows new ones to be started. The function
+ is 
+automatically called into operation 
+when the program is started but can also be selected 
+
+from the general menu.
+.para
+Choose to open an existing database or start a new one, or if ! is typed 
+when the program is first started, enter the program without opening a 
+database. Supply a project
+ 
+database name, and if it already exists, the "version". If starting a new
+
+database define the database size and if it is for DNA or protein sequences.
+The database size is an initial size for the database. It can be increased 
+later during the project. It is the sum of the number of gel
+readings plus the number of contigs.
+.para
+Database names can have from one to 12 letters and must not include full
+ 
+stop (.). The database is made from five separate files. If the database
+ is 
+called FRED then version 0 of database FRED comprises files FRED.AR0, 
+
+FRED.RL0, FRED.SQ0, FRED.TG0 and FRED.CC0. The version is the last symbol in the file names.
+
+Only this program
+ can read these files. If the "copy database" option is used it
+ 
+will ask the user to define a new "version". 
+.para
+For normal use the maximum gel reading length is set to 512 characters,
+
+but when a database is started the user may choose lengths of either
+
+ 512, 
+1024, 1536..., 4096. Normally the program is used to handle DNA 
+
+sequences but many of the functions also work on protein sequences. The
+ 
+choice of sequence type is made when the database is started.
+
+.para
+The contigs are not stored on the disk as the user sees them displayed on
+
+the screen. Each gel reading is stored with sufficient information about
+ 
+how it overlaps other gel readings so that the program can work out how
+ 
+to 
+present them aligned on the screen. We refer to this extra data as "the 
+relationships" and it is explained below.
+ 
+The database comprises 5 separate files.
+
+.left margin2
+          1.  a working version of each gel reading.  This is the  version  of
+          the  gel  reading
+that is in the database and initially it is an exact copy of
+          the original sequence (known as the archive)
+ but it is edited and manipulated to align  it
+          with other gel readings.
+
+.left margin2
+          2.  the file of  relationships.   This  file  contains  all  of  the
+
+          information  that  is required to assemble the working versions 
+into
+
+          contigs during processing;  any manipulations on the data  use  this
+
+          file   and  it  is  automatically  updated  at  any  time  that  the
+
+          relationships are changed.  The  information  in  this  file  is  as
+
+          follows:
+.left margin2
+          (A) Facts about  each  gel reading  and  its  relationship  to  
+others  
+("gel
+
+          descriptor lines"):
+
+.left margin2
+             (a) the number of the gel 
+reading   (each gel reading   is given a number  as  it  is
+
+          entered into the database)
+
+.left margin2
+             (b) the length of the sequence from this gel reading  
+
+.left margin2
+             (c) the position of the left end of this gel 
+reading   relative to the left
+
+          end of the contig of which it is a member
+
+.left margin2
+             (d) the number of the next gel 
+reading   to the left of this gel reading  
+
+.left margin2
+             (e) the number of the next gel reading   to the right
+
+.left margin2
+             (f) the relative strandedness of this gel 
+reading  , ie whether  it  is  in
+
+          the same sense or the complementary sense as its archive.
+
+.left margin2
+          (B) Facts about each contig ("contig descriptor lines"):
+
+.left margin2
+             (a) the length of this contig
+
+.left margin2
+             (b) the number of the leftmost gel 
+reading   of this contig
+
+.left margin2
+             (c) the number of the rightmost gel reading   of this contig.
+
+.left margin2
+          (C) General facts:
+
+.left margin2
+             (a) the number of gel readings in the database
+
+.left margin2
+             (b) the number of contigs in the database.
+
+.left margin2
+          3.  the file of archive names.  This is simply a list of  the  names
+
+          of each of the archive files in the database but on line number 
+
+          1000 we also store the size of the database. ie the number of lines 
+
+          of information allowed in the database files. This file always has 
+
+          1000 lines but the length of the file of relationships and the file 
+
+          of working versions can be set by the user when creating a 
+database
+
+          or when copying from one to another.
+.left margin2
+	  4. the file of tags (annotation). 
+This consists of linked lists of tag information for each sequences in the
+database.
+Tags are created by the user as annotation, or by xdap as records of edits or
+for storing cutoff information.
+As the number of tags can grow without limit, so can this file.
+For each gel there is a header record, which contains the record number of
+the start of the linked list for that gel. On line IDBSIZ there is a record
+containing information about the file such as its present length and if there
+are any free "tag" slots to be reused in the file.
+
+	  5. the file of comments (annotation).
+This consists of linked lists of comment fragments.
+Comments are created by the user as a message attached to annotation,
+or by the system to store cutoff information.
+Comments are character strings of any length.
+Comments longer than 40 characters are broken up into fragments, each 40
+characters long, and are chained together in a link list.
+As the number of comments can grow without limit, so can this file.
+
+.para
+          Structure of the database files
+.para
+          1.  The file of relationships
+.para
+	      The file contains IDBSIZ lines of data:
+          the general data are stored on line IDBSIZ;   data about  gel 
+readings  are
+          stored  from  line 1 downwards;  data about contigs are stored from
+          line IDBSIZ-1 upwards. A database of 500 lines containing 25 gel 
+readings and 4 contigs would have a file 
+          of relationships as is shown below.
+.lit
+
+
+                  ---------------------------------------------
+                     1  Gel descriptor record
+                     2   "      "       "
+                     3   "      "       "
+                     4   "      "       "
+                     5   "      "       "
+                     '   '      '       '
+                     '   '      '       '
+                    25   "      "       "
+                    26  Empty record
+                     '    '     '
+
+                     '    '     '
+                   495    '     '
+                   496  Contig descriptor record
+                   497    "        "        "   
+                   498    "        "        "
+                   499    "        "        "
+                   500   Number of gel readings=25, Number of contigs=4
+                  ---------------------------------------------
+
+          The arrangement of the data in the file of relationships
+
+.end lit
+As each new gel reading   is added into the database a new line is  added
+          to  the  end  of  the  list  of gel descriptor
+ lines.  If this new gel  reading  does not
+          overlap with any gel readings
+ already in the database a new contig  line  is
+          added  to  the top of the list of contig lines.  If it overlaps with
+          one contig then no new contig line need be added but if it  overlaps
+          with  two  contigs  then  these  two  contigs must be joined and the
+          number of contig lines will be reduced by one. Then the list of 
+contig
+          lines is compressed  to  leave  the empty line at the top of the list.
+          Initially the two types of line will move towards  one  another  but
+          eventually,  as  contigs  are joined, the contig descriptor lines will
+          move in the same direction as the  gel descriptor
+ lines.   At  the  end  of  a
+          project  there should  be only one contig line.  The database is thus
+          capable of handling a project of 998 gels.
+.para
+          2.  Structure of the working versions file
+.para    
+        The working versions of gel readings are stored in  a  file  of
+          IDBSIZ lines each containing 512 characters.  Gel reading 
+number 1 is stored on line
+          1, gel reading number  2 on line 2 and so on.
+.para
+          3.  Structure of the archive names file
+.para
+          This file, unlike the others, always has 1000 lines each 10
+          characters in length. Its length is fixed because line 1000 is used 
+          to store IDBSIZ the database size and the programs need a definite
+          location from which to read this number.
+.para
+          4.  Structure of the tag file
+.para
+This file initially starts with IDBSIZ lines, and is expanded as new tags are
+created.
+Information about the length of the file, and which tag records are reusable
+is stored on line IDBSIZ.
+A database of 500 lines would have a file of tags as shown below.
+.lit
+
+                  ---------------------------------------------
+                     1  Tag descriptor record
+                     2   "      "       "
+                     3   "      "       "
+                     4   "      "       "
+                     5   "      "       "
+                     '   '      '       '
+                     '   '      '       '
+                   497   "      "       "   
+                   498   "      "       "
+                   499   "      "       "
+                   500   Length of file=N, Free list=0
+		   501  Tag record
+		   502   "   "
+		   503   "   "
+		     '   '   '
+		     '   '   '
+		   N-2   "   "
+		   N-1   "   "
+		     N  Tag record
+                  ---------------------------------------------
+
+          The arrangement of the data in the file of relationships
+
+.end lit
+As each new tag is added to the database, a check is made in the
+file descriptor record at line IDBSIZ. If the list of reusable records is 0,
+the file is extended by one line. Otherwise the new tag is assigned to
+record at the head of the freelist.
+When tags are deleted, they are added to the free list in the file descriptor
+record.
+.para
+          5.  Structure of the comment file
+.para
+This file initially starts with 1 line, and is expanded as new annotation is
+created.
+Information about the length of the file, and which comment records are reusable
+is stored on the first line.
+.lit
+
+                  ---------------------------------------------
+                     1  Length of file=N, Free list=0
+                     2  Comment fragment
+                     3   "       "
+                     4   "       "
+                     '   '       '
+                     '   '       '
+		   N-2   "       "
+		   N-1   "       "
+		     N  Comment fragment
+                  ---------------------------------------------
+
+          The arrangement of the data in the file of relationships
+
+.end lit
+As each new comment is added to the database, a check is made in the file
+descriptor record at line 1. If the list of reusable records is 0,
+the file is extended to hold the new comment. Otherwise the new comments is
+assigned to records starting with the head of the freelist.
+When comments are deleted, the discarded records are added to the free list in
+the file descriptor record.
+.para
+  There  are  various  checks  within  the  programs  to
+          protect users from themselves:-
+.left margin2
+               1.  All user input is checked for errors - e.g.   reference  to
+               non-existent  gel 
+readings or  contigs,  incorrect  positions in the
+               contig or gel readings.
+.left margin2
+               2.  Before entering a gel reading the system checks to see if a
+               file of the same name has already been entered.
+.left margin2
+               3.  Join will not allow the circularising of a contig.
+.left margin2
+               4.        Both enter and join  functions  restrict  the  region
+               that  the  user  is  allowed to edit (using edit contig) to the
+               region of overlap.
+.left margin2
+5. Users may escape from any point in the program.
+.left margin2
+6. Help is available from all points in the program.
+.SK2
+.LEFT MARGIN2
+IT IS ESSENTIAL THAT USERS DO NOT KILL THE PROGRAM WHILE IT IS 
+DOING 
+ANYTHING THAT INVOLVES CHANGING THE CONTENTS OF THE 
+DATABASE. I.E DURING AUTO ASSEMBLE,
+COMPLETE ENTRY, COMPLETE JOIN, COMPLEMENT CONTIG, EDIT CONTIG, AND SCREEN 
+EDIT.
+This could 
+corrupt the database so badly that it is impossible to fix. The program 
+should always be left using the QUIT option.
+
+.left margin1
+@4. TX 3 @Edit contig
+.LEFT MARGIN2
+.PARA
+The Contig Editor is a mouse-driven editor that can insert,
+delete and change gel reading sequences.
+.para
+The Contig Editor allows scrolling from one end of a contig to the other
+using the scroll bar and scroll buttons. Action of mouse button presses
+when the mouse pointer is in the scroll bar:
+.sk1
+.lit
+    Middle Mouse Button      Set editor position
+    Left   Mouse Button      Scroll forward one screenful
+    Right  Mouse Button      Scroll backwards one screenful
+.end lit
+.sk1
+The four scroll buttons operate as follows:
+.sk1
+.lit
+    "<<"                     Scroll left half a screenful
+    "<"                      Scroll left one character
+    ">"                      Scroll right one character
+    ">>"                     Scroll right half a screenful
+.end lit
+.para
+The Editor cursor can be positioned anywhere in the edit window by
+moving the mouse pointer over the character of interest, then pressing the
+left mouse button. The Editor cursor can also be moved by using the
+direction arrow keys.
+.para
+The editor operates in two main edit modes - Replace and Insert. Replace allows
+a character to be replaced by another. Insert allows characters to be
+inserted into a gel reading sequence. Characters are entered by typing
+them from the keyboard. Only valid characters are permitted.
+Characters can be deleted by positioning the cursor one character to the right,
+then pressing the delete key.
+Normally Insert and Delete apply to the consensus line of the contig ONLY. 
+This restraint can be overridden by using the "Super Edit" mode of
+operation, THOUGH IT IS NOT RECOMMENDED.
+.para
+Edits can also be performed on the consensus, though they are
+restricted to insertion and deletion of padding characters ("*").
+These edits also have special meanings.
+A deletion will delete ALL characters at the position to the left
+of the cursor in the contig, and move the relative positions of all
+sequences starting to the right of the cursor position left one
+character.
+An insertion will insert the character typed ("*") into ALL gel
+reading sequences at the cursors position in the contig, and move the
+relative positions of all sequences starting to the right of the cursor
+position right one character.
+.para
+The effect of the last edit can be undone by pressing the "Undo" button
+at the top of the editor window.
+.para
+The cursor will automatically be positioned at the next problem when the
+"Find Next Problem" button is selected. The next problem is where the
+consensus shows either an ambiguity ("-") or a pad ("*") character.
+.para
+The edits to the contig can be saved by pressing the "Leave Editor"
+button and replying "Yes" to the prompt to "Save changes?". As no changes
+are made to the working copy of your database til this point it
+is possible to abort the editor if
+the edit session ends up in an unsatisfactory state (ie if you've
+stuffed it up!)
+.left margin1
+.sk3
+Displaying Traces
+.left margin2
+.para
+The original data from which the gel reading sequences where derived can
+be seen by double clicking (two quick clicks) with the middle mouse button
+on the area of interest. The trace will be displayed with the point
+clicked at the centre of the trace viewport.
+.para
+All traces that are displayed are maintained in one window, called the Trace
+Manager. The Trace Manager will only display four traces maximum. When four
+traces are already being managed and a new one is requested, the one at the top
+of the Trace Manager is removed and the new one is added to the bottom.
+Traces can be removed individually by using the "quit" button in the panel next
+to the trace.
+.left margin1
+.sk3
+Extending Reads Using Cutoff Information
+.left margin2
+.para
+Sequence data read in from Automated Fluorescent sequencing machines
+trace files processed through the program ted
+will have the discarded sequence (vector at start and poor read at
+end) available to the contig editor. To display the cutoff
+information, press the "Display Cutoff" button at the top of the
+editor window.
+The cutoff sequence appears in grey. This sequence can be incorporated
+into the editable sequence, by moving the cutoff position. This is
+done by positioning the cursor at the end of the gel sequence, and
+using Meta-Left-Arrow and Meta-Right-Arrow to adjust the point of cutoff.
+The Meta key is a diamond on the Sun keyboard.
+.left margin1
+.sk3
+Pop-up menu
+.left margin2
+.para
+A pop-up menu is revealed by depressing the "Control" key on the keyboard
+and at the same time pressing the left mouse button. The menu has the following
+functions:
+.lit
+
+    Search
+    Save Contig
+    Create Tag
+    Edit Tag
+    Delete Tag
+
+.end lit
+"Save Contig" is described above.
+Searching and operations on tags are described below.
+.left margin1
+.sk3
+Searching
+.left margin2
+.para
+Selecting "Search" brings up a
+window which can remain present during normal editor operation. The
+window allows the user to select the direction of search, the type of
+search and a value to search on.  The value is entered into the value
+text window. Then pressing the "search" button
+performs the search. If successful, the cursor is positioned and
+centred accordingly. An audible tone indicates failure.  Pressing the
+"ok" button removes the search window. The search window is
+automatically removed when the contig editor is exited.
+.sk1
+There are seven different search modes:
+.sk1
+1. Search by position
+.sk1
+This positions the cursor at the numeric position specified in the
+value text window. Eg a value of "1234" causes the cursor to be placed
+at base number 1234 in the contig. Positioning withing a gel reading is
+achieved by prefixing the number with the "@" character, eg "@123"
+positions the cursor at base 123 of the sequence in which the cursor
+lies. Relative positions can be specified by prefixing the number with
+a plus or minus character. Eg "+1234" will advance the cursor 1234
+bases. If possible, the cursor is positioned within the same sequence.
+The direction buttons have no effect on the operation of "search
+by position".
+.sk1
+2. Search by reading name
+.sk1
+This positions the cursor at the left end of the gel reading specified
+in the value text window. If the value is prefixed with a slash is is
+assumed to be a gel reading name. Otherwise it is assumed to be a gel
+reading number. Eg "123" positions the cursor at the left end of gel
+reading number 123. "/a16a12.s1" positions at the start of reading
+a16a12.s1. If the value was "/a16" the cursor is positioned at the
+first reading which starts with "a16".  The direction buttons have no
+effect on the operation of "search by position".
+.sk1
+3. Search by tag type.
+.sk1
+This positions the cursor at the start of the next tag which has the
+the same type as specified by the type value menu. To change the type,
+select off the menu that pops up when the mouse is clicked on the
+button labeled "Type:". The search can be performed either forwards
+or backwards of the current cursor position. To find all tags, use
+"search by annotation", with a null text value string.
+.sk1
+4. Search by annotation.
+.sk1
+This positions the cursor at the start of the next tag which has a
+comment containing the string specified in the value text window. The
+search performed is a regular expression search, and certain
+characters have special meaning. Be careful when your value string
+contains ".", "*", "[", "^" or "$". The search can be performed either
+forwards or backwards from the current cursor position.
+.sk1
+5. Search by sequence.
+.sk1
+This positions the cursor at the start of the next piece of sequence
+that matches the value specified in the text value window. The search
+is for an exact match, which means the case of value string is
+important. The search is performed on the gel readings themselves,
+rather than the consensus sequence. The search can be performed either
+forwards or backwards from the current cursor position.
+.sk1
+6. Search by problem.
+.sk1
+This positions the cursor at the next place in the consensus sequence
+which is not an "A", "C", "G" or "T". The search can be performed
+either forwards or backwards from the current cursor position.
+.sk1
+7. Search by quality
+.sk1
+This positions the cursor at the next place in the consensus sequence
+where the consensus calculation for each strand disagrees. When only
+sequences on one strand is present, the search will stop at every
+base. The search can be performed either forwards or backwards from the
+current cursor position.
+.left margin1
+.sk3
+Annotation
+.left margin2
+.para
+Parts of a sequence can be annotated, to record the positions of primers used
+for walking, or to mark sites, such as compressions that have caused problems
+during sequencing.
+The consensus sequence CANNOT be annotated.
+.para
+To annotate a piece of sequence first select the part of sequence
+using the mouse buttons. Use the left mouse button to position the start of the
+selection, and while this button is being held down, move the mouse to extend.
+The selection can be extended further using the right mouse button.
+.para
+To create annotation, invoke the pop-up menu, and select the "Create Tag"
+function. A small "tag editor" will appear which
+allows you to select the type of the
+annotation from a pull-down menu, and specify a comment if desired.
+To select a new type pull down the Type menu, and select the entry desired.
+To enter a comment, simply type into the text window in the tag editor.
+The annotation is created when the "Leave" button on the tag editor,
+and is displayed in the colour defined in the tag database file (TAGDB).
+.para
+To edit existing annotation,
+position the cursor with the left mouse button
+on the tag, and select the
+"Edit Tag"
+off the pop-up menu.
+This invokes the tag editor, and changes to the type and comment of the
+annotation can be made. The tag is updated when the "Leave" button is pressed.
+.para
+To delete an existing annotation,
+position the cursor with the left mouse button
+on the tag, and select the
+"Delete Tag"
+off the pop-up menu.
+.left margin1
+.sk3
+NOTE:
+.left margin2
+.para
+As the Contig Editor is a very powerful tool, it is possible that the alignment
+of the gel reading sequences has unexpectedly been disrupted.
+This can easily happen to parts of the contig that lie to the right
+of the screen if excessive use has been made of the "Super Edit" facility.
+Until familiar with "Super Edit" it would benefit the sequencer to quickly
+scan through the contig after editing to check that bad alignments have not
+been created.
+.left margin1
+@9. T 3 @Screen edit
+.LEFT MARGIN2
+.para
+THIS OPTION IS NO LONGER AVAILABLE IN XDAP. USE EDIT CONTIG
+.para
+Gives access to the system editor on the machine (for example EDT on a VAX) 
+and allows users to edit contigs. The contigs are presented as for
+"display contig" and the program will 
+reconstitute the contig's sequences and relationships  when the editor is 
+exited.
+.para
+To screen edit a contig set the line length to 50 characters,
+select the contig to edit, and supply the name of a temporary file in which 
+the editing will be performed.
+After a short pause the system 
+editor will present the first page of the file. Edit the file obeying the 
+rules given below. Exit from the editor and affirm the intention of 
+returning the contig to the database. The program will put the contig 
+back into the database.
+.para
+Rules for screen editing
+.para
+There are some limitations on the changes that can be made to the contigs 
+when using the screen editor. Users are unlikely to want to break the 
+rules 
+in order  to achieve changes to contigs, but nevertheless the 
+constraints need to be defined and they are given below.
+.para
+Alignments must be maintained during editing.
+Whole lines of sequence should not be deleted or added unless the 
+order 
+of the gel readings in the contig is preserved.
+Each line in the 
+contig display consists of gel reading numbers, their names and 50 
+character sections of sequence. Insertions are limited in the following 
+way.
+No line of sequence can be extended rightwards more than 10 characters
+beyond the end of a full length line (a full length line is 50 characters 
+long). Only one character can be added to the left end of full length 
+lines, but sections of sequence beginning further into a line
+ can be extended leftwards up to an equivalent position. Do not delete any 
+non-sequence lines in the file.
+.para
+Before returning the contig to the database the program checks that the 
+rules have been obeyed. If an error is found the number of the erroneous 
+line in the 
+file is displayed and the contig will not be changed.
+.left margin1
+@5. TX 1 @Display a contig
+.LEFT MARGIN2
+.para
+Used to show the aligned gel readings for any part of a contig. The 
+
+number, name and strandedness of each gel reading is shown and the 
+
+consensus is written below.
+.para
+If required identify the contig,  and then the start and end points of the
+ 
+region to display.
+.para
+The display can be directed to a disk file using "direct output to disk".
+ 
+These files are required by options: "screen edit" and "highlight 
+
+disagreements", and printed copies of them
+are very useful for marking corrections prior to 
+
+using the editors.
+.para
+                Below is an example showing the left  end  of  a  contig  from
+          position  1 to 200.  Overlapping this region are gels 6,3,5,17and 12;
+6, 3 and 5 
+are in reverse orientation to their archives (denoted by a minus  sign)
+          There are a few uncertainty codes and a few padding
+          characters in the working versions, but the consensus  (shown  
+below
+          each page width) has a definite assignment for almost every 
+position.
+.lit
+
+                           10        20        30        40        50
+   -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+       CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                          110       120       130       140       150
+   -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+   -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+   -5  HINW.009                                        GGCAGCGACCGTCG
+   17  HINW.999                                           AGCGACCGTCG
+       CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                          160       170       180       190       200
+   -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+   -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+   12  HINW.017                                              GTAGAGC*
+       CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+.END LIT
+.left margin1
+@6. TX 1 @List a text file
+.LEFT MARGIN2
+.PARA
+This option allows users to list text files on the screen. It can be used 
+to read a file containing notes, for checking files written to disk etc. The 
+user is asked to type the name of the file to list.
+.left margin1
+@8. TX 1 @Calculate a consensus
+.LEFT MARGIN2
+.para
+          Calculates a consensus sequence  either for  the whole database or 
+
+for selected contigs. The consensus is written to a file named by the
+ user.
+.left margin2
+Supply a file name, choose between  whole database or selected contigs.
+.para
+          Symbols for uncertainty in gel readings
+.para
+In  order  to  record uncertainties when reading gels the codes shown
+ 
+below can  be  used. Use  of these codes permits us to extract the
+
+maximum amount of data from each gel and yet record any doubts  by
+ 
+choice  of  code.   The program can deal with all of these codes and any 
+ 
+other  characters  in  a  sequence  are  treated  as  dash  (-) characters.
+
+
+.lit
+
+       SYMBOL                  MEANING
+
+         1             PROBABLY        C
+         2                "            T
+         3                "            A
+         4                "            G
+         D                "            C       POSSIBLY        CC
+         V                "            T          "            TT
+         B                "            A          "            AA
+         H                "            G          "            GG
+         K                "            C          "            C-
+         L                "            T          "            T-
+         M                "            A          "            A-
+         N                "            G          "            G-
+         R             A OR G
+         Y             C OR T
+         5             A OR C
+         6             G OR T
+         7             A OR T
+         8             G OR C
+         -             A OR G OR C OR T
+         a             A set by auto edit
+         c             C set by auto edit
+         g             G set by auto edit
+         t             T set by auto edit
+         *             padding character placed by auto assembler 
+          else = -
+
+.end lit
+
+.LEFT MARGIN2
+                           The DNA consensus algorithm
+.para
+The "calculate consensus" function, the "display  contig" routine and the
+ 
+"show quality" option use  the rules  outlined  here  to  calculate  a 
+ 
+consensus  from aligned gel readings.  Note that "display contig" 
+calculates 
+a consensus for  each  page  width  it displays  (it  does  not use the
+
+consensus sequence file calculated by the consensus function). 
+
+.LEFT MARGIN2
+.para
+We have 6 possible symbols in the consensus sequence: A,C,G,T,* and -. The 
+last symbols is assigned if none of the others makes up a sufficient 
+proportion of the aligned characters at any position in the contig. The 
+following calculation is used to decide which symbol to place in the 
+consensus at each position.
+.para
+Each uncertainty code contributes a score
+to one of A,C,G,T,*  and also to the total at each point. Symbols like R
+and Y which don't correspond to a single base type contribute only to the
+total at each point. The scores are shown below.
+.lit
+              definite assignments ie A,C,G,T,B,D,H,V,K,L,M,N,a,c,g,t,* =1
+
+              probable assignments ie 1,2,3,4 = 0.75
+
+              other uncertainty codes including R,Y,5,6,7,8,- = 0.1
+.end lit
+.para
+A cutoff score of 51% to 100% is supplied by the user. (When the program 
+starts this is set to 75%. See "set display parameters").
+At each position in the contig we calculate the total score for each of 
+the 5 symbols 
+A,C,G,T and * (denote these by Xi, where i=A,C,G,T or *), 
+and also the sum of these totals 
+(denote this by S). Then if 100 Xi / S > the cutoff for any i, symbol i is 
+placed in the consensus; otherwise - is assigned.
+.para
+Notice that S does not equal the number of times the sequence has been 
+determined, but is the score total, and hence we are less likely to put a - 
+in the consensus. For the "examine quality" algorithm each strand is
+treated separately but the calculation is the same. (It was originally
+different).
+.para
+Format of the consensus sequence ( and vector sequences).
+.para
+A consensus sequence file may contain the consensus for several contigs
+ 
+and so we identify each of them by preceding them by a 20 character
+ 
+title. The title is of the form <---LAMBDA.076-----> ( where LAMBDA is 
+
+the project name and gel reading number 
+
+
+          76 is the leftmost gel 
+reading to contribute to  this  consensus  sequence).
+
+
+          The  angle  brackets  <>  and the three digit number precede by a . 
+
+are important to some processing programs.
+.left margin1
+@25. TX 1 @Show relationships 
+.LEFT MARGIN2
+.para
+   Used to show the relationships of the gel readings in the database in 
+
+three ways -
+.LEFT MARGIN2
+               (a) All contig descriptor lines followed by all gel  descriptor
+               lines.
+.LEFT MARGIN2
+               (b) All contigs one after the  other  sorted,  i.e.   for  each
+               contig  show its  contig descriptor line followed by all its
+               gel descriptor lines sorted on position from left to right
+.LEFT MARGIN2
+               (c) Selected contigs:  show the contig  line  and,  in  order,
+               those gel readings that cover a user-defined region.
+               Note that this output can be directed to a disk file by 
+               prior selection of "disk output".
+.LEFT MARGIN2
+.para
+                Below is an example showing a contig from position
+          1 to 689.  The left gel reading  is number 6 and has archive 
+name HINW.010, the 
+rightmost gel  reading is number 2 and is has archive name HINW.004.
+On each gel  descriptor  line  is  shown:
+          the name of the archive version, the gel number, the position of the
+          left end of the gel reading  relative to the left  end  of  the  contig,  the
+          length  of  the gel 
+reading  (if this is negative it means that the gel reading  is in
+          the opposite orientation to its archive), the number of the  gel
+reading   to
+          the left and the number of the gel reading  to the right.
+.lit
+
+
+ CONTIG LINES
+ CONTIG      LINE  LENGTH               ENDS
+                                     LEFT   RIGHT
+               48     689               6       2
+ GEL LINES
+ NAME      NUMBER POSITION LENGTH     NEIGHBOURS
+                                     LEFT   RIGHT
+ HINW.010       6        1   -279       0       3
+ HINW.007       3       91   -265       6       5
+ HINW.009       5      137   -299       3      17
+ HINW.999      17      140    273       5      12
+ HINW.017      12      193    265      17      18
+ HINW.031      18      385   -245      12       2
+ HINW.004       2      401   -289      18       0
+
+.end lit
+.left margin1
+@21.  TX 3 @Enter new gel reading 
+.LEFT MARGIN2
+.para
+THIS OPTION IS NO LONGER AVAILABLE IN XDAP. USE AUTO ASSEMBLE
+.para
+Used to enter new gel readings into the  
+database. The new gel reading must have previously been compared with 
+the 
+contents of the database by use of " auto assemble"  in order to ascertain 
+if it overlaps any previously entered data. 
+.para
+The user is expected to know: if 
+the gel reading overlaps; if so which contig it overlaps; if so where it 
+overlaps. The program takes the user through a series of question to 
+establish the nature of the overlap and then displays the overlap. The 
+user 
+is then offered a number of options, including editors for the new gel 
+reading and the contig, to enable the correct alignment of the gel reading 
+throughout its whole length.
+.left margin2
+
+Supply the name of the gel reading file.
+If the gel 
+reading  has  been entered before the program will  not permit 
+
+entry. 
+The program gives the gel reading a unique number and asks if the 
+
+sequence overlaps any data already in the database (reported by "auto 
+
+assemble").
+
+If it does not, entry is complete.
+If it does overlap the
+ 
+dialogue 
+continues with the program asking if the gel readings overlaps "in the
+ 
+normal sense", if not it will automatically complement the sequence.
+
+Then supply the number of the contig the gel reading overlaps (as
+ 
+reported by "auto assemble").
+.para
+Overlaps are divided into two types: those for which the new gel reading
+ 
+protrudes from the left end of the contig it overlaps, and those for which
+ 
+it does not. The program asks about this with the question "Left end of
+gel 
+reading is inside contig". If this is true the program will go on to ask for
+ 
+the position in the contig of the left end of the new gel reading. If it is
+ not 
+true the program will ask for the position in the new gel reading of the
+
+left end of the contig. 
+.para
+Once this is completed the program will display the first 50 bases of
+ 
+the overlap. 
+The gel readings in the contig and their consensus are displayed with the 
+
+new gel reading underneath. The mismatches are shown by *'s on the 
+next 
+line down. 
+For example:
+.lit
+
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACACGTC
+       NEWGEL      CACAAGCGAGCGAGAGGGGCACCGTGACGTGGTCACGCCGGGGACACGTC
+       MISMATCH                  *                         * *       
+                           10        20        30        40        50
+
+.end lit
+.para
+The program then needs to know if the position of the  left end of the 
+overlap is correct.
+
+If it is the user should type return, if not, 1 and the program will ask for 
+the 
+new position and display it.
+
+.LEFT MARGIN2
+The program now offers a number of  options  to  allow  the
+               user  to align the new gel reading 
+correctly over its whole length with
+               the  data  already  in  the  contig.   It  is  important   that
+               sufficient  edits  are  made  to the new gel reading 
+or the sequences in the
+               contig at this stage to get the alignment correct, because  once
+               entry  is completed, the alignment is fixed and cannot easily be
+               changed (see "alter relationships"). 
+  Alignment  can  be  achieved  
+by  making
+               insertions  or  deletions  but  deletion  of  data requires the
+               original gels to be checked.   For  this  reason  at  entry  we
+               usually make only insertions to achieve alignment.  We use X or
+               asterisks (*) as padding characters to achieve alignment  and 
+ so  can, if required,
+               distinguish  padding  characters  from characters assigned from
+               reading gels.  
+.LEFT MARGIN2
+.para
+The options available are:
+.lit
+   ? = HELP
+   ! = Give up
+   3 = Complete entry
+   4 = Edit contig
+   5 = Display overlap
+   6 = Edit new gel reading
+
+.end lit
+
+.sk1
+.para
+1. HELP gives this information.
+.para
+2. Give up allows users to change their minds about entering the new gel 
+reading. The program will ask the user to 
+confirm this choice.
+.para
+3. Complete entry is the command to add the new gel reading to the 
+contig. The 
+program updates the relationships accordingly. The user is asked to 
+confirm 
+this command.
+.para
+4. Edit contig gives the user access to a simple editor that allows 
+insertions, deletions and changes to be made to the contig. The editor 
+maintains alignments by making the same number of insertions or 
+deletions 
+in all sequences covering the edit position.
+The  program
+               protects the user          by  allowing edits only within
+               the region of overlap.
+.para 
+5. Display allows display of the region of overlap only.  This
+               is defined by the relative positions in the contig. The 
+               default is the whole of the region of overlap.
+.para
+6. Edit new gel reading allows the new gel reading to be edited using a 
+simple editor. 
+.left margin1
+@23. TX 3 @         Complement a contig
+.LEFT MARGIN2
+.PARA
+               This function will complement and reverse all of the gel 
+readings in  a
+          contig.    It   automatically  reverses  and  complements  each  gel
+          reading sequence, reorders left and right neighbours, recalculates  
+relative
+          positions and changes each strandedness.
+.PARA
+               The only user  input  required  is  to  identify the  contig  to
+          complement  by  the  number or name of a gel reading it contains.
+DO NOT KILL THE 
+PROGRAM DURING THIS STEP!
+.left margin1
+@22. TX 3 @          Join contigs
+.LEFT MARGIN2
+.PARA
+This function joins contigs interactively using a mouse driven editor.
+The operation of this editor is very similar to the Contig Editor
+described in "@4 Edit".
+
+.para
+It allows the
+user  to align the ends of the two contigs by editing each
+contig separately.  It is important that the alignment  achieved  is
+correct because once the join is completed the alignment is fixed.
+The program needs to know which two contigs to join.
+.para
+First specify which two contigs are to be joined.
+The user should identify the two 
+contigs. First the left contig and then the right.
+The program checks that the two contig numbers are different (it will not 
+allow circles to be formed!)
+.para
+The Join Editor consists of two Contig Editors in between which is sandwiched
+a disagreement box. This disagreement box shows exclamation marks to
+denote mismatches between the two consensuses.
+.para
+For example, the display will look something like this:
+.lit
+
+                         1460      1470      1480      1490      1500
+   56  HINW.100    TCT*GAGCAGTGTGGGCGCTG*CCGG
+   33  HINW.300    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGG
+  -25  HINW.090    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGG
+   19  HINW.123    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+       CONSENSUS   TCTCGAGCAGTGTGGGCGCTG-CCGGGCTCGGAGGGCATGAAGTAGAGCG
+       MISMATCH                         !                      !!!!!! 
+                           10        20        30        40        50
+   -6  HINW.010    TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+   -3  HINW.007                TGGGCGCTGCCCGGGCTCGGAGGGCATGAAGT*AGAGC
+   -5  HINW.009                              GCTCGGAGGGCATGAAGT*AGAGC
+       CONSENSUS   TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+
+.END LIT
+.para
+.para
+The best strategy for joining is to
+identify the exact position of overlap. This is defined as
+the position in the left contig that the leftmost character of the right
+contig overlaps.
+The overlap must be of at least one character.
+Use the scroll bar and the scroll buttons (`<<',`<',`>',and`>>')
+for positioning the relative positions of the two contigs.
+.para
+The join position can be fixed in position
+by pressing the `lock' button at the top of the Join Editor.
+Locking allows the two contigs to be scrolled as one when using the scroll bar
+and buttons, the left ends always in the same position relative to each
+other.
+.para
+Once locked, it is best to proceed to the right along the contigs, inserting
+padding characters (`*') into the consensuses to minimise the
+disagreements.
+.para
+It is essential that the user aligns the two contigs throughout the whole 
+region of overlap before completing the join because it is only at this 
+stage that the two contigs can be edited independently. Once the join is 
+completed the alignment can only be altered using the routines supplied 
+by "alter relationships".
+.para
+The join can be completed by pressing the `Leave Editor' button. The
+percentage mismatch is displayed, and the user is required to confirm that
+they want to perform the join.
+.left margin1
+@24. TX 1 @               Copy the database
+.LEFT MARGIN2
+.PARA
+Used to make a copy of the database. If required the database size can be 
+
+altered using this option. The "version" of a database is  encoded as the
+ 
+last letter in the names of the five files that contain the database.
+
+.para
+Supply a "version" number (the default is version 1), and if required
+
+select a new size for the database. The size of a database is the number
+ of 
+lines of information it can hold. It needs a line for each gel reading and
+ 
+another for each contig.
+.left margin1
+@19. TX 1 @               Check database
+.LEFT MARGIN2
+.para
+Used to perform a check on  the  logical  consistency  of  the
+          database. No user intervention is required.
+.para
+  The following relationships are checked:
+.LEFT MARGIN2
+               1.       If gel reading A thinks gel reading B is its left
+ neighbour
+ 
+does B think A  is
+               its right neighbour?
+                The error message is
+.left margin2
+"Hand holding problem for gel reading A"
+.left margin2
+followed by  the
+               gel descriptor lines for gel readings A and B.
+.LEFT MARGIN2
+               2.       Are there any contig lines with no left or right
+end gel readings?
+                The error message is
+.left margin2
+"Bad contig line number A"
+.LEFT MARGIN2
+               3.       Do the gel readings that are described as left ends on  
+contig
+               lines agree that they are left ends?
+                The error message is
+.left margin2
+"The end gel readings of contig A have outward neighbours"
+.LEFT MARGIN2
+               4.       Are there gel readings that are in more than one contig?
+                The error message is
+.left margin2
+" Gel number A is used N times"
+.LEFT MARGIN2
+               5.       Are there gel readings that are not in any contig?
+                The error message is
+.left margin2
+" Gel number A is not used"
+.LEFT MARGIN2
+               6.       Do the relative positions of  gel readings  agree  with  
+their
+               position as defined by left and right neighbourliness?
+                The error message is
+.left margin2
+" Gel number A with position X is left neighbour of  gel  number  B  with 
+position Y"
+.LEFT MARGIN2
+               7.       Are there any loops in  contigs?   If  so  no  further
+               checking is done.
+                The error message is
+.left margin2
+" Loop in contig n no further checking done, but gel reading numbers follow"
+.left margin2
+   The
+               program  then  prints the gel reading numbers in the looped 
+contig up 
+to
+               the start of the loop.
+.LEFT MARGIN2
+8. Are there any contigs of length <1? The error message is
+.left margin2
+" The contig on line 
+number x has zero length"
+.LEFT MARGIN2
+9. Are there any gel readings (used in only one contig) that have zero 
+
+length? The error 
+message is
+.left margin2
+" Gel number N has zero length"
+.left margin2
+Note that "auto assemble"  also uses this logical consistency check and
+ will 
+only tolerate a "Gel number N
+ is not used" error. Any other error will cause it to 
+
+give up.
+
+.left margin1
+@29. TX 1 @               Examine quality 
+.LEFT MARGIN2
+.para
+Analyses the quality of the data in a contig. It reports on the proportion
+ 
+of the consensus that is "well determined" and will display a sequence of
+ 
+symbols that indicate the quality of the consensus at each position.
+
+.para
+Identify the contig to analyse, and the section of interest. The current
+ 
+consensus calculation cutoff score will be used to decide if each position 
+is 
+"well determined". In general the quality of a reading deteriorates along
+the length of the gel and so it is also possible to use a length cutoff for
+the quality calculation. Only the data from the first section of each reading
+will be included in the quality calcualtion. The length is altered under
+"set parameters" and is initially set to the maximum reading length.
+A summary showing the percentage of the consensus
+that falls into each category of quality is shown. Choose whether or not to 
+have the quality codes for each position of the consensus displayed. 
+They can be displayed as either graphics or text.
+.para
+The quality of the data depends on the number of times it has been
+
+sequenced and the particular uncertainty codes  used  in each  gel
+
+reading.  This function divides the data into five categories, assigning
+
+each 
+a symbol or code: 
+.LEFT MARGIN2
+                1.  Well determined on both strands and they agree.  code=0
+.LEFT MARGIN2
+                2.  Well determined on the plus strand only.  code=1
+.LEFT MARGIN2
+                3.  Well determined on the minus strand only.  code=2
+.LEFT MARGIN2
+                4.  Not well determined on either strand.  code=3
+.LEFT MARGIN2
+                5.  Well determined on both strands but they disagree.  code=4
+.LEFT MARGIN2
+ A position is "well determined" if it is assigned one of the symbols 
+A,C,G,T when the algorithm described in the section "calculate a 
+consensus".
+The calculation is performed
+separately for each strand.
+.para
+If the user chooses to have the data displayed graphically the following 
+scheme is used. A rectangular box is drawn so that the x coordinate
+represents the length of the contig. The box is notionally
+divided vertically into 
+5 possible levels which are given the y values: -2,-1,0,1,2.
+The quality codes attributed to each base position are plotted as 
+rectangles.
+Each rectangle represents a region in 
+which the quality codes are identical, so a single base having a different 
+code from its immediate neighbours will appear as a very narrow rectangle.
+.lit
+  
+  Rectangle bottom and top y values
+
+     Quality 0 rectangle from 0 to 0
+     Quality 1 rectangle from 0 to 1
+     Quality 2 rectangle from 0 to -1
+     Quality 3 rectangle from -1 to 1
+     Quality 4 rectangle from -2 to 2
+.end lit
+.para
+Obviously a single line at the midheight shows a perfect sequence.
+.para
+Typical dialogue is shown below.
+.lit
+
+   41.47% OK on both strands and they agree(0)
+   55.48% OK on plus strand only(1)
+    2.08% OK on minus strand only(2)
+    0.97% Bad on both strands(3)
+    0.00% OK on both strands but they disagree(4)
+  ? (y/n) (y) Show sequence of codes 
+
+           10         20         30         40         50
+   1111111111 1111111111 1111111111 1111111111 1111111111
+
+           60         70         80         90        100
+   1111111111 1111111111 1111111111 3111111111 1111111111
+
+          110        120        130        140        150
+   1111111111 1111131111 1111111111 1111111111 1111111111
+
+          160        170        180        190        200
+   1111111111 1111111111 1111111111 1111111111 1111111133
+
+          210        220        230        240        250
+   1311111111 1111111111 1111111110 0000000000 0000220000
+
+          260        270        280        290        300
+   0000000000 0020000000 2200000202 0002000000 0000222200
+
+.end lit
+.left margin1
+@26. TX 3 @              Alter relationships
+.LEFT MARGIN2
+.para
+Used to make what are normally illegal changes to the database. That is
+
+the normal checks are not done and any item in the database can be 
+changed independently of all others. Users need to know what they are
+ 
+doing because it is very easy to make a horrible mess. Always start by
+ 
+making a copy!
+.para
+By using the options here users can edit individual gel readings in contigs, 
+move one section of a contig relative to another, break contigs, remove 
+contigs, remove gel readings, etc. To give flexibility most
+ of the commands do only one thing. This means that several commands 
+may 
+have to be executed to complete any change. At the end of this help 
+section 
+there are notes on removing gel readings from the database.
+.para
+The following options are offered:
+.lit
+
+   Cancel
+   Line change
+   Edit single gel reading
+   Delete contig
+   Shift
+   Move gel reading
+   Rename gel reading
+   Break a contig
+   Alter raw data parameters
+
+.end lit
+.left margin2
+1. QUIT returns to the main options of SAP.
+.left margin2
+
+2. Line change 
+.left margin2
+            allows the user to change the contents  of  any line in the
+                
+file of relationships.  The line is selected by number, the
+ 
+               program prints the current line and prompts for the new  line.
+
+.left margin2
+3.   Edit      
+.left margin2
+allows  the  user  to   edit   an   individual   gel reading
+               independently of any others it may be related to. The edit 
+positions are relative to 
+               the contig. The effect of this editing on the length of the
+               gel reading is taken care of but, if it changes the length of
+ a contig,
+               or its relationship to others, this must be accounted for (if 
+               necessary) by use of the "line change" function.
+
+.left margin2
+4.  Delete  contig 
+.left margin2
+is a function that deletes a contig line  by moving down  
+               all  the  contig lines above by one position.  It prompts only 
+               for the line to delete.  It does not  delete  any  of  the gel 
+readings
+               or gel reading 
+lines for the deleted contig but it does reduce the 
+               number of contigs on line IDBSIZ by 1.
+
+.left margin2
+5.  Shift   
+.left margin2
+   allows the user to change all the relative  positions of  a  
+               set  of  neighbouring gel 
+readings by some fixed value, i.e.  it will
+               shift related gel readings
+ either left or right.  It can therefore
+               be  used  to  change the alignment of the gel 
+readings in a contig 
+               or as part of the process of breaking a contig into two parts 
+               (see below).  It prompts for the number of the first gel 
+reading to 
+               shift and  then  for the  distance  to  move  them (Note a 
+               negative value will move the gel readings
+ left and a positive value 
+               right).   It  then chains rightwards (ie follows right 
+               neighbours) and shifts each gel 
+reading,  in  turn,  up to the  end  
+               of the contig.  (This means that only those gel readings
+ from the first
+               to shift to the rightmost are moved). It updates the length of 
+               the contig accordingly.
+
+.left margin2
+6. Move gel reading
+.left margin2
+    is a function to renumber a gel reading. It moves all the information 
+               about a gel 
+reading on to another line. The user must specify the 
+number
+               of the gel  reading 
+to move and the number of the line to place it. It 
+               takes care of all the relationships. Of course gel 
+readings must not be
+               moved to lines occupied by other gel 
+readings! It can be used as part 
+               of the process of removing a gel 
+reading from the database (see below).
+
+.left margin2
+7.  Rename gel reading
+.left margin2
+ is a function that is used to  rename  the archive  names  of  
+               gel 
+readings  in the database;  it only changes the name in the .ARN 
+               file of the  database.
+
+.sk1
+.LEFT MARGIN2
+8. Break contig
+.LEFT MARGIN2
+.PARA
+Occasionally it is necessary to break a contig into two parts and this can be 
+achieved using this option. The program needs only the number of a gel 
+reading. This is the gel  reading that will become a left end after the 
+break. That 
+is, the break is made between this gel 
+reading and its left neighbour. A new contig 
+line is created so ensure that there is sufficient space in the database.
+.left margin2
+Removing gel readings from contigs
+.left margin2
+.PARA
+Gel 
+readings can be removed from contigs if they are not essential for holding the 
+contig together (ie are not the only gel reading covering a particular region). 
+Suppose the gel reading to remove is gel number 
+b with left neighbour a and right 
+neighbour c.
+Using "line change" change the right neighbour of a to c, and the left 
+neighbour of c to a. To tidy things up: suppose there are x gel 
+readings in the 
+database; then, using "move gel reading" move gel x to line b; then, using 
+"line change" 
+decrease the number of gel 
+readings in the database (stored in the last line) by 1.
+.sk1
+.LEFT MARGIN2
+8. Alter raw data parameters
+.LEFT MARGIN2
+.PARA
+Allows the user to edit the individual raw data parameters, such as
+the left and right cutoff lengths and the name of the machine readable trace
+file.
+The user must specify the gel line to modify, and provide new values for
+the length of the raw sequence including cutoff lengths, the left cutoff position, the length of the original working sequence, the machine type, and the name
+of the raw data file, where these values change.
+.left margin1
+@27. TX 1 @  Set display parameters
+.LEFT MARGIN2
+.para
+Used to redefine the parameters that control the cutoff employed by the
+
+consensus calculation and quality examiner, the maximum length of each
+reading to include in the quality calculation, the line length used by
+
+the display function, the text window length used by the graphics 
+options, and the graphics window length used by the graphics options.
+.para
+The default cutoff score is 75%. The default line length is 50 characters. 
+For protein sequences the cutoff is always 100%.
+.para
+The text window used by the graphics options controls the amount of 
+sequence listed at the crosshair position. The graphics window controls the 
+"zoom" function. Both these windows are defined as the number of bases that 
+should be shown, to both left and right of the crosshair.
+.left margin1
+@30. TX 3 @  Auto edit a contig
+.left margin2
+.para
+This function automatically changes characters in gel readings to make 
+
+them agree with the consensus sequence. If employed as is intended, use 
+
+of this function is not a criminal activity but a method that saves a large 
+
+amount of work. All characters changed by the auto editor will appear in 
+
+the gel readings as lowercase letters. The current consensus calculation 
+cutoff score is used.
+.para
+Identify the contig and the section to edit. The program will display a 
+
+summary of changes made. Note that it is important to understand both 
+
+what the auto editor does and the order in which it does it. Before 
+
+employing the auto editor users should note all the corrections that they 
+require, so that  after it has been used the corrections can be checked. 
+
+.para
+ The 
+general strategy employed when collecting shotgun sequence data is to let 
+the contigs get fairly deep, to get a printout of a contig, 
+check problems against the 
+films, note corrections on the printout, and
+make the changes using an interactive editor.  
+In general the consensus is correct except for places where padding 
+characters have been used to accommodate a single gel with an extra 
+character, or where the consensus is dash. The important point for the 
+auto  
+editor is that
+most edits simply make the 
+gel readings conform to the consensus, or remove columns of pads.
+.para
+The new editor does the following.
+.para
+1) calculates a consensus for the contig (or part of a contig) to be 
+edited, and then uses this consensus to direct the editing of the contig
+in 3 stages
+.para
+2) stage 1: find and correct all places where, if the order of two adjacent 
+characters is swapped, they will both agree with the consensus (given 
+that 
+they did not match the consensus before). These corrections are termed 
+"transpositions"
+.para
+3)  stage 2: find and correct all places where there is a definite consensus 
+but the gel reading has a different character. These corrections are 
+termed 
+"changes".
+.para
+4) stage 3: delete all positions in which padding is the consensus. These 
+corrections are termed "deletions".
+.para
+All changed characters are shown in lowercase letters so it will be 
+obvious which 
+characters have been assigned by the program (except for deletions). The 
+number of each type of correction will be displayed.
+
+.LEFT MARGIN1
+@10. TX 2 @Clear graphics
+.LEFT MARGIN2
+.para
+ Clears graphics from the screen.
+.left margin1
+@11. TX 2 @Clear text
+.LEFT MARGIN1
+.para
+ Clears  text from the screen.
+.left margin1
+@12. TX 2 @Draw a ruler.
+.LEFT MARGIN2
+.para
+This option
+allows the user to draw a ruler or scale along the x axis of the screen to 
+help identify the coordinates of points of interest. The user can define 
+the position of the first base to be marked (for example if the active 
+region is 1501 to 8000, the user might wish to mark every 1000th base 
+starting at either 1501 or 2000 - it depends if the user wishes to treat 
+the active region as an independent unit with its own numbering starting 
+at 
+its left edge, or as part of the whole sequence). The user can also define 
+the separation of the ticks on the scale and their height. If required the 
+labelling routine can be used to add numbers to the ticks.
+.left margin1
+@14. TX 2 @Reposition plots
+.LEFT MARGIN2
+.para
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "ANALMARG" when the 
+program is started. Users can have their own file if required.
+As all the plots start 
+at the same position in x and have the same width, x0 and xlength are the 
+same for all options. Generally users will only want to change the start 
+level of the window y0 and its height ylength. 
+ This option 
+allows users to change window positions whilst running the program.
+The routine prompts first for the number of the option that the users 
+wishes 
+to reposition; then for the y start and height; then for the x start and 
+length. Note that changes to the x values affect all options. If the user 
+types only carriage return for any value it will remain unchanged. 
+Note that, unlike all the other programs, the boxes used to contain 
+analytical results (eg plot quality) should not be made to overlap one 
+another, as the function of the crosshair routine depends on which box the 
+crosshair is in!
+.LEFT MARGIN1
+@15. TX 2 @Label a diagram
+.LEFT MARGIN2
+.para
+This routine allows users to label any diagrams they have produced. They 
+are asked to type in a label. When the user types carriage return to finish 
+typing the label the cross-hair appears on the screen. The user can 
+position it anywhere on the screen. If the user types R (for right justify)
+the label will be 
+written on the diagram with its right end at the cross-hair position. 
+If the user types L (for left justify) the label will be written on the 
+diagram with its left end at the cross hair position.
+The 
+cross-hair will then immediately reappear. The user may put the same 
+label 
+on another part of the diagram as before or if he hits the space bar he 
+will be asked if he wishes to type in another label.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=15
+Type label then drive cross hair to left or right end
+of label position then hit  "L"  to  write label left
+justified or  "R"  to  write label right justified or
+the space bar to quit
+ 
+ 
+? Label=delta gene
+
+ missing graphics 
+
+? Label=
+ 
+.end lit
+.left margin1
+@16. TX 2 @Display a map
+.LEFT MARGIN2
+.para
+This draws a map 
+of any sequence features selected by the user.
+These features may be protein coding regions (CDS), tRNA genes (TRNA), 
+promoter positions (PRM), etc. Users may define their own feature table 
+key 
+names. For example I find it convenient to split CDS lines into CDS1, 
+CDS2 
+and CDS3 each of which contains only those sequences that code in the 
+reading frames 1, 2 or 3. Then I can plot them at different heights on 
+the screen ( suitable heights can be determined by using the cross-hair).
+The coordinates must be stored in a file in the format of an EMBL feature 
+table. 
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=16
+ Display a map using an EMBL feature table file
+? map file name=hsegl1.ft
+? feature code(e.g. CDS) =CDS
+X 1 + strand
+  2 - strand
+  3 both strands
+? 0,1,2,3 =
+? level (0-9480) (256) =4000
+
+ missing graphics 
+ 
+? feature code(e.g. CDS) =
+
+.end lit
+.left margin1
+@7. TX 1 @Redirect output
+.LEFT MARGIN2
+.para
+Used to direct output that would normally appear on the screen to a file. 
+.para
+Select redirection of either text or graphics, and 
+supply the name of the file that the output should be written to.
+.para
+ The results from the next options selected will not appear on the screen 
+but will be written to the file. When option 7 is selected again
+the file will be 
+closed and output will again appear on the screen.
+.left margin1
+@13. TX 2 @Use crosshair
+.left margin2
+.para
+This option puts a steerable cross on the screen which the user 
+drives around 
+by using the arrow keys (or mouse). When the crosshair is 
+visible a number of options are available if the user types one of a 
+set of special keyboard characters. Any other characters will cause 
+an exit from the crosshair option. The special keys are:
+.lit
+
+    I = Identify the nearest gel reading
+    Z = Zoom in
+    Q = plot Quality
+    S = display the aligned Sequences at the crosshair position
+    N = list the Names and Numbers of the sequences at the crosshair
+.end lit
+.para
+In order for any of these special keys to operate, the crosshair 
+must be in an appropriate display box, and the precise function of 
+the keys will also depend on which box the crosshair is in.
+.para
+ If the 
+crosshair is in the "plot all contigs" box, Z will cause a new box to 
+appear showing all the readings for the nearest contig; Q will give 
+the same as Z but will also produce an extra box showing the 
+"quality" plot.
+.para
+ If Z is hit in the "plot single contig" box, the contig will be zoomed 
+to the current graphics window size. The zoom will be roughly 
+centred on the crosshair position. Because of this it is possible to 
+step along a contig by repeatedly zooming with the crosshair near 
+to one end of the single contig display box. If I is hit the crosshair 
+must be close to a gel reading line. If Q is hit, the quality plot will 
+be produced for the region shown in the plot single contig box. In 
+all cases when the "plot all contigs" box is shown, a vertical line will 
+bisect the line the represents the relevant contig, at the current 
+position.
+.para
+If the crosshair is in the plot quality box only the character "s" will operate 
+as a special symbol.
+.para
+The number of bases shown in the N and S options is controlled by 
+the current graphics text window size, and the size of the zoom 
+window by the current graphics window size. Both are set by the 
+parameter setting function of the general menu.
+.left margin1
+@33. TX 2 @Plot single contig
+.left margin2
+.para
+This option produces a schematic of a selected region of a single 
+contig by drawing a horizontal line to represent each of its gel 
+readings. The lines show the relative positions of each reading and 
+also their sense. The plot is divided vertically into two sections by 
+a line that is identified by an asterisk drawn at each end. All lines  
+that lie above this line represent readings that are in their original 
+sense, all lines below show readings that are in the 
+complementary sense to their original. By use of the crosshair 
+function the plot can be stepped through and examined in more 
+detail. See help on crosshair.
+.left margin1
+@34. TX 2 @Plot all contigs
+.left margin2
+.para
+This option produces a schematic of all the contigs in a database. It 
+does this by drawing a horizontal line to represent each of them. 
+In order to show the ends of each contig it draws the lines for 
+contigs at alternate heights: the first at height one, the 
+second at height two, the third at height one, etc. The order of the 
+contigs in the display is the same as their order in the database. 
+By use of the crosshair function the plot can be stepped 
+through and examined in more detail. See help on crosshair.
+.left margin1
+@31. TX 3 @ Type in gel readings
+.left margin2
+.para
+THIS OPTION IS NO LONGER AVAILABLE IN XDAP.
+.para
+This option allows gel readings to be typed in at the keyboard. It creates 
+a separate file for each gel reading and a file of file names for the 
+batch. The sequences from each batch may be listed when they have all been 
+entered. Users may choose to employ special keys to identify the 4 bases 
+A,C,G and T. By default these special keys are N M , . but any other four 
+characters may be used. If special keys are used the characters are 
+automatically translated to A C G T before being stored on the disk.
+
+.left margin1
+@35. TX 1 3 @Find internal joins
+.left margin2
+.para
+The purpose of this function is to use data already in the database to
+find possible joins between contigs.
+Joins may have been missed due to poor data or may have not been made
+due to repeated sequences. Where appropriate, it may be 
+possible to find potential
+joins by using the data clipped off readings prior to their entry into the
+database.
+.left margin2
+The database is checked for logical consistency. Supply a minimum initial
+match length, a minimum alignment block, the maximum pads per sequence,
+the maximum percent mismatch after alignment, the probe length. Choose
+if clipped data is to be used, if so define the window size for finding good
+data and the number of dashes allowed in the window. Processing will commence.
+Most of these values are used in an identical way in the autoassemble 
+function. The others are defined below. 
+.left margin2
+The program strategy
+.left margin2
+Take the first contig and calculate its consensus. If clipped data is being
+used examine all readings that
+are in the complementary orientation, and sufficiently near to the contigs left
+end, to see if they have good clipped sequence which if present, would 
+protrude 
+from the left end of the contig. If found add the longest such sequence to the 
+left end of the consensus. Do the same for the right end by examining  
+readings that are in their
+original orientation. If any are found add the longest extension to the 
+right end of
+the consensus. Repeat the consensus calculations and extensions 
+for all contigs hence producing an extended consensus. If clipped data is not
+being used simply calculate the consensus for the whole database. Now
+look for possible joins by processing the extended consensus in the following
+way. Take the last, say 100, bases (termed the "probe length" by the program)
+of the rightmost consensus, compare it both
+orientations with the extended consensus of all the other contigs. Display
+any sufficiently good alignments. Repeat with the left end of the rightmost
+contig. Do the same for the ends of all the entended contigs, always only
+comparing with the contigs to their left, so that the same matches do not 
+appear twice.
+.left margin2
+Good cliped data is defined by sliding a window of "Window size for good data
+scan" bases outwards
+along the sequence and stopping when "Maximum number of dashes in scan window"
+ or more dashes appear in the window.
+Note that
+it is advisable to have some sort of cutoff because if we simply take all the
+data it might be so full of rubbish that we wont find any good matches. For
+the same reason it is worth trying the procedure with different cutoffs. An
+initial run using no clipped data is also recommended.
+Sufficiently good
+alignments are defined by criteria equivalent to those used in autoassemble,
+however here we only display alignments that pass all tests.
+.left margin2
+Bugs
+.left margin2
+If a small contig is wholly contained within a larger one, such that its
+ends are further than ("Probe length" - "Minimum initial match length")
+from the ends of the larger contig, and the consensus for the small 
+contig lies to the left
+of the consensus for large contig, the overlap will not be discovered. (See
+the search stratgey).
+.left margin2
+ All numbering is
+relative to base number one in the contig: matches to the left (i.e. in
+the clipped data) have negative
+positions, matches off the right end of the contig (i.e. in the clipped
+data) have positions 
+greater than that of the contig length.
+The convention for reporting the positions of overlaps is as follows: if neither
+contig needs to be complemented the positions are as shown. If the program says
+"contig x in the - sense" then the positions shown assume contig x has been 
+complemented. For example in the results given below the positions for the 
+first overlap are as reported, but those for the second assume that the contig
+in the minus sense (i.e. 443) has been complemented.
+.lit
+
+
+ Possible join between contig   445 in the + sense and contig   405
+ Percentage mismatch after alignment =  4.9
+        412        422        432        442        452        462
+     405  TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAG,TT AGCTCACTCA
+           ********* * ********  ***** *** ********** ********** **********
+     445  -TTCCCGACT G,AAAGCGGG TAGTGA,CGC AACGCAATTA ATGTGAG-TT AGCTCACTCA
+       -127       -117       -107        -97        -87        -77
+        472        482        492        502        512
+     405  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+          ********** ********** ********** ********** **
+     445  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+        -67        -57        -47        -37        -27
+ Possible join between contig   443 in the - sense and contig   423
+ Percentage mismatch after alignment = 10.4
+         64         74         84         94        104        114
+     423  ATCGAAGAAA GAAAAGGAGG AGAAGATGAT TTTAAAAATG AAACG-CGAT GTCAGATGGG
+          **** ***** ********** ********** ******  ** ***** **** ********* 
+     443  ATCG,AGAAA GAAAAGGAGG AGAAGATGAT TTTAAA,,TG AAACGACGAT GTCAGATGG,
+       3610       3620       3630       3640       3650       3660
+        124        134        144        154        164
+     423  TTG-ATGAAG TAGAAGTAGG AG-AGGTGGA AGAGAAGAGA GTGGGA
+          *** ****** ********** ** *******  *** ***** ** ** 
+     443  TTGGATGAAG TAGAAGTAGG AGGAGGTGGA ,GAG,AGAGA GTTGG-
+       3670       3680       3690       3700       3710
+
+
+.end lit
+.left margin1
+@ end of help
diff --git a/help/GIP.RNO b/help/GIP.RNO
new file mode 100644
index 0000000..d72dd68
--- /dev/null
+++ b/help/GIP.RNO
@@ -0,0 +1,205 @@
+.NPA
+.left margin1
+.CENTER
+GIP
+.LEFT MARGIN1
+.PARA
+A digitizer is
+  a  two  dimensional  surface
+which is such that if a special pen is pressed onto it, the pens 
+coordinates can be recorded by a computer.
+These coordinates
+          can be interpreted by a program. 
+.para
+The digitizing device we use works by the pen emitting a high frequency 
+sound which is picked up by two microphones positioned at the rear of the 
+working area. The pen position is determined by triangulation and the 
+digitizing device sends the coordinates to the computer. As no special 
+surface is required the device can conveniently be positioned on a light 
+box giving the sequencer an unobscured view of the autoradiographs.
+.LEFT MARGIN1
+The digitizer
+             is called a GRAPHBAR MODEL GP7 made by
+             Science Accessories Corp,
+             970 Kings Highway West,
+             Southport,
+             Connecticut 06490,
+             USA.
+
+.para
+               The program uses a menu to allow the user to select commands or
+          to  enter  the  uncertainty  codes  for  areas  of  the gel that are
+          difficult to interpret.  A menu is simply a series of boxes drawn on
+          the  digitizing surface  that  each  contain  a  command or
+          uncertainty code.  When the user puts the pen down in these  special
+          regions  the program interprets the coordinates as commands and acts
+          appropriately. A copy of the menu should have been sent to you.
+It should be stuck down on the surface of the 
+light box in the digitizing area. For convenience it is best to position it 
+to the right of the digitizing area, but in practice as long as 
+its top 
+edge is parallel to the digitizer box, it can be put anywhere in the active 
+region.
+.sk1
+.left margin1
+ Entering gel readings using a digitizer
+.left margin1
+.para
+The autoradiograph should be stuck down on the light box with the lanes 
+running, as near is as 
+possible, at right angles to the digitizer. To read
+an autoradiograph placed on the light box
+the user  need  only  define the positions of
+the four sequencing lanes and the bases
+          to which they correspond and then use  the  pen  to  point  to  each
+          successive  band progressing up the gel.  The program examines the
+          coordinates of each pen position to see in which of the  four  
+lanes
+          it  lies  and  assigns  the  corresponding  base to be stored in the
+          computer.  Each time the pen tip is depressed to point to a position
+          on  the  surface of the digitizer the program sounds the bell on the
+          terminal (a different sound for each of the four bases on the 
+microcomputer version of the program)
+ to indicate to the user that a point has been recorded.  As
+          the  sequence  is read the program displays it on the screen.
+
+
+.para
+               The program uses a menu 
+to allow the user to select commands or
+          to  enter  the  uncertainty  codes  for  areas  of  the gel that are
+          difficult to interpret.  A menu is simply a series of boxes drawn on
+          the  digitizing surface  that  each  contain  a  command or
+          uncertainty code.  When the user puts the pen down in these  special
+          regions  the program interprets the coordinates as commands and acts
+          appropriately.     As    well    as    the     uncertainty     codes
+          A,C,G,T,1,2,3,4,B,D,H,V,R,Y,X,-,5,6,7,8  the  following commands are
+          included in the menu:  DELETE removes the last character  from 
+the  sequence;
+          RESET allows the lane centres to be redefined;
+START  means  begin  the  next
+          stage  of  the  procedure;  STOP means stop the current stage in the
+          procedure;  CONFIRM means confirm that the last command  or  set  of
+          coordinates  are  correct.
+.para
+The digitizing device also has a menu of its own. This lies in a two inch wide 
+strip immediately in front of the digitizing box. Pen positions within this 
+two inch strip are interpretted as commands to the digitizer and are not 
+sent to the GIP program. In general the only time users will need to use 
+the device menu is when they tell GIP where the program menu lies in the 
+digitizing area. This is done by first hitting ORIGIN in the device menu 
+and then hitting the bottom left hand corner of the program menu. The 
+program menu can hence be positioned anywhere in the active region but 
+should be arranged parallel to the digitizer.
+.para
+The user should try to hit the bands as near as possible to the centre of 
+the lanes because the program tracks the lanes up the film using the pen 
+positions. By using this tracking strategy the user only has to define the 
+centres of the bottom of the lanes before starting to read the film. The 
+program can correctly follow quite curved lanes and constantly checks that 
+its lane centre coordinates look sensible. If the lane centres appear to be 
+getting too close the program stops responding to the pen positions of 
+bands and hence does not ring the bell. If this occurs users must hit the
+reset box in the menu and the program will request them to redefine the 
+lane centres at the current reading position. Then they can continue 
+reading. As a further safeguard the program will only respond to pen 
+positions either in the menu or very close to the current reading position.
+.sk1
+.left margin1
+ Running the gel reading program
+.left margin1
+The autoradiograph should be firmly stuck down on the light box and the 
+program started by typing GIP. It will ask the first question.
+.left margin2
+" ? FILE OF FILE NAMES="
+.left margin2
+Type the name for the file of file names and then follow the instructions.
+.left margin2
+" HIT DIGITIZER MENU ORIGIN"
+.left margin2
+" THEN PROGRAM MENU ORIGIN"
+.left margin2
+" THEN HIT START IN PROGRAM MENU"
+.left margin2
+If the bell does not sound after you hit start try hitting metric in the 
+device menu (the program uses metric units, and some digitizers are set to 
+default to use inches; hitting metric switches between the two).
+.left margin2
+After the bell has sounded the program will give the default lane order.
+.left margin2
+" LANE ORDER IS T C A G"
+.left margin2
+" IF CORRECT HIT CONFIRM, ELSE HIT RESET"
+.left margin2
+If the lane order, reading from left to right is correct hit confirm in the 
+program menu. If you are using a different order hit reset and you will be 
+asked to define the lane order from left to right using the program menu 
+(as follows).
+.left margin2
+" DEFINE LANE ORDER (LEFT TO RIGHT) USING MENU"
+.left margin2
+Hit the boxes in the menu that contain the symbols A,C,G,T in the 
+left-right order of the lanes. The program will respond with the lane order 
+as above and ask for confirmation. When this is received, the next task is 
+to define the start positions of the next four lanes.
+.left margin2
+" HIT START, THEN HIT (LEFT TO RIGHT)"
+.left margin2
+" THE START POSITIONS FOR THE NEXT FOUR LANES"
+.left margin2
+Hit the centres of the four lanes at a height level with the first band 
+that is going to be read. The program will report the mean lane separations
+and asks for confirmation that they are correct.
+.left margin2
+" MEAN LANE SEPARATION IS XX"
+.left margin2
+" HIT CONFIRM TO CONTINUE"
+.left margin2
+Users will become familiar with the values from their films and will spot 
+any unusual numbers.
+Asking for confirmation allows users to try again if they had made a 
+mistake, but generally the lane separation values can be ignored.
+Hit confirm, and the program will give the message
+.left margin2
+" HIT START WHEN READY TO BEGIN READING"
+.left margin2
+Hit start and the program will give the message
+.left margin2
+" HIT BANDS, UNCERTAINTY CODES, RESET OR STOP"
+.left margin2
+Hit the bands, interpretting the sequence progressing 
+up the film. If necessary use the uncertainty codes. If the pen stops 
+responding hit reset and follow the instructions as above. When the 
+sequence becomes unreadable hit stop and the program will ask for a file 
+name for the gel reading just read.
+.left margin2
+" ? FILE NAME FOR THIS GEL READING="
+.left margin2
+Type the file name observing the rules about legal gel readings names.
+The program will ask if you wish 
+to read another sequence.
+.left margin2
+" TO ENTER ANOTHER GEL READING TYPE 1"
+.left margin2
+To enter another type 1 and you will be back to the step of defining the 
+lane order. Typing anything else will stop the program.
+.left margin1
+.sk1
+Running the microcomputer version of the gel reading program
+.left margin1
+The microcomputer version of GIP is slightly different and is called 
+GIPB. The BBC micro 
+does not have the capacity to process the gel readings beyond the reading 
+stage.
+This means that users of this program
+would need to transfer their gel readings from the micro to another machine 
+using a terminal emmulator. Transferring many files is tedious and so the 
+microcomputer version of the gel reading program stores all the gel 
+readings for each run of the program in a single file. This special 
+file contains both sequences and file names and can be moved in a single 
+transfer to another machine. Once on the other machine the single file must 
+be split into separate gel reading files and a file of file names. This is 
+done using the program BSPLIT. As far as using the microcomputer version
+of GIP, the only difference is that the first file name the program 
+requests is not a file of file names, but a name for the single file to 
+contain all the gel readings and their names.
diff --git a/help/MEP.RNO b/help/MEP.RNO
new file mode 100644
index 0000000..b3d575e
--- /dev/null
+++ b/help/MEP.RNO
@@ -0,0 +1,859 @@
+.NPA
+.SP 1
+.left margin1
+@-1. TX  0 @General
+.sp
+@-2. T   0 @Screen control
+.sp
+@-2. X   0 @Screen
+.sp
+@-3. TX  0 @Dictionary analysis
+.sp
+@0. TX  -1 @MEP
+.left margin2
+.para
+This is a program  for analysing families of nucleotide sequences in order 
+to find common motifs and potential binding sites.
+The ideas in this program were described in Staden, R.  "Methods 
+for discovering novel motifs in nucleic acid sequences". 
+Computer Applications in the Biosciences, 5, 293-298, (1989).
+.PARA
+The program can read 
+sequences stored in either of two formats: 1) all sequences aligned in a 
+single file; 2) all sequences in separate files and accessed through a file 
+of file names.
+.PARA
+The program contains functions that can answer several questions 
+about a set of sequences:
+.SK1
+.left margin2
+ Which words are most common?
+.left margin2
+ Which words occur in the most sequences?
+.left margin2
+ Which words contain the most information?
+.left margin2
+ Which words occur in equivalent positions in the sequences?
+.left margin2
+ Which words are inverted repeats?
+.left margin2
+ Which words occur on both strands of the sequences?
+.left margin2
+ Where are the inverted repeats?
+.left margin2
+ Where are the fuzzy words?
+.para
+ Most of the program is 
+concerned with analysing 
+what it terms "fuzzy 
+words" within the set of sequences. The analysis is explained 
+below. Note that the standard version of the programs is limited 
+to words of maximum length 8 letters, and a maximum fuzziness 
+of 2.
+.para
+The following analyses (preceded by their option numbers) are included:
+.lit
+  ? = Help
+  ! = Quit
+  3 = Read new sequences
+  4 = Redefine active region
+  5 = List the sequences
+  6 = List text file
+  7 = Direct output to disk
+ 10 = Clear graphics
+ 11 = Clear text
+ 12 = Draw ruler
+ 13 = Use cross hair
+ 14 = Reset margins
+ 15 = Label diagram
+ 16 = Draw map
+ 17 = Search for strings
+ 18 = Set strand
+ 19 = Set composition
+ 20 = Set word length
+ 21 = Set number of mismatches
+ 22 = Show settings
+ 23 = Make dictionary Dw
+ 24 = Make dictionary Ds
+ 25 = Make fuzzy dictionary Dm from Dw
+ 26 = Make fuzzy dictionary Dm from Ds
+ 27 = Make fuzzy dictionary Dh from Dm
+ 28 = Examine fuzzy dictionary Dm
+ 29 = Examine fuzzy dictionary Dh
+ 30 = Examine words in Dm
+ 31 = Examine words in Dh
+ 32 = Save or restore a dictionary
+ 33 = Find inverted repeats
+.end lit
+.para
+Some of these methods produce graphical 
+ results 
+and so the 
+program is generally used from a graphics terminal (a vdu on which lines 
+and points can be drawn as well as characters). 
+.para
+.LEFT MARGIN2
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "MEPMARG" when the 
+program is started. Users can have their own file if required.
+.para
+The options for the program are accessed from 3 main menus: general, screen 
+control and dictionary analylsis.
+Both menus and options are selected by number.
+.para
+The most important and novel part of the program is its use of "fuzzy 
+dictionaries" and an information theory measure, to help show the most 
+interesting motifs.
+
+  Central to the method is the idea of a fuzzy dictionary of word 
+frequencies. A dictionary of word frequencies is an ordered list of 
+all the words in the sequences and a count of the number of times 
+that they occur. A fuzzy dictionary is an equivalent list but which 
+contains instead, for each word, a count of the number of times 
+similar words occur in the sequences. We term words that are 
+similar "relations". The fuzziness is defined by the number of 
+letters in a word that are allowed to be different. So if we had a 
+fuzziness of 1 we allow 1 letter to be different. For example, with 
+a fuzziness of 1, the entry in the fuzzy dictionary for the word 
+TTTTTT would contain a count of the numbers of times TTTTTT 
+occured plus the number of times all words differing by exactly 
+one letter from TTTTTT occured.   
+.para
+   Once the fuzzy dictionary has been created we can examine it in 
+several ways to find candidate control sequences. The simplest 
+question we can ask is which word in the dictionary is the most 
+common.  Sometimes this simple criterion of "most common" may 
+be adequate to discover a new motif but in general we would not 
+expect it to be sufficient. For example some words will be common 
+simply because of a base composition bias in the sequences being 
+analysed. In addition a word can be the most frequent and yet not 
+be "well defined". This last point is best explained by an example.
+.para
+   Suppose we were looking at  two letter words and allowing one 
+mismatch, and that there were 10 occurences of TT and 5 of AC. 
+We could align the 10 words that were one letter different from TT 
+and the 5 that were  related to AC. Then we could count the 
+number of times each base occured in each position for each of 
+these two sets of words. Suppose we got the two base frequency 
+tables shown below.
+.lit
+   TT                  AC
+       T 6 4               T 1 0
+       C 1 3               C 0 4
+       A 1 2               A 4 1
+       G 2 1               G 0 0
+
+.end lit
+These tables show that although TT occurs (with one letter 
+mismatch) more often than AC, the ratio of base frequencies for 
+AC at 4/5, 4/5 is higher than those for TT at 6/10, 4/10. Hence we 
+would say that AC was better defined than TT.
+Expressing this another way we would say that the definition of AC 
+contained more information than that for TT. The program 
+calculates the information content in a way that takes into account 
+both the sequence composition and the level of definition of the 
+motif.
+.para
+Definitions
+
+.para
+Here we deal only with the dictionary analysis.
+Suppose we are dealing with a set of 
+sequences and are examining them for words that are six 
+characters in length.
+
+.para
+Dictionary Dw contains a count of the number of times each word 
+occurs in the set of sequences. For example the entry for TTTTTT 
+contains a value equal to the number of times the word TTTTTT 
+occurs in the set of sequences.
+
+.para
+Dictionary Ds contains a count of the number of different sequences in 
+which each word occurs. For example if the entry for word TTTTTT 
+contains the value 10, it denotes that the word TTTTTT occurs in ten 
+different sequences. Unlike Dw it only counts words once for each 
+sequence. For example if we had a set of 100 sequences, the maximum 
+possible value that Ds could take is 100, and this would only happen if 
+a word occurred in every sequence. However for the same set of 
+sequences, Dw could contain values greater than 100, and this would 
+show that a word had occurred more than once in at least one 
+sequence.
+
+.para
+From either of the two dictionaries Dw or Ds we can calculate a fuzzy 
+dictionary Dm. For each word, the entry in the fuzzy dictionary Dm 
+contains the sum of the dictionary values (taken from either Dw or Ds) 
+for all words that differ from it by up to m letters. For example if m=2 
+the entry for TTTTTT contains the number of times that TTTTTT 
+occurs in the dictionary, plus the counts for all words that differ from 
+TTTTTT by 1 or 2 letters. 
+Obviously the interpretation of the values in Dm depends on which of 
+the two dictionaries Dw or Ds they were derived from. When derived 
+from Dw the entry for any word in Dm gives the total number of 
+times it, and its relations, occur in the set of sequences. When derived 
+from Ds the entry for any word in Dm gives the total number of 
+different sequences that contain a word and each of its relations.
+
+.para
+Finally, from fuzzy dictionary Dm we can derive fuzzy dictionary Dh. 
+All entries in Dh are zero except for the word(s), within each set of 
+relations, that are most frequent. For example if TTTTTT occurred 20 
+times but had a relation that occurred more often, then the entry for 
+TTTTTT would be zero. However if TTTTTT did not have a more 
+frequently occurring relation, then the entry for TTTTTT would 
+contain the value 20. 
+
+.LEFT MARGIN1
+@1. T 0 @Help
+.LEFT MARGIN2
+.para
+This option gives online help. The user should select option numbers and
+the current documentation will be given. Note that option 0 gives an
+introduction to the program, and that ? will get help from anywhere in 
+the 
+program.
+The following analyses (preceded by their option numbers) are included:
+.lit
+  ? = Help
+  ! = Quit
+  3 = Read new sequences
+  4 = Redefine active region
+  5 = List the sequences
+  6 = List text file
+  7 = Direct output to disk
+ 10 = Clear graphics
+ 11 = Clear text
+ 12 = Draw ruler
+ 13 = Use cross hair
+ 14 = Reset margins
+ 15 = Label diagram
+ 16 = Draw map
+ 17 = Search for strings
+ 18 = Set strand
+ 19 = Set composition
+ 20 = Set word length
+ 21 = Set number of mismatches
+ 22 = Show settings
+ 23 = Make dictionary Dw
+ 24 = Make dictionary Ds
+ 25 = Make fuzzy dictionary Dm from Dw
+ 26 = Make fuzzy dictionary Dm from Ds
+ 27 = Make fuzzy dictionary Dh from Dm
+ 28 = Examine fuzzy dictionary Dm
+ 29 = Examine fuzzy dictionary Dh
+ 30 = Examine words in Dm
+ 31 = Examine words in Dh
+ 32 = Save or restore a dictionary
+ 33 = Find inverted repeats
+.end lit
+.left margin1
+@2. T 0 @Quit
+.left margin2
+.para
+This function stops the program.
+.left margin1
+@3. TX 1 @Read a new sequence
+.LEFT MARGIN2
+.para
+It can read 
+sequences stored in either of two formats: 1) all sequences aligned in a 
+single file; 2) all sequences in separate files and accessed through a file 
+of file names. Typical dialogue follows:
+.lit
+ 
+X 1 Read file of aligned sequences
+  2 Use file of file names
+? 0,1,2 =
+ 
+? File of aligned sequences=F1
+Number of files           88
+
+.end lit
+.left margin1
+@4. TX 1 @Define active region
+.LEFT MARGIN2
+.para
+For its analytic functions 
+the program always works on a region of the sequence called the active 
+region. When  new sequences are read into the program the active region is 
+automatically set to start at the beginning of the sequences and go
+up to the end of the longest one.
+.left margin1
+@5. TX 1 @List a sequence
+.LEFT MARGIN2
+.para
+The sequence can be listed with line lengths of 50 bases with each sequence 
+numbered in the order in which they were read.
+Output can be directed to a disk file by 
+first selecting disk output. Typical dialogue follows.
+.lit
+
+? Menu or option number=5
+
+              10        20        30        40        50
+   1  TAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCA
+   2  CAAATAATCAATGTGGACTTTTCTGCCGTGATTATAGACACTTTTGTTAC
+   3  TAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTATT
+   4  ACTAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTA
+   5  AGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGA
+   6  TAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGC
+   7  ACACCATCGAATGGCGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCG
+   8  GGGGCAAGGAGGATGGAAAGAGGTTGCCGTATAAAGAAACTAGAGTCCGT
+   9  AGGGGGTGGAGGATTTAAGCCATCTCCTGATGACGCATAGTCAGCCCATC
+  10  AAAACGTCATCGCTTGCATTAGAAAGGTTTCTGGCCGACCTTATAACCAT
+
+              60
+   1  TACCCGTTTTT
+   2  GCGTTTTTGT
+   3  TCATACCATAAG
+   4  TTTCATACC
+   5  ATTGTGAGC
+   6  TTCCGGCTCG
+   7  GAAGAGAGT
+   8  TCAGGTGT
+   9  ATGAATG
+  10  TAATTACG
+.end lit
+.left margin1
+@6. TX 1 @List a text file
+.LEFT MARGIN2
+.para
+Allows the user to have a text file displayed on the screen. It will appear 
+one page at a time.
+.left margin1
+@7. TX 1 @Direct output to disk
+.LEFT MARGIN2
+.para
+Used to direct output that would normally appear on the screen to a file. 
+.para
+Select redirection of either text or graphics, and 
+supply the name of the file that the output should be written to.
+.para
+ The results from the next options selected will not appear on the screen 
+but will be written to the file. When option 7 is selected again
+the file will be 
+closed and output will again appear on the screen.
+.left margin1
+@10. TX 2 @Clear graphics
+.LEFT MARGIN2
+.para
+ Clears the screen of both text and graphics.
+.left margin1
+@11. TX 2 @Clear text
+.LEFT MARGIN2
+.para
+ Clears only text from the screen.
+.left margin1
+@12. TX 2 @Draw a ruler
+.LEFT MARGIN2
+.para
+This option
+allows the user to draw a ruler or scale along the x axis of the screen to 
+help identify the coordinates of points of interest. The user can define 
+the position of the first amino acid to be marked (for example if the 
+active 
+region is 1501 to 8000, the user might wish to mark every 1000th amino 
+acid
+starting at either 1501 or 2000 - it depends if the user wishes to treat 
+the active region as an independent unit with its own numbering starting 
+at 
+its left edge, or as part of the whole sequence). The user can also define 
+the separation of the ticks on the scale and their height. If required the 
+labelling routine can be used to add numbers to the ticks.
+.left margin1
+@13. TX 2 @Use crosshair
+.LEFT MARGIN2
+.para
+This function puts
+a steerable cross on the screen that can be used to find the 
+coordinates of points in the sequence. The user can move the cross 
+around using the directional keys; when he hits the space bar the 
+program will print out the coordinates of the cross in sequence units and 
+the option will be exited.
+.para
+If instead, 
+you hit a , the position will be displayed but the cross will remain on 
+the screen.
+.para
+If a letter s is hit the sequence around the cross hair is displayed and 
+the cross remains on the screen.
+.left margin1
+@14. TX 2 @Reposition plots
+.LEFT MARGIN2
+.para
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "MEPMARG" when the 
+program is started. Users can have their own file if required.
+As all the plots start 
+at the same position in x and have the same width, x0 and xlength are the 
+same for all options. Generally users will only want to change the start 
+level of the window y0 and its height ylength. 
+ This option 
+allows users to change window positions whilst running the program.
+The routine prompts first for the number of the option that the users 
+wishes 
+to reposition; then for the y start and height; then for the x start and 
+length. Note that changes to the x values affect all options. If the user 
+types only carriage return for any value it will remain unchanged. 
+The cross-hair can be used to choose suitable heights.
+.LEFT MARGIN1
+@15. TX 2 @Label a diagram
+.LEFT MARGIN2
+.para
+This routine allows users to label any diagrams they have produced. They 
+are asked to type in a label. When the user types carriage return to finish 
+typing the label the cross-hair appears on the screen. The user can 
+position it anywhere on the screen. If the user types R (for right justify)
+ the label will be 
+written on the diagram with its right end at the cross-hair position. 
+If the user types L (for left justify) the label will be written on the 
+diagram with its left end at the cross hair position.
+The 
+cross-hair will then immediately reappear. The user may put the same 
+label 
+on another part of the diagram as before or if he hits the space bar he 
+will be asked if he wishes to type in another label.
+.left margin1
+@16. TX 2 @Display a map
+.LEFT MARGIN2
+.para
+It is often convenient to plot a map alongside graphed analysis in order 
+to 
+indicate features within the sequence. This function allows users to 
+draw 
+maps using files arranged in the form of EMBL feature tables. Of course 
+the 
+EMBL table are usually only used for nucleic acid sequence annotation 
+but, 
+as long as the features are written in the correct format, they can be 
+employed by this routine. The map is composed of a line representing the 
+sequence and then further lines denoting the endpoints of each feature 
+the 
+user identifies. The user is asked to define height at which the line 
+representing the sequence should be drawn; then for the feature height; 
+then for the features to plot.
+.left margin1
+@17. TX 1 @Search for strings
+.left margin2
+.para
+Search for strings
+perfoms searches of all the sequences for selected words and 
+shows which sequences they are found in. The user types in a word and 
+defines the allowed number of mismatches. The results are listed or 
+plotted. If listed the display includes the sequence number, the position 
+in the sequence and the matching string.
+The results are plotted in the 
+following way. The x axis of the plot represents the length of the aligned
+sequences and the y direction is divided into sufficient strips to accommodate
+each sequence. So if a match is found in the 3rd sequence at a position
+equivalent to halfway along the longest of the sequences then a short 
+vertical line will be drawn at the midpoint of the 3rd strip. If the sequences
+are aligned it can be useful if the motifs happen to appear in 
+related positions. For example see the original publication. Typical 
+dialogue follows.
+.lit
+
+? Menu or option number=17
+X 1 Plot match positions
+  2 Plot histogram of matches
+? 0,1,2 =
+? Word to search for=TTGACA
+? Minimum match (0-6) (6) =5
+? (y/n) (y) Plot results N
+     2    35 TAGACA
+     5    14 TTTACA
+     6    37 TTTACA
+    11    14 TAGACA
+    14    14 TTGACA
+    17    14 GTGACA
+    17    22 TTAACA
+    20     1 TTGACA
+.end lit
+.left margin1
+@18. TX 3 @Set strand
+.left margin2
+.para
+Set strand allows the user to define which strand(s) of the sequences to 
+analyse: input stand, complement of input, or both.
+.left margin1
+@19. TX 3 @Set composition
+.left margin2
+.para
+Set composition gives the user three choices for setting the composition 
+of the sequences for use in the calculation of the information content of
+words. The user can select the overall composition of the sequences as read,
+an even composition, or can type in any other 4 values.
+.left margin1
+@20. TX 3 @Set word length
+.left margin2
+.para
+Set word length sets the length of word for which dictionaries will be made.
+.left margin1
+@21. TX 3 @Set number of mismatches
+.left margin2
+.para
+Set number of mismatches sets the level of fuzziness for the creation of 
+dictionary Dm. 
+.left margin1
+@22. TX 3 @Show settings
+.left margin2
+.para
+Show settings show the current settings for all parameters associated with 
+dictionary analysis. A typical diaplsy follows:
+.lit
+ ? Menu or option number=22
+ Current word length  =   6
+ Number of mismatches =   1
+ Start position       =     1
+ End position         =    63
+ Input strand only
+ Observed composition
+ Dictionary Dw unmade
+ Dictionary Ds unmade
+ Dictionary Dm unmade
+ Dictionary Dh unmade
+.end lit
+.left margin1
+@23. TX 3 @Make dictionary Dw
+.left margin2
+.para
+Make dictionary Dw creates a dictionary that contains a count  of the
+frequency of occurrence of each word in the collected sequences.
+.left margin1
+@24. TX 3 @Make dictionary Ds
+.left margin2
+.para
+Make dictionary Ds creates a dictionary that contains a count of the
+number of different sequences that contain each word.
+.left margin1
+@25. TX 3 @Make dictionary Dm from Dw
+.left margin2
+.para
+Make dictionary Dm  from Dw creates a dictionary from dictionary Dw that
+contains the frequency of occurrence of each word (say X) in Dw plus the
+frequency of occurrence of each word in Dw that differs from X by up to m 
+letters. Dm is called a fuzzy dictionary as it contains the frequencies of
+occurrence of all words plus the frequencies of all the words that are 
+similar to them.
+.left margin1
+@26. TX 3 @Make dictionary Dm from Ds
+.left margin2
+.para
+Make dictionary Dm  from Ds creates a dictionary from dictionary Ds that
+contains the frequency of occurrence of each word (say X) in Ds plus the
+frequency of occurrence of each word in Ds that differs from X by up to m 
+letters. Dm is called a fuzzy dictionary as it contains the frequencies of
+occurrence of all words plus the frequencies of all the words that are 
+similar to them.
+.left margin1
+@27. TX 3 @Make dictionary Dh from Dm
+.left margin2
+.para
+Make dictionary Dh  creates a dictionary from dictionary Dm and whose
+entries are zero except for those words in any set of related words that
+are most frequent. It finds the dominant words in each set of relations 
+and stores their counts.
+.left margin1
+@28. TX 3 @Examine fuzzy dictionary Dm
+.left margin2
+.para
+Examine dictionary Dm allows users to analyse the contents of dictionary
+Dm to find the most common words or those words that contain the most 
+information. The user supplies a frequency or information cutoff and chooses
+to have the results sorted on either value. The program will find the top 100
+words that achieve the cutoff values and present them to the user sorted
+as selected. The information content will be calcutated from either Dw or Ds 
+depending which was used to create Dm, and using the current composition 
+setting. Typical dialogue follows:
+.lit
+
+? Menu or option number=28
+Looking for highest scoring words
+The highest word score =          115
+? Minimum word score (0-115) (0) =60
+? Minimum information (0.00-1.00) (0.00) =.62
+X 1 Sort on information
+  2 Sort on word score
+? 0,1,2 =
+ 
+? Maximum number to list (0-100) (100) =
+ 
+The words are
+ Total words=           9 Maximum information=  0.7385326
+TTGACA      60   0.73850
+AAAAAC      64   0.66460
+AAAAAA      90   0.64880
+GTTTTT      66   0.64300
+TTTTTG      73   0.64070
+TTTTGT      63   0.63820
+TTTTTC      65   0.63810
+AAAATA      63   0.62670
+TATAAT      65   0.62510
+The highest word score =          115
+? Minimum word score (0-115) (0) =60
+? Minimum information (0.00-1.00) (0.00) =.62
+X 1 Sort on information
+  2 Sort on word score
+? 0,1,2 =2
+? Maximum number to list (0-100) (100) =
+ 
+The words are
+ Total words=           9 Maximum information=  0.7385326
+AAAAAA      90   0.64880
+TTTTTG      73   0.64070
+GTTTTT      66   0.64300
+TTTTTC      65   0.63810
+TATAAT      65   0.62510
+AAAAAC      64   0.66460
+TTTTGT      63   0.63820
+AAAATA      63   0.62670
+TTGACA      60   0.73850
+The highest word score =          115
+? Minimum word score (0-115) (0) =!
+
+.end lit
+.left margin1
+@29. TX 3 @Examine fuzzy dictionary Dh
+.left margin2
+.para
+Examine dictionary Dh allows users to analyse the contents of dictionary  Dh
+to find the most common words or those words that contain the most 
+information. The user supplies a frequency or information cutoff and chooses 
+to have the results sorted on either value. The program will find the top 100
+words that achieve the cutoff values and present them to the user sorted as
+selected. The information content will be calcutated from either Dw or Ds 
+depending which was used to create Dh and using the current composition 
+setting. Typical dialogue follows:
+.lit
+
+? Menu or option number=29
+Looking for highest scoring words
+The highest word score =          115
+? Minimum word score (0-115) (0) =60
+? Minimum information (0.00-1.00) (0.00) =.6
+X 1 Sort on information
+  2 Sort on word score
+? 0,1,2 =
+ 
+? Maximum number to list (0-100) (100) =
+ 
+The words are
+ Total words=           4 Maximum information=  0.7385326
+TTGACA      60   0.73850
+AAAAAA      90   0.64880
+TATAAT      65   0.62510
+TTTTTT     115   0.60630
+The highest word score =          115
+? Minimum word score (0-115) (0) =50
+? Minimum information (0.00-1.00) (0.00) =.5
+X 1 Sort on information
+  2 Sort on word score
+? 0,1,2 =
+ 
+? Maximum number to list (0-100) (100) =
+ 
+The words are
+ Total words=           8 Maximum information=  0.7385326
+TTGACA      60   0.73850
+TCTTGA      54   0.66080
+AAAAAA      90   0.64880
+TATAAT      65   0.62510
+ACTTTA      57   0.61960
+TTTTTT     115   0.60630
+AGTATA      51   0.60540
+TTATAA      55   0.59300
+The highest word score =          115
+? Minimum word score (0-115) (0) =50
+? Minimum information (0.00-1.00) (0.00) =
+ 
+X 1 Sort on information
+  2 Sort on word score
+? 0,1,2 =
+ 
+? Maximum number to list (0-100) (100) =
+ 
+The words are
+ Total words=           8 Maximum information=  0.7385326
+TTGACA      60   0.73850
+TCTTGA      54   0.66080
+AAAAAA      90   0.64880
+TATAAT      65   0.62510
+ACTTTA      57   0.61960
+TTTTTT     115   0.60630
+AGTATA      51   0.60540
+TTATAA      55   0.59300
+The highest word score =          115
+? Minimum word score (0-115) (0) =!
+
+.end lit
+.left margin1
+@30. TX 3 @Examine words in Dm
+.left margin2
+.para
+Examine words in Dm allows users to analyse the contents of dictonary Dm at the
+level of individual words to find their frequency, information content, and to
+see their base frequency table. The user types in a word to examine and the
+program displays the values and table. The information content will be 
+calcutated from either Dw or Ds depending which was used to create Dm,
+and using the current composition setting. Typical dialogue follows:
+.lit
+? Menu or option number=30
+? Word to examine=TTGACA
+TtgacA            60  0.7385326
+    56    56     6     7     5    11
+     4     3     2     1    52     1
+     1     4     2    53     3    48
+     3     1    54     3     4     4
+TTGACA
+? Word to examine=TATAAT
+taTAat            65  0.6251902
+    56     3    53     4     4    60
+     6     1     5     5     5     3
+     3    60     5    57    57     4
+     4     5     6     3     3     2
+TATAAT
+? Word to examine=
+
+.end lit
+.left margin1
+@31. TX 3 @Examine words in Dh
+.left margin2
+.para
+Examine words in Dh allows users to analyse the contents of dictonary Dh at the
+level of individual words to find their frequency, information content, and to
+see their base frequency table. The user types in a word to examine and the
+program displays the values and table. The information content will be 
+calcutated from either Dw or Ds depending which was used to create Dm,
+and using the current composition setting. Typical dialogue follows:
+.lit
+
+ ? Menu or option number=31
+? Word to examine=TTGACA
+TtgacA            60  0.7385326
+    56    56     6     7     5    11
+     4     3     2     1    52     1
+     1     4     2    53     3    48
+     3     1    54     3     4     4
+TTGACA
+? Word to examine=TATAAT
+taTAat            65  0.6251902
+    56     3    53     4     4    60
+     6     1     5     5     5     3
+     3    60     5    57    57     4
+     4     5     6     3     3     2
+TATAAT
+? Word to examine=GGGGGG
+gggggg             0  0.6199890
+     3     1     1     2     3     4
+     1     3     1     2     2     1
+     2     1     1     1     1     1
+    11    12    14    12    11    11
+GGGGGG
+? Word to examine=
+
+.end lit
+.left margin1
+@32. TX 3 @Save or restore a dictionary
+.left margin2
+.para
+Save or restore dictionary allows users to write or read any dictionary to 
+and from disk files. The user is asked te define the dictionary and file. The
+function is useful if the machine being used is very slow at calculating 
+because the files can be handled quickly. However note that the files 
+cannot be processed by any other program.
+.left margin1
+@33. TX 1 @Find inverted repeats
+.left margin2
+.para
+Find inverted repeats performs searches for simple inverted repeat sequences 
+in each sequence. They are defined by a range of loop sizes and a minimum 
+number of potential basepairs. The results can be plotted or listed. The x 
+axis of the plot represents the length of the aligned sequences and the y 
+direction is divided into sufficient strips to accommodate each sequence. 
+So if an inverted repeat is found in the 3rd sequence at a position equivalent
+to halfway along the longest of the sequences then a short vertical line will 
+be drawn at the midpoint of the 3rd strip. Alternatively, if the results are
+listed, the potential hairpin loops are drawn out, with the sequence number 
+and the position of the loop. Typical dialogue follows.
+.lit
+
+? Menu or option number=33
+Define the range of loop sizes
+? Minimum loop size (0-10) (3) =0
+? Maximum loop size (1-20) (3) =
+? Minimum number of basepairs (1-20) (6) =
+? (y/n) (y) Plot results N
+ Searching
+
+Sequence     3    34
+           C       
+          G.T      
+          T-A      
+          A-T      
+          T.G      
+          T.G      
+          G.T      
+     ATCTTT TATTTCA
+         33
+
+Sequence     5    35
+           T       
+          G.T      
+          T.G      
+          A-T      
+          T.G      
+          G.T      
+          C-G      
+          T.G      
+     TCCGGC AATTGTG
+         34
+.end lit
+.left margin1
+@ End of help
diff --git a/help/NIP.RNO b/help/NIP.RNO
new file mode 100644
index 0000000..6db18e6
--- /dev/null
+++ b/help/NIP.RNO
@@ -0,0 +1,5116 @@
+.NPA
+.SP 1
+.left margin1
+@-1. TX  0 @General
+.sp
+@-2. T   0 @Screen control
+.sp
+@-2. X   0 @Screen
+.sp
+@-3. T   0 @Statistical analysis of content
+.sp
+@-3. X   0 @Statistics
+.sp
+@-4. T   0 @Structures and repeats
+.sp
+@-4. X   0 @Structures
+.sp
+@-5. TX  0 @Translation and codons
+.sp
+@-6. TX  0 @Gene search by content
+.sp
+@-7. TX  0 @General signals
+.sp
+@-8. TX  0 @Specific signals
+.sp
+@0.  TX  -1 @NIP
+.PARA
+.para
+This is a program  for analysing individual nucleotide sequences. It can 
+read sequences stored in many of the most commonly used formats, and 
+performs all of the usual simple analyses. However the main purpose of 
+the program is to provide  methods for finding  the function of each 
+section of a sequence. In general no single method can  give an 
+unequivecal interpretation of a sequence so we need to use many 
+techniques together and to combine  their results. For this reason the 
+program  present many of its  results graphically. 
+.para
+General information is contained in the user interface. Online 
+documentation for any function follows a consistent pattern: summary, 
+list of inputs, list of outputs, details, example.
+.LEFT MARGIN1
+@1. TX 0 @ Help
+.LEFT MARGIN2
+.para
+This option gives online help. The user should select option numbers and
+the current documentation will be given. Note that option 0 gives an
+introduction to the program, and that ? will get help from anywhere in 
+the 
+program.
+The following functions are included:
+.left margin1
+@2. TX 0 @ Quit
+.left margin2
+.para
+This function stops the program.
+.left margin1
+@3. TX 1 @ Read a new sequence
+.LEFT MARGIN2
+.para
+This option allows users to read in new sequences, browse through annotations,
+ or search sequence 
+libraries for keywords. Sequences can be read from "personal" 
+sequence files or from sequence libraries. These are referred to as the 
+sequence "source". Personal files can be stored in several formats:
+Staden, PIR, EMBL, GENBANK and GCG.
+At LMB we use "Staden" format for sequencing and all 
+the 
+libraries are stored in their original formats. Note, however, that libraries
+such as EMBL or GenBank that are divided into several files (eg GenBank has
+13 separate files) are indexed as a whole. This means that users do not need
+to know which file contains an entry, only which library.
+When the user selects to read in a sequence the program first asks for the 
+sequence "source". 
+.para
+If the user selects "personal" the program will ask for 
+the format (Staden, PIR, EMBL, GENBANK or GCG), and then for the name of 
+the file. For PIR format the user will also be required to know the entry 
+name of the sequence as the file can contain several. For the other formats
+only a single entry is expected. The file will be read, its length and
+composition will be displayed and the option left.
+.para
+If the user selects "library" as the sequence source the program will display a
+list of available libraries. The programs are capable of handling all current
+libraries but which ones are available will vary from site to site. At LMB we
+have several libraries and also weekly updates of data gathered between releases.
+The program will ask users to select a library and then give a list of options:
+.lit
+
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entrynames from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+
+.end lit
+If get a sequence or get annotations is selected users will be asked to 
+type the entry name. The option will be left when a sequence is selected or 
+! is typed. The composition and length will be displayed.
+.para
+The text index contains all words from feature tables, reference titles,
+definition lines, keywords lists and comments, so the text index search
+is most useful. It is also the fastest. Up to 5 words can be searched for
+at once. The words should be typed separated by spaces, for example
+.lit
+ ? Keywords=P53 mouse murine tumo
+
+.end lit
+will search for all entries that contain words starting with p53, mouse,
+murine and tumo. Only the unique entries that contain ALL words will be 
+listed. Before listing the matching entries
+the program will show the number of 'hits' for each word and ring the bell.
+Escape is possible at this point, or after each screenfull of entries.
+In addition to the entry names the text search displays the primary accession 
+number, the sequence length and up to 80 characters of description.
+(The search of 'titles' is now redundant because the full text index
+contains all the title words and the search is much faster. It will probably
+be removed from the program.)
+All searches are independent of case. Where
+possible the program will offer default entry names.
+.para
+Typical dialogue follows.
+.lit
+Select sequence source
+X  1 Personal file
+   2 Sequence library
+? Selection  (1-2) (1) =
+Select sequence file format
+X  1 Staden
+   2 EMBL
+   3 GenBank
+   4 PIR
+   5 GCG
+? Selection  (1-5) (1) =
+? Sequence file name=M13MP7.SEQ
+ Contig title removed
+Sequence length=  7238
+ Sequence composition
+          T          C          A          G          -
+      2405.      1539.      1765.      1527.         2.
+        33.2%      21.3%      24.4%      21.1%       0.0%
+  .
+  .
+  .
+
+
+ Select sequence source
+ X  1 Personal file
+    2 Sequence library
+ ? Selection  (1-2) (1) =2
+ Select a library
+ X  1 EMBL 29 nucleotide library Dec 91
+    2 SWISSPROT 20 protein library Nov 91
+    3 PIR 31 protein library Dec 91
+    4 NRL3D 58 From Brookhaven protein library Dec 91
+    5 GenBank
+ ? Selection  (1-5) (1) =
+Library is in EMBL format with indexes
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =5
+ Search for keywords
+ ? Keywords=P53 mouse
+P53 hits  68
+MOUSE hits  8180
+
+ MMANT01    X00875         536 Murine gene fragment for cellular tumour antigen
+ MMANT02    X00876          83 Murine gene fragment for cellular tumour antigen
+ MMANT03    X00877          21 Murine gene fragment for cellular tumour antigen
+ MMANT04    X00878         261 Murine gene fragment for cellular tumour antigen
+ MMANT05    X00879         184 Murine gene fragment for cellular tumour antigen
+ MMANT06    X00880         113 Murine gene fragment for cellular tumour antigen
+ MMANT07    X00881         110 Murine gene fragment for cellular tumour antigen
+ MMANT08    X00882         137 Murine gene fragment for cellular tumour antigen
+ MMANT09    X00883          74 Murine gene fragment for cellular tumour antigen
+ MMANT10    X00884         107 Murine gene for cellular tumour antigen p53 (exon
+ MMANT11    X00885         562 Murine p53 gene 3' region with exon 11
+ MMANTP53   M26862         536 Mouse tumor antigen p53 gene, 5' end.
+ MMLYN      M64608        2044 Mouse lyn protein mRNA, complete cds.
+ MMP53      X00741        1377 Mouse mRNA for transformation associated protein
+ MMP53A     M13872        1285 Mouse p53 mRNA, complete cds, clone pcD53.
+ MMP53B     M13873        1241 Mouse p53 mRNA, complete cds, clone p53-m11.
+ MMP53C     M13874        1322 Mouse p53 mRNA, complete cds, clone p53-m8.
+ MMP53G1    X01235         554 Mouse genomic DNA for 5' region of cellular tumou
+ MMP53IN4   X60470         729 M.musculus p53 gene for p53 protein, intron 4
+ MMP53P     X01236        2132 Mouse pseudogene for cellular tumour antigen p53
+ MMP53R     X01237        1773 Mouse mRNA for cellular tumour antigen p53
+ MMRSB2P5   M64597         196 Mouse B2 repeat in the 3' flank of protein 53 (p5
+      22 different entries found
+
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =4
+ Search for keywords
+ ? Keywords=alpha
+ Searching for alpha
+ AAGHA          623 a.anguilla mrna for glycoprotein hormone alpha subunit precu
+ AAMALI        3338 a.aegypti mali gene encoding alpha 1-4 glucosidase, complete
+ AAMALIA       1659 a.aegypti maltase-like i (mali) gene encoding alpha-1,4-gluc
+ AAMALIB       1832 a.aegypti maltase-like i (mali) mrna encoding alpha-1,4-gluc
+ ACA13GT        371 alouatta caraya alpha-1,3gt gene, 3' flank.
+ ADHBADA1       102 duck alpha-d-globin gene, exon 1.
+ ADHBADA2      1145 duck alpha-a-globin gene and 5' flank
+ ADHBADWP       513 duck (white pekin) alpha ii (minor) globin mrna, complete co
+ AEACOXABC     5279 a.eutrophus protein x (acox), acetoin:dcpip oxidoreductase-a
+ AGA13GT        371 ateles geoffroyi alpha-1,3gt gene, 3' flank.
+ AGAAAGFP       282 c.tetragonoloba alpha-amylase/alpha-galactosidase fusion pro
+ AGAABL         138 b.subtilis alpha-amylase signal peptide gene e.coli beta-lac
+ AGAFAMYA        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFAMYB        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFAMYC        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFCOXA        98 synthetic alpha-factor/cox iv fusion gene signal peptide.
+ AGAGABA       7876 synthetic gossypium hirsutum (cotton) alpha globulin a and b
+ AGAMYLS        120 synthetic alpha-amylase gene, 5' end.
+ AGANPS          95 synthetic gene (jcnf-1) encoding alpha-factor pro-region/han
+!
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =3
+ ? Accession number=v00636
+Entry name LAMBDA
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =2
+ Default Entry name=LAMBDA
+ ? Entry name=
+ID   LAMBDA     standard; DNA; PHG; 48502 BP.
+XX
+AC   V00636; J02459; M17233; X00906;
+XX
+DT   03-JUL-1991 (Rel. 28, Last updated, Version 3)
+DT   09-JUN-1982 (Rel. 1, Created)
+XX
+DE   Genome of the bacteriophage lambda (Styloviridae).
+XX
+KW   circular; coat protein; DNA binding protein; genome;
+KW   origin of replication.
+XX
+OS   Bacteriophage lambda
+OC   Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
+XX
+RN   [1]
+RP   1-48502
+RA   Sanger F., Coulson A.R., Hong G.F., Hill D.F., Petersen G.B.;
+RT   "Nucleotide sequence of bacteriophage lambda DNA";
+RL   J. Mol. Biol. 162:729-773(1982).
+XX
+!
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =
+ Default Entry name=LAMBDA
+ ? Entry name=
+DE   Genome of the bacteriophage lambda (Styloviridae).
+ Sequence length  48502
+ Sequence composition
+           T          C          A          G          -
+      11988.     11360.     12336.     12818.         0.
+         24.7%      23.4%      25.4%      26.4%       0.0%
+
+.end lit
+.left margin1
+@4. TX 1 @ Define active region
+.LEFT MARGIN2
+.para
+For its analytic functions 
+the program always works on a region of the sequence called the "active 
+region". This function allows the start and end points of the active region 
+to be reset. 
+.para
+Define  the required start and end points.
+.para
+When a new sequence is read into the program the active region is 
+automatically set to start at the beginning of the sequence and extend  to 
+the 
+maximum the program can 
+handle. On most machines this will be to the end of the sequence. The 
+positions are shown on the screen.
+ Note that for 
+convenience, in the 
+listing and translation functions, the user is given access to regions 
+outside the active region.
+.left margin1
+@5. TX 1 @ List a sequence
+.LEFT MARGIN2
+.para
+The sequence can be listed single or double stranded with line lengths 
+from 
+10 to 120 in multiples of 10.
+.para
+Define the region to list, the line length required and choose between a 
+single or double stranded display.
+The output looks like:
+.lit
+
+  GTTAATGTAG CTTAATAACA AAGCAAAGCA CTGAAAATGC TTAGATGGAT
+  CAATTACATC GAATTATTGT TTCGTTTCGT GACTTTTACG AATCTACCTA
+          10         20         30         40         50
+ 
+  AATTGTATCC CATAAACACA AAGGTTTGGT CCTGGCCTTA TAATTAATTA
+  TTAACATAGG GTATTTGTGT TTCCAAACCA GGACCGGAAT ATTAATTAAT
+          60         70         80         90        100
+ 
+  GAGGTAAAAT TACACATGCA AACCTCCATA GACCGGTGTA AAATCCCTTA
+  CTCCATTTTA ATGTGTACGT TTGGAGGTAT CTGGCCACAT TTTAGGGAAT
+         110        120        130        140        150
+ 
+  AACATTTACT TAAAATTTAA GGAGAGGGTA TCAAGCACAT TAAAATAGCT
+  TTGTAAATGA ATTTTAAATT CCTCTCCCAT AGTTCGTGTA ATTTTATCGA
+         160        170        180        190        200
+ 
+.end lit
+.left margin1
+@6. TX 1 @ List a text file.
+.LEFT MARGIN2
+.para
+Allows the user to have a text file displayed on the screen. It will appear 
+one page at a time.
+.para
+Supply the name of the file to be displayed.
+.left margin1
+@7. TX 1 @ Direct output to disk
+.LEFT MARGIN2
+.para
+Used to direct output that would normally appear on the screen to a file. 
+.para
+Select redirection of either text or graphics, and 
+supply the name of the file that the output should be written to.
+.para
+ The results from the next options selected will not appear on the screen 
+but will be written to the file. When option 7 is selected again
+the file will be 
+closed and output will again appear on the screen.
+.left margin1
+@8. TX 1 @ Write active region to disk
+.LEFT MARGIN2
+.para
+Used to write the current active section of sequence to a disk file in 
+"Staden format".
+.para
+Supply a file name and an optional title.
+.para
+The program has the capability of reading sequences stored in several 
+formats and so, in conjunction with this option, can be used to reformat 
+them. 
+.left margin1
+@9. TX 1 @ Edit the sequence
+.LEFT MARGIN2
+.para
+Used to edit sequences or any other files by giving access to the 
+computers system editor. For editing sequences the input file should  
+have already been created using one of the listing functions such as "list 
+sequence", "list translation" or "list restriction sites above the 
+sequence".
+.para
+Supply the name of the file to edit. Wait while the system editor is made 
+ready (can take awhile on a vax). Use the editor. Exit from the editor. If a 
+sequence has been edited, and you want to process it, affirm that the 
+sequence should be "made active". The edited sequence will replace the 
+original sequence. 
+.para
+This editing method is designed to give users access to an editor with 
+which they are familiar - i.e. the one on their machine, and yet to allow 
+them to edit a sequence which contains all the landmarks they need in 
+order to know where they are. Users can create files containing simple 
+listings (single stranded) with numbering, using "list the sequence", and 
+then edit them with their system editor, using the numbering to know 
+where they are within the sequence. When the edits are complete they 
+exit from the editor and the program "analyses" the edited file to extract 
+only the sequence characters. Similarly a file containing a three phase 
+tranlslation can be edited, or a file containing a sequence plus its three 
+phase translation, plus its restriction sites marked above the sequence. 
+In order to be able to "analyse" such complicated listings and correctly 
+extract the sequence the following simple rule is used: all lines in the 
+file that contain a character that is not A,C,T,G or U are deleted. It is 
+obviously important to be aware of this rule and its implications.
+.left margin1
+@10. TX 2 @ Clear graphics
+.LEFT MARGIN1
+.para
+ Clears graphics from the screen.
+.left margin1
+@11. TX 2 @ Clear text
+.LEFT MARGIN1
+.para
+ Clears  text from the screen.
+.left margin1
+@12. TX 2 @ Draw a ruler
+.LEFT MARGIN2
+.para
+This option
+allows the user to draw a ruler or scale along the x axis of the screen to 
+help identify the coordinates of points of interest. The user can define 
+the position of the first base to be marked (for example if the active 
+region is 1501 to 8000, the user might wish to mark every 1000th base 
+starting at either 1501 or 2000 - it depends if the user wishes to treat 
+the active region as an independent unit with its own numbering starting 
+at 
+its left edge, or as part of the whole sequence). The user can also define 
+the separation of the ticks on the scale and their height. If required the 
+labelling routine can be used to add numbers to the ticks.
+.left margin1
+@13. TX 2 @ Use crosshair
+.LEFT MARGIN2
+.para
+This function puts
+a steerable cross on the screen that can be used to find the 
+coordinates of points in the sequence. The user can move the cross 
+around using the directional keys; when he hits the space bar the 
+program will print out the coordinates of the cross in sequence units and 
+the option will be exited.
+.PARA
+If instead, 
+you hit a , the position will be displayed but the cross will remain on 
+the screen.
+.PARA
+If a letter s is hit the program will display the sequence around the 
+crosshair 
+position, and leave the cross on the screen.
+.left margin1
+@14. TX 2 @ Reposition plots
+.LEFT MARGIN2
+.para
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "NIPMARG" when the 
+program is started. Users can have their own file if required.
+As all the plots start 
+at the same position in x and have the same width, x0 and xlength are the 
+same for all options. Generally users will only want to change the start 
+level of the window y0 and its height ylength. 
+ This option 
+allows users to change window positions whilst running the program.
+The routine prompts first for the number of the option that the users 
+wishes 
+to reposition; then for the y start and height; then for the x start and 
+length. Note that changes to the x values affect all options. If the user 
+types only carriage return for any value it will remain unchanged. 
+The cross-hair can be used to choose suitable heights.
+.LEFT MARGIN1
+@15. TX 2 @ Label a diagram
+.LEFT MARGIN2
+.para
+This routine allows users to label any diagrams they have produced. They 
+are asked to type in a label. When the user types carriage return to finish 
+typing the label the cross-hair appears on the screen. The user can 
+position it anywhere on the screen. If the user types R (for right justify)
+the label will be 
+written on the diagram with its right end at the cross-hair position. 
+If the user types L (for left justify) the label will be written on the 
+diagram with its left end at the cross hair position.
+The 
+cross-hair will then immediately reappear. The user may put the same 
+label 
+on another part of the diagram as before or if he hits the space bar he 
+will be asked if he wishes to type in another label.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=15
+Type label then drive cross hair to left or right end
+of label position then hit  "L"  to  write label left
+justified or  "R"  to  write label right justified or
+the space bar to quit
+ 
+ 
+? Label=delta gene
+
+ missing graphics 
+
+? Label=
+ 
+.end lit
+.left margin1
+@16. TX 2 @Display a map
+.LEFT MARGIN2
+.para
+This draws a map 
+of any sequence features selected by the user.
+These features may be protein coding regions (CDS), tRNA genes (TRNA), 
+promoter positions (PRM), etc. Users may define their own feature table 
+key 
+names. For example I find it convenient to split CDS lines into CDS1, 
+CDS2 
+and CDS3 each of which contains only those sequences that code in the 
+reading frames 1, 2 or 3. Then I can plot them at different heights on 
+the screen ( suitable heights can be determined by using the cross-hair).
+.para
+The coordinates must be stored in a file in the format of an EMBL or GenBank
+feature table. Note that this means that the file must include either EMBL
+or GenBank headers, and a suitable "tail". The simplest header is the word
+FEATURES starting in column 1 of the first line of the file. The simplest
+tail is 2 empty lines at the end of the file. These lines are not included
+when nip writes out results in feature table format.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=16
+ Display a map using an EMBL feature table file
+? map file name=hsegl1.ft
+? feature code(e.g. CDS) =CDS
+X 1 + strand
+  2 - strand
+  3 both strands
+? 0,1,2,3 =
+? level (0-9480) (256) =4000
+
+ missing graphics 
+ 
+? feature code(e.g. CDS) =
+
+.end lit
+.left margin1
+@17. TX 1 @ Search for restriction enzymes
+.LEFT MARGIN2
+.para
+This routine is used to search for short sequences, like restriction 
+enzyme 
+recognition sequences, 
+and can either list  the results or present them graphically. Listings can 
+take several forms and can include the sequence and its translation. 
+Examples are given below. The program will also display the names of 
+enzymes that cut the sequence infrequently. Users can select from sets 
+of enzymes stored in files or can enter them from the keyboard. 
+.para
+The short 
+sequences (strings) and their names need to be arranged in a particular 
+way. See below. Select to search, list an enzyme file or clear the screen. 
+Choose either a file of enzymes or to enter their recognition sequences at the 
+keyboard. Choose to search for all the enzymes in the list or to select 
+from the list. Select a mode of output. Define the sequence as circular or 
+linear. Select to search for "definite" or "possible" matches. The search 
+starts, and after the results have been displayed, further searches can be 
+performed.
+.para
+When the enzymes and their recognition sequences are stored in a file 
+they must be defined in the following way. We 
+call the recognition sequences "strings".
+The format is as follows: each string or set of strings must be 
+preceded by a name, each string must be preceded and 
+terminated with a slash (/), and 
+each set of strings by 2 slashes. 
+For example 
+AATII/GACGT'C// defines the name AATII, its recognition sequence 
+GACGTC 
+and its cut site with the ' symbol; ACCI/GT'MKAC// defines the name 
+ACCI 
+and its recognition sequence includes IUB symbols for incompletely 
+defined 
+symbols in nucleic acid sequences; 
+BBVI/GCAGCNNNNNNNN'/'NNNNNNNNNNNNGCTGC// 
+defines the name BBVI and this time two recognition sequences and cut 
+sites 
+are specified in order to correctly show the cutting position relative to 
+the recognition sequence. If no cut site is included the first base of the 
+recognition sequence is displayed as being on the 3' side of the 
+recognition sequence. 
+.para
+These collections of strings and their 
+names can be read from disk or entered from the keyboard.
+When names and strings are entered from the keyboard the program will ask 
+for the name and then the string(s). If more than one string is typed per 
+name they must be separated by slash (/) characters. See the "Typical 
+dialogue" below.
+ Three files 
+containing restriction enzyme recognition sequences are currently 
+available. The "all enzymes" file contains the Rich Roberts REBASE 
+restriction enzyme database, which is updated monthly.
+.para
+The user can select strings 
+by name from these collections. If so the program will prompt for the 
+names, one at a time. The user can continue to select names until a blank 
+name is entered (by the user typing only return).
+.para
+ Listed output can be displayed in several ways: it 
+can be ordered enzyme by enzyme, or on cut positions, or with enzyme 
+names 
+written above a listing of the sequence. This last listing can also include 
+a three phase translation of the sequence. In addition the program will 
+display only infrequent cutters (the user defines the minimum number of 
+cuts), or can plot the positions of matches.
+.para
+Listings sorted "enzyme by enzyme" have the following form:
+.lit
+
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 AATII                 GACGT'C                  112     111     111
+                                                            912     912
+ Matches found=     2
+     Name                  Sequence            Position  Fragment lengths
+   1 ACCI                  GT'CGAC                  112     111     111
+   2 ACCI                  GT'AGAC                  420     308     308
+                                                            604     604
+ Matches found=     2
+     Name                  Sequence            Position  Fragment lengths
+   1 AHAII                 GA'CGTC                  109     108      90
+   2 AHAII                 GG'CGTC                  199      90     108
+                                                            825     825
+ Matches found=     2
+     Name                  Sequence            Position  Fragment lengths
+   1 AVAII                 G'GACC                    84      83      51
+   2 AVAII                 G'GTCC                   973     889      83
+                                                             51     889
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 BALI                  TGG'CCA                  258     257     257
+                                                            766     766
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 BAMHI                 G'GATCC                   92      91      91
+
+   ......   etc
+ 
+Listings sorted on cut position have the following form:
+
+ Searching
+     Name                  Sequence            Position  Fragment lengths
+   1 ECORI                 G'AATTC                    2       1
+   2 BANI                  G'GTGCC                   26      24
+   3 BSP1286               GTGCC'C                   31       5
+   4 BBVI                  'TACTGCGCCGCAGCTGC        38       7
+   5 NSPBII                CAG'CTG                   51      13
+   6 PVUII                 CAG'CTG                   51       0
+   7 BBVI                  GCAGCTGCTGGTG'            60       9
+   8 HINCII                GTC'AAC                   80      20
+   9 AVAII                 G'GACC                    84       4
+  10 BINI                  'CCAGGGATCC               87       3
+  11 BSTNI                 CC'AGG                    89       2
+  12 BAMHI                 G'GATCC                   92       3
+  13 XHOII                 G'GATCC                   92       0
+  14 NSPBII                CCG'CTG                   98       6
+  15 BINI                  GGATCCGCT'               100       2
+  16 AHAII                 GA'CGTC                  109       9
+  17 SALI                  G'TCGAC                  111       2
+  18 AATII                 GACGT'C                  112       1
+  19 ACCI                  GT'CGAC                  112       0
+  20 HINCII                GTC'GAC                  113       1
+  21 BBVI                  GCAGCGACTGATT'           166      53
+  22 BINI                  'ACTCAGATCC              178      12
+  23 XHOII                 A'GATCC                  183       5
+  24 HGAI                  'GGCGGCGGAGGCGTC         188       5
+
+  .....etc
+
+Lists of infrequent cutters have the following form:
+
+     0 AFLII
+     0 AFLIII
+     0 APAI
+     0 APALI
+     0 ASUII
+     0 AVAI
+     0 AVRII
+     0 BCLI
+     0 BGLI
+     0 BGLII
+     0 BSMI
+     0 BSPMII
+     0 BSTEII
+  ...... etc
+ 
+ Listings showing names above the sequence, and a translation have the 
+following form:
+
+
+ ECORI                   BANI BSP1286
+ .                       .    .      BBVI         NSPBII
+ .                       .    .      .            PVUII    BBVI
+GAATTCGGTTTGGGCTTGGTGTGAGGTGCCCAGAGATTACTGCGCCGCAGCTGCTG
+GTGC
+        10        20        30        40        50        60
+ E  F  G  L  G  L  V  *  G  A  Q  R  L  L  R  R  S  C  W  C
+  N  S  V  W  A  W  C  E  V  P  R  D  Y  C  A  A  A  A  G  A
+   I  R  F  G  L  G  V  R  C  P  E  I  T  A  P  Q  L  L  V  L
+ 
+                   HINCII
+                   .   AVAII
+                   .   .  BINI
+                   .   .  . BSTNI
+                   .   .  . .  BAMHI
+                   .   .  . .  XHOII NSPBII
+                   .   .  . .  .     . BINI     AHAII
+                   .   .  . .  .     . .        . SALI
+                   .   .  . .  .     . .        . .AATII
+                   .   .  . .  .     . .        . .ACCI
+                   .   .  . .  .     . .        . ..HINCII
+TGGCGGTGCGGAGGTCGTCAACGGACCCAGGGATCCGCTGGACGAGGACGTCGACG
+ACGA
+        70        80        90       100       110       120
+ W  R  C  G  G  R  Q  R  T  Q  G  S  A  G  R  G  R  R  R  R
+  G  G  A  E  V  V  N  G  P  R  D  P  L  D  E  D  V  D  D  E
+   A  V  R  R  S  S  T  D  P  G  I  R  W  T  R  T  S  T  T  R
+ 
+                                             BBVI        BINI
+GGAGGAGGTGGATAGCGCATTGCTGGTGGCTGGCAGCGACTGATTTGAGTTCTGAC
+CACT
+       130       140       150       160       170       180
+ G  G  G  G  *  R  I  A  G  G  W  Q  R  L  I  *  V  L  T  T
+  E  E  V  D  S  A  L  L  V  A  G  S  D  *  F  E  F  *  P  L
+   R  R  W  I  A  H  C  W  W  L  A  A  T  D  L  S  S  D  H  S
+ 
+  XHOII
+  .    HGAI       AHAII                      PFIMI
+  .    .          .                          .   BBVI
+CAGATCCGGCGGCGGAGGCGTCGAGGCTCCCGAAACTCCCAGTGGCTGGCCTGCTA
+GATT
+       190       200       210       220       230       240
+ Q  I  R  R  R  R  R  R  G  S  R  N  S  Q  W  L  A  C  *  I
+  R  S  G  G  G  G  V  E  A  P  E  T  P  S  G  W  P  A  R  F
+   D  P  A  A  E  A  S  R  L  P  K  L  P  V  A  G  L  L  D  S
+
+   .........etc
+ 
+.end lit
+.para
+The terms "possible" and "definite" matches are important only for back 
+translations of protein into DNA, and which include IUB redundancy codes.
+Those matches that the program terms "definite matches" and are ones in 
+which the specification of the recognition sequence corresponds 
+exactly to that of the back translation, and consequently are definitely in 
+the DNA sequence. The program will also find what it 
+terms 'possible matches' which are ones that depend on the particular 
+codons
+chosen for each amino acid.
+These are sites at which recognition 
+sequences could be engineered to produce a cut in the DNA 
+without changing the amino 
+acid, but which are not 
+necessarily found in the original sequence.
+.para
+The routine will handle both linear and circular sequences, and 
+so finds cutsites spanning the "ends" of circular sequences.
+ The program will only find cutsites spanning the 
+ends of sequences if the sequence is declared as circular.
+This includes sites for 
+recognition sequences containing leading or trailing N symbols, in which 
+the actual recognition sequence does not span the join. For example if the 
+recognition sequence was 'NNNNACGT and the first 4 characters in the 
+sequence were ACGT, then the match would only be found if the sequence 
+was 
+declared as circular. If the sequence is linear then the first fragment 
+starts at base number 1, and the last ends at the last base. If the 
+sequence is circular then the length of the first fragment is the 
+clockwise
+distance from the last cut to the first.
+.para
+Graphical output marks the position of each string by a 
+short vertical line and gives the name of the enzyme at the left end of 
+the 
+line. If the top of the screen is reached the program gives the user the 
+oportunity to take a hard copy and then will clear the screen and restart
+plotting results at the original start position.
+.para
+Below is an edited piece of dialogue from use of the search option:
+.lit
+? Menu or option number=17
+ 
+Search for restriction enzyme sites
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 = 2
+ 
+  1 All enzymes
+X 2 Six cutters
+  3 Four cutters
+  4 Personal file
+  5 Keyboard
+? 0,1,2,3,4,5 =
+ 
+AATII/GACGT'C//
+ACCI/GT'MKAC//
+AFLII/C'TTAAG//
+AFLIII/A'CRYGT//
+AHAII/GR'CGYC//
+APAI/GGGCC'C//
+APALI/G'TGCAC//
+ASUII/TT'CGAA//
+AVAI/C'YCGRG//
+AVAII/G'GWCC//
+AVRII/C'CTAGG//
+BALI/TGG'CCA//
+BAMHI/G'GATCC//
+BANI/G'GYRCC//
+BANII/GRGCY'C//
+BBVI/GCAGCNNNNNNNN'/'NNNNNNNNNNNNGCTGC//
+BCLI/T'GATCA//
+BGLI/GCCNNNN'NGGC//
+BGLII/A'GATCT//
+BINI/GGATCNNNN'/'NNNNNGATCC//
+BSMI/GAATGCN'/NG'CATTC//
+BSP1286/GDGCH'C//
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+  1 All enzymes
+X 2 Six cutters
+  3 Four cutters
+  4 Personal file
+  5 Keyboard
+? 0,1,2,3,4,5 =
+? (y/n) (y) Search for all names
+X 1 Order results enzyme by enzyme
+  2 Order results by position
+  3 Show only infrequent cutters
+  4 Show names above the sequence
+? 0,1,2,3,4 =
+? (y/n) (y) List matches
+? (y/n) (y) The sequence is linear
+? (y/n) (y) Search for definite matches
+ 
+ Searching
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 AATII                 GACGT'C                  112     111     111
+                                                            912     912
+ Matches found=     2
+     Name                  Sequence            Position  Fragment lengths
+   1 ACCI                  GT'CGAC                  112     111     111
+   2 ACCI                  GT'AGAC                  420     308     308
+                                                            604     604
+ Matches found=     2
+     Name                  Sequence            Position  Fragment lengths
+   1 AHAII                 GA'CGTC                  109     108      90
+   2 AHAII                 GG'CGTC                  199      90     108
+                                                            825     825
+ Matches found=     2
+     Name                  Sequence            Position  Fragment lengths
+   1 AVAII                 G'GACC                    84      83      51
+   2 AVAII                 G'GTCC                   973     889      83
+                                                             51     889
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 BALI                  TGG'CCA                  258     257     257
+                                                            766     766
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 BAMHI                 G'GATCC                   92      91      91
+                                                            932     932
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 BANI                  G'GTGCC                   26      25      25
+                                                            998     998
+ Matches found=     1
+     Name                  Sequence            Position  Fragment lengths
+   1 BANII                 GAGCC'C                  490     489     489
+                                                            534     534
+ Matches found=    11
+     Name                  Sequence            Position  Fragment lengths
+   1 BBVI                  'TACTGCGCCGCAGCTGC        38      37       3
+   2 BBVI                  GCAGCTGCTGGTG'            60      22      22
+   3 BBVI                  GCAGCGACTGATT'           166     106      28
+   4 BBVI                  'CCTGCTAGATTCGCTGC       230      64      37
+   5 BBVI                  GCAGCGGTACGTA'           452     222      50
+   6 BBVI                  'CTCGCCAACGTTGCTGC       502      50      55
+   7 BBVI                  GCAGCCTTCAACT'           606     104      64
+   8 BBVI                  'GAGGTATTCCTGGCTGC       634      28      97
+   9 BBVI                  'CTGGCCGCCGCCGCTGC       869     235     104
+  10 BBVI                  'GCCGCCGCCGCTGCTGC       872       3     106
+  11 BBVI                  GCAGCGATGAGGA'           927      55     222
+
+  ....etc
+
+ X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+
+  1 All enzymes
+X 2 Six cutters
+  3 Four cutters
+  4 Personal file
+  5 Keyboard
+? 0,1,2,3,4,5 =
+ 
+? (y/n) (y) Search for all names
+ 
+X 1 Order results enzyme by enzyme
+  2 Order results by position
+  3 Show only infrequent cutters
+  4 Show names above the sequence
+? 0,1,2,3,4 = 2
+ 
+? (y/n) (y) List matches
+? (y/n) (y) The sequence is linear
+? (y/n) (y) Search for definite matches
+ 
+ Searching
+     Name                  Sequence            Position  Fragment lengths
+   1 ECORI                 G'AATTC                    2       1
+   2 BANI                  G'GTGCC                   26      24
+   3 BSP1286               GTGCC'C                   31       5
+   4 BBVI                  'TACTGCGCCGCAGCTGC        38       7
+   5 NSPBII                CAG'CTG                   51      13
+   6 PVUII                 CAG'CTG                   51       0
+   7 BBVI                  GCAGCTGCTGGTG'            60       9
+   8 HINCII                GTC'AAC                   80      20
+   9 AVAII                 G'GACC                    84       4
+  10 BINI                  'CCAGGGATCC               87       3
+  11 BSTNI                 CC'AGG                    89       2
+  12 BAMHI                 G'GATCC                   92       3
+  13 XHOII                 G'GATCC                   92       0
+  14 NSPBII                CCG'CTG                   98       6
+  15 BINI                  GGATCCGCT'               100       2
+  16 AHAII                 GA'CGTC                  109       9
+  17 SALI                  G'TCGAC                  111       2
+  18 AATII                 GACGT'C                  112       1
+  19 ACCI                  GT'CGAC                  112       0
+  20 HINCII                GTC'GAC                  113       1
+
+  .....etc 
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+ 
+  1 All enzymes
+X 2 Six cutters
+  3 Four cutters
+  4 Personal file
+  5 Keyboard
+? 0,1,2,3,4,5 =
+ 
+? (y/n) (y) Search for all names
+ 
+  1 Order results enzyme by enzyme
+X 2 Order results by position
+  3 Show only infrequent cutters
+  4 Show names above the sequence
+? 0,1,2,3,4 =3
+? Maximum number of cuts (0-100) (0) =
+? (y/n) (y) The sequence is linear
+? (y/n) (y) Search for definite matches
+ 
+ Searching
+     0 AFLII
+     0 AFLIII
+     0 APAI
+     0 APALI
+     0 ASUII
+     0 AVAI
+     0 AVRII
+     0 BCLI
+     0 BGLI
+     0 BGLII
+     0 BSMI
+     0 BSPMII
+     0 BSTEII
+     0 CLAI
+     0 DRAI
+     0 DRAII
+     0 ECOB
+     0 ECOK
+     0 ECORV
+     0 ESPI
+
+   ......etc 
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+ 
+  1 All enzymes
+X 2 Six cutters
+  3 Four cutters
+  4 Personal file
+  5 Keyboard
+? 0,1,2,3,4,5 =
+ 
+? (y/n) (y) Search for all names
+ 
+  1 Order results enzyme by enzyme
+  2 Order results by position
+X 3 Show only infrequent cutters
+  4 Show names above the sequence
+? 0,1,2,3,4 =4
+? (y/n) (y) Hide translation n
+? (y/n) (y) Use 1 letter codes
+? Line length (30-90) (60) =
+? (y/n) (y) The sequence is linear
+? (y/n) (y) Search for definite matches
+ 
+ Searching
+ ECORI                   BANI BSP1286
+ .                       .    .      BBVI         NSPBII
+ .                       .    .      .            PVUII    BBVI
+GAATTCGGTTTGGGCTTGGTGTGAGGTGCCCAGAGATTACTGCGCCGCAGCTGCTG
+GTGC
+        10        20        30        40        50        60
+ E  F  G  L  G  L  V  *  G  A  Q  R  L  L  R  R  S  C  W  C
+  N  S  V  W  A  W  C  E  V  P  R  D  Y  C  A  A  A  A  G  A
+   I  R  F  G  L  G  V  R  C  P  E  I  T  A  P  Q  L  L  V  L
+ 
+                   HINCII
+                   .   AVAII
+                   .   .  BINI
+                   .   .  . BSTNI
+                   .   .  . .  BAMHI
+                   .   .  . .  XHOII NSPBII
+                   .   .  . .  .     . BINI     AHAII
+                   .   .  . .  .     . .        . SALI
+                   .   .  . .  .     . .        . .AATII
+                   .   .  . .  .     . .        . .ACCI
+                   .   .  . .  .     . .        . ..HINCII
+TGGCGGTGCGGAGGTCGTCAACGGACCCAGGGATCCGCTGGACGAGGACGTCGACG
+ACGA
+        70        80        90       100       110       120
+ W  R  C  G  G  R  Q  R  T  Q  G  S  A  G  R  G  R  R  R  R
+  G  G  A  E  V  V  N  G  P  R  D  P  L  D  E  D  V  D  D  E
+   A  V  R  R  S  S  T  D  P  G  I  R  W  T  R  T  S  T  T  R
+ 
+                                             BBVI        BINI
+GGAGGAGGTGGATAGCGCATTGCTGGTGGCTGGCAGCGACTGATTTGAGTTCTGAC
+CACT
+       130       140       150       160       170       180
+ G  G  G  G  *  R  I  A  G  G  W  Q  R  L  I  *  V  L  T  T
+  E  E  V  D  S  A  L  L  V  A  G  S  D  *  F  E  F  *  P  L
+   R  R  W  I  A  H  C  W  W  L  A  A  T  D  L  S  S  D  H  S
+
+ .......etc
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+
+  1 All enzymes
+X 2 Six cutters
+  3 Four cutters
+  4 Personal file
+  5 Keyboard
+? 0,1,2,3,4,5 =5
+Define search strings by typing a string name
+followed by the string(s)
+? Name=FRED
+? String(s)=AAAAAA/TTTTTT
+? Name=MARY
+? String(s)=CCCC/GGGG/GCGCT
+? Name=
+? (y/n) (y) Search for all names 
+X 1 Order results enzyme by enzyme
+  2 Order results by position
+  3 Show only infrequent cutters
+  4 Show names above the sequence
+? 0,1,2,3,4 =
+? (y/n) (y) List matches 
+? (y/n) (y) The sequence is linear 
+? (y/n) (y) Search for definite matches 
+ Searching
+ Matches found=     9
+     Name                  Sequence            Position  Fragment lengths
+   1 FRED                  'TTTTTT                 1557    1556       1
+   2 FRED                  'TTTTTT                 1558       1       1
+   3 FRED                  'TTTTTT                 1559       1       1
+   4 FRED                  'TTTTTT                 1560       1      22
+   5 FRED                  'AAAAAA                 1582      22     529
+   6 FRED                  'AAAAAA                 3160    1578    1019
+   7 FRED                  'AAAAAA                 4204    1044    1044
+   8 FRED                  'AAAAAA                 5691    1487    1487
+   9 FRED                  'AAAAAA                 6710    1019    1556
+                                                            529    1578
+ Matches found=    36
+     Name                  Sequence            Position  Fragment lengths
+   1 MARY                  'CCCC                     47      46       1
+   2 MARY                  'GGGG                    486     439       1
+   3 MARY                  'GGGG                    487       1       1
+   4 MARY                  'CCCC                    557      70       1
+   5 MARY                  'CCCC                    558       1       1
+   6 MARY                  'GCGCT                  1177     619       1
+
+  ... etc
+
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+  1 All enzymes
+X 2 Six cutters
+  3 Four cutters
+  4 Personal file
+  5 Keyboard
+? 0,1,2,3,4,5 =5
+Define search strings by typing a string name
+followed by the string(s)
+? Name=JANE
+? String(s)=A'TTTT/CC'GGG
+? Name=
+? (y/n) (y) Search for all names 
+X 1 Order results enzyme by enzyme
+  2 Order results by position
+  3 Show only infrequent cutters
+  4 Show names above the sequence
+? 0,1,2,3,4 =
+? (y/n) (y) List matches 
+? (y/n) (y) The sequence is linear 
+? (y/n) (y) Search for definite matches 
+ Searching
+ Matches found=    30
+     Name                  Sequence            Position  Fragment lengths
+   1 JANE                  A'TTTT                   437     436       6
+   2 JANE                  A'TTTT                   546     109      33
+   3 JANE                  A'TTTT                   597      51      43
+   4 JANE                  A'TTTT                   777     180      51
+   5 JANE                  A'TTTT                  1274     497      60
+   6 JANE                  A'TTTT                  1571     297      62
+   7 JANE                  CC'GGG                  1926     355      75
+   8 JANE                  A'TTTT                  2403     477      81
+   9 JANE                  A'TTTT                  2586     183      82
+  10 JANE                  A'TTTT                  2731     145     101
+  11 JANE                  A'TTTT                  2812      81     103
+
+ ... etc
+
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =!
+.end lit
+ 
+.left margin1
+@18. TX 1 7 @ Compare a short sequence
+.LEFT MARGIN2
+.para
+This routine slides a short sequence along the current sequence and finds 
+all positions at which a given percentage of the bases match.
+Output is in both graphical and listed forms. 
+.para
+If  users call for dialogue when the routine is selected they will be given 
+the choice of keyboard or file input. Define the string, select the "sense" 
+to use and the percentage match. Matches will be plotted out and then the 
+user can select to have them listed. Then the routine cycles around.
+.para
+ The routine slides the search string 
+along the  sequence and marks the positions at which a minimum 
+percentage score is reached. The graphical output draws a vertical line at 
+the match position; the height of the line represents the percentage 
+score, 
+so that if the line reaches the top of the box the score is 100%.
+The NC-IUB symbols may be used in the search sequence to encode 
+uncertain 
+characters. Any other symbols will not match.
+.LIT
+
+
+            NC-IUB SYMBOLS
+
+      A,C,G,T
+      R        (A,G)        'puRine'
+      Y        (T,C)        'pYrimidine'
+      W        (A,T)        'Weak'
+      S        (C,G)        'Strong'
+      M        (A,C)        'aMino'
+      K        (G,T)        'Keto'
+      H        (A,T,C)      'not G'
+      B        (G,C,T)      'not A'
+      V        (G,A,C)      'not T'
+      D        (G,A,T)      'not C'
+      N        (G,A,C,T)    'aNy'
+
+ Typical dialogue is shown below.
+
+
+? Menu or option number=18
+ Find percentage matches
+? (y/n) (y) Keep picture
+? String=AAATTTCCC
+STRING=AAATTTCCC
+? (y/n) (y) This sense
+? Percent match (1.00-100.00) (70.00) =
+ 
+ Missing graphics display here
+ 
+Total scoring positions above 70.000 percent =   7
+Scores         7      6      6      6      6      6      6
+Positions    365    212    213    292    311    358    627
+? Display (0-7) (0) =3
+ 
+       365
+         ACATTTCGC
+         * ***** *
+         AAATTTCCC
+         1
+ 
+       212
+         GAAACTCCC
+          **  ****
+         AAATTTCCC
+         1
+ 
+       213
+         AAACTCCCA
+         *** * **
+         AAATTTCCC
+         1
+? (y/n) (y) Keep picture
+Default String=AAATTTCCC
+? String=
+STRING=AAATTTCCC
+? (y/n) (y) This sense n
+STRING=GGGAAATTT
+? Percent match (1.00-100.00) (70.00) =
+ 
+ Missing graphics display here
+
+Total scoring positions above 70.000 percent =   7
+Scores         6      6      6      6      6      6      6
+Positions    269    270    271    288    354    624    853
+? Display (0-7) (0) =3
+ 
+       269
+         GAGGGATTT
+         * *  ****
+         GGGAAATTT
+         1
+ 
+       270
+         AGGGATTTT
+          ** * ***
+         GGGAAATTT
+         1
+ 
+       271
+         GGGATTTTC
+         ****  **
+         GGGAAATTT
+         1
+? (y/n) (y) Keep picture !
+
+.end lit
+.left margin1
+@19. TX 7 @ Compare a short sequence using a score matrix
+.LEFT MARGIN2
+.para
+This routine slides a short sequence along the current sequence and finds 
+all positions at which a given level of similarity (a cutoff score) is 
+reached. The score is defined by use of a score matrix. Output is in both 
+graphical and listed forms. 
+.para
+If  users call for dialogue when the routine is selected they will be given 
+the choice of keyboard or file input. Define the string, select the "sense" 
+to use and the cutoff score. Matches will be plotted out and then the user 
+can select to have them listed. Then the routine cycles around.
+.para
+ The routine slides the search string 
+along the  sequence and marks the positions at which a the cutoff score 
+is achieved. The graphical output draws a vertical line at 
+the match position; the height of the line represents the  score, 
+so that if the line reaches the top of the box the score is the maximum 
+possible.
+The NC-IUB symbols may be used in the search sequence to encode 
+uncertain 
+characters.
+.para 
+ The score matrix reflects the level of 
+redundancy in the probe sequence and hence will put more emphasis on 
+those 
+characters that are better defined. The score matrix is:
+.lit
+             DNA SCORE MATRIX USING IUB SYMBOLS
+
+        T  C  A  G  -  R  Y  W  S  M  K  H  B  V  D  N  ?
+
+   T   36  0  0  0  9  0 18 18  0  0 18 12 12  0 12  9  0 
+   C    0 36  0  0  9  0 18  0 18 18  0 12 12 12  0  9  0 
+   A    0  0 36  0  9 18  0 18  0 18  0 12  0 12 12  9  0 
+   G    0  0  0 36  9 18  0  0 18  0 18  0 12 12 12  9  0 
+   -    9  9  9  9 36 18 18 18 18 18 18 27 27 27 27 36  0 
+   R    0  0 18 18 18 36  0  9  9  9  9  6  6 12 12 18  0 
+   Y   18 18  0  0 18  0 36  9  9  9  9 12 12  6  6 18  0 
+   W   18  0 18  0 18  9  9 36  0  9  9 12  6  6 12 18  0 
+   S    0 18  0 18 18  9  9  0 36  9  9  6 12 12  6 18  0 
+   M    0 18 18  0 18  9  9  9  9 36  0 12  6 12  6 18  0 
+   K   18  0  0 18 18  9  9  9  9  0 36  6 12  6 12 18  0 
+   H   12 12 12  0 27  6 12 12  6 12  6 36  8  8  8 27  0 
+   B   12 12  0 12 27  6 12  6 12  6 12  8 36  8  8 27  0 
+   V    0 12 12 12 27 12  6  6 12 12  6  8  8 36  8 27  0 
+   D   12  0 12 12 27 12  6 12  6  6 12  8  8  8 36 27  0 
+   N    9  9  9  9 36 18 18 18 18 18 18 27 27 27 27 36  0 
+   ?    0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0 
+
+  ? is any unrecognised character.
+
+  Typical dialogue is shown below.
+
+? Menu or option number=19
+ Find matches using a score matrix
+? (y/n) (y) Keep picture
+? String=AAATTTCCC
+STRING=AAATTTCCC
+? (y/n) (y) This sense
+Minimum score=     0 Maximum score=   324
+? Score (0-324) (280) =250
+
+ Missing graphics display here
+ 
+For score   250 the number of matches=     1
+Scores       252
+Positions    365
+? Display (0-1) (0) =1
+ 
+       365
+         ACATTTCGC
+         * ***** *
+         AAATTTCCC
+         1
+? (y/n) (y) Keep picture
+Default String=AAATTTCCC
+? String=
+STRING=AAATTTCCC
+? (y/n) (y) This sense n
+STRING=GGGAAATTT
+Minimum score=     0 Maximum score=   324
+? Score (0-324) (222) = 200
+
+ Missing graphics display here
+
+For score   200 the number of matches=     7
+Scores       216    216    216    216    216    216    216
+Positions    269    270    271    288    354    624    853
+? Display (0-7) (0) =3
+ 
+       269
+         GAGGGATTT
+         * *  ****
+         GGGAAATTT
+         1
+ 
+       270
+         AGGGATTTT
+          ** * ***
+         GGGAAATTT
+         1
+ 
+       271
+         GGGATTTTC
+         ****  **
+         GGGAAATTT
+         1
+? (y/n) (y) Keep picture !
+ 
+.end lit
+.left margin1
+@20. TX 7 @ Search for a motif using a weight matrix
+.LEFT MARGIN2
+.para
+This function performs searches for short sequence
+motifs using an appropriate  weight matrix. In addition it can be used to 
+create or modify weight matrices. In order to perform a search the only 
+input 
+required is the name of the file containing the weight matrix.
+The results can be presented graphically or listed. The graphical 
+presentation will draw line at the position of any matches found; the 
+height of the line is proportional to the score.
+.para
+For a search, select "use weight matrix", supply the name of the file 
+containing the weight matrix, and choose between having results plotted 
+or listed. If dialogue is requested when the function is selected users can 
+alter the cutoff score employed.
+.para
+To create a weight matrix several steps are involved. A file containing an 
+alignment of known motifs is required. (This file must be created before 
+the current option is selected. The format is a follows: each sequence is 
+written on a separate line with at least one space at the beginning; each 
+sequence is terminated by a space character, and can be followed by a 
+name. The sequences must be aligned.) Supply the name of the file of 
+aligned sequences. The program reads and displays the sequences. Choose 
+between "summing logs of weights" or summing weights (i.e. whether to 
+multiply or add weights). If logs are used all scores will be negative. 
+Choose if all positions in the set of aligned sequences should be used or 
+if a mask should be applied. If so selected, define a mask as a string of 
+symbols, in which symbol - means ignore and any other symbol means 
+use. E.g. xx-x--abc means use all positions except 3,5 and 6.
+.para
+The program will calculate weights as the frequencies of each base at 
+each unmasked position in the set of aligned sequences. These weights 
+are then applied to the set of aligned sequences to give a range  of 
+"observed" scores. The mean and standard deviation of these scores is 
+displayed. The user is asked to supply several values to be used when the 
+weight matrix is applied to other sequences: a cutoff score (by default, 
+the mean minus 3 standard deviations), a top score for scaling graphical 
+results (by default, the mean plus 3 standard deviations), and a position 
+to identify (this means that if a particular base within the motif is used 
+as a "landmark", such as the A of the AG in splice acceptor sites, then its 
+position will be marked in plots). All these values are stored along with 
+the weight matrix. Finally supply the name of a file to contain the weight 
+matrix.
+.para
+Weight matrices can be "rescaled" using a set of aligned sequences in 
+much the same ways as a matrix is created. The purpose is to redefine 
+the cutoff scores, and rescaling does not alter any other values in the 
+weight matrix file.
+.para
+The methods have changed considerably but were first outlined in
+Staden, R. Nucl. Acid Res. 12 505-519 1984, and
+Staden, R. Genetic 
+engineering: principles and methods vol 7, Edited by J.K. Setlow and A. 
+Hollaender, Plenum publishing corp., 1985.
+.para
+ The methods have always had to deal with the problem of zeroes in the 
+matrices. The current versions 
+employ "Laplaces Law of Succession" in which 1 is 
+added to each term.
+.para
+It is now possible to apply a mask to a set of aligned sequences in 
+order to give weight to selected positions only.
+Sequences have superimposed functions: some parts may be of general 
+structural 
+importance and give rise to an overall framework, and other parts give 
+specificity and hence are not common; we may want to use a set of 
+aligned 
+sequences to define a motif, but want to use only the framework 
+positions.
+ Alternatively we may want to pick out 
+only those parts of a set of aligned sequences that give a particular 
+property, and to ignore other similarities that are due to some other 
+property
+and which could obscure the pattern 
+we are interested in. The ability to define a mask allows certain 
+positions 
+to be used in the motif and others to be ignored, and yet still permits the 
+use of a set of aligned sequences to calculate weights. 
+.para
+Typical dialogue is shown below.
+.lit
+
+? Menu or option number=20
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 =2
+? Name of aligned sequences file=[RS.MOTIFS]GCN4.SEQ
+ 
+     1 AGCGTGACTCTTCCCGGAA HIS1
+     2 GAGGTGACTCACTTGGAAG HIS1
+     3 CGGATGACTCTTTTTTTTT HIS3
+     4 ACAGTGACTCACGTTTTTT HIS4
+     5 GTCGTGACTCATATGCTTT ARG3
+     6 TGAATGACTCACTTTTTGG ARG4
+     7 TTCTTGACTCGTCTTTTCT CPA1
+     8 CGAATGACTCTTATTGATG CPA2
+     9 AGAATGACTAATTTTACTA TRP5
+    10 TCGTTGACTCATTCTAATC TRP3
+    11 TTGCTGACTCATTACGATT TRP2
+    12 GAGATGACTCTTTTTCTTT IV1
+    13 GCGATGATTCATTTCTCTG IV2
+    14 TAGATGACTCAGTTTAGTC LEU1
+    15 TAAGTGACTCAGTTCTTTC LEU4
+    16 ATGATGACTCTTAAGCATG ILS1
+Length of motif    19
+? (y/n) (y) Sum logs of weights
+ 
+? (y/n) (y) Use all motif positions n
+x means use, - means ignore
+e.g. xx-x---x-x means use positions 1,2,4,8,10
+? Mask=----XXXXXXXX
+ Applying weights to input sequences
+   1      -27.979 AGCGTGACTCTTCCCGGAA
+   2      -24.543 GAGGTGACTCACTTGGAAG
+   3      -20.890 CGGATGACTCTTTTTTTTT
+   4      -23.087 ACAGTGACTCACGTTTTTT
+   5      -22.771 GTCGTGACTCATATGCTTT
+   6      -23.408 TGAATGACTCACTTTTTGG
+   7      -25.159 TTCTTGACTCGTCTTTTCT
+   8      -22.679 CGAATGACTCTTATTGATG
+   9      -24.751 AGAATGACTAATTTTACTA
+  10      -23.157 TCGTTGACTCATTCTAATC
+  11      -23.067 TTGCTGACTCATTACGATT
+  12      -21.449 GAGATGACTCTTTTTCTTT
+  13      -24.191 GCGATGATTCATTTCTCTG
+  14      -23.770 TAGATGACTCAGTTTAGTC
+  15      -22.923 TAAGTGACTCAGTTCTTTC
+  16      -25.285 ATGATGACTCTTAAGCATG
+Top score     -20.890  Bottom score     -27.979
+Mean     -23.694  Standard deviation       1.613
+Mean minus 3.sd     -28.534  Mean plus 3.sd     -18.854
+? Cutoff score (-999.00-9999.00) (-28.53) =
+? Top score for scaling plots (-28.53-999.00) (-18.85) =
+? Position to identify (0-19) (1) =
+? Title=GCN4 SEQUENCES
+? Name for new weight matrix file=1.WTS
+ 
+ 
+? Menu or option number=20
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 =3
+? Name of existing weight matrix file=1.WTS
+ GCN4 SEQUENCES
+? Name of aligned sequences file=[RS.MOTIFS]GCN4.SEQ
+Length of motif    19
+? (y/n) (y) Sum logs of weights n
+? (y/n) (y) Use all motif positions
+ 
+ Applying weights to input sequences
+   1      128.000 AGCGTGACTCTTCCCGGAA
+   2      148.000 GAGGTGACTCACTTGGAAG
+   3      172.000 CGGATGACTCTTTTTTTTT
+   4      160.000 ACAGTGACTCACGTTTTTT
+   5      161.000 GTCGTGACTCATATGCTTT
+   6      157.000 TGAATGACTCACTTTTTGG
+   7      149.000 TTCTTGACTCGTCTTTTCT
+   8      160.000 CGAATGACTCTTATTGATG
+   9      151.000 AGAATGACTAATTTTACTA
+  10      159.000 TCGTTGACTCATTCTAATC
+  11      158.000 TTGCTGACTCATTACGATT
+  12      169.000 GAGATGACTCTTTTTCTTT
+  13      152.000 GCGATGATTCATTTCTCTG
+  14      157.000 TAGATGACTCAGTTTAGTC
+  15      160.000 TAAGTGACTCAGTTCTTTC
+  16      143.000 ATGATGACTCTTAAGCATG
+Top score     172.000  Bottom score     128.000
+Mean     155.250  Standard deviation      10.034
+Mean minus 3.sd     125.147  Mean plus 3.sd     185.353
+? Cutoff score (-999.00-9999.00) (125.15) =
+? Top score for scaling plots (125.15-999.00) (185.35) =
+? Position to identify (0-19) (1) =
+? Title=GCN4 SEQUENCES
+? Name for new weight matrix file=2.WTS
+ 
+ 
+? Menu or option number=20
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 =
+? Motif weight matrix file=1.WTS
+ GCN4 SEQUENCES
+? (y/n) (y) Plot results n
+ 
+    153    -22.61 GCAGCGACTGATTTGAGTT
+    169    -28.53 GTTCTGACCACTCAGATCC
+    172    -27.27 CTGACCACTCAGATCCGGC
+    219    -27.35 CCAGTGGCTGGCCTGCTAG
+    268    -27.82 CGAGGGATTTTCGATCTTG
+    274    -26.99 ATTTTCGATCTTGTGGATG
+    283    -25.79 CTTGTGGATGATTTTCACG
+    287    -27.50 TGGATGATTTTCACGTGCG
+    298    -28.17 CACGTGCGCCGTCATATTG
+    332    -28.27 TCTTTGAAGCAGAAGGGAC
+    351    -28.27 AGGGGTACACTTTCACATT
+    357    -25.05 ACACTTTCACATTTCGCTT
+    364    -28.51 CACATTTCGCTTATGGGAG
+    400    -23.77 GAAGTTACTAATGTGCGTG
+    451    -26.22 ATGCTCGCCCTCTTTGGTG
+    476    -28.00 TCCCTCACTGAGCCCTCCG
+    480    -28.33 TCACTGAGCCCTCCGCCTC
+    517    -23.46 GCTAAGATTCAGCTTGGTT
+    556    -27.27 TCCAGCACTCAGGTTCGGC
+    602    -27.01 AACTTGAATCCATCGTTGC
+    648    -28.45 TGCTAAACACAGCCGGTTT
+    679    -28.18 CTGTTTGCCCAGTTTGGGC
+    691    -28.51 TTTGGGCCGCTTCTGGACG
+    713    -27.67 GGCTTGACCGTGGCTGTGG
+    803    -25.47 ATGCTGACCATGCTTTTCA
+    848    -28.11 ATAATGTTAAGTTTGATTC
+    857    -25.97 AGTTTGATTCCGCTGGCCG
+    879    -27.85 CCGCTGCTGCTGTTTCCAC
+    917    -27.77 GCGATGAGGAAGGCTTGTT
+    931    -27.81 TTGTTGGCGCGCCTGCTCG
+    952    -23.52 GAGGTGACTACCATCCGTG
+    977    -28.40 TGCGTGGGTGAGCTGTTGT
+ 
+ 
+ 
+ 
+? Menu or option number=6
+Page through text files
+? Name of file to read=1.WTS
+ GCN4 SEQUENCES
+     19     1   -28.534   -18.854
+ P   0   1   2   3   4   5   6   7   8   9  10  11  12  13  14  15  16  17  18
+ N  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16  
+16
+ T   0   0   0   0  16   0   0   1  16   0   5  11  10  12   9   6   7  12   6
+ C   0   0   0   0   0   0   0  15   0  15   0   3   2   2   4   3   2   1   3
+ A   0   0   0   0   0   0  16   0   0   1  10   0   3   2   0   3   5   2   2
+ G   0   0   0   0   0  16   0   0   0   0   1   2   1   0   3   4   2   1   5
+End of file
+
+.end lit
+
+
+.left margin1
+@21. TX 3 @ Count base composition
+.LEFT MARGIN2
+.para
+This routine 
+calculates the base composition of the 
+active region of the sequence as both totals and percentages.
+.left margin1
+@22. TX 3 @ Count dinucleotide frequencies
+.LEFT MARGIN2
+.para
+This routine simply counts dinucleotide frequencies for the currently 
+active region of the sequence. It also calculates an expected distribution 
+based on the base composition.
+The output looks like:
+.LIT
+              T             C             A             G
+        obs  expected obs  expected obs  expected obs  expected 
+
+     T   8.44   8.25   6.67   7.01  10.35   9.92   3.27   3.54
+     C   7.49   7.01   6.76   5.95   8.39   8.43   1.76   3.01
+     A  10.13   9.92   7.78   8.43  11.74  11.93   4.89   4.26
+     G   2.67   3.54   3.19   3.01   4.06   4.26   2.42   1.52
+
+.END LIT
+.left margin1
+@23. TX 3 5 @ Count codons and amino acids
+.LEFT MARGIN2
+.para
+This function
+counts codons, amino acid composition, protein molecular weights, and 
+base 
+composition. Users select the segments of the sequence that the program 
+should analyse.
+.para
+Choose between being shown observed counts or counts normalised so 
+that the totals for each amino acid sum to 100. Select to define 
+segments using either the keyboard or an EMBL feature table.
+Define the segments to count over. Select strand for each segment. Stop 
+selecting segments by typing a zero for "Count from ()". The results are 
+displayed a screenful at a time, and the bell is sounded to show there is 
+more to come. A zero start position,  or the end of an EMBL feature table, 
+signals 
+the routine to print out totals for all values.
+
+.para
+The counts are broken down into several figures.
+ Base 
+composition by position in codon expressed as a percentage of each bases 
+own frequency;  base composition by position in codon expressed as a 
+percentage of the overall base composition of the section; base 
+composition 
+expected for this amino acid composition if there was no codon 
+preference; 
+percentage deviations of the observed amino acid composition from an 
+average amino acid composition. 
+.para
+The output looks like:
+.LIT
+
+      ===========================================
+      F TTT   1. S TCT   2. Y TAT   2. C TGT   1.
+      F TTC   1. S TCC   1. Y TAC   3. C TGC   2.
+      L TTA   7. S TCA   4. * TAA   9. * TGA   1.
+      L TTG   2. S TCG   1. * TAG   2. W TGG   2.
+      ===========================================
+      L CTT   3. P CCT   2. H CAT   4. R CGT   1.
+      L CTC   2. P CCC   3. H CAC   1. R CGC   0.
+      L CTA   3. P CCA   2. Q CAA   4. R CGA   0.
+      L CTG   2. P CCG   2. Q CAG   1. R CGG   2.
+      ===========================================
+      I ATT   9. T ACT   1. N AAT   7. S AGT   3.
+      I ATC   2. T ACC   2. N AAC   4. S AGC   2.
+      I ATA   4. T ACA   5. K AAA  13. R AGA   5.
+      M ATG   1. T ACG   2. K AAG   4. R AGG   1.
+      ===========================================
+      V GTT   2. A GCT   2. D GAT   1. G GGT   3.
+      V GTC   2. A GCC   2. D GAC   1. G GGC   1.
+      V GTA   4. A GCA   3. E GAA   2. G GGA   1.
+      V GTG   2. A GCG   0. E GAG   1. G GGG   1.
+      ===========================================
+  total codons=      166.
+          T          C          A          G
+
+  1     31.06      33.68      34.03      35.00
+  2     35.61      35.79      30.89      32.50
+  3     33.33      30.53      35.08      32.50
+
+  1     24.70      19.28      39.16      16.87
+  2     28.31      20.48      35.54      15.66
+  3     26.51      17.47      40.36      15.66
+  %     26.51      19.08      38.35      16.06  observed, overall totals
+  %     25.00      22.26      33.10      19.65  expected, even codons per acid
+
+          A    C    D    E    F    G    H    I    K    L
+          7.   3.   2.   3.   2.   6.   5.  15.  17.  19.
+ o-e %  -47. -33. -76. -68. -64. -54.  62. 116.  67.  67.
+
+          M    N    P    Q    R    S    T    V    W    Y
+          1.  11.   9.   5.   9.  13.  10.  10.   2.   5.
+ o-e %  -62.  66.  12. -17.  19.  21.   6.  -2.   0.  -5.
+ total acids=  154. molecular weight=    17421.
+
+ Typical dialogue follows.
+
+? Menu or option number=23
+ Calculate codon usage, base composition
+ and amino acid composition
+? (y/n) (y) Show observed counts
+? (y/n) (y) Define segments using keyboard
+? Count from (0-1023) (0) =1
+? Count to (1-1023) (1023) =1000
+? (y/n) (y) + strand
+ 
+     ===========================================
+     F TTT  13. S TCT   1. Y TAT   1. C TGT   3.
+     F TTC   4. S TCC  10. Y TAC   1. C TGC   7.
+     L TTA   1. S TCA   0. * TAA   1. * TGA   4.
+     L TTG   4. S TCG   1. * TAG   3. W TGG   5.
+     ===========================================
+     L CTT   9. P CCT   1. H CAT   3. R CGT  14.
+     L CTC   7. P CCC   0. H CAC   7. R CGC  14.
+     L CTA   0. P CCA   0. Q CAA   4. R CGA   9.
+     L CTG  12. P CCG   1. Q CAG   9. R CGG   8.
+     ===========================================
+     I ATT   7. T ACT   4. N AAT   4. S AGT   1.
+     I ATC   4. T ACC   5. N AAC   3. S AGC   7.
+     I ATA   1. T ACA   1. K AAA   3. R AGA   2.
+     M ATG   2. T ACG   1. K AAG   2. R AGG   2.
+     ===========================================
+     V GTT  11. A GCT  13. D GAT   6. G GGT   9.
+     V GTC   5. A GCC  10. D GAC   9. G GGC  11.
+     V GTA   6. A GCA   5. E GAA   6. G GGA  12.
+     V GTG   8. A GCG   5. E GAG   3. G GGG   8.
+     ===========================================
+ 
+ 
+ Total codons=      333.
+         T          C          A          G
+ 
+ 1     23.32      37.69      28.99      40.06
+ 2     37.15      22.31      38.46      36.59
+ 3     39.53      40.00      32.54      23.34
+       -----      -----      -----      -----
+ =     100%       100%       100%       100%
+ 
+ 1     17.72      29.43      14.71      38.14  = 100%
+ 2     28.23      17.42      19.52      34.83  = 100%
+ 3     30.03      31.23      16.52      22.22  = 100%
+ %     25.33      26.03      16.92      31.73  Observed, overall totals
+ %     24.44      22.31      20.90      32.35  Expected, even codons per acid
+ 
+         A    C    D    E    F    G    H    I    K    L
+        33.  10.  15.   9.  17.  40.  10.  12.   5.  33.
+O-E %   22.  81. -13. -55.  34.  71.  40. -29. -73.  13.
+ 
+         M    N    P    Q    R    S    T    V    W    Y
+         2.   7.   2.  13.  49.  20.  11.  30.   5.   2.
+O-E %  -74. -51. -88.   0. 165. -11. -42.  40.  18. -81.
+Total acids=  325. Molecular weight=    35831. Hydrophobicity= -17.8
+ 
+ 
+? Count from (0-1023) (0) =
+ 
+    Codon totals over all genes
+     ===========================================
+     F TTT  13. S TCT   1. Y TAT   1. C TGT   3.
+     F TTC   4. S TCC  10. Y TAC   1. C TGC   7.
+     L TTA   1. S TCA   0. * TAA   1. * TGA   4.
+     L TTG   4. S TCG   1. * TAG   3. W TGG   5.
+     ===========================================
+     L CTT   9. P CCT   1. H CAT   3. R CGT  14.
+     L CTC   7. P CCC   0. H CAC   7. R CGC  14.
+     L CTA   0. P CCA   0. Q CAA   4. R CGA   9.
+     L CTG  12. P CCG   1. Q CAG   9. R CGG   8.
+     ===========================================
+     I ATT   7. T ACT   4. N AAT   4. S AGT   1.
+     I ATC   4. T ACC   5. N AAC   3. S AGC   7.
+     I ATA   1. T ACA   1. K AAA   3. R AGA   2.
+     M ATG   2. T ACG   1. K AAG   2. R AGG   2.
+     ===========================================
+     V GTT  11. A GCT  13. D GAT   6. G GGT   9.
+     V GTC   5. A GCC  10. D GAC   9. G GGC  11.
+     V GTA   6. A GCA   5. E GAA   6. G GGA  12.
+     V GTG   8. A GCG   5. E GAG   3. G GGG   8.
+     ===========================================
+ 
+ 
+ Total codons=      333.
+         T          C          A          G
+ 
+ 1     23.32      37.69      28.99      40.06
+ 2     37.15      22.31      38.46      36.59
+ 3     39.53      40.00      32.54      23.34
+       -----      -----      -----      -----
+ =     100%       100%       100%       100%
+ 
+ 1     17.72      29.43      14.71      38.14  = 100%
+ 2     28.23      17.42      19.52      34.83  = 100%
+ 3     30.03      31.23      16.52      22.22  = 100%
+ %     25.33      26.03      16.92      31.73  Observed, overall totals
+ %     24.44      22.31      20.90      32.35  Expected, even codons per acid
+ 
+         A    C    D    E    F    G    H    I    K    L
+        33.  10.  15.   9.  17.  40.  10.  12.   5.  33.
+O-E %   22.  81. -13. -55.  34.  71.  40. -29. -73.  13.
+ 
+         M    N    P    Q    R    S    T    V    W    Y
+         2.   7.   2.  13.  49.  20.  11.  30.   5.   2.
+O-E %  -74. -51. -88.   0. 165. -11. -42.  40.  18. -81.
+Total acids=  325. Molecular weight=    35831. Hydrophobicity= -17.8
+ 
+.END LIT
+.LEFT MARGIN1
+@24. TX 3 @ Plot base composition
+.LEFT MARGIN2
+.para
+This option plots the base composition of the sequence. The counts for 
+any combination of bases can be plotted.
+.para
+If dialogue is requested the user is presented with a check box for 
+selecting which bases should be counted, and then allowed to define a 
+window length, and a "plot interval". Otherwise, the AT composition is 
+plotted with a window of 101 and a plot interval of 5.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=d24
+ Plot base composition
+ 
+checkbox: those set are marked X
+X 1 T
+  2 C
+X 3 A
+  4 G
+? 0,1,2,3,4 =1
+ 
+checkbox: those set are marked X
+  1 T
+  2 C
+X 3 A
+  4 G
+? 0,1,2,3,4 =3
+ 
+checkbox: those set are marked X
+  1 T
+  2 C
+  3 A
+  4 G
+? 0,1,2,3,4 =2
+ 
+checkbox: those set are marked X
+  1 T
+X 2 C
+  3 A
+  4 G
+? 0,1,2,3,4 =4
+ 
+checkbox: those set are marked X
+  1 T
+X 2 C
+  3 A
+X 4 G
+? 0,1,2,3,4 =
+ 
+? odd span length (1-201) (31) =
+? plot interval (1-11) (5) =
+
+ missing graphics
+
+
+
+.end lit
+.left margIN1
+@25. TX 3 @ Plot local deviations in base composition
+.LEFT MARGIN2
+.para
+The "local deviation" routines are designed to indicate the  similarity of 
+the compositions of different parts of the sequence. The composition of 
+every segment of the sequence is compared with  a standard composition. 
+The levels of similarity are plotted as a chi squared values. The standard 
+can be the composition of the whole sequence, or alternatively that of a 
+small segment defined by the user.
+.para
+If dialogue is forced define the standard region, the window length and 
+the plot interval. Otherwise the composition of the whole sequence is 
+taken as a standard. The maximum and minimum observed value of the chi 
+squared calculation is displayed, and plots will always exactly fill the 
+available box. Any unusual regions will show as peaks.
+.para
+The following measure is used: for each window position
+calculate (sum((obs-exp)*(obs-exp))/(exp*exp)) 
+where obs is the observed composition 
+and exp is the expected composition (the composition of the standard).
+ The calculation is performed once to find out the range of values and is
+then repeated and 
+plotted so that the plot exactly fills the allocated screen space.
+.left margIN1
+@26. TX 3 @ Plot local deviations from dinucleotide composition
+.LEFT MARGIN2
+.para
+The "local deviation" routines are designed to indicate the  similarity of 
+the compositions of different parts of the sequence. The dinucleotide 
+composition of every segment of the sequence is compared with  a 
+standard composition. The levels of similarity are plotted as a chi 
+squared values. The standard can be the composition of the whole 
+sequence, or alternatively that of a small segment defined by the user.
+.para
+If dialogue is forced define the standard region, the window length and 
+the plot interval. Otherwise the composition of the whole sequence is 
+taken as a standard. The maximum and minimum observed value of the chi 
+squared calculation is displayed, and plots will always exactly fill the 
+available box. Any unusual regions will show as peaks.
+.para
+The following measure is used: for each window position
+calculate (sum((obs-exp)*(obs-exp))/(exp*exp)) 
+where obs is the observed composition 
+and exp is the expected composition (the composition of the standard).
+ The calculation is performed once to find out the range of values and is
+then repeated and 
+plotted so that the plot exactly fills the allocated screen space.
+.left margin1
+@27. TX 3 @ Plot local deviations from trinucleotide composition
+.LEFT MARGIN2
+.para
+The "local deviation" routines are designed to indicate the  similarity of 
+the compositions of different parts of the sequence. The trinucleotide 
+composition of every segment of the sequence is compared with  a 
+standard composition. The levels of similarity are plotted as a chi 
+squared values. The standard can be the composition of the whole 
+sequence, or alternatively that of a small segment defined by the user.
+.para
+If dialogue is forced define the standard region, the window length and 
+the plot interval. Otherwise the composition of the whole sequence is 
+taken as a standard. The maximum and minimum observed value of the chi 
+squared calculation is displayed, and plots will always exactly fill the 
+available box. Any unusual regions will show as peaks.
+.para
+The following measure is used: for each window position
+calculate (sum((obs-exp)*(obs-exp))/(exp*exp)) 
+where obs is the observed composition 
+and exp is the expected composition (the composition of the standard).
+ The calculation is performed once to find out the range of values and is
+then repeated and 
+plotted so that the plot exactly fills the allocated screen space.
+.left margin1
+@28. TX 5 @ Calculate codon constraint
+.left margin2
+.para
+The purpose of this option (which is somewhat specialised) is to measure 
+the level of constraint imposed on the sequence by coding for a protein of 
+the observed composition. It measures the strength of the codon bias 
+averaged over windows of 99 codons and displays the values observed.
+.para
+Select between defining segments at the keyboard or using an EMBL 
+feature table. Finish selecting segments by typing a zero start. The value 
+for each segment is displayed:
+.para
+ Mean (W-EW) / EWD, window 99      10.5
+.para
+The codon constraint is the 
+difference between the observed codon improbability and the mean 
+improbabilty for 
+a sequence of the same composition.   See McLachlan, Staden and Boswell 
+Nucl. Acid Res. 1984
+
+.left margin1
+@59. TX 3 @ Plot negentropy
+.LEFT MARGIN2
+.para
+This routine is designed to show regions of the sequence that differ in 
+composition from others, and hence is like the "plot deviation.." routines.
+.para
+Negentropy or information is defined in the following way: let Pi be the 
+probability of observing base i, where i = A,C,G or T, then the average 
+information per base is 
+I=-sum(Pi.Log(Pi))   (sum over all i). This routine calculates Pi by 
+calculating the overall composition for the sequence and then plots I for 
+windows of length defined by the user. 
+.left margin1
+@30. TX 4 @ Search for hairpin loops
+.LEFT MARGIN2
+.para
+Used to find simple inverted repeats or potential hairpin loops
+ The loops are defined by a range of sizes for 
+the loop and a minimum number of consecutive base pairs in the stem. 
+The results can be presented graphically or listed. 
+A-T, G-C and G-T basepairs are counted. 
+.para
+Define the range of loop sizes and the minimum number of consecutive 
+basepairs required. Choose between plotted or listed results.
+.para
+The loops found are plotted as blips on a 
+horizontal line that represents the sequence, the heights of the lines are
+proportional to the number of basepairs in the stems. Note that only 
+uninterrupted stems are found - i.e. all basepairs must be made. To look 
+for stems with some unpaired bases (or for palindromes) use the inverted 
+repeat motif class in the pattern searching option.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=30
+ Search for hairpin loops
+Define the range of loop sizes
+? Minimum loop size (1-30) (1) =
+? Maximum loop size (3-60) (3) =
+? Minimum number of basepairs (2-20) (6) =
+? (y/n) (y) Plot results n
+ Searching
+ 
+          T.G
+          G-C
+          G.T
+          T.G
+          C-G
+          G-C
+          T.G
+          C-G
+          G.T
+     GCCGCA GCGGAGG
+         49
+ 
+           G
+          G-C
+          T.G
+          C-G
+          G.T
+          T.G
+          G-C
+     CTGCTG GGAGGTC
+         56
+ 
+ 
+           G
+          T.G
+          G-C
+          G.T
+          T.G
+          C-G
+          G-C
+          T-A
+          T.G
+     AGCGCA CGACTGA
+        139
+ 
+          A C
+          G.T
+          C-G
+          G.T
+          C-G
+          C-G
+          G-C
+     TTCGCT CAACGCC
+        244
+ 
+.end lit
+.LEFT MARGIN1
+@31. TX 4 @ Search for long range inverted repeats
+.LEFT MARGIN2
+.para
+Searches for inverted repeats. The repeats found are exact matches of at 
+least 6 consecutive bases. Results can be presented graphically or listed.
+Plotted results show the end points of repeats joined by rectangular 
+lines.
+.para
+If dialogue is not requested the defaults will be taken. Otherwise choose 
+between plotted or listed results. If required select to analyse a 
+restricted segment of the currently active region. Choose a repeat length.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=D31
+ Plot long-range inverted repeats
+? (y/n) (y) Plot results n
+Define restricted region
+? start (1-1023) (1) =
+? end (2-1023) (1023) =
+? Minimum inverted repeat (6-30) (12) =10
+ Searching
+    27     909      10  TGCCCAGAGA
+ 
+.end lit
+.LEFT MARGIN1
+@32. TX 4 @ Search for repeats
+.LEFT MARGIN2
+.para
+Searches for direct repeats. The repeats found are exact matches of at 
+least 6 consecutive bases. Results can be presented graphically or listed.
+Plotted results show the end points of repeats joined by rectangular 
+lines.
+.para
+If dialogue is not requested the defaults will be taken. Otherwise choose 
+between plotted or listed results. If required select to analyse a 
+restricted segment of the currently active region. Choose a repeat length.
+.para
+Typical dialogue follows.
+
+.lit
+ ? Menu or option number=D32
+ Plot repeats
+? (y/n) (y) Plot results n
+Define restricted region
+? start (1-1023) (1) =
+? end (2-1023) (1023) =
+? Minimum repeat (6-30) (12) =8
+ Searching
+   619     988       8  GCTGTTGT
+   514     646       8  GCTGCTAA
+    94     865       8  TCCGCTGG
+   146     222       9  GTGGCTGGC
+   455     497       8  TCGCCCTC
+   454     496       9  CTCGCCCTC
+   872     875       8  GCCGCCGC
+   510     615       8  CGTTGCTG
+   152     913       8  GGCAGCGA
+   199     265       8  CGTCGAGG
+   689     794       8  AGTTTGGG
+   147     223       8  TGGCTGGC
+   101     116       8  GACGAGGA
+     8     690       8  GTTTGGGC
+    52     141       8  TGCTGGTG
+ 
+.end lit
+.left margin1
+@33. TX 4 @ Search for z dna (total ry, yr)
+.LEFT MARGIN2
+.para
+Searches for segments of the sequence that might form Z DNA. A window 
+length is chosen and the number of RY and YR dinucleotides within each 
+window is plotted. The top of the box corresponds to all RY or YR, the 
+bottom to zero RY or YR.
+.para
+If dialogue is requested, select a window length and plot interval. 
+Otherwise the defaults will be used.
+.para
+The program contains three 
+separate ways of doing this (options 33,34,35). 
+.left margin1
+@34. TX 4 @ Search for z dna (runs of ry, yr)
+.LEFT MARGIN2
+.para
+Searches for segments of the sequence that might form Z DNA. Results 
+are plotted.
+.para
+If dialogue is requested define a window length and plot interval. 
+Otherwise the defaults will be used.
+ The routine 
+counts the number of R in positions 1,3,5 etc =R1, the number of Y in 
+positions 2,4,6 etc =Y1, the number of Y in positions 1,3,5 etc =Y2 and 
+the 
+number of R in positions 2,4,6 etc =R2 for a window length. It plots the 
+maximum of R1+Y1 and R2+Y2 relative to a minimum of (window 
+length)/2 and a 
+maximum of (window length). (see 33,35). 
+.LEFT MARGIN1
+@35. TX 4 @ Search for z dna (best phased value)
+.LEFT MARGIN2
+.para
+Searches for segments of the sequence that might form Z DNA. Results 
+are plotted.
+.para
+If dialogue is requested define a window length and a plot interval. 
+Ohterwise the defaults values will be used.
+.para
+ The routine 
+counts the number of consecutive RY or YR dinucleotides in phase. It 
+moves 
+through the sequence counting the number of RY or YR dinucleotides; when 
+the next dinucleotide is not of the correct type the score is set back to 
+zero and the search restarted using the current base to set the phase. The 
+plots are done relative to a minimum of zero and a maximum defined by 
+the 
+user. (See 33,34).
+.LEFT MARGIN1
+@36. TX 4 @ Local similarity or complementarity search
+.LEFT MARGIN2
+.PARA
+This function is designed to find segments of 
+local similarity or complementarity. It is therefore like performing 
+a DIAGON 
+plot that is
+restricted to regions near the main diagonal.  Results can be presented 
+graphically or listed.
+.para
+Users define 
+a region to search through,
+a span length, a range for searching through and a cut-off score. The 
+program takes all sections of sequence 
+of length span within the defined region
+ and compares them to 
+all other sequences within the region and
+range specified. 
+If a match above the cutoff is found we 
+need to show the position 
+of the two sections of sequence and the score, and we do it in the 
+following way.
+If we have a 70%
+match between 
+a sequence that starts at p1 and a sequence that starts at p2
+the program draws a 
+diagonal line that starts at p1 with height 70% of the box and which 
+finishes at p2 with 
+height 0.
+The matches can also be listed. 
+.para
+Here I define the terms range, region, and span and what is compared.
+Suppose we have a defined region j1 to j2, a range of i1 to i2 and a span 
+of 
+s; the program will take, in turn, all sections of sequence of length s
+within j1 and j2 and compare them to all sequences that start a distance 
+i1+s-1
+to i2+s-1 away from them. First it will take the sequence of length s 
+starting 
+at j1 and compare it 
+with the sequence of length s starting at
+j1+s-1+i1, then j1+s-1+i1+1, etc up to j1+s-1+i2; then it will take the 
+sequence of length s starting at j1+1 and compare it with the sequence 
+starting at j1+s-1+1+i1 etc. This continues until we hit
+ the right hand end of the 
+sequence as defined by j2. Note 1)that sequences are not compared with 
+themselves: the nearest sequence compared to a span s starting at j 
+starts 
+at j+s; 2) ranges i1 and i2 are ranges of start positions; 3) by choosing a 
+range greater than the length of the sequence this routine will do a full 
+DIAGON analysis except for those points within a distance span of
+ the main diagonal (see note 1).
+.para
+Typical dialog follows.
+.lit
+
+? Menu or option number=36
+ Search for local similarity or complementarity
+? (y/n) (y) Find direct repeats
+? (y/n) (y) Keep picture n
+? Span (5-200) (15) =
+Define restricted region
+? start (0-1023) (1) =
+? end (2-1023) (1023) =
+? Percent match (1.00-100.00) (70.00) =
+? Range start (1-50) (1) =
+? Range end (1-50) (1) =5
+? (y/n) (y) Plot results n
+ Working
+ 
+ 
+       118        128
+         CGAGGAGGAG GTGGA
+          ** *****  ** **
+         GGACGAGGAC GTCGA
+       100        110
+ 
+ 
+       119        129
+         GAGGAGGAGG TGGAT
+         ** ***** * * **
+         GACGAGGACG TCGAC
+       101        111
+? (y/n) (y) Find direct repeats n
+? (y/n) (y) Keep picture
+? Span (5-200) (15) =
+Define restricted region
+? start (0-1023) (1) =
+? end (2-1023) (1023) =
+? Percent match (1.00-100.00) (70.00) =
+? Range start (1-50) (1) =
+? Range end (1-50) (5) =8
+? (y/n) (y) List results
+ 
+ Working
+ 
+ 
+       178        188
+         ACTCAGATCC GGCGG
+         ***** ***  * **
+         ACTCAAATCA GTCGC
+       156        166
+ 
+ 
+       177        187
+         CACTCAGATC CGGCG
+          ***** ***  * **
+         AACTCAAATC AGTCG
+       157        167
+? (y/n) (y) Find inverted repeats !
+.end lit
+
+.left margin1
+@37. TX 5 @ Set genetic code
+.LEFT MARGIN2
+.para
+This function allows the user to change the current active genetic code 
+for 
+all the options. The user may select: the standard code, the mammalian 
+mitochondrial code, the yeast mitochondrial code or a personal code 
+(define 
+your own). 
+.para
+Select code. If personal, define a codon and select an amino acid. When all 
+codons have been reset define a blank codon.
+.para
+The code differences are:
+.lit
+          Mammalian        Yeast
+  Codon  Mitochondrial  Mitochondrial  Standard
+   UGA       W              W            STOP
+   AUA       M              M             I
+   CUA       L              T             L
+   AGA      STOP            R             R
+   AGG      STOP            R             R
+.END LIT
+.para
+Typical dialogue follows.
+
+.lit
+? Menu or option number=37
+X 1 Standard code
+  2 Mammalian mitochondrial code
+  3 Yeast mitochondrial code
+  4 Personal code
+? 0,1,2,3,4 =2
+ 
+? Menu or option number=37
+X 1 Standard code
+  2 Mammalian mitochondrial code
+  3 Yeast mitochondrial code
+  4 Personal code
+? 0,1,2,3,4 =4
+Define genetic code by typing a codon
+followed by a 1 letter amino acid symbol
+? Codon=TTT
+Default Amino acid symbol=F
+? Amino acid symbol=W
+? Codon=
+.end lit
+
+.left margin1
+@38. T 3 4 @ Examine repeats
+.left margin2
+.para
+This function can be used to examine the frequencies of repeated words
+within a sequence. It finds all words that occur more than once. The
+user selects a minimum word length and the program finds all words of that
+length that occur more than once; then it "follows" each repeated word until it
+becomes unique. For each word length it can report the number of different
+repeated words, the number of occurrences of each word, and their actual
+positions and sequences. 
+.para
+It is possible that the algorithm may run out of memory, paticularly if a short
+mimimum word length is chosen, or if the sequence is very long or very 
+repetitive. If this occurs the longest reported word length will not
+necessarily be the longest in the sequence: the memory will have been consumed
+before the longest word is found.
+.lit
+Typical dialogue and output is shown below.
+
+ Expected length of longest repeat    14
+ ? Minumim word length (1-6) (6) =6
+ Working
+ ? Show repeat frequencies for words of at least length (6-15) (15) =10
+ For length    10 the number of different repeated words is  2035
+ For length    11 the number of different repeated words is   613
+ For length    12 the number of different repeated words is   161
+ For length    13 the number of different repeated words is    37
+ For length    14 the number of different repeated words is    10
+ For length    15 the number of different repeated words is     1
+ ? Show repeats for words of length (6-15) (15) =14
+ ? Show repeats for words occuring with frequency (2-9999) (2) =2
+
+ ggtgctcatgccca
+ occurs at  21611
+ occurs at  21851
+ ttatccggtgatga
+ occurs at   4604
+ occurs at   8806
+ agcaccacgctgac
+ occurs at   5954
+ occurs at   9486
+ catgacggaggatg
+ occurs at  10480
+ occurs at  19925
+ aaagacgggaaaat
+ occurs at  11820
+ occurs at  43157
+ tacaaaaccaattt
+ occurs at  26797
+ occurs at  31369
+ cgagaaagagtgcg
+ occurs at   4260
+ occurs at  44305
+ gccggatgatggcg
+ occurs at   7893
+ occurs at  16638
+ atgacggaggatga
+ occurs at  10481
+ occurs at  19926
+ gcggcgaacgaggc
+ occurs at  11352
+ occurs at  18718
+ ? Show repeats for words of length (6-15) (15) =!
+
+Example of not enough memory
+----------------------------
+
+ Expected length of longest repeat    14
+ ? Minumim word length (1-6) (6) =1
+ Working
+ Not enough memory
+ Memory used in bytes 1125996. Length of longest repeat     5
+ ? Show repeat frequencies for words of at least length (1-5) (5) =!
+
+.end lit
+.left margin1
+@39. TX 5 @ Translate and list in upto six phases
+.LEFT MARGIN2
+.para
+This is a general listing function that will perform translations and 
+produce several forms of output. The possibilities are:
+.lit
+1) no translation, list one or two strands, two ways of numbering the 
+sequence.
+2) translation, one or two strands, one or three letter codes.
+ Positions defined by:
+  a) open reading frames of some minimum length l, l can be 0, hence giving 
+a complete six phase translation.
+  b) positions typed on keyboard, again 1 to 6 phases, translations appearing 
+above and below the dna.
+  c) positions read from a feature table.
+
+It should be used in preference to option 5. For publication 
+without a translation, the option to number ends of lines is more compact 
+than option 5. Some examples and typical dialogue are given below. Note the 
+requirement for d39.
+
+? Menu or option number=D39
+Find open reading frames, translate and list
+? (y/n) (y) Show translation
+ 
+The segments to translate can be
+   1 Typed on the keyboard
+   2 Read from a feature table
+X  3 Open reading frames
+? 1,2,3 =
+? Minimum open frame in amino acids (0-7238) (30) =
+? (y/n) (y) Use 1 letter codes
+Define section of DNA to display
+? start (1-7238) (1) =
+? end (2-7238) (7238) =300
+? Line length (30-120) (60) =
+Which strands should be shown
+X  1 + strand only
+   2 - strand only
+   3 Both strands
+? 1,2,3 =3
+? (y/n) (y) Number ends of lines
+ 
+ 
+    N  A  T  T  I  S  R  I  D  A  T  F  S  A  R  A  P  N  E  N
+   AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+       .    :    .    :    .    :    .    :    .    :    .    :
+   TTGCGATGATGATAATCATCTTAACTACGGTGGAAAAGTCGAGCGCGGGGTTTACTTTTA
+                                        *  S  A  G  W  I  F  I
+      A  V  V  I  L  L  I  S  A  V  K  E  A  R  A  G  F  S  F
+ 
+    I  A  K  Q  V  I  D  H  L  R  N  V  S  N  G  Q  T  K  S  T
+        L  N  R  L  L  T  I  C  E  M  Y  L  M  V  K  L  N  L  L
+   ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+       .    :    .    :    .    :    .    :    .    :    .    :
+   TATCGATTTGTCCAATAACTGGTAAACGCTTTACATAGATTACCAGTTTGATTTAGATGA
+    Y  S  F  L  N  N  V  M  Q  S  I  Y  R  I  T  L  S  F  R  S
+   I  A  L  C  T  I  S  W  K  R  F  T  D  L  P  *  V  L  D  V
+ 
+    R  S  Q  N  W  E  S  T  V  T  W  N  E  T  S  R  H  R  T  L
+     V  R  R  I  G  N  Q  L  L  H  G  M  K  L  P  D  T  V  L  *
+   CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+       .    :    .    :    .    :    .    :    .    :    .    :
+   GCAAGCGTCTTAACCCTTAGTTGACAATGTACCTTACTTTGAAGGTCTGTGGCATGAAAT
+    T  R  L  I  P  F
+   R  E  C  F  Q  S  D  V  T  V  H  F  S  V  E  L  C  R  V  K
+ 
+    V  A  Y  L  K  H  V  E  L  Q  H  Q  I  Q  Q  L  S  S  K  P
+   GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+       .    :    .    :    .    :    .    :    .    :    .    :
+   CAACGTATAAATTTTGTACAACTCGATGTCGTGGTCTAAGTCGTTAATTCGAGATTCGGT
+   T  A  Y  K  F  C  T  S  S  C  C  W  I
+ 
+    S  A  K  M  T  S  Y  Q  K  E  Q  L  K  V  L  S  N  P  D  L
+   TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+       .    :    .    :    .    :    .    :    .    :    .    :
+   AGGCGTTTTTACTGGAGAATAGTTTTCCTCGTTAATTTCCATGAGAGATTAGGACTGGAC
+ 
+ 
+? Menu or option number=D39
+Find open reading frames, translate and list
+? (y/n) (y) Show translation N
+Define section of DNA to display
+? start (1-7238) (1) =
+? end (2-7238) (7238) =300
+? Line length (30-120) (60) =
+Which strands should be shown
+X  1 + strand only
+   2 - strand only
+   3 Both strands
+? 1,2,3 =
+? (y/n) (y) Number ends of lines
+ 
+ 
+   AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+ 
+   ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+ 
+   CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+ 
+   GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+ 
+   TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+ 
+ 
+? Menu or option number=D39
+Find open reading frames, translate and list
+? (y/n) (y) Show translation
+The segments to translate can be
+   1 Typed on the keyboard
+   2 Read from a feature table
+X  3 Open reading frames
+? 1,2,3 =
+? Minimum open frame in amino acids (0-7238) (30) =0
+? (y/n) (y) Use 1 letter codes N
+Define section of DNA to display
+? start (1-7238) (1) =
+? end (2-7238) (7238) =300
+? Line length (30-120) (60) =
+Which strands should be shown
+X  1 + strand only
+   2 - strand only
+   3 Both strands
+? 1,2,3 =3
+? (y/n) (y) Number ends of lines
+ 
+ 
+   AsnAlaThrThrIleSerArgIleAspAlaThrPheSerAlaArgAlaProAsnGluAsn
+    ThrLeuLeuLeuLeuValGluLeuMetProProPheGlnLeuAlaProGlnMetLysIle
+     ArgTyrTyrTyr******Asn***CysHisLeuPheSerSerArgProLys***Lys
+   AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+       .    :    .    :    .    :    .    :    .    :    .    :
+   TTGCGATGATGATAATCATCTTAACTACGGTGGAAAAGTCGAGCGCGGGGTTTACTTTTA
+   ValSerSerSerAsnThrSerAsnIleGlyGlyLys***SerAlaGlyTrpIlePheIle
+    Arg************TyrPheGlnHisTrpArgLysLeuGluArgGlyLeuHisPheTyr
+     AlaValValIleLeuLeuIleSerAlaValLysGluAlaArgAlaGlyPheSerPhe
+ 
+   IleAlaLysGlnValIleAspHisLeuArgAsnValSerAsnGlyGlnThrLysSerThr
+    ***LeuAsnArgLeuLeuThrIleCysGluMetTyrLeuMetValLysLeuAsnLeuLeu
+  TyrSer***ThrGlyTyr***ProPheAlaLysCysIle***TrpSerAsn***IleTyr
+   ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+       .    :    .    :    .    :    .    :    .    :    .    :
+   TATCGATTTGTCCAATAACTGGTAAACGCTTTACATAGATTACCAGTTTGATTTAGATGA
+   TyrSerPheLeuAsnAsnValMetGlnSerIleTyrArgIleThrLeuSerPheArgSer
+    Leu***ValPro***GlnGlyAsnAlaPheHisIle***HisAspPhe***Ile***Glu
+  IleAlaLeuCysThrIleSerTrpLysArgPheThrAspLeuPro***ValLeuAspVal
+ 
+   ArgSerGlnAsnTrpGluSerThrValThrTrpAsnGluThrSerArgHisArgThrLeu
+    ValArgArgIleGlyAsnGlnLeuLeuHisGlyMetLysLeuProAspThrValLeu***
+  SerPheAlaGluLeuGlyIleAsnCysTyrMetGlu***AsnPheGlnThrProTyrPhe
+   CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+       .    :    .    :    .    :    .    :    .    :    .    :
+   GCAAGCGTCTTAACCCTTAGTTGACAATGTACCTTACTTTGAAGGTCTGTGGCATGAAAT
+   ThrArgLeuIleProPhe***SerAsnCysProIlePheSerGlySerValThrSer***
+    AsnAlaSerAsnProIleLeuGln***MetSerHisPheLysTrpValGlyTyrLysLeu
+  ArgGluCysPheGlnSerAspValThrValHisPheSerValGluLeuCysArgValLys
+ 
+   ValAlaTyrLeuLysHisValGluLeuGlnHisGlnIleGlnGlnLeuSerSerLysPro
+    LeuHisIle***AsnMetLeuSerTyrSerThrArgPheSerAsn***AlaLeuSerHis
+  SerCysIlePheLysThrCys***AlaThrAlaProAspSerAlaIleLysLeu***Ala
+   GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+       .    :    .    :    .    :    .    :    .    :    .    :
+   CAACGTATAAATTTTGTACAACTCGATGTCGTGGTCTAAGTCGTTAATTCGAGATTCGGT
+   AsnCysIle***PheMetAsnLeu***LeuValLeuAsnLeuLeu***AlaArgLeuTrp
+    GlnMetAsnLeuValHisGlnAlaValAlaGlySerGluAlaIleLeuSer***AlaMet
+  ThrAlaTyrLysPheCysThrSerSerCysCysTrpIle***CysAsnLeuGluLeuGly
+ 
+   SerAlaLysMetThrSerTyrGlnLysGluGlnLeuLysValLeuSerAsnProAspLeu
+    ProGlnLys***ProLeuIleLysArgSerAsn***ArgTyrSerLeuIleLeuThrCys
+  IleArgLysAsnAspLeuLeuSerLysGlyAlaIleLysGlyThrLeu***Ser***Pro
+   TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+       .    :    .    :    .    :    .    :    .    :    .    :
+   AGGCGTTTTTACTGGAGAATAGTTTTCCTCGTTAATTTCCATGAGAGATTAGGACTGGAC
+   GlyCysPheHisGlyArgIleLeuLeuLeuLeu***LeuTyrGluArgIleArgValGln
+    ArgLeuPheSerArgLysAspPheProAlaIleLeuProValArg***AspGlnGlyThr
+  AspAlaPheIleValGlu******PheSerCysAsnPheThrSerGluLeuGlySerArg
+ 
+ 
+? Menu or option number=D39
+Find open reading frames, translate and list
+? (y/n) (y) Show translation
+The segments to translate can be
+   1 Typed on the keyboard
+   2 Read from a feature table
+X  3 Open reading frames
+? 1,2,3 =1
+? (y/n) (y) Use 1 letter codes
+Define section of DNA to display
+? start (1-7238) (1) =
+? end (2-7238) (7238) =300
+? Line length (30-120) (60) =
+Which strands should be shown
+X  1 + strand only
+   2 - strand only
+   3 Both strands
+? 1,2,3 =
+? (y/n) (y) Number ends of lines N
+Translate
+? From (0-300) (0) =101
+? To (1-300) (300) =300
+Translate
+? From (0-300) (0) =102
+? To (1-300) (300) =200
+Translate
+? From (0-300) (0) =
+ 
+ 
+   AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT
+           10        20        30        40        50        60
+ 
+                                            M  V  K  L  N  L  L
+                                             W  S  N  *  I  Y
+   ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT
+           70        80        90       100       110       120
+ 
+     V  R  R  I  G  N  Q  L  L  H  G  M  K  L  P  D  T  V  L  *
+   S  F  A  E  L  G  I  N  C  Y  M  E  *  N  F  Q  T  P  Y  F
+   CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA
+          130       140       150       160       170       180
+ 
+     L  H  I  *  N  M  L  S  Y  S  T  R  F  S  N  *  A  L  S  H
+   S  C  I  F  K  T  C
+   GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA
+          190       200       210       220       230       240
+ 
+     P  Q  K  *  P  L  I  K  R  S  N  *  R  Y  S  L  I  L  T  C
+   TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG
+          250       260       270       280       290       300
+ 
+ 
+? Menu or option number=D39
+Find open reading frames, translate and list
+? (y/n) (y) Show translation
+The segments to translate can be
+   1 Typed on the keyboard
+   2 Read from a feature table
+X  3 Open reading frames
+? 1,2,3 =2
+? Embl feature table file=1.FT
+? (y/n) (y) Use 1 letter codes
+Define section of DNA to display
+? start (1-7238) (1) = 
+? end (2-7238) (7238) =300
+? Line length (30-120) (60) =
+Which strands should be shown
+X  1 + strand only
+   2 - strand only
+   3 Both strands
+? 1,2,3 =3
+? (y/n) (y) Number ends of lines
+ 
+ 
+    N  A  T  T  I  S  R  I  D  A  T  F  S  A  R  A  P  N  E  N
+   AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+       .    :    .    :    .    :    .    :    .    :    .    :
+   TTGCGATGATGATAATCATCTTAACTACGGTGGAAAAGTCGAGCGCGGGGTTTACTTTTA
+                                        *  S  A  G  W  I  F  I
+      A  V  V  I  L  L  I  S  A  V  K  E  A  R  A  G  F  S  F
+ 
+    I  A  K  Q  V  I  D  H  L  R  N  V  S  N  G  Q  T  K  S  T
+        L  N  R  L  L  T  I  C  E  M  Y  L  M  V  K  L  N  L  L
+   ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+       .    :    .    :    .    :    .    :    .    :    .    :
+   TATCGATTTGTCCAATAACTGGTAAACGCTTTACATAGATTACCAGTTTGATTTAGATGA
+    Y  S  F  L  N  N  V  M  Q  S  I  Y  R  I  T  L  S  F  R  S
+   I  A  L  C  T  I  S  W  K  R  F  T  D  L  P  *  V  L  D  V
+ 
+    R  S  Q  N  W  E  S  T  V  T  W  N  E  T  S  R  H  R  T  L
+     V  R  R  I  G  N  Q  L  L  H  G  M  K  L  P  D  T  V  L  *
+   CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+       .    :    .    :    .    :    .    :    .    :    .    :
+   GCAAGCGTCTTAACCCTTAGTTGACAATGTACCTTACTTTGAAGGTCTGTGGCATGAAAT
+    T  R  L  I  P  F
+   R  E  C  F  Q  S  D  V  T  V  H  F  S  V  E  L  C  R  V  K
+ 
+    V  A  Y  L  K  H  V  E  L  Q  H  Q  I  Q  Q  L  S  S  K  P
+   GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+       .    :    .    :    .    :    .    :    .    :    .    :
+   CAACGTATAAATTTTGTACAACTCGATGTCGTGGTCTAAGTCGTTAATTCGAGATTCGGT
+   T  A  Y  K  F  C  T  S  S  C  C  W  I
+ 
+    S  A  K  M  T  S  Y  Q  K  E  Q  L  K  V  L  S  N  P  D  L
+   TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+       .    :    .    :    .    :    .    :    .    :    .    :
+   AGGCGTTTTTACTGGAGAATAGTTTTCCTCGTTAATTTCCATGAGAGATTAGGACTGGAC
+                                     *  L  Y  E  R  I  R  V  Q
+                        *  F  S  C  N  F  T  S  E  L  G  S  R
+.end lit
+.left margin1
+@40. TX 5 @ Translate and write the protein sequence to disk
+.LEFT MARGIN2
+.para
+This routine allows the user to translate sections of the sequence into 
+the 
+1 letter amino acid codes and store the resulting amino acid sequences in
+a disk file.
+Two modes of use are possible. Either all open reading frames of at least 
+some minimum length will 
+automatically be found and translated, or the user can specify that 
+particular segments be translated.
+.para
+Mode 1: the user selects to to translate all open reading frames.
+.para
+Either, or both, strands can be 
+translated.
+ The output file is in the same format as a PIR .seq file. 
+Each protein segment is given an entry name that is its start base in 
+the DNA, and a title that includes its end position, 
+reading frame and strand (+ for plus, - for minus). 
+Each segment is terminated by * whether or not 
+there is a stop codon in the DNA. The file is therefore suitable for input 
+to FASTA, ALIGNL and ANALYSEPL.
+.para
+Mode 2: the user selects to identify the segments to translate.
+.para
+Either, or both, strands can be 
+translated.
+If multiple coding regions 
+are translated each will be separated from the previous one by a gap of 5 
+dashes (-----).
+The sections to translate can be 
+defined from the keyboard or by supplying the name of the appropriate 
+EMBL
+library feature table.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=40
+ Translate and write protein sequence to disk
+? (y/n) (y) Translate selected regions
+? (y/n) (y) Define segments using keyboard
+Translate
+? From (0-1023) (0) =1
+? To (1-1023) (1023) =111
+? (y/n) (y) + strand
+Translate
+? From (0-1023) (0) =
+? Output file name=1.OUT
+ 
+ ? Menu or option number=40
+ Translate and write protein sequence to disk
+? (y/n) (y) Translate selected regions n
+? Minimum open frame in amino acids (5-1000) (30) =
+ 
+X 1 + strand only
+  2 - strand only
+  3 Both strands
+? 0,1,2,3 =3
+? File name for translation=1.OUT
+ 
+? Menu or option number=6
+Page through text files
+? Name of file to read=1.OUT
+>P1;    25
+    135     1 +
+ GAQRLLRRSCWCWRCGGRQRTQGSAGRGRRRRGGGG*
+>P1;   238
+    486     1 +
+ IRCRDCGQRRRGIFDLVDDFHVRRHIVLARKLFEAEGTGVHFHISLMGGNIVTAEVTNVR
+ VDAGADFAAVRMLALFGAVVPH*
+>P1;   556
+    795     1 +
+ 
+ SSTQVRRASAQTSSLQLESIVAVVNVEVFLAAKHSRFYIAVLFAQFGPLLDARLDRGCGK
+ GAGRRDQWRGGGVDLANGR*
+>P1;   796
+    987     1 +
+ 
+ FGYADHAFHLRSTSRHSDNVKFDSAGRRRCCCFHLVFSLGSDEEGLLARLLVEVTTIRVV
+ LRG*
+>P1;     2
+    163     2 +
+ NSVWAWCEVPRDYCAAAAGAGGAEVVNGPRDPLDEDVDDEEEVDSALLVAGSD*
+>P1;   176
+    391     2 +
+ PLRSGGGGVEAPETPSGWPARFAAATVANAVEGFSILWMIFTCAVILSLRVNSLKQKGQG
+ YTFTFRLWEVT*
+>P1;   476
+    628     2 +
+ SLTEPSASPSPTLLLRFSLVLTEGVPNPALRFGVLPLRPAAFNLNPSLLL*
+>P1;   629
+    958     2 +
+ MSRYSWLLNTAGFTSPFCLPSLGRFWTRGLTVAVEKEPAGETNGVEAALTLPMGVSLGML
+ TMLFTCAPPAAIPIMLSLIPLAAAAAAVSTWCFLWAAMRKACWRACSLR*
+>P1;     3
+    293     3 +
+ IRFGLGVRCPEITAPQLLVLAVRRSSTDPGIRWTRTSTTRRRWIAHCWWLAATDLSSDHS
+ DPAAEASRLPKLPVAGLLDSLPRLWPTPSRDFRSCG*
+>P1;   411
+    521     3 +
+ CACRRGSRLCSGTYARPLWCSSPSLSPPPRPRQRCC*
+>P1;  1020
+     37     1 -
+ EFGKYNPLTDNSSPTQDHTDGSHLNEQARQQAFLIAAQRKHQVETAAAAAASGIKLNIIG 
+ MAAGGAQVKSMVSIPKLTPIGKVNAASTPLVSPAGSFSTATVKPRVQKRPKLGKQNGDVK
+ PAVFSSQEYLDIYNSNDGFKLKAAGLSGSTPNLSAGLGTPSVKTKLNLSSNVGEGEAEGS
+ VRDYCTKEGEHTYRCKVCSRVYTHISNFCRHYVTSHKRNVKVYPCPFCFKEFTRKDNMTA
+ HVKIIHKIENPSTALATVAAANLAGQPLGVSGASTPPPPDLSGQNSNQSLPATSNALSTS
+ SSSSTSSSSGSLGPLTTSAPPAPAAAAQ*
+>P1;   373
+     -1     2 -
+ AKCESVPLSLLLQRVYAQGQYDGARENHPQDRKSLDGVGHSRGSESSRPATGSFGSLDAS
+ AAGSEWSELKSVAASHQQCAIHLLLVVDVLVQRIPGSVDDLRTASTSSCGAVISGHLTPS
+ PNRI*
+>P1;   517
+    407     2 -
+ QQRWRGRGGGLSEGLLHQRGRAYVPLQSLLPRLHAH*
+>P1;   649
+    518     2 -
+ QPGIPRHLQQQRWIQVEGCWSERKHAEPECWIRNSLCQNQAES*
+>P1;   853
+    650     2 -
+ HYRNGGWWSAGEKHGQHTQTNAHWQGQRRLHAIGLACRLLFHSHGQAARPEAAQTQTER
+ RCKTGCV*
+>P1;   958
+    854     2 -
+ SPQRAGAPTSLPHRCPEKTPGGNSSSGGGQRNQT*
+>P1;   179
+     78     3 -
+ VVRTQISRCQPPAMRYPPPPRRRRPRPADPWVR*
+>P1;   479
+    363     3 -
+ GTTAPKRASIRTAAKSAPASTRTLVTSAVTMLPPISEM*
+>P1;   791
+    666     3 -
+ RPLARSTPPPRHWSRLPAPFPQPRSSRASRSGPNWANRTAM*
+>P1;  1022
+    819     3 -
+ SNSASTTRSPTTAHPRRTTRMVVTSTSRRANKPSSSLPRENTRWKQQQRRRPAESNLTLS
+ EWRLVERR*
+End of file
+.end lit 
+
+.LEFT MARGIN1
+@41. TX 5 @ Calculate and write codon table to disk
+.LEFT MARGIN2
+.para
+This routine calculates codon usage tables
+for sections of the sequence
+and stores the resulting tables on disk.
+The sections to translate can be 
+defined from the keyboard or by supplying the name of the appropriate 
+EMBL
+library feature table.
+.para
+If required users can add to an existing codon table stored as a disk file. 
+Choose between storing observed counts or having them normalised so 
+that the totals for each amino acid sum to 100. Select between defining 
+segments at the keyboard or using an EMBL feature table. Define 
+segments. Signal completion with a zero start. Supply a file name. For 
+each segment the program will display the counts, at the end it will 
+display the accumulated totals.
+.lit
+
+ Typical dialogue follows.
+? Menu or option number=41
+ Calculate and write codon table to disk
+? (y/n) (y) Start with empty table
+? (y/n) (y) Show observed counts
+? (y/n) (y) Define segments using keyboard
+? Count from (0-1023) (0) =1
+? Count to (1-1023) (1023) =111
+? (y/n) (y) + strand
+ 
+     ===========================================
+     F TTT   0. S TCT   0. Y TAT   0. C TGT   0.
+     F TTC   1. S TCC   1. Y TAC   0. C TGC   3.
+     L TTA   1. S TCA   0. * TAA   0. * TGA   1.
+     L TTG   2. S TCG   0. * TAG   0. W TGG   2.
+     ===========================================
+     L CTT   0. P CCT   0. H CAT   0. R CGT   2.
+     L CTC   0. P CCC   0. H CAC   0. R CGC   2.
+     L CTA   0. P CCA   0. Q CAA   1. R CGA   1.
+     L CTG   1. P CCG   0. Q CAG   2. R CGG   2.
+     ===========================================
+     I ATT   0. T ACT   0. N AAT   0. S AGT   0.
+     I ATC   0. T ACC   1. N AAC   0. S AGC   1.
+     I ATA   0. T ACA   0. K AAA   0. R AGA   1.
+     M ATG   0. T ACG   0. K AAG   0. R AGG   0.
+     ===========================================
+     V GTT   0. A GCT   1. D GAT   0. G GGT   3.
+     V GTC   0. A GCC   1. D GAC   0. G GGC   1.
+     V GTA   0. A GCA   0. E GAA   1. G GGA   4.
+     V GTG   1. A GCG   0. E GAG   0. G GGG   0.
+     ===========================================
+? Count from (0-1023) (0) =
+ 
+    Codon totals over all genes
+     ===========================================
+     F TTT   0. S TCT   0. Y TAT   0. C TGT   0.
+     F TTC   1. S TCC   1. Y TAC   0. C TGC   3.
+     L TTA   1. S TCA   0. * TAA   0. * TGA   1.
+     L TTG   2. S TCG   0. * TAG   0. W TGG   2.
+     ===========================================
+     L CTT   0. P CCT   0. H CAT   0. R CGT   2.
+     L CTC   0. P CCC   0. H CAC   0. R CGC   2.
+     L CTA   0. P CCA   0. Q CAA   1. R CGA   1.
+     L CTG   1. P CCG   0. Q CAG   2. R CGG   2.
+     ===========================================
+     I ATT   0. T ACT   0. N AAT   0. S AGT   0.
+     I ATC   0. T ACC   1. N AAC   0. S AGC   1.
+     I ATA   0. T ACA   0. K AAA   0. R AGA   1.
+     M ATG   0. T ACG   0. K AAG   0. R AGG   0.
+     ===========================================
+     V GTT   0. A GCT   1. D GAT   0. G GGT   3.
+     V GTC   0. A GCC   1. D GAC   0. G GGC   1.
+     V GTA   0. A GCA   0. E GAA   1. G GGA   4.
+     V GTG   1. A GCG   0. E GAG   0. G GGG   0.
+     ===========================================
+? (y/n) (y) Save table in a file n
+.end lit
+
+.left margin1
+@42. TX 6 @ Codon usage method
+.LEFT MARGIN2
+.para
+Used to find protein coding regions. For each window length of the 
+sequence the routine measures the closeness to an expected codon usage. 
+Results are plotted for each of the three reading frames. Stop and start 
+codons are also marked on the plots. Has the highest resolution of all 
+such methods, but makes the strongest assumption, i.e. that the codon 
+usage is known. The latest version is described in Methods in Enzymology 
+183, 193-211.
+.para
+Choose whether to use an internal standard (i.e. part of the current 
+sequence known to code for a protein). If so define its end points, and 
+those of any others. Otherwise supply the name of a disk file containing a 
+table of codon usage. Tables are listed. Choose between using the 
+observed counts, or two types of normalisation: normalised to give an 
+average amino acid composition; normalised to no amino acid bias. The 
+first normalisation is clearly often sensible, but the second removes 
+valuable information and is only made availabe for special 
+circumstances. The final table will be displayed, followed by the 
+expected scores for window lengths 21, 31 and 41 codons. The scores for 
+each of the three reading frames are shown (they are logarithmic values) 
+to help users choose a window length for the analysis. Define a window 
+length and plot interval. Plotting will start.
+.para
+The method was first described in
+Staden and McLachlan Nucl. Acid Res. 10 141-156 (1982) and the 
+following is a summary of the initial ideas.
+The method makes the following main assumptions: the codon 
+preferences 
+of all the 
+genes in the sequence we are examining are similar to those of the 
+standard; 
+the sequence is coding 
+throughout its whole length in only one reading frame; in the coding 
+frame 
+the frequency of codon abc has a definite value Fabc 
+.LEFT MARGIN2
+If we select a sequence  a1b1c1a2b2c2a3b3c3,...,anbncnan+1bn+1cn+1 
+then the 
+probability of selecting it in each of the three frames is:
+.left margin15
+frame 1: p1=Fa1b1c1.Fa2b2c2....Fanbncn
+.left margin15
+frame 2: p2=Fb1c1a2.Fb2c2a3...Fbncnan+1
+.left margin15
+frame 3: p3=Fc1a2b2.Fc2a3b3...Fcnan+1bn+1
+.LEFT MARGIN2
+The probability that selection of a particular sequence was "caused" by it 
+being a coding sequence is:
+.LEFT MARGIN2
+P1=p1/(p1+p2+p3), P2=p2/(p1+p2+p3), P3=p3/(p1+p2+p3).
+.LEFT MARGIN2
+The program calculates these values for the given window length but 
+plots 
+Log(P/(1-P)) for each of the three frames. At each point along the 
+sequence 
+that the program has a 
+point to plot it finds which of the three values is highest and places a 
+single point at the 50% level for the corresponding frame. These single 
+points will join to form a solid line if one frame is consistently the 
+highest scoring. In addition stop codons are shown as short vertical lines 
+that bisect the 50% 
+level of probability. When looking for coding regions 
+the user should look for solid horizontal lines at the 
+50% level that are not interrupted by these short vertical lines.
+.para
+Changes.
+ Two normalisations are offered: 1) to remove all amino acid 
+compositional components from the tables, hence leaving only the codon 
+preference components. In general this is not recommended as the amino 
+acid 
+component alone is often sufficient to choose correctly between frames, 
+but 
+may be useful in special circumstances. 2) to change the amino acid 
+composition components to give an average amino acid composition 
+rather the 
+the one contained in the standard (this leaves the codon preference 
+components unchanged). In general this should be useful as the average 
+amino acid composition is likely to be closer to the composition of the 
+genes being hunted, than is that of the standard table of codon 
+preferences. 
+The average composition 
+is that recently published by Argos, not the Dayhoff one that we have 
+used 
+before.
+.para
+Typical dialogue follows.
+.lit
+ 
+? Menu or option number=42
+Staden and McLachlan codon usage method
+Codon tables for standards may be read from disk
+or calculated from parts of the current sequence
+? (y/n) (y) Define internal standard
+Define standard
+? start (0-1023) (0) =1
+? end (2-1023) (1023) =1000
+     ===========================================
+     F TTT  13. S TCT   1. Y TAT   1. C TGT   3.
+     F TTC   4. S TCC  10. Y TAC   1. C TGC   7.
+     L TTA   1. S TCA   0. * TAA   1. * TGA   4.
+     L TTG   4. S TCG   1. * TAG   3. W TGG   5.
+     ===========================================
+     L CTT   9. P CCT   1. H CAT   3. R CGT  14.
+     L CTC   7. P CCC   0. H CAC   7. R CGC  14.
+     L CTA   0. P CCA   0. Q CAA   4. R CGA   9.
+     L CTG  12. P CCG   1. Q CAG   9. R CGG   8.
+     ===========================================
+     I ATT   7. T ACT   4. N AAT   4. S AGT   1.
+     I ATC   4. T ACC   5. N AAC   3. S AGC   7.
+     I ATA   1. T ACA   1. K AAA   3. R AGA   2.
+     M ATG   2. T ACG   1. K AAG   2. R AGG   2.
+     ===========================================
+     V GTT  11. A GCT  13. D GAT   6. G GGT   9.
+     V GTC   5. A GCC  10. D GAC   9. G GGC  11.
+     V GTA   6. A GCA   5. E GAA   6. G GGA  12.
+     V GTG   8. A GCG   5. E GAG   3. G GGG   8.
+     ===========================================
+Define standard
+? start (0-1023) (0) =
+Total codons in standard=     333.
+X 1 Use observed frequencies
+  2 Normalize to average amino acid composition
+  3 Normalize to no amino acid bias
+? 0,1,2,3 =2
+     ===========================================
+     F TTT  19. S TCT   2. Y TAT  10. C TGT   3.
+     F TTC   6. S TCC  22. Y TAC  10. C TGC   8.
+     L TTA   2. S TCA   0. * TAA   0. * TGA   0.
+     L TTG   7. S TCG   2. * TAG   0. W TGG   8.
+     ===========================================
+     L CTT  16. P CCT  16. H CAT   4. R CGT  10.
+     L CTC  12. P CCC   0. H CAC  10. R CGC  10.
+     L CTA   0. P CCA   0. Q CAA   8. R CGA   7.
+     L CTG  21. P CCG  16. Q CAG  18. R CGG   6.
+     ===========================================
+     I ATT  19. T ACT  13. N AAT  16. S AGT   2.
+     I ATC  11. T ACC  17. N AAC  12. S AGC  15.
+     I ATA   3. T ACA   3. K AAA  22. R AGA   1.
+     M ATG  15. T ACG   3. K AAG  15. R AGG   1.
+     ===========================================
+     V GTT  15. A GCT  21. D GAT  14. G GGT  10.
+     V GTC   7. A GCC  16. D GAC  20. G GGC  13.
+     V GTA   8. A GCA   8. E GAA  26. G GGA  14.
+     V GTG  11. A GCG   8. E GAG  13. G GGG   9.
+     ===========================================
+Span length  21 expected mean values:   4.8  -5.7  -4.8
+Span length  31 expected mean values:   7.1  -8.4  -7.2
+Span length  41 expected mean values:   9.5 -11.1  -9.5
+? odd span length (11-101) (25) =41
+? plot interval (1-11) (5) =
+ 
+ Missing graphics display here
+
+.end lit
+
+.left margin1
+@43. TX 6 @ Positional base preference method. 
+.LEFT MARGIN2
+.para
+Used to find protein coding regions. For each window length of the 
+sequence the routine measures the closeness to an expected pattern of 
+base frequencies . Results are plotted for each of the three reading 
+frames. Stop and start codons are also marked on the plots.  The method 
+is particularly useful for showing which reading frame is the most likely 
+to be coding. The latest version is described in a forthcoming issue of 
+Methods in Enzymology, but the original ideas were given in
+Staden, R. Nucl. Acid Res. 12 551-567 (1984).
+.para
+If dialogue is requested the following inputs are needed, otherwise the 
+standard analysis is performed. Choose between a "global" standard, or a 
+selected one. If the global standard is selected the 
+expected scores are displayed and the user asked to define a span length 
+and a plot interval. Then users choose between plotting relative or 
+absolute scores, and can reset the scaling values employed for plotting.
+If the global standard is not selected users must define a region of the 
+sequence to use as a standard, or they can read in a codon table from which 
+the
+program will calculate one. Then they can either, use the values 
+observed in this standard, or they can combine its values for the third 
+positions in codons, with those from the global standard. Next they can 
+give different weightings to each of the three positions in codons.
+.para
+In its original form the method
+ took advantage of the
+uneven 
+use of amino acids by proteins and the structure of the genetic code table
+and assumed that there is a typical ("global")
+amino acid composition 
+and no codon preference. The typical amino acid composition is the 
+average 
+composition found by Argos (see below).
+ This composition and no codon preference 
+determines the frequency of each of the four bases in each of the three 
+codon positions. This 3x4 frequency table shows unequal use of the bases 
+and in particular a marked use of G in position 1 and of A in position 2 
+(at the expence of G). The routine slides a window along the sequence and 
+calculates a score for each of the three reading 
+frames at each window position. It assumes the sequence is coding 
+throughout its whole length and calcualtes the probability that it is 
+coding in each of the three frames. 
+When tested against all the E. coli sequences in the EMBL sequence 
+library 
+it correctly identified the coding frame for 91% of window positions.
+(The E. coli 
+sequences were chosen only for technical reasons: I have no reason to 
+think 
+the method would work less well on other organisms with roughly even 
+base composition.)
+The routine can plot either absolute or relative values: ie absolute values 
+are the values found by summing the scores for each frame (say p1, p2 
+and 
+p3), and the relative values are then p1/(p1+p2+p3), p2/(p1+p2+p3) and 
+p3/(p1+p2+p3). 
+.para
+At each point along the sequence 
+that the program has a 
+point to plot it finds which of the three values is highest and places a 
+single point at the 50% level for the corresponding frame. These single 
+points will join to form a solid line if one frame is consistently the 
+highest scoring. In addition stop codons are shown as short vertical lines 
+that bisect the 50% 
+level of probability. When looking for coding regions 
+the user should look for solid horizontal lines at the 
+50% level that are not interrupted by these short vertical lines.
+
+The absolute mean
+values expected on the complement of 
+the coding strand (and in the same frame) 
+are 5% lower than those on the coding strand but the relative values 
+are the same on both strands. Although the 
+relative values give smoother plots and tend to emphasize the coding 
+frame
+they therefore, cannot be used to decide which strand is coding. The 
+absolute values plot should be used for this purpose but bearing in mind 
+the fact the the differences between strands are quite small. 
+.para
+The method has been improved in two overall ways: first it now allows 
+users to 
+define their own typical amino acid composition by selecting a standard 
+sequence from within the sequence they are analysing or from a codon table;
+secondly it allows the inclusion of third position preferences. 
+Again these third position preferences are defined by the use of an 
+internal standard sequence. Not only can users define their own standards 
+but they can also give weights to each of the three positions in codons. 
+This allows different emphasis to be used for each of the three positions. 
+As an example of its use, by giving, in turn, weights of 1.0, 0.0, 0.0, and 
+0.0, 1.0, 0.0, and finally 0.0, 0.0, 1.0, you could see the separate 
+contribution made by each of the three positions. It is also possible to 
+use the third position preferences with the values for the first two 
+positions taken from the "global"  amino acid composition. 
+ In all cases users may choose to plot 
+absolute or relative values. The expected scores are displayed before 
+each 
+analysis and scales are drawn on the plots.
+At present this method does not give probabilities of coding; it has only 
+been tested for its ability to choose the correct reading frame (see 
+above). It could be used to give probabilities of coding if was applied to 
+all known coding and non-coding sequences in the way that the uneven 
+positional base frequencies method was. It is designed to be used in 
+conjunction with this method. Note that the average amino composition 
+used 
+to derive the base frequencies was changed on 17-11-1988, to be
+ the new average given by McCaldon and Argos in Proteins 4 99-122 
+(1988).
+A further change is to allow users to select their own scales for 
+producing the plots. It can be helpful if they want to emphasise or 
+diminish 
+certain features.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=D43
+Positional base preferences method to find protein genes
+Select standard source
+X  1 Use global standard         
+   2 Use internal standard       
+   3 Use codon usage table       
+? Selection  (1-3) (1) =2
+Define region for standard
+? start (0-8134) (0) =3171
+? end (3172-8134) (8134) =4700
+Select normalisation
+X  1 Use observed frequencies    
+   2 Combine with global standard
+? Selection  (1-2) (1) =1
+          T      C      A      G      Range
+      1  0.125  0.249  0.230  0.397  0.272
+      2  0.298  0.245  0.292  0.165  0.132
+      3  0.288  0.313  0.169  0.230  0.144
+? (y/n) (y) Use 1.0 for positional weights 
+Give weights between 0.0 and 1.0
+to each of the 3 codon positions
+? Position 1 (0.00-1.00) (1.00) =
+? Position 2 (0.00-1.00) (1.00) =
+? Position 3 (0.00-1.00) (1.00) =
+Expected scores per codon in each frame
+       0.136     0.122     0.123
+? odd span length (31-101) (67) =
+? plot interval (1-11) (5) =
+? (y/n) (y) Plot relative scores 
+Scaling values:
+   Minimum  maximum    range
+    0.3121   0.3656   0.0382
+? (y/n) (y) Leave scaling values unchanged 
+
+  Graphics not shown
+
+? Menu or option number=D43
+Positional base preferences method to find protein genes
+Select standard source
+X  1 Use global standard         
+   2 Use internal standard       
+   3 Use codon usage table       
+? Selection  (1-3) (1) =3
+? File name of standard=atpase.cods
+     ===========================================
+     F TTT  21. S TCT  33. Y TAT  15. C TGT   5.
+     F TTC  55. S TCC  40. Y TAC  40. C TGC   4.
+     L TTA   8. S TCA   7. * TAA   8. * TGA   0.
+     L TTG  19. S TCG  12. * TAG   1. W TGG  17.
+     ===========================================
+     L CTT  22. P CCT  17. H CAT   6. R CGT  73.
+     L CTC  21. P CCC   4. H CAC  30. R CGC  23.
+     L CTA   1. P CCA  10. Q CAA  19. R CGA   5.
+     L CTG 168. P CCG  48. Q CAG  80. R CGG   3.
+     ===========================================
+     I ATT  47. T ACT  14. N AAT  17. S AGT   8.
+     I ATC  98. T ACC  54. N AAC  52. S AGC  26.
+     I ATA   6. T ACA   7. K AAA  85. R AGA   0.
+     M ATG  75. T ACG  13. K AAG  28. R AGG   0.
+     ===========================================
+     V GTT  67. A GCT  56. D GAT  41. G GGT  90.
+     V GTC  29. A GCC  53. D GAC  66. G GGC  66.
+     V GTA  49. A GCA  59. E GAA 101. G GGA   5.
+     V GTG  57. A GCG  64. E GAG  41. G GGG   8.
+     ===========================================
+Select normalisation
+X  1 Use observed frequencies    
+   2 Combine with global standard
+? Selection  (1-2) (1) =2
+          T      C      A      G      Range
+      1  0.177  0.211  0.277  0.336  0.159
+      2  0.271  0.238  0.310  0.182  0.128
+      3  0.242  0.301  0.168  0.289  0.132
+? (y/n) (y) Use 1.0 for positional weights 
+Expected scores per codon in each frame
+       0.785     0.736     0.736
+? odd span length (31-101) (67) =
+? plot interval (1-11) (5) =
+? (y/n) (y) Plot relative scores 
+Scaling values:
+   Minimum  maximum    range
+    0.3219   0.3519   0.0214
+? (y/n) (y) Leave scaling values unchanged 
+
+  Graphics not shown
+.end lit
+.left margIN1
+@44. TX 6 @ Uneven positional base frequencies.
+.LEFT MARGIN2
+.para
+Used to find regions of a sequence that might be coding for a protein. The 
+method looks for sections of the sequence in which the frequency at 
+which each of the four bases occupies the three positions in codons is 
+nonrandom. The level of nonrandomness is plotted on a scale that shows 
+the probability that the sequence is coding. At each position along a 
+sequence the calculation gives the same value for all six possible reading 
+frames, so only one value is plotted.
+.para
+Define the window length and plot interval.
+.para
+The results are plotted in a box divided by a horizontal line marked "76%". 
+76% of coding regions achieve values above this line and 76% of 
+noncoding regions achieve scores below the line.
+.para
+This method, first described in  Staden R. Nucl. Acid Res. 12 551-567 
+1984,
+looks for uneven positional 
+usage of bases in codons.
+It looks through the sequence in one fixed 
+phase and counts the number of times each base apears in each of the 
+three 
+codon positions: for each window position it counts A1,A2,A3 and 
+C1,C2,C3 
+and G1,G2,G3 and T1,T2,T3 and calculates AMEAN=(A1+A2+A3)/3, and 
+similarly 
+CMEAN, GMEAN 
+and TMEAN; it then calculates 
+ADIF=abs(A1-AMEAN)+abs(A2-AMEAN)+abs(A3-AMEAN) and similarly 
+CDIF, GDIF and 
+TDIF to measure the differences between an even base usage for all 
+positions in the codons and the observed usage. The routine then 
+calculates 
+the sum ADIF+CDIF+GDIF+TDIF and plots this value on the following scale: 
+the base level is such that no known window in a coding region has a 
+lower 
+value, whereas 14% of windows in noncoding sequences score below it. 
+The
+top of the scale is not achieved by any known noncoding
+region, but is reached by 16% of known coding regions. 
+The bar drawn across the 
+plot corresponds to a level that is exceeded by 76% of windows in known 
+coding regions
+but is reached by only 24% of windows in known noncoding regions. ie 
+76% of 
+coding windows score above and 76% of noncoding windows score below.
+This is similar to Ficketts method but without 
+the probabilities and weightings from the Los Alamos sequence library: it 
+is therefore unbiased but may well give very similar results.
+.left margin1
+@45. TX 6 @ Codon improbability on base composition
+.LEFT MARGIN2
+.para
+Used to find regions of a sequence that might code for a protein.
+.para
+If dialogue is requested define a window length and plot interval.
+.para
+ The idea of the method is, that of all sequence features 
+that we know, it is only 
+coding regions that will give rise to codon biases well above those 
+expected 
+from the base composition.
+If a region of sequence shows sufficiently strong
+codon bias then we conclude that it is coding for a protein.
+ Using the multinomial distribution we
+have derived a function to measure the improbability of observing a 
+set of codons from a sequence of the given composition. Using the 
+Poisson 
+distribution we have worked out the distribution 
+of the improbability. The program plots the observed improbability minus 
+the expected improbability (the mean as calculated from the Poisson 
+distribution). The plots are presented against a scale of units of standard 
+deviation as measured from the Poisson distribution. As with the other 
+Staden and McLachlan method the program puts an extra point at a fixed 
+level for the highest of the three probabilities; for this function this 
+point is placed at six standard deviations above the mean expected level. 
+The top of each plot corresponds to 12 standard deviations above the 
+expected level and the bottom corresponds to the expected value.
+.para
+Analysis of the application 
+of the method to the EMBL sequence library indicates that the method 
+does 
+work for most sequences and that the levels of improbability roughly 
+correlate with levels of expression. 
+Coding regions will show high peaks in all three frames making 
+interpretation more difficult than for some of the other methods.
+.left margin1
+@46. TX 6 @ Codon improbability on amino acid composition
+.LEFT MARGIN2
+.para
+Used to finds regions of a sequence that might code for a protein.
+.para
+If dialogue is requested define a window length and a plot interval.
+.para
+The idea of the method is, that of all sequence features 
+that we know, it is only 
+coding regions that will give rise to codon biases such that, for each 
+amino acid, some codons are used far more frequently than others. The 
+method is independent of what the bias actually is, requiring only that it 
+is present.
+If a region of sequence shows sufficiently strong
+codon bias then we conclude that it is coding for a protein.
+ Using the multinomial distribution we
+have derived a function to measure the improbability of observing a 
+set of codons from a sequence of the given composition. Using the 
+Poisson 
+distribution we have worked out the distribution 
+of the improbability. The program plots the observed improbability minus 
+the expected improbability (the mean as calculated from the Poisson 
+distribution). The plots are presented against a scale of units of standard 
+deviation as measured from the Poisson distribution. As with the other 
+Staden and McLachlan method the program puts an extra point at a fixed 
+level for the highest of the three probabilities; for this function this 
+point is placed at six standard deviations above the mean expected level. 
+The top of each plot corresponds to 12 standard deviations above the 
+expected level and the bottom corresponds to the expected value.
+.left margin1
+@47. TX 6 @ Shepherd RNY preference method
+.LEFT MARGIN2
+.para
+Used to find regions of a sequence that might code for a protein. Based on 
+the method of Shepherd
+(PNAS 78 1596-1600, 1981). 
+.para
+If dialogue is requested define a window length and plot interval.
+.para
+Shepherd has found that 
+many genes have a preference for the use of codons of the form RNY 
+where 
+R=purine, Y=pyrimidine and N=any base. He has attributed this to being 
+due 
+to remants of a primitive genetic code. The calculation is similar to that 
+for the Staden and McLachlan method, the p1's being simply the number of 
+RNY codons found in frame 1 etc and the P's being p/(p1+p2+p3).
+.left margIN1
+@48. TX 6 @ Ficketts method
+.LEFT MARGIN2
+.para
+Used to find regions of a sequence that might code for a protein. Based on 
+the method of Fickett
+(Nucl. Acid Res.10 
+1982), but plots values for fixed window lengths rather than over the 
+whole of open reading frames.
+.para
+If dialogue is requested define a window length and plot interval. The 
+results are plotted in a box divided into three horizontal strips.
+.para
+Sections of the sequence with values plotted in the top strip of the box 
+are adjudged to be coding, those in the middle strip "no decision", and 
+those in the bottom "not coding".
+.para
+The program performs the following calculations: let A1 = the number of 
+occurences of base A in position 1 of codons, A2 for position 2 etc. 
+Similarly for bases C,G and T. For each window position calculate 
+Apos=max(A1,A2,A3)/min(A1,A2,A3)+1. Similarly for C,G and T to give 4 
+positional values. Also count the base composition for the window to 
+give 
+Acomp, Ccomp etc. Fickett  tested each of these 8 parameters singly as 
+to 
+their ability to distinguish coding from noncoding regions and arived at 
+probabilities of coding for the range of values each can take = Pcod. He 
+also measured their relative abilities and given weightings to each of 
+the 8 parameters = Pw. To calculate the "TESTCODE" for a window we 
+first lookup the Pcod for each of the calculated compositional and 
+positional values and then calculate TESTCODE=sum(Pcod*Pw). TESTCODE 
+is 
+plotted relative to three levels of decision: the top division="coding",
+the middle="no opinion" and the bottom division="non coding".
+.left margin1
+@49. TX 6 @ tRNA gene search. 
+.LEFT MARGIN2
+.para
+Used to find segments of a sequence that might code for tRNAs. Looks for 
+potential cloverleaf forming structures and then for the presence of 
+expected conserved bases. Presents results graphically or draws out the 
+cloverleafs.
+.para
+If dialogue is requested a large number of parameters need to be given 
+values, including some loop lengths, scores for each of the four stems, 
+and scores for the conserved bases.
+.para
+The program was first described in 
+Staden Nucl. Acid Res 817-825 (1980). 
+                The tRNA's  that  have
+          been  sequenced  so far have two characteristics that can be used 
+to
+          locate their genes within long DNA sequences.  Firstly they  have  a
+          common   secondary  structure  -  the  cloverleaf  -  and  secondly,
+          particular bases almost always appear at certain  positions  in  
+the
+          cloverleaf.   The  cloverleaf  is composed of four base-paired 
+stems
+          and four loops.  Three of the stems are  of  fixed  length  but  the
+          fourth,  the  dhu  stem which usually has four base pairs, 
+sometimes
+          has only three.  All of the loops can vary in size.   The  following
+          relationships between the stems in the cloverleaf are assumed in 
+the
+          program:  (a) there are no bases between one end  of  the  
+aminoacyl
+          stem  and  the  adjoining tuc stem;  (b) there are two bases 
+between
+          the aminoacyl stem and the dhu stem;  (c) there is one base  
+between
+          the  dhu  stem and the anticodon stem;  (d) there are at least three
+          bases between the anticodon stem and the tuc stem.
+                The program looks first for cloverleaf structure and then,  if
+          required,  for  conserved bases.  The sizes of the loops, the number
+          of basepairs in the stems and the required conserved bases  may  
+all
+          be  specified  by the user.  The process of looking for the presence
+          of conserved bases can reduce the  number  of  potential  
+structures
+          found considerably.
+ The
+          user may also specify that an intron may be present in the 
+anticodon
+          loop.
+.para
+The user may define a minimum number of 
+base pairs for each stem using the scoring system G-C, A-T=2 and G-T=1 
+and 
+scores for each of the conserved bases. Recommended values for the stem 
+scores are given by the prompts and the percentage conservation of the 
+conserved bases as found in the Nucl. Acid Res 1979  paper Gauss, Gruter
+ and Sprinzl  are also given,
+but the user must decide which bases are most 
+likely to be conserved for the sequence being examined.
+The output shows the position of the possible gene in the sequence by a 
+vertical line the height of which shows the number of basepairs made in 
+the 
+stems. The cloverleaf structure is also drawn but will scroll up off the 
+screen. Output of the cloverleafs will look like:
+.lit
+
+       6942
+                    A              
+                  A-U              
+                  A-U              
+                  G-C              
+                  A-U              
+                  U-A              
+                  A-U              
+                  U-A      AAU      
+                  U   UAUCU         
+          AA    A    !!!!!         
+            AAUG     AUAGA   A     
+         U  !!!!     U    UCA      
+         C  UUAC      U            
+          AA    A                  
+                 U-AA A            
+                 A-U               
+                 A-U               
+                 C-G               
+                 U-A               
+                U   A              
+                U   A               
+                 GUC               
+
+ Typical dialogue follows.
+ 
+? Menu or option number=D49
+ tRNA search
+? Maximum trna length (70-130) (92) =
+? Aminoacyl stem score (0-14) (11) =
+? Tu stem score (0-10) (8) =
+? Anticodon stem score (0-10) (8) =
+? D stem score (0-8) (3) =
+? Minimum base pairing total (30-32) (32) =
+? Minimum intron length (0-30) (0) =
+? Minimum length for TU loop (4-12) (6) =
+? Maximum length for TU loop (6-12) (9) =
+? (y/n) (y) Skip search for conserved bases n
+Give a score for each base, then a minimum total at the end
+? Base  8, T is 100% conserved. Score (0-100) (0) =
+? Base 10, G is  95% conserved. Score (0-100) (0) =
+? Base 11, Y is  96% conserved. Score (0-100) (0) =
+? Base 14, A is 100% conserved. Score (0-100) (0) =
+? Base 15, R is 100% conserved. Score (0-100) (0) =
+? Base 21, A is  97% conserved. Score (0-100) (0) =
+? Base 32, Y is 100% conserved. Score (0-100) (0) =
+? Base 33, T is  98% conserved. Score (0-100) (0) =
+? Base 37, A is  91% conserved. Score (0-100) (0) =
+? Base 48, Y is 100% conserved. Score (0-100) (0) =
+? Base 53, G is 100% conserved. Score (0-100) (0) =
+? Base 54, T is  95% conserved. Score (0-100) (0) =
+? Base 55, T is  97% conserved. Score (0-100) (0) =
+? Base 56, C is 100% conserved. Score (0-100) (0) =
+? Base 57, R is 100% conserved. Score (0-100) (0) =
+? Base 58, A is 100% conserved. Score (0-100) (0) =
+? Base 60, Y is  92% conserved. Score (0-100) (0) =
+? Base 61, C is 100% conserved. Score (0-100) (0) =
+? Minimum total conserved base score (0-0) (0) =
+? (y/n) (y) Plot results n
+ 
+ Searching
+ 
+       306
+                   C
+                 C-G
+                 C-G
+                 G-C
+                 T-A
+                 C-G
+                 A-T
+                 T+G     AT
+                A   ATACA
+        TTC    T    !!!!   G
+           CTGT     TATGG  G
+       G    ! !     T    GA
+       C   TAAA      C
+        GCG    C      G
+                T+GA   C
+                C-G C   T
+                T+G  A   T
+                T-A   G   T
+                T-A    G   A
+               G   G    G   C
+               A   A     G   A
+                AGC       T   C
+                           A   T
+                            C   T
+                             A
+                              C T
+ 
+
+.end lit
+.left margIN1
+.left margIN1
+@50. TX 7 @ Plot start codons
+.left margin2
+.para
+This function plots the positions of all start codons for each of the three 
+reading frames.
+.left margin1
+@51. TX 7 @ Plot stop codons
+.left margin2
+.para
+This function plots the positions of all stop codons for each of the three 
+reading frames.
+.left margIN1
+@52. TX 7 @ Plot stop codons on the complementary strand
+.left margin2
+.para
+This function plots the positions of all stop codons for each of the three 
+reading frames on the complementary strand.
+.left margin1
+@53. TX 7 @ Plot stop codons on both strands
+.left margin2
+.para
+This function plots the positions of all stop codons for each of the three 
+reading frames on both strands.
+.left margin1
+@54. TX 5 @ Search for longest open reading frames
+.left margin2
+.para
+This function will report the positons of the ends of
+all sections of sequence that contain no stop codons. All six reading 
+frames are examined. Results are presented in the form of an EMBL feature
+table. Hence if the results are stored in a file by use of "direct output 
+to disk", the file
+ can be used to translate the 
+open reading frames in a sequence.
+Note that in order for the file to be used as a feature table it
+must include either EMBL
+or GenBank headers, and a suitable "tail". The simplest header is the word
+FEATURES starting in column 1 of the first line of the file. The simplest
+tail is 2 empty lines at the end of the file. These lines are not included
+when nip writes out results in feature table format.
+.para
+Define the minimum length of open reading frame to report (in amino 
+acids).
+Choose to search either or both strands. The program displays the end 
+points, the reading frame and strand.
+.para
+Typical dialogue follows.
+.lit
+
+? Menu or option number=D54
+ Find open reading frames
+? Minimum open frame in amino acids (5-1000) (30) =100
+ 
+X 1 + strand only
+  2 - strand only
+  3 Both strands
+? 0,1,2,3 =3
+
+FT   CDS           1    831       1    831
+FT   CDS        1540   2853       1   1314
+FT   CDS        3130   4242       1   1113
+FT   CDS        5761   6114       1    354
+FT   CDS        6187   6711       1    525
+FT   CDS        1766   2077       2    312
+FT   CDS        2078   2446       2    369
+FT   CDS        4136   5500       2   1365
+FT   CDS        1335   1637       3    303
+FT   CDS        2844   3194       3    351
+FT   CDS        6819   7238       3    420
+FT   CDS        2073   1711  C    1    363
+FT   CDS        2469   2149  C    1    321
+FT   CDS        6542   6144  C    3    399
+
+.end lit
+.left margin1
+@55. TX 8 @ Search for E. coli promoter (general)
+.LEFT MARGIN2
+.para
+Searches for E coli promoter like sequences using a standard weight 
+matrix. The positions of the matches are plotted. No dialogue is required.
+.para
+The method was first described in
+ Staden R. Nucl. Acid Res. 12 505-519 1984.
+This search uses a weight matrix taken from the frequency tables 
+contained 
+in Hawley, D. K. and McClure, R., nar 11 2237-2255 (1983).
+ The weight matrix is 
+divided into 3 sections that are separated by varying sizes of gap: the -
+35 
+region, the -10 and the +1 region.
+The algorithm first looks for a sufficiently good -35 region, then for the 
+best -10 region within range and then for the best +1 region within range 
+of the -10; each separate region must score above the lowest known 
+score 
+for the corresponding section. The gap penalty is then applied and two 
+plots
+produced: one with gap penalties, one without.
+ Scaling is such that no 
+known promoter scores below the bottom level and no known promoter 
+scores 
+above the top level when the weight matrix is applied.
+.para
+Two other functions also look for E. coli promoters: 92 looks for sites on 
+the complementary strand and 93 looks for individual -35 and -10 
+regions 
+and plots them on a scale such the top is the highest known value +10% 
+and 
+the bottom is the lowest known -10%
+.LEFT MARGIN1
+.lit
+weights for E. coli promoters 
+-35 region:
+P -50-49-48-47-46-45-44-43-42-41-40-39-38-37-36-35-34-33-32-31-30-29-28-27-26
+  
+107109109110110110110110110111111110111112112112112112112112112112112112112
+T  41 33 32 25 34 22 35 35 42 27 32 42 47 14 92 94 11 19 15 37 46 34 38 48 34
+C  22 27 18 29 20 14 20 12 22 23 16 25 10 43  7  6 11 18 60  8 25 23 23 17 20
+A  28 38 30 37 35 56 42 42 37 42 39 18 25 26  2  6  2 72 26 50 26 34 25 26 31
+G  16 11 29 19 21 18 13 21  9 19 24 26 29 29 11  6 88  3 11 17 15 21 26 21 27
+-10 region:
+P -23-22-21-20-19-18-17-16-15-14-13-12-11-10 -9 -8 -7 -6 -5
+  112112112112112112112112112112112112112112112112112112112
+T  35 28 28 27 39 51 34 43 26 31 89  3 49 15 19108 31 29 21
+C  34 21 24 27 12 25 20 25 20 27 10  2 16 14 22  3 13 16 30
+A  20 39 33 33 39 23 29 16 23 19  2106 29 66 57  1 35 23 31
+G  23 24 27 25 22 13 29 28 43 35 11  1 18 17 14  0 33 24 30
++ region:
+P -2 -1  1  2  3  4  5  6  7  8  9 10
+  86 88 85 88 88 88 88 88 88 88 88 88
+T 16 22  2 42 27 23 20 25 27 15 16 29
+C 29 49  4 25 25 13 18 22 17 17 16 17
+A 20  9 45 16 24 25 28 24 24 32 35 26
+G 21  8 37  5 12 27 22 17 20 24 21 16
+.end lit
+Notes:
+E. coli promoters have been shown to contain 2 regions of conserved 
+sequence
+located about 10 and 35 bases upstream of the transcription startsite. 
+These
+are TATAAT and TTGACA with an allowed spacing of 15 to 21 bases 
+between. The
+spacing with maximum efficiency was 17 bases and all but 12 of the 112 
+sequences could be aligned with a separation of 17 +or-1 bases. The 
+standard
+promoter has spacing 7 and 17 bases between the startsite and the -10 
+region,
+and the -10 and -35 regions, respectively. The spacing between the -10 
+region
+and the startsite is usually 6 or 7 bases but varies between 4 and 8 
+bases.
+There is an AT rich region of 8 to 10 bases upstream of the -35 region.
+Iniation with a purine is highly prefered with G being used if A is not
+present.
+.lit
+Gap penalties:
+	15 0.02   (only exists as mutant)
+	16 0.2
+	17 1.0
+	18 0.2
+	19 0.05   (guess)
+	20 0.02   (guess)
+	21 0.01   (guess)
+.end lit
+.left margin1
+@56. TX  8 @ Search for E. coli promoter (general)
+strand
+.LEFT MARGIN2
+.para
+This function searches for E. Coli promoters on the complementary strand 
+of 
+the sequence. See the notes on option 55.
+.left margin1
+@57. TX 8 @ Search for E. coli promoter sequences. (-35 and -10) 
+.LEFT MARGIN
+.para
+This function searches separately for the -35 and -10 sequences of an E. 
+coli promoter. See the notes on option 55.
+.left margIN1
+@58. TX 8 @ Search for procaryotic ribosome binding sites
+.LEFT MARGIN2
+.para
+This function searches for the 5' ends of prokaryotic genes using an 
+unusual weight matrix. The search is relatively slow because the matrix 
+is 101 bases in length. No dialogue is required.
+.para
+The method was first described in
+ Staden Nucl. Acid Res. 12 505-519 1984. This actually looks for more 
+than 
+a ribosome binding site as is explained below.  This uses their weight 
+matrix w101 of Stormo and 
+Schneider (NAR 10 2971-3024, 1982)
+which with a value of 2 finds all gene starts in their library. 
+.LEFT MARGIN1
+.lit
+ P-60-59-58-57-56-55-54-53-52-51-50-49-48-47-46-45-44-43-42-41-40-39-38-37-36
+ T  5  1 -3  9-14  7 15 -5  3-16-17  4 18  5 -3 -1  2  4  5 -5  7  8 -5-15  6
+ C-21 -6-11-21  0  8 -7-12 -1  1  0-19 12 -3 -1 10  2 -8 -5-11  8  1 23  6 -5
+ A  7 -2 13 -2 -8-13-18  5  0 -5 13  8-15  9 -4 -7  9  0 -8-11-10 -6 -7 -5 -6
+ G -6 -9 -7  0  8-16 -4 -2-16  1 -4  8-14  5 11-13-24  3  7 22-11 -9-15 10 -4
+
+ P-35-34-33-32-31-30-29-28-27-26-25-24-23-22-21-20-19-18-17-16-15-14-13-12-11
+ T  3  4 16 -4  7 11 -4 -1 12  8 10 -1  1  8  2-10-16 11  1 -3 16 -3-36 -8-27
+ C  2-14 -3 -8-10-21  2  0 -2 -1-11 -3 -1  5-11 -4  7  0-14  6 -8-20 -7-36-44
+ A-12 -1-27 -3 -6  0-12 -3 -4 -7 14 -2 -4 -6  0 12  5 -9  0-11-11 10  8  2  8
+ G  4 -5 -6 -3 -1 -4 -1 -4-15  0-14  3 10-19 -3-10 -7 -7  7  1 -8 -6 15 21 42
+
+ P-10 -9 -8 -7 -6 -5 -4 -3 -2 -1  0  1  2  3  4  5  6  7  8  9 10 11 12 13 14
+ T-53-27-26-23  2 -7-14-40-28  0-53 75-62-20-40-10-35 -5-12 -1  4 14-23  7 -2
+ C-15-50-43-35-38-29-29  1 -9  1-87-55-64-45 11-22-14-20-15-15-10-22 -5  2  6
+ A  0 -3 -5  4-20-11  5  6 -2-15 66-69-52 -5 -4  6  8-24 -7-10 -7 13 14 -9-18
+ G 35 22 16 -6 -5-15-25-33-28-53-36-50107 -5-37-44-27-15-23-16-29-47-17-29-15
+
+ P 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39
+ T-26  1  4 -7  3 -4  0-10  8-18  7-22-21  8  4 -3 -6  7 -8  1 -5-16-16  7 -6
+ C  6 -8 19 -7  9 -3 17 -2  3 -9  5 22 22  8 -1  1 18  6 11-10 -8  7 10  0  7
+ A 14-12-42  1 -5 -4-32 12-10 20 -6 -1  3 -4  4-10 -1 -2-14 11 14 -3  2-13  5
+ G-23 -7 -1 -6-17 -4  0-15-14 -4-17-10 -5-13 -8 10-13-13  9 -4 -3 10  2  4 -8
+
+ P 40
+ T  0
+ C 14
+ A  5
+ G-21
+.END LIT
+These come from w101 of Stormo, Schneider, Gold and Ehrenfeucht Nucl. 
+Acid Res. 10 2997-
+3011, 1982. They report that this matrix gives a score of at least 2 for 
+all
+gene starts in their library whereas all other sequences score 1 or less. 
+.left margin1
+@29. TX 1 @ Reverse and complement the sequence
+.LEFT MARGIN2
+.para
+Reverses and complements the current active region of the sequence.
+.left margin1
+@60. TX 7 @ Search using a dinucleotide weight matrix
+.LEFT MARGIN2
+.para
+This function performs searches for short sequence
+motifs using an appropriate  dinucleotide weight matrix. In addition it 
+can be used to create or modify weight matrices. In order to perform a 
+search the only input 
+required is the name of the file containing the weight matrix.
+The results can be presented graphically or listed. The graphical 
+presentation will draw line at the position of any matches found; the 
+height of the line is proportional to the score. The method is identical to 
+that using weight matrices derived from nucleotide frequencies, except 
+that here we use the frequencies of dinucleotides.
+.para
+For a search, select "use weight matrix", supply the name of the file 
+containing the weight matrix, and choose between having results plotted 
+or listed. If dialogue is requested when the function is selected users can 
+alter the cutoff score employed.
+.para
+To create a weight matrix several steps are involved. A file containing an 
+alignment of known motifs is required. (This file must be created before 
+the current option is selected. The format is a follows: each sequence is 
+written on a separate line with at least one space at the beginning; each 
+sequence is terminated by a space character, and can be followed by a 
+name. The sequences must be aligned.) Supply the name of the file of 
+aligned sequences. The program reads and displays the sequences. Choose 
+between "summing logs of weights" or summing weights (i.e. whether to 
+multiply or add weights). If logs are used all scores will be negative. 
+Choose if all positions in the set of aligned sequences should be used or 
+if a mask should be applied. If so selected, define a mask as a string of 
+symbols, in which symbol - means ignore and any other symbol means 
+use. E.g. xx-x--abc means use all positions except 3,5 and 6.
+.para
+The program will calculate weights as the frequencies of the 
+dinucleotides at each unmasked position in the set of aligned sequences. 
+These weights are then applied to the set of aligned sequences to give a 
+range  of "observed" scores. The mean and standard deviation of these 
+scores is displayed. The user is asked to supply several values to be used 
+when the weight matrix is applied to other sequences: a cutoff score (by 
+default, the mean minus 3 standard deviations), a top score for scaling 
+graphical results (by default, the mean plus 3 standard deviations), and a 
+position to identify (this means that if a particular base within the 
+motif is used as a "landmark", such as the A of the AG in splice acceptor 
+sites, then its position will be marked in plots). All these values are 
+stored along with the weight matrix. Finally supply the name of a file to 
+contain the weight matrix.
+.para
+Weight matrices can be "rescaled" using a set of aligned sequences in 
+much the same ways as a matrix is created. The purpose is to redefine 
+the cutoff scores, and rescaling does not alter any other values in the 
+weight matrix file.
+.para
+ The methods have always had to deal with the problem of zeroes in the 
+matrices. The current versions 
+employ "Laplaces Law of Succession" in which 1 is 
+added to each term.
+
+.lit
+Typical dialogue follows.
+ 
+? Menu or option number=D60
+ 
+ Motif search using dinucleotide weight matrix
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 = 2
+? Name of aligned sequences file=[RS.MOTIFS]GCN4.SEQ
+ 
+ 
+     1 AGCGTGACTCTTCCCGGAA HIS1
+     2 GAGGTGACTCACTTGGAAG HIS1
+     3 CGGATGACTCTTTTTTTTT HIS3
+     4 ACAGTGACTCACGTTTTTT HIS4
+     5 GTCGTGACTCATATGCTTT ARG3
+     6 TGAATGACTCACTTTTTGG ARG4
+     7 TTCTTGACTCGTCTTTTCT CPA1
+     8 CGAATGACTCTTATTGATG CPA2
+     9 AGAATGACTAATTTTACTA TRP5
+    10 TCGTTGACTCATTCTAATC TRP3
+    11 TTGCTGACTCATTACGATT TRP2
+    12 GAGATGACTCTTTTTCTTT IV1
+    13 GCGATGATTCATTTCTCTG IV2
+    14 TAGATGACTCAGTTTAGTC LEU1
+    15 TAAGTGACTCAGTTCTTTC LEU4
+    16 ATGATGACTCTTAAGCATG ILS1
+Length of motif    18
+? (y/n) (y) Sum logs of weights n
+? (y/n) (y) Use all motif positions n
+x means use, - means ignore
+e.g. xx-x---x-x means use positions 1,2,4,8,10
+? Mask=----XXXXXXXX--------
+ Applying weights to input sequences
+   1       89.000 AGCGTGACTCTTCCCGGA
+   2       91.000 GAGGTGACTCACTTGGAA
+   3       93.000 CGGATGACTCTTTTTTTT
+   4       90.000 ACAGTGACTCACGTTTTT
+   5       94.000 GTCGTGACTCATATGCTT
+   6       91.000 TGAATGACTCACTTTTTG
+   7       81.000 TTCTTGACTCGTCTTTTC
+   8       90.000 CGAATGACTCTTATTGAT
+   9       75.000 AGAATGACTAATTTTACT
+  10       97.000 TCGTTGACTCATTCTAAT
+  11       97.000 TTGCTGACTCATTACGAT
+  12       93.000 GAGATGACTCTTTTTCTT
+  13       69.000 GCGATGATTCATTTCTCT
+  14       90.000 TAGATGACTCAGTTTAGT
+  15       90.000 TAAGTGACTCAGTTCTTT
+  16       90.000 ATGATGACTCTTAAGCAT
+Top score      97.000  Bottom score      69.000
+Mean      88.750  Standard deviation       7.319
+Mean minus 3.sd      66.794  Mean plus 3.sd     110.706
+? Cutoff score (-999.00-9999.00) (66.79) =
+? Top score for scaling plots (66.79-999.00) (110.71) =
+? Position to identify (0-18) (1) =
+? Title=GCN4 DI WTS
+? Name for new weight matrix file=3.WTS
+  
+? Menu or option number=D60
+ Motif search using dinucleotide weight matrix
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 =
+? Motif weight matrix file=3.WTS
+ GCN4 DI WTS
+? Cutoff score (-9999.00-9999.00) (66.79) =40
+? (y/n) (y) Plot results n
+     15     42.00 CAACCCGCTCACCGACAA
+     29     42.00 ACAACAGCTCACCCACGC
+     93     46.00 AGCCTTCCTCATCGCTGC
+    153     40.00 CAGCGGAATCAAACTTAA
+    408     42.00 CGATGGATTCAAGTTGAA
+    469     47.00 TTAGGAACTCCCTCTGTC
+    493     60.00 AAGCTGAATCTTAGCAGC
+    530     43.00 CGGAGGGCTCAGTGAGGG
+    542     47.00 TGAGGGACTACTGCACCA
+    678     41.00 CTTCTGCTTCAAAGAGTT
+    709     47.00 AATATGACGGCGCACGTG
+    848     54.00 GTCAGAACTCAAATCAGT
+    940     49.00 CCGTTGACGACCTCCGCA
+    992     42.00 TGGGCACCTCACACCAAG
+ 
+
+.end lit
+.left margIN1
+@61. TX 8 @ Search for eukaryotic ribosome binding sites
+.LEFT MARGIN2
+.para
+Searches for eukaryotic ribosome binding sites using weightings derived 
+from
+ Sargan,Gregory,Butterworth febs let 147 133-136 1982.  No dialogue is 
+required. First described in Staden Nucl. Acid Res. 12 505-519 1984.
+
+.LEFT MARGIN1
+.lit
+mRNA WTS FOR EUKARYOTES SARGAN,GREGORY,BUTTERWORTH FEBS LET 
+147 133-136 1982
+P  -7 -6 -5 -4 -3 -2 -1  1  2  3
+  102102102102102102102102102102
+T  19 24 31 12  0 18  5  0102  0
+C  20 15 32 65  5 42 52  0  0  0
+A  50 27 27 19 86 36 34102  0  0
+G   6 29 12  6 11  6 11  0  0102
+VIRAL ONLY
+P  -7 -6 -5 -4 -3 -2 -1  1  2  3
+   41 41 41 41 41 41 41 41 41 41
+T  14 12 16  4  2 13  9  0 41  0
+C   7  3 13 17  7  9 14  0  0  0
+A  15 10  6 10 27 15  9 41  0  0
+G   5 16  6 10  5  4  9  0  0 41
+.END LIT
+The Sargan et al paper puts forward the hypothesis that there is an 
+interaction between
+some mRNA leader sequences and a highly conserved structure in the 18S 
+rRNA
+of eukaryotic ribosomes. The attempt to substantiate the hypothesis 
+includes
+a table of base frequencies for sequences immediately 5' to start codons.
+They examined 102 sequences and I have used the base frequencies they 
+found
+as a weight matrix for searching for eukaryotic gene starts. I don't yet 
+know how good this method is. The viral sequences were found to be 
+slightly
+different but the separate table shown here is not used in the program.
+.left margin1
+@62. TX 8 @ Search for splice junctions
+.LEFT MARGIN2
+.para
+Used to search for mRNA splice junctions using a weight matrix. The 
+default weight matrix is still that derived from the paper of Mount (Nucl. 
+Acids Res. 10, 459-472). However users may employ their own tables.
+By default the positions of possible junctions will 
+be plotted rather than listed.
+ The diagram splits the donor plot into 3 horizontal boxes
+ so that all the 
+sites marked in any box are from the same reading frame. The acceptor 
+plot appears above the donor plot and is split in an equivalent way. So 
+sites marked as donors and acceptors in equivalent boxes are compatible. 
+i.e. donors from donor box 1 are compatible with acceptors from acceptor box 
+1, etc. Of course it is the combination of reading frame and splice sites 
+that really matters, and donors from box 1 can be compatible with acceptors 
+in box 3 if the reading frame switches.
+.para
+If dialogue is selected users can employ their own file of weights (see 
+below for the format), can change the cutoff scores, and can elect to have 
+the results listed rather than plotted. Listed results show the position 
+(of the last or first base in the exon), the frame and the matching sequence.
+The frequency table shown below is used as a default
+weight matrix and AG and GT are obligatory at the appropriate positions.
+The plots are scaled so that the top of scale is the highest value achieved 
+by 
+a junction sequence in the set used to compile the frequency table, and 
+the 
+bottom of the scale is the lowest value achieved by a junction sequence 
+in 
+the set used to compile the frequency table.
+.para
+In the light of current knowledge it would be sensible for users to use 
+the weight matrix search option (20)
+to create matrices that define  more specific splice junctions. If so it is 
+important that the positions "marked" are the last base in the donor exon and 
+the first base in the acceptor exon. To make a weight matrix suitable for 
+use with this function follow the instructions for option 20 and create 
+files for both donor and acceptor sites. Then concatenate the two matrix files 
+with the donor file first.
+Note that any positions in the weight matrix that are 
+100% conserved will be made obligatory (normally the AG and GT).
+.LEFT MARGIN1
+.lit
+
+ Mount donors redone 16-4-91                                 
+     12     3   -16.085    -7.500
+ P  -2  -1   0   1   2   3   4   5   6   7   8   9
+ N 136 136 136 136 136 136 136 136 136 136 136 136
+ T  28   8  15  17   0 136   9  16   7  84  30  36
+ C  41  60  16   7   0   0   3  13   3  17  28  39
+ A  40  56  89  12   0   0  83  91  12  23  53  33
+ G  27  12  16 100 136   0  41  16 114  12  25  28
+ Mount acceptors redone 16-4-91                              
+     18    15   -26.142   -14.400
+ P -14 -13 -12 -11 -10  -9  -8  -7  -6  -5  -4  -3  -2  -1   0   1   2   3
+ N 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113
+ T  58  50  57  59  67  56  58  49  47  66  64  31  34   0   0  11  41  31
+ C  21  28  34  25  29  33  35  32  42  40  33  25  74   0   0  23  28  41
+ A  17  11  11  18   7  17  12  23  15   3  10  29   5 113   0  24  21  21
+ G  17  24  11  11  10   7   8   9   9   4   6  28   0   0 113  55  23  20
+.END LIT
+
+.left margIN1
+@63. TX 7 @ Search using a weight matrix (complementary)
+.LEFT MARGIN2
+.para
+This function searches the complementary strand of the sequence  using 
+a weight matrix. Many 
+motifs can bind to either strand of the DNA and this function allows 
+users to 
+search the complementary strand without having to change the
+orientation of the sequence. See option 20 for more details.
+.left margin1
+@64. TX 3 @ Plot observed-expected word frequencies
+.LEFT MARGIN2
+.PARA
+This  option is designed to examine the abundances of short 
+words in a sequence to see if particular ones are either under or over
+represented. It compares the observed and expected frequencies and 
+plots them along the sequence. There has been some work on the relative 
+amounts of CG dinucleotides in eukaryotic sequences (eg Bird, Nature 
+321, 
+209-213 (1986)) and this new routine can be used to examine such 
+biases, or 
+any others that might be interesting.
+.para
+The user selects a word - say CG -, a window length, and a maximum and 
+mininum scale for plotting the results. The 
+program examines each sucessive window length along the sequence, 
+with each 
+window overlapping the previous one by windowlength-1. 
+The program counts the base frequencies in each window, and the number 
+of 
+occurrences of the chosen word within the window. Using the base 
+frequencies it calculates an expected number of occurrences for the 
+chosen 
+word (simply by multiplying the relevant frequencies). It plots 
+observed-expected, and hence will show regions that are rich or depleted 
+in 
+the chosen word. The longest allowed word is 9 characters, but the 
+calculation of the expected frequencies becomes less appropriate as the 
+word 
+length increases above 2.
+.para
+Typical dialogue follows.
+.lit
+ 
+? Menu or option number=D64
+Plot composition differences (obs-exp))
+Default String=CG
+? String=
+? odd span length (3-401) (101) =
+? plot interval (1-20) (5) =
+? Maximum plot value (-6.31-25.25) (6.31) =
+? Minimum plot value (-25.25-6.31) (-6.31) =
+ 
+ Missing graphics display here
+
+.end lit
+.left margIN1
+@65. TX 9 @ Search for polya sites
+.LEFT MARGIN2
+.para
+Simply searches for the sequence AATAAA
+ (Proudfoot and Brownlee Nature 263, 211-214,
+ 1982) and marks it with a short vertical line.
+.left margin1
+@66. TX 1 @ Interconvert t and u
+.LEFT MARGIN2
+.para
+This function interconverts T and U characters in the active sequence i.e 
+between DNA and RNA.
+.LEFT MARGIN1
+@67. TX 7 @ Search for patterns of motifs
+.left margin2
+.para
+This option searches for patterns of motifs. Patterns can be defined 
+interactively or read from files. Results can be displayed in several ways 
+in both graphical and textual form. Used to create pattern files for 
+searching libraries. The option is extremely flexible and consequently the 
+following documentation is quite lengthy. However the routine is capable 
+of searching for almost any known pattern. In addition the flexibility 
+does not necessitate difficulty of use, and the userinterface has been 
+simplified considerably since the methods were first published.
+.para
+Users should refer to the "typical dialogue" shown below for the most 
+helpful information on using the program.
+.para
+There are currently 
+four ways to display the matching patterns: 1=each individual
+motif and its position is listed; 2=all the sequence between, and 
+including the two 
+outermost motifs is listed; 3=graphical, with a vertical line marking the 
+position 
+of the leftmost motif; 4 = EMBL feature table format, where the KEYNAM 
+field if the motif name, the FROM and TO fields denote the ends of the 
+match, and the DESCRIPTION field is "Program".
+.para
+When it is defined for the first time a pattern must be entered 
+interactively at the keyboard, but the pattern description 
+can be saved to a file. 
+This file can be used for all subsequent searches.
+.para
+When defining a pattern interactively
+select a motif class and the program will request the required inputs. 
+.para
+The program gives each motif an identifying name and number.
+For motifs other than the first, a range of allowed positions must be 
+defined (Note that sets of motifs included using the OR operator will all 
+be given the same range, and so the program will only request range 
+values 
+for the first motif in any such set).
+To specify the allowed range for a motif the user must supply the 
+following: the 
+identifying number of the motif, relative to which the current motifs 
+positions are to be defined (termed the "reference motif"); a "relative start 
+position" and a range. The relative start position can be negative or positive. 
+A negative start position means that although the reference motif 
+is searched for first, the current motif can be found to its left.
+A zero relative start position means their left ends are superimposed. The 
+default start position is to butt-joint the motif to righthand end of the 
+"reference motif". The range is "the number of extra positions" that the 
+motif can take.
+.para
+The program will display the probability of finding each motif. These 
+values are presented in the following form: .1234E-5 means 0.1234 times 
+10 
+to the power -5.
+.para
+After the pattern has been defined, the program will type a description 
+of 
+it on the screen. It will then allow the user to give an overall cutoff 
+score and overall probability cutoff.
+.para
+Typical dialogue for all the different motif classes is displayed below.
+.lit
+
+? Menu or option number=67
+  Pattern searcher
+? (y/n) (y) Read pattern from keyboard 
+X 1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =
+? Motif name=Ematch
+? String=AA
+Probability of score     2.0000 = 0.595E-01
+X 1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =2
+? Motif name=AAA
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-1) (1) =
+? Relative start position (-1000-1000) (3) =
+? Number of extra positions (0-1000) (0) =
+? string=AAA
+? Minimum matches (1.00-3.00) (3.00) =2
+Probability of score     2.0000 = 0.149E+00
+  1 Exact match
+X 2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =3
+? Motif name=T'S
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-2) (2) =
+? Relative start position (-1000-1000) (4) =
+? Number of extra positions (0-1000) (0) =
+? String=TTT
+? Minimum score (0.00-108.00) (108.00) =72
+Probability of score    72.0000 = 0.258E+00
+  1 Exact match
+  2 Percentage match
+X 3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =4
+? Motif name=GCN4
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-3) (3) =
+? Relative start position (-1000-1000) (4) =
+? Number of extra positions (0-1000) (0) =
+? Weight matrix file name=GCN4
+ GCN4 FROM WEIGHTS 17-11-87                                                    
+Probability of score   -22.0020 = 0.139E-02
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+X 4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =5
+? Motif name=GCN4
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-4) (4) =
+? Relative start position (-1000-1000) (20) =
+? Number of extra positions (0-1000) (0) =
+? Weight matrix file name=GCN4
+ GCN4 FROM WEIGHTS 17-11-87                                                    
+Probability of score   -22.0020 = 0.606E-03
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+X 5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =6
+? Motif name=LOOP
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-5) (5) =
+? Relative start position (-1000-1000) (20) =
+? Number of extra positions (0-1000) (0) =
+? Stem length (1-60) (6) =
+? Minimum loop length (-6-60) (0) =
+? Maximum loop length (0-60) (0) =5
+? Minimum score (1.00-12.00) (12.00) =10
+Probability of score    10.0000 = 0.598E-02
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+X 6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =7
+? Motif name=Tstep
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-6) (6) =
+? (y/n) (y) Relative to 5 prime end 
+? Relative start position (-1000-1000) (1) =
+? Number of extra positions (0-1000) (0) =
+? String=TTT
+? Step (1-20) (3) =
+Probability of score     3.0000 = 0.367E-01
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+X 7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =8
+? Motif name=REPEAT
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-7) (7) =
+? Relative start position (-1000-1000) (4) =
+? Number of extra positions (0-1000) (0) =2
+? Repeat length (1-60) (6) =
+? Minimum gap (0-60) (0) =
+? Maximum gap (0-60) (0) =4
+? Minimum score (1.00-6.00) (6.00) =5
+Probability of score     5.0000 = 0.554E-02
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+X 8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =9
+? (y/n) (y) Save pattern in a file N
+
+Pattern description
+
+Motif  1 named Ematch   is of class    1
+Which is an exact match to the string
+AA
+Motif  2 named AAA      is of class    2
+which is a match of score     2. to the string
+AAA
+and the 5 prime base can take positions      3 to       3
+relative to the 5 prime end of motif   1
+It is anded with the previous motif.
+Motif  3 named T'S      is of class    3
+which is a match of score    72. to the string
+TTT
+and the 5 prime base can take positions      4 to       4
+relative to the 5 prime end of motif   2
+It is anded with the previous motif.
+Motif  4 named GCN4     is of class    4
+Which is a match to a weight matrix with score -22.002
+and the 5 prime base can take positions      4 to       4
+relative to the 5 prime end of motif   3
+It is anded with the previous motif.
+Motif  5 named GCN4     is of class    5
+Which is a match to the complement of a weight matrix with score -22.002
+and the 5 prime base can take positions     20 to      20
+relative to the 5 prime end of motif   4
+It is anded with the previous motif.
+Motif  6 named LOOP     is of class    6
+Which is a stem-loop structure with stem length    6 and score    10.
+The loop can have sizes      0 to      5
+and the 5 prime base can take positions     20 to      20
+relative to the 5 prime end of motif   5
+It is anded with the previous motif.
+Motif  7 named Tstep    is of class    7
+Which is an exact match to the string
+TTT
+with a step size of     3
+and the 5 prime base can take positions      1 to       1
+relative to the 5 prime end of motif   6
+It is anded with the previous motif.
+Motif  8 named REPEAT   is of class    8
+Which is a repeat with repeat length    6 and score     5.
+The loop-out can have sizes      0 to      4
+and the 5 prime base can take positions      4 to       6
+relative to the 5 prime end of motif   7
+It is anded with the previous motif.
+Probability of finding pattern = 0.2348E-14
+Expected number of matches  = 0.5100E-09
+? Maximum pattern probability (0.00-1.00) (1.00) =
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+ Select display mode
+X 1 Motif by motif
+  2 Inclusive
+  3 Graphical
+  4 EMBL feature table
+? 0,1,2,3,4 =4
+ Searching
+
+
+Total matches found      0
+
+Menus and their numbers are 
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structures and repeats
+m5 = Translation and codons
+m6 = Gene search by content
+m7 = Prokaryotic signal search
+m8 = Eukaryotic signal search
+ ? = Help
+ ! = Quit
+? Menu or option number=67
+  Pattern searcher
+? (y/n) (y) Read pattern from keyboard 
+X 1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =
+? Motif name=Arun
+? String=AAAAAA
+Probability of score     6.0000 = 0.210E-03
+X 1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Complement of weight matrix
+  6 Inverted repeat or stem-loop
+  7 Exact match, defined step
+  8 Direct repeat
+  9 Pattern complete
+? 0,1,2,3,4,5,6,7,8,9 =9
+? (y/n) (y) Save pattern in a file N
+
+Pattern description
+
+Motif  1 named Arun     is of class    1
+Which is an exact match to the string
+AAAAAA
+Probability of finding pattern = 0.2103E-03
+Expected number of matches  = 0.1522E+01
+? Maximum pattern probability (0.00-1.00) (1.00) =
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+ Select display mode
+X 1 Motif by motif
+  2 Inclusive
+  3 Graphical
+  4 EMBL feature table
+? 0,1,2,3,4 =4
+ Searching
+
+
+FT   Arun       1582   1587       Program
+FT   Arun       3160   3165       Program
+FT   Arun       4204   4209       Program
+FT   Arun       5691   5696       Program
+FT   Arun       6710   6715       Program
+Total matches found      5
+Minimum and maximum observed scores        6.00        6.00
+ 
+.end lit
+.para
+These methods allow users to define and search for
+complex patterns of motifs defined as single objects.
+The programs allow individual DNA motifs to be defined in eight 
+different
+ways, and protein motifs in six. Motifs are combined, using the logical 
+operators AND, OR and NOT, to describe a pattern. The pattern also 
+specifies the ranges of allowed relative separations of the individual 
+motifs. 
+.para
+First some definitions.
+.para
+A MOTIF is a contiguous subsequence of fixed length.
+At its simplest 
+it could be a single definite base or amino acid; a more complex motif 
+might be better represented as a consensus or a weight matrix; 
+two more-abstract types of 
+motif are direct and inverted repeats. 
+.para
+A PATTERN is a higher order of structure defined by a list of motifs. The 
+motifs in a pattern are combined using the logical operators AND, OR and 
+NOT. The list also defines the allowed relative separations of the 
+motifs. In the current versions of the programs up
+ to 50 motifs can be combined into a single pattern. So using these 
+definitions there are two 
+differences between motifs and patterns: 1) the distances between all 
+elements of a motif are fixed, but 
+the separations of parts of patterns can vary;
+ 2) all characters in a motif are defined 
+using the same method (class), but different parts of a pattern can be 
+defined in completely different ways.
+.para
+Each motif 
+can be represented in 9 ways (known as the motif class):
+.sk1
+.lit
+           MOTIF CLASSES
+CLASS           DESCRIPTION
+ 1       Exact match to a short defined sequence. The IUB symbols
+         can be used for DNA sequences.
+ 2       Percentage match to a defined short sequence. In nucleic acids, 
+         the IUB symbols can be used.
+ 3       Match to a defined sequence, using a score matrix and cutoff
+         score. The DNA matrix (see option 18) gives scores to IUB symbols 
+         depending on their level of redundancy. MDM78 is used for proteins.
+ 4       Match to a weight matrix with cutoff score.
+ 5       As class 4 but on the complementary strand.
+ 6       Inverted repeat or stem-loop. Fixed stem length, range of 
+         loop sizes, and cutoff score using A-T, G-C=2; G-T=1.
+ 7       Exact match to short sequence but with a defined step size.
+ 8       Direct repeat. Fixed repeat length, range of loop-out sizes,
+         cutoff score, and score matrix (for protein sequences MDM78 and
+         for nucleic acids an identity matrix).
+ 9       Membership of a set. A list of sets of allowed amino acids for 
+         each position in the motif. The sets are separated by commas(,).
+         For example IVL,,,DEKR,FYWILVM defines a motif of length 5 amino 
+         acids in which one of I,V or L must be found in the first position, 
+         then anything in the next two positions, D,E,K or R in the fourth 
+         position and F,Y,W,I,L,V or M in the fifth. This class only applies
+         to protein sequences because for nucleic acids "membership of a 
+set"
+         can be achieved using IUB symbols.
+
+    Classes 1 - 4, 8 and 9 apply to protein sequences, and classes 1-8 to 
+    nucleic acids.
+
+.end lit
+.para
+Class 1: exact match.
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+ may include IUB symbols. All symbols must match.
+.para
+Class 2: percentage match
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+may include IUB symbols. The minimum number of matching characters 
+must 
+also be specified.
+.para
+Class 3: match using a score matrix
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+may include IUB symbols. The motif is not compared directly with the 
+sequence to count the number of matching characters. Instead a matrix is 
+used to provide a score for all possible pairs of characters. The motif 
+score for 
+any position along the sequence is the sum of the scores found by 
+looking-up the scores for each pair of aligned characters. A match is 
+declared if some minimum score is achieved.
+.para
+Class 4: weight matrix
+.para
+The motif is defined by a table of values (called weights or scores). The 
+table gives a score for finding each possible character at each position 
+along the length of the motif. It therefore 
+has dimension motif-length x character-set-size, and allows us to give 
+different scores for each character at each position. It is equivalent to 
+having a different score matrix for each position along the motif, and 
+provides the most flexible and specific method of defining motifs. The 
+weight matrices are created by program NIP option 20 and 
+stored as files. The file contains the values
+for each position, as well as an overall minimum score. 
+There are two ways in which these values can be used to calculate an 
+overall 
+score for any section of the sequence. The simplest way is to add the 
+values in the file. (This means that the highest possible score
+can be calculated by adding the top value at each column 
+position, and the lowest 
+by adding the bottom value.)
+ The normal way of using the values in the file is as 
+follows. 
+First the programs divide the values in each column by the column total 
+so 
+that they sum to 1.0
+Then the natural 
+logs of these values are used as scores. When the matrix is applied to a 
+sequence these logarithmic values are summed (which is of course 
+equivalent 
+to multiplying the frequencies).
+Note that using the natural logs of the frequencies as 
+weights and 
+adding them means that the overall cutoff score must be less than zero, 
+whereas if the original
+values in the weight matrix file are added, the cutoff score will be 
+greater than zero. The search routines therefore decide whether the user 
+wants to add values or multiply frequencies
+by examining the value of the cutoff score: it will add if the cutoff 
+is 
+greater than zero and add logs of frequencies if it is less than zero.
+ Hence we effectively get two 
+motif classes in one. The program NIP, when creating weight matrix 
+files, will ask the user whether the scores should be added or multiplied. 
+ If the values in the table have been defined 
+without using a set of aligned sequences
+it is easier for the user to 
+choose a cutoff score if the values are added.
+.para
+Class 5: complement of weight matrix
+.para
+The motif is defined by a weight matrix, but the program searches for its 
+complement.
+.para
+Class 6: inverted repeat, or stem-loop
+.para
+The motif is defined by a repeat length, a minimum score
+ and a range of loop sizes. The scores are A-T=2, G-C=2, G-T=1, else=0.
+The loop sizes are defined by a minimum 
+and maximum distance from the 3' end of the stem.
+For a stem-loop these will be positive numbers. For example to 
+define a stem of length 8 and loop sizes varying from 3 to 5, the stem 
+would be set to 8, the minimum start distance to 3 and the maximum 
+to 5. To define an 
+inverted repeat the minimum distance will be negative. For example stem 
+length=9,
+minimum distance=-9, and maximum distance=-8 will find 
+inverted repeats of lengths 9 and 10. 
+E.g. AAAAATTTT and AAAAATTTTT would be found, the first having a base 
+at 
+its centre, the second having none.
+.para
+Class 7: exact match, defined step size.
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+ may include IUB symbols. All symbols must match. The class differs 
+from 
+class 1 in that searches will move in steps of some given size. For 
+example 
+we could search for a certain codon and use a step size of 3 and hence
+ keep in a 
+single reading frame.
+.para
+Class 8: direct repeat
+.para
+The motif is defined by a repeat length, a minimum score
+ and a range of loop sizes. The scores are defined using MDM78 for protein 
+sequences and an identity matrix for nucleic acids.
+The loop sizes are defined by a minimum 
+and maximum distance from the 3' end of the stem.
+.para
+Class 9: membership of a set
+.para
+This motif class is for protein sequences. It is defined by lists of 
+allowed amino acids for each position in the motif, and a cut-off score.
+Positions at which any amino acid can occur are left blank.
+All allowed amino acids for each position give a score of 1.
+The motifs can be defined in two ways: either typed at the keyboard or 
+read 
+in as a weight-matrix-like file.
+When the motif is defined at the keyboard the sets of allowed amino 
+acids
+are separated by commas(,).
+         For example IVL,,,DEKR,FYWILVM defines a motif of length 5 amino 
+         acids in which one of I,V or L must be found in the first position, 
+         then anything in the next two positions, D,E,K or R in the fourth 
+         position and F,Y,W,I,L,V or M in the fifth.  To specify that the 
+whole motif must match a score of 3 would be required (i.e. one of the 
+allowed amino acids must be found for each of the three defined 
+positions).
+If the motif is read from a file the file must have been written by 
+program 
+NIP, or have been saved by the pattern searching routines. If the 
+user 
+elects to save a pattern, and it includes class 9 motifs typed at the 
+keyboard, then the program will save the class 9 motifs as weight matrix 
+files. Therefore it will request file names for each motif of this class. 
+If the motif given above as an example were saved the weight matrix file
+would have 5 columns.
+The first column 
+would contain zeroes except for the I, V and L rows 
+which would be set to 1; the next two columns would all be zero; the next 
+would be zero except for the D,E,K and R rows which would be 1; the final 
+column would contain 1's in rows F,Y,W,I,L,V and M, with 
+the rest zero.
+.para
+
+The logical operator (AND, OR or NOT) used to add each motif to the 
+pattern
+is specified by preceding 
+the class number by the letters A, O or N. A = AND, O = OR, N = NOT.
+The default is A, so N2 means include, using the NOT operator, a class 2 
+motif; O2 means include, using the OR operator, a class 2 motif; both A2 
+and 
+2 mean include, using the AND operator, a class 2 motif.
+
+.para
+Range setting.
+.para
+The motifs in a pattern are numbered according to their order in the list. 
+Apart from the first motif in a pattern all motifs are given a range 
+of allowed positions relative to a motif further up the list. 
+For example
+suppose we have a pattern defined by A AND B AND C AND D.
+Motif A can occur anywhere, but B must have its range of allowed 
+positions defined relative to the position of motif A, and C's positions 
+can be defined relative to either A or B, depending on which is most 
+convenient, and likewise D's positions can be relative to A or B or C.
+.para
+Notice that the positions of motifs can be defined relative to more than 
+one motif. Suppose we have a pattern consisting of 
+motifs A, B and C, and that B occurs 5-10 residues right of A, C occurs 5-
+10 
+residues right of B, and also C is never more than 15 residues from A. 
+Then 
+it is quite consistent with the methods to include motif C into the 
+pattern 
+twice using the AND operator: once relative to A and once relative to B. 
+This will define the relative spacing and the ORDER of the motifs in the 
+pattern. (If we simply defined the position of C relative to A it could be 
+found to the left of B).
+.para
+Motifs combined together using the OR operator are all given the same 
+range. For example suppose we had a pattern A AND (B OR C) AND (D OR E),
+ then B and C each have the same range, and D and E also have 
+the same range as one another. The range for D and E can be relative to 
+A or to B.
+.para
+Motifs cannot have their ranges defined relative to motifs that are 
+included using the NOT operator. For example if we had the pattern A NOT 
+B 
+AND C, then the range for C can only be defined relative to motif A.
+.para
+Speed can be gained by arranging the order 
+of the motifs so that those higher up the list are of types that can be 
+searched for rapidly and that are also unlikely to be found.
+.para
+Motifs combined by the OR operator are alternatives: if any one of a set 
+of motifs 
+combined by the OR operator is found, then a match is declared. All
+alternatives will be reported. For example if we had a pattern defined by 
+A 
+AND (B OR C), then all places where A occurs and B is found within range, 
+and all places where A is found and C is found within range will be 
+reported. A typical use would be where we might allow a motif to appear 
+on 
+either strand of the DNA sequence. For example a weight matrix 
+representing 
+the heatshock element could be used in a pattern which included 
+heatshock 
+as a motif class 4 combined using the OR operator 
+with heatshock as a motif class 5.
+.para
+The probability calculations are performed for each motif as it is 
+defined. 
+If an overall probability cut-off is given the calculation is repeated for 
+each match found. To achieve maximum searching speed do not give an 
+overall 
+probability cut-off. Overall cut-off scores should only be used if the 
+motif 
+classes used are compatible.
+.para
+There are currently 
+several ways to display the matches: 1 = each 
+motif and its position is listed; 2 = all the sequence between the two 
+outermost motifs is listed; 3 = graphical, with a spike marking the 
+position 
+of the leftmost motif. The library versions also give entry names, and a 
+one 
+line title; in addition they can be used to produce aligned families of 
+sequences. When this mode of output is selected the program will write a 
+separate file for each match. The files will be called ENTRYNAME.DAT 
+where 
+ENTRYNAME is the name of the entry in the library. The matching 
+sequence 
+will be written out so that the spacing between motifs is constant, and 
+set to the maximum allowed by the pattern definition. Any gaps will be 
+filled with dashes (-). If the individual sequences were subsequently 
+written one above the other
+they should line up so that all motifs are in register. There two types of 
+output of this sort: one, option 4, writes out whole sequences, the other, 
+option 5, writes out only the sequences between the two outermost 
+motifs.
+If the individual sequences were subsequently 
+written one above the other
+they should line up so that all motifs are in register. There two types of 
+output of this sort: one, option 4, writes out whole sequences, the other, 
+option 5, writes out only the sequences between the two outermost 
+motifs.
+Note that for option 4 users are asked to type the position of the 
+first motif, and the reason for
+this is explained below. 
+Consider a pattern found in several sequences. Consider only
+the first motif in 
+the pattern and suppose that it was found in different positions in these 
+sequences. 
+Say that of these positions the one furthest from the left end was 
+position 100. Then, in order to ensure that all the sequences would align, 
+we must specify that motif 1 must start at position 100. 
+Any sequences in which motif 1 started 
+nearer to the left end than position 100 would be padded accordingly.
+These modes of output 
+should only be used when the position of each motif is defined relative to 
+its 
+immediate neighbour.
+.para
+The pattern descriptions can be saved to files. These files 
+can be used instead of typing definitions again at the keyboard. As the 
+files are annotated,
+they can easily 
+be changed using system editors, and the modified versions used to 
+define the variant patterns for the programs.
+.para
+Use of lists of entry names 
+.para
+The two programs that operate on libraries have the ability to 
+restrict their searches to subsets of the libraries. This does not require 
+sublibraries to be created but instead is achieved by using files 
+containing a list of the entry names of sequences. The user may choose to 
+search only those entries on the list or, alternatively to search all but 
+those on the list (i.e. in the latter case
+the list contains the names of those to be excluded).
+ The programs can search libraries that have indexes and those that 
+do not.
+ If a list of names for inclusion is used,
+then the search will be faster if the index is present. In all other 
+circumstances the whole library will be read. 
+The list must be in library order except when it is used
+to include entries, and an index is available.
+The list must contain each entry name on a separate line, with the name 
+starting in column 1 of the line. ie there must be no spaces at the start 
+of the line.
+The list of entry names
+can be produced by the keyword searches of nip, pip, etc as long 
+as the listings produced have a space character separating the entry name 
+from the entry description. This will depend on how well the library 
+reformatting programs work. For example swissprot entry names tend to run 
+into the beginning of the descriptions, but other libraries are generally 
+OK.
+.para
+One use of the programs is to look for patterns that we already know 
+about, but in new sequences. However it is hoped that they will also be 
+useful for finding new motifs. For example
+several known control regions in 
+nucleic acid 
+sequences consist of particular direct or inverted repeats;
+the inclusion of
+direct and inverted repeats as motif classes
+makes it possible to 
+find previously unknown
+motifs of these types. 
+Using these new programs we can 
+ask questions like: "are there any inverted or direct repeats near to 
+sections of sequence that contain both a
+CCAAT box and a TATA box?"; and to search for such things throughout 
+the 
+libraries. In addition, the mode of output in which all the sequence 
+between 
+the two outermost motifs found is printed out, allows us to extract 
+sequences and examine them in more detail for further common 
+subsequences. 
+For example we might want to collect together all the sequences 
+between 
+putative CCAAT and TATA boxes.
+.para
+A further use of the inverted repeat motif class is the following. If a 
+regulatory sequence in DNA is poorly defined but also an inverted repeat, 
+then it might be an advantage to specify it both as a consensus sequence 
+and 
+a superimposed inverted repeat. In this way two weak definitions can be 
+combined to produce a stronger pattern.
+.para
+Given only a few examples of a motif it 
+should be possible to perform initial searches using a 
+class 3 motif, and then, using plausible matching sequences, create a 
+more 
+specific weight matrix for the same motif.
+.para
+If motifs are combined with the first motif using the OR operator
+they will be ignored until all 
+permutations that include the first motif have been looked for. 
+The whole search will then be repeated, in 
+turn, for each of 
+those motifs that are combined with the first motif using the OR 
+operator.
+An interesting consequence of this is that the program can be used, 
+without 
+change, to compare any newly determined sequence with all known 
+individual 
+motifs. We achieve this by having a pattern in which all known relevant 
+motifs are combined using the OR operator.
+If we ask to use this pattern with 
+a sequence, the program will automatically compare each individual 
+motif in 
+the pattern with the whole length of the 
+sequence. As the number of known 
+motifs grows this should become an increasingly useful standard 
+procedure.
+.para
+The NOT operator is obviously 
+useful for making sure particular motifs are not present, but it can also 
+be used to bracket the levels of matches found. We may want a degree of 
+match that lies between two limits - binding should occur, but not too 
+strongly; or base-pairs should form, but not too many. We can specify 
+this 
+by asking for a match with a low score, in combination with a match and 
+a 
+high score, both for the same motif, but with the high score included 
+using 
+the NOT operator.
+.para
+The algorithm is designed to find all sections of a sequence that satisfy 
+the pattern rather than only the best match. 
+Particularly if some of the motifs in a pattern are less well defined than 
+others, this can often result in the same region of a sequence being 
+reported as having several matches, but which only vary in the 
+positions of the weakest motifs.
+.para
+General remarks on motif searching
+.para
+Generally motifs are short subsequences that are thought to be 
+associated with 
+particular functions in some known sequences. Often 
+we search for them to try to 
+understand or interpret other sequences. Sometimes we search for 
+motifs and
+patterns to 
+test a hypothesis about their role: are they found in the expected 
+positions in the expected sequences. In doing so we should remember 
+that, in both proteins and nucleic acids,
+ what we are really looking for is a particular 
+three dimensional structure with certain affinities for other structures, 
+and that we are assuming that the sequence of the motif alone
+defines the 3D structure we searching for. 
+ The overall structure 
+may be completely different to those in which the motif is functional, 
+and 
+hence the motif may have a different shape or be inaccessible. 
+We should be aware of the 
+importance of the context in which a motif is found. Where does it lie 
+relative to the overall structure, is it accessible, is the three 
+dimensional spacing between 
+it and other motifs correct? For example, is it on the same side of the 
+double helix, and the correct distance from some other motif? How does 
+context affect our assessment of the significance of finding a motif? 
+Finding false mammalian mRNA splice junctions in non-coding sequences 
+is 
+far less important than finding false sites in pre-mRNA sequences, but 
+finding them in the correct places is most important! In other words, it 
+is 
+often the case that when we are searching for a motif that is known to 
+be  
+necessary for some function, then a positive result in the form of a 
+match 
+in the required position, is more important than a high background of 
+matches in the wrong positions. Being 
+ able to write 
+down the probability of finding a motif in a random sequence tells us how 
+well it is defined. 
+In nucleic 
+acids the DNA may contain many superimposed types of information such 
+as 
+those concerned with histone phasing, protein coding or mRNA secondary 
+structure. These overlapping "codes" may interfere with one another 
+causing 
+matches to motifs to be poorer than expected.
+In general we will only have a limited number of examples of the 
+motif and we do not know how representative they are.
+.para
+Sequences have superimposed functions: some parts may be of general 
+structural 
+importance and give rise to an overall framework, and other parts give 
+specificity and hence are not common; we may want to use a set of 
+aligned 
+sequences to define a motif, but want to use only the framework 
+positions.
+ Alternatively we may want to pick out 
+only those parts of a set of aligned sequences that give a particular 
+property, and to ignore other similarities that are due to some other 
+property
+and which could obscure the pattern 
+we are interested in.
+It is possible to apply a mask to a set of aligned sequences in 
+order to give weight to selected positions only.
+ The ability to define a mask allows certain positions 
+to be used in the motif and others to be ignored, and yet still permits the 
+use of a set of aligned sequences to calculate weights. The mask is 
+requested and applied 
+by the program and results in the masked positions being zero 
+in 
+the weight matrix. The mask is defined in the following way. 
+Suppose we had a motif of length 15, then the mask 
+x--x--xx-x will give zero weights to positions 2,3,5,6 and 9 (note it is 
+the dashes (-) that are significant and that positions 
+1,4,7,8,10,11,12,13,14 and 15 
+will be non-zero). Of course 
+the same set of sequences could be used with several alternative masks 
+in 
+order to extract different features and create corresponding weight 
+matrices.
+.para
+The programs are described in Staden,R. 
+CABIOS 4, 53-60, 1988; Staden,R.
+ CABIOS 5, 89-96, 1989, and Methods in Enzymology 183, 193-211 (1990).
+.left margin1
+@ end of help
diff --git a/help/NIPF.RNO b/help/NIPF.RNO
new file mode 100644
index 0000000..ccc4bad
--- /dev/null
+++ b/help/NIPF.RNO
@@ -0,0 +1,88 @@
+.NPA
+.SP 1
+.left margin1
+@-1. TX   0 @General
+.sp
+@-2. TX   0 @Screen control
+.sp
+@-3. TX   0 @Statistical analysis
+.sp
+@-1. TX   0 @General
+.sp
+@-2. TX   0 @Screen control
+.sp
+@-3. TX   0 @Statistical analysis
+.sp
+@0.  TX  -1 @NIPF
+.sp
+@1.  TX 1 @ Help
+.sp
+@2.  TX 1 @ Quit
+.sp
+@3.  TX 1 @ Read new sequence
+.sp
+@4.  TX 1 @ Redefine active region
+.sp
+@5.  TX 1 @ List the sequence
+.sp
+@6.  TX 1 @ List a text file
+.sp
+@7.  TX 1 @ Direct output to disk
+.sp
+@8.  TX 1 @ Write active sequence to disk
+.sp
+@9.  TX 1 @ List a translation
+.sp
+@32. TX 1 @ List showing base differences
+.sp
+@37. TX 1 @ List showing translation
+.sp
+@33. TX 1 @ List showing amino acid differences
+.sp
+@10. TX 2 @ Clear graphics
+.sp
+@11. TX 2 @ Clear text
+.sp
+@12. TX 2 @ Draw a ruler
+.sp
+@13. TX 2 @ Use cross hair
+.sp
+@14. TX 2 @ Reset margins
+.sp
+@15. TX 2 @ Label diagram
+.sp
+@16. TX 2 @ Display a map
+.sp
+@17. TX 3 @ Set comparison mode
+.sp
+@18. TX 3 @ Set sort mode
+.sp
+@21. TX 3 @ Count base changes
+.sp
+@22. TX 3 @ Count codon changes
+.sp
+@23. TX 3 @ Count genetic events
+.sp
+@24. TX 3 @ Show table of base changes
+.sp
+@36. TX 3 @ Show table of expressed base changes
+.sp
+@39. TX 3 @ Show table of silent base changes
+.sp
+@38. TX 3 @ Estimate mutation rate
+.sp
+@25. TX 3 @ Plot base changes
+.sp
+@26. TX 3 @ Plot expressed changes per base
+.sp
+@27. TX 3 @ Plot silent changes per base
+.sp
+@28. TX 3 @ Count expressed changes per base
+.sp
+@29. TX 3 @ Count silent changes per base
+.sp
+@30. TX 3 @ Count changed amino acids
+.sp
+@31. TX 3 @ Plot amino acid variability
+.sp
+@ end of help
diff --git a/help/PIP.RNO b/help/PIP.RNO
new file mode 100644
index 0000000..24e42a2
--- /dev/null
+++ b/help/PIP.RNO
@@ -0,0 +1,2469 @@
+.NPA
+.SP 1
+.left margin1
+@-1. TX 0 @General
+.sp
+@-2. T   0 @Screen control
+.sp
+@-2. X   0 @Screen
+.sp
+@-3. T   0 @Statistical analysis of content
+.sp
+@-3. X   0 @Statistics
+.sp
+@-4. T   0 @Structures and repeats
+.sp
+@-4. X   0 @Structures
+.sp
+@-5. TX  0 @Search
+.sp
+@0.  TX -1 @PIP
+.para
+This is a program  for analysing individual protein sequences. It can read 
+sequences stored in many of the most commonly used formats, and 
+performs all of the usual simple analyses. In addition it has very flexible 
+search procedures  and   presents many of its  results graphically. 
+.PARA
+The following analyses (preceded by their option numbers) are included:
+.lit
+ ? = Help
+ ! = Quit
+ 3 = read a new sequence
+ 4 = define active region
+ 5 = list the sequence
+ 6 = list a text file
+ 7 = direct output to disk
+ 8 = write active sequence to disk
+ 9 = edit the sequence
+10 = clear graphics screen
+11 = clear text screen
+12 = draw a ruler
+13 = use cross hair
+14 = reposition plots
+15 = label diagram
+16 = display a map
+17 = search for short sequences
+18 = compare a sequence
+19 = compare a sequence using a score matrix
+20 = search for a sequence using a weight matrix
+21 = calculate amino acid composition
+22 = plot hydrophobicity
+23 = plot charge
+24 = plot Robson prediction
+25 = plot hydrophobic moment
+26 = draw helix wheel
+27 = back translate
+28 = search for patterns of motifs
+.end lit
+.para
+Some of these methods produce graphical 
+ results 
+and so the 
+program is generally used from a graphics terminal (a vdu on which lines 
+and points can be drawn as well as characters). 
+.para
+For users of VT640's or their equivalents the 
+terminal must be set nowrap (type NOWRAP) prior to running the program. 
+.LEFT MARGIN2
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "ANALPMRG" when the 
+program is started. Users can have their own file if required.
+.para
+The program can handle sequences stored in several formats: 
+Staden, EMBL, GENBANK, PIR (also known as NBRF) and GCG and they are described 
+in 
+the help for 'READ NEW SEQUENCE'.
+.para
+The options for the program are accessed from 5 main menus: general,
+screen control, statistical analysis of content, structure, search.
+Both menus and options are selected by number.
+.LEFT MARGIN1
+@1. TX 0 @Help
+.LEFT MARGIN2
+.para
+This option gives online help. The user should select option numbers and
+the current documentation will be given. Note that option 0 gives an
+introduction to the program, and that ? will get help from anywhere in 
+the 
+program.
+The following analyses (preceded by their option numbers) are included:
+.sp
+.left margin1
+@2. TX 0 @Quit
+.left margin2
+.para
+This function stops the program.
+.left margin1
+@3. TX 1 @Read a new sequence
+.LEFT MARGIN2
+.para
+This option allows users to read in new sequences, browse through annotations,
+ or search sequence 
+libraries for keywords. Sequences can be read from "personal" 
+sequence files or from sequence libraries. These are referred to as the 
+sequence "source". Personal files can be stored in several formats:
+Staden, PIR, EMBL, GENBANK and GCG.
+At LMB we use "Staden" format for sequencing and all 
+the 
+libraries are stored in their original formats. Note, however, that libraries
+such as EMBL or GenBank that are divided into several files (eg GenBank has
+13 separate files) are indexed as a whole. This means that users do not need
+to know which file contains an entry, only which library.
+When the user selects to read in a sequence the program first asks for the 
+sequence "source". 
+.para
+If the user selects "personal" the program will ask for 
+the format (Staden, PIR, EMBL, GENBANK or GCG), and then for the name of 
+the file. For PIR format the user will also be required to know the entry 
+name of the sequence as the file can contain several. For the other formats
+only a single entry is expected. The file will be read, its length and
+composition will be displayed and the option left.
+.para
+If the user selects "library" as the sequence source the program will display a
+list of available libraries. The programs are capable of handling all current
+libraries but which ones are available will vary from site to site. At LMB we
+have several libraries and also weekly updates of data gathered between releases.
+The program will ask users to select a library and then give a list of options:
+.lit
+
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entrynames from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+
+.end lit
+If get a sequence or get annotations is selected users will be asked to 
+type the entry name. The option will be left when a sequence is selected or 
+! is typed. The composition and length will be displayed.
+.para
+The text index contains all words from feature tables, reference titles,
+definition lines, keywords lists and comments, so the text index search
+is most useful. It is also the fastest. Up to 5 words can be searched for
+at once. The words should be typed separated by spaces, for example
+.lit
+ ? Keywords=P53 mouse murine tumo
+
+.end lit
+will search for all entries that contain words starting with p53, mouse,
+murine and tumo. Only the unique entries that contain ALL words will be 
+listed. Before listing the matching entries
+the program will show the number of 'hits' for each word and ring the bell.
+Escape is possible at this point, or after each screenfull of entries.
+In addition to the entry names the text search displays the primary accession 
+number, the sequence length and up to 80 characters of description.
+(The search of 'titles' is now redundant because the full text index
+contains all the title words and the search is much faster. It will probably
+be removed from the program.)
+All searches are independent of case. Where
+possible the program will offer default entry names.
+.para
+Typical dialogue follows.
+.lit
+Select sequence source
+X  1 Personal file
+   2 Sequence library
+? Selection  (1-2) (1) =
+Select sequence file format
+X  1 Staden
+   2 EMBL
+   3 GenBank
+   4 PIR
+   5 GCG
+? Selection  (1-5) (1) =
+? Sequence file name=M13MP7.SEQ
+ Contig title removed
+Sequence length=  7238
+ Sequence composition
+          T          C          A          G          -
+      2405.      1539.      1765.      1527.         2.
+        33.2%      21.3%      24.4%      21.1%       0.0%
+  .
+  .
+  .
+
+
+ Select sequence source
+ X  1 Personal file
+    2 Sequence library
+ ? Selection  (1-2) (1) =2
+ Select a library
+ X  1 EMBL 29 nucleotide library Dec 91
+    2 SWISSPROT 20 protein library Nov 91
+    3 PIR 31 protein library Dec 91
+    4 NRL3D 58 From Brookhaven protein library Dec 91
+    5 GenBank
+ ? Selection  (1-5) (1) =
+Library is in EMBL format with indexes
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =5
+ Search for keywords
+ ? Keywords=P53 mouse
+P53 hits  68
+MOUSE hits  8180
+
+ MMANT01    X00875         536 Murine gene fragment for cellular tumour antigen
+ MMANT02    X00876          83 Murine gene fragment for cellular tumour antigen
+ MMANT03    X00877          21 Murine gene fragment for cellular tumour antigen
+ MMANT04    X00878         261 Murine gene fragment for cellular tumour antigen
+ MMANT05    X00879         184 Murine gene fragment for cellular tumour antigen
+ MMANT06    X00880         113 Murine gene fragment for cellular tumour antigen
+ MMANT07    X00881         110 Murine gene fragment for cellular tumour antigen
+ MMANT08    X00882         137 Murine gene fragment for cellular tumour antigen
+ MMANT09    X00883          74 Murine gene fragment for cellular tumour antigen
+ MMANT10    X00884         107 Murine gene for cellular tumour antigen p53 (exon
+ MMANT11    X00885         562 Murine p53 gene 3' region with exon 11
+ MMANTP53   M26862         536 Mouse tumor antigen p53 gene, 5' end.
+ MMLYN      M64608        2044 Mouse lyn protein mRNA, complete cds.
+ MMP53      X00741        1377 Mouse mRNA for transformation associated protein
+ MMP53A     M13872        1285 Mouse p53 mRNA, complete cds, clone pcD53.
+ MMP53B     M13873        1241 Mouse p53 mRNA, complete cds, clone p53-m11.
+ MMP53C     M13874        1322 Mouse p53 mRNA, complete cds, clone p53-m8.
+ MMP53G1    X01235         554 Mouse genomic DNA for 5' region of cellular tumou
+ MMP53IN4   X60470         729 M.musculus p53 gene for p53 protein, intron 4
+ MMP53P     X01236        2132 Mouse pseudogene for cellular tumour antigen p53
+ MMP53R     X01237        1773 Mouse mRNA for cellular tumour antigen p53
+ MMRSB2P5   M64597         196 Mouse B2 repeat in the 3' flank of protein 53 (p5
+      22 different entries found
+
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =4
+ Search for keywords
+ ? Keywords=alpha
+ Searching for alpha
+ AAGHA          623 a.anguilla mrna for glycoprotein hormone alpha subunit precu
+ AAMALI        3338 a.aegypti mali gene encoding alpha 1-4 glucosidase, complete
+ AAMALIA       1659 a.aegypti maltase-like i (mali) gene encoding alpha-1,4-gluc
+ AAMALIB       1832 a.aegypti maltase-like i (mali) mrna encoding alpha-1,4-gluc
+ ACA13GT        371 alouatta caraya alpha-1,3gt gene, 3' flank.
+ ADHBADA1       102 duck alpha-d-globin gene, exon 1.
+ ADHBADA2      1145 duck alpha-a-globin gene and 5' flank
+ ADHBADWP       513 duck (white pekin) alpha ii (minor) globin mrna, complete co
+ AEACOXABC     5279 a.eutrophus protein x (acox), acetoin:dcpip oxidoreductase-a
+ AGA13GT        371 ateles geoffroyi alpha-1,3gt gene, 3' flank.
+ AGAAAGFP       282 c.tetragonoloba alpha-amylase/alpha-galactosidase fusion pro
+ AGAABL         138 b.subtilis alpha-amylase signal peptide gene e.coli beta-lac
+ AGAFAMYA        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFAMYB        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFAMYC        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFCOXA        98 synthetic alpha-factor/cox iv fusion gene signal peptide.
+ AGAGABA       7876 synthetic gossypium hirsutum (cotton) alpha globulin a and b
+ AGAMYLS        120 synthetic alpha-amylase gene, 5' end.
+ AGANPS          95 synthetic gene (jcnf-1) encoding alpha-factor pro-region/han
+!
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =3
+ ? Accession number=v00636
+Entry name LAMBDA
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =2
+ Default Entry name=LAMBDA
+ ? Entry name=
+ID   LAMBDA     standard; DNA; PHG; 48502 BP.
+XX
+AC   V00636; J02459; M17233; X00906;
+XX
+DT   03-JUL-1991 (Rel. 28, Last updated, Version 3)
+DT   09-JUN-1982 (Rel. 1, Created)
+XX
+DE   Genome of the bacteriophage lambda (Styloviridae).
+XX
+KW   circular; coat protein; DNA binding protein; genome;
+KW   origin of replication.
+XX
+OS   Bacteriophage lambda
+OC   Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
+XX
+RN   [1]
+RP   1-48502
+RA   Sanger F., Coulson A.R., Hong G.F., Hill D.F., Petersen G.B.;
+RT   "Nucleotide sequence of bacteriophage lambda DNA";
+RL   J. Mol. Biol. 162:729-773(1982).
+XX
+!
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =
+ Default Entry name=LAMBDA
+ ? Entry name=
+DE   Genome of the bacteriophage lambda (Styloviridae).
+ Sequence length  48502
+ Sequence composition
+           T          C          A          G          -
+      11988.     11360.     12336.     12818.         0.
+         24.7%      23.4%      25.4%      26.4%       0.0%
+
+.end lit
+.left margin1
+@4. TX 1 @Redefine active region
+.LEFT MARGIN2
+.para
+For its analytic functions 
+the program always works on a region of the sequence called the active 
+region. When a new sequence is read into the program the active region is 
+automatically set to start at the beginning of the sequence and go
+up to the 
+maximum allowed size of active region the version of the program can 
+handle. The positions are shown on the screen.
+On most machines this will be to the end of the sequence.
+This option allows the user define a different region. Note that for 
+convenience in the 
+listing and translation functions the user is given access to regions 
+outside the active region.
+.left margin1
+@5. TX 1 @List a sequence
+.LEFT MARGIN2
+.para
+The sequence can be listed with line lengths from 
+10 to 120 in multiples of 10. Output can be directed to a disk file by 
+first selecting disk output. The output looks like:
+.lit
+
+          10         20         30         40         50         60
+  MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY
+
+          70         80         90        100        110        120
+  AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
+
+         130        140        150        160        170        180
+  LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
+
+         190        200        210        220        230        240
+  DAIINQRDSG IKCIYVAIGQ KASTISNVVR KLEEHGALAN TIVVVATASE SAALQYLARM
+
+         250        260        270        280        290        300
+  PVALMGEYFR DRGEDALIIY DDLSKQAVAY RQISLLLRRP PGREAFPGDV FYLHSRLLER
+
+         310        320        330        340        350        360
+  AARVNAEYVE AFTKGEVKGK TGSLTALPII ETQAGDVSAF VPTNVISITD GQIFLETNLF
+
+         370        380        390        400        410        420
+  NAGIRPAVNP GISVSRVGGA AQTKIMKKLS GGIRTALAQY RELAAFSQFA SDLDDATRKQ
+
+         430        440        450        460        470        480
+  LDHGQKVTEL LKQKQYAPMS VAQQSLVLFA AERGYLADVE LSKIGSFEAA LLAYVDRDHA
+
+         490        500        510        520        530        540
+  PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW*
+
+.end lit
+.left margin1
+@6. TX 1 @List a text file
+.LEFT MARGIN2
+.para
+Allows the user to have a text file displayed on the screen. It will appear 
+one page at a time.
+.left margin1
+@7. TX 1 @Direct output to disk
+.LEFT MARGIN2
+.para
+Used to direct output that would normally appear on the screen to a file. 
+.para
+Select redirection of either text or graphics, and 
+supply the name of the file that the output should be written to.
+.para
+ The results from the next options selected will not appear on the screen 
+but will be written to the file. When option 7 is selected again
+the file will be 
+closed and output will again appear on the screen.
+.left margin1
+@8. TX 1 @Write active region to disk
+.LEFT MARGIN2
+.para
+The program has the capability of reading in EMBL, GENBANK, NBRF, GCG 
+and Staden formats 
+and of reversing and complementing sequences. This option allows users 
+to 
+write the current active sequence to a disk file in Staden format. Hence 
+it 
+allows format conversion and crude sequence cutting.
+.left margin1
+@9. TX 1 @Edit the sequence
+.LEFT MARGIN2
+.para
+Used to edit sequences or any other files by giving access to the 
+computers system editor. For editing sequences the input file should  
+have already been created using the listing function "list 
+sequence".
+.para
+Supply the name of the file to edit. Wait while the system editor is made 
+ready (can take awhile on a vax). Use the editor. Exit from the editor. If a 
+sequence has been edited, and you want to process it, affirm that the 
+sequence should be "made active". The edited sequence will replace the 
+original sequence. 
+.para
+This editing method is designed to give users access to an editor with 
+which they are familiar - i.e. the one on their machine, and yet to allow 
+them to edit a sequence which contains the landmarks they need in 
+order to know where they are. Users can create files containing simple 
+listings with numbering, using "list the sequence", and 
+then edit them with their system editor, using the numbering to know 
+where they are within the sequence. When the edits are complete they 
+exit from the editor and the program "analyses" the edited file to extract 
+only the sequence characters. Define the permitted set of characters to be:
+ACDEFGHIKLMNPQRSTVWXYZ-acdefghiklmnpqrstvwxyz. All permitted characters 
+found in the file will become part of the sequence, all others removed.
+.left margin1
+@10. TX 2 @Clear graphics
+.LEFT MARGIN2
+.para
+ Clears the screen of both text and graphics.
+.left margin1
+@11. TX 2 @Clear text
+.LEFT MARGIN2
+.para
+ Clears only text from the screen.
+.left margin1
+@12. TX 2 @Draw a ruler
+.LEFT MARGIN2
+.para
+This option
+allows the user to draw a ruler or scale along the x axis of the screen to 
+help identify the coordinates of points of interest. The user can define 
+the position of the first amino acid to be marked (for example if the 
+active 
+region is 1501 to 8000, the user might wish to mark every 1000th amino 
+acid
+starting at either 1501 or 2000 - it depends if the user wishes to treat 
+the active region as an independent unit with its own numbering starting 
+at 
+its left edge, or as part of the whole sequence). The user can also define 
+the separation of the ticks on the scale and their height. If required the 
+labelling routine can be used to add numbers to the ticks.
+.left margin1
+@13. TX 2 @Use cross hair
+.LEFT MARGIN2
+.para
+This function puts
+a steerable cross on the screen that can be used to find the 
+coordinates of points in the sequence. The user can move the cross 
+around using the directional keys; when he hits the space bar the 
+program will print out the coordinates of the cross in sequence units and 
+the option will be exited.
+.para
+If instead, 
+you hit a , the position will be displayed but the cross will remain on 
+the screen.
+.para
+If a letter s is hit the sequence around the cross hair is displayed and 
+the cross remains on the screen.
+.left margin1
+@14. TX 2 @Reset margins
+.LEFT MARGIN2
+.para
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "ANALMARG" when the 
+program is started. Users can have their own file if required.
+As all the plots start 
+at the same position in x and have the same width, x0 and xlength are the 
+same for all options. Generally users will only want to change the start 
+level of the window y0 and its height ylength. 
+ This option 
+allows users to change window positions whilst running the program.
+The routine prompts first for the number of the option that the users 
+wishes 
+to reposition; then for the y start and height; then for the x start and 
+length. Note that changes to the x values affect all options. If the user 
+types only carriage return for any value it will remain unchanged. 
+The cross-hair can be used to choose suitable heights.
+.LEFT MARGIN1
+@15. TX 2 @Label a diagram
+.LEFT MARGIN2
+.para
+This routine allows users to label any diagrams they have produced. They 
+are asked to type in a label. When the user types carriage return to finish 
+typing the label the cross-hair appears on the screen. The user can 
+position it anywhere on the screen. If the user types R (for right justify)
+ the label will be 
+written on the diagram with its right end at the cross-hair position. 
+If the user types L (for left justify) the label will be written on the 
+diagram with its left end at the cross hair position.
+The 
+cross-hair will then immediately reappear. The user may put the same 
+label 
+on another part of the diagram as before or if he hits the space bar he 
+will be asked if he wishes to type in another label.
+.left margin1
+@16. TX 2 @Display a map
+.LEFT MARGIN2
+.para
+It is often convenient to plot a map alongside graphed analysis in order 
+to 
+indicate features within the sequence. This function allows users to 
+draw 
+maps using files arranged in the form of EMBL feature tables. Of course 
+the 
+EMBL table are usually only used for nucleic acid sequence annotation 
+but, 
+as long as the features are written in the correct format, they can be 
+employed by this routine. The map is composed of a line representing the 
+sequence and then further lines denoting the endpoints of each feature 
+the 
+user identifies. The user is asked to define height at which the line 
+representing the sequence should be drawn; then for the feature height; 
+then for the features to plot.
+.left margin1
+@17. TX 1 5 @Short sequence search
+.LEFT MARGIN2
+.para
+This routine is used to search for exact matches to short sequences. It is 
+equivalent to the restriction enzyme search in program NIP. It and can 
+either list matches 
+or present the results graphically. 
+.PARA
+Select from searching, screen clearing or file listing. Choose a file of 
+strings and the mode of output required.
+.para
+The files of short 
+sequences (strings) and their names
+need to be arranged in a particular way. For example
+.lit
+ACID/D/E//
+BASIC/R/K/H//
+HYDRO/F/L/I/V/Y//
+GLYCO/N-S/N-T//
++/R/K/H//
+-/D/E//
+.end lit
+defines various groups of amino acids.
+Each string or set of strings must be 
+preceded by a name, each string must be preceded and 
+terminated with a slash (/), and 
+each set of strings by 2 slashes. These collections of strings and their 
+names can be read from disk or entered from the keyboard. Two files
+containing sequences are currently 
+available. One contains named groups of amino acids. The other simply 
+contains the names of all amino acids and gives a convenient way of 
+producing a plot of the positions of all the different
+amino acids in the sequence.
+The user can select strings 
+by name from these collections. Results can be displayed  name by name 
+or all 
+together. 
+Strings entered from the keyboard need to be separated by slash 
+characters(/).
+For the name by name search the output looks like:
+.lit
+  MATCHES=    12
+ NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+ ACID                  E                          7       7       1
+ ACID                  E                         10       3       1
+ ACID                  E                         24      14       1
+ ACID                  E                         28       4       1
+ ACID                  D                         36       8       1
+ ACID                  D                         46      10       2
+ ACID                  E                         51       5       2
+ ACID                  E                         67      16       2
+ ACID                  D                         69       2       2
+ ACID                  D                         81      12       2
+ ACID                  E                         84       3       2
+ ACID                  E                         96      12       3
+  MATCHES=    10
+ NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+ BASIC                 K                         13      13       1
+ BASIC                 R                         15       2       1
+ BASIC                 H                         26      11       1
+ BASIC                 R                         40      14       1
+ BASIC                 H                         42       2       2
+ BASIC                 R                         59      17       2
+ BASIC                 R                         68       9       2
+ BASIC                 K                         87      19       2
+ BASIC                 K                         89       2       2
+ BASIC                 R                         93       4       2
+  MATCHES=     1
+ NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+ GLYCO                 NST                        4       4       3
+
+ or when the results are ordered only on position the output looks like:
+
+ NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+ GLYCO                 NST                        4       3
+ ACID                  E                          7       3
+ ACID                  E                         10       3
+ BASIC                 K                         13       3
+ BASIC                 R                         15       2
+ ACID                  E                         24       9
+ BASIC                 H                         26       2
+ ACID                  E                         28       2
+ ACID                  D                         36       8
+ BASIC                 R                         40       4
+ BASIC                 H                         42       2
+ ACID                  D                         46       4
+ ACID                  E                         51       5
+ BASIC                 R                         59       8
+.end lit
+.LEFT MARGIN2
+Graphical output marks the position of each string by a 
+short vertical line and gives its name at the left end of the 
+line. If the top of the screen is reached the program gives the user the 
+oportunity to take a hard copy and then will clear the screen and restart
+plotting results at the original start position.
+Note that any character in the string
+that is not a recognisable protein symbol will be treated as a 
+wild card character will match with all 
+characters in the searched sequence.
+.para
+.lit
+Typical dialogue follows.
+
+Menus and their numbers are
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structure
+m5 = Search
+ ? = Help
+ ! = Quit
+? Menu or option number=17
+ Search for short sequences
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =2
+  1 All acids
+X 2 Named groups
+  3 Personal file
+  4 Keyboard
+? 0,1,2,3,4 =
+ 
+ACID/D/E//
+BASIC/R/K/H//
+HYDRO/F/L/I/V/Y//
+GLYCO/N-S/N-T//
++/R/K/H//
+-/D/E//
+DIBASIC/RR/KK/RK/KR//
+TURN/N/D/G/P/S//
+BLOCK/A/Q/E/I/L/M/F/W/V//
+INDIF/R/C/H/K/T/Y//
+End of file
+ 
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+ 
+  1 All acids
+X 2 Named groups
+  3 Personal file
+  4 Keyboard
+? 0,1,2,3,4 =
+ 
+? (y/n) (y) All names n
+? Name=acid
+? Name=basic
+? Name=glyco
+? Name=
+ 
+? (y/n) (y) Show results name by name
+? (y/n) (y) List matches
+ 
+ searching
+ matches=    59
+NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+ACID                  E                          7       7       1
+ACID                  E                         10       3       1
+ACID                  E                         24      14       1
+ACID                  E                         28       4       1
+ACID                  D                         36       8       1
+ACID                  D                         46      10       2
+ACID                  E                         51       5       2
+ACID                  E                         67      16       2
+ACID                  D                         69       2       2
+ACID                  D                         81      12       2
+ACID                  E                         84       3       2
+ACID                  E                         96      12       3
+ACID                  D                        116      20       3
+... etc
+ matches=    61
+NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+BASIC                 K                         13      13       1
+BASIC                 R                         15       2       1
+BASIC                 H                         26      11       1
+BASIC                 R                         40      14       1
+BASIC                 H                         42       2       2
+BASIC                 R                         59      17       2
+ ...etc
+ matches=     2
+NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+GLYCO                 NST                        4       4       3
+GLYCO                 NQT                      487     483      28
+                                                        28     483
+ 
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+ 
+  1 All acids
+X 2 Named groups
+  3 Personal file
+  4 Keyboard
+? 0,1,2,3,4 =
+ 
+? (y/n) (y) Selected names
+ 
+? Name=basic
+? Name=glyco
+? Name=
+ 
+? (y/n) (y) Show results name by name n
+? (y/n) (y) List matches
+
+ searching
+NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+GLYCO                 NST                        4       3
+BASIC                 K                         13       9
+BASIC                 R                         15       2
+BASIC                 H                         26      11
+BASIC                 R                         40      14
+BASIC                 H                         42       2
+BASIC                 R                         59      17
+BASIC                 R                         68       9
+BASIC                 K                         87      19
+ ...etc
+BASIC                 R                        477      14
+BASIC                 H                        479       2
+GLYCO                 NQT                      487       8
+BASIC                 K                        499      12
+BASIC                 K                        501       2
+BASIC                 K                        508       7
+                                                         7
+ 
+X 1 Search
+  2 List enzyme file
+  3 Clear text
+  4 Clear graphics
+? 0,1,2,3,4 =
+  1 All acids
+X 2 Named groups
+  3 Personal file
+  4 Keyboard
+? 0,1,2,3,4 =4
+Define search strings by typing a string name
+followed by the string(s)
+? Name=MARY
+? String(s)=AL/VI
+? Name=
+? (y/n) (y) All names 
+? (y/n) (y) Show results name by name 
+? (y/n) (y) List matches 
+
+ searching
+ matches=    12
+NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+MARY                  VI                        38      38      10
+MARY                  AL                        63      25      13
+MARY                  VI                       136      73      16
+MARY                  AL                       177      41      19
+MARY                  AL                       217      40      25
+MARY                  AL                       233      16      37
+MARY                  AL                       243      10      40
+MARY                  AL                       256      13      41
+MARY                  AL                       326      70      45
+MARY                  VI                       345      19      51
+MARY                  AL                       396      51      70
+MARY                  AL                       470      74      73
+
+
+.END LIT
+
+.left margin1
+@18. TX 1 5 @Compare a sequence
+.LEFT MARGIN2
+.para
+This routine slides a short sequence along the current sequence and finds 
+all positions at which a given percentage of the amino acids match.
+Output is in both graphical and listed forms. 
+.para
+If  users call for dialogue when the routine is selected they will be given 
+the choice of keyboard or file input. Define the string, and the percentage 
+match. Matches will be plotted out and then the user can select to have 
+them listed. Then the routine cycles around.
+.para
+ The routine slides the search string 
+along the  sequence and marks the positions at which a minimum 
+percentage score is reached. The graphical output draws a vertical line at 
+the match position; the height of the line represents the percentage 
+score, 
+so that if the line reaches the top of the box the score is 100%.
+.para
+Typical dialogue follows.
+.lit
+
+? Menu or option number=18
+ Find percentage matches
+? (y/n) (y) Keep picture
+ 
+? String=aaa
+? Percent match (1.00-100.00) (70.00) =
+ 
+ missing graphics 
+ 
+Total scoring positions above 70.000 percent =  19
+Scores          2      2      2      2      2      2      2      2      2      2
+Positions      61    131    177    217    226    231    232    267    300    301
+ 
+? Number to list (0-19) (0) =3
+ 
+        61
+         AIA
+         * *
+         aaa
+         1
+ 
+       131
+         AIA
+         * *
+         aaa
+         1
+ 
+       177
+         ALA
+         * *
+         aaa
+         1
+? (y/n) (y) Keep picture n
+ 
+Default String=aaa
+? String=!
+
+.end lit
+ 
+.left margin1
+@19. TX 1 5 @Compare a sequence using a score matrix
+.LEFT MARGIN2
+.para
+This routine slides a short sequence along the current sequence and finds 
+all positions at which a given level of similarity (a cutoff score) is 
+reached. The score is defined by use of a score matrix (MDM78). Output is 
+in both graphical and listed forms. 
+.para
+If  users call for dialogue when the routine is selected they will be given 
+the choice of keyboard or file input. Define the string and the cutoff 
+score. Matches will be plotted out and then the user can select to have 
+them listed. Then the routine cycles around.
+.para
+ The routine slides the search string 
+along the  sequence and marks the positions at which a the cutoff score 
+is achieved. The graphical output draws a vertical line at 
+the match position; the height of the line represents the  score, 
+so that if the line reaches the top of the box the score is the maximum 
+possible.
+.para
+Typical dialogue follows.
+.lit
+ 
+Menus and their numbers are
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structure
+m5 = Search
+ ? = Help
+ ! = Quit
+? Menu or option number=19
+ Find matches using a score matrix
+? (y/n) (y) Keep picture
+ 
+? String=aaa
+Minimum score=    12 Maximum score=    36
+? Score (12-36) (36) =
+
+ missing graphics
+ 
+For score    24 the number of matches=   507
+scores         35     35     35     34     34     34     34     34     34     34
+positions     226    231    379    112    133    202    227    267    378    
+380
+ 
+? Number to list (0-507) (0) =3
+ 
+       226
+         ATA
+         * *
+         aaa
+         1
+ 
+       231
+         SAA
+           **
+         aaa
+         1
+ 
+       379
+         GAA
+          **
+         aaa
+         1
+? (y/n) (y) Keep picture n
+ 
+Default String=aaa
+? String=!
+.end lit
+.left margin1
+@20. TX 5 @Search for a motif using a weight matrix
+.LEFT MARGIN2
+.para
+This function performs searches for short sequence
+motifs using an appropriate  weight matrix. In addition it can be used to 
+create or modify weight matrices. In order to perform a search the only 
+input 
+required is the name of the file containing the weight matrix.
+The results can be presented graphically or listed. The graphical 
+presentation will draw line at the position of any matches found; the 
+height of the line is proportional to the score.
+.para
+For a search, select "use weight matrix", supply the name of the file 
+containing the weight matrix, and choose between having results plotted 
+or listed. If dialogue is requested when the function is selected users can 
+alter the cutoff score employed.
+.para
+To create a weight matrix several steps are involved. A file containing an 
+alignment of known motifs is required. (This file must be created before 
+the current option is selected. The format is a follows: each sequence is 
+written on a separate line with at least one space at the beginning; each 
+sequence is terminated by a space character, and can be followed by a 
+name. The sequences must be aligned.) Supply the name of the file of 
+aligned sequences. The program reads and displays the sequences. Choose 
+between "summing logs of weights" or summing weights (i.e. whether to 
+multiply or add weights). If logs are used all scores will be negative. 
+Choose if all positions in the set of aligned sequences should be used or 
+if a mask should be applied. If so selected, define a mask as a string of 
+symbols, in which symbol - means ignore and any other symbol means 
+use. E.g. xx-x--abc means use all positions except 3,5 and 6.
+.para
+The program will calculate weights as the frequencies of each amino 
+acid at each unmasked position in the set of aligned sequences. These 
+weights are then applied to the set of aligned sequences to give a range  
+of "observed" scores. The mean and standard deviation of these scores is 
+displayed. The user is asked to supply several values to be used when the 
+weight matrix is applied to other sequences: a cutoff score (by default, 
+the mean minus 3 standard deviations), a top score for scaling graphical 
+results (by default, the mean plus 3 standard deviations), and a position 
+to identify (this means that if a particular amino acid within the motif 
+is used as a "landmark", such as the G of the helix-turn-helix motif, then 
+its position will be marked in plots). All these values are stored along 
+with the weight matrix. Finally supply the name of a file to contain the 
+weight matrix.
+.para
+Weight matrices can be "rescaled" using a set of aligned sequences in 
+much the same ways as a matrix is created. The purpose is to redefine 
+the cutoff scores, and rescaling does not alter any other values in the 
+weight matrix file.
+.para
+The methods have changed considerably but were first outlined in
+Staden, R. Nucl. Acid Res. 12 505-519 1984, and
+Staden, R. Genetic 
+engineering: principles and methods vol 7, Edited by J.K. Setlow and A. 
+Hollaender, Plenum publishing corp., 1985.
+.para
+ The methods have always had to deal with the problem of zeroes in the 
+matrices. The current versions 
+employ "Laplaces Law of Succession" in which 1 is 
+added to each term.
+.para
+It is now possible to apply a mask to a set of aligned sequences in 
+order to give weight to selected positions only.
+Sequences have superimposed functions: some parts may be of general 
+structural 
+importance and give rise to an overall framework, and other parts give 
+specificity and hence are not common; we may want to use a set of 
+aligned 
+sequences to define a motif, but want to use only the framework 
+positions.
+ Alternatively we may want to pick out 
+only those parts of a set of aligned sequences that give a particular 
+property, and to ignore other similarities that are due to some other 
+property
+and which could obscure the pattern 
+we are interested in. The ability to define a mask allows certain 
+positions 
+to be used in the motif and others to be ignored, and yet still permits the 
+use of a set of aligned sequences to calculate weights. 
+.para
+Typical dialogue is shown below.
+.lit
+? Menu or option number=20
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 =2
+? Name of aligned sequences file=[rs.motifs]hth.seq
+     1 QESVADKMGMGQSGVGALFN LAMBDA.REP
+     2 QTKTAKDLGVYQSAINKAIH LAMBDA.CRO
+     3 QAALGKMVGVSNVAISQWQR P22.REP
+     4 QRAVAKALGISDAAVSQWKE P22.CRO
+     5 QAELAQKVGTTQQSIEQLEN 434.REP
+     6 QTELATKAGVKQQSIQLIEA 434.CRO
+     7 RQEIGQIVGCSRETVGRILK CAP
+     8 RGDIGNYLGLTVETISRLLG Fnr
+     9 LYDVAEYAGVSYQTVSRVVN LAC.R
+    10 IKDVARLAGVSVATVSRVIN GAL.R
+    11 TEKTAEAVGVDKSQISRWKR LAMBDA.CII
+    12 QRKVADALGINESQISRWKG P22.CI
+    13 KEEVAKKCGITPLQVRVWCN MAT.ALPHA
+    14 TRKLAQKLGVEQPTLYWHVK TETR.TN10
+    15 TRRLAERLGVQQPALYWHFK TETR.pSC1
+    16 QRELKNELGAGIATITRGSN TRP.REP
+    17 RQQLAIIFGIGVSTLYRYFP H-INVERSN
+    18 ATEIAHQLSIARSTVYKILE TN3.RESOL
+    19 ASHISKTMNIARSTVYKVIN GD.RESOLV
+    20 IASVAQHVCLSPSRLSHLFR ARA.C
+    21 RAEIAQRLGFRSPNAAEEHL LEX.R
+Length of motif    20
+? (y/n) (y) Sum logs of weights
+? (y/n) (y) Use all motif positions n
+x means use, - means ignore
+e.g. xx-x---x-x means use positions 1,2,4,8,10
+? Mask=--xxxxxxxxxxxx------
+ Applying weights to input sequences
+   1      -57.143 QESVADKMGMGQSGVGALFN
+   2      -55.087 QTKTAKDLGVYQSAINKAIH
+   3      -58.079 QAALGKMVGVSNVAISQWQR
+   4      -54.986 QRAVAKALGISDAAVSQWKE
+   5      -55.181 QAELAQKVGTTQQSIEQLEN
+   6      -55.874 QTELATKAGVKQQSIQLIEA
+   7      -56.692 RQEIGQIVGCSRETVGRILK
+   8      -57.722 RGDIGNYLGLTVETISRLLG
+   9      -55.363 LYDVAEYAGVSYQTVSRVVN
+  10      -55.769 IKDVARLAGVSVATVSRVIN
+  11      -56.786 TEKTAEAVGVDKSQISRWKR
+  12      -55.833 QRKVADALGINESQISRWKG
+  13      -56.279 KEEVAKKCGITPLQVRVWCN
+  14      -53.125 TRKLAQKLGVEQPTLYWHVK
+  15      -55.833 TRRLAERLGVQQPALYWHFK
+  16      -58.651 QRELKNELGAGIATITRGSN
+  17      -56.749 RQQLAIIFGIGVSTLYRYFP
+  18      -56.986 ATEIAHQLSIARSTVYKILE
+  19      -60.618 ASHISKTMNIARSTVYKVIN
+  20      -58.988 IASVAQHVCLSPSRLSHLFR
+  21      -58.002 RAEIAQRLGFRSPNAAEEHL
+Top score     -53.125  Bottom score     -60.618
+Mean     -56.655  Standard deviation       1.617
+Mean minus 3.sd     -61.505  Mean plus 3.sd     -51.804
+? Cutoff score (-999.00-9999.00) (-61.51) =
+? Top score for scaling plots (-61.51-999.00) (-51.80) =
+? Position to identify (0-20) (1) =9
+? Title=hth
+? Name for new weight matrix file=1.wts
+ 
+Menus and their numbers are
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structure
+m5 = Search
+ ? = Help
+ ! = Quit
+? Menu or option number=20
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 =
+ 
+? Motif weight matrix file=1.wts
+ hth
+? (y/n) (y) Use frequencies as weights
+? (y/n) (y) Plot results n
+      5    -61.46 STEISELIKQRIAQFNVVSE
+     13    -58.93 KQRIAQFNVVSEAHNEGTIV
+     21    -60.42 VVSEAHNEGTIVSVSDGVIR
+     57    -59.39 GNRYAIALNLERDSVGAVVM
+     59    -61.47 RYAIALNLERDSVGAVVMGP
+     79    -59.90 YADLAEGMKVKCTGRILEVP
+     88    -61.41 VKCTGRILEVPVGRGLLGRV
+    104    -60.38 LGRVVNTLGAPIDGKGPLDH
+    127    -60.13 SAVEAIAPGVIERQSVDQPV
+    129    -59.91 VEAIAPGVIERQSVDQPVQT
+    133    -60.79 APGVIERQSVDQPVQTGYKA
+    139    -61.12 RQSVDQPVQTGYKAVDSMIP
+    175    -58.90 KTALAIDAIINQRDSGIKCI
+    191    -60.95 IKCIYVAIGQKASTISNVVR
+    195    -60.94 YVAIGQKASTISNVVRKLEE
+    215    -60.66 HGALANTIVVVATASESAAL
+    254    -60.56 EDALIIYDDLSKQAVAYRQI
+    260    -60.08 YDDLSKQAVAYRQISLLLRR
+    297    -61.00 LLERAARVNAEYVEAFTKGE
+    314    -61.29 KGEVKGKTGSLTALPIIETQ
+    330    -60.49 IETQAGDVSAFVPTNVISIT
+    363    -57.63 GIRPAVNPGISVSRVGGAAQ
+    365    -61.48 RPAVNPGISVSRVGGAAQTK
+    371    -61.02 GISVSRVGGAAQTKIMKKLS
+    382    -57.90 QTKIMKKLSGGIRTALAQYR
+    394    -60.07 RTALAQYRELAAFSQFASDL
+    424    -59.95 GQKVTELLKQKQYAPMSVAQ
+    430    -58.89 LLKQKQYAPMSVAQQSLVLF
+    432    -61.14 KQKQYAPMSVAQQSLVLFAA
+    438    -58.58 PMSVAQQSLVLFAAERGYLA
+    458    -61.06 DVELSKIGSFEAALLAYVDR
+    466    -61.00 SFEAALLAYVDRDHAPLMQE
+    483    -60.48 MQEINQTGGYNDEIEGKLKG
+    494    -60.61 DEIEGKLKGILDSFKATQSW
+ 
+Menus and their numbers are
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structure
+m5 = Search
+ ? = Help
+ ! = Quit
+? Menu or option number=d20
+X 1 Use weight matrix
+  2 Make weight matrix
+  3 Rescale weight matrix
+? 0,1,2,3 =
+ 
+? Motif weight matrix file=1.wts
+ hth
+? (y/n) (y) Use frequencies as weights
+? Cutoff score (-9999.00-9999.00) (-61.51) =-56.
+? (y/n) (y) Plot results n
+ 
+
+.end lit
+.left margin1
+@21. TX 3 @Calculate amino acid composition
+.LEFT MARGIN2
+.para
+This function calculates the amino acid composition and molecular 
+weight 
+for the active region.
+.lit
+? Menu or option number=21
+ Sequence composition
+ 
+A   C     S     T     P     A     G     N     D     E     Q     B     Z     H
+N   3.   32.   23.   18.   57.   47.   16.   28.   31.   28.    0.    0.    7.
+%   0.6   6.2   4.5   3.5  11.1   9.1   3.1   5.4   6.0   5.4   0.0   0.0   1.4
+W  309. 2786. 2325. 1748. 4051. 2682. 1826. 3222. 4003. 3588.    0.    0.  
+960.
+ 
+A   R     K     M     I     L     V     F     Y     W     -     X     ?
+N  30.   24.   11.   40.   47.   41.   14.   15.    1.    0.    0.    0.    1.
+%   5.8   4.7   2.1   7.8   9.1   8.0   2.7   2.9   0.2   0.0   0.0   0.0   0.2
+W 4686. 3076. 1443. 4527. 5319. 4065. 2060. 2448.  186.    0.    0.    0.    
+0.
+Total molecular weight=    55328.
+ 
+.end lit
+.left margin1
+@22. TX 3 4 @Plot hydrophobicity
+.LEFT MARGIN2
+.para
+This routine plots the hydrophobicity of each section of the sequence 
+using 
+the hydrophobicity 
+values of Kyte and Doolittle (J. Mol. Biol. 157, 105-132 (1982)).
+A window of size span is slid along the sequence and a sum calculated 
+for 
+each position.
+.para
+If dialogue is requested select a span length and a plot interval.
+.para
+The diagrams are  on the same scale as Fig. 6 of the Kyte and Doolittle 
+paper and values of + and - 50 could be assigned to the top and bottom of 
+the diagram with corresponding values in between (-40,-20,0,20,40 are 
+shown 
+in the paper).
+.lit
+? Menu or option number=d22
+ Plot hydrophobicity
+? odd span length (1-101) (11) =
+? plot interval (1-101) (3) =
+
+ missing graphics
+.end lit
+.LEFT MARGIN1
+@23. TX 3 4 @Plot charge
+.LEFT MARGIN2
+.para
+This routine plots the charge of each section of the sequence.
+A window of size span is slid along the sequence and a sum calculated 
+for 
+each position. Amino acids are assigned charges of 1, -1 or 0.
+.para
+If dialogue is requested select a span length and a plot interval.
+.para
+Typical dialogue follows.
+.lit
+
+? Menu or option number=d23
+ Plot charge
+? odd span length (1-101) (11) =
+? plot interval (1-101) (3) =
+
+ missing graphics
+
+.end lit
+.LEFT MARGIN1
+@24. TX 4 @Plot robson prediction
+.LEFT MARGIN2
+.para
+This routine uses the method of Garnier J, Osguthorpe D J, and Robson B. 
+(1978) J. Mol. Biol. 120, 97-120 to predict secondary structures. The 
+method divides protein secondary structures into 4 classes: helix, 
+extended 
+(usually referred to as sheet), turn and coil. The routine calculates the 
+likelihood that each segment of the sequence lies in each of these 
+classes. Results are presented graphically or listed.
+.para
+If dialogue is requested choose between plotted or listed output.
+.para
+ Each residue
+has a 
+certain probability of being found in each of the 4 classes. This 
+probability
+depends both on its own amino acid type and also the 8 
+amino acids found to either side along the protein chain. Four tables of 
+weights, each 20 by 17 elements are used to calculate the likelihood that 
+each residue along the chain falls into one of the four classes of 
+structure. The most likely structure at each point 
+is the one with the highest score.
+The four values are plotted in strips labelled H, E, T and C.
+Below, a strip labelled D for decision is divided into four levels, each 
+corresponding to one of the four structure types. Their top to bottom 
+order 
+is the same as that for the strips above, i.e C, T, E, and H. For each 
+residue the program measures which of the four likelhoods is highest. It
+places a single dot at the 
+ mid-point of the corresponding strip, and 
+also at the
+appropriate level in the strip labelled D.
+.PARA
+It should be noted that the method, when tested by Kabsch W and Sander 
+C, 
+(1983) Febs. Lett. 155 (179-182), although one of the better ones, was 
+correct for only about 56% of residues.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=d24
+ Plot Robson secondary structure predictions
+? (y/n) (y) Plot results n
+
+     9 S   217   -7  -39   15
+    10 E   226    5  -27  -39
+    11 L   233   -7  -26  -15
+    12 I   229  -23    9    4
+    13 K   214   -8   10   -8
+    14 Q   178   42   19    5
+    15 R   131   54   16    3
+    16 I    86   42  -31  -23
+    17 A    55   52  -30  -15
+    18 Q    15   67    4   25
+    19 F   -34   86   47   74
+    20 N   -41   74   17  106
+    21 V   -16  118   -5  100
+    22 V    64   88    5  115
+    23 S    96   38   26  155
+    24 E   133  -25   13   96
+    25 A   118  -98   25  100
+    26 H   110 -150   37   86
+    27 N    57 -201   37   66
+    28 E    51 -140   11   -4
+    29 G     2  -77   37    9
+    30 T     2   28   28    7
+    31 I   -11  117  -21   22
+    32 V   -23  178  -55    5
+    33 S   -54  193  -14   35
+    34 V   -46  123    5   30
+    35 S   -54   53   51   80
+    36 D   -60    1   86   55
+    37 G   -66    8   57   49
+    38 V    -1  128  -30   -5
+    39 I    11  212  -56  -33
+    40 R    16  204  -44  -57
+ ...etc
+
+.end lit
+.LEFT MARGIN1
+@26. TX 4 @Draw a helix wheel
+.LEFT MARGIN2
+.para
+A helical representation of segments of the sequence is shown. The 
+display 
+includes a schematic of the helix showing the links between residues, 
+with 
+each vertex numbered according to position; the sequence element at 
+each 
+vertex; a symbol denoting a classification as hydrophobic(.), positively 
+charged(+), negatively charged(-), or otherwise( ). The 
+residue number of the first sequence element in 
+the current window is displayed at the top-left-hand 
+corner of the diagram. Also at the top-left corner the sequence in the 
+current window is listed. Below this is the total hydrophobicity and 
+hydrophobic moment for the window calculated according to Eisenberg et 
+al 
+J. Mol. Biol. 179, 125-142 (1984).
+.para
+If dialogue is requested the user is asked for the angle to define the turn 
+between residues as seen 
+looking along the helix, and a window length. The window length can be up 
+to 60, with default 18, and the angle has a default of 100 degrees. Note 
+that 18 x 100 is 5 turns. When the option is selected the first segment in 
+the current active region is displayed then the bell rings. If the user 
+types only return, the display will click on by one residue; if another 
+number is typed, say N, then the display will click forwards (or 
+backwards 
+if N is negative) by N residues. If the wheel runs off either end of the 
+sequence the option will be exited.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=d26
+? Angle (1-130) (100) =
+? Window (1-60) (18) =
+
+ missing graphics
+
+.end lit
+.left margin1
+@25. TX 3 4 @Plot hydrophobic moment
+.LEFT MARGIN2
+.para
+This routine plots hydrophobic moment and hydrophobicity according to 
+Eisenberg et al
+J. Mol. Biol. 179, 125-142 (1984). The mean hydrophobicity per residue in 
+the window is plotted on a scale -1.0 to 1.5, and the mean hydrophobic 
+moment per residue on a scale 0.0 to 1.5. 
+The hydrophobicity is shown in the top frame with the 
+hydrophobic moment below.
+The plot is arranged so that the 
+value shown at position x represents the mean value for residues x-
+window+1 
+to x, where window is the window length. 
+.para
+If dialogue is requested the user can select a window 
+length, and the  angle used for the hydrophobic moment 
+calculation.
+.para
+Note that according to Eisenberg et al, in transmembrane proteins an 
+"initiator" is required. This is either a very hydrophobic single helix 
+with <H> >=0.68, or a moderately hydrophobic pair of helices whose <H> 
+sum 
+to >= 1.1. Other helices are then accepted as transmembrane if their <H> 
+>= 
+0.42
+.para
+The following rules are claimed: if <H> < 0.51 and points lie below the 
+line <M> = -0.392 + 0.603x <H> they are "globular", if they lie above this 
+line they are "surface". If <H> > 0.51 and they lie above the line <M> = 
+0.6 - 0.342x<H> they are "monomeric", if above "multimeric".
+.para
+Typical dialogue follows.
+.lit
+
+? Menu or option number=d25
+? Angle (1-130) (100) =
+? Window (1-60) (18) =
+? Plot interval (1-101) (3) =
+
+ missing graphics
+
+
+.end lit
+.left margin1
+@27. TX 1 @Back translate to dna
+.LEFT MARGIN2
+.para
+This routine back translates protein sequences into DNA using the 
+standard 
+genetic code. The level of redundancy can be plotted and the 
+backtranslation saved to a file.
+.para
+The translation can use either the IUB symbols shown below, or a set of 
+codon 
+preferences. If a set of codon preferences are used they must conform to 
+the format of codon tables produced by the nucleotide analysis 
+program, and the back 
+translation 
+will contain the favoured codons. If there is no favoured codon
+the IUB symbols will be employed. The window length for 
+plotting the redundancy is in codons.
+.para
+The program will plot the redundancy along the sequence and hence can 
+be 
+used to find the best sequences to use as primers. Note that the program 
+plots the inverse, and so the higher the 
+plot the LESS redundant the sequence. For primers look for peaks rather 
+than 
+troughs.
+.para
+The DNA sequence can be saved to a file and analysed using the nucleotide 
+analysis program.
+Depending on the application it is often useful to produce a back 
+translation using both a table of codon preferences and one using the IUB 
+symbols. This is because the restriction enzyme search program can 
+distinguish between definite and possible cuts in the sequence.
+These matches are what the program terms "definite matches" and are 
+ones in 
+which the specification of the recognition sequence corresponds 
+exactly to that of the back translation. The program will also find what 
+it 
+terms "possible matches" which are ones that depend on the particular 
+codons
+chosen for each amino acid.
+These are sites at which recognition 
+sequences could be engineered to produce a cut in the DNA 
+without changing the amino 
+acid, but which are not 
+necessarily found in the original sequence.
+.LIT
+
+
+            NC-IUB SYMBOLS
+
+      A,C,G,T
+      R        (A,R)        'puRine'
+      Y        (T,C)        'pYrimidine'
+      W        (A,T)        'Weak'
+      S        (C,G)        'Strong'
+      M        (A,C)        'aMino'
+      K        (G,T)        'Keto'
+      H        (A,T,C)      'not G'
+      B        (G,C,T)      'not A'
+      V        (G,A,C)      'not T'
+      D        (G,A,T)      'not C'
+      N        (G,A,C,T)    'aNy'
+
+ Typical dialogue follows.
+
+? Menu or option number=d27
+ Back translate
+? (y/n) (y) No codon preference
+? (y/n) (y) Plot redundancy n
+? (y/n) (y) Save DNA to disk
+? File name for DNA sequence=tt:
+ATGCARYTNAAYWSNACNGARATHWSNGARYTNATHAARCARMGNATHGCNCARTTYAAY
+GTNGTNWSNGARGCNCAYAAYGARGGNACNATHGTNWSNGTNWSNGAYGGNGTNATHMGN
+ATHCAYGGNYTNGCNGAYTGYATGCARGGNGARATGATHWSNYTNCCNGGNAAYMGNTAY
+GCNATHGCNYTNAAYYTNGARMGNGAYWSNGTNGGNGCNGTNGTNATGGGNCCNTAYGCN
+GAYYTNGCNGARGGNATGAARGTNAARTGYACNGGNMGNATHYTNGARGTNCCNGTNGGN
+MGNGGNYTNYTNGGNMGNGTNGTNAAYACNYTNGGNGCNCCNATHGAYGGNAARGGNCCN
+YTNGAYCAYGAYGGNTTYWSNGCNGTNGARGCNATHGCNCCNGGNGTNATHGARMGNCAR
+WSNGTNGAYCARCCNGTNCARACNGGNTAYAARGCNGTNGAYWSNATGATHCCNATHGGN
+MGNGGNCARMGNGARYTNATHATHGGNGAYMGNCARACNGGNAARACNGCNYTNGCNATH
+GAYGCNATHATHAAYCARMGNGAYWSNGGNATHAARTGYATHTAYGTNGCNATHGGNCAR
+AARGCNWSNACNATHWSNAAYGTNGTNMGNAARYTNGARGARCAYGGNGCNYTNGCNAAY
+ACNATHGTNGTNGTNGCNACNGCNWSNGARWSNGCNGCNYTNCARTAYYTNGCNMGNATG
+CCNGTNGCNYTNATGGGNGARTAYTTYMGNGAYMGNGGNGARGAYGCNYTNATHATHTAY
+GAYGAYYTNWSNAARCARGCNGTNGCNTAYMGNCARATHWSNYTNYTNYTNMGNMGNCCN
+CCNGGNMGNGARGCNTTYCCNGGNGAYGTNTTYTAYYTNCAYWSNMGNYTNYTNGARMGN
+GCNGCNMGNGTNAAYGCNGARTAYGTNGARGCNTTYACNAARGGNGARGTNAARGGNAAR
+ACNGGNWSNYTNACNGCNYTNCCNATHATHGARACNCARGCNGGNGAYGTNWSNGCNTTY
+GTNCCNACNAAYGTNATHWSNATHACNGAYGGNCARATHTTYYTNGARACNAAYYTNTTY
+AAYGCNGGNATHMGNCCNGCNGTNAAYCCNGGNATHWSNGTNWSNMGNGTNGGNGGNGCN
+GCNCARACNAARATHATGAARAARYTNWSNGGNGGNATHMGNACNGCNYTNGCNCARTAY
+MGNGARYTNGCNGCNTTYWSNCARTTYGCNWSNGAYYTNGAYGAYGCNACNMGNAARCAR
+YTNGAYCAYGGNCARAARGTNACNGARYTNYTNAARCARAARCARTAYGCNCCNATGWSN
+GTNGCNCARCARWSNYTNGTNYTNTTYGCNGCNGARMGNGGNTAYYTNGCNGAYGTNGAR
+YTNWSNAARATHGGNWSNTTYGARGCNGCNYTNYTNGCNTAYGTNGAYMGNGAYCAYGCN
+CCNYTNATGCARGARATHAAYCARACNGGNGGNTAYAAYGAYGARATHGARGGNAARYTN
+AARGGNATHYTNGAYWSNTTYAARGCNACNCARWSNTGG---
+ 
+
+.end lit
+
+.LEFT MARGIN1
+@28. TX 5 @Search for patterns of motifs
+.left margin2
+.para
+This option searches for patterns of motifs. Patterns can be defined 
+interactively or read from files. Results can be displayed in several ways 
+in both graphical and textual form. Used to create pattern files for 
+searching libraries. The option is extremely flexible and consequently the 
+following documentation is quite lengthy. However the routine is capable 
+of searching for almost any known pattern. In addition the flexibility 
+does not necessitate difficulty of use, and the userinterface has been 
+simplified considerably since the methods were first published.
+.para
+Users should refer to the "typical dialogue" shown below for the most 
+helpful information on using the program.
+.para
+There are currently 
+four ways to display the matching patterns: 1=each individual
+motif and its position is listed; 2=all the sequence between, and 
+including the two 
+outermost motifs is listed; 3=graphical, with a vertical line marking the 
+position 
+of the leftmost motif; 4 = EMBL feature table format, where the KEYNAM 
+field is the motif name, the FROM and TO fields denote the ends of the 
+match, and the DESCRIPTION field is "Program".
+.para
+When it is defined for the first time a pattern must be entered 
+interactively at the keyboard, but the pattern description 
+can be saved to a file. 
+This file can be used for all subsequent searches.
+.para
+When defining a pattern interactively
+select a motif class and the program will request the required inputs. 
+.para
+The program gives each motif an identifying name and number.
+For motifs other than the first, a range of allowed positions must be 
+defined (Note that sets of motifs included using the OR operator will all 
+be given the same range, and so the program will only request range 
+values 
+for the first motif in any such set).
+To specify the allowed range for a motif the user must supply the 
+following: the 
+identifying number of the motif, relative to which the current motifs 
+positions are to be defined (termed the "reference motif"); a "relative start 
+position" and a range. The relative start position can be negative or positive. 
+A negative start position means that although the reference motif 
+is searched for first, the current motif can be found to its left.
+A zero relative start position means their left ends are superimposed. The 
+default start position is to butt-joint the motif to righthand end of the 
+"reference motif". The range is "the number of extra positions" that the 
+motif can take.
+.para
+The program will display the probability of finding each motif. These 
+values are presented in the following form: .1234E-5 means 0.1234 times 
+10 
+to the power -5.
+.para
+After the pattern has been defined, the program will type a description 
+of 
+it on the screen. It will then allow the user to give an overall cutoff 
+score and overall probability cutoff.
+.para
+Typical dialogue for all the different motif classes is displayed below.
+.lit
+
+? Menu or option number=28
+  Pattern searcher
+? (y/n) (y) Read pattern from keyboard 
+X 1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =
+? Motif name=aa
+? String=aa
+Probability of score     2.0000 = 0.123E-01
+X 1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =2
+? Motif name=pmatch
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-1) (1) =
+? Relative start position (-1000-1000) (3) =
+? Number of extra positions (0-1000) (0) =
+? String=qqq
+? Minimum matches (1.00-3.00) (3.00) =2
+Probability of score     2.0000 = 0.858E-02
+  1 Exact match
+X 2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =3
+? Motif name=sm
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-2) (2) =
+? Relative start position (-1000-1000) (4) =
+? Number of extra positions (0-1000) (0) =
+? String=wqa
+? Minimum score (11.00-53.00) (53.00) =36
+Probability of score    36.0000 = 0.531E-02
+  1 Exact match
+  2 Percentage match
+X 3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =4
+? Motif name=hth
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-3) (3) =
+? Relative start position (-1000-1000) (4) =
+? Number of extra positions (0-1000) (0) =
+? Weight matrix file name=hth
+ HELIX TURN HELIX PABO SAUER WEIGHTS 17-11-87                                  
+Probability of score   -51.5860 = 0.230E-04
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+X 4 Cut-off score and weight matrix
+  5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =5
+? Motif name=repeat
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-4) (4) =
+? Relative start position (-1000-1000) (21) =
+? Number of extra positions (0-1000) (0) =3
+? Repeat length (1-60) (6) =3
+? Minimum gap (0-60) (0) =
+? Maximum gap (0-60) (0) =2
+? Minimum score (11.00-60.00) (36.00) =
+Probability of score    36.0000 = 0.445E-01
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+X 5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =6
+? Motif name=mset
+X 1 And
+  2 Or
+  3 Not
+? 0,1,2,3 =
+? Number of reference motif (1-5) (5) =
+? Relative start position (-1000-1000) (1) =
+? Number of extra positions (0-1000) (0) =
+X 1 Keyboard input
+  2 File input
+? 0,1,2 =
+Separate sets with commas
+? String=AVL,AST,,WYRF
+? Minimum matches (1.00-4.00) (4.00) =3
+Probability of score     3.0000 = 0.718E-02
+  1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Direct repeat
+X 6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =7
+? (y/n) (y) Save pattern in a file 
+? Pattern definition file=EXAM.PAT
+Motif  6 needs a file name to store set as a weight matrix
+? Weight matrix file name=DEMO.WTS
+Weight matrix needs a title
+? Title=Demonstration class 6 weight matrix
+
+Pattern description
+
+Motif  1 named aa       is of class    1
+Which is an exact match to the string
+aa
+Motif  2 named pmatch   is of class    2
+which is a match of score     2. to the string
+qqq
+and the N-terminal residue can take positions      3 to       3
+relative to the N-terminal end of motif   1
+It is anded with the previous motif.
+Motif  3 named sm       is of class    3
+which is a match of score    36. to the string
+wqa
+and the N-terminal residue can take positions      4 to       4
+relative to the N-terminal end of motif   2
+It is anded with the previous motif.
+Motif  4 named hth      is of class    4
+Which is a match to a weight matrix with score -51.586
+and the N-terminal residue can take positions      4 to       4
+relative to the N-terminal end of motif   3
+It is anded with the previous motif.
+Motif  5 named repeat   is of class    5
+Which is a repeat with repeat length    3 and score    36.
+The loop-out can have sizes      0 to      2
+and the N-terminal residue can take positions     21 to      24
+relative to the N-terminal end of motif   4
+It is anded with the previous motif.
+Motif  6 named mset     is of class    6
+Which is membership of a set with score   3.000
+It is anded with the previous motif.
+Probability of finding pattern = 0.4109E-14
+Expected number of matches  = 0.2539E-10
+? Maximum pattern probability (0.00-1.00) (1.00) =
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+ Select display mode
+X 1 Motif by motif
+  2 Inclusive
+  3 Graphical
+  4 EMBL feature table
+? 0,1,2,3,4 =
+ Searching
+
+Total matches found      0
+Menus and their numbers are 
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structure
+m5 = Search
+ ? = Help
+ ! = Quit
+? Menu or option number=6
+Page through text files
+? Name of file to read=exam.pat
+ A1          aa       Class
+ aa
+ @ End of string
+ A2          pmatch   Class
+      1      Relative motif
+      3      Relative start position
+      0      Number of extra positions
+ qqq
+ @ End of string
+   2.00000   Cutoff
+ A3          sm       Class
+      2      Relative motif
+      4      Relative start position
+      0      Number of extra positions
+ wqa
+ @ End of string
+  36.00000   Cutoff
+ A4          hth      Class
+      3      Relative motif
+      4      Relative start position
+      0      Number of extra positions
+hth                                      File name
+ A5          repeat   Class
+      4      Relative motif
+     21      Relative start position
+      3      Number of extra positions
+      3      Length
+      0      Minimum loop
+      2      Maximum loop
+  36.00000   Cutoff
+ A6          mset     Class
+      5      Relative motif
+      1      Relative start position
+      0      Number of extra positions
+DEMO.WTS                                 File name
+End of file
+Menus and their numbers are 
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structure
+m5 = Search
+ ? = Help
+ ! = Quit
+? Menu or option number=6
+Page through text files
+? Name of file to read=demo.wts
+ Demonstration class 6 weight matrix
+      4     0     3.000     4.000
+ P   1   2   3   4
+ N   0   0   0   0
+ C   0   0   0   0
+ S   0   1   0   0
+ T   0   1   0   0
+ P   0   0   0   0
+ A   1   1   0   0
+ G   0   0   0   0
+ N   0   0   0   0
+ D   0   0   0   0
+ E   0   0   0   0
+ Q   0   0   0   0
+ B   0   0   0   0
+ Z   0   0   0   0
+ H   0   0   0   0
+ R   0   0   0   1
+ K   0   0   0   0
+ M   0   0   0   0
+ I   0   0   0   0
+ L   1   0   0   0
+ V   1   0   0   0
+ F   0   0   0   1
+ Y   0   0   0   1
+ W   0   0   0   1
+End of file
+Menus and their numbers are 
+m0 = This menu
+m1 = General
+m2 = Screen control
+m3 = Statistical analysis of content
+m4 = Structure
+m5 = Search
+ ? = Help
+ ! = Quit
+? Menu or option number=28
+  Pattern searcher
+? (y/n) (y) Read pattern from keyboard 
+X 1 Exact match
+  2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =2
+? Motif name=avlst
+? String=avlst
+? Minimum matches (1.00-5.00) (5.00) =3
+Probability of score     3.0000 = 0.394E-02
+  1 Exact match
+X 2 Percentage match
+  3 Cut-off score and score matrix
+  4 Cut-off score and weight matrix
+  5 Direct repeat
+  6 Membership of set
+  7 Pattern complete
+? 0,1,2,3,4,5,6,7 =7
+? (y/n) (y) Save pattern in a file n
+
+Pattern description
+
+Motif  1 named avlst    is of class    2
+which is a match of score     3. to the string
+avlst
+Probability of finding pattern = 0.3941E-02
+Expected number of matches  = 0.2030E+01
+? Maximum pattern probability (0.00-1.00) (1.00) =
+? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+ Select display mode
+X 1 Motif by motif
+  2 Inclusive
+  3 Graphical
+  4 EMBL feature table
+? 0,1,2,3,4 =4
+ Searching
+
+FT   avlst       152    156       Program
+Total matches found      1
+Minimum and maximum observed scores        3.00        3.00
+ 
+.end lit
+.para
+General notes
+.para
+These methods allow users to define and search for
+complex patterns of motifs defined as single objects.
+The programs allow individual DNA motifs to be defined in eight 
+different
+ways, and protein motifs in six. Motifs are combined, using the logical 
+operators AND, OR and NOT, to describe a pattern. The pattern also 
+specifies the ranges of allowed relative separations of the individual 
+motifs. 
+.para
+First some definitions.
+.para
+A MOTIF is a contiguous subsequence of fixed length.
+At its simplest 
+it could be a single definite base or amino acid; a more complex motif 
+might be better represented as a consensus or a weight matrix; 
+two more-abstract types of 
+motif are direct and inverted repeats. 
+.para
+A PATTERN is a higher order of structure defined by a list of motifs. The 
+motifs in a pattern are combined using the logical operators AND, OR and 
+NOT. The list also defines the allowed relative separations of the 
+motifs. In the current versions of the programs up
+ to 50 motifs can be combined into a single pattern. So using these 
+definitions there are two 
+differences between motifs and patterns: 1) the distances between all 
+elements of a motif are fixed, but 
+the separations of parts of patterns can vary;
+ 2) all characters in a motif are defined 
+using the same method (class), but different parts of a pattern can be 
+defined in completely different ways.
+.para
+Each motif 
+can be represented in 9 ways (known as the motif class):
+.sk1
+.lit
+           MOTIF CLASSES
+CLASS           DESCRIPTION
+ 1       Exact match to a short defined sequence. The IUB symbols
+         can be used for DNA sequences.
+ 2       Percentage match to a defined short sequence. In nucleic acids, 
+         the IUB symbols can be used.
+ 3       Match to a defined sequence, using a score matrix and cutoff
+         score. The DNA matrix (see option 18) gives scores to IUB symbols 
+         depending on their level of redundancy. MDM78 is used for proteins.
+ 4       Match to a weight matrix with cutoff score.
+ 5       As class 4 but on the complementary strand.
+ 6       Inverted repeat or stem-loop. Fixed stem length, range of 
+         loop sizes, and cutoff score using A-T, G-C=2; G-T=1.
+ 7       Exact match to short sequence but with a defined step size.
+ 8       Direct repeat. Fixed repeat length, range of loop-out sizes,
+         cutoff score, and score matrix (for protein sequences MDM78 and
+         for nucleic acids an identity matrix).
+ 9       Membership of a set. A list of sets of allowed amino acids for 
+         each position in the motif. The sets are separated by commas(,).
+         For example IVL,,,DEKR,FYWILVM defines a motif of length 5 amino 
+         acids in which one of I,V or L must be found in the first position, 
+         then anything in the next two positions, D,E,K or R in the fourth 
+         position and F,Y,W,I,L,V or M in the fifth. This class only applies
+         to protein sequences because for nucleic acids "membership of a 
+set"
+         can be achieved using IUB symbols.
+
+    Classes 1 - 4, 8 and 9 apply to protein sequences, and classes 1-8 to 
+    nucleic acids.
+
+.end lit
+.para
+Class 1: exact match.
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+ may include IUB symbols. All symbols must match.
+.para
+Class 2: percentage match
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+may include IUB symbols. The minimum number of matching characters 
+must 
+also be specified.
+.para
+Class 3: match using a score matrix
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+may include IUB symbols. The motif is not compared directly with the 
+sequence to count the number of matching characters. Instead a matrix is 
+used to provide a score for all possible pairs of characters. The motif 
+score for 
+any position along the sequence is the sum of the scores found by 
+looking-up the scores for each pair of aligned characters. A match is 
+declared if some minimum score is achieved.
+.para
+Class 4: weight matrix
+.para
+The motif is defined by a table of values (called weights or scores). The 
+table gives a score for finding each possible character at each position 
+along the length of the motif. It therefore 
+has dimension motif-length x character-set-size, and allows us to give 
+different scores for each character at each position. It is equivalent to 
+having a different score matrix for each position along the motif, and 
+provides the most flexible and specific method of defining motifs. The 
+weight matrices are created by program PIP option 20 and 
+stored as files. The file contains the values
+for each position, as well as an overall minimum score. 
+There are two ways in which these values can be used to calculate an 
+overall 
+score for any section of the sequence. The simplest way is to add the 
+values in the file. (This means that the highest possible score
+can be calculated by adding the top value at each column 
+position, and the lowest 
+by adding the bottom value.)
+ The normal way of using the values in the file is as 
+follows. 
+First the programs divide the values in each column by the column total 
+so 
+that they sum to 1.0
+Then the natural 
+logs of these values are used as scores. When the matrix is applied to a 
+sequence these logarithmic values are summed (which is of course 
+equivalent 
+to multiplying the frequencies).
+Note that using the natural logs of the frequencies as 
+weights and 
+adding them means that the overall cutoff score must be less than zero, 
+whereas if the original
+values in the weight matrix file are added, the cutoff score will be 
+greater than zero. The search routines therefore decide whether the user 
+wants to add values or multiply frequencies
+by examining the value of the cutoff score: it will add if the cutoff 
+is 
+greater than zero and add logs of frequencies if it is less than zero.
+ Hence we effectively get two 
+motif classes in one. The program PIP, when creating weight matrix 
+files, will ask the user whether the scores should be added or multiplied. 
+ If the values in the table have been defined 
+without using a set of aligned sequences
+it is easier for the user to 
+choose a cutoff score if the values are added.
+.para
+Class 5: complement of weight matrix
+.para
+The motif is defined by a weight matrix, but the program searches for its 
+complement.
+.para
+Class 6: inverted repeat, or stem-loop
+.para
+The motif is defined by a repeat length, a minimum score
+ and a range of loop sizes. The scores are A-T=2, G-C=2, G-T=1, else=0.
+The loop sizes are defined by a minimum 
+and maximum distance from the 3' end of the stem.
+For a stem-loop these will be positive numbers. For example to 
+define a stem of length 8 and loop sizes varying from 3 to 5, the stem 
+would be set to 8, the minimum start distance to 3 and the maximum 
+to 5. To define an 
+inverted repeat the minimum distance will be negative. For example stem 
+length=9,
+minimum distance=-9, and maximum distance=-8 will find 
+inverted repeats of lengths 9 and 10. 
+E.g. AAAAATTTT and AAAAATTTTT would be found, the first having a base 
+at 
+its centre, the second having none.
+.para
+Class 7: exact match, defined step size.
+.para
+The motif is defined by a short sequence, which for nucleic acids,
+ may include IUB symbols. All symbols must match. The class differs 
+from 
+class 1 in that searches will move in steps of some given size. For 
+example 
+we could search for a certain codon and use a step size of 3 and hence
+ keep in a 
+single reading frame.
+.para
+Class 8: direct repeat
+.para
+The motif is defined by a repeat length, a minimum score
+ and a range of loop sizes. The scores are defined using MDM78 for protein 
+sequences and an identity matrix for nucleic acids.
+The loop sizes are defined by a minimum 
+and maximum distance from the 3' end of the stem.
+.para
+Class 9: membership of a set
+.para
+This motif class is for protein sequences. It is defined by lists of 
+allowed amino acids for each position in the motif, and a cut-off score.
+Positions at which any amino acid can occur are left blank.
+All allowed amino acids for each position give a score of 1.
+The motifs can be defined in two ways: either typed at the keyboard or 
+read 
+in as a weight-matrix-like file.
+When the motif is defined at the keyboard the sets of allowed amino 
+acids
+are separated by commas(,).
+         For example IVL,,,DEKR,FYWILVM defines a motif of length 5 amino 
+         acids in which one of I,V or L must be found in the first position, 
+         then anything in the next two positions, D,E,K or R in the fourth 
+         position and F,Y,W,I,L,V or M in the fifth.  To specify that the 
+whole motif must match a score of 3 would be required (i.e. one of the 
+allowed amino acids must be found for each of the three defined 
+positions).
+If the motif is read from a file the file must have been written by 
+program 
+PIP, or have been saved by the pattern searching routines. If the 
+user 
+elects to save a pattern, and it includes class 9 motifs typed at the 
+keyboard, then the program will save the class 9 motifs as weight matrix 
+files. Therefore it will request file names for each motif of this class. 
+If the motif given above as an example were saved the weight matrix file
+would have 5 columns.
+The first column 
+would contain zeroes except for the I, V and L rows 
+which would be set to 1; the next two columns would all be zero; the next 
+would be zero except for the D,E,K and R rows which would be 1; the final 
+column would contain 1's in rows F,Y,W,I,L,V and M, with 
+the rest zero.
+.para
+
+The logical operator (AND, OR or NOT) used to add each motif to the 
+pattern
+is specified by preceding 
+the class number by the letters A, O or N. A = AND, O = OR, N = NOT.
+The default is A, so N2 means include, using the NOT operator, a class 2 
+motif; O2 means include, using the OR operator, a class 2 motif; both A2 
+and 
+2 mean include, using the AND operator, a class 2 motif.
+
+.para
+Range setting.
+.para
+The motifs in a pattern are numbered according to their order in the list. 
+Apart from the first motif in a pattern all motifs are given a range 
+of allowed positions relative to a motif further up the list. 
+For example
+suppose we have a pattern defined by A AND B AND C AND D.
+Motif A can occur anywhere, but B must have its range of allowed 
+positions defined relative to the position of motif A, and C's positions 
+can be defined relative to either A or B, depending on which is most 
+convenient, and likewise D's positions can be relative to A or B or C.
+.para
+Notice that the positions of motifs can be defined relative to more than 
+one motif. Suppose we have a pattern consisting of 
+motifs A, B and C, and that B occurs 5-10 residues right of A, C occurs 5-
+10 
+residues right of B, and also C is never more than 15 residues from A. 
+Then 
+it is quite consistent with the methods to include motif C into the 
+pattern 
+twice using the AND operator: once relative to A and once relative to B. 
+This will define the relative spacing and the ORDER of the motifs in the 
+pattern. (If we simply defined the position of C relative to A it could be 
+found to the left of B).
+.para
+Motifs combined together using the OR operator are all given the same 
+range. For example suppose we had a pattern A AND (B OR C) AND (D OR E),
+ then B and C each have the same range, and D and E also have 
+the same range as one another. The range for D and E can be relative to 
+A or to B.
+.para
+Motifs cannot have their ranges defined relative to motifs that are 
+included using the NOT operator. For example if we had the pattern A NOT 
+B 
+AND C, then the range for C can only be defined relative to motif A.
+.para
+Speed can be gained by arranging the order 
+of the motifs so that those higher up the list are of types that can be 
+searched for rapidly and that are also unlikely to be found.
+.para
+Motifs combined by the OR operator are alternatives: if any one of a set 
+of motifs 
+combined by the OR operator is found, then a match is declared. All
+alternatives will be reported. For example if we had a pattern defined by 
+A 
+AND (B OR C), then all places where A occurs and B is found within range, 
+and all places where A is found and C is found within range will be 
+reported. A typical use would be where we might allow a motif to appear 
+on 
+either strand of the DNA sequence. For example a weight matrix 
+representing 
+the heatshock element could be used in a pattern which included 
+heatshock 
+as a motif class 4 combined using the OR operator 
+with heatshock as a motif class 5.
+.para
+The probability calculations are performed for each motif as it is 
+defined. 
+If an overall probability cut-off is given the calculation is repeated for 
+each match found. To achieve maximum searching speed do not give an 
+overall 
+probability cut-off. Overall cut-off scores should only be used if the 
+motif 
+classes used are compatible.
+.para
+There are currently 
+several ways to display the matches: 1 = each 
+motif and its position is listed; 2 = all the sequence between the two 
+outermost motifs is listed; 3 = graphical, with a spike marking the 
+position 
+of the leftmost motif. The library versions also give entry names, and a 
+one 
+line title; in addition they can be used to produce aligned families of 
+sequences. When this mode of output is selected the program will write a 
+separate file for each match. The files will be called ENTRYNAME.DAT 
+where 
+ENTRYNAME is the name of the entry in the library. The matching 
+sequence 
+will be written out so that the spacing between motifs is constant, and 
+set to the maximum allowed by the pattern definition. Any gaps will be 
+filled with dashes (-). If the individual sequences were subsequently 
+written one above the other
+they should line up so that all motifs are in register. There two types of 
+output of this sort: one, option 4, writes out whole sequences, the other, 
+option 5, writes out only the sequences between the two outermost 
+motifs.
+If the individual sequences were subsequently 
+written one above the other
+they should line up so that all motifs are in register. There two types of 
+output of this sort: one, option 4, writes out whole sequences, the other, 
+option 5, writes out only the sequences between the two outermost 
+motifs.
+Note that for option 4 users are asked to type the position of the 
+first motif, and the reason for
+this is explained below. 
+Consider a pattern found in several sequences. Consider only
+the first motif in 
+the pattern and suppose that it was found in different positions in these 
+sequences. 
+Say that of these positions the one furthest from the left end was 
+position 100. Then, in order to ensure that all the sequences would align, 
+we must specify that motif 1 must start at position 100. 
+Any sequences in which motif 1 started 
+nearer to the left end than position 100 would be padded accordingly.
+These modes of output 
+should only be used when the position of each motif is defined relative to 
+its 
+immediate neighbour.
+.para
+The pattern descriptions can be saved to files. These files 
+can be used instead of typing definitions again at the keyboard. As the 
+files are annotated,
+they can easily 
+be changed using system editors, and the modified versions used to 
+define the variant patterns for the programs.
+.para 
+.para
+Use of lists of entry names 
+.para
+The two programs that operate on libraries have the ability to 
+restrict their searches to subsets of the libraries. This does not require 
+sublibraries to be created but instead is achieved by using files 
+containing a list of the entry names of sequences. The user may choose to 
+search only those entries on the list or, alternatively to search all but 
+those on the list (i.e. in the latter case
+the list contains the names of those to be excluded).
+ The programs can search libraries that have indexes and those that 
+do not.
+ If a list of names for inclusion is used,
+then the search will be faster if the index is present. In all other 
+circumstances the whole library will be read. 
+The list must be in library order except when it is used
+to include entries, and an index is available.
+The list must contain each entry name on a separate line, with the name 
+starting in column 1 of the line. ie there must be no spaces at the start 
+of the line.
+The list of entry names
+can be produced by the keyword searches of nip, pip, etc as long 
+as the listings produced have a space character separating the entry name 
+from the entry description. This will depend on how well the library 
+reformatting programs work. For example swissprot entry names tend to run 
+into the beginning of the descriptions, but other libraries are generally 
+OK.
+
+.para
+One use of the programs is to look for patterns that we already know 
+about, but in new sequences. However it is hoped that they will also be 
+useful for finding new motifs. For example
+several known control regions in 
+nucleic acid 
+sequences consist of particular direct or inverted repeats;
+the inclusion of
+direct and inverted repeats as motif classes
+makes it possible to 
+find previously unknown
+motifs of these types. 
+Using these new programs we can 
+ask questions like: "are there any inverted or direct repeats near to 
+sections of sequence that contain both a
+CCAAT box and a TATA box?"; and to search for such things throughout 
+the 
+libraries. In addition, the mode of output in which all the sequence 
+between 
+the two outermost motifs found is printed out, allows us to extract 
+sequences and examine them in more detail for further common 
+subsequences. 
+For example we might want to collect together all the sequences 
+between 
+putative CCAAT and TATA boxes.
+.para
+A further use of the inverted repeat motif class is the following. If a 
+regulatory sequence in DNA is poorly defined but also an inverted repeat, 
+then it might be an advantage to specify it both as a consensus sequence 
+and 
+a superimposed inverted repeat. In this way two weak definitions can be 
+combined to produce a stronger pattern.
+.para
+Given only a few examples of a motif it 
+should be possible to perform initial searches using a 
+class 3 motif, and then, using plausible matching sequences, create a 
+more 
+specific weight matrix for the same motif.
+.para
+If motifs are combined with the first motif using the OR operator
+they will be ignored until all 
+permutations that include the first motif have been looked for. 
+The whole search will then be repeated, in 
+turn, for each of 
+those motifs that are combined with the first motif using the OR 
+operator.
+An interesting consequence of this is that the program can be used, 
+without 
+change, to compare any newly determined sequence with all known 
+individual 
+motifs. We achieve this by having a pattern in which all known relevant 
+motifs are combined using the OR operator.
+If we ask to use this pattern with 
+a sequence, the program will automatically compare each individual 
+motif in 
+the pattern with the whole length of the 
+sequence. As the number of known 
+motifs grows this should become an increasingly useful standard 
+procedure.
+.para
+The NOT operator is obviously 
+useful for making sure particular motifs are not present, but it can also 
+be used to bracket the levels of matches found. We may want a degree of 
+match that lies between two limits - binding should occur, but not too 
+strongly; or base-pairs should form, but not too many. We can specify 
+this 
+by asking for a match with a low score, in combination with a match and 
+a 
+high score, both for the same motif, but with the high score included 
+using 
+the NOT operator.
+.para
+The algorithm is designed to find all sections of a sequence that satisfy 
+the pattern rather than only the best match. 
+Particularly if some of the motifs in a pattern are less well defined than 
+others, this can often result in the same region of a sequence being 
+reported as having several matches, but which only vary in the 
+positions of the weakest motifs.
+.para
+General remarks on motif searching
+.para
+Generally motifs are short subsequences that are thought to be 
+associated with 
+particular functions in some known sequences. Often 
+we search for them to try to 
+understand or interpret other sequences. Sometimes we search for 
+motifs and
+patterns to 
+test a hypothesis about their role: are they found in the expected 
+positions in the expected sequences. In doing so we should remember 
+that, in both proteins and nucleic acids,
+ what we are really looking for is a particular 
+three dimensional structure with certain affinities for other structures, 
+and that we are assuming that the sequence of the motif alone
+defines the 3D structure we searching for. 
+ The overall structure 
+may be completely different to those in which the motif is functional, 
+and 
+hence the motif may have a different shape or be inaccessible. 
+We should be aware of the 
+importance of the context in which a motif is found. Where does it lie 
+relative to the overall structure, is it accessible, is the three 
+dimensional spacing between 
+it and other motifs correct? For example, is it on the same side of the 
+double helix, and the correct distance from some other motif? How does 
+context affect our assessment of the significance of finding a motif? 
+Finding false mammalian mRNA splice junctions in non-coding sequences 
+is 
+far less important than finding false sites in pre-mRNA sequences, but 
+finding them in the correct places is most important! In other words, it 
+is 
+often the case that when we are searching for a motif that is known to 
+be  
+necessary for some function, then a positive result in the form of a 
+match 
+in the required position, is more important than a high background of 
+matches in the wrong positions. Being 
+ able to write 
+down the probability of finding a motif in a random sequence tells us how 
+well it is defined. 
+In nucleic 
+acids the DNA may contain many superimposed types of information such 
+as 
+those concerned with histone phasing, protein coding or mRNA secondary 
+structure. These overlapping "codes" may interfere with one another 
+causing 
+matches to motifs to be poorer than expected.
+In general we will only have a limited number of examples of the 
+motif and we do not know how representative they are.
+.para
+Sequences have superimposed functions: some parts may be of general 
+structural 
+importance and give rise to an overall framework, and other parts give 
+specificity and hence are not common; we may want to use a set of 
+aligned 
+sequences to define a motif, but want to use only the framework 
+positions.
+ Alternatively we may want to pick out 
+only those parts of a set of aligned sequences that give a particular 
+property, and to ignore other similarities that are due to some other 
+property
+and which could obscure the pattern 
+we are interested in.
+It is possible to apply a mask to a set of aligned sequences in 
+order to give weight to selected positions only.
+ The ability to define a mask allows certain positions 
+to be used in the motif and others to be ignored, and yet still permits the 
+use of a set of aligned sequences to calculate weights. The mask is 
+requested and applied 
+by the program and results in the masked positions being zero 
+in 
+the weight matrix. The mask is defined in the following way. 
+Suppose we had a motif of length 15, then the mask 
+x--x--xx-x will give zero weights to positions 2,3,5,6 and 9 (note it is 
+the dashes (-) that are significant and that positions 
+1,4,7,8,10,11,12,13,14 and 15 
+will be non-zero). Of course 
+the same set of sequences could be used with several alternative masks 
+in 
+order to extract different features and create corresponding weight 
+matrices.
+.para
+The programs are described in Staden,R. 
+CABIOS 4, 53-60, 1988; Staden,R.
+ CABIOS 5, 89-96, 1989, anf a forthcoming Methods in 
+Enzymology.
+.left margin1
+@ end of help
diff --git a/help/README b/help/README
new file mode 100644
index 0000000..b662de2
--- /dev/null
+++ b/help/README
@@ -0,0 +1,38 @@
+    README file for help directory of staden package
+    -----------------------------------------------
+
+Should contain (at least) ProgramName_help where ProgramName is each of
+bap, dap, gip, mem, mep, nip, nipf, pip, sap, sip and also staden_help
+and stadenp_help.
+
+There are 3 main formats of file in this directory:
+
+PROGRAM.RNO:
+	This is the unformatted (runoff/nroff style) help for PROGRAM.
+	Any changes to the help should be performed on this file.
+
+program_help:
+	This is the online formatted help used by PROGRAM. It can also
+	be printed to produce hardcopy documentation.
+
+program_menu:
+	This is a file that describes the menus used in PROGRAM,
+	together with an index into the program_help file for the
+	online help. The format for each line is:
+	
+	<option number> <menu number> <program_help offset> <no. of
+	lines of help> <program type T(ext) or (X)windows> <option name>
+
+
+Exceptions to these are for the staden_help, stadenp_help, and
+splitp_help which do not have the relevant .RNO or _menu files. The
+file staden_help gives an introduction to the xterm user interface
+(written for vax and vms and so is out of date with the Unix
+versions).
+
+See the file splitp_help for information about the reformatting of the
+PROSITE motif library.
+
+Rebuild help files with the Unix command "make all". Ensure that the utility
+program sethelp is compiled and in the executables search path. The sources
+for the program sethelp are found in $STADENROOT/staden.
diff --git a/help/SAP.RNO b/help/SAP.RNO
new file mode 100644
index 0000000..b84eae5
--- /dev/null
+++ b/help/SAP.RNO
@@ -0,0 +1,2523 @@
+.npa
+.left margin1
+@-1. TX  0 @General
+.sp
+@-2. T   0 @Screen control
+.sp
+@-2. X   0 @Screen
+.sp
+@-3. TX  0 @Modification
+.sp
+@0.  TX -1 @SAP
+.left margin2
+.PARA
+This is an interactive program whose primary use is
+for managing shotgun sequencing projects, but it can also be used for 
+handling alignments of other sequences, including those of proteins. 
+Currently the maximum 'gel reading' length is set to 4096 characters. 
+Almost all of the information below describes the use of the program for 
+shotgun projects, but those using the programs for handling other 
+sequence 
+alignments should interpret it accordingly.
+The data for such a project is stored in a special type of database. The 
+program
+ contains the tools that are required to type in gel readings,
+screen them against vector sequences and restriction sites; 
+enter new gel 
+readings into the database (automatically comparing and aligning
+them). In addition it contains editors and functions to examine the quality 
+of the aligned sequences.
+.para
+ There are three main menus: "general", "graphics" and  "modification", 
+and some functions have submenus.
+.left margin2
+.lit
+  The general menu contains the following options:
+
+       0 = List of menus
+       ? = Help
+       ! = Quit
+       3 = Open a database
+       4 = Edit contig
+       5 = Display a contig
+       6 = List a text file
+       7 = Direct output to disk
+       8 = Calculate a consensus
+      17 = Screen against restriction enzymes
+      18 = Screen against vector
+      19 = Check consistency
+      25 = Show relationships
+      27 = set parameters
+      28 = Highlight disagreements
+      29 = Examine quality
+
+The graphics menu contains:
+
+       0 = List of menus
+       ? = Help
+       ! = Quit
+      10 = Clear graphics
+      11 = Clear text
+      12 = Draw ruler
+      13 = Use cross hair
+      14 = Change margins
+      15 = Label diagram
+      16 = Plot map
+      33 = Plot single contig
+      34 = Plot all contigs
+
+
+The modification menu contains:
+
+       0 = List of menus
+       ? = Help
+       ! = Quit
+       4 = Edit a contig
+       9 = Screen edit
+      20 = Auto assemble
+      21 = Enter new gel reading
+      22 = Join contigs
+      23 = Complement a contig
+      24 = Copy database
+      26 = Alter relationships
+      30 = Auto edit a contig
+      31 = Type in gel readings
+      32 = Extract gel readings
+
+  The enter new gel reading menu contains:
+
+       ? = Help
+       ! = Quit
+       3 = Complete entry
+       4 = Edit contig...
+       5 = Display overlap
+       6 = Edit new gel reading...
+
+   The join contig menu contains:
+
+       ? = Help
+       ! = Quit
+       3 = Complete join
+       4 = Edit left contig...
+       5 = Display joint
+       6 = Edit right contig...
+       7 = Move join
+
+   The alter relationships menu contains:
+
+       ? = Help
+       ! = Quit
+       3 = Line change
+       4 = Edit single gel reading...
+       5 = Delete contig
+       6 = Shift
+       7 = Move gel reading
+       8 = Rename gel reading
+       9 = Break contig
+
+   The edit menu contains:
+  
+       ? = Help
+       ! = Quit
+       3 = Insert
+       4 = Delete
+       5 = Change
+
+.END LIT
+.SK1
+.para
+Overview of the methodology
+.para
+The shotgun sequencing strategy
+.para
+               In the shotgun sequencing procedure
+the sequence to be determined is randomly broken into fragments of 
+about 
+400 nucleotides in length. These fragments are cloned and then 
+selected randomly and their
+
+          sequences   determined.    The  relationship  between  any  pair  of
+
+          fragments is not known beforehand
+but is found by comparing their  sequences.
+
+          If  the  sequence  of  one found to be wholly or partially contained
+
+          within that of another  for  sufficient  length  to  distinguish  an
+
+          overlap  from  a repeat then those two fragments can be joined.  
+The
+
+          process of select, sequence and compare is continued until the 
+whole
+
+          of  the  DNA  to  be  sequenced is in one continuous well 
+determined
+
+          piece.
+
+.para
+          Definition of a contig
+
+.para
+               A CONTIG is a set of gel  readings  that  are  related  to  one
+          another  by  overlap of their sequences.  All gel readings belong to
+          a contig and each contig contains at  least  one  gel
+          reading.   The  gel  readings in a contig can be summed to produce 
+a continuous consensus sequence and the length of this sequence is 
+the length  of the contig.  The rules used to perform this summation are
+          given under "the consensus algorithm".
+          At any stage
+          of  a  sequencing project the data will comprise a number of 
+contigs;
+when  a  project  is
+
+          complete  there  should be only one contig and its consensus will be
+          the finished sequence. Note that since being introduced and 
+defined as above the word "contig" has been taken up by those involved in 
+genomic mapping. In that context the consensus with a  precise length is not 
+defined.
+
+.SK1
+.LEFT MARGIN2
+Introduction to the computer method
+.LEFT margin2
+.PARA
+It is useful to consider the objectives of a sequencing project before 
+outlining how we use the computer to help achieve them. The aim of a 
+shotgun sequencing project is to 
+produce an accurate consensus sequence from many overlapping gel 
+readings.
+It is necessary to know, particularly at the latter
+stages of the project, how accurate the 
+consensus sequence is. This enables us to know which regions of the
+ sequence require further work and also to know when the project is 
+finished.
+To show the quality of the consensus, the programs described here
+produce displays like that shown below.
+.sk1
+.lit
+
+                           10        20        30        40        50
+   -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+       CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                          110       120       130       140       150
+   -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+   -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+   -5  HINW.009                                        GGCAGCGACCGTCG
+   17  HINW.999                                           AGCGACCGTCG
+       CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                          160       170       180       190       200
+   -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+   -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+   12  HINW.017                                              GTAGAGC*
+       CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+.END LIT
+.para
+                This is an example showing the left  end  of  a  contig  from
+          position  1 to 200.  Overlapping this region are gel readings 
+numbered 6, 3, 5, 17 and 12;
+6, 3 and 5 
+are in reverse orientation to their original reading (denoted by a minus  
+sign). Each gel reading also has a name (eg HINW.010). It can be seen that 
+in a number of places the sequences contain characters other than A,C,G 
+and 
+T. Some of these extra characters have been used by the sequencer to 
+indicate regions of uncertainty in the initial interpretation of the gel 
+reading, but the asterisks (*) have been inserted by the automatic 
+assembly function in order to align the sequences. Underneath each 50 
+character block of gel reading sequences is the consensus derived from 
+the 
+sequences aligned above (the line labelled CONSENSUS). For most of its 
+length the consensus has a definite nucleotide assignment but in a few 
+positions there is insufficient agreement between the gel readings and
+so a dash (-) appears in the sequence. This display contains all the 
+evidence needed to assess the quality of the consensus: the number of 
+times 
+the sequence has been determined on each strand of the DNA, and the 
+individual nucleotide assignments given for each gel reading.
+.para
+So the aim is to produce the consensus sequence and, equally important, 
+a display of the experimental results from which it was derived.
+.para
+In order to achieve this the following operations need to be performed:
+.left margin2
+1) Interpret autoradiographs and put individual gel readings into the 
+computer.
+.left margin2
+2) Check each gel reading to make sure it is not simply part of one of the 
+vectors used to clone the sequence.
+.left margin2
+3) Check each gel reading to make sure that those fragments that span 
+the 
+ligation point used prior to sonication are not assembled as single 
+sequences.
+.left margin2
+4) Compare all the remaining gel readings with one another to assemble 
+them 
+to produce the consensus sequence.
+.left margin2
+5) Check the quality of the consensus and edit the sequences.
+.left margin2
+6) When all the consensus is sufficiently well determined, produce a copy 
+of 
+it for processing by other analysis programs.
+.para
+It is very unlikely that this procedure will only be passed through once.
+Usually steps 1 to 5 are cycled through repeatedly, with step 4 just 
+adding 
+new sequences to those already assembled. Generally step 6 is also used 
+in 
+order to analyse imperfect sequence to check if it is the one the project 
+intended to sequence, or to look for interesting features. Analysis of 
+the consensus, such as 
+searches for protein coding regions,
+can also help to find errors in the sequence. The display of the 
+overlapping gel readings shown above can be used to indicate, not only 
+the 
+poorly determined regions, but also which clones should be resequenced 
+to 
+resolve ambiguities, or those which can usefully be extended or 
+sequenced 
+in the reverse direction, to cover 
+difficult regions.
+
+.PARA
+The original
+individual gel readings for a sequencing project are each stored in 
+separate files. As the gel readings are entered into the computer
+(usually in batches, say 10 
+from a film), the file names they are given are stored in 
+a further file, called a file of file names. Files of file names  
+enable gel readings to be processed in batches. 
+.para
+For each sequencing project 
+we start a project database. This database has a structure specifically 
+designed for
+dealing with shotgun sequence data. 
+In order to arrive at the final consensus sequence many operations will 
+be 
+performed on the sequence data. Individual fragments must be 
+sequenced and 
+compared in both senses (i.e. both orientations) with all the other 
+sequences. When an overlap between a new gel reading and a contig are 
+found 
+they must be aligned and the new gel reading added to the contig. If a 
+new 
+gel reading overlaps two contigs they must be aligned and joined. Before 
+the two contigs are joined one of them may need to be turned around 
+(reversed and complemented) so they are both in in the same orientation. 
+.para
+Clearly, keeping track of all these manipulations is quite complicated,
+and to be able to perform the operations 
+quickly requires careful choice of data 
+structure and algorithms. For these reasons it is not practicable to store 
+the gel readings aligned as shown in the display above. Rather, it is more 
+convenient to store the sequences unassembled, and to record sufficient 
+information for programs to assemble them during processing. The 
+data used to assemble the sequences is called relational information. 
+.left margin2
+.PARA
+ The database comprises three files and they are described under the 
+section entitled "open database".
+.PARA
+Before entry into the project database 
+each new gel reading must be compared to look for overlaps 
+with all the data already contained 
+within the database. This last point is 
+important: all searching for overlaps is between individual new gel 
+readings and the data already in the database. There is no searching for 
+overlaps between sequences within the database; overlaps must be found 
+before new gel readings are entered into the database.
+.para
+Below I give an introduction to how the sequencess are processed by 
+being 
+passed from one function to the next.
+.para
+This program is used to start a 
+database for the project and 
+then the following procedure is used.
+.para
+Data in the form of individual gel readings are entered into the computer
+ 
+and stored in separate files using either program this program or the digitizer 
+
+program. Batches 
+of these gel readings 
+are passed to the screening functions in this program to search for overlaps 
+
+with vector sequences ("screen against vector") or for matches to 
+
+restriction enzyme sites  that should not be
+ 
+present ("screen against enzymes"). 
+Each run of these screening functions passes on only those gel
+ 
+readings that do not contain unwanted sequences. Sequences are passed 
+
+via 
+files of file names and eventually are processed by the automatic 
+assembly function ("auto assemble"). This function compares each gel 
+reading with a consensus of all the previous gel readings 
+stored in the database.
+If it finds any 
+overlaps
+ it aligns the overlapping sequences by inserting padding characters, 
+and then adds the new gel reading to the database. 
+Gels that overlap are added to existing contigs and gels that do not 
+overlap any data in the database start 
+new contigs. If a new gel overlaps two contigs they are joined. 
+Any gel readings that appear to overlap but which
+cannot be aligned sufficiently well are not entered and have 
+their names written to a file of failed gel reading names.
+.PARA
+Generally data is entered 
+into the database in batches as just described. The program
+ is also used to examine
+
+the data in the database, to enter gel readings that the automatic
+ 
+assembly function cannot align ("enter new gel reading"),
+
+ and to make final edits. Edits to whole contigs 
+
+can  be made in several ways. An automatic editor ("auto edit") will
+ 
+perform almost all edits without any user intervention, but the program 
+
+also gives access to the system editor (EDT on the VAX), through the 
+
+function "screen edit", and to simple command driven editors ("edit 
+
+contig" and "edit new gel reading").  Disagreements between gel readings 
+
+in contigs and their consensus 
+
+sequences can be highlighted by use of the function "highlight
+ 
+disagreements". 
+.PARA
+Editing the sequences is obviously an essential part of managing a
+
+sequencing project. 
+Editing is required when new 
+
+sequences are added, when contigs are joined, and when sequences are 
+
+corrected.
+A basic part of the strategy
+
+used here is that new
+ 
+gel readings should be correctly aligned throughout their whole length
+
+when 
+they are entered into the database, and that when contigs are joined they
+ 
+are edited so that they are well aligned in the region of overlap.
+
+ Alignment can be achieved by
+
+adding padding characters to the sequences, and this is the way "auto
+ 
+assemble"  
+operates when adding new sequences to the database.
+
+.para
+In order to search 
+for overlaps that may have been missed due to errors in 
+
+the gel readings, the function "extract gel readings" can be used to take 
+
+copies of the gel
+ 
+readings at the ends of contigs, and write them out as separate files.
+ 
+These can then be compared with the database consensus using the "auto 
+
+assemble" function in a mode that forbids entry of data into the 
+database,
+and any gel reading matching two contigs will indicate a join that has 
+
+been 
+missed. The joins can then be made interactively using "join contigs". 
+
+Missed matches can be 
+
+found at this stage because the errors in the sequences may have been 
+
+corrected by new data.
+
+.para
+Generally the users need not concern themselves with how the relational 
+information is used by the program, but it is necessary to know
+how contigs are identified. Because contigs are constantly being changed and 
+reordered the program identifies them by the numbers of the gel readings 
+they contain. Whenever users need to identify a contig they need only 
+know 
+the number or name of one of the gel readings it contains. Whenever the 
+program asks users to identify a contig or gel reading they can type its 
+number or its archive name. If they type its archive name they must precede 
+the name by a slash "/" symbol to denote that it is a name rather than a 
+number. E.g if the  archive
+name is fred.gel with number 99, users should
+type /fred.gel or 99 when asked to identify the contig. Generally,
+ when it asks for the gel reading to be identified,
+the program will offer the user a default name,
+ and if the user types only return, that 
+contig will be accessed. When a database is opened the default contig will 
+be the longest one, but if another is accessed, it will subsequently  become 
+the current default.
+.para
+Further information is located in the following places. 
+The database files are described under "open database". The format 
+for 
+vector and consensus sequences is given under "calculate a consensus", as are 
+the 
+uncertainty codes used in gel readings. 
+.left margin2
+.para
+The only program, other than this, relevant to sequencing is the digitizer 
+program  and it is outlined briefly below.
+.para
+         The digitiser program
+is used for the initial input of gel readings
+and for writing a file of file names. The program
+uses a digitizer for data entry.
+A digitizer is
+  a  two  dimensional  surface such as a light box
+which is such that if a special pen is pressed onto it, the pens 
+coordinates are recorded by a computer.
+These coordinates
+          can be interpreted by a program.
+.para
+               In order to read an autoradiograph placed on the light box
+the user  need  only  define the bottom of
+the four sequencing lanes and the bases
+          to which they correspond and then use  the  pen  to  point  to  each
+          successive  band progressing up the gel.  The program examines 
+the
+          coordinates of each pen position to see in which of the  four  
+lanes
+          it  lies  and  assigns  the  corresponding  base to be stored in the
+          computer.  Each time the pen tip is depressed to point to a position
+          on  the  surface of the digitizer the program sounds the bell on the
+          terminal to indicate to the user that a point has been recorded.  As
+          the  sequence  is read the program displays it on the screen.
+
+.left margin1
+@17. TX 1 @Screen against restriction enzymes
+.left margin2
+.PARA
+Used to compare gel readings against any restriction enzyme recognition
+ 
+sequences that may have been used  during cloning and which should not 
+
+be present in the data. Works on single gel readings or processes batches
+ 
+accessed through files of file names. The algorithm looks for exact 
+
+matches to recognition sequences stored in a file. 
+
+.para
+The file containing the recognition sequences must be identified. The
+user 
+must choose between employing a file of file names, or typing in the
+
+ 
+names of individual gel reading files. If a file of file names is used the
+
+
+program will also create a new file of file names. When the option has
+ 
+finished operating this new file will contain the names of all those gel
+ 
+readings that did not match any of the recognition sequences. Hence it
+ can 
+be used for further processing of the batch. The recognition sequences 
+
+should be stored in a simple text file with one recognition sequence per 
+
+line.
+.left margin1
+@18. TX 1 @Screen against vector
+.left margin2
+.PARA
+Used to compare gel readings against any vector sequences that may have 
+
+been picked up during cloning. Works on single gel readings or processes 
+
+batches accessed through files of file names. The algorithm looks for
+exact 
+matches of length "minimum match length" and displays the overlapping 
+
+sequences.
+.para
+The file containing the vector sequence must be identified. The user must
+
+choose between employing a file of file names, or typing in the names of
+ 
+individual gel reading files. If a file of file names is used the program 
+will 
+also create a new file of file names. When the option has finished 
+
+operating  this new file will contain the names of all those gel readings
+
+that did not match the vector sequence. Hence it can be used for further 
+
+processing of the batch.The vector sequence should be stored in a simple
+ 
+text file with up to 80 characters of data per line. More than one vector
+ 
+can be stored in a single file. If so each should be preceded by a 20 
+
+character title of the form <---m13mp8.001-----> where the < and >
+ signs 
+and the number like .001 are obligatory. The number must be preceded 
+
+by a dot (.) and be 3 digits long. The total sequence in the file must be <
+
+50,001 characters in length.
+
+.left margin1
+@20. TX 2 @Auto assemble
+.left margin2
+.PARA
+Compares gel readings against the current contents of the database and 
+
+produces  alignments. In its normal mode of operation 
+("entry permitted"), the function
+will automatically enter the gel readings into the database, but if entry 
+is not permitted it will only produce alignments. It works on 
+
+single gel readings or processes batches of gel readings accessed through 
+
+files of file names. It is the usual way to enter data into the database.
+
+.para
+The function will check the database for logical consistency and will
+ only 
+procede if it is OK. Choose if gel readings should be entered into the
+ 
+database, or if they should only be compared. Choose between using a file 
+
+of file names or typing file names on the keyboard. If so selected, supply 
+
+the file of file names. Also supply a file of file names to contain the names of
+ 
+all the gel readings that fail to get entered. 
+Select the entry mode. Normal assembly is appropriate for all but special 
+cases, as is "permit joins". Uses for the other modes are not documented 
+here.
+Define a minimum initial
+ 
+match length. Define a minimum alignment block (the default value is
+ 
+taken in all but exceptional circumstances). Define the maximum number
+ 
+of paddding characters allowed to be used in each gel reading to help
+
+achieve alignment, and the same for the number allowed in the contig for
+ 
+each gel reading. Finally define the maximum percentage mismatch to
+be allowed for any gel reading to be entered into the database. If
+ 
+for any gel reading, either of these last three values is exceeded the gel
+ 
+reading will not be entered into the database.
+
+.para
+In operation the function takes a batch of gel readings (probably  passed
+
+          on  as  a file of file names from one of the screening routines) and 
+enters them into a
+          database for a sequencing project.  It takes each  gel reading
+  in  turn,
+          compares  it with the current consensus for the database, it then
+          produces an  alignment  for  any  regions  of  the  consensus  it
+          overlaps;   if  this alignment is sufficiently good it then edits
+          both the new gel reading and the sequences it overlaps  and  adds  
+the
+          new  gel reading to the database.  The program then updates the 
+consensus
+          accordingly and carries on to the next  gel  reading.
+.para
+          All alignments are displayed and any gel readings 
+that do match but  that
+
+          cannot be aligned sufficiently well have their names written to a
+          file of failed gel reading names.  The function works without  any
+
+          user intervention and can process any number of gel readings in a
+          single run.  Those gel readings that fail can be recompared using
+
+          the same function (to find the current overlap position) and  the
+  
+user  can enter them into the database
+
+          manually  using  the   "enter new gel reading" option.
+.para
+Typical dialogue and output from the function is shown below. (Note that 
+output for gel readings 2 - 9 has been deleted to save space).
+.lit
+Automatic sequence assembler
+Database is logically consistent
+? (y/n) (y) Permit entry 
+? (y/n) (y) Use file of file names 
+? File of gel reading names=demo.nam
+? File for names of failures=demo.fail
+Select entry mode
+X  1 Perform normal shotgun assembly 
+   2 Put all sequences in one contig 
+   3 Put all sequences in new contigs
+? Selection  (1-3) (1) =
+? (y/n) (y) Permit joins 
+? Minimum initial match (12-4097) (15) =
+? Minimum alignment block (2-5) (3) =
+? Maximum pads per gel (0-25) (8) =
+? Maximum pads per gel in contig (0-25) (8) =
+? Maximum percent mismatch after alignment (0.00-15.00) (8.00) =
+  >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
+  Processing           1 in batch
+  Gel reading name=HINW.004                                
+  Gel reading length=   283
+  Searching for overlaps
+  Strand     1
+  Strand     2
+  No matches found
+  Total matches found           1
+  Padding in contig=    0 and in gel=    1
+  Percentage mismatch after alignment =  1.8
+  Best alignment found
+         1         11         21         31         41         51
+         TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+         ********** ********** ********** ********** ********** **********
+         TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+         1         11         21         31         41         51
+        61         71         81         91        101        111
+         TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACA-GTTG
+         ********** ********** ********** ********** ********** ***** ****
+         TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACACGTTG
+        61         71         81         91        101        111
+       121        131        141        151        161        171
+         CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+         ********** ********** ********** ********** ********** **********
+         CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+       121        131        141        151        161        171
+       181        191        201        211        221        231
+         GCCATGGTGG CCACAGTGAC G-TATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+         ********** ********** * ******** ********** ********** **********
+         GCCATGGTGG CCACAGTGAC GCTATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+       181        191        201        211        221        231
+       241        251        261        271        281
+         CGCG--GTGG AATTACAGCG TTCCCTATTG ACGGGCGCAT CCAC
+         ****  **** ********** ** * ***** ********** ****
+         CGCGACGTGG AATTACAGCG TT,CDTATTG ACGGGCGCAT CCAC
+       241        251        261        271        281
+          Batch finished
+          9 sequences processed
+          0 sequences entered into database
+          0 joins made
+
+.end lit
+
+.para
+Note that "auto assemble" cannot align protein sequences.
+.left margin1
+@28. TX 1 @Highlight disagreements
+.left margin2
+.para
+Used in the latter stages of a project
+to highlight disagreements between individual gel readings
+and their consensus sequences. Characters that agree with the
+ 
+consensus are shown as : symbols for the plus strand and . for the minus
+ 
+strand. Characters that disagree with the consensus are left unchanged 
+
+and so stand out clearly. The results of this analysis are written to a 
+file.
+
+.para
+Before selecting this option create a file of the display of the contig to 
+be 
+"highlighted". The option will ask for the name of this file. Select
+ symbols 
+to denote "agreeing" characters on each strand, the defaults are : and ., 
+
+but any others can be used. Supply the name of a file in which to put
+ 
+the output.
+.para
+The display file needed as input for this option is created by selecting 
+
+"Redirect output",  followed immediately by  "display contig", and then 
+"Redirect output" again. The 
+
+cutoff score used in the consensus calculation can be set by option "set
+
+display parameters". Note that for the highlight function
+there is a limit of 50 for the number of gel 
+readings that are aligned at any position - ie the contig must be less 
+than 51 gel readings deep at its thickest point. I hope that those performing
+shotgun sequencing never reach this limit, but those using the program for
+comparing sequence families might.
+.para
+Typical output from this function is shown below.
+.lit
+                                                                     
+                          210       220       230       240       250
+    1  HINW.004    :C::::::::::::::::::::::::::::::::::::::::::AC::::
+    7  HINW.018    :*::::::::::::::::::::::::::::::::::::::::::CA::::
+   -4  HINW.017                                 ...............AC....
+                   G-TATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCCCGCG--GTGG
+                                                                     
+                          260       270       280       290       300
+    1  HINW.004    ::::::::::::*:D:::::::::::::::::::
+    7  HINW.018    ::::::::::::::::::::CA:::::T:*:::*::::::::::::CA:
+   -4  HINW.017    ..............................................A...
+    3  HINW.009    :::::::::::::::V::::::::::::::::::::::::::::*AV:::
+   -6  HINW.028                            ......................A...
+                   AATTACAGCGTTCCCTATTGACGGGCGCATCCACGCTGATTCTCTT-CTG
+                                                                     
+.end lit
+.left margin1
+@32. TX 3 @Extract gel readings
+.left margin2
+.para
+Used to make copies of the aligned gel readings in a database,
+to write them into separate files, and to write a 
+
+corresponding file of file names. It operates in two modes: either all gel
+ 
+readings are extracted, or only those at the ends of contigs. 
+
+.para
+Choose which mode of operation is required and supply a file of file 
+
+names. 
+.para
+The gel readings are given their original 
+
+names. 
+If used to extract the gel readings from the ends of contigs the function
+ is 
+useful for checking for missed contig joins: the file of file names can be 
+
+used with the auto assemble function to recompare these gel readings, 
+
+and each should only overlap one contig. Any that overlap two contigs
+
+will identify possible joins.
+.para
+If the option is used to extract all the gel readings from a database, a 
+
+subsequent run of "auto assemble" can reconstitute a database which has
+
+been corrupted. This  rarely occurs and is usually necessesitated by a
+ 
+user  employing "alter relationships"  incorrectly without first having
+ 
+made a copy. 
+.left margin1
+@1. TX 0 @Help
+.left margin2
+.PARA
+Help is available on the following topics :
+
+.LEFT MARGIN1
+@2. TX 0 Quit
+.LEFT MARGIN2
+.PARA
+This command stops the program and is the only safe way to terminate a 
+
+run 
+of the program that has altered the contents of the database in any way.
+
+.left margin1
+@3. TX 1 @Open a database
+.LEFT MARGIN2
+.PARA
+Opens existing databases or allows new ones to be started. The function
+ is 
+automatically called into operation 
+when the program is started but can also be selected 
+
+from the general menu.
+.para
+Choose to open an existing database or start a new one, or if ! is typed 
+when the program is first started, enter the program without opening a 
+database. Supply a project
+ 
+database name, and if it already exists, the "version". If starting a new
+
+database define the database size and if it is for DNA or protein sequences.
+The database size is an initial size for the database. It can be increased 
+later during the project. It is the sum of the number of gel
+readings plus the number of contigs.
+.para
+Database names can have from one to 12 letters and must not include full
+ 
+stop (.). The database is made from three separate files. If the database
+ is 
+called FRED then version 0 of database FRED comprises files FRED.AR0, 
+
+FRED.RL0 and FRED.SQ0. The version is the last symbol in the file names.
+
+Only this program
+ can read these files. If the "copy database" option is used it
+ 
+will ask the user to define a new "version". 
+.para
+For normal use the maximum gel reading length is set to 512 characters,
+
+but when a database is started the user may choose lengths of either
+
+ 512, 
+1024, 1536..., 4096. Normally the program is used to handle DNA 
+
+sequences but many of the functions also work on protein sequences. The
+ 
+choice of sequence type is made when the database is started.
+
+.para
+The contigs are not stored on the disk as the user sees them displayed on
+
+the screen. Each gel reading is stored with sufficient information about
+ 
+how it overlaps other gel readings so that the program can work out how
+ 
+to 
+present them aligned on the screen. We refer to this extra data as "the 
+relationships" and it is explained below.
+ 
+The database comprises 3 separate files.
+
+.left margin2
+          1.  a working version of each gel reading.  This is the  version  of
+          the  gel  reading
+that is in the database and initially it is an exact copy of
+          the original sequence (known as the archive)
+ but it is edited and manipulated to align  it
+          with other gel readings.
+
+.left margin2
+          2.  the file of  relationships.   This  file  contains  all  of  the
+
+          information  that  is required to assemble the working versions 
+into
+
+          contigs during processing;  any manipulations on the data  use  this
+
+          file   and  it  is  automatically  updated  at  any  time  that  the
+
+          relationships are changed.  The  information  in  this  file  is  as
+
+          follows:
+.left margin2
+          (A) Facts about  each  gel reading  and  its  relationship  to  
+others  
+("gel
+
+          descriptor lines"):
+
+.left margin2
+             (a) the number of the gel 
+reading   (each gel reading   is given a number  as  it  is
+
+          entered into the database)
+
+.left margin2
+             (b) the length of the sequence from this gel reading  
+
+.left margin2
+             (c) the position of the left end of this gel 
+reading   relative to the left
+
+          end of the contig of which it is a member
+
+.left margin2
+             (d) the number of the next gel 
+reading   to the left of this gel reading  
+
+.left margin2
+             (e) the number of the next gel reading   to the right
+
+.left margin2
+             (f) the relative strandedness of this gel 
+reading  , ie whether  it  is  in
+
+          the same sense or the complementary sense as its archive.
+
+.left margin2
+          (B) Facts about each contig ("contig descriptor lines"):
+
+.left margin2
+             (a) the length of this contig
+
+.left margin2
+             (b) the number of the leftmost gel 
+reading   of this contig
+
+.left margin2
+             (c) the number of the rightmost gel reading   of this contig.
+
+.left margin2
+          (C) General facts:
+
+.left margin2
+             (a) the number of gel readings in the database
+
+.left margin2
+             (b) the number of contigs in the database.
+
+.left margin2
+          3.  the file of archive names.  This is simply a list of  the  names
+
+          of each of the archive files in the database but on line number 
+
+          1000 we also store the size of the database. ie the number of lines 
+
+          of information allowed in the database files. This file always has 
+
+          1000 lines but the length of the file of relationships and the file 
+
+          of working versions can be set by the user when creating a 
+database
+
+          or when copying from one to another.
+
+
+
+
+.para
+          Structure of the database files
+.para
+          1.  The file of relationships
+.para
+	      The file contains IDBSIZ lines of data:
+          the general data are stored on line IDBSIZ;   data about  gel 
+readings  are
+          stored  from  line 1 downwards;  data about contigs are stored from
+          line IDBSIZ-1 upwards. A database of 500 lines containing 25 gel 
+readings and 4 contigs would have a file 
+          of relationships as is shown below.
+.lit
+
+
+                  ---------------------------------------------
+                     1  Gel descriptor record
+                     2   "      "       "
+                     3   "      "       "
+                     4   "      "       "
+                     5   "      "       "
+                     '   '      '       '
+                     '   '      '       '
+                    25   "      "       "
+                    26  Empty record
+                     '    '     '
+
+                     '    '     '
+                   495    '     '
+                   496  Contig descriptor record
+                   497    "        "        "   
+                   498    "        "        "
+                   499    "        "        "
+                   500   Number of gel readings=25, Number of contigs=4
+                  ---------------------------------------------
+
+          The arrangement of the data in the file of relationships
+
+.end lit
+As each new gel reading   is added into the database a new line is  added
+          to  the  end  of  the  list  of gel descriptor
+ lines.  If this new gel  reading  does not
+          overlap with any gel readings
+ already in the database a new contig  line  is
+          added  to  the top of the list of contig lines.  If it overlaps with
+          one contig then no new contig line need be added but if it  overlaps
+          with  two  contigs  then  these  two  contigs must be joined and the
+          number of contig lines will be reduced by one. Then the list of 
+contig
+          lines is compressed  to  leave  the empty line at the top of the list.
+          Initially the two types of line will move towards  one  another  but
+          eventually,  as  contigs  are joined, the contig descriptor lines will
+          move in the same direction as the  gel descriptor
+ lines.   At  the  end  of  a
+          project  there should  be only one contig line.  The database is thus
+          capable of handling a project of 998 gels.
+.para
+          Structure of the working versions file
+.para    
+        The working versions of gel readings are stored in  a  file  of
+          IDBSIZ lines each containing 512 characters.  Gel reading 
+number 1 is stored on line
+          1, gel reading number  2 on line 2 and so on.
+.para
+          Structure of the archive names file
+.para
+          This file, unlike the others, always has 1000 lines each 10
+          characters in length. Its length is fixed because line 1000 is used 
+          to store IDBSIZ the database size and the programs need a definite
+          location from which to read this number.
+.para
+                           Safeguarding the database
+.para
+                It is advisable to copy regularly (using the copy function  of
+          DS) from say copy 0 to copy 1 in case of errors.
+.para
+I also recommend setting the protection codes on copy 0 of each database 
+so 
+that users cannot delete the files without first resetting the protection 
+codes. This will protect you from accidently deleting the files. Users at 
+LMB can use the PROTECT command for this purpose.
+.para
+               The give-up options allow users to change their minds
+          about  entering  a  new  gel  reading or joining two contigs without
+          affecting the file of relationships.  BUT if the  edit
+          contig  option  from  either  of  these two functions has been 
+used
+the edits will
+          remain even though the user has "given up".  To leave the files 
+completely
+          unaffected  the  user  could,  if  required,  undo  any edits before
+          "giving up".
+.para
+  There  are  various  checks  within  the  programs  to
+          protect users from themselves:-
+.left margin2
+               1.  All user input is checked for errors - e.g.   reference  to
+               non-existent  gel 
+readings or  contigs,  incorrect  positions in the
+               contig or gel readings.
+.left margin2
+               2.  Before entering a gel reading the system checks to see if a
+               file of the same name has already been entered.
+.left margin2
+               3.  Join will not allow the circularising of a contig.
+.left margin2
+               4.        Both enter and join  functions  restrict  the  region
+               that  the  user  is  allowed to edit (using edit contig) to the
+               region of overlap.
+.left margin2
+5. Users may escape from any point in the program.
+.left margin2
+6. Help is available from all points in the program.
+.SK2
+.LEFT MARGIN2
+IT IS ESSENTIAL THAT USERS DO NOT KILL THE PROGRAM WHILE IT IS 
+DOING 
+ANYTHING THAT INVOLVES CHANGING THE CONTENTS OF THE 
+DATABASE. I.E DURING AUTO ASSEMBLE,
+COMPLETE ENTRY, COMPLETE JOIN, COMPLEMENT CONTIG, EDIT CONTIG, AND SCREEN 
+EDIT.
+This could 
+corrupt the database so badly that it is impossible to fix. The program 
+should always be left using the QUIT option.
+
+.left margin1
+@4. TX 3 @Edit
+.LEFT MARGIN2
+.PARA
+ A simple commnd driven editor that can insert, delete and change gel 
+reading sequences.
+Insert, delete and change commands will request the position at which
+ 
+the edit is required and the number of characters to insert, delete or
+ 
+change. The default character for insertions is *.
+.para
+There are three modes of editing offered by this editor depending
+
+where it is selected from.  New gel readings
+
+can be edited as they are 
+being entered into the database, contigs can be edited with alignments
+ 
+being automatically maintained, or gel readings in contigs can be edited
+
+without the maintenance of alignments.
+.LEFT MARGIN2
+The following commands can be used.
+.lit
+  
+   ? = Help
+   ! = Quit
+   3 = Insert
+   4 = Delete
+   5 = Change
+
+.end lit
+.para
+All commands request the position at which the edit should be made.
+(Note that the position refers to the position in the contig for 
+gel readings in the database, but to the position in the gel 
+reading if you are editing a new gel reading while entering it into the 
+database.)
+.para
+All commands request the number of characters to operate on.
+               (Note that
+if you are editing a contig the program will ask for the 
+characters to insert into each separate gel reading, hence allowing 
+different changes to be made to each. Also the default character is 
+asterisk (*) - i.e if you include a space in the string it will be replaced 
+by an asterisk, or if you simply type return the whole string inserted will 
+be asterisks.)
+.LEFT MARGIN2
+"Change" allows  characters  in  individual  gel readings  to  be
+               replaced.
+If the user is not editing a new gel reading during "enter new gel reading" 
+the program will request the numer of the gel reading to edit.
+               (When editing gel readings in contigs
+the program responds with the relative position and  length  of
+               the  selected  gel reading 
+ in  case  the  the user only knows the edit
+               position relative to the  gel reading. 
+  (The  edit  position  must  be
+               relative to the contig.))
+.left margin2
+Further notes on editing
+.PARA
+         When you are editing a contig
+the program maintains the alignments of the gel readings
+          by always making the same number of insertions or deletions  in  
+all
+          the gels.  Note that these edits are immediately carried out and the
+          "Quit" options of "enter new gel reading" and "join contigs"
+do not undo them.  Users must
+          undo them themselves. Note that if this option has been entered 
+from
+          either "enter new gel reading" or "join contigs"
+the program will restrict edits  to
+          the  region  of  overlap.
+DO NOT KILL THE PROGRAM DURING EDIT CONTIG!
+.para
+When editing a single gel reading in a contig from "alter relationships"
+ (which you should not normally
+need to do) the program will 
+correct the length of the individual gel reading, but it will not update 
+the length of the contig if it has changed.
+.para
+The program contains better methods than this simple command driven editor,
+ for making 
+multiple edits to contigs. "Screen edit",
+gives access to the system editor on your machine, and "auto edit" will 
+edit a whole contig automatically.
+.left margin1
+@9. TX 3 @Screen edit
+.LEFT MARGIN2
+.para
+Gives access to the system editor on the machine (for example EDT on a VAX) 
+and allows users to edit contigs. The contigs are presented as for
+"display contig" and the program will 
+reconstitute the contig's sequences and relationships  when the editor is 
+exited.
+.para
+To screen edit a contig set the line length to 50 characters,
+select the contig to edit, and supply the name of a temporary file in which 
+the editing will be performed.
+After a short pause the system 
+editor will present the first page of the file. Edit the file obeying the 
+rules given below. Exit from the editor and affirm the intention of 
+returning the contig to the database. The program will put the contig 
+back into the database.
+.para
+Rules for screen editing
+.para
+There are some limitations on the changes that can be made to the contigs 
+when using the screen editor. Users are unlikely to want to break the 
+rules 
+in order  to achieve changes to contigs, but nevertheless the 
+constraints need to be defined and they are given below.
+.para
+Alignments must be maintained during editing.
+Whole lines of sequence should not be deleted or added unless the 
+order 
+of the gel readings in the contig is preserved.
+Each line in the 
+contig display consists of gel reading numbers, their names and 50 
+character sections of sequence. Insertions are limited in the following 
+way.
+No line of sequence can be extended rightwards more than 10 characters
+beyond the end of a full length line (a full length line is 50 characters 
+long). Only one character can be added to the left end of full length 
+lines, but sections of sequence beginning further into a line
+ can be extended leftwards up to an equivalent position. Do not delete any 
+non-sequence lines in the file.
+.para
+Before returning the contig to the database the program checks that the 
+rules have been obeyed. If an error is found the number of the erroneous 
+line in the 
+file is displayed and the contig will not be changed.
+.left margin1
+@5. TX 1 @Display a contig
+.LEFT MARGIN2
+.para
+Used to show the aligned gel readings for any part of a contig. The 
+
+number, name and strandedness of each gel reading is shown and the 
+
+consensus is written below.
+.para
+If required identify the contig,  and then the start and end points of the
+ 
+region to display.
+.para
+The display can be directed to a disk file using "direct output to disk".
+ 
+These files are required by options: "screen edit" and "highlight 
+
+disagreements", and printed copies of them
+are very useful for marking corrections prior to 
+
+using the editors.
+.para
+                Below is an example showing the left  end  of  a  contig  from
+          position  1 to 200.  Overlapping this region are gels 6,3,5,17and 12;
+6, 3 and 5 
+are in reverse orientation to their archives (denoted by a minus  sign)
+          There are a few uncertainty codes and a few padding
+          characters in the working versions, but the consensus  (shown  
+below
+          each page width) has a definite assignment for almost every 
+position.
+.lit
+
+                           10        20        30        40        50
+   -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+       CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                          110       120       130       140       150
+   -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+   -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+   -5  HINW.009                                        GGCAGCGACCGTCG
+   17  HINW.999                                           AGCGACCGTCG
+       CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                          160       170       180       190       200
+   -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+   -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+   17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+   12  HINW.017                                              GTAGAGC*
+       CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+.END LIT
+.left margin1
+@6. TX 1 @List a text file
+.LEFT MARGIN2
+.PARA
+This option allows users to list text files on the screen. It can be used 
+to read a file containing notes, for checking files written to disk etc. The 
+user is asked to type the name of the file to list.
+.left margin1
+@8. TX 1 @Calculate a consensus
+.LEFT MARGIN2
+.para
+          Calculates a consensus sequence  either for  the whole database or 
+
+for selected contigs. The consensus is written to a file named by the
+ user.
+.left margin2
+Supply a file name, choose between  whole database or selected contigs.
+.para
+          Symbols for uncertainty in gel readings
+.para
+In  order  to  record uncertainties when reading gels the codes shown
+ 
+below can  be  used. Use  of these codes permits us to extract the
+
+maximum amount of data from each gel and yet record any doubts  by
+ 
+choice  of  code.   The program can deal with all of these codes and any 
+ 
+other  characters  in  a  sequence  are  treated  as  dash  (-) characters.
+
+
+.lit
+
+       SYMBOL                  MEANING
+
+         1             PROBABLY        C
+         2                "            T
+         3                "            A
+         4                "            G
+         D                "            C       POSSIBLY        CC
+         V                "            T          "            TT
+         B                "            A          "            AA
+         H                "            G          "            GG
+         K                "            C          "            C-
+         L                "            T          "            T-
+         M                "            A          "            A-
+         N                "            G          "            G-
+         R             A OR G
+         Y             C OR T
+         5             A OR C
+         6             G OR T
+         7             A OR T
+         8             G OR C
+         -             A OR G OR C OR T
+         a             A set by auto edit
+         c             C set by auto edit
+         g             G set by auto edit
+         t             T set by auto edit
+         *             padding character placed by auto assembler 
+          else = -
+
+.end lit
+
+.LEFT MARGIN2
+                           The DNA consensus algorithm
+.para
+The "calculate consensus" function, the "display  contig" routine and the
+ 
+"show quality" option use  the rules  outlined  here  to  calculate  a 
+ 
+consensus  from aligned gel readings.  Note that "display contig" 
+calculates 
+a consensus for  each  page  width  it displays  (it  does  not use the
+
+consensus sequence file calculated by the consensus function). 
+
+.LEFT MARGIN2
+.para
+We have 6 possble symbols in the consensus sequence: A,C,G,T,* and -. The 
+last symbols is assigned if none of the others makes up a sufficient 
+proportion of the aligned characters at any position in the contig. The 
+following calculation is used to decide which symbol to place in the 
+consensus at each position.
+.para
+Each uncertainty code contributes a score
+to one of A,C,G,T,*  and also to the total at each point. Symbols like R 
+and Y which don't correspond to a single base type contribute only to the 
+total at each point. The scores are shown below.
+.lit
+              definite assignments ie A,C,G,T,B,D,H,V,K,L,M,N,a,c,g,t,* =1
+
+              probable assignments ie 1,2,3,4 = 0.75
+
+              other uncertainty codes including R,Y,5,6,7,8,- = 0.1
+
+.end lit
+.para
+A cutoff score of 51% to 100% is supplied by the user. (When the program 
+starts this is set to 75%. See "set display parameters").
+At each position in the contig we calculate the total score for each of 
+the 5 symbols 
+A,C,G,T and * (denote these by Xi, where i=A,C,G,T or *), 
+and also the sum of these totals 
+(denote this by S). Then if 100 Xi / S > the cutoff for any i, symbol i is 
+placed in the consensus; otherwise - is assigned.
+.para
+Notice that S does not equal the number of times the sequence has been 
+determined, but is the score total, and hence we are less likely to put a - 
+in the consensus. For the "examine quality" algorithm each strand is 
+treated separately but the calculation is the same. (It was originally 
+different).
+.para
+Format of the consensus sequence ( and vector sequences).
+.para
+A consensus sequence file may contain the consensus for several contigs
+ 
+and so we identify each of them by preceding them by a 20 character
+ 
+title. The title is of the form <---LAMBDA.076-----> ( where LAMBDA is 
+
+the project name and gel reading number 
+
+
+          76 is the leftmost gel 
+reading to contribute to  this  consensus  sequence).
+
+
+          The  angle  brackets  <>  and the three digit number precede by a . 
+
+are important to some processing programs.
+.left margin1
+@25. TX 1 @Show relationships 
+.LEFT MARGIN2
+.para
+   Used to show the relationships of the gel readings in the database in 
+
+three ways -
+.LEFT MARGIN2
+               (a) All contig descriptor lines followed by all gel  descriptor
+               lines.
+.LEFT MARGIN2
+               (b) All contigs one after the  other  sorted,  i.e.   for  each
+               contig  show its  contig descriptor line followed by all its
+               gel descriptor lines sorted on position from left to right
+.LEFT MARGIN2
+               (c) Selected contigs:  show the contig  line  and,  in  order,
+               those gel readings that cover a user-defined region.
+               Note that this output can be directed to a disk file by 
+               prior selection of "disk output".
+.LEFT MARGIN2
+.para
+                Below is an example showing a contig from position
+          1 to 689.  The left gel reading  is number 6 and has archive 
+name HINW.010, the 
+rightmost gel  reading is number 2 and is has archive name HINW.004.
+On each gel  descriptor  line  is  shown:
+          the name of the archive version, the gel number, the position of the
+          left end of the gel reading  relative to the left  end  of  the  contig,  the
+          length  of  the gel 
+reading  (if this is negative it means that the gel reading  is in
+          the opposite orientation to its archive), the number of the  gel
+reading   to
+          the left and the number of the gel reading  to the right.
+.lit
+
+
+ CONTIG LINES
+ CONTIG      LINE  LENGTH               ENDS
+                                     LEFT   RIGHT
+               48     689               6       2
+ GEL LINES
+ NAME      NUMBER POSITION LENGTH     NEIGHBOURS
+                                     LEFT   RIGHT
+ HINW.010       6        1   -279       0       3
+ HINW.007       3       91   -265       6       5
+ HINW.009       5      137   -299       3      17
+ HINW.999      17      140    273       5      12
+ HINW.017      12      193    265      17      18
+ HINW.031      18      385   -245      12       2
+ HINW.004       2      401   -289      18       0
+
+.end lit
+.left margin1
+@21. TX 3 @Enter new gel reading 
+.LEFT MARGIN2
+.para
+Used to enter new gel readings into the  
+database. The new gel reading must have previously been compared with 
+the 
+contents of the database by use of " auto assemble"  in order to ascertain 
+if it overlaps any previously entered data. 
+.para
+The user is expected to know: if 
+the gel reading overlaps; if so which contig it overlaps; if so where it 
+overlaps. The program takes the user through a series of question to 
+establish the nature of the overlap and then displays the overlap. The 
+user 
+is then offered a number of options, including editors for the new gel 
+reading and the contig, to enable the correct alignment of the gel reading 
+throughout its whole length.
+.left margin2
+
+Supply the name of the gel reading file.
+If the gel 
+reading  has  been entered before the program will  not permit 
+
+entry. 
+The program gives the gel reading a unique number and asks if the 
+
+sequence overlaps any data already in the database (reported by "auto 
+
+assemble").
+
+If it does not, entry is complete.
+If it does overlap the
+ 
+dialogue 
+continues with the program asking if the gel readings overlaps "in the
+ 
+normal sense", if not it will automatically complement the sequence.
+
+Then supply the number of the contig the gel reading overlaps (as
+ 
+reported by "auto assemble").
+.para
+Overlaps are divided into two types: those for which the new gel reading
+ 
+protrudes from the left end of the contig it overlaps, and those for which
+ 
+it does not. The program asks about this with the question "Left end of
+gel 
+reading is inside contig". If this is true the program will go on to ask for
+ 
+the position in the contig of the left end of the new gel reading. If it is
+ not 
+true the program will ask for the position in the new gel reading of the
+
+left end of the contig. 
+.para
+Once this is completed the program will display the first 50 bases of
+ 
+the overlap. 
+The gel readings in the contig and their consensus are displayed with the 
+
+new gel reading underneath. The mismatches are shown by *'s on the 
+next 
+line down. 
+For example:
+.lit
+
+
+                           60        70        80        90       100
+   -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+   -3  HINW.007                                            GGCACA*GTC
+       CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACACGTC
+       NEWGEL      CACAAGCGAGCGAGAGGGGCACCGTGACGTGGTCACGCCGGGGACACGTC
+       MISMATCH                  *                         * *       
+                           10        20        30        40        50
+
+.end lit
+.para
+The program then needs to know if the position of the  left end of the 
+overlap is correct.
+
+If it is the user should type return, if not, 1 and the program will ask for 
+the 
+new position and display it.
+
+.LEFT MARGIN2
+The program now offers a number of  options  to  allow  the
+               user  to align the new gel reading 
+correctly over its whole length with
+               the  data  already  in  the  contig.   It  is  important   that
+               sufficient  edits  are  made  to the new gel reading 
+or the sequences in the
+               contig at this stage to get the alignment correct, because  once
+               entry  is completed, the alignment is fixed and cannot easily be
+               changed (see "alter relationships"). 
+  Alignment  can  be  achieved  
+by  making
+               insertions  or  deletions  but  deletion  of  data requires the
+               original gels to be checked.   For  this  reason  at  entry  we
+               usually make only insertions to achieve alignment.  We use X or
+               asterisks (*) as padding characters to achieve alignment  and 
+ so  can, if required,
+               distinguish  padding  characters  from characters assigned from
+               reading gels.  
+.LEFT MARGIN2
+.para
+The options available are:
+.lit
+   ? = HELP
+   ! = Give up
+   3 = Complete entry
+   4 = Edit contig
+   5 = Display overlap
+   6 = Edit new gel reading
+
+.end lit
+
+.sk1
+.para
+1. HELP gives this information.
+.para
+2. Give up allows users to change their minds about entering the new gel 
+reading. The program will ask the user to 
+confirm this choice.
+.para
+3. Complete entry is the command to add the new gel reading to the 
+contig. The 
+program updates the relationships accordingly. The user is asked to 
+confirm 
+this command.
+.para
+4. Edit contig gives the user access to a simple editor that allows 
+insertions, deletions and changes to be made to the contig. The editor 
+maintains alignments by making the same number of insertions or 
+deletions 
+in all sequences covering the edit position.
+The  program
+               protects the user          by  allowing edits only within
+               the region of overlap.
+.para 
+5. Display allows display of the region of overlap only.  This
+               is defined by the relative positions in the contig. The 
+               default is the whole of the region of overlap.
+.para
+6. Edit new gel reading allows the new gel reading to be edited using a 
+simple editor. 
+.left margin1
+@23. TX 3 @Complement a contig
+.LEFT MARGIN2
+.PARA
+               This function will complement and reverse all of the gel 
+readings in  a
+          contig.    It   automatically  reverses  and  complements  each  gel
+          reading sequence, reorders left and right neighbours, recalculates  
+relative
+          positions and changes each strandedness.
+.PARA
+               The only user  input  required  is  to  identify the  contig  to
+          complement  by  the  number or name of a gel reading it contains.
+DO NOT KILL THE 
+PROGRAM DURING THIS STEP!
+.left margin1
+@22. TX 3 @Join contigs
+.LEFT MARGIN2
+.PARA
+               This function joins contigs interactively.  It allows the
+          user  to align the ends of the two contigs by editing each
+          contig separately.  It is important that the alignment  achieved  is
+          correct because once the join is completed the alignment is fixed.
+The program needs to know which two contigs to join and where they 
+overlap.
+.para
+First which two contigs are to be joined.
+
+The user should identify the two 
+contigs. First the left contig and then the right.
+The program checks that the two contig numbers are different (it will not 
+allow circles to be formed!)
+.para
+Now identify the exact position of overlap. This is defined as
+ 
+the position in the left contig that the leftmost character of the right
+ 
+contig overlaps. Normally the position is established by employing the
+ 
+end gel reading for option "auto assemble".
+ 
+    The overlap must be of at least one character.     The program then
+ 
+displays the join  showing  all  the  gel readings overlapping
+ 
+the join from the left contig, their consensus, all the gel readings 
+from the right
+ 
+contig that  overlap  the  join,  their consensus  and  then asterisks to
+ 
+denote mismatches between the two consensuses. For example:
+
+.lit
+
+                         1460      1470      1480      1490      1500
+   56  HINW.100    TCT*GAGCAGTGTGGGCGCTG*CCGG
+   33  HINW.300    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGG
+  -25  HINW.090    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGG
+   19  HINW.123    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+       CONSENSUS   TCTCGAGCAGTGTGGGCGCTG-CCGGGCTCGGAGGGCATGAAGTAGAGCG
+   -6  HINW.010    TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+   -3  HINW.007                TGGGCGCTGCCCGGGCTCGGAGGGCATGAAGT*AGAGC
+   -5  HINW.009                              GCTCGGAGGGCATGAAGT*AGAGC
+       CONSENSUS   TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+       MISMATCH                         *                      ****** 
+                           10        20        30        40        50
+
+.END LIT
+.para
+It is essential that the user aligns the two contigs throughout the whole 
+region of overlap before completing the join because it is only at this 
+stage that the two contigs can be edited independently. Once the join is 
+completed the alignment can only be altered using the routines supplied 
+by 
+"alter relationships". The program offers the user options to facilitate the 
+alignment of the two contigs.  These options are:-
+.LEFT MARGIN2
+.lit
+
+   ? = Help
+   ! = Give up
+   3 = Complete join
+   4 = Edit left contig
+   5 = Display joint
+   6 = Edit right contig
+   7 = Move join
+
+.end lit
+.LEFT MARGIN2
+1. Help gives this information.
+.LEFT MARGIN2
+
+2. Give up allows the user to return to the main options without 
+completing the join. Note any edits made will remain.
+.LEFT MARGIN2
+
+3. Complete join instructs the program to update the relationships so
+ that 
+the two contigs are joined. DO NOT KILL THE PROGRAM DURING COMPLETE 
+JOIN!
+.LEFT MARGIN2
+
+4. Edit left contig and edit right contig give access to a simple editor that 
+allows insertions, 
+
+deletions and changes to be made to the  contigs. Help is available on 
+editing once the editing option is selected. The user is only allowed to 
+edit within the region of overlap and should make sure that the positions 
+used correspond to the correct contig.
+
+.LEFT MARGIN2
+5. Display join displays the joint as shown above.
+
+.LEFT MARGIN2
+6. See above.
+
+.LEFT MARGIN2
+7. Move join allows the position of the joint to be changed.
+
+.left margin1
+@24. TX 1 @               Copy the database
+.LEFT MARGIN2
+.PARA
+Used to make a copy of the database. If required the database size can be 
+
+altered using this option. The "version" of a database is  encoded as the
+ 
+last letter in the names of the three files that contain the database.
+
+.para
+Supply a "version" number (the default is version 1), and if required
+
+select a new size for the database. The size of a database is the number
+ of 
+lines of information it can hold. It needs a line for each gel reading and
+ 
+another for each contig.
+.left margin1
+@19. TX 1 @               Check database
+.LEFT MARGIN2
+.para
+Used to perform a check on  the  logical  consistency  of  the
+          database. No user intervention is required.
+.para
+  The following relationships are checked:
+.LEFT MARGIN2
+               1.       If gel reading A thinks gel reading B is its left
+ neighbour
+ 
+does B think A  is
+               its right neighbour?
+                The error message is
+.left margin2
+"Hand holding problem for gel reading A"
+.left margin2
+followed by  the
+               gel descriptor lines for gel readings A and B.
+.LEFT MARGIN2
+               2.       Are there any contig lines with no left or right
+end gel readings?
+                The error message is
+.left margin2
+"Bad contig line number A"
+.LEFT MARGIN2
+               3.       Do the gel readings that are described as left ends on  
+contig
+               lines agree that they are left ends?
+                The error message is
+.left margin2
+"The end gel readings of contig A have outward neighbours"
+.LEFT MARGIN2
+               4.       Are there gel readings that are in more than one contig?
+                The error message is
+.left margin2
+" Gel number A is used N times"
+.LEFT MARGIN2
+               5.       Are there gel readings that are not in any contig?
+                The error message is
+.left margin2
+" Gel number A is not used"
+.LEFT MARGIN2
+               6.       Do the relative positions of  gel readings  agree  with  
+their
+               position as defined by left and right neighbourliness?
+                The error message is
+.left margin2
+" Gel number A with position X is left neighbour of  gel  number  B  with 
+position Y"
+.LEFT MARGIN2
+               7.       Are there any loops in  contigs?   If  so  no  further
+               checking is done.
+                The error mesage is
+.left margin2
+" Loop in contig n no further checking done, but gel reading numbers follow"
+.left margin2
+   The
+               program  then  prints the gel reading numbers in the looped 
+contig up 
+to
+               the start of the loop.
+.LEFT MARGIN2
+8. Are there any contigs of length <1? The error message is
+.left margin2
+" The contig on line 
+number x has zero length"
+.LEFT MARGIN2
+9. Are there any gel readings (used in only one contig) that have zero 
+
+length? The error 
+message is
+.left margin2
+" Gel number N has zero length"
+.left margin2
+Note that "auto assemble"  also uses this logical consistency check and
+ will 
+only tolerate a "Gel number N
+ is not used" error. Any other error will cause it to 
+
+give up.
+
+.left margin1
+@29. TX 1 @               Examine quality 
+.LEFT MARGIN2
+.para
+Analyses the quality of the data in a contig. It reports on the proportion
+ 
+of the consensus that is "well determined" and will display a sequence of
+ 
+symbols that indicate the quality of the consensus at each position.
+
+.para
+Identify the contig to analyse, and the section of interest. The current
+ 
+consensus calculation cutoff score will be used to decide if each position 
+is 
+"well determined". In general the quality of a reading deteriorates along 
+the length of the gel and so it is also possible to use a length cutoff for 
+the quality calculation. Only the data from the first section of each reading 
+will be included in the quality calcualtion. The length is altered under 
+"set parameters" and is initially set to the maximum reading length.
+A summary showing the percentage of the consensus 
+that falls into each category of quality is shown. Choose whether or not to 
+have the quality codes for each position of the consensus displayed. 
+They can be displayed as either graphics or text.
+.para
+The quality of the data depends on the number of times it has been
+
+sequenced and the particular uncertainty codes  used  in each  gel
+
+reading.  This function divides the data into five categories, assigning
+
+each 
+a symbol or code: 
+.LEFT MARGIN2
+                1.  Well determined on both strands and they agree.  code=0
+.LEFT MARGIN2
+                2.  Well determined on the plus strand only.  code=1
+.LEFT MARGIN2
+                3.  Well determined on the minus strand only.  code=2
+.LEFT MARGIN2
+                4.  Not well determined on either strand.  code=3
+.LEFT MARGIN2
+                5.  Well determined on both strands but they disagree.  code=4
+.LEFT MARGIN2
+ A position is "well determined" if it is assigned one of the symbols 
+A,C,G,T when the algorithm described in the section "calculate a 
+consensus".
+The calculation is performed
+separately for each strand.
+.para
+If the user chooses to have the data displayed graphically the following 
+scheme is used. A rectangular box is drawn so that the x coordinate
+represents the length of the contig. The box is notionally
+divided vertically into 
+5 possible levels which are given the y values: -2,-1,0,1,2.
+The quality codes attributed to each base position are plotted as 
+rectangles.
+Each rectangle represents a region in 
+which the quality codes are identical, so a single base having a different 
+code from its immediate neighbours will appear as a very narrow rectangle.
+.lit
+  
+  Rectangle bottom and top y values
+
+     Quality 0 rectangle from 0 to 0
+     Quality 1 rectangle from 0 to 1
+     Quality 2 rectangle from 0 to -1
+     Quality 3 rectangle from -1 to 1
+     Quality 4 rectangle from -2 to 2
+.end lit
+.para
+Obviously a single line at the midheight shows a perfect sequence.
+.para
+Typical dialogue is shown below.
+.lit
+
+   41.47% OK on both strands and they agree(0)
+   55.48% OK on plus strand only(1)
+    2.08% OK on minus strand only(2)
+    0.97% Bad on both strands(3)
+    0.00% OK on both strands but they disagree(4)
+  ? (y/n) (y) Show sequence of codes 
+
+           10         20         30         40         50
+   1111111111 1111111111 1111111111 1111111111 1111111111
+
+           60         70         80         90        100
+   1111111111 1111111111 1111111111 3111111111 1111111111
+
+          110        120        130        140        150
+   1111111111 1111131111 1111111111 1111111111 1111111111
+
+          160        170        180        190        200
+   1111111111 1111111111 1111111111 1111111111 1111111133
+
+          210        220        230        240        250
+   1311111111 1111111111 1111111110 0000000000 0000220000
+
+          260        270        280        290        300
+   0000000000 0020000000 2200000202 0002000000 0000222200
+
+.end lit
+.left margin1
+@26. TX 3 @               Alter relationships
+.LEFT MARGIN2
+.para
+Used to make what are normally illegal changes to the database. That is
+
+the normal checks are not done and any item in the database can be 
+changed independently of all others. Users need to know what they are
+ 
+doing because it is very easy to make a horrible mess. Always start by
+ 
+making a copy!
+.para
+By using the options here users can edit individual gel readings in contigs, 
+move one section of a contig relative to another, break contigs, remove 
+contigs, remove gel readings, etc. To give flexibility most
+ of the commands do only one thing. This means that several commands 
+may 
+have to be executed to complete any change. At the end of this help 
+section 
+there are notes on removing gel readings from the database.
+.para
+The following options are offered:
+.lit
+
+   ? = HELP
+   ! = QUIT
+   3 = Line change
+   4 = Edit single gel reading
+   5 = Delete contig
+   6 = Shift
+   7 = Move gel reading
+   8 = Rename gel reading
+   9 = Break a contig
+
+.end lit
+.left margin2
+1. HELP gives this information.
+.left margin2
+
+2. QUIT returns to the main options of SAP.
+.left margin2
+
+3. Line change 
+.left margin2
+            allows the user to change the contents  of  any line in the
+                
+file of relationships.  The line is selected by number, the
+ 
+               program prints the current line and prompts for the new  line.
+
+.left margin2
+4.   Edit      
+.left margin2
+allows  the  user  to   edit   an   individual   gel reading
+               independently of any others it may be related to. The edit 
+positions are relative to 
+               the contig. The effect of this editing on the length of the
+               gel reading is taken care of but, if it changes the length of
+ a contig,
+               or its relationship to others, this must be accounted for (if 
+               necessary) by use of the "line change" function.
+
+.left margin2
+5.  Delete  contig 
+.left margin2
+is a function that deletes a contig line  by moving down  
+               all  the  contig lines above by one position.  It prompts only 
+               for the line to delete.  It does not  delete  any  of  the gel 
+readings
+               or gel reading 
+lines for the deleted contig but it does reduce the 
+               number of contigs on line IDBSIZ by 1.
+
+.left margin2
+6.  Shift   
+.left margin2
+   allows the user to change all the relative  positions of  a  
+               set  of  neighbouring gel 
+readings by some fixed value, i.e.  it will
+               shift related gel readings
+ either left or right.  It can therefore
+               be  used  to  change the alignment of the gel 
+readings in a contig 
+               or as part of the process of breaking a contig into two parts 
+               (see below).  It prompts for the number of the first gel 
+reading to 
+               shift and  then  for the  distance  to  move  them (Note a 
+               negative value will move the gel readings
+ left and a positive value 
+               right).   It  then chains rightwards (ie follows right 
+               neighbours) and shifts each gel 
+reading,  in  turn,  up to the  end  
+               of the contig.  (This means that only those gel readings
+ from the first
+               to shift to the rightmost are moved). It updates the length of 
+               the contig accordingly.
+
+.left margin2
+7. Move gel reading
+.left margin2
+    is a function to renumber a gel reading. It moves all the information 
+               about a gel 
+reading on to another line. The user must specify the 
+number
+               of the gel  reading 
+to move and the number of the line to place it. It 
+               takes care of all the relationships. Of course gel 
+readings must not be
+               moved to lines occupied by other gel 
+readings! It can be used as part 
+               of the process of removing a gel 
+reading from the database (see below).
+
+.left margin2
+8.  Rename gel reading
+.left margin2
+ is a function that is used to  rename  the archive  names  of  
+               gel 
+readings  in the database;  it only changes the name in the .ARN 
+               file of the  database.
+
+.sk1
+.LEFT MARGIN2
+ 9. Break contig
+.LEFT MARGIN2
+.PARA
+Occasionaly it is necessary to break a contig into two parts and this can be 
+achieved using this option. The program needs only the number of a gel 
+reading. This is the gel  reading that will become a left end after the 
+break. That 
+is, the break is made between this gel 
+reading and its left neighbour. A new contig 
+line is created so ensure that there is sufficient space in the database.
+.left margin2
+Removing gel readings from contigs
+.left margin2
+.PARA
+Gel 
+readings can be removed from contigs if they are not essential for holding the 
+contig together (ie are not the only gel reading covering a particular region). 
+Suppose the gel reading to remove is gel number 
+b with left neighbour a and right 
+neighbour c.
+Using "line change" change the right neighbour of a to c, and the left 
+neighbour of c to a. To tidy things up: suppose there are x gel 
+readings in the 
+database; then, using "move gel reading" move gel x to line b; then, using 
+"line change" 
+decrease the number of gel 
+readings in the database (stored in the last line) by 1.
+.left margin1
+@27. TX 1 @  Set display parameters
+.LEFT MARGIN2
+.para
+Used to redefine the parameters that control the cutoff employed by the
+ 
+consensus calculation and quality examiner, the maximum length of each 
+reading to include in the quality calculation, the line length used by
+ 
+the display function, the text window length used by the graphics 
+options, and the graphics window length used by the graphics options.
+.para
+The default cutoff score is 75%. The default line length is 50 characters. 
+For protein sequences the cutoff is always 100%.
+.para
+The text window used by the graphics options controls the amount of 
+sequence listed at the crosshair position. The graphics window controls the 
+"zoom" function. Both these windows are defined as the number of bases that 
+should be shown, to both left and right of the crosshair.
+.left margin1
+@30. TX 3 @  Auto edit a contig
+.left margin2
+.para
+This function automatically changes characters in gel readings to make 
+
+them agree with the consensus sequence. If employed as is intended, use 
+
+of this function is not a criminal activity but a method that saves a large 
+
+amount of work. All characters changed by the auto editor will appear in 
+
+the gel readings as lowercase letters. The current consensus calculation 
+cutoff score is used.
+.para
+Identify the contig and the section to edit. The program will display a 
+
+summary of changes made. Note that it is important to understand both 
+
+what the auto editor does and the order in which it does it. Before 
+
+employing the auto editor users should note all the corrections that they 
+require, so that  after it has been used the corrections can be checked. 
+
+.para
+ The 
+general strategy employed when collecting shotgun sequence data is to let 
+the contigs get fairly deep, to get a printout of a contig, 
+check problems against the 
+films, note corrections on the printout, and
+make the changes using an interactive editor.  
+In general the consensus is correct except for places where padding 
+characters have been used to accommodate a single gel with an extra 
+character, or where the consensus is dash. The important point for the 
+auto  
+editor is that
+most edits simply make the 
+gel readings conform to the consensus, or remove columns of pads.
+.para
+The new editor does the following.
+.para
+1) calculates a consensus for the contig (or part of a contig) to be 
+edited, and then uses this consensus to direct the editing of the contig
+in 3 stages
+.para
+2) stage 1: find and correct all places where, if the order of two adjacent 
+characters is swapped, they will both agree with the consensus (given 
+that 
+they did not match the consensus before). These corrections are termed 
+"transpositions"
+.para
+3)  stage 2: find and correct all places where there is a definite consensus 
+but the gel reading has a different character. These corrections are 
+termed 
+"changes".
+.para
+4) stage 3: delete all positions in which padding is the consensus. These 
+corrections are termed "deletions".
+.para
+All changed characters are shown in lowercase letters so it will be 
+obvious which 
+characters have been assigned by the program (except for deletions). The 
+number of each type of correction will be displayed.
+
+.LEFT MARGIN1
+@10. TX 2 @Clear graphics
+.LEFT MARGIN2
+.para
+ Clears graphics from the screen.
+.left margin1
+@11. TX 2 @Clear text
+.LEFT MARGIN1
+.para
+ Clears  text from the screen.
+.left margin1
+@12. TX 2 @Draw a ruler.
+.LEFT MARGIN2
+.para
+This option
+allows the user to draw a ruler or scale along the x axis of the screen to 
+help identify the coordinates of points of interest. The user can define 
+the position of the first base to be marked (for example if the active 
+region is 1501 to 8000, the user might wish to mark every 1000th base 
+starting at either 1501 or 2000 - it depends if the user wishes to treat 
+the active region as an independent unit with its own numbering starting 
+at 
+its left edge, or as part of the whole sequence). The user can also define 
+the separation of the ticks on the scale and their height. If required the 
+labelling routine can be used to add numbers to the ticks.
+.left margin1
+@14. TX 2 @Reposition plots
+.LEFT MARGIN2
+.para
+The positions of each of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots for
+each option are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "ANALMARG" when the 
+program is started. Users can have their own file if required.
+As all the plots start 
+at the same position in x and have the same width, x0 and xlength are the 
+same for all options. Generally users will only want to change the start 
+level of the window y0 and its height ylength. 
+ This option 
+allows users to change window positions whilst running the program.
+The routine prompts first for the number of the option that the users 
+wishes 
+to reposition; then for the y start and height; then for the x start and 
+length. Note that changes to the x values affect all options. If the user 
+types only carriage return for any value it will remain unchanged. 
+Note that, unlike all the other programs, the boxes used to contain 
+analytical results (eg plot quality) should not be made to overlap one 
+another, as the function of the crosshair routine depends on which box the 
+crosshair is in!
+overlap
+.LEFT MARGIN1
+@15. TX 2 @Label a diagram
+.LEFT MARGIN2
+.para
+This routine allows users to label any diagrams they have produced. They 
+are asked to type in a label. When the user types carriage return to finish 
+typing the label the cross-hair appears on the screen. The user can 
+position it anywhere on the screen. If the user types R (for right justify)
+the label will be 
+written on the diagram with its right end at the cross-hair position. 
+If the user types L (for left justify) the label will be written on the 
+diagram with its left end at the cross hair position.
+The 
+cross-hair will then immediately reappear. The user may put the same 
+label 
+on another part of the diagram as before or if he hits the space bar he 
+will be asked if he wishes to type in another label.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=15
+Type label then drive cross hair to left or right end
+of label position then hit  "L"  to  write label left
+justified or  "R"  to  write label right justified or
+the space bar to quit
+ 
+ 
+? Label=delta gene
+
+ missing graphics 
+
+? Label=
+ 
+.end lit
+.left margin1
+@16. TX 2 @Display a map.
+.LEFT MARGIN2
+.para
+This draws a map 
+of any sequence features selected by the user.
+These features may be protein coding regions (CDS), tRNA genes (TRNA), 
+promoter positions (PRM), etc. Users may define their own feature table 
+key 
+names. For example I find it convenient to split CDS lines into CDS1, 
+CDS2 
+and CDS3 each of which contains only those sequences that code in the 
+reading frames 1, 2 or 3. Then I can plot them at different heights on 
+the screen ( suitable heights can be determined by using the cross-hair).
+The coordinates must be stored in a file in the format of an EMBL feature 
+table. 
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=16
+ Display a map using an EMBL feature table file
+? map file name=hsegl1.ft
+? feature code(e.g. CDS) =CDS
+X 1 + strand
+  2 - strand
+  3 both strands
+? 0,1,2,3 =
+? level (0-9480) (256) =4000
+
+ missing graphics 
+ 
+? feature code(e.g. CDS) =
+
+.end lit
+.left margin1
+@7. TX 1 @Redirect output
+.LEFT MARGIN2
+.para
+Used to direct output that would normally appear on the screen to a file. 
+.para
+Select redirection of either text or graphics, and 
+supply the name of the file that the output should be written to.
+.para
+ The results from the next options selected will not appear on the screen 
+but will be written to the file. When option 7 is selected again
+the file will be 
+closed and output will again appear on the screen.
+.left margin1
+@13. TX 2 @Use crosshair
+.left margin2
+This option puts a steerable cross on the screen which the user 
+drives around 
+by using the arrow keys (or mouse). When the crosshair is 
+visible a number of options are available if the user types one of a 
+set of special keyboard characters. Any other characters will cause 
+an exit from the crosshair option. The special keys are:
+.lit
+
+    I = Identify the nearest gel reading
+    Z = Zoom in
+    Q = plot Quality
+    S = display the aligned Sequences at the crosshair position
+    N = list the Names and Numbers of the sequences at the crosshair
+.end lit
+.para
+In order for any of these special keys to operate, the crosshair 
+must be in an appropriate display box, and the precise function of 
+the keys will also depend on which box the crosshair is in.
+.para
+ If the 
+crosshair is in the "plot all contigs" box, Z will cause a new box to 
+appear showing all the readings for the nearest contig; Q will give 
+the same as Z but will also produce an extra box showing the 
+"quality" plot.
+.para
+ If Z is hit in the "plot single contig" box, the contig will be zoomed 
+to the current graphics window size. The zoom will be roughly 
+centred on the crosshair position. Because of this it is possible to 
+step along a contig by repeatedly zooming with the crosshair near 
+to one end of the single contig display box. If I is hit the crosshair 
+must be close to a gel reading line. If Q is hit, the quality plot will 
+be produced for the region shown in the plot single contig box. In 
+all cases when the "plot all contigs" box is shown, a vertical line will 
+bisect the line the represents the relevent contig, at the current 
+position.
+.para
+If the crosshair is in the plot quality box only the character "s" will operate 
+as a special symbol.
+.para
+The number of bases shown in the N and S options is controlled by 
+the current graphics text window size, and the size of the zoom 
+window by the current graphics window size. Both are set by the 
+parameter setting function of the general menu.
+.left margin1
+@33. TX 2 @Plot single contig
+.left margin2
+This option produces a schematic of a selected region of a single 
+contig by drawing a horizontal line to represent each of its gel 
+readings. The lines show the relative positions of each reading and 
+also their sense. The plot is divided vertically into two sections by 
+a line that is identified by an asterisk drawn at each end. All lines  
+that lie above this line represent readings that are in their original 
+sense, all lines below show readings that are in the 
+complementary sense to their original. By use of the crosshair 
+function the plot can be stepped through and examined in more 
+detail. See help on crosshair.
+.left margin1
+@34. TX 2 @Plot all contigs
+.left margin2
+This option produces a schematic of all the contigs in a database. It 
+does this by drawing a horizontal line to represent each of them. 
+In order to show the ends of each contig it draws the lines for 
+contigs at alternate heights: the first at height one, the 
+second at height two, the third at height one, etc. The order of the 
+contigs in the display is the same as their order in the database. 
+By use of the crosshair function the plot can be stepped 
+through and examined in more detail. See help on crosshair.
+.left margin1
+@31. TX 3 @ Type in gel readings
+.left margin2
+This option allows gel readings to be typed in at the keyboard. It creates 
+a separate file for each gel reading and a file of file names for the 
+batch. The sequences from each batch may be listed when they have all been 
+entered. Users may choose to employ special keys to identify the 4 bases 
+A,C,G and T. By default these special keys are N M , . but any other four 
+characters may be used. If special keys are used the characters are 
+automatically translated to A C G T before being stored on the disk.
+.left margin1
+@35. TX 1 @ Find internal joins
+.left margin2
+The purpose of this function is to use data already in the database to
+find possible joins between contigs.
+Joins may have been missed due to poor data or may have not been made
+due to repeated sequences. Where appropriate, it may be 
+possible to find potential
+joins by using the data clipped off readings prior to their entry into the
+database.
+.left margin2
+The database is checked for logical consistency. Supply a minimum initial
+match length, a minimum alignment block, the maximum pads per sequence,
+the maximum percent mismatch after alignment, the probe length. Choose
+if clipped data is to be used, if so define the window size for finding good
+data and the number of dashes allowed in the window. Processing will commence.
+Most of these values are used in an identical way in the autoassemble 
+function. The others are defined below. 
+.left margin2
+The program strategy
+.left margin2
+Take the first contig and calculate its consensus. If clipped data is being
+used examine all readings that
+are in the complementary orientation, and sufficiently near to the contigs left
+end, to see if they have good clipped sequence which if present, would 
+protrude 
+from the left end of the contig. If found add the longest such sequence to the 
+left end of the consensus. Do the same for the right end by examining  
+readings that are in their
+original orientation. If any are found add the longest extension to the 
+right end of
+the consensus. Repeat the consensus calculations and extensions 
+for all contigs hence producing an extended consensus. If clipped data is not
+being used simply calculate the consensus for the whole database. Now
+look for possible joins by processing the extended consensus in the following
+way. Take the last, say 100, bases (termed the "probe length" by the program)
+of the rightmost consensus, compare it both
+orientations with the extended consensus of all the other contigs. Display
+any sufficiently good alignments. Repeat with the left end of the rightmost
+contig. Do the same for the ends of all the entended contigs, always only
+comparing with the contigs to their left, so that the same matches do not 
+appear twice.
+.left margin2
+Good cliped data is defined by sliding a window of "Window size for good data
+scan" bases outwards
+along the sequence and stopping when "Maximum number of dashes in scan window"
+ or more dashes appear in the window.
+Note that
+it is advisable to have some sort of cutoff because if we simply take all the
+data it might be so full of rubbish that we wont find any good matches. For
+the same reason it is worth trying the procedure with different cutoffs. An
+initial run using no clipped data is also recommended.
+Sufficiently good
+alignments are defined by criteria equivalent to those used in autoassemble,
+however here we only display alignments that pass all tests.
+.left margin2
+Bugs
+.left margin2
+If a small contig is wholly contained within a larger one, such that its
+ends are further than ("Probe length" - "Minimum initial match length")
+from the ends of the larger contig, and the consensus for the small 
+contig lies to the left
+of the consensus for large contig, the overlap will not be discovered. (See
+the search stratgey).
+.left margin2
+ All numbering is
+relative to base number one in the contig: matches to the left (i.e. in
+the clipped data) have negative
+positions, matches off the right end of the contig (i.e. in the clipped
+data) have positions 
+greater than that of the contig length. A typical result is shown below.
+.lit
+
+ Right end of contig   22 in the - sense  and contig   96
+ Percentage mismatch after alignment =  3.0
+        628        638        648        658        668        678
+          GTGAGATGAG CATATTTAAA ATGAACCGAG CAGTTAGGAG ATATGTTGGG AGGACAAGAA
+           ********* ********** ********** ********** ********** **********
+          -TGAGATGAG CATATTTAAA ATGAACCGAG CAGTTAGGAG ATATGTTGGG AGGACAAGAA
+        -86        -76        -66        -56        -46        -36
+        688        698        708        718
+          ACATCCGGGA TACAGTCAAT AAATGAAAAA TTAATGAATT
+          ********** ********** ****** *** ***** ****
+          ACATCCGGGA TACAGTCAAT AAATGA-AAA TTAATTAATT
+        -26        -16         -6          4
+
+.end lit
+
diff --git a/help/SIP.RNO b/help/SIP.RNO
new file mode 100644
index 0000000..8c5838a
--- /dev/null
+++ b/help/SIP.RNO
@@ -0,0 +1,1431 @@
+.NPA
+.SP 1
+.left margin1
+@-1. TX  0 @General
+.sp
+@-2. T   0 @Screen control
+.sp
+@-2. X   0 @Screen
+.sp
+@-3. TX  0 @Set parameters
+.sp
+@-4. TX  0 @Comparison
+.sp
+@0.  TX -1 @SIP
+.PARA
+This is program for comparing and aligning nucleic acid or protein 
+sequences. It can produce optimal alignments using a dynamic 
+programming algorithm, and has several ways of producing "dot matrix" 
+diagrams. 
+.PARA
+The following analyses (preceded by their option numbers) are included:
+.sp
+.para
+                The program is  used  on  a  simple  graphics  terminal  ie  a
+          keyboard  with a screen on which points and lines can be drawn.  
+The
+          user  works  at  the  terminal  and  produces  plots   for   various
+          combinations  of values for the span length and minimum scores. 
+However large or small a region  the  user
+          elects  to  compare  the program expands or contracts the diagram 
+so
+          that the plot always fills the screen.  This allows the user to gain
+          an  overall  impression  or  to  "home-in" on particular regions and
+          examine them in more detail.   Having  found  a  region  that  looks
+          interesting  the  user  can  determine  its  coordinates in terms of
+          sequence positions by use of a crosshair facility.
+.para
+                The program has two  statistical  options  to  help  the  user
+          choose  score  levels for plotting and to assess the significance of
+          any similarity found.  It can  produce  a  cumulative  histogram  of
+          observed  scores  for  the current span length and region and it can
+          calculate the "double matching probability" of McLachlan. 
+The
+          "double   matching   probability"  is  the  probability  of  finding
+          particular  scores  given  two  infinitely  long  sequences  of  the
+          composition  of  those  being compared, with the current span 
+length
+          and score matrix.  By using these options the  user  can  choose  to
+          plot   all   the  matches  for  which  the  score  exceeds  a  given
+          significance  level  (such  as  1%),  using  either   empirical   or
+          theoretical  probability values.  Generally it is best to begin at a
+          low level to avoid an overcrowded diagram.
+.para
+               If the user finds that the two sequences do  contain  stretches
+          of  homology  he will often want to align the sequences by 
+inserting
+          padding characters at deletion points.  The program has a  
+selection
+          of  options  for this purpose:  it contains an alignment routine; it 
+can display on the screen the two
+          sequences, one above the other, with asterisks  marking  
+identities,
+          it  has inbuilt editing functions and can save the aligned sequences
+          on disk files.  
+.para
+               The basic principle of dot matrices was  first
+          described  by Gibbs and McIntyre and involves producing a diagram
+          that contains a representation of all the matches between a pair  
+of
+          sequences.   This  diagram  is  then  scanned  by  eye and the human
+          ability to recognise patterns used to detect any  similarities  that
+          might be present.  The diagram consists of a two dimensional plot 
+in
+          which the x axis represents one sequence (A)  and  the  y  axis  the
+          other  (B).   Every point (i,j) on the plane x,y is assigned a score
+          which corresponds  to  the  level  of  similarity  between  sequence
+          characters A(i) and B(j).  In the simplest use of the method a score
+          of 1 could be assigned to every point (i,j) where A(i) = B(j), and a
+          score  of  0  to  every other point.  If a plot of the points in the
+          plane was made in which all scores of 1 were marked with a  dot  
+and
+          all  those  of 0 left blank then regions of identity would appear as
+          diagonal lines.  With the comparison  displayed  in  this  form  the
+          human eye is very good at detecting regions of homology even if 
+they
+          are imperfect.  The effects of mismatches, insertions  or  
+deletions
+          can  be  seen:   matches interrupted by insertions or deletions will
+          appear as parallel diagonals, and matches  interrupted  by  the  odd
+          mismatching  pair  of  characters  will  appear  as broken collinear
+          diagonal lines. This diagram is  a  very  useful  representation  but
+          simply  placing a dot for every identity is of limited value for the
+          following reasons.
+.para
+                For nucleic acid sequences around 25% of the plot will contain
+          points   and   it  will  often  be  very  difficult  to  distinguish
+          significant homologies  from  chance  matches.   For  proteins  
+many
+          significant alignments of sequences contain almost no identities 
+but
+          are formed from chemically and structurally similar amino  acids  
+so
+          that  simply  looking  for  identity would be insufficient.  What is
+          required is to first find those points  that  correspond  to  fairly
+          strong  local  similarities  and  then  to  use the diagram of these
+          points so that the human eye can be used to look  for  larger  scale
+          homologies.   The program uses a number of different algorithms to 
+calculate the
+          score for each point and the user defines a minimum  score  so  
+that
+          only  those  points  in  the diagram for which the score is at least
+          this value will be marked with a dot. 
+.para
+ The first scoring  method  finds  the  longest uninterrupted sections of 
+perfect identity i.e.
+          those that contain no  mismatches,  insertions  or  deletions. 
+Generally this method, termed "the identities algorithm" is of little 
+value, but runs very quickly.
+.para
+  The
+          second   method  looks  for  sections  where  a  proportion  of  the
+          characters in the sequence are similar, again allowing no 
+insertions
+          or deletions. For a thorough analysis this  method, termed "the 
+proportional algorithm", is the best.        
+.para
+The original method, of this type was  first
+          described  by  McLachlan  and involves calculating a score for
+          each position in the matrix by summing  points  found  when  
+looking
+          forwards  and  backwards  along  a  diagonal line of a given length.
+          This length, called the span, must be an odd number so that the dot 
+marking matches can be precisely placed at its centre.
+The algorithm does not simply look for identity  but  uses  a
+          score  matrix  that  contains  scores  for  every  possible  pair of
+          characters.  For comparing amino acid sequences  we  usually
+use  the  score
+          matrix  shown  below which was calculated by adding 10 (to make
+          every term >0) to each term of the relatedness odds matrix MDM78  
+of
+          Dayhoff.  This matrix MDM78 was calculated by looking at accepted
+          point mutations in 71 families of closely related proteins  and,  of
+          those  tested  by  Dayhoff,  was found to be the most powerful 
+score
+          matrix  for  finding  distant  relationships  between   amino   acid
+          sequences.
+.left margin1
+.lit
+
+                           AMINO ACID SCORE MATRIX
+                           -----------------------
+
+   C  S  T  P  A  G  N  D  E  Q  B  Z  H  R  K  M  I  L  V  F  Y  W  -  X  ?  
+C 22 10  8  7  8  7  6  5  5  5  5  5  7  6  5  5  8  4  8  6 10  2 10 10 10 10
+S 10 12 11 11 11 11 11 10 10  9 10 10  9 10 10  8  9  7  9  7  7  8 10 10 10 10
+T  8 11 13 10 11 10 10 10 10  9 10 10  9  9 10  9 10  8 10  7  7  5 10 10 10 10
+P  7 11 10 16 11  9  9  9  9 10  9 10 10 10  9  8  8  7  9  5  5  4 10 10 10 10
+A  8 11 11 11 12 11 10 10 10 10 10 10  9  8  9  9  9  8 10  6  7  4 10 10 10 10
+G  7 11 10  9 11 15 10 11 10  9 10 10  8  7  8  7  7  6  9  5  5  3 10 10 10 10
+N  6 11 10  9 10 10 12 12 11 11 12 11 12 10 11  8  8  7  8  6  8  6 10 10 10 10
+D  5 10 10  9 10 11 12 14 13 12 13 12 11  9 10  7  8  6  8  4  6  3 10 10 10 10
+E  5 10 10  9 10 10 11 13 14 12 12 13 11  9 10  8  8  7  8  5  6  3 10 10 10 10
+Q  5  9  9 10 10  9 11 12 12 14 11 13 13 11 11  9  8  8  8  5  6  5 10 10 10 10
+B  5 10 10  9 10 10 12 13 12 11 13 11 11 10 10  8  8  6  8  5  7  4 10 10 10 10
+Z  5 10 10 10 10 10 11 12 13 13 11 14 12 10 10  8  8  8  8  5  6  4 10 10 10 10
+H  7  9  9 10  9  8 12 11 11 13 11 12 16 12 10  8  8  8  8  8 10  7 10 10 10 10
+R  6 10  9 10  8  7 10  9  9 11 10 10 12 16 13 10  8  7  8  6  6 12 10 10 10 10
+K  5 10 10  9  9  8 11 10 10 11 10 10 10 13 15 10  8  7  8  5  6  7 10 10 10 10
+M  5  8  9  8  9  7  8  7  8  9  8  8  8 10 10 16 12 14 12 10  8  6 10 10 10 10
+I  8  9 10  8  9  7  8  8  8  8  8  8  8  8  8 12 15 12 14 11  9  5 10 10 10 10
+L  4  7  8  7  8  6  7  6  7  8  6  8  8  7  7 14 12 16 12 12  9  8 10 10 10 10
+V  8  9 10  9 10  9  8  8  8  8  8  8  8  8  8 12 14 12 14  9  8  4 10 10 10 10
+F  6  7  7  5  6  5  6  4  5  5  5  5  8  6  5 10 11 12  9 19 17 10 10 10 10 10
+Y 10  7  7  5  7  5  8  6  6  6  7  6 10  6  6  8  9  9  8 17 20 10 10 10 10 10
+W  2  8  5  4  4  3  6  3  3  5  4  4  7 12  7  6  5  8  4 10 10 27 10 10 10 10
+- 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+X 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+? 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+  10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+.end lit
+.para
+It is also possible to use other matrices, including an identity matrix for 
+proteins. For nucleic acids we usually use the matrix shown below.
+.lit
+
+         DNA SCORE MATRIX
+
+             A C G T X 
+           A 1 0 0 0 0 
+           C 0 1 0 0 0 
+           G 0 0 1 0 0 
+           T 0 0 0 1 0 
+           X 0 0 0 0 0 
+.end lit
+.left margin2
+.para
+Plotting dots at the centres of spans that reach the cutoff leads to a 
+persistence effect that, to some extent, can be mitigated by a  variation 
+on the method. If, for example, all the high scoring amino acids are 
+clustered at the left end of a particular diagonal segment, dots will 
+continue to be plotted to their right until the span score drops below the 
+cutoff. Instead of plotting a single point for each span that reaches the 
+cutoff score, the variant  method  plots points for all the identities that 
+lie in spans that reach the cutoff. Obviously the persistence effect can be 
+more pronounced for long spans and low cutoff scores, but note that the 
+variant method will not plot anything if there are no identities present, 
+and so similar regions could be missed!
+.para
+A further variant, useful for comparing a sequence against itself, ignores 
+the main diagonal.
+.para
+The third comparison method called "quick scan"  is really a combination 
+of the first two, and is similar to the FASTP program of Lipman and 
+Pearson, but produces a dot matrix diagram. The algorithm is as follows. 
+The dot matrix positions are found for all words of some minimum length 
+(obviously length 1 is most sensitive) that are common to both 
+sequences. Imagine a diagonal line running from corner to corner of the 
+diagram, at right angles to the diagonals in the dotmatrix,  The scores 
+for the common words (according to the current score matrix, e.g. 
+MDM78) are accummulated at the appropriate positions on
+that imaginary line, hence  producing a 
+histogram. The histogram is analysed to find its mean and standard 
+deviation. The diagonals that lie above some cutoff score (defined in 
+standard deviation units), are rescanned using the proportional 
+algorithm, and a diagram produced. The method is very fast, and is also 
+employed  by the library comparison program.
+.para
+The dynamic programming alignment algorithm contained in the program 
+is based on that of Miller and Myers (). It guarantees to produce 
+alignments with the optimum score given a score matrix, a gap start 
+penalty, and a gap extension penalty. That is, starting a gap costs a fixed 
+penalty (IG) and each residue added to the gap incurs a further penalty 
+(IH) so that for each gap of length K residues the penalty is IG + k*IH. 
+Gaps at the ends of sequences incur no penalty.
+.para
+It is very useful to have the dot matrix methods and the alignment 
+routine together in the same program because it allows users to produce 
+a dot matrix diagram to help select which regions of the sequence they 
+wish to align. Selection is made by use of the crosshair. First the 
+crosshair is positioned at the bottom left hand end of the segment to be 
+aligned. The crosshair function is quit and immediately selected again, 
+the crosshair positioned at the top right of the segment, and the 
+crosshair function quit. When the alignment routine is selected the 
+segment will be aligned. 
+.para
+The alignment can replace the original segment of the sequence. By 
+repeated plotting of dot matrices, followed by alignment, very long 
+sequences can easily be aligned.
+.LEFT MARGIN1
+@1. TX 0 @Help
+.LEFT MARGIN2
+.para
+This option gives online help. The user should select option numbers and
+the current documentation will be given. 
+.PARA
+The following analyses (preceded by their option numbers) are included:
+.lit
+ ? = Help
+ ! = Quit
+ 3 = read a new sequence
+ 4 = define active region
+ 5 = list the sequence
+ 6 = list a text file
+ 7 = direct output to disk
+ 8 = write active sequence to disk
+ 9 = edit the sequences
+10 = clear graphics screen
+11 = clear text screen
+12 = draw a ruler
+13 = use cross hair
+14 = reposition plots
+15 = label diagram
+16 = display a map
+17 = apply identities algorithm
+18 = apply proportional algorithm
+19 = list matching spans
+20 = set span length
+21 = set proportional score
+22 = set identities score
+23 = calculate expected scores
+24 = calculate observed scores
+25 = show current parameter settings
+26 = quick scan
+27 = draw a /
+28 = align the sequences
+29 = complement the sequences
+30 = switch main diagonal
+31 = switch identities
+32 = change score matrix
+.end lit
+.left margin1
+@2. TX 0 @Quit
+.left margin2
+.para
+This function stops the program.
+.left margin1
+@3. TX 1 @Read a new sequence
+.LEFT MARGIN2
+.para
+This option allows users to read in new sequences, browse through annotations,
+ or search sequence 
+libraries for keywords. Sequences can be read from "personal" 
+sequence files or from sequence libraries. These are referred to as the 
+sequence "source". Personal files can be stored in several formats:
+Staden, PIR, EMBL, GENBANK and GCG.
+At LMB we use "Staden" format for sequencing and all 
+the 
+libraries are stored in their original formats. Note, however, that libraries
+such as EMBL or GenBank that are divided into several files (eg GenBank has
+13 separate files) are indexed as a whole. This means that users do not need
+to know which file contains an entry, only which library.
+When the user selects to read in a sequence the program first asks for the 
+sequence "source". 
+.para
+If the user selects "personal" the program will ask for 
+the format (Staden, PIR, EMBL, GENBANK or GCG), and then for the name of 
+the file. For PIR format the user will also be required to know the entry 
+name of the sequence as the file can contain several. For the other formats
+only a single entry is expected. The file will be read, its length and
+composition will be displayed and the option left.
+.para
+If the user selects "library" as the sequence source the program will display a
+list of available libraries. The programs are capable of handling all current
+libraries but which ones are available will vary from site to site. At LMB we
+have several libraries and also weekly updates of data gathered between releases.
+The program will ask users to select a library and then give a list of options:
+.lit
+
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entrynames from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+
+.end lit
+If get a sequence or get annotations is selected users will be asked to 
+type the entry name. The option will be left when a sequence is selected or 
+! is typed. The composition and length will be displayed.
+.para
+The text index contains all words from feature tables, reference titles,
+definition lines, keywords lists and comments, so the text index search
+is most useful. It is also the fastest. Up to 5 words can be searched for
+at once. The words should be typed separated by spaces, for example
+.lit
+ ? Keywords=P53 mouse murine tumo
+
+.end lit
+will search for all entries that contain words starting with p53, mouse,
+murine and tumo. Only the unique entries that contain ALL words will be 
+listed. Before listing the matching entries
+the program will show the number of 'hits' for each word and ring the bell.
+Escape is possible at this point, or after each screenfull of entries.
+In addition to the entry names the text search displays the primary accession 
+number, the sequence length and up to 80 characters of description.
+(The search of 'titles' is now redundant because the full text index
+contains all the title words and the search is much faster. It will probably
+be removed from the program.)
+All searches are independent of case. Where
+possible the program will offer default entry names.
+.para
+Typical dialogue follows.
+.lit
+Select sequence source
+X  1 Personal file
+   2 Sequence library
+? Selection  (1-2) (1) =
+Select sequence file format
+X  1 Staden
+   2 EMBL
+   3 GenBank
+   4 PIR
+   5 GCG
+? Selection  (1-5) (1) =
+? Sequence file name=M13MP7.SEQ
+ Contig title removed
+Sequence length=  7238
+ Sequence composition
+          T          C          A          G          -
+      2405.      1539.      1765.      1527.         2.
+        33.2%      21.3%      24.4%      21.1%       0.0%
+  .
+  .
+  .
+
+
+ Select sequence source
+ X  1 Personal file
+    2 Sequence library
+ ? Selection  (1-2) (1) =2
+ Select a library
+ X  1 EMBL 29 nucleotide library Dec 91
+    2 SWISSPROT 20 protein library Nov 91
+    3 PIR 31 protein library Dec 91
+    4 NRL3D 58 From Brookhaven protein library Dec 91
+    5 GenBank
+ ? Selection  (1-5) (1) =
+Library is in EMBL format with indexes
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =5
+ Search for keywords
+ ? Keywords=P53 mouse
+P53 hits  68
+MOUSE hits  8180
+
+ MMANT01    X00875         536 Murine gene fragment for cellular tumour antigen
+ MMANT02    X00876          83 Murine gene fragment for cellular tumour antigen
+ MMANT03    X00877          21 Murine gene fragment for cellular tumour antigen
+ MMANT04    X00878         261 Murine gene fragment for cellular tumour antigen
+ MMANT05    X00879         184 Murine gene fragment for cellular tumour antigen
+ MMANT06    X00880         113 Murine gene fragment for cellular tumour antigen
+ MMANT07    X00881         110 Murine gene fragment for cellular tumour antigen
+ MMANT08    X00882         137 Murine gene fragment for cellular tumour antigen
+ MMANT09    X00883          74 Murine gene fragment for cellular tumour antigen
+ MMANT10    X00884         107 Murine gene for cellular tumour antigen p53 (exon
+ MMANT11    X00885         562 Murine p53 gene 3' region with exon 11
+ MMANTP53   M26862         536 Mouse tumor antigen p53 gene, 5' end.
+ MMLYN      M64608        2044 Mouse lyn protein mRNA, complete cds.
+ MMP53      X00741        1377 Mouse mRNA for transformation associated protein
+ MMP53A     M13872        1285 Mouse p53 mRNA, complete cds, clone pcD53.
+ MMP53B     M13873        1241 Mouse p53 mRNA, complete cds, clone p53-m11.
+ MMP53C     M13874        1322 Mouse p53 mRNA, complete cds, clone p53-m8.
+ MMP53G1    X01235         554 Mouse genomic DNA for 5' region of cellular tumou
+ MMP53IN4   X60470         729 M.musculus p53 gene for p53 protein, intron 4
+ MMP53P     X01236        2132 Mouse pseudogene for cellular tumour antigen p53
+ MMP53R     X01237        1773 Mouse mRNA for cellular tumour antigen p53
+ MMRSB2P5   M64597         196 Mouse B2 repeat in the 3' flank of protein 53 (p5
+      22 different entries found
+
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =4
+ Search for keywords
+ ? Keywords=alpha
+ Searching for alpha
+ AAGHA          623 a.anguilla mrna for glycoprotein hormone alpha subunit precu
+ AAMALI        3338 a.aegypti mali gene encoding alpha 1-4 glucosidase, complete
+ AAMALIA       1659 a.aegypti maltase-like i (mali) gene encoding alpha-1,4-gluc
+ AAMALIB       1832 a.aegypti maltase-like i (mali) mrna encoding alpha-1,4-gluc
+ ACA13GT        371 alouatta caraya alpha-1,3gt gene, 3' flank.
+ ADHBADA1       102 duck alpha-d-globin gene, exon 1.
+ ADHBADA2      1145 duck alpha-a-globin gene and 5' flank
+ ADHBADWP       513 duck (white pekin) alpha ii (minor) globin mrna, complete co
+ AEACOXABC     5279 a.eutrophus protein x (acox), acetoin:dcpip oxidoreductase-a
+ AGA13GT        371 ateles geoffroyi alpha-1,3gt gene, 3' flank.
+ AGAAAGFP       282 c.tetragonoloba alpha-amylase/alpha-galactosidase fusion pro
+ AGAABL         138 b.subtilis alpha-amylase signal peptide gene e.coli beta-lac
+ AGAFAMYA        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFAMYB        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFAMYC        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+ AGAFCOXA        98 synthetic alpha-factor/cox iv fusion gene signal peptide.
+ AGAGABA       7876 synthetic gossypium hirsutum (cotton) alpha globulin a and b
+ AGAMYLS        120 synthetic alpha-amylase gene, 5' end.
+ AGANPS          95 synthetic gene (jcnf-1) encoding alpha-factor pro-region/han
+!
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =3
+ ? Accession number=v00636
+Entry name LAMBDA
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =2
+ Default Entry name=LAMBDA
+ ? Entry name=
+ID   LAMBDA     standard; DNA; PHG; 48502 BP.
+XX
+AC   V00636; J02459; M17233; X00906;
+XX
+DT   03-JUL-1991 (Rel. 28, Last updated, Version 3)
+DT   09-JUN-1982 (Rel. 1, Created)
+XX
+DE   Genome of the bacteriophage lambda (Styloviridae).
+XX
+KW   circular; coat protein; DNA binding protein; genome;
+KW   origin of replication.
+XX
+OS   Bacteriophage lambda
+OC   Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
+XX
+RN   [1]
+RP   1-48502
+RA   Sanger F., Coulson A.R., Hong G.F., Hill D.F., Petersen G.B.;
+RT   "Nucleotide sequence of bacteriophage lambda DNA";
+RL   J. Mol. Biol. 162:729-773(1982).
+XX
+!
+ Select a task
+ X  1 Get a sequence
+    2 Get annotations
+    3 Get entry names from accession numbers
+    4 Search titles for keywords
+    5 Search text index for keywords
+ ? Selection  (1-5) (1) =
+ Default Entry name=LAMBDA
+ ? Entry name=
+DE   Genome of the bacteriophage lambda (Styloviridae).
+ Sequence length  48502
+ Sequence composition
+           T          C          A          G          -
+      11988.     11360.     12336.     12818.         0.
+         24.7%      23.4%      25.4%      26.4%       0.0%
+
+.end lit
+.left margin1
+@4. TX 1 @Define active region
+.LEFT MARGIN2
+.para
+For its analytic functions 
+the program always works on a region of the sequence called the active 
+region. When a new sequence is read into the program the active region is 
+automatically set to start at the beginning of the sequence and go
+up to the 
+maximum allowed size of active region the  program can 
+handle. The positions are shown on the screen.
+On most machines this will be to the end of the sequence.
+This option allows the user define a different region.
+.left margin1
+@5. TX 1 @List a sequence
+.LEFT MARGIN2
+.para
+The sequence can be listed with line lengths from 
+10 to 120 in multiples of 10.  The output looks like:
+.lit
+
+    87         97        107        117        127        137
+     KVKCTGRILE VPVGRGLLGR VVNTLGAPID GKGPLDHDGF SAVEAIAPGV IERQSVDQPV
+      **      * ****   ***   * ** * *  **         *    **    *        
+     DVKDLEHPIE VPVGKATLGR IMNVLGEPVD MKGEIGEEER WAIHRAAPSY EELSNSQELL
+    68         78         88         98        108        118
+   147        157        167        177        187        197
+     QTGYKAVDSM IPIGRGQREL IIGDRQTGKT ALAIDAIINQ RDSGIKCIYV AIGQ
+      ** *  * *  *   *       *    ***       * *             *   
+     ETGIKVIDLM CPFAKGGKVG LFGGAGVGKT VNMMELIRNI AIEHSGYSVF AGVG
+   128        138        148        158        168        178
+
+.end lit
+.left margin1
+@6. TX 1 @List a text file
+.LEFT MARGIN2
+.para
+Allows the user to have a text file displayed on the screen. It will appear 
+one page at a time.
+.left margin1
+@7. TX 1 @Direct output to disk
+.LEFT MARGIN2
+.para
+Used to direct output that would normally appear on the screen to a file. 
+.para
+Select redirection of either text or graphics, and 
+supply the name of the file that the output should be written to.
+.para
+ The results from the next options selected will not appear on the screen 
+but will be written to the file. When option 7 is selected again
+the file will be 
+closed and output will again appear on the screen.
+.left margin1
+@8. TX 1 @Write active region to disk
+.LEFT MARGIN2
+.para
+This option allows users to 
+write the current active sequence to a disk file in Staden format. 
+.left margin1
+@9. TX 1 @Edit the sequences
+.LEFT MARGIN2
+.para
+This function allows the user to insert or delete parts of either sequence 
+to help align them. The inserted characters are dashes.
+.left margin1
+@10. TX 2 @Clear graphics
+.LEFT MARGIN2
+.para
+ Clears the screen of both text and graphics.
+.left margin1
+@11. TX 2 @Clear text
+.LEFT MARGIN2
+.para
+ Clears only text from the screen.
+.left margin1
+@12. TX 2 @Draw a ruler
+.LEFT MARGIN2
+.para
+This option
+allows the user to draw a ruler or scale along the axes of the screen to 
+help identify the coordinates of points of interest. The user can define 
+the position of the first sequence element to be marked
+ (for example if the active 
+region is 1501 to 8000, the user might wish to mark every 1000th 
+element 
+starting at either 1501 or 2000 - it depends if the user wishes to treat 
+the active region as an independent unit with its own numbering starting 
+at 
+its left edge, or as part of the whole sequence). The user can also define 
+the separation of the ticks on the scale and their height. If required the 
+labelling routine can be used to add numbers to the ticks.
+.PARA
+To escape type !
+.left margin1
+@13. TX 2 @Use cross hair
+.LEFT MARGIN2
+.para
+This function puts
+a steerable cross on the screen that can be used to find the 
+coordinates of points in the sequence. The user can move the cross 
+around using the directional keys; when he hits the space bar the 
+program will write out the coordinates of the cross in sequence units and 
+the option will be exited.
+.para
+If instead, 
+the user hits a , the position will be displayed but the cross will remain on 
+the screen.
+.para
+If a letter s is hit the sequences around the cross hair are displayed as a 
+short alignment (as shown below) and the cross remains on the screen.
+.lit
+        97        107
+         VPVGRGLLGR VVNTLGAPID
+         ****   ***   * ** * *
+         VPVGKATLGR IMNVLGEPVD
+        78         88
+
+.end lit
+.PARA
+If a letter m is hit the sequences around the cross hair are displayed in 
+the form of a matrix (as shown below) and the cross remains on the screen.
+
+.lit
+
+   VPVGKATLGRIMNVLGEPVD
+  D...................DD
+  I..........I.........I
+  P.P...............P..P
+  A.....A..............A
+  G...G....G......G....G
+  L.......L......L.....L
+  T......T.............T
+  N............N.......N
+  VV.V..........V....V.V
+  VV.V..........V....V.V
+  R.........R..........R
+  G...G....G......G....G
+  L.......L......L.....L
+  L.......L......L.....L
+  G...G....G......G....G
+  R.........R..........R
+  G...G....G......G....G
+  VV.V..........V....V.V
+  P.P...............P..P
+  VV.V..........V....V.V
+   VPVGKATLGRIMNVLGEPVD
+
+.end lit
+.para
+The function is also used prior to "align sequences" in order to delineate the 
+region to be aligned. The crosshair is positioned at the bottom left of the 
+region, the crosshair option quit. Then the crosshair option is selected 
+again, and the crosshair moved to the top right of the region to be 
+aligned. 
+.left margin1
+@14. TX 2 @Reposition plots
+.LEFT MARGIN2
+.para
+The position of the plots is defined relative to a users drawing 
+board which has size 1-10,000 in x and 1-10,000 in y.
+Plots 
+are drawn in a window defined by x0,y0 and xlength,ylength. 
+Where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+All values are in drawing board units (i.e. 1-10,000, 1-10,000).
+The default window positions are read from a file "DIAGMARG" when the 
+program is started. Users can have their own file if required.
+ This option 
+allows users to change window positions whilst running the program.
+If the user 
+types only carriage return for any value it will remain unchanged. 
+The cross-hair can be used to choose suitable heights.
+.LEFT MARGIN1
+@15. TX 2 @Label a diagram
+.LEFT MARGIN2
+.para
+This routine allows users to label any diagrams they have produced. They 
+are asked to type in a label. When the user types carriage return to finish 
+typing the label the cross-hair appears on the screen. The user can 
+position it anywhere on the screen. If the user types R (for right justify)
+ the label will be 
+written on the diagram with its right end at the cross-hair position. 
+If the user types L (for left justify) the label will be written with its 
+left end at the cross hair position.
+The 
+cross-hair will then immediately reappear. The user may put the same 
+label 
+on another part of the diagram as before or if he hits the space bar he 
+will be asked if he wishes to type in another label.
+.left margin1
+@16. TX 2 @Display a map
+.LEFT MARGIN2
+.para
+NOT AVAILABLE.
+This draws a map 
+of any sequence features selected by the user.
+These features may be protein coding regions (CDS), tRNA genes (TRNA), 
+promoter positions (PRM), etc. Users may define their own feature table 
+key 
+names. 
+The coordinates must be stored in a file in the format of an EMBL feature 
+table. 
+.left margin1
+@17. TX 4 @Apply identities algorithm
+.LEFT MARGIN2
+.para
+                The identities algorithm finds runs of identical characters 
+in the sequence. Its main value is speed, being 100's of times faster than 
+the proportional algorithm. It is of course not very sensitive, and should 
+only be used for a quick scan. The cutoff score is the minimum number of 
+consecutive matching characters.
+All runs of identical characters that are at least as long as the cutoff 
+score will produce a dot on the screen.
+.para
+See also quick scan.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=d17
+? Identity score (1-20) (2) =3
+Working
+
+ missing graphics
+
+.end lit
+.left margin1
+@18. TX 4 @Apply proportional algorithm
+.para
+                        This method, generally  the  most  useful,  was  first
+          described  by  McLachlan  and involves calculating a score for
+          each position in the matrix by summing  points  found  when  
+looking
+          forwards  and  backwards  along  a  diagonal line of a given length.
+          This length, called the span, must be an odd number.
+The algorithm does not simply look for identity  but  uses  a
+          score  matrix  that  contains  scores  for  every  possible  pair of
+          characters.  At each point that a threshold score is achieved the 
+program marks the screen in one of two ways. It will either place a 
+single 
+dot at the position corresponding to the centre of the matching span, or 
+it 
+will plot a dot for each identical residue within each matching span.
+Alternatively, the "list matching spans" 
+option will list the segments that match.
+.para
+For comparing amino acid sequences  we  usually use  the  score
+          matrix  shown  below which was calculated by adding 10 (to make
+          every term >0) to each term of the relatedness odds matrix MDM78  
+of
+          Dayhoff.  This matrix MDM78 was calculated by looking at accepted
+          point mutations in 71 families of closely related proteins  and,  of
+          those  tested  by  Dayhoff,  was found to be the most powerful 
+score
+          matrix  for  finding  distant  relationships  between   amino   acid
+          sequences. 
+.left margin1
+.lit
+
+                           AMINO ACID SCORE MATRIX
+                           -----------------------
+
+   C  S  T  P  A  G  N  D  E  Q  B  Z  H  R  K  M  I  L  V  F  Y  W  -  X  ?  
+C 22 10  8  7  8  7  6  5  5  5  5  5  7  6  5  5  8  4  8  6 10  2 10 10 10 10
+S 10 12 11 11 11 11 11 10 10  9 10 10  9 10 10  8  9  7  9  7  7  8 10 10 10 10
+T  8 11 13 10 11 10 10 10 10  9 10 10  9  9 10  9 10  8 10  7  7  5 10 10 10 10
+P  7 11 10 16 11  9  9  9  9 10  9 10 10 10  9  8  8  7  9  5  5  4 10 10 10 10
+A  8 11 11 11 12 11 10 10 10 10 10 10  9  8  9  9  9  8 10  6  7  4 10 10 10 10
+G  7 11 10  9 11 15 10 11 10  9 10 10  8  7  8  7  7  6  9  5  5  3 10 10 10 10
+N  6 11 10  9 10 10 12 12 11 11 12 11 12 10 11  8  8  7  8  6  8  6 10 10 10 10
+D  5 10 10  9 10 11 12 14 13 12 13 12 11  9 10  7  8  6  8  4  6  3 10 10 10 10
+E  5 10 10  9 10 10 11 13 14 12 12 13 11  9 10  8  8  7  8  5  6  3 10 10 10 10
+Q  5  9  9 10 10  9 11 12 12 14 11 13 13 11 11  9  8  8  8  5  6  5 10 10 10 10
+B  5 10 10  9 10 10 12 13 12 11 13 11 11 10 10  8  8  6  8  5  7  4 10 10 10 10
+Z  5 10 10 10 10 10 11 12 13 13 11 14 12 10 10  8  8  8  8  5  6  4 10 10 10 10
+H  7  9  9 10  9  8 12 11 11 13 11 12 16 12 10  8  8  8  8  8 10  7 10 10 10 10
+R  6 10  9 10  8  7 10  9  9 11 10 10 12 16 13 10  8  7  8  6  6 12 10 10 10 10
+K  5 10 10  9  9  8 11 10 10 11 10 10 10 13 15 10  8  7  8  5  6  7 10 10 10 10
+M  5  8  9  8  9  7  8  7  8  9  8  8  8 10 10 16 12 14 12 10  8  6 10 10 10 10
+I  8  9 10  8  9  7  8  8  8  8  8  8  8  8  8 12 15 12 14 11  9  5 10 10 10 10
+L  4  7  8  7  8  6  7  6  7  8  6  8  8  7  7 14 12 16 12 12  9  8 10 10 10 10
+V  8  9 10  9 10  9  8  8  8  8  8  8  8  8  8 12 14 12 14  9  8  4 10 10 10 10
+F  6  7  7  5  6  5  6  4  5  5  5  5  8  6  5 10 11 12  9 19 17 10 10 10 10 10
+Y 10  7  7  5  7  5  8  6  6  6  7  6 10  6  6  8  9  9  8 17 20 10 10 10 10 10
+W  2  8  5  4  4  3  6  3  3  5  4  4  7 12  7  6  5  8  4 10 10 27 10 10 10 10
+- 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+X 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+? 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+  10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+.end lit
+One alternative for proteins is to use an identity matrix.
+For comparing nucleic acids we usually use the matrix shown below.
+.lit
+
+         DNA SCORE MATRIX
+
+             A C G T X 
+           A 1 0 0 0 0 
+           C 0 1 0 0 0 
+           G 0 0 1 0 0 
+           T 0 0 0 1 0 
+           X 0 0 0 0 0 
+.end lit
+See option 32 for how to change the score matrices.
+.para
+When a sequence is compared against itselt to look for repeats it is 
+possible to use the proportional algorithm in a mode such that the main 
+diagonal is not shown. See option 30.
+.para
+Typical dialogue follows.
+.lit
+
+? Menu or option number=d18
+? Odd span length (1-401) (11) =
+? Proportional score (1-297) (132) =
+Working
+
+ missing graphics
+
+.end lit
+.left margin1
+@19. TX 4 @List matching spans
+.LEFT MARGIN2
+This option applies the proportional algorithm using the current span and 
+cut-off score, but instead of drawing a dot matrix it lists all the 
+matching spans. When a sequence is compared against itselt to look for 
+repeats it is 
+possible to use this algorithm in a mode such that the main 
+diagonal is not listed. See option 30.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=d19
+? Odd span length (1-401) (11) =
+? Proportional score (1-297) (132) =148
+List matching spans
+Working
+     76
+IEVPVGKATLG
+LEVPVGRGLLG
+     95
+     77
+EVPVGKATLGR
+EVPVGRGLLGR
+     96
+     78
+VPVGKATLGRI
+VPVGRGLLGRV
+     97
+     79
+PVGKATLGRIM
+PVGRGLLGRVV
+     98
+     85
+LGRIMNVLGEP
+LGRVVNTLGAP
+    104
+     86
+GRIMNVLGEPV
+GRVVNTLGAPI
+    105
+     87
+RIMNVLGEPVD
+RVVNTLGAPID
+    106
+
+.end lit
+.left margin1
+@20. TX 3 @Set span length
+.para
+                        The proportional algorithm
+calculates a score for
+          each position in the matrix by summing  the 
+points  found  when  looking
+          forwards  and  backwards  along  a  diagonal line of a given length.
+          This length, called the span, should be an odd number  so  that  the
+          score  for  any  point  is correctly positioned at the centre of the
+          span.  This option allows the user to define the span length. It 
+should be noted that short spans can produce noisy diagrams, but are less 
+affected by insertions and deletions than are long spans. However long 
+spans can detect more  distant relationships. Long spans can suffer from 
+a 
+persistence problem by plotting dots when all the "signal" is to one side 
+of the spans central position. To help avoid this, the option that plots 
+the position of all matching residues within a matching span, can be 
+tried. 
+This is most useful if an identity matrix is being used.
+.left margin1
+@21. TX 3 @Set proportional score
+.LEFT MARGIN2
+.para
+                        The proportional algorithm
+calculates a score for
+          each position in the matrix by summing  the 
+scores for the individual amino acids found  when  looking
+          forwards  and  backwards  along  a  diagonal line of a given length.
+All points at which the proportional score is achieved will produce a dot 
+on the diagram. (The same score is used for the 'LIST MATCHING SPANS' 
+option.)
+.para
+Before chosing a score the user can apply the routine that will calculate 
+the expected score, or can calculate a histogram of observed scores. It is 
+best to start with a high score to avoid an overcrowded diagram.
+.left margin1
+@22. TX 3 @Set identities score
+.LEFT MARGIN2
+.para
+The identities algorithm is of limited value as it only finds runs of 
+matching characters, however it has the virtue of being very fast.
+ This option allows the user to set the minimum length 
+of run that will produce a dot on the screen.
+.left margin1
+@23. TX 3 @Calculate expected scores
+.left margin2
+.para
+This function calculates the "double matching probability" of McLachlan.
+The
+          "double   matching   probability"  is  the  probability  of  finding
+          particular  scores  given  two  infinitely  long  sequences  of  the
+          composition  of  those  being compared, with the current span 
+length
+          and score matrix.  By using this option the  user  can  choose  to
+          plot   all   the  matches  for  which  the  score  exceeds  a  given
+          significance  level  (such  as  1%). 
+Generally it is best to begin at a
+          low level to avoid an overcrowded diagram.
+.para   
+When the calculation of the expected scores
+is  finished  the program offers
+               the user 3 ways of examining the results:
+.LEFT MARGIN2   
+                "Show probability for a score" allows the user  to  type  in  a  
+score  and  the
+                 program responds with the probability of achieving that level
+                 of score.
+.LEFT MARGIN2   
+                "Show score for a probability" allows the user to type in a 
+probability value and
+                 the program types the score that corresponds to that level of
+                 probability.
+.LEFT MARGIN2   
+                "List scores and probabilities" is the command to list out the  
+scores  and  their
+                 corresponding  probabilities.   The user is asked to supply a
+                 further parameter, the "number of steps between scores", and 
+the program only lists
+                 every stepsize point.  e.g a stepsize of 5 will get every 5th
+                 score listed.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=d23
+? Odd span length (1-401) (11) =
+? Proportional score (1-297) (132) =
+ 
+Working
+Average score=   103.18557
+RMS deviation=     7.85276
+X 1 Show probability for a score
+  2 Show score for a probability
+  3 List scores and probabilities
+? 0,1,2,3 =
+ 
+? Show probability for score (1-165) (134) =160
+Probability of score    160 is 0.0000000008
+X 1 Show probability for a score
+  2 Show score for a probability
+  3 List scores and probabilities
+? 0,1,2,3 =2
+? Show score for probability (0.0000000001-1.) (0.00001) =0.0000001
+Score for probability 0.0000001000 is   153
+  1 Show probability for a score
+X 2 Show score for a probability
+  3 List scores and probabilities
+? 0,1,2,3 =3
+? Number of steps between scores (1-10) (5) =
+ 
+     0  0.10000E+01    100  0.67232E+00    200  0.18977E-20
+     5  0.10000E+01    105  0.42119E+00    205  0.42561E-22
+    10  0.10000E+01    110  0.20671E+00    210  0.87767E-24
+    15  0.10000E+01    115  0.78860E-01    215  0.16651E-25
+    20  0.10000E+01    120  0.23515E-01    220  0.27300E-27
+    25  0.10000E+01    125  0.55406E-02    225  0.00000E+00
+    30  0.10000E+01    130  0.10443E-02    230  0.00000E+00
+    35  0.10000E+01    135  0.15935E-03    235  0.00000E+00
+    40  0.10000E+01    140  0.19906E-04    240  0.00000E+00
+    45  0.10000E+01    145  0.20569E-05    245  0.00000E+00
+    50  0.10000E+01    150  0.17758E-06    250  0.00000E+00
+    55  0.10000E+01    155  0.12938E-07    255  0.00000E+00
+    60  0.10000E+01    160  0.80360E-09    260  0.00000E+00
+    65  0.10000E+01    165  0.43009E-10    265  0.00000E+00
+    70  0.10000E+01    170  0.20049E-11    270  0.00000E+00
+    75  0.99997E+00    175  0.82263E-13    275  0.00000E+00
+    80  0.99949E+00    180  0.29998E-14    280  0.00000E+00
+    85  0.99448E+00    185  0.98050E-16    285  0.00000E+00
+    90  0.96543E+00    190  0.28934E-17    290  0.00000E+00
+    95  0.86836E+00    195  0.77556E-19    295  0.00000E+00
+  1 Show probability for a score
+  2 Show score for a probability
+X 3 List scores and probabilities
+? 0,1,2,3 =!
+ 
+
+.end lit
+.left margin1
+@24. TX 3 @Calculate observed scores
+.left margin2
+.para
+This option applies the proportional algorithm to the currently active 
+sequence but instead of producing a 
+dot matrix it calculates a histogram of observed scores.
+             The speed of this calculation
+               of course depends on the size of the active 
+regions, but  when  it
+               is  completed  the  program offers the user 3 ways of examining
+               the results:
+.para   
+ "Show percentage for score" allows the user to type in a score and the 
+program
+                 responds  with  the  percentage  of  points that achieve this
+                 value.
+.para
+ "Show percentage for score" allows the user to type in a percentage and  
+the
+                 program  responds  with  the  corresponding score.  Values of
+                 this  score  and  above  are  only  achieved  by  the   given
+                 percentage of points.
+.para
+ "List scores and percentages" is the command to  list  out  the  scores  
+and  the
+                 percentage of points achieving them.
+ .para
+Typical dialogue follows.
+.lit
+? Menu or option number=24
+Working
+Maximum observed score is    152
+X 1 Show percentage reaching a score
+  2 Show score for a percentage
+  3 List scores and percentages
+? 0,1,2,3 =
+ 
+? Show percentage for score (1-152) (114) =144
+Percentage of points with score    144 is   0.005486297
+X 1 Show percentage reaching a score
+  2 Show score for a percentage
+  3 List scores and percentages
+? 0,1,2,3 =2
+ 
+? Show score for percentage (0.00001-1.) (0.001) =0.01
+Score for percentage   0.010000000 is   143
+  1 Show percentage reaching a score
+X 2 Show score for a percentage
+  3 List scores and percentages
+? 0,1,2,3 =
+ 
+? Show score for percentage (0.00001-1.) (0.001) =1.
+Score for percentage   1.000000000 is   124
+  1 Show percentage reaching a score
+X 2 Show score for a percentage
+  3 List scores and percentages
+? 0,1,2,3 =3
+? Number of steps between scores (1-10) (5) =1
+ 
+   73   236953  0.10000E+03
+   74   236951  0.99999E+02
+   75   236951  0.99999E+02
+   76   236950  0.99998E+02
+   77   236945  0.99996E+02
+   78   236942  0.99995E+02
+   79   236929  0.99989E+02
+   80   236900  0.99977E+02
+  
+  missing data here
+
+  130      384  0.16206E+00
+  131      307  0.12956E+00
+  132      239  0.10086E+00
+  133      180  0.75964E-01
+  134      134  0.56551E-01
+  135      103  0.43468E-01
+  136       78  0.32918E-01
+  137       67  0.28276E-01
+  138       46  0.19413E-01
+  139       40  0.16881E-01
+  140       33  0.13927E-01
+  141       29  0.12239E-01
+  142       24  0.10129E-01
+  143       19  0.80184E-02
+  144       13  0.54863E-02
+  145       10  0.42202E-02
+  146        8  0.33762E-02
+  147        7  0.29542E-02
+  148        7  0.29542E-02
+  149        6  0.25321E-02
+  150        5  0.21101E-02
+  151        3  0.12661E-02
+  152        3  0.12661E-02
+  1 Show percentage reaching a score
+  2 Show score for a percentage
+X 3 List scores and percentages
+? 0,1,2,3 =!
+ 
+.end lit
+.left margin1
+@25. TX 3 @Show current parameter settings
+.LEFT MARGIN2
+.para
+This function lists the names of the current sequences, their total
+lengths, the start 
+and end points of the active sequence and the current values of span and 
+cut-off scores. It also shows if the main diagonal will be shown, or if 
+the 
+proportional algorithm will mark all identities in matching spans.
+.para
+Typical dialogue follows.
+.lit
+? Menu or option number=25
+Horizontal sequence
+ALPHA.PRT
+Positions
+     1 TO    514
+Vertical sequence
+BETA.PRT
+Positions
+     1 TO    461
+Span length=    11
+Scores
+Proportional=   132
+Identities=     3
+Identites off
+Main diagonal shown
+
+
+.end lit
+.left margin1
+@27. TX 2 @Draw a /
+.left margin2
+.para
+This option simply draws a diagonal line from the bottom left of the 
+diagram to the top right. it can be an aid when trying to align the 
+sequences.
+.left margin1
+@26. TX 4 @Quick scan
+.left margin2
+.para
+The algorithm is as follows. The dot matrix positions are found for all 
+words of some minimum length (obviously length 1 is most sensitive) 
+that are common to both sequences. Imagine a diagonal line running from 
+corner to corner of the diagram, at right angles to the diagonals in the 
+dotmatrix,  The scores for the common words (according to the current 
+score matrix, e.g. MDM78) are accummulated at the appropriate positions
+on that imaginary line, hence  
+producing a histogram. The histogram is analysed to find its mean and 
+standard deviation. The diagonals that lie above some cutoff score 
+(defined in standard deviation units), are rescanned using the 
+proportional algorithm, and a diagram produced. The method is very fast, 
+and is also employed  by the library comparison program.
+.para
+Typical dialogue follows.
+.lit
+
+? Menu or option number=d26
+? Identity score (1-20) (3) =
+? Odd span length (1-401) (11) =
+? Proportional score (1-297) (132) =
+? Number of sd above mean (0.00-10.00) (5.00) =
+
+ missing graphics
+ 
+
+.end lit
+.left margin2
+.para
+SIPL the library searching version of SIP
+.para
+This program compares a probe sequence against a library of sequences using 
+the quick scan algorithm, sorts the matches into descending order of score, 
+and produces optimal alignments of the best scores using the Myers and 
+Miller method. It is very rapid.
+.para
+Use of lists of entry names 
+.para
+SIPL has the ability to 
+restrict searches to subsets of the libraries. This does not require 
+sublibraries to be created but instead is achieved by using files 
+containing a list of the entry names of sequences. The user may choose to 
+search only those entries on the list or, alternatively to search all but 
+those on the list (i.e. in the latter case
+the list contains the names of those to be excluded).
+ The programs can search libraries that have indexes and those that 
+do not.
+ If a list of names for inclusion is used,
+then the search will be faster if the index is present. In all other 
+circumstances the whole library will be read. 
+The list must be in library order except when it is used
+to include entries, and an index is available.
+The list must contain each entry name on a separate line, with the name 
+starting in column 1 of the line. ie there must be no spaces at the start 
+of the line.
+The list of entry names
+can be produced by the keyword searches of nip, pip, sip, etc as long 
+as the listings produced have a space character separating the entry name 
+from the entry description. This will depend on how well the library 
+reformatting programs work. For example swissprot entry names tend to run 
+into the beginning of the descriptions, but other libraries are generally 
+OK.
+
+.left margin1
+@28. TX 4 @Align sequences
+.left margin2
+.para
+This function will produce an optimal alignment of two segments of the 
+sequence. 
+The dynamic programming alignment algorithm is based on that of Miller 
+and Myers (). It guarantees to produce alignments with the optimum score 
+given a score matrix, a gap start penalty, and a gap extension penalty. 
+That is, starting a gap costs a fixed penalty (F) and each residue added 
+to the gap incurs a further penalty (E) so that for each gap of length K 
+residues the penalty is F + K*E. Gaps at the ends of sequences incur no 
+penalty.
+                                                                               
+.para
+The routine can only handle segments of sequence of maximum 
+length 5000 residues. When the sequences are read in the alignment 
+segment 
+will be set to the first 5000 residues. A different segment can be 
+selected by prefixing the option number by the letter D, in which case the 
+cross hair can be used to identify the two ends. The cross hair will 
+appear.
+First position the 
+crosshair at 
+the bottom left of the 
+segment and type a character other than s 
+or m or ",". When the crosshair reappears, position it a the top right 
+of the segment, and type a keyboard character.
+The aligned sequences will replace the active sequence if the user 
+confirms "keep alignment". By alternate use of the 
+plotting and alignment routines it is possible to rapidly produce an 
+alignment of quite long sequences.
+.para
+Typical dialogue follows.
+.lit
+
+28 = Align sequences
+? Menu or option number=d28
+Define the region to align using the cross-hair.
+First identify the bottom left position and exit
+the cross-hair routine. Then the top right.
+
+(Bell rings, type return, cross hair appears)
+
+? Penalty for starting a gap (1-100) (10) =
+? Penalty for each residue in gap (1-100) (10) =
+ 
+Aligning region           1 to         461
+    with region           1 to         514
+         1         11         21         31         41         51
+         MA--TGKIVQ VIGA------ VVDVEFPQDA VPRVYDALEV QNG------N ERLVL-----
+         *      *    *         **            * *       *        *   *
+         MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY
+         1         11         21         31         41         51
+        61         71         81         91        101        111
+         EVQQQLGGGI VRTIAMGSSD GLRRGLDVKD LEHPIEVPVG KATLGRIMNV LGEPVDMKGE
+              *     *    **     *  *  **       *****    ***  *  ** * * **
+         AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
+        61         71         81         91        101        111
+       121        131        141        151        161        171
+         IGEEERWAIH RAAPSYEELS NSQELLETGI KVIDLMCPFA KGGKVGLFGG AGVGKTVNMM
+                *     **   *          **  *  * * *    *      *     ***
+         LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
+       121        131        141        151        161        171
+       181        191        201        211        221        231
+         ELIRNIAIEH SGYS-VFAGV GERTREGNDF YHEMTDSNVI DKVSLVYGQM NEPPGNRLRV
+           *  *     **         *                          *      *
+         DAI--INQRD SGIKCIYVAI GQKASTISNV VRKLEEHGAL ANTIVVVATA SESAALQYLA
+       181        191        201        211        221        231
+       241        251        261        271        281        291
+         ALTGLTMAEK FRDEGRDVLL FVDNIYRYTL AGTEVSALLG RMPSAVGYQP TLAEEMGVLQ
+               * *  *** * * *    *        *    * **  * *                *
+         RMPVALMGEY FRDRGEDALI IYDDLSKQAV AYRQISLLLR RPPGREAFPG DVFYLHSRLL
+       241        251        261        271        281        291
+       301        311        321        331        341        351
+         ERITST---- ---------- -KTGSITSVQ AVYVPADDLT DPSPATTFAH LDATVVLSRQ
+         **                     **** *         * *      *        *    *
+         ERAARVNAEY VEAFTKGEVK GKTGSLTALP IIETQAGDVS AFVPTNVISI TDGQIFLETN
+       301        311        321        331        341        351
+       361        371        381        391        401        411
+         IASLGIYPAV DPLDSTSRQL DPLVVGQEHY DTAR----GV QSILQRYQEL KDIIAILGMD
+             ** ***  *  * **      * *             *     *  * **
+         LFNAGIRPAV NPGISVSR-- ---VGGAAQT KIMKKLSGGI RTALAQYREL AAFSQFAS--
+       361        371        381        391        401        411
+       421        431        441        451        461        471
+         ELSEEDKLVV ARARKIQRFL SQ----PFFV AE----VFTG SPGKYVSLKD --TIRGFKGI
+          *             *    *  *    *  * *      *     * *         *  *
+         DLDDATRKQL DHGQKVTELL KQKQYAPMSV AQQSLVLFAA ERG-YLADVE LSKIGSFEAA
+       421        431        441        451        461        471
+       481        491        501        511        521
+         MEG--EYDHL P-EQAFYMVG SIEEAVE--- --------KA KKL*
+                **  *  *     *       *                  *
+         LLAYVDRDHA PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW*
+       481        491        501        511        521
+Conservation  22.5%
+Number of padding characters inserted    63 and    10
+? (y/n) (y) Keep alignment n
+ 
+
+.end lit
+.left margin1
+@29. TX 1 @Complement the sequences
+.left margin2
+.para
+This function allows users to reverse and complement nucleic acid 
+sequences.
+.left margin1
+@30. TX 3 @Switch main diagonal
+.left margin2
+.para
+If a sequence is being compared against itself to look for repeats it is
+sometimes convenient if the main diagonal is not included in the 
+comparison. This function allows users to set a switch that determines 
+whether or not to include the main 
+diagonal for all the comparison methods.
+If the switch is set, and the active regions for both sequences have 
+the same start position, then the main diagonal will not be compared. 
+.left margin1
+@31. TX 3 @Switch identities
+.left margin2
+.para
+This function allows a switch to be set or unset. The switch determines 
+which of two forms of plot will be produced by the proportional 
+algorithm. 
+One form of output (the original method) plots a dot at the centre of each 
+span that reaches the threshold score; whereas the other form will plot
+dots for all matching residues that lie within spans that reach the 
+threshold.
+.left margin1
+@32. TX 3 @change score matrix
+.left margin2
+.para
+This option allows users to select their 
+own score matrix for use with the proportional algorithm. The choices 
+are:
+.lit
+
+ 1 = MDM78
+ 2 = identity
+ 3 = your own matrix
+
+.end lit
+.para
+MDM78 is the standard matrix that is used for proteins and an 
+identity matrix is the default matrix for nucleic acids. However an 
+identity 
+matrix is also useful for protein comparisons. "Your own matrix" allows 
+users to apply any other matrix, as long as the matrix file is in the same
+format as MDM78.
+For comparisons of DNA it might be useful to try one that gave say 3 for 
+exact matches and 1 for R-R or Y-Y, else=0.
+.left margin1
+@33. TX 3 @Set number of sd's for Quickscan
+.left margin2
+.para
+The quickscan 
+algorithm is as follows. The dot matrix positions are found for all 
+words of some minimum length (obviously length 1 is most sensitive) 
+that are common to both sequences. Imagine a diagonal line running from 
+corner to corner of the diagram, at right angles to the diagonals in the 
+dotmatrix,  The scores for the common words (according to the current 
+score matrix, e.g. MDM78) are accummulated at the appropriate positions
+on that imaginary line, hence  
+producing a histogram. The histogram is analysed to find its mean and 
+standard deviation. The diagonals that lie above some cutoff score 
+(defined in standard deviation units), are rescanned using the 
+proportional algorithm, and a diagram produced.
+.para
+This option allows the number of sd's to be set.
+.left margin1
+@34. TX 3 @Set gap penalities
+.left margin2
+.para
+The alignment 
+function will produce an optimal alignment of two segments of the 
+sequence. 
+The dynamic programming alignment algorithm is based on that of Miller 
+and Myers (). It guarantees to produce alignments with the optimum score 
+given a score matrix, a gap start penalty, and a gap extension penalty. 
+That is, starting a gap costs a fixed penalty (F) and each residue added 
+to the gap incurs a further penalty (E) so that for each gap of length K 
+residues the penalty is F + K*E. Gaps at the ends of sequences incur no 
+penalty.
+.para
+This option allows the gap penalties to be set.
+.left margin1
+@ end of help
\ No newline at end of file
diff --git a/help/SPLITP.RNO b/help/SPLITP.RNO
new file mode 100644
index 0000000..a0ff5ae
--- /dev/null
+++ b/help/SPLITP.RNO
@@ -0,0 +1,125 @@
+.para
+Preparing the PROSITE protein motif library for use by the Staden programs
+.para
+Introduction
+.para
+A library of protein motifs (in our terminology, because they include 
+variable gaps, some would be called patterns) has recently become available 
+from Amos Bairoch,Departement de Biochimie Medicale,University of Geneva
+Currently it contains 317 patterns/motifs and arrives on tape or cdrom
+in two files: 
+a .dat file and a .doc file. There is also a user documentation file 
+prosite.usr. Here I outline what is required to prepare the PROSITE library for
+use by our programs.
+.para
+Three programs need to be run SPLITP1, SPLITP2, and SPLITP3.
+.PARA
+Outline of the PROSITE files
+.para
+ A typical entry in the .dat file is shown below.
+.lit
+
+ID   2FE2S_FERREDOXIN; PATTERN.
+AC   PS00197;
+DT   APR-1990 (CREATED); APR-1990 (DATA UPDATE); APR-1990 (INFO UPDATE).
+DE   2Fe-2S ferredoxins, iron-sulfur binding region signature.
+PA   C-x(1,2)-[STA]-x(2)-C-[STA]-{P}-C.
+NR   /RELEASE=14,15409;
+NR   /TOTAL=69(69); /POSITIVE=63(63); /UNKNOWN=0(0); /FALSE_POS=6(6);
+NR   /FALSE_NEG=5(5);
+CC   /TAXO-RANGE=A?EP?; /MAX-REPEAT=1;
+CC   /SITE=1,iron_sulfur; /SITE=5,iron_sulfur; /SITE=8,iron_sulfur;
+DR   P15788, FER$APHHA , T; P00250, FER$APHSA , T; P00223, FER$ARCLA , T;
+DR   P00227, FER$BRANA , T; P07838, FER$BRYMA , T; P13106, FER$BUMFI , T;
+DR   P00247, FER$CHLFR , T; P07839, FER$CHLRE , T; P00222, FER$COLES , T;
+DO   PDOC00175;
+//
+.end lit
+.para
+Each entry has an accession number (here PS00197), a pattern definition 
+(here C-x(1,2)-[STA]-x(2)-C-[STA]-{P}-C) and a documentation file 
+cross reference (here PDOC00175).
+This pattern means: C, gap of 1 or 2, any of STA, gap of 2, C, any of STA, 
+not P, C.
+.para
+  We need to convert all of these patterns into our pattern definitions 
+(as membership of a set, with the appopriate gap ranges) and write each 
+into a separate pattern file with corresponding "membership of a set" 
+weight matrices. Each 
+pattern file is named accession_number.pat (here PS00197.PAT). The 
+corresponding matrix files are accession_number.wtsa, 
+accession_number.wtsb, etc for however many are needed (here PS00197.WTSA 
+and PS00197.WTSB): two are needed because of the variable gap. 
+.para
+In addition we can optionally
+split the .dat and .doc files into separate files, one for each 
+entry, with names accession_number.dat and accession_number.doc. Also we
+create an index for the library prosite.lis, which 
+gives a one line description of each pattern, and ends with the pattern 
+file and documentation file numbers. The start of the file is shown below.
+.lit
+
+N-glycosylation site.                                                00001,00001
+Glycosaminoglycan attachment site.                                   00002,00002
+Tyrosine sulfatation site.                                           00003,00003
+cAMP- and cGMP-dependent protein kinase phosphorylation site.        00004,00004
+
+.end lit
+So the name of the pattern file for Glycosaminoglycan attachment site is 
+PS00002.PAT, and for the documentation file PDOC00002.DOC
+.para
+Finally we
+create a file of file names for all the patterns in the library.
+.para
+To use the complete PROSITE library from program pip, select "pattern searcher"
+and choose the 
+option "use file of pattern file names", and give the file name 
+prosite.nam). For any matches found, the accession number and pattern title
+will be 
+displayed.
+
+.para
+Running the conversion programs
+.para
+
+Only SPLITP3 is necessary for using the library. The others programs
+ only make the
+original files marginally easier to browse through and produce an index.
+.para
+SPLITP1 splits the prosite.dat file to create a separate file for each 
+entry. Each file is automatically named PSentry_number.dat. In addition it 
+creates an index for the library (see above).
+.para
+SPLITP2 performs the same operation for the Prosite.doc file, except that 
+no index is created. Files are named PSentry_number.doc.
+.para
+SPLITP3 creates a separate pattern file and weight matrix files for each 
+prosite entry from the file prosite.dat. Pattern files are named 
+PSentry_number.pat, weight matrix files PSentry_number.wtsa, 
+Psentry_number.wtsb, etc. The pattern title is the one line description
+of the motif. SPLITP3 also creates a file of file names. Notice that it
+will ask for a path name so that the path can be included in the file of
+file names. This is the path to the directory in which the pattern files
+are stored.
+.para
+Notes
+.para
+Obviously the use of files of file names is a general solution, and anybody 
+could now create their own set of interesting patterns for screening, or a 
+subset of prosite.nam, etc.
+.para
+   Note that 5 of the bairoch motifs contained the symbols > or < which 
+means that the motifs must appear exactly at the N or C termini of the 
+sequences. Currently our methods have no mechanism for such definitions and, 
+for example KDEL motifs, will be permitted to occur anywhere throughout 
+a sequence.
+
+.para
+Also, of course, the library does not have to be used solely for performing 
+mass screenings: each individual entry can be used as a single pattern by 
+giving the name of its .pat file - eg pathname/ps00002.pat
+In addition more sophisticated users will wish to copy pattern files and 
+weight matrices into their own directories and modify them. For example the 
+cutoff scores are probably chosen to be quite high in order to reduce the 
+number of false positives, and some users might wish to lower them.
+
diff --git a/help/STADEN.RNO b/help/STADEN.RNO
new file mode 100644
index 0000000..5d1db9d
--- /dev/null
+++ b/help/STADEN.RNO
@@ -0,0 +1,354 @@
+.npa
+.left margin2
+.para
+Introduction to the Staden sequence analysis package and its user interface
+.PARA
+The package contains the following programs:
+.lit
+
+  GIP     Gel input program
+  SAP     Sequence assemble program
+  NIP     Nucleotide interpretation program
+  PIP     Protein interpretation program
+  SIP     Similarity investigation program
+  MEP     Motif exploration program
+  NIPL    Nucleotide interpretation program (library)
+  PIPL    Protein interpretation program (library)
+  SIPL    Similarity investigation program (library)
+
+.end lit
+.left margin2
+GIP uses a digitiser for entry of DNA sequences from 
+autoradiographs.
+.left margin2
+SAP handles everything relating to assembling gel 
+readings in order to produce a consensus sequence. It can also deal with 
+families of protein sequences.
+.left margin2
+NIP provides functions for analysing and interpretting
+individual nucleotide sequences.
+.left margin2
+PIP provides functions for analysing and interpretting
+individual protein sequences.
+.left margin2
+MEP analyses families of nucleotide sequences to help discover new motifs.
+.left margin2
+NIPL performs pattern searches on nucleotide sequence libraries.
+.left margin2
+PIPL performs pattern searches on protein sequence libraries.
+.left margin2
+SIP provides functions for comparing and aligning 
+pairs of protein or nucleotide sequences.
+.left margin2
+SIPL searches nucleotide and protein sequence
+libraries for entries similar to probe sequences.
+.left margin2
+.sk1
+.para
+Documentation
+.para
+As is explained below, the 
+programs SAP, NIP, PIP, SIP and MEP have online help,
+and the help files have the names: HELPSAP, HELPNIP, HELPPIP, HELPSIP, 
+HELPMEP. These 
+files can be displayed on the screen or printed using the appropriate 
+commands. Currently the help for the other programs is also contained in 
+these files. For example help for NIPL is in HELPNIP. This file is called 
+HELPSTADEN.
+.para
+Sequence formats
+.para
+ The shotgun sequencing program SAP deals only with simple 
+text files for gel readings, and is a self-contained system.
+However as there is still no single agreed format
+ for finished sequences or for libraries of sequences,
+the other programs in the package can read data that is stored in several ways.
+.para
+The analytical programs can read individual sequences stored in the following
+formats: 
+Staden, EMBL, Genbank, PIR (also known as NBRF), and GCG, but for storing whole 
+libraries we use only PIR format. In addition 
+these programs can perform a number of 
+simple operations using libraries stored in this format. They can extract 
+entries by entry name, can search titles for keywords, can search the whole 
+of the annotation files for keywords, and can extract annotations for any 
+named entry.
+We reformat all sequence libraries into PIR format. Currently we 
+have NBRF, EMBL, SWISSPROT and VECBASE libraries in PIR format.
+.para
+The library searching programs operate only
+on sequences stored in PIR format.
+.para
+The analytical programs
+will operate with uppercase or lowercase sequence
+characters. In addition T and U are equivalent. SAP uses uppercase letters 
+for original gel readings and lowercase letters for characters that are 
+corrected by the automatic editor.
+Programs NIP  and PIP use IUB symbols for redundancy in back translations 
+and for sequence searches.
+The symbols are shown below.
+.LIT
+
+
+            NC-IUB SYMBOLS
+
+      A,C,G,T
+      R        (A,G)        'puRine'
+      Y        (T,C)        'pYrimidine'
+      W        (A,T)        'Weak'
+      S        (C,G)        'Strong'
+      M        (A,C)        'aMino'
+      K        (G,T)        'Keto'
+      H        (A,T,C)      'not G'
+      B        (G,C,T)      'not A'
+      V        (G,A,C)      'not T'
+      D        (G,A,T)      'not C'
+      N        (G,A,C,T)    'aNy'
+
+.end lit
+.PARA
+The user interface
+.PARA
+The user interface is common to all programs. 
+It consists of a set of menus and a uniform way
+ of presenting choices and obtaining input 
+from the user. This section describes: the 
+menu system; how options are selected and  other choices made; how values
+ are supplied to the program;  how help is obtained, and 
+how to escape from any part of a program. In addition it gives information 
+about saving results in files and the use of graphics for presenting 
+results.
+.para
+Menus
+.para
+Each program has several menus and numerous options. 
+Each menu or option has a unique number that is used to 
+identify it. Menu numbers are distinguished from 
+option numbers by being preceded by the letter 
+m (or M, all programs make no distinction between 
+upper and lower case letters). With the exception of 
+some parts of program SAP, the menus are not hierachical, 
+rather the options they each contain are simply lists of 
+related functions and their identifying numbers. 
+Therefore options can be selected independently
+ of the menu that is currently being shown on the 
+screen,  and the menus are simply memory aides. 
+All options and menus are selected by typing their 
+option number when the programs present the prompt 
+.para
+  "? Menu or option number =". 
+.para
+ To select a menu type its number preceded by 
+the letter M. To select an option type its number.
+If you type only "return" you will get menu m0 
+which is simply a list of menus. If you select an 
+option you will return to the current menu after the function is completed.
+.para
+ When you select an option, in many cases the 
+program will immediately perform the operation 
+selected without further dialogue.  If you precede an option 
+number by the letter d (e.g. D17), you 
+will force the program to offer dialogue about the selected option 
+before the function operates, 
+hence allowing you to change the value of any of its parameters.  If 
+you precede an option number by the symbol ? (e.g. ?17), 
+you will be given help on the option (here 17). 
+.para
+Where possible, equivalent or identical options have been given the same
+numbers in all programs, and so users quickly learn the numbers for 
+the functions they employ most often.
+.para
+Help
+.para
+As mentioned above, help about each option can be obtained by 
+preceding the option number  by the symbol ? when you are presented 
+with the prompt "? Menu or option number", but there are two further 
+ways of obtaining help. Whenever the program asks a question 
+you can respond by typing the symbol ? and you will receive information 
+about the current option. In addition, option number 1 
+in all the programs will give help on all of a programs functions.
+.para
+Quitting
+.para
+To exit from any point in a program you type ! for quit. 
+If a menu is on the screen this will stop the program, otherwise 
+you will be returned to the last menu.
+.Para
+Other interactions
+.para
+Questions are  presented in a few restricted ways. 
+In all cases typing only "return" in response to a question means 
+yes, and typing N or n means no.
+.para
+Obvious opposites such as "clear screen" and "keep picture" 
+are presented with only the default shown. For example 
+in this case the default is generally "keep picture" so the 
+program will display: 
+.para
+"(y/n) (y) Keep picture"
+.para
+and the picture will be retained if the user types anything other than N or 
+n, (in which case the screen will be cleared).
+.para
+Where there are choices that are not obvious opposites, or 
+there are more than two choices, two further conventions are used: 
+"radio buttons" and "check boxes".
+.para
+
+Radio buttons are used when only one of a number of choices can be 
+made at any one time. The choices are presented arranged one above the 
+other, each choice with a number for its selection, and the default 
+choice marked with an X. For example in the restriction 
+enzyme search routine the following choices are offered:
+.para
+.lit
+
+   Select output mode
+   1 order results enzyme by enzyme
+   2 order results by positon
+ X 3 show only infrequent cutters
+   4 show names above the sequence
+ ? Selection (1-4) (3) =
+
+.end lit
+Any single option can be selected by typing the option number, 
+and the default option, (here shown as 3), is also obtained by 
+typing only "return". Again help can be obtained by typing ? and 
+you can quit by typing !.
+.para
+Check boxes are used when any number of a set of choices can be 
+made (i.e. the choices are not exclusive). Choices are 
+made by typing choice numbers.  Each choice can be considered 
+as a switch whose setting is reversed when it is selected. Choices that are 
+currently switched on are marked with an X. 
+The user quits from making selections by typing only 
+"return". For example in the routine that plots base composition 
+you can plot the frequencies of any combination of bases, e.g. only 
+A, or A+T, or A+T+G  etc. 
+The following check box is offered to the user:
+.lit
+
+  X 1 T
+    2 C
+  X 3 A
+    4 G
+  ? Selection (1-4) () =
+
+.END LIT
+As shown this will plot the A+T composition. To switch off  T 
+you select 1, to switch on  C you select 2, etc, to quit, 
+having set the bases required you type only "return".
+.para
+Input of numerical values
+.para
+All input of integer or decimal numbers is presented in a 
+standard way with the allowed range shown in brackets and the default 
+value also in brackets. For example:
+.para
+ ? span (5-31) (11) =
+.para
+In this example you could type any number between 5 and 31, 
+or "return" only, or ! or ? (see above). Any other input will cause the 
+program to ask the question again. Typing only "return" gives the default 
+value (here 11).
+.para
+Use of the bell
+.para
+The programs use the bell to indicate that a task is completed. 
+This allows users to read textual results before they are scrolled up off 
+the screen, or to look at a plot before it is scrolled over by the menus. 
+When the bell sounds, the programs will wait
+ until return is typed. You can quit from these points by typing ! but 
+no help is available.
+.para
+Printing and saving results in files
+.para
+A few of the functions in the programs automatically  write their textual
+results 
+to disk files,  but for most functions you can choose whether results
+appear on the terminal screen or go to a file. This applies to both text 
+and graphical results.
+For these functions 
+the normal, or default, place for results to 
+appear is on the screen, and users need to decide before the 
+function is selected if they want to redirect the results to a file. 
+In all programs, option number 7, "Direct output to disk" gives control 
+over whether results appear on the screen or go to a file. When a program 
+is started results will be sent to the screen. If option 7 is selected 
+users will be given the choice of redirecting either text or graphics to a 
+file. The program will then ask users to supply a file name. From that 
+point on all results will be sent to the file until option 7 is selected again, 
+in which case the "redirection file" will be closed, and results will start 
+to appear on the screen.
+.para
+If these files contain textual results they can be looked at 
+from within the programs 
+by using option 6, "List a text file". Once you leave the program 
+you can use an appropriate system command to print the files. 
+There is no function within the programs to direct files to a printer.
+.para
+The converse of the above is also possible. That 
+is, it is possible to redirect results that would normally go to file, 
+so that they appear instead on the screen. This is often useful as a way 
+of checking results before saving them in a file. On a VAX using 
+VMS you do this by typing TT: for the name of the file that the 
+program would create. TT: is what VMS calls the screen.
+.para
+Use of graphics
+.para
+The analytical programs including NIP, PIP and SIP present the results of 
+many of their analyses graphically. The position at which the results for 
+any function appear on the screen is defined relative to a notional users 
+"drawing board" of dimension 10,000 by 10,000. This drawing board fills the 
+screen and results are drawn in windows defined using symbols x0,yo and 
+xlength,ylength, 
+where x0,y0 is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the 
+height of the window.
+.lit
+
+   --------------------------------------------------------- 10,000
+   1                                                       1
+   1       --------------------------------------   ^      1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       1                                    1 ylength  1
+   1       1                                    1   1      1
+   1       1                                    1   1      1
+   1       --------------------------------------   v      1
+   1  x0,y0^                                               1
+   1       <---------------xlength-------------->          1
+   ---------------------------------------------------------      1
+   1                                                   10,000
+
+.end lit
+.para
+ The window positions for each option are read from a file 
+when a program is started. If required individual users could have their
+own set of plot positions, and also the positions
+ can be redefined from within the 
+programs using option number 14.
+.para
+For those analyses that draw continuous lines to represent results 
+(for example a plot of base composition) the user is asked to supply the 
+"Plot interval". All the analyses produce a value for every point along the 
+sequence but often it is unnecessary to actually plot the 
+values for all the points.
+The plot interval is simply the distance between the points 
+shown on the screen. If the user selects a plot interval of 1, every point 
+will be plotted; a plot interval of 3 will show every third point. It is a 
+way of speeding up the analyses.
+.para
+Saving graphics
+.para
+Many terminals are not capable of dumping their screen contents to a 
+file for subsequent printing. One convenient way of obtaining hard copy 
+of graphical results is to use a micro computer as a terminal. On 
+the Macintosh we use the terminal emulator versa
+termPro. This allows graphics to be saved as 
+Macintosh files that can be annotated and printed using 
+Macdraw and other painting programs. 
+.para
+Alternatively graphics can be redirected to a file and printed using a 
+laser printer with tektronix capability (see 
+"Printing and saving results in files").
diff --git a/help/bap_help b/help/bap_help
new file mode 100644
index 0000000..6d1ae3c
--- /dev/null
+++ b/help/bap_help
@@ -0,0 +1,2112 @@
+ @-1. TX  0 @General
+
+ @-2. T   0 @Screen control
+
+ @-2. X   0 @Screen
+
+ @-3. TX  0 @Modification
+
+ @0.  TX -1 @BAP
+
+        This is an  interactive  program  whose  primary  use  is  for
+  managing  shotgun  sequencing  projects, but it can also be used for
+  handling alignments of other sequences, including those of proteins.
+  Currently   the   maximum  'gel  reading'  length  is  set  to  4096
+  characters. Almost all of the information below describes the use of
+  the  program  for shotgun projects, but those using the programs for
+  handling other sequence alignments should interpret it  accordingly.
+  The data for such a project is stored in a special type of database.
+  The program contains the tools  that  are  required  to  screen  gel
+  readings  against  vector  sequences  and  restriction sites, and to
+  assemble new gel readings into the database (automatically comparing
+  and aligning them). In addition it contains editors and functions to
+  examine the quality of the aligned sequences.
+
+        There  are  three  main   menus:   "general",   "screen"   and
+  "modification", and some functions have submenus.
+    The general menu contains the following options:
+
+         Open a database
+         Display a contig
+         List a text file
+         Direct output
+         Calculate a consensus
+         Screen against restriction enzymes
+         Screen against vector
+         Check logical consistency
+         Copy database
+         Show relationships
+         set parameters
+         Highlight disagreements
+         Examine quality
+         Check Assembly
+         Find read pairs
+
+  The graphics menu contains:
+
+         Clear graphics
+         Clear text
+         Draw ruler
+         Use cross hair
+         Change margins
+         Label diagram
+         Plot map
+         Plot single contig
+         Plot all contigs
+
+
+  The modification menu contains:
+
+         Edit contig
+         Auto assemble
+         Join contigs
+         Complement a contig
+         Alter relationships
+         Extract gel readings
+         Find internal joins
+         Disassemble readings
+         Shuffle pads
+         Auto-select oligos
+         Double strand
+
+  The alter relationships menu contains:
+
+         Cancel
+         Line change
+         Check logical consistency
+         Remove contig
+         Shift
+         Move gel reading
+         Rename gel reading
+         Break a contig
+         Remove a gel reading
+         Alter raw data parameters
+
+
+
+        Overview of the methodology
+
+        The shotgun sequencing strategy
+
+        In  the  shotgun  sequencing  procedure  the  sequence  to  be
+  determined   is   randomly  broken  into  fragments  of  about  1000
+  nucleotides in length. These fragments are cloned and then  selected
+  randomly  and  their  sequences    determined.     The  relationship
+  between  any  pair  of fragments is  not  known  beforehand  but  is
+  found  by  comparing  their   sequences.  If  the  sequence  of  one
+  found to be wholly or partially contained  within  that  of  another
+  for  sufficient  length  to  distinguish  an overlap  from  a repeat
+  then those two fragments can  be  joined.  The  process  of  select,
+  sequence  and  compare is continued until the whole of  the  DNA  to
+  be  sequenced is in one continuous well determined piece.
+
+        Definition of a contig
+
+        A CONTIG is a set of gel  readings   that   are   related   to
+  one  another   by   overlap  of  their  sequences.  All gel readings
+  belong to a contig and each contig  contains  at   least   one   gel
+  reading.   The  gel  readings in a contig can be summed to produce a
+  continuous consensus sequence and the length of this sequence is the
+  length  of the contig.  The rules used to perform this summation are
+  given  under  "the  consensus  algorithm".   At  any  stage  of    a
+  sequencing  project the data will comprise a number of contigs; when
+  a  project  is complete  there  should be only one  contig  and  its
+  consensus  will  be  the  finished  sequence.  Note that since being
+  introduced and defined as above the word "contig" has been taken  up
+  by  those involved in genomic mapping. In that context the consensus
+  with a  precise length is, of course, not defined.
+
+  Introduction to the computer method
+
+        It is useful  to  consider  the  objectives  of  a  sequencing
+  project  before  outlining  how  we use the computer to help achieve
+  them. The aim of a shotgun  sequencing  project  is  to  produce  an
+  accurate  consensus sequence from many overlapping gel readings.  It
+  is necessary to know, particularly  at  the  latter  stages  of  the
+  project,  how accurate the consensus sequence is. This enables us to
+  know which regions of the sequence require further work and also  to
+  know  when  the  project  is  finished.   To show the quality of the
+  consensus, the programs described here produce  displays  like  that
+  shown below.
+
+
+                             10        20        30        40        50
+     -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+         CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                            110       120       130       140       150
+     -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+     -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+     -5  HINW.009                                        GGCAGCGACCGTCG
+     17  HINW.999                                           AGCGACCGTCG
+         CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                            160       170       180       190       200
+     -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+     -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+     12  HINW.017                                              GTAGAGC*
+         CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+
+        This is an example showing the left  end  of  a  contig   from
+  position   1  to  200.   Overlapping  this  region  are gel readings
+  numbered 6, 3, 5, 17 and 12; 6, 3 and 5 are in  reverse  orientation
+  to  their  original  reading  (denoted  by  a  minus sign). Each gel
+  reading also has a name (eg HINW.010). It can  be  seen  that  in  a
+  number  of  places the sequences contain characters other than A,C,G
+  and T. Some  of  these  extra  characters  have  been  used  by  the
+  sequencer   to  indicate  regions  of  uncertainty  in  the  initial
+  interpretation of the gel reading, but the asterisks (*)  have  been
+  inserted  by  the  automatic assembly function in order to align the
+  sequences.  Underneath  each  50  character  block  of  gel  reading
+  sequences  is the consensus derived from the sequences aligned above
+  (the line labelled CONSENSUS). For most of its length the  consensus
+  has a definite nucleotide assignment but in a few positions there is
+  insufficient agreement between the gel readings and so  a  dash  (-)
+  appears  in  the  sequence.  This  display contains all the evidence
+  needed to assess the quality of the consensus: the number  of  times
+  the  sequence has been determined on each strand of the DNA, and the
+  individual nucleotide assignments given for each gel reading.
+
+        So the aim is to produce the consensus sequence  and,  equally
+  important,  a  display of the experimental results from which it was
+  derived.
+
+        In order to achieve this the following operations need  to  be
+  performed:
+  1) Put individual  gel  readings  into  the  computer.   This  might
+  involved   the  manual  interpretation  of  autoradiographs  or  the
+  transfer and process  of  machine-readable  files  from  fluorescent
+  sequencing machines.
+  2) Check each gel reading to make sure it is not simply part of  one
+  of the vectors used to clone the sequence.
+  3) Check each gel reading to make sure  that  those  fragments  that
+  span  the  ligation point used prior to sonication are not assembled
+  as single sequences.
+  4) Compare all the  remaining  gel  readings  with  one  another  to
+  assemble them to produce the consensus sequence.
+  5) Check the quality of the consensus and edit the sequences.
+  6) When all the consensus is sufficiently well determined, produce a
+  copy of it for processing by other analysis programs.
+
+        It is very unlikely that this procedure will  only  be  passed
+  through  once.   Usually steps 1 to 5 are cycled through repeatedly,
+  with step 4 just adding new sequences to  those  already  assembled.
+  Generally step 6 is also used in order to analyse imperfect sequence
+  to check if it is the one the project intended to  sequence,  or  to
+  look  for  interesting  features. Analysis of the consensus, such as
+  searches for protein coding regions, can also help to find errors in
+  the  sequence.  The  display  of  the overlapping gel readings shown
+  above can be used  to  indicate,  not  only  the  poorly  determined
+  regions,  but  also  which  clones  should be resequenced to resolve
+  ambiguities, or those which can usefully be extended or sequenced in
+  the reverse direction, to cover difficult regions.
+
+        The original individual gel readings for a sequencing  project
+  are  each  stored in separate files. As the gel readings are entered
+  into the computer (usually in batches, say 10 from a film), the file
+  names  they are given are stored in a further file, called a file of
+  file names. Files of file names enable gel readings to be  processed
+  in batches.
+
+        For each sequencing project we start a project database.  This
+  database  has  a  structure  specifically  designed for dealing with
+  shotgun sequence data. In order to arrive  at  the  final  consensus
+  sequence  many  operations  will  be performed on the sequence data.
+  Individual fragments must be sequenced and compared in  both  senses
+  (i.e.  both  orientations)  with  all  the  other sequences. When an
+  overlap between a new gel reading and a contig are found  they  must
+  be aligned and the new gel reading added to the contig. If a new gel
+  reading overlaps two contigs they must be aligned and joined. Before
+  the  two contigs are joined one of them may need to be turned around
+  (reversed and  complemented)  so  they  are  both  in  in  the  same
+  orientation.
+
+        Clearly, keeping track of all  these  manipulations  is  quite
+  complicated,  and  to  be  able  to  perform  the operations quickly
+  requires careful choice of data structure and algorithms. For  these
+  reasons  it  is not practicable to store the gel readings aligned as
+  shown in the display above. Rather, it is more convenient  to  store
+  the  sequences unassembled, and to record sufficient information for
+  programs to assemble  them  during  processing.  The  data  used  to
+  assemble the sequences is called relational information.
+
+        The database comprises five files and they are described under
+  the section entitled "open database".
+
+        Before entry into the project database each  new  gel  reading
+  must  be  compared  to  look  for overlaps with all the data already
+  contained within the database. This last  point  is  important:  all
+  searching  for  overlaps  is between individual new gel readings and
+  the data already in the database. There is no searching for overlaps
+  between sequences within the database; overlaps must be found before
+  new gel readings are entered into the database.
+
+        Below  I  give  an  introduction  to  how  the  sequences  are
+  processed by being passed from one function to the next.
+
+        This program is used to start a database for the  project  and
+  then the following procedure is used.
+
+        Data in the form of individual gel readings are  entered  into
+  the computer and stored in separate files (possibly using either the
+  digitizer program GIP). Batches of these gel readings are passed  to
+  the  screening functions in this program to search for overlaps with
+  vector sequences (see  VEP  and  "screen  against  vector")  or  for
+  matches  to  restriction  enzyme  sites   that should not be present
+  ("screen against enzymes"). Each run of  these  screening  functions
+  passes  on  only  those  gel  readings  that do not contain unwanted
+  sequences.  Sequences  are  passed  via  files  of  file  names  and
+  eventually  are  processed by the automatic assembly function ("auto
+  assemble"). This function compares each gel reading with a consensus
+  of  all  the  previous  gel  readings stored in the database.  If it
+  finds any overlaps it aligns the overlapping sequences by  inserting
+  padding  characters,  and  then  adds  the  new  gel  reading to the
+  database. Gels that overlap are added to existing contigs  and  gels
+  that do not overlap any data in the database start new contigs. If a
+  new gel overlaps two contigs they are joined. Any gel readings  that
+  appear  to overlap but which cannot be aligned sufficiently well are
+  not entered and have their names written to a  file  of  failed  gel
+  reading names.
+
+        Generally data is entered into the database in batches as just
+  described.  The  program  is  also  used  to examine the data in the
+  database, to enter gel readings that the automatic assembly function
+  cannot  align  ("auto  assemble"), and to make final edits. Edits to
+  whole contigs can   be  made  using  a  mouse-driven  editor  ("edit
+  contig").
+
+        Editing the  sequences  is  obviously  an  essential  part  of
+  managing   a  sequencing  project.  Editing  is  required  when  new
+  sequences are added, when contigs are joined, and when sequences are
+  corrected.   A  basic part of the strategy used here is that new gel
+  readings should be correctly aligned throughout their  whole  length
+  when  they  are entered into the database, and that when contigs are
+  joined they are edited so that they are well aligned in  the  region
+  of  overlap.  Alignment can be achieved by adding padding characters
+  to the sequences, and this is the way "auto assemble" operates  when
+  adding new sequences to the database.
+
+        In order to search for overlaps that may have been  missed  or
+  may  be  hidden  in  the  "unused  data" the function "find internal
+  joins" can be used.
+
+        Generally the users need not concern themselves with  how  the
+  relational  information  is used by the program, but it is necessary
+  to know how contigs are identified. Because contigs  are  constantly
+  being  changed  and  reordered  the  program  identifies them by the
+  numbers of the gel readings they contain.  Whenever  users  need  to
+  identify  a  contig they need only know the number or name of one of
+  the gel readings it contains. Whenever the  program  asks  users  to
+  identify  a  contig  or  gel reading they can type its number or its
+  archive name. If they type its archive name they  must  precede  the
+  name by a slash "/" symbol to denote that it is a name rather than a
+  number. E.g if the  archive name is fred.gel with number  99,  users
+  should  type  /fred.gel  or  99  when  asked to identify the contig.
+  Generally, when it asks for the gel reading to  be  identified,  the
+  program  will  offer  the user a default name, and if the user types
+  only return, that contig will be accessed. When a database is opened
+  the  default  contig  will  be  the  longest  one, but if another is
+  accessed, it will subsequently  become the current default.
+
+        Further information is located in the  following  places.  The
+  database  files  are described under "open database". The format for
+  vector  and  consensus  sequences  is  given  under   "calculate   a
+  consensus", as are the uncertainty codes used in gel readings.
+
+        The digitiser program is used for the  initial  input  of  gel
+  readings  and  for  writing a file of file names. The program uses a
+  digitizer for data  entry.   A  digitizer  is  a   two   dimensional
+  surface  such  as a light box which is such that if a special pen is
+  pressed onto it, the pens coordinates are recorded  by  a  computer.
+  These coordinates can be interpreted by a program.
+
+        In order to read an autoradiograph placed on the light box the
+  user  need  only  define the bottom of the four sequencing lanes and
+  the bases to which they correspond and then use  the  pen  to  point
+  to   each  successive   band  progressing  up  the gel.  The program
+  examines the coordinates of each pen position to see in which of the
+  four  lanes  it   lies  and  assigns  the  corresponding  base to be
+  stored in the computer.  Each time the pen tip is depressed to point
+  to  a  position on  the  surface of the digitizer the program sounds
+  the bell on the terminal to indicate to the user that  a  point  has
+  been recorded.  As the  sequence  is read the program displays it on
+  the screen.
+ @17. TX 1 @Screen against enzymes
+
+        Used to compare gel readings against  any  restriction  enzyme
+  recognition  sequences  that  may have been used  during cloning and
+  which should not be  present  in  the  data.  Works  on  single  gel
+  readings  or processes batches accessed through files of file names.
+  The algorithm looks  for  exact  matches  to  recognition  sequences
+  stored in a file.
+
+        The  file  containing  the  recognition  sequences   must   be
+  identified.  The  user  must choose between employing a file of file
+  names, or typing in the names of individual gel reading files. If  a
+  file  of  file names is used the program will also create a new file
+  of file names. When the option has finished operating this new  file
+  will  contain the names of all those gel readings that did not match
+  any of the recognition sequences. Hence it can be used  for  further
+  processing  of the batch. The recognition sequences should be stored
+  in a simple text file with one recognition sequence per line.
+ @18. TX 1 @Screen against vector
+
+        Used to compare gel readings against any vector sequences that
+  may  have  been  picked  up  during  cloning and which have not been
+  removed by vep. It Works on single gel readings or processes batches
+  accessed  through files of file names. The algorithm looks for exact
+  matches  of  length  "minimum  match  length"   and   displays   the
+  overlapping sequences.
+
+        The file containing the vector sequence  must  be  identified.
+  The  user  must  choose  between  employing a file of file names, or
+  typing in the names of individual gel reading files. If  a  file  of
+  file  names  is used the program will also create a new file of file
+  names. When the option has finished operating  this  new  file  will
+  contain  the  names of all those gel readings that did not match the
+  vector sequence. Hence it can be used for further processing of  the
+  batch.  The  vector  sequence should be stored in a simple text file
+  with up to 80 characters of data per line. More than one vector  can
+  be  stored  in  a single file. If so each should be preceded by a 20
+  character title of the form <---m13mp8.0001----> where the <  and  >
+  signs  and  the number like .0001 are obligatory. The number must be
+  preceded by a dot (.) and be 4 digits long. The  total  sequence  in
+  the file must be < 500,001 characters in length.
+ @20. TX 3 @Auto assemble
+
+        Compares gel readings against  the  current  contents  of  the
+  database  and  produces  alignments. In its normal mode of operation
+  ("entry permitted"), the function will automatically enter  the  gel
+  readings into the database.
+
+        New assembly suboption.  However if entry is not permitted the
+  reads  won't  be entered but the program will produce alignments and
+  (optionally) save each reading name and  its  best  alignment  score
+  (percentage mismatch) in a file. When used in this mode, the program
+  will include in  the  alignment  the  poor  quality  data  for  each
+  reading.  These  files  of names can then be sorted into score order
+  and then used for assembly, hence forcing the  readings  that  align
+  best to be entered into the database first.  End of new suboption.
+
+        The routine works on single gel readings or processes  batches
+  of gel readings accessed through files of file names. It is the only
+  way to enter data into the database.
+
+        The function will check the database for  logical  consistency
+  and  will only proceed if it is OK. Choose if gel readings should be
+  entered into the database, or  if  they  should  only  be  compared.
+  Choose  between  using  a file of file names or typing file names on
+  the keyboard. If so selected, supply the file of  file  names.  Also
+  supply  a  file  of  file  names to contain the names of all the gel
+  readings that fail to get entered. Select  the  entry  mode.  Normal
+  assembly  is  appropriate  for  all but special cases, as is "permit
+  joins". Uses for the other modes are not documented here.  Define  a
+  minimum  initial  match length. Define the maximum number of padding
+  characters allowed to be used in each gel reading  to  help  achieve
+  alignment,  and  the  same  for the number allowed in the contig for
+  each gel reading. Finally define the maximum percentage mismatch  to
+  be  allowed  for any gel reading to be entered into the database. If
+  for any gel reading, either of these last three values  is  exceeded
+  the gel reading will not be entered into the database.
+
+        In operation the  function  takes  a  batch  of  gel  readings
+  (probably   passed  on   as   a  file  of file names from one of the
+  screening routines) and enters them into a database for a sequencing
+  project.  It takes each  gel reading in  turn, compares  it with the
+  current consensus for the database, it then produces  an   alignment
+  for   any   regions   of   the   consensus   it overlaps;   if  this
+  alignment is sufficiently good  it  then  edits  both  the  new  gel
+  reading  and  the  sequences  it  overlaps   and   adds the new  gel
+  reading to the database.  The program  then  updates  the  consensus
+  accordingly and carries on to the next  gel  reading.
+
+        All alignments are displayed and  any  gel  readings  that  do
+  match but  that cannot be aligned sufficiently well have their names
+  written to a file of failed gel reading names.  The  function  works
+  without   any  user  intervention  and can process any number of gel
+  readings in a single run.  Those  gel  readings  that  fail  can  be
+  recompared  using  the  same  function  (to find the current overlap
+  position) and  the user  can enter them into the database using  the
+  "put  all  readings  in new contigs" assembly option and then joined
+  using "join contigs".
+
+        Typical dialogue and output from the function is shown  below.
+  (Note  that  output  for gel readings 2 - 9 has been deleted to save
+  space).
+  Automatic sequence assembler
+  Database is logically consistent
+  ? (y/n) (y) Permit entry
+  ? (y/n) (y) Use file of file names
+  ? File of gel reading names=demo.nam
+  ? File for names of failures=demo.fail
+  Select entry mode
+  X  1 Perform normal shotgun assembly
+     2 Put all sequences in one contig
+     3 Put all sequences in new contigs
+  ? Selection  (1-3) (1) =
+  ? (y/n) (y) Permit joins
+  ? Minimum initial match (12-4097) (15) =
+  ? Maximum pads per gel (0-25) (8) =
+  ? Maximum pads per gel in contig (0-25) (8) =
+  ? Maximum percent mismatch after alignment (0.00-15.00) (8.00) =
+    >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
+    Processing           1 in batch
+    Gel reading name=HINW.004
+    Gel reading length=   283
+    Searching for overlaps
+    Strand     1
+    Strand     2
+    No matches found
+    Total matches found           1
+    Padding in contig=    0 and in gel=    1
+    Percentage mismatch after alignment =  1.8
+    Best alignment found
+           1         11         21         31         41         51
+           TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+           ********** ********** ********** ********** ********** **********
+           TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+           1         11         21         31         41         51
+          61         71         81         91        101        111
+           TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACA-GTTG
+           ********** ********** ********** ********** ********** ***** ****
+           TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACACGTTG
+          61         71         81         91        101        111
+         121        131        141        151        161        171
+           CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+           ********** ********** ********** ********** ********** **********
+           CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+         121        131        141        151        161        171
+         181        191        201        211        221        231
+           GCCATGGTGG CCACAGTGAC G-TATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+           ********** ********** * ******** ********** ********** **********
+           GCCATGGTGG CCACAGTGAC GCTATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+         181        191        201        211        221        231
+         241        251        261        271        281
+           CGCG--GTGG AATTACAGCG TTCCCTATTG ACGGGCGCAT CCAC
+           ****  **** ********** ** * ***** ********** ****
+           CGCGACGTGG AATTACAGCG TT,CDTATTG ACGGGCGCAT CCAC
+         241        251        261        271        281
+            Batch finished
+            9 sequences processed
+            0 sequences entered into database
+            0 joins made
+
+
+        Note that "auto assemble" cannot align protein sequences.
+ @28. TX 1 @Highlight disagreements
+
+        Used  in  the  latter  stages  of  a  project   to   highlight
+  disagreements  between  individual  gel readings and their consensus
+  sequences. This display is also  availbale  in  the  contig  editor.
+  Characters  that agree with the consensus are shown as : symbols for
+  the plus strand and . for the minus strand. Characters that disagree
+  with  the consensus are left unchanged and so stand out clearly. The
+  results of this analysis are written to a file.
+
+        Before selecting this option create a file of the  display  of
+  the  contig to be "highlighted". The option will ask for the name of
+  this file. Select symbols to denote "agreeing"  characters  on  each
+  strand, the defaults are : and ., but any others can be used. Supply
+  the name of a file in which to put the output.
+
+        The display file needed as input for this option is created by
+  selecting  "Redirect  output",   followed  immediately  by  "display
+  contig", and then "Redirect output" again. The cutoff score used  in
+  the  consensus  calculation  can  be  set  by  option  "set  display
+  parameters". Note that for the highlight function there is  a  limit
+  of  50  for  the  number  of  gel  readings  that are aligned at any
+  position - ie the contig must be less than 51 gel readings  deep  at
+  its  thickest point. I hope that those performing shotgun sequencing
+  never reach this limit, but those using the  program  for  comparing
+  sequence families might.
+
+        Typical output from this function is shown below.
+
+                            210       220       230       240       250
+      1  HINW.004    :C::::::::::::::::::::::::::::::::::::::::::AC::::
+      7  HINW.018    :*::::::::::::::::::::::::::::::::::::::::::CA::::
+     -4  HINW.017                                 ...............AC....
+                     G-TATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCCCGCG--GTGG
+
+                            260       270       280       290       300
+      1  HINW.004    ::::::::::::*:D:::::::::::::::::::
+      7  HINW.018    ::::::::::::::::::::CA:::::T:*:::*::::::::::::CA:
+     -4  HINW.017    ..............................................A...
+      3  HINW.009    :::::::::::::::V::::::::::::::::::::::::::::*AV:::
+     -6  HINW.028                            ......................A...
+                     AATTACAGCGTTCCCTATTGACGGGCGCATCCACGCTGATTCTCTT-CTG
+
+ @32. TX 3 @Extract gel readings
+
+        Used to make copies of the aligned gel readings in a database,
+  to write them into separate files, and to write a corresponding file
+  of file names. It operates in two modes: either all gel readings are
+  extracted, or only those at the ends of contigs.
+
+        Choose which mode of operation is required and supply  a  file
+  of file names.
+
+        The gel readings are given their original names.
+
+        If the option is used to extract all the gel readings  from  a
+  database,  a  subsequent  run  of "auto assemble" can reconstitute a
+  database which has  been  corrupted.  This   rarely  occurs  and  is
+  usually  necessitated  by  a  user   employing "alter relationships"
+  incorrectly without first having made a copy.
+ @1. TX 0 @Help
+
+        Help is available on the following topics :
+ @2. TX 0 @Quit
+
+        This command stops the program and is the  only  safe  way  to
+  terminate  a run of the program that has altered the contents of the
+  database in any way.
+ @3. TX 1 @Open a database
+
+        Opens existing databases or allows new ones to be started. The
+  function  is automatically called into operation when the program is
+  started but can also be selected from the general menu.
+
+        Choose to open an existing database or start a new one, or  if
+  !  is  typed  when  the  program is first started, enter the program
+  without opening a database. Supply a project database name,  and  if
+  it  already exists, the "version". If starting a new database define
+  the database size and if it is for DNA or  protein  sequences.   The
+  database  size  is  an  initial  size  for  the  database. It can be
+  increased later during the project. It is the sum of the  number  of
+  gel readings plus the number of contigs. The current maximum size is
+  8000.
+
+        Database names can have from one to 12 letters  and  must  not
+  include  full  stop  (.).  The  database  is made from five separate
+  files. If the database is called FRED then  version  0  of  database
+  FRED  comprises  files  FRED.AR0,  FRED.RL0,  FRED.SQ0, FRED.TG0 and
+  FRED.CC0. The version is the last symbol in the  file  names.   Only
+  this  program can read these files. If the "copy database" option is
+  used it will ask the user to define a new "version".
+
+        For normal use the maximum gel reading length is  set  to  512
+  characters,  but  when  a  database  is  started the user may choose
+  lengths of either 512, 1024, 1536..., 4096. Normally the program  is
+  used  to handle DNA sequences but many of the functions also work on
+  protein sequences. The choice of sequence  type  is  made  when  the
+  database is started.
+
+        The contigs are not stored on the disk as the user  sees  them
+  displayed  on the screen. Each gel reading is stored with sufficient
+  information about how it overlaps other gel  readings  so  that  the
+  program  can  work out how to present them aligned on the screen. We
+  refer to this extra data as "the relationships" and it is  explained
+  below.  The database comprises 5 separate files.
+  1.  a working version of each gel reading.  This is the  version  of
+  the   gel   reading  that  is in the database and initially it is an
+  exact copy of the original sequence (known as the archive) but it is
+  edited and manipulated to align  it with other gel readings.
+  2.  the file of  relationships.   This  file  contains  all  of  the
+  information  that  is required to assemble the working versions into
+  contigs during processing;  any manipulations on the data  use  this
+  file   and  it  is  automatically  updated  at  any  time  that  the
+  relationships are changed.  The  information  in  this  file  is  as
+  follows:
+  (A) Facts about  each   gel  reading   and   its   relationship   to
+  others ("gel descriptor lines"):
+  (a) the number of the gel reading   (each gel reading   is  given  a
+  number  as  it  is entered into the database)
+  (b) the length of the sequence from this gel reading
+  (c) the position of the left end of this gel reading    relative  to
+  the left end of the contig of which it is a member
+  (d) the number of the next gel reading   to the  left  of  this  gel
+  reading
+  (e) the number of the next gel reading   to the right
+  (f) the relative strandedness of this gel reading  , ie whether   it
+  is  in the same sense or the complementary sense as its archive.
+  (B) Facts about each contig ("contig descriptor lines"):
+  (a) the length of this contig
+  (b) the number of the leftmost gel reading   of this contig
+  (c) the number of the rightmost gel reading   of this contig.
+  (C) General facts:
+  (a) the number of gel readings in the database
+  (b) the number of contigs in the database.
+  3.  the file of archive names.  This is simply a list of  the  names
+  of each of the archive files in the database.
+  4. the file of tags (annotation). This consists of linked  lists  of
+  tag  information  for  each  sequences  in  the  database.  Tags are
+  created by the user as annotation, or by xdap as records of edits or
+  for  storing  cutoff  information.   As  the number of tags can grow
+  without limit, so can this file.  For each gel  there  is  a  header
+  record,  which contains the record number of the start of the linked
+  list for that gel. On line  IDBSIZ  there  is  a  record  containing
+  information  about  the file such as its present length and if there
+  are any free "tag" slots to be reused in the file.  5. the  file  of
+  comments  (annotation).   This  consists  of linked lists of comment
+  fragments.  Comments are created by the user as a  message  attached
+  to  annotation,  or  by  the  system  to  store  cutoff information.
+  Comments are character strings of any length.  Comments longer  than
+  40 characters are broken up into fragments, each 40 characters long,
+  and are chained together in a link list.  As the number of  comments
+  can grow without limit, so can this file.
+
+        Structure of the database files
+
+        1.  The file of relationships
+
+        The file contains IDBSIZ lines of data:  the general data  are
+  stored  on line IDBSIZ;   data about  gel readings  are stored  from
+  line 1 downwards;  data about contigs are stored from line  IDBSIZ-1
+  upwards.  A  database  of 500 lines containing 25 gel readings and 4
+  contigs would have a file of relationships as is shown below.
+
+
+                    ---------------------------------------------
+                       0  Info about the database size
+                       1  Gel descriptor record
+                       2   "      "       "
+                       3   "      "       "
+                       4   "      "       "
+                       5   "      "       "
+                       '   '      '       '
+                       '   '      '       '
+                      25   "      "       "
+                      26  Empty record
+                       '    '     '
+
+                       '    '     '
+                     495    '     '
+                     496  Contig descriptor record
+                     497    "        "        "
+                     498    "        "        "
+                     499    "        "        "
+                     500   Number of gel readings=25, Number of contigs=4
+                    ---------------------------------------------
+
+            The arrangement of the data in the file of relationships
+
+  As each new gel reading   is added into the database a new  line  is
+  added to  the  end  of  the  list  of gel descriptor lines.  If this
+  new gel  reading  does not overlap with any gel readings already  in
+  the  database  a new contig  line  is added  to  the top of the list
+  of contig lines.  If it overlaps with one contig then no new  contig
+  line  need  be  added  but  if it  overlaps with  two  contigs  then
+  these  two  contigs must be joined and the number  of  contig  lines
+  will  be reduced by one. Then the list of contig lines is compressed
+  to  leave  the empty line at the top of the list.  Initially the two
+  types  of  line will move towards  one  another  but eventually,  as
+  contigs  are joined, the contig descriptor lines will  move  in  the
+  same  direction  as the  gel descriptor lines.   At  the  end  of  a
+  project  there should  be only one contig  line.   The  database  is
+  thus capable of handling a project of 998 gels.
+
+        2.  Structure of the working versions file
+
+        The working versions of gel readings are stored  in   a   file
+  of  NGELS  lines  each  containing  MAXGEL  characters.  Gel reading
+  number 1 is stored on line 1, gel reading number  2 on line 2 and so
+  on.  NGELS  is the current number of readings and MAXGEL the maximum
+  reading length.
+
+        3.  Structure of the archive names file
+
+        This file has NGELS lines of 16 characters.
+
+        4.  Structure of the tag file
+
+        This file initially starts with IDBSIZ lines, and is  expanded
+  as  new tags are created.  Information about the length of the file,
+  and which tag records are reusable is  stored  on  line  IDBSIZ.   A
+  database of 500 lines would have a file of tags as shown below.
+
+                    ---------------------------------------------
+                       1  Tag descriptor record
+                       2   "      "       "
+                       3   "      "       "
+                       4   "      "       "
+                       5   "      "       "
+                       '   '      '       '
+                       '   '      '       '
+                     497   "      "       "
+                     498   "      "       "
+                     499   "      "       "
+                     500   Length of file=N, Free list=0
+                     501  Tag record
+                     502   "   "
+                     503   "   "
+                       '   '   '
+                       '   '   '
+                     N-2   "   "
+                     N-1   "   "
+                       N  Tag record
+                    ---------------------------------------------
+
+            The arrangement of the data in the tag file
+
+  As each new tag is added to the database, a check  is  made  in  the
+  file  descriptor  record  at  line  IDBSIZ.  If the list of reusable
+  records is 0, the file is extended by one line.  Otherwise  the  new
+  tag  is  assigned  to record at the head of the freelist.  When tags
+  are deleted, they are added to the free list in the file  descriptor
+  record.
+
+        5.  Structure of the comment file
+
+        This file initially starts with 1 line, and is expanded as new
+  annotation  is  created.   Information about the length of the file,
+  and which comment records are reusable is stored on the first line.
+
+                    ---------------------------------------------
+                       1  Length of file=N, Free list=0
+                       2  Comment fragment
+                       3   "       "
+                       4   "       "
+                       '   '       '
+                       '   '       '
+                     N-2   "       "
+                     N-1   "       "
+                       N  Comment fragment
+                    ---------------------------------------------
+
+            The arrangement of the data in the comment file
+
+  As each new comment is added to the database, a check is made in the
+  file descriptor record at line 1. If the list of reusable records is
+  0, the file is extended to hold the new comment. Otherwise  the  new
+  comments  is  assigned  to  records  starting  with  the head of the
+  freelist.  When comments are  deleted,  the  discarded  records  are
+  added to the free list in the file descriptor record.
+
+        There  are  various  checks  within  the  programs  to protect
+  users from themselves:-
+  1.  All user input is checked for  errors  -  e.g.    reference   to
+  non-existent  gel readings or  contigs,  incorrect  positions in the
+  contig or gel readings.
+  2.  Before entering a gel reading the system checks to see if a file
+  of the same name has already been entered.
+  3.  Join will not allow the circularising of a contig.
+  5. Users may escape from any point in the program.
+  6. Help is available from all points in the program.
+
+
+  IT IS ESSENTIAL THAT USERS DO NOT KILL THE PROGRAM WHILE IT IS DOING
+  ANYTHING  THAT  INVOLVES  CHANGING THE CONTENTS OF THE DATABASE. I.E
+  DURING AUTO ASSEMBLE, COMPLETE JOIN, COMPLEMENT  CONTIG,  SAVE  EDIT
+  CONTIG.   This  could  corrupt  the  database  so  badly  that it is
+  impossible to fix. The program should always be left using the  QUIT
+  option.
+ @4. TX 3 @Edit contig
+
+        The Contig Editor is a mouse-driven editor  that  can  insert,
+  delete and change gel reading sequences.
+
+        The Contig Editor allows scrolling from one end of a contig to
+  the  other  using the scroll bar and scroll buttons. Action of mouse
+  button presses when the mouse pointer is in the scroll bar:
+
+      Middle Mouse Button      Set editor position
+      Left   Mouse Button      Scroll forward one screenful
+      Right  Mouse Button      Scroll backwards one screenful
+
+  The four scroll buttons operate as follows:
+
+      "<<"                     Scroll left half a screenful
+      "<"                      Scroll left one character
+      ">"                      Scroll right one character
+      ">>"                     Scroll right half a screenful
+
+        The Editor cursor can  be  positioned  anywhere  in  the  edit
+  window  by  moving the mouse pointer over the character of interest,
+  then pressing the left mouse button. The Editor cursor can  also  be
+  moved by using the direction arrow keys.
+
+        The editor operates in two  main  edit  modes  -  Replace  and
+  Insert. Replace allows a character to be replaced by another. Insert
+  allows characters to  be  inserted  into  a  gel  reading  sequence.
+  Characters  are entered by typing them from the keyboard. Only valid
+  characters are permitted.  Characters can be deleted by  positioning
+  the cursor one character to the right, then pressing the delete key.
+  Normally Insert and Delete apply to the consensus line of the contig
+  ONLY.  This  restraint  can  be overridden by using the "Super Edit"
+  mode of operation, THOUGH IT IS NOT RECOMMENDED.
+
+        Edits can also be performed on the consensus, though they  are
+  restricted  to  insertion  and deletion of padding characters ("*").
+  These edits also have special meanings.  A deletion will delete  ALL
+  characters  at the position to the left of the cursor in the contig,
+  and move the relative positions of all  sequences  starting  to  the
+  right  of the cursor position left one character.  An insertion will
+  insert the character typed ("*") into ALL gel reading  sequences  at
+  the  cursors position in the contig, and move the relative positions
+  of all sequences starting to the right of the cursor position  right
+  one character.
+
+        The effect of the last edit can  be  undone  by  pressing  the
+  "Undo" button at the top of the editor window.
+
+        The cursor  will  automatically  be  positioned  at  the  next
+  problem  when  the  "Find Next Problem" button is selected. The next
+  problem is where the consensus shows either an ambiguity ("-") or  a
+  pad ("*") character.
+
+        The edits to the contig can be saved by  pressing  the  "Leave
+  Editor"  button and replying "Yes" to the prompt to "Save changes?".
+  As no changes are made to the working copy of your database til this
+  point it is possible to abort the editor if the edit session ends up
+  in an unsatisfactory state (ie if you've stuffed it up!)
+
+
+
+ Displaying Traces
+
+        The original data from which the gel reading  sequences  where
+  derived  can  be seen by double clicking (two quick clicks) with the
+  middle mouse button on the area  of  interest.  The  trace  will  be
+  displayed  with  the  point  clicked  at  the  centre  of  the trace
+  viewport.
+
+        All traces that are displayed are maintained  in  one  window,
+  called  the  Trace Manager. The Trace Manager will only display four
+  traces maximum. When four traces are already being managed and a new
+  one is requested, the one at the top of the Trace Manager is removed
+  and the new one is added to  the  bottom.   Traces  can  be  removed
+  individually  by  using  the  "quit" button in the panel next to the
+  trace.
+
+
+
+ Extending Reads Using Cutoff Information
+
+        Sequence data read in from  Automated  Fluorescent  sequencing
+  machines trace files processed through the program ted will have the
+  discarded sequence (vector at start and poor read at end)  available
+  to  the  contig editor. To display the cutoff information, press the
+  "Display Cutoff" button at the top of the editor window.  The cutoff
+  sequence appears in grey. This sequence can be incorporated into the
+  editable sequence, by moving the cutoff position. This  is  done  by
+  positioning  the  cursor  at  the end of the gel sequence, and using
+  Meta-Left-Arrow and Meta-Right-Arrow to adjust the point of  cutoff.
+  The Meta key is a diamond on the Sun keyboard.
+
+
+
+ Pop-up menu
+
+        A pop-up menu is revealed by depressing the "Control"  key  on
+  the  keyboard  and  at the same time pressing the left mouse button.
+  The menu has the following functions:
+
+      Search
+      Highlight Disagreements
+      Save Contig
+      Create Tag
+      Edit Tag
+      Delete Tag
+      Select Oligo
+
+  "Highlight Disaggreements" simply toggles between the normal display
+  showing  the  current  base  assignments and one in which only those
+  assignments that differ from the consensus are shown.
+  "Save Contig" is described above.  Searching and operations on  tags
+  are described below.
+
+
+
+  Searching
+
+        Selecting "Search" brings up a window which can remain present
+  during normal editor operation. The window allows the user to select
+  the direction of search, the type of search and a  value  to  search
+  on.   The value is entered into the value text window. Then pressing
+  the "search" button performs the search. If successful,  the  cursor
+  is  positioned  and  centred  accordingly. An audible tone indicates
+  failure.  Pressing the "ok" button removes the  search  window.  The
+  search  window  is  automatically  removed when the contig editor is
+  exited.
+
+  There are seven different search modes:
+
+  1. Search by position
+
+  This positions the cursor at the numeric position specified  in  the
+  value  text  window.  Eg  a  value of "1234" causes the cursor to be
+  placed at base number 1234 in the contig. Positioning withing a  gel
+  reading  is achieved by prefixing the number with the "@" character,
+  eg "@123" positions the cursor at base 123 of the sequence in  which
+  the  cursor  lies.  Relative positions can be specified by prefixing
+  the number with a plus or minus character. Eg "+1234"  will  advance
+  the  cursor 1234 bases. If possible, the cursor is positioned within
+  the same sequence.  The direction buttons  have  no  effect  on  the
+  operation of "search by position".
+
+  2. Search by reading name
+
+  This positions the cursor  at  the  left  end  of  the  gel  reading
+  specified  in the value text window. If the value is prefixed with a
+  slash is is assumed to be  a  gel  reading  name.  Otherwise  it  is
+  assumed to be a gel reading number. Eg "123" positions the cursor at
+  the left end of gel reading number 123.  "/a16a12.s1"  positions  at
+  the  start  of reading a16a12.s1. If the value was "/a16" the cursor
+  is positioned at the first reading which  starts  with  "a16".   The
+  direction  buttons  have  no  effect  on the operation of "search by
+  position".
+
+  3. Search by tag type.
+
+  This positions the cursor at the start of the next tag which has the
+  the  same  type  as  specified by the type value menu. To change the
+  type, select off the menu that pops up when the mouse is clicked  on
+  the  button  labeled  "Type:".  The  search  can be performed either
+  forwards or backwards of the current cursor position.  To  find  all
+  tags, use "search by annotation", with a null text value string.
+
+  4. Search by annotation.
+
+  This positions the cursor at the start of the next tag which  has  a
+  comment  containing  the  string specified in the value text window.
+  The search performed is a regular  expression  search,  and  certain
+  characters  have  special meaning. Be careful when your value string
+  contains ".", "*", "[", "^" or "$".  The  search  can  be  performed
+  either forwards or backwards from the current cursor position.
+
+  5. Search by sequence.
+
+  This positions the cursor at the start of the next piece of sequence
+  that  matches  the  value  specified  in  the text value window. The
+  search is for an exact match, which means the case of  value  string
+  is   important.   The  search  is  performed  on  the  gel  readings
+  themselves, rather than the consensus sequence. The  search  can  be
+  performed  either  forwards  or  backwards  from  the current cursor
+  position.
+
+  6. Search by problem.
+
+  This positions the  cursor  at  the  next  place  in  the  consensus
+  sequence  which  is  not  an "A", "C", "G" or "T". The search can be
+  performed either forwards  or  backwards  from  the  current  cursor
+  position.
+
+  7. Search by quality
+
+  This positions the  cursor  at  the  next  place  in  the  consensus
+  sequence  where the consensus calculation for each strand disagrees.
+  When only sequences on one strand is present, the search  will  stop
+  at  every  base.  The  search  can  be  performed either forwards or
+  backwards from the current cursor position.
+
+
+
+ Annotation
+
+        Parts of a sequence can be annotated, to record the  positions
+  of  primers used for walking, or to mark sites, such as compressions
+  that have caused problems during sequencing.  The consensus sequence
+  CANNOT be annotated.
+
+        To annotate a piece of  sequence  first  select  the  part  of
+  sequence  using  the  mouse  buttons.  Use  the left mouse button to
+  position the start of the selection, and while this button is  being
+  held  down, move the mouse to extend.  The selection can be extended
+  further using the right mouse button.
+
+        To create annotation, invoke the pop-up menu, and  select  the
+  "Create Tag" function. A small "tag editor" will appear which allows
+  you to select the type of the annotation from a pull-down menu,  and
+  specify  a  comment  if desired.  To select a new type pull down the
+  Type menu, and select the entry desired.  To enter a comment, simply
+  type  into  the  text  window  in the tag editor.  The annotation is
+  created when the "Leave" button on the tag editor, and is  displayed
+  in the colour defined in the tag database file (TAGDB).
+
+        To edit existing annotation, position the cursor with the left
+  mouse  button  on  the tag, and select the "Edit Tag" off the pop-up
+  menu.  This invokes the tag editor, and  changes  to  the  type  and
+  comment  of  the annotation can be made. The tag is updated when the
+  "Leave" button is pressed.
+
+        To delete an existing annotation, position the cursor with the
+  left  mouse  button  on the tag, and select the "Delete Tag" off the
+  pop-up menu.
+
+
+
+ NOTE:
+
+        As the Contig Editor is a very powerful tool, it  is  possible
+  that  the  alignment  of  the gel reading sequences has unexpectedly
+  been disrupted.  This can easily happen to parts of the contig  that
+  lie to the right of the screen if excessive use has been made of the
+  "Super Edit" facility.  Until familiar with "Super  Edit"  it  would
+  benefit  the  sequencer  to  quickly  scan  through the contig after
+  editing to check that bad alignments have not been created.
+
+  Selecting Oligos ----------------
+
+  1. Open the oligo selection window, by selecting "Select Oligo" from
+  the contig editor popup menu.
+  2. Position the cursor to where you want the  oligo  to  be  chosen.
+  While  the  oligo  selection  window is visible, you will still have
+  complete control over positioning  and  editing  within  the  contig
+  editor.
+  3. Indicate the strand for which you require an oligo. This is  done
+  by   toggling  the  direction  arrow  ("----->"  or  "<------"),  if
+  necessary.
+  3. Press the "Find Oligos" button to find all suitable  oligos  (See
+  "Oligo  selection" below.)  Information for the closest oligo to the
+  cursor position is given in the output text window.  In  the  contig
+  editor the position of the oligo is marked by a temporary tag on the
+  consensus. The window is recentered if the oligo is off the  screen.
+  Selecting  "Display Selection Information" will print a short report
+  on the numbers  of  oligos  considered  and  rejected  during  oligo
+  selection.
+  4. If this oligo is  not  suitable  (it  may  have  been  previously
+  chosen,  and  found  to  be unsuitable by experimentation, say), the
+  next closest oligo can be viewed by pressing "Select Next".
+  5. Suitable templates are automatically identified for the currently
+  displayed  oligo  (See  "Template selection" below.) By default, the
+  template is that closest to the oligo site. If  the  choice  is  not
+  suitable  (it  may  be  known  to  be  a poor quality template, say)
+  another can be chosen from the  "Choose  Template  for  this  Oligo"
+  menu.   Templates that do not appear on the menu can be specified by
+  selecting "other". However, the template  must  be  on  the  correct
+  strand and be upstream of the oligo.
+  6. A tag can be created for the current oligo by pressing the button
+  "Create a tag for this oligo". The annotation for this tag holds the
+  name of the template and  the  oligo  primer  sequence.   There  are
+  fields   to  allow  the  user  to  specify  their  own  primer  name
+  ("serial#") and comments ("flags") for this tag. An example of oligo
+  tag annotation:
+          serial#=
+          template=a16a9.s1
+          sequence=CGTTATGACCTATATTTTGTATG
+          flags=
+
+  7. The oligo selection window is closed when "Create a tag for  this
+  oligo" or "Quit" is selected.
+  Oligo selection:
+  ----------------
+  The oligo selection engine is the one used in the program OSP. It is
+  described in some detail in:
+  Hillier,  L.,  and  Green,  P.  (1991).  "OSP:  an   oligonucleotide
+  selection program," PCR Methods and Applications, 1:124-128.
+  The parameters controlling the selection of oligos can be changed in
+  the "Oligo Selection Parameters" window. The weights controlling the
+  scoring of selected oligos can be changed in  the  "Oligo  Selection
+  Weights" window.
+  By default, the oligos are selected from a window  that  extends  40
+  bases  either  side  of  the  cursor.  The size and location of this
+  window relative to  the  cursor  position  can  be  changed  in  the
+  "Parameters" window.
+  In xbap oligos are ranked according to their proximity to the cursor
+  position, rather than by their scores.
+  Template selection:
+  -------------------
+  For simplicity, each reading is considered to represent a  template.
+  In  practise,  many  readings  can  be  made  of  the same template.
+  Suitable templates that are identified are those that:
+
+      1. are in the appropriate sense,
+      2. have 5' ends that start upstream of the oligo,
+  and 3. are sufficiently close to the oligo to be useful.
+
+  This last criterion relates to the insert  size  for  the  subclones
+  used  for  sequencing  and the average reading length. A template is
+  considered useful if a full reading can be made from it, taking into
+  account  both  of  these  factors.  The  default insert size is 1000
+  bases, and the default average reading length is  400  bases.  These
+  values can be changed in the "Parameters" window.
+ @5. TX 1 @Display a contig
+
+        Used to show the aligned  gel  readings  for  any  part  of  a
+  contig.  The  number,  name  and strandedness of each gel reading is
+  shown and the consensus is written below.
+
+        If required identify the contig,  and then the start  and  end
+  points of the region to display.
+
+        The display can be directed  to  a  disk  file  using  "direct
+  output to disk".
+
+        Below is an example showing the left  end  of  a  contig  from
+  position   1  to  200.  Overlapping this region are gels 6,3,5,17and
+  12; 6, 3 and 5 are in reverse orientation to their archives (denoted
+  by  a  minus   sign)  There  are  a  few uncertainty codes and a few
+  padding characters  in  the  working  versions,  but  the  consensus
+  (shown  below  each page width) has a definite assignment for almost
+  every position.
+
+                             10        20        30        40        50
+     -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+         CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                            110       120       130       140       150
+     -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+     -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+     -5  HINW.009                                        GGCAGCGACCGTCG
+     17  HINW.999                                           AGCGACCGTCG
+         CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                            160       170       180       190       200
+     -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+     -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+     12  HINW.017                                              GTAGAGC*
+         CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+ @6. TX 1 @List a text file
+
+        This option allows users to list text files on the screen.  It
+  can  be  used  to  read  a file containing notes, for checking files
+  written to disk etc. The user is asked to type the name of the  file
+  to list.
+ @8. TX 1 @Calculate a consensus
+
+        Calculates  a  consensus  sequence   either  for   the   whole
+  database or for selected contigs. The consensus is written to a file
+  named by the user.
+  Supply a file name,  choose  between   whole  database  or  selected
+  contigs.
+
+        Symbols for uncertainty in gel readings
+
+        In  order  to  record  uncertainties  when  reading  gels  the
+  codes  shown  below can  be  used. Use  of these codes permits us to
+  extract the maximum amount of data from each gel and yet record  any
+  doubts   by  choice   of   code.    The program can deal with all of
+  these codes and any other  characters  in  a  sequence  are  treated
+  as  dash  (-) characters.
+
+         SYMBOL                  MEANING
+
+           1             PROBABLY        C
+           2                "            T
+           3                "            A
+           4                "            G
+           D                "            C       POSSIBLY        CC
+           V                "            T          "            TT
+           B                "            A          "            AA
+           H                "            G          "            GG
+           K                "            C          "            C-
+           L                "            T          "            T-
+           M                "            A          "            A-
+           N                "            G          "            G-
+           R             A OR G
+           Y             C OR T
+           5             A OR C
+           6             G OR T
+           7             A OR T
+           8             G OR C
+           -             A OR G OR C OR T
+           a             A
+           c             C
+           g             G
+           t             T
+           *             padding character placed by auto assembler
+            else = -
+
+  The DNA consensus algorithm
+
+        The "calculate  consensus"  function,  the  "display   contig"
+  routine and the "show quality" option use  the rules  outlined  here
+  to  calculate  a consensus  from aligned gel  readings.   Note  that
+  "display  contig"  calculates a consensus for  each  page  width  it
+  displays  (it  does  not use the consensus sequence file  calculated
+  by the consensus function).
+
+        We  have  6  possible  symbols  in  the  consensus   sequence:
+  A,C,G,T,*  and -. The last symbols is assigned if none of the others
+  makes up a sufficient proportion of the aligned  characters  at  any
+  position  in the contig. The following calculation is used to decide
+  which symbol to place in the consensus at each position.
+
+        Each uncertainty code contributes a score to one of  A,C,G,T,*
+  and  also  to  the  total  at each point. Symbols like R and Y which
+  don't correspond to a single base type contribute only to the  total
+  at each point. The scores are shown below.
+                definite assignments ie A,C,G,T,B,D,H,V,K,L,M,N,a,c,g,t,* =1
+
+                probable assignments ie 1,2,3,4 = 0.75
+
+                other uncertainty codes including R,Y,5,6,7,8,- = 0.1
+
+        A cutoff score of 51% to 100% is supplied by the  user.  (When
+  the   program   starts   this  is  set  to  75%.  See  "set  display
+  parameters").  At each position in the contig we calculate the total
+  score  for  each of the 5 symbols A,C,G,T and * (denote these by Xi,
+  where i=A,C,G,T or *), and also the sum of these totals (denote this
+  by S). Then if 100 Xi / S > the cutoff for any i, symbol i is placed
+  in the consensus; otherwise - is assigned.
+
+        Notice that S does not equal the number of times the  sequence
+  has  been  determined, but is the score total, and hence we are less
+  likely to put a -  in  the  consensus.  For  the  "examine  quality"
+  algorithm  each  strand is treated separately but the calculation is
+  the same. (It was originally different).
+
+        Format of the consensus sequence ( and vector sequences).
+
+        A consensus  sequence  file  may  contain  the  consensus  for
+  several contigs and so we identify each of them by preceding them by
+  a 20 character title. The title is of the form  <---LAMBDA.0076---->
+  (  where LAMBDA is the project name and gel reading number 76 is the
+  leftmost gel reading to contribute to  this   consensus   sequence).
+  The   angle  brackets  <>  and the 4 digit number precede by a . are
+  important to some processing programs.
+ @25. TX 1 @Show relationships
+
+        Used to show the relationships of  the  gel  readings  in  the
+  database in three ways -
+  (a) All contig descriptor lines  followed  by  all  gel   descriptor
+  lines.
+  (b) All contigs one after the   other   sorted,   i.e.    for   each
+  contig   show  its   contig  descriptor line followed by all its gel
+  descriptor lines sorted on position from left to right
+  (c) Selected contigs:  show the contig  line  and,  in  order, those
+  gel  readings  that  cover  a  user-defined  region.  Note that this
+  output can be  directed  to  a  disk  file  by  prior  selection  of
+  "redirect output".
+
+        Below is an example showing a contig from position 1  to  689.
+  The left gel reading  is number 6 and has archive name HINW.010, the
+  rightmost gel  reading is number 2 and is has archive name HINW.004.
+  On  each  gel  descriptor  line  is  shown:  the name of the archive
+  version, the gel number, the position of the left  end  of  the  gel
+  reading  relative to the left  end  of  the  contig,  the length  of
+  the gel reading  (if this is negative it means that the gel  reading
+  is  in  the  opposite orientation to its archive), the number of the
+  gel reading   to the left and the number of the gel reading  to  the
+  right.
+
+
+   CONTIG LINES
+   CONTIG      LINE  LENGTH               ENDS
+                                       LEFT   RIGHT
+                 48     689               6       2
+   GEL LINES
+   NAME      NUMBER POSITION LENGTH     NEIGHBOURS
+                                       LEFT   RIGHT
+   HINW.010       6        1   -279       0       3
+   HINW.007       3       91   -265       6       5
+   HINW.009       5      137   -299       3      17
+   HINW.999      17      140    273       5      12
+   HINW.017      12      193    265      17      18
+   HINW.031      18      385   -245      12       2
+   HINW.004       2      401   -289      18       0
+
+ @23. TX 3 @Complement a contig
+
+        This function will complement  and  reverse  all  of  the  gel
+  readings   in    a  contig.     It    automatically   reverses   and
+  complements  each  gel reading sequence,  reorders  left  and  right
+  neighbours,   recalculates   relative  positions  and  changes  each
+  strandedness.
+
+        The only user  input  required  is  to  identify  the   contig
+  to complement  by  the  number or name of a gel reading it contains.
+  DO NOT KILL THE PROGRAM DURING THIS STEP!
+ @22. TX 3 @          Join contigs
+
+        This function joins contigs interactively using a mouse driven
+  editor.   The operation of this editor is very similar to the Contig
+  Editor described in "Edit".
+
+        It allows the user  to align the ends of the  two  contigs  by
+  editing  each contig separately.  It is important that the alignment
+  achieved   is  correct  because  once  the  join  is  completed  the
+  alignment  is fixed.  The program needs to know which two contigs to
+  join.
+
+        First specify which two contigs are to be  joined.   The  user
+  should  identify  the  two  contigs. The program checks that the two
+  contig numbers are different  (it  will  not  allow  circles  to  be
+  formed!)
+
+        The Join Editor consists of  two  Contig  Editors  in  between
+  which  is sandwiched a disagreement box. This disagreement box shows
+  exclamation marks to denote mismatches between the two consensuses.
+
+        For example, the display will look something like this:
+
+                           1460      1470      1480      1490      1500
+     56  HINW.100    TCT*GAGCAGTGTGGGCGCTG*CCGG
+     33  HINW.300    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGG
+    -25  HINW.090    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGG
+     19  HINW.123    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+         CONSENSUS   TCTCGAGCAGTGTGGGCGCTG-CCGGGCTCGGAGGGCATGAAGTAGAGCG
+         MISMATCH                         !                      !!!!!!
+                             10        20        30        40        50
+     -6  HINW.010    TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+     -3  HINW.007                TGGGCGCTGCCCGGGCTCGGAGGGCATGAAGT*AGAGC
+     -5  HINW.009                              GCTCGGAGGGCATGAAGT*AGAGC
+         CONSENSUS   TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+
+
+        The overlap must be of at least one character.  Use the scroll
+  bar  and  the  scroll buttons (`<<',`<',`>',and`>>') for positioning
+  the relative positions of the two contigs.
+
+        The join position can be fixed in  position  by  pressing  the
+  `lock' button at the top of the Join Editor.  Locking allows the two
+  contigs to be scrolled as one when using the scroll bar and buttons,
+  the left ends always in the same position relative to each other.
+
+        Once locked, it is best to proceed  to  the  right  along  the
+  contigs,  inserting padding characters (`*') into the consensuses to
+  minimise the disagreements.
+
+        It  is  essential  that  the  user  aligns  the  two   contigs
+  throughout  the  whole  region of overlap before completing the join
+  because it is only at this stage that the two contigs can be  edited
+  independently.  Once the join is completed the alignment can only be
+  altered using the routines supplied by "alter relationships".
+
+        The join can be  completed  by  pressing  the  `Leave  Editor'
+  button.  The  percentage  mismatch  is  displayed,  and  the user is
+  required to confirm that they want to perform the join.
+ @24. TX 1 @               Copy the database
+
+        Used to make a copy of the database. If required the  database
+  size  can  be altered using this option. The "version" of a database
+  is  encoded as the last letter in the names of the five  files  that
+  contain the database.
+
+        Supply a "version" number (the default is version 1),  and  if
+  required  select a new size for the database. The size of a database
+  is the number of lines of information it can hold. It needs  a  line
+  for each gel reading and another for each contig.
+ @19. TX 1 @               Check database
+
+        Used to perform a check on  the  logical  consistency  of  the
+  database.  No  user  intervention  is  required.  If  selected "with
+  dialogue" the program also checks for any sections of the  consensus
+  that contain 15 dashes in 20 characters.
+
+        The following relationships are checked:
+  1.       If gel reading A thinks gel reading B is its left neighbour
+  does B think A  is its right neighbour?  The error message is
+  "Hand holding problem for gel reading A"
+  followed by  the gel descriptor lines for gel readings A and B.
+  2.       Are there any contig lines with no left or  right  end  gel
+  readings?  The error message is
+  "Bad contig line number A"
+  3.       Do the gel readings that are  described  as  left  ends  on
+  contig lines agree that they are left ends?  The error message is
+  "The end gel readings of contig A have outward neighbours"
+  4.       Are there gel readings that are in more  than  one  contig?
+  The error message is
+  " Gel number A is used N times"
+  5.       Are there gel readings that are not  in  any  contig?   The
+  error message is
+  " Gel number A is not used"
+  6.       Do the relative positions of   gel  readings   agree   with
+  their  position  as  defined by left and right neighbourliness?  The
+  error message is
+  " Gel number A with position X is left neighbour of  gel  number   B
+  with position Y"
+  7.       Are there any loops in   contigs?    If   so   no   further
+  checking is done.  The error message is
+  " Loop in contig n no further checking done, but gel reading numbers
+  follow"
+  The program  then  prints the gel  reading  numbers  in  the  looped
+  contig up to the start of the loop.
+  8. Are there any contigs of length <1? The error message is
+  " The contig on line number x has zero length"
+  9. Are there any gel readings (used in only one  contig)  that  have
+  zero length? The error message is
+  " Gel number N has zero length"
+  Note that "auto assemble"  also uses this logical consistency  check
+  and will only tolerate a "Gel number N is not used" error. Any other
+  error will cause it to give up.
+ @29. TX 1 @               Examine quality
+
+        Analyses the quality of the data in a contig.  It  reports  on
+  the  proportion  of the consensus that is "well determined" and will
+  display a sequence of symbols  that  indicate  the  quality  of  the
+  consensus at each position.
+
+        Identify the contig to analyse, and the section  of  interest.
+  The  current  consensus  calculation  cutoff  score  will be used to
+  decide if each position is "well determined". In general the quality
+  of  a  reading deteriorates along the length of the gel and so it is
+  also possible to use a length cutoff for  the  quality  calculation.
+  Only  the  data  from  the  first  section  of  each reading will be
+  included in the quality calculation. The  length  is  altered  under
+  "set parameters" and is initially set to the maximum reading length.
+  A summary showing the percentage of the consensus  that  falls  into
+  each category of quality is shown. Choose whether or not to have the
+  quality codes for each position of the consensus displayed. They can
+  be displayed as either graphics or text.
+
+        The quality of the data depends on the number of times it  has
+  been  sequenced  and the particular uncertainty codes  used  in each
+  gel reading.  This function divides the data into  five  categories,
+  assigning each a symbol or code:
+  1.  Well determined on both strands and they agree.  code=0
+  2.  Well determined on the plus strand only.  code=1
+  3.  Well determined on the minus strand only.  code=2
+  4.  Not well determined on either strand.  code=3
+  5.  Well determined on both strands but they disagree.  code=4
+  A position is "well determined" if it is assigned one of the symbols
+  A,C,G,T  when  the  algorithm  described in the section "calculate a
+  consensus".   The  calculation  is  performed  separately  for  each
+  strand.
+
+        If the user chooses to have the data displayed graphically the
+  following  scheme  is used. A rectangular box is drawn so that the x
+  coordinate  represents  the  length  of  the  contig.  The  box   is
+  notionally divided vertically into 5 possible levels which are given
+  the y values: -2,-1,0,1,2.  The quality  codes  attributed  to  each
+  base  position are plotted as rectangles.  Each rectangle represents
+  a region in which the quality codes are identical, so a single  base
+  having a different code from its immediate neighbours will appear as
+  a very narrow rectangle.
+
+    Rectangle bottom and top y values
+
+       Quality 0 rectangle from 0 to 0
+       Quality 1 rectangle from 0 to 1
+       Quality 2 rectangle from 0 to -1
+       Quality 3 rectangle from -1 to 1
+       Quality 4 rectangle from -2 to 2
+
+        Obviously a single line  at  the  midheight  shows  a  perfect
+  sequence.
+
+        Typical dialogue is shown below.
+
+     41.47% OK on both strands and they agree(0)
+     55.48% OK on plus strand only(1)
+      2.08% OK on minus strand only(2)
+      0.97% Bad on both strands(3)
+      0.00% OK on both strands but they disagree(4)
+    ? (y/n) (y) Show sequence of codes
+
+             10         20         30         40         50
+     1111111111 1111111111 1111111111 1111111111 1111111111
+
+             60         70         80         90        100
+     1111111111 1111111111 1111111111 3111111111 1111111111
+
+            110        120        130        140        150
+     1111111111 1111131111 1111111111 1111111111 1111111111
+
+            160        170        180        190        200
+     1111111111 1111111111 1111111111 1111111111 1111111133
+
+            210        220        230        240        250
+     1311111111 1111111111 1111111110 0000000000 0000220000
+
+            260        270        280        290        300
+     0000000000 0020000000 2200000202 0002000000 0000222200
+
+ @26. TX 3 @              Alter relationships
+
+        Used  to  make  what  are  normally  illegal  changes  to  the
+  database. That is the normal checks are not done and any item in the
+  database can be changed independently of all others. Users  need  to
+  know  what they are doing because it is very easy to make a horrible
+  mess. Always start by making a copy!
+
+        By using the options here users can  move  one  section  of  a
+  contig  relative  to  another, break contigs, remove contigs, remove
+  gel readings, etc. To give flexibility most of the commands do  only
+  one  thing. This means that several commands may have to be executed
+  to complete any change.
+
+        The following options are offered:
+
+     Cancel
+     Line change
+     Check logical consistency
+     Remove contig
+     Shift
+     Move gel reading
+     Rename gel reading
+     Break a contig
+     Remove a gel reading
+     Alter raw data parameters
+
+  1. QUIT returns to the main options of BAP.
+  3. Line change
+  allows the user to change the contents  of  any line in the file  of
+  relationships.   The  line is selected by number, the program prints
+  the current line and prompts for the new  line.
+  4.  Check logical consistency
+  5. Remove a contig
+  This function removes a contig and all its gel  readings.  The  user
+  specifies any reading in the contig.
+  6.  Shift
+  allows the user to change all the relative  positions of  a set   of
+  neighbouring  gel  readings by some fixed value, i.e.  it will shift
+  related gel readings either left or  right.   It  can  therefore  be
+  used   to  change the alignment of the gel readings in a contig.  It
+  prompts for the number of the first gel reading to shift  and   then
+  for  the   distance  to  move  them (Note a negative value will move
+  the gel readings left and a positive value right).   It  then chains
+  rightwards  (ie  follows  right  neighbours)  and  shifts  each  gel
+  reading,  in  turn,  up to the  end of the contig.  (This means that
+  only those gel readings from the first to shift to the rightmost are
+  moved). It updates the length of the contig accordingly.
+  7. Move gel reading
+  is  a  function  to  renumber  a  gel  reading.  It  moves  all  the
+  information  about  a  gel reading on to another line. The user must
+  specify the number of the gel  reading to move and the number of the
+  line  to place it. It takes care of all the relationships. Of course
+  gel readings must not be  moved  to  lines  occupied  by  other  gel
+  readings!
+  8.  Rename gel reading
+  is a function that is used to  rename  the archive   names   of  gel
+  readings   in  the  database;   it only changes the name in the .ARN
+  file of the  database.
+
+  9. Break contig
+
+        Occasionally it is necessary to break a contig into two  parts
+  and  this  can be achieved using this option. The program needs only
+  the number of a gel reading. This is  the  gel   reading  that  will
+  become  a  left  end  after  the  break.  That is, the break is made
+  between this gel reading and its left neighbour. A new  contig  line
+  is created so ensure that there is sufficient space in the database.
+  10. Removing gel readings from contigs
+
+        Gel  readings  can  be  removed  from  contigs.  If  they  are
+  essential  for  holding  the  contig  together  (ie are the only gel
+  reading covering a particular region), the program will create a new
+  contig.
+
+  11. Alter raw data parameters
+
+        Allows the user to edit the individual  raw  data  parameters,
+  such  as  the  left  and  right  cutoff  lengths and the name of the
+  machine readable trace file.  The user must specify the gel line  to
+  modify,  and  provide  new values for the length of the raw sequence
+  including cutoff lengths, the left cutoff position,  the  length  of
+  the original working sequence, the machine type, and the name of the
+  raw data file, where these values change.
+ @27. TX 1 @  Set display parameters
+
+        Used to  redefine  the  parameters  that  control  the  cutoff
+  employed  by  the  consensus  calculation  and quality examiner, the
+  maximum  length  of  each  reading  to  include   in   the   quality
+  calculation,  the line length used by the display function, the text
+  window length used by the graphics options, and the graphics  window
+  length used by the graphics options.
+
+        The default cutoff score is 75%. The default line length is 50
+  characters. For protein sequences the cutoff is always 100%.
+
+        The text window used by  the  graphics  options  controls  the
+  amount  of  sequence  listed at the crosshair position. The graphics
+  window controls the "zoom" function. Both these windows are  defined
+  as  the number of bases that should be shown, to both left and right
+  of the crosshair.
+ @30. TX 3 @  Shuffle pads
+
+        One weakness of the alignment strategy used  is  that  padding
+  characters  are  not  always  aligned  by the assembly routine. This
+  function attempts to align padding characters using  a  very  simply
+  strategy.  It  does  not  solve  all pad alignment problems but is a
+  useful first step during cleaning-up operations.
+ @10. TX 2 @Clear graphics
+
+        Clears graphics from the screen.
+ @11. TX 2 @Clear text
+
+        Clears  text from the screen.
+ @12. TX 2 @Draw a ruler.
+
+        This option allows the user to draw a ruler or scale along the
+  x  axis  of the screen to help identify the coordinates of points of
+  interest. The user can define the position of the first base  to  be
+  marked  (for  example if the active region is 1501 to 8000, the user
+  might wish to mark every 1000th base starting at either 1501 or 2000
+  -  it  depends  if  the user wishes to treat the active region as an
+  independent unit with its own numbering starting at its  left  edge,
+  or  as  part  of  the  whole sequence). The user can also define the
+  separation of the ticks on the scale and their height.  If  required
+  the labelling routine can be used to add numbers to the ticks.
+ @14. TX 2 @Reposition plots
+
+        The positions of each of the plots is defined  relative  to  a
+  users  drawing board which has size 1-10,000 in x and 1-10,000 in y.
+  Plots for each option are drawn in a window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The  default  window  positions are read from a file "ANALMARG" when
+  the program is started. Users can have their own file  if  required.
+  As  all  the plots start at the same position in x and have the same
+  width, x0 and xlength are the same for all options. Generally  users
+  will  only  want  to change the start level of the window y0 and its
+  height ylength. This option allows users to change window  positions
+  whilst  running  the  program.   The  routine  prompts first for the
+  number of the option that the users wishes to reposition;  then  for
+  the  y  start and height; then for the x start and length. Note that
+  changes to the x values affect all options. If the user  types  only
+  carriage  return  for any value it will remain unchanged. Note that,
+  unlike all the other programs, the boxes used to contain  analytical
+  results (eg plot quality) should not be made to overlap one another,
+  as the function of the crosshair routine depends on  which  box  the
+  crosshair is in!
+ @15. TX 2 @Label a diagram
+
+        This routine allows users to  label  any  diagrams  they  have
+  produced.  They  are  asked  to type in a label. When the user types
+  carriage return to finish typing the label the cross-hair appears on
+  the  screen. The user can position it anywhere on the screen. If the
+  user types R (for right justify) the label will be  written  on  the
+  diagram  with  its right end at the cross-hair position. If the user
+  types L (for left justify) the label will be written on the  diagram
+  with  its  left end at the cross hair position.  The cross-hair will
+  then immediately reappear. The  user  may  put  the  same  label  on
+  another part of the diagram as before or if he hits the space bar he
+  will be asked if he wishes to type in another label.
+
+        Typical dialogue follows.
+  ? Menu or option number=15
+  Type label then drive cross hair to left or right end
+  of label position then hit  "L"  to  write label left
+  justified or  "R"  to  write label right justified or
+  the space bar to quit
+
+
+  ? Label=delta gene
+
+   missing graphics
+
+  ? Label=
+
+ @16. TX 2 @Display a map
+
+        This is disabled!
+ @7. TX 1 @Redirect output
+
+        Used to direct output that would normally appear on the screen
+  to a file and to create postscript output.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @13. TX 2 @Use crosshair
+
+        This option puts a steerable cross on  the  screen  which  the
+  user  drives  around  by  using  the arrow keys (or mouse). When the
+  crosshair is visible a number of options are available if  the  user
+  types  one  of  a  set  of  special  keyboard  characters. Any other
+  characters will cause an exit from the crosshair option. The special
+  keys are:
+
+      I = Identify the nearest gel reading
+      Z = Zoom in
+      Q = plot Quality
+      S = display the aligned Sequences at the crosshair position
+      N = list the Names and Numbers of the sequences at the crosshair
+
+        In order for  any  of  these  special  keys  to  operate,  the
+  crosshair  must  be  in  an appropriate display box, and the precise
+  function of the keys will also depend on which box the crosshair  is
+  in.
+
+        If the crosshair is in the "plot  all  contigs"  box,  Z  will
+  cause  a  new box to appear showing all the readings for the nearest
+  contig; Q will give the same as Z but will also produce an extra box
+  showing the "quality" plot.
+
+        If Z is hit in the "plot single contig" box, the  contig  will
+  be  zoomed  to  the  current  graphics window size. The zoom will be
+  roughly centred on the crosshair position. Because  of  this  it  is
+  possible  to  step  along  a  contig  by repeatedly zooming with the
+  crosshair near to one end of the single contig display box. If I  is
+  hit  the crosshair must be close to a gel reading line. If Q is hit,
+  the quality plot will be produced for the region shown in  the  plot
+  single  contig  box. In all cases when the "plot all contigs" box is
+  shown, a vertical line will  bisect  the  line  the  represents  the
+  relevant contig, at the current position.
+
+        If the crosshair is in the plot quality box only the character
+  "s" will operate as a special symbol.
+
+        The number of bases shown in the N and S options is controlled
+  by  the  current graphics text window size, and the size of the zoom
+  window by the current graphics window size.  Both  are  set  by  the
+  parameter setting function of the general menu.
+ @33. TX 2 @Plot single contig
+
+        This option produces a schematic of a  selected  region  of  a
+  single  contig by drawing a horizontal line to represent each of its
+  gel readings. The lines show the relative positions of each  reading
+  and  also  their  sense.  The  plot  is  divided vertically into two
+  sections by a line that is identified by an asterisk drawn  at  each
+  end.  All lines that lie above this line represent readings that are
+  in their original sense, all lines below show readings that  are  in
+  the  complementary  sense to their original. By use of the crosshair
+  function the plot can  be  stepped  through  and  examined  in  more
+  detail. See help on crosshair.
+ @34. TX 2 @Plot all contigs
+
+        This option produces a schematic  of  all  the  contigs  in  a
+  database.  It  does  this  by drawing a horizontal line to represent
+  each of them. In order to show the ends of each contig it draws  the
+  lines for contigs at alternate heights: the first at height one, the
+  second at height two, the third at height one, etc. The order of the
+  contigs  in  the display is the same as their order in the database.
+  By use of the crosshair function the plot can be stepped through and
+  examined in more detail. See help on crosshair.
+ @31. TX 3 @ Disassemble readings
+
+        This function is used to remove a  list  of  readings  from  a
+  database, or to create a new contig from a single reading moved from
+  an existing contig.  This latter mode is useful for repositioning  a
+  reading  in  a  repeat:  once separated it can be placed in the join
+  editor and scrolled  by  the  other  copies.   Removal  of  sets  of
+  readings  works  in  two  modes:  1. A set of adjacent readings in a
+  contig can be removed by the user naming the two end ones; or  2.  A
+  batch  of  readings from any number of contigs can be defined by the
+  user naming a file containing a list of reading names.  The  program
+  cleans  up  the database by moving data to fill up any holes made in
+  the files.
+
+        For both modes of operation the program will ask for a file of
+  file names.  If users create their own file (ie mode 2) each reading
+  NAME must be on a separate line. For mode 1 the user types the NAMES
+  of  the  leftmost and rightmost readings to be removed. They and all
+  intervening readings will be removed. Note that the routine operates
+  on  reading  names  - not numbers. For both modes, if necessary, new
+  contigs will be created.
+ @35. TX 1 3 @Find internal joins
+
+        The purpose of this function is to use  data  already  in  the
+  database  to  find  possible  joins between contigs.  Joins may have
+  been missed due to poor data or  may  have  not  been  made  due  to
+  repeated  sequences.  Where  appropriate, it may be possible to find
+  potential joins by using the "unused data" derived  from  sequencing
+  machines.
+  For all overlaps found when the X version is used, the contig editor
+  (in join mode) will be called up with the two contigs aligned.
+  The database is checked for logical consistency.  Supply  a  minimum
+  initial  match  length,  a minimum alignment block, the maximum pads
+  per sequence, the maximum  percent  mismatch  after  alignment,  the
+  probe length. Choose if clipped data is to be used, if so define the
+  window size for finding good data and the number of  dashes  allowed
+  in  the  window. Processing will commence.  Most of these values are
+  used in an identical way in the autoassemble  function.  The  others
+  are defined below.
+  The program strategy
+  Take the first contig and calculate its consensus. If  clipped  data
+  is  being  used  examine  all readings that are in the complementary
+  orientation, and sufficiently near to the contigs left end,  to  see
+  if  they have good clipped sequence which if present, would protrude
+  from the left end of the contig.  If  found  add  the  longest  such
+  sequence to the left end of the consensus. Do the same for the right
+  end by examining readings that are in their original orientation. If
+  any  are  found  add  the  longest extension to the right end of the
+  consensus. Repeat the consensus calculations and extensions for  all
+  contigs  hence  producing  an extended consensus. If clipped data is
+  not  being  used  simply  calculate  the  consensus  for  the  whole
+  database.  Now  look  for  possible joins by processing the extended
+  consensus in the following  way.  Take  the  last,  say  100,  bases
+  (termed  the  "probe  length"  by  the  program)  of  the  rightmost
+  consensus, compare it both orientations with the extended  consensus
+  of  all the other contigs. Display any sufficiently good alignments.
+  Repeat with the left end of the rightmost contig. Do  the  same  for
+  the ends of all the entended contigs, always only comparing with the
+  contigs to their left, so that the same matches do not appear twice.
+  Good cliped data is defined by sliding a window of "Window size  for
+  good  data scan" bases outwards along the sequence and stopping when
+  "Maximum number of dashes in scan window" or more dashes  appear  in
+  the  window.   Note that it is advisable to have some sort of cutoff
+  because if we simply take all the  data  it  might  be  so  full  of
+  rubbish  that  we wont find any good matches. For the same reason it
+  is worth trying the procedure with different cutoffs. An initial run
+  using  no  clipped  data  is  also  recommended.   Sufficiently good
+  alignments are defined by  criteria  equivalent  to  those  used  in
+  autoassemble,  however here we only display alignments that pass all
+  tests.
+  Bugs
+  If a small contig is wholly contained within a larger one, such that
+  its  ends  are further than ("Probe length" - "Minimum initial match
+  length") from the ends of the larger contig, and the  consensus  for
+  the small contig lies to the left of the consensus for large contig,
+  the overlap will not be discovered. (See the search stratgey).
+  All numbering is relative to base number one in the contig:  matches
+  to  the  left  (i.e.  in  the clipped data) have negative positions,
+  matches off the right end of the contig (i.e. in the  clipped  data)
+  have  positions  greater  than  that  of  the  contig  length.   The
+  convention for reporting the positions of overlaps is as follows: if
+  neither  contig needs to be complemented the positions are as shown.
+  If the program says "contig x in the -  sense"  then  the  positions
+  shown  assume  contig  x  has  been complemented. For example in the
+  results given below the positions  for  the  first  overlap  are  as
+  reported,  but  those  for  the second assume that the contig in the
+  minus sense (i.e. 443) has been complemented.
+
+
+   Possible join between contig   445 in the + sense and contig   405
+   Percentage mismatch after alignment =  4.9
+          412        422        432        442        452        462
+       405  TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAG,TT AGCTCACTCA
+             ********* * ********  ***** *** ********** ********** **********
+       445  -TTCCCGACT G,AAAGCGGG TAGTGA,CGC AACGCAATTA ATGTGAG-TT AGCTCACTCA
+         -127       -117       -107        -97        -87        -77
+          472        482        492        502        512
+       405  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+            ********** ********** ********** ********** **
+       445  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+          -67        -57        -47        -37        -27
+   Possible join between contig   443 in the - sense and contig   423
+   Percentage mismatch after alignment = 10.4
+           64         74         84         94        104        114
+       423  ATCGAAGAAA GAAAAGGAGG AGAAGATGAT TTTAAAAATG AAACG-CGAT GTCAGATGGG
+            **** ***** ********** ********** ******  ** ***** **** *********
+       443  ATCG,AGAAA GAAAAGGAGG AGAAGATGAT TTTAAA,,TG AAACGACGAT GTCAGATGG,
+         3610       3620       3630       3640       3650       3660
+          124        134        144        154        164
+       423  TTG-ATGAAG TAGAAGTAGG AG-AGGTGGA AGAGAAGAGA GTGGGA
+            *** ****** ********** ** *******  *** ***** ** **
+       443  TTGGATGAAG TAGAAGTAGG AGGAGGTGGA ,GAG,AGAGA GTTGG-
+         3670       3680       3690       3700       3710
+
+
+ @36. TX 3 @Double strand
+
+        PLEASE MAKE A COPY OF THE DATABASE BEFORE USING THIS OPTION AS
+  IT HAS CURRENTLY HAD VERY LITTLE TESTING.
+
+        Uses the cutoff data to change single stranded sections  of  a
+  contig  into double stranded sections. Data is used carefully to try
+  and minimise the number of data disagreements  created.  However  it
+  must  be  noted  that  an overall slight degradation in quality will
+  still occur.
+
+        When using this option you will be prompted for a contig and a
+  region  within that contig. The default region is the entire contig.
+  The option will then search through the region  for  areas  of  good
+  data on one strand and cutoff data on the opposite strand, extending
+  the cutoff data. The criteria for evaluating the  amount  of  cutoff
+  data  to  be used is based upon a maximum number of mismatches and a
+  score  (derived  by  accumulating  points   for   mismatches   (-8),
+  matches(+1)  and  insertions  (-5) over the length of an alignment).
+  The defaults are:
+
+  maximum mismatches      :  6
+
+  score for mismatch      : -8
+  score for correct match : +1
+  score for insertion     : -5
+
+        Note that with successive calls to this option it is  possible
+  to  double  strand more and more data. Naturally however the quality
+  of the data generated will diminish each time.
+ @37. TX 3 @Auto-select oligos
+
+        PLEASE MAKE A COPY OF THE DATABASE BEFORE USING THIS OPTION AS
+  IT HAS CURRENTLY HAD VERY LITTLE TESTING.
+
+        Generates a file (default "primers") of suggested  primers  to
+  use  for  covering  a single stranded section or for walking off the
+  end of a contig. The file generated contains the gel  reading  name,
+  the  primer sequence, it's offset in the contig and the orientation.
+  An example file would be :
+
+  c81d12.s1 TTGTCTGTAAGCGGATG (@ 6449 ) +
+  c98a10.s1 ATTATCACTTTACGGGTC (@ 6959 ) +
+  c81c1.s1 CAAGAAGGCGATAGAAG (@ 7643 ) +
+  c76a10.s1 CCTCATCCTGTCTCTTG (@ 8441 ) +
+  c81g4.s1 ATGAAACCTGGGCGTTG (@ 16156 ) +
+  c91e6.s1 GTTTTCAGATGTCGGAG (@ 18249 ) +
+  c81e12.s1 GCTACCGTAAAACACTTC (@ 18737 ) +
+  c93h11.s1 GCTGCTTTTTGTTTTATCC (@ 19158 ) +
+  c81h6.s1 CTTCCACTTCTTTCTTATC (@ 21210 ) +
+  c86a12.s1 CGAATGATAAAGACAAATCAG (@ 22122 ) +
+  c98b1.s1 GCCACTTTATCCGAGAC (@ 3048 ) -
+  c97c5.s1 GTGTTTTGGGTATATTGTG (@ 3371 ) -
+  c83d2.s1 CTACACAGAATGAACCC (@ 3768 ) -
+  c78h10.s1 GGCGGTGAAGATTGAAG (@ 4200 ) -
+  c98h9.s2dt CTCGTTTAAATTTCAAACTTCC (@ 7419 ) -
+  c95a9.s1 ATTGGAAGGAAGGAGGG (@ 22996 ) -
+  c82b4.s1 TGTAGCCGAAATCTTCC (@ 23369 ) -
+
+        This is best employed after having previously used the 'Double
+  strand' option.  When selecting the option you will be asked for the
+  contig, a region within this contig and the file to write  the  list
+  of  primers  to.  For  each  primer suggested a tag is automatically
+  created containing details of the gel reading name and the sequence.
+  Preferably the tag will be created on the gel reading from which the
+  primer was selected. However this is not always possible so  failing
+  that  the  tag  will  be  on another sequence overlapping the primer
+  position.
+
+        When invoked with the dialogue option  you  will  be  asked  a
+  couple  more  questions  relating  to  the  position and size of the
+  consensus checked for suitable oligos. You will be prompted for  the
+  start and end of a region (default 40-140) at a relative position to
+  the left of our initial region.
+
+        For example:
+
+  ? Menu or option number=d37
+   Auto-select oligos
+   Default Contig identfier=/e97f2.s1
+   ? Contig identfier=
+   ? Start position in contig (1-20942) (1) =10000
+   ? End position in contig (10000-20942) (20942) =11000
+   Default Name of file for primers=primers
+   ? Name of file for primers=
+   ? Start of oligo choice region (1-1024) (40) =50
+   ? End of oligo choice region (50-1024) (150) =150
+
+
+        This implies that we are going to look for oligos  to  use  as
+  primers covering the region 10000 to 11000. For each single stranded
+  section in this region we search for the oligos at  between  50  and
+  150  to  the left. So if we had a single stranded section from 10121
+  to 10295 we would search for oligos in the region 9971 to 10071.
+ @38. TX 1 @Check assembly
+
+        This new function is used  for  checking  the  positioning  of
+  assembled  readings.   It  is  useful  for  checking  sequences that
+  contain repeats of length similar to that of a single  gel  reading.
+  It  takes  the poor quality data for each reading and compares it to
+  the segment of the consensus  to  which  it  should  align.  If  the
+  extension of the read does not match the consensus then the read (or
+  its neighbours) has probably been assembled into  the  wrong  place.
+  The  program  displays  the  bad  alignments.   The  quality  of  an
+  alignment is defined by the percentage mismatch.  Naturally the user
+  should  select  a  value that takes into account the poor quality of
+  the data being aligned.
+
+        When the routine is used  from  the  X  version  the  user  is
+  offered  the  editor  to  examine poor alignments. If alignments are
+  reported as poor, but on inspection are OK, the user can set  a  tag
+  so  that  the  poor  quality  data  is  ignored on subsequent passes
+  through the routine. Note  however  such  data  will  then  also  be
+  ignored by the automatic double stranding routine!
+
+        The user defines the percentage mismatch; the window size  and
+  number of dashes allowed in the window used for selecting the amount
+  of the poor data to be employed; can choose to save the names of the
+  poorly  aligned  reads in a file; can select an individual contig or
+  scan the whole database.  The  file  containing  the  names  of  the
+  poorly  aligned  reads  can  be  used  by the disassembly routine to
+  remove them from the database, and then can be  used  to  reassemble
+  them.  Note  that  the  routine complements each contig twice during
+  processing.
+ @39. TX 1 @Find read pairs
+
+        This new function is used to check the positions  of  readings
+  taken  from  each end of the same template. For each forward read it
+  searches for a corresponding reverse reading. The search can be over
+  the  whole  database  or  over  a single contig.  The results can be
+  presented graphically for single contig searches and  the  crosshair
+  function can be used to identify the readings displayed.
+
+        Note that at present the function only knows  that  two  reads
+  are from the same template by comparing reading names. For our local
+  projects we use the following naming convention: forward  reads  are
+  named abcdefgh.s1 and reverse reads abcdefgh.r1. The program expects
+  this naming convention and so if it finds read fred.s1  and  fred.r1
+  it assumes they are the forward and reverse reads for template fred.
+  In the future we will make the routine more general!
+
+        If a single contig is selected and the output  is  listed  the
+  program  displays  two lines for each pair: the first line shows the
+  reading name, its position and length, and the distance between  the
+  extremeties  of  the two reads; the second line shows the other read
+  name, its position and length.  If  there  are  pairs  that  are  in
+  separate contigs or are facing away from one another they are listed
+  after the pairs that face inwards.  Is this true?
+
+        If the results are plotted the full length of the template  is
+  drawn  with  arrows indicating the direction of reads and the extent
+  of each reading. Those reads that  have  their  partner  in  another
+  contig are marked by asterisks.
+
+        Typical dialogue is shown below.
+
+   ? Select contigs (y/n) (y) =
+   Default Contig identifier=/i55d8.s1
+   ? Contig identifier=
+   ? Start position in contig (1-15227) (1) =
+   ? End position in contig (1-15227) (15227) =
+   ? Plot results (y/n) (y) = n
+      852 k23a1.r1            249   238  1615
+      806 k23a1.s1           1529  -335
+      238 i68e6.s1            422   193  1632
+      868 i68e6.r1           1756  -298
+      576 k17a2.s1           2370   213  1676
+      885 k17a2.r1           3790  -256
+       84 k27g6.s1           3456   291  1777
+      867 k27g6.r1           4905  -328
+      453 k01g10.s1          5805   142  1251
+      881 k01g10.r1          6909  -147
+      781 i98b8.r1           6754   338  1079
+       10 i98b8.s1           7653  -180
+      883 k02d11.r1          7327   276  1597
+      283 k02d11.s1          8726  -198
+      269 i68f9.s1           8191   169  1055
+      777 i68f9.r1           8891  -355
+      710 i91c6.s1           8245    95  1516
+      780 i91c6.r1           9403  -358
+      596 k27d12.s1           136   329  -329
+      219 k27d12.r1             1  -116
+      159 k27d11.r1          1830  -263  -263
+      317 k27d11.s1          2902   343
+      886 k17g11.r1          7107  -123  -123
+      647 k17g11.s1          1867   265
+      851 i69g10.r1          8045  -137  -137
+      277 i69g10.s1          4658   174
+
+        If contigs are not selected the  pairs  are  sorted  on  their
+  separations.
+
+   ? Select contigs (y/n) (y) = n
+   i68f2.s1            27  1781  1777
+   i68f2.r1           776   111  1777
+   k17f6.s1           601    60  1706
+   k17f6.r1           856  1405  1706
+   k17a2.s1           576  2370  1676
+   k17a2.r1           885  3790  1676
+   k27g3.s1           177 14985  1664
+   k27g3.r1           889 13564  1664
+   k27b12.s1          764     1  1086
+   k27b12.r1          857   932  1086
+   i98b8.s1            10  7653  1079
+   i98b8.r1           781  6754  1079
+   k16a3.s1           748  1276  1070
+   k16a3.r1           784   472  1070
+   k17b7.r1           786 14937 18942*
+   k17b7.s1           787  3601 18942*
+   k27d12.r1          219     1 15208*
+   k27d12.s1          596   136 15208*
+   k01g2.s1           502    87 14754*
+   k01g2.r1           782  9224 14754*
+
+ @ end of help
+
diff --git a/help/bap_menu b/help/bap_menu
new file mode 100644
index 0000000..35a1dd3
--- /dev/null
+++ b/help/bap_menu
@@ -0,0 +1,84 @@
+-1 0 21 2 T General
+-1 0 21 2 X General
+-2 0 50 2 T Screen control
+-2 0 71 2 X Screen
+-3 0 98 2 T Modification
+-3 0 98 2 X Modification
+0 -1 116 332 T BAP
+0 -1 116 332 X BAP
+17 1 17434 18 T Screen against enzymes
+17 1 17434 18 X Screen against enzymes
+18 1 18477 23 T Screen against vector
+18 1 18477 23 X Screen against vector
+20 3 19859 121 T Auto assemble
+20 3 19859 121 X Auto assemble
+28 1 26426 43 T Highlight disagreements
+28 1 26426 43 X Highlight disagreements
+32 3 28846 17 T Extract gel readings
+32 3 28846 17 X Extract gel readings
+1 0 29607 3 T Help
+1 0 29607 3 X Help
+2 0 29676 5 T Quit
+2 0 29676 5 X Quit
+3 1 29869 230 T Open a database
+3 1 29869 230 X Open a database
+4 3 41499 320 T Edit contig
+4 3 41499 320 X Edit contig
+5 1 56688 43 T Display a contig
+5 1 56688 43 X Display a contig
+6 1 58990 6 T List a text file
+6 1 58990 6 X List a text file
+8 1 59248 93 T Calculate a consensus
+8 1 59248 93 X Calculate a consensus
+25 1 63707 41 T Show relationships
+25 1 63707 41 X Show relationships
+23 3 65650 11 T Complement a contig
+23 3 65650 11 X Complement a contig
+22 3 66173 59 T Join contigs
+22 3 66173 59 X Join contigs
+24 1 69194 11 T Copy the database
+24 1 69194 11 X Copy the database
+19 1 69740 43 T Check database
+19 1 69740 43 X Check database
+29 1 71898 82 T Examine quality
+29 1 71898 82 X Examine quality
+26 3 75715 84 T Alter relationships
+26 3 75715 84 X Alter relationships
+27 1 79641 17 T Set display parameters
+27 1 79641 17 X Set display parameters
+30 3 80503 7 T Shuffle pads
+30 3 80503 7 X Shuffle pads
+10 2 80866 3 T Clear graphics
+10 2 80866 3 X Clear graphics
+11 2 80931 3 T Clear text
+11 2 80931 3 X Clear text
+12 2 80996 12 T Draw a ruler.
+12 2 80996 12 X Draw a ruler.
+14 2 81730 38 T Reposition plots
+14 2 81730 38 X Reposition plots
+15 2 84069 28 T Label a diagram
+15 2 84069 28 X Label a diagram
+16 2 85174 3 T Display a map
+16 2 85174 3 X Display a map
+7 1 85228 12 T Redirect output
+7 1 85228 12 X Redirect output
+13 2 85731 43 T Use crosshair
+13 2 85731 43 X Use crosshair
+33 2 87876 12 T Plot single contig
+33 2 87876 12 X Plot single contig
+34 2 88578 10 T Plot all contigs
+34 2 88578 10 X Plot all contigs
+31 3 89160 21 T Disassemble readings
+31 3 89160 21 X Disassemble readings
+35 3 90372 94 T Find internal joins
+35 1 90372 94 T Find internal joins
+35 3 90372 94 X Find internal joins
+35 1 90372 94 X Find internal joins
+36 3 96201 30 T Double strand
+36 3 96201 30 X Double strand
+37 3 97555 64 T Auto-select oligos
+37 3 97555 64 X Auto-select oligos
+38 1 100421 30 T Check assembly
+38 1 100421 30 X Check assembly
+39 1 102178 90 T Find read pairs
+39 1 102178 90 X Find read pairs
diff --git a/help/dap_help b/help/dap_help
new file mode 100644
index 0000000..3dd3ea1
--- /dev/null
+++ b/help/dap_help
@@ -0,0 +1,2112 @@
+ @-1. TX  0 @General
+
+ @-2. T   0 @Screen control
+
+ @-2. X   0 @Screen
+
+ @-3. TX  0 @Modification
+
+ @0.  TX -1 @SAP
+
+        This is help information for the X  Windows  version  of  SAP.
+  Currently  it  is  being brought up to date with the new features in
+  XDAP.  The accuracy of this help should therefore not be assumed.
+
+        This is an  interactive  program  whose  primary  use  is  for
+  managing  shotgun  sequencing  projects, but it can also be used for
+  handling alignments of other sequences, including those of proteins.
+  Currently   the   maximum  'gel  reading'  length  is  set  to  4096
+  characters. Almost all of the information below describes the use of
+  the  program  for shotgun projects, but those using the programs for
+  handling other sequence alignments should interpret it  accordingly.
+  The data for such a project is stored in a special type of database.
+  The program contains the tools that are  required  to  type  in  gel
+  readings,  screen  them  against  vector  sequences  and restriction
+  sites; enter new  gel  readings  into  the  database  (automatically
+  comparing  and  aligning  them). In addition it contains editors and
+  functions to examine the quality of the aligned sequences.
+
+        There  are  three  main   menus:   "general",   "screen"   and
+  "modification", and some functions have submenus.
+    The general menu contains the following options:
+
+         Open a database
+         Display a contig
+         List a text file
+         Direct output
+         Calculate a consensus
+         Screen against restriction enzymes
+         Screen against vector
+         Check database
+         Copy database
+         Show relationships
+         set parameters
+         Highlight disagreements
+         Examine quality
+         Find internal joins
+
+  The graphics menu contains:
+
+         Clear graphics
+         Clear text
+         Draw ruler
+         Use cross hair
+         Change margins
+         Label diagram
+         Plot map
+         Plot single contig
+         Plot all contigs
+
+
+  The modification menu contains:
+
+         Edit contig
+         Auto assemble
+         Join contigs
+         Complement a contig
+         Alter relationships
+         Extract gel readings
+
+
+  The alter relationships menu contains:
+
+         Cancel
+         Line change
+         Edit single gel reading
+         Delete contig
+         Shift
+         Move gel reading
+         Rename gel reading
+         Break contig
+         Alter raw data parameters
+
+
+
+        Overview of the methodology
+
+        The shotgun sequencing strategy
+
+        In  the  shotgun  sequencing  procedure  the  sequence  to  be
+  determined   is   randomly   broken  into  fragments  of  about  400
+  nucleotides in length. These fragments are cloned and then  selected
+  randomly  and  their  sequences    determined.     The  relationship
+  between  any  pair  of fragments is  not  known  beforehand  but  is
+  found  by  comparing  their   sequences.  If  the  sequence  of  one
+  found to be wholly or partially contained  within  that  of  another
+  for  sufficient  length  to  distinguish  an overlap  from  a repeat
+  then those two fragments can  be  joined.  The  process  of  select,
+  sequence  and  compare is continued until the whole of  the  DNA  to
+  be  sequenced is in one continuous well determined piece.
+
+        Definition of a contig
+
+        A CONTIG is a set of gel  readings   that   are   related   to
+  one  another   by   overlap  of  their  sequences.  All gel readings
+  belong to a contig and each contig  contains  at   least   one   gel
+  reading.   The  gel  readings in a contig can be summed to produce a
+  continuous consensus sequence and the length of this sequence is the
+  length  of the contig.  The rules used to perform this summation are
+  given  under  "the  consensus  algorithm".   At  any  stage  of    a
+  sequencing  project the data will comprise a number of contigs; when
+  a  project  is complete  there  should be only one  contig  and  its
+  consensus  will  be  the  finished  sequence.  Note that since being
+  introduced and defined as above the word "contig" has been taken  up
+  by  those involved in genomic mapping. In that context the consensus
+  with a  precise length is not defined.
+
+  Introduction to the computer method
+
+        It is useful  to  consider  the  objectives  of  a  sequencing
+  project  before  outlining  how  we use the computer to help achieve
+  them. The aim of a shotgun  sequencing  project  is  to  produce  an
+  accurate  consensus sequence from many overlapping gel readings.  It
+  is necessary to know, particularly  at  the  latter  stages  of  the
+  project,  how accurate the consensus sequence is. This enables us to
+  know which regions of the sequence require further work and also  to
+  know  when  the  project  is  finished.   To show the quality of the
+  consensus, the programs described here produce  displays  like  that
+  shown below.
+
+
+                             10        20        30        40        50
+     -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+         CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                            110       120       130       140       150
+     -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+     -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+     -5  HINW.009                                        GGCAGCGACCGTCG
+     17  HINW.999                                           AGCGACCGTCG
+         CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                            160       170       180       190       200
+     -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+     -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+     12  HINW.017                                              GTAGAGC*
+         CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+
+        This is an example showing the left  end  of  a  contig   from
+  position   1  to  200.   Overlapping  this  region  are gel readings
+  numbered 6, 3, 5, 17 and 12; 6, 3 and 5 are in  reverse  orientation
+  to  their  original  reading  (denoted  by  a  minus sign). Each gel
+  reading also has a name (eg HINW.010). It can  be  seen  that  in  a
+  number  of  places the sequences contain characters other than A,C,G
+  and T. Some  of  these  extra  characters  have  been  used  by  the
+  sequencer   to  indicate  regions  of  uncertainty  in  the  initial
+  interpretation of the gel reading, but the asterisks (*)  have  been
+  inserted  by  the  automatic assembly function in order to align the
+  sequences.  Underneath  each  50  character  block  of  gel  reading
+  sequences  is the consensus derived from the sequences aligned above
+  (the line labelled CONSENSUS). For most of its length the  consensus
+  has a definite nucleotide assignment but in a few positions there is
+  insufficient agreement between the gel readings and so  a  dash  (-)
+  appears  in  the  sequence.  This  display contains all the evidence
+  needed to assess the quality of the consensus: the number  of  times
+  the  sequence has been determined on each strand of the DNA, and the
+  individual nucleotide assignments given for each gel reading.
+
+        So the aim is to produce the consensus sequence  and,  equally
+  important,  a  display of the experimental results from which it was
+  derived.
+
+        In order to achieve this the following operations need  to  be
+  performed:
+  1) Put individual  gel  readings  into  the  computer.   This  might
+  involved   the  manual  interpretation  of  autoradiographs  or  the
+  transfer and process  of  machine-readable  files  from  fluorescent
+  sequencing machines.
+  2) Check each gel reading to make sure it is not simply part of  one
+  of the vectors used to clone the sequence.
+  3) Check each gel reading to make sure  that  those  fragments  that
+  span  the  ligation point used prior to sonication are not assembled
+  as single sequences.
+  4) Compare all the  remaining  gel  readings  with  one  another  to
+  assemble them to produce the consensus sequence.
+  5) Check the quality of the consensus and edit the sequences.
+  6) When all the consensus is sufficiently well determined, produce a
+  copy of it for processing by other analysis programs.
+
+        It is very unlikely that this procedure will  only  be  passed
+  through  once.   Usually steps 1 to 5 are cycled through repeatedly,
+  with step 4 just adding new sequences to  those  already  assembled.
+  Generally step 6 is also used in order to analyse imperfect sequence
+  to check if it is the one the project intended to  sequence,  or  to
+  look  for  interesting  features. Analysis of the consensus, such as
+  searches for protein coding regions, can also help to find errors in
+  the  sequence.  The  display  of  the overlapping gel readings shown
+  above can be used  to  indicate,  not  only  the  poorly  determined
+  regions,  but  also  which  clones  should be resequenced to resolve
+  ambiguities, or those which can usefully be extended or sequenced in
+  the reverse direction, to cover difficult regions.
+
+        The original individual gel readings for a sequencing  project
+  are  each  stored in separate files. As the gel readings are entered
+  into the computer (usually in batches, say 10 from a film), the file
+  names  they are given are stored in a further file, called a file of
+  file names. Files of file names enable gel readings to be  processed
+  in batches.
+
+        For each sequencing project we start a project database.  This
+  database  has  a  structure  specifically  designed for dealing with
+  shotgun sequence data. In order to arrive  at  the  final  consensus
+  sequence  many  operations  will  be performed on the sequence data.
+  Individual fragments must be sequenced and compared in  both  senses
+  (i.e.  both  orientations)  with  all  the  other sequences. When an
+  overlap between a new gel reading and a contig are found  they  must
+  be aligned and the new gel reading added to the contig. If a new gel
+  reading overlaps two contigs they must be aligned and joined. Before
+  the  two contigs are joined one of them may need to be turned around
+  (reversed and  complemented)  so  they  are  both  in  in  the  same
+  orientation.
+
+        Clearly, keeping track of all  these  manipulations  is  quite
+  complicated,  and  to  be  able  to  perform  the operations quickly
+  requires careful choice of data structure and algorithms. For  these
+  reasons  it  is not practicable to store the gel readings aligned as
+  shown in the display above. Rather, it is more convenient  to  store
+  the  sequences unassembled, and to record sufficient information for
+  programs to assemble  them  during  processing.  The  data  used  to
+  assemble the sequences is called relational information.
+
+        The database comprises five files and they are described under
+  the section entitled "open database".
+
+        Before entry into the project database each  new  gel  reading
+  must  be  compared  to  look  for overlaps with all the data already
+  contained within the database. This last  point  is  important:  all
+  searching  for  overlaps  is between individual new gel readings and
+  the data already in the database. There is no searching for overlaps
+  between sequences within the database; overlaps must be found before
+  new gel readings are entered into the database.
+
+        Below  I  give  an  introduction  to  how  the  sequences  are
+  processed by being passed from one function to the next.
+
+        This program is used to start a database for the  project  and
+  then the following procedure is used.
+
+        Data in the form of individual gel readings are  entered  into
+  the  computer and stored in separate files using either program this
+  program or the digitizer program. Batches of these gel readings  are
+  passed  to  the  screening  functions  in this program to search for
+  overlaps with vector sequences  ("screen  against  vector")  or  for
+  matches  to  restriction  enzyme  sites   that should not be present
+  ("screen against enzymes"). Each run of  these  screening  functions
+  passes  on  only  those  gel  readings  that do not contain unwanted
+  sequences.  Sequences  are  passed  via  files  of  file  names  and
+  eventually  are  processed by the automatic assembly function ("auto
+  assemble"). This function compares each gel reading with a consensus
+  of  all  the  previous  gel  readings stored in the database.  If it
+  finds any overlaps it aligns the overlapping sequences by  inserting
+  padding  characters,  and  then  adds  the  new  gel  reading to the
+  database. Gels that overlap are added to existing contigs  and  gels
+  that do not overlap any data in the database start new contigs. If a
+  new gel overlaps two contigs they are joined. Any gel readings  that
+  appear  to overlap but which cannot be aligned sufficiently well are
+  not entered and have their names written to a  file  of  failed  gel
+  reading names.
+
+        Generally data is entered into the database in batches as just
+  described.  The  program  is  also  used  to examine the data in the
+  database, to enter gel readings that the automatic assembly function
+  cannot  align  ("auto  assemble"), and to make final edits. Edits to
+  whole contigs can  be made in several ways.  A  mouse-driven  editor
+  ("edit   contig")   is   used   to   perform   all  edits  manually.
+  Disagreements between gel readings in contigs  and  their  consensus
+  sequences  can  be  highlighted  by  use  of the function "highlight
+  disagreements".
+
+        Editing the  sequences  is  obviously  an  essential  part  of
+  managing   a  sequencing  project.  Editing  is  required  when  new
+  sequences are added, when contigs are joined, and when sequences are
+  corrected.   A  basic part of the strategy used here is that new gel
+  readings should be correctly aligned throughout their  whole  length
+  when  they  are entered into the database, and that when contigs are
+  joined they are edited so that they are well aligned in  the  region
+  of  overlap.  Alignment can be achieved by adding padding characters
+  to the sequences, and this is the way "auto assemble" operates  when
+  adding new sequences to the database.
+
+        In order to search for overlaps that may have been missed  due
+  to  errors  in the gel readings, the function "extract gel readings"
+  can be used to take copies of  the  gel  readings  at  the  ends  of
+  contigs,  and  write  them out as separate files.  These can then be
+  compared with the  database  consensus  using  the  "auto  assemble"
+  function in a mode that forbids entry of data into the database, and
+  any gel reading matching two contigs will indicate a join  that  has
+  been  missed.  The  joins can then be made interactively using "join
+  contigs". Missed matches can be found  at  this  stage  because  the
+  errors in the sequences may have been corrected by new data.
+
+        Generally the users need not concern themselves with  how  the
+  relational  information  is used by the program, but it is necessary
+  to know how contigs are identified. Because contigs  are  constantly
+  being  changed  and  reordered  the  program  identifies them by the
+  numbers of the gel readings they contain.  Whenever  users  need  to
+  identify  a  contig they need only know the number or name of one of
+  the gel readings it contains. Whenever the  program  asks  users  to
+  identify  a  contig  or  gel reading they can type its number or its
+  archive name. If they type its archive name they  must  precede  the
+  name by a slash "/" symbol to denote that it is a name rather than a
+  number. E.g if the  archive name is fred.gel with number  99,  users
+  should  type  /fred.gel  or  99  when  asked to identify the contig.
+  Generally, when it asks for the gel reading to  be  identified,  the
+  program  will  offer  the user a default name, and if the user types
+  only return, that contig will be accessed. When a database is opened
+  the  default  contig  will  be  the  longest  one, but if another is
+  accessed, it will subsequently  become the current default.
+
+        Further information is located in the  following  places.  The
+  database  files  are described under "open database". The format for
+  vector  and  consensus  sequences  is  given  under   "calculate   a
+  consensus", as are the uncertainty codes used in gel readings.
+
+        There  are  two  programs,  other  than  this,   relevant   to
+  sequencing  are the digitizer program  and the trace editor program,
+  both is outlined briefly below.
+
+        The digitiser program is used for the  initial  input  of  gel
+  readings  and  for  writing a file of file names. The program uses a
+  digitizer for data  entry.   A  digitizer  is  a   two   dimensional
+  surface  such  as a light box which is such that if a special pen is
+  pressed onto it, the pens coordinates are recorded  by  a  computer.
+  These coordinates can be interpreted by a program.
+
+        In order to read an autoradiograph placed on the light box the
+  user  need  only  define the bottom of the four sequencing lanes and
+  the bases to which they correspond and then use  the  pen  to  point
+  to   each  successive   band  progressing  up  the gel.  The program
+  examines the coordinates of each pen position to see in which of the
+  four  lanes  it   lies  and  assigns  the  corresponding  base to be
+  stored in the computer.  Each time the pen tip is depressed to point
+  to  a  position on  the  surface of the digitizer the program sounds
+  the bell on the terminal to indicate to the user that  a  point  has
+  been recorded.  As the  sequence  is read the program displays it on
+  the screen.
+
+        The trace editor program is used for the initial processing of
+  data  obtained  from  fluorescent sequencing machines. It allows the
+  user to visually select left and right cutoff  positions  to  denote
+  the  start and end of good data. Users may also edit the sequence at
+  this point.  Output from ted is a sequence  file  in  Staden  format
+  with headers that describe to xdap the cutoff information.
+ @17. TX 1 @Screen against enzymes
+
+        Used to compare gel readings against  any  restriction  enzyme
+  recognition  sequences  that  may have been used  during cloning and
+  which should not be  present  in  the  data.  Works  on  single  gel
+  readings  or processes batches accessed through files of file names.
+  The algorithm looks  for  exact  matches  to  recognition  sequences
+  stored in a file.
+
+        The  file  containing  the  recognition  sequences   must   be
+  identified.  The  user  must choose between employing a file of file
+  names, or typing in the names of individual gel reading files. If  a
+  file  of  file names is used the program will also create a new file
+  of file names. When the option has finished operating this new  file
+  will  contain the names of all those gel readings that did not match
+  any of the recognition sequences. Hence it can be used  for  further
+  processing  of the batch. The recognition sequences should be stored
+  in a simple text file with one recognition sequence per line.
+ @18. TX 1 @Screen against vector
+
+        Used to compare gel readings against any vector sequences that
+  may have been picked up during cloning. Works on single gel readings
+  or processes batches accessed  through  files  of  file  names.  The
+  algorithm  looks  for exact matches of length "minimum match length"
+  and displays the overlapping sequences.
+
+        The file containing the vector sequence  must  be  identified.
+  The  user  must  choose  between  employing a file of file names, or
+  typing in the names of individual gel reading files. If  a  file  of
+  file  names  is used the program will also create a new file of file
+  names. When the option has finished operating  this  new  file  will
+  contain  the  names of all those gel readings that did not match the
+  vector sequence. Hence it can be used for further processing of  the
+  batch.  The  vector  sequence should be stored in a simple text file
+  with up to 80 characters of data per line. More than one vector  can
+  be  stored  in  a single file. If so each should be preceded by a 20
+  character title of the form <---m13mp8.001-----> where the <  and  >
+  signs  and  the  number like .001 are obligatory. The number must be
+  preceded by a dot (.) and be 3 digits long. The  total  sequence  in
+  the file must be < 50,001 characters in length.
+ @20. TX 3 @Auto assemble
+
+        Compares gel readings against  the  current  contents  of  the
+  database  and  produces  alignments. In its normal mode of operation
+  ("entry permitted"), the function will automatically enter  the  gel
+  readings  into  the  database, but if entry is not permitted it will
+  only  produce  alignments.  It  works  on  single  gel  readings  or
+  processes  batches  of  gel  readings accessed through files of file
+  names. It is the usual way to enter data into the database.
+
+        The function will check the database for  logical  consistency
+  and  will only proceed if it is OK. Choose if gel readings should be
+  entered into the database, or  if  they  should  only  be  compared.
+  Choose  between  using  a file of file names or typing file names on
+  the keyboard. If so selected, supply the file of  file  names.  Also
+  supply  a  file  of  file  names to contain the names of all the gel
+  readings that fail to get entered. Select  the  entry  mode.  Normal
+  assembly  is  appropriate  for  all but special cases, as is "permit
+  joins". Uses for the other modes are not documented here.  Define  a
+  minimum  initial match length. Define a minimum alignment block (the
+  default value is taken in all but exceptional circumstances). Define
+  the  maximum number of padding characters allowed to be used in each
+  gel reading to help achieve alignment, and the same for  the  number
+  allowed  in  the  contig  for  each  gel reading. Finally define the
+  maximum percentage mismatch to be allowed for any gel reading to  be
+  entered  into  the database. If for any gel reading, either of these
+  last three values is exceeded the gel reading will  not  be  entered
+  into the database.
+
+        In operation the  function  takes  a  batch  of  gel  readings
+  (probably   passed  on   as   a  file  of file names from one of the
+  screening routines) and enters them into a database for a sequencing
+  project.  It takes each  gel reading in  turn, compares  it with the
+  current consensus for the database, it then produces  an   alignment
+  for   any   regions   of   the   consensus   it overlaps;   if  this
+  alignment is sufficiently good  it  then  edits  both  the  new  gel
+  reading  and  the  sequences  it  overlaps   and   adds the new  gel
+  reading to the database.  The program  then  updates  the  consensus
+  accordingly and carries on to the next  gel  reading.
+
+        All alignments are displayed and  any  gel  readings  that  do
+  match but  that cannot be aligned sufficiently well have their names
+  written to a file of failed gel reading names.  The  function  works
+  without   any  user  intervention  and can process any number of gel
+  readings in a single run.  Those  gel  readings  that  fail  can  be
+  recompared  using  the  same  function  (to find the current overlap
+  position) and  the user  can enter them into the  database  manually
+  using  the   "enter new gel reading" option.
+
+        Typical dialogue and output from the function is shown  below.
+  (Note  that  output  for gel readings 2 - 9 has been deleted to save
+  space).
+  Automatic sequence assembler
+  Database is logically consistent
+  ? (y/n) (y) Permit entry
+  ? (y/n) (y) Use file of file names
+  ? File of gel reading names=demo.nam
+  ? File for names of failures=demo.fail
+  Select entry mode
+  X  1 Perform normal shotgun assembly
+     2 Put all sequences in one contig
+     3 Put all sequences in new contigs
+  ? Selection  (1-3) (1) =
+  ? (y/n) (y) Permit joins
+  ? Minimum initial match (12-4097) (15) =
+  ? Minimum alignment block (2-5) (3) =
+  ? Maximum pads per gel (0-25) (8) =
+  ? Maximum pads per gel in contig (0-25) (8) =
+  ? Maximum percent mismatch after alignment (0.00-15.00) (8.00) =
+    >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
+    Processing           1 in batch
+    Gel reading name=HINW.004
+    Gel reading length=   283
+    Searching for overlaps
+    Strand     1
+    Strand     2
+    No matches found
+    Total matches found           1
+    Padding in contig=    0 and in gel=    1
+    Percentage mismatch after alignment =  1.8
+    Best alignment found
+           1         11         21         31         41         51
+           TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+           ********** ********** ********** ********** ********** **********
+           TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+           1         11         21         31         41         51
+          61         71         81         91        101        111
+           TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACA-GTTG
+           ********** ********** ********** ********** ********** ***** ****
+           TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACACGTTG
+          61         71         81         91        101        111
+         121        131        141        151        161        171
+           CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+           ********** ********** ********** ********** ********** **********
+           CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+         121        131        141        151        161        171
+         181        191        201        211        221        231
+           GCCATGGTGG CCACAGTGAC G-TATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+           ********** ********** * ******** ********** ********** **********
+           GCCATGGTGG CCACAGTGAC GCTATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+         181        191        201        211        221        231
+         241        251        261        271        281
+           CGCG--GTGG AATTACAGCG TTCCCTATTG ACGGGCGCAT CCAC
+           ****  **** ********** ** * ***** ********** ****
+           CGCGACGTGG AATTACAGCG TT,CDTATTG ACGGGCGCAT CCAC
+         241        251        261        271        281
+            Batch finished
+            9 sequences processed
+            0 sequences entered into database
+            0 joins made
+
+
+        Note that "auto assemble" cannot align protein sequences.
+ @28. TX 1 @Highlight disagreements
+
+        Used  in  the  latter  stages  of  a  project   to   highlight
+  disagreements  between  individual  gel readings and their consensus
+  sequences. Characters that agree with the consensus are shown  as  :
+  symbols  for  the plus strand and . for the minus strand. Characters
+  that disagree with the consensus are left unchanged and so stand out
+  clearly. The results of this analysis are written to a file.
+
+        Before selecting this option create a file of the  display  of
+  the  contig to be "highlighted". The option will ask for the name of
+  this file. Select symbols to denote "agreeing"  characters  on  each
+  strand, the defaults are : and ., but any others can be used. Supply
+  the name of a file in which to put the output.
+
+        The display file needed as input for this option is created by
+  selecting  "Redirect  output",   followed  immediately  by  "display
+  contig", and then "Redirect output" again. The cutoff score used  in
+  the  consensus  calculation  can  be  set  by  option  "set  display
+  parameters". Note that for the highlight function there is  a  limit
+  of  50  for  the  number  of  gel  readings  that are aligned at any
+  position - ie the contig must be less than 51 gel readings  deep  at
+  its  thickest point. I hope that those performing shotgun sequencing
+  never reach this limit, but those using the  program  for  comparing
+  sequence families might.
+
+        Typical output from this function is shown below.
+
+                            210       220       230       240       250
+      1  HINW.004    :C::::::::::::::::::::::::::::::::::::::::::AC::::
+      7  HINW.018    :*::::::::::::::::::::::::::::::::::::::::::CA::::
+     -4  HINW.017                                 ...............AC....
+                     G-TATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCCCGCG--GTGG
+
+                            260       270       280       290       300
+      1  HINW.004    ::::::::::::*:D:::::::::::::::::::
+      7  HINW.018    ::::::::::::::::::::CA:::::T:*:::*::::::::::::CA:
+     -4  HINW.017    ..............................................A...
+      3  HINW.009    :::::::::::::::V::::::::::::::::::::::::::::*AV:::
+     -6  HINW.028                            ......................A...
+                     AATTACAGCGTTCCCTATTGACGGGCGCATCCACGCTGATTCTCTT-CTG
+
+ @32. TX 3 @Extract gel readings
+
+        Used to make copies of the aligned gel readings in a database,
+  to write them into separate files, and to write a corresponding file
+  of file names. It operates in two modes: either all gel readings are
+  extracted, or only those at the ends of contigs.
+
+        Choose which mode of operation is required and supply  a  file
+  of file names.
+
+        The gel readings are given their original names.  If  used  to
+  extract  the  gel  readings from the ends of contigs the function is
+  useful for checking for missed contig joins: the file of file  names
+  can  be  used with the auto assemble function to recompare these gel
+  readings, and each should only overlap one contig. Any that  overlap
+  two contigs will identify possible joins.
+
+        If the option is used to extract all the gel readings  from  a
+  database,  a  subsequent  run  of "auto assemble" can reconstitute a
+  database which has  been  corrupted.  This   rarely  occurs  and  is
+  usually  necessitated  by  a  user   employing "alter relationships"
+  incorrectly without first having made a copy.
+ @1. TX 0 @Help
+
+        Help is available on the following topics :
+ @2. TX 0 @Quit
+
+        This command stops the program and is the  only  safe  way  to
+  terminate  a run of the program that has altered the contents of the
+  database in any way.
+ @3. TX 1 @Open a database
+
+        Opens existing databases or allows new ones to be started. The
+  function  is automatically called into operation when the program is
+  started but can also be selected from the general menu.
+
+        Choose to open an existing database or start a new one, or  if
+  !  is  typed  when  the  program is first started, enter the program
+  without opening a database. Supply a project database name,  and  if
+  it  already exists, the "version". If starting a new database define
+  the database size and if it is for DNA or  protein  sequences.   The
+  database  size  is  an  initial  size  for  the  database. It can be
+  increased later during the project. It is the sum of the  number  of
+  gel readings plus the number of contigs.
+
+        Database names can have from one to 12 letters  and  must  not
+  include  full  stop  (.).  The  database  is made from five separate
+  files. If the database is called FRED then  version  0  of  database
+  FRED  comprises  files  FRED.AR0,  FRED.RL0,  FRED.SQ0, FRED.TG0 and
+  FRED.CC0. The version is the last symbol in the  file  names.   Only
+  this  program can read these files. If the "copy database" option is
+  used it will ask the user to define a new "version".
+
+        For normal use the maximum gel reading length is  set  to  512
+  characters,  but  when  a  database  is  started the user may choose
+  lengths of either 512, 1024, 1536..., 4096. Normally the program  is
+  used  to handle DNA sequences but many of the functions also work on
+  protein sequences. The choice of sequence  type  is  made  when  the
+  database is started.
+
+        The contigs are not stored on the disk as the user  sees  them
+  displayed  on the screen. Each gel reading is stored with sufficient
+  information about how it overlaps other gel  readings  so  that  the
+  program  can  work out how to present them aligned on the screen. We
+  refer to this extra data as "the relationships" and it is  explained
+  below.  The database comprises 5 separate files.
+  1.  a working version of each gel reading.  This is the  version  of
+  the   gel   reading  that  is in the database and initially it is an
+  exact copy of the original sequence (known as the archive) but it is
+  edited and manipulated to align  it with other gel readings.
+  2.  the file of  relationships.   This  file  contains  all  of  the
+  information  that  is required to assemble the working versions into
+  contigs during processing;  any manipulations on the data  use  this
+  file   and  it  is  automatically  updated  at  any  time  that  the
+  relationships are changed.  The  information  in  this  file  is  as
+  follows:
+  (A) Facts about  each   gel  reading   and   its   relationship   to
+  others ("gel descriptor lines"):
+  (a) the number of the gel reading   (each gel reading   is  given  a
+  number  as  it  is entered into the database)
+  (b) the length of the sequence from this gel reading
+  (c) the position of the left end of this gel reading    relative  to
+  the left end of the contig of which it is a member
+  (d) the number of the next gel reading   to the  left  of  this  gel
+  reading
+  (e) the number of the next gel reading   to the right
+  (f) the relative strandedness of this gel reading  , ie whether   it
+  is  in the same sense or the complementary sense as its archive.
+  (B) Facts about each contig ("contig descriptor lines"):
+  (a) the length of this contig
+  (b) the number of the leftmost gel reading   of this contig
+  (c) the number of the rightmost gel reading   of this contig.
+  (C) General facts:
+  (a) the number of gel readings in the database
+  (b) the number of contigs in the database.
+  3.  the file of archive names.  This is simply a list of  the  names
+  of each of the archive files in the database but on line number 1000
+  we also store the size of the database. ie the number  of  lines  of
+  information allowed in the database files. This file always has 1000
+  lines but the length of the file of relationships and  the  file  of
+  working  versions can be set by the user when creating a database or
+  when copying from one to another.
+  4. the file of tags (annotation). This consists of linked  lists  of
+  tag  information  for  each  sequences  in  the  database.  Tags are
+  created by the user as annotation, or by xdap as records of edits or
+  for  storing  cutoff  information.   As  the number of tags can grow
+  without limit, so can this file.  For each gel  there  is  a  header
+  record,  which contains the record number of the start of the linked
+  list for that gel. On line  IDBSIZ  there  is  a  record  containing
+  information  about  the file such as its present length and if there
+  are any free "tag" slots to be reused in the file.  5. the  file  of
+  comments  (annotation).   This  consists  of linked lists of comment
+  fragments.  Comments are created by the user as a  message  attached
+  to  annotation,  or  by  the  system  to  store  cutoff information.
+  Comments are character strings of any length.  Comments longer  than
+  40 characters are broken up into fragments, each 40 characters long,
+  and are chained together in a link list.  As the number of  comments
+  can grow without limit, so can this file.
+
+        Structure of the database files
+
+        1.  The file of relationships
+
+        The file contains IDBSIZ lines of data:  the general data  are
+  stored  on line IDBSIZ;   data about  gel readings  are stored  from
+  line 1 downwards;  data about contigs are stored from line  IDBSIZ-1
+  upwards.  A  database  of 500 lines containing 25 gel readings and 4
+  contigs would have a file of relationships as is shown below.
+
+
+                    ---------------------------------------------
+                       1  Gel descriptor record
+                       2   "      "       "
+                       3   "      "       "
+                       4   "      "       "
+                       5   "      "       "
+                       '   '      '       '
+                       '   '      '       '
+                      25   "      "       "
+                      26  Empty record
+                       '    '     '
+
+                       '    '     '
+                     495    '     '
+                     496  Contig descriptor record
+                     497    "        "        "
+                     498    "        "        "
+                     499    "        "        "
+                     500   Number of gel readings=25, Number of contigs=4
+                    ---------------------------------------------
+
+            The arrangement of the data in the file of relationships
+
+  As each new gel reading   is added into the database a new  line  is
+  added to  the  end  of  the  list  of gel descriptor lines.  If this
+  new gel  reading  does not overlap with any gel readings already  in
+  the  database  a new contig  line  is added  to  the top of the list
+  of contig lines.  If it overlaps with one contig then no new  contig
+  line  need  be  added  but  if it  overlaps with  two  contigs  then
+  these  two  contigs must be joined and the number  of  contig  lines
+  will  be reduced by one. Then the list of contig lines is compressed
+  to  leave  the empty line at the top of the list.  Initially the two
+  types  of  line will move towards  one  another  but eventually,  as
+  contigs  are joined, the contig descriptor lines will  move  in  the
+  same  direction  as the  gel descriptor lines.   At  the  end  of  a
+  project  there should  be only one contig  line.   The  database  is
+  thus capable of handling a project of 998 gels.
+
+        2.  Structure of the working versions file
+
+        The working versions of gel readings are stored  in   a   file
+  of  IDBSIZ lines each containing 512 characters.  Gel reading number
+  1 is stored on line 1, gel reading number  2 on line 2 and so on.
+
+        3.  Structure of the archive names file
+
+        This file, unlike the others, always has 1000  lines  each  10
+  characters  in length. Its length is fixed because line 1000 is used
+  to store IDBSIZ the database size and the programs need  a  definite
+  location from which to read this number.
+
+        4.  Structure of the tag file
+
+        This file initially starts with IDBSIZ lines, and is  expanded
+  as  new tags are created.  Information about the length of the file,
+  and which tag records are reusable is  stored  on  line  IDBSIZ.   A
+  database of 500 lines would have a file of tags as shown below.
+
+                    ---------------------------------------------
+                       1  Tag descriptor record
+                       2   "      "       "
+                       3   "      "       "
+                       4   "      "       "
+                       5   "      "       "
+                       '   '      '       '
+                       '   '      '       '
+                     497   "      "       "
+                     498   "      "       "
+                     499   "      "       "
+                     500   Length of file=N, Free list=0
+                     501  Tag record
+                     502   "   "
+                     503   "   "
+                       '   '   '
+                       '   '   '
+                     N-2   "   "
+                     N-1   "   "
+                       N  Tag record
+                    ---------------------------------------------
+
+            The arrangement of the data in the file of relationships
+
+  As each new tag is added to the database, a check  is  made  in  the
+  file  descriptor  record  at  line  IDBSIZ.  If the list of reusable
+  records is 0, the file is extended by one line.  Otherwise  the  new
+  tag  is  assigned  to record at the head of the freelist.  When tags
+  are deleted, they are added to the free list in the file  descriptor
+  record.
+
+        5.  Structure of the comment file
+
+        This file initially starts with 1 line, and is expanded as new
+  annotation  is  created.   Information about the length of the file,
+  and which comment records are reusable is stored on the first line.
+
+                    ---------------------------------------------
+                       1  Length of file=N, Free list=0
+                       2  Comment fragment
+                       3   "       "
+                       4   "       "
+                       '   '       '
+                       '   '       '
+                     N-2   "       "
+                     N-1   "       "
+                       N  Comment fragment
+                    ---------------------------------------------
+
+            The arrangement of the data in the file of relationships
+
+  As each new comment is added to the database, a check is made in the
+  file descriptor record at line 1. If the list of reusable records is
+  0, the file is extended to hold the new comment. Otherwise  the  new
+  comments  is  assigned  to  records  starting  with  the head of the
+  freelist.  When comments are  deleted,  the  discarded  records  are
+  added to the free list in the file descriptor record.
+
+        There  are  various  checks  within  the  programs  to protect
+  users from themselves:-
+  1.  All user input is checked for  errors  -  e.g.    reference   to
+  non-existent  gel readings or  contigs,  incorrect  positions in the
+  contig or gel readings.
+  2.  Before entering a gel reading the system checks to see if a file
+  of the same name has already been entered.
+  3.  Join will not allow the circularising of a contig.
+  4.        Both enter and join  functions  restrict  the  region that
+  the   user  is  allowed to edit (using edit contig) to the region of
+  overlap.
+  5. Users may escape from any point in the program.
+  6. Help is available from all points in the program.
+
+
+  IT IS ESSENTIAL THAT USERS DO NOT KILL THE PROGRAM WHILE IT IS DOING
+  ANYTHING  THAT  INVOLVES  CHANGING THE CONTENTS OF THE DATABASE. I.E
+  DURING AUTO ASSEMBLE,  COMPLETE  ENTRY,  COMPLETE  JOIN,  COMPLEMENT
+  CONTIG,  EDIT  CONTIG,  AND  SCREEN  EDIT.   This  could corrupt the
+  database so badly that it is impossible to fix. The  program  should
+  always be left using the QUIT option.
+ @4. TX 3 @Edit contig
+
+        The Contig Editor is a mouse-driven editor  that  can  insert,
+  delete and change gel reading sequences.
+
+        The Contig Editor allows scrolling from one end of a contig to
+  the  other  using the scroll bar and scroll buttons. Action of mouse
+  button presses when the mouse pointer is in the scroll bar:
+
+      Middle Mouse Button      Set editor position
+      Left   Mouse Button      Scroll forward one screenful
+      Right  Mouse Button      Scroll backwards one screenful
+
+  The four scroll buttons operate as follows:
+
+      "<<"                     Scroll left half a screenful
+      "<"                      Scroll left one character
+      ">"                      Scroll right one character
+      ">>"                     Scroll right half a screenful
+
+        The Editor cursor can  be  positioned  anywhere  in  the  edit
+  window  by  moving the mouse pointer over the character of interest,
+  then pressing the left mouse button. The Editor cursor can  also  be
+  moved by using the direction arrow keys.
+
+        The editor operates in two  main  edit  modes  -  Replace  and
+  Insert. Replace allows a character to be replaced by another. Insert
+  allows characters to  be  inserted  into  a  gel  reading  sequence.
+  Characters  are entered by typing them from the keyboard. Only valid
+  characters are permitted.  Characters can be deleted by  positioning
+  the cursor one character to the right, then pressing the delete key.
+  Normally Insert and Delete apply to the consensus line of the contig
+  ONLY.  This  restraint  can  be overridden by using the "Super Edit"
+  mode of operation, THOUGH IT IS NOT RECOMMENDED.
+
+        Edits can also be performed on the consensus, though they  are
+  restricted  to  insertion  and deletion of padding characters ("*").
+  These edits also have special meanings.  A deletion will delete  ALL
+  characters  at the position to the left of the cursor in the contig,
+  and move the relative positions of all  sequences  starting  to  the
+  right  of the cursor position left one character.  An insertion will
+  insert the character typed ("*") into ALL gel reading  sequences  at
+  the  cursors position in the contig, and move the relative positions
+  of all sequences starting to the right of the cursor position  right
+  one character.
+
+        The effect of the last edit can  be  undone  by  pressing  the
+  "Undo" button at the top of the editor window.
+
+        The cursor  will  automatically  be  positioned  at  the  next
+  problem  when  the  "Find Next Problem" button is selected. The next
+  problem is where the consensus shows either an ambiguity ("-") or  a
+  pad ("*") character.
+
+        The edits to the contig can be saved by  pressing  the  "Leave
+  Editor"  button and replying "Yes" to the prompt to "Save changes?".
+  As no changes are made to the working copy of your database til this
+  point it is possible to abort the editor if the edit session ends up
+  in an unsatisfactory state (ie if you've stuffed it up!)
+
+
+
+ Displaying Traces
+
+        The original data from which the gel reading  sequences  where
+  derived  can  be seen by double clicking (two quick clicks) with the
+  middle mouse button on the area  of  interest.  The  trace  will  be
+  displayed  with  the  point  clicked  at  the  centre  of  the trace
+  viewport.
+
+        All traces that are displayed are maintained  in  one  window,
+  called  the  Trace Manager. The Trace Manager will only display four
+  traces maximum. When four traces are already being managed and a new
+  one is requested, the one at the top of the Trace Manager is removed
+  and the new one is added to  the  bottom.   Traces  can  be  removed
+  individually  by  using  the  "quit" button in the panel next to the
+  trace.
+
+
+
+ Extending Reads Using Cutoff Information
+
+        Sequence data read in from  Automated  Fluorescent  sequencing
+  machines trace files processed through the program ted will have the
+  discarded sequence (vector at start and poor read at end)  available
+  to  the  contig editor. To display the cutoff information, press the
+  "Display Cutoff" button at the top of the editor window.  The cutoff
+  sequence appears in grey. This sequence can be incorporated into the
+  editable sequence, by moving the cutoff position. This  is  done  by
+  positioning  the  cursor  at  the end of the gel sequence, and using
+  Meta-Left-Arrow and Meta-Right-Arrow to adjust the point of  cutoff.
+  The Meta key is a diamond on the Sun keyboard.
+
+
+
+ Pop-up menu
+
+        A pop-up menu is revealed by depressing the "Control"  key  on
+  the  keyboard  and  at the same time pressing the left mouse button.
+  The menu has the following functions:
+
+      Search
+      Save Contig
+      Create Tag
+      Edit Tag
+      Delete Tag
+
+  "Save Contig" is described above.  Searching and operations on  tags
+  are described below.
+
+
+
+ Searching
+
+        Selecting "Search" brings up a window which can remain present
+  during normal editor operation. The window allows the user to select
+  the direction of search, the type of search and a  value  to  search
+  on.   The value is entered into the value text window. Then pressing
+  the "search" button performs the search. If successful,  the  cursor
+  is  positioned  and  centred  accordingly. An audible tone indicates
+  failure.  Pressing the "ok" button removes the  search  window.  The
+  search  window  is  automatically  removed when the contig editor is
+  exited.
+
+  There are seven different search modes:
+
+  1. Search by position
+
+  This positions the cursor at the numeric position specified  in  the
+  value  text  window.  Eg  a  value of "1234" causes the cursor to be
+  placed at base number 1234 in the contig. Positioning withing a  gel
+  reading  is achieved by prefixing the number with the "@" character,
+  eg "@123" positions the cursor at base 123 of the sequence in  which
+  the  cursor  lies.  Relative positions can be specified by prefixing
+  the number with a plus or minus character. Eg "+1234"  will  advance
+  the  cursor 1234 bases. If possible, the cursor is positioned within
+  the same sequence.  The direction buttons  have  no  effect  on  the
+  operation of "search by position".
+
+  2. Search by reading name
+
+  This positions the cursor  at  the  left  end  of  the  gel  reading
+  specified  in the value text window. If the value is prefixed with a
+  slash is is assumed to be  a  gel  reading  name.  Otherwise  it  is
+  assumed to be a gel reading number. Eg "123" positions the cursor at
+  the left end of gel reading number 123.  "/a16a12.s1"  positions  at
+  the  start  of reading a16a12.s1. If the value was "/a16" the cursor
+  is positioned at the first reading which  starts  with  "a16".   The
+  direction  buttons  have  no  effect  on the operation of "search by
+  position".
+
+  3. Search by tag type.
+
+  This positions the cursor at the start of the next tag which has the
+  the  same  type  as  specified by the type value menu. To change the
+  type, select off the menu that pops up when the mouse is clicked  on
+  the  button  labeled  "Type:".  The  search  can be performed either
+  forwards or backwards of the current cursor position.  To  find  all
+  tags, use "search by annotation", with a null text value string.
+
+  4. Search by annotation.
+
+  This positions the cursor at the start of the next tag which  has  a
+  comment  containing  the  string specified in the value text window.
+  The search performed is a regular  expression  search,  and  certain
+  characters  have  special meaning. Be careful when your value string
+  contains ".", "*", "[", "^" or "$".  The  search  can  be  performed
+  either forwards or backwards from the current cursor position.
+
+  5. Search by sequence.
+
+  This positions the cursor at the start of the next piece of sequence
+  that  matches  the  value  specified  in  the text value window. The
+  search is for an exact match, which means the case of  value  string
+  is   important.   The  search  is  performed  on  the  gel  readings
+  themselves, rather than the consensus sequence. The  search  can  be
+  performed  either  forwards  or  backwards  from  the current cursor
+  position.
+
+  6. Search by problem.
+
+  This positions the  cursor  at  the  next  place  in  the  consensus
+  sequence  which  is  not  an "A", "C", "G" or "T". The search can be
+  performed either forwards  or  backwards  from  the  current  cursor
+  position.
+
+  7. Search by quality
+
+  This positions the  cursor  at  the  next  place  in  the  consensus
+  sequence  where the consensus calculation for each strand disagrees.
+  When only sequences on one strand is present, the search  will  stop
+  at  every  base.  The  search  can  be  performed either forwards or
+  backwards from the current cursor position.
+
+
+
+ Annotation
+
+        Parts of a sequence can be annotated, to record the  positions
+  of  primers used for walking, or to mark sites, such as compressions
+  that have caused problems during sequencing.  The consensus sequence
+  CANNOT be annotated.
+
+        To annotate a piece of  sequence  first  select  the  part  of
+  sequence  using  the  mouse  buttons.  Use  the left mouse button to
+  position the start of the selection, and while this button is  being
+  held  down, move the mouse to extend.  The selection can be extended
+  further using the right mouse button.
+
+        To create annotation, invoke the pop-up menu, and  select  the
+  "Create Tag" function. A small "tag editor" will appear which allows
+  you to select the type of the annotation from a pull-down menu,  and
+  specify  a  comment  if desired.  To select a new type pull down the
+  Type menu, and select the entry desired.  To enter a comment, simply
+  type  into  the  text  window  in the tag editor.  The annotation is
+  created when the "Leave" button on the tag editor, and is  displayed
+  in the colour defined in the tag database file (TAGDB).
+
+        To edit existing annotation, position the cursor with the left
+  mouse  button  on  the tag, and select the "Edit Tag" off the pop-up
+  menu.  This invokes the tag editor, and  changes  to  the  type  and
+  comment  of  the annotation can be made. The tag is updated when the
+  "Leave" button is pressed.
+
+        To delete an existing annotation, position the cursor with the
+  left  mouse  button  on the tag, and select the "Delete Tag" off the
+  pop-up menu.
+
+
+
+ NOTE:
+
+        As the Contig Editor is a very powerful tool, it  is  possible
+  that  the  alignment  of  the gel reading sequences has unexpectedly
+  been disrupted.  This can easily happen to parts of the contig  that
+  lie to the right of the screen if excessive use has been made of the
+  "Super Edit" facility.  Until familiar with "Super  Edit"  it  would
+  benefit  the  sequencer  to  quickly  scan  through the contig after
+  editing to check that bad alignments have not been created.
+ @9. T 3 @Screen edit
+
+        THIS OPTION IS NO LONGER AVAILABLE IN XDAP. USE EDIT CONTIG
+
+        Gives access to the system editor on the machine (for  example
+  EDT  on  a  VAX)  and  allows users to edit contigs. The contigs are
+  presented as for "display contig" and the program will  reconstitute
+  the contig's sequences and relationships  when the editor is exited.
+
+        To screen edit a contig set the line length to 50  characters,
+  select  the  contig to edit, and supply the name of a temporary file
+  in which the editing will be performed.  After  a  short  pause  the
+  system editor will present the first page of the file. Edit the file
+  obeying the rules given below. Exit from the editor and  affirm  the
+  intention  of returning the contig to the database. The program will
+  put the contig back into the database.
+
+        Rules for screen editing
+
+        There are some limitations on the changes that can be made  to
+  the contigs when using the screen editor. Users are unlikely to want
+  to break the rules in order  to  achieve  changes  to  contigs,  but
+  nevertheless  the  constraints need to be defined and they are given
+  below.
+
+        Alignments must be maintained during editing.  Whole lines  of
+  sequence  should not be deleted or added unless the order of the gel
+  readings in the contig  is  preserved.   Each  line  in  the  contig
+  display  consists  of  gel  reading  numbers,  their  names  and  50
+  character sections  of  sequence.  Insertions  are  limited  in  the
+  following  way.  No line of sequence can be extended rightwards more
+  than 10 characters beyond the end of a  full  length  line  (a  full
+  length  line is 50 characters long). Only one character can be added
+  to the left end of full  length  lines,  but  sections  of  sequence
+  beginning  further  into  a  line can be extended leftwards up to an
+  equivalent position. Do not delete any  non-sequence  lines  in  the
+  file.
+
+        Before returning the contig to the database the program checks
+  that  the rules have been obeyed. If an error is found the number of
+  the erroneous line in the file is displayed and the contig will  not
+  be changed.
+ @5. TX 1 @Display a contig
+
+        Used to show the aligned  gel  readings  for  any  part  of  a
+  contig.  The  number,  name  and strandedness of each gel reading is
+  shown and the consensus is written below.
+
+        If required identify the contig,  and then the start  and  end
+  points of the region to display.
+
+        The display can be directed  to  a  disk  file  using  "direct
+  output to disk".  These files are required by options: "screen edit"
+  and "highlight disagreements", and printed copies of them  are  very
+  useful for marking corrections prior to using the editors.
+
+        Below is an example showing the left  end  of  a  contig  from
+  position   1  to  200.  Overlapping this region are gels 6,3,5,17and
+  12; 6, 3 and 5 are in reverse orientation to their archives (denoted
+  by  a  minus   sign)  There  are  a  few uncertainty codes and a few
+  padding characters  in  the  working  versions,  but  the  consensus
+  (shown  below  each page width) has a definite assignment for almost
+  every position.
+
+                             10        20        30        40        50
+     -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+         CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                            110       120       130       140       150
+     -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+     -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+     -5  HINW.009                                        GGCAGCGACCGTCG
+     17  HINW.999                                           AGCGACCGTCG
+         CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                            160       170       180       190       200
+     -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+     -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+     12  HINW.017                                              GTAGAGC*
+         CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+ @6. TX 1 @List a text file
+
+        This option allows users to list text files on the screen.  It
+  can  be  used  to  read  a file containing notes, for checking files
+  written to disk etc. The user is asked to type the name of the  file
+  to list.
+ @8. TX 1 @Calculate a consensus
+
+        Calculates  a  consensus  sequence   either  for   the   whole
+  database or for selected contigs. The consensus is written to a file
+  named by the user.
+  Supply a file name,  choose  between   whole  database  or  selected
+  contigs.
+
+        Symbols for uncertainty in gel readings
+
+        In  order  to  record  uncertainties  when  reading  gels  the
+  codes  shown  below can  be  used. Use  of these codes permits us to
+  extract the maximum amount of data from each gel and yet record  any
+  doubts   by  choice   of   code.    The program can deal with all of
+  these codes and any other  characters  in  a  sequence  are  treated
+  as  dash  (-) characters.
+
+         SYMBOL                  MEANING
+
+           1             PROBABLY        C
+           2                "            T
+           3                "            A
+           4                "            G
+           D                "            C       POSSIBLY        CC
+           V                "            T          "            TT
+           B                "            A          "            AA
+           H                "            G          "            GG
+           K                "            C          "            C-
+           L                "            T          "            T-
+           M                "            A          "            A-
+           N                "            G          "            G-
+           R             A OR G
+           Y             C OR T
+           5             A OR C
+           6             G OR T
+           7             A OR T
+           8             G OR C
+           -             A OR G OR C OR T
+           a             A set by auto edit
+           c             C set by auto edit
+           g             G set by auto edit
+           t             T set by auto edit
+           *             padding character placed by auto assembler
+            else = -
+
+  The DNA consensus algorithm
+
+        The "calculate  consensus"  function,  the  "display   contig"
+  routine and the "show quality" option use  the rules  outlined  here
+  to  calculate  a consensus  from aligned gel  readings.   Note  that
+  "display  contig"  calculates a consensus for  each  page  width  it
+  displays  (it  does  not use the consensus sequence file  calculated
+  by the consensus function).
+
+        We  have  6  possible  symbols  in  the  consensus   sequence:
+  A,C,G,T,*  and -. The last symbols is assigned if none of the others
+  makes up a sufficient proportion of the aligned  characters  at  any
+  position  in the contig. The following calculation is used to decide
+  which symbol to place in the consensus at each position.
+
+        Each uncertainty code contributes a score to one of  A,C,G,T,*
+  and  also  to  the  total  at each point. Symbols like R and Y which
+  don't correspond to a single base type contribute only to the  total
+  at each point. The scores are shown below.
+                definite assignments ie A,C,G,T,B,D,H,V,K,L,M,N,a,c,g,t,* =1
+
+                probable assignments ie 1,2,3,4 = 0.75
+
+                other uncertainty codes including R,Y,5,6,7,8,- = 0.1
+
+        A cutoff score of 51% to 100% is supplied by the  user.  (When
+  the   program   starts   this  is  set  to  75%.  See  "set  display
+  parameters").  At each position in the contig we calculate the total
+  score  for  each of the 5 symbols A,C,G,T and * (denote these by Xi,
+  where i=A,C,G,T or *), and also the sum of these totals (denote this
+  by S). Then if 100 Xi / S > the cutoff for any i, symbol i is placed
+  in the consensus; otherwise - is assigned.
+
+        Notice that S does not equal the number of times the  sequence
+  has  been  determined, but is the score total, and hence we are less
+  likely to put a -  in  the  consensus.  For  the  "examine  quality"
+  algorithm  each  strand is treated separately but the calculation is
+  the same. (It was originally different).
+
+        Format of the consensus sequence ( and vector sequences).
+
+        A consensus  sequence  file  may  contain  the  consensus  for
+  several contigs and so we identify each of them by preceding them by
+  a 20 character title. The title is of the form  <---LAMBDA.076----->
+  (  where LAMBDA is the project name and gel reading number 76 is the
+  leftmost gel reading to contribute to  this   consensus   sequence).
+  The   angle  brackets  <>  and the three digit number precede by a .
+  are important to some processing programs.
+ @25. TX 1 @Show relationships
+
+        Used to show the relationships of  the  gel  readings  in  the
+  database in three ways -
+  (a) All contig descriptor lines  followed  by  all  gel   descriptor
+  lines.
+  (b) All contigs one after the   other   sorted,   i.e.    for   each
+  contig   show  its   contig  descriptor line followed by all its gel
+  descriptor lines sorted on position from left to right
+  (c) Selected contigs:  show the contig  line  and,  in  order, those
+  gel  readings  that  cover  a  user-defined  region.  Note that this
+  output can be directed to a disk file by prior  selection  of  "disk
+  output".
+
+        Below is an example showing a contig from position 1  to  689.
+  The left gel reading  is number 6 and has archive name HINW.010, the
+  rightmost gel  reading is number 2 and is has archive name HINW.004.
+  On  each  gel  descriptor  line  is  shown:  the name of the archive
+  version, the gel number, the position of the left  end  of  the  gel
+  reading  relative to the left  end  of  the  contig,  the length  of
+  the gel reading  (if this is negative it means that the gel  reading
+  is  in  the  opposite orientation to its archive), the number of the
+  gel reading   to the left and the number of the gel reading  to  the
+  right.
+
+
+   CONTIG LINES
+   CONTIG      LINE  LENGTH               ENDS
+                                       LEFT   RIGHT
+                 48     689               6       2
+   GEL LINES
+   NAME      NUMBER POSITION LENGTH     NEIGHBOURS
+                                       LEFT   RIGHT
+   HINW.010       6        1   -279       0       3
+   HINW.007       3       91   -265       6       5
+   HINW.009       5      137   -299       3      17
+   HINW.999      17      140    273       5      12
+   HINW.017      12      193    265      17      18
+   HINW.031      18      385   -245      12       2
+   HINW.004       2      401   -289      18       0
+
+ @21.  TX 3 @Enter new gel reading
+
+        THIS OPTION IS NO LONGER AVAILABLE IN XDAP. USE AUTO ASSEMBLE
+
+        Used to enter new gel readings into the database. The new  gel
+  reading  must have previously been compared with the contents of the
+  database by use of " auto assemble"  in order  to  ascertain  if  it
+  overlaps any previously entered data.
+
+        The user is expected to know: if the gel reading overlaps;  if
+  so  which  contig  it overlaps; if so where it overlaps. The program
+  takes the user through a series of question to establish the  nature
+  of  the  overlap  and  then  displays  the overlap. The user is then
+  offered a number of options,  including  editors  for  the  new  gel
+  reading  and  the contig, to enable the correct alignment of the gel
+  reading throughout its whole length.
+  Supply the name of the gel reading file.  If the  gel  reading   has
+  been  entered before the program will  not permit entry. The program
+  gives the gel reading a unique  number  and  asks  if  the  sequence
+  overlaps  any  data  already  in  the  database  (reported  by "auto
+  assemble").  If it does not, entry is complete.  If it does  overlap
+  the  dialogue  continues with the program asking if the gel readings
+  overlaps "in  the  normal  sense",  if  not  it  will  automatically
+  complement  the  sequence.  Then supply the number of the contig the
+  gel reading overlaps (as reported by "auto assemble").
+
+        Overlaps are divided into two types: those for which  the  new
+  gel  reading  protrudes from the left end of the contig it overlaps,
+  and those for which it does not. The program asks  about  this  with
+  the  question "Left end of gel reading is inside contig". If this is
+  true the program will go on to ask for the position in the contig of
+  the  left  end of the new gel reading. If it is not true the program
+  will ask for the position in the new gel reading of the left end  of
+  the contig.
+
+        Once this is completed the program will display the  first  50
+  bases  of  the  overlap.  The  gel  readings in the contig and their
+  consensus are displayed with the new  gel  reading  underneath.  The
+  mismatches are shown by *'s on the next line down. For example:
+
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACACGTC
+         NEWGEL      CACAAGCGAGCGAGAGGGGCACCGTGACGTGGTCACGCCGGGGACACGTC
+         MISMATCH                  *                         * *
+                             10        20        30        40        50
+
+
+        The program then needs to know if the position  of  the   left
+  end  of  the  overlap  is  correct.   If  it is the user should type
+  return, if not, 1 and the program will ask for the new position  and
+  display it.
+  The program now offers a number of  options  to  allow  the user  to
+  align  the  new gel reading correctly over its whole length with the
+  data   already   in   the   contig.    It   is    important     that
+  sufficient   edits   are   made   to  the  new  gel  reading  or the
+  sequences in the contig at this stage to get the alignment  correct,
+  because  once entry  is completed, the alignment is fixed and cannot
+  easily be changed (see "alter relationships").  Alignment   can   be
+  achieved  by   making  insertions   or  deletions  but  deletion  of
+  data requires the original gels to be checked.   For   this   reason
+  at  entry  we usually make only insertions to achieve alignment.  We
+  use X or asterisks (*) as padding characters  to  achieve  alignment
+  and  so   can,  if  required, distinguish  padding  characters  from
+  characters assigned from reading gels.
+
+        The options available are:
+     ? = HELP
+     ! = Give up
+     3 = Complete entry
+     4 = Edit contig
+     5 = Display overlap
+     6 = Edit new gel reading
+
+
+
+        1. HELP gives this information.
+
+        2. Give up allows users to change their minds  about  entering
+  the  new  gel reading. The program will ask the user to confirm this
+  choice.
+
+        3. Complete entry is the command to add the new gel reading to
+  the  contig.  The program updates the relationships accordingly. The
+  user is asked to confirm this command.
+
+        4. Edit contig gives the user access to a simple  editor  that
+  allows  insertions,  deletions and changes to be made to the contig.
+  The editor  maintains  alignments  by  making  the  same  number  of
+  insertions or deletions in all sequences covering the edit position.
+  The  program protects the  user           by   allowing  edits  only
+  within the region of overlap.
+
+        5. Display allows display of the region of overlap only.  This
+  is  defined  by the relative positions in the contig. The default is
+  the whole of the region of overlap.
+
+        6. Edit new gel reading allows  the  new  gel  reading  to  be
+  edited using a simple editor.
+ @23. TX 3 @         Complement a contig
+
+        This function will complement  and  reverse  all  of  the  gel
+  readings   in    a  contig.     It    automatically   reverses   and
+  complements  each  gel reading sequence,  reorders  left  and  right
+  neighbours,   recalculates   relative  positions  and  changes  each
+  strandedness.
+
+        The only user  input  required  is  to  identify  the   contig
+  to complement  by  the  number or name of a gel reading it contains.
+  DO NOT KILL THE PROGRAM DURING THIS STEP!
+ @22. TX 3 @          Join contigs
+
+        This function joins contigs interactively using a mouse driven
+  editor.   The operation of this editor is very similar to the Contig
+  Editor described in "@4 Edit".
+
+        It allows the user  to align the ends of the  two  contigs  by
+  editing  each contig separately.  It is important that the alignment
+  achieved   is  correct  because  once  the  join  is  completed  the
+  alignment  is fixed.  The program needs to know which two contigs to
+  join.
+
+        First specify which two contigs are to be  joined.   The  user
+  should  identify the two contigs. First the left contig and then the
+  right.  The program checks that the two contig numbers are different
+  (it will not allow circles to be formed!)
+
+        The Join Editor consists of  two  Contig  Editors  in  between
+  which  is sandwiched a disagreement box. This disagreement box shows
+  exclamation marks to denote mismatches between the two consensuses.
+
+        For example, the display will look something like this:
+
+                           1460      1470      1480      1490      1500
+     56  HINW.100    TCT*GAGCAGTGTGGGCGCTG*CCGG
+     33  HINW.300    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGG
+    -25  HINW.090    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGG
+     19  HINW.123    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+         CONSENSUS   TCTCGAGCAGTGTGGGCGCTG-CCGGGCTCGGAGGGCATGAAGTAGAGCG
+         MISMATCH                         !                      !!!!!!
+                             10        20        30        40        50
+     -6  HINW.010    TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+     -3  HINW.007                TGGGCGCTGCCCGGGCTCGGAGGGCATGAAGT*AGAGC
+     -5  HINW.009                              GCTCGGAGGGCATGAAGT*AGAGC
+         CONSENSUS   TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+
+
+
+        The best  strategy  for  joining  is  to  identify  the  exact
+  position  of  overlap.  This  is defined as the position in the left
+  contig that the leftmost character of  the  right  contig  overlaps.
+  The  overlap  must be of at least one character.  Use the scroll bar
+  and the scroll buttons (`<<',`<',`>',and`>>')  for  positioning  the
+  relative positions of the two contigs.
+
+        The join position can be fixed in  position  by  pressing  the
+  `lock' button at the top of the Join Editor.  Locking allows the two
+  contigs to be scrolled as one when using the scroll bar and buttons,
+  the left ends always in the same position relative to each other.
+
+        Once locked, it is best to proceed  to  the  right  along  the
+  contigs,  inserting padding characters (`*') into the consensuses to
+  minimise the disagreements.
+
+        It  is  essential  that  the  user  aligns  the  two   contigs
+  throughout  the  whole  region of overlap before completing the join
+  because it is only at this stage that the two contigs can be  edited
+  independently.  Once the join is completed the alignment can only be
+  altered using the routines supplied by "alter relationships".
+
+        The join can be  completed  by  pressing  the  `Leave  Editor'
+  button.  The  percentage  mismatch  is  displayed,  and  the user is
+  required to confirm that they want to perform the join.
+ @24. TX 1 @               Copy the database
+
+        Used to make a copy of the database. If required the  database
+  size  can  be altered using this option. The "version" of a database
+  is  encoded as the last letter in the names of the five  files  that
+  contain the database.
+
+        Supply a "version" number (the default is version 1),  and  if
+  required  select a new size for the database. The size of a database
+  is the number of lines of information it can hold. It needs  a  line
+  for each gel reading and another for each contig.
+ @19. TX 1 @               Check database
+
+        Used to perform a check on  the  logical  consistency  of  the
+  database. No user intervention is required.
+
+        The following relationships are checked:
+  1.       If gel reading A thinks gel reading B is its left neighbour
+  does B think A  is its right neighbour?  The error message is
+  "Hand holding problem for gel reading A"
+  followed by  the gel descriptor lines for gel readings A and B.
+  2.       Are there any contig lines with no left or  right  end  gel
+  readings?  The error message is
+  "Bad contig line number A"
+  3.       Do the gel readings that are  described  as  left  ends  on
+  contig lines agree that they are left ends?  The error message is
+  "The end gel readings of contig A have outward neighbours"
+  4.       Are there gel readings that are in more  than  one  contig?
+  The error message is
+  " Gel number A is used N times"
+  5.       Are there gel readings that are not  in  any  contig?   The
+  error message is
+  " Gel number A is not used"
+  6.       Do the relative positions of   gel  readings   agree   with
+  their  position  as  defined by left and right neighbourliness?  The
+  error message is
+  " Gel number A with position X is left neighbour of  gel  number   B
+  with position Y"
+  7.       Are there any loops in   contigs?    If   so   no   further
+  checking is done.  The error message is
+  " Loop in contig n no further checking done, but gel reading numbers
+  follow"
+  The program  then  prints the gel  reading  numbers  in  the  looped
+  contig up to the start of the loop.
+  8. Are there any contigs of length <1? The error message is
+  " The contig on line number x has zero length"
+  9. Are there any gel readings (used in only one  contig)  that  have
+  zero length? The error message is
+  " Gel number N has zero length"
+  Note that "auto assemble"  also uses this logical consistency  check
+  and will only tolerate a "Gel number N is not used" error. Any other
+  error will cause it to give up.
+ @29. TX 1 @               Examine quality
+
+        Analyses the quality of the data in a contig.  It  reports  on
+  the  proportion  of the consensus that is "well determined" and will
+  display a sequence of symbols  that  indicate  the  quality  of  the
+  consensus at each position.
+
+        Identify the contig to analyse, and the section  of  interest.
+  The  current  consensus  calculation  cutoff  score  will be used to
+  decide if each position is "well determined". In general the quality
+  of  a  reading deteriorates along the length of the gel and so it is
+  also possible to use a length cutoff for  the  quality  calculation.
+  Only  the  data  from  the  first  section  of  each reading will be
+  included in the quality calcualtion. The  length  is  altered  under
+  "set parameters" and is initially set to the maximum reading length.
+  A summary showing the percentage of the consensus  that  falls  into
+  each category of quality is shown. Choose whether or not to have the
+  quality codes for each position of the consensus displayed. They can
+  be displayed as either graphics or text.
+
+        The quality of the data depends on the number of times it  has
+  been  sequenced  and the particular uncertainty codes  used  in each
+  gel reading.  This function divides the data into  five  categories,
+  assigning each a symbol or code:
+  1.  Well determined on both strands and they agree.  code=0
+  2.  Well determined on the plus strand only.  code=1
+  3.  Well determined on the minus strand only.  code=2
+  4.  Not well determined on either strand.  code=3
+  5.  Well determined on both strands but they disagree.  code=4
+  A position is "well determined" if it is assigned one of the symbols
+  A,C,G,T  when  the  algorithm  described in the section "calculate a
+  consensus".   The  calculation  is  performed  separately  for  each
+  strand.
+
+        If the user chooses to have the data displayed graphically the
+  following  scheme  is used. A rectangular box is drawn so that the x
+  coordinate  represents  the  length  of  the  contig.  The  box   is
+  notionally divided vertically into 5 possible levels which are given
+  the y values: -2,-1,0,1,2.  The quality  codes  attributed  to  each
+  base  position are plotted as rectangles.  Each rectangle represents
+  a region in which the quality codes are identical, so a single  base
+  having a different code from its immediate neighbours will appear as
+  a very narrow rectangle.
+
+    Rectangle bottom and top y values
+
+       Quality 0 rectangle from 0 to 0
+       Quality 1 rectangle from 0 to 1
+       Quality 2 rectangle from 0 to -1
+       Quality 3 rectangle from -1 to 1
+       Quality 4 rectangle from -2 to 2
+
+        Obviously a single line  at  the  midheight  shows  a  perfect
+  sequence.
+
+        Typical dialogue is shown below.
+
+     41.47% OK on both strands and they agree(0)
+     55.48% OK on plus strand only(1)
+      2.08% OK on minus strand only(2)
+      0.97% Bad on both strands(3)
+      0.00% OK on both strands but they disagree(4)
+    ? (y/n) (y) Show sequence of codes
+
+             10         20         30         40         50
+     1111111111 1111111111 1111111111 1111111111 1111111111
+
+             60         70         80         90        100
+     1111111111 1111111111 1111111111 3111111111 1111111111
+
+            110        120        130        140        150
+     1111111111 1111131111 1111111111 1111111111 1111111111
+
+            160        170        180        190        200
+     1111111111 1111111111 1111111111 1111111111 1111111133
+
+            210        220        230        240        250
+     1311111111 1111111111 1111111110 0000000000 0000220000
+
+            260        270        280        290        300
+     0000000000 0020000000 2200000202 0002000000 0000222200
+
+ @26. TX 3 @              Alter relationships
+
+        Used  to  make  what  are  normally  illegal  changes  to  the
+  database. That is the normal checks are not done and any item in the
+  database can be changed independently of all others. Users  need  to
+  know  what they are doing because it is very easy to make a horrible
+  mess. Always start by making a copy!
+
+        By using the  options  here  users  can  edit  individual  gel
+  readings  in  contigs,  move  one  section  of  a contig relative to
+  another, break contigs, remove contigs, remove gel readings, etc. To
+  give  flexibility most of the commands do only one thing. This means
+  that several commands may  have  to  be  executed  to  complete  any
+  change.  At the end of this help section there are notes on removing
+  gel readings from the database.
+
+        The following options are offered:
+
+     Cancel
+     Line change
+     Edit single gel reading
+     Delete contig
+     Shift
+     Move gel reading
+     Rename gel reading
+     Break a contig
+     Alter raw data parameters
+
+  1. QUIT returns to the main options of SAP.
+  2. Line change
+  allows the user to change the contents  of  any line in the file  of
+  relationships.   The  line is selected by number, the program prints
+  the current line and prompts for the new  line.
+  3.   Edit
+  allows   the   user   to    edit    an    individual    gel  reading
+  independently of any others it may be related to. The edit positions
+  are relative to the contig. The effect of this editing on the length
+  of the gel reading is taken care of but, if it changes the length of
+  a contig, or its relationship to others, this must be accounted  for
+  (if necessary) by use of the "line change" function.
+  4.  Delete  contig
+  is a function that deletes a contig line  by moving  down  all   the
+  contig lines above by one position.  It prompts only for the line to
+  delete.  It does not  delete  any   of   the  gel  readings  or  gel
+  reading  lines  for the deleted contig but it does reduce the number
+  of contigs on line IDBSIZ by 1.
+  5.  Shift
+  allows the user to change all the relative  positions of  a set   of
+  neighbouring  gel  readings by some fixed value, i.e.  it will shift
+  related gel readings either left or  right.   It  can  therefore  be
+  used  to  change the alignment of the gel readings in a contig or as
+  part of the process of breaking a contig into two parts (see below).
+  It  prompts  for  the  number  of the first gel reading to shift and
+  then  for the  distance  to  move  them (Note a negative value  will
+  move  the  gel readings left and a positive value right).   It  then
+  chains rightwards (ie follows right neighbours) and shifts each  gel
+  reading,  in  turn,  up to the  end of the contig.  (This means that
+  only those gel readings from the first to shift to the rightmost are
+  moved). It updates the length of the contig accordingly.
+  6. Move gel reading
+  is  a  function  to  renumber  a  gel  reading.  It  moves  all  the
+  information  about  a  gel reading on to another line. The user must
+  specify the number of the gel  reading to move and the number of the
+  line  to place it. It takes care of all the relationships. Of course
+  gel readings must not be  moved  to  lines  occupied  by  other  gel
+  readings!  It  can  be used as part of the process of removing a gel
+  reading from the database (see below).
+  7.  Rename gel reading
+  is a function that is used to  rename  the archive   names   of  gel
+  readings   in  the  database;   it only changes the name in the .ARN
+  file of the  database.
+
+  8. Break contig
+
+        Occasionally it is necessary to break a contig into two  parts
+  and  this  can be achieved using this option. The program needs only
+  the number of a gel reading. This is  the  gel   reading  that  will
+  become  a  left  end  after  the  break.  That is, the break is made
+  between this gel reading and its left neighbour. A new  contig  line
+  is created so ensure that there is sufficient space in the database.
+  Removing gel readings from contigs
+
+        Gel readings can be removed  from  contigs  if  they  are  not
+  essential  for  holding the contig together (ie are not the only gel
+  reading covering a particular region). Suppose the  gel  reading  to
+  remove  is gel number b with left neighbour a and right neighbour c.
+  Using "line change" change the right neighbour of a to  c,  and  the
+  left neighbour of c to a. To tidy things up: suppose there are x gel
+  readings in the database; then, using "move gel reading" move gel  x
+  to  line  b;  then,  using  "line change" decrease the number of gel
+  readings in the database (stored in the last line) by 1.
+
+  8. Alter raw data parameters
+
+        Allows the user to edit the individual  raw  data  parameters,
+  such  as  the  left  and  right  cutoff  lengths and the name of the
+  machine readable trace file.  The user must specify the gel line  to
+  modify,  and  provide  new values for the length of the raw sequence
+  including cutoff lengths, the left cutoff position,  the  length  of
+  the original working sequence, the machine type, and the name of the
+  raw data file, where these values change.
+ @27. TX 1 @  Set display parameters
+
+        Used to  redefine  the  parameters  that  control  the  cutoff
+  employed  by  the  consensus  calculation  and quality examiner, the
+  maximum  length  of  each  reading  to  include   in   the   quality
+  calculation,  the line length used by the display function, the text
+  window length used by the graphics options, and the graphics  window
+  length used by the graphics options.
+
+        The default cutoff score is 75%. The default line length is 50
+  characters. For protein sequences the cutoff is always 100%.
+
+        The text window used by  the  graphics  options  controls  the
+  amount  of  sequence  listed at the crosshair position. The graphics
+  window controls the "zoom" function. Both these windows are  defined
+  as  the number of bases that should be shown, to both left and right
+  of the crosshair.
+ @30. TX 3 @  Auto edit a contig
+
+        This function automatically changes characters in gel readings
+  to  make  them  agree with the consensus sequence. If employed as is
+  intended, use of this function is not  a  criminal  activity  but  a
+  method  that saves a large amount of work. All characters changed by
+  the auto editor  will  appear  in  the  gel  readings  as  lowercase
+  letters. The current consensus calculation cutoff score is used.
+
+        Identify the contig and the section to edit. The program  will
+  display  a  summary  of  changes  made. Note that it is important to
+  understand both what the auto editor does and the order in which  it
+  does  it. Before employing the auto editor users should note all the
+  corrections that they require, so that  after it has been  used  the
+  corrections can be checked.
+
+        The general strategy employed when collecting shotgun sequence
+  data  is  to let the contigs get fairly deep, to get a printout of a
+  contig, check problems against the films, note  corrections  on  the
+  printout,  and  make  the  changes  using  an interactive editor. In
+  general the consensus is correct except  for  places  where  padding
+  characters  have been used to accommodate a single gel with an extra
+  character, or where the consensus is dash. The important  point  for
+  the  auto  editor  is  that  most edits simply make the gel readings
+  conform to the consensus, or remove columns of pads.
+
+        The new editor does the following.
+
+        1) calculates a consensus for the contig (or part of a contig)
+  to  be edited, and then uses this consensus to direct the editing of
+  the contig in 3 stages
+
+        2) stage 1: find and correct all places where, if the order of
+  two  adjacent  characters  is swapped, they will both agree with the
+  consensus (given that they did  not  match  the  consensus  before).
+  These corrections are termed "transpositions"
+
+        3)  stage 2: find and correct all  places  where  there  is  a
+  definite  consensus  but  the gel reading has a different character.
+  These corrections are termed "changes".
+
+        4) stage 3: delete all  positions  in  which  padding  is  the
+  consensus. These corrections are termed "deletions".
+
+        All changed characters are shown in lowercase  letters  so  it
+  will  be  obvious which characters have been assigned by the program
+  (except for deletions). The number of each type of  correction  will
+  be displayed.
+ @10. TX 2 @Clear graphics
+
+        Clears graphics from the screen.
+ @11. TX 2 @Clear text
+
+        Clears  text from the screen.
+ @12. TX 2 @Draw a ruler.
+
+        This option allows the user to draw a ruler or scale along the
+  x  axis  of the screen to help identify the coordinates of points of
+  interest. The user can define the position of the first base  to  be
+  marked  (for  example if the active region is 1501 to 8000, the user
+  might wish to mark every 1000th base starting at either 1501 or 2000
+  -  it  depends  if  the user wishes to treat the active region as an
+  independent unit with its own numbering starting at its  left  edge,
+  or  as  part  of  the  whole sequence). The user can also define the
+  separation of the ticks on the scale and their height.  If  required
+  the labelling routine can be used to add numbers to the ticks.
+ @14. TX 2 @Reposition plots
+
+        The positions of each of the plots is defined  relative  to  a
+  users  drawing board which has size 1-10,000 in x and 1-10,000 in y.
+  Plots for each option are drawn in a window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The  default  window  positions are read from a file "ANALMARG" when
+  the program is started. Users can have their own file  if  required.
+  As  all  the plots start at the same position in x and have the same
+  width, x0 and xlength are the same for all options. Generally  users
+  will  only  want  to change the start level of the window y0 and its
+  height ylength. This option allows users to change window  positions
+  whilst  running  the  program.   The  routine  prompts first for the
+  number of the option that the users wishes to reposition;  then  for
+  the  y  start and height; then for the x start and length. Note that
+  changes to the x values affect all options. If the user  types  only
+  carriage  return  for any value it will remain unchanged. Note that,
+  unlike all the other programs, the boxes used to contain  analytical
+  results (eg plot quality) should not be made to overlap one another,
+  as the function of the crosshair routine depends on  which  box  the
+  crosshair is in!
+ @15. TX 2 @Label a diagram
+
+        This routine allows users to  label  any  diagrams  they  have
+  produced.  They  are  asked  to type in a label. When the user types
+  carriage return to finish typing the label the cross-hair appears on
+  the  screen. The user can position it anywhere on the screen. If the
+  user types R (for right justify) the label will be  written  on  the
+  diagram  with  its right end at the cross-hair position. If the user
+  types L (for left justify) the label will be written on the  diagram
+  with  its  left end at the cross hair position.  The cross-hair will
+  then immediately reappear. The  user  may  put  the  same  label  on
+  another part of the diagram as before or if he hits the space bar he
+  will be asked if he wishes to type in another label.
+
+        Typical dialogue follows.
+  ? Menu or option number=15
+  Type label then drive cross hair to left or right end
+  of label position then hit  "L"  to  write label left
+  justified or  "R"  to  write label right justified or
+  the space bar to quit
+
+
+  ? Label=delta gene
+
+   missing graphics
+
+  ? Label=
+
+ @16. TX 2 @Display a map
+
+        This draws a map of any  sequence  features  selected  by  the
+  user.   These  features  may  be  protein coding regions (CDS), tRNA
+  genes (TRNA), promoter positions (PRM), etc. Users may define  their
+  own  feature  table  key  names. For example I find it convenient to
+  split CDS lines into CDS1, CDS2 and CDS3 each of which contains only
+  those  sequences  that  code in the reading frames 1, 2 or 3. Then I
+  can plot them at different heights on the screen ( suitable  heights
+  can be determined by using the cross-hair).  The coordinates must be
+  stored in a file in the format of an EMBL feature table.
+
+        Typical dialogue follows.
+  ? Menu or option number=16
+   Display a map using an EMBL feature table file
+  ? map file name=hsegl1.ft
+  ? feature code(e.g. CDS) =CDS
+  X 1 + strand
+    2 - strand
+    3 both strands
+  ? 0,1,2,3 =
+  ? level (0-9480) (256) =4000
+
+   missing graphics
+
+  ? feature code(e.g. CDS) =
+
+ @7. TX 1 @Redirect output
+
+        Used to direct output that would normally appear on the screen
+  to a file.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @13. TX 2 @Use crosshair
+
+        This option puts a steerable cross on  the  screen  which  the
+  user  drives  around  by  using  the arrow keys (or mouse). When the
+  crosshair is visible a number of options are available if  the  user
+  types  one  of  a  set  of  special  keyboard  characters. Any other
+  characters will cause an exit from the crosshair option. The special
+  keys are:
+
+      I = Identify the nearest gel reading
+      Z = Zoom in
+      Q = plot Quality
+      S = display the aligned Sequences at the crosshair position
+      N = list the Names and Numbers of the sequences at the crosshair
+
+        In order for  any  of  these  special  keys  to  operate,  the
+  crosshair  must  be  in  an appropriate display box, and the precise
+  function of the keys will also depend on which box the crosshair  is
+  in.
+
+        If the crosshair is in the "plot  all  contigs"  box,  Z  will
+  cause  a  new box to appear showing all the readings for the nearest
+  contig; Q will give the same as Z but will also produce an extra box
+  showing the "quality" plot.
+
+        If Z is hit in the "plot single contig" box, the  contig  will
+  be  zoomed  to  the  current  graphics window size. The zoom will be
+  roughly centred on the crosshair position. Because  of  this  it  is
+  possible  to  step  along  a  contig  by repeatedly zooming with the
+  crosshair near to one end of the single contig display box. If I  is
+  hit  the crosshair must be close to a gel reading line. If Q is hit,
+  the quality plot will be produced for the region shown in  the  plot
+  single  contig  box. In all cases when the "plot all contigs" box is
+  shown, a vertical line will  bisect  the  line  the  represents  the
+  relevant contig, at the current position.
+
+        If the crosshair is in the plot quality box only the character
+  "s" will operate as a special symbol.
+
+        The number of bases shown in the N and S options is controlled
+  by  the  current graphics text window size, and the size of the zoom
+  window by the current graphics window size.  Both  are  set  by  the
+  parameter setting function of the general menu.
+ @33. TX 2 @Plot single contig
+
+        This option produces a schematic of a  selected  region  of  a
+  single  contig by drawing a horizontal line to represent each of its
+  gel readings. The lines show the relative positions of each  reading
+  and  also  their  sense.  The  plot  is  divided vertically into two
+  sections by a line that is identified by an asterisk drawn  at  each
+  end.  All lines that lie above this line represent readings that are
+  in their original sense, all lines below show readings that  are  in
+  the  complementary  sense to their original. By use of the crosshair
+  function the plot can  be  stepped  through  and  examined  in  more
+  detail. See help on crosshair.
+ @34. TX 2 @Plot all contigs
+
+        This option produces a schematic  of  all  the  contigs  in  a
+  database.  It  does  this  by drawing a horizontal line to represent
+  each of them. In order to show the ends of each contig it draws  the
+  lines for contigs at alternate heights: the first at height one, the
+  second at height two, the third at height one, etc. The order of the
+  contigs  in  the display is the same as their order in the database.
+  By use of the crosshair function the plot can be stepped through and
+  examined in more detail. See help on crosshair.
+ @31. TX 3 @ Type in gel readings
+
+        THIS OPTION IS NO LONGER AVAILABLE IN XDAP.
+
+        This option  allows  gel  readings  to  be  typed  in  at  the
+  keyboard. It creates a separate file for each gel reading and a file
+  of file names for the batch. The sequences from each  batch  may  be
+  listed  when  they have all been entered. Users may choose to employ
+  special keys to identify the 4 bases A,C,G and T. By  default  these
+  special  keys are N M , . but any other four characters may be used.
+  If special keys are used the characters are automatically translated
+  to A C G T before being stored on the disk.
+ @35. TX 1 3 @Find internal joins
+
+        The purpose of this function is to use  data  already  in  the
+  database  to  find  possible  joins between contigs.  Joins may have
+  been missed due to poor data or  may  have  not  been  made  due  to
+  repeated  sequences.  Where  appropriate, it may be possible to find
+  potential joins by using the data  clipped  off  readings  prior  to
+  their entry into the database.
+  The database is checked for logical consistency.  Supply  a  minimum
+  initial  match  length,  a minimum alignment block, the maximum pads
+  per sequence, the maximum  percent  mismatch  after  alignment,  the
+  probe length. Choose if clipped data is to be used, if so define the
+  window size for finding good data and the number of  dashes  allowed
+  in  the  window. Processing will commence.  Most of these values are
+  used in an identical way in the autoassemble  function.  The  others
+  are defined below.
+  The program strategy
+  Take the first contig and calculate its consensus. If  clipped  data
+  is  being  used  examine  all readings that are in the complementary
+  orientation, and sufficiently near to the contigs left end,  to  see
+  if  they have good clipped sequence which if present, would protrude
+  from the left end of the contig.  If  found  add  the  longest  such
+  sequence to the left end of the consensus. Do the same for the right
+  end by examining readings that are in their original orientation. If
+  any  are  found  add  the  longest extension to the right end of the
+  consensus. Repeat the consensus calculations and extensions for  all
+  contigs  hence  producing  an extended consensus. If clipped data is
+  not  being  used  simply  calculate  the  consensus  for  the  whole
+  database.  Now  look  for  possible joins by processing the extended
+  consensus in the following  way.  Take  the  last,  say  100,  bases
+  (termed  the  "probe  length"  by  the  program)  of  the  rightmost
+  consensus, compare it both orientations with the extended  consensus
+  of  all the other contigs. Display any sufficiently good alignments.
+  Repeat with the left end of the rightmost contig. Do  the  same  for
+  the ends of all the entended contigs, always only comparing with the
+  contigs to their left, so that the same matches do not appear twice.
+  Good cliped data is defined by sliding a window of "Window size  for
+  good  data scan" bases outwards along the sequence and stopping when
+  "Maximum number of dashes in scan window" or more dashes  appear  in
+  the  window.   Note that it is advisable to have some sort of cutoff
+  because if we simply take all the  data  it  might  be  so  full  of
+  rubbish  that  we wont find any good matches. For the same reason it
+  is worth trying the procedure with different cutoffs. An initial run
+  using  no  clipped  data  is  also  recommended.   Sufficiently good
+  alignments are defined by  criteria  equivalent  to  those  used  in
+  autoassemble,  however here we only display alignments that pass all
+  tests.
+  Bugs
+  If a small contig is wholly contained within a larger one, such that
+  its  ends  are further than ("Probe length" - "Minimum initial match
+  length") from the ends of the larger contig, and the  consensus  for
+  the small contig lies to the left of the consensus for large contig,
+  the overlap will not be discovered. (See the search stratgey).
+  All numbering is relative to base number one in the contig:  matches
+  to  the  left  (i.e.  in  the clipped data) have negative positions,
+  matches off the right end of the contig (i.e. in the  clipped  data)
+  have  positions  greater  than  that  of  the  contig  length.   The
+  convention for reporting the positions of overlaps is as follows: if
+  neither  contig needs to be complemented the positions are as shown.
+  If the program says "contig x in the -  sense"  then  the  positions
+  shown  assume  contig  x  has  been complemented. For example in the
+  results given below the positions  for  the  first  overlap  are  as
+  reported,  but  those  for  the second assume that the contig in the
+  minus sense (i.e. 443) has been complemented.
+
+
+   Possible join between contig   445 in the + sense and contig   405
+   Percentage mismatch after alignment =  4.9
+          412        422        432        442        452        462
+       405  TTTCCCGACT GGAAAGCGGG CAGTGAGCGC AACGCAATTA ATGTGAG,TT AGCTCACTCA
+             ********* * ********  ***** *** ********** ********** **********
+       445  -TTCCCGACT G,AAAGCGGG TAGTGA,CGC AACGCAATTA ATGTGAG-TT AGCTCACTCA
+         -127       -117       -107        -97        -87        -77
+          472        482        492        502        512
+       405  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+            ********** ********** ********** ********** **
+       445  TTAGGCACCC CAGGCTTTAC ACTTTATGCT TCCGGCTCGT AT
+          -67        -57        -47        -37        -27
+   Possible join between contig   443 in the - sense and contig   423
+   Percentage mismatch after alignment = 10.4
+           64         74         84         94        104        114
+       423  ATCGAAGAAA GAAAAGGAGG AGAAGATGAT TTTAAAAATG AAACG-CGAT GTCAGATGGG
+            **** ***** ********** ********** ******  ** ***** **** *********
+       443  ATCG,AGAAA GAAAAGGAGG AGAAGATGAT TTTAAA,,TG AAACGACGAT GTCAGATGG,
+         3610       3620       3630       3640       3650       3660
+          124        134        144        154        164
+       423  TTG-ATGAAG TAGAAGTAGG AG-AGGTGGA AGAGAAGAGA GTGGGA
+            *** ****** ********** ** *******  *** ***** ** **
+       443  TTGGATGAAG TAGAAGTAGG AGGAGGTGGA ,GAG,AGAGA GTTGG-
+         3670       3680       3690       3700       3710
+
+
+ @ end of help
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/help/dap_menu b/help/dap_menu
new file mode 100644
index 0000000..c643ee2
--- /dev/null
+++ b/help/dap_menu
@@ -0,0 +1,79 @@
+-1 0 21 2 T General
+-1 0 21 2 X General
+-2 0 50 2 T Screen control
+-2 0 71 2 X Screen
+-3 0 98 2 T Modification
+-3 0 98 2 X Modification
+0 -1 116 351 T SAP
+0 -1 116 351 X SAP
+17 1 18801 18 T Screen against enzymes
+17 1 18801 18 X Screen against enzymes
+18 1 19844 22 T Screen against vector
+18 1 19844 22 X Screen against vector
+20 3 21171 113 T Auto assemble
+20 3 21171 113 X Auto assemble
+28 1 27332 42 T Highlight disagreements
+28 1 27332 42 X Highlight disagreements
+32 3 29694 22 T Extract gel readings
+32 3 29694 22 X Extract gel readings
+1 0 30797 3 T Help
+1 0 30797 3 X Help
+2 0 30866 5 T Quit
+2 0 30866 5 X Quit
+3 1 31059 237 T Open a database
+3 1 31059 237 X Open a database
+4 3 43258 239 T Edit contig
+4 3 43258 239 X Edit contig
+9 3 54180 42 T Screen edit
+5 1 56376 45 T Display a contig
+5 1 56376 45 X Display a contig
+6 1 58862 6 T List a text file
+6 1 58862 6 X List a text file
+8 1 59120 93 T Calculate a consensus
+8 1 59120 93 X Calculate a consensus
+25 1 63651 41 T Show relationships
+25 1 63651 41 X Show relationships
+21 3 65587 101 T Enter new gel reading
+21 3 65587 101 X Enter new gel reading
+23 3 70677 11 T Complement a contig
+23 3 70677 11 X Complement a contig
+22 3 71200 63 T Join contigs
+22 3 71200 63 X Join contigs
+24 1 74467 11 T Copy the database
+24 1 74467 11 X Copy the database
+19 1 75013 41 T Check database
+19 1 75013 41 X Check database
+29 1 77032 82 T Examine quality
+29 1 77032 82 X Examine quality
+26 3 80849 101 T Alter relationships
+26 3 80849 101 X Alter relationships
+27 1 86065 17 T Set display parameters
+27 1 86065 17 X Set display parameters
+30 3 86933 48 T Auto edit a contig
+30 3 86933 48 X Auto edit a contig
+10 2 89409 3 T Clear graphics
+10 2 89409 3 X Clear graphics
+11 2 89474 3 T Clear text
+11 2 89474 3 X Clear text
+12 2 89539 12 T Draw a ruler.
+12 2 89539 12 X Draw a ruler.
+14 2 90273 38 T Reposition plots
+14 2 90273 38 X Reposition plots
+15 2 92612 28 T Label a diagram
+15 2 92612 28 X Label a diagram
+16 2 93717 27 T Display a map
+16 2 93717 27 X Display a map
+7 1 94692 12 T Redirect output
+7 1 94692 12 X Redirect output
+13 2 95163 43 T Use crosshair
+13 2 95163 43 X Use crosshair
+33 2 97308 12 T Plot single contig
+33 2 97308 12 X Plot single contig
+34 2 98010 10 T Plot all contigs
+34 2 98010 10 X Plot all contigs
+31 3 98592 12 T Type in gel readings
+31 3 98592 12 X Type in gel readings
+35 3 99223 92 T Find internal joins
+35 1 99223 92 T Find internal joins
+35 3 99223 92 X Find internal joins
+35 1 99223 92 X Find internal joins
diff --git a/help/gip_help b/help/gip_help
new file mode 100644
index 0000000..618202d
--- /dev/null
+++ b/help/gip_help
@@ -0,0 +1,198 @@
+                                  GIP
+
+        A digitizer is a  two  dimensional  surface which is such that
+ if  a  special  pen  is  pressed onto it, the pens coordinates can be
+ recorded by a computer.  These coordinates can be  interpreted  by  a
+ program.
+
+        The digitizing device we use works by the pen emitting a  high
+ frequency  sound  which is picked up by two microphones positioned at
+ the rear of the working area.  The  pen  position  is  determined  by
+ triangulation  and the digitizing device sends the coordinates to the
+ computer.  As  no  special  surface  is  required  the   device   can
+ conveniently  be  positioned  on  a light box giving the sequencer an
+ unobscured view of the autoradiographs.
+ The digitizer  is  called  a  GRAPHBAR  MODEL  GP7  made  by  Science
+ Accessories  Corp,  970  Kings  Highway  West, Southport, Connecticut
+ 06490, USA.
+
+        The program uses a menu to allow the user to  select  commands
+ or  to   enter  the  uncertainty  codes  for  areas  of  the gel that
+ are difficult to interpret.  A menu is simply a series of boxes drawn
+ on  the   digitizing  surface   that   each   contain   a  command or
+ uncertainty code.  When the user puts the pen down in these   special
+ regions   the program interprets the coordinates as commands and acts
+ appropriately. A copy of the menu should have been sent to  you.   It
+ should  be  stuck  down  on  the  surface  of  the  light  box in the
+ digitizing area. For convenience it is best to  position  it  to  the
+ right of the digitizing area, but in practice as long as its top edge
+ is parallel to the digitizer box, it  can  be  put  anywhere  in  the
+ active region.
+
+ Entering gel readings using a digitizer
+
+        The autoradiograph should be stuck down on the light box  with
+ the  lanes  running,  as  near is as possible, at right angles to the
+ digitizer. To read an autoradiograph placed on the light box the user
+ need  only  define the positions of the four sequencing lanes and the
+ bases to which they correspond and then use  the  pen  to  point   to
+ each  successive   band progressing up the gel.  The program examines
+ the coordinates of each pen position to see in  which  of  the   four
+ lanes  it   lies  and  assigns  the  corresponding  base to be stored
+ in the computer.  Each time the pen tip is depressed to  point  to  a
+ position  on   the   surface  of the digitizer the program sounds the
+ bell on the terminal (a different sound for each of the four bases on
+ the  microcomputer  version  of  the program) to indicate to the user
+ that a point has been  recorded.   As  the   sequence   is  read  the
+ program displays it on the screen.
+
+        The program uses a menu to allow the user to  select  commands
+ or  to   enter  the  uncertainty  codes  for  areas  of  the gel that
+ are difficult to interpret.  A menu is simply a series of boxes drawn
+ on  the   digitizing  surface   that   each   contain   a  command or
+ uncertainty code.  When the user puts the pen down in these   special
+ regions   the program interprets the coordinates as commands and acts
+ appropriately.     As    well    as    the     uncertainty      codes
+ A,C,G,T,1,2,3,4,B,D,H,V,R,Y,X,-,5,6,7,8   the  following commands are
+ included in the menu:  DELETE removes the last  character   from  the
+ sequence; RESET allows the lane centres to be redefined; START  means
+ begin  the  next stage  of  the   procedure;   STOP  means  stop  the
+ current  stage in the procedure;  CONFIRM means confirm that the last
+ command  or  set  of coordinates  are  correct.
+
+        The digitizing device also has a menu of its own. This lies in
+ a two inch wide strip immediately in front of the digitizing box. Pen
+ positions within this two inch strip are interpretted as commands  to
+ the  digitizer  and  are  not sent to the GIP program. In general the
+ only time users will need to use the device menu is  when  they  tell
+ GIP  where the program menu lies in the digitizing area. This is done
+ by first hitting ORIGIN in the  device  menu  and  then  hitting  the
+ bottom  left  hand  corner  of the program menu. The program menu can
+ hence be positioned anywhere in  the  active  region  but  should  be
+ arranged parallel to the digitizer.
+
+        The user should try to hit the bands as near  as  possible  to
+ the  centre  of the lanes because the program tracks the lanes up the
+ film using the pen positions. By using  this  tracking  strategy  the
+ user only has to define the centres of the bottom of the lanes before
+ starting to read the film. The program  can  correctly  follow  quite
+ curved  lanes  and constantly checks that its lane centre coordinates
+ look sensible. If the lane centres appear to be getting too close the
+ program stops responding to the pen positions of bands and hence does
+ not ring the bell. If this occurs users must hit the reset box in the
+ menu  and  the program will request them to redefine the lane centres
+ at the current reading position. Then they can continue reading. As a
+ further  safeguard  the  program  will  only respond to pen positions
+ either in the menu or very close to the current reading position.
+
+ Running the gel reading program
+ The autoradiograph should be firmly stuck down on the light  box  and
+ the program started by typing GIP. It will ask the first question.
+  " ? FILE OF FILE NAMES="
+  Type the name for the  file  of  file  names  and  then  follow  the
+  instructions.
+  " HIT DIGITIZER MENU ORIGIN"
+  " THEN PROGRAM MENU ORIGIN"
+  " THEN HIT START IN PROGRAM MENU"
+  If the bell does not sound after you hit start try hitting metric in
+  the  device menu (the program uses metric units, and some digitizers
+  are set to default to use inches; hitting  metric  switches  between
+  the two).
+  After the bell has sounded the program will give  the  default  lane
+  order.
+  " LANE ORDER IS T C A G"
+  " IF CORRECT HIT CONFIRM, ELSE HIT RESET"
+  If the lane order, reading from left to right is correct hit confirm
+  in  the  program  menu. If you are using a different order hit reset
+  and you will be asked to define the lane order from  left  to  right
+  using the program menu (as follows).
+  " DEFINE LANE ORDER (LEFT TO RIGHT) USING MENU"
+  Hit the boxes in the menu that contain the symbols  A,C,G,T  in  the
+  left-right  order  of  the  lanes. The program will respond with the
+  lane order as above and ask for confirmation. When this is received,
+  the  next  task  is  to  define the start positions of the next four
+  lanes.
+  " HIT START, THEN HIT (LEFT TO RIGHT)"
+  " THE START POSITIONS FOR THE NEXT FOUR LANES"
+  Hit the centres of the four lanes at a height level with  the  first
+  band that is going to be read. The program will report the mean lane
+  separations and asks for confirmation that they are correct.
+  " MEAN LANE SEPARATION IS XX"
+  " HIT CONFIRM TO CONTINUE"
+  Users will become familiar with the values from their films and will
+  spot  any  unusual numbers.  Asking for confirmation allows users to
+  try again if they  had  made  a  mistake,  but  generally  the  lane
+  separation values can be ignored.  Hit confirm, and the program will
+  give the message
+  " HIT START WHEN READY TO BEGIN READING"
+  Hit start and the program will give the message
+  " HIT BANDS, UNCERTAINTY CODES, RESET OR STOP"
+  Hit the bands, interpretting the sequence progressing up  the  film.
+  If  necessary use the uncertainty codes. If the pen stops responding
+  hit reset and follow the instructions as above.  When  the  sequence
+  becomes unreadable hit stop and the program will ask for a file name
+  for the gel reading just read.
+  " ? FILE NAME FOR THIS GEL READING="
+  Type the file name observing the  rules  about  legal  gel  readings
+  names.  The program will ask if you wish to read another sequence.
+  " TO ENTER ANOTHER GEL READING TYPE 1"
+  To enter another type 1 and you will be back to the step of defining
+  the lane order. Typing anything else will stop the program.
+
+ Running the microcomputer version of the gel reading program
+ The microcomputer version of GIP is slightly different and is  called
+ GIPB.  The  BBC  micro  does not have the capacity to process the gel
+ readings beyond the reading stage.  This means  that  users  of  this
+ program  would  need to transfer their gel readings from the micro to
+ another machine using a terminal emmulator. Transferring  many  files
+ is  tedious  and  so  the  microcomputer  version  of the gel reading
+ program stores all the gel readings for each run of the program in  a
+ single file. This special file contains both sequences and file names
+ and can be moved in a single transfer to another machine. Once on the
+ other machine the single file must be split into separate gel reading
+ files and a file of file  names.  This  is  done  using  the  program
+ BSPLIT.  As  far  as using the microcomputer version of GIP, the only
+ difference is that the first file name the program requests is not  a
+ file of file names, but a name for the single file to contain all the
+ gel readings and their names.
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/help/gip_menu b/help/gip_menu
new file mode 100644
index 0000000..e69de29
diff --git a/help/makefile b/help/makefile
new file mode 100644
index 0000000..520b30f
--- /dev/null
+++ b/help/makefile
@@ -0,0 +1,48 @@
+#
+# Make file for help files - this requires gmake on some systems.
+#
+PROGS = bap       dap       gip       mep         nip      \
+	nipf      pip       sap       sip        #mem
+
+HELPS = bap_help  dap_help  gip_help  mep_help    nip_help \
+	nipf_help pip_help  sap_help  sip_help   #mem_help
+
+MENUS = bap_menu  dap_menu  gip_menu  mep_menu    nip_menu \
+	nipf_menu pip_menu  sap_menu  sip_menu   #mem_menu
+
+all:	$(PROGS)
+
+DOIT = rm -f $@_help $@_menu; ./runoff $?
+
+bap:	BAP.RNO
+	$(DOIT)
+
+dap:	DAP.RNO
+	$(DOIT)
+
+gip:	GIP.RNO
+	$(DOIT)
+
+#mem:	MEM.RNO
+#	$(DOIT)
+
+mep:	MEP.RNO
+	$(DOIT)
+
+nip:	NIP.RNO
+	$(DOIT)
+
+nipf:	NIPF.RNO
+	$(DOIT)
+
+pip:	PIP.RNO
+	$(DOIT)
+
+sap:	SAP.RNO
+	$(DOIT)
+
+sip:	SIP.RNO
+	$(DOIT)
+
+clean:
+	rm -f $(HELPS) $(MENUS)
diff --git a/help/mem_help b/help/mem_help
new file mode 100644
index 0000000..d44fdab
--- /dev/null
+++ b/help/mem_help
@@ -0,0 +1,698 @@
+
+ @0. B 1 @MEP
+  This is a program  for analysing families of nucleotide sequences in
+  order  to find common motifs and potential binding sites.  The ideas
+  in  this  program  were  described  in  Staden,  R.   "Methods   for
+  discovering  novel  motifs  in  nucleic  acid  sequences".  Computer
+  Applications in the Biosciences, 5, 293-298, (1989).
+
+        The program  can  read  sequences  stored  in  either  of  two
+  formats: 1) all sequences aligned in a single file; 2) all sequences
+  in separate files and accessed through a file of file names.
+
+        The  program  contains  functions  that  can  answer   several
+  questions about a set of sequences:
+
+  Which words are most common?
+  Which words occur in the most sequences?
+  Which words contain the most information?
+  Which words occur in equivalent positions in the sequences?
+  Which words are inverted repeats?
+  Which words occur on both strands of the sequences?
+  Where are the inverted repeats?
+  Where are the fuzzy words?
+
+        Most of the program is concerned with analysing what it  terms
+  "fuzzy words" within the set of sequences. The analysis is explained
+  below. Note that the standard version of the programs is limited  to
+  words of maximum length 8 letters, and a maximum fuzziness of 2.
+
+        The following analyses (preceded by their option numbers)  are
+  included:
+    ? = Help
+    ! = Quit
+    3 = Read new sequences
+    4 = Redefine active region
+    5 = List the sequences
+    6 = List text file
+    7 = Direct output to disk
+   10 = Clear graphics
+   11 = Clear text
+   12 = Draw ruler
+   13 = Use cross hair
+   14 = Reset margins
+   15 = Label diagram
+   16 = Draw map
+   17 = Search for strings
+   18 = Set strand
+   19 = Set composition
+   20 = Set word length
+   21 = Set number of mismatches
+   22 = Show settings
+   23 = Make dictionary Dw
+   24 = Make dictionary Ds
+   25 = Make fuzzy dictionary Dm from Dw
+   26 = Make fuzzy dictionary Dm from Ds
+   27 = Make fuzzy dictionary Dh from Dm
+   28 = Examine fuzzy dictionary Dm
+   29 = Examine fuzzy dictionary Dh
+   30 = Examine words in Dm
+   31 = Examine words in Dh
+   32 = Save or restore a dictionary
+   33 = Find inverted repeats
+
+        Some of these methods produce graphical  results  and  so  the
+  program  is  generally used from a graphics terminal (a vdu on which
+  lines and points can be drawn as well as characters).
+
+  The positions of each of the plots is defined relative  to  a  users
+  drawing board which has size 1-10,000 in x and 1-10,000 in y.  Plots
+  for each  option  are  drawn  in  a  window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The default window positions are read from a file "MEPMARG" when the
+  program is started. Users can have their own file if required.
+
+        The options for the program are accessed from  3  main  menus:
+  general,  screen  control  and dictionary analylsis.  Both menus and
+  options are selected by number.
+
+        The most important and novel part of the program is its use of
+  "fuzzy dictionaries" and an information theory measure, to help show
+  the most interesting motifs.  Central to the method is the idea of a
+  fuzzy   dictionary   of  word  frequencies.  A  dictionary  of  word
+  frequencies is an ordered list of all the words in the sequences and
+  a  count  of the number of times that they occur. A fuzzy dictionary
+  is an equivalent list but which contains instead, for each  word,  a
+  count  of  the number of times similar words occur in the sequences.
+  We term words that are similar "relations". The fuzziness is defined
+  by the number of letters in a word that are allowed to be different.
+  So if we had a fuzziness of 1 we allow 1 letter to be different. For
+  example,  with  a  fuzziness of 1, the entry in the fuzzy dictionary
+  for the word TTTTTT would contain a count of the  numbers  of  times
+  TTTTTT  occured  plus  the  number  of  times all words differing by
+  exactly one letter from TTTTTT occured.
+
+        Once the fuzzy dictionary has been created we can  examine  it
+  in  several  ways  to find candidate control sequences. The simplest
+  question we can ask is which word in  the  dictionary  is  the  most
+  common.   Sometimes  this  simple  criterion of "most common" may be
+  adequate to discover a new motif but in general we would not  expect
+  it  to  be  sufficient. For example some words will be common simply
+  because of a base composition bias in the sequences being  analysed.
+  In  addition  a  word  can be the most frequent and yet not be "well
+  defined". This last point is best explained by an example.
+
+        Suppose we were looking at  two letter words and allowing  one
+  mismatch,  and  that  there were 10 occurences of TT and 5 of AC. We
+  could align the 10 words that were one letter different from TT  and
+  the  5  that  were  related to AC. Then we could count the number of
+  times each base occured in each position for each of these two  sets
+  of words. Suppose we got the two base frequency tables shown below.
+     TT                  AC
+         T 6 4               T 1 0
+         C 1 3               C 0 4
+         A 1 2               A 4 1
+         G 2 1               G 0 0
+
+  These tables show that although TT occurs (with one letter mismatch)
+  more often than AC, the ratio of base frequencies for AC at 4/5, 4/5
+  is higher than those for TT at 6/10, 4/10. Hence we would  say  that
+  AC was better defined than TT.  Expressing this another way we would
+  say that the definition of AC contained more information  than  that
+  for TT. The program calculates the information content in a way that
+  takes into account both the sequence composition and  the  level  of
+  definition of the motif.
+
+        Definitions
+
+        Here we deal only with the dictionary  analysis.   Suppose  we
+  are dealing with a set of sequences and are examining them for words
+  that are six characters in length.
+
+        Dictionary Dw contains a count of the  number  of  times  each
+  word  occurs  in  the  set  of  sequences. For example the entry for
+  TTTTTT contains a value equal to the number of times the word TTTTTT
+  occurs in the set of sequences.
+
+        Dictionary Ds contains a count  of  the  number  of  different
+  sequences  in  which  each word occurs. For example if the entry for
+  word TTTTTT contains the value 10, it denotes that the  word  TTTTTT
+  occurs  in  ten  different sequences. Unlike Dw it only counts words
+  once for each  sequence.  For  example  if  we  had  a  set  of  100
+  sequences, the maximum possible value that Ds could take is 100, and
+  this would only happen if a word occurred in every sequence. However
+  for  the same set of sequences, Dw could contain values greater than
+  100, and this would show that a word had occurred more than once  in
+  at least one sequence.
+
+        From either of the two dictionaries Dw or Ds we can  calculate
+  a  fuzzy  dictionary  Dm.  For  each  word,  the  entry in the fuzzy
+  dictionary Dm contains the sum of the dictionary values (taken  from
+  either  Dw  or  Ds)  for  all  words  that differ from it by up to m
+  letters. For example if m=2 the entry for TTTTTT contains the number
+  of  times  that TTTTTT occurs in the dictionary, plus the counts for
+  all words that differ from TTTTTT by 1 or 2 letters.  Obviously  the
+  interpretation  of  the  values  in  Dm  depends on which of the two
+  dictionaries Dw or Ds they were derived from. When derived  from  Dw
+  the entry for any word in Dm gives the total number of times it, and
+  its relations, occur in the set of sequences. When derived  from  Ds
+  the  entry  for  any  word in Dm gives the total number of different
+  sequences that contain a word and each of its relations.
+
+        Finally,  from  fuzzy  dictionary  Dm  we  can  derive   fuzzy
+  dictionary  Dh.  All  entries in Dh are zero except for the word(s),
+  within each set of relations, that are most frequent. For example if
+  TTTTTT  occurred  20  times  but  had  a relation that occurred more
+  often, then the entry for TTTTTT would be zero.  However  if  TTTTTT
+  did  not  have  a more frequently occurring relation, then the entry
+  for TTTTTT would contain the value 20.
+ @1. B 1 @Help
+  This option gives online help. The user should select option numbers
+  and  the  current  documentation  will  be given. Note that option 0
+  gives an introduction to the program, and that ? will get help  from
+  anywhere  in the program.  The following analyses (preceded by their
+  option numbers) are included:
+    ? = Help
+    ! = Quit
+    3 = Read new sequences
+    4 = Redefine active region
+    5 = List the sequences
+    6 = List text file
+    7 = Direct output to disk
+   10 = Clear graphics
+   11 = Clear text
+   12 = Draw ruler
+   13 = Use cross hair
+   14 = Reset margins
+   15 = Label diagram
+   16 = Draw map
+   17 = Search for strings
+   18 = Set strand
+   19 = Set composition
+   20 = Set word length
+   21 = Set number of mismatches
+   22 = Show settings
+   23 = Make dictionary Dw
+   24 = Make dictionary Ds
+   25 = Make fuzzy dictionary Dm from Dw
+   26 = Make fuzzy dictionary Dm from Ds
+   27 = Make fuzzy dictionary Dh from Dm
+   28 = Examine fuzzy dictionary Dm
+   29 = Examine fuzzy dictionary Dh
+   30 = Examine words in Dm
+   31 = Examine words in Dh
+   32 = Save or restore a dictionary
+   33 = Find inverted repeats
+ @2. B 1 @Quit
+  This function stops the program.
+ @3. B 1 @Read a new sequence.
+
+        It can read sequences stored in either of two formats: 1)  all
+  sequences  aligned  in  a  single file; 2) all sequences in separate
+  files and accessed through a file of file  names.  Typical  dialogue
+  follows:
+
+  X 1 Read file of aligned sequences
+    2 Use file of file names
+  ? 0,1,2 =
+
+  ? File of aligned sequences=F1
+  Number of files           88
+
+ @4. B 1 @Define active region
+  For its analytic functions the program always works on a  region  of
+  the  sequence called the active region. When  new sequences are read
+  into the program the active region is automatically set to start  at
+  the  beginning  of the sequences and go up to the end of the longest
+  one.
+ @5. B 1 @List a sequence.
+  The sequence can be listed with line lengths of 50 bases  with  each
+  sequence  numbered in the order in which they were read.  Output can
+  be directed to a disk file by first selecting disk  output.  Typical
+  dialogue follows.
+
+  ? Menu or option number=5
+
+                10        20        30        40        50
+     1  TAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCA
+     2  CAAATAATCAATGTGGACTTTTCTGCCGTGATTATAGACACTTTTGTTAC
+     3  TAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTATT
+     4  ACTAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTA
+     5  AGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGA
+     6  TAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGC
+     7  ACACCATCGAATGGCGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCG
+     8  GGGGCAAGGAGGATGGAAAGAGGTTGCCGTATAAAGAAACTAGAGTCCGT
+     9  AGGGGGTGGAGGATTTAAGCCATCTCCTGATGACGCATAGTCAGCCCATC
+    10  AAAACGTCATCGCTTGCATTAGAAAGGTTTCTGGCCGACCTTATAACCAT
+
+                60
+     1  TACCCGTTTTT
+     2  GCGTTTTTGT
+     3  TCATACCATAAG
+     4  TTTCATACC
+     5  ATTGTGAGC
+     6  TTCCGGCTCG
+     7  GAAGAGAGT
+     8  TCAGGTGT
+     9  ATGAATG
+    10  TAATTACG
+ @6. B 1 @List a text file.
+  Allows the user to have a text file displayed on the screen. It will
+  appear one page at a time.
+ @7. B 1 @Direct output to disk
+
+        Used to direct output that would normally appear on the screen
+  to a file.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @10. B 1 @Clear graphics
+  Clears the screen of both text and graphics.
+ @11. B 1 @Clear text
+  Clears only text from the screen.
+ @12. B 1 @Draw a ruler.
+  This option allows the user to draw a ruler or  scale  along  the  x
+  axis  of  the  screen  to help identify the coordinates of points of
+  interest. The user can define the position of the first  amino  acid
+  to  be marked (for example if the active region is 1501 to 8000, the
+  user might wish to mark every 1000th amino acid starting  at  either
+  1501  or  2000  -  it depends if the user wishes to treat the active
+  region as an independent unit with its own numbering starting at its
+  left  edge,  or  as  part  of the whole sequence). The user can also
+  define the separation of the ticks on the scale and their height. If
+  required  the  labelling  routine  can be used to add numbers to the
+  ticks.
+ @13. B 1 @Use crosshair.
+  This function puts a steerable cross on the screen that can be  used
+  to find the coordinates of points in the sequence. The user can move
+  the cross around using the directional keys; when he hits the  space
+  bar  the  program  will  print  out  the coordinates of the cross in
+  sequence units and the option will be exited.
+
+        If instead, you hit a , the position will be displayed but the
+  cross will remain on the screen.
+
+        If a letter s is hit the sequence around  the  cross  hair  is
+  displayed and the cross remains on the screen.
+ @14. B 1 @Reposition plots
+  The positions of each of the plots is defined relative  to  a  users
+  drawing board which has size 1-10,000 in x and 1-10,000 in y.  Plots
+  for each  option  are  drawn  in  a  window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The default window positions are read from a file "MEPMARG" when the
+  program is started. Users can have their own file if  required.   As
+  all  the  plots  start  at  the same position in x and have the same
+  width, x0 and xlength are the same for all options. Generally  users
+  will  only  want  to change the start level of the window y0 and its
+  height ylength. This option allows users to change window  positions
+  whilst  running  the  program.   The  routine  prompts first for the
+  number of the option that the users wishes to reposition;  then  for
+  the  y  start and height; then for the x start and length. Note that
+  changes to the x values affect all options. If the user  types  only
+  carriage  return  for any value it will remain unchanged. The cross-
+  hair can be used to choose suitable heights.
+ @15. B 1 @Label a diagram
+  This routine allows users to label any diagrams they have  produced.
+  They  are  asked  to  type  in a label. When the user types carriage
+  return to finish typing the label  the  cross-hair  appears  on  the
+  screen. The user can position it anywhere on the screen. If the user
+  types R (for right justify) the label will be written on the diagram
+  with  its  right end at the cross-hair position. If the user types L
+  (for left justify) the label will be written on the diagram with its
+  left  end  at  the  cross  hair  position.  The cross-hair will then
+  immediately reappear. The user may put the  same  label  on  another
+  part of the diagram as before or if he hits the space bar he will be
+  asked if he wishes to type in another label.
+ @16. B 1 @Display a map.
+  It is often convenient to plot a map alongside graphed  analysis  in
+  order to indicate features within the sequence. This function allows
+  users to draw maps using files arranged in the form of EMBL  feature
+  tables.  Of  course the EMBL table are usually only used for nucleic
+  acid sequence annotation but, as long as the features are written in
+  the correct format, they can be employed by this routine. The map is
+  composed of a line representing the sequence and then further  lines
+  denoting the endpoints of each feature the user identifies. The user
+  is asked to  define  height  at  which  the  line  representing  the
+  sequence  should be drawn; then for the feature height; then for the
+  features to plot.
+ @17. B 1 @Search for strings
+  Search for  strings  perfoms  searches  of  all  the  sequences  for
+  selected words and shows which sequences they are found in. The user
+  types in a word and defines the allowed number  of  mismatches.  The
+  results  are  listed  or plotted. If listed the display includes the
+  sequence number, the position  in  the  sequence  and  the  matching
+  string.  The results are plotted in the following way. The x axis of
+  the plot represents the length of the aligned sequences  and  the  y
+  direction  is  divided  into  sufficient  strips to accommodate each
+  sequence. So if a match is found in the 3rd sequence at  a  position
+  equivalent  to  halfway  along  the  longest of the sequences then a
+  short vertical line will be drawn at the midpoint of the 3rd  strip.
+  If  the  sequences are aligned it can be useful if the motifs happen
+  to appear  in  related  positions.  For  example  see  the  original
+  publication. Typical dialogue follows.
+
+  ? Menu or option number=17
+  X 1 Plot match positions
+    2 Plot histogram of matches
+  ? 0,1,2 =
+  ? Word to search for=TTGACA
+  ? Minimum match (0-6) (6) =5
+  ? (y/n) (y) Plot results N
+       2    35 TAGACA
+       5    14 TTTACA
+       6    37 TTTACA
+      11    14 TAGACA
+      14    14 TTGACA
+      17    14 GTGACA
+      17    22 TTAACA
+      20     1 TTGACA
+ @18. B 1 @Set strand
+  Set strand  allows  the  user  to  define  which  strand(s)  of  the
+  sequences to analyse: input stand, complement of input, or both.
+ @19. B 1 @Set composition
+  Set composition  gives  the  user  three  choices  for  setting  the
+  composition  of  the  sequences  for  use  in the calculation of the
+  information content of  words.  The  user  can  select  the  overall
+  composition  of  the  sequences as read, an even composition, or can
+  type in any other 4 values.
+ @20. B 1 @Set word length
+  Set word length sets the length of word for which dictionaries  will
+  be made.
+ @21. B 1 @Set number of mismatches
+  Set number of  mismatches  sets  the  level  of  fuzziness  for  the
+  creation of dictionary Dm.
+ @22. B 1 @Show settings
+  Show  settings  show  the  current  settings  for   all   parameters
+  associated with dictionary analysis. A typical diaplsy follows:
+   ? Menu or option number=22
+   Current word length  =   6
+   Number of mismatches =   1
+   Start position       =     1
+   End position         =    63
+   Input strand only
+   Observed composition
+   Dictionary Dw unmade
+   Dictionary Ds unmade
+   Dictionary Dm unmade
+   Dictionary Dh unmade
+ @23. B 1 @Make dictionary Dw
+  Make dictionary Dw creates a dictionary that contains  a  count   of
+  the frequency of occurrence of each word in the collected sequences.
+ @24. B 1 @Make dictionary Ds
+  Make dictionary Ds creates a dictionary that contains a count of the
+  number of different sequences that contain each word.
+ @25. B 1 @Make dictionary Dm from Dw
+  Make dictionary Dm  from Dw creates a dictionary from dictionary  Dw
+  that contains the frequency of occurrence of each word (say X) in Dw
+  plus the frequency of occurrence of each word  in  Dw  that  differs
+  from  X  by  up  to m letters. Dm is called a fuzzy dictionary as it
+  contains the  frequencies  of  occurrence  of  all  words  plus  the
+  frequencies of all the words that are similar to them.
+ @26. B 1 @Make dictionary Dm from Ds
+  Make dictionary Dm  from Ds creates a dictionary from dictionary  Ds
+  that contains the frequency of occurrence of each word (say X) in Ds
+  plus the frequency of occurrence of each word  in  Ds  that  differs
+  from  X  by  up  to m letters. Dm is called a fuzzy dictionary as it
+  contains the  frequencies  of  occurrence  of  all  words  plus  the
+  frequencies of all the words that are similar to them.
+ @27. B 1 @Make dictionary Dh from Dm
+  Make dictionary Dh  creates a  dictionary  from  dictionary  Dm  and
+  whose  entries are zero except for those words in any set of related
+  words that are most frequent. It finds the dominant  words  in  each
+  set of relations and stores their counts.
+ @28. B 1 @Examine dictionary Dm
+  Examine dictionary Dm  allows  users  to  analyse  the  contents  of
+  dictionary  Dm  to  find  the  most common words or those words that
+  contain the most information.  The  user  supplies  a  frequency  or
+  information  cutoff and chooses to have the results sorted on either
+  value. The program will find the top  100  words  that  achieve  the
+  cutoff  values  and present them to the user sorted as selected. The
+  information  content  will  be  calcutated  from  either  Dw  or  Ds
+  depending  which  was  used  to  create  Dm,  and  using the current
+  composition setting. Typical dialogue follows:
+
+  ? Menu or option number=28
+  Looking for highest scoring words
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =60
+  ? Minimum information (0.00-1.00) (0.00) =.62
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           9 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  AAAAAC      64   0.66460
+  AAAAAA      90   0.64880
+  GTTTTT      66   0.64300
+  TTTTTG      73   0.64070
+  TTTTGT      63   0.63820
+  TTTTTC      65   0.63810
+  AAAATA      63   0.62670
+  TATAAT      65   0.62510
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =60
+  ? Minimum information (0.00-1.00) (0.00) =.62
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =2
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           9 Maximum information=  0.7385326
+  AAAAAA      90   0.64880
+  TTTTTG      73   0.64070
+  GTTTTT      66   0.64300
+  TTTTTC      65   0.63810
+  TATAAT      65   0.62510
+  AAAAAC      64   0.66460
+  TTTTGT      63   0.63820
+  AAAATA      63   0.62670
+  TTGACA      60   0.73850
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =!
+
+ @29. B 1 @Examine dictionary Dh
+  Examine dictionary Dh  allows  users  to  analyse  the  contents  of
+  dictionary   Dh  to  find  the most common words or those words that
+  contain the most information.  The  user  supplies  a  frequency  or
+  information  cutoff and chooses to have the results sorted on either
+  value. The program will find the top  100  words  that  achieve  the
+  cutoff  values  and present them to the user sorted as selected. The
+  information  content  will  be  calcutated  from  either  Dw  or  Ds
+  depending  which  was  used  to  create  Dh  and  using  the current
+  composition setting. Typical dialogue follows:
+
+  ? Menu or option number=29
+  Looking for highest scoring words
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =60
+  ? Minimum information (0.00-1.00) (0.00) =.6
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           4 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  AAAAAA      90   0.64880
+  TATAAT      65   0.62510
+  TTTTTT     115   0.60630
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =50
+  ? Minimum information (0.00-1.00) (0.00) =.5
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           8 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  TCTTGA      54   0.66080
+  AAAAAA      90   0.64880
+  TATAAT      65   0.62510
+  ACTTTA      57   0.61960
+  TTTTTT     115   0.60630
+  AGTATA      51   0.60540
+  TTATAA      55   0.59300
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =50
+  ? Minimum information (0.00-1.00) (0.00) =
+
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           8 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  TCTTGA      54   0.66080
+  AAAAAA      90   0.64880
+  TATAAT      65   0.62510
+  ACTTTA      57   0.61960
+  TTTTTT     115   0.60630
+  AGTATA      51   0.60540
+  TTATAA      55   0.59300
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =!
+
+ @30. B 1 @Examine words in Dm
+  Examine words  in  Dm  allows  users  to  analyse  the  contents  of
+  dictonary  Dm  at  the  level  of  individual  words  to  find their
+  frequency, information content, and  to  see  their  base  frequency
+  table.  The user types in a word to examine and the program displays
+  the values and table. The information  content  will  be  calcutated
+  from  either  Dw  or  Ds  depending which was used to create Dm, and
+  using the current composition setting. Typical dialogue follows:
+  ? Menu or option number=30
+  ? Word to examine=TTGACA
+  TtgacA            60  0.7385326
+      56    56     6     7     5    11
+       4     3     2     1    52     1
+       1     4     2    53     3    48
+       3     1    54     3     4     4
+  TTGACA
+  ? Word to examine=TATAAT
+  taTAat            65  0.6251902
+      56     3    53     4     4    60
+       6     1     5     5     5     3
+       3    60     5    57    57     4
+       4     5     6     3     3     2
+  TATAAT
+  ? Word to examine=
+
+ @31. B 1 @Examine words in Dh
+  Examine words  in  Dh  allows  users  to  analyse  the  contents  of
+  dictonary  Dh  at  the  level  of  individual  words  to  find their
+  frequency, information content, and  to  see  their  base  frequency
+  table.  The user types in a word to examine and the program displays
+  the values and table. The information  content  will  be  calcutated
+  from  either  Dw  or  Ds  depending which was used to create Dm, and
+  using the current composition setting. Typical dialogue follows:
+
+   ? Menu or option number=31
+  ? Word to examine=TTGACA
+  TtgacA            60  0.7385326
+      56    56     6     7     5    11
+       4     3     2     1    52     1
+       1     4     2    53     3    48
+       3     1    54     3     4     4
+  TTGACA
+  ? Word to examine=TATAAT
+  taTAat            65  0.6251902
+      56     3    53     4     4    60
+       6     1     5     5     5     3
+       3    60     5    57    57     4
+       4     5     6     3     3     2
+  TATAAT
+  ? Word to examine=GGGGGG
+  gggggg             0  0.6199890
+       3     1     1     2     3     4
+       1     3     1     2     2     1
+       2     1     1     1     1     1
+      11    12    14    12    11    11
+  GGGGGG
+  ? Word to examine=
+
+ @32. B 1 @Save or restore a dictionary
+  Save or restore  dictionary  allows  users  to  write  or  read  any
+  dictionary  to  and from disk files. The user is asked te define the
+  dictionary and file. The function is useful  if  the  machine  being
+  used  is  very  slow at calculating because the files can be handled
+  quickly. However note that the files  cannot  be  processed  by  any
+  other program.
+ @33. B 1 @Find inverted repeats
+  Find inverted repeats performs searches for simple  inverted  repeat
+  sequences  in  each  sequence.  They  are defined by a range of loop
+  sizes and a minimum number of potential basepairs. The  results  can
+  be  plotted  or listed. The x axis of the plot represents the length
+  of the aligned  sequences  and  the  y  direction  is  divided  into
+  sufficient  strips  to  accommodate each sequence. So if an inverted
+  repeat is found in the 3rd sequence  at  a  position  equivalent  to
+  halfway  along  the  longest  of the sequences then a short vertical
+  line will be drawn at the midpoint of the 3rd strip.  Alternatively,
+  if  the  results  are  listed, the potential hairpin loops are drawn
+  out, with the sequence number and the position of the loop.  Typical
+  dialogue follows.
+
+  ? Menu or option number=33
+  Define the range of loop sizes
+  ? Minimum loop size (0-10) (3) =0
+  ? Maximum loop size (1-20) (3) =
+  ? Minimum number of basepairs (1-20) (6) =
+  ? (y/n) (y) Plot results N
+   Searching
+
+  Sequence     3    34
+             C
+            G.T
+            T-A
+            A-T
+            T.G
+            T.G
+            G.T
+       ATCTTT TATTTCA
+           33
+
+  Sequence     5    35
+             T
+            G.T
+            T.G
+            A-T
+            T.G
+            G.T
+            C-G
+            T.G
+       TCCGGC AATTGTG
+           34
+
+
+ @ End of help
diff --git a/help/mem_menu b/help/mem_menu
new file mode 100644
index 0000000..2508f7c
--- /dev/null
+++ b/help/mem_menu
@@ -0,0 +1,32 @@
+0 1 15 184 B MEP
+1 1 9304 37 B Help
+2 1 10465 2 B Quit
+3 1 10531 14 B Read a new sequence.
+4 1 10932 6 B Define active region
+5 1 11250 31 B List a sequence.
+6 1 12393 3 B List a text file.
+7 1 12525 12 B Direct output to disk
+10 1 12996 2 B Clear graphics
+11 1 13065 2 B Clear text
+12 1 13126 12 B Draw a ruler.
+13 1 13871 12 B Use crosshair.
+14 1 14459 34 B Reposition plots
+15 1 16611 12 B Label a diagram
+16 1 17394 12 B Display a map.
+17 1 18154 31 B Search for strings
+18 1 19507 3 B Set strand
+19 1 19672 6 B Set composition
+20 1 20013 3 B Set word length
+21 1 20131 3 B Set number of mismatches
+22 1 20256 14 B Show settings
+23 1 20718 3 B Make dictionary Dw
+24 1 20890 3 B Make dictionary Ds
+25 1 21055 7 B Make dictionary Dm from Dw
+26 1 21505 7 B Make dictionary Dm from Ds
+27 1 21955 5 B Make dictionary Dh from Dm
+28 1 22245 55 B Examine dictionary Dm
+29 1 24148 70 B Examine dictionary Dh
+30 1 26410 25 B Examine words in Dm
+31 1 27437 33 B Examine words in Dh
+32 1 28701 7 B Save or restore a dictionary
+33 1 29106 46 B Find inverted repeats
diff --git a/help/mep_help b/help/mep_help
new file mode 100644
index 0000000..e2337ec
--- /dev/null
+++ b/help/mep_help
@@ -0,0 +1,792 @@
+
+ @-1. TX  0 @General
+
+ @-2. T   0 @Screen control
+
+ @-2. X   0 @Screen
+
+ @-3. TX  0 @Dictionary analysis
+
+ @0. TX  -1 @MEP
+
+        This is  a  program   for  analysing  families  of  nucleotide
+  sequences  in  order  to  find  common  motifs and potential binding
+  sites.  The ideas in this  program  were  described  in  Staden,  R.
+  "Methods  for  discovering  novel motifs in nucleic acid sequences".
+  Computer Applications in the Biosciences, 5, 293-298, (1989).
+
+        The program  can  read  sequences  stored  in  either  of  two
+  formats: 1) all sequences aligned in a single file; 2) all sequences
+  in separate files and accessed through a file of file names.
+
+        The  program  contains  functions  that  can  answer   several
+  questions about a set of sequences:
+
+  Which words are most common?
+  Which words occur in the most sequences?
+  Which words contain the most information?
+  Which words occur in equivalent positions in the sequences?
+  Which words are inverted repeats?
+  Which words occur on both strands of the sequences?
+  Where are the inverted repeats?
+  Where are the fuzzy words?
+
+        Most of the program is concerned with analysing what it  terms
+  "fuzzy words" within the set of sequences. The analysis is explained
+  below. Note that the standard version of the programs is limited  to
+  words of maximum length 8 letters, and a maximum fuzziness of 2.
+
+        The following analyses (preceded by their option numbers)  are
+  included:
+    ? = Help
+    ! = Quit
+    3 = Read new sequences
+    4 = Redefine active region
+    5 = List the sequences
+    6 = List text file
+    7 = Direct output to disk
+   10 = Clear graphics
+   11 = Clear text
+   12 = Draw ruler
+   13 = Use cross hair
+   14 = Reset margins
+   15 = Label diagram
+   16 = Draw map
+   17 = Search for strings
+   18 = Set strand
+   19 = Set composition
+   20 = Set word length
+   21 = Set number of mismatches
+   22 = Show settings
+   23 = Make dictionary Dw
+   24 = Make dictionary Ds
+   25 = Make fuzzy dictionary Dm from Dw
+   26 = Make fuzzy dictionary Dm from Ds
+   27 = Make fuzzy dictionary Dh from Dm
+   28 = Examine fuzzy dictionary Dm
+   29 = Examine fuzzy dictionary Dh
+   30 = Examine words in Dm
+   31 = Examine words in Dh
+   32 = Save or restore a dictionary
+   33 = Find inverted repeats
+
+        Some of these methods produce graphical  results  and  so  the
+  program  is  generally used from a graphics terminal (a vdu on which
+  lines and points can be drawn as well as characters).
+
+  The positions of each of the plots is defined relative  to  a  users
+  drawing board which has size 1-10,000 in x and 1-10,000 in y.  Plots
+  for each  option  are  drawn  in  a  window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The default window positions are read from a file "MEPMARG" when the
+  program is started. Users can have their own file if required.
+
+        The options for the program are accessed from  3  main  menus:
+  general,  screen  control  and dictionary analylsis.  Both menus and
+  options are selected by number.
+
+        The most important and novel part of the program is its use of
+  "fuzzy dictionaries" and an information theory measure, to help show
+  the most interesting motifs.  Central to the method is the idea of a
+  fuzzy   dictionary   of  word  frequencies.  A  dictionary  of  word
+  frequencies is an ordered list of all the words in the sequences and
+  a  count  of the number of times that they occur. A fuzzy dictionary
+  is an equivalent list but which contains instead, for each  word,  a
+  count  of  the number of times similar words occur in the sequences.
+  We term words that are similar "relations". The fuzziness is defined
+  by the number of letters in a word that are allowed to be different.
+  So if we had a fuzziness of 1 we allow 1 letter to be different. For
+  example,  with  a  fuzziness of 1, the entry in the fuzzy dictionary
+  for the word TTTTTT would contain a count of the  numbers  of  times
+  TTTTTT  occured  plus  the  number  of  times all words differing by
+  exactly one letter from TTTTTT occured.
+
+        Once the fuzzy dictionary has been created we can  examine  it
+  in  several  ways  to find candidate control sequences. The simplest
+  question we can ask is which word in  the  dictionary  is  the  most
+  common.   Sometimes  this  simple  criterion of "most common" may be
+  adequate to discover a new motif but in general we would not  expect
+  it  to  be  sufficient. For example some words will be common simply
+  because of a base composition bias in the sequences being  analysed.
+  In  addition  a  word  can be the most frequent and yet not be "well
+  defined". This last point is best explained by an example.
+
+        Suppose we were looking at  two letter words and allowing  one
+  mismatch,  and  that  there were 10 occurences of TT and 5 of AC. We
+  could align the 10 words that were one letter different from TT  and
+  the  5  that  were  related to AC. Then we could count the number of
+  times each base occured in each position for each of these two  sets
+  of words. Suppose we got the two base frequency tables shown below.
+     TT                  AC
+         T 6 4               T 1 0
+         C 1 3               C 0 4
+         A 1 2               A 4 1
+         G 2 1               G 0 0
+
+  These tables show that although TT occurs (with one letter mismatch)
+  more often than AC, the ratio of base frequencies for AC at 4/5, 4/5
+  is higher than those for TT at 6/10, 4/10. Hence we would  say  that
+  AC was better defined than TT.  Expressing this another way we would
+  say that the definition of AC contained more information  than  that
+  for TT. The program calculates the information content in a way that
+  takes into account both the sequence composition and  the  level  of
+  definition of the motif.
+
+        Definitions
+
+        Here we deal only with the dictionary  analysis.   Suppose  we
+  are dealing with a set of sequences and are examining them for words
+  that are six characters in length.
+
+        Dictionary Dw contains a count of the  number  of  times  each
+  word  occurs  in  the  set  of  sequences. For example the entry for
+  TTTTTT contains a value equal to the number of times the word TTTTTT
+  occurs in the set of sequences.
+
+        Dictionary Ds contains a count  of  the  number  of  different
+  sequences  in  which  each word occurs. For example if the entry for
+  word TTTTTT contains the value 10, it denotes that the  word  TTTTTT
+  occurs  in  ten  different sequences. Unlike Dw it only counts words
+  once for each  sequence.  For  example  if  we  had  a  set  of  100
+  sequences, the maximum possible value that Ds could take is 100, and
+  this would only happen if a word occurred in every sequence. However
+  for  the same set of sequences, Dw could contain values greater than
+  100, and this would show that a word had occurred more than once  in
+  at least one sequence.
+
+        From either of the two dictionaries Dw or Ds we can  calculate
+  a  fuzzy  dictionary  Dm.  For  each  word,  the  entry in the fuzzy
+  dictionary Dm contains the sum of the dictionary values (taken  from
+  either  Dw  or  Ds)  for  all  words  that differ from it by up to m
+  letters. For example if m=2 the entry for TTTTTT contains the number
+  of  times  that TTTTTT occurs in the dictionary, plus the counts for
+  all words that differ from TTTTTT by 1 or 2 letters.  Obviously  the
+  interpretation  of  the  values  in  Dm  depends on which of the two
+  dictionaries Dw or Ds they were derived from. When derived  from  Dw
+  the entry for any word in Dm gives the total number of times it, and
+  its relations, occur in the set of sequences. When derived  from  Ds
+  the  entry  for  any  word in Dm gives the total number of different
+  sequences that contain a word and each of its relations.
+
+        Finally,  from  fuzzy  dictionary  Dm  we  can  derive   fuzzy
+  dictionary  Dh.  All  entries in Dh are zero except for the word(s),
+  within each set of relations, that are most frequent. For example if
+  TTTTTT  occurred  20  times  but  had  a relation that occurred more
+  often, then the entry for TTTTTT would be zero.  However  if  TTTTTT
+  did  not  have  a more frequently occurring relation, then the entry
+  for TTTTTT would contain the value 20.
+ @1. T 0 @Help
+
+        This option gives online help. The user should  select  option
+  numbers  and  the  current  documentation  will  be given. Note that
+  option 0 gives an introduction to the program, and that ?  will  get
+  help from anywhere in the program.  The following analyses (preceded
+  by their option numbers) are included:
+    ? = Help
+    ! = Quit
+    3 = Read new sequences
+    4 = Redefine active region
+    5 = List the sequences
+    6 = List text file
+    7 = Direct output to disk
+   10 = Clear graphics
+   11 = Clear text
+   12 = Draw ruler
+   13 = Use cross hair
+   14 = Reset margins
+   15 = Label diagram
+   16 = Draw map
+   17 = Search for strings
+   18 = Set strand
+   19 = Set composition
+   20 = Set word length
+   21 = Set number of mismatches
+   22 = Show settings
+   23 = Make dictionary Dw
+   24 = Make dictionary Ds
+   25 = Make fuzzy dictionary Dm from Dw
+   26 = Make fuzzy dictionary Dm from Ds
+   27 = Make fuzzy dictionary Dh from Dm
+   28 = Examine fuzzy dictionary Dm
+   29 = Examine fuzzy dictionary Dh
+   30 = Examine words in Dm
+   31 = Examine words in Dh
+   32 = Save or restore a dictionary
+   33 = Find inverted repeats
+ @2. T 0 @Quit
+
+        This function stops the program.
+ @3. TX 1 @Read a new sequence
+
+        It can read sequences stored in either of two formats: 1)  all
+  sequences  aligned  in  a  single file; 2) all sequences in separate
+  files and accessed through a file of file  names.  Typical  dialogue
+  follows:
+
+  X 1 Read file of aligned sequences
+    2 Use file of file names
+  ? 0,1,2 =
+
+  ? File of aligned sequences=F1
+  Number of files           88
+
+ @4. TX 1 @Define active region
+
+        For its analytic functions  the  program  always  works  on  a
+  region of the sequence called the active region. When  new sequences
+  are read into the program the active region is automatically set  to
+  start  at the beginning of the sequences and go up to the end of the
+  longest one.
+ @5. TX 1 @List a sequence
+
+        The sequence can be listed with line lengths of 50 bases  with
+  each sequence numbered in the order in which they were read.  Output
+  can be directed to a disk  file  by  first  selecting  disk  output.
+  Typical dialogue follows.
+
+  ? Menu or option number=5
+
+                10        20        30        40        50
+     1  TAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCA
+     2  CAAATAATCAATGTGGACTTTTCTGCCGTGATTATAGACACTTTTGTTAC
+     3  TAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTATT
+     4  ACTAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTA
+     5  AGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGA
+     6  TAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGC
+     7  ACACCATCGAATGGCGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCG
+     8  GGGGCAAGGAGGATGGAAAGAGGTTGCCGTATAAAGAAACTAGAGTCCGT
+     9  AGGGGGTGGAGGATTTAAGCCATCTCCTGATGACGCATAGTCAGCCCATC
+    10  AAAACGTCATCGCTTGCATTAGAAAGGTTTCTGGCCGACCTTATAACCAT
+
+                60
+     1  TACCCGTTTTT
+     2  GCGTTTTTGT
+     3  TCATACCATAAG
+     4  TTTCATACC
+     5  ATTGTGAGC
+     6  TTCCGGCTCG
+     7  GAAGAGAGT
+     8  TCAGGTGT
+     9  ATGAATG
+    10  TAATTACG
+ @6. TX 1 @List a text file
+
+        Allows the user to have a text file displayed on  the  screen.
+  It will appear one page at a time.
+ @7. TX 1 @Direct output to disk
+
+        Used to direct output that would normally appear on the screen
+  to a file.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @10. TX 2 @Clear graphics
+
+        Clears the screen of both text and graphics.
+ @11. TX 2 @Clear text
+
+        Clears only text from the screen.
+ @12. TX 2 @Draw a ruler
+
+        This option allows the user to draw a ruler or scale along the
+  x  axis  of the screen to help identify the coordinates of points of
+  interest. The user can define the position of the first  amino  acid
+  to  be marked (for example if the active region is 1501 to 8000, the
+  user might wish to mark every 1000th amino acid starting  at  either
+  1501  or  2000  -  it depends if the user wishes to treat the active
+  region as an independent unit with its own numbering starting at its
+  left  edge,  or  as  part  of the whole sequence). The user can also
+  define the separation of the ticks on the scale and their height. If
+  required  the  labelling  routine  can be used to add numbers to the
+  ticks.
+ @13. TX 2 @Use crosshair
+
+        This function puts a steerable cross on the screen that can be
+  used to find the coordinates of points in the sequence. The user can
+  move the cross around using the directional keys; when he  hits  the
+  space bar the program will print out the coordinates of the cross in
+  sequence units and the option will be exited.
+
+        If instead, you hit a , the position will be displayed but the
+  cross will remain on the screen.
+
+        If a letter s is hit the sequence around  the  cross  hair  is
+  displayed and the cross remains on the screen.
+ @14. TX 2 @Reposition plots
+
+        The positions of each of the plots is defined  relative  to  a
+  users  drawing board which has size 1-10,000 in x and 1-10,000 in y.
+  Plots for each option are drawn in a window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The default window positions are read from a file "MEPMARG" when the
+  program is started. Users can have their own file if  required.   As
+  all  the  plots  start  at  the same position in x and have the same
+  width, x0 and xlength are the same for all options. Generally  users
+  will  only  want  to change the start level of the window y0 and its
+  height ylength. This option allows users to change window  positions
+  whilst  running  the  program.   The  routine  prompts first for the
+  number of the option that the users wishes to reposition;  then  for
+  the  y  start and height; then for the x start and length. Note that
+  changes to the x values affect all options. If the user  types  only
+  carriage  return  for any value it will remain unchanged. The cross-
+  hair can be used to choose suitable heights.
+ @15. TX 2 @Label a diagram
+
+        This routine allows users to  label  any  diagrams  they  have
+  produced.  They  are  asked  to type in a label. When the user types
+  carriage return to finish typing the label the cross-hair appears on
+  the  screen. The user can position it anywhere on the screen. If the
+  user types R (for right justify) the label will be  written  on  the
+  diagram  with  its right end at the cross-hair position. If the user
+  types L (for left justify) the label will be written on the  diagram
+  with  its  left end at the cross hair position.  The cross-hair will
+  then immediately reappear. The  user  may  put  the  same  label  on
+  another part of the diagram as before or if he hits the space bar he
+  will be asked if he wishes to type in another label.
+ @16. TX 2 @Display a map
+
+        It is  often  convenient  to  plot  a  map  alongside  graphed
+  analysis  in  order  to  indicate features within the sequence. This
+  function allows users to draw maps using files arranged in the  form
+  of  EMBL  feature  tables. Of course the EMBL table are usually only
+  used for nucleic acid  sequence  annotation  but,  as  long  as  the
+  features  are written in the correct format, they can be employed by
+  this routine. The  map  is  composed  of  a  line  representing  the
+  sequence  and  then  further  lines  denoting  the endpoints of each
+  feature the user identifies. The user is asked to define  height  at
+  which  the  line representing the sequence should be drawn; then for
+  the feature height; then for the features to plot.
+ @17. TX 1 @Search for strings
+
+        Search for strings perfoms searches of all the  sequences  for
+  selected words and shows which sequences they are found in. The user
+  types in a word and defines the allowed number  of  mismatches.  The
+  results  are  listed  or plotted. If listed the display includes the
+  sequence number, the position  in  the  sequence  and  the  matching
+  string.  The results are plotted in the following way. The x axis of
+  the plot represents the length of the aligned sequences  and  the  y
+  direction  is  divided  into  sufficient  strips to accommodate each
+  sequence. So if a match is found in the 3rd sequence at  a  position
+  equivalent  to  halfway  along  the  longest of the sequences then a
+  short vertical line will be drawn at the midpoint of the 3rd  strip.
+  If  the  sequences are aligned it can be useful if the motifs happen
+  to appear  in  related  positions.  For  example  see  the  original
+  publication. Typical dialogue follows.
+
+  ? Menu or option number=17
+  X 1 Plot match positions
+    2 Plot histogram of matches
+  ? 0,1,2 =
+  ? Word to search for=TTGACA
+  ? Minimum match (0-6) (6) =5
+  ? (y/n) (y) Plot results N
+       2    35 TAGACA
+       5    14 TTTACA
+       6    37 TTTACA
+      11    14 TAGACA
+      14    14 TTGACA
+      17    14 GTGACA
+      17    22 TTAACA
+      20     1 TTGACA
+ @18. TX 3 @Set strand
+
+        Set strand allows the user to define which  strand(s)  of  the
+  sequences to analyse: input stand, complement of input, or both.
+ @19. TX 3 @Set composition
+
+        Set composition gives the user three choices for  setting  the
+  composition  of  the  sequences  for  use  in the calculation of the
+  information content of  words.  The  user  can  select  the  overall
+  composition  of  the  sequences as read, an even composition, or can
+  type in any other 4 values.
+ @20. TX 3 @Set word length
+
+        Set word length sets the length of word for which dictionaries
+  will be made.
+ @21. TX 3 @Set number of mismatches
+
+        Set number of mismatches sets the level of fuzziness  for  the
+  creation of dictionary Dm.
+ @22. TX 3 @Show settings
+
+        Show settings show the current  settings  for  all  parameters
+  associated with dictionary analysis. A typical diaplsy follows:
+   ? Menu or option number=22
+   Current word length  =   6
+   Number of mismatches =   1
+   Start position       =     1
+   End position         =    63
+   Input strand only
+   Observed composition
+   Dictionary Dw unmade
+   Dictionary Ds unmade
+   Dictionary Dm unmade
+   Dictionary Dh unmade
+ @23. TX 3 @Make dictionary Dw
+
+        Make dictionary Dw creates a dictionary that contains a  count
+  of  the  frequency  of  occurrence  of  each  word  in the collected
+  sequences.
+ @24. TX 3 @Make dictionary Ds
+
+        Make dictionary Ds creates a dictionary that contains a  count
+  of the number of different sequences that contain each word.
+ @25. TX 3 @Make dictionary Dm from Dw
+
+        Make  dictionary  Dm   from  Dw  creates  a  dictionary   from
+  dictionary Dw that contains the frequency of occurrence of each word
+  (say X) in Dw plus the frequency of occurrence of each  word  in  Dw
+  that  differs  from  X  by  up  to  m  letters. Dm is called a fuzzy
+  dictionary as it contains the frequencies of occurrence of all words
+  plus the frequencies of all the words that are similar to them.
+ @26. TX 3 @Make dictionary Dm from Ds
+
+        Make  dictionary  Dm   from  Ds  creates  a  dictionary   from
+  dictionary Ds that contains the frequency of occurrence of each word
+  (say X) in Ds plus the frequency of occurrence of each  word  in  Ds
+  that  differs  from  X  by  up  to  m  letters. Dm is called a fuzzy
+  dictionary as it contains the frequencies of occurrence of all words
+  plus the frequencies of all the words that are similar to them.
+ @27. TX 3 @Make dictionary Dh from Dm
+
+        Make dictionary Dh  creates a dictionary  from  dictionary  Dm
+  and  whose  entries  are  zero  except for those words in any set of
+  related words that are most frequent. It finds the dominant words in
+  each set of relations and stores their counts.
+ @28. TX 3 @Examine fuzzy dictionary Dm
+
+        Examine dictionary Dm allows users to analyse the contents  of
+  dictionary  Dm  to  find  the  most common words or those words that
+  contain the most information.  The  user  supplies  a  frequency  or
+  information  cutoff and chooses to have the results sorted on either
+  value. The program will find the top  100  words  that  achieve  the
+  cutoff  values  and present them to the user sorted as selected. The
+  information  content  will  be  calcutated  from  either  Dw  or  Ds
+  depending  which  was  used  to  create  Dm,  and  using the current
+  composition setting. Typical dialogue follows:
+
+  ? Menu or option number=28
+  Looking for highest scoring words
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =60
+  ? Minimum information (0.00-1.00) (0.00) =.62
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           9 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  AAAAAC      64   0.66460
+  AAAAAA      90   0.64880
+  GTTTTT      66   0.64300
+  TTTTTG      73   0.64070
+  TTTTGT      63   0.63820
+  TTTTTC      65   0.63810
+  AAAATA      63   0.62670
+  TATAAT      65   0.62510
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =60
+  ? Minimum information (0.00-1.00) (0.00) =.62
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =2
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           9 Maximum information=  0.7385326
+  AAAAAA      90   0.64880
+  TTTTTG      73   0.64070
+  GTTTTT      66   0.64300
+  TTTTTC      65   0.63810
+  TATAAT      65   0.62510
+  AAAAAC      64   0.66460
+  TTTTGT      63   0.63820
+  AAAATA      63   0.62670
+  TTGACA      60   0.73850
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =!
+
+ @29. TX 3 @Examine fuzzy dictionary Dh
+
+        Examine dictionary Dh allows users to analyse the contents  of
+  dictionary   Dh  to  find  the most common words or those words that
+  contain the most information.  The  user  supplies  a  frequency  or
+  information  cutoff and chooses to have the results sorted on either
+  value. The program will find the top  100  words  that  achieve  the
+  cutoff  values  and present them to the user sorted as selected. The
+  information  content  will  be  calcutated  from  either  Dw  or  Ds
+  depending  which  was  used  to  create  Dh  and  using  the current
+  composition setting. Typical dialogue follows:
+
+  ? Menu or option number=29
+  Looking for highest scoring words
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =60
+  ? Minimum information (0.00-1.00) (0.00) =.6
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           4 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  AAAAAA      90   0.64880
+  TATAAT      65   0.62510
+  TTTTTT     115   0.60630
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =50
+  ? Minimum information (0.00-1.00) (0.00) =.5
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           8 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  TCTTGA      54   0.66080
+  AAAAAA      90   0.64880
+  TATAAT      65   0.62510
+  ACTTTA      57   0.61960
+  TTTTTT     115   0.60630
+  AGTATA      51   0.60540
+  TTATAA      55   0.59300
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =50
+  ? Minimum information (0.00-1.00) (0.00) =
+
+  X 1 Sort on information
+    2 Sort on word score
+  ? 0,1,2 =
+
+  ? Maximum number to list (0-100) (100) =
+
+  The words are
+   Total words=           8 Maximum information=  0.7385326
+  TTGACA      60   0.73850
+  TCTTGA      54   0.66080
+  AAAAAA      90   0.64880
+  TATAAT      65   0.62510
+  ACTTTA      57   0.61960
+  TTTTTT     115   0.60630
+  AGTATA      51   0.60540
+  TTATAA      55   0.59300
+  The highest word score =          115
+  ? Minimum word score (0-115) (0) =!
+
+ @30. TX 3 @Examine words in Dm
+
+        Examine words in Dm allows users to analyse  the  contents  of
+  dictonary  Dm  at  the  level  of  individual  words  to  find their
+  frequency, information content, and  to  see  their  base  frequency
+  table.  The user types in a word to examine and the program displays
+  the values and table. The information  content  will  be  calcutated
+  from  either  Dw  or  Ds  depending which was used to create Dm, and
+  using the current composition setting. Typical dialogue follows:
+  ? Menu or option number=30
+  ? Word to examine=TTGACA
+  TtgacA            60  0.7385326
+      56    56     6     7     5    11
+       4     3     2     1    52     1
+       1     4     2    53     3    48
+       3     1    54     3     4     4
+  TTGACA
+  ? Word to examine=TATAAT
+  taTAat            65  0.6251902
+      56     3    53     4     4    60
+       6     1     5     5     5     3
+       3    60     5    57    57     4
+       4     5     6     3     3     2
+  TATAAT
+  ? Word to examine=
+
+ @31. TX 3 @Examine words in Dh
+
+        Examine words in Dh allows users to analyse  the  contents  of
+  dictonary  Dh  at  the  level  of  individual  words  to  find their
+  frequency, information content, and  to  see  their  base  frequency
+  table.  The user types in a word to examine and the program displays
+  the values and table. The information  content  will  be  calcutated
+  from  either  Dw  or  Ds  depending which was used to create Dm, and
+  using the current composition setting. Typical dialogue follows:
+
+   ? Menu or option number=31
+  ? Word to examine=TTGACA
+  TtgacA            60  0.7385326
+      56    56     6     7     5    11
+       4     3     2     1    52     1
+       1     4     2    53     3    48
+       3     1    54     3     4     4
+  TTGACA
+  ? Word to examine=TATAAT
+  taTAat            65  0.6251902
+      56     3    53     4     4    60
+       6     1     5     5     5     3
+       3    60     5    57    57     4
+       4     5     6     3     3     2
+  TATAAT
+  ? Word to examine=GGGGGG
+  gggggg             0  0.6199890
+       3     1     1     2     3     4
+       1     3     1     2     2     1
+       2     1     1     1     1     1
+      11    12    14    12    11    11
+  GGGGGG
+  ? Word to examine=
+
+ @32. TX 3 @Save or restore a dictionary
+
+        Save or restore dictionary allows users to write or  read  any
+  dictionary  to  and from disk files. The user is asked te define the
+  dictionary and file. The function is useful  if  the  machine  being
+  used  is  very  slow at calculating because the files can be handled
+  quickly. However note that the files  cannot  be  processed  by  any
+  other program.
+ @33. TX 1 @Find inverted repeats
+
+        Find inverted repeats performs searches  for  simple  inverted
+  repeat  sequences  in  each sequence. They are defined by a range of
+  loop sizes and a minimum number of potential basepairs. The  results
+  can  be  plotted  or  listed.  The x axis of the plot represents the
+  length of the aligned sequences and the y direction is divided  into
+  sufficient  strips  to  accommodate each sequence. So if an inverted
+  repeat is found in the 3rd sequence  at  a  position  equivalent  to
+  halfway  along  the  longest  of the sequences then a short vertical
+  line will be drawn at the midpoint of the 3rd strip.  Alternatively,
+  if  the  results  are  listed, the potential hairpin loops are drawn
+  out, with the sequence number and the position of the loop.  Typical
+  dialogue follows.
+
+  ? Menu or option number=33
+  Define the range of loop sizes
+  ? Minimum loop size (0-10) (3) =0
+  ? Maximum loop size (1-20) (3) =
+  ? Minimum number of basepairs (1-20) (6) =
+  ? (y/n) (y) Plot results N
+   Searching
+
+  Sequence     3    34
+             C
+            G.T
+            T-A
+            A-T
+            T.G
+            T.G
+            G.T
+       ATCTTT TATTTCA
+           33
+
+  Sequence     5    35
+             T
+            G.T
+            T.G
+            A-T
+            T.G
+            G.T
+            C-G
+            T.G
+       TCCGGC AATTGTG
+           34
+ @ End of help
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/help/mep_menu b/help/mep_menu
new file mode 100644
index 0000000..7d588ab
--- /dev/null
+++ b/help/mep_menu
@@ -0,0 +1,68 @@
+-1 0 22 2 T General
+-1 0 22 2 X General
+-2 0 51 2 T Screen control
+-2 0 72 2 X Screen
+-3 0 106 2 T Dictionary analysis
+-3 0 106 2 X Dictionary analysis
+0 -1 124 185 T MEP
+0 -1 124 185 X MEP
+1 0 9423 38 T Help
+2 0 10594 3 T Quit
+3 1 10667 14 T Read a new sequence
+3 1 10667 14 X Read a new sequence
+4 1 11069 7 T Define active region
+4 1 11069 7 X Define active region
+5 1 11396 32 T List a sequence
+5 1 11396 32 X List a sequence
+6 1 12548 4 T List a text file
+6 1 12548 4 X List a text file
+7 1 12690 12 T Direct output to disk
+7 1 12690 12 X Direct output to disk
+10 2 13162 3 T Clear graphics
+10 2 13162 3 X Clear graphics
+11 2 13239 3 T Clear text
+11 2 13239 3 X Clear text
+12 2 13307 13 T Draw a ruler
+12 2 13307 13 X Draw a ruler
+13 2 14053 13 T Use crosshair
+13 2 14053 13 X Use crosshair
+14 2 14643 35 T Reposition plots
+14 2 14643 35 X Reposition plots
+15 2 16797 13 T Label a diagram
+15 2 16797 13 X Label a diagram
+16 2 17589 13 T Display a map
+16 2 17589 13 X Display a map
+17 1 18384 32 T Search for strings
+17 1 18384 32 X Search for strings
+18 3 19739 4 T Set strand
+18 3 19739 4 X Set strand
+19 3 19906 7 T Set composition
+19 3 19906 7 X Set composition
+20 3 20249 4 T Set word length
+20 3 20249 4 X Set word length
+21 3 20374 4 T Set number of mismatches
+21 3 20374 4 X Set number of mismatches
+22 3 20501 15 T Show settings
+22 3 20501 15 X Show settings
+23 3 20965 5 T Make dictionary Dw
+23 3 20965 5 X Make dictionary Dw
+24 3 21152 4 T Make dictionary Ds
+24 3 21152 4 X Make dictionary Ds
+25 3 21326 8 T Make dictionary Dm from Dw
+25 3 21326 8 X Make dictionary Dm from Dw
+26 3 21787 8 T Make dictionary Dm from Ds
+26 3 21787 8 X Make dictionary Dm from Ds
+27 3 22248 6 T Make dictionary Dh from Dm
+27 3 22248 6 X Make dictionary Dh from Dm
+28 3 22551 56 T Examine fuzzy dictionary Dm
+28 3 22551 56 X Examine fuzzy dictionary Dm
+29 3 24462 71 T Examine fuzzy dictionary Dh
+29 3 24462 71 X Examine fuzzy dictionary Dh
+30 3 26726 26 T Examine words in Dm
+30 3 26726 26 X Examine words in Dm
+31 3 27755 34 T Examine words in Dh
+31 3 27755 34 X Examine words in Dh
+32 3 29021 8 T Save or restore a dictionary
+32 3 29021 8 X Save or restore a dictionary
+33 1 29428 45 T Find inverted repeats
+33 1 29428 45 X Find inverted repeats
diff --git a/help/nip_help b/help/nip_help
new file mode 100644
index 0000000..5acf9ce
--- /dev/null
+++ b/help/nip_help
@@ -0,0 +1,4620 @@
+
+ @-1. TX  0 @General
+
+ @-2. T   0 @Screen control
+
+ @-2. X   0 @Screen
+
+ @-3. T   0 @Statistical analysis of content
+
+ @-3. X   0 @Statistics
+
+ @-4. T   0 @Structures and repeats
+
+ @-4. X   0 @Structures
+
+ @-5. TX  0 @Translation and codons
+
+ @-6. TX  0 @Gene search by content
+
+ @-7. TX  0 @General signals
+
+ @-8. TX  0 @Specific signals
+
+ @0.  TX  -1 @NIP
+
+
+        This  is  a  program   for  analysing  individual   nucleotide
+ sequences.  It can read sequences stored in many of the most commonly
+ used formats, and performs all of the usual simple analyses.  However
+ the  main  purpose  of the program is to provide  methods for finding
+ the function of each section of a  sequence.  In  general  no  single
+ method  can   give  an unequivecal interpretation of a sequence so we
+ need to use many techniques together and to combine   their  results.
+ For   this   reason   the  program   present  many  of  its   results
+ graphically.
+
+        General information is contained in the user interface. Online
+ documentation for any function follows a consistent pattern: summary,
+ list of inputs, list of outputs, details, example.
+ @1. TX 0 @ Help
+
+        This option gives online help. The user should  select  option
+  numbers  and  the  current  documentation  will  be given. Note that
+  option 0 gives an introduction to the program, and that ?  will  get
+  help  from  anywhere  in  the  program.  The following functions are
+  included:
+ @2. TX 0 @ Quit
+
+        This function stops the program.
+ @3. TX 1 @ Read a new sequence
+
+        This option allows users to  read  in  new  sequences,  browse
+  through  annotations,  or  search  sequence  libraries for keywords.
+  Sequences can  be  read  from  "personal"  sequence  files  or  from
+  sequence  libraries. These are referred to as the sequence "source".
+  Personal files can be stored in several formats:  Staden, PIR, EMBL,
+  GENBANK  and  GCG.  At LMB we use "Staden" format for sequencing and
+  all the libraries  are  stored  in  their  original  formats.  Note,
+  however,  that  libraries  such  as EMBL or GenBank that are divided
+  into several files (eg GenBank has 13 separate files) are indexed as
+  a  whole.  This  means  that  users  do  not need to know which file
+  contains an entry, only which library.  When  the  user  selects  to
+  read in a sequence the program first asks for the sequence "source".
+
+        If the user selects "personal" the program will  ask  for  the
+  format (Staden, PIR, EMBL, GENBANK or GCG), and then for the name of
+  the file. For PIR format the user will also be required to know  the
+  entry  name of the sequence as the file can contain several. For the
+  other formats only a single entry is  expected.  The  file  will  be
+  read,  its  length  and composition will be displayed and the option
+  left.
+
+        If the user selects  "library"  as  the  sequence  source  the
+  program will display a list of available libraries. The programs are
+  capable of  handling  all  current  libraries  but  which  ones  are
+  available  will  vary  from  site  to  site.  At LMB we have several
+  libraries and also weekly updates of data gathered between releases.
+  The  program will ask users to select a library and then give a list
+  of options:
+
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entrynames from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+
+  If get a sequence or get annotations is selected users will be asked
+  to  type  the entry name. The option will be left when a sequence is
+  selected  or  !  is  typed.  The  composition  and  length  will  be
+  displayed.
+
+        The  text  index  contains  all  words  from  feature  tables,
+  reference  titles, definition lines, keywords lists and comments, so
+  the text index search is most useful. It is also the fastest. Up  to
+  5  words  can  be  searched  for  at once. The words should be typed
+  separated by spaces, for example
+   ? Keywords=P53 mouse murine tumo
+
+  will search for all entries that contain words  starting  with  p53,
+  mouse,  murine  and  tumo.  Only the unique entries that contain ALL
+  words will be  listed.  Before  listing  the  matching  entries  the
+  program  will  show  the number of 'hits' for each word and ring the
+  bell.  Escape is possible at this point, or after each screenfull of
+  entries.   In  addition  to the entry names the text search displays
+  the primary accession number, the  sequence  length  and  up  to  80
+  characters of description.  (The search of 'titles' is now redundant
+  because the full text index contains all the  title  words  and  the
+  search  is  much  faster.  It  will  probably  be  removed  from the
+  program.)  All searches are independent of case. Where possible  the
+  program will offer default entry names.
+
+        Typical dialogue follows.
+  Select sequence source
+  X  1 Personal file
+     2 Sequence library
+  ? Selection  (1-2) (1) =
+  Select sequence file format
+  X  1 Staden
+     2 EMBL
+     3 GenBank
+     4 PIR
+     5 GCG
+  ? Selection  (1-5) (1) =
+  ? Sequence file name=M13MP7.SEQ
+   Contig title removed
+  Sequence length=  7238
+   Sequence composition
+            T          C          A          G          -
+        2405.      1539.      1765.      1527.         2.
+          33.2%      21.3%      24.4%      21.1%       0.0%
+    .
+    .
+    .
+
+
+   Select sequence source
+   X  1 Personal file
+      2 Sequence library
+   ? Selection  (1-2) (1) =2
+   Select a library
+   X  1 EMBL 29 nucleotide library Dec 91
+      2 SWISSPROT 20 protein library Nov 91
+      3 PIR 31 protein library Dec 91
+      4 NRL3D 58 From Brookhaven protein library Dec 91
+      5 GenBank
+   ? Selection  (1-5) (1) =
+  Library is in EMBL format with indexes
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =5
+   Search for keywords
+   ? Keywords=P53 mouse
+  P53 hits  68
+  MOUSE hits  8180
+
+   MMANT01    X00875         536 Murine gene fragment for cellular tumour antigen
+   MMANT02    X00876          83 Murine gene fragment for cellular tumour antigen
+   MMANT03    X00877          21 Murine gene fragment for cellular tumour antigen
+   MMANT04    X00878         261 Murine gene fragment for cellular tumour antigen
+   MMANT05    X00879         184 Murine gene fragment for cellular tumour antigen
+   MMANT06    X00880         113 Murine gene fragment for cellular tumour antigen
+   MMANT07    X00881         110 Murine gene fragment for cellular tumour antigen
+   MMANT08    X00882         137 Murine gene fragment for cellular tumour antigen
+   MMANT09    X00883          74 Murine gene fragment for cellular tumour antigen
+   MMANT10    X00884         107 Murine gene for cellular tumour antigen p53 (exon
+   MMANT11    X00885         562 Murine p53 gene 3' region with exon 11
+   MMANTP53   M26862         536 Mouse tumor antigen p53 gene, 5' end.
+   MMLYN      M64608        2044 Mouse lyn protein mRNA, complete cds.
+   MMP53      X00741        1377 Mouse mRNA for transformation associated protein
+   MMP53A     M13872        1285 Mouse p53 mRNA, complete cds, clone pcD53.
+   MMP53B     M13873        1241 Mouse p53 mRNA, complete cds, clone p53-m11.
+   MMP53C     M13874        1322 Mouse p53 mRNA, complete cds, clone p53-m8.
+   MMP53G1    X01235         554 Mouse genomic DNA for 5' region of cellular tumou
+   MMP53IN4   X60470         729 M.musculus p53 gene for p53 protein, intron 4
+   MMP53P     X01236        2132 Mouse pseudogene for cellular tumour antigen p53
+   MMP53R     X01237        1773 Mouse mRNA for cellular tumour antigen p53
+   MMRSB2P5   M64597         196 Mouse B2 repeat in the 3' flank of protein 53 (p5
+        22 different entries found
+
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =4
+   Search for keywords
+   ? Keywords=alpha
+   Searching for alpha
+   AAGHA          623 a.anguilla mrna for glycoprotein hormone alpha subunit precu
+   AAMALI        3338 a.aegypti mali gene encoding alpha 1-4 glucosidase, complete
+   AAMALIA       1659 a.aegypti maltase-like i (mali) gene encoding alpha-1,4-gluc
+   AAMALIB       1832 a.aegypti maltase-like i (mali) mrna encoding alpha-1,4-gluc
+   ACA13GT        371 alouatta caraya alpha-1,3gt gene, 3' flank.
+   ADHBADA1       102 duck alpha-d-globin gene, exon 1.
+   ADHBADA2      1145 duck alpha-a-globin gene and 5' flank
+   ADHBADWP       513 duck (white pekin) alpha ii (minor) globin mrna, complete co
+   AEACOXABC     5279 a.eutrophus protein x (acox), acetoin:dcpip oxidoreductase-a
+   AGA13GT        371 ateles geoffroyi alpha-1,3gt gene, 3' flank.
+   AGAAAGFP       282 c.tetragonoloba alpha-amylase/alpha-galactosidase fusion pro
+   AGAABL         138 b.subtilis alpha-amylase signal peptide gene e.coli beta-lac
+   AGAFAMYA        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFAMYB        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFAMYC        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFCOXA        98 synthetic alpha-factor/cox iv fusion gene signal peptide.
+   AGAGABA       7876 synthetic gossypium hirsutum (cotton) alpha globulin a and b
+   AGAMYLS        120 synthetic alpha-amylase gene, 5' end.
+   AGANPS          95 synthetic gene (jcnf-1) encoding alpha-factor pro-region/han
+  !
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =3
+   ? Accession number=v00636
+  Entry name LAMBDA
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =2
+   Default Entry name=LAMBDA
+   ? Entry name=
+  ID   LAMBDA     standard; DNA; PHG; 48502 BP.
+  XX
+  AC   V00636; J02459; M17233; X00906;
+  XX
+  DT   03-JUL-1991 (Rel. 28, Last updated, Version 3)
+  DT   09-JUN-1982 (Rel. 1, Created)
+  XX
+  DE   Genome of the bacteriophage lambda (Styloviridae).
+  XX
+  KW   circular; coat protein; DNA binding protein; genome;
+  KW   origin of replication.
+  XX
+  OS   Bacteriophage lambda
+  OC   Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
+  XX
+  RN   [1]
+  RP   1-48502
+  RA   Sanger F., Coulson A.R., Hong G.F., Hill D.F., Petersen G.B.;
+  RT   "Nucleotide sequence of bacteriophage lambda DNA";
+  RL   J. Mol. Biol. 162:729-773(1982).
+  XX
+  !
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =
+   Default Entry name=LAMBDA
+   ? Entry name=
+  DE   Genome of the bacteriophage lambda (Styloviridae).
+   Sequence length  48502
+   Sequence composition
+             T          C          A          G          -
+        11988.     11360.     12336.     12818.         0.
+           24.7%      23.4%      25.4%      26.4%       0.0%
+
+ @4. TX 1 @ Define active region
+
+        For its analytic functions  the  program  always  works  on  a
+  region  of  the  sequence  called the "active region". This function
+  allows the start and end points of the active region to be reset.
+
+        Define  the required start and end points.
+
+        When a new sequence is read into the program the active region
+  is  automatically  set to start at the beginning of the sequence and
+  extend  to the maximum the program can handle. On most machines this
+  will  be  to the end of the sequence. The positions are shown on the
+  screen.  Note that for convenience, in the listing  and  translation
+  functions,  the  user  is given access to regions outside the active
+  region.
+ @5. TX 1 @ List a sequence
+
+        The sequence can be listed single or double stranded with line
+  lengths from 10 to 120 in multiples of 10.
+
+        Define the region to list, the line length required and choose
+  between a single or double stranded display.  The output looks like:
+
+    GTTAATGTAG CTTAATAACA AAGCAAAGCA CTGAAAATGC TTAGATGGAT
+    CAATTACATC GAATTATTGT TTCGTTTCGT GACTTTTACG AATCTACCTA
+            10         20         30         40         50
+
+    AATTGTATCC CATAAACACA AAGGTTTGGT CCTGGCCTTA TAATTAATTA
+    TTAACATAGG GTATTTGTGT TTCCAAACCA GGACCGGAAT ATTAATTAAT
+            60         70         80         90        100
+
+    GAGGTAAAAT TACACATGCA AACCTCCATA GACCGGTGTA AAATCCCTTA
+    CTCCATTTTA ATGTGTACGT TTGGAGGTAT CTGGCCACAT TTTAGGGAAT
+           110        120        130        140        150
+
+    AACATTTACT TAAAATTTAA GGAGAGGGTA TCAAGCACAT TAAAATAGCT
+    TTGTAAATGA ATTTTAAATT CCTCTCCCAT AGTTCGTGTA ATTTTATCGA
+           160        170        180        190        200
+
+ @6. TX 1 @ List a text file.
+
+        Allows the user to have a text file displayed on  the  screen.
+  It will appear one page at a time.
+
+        Supply the name of the file to be displayed.
+ @7. TX 1 @ Direct output to disk
+
+        Used to direct output that would normally appear on the screen
+  to a file.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @8. TX 1 @ Write active region to disk
+
+        Used to write the current active section of sequence to a disk
+  file in "Staden format".
+
+        Supply a file name and an optional title.
+
+        The program has the capability of reading sequences stored  in
+  several formats and so, in conjunction with this option, can be used
+  to reformat them.
+ @9. TX 1 @ Edit the sequence
+
+        Used to edit sequences or any other files by giving access  to
+  the  computers  system  editor. For editing sequences the input file
+  should have already been created using one of the listing  functions
+  such  as  "list  sequence",  "list translation" or "list restriction
+  sites above the sequence".
+
+        Supply the name of the file to edit.  Wait  while  the  system
+  editor  is  made  ready  (can take awhile on a vax). Use the editor.
+  Exit from the editor. If a sequence has been edited, and you want to
+  process  it,  affirm  that the sequence should be "made active". The
+  edited sequence will replace the original sequence.
+
+        This editing method is designed to give  users  access  to  an
+  editor with which they are familiar - i.e. the one on their machine,
+  and yet to allow them to edit a  sequence  which  contains  all  the
+  landmarks  they  need  in  order  to  know where they are. Users can
+  create files  containing  simple  listings  (single  stranded)  with
+  numbering,  using "list the sequence", and then edit them with their
+  system editor, using the numbering to know where they are within the
+  sequence.  When the edits are complete they exit from the editor and
+  the program "analyses" the edited file to extract only the  sequence
+  characters.  Similarly  a file containing a three phase tranlslation
+  can be edited, or a file containing a sequence plus its three  phase
+  translation,  plus  its restriction sites marked above the sequence.
+  In order to be able  to  "analyse"  such  complicated  listings  and
+  correctly  extract  the  sequence the following simple rule is used:
+  all lines in the file that contain a character that is  not  A,C,T,G
+  or U are deleted. It is obviously important to be aware of this rule
+  and its implications.
+ @10. TX 2 @ Clear graphics
+
+        Clears graphics from the screen.
+ @11. TX 2 @ Clear text
+
+        Clears  text from the screen.
+ @12. TX 2 @ Draw a ruler
+
+        This option allows the user to draw a ruler or scale along the
+  x  axis  of the screen to help identify the coordinates of points of
+  interest. The user can define the position of the first base  to  be
+  marked  (for  example if the active region is 1501 to 8000, the user
+  might wish to mark every 1000th base starting at either 1501 or 2000
+  -  it  depends  if  the user wishes to treat the active region as an
+  independent unit with its own numbering starting at its  left  edge,
+  or  as  part  of  the  whole sequence). The user can also define the
+  separation of the ticks on the scale and their height.  If  required
+  the labelling routine can be used to add numbers to the ticks.
+ @13. TX 2 @ Use crosshair
+
+        This function puts a steerable cross on the screen that can be
+  used to find the coordinates of points in the sequence. The user can
+  move the cross around using the directional keys; when he  hits  the
+  space bar the program will print out the coordinates of the cross in
+  sequence units and the option will be exited.
+
+        If instead, you hit a , the position will be displayed but the
+  cross will remain on the screen.
+
+        If a letter s is hit the program  will  display  the  sequence
+  around the crosshair position, and leave the cross on the screen.
+ @14. TX 2 @ Reposition plots
+
+        The positions of each of the plots is defined  relative  to  a
+  users  drawing board which has size 1-10,000 in x and 1-10,000 in y.
+  Plots for each option are drawn in a window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The default window positions are read from a file "NIPMARG" when the
+  program is started. Users can have their own file if  required.   As
+  all  the  plots  start  at  the same position in x and have the same
+  width, x0 and xlength are the same for all options. Generally  users
+  will  only  want  to change the start level of the window y0 and its
+  height ylength. This option allows users to change window  positions
+  whilst  running  the  program.   The  routine  prompts first for the
+  number of the option that the users wishes to reposition;  then  for
+  the  y  start and height; then for the x start and length. Note that
+  changes to the x values affect all options. If the user  types  only
+  carriage  return  for any value it will remain unchanged. The cross-
+  hair can be used to choose suitable heights.
+ @15. TX 2 @ Label a diagram
+
+        This routine allows users to  label  any  diagrams  they  have
+  produced.  They  are  asked  to type in a label. When the user types
+  carriage return to finish typing the label the cross-hair appears on
+  the  screen. The user can position it anywhere on the screen. If the
+  user types R (for right justify) the label will be  written  on  the
+  diagram  with  its right end at the cross-hair position. If the user
+  types L (for left justify) the label will be written on the  diagram
+  with  its  left end at the cross hair position.  The cross-hair will
+  then immediately reappear. The  user  may  put  the  same  label  on
+  another part of the diagram as before or if he hits the space bar he
+  will be asked if he wishes to type in another label.
+
+        Typical dialogue follows.
+  ? Menu or option number=15
+  Type label then drive cross hair to left or right end
+  of label position then hit  "L"  to  write label left
+  justified or  "R"  to  write label right justified or
+  the space bar to quit
+
+
+  ? Label=delta gene
+
+   missing graphics
+
+  ? Label=
+
+ @16. TX 2 @Display a map
+
+        This draws a map of any  sequence  features  selected  by  the
+  user.   These  features  may  be  protein coding regions (CDS), tRNA
+  genes (TRNA), promoter positions (PRM), etc. Users may define  their
+  own  feature  table  key  names. For example I find it convenient to
+  split CDS lines into CDS1, CDS2 and CDS3 each of which contains only
+  those  sequences  that  code in the reading frames 1, 2 or 3. Then I
+  can plot them at different heights on the screen ( suitable  heights
+  can be determined by using the cross-hair).
+
+        The coordinates must be stored in a file in the format  of  an
+  EMBL  or  GenBank  feature table. Note that this means that the file
+  must include either EMBL or GenBank headers, and a suitable  "tail".
+  The simplest header is the word FEATURES starting in column 1 of the
+  first line of the file. The simplest tail is 2 empty  lines  at  the
+  end  of  the  file. These lines are not included when nip writes out
+  results in feature table format.
+
+        Typical dialogue follows.
+  ? Menu or option number=16
+   Display a map using an EMBL feature table file
+  ? map file name=hsegl1.ft
+  ? feature code(e.g. CDS) =CDS
+  X 1 + strand
+    2 - strand
+    3 both strands
+  ? 0,1,2,3 =
+  ? level (0-9480) (256) =4000
+
+   missing graphics
+
+  ? feature code(e.g. CDS) =
+
+ @17. TX 1 @ Search for restriction enzymes
+
+        This routine is used  to  search  for  short  sequences,  like
+  restriction  enzyme  recognition sequences, and can either list  the
+  results or present them graphically. Listings can take several forms
+  and can include the sequence and its translation. Examples are given
+  below. The program will also display the names of enzymes  that  cut
+  the  sequence  infrequently.  Users  can select from sets of enzymes
+  stored in files or can enter them from the keyboard.
+
+        The short sequences (strings)  and  their  names  need  to  be
+  arranged  in  a particular way. See below. Select to search, list an
+  enzyme file or clear the screen. Choose either a file of enzymes  or
+  to  enter  their  recognition  sequences  at the keyboard. Choose to
+  search for all the enzymes in the list or to select from  the  list.
+  Select  a mode of output. Define the sequence as circular or linear.
+  Select to search for "definite" or "possible"  matches.  The  search
+  starts,  and after the results have been displayed, further searches
+  can be performed.
+
+        When the enzymes and their recognition sequences are stored in
+  a  file  they  must  be  defined  in  the following way. We call the
+  recognition sequences "strings".  The format  is  as  follows:  each
+  string  or  set  of  strings must be preceded by a name, each string
+  must be preceded and terminated with a slash (/), and  each  set  of
+  strings  by  2 slashes. For example AATII/GACGT'C// defines the name
+  AATII, its recognition sequence GACGTC and its cut site with  the  '
+  symbol;  ACCI/GT'MKAC//  defines  the  name ACCI and its recognition
+  sequence includes IUB symbols for incompletely  defined  symbols  in
+  nucleic   acid  sequences;  BBVI/GCAGCNNNNNNNN'/'NNNNNNNNNNNNGCTGC//
+  defines the name BBVI and this time two  recognition  sequences  and
+  cut  sites  are  specified  in  order  to correctly show the cutting
+  position relative to the recognition sequence. If  no  cut  site  is
+  included  the first base of the recognition sequence is displayed as
+  being on the 3' side of the recognition sequence.
+
+        These collections of strings and their names can be read  from
+  disk  or  entered  from  the  keyboard.   When names and strings are
+  entered from the keyboard the program will ask for the name and then
+  the  string(s).  If more than one string is typed per name they must
+  be separated by slash (/) characters.  See  the  "Typical  dialogue"
+  below.    Three  files  containing  restriction  enzyme  recognition
+  sequences are currently available. The "all enzymes"  file  contains
+  the  Rich  Roberts  REBASE  restriction  enzyme  database,  which is
+  updated monthly.
+
+        The user can select strings by name from these collections. If
+  so  the  program  will prompt for the names, one at a time. The user
+  can continue to select names until a blank name is entered  (by  the
+  user typing only return).
+
+        Listed output can be displayed in  several  ways:  it  can  be
+  ordered  enzyme by enzyme, or on cut positions, or with enzyme names
+  written above a listing of the sequence. This last listing can  also
+  include  a  three phase translation of the sequence. In addition the
+  program will display only infrequent cutters (the user  defines  the
+  minimum number of cuts), or can plot the positions of matches.
+
+        Listings sorted "enzyme by enzyme" have the following form:
+
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 AATII                 GACGT'C                  112     111     111
+                                                              912     912
+   Matches found=     2
+       Name                  Sequence            Position  Fragment lengths
+     1 ACCI                  GT'CGAC                  112     111     111
+     2 ACCI                  GT'AGAC                  420     308     308
+                                                              604     604
+   Matches found=     2
+       Name                  Sequence            Position  Fragment lengths
+     1 AHAII                 GA'CGTC                  109     108      90
+     2 AHAII                 GG'CGTC                  199      90     108
+                                                              825     825
+   Matches found=     2
+       Name                  Sequence            Position  Fragment lengths
+     1 AVAII                 G'GACC                    84      83      51
+     2 AVAII                 G'GTCC                   973     889      83
+                                                               51     889
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 BALI                  TGG'CCA                  258     257     257
+                                                              766     766
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 BAMHI                 G'GATCC                   92      91      91
+
+     ......   etc
+
+  Listings sorted on cut position have the following form:
+
+   Searching
+       Name                  Sequence            Position  Fragment lengths
+     1 ECORI                 G'AATTC                    2       1
+     2 BANI                  G'GTGCC                   26      24
+     3 BSP1286               GTGCC'C                   31       5
+     4 BBVI                  'TACTGCGCCGCAGCTGC        38       7
+     5 NSPBII                CAG'CTG                   51      13
+     6 PVUII                 CAG'CTG                   51       0
+     7 BBVI                  GCAGCTGCTGGTG'            60       9
+     8 HINCII                GTC'AAC                   80      20
+     9 AVAII                 G'GACC                    84       4
+    10 BINI                  'CCAGGGATCC               87       3
+    11 BSTNI                 CC'AGG                    89       2
+    12 BAMHI                 G'GATCC                   92       3
+    13 XHOII                 G'GATCC                   92       0
+    14 NSPBII                CCG'CTG                   98       6
+    15 BINI                  GGATCCGCT'               100       2
+    16 AHAII                 GA'CGTC                  109       9
+    17 SALI                  G'TCGAC                  111       2
+    18 AATII                 GACGT'C                  112       1
+    19 ACCI                  GT'CGAC                  112       0
+    20 HINCII                GTC'GAC                  113       1
+    21 BBVI                  GCAGCGACTGATT'           166      53
+    22 BINI                  'ACTCAGATCC              178      12
+    23 XHOII                 A'GATCC                  183       5
+    24 HGAI                  'GGCGGCGGAGGCGTC         188       5
+
+    .....etc
+
+  Lists of infrequent cutters have the following form:
+
+       0 AFLII
+       0 AFLIII
+       0 APAI
+       0 APALI
+       0 ASUII
+       0 AVAI
+       0 AVRII
+       0 BCLI
+       0 BGLI
+       0 BGLII
+       0 BSMI
+       0 BSPMII
+       0 BSTEII
+    ...... etc
+
+   Listings showing names above the sequence, and a translation have the
+  following form:
+
+
+   ECORI                   BANI BSP1286
+   .                       .    .      BBVI         NSPBII
+   .                       .    .      .            PVUII    BBVI
+  GAATTCGGTTTGGGCTTGGTGTGAGGTGCCCAGAGATTACTGCGCCGCAGCTGCTG
+  GTGC
+          10        20        30        40        50        60
+   E  F  G  L  G  L  V  *  G  A  Q  R  L  L  R  R  S  C  W  C
+    N  S  V  W  A  W  C  E  V  P  R  D  Y  C  A  A  A  A  G  A
+     I  R  F  G  L  G  V  R  C  P  E  I  T  A  P  Q  L  L  V  L
+
+                     HINCII
+                     .   AVAII
+                     .   .  BINI
+                     .   .  . BSTNI
+                     .   .  . .  BAMHI
+                     .   .  . .  XHOII NSPBII
+                     .   .  . .  .     . BINI     AHAII
+                     .   .  . .  .     . .        . SALI
+                     .   .  . .  .     . .        . .AATII
+                     .   .  . .  .     . .        . .ACCI
+                     .   .  . .  .     . .        . ..HINCII
+  TGGCGGTGCGGAGGTCGTCAACGGACCCAGGGATCCGCTGGACGAGGACGTCGACG
+  ACGA
+          70        80        90       100       110       120
+   W  R  C  G  G  R  Q  R  T  Q  G  S  A  G  R  G  R  R  R  R
+    G  G  A  E  V  V  N  G  P  R  D  P  L  D  E  D  V  D  D  E
+     A  V  R  R  S  S  T  D  P  G  I  R  W  T  R  T  S  T  T  R
+
+                                               BBVI        BINI
+  GGAGGAGGTGGATAGCGCATTGCTGGTGGCTGGCAGCGACTGATTTGAGTTCTGAC
+  CACT
+         130       140       150       160       170       180
+   G  G  G  G  *  R  I  A  G  G  W  Q  R  L  I  *  V  L  T  T
+    E  E  V  D  S  A  L  L  V  A  G  S  D  *  F  E  F  *  P  L
+     R  R  W  I  A  H  C  W  W  L  A  A  T  D  L  S  S  D  H  S
+
+    XHOII
+    .    HGAI       AHAII                      PFIMI
+    .    .          .                          .   BBVI
+  CAGATCCGGCGGCGGAGGCGTCGAGGCTCCCGAAACTCCCAGTGGCTGGCCTGCTA
+  GATT
+         190       200       210       220       230       240
+   Q  I  R  R  R  R  R  R  G  S  R  N  S  Q  W  L  A  C  *  I
+    R  S  G  G  G  G  V  E  A  P  E  T  P  S  G  W  P  A  R  F
+     D  P  A  A  E  A  S  R  L  P  K  L  P  V  A  G  L  L  D  S
+
+     .........etc
+
+
+        The terms "possible" and "definite" matches are important only
+  for  back  translations  of  protein into DNA, and which include IUB
+  redundancy codes.  Those matches that the  program  terms  "definite
+  matches"  and are ones in which the specification of the recognition
+  sequence corresponds exactly to that of the  back  translation,  and
+  consequently  are  definitely  in the DNA sequence. The program will
+  also find what it terms  'possible  matches'  which  are  ones  that
+  depend  on  the particular codons chosen for each amino acid.  These
+  are sites at which recognition  sequences  could  be  engineered  to
+  produce  a cut in the DNA without changing the amino acid, but which
+  are not necessarily found in the original sequence.
+
+        The routine will handle both linear  and  circular  sequences,
+  and  so  finds  cutsites  spanning the "ends" of circular sequences.
+  The program will only find cutsites spanning the ends  of  sequences
+  if  the  sequence  is declared as circular.  This includes sites for
+  recognition sequences containing leading or trailing N  symbols,  in
+  which  the  actual  recognition sequence does not span the join. For
+  example if the recognition sequence was 'NNNNACGT and  the  first  4
+  characters  in  the sequence were ACGT, then the match would only be
+  found if the sequence was declared as circular. If the  sequence  is
+  linear then the first fragment starts at base number 1, and the last
+  ends at the last base. If the sequence is circular then  the  length
+  of the first fragment is the clockwise distance from the last cut to
+  the first.
+
+        Graphical output marks the position of each string by a  short
+  vertical  line  and  gives the name of the enzyme at the left end of
+  the line. If the top of the screen is reached the program gives  the
+  user  the  oportunity  to  take  a hard copy and then will clear the
+  screen and restart plotting results at the original start position.
+
+        Below is an edited piece of dialogue from use  of  the  search
+  option:
+  ? Menu or option number=17
+
+  Search for restriction enzyme sites
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 = 2
+
+    1 All enzymes
+  X 2 Six cutters
+    3 Four cutters
+    4 Personal file
+    5 Keyboard
+  ? 0,1,2,3,4,5 =
+
+  AATII/GACGT'C//
+  ACCI/GT'MKAC//
+  AFLII/C'TTAAG//
+  AFLIII/A'CRYGT//
+  AHAII/GR'CGYC//
+  APAI/GGGCC'C//
+  APALI/G'TGCAC//
+  ASUII/TT'CGAA//
+  AVAI/C'YCGRG//
+  AVAII/G'GWCC//
+  AVRII/C'CTAGG//
+  BALI/TGG'CCA//
+  BAMHI/G'GATCC//
+  BANI/G'GYRCC//
+  BANII/GRGCY'C//
+  BBVI/GCAGCNNNNNNNN'/'NNNNNNNNNNNNGCTGC//
+  BCLI/T'GATCA//
+  BGLI/GCCNNNN'NGGC//
+  BGLII/A'GATCT//
+  BINI/GGATCNNNN'/'NNNNNGATCC//
+  BSMI/GAATGCN'/NG'CATTC//
+  BSP1286/GDGCH'C//
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+    1 All enzymes
+  X 2 Six cutters
+    3 Four cutters
+    4 Personal file
+    5 Keyboard
+  ? 0,1,2,3,4,5 =
+  ? (y/n) (y) Search for all names
+  X 1 Order results enzyme by enzyme
+    2 Order results by position
+    3 Show only infrequent cutters
+    4 Show names above the sequence
+  ? 0,1,2,3,4 =
+  ? (y/n) (y) List matches
+  ? (y/n) (y) The sequence is linear
+  ? (y/n) (y) Search for definite matches
+
+   Searching
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 AATII                 GACGT'C                  112     111     111
+                                                              912     912
+   Matches found=     2
+       Name                  Sequence            Position  Fragment lengths
+     1 ACCI                  GT'CGAC                  112     111     111
+     2 ACCI                  GT'AGAC                  420     308     308
+                                                              604     604
+   Matches found=     2
+       Name                  Sequence            Position  Fragment lengths
+     1 AHAII                 GA'CGTC                  109     108      90
+     2 AHAII                 GG'CGTC                  199      90     108
+                                                              825     825
+   Matches found=     2
+       Name                  Sequence            Position  Fragment lengths
+     1 AVAII                 G'GACC                    84      83      51
+     2 AVAII                 G'GTCC                   973     889      83
+                                                               51     889
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 BALI                  TGG'CCA                  258     257     257
+                                                              766     766
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 BAMHI                 G'GATCC                   92      91      91
+                                                              932     932
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 BANI                  G'GTGCC                   26      25      25
+                                                              998     998
+   Matches found=     1
+       Name                  Sequence            Position  Fragment lengths
+     1 BANII                 GAGCC'C                  490     489     489
+                                                              534     534
+   Matches found=    11
+       Name                  Sequence            Position  Fragment lengths
+     1 BBVI                  'TACTGCGCCGCAGCTGC        38      37       3
+     2 BBVI                  GCAGCTGCTGGTG'            60      22      22
+     3 BBVI                  GCAGCGACTGATT'           166     106      28
+     4 BBVI                  'CCTGCTAGATTCGCTGC       230      64      37
+     5 BBVI                  GCAGCGGTACGTA'           452     222      50
+     6 BBVI                  'CTCGCCAACGTTGCTGC       502      50      55
+     7 BBVI                  GCAGCCTTCAACT'           606     104      64
+     8 BBVI                  'GAGGTATTCCTGGCTGC       634      28      97
+     9 BBVI                  'CTGGCCGCCGCCGCTGC       869     235     104
+    10 BBVI                  'GCCGCCGCCGCTGCTGC       872       3     106
+    11 BBVI                  GCAGCGATGAGGA'           927      55     222
+
+    ....etc
+
+   X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+
+    1 All enzymes
+  X 2 Six cutters
+    3 Four cutters
+    4 Personal file
+    5 Keyboard
+  ? 0,1,2,3,4,5 =
+
+  ? (y/n) (y) Search for all names
+
+  X 1 Order results enzyme by enzyme
+    2 Order results by position
+    3 Show only infrequent cutters
+    4 Show names above the sequence
+  ? 0,1,2,3,4 = 2
+
+  ? (y/n) (y) List matches
+  ? (y/n) (y) The sequence is linear
+  ? (y/n) (y) Search for definite matches
+
+   Searching
+       Name                  Sequence            Position  Fragment lengths
+     1 ECORI                 G'AATTC                    2       1
+     2 BANI                  G'GTGCC                   26      24
+     3 BSP1286               GTGCC'C                   31       5
+     4 BBVI                  'TACTGCGCCGCAGCTGC        38       7
+     5 NSPBII                CAG'CTG                   51      13
+     6 PVUII                 CAG'CTG                   51       0
+     7 BBVI                  GCAGCTGCTGGTG'            60       9
+     8 HINCII                GTC'AAC                   80      20
+     9 AVAII                 G'GACC                    84       4
+    10 BINI                  'CCAGGGATCC               87       3
+    11 BSTNI                 CC'AGG                    89       2
+    12 BAMHI                 G'GATCC                   92       3
+    13 XHOII                 G'GATCC                   92       0
+    14 NSPBII                CCG'CTG                   98       6
+    15 BINI                  GGATCCGCT'               100       2
+    16 AHAII                 GA'CGTC                  109       9
+    17 SALI                  G'TCGAC                  111       2
+    18 AATII                 GACGT'C                  112       1
+    19 ACCI                  GT'CGAC                  112       0
+    20 HINCII                GTC'GAC                  113       1
+
+    .....etc
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+
+    1 All enzymes
+  X 2 Six cutters
+    3 Four cutters
+    4 Personal file
+    5 Keyboard
+  ? 0,1,2,3,4,5 =
+
+  ? (y/n) (y) Search for all names
+
+    1 Order results enzyme by enzyme
+  X 2 Order results by position
+    3 Show only infrequent cutters
+    4 Show names above the sequence
+  ? 0,1,2,3,4 =3
+  ? Maximum number of cuts (0-100) (0) =
+  ? (y/n) (y) The sequence is linear
+  ? (y/n) (y) Search for definite matches
+
+   Searching
+       0 AFLII
+       0 AFLIII
+       0 APAI
+       0 APALI
+       0 ASUII
+       0 AVAI
+       0 AVRII
+       0 BCLI
+       0 BGLI
+       0 BGLII
+       0 BSMI
+       0 BSPMII
+       0 BSTEII
+       0 CLAI
+       0 DRAI
+       0 DRAII
+       0 ECOB
+       0 ECOK
+       0 ECORV
+       0 ESPI
+
+     ......etc
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+
+    1 All enzymes
+  X 2 Six cutters
+    3 Four cutters
+    4 Personal file
+    5 Keyboard
+  ? 0,1,2,3,4,5 =
+
+  ? (y/n) (y) Search for all names
+
+    1 Order results enzyme by enzyme
+    2 Order results by position
+  X 3 Show only infrequent cutters
+    4 Show names above the sequence
+  ? 0,1,2,3,4 =4
+  ? (y/n) (y) Hide translation n
+  ? (y/n) (y) Use 1 letter codes
+  ? Line length (30-90) (60) =
+  ? (y/n) (y) The sequence is linear
+  ? (y/n) (y) Search for definite matches
+
+   Searching
+   ECORI                   BANI BSP1286
+   .                       .    .      BBVI         NSPBII
+   .                       .    .      .            PVUII    BBVI
+  GAATTCGGTTTGGGCTTGGTGTGAGGTGCCCAGAGATTACTGCGCCGCAGCTGCTG
+  GTGC
+          10        20        30        40        50        60
+   E  F  G  L  G  L  V  *  G  A  Q  R  L  L  R  R  S  C  W  C
+    N  S  V  W  A  W  C  E  V  P  R  D  Y  C  A  A  A  A  G  A
+     I  R  F  G  L  G  V  R  C  P  E  I  T  A  P  Q  L  L  V  L
+
+                     HINCII
+                     .   AVAII
+                     .   .  BINI
+                     .   .  . BSTNI
+                     .   .  . .  BAMHI
+                     .   .  . .  XHOII NSPBII
+                     .   .  . .  .     . BINI     AHAII
+                     .   .  . .  .     . .        . SALI
+                     .   .  . .  .     . .        . .AATII
+                     .   .  . .  .     . .        . .ACCI
+                     .   .  . .  .     . .        . ..HINCII
+  TGGCGGTGCGGAGGTCGTCAACGGACCCAGGGATCCGCTGGACGAGGACGTCGACG
+  ACGA
+          70        80        90       100       110       120
+   W  R  C  G  G  R  Q  R  T  Q  G  S  A  G  R  G  R  R  R  R
+    G  G  A  E  V  V  N  G  P  R  D  P  L  D  E  D  V  D  D  E
+     A  V  R  R  S  S  T  D  P  G  I  R  W  T  R  T  S  T  T  R
+
+                                               BBVI        BINI
+  GGAGGAGGTGGATAGCGCATTGCTGGTGGCTGGCAGCGACTGATTTGAGTTCTGAC
+  CACT
+         130       140       150       160       170       180
+   G  G  G  G  *  R  I  A  G  G  W  Q  R  L  I  *  V  L  T  T
+    E  E  V  D  S  A  L  L  V  A  G  S  D  *  F  E  F  *  P  L
+     R  R  W  I  A  H  C  W  W  L  A  A  T  D  L  S  S  D  H  S
+
+   .......etc
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+
+    1 All enzymes
+  X 2 Six cutters
+    3 Four cutters
+    4 Personal file
+    5 Keyboard
+  ? 0,1,2,3,4,5 =5
+  Define search strings by typing a string name
+  followed by the string(s)
+  ? Name=FRED
+  ? String(s)=AAAAAA/TTTTTT
+  ? Name=MARY
+  ? String(s)=CCCC/GGGG/GCGCT
+  ? Name=
+  ? (y/n) (y) Search for all names
+  X 1 Order results enzyme by enzyme
+    2 Order results by position
+    3 Show only infrequent cutters
+    4 Show names above the sequence
+  ? 0,1,2,3,4 =
+  ? (y/n) (y) List matches
+  ? (y/n) (y) The sequence is linear
+  ? (y/n) (y) Search for definite matches
+   Searching
+   Matches found=     9
+       Name                  Sequence            Position  Fragment lengths
+     1 FRED                  'TTTTTT                 1557    1556       1
+     2 FRED                  'TTTTTT                 1558       1       1
+     3 FRED                  'TTTTTT                 1559       1       1
+     4 FRED                  'TTTTTT                 1560       1      22
+     5 FRED                  'AAAAAA                 1582      22     529
+     6 FRED                  'AAAAAA                 3160    1578    1019
+     7 FRED                  'AAAAAA                 4204    1044    1044
+     8 FRED                  'AAAAAA                 5691    1487    1487
+     9 FRED                  'AAAAAA                 6710    1019    1556
+                                                              529    1578
+   Matches found=    36
+       Name                  Sequence            Position  Fragment lengths
+     1 MARY                  'CCCC                     47      46       1
+     2 MARY                  'GGGG                    486     439       1
+     3 MARY                  'GGGG                    487       1       1
+     4 MARY                  'CCCC                    557      70       1
+     5 MARY                  'CCCC                    558       1       1
+     6 MARY                  'GCGCT                  1177     619       1
+
+    ... etc
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+    1 All enzymes
+  X 2 Six cutters
+    3 Four cutters
+    4 Personal file
+    5 Keyboard
+  ? 0,1,2,3,4,5 =5
+  Define search strings by typing a string name
+  followed by the string(s)
+  ? Name=JANE
+  ? String(s)=A'TTTT/CC'GGG
+  ? Name=
+  ? (y/n) (y) Search for all names
+  X 1 Order results enzyme by enzyme
+    2 Order results by position
+    3 Show only infrequent cutters
+    4 Show names above the sequence
+  ? 0,1,2,3,4 =
+  ? (y/n) (y) List matches
+  ? (y/n) (y) The sequence is linear
+  ? (y/n) (y) Search for definite matches
+   Searching
+   Matches found=    30
+       Name                  Sequence            Position  Fragment lengths
+     1 JANE                  A'TTTT                   437     436       6
+     2 JANE                  A'TTTT                   546     109      33
+     3 JANE                  A'TTTT                   597      51      43
+     4 JANE                  A'TTTT                   777     180      51
+     5 JANE                  A'TTTT                  1274     497      60
+     6 JANE                  A'TTTT                  1571     297      62
+     7 JANE                  CC'GGG                  1926     355      75
+     8 JANE                  A'TTTT                  2403     477      81
+     9 JANE                  A'TTTT                  2586     183      82
+    10 JANE                  A'TTTT                  2731     145     101
+    11 JANE                  A'TTTT                  2812      81     103
+
+   ... etc
+
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =!
+ @18. TX 1 7 @ Compare a short sequence
+
+        This  routine  slides  a  short  sequence  along  the  current
+  sequence  and finds all positions at which a given percentage of the
+  bases match.  Output is in both graphical and listed forms.
+
+        If  users call for dialogue when the routine is selected  they
+  will  be  given  the  choice  of  keyboard or file input. Define the
+  string, select the "sense" to use and the percentage match.  Matches
+  will  be  plotted  out  and  then  the  user can select to have them
+  listed. Then the routine cycles around.
+
+        The routine slides the search string along the   sequence  and
+  marks  the positions at which a minimum percentage score is reached.
+  The graphical output draws a vertical line at  the  match  position;
+  the  height  of the line represents the percentage score, so that if
+  the line reaches the top of the box the score is 100%.   The  NC-IUB
+  symbols  may  be  used  in  the  search sequence to encode uncertain
+  characters. Any other symbols will not match.
+
+
+              NC-IUB SYMBOLS
+
+        A,C,G,T
+        R        (A,G)        'puRine'
+        Y        (T,C)        'pYrimidine'
+        W        (A,T)        'Weak'
+        S        (C,G)        'Strong'
+        M        (A,C)        'aMino'
+        K        (G,T)        'Keto'
+        H        (A,T,C)      'not G'
+        B        (G,C,T)      'not A'
+        V        (G,A,C)      'not T'
+        D        (G,A,T)      'not C'
+        N        (G,A,C,T)    'aNy'
+
+   Typical dialogue is shown below.
+
+
+  ? Menu or option number=18
+   Find percentage matches
+  ? (y/n) (y) Keep picture
+  ? String=AAATTTCCC
+  STRING=AAATTTCCC
+  ? (y/n) (y) This sense
+  ? Percent match (1.00-100.00) (70.00) =
+
+   Missing graphics display here
+
+  Total scoring positions above 70.000 percent =   7
+  Scores         7      6      6      6      6      6      6
+  Positions    365    212    213    292    311    358    627
+  ? Display (0-7) (0) =3
+
+         365
+           ACATTTCGC
+           * ***** *
+           AAATTTCCC
+           1
+
+         212
+           GAAACTCCC
+            **  ****
+           AAATTTCCC
+           1
+
+         213
+           AAACTCCCA
+           *** * **
+           AAATTTCCC
+           1
+  ? (y/n) (y) Keep picture
+  Default String=AAATTTCCC
+  ? String=
+  STRING=AAATTTCCC
+  ? (y/n) (y) This sense n
+  STRING=GGGAAATTT
+  ? Percent match (1.00-100.00) (70.00) =
+
+   Missing graphics display here
+
+  Total scoring positions above 70.000 percent =   7
+  Scores         6      6      6      6      6      6      6
+  Positions    269    270    271    288    354    624    853
+  ? Display (0-7) (0) =3
+
+         269
+           GAGGGATTT
+           * *  ****
+           GGGAAATTT
+           1
+
+         270
+           AGGGATTTT
+            ** * ***
+           GGGAAATTT
+           1
+
+         271
+           GGGATTTTC
+           ****  **
+           GGGAAATTT
+           1
+  ? (y/n) (y) Keep picture !
+
+ @19. TX 7 @ Compare a short sequence using a score matrix
+
+        This  routine  slides  a  short  sequence  along  the  current
+  sequence  and  finds  all  positions  at  which  a  given  level  of
+  similarity (a cutoff score) is reached. The score is defined by  use
+  of a score matrix. Output is in both graphical and listed forms.
+
+        If  users call for dialogue when the routine is selected  they
+  will  be  given  the  choice  of  keyboard or file input. Define the
+  string, select the "sense" to use and the cutoff score. Matches will
+  be  plotted  out  and  then the user can select to have them listed.
+  Then the routine cycles around.
+
+        The routine slides the search string along the   sequence  and
+  marks  the  positions  at  which a the cutoff score is achieved. The
+  graphical output draws a vertical line at the  match  position;  the
+  height  of  the  line  represents  the   score,  so that if the line
+  reaches the top of the box the score is the maximum  possible.   The
+  NC-IUB  symbols  may  be  used  in  the  search  sequence  to encode
+  uncertain characters.
+
+        The score matrix reflects the level of redundancy in the probe
+  sequence  and  hence will put more emphasis on those characters that
+  are better defined. The score matrix is:
+               DNA SCORE MATRIX USING IUB SYMBOLS
+
+          T  C  A  G  -  R  Y  W  S  M  K  H  B  V  D  N  ?
+
+     T   36  0  0  0  9  0 18 18  0  0 18 12 12  0 12  9  0
+     C    0 36  0  0  9  0 18  0 18 18  0 12 12 12  0  9  0
+     A    0  0 36  0  9 18  0 18  0 18  0 12  0 12 12  9  0
+     G    0  0  0 36  9 18  0  0 18  0 18  0 12 12 12  9  0
+     -    9  9  9  9 36 18 18 18 18 18 18 27 27 27 27 36  0
+     R    0  0 18 18 18 36  0  9  9  9  9  6  6 12 12 18  0
+     Y   18 18  0  0 18  0 36  9  9  9  9 12 12  6  6 18  0
+     W   18  0 18  0 18  9  9 36  0  9  9 12  6  6 12 18  0
+     S    0 18  0 18 18  9  9  0 36  9  9  6 12 12  6 18  0
+     M    0 18 18  0 18  9  9  9  9 36  0 12  6 12  6 18  0
+     K   18  0  0 18 18  9  9  9  9  0 36  6 12  6 12 18  0
+     H   12 12 12  0 27  6 12 12  6 12  6 36  8  8  8 27  0
+     B   12 12  0 12 27  6 12  6 12  6 12  8 36  8  8 27  0
+     V    0 12 12 12 27 12  6  6 12 12  6  8  8 36  8 27  0
+     D   12  0 12 12 27 12  6 12  6  6 12  8  8  8 36 27  0
+     N    9  9  9  9 36 18 18 18 18 18 18 27 27 27 27 36  0
+     ?    0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0  0
+
+    ? is any unrecognised character.
+
+    Typical dialogue is shown below.
+
+  ? Menu or option number=19
+   Find matches using a score matrix
+  ? (y/n) (y) Keep picture
+  ? String=AAATTTCCC
+  STRING=AAATTTCCC
+  ? (y/n) (y) This sense
+  Minimum score=     0 Maximum score=   324
+  ? Score (0-324) (280) =250
+
+   Missing graphics display here
+
+  For score   250 the number of matches=     1
+  Scores       252
+  Positions    365
+  ? Display (0-1) (0) =1
+
+         365
+           ACATTTCGC
+           * ***** *
+           AAATTTCCC
+           1
+  ? (y/n) (y) Keep picture
+  Default String=AAATTTCCC
+  ? String=
+  STRING=AAATTTCCC
+  ? (y/n) (y) This sense n
+  STRING=GGGAAATTT
+  Minimum score=     0 Maximum score=   324
+  ? Score (0-324) (222) = 200
+
+   Missing graphics display here
+
+  For score   200 the number of matches=     7
+  Scores       216    216    216    216    216    216    216
+  Positions    269    270    271    288    354    624    853
+  ? Display (0-7) (0) =3
+
+         269
+           GAGGGATTT
+           * *  ****
+           GGGAAATTT
+           1
+
+         270
+           AGGGATTTT
+            ** * ***
+           GGGAAATTT
+           1
+
+         271
+           GGGATTTTC
+           ****  **
+           GGGAAATTT
+           1
+  ? (y/n) (y) Keep picture !
+
+ @20. TX 7 @ Search for a motif using a weight matrix
+
+        This function performs  searches  for  short  sequence  motifs
+  using  an  appropriate  weight matrix. In addition it can be used to
+  create or modify weight matrices. In order to perform a  search  the
+  only  input  required  is the name of the file containing the weight
+  matrix.  The results can be presented  graphically  or  listed.  The
+  graphical presentation will draw line at the position of any matches
+  found; the height of the line is proportional to the score.
+
+        For a search, select "use weight matrix", supply the  name  of
+  the  file  containing  the  weight matrix, and choose between having
+  results plotted  or  listed.  If  dialogue  is  requested  when  the
+  function is selected users can alter the cutoff score employed.
+
+        To create a weight matrix several steps are involved.  A  file
+  containing an alignment of known motifs is required. (This file must
+  be created before the current option is selected. The  format  is  a
+  follows:  each  sequence is written on a separate line with at least
+  one space at the beginning; each sequence is terminated by  a  space
+  character,  and  can  be  followed  by a name. The sequences must be
+  aligned.) Supply the name of the  file  of  aligned  sequences.  The
+  program  reads  and  displays the sequences. Choose between "summing
+  logs of weights" or summing weights (i.e. whether to multiply or add
+  weights).  If  logs  are used all scores will be negative. Choose if
+  all positions in the set of aligned sequences should be used or if a
+  mask should be applied. If so selected, define a mask as a string of
+  symbols, in which symbol - means ignore and any other  symbol  means
+  use. E.g. xx-x--abc means use all positions except 3,5 and 6.
+
+        The program will calculate weights as the frequencies of  each
+  base  at  each  unmasked  position  in the set of aligned sequences.
+  These weights are then applied to the set of  aligned  sequences  to
+  give  a range  of "observed" scores. The mean and standard deviation
+  of these scores is displayed. The user is asked  to  supply  several
+  values  to  be  used  when  the  weight  matrix  is applied to other
+  sequences: a cutoff score (by default, the  mean  minus  3  standard
+  deviations),  a top score for scaling graphical results (by default,
+  the mean plus 3 standard deviations), and  a  position  to  identify
+  (this  means that if a particular base within the motif is used as a
+  "landmark", such as the A of the AG in splice acceptor  sites,  then
+  its  position  will be marked in plots). All these values are stored
+  along with the weight matrix. Finally supply the name of a  file  to
+  contain the weight matrix.
+
+        Weight matrices can be  "rescaled"  using  a  set  of  aligned
+  sequences  in much the same ways as a matrix is created. The purpose
+  is to redefine the cutoff scores, and rescaling does not  alter  any
+  other values in the weight matrix file.
+
+        The methods have changed considerably but were first  outlined
+  in  Staden,  R.  Nucl.  Acid  Res.  12  505-519 1984, and Staden, R.
+  Genetic engineering: principles and methods vol 7,  Edited  by  J.K.
+  Setlow and A. Hollaender, Plenum publishing corp., 1985.
+
+        The methods have always had to deal with the problem of zeroes
+  in  the  matrices.  The  current  versions  employ  "Laplaces Law of
+  Succession" in which 1 is added to each term.
+
+        It is now possible to  apply  a  mask  to  a  set  of  aligned
+  sequences  in  order  to  give  weight  to  selected positions only.
+  Sequences have superimposed functions: some parts may be of  general
+  structural  importance  and  give  rise to an overall framework, and
+  other parts give specificity and hence are not common; we  may  want
+  to use a set of aligned sequences to define a motif, but want to use
+  only the framework positions.  Alternatively we may want to pick out
+  only  those  parts  of  a  set  of  aligned  sequences  that  give a
+  particular property, and to ignore other similarities that  are  due
+  to  some  other  property and which could obscure the pattern we are
+  interested in. The ability to define a mask allows certain positions
+  to  be  used  in  the  motif and others to be ignored, and yet still
+  permits the use of a set of aligned sequences to calculate weights.
+
+        Typical dialogue is shown below.
+
+  ? Menu or option number=20
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 =2
+  ? Name of aligned sequences file=[RS.MOTIFS]GCN4.SEQ
+
+       1 AGCGTGACTCTTCCCGGAA HIS1
+       2 GAGGTGACTCACTTGGAAG HIS1
+       3 CGGATGACTCTTTTTTTTT HIS3
+       4 ACAGTGACTCACGTTTTTT HIS4
+       5 GTCGTGACTCATATGCTTT ARG3
+       6 TGAATGACTCACTTTTTGG ARG4
+       7 TTCTTGACTCGTCTTTTCT CPA1
+       8 CGAATGACTCTTATTGATG CPA2
+       9 AGAATGACTAATTTTACTA TRP5
+      10 TCGTTGACTCATTCTAATC TRP3
+      11 TTGCTGACTCATTACGATT TRP2
+      12 GAGATGACTCTTTTTCTTT IV1
+      13 GCGATGATTCATTTCTCTG IV2
+      14 TAGATGACTCAGTTTAGTC LEU1
+      15 TAAGTGACTCAGTTCTTTC LEU4
+      16 ATGATGACTCTTAAGCATG ILS1
+  Length of motif    19
+  ? (y/n) (y) Sum logs of weights
+
+  ? (y/n) (y) Use all motif positions n
+  x means use, - means ignore
+  e.g. xx-x---x-x means use positions 1,2,4,8,10
+  ? Mask=----XXXXXXXX
+   Applying weights to input sequences
+     1      -27.979 AGCGTGACTCTTCCCGGAA
+     2      -24.543 GAGGTGACTCACTTGGAAG
+     3      -20.890 CGGATGACTCTTTTTTTTT
+     4      -23.087 ACAGTGACTCACGTTTTTT
+     5      -22.771 GTCGTGACTCATATGCTTT
+     6      -23.408 TGAATGACTCACTTTTTGG
+     7      -25.159 TTCTTGACTCGTCTTTTCT
+     8      -22.679 CGAATGACTCTTATTGATG
+     9      -24.751 AGAATGACTAATTTTACTA
+    10      -23.157 TCGTTGACTCATTCTAATC
+    11      -23.067 TTGCTGACTCATTACGATT
+    12      -21.449 GAGATGACTCTTTTTCTTT
+    13      -24.191 GCGATGATTCATTTCTCTG
+    14      -23.770 TAGATGACTCAGTTTAGTC
+    15      -22.923 TAAGTGACTCAGTTCTTTC
+    16      -25.285 ATGATGACTCTTAAGCATG
+  Top score     -20.890  Bottom score     -27.979
+  Mean     -23.694  Standard deviation       1.613
+  Mean minus 3.sd     -28.534  Mean plus 3.sd     -18.854
+  ? Cutoff score (-999.00-9999.00) (-28.53) =
+  ? Top score for scaling plots (-28.53-999.00) (-18.85) =
+  ? Position to identify (0-19) (1) =
+  ? Title=GCN4 SEQUENCES
+  ? Name for new weight matrix file=1.WTS
+
+
+  ? Menu or option number=20
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 =3
+  ? Name of existing weight matrix file=1.WTS
+   GCN4 SEQUENCES
+  ? Name of aligned sequences file=[RS.MOTIFS]GCN4.SEQ
+  Length of motif    19
+  ? (y/n) (y) Sum logs of weights n
+  ? (y/n) (y) Use all motif positions
+
+   Applying weights to input sequences
+     1      128.000 AGCGTGACTCTTCCCGGAA
+     2      148.000 GAGGTGACTCACTTGGAAG
+     3      172.000 CGGATGACTCTTTTTTTTT
+     4      160.000 ACAGTGACTCACGTTTTTT
+     5      161.000 GTCGTGACTCATATGCTTT
+     6      157.000 TGAATGACTCACTTTTTGG
+     7      149.000 TTCTTGACTCGTCTTTTCT
+     8      160.000 CGAATGACTCTTATTGATG
+     9      151.000 AGAATGACTAATTTTACTA
+    10      159.000 TCGTTGACTCATTCTAATC
+    11      158.000 TTGCTGACTCATTACGATT
+    12      169.000 GAGATGACTCTTTTTCTTT
+    13      152.000 GCGATGATTCATTTCTCTG
+    14      157.000 TAGATGACTCAGTTTAGTC
+    15      160.000 TAAGTGACTCAGTTCTTTC
+    16      143.000 ATGATGACTCTTAAGCATG
+  Top score     172.000  Bottom score     128.000
+  Mean     155.250  Standard deviation      10.034
+  Mean minus 3.sd     125.147  Mean plus 3.sd     185.353
+  ? Cutoff score (-999.00-9999.00) (125.15) =
+  ? Top score for scaling plots (125.15-999.00) (185.35) =
+  ? Position to identify (0-19) (1) =
+  ? Title=GCN4 SEQUENCES
+  ? Name for new weight matrix file=2.WTS
+
+
+  ? Menu or option number=20
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 =
+  ? Motif weight matrix file=1.WTS
+   GCN4 SEQUENCES
+  ? (y/n) (y) Plot results n
+
+      153    -22.61 GCAGCGACTGATTTGAGTT
+      169    -28.53 GTTCTGACCACTCAGATCC
+      172    -27.27 CTGACCACTCAGATCCGGC
+      219    -27.35 CCAGTGGCTGGCCTGCTAG
+      268    -27.82 CGAGGGATTTTCGATCTTG
+      274    -26.99 ATTTTCGATCTTGTGGATG
+      283    -25.79 CTTGTGGATGATTTTCACG
+      287    -27.50 TGGATGATTTTCACGTGCG
+      298    -28.17 CACGTGCGCCGTCATATTG
+      332    -28.27 TCTTTGAAGCAGAAGGGAC
+      351    -28.27 AGGGGTACACTTTCACATT
+      357    -25.05 ACACTTTCACATTTCGCTT
+      364    -28.51 CACATTTCGCTTATGGGAG
+      400    -23.77 GAAGTTACTAATGTGCGTG
+      451    -26.22 ATGCTCGCCCTCTTTGGTG
+      476    -28.00 TCCCTCACTGAGCCCTCCG
+      480    -28.33 TCACTGAGCCCTCCGCCTC
+      517    -23.46 GCTAAGATTCAGCTTGGTT
+      556    -27.27 TCCAGCACTCAGGTTCGGC
+      602    -27.01 AACTTGAATCCATCGTTGC
+      648    -28.45 TGCTAAACACAGCCGGTTT
+      679    -28.18 CTGTTTGCCCAGTTTGGGC
+      691    -28.51 TTTGGGCCGCTTCTGGACG
+      713    -27.67 GGCTTGACCGTGGCTGTGG
+      803    -25.47 ATGCTGACCATGCTTTTCA
+      848    -28.11 ATAATGTTAAGTTTGATTC
+      857    -25.97 AGTTTGATTCCGCTGGCCG
+      879    -27.85 CCGCTGCTGCTGTTTCCAC
+      917    -27.77 GCGATGAGGAAGGCTTGTT
+      931    -27.81 TTGTTGGCGCGCCTGCTCG
+      952    -23.52 GAGGTGACTACCATCCGTG
+      977    -28.40 TGCGTGGGTGAGCTGTTGT
+
+
+
+
+  ? Menu or option number=6
+  Page through text files
+  ? Name of file to read=1.WTS
+   GCN4 SEQUENCES
+       19     1   -28.534   -18.854
+   P   0   1   2   3   4   5   6   7   8   9  10  11  12  13  14  15  16  17  18
+   N  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16  16
+  16
+   T   0   0   0   0  16   0   0   1  16   0   5  11  10  12   9   6   7  12   6
+   C   0   0   0   0   0   0   0  15   0  15   0   3   2   2   4   3   2   1   3
+   A   0   0   0   0   0   0  16   0   0   1  10   0   3   2   0   3   5   2   2
+   G   0   0   0   0   0  16   0   0   0   0   1   2   1   0   3   4   2   1   5
+  End of file
+
+ @21. TX 3 @ Count base composition
+
+        This routine calculates the base  composition  of  the  active
+  region of the sequence as both totals and percentages.
+ @22. TX 3 @ Count dinucleotide frequencies
+
+        This routine simply counts dinucleotide  frequencies  for  the
+  currently  active  region  of  the  sequence.  It also calculates an
+  expected distribution based on the  base  composition.   The  output
+  looks like:
+                T             C             A             G
+          obs  expected obs  expected obs  expected obs  expected
+
+       T   8.44   8.25   6.67   7.01  10.35   9.92   3.27   3.54
+       C   7.49   7.01   6.76   5.95   8.39   8.43   1.76   3.01
+       A  10.13   9.92   7.78   8.43  11.74  11.93   4.89   4.26
+       G   2.67   3.54   3.19   3.01   4.06   4.26   2.42   1.52
+
+ @23. TX 3 5 @ Count codons and amino acids
+
+        This function counts codons, amino acid  composition,  protein
+  molecular  weights,  and base composition. Users select the segments
+  of the sequence that the program should analyse.
+
+        Choose  between  being  shown  observed   counts   or   counts
+  normalised so that the totals for each amino acid sum to 100. Select
+  to define segments using either the  keyboard  or  an  EMBL  feature
+  table.   Define  the  segments to count over. Select strand for each
+  segment. Stop selecting segments by typing a zero  for  "Count  from
+  ()".  The  results are displayed a screenful at a time, and the bell
+  is sounded to show there is more to come. A zero start position,  or
+  the  end  of an EMBL feature table, signals the routine to print out
+  totals for all values.
+
+        The  counts  are  broken  down  into  several  figures.   Base
+  composition  by  position in codon expressed as a percentage of each
+  bases  own  frequency;   base  composition  by  position  in   codon
+  expressed  as  a  percentage  of the overall base composition of the
+  section; base composition expected for this amino  acid  composition
+  if  there  was  no  codon  preference;  percentage deviations of the
+  observed  amino  acid  composition  from  an  average   amino   acid
+  composition.
+
+        The output looks like:
+
+        ===========================================
+        F TTT   1. S TCT   2. Y TAT   2. C TGT   1.
+        F TTC   1. S TCC   1. Y TAC   3. C TGC   2.
+        L TTA   7. S TCA   4. * TAA   9. * TGA   1.
+        L TTG   2. S TCG   1. * TAG   2. W TGG   2.
+        ===========================================
+        L CTT   3. P CCT   2. H CAT   4. R CGT   1.
+        L CTC   2. P CCC   3. H CAC   1. R CGC   0.
+        L CTA   3. P CCA   2. Q CAA   4. R CGA   0.
+        L CTG   2. P CCG   2. Q CAG   1. R CGG   2.
+        ===========================================
+        I ATT   9. T ACT   1. N AAT   7. S AGT   3.
+        I ATC   2. T ACC   2. N AAC   4. S AGC   2.
+        I ATA   4. T ACA   5. K AAA  13. R AGA   5.
+        M ATG   1. T ACG   2. K AAG   4. R AGG   1.
+        ===========================================
+        V GTT   2. A GCT   2. D GAT   1. G GGT   3.
+        V GTC   2. A GCC   2. D GAC   1. G GGC   1.
+        V GTA   4. A GCA   3. E GAA   2. G GGA   1.
+        V GTG   2. A GCG   0. E GAG   1. G GGG   1.
+        ===========================================
+    total codons=      166.
+            T          C          A          G
+
+    1     31.06      33.68      34.03      35.00
+    2     35.61      35.79      30.89      32.50
+    3     33.33      30.53      35.08      32.50
+
+    1     24.70      19.28      39.16      16.87
+    2     28.31      20.48      35.54      15.66
+    3     26.51      17.47      40.36      15.66
+    %     26.51      19.08      38.35      16.06  observed, overall totals
+    %     25.00      22.26      33.10      19.65  expected, even codons per acid
+
+            A    C    D    E    F    G    H    I    K    L
+            7.   3.   2.   3.   2.   6.   5.  15.  17.  19.
+   o-e %  -47. -33. -76. -68. -64. -54.  62. 116.  67.  67.
+
+            M    N    P    Q    R    S    T    V    W    Y
+            1.  11.   9.   5.   9.  13.  10.  10.   2.   5.
+   o-e %  -62.  66.  12. -17.  19.  21.   6.  -2.   0.  -5.
+   total acids=  154. molecular weight=    17421.
+
+   Typical dialogue follows.
+
+  ? Menu or option number=23
+   Calculate codon usage, base composition
+   and amino acid composition
+  ? (y/n) (y) Show observed counts
+  ? (y/n) (y) Define segments using keyboard
+  ? Count from (0-1023) (0) =1
+  ? Count to (1-1023) (1023) =1000
+  ? (y/n) (y) + strand
+
+       ===========================================
+       F TTT  13. S TCT   1. Y TAT   1. C TGT   3.
+       F TTC   4. S TCC  10. Y TAC   1. C TGC   7.
+       L TTA   1. S TCA   0. * TAA   1. * TGA   4.
+       L TTG   4. S TCG   1. * TAG   3. W TGG   5.
+       ===========================================
+       L CTT   9. P CCT   1. H CAT   3. R CGT  14.
+       L CTC   7. P CCC   0. H CAC   7. R CGC  14.
+       L CTA   0. P CCA   0. Q CAA   4. R CGA   9.
+       L CTG  12. P CCG   1. Q CAG   9. R CGG   8.
+       ===========================================
+       I ATT   7. T ACT   4. N AAT   4. S AGT   1.
+       I ATC   4. T ACC   5. N AAC   3. S AGC   7.
+       I ATA   1. T ACA   1. K AAA   3. R AGA   2.
+       M ATG   2. T ACG   1. K AAG   2. R AGG   2.
+       ===========================================
+       V GTT  11. A GCT  13. D GAT   6. G GGT   9.
+       V GTC   5. A GCC  10. D GAC   9. G GGC  11.
+       V GTA   6. A GCA   5. E GAA   6. G GGA  12.
+       V GTG   8. A GCG   5. E GAG   3. G GGG   8.
+       ===========================================
+
+
+   Total codons=      333.
+           T          C          A          G
+
+   1     23.32      37.69      28.99      40.06
+   2     37.15      22.31      38.46      36.59
+   3     39.53      40.00      32.54      23.34
+         -----      -----      -----      -----
+   =     100%       100%       100%       100%
+
+   1     17.72      29.43      14.71      38.14  = 100%
+   2     28.23      17.42      19.52      34.83  = 100%
+   3     30.03      31.23      16.52      22.22  = 100%
+   %     25.33      26.03      16.92      31.73  Observed, overall totals
+   %     24.44      22.31      20.90      32.35  Expected, even codons per acid
+
+           A    C    D    E    F    G    H    I    K    L
+          33.  10.  15.   9.  17.  40.  10.  12.   5.  33.
+  O-E %   22.  81. -13. -55.  34.  71.  40. -29. -73.  13.
+
+           M    N    P    Q    R    S    T    V    W    Y
+           2.   7.   2.  13.  49.  20.  11.  30.   5.   2.
+  O-E %  -74. -51. -88.   0. 165. -11. -42.  40.  18. -81.
+  Total acids=  325. Molecular weight=    35831. Hydrophobicity= -17.8
+
+
+  ? Count from (0-1023) (0) =
+
+      Codon totals over all genes
+       ===========================================
+       F TTT  13. S TCT   1. Y TAT   1. C TGT   3.
+       F TTC   4. S TCC  10. Y TAC   1. C TGC   7.
+       L TTA   1. S TCA   0. * TAA   1. * TGA   4.
+       L TTG   4. S TCG   1. * TAG   3. W TGG   5.
+       ===========================================
+       L CTT   9. P CCT   1. H CAT   3. R CGT  14.
+       L CTC   7. P CCC   0. H CAC   7. R CGC  14.
+       L CTA   0. P CCA   0. Q CAA   4. R CGA   9.
+       L CTG  12. P CCG   1. Q CAG   9. R CGG   8.
+       ===========================================
+       I ATT   7. T ACT   4. N AAT   4. S AGT   1.
+       I ATC   4. T ACC   5. N AAC   3. S AGC   7.
+       I ATA   1. T ACA   1. K AAA   3. R AGA   2.
+       M ATG   2. T ACG   1. K AAG   2. R AGG   2.
+       ===========================================
+       V GTT  11. A GCT  13. D GAT   6. G GGT   9.
+       V GTC   5. A GCC  10. D GAC   9. G GGC  11.
+       V GTA   6. A GCA   5. E GAA   6. G GGA  12.
+       V GTG   8. A GCG   5. E GAG   3. G GGG   8.
+       ===========================================
+
+
+   Total codons=      333.
+           T          C          A          G
+
+   1     23.32      37.69      28.99      40.06
+   2     37.15      22.31      38.46      36.59
+   3     39.53      40.00      32.54      23.34
+         -----      -----      -----      -----
+   =     100%       100%       100%       100%
+
+   1     17.72      29.43      14.71      38.14  = 100%
+   2     28.23      17.42      19.52      34.83  = 100%
+   3     30.03      31.23      16.52      22.22  = 100%
+   %     25.33      26.03      16.92      31.73  Observed, overall totals
+   %     24.44      22.31      20.90      32.35  Expected, even codons per acid
+
+           A    C    D    E    F    G    H    I    K    L
+          33.  10.  15.   9.  17.  40.  10.  12.   5.  33.
+  O-E %   22.  81. -13. -55.  34.  71.  40. -29. -73.  13.
+
+           M    N    P    Q    R    S    T    V    W    Y
+           2.   7.   2.  13.  49.  20.  11.  30.   5.   2.
+  O-E %  -74. -51. -88.   0. 165. -11. -42.  40.  18. -81.
+  Total acids=  325. Molecular weight=    35831. Hydrophobicity= -17.8
+
+ @24. TX 3 @ Plot base composition
+
+        This option plots the base composition of  the  sequence.  The
+  counts for any combination of bases can be plotted.
+
+        If dialogue is requested the user is presented  with  a  check
+  box for selecting which bases should be counted, and then allowed to
+  define a window length, and a "plot  interval".  Otherwise,  the  AT
+  composition  is  plotted with a window of 101 and a plot interval of
+  5.
+
+        Typical dialogue follows.
+  ? Menu or option number=d24
+   Plot base composition
+
+  checkbox: those set are marked X
+  X 1 T
+    2 C
+  X 3 A
+    4 G
+  ? 0,1,2,3,4 =1
+
+  checkbox: those set are marked X
+    1 T
+    2 C
+  X 3 A
+    4 G
+  ? 0,1,2,3,4 =3
+
+  checkbox: those set are marked X
+    1 T
+    2 C
+    3 A
+    4 G
+  ? 0,1,2,3,4 =2
+
+  checkbox: those set are marked X
+    1 T
+  X 2 C
+    3 A
+    4 G
+  ? 0,1,2,3,4 =4
+
+  checkbox: those set are marked X
+    1 T
+  X 2 C
+    3 A
+  X 4 G
+  ? 0,1,2,3,4 =
+
+  ? odd span length (1-201) (31) =
+  ? plot interval (1-11) (5) =
+
+   missing graphics
+
+
+
+ @25. TX 3 @ Plot local deviations in base composition
+
+        The "local deviation" routines are designed  to  indicate  the
+  similarity  of  the compositions of different parts of the sequence.
+  The composition of every segment of the sequence is compared with  a
+  standard  composition. The levels of similarity are plotted as a chi
+  squared values. The standard can be the  composition  of  the  whole
+  sequence,  or  alternatively  that of a small segment defined by the
+  user.
+
+        If dialogue is forced define the standard region,  the  window
+  length and the plot interval. Otherwise the composition of the whole
+  sequence is taken as a standard. The maximum  and  minimum  observed
+  value  of  the  chi squared calculation is displayed, and plots will
+  always exactly fill the available box. Any unusual regions will show
+  as peaks.
+
+        The following  measure  is  used:  for  each  window  position
+  calculate  (sum((obs-exp)*(obs-exp))/(exp*exp))  where  obs  is  the
+  observed composition  and  exp  is  the  expected  composition  (the
+  composition  of the standard).  The calculation is performed once to
+  find out the range of values and is then  repeated  and  plotted  so
+  that the plot exactly fills the allocated screen space.
+ @26. TX 3 @ Plot local deviations from dinucleotide composition
+
+        The "local deviation" routines are designed  to  indicate  the
+  similarity  of  the compositions of different parts of the sequence.
+  The dinucleotide composition of every segment  of  the  sequence  is
+  compared  with  a standard composition. The levels of similarity are
+  plotted as a chi squared values. The standard can be the composition
+  of  the  whole  sequence,  or  alternatively that of a small segment
+  defined by the user.
+
+        If dialogue is forced define the standard region,  the  window
+  length and the plot interval. Otherwise the composition of the whole
+  sequence is taken as a standard. The maximum  and  minimum  observed
+  value  of  the  chi squared calculation is displayed, and plots will
+  always exactly fill the available box. Any unusual regions will show
+  as peaks.
+
+        The following  measure  is  used:  for  each  window  position
+  calculate  (sum((obs-exp)*(obs-exp))/(exp*exp))  where  obs  is  the
+  observed composition  and  exp  is  the  expected  composition  (the
+  composition  of the standard).  The calculation is performed once to
+  find out the range of values and is then  repeated  and  plotted  so
+  that the plot exactly fills the allocated screen space.
+ @27. TX 3 @ Plot local deviations from trinucleotide composition
+
+        The "local deviation" routines are designed  to  indicate  the
+  similarity  of  the compositions of different parts of the sequence.
+  The trinucleotide composition of every segment of  the  sequence  is
+  compared  with  a standard composition. The levels of similarity are
+  plotted as a chi squared values. The standard can be the composition
+  of  the  whole  sequence,  or  alternatively that of a small segment
+  defined by the user.
+
+        If dialogue is forced define the standard region,  the  window
+  length and the plot interval. Otherwise the composition of the whole
+  sequence is taken as a standard. The maximum  and  minimum  observed
+  value  of  the  chi squared calculation is displayed, and plots will
+  always exactly fill the available box. Any unusual regions will show
+  as peaks.
+
+        The following  measure  is  used:  for  each  window  position
+  calculate  (sum((obs-exp)*(obs-exp))/(exp*exp))  where  obs  is  the
+  observed composition  and  exp  is  the  expected  composition  (the
+  composition  of the standard).  The calculation is performed once to
+  find out the range of values and is then  repeated  and  plotted  so
+  that the plot exactly fills the allocated screen space.
+ @28. TX 5 @ Calculate codon constraint
+
+        The purpose of this option (which is somewhat specialised)  is
+  to measure the level of constraint imposed on the sequence by coding
+  for a protein of the observed composition. It measures the  strength
+  of  the  codon  bias averaged over windows of 99 codons and displays
+  the values observed.
+
+        Select between defining segments at the keyboard or  using  an
+  EMBL  feature  table.  Finish  selecting  segments  by typing a zero
+  start. The value for each segment is displayed:
+
+        Mean (W-EW) / EWD, window 99      10.5
+
+        The codon constraint is the difference  between  the  observed
+  codon  improbability and the mean improbabilty for a sequence of the
+  same composition.   See McLachlan, Staden  and  Boswell  Nucl.  Acid
+  Res. 1984
+ @59. TX 3 @ Plot negentropy
+
+        This routine is designed to show regions of the sequence  that
+  differ  in  composition  from  others,  and  hence is like the "plot
+  deviation.." routines.
+
+        Negentropy or information is defined in the following way: let
+  Pi  be  the  probability  of observing base i, where i = A,C,G or T,
+  then the average information per base is  I=-sum(Pi.Log(Pi))    (sum
+  over  all  i). This routine calculates Pi by calculating the overall
+  composition for the sequence and then plots I for windows of  length
+  defined by the user.
+ @30. TX 4 @ Search for hairpin loops
+
+        Used to find simple  inverted  repeats  or  potential  hairpin
+  loops  The  loops are defined by a range of sizes for the loop and a
+  minimum number of consecutive base pairs in the  stem.  The  results
+  can  be  presented graphically or listed. A-T, G-C and G-T basepairs
+  are counted.
+
+        Define the range of loop  sizes  and  the  minimum  number  of
+  consecutive  basepairs  required.  Choose  between plotted or listed
+  results.
+
+        The loops found are plotted as blips on a horizontal line that
+  represents  the  sequence, the heights of the lines are proportional
+  to  the  number  of  basepairs  in  the  stems.   Note   that   only
+  uninterrupted  stems are found - i.e. all basepairs must be made. To
+  look for stems with some unpaired bases (or for palindromes) use the
+  inverted repeat motif class in the pattern searching option.
+
+        Typical dialogue follows.
+  ? Menu or option number=30
+   Search for hairpin loops
+  Define the range of loop sizes
+  ? Minimum loop size (1-30) (1) =
+  ? Maximum loop size (3-60) (3) =
+  ? Minimum number of basepairs (2-20) (6) =
+  ? (y/n) (y) Plot results n
+   Searching
+
+            T.G
+            G-C
+            G.T
+            T.G
+            C-G
+            G-C
+            T.G
+            C-G
+            G.T
+       GCCGCA GCGGAGG
+           49
+
+             G
+            G-C
+            T.G
+            C-G
+            G.T
+            T.G
+            G-C
+       CTGCTG GGAGGTC
+           56
+
+
+             G
+            T.G
+            G-C
+            G.T
+            T.G
+            C-G
+            G-C
+            T-A
+            T.G
+       AGCGCA CGACTGA
+          139
+
+            A C
+            G.T
+            C-G
+            G.T
+            C-G
+            C-G
+            G-C
+       TTCGCT CAACGCC
+          244
+
+ @31. TX 4 @ Search for long range inverted repeats
+
+        Searches for inverted repeats. The  repeats  found  are  exact
+  matches  of  at  least 6 consecutive bases. Results can be presented
+  graphically or listed.  Plotted  results  show  the  end  points  of
+  repeats joined by rectangular lines.
+
+        If dialogue is not  requested  the  defaults  will  be  taken.
+  Otherwise  choose  between  plotted  or  listed results. If required
+  select to analyse a  restricted  segment  of  the  currently  active
+  region. Choose a repeat length.
+
+        Typical dialogue follows.
+  ? Menu or option number=D31
+   Plot long-range inverted repeats
+  ? (y/n) (y) Plot results n
+  Define restricted region
+  ? start (1-1023) (1) =
+  ? end (2-1023) (1023) =
+  ? Minimum inverted repeat (6-30) (12) =10
+   Searching
+      27     909      10  TGCCCAGAGA
+
+ @32. TX 4 @ Search for repeats
+
+        Searches for direct  repeats.  The  repeats  found  are  exact
+  matches  of  at  least 6 consecutive bases. Results can be presented
+  graphically or listed.  Plotted  results  show  the  end  points  of
+  repeats joined by rectangular lines.
+
+        If dialogue is not  requested  the  defaults  will  be  taken.
+  Otherwise  choose  between  plotted  or  listed results. If required
+  select to analyse a  restricted  segment  of  the  currently  active
+  region. Choose a repeat length.
+
+        Typical dialogue follows.
+   ? Menu or option number=D32
+   Plot repeats
+  ? (y/n) (y) Plot results n
+  Define restricted region
+  ? start (1-1023) (1) =
+  ? end (2-1023) (1023) =
+  ? Minimum repeat (6-30) (12) =8
+   Searching
+     619     988       8  GCTGTTGT
+     514     646       8  GCTGCTAA
+      94     865       8  TCCGCTGG
+     146     222       9  GTGGCTGGC
+     455     497       8  TCGCCCTC
+     454     496       9  CTCGCCCTC
+     872     875       8  GCCGCCGC
+     510     615       8  CGTTGCTG
+     152     913       8  GGCAGCGA
+     199     265       8  CGTCGAGG
+     689     794       8  AGTTTGGG
+     147     223       8  TGGCTGGC
+     101     116       8  GACGAGGA
+       8     690       8  GTTTGGGC
+      52     141       8  TGCTGGTG
+
+ @33. TX 4 @ Search for z dna (total ry, yr)
+
+        Searches for segments of the sequence that might form Z DNA. A
+  window  length  is  chosen and the number of RY and YR dinucleotides
+  within each window is plotted. The top of the box corresponds to all
+  RY or YR, the bottom to zero RY or YR.
+
+        If dialogue is requested, select  a  window  length  and  plot
+  interval. Otherwise the defaults will be used.
+
+        The  program  contains  three  separate  ways  of  doing  this
+  (options 33,34,35).
+ @34. TX 4 @ Search for z dna (runs of ry, yr)
+
+        Searches for segments of the sequence that might form  Z  DNA.
+  Results are plotted.
+
+        If dialogue is requested  define  a  window  length  and  plot
+  interval.  Otherwise  the defaults will be used.  The routine counts
+  the number of R in positions 1,3,5 etc  =R1,  the  number  of  Y  in
+  positions  2,4,6 etc =Y1, the number of Y in positions 1,3,5 etc =Y2
+  and the number of R in positions 2,4,6 etc =R2 for a window  length.
+  It  plots  the  maximum  of R1+Y1 and R2+Y2 relative to a minimum of
+  (window length)/2 and a maximum of (window length). (see 33,35).
+ @35. TX 4 @ Search for z dna (best phased value)
+
+        Searches for segments of the sequence that might form  Z  DNA.
+  Results are plotted.
+
+        If dialogue is requested define a window  length  and  a  plot
+  interval. Ohterwise the defaults values will be used.
+
+        The  routine  counts  the  number  of  consecutive  RY  or  YR
+  dinucleotides  in  phase. It moves through the sequence counting the
+  number of RY or YR dinucleotides; when the next dinucleotide is  not
+  of  the  correct  type  the score is set back to zero and the search
+  restarted using the current base to set the  phase.  The  plots  are
+  done  relative  to  a  minimum  of zero and a maximum defined by the
+  user. (See 33,34).
+ @36. TX 4 @ Local similarity or complementarity search
+
+        This function is designed to find segments of local similarity
+  or  complementarity.  It  is therefore like performing a DIAGON plot
+  that is restricted to regions near the main diagonal.   Results  can
+  be presented graphically or listed.
+
+        Users define a region to search  through,  a  span  length,  a
+  range  for  searching through and a cut-off score. The program takes
+  all sections of sequence of length span within  the  defined  region
+  and compares them to all other sequences within the region and range
+  specified. If a match above the cutoff is found we need to show  the
+  position of the two sections of sequence and the score, and we do it
+  in the following way.  If we have a 70%  match  between  a  sequence
+  that starts at p1 and a sequence that starts at p2 the program draws
+  a diagonal line that starts at p1 with height 70%  of  the  box  and
+  which finishes at p2 with height 0.  The matches can also be listed.
+
+        Here I define the terms range, region, and span  and  what  is
+  compared.   Suppose we have a defined region j1 to j2, a range of i1
+  to i2 and a span of s; the program will take, in turn, all  sections
+  of  sequence  of  length  s within j1 and j2 and compare them to all
+  sequences that start a distance i1+s-1 to  i2+s-1  away  from  them.
+  First  it  will  take  the  sequence  of length s starting at j1 and
+  compare it with the sequence of length s starting at j1+s-1+i1, then
+  j1+s-1+i1+1,  etc up to j1+s-1+i2; then it will take the sequence of
+  length s starting at j1+1 and compare it with the sequence  starting
+  at  j1+s-1+1+i1  etc. This continues until we hit the right hand end
+  of the sequence as defined by j2.  Note  1)that  sequences  are  not
+  compared  with themselves: the nearest sequence compared to a span s
+  starting at j starts at j+s; 2) ranges i1 and i2 are ranges of start
+  positions;  3)  by  choosing  a range greater than the length of the
+  sequence this routine will do a  full  DIAGON  analysis  except  for
+  those  points  within a distance span of the main diagonal (see note
+  1).
+
+        Typical dialog follows.
+
+  ? Menu or option number=36
+   Search for local similarity or complementarity
+  ? (y/n) (y) Find direct repeats
+  ? (y/n) (y) Keep picture n
+  ? Span (5-200) (15) =
+  Define restricted region
+  ? start (0-1023) (1) =
+  ? end (2-1023) (1023) =
+  ? Percent match (1.00-100.00) (70.00) =
+  ? Range start (1-50) (1) =
+  ? Range end (1-50) (1) =5
+  ? (y/n) (y) Plot results n
+   Working
+
+
+         118        128
+           CGAGGAGGAG GTGGA
+            ** *****  ** **
+           GGACGAGGAC GTCGA
+         100        110
+
+
+         119        129
+           GAGGAGGAGG TGGAT
+           ** ***** * * **
+           GACGAGGACG TCGAC
+         101        111
+  ? (y/n) (y) Find direct repeats n
+  ? (y/n) (y) Keep picture
+  ? Span (5-200) (15) =
+  Define restricted region
+  ? start (0-1023) (1) =
+  ? end (2-1023) (1023) =
+  ? Percent match (1.00-100.00) (70.00) =
+  ? Range start (1-50) (1) =
+  ? Range end (1-50) (5) =8
+  ? (y/n) (y) List results
+
+   Working
+
+
+         178        188
+           ACTCAGATCC GGCGG
+           ***** ***  * **
+           ACTCAAATCA GTCGC
+         156        166
+
+
+         177        187
+           CACTCAGATC CGGCG
+            ***** ***  * **
+           AACTCAAATC AGTCG
+         157        167
+  ? (y/n) (y) Find inverted repeats !
+ @37. TX 5 @ Set genetic code
+
+        This function allows the user to  change  the  current  active
+  genetic  code for all the options. The user may select: the standard
+  code, the mammalian mitochondrial code, the yeast mitochondrial code
+  or a personal code (define your own).
+
+        Select code. If personal, define a codon and select  an  amino
+  acid. When all codons have been reset define a blank codon.
+
+        The code differences are:
+            Mammalian        Yeast
+    Codon  Mitochondrial  Mitochondrial  Standard
+     UGA       W              W            STOP
+     AUA       M              M             I
+     CUA       L              T             L
+     AGA      STOP            R             R
+     AGG      STOP            R             R
+
+        Typical dialogue follows.
+  ? Menu or option number=37
+  X 1 Standard code
+    2 Mammalian mitochondrial code
+    3 Yeast mitochondrial code
+    4 Personal code
+  ? 0,1,2,3,4 =2
+
+  ? Menu or option number=37
+  X 1 Standard code
+    2 Mammalian mitochondrial code
+    3 Yeast mitochondrial code
+    4 Personal code
+  ? 0,1,2,3,4 =4
+  Define genetic code by typing a codon
+  followed by a 1 letter amino acid symbol
+  ? Codon=TTT
+  Default Amino acid symbol=F
+  ? Amino acid symbol=W
+  ? Codon=
+ @38. T 3 4 @ Examine repeats
+
+        This function can  be  used  to  examine  the  frequencies  of
+  repeated words within a sequence. It finds all words that occur more
+  than once. The user selects a minimum word length  and  the  program
+  finds  all  words  of that length that occur more than once; then it
+  "follows" each repeated word until it becomes unique. For each  word
+  length  it  can  report  the number of different repeated words, the
+  number of occurrences of each word, and their actual  positions  and
+  sequences.
+
+        It is possible that the  algorithm  may  run  out  of  memory,
+  paticularly  if  a  short  mimimum  word length is chosen, or if the
+  sequence is very long or very repetitive. If this occurs the longest
+  reported  word  length  will  not  necessarily be the longest in the
+  sequence: the memory will have been consumed before the longest word
+  is found.
+  Typical dialogue and output is shown below.
+
+   Expected length of longest repeat    14
+   ? Minumim word length (1-6) (6) =6
+   Working
+   ? Show repeat frequencies for words of at least length (6-15) (15) =10
+   For length    10 the number of different repeated words is  2035
+   For length    11 the number of different repeated words is   613
+   For length    12 the number of different repeated words is   161
+   For length    13 the number of different repeated words is    37
+   For length    14 the number of different repeated words is    10
+   For length    15 the number of different repeated words is     1
+   ? Show repeats for words of length (6-15) (15) =14
+   ? Show repeats for words occuring with frequency (2-9999) (2) =2
+
+   ggtgctcatgccca
+   occurs at  21611
+   occurs at  21851
+   ttatccggtgatga
+   occurs at   4604
+   occurs at   8806
+   agcaccacgctgac
+   occurs at   5954
+   occurs at   9486
+   catgacggaggatg
+   occurs at  10480
+   occurs at  19925
+   aaagacgggaaaat
+   occurs at  11820
+   occurs at  43157
+   tacaaaaccaattt
+   occurs at  26797
+   occurs at  31369
+   cgagaaagagtgcg
+   occurs at   4260
+   occurs at  44305
+   gccggatgatggcg
+   occurs at   7893
+   occurs at  16638
+   atgacggaggatga
+   occurs at  10481
+   occurs at  19926
+   gcggcgaacgaggc
+   occurs at  11352
+   occurs at  18718
+   ? Show repeats for words of length (6-15) (15) =!
+
+  Example of not enough memory
+  ----------------------------
+
+   Expected length of longest repeat    14
+   ? Minumim word length (1-6) (6) =1
+   Working
+   Not enough memory
+   Memory used in bytes 1125996. Length of longest repeat     5
+   ? Show repeat frequencies for words of at least length (1-5) (5) =!
+
+ @39. TX 5 @ Translate and list in upto six phases
+
+        This  is  a  general  listing  function  that   will   perform
+  translations  and produce several forms of output. The possibilities
+  are:
+  1) no translation, list one or two strands, two ways of numbering the
+  sequence.
+  2) translation, one or two strands, one or three letter codes.
+   Positions defined by:
+    a) open reading frames of some minimum length l, l can be 0, hence giving
+  a complete six phase translation.
+    b) positions typed on keyboard, again 1 to 6 phases, translations appearing
+  above and below the dna.
+    c) positions read from a feature table.
+
+  It should be used in preference to option 5. For publication
+  without a translation, the option to number ends of lines is more compact
+  than option 5. Some examples and typical dialogue are given below. Note the
+  requirement for d39.
+
+  ? Menu or option number=D39
+  Find open reading frames, translate and list
+  ? (y/n) (y) Show translation
+
+  The segments to translate can be
+     1 Typed on the keyboard
+     2 Read from a feature table
+  X  3 Open reading frames
+  ? 1,2,3 =
+  ? Minimum open frame in amino acids (0-7238) (30) =
+  ? (y/n) (y) Use 1 letter codes
+  Define section of DNA to display
+  ? start (1-7238) (1) =
+  ? end (2-7238) (7238) =300
+  ? Line length (30-120) (60) =
+  Which strands should be shown
+  X  1 + strand only
+     2 - strand only
+     3 Both strands
+  ? 1,2,3 =3
+  ? (y/n) (y) Number ends of lines
+
+
+      N  A  T  T  I  S  R  I  D  A  T  F  S  A  R  A  P  N  E  N
+     AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+         .    :    .    :    .    :    .    :    .    :    .    :
+     TTGCGATGATGATAATCATCTTAACTACGGTGGAAAAGTCGAGCGCGGGGTTTACTTTTA
+                                          *  S  A  G  W  I  F  I
+        A  V  V  I  L  L  I  S  A  V  K  E  A  R  A  G  F  S  F
+
+      I  A  K  Q  V  I  D  H  L  R  N  V  S  N  G  Q  T  K  S  T
+          L  N  R  L  L  T  I  C  E  M  Y  L  M  V  K  L  N  L  L
+     ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+         .    :    .    :    .    :    .    :    .    :    .    :
+     TATCGATTTGTCCAATAACTGGTAAACGCTTTACATAGATTACCAGTTTGATTTAGATGA
+      Y  S  F  L  N  N  V  M  Q  S  I  Y  R  I  T  L  S  F  R  S
+     I  A  L  C  T  I  S  W  K  R  F  T  D  L  P  *  V  L  D  V
+
+      R  S  Q  N  W  E  S  T  V  T  W  N  E  T  S  R  H  R  T  L
+       V  R  R  I  G  N  Q  L  L  H  G  M  K  L  P  D  T  V  L  *
+     CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+         .    :    .    :    .    :    .    :    .    :    .    :
+     GCAAGCGTCTTAACCCTTAGTTGACAATGTACCTTACTTTGAAGGTCTGTGGCATGAAAT
+      T  R  L  I  P  F
+     R  E  C  F  Q  S  D  V  T  V  H  F  S  V  E  L  C  R  V  K
+
+      V  A  Y  L  K  H  V  E  L  Q  H  Q  I  Q  Q  L  S  S  K  P
+     GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+         .    :    .    :    .    :    .    :    .    :    .    :
+     CAACGTATAAATTTTGTACAACTCGATGTCGTGGTCTAAGTCGTTAATTCGAGATTCGGT
+     T  A  Y  K  F  C  T  S  S  C  C  W  I
+
+      S  A  K  M  T  S  Y  Q  K  E  Q  L  K  V  L  S  N  P  D  L
+     TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+         .    :    .    :    .    :    .    :    .    :    .    :
+     AGGCGTTTTTACTGGAGAATAGTTTTCCTCGTTAATTTCCATGAGAGATTAGGACTGGAC
+
+
+  ? Menu or option number=D39
+  Find open reading frames, translate and list
+  ? (y/n) (y) Show translation N
+  Define section of DNA to display
+  ? start (1-7238) (1) =
+  ? end (2-7238) (7238) =300
+  ? Line length (30-120) (60) =
+  Which strands should be shown
+  X  1 + strand only
+     2 - strand only
+     3 Both strands
+  ? 1,2,3 =
+  ? (y/n) (y) Number ends of lines
+
+
+     AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+
+     ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+
+     CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+
+     GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+
+     TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+
+
+  ? Menu or option number=D39
+  Find open reading frames, translate and list
+  ? (y/n) (y) Show translation
+  The segments to translate can be
+     1 Typed on the keyboard
+     2 Read from a feature table
+  X  3 Open reading frames
+  ? 1,2,3 =
+  ? Minimum open frame in amino acids (0-7238) (30) =0
+  ? (y/n) (y) Use 1 letter codes N
+  Define section of DNA to display
+  ? start (1-7238) (1) =
+  ? end (2-7238) (7238) =300
+  ? Line length (30-120) (60) =
+  Which strands should be shown
+  X  1 + strand only
+     2 - strand only
+     3 Both strands
+  ? 1,2,3 =3
+  ? (y/n) (y) Number ends of lines
+
+
+     AsnAlaThrThrIleSerArgIleAspAlaThrPheSerAlaArgAlaProAsnGluAsn
+      ThrLeuLeuLeuLeuValGluLeuMetProProPheGlnLeuAlaProGlnMetLysIle
+       ArgTyrTyrTyr******Asn***CysHisLeuPheSerSerArgProLys***Lys
+     AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+         .    :    .    :    .    :    .    :    .    :    .    :
+     TTGCGATGATGATAATCATCTTAACTACGGTGGAAAAGTCGAGCGCGGGGTTTACTTTTA
+     ValSerSerSerAsnThrSerAsnIleGlyGlyLys***SerAlaGlyTrpIlePheIle
+      Arg************TyrPheGlnHisTrpArgLysLeuGluArgGlyLeuHisPheTyr
+       AlaValValIleLeuLeuIleSerAlaValLysGluAlaArgAlaGlyPheSerPhe
+
+     IleAlaLysGlnValIleAspHisLeuArgAsnValSerAsnGlyGlnThrLysSerThr
+      ***LeuAsnArgLeuLeuThrIleCysGluMetTyrLeuMetValLysLeuAsnLeuLeu
+    TyrSer***ThrGlyTyr***ProPheAlaLysCysIle***TrpSerAsn***IleTyr
+     ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+         .    :    .    :    .    :    .    :    .    :    .    :
+     TATCGATTTGTCCAATAACTGGTAAACGCTTTACATAGATTACCAGTTTGATTTAGATGA
+     TyrSerPheLeuAsnAsnValMetGlnSerIleTyrArgIleThrLeuSerPheArgSer
+      Leu***ValPro***GlnGlyAsnAlaPheHisIle***HisAspPhe***Ile***Glu
+    IleAlaLeuCysThrIleSerTrpLysArgPheThrAspLeuPro***ValLeuAspVal
+
+     ArgSerGlnAsnTrpGluSerThrValThrTrpAsnGluThrSerArgHisArgThrLeu
+      ValArgArgIleGlyAsnGlnLeuLeuHisGlyMetLysLeuProAspThrValLeu***
+    SerPheAlaGluLeuGlyIleAsnCysTyrMetGlu***AsnPheGlnThrProTyrPhe
+     CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+         .    :    .    :    .    :    .    :    .    :    .    :
+     GCAAGCGTCTTAACCCTTAGTTGACAATGTACCTTACTTTGAAGGTCTGTGGCATGAAAT
+     ThrArgLeuIleProPhe***SerAsnCysProIlePheSerGlySerValThrSer***
+      AsnAlaSerAsnProIleLeuGln***MetSerHisPheLysTrpValGlyTyrLysLeu
+    ArgGluCysPheGlnSerAspValThrValHisPheSerValGluLeuCysArgValLys
+
+     ValAlaTyrLeuLysHisValGluLeuGlnHisGlnIleGlnGlnLeuSerSerLysPro
+      LeuHisIle***AsnMetLeuSerTyrSerThrArgPheSerAsn***AlaLeuSerHis
+    SerCysIlePheLysThrCys***AlaThrAlaProAspSerAlaIleLysLeu***Ala
+     GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+         .    :    .    :    .    :    .    :    .    :    .    :
+     CAACGTATAAATTTTGTACAACTCGATGTCGTGGTCTAAGTCGTTAATTCGAGATTCGGT
+     AsnCysIle***PheMetAsnLeu***LeuValLeuAsnLeuLeu***AlaArgLeuTrp
+      GlnMetAsnLeuValHisGlnAlaValAlaGlySerGluAlaIleLeuSer***AlaMet
+    ThrAlaTyrLysPheCysThrSerSerCysCysTrpIle***CysAsnLeuGluLeuGly
+
+     SerAlaLysMetThrSerTyrGlnLysGluGlnLeuLysValLeuSerAsnProAspLeu
+      ProGlnLys***ProLeuIleLysArgSerAsn***ArgTyrSerLeuIleLeuThrCys
+    IleArgLysAsnAspLeuLeuSerLysGlyAlaIleLysGlyThrLeu***Ser***Pro
+     TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+         .    :    .    :    .    :    .    :    .    :    .    :
+     AGGCGTTTTTACTGGAGAATAGTTTTCCTCGTTAATTTCCATGAGAGATTAGGACTGGAC
+     GlyCysPheHisGlyArgIleLeuLeuLeuLeu***LeuTyrGluArgIleArgValGln
+      ArgLeuPheSerArgLysAspPheProAlaIleLeuProValArg***AspGlnGlyThr
+    AspAlaPheIleValGlu******PheSerCysAsnPheThrSerGluLeuGlySerArg
+
+
+  ? Menu or option number=D39
+  Find open reading frames, translate and list
+  ? (y/n) (y) Show translation
+  The segments to translate can be
+     1 Typed on the keyboard
+     2 Read from a feature table
+  X  3 Open reading frames
+  ? 1,2,3 =1
+  ? (y/n) (y) Use 1 letter codes
+  Define section of DNA to display
+  ? start (1-7238) (1) =
+  ? end (2-7238) (7238) =300
+  ? Line length (30-120) (60) =
+  Which strands should be shown
+  X  1 + strand only
+     2 - strand only
+     3 Both strands
+  ? 1,2,3 =
+  ? (y/n) (y) Number ends of lines N
+  Translate
+  ? From (0-300) (0) =101
+  ? To (1-300) (300) =300
+  Translate
+  ? From (0-300) (0) =102
+  ? To (1-300) (300) =200
+  Translate
+  ? From (0-300) (0) =
+
+
+     AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT
+             10        20        30        40        50        60
+
+                                              M  V  K  L  N  L  L
+                                               W  S  N  *  I  Y
+     ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT
+             70        80        90       100       110       120
+
+       V  R  R  I  G  N  Q  L  L  H  G  M  K  L  P  D  T  V  L  *
+     S  F  A  E  L  G  I  N  C  Y  M  E  *  N  F  Q  T  P  Y  F
+     CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA
+            130       140       150       160       170       180
+
+       L  H  I  *  N  M  L  S  Y  S  T  R  F  S  N  *  A  L  S  H
+     S  C  I  F  K  T  C
+     GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA
+            190       200       210       220       230       240
+
+       P  Q  K  *  P  L  I  K  R  S  N  *  R  Y  S  L  I  L  T  C
+     TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG
+            250       260       270       280       290       300
+
+
+  ? Menu or option number=D39
+  Find open reading frames, translate and list
+  ? (y/n) (y) Show translation
+  The segments to translate can be
+     1 Typed on the keyboard
+     2 Read from a feature table
+  X  3 Open reading frames
+  ? 1,2,3 =2
+  ? Embl feature table file=1.FT
+  ? (y/n) (y) Use 1 letter codes
+  Define section of DNA to display
+  ? start (1-7238) (1) =
+  ? end (2-7238) (7238) =300
+  ? Line length (30-120) (60) =
+  Which strands should be shown
+  X  1 + strand only
+     2 - strand only
+     3 Both strands
+  ? 1,2,3 =3
+  ? (y/n) (y) Number ends of lines
+
+
+      N  A  T  T  I  S  R  I  D  A  T  F  S  A  R  A  P  N  E  N
+     AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT      60
+         .    :    .    :    .    :    .    :    .    :    .    :
+     TTGCGATGATGATAATCATCTTAACTACGGTGGAAAAGTCGAGCGCGGGGTTTACTTTTA
+                                          *  S  A  G  W  I  F  I
+        A  V  V  I  L  L  I  S  A  V  K  E  A  R  A  G  F  S  F
+
+      I  A  K  Q  V  I  D  H  L  R  N  V  S  N  G  Q  T  K  S  T
+          L  N  R  L  L  T  I  C  E  M  Y  L  M  V  K  L  N  L  L
+     ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT     120
+         .    :    .    :    .    :    .    :    .    :    .    :
+     TATCGATTTGTCCAATAACTGGTAAACGCTTTACATAGATTACCAGTTTGATTTAGATGA
+      Y  S  F  L  N  N  V  M  Q  S  I  Y  R  I  T  L  S  F  R  S
+     I  A  L  C  T  I  S  W  K  R  F  T  D  L  P  *  V  L  D  V
+
+      R  S  Q  N  W  E  S  T  V  T  W  N  E  T  S  R  H  R  T  L
+       V  R  R  I  G  N  Q  L  L  H  G  M  K  L  P  D  T  V  L  *
+     CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA     180
+         .    :    .    :    .    :    .    :    .    :    .    :
+     GCAAGCGTCTTAACCCTTAGTTGACAATGTACCTTACTTTGAAGGTCTGTGGCATGAAAT
+      T  R  L  I  P  F
+     R  E  C  F  Q  S  D  V  T  V  H  F  S  V  E  L  C  R  V  K
+
+      V  A  Y  L  K  H  V  E  L  Q  H  Q  I  Q  Q  L  S  S  K  P
+     GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA     240
+         .    :    .    :    .    :    .    :    .    :    .    :
+     CAACGTATAAATTTTGTACAACTCGATGTCGTGGTCTAAGTCGTTAATTCGAGATTCGGT
+     T  A  Y  K  F  C  T  S  S  C  C  W  I
+
+      S  A  K  M  T  S  Y  Q  K  E  Q  L  K  V  L  S  N  P  D  L
+     TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG     300
+         .    :    .    :    .    :    .    :    .    :    .    :
+     AGGCGTTTTTACTGGAGAATAGTTTTCCTCGTTAATTTCCATGAGAGATTAGGACTGGAC
+                                       *  L  Y  E  R  I  R  V  Q
+                          *  F  S  C  N  F  T  S  E  L  G  S  R
+ @40. TX 5 @ Translate and write the protein sequence to disk
+
+        This routine allows the user  to  translate  sections  of  the
+  sequence  into the 1 letter amino acid codes and store the resulting
+  amino acid sequences in a disk file.  Two modes of use are possible.
+  Either  all open reading frames of at least some minimum length will
+  automatically be found and translated, or the user can specify  that
+  particular segments be translated.
+
+        Mode 1: the user selects to  to  translate  all  open  reading
+  frames.
+
+        Either, or both, strands can be translated.  The  output  file
+  is  in  the  same format as a PIR .seq file. Each protein segment is
+  given an entry name that is its start base in the DNA, and  a  title
+  that  includes  its  end  position,  reading frame and strand (+ for
+  plus, - for minus). Each segment is terminated by * whether  or  not
+  there is a stop codon in the DNA. The file is therefore suitable for
+  input to FASTA, ALIGNL and ANALYSEPL.
+
+        Mode  2:  the  user  selects  to  identify  the  segments   to
+  translate.
+
+        Either, or both,  strands  can  be  translated.   If  multiple
+  coding  regions  are  translated  each  will  be  separated from the
+  previous one by  a  gap  of  5  dashes  (-----).   The  sections  to
+  translate  can be defined from the keyboard or by supplying the name
+  of the appropriate EMBL library feature table.
+
+        Typical dialogue follows.
+  ? Menu or option number=40
+   Translate and write protein sequence to disk
+  ? (y/n) (y) Translate selected regions
+  ? (y/n) (y) Define segments using keyboard
+  Translate
+  ? From (0-1023) (0) =1
+  ? To (1-1023) (1023) =111
+  ? (y/n) (y) + strand
+  Translate
+  ? From (0-1023) (0) =
+  ? Output file name=1.OUT
+
+   ? Menu or option number=40
+   Translate and write protein sequence to disk
+  ? (y/n) (y) Translate selected regions n
+  ? Minimum open frame in amino acids (5-1000) (30) =
+
+  X 1 + strand only
+    2 - strand only
+    3 Both strands
+  ? 0,1,2,3 =3
+  ? File name for translation=1.OUT
+
+  ? Menu or option number=6
+  Page through text files
+  ? Name of file to read=1.OUT
+  >P1;    25
+      135     1 +
+   GAQRLLRRSCWCWRCGGRQRTQGSAGRGRRRRGGGG*
+  >P1;   238
+      486     1 +
+   IRCRDCGQRRRGIFDLVDDFHVRRHIVLARKLFEAEGTGVHFHISLMGGNIVTAEVTNVR
+   VDAGADFAAVRMLALFGAVVPH*
+  >P1;   556
+      795     1 +
+
+   SSTQVRRASAQTSSLQLESIVAVVNVEVFLAAKHSRFYIAVLFAQFGPLLDARLDRGCGK
+   GAGRRDQWRGGGVDLANGR*
+  >P1;   796
+      987     1 +
+
+   FGYADHAFHLRSTSRHSDNVKFDSAGRRRCCCFHLVFSLGSDEEGLLARLLVEVTTIRVV
+   LRG*
+  >P1;     2
+      163     2 +
+   NSVWAWCEVPRDYCAAAAGAGGAEVVNGPRDPLDEDVDDEEEVDSALLVAGSD*
+  >P1;   176
+      391     2 +
+   PLRSGGGGVEAPETPSGWPARFAAATVANAVEGFSILWMIFTCAVILSLRVNSLKQKGQG
+   YTFTFRLWEVT*
+  >P1;   476
+      628     2 +
+   SLTEPSASPSPTLLLRFSLVLTEGVPNPALRFGVLPLRPAAFNLNPSLLL*
+  >P1;   629
+      958     2 +
+   MSRYSWLLNTAGFTSPFCLPSLGRFWTRGLTVAVEKEPAGETNGVEAALTLPMGVSLGML
+   TMLFTCAPPAAIPIMLSLIPLAAAAAAVSTWCFLWAAMRKACWRACSLR*
+  >P1;     3
+      293     3 +
+   IRFGLGVRCPEITAPQLLVLAVRRSSTDPGIRWTRTSTTRRRWIAHCWWLAATDLSSDHS
+   DPAAEASRLPKLPVAGLLDSLPRLWPTPSRDFRSCG*
+  >P1;   411
+      521     3 +
+   CACRRGSRLCSGTYARPLWCSSPSLSPPPRPRQRCC*
+  >P1;  1020
+       37     1 -
+   EFGKYNPLTDNSSPTQDHTDGSHLNEQARQQAFLIAAQRKHQVETAAAAAASGIKLNIIG
+   MAAGGAQVKSMVSIPKLTPIGKVNAASTPLVSPAGSFSTATVKPRVQKRPKLGKQNGDVK
+   PAVFSSQEYLDIYNSNDGFKLKAAGLSGSTPNLSAGLGTPSVKTKLNLSSNVGEGEAEGS
+   VRDYCTKEGEHTYRCKVCSRVYTHISNFCRHYVTSHKRNVKVYPCPFCFKEFTRKDNMTA
+   HVKIIHKIENPSTALATVAAANLAGQPLGVSGASTPPPPDLSGQNSNQSLPATSNALSTS
+   SSSSTSSSSGSLGPLTTSAPPAPAAAAQ*
+  >P1;   373
+       -1     2 -
+   AKCESVPLSLLLQRVYAQGQYDGARENHPQDRKSLDGVGHSRGSESSRPATGSFGSLDAS
+   AAGSEWSELKSVAASHQQCAIHLLLVVDVLVQRIPGSVDDLRTASTSSCGAVISGHLTPS
+   PNRI*
+  >P1;   517
+      407     2 -
+   QQRWRGRGGGLSEGLLHQRGRAYVPLQSLLPRLHAH*
+  >P1;   649
+      518     2 -
+   QPGIPRHLQQQRWIQVEGCWSERKHAEPECWIRNSLCQNQAES*
+  >P1;   853
+      650     2 -
+   HYRNGGWWSAGEKHGQHTQTNAHWQGQRRLHAIGLACRLLFHSHGQAARPEAAQTQTER
+   RCKTGCV*
+  >P1;   958
+      854     2 -
+   SPQRAGAPTSLPHRCPEKTPGGNSSSGGGQRNQT*
+  >P1;   179
+       78     3 -
+   VVRTQISRCQPPAMRYPPPPRRRRPRPADPWVR*
+  >P1;   479
+      363     3 -
+   GTTAPKRASIRTAAKSAPASTRTLVTSAVTMLPPISEM*
+  >P1;   791
+      666     3 -
+   RPLARSTPPPRHWSRLPAPFPQPRSSRASRSGPNWANRTAM*
+  >P1;  1022
+      819     3 -
+   SNSASTTRSPTTAHPRRTTRMVVTSTSRRANKPSSSLPRENTRWKQQQRRRPAESNLTLS
+   EWRLVERR*
+  End of file
+ @41. TX 5 @ Calculate and write codon table to disk
+
+        This routine calculates codon usage tables for sections of the
+  sequence  and  stores the resulting tables on disk.  The sections to
+  translate can be defined from the keyboard or by supplying the  name
+  of the appropriate EMBL library feature table.
+
+        If required users can add to an existing codon table stored as
+  a  disk  file. Choose between storing observed counts or having them
+  normalised so that the totals for each amino acid sum to 100. Select
+  between  defining  segments at the keyboard or using an EMBL feature
+  table. Define segments. Signal completion with a zero start.  Supply
+  a  file  name. For each segment the program will display the counts,
+  at the end it will display the accumulated totals.
+
+   Typical dialogue follows.
+  ? Menu or option number=41
+   Calculate and write codon table to disk
+  ? (y/n) (y) Start with empty table
+  ? (y/n) (y) Show observed counts
+  ? (y/n) (y) Define segments using keyboard
+  ? Count from (0-1023) (0) =1
+  ? Count to (1-1023) (1023) =111
+  ? (y/n) (y) + strand
+
+       ===========================================
+       F TTT   0. S TCT   0. Y TAT   0. C TGT   0.
+       F TTC   1. S TCC   1. Y TAC   0. C TGC   3.
+       L TTA   1. S TCA   0. * TAA   0. * TGA   1.
+       L TTG   2. S TCG   0. * TAG   0. W TGG   2.
+       ===========================================
+       L CTT   0. P CCT   0. H CAT   0. R CGT   2.
+       L CTC   0. P CCC   0. H CAC   0. R CGC   2.
+       L CTA   0. P CCA   0. Q CAA   1. R CGA   1.
+       L CTG   1. P CCG   0. Q CAG   2. R CGG   2.
+       ===========================================
+       I ATT   0. T ACT   0. N AAT   0. S AGT   0.
+       I ATC   0. T ACC   1. N AAC   0. S AGC   1.
+       I ATA   0. T ACA   0. K AAA   0. R AGA   1.
+       M ATG   0. T ACG   0. K AAG   0. R AGG   0.
+       ===========================================
+       V GTT   0. A GCT   1. D GAT   0. G GGT   3.
+       V GTC   0. A GCC   1. D GAC   0. G GGC   1.
+       V GTA   0. A GCA   0. E GAA   1. G GGA   4.
+       V GTG   1. A GCG   0. E GAG   0. G GGG   0.
+       ===========================================
+  ? Count from (0-1023) (0) =
+
+      Codon totals over all genes
+       ===========================================
+       F TTT   0. S TCT   0. Y TAT   0. C TGT   0.
+       F TTC   1. S TCC   1. Y TAC   0. C TGC   3.
+       L TTA   1. S TCA   0. * TAA   0. * TGA   1.
+       L TTG   2. S TCG   0. * TAG   0. W TGG   2.
+       ===========================================
+       L CTT   0. P CCT   0. H CAT   0. R CGT   2.
+       L CTC   0. P CCC   0. H CAC   0. R CGC   2.
+       L CTA   0. P CCA   0. Q CAA   1. R CGA   1.
+       L CTG   1. P CCG   0. Q CAG   2. R CGG   2.
+       ===========================================
+       I ATT   0. T ACT   0. N AAT   0. S AGT   0.
+       I ATC   0. T ACC   1. N AAC   0. S AGC   1.
+       I ATA   0. T ACA   0. K AAA   0. R AGA   1.
+       M ATG   0. T ACG   0. K AAG   0. R AGG   0.
+       ===========================================
+       V GTT   0. A GCT   1. D GAT   0. G GGT   3.
+       V GTC   0. A GCC   1. D GAC   0. G GGC   1.
+       V GTA   0. A GCA   0. E GAA   1. G GGA   4.
+       V GTG   1. A GCG   0. E GAG   0. G GGG   0.
+       ===========================================
+  ? (y/n) (y) Save table in a file n
+ @42. TX 6 @ Codon usage method
+
+        Used to find protein coding regions. For each window length of
+  the sequence the routine measures the closeness to an expected codon
+  usage. Results are plotted for each of  the  three  reading  frames.
+  Stop  and start codons are also marked on the plots. Has the highest
+  resolution of all such methods, but makes the strongest  assumption,
+  i.e.  that the codon usage is known. The latest version is described
+  in Methods in Enzymology 183, 193-211.
+
+        Choose whether to use an internal standard (i.e. part  of  the
+  current  sequence known to code for a protein). If so define its end
+  points, and those of any others. Otherwise supply the name of a disk
+  file  containing  a  table of codon usage. Tables are listed. Choose
+  between using the observed counts, or two  types  of  normalisation:
+  normalised  to give an average amino acid composition; normalised to
+  no amino  acid  bias.  The  first  normalisation  is  clearly  often
+  sensible,  but  the  second removes valuable information and is only
+  made availabe for special circumstances. The  final  table  will  be
+  displayed, followed by the expected scores for window lengths 21, 31
+  and 41 codons. The scores for each of the three reading  frames  are
+  shown  (they  are  logarithmic values) to help users choose a window
+  length for the analysis. Define a window length and  plot  interval.
+  Plotting will start.
+
+        The method was first described in Staden and  McLachlan  Nucl.
+  Acid  Res.  10  141-156 (1982) and the following is a summary of the
+  initial ideas.  The method makes the following main assumptions: the
+  codon  preferences of all the genes in the sequence we are examining
+  are similar to  those  of  the  standard;  the  sequence  is  coding
+  throughout its whole length in only one reading frame; in the coding
+  frame the frequency of codon abc has a definite value Fabc
+  If we select a  sequence   a1b1c1a2b2c2a3b3c3,...,anbncnan+1bn+1cn+1
+  then the probability of selecting it in each of the three frames is:
+               frame 1: p1=Fa1b1c1.Fa2b2c2....Fanbncn
+               frame 2: p2=Fb1c1a2.Fb2c2a3...Fbncnan+1
+               frame 3: p3=Fc1a2b2.Fc2a3b3...Fcnan+1bn+1
+  The probability that selection of a particular sequence was "caused"
+  by it being a coding sequence is:
+  P1=p1/(p1+p2+p3), P2=p2/(p1+p2+p3), P3=p3/(p1+p2+p3).
+  The program calculates these values for the given window length  but
+  plots Log(P/(1-P)) for each of the three frames. At each point along
+  the sequence that the program has a point to plot it finds which  of
+  the  three  values  is  highest and places a single point at the 50%
+  level for the corresponding frame. These single points will join  to
+  form  a solid line if one frame is consistently the highest scoring.
+  In addition stop codons are  shown  as  short  vertical  lines  that
+  bisect the 50% level of probability. When looking for coding regions
+  the user should look for solid horizontal lines  at  the  50%  level
+  that are not interrupted by these short vertical lines.
+
+        Changes.  Two normalisations are offered:  1)  to  remove  all
+  amino  acid  compositional components from the tables, hence leaving
+  only the  codon  preference  components.  In  general  this  is  not
+  recommended as the amino acid component alone is often sufficient to
+  choose correctly between  frames,  but  may  be  useful  in  special
+  circumstances. 2) to change the amino acid composition components to
+  give an average amino acid composition rather the the one  contained
+  in  the  standard  (this  leaves  the  codon  preference  components
+  unchanged). In general this should be useful as  the  average  amino
+  acid  composition  is  likely to be closer to the composition of the
+  genes being hunted, than is that of  the  standard  table  of  codon
+  preferences.  The  average composition is that recently published by
+  Argos, not the Dayhoff one that we have used before.
+
+        Typical dialogue follows.
+
+  ? Menu or option number=42
+  Staden and McLachlan codon usage method
+  Codon tables for standards may be read from disk
+  or calculated from parts of the current sequence
+  ? (y/n) (y) Define internal standard
+  Define standard
+  ? start (0-1023) (0) =1
+  ? end (2-1023) (1023) =1000
+       ===========================================
+       F TTT  13. S TCT   1. Y TAT   1. C TGT   3.
+       F TTC   4. S TCC  10. Y TAC   1. C TGC   7.
+       L TTA   1. S TCA   0. * TAA   1. * TGA   4.
+       L TTG   4. S TCG   1. * TAG   3. W TGG   5.
+       ===========================================
+       L CTT   9. P CCT   1. H CAT   3. R CGT  14.
+       L CTC   7. P CCC   0. H CAC   7. R CGC  14.
+       L CTA   0. P CCA   0. Q CAA   4. R CGA   9.
+       L CTG  12. P CCG   1. Q CAG   9. R CGG   8.
+       ===========================================
+       I ATT   7. T ACT   4. N AAT   4. S AGT   1.
+       I ATC   4. T ACC   5. N AAC   3. S AGC   7.
+       I ATA   1. T ACA   1. K AAA   3. R AGA   2.
+       M ATG   2. T ACG   1. K AAG   2. R AGG   2.
+       ===========================================
+       V GTT  11. A GCT  13. D GAT   6. G GGT   9.
+       V GTC   5. A GCC  10. D GAC   9. G GGC  11.
+       V GTA   6. A GCA   5. E GAA   6. G GGA  12.
+       V GTG   8. A GCG   5. E GAG   3. G GGG   8.
+       ===========================================
+  Define standard
+  ? start (0-1023) (0) =
+  Total codons in standard=     333.
+  X 1 Use observed frequencies
+    2 Normalize to average amino acid composition
+    3 Normalize to no amino acid bias
+  ? 0,1,2,3 =2
+       ===========================================
+       F TTT  19. S TCT   2. Y TAT  10. C TGT   3.
+       F TTC   6. S TCC  22. Y TAC  10. C TGC   8.
+       L TTA   2. S TCA   0. * TAA   0. * TGA   0.
+       L TTG   7. S TCG   2. * TAG   0. W TGG   8.
+       ===========================================
+       L CTT  16. P CCT  16. H CAT   4. R CGT  10.
+       L CTC  12. P CCC   0. H CAC  10. R CGC  10.
+       L CTA   0. P CCA   0. Q CAA   8. R CGA   7.
+       L CTG  21. P CCG  16. Q CAG  18. R CGG   6.
+       ===========================================
+       I ATT  19. T ACT  13. N AAT  16. S AGT   2.
+       I ATC  11. T ACC  17. N AAC  12. S AGC  15.
+       I ATA   3. T ACA   3. K AAA  22. R AGA   1.
+       M ATG  15. T ACG   3. K AAG  15. R AGG   1.
+       ===========================================
+       V GTT  15. A GCT  21. D GAT  14. G GGT  10.
+       V GTC   7. A GCC  16. D GAC  20. G GGC  13.
+       V GTA   8. A GCA   8. E GAA  26. G GGA  14.
+       V GTG  11. A GCG   8. E GAG  13. G GGG   9.
+       ===========================================
+  Span length  21 expected mean values:   4.8  -5.7  -4.8
+  Span length  31 expected mean values:   7.1  -8.4  -7.2
+  Span length  41 expected mean values:   9.5 -11.1  -9.5
+  ? odd span length (11-101) (25) =41
+  ? plot interval (1-11) (5) =
+
+   Missing graphics display here
+
+ @43. TX 6 @ Positional base preference method.
+
+        Used to find protein coding regions. For each window length of
+  the  sequence  the  routine  measures  the  closeness to an expected
+  pattern of base frequencies . Results are plotted for  each  of  the
+  three  reading  frames. Stop and start codons are also marked on the
+  plots.  The method is particularly useful for showing which  reading
+  frame  is  the  most  likely  to  be  coding.  The latest version is
+  described in a forthcoming issue of Methods in Enzymology,  but  the
+  original  ideas  were given in Staden, R. Nucl. Acid Res. 12 551-567
+  (1984).
+
+        If dialogue is requested  the  following  inputs  are  needed,
+  otherwise  the  standard  analysis  is  performed.  Choose between a
+  "global" standard, or a selected one.  If  the  global  standard  is
+  selected  the  expected  scores  are displayed and the user asked to
+  define a span length and a plot interval. Then users choose  between
+  plotting  relative  or  absolute  scores,  and can reset the scaling
+  values employed  for  plotting.   If  the  global  standard  is  not
+  selected  users  must  define  a  region of the sequence to use as a
+  standard, or they can read in a codon table from which  the  program
+  will calculate one. Then they can either, use the values observed in
+  this standard,  or  they  can  combine  its  values  for  the  third
+  positions  in codons, with those from the global standard. Next they
+  can give different weightings to each  of  the  three  positions  in
+  codons.
+
+        In its original form the method took advantage of  the  uneven
+  use of amino acids by proteins and the structure of the genetic code
+  table and assumed that there is  a  typical  ("global")  amino  acid
+  composition   and  no  codon  preference.  The  typical  amino  acid
+  composition is the average composition found by Argos  (see  below).
+  This composition and no codon preference determines the frequency of
+  each of the four bases in each of the three  codon  positions.  This
+  3x4 frequency table shows unequal use of the bases and in particular
+  a marked use of G in position 1 and of  A  in  position  2  (at  the
+  expence  of  G).  The routine slides a window along the sequence and
+  calculates a score for each of the  three  reading  frames  at  each
+  window  position.  It  assumes the sequence is coding throughout its
+  whole length and calcualtes the probability that  it  is  coding  in
+  each  of  the  three  frames.  When  tested  against all the E. coli
+  sequences in the EMBL sequence library it correctly  identified  the
+  coding  frame  for  91% of window positions.  (The E. coli sequences
+  were chosen only for technical reasons: I have no  reason  to  think
+  the method would work less well on other organisms with roughly even
+  base composition.)  The routine can plot either absolute or relative
+  values:  ie  absolute  values  are  the  values found by summing the
+  scores for each frame (say p1, p2 and p3), and the  relative  values
+  are then p1/(p1+p2+p3), p2/(p1+p2+p3) and p3/(p1+p2+p3).
+
+        At each point along the sequence that the program has a  point
+  to  plot  it finds which of the three values is highest and places a
+  single point at the 50% level for  the  corresponding  frame.  These
+  single  points  will  join  to  form  a  solid  line if one frame is
+  consistently the highest scoring. In addition stop codons are  shown
+  as  short  vertical  lines that bisect the 50% level of probability.
+  When looking for coding regions  the  user  should  look  for  solid
+  horizontal  lines at the 50% level that are not interrupted by these
+  short vertical lines.  The absolute  mean  values  expected  on  the
+  complement of the coding strand (and in the same frame) are 5% lower
+  than those on the coding strand but the relative values are the same
+  on  both  strands.  Although the relative values give smoother plots
+  and tend to emphasize the coding frame  they  therefore,  cannot  be
+  used  to  decide  which  strand  is coding. The absolute values plot
+  should be used for this purpose but bearing in mind the fact the the
+  differences between strands are quite small.
+
+        The method has been improved in two overall ways: first it now
+  allows  users  to define their own typical amino acid composition by
+  selecting a standard sequence from  within  the  sequence  they  are
+  analysing or from a codon table; secondly it allows the inclusion of
+  third position preferences. Again these third  position  preferences
+  are  defined  by  the use of an internal standard sequence. Not only
+  can users define their own standards but they can also give  weights
+  to  each  of  the  three  positions in codons. This allows different
+  emphasis to be used for each of the three positions. As  an  example
+  of  its  use, by giving, in turn, weights of 1.0, 0.0, 0.0, and 0.0,
+  1.0, 0.0, and finally 0.0, 0.0, 1.0,  you  could  see  the  separate
+  contribution  made  by  each  of  the  three  positions.  It is also
+  possible to use the third position preferences with the  values  for
+  the  first  two  positions  taken  from  the  "global"   amino  acid
+  composition. In all cases users  may  choose  to  plot  absolute  or
+  relative  values.  The  expected  scores  are  displayed before each
+  analysis and scales are drawn on the plots.  At present this  method
+  does  not  give probabilities of coding; it has only been tested for
+  its ability to choose the correct  reading  frame  (see  above).  It
+  could  be used to give probabilities of coding if was applied to all
+  known coding and non-coding sequences in the  way  that  the  uneven
+  positional base frequencies method was. It is designed to be used in
+  conjunction  with  this  method.  Note  that   the   average   amino
+  composition  used  to derive the base frequencies was changed on 17-
+  11-1988, to be the new  average  given  by  McCaldon  and  Argos  in
+  Proteins  4  99-122  (1988).   A further change is to allow users to
+  select their own scales for producing the plots. It can  be  helpful
+  if they want to emphasise or diminish certain features.
+
+        Typical dialogue follows.
+  ? Menu or option number=D43
+  Positional base preferences method to find protein genes
+  Select standard source
+  X  1 Use global standard
+     2 Use internal standard
+     3 Use codon usage table
+  ? Selection  (1-3) (1) =2
+  Define region for standard
+  ? start (0-8134) (0) =3171
+  ? end (3172-8134) (8134) =4700
+  Select normalisation
+  X  1 Use observed frequencies
+     2 Combine with global standard
+  ? Selection  (1-2) (1) =1
+            T      C      A      G      Range
+        1  0.125  0.249  0.230  0.397  0.272
+        2  0.298  0.245  0.292  0.165  0.132
+        3  0.288  0.313  0.169  0.230  0.144
+  ? (y/n) (y) Use 1.0 for positional weights
+  Give weights between 0.0 and 1.0
+  to each of the 3 codon positions
+  ? Position 1 (0.00-1.00) (1.00) =
+  ? Position 2 (0.00-1.00) (1.00) =
+  ? Position 3 (0.00-1.00) (1.00) =
+  Expected scores per codon in each frame
+         0.136     0.122     0.123
+  ? odd span length (31-101) (67) =
+  ? plot interval (1-11) (5) =
+  ? (y/n) (y) Plot relative scores
+  Scaling values:
+     Minimum  maximum    range
+      0.3121   0.3656   0.0382
+  ? (y/n) (y) Leave scaling values unchanged
+
+    Graphics not shown
+
+  ? Menu or option number=D43
+  Positional base preferences method to find protein genes
+  Select standard source
+  X  1 Use global standard
+     2 Use internal standard
+     3 Use codon usage table
+  ? Selection  (1-3) (1) =3
+  ? File name of standard=atpase.cods
+       ===========================================
+       F TTT  21. S TCT  33. Y TAT  15. C TGT   5.
+       F TTC  55. S TCC  40. Y TAC  40. C TGC   4.
+       L TTA   8. S TCA   7. * TAA   8. * TGA   0.
+       L TTG  19. S TCG  12. * TAG   1. W TGG  17.
+       ===========================================
+       L CTT  22. P CCT  17. H CAT   6. R CGT  73.
+       L CTC  21. P CCC   4. H CAC  30. R CGC  23.
+       L CTA   1. P CCA  10. Q CAA  19. R CGA   5.
+       L CTG 168. P CCG  48. Q CAG  80. R CGG   3.
+       ===========================================
+       I ATT  47. T ACT  14. N AAT  17. S AGT   8.
+       I ATC  98. T ACC  54. N AAC  52. S AGC  26.
+       I ATA   6. T ACA   7. K AAA  85. R AGA   0.
+       M ATG  75. T ACG  13. K AAG  28. R AGG   0.
+       ===========================================
+       V GTT  67. A GCT  56. D GAT  41. G GGT  90.
+       V GTC  29. A GCC  53. D GAC  66. G GGC  66.
+       V GTA  49. A GCA  59. E GAA 101. G GGA   5.
+       V GTG  57. A GCG  64. E GAG  41. G GGG   8.
+       ===========================================
+  Select normalisation
+  X  1 Use observed frequencies
+     2 Combine with global standard
+  ? Selection  (1-2) (1) =2
+            T      C      A      G      Range
+        1  0.177  0.211  0.277  0.336  0.159
+        2  0.271  0.238  0.310  0.182  0.128
+        3  0.242  0.301  0.168  0.289  0.132
+  ? (y/n) (y) Use 1.0 for positional weights
+  Expected scores per codon in each frame
+         0.785     0.736     0.736
+  ? odd span length (31-101) (67) =
+  ? plot interval (1-11) (5) =
+  ? (y/n) (y) Plot relative scores
+  Scaling values:
+     Minimum  maximum    range
+      0.3219   0.3519   0.0214
+  ? (y/n) (y) Leave scaling values unchanged
+
+    Graphics not shown
+ @44. TX 6 @ Uneven positional base frequencies.
+
+        Used to find regions of a sequence that might be coding for  a
+  protein.  The method looks for sections of the sequence in which the
+  frequency at which  each  of  the  four  bases  occupies  the  three
+  positions  in  codons  is  nonrandom.  The level of nonrandomness is
+  plotted on a scale that shows the probability that the  sequence  is
+  coding.  At each position along a sequence the calculation gives the
+  same value for all six possible reading frames, so only one value is
+  plotted.
+
+        Define the window length and plot interval.
+
+        The results are plotted in a box divided by a horizontal  line
+  marked  "76%".  76% of coding regions achieve values above this line
+  and 76% of noncoding regions achieve scores below the line.
+
+        This method, first described in  Staden R. Nucl. Acid Res.  12
+  551-567  1984, looks for uneven positional usage of bases in codons.
+  It looks through the sequence in one  fixed  phase  and  counts  the
+  number  of  times  each  base  apears  in  each  of  the three codon
+  positions: for each window position it counts A1,A2,A3 and  C1,C2,C3
+  and  G1,G2,G3  and  T1,T2,T3  and calculates AMEAN=(A1+A2+A3)/3, and
+  similarly CMEAN, GMEAN and TMEAN; it  then  calculates  ADIF=abs(A1-
+  AMEAN)+abs(A2-AMEAN)+abs(A3-AMEAN) and similarly CDIF, GDIF and TDIF
+  to measure the differences  between  an  even  base  usage  for  all
+  positions  in  the  codons  and the observed usage. The routine then
+  calculates the sum ADIF+CDIF+GDIF+TDIF and plots this value  on  the
+  following  scale:  the  base level is such that no known window in a
+  coding region has a lower value, whereas 14% of windows in noncoding
+  sequences  score  below  it. The top of the scale is not achieved by
+  any known noncoding region, but is reached by 16%  of  known  coding
+  regions.  The  bar drawn across the plot corresponds to a level that
+  is exceeded by 76% of windows in known coding regions but is reached
+  by  only 24% of windows in known noncoding regions. ie 76% of coding
+  windows score above and 76% of noncoding windows score below.   This
+  is  similar  to  Ficketts  method  but without the probabilities and
+  weightings from the Los Alamos sequence  library:  it  is  therefore
+  unbiased but may well give very similar results.
+ @45. TX 6 @ Codon improbability on base composition
+
+        Used to find regions of a  sequence  that  might  code  for  a
+  protein.
+
+        If dialogue is requested  define  a  window  length  and  plot
+  interval.
+
+        The idea of the method is, that of all sequence features  that
+  we  know,  it  is  only  coding regions that will give rise to codon
+  biases well above those expected from the base  composition.   If  a
+  region  of  sequence  shows  sufficiently  strong codon bias then we
+  conclude that it is coding for a  protein.   Using  the  multinomial
+  distribution we have derived a function to measure the improbability
+  of  observing  a  set  of  codons  from  a  sequence  of  the  given
+  composition.  Using  the Poisson distribution we have worked out the
+  distribution of the improbability. The program  plots  the  observed
+  improbability   minus   the  expected  improbability  (the  mean  as
+  calculated from the Poisson distribution). The plots  are  presented
+  against  a scale of units of standard deviation as measured from the
+  Poisson distribution. As with the other Staden and McLachlan  method
+  the  program puts an extra point at a fixed level for the highest of
+  the three probabilities; for this function this point is  placed  at
+  six  standard  deviations  above the mean expected level. The top of
+  each plot corresponds to 12 standard deviations above  the  expected
+  level and the bottom corresponds to the expected value.
+
+        Analysis of the application of the method to the EMBL sequence
+  library  indicates  that the method does work for most sequences and
+  that the levels of improbability roughly correlate  with  levels  of
+  expression.  Coding regions will show high peaks in all three frames
+  making interpretation more difficult than  for  some  of  the  other
+  methods.
+ @46. TX 6 @ Codon improbability on amino acid composition
+
+        Used to finds regions of a sequence  that  might  code  for  a
+  protein.
+
+        If dialogue is requested define a window  length  and  a  plot
+  interval.
+
+        The idea of the method is, that of all sequence features  that
+  we  know,  it  is  only  coding regions that will give rise to codon
+  biases such that, for each amino acid, some codons are used far more
+  frequently  than  others. The method is independent of what the bias
+  actually is, requiring only that it is  present.   If  a  region  of
+  sequence  shows sufficiently strong codon bias then we conclude that
+  it is coding for a protein.  Using the multinomial  distribution  we
+  have  derived a function to measure the improbability of observing a
+  set of codons from a sequence of the given  composition.  Using  the
+  Poisson  distribution  we  have  worked  out the distribution of the
+  improbability. The program plots the  observed  improbability  minus
+  the  expected improbability (the mean as calculated from the Poisson
+  distribution). The plots are presented against a scale of  units  of
+  standard  deviation  as  measured  from the Poisson distribution. As
+  with the other Staden and McLachlan method the program puts an extra
+  point  at  a fixed level for the highest of the three probabilities;
+  for this function this point is placed at  six  standard  deviations
+  above  the  mean expected level. The top of each plot corresponds to
+  12 standard deviations above  the  expected  level  and  the  bottom
+  corresponds to the expected value.
+ @47. TX 6 @ Shepherd RNY preference method
+
+        Used to find regions of a  sequence  that  might  code  for  a
+  protein. Based on the method of Shepherd (PNAS 78 1596-1600, 1981).
+
+        If dialogue is requested  define  a  window  length  and  plot
+  interval.
+
+        Shepherd has found that many genes have a preference  for  the
+  use of codons of the form RNY where R=purine, Y=pyrimidine and N=any
+  base. He has attributed this to being due to remants of a  primitive
+  genetic  code. The calculation is similar to that for the Staden and
+  McLachlan method, the p1's being simply the  number  of  RNY  codons
+  found in frame 1 etc and the P's being p/(p1+p2+p3).
+ @48. TX 6 @ Ficketts method
+
+        Used to find regions of a  sequence  that  might  code  for  a
+  protein.  Based  on  the method of Fickett (Nucl. Acid Res.10 1982),
+  but plots values for fixed window lengths rather than over the whole
+  of open reading frames.
+
+        If dialogue is requested  define  a  window  length  and  plot
+  interval.  The  results  are  plotted  in  a  box divided into three
+  horizontal strips.
+
+        Sections of the sequence with values plotted in the top  strip
+  of  the box are adjudged to be coding, those in the middle strip "no
+  decision", and those in the bottom "not coding".
+
+        The program performs the following calculations: let A1 =  the
+  number  of  occurences  of  base  A  in position 1 of codons, A2 for
+  position 2 etc. Similarly for bases  C,G  and  T.  For  each  window
+  position calculate Apos=max(A1,A2,A3)/min(A1,A2,A3)+1. Similarly for
+  C,G and  T  to  give  4  positional  values.  Also  count  the  base
+  composition for the window to give Acomp, Ccomp etc. Fickett  tested
+  each of these 8 parameters singly as to their ability to distinguish
+  coding  from noncoding regions and arived at probabilities of coding
+  for the range of values each can take = Pcod. He also measured their
+  relative  abilities and given weightings to each of the 8 parameters
+  = Pw. To calculate the "TESTCODE" for a window we first  lookup  the
+  Pcod  for each of the calculated compositional and positional values
+  and  then  calculate  TESTCODE=sum(Pcod*Pw).  TESTCODE  is   plotted
+  relative to three levels of decision: the top division="coding", the
+  middle="no opinion" and the bottom division="non coding".
+ @49. TX 6 @ tRNA gene search.
+
+        Used to find segments of a sequence that might code for tRNAs.
+  Looks  for  potential cloverleaf forming structures and then for the
+  presence of expected conserved bases. Presents  results  graphically
+  or draws out the cloverleafs.
+
+        If dialogue is requested a large number of parameters need  to
+  be given values, including some loop lengths, scores for each of the
+  four stems, and scores for the conserved bases.
+
+        The program was first  described  in  Staden  Nucl.  Acid  Res
+  817-825  (1980). The tRNA's  that  have been  sequenced  so far have
+  two characteristics that can be used to locate  their  genes  within
+  long  DNA  sequences.   Firstly  they   have   a  common   secondary
+  structure  -  the  cloverleaf  -  and   secondly,  particular  bases
+  almost always appear at certain  positions  in the cloverleaf.   The
+  cloverleaf  is composed of four base-paired stems  and  four  loops.
+  Three  of  the  stems  are  of  fixed  length  but  the fourth,  the
+  dhu  stem which usually has four  base  pairs,  sometimes  has  only
+  three.   All  of  the  loops  can  vary  in  size.    The  following
+  relationships between the stems in the cloverleaf are assumed in the
+  program:   (a) there are no bases between one end  of  the aminoacyl
+  stem  and  the  adjoining tuc stem;  (b) there are two bases between
+  the  aminoacyl stem and the dhu stem;  (c) there is one base between
+  the  dhu  stem and the anticodon stem;  (d) there are at least three
+  bases  between  the  anticodon  stem  and the tuc stem.  The program
+  looks first for cloverleaf structure and then,   if  required,   for
+  conserved bases.  The sizes of the loops, the number of basepairs in
+  the stems and the required conserved bases  may  all  be   specified
+  by  the  user.  The process of looking for the presence of conserved
+  bases  can  reduce  the   number   of   potential  structures  found
+  considerably.   The  user  may  also  specify  that an intron may be
+  present in the anticodon loop.
+
+        The user may define a minimum number of base  pairs  for  each
+  stem  using  the  scoring system G-C, A-T=2 and G-T=1 and scores for
+  each of the conserved bases. Recommended values for the stem  scores
+  are  given  by  the  prompts  and the percentage conservation of the
+  conserved bases as found in the Nucl. Acid Res  1979   paper  Gauss,
+  Gruter  and  Sprinzl  are also given, but the user must decide which
+  bases are most  likely  to  be  conserved  for  the  sequence  being
+  examined.  The output shows the position of the possible gene in the
+  sequence by a vertical line the height of which shows the number  of
+  basepairs  made in the stems. The cloverleaf structure is also drawn
+  but will scroll up off the screen. Output of  the  cloverleafs  will
+  look like:
+
+         6942
+                      A
+                    A-U
+                    A-U
+                    G-C
+                    A-U
+                    U-A
+                    A-U
+                    U-A      AAU
+                    U   UAUCU
+            AA    A    !!!!!
+              AAUG     AUAGA   A
+           U  !!!!     U    UCA
+           C  UUAC      U
+            AA    A
+                   U-AA A
+                   A-U
+                   A-U
+                   C-G
+                   U-A
+                  U   A
+                  U   A
+                   GUC
+
+   Typical dialogue follows.
+
+  ? Menu or option number=D49
+   tRNA search
+  ? Maximum trna length (70-130) (92) =
+  ? Aminoacyl stem score (0-14) (11) =
+  ? Tu stem score (0-10) (8) =
+  ? Anticodon stem score (0-10) (8) =
+  ? D stem score (0-8) (3) =
+  ? Minimum base pairing total (30-32) (32) =
+  ? Minimum intron length (0-30) (0) =
+  ? Minimum length for TU loop (4-12) (6) =
+  ? Maximum length for TU loop (6-12) (9) =
+  ? (y/n) (y) Skip search for conserved bases n
+  Give a score for each base, then a minimum total at the end
+  ? Base  8, T is 100% conserved. Score (0-100) (0) =
+  ? Base 10, G is  95% conserved. Score (0-100) (0) =
+  ? Base 11, Y is  96% conserved. Score (0-100) (0) =
+  ? Base 14, A is 100% conserved. Score (0-100) (0) =
+  ? Base 15, R is 100% conserved. Score (0-100) (0) =
+  ? Base 21, A is  97% conserved. Score (0-100) (0) =
+  ? Base 32, Y is 100% conserved. Score (0-100) (0) =
+  ? Base 33, T is  98% conserved. Score (0-100) (0) =
+  ? Base 37, A is  91% conserved. Score (0-100) (0) =
+  ? Base 48, Y is 100% conserved. Score (0-100) (0) =
+  ? Base 53, G is 100% conserved. Score (0-100) (0) =
+  ? Base 54, T is  95% conserved. Score (0-100) (0) =
+  ? Base 55, T is  97% conserved. Score (0-100) (0) =
+  ? Base 56, C is 100% conserved. Score (0-100) (0) =
+  ? Base 57, R is 100% conserved. Score (0-100) (0) =
+  ? Base 58, A is 100% conserved. Score (0-100) (0) =
+  ? Base 60, Y is  92% conserved. Score (0-100) (0) =
+  ? Base 61, C is 100% conserved. Score (0-100) (0) =
+  ? Minimum total conserved base score (0-0) (0) =
+  ? (y/n) (y) Plot results n
+
+   Searching
+
+         306
+                     C
+                   C-G
+                   C-G
+                   G-C
+                   T-A
+                   C-G
+                   A-T
+                   T+G     AT
+                  A   ATACA
+          TTC    T    !!!!   G
+             CTGT     TATGG  G
+         G    ! !     T    GA
+         C   TAAA      C
+          GCG    C      G
+                  T+GA   C
+                  C-G C   T
+                  T+G  A   T
+                  T-A   G   T
+                  T-A    G   A
+                 G   G    G   C
+                 A   A     G   A
+                  AGC       T   C
+                             A   T
+                              C   T
+                               A
+                                C T
+
+
+ @50. TX 7 @ Plot start codons
+
+        This function plots the positions of all start codons for each
+  of the three reading frames.
+ @51. TX 7 @ Plot stop codons
+
+        This function plots the positions of all stop codons for  each
+  of the three reading frames.
+ @52. TX 7 @ Plot stop codons on the complementary strand
+
+        This function plots the positions of all stop codons for  each
+  of the three reading frames on the complementary strand.
+ @53. TX 7 @ Plot stop codons on both strands
+
+        This function plots the positions of all stop codons for  each
+  of the three reading frames on both strands.
+ @54. TX 5 @ Search for longest open reading frames
+
+        This function will report the positons  of  the  ends  of  all
+  sections  of  sequence  that contain no stop codons. All six reading
+  frames are examined. Results are presented in the form  of  an  EMBL
+  feature  table.  Hence if the results are stored in a file by use of
+  "direct output to disk", the file can be used to translate the  open
+  reading frames in a sequence.  Note that in order for the file to be
+  used as a feature table it  must  include  either  EMBL  or  GenBank
+  headers,  and  a  suitable  "tail".  The simplest header is the word
+  FEATURES starting in column 1 of the first line  of  the  file.  The
+  simplest  tail  is 2 empty lines at the end of the file. These lines
+  are not included when  nip  writes  out  results  in  feature  table
+  format.
+
+        Define the minimum length of open reading frame to report  (in
+  amino  acids).  Choose to search either or both strands. The program
+  displays the end points, the reading frame and strand.
+
+        Typical dialogue follows.
+
+  ? Menu or option number=D54
+   Find open reading frames
+  ? Minimum open frame in amino acids (5-1000) (30) =100
+
+  X 1 + strand only
+    2 - strand only
+    3 Both strands
+  ? 0,1,2,3 =3
+
+  FT   CDS           1    831       1    831
+  FT   CDS        1540   2853       1   1314
+  FT   CDS        3130   4242       1   1113
+  FT   CDS        5761   6114       1    354
+  FT   CDS        6187   6711       1    525
+  FT   CDS        1766   2077       2    312
+  FT   CDS        2078   2446       2    369
+  FT   CDS        4136   5500       2   1365
+  FT   CDS        1335   1637       3    303
+  FT   CDS        2844   3194       3    351
+  FT   CDS        6819   7238       3    420
+  FT   CDS        2073   1711  C    1    363
+  FT   CDS        2469   2149  C    1    321
+  FT   CDS        6542   6144  C    3    399
+
+ @55. TX 8 @ Search for E. coli promoter (general)
+
+        Searches for E coli promoter like sequences using  a  standard
+  weight matrix. The positions of the matches are plotted. No dialogue
+  is required.
+
+        The method was first described in Staden R. Nucl. Acid Res. 12
+  505-519  1984.   This  search  uses  a  weight matrix taken from the
+  frequency tables contained in Hawley, D. K. and McClure, R., nar  11
+  2237-2255 (1983).  The weight matrix is divided into 3 sections that
+  are separated by varying sizes of gap: the - 35 region, the -10  and
+  the  +1  region.   The algorithm first looks for a sufficiently good
+  -35 region, then for the best -10 region within range and  then  for
+  the  best  +1  region  within range of the -10; each separate region
+  must score above  the  lowest  known  score  for  the  corresponding
+  section. The gap penalty is then applied and two plots produced: one
+  with gap penalties, one without.  Scaling  is  such  that  no  known
+  promoter  scores below the bottom level and no known promoter scores
+  above the top level when the weight matrix is applied.
+
+        Two other functions also look for E. coli promoters: 92  looks
+  for  sites  on  the complementary strand and 93 looks for individual
+  -35 and -10 regions and plots them on a scale such the  top  is  the
+  highest known value +10% and the bottom is the lowest known -10%
+ weights for E. coli promoters
+ -35 region:
+ P -50-49-48-47-46-45-44-43-42-41-40-39-38-37-36-35-34-33-32-31-30-29-28-27-26
+
+ 107109109110110110110110110111111110111112112112112112112112112112112112112
+ T  41 33 32 25 34 22 35 35 42 27 32 42 47 14 92 94 11 19 15 37 46 34 38 48 34
+ C  22 27 18 29 20 14 20 12 22 23 16 25 10 43  7  6 11 18 60  8 25 23 23 17 20
+ A  28 38 30 37 35 56 42 42 37 42 39 18 25 26  2  6  2 72 26 50 26 34 25 26 31
+ G  16 11 29 19 21 18 13 21  9 19 24 26 29 29 11  6 88  3 11 17 15 21 26 21 27
+ -10 region:
+ P -23-22-21-20-19-18-17-16-15-14-13-12-11-10 -9 -8 -7 -6 -5
+   112112112112112112112112112112112112112112112112112112112
+ T  35 28 28 27 39 51 34 43 26 31 89  3 49 15 19108 31 29 21
+ C  34 21 24 27 12 25 20 25 20 27 10  2 16 14 22  3 13 16 30
+ A  20 39 33 33 39 23 29 16 23 19  2106 29 66 57  1 35 23 31
+ G  23 24 27 25 22 13 29 28 43 35 11  1 18 17 14  0 33 24 30
+ + region:
+ P -2 -1  1  2  3  4  5  6  7  8  9 10
+   86 88 85 88 88 88 88 88 88 88 88 88
+ T 16 22  2 42 27 23 20 25 27 15 16 29
+ C 29 49  4 25 25 13 18 22 17 17 16 17
+ A 20  9 45 16 24 25 28 24 24 32 35 26
+ G 21  8 37  5 12 27 22 17 20 24 21 16
+ Notes:  E. coli promoters have been shown to  contain  2  regions  of
+ conserved  sequence  located  about  10  and 35 bases upstream of the
+ transcription startsite. These are TATAAT and TTGACA with an  allowed
+ spacing  of  15  to  21  bases  between.  The  spacing  with  maximum
+ efficiency was 17 bases and all but 12 of the 112 sequences could  be
+ aligned  with  a  separation of 17 +or-1 bases. The standard promoter
+ has spacing 7 and 17 bases between the startsite and the -10  region,
+ and  the  -10  and -35 regions, respectively. The spacing between the
+ -10 region and the startsite is usually  6  or  7  bases  but  varies
+ between  4  and 8 bases.  There is an AT rich region of 8 to 10 bases
+ upstream of the  -35  region.   Iniation  with  a  purine  is  highly
+ prefered with G being used if A is not present.
+ Gap penalties:
+         15 0.02   (only exists as mutant)
+         16 0.2
+         17 1.0
+         18 0.2
+         19 0.05   (guess)
+         20 0.02   (guess)
+         21 0.01   (guess)
+ @56. TX  8 @ Search for E. coli promoter (general) strand
+
+        This  function  searches  for  E.  Coli   promoters   on   the
+  complementary strand of the sequence. See the notes on option 55.
+ @57. TX 8 @ Search for E. coli promoter sequences. (-35 and -10)
+
+        This  function  searches  separately  for  the  -35  and   -10
+  sequences of an E. coli promoter. See the notes on option 55.
+ @58. TX 8 @ Search for procaryotic ribosome binding sites
+
+        This function searches for the 5' ends  of  prokaryotic  genes
+  using  an  unusual  weight  matrix.  The  search  is relatively slow
+  because the matrix is 101 bases in length. No dialogue is required.
+
+        The method was first described in Staden Nucl.  Acid  Res.  12
+  505-519  1984.  This actually looks for more than a ribosome binding
+  site as is explained below.  This uses their weight matrix  w101  of
+  Stormo  and Schneider (NAR 10 2971-3024, 1982) which with a value of
+  2 finds all gene starts in their library.
+  P-60-59-58-57-56-55-54-53-52-51-50-49-48-47-46-45-44-43-42-41-40-39-38-37-36
+  T  5  1 -3  9-14  7 15 -5  3-16-17  4 18  5 -3 -1  2  4  5 -5  7  8 -5-15  6
+  C-21 -6-11-21  0  8 -7-12 -1  1  0-19 12 -3 -1 10  2 -8 -5-11  8  1 23  6 -5
+  A  7 -2 13 -2 -8-13-18  5  0 -5 13  8-15  9 -4 -7  9  0 -8-11-10 -6 -7 -5 -6
+  G -6 -9 -7  0  8-16 -4 -2-16  1 -4  8-14  5 11-13-24  3  7 22-11 -9-15 10 -4
+
+  P-35-34-33-32-31-30-29-28-27-26-25-24-23-22-21-20-19-18-17-16-15-14-13-12-11
+  T  3  4 16 -4  7 11 -4 -1 12  8 10 -1  1  8  2-10-16 11  1 -3 16 -3-36 -8-27
+  C  2-14 -3 -8-10-21  2  0 -2 -1-11 -3 -1  5-11 -4  7  0-14  6 -8-20 -7-36-44
+  A-12 -1-27 -3 -6  0-12 -3 -4 -7 14 -2 -4 -6  0 12  5 -9  0-11-11 10  8  2  8
+  G  4 -5 -6 -3 -1 -4 -1 -4-15  0-14  3 10-19 -3-10 -7 -7  7  1 -8 -6 15 21 42
+
+  P-10 -9 -8 -7 -6 -5 -4 -3 -2 -1  0  1  2  3  4  5  6  7  8  9 10 11 12 13 14
+  T-53-27-26-23  2 -7-14-40-28  0-53 75-62-20-40-10-35 -5-12 -1  4 14-23  7 -2
+  C-15-50-43-35-38-29-29  1 -9  1-87-55-64-45 11-22-14-20-15-15-10-22 -5  2  6
+  A  0 -3 -5  4-20-11  5  6 -2-15 66-69-52 -5 -4  6  8-24 -7-10 -7 13 14 -9-18
+  G 35 22 16 -6 -5-15-25-33-28-53-36-50107 -5-37-44-27-15-23-16-29-47-17-29-15
+
+  P 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39
+  T-26  1  4 -7  3 -4  0-10  8-18  7-22-21  8  4 -3 -6  7 -8  1 -5-16-16  7 -6
+  C  6 -8 19 -7  9 -3 17 -2  3 -9  5 22 22  8 -1  1 18  6 11-10 -8  7 10  0  7
+  A 14-12-42  1 -5 -4-32 12-10 20 -6 -1  3 -4  4-10 -1 -2-14 11 14 -3  2-13  5
+  G-23 -7 -1 -6-17 -4  0-15-14 -4-17-10 -5-13 -8 10-13-13  9 -4 -3 10  2  4 -8
+
+  P 40
+  T  0
+  C 14
+  A  5
+  G-21
+ These come from w101 of Stormo, Schneider, Gold and Ehrenfeucht Nucl.
+ Acid  Res.  10 2997- 3011, 1982. They report that this matrix gives a
+ score of at least 2 for all gene starts in their library whereas  all
+ other sequences score 1 or less.
+ @29. TX 1 @ Reverse and complement the sequence
+
+        Reverses and complements the  current  active  region  of  the
+  sequence.
+ @60. TX 7 @ Search using a dinucleotide weight matrix
+
+        This function performs  searches  for  short  sequence  motifs
+  using an appropriate  dinucleotide weight matrix. In addition it can
+  be used to create or modify weight matrices. In order to  perform  a
+  search  the  only  input required is the name of the file containing
+  the weight matrix.  The results  can  be  presented  graphically  or
+  listed. The graphical presentation will draw line at the position of
+  any matches found; the height of the line  is  proportional  to  the
+  score. The method is identical to that using weight matrices derived
+  from nucleotide frequencies, except that here we use the frequencies
+  of dinucleotides.
+
+        For a search, select "use weight matrix", supply the  name  of
+  the  file  containing  the  weight matrix, and choose between having
+  results plotted  or  listed.  If  dialogue  is  requested  when  the
+  function is selected users can alter the cutoff score employed.
+
+        To create a weight matrix several steps are involved.  A  file
+  containing an alignment of known motifs is required. (This file must
+  be created before the current option is selected. The  format  is  a
+  follows:  each  sequence is written on a separate line with at least
+  one space at the beginning; each sequence is terminated by  a  space
+  character,  and  can  be  followed  by a name. The sequences must be
+  aligned.) Supply the name of the  file  of  aligned  sequences.  The
+  program  reads  and  displays the sequences. Choose between "summing
+  logs of weights" or summing weights (i.e. whether to multiply or add
+  weights).  If  logs  are used all scores will be negative. Choose if
+  all positions in the set of aligned sequences should be used or if a
+  mask should be applied. If so selected, define a mask as a string of
+  symbols, in which symbol - means ignore and any other  symbol  means
+  use. E.g. xx-x--abc means use all positions except 3,5 and 6.
+
+        The program will calculate weights as the frequencies  of  the
+  dinucleotides  at  each  unmasked  position  in  the  set of aligned
+  sequences. These weights are then applied  to  the  set  of  aligned
+  sequences  to  give  a  range   of  "observed"  scores. The mean and
+  standard deviation of these scores is displayed. The user  is  asked
+  to  supply  several  values  to  be  used  when the weight matrix is
+  applied to other sequences: a cutoff score  (by  default,  the  mean
+  minus  3  standard  deviations),  a  top score for scaling graphical
+  results (by default, the mean plus 3  standard  deviations),  and  a
+  position  to  identify  (this means that if a particular base within
+  the motif is used as a "landmark", such as the A of the AG in splice
+  acceptor  sites,  then  its  position  will be marked in plots). All
+  these values are stored along with the weight matrix. Finally supply
+  the name of a file to contain the weight matrix.
+
+        Weight matrices can be  "rescaled"  using  a  set  of  aligned
+  sequences  in much the same ways as a matrix is created. The purpose
+  is to redefine the cutoff scores, and rescaling does not  alter  any
+  other values in the weight matrix file.
+
+        The methods have always had to deal with the problem of zeroes
+  in  the  matrices.  The  current  versions  employ  "Laplaces Law of
+  Succession" in which 1 is added to each term.
+  Typical dialogue follows.
+
+  ? Menu or option number=D60
+
+   Motif search using dinucleotide weight matrix
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 = 2
+  ? Name of aligned sequences file=[RS.MOTIFS]GCN4.SEQ
+
+
+       1 AGCGTGACTCTTCCCGGAA HIS1
+       2 GAGGTGACTCACTTGGAAG HIS1
+       3 CGGATGACTCTTTTTTTTT HIS3
+       4 ACAGTGACTCACGTTTTTT HIS4
+       5 GTCGTGACTCATATGCTTT ARG3
+       6 TGAATGACTCACTTTTTGG ARG4
+       7 TTCTTGACTCGTCTTTTCT CPA1
+       8 CGAATGACTCTTATTGATG CPA2
+       9 AGAATGACTAATTTTACTA TRP5
+      10 TCGTTGACTCATTCTAATC TRP3
+      11 TTGCTGACTCATTACGATT TRP2
+      12 GAGATGACTCTTTTTCTTT IV1
+      13 GCGATGATTCATTTCTCTG IV2
+      14 TAGATGACTCAGTTTAGTC LEU1
+      15 TAAGTGACTCAGTTCTTTC LEU4
+      16 ATGATGACTCTTAAGCATG ILS1
+  Length of motif    18
+  ? (y/n) (y) Sum logs of weights n
+  ? (y/n) (y) Use all motif positions n
+  x means use, - means ignore
+  e.g. xx-x---x-x means use positions 1,2,4,8,10
+  ? Mask=----XXXXXXXX--------
+   Applying weights to input sequences
+     1       89.000 AGCGTGACTCTTCCCGGA
+     2       91.000 GAGGTGACTCACTTGGAA
+     3       93.000 CGGATGACTCTTTTTTTT
+     4       90.000 ACAGTGACTCACGTTTTT
+     5       94.000 GTCGTGACTCATATGCTT
+     6       91.000 TGAATGACTCACTTTTTG
+     7       81.000 TTCTTGACTCGTCTTTTC
+     8       90.000 CGAATGACTCTTATTGAT
+     9       75.000 AGAATGACTAATTTTACT
+    10       97.000 TCGTTGACTCATTCTAAT
+    11       97.000 TTGCTGACTCATTACGAT
+    12       93.000 GAGATGACTCTTTTTCTT
+    13       69.000 GCGATGATTCATTTCTCT
+    14       90.000 TAGATGACTCAGTTTAGT
+    15       90.000 TAAGTGACTCAGTTCTTT
+    16       90.000 ATGATGACTCTTAAGCAT
+  Top score      97.000  Bottom score      69.000
+  Mean      88.750  Standard deviation       7.319
+  Mean minus 3.sd      66.794  Mean plus 3.sd     110.706
+  ? Cutoff score (-999.00-9999.00) (66.79) =
+  ? Top score for scaling plots (66.79-999.00) (110.71) =
+  ? Position to identify (0-18) (1) =
+  ? Title=GCN4 DI WTS
+  ? Name for new weight matrix file=3.WTS
+
+  ? Menu or option number=D60
+   Motif search using dinucleotide weight matrix
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 =
+  ? Motif weight matrix file=3.WTS
+   GCN4 DI WTS
+  ? Cutoff score (-9999.00-9999.00) (66.79) =40
+  ? (y/n) (y) Plot results n
+       15     42.00 CAACCCGCTCACCGACAA
+       29     42.00 ACAACAGCTCACCCACGC
+       93     46.00 AGCCTTCCTCATCGCTGC
+      153     40.00 CAGCGGAATCAAACTTAA
+      408     42.00 CGATGGATTCAAGTTGAA
+      469     47.00 TTAGGAACTCCCTCTGTC
+      493     60.00 AAGCTGAATCTTAGCAGC
+      530     43.00 CGGAGGGCTCAGTGAGGG
+      542     47.00 TGAGGGACTACTGCACCA
+      678     41.00 CTTCTGCTTCAAAGAGTT
+      709     47.00 AATATGACGGCGCACGTG
+      848     54.00 GTCAGAACTCAAATCAGT
+      940     49.00 CCGTTGACGACCTCCGCA
+      992     42.00 TGGGCACCTCACACCAAG
+
+
+ @61. TX 8 @ Search for eukaryotic ribosome binding sites
+
+        Searches  for  eukaryotic   ribosome   binding   sites   using
+  weightings  derived  from  Sargan,Gregory,Butterworth  febs  let 147
+  133-136 1982.  No dialogue is required. First  described  in  Staden
+  Nucl. Acid Res. 12 505-519 1984.
+ mRNA WTS FOR EUKARYOTES SARGAN,GREGORY,BUTTERWORTH FEBS LET
+ 147 133-136 1982
+ P  -7 -6 -5 -4 -3 -2 -1  1  2  3
+   102102102102102102102102102102
+ T  19 24 31 12  0 18  5  0102  0
+ C  20 15 32 65  5 42 52  0  0  0
+ A  50 27 27 19 86 36 34102  0  0
+ G   6 29 12  6 11  6 11  0  0102
+ VIRAL ONLY
+ P  -7 -6 -5 -4 -3 -2 -1  1  2  3
+    41 41 41 41 41 41 41 41 41 41
+ T  14 12 16  4  2 13  9  0 41  0
+ C   7  3 13 17  7  9 14  0  0  0
+ A  15 10  6 10 27 15  9 41  0  0
+ G   5 16  6 10  5  4  9  0  0 41
+ The Sargan et al paper puts forward the hypothesis that there  is  an
+ interaction between some mRNA leader sequences and a highly conserved
+ structure in the 18S rRNA of eukaryotic  ribosomes.  The  attempt  to
+ substantiate  the hypothesis includes a table of base frequencies for
+ sequences  immediately  5'  to  start  codons.   They  examined   102
+ sequences and I have used the base frequencies they found as a weight
+ matrix for searching for eukaryotic gene starts. I don't yet know how
+ good  this  method  is. The viral sequences were found to be slightly
+ different but the separate table  shown  here  is  not  used  in  the
+ program.
+ @62. TX 8 @ Search for splice junctions
+
+        Used to search  for  mRNA  splice  junctions  using  a  weight
+  matrix.  The  default  weight  matrix is still that derived from the
+  paper of Mount (Nucl. Acids Res. 10,  459-472).  However  users  may
+  employ  their  own  tables.   By  default  the positions of possible
+  junctions will be plotted rather than listed.   The  diagram  splits
+  the  donor plot into 3 horizontal boxes so that all the sites marked
+  in any box are from  the  same  reading  frame.  The  acceptor  plot
+  appears  above  the donor plot and is split in an equivalent way. So
+  sites marked  as  donors  and  acceptors  in  equivalent  boxes  are
+  compatible.  i.e.  donors  from  donor  box  1  are  compatible with
+  acceptors from acceptor box 1, etc. Of course it is the  combination
+  of  reading  frame  and splice sites that really matters, and donors
+  from box 1 can be compatible with acceptors in box 3 if the  reading
+  frame switches.
+
+        If dialogue is selected users can employ  their  own  file  of
+  weights  (see  below  for the format), can change the cutoff scores,
+  and can elect to have the results listed rather than plotted. Listed
+  results  show  the position (of the last or first base in the exon),
+  the frame and the matching  sequence.   The  frequency  table  shown
+  below  is  used  as  a  default  weight  matrix  and  AG  and GT are
+  obligatory at the appropriate positions.  The plots  are  scaled  so
+  that  the  top  of scale is the highest value achieved by a junction
+  sequence in the set used to compile the  frequency  table,  and  the
+  bottom  of  the  scale  is  the  lowest value achieved by a junction
+  sequence in the set used to compile the frequency table.
+
+        In the light of current knowledge it  would  be  sensible  for
+  users to use the weight matrix search option (20) to create matrices
+  that define  more specific splice junctions. If so it  is  important
+  that  the positions "marked" are the last base in the donor exon and
+  the first base in  the  acceptor  exon.  To  make  a  weight  matrix
+  suitable  for  use  with  this  function follow the instructions for
+  option 20 and create files for both donor and acceptor  sites.  Then
+  concatenate  the  two  matrix files with the donor file first.  Note
+  that any positions in the weight matrix that are 100% conserved will
+  be made obligatory (normally the AG and GT).
+
+  Mount donors redone 16-4-91
+      12     3   -16.085    -7.500
+  P  -2  -1   0   1   2   3   4   5   6   7   8   9
+  N 136 136 136 136 136 136 136 136 136 136 136 136
+  T  28   8  15  17   0 136   9  16   7  84  30  36
+  C  41  60  16   7   0   0   3  13   3  17  28  39
+  A  40  56  89  12   0   0  83  91  12  23  53  33
+  G  27  12  16 100 136   0  41  16 114  12  25  28
+  Mount acceptors redone 16-4-91
+      18    15   -26.142   -14.400
+  P -14 -13 -12 -11 -10  -9  -8  -7  -6  -5  -4  -3  -2  -1   0   1   2   3
+  N 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113
+  T  58  50  57  59  67  56  58  49  47  66  64  31  34   0   0  11  41  31
+  C  21  28  34  25  29  33  35  32  42  40  33  25  74   0   0  23  28  41
+  A  17  11  11  18   7  17  12  23  15   3  10  29   5 113   0  24  21  21
+  G  17  24  11  11  10   7   8   9   9   4   6  28   0   0 113  55  23  20
+ @63. TX 7 @ Search using a weight matrix (complementary)
+
+        This  function  searches  the  complementary  strand  of   the
+  sequence   using  a  weight  matrix.  Many motifs can bind to either
+  strand of the DNA and this  function  allows  users  to  search  the
+  complementary strand without having to change the orientation of the
+  sequence. See option 20 for more details.
+ @64. TX 3 @ Plot observed-expected word frequencies
+
+        This  option is designed to examine the  abundances  of  short
+  words  in  a  sequence to see if particular ones are either under or
+  over represented. It compares the observed and expected  frequencies
+  and  plots  them along the sequence. There has been some work on the
+  relative amounts of CG dinucleotides  in  eukaryotic  sequences  (eg
+  Bird,  Nature  321, 209-213 (1986)) and this new routine can be used
+  to examine such biases, or any others that might be interesting.
+
+        The user selects a word - say CG -, a  window  length,  and  a
+  maximum  and  mininum  scale  for  plotting the results. The program
+  examines each sucessive window length along the sequence, with  each
+  window  overlapping  the previous one by windowlength-1. The program
+  counts the base frequencies  in  each  window,  and  the  number  of
+  occurrences  of  the  chosen  word within the window. Using the base
+  frequencies it calculates an expected number of occurrences for  the
+  chosen  word  (simply  by  multiplying the relevant frequencies). It
+  plots observed-expected, and hence will show regions that  are  rich
+  or  depleted  in  the  chosen  word.  The  longest allowed word is 9
+  characters, but the calculation of the expected frequencies  becomes
+  less appropriate as the word length increases above 2.
+
+        Typical dialogue follows.
+
+  ? Menu or option number=D64
+  Plot composition differences (obs-exp))
+  Default String=CG
+  ? String=
+  ? odd span length (3-401) (101) =
+  ? plot interval (1-20) (5) =
+  ? Maximum plot value (-6.31-25.25) (6.31) =
+  ? Minimum plot value (-25.25-6.31) (-6.31) =
+
+   Missing graphics display here
+
+ @65. TX 9 @ Search for polya sites
+
+        Simply  searches  for  the  sequence  AATAAA  (Proudfoot   and
+  Brownlee  Nature  263,  211-214,  1982)  and  marks  it with a short
+  vertical line.
+ @66. TX 1 @ Interconvert t and u
+
+        This function interconverts T and U characters in  the  active
+  sequence i.e between DNA and RNA.
+ @67. TX 7 @ Search for patterns of motifs
+
+        This option searches for patterns of motifs. Patterns  can  be
+  defined  interactively  or read from files. Results can be displayed
+  in several ways in both graphical and textual form. Used  to  create
+  pattern  files  for  searching  libraries.  The  option is extremely
+  flexible and  consequently  the  following  documentation  is  quite
+  lengthy.  However the routine is capable of searching for almost any
+  known pattern. In addition  the  flexibility  does  not  necessitate
+  difficulty  of  use,  and  the  userinterface  has  been  simplified
+  considerably since the methods were first published.
+
+        Users should refer to the "typical dialogue" shown  below  for
+  the most helpful information on using the program.
+
+        There  are  currently  four  ways  to  display  the   matching
+  patterns:  1=each individual motif and its position is listed; 2=all
+  the sequence between, and including  the  two  outermost  motifs  is
+  listed;  3=graphical,  with  a vertical line marking the position of
+  the leftmost motif; 4 = EMBL feature table format, where the  KEYNAM
+  field  if  the motif name, the FROM and TO fields denote the ends of
+  the match, and the DESCRIPTION field is "Program".
+
+        When it is defined for  the  first  time  a  pattern  must  be
+  entered  interactively  at the keyboard, but the pattern description
+  can be saved to a file. This file can be  used  for  all  subsequent
+  searches.
+
+        When defining a pattern interactively select a motif class and
+  the program will request the required inputs.
+
+        The program gives each motif an identifying name  and  number.
+  For  motifs  other than the first, a range of allowed positions must
+  be defined (Note that sets of motifs included using the OR  operator
+  will  all  be  given  the  same  range, and so the program will only
+  request range values for the first  motif  in  any  such  set).   To
+  specify  the  allowed  range  for  a  motif the user must supply the
+  following: the identifying number of the motif,  relative  to  which
+  the   current  motifs  positions  are  to  be  defined  (termed  the
+  "reference motif"); a "relative start position"  and  a  range.  The
+  relative  start  position  can  be  negative or positive. A negative
+  start position means that although the reference motif  is  searched
+  for  first,  the  current  motif  can  be found to its left.  A zero
+  relative start position means their left ends are superimposed.  The
+  default  start  position is to butt-joint the motif to righthand end
+  of the  "reference  motif".  The  range  is  "the  number  of  extra
+  positions" that the motif can take.
+
+        The program will  display  the  probability  of  finding  each
+  motif.  These  values  are presented in the following form: .1234E-5
+  means 0.1234 times 10 to the power -5.
+
+        After the pattern has been defined, the program  will  type  a
+  description of it on the screen. It will then allow the user to give
+  an overall cutoff score and overall probability cutoff.
+
+        Typical dialogue  for  all  the  different  motif  classes  is
+  displayed below.
+
+  ? Menu or option number=67
+    Pattern searcher
+  ? (y/n) (y) Read pattern from keyboard
+  X 1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =
+  ? Motif name=Ematch
+  ? String=AA
+  Probability of score     2.0000 = 0.595E-01
+  X 1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =2
+  ? Motif name=AAA
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-1) (1) =
+  ? Relative start position (-1000-1000) (3) =
+  ? Number of extra positions (0-1000) (0) =
+  ? string=AAA
+  ? Minimum matches (1.00-3.00) (3.00) =2
+  Probability of score     2.0000 = 0.149E+00
+    1 Exact match
+  X 2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =3
+  ? Motif name=T'S
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-2) (2) =
+  ? Relative start position (-1000-1000) (4) =
+  ? Number of extra positions (0-1000) (0) =
+  ? String=TTT
+  ? Minimum score (0.00-108.00) (108.00) =72
+  Probability of score    72.0000 = 0.258E+00
+    1 Exact match
+    2 Percentage match
+  X 3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =4
+  ? Motif name=GCN4
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-3) (3) =
+  ? Relative start position (-1000-1000) (4) =
+  ? Number of extra positions (0-1000) (0) =
+  ? Weight matrix file name=GCN4
+   GCN4 FROM WEIGHTS 17-11-87
+  Probability of score   -22.0020 = 0.139E-02
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+  X 4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =5
+  ? Motif name=GCN4
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-4) (4) =
+  ? Relative start position (-1000-1000) (20) =
+  ? Number of extra positions (0-1000) (0) =
+  ? Weight matrix file name=GCN4
+   GCN4 FROM WEIGHTS 17-11-87
+  Probability of score   -22.0020 = 0.606E-03
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+  X 5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =6
+  ? Motif name=LOOP
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-5) (5) =
+  ? Relative start position (-1000-1000) (20) =
+  ? Number of extra positions (0-1000) (0) =
+  ? Stem length (1-60) (6) =
+  ? Minimum loop length (-6-60) (0) =
+  ? Maximum loop length (0-60) (0) =5
+  ? Minimum score (1.00-12.00) (12.00) =10
+  Probability of score    10.0000 = 0.598E-02
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+  X 6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =7
+  ? Motif name=Tstep
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-6) (6) =
+  ? (y/n) (y) Relative to 5 prime end
+  ? Relative start position (-1000-1000) (1) =
+  ? Number of extra positions (0-1000) (0) =
+  ? String=TTT
+  ? Step (1-20) (3) =
+  Probability of score     3.0000 = 0.367E-01
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+  X 7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =8
+  ? Motif name=REPEAT
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-7) (7) =
+  ? Relative start position (-1000-1000) (4) =
+  ? Number of extra positions (0-1000) (0) =2
+  ? Repeat length (1-60) (6) =
+  ? Minimum gap (0-60) (0) =
+  ? Maximum gap (0-60) (0) =4
+  ? Minimum score (1.00-6.00) (6.00) =5
+  Probability of score     5.0000 = 0.554E-02
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+  X 8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =9
+  ? (y/n) (y) Save pattern in a file N
+
+  Pattern description
+
+  Motif  1 named Ematch   is of class    1
+  Which is an exact match to the string
+  AA
+  Motif  2 named AAA      is of class    2
+  which is a match of score     2. to the string
+  AAA
+  and the 5 prime base can take positions      3 to       3
+  relative to the 5 prime end of motif   1
+  It is anded with the previous motif.
+  Motif  3 named T'S      is of class    3
+  which is a match of score    72. to the string
+  TTT
+  and the 5 prime base can take positions      4 to       4
+  relative to the 5 prime end of motif   2
+  It is anded with the previous motif.
+  Motif  4 named GCN4     is of class    4
+  Which is a match to a weight matrix with score -22.002
+  and the 5 prime base can take positions      4 to       4
+  relative to the 5 prime end of motif   3
+  It is anded with the previous motif.
+  Motif  5 named GCN4     is of class    5
+  Which is a match to the complement of a weight matrix with score -22.002
+  and the 5 prime base can take positions     20 to      20
+  relative to the 5 prime end of motif   4
+  It is anded with the previous motif.
+  Motif  6 named LOOP     is of class    6
+  Which is a stem-loop structure with stem length    6 and score    10.
+  The loop can have sizes      0 to      5
+  and the 5 prime base can take positions     20 to      20
+  relative to the 5 prime end of motif   5
+  It is anded with the previous motif.
+  Motif  7 named Tstep    is of class    7
+  Which is an exact match to the string
+  TTT
+  with a step size of     3
+  and the 5 prime base can take positions      1 to       1
+  relative to the 5 prime end of motif   6
+  It is anded with the previous motif.
+  Motif  8 named REPEAT   is of class    8
+  Which is a repeat with repeat length    6 and score     5.
+  The loop-out can have sizes      0 to      4
+  and the 5 prime base can take positions      4 to       6
+  relative to the 5 prime end of motif   7
+  It is anded with the previous motif.
+  Probability of finding pattern = 0.2348E-14
+  Expected number of matches  = 0.5100E-09
+  ? Maximum pattern probability (0.00-1.00) (1.00) =
+  ? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+   Select display mode
+  X 1 Motif by motif
+    2 Inclusive
+    3 Graphical
+    4 EMBL feature table
+  ? 0,1,2,3,4 =4
+   Searching
+
+
+  Total matches found      0
+
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structures and repeats
+  m5 = Translation and codons
+  m6 = Gene search by content
+  m7 = Prokaryotic signal search
+  m8 = Eukaryotic signal search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=67
+    Pattern searcher
+  ? (y/n) (y) Read pattern from keyboard
+  X 1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =
+  ? Motif name=Arun
+  ? String=AAAAAA
+  Probability of score     6.0000 = 0.210E-03
+  X 1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Complement of weight matrix
+    6 Inverted repeat or stem-loop
+    7 Exact match, defined step
+    8 Direct repeat
+    9 Pattern complete
+  ? 0,1,2,3,4,5,6,7,8,9 =9
+  ? (y/n) (y) Save pattern in a file N
+
+  Pattern description
+
+  Motif  1 named Arun     is of class    1
+  Which is an exact match to the string
+  AAAAAA
+  Probability of finding pattern = 0.2103E-03
+  Expected number of matches  = 0.1522E+01
+  ? Maximum pattern probability (0.00-1.00) (1.00) =
+  ? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+   Select display mode
+  X 1 Motif by motif
+    2 Inclusive
+    3 Graphical
+    4 EMBL feature table
+  ? 0,1,2,3,4 =4
+   Searching
+
+
+  FT   Arun       1582   1587       Program
+  FT   Arun       3160   3165       Program
+  FT   Arun       4204   4209       Program
+  FT   Arun       5691   5696       Program
+  FT   Arun       6710   6715       Program
+  Total matches found      5
+  Minimum and maximum observed scores        6.00        6.00
+
+
+        These methods allow users to define  and  search  for  complex
+  patterns  of  motifs  defined as single objects.  The programs allow
+  individual DNA motifs to be defined in  eight  different  ways,  and
+  protein  motifs  in  six.  Motifs  are  combined,  using the logical
+  operators AND, OR and NOT, to describe a pattern. The  pattern  also
+  specifies   the  ranges  of  allowed  relative  separations  of  the
+  individual motifs.
+
+        First some definitions.
+
+        A MOTIF is a contiguous subsequence of fixed length.   At  its
+  simplest  it  could  be a single definite base or amino acid; a more
+  complex motif might be better represented as a consensus or a weight
+  matrix;  two  more-abstract  types  of motif are direct and inverted
+  repeats.
+
+        A PATTERN is a higher order of structure defined by a list  of
+  motifs.  The  motifs  in  a  pattern  are combined using the logical
+  operators AND, OR  and  NOT.  The  list  also  defines  the  allowed
+  relative  separations  of the motifs. In the current versions of the
+  programs up to 50 motifs can be combined into a single  pattern.  So
+  using these definitions there are two differences between motifs and
+  patterns: 1) the distances between  all  elements  of  a  motif  are
+  fixed,  but  the  separations  of parts of patterns can vary; 2) all
+  characters in a motif are defined using the same method (class), but
+  different  parts of a pattern can be defined in completely different
+  ways.
+
+        Each motif can be represented in 9 ways (known  as  the  motif
+  class):
+
+             MOTIF CLASSES
+  CLASS           DESCRIPTION
+   1       Exact match to a short defined sequence. The IUB symbols
+           can be used for DNA sequences.
+   2       Percentage match to a defined short sequence. In nucleic acids,
+           the IUB symbols can be used.
+   3       Match to a defined sequence, using a score matrix and cutoff
+           score. The DNA matrix (see option 18) gives scores to IUB symbols
+           depending on their level of redundancy. MDM78 is used for proteins.
+   4       Match to a weight matrix with cutoff score.
+   5       As class 4 but on the complementary strand.
+   6       Inverted repeat or stem-loop. Fixed stem length, range of
+           loop sizes, and cutoff score using A-T, G-C=2; G-T=1.
+   7       Exact match to short sequence but with a defined step size.
+   8       Direct repeat. Fixed repeat length, range of loop-out sizes,
+           cutoff score, and score matrix (for protein sequences MDM78 and
+           for nucleic acids an identity matrix).
+   9       Membership of a set. A list of sets of allowed amino acids for
+           each position in the motif. The sets are separated by commas(,).
+           For example IVL,,,DEKR,FYWILVM defines a motif of length 5 amino
+           acids in which one of I,V or L must be found in the first position,
+           then anything in the next two positions, D,E,K or R in the fourth
+           position and F,Y,W,I,L,V or M in the fifth. This class only applies
+           to protein sequences because for nucleic acids "membership of a
+  set"
+           can be achieved using IUB symbols.
+
+      Classes 1 - 4, 8 and 9 apply to protein sequences, and classes 1-8 to
+      nucleic acids.
+
+
+        Class 1: exact match.
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids, may include IUB symbols. All symbols must match.
+
+        Class 2: percentage match
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids,  may  include  IUB  symbols.  The  minimum number of matching
+  characters must also be specified.
+
+        Class 3: match using a score matrix
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids,  may  include IUB symbols. The motif is not compared directly
+  with the sequence  to  count  the  number  of  matching  characters.
+  Instead  a  matrix is used to provide a score for all possible pairs
+  of characters. The motif score for any position along  the  sequence
+  is  the  sum  of  the scores found by looking-up the scores for each
+  pair of aligned characters. A match  is  declared  if  some  minimum
+  score is achieved.
+
+        Class 4: weight matrix
+
+        The motif is defined by a table of values (called  weights  or
+  scores). The table gives a score for finding each possible character
+  at each position along the length of the  motif.  It  therefore  has
+  dimension  motif-length  x character-set-size, and allows us to give
+  different  scores  for  each  character  at  each  position.  It  is
+  equivalent  to  having  a  different  score matrix for each position
+  along the motif, and provides the most flexible and specific  method
+  of  defining  motifs. The weight matrices are created by program NIP
+  option 20 and stored as files. The file contains the values for each
+  position, as well as an overall minimum score. There are two ways in
+  which these values can be used to calculate an overall score for any
+  section  of  the  sequence. The simplest way is to add the values in
+  the file. (This  means  that  the  highest  possible  score  can  be
+  calculated  by adding the top value at each column position, and the
+  lowest by adding the bottom value.)  The normal  way  of  using  the
+  values  in  the  file  is  as follows. First the programs divide the
+  values in each column by the column total so that they  sum  to  1.0
+  Then  the  natural logs of these values are used as scores. When the
+  matrix is applied to a sequence these logarithmic values are  summed
+  (which  is  of  course  equivalent  to multiplying the frequencies).
+  Note that using the natural logs of the frequencies as  weights  and
+  adding  them  means  that the overall cutoff score must be less than
+  zero, whereas if the original values in the weight matrix  file  are
+  added,  the  cutoff  score  will  be  greater  than zero. The search
+  routines therefore decide whether the user wants to  add  values  or
+  multiply  frequencies by examining the value of the cutoff score: it
+  will add if the  cutoff  is  greater  than  zero  and  add  logs  of
+  frequencies  if  it is less than zero.  Hence we effectively get two
+  motif classes in one. The program NIP, when creating  weight  matrix
+  files,  will  ask  the  user  whether  the scores should be added or
+  multiplied. If the values in the table  have  been  defined  without
+  using a set of aligned sequences it is easier for the user to choose
+  a cutoff score if the values are added.
+
+        Class 5: complement of weight matrix
+
+        The motif is defined by  a  weight  matrix,  but  the  program
+  searches for its complement.
+
+        Class 6: inverted repeat, or stem-loop
+
+        The motif is defined by a repeat length, a minimum score and a
+  range  of  loop  sizes.  The scores are A-T=2, G-C=2, G-T=1, else=0.
+  The loop sizes are defined by a minimum and  maximum  distance  from
+  the  3'  end  of  the  stem.  For a stem-loop these will be positive
+  numbers. For example to define a stem of length  8  and  loop  sizes
+  varying  from  3 to 5, the stem would be set to 8, the minimum start
+  distance to 3 and the maximum to 5. To define an inverted repeat the
+  minimum  distance  will  be  negative.  For  example  stem length=9,
+  minimum distance=-9, and  maximum  distance=-8  will  find  inverted
+  repeats  of lengths 9 and 10. E.g. AAAAATTTT and AAAAATTTTT would be
+  found, the first having a base at  its  centre,  the  second  having
+  none.
+
+        Class 7: exact match, defined step size.
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids,  may  include  IUB symbols. All symbols must match. The class
+  differs from class 1 in that searches will move  in  steps  of  some
+  given  size. For example we could search for a certain codon and use
+  a step size of 3 and hence keep in a single reading frame.
+
+        Class 8: direct repeat
+
+        The motif is defined by a repeat length, a minimum score and a
+  range  of loop sizes. The scores are defined using MDM78 for protein
+  sequences and an identity matrix for nucleic acids.  The loop  sizes
+  are defined by a minimum and maximum distance from the 3' end of the
+  stem.
+
+        Class 9: membership of a set
+
+        This motif class is for protein sequences. It  is  defined  by
+  lists  of  allowed amino acids for each position in the motif, and a
+  cut-off score.  Positions at which any amino acid can occur are left
+  blank.  All allowed amino acids for each position give a score of 1.
+  The motifs can be defined in two ways: either typed at the  keyboard
+  or  read in as a weight-matrix-like file.  When the motif is defined
+  at the keyboard the sets of allowed amino  acids  are  separated  by
+  commas(,).  For example IVL,,,DEKR,FYWILVM defines a motif of length
+  5 amino acids in which one of I,V or L must be found  in  the  first
+  position, then anything in the next two positions, D,E,K or R in the
+  fourth position and F,Y,W,I,L,V or M in the fifth.  To specify  that
+  the  whole motif must match a score of 3 would be required (i.e. one
+  of the allowed amino acids must be  found  for  each  of  the  three
+  defined  positions).  If the motif is read from a file the file must
+  have been written by program NIP, or have been saved by the  pattern
+  searching  routines.  If  the  user elects to save a pattern, and it
+  includes class 9 motifs typed at the keyboard, then the program will
+  save  the  class  9 motifs as weight matrix files. Therefore it will
+  request file names for each motif of this class. If the motif  given
+  above  as  an example were saved the weight matrix file would have 5
+  columns.  The first column would contain zeroes except for the I,  V
+  and  L  rows which would be set to 1; the next two columns would all
+  be zero; the next would be zero except for  the  D,E,K  and  R  rows
+  which  would  be  1;  the  final  column  would  contain 1's in rows
+  F,Y,W,I,L,V and M, with the rest zero.
+
+        The logical operator (AND, OR or NOT) used to add  each  motif
+  to  the  pattern  is  specified by preceding the class number by the
+  letters A, O or N. A = AND, O = OR, N = NOT.  The default is  A,  so
+  N2  means include, using the NOT operator, a class 2 motif; O2 means
+  include, using the OR operator, a class 2 motif; both A2 and 2  mean
+  include, using the AND operator, a class 2 motif.
+
+        Range setting.
+
+        The motifs in a pattern are numbered according to their  order
+  in  the list. Apart from the first motif in a pattern all motifs are
+  given a range of allowed positions relative to a  motif  further  up
+  the  list.  For example suppose we have a pattern defined by A AND B
+  AND C AND D.  Motif A can occur anywhere, but B must have its  range
+  of  allowed  positions  defined relative to the position of motif A,
+  and C's positions  can  be  defined  relative  to  either  A  or  B,
+  depending  on  which  is most convenient, and likewise D's positions
+  can be relative to A or B or C.
+
+        Notice that the positions of motifs can be defined relative to
+  more  than one motif. Suppose we have a pattern consisting of motifs
+  A, B and C, and that B occurs 5-10 residues right of A, C occurs  5-
+  10  residues  right  of B, and also C is never more than 15 residues
+  from A. Then it is quite consistent  with  the  methods  to  include
+  motif C into the pattern twice using the AND operator: once relative
+  to A and once relative to B. This will define the  relative  spacing
+  and  the  ORDER  of the motifs in the pattern. (If we simply defined
+  the position of C relative to A it could be found to the left of B).
+
+        Motifs combined together using the OR operator are  all  given
+  the  same range. For example suppose we had a pattern A AND (B OR C)
+  AND (D OR E), then B and C each have the same range,  and  D  and  E
+  also  have  the same range as one another. The range for D and E can
+  be relative to A or to B.
+
+        Motifs cannot have their ranges  defined  relative  to  motifs
+  that  are included using the NOT operator. For example if we had the
+  pattern A NOT B AND C, then the range for  C  can  only  be  defined
+  relative to motif A.
+
+        Speed can be gained by arranging the order of  the  motifs  so
+  that  those higher up the list are of types that can be searched for
+  rapidly and that are also unlikely to be found.
+
+        Motifs combined by the OR operator are  alternatives:  if  any
+  one  of a set of motifs combined by the OR operator is found, then a
+  match is declared. All alternatives will be reported. For example if
+  we  had a pattern defined by A AND (B OR C), then all places where A
+  occurs and B is found within range, and all places where A is  found
+  and C is found within range will be reported. A typical use would be
+  where we might allow a motif to appear on either strand of  the  DNA
+  sequence.  For  example  a  weight matrix representing the heatshock
+  element could be used in a pattern which  included  heatshock  as  a
+  motif  class  4  combined  using the OR operator with heatshock as a
+  motif class 5.
+
+        The probability calculations are performed for each  motif  as
+  it  is  defined.  If  an  overall  probability  cut-off is given the
+  calculation is repeated for each match  found.  To  achieve  maximum
+  searching  speed do not give an overall probability cut-off. Overall
+  cut-off scores should only be used if the  motif  classes  used  are
+  compatible.
+
+        There are currently several ways to display the matches:  1  =
+  each  motif and its position is listed; 2 = all the sequence between
+  the two outermost motifs is listed; 3  =  graphical,  with  a  spike
+  marking  the  position  of  the leftmost motif. The library versions
+  also give entry names, and a one line title; in addition they can be
+  used  to  produce  aligned  families of sequences. When this mode of
+  output is selected the program will write a separate file  for  each
+  match. The files will be called ENTRYNAME.DAT where ENTRYNAME is the
+  name of the entry in the library.  The  matching  sequence  will  be
+  written  out so that the spacing between motifs is constant, and set
+  to the maximum allowed by the pattern definition. Any gaps  will  be
+  filled   with   dashes   (-).   If  the  individual  sequences  were
+  subsequently written one above the other they should line up so that
+  all  motifs are in register. There two types of output of this sort:
+  one, option 4, writes out whole  sequences,  the  other,  option  5,
+  writes  out only the sequences between the two outermost motifs.  If
+  the individual sequences were subsequently  written  one  above  the
+  other  they should line up so that all motifs are in register. There
+  two types of output of this sort: one, option 4,  writes  out  whole
+  sequences,  the  other,  option  5,  writes  out  only the sequences
+  between the two outermost motifs.  Note that for option 4 users  are
+  asked  to  type  the position of the first motif, and the reason for
+  this is  explained  below.  Consider  a  pattern  found  in  several
+  sequences.  Consider only the first motif in the pattern and suppose
+  that it was found in different positions  in  these  sequences.  Say
+  that  of  these  positions  the  one  furthest from the left end was
+  position 100. Then, in order to ensure that all the sequences  would
+  align,  we must specify that motif 1 must start at position 100. Any
+  sequences in which motif 1 started  nearer  to  the  left  end  than
+  position  100  would  be  padded accordingly.  These modes of output
+  should only be used when the  position  of  each  motif  is  defined
+  relative to its immediate neighbour.
+
+        The pattern descriptions can be saved to  files.  These  files
+  can  be used instead of typing definitions again at the keyboard. As
+  the files are annotated, they can easily  be  changed  using  system
+  editors,  and  the  modified  versions  used  to  define the variant
+  patterns for the programs.
+
+        Use of lists of entry names
+
+        The two programs that operate on libraries have the ability to
+  restrict  their  searches to subsets of the libraries. This does not
+  require sublibraries to be created but instead is achieved by  using
+  files  containing  a  list of the entry names of sequences. The user
+  may  choose  to  search  only  those  entries  on   the   list   or,
+  alternatively  to  search  all  but  those  on the list (i.e. in the
+  latter case the list contains the names of those  to  be  excluded).
+  The  programs  can search libraries that have indexes and those that
+  do not.  If a list of names for inclusion is used, then  the  search
+  will  be  faster if the index is present. In all other circumstances
+  the whole library will be read. The list must be  in  library  order
+  except  when  it  is  used  to  include  entries,  and  an  index is
+  available.  The list must contain each  entry  name  on  a  separate
+  line,  with the name starting in column 1 of the line. ie there must
+  be no spaces at the start of the line.  The list of entry names  can
+  be  produced by the keyword searches of nip, pip, etc as long as the
+  listings produced have a space character separating the  entry  name
+  from the entry description. This will depend on how well the library
+  reformatting programs work. For example swissprot entry  names  tend
+  to  run  into the beginning of the descriptions, but other libraries
+  are generally OK.
+
+        One use of the programs  is  to  look  for  patterns  that  we
+  already  know  about, but in new sequences. However it is hoped that
+  they will also be useful for finding new motifs. For example several
+  known   control   regions  in  nucleic  acid  sequences  consist  of
+  particular direct or inverted repeats; the inclusion of  direct  and
+  inverted  repeats  as  motif  classes  makes  it  possible  to  find
+  previously unknown motifs of these types. Using these  new  programs
+  we can ask questions like: "are there any inverted or direct repeats
+  near to sections of sequence that contain both a  CCAAT  box  and  a
+  TATA  box?"; and to search for such things throughout the libraries.
+  In addition, the mode of output in which all  the  sequence  between
+  the  two outermost motifs found is printed out, allows us to extract
+  sequences and  examine  them  in  more  detail  for  further  common
+  subsequences.  For example we might want to collect together all the
+  sequences between putative CCAAT and TATA boxes.
+
+        A further use of  the  inverted  repeat  motif  class  is  the
+  following.  If  a  regulatory  sequence in DNA is poorly defined but
+  also an inverted repeat, then it might be an advantage to specify it
+  both  as a consensus sequence and a superimposed inverted repeat. In
+  this way two weak definitions can be combined to produce a  stronger
+  pattern.
+
+        Given only a few examples of a motif it should be possible  to
+  perform  initial  searches  using  a  class 3 motif, and then, using
+  plausible matching sequences, create a more specific  weight  matrix
+  for the same motif.
+
+        If motifs are combined with  the  first  motif  using  the  OR
+  operator  they  will  be ignored until all permutations that include
+  the first motif have been looked for. The whole search will then  be
+  repeated,  in  turn, for each of those motifs that are combined with
+  the first motif using the OR operator.  An  interesting  consequence
+  of  this is that the program can be used, without change, to compare
+  any newly determined sequence with all known individual  motifs.  We
+  achieve  this by having a pattern in which all known relevant motifs
+  are combined using the OR operator.  If we ask to use  this  pattern
+  with  a  sequence,  the  program  will  automatically  compare  each
+  individual motif in  the  pattern  with  the  whole  length  of  the
+  sequence.  As the number of known motifs grows this should become an
+  increasingly useful standard procedure.
+
+        The  NOT  operator  is  obviously  useful  for   making   sure
+  particular  motifs  are  not  present,  but  it  can also be used to
+  bracket the levels of matches found. We may want a degree  of  match
+  that  lies  between  two  limits - binding should occur, but not too
+  strongly; or base-pairs should  form,  but  not  too  many.  We  can
+  specify  this by asking for a match with a low score, in combination
+  with a match and a high score, both for the same motif, but with the
+  high score included using the NOT operator.
+
+        The algorithm is designed to find all sections of  a  sequence
+  that   satisfy   the  pattern  rather  than  only  the  best  match.
+  Particularly if some of the  motifs  in  a  pattern  are  less  well
+  defined  than  others, this can often result in the same region of a
+  sequence being reported as having several matches,  but  which  only
+  vary in the positions of the weakest motifs.
+
+        General remarks on motif searching
+
+        Generally motifs are short subsequences that are thought to be
+  associated  with particular functions in some known sequences. Often
+  we  search  for  them  to  try  to  understand  or  interpret  other
+  sequences.  Sometimes  we  search  for motifs and patterns to test a
+  hypothesis  about  their  role:  are  they  found  in  the  expected
+  positions  in the expected sequences. In doing so we should remember
+  that, in both proteins and nucleic acids, what we are really looking
+  for  is  a  particular  three  dimensional  structure  with  certain
+  affinities for other structures, and that we are assuming  that  the
+  sequence  of  the  motif alone defines the 3D structure we searching
+  for. The overall structure may be completely different to  those  in
+  which  the  motif  is  functional,  and  hence  the motif may have a
+  different shape or be  inaccessible.  We  should  be  aware  of  the
+  importance  of  the context in which a motif is found. Where does it
+  lie relative to the overall structure,  is  it  accessible,  is  the
+  three  dimensional  spacing between it and other motifs correct? For
+  example, is it on the same side of the double helix, and the correct
+  distance  from  some  other  motif?  How  does  context  affect  our
+  assessment of the significance of finding  a  motif?  Finding  false
+  mammalian  mRNA splice junctions in non-coding sequences is far less
+  important than  finding  false  sites  in  pre-mRNA  sequences,  but
+  finding  them  in  the  correct  places  is most important! In other
+  words, it is often the case that when we are searching for  a  motif
+  that  is  known  to  be necessary for some function, then a positive
+  result in the form of a match in  the  required  position,  is  more
+  important  than a high background of matches in the wrong positions.
+  Being able to write down the probability of finding  a  motif  in  a
+  random  sequence  tells  us how well it is defined. In nucleic acids
+  the DNA may contain many superimposed types of information  such  as
+  those  concerned  with  histone  phasing,  protein  coding  or  mRNA
+  secondary structure. These overlapping "codes"  may  interfere  with
+  one  another  causing  matches to motifs to be poorer than expected.
+  In general we will only have a limited number  of  examples  of  the
+  motif and we do not know how representative they are.
+
+        Sequences have superimposed functions: some parts  may  be  of
+  general structural importance and give rise to an overall framework,
+  and other parts give specificity and hence are not  common;  we  may
+  want  to  use a set of aligned sequences to define a motif, but want
+  to use only the framework positions.  Alternatively we may  want  to
+  pick  out only those parts of a set of aligned sequences that give a
+  particular property, and to ignore other similarities that  are  due
+  to  some  other  property and which could obscure the pattern we are
+  interested in.  It is possible to apply a mask to a set  of  aligned
+  sequences  in  order to give weight to selected positions only.  The
+  ability to define a mask allows certain positions to be used in  the
+  motif  and  others to be ignored, and yet still permits the use of a
+  set of aligned sequences to calculate weights. The mask is requested
+  and applied by the program and results in the masked positions being
+  zero in the weight matrix. The mask is defined in the following way.
+  Suppose  we  had a motif of length 15, then the mask x--x--xx-x will
+  give zero weights to positions 2,3,5,6 and 9 (note it is the  dashes
+  (-)  that  are significant and that positions 1,4,7,8,10,11,12,13,14
+  and 15 will be non-zero). Of course the same set of sequences  could
+  be used with several alternative masks in order to extract different
+  features and create corresponding weight matrices.
+
+        The programs are described in Staden,R. CABIOS 4, 53-60, 1988;
+  Staden,R.   CABIOS  5,  89-96,  1989, and Methods in Enzymology 183,
+  193-211 (1990).
+ @ end of help
+
+
+
diff --git a/help/nip_menu b/help/nip_menu
new file mode 100644
index 0000000..6712ddb
--- /dev/null
+++ b/help/nip_menu
@@ -0,0 +1,156 @@
+-1 0 22 2 T General
+-1 0 22 2 X General
+-2 0 51 2 T Screen control
+-2 0 72 2 X Screen
+-3 0 118 2 T Statistical analysis of content
+-3 0 143 2 X Statistics
+-4 0 180 2 T Structures and repeats
+-4 0 205 2 X Structures
+-5 0 242 2 T Translation and codons
+-5 0 242 2 X Translation and codons
+-6 0 279 2 T Gene search by content
+-6 0 279 2 X Gene search by content
+-7 0 309 2 T General signals
+-7 0 309 2 X General signals
+-8 0 340 2 T Specific signals
+-8 0 340 2 X Specific signals
+0 -1 359 16 T NIP
+0 -1 359 16 X NIP
+1 0 1155 7 T Help
+1 0 1155 7 X Help
+2 0 1469 3 T Quit
+2 0 1469 3 X Quit
+3 1 1543 220 T Read a new sequence
+3 1 1543 220 X Read a new sequence
+4 1 11372 15 T Define active region
+4 1 11372 15 X Define active region
+5 1 12100 24 T List a sequence
+5 1 12100 24 X List a sequence
+6 1 13103 6 T List a text file.
+6 1 13103 6 X List a text file.
+7 1 13300 12 T Direct output to disk
+7 1 13300 12 X Direct output to disk
+8 1 13785 10 T Write active region to disk
+8 1 13785 10 X Write active region to disk
+9 1 14128 31 T Edit the sequence
+9 1 14128 31 X Edit the sequence
+10 2 15970 3 T Clear graphics
+10 2 15970 3 X Clear graphics
+11 2 16036 3 T Clear text
+11 2 16036 3 X Clear text
+12 2 16101 12 T Draw a ruler
+12 2 16101 12 X Draw a ruler
+13 2 16833 13 T Use crosshair
+13 2 16833 13 X Use crosshair
+14 2 17443 35 T Reposition plots
+14 2 17443 35 X Reposition plots
+15 2 19598 28 T Label a diagram
+15 2 19598 28 X Label a diagram
+16 2 20703 34 T Display a map
+16 2 20703 34 X Display a map
+17 1 22073 599 T Search for restriction enzymes
+17 1 22073 599 X Search for restriction enzymes
+18 7 46675 105 T Compare a short sequence
+18 1 46675 105 T Compare a short sequence
+18 7 46675 105 X Compare a short sequence
+18 1 46675 105 X Compare a short sequence
+19 7 49650 106 T Compare a short sequence using a score matrix
+19 7 49650 106 X Compare a short sequence using a score matrix
+20 7 53349 230 T Search for a motif using a weight matrix
+20 7 53349 230 X Search for a motif using a weight matrix
+21 3 63267 4 T Count base composition
+21 3 63267 4 X Count base composition
+22 3 63440 14 T Count dinucleotide frequencies
+22 3 63440 14 X Count dinucleotide frequencies
+23 5 64100 179 T Count codons and amino acids
+23 3 64100 179 T Count codons and amino acids
+23 5 64100 179 X Count codons and amino acids
+23 3 64100 179 X Count codons and amino acids
+24 3 72137 57 T Plot base composition
+24 3 72137 57 X Plot base composition
+25 3 73213 23 T Plot local deviations in base composition
+25 3 73213 23 X Plot local deviations in base composition
+26 3 74495 23 T Plot local deviations from dinucleotide composition
+26 3 74495 23 X Plot local deviations from dinucleotide composition
+27 3 75793 23 T Plot local deviations from trinucleotide composition
+27 3 75793 23 X Plot local deviations from trinucleotide composition
+28 5 77065 18 T Calculate codon constraint
+28 5 77065 18 X Calculate codon constraint
+59 3 77869 12 T Plot negentropy
+59 3 77869 12 X Plot negentropy
+30 4 78454 74 T Search for hairpin loops
+30 4 78454 74 X Search for hairpin loops
+31 4 80321 23 T Search for long range inverted repeats
+31 4 80321 23 X Search for long range inverted repeats
+32 4 81157 37 T Search for repeats
+32 4 81157 37 X Search for repeats
+33 4 82467 12 T Search for z dna (total ry, yr)
+33 4 82467 12 X Search for z dna (total ry, yr)
+34 4 82984 12 T Search for z dna (runs of ry, yr)
+34 4 82984 12 X Search for z dna (runs of ry, yr)
+35 4 83623 15 T Search for z dna (best phased value)
+35 4 83623 15 X Search for z dna (best phased value)
+36 4 84350 92 T Local similarity or complementarity search
+36 4 84350 92 X Local similarity or complementarity search
+37 5 87778 39 T Set genetic code
+37 5 87778 39 X Set genetic code
+38 4 89050 74 T Examine repeats
+38 3 89050 74 T Examine repeats
+39 5 91670 286 T Translate and list in upto six phases
+39 5 91670 286 X Translate and list in upto six phases
+40 5 103780 134 T Translate and write the protein sequence to disk
+40 5 103780 134 X Translate and write the protein sequence to disk
+41 5 108198 71 T Calculate and write codon table to disk
+41 5 108198 71 X Calculate and write codon table to disk
+42 6 111525 132 T Codon usage method
+42 6 111525 132 X Codon usage method
+43 6 118508 182 T Positional base preference method.
+43 6 118508 182 X Positional base preference method.
+44 6 127924 39 T Uneven positional base frequencies.
+44 6 127924 39 X Uneven positional base frequencies.
+45 6 130287 33 T Codon improbability on base composition
+45 6 130287 33 X Codon improbability on base composition
+46 6 132146 28 T Codon improbability on amino acid composition
+46 6 132146 28 X Codon improbability on amino acid composition
+47 6 133744 14 T Shepherd RNY preference method
+47 6 133744 14 X Shepherd RNY preference method
+48 6 134410 30 T Ficketts method
+48 6 134410 30 X Ficketts method
+49 6 136094 139 T tRNA gene search.
+49 6 136094 139 X tRNA gene search.
+50 7 141894 4 T Plot start codons
+50 7 141894 4 X Plot start codons
+51 7 142027 4 T Plot stop codons
+51 7 142027 4 X Plot stop codons
+52 7 142188 4 T Plot stop codons on the complementary strand
+52 7 142188 4 X Plot stop codons on the complementary strand
+53 7 142365 4 T Plot stop codons on both strands
+53 7 142365 4 X Plot stop codons on both strands
+54 5 142536 45 T Search for longest open reading frames
+54 5 142536 45 X Search for longest open reading frames
+55 8 144437 67 T Search for E. coli promoter (general)
+55 8 144437 67 X Search for E. coli promoter (general)
+56 8 148004 4 T Search for E. coli promoter (general) strand
+56 8 148004 4 X Search for E. coli promoter (general) strand
+57 8 148210 4 T Search for E. coli promoter sequences. (-35 and -10)
+57 8 148210 4 X Search for E. coli promoter sequences. (-35 and -10)
+58 8 148405 44 T Search for procaryotic ribosome binding sites
+58 8 148405 44 X Search for procaryotic ribosome binding sites
+29 1 150862 4 T Reverse and complement the sequence
+29 1 150862 4 X Reverse and complement the sequence
+60 7 151001 142 T Search using a dinucleotide weight matrix
+60 7 151001 142 X Search using a dinucleotide weight matrix
+61 8 157292 31 T Search for eukaryotic ribosome binding sites
+61 8 157292 31 X Search for eukaryotic ribosome binding sites
+62 8 158730 56 T Search for splice junctions
+62 8 158730 56 X Search for splice junctions
+63 7 162089 7 T Search using a weight matrix (complementary)
+63 7 162089 7 X Search using a weight matrix (complementary)
+64 3 162471 36 T Plot observed-expected word frequencies
+64 3 162471 36 X Plot observed-expected word frequencies
+65 9 164175 5 T Search for polya sites
+65 9 164175 5 X Search for polya sites
+66 1 164369 4 T Interconvert t and u
+66 1 164369 4 X Interconvert t and u
+67 7 164520 797 T Search for patterns of motifs
+67 7 164520 797 X Search for patterns of motifs
diff --git a/help/nipf_help b/help/nipf_help
new file mode 100644
index 0000000..44a7208
--- /dev/null
+++ b/help/nipf_help
@@ -0,0 +1,132 @@
+
+ @-1. TX   0 @General
+
+ @-2. TX   0 @Screen control
+
+ @-3. TX   0 @Statistical analysis
+
+ @-1. TX   0 @General
+
+ @-2. TX   0 @Screen control
+
+ @-3. TX   0 @Statistical analysis
+
+ @0.  TX  -1 @NIPF
+
+ @1.  TX 1 @ Help
+
+ @2.  TX 1 @ Quit
+
+ @3.  TX 1 @ Read new sequence
+
+ @4.  TX 1 @ Redefine active region
+
+ @5.  TX 1 @ List the sequence
+
+ @6.  TX 1 @ List a text file
+
+ @7.  TX 1 @ Direct output to disk
+
+ @8.  TX 1 @ Write active sequence to disk
+
+ @9.  TX 1 @ List a translation
+
+ @32. TX 1 @ List showing base differences
+
+ @37. TX 1 @ List showing translation
+
+ @33. TX 1 @ List showing amino acid differences
+
+ @10. TX 2 @ Clear graphics
+
+ @11. TX 2 @ Clear text
+
+ @12. TX 2 @ Draw a ruler
+
+ @13. TX 2 @ Use cross hair
+
+ @14. TX 2 @ Reset margins
+
+ @15. TX 2 @ Label diagram
+
+ @16. TX 2 @ Display a map
+
+ @17. TX 3 @ Set comparison mode
+
+ @18. TX 3 @ Set sort mode
+
+ @21. TX 3 @ Count base changes
+
+ @22. TX 3 @ Count codon changes
+
+ @23. TX 3 @ Count genetic events
+
+ @24. TX 3 @ Show table of base changes
+
+ @36. TX 3 @ Show table of expressed base changes
+
+ @39. TX 3 @ Show table of silent base changes
+
+ @38. TX 3 @ Estimate mutation rate
+
+ @25. TX 3 @ Plot base changes
+
+ @26. TX 3 @ Plot expressed changes per base
+
+ @27. TX 3 @ Plot silent changes per base
+
+ @28. TX 3 @ Count expressed changes per base
+
+ @29. TX 3 @ Count silent changes per base
+
+ @30. TX 3 @ Count changed amino acids
+
+ @31. TX 3 @ Plot amino acid variability
+
+ @ end of help
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/help/nipf_menu b/help/nipf_menu
new file mode 100644
index 0000000..ab0225b
--- /dev/null
+++ b/help/nipf_menu
@@ -0,0 +1,84 @@
+-1 0 23 2 T General
+-1 0 23 2 X General
+-2 0 53 2 T Screen control
+-2 0 53 2 X Screen control
+-3 0 89 2 T Statistical analysis
+-3 0 89 2 X Statistical analysis
+-1 0 112 2 T General
+-1 0 112 2 X General
+-2 0 142 2 T Screen control
+-2 0 142 2 X Screen control
+-3 0 178 2 T Statistical analysis
+-3 0 178 2 X Statistical analysis
+0 -1 198 2 T NIPF
+0 -1 198 2 X NIPF
+1 1 217 2 T Help
+1 1 217 2 X Help
+2 1 236 2 T Quit
+2 1 236 2 X Quit
+3 1 268 2 T Read new sequence
+3 1 268 2 X Read new sequence
+4 1 305 2 T Redefine active region
+4 1 305 2 X Redefine active region
+5 1 337 2 T List the sequence
+5 1 337 2 X List the sequence
+6 1 368 2 T List a text file
+6 1 368 2 X List a text file
+7 1 404 2 T Direct output to disk
+7 1 404 2 X Direct output to disk
+8 1 448 2 T Write active sequence to disk
+8 1 448 2 X Write active sequence to disk
+9 1 481 2 T List a translation
+9 1 481 2 X List a translation
+32 1 525 2 T List showing base differences
+32 1 525 2 X List showing base differences
+37 1 564 2 T List showing translation
+37 1 564 2 X List showing translation
+33 1 614 2 T List showing amino acid differences
+33 1 614 2 X List showing amino acid differences
+10 2 643 2 T Clear graphics
+10 2 643 2 X Clear graphics
+11 2 668 2 T Clear text
+11 2 668 2 X Clear text
+12 2 695 2 T Draw a ruler
+12 2 695 2 X Draw a ruler
+13 2 724 2 T Use cross hair
+13 2 724 2 X Use cross hair
+14 2 752 2 T Reset margins
+14 2 752 2 X Reset margins
+15 2 780 2 T Label diagram
+15 2 780 2 X Label diagram
+16 2 808 2 T Display a map
+16 2 808 2 X Display a map
+17 3 842 2 T Set comparison mode
+17 3 842 2 X Set comparison mode
+18 3 870 2 T Set sort mode
+18 3 870 2 X Set sort mode
+21 3 903 2 T Count base changes
+21 3 903 2 X Count base changes
+22 3 937 2 T Count codon changes
+22 3 937 2 X Count codon changes
+23 3 972 2 T Count genetic events
+23 3 972 2 X Count genetic events
+24 3 1013 2 T Show table of base changes
+24 3 1013 2 X Show table of base changes
+36 3 1064 2 T Show table of expressed base changes
+36 3 1064 2 X Show table of expressed base changes
+39 3 1112 2 T Show table of silent base changes
+39 3 1112 2 X Show table of silent base changes
+38 3 1149 2 T Estimate mutation rate
+38 3 1149 2 X Estimate mutation rate
+25 3 1181 2 T Plot base changes
+25 3 1181 2 X Plot base changes
+26 3 1227 2 T Plot expressed changes per base
+26 3 1227 2 X Plot expressed changes per base
+27 3 1270 2 T Plot silent changes per base
+27 3 1270 2 X Plot silent changes per base
+28 3 1317 2 T Count expressed changes per base
+28 3 1317 2 X Count expressed changes per base
+29 3 1361 2 T Count silent changes per base
+29 3 1361 2 X Count silent changes per base
+30 3 1401 2 T Count changed amino acids
+30 3 1401 2 X Count changed amino acids
+31 3 1443 2 T Plot amino acid variability
+31 3 1443 2 X Plot amino acid variability
diff --git a/help/pip_help b/help/pip_help
new file mode 100644
index 0000000..69cc664
--- /dev/null
+++ b/help/pip_help
@@ -0,0 +1,2244 @@
+
+ @-1. TX 0 @General
+
+ @-2. T   0 @Screen control
+
+ @-2. X   0 @Screen
+
+ @-3. T   0 @Statistical analysis of content
+
+ @-3. X   0 @Statistics
+
+ @-4. T   0 @Structures and repeats
+
+ @-4. X   0 @Structures
+
+ @-5. TX  0 @Search
+
+ @0.  TX -1 @PIP
+
+        This is a program  for analysing individual protein sequences.
+ It  can  read  sequences  stored  in  many  of the most commonly used
+ formats, and performs all of the usual simple analyses.  In  addition
+ it  has  very  flexible search procedures  and   presents many of its
+ results graphically.
+
+        The following analyses (preceded by their option numbers)  are
+ included:
+  ? = Help
+  ! = Quit
+  3 = read a new sequence
+  4 = define active region
+  5 = list the sequence
+  6 = list a text file
+  7 = direct output to disk
+  8 = write active sequence to disk
+  9 = edit the sequence
+ 10 = clear graphics screen
+ 11 = clear text screen
+ 12 = draw a ruler
+ 13 = use cross hair
+ 14 = reposition plots
+ 15 = label diagram
+ 16 = display a map
+ 17 = search for short sequences
+ 18 = compare a sequence
+ 19 = compare a sequence using a score matrix
+ 20 = search for a sequence using a weight matrix
+ 21 = calculate amino acid composition
+ 22 = plot hydrophobicity
+ 23 = plot charge
+ 24 = plot Robson prediction
+ 25 = plot hydrophobic moment
+ 26 = draw helix wheel
+ 27 = back translate
+ 28 = search for patterns of motifs
+
+        Some of these methods produce graphical  results  and  so  the
+ program  is  generally  used from a graphics terminal (a vdu on which
+ lines and points can be drawn as well as characters).
+
+        For users of VT640's or their equivalents the terminal must be
+ set nowrap (type NOWRAP) prior to running the program.
+  The positions of each of the plots is defined relative  to  a  users
+  drawing board which has size 1-10,000 in x and 1-10,000 in y.  Plots
+  for each  option  are  drawn  in  a  window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The  default  window  positions are read from a file "ANALPMRG" when
+  the program is started. Users can have their own file if required.
+
+        The program can handle sequences stored  in  several  formats:
+  Staden, EMBL, GENBANK, PIR (also known as NBRF) and GCG and they are
+  described in the help for 'READ NEW SEQUENCE'.
+
+        The options for the program are accessed from  5  main  menus:
+  general, screen control, statistical analysis of content, structure,
+  search.  Both menus and options are selected by number.
+ @1. TX 0 @Help
+
+        This option gives online help. The user should  select  option
+  numbers  and  the  current  documentation  will  be given. Note that
+  option 0 gives an introduction to the program, and that ?  will  get
+  help from anywhere in the program.  The following analyses (preceded
+  by their option numbers) are included:
+
+ @2. TX 0 @Quit
+
+        This function stops the program.
+ @3. TX 1 @Read a new sequence
+
+        This option allows users to  read  in  new  sequences,  browse
+  through  annotations,  or  search  sequence  libraries for keywords.
+  Sequences can  be  read  from  "personal"  sequence  files  or  from
+  sequence  libraries. These are referred to as the sequence "source".
+  Personal files can be stored in several formats:  Staden, PIR, EMBL,
+  GENBANK  and  GCG.  At LMB we use "Staden" format for sequencing and
+  all the libraries  are  stored  in  their  original  formats.  Note,
+  however,  that  libraries  such  as EMBL or GenBank that are divided
+  into several files (eg GenBank has 13 separate files) are indexed as
+  a  whole.  This  means  that  users  do  not need to know which file
+  contains an entry, only which library.  When  the  user  selects  to
+  read in a sequence the program first asks for the sequence "source".
+
+        If the user selects "personal" the program will  ask  for  the
+  format (Staden, PIR, EMBL, GENBANK or GCG), and then for the name of
+  the file. For PIR format the user will also be required to know  the
+  entry  name of the sequence as the file can contain several. For the
+  other formats only a single entry is  expected.  The  file  will  be
+  read,  its  length  and composition will be displayed and the option
+  left.
+
+        If the user selects  "library"  as  the  sequence  source  the
+  program will display a list of available libraries. The programs are
+  capable of  handling  all  current  libraries  but  which  ones  are
+  available  will  vary  from  site  to  site.  At LMB we have several
+  libraries and also weekly updates of data gathered between releases.
+  The  program will ask users to select a library and then give a list
+  of options:
+
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entrynames from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+
+  If get a sequence or get annotations is selected users will be asked
+  to  type  the entry name. The option will be left when a sequence is
+  selected  or  !  is  typed.  The  composition  and  length  will  be
+  displayed.
+
+        The  text  index  contains  all  words  from  feature  tables,
+  reference  titles, definition lines, keywords lists and comments, so
+  the text index search is most useful. It is also the fastest. Up  to
+  5  words  can  be  searched  for  at once. The words should be typed
+  separated by spaces, for example
+   ? Keywords=P53 mouse murine tumo
+
+  will search for all entries that contain words  starting  with  p53,
+  mouse,  murine  and  tumo.  Only the unique entries that contain ALL
+  words will be  listed.  Before  listing  the  matching  entries  the
+  program  will  show  the number of 'hits' for each word and ring the
+  bell.  Escape is possible at this point, or after each screenfull of
+  entries.   In  addition  to the entry names the text search displays
+  the primary accession number, the  sequence  length  and  up  to  80
+  characters of description.  (The search of 'titles' is now redundant
+  because the full text index contains all the  title  words  and  the
+  search  is  much  faster.  It  will  probably  be  removed  from the
+  program.)  All searches are independent of case. Where possible  the
+  program will offer default entry names.
+
+        Typical dialogue follows.
+  Select sequence source
+  X  1 Personal file
+     2 Sequence library
+  ? Selection  (1-2) (1) =
+  Select sequence file format
+  X  1 Staden
+     2 EMBL
+     3 GenBank
+     4 PIR
+     5 GCG
+  ? Selection  (1-5) (1) =
+  ? Sequence file name=M13MP7.SEQ
+   Contig title removed
+  Sequence length=  7238
+   Sequence composition
+            T          C          A          G          -
+        2405.      1539.      1765.      1527.         2.
+          33.2%      21.3%      24.4%      21.1%       0.0%
+    .
+    .
+    .
+
+
+   Select sequence source
+   X  1 Personal file
+      2 Sequence library
+   ? Selection  (1-2) (1) =2
+   Select a library
+   X  1 EMBL 29 nucleotide library Dec 91
+      2 SWISSPROT 20 protein library Nov 91
+      3 PIR 31 protein library Dec 91
+      4 NRL3D 58 From Brookhaven protein library Dec 91
+      5 GenBank
+   ? Selection  (1-5) (1) =
+  Library is in EMBL format with indexes
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =5
+   Search for keywords
+   ? Keywords=P53 mouse
+  P53 hits  68
+  MOUSE hits  8180
+
+   MMANT01    X00875         536 Murine gene fragment for cellular tumour antigen
+   MMANT02    X00876          83 Murine gene fragment for cellular tumour antigen
+   MMANT03    X00877          21 Murine gene fragment for cellular tumour antigen
+   MMANT04    X00878         261 Murine gene fragment for cellular tumour antigen
+   MMANT05    X00879         184 Murine gene fragment for cellular tumour antigen
+   MMANT06    X00880         113 Murine gene fragment for cellular tumour antigen
+   MMANT07    X00881         110 Murine gene fragment for cellular tumour antigen
+   MMANT08    X00882         137 Murine gene fragment for cellular tumour antigen
+   MMANT09    X00883          74 Murine gene fragment for cellular tumour antigen
+   MMANT10    X00884         107 Murine gene for cellular tumour antigen p53 (exon
+   MMANT11    X00885         562 Murine p53 gene 3' region with exon 11
+   MMANTP53   M26862         536 Mouse tumor antigen p53 gene, 5' end.
+   MMLYN      M64608        2044 Mouse lyn protein mRNA, complete cds.
+   MMP53      X00741        1377 Mouse mRNA for transformation associated protein
+   MMP53A     M13872        1285 Mouse p53 mRNA, complete cds, clone pcD53.
+   MMP53B     M13873        1241 Mouse p53 mRNA, complete cds, clone p53-m11.
+   MMP53C     M13874        1322 Mouse p53 mRNA, complete cds, clone p53-m8.
+   MMP53G1    X01235         554 Mouse genomic DNA for 5' region of cellular tumou
+   MMP53IN4   X60470         729 M.musculus p53 gene for p53 protein, intron 4
+   MMP53P     X01236        2132 Mouse pseudogene for cellular tumour antigen p53
+   MMP53R     X01237        1773 Mouse mRNA for cellular tumour antigen p53
+   MMRSB2P5   M64597         196 Mouse B2 repeat in the 3' flank of protein 53 (p5
+        22 different entries found
+
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =4
+   Search for keywords
+   ? Keywords=alpha
+   Searching for alpha
+   AAGHA          623 a.anguilla mrna for glycoprotein hormone alpha subunit precu
+   AAMALI        3338 a.aegypti mali gene encoding alpha 1-4 glucosidase, complete
+   AAMALIA       1659 a.aegypti maltase-like i (mali) gene encoding alpha-1,4-gluc
+   AAMALIB       1832 a.aegypti maltase-like i (mali) mrna encoding alpha-1,4-gluc
+   ACA13GT        371 alouatta caraya alpha-1,3gt gene, 3' flank.
+   ADHBADA1       102 duck alpha-d-globin gene, exon 1.
+   ADHBADA2      1145 duck alpha-a-globin gene and 5' flank
+   ADHBADWP       513 duck (white pekin) alpha ii (minor) globin mrna, complete co
+   AEACOXABC     5279 a.eutrophus protein x (acox), acetoin:dcpip oxidoreductase-a
+   AGA13GT        371 ateles geoffroyi alpha-1,3gt gene, 3' flank.
+   AGAAAGFP       282 c.tetragonoloba alpha-amylase/alpha-galactosidase fusion pro
+   AGAABL         138 b.subtilis alpha-amylase signal peptide gene e.coli beta-lac
+   AGAFAMYA        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFAMYB        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFAMYC        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFCOXA        98 synthetic alpha-factor/cox iv fusion gene signal peptide.
+   AGAGABA       7876 synthetic gossypium hirsutum (cotton) alpha globulin a and b
+   AGAMYLS        120 synthetic alpha-amylase gene, 5' end.
+   AGANPS          95 synthetic gene (jcnf-1) encoding alpha-factor pro-region/han
+  !
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =3
+   ? Accession number=v00636
+  Entry name LAMBDA
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =2
+   Default Entry name=LAMBDA
+   ? Entry name=
+  ID   LAMBDA     standard; DNA; PHG; 48502 BP.
+  XX
+  AC   V00636; J02459; M17233; X00906;
+  XX
+  DT   03-JUL-1991 (Rel. 28, Last updated, Version 3)
+  DT   09-JUN-1982 (Rel. 1, Created)
+  XX
+  DE   Genome of the bacteriophage lambda (Styloviridae).
+  XX
+  KW   circular; coat protein; DNA binding protein; genome;
+  KW   origin of replication.
+  XX
+  OS   Bacteriophage lambda
+  OC   Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
+  XX
+  RN   [1]
+  RP   1-48502
+  RA   Sanger F., Coulson A.R., Hong G.F., Hill D.F., Petersen G.B.;
+  RT   "Nucleotide sequence of bacteriophage lambda DNA";
+  RL   J. Mol. Biol. 162:729-773(1982).
+  XX
+  !
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =
+   Default Entry name=LAMBDA
+   ? Entry name=
+  DE   Genome of the bacteriophage lambda (Styloviridae).
+   Sequence length  48502
+   Sequence composition
+             T          C          A          G          -
+        11988.     11360.     12336.     12818.         0.
+           24.7%      23.4%      25.4%      26.4%       0.0%
+
+ @4. TX 1 @Redefine active region
+
+        For its analytic functions  the  program  always  works  on  a
+  region of the sequence called the active region. When a new sequence
+  is read into the program the active region is automatically  set  to
+  start  at  the  beginning  of  the sequence and go up to the maximum
+  allowed size of active region the version of the program can handle.
+  The  positions  are shown on the screen.  On most machines this will
+  be to the end of the sequence.  This option allows the user define a
+  different  region.  Note  that  for  convenience  in the listing and
+  translation functions the user is given access  to  regions  outside
+  the active region.
+ @5. TX 1 @List a sequence
+
+        The sequence can be listed with line lengths from 10 to 120 in
+  multiples  of  10.  Output  can  be directed to a disk file by first
+  selecting disk output. The output looks like:
+
+            10         20         30         40         50         60
+    MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY
+
+            70         80         90        100        110        120
+    AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
+
+           130        140        150        160        170        180
+    LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
+
+           190        200        210        220        230        240
+    DAIINQRDSG IKCIYVAIGQ KASTISNVVR KLEEHGALAN TIVVVATASE SAALQYLARM
+
+           250        260        270        280        290        300
+    PVALMGEYFR DRGEDALIIY DDLSKQAVAY RQISLLLRRP PGREAFPGDV FYLHSRLLER
+
+           310        320        330        340        350        360
+    AARVNAEYVE AFTKGEVKGK TGSLTALPII ETQAGDVSAF VPTNVISITD GQIFLETNLF
+
+           370        380        390        400        410        420
+    NAGIRPAVNP GISVSRVGGA AQTKIMKKLS GGIRTALAQY RELAAFSQFA SDLDDATRKQ
+
+           430        440        450        460        470        480
+    LDHGQKVTEL LKQKQYAPMS VAQQSLVLFA AERGYLADVE LSKIGSFEAA LLAYVDRDHA
+
+           490        500        510        520        530        540
+    PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW*
+
+ @6. TX 1 @List a text file
+
+        Allows the user to have a text file displayed on  the  screen.
+  It will appear one page at a time.
+ @7. TX 1 @Direct output to disk
+
+        Used to direct output that would normally appear on the screen
+  to a file.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @8. TX 1 @Write active region to disk
+
+        The program has the capability of reading  in  EMBL,  GENBANK,
+  NBRF,  GCG  and  Staden  formats  and of reversing and complementing
+  sequences. This option allows users  to  write  the  current  active
+  sequence  to  a  disk  file in Staden format. Hence it allows format
+  conversion and crude sequence cutting.
+ @9. TX 1 @Edit the sequence
+
+        Used to edit sequences or any other files by giving access  to
+  the  computers  system  editor. For editing sequences the input file
+  should have already been created using the  listing  function  "list
+  sequence".
+
+        Supply the name of the file to edit.  Wait  while  the  system
+  editor  is  made  ready  (can take awhile on a vax). Use the editor.
+  Exit from the editor. If a sequence has been edited, and you want to
+  process  it,  affirm  that the sequence should be "made active". The
+  edited sequence will replace the original sequence.
+
+        This editing method is designed to give  users  access  to  an
+  editor with which they are familiar - i.e. the one on their machine,
+  and yet to  allow  them  to  edit  a  sequence  which  contains  the
+  landmarks  they  need  in  order  to  know where they are. Users can
+  create files containing simple listings with numbering, using  "list
+  the  sequence",  and  then edit them with their system editor, using
+  the numbering to know where they are within the sequence.  When  the
+  edits  are  complete  they  exit  from  the  editor  and the program
+  "analyses" the edited file to extract only the sequence  characters.
+  Define     the     permitted    set    of    characters    to    be:
+  ACDEFGHIKLMNPQRSTVWXYZ-acdefghiklmnpqrstvwxyz.     All     permitted
+  characters  found  in the file will become part of the sequence, all
+  others removed.
+ @10. TX 2 @Clear graphics
+
+        Clears the screen of both text and graphics.
+ @11. TX 2 @Clear text
+
+        Clears only text from the screen.
+ @12. TX 2 @Draw a ruler
+
+        This option allows the user to draw a ruler or scale along the
+  x  axis  of the screen to help identify the coordinates of points of
+  interest. The user can define the position of the first  amino  acid
+  to  be marked (for example if the active region is 1501 to 8000, the
+  user might wish to mark every 1000th amino acid starting  at  either
+  1501  or  2000  -  it depends if the user wishes to treat the active
+  region as an independent unit with its own numbering starting at its
+  left  edge,  or  as  part  of the whole sequence). The user can also
+  define the separation of the ticks on the scale and their height. If
+  required  the  labelling  routine  can be used to add numbers to the
+  ticks.
+ @13. TX 2 @Use cross hair
+
+        This function puts a steerable cross on the screen that can be
+  used to find the coordinates of points in the sequence. The user can
+  move the cross around using the directional keys; when he  hits  the
+  space bar the program will print out the coordinates of the cross in
+  sequence units and the option will be exited.
+
+        If instead, you hit a , the position will be displayed but the
+  cross will remain on the screen.
+
+        If a letter s is hit the sequence around  the  cross  hair  is
+  displayed and the cross remains on the screen.
+ @14. TX 2 @Reset margins
+
+        The positions of each of the plots is defined  relative  to  a
+  users  drawing board which has size 1-10,000 in x and 1-10,000 in y.
+  Plots for each option are drawn in a window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The  default  window  positions are read from a file "ANALMARG" when
+  the program is started. Users can have their own file  if  required.
+  As  all  the plots start at the same position in x and have the same
+  width, x0 and xlength are the same for all options. Generally  users
+  will  only  want  to change the start level of the window y0 and its
+  height ylength. This option allows users to change window  positions
+  whilst  running  the  program.   The  routine  prompts first for the
+  number of the option that the users wishes to reposition;  then  for
+  the  y  start and height; then for the x start and length. Note that
+  changes to the x values affect all options. If the user  types  only
+  carriage  return  for any value it will remain unchanged. The cross-
+  hair can be used to choose suitable heights.
+ @15. TX 2 @Label a diagram
+
+        This routine allows users to  label  any  diagrams  they  have
+  produced.  They  are  asked  to type in a label. When the user types
+  carriage return to finish typing the label the cross-hair appears on
+  the  screen. The user can position it anywhere on the screen. If the
+  user types R (for right justify) the label will be  written  on  the
+  diagram  with  its right end at the cross-hair position. If the user
+  types L (for left justify) the label will be written on the  diagram
+  with  its  left end at the cross hair position.  The cross-hair will
+  then immediately reappear. The  user  may  put  the  same  label  on
+  another part of the diagram as before or if he hits the space bar he
+  will be asked if he wishes to type in another label.
+ @16. TX 2 @Display a map
+
+        It is  often  convenient  to  plot  a  map  alongside  graphed
+  analysis  in  order  to  indicate features within the sequence. This
+  function allows users to draw maps using files arranged in the  form
+  of  EMBL  feature  tables. Of course the EMBL table are usually only
+  used for nucleic acid  sequence  annotation  but,  as  long  as  the
+  features  are written in the correct format, they can be employed by
+  this routine. The  map  is  composed  of  a  line  representing  the
+  sequence  and  then  further  lines  denoting  the endpoints of each
+  feature the user identifies. The user is asked to define  height  at
+  which  the  line representing the sequence should be drawn; then for
+  the feature height; then for the features to plot.
+ @17. TX 1 5 @Short sequence search
+
+        This routine is used to search  for  exact  matches  to  short
+  sequences.  It  is  equivalent  to  the restriction enzyme search in
+  program NIP. It and can either list matches or present  the  results
+  graphically.
+
+        Select from searching, screen clearing or file listing. Choose
+  a file of strings and the mode of output required.
+
+        The files of short sequences (strings) and their names need to
+  be arranged in a particular way. For example
+  ACID/D/E//
+  BASIC/R/K/H//
+  HYDRO/F/L/I/V/Y//
+  GLYCO/N-S/N-T//
+  +/R/K/H//
+  -/D/E//
+  defines various groups of  amino  acids.   Each  string  or  set  of
+  strings must be preceded by a name, each string must be preceded and
+  terminated with a slash (/), and each set of strings by  2  slashes.
+  These  collections  of strings and their names can be read from disk
+  or entered from the keyboard. Two  files  containing  sequences  are
+  currently  available.  One contains named groups of amino acids. The
+  other simply contains the names of  all  amino  acids  and  gives  a
+  convenient  way  of  producing  a  plot  of the positions of all the
+  different amino acids in the sequence.  The user can select  strings
+  by  name  from  these collections. Results can be displayed  name by
+  name or all together. Strings entered from the keyboard need  to  be
+  separated  by  slash characters(/).  For the name by name search the
+  output looks like:
+    MATCHES=    12
+   NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+   ACID                  E                          7       7       1
+   ACID                  E                         10       3       1
+   ACID                  E                         24      14       1
+   ACID                  E                         28       4       1
+   ACID                  D                         36       8       1
+   ACID                  D                         46      10       2
+   ACID                  E                         51       5       2
+   ACID                  E                         67      16       2
+   ACID                  D                         69       2       2
+   ACID                  D                         81      12       2
+   ACID                  E                         84       3       2
+   ACID                  E                         96      12       3
+    MATCHES=    10
+   NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+   BASIC                 K                         13      13       1
+   BASIC                 R                         15       2       1
+   BASIC                 H                         26      11       1
+   BASIC                 R                         40      14       1
+   BASIC                 H                         42       2       2
+   BASIC                 R                         59      17       2
+   BASIC                 R                         68       9       2
+   BASIC                 K                         87      19       2
+   BASIC                 K                         89       2       2
+   BASIC                 R                         93       4       2
+    MATCHES=     1
+   NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+   GLYCO                 NST                        4       4       3
+
+   or when the results are ordered only on position the output looks like:
+
+   NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+   GLYCO                 NST                        4       3
+   ACID                  E                          7       3
+   ACID                  E                         10       3
+   BASIC                 K                         13       3
+   BASIC                 R                         15       2
+   ACID                  E                         24       9
+   BASIC                 H                         26       2
+   ACID                  E                         28       2
+   ACID                  D                         36       8
+   BASIC                 R                         40       4
+   BASIC                 H                         42       2
+   ACID                  D                         46       4
+   ACID                  E                         51       5
+   BASIC                 R                         59       8
+  Graphical output marks the  position  of  each  string  by  a  short
+  vertical line and gives its name at the left end of the line. If the
+  top of the  screen  is  reached  the  program  gives  the  user  the
+  oportunity  to  take  a hard copy and then will clear the screen and
+  restart plotting results at the original start position.  Note  that
+  any  character  in  the  string  that  is not a recognisable protein
+  symbol will be treated as a wild card character will match with  all
+  characters in the searched sequence.
+
+        Typical dialogue follows.
+
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structure
+  m5 = Search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=17
+   Search for short sequences
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =2
+    1 All acids
+  X 2 Named groups
+    3 Personal file
+    4 Keyboard
+  ? 0,1,2,3,4 =
+
+  ACID/D/E//
+  BASIC/R/K/H//
+  HYDRO/F/L/I/V/Y//
+  GLYCO/N-S/N-T//
+  +/R/K/H//
+  -/D/E//
+  DIBASIC/RR/KK/RK/KR//
+  TURN/N/D/G/P/S//
+  BLOCK/A/Q/E/I/L/M/F/W/V//
+  INDIF/R/C/H/K/T/Y//
+  End of file
+
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+
+    1 All acids
+  X 2 Named groups
+    3 Personal file
+    4 Keyboard
+  ? 0,1,2,3,4 =
+
+  ? (y/n) (y) All names n
+  ? Name=acid
+  ? Name=basic
+  ? Name=glyco
+  ? Name=
+
+  ? (y/n) (y) Show results name by name
+  ? (y/n) (y) List matches
+
+   searching
+   matches=    59
+  NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+  ACID                  E                          7       7       1
+  ACID                  E                         10       3       1
+  ACID                  E                         24      14       1
+  ACID                  E                         28       4       1
+  ACID                  D                         36       8       1
+  ACID                  D                         46      10       2
+  ACID                  E                         51       5       2
+  ACID                  E                         67      16       2
+  ACID                  D                         69       2       2
+  ACID                  D                         81      12       2
+  ACID                  E                         84       3       2
+  ACID                  E                         96      12       3
+  ACID                  D                        116      20       3
+   matches=    61
+  NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+  BASIC                 K                         13      13       1
+  BASIC                 R                         15       2       1
+  BASIC                 H                         26      11       1
+  BASIC                 R                         40      14       1
+  BASIC                 H                         42       2       2
+  BASIC                 R                         59      17       2
+   ...etc
+   matches=     2
+  NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+  GLYCO                 NST                        4       4       3
+  GLYCO                 NQT                      487     483      28
+                                                          28     483
+
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+
+    1 All acids
+  X 2 Named groups
+    3 Personal file
+    4 Keyboard
+  ? 0,1,2,3,4 =
+
+  ? (y/n) (y) Selected names
+
+  ? Name=basic
+  ? Name=glyco
+  ? Name=
+
+  ? (y/n) (y) Show results name by name n
+  ? (y/n) (y) List matches
+
+   searching
+  NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+  GLYCO                 NST                        4       3
+  BASIC                 K                         13       9
+  BASIC                 R                         15       2
+  BASIC                 H                         26      11
+  BASIC                 R                         40      14
+  BASIC                 H                         42       2
+  BASIC                 R                         59      17
+  BASIC                 R                         68       9
+  BASIC                 K                         87      19
+   ...etc
+  BASIC                 R                        477      14
+  BASIC                 H                        479       2
+  GLYCO                 NQT                      487       8
+  BASIC                 K                        499      12
+  BASIC                 K                        501       2
+  BASIC                 K                        508       7
+                                                           7
+
+  X 1 Search
+    2 List enzyme file
+    3 Clear text
+    4 Clear graphics
+  ? 0,1,2,3,4 =
+    1 All acids
+  X 2 Named groups
+    3 Personal file
+    4 Keyboard
+  ? 0,1,2,3,4 =4
+  Define search strings by typing a string name
+  followed by the string(s)
+  ? Name=MARY
+  ? String(s)=AL/VI
+  ? Name=
+  ? (y/n) (y) All names
+  ? (y/n) (y) Show results name by name
+  ? (y/n) (y) List matches
+
+   searching
+   matches=    12
+  NAME                  SEQUENCE            POSITION  FRAGMENT LENGTHS
+  MARY                  VI                        38      38      10
+  MARY                  AL                        63      25      13
+  MARY                  VI                       136      73      16
+  MARY                  AL                       177      41      19
+  MARY                  AL                       217      40      25
+  MARY                  AL                       233      16      37
+  MARY                  AL                       243      10      40
+  MARY                  AL                       256      13      41
+  MARY                  AL                       326      70      45
+  MARY                  VI                       345      19      51
+  MARY                  AL                       396      51      70
+  MARY                  AL                       470      74      73
+
+
+ @18. TX 1 5 @Compare a sequence
+
+        This  routine  slides  a  short  sequence  along  the  current
+  sequence  and finds all positions at which a given percentage of the
+  amino acids match.  Output is in both graphical and listed forms.
+
+        If  users call for dialogue when the routine is selected  they
+  will  be  given  the  choice  of  keyboard or file input. Define the
+  string, and the percentage match. Matches will be  plotted  out  and
+  then  the  user  can  select  to  have them listed. Then the routine
+  cycles around.
+
+        The routine slides the search string along the   sequence  and
+  marks  the positions at which a minimum percentage score is reached.
+  The graphical output draws a vertical line at  the  match  position;
+  the  height  of the line represents the percentage score, so that if
+  the line reaches the top of the box the score is 100%.
+
+        Typical dialogue follows.
+
+  ? Menu or option number=18
+   Find percentage matches
+  ? (y/n) (y) Keep picture
+
+  ? String=aaa
+  ? Percent match (1.00-100.00) (70.00) =
+
+   missing graphics
+
+  Total scoring positions above 70.000 percent =  19
+  Scores          2      2      2      2      2      2      2      2      2      2
+  Positions      61    131    177    217    226    231    232    267    300    301
+
+  ? Number to list (0-19) (0) =3
+
+          61
+           AIA
+           * *
+           aaa
+           1
+
+         131
+           AIA
+           * *
+           aaa
+           1
+
+         177
+           ALA
+           * *
+           aaa
+           1
+  ? (y/n) (y) Keep picture n
+
+  Default String=aaa
+  ? String=!
+
+ @19. TX 1 5 @Compare a sequence using a score matrix
+
+        This  routine  slides  a  short  sequence  along  the  current
+  sequence  and  finds  all  positions  at  which  a  given  level  of
+  similarity (a cutoff score) is reached. The score is defined by  use
+  of  a  score  matrix (MDM78). Output is in both graphical and listed
+  forms.
+
+        If  users call for dialogue when the routine is selected  they
+  will  be  given  the  choice  of  keyboard or file input. Define the
+  string and the cutoff score. Matches will be plotted  out  and  then
+  the  user  can  select  to have them listed. Then the routine cycles
+  around.
+
+        The routine slides the search string along the   sequence  and
+  marks  the  positions  at  which a the cutoff score is achieved. The
+  graphical output draws a vertical line at the  match  position;  the
+  height  of  the  line  represents  the   score,  so that if the line
+  reaches the top of the box the score is the maximum possible.
+
+        Typical dialogue follows.
+
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structure
+  m5 = Search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=19
+   Find matches using a score matrix
+  ? (y/n) (y) Keep picture
+
+  ? String=aaa
+  Minimum score=    12 Maximum score=    36
+  ? Score (12-36) (36) =
+
+   missing graphics
+
+  For score    24 the number of matches=   507
+  scores         35     35     35     34     34     34     34     34     34     34
+  positions     226    231    379    112    133    202    227    267    378
+  380
+
+  ? Number to list (0-507) (0) =3
+
+         226
+           ATA
+           * *
+           aaa
+           1
+
+         231
+           SAA
+             **
+           aaa
+           1
+
+         379
+           GAA
+            **
+           aaa
+           1
+  ? (y/n) (y) Keep picture n
+
+  Default String=aaa
+  ? String=!
+ @20. TX 5 @Search for a motif using a weight matrix
+
+        This function performs  searches  for  short  sequence  motifs
+  using  an  appropriate  weight matrix. In addition it can be used to
+  create or modify weight matrices. In order to perform a  search  the
+  only  input  required  is the name of the file containing the weight
+  matrix.  The results can be presented  graphically  or  listed.  The
+  graphical presentation will draw line at the position of any matches
+  found; the height of the line is proportional to the score.
+
+        For a search, select "use weight matrix", supply the  name  of
+  the  file  containing  the  weight matrix, and choose between having
+  results plotted  or  listed.  If  dialogue  is  requested  when  the
+  function is selected users can alter the cutoff score employed.
+
+        To create a weight matrix several steps are involved.  A  file
+  containing an alignment of known motifs is required. (This file must
+  be created before the current option is selected. The  format  is  a
+  follows:  each  sequence is written on a separate line with at least
+  one space at the beginning; each sequence is terminated by  a  space
+  character,  and  can  be  followed  by a name. The sequences must be
+  aligned.) Supply the name of the  file  of  aligned  sequences.  The
+  program  reads  and  displays the sequences. Choose between "summing
+  logs of weights" or summing weights (i.e. whether to multiply or add
+  weights).  If  logs  are used all scores will be negative. Choose if
+  all positions in the set of aligned sequences should be used or if a
+  mask should be applied. If so selected, define a mask as a string of
+  symbols, in which symbol - means ignore and any other  symbol  means
+  use. E.g. xx-x--abc means use all positions except 3,5 and 6.
+
+        The program will calculate weights as the frequencies of  each
+  amino  acid  at  each  unmasked  position  in  the  set  of  aligned
+  sequences. These weights are then applied  to  the  set  of  aligned
+  sequences  to  give  a  range  of  "observed"  scores.  The mean and
+  standard deviation of these scores is displayed. The user  is  asked
+  to  supply  several  values  to  be  used  when the weight matrix is
+  applied to other sequences: a cutoff score  (by  default,  the  mean
+  minus  3  standard  deviations),  a  top score for scaling graphical
+  results (by default, the mean plus 3  standard  deviations),  and  a
+  position  to  identify  (this  means that if a particular amino acid
+  within the motif is used as a "landmark",  such  as  the  G  of  the
+  helix-turn-helix  motif, then its position will be marked in plots).
+  All these values are stored along with the  weight  matrix.  Finally
+  supply the name of a file to contain the weight matrix.
+
+        Weight matrices can be  "rescaled"  using  a  set  of  aligned
+  sequences  in much the same ways as a matrix is created. The purpose
+  is to redefine the cutoff scores, and rescaling does not  alter  any
+  other values in the weight matrix file.
+
+        The methods have changed considerably but were first  outlined
+  in  Staden,  R.  Nucl.  Acid  Res.  12  505-519 1984, and Staden, R.
+  Genetic engineering: principles and methods vol 7,  Edited  by  J.K.
+  Setlow and A. Hollaender, Plenum publishing corp., 1985.
+
+        The methods have always had to deal with the problem of zeroes
+  in  the  matrices.  The  current  versions  employ  "Laplaces Law of
+  Succession" in which 1 is added to each term.
+
+        It is now possible to  apply  a  mask  to  a  set  of  aligned
+  sequences  in  order  to  give  weight  to  selected positions only.
+  Sequences have superimposed functions: some parts may be of  general
+  structural  importance  and  give  rise to an overall framework, and
+  other parts give specificity and hence are not common; we  may  want
+  to use a set of aligned sequences to define a motif, but want to use
+  only the framework positions.  Alternatively we may want to pick out
+  only  those  parts  of  a  set  of  aligned  sequences  that  give a
+  particular property, and to ignore other similarities that  are  due
+  to  some  other  property and which could obscure the pattern we are
+  interested in. The ability to define a mask allows certain positions
+  to  be  used  in  the  motif and others to be ignored, and yet still
+  permits the use of a set of aligned sequences to calculate weights.
+
+        Typical dialogue is shown below.
+  ? Menu or option number=20
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 =2
+  ? Name of aligned sequences file=[rs.motifs]hth.seq
+       1 QESVADKMGMGQSGVGALFN LAMBDA.REP
+       2 QTKTAKDLGVYQSAINKAIH LAMBDA.CRO
+       3 QAALGKMVGVSNVAISQWQR P22.REP
+       4 QRAVAKALGISDAAVSQWKE P22.CRO
+       5 QAELAQKVGTTQQSIEQLEN 434.REP
+       6 QTELATKAGVKQQSIQLIEA 434.CRO
+       7 RQEIGQIVGCSRETVGRILK CAP
+       8 RGDIGNYLGLTVETISRLLG Fnr
+       9 LYDVAEYAGVSYQTVSRVVN LAC.R
+      10 IKDVARLAGVSVATVSRVIN GAL.R
+      11 TEKTAEAVGVDKSQISRWKR LAMBDA.CII
+      12 QRKVADALGINESQISRWKG P22.CI
+      13 KEEVAKKCGITPLQVRVWCN MAT.ALPHA
+      14 TRKLAQKLGVEQPTLYWHVK TETR.TN10
+      15 TRRLAERLGVQQPALYWHFK TETR.pSC1
+      16 QRELKNELGAGIATITRGSN TRP.REP
+      17 RQQLAIIFGIGVSTLYRYFP H-INVERSN
+      18 ATEIAHQLSIARSTVYKILE TN3.RESOL
+      19 ASHISKTMNIARSTVYKVIN GD.RESOLV
+      20 IASVAQHVCLSPSRLSHLFR ARA.C
+      21 RAEIAQRLGFRSPNAAEEHL LEX.R
+  Length of motif    20
+  ? (y/n) (y) Sum logs of weights
+  ? (y/n) (y) Use all motif positions n
+  x means use, - means ignore
+  e.g. xx-x---x-x means use positions 1,2,4,8,10
+  ? Mask=--xxxxxxxxxxxx------
+   Applying weights to input sequences
+     1      -57.143 QESVADKMGMGQSGVGALFN
+     2      -55.087 QTKTAKDLGVYQSAINKAIH
+     3      -58.079 QAALGKMVGVSNVAISQWQR
+     4      -54.986 QRAVAKALGISDAAVSQWKE
+     5      -55.181 QAELAQKVGTTQQSIEQLEN
+     6      -55.874 QTELATKAGVKQQSIQLIEA
+     7      -56.692 RQEIGQIVGCSRETVGRILK
+     8      -57.722 RGDIGNYLGLTVETISRLLG
+     9      -55.363 LYDVAEYAGVSYQTVSRVVN
+    10      -55.769 IKDVARLAGVSVATVSRVIN
+    11      -56.786 TEKTAEAVGVDKSQISRWKR
+    12      -55.833 QRKVADALGINESQISRWKG
+    13      -56.279 KEEVAKKCGITPLQVRVWCN
+    14      -53.125 TRKLAQKLGVEQPTLYWHVK
+    15      -55.833 TRRLAERLGVQQPALYWHFK
+    16      -58.651 QRELKNELGAGIATITRGSN
+    17      -56.749 RQQLAIIFGIGVSTLYRYFP
+    18      -56.986 ATEIAHQLSIARSTVYKILE
+    19      -60.618 ASHISKTMNIARSTVYKVIN
+    20      -58.988 IASVAQHVCLSPSRLSHLFR
+    21      -58.002 RAEIAQRLGFRSPNAAEEHL
+  Top score     -53.125  Bottom score     -60.618
+  Mean     -56.655  Standard deviation       1.617
+  Mean minus 3.sd     -61.505  Mean plus 3.sd     -51.804
+  ? Cutoff score (-999.00-9999.00) (-61.51) =
+  ? Top score for scaling plots (-61.51-999.00) (-51.80) =
+  ? Position to identify (0-20) (1) =9
+  ? Title=hth
+  ? Name for new weight matrix file=1.wts
+
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structure
+  m5 = Search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=20
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 =
+
+  ? Motif weight matrix file=1.wts
+   hth
+  ? (y/n) (y) Use frequencies as weights
+  ? (y/n) (y) Plot results n
+        5    -61.46 STEISELIKQRIAQFNVVSE
+       13    -58.93 KQRIAQFNVVSEAHNEGTIV
+       21    -60.42 VVSEAHNEGTIVSVSDGVIR
+       57    -59.39 GNRYAIALNLERDSVGAVVM
+       59    -61.47 RYAIALNLERDSVGAVVMGP
+       79    -59.90 YADLAEGMKVKCTGRILEVP
+       88    -61.41 VKCTGRILEVPVGRGLLGRV
+      104    -60.38 LGRVVNTLGAPIDGKGPLDH
+      127    -60.13 SAVEAIAPGVIERQSVDQPV
+      129    -59.91 VEAIAPGVIERQSVDQPVQT
+      133    -60.79 APGVIERQSVDQPVQTGYKA
+      139    -61.12 RQSVDQPVQTGYKAVDSMIP
+      175    -58.90 KTALAIDAIINQRDSGIKCI
+      191    -60.95 IKCIYVAIGQKASTISNVVR
+      195    -60.94 YVAIGQKASTISNVVRKLEE
+      215    -60.66 HGALANTIVVVATASESAAL
+      254    -60.56 EDALIIYDDLSKQAVAYRQI
+      260    -60.08 YDDLSKQAVAYRQISLLLRR
+      297    -61.00 LLERAARVNAEYVEAFTKGE
+      314    -61.29 KGEVKGKTGSLTALPIIETQ
+      330    -60.49 IETQAGDVSAFVPTNVISIT
+      363    -57.63 GIRPAVNPGISVSRVGGAAQ
+      365    -61.48 RPAVNPGISVSRVGGAAQTK
+      371    -61.02 GISVSRVGGAAQTKIMKKLS
+      382    -57.90 QTKIMKKLSGGIRTALAQYR
+      394    -60.07 RTALAQYRELAAFSQFASDL
+      424    -59.95 GQKVTELLKQKQYAPMSVAQ
+      430    -58.89 LLKQKQYAPMSVAQQSLVLF
+      432    -61.14 KQKQYAPMSVAQQSLVLFAA
+      438    -58.58 PMSVAQQSLVLFAAERGYLA
+      458    -61.06 DVELSKIGSFEAALLAYVDR
+      466    -61.00 SFEAALLAYVDRDHAPLMQE
+      483    -60.48 MQEINQTGGYNDEIEGKLKG
+      494    -60.61 DEIEGKLKGILDSFKATQSW
+
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structure
+  m5 = Search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=d20
+  X 1 Use weight matrix
+    2 Make weight matrix
+    3 Rescale weight matrix
+  ? 0,1,2,3 =
+
+  ? Motif weight matrix file=1.wts
+   hth
+  ? (y/n) (y) Use frequencies as weights
+  ? Cutoff score (-9999.00-9999.00) (-61.51) =-56.
+  ? (y/n) (y) Plot results n
+
+
+ @21. TX 3 @Calculate amino acid composition
+
+        This  function  calculates  the  amino  acid  composition  and
+  molecular weight for the active region.
+  ? Menu or option number=21
+   Sequence composition
+
+  A   C     S     T     P     A     G     N     D     E     Q     B     Z     H
+  N   3.   32.   23.   18.   57.   47.   16.   28.   31.   28.    0.    0.    7.
+  %   0.6   6.2   4.5   3.5  11.1   9.1   3.1   5.4   6.0   5.4   0.0   0.0   1.4
+  W  309. 2786. 2325. 1748. 4051. 2682. 1826. 3222. 4003. 3588.    0.    0.
+  960.
+
+  A   R     K     M     I     L     V     F     Y     W     -     X     ?
+  N  30.   24.   11.   40.   47.   41.   14.   15.    1.    0.    0.    0.    1.
+  %   5.8   4.7   2.1   7.8   9.1   8.0   2.7   2.9   0.2   0.0   0.0   0.0   0.2
+  W 4686. 3076. 1443. 4527. 5319. 4065. 2060. 2448.  186.    0.    0.    0.
+  0.
+  Total molecular weight=    55328.
+
+ @22. TX 3 4 @Plot hydrophobicity
+
+        This routine plots the hydrophobicity of each section  of  the
+  sequence  using  the hydrophobicity values of Kyte and Doolittle (J.
+  Mol. Biol. 157, 105-132 (1982)).  A window  of  size  span  is  slid
+  along the sequence and a sum calculated for each position.
+
+        If dialogue is requested select  a  span  length  and  a  plot
+  interval.
+
+        The diagrams are  on the same scale as Fig. 6 of the Kyte  and
+  Doolittle  paper  and  values of + and - 50 could be assigned to the
+  top and bottom of the diagram with corresponding values  in  between
+  (-40,-20,0,20,40 are shown in the paper).
+  ? Menu or option number=d22
+   Plot hydrophobicity
+  ? odd span length (1-101) (11) =
+  ? plot interval (1-101) (3) =
+
+   missing graphics
+ @23. TX 3 4 @Plot charge
+
+        This routine plots the charge of each section of the sequence.
+  A  window  of  size  span  is  slid  along  the  sequence  and a sum
+  calculated for each position. Amino acids are assigned charges of 1,
+  -1 or 0.
+
+        If dialogue is requested select  a  span  length  and  a  plot
+  interval.
+
+        Typical dialogue follows.
+
+  ? Menu or option number=d23
+   Plot charge
+  ? odd span length (1-101) (11) =
+  ? plot interval (1-101) (3) =
+
+   missing graphics
+
+ @24. TX 4 @Plot robson prediction
+
+        This routine uses the method of Garnier J, Osguthorpe D J, and
+  Robson  B.  (1978)  J.  Mol.  Biol. 120, 97-120 to predict secondary
+  structures. The method divides protein secondary structures  into  4
+  classes:  helix,  extended  (usually referred to as sheet), turn and
+  coil. The routine calculates the likelihood that each segment of the
+  sequence  lies  in  each  of  these  classes.  Results are presented
+  graphically or listed.
+
+        If dialogue is requested  choose  between  plotted  or  listed
+  output.
+
+        Each residue has a certain probability of being found in  each
+  of  the  4  classes.  This probability depends both on its own amino
+  acid type and also the 8 amino acids found to either side along  the
+  protein  chain.  Four  tables of weights, each 20 by 17 elements are
+  used to calculate the likelihood that each residue along  the  chain
+  falls  into  one  of  the four classes of structure. The most likely
+  structure at each point is the one with the highest score.  The four
+  values are plotted in strips labelled H, E, T and C.  Below, a strip
+  labelled  D  for  decision  is  divided  into  four   levels,   each
+  corresponding  to  one  of  the  four  structure types. Their top to
+  bottom order is the same as that for the strips above, i.e C, T,  E,
+  and  H.  For  each  residue  the  program measures which of the four
+  likelhoods is highest. It places a single dot at  the  mid-point  of
+  the  corresponding  strip,  and also at the appropriate level in the
+  strip labelled D.
+
+        It should be noted that the method, when tested  by  Kabsch  W
+  and  Sander C, (1983) Febs. Lett. 155 (179-182), although one of the
+  better ones, was correct for only about 56% of residues.
+
+        Typical dialogue follows.
+  ? Menu or option number=d24
+   Plot Robson secondary structure predictions
+  ? (y/n) (y) Plot results n
+
+       9 S   217   -7  -39   15
+      10 E   226    5  -27  -39
+      11 L   233   -7  -26  -15
+      12 I   229  -23    9    4
+      13 K   214   -8   10   -8
+      14 Q   178   42   19    5
+      15 R   131   54   16    3
+      16 I    86   42  -31  -23
+      17 A    55   52  -30  -15
+      18 Q    15   67    4   25
+      19 F   -34   86   47   74
+      20 N   -41   74   17  106
+      21 V   -16  118   -5  100
+      22 V    64   88    5  115
+      23 S    96   38   26  155
+      24 E   133  -25   13   96
+      25 A   118  -98   25  100
+      26 H   110 -150   37   86
+      27 N    57 -201   37   66
+      28 E    51 -140   11   -4
+      29 G     2  -77   37    9
+      30 T     2   28   28    7
+      31 I   -11  117  -21   22
+      32 V   -23  178  -55    5
+      33 S   -54  193  -14   35
+      34 V   -46  123    5   30
+      35 S   -54   53   51   80
+      36 D   -60    1   86   55
+      37 G   -66    8   57   49
+      38 V    -1  128  -30   -5
+      39 I    11  212  -56  -33
+      40 R    16  204  -44  -57
+   ...etc
+
+ @26. TX 4 @Draw a helix wheel
+
+        A helical representation of segments of the sequence is shown.
+  The  display  includes  a  schematic  of the helix showing the links
+  between residues, with each vertex numbered according  to  position;
+  the   sequence   element   at  each  vertex;  a  symbol  denoting  a
+  classification as hydrophobic(.), positively charged(+),  negatively
+  charged(-),  or  otherwise(  ).  The  residue  number  of  the first
+  sequence element in the current window  is  displayed  at  the  top-
+  left-hand  corner  of  the  diagram. Also at the top-left corner the
+  sequence in the current window is listed. Below this  is  the  total
+  hydrophobicity  and  hydrophobic  moment  for  the window calculated
+  according to Eisenberg et al J. Mol. Biol. 179, 125-142 (1984).
+
+        If dialogue is requested the user is asked for  the  angle  to
+  define  the  turn  between residues as seen looking along the helix,
+  and a window length. The window length can be up to 60, with default
+  18,  and  the angle has a default of 100 degrees. Note that 18 x 100
+  is 5 turns. When the option is selected the  first  segment  in  the
+  current  active region is displayed then the bell rings. If the user
+  types only return, the display will click  on  by  one  residue;  if
+  another number is typed, say N, then the display will click forwards
+  (or backwards if N is negative) by N residues. If the wheel runs off
+  either end of the sequence the option will be exited.
+
+        Typical dialogue follows.
+  ? Menu or option number=d26
+  ? Angle (1-130) (100) =
+  ? Window (1-60) (18) =
+
+   missing graphics
+
+ @25. TX 3 4 @Plot hydrophobic moment
+
+        This  routine  plots  hydrophobic  moment  and  hydrophobicity
+  according  to Eisenberg et al J. Mol. Biol. 179, 125-142 (1984). The
+  mean hydrophobicity per residue in the window is plotted on a  scale
+  -1.0  to 1.5, and the mean hydrophobic moment per residue on a scale
+  0.0 to 1.5. The hydrophobicity is shown in the top  frame  with  the
+  hydrophobic  moment  below.   The plot is arranged so that the value
+  shown at position x  represents  the  mean  value  for  residues  x-
+  window+1 to x, where window is the window length.
+
+        If dialogue is requested the user can select a window  length,
+  and the  angle used for the hydrophobic moment calculation.
+
+        Note that according  to  Eisenberg  et  al,  in  transmembrane
+  proteins   an  "initiator"  is  required.  This  is  either  a  very
+  hydrophobic  single  helix  with  <H>  >=0.68,   or   a   moderately
+  hydrophobic  pair  of helices whose <H> sum to >= 1.1. Other helices
+  are then accepted as transmembrane if their <H> >= 0.42
+
+        The following rules are claimed: if <H> < 0.51 and points  lie
+  below  the  line  <M>  = -0.392 + 0.603x <H> they are "globular", if
+  they lie above this line they are "surface". If <H> > 0.51 and  they
+  lie  above  the  line <M> = 0.6 - 0.342x<H> they are "monomeric", if
+  above "multimeric".
+
+        Typical dialogue follows.
+
+  ? Menu or option number=d25
+  ? Angle (1-130) (100) =
+  ? Window (1-60) (18) =
+  ? Plot interval (1-101) (3) =
+
+   missing graphics
+
+
+ @27. TX 1 @Back translate to dna
+
+        This routine back translates protein sequences into DNA  using
+  the  standard  genetic  code. The level of redundancy can be plotted
+  and the backtranslation saved to a file.
+
+        The translation can use either the IUB symbols shown below, or
+  a  set  of codon preferences. If a set of codon preferences are used
+  they must conform to the format of  codon  tables  produced  by  the
+  nucleotide  analysis  program, and the back translation will contain
+  the favoured codons. If there is no favoured codon the  IUB  symbols
+  will  be  employed. The window length for plotting the redundancy is
+  in codons.
+
+        The program will plot the redundancy along  the  sequence  and
+  hence can be used to find the best sequences to use as primers. Note
+  that the program plots the inverse, and so the higher the  plot  the
+  LESS  redundant the sequence. For primers look for peaks rather than
+  troughs.
+
+        The DNA sequence can be saved to a file and analysed using the
+  nucleotide  analysis  program.   Depending  on the application it is
+  often useful to produce a back translation using  both  a  table  of
+  codon preferences and one using the IUB symbols. This is because the
+  restriction enzyme search program can distinguish  between  definite
+  and  possible  cuts  in  the  sequence.   These matches are what the
+  program  terms  "definite  matches"  and  are  ones  in  which   the
+  specification  of  the  recognition  sequence corresponds exactly to
+  that of the back translation. The program will  also  find  what  it
+  terms   "possible  matches"  which  are  ones  that  depend  on  the
+  particular codons chosen for each amino acid.  These  are  sites  at
+  which  recognition sequences could be engineered to produce a cut in
+  the  DNA  without  changing  the  amino  acid,  but  which  are  not
+  necessarily found in the original sequence.
+
+
+              NC-IUB SYMBOLS
+
+        A,C,G,T
+        R        (A,R)        'puRine'
+        Y        (T,C)        'pYrimidine'
+        W        (A,T)        'Weak'
+        S        (C,G)        'Strong'
+        M        (A,C)        'aMino'
+        K        (G,T)        'Keto'
+        H        (A,T,C)      'not G'
+        B        (G,C,T)      'not A'
+        V        (G,A,C)      'not T'
+        D        (G,A,T)      'not C'
+        N        (G,A,C,T)    'aNy'
+
+   Typical dialogue follows.
+
+  ? Menu or option number=d27
+   Back translate
+  ? (y/n) (y) No codon preference
+  ? (y/n) (y) Plot redundancy n
+  ? (y/n) (y) Save DNA to disk
+  ? File name for DNA sequence=tt:
+  ATGCARYTNAAYWSNACNGARATHWSNGARYTNATHAARCARMGNATHGCNCARTTYAAY
+  GTNGTNWSNGARGCNCAYAAYGARGGNACNATHGTNWSNGTNWSNGAYGGNGTNATHMGN
+  ATHCAYGGNYTNGCNGAYTGYATGCARGGNGARATGATHWSNYTNCCNGGNAAYMGNTAY
+  GCNATHGCNYTNAAYYTNGARMGNGAYWSNGTNGGNGCNGTNGTNATGGGNCCNTAYGCN
+  GAYYTNGCNGARGGNATGAARGTNAARTGYACNGGNMGNATHYTNGARGTNCCNGTNGGN
+  MGNGGNYTNYTNGGNMGNGTNGTNAAYACNYTNGGNGCNCCNATHGAYGGNAARGGNCCN
+  YTNGAYCAYGAYGGNTTYWSNGCNGTNGARGCNATHGCNCCNGGNGTNATHGARMGNCAR
+  WSNGTNGAYCARCCNGTNCARACNGGNTAYAARGCNGTNGAYWSNATGATHCCNATHGGN
+  MGNGGNCARMGNGARYTNATHATHGGNGAYMGNCARACNGGNAARACNGCNYTNGCNATH
+  GAYGCNATHATHAAYCARMGNGAYWSNGGNATHAARTGYATHTAYGTNGCNATHGGNCAR
+  AARGCNWSNACNATHWSNAAYGTNGTNMGNAARYTNGARGARCAYGGNGCNYTNGCNAAY
+  ACNATHGTNGTNGTNGCNACNGCNWSNGARWSNGCNGCNYTNCARTAYYTNGCNMGNATG
+  CCNGTNGCNYTNATGGGNGARTAYTTYMGNGAYMGNGGNGARGAYGCNYTNATHATHTAY
+  GAYGAYYTNWSNAARCARGCNGTNGCNTAYMGNCARATHWSNYTNYTNYTNMGNMGNCCN
+  CCNGGNMGNGARGCNTTYCCNGGNGAYGTNTTYTAYYTNCAYWSNMGNYTNYTNGARMGN
+  GCNGCNMGNGTNAAYGCNGARTAYGTNGARGCNTTYACNAARGGNGARGTNAARGGNAAR
+  ACNGGNWSNYTNACNGCNYTNCCNATHATHGARACNCARGCNGGNGAYGTNWSNGCNTTY
+  GTNCCNACNAAYGTNATHWSNATHACNGAYGGNCARATHTTYYTNGARACNAAYYTNTTY
+  AAYGCNGGNATHMGNCCNGCNGTNAAYCCNGGNATHWSNGTNWSNMGNGTNGGNGGNGCN
+  GCNCARACNAARATHATGAARAARYTNWSNGGNGGNATHMGNACNGCNYTNGCNCARTAY
+  MGNGARYTNGCNGCNTTYWSNCARTTYGCNWSNGAYYTNGAYGAYGCNACNMGNAARCAR
+  YTNGAYCAYGGNCARAARGTNACNGARYTNYTNAARCARAARCARTAYGCNCCNATGWSN
+  GTNGCNCARCARWSNYTNGTNYTNTTYGCNGCNGARMGNGGNTAYYTNGCNGAYGTNGAR
+  YTNWSNAARATHGGNWSNTTYGARGCNGCNYTNYTNGCNTAYGTNGAYMGNGAYCAYGCN
+  CCNYTNATGCARGARATHAAYCARACNGGNGGNTAYAAYGAYGARATHGARGGNAARYTN
+  AARGGNATHYTNGAYWSNTTYAARGCNACNCARWSNTGG---
+
+
+ @28. TX 5 @Search for patterns of motifs
+
+        This option searches for patterns of motifs. Patterns  can  be
+  defined  interactively  or read from files. Results can be displayed
+  in several ways in both graphical and textual form. Used  to  create
+  pattern  files  for  searching  libraries.  The  option is extremely
+  flexible and  consequently  the  following  documentation  is  quite
+  lengthy.  However the routine is capable of searching for almost any
+  known pattern. In addition  the  flexibility  does  not  necessitate
+  difficulty  of  use,  and  the  userinterface  has  been  simplified
+  considerably since the methods were first published.
+
+        Users should refer to the "typical dialogue" shown  below  for
+  the most helpful information on using the program.
+
+        There  are  currently  four  ways  to  display  the   matching
+  patterns:  1=each individual motif and its position is listed; 2=all
+  the sequence between, and including  the  two  outermost  motifs  is
+  listed;  3=graphical,  with  a vertical line marking the position of
+  the leftmost motif; 4 = EMBL feature table format, where the  KEYNAM
+  field  is  the motif name, the FROM and TO fields denote the ends of
+  the match, and the DESCRIPTION field is "Program".
+
+        When it is defined for  the  first  time  a  pattern  must  be
+  entered  interactively  at the keyboard, but the pattern description
+  can be saved to a file. This file can be  used  for  all  subsequent
+  searches.
+
+        When defining a pattern interactively select a motif class and
+  the program will request the required inputs.
+
+        The program gives each motif an identifying name  and  number.
+  For  motifs  other than the first, a range of allowed positions must
+  be defined (Note that sets of motifs included using the OR  operator
+  will  all  be  given  the  same  range, and so the program will only
+  request range values for the first  motif  in  any  such  set).   To
+  specify  the  allowed  range  for  a  motif the user must supply the
+  following: the identifying number of the motif,  relative  to  which
+  the   current  motifs  positions  are  to  be  defined  (termed  the
+  "reference motif"); a "relative start position"  and  a  range.  The
+  relative  start  position  can  be  negative or positive. A negative
+  start position means that although the reference motif  is  searched
+  for  first,  the  current  motif  can  be found to its left.  A zero
+  relative start position means their left ends are superimposed.  The
+  default  start  position is to butt-joint the motif to righthand end
+  of the  "reference  motif".  The  range  is  "the  number  of  extra
+  positions" that the motif can take.
+
+        The program will  display  the  probability  of  finding  each
+  motif.  These  values  are presented in the following form: .1234E-5
+  means 0.1234 times 10 to the power -5.
+
+        After the pattern has been defined, the program  will  type  a
+  description of it on the screen. It will then allow the user to give
+  an overall cutoff score and overall probability cutoff.
+
+        Typical dialogue  for  all  the  different  motif  classes  is
+  displayed below.
+
+  ? Menu or option number=28
+    Pattern searcher
+  ? (y/n) (y) Read pattern from keyboard
+  X 1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =
+  ? Motif name=aa
+  ? String=aa
+  Probability of score     2.0000 = 0.123E-01
+  X 1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =2
+  ? Motif name=pmatch
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-1) (1) =
+  ? Relative start position (-1000-1000) (3) =
+  ? Number of extra positions (0-1000) (0) =
+  ? String=qqq
+  ? Minimum matches (1.00-3.00) (3.00) =2
+  Probability of score     2.0000 = 0.858E-02
+    1 Exact match
+  X 2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =3
+  ? Motif name=sm
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-2) (2) =
+  ? Relative start position (-1000-1000) (4) =
+  ? Number of extra positions (0-1000) (0) =
+  ? String=wqa
+  ? Minimum score (11.00-53.00) (53.00) =36
+  Probability of score    36.0000 = 0.531E-02
+    1 Exact match
+    2 Percentage match
+  X 3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =4
+  ? Motif name=hth
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-3) (3) =
+  ? Relative start position (-1000-1000) (4) =
+  ? Number of extra positions (0-1000) (0) =
+  ? Weight matrix file name=hth
+   HELIX TURN HELIX PABO SAUER WEIGHTS 17-11-87
+  Probability of score   -51.5860 = 0.230E-04
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+  X 4 Cut-off score and weight matrix
+    5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =5
+  ? Motif name=repeat
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-4) (4) =
+  ? Relative start position (-1000-1000) (21) =
+  ? Number of extra positions (0-1000) (0) =3
+  ? Repeat length (1-60) (6) =3
+  ? Minimum gap (0-60) (0) =
+  ? Maximum gap (0-60) (0) =2
+  ? Minimum score (11.00-60.00) (36.00) =
+  Probability of score    36.0000 = 0.445E-01
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+  X 5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =6
+  ? Motif name=mset
+  X 1 And
+    2 Or
+    3 Not
+  ? 0,1,2,3 =
+  ? Number of reference motif (1-5) (5) =
+  ? Relative start position (-1000-1000) (1) =
+  ? Number of extra positions (0-1000) (0) =
+  X 1 Keyboard input
+    2 File input
+  ? 0,1,2 =
+  Separate sets with commas
+  ? String=AVL,AST,,WYRF
+  ? Minimum matches (1.00-4.00) (4.00) =3
+  Probability of score     3.0000 = 0.718E-02
+    1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Direct repeat
+  X 6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =7
+  ? (y/n) (y) Save pattern in a file
+  ? Pattern definition file=EXAM.PAT
+  Motif  6 needs a file name to store set as a weight matrix
+  ? Weight matrix file name=DEMO.WTS
+  Weight matrix needs a title
+  ? Title=Demonstration class 6 weight matrix
+
+  Pattern description
+
+  Motif  1 named aa       is of class    1
+  Which is an exact match to the string
+  aa
+  Motif  2 named pmatch   is of class    2
+  which is a match of score     2. to the string
+  qqq
+  and the N-terminal residue can take positions      3 to       3
+  relative to the N-terminal end of motif   1
+  It is anded with the previous motif.
+  Motif  3 named sm       is of class    3
+  which is a match of score    36. to the string
+  wqa
+  and the N-terminal residue can take positions      4 to       4
+  relative to the N-terminal end of motif   2
+  It is anded with the previous motif.
+  Motif  4 named hth      is of class    4
+  Which is a match to a weight matrix with score -51.586
+  and the N-terminal residue can take positions      4 to       4
+  relative to the N-terminal end of motif   3
+  It is anded with the previous motif.
+  Motif  5 named repeat   is of class    5
+  Which is a repeat with repeat length    3 and score    36.
+  The loop-out can have sizes      0 to      2
+  and the N-terminal residue can take positions     21 to      24
+  relative to the N-terminal end of motif   4
+  It is anded with the previous motif.
+  Motif  6 named mset     is of class    6
+  Which is membership of a set with score   3.000
+  It is anded with the previous motif.
+  Probability of finding pattern = 0.4109E-14
+  Expected number of matches  = 0.2539E-10
+  ? Maximum pattern probability (0.00-1.00) (1.00) =
+  ? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+   Select display mode
+  X 1 Motif by motif
+    2 Inclusive
+    3 Graphical
+    4 EMBL feature table
+  ? 0,1,2,3,4 =
+   Searching
+
+  Total matches found      0
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structure
+  m5 = Search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=6
+  Page through text files
+  ? Name of file to read=exam.pat
+   A1          aa       Class
+   aa
+   @ End of string
+   A2          pmatch   Class
+        1      Relative motif
+        3      Relative start position
+        0      Number of extra positions
+   qqq
+   @ End of string
+     2.00000   Cutoff
+   A3          sm       Class
+        2      Relative motif
+        4      Relative start position
+        0      Number of extra positions
+   wqa
+   @ End of string
+    36.00000   Cutoff
+   A4          hth      Class
+        3      Relative motif
+        4      Relative start position
+        0      Number of extra positions
+  hth                                      File name
+   A5          repeat   Class
+        4      Relative motif
+       21      Relative start position
+        3      Number of extra positions
+        3      Length
+        0      Minimum loop
+        2      Maximum loop
+    36.00000   Cutoff
+   A6          mset     Class
+        5      Relative motif
+        1      Relative start position
+        0      Number of extra positions
+  DEMO.WTS                                 File name
+  End of file
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structure
+  m5 = Search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=6
+  Page through text files
+  ? Name of file to read=demo.wts
+   Demonstration class 6 weight matrix
+        4     0     3.000     4.000
+   P   1   2   3   4
+   N   0   0   0   0
+   C   0   0   0   0
+   S   0   1   0   0
+   T   0   1   0   0
+   P   0   0   0   0
+   A   1   1   0   0
+   G   0   0   0   0
+   N   0   0   0   0
+   D   0   0   0   0
+   E   0   0   0   0
+   Q   0   0   0   0
+   B   0   0   0   0
+   Z   0   0   0   0
+   H   0   0   0   0
+   R   0   0   0   1
+   K   0   0   0   0
+   M   0   0   0   0
+   I   0   0   0   0
+   L   1   0   0   0
+   V   1   0   0   0
+   F   0   0   0   1
+   Y   0   0   0   1
+   W   0   0   0   1
+  End of file
+  Menus and their numbers are
+  m0 = This menu
+  m1 = General
+  m2 = Screen control
+  m3 = Statistical analysis of content
+  m4 = Structure
+  m5 = Search
+   ? = Help
+   ! = Quit
+  ? Menu or option number=28
+    Pattern searcher
+  ? (y/n) (y) Read pattern from keyboard
+  X 1 Exact match
+    2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =2
+  ? Motif name=avlst
+  ? String=avlst
+  ? Minimum matches (1.00-5.00) (5.00) =3
+  Probability of score     3.0000 = 0.394E-02
+    1 Exact match
+  X 2 Percentage match
+    3 Cut-off score and score matrix
+    4 Cut-off score and weight matrix
+    5 Direct repeat
+    6 Membership of set
+    7 Pattern complete
+  ? 0,1,2,3,4,5,6,7 =7
+  ? (y/n) (y) Save pattern in a file n
+
+  Pattern description
+
+  Motif  1 named avlst    is of class    2
+  which is a match of score     3. to the string
+  avlst
+  Probability of finding pattern = 0.3941E-02
+  Expected number of matches  = 0.2030E+01
+  ? Maximum pattern probability (0.00-1.00) (1.00) =
+  ? Minimum pattern score (-9999.00-9999.00) (-9999.00) =
+   Select display mode
+  X 1 Motif by motif
+    2 Inclusive
+    3 Graphical
+    4 EMBL feature table
+  ? 0,1,2,3,4 =4
+   Searching
+
+  FT   avlst       152    156       Program
+  Total matches found      1
+  Minimum and maximum observed scores        3.00        3.00
+
+
+        General notes
+
+        These methods allow users to define  and  search  for  complex
+  patterns  of  motifs  defined as single objects.  The programs allow
+  individual DNA motifs to be defined in  eight  different  ways,  and
+  protein  motifs  in  six.  Motifs  are  combined,  using the logical
+  operators AND, OR and NOT, to describe a pattern. The  pattern  also
+  specifies   the  ranges  of  allowed  relative  separations  of  the
+  individual motifs.
+
+        First some definitions.
+
+        A MOTIF is a contiguous subsequence of fixed length.   At  its
+  simplest  it  could  be a single definite base or amino acid; a more
+  complex motif might be better represented as a consensus or a weight
+  matrix;  two  more-abstract  types  of motif are direct and inverted
+  repeats.
+
+        A PATTERN is a higher order of structure defined by a list  of
+  motifs.  The  motifs  in  a  pattern  are combined using the logical
+  operators AND, OR  and  NOT.  The  list  also  defines  the  allowed
+  relative  separations  of the motifs. In the current versions of the
+  programs up to 50 motifs can be combined into a single  pattern.  So
+  using these definitions there are two differences between motifs and
+  patterns: 1) the distances between  all  elements  of  a  motif  are
+  fixed,  but  the  separations  of parts of patterns can vary; 2) all
+  characters in a motif are defined using the same method (class), but
+  different  parts of a pattern can be defined in completely different
+  ways.
+
+        Each motif can be represented in 9 ways (known  as  the  motif
+  class):
+
+             MOTIF CLASSES
+  CLASS           DESCRIPTION
+   1       Exact match to a short defined sequence. The IUB symbols
+           can be used for DNA sequences.
+   2       Percentage match to a defined short sequence. In nucleic acids,
+           the IUB symbols can be used.
+   3       Match to a defined sequence, using a score matrix and cutoff
+           score. The DNA matrix (see option 18) gives scores to IUB symbols
+           depending on their level of redundancy. MDM78 is used for proteins.
+   4       Match to a weight matrix with cutoff score.
+   5       As class 4 but on the complementary strand.
+   6       Inverted repeat or stem-loop. Fixed stem length, range of
+           loop sizes, and cutoff score using A-T, G-C=2; G-T=1.
+   7       Exact match to short sequence but with a defined step size.
+   8       Direct repeat. Fixed repeat length, range of loop-out sizes,
+           cutoff score, and score matrix (for protein sequences MDM78 and
+           for nucleic acids an identity matrix).
+   9       Membership of a set. A list of sets of allowed amino acids for
+           each position in the motif. The sets are separated by commas(,).
+           For example IVL,,,DEKR,FYWILVM defines a motif of length 5 amino
+           acids in which one of I,V or L must be found in the first position,
+           then anything in the next two positions, D,E,K or R in the fourth
+           position and F,Y,W,I,L,V or M in the fifth. This class only applies
+           to protein sequences because for nucleic acids "membership of a
+  set"
+           can be achieved using IUB symbols.
+
+      Classes 1 - 4, 8 and 9 apply to protein sequences, and classes 1-8 to
+      nucleic acids.
+
+
+        Class 1: exact match.
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids, may include IUB symbols. All symbols must match.
+
+        Class 2: percentage match
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids,  may  include  IUB  symbols.  The  minimum number of matching
+  characters must also be specified.
+
+        Class 3: match using a score matrix
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids,  may  include IUB symbols. The motif is not compared directly
+  with the sequence  to  count  the  number  of  matching  characters.
+  Instead  a  matrix is used to provide a score for all possible pairs
+  of characters. The motif score for any position along  the  sequence
+  is  the  sum  of  the scores found by looking-up the scores for each
+  pair of aligned characters. A match  is  declared  if  some  minimum
+  score is achieved.
+
+        Class 4: weight matrix
+
+        The motif is defined by a table of values (called  weights  or
+  scores). The table gives a score for finding each possible character
+  at each position along the length of the  motif.  It  therefore  has
+  dimension  motif-length  x character-set-size, and allows us to give
+  different  scores  for  each  character  at  each  position.  It  is
+  equivalent  to  having  a  different  score matrix for each position
+  along the motif, and provides the most flexible and specific  method
+  of  defining  motifs. The weight matrices are created by program PIP
+  option 20 and stored as files. The file contains the values for each
+  position, as well as an overall minimum score. There are two ways in
+  which these values can be used to calculate an overall score for any
+  section  of  the  sequence. The simplest way is to add the values in
+  the file. (This  means  that  the  highest  possible  score  can  be
+  calculated  by adding the top value at each column position, and the
+  lowest by adding the bottom value.)  The normal  way  of  using  the
+  values  in  the  file  is  as follows. First the programs divide the
+  values in each column by the column total so that they  sum  to  1.0
+  Then  the  natural logs of these values are used as scores. When the
+  matrix is applied to a sequence these logarithmic values are  summed
+  (which  is  of  course  equivalent  to multiplying the frequencies).
+  Note that using the natural logs of the frequencies as  weights  and
+  adding  them  means  that the overall cutoff score must be less than
+  zero, whereas if the original values in the weight matrix  file  are
+  added,  the  cutoff  score  will  be  greater  than zero. The search
+  routines therefore decide whether the user wants to  add  values  or
+  multiply  frequencies by examining the value of the cutoff score: it
+  will add if the  cutoff  is  greater  than  zero  and  add  logs  of
+  frequencies  if  it is less than zero.  Hence we effectively get two
+  motif classes in one. The program PIP, when creating  weight  matrix
+  files,  will  ask  the  user  whether  the scores should be added or
+  multiplied. If the values in the table  have  been  defined  without
+  using a set of aligned sequences it is easier for the user to choose
+  a cutoff score if the values are added.
+
+        Class 5: complement of weight matrix
+
+        The motif is defined by  a  weight  matrix,  but  the  program
+  searches for its complement.
+
+        Class 6: inverted repeat, or stem-loop
+
+        The motif is defined by a repeat length, a minimum score and a
+  range  of  loop  sizes.  The scores are A-T=2, G-C=2, G-T=1, else=0.
+  The loop sizes are defined by a minimum and  maximum  distance  from
+  the  3'  end  of  the  stem.  For a stem-loop these will be positive
+  numbers. For example to define a stem of length  8  and  loop  sizes
+  varying  from  3 to 5, the stem would be set to 8, the minimum start
+  distance to 3 and the maximum to 5. To define an inverted repeat the
+  minimum  distance  will  be  negative.  For  example  stem length=9,
+  minimum distance=-9, and  maximum  distance=-8  will  find  inverted
+  repeats  of lengths 9 and 10. E.g. AAAAATTTT and AAAAATTTTT would be
+  found, the first having a base at  its  centre,  the  second  having
+  none.
+
+        Class 7: exact match, defined step size.
+
+        The motif is defined by a short sequence,  which  for  nucleic
+  acids,  may  include  IUB symbols. All symbols must match. The class
+  differs from class 1 in that searches will move  in  steps  of  some
+  given  size. For example we could search for a certain codon and use
+  a step size of 3 and hence keep in a single reading frame.
+
+        Class 8: direct repeat
+
+        The motif is defined by a repeat length, a minimum score and a
+  range  of loop sizes. The scores are defined using MDM78 for protein
+  sequences and an identity matrix for nucleic acids.  The loop  sizes
+  are defined by a minimum and maximum distance from the 3' end of the
+  stem.
+
+        Class 9: membership of a set
+
+        This motif class is for protein sequences. It  is  defined  by
+  lists  of  allowed amino acids for each position in the motif, and a
+  cut-off score.  Positions at which any amino acid can occur are left
+  blank.  All allowed amino acids for each position give a score of 1.
+  The motifs can be defined in two ways: either typed at the  keyboard
+  or  read in as a weight-matrix-like file.  When the motif is defined
+  at the keyboard the sets of allowed amino  acids  are  separated  by
+  commas(,).  For example IVL,,,DEKR,FYWILVM defines a motif of length
+  5 amino acids in which one of I,V or L must be found  in  the  first
+  position, then anything in the next two positions, D,E,K or R in the
+  fourth position and F,Y,W,I,L,V or M in the fifth.  To specify  that
+  the  whole motif must match a score of 3 would be required (i.e. one
+  of the allowed amino acids must be  found  for  each  of  the  three
+  defined  positions).  If the motif is read from a file the file must
+  have been written by program PIP, or have been saved by the  pattern
+  searching  routines.  If  the  user elects to save a pattern, and it
+  includes class 9 motifs typed at the keyboard, then the program will
+  save  the  class  9 motifs as weight matrix files. Therefore it will
+  request file names for each motif of this class. If the motif  given
+  above  as  an example were saved the weight matrix file would have 5
+  columns.  The first column would contain zeroes except for the I,  V
+  and  L  rows which would be set to 1; the next two columns would all
+  be zero; the next would be zero except for  the  D,E,K  and  R  rows
+  which  would  be  1;  the  final  column  would  contain 1's in rows
+  F,Y,W,I,L,V and M, with the rest zero.
+
+        The logical operator (AND, OR or NOT) used to add  each  motif
+  to  the  pattern  is  specified by preceding the class number by the
+  letters A, O or N. A = AND, O = OR, N = NOT.  The default is  A,  so
+  N2  means include, using the NOT operator, a class 2 motif; O2 means
+  include, using the OR operator, a class 2 motif; both A2 and 2  mean
+  include, using the AND operator, a class 2 motif.
+
+        Range setting.
+
+        The motifs in a pattern are numbered according to their  order
+  in  the list. Apart from the first motif in a pattern all motifs are
+  given a range of allowed positions relative to a  motif  further  up
+  the  list.  For example suppose we have a pattern defined by A AND B
+  AND C AND D.  Motif A can occur anywhere, but B must have its  range
+  of  allowed  positions  defined relative to the position of motif A,
+  and C's positions  can  be  defined  relative  to  either  A  or  B,
+  depending  on  which  is most convenient, and likewise D's positions
+  can be relative to A or B or C.
+
+        Notice that the positions of motifs can be defined relative to
+  more  than one motif. Suppose we have a pattern consisting of motifs
+  A, B and C, and that B occurs 5-10 residues right of A, C occurs  5-
+  10  residues  right  of B, and also C is never more than 15 residues
+  from A. Then it is quite consistent  with  the  methods  to  include
+  motif C into the pattern twice using the AND operator: once relative
+  to A and once relative to B. This will define the  relative  spacing
+  and  the  ORDER  of the motifs in the pattern. (If we simply defined
+  the position of C relative to A it could be found to the left of B).
+
+        Motifs combined together using the OR operator are  all  given
+  the  same range. For example suppose we had a pattern A AND (B OR C)
+  AND (D OR E), then B and C each have the same range,  and  D  and  E
+  also  have  the same range as one another. The range for D and E can
+  be relative to A or to B.
+
+        Motifs cannot have their ranges  defined  relative  to  motifs
+  that  are included using the NOT operator. For example if we had the
+  pattern A NOT B AND C, then the range for  C  can  only  be  defined
+  relative to motif A.
+
+        Speed can be gained by arranging the order of  the  motifs  so
+  that  those higher up the list are of types that can be searched for
+  rapidly and that are also unlikely to be found.
+
+        Motifs combined by the OR operator are  alternatives:  if  any
+  one  of a set of motifs combined by the OR operator is found, then a
+  match is declared. All alternatives will be reported. For example if
+  we  had a pattern defined by A AND (B OR C), then all places where A
+  occurs and B is found within range, and all places where A is  found
+  and C is found within range will be reported. A typical use would be
+  where we might allow a motif to appear on either strand of  the  DNA
+  sequence.  For  example  a  weight matrix representing the heatshock
+  element could be used in a pattern which  included  heatshock  as  a
+  motif  class  4  combined  using the OR operator with heatshock as a
+  motif class 5.
+
+        The probability calculations are performed for each  motif  as
+  it  is  defined.  If  an  overall  probability  cut-off is given the
+  calculation is repeated for each match  found.  To  achieve  maximum
+  searching  speed do not give an overall probability cut-off. Overall
+  cut-off scores should only be used if the  motif  classes  used  are
+  compatible.
+
+        There are currently several ways to display the matches:  1  =
+  each  motif and its position is listed; 2 = all the sequence between
+  the two outermost motifs is listed; 3  =  graphical,  with  a  spike
+  marking  the  position  of  the leftmost motif. The library versions
+  also give entry names, and a one line title; in addition they can be
+  used  to  produce  aligned  families of sequences. When this mode of
+  output is selected the program will write a separate file  for  each
+  match. The files will be called ENTRYNAME.DAT where ENTRYNAME is the
+  name of the entry in the library.  The  matching  sequence  will  be
+  written  out so that the spacing between motifs is constant, and set
+  to the maximum allowed by the pattern definition. Any gaps  will  be
+  filled   with   dashes   (-).   If  the  individual  sequences  were
+  subsequently written one above the other they should line up so that
+  all  motifs are in register. There two types of output of this sort:
+  one, option 4, writes out whole  sequences,  the  other,  option  5,
+  writes  out only the sequences between the two outermost motifs.  If
+  the individual sequences were subsequently  written  one  above  the
+  other  they should line up so that all motifs are in register. There
+  two types of output of this sort: one, option 4,  writes  out  whole
+  sequences,  the  other,  option  5,  writes  out  only the sequences
+  between the two outermost motifs.  Note that for option 4 users  are
+  asked  to  type  the position of the first motif, and the reason for
+  this is  explained  below.  Consider  a  pattern  found  in  several
+  sequences.  Consider only the first motif in the pattern and suppose
+  that it was found in different positions  in  these  sequences.  Say
+  that  of  these  positions  the  one  furthest from the left end was
+  position 100. Then, in order to ensure that all the sequences  would
+  align,  we must specify that motif 1 must start at position 100. Any
+  sequences in which motif 1 started  nearer  to  the  left  end  than
+  position  100  would  be  padded accordingly.  These modes of output
+  should only be used when the  position  of  each  motif  is  defined
+  relative to its immediate neighbour.
+
+        The pattern descriptions can be saved to  files.  These  files
+  can  be used instead of typing definitions again at the keyboard. As
+  the files are annotated, they can easily  be  changed  using  system
+  editors,  and  the  modified  versions  used  to  define the variant
+  patterns for the programs.
+
+
+        Use of lists of entry names
+
+        The two programs that operate on libraries have the ability to
+  restrict  their  searches to subsets of the libraries. This does not
+  require sublibraries to be created but instead is achieved by  using
+  files  containing  a  list of the entry names of sequences. The user
+  may  choose  to  search  only  those  entries  on   the   list   or,
+  alternatively  to  search  all  but  those  on the list (i.e. in the
+  latter case the list contains the names of those  to  be  excluded).
+  The  programs  can search libraries that have indexes and those that
+  do not.  If a list of names for inclusion is used, then  the  search
+  will  be  faster if the index is present. In all other circumstances
+  the whole library will be read. The list must be  in  library  order
+  except  when  it  is  used  to  include  entries,  and  an  index is
+  available.  The list must contain each  entry  name  on  a  separate
+  line,  with the name starting in column 1 of the line. ie there must
+  be no spaces at the start of the line.  The list of entry names  can
+  be  produced by the keyword searches of nip, pip, etc as long as the
+  listings produced have a space character separating the  entry  name
+  from the entry description. This will depend on how well the library
+  reformatting programs work. For example swissprot entry  names  tend
+  to  run  into the beginning of the descriptions, but other libraries
+  are generally OK.
+
+        One use of the programs  is  to  look  for  patterns  that  we
+  already  know  about, but in new sequences. However it is hoped that
+  they will also be useful for finding new motifs. For example several
+  known   control   regions  in  nucleic  acid  sequences  consist  of
+  particular direct or inverted repeats; the inclusion of  direct  and
+  inverted  repeats  as  motif  classes  makes  it  possible  to  find
+  previously unknown motifs of these types. Using these  new  programs
+  we can ask questions like: "are there any inverted or direct repeats
+  near to sections of sequence that contain both a  CCAAT  box  and  a
+  TATA  box?"; and to search for such things throughout the libraries.
+  In addition, the mode of output in which all  the  sequence  between
+  the  two outermost motifs found is printed out, allows us to extract
+  sequences and  examine  them  in  more  detail  for  further  common
+  subsequences.  For example we might want to collect together all the
+  sequences between putative CCAAT and TATA boxes.
+
+        A further use of  the  inverted  repeat  motif  class  is  the
+  following.  If  a  regulatory  sequence in DNA is poorly defined but
+  also an inverted repeat, then it might be an advantage to specify it
+  both  as a consensus sequence and a superimposed inverted repeat. In
+  this way two weak definitions can be combined to produce a  stronger
+  pattern.
+
+        Given only a few examples of a motif it should be possible  to
+  perform  initial  searches  using  a  class 3 motif, and then, using
+  plausible matching sequences, create a more specific  weight  matrix
+  for the same motif.
+
+        If motifs are combined with  the  first  motif  using  the  OR
+  operator  they  will  be ignored until all permutations that include
+  the first motif have been looked for. The whole search will then  be
+  repeated,  in  turn, for each of those motifs that are combined with
+  the first motif using the OR operator.  An  interesting  consequence
+  of  this is that the program can be used, without change, to compare
+  any newly determined sequence with all known individual  motifs.  We
+  achieve  this by having a pattern in which all known relevant motifs
+  are combined using the OR operator.  If we ask to use  this  pattern
+  with  a  sequence,  the  program  will  automatically  compare  each
+  individual motif in  the  pattern  with  the  whole  length  of  the
+  sequence.  As the number of known motifs grows this should become an
+  increasingly useful standard procedure.
+
+        The  NOT  operator  is  obviously  useful  for   making   sure
+  particular  motifs  are  not  present,  but  it  can also be used to
+  bracket the levels of matches found. We may want a degree  of  match
+  that  lies  between  two  limits - binding should occur, but not too
+  strongly; or base-pairs should  form,  but  not  too  many.  We  can
+  specify  this by asking for a match with a low score, in combination
+  with a match and a high score, both for the same motif, but with the
+  high score included using the NOT operator.
+
+        The algorithm is designed to find all sections of  a  sequence
+  that   satisfy   the  pattern  rather  than  only  the  best  match.
+  Particularly if some of the  motifs  in  a  pattern  are  less  well
+  defined  than  others, this can often result in the same region of a
+  sequence being reported as having several matches,  but  which  only
+  vary in the positions of the weakest motifs.
+
+        General remarks on motif searching
+
+        Generally motifs are short subsequences that are thought to be
+  associated  with particular functions in some known sequences. Often
+  we  search  for  them  to  try  to  understand  or  interpret  other
+  sequences.  Sometimes  we  search  for motifs and patterns to test a
+  hypothesis  about  their  role:  are  they  found  in  the  expected
+  positions  in the expected sequences. In doing so we should remember
+  that, in both proteins and nucleic acids, what we are really looking
+  for  is  a  particular  three  dimensional  structure  with  certain
+  affinities for other structures, and that we are assuming  that  the
+  sequence  of  the  motif alone defines the 3D structure we searching
+  for. The overall structure may be completely different to  those  in
+  which  the  motif  is  functional,  and  hence  the motif may have a
+  different shape or be  inaccessible.  We  should  be  aware  of  the
+  importance  of  the context in which a motif is found. Where does it
+  lie relative to the overall structure,  is  it  accessible,  is  the
+  three  dimensional  spacing between it and other motifs correct? For
+  example, is it on the same side of the double helix, and the correct
+  distance  from  some  other  motif?  How  does  context  affect  our
+  assessment of the significance of finding  a  motif?  Finding  false
+  mammalian  mRNA splice junctions in non-coding sequences is far less
+  important than  finding  false  sites  in  pre-mRNA  sequences,  but
+  finding  them  in  the  correct  places  is most important! In other
+  words, it is often the case that when we are searching for  a  motif
+  that  is  known  to  be necessary for some function, then a positive
+  result in the form of a match in  the  required  position,  is  more
+  important  than a high background of matches in the wrong positions.
+  Being able to write down the probability of finding  a  motif  in  a
+  random  sequence  tells  us how well it is defined. In nucleic acids
+  the DNA may contain many superimposed types of information  such  as
+  those  concerned  with  histone  phasing,  protein  coding  or  mRNA
+  secondary structure. These overlapping "codes"  may  interfere  with
+  one  another  causing  matches to motifs to be poorer than expected.
+  In general we will only have a limited number  of  examples  of  the
+  motif and we do not know how representative they are.
+
+        Sequences have superimposed functions: some parts  may  be  of
+  general structural importance and give rise to an overall framework,
+  and other parts give specificity and hence are not  common;  we  may
+  want  to  use a set of aligned sequences to define a motif, but want
+  to use only the framework positions.  Alternatively we may  want  to
+  pick  out only those parts of a set of aligned sequences that give a
+  particular property, and to ignore other similarities that  are  due
+  to  some  other  property and which could obscure the pattern we are
+  interested in.  It is possible to apply a mask to a set  of  aligned
+  sequences  in  order to give weight to selected positions only.  The
+  ability to define a mask allows certain positions to be used in  the
+  motif  and  others to be ignored, and yet still permits the use of a
+  set of aligned sequences to calculate weights. The mask is requested
+  and applied by the program and results in the masked positions being
+  zero in the weight matrix. The mask is defined in the following way.
+  Suppose  we  had a motif of length 15, then the mask x--x--xx-x will
+  give zero weights to positions 2,3,5,6 and 9 (note it is the  dashes
+  (-)  that  are significant and that positions 1,4,7,8,10,11,12,13,14
+  and 15 will be non-zero). Of course the same set of sequences  could
+  be used with several alternative masks in order to extract different
+  features and create corresponding weight matrices.
+
+        The programs are described in Staden,R. CABIOS 4, 53-60, 1988;
+  Staden,R.   CABIOS  5,  89-96,  1989,  anf  a forthcoming Methods in
+  Enzymology.
+ @ end of help
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/help/pip_menu b/help/pip_menu
new file mode 100644
index 0000000..5a527d0
--- /dev/null
+++ b/help/pip_menu
@@ -0,0 +1,80 @@
+-1 0 21 2 T General
+-1 0 21 2 X General
+-2 0 50 2 T Screen control
+-2 0 71 2 X Screen
+-3 0 117 2 T Statistical analysis of content
+-3 0 142 2 X Statistics
+-4 0 179 2 T Structures and repeats
+-4 0 204 2 X Structures
+-5 0 225 2 T Search
+-5 0 225 2 X Search
+0 -1 243 76 T PIP
+0 -1 243 76 X PIP
+1 0 3546 8 T Help
+1 0 3546 8 X Help
+2 0 3889 3 T Quit
+2 0 3889 3 X Quit
+3 1 3962 220 T Read a new sequence
+3 1 3962 220 X Read a new sequence
+4 1 13792 12 T Redefine active region
+4 1 13792 12 X Redefine active region
+5 1 14480 33 T List a sequence
+5 1 14480 33 X List a sequence
+6 1 15941 4 T List a text file
+6 1 15941 4 X List a text file
+7 1 16083 12 T Direct output to disk
+7 1 16083 12 X Direct output to disk
+8 1 16567 7 T Write active region to disk
+8 1 16567 7 X Write active region to disk
+9 1 16922 26 T Edit the sequence
+9 1 16922 26 X Edit the sequence
+10 2 18386 3 T Clear graphics
+10 2 18386 3 X Clear graphics
+11 2 18463 3 T Clear text
+11 2 18463 3 X Clear text
+12 2 18531 13 T Draw a ruler
+12 2 18531 13 X Draw a ruler
+13 2 19278 13 T Use cross hair
+13 2 19278 13 X Use cross hair
+14 2 19865 35 T Reset margins
+14 2 19865 35 X Reset margins
+15 2 22019 13 T Label a diagram
+15 2 22019 13 X Label a diagram
+16 2 22811 13 T Display a map
+16 2 22811 13 X Display a map
+17 5 23611 254 T Short sequence search
+17 1 23611 254 T Short sequence search
+17 5 23611 254 X Short sequence search
+17 1 23611 254 X Short sequence search
+18 5 34012 57 T Compare a sequence
+18 1 34012 57 T Compare a sequence
+18 5 34012 57 X Compare a sequence
+18 1 34012 57 X Compare a sequence
+19 5 35654 69 T Compare a sequence using a score matrix
+19 1 35654 69 T Compare a sequence using a score matrix
+19 5 35654 69 X Compare a sequence using a score matrix
+19 1 35654 69 X Compare a sequence using a score matrix
+20 5 37587 214 T Search for a motif using a weight matrix
+20 5 37587 214 X Search for a motif using a weight matrix
+21 3 46771 20 T Calculate amino acid composition
+21 3 46771 20 X Calculate amino acid composition
+22 4 47655 20 T Plot hydrophobicity
+22 3 47655 20 T Plot hydrophobicity
+22 4 47655 20 X Plot hydrophobicity
+22 3 47655 20 X Plot hydrophobicity
+23 4 48439 19 T Plot charge
+23 3 48439 19 T Plot charge
+23 4 48439 19 X Plot charge
+23 3 48439 19 X Plot charge
+24 4 48953 72 T Plot robson prediction
+24 4 48953 72 X Plot robson prediction
+26 4 51912 32 T Draw a helix wheel
+26 4 51912 32 X Draw a helix wheel
+25 4 53561 36 T Plot hydrophobic moment
+25 3 53561 36 T Plot hydrophobic moment
+25 4 53561 36 X Plot hydrophobic moment
+25 3 53561 36 X Plot hydrophobic moment
+27 1 55101 87 T Back translate to dna
+27 1 55101 87 X Back translate to dna
+28 5 59337 809 T Search for patterns of motifs
+28 5 59337 809 X Search for patterns of motifs
diff --git a/help/sap_help b/help/sap_help
new file mode 100644
index 0000000..6effec5
--- /dev/null
+++ b/help/sap_help
@@ -0,0 +1,1848 @@
+ @-1. TX  0 @General
+
+ @-2. T   0 @Screen control
+
+ @-2. X   0 @Screen
+
+ @-3. TX  0 @Modification
+
+ @0.  TX -1 @SAP
+
+        This is an  interactive  program  whose  primary  use  is  for
+  managing  shotgun  sequencing  projects, but it can also be used for
+  handling alignments of other sequences, including those of proteins.
+  Currently   the   maximum  'gel  reading'  length  is  set  to  4096
+  characters. Almost all of the information below describes the use of
+  the  program  for shotgun projects, but those using the programs for
+  handling other sequence alignments should interpret it  accordingly.
+  The data for such a project is stored in a special type of database.
+  The program contains the tools that are  required  to  type  in  gel
+  readings,  screen  them  against  vector  sequences  and restriction
+  sites; enter new  gel  readings  into  the  database  (automatically
+  comparing  and  aligning  them). In addition it contains editors and
+  functions to examine the quality of the aligned sequences.
+
+        There  are  three  main  menus:  "general",   "graphics"   and
+  "modification", and some functions have submenus.
+    The general menu contains the following options:
+
+         0 = List of menus
+         ? = Help
+         ! = Quit
+         3 = Open a database
+         4 = Edit contig
+         5 = Display a contig
+         6 = List a text file
+         7 = Direct output to disk
+         8 = Calculate a consensus
+        17 = Screen against restriction enzymes
+        18 = Screen against vector
+        19 = Check consistency
+        25 = Show relationships
+        27 = set parameters
+        28 = Highlight disagreements
+        29 = Examine quality
+
+  The graphics menu contains:
+
+         0 = List of menus
+         ? = Help
+         ! = Quit
+        10 = Clear graphics
+        11 = Clear text
+        12 = Draw ruler
+        13 = Use cross hair
+        14 = Change margins
+        15 = Label diagram
+        16 = Plot map
+        33 = Plot single contig
+        34 = Plot all contigs
+
+
+  The modification menu contains:
+
+         0 = List of menus
+         ? = Help
+         ! = Quit
+         4 = Edit a contig
+         9 = Screen edit
+        20 = Auto assemble
+        21 = Enter new gel reading
+        22 = Join contigs
+        23 = Complement a contig
+        24 = Copy database
+        26 = Alter relationships
+        30 = Auto edit a contig
+        31 = Type in gel readings
+        32 = Extract gel readings
+
+    The enter new gel reading menu contains:
+
+         ? = Help
+         ! = Quit
+         3 = Complete entry
+         4 = Edit contig...
+         5 = Display overlap
+         6 = Edit new gel reading...
+
+     The join contig menu contains:
+
+         ? = Help
+         ! = Quit
+         3 = Complete join
+         4 = Edit left contig...
+         5 = Display joint
+         6 = Edit right contig...
+         7 = Move join
+
+     The alter relationships menu contains:
+
+         ? = Help
+         ! = Quit
+         3 = Line change
+         4 = Edit single gel reading...
+         5 = Delete contig
+         6 = Shift
+         7 = Move gel reading
+         8 = Rename gel reading
+         9 = Break contig
+
+     The edit menu contains:
+
+         ? = Help
+         ! = Quit
+         3 = Insert
+         4 = Delete
+         5 = Change
+
+
+
+        Overview of the methodology
+
+        The shotgun sequencing strategy
+
+        In  the  shotgun  sequencing  procedure  the  sequence  to  be
+  determined   is   randomly   broken  into  fragments  of  about  400
+  nucleotides in length. These fragments are cloned and then  selected
+  randomly  and  their  sequences    determined.     The  relationship
+  between  any  pair  of fragments is  not  known  beforehand  but  is
+  found  by  comparing  their   sequences.  If  the  sequence  of  one
+  found to be wholly or partially contained  within  that  of  another
+  for  sufficient  length  to  distinguish  an overlap  from  a repeat
+  then those two fragments can  be  joined.  The  process  of  select,
+  sequence  and  compare is continued until the whole of  the  DNA  to
+  be  sequenced is in one continuous well determined piece.
+
+        Definition of a contig
+
+        A CONTIG is a set of gel  readings   that   are   related   to
+  one  another   by   overlap  of  their  sequences.  All gel readings
+  belong to a contig and each contig  contains  at   least   one   gel
+  reading.   The  gel  readings in a contig can be summed to produce a
+  continuous consensus sequence and the length of this sequence is the
+  length  of the contig.  The rules used to perform this summation are
+  given  under  "the  consensus  algorithm".   At  any  stage  of    a
+  sequencing  project the data will comprise a number of contigs; when
+  a  project  is complete  there  should be only one  contig  and  its
+  consensus  will  be  the  finished  sequence.  Note that since being
+  introduced and defined as above the word "contig" has been taken  up
+  by  those involved in genomic mapping. In that context the consensus
+  with a  precise length is not defined.
+
+  Introduction to the computer method
+
+        It is useful  to  consider  the  objectives  of  a  sequencing
+  project  before  outlining  how  we use the computer to help achieve
+  them. The aim of a shotgun  sequencing  project  is  to  produce  an
+  accurate  consensus sequence from many overlapping gel readings.  It
+  is necessary to know, particularly  at  the  latter  stages  of  the
+  project,  how accurate the consensus sequence is. This enables us to
+  know which regions of the sequence require further work and also  to
+  know  when  the  project  is  finished.   To show the quality of the
+  consensus, the programs described here produce  displays  like  that
+  shown below.
+
+
+                             10        20        30        40        50
+     -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+         CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                            110       120       130       140       150
+     -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+     -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+     -5  HINW.009                                        GGCAGCGACCGTCG
+     17  HINW.999                                           AGCGACCGTCG
+         CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                            160       170       180       190       200
+     -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+     -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+     12  HINW.017                                              GTAGAGC*
+         CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+
+        This is an example showing the left  end  of  a  contig   from
+  position   1  to  200.   Overlapping  this  region  are gel readings
+  numbered 6, 3, 5, 17 and 12; 6, 3 and 5 are in  reverse  orientation
+  to  their  original  reading  (denoted  by  a  minus sign). Each gel
+  reading also has a name (eg HINW.010). It can  be  seen  that  in  a
+  number  of  places the sequences contain characters other than A,C,G
+  and T. Some  of  these  extra  characters  have  been  used  by  the
+  sequencer   to  indicate  regions  of  uncertainty  in  the  initial
+  interpretation of the gel reading, but the asterisks (*)  have  been
+  inserted  by  the  automatic assembly function in order to align the
+  sequences.  Underneath  each  50  character  block  of  gel  reading
+  sequences  is the consensus derived from the sequences aligned above
+  (the line labelled CONSENSUS). For most of its length the  consensus
+  has a definite nucleotide assignment but in a few positions there is
+  insufficient agreement between the gel readings and so  a  dash  (-)
+  appears  in  the  sequence.  This  display contains all the evidence
+  needed to assess the quality of the consensus: the number  of  times
+  the  sequence has been determined on each strand of the DNA, and the
+  individual nucleotide assignments given for each gel reading.
+
+        So the aim is to produce the consensus sequence  and,  equally
+  important,  a  display of the experimental results from which it was
+  derived.
+
+        In order to achieve this the following operations need  to  be
+  performed:
+  1) Interpret autoradiographs and put individual  gel  readings  into
+  the computer.
+  2) Check each gel reading to make sure it is not simply part of  one
+  of the vectors used to clone the sequence.
+  3) Check each gel reading to make sure  that  those  fragments  that
+  span  the  ligation point used prior to sonication are not assembled
+  as single sequences.
+  4) Compare all the  remaining  gel  readings  with  one  another  to
+  assemble them to produce the consensus sequence.
+  5) Check the quality of the consensus and edit the sequences.
+  6) When all the consensus is sufficiently well determined, produce a
+  copy of it for processing by other analysis programs.
+
+        It is very unlikely that this procedure will  only  be  passed
+  through  once.   Usually steps 1 to 5 are cycled through repeatedly,
+  with step 4 just adding new sequences to  those  already  assembled.
+  Generally step 6 is also used in order to analyse imperfect sequence
+  to check if it is the one the project intended to  sequence,  or  to
+  look  for  interesting  features. Analysis of the consensus, such as
+  searches for protein coding regions, can also help to find errors in
+  the  sequence.  The  display  of  the overlapping gel readings shown
+  above can be used  to  indicate,  not  only  the  poorly  determined
+  regions,  but  also  which  clones  should be resequenced to resolve
+  ambiguities, or those which can usefully be extended or sequenced in
+  the reverse direction, to cover difficult regions.
+
+        The original individual gel readings for a sequencing  project
+  are  each  stored in separate files. As the gel readings are entered
+  into the computer (usually in batches, say 10 from a film), the file
+  names  they are given are stored in a further file, called a file of
+  file names. Files of file names enable gel readings to be  processed
+  in batches.
+
+        For each sequencing project we start a project database.  This
+  database  has  a  structure  specifically  designed for dealing with
+  shotgun sequence data. In order to arrive  at  the  final  consensus
+  sequence  many  operations  will  be performed on the sequence data.
+  Individual fragments must be sequenced and compared in  both  senses
+  (i.e.  both  orientations)  with  all  the  other sequences. When an
+  overlap between a new gel reading and a contig are found  they  must
+  be aligned and the new gel reading added to the contig. If a new gel
+  reading overlaps two contigs they must be aligned and joined. Before
+  the  two contigs are joined one of them may need to be turned around
+  (reversed and  complemented)  so  they  are  both  in  in  the  same
+  orientation.
+
+        Clearly, keeping track of all  these  manipulations  is  quite
+  complicated,  and  to  be  able  to  perform  the operations quickly
+  requires careful choice of data structure and algorithms. For  these
+  reasons  it  is not practicable to store the gel readings aligned as
+  shown in the display above. Rather, it is more convenient  to  store
+  the  sequences unassembled, and to record sufficient information for
+  programs to assemble  them  during  processing.  The  data  used  to
+  assemble the sequences is called relational information.
+
+        The database comprises three  files  and  they  are  described
+  under the section entitled "open database".
+
+        Before entry into the project database each  new  gel  reading
+  must  be  compared  to  look  for overlaps with all the data already
+  contained within the database. This last  point  is  important:  all
+  searching  for  overlaps  is between individual new gel readings and
+  the data already in the database. There is no searching for overlaps
+  between sequences within the database; overlaps must be found before
+  new gel readings are entered into the database.
+
+        Below I  give  an  introduction  to  how  the  sequencess  are
+  processed by being passed from one function to the next.
+
+        This program is used to start a database for the  project  and
+  then the following procedure is used.
+
+        Data in the form of individual gel readings are  entered  into
+  the  computer and stored in separate files using either program this
+  program or the digitizer program. Batches of these gel readings  are
+  passed  to  the  screening  functions  in this program to search for
+  overlaps with vector sequences  ("screen  against  vector")  or  for
+  matches  to  restriction  enzyme  sites   that should not be present
+  ("screen against enzymes"). Each run of  these  screening  functions
+  passes  on  only  those  gel  readings  that do not contain unwanted
+  sequences.  Sequences  are  passed  via  files  of  file  names  and
+  eventually  are  processed by the automatic assembly function ("auto
+  assemble"). This function compares each gel reading with a consensus
+  of  all  the  previous  gel  readings stored in the database.  If it
+  finds any overlaps it aligns the overlapping sequences by  inserting
+  padding  characters,  and  then  adds  the  new  gel  reading to the
+  database. Gels that overlap are added to existing contigs  and  gels
+  that do not overlap any data in the database start new contigs. If a
+  new gel overlaps two contigs they are joined. Any gel readings  that
+  appear  to overlap but which cannot be aligned sufficiently well are
+  not entered and have their names written to a  file  of  failed  gel
+  reading names.
+
+        Generally data is entered into the database in batches as just
+  described.  The  program  is  also  used  to examine the data in the
+  database, to enter gel readings that the automatic assembly function
+  cannot  align  ("enter  new  gel reading"), and to make final edits.
+  Edits to whole contigs can  be made in several  ways.  An  automatic
+  editor  ("auto edit") will perform almost all edits without any user
+  intervention, but the program also gives access to the system editor
+  (EDT  on the VAX), through the function "screen edit", and to simple
+  command driven editors ("edit contig" and "edit new  gel  reading").
+  Disagreements  between  gel  readings in contigs and their consensus
+  sequences can be highlighted  by  use  of  the  function  "highlight
+  disagreements".
+
+        Editing the  sequences  is  obviously  an  essential  part  of
+  managing   a  sequencing  project.  Editing  is  required  when  new
+  sequences are added, when contigs are joined, and when sequences are
+  corrected.   A  basic part of the strategy used here is that new gel
+  readings should be correctly aligned throughout their  whole  length
+  when  they  are entered into the database, and that when contigs are
+  joined they are edited so that they are well aligned in  the  region
+  of  overlap.  Alignment can be achieved by adding padding characters
+  to the sequences, and this is the way "auto assemble" operates  when
+  adding new sequences to the database.
+
+        In order to search for overlaps that may have been missed  due
+  to  errors  in the gel readings, the function "extract gel readings"
+  can be used to take copies of  the  gel  readings  at  the  ends  of
+  contigs,  and  write  them out as separate files.  These can then be
+  compared with the  database  consensus  using  the  "auto  assemble"
+  function in a mode that forbids entry of data into the database, and
+  any gel reading matching two contigs will indicate a join  that  has
+  been  missed.  The  joins can then be made interactively using "join
+  contigs". Missed matches can be found  at  this  stage  because  the
+  errors in the sequences may have been corrected by new data.
+
+        Generally the users need not concern themselves with  how  the
+  relational  information  is used by the program, but it is necessary
+  to know how contigs are identified. Because contigs  are  constantly
+  being  changed  and  reordered  the  program  identifies them by the
+  numbers of the gel readings they contain.  Whenever  users  need  to
+  identify  a  contig they need only know the number or name of one of
+  the gel readings it contains. Whenever the  program  asks  users  to
+  identify  a  contig  or  gel reading they can type its number or its
+  archive name. If they type its archive name they  must  precede  the
+  name by a slash "/" symbol to denote that it is a name rather than a
+  number. E.g if the  archive name is fred.gel with number  99,  users
+  should  type  /fred.gel  or  99  when  asked to identify the contig.
+  Generally, when it asks for the gel reading to  be  identified,  the
+  program  will  offer  the user a default name, and if the user types
+  only return, that contig will be accessed. When a database is opened
+  the  default  contig  will  be  the  longest  one, but if another is
+  accessed, it will subsequently  become the current default.
+
+        Further information is located in the  following  places.  The
+  database  files  are described under "open database". The format for
+  vector  and  consensus  sequences  is  given  under   "calculate   a
+  consensus", as are the uncertainty codes used in gel readings.
+
+        The only program, other than this, relevant to  sequencing  is
+  the digitizer program  and it is outlined briefly below.
+
+        The digitiser program is used for the  initial  input  of  gel
+  readings  and  for  writing a file of file names. The program uses a
+  digitizer for data  entry.   A  digitizer  is  a   two   dimensional
+  surface  such  as a light box which is such that if a special pen is
+  pressed onto it, the pens coordinates are recorded  by  a  computer.
+  These coordinates can be interpreted by a program.
+
+        In order to read an autoradiograph placed on the light box the
+  user  need  only  define the bottom of the four sequencing lanes and
+  the bases to which they correspond and then use  the  pen  to  point
+  to   each  successive   band  progressing  up  the gel.  The program
+  examines the coordinates of each pen position to see in which of the
+  four  lanes  it   lies  and  assigns  the  corresponding  base to be
+  stored in the computer.  Each time the pen tip is depressed to point
+  to  a  position on  the  surface of the digitizer the program sounds
+  the bell on the terminal to indicate to the user that  a  point  has
+  been recorded.  As the  sequence  is read the program displays it on
+  the screen.
+ @17. TX 1 @Screen against restriction enzymes
+
+        Used to compare gel readings against  any  restriction  enzyme
+  recognition  sequences  that  may have been used  during cloning and
+  which should not be  present  in  the  data.  Works  on  single  gel
+  readings  or processes batches accessed through files of file names.
+  The algorithm looks  for  exact  matches  to  recognition  sequences
+  stored in a file.
+
+        The  file  containing  the  recognition  sequences   must   be
+  identified.  The  user  must choose between employing a file of file
+  names, or typing in the names of individual gel reading files. If  a
+  file  of  file names is used the program will also create a new file
+  of file names. When the option has finished operating this new  file
+  will  contain the names of all those gel readings that did not match
+  any of the recognition sequences. Hence it can be used  for  further
+  processing  of the batch. The recognition sequences should be stored
+  in a simple text file with one recognition sequence per line.
+ @18. TX 1 @Screen against vector
+
+        Used to compare gel readings against any vector sequences that
+  may have been picked up during cloning. Works on single gel readings
+  or processes batches accessed  through  files  of  file  names.  The
+  algorithm  looks  for exact matches of length "minimum match length"
+  and displays the overlapping sequences.
+
+        The file containing the vector sequence  must  be  identified.
+  The  user  must  choose  between  employing a file of file names, or
+  typing in the names of individual gel reading files. If  a  file  of
+  file  names  is used the program will also create a new file of file
+  names. When the option has finished operating  this  new  file  will
+  contain  the  names of all those gel readings that did not match the
+  vector sequence. Hence it can be used for further processing of  the
+  batch.The  vector  sequence  should  be stored in a simple text file
+  with up to 80 characters of data per line. More than one vector  can
+  be  stored  in  a single file. If so each should be preceded by a 20
+  character title of the form <---m13mp8.001-----> where the <  and  >
+  signs  and  the  number like .001 are obligatory. The number must be
+  preceded by a dot (.) and be 3 digits long. The  total  sequence  in
+  the file must be < 50,001 characters in length.
+ @20. TX 2 @Auto assemble
+
+        Compares gel readings against  the  current  contents  of  the
+  database  and  produces  alignments. In its normal mode of operation
+  ("entry permitted"), the function will automatically enter  the  gel
+  readings  into  the  database, but if entry is not permitted it will
+  only  produce  alignments.  It  works  on  single  gel  readings  or
+  processes  batches  of  gel  readings accessed through files of file
+  names. It is the usual way to enter data into the database.
+
+        The function will check the database for  logical  consistency
+  and  will only procede if it is OK. Choose if gel readings should be
+  entered into the database, or  if  they  should  only  be  compared.
+  Choose  between  using  a file of file names or typing file names on
+  the keyboard. If so selected, supply the file of  file  names.  Also
+  supply  a  file  of  file  names to contain the names of all the gel
+  readings that fail to get entered. Select  the  entry  mode.  Normal
+  assembly  is  appropriate  for  all but special cases, as is "permit
+  joins". Uses for the other modes are not documented here.  Define  a
+  minimum  initial match length. Define a minimum alignment block (the
+  default value is taken in all but exceptional circumstances). Define
+  the maximum number of paddding characters allowed to be used in each
+  gel reading to help achieve alignment, and the same for  the  number
+  allowed  in  the  contig  for  each  gel reading. Finally define the
+  maximum percentage mismatch to be allowed for any gel reading to  be
+  entered  into  the database. If for any gel reading, either of these
+  last three values is exceeded the gel reading will  not  be  entered
+  into the database.
+
+        In operation the  function  takes  a  batch  of  gel  readings
+  (probably   passed  on   as   a  file  of file names from one of the
+  screening routines) and enters them into a database for a sequencing
+  project.  It takes each  gel reading in  turn, compares  it with the
+  current consensus for the database, it then produces  an   alignment
+  for   any   regions   of   the   consensus   it overlaps;   if  this
+  alignment is sufficiently good  it  then  edits  both  the  new  gel
+  reading  and  the  sequences  it  overlaps   and   adds the new  gel
+  reading to the database.  The program  then  updates  the  consensus
+  accordingly and carries on to the next  gel  reading.
+
+        All alignments are displayed and  any  gel  readings  that  do
+  match but  that cannot be aligned sufficiently well have their names
+  written to a file of failed gel reading names.  The  function  works
+  without   any  user  intervention  and can process any number of gel
+  readings in a single run.  Those  gel  readings  that  fail  can  be
+  recompared  using  the  same  function  (to find the current overlap
+  position) and  the user  can enter them into the  database  manually
+  using  the   "enter new gel reading" option.
+
+        Typical dialogue and output from the function is shown  below.
+  (Note  that  output  for gel readings 2 - 9 has been deleted to save
+  space).
+  Automatic sequence assembler
+  Database is logically consistent
+  ? (y/n) (y) Permit entry
+  ? (y/n) (y) Use file of file names
+  ? File of gel reading names=demo.nam
+  ? File for names of failures=demo.fail
+  Select entry mode
+  X  1 Perform normal shotgun assembly
+     2 Put all sequences in one contig
+     3 Put all sequences in new contigs
+  ? Selection  (1-3) (1) =
+  ? (y/n) (y) Permit joins
+  ? Minimum initial match (12-4097) (15) =
+  ? Minimum alignment block (2-5) (3) =
+  ? Maximum pads per gel (0-25) (8) =
+  ? Maximum pads per gel in contig (0-25) (8) =
+  ? Maximum percent mismatch after alignment (0.00-15.00) (8.00) =
+    >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>
+    Processing           1 in batch
+    Gel reading name=HINW.004
+    Gel reading length=   283
+    Searching for overlaps
+    Strand     1
+    Strand     2
+    No matches found
+    Total matches found           1
+    Padding in contig=    0 and in gel=    1
+    Percentage mismatch after alignment =  1.8
+    Best alignment found
+           1         11         21         31         41         51
+           TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+           ********** ********** ********** ********** ********** **********
+           TTTTCCAGCG TGCGTCTGAC GCTGTCTTGC TTAATGATCT CCATCGTGTG CCTAGGTCTG
+           1         11         21         31         41         51
+          61         71         81         91        101        111
+           TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACA-GTTG
+           ********** ********** ********** ********** ********** ***** ****
+           TTGCGTTGGG CCGAGCCCAA CTTTCCCAAA AACGTATGGA TCTTACTGAC GTACACGTTG
+          61         71         81         91        101        111
+         121        131        141        151        161        171
+           CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+           ********** ********** ********** ********** ********** **********
+           CTTACCAGCG TGGCTGTCAC GGCGTCAGGC TTCCACTTTA GTCATCGTTC AGTCATTTAT
+         121        131        141        151        161        171
+         181        191        201        211        221        231
+           GCCATGGTGG CCACAGTGAC G-TATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+           ********** ********** * ******** ********** ********** **********
+           GCCATGGTGG CCACAGTGAC GCTATTTTGT TTCCTCACGC TCGCTACGTA TCTGTTTGCC
+         181        191        201        211        221        231
+         241        251        261        271        281
+           CGCG--GTGG AATTACAGCG TTCCCTATTG ACGGGCGCAT CCAC
+           ****  **** ********** ** * ***** ********** ****
+           CGCGACGTGG AATTACAGCG TT,CDTATTG ACGGGCGCAT CCAC
+         241        251        261        271        281
+            Batch finished
+            9 sequences processed
+            0 sequences entered into database
+            0 joins made
+
+
+        Note that "auto assemble" cannot align protein sequences.
+ @28. TX 1 @Highlight disagreements
+
+        Used  in  the  latter  stages  of  a  project   to   highlight
+  disagreements  between  individual  gel readings and their consensus
+  sequences. Characters that agree with the consensus are shown  as  :
+  symbols  for  the plus strand and . for the minus strand. Characters
+  that disagree with the consensus are left unchanged and so stand out
+  clearly. The results of this analysis are written to a file.
+
+        Before selecting this option create a file of the  display  of
+  the  contig to be "highlighted". The option will ask for the name of
+  this file. Select symbols to denote "agreeing"  characters  on  each
+  strand, the defaults are : and ., but any others can be used. Supply
+  the name of a file in which to put the output.
+
+        The display file needed as input for this option is created by
+  selecting  "Redirect  output",   followed  immediately  by  "display
+  contig", and then "Redirect output" again. The cutoff score used  in
+  the  consensus  calculation  can  be  set  by  option  "set  display
+  parameters". Note that for the highlight function there is  a  limit
+  of  50  for  the  number  of  gel  readings  that are aligned at any
+  position - ie the contig must be less than 51 gel readings  deep  at
+  its  thickest point. I hope that those performing shotgun sequencing
+  never reach this limit, but those using the  program  for  comparing
+  sequence families might.
+
+        Typical output from this function is shown below.
+
+                            210       220       230       240       250
+      1  HINW.004    :C::::::::::::::::::::::::::::::::::::::::::AC::::
+      7  HINW.018    :*::::::::::::::::::::::::::::::::::::::::::CA::::
+     -4  HINW.017                                 ...............AC....
+                     G-TATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCCCGCG--GTGG
+
+                            260       270       280       290       300
+      1  HINW.004    ::::::::::::*:D:::::::::::::::::::
+      7  HINW.018    ::::::::::::::::::::CA:::::T:*:::*::::::::::::CA:
+     -4  HINW.017    ..............................................A...
+      3  HINW.009    :::::::::::::::V::::::::::::::::::::::::::::*AV:::
+     -6  HINW.028                            ......................A...
+                     AATTACAGCGTTCCCTATTGACGGGCGCATCCACGCTGATTCTCTT-CTG
+
+ @32. TX 3 @Extract gel readings
+
+        Used to make copies of the aligned gel readings in a database,
+  to write them into separate files, and to write a corresponding file
+  of file names. It operates in two modes: either all gel readings are
+  extracted, or only those at the ends of contigs.
+
+        Choose which mode of operation is required and supply  a  file
+  of file names.
+
+        The gel readings are given their original names.  If  used  to
+  extract  the  gel  readings from the ends of contigs the function is
+  useful for checking for missed contig joins: the file of file  names
+  can  be  used with the auto assemble function to recompare these gel
+  readings, and each should only overlap one contig. Any that  overlap
+  two contigs will identify possible joins.
+
+        If the option is used to extract all the gel readings  from  a
+  database,  a  subsequent  run  of "auto assemble" can reconstitute a
+  database which has  been  corrupted.  This   rarely  occurs  and  is
+  usually  necessesitated  by  a user  employing "alter relationships"
+  incorrectly without first having made a copy.
+ @1. TX 0 @Help
+
+        Help is available on the following topics :
+ @2. TX 0 Quit
+
+        This command stops the program and is the  only  safe  way  to
+  terminate  a run of the program that has altered the contents of the
+  database in any way.
+ @3. TX 1 @Open a database
+
+        Opens existing databases or allows new ones to be started. The
+  function  is automatically called into operation when the program is
+  started but can also be selected from the general menu.
+
+        Choose to open an existing database or start a new one, or  if
+  !  is  typed  when  the  program is first started, enter the program
+  without opening a database. Supply a project database name,  and  if
+  it  already exists, the "version". If starting a new database define
+  the database size and if it is for DNA or  protein  sequences.   The
+  database  size  is  an  initial  size  for  the  database. It can be
+  increased later during the project. It is the sum of the  number  of
+  gel readings plus the number of contigs.
+
+        Database names can have from one to 12 letters  and  must  not
+  include  full  stop  (.).  The  database is made from three separate
+  files. If the database is called FRED then  version  0  of  database
+  FRED comprises files FRED.AR0, FRED.RL0 and FRED.SQ0. The version is
+  the last symbol in the file names.  Only this program can read these
+  files. If the "copy database" option is used it will ask the user to
+  define a new "version".
+
+        For normal use the maximum gel reading length is  set  to  512
+  characters,  but  when  a  database  is  started the user may choose
+  lengths of either 512, 1024, 1536..., 4096. Normally the program  is
+  used  to handle DNA sequences but many of the functions also work on
+  protein sequences. The choice of sequence  type  is  made  when  the
+  database is started.
+
+        The contigs are not stored on the disk as the user  sees  them
+  displayed  on the screen. Each gel reading is stored with sufficient
+  information about how it overlaps other gel  readings  so  that  the
+  program  can  work out how to present them aligned on the screen. We
+  refer to this extra data as "the relationships" and it is  explained
+  below.  The database comprises 3 separate files.
+  1.  a working version of each gel reading.  This is the  version  of
+  the   gel   reading  that  is in the database and initially it is an
+  exact copy of the original sequence (known as the archive) but it is
+  edited and manipulated to align  it with other gel readings.
+  2.  the file of  relationships.   This  file  contains  all  of  the
+  information  that  is required to assemble the working versions into
+  contigs during processing;  any manipulations on the data  use  this
+  file   and  it  is  automatically  updated  at  any  time  that  the
+  relationships are changed.  The  information  in  this  file  is  as
+  follows:
+  (A) Facts about  each   gel  reading   and   its   relationship   to
+  others ("gel descriptor lines"):
+  (a) the number of the gel reading   (each gel reading   is  given  a
+  number  as  it  is entered into the database)
+  (b) the length of the sequence from this gel reading
+  (c) the position of the left end of this gel reading    relative  to
+  the left end of the contig of which it is a member
+  (d) the number of the next gel reading   to the  left  of  this  gel
+  reading
+  (e) the number of the next gel reading   to the right
+  (f) the relative strandedness of this gel reading  , ie whether   it
+  is  in the same sense or the complementary sense as its archive.
+  (B) Facts about each contig ("contig descriptor lines"):
+  (a) the length of this contig
+  (b) the number of the leftmost gel reading   of this contig
+  (c) the number of the rightmost gel reading   of this contig.
+  (C) General facts:
+  (a) the number of gel readings in the database
+  (b) the number of contigs in the database.
+  3.  the file of archive names.  This is simply a list of  the  names
+  of each of the archive files in the database but on line number 1000
+  we also store the size of the database. ie the number  of  lines  of
+  information allowed in the database files. This file always has 1000
+  lines but the length of the file of relationships and  the  file  of
+  working  versions can be set by the user when creating a database or
+  when copying from one to another.
+
+        Structure of the database files
+
+        1.  The file of relationships
+
+        The file contains IDBSIZ lines of data:  the general data  are
+  stored  on line IDBSIZ;   data about  gel readings  are stored  from
+  line 1 downwards;  data about contigs are stored from line  IDBSIZ-1
+  upwards.  A  database  of 500 lines containing 25 gel readings and 4
+  contigs would have a file of relationships as is shown below.
+
+
+                    ---------------------------------------------
+                       1  Gel descriptor record
+                       2   "      "       "
+                       3   "      "       "
+                       4   "      "       "
+                       5   "      "       "
+                       '   '      '       '
+                       '   '      '       '
+                      25   "      "       "
+                      26  Empty record
+                       '    '     '
+
+                       '    '     '
+                     495    '     '
+                     496  Contig descriptor record
+                     497    "        "        "
+                     498    "        "        "
+                     499    "        "        "
+                     500   Number of gel readings=25, Number of contigs=4
+                    ---------------------------------------------
+
+            The arrangement of the data in the file of relationships
+
+  As each new gel reading   is added into the database a new  line  is
+  added to  the  end  of  the  list  of gel descriptor lines.  If this
+  new gel  reading  does not overlap with any gel readings already  in
+  the  database  a new contig  line  is added  to  the top of the list
+  of contig lines.  If it overlaps with one contig then no new  contig
+  line  need  be  added  but  if it  overlaps with  two  contigs  then
+  these  two  contigs must be joined and the number  of  contig  lines
+  will  be reduced by one. Then the list of contig lines is compressed
+  to  leave  the empty line at the top of the list.  Initially the two
+  types  of  line will move towards  one  another  but eventually,  as
+  contigs  are joined, the contig descriptor lines will  move  in  the
+  same  direction  as the  gel descriptor lines.   At  the  end  of  a
+  project  there should  be only one contig  line.   The  database  is
+  thus capable of handling a project of 998 gels.
+
+        Structure of the working versions file
+
+        The working versions of gel readings are stored  in   a   file
+  of  IDBSIZ lines each containing 512 characters.  Gel reading number
+  1 is stored on line 1, gel reading number  2 on line 2 and so on.
+
+        Structure of the archive names file
+
+        This file, unlike the others, always has 1000  lines  each  10
+  characters  in length. Its length is fixed because line 1000 is used
+  to store IDBSIZ the database size and the programs need  a  definite
+  location from which to read this number.
+
+        Safeguarding the database
+
+        It is advisable to copy regularly (using the copy function  of
+  DS) from say copy 0 to copy 1 in case of errors.
+
+        I also recommend setting the protection codes  on  copy  0  of
+  each  database  so  that users cannot delete the files without first
+  resetting  the  protection  codes.  This  will  protect   you   from
+  accidently  deleting  the  files.  Users  at LMB can use the PROTECT
+  command for this purpose.
+
+        The give-up options allow users to change  their  minds  about
+  entering   a   new   gel   reading  or  joining  two contigs without
+  affecting the file  of  relationships.   BUT  if  the   edit  contig
+  option   from   either   of   these  two functions has been used the
+  edits will remain even though the user has "given up".  To leave the
+  files  completely unaffected  the  user  could,  if  required,  undo
+  any edits before "giving up".
+
+        There  are  various  checks  within  the  programs  to protect
+  users from themselves:-
+  1.  All user input is checked for  errors  -  e.g.    reference   to
+  non-existent  gel readings or  contigs,  incorrect  positions in the
+  contig or gel readings.
+  2.  Before entering a gel reading the system checks to see if a file
+  of the same name has already been entered.
+  3.  Join will not allow the circularising of a contig.
+  4.        Both enter and join  functions  restrict  the  region that
+  the   user  is  allowed to edit (using edit contig) to the region of
+  overlap.
+  5. Users may escape from any point in the program.
+  6. Help is available from all points in the program.
+
+
+  IT IS ESSENTIAL THAT USERS DO NOT KILL THE PROGRAM WHILE IT IS DOING
+  ANYTHING  THAT  INVOLVES  CHANGING THE CONTENTS OF THE DATABASE. I.E
+  DURING AUTO ASSEMBLE,  COMPLETE  ENTRY,  COMPLETE  JOIN,  COMPLEMENT
+  CONTIG,  EDIT  CONTIG,  AND  SCREEN  EDIT.   This  could corrupt the
+  database so badly that it is impossible to fix. The  program  should
+  always be left using the QUIT option.
+ @4. TX 3 @Edit
+
+        A simple commnd driven editor  that  can  insert,  delete  and
+  change  gel  reading  sequences.  Insert, delete and change commands
+  will request the position at which the  edit  is  required  and  the
+  number  of  characters  to  insert,  delete  or  change. The default
+  character for insertions is *.
+
+        There are three  modes  of  editing  offered  by  this  editor
+  depending where it is selected from.  New gel readings can be edited
+  as they are being entered into the database, contigs can  be  edited
+  with  alignments  being automatically maintained, or gel readings in
+  contigs can be edited without the maintenance of alignments.
+  The following commands can be used.
+
+     ? = Help
+     ! = Quit
+     3 = Insert
+     4 = Delete
+     5 = Change
+
+
+        All commands request the position at which the edit should  be
+  made.   (Note that the position refers to the position in the contig
+  for gel readings in the database, but to the  position  in  the  gel
+  reading  if you are editing a new gel reading while entering it into
+  the database.)
+
+        All commands request the number of characters to  operate  on.
+  (Note  that if you are editing a contig the program will ask for the
+  characters to insert into each separate gel reading, hence  allowing
+  different  changes to be made to each. Also the default character is
+  asterisk (*) - i.e if you include a space in the string it  will  be
+  replaced  by  an  asterisk,  or  if you simply type return the whole
+  string inserted will be asterisks.)
+  "Change" allows  characters  in  individual  gel  readings   to   be
+  replaced.   If  the  user  is  not  editing a new gel reading during
+  "enter new gel reading" the program will request the  numer  of  the
+  gel  reading  to  edit.   (When  editing gel readings in contigs the
+  program responds with the relative  position  and   length   of  the
+  selected   gel  reading  in  case  the  the user only knows the edit
+  position relative to the  gel reading. (The   edit   position   must
+  be relative to the contig.))
+  Further notes on editing
+
+        When you are  editing  a  contig  the  program  maintains  the
+  alignments  of  the gel readings by always making the same number of
+  insertions or deletions  in all the gels.  Note that these edits are
+  immediately  carried  out  and  the "Quit" options of "enter new gel
+  reading" and "join contigs" do not undo them.  Users must undo  them
+  themselves.  Note  that  if this option has been entered from either
+  "enter new gel reading" or "join contigs" the program will  restrict
+  edits   to the  region  of  overlap.  DO NOT KILL THE PROGRAM DURING
+  EDIT CONTIG!
+
+        When editing a single gel reading  in  a  contig  from  "alter
+  relationships"  (which  you  should  not  normally  need  to do) the
+  program will correct the length of the individual gel  reading,  but
+  it will not update the length of the contig if it has changed.
+
+        The program contains better methods than this  simple  command
+  driven  editor, for making multiple edits to contigs. "Screen edit",
+  gives access to the system editor on your machine, and  "auto  edit"
+  will edit a whole contig automatically.
+ @9. TX 3 @Screen edit
+
+        Gives access to the system editor on the machine (for  example
+  EDT  on  a  VAX)  and  allows users to edit contigs. The contigs are
+  presented as for "display contig" and the program will  reconstitute
+  the contig's sequences and relationships  when the editor is exited.
+
+        To screen edit a contig set the line length to 50  characters,
+  select  the  contig to edit, and supply the name of a temporary file
+  in which the editing will be performed.  After  a  short  pause  the
+  system editor will present the first page of the file. Edit the file
+  obeying the rules given below. Exit from the editor and  affirm  the
+  intention  of returning the contig to the database. The program will
+  put the contig back into the database.
+
+        Rules for screen editing
+
+        There are some limitations on the changes that can be made  to
+  the contigs when using the screen editor. Users are unlikely to want
+  to break the rules in order  to  achieve  changes  to  contigs,  but
+  nevertheless  the  constraints need to be defined and they are given
+  below.
+
+        Alignments must be maintained during editing.  Whole lines  of
+  sequence  should not be deleted or added unless the order of the gel
+  readings in the contig  is  preserved.   Each  line  in  the  contig
+  display  consists  of  gel  reading  numbers,  their  names  and  50
+  character sections  of  sequence.  Insertions  are  limited  in  the
+  following  way.  No line of sequence can be extended rightwards more
+  than 10 characters beyond the end of a  full  length  line  (a  full
+  length  line is 50 characters long). Only one character can be added
+  to the left end of full  length  lines,  but  sections  of  sequence
+  beginning  further  into  a  line can be extended leftwards up to an
+  equivalent position. Do not delete any  non-sequence  lines  in  the
+  file.
+
+        Before returning the contig to the database the program checks
+  that  the rules have been obeyed. If an error is found the number of
+  the erroneous line in the file is displayed and the contig will  not
+  be changed.
+ @5. TX 1 @Display a contig
+
+        Used to show the aligned  gel  readings  for  any  part  of  a
+  contig.  The  number,  name  and strandedness of each gel reading is
+  shown and the consensus is written below.
+
+        If required identify the contig,  and then the start  and  end
+  points of the region to display.
+
+        The display can be directed  to  a  disk  file  using  "direct
+  output to disk".  These files are required by options: "screen edit"
+  and "highlight disagreements", and printed copies of them  are  very
+  useful for marking corrections prior to using the editors.
+
+        Below is an example showing the left  end  of  a  contig  from
+  position   1  to  200.  Overlapping this region are gels 6,3,5,17and
+  12; 6, 3 and 5 are in reverse orientation to their archives (denoted
+  by  a  minus   sign)  There  are  a  few uncertainty codes and a few
+  padding characters  in  the  working  versions,  but  the  consensus
+  (shown  below  each page width) has a definite assignment for almost
+  every position.
+
+                             10        20        30        40        50
+     -6  HINW.010    GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+         CONSENSUS   GCGACGGTCTCGGCACAAAGCCGCTGCGGCGCACCTACCCTTCTCTTATA
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACA-GTC
+
+                            110       120       130       140       150
+     -6  HINW.010    GATTAGGAGACGAACTGGGGCG3CGCC*GCTGCTGTGGCAGCGACCGTCG
+     -3  HINW.007    GATTAG4AGACGAACTGGGGCGACGCCCG*TGCTGTGGCAGCGACCGTCG
+     -5  HINW.009                                        GGCAGCGACCGTCG
+     17  HINW.999                                           AGCGACCGTCG
+         CONSENSUS   GATTAGGAGACGAACTGGGGCGACGCC-G-TGCTGTGGCAGCGACCGTCG
+
+                            160       170       180       190       200
+     -6  HINW.010    TCT*GAGCAGTGTGGGCGCTG*CCGGGCTCGGAGGGCATGAAGTAGAGC*
+     -3  HINW.007    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     -5  HINW.009    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGGCATGAAGTAGAGC*
+     17  HINW.999    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+     12  HINW.017                                              GTAGAGC*
+         CONSENSUS   TCT*GAGCAGTGTGGGCGCTG-*CGGGCTCGGAGGGCATGAAGTAGAGC*
+ @6. TX 1 @List a text file
+
+        This option allows users to list text files on the screen.  It
+  can  be  used  to  read  a file containing notes, for checking files
+  written to disk etc. The user is asked to type the name of the  file
+  to list.
+ @8. TX 1 @Calculate a consensus
+
+        Calculates  a  consensus  sequence   either  for   the   whole
+  database or for selected contigs. The consensus is written to a file
+  named by the user.
+  Supply a file name,  choose  between   whole  database  or  selected
+  contigs.
+
+        Symbols for uncertainty in gel readings
+
+        In  order  to  record  uncertainties  when  reading  gels  the
+  codes  shown  below can  be  used. Use  of these codes permits us to
+  extract the maximum amount of data from each gel and yet record  any
+  doubts   by  choice   of   code.    The program can deal with all of
+  these codes and any other  characters  in  a  sequence  are  treated
+  as  dash  (-) characters.
+
+         SYMBOL                  MEANING
+
+           1             PROBABLY        C
+           2                "            T
+           3                "            A
+           4                "            G
+           D                "            C       POSSIBLY        CC
+           V                "            T          "            TT
+           B                "            A          "            AA
+           H                "            G          "            GG
+           K                "            C          "            C-
+           L                "            T          "            T-
+           M                "            A          "            A-
+           N                "            G          "            G-
+           R             A OR G
+           Y             C OR T
+           5             A OR C
+           6             G OR T
+           7             A OR T
+           8             G OR C
+           -             A OR G OR C OR T
+           a             A set by auto edit
+           c             C set by auto edit
+           g             G set by auto edit
+           t             T set by auto edit
+           *             padding character placed by auto assembler
+            else = -
+
+  The DNA consensus algorithm
+
+        The "calculate  consensus"  function,  the  "display   contig"
+  routine and the "show quality" option use  the rules  outlined  here
+  to  calculate  a consensus  from aligned gel  readings.   Note  that
+  "display  contig"  calculates a consensus for  each  page  width  it
+  displays  (it  does  not use the consensus sequence file  calculated
+  by the consensus function).
+
+        We have 6 possble symbols in the consensus sequence: A,C,G,T,*
+  and -. The last symbols is assigned if none of the others makes up a
+  sufficient proportion of the aligned characters at any  position  in
+  the contig. The following calculation is used to decide which symbol
+  to place in the consensus at each position.
+
+        Each uncertainty code contributes a score to one of  A,C,G,T,*
+  and  also  to  the  total  at each point. Symbols like R and Y which
+  don't correspond to a single base type contribute only to the  total
+  at each point. The scores are shown below.
+                definite assignments ie A,C,G,T,B,D,H,V,K,L,M,N,a,c,g,t,* =1
+
+                probable assignments ie 1,2,3,4 = 0.75
+
+                other uncertainty codes including R,Y,5,6,7,8,- = 0.1
+
+
+        A cutoff score of 51% to 100% is supplied by the  user.  (When
+  the   program   starts   this  is  set  to  75%.  See  "set  display
+  parameters").  At each position in the contig we calculate the total
+  score  for  each of the 5 symbols A,C,G,T and * (denote these by Xi,
+  where i=A,C,G,T or *), and also the sum of these totals (denote this
+  by S). Then if 100 Xi / S > the cutoff for any i, symbol i is placed
+  in the consensus; otherwise - is assigned.
+
+        Notice that S does not equal the number of times the  sequence
+  has  been  determined, but is the score total, and hence we are less
+  likely to put a -  in  the  consensus.  For  the  "examine  quality"
+  algorithm  each  strand is treated separately but the calculation is
+  the same. (It was originally different).
+
+        Format of the consensus sequence ( and vector sequences).
+
+        A consensus  sequence  file  may  contain  the  consensus  for
+  several contigs and so we identify each of them by preceding them by
+  a 20 character title. The title is of the form  <---LAMBDA.076----->
+  (  where LAMBDA is the project name and gel reading number 76 is the
+  leftmost gel reading to contribute to  this   consensus   sequence).
+  The   angle  brackets  <>  and the three digit number precede by a .
+  are important to some processing programs.
+ @25. TX 1 @Show relationships
+
+        Used to show the relationships of  the  gel  readings  in  the
+  database in three ways -
+  (a) All contig descriptor lines  followed  by  all  gel   descriptor
+  lines.
+  (b) All contigs one after the   other   sorted,   i.e.    for   each
+  contig   show  its   contig  descriptor line followed by all its gel
+  descriptor lines sorted on position from left to right
+  (c) Selected contigs:  show the contig  line  and,  in  order, those
+  gel  readings  that  cover  a  user-defined  region.  Note that this
+  output can be directed to a disk file by prior  selection  of  "disk
+  output".
+
+        Below is an example showing a contig from position 1  to  689.
+  The left gel reading  is number 6 and has archive name HINW.010, the
+  rightmost gel  reading is number 2 and is has archive name HINW.004.
+  On  each  gel  descriptor  line  is  shown:  the name of the archive
+  version, the gel number, the position of the left  end  of  the  gel
+  reading  relative to the left  end  of  the  contig,  the length  of
+  the gel reading  (if this is negative it means that the gel  reading
+  is  in  the  opposite orientation to its archive), the number of the
+  gel reading   to the left and the number of the gel reading  to  the
+  right.
+
+
+   CONTIG LINES
+   CONTIG      LINE  LENGTH               ENDS
+                                       LEFT   RIGHT
+                 48     689               6       2
+   GEL LINES
+   NAME      NUMBER POSITION LENGTH     NEIGHBOURS
+                                       LEFT   RIGHT
+   HINW.010       6        1   -279       0       3
+   HINW.007       3       91   -265       6       5
+   HINW.009       5      137   -299       3      17
+   HINW.999      17      140    273       5      12
+   HINW.017      12      193    265      17      18
+   HINW.031      18      385   -245      12       2
+   HINW.004       2      401   -289      18       0
+
+ @21. TX 3 @Enter new gel reading
+
+        Used to enter new gel readings into the database. The new  gel
+  reading  must have previously been compared with the contents of the
+  database by use of " auto assemble"  in order  to  ascertain  if  it
+  overlaps any previously entered data.
+
+        The user is expected to know: if the gel reading overlaps;  if
+  so  which  contig  it overlaps; if so where it overlaps. The program
+  takes the user through a series of question to establish the  nature
+  of  the  overlap  and  then  displays  the overlap. The user is then
+  offered a number of options,  including  editors  for  the  new  gel
+  reading  and  the contig, to enable the correct alignment of the gel
+  reading throughout its whole length.
+  Supply the name of the gel reading file.  If the  gel  reading   has
+  been  entered before the program will  not permit entry. The program
+  gives the gel reading a unique  number  and  asks  if  the  sequence
+  overlaps  any  data  already  in  the  database  (reported  by "auto
+  assemble").  If it does not, entry is complete.  If it does  overlap
+  the  dialogue  continues with the program asking if the gel readings
+  overlaps "in  the  normal  sense",  if  not  it  will  automatically
+  complement  the  sequence.  Then supply the number of the contig the
+  gel reading overlaps (as reported by "auto assemble").
+
+        Overlaps are divided into two types: those for which  the  new
+  gel  reading  protrudes from the left end of the contig it overlaps,
+  and those for which it does not. The program asks  about  this  with
+  the  question "Left end of gel reading is inside contig". If this is
+  true the program will go on to ask for the position in the contig of
+  the  left  end of the new gel reading. If it is not true the program
+  will ask for the position in the new gel reading of the left end  of
+  the contig.
+
+        Once this is completed the program will display the  first  50
+  bases  of  the  overlap.  The  gel  readings in the contig and their
+  consensus are displayed with the new  gel  reading  underneath.  The
+  mismatches are shown by *'s on the next line down. For example:
+
+
+                             60        70        80        90       100
+     -6  HINW.010    CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCGCGGACACGTC
+     -3  HINW.007                                            GGCACA*GTC
+         CONSENSUS   CACAAGCGAGCGAGTGGGGCACGGTGACGTGGTCACGCCG-G-ACACGTC
+         NEWGEL      CACAAGCGAGCGAGAGGGGCACCGTGACGTGGTCACGCCGGGGACACGTC
+         MISMATCH                  *                         * *
+                             10        20        30        40        50
+
+
+        The program then needs to know if the position  of  the   left
+  end  of  the  overlap  is  correct.   If  it is the user should type
+  return, if not, 1 and the program will ask for the new position  and
+  display it.
+  The program now offers a number of  options  to  allow  the user  to
+  align  the  new gel reading correctly over its whole length with the
+  data   already   in   the   contig.    It   is    important     that
+  sufficient   edits   are   made   to  the  new  gel  reading  or the
+  sequences in the contig at this stage to get the alignment  correct,
+  because  once entry  is completed, the alignment is fixed and cannot
+  easily be changed (see "alter relationships").  Alignment   can   be
+  achieved  by   making  insertions   or  deletions  but  deletion  of
+  data requires the original gels to be checked.   For   this   reason
+  at  entry  we usually make only insertions to achieve alignment.  We
+  use X or asterisks (*) as padding characters  to  achieve  alignment
+  and  so   can,  if  required, distinguish  padding  characters  from
+  characters assigned from reading gels.
+
+        The options available are:
+     ? = HELP
+     ! = Give up
+     3 = Complete entry
+     4 = Edit contig
+     5 = Display overlap
+     6 = Edit new gel reading
+
+
+
+        1. HELP gives this information.
+
+        2. Give up allows users to change their minds  about  entering
+  the  new  gel reading. The program will ask the user to confirm this
+  choice.
+
+        3. Complete entry is the command to add the new gel reading to
+  the  contig.  The program updates the relationships accordingly. The
+  user is asked to confirm this command.
+
+        4. Edit contig gives the user access to a simple  editor  that
+  allows  insertions,  deletions and changes to be made to the contig.
+  The editor  maintains  alignments  by  making  the  same  number  of
+  insertions or deletions in all sequences covering the edit position.
+  The  program protects the  user           by   allowing  edits  only
+  within the region of overlap.
+
+        5. Display allows display of the region of overlap only.  This
+  is  defined  by the relative positions in the contig. The default is
+  the whole of the region of overlap.
+
+        6. Edit new gel reading allows  the  new  gel  reading  to  be
+  edited using a simple editor.
+ @23. TX 3 @Complement a contig
+
+        This function will complement  and  reverse  all  of  the  gel
+  readings   in    a  contig.     It    automatically   reverses   and
+  complements  each  gel reading sequence,  reorders  left  and  right
+  neighbours,   recalculates   relative  positions  and  changes  each
+  strandedness.
+
+        The only user  input  required  is  to  identify  the   contig
+  to complement  by  the  number or name of a gel reading it contains.
+  DO NOT KILL THE PROGRAM DURING THIS STEP!
+ @22. TX 3 @Join contigs
+
+        This function joins contigs interactively.  It allows the user
+  to  align  the  ends  of  the  two  contigs  by  editing each contig
+  separately.  It  is  important  that  the  alignment   achieved   is
+  correct  because  once the join is completed the alignment is fixed.
+  The program needs to know which two contigs to join and  where  they
+  overlap.
+
+        First which two contigs are to be  joined.   The  user  should
+  identify  the two contigs. First the left contig and then the right.
+  The program checks that the two contig  numbers  are  different  (it
+  will not allow circles to be formed!)
+
+        Now identify the exact position of overlap. This is defined as
+  the  position  in the left contig that the leftmost character of the
+  right contig overlaps.  Normally  the  position  is  established  by
+  employing  the  end  gel  reading  for  option "auto assemble".  The
+  overlap must be of at least  one  character.      The  program  then
+  displays  the  join  showing  all  the  gel readings overlapping the
+  join from the left contig, their consensus,  all  the  gel  readings
+  from  the  right  contig  that  overlap  the  join,  their consensus
+  and   then  asterisks  to  denote   mismatches   between   the   two
+  consensuses. For example:
+
+                           1460      1470      1480      1490      1500
+     56  HINW.100    TCT*GAGCAGTGTGGGCGCTG*CCGG
+     33  HINW.300    TCT*GAGCAGTGTGGGCGCTGC*CGGGCTCGGAGGG
+    -25  HINW.090    TCT*GAGCAGTGTGGGCG*T*G*CGGGCTCGGAGGG
+     19  HINW.123    TCTCGAGCAGTGTGGGCGCTG**CGGGCTCGGAGGGCATGAAGTAGAGCG
+         CONSENSUS   TCTCGAGCAGTGTGGGCGCTG-CCGGGCTCGGAGGGCATGAAGTAGAGCG
+     -6  HINW.010    TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+     -3  HINW.007                TGGGCGCTGCCCGGGCTCGGAGGGCATGAAGT*AGAGC
+     -5  HINW.009                              GCTCGGAGGGCATGAAGT*AGAGC
+         CONSENSUS   TCTCGAGCAGTGTGGGCGCTGCCCGGGCTCGGAGGGCATGAAGTTAGAGC
+         MISMATCH                         *                      ******
+                             10        20        30        40        50
+
+
+        It  is  essential  that  the  user  aligns  the  two   contigs
+  throughout  the  whole  region of overlap before completing the join
+  because it is only at this stage that the two contigs can be  edited
+  independently.  Once the join is completed the alignment can only be
+  altered using the routines supplied by  "alter  relationships".  The
+  program  offers  the user options to facilitate the alignment of the
+  two contigs.  These options are:-
+
+     ? = Help
+     ! = Give up
+     3 = Complete join
+     4 = Edit left contig
+     5 = Display joint
+     6 = Edit right contig
+     7 = Move join
+
+  1. Help gives this information.
+  2. Give up allows the user to return to  the  main  options  without
+  completing the join. Note any edits made will remain.
+  3. Complete join instructs the program to update  the  relationships
+  so  that  the two contigs are joined. DO NOT KILL THE PROGRAM DURING
+  COMPLETE JOIN!
+  4. Edit left contig and edit right contig give access  to  a  simple
+  editor  that  allows insertions, deletions and changes to be made to
+  the  contigs. Help is available on editing once the  editing  option
+  is  selected.  The user is only allowed to edit within the region of
+  overlap and should make sure that the positions used  correspond  to
+  the correct contig.
+  5. Display join displays the joint as shown above.
+  6. See above.
+  7. Move join allows the position of the joint to be changed.
+ @24. TX 1 @               Copy the database
+
+        Used to make a copy of the database. If required the  database
+  size  can  be altered using this option. The "version" of a database
+  is  encoded as the last letter in the names of the three files  that
+  contain the database.
+
+        Supply a "version" number (the default is version 1),  and  if
+  required  select a new size for the database. The size of a database
+  is the number of lines of information it can hold. It needs  a  line
+  for each gel reading and another for each contig.
+ @19. TX 1 @               Check database
+
+        Used to perform a check on  the  logical  consistency  of  the
+  database. No user intervention is required.
+
+        The following relationships are checked:
+  1.       If gel reading A thinks gel reading B is its left neighbour
+  does B think A  is its right neighbour?  The error message is
+  "Hand holding problem for gel reading A"
+  followed by  the gel descriptor lines for gel readings A and B.
+  2.       Are there any contig lines with no left or  right  end  gel
+  readings?  The error message is
+  "Bad contig line number A"
+  3.       Do the gel readings that are  described  as  left  ends  on
+  contig lines agree that they are left ends?  The error message is
+  "The end gel readings of contig A have outward neighbours"
+  4.       Are there gel readings that are in more  than  one  contig?
+  The error message is
+  " Gel number A is used N times"
+  5.       Are there gel readings that are not  in  any  contig?   The
+  error message is
+  " Gel number A is not used"
+  6.       Do the relative positions of   gel  readings   agree   with
+  their  position  as  defined by left and right neighbourliness?  The
+  error message is
+  " Gel number A with position X is left neighbour of  gel  number   B
+  with position Y"
+  7.       Are there any loops in   contigs?    If   so   no   further
+  checking is done.  The error mesage is
+  " Loop in contig n no further checking done, but gel reading numbers
+  follow"
+  The program  then  prints the gel  reading  numbers  in  the  looped
+  contig up to the start of the loop.
+  8. Are there any contigs of length <1? The error message is
+  " The contig on line number x has zero length"
+  9. Are there any gel readings (used in only one  contig)  that  have
+  zero length? The error message is
+  " Gel number N has zero length"
+  Note that "auto assemble"  also uses this logical consistency  check
+  and will only tolerate a "Gel number N is not used" error. Any other
+  error will cause it to give up.
+ @29. TX 1 @               Examine quality
+
+        Analyses the quality of the data in a contig.  It  reports  on
+  the  proportion  of the consensus that is "well determined" and will
+  display a sequence of symbols  that  indicate  the  quality  of  the
+  consensus at each position.
+
+        Identify the contig to analyse, and the section  of  interest.
+  The  current  consensus  calculation  cutoff  score  will be used to
+  decide if each position is "well determined". In general the quality
+  of  a  reading deteriorates along the length of the gel and so it is
+  also possible to use a length cutoff for  the  quality  calculation.
+  Only  the  data  from  the  first  section  of  each reading will be
+  included in the quality calcualtion. The  length  is  altered  under
+  "set parameters" and is initially set to the maximum reading length.
+  A summary showing the percentage of the consensus  that  falls  into
+  each category of quality is shown. Choose whether or not to have the
+  quality codes for each position of the consensus displayed. They can
+  be displayed as either graphics or text.
+
+        The quality of the data depends on the number of times it  has
+  been  sequenced  and the particular uncertainty codes  used  in each
+  gel reading.  This function divides the data into  five  categories,
+  assigning each a symbol or code:
+  1.  Well determined on both strands and they agree.  code=0
+  2.  Well determined on the plus strand only.  code=1
+  3.  Well determined on the minus strand only.  code=2
+  4.  Not well determined on either strand.  code=3
+  5.  Well determined on both strands but they disagree.  code=4
+  A position is "well determined" if it is assigned one of the symbols
+  A,C,G,T  when  the  algorithm  described in the section "calculate a
+  consensus".   The  calculation  is  performed  separately  for  each
+  strand.
+
+        If the user chooses to have the data displayed graphically the
+  following  scheme  is used. A rectangular box is drawn so that the x
+  coordinate  represents  the  length  of  the  contig.  The  box   is
+  notionally divided vertically into 5 possible levels which are given
+  the y values: -2,-1,0,1,2.  The quality  codes  attributed  to  each
+  base  position are plotted as rectangles.  Each rectangle represents
+  a region in which the quality codes are identical, so a single  base
+  having a different code from its immediate neighbours will appear as
+  a very narrow rectangle.
+
+    Rectangle bottom and top y values
+
+       Quality 0 rectangle from 0 to 0
+       Quality 1 rectangle from 0 to 1
+       Quality 2 rectangle from 0 to -1
+       Quality 3 rectangle from -1 to 1
+       Quality 4 rectangle from -2 to 2
+
+        Obviously a single line  at  the  midheight  shows  a  perfect
+  sequence.
+
+        Typical dialogue is shown below.
+
+     41.47% OK on both strands and they agree(0)
+     55.48% OK on plus strand only(1)
+      2.08% OK on minus strand only(2)
+      0.97% Bad on both strands(3)
+      0.00% OK on both strands but they disagree(4)
+    ? (y/n) (y) Show sequence of codes
+
+             10         20         30         40         50
+     1111111111 1111111111 1111111111 1111111111 1111111111
+
+             60         70         80         90        100
+     1111111111 1111111111 1111111111 3111111111 1111111111
+
+            110        120        130        140        150
+     1111111111 1111131111 1111111111 1111111111 1111111111
+
+            160        170        180        190        200
+     1111111111 1111111111 1111111111 1111111111 1111111133
+
+            210        220        230        240        250
+     1311111111 1111111111 1111111110 0000000000 0000220000
+
+            260        270        280        290        300
+     0000000000 0020000000 2200000202 0002000000 0000222200
+
+ @26. TX 3 @               Alter relationships
+
+        Used  to  make  what  are  normally  illegal  changes  to  the
+  database. That is the normal checks are not done and any item in the
+  database can be changed independently of all others. Users  need  to
+  know  what they are doing because it is very easy to make a horrible
+  mess. Always start by making a copy!
+
+        By using the  options  here  users  can  edit  individual  gel
+  readings  in  contigs,  move  one  section  of  a contig relative to
+  another, break contigs, remove contigs, remove gel readings, etc. To
+  give  flexibility most of the commands do only one thing. This means
+  that several commands may  have  to  be  executed  to  complete  any
+  change.  At the end of this help section there are notes on removing
+  gel readings from the database.
+
+        The following options are offered:
+
+     ? = HELP
+     ! = QUIT
+     3 = Line change
+     4 = Edit single gel reading
+     5 = Delete contig
+     6 = Shift
+     7 = Move gel reading
+     8 = Rename gel reading
+     9 = Break a contig
+
+  1. HELP gives this information.
+  2. QUIT returns to the main options of SAP.
+  3. Line change
+  allows the user to change the contents  of  any line in the file  of
+  relationships.   The  line is selected by number, the program prints
+  the current line and prompts for the new  line.
+  4.   Edit
+  allows   the   user   to    edit    an    individual    gel  reading
+  independently of any others it may be related to. The edit positions
+  are relative to the contig. The effect of this editing on the length
+  of the gel reading is taken care of but, if it changes the length of
+  a contig, or its relationship to others, this must be accounted  for
+  (if necessary) by use of the "line change" function.
+  5.  Delete  contig
+  is a function that deletes a contig line  by moving  down  all   the
+  contig lines above by one position.  It prompts only for the line to
+  delete.  It does not  delete  any   of   the  gel  readings  or  gel
+  reading  lines  for the deleted contig but it does reduce the number
+  of contigs on line IDBSIZ by 1.
+  6.  Shift
+  allows the user to change all the relative  positions of  a set   of
+  neighbouring  gel  readings by some fixed value, i.e.  it will shift
+  related gel readings either left or  right.   It  can  therefore  be
+  used  to  change the alignment of the gel readings in a contig or as
+  part of the process of breaking a contig into two parts (see below).
+  It  prompts  for  the  number  of the first gel reading to shift and
+  then  for the  distance  to  move  them (Note a negative value  will
+  move  the  gel readings left and a positive value right).   It  then
+  chains rightwards (ie follows right neighbours) and shifts each  gel
+  reading,  in  turn,  up to the  end of the contig.  (This means that
+  only those gel readings from the first to shift to the rightmost are
+  moved). It updates the length of the contig accordingly.
+  7. Move gel reading
+  is  a  function  to  renumber  a  gel  reading.  It  moves  all  the
+  information  about  a  gel reading on to another line. The user must
+  specify the number of the gel  reading to move and the number of the
+  line  to place it. It takes care of all the relationships. Of course
+  gel readings must not be  moved  to  lines  occupied  by  other  gel
+  readings!  It  can  be used as part of the process of removing a gel
+  reading from the database (see below).
+  8.  Rename gel reading
+  is a function that is used to  rename  the archive   names   of  gel
+  readings   in  the  database;   it only changes the name in the .ARN
+  file of the  database.
+
+  9. Break contig
+
+        Occasionaly it is necessary to break a contig into  two  parts
+  and  this  can be achieved using this option. The program needs only
+  the number of a gel reading. This is  the  gel   reading  that  will
+  become  a  left  end  after  the  break.  That is, the break is made
+  between this gel reading and its left neighbour. A new  contig  line
+  is created so ensure that there is sufficient space in the database.
+  Removing gel readings from contigs
+
+        Gel readings can be removed  from  contigs  if  they  are  not
+  essential  for  holding the contig together (ie are not the only gel
+  reading covering a particular region). Suppose the  gel  reading  to
+  remove  is gel number b with left neighbour a and right neighbour c.
+  Using "line change" change the right neighbour of a to  c,  and  the
+  left neighbour of c to a. To tidy things up: suppose there are x gel
+  readings in the database; then, using "move gel reading" move gel  x
+  to  line  b;  then,  using  "line change" decrease the number of gel
+  readings in the database (stored in the last line) by 1.
+ @27. TX 1 @  Set display parameters
+
+        Used to  redefine  the  parameters  that  control  the  cutoff
+  employed  by  the  consensus  calculation  and quality examiner, the
+  maximum  length  of  each  reading  to  include   in   the   quality
+  calculation,  the line length used by the display function, the text
+  window length used by the graphics options, and the graphics  window
+  length used by the graphics options.
+
+        The default cutoff score is 75%. The default line length is 50
+  characters. For protein sequences the cutoff is always 100%.
+
+        The text window used by  the  graphics  options  controls  the
+  amount  of  sequence  listed at the crosshair position. The graphics
+  window controls the "zoom" function. Both these windows are  defined
+  as  the number of bases that should be shown, to both left and right
+  of the crosshair.
+ @30. TX 3 @  Auto edit a contig
+
+        This function automatically changes characters in gel readings
+  to  make  them  agree with the consensus sequence. If employed as is
+  intended, use of this function is not  a  criminal  activity  but  a
+  method  that saves a large amount of work. All characters changed by
+  the auto editor  will  appear  in  the  gel  readings  as  lowercase
+  letters. The current consensus calculation cutoff score is used.
+
+        Identify the contig and the section to edit. The program  will
+  display  a  summary  of  changes  made. Note that it is important to
+  understand both what the auto editor does and the order in which  it
+  does  it. Before employing the auto editor users should note all the
+  corrections that they require, so that  after it has been  used  the
+  corrections can be checked.
+
+        The general strategy employed when collecting shotgun sequence
+  data  is  to let the contigs get fairly deep, to get a printout of a
+  contig, check problems against the films, note  corrections  on  the
+  printout,  and  make  the  changes  using  an interactive editor. In
+  general the consensus is correct except  for  places  where  padding
+  characters  have been used to accommodate a single gel with an extra
+  character, or where the consensus is dash. The important  point  for
+  the  auto  editor  is  that  most edits simply make the gel readings
+  conform to the consensus, or remove columns of pads.
+
+        The new editor does the following.
+
+        1) calculates a consensus for the contig (or part of a contig)
+  to  be edited, and then uses this consensus to direct the editing of
+  the contig in 3 stages
+
+        2) stage 1: find and correct all places where, if the order of
+  two  adjacent  characters  is swapped, they will both agree with the
+  consensus (given that they did  not  match  the  consensus  before).
+  These corrections are termed "transpositions"
+
+        3)  stage 2: find and correct all  places  where  there  is  a
+  definite  consensus  but  the gel reading has a different character.
+  These corrections are termed "changes".
+
+        4) stage 3: delete all  positions  in  which  padding  is  the
+  consensus. These corrections are termed "deletions".
+
+        All changed characters are shown in lowercase  letters  so  it
+  will  be  obvious which characters have been assigned by the program
+  (except for deletions). The number of each type of  correction  will
+  be displayed.
+ @10. TX 2 @Clear graphics
+
+        Clears graphics from the screen.
+ @11. TX 2 @Clear text
+
+        Clears  text from the screen.
+ @12. TX 2 @Draw a ruler.
+
+        This option allows the user to draw a ruler or scale along the
+  x  axis  of the screen to help identify the coordinates of points of
+  interest. The user can define the position of the first base  to  be
+  marked  (for  example if the active region is 1501 to 8000, the user
+  might wish to mark every 1000th base starting at either 1501 or 2000
+  -  it  depends  if  the user wishes to treat the active region as an
+  independent unit with its own numbering starting at its  left  edge,
+  or  as  part  of  the  whole sequence). The user can also define the
+  separation of the ticks on the scale and their height.  If  required
+  the labelling routine can be used to add numbers to the ticks.
+ @14. TX 2 @Reposition plots
+
+        The positions of each of the plots is defined  relative  to  a
+  users  drawing board which has size 1-10,000 in x and 1-10,000 in y.
+  Plots for each option are drawn in a window  defined  by  x0,y0  and
+  xlength,ylength. Where x0,y0 is the position of the bottom left hand
+  corner of the window, and xlength is the width  of  the  window  and
+  ylength the height of the window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The  default  window  positions are read from a file "ANALMARG" when
+  the program is started. Users can have their own file  if  required.
+  As  all  the plots start at the same position in x and have the same
+  width, x0 and xlength are the same for all options. Generally  users
+  will  only  want  to change the start level of the window y0 and its
+  height ylength. This option allows users to change window  positions
+  whilst  running  the  program.   The  routine  prompts first for the
+  number of the option that the users wishes to reposition;  then  for
+  the  y  start and height; then for the x start and length. Note that
+  changes to the x values affect all options. If the user  types  only
+  carriage  return  for any value it will remain unchanged. Note that,
+  unlike all the other programs, the boxes used to contain  analytical
+  results (eg plot quality) should not be made to overlap one another,
+  as the function of the crosshair routine depends on  which  box  the
+  crosshair is in!  overlap
+ @15. TX 2 @Label a diagram
+
+        This routine allows users to  label  any  diagrams  they  have
+  produced.  They  are  asked  to type in a label. When the user types
+  carriage return to finish typing the label the cross-hair appears on
+  the  screen. The user can position it anywhere on the screen. If the
+  user types R (for right justify) the label will be  written  on  the
+  diagram  with  its right end at the cross-hair position. If the user
+  types L (for left justify) the label will be written on the  diagram
+  with  its  left end at the cross hair position.  The cross-hair will
+  then immediately reappear. The  user  may  put  the  same  label  on
+  another part of the diagram as before or if he hits the space bar he
+  will be asked if he wishes to type in another label.
+
+        Typical dialogue follows.
+  ? Menu or option number=15
+  Type label then drive cross hair to left or right end
+  of label position then hit  "L"  to  write label left
+  justified or  "R"  to  write label right justified or
+  the space bar to quit
+
+
+  ? Label=delta gene
+
+   missing graphics
+
+  ? Label=
+
+ @16. TX 2 @Display a map.
+
+        This draws a map of any  sequence  features  selected  by  the
+  user.   These  features  may  be  protein coding regions (CDS), tRNA
+  genes (TRNA), promoter positions (PRM), etc. Users may define  their
+  own  feature  table  key  names. For example I find it convenient to
+  split CDS lines into CDS1, CDS2 and CDS3 each of which contains only
+  those  sequences  that  code in the reading frames 1, 2 or 3. Then I
+  can plot them at different heights on the screen ( suitable  heights
+  can be determined by using the cross-hair).  The coordinates must be
+  stored in a file in the format of an EMBL feature table.
+
+        Typical dialogue follows.
+  ? Menu or option number=16
+   Display a map using an EMBL feature table file
+  ? map file name=hsegl1.ft
+  ? feature code(e.g. CDS) =CDS
+  X 1 + strand
+    2 - strand
+    3 both strands
+  ? 0,1,2,3 =
+  ? level (0-9480) (256) =4000
+
+   missing graphics
+
+  ? feature code(e.g. CDS) =
+
+ @7. TX 1 @Redirect output
+
+        Used to direct output that would normally appear on the screen
+  to a file.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @13. TX 2 @Use crosshair
+  This option puts a steerable cross on  the  screen  which  the  user
+  drives around by using the arrow keys (or mouse). When the crosshair
+  is visible a number of options are available if the user  types  one
+  of  a  set of special keyboard characters. Any other characters will
+  cause an exit from the crosshair option. The special keys are:
+
+      I = Identify the nearest gel reading
+      Z = Zoom in
+      Q = plot Quality
+      S = display the aligned Sequences at the crosshair position
+      N = list the Names and Numbers of the sequences at the crosshair
+
+        In order for  any  of  these  special  keys  to  operate,  the
+  crosshair  must  be  in  an appropriate display box, and the precise
+  function of the keys will also depend on which box the crosshair  is
+  in.
+
+        If the crosshair is in the "plot  all  contigs"  box,  Z  will
+  cause  a  new box to appear showing all the readings for the nearest
+  contig; Q will give the same as Z but will also produce an extra box
+  showing the "quality" plot.
+
+        If Z is hit in the "plot single contig" box, the  contig  will
+  be  zoomed  to  the  current  graphics window size. The zoom will be
+  roughly centred on the crosshair position. Because  of  this  it  is
+  possible  to  step  along  a  contig  by repeatedly zooming with the
+  crosshair near to one end of the single contig display box. If I  is
+  hit  the crosshair must be close to a gel reading line. If Q is hit,
+  the quality plot will be produced for the region shown in  the  plot
+  single  contig  box. In all cases when the "plot all contigs" box is
+  shown, a vertical line will  bisect  the  line  the  represents  the
+  relevent contig, at the current position.
+
+        If the crosshair is in the plot quality box only the character
+  "s" will operate as a special symbol.
+
+        The number of bases shown in the N and S options is controlled
+  by  the  current graphics text window size, and the size of the zoom
+  window by the current graphics window size.  Both  are  set  by  the
+  parameter setting function of the general menu.
+ @33. TX 2 @Plot single contig
+  This option produces a schematic of a selected region  of  a  single
+  contig  by  drawing  a  horizontal line to represent each of its gel
+  readings. The lines show the relative positions of each reading  and
+  also  their  sense. The plot is divided vertically into two sections
+  by a line that is identified by an asterisk drawn at each  end.  All
+  lines  that lie above this line represent readings that are in their
+  original sense, all lines  below  show  readings  that  are  in  the
+  complementary  sense  to  their  original.  By  use of the crosshair
+  function the plot can  be  stepped  through  and  examined  in  more
+  detail. See help on crosshair.
+ @34. TX 2 @Plot all contigs
+  This option produces a schematic of all the contigs in  a  database.
+  It does this by drawing a horizontal line to represent each of them.
+  In order to show the ends of each contig  it  draws  the  lines  for
+  contigs at alternate heights: the first at height one, the second at
+  height two, the third at height one, etc. The order of  the  contigs
+  in the display is the same as their order in the database. By use of
+  the crosshair function the plot can be stepped through and  examined
+  in more detail. See help on crosshair.
+ @31. TX 3 @ Type in gel readings
+  This option allows gel readings to be typed in at the  keyboard.  It
+  creates  a  separate  file  for  each gel reading and a file of file
+  names for the batch. The sequences from each  batch  may  be  listed
+  when  they have all been entered. Users may choose to employ special
+  keys to identify the 4 bases A,C,G and T. By default  these  special
+  keys  are  N  M  ,  .  but any other four characters may be used. If
+  special keys are used the characters are automatically translated to
+  A C G T before being stored on the disk.
+ @35. TX 1 @ Find internal joins
+  The purpose of this function is to use data already in the  database
+  to  find possible joins between contigs.  Joins may have been missed
+  due to poor  data  or  may  have  not  been  made  due  to  repeated
+  sequences.  Where  appropriate, it may be possible to find potential
+  joins by using the data clipped off readings prior  to  their  entry
+  into the database.
+  The database is checked for logical consistency.  Supply  a  minimum
+  initial  match  length,  a minimum alignment block, the maximum pads
+  per sequence, the maximum  percent  mismatch  after  alignment,  the
+  probe length. Choose if clipped data is to be used, if so define the
+  window size for finding good data and the number of  dashes  allowed
+  in  the  window. Processing will commence.  Most of these values are
+  used in an identical way in the autoassemble  function.  The  others
+  are defined below.
+  The program strategy
+  Take the first contig and calculate its consensus. If  clipped  data
+  is  being  used  examine  all readings that are in the complementary
+  orientation, and sufficiently near to the contigs left end,  to  see
+  if  they have good clipped sequence which if present, would protrude
+  from the left end of the contig.  If  found  add  the  longest  such
+  sequence to the left end of the consensus. Do the same for the right
+  end by examining readings that are in their original orientation. If
+  any  are  found  add  the  longest extension to the right end of the
+  consensus. Repeat the consensus calculations and extensions for  all
+  contigs  hence  producing  an extended consensus. If clipped data is
+  not  being  used  simply  calculate  the  consensus  for  the  whole
+  database.  Now  look  for  possible joins by processing the extended
+  consensus in the following  way.  Take  the  last,  say  100,  bases
+  (termed  the  "probe  length"  by  the  program)  of  the  rightmost
+  consensus, compare it both orientations with the extended  consensus
+  of  all the other contigs. Display any sufficiently good alignments.
+  Repeat with the left end of the rightmost contig. Do  the  same  for
+  the ends of all the entended contigs, always only comparing with the
+  contigs to their left, so that the same matches do not appear twice.
+  Good cliped data is defined by sliding a window of "Window size  for
+  good  data scan" bases outwards along the sequence and stopping when
+  "Maximum number of dashes in scan window" or more dashes  appear  in
+  the  window.   Note that it is advisable to have some sort of cutoff
+  because if we simply take all the  data  it  might  be  so  full  of
+  rubbish  that  we wont find any good matches. For the same reason it
+  is worth trying the procedure with different cutoffs. An initial run
+  using  no  clipped  data  is  also  recommended.   Sufficiently good
+  alignments are defined by  criteria  equivalent  to  those  used  in
+  autoassemble,  however here we only display alignments that pass all
+  tests.
+  Bugs
+  If a small contig is wholly contained within a larger one, such that
+  its  ends  are further than ("Probe length" - "Minimum initial match
+  length") from the ends of the larger contig, and the  consensus  for
+  the small contig lies to the left of the consensus for large contig,
+  the overlap will not be discovered. (See the search stratgey).
+  All numbering is relative to base number one in the contig:  matches
+  to  the  left  (i.e.  in  the clipped data) have negative positions,
+  matches off the right end of the contig (i.e. in the  clipped  data)
+  have  positions  greater  than  that of the contig length. A typical
+  result is shown below.
+
+   Right end of contig   22 in the - sense  and contig   96
+   Percentage mismatch after alignment =  3.0
+          628        638        648        658        668        678
+            GTGAGATGAG CATATTTAAA ATGAACCGAG CAGTTAGGAG ATATGTTGGG AGGACAAGAA
+             ********* ********** ********** ********** ********** **********
+            -TGAGATGAG CATATTTAAA ATGAACCGAG CAGTTAGGAG ATATGTTGGG AGGACAAGAA
+          -86        -76        -66        -56        -46        -36
+          688        698        708        718
+            ACATCCGGGA TACAGTCAAT AAATGAAAAA TTAATGAATT
+            ********** ********** ****** *** ***** ****
+            ACATCCGGGA TACAGTCAAT AAATGA-AAA TTAATTAATT
+          -26        -16         -6          4
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/help/sap_menu b/help/sap_menu
new file mode 100644
index 0000000..edbbe82
--- /dev/null
+++ b/help/sap_menu
@@ -0,0 +1,76 @@
+-1 0 21 2 T General
+-1 0 21 2 X General
+-2 0 50 2 T Screen control
+-2 0 71 2 X Screen
+-3 0 98 2 T Modification
+-3 0 98 2 X Modification
+0 -1 116 379 T SAP
+0 -1 116 379 X SAP
+17 1 19213 18 T Screen against restriction enzymes
+17 1 19213 18 X Screen against restriction enzymes
+18 1 20256 22 T Screen against vector
+18 1 20256 22 X Screen against vector
+20 2 21583 113 T Auto assemble
+20 2 21583 113 X Auto assemble
+28 1 27744 42 T Highlight disagreements
+28 1 27744 42 X Highlight disagreements
+32 3 30106 22 T Extract gel readings
+32 3 30106 22 X Extract gel readings
+1 0 31209 3 T Help
+1 0 31209 3 X Help
+2 0 31277 5 T Help
+2 0 31277 5 X Help
+3 1 31470 175 T Open a database
+3 1 31470 175 X Open a database
+4 3 40550 64 T Edit
+4 3 40550 64 X Edit
+9 3 43796 40 T Screen edit
+9 3 43796 40 X Screen edit
+5 1 45923 45 T Display a contig
+5 1 45923 45 X Display a contig
+6 1 48409 6 T List a text file
+6 1 48409 6 X List a text file
+8 1 48667 94 T Calculate a consensus
+8 1 48667 94 X Calculate a consensus
+25 1 53186 41 T Show relationships
+25 1 53186 41 X Show relationships
+21 3 55121 99 T Enter new gel reading
+21 3 55121 99 X Enter new gel reading
+23 3 60131 11 T Complement a contig
+23 3 60131 11 X Complement a contig
+22 3 60644 70 T Join contigs
+22 3 60644 70 X Join contigs
+24 1 64235 11 T Copy the database
+24 1 64235 11 X Copy the database
+19 1 64781 41 T Check database
+19 1 64781 41 X Check database
+29 1 66799 82 T Examine quality
+29 1 66799 82 X Examine quality
+26 3 70617 92 T Alter relationships
+26 3 70617 92 X Alter relationships
+27 1 75377 17 T Set display parameters
+27 1 75377 17 X Set display parameters
+30 3 76245 48 T Auto edit a contig
+30 3 76245 48 X Auto edit a contig
+10 2 78721 3 T Clear graphics
+10 2 78721 3 X Clear graphics
+11 2 78786 3 T Clear text
+11 2 78786 3 X Clear text
+12 2 78851 12 T Draw a ruler.
+12 2 78851 12 X Draw a ruler.
+14 2 79585 38 T Reposition plots
+14 2 79585 38 X Reposition plots
+15 2 81933 28 T Label a diagram
+15 2 81933 28 X Label a diagram
+16 2 83039 27 T Display a map.
+16 2 83039 27 X Display a map.
+7 1 84014 12 T Redirect output
+7 1 84014 12 X Redirect output
+13 2 84485 41 T Use crosshair
+13 2 84485 41 X Use crosshair
+33 2 86611 11 T Plot single contig
+33 2 86611 11 X Plot single contig
+34 2 87312 9 T Plot all contigs
+34 2 87312 9 X Plot all contigs
+31 3 87884 9 T Type in gel readings
+31 3 87884 9 X Type in gel readings
diff --git a/help/sip_help b/help/sip_help
new file mode 100644
index 0000000..7ef56f9
--- /dev/null
+++ b/help/sip_help
@@ -0,0 +1,1254 @@
+
+ @-1. TX  0 @General
+
+ @-2. T   0 @Screen control
+
+ @-2. X   0 @Screen
+
+ @-3. TX  0 @Set parameters
+
+ @-4. TX  0 @Comparison
+
+ @0.  TX -1 @SIP
+
+        This is program for comparing and  aligning  nucleic  acid  or
+ protein  sequences. It can produce optimal alignments using a dynamic
+ programming algorithm, and has several ways of producing "dot matrix"
+ diagrams.
+
+        The following analyses (preceded by their option numbers)  are
+ included:
+
+
+        The program is  used  on  a  simple  graphics  terminal  ie  a
+ keyboard   with  a screen on which points and lines can be drawn. The
+ user  works  at  the  terminal  and  produces  plots   for    various
+ combinations   of  values  for  the  span  length and minimum scores.
+ However large or small a region  the  user elects  to   compare   the
+ program  expands  or  contracts  the  diagram so that the plot always
+ fills  the  screen.   This  allows  the  user  to  gain  an   overall
+ impression   or  to  "home-in" on particular regions and examine them
+ in more detail.   Having  found  a  region  that   looks  interesting
+ the   user   can   determine   its   coordinates in terms of sequence
+ positions by use of a crosshair facility.
+
+        The program has two  statistical  options  to  help  the  user
+ choose   score  levels for plotting and to assess the significance of
+ any similarity found.  It can  produce  a  cumulative  histogram   of
+ observed   scores  for  the current span length and region and it can
+ calculate the "double matching probability" of McLachlan. The "double
+ matching   probability"  is  the  probability  of  finding particular
+ scores  given  two  infinitely  long  sequences  of  the  composition
+ of   those   being  compared,  with the current span length and score
+ matrix.  By using these options the  user  can  choose  to plot   all
+ the   matches  for  which  the  score  exceeds  a  given significance
+ level  (such  as  1%),  using  either    empirical    or  theoretical
+ probability  values.  Generally it is best to begin at a low level to
+ avoid an overcrowded diagram.
+
+        If  the  user  finds  that  the  two  sequences  do    contain
+ stretches  of  homology  he will often want to align the sequences by
+ inserting padding characters at deletion points.  The program  has  a
+ selection  of   options   for this purpose:  it contains an alignment
+ routine; it can display on the screen the two  sequences,  one  above
+ the  other,  with  asterisks   marking  identities,  it   has inbuilt
+ editing functions and can save the aligned sequences on disk files.
+
+        The basic principle of dot matrices was  first  described   by
+ Gibbs  and  McIntyre and involves producing a diagram that contains a
+ representation of all the matches between a pair of sequences.   This
+ diagram   is   then   scanned   by   eye  and  the  human  ability to
+ recognise patterns used to detect any  similarities   that  might  be
+ present.  The diagram consists of a two dimensional plot in which the
+ x axis represents one sequence (A)  and   the   y   axis   the  other
+ (B).    Every  point (i,j) on the plane x,y is assigned a score which
+ corresponds   to   the   level   of   similarity   between   sequence
+ characters  A(i) and B(j).  In the simplest use of the method a score
+ of 1 could be assigned to every point (i,j) where A(i) = B(j), and  a
+ score   of  0  to  every other point.  If a plot of the points in the
+ plane was made in which all scores of 1 were marked with a   dot  and
+ all   those  of 0 left blank then regions of identity would appear as
+ diagonal lines.  With the comparison  displayed  in  this  form   the
+ human  eye is very good at detecting regions of homology even if they
+ are imperfect.  The effects of mismatches, insertions   or  deletions
+ can   be  seen:   matches interrupted by insertions or deletions will
+ appear as parallel diagonals, and matches  interrupted  by  the   odd
+ mismatching   pair  of  characters  will  appear  as broken collinear
+ diagonal lines. This diagram is  a  very  useful  representation  but
+ simply   placing a dot for every identity is of limited value for the
+ following reasons.
+
+        For nucleic acid sequences around 25% of the plot will contain
+ points    and   it  will  often  be  very  difficult  to  distinguish
+ significant homologies  from  chance  matches.   For   proteins  many
+ significant  alignments of sequences contain almost no identities but
+ are formed from chemically and structurally similar amino   acids  so
+ that   simply  looking  for  identity would be insufficient.  What is
+ required is to first find those points  that  correspond  to   fairly
+ strong   local  similarities  and  then  to  use the diagram of these
+ points so that the human eye can be used to look  for  larger   scale
+ homologies.    The  program  uses a number of different algorithms to
+ calculate the score for each point and the  user  defines  a  minimum
+ score   so  that  only   those  points  in  the diagram for which the
+ score is at least this value will be marked with a dot.
+
+        The first scoring  method  finds  the   longest  uninterrupted
+ sections of perfect identity i.e.  those that contain no  mismatches,
+ insertions   or   deletions.  Generally  this  method,  termed   "the
+ identities algorithm" is of little value, but runs very quickly.
+
+        The  second    method   looks    for    sections    where    a
+ proportion   of   the  characters  in the sequence are similar, again
+ allowing no insertions or deletions. For  a  thorough  analysis  this
+ method, termed "the proportional algorithm", is the best.
+
+        The original method, of this type  was   first  described   by
+ McLachlan   and involves calculating a score for each position in the
+ matrix  by  summing   points   found   when  looking  forwards    and
+ backwards   along   a  diagonal line of a given length.  This length,
+ called the span, must be an  odd  number  so  that  the  dot  marking
+ matches  can  be  precisely placed at its centre.  The algorithm does
+ not simply look for  identity   but   uses   a  score   matrix   that
+ contains   scores   for   every   possible   pair of characters.  For
+ comparing amino acid sequences  we  usually use   the   score  matrix
+ shown   below  which  was calculated by adding 10 (to make every term
+ >0) to each term of the relatedness odds  matrix  MDM78  of  Dayhoff.
+ This  matrix  MDM78  was  calculated  by  looking  at  accepted point
+ mutations in 71 families of closely related proteins  and,  of  those
+ tested   by  Dayhoff,  was found to be the most powerful score matrix
+ for   finding   distant   relationships    between     amino     acid
+ sequences.
+
+                            AMINO ACID SCORE MATRIX
+                            -----------------------
+
+    C  S  T  P  A  G  N  D  E  Q  B  Z  H  R  K  M  I  L  V  F  Y  W  -  X  ?
+ C 22 10  8  7  8  7  6  5  5  5  5  5  7  6  5  5  8  4  8  6 10  2 10 10 10 10
+ S 10 12 11 11 11 11 11 10 10  9 10 10  9 10 10  8  9  7  9  7  7  8 10 10 10 10
+ T  8 11 13 10 11 10 10 10 10  9 10 10  9  9 10  9 10  8 10  7  7  5 10 10 10 10
+ P  7 11 10 16 11  9  9  9  9 10  9 10 10 10  9  8  8  7  9  5  5  4 10 10 10 10
+ A  8 11 11 11 12 11 10 10 10 10 10 10  9  8  9  9  9  8 10  6  7  4 10 10 10 10
+ G  7 11 10  9 11 15 10 11 10  9 10 10  8  7  8  7  7  6  9  5  5  3 10 10 10 10
+ N  6 11 10  9 10 10 12 12 11 11 12 11 12 10 11  8  8  7  8  6  8  6 10 10 10 10
+ D  5 10 10  9 10 11 12 14 13 12 13 12 11  9 10  7  8  6  8  4  6  3 10 10 10 10
+ E  5 10 10  9 10 10 11 13 14 12 12 13 11  9 10  8  8  7  8  5  6  3 10 10 10 10
+ Q  5  9  9 10 10  9 11 12 12 14 11 13 13 11 11  9  8  8  8  5  6  5 10 10 10 10
+ B  5 10 10  9 10 10 12 13 12 11 13 11 11 10 10  8  8  6  8  5  7  4 10 10 10 10
+ Z  5 10 10 10 10 10 11 12 13 13 11 14 12 10 10  8  8  8  8  5  6  4 10 10 10 10
+ H  7  9  9 10  9  8 12 11 11 13 11 12 16 12 10  8  8  8  8  8 10  7 10 10 10 10
+ R  6 10  9 10  8  7 10  9  9 11 10 10 12 16 13 10  8  7  8  6  6 12 10 10 10 10
+ K  5 10 10  9  9  8 11 10 10 11 10 10 10 13 15 10  8  7  8  5  6  7 10 10 10 10
+ M  5  8  9  8  9  7  8  7  8  9  8  8  8 10 10 16 12 14 12 10  8  6 10 10 10 10
+ I  8  9 10  8  9  7  8  8  8  8  8  8  8  8  8 12 15 12 14 11  9  5 10 10 10 10
+ L  4  7  8  7  8  6  7  6  7  8  6  8  8  7  7 14 12 16 12 12  9  8 10 10 10 10
+ V  8  9 10  9 10  9  8  8  8  8  8  8  8  8  8 12 14 12 14  9  8  4 10 10 10 10
+ F  6  7  7  5  6  5  6  4  5  5  5  5  8  6  5 10 11 12  9 19 17 10 10 10 10 10
+ Y 10  7  7  5  7  5  8  6  6  6  7  6 10  6  6  8  9  9  8 17 20 10 10 10 10 10
+ W  2  8  5  4  4  3  6  3  3  5  4  4  7 12  7  6  5  8  4 10 10 27 10 10 10 10
+ - 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+ X 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+ ? 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+   10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+
+        It is also  possible  to  use  other  matrices,  including  an
+ identity  matrix  for  proteins. For nucleic acids we usually use the
+ matrix shown below.
+
+          DNA SCORE MATRIX
+
+              A C G T X
+            A 1 0 0 0 0
+            C 0 1 0 0 0
+            G 0 0 1 0 0
+            T 0 0 0 1 0
+            X 0 0 0 0 0
+
+        Plotting dots at the centres of spans that  reach  the  cutoff
+  leads to a persistence effect that, to some extent, can be mitigated
+  by a  variation on the method. If, for example, all the high scoring
+  amino  acids  are clustered at the left end of a particular diagonal
+  segment, dots will continue to be plotted to their right  until  the
+  span  score  drops  below  the  cutoff. Instead of plotting a single
+  point for each span that  reaches  the  cutoff  score,  the  variant
+  method   plots  points for all the identities that lie in spans that
+  reach the cutoff. Obviously  the  persistence  effect  can  be  more
+  pronounced  for  long spans and low cutoff scores, but note that the
+  variant method will not plot anything if  there  are  no  identities
+  present, and so similar regions could be missed!
+
+        A further variant, useful for  comparing  a  sequence  against
+  itself, ignores the main diagonal.
+
+        The third comparison method called "quick scan"  is  really  a
+  combination of the first two, and is similar to the FASTP program of
+  Lipman and Pearson, but produces a dot matrix diagram. The algorithm
+  is  as  follows. The dot matrix positions are found for all words of
+  some minimum length (obviously length 1 is most sensitive) that  are
+  common  to  both  sequences.  Imagine  a  diagonal line running from
+  corner to corner of the diagram, at right angles to the diagonals in
+  the  dotmatrix,   The  scores for the common words (according to the
+  current  score  matrix,  e.g.  MDM78)  are   accummulated   at   the
+  appropriate  positions  on  that  imaginary line, hence  producing a
+  histogram. The histogram is analysed to find its mean  and  standard
+  deviation.  The  diagonals that lie above some cutoff score (defined
+  in standard deviation units), are rescanned using  the  proportional
+  algorithm,  and  a diagram produced. The method is very fast, and is
+  also employed  by the library comparison program.
+
+        The dynamic programming alignment algorithm contained  in  the
+  program  is  based  on that of Miller and Myers (). It guarantees to
+  produce alignments with the optimum score given a  score  matrix,  a
+  gap  start penalty, and a gap extension penalty. That is, starting a
+  gap costs a fixed penalty (IG) and each residue  added  to  the  gap
+  incurs  a  further  penalty  (IH)  so  that for each gap of length K
+  residues the penalty is IG + k*IH. Gaps at  the  ends  of  sequences
+  incur no penalty.
+
+        It is very useful to have  the  dot  matrix  methods  and  the
+  alignment  routine  together  in  the same program because it allows
+  users to produce a dot matrix diagram to help select  which  regions
+  of  the sequence they wish to align. Selection is made by use of the
+  crosshair. First the crosshair is positioned at the bottom left hand
+  end of the segment to be aligned. The crosshair function is quit and
+  immediately selected again, the  crosshair  positioned  at  the  top
+  right  of  the  segment,  and  the crosshair function quit. When the
+  alignment routine is selected the segment will be aligned.
+
+        The  alignment  can  replace  the  original  segment  of   the
+  sequence.   By  repeated  plotting  of  dot  matrices,  followed  by
+  alignment, very long sequences can easily be aligned.
+ @1. TX 0 @Help
+
+        This option gives online help. The user should  select  option
+  numbers and the current documentation will be given.
+
+        The following analyses (preceded by their option numbers)  are
+  included:
+   ? = Help
+   ! = Quit
+   3 = read a new sequence
+   4 = define active region
+   5 = list the sequence
+   6 = list a text file
+   7 = direct output to disk
+   8 = write active sequence to disk
+   9 = edit the sequences
+  10 = clear graphics screen
+  11 = clear text screen
+  12 = draw a ruler
+  13 = use cross hair
+  14 = reposition plots
+  15 = label diagram
+  16 = display a map
+  17 = apply identities algorithm
+  18 = apply proportional algorithm
+  19 = list matching spans
+  20 = set span length
+  21 = set proportional score
+  22 = set identities score
+  23 = calculate expected scores
+  24 = calculate observed scores
+  25 = show current parameter settings
+  26 = quick scan
+  27 = draw a /
+  28 = align the sequences
+  29 = complement the sequences
+  30 = switch main diagonal
+  31 = switch identities
+  32 = change score matrix
+ @2. TX 0 @Quit
+
+        This function stops the program.
+ @3. TX 1 @Read a new sequence
+
+        This option allows users to  read  in  new  sequences,  browse
+  through  annotations,  or  search  sequence  libraries for keywords.
+  Sequences can  be  read  from  "personal"  sequence  files  or  from
+  sequence  libraries. These are referred to as the sequence "source".
+  Personal files can be stored in several formats:  Staden, PIR, EMBL,
+  GENBANK  and  GCG.  At LMB we use "Staden" format for sequencing and
+  all the libraries  are  stored  in  their  original  formats.  Note,
+  however,  that  libraries  such  as EMBL or GenBank that are divided
+  into several files (eg GenBank has 13 separate files) are indexed as
+  a  whole.  This  means  that  users  do  not need to know which file
+  contains an entry, only which library.  When  the  user  selects  to
+  read in a sequence the program first asks for the sequence "source".
+
+        If the user selects "personal" the program will  ask  for  the
+  format (Staden, PIR, EMBL, GENBANK or GCG), and then for the name of
+  the file. For PIR format the user will also be required to know  the
+  entry  name of the sequence as the file can contain several. For the
+  other formats only a single entry is  expected.  The  file  will  be
+  read,  its  length  and composition will be displayed and the option
+  left.
+
+        If the user selects  "library"  as  the  sequence  source  the
+  program will display a list of available libraries. The programs are
+  capable of  handling  all  current  libraries  but  which  ones  are
+  available  will  vary  from  site  to  site.  At LMB we have several
+  libraries and also weekly updates of data gathered between releases.
+  The  program will ask users to select a library and then give a list
+  of options:
+
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entrynames from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+
+  If get a sequence or get annotations is selected users will be asked
+  to  type  the entry name. The option will be left when a sequence is
+  selected  or  !  is  typed.  The  composition  and  length  will  be
+  displayed.
+
+        The  text  index  contains  all  words  from  feature  tables,
+  reference  titles, definition lines, keywords lists and comments, so
+  the text index search is most useful. It is also the fastest. Up  to
+  5  words  can  be  searched  for  at once. The words should be typed
+  separated by spaces, for example
+   ? Keywords=P53 mouse murine tumo
+
+  will search for all entries that contain words  starting  with  p53,
+  mouse,  murine  and  tumo.  Only the unique entries that contain ALL
+  words will be  listed.  Before  listing  the  matching  entries  the
+  program  will  show  the number of 'hits' for each word and ring the
+  bell.  Escape is possible at this point, or after each screenfull of
+  entries.   In  addition  to the entry names the text search displays
+  the primary accession number, the  sequence  length  and  up  to  80
+  characters of description.  (The search of 'titles' is now redundant
+  because the full text index contains all the  title  words  and  the
+  search  is  much  faster.  It  will  probably  be  removed  from the
+  program.)  All searches are independent of case. Where possible  the
+  program will offer default entry names.
+
+        Typical dialogue follows.
+  Select sequence source
+  X  1 Personal file
+     2 Sequence library
+  ? Selection  (1-2) (1) =
+  Select sequence file format
+  X  1 Staden
+     2 EMBL
+     3 GenBank
+     4 PIR
+     5 GCG
+  ? Selection  (1-5) (1) =
+  ? Sequence file name=M13MP7.SEQ
+   Contig title removed
+  Sequence length=  7238
+   Sequence composition
+            T          C          A          G          -
+        2405.      1539.      1765.      1527.         2.
+          33.2%      21.3%      24.4%      21.1%       0.0%
+    .
+    .
+    .
+
+
+   Select sequence source
+   X  1 Personal file
+      2 Sequence library
+   ? Selection  (1-2) (1) =2
+   Select a library
+   X  1 EMBL 29 nucleotide library Dec 91
+      2 SWISSPROT 20 protein library Nov 91
+      3 PIR 31 protein library Dec 91
+      4 NRL3D 58 From Brookhaven protein library Dec 91
+      5 GenBank
+   ? Selection  (1-5) (1) =
+  Library is in EMBL format with indexes
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =5
+   Search for keywords
+   ? Keywords=P53 mouse
+  P53 hits  68
+  MOUSE hits  8180
+
+   MMANT01    X00875         536 Murine gene fragment for cellular tumour antigen
+   MMANT02    X00876          83 Murine gene fragment for cellular tumour antigen
+   MMANT03    X00877          21 Murine gene fragment for cellular tumour antigen
+   MMANT04    X00878         261 Murine gene fragment for cellular tumour antigen
+   MMANT05    X00879         184 Murine gene fragment for cellular tumour antigen
+   MMANT06    X00880         113 Murine gene fragment for cellular tumour antigen
+   MMANT07    X00881         110 Murine gene fragment for cellular tumour antigen
+   MMANT08    X00882         137 Murine gene fragment for cellular tumour antigen
+   MMANT09    X00883          74 Murine gene fragment for cellular tumour antigen
+   MMANT10    X00884         107 Murine gene for cellular tumour antigen p53 (exon
+   MMANT11    X00885         562 Murine p53 gene 3' region with exon 11
+   MMANTP53   M26862         536 Mouse tumor antigen p53 gene, 5' end.
+   MMLYN      M64608        2044 Mouse lyn protein mRNA, complete cds.
+   MMP53      X00741        1377 Mouse mRNA for transformation associated protein
+   MMP53A     M13872        1285 Mouse p53 mRNA, complete cds, clone pcD53.
+   MMP53B     M13873        1241 Mouse p53 mRNA, complete cds, clone p53-m11.
+   MMP53C     M13874        1322 Mouse p53 mRNA, complete cds, clone p53-m8.
+   MMP53G1    X01235         554 Mouse genomic DNA for 5' region of cellular tumou
+   MMP53IN4   X60470         729 M.musculus p53 gene for p53 protein, intron 4
+   MMP53P     X01236        2132 Mouse pseudogene for cellular tumour antigen p53
+   MMP53R     X01237        1773 Mouse mRNA for cellular tumour antigen p53
+   MMRSB2P5   M64597         196 Mouse B2 repeat in the 3' flank of protein 53 (p5
+        22 different entries found
+
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =4
+   Search for keywords
+   ? Keywords=alpha
+   Searching for alpha
+   AAGHA          623 a.anguilla mrna for glycoprotein hormone alpha subunit precu
+   AAMALI        3338 a.aegypti mali gene encoding alpha 1-4 glucosidase, complete
+   AAMALIA       1659 a.aegypti maltase-like i (mali) gene encoding alpha-1,4-gluc
+   AAMALIB       1832 a.aegypti maltase-like i (mali) mrna encoding alpha-1,4-gluc
+   ACA13GT        371 alouatta caraya alpha-1,3gt gene, 3' flank.
+   ADHBADA1       102 duck alpha-d-globin gene, exon 1.
+   ADHBADA2      1145 duck alpha-a-globin gene and 5' flank
+   ADHBADWP       513 duck (white pekin) alpha ii (minor) globin mrna, complete co
+   AEACOXABC     5279 a.eutrophus protein x (acox), acetoin:dcpip oxidoreductase-a
+   AGA13GT        371 ateles geoffroyi alpha-1,3gt gene, 3' flank.
+   AGAAAGFP       282 c.tetragonoloba alpha-amylase/alpha-galactosidase fusion pro
+   AGAABL         138 b.subtilis alpha-amylase signal peptide gene e.coli beta-lac
+   AGAFAMYA        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFAMYB        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFAMYC        57 synthetic b.stearothermophilus alpha amylase/s.cerevisiae ma
+   AGAFCOXA        98 synthetic alpha-factor/cox iv fusion gene signal peptide.
+   AGAGABA       7876 synthetic gossypium hirsutum (cotton) alpha globulin a and b
+   AGAMYLS        120 synthetic alpha-amylase gene, 5' end.
+   AGANPS          95 synthetic gene (jcnf-1) encoding alpha-factor pro-region/han
+  !
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =3
+   ? Accession number=v00636
+  Entry name LAMBDA
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =2
+   Default Entry name=LAMBDA
+   ? Entry name=
+  ID   LAMBDA     standard; DNA; PHG; 48502 BP.
+  XX
+  AC   V00636; J02459; M17233; X00906;
+  XX
+  DT   03-JUL-1991 (Rel. 28, Last updated, Version 3)
+  DT   09-JUN-1982 (Rel. 1, Created)
+  XX
+  DE   Genome of the bacteriophage lambda (Styloviridae).
+  XX
+  KW   circular; coat protein; DNA binding protein; genome;
+  KW   origin of replication.
+  XX
+  OS   Bacteriophage lambda
+  OC   Viridae; ds-DNA nonenveloped viruses; Siphoviridae.
+  XX
+  RN   [1]
+  RP   1-48502
+  RA   Sanger F., Coulson A.R., Hong G.F., Hill D.F., Petersen G.B.;
+  RT   "Nucleotide sequence of bacteriophage lambda DNA";
+  RL   J. Mol. Biol. 162:729-773(1982).
+  XX
+  !
+   Select a task
+   X  1 Get a sequence
+      2 Get annotations
+      3 Get entry names from accession numbers
+      4 Search titles for keywords
+      5 Search text index for keywords
+   ? Selection  (1-5) (1) =
+   Default Entry name=LAMBDA
+   ? Entry name=
+  DE   Genome of the bacteriophage lambda (Styloviridae).
+   Sequence length  48502
+   Sequence composition
+             T          C          A          G          -
+        11988.     11360.     12336.     12818.         0.
+           24.7%      23.4%      25.4%      26.4%       0.0%
+
+ @4. TX 1 @Define active region
+
+        For its analytic functions  the  program  always  works  on  a
+  region of the sequence called the active region. When a new sequence
+  is read into the program the active region is automatically  set  to
+  start  at  the  beginning  of  the sequence and go up to the maximum
+  allowed size of active region the  program can handle. The positions
+  are  shown  on the screen.  On most machines this will be to the end
+  of the sequence.  This option allows the  user  define  a  different
+  region.
+ @5. TX 1 @List a sequence
+
+        The sequence can be listed with line lengths from 10 to 120 in
+  multiples of 10.  The output looks like:
+
+      87         97        107        117        127        137
+       KVKCTGRILE VPVGRGLLGR VVNTLGAPID GKGPLDHDGF SAVEAIAPGV IERQSVDQPV
+        **      * ****   ***   * ** * *  **         *    **    *
+       DVKDLEHPIE VPVGKATLGR IMNVLGEPVD MKGEIGEEER WAIHRAAPSY EELSNSQELL
+      68         78         88         98        108        118
+     147        157        167        177        187        197
+       QTGYKAVDSM IPIGRGQREL IIGDRQTGKT ALAIDAIINQ RDSGIKCIYV AIGQ
+        ** *  * *  *   *       *    ***       * *             *
+       ETGIKVIDLM CPFAKGGKVG LFGGAGVGKT VNMMELIRNI AIEHSGYSVF AGVG
+     128        138        148        158        168        178
+
+ @6. TX 1 @List a text file
+
+        Allows the user to have a text file displayed on  the  screen.
+  It will appear one page at a time.
+ @7. TX 1 @Direct output to disk
+
+        Used to direct output that would normally appear on the screen
+  to a file.
+
+        Select redirection of either text or graphics, and supply  the
+  name of the file that the output should be written to.
+
+        The results from the next options selected will not appear  on
+  the  screen  but  will  be  written  to  the  file. When option 7 is
+  selected again the file will be closed and output will again  appear
+  on the screen.
+ @8. TX 1 @Write active region to disk
+
+        This option allows users to write the current active  sequence
+  to a disk file in Staden format.
+ @9. TX 1 @Edit the sequences
+
+        This function allows the user to insert  or  delete  parts  of
+  either  sequence  to  help  align  them. The inserted characters are
+  dashes.
+ @10. TX 2 @Clear graphics
+
+        Clears the screen of both text and graphics.
+ @11. TX 2 @Clear text
+
+        Clears only text from the screen.
+ @12. TX 2 @Draw a ruler
+
+        This option allows the user to draw a ruler or scale along the
+  axes  of  the  screen  to help identify the coordinates of points of
+  interest. The user can define the position  of  the  first  sequence
+  element  to  be  marked (for example if the active region is 1501 to
+  8000, the user might wish to mark every 1000th element  starting  at
+  either  1501  or  2000  - it depends if the user wishes to treat the
+  active region as an independent unit with its own numbering starting
+  at  its  left  edge, or as part of the whole sequence). The user can
+  also define the separation of the  ticks  on  the  scale  and  their
+  height. If required the labelling routine can be used to add numbers
+  to the ticks.
+
+        To escape type !
+ @13. TX 2 @Use cross hair
+
+        This function puts a steerable cross on the screen that can be
+  used to find the coordinates of points in the sequence. The user can
+  move the cross around using the directional keys; when he  hits  the
+  space bar the program will write out the coordinates of the cross in
+  sequence units and the option will be exited.
+
+        If instead, the user hits a , the position will  be  displayed
+  but the cross will remain on the screen.
+
+        If a letter s is hit the sequences around the cross  hair  are
+  displayed  as  a  short  alignment  (as  shown  below) and the cross
+  remains on the screen.
+          97        107
+           VPVGRGLLGR VVNTLGAPID
+           ****   ***   * ** * *
+           VPVGKATLGR IMNVLGEPVD
+          78         88
+
+
+        If a letter m is hit the sequences around the cross  hair  are
+  displayed  in  the  form  of a matrix (as shown below) and the cross
+  remains on the screen.
+
+     VPVGKATLGRIMNVLGEPVD
+    D...................DD
+    I..........I.........I
+    P.P...............P..P
+    A.....A..............A
+    G...G....G......G....G
+    L.......L......L.....L
+    T......T.............T
+    N............N.......N
+    VV.V..........V....V.V
+    VV.V..........V....V.V
+    R.........R..........R
+    G...G....G......G....G
+    L.......L......L.....L
+    L.......L......L.....L
+    G...G....G......G....G
+    R.........R..........R
+    G...G....G......G....G
+    VV.V..........V....V.V
+    P.P...............P..P
+    VV.V..........V....V.V
+     VPVGKATLGRIMNVLGEPVD
+
+
+        The function is also used prior to "align sequences" in  order
+  to  delineate  the region to be aligned. The crosshair is positioned
+  at the bottom left of the region, the crosshair  option  quit.  Then
+  the  crosshair  option is selected again, and the crosshair moved to
+  the top right of the region to be aligned.
+ @14. TX 2 @Reposition plots
+
+        The position of the plots  is  defined  relative  to  a  users
+  drawing board which has size 1-10,000 in x and 1-10,000 in y.  Plots
+  are drawn in a window defined by x0,y0  and  xlength,ylength.  Where
+  x0,y0  is the position of the bottom left hand corner of the window,
+  and xlength is the width of the window and ylength the height of the
+  window.
+     --------------------------------------------------------- 10,000
+     1                                                       1
+     1       --------------------------------------   ^      1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       1                                    1 ylength  1
+     1       1                                    1   1      1
+     1       1                                    1   1      1
+     1       --------------------------------------   v      1
+     1  x0,y0^                                               1
+     1       <---------------xlength-------------->          1
+     ---------------------------------------------------------      1
+     1                                                   10,000
+
+  All values are in drawing board  units  (i.e.  1-10,000,  1-10,000).
+  The  default  window  positions are read from a file "DIAGMARG" when
+  the program is started. Users can have their own file  if  required.
+  This  option  allows users to change window positions whilst running
+  the program.  If the user types only carriage return for  any  value
+  it  will  remain  unchanged.  The  cross-hair  can be used to choose
+  suitable heights.
+ @15. TX 2 @Label a diagram
+
+        This routine allows users to  label  any  diagrams  they  have
+  produced.  They  are  asked  to type in a label. When the user types
+  carriage return to finish typing the label the cross-hair appears on
+  the  screen. The user can position it anywhere on the screen. If the
+  user types R (for right justify) the label will be  written  on  the
+  diagram  with  its right end at the cross-hair position. If the user
+  types L (for left justify) the label will be written with  its  left
+  end   at   the  cross  hair  position.   The  cross-hair  will  then
+  immediately reappear. The user may put the  same  label  on  another
+  part of the diagram as before or if he hits the space bar he will be
+  asked if he wishes to type in another label.
+ @16. TX 2 @Display a map
+
+        NOT AVAILABLE.  This draws a  map  of  any  sequence  features
+  selected  by the user.  These features may be protein coding regions
+  (CDS), tRNA genes (TRNA), promoter positions (PRM), etc.  Users  may
+  define  their  own  feature table key names. The coordinates must be
+  stored in a file in the format of an EMBL feature table.
+ @17. TX 4 @Apply identities algorithm
+
+        The identities algorithm finds runs of identical characters in
+  the  sequence.  Its main value is speed, being 100's of times faster
+  than the proportional algorithm. It is of course not very sensitive,
+  and  should  only  be used for a quick scan. The cutoff score is the
+  minimum number of consecutive  matching  characters.   All  runs  of
+  identical  characters  that are at least as long as the cutoff score
+  will produce a dot on the screen.
+
+        See also quick scan.
+
+        Typical dialogue follows.
+  ? Menu or option number=d17
+  ? Identity score (1-20) (2) =3
+  Working
+
+   missing graphics
+
+ @18. TX 4 @Apply proportional algorithm
+
+        This  method,  generally   the   most   useful,   was    first
+ described   by   McLachlan  and involves calculating a score for each
+ position in the  matrix  by  summing   points   found   when  looking
+ forwards   and  backwards  along  a  diagonal line of a given length.
+ This length, called the span, must be an odd number.   The  algorithm
+ does  not  simply look for identity  but  uses  a score  matrix  that
+ contains  scores  for  every  possible  pair of characters.  At  each
+ point that a threshold score is achieved the program marks the screen
+ in one of two ways. It will either place a single dot at the position
+ corresponding  to  the centre of the matching span, or it will plot a
+ dot  for  each  identical  residue   within   each   matching   span.
+ Alternatively,  the  "list  matching  spans"  option  will  list  the
+ segments that match.
+
+        For comparing amino  acid  sequences   we   usually  use   the
+ score matrix  shown  below which was calculated by adding 10 (to make
+ every term >0) to each term of the relatedness odds matrix  MDM78  of
+ Dayhoff.   This  matrix  MDM78  was calculated by looking at accepted
+ point mutations in 71 families of closely related proteins  and,   of
+ those   tested  by  Dayhoff,  was found to be the most powerful score
+ matrix  for  finding  distant  relationships  between   amino    acid
+ sequences.
+
+                            AMINO ACID SCORE MATRIX
+                            -----------------------
+
+    C  S  T  P  A  G  N  D  E  Q  B  Z  H  R  K  M  I  L  V  F  Y  W  -  X  ?
+ C 22 10  8  7  8  7  6  5  5  5  5  5  7  6  5  5  8  4  8  6 10  2 10 10 10 10
+ S 10 12 11 11 11 11 11 10 10  9 10 10  9 10 10  8  9  7  9  7  7  8 10 10 10 10
+ T  8 11 13 10 11 10 10 10 10  9 10 10  9  9 10  9 10  8 10  7  7  5 10 10 10 10
+ P  7 11 10 16 11  9  9  9  9 10  9 10 10 10  9  8  8  7  9  5  5  4 10 10 10 10
+ A  8 11 11 11 12 11 10 10 10 10 10 10  9  8  9  9  9  8 10  6  7  4 10 10 10 10
+ G  7 11 10  9 11 15 10 11 10  9 10 10  8  7  8  7  7  6  9  5  5  3 10 10 10 10
+ N  6 11 10  9 10 10 12 12 11 11 12 11 12 10 11  8  8  7  8  6  8  6 10 10 10 10
+ D  5 10 10  9 10 11 12 14 13 12 13 12 11  9 10  7  8  6  8  4  6  3 10 10 10 10
+ E  5 10 10  9 10 10 11 13 14 12 12 13 11  9 10  8  8  7  8  5  6  3 10 10 10 10
+ Q  5  9  9 10 10  9 11 12 12 14 11 13 13 11 11  9  8  8  8  5  6  5 10 10 10 10
+ B  5 10 10  9 10 10 12 13 12 11 13 11 11 10 10  8  8  6  8  5  7  4 10 10 10 10
+ Z  5 10 10 10 10 10 11 12 13 13 11 14 12 10 10  8  8  8  8  5  6  4 10 10 10 10
+ H  7  9  9 10  9  8 12 11 11 13 11 12 16 12 10  8  8  8  8  8 10  7 10 10 10 10
+ R  6 10  9 10  8  7 10  9  9 11 10 10 12 16 13 10  8  7  8  6  6 12 10 10 10 10
+ K  5 10 10  9  9  8 11 10 10 11 10 10 10 13 15 10  8  7  8  5  6  7 10 10 10 10
+ M  5  8  9  8  9  7  8  7  8  9  8  8  8 10 10 16 12 14 12 10  8  6 10 10 10 10
+ I  8  9 10  8  9  7  8  8  8  8  8  8  8  8  8 12 15 12 14 11  9  5 10 10 10 10
+ L  4  7  8  7  8  6  7  6  7  8  6  8  8  7  7 14 12 16 12 12  9  8 10 10 10 10
+ V  8  9 10  9 10  9  8  8  8  8  8  8  8  8  8 12 14 12 14  9  8  4 10 10 10 10
+ F  6  7  7  5  6  5  6  4  5  5  5  5  8  6  5 10 11 12  9 19 17 10 10 10 10 10
+ Y 10  7  7  5  7  5  8  6  6  6  7  6 10  6  6  8  9  9  8 17 20 10 10 10 10 10
+ W  2  8  5  4  4  3  6  3  3  5  4  4  7 12  7  6  5  8  4 10 10 27 10 10 10 10
+ - 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+ X 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+ ? 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+   10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+ One alternative for proteins is  to  use  an  identity  matrix.   For
+ comparing nucleic acids we usually use the matrix shown below.
+
+          DNA SCORE MATRIX
+
+              A C G T X
+            A 1 0 0 0 0
+            C 0 1 0 0 0
+            G 0 0 1 0 0
+            T 0 0 0 1 0
+            X 0 0 0 0 0
+ See option 32 for how to change the score matrices.
+
+        When a sequence is compared against itselt to look for repeats
+ it  is possible to use the proportional algorithm in a mode such that
+ the main diagonal is not shown. See option 30.
+
+        Typical dialogue follows.
+
+ ? Menu or option number=d18
+ ? Odd span length (1-401) (11) =
+ ? Proportional score (1-297) (132) =
+ Working
+
+  missing graphics
+
+ @19. TX 4 @List matching spans
+  This option applies the proportional  algorithm  using  the  current
+  span and cut-off score, but instead of drawing a dot matrix it lists
+  all the matching spans. When a sequence is compared  against  itselt
+  to  look  for repeats it is possible to use this algorithm in a mode
+  such that the main diagonal is not listed. See option 30.
+
+        Typical dialogue follows.
+  ? Menu or option number=d19
+  ? Odd span length (1-401) (11) =
+  ? Proportional score (1-297) (132) =148
+  List matching spans
+  Working
+       76
+  IEVPVGKATLG
+  LEVPVGRGLLG
+       95
+       77
+  EVPVGKATLGR
+  EVPVGRGLLGR
+       96
+       78
+  VPVGKATLGRI
+  VPVGRGLLGRV
+       97
+       79
+  PVGKATLGRIM
+  PVGRGLLGRVV
+       98
+       85
+  LGRIMNVLGEP
+  LGRVVNTLGAP
+      104
+       86
+  GRIMNVLGEPV
+  GRVVNTLGAPI
+      105
+       87
+  RIMNVLGEPVD
+  RVVNTLGAPID
+      106
+
+ @20. TX 3 @Set span length
+
+        The  proportional  algorithm  calculates  a  score  for   each
+ position  in  the matrix by summing  the points  found  when  looking
+ forwards  and  backwards  along  a  diagonal line of a given  length.
+ This  length, called the span, should be an odd number  so  that  the
+ score  for  any  point  is correctly positioned at the centre of  the
+ span.   This  option  allows  the  user to define the span length. It
+ should be noted that short spans can produce noisy diagrams, but  are
+ less  affected  by  insertions  and  deletions  than  are long spans.
+ However long spans can detect more  distant relationships. Long spans
+ can  suffer  from a persistence problem by plotting dots when all the
+ "signal" is to one side of the spans central position. To help  avoid
+ this,  the  option  that  plots the position of all matching residues
+ within a matching span, can be tried.  This  is  most  useful  if  an
+ identity matrix is being used.
+ @21. TX 3 @Set proportional score
+
+        The  proportional  algorithm  calculates  a  score  for   each
+  position  in  the  matrix  by summing  the scores for the individual
+  amino acids found  when  looking forwards  and  backwards  along   a
+  diagonal   line  of  a  given  length.   All  points  at  which  the
+  proportional score is achieved will produce a dot  on  the  diagram.
+  (The same score is used for the 'LIST MATCHING SPANS' option.)
+
+        Before chosing a score the user can  apply  the  routine  that
+  will  calculate  the expected score, or can calculate a histogram of
+  observed scores. It is best to start with a high score to  avoid  an
+  overcrowded diagram.
+ @22. TX 3 @Set identities score
+
+        The identities algorithm is of limited value as it only  finds
+  runs of matching characters, however it has the virtue of being very
+  fast.  This option allows the user to set the minimum length of  run
+  that will produce a dot on the screen.
+ @23. TX 3 @Calculate expected scores
+
+        This function calculates the "double matching probability"  of
+  McLachlan.    The   "double     matching     probability"   is   the
+  probability  of  finding particular  scores  given  two   infinitely
+  long   sequences   of   the  composition  of  those  being compared,
+  with the current span length and score matrix.  By using this option
+  the   user   can   choose   to plot   all   the  matches  for  which
+  the  score  exceeds  a  given significance  level  (such   as   1%).
+  Generally it is best to begin at a low level to avoid an overcrowded
+  diagram.
+
+        When the calculation of the expected scores is  finished   the
+  program offers the user 3 ways of examining the results:
+  "Show probability for a score" allows the  user   to   type   in   a
+  score   and   the program responds with the probability of achieving
+  that level of score.
+  "Show score for  a  probability"  allows  the  user  to  type  in  a
+  probability  value  and the program types the score that corresponds
+  to that level of probability.
+  "List scores and probabilities" is  the  command  to  list  out  the
+  scores  and  their corresponding  probabilities.   The user is asked
+  to supply a further parameter, the "number of steps between scores",
+  and  the program only lists every stepsize point.  e.g a stepsize of
+  5 will get every 5th score listed.
+
+        Typical dialogue follows.
+  ? Menu or option number=d23
+  ? Odd span length (1-401) (11) =
+  ? Proportional score (1-297) (132) =
+
+  Working
+  Average score=   103.18557
+  RMS deviation=     7.85276
+  X 1 Show probability for a score
+    2 Show score for a probability
+    3 List scores and probabilities
+  ? 0,1,2,3 =
+
+  ? Show probability for score (1-165) (134) =160
+  Probability of score    160 is 0.0000000008
+  X 1 Show probability for a score
+    2 Show score for a probability
+    3 List scores and probabilities
+  ? 0,1,2,3 =2
+  ? Show score for probability (0.0000000001-1.) (0.00001) =0.0000001
+  Score for probability 0.0000001000 is   153
+    1 Show probability for a score
+  X 2 Show score for a probability
+    3 List scores and probabilities
+  ? 0,1,2,3 =3
+  ? Number of steps between scores (1-10) (5) =
+
+       0  0.10000E+01    100  0.67232E+00    200  0.18977E-20
+       5  0.10000E+01    105  0.42119E+00    205  0.42561E-22
+      10  0.10000E+01    110  0.20671E+00    210  0.87767E-24
+      15  0.10000E+01    115  0.78860E-01    215  0.16651E-25
+      20  0.10000E+01    120  0.23515E-01    220  0.27300E-27
+      25  0.10000E+01    125  0.55406E-02    225  0.00000E+00
+      30  0.10000E+01    130  0.10443E-02    230  0.00000E+00
+      35  0.10000E+01    135  0.15935E-03    235  0.00000E+00
+      40  0.10000E+01    140  0.19906E-04    240  0.00000E+00
+      45  0.10000E+01    145  0.20569E-05    245  0.00000E+00
+      50  0.10000E+01    150  0.17758E-06    250  0.00000E+00
+      55  0.10000E+01    155  0.12938E-07    255  0.00000E+00
+      60  0.10000E+01    160  0.80360E-09    260  0.00000E+00
+      65  0.10000E+01    165  0.43009E-10    265  0.00000E+00
+      70  0.10000E+01    170  0.20049E-11    270  0.00000E+00
+      75  0.99997E+00    175  0.82263E-13    275  0.00000E+00
+      80  0.99949E+00    180  0.29998E-14    280  0.00000E+00
+      85  0.99448E+00    185  0.98050E-16    285  0.00000E+00
+      90  0.96543E+00    190  0.28934E-17    290  0.00000E+00
+      95  0.86836E+00    195  0.77556E-19    295  0.00000E+00
+    1 Show probability for a score
+    2 Show score for a probability
+  X 3 List scores and probabilities
+  ? 0,1,2,3 =!
+
+
+ @24. TX 3 @Calculate observed scores
+
+        This  option  applies  the  proportional  algorithm   to   the
+  currently  active  sequence but instead of producing a dot matrix it
+  calculates a histogram  of  observed  scores.   The  speed  of  this
+  calculation of course depends on the size of the active regions, but
+  when  it is  completed  the  program  offers  the  user  3  ways  of
+  examining the results:
+
+        "Show percentage for score" allows the user to type in a score
+  and  the  program  responds   with  the  percentage  of  points that
+  achieve this value.
+
+        "Show percentage for score" allows  the  user  to  type  in  a
+  percentage  and  the  program   responds   with   the  corresponding
+  score.  Values of this  score  and  above  are  only   achieved   by
+  the   given percentage of points.
+
+        "List scores and percentages" is the  command  to   list   out
+  the   scores  and  the percentage of points achieving them.  Typical
+  dialogue follows.
+  ? Menu or option number=24
+  Working
+  Maximum observed score is    152
+  X 1 Show percentage reaching a score
+    2 Show score for a percentage
+    3 List scores and percentages
+  ? 0,1,2,3 =
+
+  ? Show percentage for score (1-152) (114) =144
+  Percentage of points with score    144 is   0.005486297
+  X 1 Show percentage reaching a score
+    2 Show score for a percentage
+    3 List scores and percentages
+  ? 0,1,2,3 =2
+
+  ? Show score for percentage (0.00001-1.) (0.001) =0.01
+  Score for percentage   0.010000000 is   143
+    1 Show percentage reaching a score
+  X 2 Show score for a percentage
+    3 List scores and percentages
+  ? 0,1,2,3 =
+
+  ? Show score for percentage (0.00001-1.) (0.001) =1.
+  Score for percentage   1.000000000 is   124
+    1 Show percentage reaching a score
+  X 2 Show score for a percentage
+    3 List scores and percentages
+  ? 0,1,2,3 =3
+  ? Number of steps between scores (1-10) (5) =1
+
+     73   236953  0.10000E+03
+     74   236951  0.99999E+02
+     75   236951  0.99999E+02
+     76   236950  0.99998E+02
+     77   236945  0.99996E+02
+     78   236942  0.99995E+02
+     79   236929  0.99989E+02
+     80   236900  0.99977E+02
+
+    missing data here
+
+    130      384  0.16206E+00
+    131      307  0.12956E+00
+    132      239  0.10086E+00
+    133      180  0.75964E-01
+    134      134  0.56551E-01
+    135      103  0.43468E-01
+    136       78  0.32918E-01
+    137       67  0.28276E-01
+    138       46  0.19413E-01
+    139       40  0.16881E-01
+    140       33  0.13927E-01
+    141       29  0.12239E-01
+    142       24  0.10129E-01
+    143       19  0.80184E-02
+    144       13  0.54863E-02
+    145       10  0.42202E-02
+    146        8  0.33762E-02
+    147        7  0.29542E-02
+    148        7  0.29542E-02
+    149        6  0.25321E-02
+    150        5  0.21101E-02
+    151        3  0.12661E-02
+    152        3  0.12661E-02
+    1 Show percentage reaching a score
+    2 Show score for a percentage
+  X 3 List scores and percentages
+  ? 0,1,2,3 =!
+
+ @25. TX 3 @Show current parameter settings
+
+        This function lists the names of the current sequences,  their
+  total  lengths,  the start and end points of the active sequence and
+  the current values of span and cut-off scores. It also shows if  the
+  main  diagonal  will be shown, or if the proportional algorithm will
+  mark all identities in matching spans.
+
+        Typical dialogue follows.
+  ? Menu or option number=25
+  Horizontal sequence
+  ALPHA.PRT
+  Positions
+       1 TO    514
+  Vertical sequence
+  BETA.PRT
+  Positions
+       1 TO    461
+  Span length=    11
+  Scores
+  Proportional=   132
+  Identities=     3
+  Identites off
+  Main diagonal shown
+
+
+ @27. TX 2 @Draw a /
+
+        This option simply draws a diagonal line from the bottom  left
+  of  the  diagram  to  the top right. it can be an aid when trying to
+  align the sequences.
+ @26. TX 4 @Quick scan
+
+        The algorithm is as follows.  The  dot  matrix  positions  are
+  found  for  all  words of some minimum length (obviously length 1 is
+  most sensitive)  that  are  common  to  both  sequences.  Imagine  a
+  diagonal line running from corner to corner of the diagram, at right
+  angles to the diagonals in the dotmatrix,  The scores for the common
+  words  (according  to  the  current  score  matrix,  e.g. MDM78) are
+  accummulated at the appropriate positions on  that  imaginary  line,
+  hence  producing  a histogram. The histogram is analysed to find its
+  mean and standard deviation.  The  diagonals  that  lie  above  some
+  cutoff  score  (defined  in standard deviation units), are rescanned
+  using the proportional algorithm, and a diagram produced. The method
+  is  very  fast,  and  is  also  employed   by the library comparison
+  program.
+
+        Typical dialogue follows.
+
+  ? Menu or option number=d26
+  ? Identity score (1-20) (3) =
+  ? Odd span length (1-401) (11) =
+  ? Proportional score (1-297) (132) =
+  ? Number of sd above mean (0.00-10.00) (5.00) =
+
+   missing graphics
+
+
+
+        SIPL the library searching version of SIP
+
+        This program compares a probe sequence against  a  library  of
+  sequences  using  the  quick  scan algorithm, sorts the matches into
+  descending order of score, and produces optimal  alignments  of  the
+  best scores using the Myers and Miller method. It is very rapid.
+
+        Use of lists of entry names
+
+        SIPL has the ability to restrict searches to  subsets  of  the
+  libraries.  This  does  not  require  sublibraries to be created but
+  instead is achieved by using files containing a list  of  the  entry
+  names of sequences. The user may choose to search only those entries
+  on the list or, alternatively to search all but those  on  the  list
+  (i.e.  in the latter case the list contains the names of those to be
+  excluded).  The programs can search libraries that have indexes  and
+  those  that  do not.  If a list of names for inclusion is used, then
+  the search will be faster if the index  is  present.  In  all  other
+  circumstances  the  whole  library will be read. The list must be in
+  library order except when it is used  to  include  entries,  and  an
+  index  is  available.   The  list  must contain each entry name on a
+  separate line, with the name starting in column 1 of  the  line.  ie
+  there must be no spaces at the start of the line.  The list of entry
+  names can be produced by the keyword searches of nip, pip, sip,  etc
+  as  long  as the listings produced have a space character separating
+  the entry name from the entry description. This will depend  on  how
+  well  the  library reformatting programs work. For example swissprot
+  entry names tend to run into the beginning of the descriptions,  but
+  other libraries are generally OK.
+ @28. TX 4 @Align sequences
+
+        This  function  will  produce  an  optimal  alignment  of  two
+  segments   of   the  sequence.  The  dynamic  programming  alignment
+  algorithm is based on that of Miller and Myers (). It guarantees  to
+  produce  alignments  with  the optimum score given a score matrix, a
+  gap start penalty, and a gap extension penalty. That is, starting  a
+  gap  costs  a  fixed  penalty  (F) and each residue added to the gap
+  incurs a further penalty (E) so  that  for  each  gap  of  length  K
+  residues the penalty is F + K*E. Gaps at the ends of sequences incur
+  no penalty.
+
+        The routine can only handle segments of  sequence  of  maximum
+  length  5000  residues. When the sequences are read in the alignment
+  segment will be set to the first 5000 residues. A different  segment
+  can  be  selected by prefixing the option number by the letter D, in
+  which case the cross hair can be used to identify the two ends.  The
+  cross  hair will appear.  First position the crosshair at the bottom
+  left of the segment and type a character other than s or m  or  ",".
+  When  the  crosshair  reappears,  position it a the top right of the
+  segment, and type a keyboard character.  The aligned sequences  will
+  replace  the  active sequence if the user confirms "keep alignment".
+  By alternate use of  the  plotting  and  alignment  routines  it  is
+  possible to rapidly produce an alignment of quite long sequences.
+
+        Typical dialogue follows.
+
+  28 = Align sequences
+  ? Menu or option number=d28
+  Define the region to align using the cross-hair.
+  First identify the bottom left position and exit
+  the cross-hair routine. Then the top right.
+
+  (Bell rings, type return, cross hair appears)
+
+  ? Penalty for starting a gap (1-100) (10) =
+  ? Penalty for each residue in gap (1-100) (10) =
+
+  Aligning region           1 to         461
+      with region           1 to         514
+           1         11         21         31         41         51
+           MA--TGKIVQ VIGA------ VVDVEFPQDA VPRVYDALEV QNG------N ERLVL-----
+           *      *    *         **            * *       *        *   *
+           MQLNSTEISE LIKQRIAQFN VVSEAHNEGT IVSVSDGVIR IHGLADCMQG EMISLPGNRY
+           1         11         21         31         41         51
+          61         71         81         91        101        111
+           EVQQQLGGGI VRTIAMGSSD GLRRGLDVKD LEHPIEVPVG KATLGRIMNV LGEPVDMKGE
+                *     *    **     *  *  **       *****    ***  *  ** * * **
+           AIALNLERDS VGAVVMGPYA DLAEGMKVKC TGRILEVPVG RGLLGRVVNT LGAPIDGKGP
+          61         71         81         91        101        111
+         121        131        141        151        161        171
+           IGEEERWAIH RAAPSYEELS NSQELLETGI KVIDLMCPFA KGGKVGLFGG AGVGKTVNMM
+                  *     **   *          **  *  * * *    *      *     ***
+           LDHDGFSAVE AIAPGVIERQ SVDQPVQTGY KAVDSMIPIG RGQRELIIGD RQTGKTALAI
+         121        131        141        151        161        171
+         181        191        201        211        221        231
+           ELIRNIAIEH SGYS-VFAGV GERTREGNDF YHEMTDSNVI DKVSLVYGQM NEPPGNRLRV
+             *  *     **         *                          *      *
+           DAI--INQRD SGIKCIYVAI GQKASTISNV VRKLEEHGAL ANTIVVVATA SESAALQYLA
+         181        191        201        211        221        231
+         241        251        261        271        281        291
+           ALTGLTMAEK FRDEGRDVLL FVDNIYRYTL AGTEVSALLG RMPSAVGYQP TLAEEMGVLQ
+                 * *  *** * * *    *        *    * **  * *                *
+           RMPVALMGEY FRDRGEDALI IYDDLSKQAV AYRQISLLLR RPPGREAFPG DVFYLHSRLL
+         241        251        261        271        281        291
+         301        311        321        331        341        351
+           ERITST---- ---------- -KTGSITSVQ AVYVPADDLT DPSPATTFAH LDATVVLSRQ
+           **                     **** *         * *      *        *    *
+           ERAARVNAEY VEAFTKGEVK GKTGSLTALP IIETQAGDVS AFVPTNVISI TDGQIFLETN
+         301        311        321        331        341        351
+         361        371        381        391        401        411
+           IASLGIYPAV DPLDSTSRQL DPLVVGQEHY DTAR----GV QSILQRYQEL KDIIAILGMD
+               ** ***  *  * **      * *             *     *  * **
+           LFNAGIRPAV NPGISVSR-- ---VGGAAQT KIMKKLSGGI RTALAQYREL AAFSQFAS--
+         361        371        381        391        401        411
+         421        431        441        451        461        471
+           ELSEEDKLVV ARARKIQRFL SQ----PFFV AE----VFTG SPGKYVSLKD --TIRGFKGI
+            *             *    *  *    *  * *      *     * *         *  *
+           DLDDATRKQL DHGQKVTELL KQKQYAPMSV AQQSLVLFAA ERG-YLADVE LSKIGSFEAA
+         421        431        441        451        461        471
+         481        491        501        511        521
+           MEG--EYDHL P-EQAFYMVG SIEEAVE--- --------KA KKL*
+                  **  *  *     *       *                  *
+           LLAYVDRDHA PLMQEINQTG GYNDEIEGKL KGILDSFKAT QSW*
+         481        491        501        511        521
+  Conservation  22.5%
+  Number of padding characters inserted    63 and    10
+  ? (y/n) (y) Keep alignment n
+
+
+ @29. TX 1 @Complement the sequences
+
+        This function allows users to reverse and  complement  nucleic
+  acid sequences.
+ @30. TX 3 @Switch main diagonal
+
+        If a sequence is being compared against  itself  to  look  for
+  repeats  it  is  sometimes  convenient  if  the main diagonal is not
+  included in the comparison. This function  allows  users  to  set  a
+  switch  that  determines whether or not to include the main diagonal
+  for all the comparison methods.  If  the  switch  is  set,  and  the
+  active regions for both sequences have the same start position, then
+  the main diagonal will not be compared.
+ @31. TX 3 @Switch identities
+
+        This function allows a switch to be set or unset.  The  switch
+  determines  which  of  two  forms  of  plot  will be produced by the
+  proportional algorithm. One form of  output  (the  original  method)
+  plots  a  dot  at the centre of each span that reaches the threshold
+  score; whereas the other  form  will  plot  dots  for  all  matching
+  residues that lie within spans that reach the threshold.
+ @32. TX 3 @change score matrix
+
+        This option allows users to select their own score matrix  for
+  use with the proportional algorithm. The choices are:
+
+   1 = MDM78
+   2 = identity
+   3 = your own matrix
+
+
+        MDM78 is the standard matrix that is used for proteins and  an
+  identity  matrix is the default matrix for nucleic acids. However an
+  identity matrix is also useful for protein  comparisons.  "Your  own
+  matrix"  allows  users  to  apply  any  other matrix, as long as the
+  matrix file is in the same format as MDM78.  For comparisons of  DNA
+  it  might be useful to try one that gave say 3 for exact matches and
+  1 for R-R or Y-Y, else=0.
+ @33. TX 3 @Set number of sd's for Quickscan
+
+        The  quickscan  algorithm  is  as  follows.  The  dot   matrix
+  positions  are found for all words of some minimum length (obviously
+  length 1 is most sensitive)  that  are  common  to  both  sequences.
+  Imagine  a  diagonal  line  running  from  corner  to  corner of the
+  diagram, at right angles to the diagonals  in  the  dotmatrix,   The
+  scores  for the common words (according to the current score matrix,
+  e.g. MDM78) are accummulated at the appropriate  positions  on  that
+  imaginary  line,  hence  producing  a  histogram.  The  histogram is
+  analysed to find its mean and standard deviation. The diagonals that
+  lie  above  some cutoff score (defined in standard deviation units),
+  are rescanned  using  the  proportional  algorithm,  and  a  diagram
+  produced.
+
+        This option allows the number of sd's to be set.
+ @34. TX 3 @Set gap penalities
+
+        The alignment function will produce an  optimal  alignment  of
+  two  segments  of  the  sequence.  The dynamic programming alignment
+  algorithm is based on that of Miller and Myers (). It guarantees  to
+  produce  alignments  with  the optimum score given a score matrix, a
+  gap start penalty, and a gap extension penalty. That is, starting  a
+  gap  costs  a  fixed  penalty  (F) and each residue added to the gap
+  incurs a further penalty (E) so  that  for  each  gap  of  length  K
+  residues the penalty is F + K*E. Gaps at the ends of sequences incur
+  no penalty.
+
+        This option allows the gap penalties to be set.
+ @ end of help
+
+
+
+
+
+
diff --git a/help/sip_menu b/help/sip_menu
new file mode 100644
index 0000000..ca6b6db
--- /dev/null
+++ b/help/sip_menu
@@ -0,0 +1,78 @@
+-1 0 22 2 T General
+-1 0 22 2 X General
+-2 0 51 2 T Screen control
+-2 0 72 2 X Screen
+-3 0 101 2 T Set parameters
+-3 0 101 2 X Set parameters
+-4 0 126 2 T Comparison
+-4 0 126 2 X Comparison
+0 -1 144 208 T SIP
+0 -1 144 208 X SIP
+1 0 12690 39 T Help
+1 0 12690 39 X Help
+2 0 13755 3 T Quit
+2 0 13755 3 X Quit
+3 1 13828 220 T Read a new sequence
+3 1 13828 220 X Read a new sequence
+4 1 23656 10 T Define active region
+4 1 23656 10 X Define active region
+5 1 24191 16 T List a sequence
+5 1 24191 16 X List a sequence
+6 1 25001 4 T List a text file
+6 1 25001 4 X List a text file
+7 1 25143 12 T Direct output to disk
+7 1 25143 12 X Direct output to disk
+8 1 25627 4 T Write active region to disk
+8 1 25627 4 X Write active region to disk
+9 1 25764 5 T Edit the sequences
+9 1 25764 5 X Edit the sequences
+10 2 25944 3 T Clear graphics
+10 2 25944 3 X Clear graphics
+11 2 26021 3 T Clear text
+11 2 26021 3 X Clear text
+12 2 26089 15 T Draw a ruler
+12 2 26089 15 X Draw a ruler
+13 2 26869 54 T Use cross hair
+13 2 26869 54 X Use cross hair
+14 2 28754 29 T Reposition plots
+14 2 28754 29 X Reposition plots
+15 2 30429 13 T Label a diagram
+15 2 30429 13 X Label a diagram
+16 2 31213 7 T Display a map
+16 2 31213 7 X Display a map
+17 4 31596 19 T Apply identities algorithm
+17 4 31596 19 X Apply identities algorithm
+18 4 32260 81 T Apply proportional algorithm
+18 4 32260 81 X Apply proportional algorithm
+19 4 36686 42 T List matching spans
+19 4 36686 42 X List matching spans
+20 3 37569 16 T Set span length
+20 3 37569 16 X Set span length
+21 3 38560 13 T Set proportional score
+21 3 38560 13 X Set proportional score
+22 3 39251 6 T Set identities score
+22 3 39251 6 X Set identities score
+23 3 39544 79 T Calculate expected scores
+23 3 39544 79 X Calculate expected scores
+24 3 43148 90 T Calculate observed scores
+24 3 43148 90 X Calculate observed scores
+25 3 46152 26 T Show current parameter settings
+25 3 46152 26 X Show current parameter settings
+27 2 46802 5 T Draw a /
+27 2 46802 5 X Draw a /
+26 4 46991 57 T Quick scan
+26 4 46991 57 X Quick scan
+28 4 49883 90 T Align sequences
+28 4 49883 90 X Align sequences
+29 1 55133 4 T Complement the sequences
+29 1 55133 4 X Complement the sequences
+30 3 55256 9 T Switch main diagonal
+30 3 55256 9 X Switch main diagonal
+31 3 55755 8 T Switch identities
+31 3 55755 8 X Switch identities
+32 3 56202 17 T change score matrix
+32 3 56202 17 X change score matrix
+33 3 56884 16 T Set number of sd's for Quickscan
+33 3 56884 16 X Set number of sd's for Quickscan
+34 3 57767 13 T Set gap penalities
+34 3 57767 13 X Set gap penalities
diff --git a/help/splitp_help b/help/splitp_help
new file mode 100644
index 0000000..ee183b3
--- /dev/null
+++ b/help/splitp_help
@@ -0,0 +1,132 @@
+
+        Preparing the PROSITE protein motif library  for  use  by
+the Staden programs
+
+        Introduction
+
+        A library of protein motifs (in our terminology,  because
+they  include  variable  gaps, some would be called patterns) has
+recently  become  available  from  Amos  Bairoch,Departement   de
+Biochimie Medicale,University of Geneva Currently it contains 317
+patterns/motifs and arrives on tape or cdrom in two files: a .dat
+file  and  a  .doc  file. There is also a user documentation file
+prosite.usr. Here I outline  what  is  required  to  prepare  the
+PROSITE library for use by our programs.
+
+        Three programs need  to  be  run  SPLITP1,  SPLITP2,  and
+SPLITP3.
+
+        Outline of the PROSITE files
+
+        A typical entry in the .dat file is shown below.
+
+ID   2FE2S_FERREDOXIN; PATTERN.
+AC   PS00197;
+DT   APR-1990 (CREATED); APR-1990 (DATA UPDATE); APR-1990 (INFO UPDATE).
+DE   2Fe-2S ferredoxins, iron-sulfur binding region signature.
+PA   C-x(1,2)-[STA]-x(2)-C-[STA]-{P}-C.
+NR   /RELEASE=14,15409;
+NR   /TOTAL=69(69); /POSITIVE=63(63); /UNKNOWN=0(0); /FALSE_POS=6(6);
+NR   /FALSE_NEG=5(5);
+CC   /TAXO-RANGE=A?EP?; /MAX-REPEAT=1;
+CC   /SITE=1,iron_sulfur; /SITE=5,iron_sulfur; /SITE=8,iron_sulfur;
+DR   P15788, FER$APHHA , T; P00250, FER$APHSA , T; P00223, FER$ARCLA , T;
+DR   P00227, FER$BRANA , T; P07838, FER$BRYMA , T; P13106, FER$BUMFI , T;
+DR   P00247, FER$CHLFR , T; P07839, FER$CHLRE , T; P00222, FER$COLES , T;
+DO   PDOC00175;
+//
+
+        Each entry has an  accession  number  (here  PS00197),  a
+pattern definition (here C-x(1,2)-[STA]-x(2)-C-[STA]-{P}-C) and a
+documentation  file  cross  reference  (here  PDOC00175).    This
+pattern  means: C, gap of 1 or 2, any of STA, gap of 2, C, any of
+STA, not P, C.
+
+        We need to convert all of these patterns into our pattern
+definitions  (as  membership  of  a  set, with the appopriate gap
+ranges)  and  write  each  into  a  separate  pattern  file  with
+corresponding "membership of a set" weight matrices. Each pattern
+file  is  named  accession_number.pat  (here  PS00197.PAT).   The
+corresponding    matrix    files    are    accession_number.wtsa,
+accession_number.wtsb, etc for  however  many  are  needed  (here
+PS00197.WTSA  and  PS00197.WTSB):  two  are needed because of the
+variable gap.
+
+        In addition we can optionally split  the  .dat  and  .doc
+files  into  separate  files,  one  for  each  entry,  with names
+accession_number.dat and accession_number.doc. Also we create  an
+index  for  the  library  prosite.lis,  which  gives  a  one line
+description of each pattern, and ends with the pattern  file  and
+documentation file numbers. The start of the file is shown below.
+
+N-glycosylation site.                                                00001,00001
+Glycosaminoglycan attachment site.                                   00002,00002
+Tyrosine sulfatation site.                                           00003,00003
+cAMP- and cGMP-dependent protein kinase phosphorylation site.        00004,00004
+
+So the name of the pattern file for Glycosaminoglycan  attachment
+site is PS00002.PAT, and for the documentation file PDOC00002.DOC
+
+        Finally we create a  file  of  file  names  for  all  the
+patterns in the library.
+
+        To use the complete PROSITE  library  from  program  pip,
+select  "pattern  searcher"  and  choose  the option "use file of
+pattern file names", and give the file name prosite.nam). For any
+matches  found,  the  accession  number and pattern title will be
+displayed.
+
+        Running the conversion programs
+
+        Only SPLITP3 is necessary  for  using  the  library.  The
+others programs only make the original files marginally easier to
+browse through and produce an index.
+
+        SPLITP1 splits the prosite.dat file to create a  separate
+file   for   each   entry.   Each  file  is  automatically  named
+PSentry_number.dat. In addition  it  creates  an  index  for  the
+library (see above).
+
+        SPLITP2 performs the same operation for  the  Prosite.doc
+file,   except   that  no  index  is  created.  Files  are  named
+PSentry_number.doc.
+
+        SPLITP3 creates a separate pattern file and weight matrix
+files  for  each prosite entry from the file prosite.dat. Pattern
+files  are  named   PSentry_number.pat,   weight   matrix   files
+PSentry_number.wtsa,  Psentry_number.wtsb, etc. The pattern title
+is the one line description of the motif. SPLITP3 also creates  a
+file  of  file  names. Notice that it will ask for a path name so
+that the path can be included in the file of file names. This  is
+the path to the directory in which the pattern files are stored.
+
+        Notes
+
+        Obviously the use of files of file  names  is  a  general
+solution,   and  anybody  could  now  create  their  own  set  of
+interesting patterns for screening, or a subset  of  prosite.nam,
+etc.
+
+        Note that 5 of the bairoch motifs contained the symbols >
+or  < which means that the motifs must appear exactly at the N or
+C termini  of  the  sequences.  Currently  our  methods  have  no
+mechanism for such definitions and, for example KDEL motifs, will
+be permitted to occur anywhere throughout a sequence.
+
+        Also, of course, the library does not  have  to  be  used
+solely  for performing mass screenings: each individual entry can
+be used as a single pattern by giving the name of its .pat file -
+eg pathname/ps00002.pat In addition more sophisticated users will
+wish to copy pattern files and weight  matrices  into  their  own
+directories  and  modify  them. For example the cutoff scores are
+probably chosen to be quite high in order to reduce the number of
+false positives, and some users might wish to lower them.
+
+
+
+
+
+
+
+
+
diff --git a/help/staden.references b/help/staden.references
new file mode 100644
index 0000000..532c09b
--- /dev/null
+++ b/help/staden.references
@@ -0,0 +1,81 @@
+
+  References with further information about the methods
+ 
+            Staden, R.  Nucl.  Acid Res.  8, 817-825 (1980)
+                 A computer program to search for tRNA genes. (NIP)
+            Staden, R.  Nucl.  Acid Res.  8, 3673-3694 (1980)
+                 A new computer method for the storage and  manipulation
+                 of DNA gel reading data.  (SAP).
+            Staden, R.  Nucl.  Acid Res.  10, 2951-2961 (1982)
+                 An  interactive  graphics  program  for  comparing  and
+                 aligning   nucleic   acid  and  amino  acid  sequences.
+                 (SIP).
+            Staden, R.  Nucl.  Acid Res.  10, 4731-4751 (1982)
+                 Automation of the computer handling of gel reading data
+                 produced by the shotgun method of DNA sequencing.(SAP)
+            Staden, R.  and  McLachlan,  A.,D.   Nucl.   Acid  Res.   10
+                 141-156 (1982)
+                 Codon preference and its  use  in  identifying  protein
+                 coding regions in long DNA sequences. (NIP)
+            Staden, R.  Nucl.  Acid Res.  12, 499-503 (1984)
+                 A computer program to enter DNA gel reading data into a
+                 computer.  (GIP)
+            Staden, R.  Nucl.  Acid Res.  12, 551-567 (1984)
+                 Measurements of the effects that coding for  a  protein
+                 has  on  on  a  DNA  sequence and their use for finding
+                 genes.  (NIP:  positional base preferences, uneven
+                 positional base frequencies)
+            Staden, R.  Nucl.  Acid Res.  12, 505-519 (1984)
+                 Computer methods to  locate  signals  in  nucleic  acid
+                 sequences.    NIP:   promoters,  ribosome  binding
+                 sites, intron/exon junctions.
+            McLachlan A D, Staden R and Boswell D R,  Nucl.   Acid  Res.
+                 12, 9567-9575 (1984)
+                 Measure of strength of codon preference. (NIP)
+            Staden R, Computer methods to locate genes  and  signals  in
+                 nucleic acid sequences, Genetic Engineering: Principles
+                 and Methods Vol. 7, Edited  by  J. K. Setlow   and   A.
+                 Hollaender, Plenum Publishing Corp.  1985. (NIP)
+            Staden R Nucl. Acid. Res. 14, 217-231 (1986)
+                 The current status and portability of our sequence
+                 handling software. Summary for May 1985.
+            Staden R "Computer Handling of DNA sequencing projects" in
+                 Nucleic acid and protein sequence analysis, A practical
+                 approach, 173-217. Edited by M.J.Bishop and C.J.Rawlings,
+                 IRL press (1987). (SAP)
+            Staden R, Methods to define and locate patterns of motifs in
+                  sequences. CABIOS 4 53-60 (1988). (NIP, PIP,
+                  NIPL, PIPL)
+            Staden R, Methods for calculating the probabilities of finding
+                  patterns in sequences. CABIOS 5 89-96 (1989). (NIP, PIP,
+                  NIPL, PIPL)
+            Staden R,  "Methods for discovering novel motifs in nucleic acid
+                   sequences". CABIOS 5, 293-298, (1989). (MEP)
+            Staden R, Methods to search for patterns in protein and nucleic
+                  acid sequences. In Doolittle, R,R (ed), Methods in
+                  Enzymology, 183, Academic Press, San Diego, CA, 193-211.
+                  (1990) (NIP, NIPL, PIP, PIPL)
+            Staden R, Finding protein coding regions in genomic sequences.
+                  In Doolittle, R,R (ed), Methods in Enzymology, 183, 
+                  Academic Press, San Diego, CA, 163-180. (1990) (NIP)
+            Gleeson T J and Staden R, An X windows and UNIX implementation
+                  of our sequence analysis package. CABIOS 7 398 (1991)
+	    Staden R, Screening protein and nucleic acid sequences against
+                  libraries of patterns. DNA Sequence, in press (NIP, PIP,
+                  SPLITP1, SPLITP2, SPLITP3, PROSITE)
+            Dear S and Staden R, A sequence assembly and editing program for
+                   efficient management of large projects. Nucleic Acids
+                   Research 19 3907-3911 (1991) (XDAP)
+	    Staden R and Dear S, Indexing the sequence libraries: Software
+		   providing a common indexing system for all the standard
+		   sequence libraries. DNA Sequence 3, 99-105 (1992).
+            Dear S and Staden R, A standard file format for data from DNA
+		   sequencing instruments. DNA Sequence 3, 107-110 (1992)
+	    Gleeson T and Hillier L, A trace display and editing program
+		   for data from fluorescence based sequencing machines.
+		   Nucleic Acids Research 19 6481-6483 (1991) (TED)
+	    Staden R, Staden package update. Genome News 13 12-13 (1993)
+
+ 
+
+
diff --git a/help/staden_help b/help/staden_help
new file mode 100644
index 0000000..2a999da
--- /dev/null
+++ b/help/staden_help
@@ -0,0 +1,184 @@
+
+        Introduction to the Staden sequence analysis package  and  its
+  user interface
+
+        The package contains the following programs:
+
+    GIP     Gel input program
+    SAP     Sequence assemble program
+    NIP     Nucleotide interpretation program
+    PIP     Protein interpretation program
+    SIP     Similarity investigation program
+    MEP     Motif exploration program
+    NIPL    Nucleotide interpretation program (library)
+    PIPL    Protein interpretation program (library)
+    SIPL    Similarity investigation program (library)
+
+  GIP  uses  a   digitiser   for   entry   of   DNA   sequences   from
+  autoradiographs.
+  SAP handles everything relating to assembling gel readings in  order
+  to  produce  a consensus sequence. It can also deal with families of
+  protein sequences.
+  NIP provides functions for analysing  and  interpretting  individual
+  nucleotide sequences.
+  PIP provides functions for analysing  and  interpretting  individual
+  protein sequences.
+  MEP analyses families of nucleotide sequences to help  discover  new
+  motifs.
+  NIPL performs pattern searches on nucleotide sequence libraries.
+  PIPL performs pattern searches on protein sequence libraries.
+  SIP provides functions for comparing and aligning pairs  of  protein
+  or nucleotide sequences.
+  SIPL searches nucleotide and protein sequence libraries for  entries
+  similar to probe sequences.
+
+
+        Documentation
+
+        As is explained below, the programs SAP, NIP, PIP, SIP and MEP
+  have  online  help,  and  the  help  files  have the names: HELPSAP,
+  HELPNIP, HELPPIP, HELPSIP, HELPMEP. These files can be displayed  on
+  the  screen or printed using the appropriate commands. Currently the
+  help for the other programs is also contained in  these  files.  For
+  example help for NIPL is in HELPNIP. This file is called HELPSTADEN.
+
+        Sequence formats
+
+        The shotgun sequencing program SAP deals only with simple text
+  files  for gel readings, and is a self-contained system.  However as
+  there is still no single agreed format for finished sequences or for
+  libraries  of  sequences, the other programs in the package can read
+  data that is stored in several ways.
+
+        The analytical programs can read individual  sequences  stored
+  in  the following formats: Staden, EMBL, Genbank, PIR (also known as
+  NBRF), and GCG, but for storing whole  libraries  we  use  only  PIR
+  format.  In  addition  these programs can perform a number of simple
+  operations using libraries stored in this format. They  can  extract
+  entries  by  entry  name, can search titles for keywords, can search
+  the whole of the annotation files  for  keywords,  and  can  extract
+  annotations for any named entry.  We reformat all sequence libraries
+  into PIR format. Currently we have NBRF, EMBL, SWISSPROT and VECBASE
+  libraries in PIR format.
+
+        The library  searching  programs  operate  only  on  sequences
+  stored in PIR format.
+
+        The  analytical  programs  will  operate  with  uppercase   or
+  lowercase  sequence  characters. In addition T and U are equivalent.
+  SAP uses uppercase letters for original gel readings  and  lowercase
+  letters  for  characters that are corrected by the automatic editor.
+  Programs NIP  and  PIP  use  IUB  symbols  for  redundancy  in  back
+  translations  and  for  sequence  searches.   The  symbols are shown
+  below.
+
+
+              NC-IUB SYMBOLS
+
+        A,C,G,T
+        R        (A,G)        'puRine'
+        Y        (T,C)        'pYrimidine'
+        W        (A,T)        'Weak'
+        S        (C,G)        'Strong'
+        M        (A,C)        'aMino'
+        K        (G,T)        'Keto'
+        H        (A,T,C)      'not G'
+        B        (G,C,T)      'not A'
+        V        (G,A,C)      'not T'
+        D        (G,A,T)      'not C'
+        N        (G,A,C,T)    'aNy'
+
+
+        The user interface
+
+        The user interface is common to all programs. It consists of a
+  set  of  menus and a uniform way of presenting choices and obtaining
+  input from the user. This section describes: the  menu  system;  how
+  options  are  selected  and   other  choices  made;  how  values are
+  supplied to the program;  how help is obtained, and  how  to  escape
+  from  any  part of a program. In addition it gives information about
+  saving results in files and  the  use  of  graphics  for  presenting
+  results.
+
+        Menus
+
+        Each program has several menus and numerous options. Each menu
+  or  option  has  a  unique  number that is used to identify it. Menu
+  numbers are distinguished from option numbers by being  preceded  by
+  the  letter  m (or M, all programs make no distinction between upper
+  and lower case letters). With the exception of some parts of program
+  SAP,  the  menus  are  not hierachical, rather the options they each
+  contain are simply lists of related functions and their  identifying
+  numbers. Therefore options can be selected independently of the menu
+  that is currently being shown on the  screen,   and  the  menus  are
+  simply  memory  aides.  All options and menus are selected by typing
+  their option number when the programs present the prompt
+
+        "? Menu or option number =".
+
+        To select a menu type its number preceded by the letter M.  To
+  select  an  option  type  its number.  If you type only "return" you
+  will get menu m0 which is simply a list of menus. If you  select  an
+  option  you  will  return  to the current menu after the function is
+  completed.
+
+        When you select an option, in  many  cases  the  program  will
+  immediately perform the operation selected without further dialogue.
+  If you precede an option number by the letter d (e.g. D17), you will
+  force the program to offer dialogue about the selected option before
+  the function operates, hence allowing you to change the value of any
+  of  its parameters.  If you precede an option number by the symbol ?
+  (e.g. ?17), you will be given help on the option (here 17).
+
+        Where possible, equivalent  or  identical  options  have  been
+  given  the  same numbers in all programs, and so users quickly learn
+  the numbers for the functions they employ most often.
+
+        Help
+
+        As mentioned above, help about each option can be obtained  by
+  preceding  the option number  by the symbol ? when you are presented
+  with the prompt "? Menu or option number", but there are two further
+  ways of obtaining help. Whenever the program asks a question you can
+  respond by typing the symbol ?  and  you  will  receive  information
+  about  the  current  option. In addition, option number 1 in all the
+  programs will give help on all of a programs functions.
+
+        Quitting
+
+        To exit from any point in a program you type ! for quit. If  a
+  menu is on the screen this will stop the program, otherwise you will
+  be returned to the last menu.
+
+        Other interactions
+
+        Questions are  presented in a  few  restricted  ways.  In  all
+  cases  typing only "return" in response to a question means yes, and
+  typing N or n means no.
+
+        Obvious opposites such as "clear screen"  and  "keep  picture"
+  are  presented with only the default shown. For example in this case
+  the default is generally "keep picture" so the program will display:
+
+        "(y/n) (y) Keep picture"
+
+        and the picture will be retained if the  user  types  anything
+  other than N or n, (in which case the screen will be cleared).
+
+        Where there are choices that are  not  obvious  opposites,  or
+  there  are  more than two choices, two further conventions are used:
+  "radio buttons" and "check boxes".
+
+        Radio buttons are used when only one of a  number  of  choices
+  can  be made at any one time. The choices are presented arranged one
+  above the other, each choice with a number for  its  selection,  and
+  the  default choice marked with an X. For example in the restriction
+  enzyme search routine the following choices are offered:
+
+
+           Select output mode
+     1 order results enzyme by enzyme
+     2 order results by positon
+   X 3 show only infrequent cutters
+     4 show names above the sequence
+   
\ No newline at end of file
diff --git a/help/stadenp_help b/help/stadenp_help
new file mode 100644
index 0000000..76b57e0
--- /dev/null
+++ b/help/stadenp_help
@@ -0,0 +1,26 @@
+  Standard Staden Programs
+
+  gip        Gel input program
+  sap        Sequence assembly program
+  (x)dap     Sequence assembly program
+  (x)nip     Nucleotide interpretation program
+  (x)pip     Protein interpretation program
+  (x)sip     Similarity investigation program
+  (x)mep     Motif exploration program
+  nipl       Nucleotide interpretation program (library)
+  pipl       Protein interpretation program (library)
+  sipl       Similarity investigation program (library) 
+  Those with (x) have both tektronix (say nip) and x (say xnip) versions.
+  Environment variables for help files
+  HELPSAP    sap
+  HELPDAP    dap
+  HELPGIP    gip
+  HELPNIP    nip
+  HELPPIP    pip
+  HELPSIP    sip
+  HELPMEP    mep
+  HELPSTADEN Introduction and user interface
+  e.g. to read HELPSTADEN type  'more $HELPSTADEN'
+ 
+
+
diff --git a/help/ted.help b/help/ted.help
new file mode 100644
index 0000000..bbea1bf
--- /dev/null
+++ b/help/ted.help
@@ -0,0 +1,168 @@
+Trace Editor Help
+-----------------
+
+The ted trace editor is a prototype to allow the display and editing
+of traces from sequencing machines, and the simple editing of plain
+sequences. It runs under the X window system. It provides simultaneous
+display of traces and bases. The editing allows individual bases to be
+removed and new ones added, and also a range of bases at either end to
+be cutoff. Currently, only ABI result files and plain sequences are
+accepted.
+
+Only one trace can be edited at a time.
+
+
+Invocation
+----------
+
+ted can be run from the command line by simply typing:
+
+    ted
+
+It will come up with no sequence initially displayed. If provided with
+any arguments it does not understand, or invalid combinations of
+arguments, ted will exit with a message indicating its intended usage.
+ted accepts the standard X arguments allowing, for example, background
+colour or geometry to be specified. ted can accept an argument
+specifying an initial file to display. The key for this is the format
+of the file, for example:
+
+    ted -ABI {ABI format filename}
+    ted -plain {plain format filename}
+
+The file is then displayed at 50% magnification, with the caret 
+initially positioned at the first base.
+
+When an initial file is given, a base number of interest and/or a
+magnification can also be given, for example:
+
+    ted -ABI {ABI format file} -baseNum 280 -mag 30
+
+or the bottom strand may be specified:
+ 
+    ted -ABI {ABI format file} -baseNum 280 -mag 30 -bottom 1
+	or
+    ted -ABI {ABI format file} -bottom 1
+
+or a string of nucleotides on which the center the window:
+	
+    ted -ABI {ABI format file} -astring 1
+	or
+    ted -ABI {ABI format file} -astring 1 -mag 30 -bottom 1
+
+Options can be specified in any order.
+
+An output filename can be specified in a similar manner:
+
+    ted -ABI inputfilename -output outputfilename
+
+The default output filename is inputfilename.seq
+
+If you are running the program on a remote machine, you must
+specify a display parameter:
+
+    ted -display machine_name:0.2
+
+You can also specify the size of the opening window or
+other screen parameters by the following:
+
+    ted  -geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]
+	   [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]
+
+Displays
+--------
+
+When running, ted displays the name of the file it is currently
+operating on (if any) and the original number of bases.
+
+A so-called viewport presents four different synchronised views of
+part of the trace. The top one indicates the sequence indices - the
+first digit of the number if positioned over the base to which that 
+number corresponds.  Below this is a list of the bases as originally
+found in the file (this is the interpretation of the trace as made by
+the sequencing machine). Below this is the list of bases as edited by
+the user --- initially, if this file has not been edited in the past,
+this is identical to the list of original bases.  However, if in a 
+previous session the user has edited this sequence, the edited
+version of the sequence will appear in the edit window. 
+The final display is of the traces produced by the sequencing
+machine for the four respective bases.
+
+Two controls allow the view presented to be adjusted: both are
+horizontal sliders or scrollbars. The first affects the magnification
+at which the trace is viewed. The minimum magnification is such that
+the whole of the trace is visible within the viewport; when a trace is
+first input, this is the magnification used. The maximum magnification
+is such that bases are spaced out with several characters of space
+between them --- this should allow more than enough room for base
+insertions to be clearly visible. The second scrollbar is immediately
+above the viewport and allows the user to select which part of the
+trace is viewed. Both the sliders work in a similar way: the middle
+mouse button can be used to drag the thumb to any desired position,
+the left and right mouse buttons can be clicked within the scrollbar
+to indicate that paging up or down is desired. In the case of the
+viewport scrollbar, the amount of paging is determined by how far up
+the scrollbar the pointer is.
+
+The whole ted window can be expanded and contracted (to an extent) by
+dragging the "grow-region" provided by whatever window manager is
+running. The viewport takes up all of this change in size.
+
+Controls
+--------
+
+ted has four buttons. "Quit" exits the program after first checking
+whether there is a sequence which has been edited and not saved.
+
+"Help" pops up this window which has a scrollbar on the left allowing
+all the text to be viewed.
+
+"Input" presents a dialogue which asks for the format and name of a
+file to be processed. The bases and (if this is not a plain format
+file) traces are read in and displayed for editing. The only
+conversion performed on bases is from 'N' to '-'.
+
+"Output" presents a dialogue which asks for a filename into which the
+edited and clipped bases can be saved. The default value can be set
+on the command line using the "-output" keyword. No conversion of bases
+is performed on output.
+
+ted operates in one of three editing modes, one of which is selected
+from three "radio buttons". The currently selected mode is
+highlighted.
+
+Editing
+-------
+
+In "Edit sequence" mode, the (lower) list of editable bases can be
+edited in much the same way as a text editor operates. A "caret" which
+is visible in the display of edited bases can be moved left and right
+with the cursor keys (these are sometimes called arrow keys and often
+appear on numeric keypads). It can also be positioned by clicking any
+button while the pointer is pointing into either of the list of bases
+or the traces. The DELETE key deletes the base immediately to the left
+of the caret. Any printing character can be inserted to the right of
+the caret by simply typing it. Inserted characters are placed halfway
+between their neighbours, or if a space is left by the deletion of a
+base originally there, its position is used. A base can thus be
+changed by deleting it and entering the new base.
+
+Note that in the current version of ted the caret is not constrained
+to remain within the viewed part of the display and that editing can
+still continue while it is thus invisible. Such editing would probably
+only occur by accident.
+
+ted provides a facility to define a cutoff at either end of the trace.
+A number of the leftmost bases (corresponding to the vector) and the
+rightmost bases (corresponding to the point where the data become
+unreliable) can be defined by setting the editor into "Adjust left
+cutoff" or "Adjust right cutoff" mode. In either of these modes, the
+pointer and mouse buttons can be used to indicate the cutoff point,
+and the cursor keys can be used to adjust this leftwards or
+rightwards. Initially, the cutoff regions are both empty. The cutoff
+regions are clearly indicated on the list of edited bases display and
+on the traces display by being drawn with a dimmed background.
+
+When the sequence is written out, the list of edited bases, with both
+cutoff regions removed, is written. The output contains newlines
+for convenient formatting and always ends with one.
diff --git a/manl/staden.l b/manl/staden.l
new file mode 100644
index 0000000..f9381f2
--- /dev/null
+++ b/manl/staden.l
@@ -0,0 +1,102 @@
+.TH staden 1L "November 1991" "MRC LMB" "LOCAL"
+.SH NAME
+staden, xstaden \- sequence analysis suite
+.SH DESCRIPTION
+.I staden
+is a suite of programs for sequence analysis. Currently available are
+.I mep,
+.I nip,
+.I pip,
+.I sap,
+.I sip,
+.I nipl,
+.I pipl,
+.I and sipl.
+These all run under the SUN X11
+.I xterm
+Tektronics terminal emulator, but also work with the VT640 terminal
+and the VersaTermPro and MS-Kermit emulators if they login to a SUN.
+.PP
+.I xstaden
+is the same set of programs, named
+.I xmep,
+.I xnip,
+.I xpip,
+.I xsap,
+.I xdap,
+and
+.I xsip,
+which run directly under X providing a convenient user interface,
+including resizable output and pull-down menus. All these programs
+accept the standard X arguments. The library searching programs
+nipl, pipl and sipl are only available in xterm form.
+.PP
+Sequence library access is provided for the format as distributed
+on CDROM by EMBL. The CDROM contains the EMBL nucleotide library and
+the SWISSPROT protein library. The libraries can be left on the
+CDROM or transferred to hard disk.
+.PP
+The programs also provide an interface to the PROSITE protein motif
+library.
+.PP
+Some initialisation is required in order to use the package. csh users
+should insert the following in their .login files:
+.IP
+ setenv STADENROOT /home/BioSW/staden
+.IP
+ source $STADENROOT/staden.login
+.LP
+Users of the Bourne shell, sh, should insert the following in
+their .profile:
+.IP
+ STADENROOT=/home/BioSW/staden
+.IP
+ export STADENROOT
+.IP
+ . $STADENROOT/staden.profile
+.LP
+These initialisations will alter your shell's search path so
+that it can find the program binaries, and other files that are
+required.
+.SH ENVIRONMENT
+The following environment variables may be set in the
+user's \fI .login\fP or \fI .profile\fP file:
+.TP 20
+.BI STADENROOT= /home/BioSW/staden
+This must be set in the user's initialisation.
+.TP 20
+.BI SEQEDT= editor
+Set the editor to be used by the package.  The default is
+\fIemacs\fP.
+.SH FILES
+.PD 0
+.TP 30
+$STADENROOT/staden.login
+csh initialisation
+.TP 30
+$STADENROOT/staden.profile
+sh initialisation
+.TP 30
+$STADENROOT/tables
+Tables used by the programs
+.TP 30
+$STADENROOT/help
+Helpfiles used by the programs, documentation of the user interface
+and of each of the programs.
+.TP 30
+$STADENROOT/tables/SEQUENCELIBRARIES
+Defines the sequence libraries available, their file descriptors
+and the prompts to appear on the users screen.
+.SH AUTHOR
+Rodger Staden, MRC Laboratory of Molecular Biology, Hills Rd., Cambridge,
+CB2 2QH, UK.
+.SH BUGS
+.PP
+When using the xterm programs and in graphics input mode,
+a carriage return should not be
+entered on its own but should be preceded by some other character,
+such as SPACE, COMMA or K. If a carriage return is entered on its
+own, some garbage will (relatively) harmelssly appear on the plot.
+.PP
+General comments on the package can be sent to
+\fI<rs@uk.ac.cam.mrc-lmb>\fP 
diff --git a/manl/ted.l b/manl/ted.l
new file mode 100644
index 0000000..21201a9
--- /dev/null
+++ b/manl/ted.l
@@ -0,0 +1,107 @@
+.TH ted 1L "July 1991" "MRC LMB" "LOCAL"
+.SH NAME
+ted \- trace editor
+.SH SYNOPSIS
+.B ted
+[(
+.B -ABI\||\|-ALF\||\|-plain
+)
+.I tracefilename
+[
+.B -baseNum
+.I number
+]
+.B [
+.B -mag
+.I number
+( 1 to 100 )
+] 
+.B [
+.B -bottom
+.I number
+(1(true) or 0(false))
+.B ]
+.B [
+.B -astring
+.I nucleotide-string
+]] 
+.B [
+.B -enzyme
+.I  5' cutting sequence
+] 
+.B [
+.B -raw
+.I  filename
+(to be placed at head of xdap compatible .seq file)
+.B ]
+[
+.B -output
+.I outputfilename
+]
+
+.SH DESCRIPTION
+.B ted
+is a simple prototype editor for traces produced from automatic
+sequencing machines. It allows the traces (from the ABI
+or ALF sequencing machines) produced to be
+displayed along with the machines interpretation of these into
+bases and an initially identical sequence which can be edited
+by the user. A cutoff region can be defined at both ends. The
+edited and clipped list of bases can then be written out.
+.LP
+When initially run,
+.B ted
+displays the trace file
+.I tracefilename
+(if given) of the specified format centered on the base number
+.I baseNum
+(if given). If no file is provided,
+.B ted
+initially displays nothing.
+.LP
+The display consists of
+the control panel and the synchronized view of the base position
+information, original and edited sequence data, 
+and graphical representation of the trace (with each nucleotide's trace
+being represented by a different color).  The control
+panel allows the user to read in new trace files (in either
+bottom or top strand orientation)
+as well as to search for a string of nucleotides or a certain base position.
+The information button brings up signal strength and average spacing for
+ABI files.
+Scroll bars allow the user to adjust the magnification of or scroll through
+the sequence and trace data.  The user may also choose to change the vertical
+magnification of the trace data.  Further, sequence on the head (vector)
+or tail (uncertain data) of the sequence may be ``cutoff'' 
+using the adjust left and right cutoff buttons. Bases can be inserted, 
+deleted, or replaced as with
+any ordinary word-processor in the sequence data window. Finally, the
+sequence may be written to an ascii file using the output button on
+the control panel.  The output filename is specified in a dialogue, 
+but a default value of inputfilename.seq is provided or the default value
+can be given with the
+.I outputfilename
+argument.
+.LP
+A simple help system is provided.
+.SH FILES
+.PD 0
+.TP 20
+.B ted.help
+Text provided in the help window.
+.TP
+.B /usr/lib/X11/app-defaults/Xted
+Default application resources.
+.SH ENVIRONMENT
+.TP 20
+.SB XFILESEARCHPATH
+Specifies the locations where
+.B ted.help
+is sought.
+If this is not defined,
+.B ted.help
+must be in the
+.B /usr/lib/X11/app-defaults
+directory.
+.SH AUTHORS
+Tim Gleeson, LaDeana Hillier, Simon Dear.
diff --git a/src/Misc/README b/src/Misc/README
new file mode 100644
index 0000000..07f2d99
--- /dev/null
+++ b/src/Misc/README
@@ -0,0 +1,7 @@
+Miscellaneous Routines                Simon Dear, 14 April 1992
+---------------------------------------------------------------
+
+The source modules in this directory are for commonly used
+routines. The archive misc.a should be made before any
+other programs supplied on this tape.
+
diff --git a/src/Misc/crash.c b/src/Misc/crash.c
new file mode 100644
index 0000000..2a540c7
--- /dev/null
+++ b/src/Misc/crash.c
@@ -0,0 +1,15 @@
+#include "misc.h"
+#include <stdio.h>
+#include <stdlib.h>
+#include <stdarg.h>  /* varargs needed for v*printf() prototypes */
+
+void crash (char* format,...)
+{
+    va_list args ;
+
+    va_start (args,format) ;
+    vfprintf (stderr,format,args) ;
+    va_end (args) ;
+
+    exit (1) ;
+}
diff --git a/src/Misc/date.c b/src/Misc/date.c
new file mode 100644
index 0000000..37b51ce
--- /dev/null
+++ b/src/Misc/date.c
@@ -0,0 +1,14 @@
+#include "misc.h"
+#include <stdio.h>
+
+/******************************************************************************/
+/*
+** Time and date calculations
+*/
+#include <time.h>
+char *date_str()
+{
+    time_t clock;
+    clock = time(NULL);
+    return ctime(&clock);
+}
diff --git a/src/Misc/filenames.c b/src/Misc/filenames.c
new file mode 100644
index 0000000..2b95cfb
--- /dev/null
+++ b/src/Misc/filenames.c
@@ -0,0 +1,39 @@
+#include "misc.h"
+#include <string.h>
+
+char *fn_tail(char *fn)
+/*
+** Return file part (:t) of
+** directory path
+*/
+{
+    int len;
+    char *s;
+
+    len = strlen(fn);
+    for(s=fn+len-1;len && *s != '/'; len--, s--) ;
+    s++;
+
+    return s;
+}
+
+
+void fn_toupper (char *s)
+/*
+** Convert file to upper case
+** ignoring directory path head
+*/
+{
+    str_toupper(fn_tail(s));
+}
+
+
+
+void fn_tolower (char *s)
+/*
+** Convert file to lower case
+** ignoring directory path head
+*/
+{
+    str_tolower(fn_tail(s));
+}
diff --git a/src/Misc/files.c b/src/Misc/files.c
new file mode 100644
index 0000000..89056ec
--- /dev/null
+++ b/src/Misc/files.c
@@ -0,0 +1,41 @@
+#include "misc.h"
+
+#include <sys/types.h>
+#include <sys/stat.h>
+/* Alliant's Concentrix <sys/stat.h> is hugely deficient */
+/* Define things we require in this program              */
+/* Methinks S_IFMT and S_IFDIR aren't defined in POSIX   */
+#ifndef S_ISDIR
+#define S_ISDIR(m)      (((m)&S_IFMT) == S_IFDIR)
+#endif /*!S_ISDIR*/
+#ifndef S_ISREG
+#define S_ISREG(m)      (((m)&S_IFMT) == S_IFREG)
+#endif /*!S_ISREG*/
+
+int is_directory(char * fn)
+{
+    struct stat buf;
+    if ( stat(fn,&buf) ) return 0;
+    return S_ISDIR(buf.st_mode);
+}
+
+int is_file(char * fn)
+{
+    struct stat buf;
+    if ( stat(fn,&buf) ) return 0;
+    return S_ISREG(buf.st_mode);
+}
+
+int file_exists(char * fn)
+{
+    struct stat buf;
+    return ( stat(fn,&buf) == 0);
+}
+
+int file_size(char * fn)
+{
+    struct stat buf;
+    if ( stat(fn,&buf) != 0) return 0;
+    return buf.st_size;
+}
+
diff --git a/src/Misc/find.c b/src/Misc/find.c
new file mode 100644
index 0000000..4847096
--- /dev/null
+++ b/src/Misc/find.c
@@ -0,0 +1,39 @@
+#include "misc.h"
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+
+char *myfind(char *file, char* searchpath, int (*found) (char *) )
+{
+    static char wholePath[1024];
+    char *path;
+    char *delimiters=":";
+    char *f;
+
+    f = NULL;
+    if (found(file)) {
+	strcpy(wholePath,file);
+	f = wholePath;
+    } else if (searchpath != NULL) {
+	char *paths;
+
+	paths = (char *) malloc(strlen(searchpath)+1);
+	strcpy(paths,searchpath);
+
+	path = (char *) strtok(paths,delimiters);
+	while (path!= NULL) {
+
+	    (void) strcpy(wholePath,path);
+	    (void) strcat(wholePath,"/");
+	    (void) strcat(wholePath,file);
+	    if (found(wholePath)) {
+		f = wholePath;
+		break;
+	    }
+	    path = (char *) strtok((char *)NULL,delimiters);
+	}
+	free(paths);
+    }
+
+    return f;;
+}
diff --git a/src/Misc/makefile-alpha b/src/Misc/makefile-alpha
new file mode 100644
index 0000000..7b8cbdb
--- /dev/null
+++ b/src/Misc/makefile-alpha
@@ -0,0 +1,52 @@
+#
+# Makefile for Miscellaneous routines - Alpha version 
+
+#
+# Objects made by this makefile
+#
+LIBS = misc.a
+
+#
+# Where the wild things are
+#
+SRC = .
+INCLUDES = -I$(SRC)
+
+#
+# Alpha Version
+#
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact $(INCLUDES)
+ARFLAGS = rv
+
+#
+# Objects
+#
+OBJS = \
+	crash.o \
+	date.o \
+	filenames.o \
+	files.o \
+	find.o \
+	shell.o \
+	strings.o
+
+#
+# Main dependency
+#
+$(LIBS) : $(OBJS)
+	/bin/rm -f $(LIBS) ;\
+	$(AR) $(ARFLAGS) $(LIBS) $(OBJS) ;\
+	ranlib $(LIBS)
+
+
+#
+# Special dependencies
+#
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+all : $(LIBS)
+
+clean :
+	/bin/rm -f $(OBJS) $(LIBS)
diff --git a/src/Misc/makefile-dec b/src/Misc/makefile-dec
new file mode 100644
index 0000000..ddb6f22
--- /dev/null
+++ b/src/Misc/makefile-dec
@@ -0,0 +1,53 @@
+#
+# Makefile for Miscellaneous routines - DEC Ultrix (mips) version 
+
+#
+#
+# Objects made by this makefile
+#
+LIBS = misc.a
+
+#
+# Where the wild things are
+#
+SRC = .
+INCLUDES = -I$(SRC)
+
+#
+# DEC Version
+#
+CC = c89
+CFLAGS = -g -std -w0 -DNOSTRDUP $(INCLUDES)
+
+
+#
+# Objects
+#
+OBJS = \
+	crash.o \
+	date.o \
+	filenames.o \
+	files.o \
+	find.o \
+	shell.o \
+	strings.o
+
+#
+# Main dependency
+#
+$(LIBS) : $(OBJS)
+	/bin/rm -f $(LIBS) ;\
+	$(AR) $(ARFLAGS) $(LIBS) $(OBJS) ;\
+	ranlib $(LIBS)
+
+
+#
+# Special dependencies
+#
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+all : $(LIBS)
+
+clean :
+	/bin/rm -f $(OBJS) $(LIBS)
diff --git a/src/Misc/makefile-sgi b/src/Misc/makefile-sgi
new file mode 100644
index 0000000..c7b502c
--- /dev/null
+++ b/src/Misc/makefile-sgi
@@ -0,0 +1,59 @@
+#
+# Makefile for Miscellaneous routines - Iris version 
+
+#
+# Shell to perform make commands
+#
+SHELL=/bin/sh
+
+#
+#
+# Objects made by this makefile
+#
+LIBS = misc.a
+
+#
+# Where the wild things are
+#
+SRC = .
+INCLUDES = -I$(SRC)
+
+#
+# Iris Version
+#
+CC = cc
+# -xansi needed to define __EXTENSION__ (eg - popen())
+CFLAGS = -g -xansi -prototypes $(INCLUDES)
+ARFLAGS = rv
+
+#
+# Objects
+#
+OBJS = \
+	crash.o \
+	date.o \
+	filenames.o \
+	files.o \
+	find.o \
+	shell.o \
+	strings.o
+
+#
+# Main dependency
+#
+$(LIBS) : $(OBJS)
+	/bin/rm -f $(LIBS) ;\
+	$(AR) $(ARFLAGS) $(LIBS) $(OBJS) ;
+
+
+
+#
+# Special dependencies
+#
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+all : $(LIBS)
+
+clean :
+	/bin/rm -f $(OBJS) $(LIBS)
diff --git a/src/Misc/makefile-solaris b/src/Misc/makefile-solaris
new file mode 100644
index 0000000..687160a
--- /dev/null
+++ b/src/Misc/makefile-solaris
@@ -0,0 +1,53 @@
+#
+# Makefile for Miscellaneous routines (SunOS Version)
+#
+
+#
+# Objects made by this makefile
+#
+LIBS = misc.a
+
+#
+# Where the wild things are
+#
+SRC = .
+INCLUDES = -I$(SRC)
+
+#
+# Sun Version
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -Wformat
+CFLAGS = -ansi -g $(GCCWARNINGS) $(INCLUDES)
+
+
+#
+# Objects
+#
+OBJS = \
+	crash.o \
+	date.o \
+	filenames.o \
+	files.o \
+	find.o \
+	shell.o \
+	strings.o
+
+#
+# Main dependency
+#
+$(LIBS) : $(OBJS)
+	/bin/rm -f $(LIBS) ;\
+	$(AR) $(ARFLAGS) $(LIBS) $(OBJS) ;
+
+
+#
+# Special dependencies
+#
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+all : $(LIBS)
+
+clean :
+	/bin/rm -f $(OBJS) $(LIBS)
diff --git a/src/Misc/makefile-sun b/src/Misc/makefile-sun
new file mode 100644
index 0000000..873559e
--- /dev/null
+++ b/src/Misc/makefile-sun
@@ -0,0 +1,55 @@
+#
+# Makefile for Miscellaneous routines (SunOS Version)
+#
+
+#
+# Objects made by this makefile
+#
+LIBS = misc.a
+
+#
+# Where the wild things are
+#
+SRC = .
+INCLUDES = -I$(SRC)
+
+#
+# Sun Version
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -ansi -g $(GCCWARNINGS) $(INCLUDES)
+
+
+#
+# Objects
+#
+OBJS = \
+	crash.o \
+	date.o \
+	filenames.o \
+	files.o \
+	find.o \
+	shell.o \
+	strings.o
+
+#
+# Main dependency
+#
+$(LIBS) : $(OBJS)
+	/bin/rm -f $(LIBS) ;\
+	$(AR) $(ARFLAGS) $(LIBS) $(OBJS) ;\
+	ranlib $(LIBS)
+
+
+#
+# Special dependencies
+#
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+all : $(LIBS)
+
+clean :
+	/bin/rm -f $(OBJS) $(LIBS)
diff --git a/src/Misc/memmove.c b/src/Misc/memmove.c
new file mode 100644
index 0000000..f7a0895
--- /dev/null
+++ b/src/Misc/memmove.c
@@ -0,0 +1,7 @@
+/*
+ * memmove() does not exist on SunOS, despite being an ANSI library call.
+ */
+void *memmove(void *to, const void *from, size_t len) {
+    bcopy(from, to, len);
+    return to;
+}
diff --git a/src/Misc/misc.h b/src/Misc/misc.h
new file mode 100644
index 0000000..162633f
--- /dev/null
+++ b/src/Misc/misc.h
@@ -0,0 +1,44 @@
+#ifndef _misc_h
+#define _misc_h
+
+#include <stdio.h>
+#include <stdarg.h>  /* varargs needed for v*printf() prototypes */
+
+
+extern int is_directory(char * fn);
+extern int is_file(char * fn);
+extern int file_exists(char * fn);
+extern int file_size(char * fn);
+extern int fstrlen(char *f, int max_f);
+extern void f2cstr(char *f, int max_f, char *c, int max_c);
+extern void c2fstr(char *c, int max_c, char *f, int max_f);
+extern char *mystrtok(char *s, char *ct);
+extern char *myfind(char *file, char* searchpath, int (*found) (char *) );
+extern void crash (char* format,...);
+extern void str_tolower (char *s);
+extern void str_toupper (char *s);
+extern char *fn_tail (char *s);
+extern void fn_tolower (char *s);
+extern void fn_toupper (char *s);
+extern void shell_call(char *command, char *output, int len);
+extern char *date_str();
+#ifdef NOSTRDUP
+extern char *strdup(char *s);
+#endif
+#ifdef NOSTRSTR
+extern char *strstr(char *cs, char *ct);
+#endif
+#ifdef NOMEMMOVE
+extern void *memmove(void *to, const void *from, size_t len);
+#endif
+
+#define findfile(F,S) myfind((F),(S),file_exists)
+/*is_file fails for symbolic links*/
+/*#define findfile(F,S) myfind((F),(S),is_file)*/
+
+#define min(A,B) ( ( (A) < (B) ) ? (A) : (B) )
+#define max(A,B) ( ( (A) > (B) ) ? (A) : (B) )
+#define sgn(A) ( (A) ? ( ( (A) < 0 ) ? -1 : 1 ) : 0 )
+
+
+#endif /*_misc_h*/
diff --git a/src/Misc/shell.c b/src/Misc/shell.c
new file mode 100644
index 0000000..ab012fb
--- /dev/null
+++ b/src/Misc/shell.c
@@ -0,0 +1,17 @@
+#include "misc.h"
+#include <stdio.h>
+
+void shell_call(char *command, char *output, int len)
+{
+    FILE *pipe;
+    char *a;
+
+    output[0]='\0';
+    pipe = popen(command,"r");
+    fgets(output,len,pipe);
+    pclose(pipe);
+
+    /* clobber last new line */
+    for (a=output;*a && *a != '\n'; a++);
+    *a = '\0';
+}
diff --git a/src/Misc/stdarg-sun.h b/src/Misc/stdarg-sun.h
new file mode 100644
index 0000000..fd79ec0
--- /dev/null
+++ b/src/Misc/stdarg-sun.h
@@ -0,0 +1,28 @@
+#ifndef _STDARG_H
+#define _STDARG_H
+
+typedef char *va_list;
+
+/* Amount of space required in an argument list for an arg of type TYPE.
+   TYPE may alternatively be an expression whose type is used.  */
+
+#define __va_rounded_size(TYPE)  \
+  (((sizeof (TYPE) + sizeof (int) - 1) / sizeof (int)) * sizeof (int))
+
+#ifndef __sparc__
+#define va_start(AP, LASTARG) 						\
+ (AP = ((char *) &(LASTARG) + __va_rounded_size (LASTARG)))
+#else
+#define va_start(AP, LASTARG) 						\
+ (__builtin_saveregs (),						\
+  AP = ((char *) &(LASTARG) + __va_rounded_size (LASTARG)))
+#endif
+
+void va_end (va_list);		/* Defined in gnulib */
+#define va_end(AP)
+
+#define va_arg(AP, TYPE)						\
+ (AP += __va_rounded_size (TYPE),					\
+  *((TYPE *) (AP - __va_rounded_size (TYPE))))
+
+#endif /* _STDARG_H */
diff --git a/src/Misc/strings.c b/src/Misc/strings.c
new file mode 100644
index 0000000..bd6f88a
--- /dev/null
+++ b/src/Misc/strings.c
@@ -0,0 +1,141 @@
+#include "misc.h"
+#include <stdlib.h>
+#include <ctype.h>
+#include <string.h>
+
+int fstrlen(char *f, int max_f)
+{
+    for (; max_f > 0 && (isspace(f[max_f-1]) || f[max_f-1]=='\0'); max_f--);
+    return max_f;
+}
+
+
+
+
+
+void f2cstr(char *f, int max_f, char *c, int max_c)
+{
+    int i;
+
+    i = min(fstrlen(f,max_f),max_c);
+    strncpy(c,f,i);
+    c[i]='\0';
+}
+
+
+void c2fstr(char *c, int max_c, char *f, int max_f)
+{
+    int i;
+    i = min(strlen(c),max_f);
+    strncpy(f,c,i);
+    for( ; i<max_f; i++) f[i]=' ';
+
+}
+
+
+
+
+char *mystrtok(char *s, char *ct)
+/*
+** When strtok isn't good enough
+*/
+{
+    char *this;
+    static char *look;
+    static int last;
+
+    if (s == NULL) {
+	if (last) return NULL;
+    } else {
+	look = s;
+	last = 0;
+    }
+    this = look;
+
+    for ( ; *look && strchr(ct,*look)==NULL; look++ ) ;
+    last = (! *look);
+    *look++ = '\0';
+    
+    return this;
+}
+
+
+void str_tolower (char *s)
+/*
+** Convert string to lower case
+*/
+{
+    if (!s) return;
+    for ( ; *s ; s++ )
+	if (isupper(*s))
+	    *s = tolower(*s);
+}
+
+void str_toupper (char *s)
+/*
+** Convert string to upper case
+*/
+{
+    if (!s) return;
+    for ( ; *s ; s++ )
+	if (islower(*s))
+	    *s = toupper(*s);
+}
+
+#ifdef NOSTRSTR
+/*
+** My routines for nice sun ones.
+*/
+char *strstr(char *cs, char *ct)
+/*
+** ANSI C has the function strstr().
+**
+**     strstr() returns a pointer to the first  occurrence  of  the
+**     pattern  string  s2  in  s1.   For example, if s1 is "string
+**     thing" and s2 is "ing", strstr() returns "ing thing".  If s2
+**     does not occur in s1, strstr() returns NULL.
+**
+** It's not always implemented. Here's my cludge:
+*/
+{
+    int i;
+    int len_ct;
+    int end;
+    len_ct = strlen(ct);
+    end = strlen(cs) - len_ct;
+    for (i=0;i<=end;i++)
+      if (strncmp(&cs[i],ct,len_ct)==0)
+	return &cs[i];
+
+    return NULL;
+}
+#endif
+
+#ifdef NOSTRDUP
+char *strdup(char *str)
+/*
+** SunOS has a nice strdup() function.
+**
+**     strdup() returns a pointer to a new string which is a dupli-
+**     cate  of the string pointed to by s1.  The space for the new
+**     string is obtained using malloc(3V).  If the new string can-
+**     not be created, a NULL pointer is returned.
+**
+** Other ANSI C libraries don't have this. Here is my kludge:
+*/
+{
+    char *newstr;
+    int i = strlen(str);
+
+    if ((newstr = (char *)malloc((unsigned int)(i+1))) == NULL)
+        return NULL;
+
+    for (; i>=0; i--)
+        newstr[i] = str[i];
+
+    return newstr;
+}
+#endif
+
+
+
diff --git a/src/Misc/usleep-bsd.c b/src/Misc/usleep-bsd.c
new file mode 100644
index 0000000..296bd0a
--- /dev/null
+++ b/src/Misc/usleep-bsd.c
@@ -0,0 +1,75 @@
+#include <stdio.h>
+#include <signal.h>
+#include <sys/time.h>
+
+static int rung_alarm;
+static void wakeup() {
+    rung_alarm++;
+}
+
+/*
+ * This sleeps for usecond microseconds. The implementation uses an itimer to
+ * set an interrupt and wait for it. Complications arise where there is
+ * already one set. Here we subtract our wait from the existing one (when
+ * appropriate).
+ */
+int usleep(unsigned int useconds) {
+    struct itimerval value, ovalue;
+    struct sigvec    vec,   ovec;
+    int old_mask;
+
+    if (!useconds)
+	return 0;
+    
+    /* Initialise our itimer to zero time */
+    timerclear(&value.it_interval);
+    timerclear(&value.it_value);
+    
+    /* Fetch the old itimer */
+    if (-1 == setitimer(ITIMER_REAL, &value, &ovalue))
+	return -1;
+
+    /* Set up our timer structure */
+    value.it_value.tv_sec  = useconds / 100000;
+    value.it_value.tv_usec = useconds % 100000;
+    
+    /*
+     * If it's set and is set for a time further in the future than ours then
+     * we subtract our time from its. (Yuk this is a horrid macro). Otherwise
+     * we set our timer to the old one (so we wake up in time ;)
+     */
+    if (timercmp( &ovalue.it_value, &value.it_value, > )) {
+	ovalue.it_value.tv_usec -= value.it_value.tv_usec;
+	ovalue.it_value.tv_sec  -= value.it_value.tv_sec;
+	if (ovalue.it_value.tv_usec < 0) {
+	    ovalue.it_value.tv_usec += 1000000;
+	    ovalue.it_value.tv_sec--;
+	} else {
+	    value.it_value = ovalue.it_value;
+	    ovalue.it_value.tv_sec = 0;
+	    ovalue.it_value.tv_usec = 50000; /* HACK arbitrary minimal pause */
+	}
+    }
+
+    /* Initialise our sigvec structure */
+    vec.sv_handler = wakeup;
+    vec.sv_mask = 0;
+    vec.sv_onstack = 0;
+
+    /* Create an alarm */
+    (void)sigvec(SIGALRM, &vec, NULL);
+
+    /* Wait for the alarm to go off */
+    old_mask = sigblock(sigmask(SIGALRM));
+    rung_alarm = 0;
+    while (!rung_alarm)
+	/* wait for a SIGALRM - but also allow other signals to be caught */
+	sigpause(old_mask &~ sigmask(SIGALRM));
+
+    /* tidy up */
+    (void)sigvec(SIGALRM, &ovec, NULL);
+    (void)sigsetmask(old_mask);
+    (void)setitimer(ITIMER_REAL, &ovalue, NULL);
+
+    return 0;
+}
diff --git a/src/Misc/usleep-gen.c b/src/Misc/usleep-gen.c
new file mode 100644
index 0000000..3928f64
--- /dev/null
+++ b/src/Misc/usleep-gen.c
@@ -0,0 +1,29 @@
+#include <stdio.h>
+#include <sys/types.h>
+#include <sys/time.h>
+
+/*
+ * This sleeps for `usecond' microseconds. This implementation uses select to
+ * wait for input on no file descriptors with `useconds' timeout. However it
+ * does not wake early if it gets hit by a signal.
+ */
+int usleep(unsigned int useconds) {
+    struct timeval tv;
+
+    tv.tv_sec  = useconds / 1000000;
+    tv.tv_usec = useconds % 1000000;
+
+    if (-1 == select(0, NULL, NULL, NULL, &tv))
+	return -1;
+    else
+	return 0;
+}
+
+typedef int int_f;
+
+/*
+ * FORTRAN interface
+ */
+void usleep_(int_f *useconds) {
+    (void)usleep((unsigned int)*useconds);
+}
diff --git a/src/abi/2rs b/src/abi/2rs
new file mode 100644
index 0000000..3d38a26
Binary files /dev/null and b/src/abi/2rs differ
diff --git a/src/abi/2rs.c b/src/abi/2rs.c
new file mode 100644
index 0000000..3d0f7d1
--- /dev/null
+++ b/src/abi/2rs.c
@@ -0,0 +1,54 @@
+#include <stdio.h>
+#include <ctype.h>
+/*
+** SD 2 August 1991
+**   Changes way of interpreting uncertainty codes so that
+**   we now only generate C A G T and -
+*/
+main (int argc, char **argv)
+{
+    char c ;
+    int i = 0 ;
+
+    if (argc != 2 || *argv[1] != '-') {
+	fprintf (stderr,"Usage: '2rs -form' : form is abi or alf; a filter\n") ;
+	exit (1) ;
+    }
+
+    if (!strcmp (&argv[1][1],"abi"))
+	while ((c = getc (stdin)) != EOF) {
+	    switch (c) {
+		case 'N' : c = '-' ; break ;
+	    }
+	    putc (c,stdout) ;
+	    if (!(++i%50))
+		putc ('\n',stdout) ;
+	}
+      else if (! strcmp (&argv[1][1],"alf"))
+    	/* ALF lower case uncertainty codes mean that the base may be missing.
+	   RS uncertain length codes mean that there may be an extra base.
+	   So I have to delay output by one character. Use oldc for this.
+	*/
+	while ((c = getc (stdin)) != EOF) { 
+	    switch (c) {
+	    case 'A':
+	    case 'C':
+	    case 'G':
+	    case 'T':
+		break;
+	    default:
+		if (isupper(c))
+		    c = '-';
+		else
+		    c = '\0';
+		break;
+	    }
+	    if (c) {
+		putc (c,stdout);
+		if (!(++i%50))
+		    putc ('\n',stdout) ;
+	    }
+	}
+
+    putc ('\n',stdout) ;
+}
diff --git a/src/abi/README b/src/abi/README
new file mode 100644
index 0000000..3db698a
--- /dev/null
+++ b/src/abi/README
@@ -0,0 +1,25 @@
+abiprocess Scripts and Filters
+------------------------------
+The scripts and filters here were developed at the MRC-LMB for INTERNAL
+USE ONLY. As we are aware that people elsewhere will want to develop
+similar software we include them in the Staden package as EXAMPLES of
+what can be achieved.
+
+THESE SCRIPTS WILL NOT WORK ON YOUR SYSTEM WITHOUT MODIFICATION.
+
+This directory contains:
+    2rs.c		- filter to convert abi and alf uncertainty codes to
+			  staden uncertainty codes
+    abibackup		- script to backup a folder and transfer to a project
+			  database
+    abiprocess		- script to process a batch of ABI data.
+    abirestore		- script to transfer abi data files back from a
+    			- project directory to the Mac-files-{user name} region.
+    abiscripts.tex	- Documentation for abi scripts
+    getABISampleName.c	- Extract the Sample name from and ABI trace file
+    makefile		- To compile c programs
+
+
+Happy hacking,
+
+    sd@uk.ac.cam.mrc-lmb
diff --git a/src/abi/abi2sun b/src/abi/abi2sun
new file mode 100644
index 0000000..8985946
--- /dev/null
+++ b/src/abi/abi2sun
@@ -0,0 +1,88 @@
+#! /bin/sh
+# shell script to process a batch of ABI data.
+# it assumes that the raw data is in ~/Mac-files-$USER/'Results date',
+# where date is e.g. 10_28_90.
+# after preliminary checks it runs ted on each file
+# then forces the user to back up on tape
+# then copies everything across to the project directory
+# (latter two steps are done in abibackup, so they can be executed on a
+#  remote machine - cele in our case)
+#
+# RMD 6/12/90, based on an original of JES
+#
+# define machine on which qic drive resides
+QIC_MACH=dumpy
+
+if test ! -n "$DISPLAY"
+  then
+    echo "Sorry - you must run abiprocess under X windows"
+    exit 1
+  fi
+
+case $# in
+  2) ;;
+  *) echo 'Usage: abiprocess machine_letter date' 1>&2; exit 2
+esac
+
+if test ! -d $HOME/Mac-files-$USER/Results?$1?$2
+  then
+    echo "Bad folder machine or date"
+    exit 1
+  fi
+cd $HOME/Mac-files-$USER/Results?$1?$2
+
+echo "abiprocess: runs ted, backs up on tape, and transfers to project"
+echo "  To abort, type CTRL/C in the xterm window while in ted"
+echo "  If you do this, or you don't backup, just rerun abiprocess; you"
+echo "    will not have to run ted again on traces you have already processed."
+echo "  You can not transfer to your project without backing up."
+
+# next loop runs ted on each eligible file
+
+filenames=${2}${1}fn	# file of filenames
+
+for i in 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 
+  do
+    oldname=Sample?$i
+    if test ! -f $oldname
+      then continue
+    fi
+# sed in next line replaces spaces in sample name by  '_', since
+# spaces mess things up badly (they shouldn't be there!)
+    newname=`echo \`getABISampleName $oldname\` | sed 's/ /_/g'`
+    echo $i $newname
+    if test $newname 
+      then if test -f $oldname.Seq
+        then 
+          ted -ABI $oldname -output $newname -raw $newname"RES"
+	  if test ! $? = 0
+	    then 
+	      echo "ted failed - am exiting without changing that clone"
+	      exit 1
+	    fi
+#          2rs -abi < "Sample $i.Seq" > $newname"MCH"
+	  rm $oldname.Seq
+          if test -f $newname
+	    then
+	      mv $oldname $newname"RES"
+	      cat >> $filenames << EOF
+$newname
+EOF
+	    else 
+	      mv $oldname $newname"FLD"
+          fi
+        else mv $oldname $newname"FLD"
+        fi
+# if they didn't attach a name to their clone then destroy the data
+      else 
+	if test -f "Sample $i.Seq"
+          then
+	    echo "Can't find name for sample $i - get help!"
+	    exit 1 
+          else
+	    rm $oldname
+	fi
+    fi
+  done
+
+rsh $QIC_MACH /usr/local/bin/abibackup  "\"${1}?${2}\""
diff --git a/src/abi/abibackup b/src/abi/abibackup
new file mode 100644
index 0000000..97508c8
--- /dev/null
+++ b/src/abi/abibackup
@@ -0,0 +1,68 @@
+#! /bin/sh
+# shell script to be run on cele to back up a folder and transfer to project
+# define machine on which QIC drive resides
+cd $HOME/Mac-files-$USER
+
+# next line needed to prevent mt from rewinding at the end of each operation
+unset TAPE
+echo "${1}"
+
+echo
+echo "Which tape drive are you using, the Xabyte (small video tape) or the large"
+echo "format QIC drive (large tape labelled DC 6150)?"
+echo
+echo -n "Enter \"large\" for large tape or \"small\" for small tape "
+read tapestring
+case "$tapestring" in
+	l*) tapenumber=0 ;;
+	s*) tapenumber=1 ;;
+esac
+
+while test ! "$tapenumber"
+ do
+	echo "You must enter either \"large\" or \"small\" for the tape size"
+	read tapestring
+	case "$tapestring" in
+		l*) tapenumber=0 ;;
+		s*) tapenumber=1 ;;
+	esac
+ done
+
+
+echo -n "load tape - how many data sets are already on this tape? "
+read nmark
+while test ! "$nmark"
+  do
+    echo -n "You must give a number of previous files. How many? "
+    read nmark
+  done
+
+mt -f /dev/nrst$tapenumber fsf $nmark
+if test ! $? = 0
+  then
+    echo "tape positioning failed - is tape loaded? or file number correct?"
+    exit 1
+  fi
+
+tar cvef /dev/rst$tapenumber Results?${1}
+if test ! $? = 0
+  then
+    echo "tape archive failed - perhaps tape is write protected"
+    exit 1
+  fi
+
+echo -n "Give project name to move data to (blank for no move): "
+read project
+while test "$project"
+  do 
+    if test -d $HOME/$project
+      then
+        mv -i Results?${1}/* $HOME/$project
+        rm -r Results?${1}
+	exit 0
+      else
+        echo -n "Project directory not found - please retype: "
+	read project
+      fi
+  done
+
diff --git a/src/abi/abiget b/src/abi/abiget
new file mode 100644
index 0000000..62028e8
--- /dev/null
+++ b/src/abi/abiget
@@ -0,0 +1,38 @@
+#! /bin/csh
+# shell script to get a batch of ABI data via NCSA telnet
+# it transfers the data into ~/Mac-files-$USER/'Results date',
+# where date is e.g. 10_28_90.
+# Currently I assume that the data are on abi2mac (2nd ABI machine)
+#
+# RMD 29/6/91
+#
+
+switch ($#argv)
+  case "1": 
+	breaksw
+  default: 
+	echo 'Usage: abiget date' ; exit 2
+endsw
+
+set putdir = $HOME/Mac-files-$USER/"Results $1" 
+if (-d "$putdir") then
+  echo -n "Results directory already exits - proceed? "
+  set answer = $<
+  if ($answer == "n" || $answer == "N") then
+    exit 1
+  endif
+else
+  mkdir "$putdir"
+endif
+
+cd "$putdir"
+#sed s/date/$1/g < /nfs/cele/home2/public/BioSW/abi/ftprun > ./ftprun
+cp /nfs/cele/home2/public/BioSW/abi/ftprun .
+echo "Use the 'Select Transfer Directory...' option of the File Menu"
+echo "on the Mac to select the data folder (go to it then click on"
+echo -n "'Set Directory').  Then hit return in this window."
+set answer = $<
+ftp -in abi2mac < ftprun
+\rm ftprun
+ls -xsF
+exit 0
diff --git a/src/abi/abiprocess b/src/abi/abiprocess
new file mode 100644
index 0000000..8e5e696
--- /dev/null
+++ b/src/abi/abiprocess
@@ -0,0 +1,81 @@
+#! /bin/sh
+# shell script to process a batch of ABI data.
+# it assumes that the raw data is in ~/Mac-files-$USER/'Results date',
+# where date is e.g. 10_28_90.
+# after preliminary checks it runs ted on each file
+# then forces the user to back up on tape
+# then copies everything across to the project directory
+# (latter two steps are done in abibackup, so they can be executed on a
+#  remote machine - cele in our case)
+#
+# RMD 6/12/90, based on an original of JES
+#
+# define machine on which qic drive resides
+QIC_MACH=dumpy
+
+if test ! -n "$DISPLAY"
+  then
+    echo "Sorry - you must run abiprocess under X windows"
+    exit 1
+  fi
+
+case $# in
+  1) ;;
+  *) echo 'Usage: abiprocess date' 1>&2; exit 2
+esac
+
+if test ! -d $HOME/Mac-files-$USER/Results?$1
+  then
+    echo "Bad folder date"
+    exit 1
+  fi
+cd $HOME/Mac-files-$USER/Results?$1
+
+echo "abiprocess: runs ted, backs up on tape, and transfers to project"
+echo "  To abort, type CTRL/C in the xterm window while in ted"
+echo "  If you do this, or you don't backup, just rerun abiprocess; you"
+echo "    will not have to run ted again on traces you have already processed."
+echo "  You can not transfer to your project without backing up."
+
+# next loop runs ted on each eligible file
+
+filenames=${1}fn	# file of filenames
+
+for i in 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24
+  do
+    oldname=Sample?$i
+    if test ! -f $oldname
+      then continue
+    fi
+# sed in next line replaces spaces in sample name by  '_', since
+# spaces mess things up badly (they shouldn't be there!)
+    newname=`echo \`getABISampleName $oldname\` | sed 's/ /_/g'`
+    echo $i $newname
+    if test $newname 
+      then if test -f $oldname.Seq
+        then 
+          ted -ABI $oldname -output $newname -raw $newname"RES"
+	  if test ! $? = 0
+	    then 
+	      echo "ted failed - am exiting without changing that clone"
+	      exit 1
+	    fi
+          2rs -abi < "Sample $i.Seq" > $newname"MCH"
+	  rm $oldname.Seq
+          if test -f $newname
+	    then
+	      mv $oldname $newname"RES"
+	      cat >> $filenames << EOF
+$newname
+EOF
+	    else 
+	      mv $oldname $newname"FLD"
+          fi
+        else mv $oldname $newname"FLD"
+        fi
+# if they didn't attach a name to their clone then destroy the data
+      else rm $oldname
+    fi
+  done
+
+rsh $QIC_MACH /usr/local/bin/abibackup  "\"${1}\""
diff --git a/src/abi/abirestore b/src/abi/abirestore
new file mode 100644
index 0000000..0b1a2b5
--- /dev/null
+++ b/src/abi/abirestore
@@ -0,0 +1,45 @@
+#!  /bin/sh
+#
+# shell script to transfer abi data files back from a project directory
+# to the Mac-files-{user name} region.
+#
+
+case $# in
+  2) ;;
+  *) echo 'Usage: abirestore project date' 1>&2; exit 2
+esac
+
+if test ! -d $HOME/$1
+  then echo "Bad project" ; exit 1
+  fi
+cd $HOME/$1
+
+fnfile=${2}fn
+if  test ! -f $fnfile
+  then echo "Bad date" ; exit 1
+  fi
+
+dir=$HOME/Mac-files-$USER/restored.$2
+if test -d $dir
+  then
+    echo -n "Directory $dir already exists.  Shall I overwite it (y/n): " 
+    read answer
+    if test answer = "n"
+      then exit 1
+      fi
+  else
+    mkdir $dir
+    mkdir $dir/.rsrc
+  fi
+
+for i in `cat $fnfile`
+  do
+    if test -f ${i}RES
+      then cp ${i}RES $dir ; 
+	cp /home/BioSW/abi/big1.rsrc $dir/.rsrc/${i}RES
+      fi
+  done
+
+exit 0
+
+
diff --git a/src/abi/abiscripts.tex b/src/abi/abiscripts.tex
new file mode 100644
index 0000000..5e8fa6d
--- /dev/null
+++ b/src/abi/abiscripts.tex
@@ -0,0 +1,221 @@
+\documentstyle[12pt]{article}
+
+\oddsidemargin  0.0truein
+\evensidemargin 0.0truein
+\textwidth      6.5truein
+\marginparwidth 0.0truein
+\marginparsep   0.0truein
+\marginparpush  0.0truein
+
+\topmargin      0.0truein
+\textheight     9.0truein
+\headheight     0.0truein
+\headsep        0.0truein
+
+\setlength{\parskip}{0.25\baselineskip}
+
+\title{Processing ABI data}
+
+\author{Richard from a draft by John}
+
+\begin{document}
+
+\maketitle
+
+Before you start you need a user name and password on the Sun, and two
+subdirectories in your Sun home directories.  One of these is called
+{\sf Mac-files-{\em \{your user name\}}}, and is visible from the Mac.
+This is where the raw results folder gets transferred to.  The other
+is a subdirectory for your project, in which you will keep all the
+data for one cosmid and run {\sf sap}.  This is called by the name of
+the cosmid (e.g. ZK637). I can help set these up.
+
+\section{At the Mac}
+
+Settings for ABI run: 14 hours, analyse all samples ({\em N.B. you
+will lose all data on the Sun for samples that were not analysed on
+the Mac}). Fill in sample names according to the following rules:
+
+\begin{verse}
+Each microtitre plate has a letter plus a two digit number.  This
+identifier is unique to you and to a library made from a particular
+cosmid. (Fill in the index sheet on the wall to avoid conflicts.)\\
+The 96 clones in the microtitre plate are numbered a1, a2 .. h12.\\
+The sequence sample name is formed by adding a full stop, a character
+and a number.  The characters in use at the present time are: s
+(single stranded), f (double stranded, forward) and r (double
+stranded, reverse).   The number indicates which read  this one is,
+in a series of primer walks. Thus the first, shotgun, read always ends
+in 1.\\
+e.g. a09c8.s1 is plate a09, row c, column 8, single strand, first
+(shotgun) read.\\
+If you extract a second sequence file from the same raw data file, you
+should add another letter to the name (e.g. a09c8.s1a).  Another
+reaction and raw data read would be called by a different number (e.g.
+109c8.s2).
+It's a matter of personal taste whether you use upper or lower case
+letters, but I strongly recommend lower case ones for ease of UNIX typing.\\
+NB  If you want to process the data on the Sun you must give the
+sample a name, even if is a test sample.  Also, any spaces in the name
+will be turned into underscores by {\sf abiprocess}.\\
+\end{verse}
+
+After the ABI has run and processed the samples, use TOPS to transfer
+the results folder to the Sun.  The results folder is called ``Results
+{\em folder\_date}'' , where the date is at the time of {\em
+termination}.
+\begin{verse}
+Select {\bf TOPS} from the pulldown desk accessory menu (apple icon in top left).\\
+Select {\bf cele} from the list of machines on the right side of the
+TOPS application.\\
+Click on {\bf open} in the central panel (or double click on {\bf cele} in previous step).\\
+Give your Sun user name and password.\\
+Click on {\bf Mac-files-{\em you}}.\\
+Select {\bf mount} from the central panel - this creates a published
+volume icon, looking a bit like a reading stand, in the place where the disk 
+icons go at the right hand edge of the screen.\\
+Drag the icon for your results folder over to the published volume icon.\\
+Wait while your files get copied.\\
+Close TOPS and trash the published volume icon.\\
+\end{verse}
+
+Leave the results folder on the Mac until the results have been
+backed up onto tape.
+
+\section{At the Sun}
+
+Log on to a Sun (any one).  Put a tape in the tape drive (currently next to
+cele at the far end of room 5036).  You have to back up your data
+before {\sf abiprocess} will transfer the sequences to your project.
+
+Type {\sf abiprocess {\em folder\_date}}, where {\em folder\_date} is
+the date as given for the ABI results folder (i.e. with underscores
+and American ordering).  You do not need to {\sf cd} anywhere before
+doing this.  It will then run {\sf ted} for each sample
+for which there was a {\sf .Seq} file, i.e. for each one that was
+analysed on the Mac by the ABI software.
+
+For each ted run (i.e. each sequence) do the following:
+\begin{verse}
+Move the outline of the ted window to a suitable place on the screen and
+click to start (all clicks are with the left mouse button).\\
+Set the cutoffs, and edit as required.  Don't waste a lot of time at
+this stage on editing.  Make the right cut between 400 and 500
+usually.  The additional data will be saved, and can be used later.\\
+If you want to use the sequence from this run, click on {\bf Output},
+then on {\bf OK} in the pop-up output window.\\
+Click on {\bf Quit}.
+\end{verse}
+
+The files you should end up with are: one file of file names {\sf {\em
+folder\_date}fn}; for each sample name in this file a sap input format
+sequence file {\em sample\_name} and a raw data file renamed to {\sf
+{\em sample\_name}RES}; and for each sample not in this list (i.e.
+those for which you didn't output a sequence in ted) a raw data file
+renamed to {\sf {\em sample\_name}FLD}.
+
+After processing all the samples {\sf abiprocess} requires you to back
+up the data on tape.  You must have a tape loaded in the drive, with
+the door switch pulled across to the right.  The tape must be
+writable, i.e. the little button at top left must be rotated so that
+the arrow points away from SAFE.  You will be asked how many data sets
+have already been stored on the tape.  It is possible to store many
+folders, but you have to keep track of them yourself. If you give too
+small a number you will lose data and mess things up.
+
+If you don't have a tape set up when you hit Return to start the
+backup, then {\sf abiprocess} will abort without giving you the chance
+to copy the data to your projet directory.  However you can come back
+at any time in the future and run {\sf abiprocess} again with the same
+date to do the transfer.  This will {\em not} require you to run all
+the {\sf ted}'s again.  
+
+Similarly, you can yourself abort from {\sf abiprocess} by putting the
+cursor is in the xterm window from which you first ran {\sf
+abiprocess} and typing CTRL/C (holding down the Control key while
+pressing the C key).  To be sure that you leave things in a sensible
+state, do this when inside ted, but before you output a sequence.
+Again, if you do this, you can restart at a later date by running {\sf
+abiprocess} again.  You will restart with the sample that you were
+editing when you aborted.
+
+After backing up the results directory on tape, {\sf abiprocess} asks
+whether you want to transfer the processed data to a project, and if
+so, prompts for a project directory name.  When you give this all the
+files are transferred to the project directory, and the results folder
+subdirectory of {\sf Mac-files-{\em you}} is destroyed.  
+
+\subsection{Restoring data to the Mac from your project directory}
+
+If you want to restore some data to the Mac (for example to plot a
+trace) then you can use {\sf abirestore}.  Run {\sf abirestore {\em
+project folder\_date}}.  This creates a subdirectory {\sf
+restored.{\em folder\_date}} of your {\sf Mac\_files} directory that
+you can see from TOPS on the Mac.  When you mount it all the ABI data
+files will be on top of one another in the folder window (due to the
+way I fiddled the Mac resources files -- they all get the same one!),
+so it looks like there is only one.  To see them all use the finder
+options to either clean up the window or view by name.
+
+This only restores the successful reads (the ones you output sequence
+from in ted).  
+
+\subsection{Restoring data from tape - and more about tape usage}
+
+I haven't written a script to do this yet.  The following should work
+(anything after a \# sign is a comment that you don't need to type).
+\begin{verse}
+\# load the tape\\
+rlogin cele \hspace{2cm}  \# if you are not already on cele\\
+cd ~/Mac-files-{\em \{your login name\}}\\
+unsetenv TAPE \hspace{2cm} \# this is a fix for something I don't understand\\
+mt fsf {\em nskip} \hspace{2cm} \# where {\em nskip} is the number of data sets to skip\\
+\hspace{3in} \# (i.e. 0 to retrieve the first folder)\\
+tar xvf /dev/rst0\\
+\# unload the tape
+\end{verse}
+
+You should in principle be able to get about 30 complete abi run
+folders on a tape, but there are several reasons not to go up that
+high.  The first is that it will take rather a long time to wind
+forward to the right place when writing (or reading) the later ones.
+The second is that you should be backing up your entire project from
+time to time.  This is done by saving everything in the project
+directory.  In this case we do not bother to try to store more than
+one archive set on a tape, and so don't need the ``mt fsf'' command
+(and associated ``unsetenv TAPE'').  So the following should work:
+\begin{verse}
+\# load tape and rlogin to cele if necessary
+cd ~\\
+tar cvf /dev/rst0 {\em project\_name}
+\end{verse}
+You can restore it if need be with:
+\begin{quote}
+tar xvf /dev/rst0
+\end{quote}
+or just get a listing of what it contains with
+\begin{quote}
+tar tvf /dev/rst0
+\end{quote}
+
+Once you have backed up the project, that tape will contain all the
+raw data files as well as the assembly data, so you can erase the tape
+you were using to store the original folders.  You do this with the
+command
+\begin{quote}
+mt erase
+\end{quote}
+
+\subsection{For those you want to know a bit more}
+
+{\sf abiprocess} and {\sf abirestore} are both shell scripts,
+containing a mixture of UNIX commands and control statements.  They
+can be found in {\sf /usr/local/bin} along with most other public
+local programs, such as {\sf ted}.  If you want to see what they do
+you can look at them with {\sf cat} or {\sf more} or your favourite
+editor, or print them out with {\sf lpr}.  In fact {\sf abiprocess}
+runs a remote shell script on cele to do the tape backup and transfer
+to project.  This is called {\sf abibackup} and is also found in {\sf
+/usr/local/bin}.
+\end{document}
+
diff --git a/src/abi/big1.rsrc b/src/abi/big1.rsrc
new file mode 100644
index 0000000..19bc2ed
Binary files /dev/null and b/src/abi/big1.rsrc differ
diff --git a/src/abi/big1.rsrc.dump b/src/abi/big1.rsrc.dump
new file mode 100644
index 0000000..565e1ba
--- /dev/null
+++ b/src/abi/big1.rsrc.dump
@@ -0,0 +1,33 @@
+ 0    5   16    7    0    1    0    0   4d M 61 a 63 c 69 i 6e n 74 t 6f o 73 s 
+68 h 20   20   20   20   20   20   20    0    3    0    0    0    7    0    0   
+ 0   4c L  0    0    0   20    0    0    0    8    0    0    0   6c l  0    0   
+ 0    c    0    0    0    2    0    0    2    0    0    0    2    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0   41 A 42 B 49 I 31 1 
+41 A 42 B 49 I 31 1  1    0    0    0    0   40 @  0    0   14   71 q  0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0   74 t a0   66 f 29 ) 
+74 t a0   6a j 19   27 ' 43 C 32 2 4e N  0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ 0    0    0    0    0    0    0    0    0    0    0    0    0    0    0    0   
+ff   
\ No newline at end of file
diff --git a/src/abi/ftprun b/src/abi/ftprun
new file mode 100644
index 0000000..eac1a20
--- /dev/null
+++ b/src/abi/ftprun
@@ -0,0 +1,4 @@
+binary
+verbose
+mget *
+quit
diff --git a/src/abi/getABISampleName b/src/abi/getABISampleName
new file mode 100644
index 0000000..76e7f37
Binary files /dev/null and b/src/abi/getABISampleName differ
diff --git a/src/abi/getABISampleName.c b/src/abi/getABISampleName.c
new file mode 100644
index 0000000..7840e5f
--- /dev/null
+++ b/src/abi/getABISampleName.c
@@ -0,0 +1,161 @@
+/*
+    Title: 	 getABISampleName
+
+    Purpose:	 Extract the Sample name from and ABI trace file
+    Last update: Fri Oct 25 1991
+
+    Change log:
+
+	25-Oct-1991 SD Machine independant I/O
+*/
+
+
+/*
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include <stdlib.h>
+#include <sys/types.h>
+#include "mach-io.h"
+
+
+
+/* ---- Constants ---- */
+
+
+/*
+    The ABI trace file structure is based around an `index'
+    which is located towards the end of the file. It's location
+    is given by a longword at a fixed place.
+*/
+#define IndexPO ((off_t)26)
+
+/*
+    The index consists of a number of entries.
+    Each of these is a label of 4 characters followed by 6 long words.
+*/
+#define LW1 4
+#define LW2 8
+#define LW3 12
+#define LW4 16
+#define LW5 20
+#define LW6 24
+#define IndexEntryLength 28
+
+/*
+    Here is the label we will be looking for (SMPL), packed
+    into a long word.
+*/
+#define SampleNameEntryLabel  ((int_4) ((((('S'<<8)+'M')<<8)+'P')<<8)+'L')
+
+
+
+/* ---- Internal functions ---- */
+
+
+static int_4 readABIInt4(FILE *fp)
+{
+    unsigned char buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(int_4), 1, fp) != 1) {
+	fprintf(stderr,"Fread fail\n");
+	exit(1);
+    }
+    return (int_4) 
+	   (((int_4)buf[3]) +
+	    ((int_4)buf[2]<<8) +
+	    ((int_4)buf[1]<<16) +
+	    ((int_4)buf[0]<<24));
+
+}
+
+
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{   FILE *fp = NULL;
+    int_4 indexO, sampleNameO;
+    int sampleNameLen;
+    int_4 val;
+    int entryNum;
+
+    if (argc != 2)
+    {   fprintf(stderr, "%s: usage\n", argv[0]);
+	exit(1);
+    }
+
+
+    /* Open the gel file: read, binary */
+    if ((fp = fopen(argv[1], "rb")) == NULL)
+    {   fprintf(stderr, "%s: Unable to open file %s\n", argv[0], argv[1]);
+	exit(1);
+    }
+
+
+    /* Get the index offset */
+    if (fseek(fp, IndexPO, 0) != 0)
+    {   fprintf(stderr,"%s: Seek fail\n", argv[0]);
+	exit(1);
+    }
+    indexO = readABIInt4(fp);
+
+
+    /* Find the sample name entry by searching for the correct label */
+    entryNum = 0;
+    (void) fseek(fp, indexO+(entryNum*IndexEntryLength), 0);
+    while ((val = readABIInt4(fp)) != SampleNameEntryLabel)
+    {   entryNum++;
+	(void) fseek(fp, indexO+(entryNum*IndexEntryLength), 0);
+    }
+
+    /* Get the offset of the actual sample name */
+    (void) fseek(fp, indexO+(entryNum*IndexEntryLength)+LW4, 0);
+    if (readABIInt4(fp) <= 4) {
+	sampleNameO = indexO+(entryNum*IndexEntryLength)+LW5;
+    } else {
+	/* Get the offset of the actual sample name */
+	(void) fseek(fp, indexO+(entryNum*IndexEntryLength)+LW5, 0);
+	sampleNameO = readABIInt4(fp);
+    }
+
+    /* If this is 0, there is The Sample With No Name */
+    if (sampleNameO == (int_4)0)
+    {   fclose(fp);
+	exit(0);
+    }
+
+
+    /*
+        Write the sample name out on the standard output
+	The name is in a BCPL like format -
+	the first byte holds the string length
+    */
+    if (fseek(fp, sampleNameO, 0) != 0)
+    {   fprintf(stderr,"%s: Seek fail\n", argv[0]);
+	exit(1);
+    }
+    sampleNameLen = (int) fgetc(fp);
+    while (sampleNameLen > 0)
+    {   fputc(fgetc(fp), stdout);
+	sampleNameLen--;
+    }
+
+
+    fclose(fp);
+    exit(0);
+}
+
+
+
diff --git a/src/abi/mach-io.h b/src/abi/mach-io.h
new file mode 100644
index 0000000..5783df3
--- /dev/null
+++ b/src/abi/mach-io.h
@@ -0,0 +1,115 @@
+#ifndef _mach_io_h
+#define _mach_io_h
+/*
+** Machine independant io
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value successfully read or written
+*/
+
+#include <stdio.h>
+
+typedef char		int_1;
+typedef short		int_2;
+typedef int		int_4;
+typedef unsigned char	uint_1;
+typedef unsigned short	uint_2;
+typedef unsigned int	uint_4;
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+extern int be_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a big-endian int_1
+*/
+
+extern
+int be_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a big-endian int_2
+*/
+
+extern
+int be_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a big-endian int_4
+*/
+
+extern
+int be_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a big-endian int_1
+*/
+
+extern
+int be_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a big-endian int_2
+*/
+
+extern
+int be_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a big-endian int_4
+*/
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+extern
+int le_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a little-endian int_1
+*/
+
+extern
+int le_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a little-endian int_2
+*/
+
+extern
+int le_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a little-endian int_4
+*/
+
+extern
+int le_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a little-endian int_1
+*/
+
+extern
+int le_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a little-endian int_2
+*/
+
+extern
+int le_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a little-endian int_4
+*/
+
+
+#endif /* _mach_io_h */
diff --git a/src/abi/makefile-alpha b/src/abi/makefile-alpha
new file mode 100644
index 0000000..f200d15
--- /dev/null
+++ b/src/abi/makefile-alpha
@@ -0,0 +1,40 @@
+#
+# Makefile for abi - Alpha version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = getABISampleName
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Alpha
+#
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact -I$(SRC)/../ted
+LD = cc
+LDFLAGS = -g -non_shared
+
+#
+# Make the programs
+#
+getABISampleName: getABISampleName.o
+	$(LD) $(LDFLAGS) -o getABISampleName getABISampleName.o
+
+
+#
+# Special dependencies
+#
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
diff --git a/src/abi/makefile-dec b/src/abi/makefile-dec
new file mode 100644
index 0000000..cc744e2
--- /dev/null
+++ b/src/abi/makefile-dec
@@ -0,0 +1,34 @@
+#
+# Makefile for abi - DEC Ultrix (mips) version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = getABISampleName
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Sun
+#
+CC = c89
+CFLAGS = -g -std -w0 -I$(SRC)/../ted
+
+#
+# Make the programs
+#
+getABISampleName: $(SRC)/getABISampleName.c
+	$(CC) $(CFLAGS) -o getABISampleName $(SRC)/getABISampleName.c
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
diff --git a/src/abi/makefile-sgi b/src/abi/makefile-sgi
new file mode 100644
index 0000000..629dac4
--- /dev/null
+++ b/src/abi/makefile-sgi
@@ -0,0 +1,36 @@
+#
+# Makefile for abi - SGI version
+#
+
+SHELL=/bin/sh
+
+#
+# Programs made by this makefile
+#
+PROGS = getABISampleName
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Sgi
+#
+CC = cc
+CFLAGS = -ansi -g -prototypes -I$(SRC)/../ted
+
+#
+# Make the programs
+#
+getABISampleName: $(SRC)/getABISampleName.c
+	$(CC) $(CFLAGS) -o getABISampleName $(SRC)/getABISampleName.c
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
diff --git a/src/abi/makefile-solaris b/src/abi/makefile-solaris
new file mode 100644
index 0000000..3211a5d
--- /dev/null
+++ b/src/abi/makefile-solaris
@@ -0,0 +1,37 @@
+#
+# Makefile for abi - Sun version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = getABISampleName
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC=$(SRC)/../ted
+
+#
+# Sun
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -ansi -g -pedantic -I$(TEDSRC) $(GCCWARNINGS)
+
+#
+# Make the programs
+#
+getABISampleName: $(SRC)/getABISampleName.c
+	$(CC) $(CFLAGS) -static -o getABISampleName $(SRC)/getABISampleName.c
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
diff --git a/src/abi/makefile-sun b/src/abi/makefile-sun
new file mode 100644
index 0000000..64850a4
--- /dev/null
+++ b/src/abi/makefile-sun
@@ -0,0 +1,36 @@
+#
+# Makefile for abi - Sun version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = getABISampleName
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC=$(SRC)/../ted
+#
+# Sun
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -ansi -g -pedantic -I$(TEDSRC) $(GCCWARNINGS)
+
+#
+# Make the programs
+#
+getABISampleName: $(SRC)/getABISampleName.c
+	$(CC) $(CFLAGS) -static -o getABISampleName $(SRC)/getABISampleName.c
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
diff --git a/src/alf/2rs.c b/src/alf/2rs.c
new file mode 100644
index 0000000..3d0f7d1
--- /dev/null
+++ b/src/alf/2rs.c
@@ -0,0 +1,54 @@
+#include <stdio.h>
+#include <ctype.h>
+/*
+** SD 2 August 1991
+**   Changes way of interpreting uncertainty codes so that
+**   we now only generate C A G T and -
+*/
+main (int argc, char **argv)
+{
+    char c ;
+    int i = 0 ;
+
+    if (argc != 2 || *argv[1] != '-') {
+	fprintf (stderr,"Usage: '2rs -form' : form is abi or alf; a filter\n") ;
+	exit (1) ;
+    }
+
+    if (!strcmp (&argv[1][1],"abi"))
+	while ((c = getc (stdin)) != EOF) {
+	    switch (c) {
+		case 'N' : c = '-' ; break ;
+	    }
+	    putc (c,stdout) ;
+	    if (!(++i%50))
+		putc ('\n',stdout) ;
+	}
+      else if (! strcmp (&argv[1][1],"alf"))
+    	/* ALF lower case uncertainty codes mean that the base may be missing.
+	   RS uncertain length codes mean that there may be an extra base.
+	   So I have to delay output by one character. Use oldc for this.
+	*/
+	while ((c = getc (stdin)) != EOF) { 
+	    switch (c) {
+	    case 'A':
+	    case 'C':
+	    case 'G':
+	    case 'T':
+		break;
+	    default:
+		if (isupper(c))
+		    c = '-';
+		else
+		    c = '\0';
+		break;
+	    }
+	    if (c) {
+		putc (c,stdout);
+		if (!(++i%50))
+		    putc ('\n',stdout) ;
+	    }
+	}
+
+    putc ('\n',stdout) ;
+}
diff --git a/src/alf/README b/src/alf/README
new file mode 100644
index 0000000..fe98c91
--- /dev/null
+++ b/src/alf/README
@@ -0,0 +1,24 @@
+alfprocess Scripts and Filters
+------------------------------
+The scripts and filters here were developed at the MRC-LMB for INTERNAL
+USE ONLY. As we are aware that people elsewhere will want to develop
+similar software we include them in the Staden package as EXAMPLES of
+what can be achieved.
+
+THESE SCRIPTS WILL NOT WORK ON YOUR SYSTEM WITHOUT MODIFICATION.
+
+This directory contains:
+    2rs.c		- filter to convert abi and alf uncertainty codes to
+			  staden uncertainty codes
+    alfbackup.sh        - script to be back up a folder and transfer to project
+    alfprocess		- script to process a batch of alf data
+    alfprocess2		- script to process a batch of alf data
+    alfprocessDoc	- documentation on alfprocess
+    alfsplit.c		- Takes big combined alf results file, and splits it
+			  into separate files for each clone.
+    alftransfer		- script to transfer alf files into a results directory
+    makefile		- To compile c programs
+
+Happy hacking,
+
+    sd@uk.ac.cam.mrc-lmb
diff --git a/src/alf/alfbackup.sh b/src/alf/alfbackup.sh
new file mode 100644
index 0000000..fe99868
--- /dev/null
+++ b/src/alf/alfbackup.sh
@@ -0,0 +1,64 @@
+#! /bin/sh
+# shell script to be run on cele to back up a folder and transfer to project
+# first argument specifies working directory second gives date
+cd ${1}
+# now move up one directory
+# target directory is current dir, useful to save name for tar
+tardir=`echo ${1} |  sed 's/\// /g' | awk '{print $NF;}'`
+cd ..
+
+# next line needed to prevent mt from rewinding at the end of each operation
+# May not need this now because now mt -f /dev/nrst0 -CWL
+unset TAPE
+
+echo -n "load tape - how many data sets are already on this tape? "
+read nmark
+while test ! "$nmark"
+  do
+    echo -n "You must give a number of previous files. How many? "
+    read nmark
+  done
+
+mt -f /dev/nrst0 fsf $nmark
+if test ! $? = 0
+  then
+    echo "tape positioning failed - is tape loaded? or file number correct?"
+    exit 1
+  fi
+# tar with rewind after done
+tar cvef /dev/rst0 $tardir		 # temp modification
+if test ! $? = 0
+  then
+    echo "tape archive failed - perhaps tape is write protected"
+    exit 1
+  fi
+
+# now move back down
+cd $tardir 
+echo "Give project name to move data to (blank for no move) "
+echo -n "and give path name starting from your home directory:"
+read project
+
+while test "$project"
+  do 
+    if test -d $HOME/$project
+      then
+	for i in `cat ${2}fn` ;
+	   do
+		mv -i $i $HOME/$project ;
+		mv -i $i"ALF" $HOME/$project
+	done
+	mv -i ${2}fn $HOME/$project ;
+#       rm -r Results${2} # temp change -cwl
+	exit 0
+      else
+        echo -n "Project directory not found - please retype: "
+	read project
+      fi
+  done
+
+
+
+
+
+
diff --git a/src/alf/alfprocess b/src/alf/alfprocess
new file mode 100644
index 0000000..fa3a505
--- /dev/null
+++ b/src/alf/alfprocess
@@ -0,0 +1,78 @@
+#! /bin/csh -f
+# alfprocess
+# shell script to process a batch of alf data
+# from abiprocess
+#
+# It assumes that raw data is in ~/alf_e/'Results'$date
+# where date is in the form {yr}{month}{day}{letter}: eg: 910227a
+# for february 27, 1991, run "a"
+# thus in addition to the date, the run letter accounts for more than one
+# run a day:
+# 910227a  for first run of day
+# 910227b  for second run of day, etc.
+# 
+# It then runs alfsplit on that file producing separate files
+# for each clone in the format {clonename}ALF (eg a02b9.s2ALF)
+# get the file names that alfsplit outputs
+# For clone names it is essential that a real clone name has no spaces
+#   unnamed clones are ignored when the file names are gotten
+# 
+# It then invokes ted on each one of these files
+# (ie. ted -ALF $oldname -output $newname -raw $newname"RES")
+
+# FoFfile is the file of filenames
+
+if ( $#argv < 1) then
+	echo "Usage: alfprocess <date> (Example: alfprocess 910228a)" 
+	echo "       alfprocess presumes that data are in ~/alf_e/Results<date>"
+	echo "       (eg. ~/alf_e/Results910228a)"
+	echo
+	exit
+endif
+
+if ( ! -d $HOME/alf_e/Results${1} ) then
+	echo "Directory ~/alf_e/Results${1} doesn't exist"
+	echo "Check if date is correct and that the directory been created"
+	echo "Then run alfprocess again"
+	exit
+endif
+
+if ( $#DISPLAY == 0 ) then
+	echo "Must be running under X windows for alfprocess to run"
+	exit
+endif
+
+##descriptiong
+echo "Running alfprocess: looking for directory ~/alf_e/Results${1}"
+echo "Will run alfsplit on ${1}.alf, process clones through ted, and then"
+echo "backup directory to tape.  The accepted clones will be transferred"
+echo "to your project directory as well as the file of file names"
+
+cd $HOME/alf_e/"Results"${1}
+
+set FoFfile = ${1}fn
+set clones = `alfsplit ${1}.alf | awk '/^Clone/&& $4 ~ /-/ {print $3;}' `
+echo Clones: $clones
+
+if (-e $FoFfile) then
+	echo "Warning, file of filenames already exists ($FoFfile)"
+	echo "Press return to continue, Ctrl-c to exit"
+	$<
+endif
+	
+foreach clonename ( $clones )
+	set alfsplitname = $clonename"ALF"
+	echo -n "Processing clone: $clonename"
+	ted -ALF $clonename"ALF" -output $clonename -raw $clonename"ALF"
+	if ( ! -e $clonename ) then
+		echo " failed"
+		mv $clonename"ALF" $clonename"ALF.fld"
+	else
+		echo " accepted"
+		echo $clonename >> $FoFfile 
+	endif
+end
+
+rsh cele /usr/local/bin/alfbackup.sh $cwd ${1} 
+
+
diff --git a/src/alf/alfprocess2 b/src/alf/alfprocess2
new file mode 100644
index 0000000..d28c09e
--- /dev/null
+++ b/src/alf/alfprocess2
@@ -0,0 +1,41 @@
+#! /bin/csh -f
+# alftransfer
+# shell script to process a batch of alf data
+# this takes a date and creates a Results<date> directory in
+# ~/alf_e/ if it doesn't already exist then copies data there
+# and runs alfprocess (1) on it
+# It does not assume that raw data is in ~/alf_e/'Results'$date
+# where date is in the form {yr}{month}{day}{letter}: eg: 910227a
+# for february 27, 1991, run "a"
+# thus in addition to the date, the run letter accounts for more than one
+# run a day:
+# 910227a  for first run of day
+# 910227b  for second run of day, etc.
+
+
+if ( $#argv < 1) then
+	echo "alftransfer--a script to transfer alf files into your
+	echo "		   results directory"
+	echo 
+	echo "Usage: alfprocess2 <date>"
+	echo "       alfprocess moves data to  ~/alf_e/Results<date>"
+	echo "       creating the directory if necessary"
+	echo "Example:  alfprocess 910228a"
+	echo "          creates  ~/alf_e/Results910228a"
+	echo
+	exit
+endif
+
+if ( -e ${1}.alf ) then 
+	if ( ! -d $HOME/alf_e/Results${1} ) then
+		echo "Results directory did not exist, making $HOME/alf_e/Results${1}"
+		mkdir $HOME/alf_e/Results${1}
+	endif
+	cp -i ${1}.alf $HOME/alf_e/Results${1}
+endif
+
+cd $HOME/alf_e/Results${1}
+echo "Copying complete, now invoking alfprocess"
+alfprocess {$1}
+
+
diff --git a/src/alf/alfprocessDoc b/src/alf/alfprocessDoc
new file mode 100644
index 0000000..a0898dd
--- /dev/null
+++ b/src/alf/alfprocessDoc
@@ -0,0 +1,39 @@
+		       ALFPROCESS DOCUMENTATION
+
+alfprocess and alfprocess2 are scripts that take an alf
+trace file, break it down into smaller pieces corresponding to
+individual clones, and run ted, the trace editor on each one of the
+clones.  Afterwards, alfprocess will backup files to tape and move the accepted
+files into your project directory.
+
+I. alfprocess
+
+alfprocess is modeled directly on abiprocess, it requires that:
+
+(i) your raw data from the alf is held in the directory ~/alf_e/Results<date>
+    where <date> is written 910322a for March 22, 1991 run "a"
+    of the day
+
+(ii) it also requires that your project directory is given
+     relative to your home directory when it asks you for this name
+
+example usage:
+
+% alfprocess 910322a
+
+II. alfprocess2
+
+alfprocess2 allows you to move into any directory containing your raw
+data file (not necessarily ~/alf_e) to process your data.  It will
+organize this data file correctly--creating the proper "Results"
+directory if necessary.  This is useful if, for example, you have
+transferred the file 910322a.alf into the scratch directory
+/home/alf_d.  You can then type:
+
+% cd /home/alf_d
+% alfprocess2 910322a
+
+And the script will copy this file over to your home directory
+~/home/alf_e/Results910322a creating the directory if necessary.  Then
+it will do all the normal alfprocess activities.
+
diff --git a/src/alf/alfsplit.c b/src/alf/alfsplit.c
new file mode 100644
index 0000000..cb88d76
--- /dev/null
+++ b/src/alf/alfsplit.c
@@ -0,0 +1,383 @@
+/* alfsplit.c  
+   Written by Richard Durbin 12/28/90.
+   Takes big combined alf results file, and splits it into
+   separate files for each clone.
+   Only keep processed data, sequence data and experimental notes.
+   Although the format of the small files is based on that of an
+   ALF file officially split on the PC, they are unfortunately
+   not reaadable by ALFManager software on the PC.
+   */
+
+/* first give full function prototypes for system functions
+   these are incomplete in the Sun /usr/include/...
+   
+   Modified by Simon Dear 21 August 1991.
+   Ignore value of s3 in readIndexEntry in check on sensible values
+   This value is DirEntry.fType in Pharmacia documentation
+   
+   Modified by Simon Dear 25 October 1991.
+   Machine independant I/O
+
+   24 August 1992 [Simon Dear]
+   MAJOR HACK - to allow for readings which have no clone name
+   */
+
+typedef int mysize_t;
+typedef long mytime_t;
+
+#include <stdio.h>   /* because need FILE definition for prototypes */
+#include <sys/types.h>
+#include <stdarg.h>  /* varargs needed for v*printf() prototypes */
+#include "mach-io.h"
+
+/* stdio function prototypes */
+int       fclose   (FILE *stream);
+int       fgetc    (FILE *stream);
+int       _filbuf  (FILE *stream) ;
+/* Causes Alliant gyp
+int       _flsbuf  (unsigned char x, FILE *stream) ;
+*/
+FILE    * fopen    (const char *path, const char *mode);
+/* incompat. with Alpha
+int       fprintf  (FILE *stream, const char *format, ...);
+*/
+/* Causes Alliant gyp
+mysize_t  fread    (void *ptr, mysize_t size, mysize_t n, FILE *stream);
+*/
+int       fseek    (FILE *stream, long offset, int whence);
+long      ftell    (FILE *stream);
+/* Causes Alliant gyp
+mysize_t  fwrite   (const void *ptr, mysize_t size, mysize_t n,
+FILE *stream);
+*/
+/* incompat. with Alpha
+int       printf   (const char *format, ...);
+*/
+/*  sprintf has incompatible declarations in different SUN OS releases!
+void      sprintf  (char *buffer, const char *format, ...);
+*/
+/*
+int       vfprintf (FILE *stream, const char *format, va_list arglist);
+*/
+
+/* allocation prototypes that we use */
+/* void   * malloc    (mysize_t size);*/
+
+/* string.h prototypes that we use */
+int       strcmp   (const char *s1, const char *s2);
+int       strncmp  (const char *s1, const char *s2, mysize_t maxlen);
+int       atoi (const char *s) ;
+
+/* system prototypes */
+void      exit      (int status);
+
+#include <ctype.h>
+
+typedef int BOOL ;
+#define TRUE 1
+#define FALSE 0
+
+/********** routines to read and write Index entries ***********/
+
+static char junk[512] ;	/* for when we want to read/write junk */
+
+/***** architecture independant reads ******/
+static int_4 read_int_4(FILE *fp)
+
+{
+    unsigned char buf[sizeof(int_4)];
+    
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return 0;
+    return (int_4)
+        (((uint_4)buf[0]) +
+         ((uint_4)buf[1]<<8) +
+         ((uint_4)buf[2]<<16) +
+         ((uint_4)buf[3]<<24));
+}
+
+static int_2 read_int_2(FILE *fp)
+
+{
+    unsigned char buf[sizeof(int_2)];
+    
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return 0;
+    return (int_2)
+        (((uint_2)buf[0]) +
+         ((uint_2)buf[1]<<8));
+}
+
+static void write_int_4(FILE *fp, int_4 l)
+{
+    unsigned char buf[sizeof(int_4)];
+    
+    buf[0] = (unsigned char)(l&255);
+    buf[1] = (unsigned char)(l>>8&255);
+    buf[2] = (unsigned char)(l>>16&255);
+    buf[3] = (unsigned char)(l>>24&255);
+    
+    fwrite(buf, sizeof(buf), 1, fp);
+}
+
+static void write_int_2(FILE *fp, int_2 l)
+{
+    unsigned char buf[sizeof(int_2)];
+    
+    buf[0] = (unsigned char)(l&255);
+    buf[1] = (unsigned char)(l>>8&255);
+    
+    fwrite(buf, sizeof(buf), 1, fp);
+}
+
+
+
+
+
+
+
+typedef struct IndexEntryStruct
+{ int_4 isTraces ;
+  char  label[40] ;
+  int_4 dataLen ;
+  int_4 blockLen ;
+  int_4 offset ;
+} *IndexEntry ;
+
+static BOOL readIndexEntry (FILE *fil, IndexEntry ent)
+{
+    short s1,s2,s3 ;
+    
+#define readInt() (read_int_4(fil))
+#define readShort() (read_int_2(fil))
+    
+    clearerr (fil) ;
+    
+    s1 = readShort() ;
+    s2 = readShort() ;
+    s3 = readShort() ;
+    /* Was this, but s3 can be 0 now
+       if ((s1 != 1 || s3 != 1) && (s1 || s2 || s3))
+       */
+    if ((s1 != 1) && (s1 || s2))
+	return FALSE ;
+    ent->isTraces = (s2 == 4) ;
+    
+    fread (ent->label,40,1,fil) ;
+    ent->dataLen = readInt() ;
+    ent->blockLen = readInt() ;
+    ent->offset = readInt() ;
+    
+    fread (junk,70,1,fil) ;
+    
+    return !ferror (fil) ;
+}
+
+static BOOL writeIndexEntry (FILE *fil, IndexEntry ent)
+{
+    
+#define writeInt(xx) (write_int_4(fil,xx))
+#define writeShort(xx) (write_int_2(fil,xx))
+    
+    clearerr (fil) ;
+    
+    writeShort(1) ;
+    if (ent->isTraces)
+	writeShort(4) ;
+    else
+	writeShort(2) ;
+    writeShort(1) ;
+    
+    fwrite (ent->label,40,1,fil) ;
+    writeInt(ent->dataLen) ;
+    writeInt(ent->blockLen) ;
+    writeInt(ent->offset) ;
+    
+    fwrite (junk,70,1,fil) ;
+    
+    return !ferror (fil) ;
+}
+
+/************************************************************/
+
+void crash (char* format,...)
+{
+    va_list args ;
+    
+    va_start (args,format) ;
+    vfprintf (stderr,format,args) ;
+    va_end (args) ;
+    
+    exit (1) ;
+}
+
+/*****************/
+
+static void readLine (FILE *fil, char* cp)
+{
+    while ((*cp = fgetc(fil)) && *cp != EOF && *cp != '\n')
+	++cp ;
+    *cp = 0 ;
+}
+
+/*****************/
+#define MAXCLONES 10
+void main (int argc, char* *argv)
+{
+    FILE *inEnt, *inData, *outEnt[MAXCLONES], *outData[MAXCLONES] ;
+    /* open two pointers in each file - index and data */
+    IndexEntry EN ;
+    IndexEntry ent ;
+    char expLine[4][20],name[MAXCLONES][20],note[MAXCLONES][80],fname[25];
+    off_t seqOffset[MAXCLONES], dataOffset[MAXCLONES];
+    int_4 seqDataLen[MAXCLONES], dataDataLen[MAXCLONES];
+    int_4 seqBlockLen[MAXCLONES], dataBlockLen[MAXCLONES];
+    char buf[512] ;
+    int i,j,len ;
+    size_t lastDot,lastSlash;
+    
+    if (argc != 2)
+	crash ("Usage: alfsplit rawfilename\n") ;
+    
+    inData = fopen (argv[1],"r") ;
+    if (!(inEnt = fopen (argv[1],"r")))
+	crash ("Could not open file '%s'\n",argv[1]) ;
+    
+    /* first find the experimental notes entry and extract file names */
+    
+    ent = (IndexEntry) malloc (sizeof (struct IndexEntryStruct)) ;
+    EN = (IndexEntry) malloc (sizeof (struct IndexEntryStruct)) ;
+    if (fseek (inEnt,(off_t)512,0))
+	crash ("Could not seek to index in raw file\n") ;
+    while (TRUE)
+	{ if (!readIndexEntry (inEnt,EN))
+	      crash ("Can't find Experimental Notes index entry\n") ;
+	  if (!strcmp (EN->label,"ALF Experimental notes"))
+	      break ;
+      }
+    
+    if (fseek (inData,(off_t)EN->offset,0))
+	crash ("Can't seek to Experimental notes\n") ;
+    for (i = 0 ; i < 4 ; ++i)
+	readLine (inData,expLine[i]) ;
+    
+    /* determine default root name from argv[1]:
+     ** I assume this has the format {path}/{name}.alf
+     ** Default names will be {name}.1, {name}.2, ..., {name}.MAXCLONES
+     */
+    lastDot = (size_t)0;
+    for (i = strlen(argv[1])-1;i>=0 && argv[1][i] != '/'; i--)
+	if (lastDot==0 && argv[1][i] == '.') lastDot = (size_t)i;
+    if (lastDot==0) lastDot = strlen(argv[1]);
+    lastSlash = (size_t)i;
+    
+    for (i = 0 ; i < MAXCLONES ; ++i) {
+	readLine (inData,name[i]) ;
+	if (!strcmp (name[i],"blank") || *name[i]==0 )    /* clone not named */
+	    /* assume default */
+	    sprintf(name[i],
+		    "%.*s.%d",
+		    (int)(lastDot-lastSlash-1),
+		    argv[1]+lastSlash+1,
+		    (i>10)?i:(i+1)%10);
+    }
+    for (i = 0 ; i < MAXCLONES ; ++i)
+	readLine (inData,note[i]) ;
+
+    for (i = 0 ; i < MAXCLONES ; ++i)
+	seqOffset[i] = dataOffset[i] = 0;
+
+    /* gather offset information */
+    fseek (inEnt,(off_t)512,0) ;
+    while (readIndexEntry (inEnt,ent)) {
+	printf ("%s: %d\n",ent->label,ent->offset/512) ;
+	if (!strncmp (ent->label,"ALF Sequence data Clone ",24))
+	    len = 24 ;
+	else if (!strncmp (ent->label,"ALF Processed data Clone ",25))
+	    len = 25 ;
+	else
+	    continue ;
+	/* fall through to here if sequence or processed */
+	i = atoi (&ent->label[len]) - 1 ;
+	
+	if (len == 24) {
+	    seqOffset[i] = (off_t)ent->offset;
+	    seqBlockLen[i] = ent->blockLen;
+	    seqDataLen[i] = ent->dataLen;
+	} else {
+	    dataOffset[i] = (off_t)ent->offset;
+	    dataBlockLen[i] = ent->blockLen;
+	    dataDataLen[i] = ent->dataLen;
+	}
+    }
+
+    /* initialise output files for clones */
+    for (i = 0 ; i < MAXCLONES ; ++i) {
+	if (seqOffset[i]==0 && dataOffset[i]==0) {
+	    /* we are missing sequence and/or trace data */
+	    printf ("Clone %d: %s - NOT MAKING BECAUSE THERE IS NO TRACE AND SEQUENCE DATA\n",i+1,name[i]);
+	} else if (seqOffset[i]==0) {
+	    /* we are missing sequence and/or trace data */
+	    printf ("Clone %d: %s - NOT MAKING BECAUSE THERE IS NO SEQUENCE DATA\n",i+1,name[i]);
+	} else if (dataOffset[i]==0) {
+	    /* we are missing sequence and/or trace data */
+	    printf ("Clone %d: %s - NOT MAKING BECAUSE THERE IS NO TRACE DATA\n",i+1,name[i]);
+	} else  {
+	    printf ("Clone %d: %s - %s\n",i+1,name[i],note[i]) ;
+	    /* create the file and write the notes */
+	    sprintf (fname,"%sALF",name[i]) ;
+	    outData[i] = fopen (fname,"w") ;
+	    if(!fwrite (junk,512,1,outData[i]) )
+		fprintf(stderr, "could not write file: %s\n", fname);
+	    if(!fwrite (junk,512,1,outData[i]) )
+		fprintf(stderr, "could not write file: %s\n", fname);
+	    len = 0 ;
+	    for (j = 0 ; j < 4 ; ++j)
+		len += fprintf (outData[i],"%s\n",expLine[j]) ;
+	    len += fprintf (outData[i],"%s\n\n\n\n\n\n\n\n\n\n",name[i]) ;
+	    len += fprintf (outData[i],"%s\n\n\n\n\n\n\n\n\n\n",note[i]) ;
+	    fwrite (junk,512-len,1,outData[i]) ;
+	    /* now write the index entry */
+	    if (!(outEnt[i] = fopen (fname,"a")))
+		crash ("Couldn't open output file %s\n",fname) ;
+	    fseek (outEnt[i],(off_t)512,0) ;
+	    EN->offset = 1024 ;
+	    EN->dataLen = len ;
+	    writeIndexEntry (outEnt[i],EN) ;
+
+	    /*
+            ** Copy sequence and trace data
+	    */
+	    /* trace data */
+	    strcpy(ent->label,"ALF Processed data Clone 1");
+	    ent->isTraces = 1;
+	    ent->offset = (int_4)ftell (outData[i]) ;
+	    ent->blockLen = dataBlockLen[i];
+	    ent->dataLen = dataDataLen[i];
+	    
+	    fseek (inData,dataOffset[i],0) ;
+	    len = ent->blockLen/512 ;
+	    for (j = 0 ; j < len ; ++j) {
+		fread (buf,512,1,inData) ;
+		fwrite (buf,512,1,outData[i]) ;
+	    }
+	    
+	    writeIndexEntry (outEnt[i],ent) ;
+
+	    /* sequence data*/
+	    strcpy(ent->label,"ALF Sequence data Clone 1");
+	    ent->isTraces = 0;
+	    ent->offset = (int_4)ftell (outData[i]) ;
+	    ent->blockLen = seqBlockLen[i];
+	    ent->dataLen = seqDataLen[i];
+
+	    fseek (inData,seqOffset[i],0) ;
+	    len = ent->blockLen/512 ;
+	    for (j = 0 ; j < len ; ++j) {
+		fread (buf,512,1,inData) ;
+		fwrite (buf,512,1,outData[i]) ;
+	    }
+	    
+	    writeIndexEntry (outEnt[i],ent) ;
+
+	}
+    }
+}
diff --git a/src/alf/alftransfer b/src/alf/alftransfer
new file mode 100644
index 0000000..71be024
--- /dev/null
+++ b/src/alf/alftransfer
@@ -0,0 +1,43 @@
+#! /bin/csh -f
+# alftransfer
+# shell script to process a batch of alf data
+# this takes a date and creates a Results<date> directory in
+# ~/alf_e/ if it doesn't already exist then copies data there
+# and runs alfprocess (1) on it
+# It does not assume that raw data is in ~/alf_e/'Results'$date
+# where date is in the form {yr}{month}{day}{letter}: eg: 910227a
+# for february 27, 1991, run "a"
+# thus in addition to the date, the run letter accounts for more than one
+# run a day:
+# 910227a  for first run of day
+# 910227b  for second run of day, etc.
+
+
+if ( $#argv < 1) then
+	echo "alftransfer--a script to transfer alf files into your
+	echo "		   results directory"
+	echo 
+	echo "Usage: alfprocess2 <date>"
+	echo "       alfprocess moves data to  ~/alf_e/Results<date>"
+	echo "       creating the directory if necessary"
+	echo "Example:  alfprocess 910228a"
+	echo "          creates  ~/alf_e/Results910228a and moves"
+	echo "		910228a.alf to that directory"
+	exit
+endif
+
+if ( -e ${1}.alf ) then
+	if ( $cwd != $HOME/alf_e/Results${1} ) then 
+		if ( ! -d $HOME/alf_e/Results${1} ) then
+			echo "Results directory did not exist, making $HOME/alf_e/Results${1}"
+			mkdir $HOME/alf_e/Results${1}
+		endif
+		cp -i ${1}.alf $HOME/alf_e/Results${1}
+	endif
+endif
+
+cd $HOME/alf_e/Results${1}
+echo "Copying complete, now invoking alfprocess"
+alfprocess {$1}
+
+
diff --git a/src/alf/makefile-alpha b/src/alf/makefile-alpha
new file mode 100644
index 0000000..7df578e
--- /dev/null
+++ b/src/alf/makefile-alpha
@@ -0,0 +1,43 @@
+#
+# Makefile for alf - Alpha version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = alfsplit
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Alpha
+#
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact -I$(SRC)/../ted
+LD = cc
+LDFLAGS = -g -non_shared
+
+
+#
+# Make the programs
+#
+alfsplit: alfsplit.o
+	$(LD) $(LDFLAGS) -o alfsplit alfsplit.o $(LIBS)
+
+2rs: 2rs.o
+	$(LD) $(LDFLAGS) -o 2rs 2rs.o $(LIBS)
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/alf/makefile-dec b/src/alf/makefile-dec
new file mode 100644
index 0000000..161936e
--- /dev/null
+++ b/src/alf/makefile-dec
@@ -0,0 +1,42 @@
+#
+# Makefile for alf - DEC Ultrix (mips) version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = alfsplit
+
+#
+# Where the wild things are
+#
+SRC = .
+LIBS =
+
+#
+# Sun
+#
+CC = c89
+CFLAGS = -g -std -w0 -I$(SRC)/../ted
+
+
+#
+# Make the programs
+#
+alfsplit: alfsplit.o
+	$(CC) -o alfsplit alfsplit.o $(LIBS)
+
+2rs: 2rs.o
+	$(CC) -o 2rs 2rs.o $(LIBS)
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/alf/makefile-sgi b/src/alf/makefile-sgi
new file mode 100644
index 0000000..3c90438
--- /dev/null
+++ b/src/alf/makefile-sgi
@@ -0,0 +1,43 @@
+#
+# Makefile for alf - SGI version
+#
+
+SHELL=/bin/sh
+
+#
+# Programs made by this makefile
+#
+PROGS = alfsplit
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Sun
+#
+CC = cc
+CFLAGS = -g -ansi -prototypes -I$(SRC)/../ted
+
+
+#
+# Make the programs
+#
+alfsplit: alfsplit.o
+	$(CC) -o alfsplit alfsplit.o $(LIBS)
+
+2rs: 2rs.o
+	$(CC) -o 2rs 2rs.o $(LIBS)
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/alf/makefile-solaris b/src/alf/makefile-solaris
new file mode 100644
index 0000000..790bddb
--- /dev/null
+++ b/src/alf/makefile-solaris
@@ -0,0 +1,46 @@
+#
+# Makefile for alf - Sun version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = alfsplit
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+# not needed for gcc2.1
+#LIBS = /usr/local/lib/gcc-gnulib
+
+#
+# Sun
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS) -I$(TEDSRC)
+
+
+#
+# Make the programs
+#
+alfsplit: alfsplit.o
+	$(CC) -o alfsplit alfsplit.o -static $(LIBS)
+
+2rs: 2rs.o
+	$(CC) -o 2rs 2rs.o $(LIBS)
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/alf/makefile-sun b/src/alf/makefile-sun
new file mode 100644
index 0000000..986ff5a
--- /dev/null
+++ b/src/alf/makefile-sun
@@ -0,0 +1,47 @@
+#
+# Makefile for alf - Sun version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = alfsplit
+
+#
+# Where the wild things are
+#
+SRC = .
+# not needed for gcc2.1
+#LIBS = /usr/local/lib/gcc-gnulib
+
+#
+# Sun
+#
+SRC=.
+TEDSRC=$(SRC)/../ted
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS) -I$(TEDSRC)
+
+
+#
+# Make the programs
+#
+alfsplit: alfsplit.o
+	$(CC) -o alfsplit alfsplit.o -static $(LIBS)
+
+2rs: 2rs.o
+	$(CC) -o 2rs 2rs.o $(LIBS)
+
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS) *.o *~
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/bap/actf.c b/src/bap/actf.c
new file mode 100644
index 0000000..67b2444
--- /dev/null
+++ b/src/bap/actf.c
@@ -0,0 +1,116 @@
+#include <string.h>
+#include <stdio.h>
+#include <stdlib.h>
+#include <fcntl.h>
+#include <sys/types.h>
+#include <sys/stat.h>
+#include "mcspec.h"
+#include "fort.h"
+
+/* static error messages for use in actf_() */
+static char *actferrlist[] = {
+    "Error! - stack underflow",
+    "Error! - unknown JOB to ACTF()",
+    "Error creating busy file",
+    "Error deleting busy file",
+    "Sorry, database busy"
+    };
+
+/* global variables - exported in actf.h */
+int actfptr = 0;	/* stack pointer - points to first free location */
+
+/* types */
+typedef struct {
+    char *filename;
+} actfstack_t;
+
+/* local variables */
+static actfstack_t actfstack[10];
+
+/*
+ * Display an error either via ERROM() if we've got a KBOUT or to stderr
+ * otherwise.
+ */
+static void actferr(int errnum, int_f *KBOUT) {
+    if (KBOUT)
+	errom_(KBOUT, actferrlist[errnum-1], strlen(actferrlist[errnum-1]));
+    else 
+	fprintf(stderr, "%s\n", actferrlist[errnum-1]);
+    return;
+}
+
+/*
+ * in FORTRAN as INTEGER FUNCTION ACTF(JOB,FILNAM,FILNAMLEN,COPYNUM,KBOUT)
+ *
+ * JOB = 1 : lock FILNAM (push filename and copynumber onto stack)
+ *
+ * JOB = 2 : unlock last locked file (pull off stack)
+ *           (in this case only trust COPYNUM and KBOUT to contain legal
+ *            and relevant data.)
+ *
+ * Locking mechanism done by creating a 'busy' file of format 'FILE_busy'.
+ */
+int_f actf_(int_f *JOB_p,
+	   char *FILNAM_p,
+	   int_f *FILNAMLEN_p,
+	   char *COPYNUM_p,
+	   int_f *KBOUT,
+	   int_fl  FILNAM_l,
+	   int_fl  COPYNUM_l)
+{
+    char fname[256], *cptr;
+    struct stat statbuf;
+    int fd, i;
+
+    if (*JOB_p == 1) {
+	/* char buf[10]; */
+
+	/* copy correct amount of FILNAM_p */
+	for (i=0; i<*FILNAMLEN_p; i++) {
+	    fname[i] = FILNAM_p[i];
+	}
+	strcpy(&fname[i], "_BUSYx");
+	fname[i+5] = *COPYNUM_p;
+
+	/* add data to our stack */
+/*	actfstack[actfptr].filename = strdup(fname); */
+	i = strlen(fname);
+	cptr = actfstack[actfptr].filename = (char *)malloc(i+1);
+	for (; i>=0; i--)
+	    cptr[i] = fname[i];
+	/* do the locking */
+	if (stat(fname, &statbuf) != -1) {
+	    /* lock already exists */
+	    actferr(5, KBOUT);
+	    return 5;
+	}
+	if ((fd = open(fname, O_CREAT, 0600)) == -1) {
+	    actferr(3, KBOUT);
+	    return 3;
+	}
+/*	fprintf(buf, "%d", getuid()); - wrong... ADDRESS of getuid()
+	write(fd, buf, sizeof(buf));*/
+	close(fd);
+	/* inc stack pointer */
+	actfptr++;
+	return 0;
+    } else if (*JOB_p == 2) {
+	/* dec stack pointer */
+	actfptr--;
+	if (actfptr < 0) {
+	    actferr(1, KBOUT);
+	    return 1;
+	}
+	/* do the unlocking */
+	if (unlink(actfstack[actfptr].filename) == -1) {
+	    actferr(4, KBOUT);
+	    return 4;
+	}
+	return 0;
+    } else {
+	/* unknown job */
+	actferr(2, KBOUT);
+	return 2;
+    }
+}
+
diff --git a/src/bap/actf.h b/src/bap/actf.h
new file mode 100644
index 0000000..2c611c9
--- /dev/null
+++ b/src/bap/actf.h
@@ -0,0 +1,9 @@
+extern int actfptr;
+
+int_f actf_(int_f *JOB_p,
+	   char *FILNAM_p,
+	   int_f *FILNAMLEN_p,
+	   char *COPYNUM_p,
+	   int_f *KBOUT,
+	   int_fl  FILNAM_l,
+	   int_fl  COPYNUM_l);
diff --git a/src/bap/bap.f b/src/bap/bap.f
new file mode 100644
index 0000000..d752d40
--- /dev/null
+++ b/src/bap/bap.f
@@ -0,0 +1,787 @@
+C     BAP (Sequence assembly program)
+C     AUTHOR and copyright RODGER STADEN
+C  23-1-93 new consensus routine
+C  23-10-92 tidy up for distribution
+C  25-8-92 Added disassembly routine (31)
+C    22-7-92 Added busy database checks
+C    14-7-92 added delete contig to fix
+C    4-6-92 Added padshifting under auto edit function
+C     9-1-90  sent idev instead of kbout to join and enter
+C     16-3-90 removed escape after bpause
+C     2-5-90 Many changes related to introducing graphics
+C     4-5-90 Allowed escape from getreg
+C     9-5-90 Introduced a default gel reading
+C    23-8-90 Added calls to SHOWFU
+C    28-11-90 See dbsys89
+C    3-12-90 Add invocation of contig editor
+C    25-02-91 tag and comment files
+C    28-2-91 changed maxsav from 1000 to maxglm
+C    30-4-91 changes help references to DAP*
+C    28-7-91 added extra parameter for quality calc: maxgood, only the parts
+C    of reads upto this length are used
+C    21-8-91 Added an option to find internal joins: AUTOJ
+C    27-8-91 Added buffer flushing for database files
+C    13-6-92 Upped MAXSEQ to 400000
+C    2-9-91  BIG CHANGE: reading names increased to 16 chars and
+C            max readings can now be set at compile up to 999999 and
+C            database organisation changed (record 1 of relationships
+C            now used for file info, last record of names not used
+C            for file size info and database start does not waste
+C            space in working version and names files: they are
+C            increased as we go along
+C    12-11-91 Changed autoedit so that it takes strands into account: needs
+C             a new consensus type calculation sumss and a call from main.
+C    17-3-92 Changed maxcon from 500 to maxdb/2
+      SUBROUTINE FMAIN()
+      INTEGER CHRSIZ,ECHRSZ
+      PARAMETER (
+     +           NAMLEN = 40,
+     +           MAXDEV = 12,
+     +           MAXSEQ=500000,
+     +           MAXGLM=4096,
+     +           MAXGL2=MAXGLM*2,
+     +           CHRSIZ=4,
+     +           ECHRSZ = 6,
+     +           IDT1=ECHRSZ*MAXGL2,
+     +           IDT2=2*CHRSIZ*MAXGL2,
+     +           IDT3=2*MAXGL2,
+     +           IDT4=2*ECHRSZ*MAXGL2,
+     +           MAXDB=8000)
+      INTEGER BOTOPT,TOPOPT,DEVNOS(MAXDEV)
+      INTEGER RELPG(MAXDB),LREG,RREG,ANS
+      INTEGER TEMP1(IDT1),TEMP2(IDT2),TEMP3(IDT4)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILNAM,NAMPRO,FILE,NAMARC,FILMAR
+      CHARACTER*(NAMLEN) SHONAM
+      CHARACTER GEL(MAXGLM)
+      CHARACTER GEL2(MAXGLM),GEL3(MAXGLM),GEL4(MAXGLM)
+      PARAMETER (BOTOPT=0,TOPOPT=39)
+      PARAMETER (MINMEN=-3)
+      PARAMETER (HELPF='BAPHELP')
+      PARAMETER (POINTF='BAPHPNT')
+      PARAMETER (FILMAR='BAPMARG')
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      INTEGER MARGB(TOPOPT),MARGT(TOPOPT)
+      CHARACTER SEQ1(MAXSEQ), SEQ2(MAXSEQ)
+C     DBAUTO TEMP3A = TEMP1, SEQ3A = GEL3, SEQ2A = GEL2
+C     MATCHA = GEL, SEQ4A = GEL4
+      PARAMETER (LENGTH = 7,
+     +           MAXCON = MAXDB/2,
+     +           MAXSAV = MAXGLM,
+     +           LPOWRC = CHRSIZ**LENGTH)
+      INTEGER CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ5(MAXGLM),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      INTEGER WINDOW,ACTF,CONOK
+      INTEGER OLILEN,OLIBAK,TEMNUM,OLINUM
+      EXTERNAL ACTF,CONOK
+      PARAMETER (LENRNM = 16)
+      CHARACTER*(LENRNM) RNAMES(MAXDB)
+      EQUIVALENCE (SEQ2,RNAMES)
+C     DBAUTO
+      EQUIVALENCE (TEMP2,TEMP1)
+      EQUIVALENCE (TEMP1,TEMP3)
+      COMMON /DEVILS/ IDEVT,IDEVC,IDBSIZ,RELPG
+      IDM = 5
+      CALL INITS
+      CALL INITLU(IDM)
+      CALL INTHLP('bap', TOPOPT)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(10))
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' BAP (Development assembly program) V12.1 May 1993',/,
+     +' Copyright: MRC Laboratory of Molecular Biology')
+      IGORT = 0
+      IDEV = KBOUT
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +TOPOPT,DEVNOS(5),FILMAR)
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(5),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,DEVNOS(5))
+      IOPEN=1
+      LINLEN=50
+      PERCD=0.75
+      WINDOW = 25
+      MXGOOD = MAXGLM
+      IGWIND = 1000
+      FILE = ' '
+      FILNAM = ' '
+      MAXGEL = MAXGLM
+      IDEVR = DEVNOS(1)
+      IDEVW = DEVNOS(2)
+      IDEVN = DEVNOS(3)
+      IDEVT  = DEVNOS(11)
+      IDEVC  = DEVNOS(12)
+      IDEVH = DEVNOS(5)
+      IDBSIZ=MAXDB
+      IDBSIS=IDBSIZ
+      LREG = 0
+      RREG = 0
+      SHONAM = ' '
+      CALL SHOWFI(KBOUT,SHONAM)
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(ANS.EQ.1)THEN
+C          CALL SHOWFU(KBOUT,'Start new database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +    IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          LLINO = 0
+          IF(IERR.EQ.0) THEN
+            IOPEN=0
+            SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+            CALL SHOWFI(KBOUT,SHONAM)
+          END IF
+          CALL INITLU(IDM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+C        CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    IDEVT,IDEVC,MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      MENUN = 0
+      KOPT = 0
+10    CONTINUE
+C
+C Flush the database buffers, just in case!
+C
+      IF (IOPEN.EQ.0) THEN
+        CALL FLUSHL(DEVNOS(1))
+        CALL FLUSHL(DEVNOS(2))
+        CALL FLUSHL(DEVNOS(3))
+        CALL FLUSHL(IDEVT)
+        CALL FLUSHL(IDEVC)
+      END IF
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+C      CALL SHOWFU(KBOUT,' ')
+C      IF(IOK.NE.0) GO TO 9999
+C      CALL DBMENT(MENU,NOPT,KOPT,
+C     +TOPOPT,HELPS(0),HELPE(0),HELPF,DEVNOS(5),
+C     +KBIN,KBOUT)
+      CALL MENU(NOPT, KOPT, MENUN, TOPOPT, MINMEN, KBIN, KBOUT,
+     +     HELPS(0), HELPE(0), HELPF, DEVNOS(5))
+      IF(NOPT.EQ.3)THEN
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          CLOSE(UNIT=IDEVC)
+          CLOSE(UNIT=IDEVT)
+          IOPEN=1
+          IOK = ACTF(2,NAMPRO,0,' ',KBOUT)
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+C          CALL SHOWFU(KBOUT,'Start a database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +    IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          IF(IERR.EQ.0)IOPEN=0
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+C          CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    IDEVT,IDEVC,MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      END IF
+C  STOP
+      IF(NOPT.EQ.2)THEN
+        IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBOUT)
+        GO TO 9999
+      END IF
+
+
+C   clear all
+C
+      IF(NOPT.EQ.10)THEN
+C        CALL SHOWFU(KBOUT,'Clear graphics')
+        CALL CLEARG
+        GO TO 10
+      END IF
+C   clear VT100
+C
+      IF(NOPT.EQ.11)THEN
+C        CALL SHOWFU(KBOUT,'Clear text')
+        CALL CLEARV
+        GO TO 10
+      END IF
+C
+C   xhairs
+C
+      IF((NOPT.EQ.13).AND.(IOPEN.EQ.0)) THEN
+C          CALL SHOWFU(KBOUT,'Use xhair')
+          CALL XHGAP(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,NCONTS,LLINO,LINCON,LREG,RREG,
+     +    WINDOW,IGWIND,LENCON,IDEPTH,JDEPTH,
+     +    MARGL,MARGR,MARGB,MARGT,TOPOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +    KBOUT,GEL,GEL2,DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM,
+     +    SEQ1,MAXSEQ,NGELS,TEMP3,ECHRSZ,MAXGL2,GEL3,GEL4,33,34,29,39,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MXGOOD,RNAMES,
+     +    SAV1,SAV2,SAV3)
+        GO TO 10
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(NOPT.EQ.14)THEN
+C        CALL SHOWFU(KBOUT,'Change margins')
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(NOPT),HELPE(NOPT),TOPOPT,HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+C  ruler
+C
+      IF(NOPT.EQ.12) THEN
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.1) THEN
+C          CALL SHOWFU(KBOUT,'Show ruler')
+          CALL RULER(LREG,RREG,MARGL,MARGR,
+     +    MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          GO TO 10
+        END IF
+      END IF
+C  labler
+      IF(NOPT.EQ.15)THEN
+C        CALL SHOWFU(KBOUT,'Label diagram')
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.16) THEN
+C        CALL SHOWFU(KBOUT,'Plot map')
+C  map
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.1) THEN
+          CALL PLTMAP(DEVNOS(4),FILNAM,LENCON,MARGL,MARGR,MARGB(NOPT),
+     +    MARGT(NOPT),ISXMAX,ISYMAX,LREG,RREG,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.25).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Show relationships')
+C       PRINT DB
+        CALL DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,IDEV,
+     +  KBIN,KBOUT,DEVNOS(3),LLINO,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.5).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Display contig')
+C       DISPLAY
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  'Contig identifier',
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+          IF(IERR.EQ.0)THEN
+          CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +    GEL,LLINO,LINCON,LREG,RREG,GEL2,I1,I2,0,I,IDBSIZ,IDEV,KBOUT,
+     +    DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+200   CONTINUE
+      IF((NOPT.EQ.23).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Complement contig')
+C       COMPLEMENT
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCON,LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  'Contig identifier',
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LINCON,LLINO,GEL,IDBSIZ,KBOUT,DEVNOS(1),DEVNOS(2),
+     +    MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+C  PLOT SINGLE CONTIG
+      IF((NOPT.EQ.33).AND.(IOPEN.EQ.0))THEN
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  'Contig identifier',
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0)THEN
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.26).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Alter relationships')
+C       FIX
+        CALL DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),
+     +  HELPS(4),HELPE(4),HELPF,DEVNOS(5),MAXGEL,MAXGLM,IDEVT,IDEVC,
+     +  TEMP1)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.24).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Copy database')
+C       COPY
+        CALL DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +  NAMPRO,DEVNOS(4),IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),
+     +  DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,MAXDB,IDM)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.19).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Check database for consistency')
+C       CHECK
+        ANS = 0
+        IF (KOPT.EQ.1) THEN
+          CALL YESNO(ANS,'Only check relationships',
+     +    HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+          IF(ANS.LT.0)GO TO 10
+        END IF
+        CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP1,IERR,KBOUT)
+        IF (ANS.EQ.1) THEN
+          ICOCK =  CONOK(
+     +    RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +    SEQ1,MAXSEQ,GEL,IDBSIZ,TEMP1,
+     +    ECHRSZ,MAXGL2,KBOUT,IDEVW,IDEV,
+     +    MAXGEL,IDM,PERCD,CENDS,NENDS,MAXCON)
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.29).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Examine quality')
+C       SCAN
+        IDIM1=MAXSEQ
+        CALL DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),IDEV,LINLEN,PERCD,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,GEL3,GEL4,
+     +  MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,DEVNOS(3),
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.8).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Make consensus')
+C       CONSENSUS
+        CALL NEWCON(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +  NGELS,NCONTS,MAXGEL,LLINO,LINCON,KOPT,
+     +  TEMP1,SEQ1,MAXSEQ,GEL2,GEL3,
+     +  MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,
+     +  CENDS,NENDS,MAXCON,
+     +  KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +  DEVNOS(4),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +  PERCD,IDM,IDEVC,IDEVT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+C        CALL SHOWFU(KBOUT,'Redirect output')
+        CALL REDIR(IDEV,DEVNOS(6),DEVNOS(10),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),KOPT)
+        GO TO 10
+      END IF
+C Double stranding
+      IF((NOPT.EQ.36).AND.(IOPEN.EQ.0)) THEN
+        IDIM1=MAXSEQ
+        CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +       IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +       KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +       HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +       LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,0,IERR,LSTRT,LEND)
+        IF (IERR.NE.0) GO TO 10
+        LLINO = LNBR(LINCON)
+        CALL DBLINT(IERR,MAXMIS,MISSC,MTCHSC,PADSC,KOPT)
+        IF (IERR.EQ.-1) GO TO 10
+        LSTRT = LREG - MAXGEL
+        IF (LSTRT.LT.1) LSTRT = 1
+        LEND = RREG + MAXGEL
+        CALL BUSY()
+        CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +       IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +       KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +       HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +       LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,1,IERR,LSTRT,LEND)
+        IF (IERR.NE.0) GO TO 10
+        CALL DSTRND(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +       SEQ2,IDIM1,LREG,RREG, LLINO, LINCON, MAXGEL, KBOUT,
+     +       IDBSIZ, GEL, IDEVR, IDEVW, IDEVN, 0, MAXMIS,MISSC,MTCHSC,
+     +       PADSC)
+        IDIM1=MAXSEQ
+        CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +       IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +       KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +       HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +       LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,2,IERR,LSTRT,LEND)
+        CALL DSTRND(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +       SEQ2,IDIM1,LREG,RREG, LLINO, LINCON, MAXGEL, KBOUT,
+     +       IDBSIZ, GEL, IDEVR, IDEVW, IDEVN, 1, MAXMIS,MISSC,MTCHSC,
+     +       PADSC)
+        IDIM1=MAXSEQ
+        CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +       IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +       KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +       HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +       LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,2,IERR,LSTRT,LEND)
+        GO TO 10
+      END IF
+C Auto-create oligos
+      IF((NOPT.EQ.37).AND.(IOPEN.EQ.0)) THEN
+         LL = INDEX(NAMPRO,'.')-1
+         IDIM1=MAXSEQ
+         CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +        IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +        KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +        HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +        LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,0,IERR,LSTRT,LEND)
+C     LLINO returned from DBLSTR is the left most gel in the region.
+C     We need the left most one covering the region which is often not
+C     the same. So for simplicity we just use the left most of this
+C     contig.
+         IF (IERR.NE.0) GO TO 10
+         LLINO = LNBR(LINCON)
+         CALL OLINIT(IERR, OLILEN,OLIBAK,KOPT,MAXGEL,TEMNUM,OLINUM)
+         IF (IERR.EQ.-1) GO TO 10
+         LSTRT = LREG - (OLILEN+OLIBAK)
+         IF (LSTRT.LT.1) LSTRT = 1
+         LEND = RREG
+         CALL BUSY()
+         CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +        IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +        KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +        HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +        LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,1,IERR,LSTRT,LEND)
+         IF (IERR.NE.0) GO TO 10
+         CALL OLISEL(RELPG,LNGTHG,LNBR,RNBR,SEQ1,SEQ2,LLINO,
+     +        LINCON,LREG,RREG,IDEVN, '+',OLILEN,OLIBAK,LSTRT,TEMNUM,
+     +        NAMPRO,OLINUM,LL)
+         IDIM1=MAXSEQ
+         CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +        IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +        KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +        HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +        LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,2,IERR,LSTRT,LEND)
+         CALL OLISEL(RELPG,LNGTHG,LNBR,RNBR,SEQ1,SEQ2,LLINO,
+     +        LINCON,LREG,RREG,IDEVN, '-',OLILEN,OLIBAK,LSTRT,TEMNUM,
+     +        NAMPRO,OLINUM,LL)
+         IDIM1=MAXSEQ
+         CALL DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +        IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +        KBIN,KBOUT,DEVNOS(2),DEVNOS(1),IDEV,LINLEN,PERCD,
+     +        HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,DEVNOS(3),
+     +        LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,2,IERR,LSTRT,LEND)
+         GO TO 10
+      END IF
+C  HELP
+      IF(NOPT.EQ.1)THEN
+C        CALL SHOWFU(KBOUT,'Help')
+C       HELP
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,
+     +  HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.27)THEN
+C        CALL SHOWFU(KBOUT,'Set parameters')
+C  SET DISPLAY PARAMETERS
+        MN = 10
+        MX = 100
+        CALL GETINT(MN,MX,LINLEN,'Display line length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) LINLEN = IVAL
+        IPCMIN = 51
+        IPCMAX = 100
+        IPCD = INT(PERCD*100.)
+        CALL GETINT(IPCMIN,IPCMAX,IPCD,
+     +  'Percentage score for consensus',
+     +  IPVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) THEN
+          PERCD = REAL(IPVAL)/100.
+        END IF
+        MN = 1
+        MX = MAXGLM
+        CALL GETINT(MN,MX,MXGOOD,
+     +  'Maximum read length for quality analysis',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) MXGOOD = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,WINDOW,'Text window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) WINDOW = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,IGWIND,'Graphics window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) IGWIND = IVAL
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.6)THEN
+C        CALL SHOWFU(KBOUT,'Display text file')
+       CALL TTEXT(DEVNOS(4),FILNAM,KBIN,KBOUT,
+     + HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+       GO TO 10
+      END IF
+      IF(NOPT.EQ.28) THEN
+C        CALL SHOWFU(KBOUT,'Highlight disagreements')
+        CALL HIGHLT(SEQ1,SEQ1(6001),SEQ1(10001),KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(7),
+     +  DEVNOS(8),FILE,IOK)
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.34).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Plot contigs')
+C  PLOT ALL CONTIGS
+        CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,0,0,
+     +  NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+CC  AUTO EDIT
+      IF((NOPT.EQ.30).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Shuffle pads')
+C        WRITE(KBOUT,*)'Make a copy first!'
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  'Contig identifier',
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0) THEN
+        IDIM1=MAXSEQ
+C
+C  note pinching a few arrays and dimensions here so beware of changes
+C
+        CALL PADSHF(RELPG,LNGTHG,LNBR,RNBR,SEQ1,
+     +  IDIM1,LREG,RREG,LLINO,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+     +  DEVNOS(2),MAXGEL,WORDP,WORDN,GELN,SAV1,MAXGLM,POSNS,MAXSEQ,
+     +  KBOUT)
+C        IDIM1=MAXSEQ
+C        CALL SUMSS(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,IDIM1,
+C     +  GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+C     +  DEVNOS(2),MAXGEL,GEL3,GEL4)
+C        CALL AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,LINCON,
+C     +  GEL,MAXGEL,SEQ1,IDIM1,DEVNOS(2),DEVNOS(1),LREG,RREG,
+C     +  KBOUT)
+      END IF
+      END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.4).AND.(IOPEN.EQ.0))THEN
+C       CALL UP CONTIG EDITOR
+        CALL CONEDT(KBIN,KBOUT,
+     +  POSNS(1),POSNS(1001),POSNS(2001),
+     +  MAXDB,POSNS(3001),POSNS(3101),50,
+     +  RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +  HELPS(9),HELPE(9),HELPF,DEVNOS(5),DEVNOS(4),
+     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +  LINLEN,FILNAM,LLINO,IOK,
+     +  IERR,TEMP1)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.22).AND.(IOPEN.EQ.0))THEN
+C       CALL UP JOIN EDITOR
+        CALL JOINED(KBIN,KBOUT,
+     +  POSNS(1),POSNS(1001),POSNS(2001),
+     +  MAXDB,POSNS(3001),POSNS(3101),50,
+     +  RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +  HELPS(9),HELPE(9),HELPF,DEVNOS(5),DEVNOS(4),
+     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +  LINLEN,FILNAM,LLINO,IOK,
+     +  IERR,TEMP1)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.17) THEN
+C        CALL SHOWFU(KBOUT,'Screen for restriction sites')
+        CALL SCRENR(GEL2,MAXGEL,GEL3,NAMARC,FILE,
+     +  DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.18) THEN
+C        CALL SHOWFU(KBOUT,'Screen against vector')
+        CALL SCRENV(MAXGEL,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +  SEQ1,MAXSEQ,GEL2,GEL3,GEL4,LENGTH,
+     +  SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +  KBIN,KBOUT,DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,FILE,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.20).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+C        CALL SHOWFU(KBOUT,'Auto assemble sequences')
+      CALL DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,IDEVC,IDEVT,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.35).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+C        CALL SHOWFU(KBOUT,'Find internal joins')
+      CALL AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK,IDEVC,IDEVT)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.32).AND.(IOPEN.EQ.0)) THEN
+C        CALL SHOWFU(KBOUT,'Extract gel readings')
+        CALL GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +  NCONTS,GEL,MAXGEL,DEVNOS(2),DEVNOS(3),DEVNOS(4),DEVNOS(7),
+     +  KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),FILE)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF (NOPT.EQ.31) THEN
+C        CALL SHOWFU(KBOUT,'Disassemble readings')
+        CALL REMGBD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,DEVNOS(7),FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+      END IF
+      IF((NOPT.EQ.38).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+        CALL AUTOM(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +  NGELS,NCONTS,LLINO,LINCON,MAXGEL,
+     +  TEMP1,SEQ1,MAXSEQ,GEL2,SEQ5,SEQC2,SEQG2,
+     +  MAXGLM,MAXGL2,ECHRSZ,SAV1,SAV2,SAV3,MAXSAV,
+     +  KBIN,KBOUT,IDEVR,IDEVW,IDEVN,DEVNOS(8),IDEV,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +  PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK,IDEVC,IDEVT)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.39).AND.(IOPEN.EQ.0))THEN
+        CALL YESNO(ISEL,'Select contigs',
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+        IF (ISEL.LT.0) GO TO 10
+        IF (ISEL.EQ.0) THEN
+          CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +    LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +    'Contig identifier',
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(IERR.EQ.0)THEN
+            CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +      1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +      HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+            IF(IERR.EQ.0)THEN
+              CALL CHKREW(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,
+     +        IDEVN,SAV1,SAV2,SAV3,RNAMES,LLINO,NMATES)
+              CALL YESNO(IP,'Plot results',
+     +        HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+              IF(IP.LT.0) GO TO 10
+              IF(IP.EQ.0) THEN
+                CALL GDEPTH(RELPG,LNGTHG,
+     +          IDBSIZ,LREG,RREG,SAV1,SAV2,SAV3,NMATES,
+     +          MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+              ELSE
+                CALL LMATES(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,
+     +          IDEV,SAV1,SAV2,SAV3,RNAMES,NMATES)
+              END IF
+            END IF
+          END IF
+        ELSE
+          CALL CHKREV(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +    IDEVN,IDEV,KBIN,KBOUT,TEMP1,SAV1,SAV2,SAV3,RNAMES)
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.GT.3).AND.(IOPEN.NE.0))THEN
+        WRITE(KBOUT,1012)
+1012    FORMAT(' You have not opened a database!')
+        GO TO 10
+      END IF
+      GO TO 10
+9999  CONTINUE
+      IF (IOPEN.EQ.0) THEN
+        IOK = ACTF(2,NAMPRO,0,' ',KBOUT)
+      END IF
+      CALL SHUTD
+      END
diff --git a/src/bap/bapmen.f b/src/bap/bapmen.f
new file mode 100644
index 0000000..be56b57
--- /dev/null
+++ b/src/bap/bapmen.f
@@ -0,0 +1,120 @@
+      SUBROUTINE DBMENT(MENU,NOPT,KOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+1     CONTINUE
+      IF(MENU.EQ.0) THEN
+        WRITE(KBOUT,5000)
+5000    FORMAT(' Menus and their numbers are',/,
+     +  ' m0 = This menu',/,
+     +  ' m1 = General',/,
+     +  ' m2 = Screen control',/,
+     +  ' m3 = Modification',/,
+     +  '  ? = Help',/,
+     +  '  ! = Quit')
+      ELSE IF(MENU.EQ.-1) THEN
+      WRITE(KBOUT,1001)
+1001  FORMAT(' General menu',/,
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Open a database',/,
+     +'  5 = Display a contig',/,
+     +'  6 = List a text file',/,
+     +'  7 = Redirect output',/,
+     +'  8 = Calculate a consensus',/,
+     +' 17 = Screen against restriction enzymes',/,
+     +' 18 = Screen against vector',/,
+     +' 19 = Check database',/,
+     +' 24 = Copy database',/,
+     +' 25 = Show relationships',/,
+     +' 27 = Set parameters',/,
+     +' 28 = Highlight disagreements',/,
+     +' 29 = Examine quality')
+      ELSE IF(MENU.EQ.-2) THEN
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Screen control menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +' 10 = Clear graphics',/,
+     +' 11 = Clear text',/,
+     +' 12 = Draw ruler',/,
+     +' 13 = Use cross hair',/,
+     +' 14 = Change margins',/,
+     +' 15 = Label diagram',/,
+     +' 16 = Plot map',/,
+     +' 33 = Plot single contig',/,
+     +' 34 = Plot all contigs')
+      ELSE IF(MENU.EQ.-3) THEN
+      WRITE(KBOUT,1003)
+1003  FORMAT(' Modification menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +' 20 = Auto assemble',/,
+     +' 23 = Complement a contig',/,
+     +' 26 = Alter relationships',/,
+     +' 30 = Shuffle pads',/,
+     +' 31 = Disassemble readings',/,
+     +' 32 = Extract gel readings',/,
+     +' 35 = Find internal joins')
+      END IF
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      IF((NOPT.LT.-3).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS(NOPT),IHELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(NOPT.EQ.0) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF(NOPT.LT.0) THEN
+        MENU = NOPT
+        GO TO 1
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      RETURN
+      END
+      SUBROUTINE DBMENU(MENU,NOPT,KOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+1     CONTINUE
+      IF(MENU.EQ.4)THEN
+40    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Alter relationships menu options are:'/
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Line change',/,
+     +'  4 = Check logical consistency',/,
+     +'  5 = Remove a contig',/,
+     +'  6 = Shift',/,
+     +'  7 = Move gel reading',/,
+     +'  8 = Rename gel reading',/,
+     +'  9 = Break a contig',/,
+     +' 10 = Remove a gel reading',/,
+     +' 11 = Alter raw data parameters')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 10
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 40
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 40
+      END IF
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      IF(NOPT.LE.0) GO TO 1
+      END
diff --git a/src/bap/baponly.c b/src/bap/baponly.c
new file mode 100644
index 0000000..4dea875
--- /dev/null
+++ b/src/bap/baponly.c
@@ -0,0 +1,6 @@
+/*
+ * Null functions supplied for use in bap (not xbap) so that various pieces
+ * of xbap code do not scream in agony.
+ */
+void UpdateTextOutput() {
+}
diff --git a/src/bap/contigEditor.c b/src/bap/contigEditor.c
new file mode 100644
index 0000000..3228b94
--- /dev/null
+++ b/src/bap/contigEditor.c
@@ -0,0 +1,1251 @@
+/*
+    Title: 	 contigEditor
+
+    File: 	 contigEditor.c
+    Purpose:	 C language entry point and initialisation functions
+    Last update: 5 April 1993
+
+*/
+
+
+/*
+*/
+
+#define CHKPNT(W,S) \
+    { XSync(XtDisplay(W),False); fprintf(stderr,"CHKPNT: %s\n",S); }
+
+
+/* ---- Includes ---- */
+
+#include <stdio.h>    /* IMPORT: freopen, stdout, stderr, fprintf */
+/*#include <string.h>*/
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Scrollbar.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/SmeBSB.h>
+#include <X11/Xaw/SmeLine.h>
+#include <X11/Xaw/Text.h>
+#include <X11/Xaw/Toggle.h>
+
+#include "Sheet.h"
+#include "contigEditor.h"
+#include "locks.h"
+#include "main.h"
+#include "tagUtils.h"
+#include "edUtils.h"
+#include "select.h"
+#include "extend.h"
+#include "undo.h"
+#include "fort.h"
+#include "oligo.h"
+
+
+/* ---- Static variables ---- */
+static Widget editorShellWid;	/* main editor shell */
+static Widget mainFormWid;	/* main editor form */
+
+static Widget buttonsWid;
+static Widget exitWid;		/* Leave Editor button */
+static Widget undoWid;		/* Undo last edit command button */
+static Widget supermanWid;	/* switch for Superuser class of edit commands */
+static Widget revealWid;	/* switch for revealing hidden cutoff information */
+#ifdef nana
+static Widget saveWid;	        /* save command button */
+#endif
+static Widget lockWid;	        /* lock button for when joining */
+#ifdef nana
+static Widget joinWid;	        /* lock button for when joining */
+#endif
+static Widget labelWid;	/* label displaying contig name */
+static Widget insertWid;	/* edit mode - insertion */
+static Widget replaceWid;	/* edit mode - replacement */
+static Widget nextWid;          /* search for next problem in consensus */
+
+static Widget disagreeFormWid;	/* for displaying disagreements when joining */
+static Widget dummyWid;
+Widget disagreeWid;
+
+float pcCut;              /* cut off for consensus calculation */
+int editorMode;
+enum States {StateDown=0,StateUp} editorState = StateDown;
+static int lockOffset;
+
+/* save/lock state */
+int_f save_state;
+
+SaveStruct saveState;
+
+DefColours defColours;
+
+
+
+/* ---- My translation tables ---- */
+/* static void tedSetUp(); */
+static char translationTable[] =
+    "\
+	 Ctrl<Btn1Down>:   XawPositionSimpleMenu(edMenu) XtMenuPopup(edMenu)\n\
+	 <Btn1Down>:    StartHighlight() buttonDown()\n\
+	 <Btn1Motion>:  ExtendHighlight()\n\
+	 <Btn1Up>:      MakeSelection()\n\
+	 <Btn2Down>(2): buttonDown() invokeTrace()\n\
+	 <Btn2Down>:    buttonDown()\n\
+	 <Btn3Down>:    ExtendHighlight()\n\
+	 <Btn3Motion>:  ExtendHighlight()\n\
+	 <Btn3Up>:      ExtendHighlight()\n\
+	 Meta<Key>Left:        MetaLeft()\n\
+	 Meta<Key>Right:       MetaRight()\n\
+	 <Key>Right:        caretRight()\n\
+         <Key>Left:         caretLeft()\n\
+         <Key>Down:         caretDown()\n\
+         <Key>Up:           caretUp()\n\
+         <Key>Delete:       deleteKey()\n\
+         <Key>:             keyPress()";
+/*
+  <Btn2Down>:    SelectTag()\n\
+  Meta<Key>Up:          MetaUp()\n\
+  */
+static XtActionsRec actionTable[] = {
+    {"caretRight",      caretRight},
+    {"caretLeft",       caretLeft},
+    {"caretDown",       caretDown},
+    {"caretUp",         caretUp},
+    {"deleteKey",       deleteKey},
+    {"keyPress",        keyPress},
+    {"buttonDown",      buttonDown},
+    {"invokeTrace",     invokeTrace},
+    {"StartHighlight",  start_highlight},
+    {"ExtendHighlight", extend_highlight},
+    {"MakeSelection",   make_selection},
+    {"SelectTag",       select_tag},
+    {"MetaLeft",        meta_left},
+    {"MetaRight",       meta_right},
+    {"MetaUp",          meta_up},
+};
+XtTranslations parsedTTable;
+
+
+static char translationTable2[] =
+    "<Btn1Down>:    selectRead()";
+
+static XtActionsRec actionTable2[] = {
+    {"selectRead",      selectRead},
+};
+XtTranslations parsedTTable2;
+
+
+
+
+
+
+
+
+
+/* ---- Private Functions ---- */
+static int LeaveAllowed;
+
+static void save()
+/*
+** Save Changes 
+*/
+{
+    EdStruct *xx = intToEdStruct(0);
+
+    saveDB(
+	xx,
+	saveState.idevr,
+	saveState.idevw,
+	saveState.idevn,
+	saveState.relpg,
+	saveState.lngthg,
+	saveState.lnbr,
+	saveState.rnbr,
+	saveState.maxgel
+    );
+}
+
+static void join()
+/*
+** Join request made
+*/
+{
+    joinDB(
+        saveState.idevr,
+        saveState.idevw,
+        saveState.idevn,
+        saveState.relpg,
+        saveState.lngthg,
+        saveState.lnbr,
+        saveState.rnbr,
+        saveState.maxgel,
+        saveState.llinol,
+        saveState.lnconl,
+        saveState.llinor,
+        saveState.lnconr,
+        saveState.ngels,
+        saveState.nconts,
+        saveState.idbsiz
+    );
+
+}
+
+static void warningYesCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A yes response to the
+** warning about data not saved
+*/
+{
+    Widget popupWid = (Widget) client_data;
+
+    if (editorMode == EDITMODE)
+	save();
+    else
+	join();
+
+    LeaveAllowed = 1;
+    XtDestroyWidget(popupWid);
+}
+
+static void warningNoCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A No response to the
+** warning about data not saved
+*/
+{   Widget popupWid = (Widget) client_data;
+
+    LeaveAllowed = 1;
+    XtDestroyWidget(popupWid);
+}
+
+ static void warningCancelCallback(Widget w,
+				  XtPointer client_data, XtPointer call_data)
+/*
+** A Cancel response to the
+** warning about data not saved
+*/
+{   Widget popupWid = (Widget) client_data;
+
+     LeaveAllowed = 0;
+     XtDestroyWidget(popupWid);
+}
+
+static void createLabelsForBox(Widget parentWid, char *warning)
+{
+    Arg	args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, warning); nargs++;
+    (void) XtCreateManagedWidget("warnPrompt", labelWidgetClass,
+				 parentWid, args, nargs);
+    
+}
+
+static int warnNotSaved(Widget parentWid, String warning)
+/*
+** Warn that changes have been made but contig hasn't been saved.
+*/
+{   Widget warningWid, warnFormWid, warnPromptBox;
+    Widget yesWid, noWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+
+
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) 0, (Position) height,
+		      &x, &y);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    warningWid = XtCreatePopupShell("warning", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    /* Create the form */
+    nargs = 0;
+    warnFormWid = XtCreateManagedWidget("warnForm", formWidgetClass,
+				      warningWid, args, nargs);
+
+    nargs = 0;
+    warnPromptBox = XtCreateManagedWidget("warnBox", boxWidgetClass,
+					warnFormWid, args, nargs);
+
+    createLabelsForBox(warnPromptBox,warning);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, warnPromptBox); nargs++;
+    yesWid = XtCreateManagedWidget("Yes",  commandWidgetClass,
+				  warnFormWid, args, nargs);
+    XtAddCallback(yesWid, XtNcallback, warningYesCallback, (XtPointer) warningWid);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, yesWid); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, warnPromptBox); nargs++;
+    noWid = XtCreateManagedWidget("No",  commandWidgetClass,
+				  warnFormWid, args, nargs);
+    XtAddCallback(noWid, XtNcallback, warningNoCallback, (XtPointer) warningWid);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, noWid);           nargs++;
+    XtSetArg(args[nargs], XtNfromVert, warnPromptBox); nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      warnFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback, 
+		  warningCancelCallback, (XtPointer) warningWid);
+
+    /* This is a modal dialogue */
+    LeaveAllowed = -1;
+    XtPopup(warningWid, XtGrabNonexclusive);
+
+    while (LeaveAllowed==-1)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(parentWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+
+    return LeaveAllowed;
+}
+
+
+
+static void editNextCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Next Problem button has been pushed
+*/
+{
+    EdStruct *xx = intToEdStruct(0);
+
+    if (editorState == StateDown) return;
+
+    findNextProblem(xx);
+}
+
+
+static void editRevealCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Next Problem button has been pushed
+*/
+{
+    EdStruct *xx;
+    Arg args[10];
+    int nargs;
+    Boolean state;
+    int i;
+
+    if (editorState == StateDown) return;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, &state); nargs++;
+    XtGetValues(w, args, nargs);
+
+    for (i=0;i<editorMode;i++) {
+	xx = intToEdStruct(i);
+	xx->reveal_cutoffs=state;
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+    }
+}
+
+static void editLockCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Editor has been locked
+*/
+{
+/*    Boolean currMode; */
+    EdStruct *xx0 = intToEdStruct(0);
+    EdStruct *xx1 = intToEdStruct(1);
+
+    /*
+    int nargs;
+    Arg args[10];
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNstate, &currMode); nargs++;
+    XtGetValues(lockWid, args, nargs);
+
+    if (currMode)
+    */
+	lockOffset = xx1->displayPos - xx0->displayPos;
+
+    redisplaySequences (xx0,xx0->namesWid, xx0->sequencesWid, xx0->displayPos, xx0->displayWidth);
+    redisplaySequences (xx1,xx1->namesWid, xx1->sequencesWid, xx1->displayPos, xx1->displayWidth);
+}
+
+static void editUndoCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Undo last command request made
+*/
+{
+    undoLastCommand();
+}
+
+static void editExitCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Request made to leave the editor
+*/
+{
+    EdStruct *xx = intToEdStruct(0);
+
+    if (editorState == StateDown) return;
+
+    switch (editorMode) {
+	case EDITMODE:
+            if (editsMade(xx)) {
+                if (warnNotSaved(w,"Save changes?"))
+                {
+                    XtPopdown(editorShellWid);
+                    editorState = StateDown;
+                    semaphoreRelease(activeLock);
+                }
+	    } else {
+                XtPopdown(editorShellWid);
+                editorState = StateDown;
+                semaphoreRelease(activeLock);
+	    }
+	    break;
+        case JOINMODE: {
+	    int overlapLength, wingeCount;
+	    char warning[200];
+
+	    countDisagreements(&overlapLength, &wingeCount);
+	    if (overlapLength<=0) {
+		sprintf(warning,
+		    "Contigs do not overlap\nPerform Join?\n");
+	    } else {
+		sprintf(warning,
+		    "Percentage Mismatch:\n   %5.2f%%\nPerform Join?\n",
+		    (float)(100*wingeCount)/(float)overlapLength);
+	    }
+
+            if (warnNotSaved(w,warning))
+            {
+                XtPopdown(editorShellWid);
+                editorState = StateDown;
+                semaphoreRelease(activeLock);
+            }
+            break;
+	}
+        default:
+            XtPopdown(editorShellWid);
+            editorState = StateDown;
+            semaphoreRelease(activeLock);
+    }
+    /*
+    ** We must shut down the search window as well
+    */
+    if (editorState == StateDown) {
+	destroyOligo();
+	destroySearchWindow();
+    }
+}
+
+static void sliderScrollCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** left or right mouse button has caused the
+** slider to move
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(wid));
+    int position = (int) call_data;
+
+    float     topOfThumb;
+    int nargs;
+    Arg args[10];
+
+    if (editorState == StateDown) return;
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtGetValues(wid, args, nargs);
+
+    /* Move the thumb by one screenful in the appropriate direction */
+    if (position>0)
+	incDisplayPos(xx,D_screen);
+    else
+	decDisplayPos(xx,D_screen);
+}
+
+static void leftleftCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump left a long way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    decDisplayPos (xx,D_halfScreen);
+}
+
+static void leftCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump left a short way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    decDisplayPos (xx,D_character);
+}
+
+static void rightCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump right a short way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    incDisplayPos (xx,D_character);
+}
+
+static void rightrightCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump right a long way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    incDisplayPos (xx,D_halfScreen);
+}
+
+
+static void sliderJumpCallback(Widget wid,
+                            XtPointer client_data, XtPointer call_data)
+/*
+** Middle mouse button used to reposition scroll bar
+*/
+{
+    float percent = *((float *) call_data);
+    EdStruct *xx = widgetToEdStruct(XtParent(wid));
+
+    if (editorState == StateDown) return;
+
+    setDisplayPosPercent(xx,percent);
+}
+
+static void getFontDetails(int *width, int *height)
+/*
+** Get the font width and height
+** for the contig editor window
+** ASSUME a single font is used!!
+*/
+{
+    Arg args[10];
+    int nargs;
+    XFontStruct *font;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfont, &font); nargs++;
+    XtGetValues(labelWid, args, nargs);
+
+    *width = font->max_bounds.width;
+    *height = font->max_bounds.ascent + font->max_bounds.descent;
+}
+
+static void setInitialStates(int reveal_state)
+/*
+** Set the initial states of togglewidgets
+*/
+{
+    Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, False); nargs++;
+    XtSetValues(lockWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, reveal_state); nargs++;
+    XtSetValues(revealWid, args, nargs);
+
+}
+
+static void tweakGeometry1()
+/*
+** Some geometry tweaking must be done before widgets are realised
+*/
+{
+    EdStruct *xx;
+
+    xx = intToEdStruct(1);
+    if (editorMode==EDITMODE) {
+	XtManageChild(nextWid);
+	XtUnmanageChild(lockWid);
+#ifdef nana
+	XtUnmanageChild(joinWid);
+#endif
+	XtUnmanageChild(disagreeFormWid);
+	XtUnmanageChild(xx->edWid);
+	XawFormDoLayout(mainFormWid,True);
+    } else {
+	XtUnmanageChild(nextWid);
+	XtManageChild(lockWid);
+#ifdef nana
+	XtManageChild(joinWid);
+#endif
+	XtManageChild(disagreeFormWid);
+	XtManageChild(xx->edWid);
+	XawFormDoLayout(mainFormWid,True);
+    }
+    
+}
+
+static void tweakGeometry2()
+/*
+** Some geometry tweaking must be done after widgets are realised
+*/
+{
+    int i;
+    Arg args[10];
+    int nargs;
+
+    EdStruct *xx;
+
+    /*
+    ** scroll button wids
+    */
+    for (i=0;i<editorMode;i++) {
+	xx = intToEdStruct(i);
+	nargs = 0;
+	XtSetArg(args[nargs], XtNwidth, xx->fontWidth*NAMELEN+10); nargs++;
+	XtSetValues(xx->scrollButtonsWid, args, nargs);
+
+	/*
+	Dimension width;
+	xx = intToEdStruct(i);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, &width); nargs++;
+	XtGetValues(xx->namesWid, args, nargs);
+	fprintf(stderr,"width.a = %d\n",(int)width);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, width); nargs++;
+        XtSetValues(xx->scrollButtonsWid, args, nargs);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, &width); nargs++;
+	XtGetValues(xx->sequencesWid, args, nargs);
+	fprintf(stderr,"width.b = %d\n",(int)width);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, width); nargs++;
+        XtSetValues(xx->sliderWid, args, nargs);
+	*/
+    }
+
+
+}
+
+
+/* ---- Exported functions ---- */
+
+
+void jxedit_(
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *idevt,    /* unit number for tag information */
+	int_f *idevc,    /* unit number for comment file */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *lnconl,	/* left contig to join */
+	int_f *llinol,	/* left-most gel in contig of left contig */
+	int_f *lnconr,	/* right contig for join */
+	int_f *llinor,	/* left-most gel in contig of right contig */
+	int_f *igell,    /* left gel specified on entry */
+	int_f *igellpos, /* position in left-most gel */
+	int_f *igelr,    /* right gel specified on entry */
+	int_f *igelrpos, /* position in left-most gel */
+	int_f *perced,	/* cutoff for consensus calculation */
+	int_f *ngels,	/* number of gels in database */
+	int_f *nconts,	/* number of contigs in database */
+	int_f *idm,      /* database type */
+	int_f *rcstate,  /* reveal cutoff state */
+	int_f *iok       /* returns 0-saved (1) | 1-saved (2) | joined (4) */
+	)
+/*
+** Interface to FORTRAN for Join editor
+*/
+{
+    EdStruct *xx[2];
+    int reveal_state;
+    int i;
+
+    semaphoreGrab(activeLock);
+    editorState = StateUp;
+    editorMode = JOINMODE;
+    save_state = (int_f)0;
+
+    reveal_state = (*rcstate != 0);
+    setInitialStates(reveal_state);
+
+    /*
+    ** Save arguments for later use
+    */
+    saveState.idevr  = idevr;
+    saveState.idevw  = idevw;
+    saveState.idevn  = idevn;
+    saveState.idevt  = idevt;
+    saveState.idevc  = idevc;
+    saveState.relpg  = relpg;
+    saveState.lngthg = lngthg;
+    saveState.lnbr   = lnbr;
+    saveState.rnbr   = rnbr;
+    saveState.maxgel = maxgel;
+    saveState.idbsiz = idbsiz;
+    saveState.llinol = llinol;
+    saveState.lnconl = lnconl;
+    saveState.llinor = llinor;
+    saveState.lnconr = lnconr;
+    saveState.perced = perced;
+    saveState.ngels  = ngels;
+    saveState.nconts = nconts;
+    saveState.idm    = idm;
+    pcCut = * (float *) perced;
+
+    for (i=0; i<2; i++) {
+	int_f *llino = (i==0)?llinol:llinor;
+
+        /*
+        ** Set up data structures
+        */
+        xx[i] = intToEdStruct(i);
+	xx[i]->reveal_cutoffs = reveal_state;
+        if (initialiseDB(xx[i],idevr,idevw,idevn,relpg,lngthg,lnbr,rnbr,maxgel,idbsiz,llino)) {
+	    if (i==1) freeDB(xx[0]);
+	    return;
+        }
+
+        /*
+        ** Set up display
+        */
+        if (createEdDisplay(xx[i],
+			    xx[i]->namesWid,
+			    xx[i]->sequencesWid,
+			    (i)?*igelr:*igell,
+			    (i)?*igelrpos:*igellpos)) {
+	    freeDB(xx[i]);
+	    if (i==1) freeDB(xx[0]);
+	    return;
+        }
+
+    }
+
+    tweakGeometry1();
+    XtPopup(editorShellWid,   XtGrabNone);
+    tweakGeometry2();
+
+    while (editorState != StateDown)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(xx[0]->edWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    for (i=0;i<2;i++) {
+	disown_selection(xx[i]);
+        freeDB(xx[i]);
+        cleanUpAllStacks();
+    }
+
+    *iok = save_state;
+}
+
+void cxedit_(
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *idevt,    /* unit number for tag information */
+	int_f *idevc,    /* unit number for comment file */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *lincon,	/* current contig to edit */
+	int_f *llino,	/* left-most gel in contig */
+	int_f *igelno,   /* gel number specified on entry */
+	int_f *igelpos,  /* position in gel */
+	int_f *perced,	/* cutoff for consensus calculation */
+	int_f *idm,      /* database type */
+	int_f *rcstate,  /* reveal cutoff state */
+	int_f *iok       /* returns 0-saved (1) | 1-saved (2) | joined (4) */
+	)
+/*
+** Interface to FORTRAN for Contig Editor
+*/
+{
+
+    int reveal_state;
+
+    EdStruct *xx = intToEdStruct(0);
+    semaphoreGrab(activeLock);
+    editorState = StateUp;
+    editorMode = EDITMODE;
+    save_state = (int_f)0;
+
+    reveal_state = (*rcstate != 0);
+    setInitialStates(reveal_state);
+
+    /*
+    ** Save arguments for later use
+    */
+    saveState.idevr  = idevr;
+    saveState.idevw  = idevw;
+    saveState.idevn  = idevn;
+    saveState.idevt  = idevt;
+    saveState.idevc  = idevc;
+    saveState.relpg  = relpg;
+    saveState.lngthg = lngthg;
+    saveState.lnbr   = lnbr;
+    saveState.rnbr   = rnbr;
+    saveState.maxgel = maxgel;
+    saveState.idbsiz = idbsiz;
+    saveState.lincon = lincon;
+    saveState.llino  = llino;
+    saveState.perced = perced;
+    saveState.idm    = idm;
+
+    /*
+    ** Set up data structures
+    */
+    xx->reveal_cutoffs = reveal_state;
+    if (initialiseDB(xx,idevr,idevw,idevn,relpg,lngthg,lnbr,rnbr,maxgel,idbsiz,llino))
+	return;
+
+    /*
+    ** Set up display
+    */
+    pcCut = * (float *) perced;
+    if (createEdDisplay(xx,xx->namesWid,xx->sequencesWid,*igelno,*igelpos)) {
+	freeDB(xx);
+	return;
+    }
+
+    tweakGeometry1();
+    XtPopup(editorShellWid,   XtGrabNone);
+    tweakGeometry2();
+
+    while (editorState != StateDown)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(xx->namesWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    disown_selection(xx);
+    freeDB(xx);
+    cleanUpAllStacks();
+
+    *iok = save_state;
+}
+
+Widget createEditWid(Widget parentWid, Widget fromVertWid, int closeToVert)
+/*
+** Create an form with all the trappings of an editor window
+*/
+{
+    EdStruct *xx;
+    Widget llWid, lWid, rWid, rrWid;
+    Arg args[10];
+    int nargs;
+
+    xx = getFreeEdStruct();
+    if (xx == NULL) return NULL;
+
+    getFontDetails(&xx->fontWidth,&xx->fontHeight);
+
+    /*
+    ** create main form
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  fromVertWid); nargs++;
+    if (closeToVert) {
+        XtSetArg(args[nargs], XtNvertDistance,  0); nargs++;
+    }
+    xx->edWid = XtCreateManagedWidget("edForm",  formWidgetClass,
+                                  parentWid, args, nargs);
+
+    /*
+    ** Scrollers
+    */
+    nargs = 0;
+    xx->scrollButtonsWid = XtCreateManagedWidget("scrollButtons", formWidgetClass,
+                                   xx->edWid, args, nargs);
+    nargs = 0;
+    llWid = XtCreateManagedWidget("leftleft",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(llWid, XtNcallback, leftleftCallback, NULL );
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  llWid); nargs++;
+    lWid = XtCreateManagedWidget("left",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(lWid, XtNcallback, leftCallback, NULL );
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  lWid); nargs++;
+    rWid = XtCreateManagedWidget("right",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(rWid, XtNcallback, rightCallback, NULL );
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  rWid); nargs++;
+    rrWid = XtCreateManagedWidget("rightright",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(rrWid, XtNcallback, rightrightCallback, NULL );
+
+
+
+    /*
+    ** A Slider
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  xx->scrollButtonsWid); nargs++;
+    XtSetArg(args[nargs], XtNhorizDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation,  XtorientHorizontal); nargs++;
+    XtSetArg(args[nargs], XtNheight, xx->fontHeight+4); nargs++;
+    XtSetArg(args[nargs], XtNwidth, xx->fontWidth*(xx->displayWidth)+8); nargs++;
+
+    xx->sliderWid = XtCreateManagedWidget("slider", scrollbarWidgetClass,
+                                   xx->edWid, args, nargs);
+    XtAddCallback(xx->sliderWid, XtNjumpProc,   sliderJumpCallback, NULL);
+    XtAddCallback(xx->sliderWid, XtNscrollProc, sliderScrollCallback, NULL);
+
+
+
+
+    /*
+    ** ... and a text widget or two or three
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, xx->sliderWid); nargs++;
+    XtSetArg(args[nargs], XtNvertDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, NAMELEN); nargs++;
+    xx->namesWid = XtCreateManagedWidget("names", sheetWidgetClass,
+                             xx->edWid, args, nargs);
+    XtUninstallTranslations(xx->namesWid);
+    /* Add string to function bindings for our application actions */
+    XtAppAddActions(XtWidgetToApplicationContext(xx->edWid),
+                    actionTable2, XtNumber(actionTable2));
+    parsedTTable2 = XtParseTranslationTable(translationTable2);
+    XtUninstallTranslations(xx->namesWid);
+    XtAugmentTranslations(xx->namesWid,parsedTTable2);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, xx->sliderWid); nargs++;
+    XtSetArg(args[nargs], XtNvertDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, xx->namesWid); nargs++;
+    XtSetArg(args[nargs], XtNhorizDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, DEFAULT_DISPLAY_WIDTH); nargs++;
+    xx->sequencesWid = XtCreateManagedWidget("sequences", sheetWidgetClass,
+                             xx->edWid, args, nargs);
+
+    /* Add string to function bindings for our application actions */
+    XtAppAddActions(XtWidgetToApplicationContext(xx->edWid),
+                    actionTable, XtNumber(actionTable));
+    parsedTTable = XtParseTranslationTable(translationTable);
+    XtUninstallTranslations(xx->sequencesWid);
+
+    XtAugmentTranslations(xx->sequencesWid,parsedTTable);
+
+    /*
+    ** Pop-up menu
+    */
+    createEdMenu(xx->sequencesWid);
+
+    return xx->edWid;
+}
+
+Widget createDisagreeWid(Widget parentWid, Widget fromVertWid, int closeToVert)
+/*
+** Create a form widget with all the trappings of a
+** disagreement window
+*/
+{
+    Arg args[10];
+    int nargs;
+    int fontWidth, fontHeight;
+
+    getFontDetails(&fontWidth, &fontHeight);
+
+    /*
+    ** create main form
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  fromVertWid); nargs++;
+    if (closeToVert) {
+        XtSetArg(args[nargs], XtNvertDistance, 0); nargs++;
+    }
+    disagreeFormWid = XtCreateManagedWidget("disagreeForm",  formWidgetClass,
+                                  parentWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, 1); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, NAMELEN); nargs++;
+    dummyWid = XtCreateManagedWidget("dummy", sheetWidgetClass,
+                             disagreeFormWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, 1); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, DEFAULT_DISPLAY_WIDTH); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, dummyWid); nargs++;
+    disagreeWid = XtCreateManagedWidget("disagreements", sheetWidgetClass,
+                             disagreeFormWid, args, nargs);
+
+    XtUninstallTranslations(dummyWid);
+    XtUninstallTranslations(disagreeWid);
+
+    return disagreeFormWid;
+}
+
+
+Widget CreateEditorShell(Widget parentWid)
+/*
+** Create all components of the
+**	Contig Editor
+** and
+**	Join Editor
+*/
+{
+    Arg args[10];
+    int nargs;
+    Widget ed1Wid;
+    Widget ed2Wid;
+    Widget dw;
+
+    /*
+    ** Create the contig editor shell
+    */
+    editorShellWid = XtCreatePopupShell ("Editor",
+				      topLevelShellWidgetClass,
+				      parentWid,
+				      NULL, (Cardinal) 0);
+
+    mainFormWid = XtCreateManagedWidget("mainForm", formWidgetClass,
+					editorShellWid,
+                                        NULL, (Cardinal) 0);
+
+
+    /*
+    **  The form widget holds a label
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, exitWid); nargs++;
+    labelWid = XtCreateManagedWidget("title", labelWidgetClass,
+                                     mainFormWid, args, nargs);
+
+
+    /*
+    ** Button box for contig/join editor
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, labelWid); nargs++;
+    buttonsWid = XtCreateManagedWidget("buttons", boxWidgetClass,
+                                   mainFormWid, args, nargs);
+
+    /*
+    ** Toggle widgets for insert/replace edit modes
+    */
+    nargs = 0;
+    insertWid = XtCreateManagedWidget("insert", toggleWidgetClass,
+                                   buttonsWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, insertWid); nargs++;
+    replaceWid = XtCreateManagedWidget("replace", toggleWidgetClass,
+                                   buttonsWid, args, nargs);
+
+    /*
+    ** Superman toggle widget (for super-duper editing)
+    */
+    nargs = 0;
+    supermanWid = XtCreateManagedWidget("superman",  toggleWidgetClass,
+                                  buttonsWid, args, nargs);
+
+    /*
+    ** Reveal cutoffs
+    */
+    nargs = 0;
+    revealWid = XtCreateManagedWidget("reveal",  toggleWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(revealWid, XtNcallback, editRevealCallback, NULL );
+
+    /*
+    ** Undo command button
+    */
+    nargs = 0;
+    undoWid = XtCreateManagedWidget("undo",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(undoWid, XtNcallback, editUndoCallback, NULL );
+
+    /*
+    ** Next (problem) command button
+    */
+    nargs = 0;
+    nextWid = XtCreateManagedWidget("next",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(nextWid, XtNcallback, editNextCallback, NULL );
+
+#ifdef nana
+    /*
+    ** Save command button
+    */
+    nargs = 0;
+    saveWid = XtCreateManagedWidget("save",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(saveWid, XtNcallback, editSaveCallback, NULL );
+#endif
+
+    /*
+    ** Lock command button
+    */
+    nargs = 0;
+    lockWid = XtCreateManagedWidget("lock",  toggleWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(lockWid, XtNcallback, editLockCallback, NULL );
+
+#ifdef nana
+    /*
+    ** Join command button
+    */
+    nargs = 0;
+    joinWid = XtCreateManagedWidget("join",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(joinWid, XtNcallback, editJoinCallback, NULL );
+#endif
+
+    /*
+    ** and an exit button
+    */
+    nargs = 0;
+    exitWid = XtCreateManagedWidget("exit",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(exitWid, XtNcallback, editExitCallback, NULL );
+
+    /*
+    ** create editor
+    */
+    ed1Wid = createEditWid(mainFormWid,buttonsWid,0);
+    dw = createDisagreeWid(mainFormWid,ed1Wid,1);
+    ed2Wid = createEditWid(mainFormWid,dw,1);
+
+    /*
+    ** Create tag editor
+    */
+    (void) createTagEditor(parentWid);
+
+
+    /*
+    ** Create search diologue
+    */
+    (void) createSearchWidget(parentWid);
+
+    /*
+    ** Create select oligo diologue
+    */
+    (void) createOligoWidget(parentWid);
+
+    return editorShellWid;
+
+}
+
+
+int editModeIsInsert()
+/*
+** Find out if editor is in insert mode rather than replace mode
+*/
+{
+    String currMode;
+
+    currMode = (String) XawToggleGetCurrent(insertWid);
+    if (currMode == NULL)
+	return 0;
+    else 
+        return (strcmp (currMode,"insert") == 0);
+}
+
+int editModeIsSuperman()
+/*
+** Find out if we are entitled to use privilaged edit commands
+*/
+{
+    Boolean currMode;
+    int nargs;
+    Arg args[10];
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNstate, &currMode); nargs++;
+    XtGetValues(supermanWid, args, nargs);
+
+    return currMode;
+}
+
+int inJoinMode()
+/*
+** Find out if the editor is in join mode
+*/
+{
+    return editorMode==JOINMODE;
+}
+
+int editorLocked()
+/*
+** Find out if the editor is locked (and in join mode of course)
+*/
+{
+    Boolean currMode;
+    int nargs;
+    Arg args[10];
+
+    if (inJoinMode()) {
+        nargs=0;
+        XtSetArg(args[nargs], XtNstate, &currMode); nargs++;
+        XtGetValues(lockWid, args, nargs);
+
+        return (currMode);
+    } else
+	return 0;
+}
+
+int editorLockedPos(int force)
+/*
+** Find out the locked position
+*/
+{
+    if (force) {
+        EdStruct *xx0 = intToEdStruct(0);
+        EdStruct *xx1 = intToEdStruct(1);
+
+        return xx1->displayPos - xx0->displayPos;
+    } else
+        return (lockOffset);
+}
diff --git a/src/bap/dbsyscommon.f b/src/bap/dbsyscommon.f
new file mode 100644
index 0000000..5e3c6a0
--- /dev/null
+++ b/src/bap/dbsyscommon.f
@@ -0,0 +1,5739 @@
+C  DBSYS ROUTINES COMMON TO PRE AND POST .RD PROGRAMS
+C
+C 25-8-92 NOTE at the end of the file are new versions of getln2 (getln3)
+C              and gelid (gelidn) and a new routine nameno, that should
+C              replace getln2 and gelid.
+C
+C  Further sap routines are in dbsysold.f (pre .rd) and dbsysnew.f (post)
+C  the split was made by rs 23-1-91
+C 15-6-92 added fasta format output from consen
+C 15-4-92 added in all the speedup changes ive been making and made 
+C         screnv compatible
+C 13-4-92 changed autocn to use a new hashing routine encof and inite
+C  2-4-92 Added new dbauto related routines and changed gtconc
+C  2-4-92 Added filnam = ' ' and brought uptodate with dap
+C  4-5-90 Change to getreg to allow escape
+C  4-5-90 addition of graphics routines and changes to menus
+C  9-5-90 added default gel reading: many changes
+C  17-5-90 Fixed 3 bugs in screen editing: 1) rightjustified names
+C          caused problems; 2) beginnings of sequences starting at far
+C          right of lines where not seen; 3) lines with no numbers at
+C          the end of a contig (i.e. with <10 chars) were flagged
+C          as errors. Changes to ltype for 1, linlen for 2, dsplay for 3
+C  9-7-90  removed menu routines
+C  20-8-90 changed gelid to add / to reading name because xsap did
+C          not return the INFLAG = 3 for the default
+C  23-8-90 Changes to dbauto and autocn to deal with failures better
+C          Plus addition of calls to BUSY
+C  9-11-90 Replaced call to radio with call to radion
+C  19-11-90 Changed max match length in dbauto to maxglm+1  (was 50)
+C  25-11-90 Very important bug fix in tpchek. Old versions could
+C           duplicate bits of working versions.
+C  28-11-90 Modified slider to receive maxpg and maxpc and to allow exactly
+C           the requested number of matches at each end of the two 
+C           sequences. 
+C           Added two new options to dbauto: all gels to new contigs, all 
+C           gels to contig 1; plus resurrected forbidding joins to allow
+C           sequences to be entered only into the contig the overlap best.
+C           Changed autocn to sort overlaps into order based on % mismatch
+C           (previously it saved the best two in any order)
+C           Minor change to dbstar
+C 3-1-91   Discovered bug in dbopen: incorrect call to getint when the
+C          database is very old and needs values for the current format
+C 21-1-91  GELID allowed illegal gel numbers to be returned! Fixed it.
+C 22-1-91  Modified autocn, adism4,adism3 to give more info about
+C          overlaps, and to allow 10 overlaps. Modified dbopen to
+C          return version number, ditto dbstar
+C 23-1-91  Split into dbsyscommon, dbsysold, dbsysnew
+C 26-2-91  Improved overflow check in padcop
+C 28-7-91 added extra parameter to quality calc: mxgood is the maximum
+C         reading length in which we have confidence, so only add this
+C         many chars from the start of each reading. Also changed the
+C         quality calc to make it the same as the consensus one. Made
+C         all characters have nonzero score and made lowercase = 100
+C 21-8-91 Changed arrfil to arrfim which does not display comments
+C 22-8-91 Added routine to find contig line number given left gel (CLINNO)
+C 
+C
+C 12-11-91 BIG CHANGE: made database handle 99,999 readings and 16 char names
+C
+C
+C          Also added routine to make aedit take strandedness into account
+C          (SUMSS).
+C          Also added fmt4lp which is used by find internal joins and
+C          could be used to advantage by others that call fmt4ln.
+C 18-11-91 New routine GETLN2 with returns gel number specified
+C
+C     enconn
+C   routine to store positions of words in posns and first occurences
+C   in wordp and number of occurences in wordn
+C   each number is a value representing one of the le4 possible
+C   words of length length made up of 4 characters
+C   words in posns are numbers from 1 to 4**length
+      SUBROUTINE ENCONN(POSNS,IDIM,WORDP,WORDN,LE4,LENGTH,START)
+C   AUTHOR: RODGER STADEN
+      INTEGER WORDP(LE4),POSNS(IDIM)
+      INTEGER WORDN(LE4),START
+C   number of words of length length
+      IDIM1 = IDIM - (LENGTH-1)
+      IF (START.EQ.1) THEN
+        DO 10 I=1,LE4
+          WORDN(I) = 0
+10      CONTINUE
+      END IF
+C   loop for each word
+      DO 100 I=START,IDIM1
+        N = POSNS(I)
+        IF(N.NE.0) THEN
+          NW = WORDN(N)
+C   is their already an entry for this word?
+          IF(NW.EQ.0) THEN
+C  first entry, put in wordp
+            WORDP(N) = I
+            WORDN(N) = NW + 1
+          ELSE
+            WORDN(N) = NW + 1
+            POSNS(I) = WORDP(N)
+            WORDP(N) = I
+          END IF
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE ENCOF(SEQ,IDSEQ,CONST,CSTART,LENGTH,POSNS)
+      CHARACTER SEQ(IDSEQ)
+      INTEGER CONST(LENGTH),CSTART,POSNS(IDSEQ),HASH
+      INTEGER CTONUM,CONSTL
+      EXTERNAL CTONUM
+C
+C new hashing routine. hash = k1 + k2
+C
+C hash = k1.c1 + k2.c2 + ... + kn.cn - cstart
+C now c1=1, c2=4*c1, c3=4*c2,...
+C
+C find length bases in a row, then do first word base by base,
+C for rest only change what is necessary
+C
+      DO 1 I=1,IDSEQ
+        POSNS(I) = 0
+ 1    CONTINUE
+      CONSTL = CONST(LENGTH)
+      LM1 = LENGTH - 1
+      IDSQML = IDSEQ - LENGTH
+      IS = 1
+      I  = 1
+      IP = 1
+      HASH = 0
+ 10   CONTINUE
+C
+C end approaching ?
+C
+      IF (IS.GT.IDSQML) RETURN
+C
+C at least a words length of characters left
+C
+ 11   CONTINUE
+      K = CTONUM(SEQ(I))
+      IF (K.EQ.5) THEN
+C
+C start a new word
+C
+        IS = I + 1
+        I = IS
+        IP = 1
+        HASH = 0
+        GO TO 10
+      END IF
+      HASH = HASH + CONST(IP) * K
+      IF (IP.NE.LENGTH) THEN
+        I = I + 1
+        IP = IP + 1
+        GO TO 11
+      END IF
+C
+C word finished
+C
+C save the hash value and the 
+C
+ 20   CONTINUE
+      POSNS(IS) = HASH + CSTART
+C      K1 = CONST(1) * CTONUM(SEQ(IS)) note const(1) = 1
+      K1 = CTONUM(SEQ(IS))
+      K2 = (HASH - K1) / 4
+      IS = IS + 1
+      IF (IS.GT.IDSQML) RETURN
+      K = CTONUM(SEQ(IS+LM1))
+      IF (K.EQ.5) THEN
+        IS = IS + 1
+        I = IS
+        IP = 1
+        HASH = 0
+        GO TO 10
+      END IF
+C      HASH = K2 + K * CONST(LENGTH) note this is a constant constant
+      HASH = K2 + K * CONSTL
+      GO TO 20
+      END
+      SUBROUTINE INITE(CONST,CSTART,LENGTH)
+      INTEGER CONST(LENGTH),CSTART
+      CSTART = 1
+      DO 1 I=1,LENGTH
+C      WRITE(*,*)I
+        CONST(I) = 4**(I-1)
+        CSTART = CSTART - CONST(I)
+ 1    CONTINUE
+      END
+C   SUBROUTINE TO READ CHARACTER DATA FROM IDEV, REMOVE SPACES, FILL
+C   ARRAY AND RETURN NUMBER OF ELEMENTS USED. ANY LINES STARTING WITH
+C   A ; ARE TREATED AS COMMENTS
+      SUBROUTINE ARRFIM(IDEV,SEQNCE,J,KBOUT)
+C 14-8-91 Added err= option to read, and set length to 0 if error found
+C   AUTHOR: RODGER STADEN
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,ENDCHR,TITCHR
+      SAVE ENDCHR,SPACE,TITCHR
+      DATA ENDCHR/'@'/
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=40)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR)THEN
+C        WRITE(KBOUT,1003)(TEMP(K),K=2,80)
+C1003    FORMAT(' ',79A1)
+        GO TO 1
+      END IF
+10    CONTINUE
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(TEMP(I).EQ.ENDCHR)RETURN
+          IF(J.EQ.IDMX)THEN
+            WRITE(KBOUT,1002)IDMX
+1002        FORMAT(
+     +             ' Too much data. Maximum possible',
+     +             ' =',I6,', input stopped there')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+      RETURN
+ 40   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      J = 0
+      END
+C      ABEDIN
+C
+C   ROUTINE TO EDIT THE DB USING A PADDED SEQ
+C   HAVE AN ARRAY SEQC2 LENGTH IDC OF PADDED SECTION OF CONTIG LINCON
+C  THE LEFT END OF THE PADDED CONTIG STARTS AT X
+C   THERE ARE ITOTPC PADS TO MAKE
+C
+      SUBROUTINE ABEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,POSN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQC2(IDC),GEL(MAXGEL),P
+      SAVE P
+      DATA P/','/
+C
+C   POINT TO CONTIG
+      POSN=X-1
+C   POINT TO SEQC2
+      IAT=0
+C   COUNT PADS DONE
+      IDONE=0
+C   LOOP FOR ALL SEQC2
+      DO 100 J=1,IDC
+      POSN=POSN+1
+      IAT=IAT+1
+      IPAD=0
+C   IS THIS A PADDING CHAR?
+      IF(SEQC2(IAT).NE.P)GO TO 100
+50    CONTINUE
+C   COUNT PADS
+      IPAD=IPAD+1
+      IAT=IAT+1
+      IF(SEQC2(IAT).EQ.P)GO TO 50
+C   END OF THIS STRETCH OF PADS,DO INSERT
+C   HAVE IPAD INSERTS TO MAKE AT POSN
+      CALL PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,IPAD,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   MOVE POINTER TO CONTIG
+      POSN=POSN+IPAD
+C   COUNT PADS DONE
+      IDONE=IDONE+IPAD
+C   ANY MORE TO DO?
+      IF(IDONE.EQ.ITOTPC)GO TO 101
+100   CONTINUE
+C   ERROR SHOULD HAVE DONE ALL PADS
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Problem: some pads were not done!')
+101   CONTINUE
+      END
+      SUBROUTINE ADDTIT(SEQ1,NAMPRO,NGELS,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(20),NAMPRO*(*)
+      CHARACTER NUMS(10)
+C Set maximum number of digits in reading number
+      PARAMETER (MAXGD = 5)
+      SAVE NUMS
+      DATA NUMS/'0','1','2','3','4','5','6','7','8','9'/
+      CALL FILLC(SEQ1(2),18,'-')
+      SEQ1(1) = '<'
+      SEQ1(20) = '>'
+      IEND = INDEX(NAMPRO,'.')
+      N=NGELS
+      K=IEND+MAXGD
+      DO 10 J=1,MAXGD
+        N=MOD(N,10)+1
+        NAMPRO(K:K)=NUMS(N)
+        N=NGELS/(10**J)
+        K=K-1
+10    CONTINUE
+      K = 18-IEND
+      K=K/2
+      DO 20 I=1,IEND+MAXGD
+        SEQ1(K)=NAMPRO(I:I)
+        K=K+1
+20    CONTINUE
+      IDIM1=IDIM1+20
+      END
+      SUBROUTINE ADISM1(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,
+     +ISTRAN,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C  NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCCC
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C
+C   EDITED 07-02-83 TO ALLOW FOR CASE WHERE A GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL. USE PARM THAT CONTAINS
+C   THE POSITION OF THE LEFT END OF THE NEXT CONTIG. SET TO VERY HIGH
+C   VALUE TO START
+      NEXTC=IDIM+1
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPERATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C  COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     1SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C   COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     1SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+C
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       ADISM2
+C   ROUTINE TO DISPLAY MATCHES
+      SUBROUTINE ADISM2(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C   NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCC
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC. SEE ADISM1.
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with contig number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   UPDATE END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.GT.2)RETURN
+      ILEFTS(IMATC)=CENDS(JJ)+20
+      ILC(IMATC)=LCL+LCR+1
+      IPOSC(IMATC)=LCL+1
+      IPOSG(IMATC)=SAVPG
+      LLINO(IMATC)=NENDS(JJ)
+      ISENSE(IMATC)=1
+      IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      RETURN
+      END
+      SUBROUTINE ADISM3(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+        WRITE(KBOUT,1000)LLINO(IMATC),IPOSC(IMATC),ISTRAN,
+     +  IPOSG(IMATC)
+ 1000   FORMAT
+     +  (' Contig',I5,' position',I6,' matches strand',I2,
+     +  ' at position',I5)
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+      SUBROUTINE ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      RETURN
+      END
+      SUBROUTINE AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +GEL,MAXGEL,CON,IDC,IDEVW,IDEVR,LREG,RREG,KBOUT)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),CON(IDC)
+      INTEGER RREG,PC,PCA,PG
+      NG = LGEL
+      PG = RELPG(NG)
+      PC = LREG
+      NTT = 0
+      NCT = 0
+      NDT = 0
+10    CONTINUE
+C      WRITE(*,*)'GEL',NG
+      CALL READW(IDEVW,NG,GEL,MAXGEL)
+      LG = ABS(LNGTHG(NG))
+      IF(PC.LT.LREG) PC = LREG
+      PCA = PC - LREG + 1
+      IG = PC - PG + 1
+      LC = MIN(LG,RREG-PC+1)
+C      WRITE(*,*)'PC,PG,IG,LG,PCA,LC',PC,PG,IG,LG,PCA,LC
+      CALL ET(GEL(IG),LG,CON(PCA),LC,NE)
+      NTT = NTT + NE
+      CALL EC(GEL(IG),LG,CON(PCA),LC,NE)
+      NCT = NCT + NE
+      CALL ED(GEL(IG),LG,CON(PCA),LC,ND)
+      NDT = NDT + ND
+      CALL WRITEW(IDEVW,NG,GEL,MAXGEL)
+      IF(ND.GT.0) THEN
+        K = LNGTHG(NG)
+        LNGTHG(NG) = ABS(LNGTHG(NG)) - ND
+        LNGTHG(NG) = SIGN(LNGTHG(NG),K)
+        CALL WRITER(IDEVR,NG,RELPG(NG),LNGTHG(NG),LNBR(NG),RNBR(NG))
+      END IF
+      IF(RNBR(NG).NE.0) THEN
+        NG = RNBR(NG)
+        PG = RELPG(NG)
+        PC = PG
+        IF(PG.LE.RREG) GO TO 10
+      END IF
+      CALL EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      WRITE(KBOUT,1000)NTT
+1000  FORMAT(' Number of transpositions=',I6)
+      WRITE(KBOUT,1001)NCT
+1001  FORMAT(' Number of changes       =',I6)
+      WRITE(KBOUT,1002)NDT
+1002  FORMAT(' Number of deletions     =',I6)
+      END
+C    AJOIN2
+C   COMPLETES JOIN AND RETURNS LENGTH OF NEW CONTIG IN LLINOR
+      SUBROUTINE AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +RELX,LLINOL,LLINOR,LNCONL,LNCONR,IDEVR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNBR(IDBSIZ),RNBR(IDBSIZ),LNGTHG(IDBSIZ)
+      INTEGER RELX
+C   RELX IS THE POSITION OF THE JOINT
+C   LLINOL IS THE LEFT GEL NUMBER OF THE LEFT CONTIG
+C   LLINOR IS THE LEFT GEL OF THE RIGHT CONTIG
+C   LNCONL IS THE LEFT CONTIG LINE NUMBER
+C   LNCONR IS THE RIGHT CONTIG LINE NUMBER
+C
+C   ADJUST ALL RELATIVE POSITIONS IN RIGHT CONTIG
+      N=LLINOR
+      RELPG(N)=RELX
+50    CONTINUE
+      IF(RNBR(N).EQ.0)GO TO 60
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+RELX-1
+      GO TO 50
+60    CONTINUE
+C
+C   FIX UP NEW GEL LINE FOR OLD LEFT OF RIGHT CONTIG
+      LNBR(LLINOR)=RNBR(LNCONL)
+C   FIX UP RIGHT GEL OF LEFT CONTIG
+      N=RNBR(LNCONL)
+      RNBR(N)=LLINOR
+C   MERGE WILL SORT OUT THE CORRECT NEIGHBOURS
+C
+      CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LNCONL,IDBSIZ)
+C   MERGE DOES NOT WRITE TO DISK
+      N=LNBR(LNCONL)
+65    CONTINUE
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N))
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 65
+C   CONTIG LINES
+      X=RELPG(LNCONR)+RELX-1
+C   LENGTH MAY NOT HAVE INCREASED!
+      IF(X.GT.RELPG(LNCONL))RELPG(LNCONL)=X
+C   SAVE LENGTH OF NEW CONTIG
+      RELX=RELPG(LNCONL)
+C      WRITE(IDEVR,REC=LNCONL)RELPG(LNCONL),LNGTHG(LNCONL),LNBR(LNCONL),
+C     1RNBR(LNCONL)
+      CALL WRITER(IDEVR,LNCONL,RELPG(LNCONL),LNGTHG(LNCONL),
+     +LNBR(LNCONL),RNBR(LNCONL))
+C
+C   NOW MOVE ALL DATA DOWN TO DELETE OLD RIGHT END
+      N=IDBSIZ-NCONTS
+      M=LNCONR-N
+      IF(M.EQ.0)GO TO 80
+      K=LNCONR
+      J=LNCONR-1
+      DO 70 I=1,M
+      RELPG(K)=RELPG(J)
+      LNGTHG(K)=LNGTHG(J)
+      LNBR(K)=LNBR(J)
+      RNBR(K)=RNBR(J)
+C      WRITE(IDEVR,REC=K)RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K))
+      K=K-1
+      J=J-1
+70    CONTINUE
+80    CONTINUE
+      NCONTS=NCONTS-1
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+90    CONTINUE
+      RETURN
+      END
+C     SUBROUTINE AJOIN3
+      SUBROUTINE AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,IDIM22,
+     +KLASS,IOVER,KBOUT,PL,PR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LINCON(2),IDIM22(2)
+      INTEGER ITYPE(2),ISENSE(2),JOINT(2),PL(2),PR(2)
+C
+C   CALC POSITIONS OF CONTIGS RELATIVE TO FIXED GEL
+      DO 20 I=1,2
+C   R+
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.1))GO TO 11
+      PL(I)=-1*JOINT(I)+2
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   L+
+11    CONTINUE
+      IF((ITYPE(I).NE.1).OR.(ISENSE(I).NE.1))GO TO 12
+      PL(I)=JOINT(I)
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   R-
+12    CONTINUE
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.-1))GO TO 13
+      PR(I)=JOINT(I)+IDIM22(I)-1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+      GO TO 20
+C   L-
+13    CONTINUE
+      PR(I)=IDIM22(I)-JOINT(I)+1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+20    CONTINUE
+C  LENGTH OF OVERLAP
+      IOVER=MIN(PR(1),PR(2))-MAX(PL(1),PL(2))+1
+      WRITE(KBOUT,1002)IOVER
+1002  FORMAT(' Length of overlap between the contigs=',I6)
+C
+C  CLASS NUMBER 1-16
+      KLASS=1
+      IF(ITYPE(1).EQ.1)KLASS=KLASS+8
+      IF(ISENSE(1).EQ.-1)KLASS=KLASS+4
+      IF(ITYPE(2).EQ.1)KLASS=KLASS+2
+      IF(ISENSE(2).EQ.-1)KLASS=KLASS+1
+C      WRITE(KBOUT,1001)KLASS
+C1001  FORMAT(' CLASS OF JOIN=',I6)
+      RETURN
+      END
+C      ALINE
+C
+C    ROUTINE TO LINE UP 2 SEQS.
+C   IT SLIDES,REMOVES OVERLAPPING MATCHES,
+C   SORTS MATCHES INTO ASCENDING ORDER, THEN DOES DOES A TOPOLOGICAL
+C   CHECK, AND THEN PRODUCES 2 LINED UP SEQS WITH PADDING CHARS
+C   VARIABLES
+C       SEQ1 CONSENSUS
+C       SEQ2 GEL ORIGINAL IN CORRECT ORIENTATION
+C       SEQG2 ALIGNED GEL
+C       SEQC2 ALIGNED CONSENSUS
+C       SEQ3 SAVED GEL RAW DATA
+C       ISAV1,2,3 STORE MATCHES AND POSITIONS
+C       IDSAV NUMBER ISAV'S
+C       IDC LENGTH OF INPUT SEQ1
+C       IDIM2 LENGTH OF INPUT SEQ2
+C       IDOUT LENGTH OF OUTPUT ALIGNED SEQ1
+C       IDIM2 LENGTH OF SEQ2 ON OUTPUT AFTER ALIGNMENT
+C       MINSLI MIN MATCH FOR SLIDING
+C       IFAIL FLAG TO SHOW IF ALIGNMENT FAILED DUE TO TOO
+C   MANY MISMATCHES OR TOPOLIGICAL CHECK OR TOO MANY OR TOO MANY
+C   PADDING CHARS. 1=FAIL,0=PASS
+C
+      SUBROUTINE ALINE(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ3,MAXGEL,
+     +PERCM,LENO,ISHOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV)
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+C
+C added next routine 27-2-93
+C
+      CALL UPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(ITOTPC.GT.MAXPC)IFAIL=1
+      IF(ITOTPG.GT.MAXPG)IFAIL=1
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+      CALL DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IFAIL,LENO,PERMAX,ISHOW)
+      IF(IFAIL.NE.0)RETURN
+      IF(ISHOW.EQ.1) THEN
+        WRITE(KBOUT,1052)PERCM,ITOTPC,ITOTPG
+1052    FORMAT(' Percent mismatch=',F4.1,', pads in contig=',I3,
+     +  ', pads in gel=',I3)
+      END IF
+      END
+      SUBROUTINE ARCSER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*16,NAME2*16
+10    CONTINUE
+      L = 0
+      CALL GTSTR('Archive name',' ',NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(L.EQ.0) RETURN
+      CALL CCASE(NAME1,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      IF(NAME1(1:1).EQ.' ') RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      DO 100 I=1,NGELS
+        CALL READN(IDEVN,I,NAME2)
+        IF(NAME1.EQ.NAME2) THEN
+          WRITE(KBOUT,1003)NAME2,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1003      FORMAT(' ',A,2X,I4,2X,I6,2X,I6,2X,I6,2X,I6/)
+          GO TO 10
+        END IF
+100   CONTINUE
+      WRITE(KBOUT,1004)NAME1
+1004  FORMAT(' ',A,' Not in database')
+      GO TO 10
+      END
+      SUBROUTINE AUTOCN(SEQ1,IDIM,GEL,IDIMG,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,KBOUT,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,CSTART,
+     +JOBC,PERMIS,LENO,ISHOW)
+C   AUTHOR: RODGER STADEN
+C   changed 29-11-90 to make first in list of alignments the best
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(IDIM),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(IDIM),GEL(MAXGLM)
+C
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL(2),ITYPE(2)
+      PARAMETER (MAXC = 100)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC),CSTART,START
+      REAL PERMIS(2)
+C
+C jobc tells how to update the hash tables:
+C 0 means dont do anything because the consensus hasnt changed
+C 1 means add the last contig because a new one has been stuck on the end
+C 2 means do the whole consensus
+C
+      IFAIL(1) = 1
+      IFAIL(2) = 1
+      KFAIL = 0
+C  23-8-90 Need to deal with failures in a better way. Problem is
+C          case where overlaps are found but fail to align. In future
+C          signal them with new variable KFAIL which will be nonzero
+C          if any alignment fails.
+C  29-11-90 Changed sorting of overlaps so that the best is first in the 
+C           list returned to caller.
+C   SAVE GEL
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+C  COUNT NUMBER OF CONTIGS THAT MATCH
+      IMATC=0
+      IDCEND=MAXCON
+      CALL BUSY(KBOUT)
+      CALL FNDCON(SEQ1,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+      IF (JOBC.NE.0) THEN
+        START = 1
+        IF(JOBC.EQ.1) START = CENDS(IDCEND)
+        CALL ENCOF(SEQ1(START),IDIM-START+1,CONST,CSTART,LENGTH,
+     +POSNS(START))
+        CALL ENCONN(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH,START)
+      END IF
+1     CONTINUE
+      ISTRAN=1
+2     CONTINUE
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCOF(GEL,IDIMG,CONST,CSTART,LENGTH,GELN)
+      IDSAV=MAXSAV
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ1,GEL,MINMAT,IFCOMP,KBOUT)
+      IF(IFCOMP.NE.0)RETURN
+      IF(IDSAV.NE.0)THEN
+        CALL ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +  IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +  IMATC,ISTRAN,MAXC,KBOUT)
+      END IF
+      ISTRAN=ISTRAN+1
+      IF(ISTRAN.EQ.2) THEN
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        CALL SQREV(GEL,IDIMG)
+        CALL SQCOM(GEL,IDIMG)
+        GO TO 2
+      END IF
+      CALL SQCOPY(GELCOP,GEL,IDIMG)
+      KSENSE = 0
+      WRITE(KBOUT,*)'Total matches found',IMATC
+      IF(IMATC.EQ.0) THEN
+        IFAIL(1) = 0
+        RETURN
+      END IF
+      JMATC = 0
+      DO 100 I = 1,IMATC
+        IF(JSENSE(I).EQ.-1) THEN
+          IF(KSENSE.EQ.0) THEN 
+            CALL SQREV(GEL,IDIMG)
+            CALL SQCOM(GEL,IDIMG)
+            KSENSE = 1
+          END IF
+        END IF
+        JDIM22 = IDIMG
+        JDOUT = MAXGEL
+        IDSAV = MAXSAV
+        WRITE(KBOUT,*)'Trying to align with contig',JLLINO(I)
+        CALL ALINE(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +  SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +  JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +  JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ4,MAXGEL,PERMS,LENO,
+     +  ISHOW)
+        IF(JFAIL.EQ.0) THEN
+          JMATC = JMATC + 1
+          IF(JMATC.EQ.1) THEN
+C    Save in elements 1
+             CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            IFAIL(1) = 0
+          ELSE IF(JMATC.EQ.2) THEN
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+              IFAIL(2) = 0
+            END IF
+          ELSE
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE IF(PERMS.LT.PERMIS(2)) THEN
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+            END IF
+          END IF
+        ELSE
+          KFAIL = 1
+        END IF
+100   CONTINUE
+      IMATC = MIN(2,JMATC)
+      END
+      SUBROUTINE BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CHAINL,GCLIN
+      EXTERNAL CHAINL,GCLIN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTR = IDBSIZ - NCONTS - 1
+      IF(NCONTR.LE.NGELS) THEN
+        WRITE(KBOUT,*)'Insufficient space for new contig line.'
+        WRITE(KBOUT,*)'Increase database size with copy'
+        RETURN
+      END IF
+10    CONTINUE
+        MN = 0
+        MX = NGELS
+        IR  = 0
+        CALL GETINT(MN,MX,IR,
+     +  'Number of gel reading that will become a left end',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IVAL.LT.1) RETURN
+        IR = IVAL
+      IL = LNBR(IR)
+      IF(IL.EQ.0)THEN
+        WRITE(KBOUT,*)'Gel number',IR,' is already a left end'
+        GO TO 10
+      END IF
+      ILO = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IR)
+      IF(ILO.EQ.0)THEN
+        WRITE(KBOUT,*)
+     +'Problem with this contig. Check logical consistency'
+        WRITE(KBOUT,*)'of database. Break not made'
+        RETURN
+      END IF
+      NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,ILO)
+      IF(NCONTO.EQ.0)THEN
+        WRITE(KBOUT,*)'No contig line for this contig. Check logical'
+        WRITE(KBOUT,*)'consistency of database. Break not made'
+        RETURN
+      END IF
+      LCONTO = RELPG(NCONTO)
+      IF(LCONTO.LT.1)THEN
+        WRITE(KBOUT,*)'Contig has zero length. Break not made'
+        RETURN
+      END IF
+      CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+      END
+      SUBROUTINE CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CLEN
+      EXTERNAL CLEN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTS = NCONTS + 1
+C  WRITE LAST LINE OF DB
+      WRITE(KBOUT,*)'Increasing number of contigs by 1'
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C  MAKE NEW CONTIG A COPY OF OLD
+      RELPG(NCONTR) = RELPG(NCONTO)
+      LNGTHG(NCONTR) = LNGTHG(NCONTO)
+      LNBR(NCONTR) = IR
+      RNBR(NCONTR) = RNBR(NCONTO)
+      WRITE(KBOUT,*)'Writing new right contig line'
+      CALL WRITER(IDEVR,NCONTR,RELPG(NCONTR),LNGTHG(NCONTR),
+     +LNBR(NCONTR),RNBR(NCONTR))
+C  NEED LENGTH FOR OLD LEFT CONTIG
+      RNBR(IL) = 0
+      L = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,ILO)
+      IF(L.LT.1)THEN
+        WRITE(KBOUT,*)
+     +  'New left contig has zero length. Break not made'
+        RETURN
+      END IF
+      RELPG(NCONTO) = L
+      RNBR(NCONTO) = IL
+C  DO CONTIG LINE FOR NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new left contig line'
+      WRITE(KBOUT,*)'New length=',RELPG(NCONTO)
+      WRITE(KBOUT,*)'New right gel=',RNBR(NCONTO)
+      CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +LNBR(NCONTO),RNBR(NCONTO))
+C  DO GEL LINE FOR RIGHT GEL OF NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new right gel of left contig'
+      WRITE(KBOUT,*)'Gel number=',IL
+      CALL WRITER(IDEVR,IL,RELPG(IL),LNGTHG(IL),
+     +LNBR(IL),RNBR(IL))
+C  DO GEL LINE FOR NEW RIGHT CONTIG
+      LNBR(IR) = 0
+      WRITE(KBOUT,*)'Writing new left gel of right contig'
+      WRITE(KBOUT,*)'Gel number=',IR
+      CALL WRITER(IDEVR,IR,RELPG(IR),LNGTHG(IR),
+     +LNBR(IR),RNBR(IR))
+C  NOW SHIFT
+      I = 1 - RELPG(IR)
+      WRITE(KBOUT,*)'Shifting gels in right contig by distance=',I
+      CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IR,NCONTR,I)
+      WRITE(KBOUT,*)'Right contig length=',RELPG(NCONTR)
+      WRITE(KBOUT,*)'Break completed'
+      IOK = 0
+      END
+C     BUBBL3
+C   SUBROUTINE TO SORT INTEGER ARRAY (LIST) INTO ASCENDING  ORDER
+C
+      SUBROUTINE BUBBL3(LIST,LISTEL,LISTAL,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM),LISTEL(IDIM),LISTAL(IDIM)
+C
+C   SET POINTERS TO ZERO
+      I=0
+      J=0
+C
+10    CONTINUE
+C
+C   SET I=J IF WE HAVE JUST CORRECTLY POSITIONED AN ELEMENT
+      IF(J.GT.I)I=J
+C
+C   INCREMENT POINTER TO NEXT ELEMENT
+      I=I+1
+C   TEST FOR END OF ARRAY
+      IF(I.EQ.IDIM)RETURN
+C
+20    CONTINUE
+C
+C   COMPARE ADJACENT ELEMENTS
+      IF(LIST(I).GE.LIST(I+1))GO TO 10
+C
+C   FIRST MOVE THIS ELEMENT? IF SO SET POINTER TO ITS INITIAL POSITION
+      IF(J.LT.I)J=I
+C
+C   EXCHANGE ADJACENT ELEMENTS
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+C
+      ITEMP=LISTEL(I)
+      LISTEL(I)=LISTEL(I+1)
+      LISTEL(I+1)=ITEMP
+      ITEMP=LISTAL(I)
+      LISTAL(I)=LISTAL(I+1)
+      LISTAL(I+1)=ITEMP
+C
+C
+C   DECREMENT BACK THRU LIST WITH THIS ELEMENT
+      IF(I.GT.1)I=I-1
+C
+      GO TO 20
+      END
+      SUBROUTINE CCTA(SEQ,ID)
+      CHARACTER SEQ(ID),COM,AS
+      SAVE COM,AS
+      DATA COM/','/,AS/'*'/
+      DO 10 I = 1,ID
+        IF(SEQ(I).EQ.COM) SEQ(I) = AS
+10    CONTINUE
+      END
+C
+C     CFGEL  new version 15-4-92
+C
+C   ROUTINE TO COMPARE A STRING OF WORD NUMBERS FOR A GEL WITH A SERIES
+C   OF ARRAYS REPRESENTING A CONSENSUS SEQUENCE. WE LOOK FOR OCCURENCES
+C   OF PAIRS OF WORDS (EACH WORD IS LENGTH CHARS LONG AND SO TOTAL MATCH IS
+C   2*LENGTH CHARS LONG). THE ARRAYS SENT ARE OF SIZE 4**LENGTH (LE4)
+      SUBROUTINE CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LE4,
+     +SAVPG,
+     +SAVPS,SAVL,IDSAV,SEQ,GEL,MINMAT,IFAIL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),GEL(IDIMG)
+      INTEGER POSNS(IDIM),WORDP(LE4),SAVPS(IDSAV),SAVPG(IDSAV)
+      INTEGER GELN(IDIMG),SAVL(IDSAV)
+      INTEGER WORDN(LE4)
+      INTEGER W1,W2,PS1W1,PS1W2
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C go thru the gel reading looking at words to see if they exist in the consensus
+      IDSAVM=IDSAV
+      IDSAV=0
+C length of a pair of words is LX2
+      LX2=2*LENGTH
+C number of pairs of words in gel reading is NW
+      NW=IDIMG-(LX2-1)
+C loop for each words start point
+      DO 200 I=1,NW
+C       IS THIS WORD A ZERO?
+        W1=GELN(I)
+        IF(W1.EQ.0)GO TO 200
+C       POINT TO NEXT WORD OF PAIR
+        W2=GELN(I+LENGTH)
+        IF(W2.EQ.0)GO TO 200
+C       DOES W1 EXIST IN SEQ?
+        N1S1=WORDN(W1)
+        IF(N1S1.EQ.0)GO TO 200
+        N2S1=WORDN(W2)
+        IF(N2S1.EQ.0)GO TO 200
+C       BOTH EXIST, SO POINT TO THE FIRST + LENGTH
+        PS1W1=WORDP(W1)+LENGTH
+C       LOOP FOR ALL PAIRS
+C there are N1S1 occurrences of word 1 and N2S1 of word 2 in consensus
+C compare the positions of all pairs to see if they are LENGTH apart
+        DO 50 J=1,N1S1
+C         POINT TO FIRST W2 BECAUSE IT IS IN WORDP NOT POSNS
+          PS1W2=WORDP(W2)
+C         LOOP FOR THESE
+          DO 40 K=1,N2S1
+C           ARE THIS PAIR LENGTH APART?
+            N=PS1W1-PS1W2
+            IF(N.NE.0)GO TO 20
+C           THEY ARE SO, IF REQUIRED LOOK FOR REST OF MATCH
+            LMAT=LX2
+C
+C new code
+C
+            IPC = PS1W2 + LENGTH - 1
+            IPG = I + LX2 - 1
+ 16         CONTINUE
+            IF (LMAT.LT.MINMAT) THEN
+              IPC = IPC + 1
+              IPG = IPG + 1
+              IF(IPG.GT.IDIMG)GO TO 20
+              IF(IPC.GT.IDIM)GO TO 20
+              IF(CTONUM(SEQ(IPC)).NE.CTONUM(GEL(IPG)))GO TO 20
+              LMAT=LMAT+1
+              GO TO 16
+            END IF
+C
+C match found, is it an extension of a previous one ?
+C
+C            WRITE(*,*)I,PS1W1-LENGTH
+            IF (IDSAV.GT.0) THEN
+              IF (I-SAVPG(IDSAV).EQ.PS1W1-LENGTH-SAVPS(IDSAV)) GO TO 20
+            END IF
+            IDSAV = IDSAV + 1
+            IF (IDSAV.GT.IDSAVM) THEN
+              WRITE(KBOUT,1000)IDSAVM
+1000          FORMAT(' More than ',I6,' matches. Search aborted')
+              IFAIL = 1
+              RETURN
+            END IF
+C            WRITE(*,*)IDSAV
+            SAVPG(IDSAV) = I
+            SAVPS(IDSAV) = PS1W1 - LENGTH
+ 20         CONTINUE
+C           POINT TO NEXT W2
+            PS1W2=POSNS(PS1W2)
+40        CONTINUE
+C         ALL TRIED THIS PS1W1, TRY NEXT
+          PS1W1=POSNS(PS1W1-LENGTH)+LENGTH
+50      CONTINUE
+200   CONTINUE
+      IFAIL=0
+      RETURN
+      END
+C
+C     CFGEL old version (before 15-4-92)
+C
+C   ROUTINE TO COMPARE A STRING OF WORD NUMBERS FOR A GEL WITH A SERIES
+C   OF ARRAYS REPRESENTING A CONSENSUS SEQUENCE. WE LOOK FOR OCCURENCES
+C   OF PAIRS OF WORDS (EACH WORD IS LENGTH CHARS LONG AND SO TOTAL MATCH IS
+C   2*LENGTH CHARS LONG). THE ARRAYS SENT ARE OF SIZE 4**LENGTH (LE4)
+      SUBROUTINE CFGELO(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LE4,
+     +SAVPG,
+     +SAVPS,SAVL,IDSAV,SEQ,GEL,MINMAT,IFAIL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),GEL(IDIMG)
+      INTEGER POSNS(IDIM),WORDP(LE4),SAVPS(IDSAV),SAVPG(IDSAV)
+      INTEGER GELN(IDIMG),SAVL(IDSAV)
+      INTEGER WORDN(LE4)
+      INTEGER W1,W2,PS1W1,PS1W2
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDSAVM=IDSAV
+      IDSAV=0
+C   LENGTH OF PAIR OF WORDS
+      LX2=2*LENGTH
+C   NUMBER OF PAIRS OF WORDS OF LENGTH LENGTH IN GEL
+      NW=IDIMG-(LX2-1)
+C   LOOP FOR EACH START POINT
+      DO 200 I=1,NW
+C   IS THIS WORD A ZERO?
+      W1=GELN(I)
+      IF(W1.EQ.0)GO TO 200
+C   POINT TO NEXT WORD OF PAIR
+      W2=GELN(I+LENGTH)
+      IF(W2.EQ.0)GO TO 200
+C   DOES W1 EXIST IN SEQ?
+      N1S1=WORDN(W1)
+      IF(N1S1.EQ.0)GO TO 200
+      N2S1=WORDN(W2)
+      IF(N2S1.EQ.0)GO TO 200
+C   BOTH EXIST, SO POINT TO THE FIRST + LENGTH
+      PS1W1=WORDP(W1)+LENGTH
+C   LOOP FOR ALL PAIRS
+      DO 50 J=1,N1S1
+C   POINT TO FIRST W2 BECAUSE IT IS IN WORDP NOT POSNS
+      PS1W2=WORDP(W2)
+C   LOOP FOR THESE
+      DO 40 K=1,N2S1
+C   ARE THIS PAIR LENGTH APART?
+      N=PS1W1-PS1W2
+      IF(N.NE.0)GO TO 20
+C   THEY ARE SO, IF REQUIRED LOOK FOR REST OF MATCH
+      LMAT=LX2
+      IF(MINMAT.EQ.LX2)GO TO 15
+      IPC=PS1W2+LENGTH
+      IPG=I+LX2
+16    CONTINUE
+      IF(IPG.GT.IDIMG)GO TO 15
+      IF(IPC.GT.IDIM)GO TO 15
+C
+      IF(CTONUM(SEQ(IPC)).NE.CTONUM(GEL(IPG)))GO TO 15
+      LMAT=LMAT+1
+      IPC=IPC+1
+      IPG=IPG+1
+      GO TO 16
+15    CONTINUE
+C  IS MATCH LONG ENOUGH?
+      IF(LMAT.LT.MINMAT)GO TO 20
+      IDSAV=IDSAV+1
+      IF(IDSAV.LE.IDSAVM)GO TO 18
+      WRITE(KBOUT,1000)IDSAVM
+1000  FORMAT(' More than ',I6,' matches. Search aborted')
+      IFAIL=1
+      RETURN
+18    CONTINUE
+      SAVL(IDSAV)=LMAT
+      SAVPG(IDSAV)=I
+      SAVPS(IDSAV)=PS1W1-LENGTH
+20    CONTINUE
+C   POINT TO NEXT W2
+      PS1W2=POSNS(PS1W2)
+40    CONTINUE
+C   ALL TRIED THIS PS1W1, TRY NEXT
+      PS1W1=POSNS(PS1W1-LENGTH)+LENGTH
+50    CONTINUE
+200   CONTINUE
+      IFAIL=0
+      RETURN
+      END
+      INTEGER FUNCTION CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      J = I
+      CHAINL = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        J = I
+        I = LNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CHAINL = J
+      END
+C
+C      CHANGE
+C
+C   ROUTINE TO EXCHANGE ALL THE CHARS IN A CHARACTER ARRAY USING
+C   A PAIR OF LOOKUP ARRAYS SENT BY CALLING PROG
+C
+C
+      SUBROUTINE CHANGE(SEQ,IDIM1,CHAR1,CHAR2,IDIM2,ELSE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      CHARACTER CHAR1(IDIM2),CHAR2(IDIM2)
+      CHARACTER ELSE
+      DO 100 I=1,IDIM1
+C
+      DO 50 J=1,IDIM2
+C
+      IF(SEQ(I).NE.CHAR1(J))GO TO 50
+C   MATCH SO EXCHANGE CHARS
+      SEQ(I)=CHAR2(J)
+      GO TO 100
+50    CONTINUE
+      SEQ(I)=ELSE
+100   CONTINUE
+C
+      RETURN
+      END
+      CHARACTER*1 FUNCTION CHARSL(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'ctag*-'/
+      CHARSL = C(I:I)
+      END
+      CHARACTER*1 FUNCTION CHARSU(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'CTAG*-'/
+      CHARSU = C(I:I)
+      END
+      INTEGER FUNCTION CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      CLEN= 0
+      LEN = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        LEN = MAX(LEN,(RELPG(I) + ABS(LNGTHG(I)) - 1))
+        I = RNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CLEN = LEN
+      END
+      SUBROUTINE CLIST(GELNO1,LINNO1,IGEL1,GELNO2,LINNO2,
+     +IGEL2,GELNOS,GELSTR,GELEND,IUNIQ1,IUNIQ,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER GELNO1(IGEL1),GELNO2(IGEL2),GELNOS(IUNIQ1)
+      INTEGER GELSTR(IUNIQ1),GELEND(IUNIQ1)
+      INTEGER LINNO1(IGEL1),LINNO2(IGEL2)
+      EXTERNAL INLIST
+C   GELNOS === GEL NUMBERS (GELNOS)
+C   GELSTR === GEL START LINES
+C   GELEND === GEL END LINES
+C   GELNO  === GEL NUMBERS PER STRIP
+C   LINNO  === GEL LINE NUMBERS PER STRIP
+C   IGEL   === NUMBER OF GELS PER STRIP
+C   LINENO === CURRENT LINE NUMBER
+C
+C     WHICH GELS IN GELNO2 DO NOT APPEAR IN GELNO1
+C     IE HAVE STARTED IN GELNO2
+        DO 20 I=1,IGEL2
+        MATCH=INLIST(GELNO1,IGEL1,GELNO2(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH SO NEW
+C         PUT IN GELSTR
+          IUNIQ=IUNIQ+1
+          GELNOS(IUNIQ)=GELNO2(I)
+          GELSTR(IUNIQ)=LINNO2(I)
+        END IF
+20    CONTINUE
+C     WHICH GELS IN GELNO1 DO NOT APPEAR IN GELNO2
+C     IE WHICH HAVE ENDED IN GELNO1
+      DO 10 I=1,IGEL1
+        MATCH=INLIST(GELNO2,IGEL2,GELNO1(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH  SO MUST HAVE ENDED
+C         WHERE IS IT STORED IN GELNOS?
+          MATCH=INLIST(GELNOS,IUNIQ,GELNO1(I))
+          IF(MATCH.NE.0)THEN
+            GELEND(MATCH)=LINNO1(I)
+            GO TO 10
+          END IF
+C         ERROR
+          WRITE(KBOUT,1000)GELNO1(I)
+1000      FORMAT( ' Error: gel number ',I5,
+     +    ' expected but not found in list')
+          IOK = 1
+          RETURN
+        END IF
+10    CONTINUE
+      IOK = 0
+      RETURN
+      END
+C
+C      CMPLMT
+C
+C   SUBROUTINE TO REVERSE AND COMPLEMENT GELS AND DATA BASE
+C   THE POSITIONS OF THE RIGHT ENDS OF GELS ARE FIRST STORED
+C   IN RELPG THEN WE DO A BUBBLE SORT ON THESE POSITIONS
+C   UPDATING RELATIONSHIPS AS WE GO
+C   ALSO SEQUENCES ARE COMPLEMENTED, SIGNS OF LENGTH ARE
+C   MULTIPLIED BY -1 AND THE CONTIG LINE IS ALTERED
+      SUBROUTINE CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,GEL,IDBSIZ,KBOUT,IDEVR,IDEVW,MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER X
+C
+      WRITE(KBOUT,1000)LLINO
+1000  FORMAT( ' Complementing contig',I6)
+C   CHAIN THRU AND PUT RIGHT ENDS IN RELPG
+      N=LLINO
+10    CONTINUE
+      RELPG(N)=RELPG(N)+(ABS(LNGTHG(N)))-1
+      IF(RNBR(N).EQ.0)GO TO 20
+      N=RNBR(N)
+      GO TO 10
+20    CONTINUE
+C
+C   NOW EFFECTIVELY BUBBLE SORT ON RELPG
+      N=RNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+      N=NL
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+      NL=LNBR(N)
+      IF(NL.EQ.0)GO TO 30
+      I1=0
+23    CONTINUE
+      IF(RELPG(N).GE.RELPG(NL))GO TO 21
+C   NOT IN CORRECT ORDER SO CHAIN ALONG UNTIL CORRECT,THEN COME
+C   BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE THIS LINE SET POINTER TO CURRENT POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C
+C   EXCHANGE NEIGHBOURS. CURRENTLY LOOKING AT N AND ITS LEFT
+C   NBR, AND THE LEFT NBR IS FURTHER RIGHT THAN N
+C   FIX UP POINTERS TO LEFT AND RIGHT OF THESE TWO
+      M=LNBR(NL)
+      IF(M.NE.0)RNBR(M)=N
+      M=RNBR(N)
+      IF(M.NE.0)LNBR(M)=NL
+      LNBR(N)=LNBR(NL)
+      LNBR(NL)=N
+      RNBR(NL)=RNBR(N)
+      RNBR(N)=NL
+C   CHAIN BACK THRU LIST WITH THIS LINE
+      N=RNBR(NL)
+      IF(N.EQ.0)GO TO 21
+C   IE END MET
+      GO TO 23
+30    CONTINUE
+C   FINISH WITH LEFT END IN N
+40    CONTINUE
+C   NOW REVERSE NBRS SO CHAIN BACK RIGHT
+      NL=RNBR(N)
+      IF(NL.EQ.0)GO TO 50
+      RNBR(N)=LNBR(N)
+      LNBR(N)=NL
+      N=NL
+      GO TO 40
+50    CONTINUE
+C   NEED TO FIX UP NEW LEFT END
+      RNBR(N)=LNBR(N)
+      LNBR(N)=0
+C   ALL POINTERS FIXED NOW DO RELATIVE POSITION
+C   FINISH WITH LEFT END IN N
+C   SO CHAIN BACK RIGHT
+C   SAVE RIGHT LINE NUMBER
+      NL=N
+      X=RELPG(N)
+60    CONTINUE
+      RELPG(N)=1+(-1*(RELPG(N)-X))
+      IF(RNBR(N).EQ.0)GO TO 70
+      N=RNBR(N)
+      GO TO 60
+70    CONTINUE
+C   NOW FIX CONTIG LINE
+      LNBR(LINCON)=NL
+      RNBR(LINCON)=N
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+C   NOW REVERSE AND COMPLEMENT GELS
+      N=NL
+80    CONTINUE
+C      READ(IDEVW,REC=N)GEL
+      CALL READW(IDEVW,N,GEL,MAXGEL)
+      M=ABS(LNGTHG(N))
+      CALL SQREV(GEL,M)
+      CALL SQCOM(GEL,M)
+      CALL WRITEW(IDEVW,N,GEL,MAXGEL)
+C      WRITE(IDEVW,REC=N)GEL
+C   CHANGE SIGNS
+      LNGTHG(N)=-1*LNGTHG(N)
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   ANY MORE?
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 80
+C   NO MORE
+      RETURN
+      END
+C      CONSEN
+C   CALCULATES A CONSENSUS USING THE RULES OUTLINED IN THE DOCUMENTATION
+C   AND SUBROUTINE SUMMER
+C   UNIT IDEV IS USED FOR OUTPUT
+      SUBROUTINE CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +KBIN,KBOUT,IDEVW,IDEV,NAMCON,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,IDM,PERCD,IDEVN,LLINO)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CHARACTER NAMCON*(*)
+100   CONTINUE
+      ISTART=1
+      NAMCON = ' '
+      CALL OPENF1(IDEV,NAMCON,1,IOK,KBIN,KBOUT,
+     +'Name for consensus file',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(ANS,'Make consensus for whole database',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.1)GO TO 150
+      N=IDBSIZ-NCONTS
+      CALL BUSY(KBOUT)
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        IF((ISTART+19+Y).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+1009      FORMAT(
+     +    ' Maximum consensus length(',I6,') exceeded,',/,
+     +    ' calculation aborted')
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        ISTART=ISTART+Y
+110   CONTINUE
+      ISTART=ISTART-1
+      CALL YESNO(ANS,'Staden format',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF (ANS.LT.0) RETURN
+      IF(ANS.EQ.0) THEN
+        CALL FMTDK(IDEV,SEQ1,ISTART)
+      ELSE
+        CALL WRITCF(IDEV,SEQ1,ISTART,NAMPRO,KBOUT,IOK)
+      END IF
+      RETURN
+150   CONTINUE
+      CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +'Contig identifier',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)GO TO 400
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)GO TO 400
+      IDIM2=RREG-LREG+1
+      IF((ISTART+19+IDIM2).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+         RETURN
+      END IF
+      CALL BUSY(KBOUT)
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1(ISTART),IDIM2,GEL,LREG,RREG,LLINO,IDBSIZ,TEMP,
+     +CHRSIZ,MAXGL2,IDEVW,MAXGEL,IDM,PERCD)
+      ISTART=ISTART+IDIM2
+300   CONTINUE
+      CALL YESNO(ANS,'Select another contig',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0) GO TO 150
+      ISTART=ISTART-1
+      CALL YESNO(ANS,'Staden format',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF (ANS.LT.0) RETURN
+      IF(ANS.EQ.0) THEN
+        CALL FMTDK(IDEV,SEQ1,ISTART)
+      ELSE
+        CALL WRITCF(IDEV,SEQ1,ISTART,NAMPRO,KBOUT,IOK)
+      END IF
+400   CONTINUE
+      CALL YESNO(ANS,'Make another consensus',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0)GO TO 100
+      END
+      SUBROUTINE WRITCF(IDEV,SEQ,IDSEQ,NAMPRO,KBOUT,IOK)
+      CHARACTER SEQ(IDSEQ),TITLE*10,ENAME*10,NAMPRO*(*)
+      CHARACTER NL
+      PARAMETER (MAXDG = 5)
+      EXTERNAL INDEXA
+C
+C write out in fasta format
+C also change -,* to N
+C currently name is project name for single contig, but left gel number
+C for multiple contigs, title is always left gel number
+C
+      NC = 0
+      DO 1 I=1,IDSEQ
+        IF (SEQ(I).EQ.'>') NC = NC + 1
+ 1    CONTINUE
+      NL = CHAR(10)
+      ENAME = ' '
+      I = INDEX(NAMPRO,'.')
+      ENAME(1:I-1) = NAMPRO(1:I-1)
+      IAT = 1
+ 10   CONTINUE
+      IF (IAT.GT.IDSEQ) THEN
+        CLOSE(UNIT=IDEV)
+        IOK =0
+        RETURN
+      END IF
+      IF(SEQ(IAT).NE.'<') THEN
+        CALL ERROM(KBOUT,'Missing header in consensus')
+        IOK = 1
+        RETURN
+      END IF
+      INAMES = INDEXA(SEQ(IAT),20,'.')
+      IF (INAMES.EQ.0) THEN
+        CALL ERROM(KBOUT,'Missing dot in header')
+        IOK = 1
+        RETURN
+      END IF
+      INAMES = IAT + INAMES
+      INAMEE = INAMES + MAXDG - 1
+      TITLE = ' '
+      K = 0
+      DO 20 I=INAMES,INAMEE
+        K = K + 1
+        TITLE(K:K) = SEQ(I)
+ 20     CONTINUE
+      IF (NC.GT.1) ENAME = TITLE
+      IAT = IAT + 20
+      IDT = IDSEQ-IAT+2
+      IDSQ = INDEXA(SEQ(IAT),IDT,'<')
+      IF (IDSQ.EQ.0) IDSQ = IDT
+      IDSQ = IDSQ - 1
+      CALL SETCCS(SEQ(IAT),IDSQ)
+      CALL WRITFF(IDEV,SEQ(IAT),IDSQ,ENAME,TITLE)
+      IAT = IAT + IDSQ
+      GO TO 10
+      END
+      SUBROUTINE SETCCS(SEQ,IDSEQ)
+      CHARACTER SEQ(IDSEQ),TO(5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE TO
+      DATA TO/'t','c','a','g','n'/
+C
+C change chars in array seq of type found to type to
+C
+      DO 10 I=1,IDSEQ
+        K = CTONUM(SEQ(I))
+        SEQ(I) = TO(K)
+ 10     CONTINUE
+      END
+      SUBROUTINE COPYM(JLEFTS,ILEFTS,JLC,ILC,
+     +JPOSC,IPOSC,JSENSE,ISENSE,JLLINO,LLINO,
+     +JJOINT,JOINT,JTOTPC,ITOTPC,JTOTPG,ITOTPG,
+     +JTYPE,ITYPE,JDOUT,IDOUT,JDIM22,IDIM22,
+     +SEQG3,SEQG2,SEQC3,SEQC2,PERMS,PERMIS)
+      CHARACTER SEQG3(JDIM22),SEQG2(JDIM22),SEQC3(JDOUT),SEQC2(JDOUT)
+      ILEFTS = JLEFTS
+      ILC = JLC
+      IPOSC = JPOSC
+      ISENSE = JSENSE
+      LLINO = JLLINO
+      JOINT = JJOINT
+      ITOTPC = JTOTPC
+      ITOTPG = JTOTPG
+      ITYPE = JTYPE
+      IDOUT = JDOUT
+      IDIM22 = JDIM22
+      CALL SQCOPY(SEQG3,SEQG2,JDIM22)
+      CALL SQCOPY(SEQC3,SEQC2,JDOUT)
+      PERMIS = PERMS
+      END
+C     SUBROUTINE DALIGN
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IFAIL,LO,PERMAX,ISHOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+      IFAIL = 1
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+100   CONTINUE
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      X=FLOAT(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        CALL ERROM(KBOUT,'DALIGN: matching region too long')
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      IF (X.GT.PERMAX) RETURN
+      IF (ISHOW.EQ.1) THEN
+        WRITE(KBOUT,1002)
+1002    FORMAT(' Best alignment found')
+        CALL SQMTCH(SEQC2(1),SEQG2(IENDG),SEQ3,LO)
+        CALL FMT4LN(SEQC2(1),SEQG2(IENDG),SEQ3,LO,IENDC,IENDG,KBOUT)
+      END IF
+      IFAIL=0
+      END
+      SUBROUTINE DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP,IERR,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP(IDBSIZ)
+      EXTERNAL LCCHEK,NCCHEK
+C to check the logical consistency of a database
+C
+C 1. are all nbrs holding hands
+C 2. are all gels in exactly 1 contig
+C 3. are there loops in contigs
+C 4. do the gels designated left or right ends have outward neighbours
+C 5. are the relative positions in same order as hand holding
+C 6. are there gels of zero length
+C 7. are there contigs of length < 1
+C 8. does the designated length of the contigs agree with the gel positions
+C 9. if i chain left thru a contig do i reach the gel designated as the left end
+C10.if i chain right thru a contig do i reach the gel designated as the right end
+C
+C return error code 2 for all errors except where only error is "gel not used"
+C for which we return 1
+      IERR=0
+C   hand holding OK?
+      DO 100 I=1,NGELS
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 50
+        IF(RNBR(K).EQ.I)GO TO 50
+        WRITE(KBOUT,1000)I
+1000    FORMAT(' Hand holding problem for gel reading',I6)
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+1001    FORMAT(' ',5I6)
+        IERR=2
+50      CONTINUE
+        K=RNBR(I)
+        IF(K.EQ.0)GO TO 100
+        IF(LNBR(K).EQ.I)GO TO 100
+        WRITE(KBOUT,1000)I
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+        IERR=2
+100   CONTINUE
+C
+C   are all gels in exactly 1 contig
+C
+      CALL FILLI(TEMP,IDBSIZ,0)
+      N=IDBSIZ-NCONTS
+C
+C count the number of times thru loop 320 to 300: if this exceeds the
+C database size a loop has been found
+C
+      ICOUNT=0
+      DO 300 I=N,IDBSIZ-1
+        II=I
+        K=LNBR(I)
+        J=RNBR(I)
+        IF((K.NE.0).AND.(J.NE.0))GO TO 310
+C
+C This contig points to zero gel number as a left or right end
+C
+        WRITE(KBOUT,1002)I
+1002    FORMAT(' Contig',I4,' has gel numbers of zero')
+        IERR=2
+        GO TO 290
+310     CONTINUE
+        IF((LNBR(K).EQ.0).AND.(RNBR(J).EQ.0))GO TO 290
+C
+C These ends reads have outward neighbours
+C
+        WRITE(KBOUT,1004)I
+1004  FORMAT(' The end gels of contig',I4,' have outward neighbours')
+        IERR=2
+290     CONTINUE
+C
+C Does the contig have nonzero length?
+C
+        IF(RELPG(I).GT.0)GO TO 320
+        WRITE(KBOUT,1010)I
+1010    FORMAT(' The contig on line number',I4,' has zero length')
+        IERR=2
+320     CONTINUE
+        TEMP(K)=TEMP(K)+1
+        ICOUNT=ICOUNT+1
+        IF(ICOUNT.GT.IDBSIZ)GO TO 601
+        K=RNBR(K)
+        IF(K.NE.0)GO TO 320
+300   CONTINUE
+      DO 400 I=1,NGELS
+        IF(TEMP(I).EQ.1)GO TO 390
+        IF(TEMP(I).EQ.0)GO TO 410
+        WRITE(KBOUT,1005)I,TEMP(I)
+1005    FORMAT(' Gel number ',I6,' is used ',I6,' times')
+        IERR=2
+        GO TO 400
+390     CONTINUE
+C
+C does the gel have nonzero length (only check those used once)
+C
+        IF(LNGTHG(I).NE.0)GO TO 400
+        WRITE(KBOUT,1011)I
+1011    FORMAT(' Gel number',I6,' has zero length')
+        IERR=2
+        GO TO 400
+410     CONTINUE
+        WRITE(KBOUT,1006)I
+1006    FORMAT(' Gel number ',I6,' is not used')
+C
+C need to increase the error count (dont reset to lower value)
+C
+        IF(IERR.LT.2)IERR=1
+400   CONTINUE
+C
+C all relative positions ok?
+C
+      N=IDBSIZ-NCONTS
+      DO 500 I=N,IDBSIZ-1
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 500
+510     CONTINUE
+        J=RNBR(K)
+        IF(J.EQ.0)GO TO 500
+        IF(RELPG(K).GT.RELPG(J))GO TO 520
+        K=J
+        GO TO 510
+520     CONTINUE
+        WRITE(KBOUT,1007)K,RELPG(K),J,RELPG(J)
+1007    FORMAT(' Gel number',I6,' with position',I6,
+     +  ' is the left neighbour of',
+     +  /,' gel number',I6,' with position',I6)
+        K=J
+        IERR=2
+        GO TO 510
+500   CONTINUE
+      IOK = LCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,KBOUT)
+      IF (IOK.NE.0) IERR = 2
+      IOK = NCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,KBOUT)
+      IF (IOK.NE.0) IERR = 2
+      IF(IERR.EQ.0) WRITE(KBOUT,1013)
+1013  FORMAT(' Database is logically consistent')
+      RETURN
+601   CONTINUE
+      IERR=2
+      WRITE(KBOUT,1008)II
+1008  FORMAT(' Loop in contig',I6,/,
+     +' No further checking done but gel numbers follow')
+      CALL FILLI(TEMP,IDBSIZ,0)
+      K=LNBR(II)
+710   CONTINUE
+      TEMP(K)=TEMP(K)+1
+      WRITE(KBOUT,1009)K
+1009  FORMAT(' ',I6)
+      IF(TEMP(K).GT.1)RETURN
+      K=RNBR(K)
+      GO TO 710
+      END
+      INTEGER FUNCTION LCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CLEN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      EXTERNAL CLEN
+      LCCHEK = 0
+      DO 10 I=IDBSIZ-NCONTS,IDBSIZ-1
+        IL = LNBR(I)
+        L1 = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,IL)
+        L2 = RELPG(I)
+        IF (L1.NE.L2) THEN
+          WRITE(KBOUT,1000)I,L2,L1
+ 1000     FORMAT(
+     +' Contig line',I6,' records length',I6,' but actual length is',I6)
+        LCCHEK = LCCHEK + 1
+        END IF
+ 10   CONTINUE
+      END
+      INTEGER FUNCTION NCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHAINL,CHAINR
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      EXTERNAL CHAINL,CHAINR
+      NCCHEK = 0
+      DO 10 I=IDBSIZ-NCONTS,IDBSIZ-1
+        IL = LNBR(I)
+        L1 = CHAINR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,IL)
+        L2 = RNBR(I)
+        IF (L1.NE.L2) THEN
+          WRITE(KBOUT,1000)I,L2,L1
+ 1000     FORMAT(
+     +' Contig line',I6,' records right neighbour as',I6,
+     +' but left to right chaining gives',I6)
+        NCCHEK = NCCHEK + 1
+        END IF
+        IL = RNBR(I)
+        L1 = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,IL)
+        L2 = LNBR(I)
+        IF (L1.NE.L2) THEN
+          WRITE(KBOUT,1001)I,L2,L1
+ 1001     FORMAT(
+     +' Contig line',I6,' records left neighbour as',I6,
+     +' but right to left chaining gives',I6)
+        NCCHEK = NCCHEK + 1
+        END IF
+ 10   CONTINUE
+      END
+      INTEGER FUNCTION CHAINR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG RIGHT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      J = I
+      CHAINR = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        J = I
+        I = RNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CHAINR = J
+      END
+C   DBPRNT
+C   PRINTS A DATABASE. IE ITS RELATIONSHIPS
+      SUBROUTINE DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +IDEV,KBIN,KBOUT,IDEVN,LLINO,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LREG,RREG,ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMARC*16
+      IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +NCONTS,IDBSIZ,IDEV)
+      WRITE(IDEV,10011)NGELS,NCONTS
+10011 FORMAT(' Number of gel readings ',I6,' Number of contigs ',I6)
+20    CONTINUE
+      CALL YESNO(ANS,'Select contigs',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0) GO TO 45
+      N=IDBSIZ-NCONTS
+25    CONTINUE
+      CALL YESNO(ANS,'Show gel readings in positional order',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0)GO TO 41
+      WRITE(IDEV,1009)
+1009  FORMAT(' CONTIG LINES')
+      WRITE(IDEV,1000)
+1000  FORMAT(
+     +' CONTIG              LINE  LENGTH               ENDS'/
+     +'                                             LEFT   RIGHT')
+      DO 30 I=N,IDBSIZ-1
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+30    CONTINUE
+1007  FORMAT( ' ',18X,I6,2X,I7,9X,I6,2X,I6)
+      WRITE(IDEV,1008)
+1008  FORMAT(' GEL LINES')
+      WRITE(IDEV,1001)
+ 1001 FORMAT(
+     +' NAME              NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                             LEFT   RIGHT')
+      DO 40 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        WRITE(IDEV,1006)NAMARC,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1006    FORMAT( ' ',A,2X,I6,2X,I7,2X,I5,2X,I6,2X,I6)
+40    CONTINUE
+      RETURN
+C
+41    CONTINUE
+C
+C   SORTED DATA
+      DO 43 I=N,IDBSIZ-1
+        WRITE(IDEV,1021)
+1021    FORMAT( )
+        WRITE(IDEV,1000)
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+        J=LNBR(I)
+        WRITE(IDEV,1001)
+42      CONTINUE
+        CALL READN(IDEVN,J,NAMARC)
+        WRITE(IDEV,1006)NAMARC,J,RELPG(J),LNGTHG(J),LNBR(J),RNBR(J)
+        J=RNBR(J)
+        IF(J.NE.0)GO TO 42
+43    CONTINUE
+      RETURN
+45    CONTINUE
+C   SELECTED CONTIGS ONLY
+C
+C   GET GEL NUMBER AND CONTIG NUMBER
+      CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +'Contig identifier',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1009)
+      WRITE(IDEV,1000)
+      WRITE(IDEV,1007)LINCON,RELPG(LINCON),LNBR(LINCON),RNBR(LINCON)
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1008)
+      N=LLINO
+      WRITE(IDEV,1001)
+46    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).EQ.0)GO TO 48
+      N=RNBR(N)
+      IF(RELPG(N).GT.RREG)GO TO 48
+      GO TO 46
+48    CONTINUE
+      GO TO 45
+      END
+      SUBROUTINE DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,KBIN,KBOUT,IDEVW,
+     +IDEV,LINLEN,PERCD,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,LINOU1,LINOU2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEVN,
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CHARACTER FILEH*(*)
+      PARAMETER (MAXPRM = 10)
+      CHARACTER PROMPT(2)*(MAXPRM)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2),ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +'Contig identifier',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0) RETURN
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0) RETURN
+      IDIM2=RREG-LREG+1
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL DBSCSM(SEQ1(LREG),IDIM2,KBOUT)
+160   CONTINUE
+      ANS = 1
+      PROMPT(1) = 'List codes'
+      PROMPT(2) = 'Plot codes'
+      CALL RADION('Select results display mode',PROMPT,2,ANS,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.1) RETURN
+      IF(ANS.EQ.1) THEN
+        CALL FMTDB(SEQ1,IDIM1,LREG,RREG,LINLEN,IDEV)
+        RETURN
+      ELSE
+        CALL PLTQ(SEQ1(LREG),IDIM2,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      END
+      SUBROUTINE DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,LLINO,
+     +PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      IDIM2=RREG-LREG+1
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL PLTQ(SEQ1(LREG),IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END
+      SUBROUTINE DBSCSM(SEQ1,IDIM1,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER CODES(5)
+      REAL X(5)
+      SAVE CODES
+      DATA CODES/'0','1','2','3','4'/
+      DO 50 J=1,5
+        X(J)=0.
+50    CONTINUE
+      DO 100 I=1,IDIM1
+        DO 60 J=1,5
+          IF(SEQ1(I).NE.CODES(J))GO TO 60
+          X(J)=X(J)+1.
+          GO TO 61
+60      CONTINUE
+61      CONTINUE
+100   CONTINUE
+      SUM=0.
+      DO 130 J=1,5
+        SUM=SUM+X(J)
+130   CONTINUE
+      DO 140 J=1,5
+        IF(SUM.NE.0)X(J)=X(J)*100./SUM
+140   CONTINUE
+      WRITE(KBOUT,1001)X(1)
+1001  FORMAT(' ',F6.2,'% OK on both strands and they agree(0)')
+      WRITE(KBOUT,1002)X(2)
+1002  FORMAT(' ',F6.2,'% OK on plus strand only(1)')
+      WRITE(KBOUT,1003)X(3)
+1003  FORMAT(' ',F6.2,'% OK on minus strand only(2)')
+      WRITE(KBOUT,1004)X(4)
+1004  FORMAT(' ',F6.2,'% Bad on both strands(3)')
+      WRITE(KBOUT,1005)X(5)
+1005  FORMAT(' ',F6.2,'% OK on both strands but they disagree(4)')
+      RETURN
+      END
+C     DBSTAT
+      SUBROUTINE DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      N=IDBSIZ-NCONTS
+      SUM=0.
+      DO 20 I=N,IDBSIZ-1
+        SUM=SUM+RELPG(I)
+20    CONTINUE
+      AV=SUM/NCONTS
+      WRITE(KBOUT,1020)SUM,AV
+1020  FORMAT( ' Total contig length ',F10.0,'  Average',
+     +' length ',F10.1)
+      SUMG=0.
+      DO 30 I=1,NGELS
+        SUMG=SUMG+FLOAT(ABS(LNGTHG(I)))
+30    CONTINUE
+      AV=SUMG/SUM
+      WRITE(KBOUT,1021)SUMG
+1021  FORMAT( ' Total characters in gel readings ',F10.0)
+      WRITE(KBOUT,1022)AV
+1022  FORMAT
+     +( ' Average gel characters per consensus character ',F10.2)
+99    CONTINUE
+      RETURN
+      END
+C     DELCON
+C
+C   DELETES CONTIG FROM CONSENSUS SEQUENCE
+      SUBROUTINE DELCON(SEQ1,ILEFT,ILC,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+C   FIRST CHAR TO REPLACE
+      I1=ILEFT-20
+C   FIRST CHAR TO MOVE
+      I2=ILEFT+ILC
+C   IS THIS RIGHTMOST CONTIG ANYWAY?
+      IF(I2.GT.IDIM1)GO TO 10
+C   NUMBER TO MOVE
+      ID=IDIM1-I2+1
+C   MOVE
+      CALL SQCOPY(SEQ1(I2),SEQ1(I1),ID)
+C   RESET LENGTH
+      IDIM1=I1+ID-1
+      RETURN
+10    CONTINUE
+C   RIGHTMOST CONTIG SO DONT MOVE
+      IDIM1=I1-1
+C
+      RETURN
+      END
+      SUBROUTINE DISMAT(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C   EDIT 07-02-83 TO CHECK FOR CASE WHEN GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL!  DONE BY HAVING A
+C   PARAMETER THAT STORES THE POSITION OF THE LEFT END OF THE
+C   NEXT CONTIG (IE THE ONE AFTER THE ONE THE CURRENT GEL OVERLAPS)
+C   SET IT TO A VERY LARGE VALUE INITIALLY
+      NEXTC=99999
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPARATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C      WRITE(KBOUT,1000)SAVPG(1),SAVPS(1)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     +SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C      WRITE(KBOUT,1000)SAVPG(I),SAVPS(I)
+C1000  FORMAT(' ',2I6)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     +SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       DISMAU
+C   ROUTINE TO DISPLAY MATCHES
+C   EDITED 17-12-81 TO NOT SUBTRACT 1 FROM LCL AND LGR
+      SUBROUTINE DISMAU(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC (SEE DISMAT)
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with vector number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   SAVE POSN OF END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      RETURN
+      END
+      SUBROUTINE DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +GEL,LLINOO,LINCON,LREG,RREG,GEL2,I1,IDIM,NOPT,
+     +LLINOR,IDBSIZ,IDEV,KBOUT,IDEVW,IDEVN,LINLEN,PERCD,
+     +MAXGEL,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER CHRSIZ
+      PARAMETER (CHRSIZ = 6)
+      PARAMETER (IDC1 = CHRSIZ*100)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,X,XLS2,XLS1,XRS2,XRS1,RREG2
+      CHARACTER MATCH(100)
+      INTEGER CHARS(CHRSIZ,100),CHARS1(IDC1)
+      CHARACTER NAMARC*16
+      CHARACTER GEL(MAXGEL)
+      CHARACTER GEL2(MAXGEL)
+      INTEGER RELPOS(10),RELPO2(10)
+      INTEGER GELC
+      INTEGER RP
+      INTEGER LSEQNO,RSEQNO
+      CHARACTER LINOUT(100)
+      CHARACTER MUNOTP
+      CHARACTER GTCONC
+      EXTERNAL GTCONC
+      EQUIVALENCE (CHARS1,CHARS)
+      CALL FILLI(CHARS1,IDC1,0)
+C   SET CONTIG NUMBER
+      ICON=1
+      LLINO=LLINOO
+      NLEN=LINLEN/10
+      LSEQNO=LREG
+      X=LINLEN+LSEQNO-1
+      RSEQNO=MIN(RREG,X)
+C   SET LEFT GEL NUMBER FOR RIGHT CONTIG
+      LN2=LLINOR
+C   FIRST GEL NO IS LLINOO
+C   SET RREG FOR RIGHT CONTIG
+      RREG2=IDIM
+C   SET UP LSEQNO,RSEQNO FOR FOR NOPT=3
+      XLS2=I1
+      XRS2=RSEQNO-LSEQNO+XLS2
+9     CONTINUE
+C   IF RIGHT CONTIG SKIP NUMBER PRINTING
+      IF(ICON.EQ.2)GO TO 8
+C   NEED TO KEEP LONGEST LINE LENGTH FOR OUTPUT OF CONSENSUS
+      IE=0
+C   SETUP AND WRITE NUMBERS
+      RELPOS(1)=LSEQNO+9
+      DO 5 I=2,NLEN
+        RELPOS(I)=RELPOS(I-1)+10
+5     CONTINUE
+      WRITE(IDEV,1023)
+     +(RELPOS(K),K=1,MIN(NLEN,MAX(1,(RSEQNO-LSEQNO+1)/10)))
+1023  FORMAT( ' ',25X,10(I9,1X))
+C   SET CURRENT LINE NUMBER
+8     CONTINUE
+      GELC=LLINO
+10    CONTINUE
+C   IS LEFT END OF CURRENT GEL >RREG
+      IF(RELPG(GELC).GT.RSEQNO)GO TO 200
+C   ALSO NEED TO KNOW IF RIGHT END  ON THIS LINE (IF .LT. NO DATA
+C   TO DISPLAY)
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      IF(X.LT.LSEQNO)GO TO 190
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+      CALL FILLC(LINOUT,LINLEN,' ')
+      CALL READN(IDEVN,GELC,NAMARC)
+C
+C   NEED TO KNOW HOW MANY CHARS TO COPY OVER TO OUTPUT LINE
+C   AND WHERE IN LINE TO PUT THEM
+C   CURRENT LINE LEFT END IS LSEQNO,RIGHT END RSEQNO
+C   SO LEFT START CHAR IS
+      X=MAX(LSEQNO,RELPG(GELC))
+C   POSITION IN ARRAY LINE
+      LP=X-LSEQNO+1
+C   RIGHT END CHAR IS
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      X=MIN(RSEQNO,X)
+C   POSITION IN ARRAY LINE
+      RP=X-LSEQNO+1
+C   LOOK FOR LONGEST LINE
+      IF(RP.GT.IE)IE=RP
+C   NEED LEFT START IN GEL
+      K=LSEQNO-RELPG(GELC)+1
+      IF(K.LT.1)K=1
+      NCOP=RP-LP+1
+      IF(NCOP.GT.0)CALL SQCOPY(GEL(K),LINOUT(LP),NCOP)
+      N=LP+NCOP-1
+      II=K-1
+      IF(IDM.EQ.26)THEN
+        DO 50 I = LP,N
+          II = II + 1
+          CALL PCON1(GEL(II),CHARS(1,I))
+50      CONTINUE
+      ELSE
+        DO 70 I=LP,N
+          II=II+1
+          JJ = INDEXS(GEL(II),JSCORE)
+          CHARS(JJ,I) = CHARS(JJ,I) + JSCORE
+C          CHARS(CHRSIZ,I) = CHARS(CHRSIZ,I) + JSCORE
+70      CONTINUE
+      END IF
+      I=SIGN(GELC,LNGTHG(GELC))
+      WRITE(IDEV,1020)I,NAMARC,(LINOUT(K),K=1,RP)
+C1020  FORMAT( ' ',I4,2X,A,2X,100A1)
+1020  FORMAT( ' ',I6,1X,A,1X,100A1)
+C
+190   CONTINUE
+C   NOW GET NEXT GEL TO RIGHT
+      GELC=RNBR(GELC)
+      IF(GELC.NE.0)GO TO 10
+200   CONTINUE
+C   CALC CONSENSUS AND WRITE IT
+      IF(IDM.EQ.26)THEN
+        DO 49 I = 1,LINLEN
+          LINOUT(I) = MUNOTP(CHARS(1,I))
+          CHARS(1,I) = 0
+49      CONTINUE
+      ELSE
+        DO 230 I=1,LINLEN
+          LINOUT(I) = GTCONC(CHARS(1,I),CHRSIZ,PERCD)
+          CALL FILLI(CHARS(1,I),CHRSIZ,0)
+230     CONTINUE
+      END IF
+      WRITE(IDEV,1019)(LINOUT(K),K=1,IE)
+C   IF REQUIRED WRITE COMPARISON GEL
+C   WHICH OPTION IN OPERATION?
+      IF(NOPT.EQ.2)GO TO 52
+      IF(NOPT.NE.3)GO TO 250
+53    CONTINUE
+C   ALREADY DONE THIS LINE CONTIG2?
+      IF(ICON.EQ.2)GO TO 54
+      ICON=2
+C   NEED TO SAVE CONSENSUS FROM LEFT CONTIG
+      CALL SQCOPY(LINOUT,GEL2,IE)
+C   SAVE VALUES FROM LEFT CONTIG
+      XLS1=LSEQNO
+      XRS1=RSEQNO
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN1=LLINO
+C   SET UP VALUES FOR RIGHT CONTIG
+      LSEQNO=XLS2
+      RSEQNO=XRS2
+C   SET LEFT GEL NUMBER
+      LLINO=LN2
+C   GET NEXT GEL
+      GO TO 150
+54    CONTINUE
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN2=LLINO
+C   SET VALUES FOR RIGHT CONTIG NEXT PASS
+      XLS2=XRS2+1
+      XRS2=XLS2+LINLEN-1
+      IF(XRS2.GT.RREG2)XRS2=RREG2
+C   SET UP VALUES FOR LEFT CONTIG
+      LLINO=LN1
+      ICON=1
+      LSEQNO=XLS1
+      RSEQNO=XRS1
+C   SET DECREMENT FOR POINTER TO GEL2
+      MMM=I1-1
+52    CONTINUE
+C1020  FORMAT( ' ',I4,2X,A,2X,100A1)
+1017  FORMAT('        NEWGEL           ',100A1)
+1018  FORMAT('        MISMATCH         ',100A1)
+1019  FORMAT('        CONSENSUS        ',100A1)
+1022  FORMAT( ' ',26X,100A1)
+      I2=I1+LINLEN-1
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(NOPT.EQ.2)WRITE(IDEV,1017)(GEL2(K),K=I1,I2)
+C   SET DECREMENT
+      IF(NOPT.EQ.2)MMM=0
+55    CONTINUE
+      CALL FILLC(MATCH,LINLEN,'*')
+      K=0
+      DO 667 J=I1,I2
+        K=K+1
+        IF(GEL2(J-MMM).EQ.LINOUT(K))MATCH(K) = ' '
+667   CONTINUE
+      WRITE(IDEV,1018)(MATCH(K),K=1,IE)
+      RELPO2(1)=(I1)+9
+      DO 240 I=2,NLEN
+        RELPO2(I)=RELPO2(I-1)+10
+240   CONTINUE
+      WRITE(IDEV,1023)(RELPO2(K),K=1,NLEN)
+      I1=I2+1
+      I2=I2+LINLEN
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(I1.GT.I2)RETURN
+250   CONTINUE
+C
+      WRITE(IDEV,1021)
+1021  FORMAT( )
+C   NEXT LINE LENGTH
+C   NEXT LENGTH IS OLD RIGHT +1
+      LSEQNO=RSEQNO+1
+C   NEW RIGHT IS LEFT +LENGTH
+      RSEQNO=LSEQNO+(LINLEN)-1
+C   ARE WE OVER END OF REGION
+      IF(RSEQNO.GT.RREG)RSEQNO=RREG
+C   HAVE WE FINISHED REGION COMPLETELY
+      IF(RSEQNO.LT.LSEQNO) RETURN
+C   NOT FINISHED SO NEED TO FIND CURRENT LEFT GEL NO
+C   CURRENT LEFT GEL IS LLINO
+C
+150   CONTINUE
+C   NEED TO KNOW IF CURRENT LEFT GELS RIGHT END IS INSIDE REGION
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.LSEQNO)GO TO 9
+C   LOOK AT NEXT GEL TO RIGHT
+      LLINO=RNBR(LLINO)
+C   MAY HAVE GONE OVER END OF CONTIG?????
+      IF(LLINO.GT.0)GO TO 150
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE EC(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      PARAMETER (IDASH = 6)
+      K = 0
+      DO 10 I = 1,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.IDASH) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            GEL(I) = CHARSL(JC)
+            K = K + 1
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS CORRECTED=',K
+      END
+      SUBROUTINE ED(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = MIN(IDC,IDG),1,-1
+        JC = INDEXS(CON(I),J)
+        IF(JC.EQ.5) THEN
+          IF(I.LT.IDG) CALL SQCOPY(GEL(I+1),GEL(I),IDG-I)
+          K = K + 1
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS DELETED=',K
+      END
+      SUBROUTINE EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER CON(IDC)
+      INTEGER CHNRP
+      EXTERNAL CHNRP
+C  CHANGE RELATIVE POSITIONS FOR AE
+      ND = 0
+      DO 10 I = IDC,1,-1
+        IF(CON(I).EQ.'*') THEN
+          ND = ND + 1
+          K = I + LREG - 1
+          J = CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,K)
+          IF(J.NE.0) THEN
+            CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,IDUM,JDUM,IDEVR,
+     +      IDBSIZ,J,NCONT,-1)
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF DELETIONS=',ND
+      END
+      SUBROUTINE ET(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = 2,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.6) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            JNG = INDEXS(GEL(I-1),J)
+            JNC = INDEXS(CON(I-1),J)
+            IF(JNC.NE.JNG) THEN
+              IF((JNG.EQ.JC).AND.(JNC.EQ.JG)) THEN
+                GEL(I) = CHARSL(JNG)
+                GEL(I-1) = CHARSL(JG)
+                K = K + 1
+              END IF
+            END IF
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF CHARS TRANSPOSED=',K
+      END
+      SUBROUTINE FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND
+      STRAND = 1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHP)
+      IF(DEPTHP.LT.0) RETURN
+      STRAND = -1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHM)
+      IF(DEPTHM.LT.0) RETURN
+      CALL PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      END
+      SUBROUTINE FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTH,STRAND
+      EXTERNAL NCDEP
+C  LREG = left contig position
+C  RREG = right ''      ''
+C  LENCON = RREG-LREG+1
+      I = LGEL
+      DEPTH = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      ELSE
+        DEPTH = -1
+        RETURN
+      END IF
+C      WRITE(*,*)'LGEL',LGEL
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            K = RELPG(I) + ABS(LNGTHG(I)) -1
+            DEPTH = MAX(NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,
+     +      STRAND,K),DEPTH)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+C      WRITE(*,*)'DEPTH',DEPTH
+      END
+C      FIND
+C
+C   SUBROUTINE TO FIND THE FIRST OCCURENCE OF A GIVEN STRING
+C   IN A GIVEN ARRAY
+C
+      SUBROUTINE FIND(SEQ,IDIM1,STRING,IDIM2,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2),DASH
+      INTEGER PSEQ,PSTR
+      SAVE DASH
+      DATA DASH/'-'/
+      PSEQ=0
+      PSTR=1
+      IMATCH=0
+C
+100   CONTINUE
+C
+C   PUT PSEQ TO WHERE THIS FAILED MATCH STARTED
+      PSEQ=PSEQ+1-PSTR
+C
+400   CONTINUE
+C
+      PSTR=0
+C
+500   CONTINUE
+C
+C   POINT TO NEXT SEQ CHAR
+      PSEQ=PSEQ+1
+C   TEST FOR END
+      IF(PSEQ.GT.IDIM1)GO TO 300
+C   POINT TO NEXT STRING CHAR
+      PSTR=PSTR+1
+C   TEST FOR DASH IN STRING
+      IF(STRING(PSTR).EQ.DASH)GO TO 450
+C   TEST FOR DASH IN SEQ
+      IF(SEQ(PSEQ).EQ.DASH)GO TO 400
+C   TEST FOR MATCH
+      IF(SEQ(PSEQ).NE.STRING(PSTR))GO TO 100
+C
+450   CONTINUE
+C
+C   TEST FOR END OF STRING IE. WHOLE STRING MATCH
+      IF(PSTR.LT.IDIM2)GO TO 500
+C   HAVE MATCH. GET POINTER TO WHERE IT STARTED
+      IMATCH=PSEQ-IDIM2+1
+C
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE FMT4LP(SEQ1,SEQ2,IDIM,ISW,ISX,IDEV,NAME1,NAME2)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(60),NAME1*(*),NAME2*(*)
+      INTEGER KL(6)
+      ISXX=ISX
+      ISWW=ISW
+      IE=0
+10    CONTINUE
+      IS=IE+1
+      IE=IE+60
+      IF(IE.GT.IDIM)IE=IDIM
+      N=IE-IS+1
+      N=1+(N-1)/10
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,N
+        KL(J)=ISWW
+        ISWW=ISWW+10
+50    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+      WRITE(IDEV,1002)NAME1,(SEQ1(K),K=IS,IE)
+      IL = IE - IS + 1
+      CALL SQMTCH(SEQ1(IS),SEQ2(IS),MATCH,IL)
+      WRITE(IDEV,1003)(MATCH(K),K=1,IL)
+      WRITE(IDEV,1002)NAME2,(SEQ2(K),K=IS,IE)
+1002  FORMAT(2X,A,2X,6(10A1,1X))
+ 1003 FORMAT(10X,6(10A1,1X))
+C   SET UP DECIMAL COUNTERS
+      DO 60 J=1,N
+        KL(J)=ISXX
+        ISXX=ISXX+10
+60    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+1001  FORMAT( 5X,6(I6,5X))
+      IF(IE.LT.IDIM) GO TO 10
+      END
+      SUBROUTINE FMTDB(SEQ1,IDIM,ISW,ISE,LINLEN,IDEV)
+C   NOTE SAME AS FMTSEP!
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM)
+      INTEGER KL(12)
+      ISWW=ISW-1
+      IE=ISW-1
+1     CONTINUE
+      WRITE(IDEV,1003)
+1003  FORMAT( )
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,LINLEN/10
+        ISWW=ISWW+10
+        KL(J)=ISWW
+50    CONTINUE
+      IS=IE+1
+      IE=IE+LINLEN
+      IF(IE.GT.ISE)IE=ISE
+      WRITE(IDEV,1001)(KL(KKK),KKK=1,MIN(IE-IS+1,LINLEN)/10)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+1002  FORMAT( '  ',12(10A1,1X))
+1001  FORMAT( ' ',12(5X,I6))
+      IF(IE.EQ.ISE)RETURN
+      GO TO 1
+      END
+      SUBROUTINE FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   AUTHOR: RODGER STADEN
+C   STORES THEIR POSITIONS IN CENDS AND THEIR LEFT LINE NUMBERS IN NENDS
+      PARAMETER (MAXDG = 5)
+      CHARACTER SEQ(IDIM),DC(MAXDG)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      EXTERNAL IFROMC,INDEXA
+      IDCEND=0
+      DO 10 I=1,IDIM
+        IF(SEQ(I).NE.'<')GO TO 10
+        IDCEND=IDCEND+1
+C       PUT POSITION OF LEFT END OF CONTIG IN CENDS
+        CENDS(IDCEND)=I
+        K = INDEXA(SEQ(I),20,'.')
+        IF(K.EQ.0) THEN
+           WRITE(KBOUT,*)'Error in contig title: no dot!'
+           IDCEND = 0
+           RETURN
+         END IF
+        K = K + I
+C        K=I+11
+        DO 5 J=1,MAXDG
+          DC(J)=SEQ(K)
+          K=K+1
+5       CONTINUE
+        NENDS(IDCEND)=IFROMC(DC,MAXDG,KBOUT)
+10    CONTINUE
+C     STORE POSITION OF LAST CHAR +1 TO SIMPLIFY DISPLAY ROUTINES
+      CENDS(IDCEND+1)=IDIM+1
+      RETURN
+      END
+      INTEGER FUNCTION GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      GCLIN = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          GCLIN = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      INTEGER FUNCTION GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LLINO,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*17,NAME2*16,NAME3*17,NFLAG
+      PARAMETER (NFLAG='/')
+      NAME3 = ' '
+      IF(LLINO.NE.0) THEN
+        NAME3(1:1) = NFLAG
+        CALL READN(IDEVN,LLINO,NAME3(2:))
+      END IF
+      GELID = 0      
+10    CONTINUE
+      L = 0
+      IF(LLINO.NE.0) L = 17
+      CALL GTSTR('Contig identfier',NAME3,
+     +NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(INFLAG.EQ.3) THEN
+        GELID = LLINO
+        RETURN
+      END IF
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NAME1(1:1).EQ.NFLAG) THEN
+        CALL CCASE(NAME1,1)
+        DO 20 I=1,NGELS
+          CALL READN(IDEVN,I,NAME2)
+          CALL CCASE(NAME2,1)
+          IF(NAME1(2:17).EQ.NAME2) THEN
+            GELID = I
+            RETURN
+          END IF
+20     CONTINUE
+        WRITE(KBOUT,1004)NAME1(2:)
+1004    FORMAT(' ',A,' is not in the database!')
+      ELSE
+        CALL RJST(NAME1)
+        READ(NAME1,1001,ERR=10,END=10)GELID
+1001    FORMAT(I17)
+        IF((GELID.LT.1).OR.(GELID.GT.NGELS)) THEN
+          CALL ERROM(KBOUT,'Illegal gel reading number')
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +NCONTS,GEL,MAXGEL,IDEV3,IDEV4,IDEV5,IDEV1,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER  GEL(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER NAMARC*16
+      FILNAM = ' '
+      CALL OPENF1(IDEV5,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for names of extracted gel readings',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(I,'Extract ends of contigs only',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(I.LT.0) RETURN
+      IF(I.EQ.0) GO TO 15
+      DO 10 I=1,NGELS
+        L=ABS(LNGTHG(I))
+        IF(L.GT.0)THEN
+          CALL READN(IDEV4,I,NAMARC)
+          WRITE(KBOUT,1002)NAMARC
+1002      FORMAT(' ',A)
+          WRITE(IDEV5,1003)NAMARC
+1003      FORMAT(A)
+          FILNAM = NAMARC
+          CALL OPENRS(IDEV1,FILNAM,IOK,LRECL,1)
+          IF(IOK.NE.0) GO TO 100
+          CALL READW(IDEV3,I,GEL,MAXGEL)
+          IF(LNGTHG(I).LT.0)THEN
+            CALL SQREV(GEL,L)
+            CALL SQCOM(GEL,L)
+          END IF
+          CALL FMTDKN(IDEV1,GEL,L)
+          CLOSE(UNIT=IDEV1)
+      END IF
+10    CONTINUE
+      RETURN
+15    CONTINUE
+C   NUMBER OF LINES TO PROCESS
+      N=IDBSIZ-NCONTS
+      DO 20 I=N,IDBSIZ-1
+        JL=LNBR(I)
+        JR=RNBR(I)
+        CALL READN(IDEV4,JL,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        FILNAM = NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JL,GEL,MAXGEL)
+        L=ABS(LNGTHG(JL))
+        IF(LNGTHG(JL).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+        IF(JR.EQ.JL)GO TO 20
+        CALL READN(IDEV4,JR,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JR,GEL,MAXGEL)
+        L=ABS(LNGTHG(JR))
+        IF(LNGTHG(JR).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+20    CONTINUE
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,*)'Error opening file for extracted gel reading'
+      RETURN
+      END
+      SUBROUTINE GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),GELID
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER FILEH*(*)
+      EXTERNAL GELID
+      IERR = 1
+      NCONTC = GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LLINO,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(NCONTC.EQ.0) RETURN
+      IGELNO = NCONTC
+      IF(LNBR(NCONTC).NE.0) THEN
+        WRITE(KBOUT,1013)RELPG(NCONTC)
+1013    FORMAT(' Position of this reading=',I6)
+25      CONTINUE
+        NCONTC = LNBR(NCONTC)
+        IF(LNBR(NCONTC).NE.0) GO TO 25
+        WRITE(KBOUT,1014)NCONTC
+1014    FORMAT( ' Number of leftmost reading this contig=',I6)
+      END IF
+30    CONTINUE
+      N = IDBSIZ - NCONTS
+      DO 20 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.NCONTC) THEN
+          LINCON=J
+          GO TO 21
+        END IF
+20    CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' No contig line for this gel! Fix the database')
+      RETURN
+21    CONTINUE
+      LLINO = NCONTC
+      IERR = 0
+      END
+      SUBROUTINE GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CALL GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      END
+      SUBROUTINE GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LEFTMN,RIGHTM,LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,RIGHTM
+      CHARACTER FILEH*(*)
+40    CONTINUE
+      MN = LEFTMN
+      MX = RIGHTM
+      LREG = MN
+      CALL GETINT(MN,MX,LREG,
+     +'Start position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LREG = IVAL
+      MN = LREG
+      MX = RIGHTM
+      RREG = MX
+      CALL GETINT(MN,MX,RREG,
+     +'End position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      RREG = IVAL
+C   NOW FIND FIRST GEL THAT OVER LAPS
+50    CONTINUE
+      X=RELPG(LLINO)+(ABS(LNGTHG(LLINO)))-1
+      IF(X.GE.LREG)GO TO 60
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 50
+60    CONTINUE
+      RETURN
+      END
+      SUBROUTINE GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+      CHARACTER T
+        IF(T.EQ.'0') THEN
+          YF = Y0
+          YT = Y0
+        ELSE IF(T.EQ.'1') THEN
+          YF = Y0
+          YT = YM1
+        ELSE IF(T.EQ.'2') THEN
+          YF = Y0
+          YT = YP1
+        ELSE IF(T.EQ.'3') THEN
+          YF = YP1
+          YT = YM1
+        ELSE IF(T.EQ.'4') THEN
+          YF = YP2
+          YT = YM2
+        END IF
+      END
+      CHARACTER*1 FUNCTION GTCONC(COUNTS,IDM,CUT)
+      INTEGER IDM
+      INTEGER COUNTS(IDM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU
+C 30-3-92 made this routine sum counts
+      GTCONC = '-'
+      ISUM = 0
+      DO 5 I=1,IDM
+        ISUM = ISUM + COUNTS(I)
+ 5    CONTINUE
+      IF(ISUM.EQ.0) RETURN
+      Y = ISUM
+      DO 10 I = 1,IDM - 1
+        X = REAL(COUNTS(I))/Y
+        IF(X.GE.CUT) THEN
+          GTCONC = CHARSU(I)
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE HIGHLT(GELSAV,NAMSAV,NUMSAV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV1,IDEV2,
+     +FILNAM,IOK)
+      CHARACTER LINEIN*132,CONSEN*100
+      CHARACTER GELNO*6,GEL*100,GELSAV*100,GELNAM*16
+      CHARACTER NAMSAV*16,NUMSAV*6
+      CHARACTER FILNAM*(*),HELPF*(*)
+      DIMENSION GELSAV(50),NAMSAV(50),NUMSAV(50)
+      CHARACTER PLUS*4,MINUS*4
+      EQUIVALENCE (LINEIN(2:2),GELNO),(LINEIN(9:9),GELNAM)
+      EQUIVALENCE (LINEIN(26:26),GEL)
+      EXTERNAL NOTIRL
+      CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File containing contig display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for problem display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+C   FORMAT:
+C
+C12345678901234567890       10        20         30    ETC
+C   12  GELNAM0000  CAGACGCGCGCGCGCGCGGATATAGTCTCTCCGCTCT
+C  100  GELNAM0000       TGATACGCTCGCTCTCTCTCTCTCTCTCTTTC
+C                   AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C                           70        80       ETC
+C   12  GELNAM0000     AAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C
+      LIN = 1
+      CALL GTSTR('plus strand symbol',':',PLUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) PLUS = ':'
+      LIN = 1
+      CALL GTSTR('minus strand symbol','.',MINUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) MINUS = '.'
+C   COUNT LINE NUMBERS
+      LINNO=0
+10    CONTINUE
+C   READ LINE OF NOS
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+1003  FORMAT(A)
+C   WRITE IT OUT AGAIN
+      WRITE(IDEV2,1003)LINEIN
+C   ZERO GEL COUNT FOR THIS STRIP
+      IGEL=0
+20    CONTINUE
+C
+C   READ A LINE, COULD BE 1 GEL, 2 CONSENSUS OR BLANK
+C       LINEIN=' '
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+C   WHAT SORT OF LINE?  ONLY A GEL WILL HAVE NON BLANK CHARS AT THE LEFT END
+      IF(LINEIN(2:7).NE.' ')THEN
+C   GEL LINE SO SAVE
+      IGEL=IGEL+1
+      GELSAV(IGEL)=GEL
+      NAMSAV(IGEL)=GELNAM
+      NUMSAV(IGEL)=GELNO
+      GO TO 20
+      END IF
+C   MUST BE CONSENSUS
+      CONSEN=GEL
+C   PROCESS THIS STRIP OF GELS (IGEL OF THEM)
+      DO 50 I=1,IGEL
+C       WHERE DOES DATA START AND END?
+        IFIRST=1
+40      CONTINUE
+        IF(GELSAV(I)(IFIRST:IFIRST).NE.' ')GO TO 45
+        IFIRST=IFIRST+1
+        IF(IFIRST.LE.100)GO TO 40
+C       ERROR --- NO DATA FOUND
+        WRITE(KBOUT,1004)LINNO
+1004    FORMAT(' Error on line',I6,' of file')
+        RETURN
+45      CONTINUE
+C       NOW WHERE DOES IT END
+        ILAST=NOTIRL(GELSAV(I),100,' ')
+C       COMPARE WITH CONSENSUS
+        READ(NUMSAV(I),1001,ERR=900)INTEG
+ 1001   FORMAT(I6)
+        IF(INTEG.GE.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,PLUS)
+        IF(INTEG.LT.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,MINUS)
+        WRITE(IDEV2,1008)NUMSAV(I),NAMSAV(I),GELSAV(I)(1:ILAST)
+1008    FORMAT(' ',A,1X,A,1X,A)
+50    CONTINUE
+      WRITE(IDEV2,1009)CONSEN
+1009  FORMAT(' ',24X,A)
+1006  FORMAT( )
+C     READ A BLANK LINE
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+      WRITE(IDEV2,1003)LINEIN
+C     NO GO BACK FOR THE NEXT LINE OF NUMBERS
+      GO TO 10
+100   CONTINUE
+      WRITE(KBOUT,1005)
+1005  FORMAT(' Finished')
+      RETURN
+900   WRITE(KBOUT,*)'Error reading gel number'
+      END
+      SUBROUTINE IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +NCONTS,IX,IY,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,
+     +DBTDUX,DBTDUY,NCONT,IGEL,IS)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER CHNRP1
+      EXTERNAL CWORLD,CHNRP1
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      XX = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+      YX = CWORLD(IY,MARGB,MARGT,YMIN,YMAX)
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        IF((XX.GT.XF).AND.(XX.LT.XT)) THEN
+          IS = NINT(((XX-XF)/(XT-XF)) * RELPG(I))
+          JGEL = LNBR(I)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,JGEL,IS)
+          NCONT = I
+          RETURN
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      IGEL = 0
+      NCONT = 0
+      END
+      SUBROUTINE IDTOD(TOPLIN,GEL,IFIRST,ILAST,SYMBOL)
+      CHARACTER TOPLIN*100,GEL*100,SYMBOL*4
+      DO 10 I=IFIRST,ILAST
+        IF(GEL(I:I).EQ.TOPLIN(I:I))GEL(I:I)=SYMBOL(1:1)
+10    CONTINUE
+      END
+      INTEGER FUNCTION INDEXS(C,S)
+      PARAMETER (IDM = 29)
+      CHARACTER C
+      INTEGER POINTS(0:255),SCORES(IDM),IND(IDM),S
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      SAVE SCORES,IND
+      DATA 
+     +IND/1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,6,6,6,6,6,6,1,2,3,4,5,5,6/
+C      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  changed 28-7-91 to give 10 to old zeroes and 100 to lowercase
+      DATA SCORES/
+     +100,100,100,100,
+     +75,75,75,75,
+     +100,100,100,100,
+     +100,100,100,100,
+     +10,10,10,10,10,10,
+     +100,100,100,100,100,100,10/
+      I = ICHAR(C)
+      I = POINTS(I)
+      S = SCORES(I)
+      INDEXS = IND(I)
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP FOR SHOTGUN SEQUENCING
+      SUBROUTINE INITS
+C  AUTHOR RODGER STADEN
+      INTEGER POINTS(0:255)
+      PARAMETER (IDM = 29)
+      CHARACTER DUP*29
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-4 FOR ACGT
+C                 acgt
+C                 1234
+C                 BDHV
+C                 KLMN
+C      5 FOR      *
+C      6 FOR      5678- AND ELSE
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 6
+C
+        DO 30 I = 0,255
+          POINTS(I) = IDM
+30      CONTINUE
+        DO 35 I = 1,IDM
+          J = ICHAR(DUP(I:I))
+          POINTS(J) = I
+35      CONTINUE
+      END
+      FUNCTION INLIST(LIST,IDLIST,ITEM)
+C   AUTHOR: RODGER STADEN
+C   SENT LIST LIST, AND ITEM ITEM. IF IN LIST RETURNS ELEMENT NUMBER, ELSE 0
+      INTEGER LIST(IDLIST)
+      INLIST=0
+      DO 1 I=1,IDLIST
+        IF(LIST(I).NE.ITEM)GO TO 1
+        INLIST=I
+      RETURN
+1     CONTINUE
+      RETURN
+      END
+      SUBROUTINE IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH,X,Y,KBOUT,
+     +IGEL,IOK)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      IOK = 1
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      YINCO2 = STRAND*YINC/2.
+      I = LGEL
+      IGEL = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            IF((X.GE.XF).AND.(X.LE.XT)) THEN
+              IGEL = I
+              IF((Y.GE.YF-YINCO2).AND.(Y.LE.YF+YINCO2)) THEN
+                IOK = 0
+                RETURN
+              END IF
+            END IF
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+C     LINEUP
+C
+C   TAKES 2 SEQS SET OF MATCHES AND PRODUCES LINED UP SEQS
+C   FINDS IF WE HAVE A LEFT OVERLAP
+C   RETURNS POSITION OF JOINT. THIS IS RELATIVE TO THE CONTIG
+C   FOR MOST MATCHES BUT I RELATIVE TO THE GEL FOR A LEFT OVERLAP
+      SUBROUTINE LINEUP(SEQG,SEQC,SEQG2,SEQC2,IDC,IDG,IDOUT,
+     1MATG,MATC,MATL,IP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQG(IDG),SEQC(IDC),SEQG2(IDOUT),SEQC2(IDOUT),PAD
+      INTEGER MATG(IP),MATC(IP),MATL(IP)
+      SAVE PAD
+      DATA PAD/','/
+      IFAIL=0
+C   ZERO PADDING CHARS IN CONTIG (GEL DONE AT END BY DIFFERENCE
+C   IN INPUT AND OUTPUT LENGTHS)
+      ITOTPC=0
+C   FILL OUTPUT WITH PADDING
+      DO 10 I=1,IDOUT
+        SEQG2(I)=PAD
+        SEQC2(I)=PAD
+10    CONTINUE
+      NMTCH=0
+C   SET INITIAL POINTERS TO OUTPUT
+C   CONSENSUS
+      IS1=1
+C   GEL
+      IS2=1
+C   FIND DISTANCE FROM LEFT MATCH IN GEL TO LEFT OF GEL
+      IG2=MATG(1)-1
+      IF(IG2.EQ.0)THEN
+C       THE LEFT END OF THE GEL MATCHES SO THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET JOINT
+        JOINT=MATC(1)
+C       SKIP NEXT SECTION
+        GO TO 50
+      END IF
+C   FIND DISTANCE FROM LEFT MATCH IN CONTIG TO LEFT OF CONTIG
+      IC2=MATC(1)-1
+C   GET DISTANCE FROM FIRST MATCH IN CONTIG TO FIRST MATCH IN GEL.
+C   IF THIS DISTANCE <0 THEN WE HAVE A LEFT OVERLAP
+      IC1=IC2-IG2+1
+      IF(IC1.GT.0)THEN
+C       THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET LEFT END
+        JOINT=IC1
+C       COPY THE GEL UPTO THE FIRST MATCH, INTO THE OUTPUT ARRAY
+C       CHECK FOR OVERFLOW
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+C       COPY THE CONTIG FOR THE SAME REGION
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(IC1),SEQC2(1),IG2)
+        IS1=IS1+IG2
+        IS2=IS2+IG2
+        GO TO 50
+      END IF
+C   MUST BE LEFT END OVERLAP
+C   SET TYPE
+      ITYPE=1
+C   SET POSITION OF JOINT RELATIVE TO GEL
+      JOINT=ABS(IC1)+2
+C   COPY OVER THE GEL UPTO THE JOINT
+C   CHECK FOR OVERFLOW
+      IF(IG2.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+      IS2=IS2+IG2
+C   WE MAY ALSO HAVE MISMATCHING
+C   DATA AT THE JOIN SO DEAL WITH THAT NOW
+C   IF IC2 >0 THE LEFT END OF THE CONTIG MATCHES THE GEL BUT OTHERWISE
+C   WE HAVE SOME MISMATCHED DATA TO DEAL WITH - WE NEED TO TRANSFER
+C   THE MISMATCHED REGION OF THE CONTIG TO THE OUTPUT ARRAY
+      IF(IC2.GT.0)THEN
+        IF(IC2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(1),SEQC2(1),IC2)
+        IS1=IS1+IC2
+      END IF
+C   WHEN WE GET HERE WE HAVE SORTED OUT THE LEFT ENDS FOR LEFT OVERLAP
+C   AND MISMATCHED LEFT ENDS, WE NOW DEAL WITH THE REST OF THE SEQUENCE
+C   STARTING WITH THE FIRST BLOCK OF IDENTITY
+C
+C IG1 POSITION IN INPUT GEL
+C IS2 POSITION IN OUTPUT GEL
+C IC1 POSITION IN INPUT CONTIG
+C IS1 POSITION IN OUTPUT CONTIG
+C LG1 POSITION OF END OF CURRENT MATCH IN OUTPUT GEL
+C LC1 POSITION OF END OF CURRENT MATCH IN OUTPUT CONTIG
+C LG2 DISTANCE FROM CURRENT MATCH IN INPUT GEL TO NEXT MATCH
+C LC2 DISTANCE FROM CURRENT MATCH IN INPUT CONTIG TO NEXT MATCH
+C
+50    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   COPY NEXT MATCH
+      IG1=MATG(NMTCH)
+      IC1=MATC(NMTCH)
+      L=MATL(NMTCH)
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(IG1),SEQG2(IS2),L)
+C   CHECK FOR OVERFLOW
+      IF(IS1+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQC(IC1),SEQC2(IS1),L)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+L
+      IS2=IS2+L
+C   END OF CURRENT MATCH
+      LG1=IG1+L
+      LC1=IC1+L
+C   ANY MORE MATCHES
+      IF(NMTCH.EQ.IP)GO TO 500
+      K=NMTCH+1
+      LG2=MATG(K)-LG1
+      LC2=MATC(K)-LC1
+C   ANY DIFFERENCE IN LENGTH? IF SO WE HAVE TO PAD SO THEY BECOME THE SAME
+      L5=ABS(LG2-LC2)
+C   COUNT PADDING CHARS IN CONTIG
+      IF(LG2.GT.LC2)ITOTPC=ITOTPC+L5
+C   IF DIFFERENCE INCREMENT SHORTER
+      IF(LG2.GT.LC2)IS1=IS1+L5
+C   IF GEL NEEDS PADDING TRY TO PUT PADS NEXT TO DOUBLE CODES
+      IF(LC2.GT.LG2)CALL PADCOP(SEQG,SEQG2,
+     +LG1,MATG(K),L5,IS2,LG2,MAXGEL,IFAIL,KBOUT,SEQC,LC1)
+C   CHECK FOR OVERFLOW
+      IF(IFAIL.EQ.1)GO TO 700
+C   NOW COPY MISSMATCHED REGION
+C   CHECK FOR OVERFLOW
+      IF(IS2+LG2-1.GT.MAXGEL)GO TO 700
+      IF(LG2.GT.0)CALL SQCOPY(SEQG(LG1),SEQG2(IS2),LG2)
+C   CHECK FOR OVERFLOW
+      IF(IS1+LC2-1.GT.MAXGEL)GO TO 700
+      IF(LC2.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),LC2)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+LC2
+      IS2=IS2+LG2
+C   GET NEXT MATCH
+      GO TO 50
+500   CONTINUE
+C
+C   FINISH RIGHT ENDS
+C   ONLY COPY TO END OF GEL IN GEL AND TO THE SAME RELATIVE POSITION
+C   IN THE CONTIG FOR DISPLAY PURPOSES AND FOR COUNTING MISMATCH
+C   CURRENT ENDS AT LG1,LC1
+C   HOW FAR TO END OF GEL?
+C   SET M
+      M=0
+      L=IDG-LG1+1
+      IF(L.LT.1)GO TO 600
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(LG1),SEQG2(IS2),L)
+C   NEED TO COPY TO END OF GEL IN CONTIG FOR DISPLAY
+C   POINT TO POSN IN CONTIG LEVEL WITH END OF GEL
+      M=LC1+L-1
+C   IS THIS OVER END OF CONTIG?
+      IF(M.GT.IDC)M=IDC
+C   NUMBER TO COPY
+      M=M-LC1+1
+C   CHECK FOR OVERFLOW
+      IF(IS1+M-1.GT.MAXGEL)GO TO 700
+      IF(M.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),M)
+600   CONTINUE
+C   COUNT PADDING IN GEL
+      ITOTPG=IS2+L-1-IDG
+C   SET NEW LENGTHS FOR RETURN TO CALLING ROUTINE
+      IDOUT=IS1+M-1
+      IDG=IS2+L-1
+      IFAIL=0
+      RETURN
+700   CONTINUE
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Matching region too long for routine lineup,',
+     +' alignment aborted')
+      IFAIL=1
+      RETURN
+      END
+      SUBROUTINE LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,
+     +RREG,IDEV,IDEVN,NAMARC)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER RREG
+      CHARACTER NAMARC*(*)
+      N = LLINO
+      WRITE(IDEV,1001)
+10    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).NE.0) THEN
+        N = RNBR(N)
+        IF(RELPG(N).LE.RREG) GO TO 10
+      END IF
+1001  FORMAT(
+     +' NAME              NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                             LEFT   RIGHT')
+1006    FORMAT( ' ',A,2X,I6,2X,I7,2X,I5,2X,I6,2X,I6)
+C1006    FORMAT( ' ',A,2X,I4,2X,I7,2X,I5,2X,I6,2X,I6)
+      END
+C12345678901234567890
+C                          710       720       730       740       750
+C   -1  HINW.004    CGTCAGACGCACGCTGGAAAA
+      INTEGER FUNCTION LTYPE(LINE,LL,J1,J2,N,MAXDB,KBOUT)
+      PARAMETER (MAXDG = 5)
+      CHARACTER LINE*(*),NUM*(MAXDG),SPACE
+      EXTERNAL NOTRL,NOTLR
+      PARAMETER (SPACE= ' ')
+      J1 = NOTLR(LINE,LL,SPACE)
+      IF(J1.EQ.0) THEN
+C       BLANK LINE
+        LTYPE = 1
+        RETURN
+      END IF
+      IF(J1.GT.20) THEN
+C       LINE OF NUMBERS
+        LTYPE = 2
+        RETURN
+      END IF
+      IF(J1.GT.MAXDG+2) THEN
+C       CONSENSUS LINE
+        LTYPE = 3
+        RETURN
+      END IF
+C       SHOULD BE A SEQUENCE LINE
+      J = INDEX(LINE(J1:),SPACE)
+      NUM = SPACE
+      NUM = LINE(J1:J1+J-2)
+      CALL RJST(NUM)
+ 1001 FORMAT(I6)
+      READ(NUM,1001,ERR=10) N
+      IF(N.GT.MAXDB-2) GO TO 10
+C     NUMBER ENDS AT J1+J-2
+      J1 = J1 + J - 1
+C     LOOK FOR BEGINNING OF NAME
+      J = NOTLR(LINE(J1:),LL-J1+1,SPACE)
+      N1 = J1 + J - 1
+C     LOOK FOR END OF NAME
+      J = INDEX(LINE(N1:),SPACE)
+      N2 = N1 + J - 2
+C     LOOK FOR BEGINNING OF SEQ
+      J = NOTLR(LINE(N2+1:),LL-N2,SPACE)
+      J1 = N2 + J
+      LTYPE = 4
+C     LOOK FOR END OF SEQ
+      J2 = NOTRL(LINE,LL,SPACE)
+      IF(J2.GT.N2) RETURN
+10    CONTINUE
+      LTYPE = 0
+      END
+      INTEGER FUNCTION LWRAPS(I,J)
+      K = MOD(I,J)
+      IF(K.EQ.0) K = J
+      LWRAPS = K
+      END
+C     MERGE
+C
+C   ROUTINE SENT CONTIG WHOSE GELS MAY BE OUT OF ORDER
+C   REORDERS GELS ON POSITION OF LEFT ENDS AND SETS LEFT
+C   GEL NUMBER FOR THE REORDERED CONTIG
+C
+      SUBROUTINE MERGE(RELPG,LNGTHG,LNBR,RNBR,LINCON,IDBSIZ)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C   START AT LEFT END
+      N=LNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT IN LIST
+      N=NR
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT
+      NR=RNBR(N)
+      IF(NR.EQ.0)GO TO 30
+C   HAVENT REACHED END YET
+      I1=0
+23    CONTINUE
+C   ARE THESE 2 IN CORRECT ORDER IE N<=NR ?
+      IF(RELPG(N).LE.RELPG(NR))GO TO 21
+C   NOT IN ORDER SO CHAIN LEFT UNTIL CORRECTLY POSITIONED
+C   THEN COME BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE SAVE POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C   EXCHANGE NEIGHBOURS
+      M=RNBR(NR)
+      IF(M.NE.0)LNBR(M)=N
+      M=LNBR(N)
+      IF(M.NE.0)RNBR(M)=NR
+      RNBR(N)=RNBR(NR)
+      RNBR(NR)=N
+      LNBR(NR)=LNBR(N)
+      LNBR(N)=NR
+C   CHAIN BACK THRU LIST
+      N=LNBR(NR)
+      IF(N.EQ.0)GO TO 21
+C   END NOT REACHED
+      GO TO 23
+30    CONTINUE
+C  ALL DONE POINTER AT RIGHT GEL
+      RNBR(LINCON)=N
+      RETURN
+      END
+      SUBROUTINE MINCOM(SEQ1,IDIM1,SEQ2,IDIM2,SAV1,SAV2,SAV3,
+     +IP,MINM,KBOUT)
+C   AUTHOR: RODGER STADEN
+C
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER SAV1(IP),SAV2(IP),SAV3(IP)
+C
+      IP1=IP
+      IP=0
+C
+C   SITUATION 1
+      NT1=IDIM2-MINM
+      IES1=MINM-1
+      ISS2=NT1+1
+C
+      DO 100 I=1,NT1
+C
+C   POINT TO FIRST CHAR-1 OF SEQ2
+      ISS2=ISS2-1
+C   POINT TO LAST CHAR SEQ1
+      IES1=IES1+1
+C
+      N=0
+C
+      DO 200 J=1,IES1
+C   STORE POINTER
+      JJ=J
+C
+C   POINT TO SEQ2
+      K=ISS2+J
+C   TEST FOR EQUALITY
+      IF(SEQ1(J).NE.SEQ2(K))GO TO 220
+C   INCREMENT N
+      N=N+1
+      GO TO 200
+220   CONTINUE
+C   TEST FOR SUFFICENTLY LARGE N
+      IF(N.GE.MINM)CALL SAVIT(N,J,K,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C   RESET N TO ZERO
+      N=0
+200   CONTINUE
+C
+C   GOOD SCORE AT END?
+C   NEED TO INCREMENT POINTERS AS SAVIT EXPECTS TO BE POINTING AT NEXT
+C   MISMATCH
+      JJ=JJ+1
+      KK=K+1
+      IF(N.GE.MINM)CALL SAVIT(N,JJ,KK,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C
+100   CONTINUE
+C
+C
+C   SITUATION 2
+      NT2=IDIM1-IDIM2+1
+C
+      DO 300 I=1,NT2
+      N=0
+C
+      DO 400 J=1,IDIM2
+C   SAVE POINTER
+      JJ=J
+C
+C   SET POINTER TO SEQ1
+      L=I+J-1
+      IF(SEQ1(L).NE.SEQ2(J))GO TO 420
+      N=N+1
+      GO TO 400
+420   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+400   CONTINUE
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+300   CONTINUE
+C
+C
+C   SITUATION 3
+      ISS1=IDIM1-IDIM2
+C
+      DO 500 I=1,NT1
+C
+C   POINT TO FIRST CHAR SEQ1
+      K=ISS1+I
+      IES2=IDIM2-I
+      N=0
+C
+      DO 600 J=1,IES2
+C   SAVE POINTER
+      JJ=J
+C
+C   POINT TO SEQ1
+      L=K+J
+      IF(SEQ1(L).NE.SEQ2(J))GO TO  620
+      N=N+1
+      GO TO 600
+620   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C    TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+600   CONTINUE
+C
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+500   CONTINUE
+C
+      RETURN
+5000  CONTINUE
+C   OVERFLOW
+C
+      WRITE(KBOUT,1000)IP1
+1000  FORMAT(/' TOO MANY MATCHES. LIMIT = ',I6)
+      RETURN
+      END
+      SUBROUTINE ML(PC,PG,L,N,J)
+      INTEGER PC(N),PG(N),L(N)
+      DO 10 I = J,N-1
+        PC(I) = PC(I+1)
+        PG(I) = PG(I+1)
+        L(I) = L(I+1)
+10    CONTINUE
+      END
+      SUBROUTINE MSTLKL(SEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU,INDEXS
+      DO 100 I=1,IDIM
+        J = INDEXS(SEQ(I),K)
+        SEQ(I) = CHARSU(J)
+100   CONTINUE
+      END
+      CHARACTER FUNCTION MUNOTP(IP)
+C     AUTHOR RODGER STADEN
+      CHARACTER PUP*26
+      SAVE PUP
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      MUNOTP = '-'
+      IF((IP.GT.0).AND.(IP.LT.23))MUNOTP = PUP(IP:IP)
+      END
+      INTEGER FUNCTION NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +STRAND,RREG)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+      NCDEP = 0
+      N = 0
+      I = IGEL
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) N = N + 1
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      NCDEP = N
+      END
+      SUBROUTINE PADCOP(SEQG,SEQG2,LG1,MG,L5,IS2,LG2,MAXGEL,IFAIL,
+     +KBOUT,SEQC,IC1)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (NDUBL = 4)
+      CHARACTER SEQG(MAXGEL),SEQG2(MAXGEL),DUBBL(NDUBL),SEQC(MAXGEL)
+      SAVE DUBBL
+      DATA DUBBL/'D','B','V','H'/
+      JC1 = IC1
+C Make seqg2 from seqg placing L5 padding chars before position MG
+C which is the start of the next block of identity. Try to put the
+C padding either in line with consensus pads, or next to double
+C codes. The positions in seqg are LG1 to MG-1. seqg2 needs to be long
+C enough to be extended from IS2 to IS2 + L5 -1 + MGM1-LG1 +1
+C ie we add L5 pads, plus the chars between and including  LG1 and MGM1
+      IDONE=0
+C   POINT TO END OF MISMATCH
+      MGM1=MG-1
+C   MAY BE NO CHARS TO COPY
+      IF(MGM1.LT.LG1)GO TO 111
+C  Next check added 26-2-91
+      MAXREQ = IS2 + L5 - 1 + MGM1 - LG1 + 1
+      IF((MGM1.GT.MAXGEL).OR.(MAXREQ.GT.MAXGEL)) THEN
+        WRITE(KBOUT,1000)
+1000    FORMAT(' Matching region too large for routine padcop,',
+     +  ' alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 110 J=LG1,MGM1
+        IF(IDONE.LT.L5) THEN
+          IF((JC1.GT.0).AND.(JC1.LT.MAXGEL)) THEN
+          IF(SEQC(JC1).EQ.'*') THEN
+            IS2 = IS2 + 1
+            JC1 = JC1 + 1
+            IDONE = IDONE + 1
+            GO TO 109
+          END IF
+          END IF
+          DO 108 M=1,NDUBL
+            IF(SEQG(J).EQ.DUBBL(M)) THEN
+              IS2 = IS2 + 1
+              JC1 = JC1 + 1
+              IDONE = IDONE + 1
+              GO TO 109
+            END IF
+108       CONTINUE
+109       CONTINUE
+        END IF
+        SEQG2(IS2) = SEQG(J)
+        IS2 = IS2 + 1
+        JC1 = JC1 + 1
+110   CONTINUE
+111   CONTINUE
+C   ALL CHARS COPIED. ENOUGH PADDING?
+      IF(IDONE.LT.L5)IS2=IS2+L5-IDONE
+C   IS2 SHOULD NOW BE POINTING AT NEXT CHAR
+C   ZERO LG2 TO SHOW CALLING ROUTINE COPYING DONE
+      LG2=0
+      IFAIL=0
+      END
+      SUBROUTINE PCON1(CHAR,CHRSUM)
+C  AUTHOR RODGER STADEN
+C  PART OF PROTEIN 'CONSENSUS' CALCULATION
+      CHARACTER CHAR
+      INTEGER CHRSUM
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      K = CTONUM(CHAR)
+      IF(K.NE.26)THEN
+        IF(CHRSUM.EQ.0)THEN
+          CHRSUM = K
+        ELSE
+          IF(K.NE.CHRSUM)CHRSUM = -1
+        END IF
+      END IF
+      END
+      SUBROUTINE PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +NCONTS,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      YMIN = 0.
+      YMAX = ISYMAX
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        CALL LINE(XF,XT,Y,Y,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        IF((IGEL.NE.0).AND.(I.EQ.LINCON)) THEN
+          XZ = XF + RELPG(IGEL) + ABS(LNGTHG(IGEL))/2
+          CALL LINE(XZ,XZ,YMAX,YMIN,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      I = LGEL
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            CALL LINE(XF,XT,YF,YF,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      INTEGER DEPTHP,DEPTHM
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+C  have window size margt starting at margb
+C  depths depthp, depthm
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = LREG
+      XMAX = RREG
+      RINC = YMAX / (DEPTHP + DEPTHM + 2)
+      RMID =(DEPTHM+1) * RINC
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX,RMID,RMID,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMAX,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      STRAND = 1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHP)
+      STRAND = -1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHM)
+      CALL VT100M
+      END
+      SUBROUTINE PLTQ(SEQ,IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CHARACTER SEQ(IDIM2),T
+      PARAMETER (Y0  = 0.,
+     +           YP1 = 1.,
+     +           YP2 = 2.,
+     +           YM1 = -1.,
+     +           YM2 = -2.)
+      XMIN = 0.
+      XMAX = IDIM2
+      YMIN = YM2
+      YMAX = YP2
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XIN,XMAX,Y0,Y0,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      I = 1
+10    CONTINUE
+      XF = I
+      T = SEQ(I)
+20    CONTINUE
+      IF(SEQ(I).NE.T) THEN
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I - 1
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+      I = I + 1
+      IF(I.LT.IDIM2) GO TO 20
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      END
+      SUBROUTINE READN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      READ(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE READR(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      READ(IDEVR,REC=N+1)RELPG,LNGTHG,LNBR,RNBR
+      RELPG = SWAPBO(RELPG)
+      LNGTHG = SWAPBO(LNGTHG)
+      LNBR = SWAPBO(LNBR)
+      RNBR = SWAPBO(RNBR)
+      RETURN
+      END
+      SUBROUTINE READW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      READ(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE REMOVL(MATC,MATG,MATL,IP)
+C   AUTHOR: RODGER STADEN
+      INTEGER MATC(IP),MATG(IP),MATL(IP)
+C
+C   SET POINTER TO FIRST MATCH
+      NMTCH=0
+10    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   SORT MATCHES ON LENGTH
+      IPP=IP-NMTCH+1
+      CALL BUBBL3(MATL(NMTCH),MATG(NMTCH),MATC(NMTCH),IPP)
+C   LOOK FOR END OF POSITIVES
+      DO 20 I=NMTCH,IP
+      J=I
+20    IF(MATL(I).LT.1)GO TO 30
+      J=J+1
+30    CONTINUE
+      IP=J-1
+C   END OF POSITIVES AT IP
+      IF(NMTCH.GE.IP)RETURN
+      K1=MATC(NMTCH)
+      K2=K1+MATL(NMTCH)-1
+      K3=MATG(NMTCH)
+      K4=K3+MATL(NMTCH)-1
+C   POINT TO FIRST MATCH TO TEST
+      K6=NMTCH+1
+      DO 200 I=K6,IP
+C   DO CONSENSUS FIRST
+C   OVERLAP?
+      IF(MATC(I).GT.K2)GO TO 100
+      K5=MATC(I)+MATL(I)-1
+      IF(K5.LT.K1)GO TO 100
+C   DOES OVERLAP
+C   WHICH END
+      IF(K5.LE.K2)GO TO 80
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K2-MATC(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C  MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 100
+80    CONTINUE
+C   LENGTH
+      MATL(I)=K1-MATC(I)
+100   CONTINUE
+C   NOW LOOK FOR OVERLAPS WITH GEL
+C   OVERLAP?
+      IF(MATG(I).GT.K4)GO TO 200
+      K5=MATG(I)+MATL(I)-1
+      IF(K5.LT.K3)GO TO 200
+C   DOES OVERLAP
+C   WHICH END?
+      IF(K5.LE.K4)GO TO 180
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K4-MATG(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C   MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 200
+180   CONTINUE
+C   LENGTH
+      MATL(I)=K3-MATG(I)
+200   CONTINUE
+      GO TO 10
+      END
+C     SAVIT
+C
+      SUBROUTINE SAVIT(N,J,K,IP,S1,S2,S3,IP1)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1(IP1),S2(IP1),S3(IP1)
+C
+      IP=IP+1
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)RETURN
+      S1(IP)=N
+      S2(IP)=J-N
+      S3(IP)=K-N
+C
+      RETURN
+      END
+      SUBROUTINE SCRENR(GEL,MAXGEL,STRING,NAME,FILNAM,
+     +IDEV1,IDEV2,IDEV3,IDEV4,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER NAME*(*),FILNAM*(*),HELPF*(*)
+      CHARACTER GEL(MAXGEL),STRING(60)
+      INTEGER GNFFOF
+      EXTERNAL GNFFOF
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of sequences that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of recognition sequences',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+      JGEL = 0
+      IGEL = 0
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+31      CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 31
+        END IF
+      ELSE
+        IOK = GNFFOF(IDEV1,NAME)
+        IF(IOK.EQ.1) GO TO 100
+        IF(IOK.NE.0) GO TO 1
+      END IF
+1002  FORMAT(A)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing', JGEL,' in batch'
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV4,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(KBOUT,*)' Error opening gel reading file'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV4,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV4)
+2     CONTINUE
+      IF(IDIMG.LT.1)THEN
+        WRITE(KBOUT,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL MSTLKL(GEL,IDIMG)
+3     CONTINUE
+      READ(IDEV3,1005,END=6)STRING
+1005  FORMAT(60A1)
+C   FIND LENGTH OF STRING ASSUMING NO SPACES
+      DO 4 I=1,60
+        II=I
+        IF(STRING(I).EQ.' ')GO TO 5
+4     CONTINUE
+5     CONTINUE
+      II=II-1
+      IF(II.GT.0)CALL FIND(GEL,IDIMG,STRING,II,JMATCH)
+      IF(JMATCH.EQ.0)GO TO 3
+C   A MATCH
+      WRITE(IDEV,1007)JMATCH,(STRING(K),K=1,II)
+1007  FORMAT(' Match at',I6,' with ',60A1)
+      REWIND IDEV3
+      GO TO 1
+C   NO MATCH SO SAVE
+6     CONTINUE
+      WRITE(IDEV2,1002)NAME
+      IGEL = IGEL + 1
+      REWIND IDEV3
+      GO TO 1
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGEL,' passed'
+      RETURN
+      END
+      SUBROUTINE SCRENV(MAXGEL,
+     +WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ,MAXSEQ,GEL,GELCOP,MATCH,
+     +LENGTH,
+     +SAVPS,SAVPG,SAVL,MAXMAT,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM,NAME,IOK)
+      INTEGER POSNS(MAXSEQ),GELN(MAXGEL),WORDP(LPOWRC),SAVPS(MAXMAT)
+      INTEGER SAVPG(MAXMAT),SAVL(MAXMAT)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER FILNAM*(*),NAME*(*),HELPF*(*)
+      CHARACTER GELCOP(MAXGEL)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER CONST(LENGTH),CSTART
+      CHARACTER SEQ(MAXSEQ),GEL(MAXGEL),MATCH(MAXGEL)
+      INTEGER GNFFOF
+      EXTERNAL GNFFOF
+      JGEL = 0
+      IGELS = 0
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of gel readings that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV4,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of vector sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      IDIM = MAXSEQ
+      CALL ARRFIM(IDEV4,SEQ,IDIM,KBOUT)
+      CLOSE(UNIT=IDEV4)
+      MN = LENGTH*2
+      MX = 50
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   IS THE VECTOR SEQUENCE IN THE CORRECT FORMAT WITH A TITLE AT THE FRONT?
+      IF(IDCEND.EQ.0)THEN
+        CENDS(1) = -19
+        NENDS(1) = 1
+        CENDS(2) = IDIM + 1
+        IDCEND = 1
+      END IF
+C      WRITE(KBOUT,9999)
+C9999  FORMAT(' VECTOR SEQUENCE REQUIRES A TITLE EG ',
+C     1' <---M13MP7.001----->')
+C      RETURN
+C      END IF
+      CALL BUSY(KBOUT)
+C
+C init hashing routines
+C
+      CALL INITE(CONST,CSTART,LENGTH)
+      CALL ENCOF(SEQ,IDIM,CONST,CSTART,LENGTH,POSNS)
+      CALL ENCONN(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH,1)
+C
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        IOK = GNFFOF(IDEV1,NAME)
+        IF(IOK.EQ.1) GO TO 100
+        IF(IOK.NE.0) GO TO 1
+      END IF
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV3,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(IDEV,*)' Gel reading file not found'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV3,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV3)
+C  LONG ENOUGH ?
+      IF(IDIMG.LT.MINMAT)THEN
+        WRITE(IDEV,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+      ISTRAN=1
+      IMATCH=0
+2     CONTINUE
+      CALL BUSY(KBOUT)
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      WRITE(IDEV,1009)ISTRAN
+1009  FORMAT(' Searching strand',I6)
+      IDSAV=MAXMAT
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ,GELCOP,MINMAT,IFAIL,KBOUT)
+      IF(IDSAV.GT.0) THEN
+        IMATCH=1
+        CALL DISMAT(SEQ,IDIM,GELCOP,IDIMG,SAVPS,SAVPG,IDSAV,
+     +  CENDS,NENDS,IDCEND,MAXCON,IDEV,MATCH)
+      END IF
+      IF(ISTRAN.EQ.1) THEN
+        CALL SQREV(GELCOP,IDIMG)
+        CALL SQCOM(GELCOP,IDIMG)
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        ISTRAN = 2
+        GO TO 2
+      END IF
+      IF(IMATCH.EQ.0) THEN
+        WRITE(IDEV2,1010)NAME
+        IGELS = IGELS + 1
+      END IF
+      GO TO 1
+1010  FORMAT(A)
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGELS,' passed'
+      RETURN
+      END
+      SUBROUTINE SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IGN,NCONT,DIST)
+C  AUTHOR: RODGER STADEN
+C  SHIFTS PART OF A CONTIG FORM GEL IGN TO RIGHT END
+C  CONTIG LINE NUMBER IF NCONT
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER DIST,CLEN
+      EXTERNAL CLEN
+      I = IGN
+10    CONTINUE
+      IF(I.NE.0)THEN
+        RELPG(I) = RELPG(I) + DIST
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        I = RNBR(I)
+        GO TO 10
+      END IF
+C  UPDATE CONTIG LENGTH
+      L = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IGN)
+      RELPG(NCONT) = L
+      CALL WRITER(IDEVR,NCONT,RELPG(NCONT),LNGTHG(NCONT),
+     +LNBR(NCONT),RNBR(NCONT))
+      END
+      SUBROUTINE SLIDER(SEQ1,IDC,SEQ2,IDIM2,MS1,MS2,MAXPG,MAXPC,MINSLI,
+     +MATL,MATC,MATG,IP)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2)
+      INTEGER MATL(IP),MATC(IP),MATG(IP),P1S,P1,P2
+      IP1 = IP
+      IP = 0
+C   LEFT END S2 RELATIVE S1 - MAX PADS -2 READY FOR LOOP
+      P1S = MS1 - MS2 - MAXPC - 1
+C   TRY NSLIDE START POSNS FOR SEQ2
+      DO 100 I=1,MAXPG+MAXPC+1
+C       POINT TO SEQ1 START
+        P1S = P1S + 1
+C       POINT TO CURRENT SEQ1 POSN
+        P1 = P1S
+        N = 0
+C       COMPARE WHOLE LENGTH OF SEQ2 (IF P1 WITHIN RANGE)
+        DO 50 J=1,IDIM2
+          P2 = J
+          P1 = P1 + 1
+          IF(P1.LT.1)GO TO 50
+C         OFF RIGHT END? IF SO MAY HAVE BEEN A MATCH
+          IF(P1.GT.IDC)GO TO 40
+          IF(SEQ1(P1).EQ.SEQ2(P2))GO TO 45
+40        CONTINUE
+          IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+          N = 0
+          GO TO 50
+45        CONTINUE
+          N = N + 1
+50      CONTINUE
+C       GOOD SCORE AT END? NEED TO INCREMENT POINTERS FOR SAVIT
+        P1 = P1 + 1
+        P2 = P2 + 1
+        IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+100   CONTINUE
+      END
+      SUBROUTINE SUBS(SEQ,IDIMS,FROM,TO)
+      CHARACTER SEQ(IDIMS),FROM,TO
+C   AUTHOR RODGER STADEN
+      DO 10 I = 1,IDIMS
+        IF(SEQ(I).EQ.FROM) SEQ(I) = TO
+10    CONTINUE
+      END
+      SUBROUTINE SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,LINOU1,LINOU2,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      CHARACTER LINOU1(MAXGEL),LINOU2(MAXGEL),GTCONC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      EXTERNAL INDEXS,LWRAPS,GTCONC
+C 28-7-91 added extra parameter: mxgood is the maximum reading
+C length for which we have confidence, so only the parts of
+C reads 1 to mxgood will be included in the calculation
+C   SET INITIAL VALUES
+C hard to understand this very old code! rewrite it.
+C we have a summing array of twice the length of the longest sequence
+C posn is posn in contig of next to write, lseqno is current posn in contig
+C we write when lseqno-posn ge the length of the longest seq
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+40    CONTINUE
+50    CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read
+C
+      IF(ISS.EQ.1) THEN
+        N = MIN(MXGOOD,ABS(LNGTHG(GELC)))
+        IF(LSEQNO.LT.LREG)LSEQNO=LREG
+        IS = LSEQNO-RELPG(GELC)+1
+      ELSE
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read (right end for these)
+C
+        IREND = RELPG(GELC) - LNGTHG(GELC) + 1
+        IF (MXGOOD.LT.MAXGEL) THEN
+          LSEQNO = IREND - MXGOOD + 1
+        ELSE
+          LSEQNO = RELPG(GELC)
+        END IF
+        LSEQNO = MAX(LSEQNO,LREG)
+        IS = LSEQNO - RELPG(GELC) + 1
+        N  = ABS(LNGTHG(GELC))
+      END IF
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        LINOU1(I) = GTCONC(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        LINOU2(I) = GTCONC(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+250     CONTINUE
+        POSN=POSN+1
+230   CONTINUE
+C
+C   COMPARE STRANDS
+C
+      DO 500 I=1,LINLEN
+C        WRITE(*,*)I,LINOU1(I),LINOU2(I)
+        POSN1=POSN1+1
+        IF(LINOU1(I).EQ.LINOU2(I)) THEN
+          IF(LINOU1(I).EQ.'-') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF(LINOU1(I).EQ.'*') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          SEQ1(POSN1) = '0'
+        ELSE
+          IF((LINOU1(I).EQ.'*').AND.(LINOU2(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU2(I).EQ.'*').AND.(LINOU1(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*')) THEN
+            SEQ1(POSN1) = '1'
+            IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) 
+     +      SEQ1(POSN1) = '4'
+              GO TO 500
+          END IF
+          IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) THEN
+            SEQ1(POSN1) = '2'
+            IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*'))
+     +      SEQ1(POSN1) = '4'
+            GO TO 500
+          END IF
+        END IF
+500   CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+C      SUMMER
+C
+C   SUBROUTINE TO PRODUCE A CONSENSUS FROM LINED UP GEL READINGS
+      SUBROUTINE SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1,IDIM1,GEL,LREG,RREG,IGELC,IDBSIZ,CHARS,CHRSIZ,MAXGL2,
+     +IDEVW,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,RREG,LSEQNO,POSN,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER GELC
+      CHARACTER SEQ1(IDIM1)
+      INTEGER CHARS(CHRSIZ,MAXGL2)
+      CHARACTER MUNOTP,GTCONC
+      EXTERNAL MUNOTP,INDEXS,GTCONC,LWRAPS
+C
+C   SET INITIAL VALUES
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      IPSEQ1=0
+C
+C   ZERO ARRAY
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,CHRSIZ
+      CHARS(J,I)=0
+40    CONTINUE
+50    CONTINUE
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C   LOOP FOR RELEVANT ELEMENTS THIS GEL
+      N=ABS(LNGTHG(GELC))
+      IF(LSEQNO.LT.LREG)LSEQNO=LREG
+      IS=(LSEQNO-RELPG(GELC))+1
+****************************
+      IF(IDM.EQ.26)THEN
+        DO 51 I = IS,N
+          JJJ=(MOD(LSEQNO,MAXGL2))
+          IF(JJJ.EQ.0)JJJ=MAXGL2
+          CALL PCON1(GEL(I),CHARS(1,JJJ))
+          LSEQNO = LSEQNO + 1
+51      CONTINUE
+      ELSE
+****************************
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,JJJ) = CHARS(JJ,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      END IF
+C
+C   LOOK AT NEXT GEL TO RIGHT
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+C   RESET LSEQNO
+      LSEQNO=RELPG(GELC)
+C   IS THIS OVER END?
+      IF(LSEQNO.GT.RREG)GO TO 200
+C   ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C   SET FLAG TO SHOW END REACHED
+      IEND=1
+C   NEED TO SUM AND OUTPUT
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SUM NEXT SECTION OF CHARS
+      IF(IDM.EQ.26)THEN
+        DO 211 I = 1,LINLEN
+          IPSEQ1 = IPSEQ1 + 1
+          SEQ1(IPSEQ1) = '-'
+          JJJ = MOD(POSN,MAXGL2)
+          IF(JJJ.EQ.0)JJJ = MAXGL2
+          SEQ1(IPSEQ1) = MUNOTP(CHARS(1,JJJ))
+          CHARS(1,JJJ) = 0
+          POSN = POSN + 1
+211     CONTINUE
+      ELSE
+      DO 230 I=1,LINLEN
+        IPSEQ1=IPSEQ1+1
+        ISUM=0
+        JJJ = LWRAPS(POSN,MAXGL2)
+        SEQ1(IPSEQ1) = GTCONC(CHARS(1,JJJ),CHRSIZ,PERCD)
+        CALL FILLI(CHARS(1,JJJ),CHRSIZ,0)
+        POSN = POSN + 1
+230   CONTINUE
+      END IF
+C
+C
+C   ANY MORE TO OUTPUT?
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGLEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+      SUBROUTINE SUMSS(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,LINOU1,LINOU2)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      CHARACTER LINOU1(MAXGEL),LINOU2(MAXGEL),GTCONC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      EXTERNAL INDEXS,LWRAPS,GTCONC
+C
+C Routine to calculate a consensus. Only if the two strands agree
+C is a non dash character assigned.
+C
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+40    CONTINUE
+50    CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+      N=ABS(LNGTHG(GELC))
+      IF(LSEQNO.LT.LREG)LSEQNO=LREG
+      IS=LSEQNO-RELPG(GELC)+1
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        LINOU1(I) = GTCONC(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        LINOU2(I) = GTCONC(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+250     CONTINUE
+        POSN=POSN+1
+230   CONTINUE
+C
+C   Compare the strands. If they the same then set the consensus
+C   accordingly, otherwise set it to - so no edits are made.
+C
+      DO 500 I=1,LINLEN
+        POSN1=POSN1+1
+        IF(LINOU1(I).EQ.LINOU2(I)) THEN
+          SEQ1(POSN1) = LINOU1(I)
+        ELSE
+          SEQ1(POSN1) = '-'
+        END IF
+500   CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+      SUBROUTINE TPCHEK(PC,PG,L,N)
+      INTEGER PC(N),PG(N),L(N)
+C     AUTHOR RODGER STADEN
+C     IF OVERLAPPING BLOCKS ARE FOUND REMOVE THE SHORTER ONE
+C     THEN REMOVE LARGE GAPS AT ENDS (THOSE AS LARGE AS THE END BLOCK)
+      K1 = 2
+1     CONTINUE
+      DO 10 I = K1,N
+        J1 = I
+        IF(PC(I).LE.PC(I-1)) GO TO 20
+        IF(PG(I).LE.PG(I-1)) GO TO 20
+10    CONTINUE
+C     REMOVE LARGE GAPS FROM ENDS
+C     THIS RULE OF THUMB COULD BE CHANGED TO USE A DIFFERENCE
+C     BETWEEN THE NUMBERS OF MISMATCHING CHARACTERS
+      IF(N.GT.1) THEN
+        K1 = PC(2) - PC(1) - L(1) 
+        J1 = PG(2) - PG(1) - L(1)
+        IF(MAX(K1,J1).GT.L(1)) THEN
+          CALL ML(PC,PG,L,N,1)
+          N = N - 1
+        END IF
+        IF(N.GT.1) THEN
+          K1 = PC(N) - PC(N-1) - L(N-1)
+          J1 = PG(N) - PG(N-1) - L(N-1)
+          IF(MAX(K1,J1).GT.L(N)) THEN
+            CALL ML(PC,PG,L,N,N)
+            N = N - 1
+          END IF
+        END IF
+      END IF
+      RETURN
+20    CONTINUE
+      IF(L(J1-1).GT.L(J1)) THEN
+        CALL ML(PC,PG,L,N,J1)
+      ELSE
+        CALL ML(PC,PG,L,N,J1-1)
+      END IF
+C  Until 25-11-90 next line was k1=j1 but this does not deal with all 
+C  cases: when a line is deleted we must compare it with the previous
+C  one before dealing with the rest, because it could be left of that
+C   one as well!
+      K1 = MAX(2,J1-1)
+      N = N - 1
+      GO TO 1
+      END
+      SUBROUTINE WRITEN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      WRITE(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE WRITER(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      WRITE(IDEVR,REC=N+1)SWAPBO(RELPG),SWAPBO(LNGTHG),
+     +SWAPBO(LNBR),SWAPBO(RNBR)
+      RETURN
+      END
+      SUBROUTINE WRITEW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      WRITE(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NCONTS,LLINOI,LINCNI,LREG,RREG,
+     +WINDOW,GWIND,LENCON,DEPTHP,DEPTHM,
+     +MARGL,MARGR,MARGB,MARGT,MAXOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +KBOUT,GEL,GEL2,IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM,
+     +SEQ1,IDIM1,NGELS,TEMP3,CHRSIZ,MAXGL2,LINOU1,LINOU2,
+     +NOPT1,NOPT2,NOPT3,
+     +IHELPS,IHELPE,HELPF,IDEVH,MXGOOD)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),WINDOW,CHRSIZ,GWIND
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER MARGB(MAXOPT),MARGT(MAXOPT)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND,CHNRP1,HQN
+      INTEGER TEMP3(2,CHRSIZ,MAXGL2)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER TERM,TUPPER,NAMARC*16,HELPF*(*)
+      CHARACTER SEQ1(IDIM1),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      EXTERNAL NOPWIN,CWORLD,TUPPER,CHNRP1,HQN
+C  nopt1 = single contig
+C  nopt2 = all contigs
+C  nopt3 = scan
+10    CONTINUE
+      LLINO = LLINOI
+      LINCON = LINCNI
+      LOCLR = 0
+      LOCRR = 0
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL CLEARV
+      CALL XHAIRR(ISXMAX,ISYMAX,IX,IY,TERM,DBTDUX,DBTDUY)
+      CALL VT100M
+      INFLAG = HQN(TERM)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      NOPT = NOPWIN(IY,MARGB,MARGT,MAXOPT)
+      TERM = TUPPER(TERM)
+      IF(NOPT.EQ.0) RETURN
+      IF(NOPT.EQ.NOPT3) THEN
+        IF(TERM.EQ.'S') THEN
+          XMIN = LREG
+          XMAX = RREG
+          X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF((TERM.EQ.'N').OR.(TERM.EQ.'Z').OR.(TERM.EQ.'I')) GO TO 10
+      END IF
+      IF(NOPT.EQ.NOPT1) THEN
+        STRAND = 1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHP)
+        IF(DEPTHP.LT.0) RETURN
+        STRAND = -1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHM)
+        IF(DEPTHM.LT.0) RETURN
+        YMAX = ISYMAX
+        YMIN = 0.
+        XMIN = LREG
+        XMAX = RREG
+        RINC = ISYMAX / (DEPTHP + DEPTHM + 2)
+        RMID =(DEPTHM+1) * RINC
+        X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+        Y = CWORLD(IY,MARGB(NOPT),MARGT(NOPT),YMIN,YMAX)
+        IF(TERM.EQ.'I') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(ICLOSE.EQ.1) GO TO 10
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+1006      FORMAT
+     +    ( ' Name ',A,' Number ',I6,' Rel. Posn. ',I7,' Length ',I5)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Z') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(IGEL.EQ.0) GO TO 10
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT2),MARGT(NOPT2),ISXMAX,ISYMAX)
+          LREG = MAX(1,RELPG(IGEL)-GWIND)
+          RREG = MIN(RELPG(LINCON),RELPG(IGEL)+GWIND)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +    LLINO,LREG)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+      END IF
+      IF(NOPT.EQ.NOPT2) THEN
+        CALL IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  NCONTS,IX,IY,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +  ISXMAX,ISYMAX,DBTDUX,DBTDUY,
+     +  LINCON,IGEL,IS)
+        IF(IGEL.EQ.0) RETURN
+        IF(TERM.EQ.'Z') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'I') THEN
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION CLINNO(LNBR,IDBSIZ,NCONTS,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER LNBR(IDBSIZ)
+      CLINNO = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          CLINNO = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE UPDCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +SEQ,MAXSEQ,IDIM1,CSTART,CLENO,LINCON,NAMPRO,SEQ2,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV,IDEV2,IFAIL,MAXGEL,IDM,PERCD)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CSTART,CLENO,S1,B1,ECHRSZ,RREG
+      INTEGER TEMP3(ECHRSZ,MAXGL2)
+      CHARACTER SEQ(MAXSEQ),SEQ2(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      INTEGER CHNRP
+      EXTERNAL CHNRP
+C cstart consensus start point (before new reading)
+C cleno consensus length (before new reading)
+C lincon element number of contig
+C s1 number of first reading to shift
+C b1 number of first base to shift (in overall consensus positioning)
+C
+C there are 2 tasks: 1. make space for the new and altered region
+C                    2. calculate the new consensus and put it in the space
+C we do not have to make space if:
+C a. we are dealing with the last contig in the consensus and there are no
+C    readings starting to the right of the new data
+C b. the contig has not been padded
+C
+C New code to update the consensus only for the region affected by the
+C new reading. Find the next reading to the right of the new one, which
+C the new one does not overlap (might not be one!). Make a consensus from
+C start of new reading to here. Prior to this make space for it by moving
+C the consensus right (only if the contig is longer (padding or extra data
+C at its ends). Let s1 be the first reading to shift. We shift from its
+C left end to the end of the contig - where is this in the overall consensus?
+C The distance of the left end of s1 to the right end of the contig is
+C unchanged. This means that the new relpg(s1) is the same distance from
+C the right end of the old consensus as the old relpg(s1) was from the right
+C end of the old consensus. So from this we can calculate the position of the
+C the first base to move. 
+C Let L be the position in the overall consensus of  the last base in this contig
+C            L = cstart - cleno - 1
+C Let D = distance to end of contig 
+C            D = RELPG(LINCON) - relpg(s1) + 1.
+C First base to shift B1 = L - D + 1
+C Last base to shift is idim1
+C Distance to move to right is relpg(lincon) - cleno ie the number of extra bases
+C make consensus from relpg(ngels) to relpg(s1) - 1
+C put it at cstart + relpg(ngels) - 1
+C
+C Potential problems:
+C 1) reading at right end of contig
+C the search for the first nonoverlapping read to the right will return 0
+C shift al the next contig: ie cstart + cleno onwards
+C make consensus from relpg(ngels) to end of contig
+C put it at cstart + relpg(ngels) -1
+C
+C 2) reading at left end of contig
+C shift whole contig ie cstart - 20
+C add new title
+C shift consensus relpg(lincon) - cleno to the right
+C
+C 3) new reading contains contig - cases 1 and 2 combined
+C the search for the first nonoverlapping read to the right will return 0
+C shift whole of next contig and make consensus from relpg(ngels) to end of
+C contig.
+C
+C 4) Might not be a next contig to shift
+C
+C get number of first reading to shift
+C
+      S1 = CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +RELPG(NGELS)+ABS(LNGTHG(NGELS))-1)
+C      WRITE(*,*)'S1',S1
+C
+C is the altered region longer than the original: only then do we need to shift
+C
+C           WRITE(*,*)'IDIM1',IDIM1
+C           WRITE(*,*)'RELPG(LINCON)',RELPG(LINCON)
+C           WRITE(*,*)'CSTART,CLENO',CSTART,CLENO
+      IF (RELPG(LINCON) - CLENO.GT.0) THEN
+C
+C it is longer so we probably need to shift
+C
+        IF (S1.EQ.0) THEN
+C
+C no readings start to the right of the new data
+C
+          IF (CSTART+CLENO-1.LT.IDIM1) THEN
+C
+C there are other contigs to the right
+C
+C           WRITE(*,*)'CSTART,CLENO',CSTART,CLENO
+            B1 = CSTART + CLENO
+C            WRITE(*,*)'B1',B1
+            CALL MAKHCA(SEQ,MAXSEQ,B1,RELPG(LINCON)-CLENO,IDIM1)
+          ELSE
+C
+C there are no contigs to the right and no readings start to the right of
+C the new one so nothing to shift
+C
+          END IF
+        ELSE
+C
+C there are readings starting to the right of the new one
+C
+C shift from start of next reading to right
+C
+           L = CSTART + CLENO - 1
+C           WRITE(*,*)'CSTART,CLENO,L',CSTART,CLENO,L
+           LD = RELPG(LINCON) - RELPG(S1) + 1
+C           WRITE(*,*)'LD',LD
+           B1 = L - LD + 1
+C            WRITE(*,*)'B1',B1
+           CALL MAKHCA(SEQ,MAXSEQ,B1,RELPG(LINCON)-CLENO,IDIM1)
+        END IF
+      END IF
+C
+C now make new consensus (where do we put it,  do we need
+C to give it a header, and what region do we make it for ?
+C in the simplest case make it for relpg(ngels) to relpg(s1) -1
+C if s1=0 make it for relpg(ngels) to end of contig (relpg(lincon))
+C we give it a header if it is at the left end of the contig ie lnbr(ngels)=0
+C
+C we always start at the left end of the new reading
+C
+      LREG = RELPG(NGELS)
+C
+C we end at the next reading to the right or the end of the contig
+C
+      IF (S1.NE.0) THEN
+        RREG = RELPG(S1) - 1
+      ELSE
+        RREG = RELPG(LINCON)
+      END IF
+C
+C where do we put the new consensus ?
+C
+      B1 = CSTART + RELPG(NGELS) - 1
+C      WRITE(*,*)'LREG,RREG',LREG,RREG
+C            WRITE(*,*)'B1',B1
+C
+C do we need to add a title
+C
+      IF (LNBR(NGELS).EQ.0) THEN
+        B1 = CSTART - 20
+C        WRITE(*,*)'ADD NEW TIT AT',B1
+        CALL ADDTIT(SEQ(B1),NAMPRO,NGELS,B1)
+      END IF
+      IGELC = LNBR(LINCON)
+C
+C note aconsn will chain along until it find the first useful reading
+C
+      JOB = 2
+      CALL ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ,MAXSEQ,SEQ2,IDBSIZ,B1,JOB,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL,MAXGEL,IDM,PERCD)
+      IF(IFAIL.NE.0) THEN
+        CALL ERROM(KBOUT,'Error calculating consensus')
+        RETURN
+      END IF
+C
+C before we leave we must make the overall consensus length correct
+C  so add on the extra length (if any) which is the new length - old length
+C
+C      WRITE(*,*)'OLD IDIM1',IDIM1
+      IDIM1 = IDIM1 + RELPG(LINCON) - CLENO
+      IDIM2 = IDIM1 + RELPG(LINCON) - CLENO
+C      WRITE(*,*)'NEW IDIM1/2',IDIM1
+      END
+      SUBROUTINE MAKHCA(STRING,MAXAR,FROM,HSIZE,ASIZE)
+      CHARACTER STRING(MAXAR)
+      INTEGER FROM,HSIZE,ASIZE
+C
+C make a hole of size hsize in character array size asize
+C
+      J = ASIZE + HSIZE
+      DO 10 I=ASIZE,FROM,-1
+        STRING(J) = STRING(I)
+        J = J - 1
+ 10     CONTINUE
+      END
+      INTEGER FUNCTION CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C find first reading starting past lreg (0=none found)
+C
+      I = LGEL
+      CHNRP = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP = I
+        RETURN
+      END IF
+      END
+      INTEGER FUNCTION CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +LGEL,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C find first reading with data covering or past lreg (0=none found)
+C
+      I = LGEL
+      CHNRP1 = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I)+ABS(LNGTHG(I))-1.LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP1 = I
+        RETURN
+      END IF
+      END
+C      ACONSN
+      SUBROUTINE ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,JOB,LLINO,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      INTEGER CHNRP1
+      EXTERNAL CHNRP1
+C
+C new consensus calculating routine (could replace acons if we check values of job)
+C
+C job = 0 do it for whole db
+C     = 2 for selected contig only
+C     = 1 for selected contig, adding a header
+C     note jobs 0 and 1 update istart (it always points to end of overall 
+C     consensus), but job=2 does not
+C
+      CALL BUSY(KBOUT)
+      IFAIL=0
+      IF(JOB.EQ.1) THEN
+C
+C do it for a selected contig, adding title
+C
+        ISTART=ISTART+1
+        IDIM11=RREG-LREG+1
+        IF((ISTART+19+IDIM11).GT.IDIM1)THEN
+           WRITE(KBOUT,1009)IDIM1
+           IFAIL=1
+           RETURN
+        END IF
+C
+C allow summer to be dumb, and find first relevant reading number
+C
+        LLINO1 =  CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  LLINO,LREG)
+        IF(LLINO1.EQ.0) THEN
+          CALL ERROM(KBOUT,
+     +    'Error in ACONSN: no data found for consensus')
+          IFAIL = 1
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),IDIM11,GEL,LREG,RREG,LLINO1,IDBSIZ,TEMP,
+     +  CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        ISTART=ISTART+IDIM11-1
+        RETURN
+      END IF
+      IF(JOB.EQ.2) THEN
+C
+C do it for a selected contig
+C
+        IDIM11=RREG-LREG+1
+        IF((ISTART+IDIM11-1).GT.IDIM1)THEN
+           WRITE(KBOUT,1009)IDIM1
+           IFAIL=1
+           RETURN
+        END IF
+C
+C allow summer to be dumb, and find first relevant reading number
+C
+        LLINO1 =  CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  LLINO,LREG)
+        IF(LLINO1.EQ.0) THEN
+          CALL ERROM(KBOUT,
+     +    'Error in ACONSN: no data found for consensus')
+          IFAIL = 1
+          RETURN
+        END IF
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),IDIM11,GEL,LREG,RREG,LLINO1,IDBSIZ,TEMP,
+     +  CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        RETURN
+      END IF
+C
+C  do it for all contigs
+C
+      N=IDBSIZ-NCONTS
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        ISTART=ISTART+1
+        IF((ISTART+19+Y).GT.IDIM1)THEN
+           WRITE(KBOUT,1009)IDIM1
+1009       FORMAT(
+     +  ' Database maximum consensus length(',I6,') exceeded',/,
+     +  ' calculation aborted')
+           IFAIL=1
+           RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,MAXGEL,
+     +  IDM,PERCD)
+        ISTART=ISTART+Y-1
+110   CONTINUE
+      END
+      SUBROUTINE AERROR(IDEVS,IDEVF,NAME,IERR)
+      CHARACTER NAME*(*)
+C
+C handle errors for assembly
+C
+C errors are:
+C 0 file not found
+C 1 read too short
+C 2 failed to align and not entered
+C 3 failed on entry
+C 4 failed to align but entered
+      WRITE(IDEVF,1000)NAME(1:INDEX(NAME,' ')),IERR
+ 1000 FORMAT(A,I2)
+      CALL ERROM(IDEVS,'Failed reading written to error file')
+      END
+      SUBROUTINE SHFTLA(STRING,MAXAR,FROMS,TO,FROME)
+      CHARACTER STRING(MAXAR)
+      INTEGER FROMS,TO,FROME
+C
+C shift an array left from froms to to
+C
+      J = TO
+      DO 10 I=FROMS,FROME
+        STRING(J) = STRING(I)
+        J = J + 1
+ 10   CONTINUE
+      END
+      SUBROUTINE GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,PROMPT,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),GELIDN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER FILEH*(*),PROMPT*(*)
+      EXTERNAL GELIDN
+      IERR = 1
+      NCONTC = GELIDN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LLINO,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,PROMPT,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(NCONTC.EQ.0) RETURN
+      IGELNO = NCONTC
+      IF(LNBR(NCONTC).NE.0) THEN
+        WRITE(KBOUT,1013)RELPG(NCONTC)
+1013    FORMAT(' Position of this reading=',I6)
+25      CONTINUE
+        NCONTC = LNBR(NCONTC)
+        IF(LNBR(NCONTC).NE.0) GO TO 25
+        WRITE(KBOUT,1014)NCONTC
+1014    FORMAT( ' Number of leftmost reading this contig=',I6)
+      END IF
+30    CONTINUE
+      N = IDBSIZ - NCONTS
+      DO 20 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.NCONTC) THEN
+          LINCON=J
+          GO TO 21
+        END IF
+20    CONTINUE
+      CALL ERROM(KBOUT,'No contig line for this reading!')
+      RETURN
+21    CONTINUE
+      LLINO = NCONTC
+      IERR = 0
+      END
+      INTEGER FUNCTION GELIDN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LLINO,IDBSIZ,KBIN,KBOUT,IDEVN,PROMPT,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      CHARACTER FILEH*(*),PROMPT*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*17,NAME3*17,NFLAG
+      PARAMETER (NFLAG='/')
+      EXTERNAL NAMENO
+      NAME3 = ' '
+      IF(LLINO.NE.0) THEN
+        NAME3(1:1) = NFLAG
+        CALL READN(IDEVN,LLINO,NAME3(2:))
+      END IF
+      GELIDN = 0
+10    CONTINUE
+      L = 0
+      IF(LLINO.NE.0) L = 17
+      CALL GTSTR(PROMPT,NAME3,
+     +NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(INFLAG.EQ.3) THEN
+        GELIDN = LLINO
+        RETURN
+      END IF
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NAME1(1:1).EQ.NFLAG) THEN
+        GELIDN = NAMENO(NAME1(2:),NGELS,IDEVN)
+        IF (GELIDN.EQ.0) CALL ERROM(KBOUT,'Reading name not found')
+      ELSE
+        CALL RJST(NAME1)
+        READ(NAME1,1001,ERR=10,END=10)GELIDN
+1001    FORMAT(I17)
+        IF((GELIDN.LT.1).OR.(GELIDN.GT.NGELS)) THEN
+          CALL ERROM(KBOUT,'Illegal gel reading number')
+          GO TO 10
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION NAMENO(NAME,NGELS,IDEVN)
+      CHARACTER NAME*(*)
+      CHARACTER*16 NAME1,NAME2
+      NAME1 = NAME
+      CALL CCASE(NAME1,1)
+      DO 10 I=1,NGELS
+        CALL READN(IDEVN,I,NAME2)
+        CALL CCASE(NAME2,1)
+        IF (NAME1.EQ.NAME2) THEN
+          NAMENO = I
+          RETURN
+        END IF
+ 10   CONTINUE
+      NAMENO = 0
+      END
+      SUBROUTINE REMGBD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,IDEV2,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER HELPF*(*),GEL(MAXGEL),NAMARC*16,FILNAM*(*)
+      INTEGER REMME,GCLIN,CHAINL,GNFFOF
+      PARAMETER (MAXPRM = 35)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      EXTERNAL GCLIN,CHAINL,NAMENO,GNFFOF
+C assumes db is logical consistent
+      FILNAM = ' '
+      PROMPT(1) = 'Define a region by reading names'
+      PROMPT(2) = 'Use a file of reading names'
+      PROMPT(3) = 'Move a reading to a separate contig'
+      PROMPT(4) = 'Make a list of unattached readings'
+      IOPT = 1
+      CALL RADION('Select list definition mode',PROMPT,4,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) RETURN
+      IF(IOPT.EQ.4) THEN
+C
+C here we find all contigs with single readings and write their
+C names to a file
+C
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Name for file of reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        DO 5 I=IDBSIZ-NCONTS,IDBSIZ-1
+          IF (LNBR(I).EQ.RNBR(I)) THEN
+            IF (LNBR(I).NE.0) THEN
+              CALL READN(IDEVN,LNBR(I),NAMARC)
+              WRITE(IDEV2,1000)NAMARC
+            END IF
+          END IF
+ 5      CONTINUE
+        CLOSE(UNIT=IDEV2)
+        RETURN
+      END IF
+      IF(IOPT.EQ.3) THEN
+C
+C here we start a new contig with the selected reading
+C
+C we get the reading number igelno and move a copy of it
+C to ngels+1. Then we use the remove reading routine to delete
+C the original copy and move the new one to fill the hole. The 
+C reason for this convoluted route is that remgel cleans up
+C all the mess. We must write a new contig line and check the
+C orientation.
+C
+C
+        IF(NGELS+3.GE.IDBSIZ-NCONTS) THEN
+          CALL ERROM(KBOUT,'Insufficient space for new contig')
+          RETURN
+        END IF
+        NGELST = NGELS + 1
+        NCONTT = NCONTS + 1
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCOL,LLINOL,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +  'Reading to disconnect',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF (IERR.NE.0) RETURN
+        CALL READW(IDEVW,IGELNO,GEL,MAXGEL)
+C
+C move reading info over end of gel list
+C        
+        CALL READN(IDEVN,IGELNO,NAMARC)
+        CALL WRITEN(IDEVN,NGELST,NAMARC)
+        LNBR(NGELST) = 0
+        RNBR(NGELST) = 0
+C
+C leave orientation the same
+C
+        LNGTHG(NGELST) = LNGTHG(IGELNO)
+        RELPG(NGELST) = 1
+        CALL WRITER(IDEVR,NGELST,RELPG(NGELST),LNGTHG(NGELST),
+     +  LNBR(NGELST),RNBR(NGELST))
+        CALL WRITEW(IDEVW,NGELST,GEL,MAXGEL)
+        CALL MOVTAG(IGELNO,NGELST)
+C
+C start a new contig
+C
+        I = IDBSIZ - NCONTT
+        LNBR(I) = NGELST
+        RNBR(I) = NGELST
+        LNGTHG(I) = 0
+        RELPG(I) = ABS(LNGTHG(NGELST))
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+        NGELS = NGELST
+        NCONTS = NCONTT
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,0,0)
+        IGLNO = IGELNO
+        CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  IGLNO,LINCOL,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+        RETURN
+      END IF
+      IF(IOPT.EQ.1) THEN
+ 10     CONTINUE
+        LLINOL = 0
+        LLINOR = 0
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCOL,LLINOL,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +  'Leftmost reading',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF (IERR.NE.0) RETURN
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCOR,LLINOR,JGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +  'Rightmost reading',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF (IERR.NE.0) RETURN
+        IF (LLINOL.NE.LLINOR) THEN
+          CALL ERROM(KBOUT,
+     +    'For this mode readings must be in the same contig')
+          GO TO 10
+        END IF
+        IF (RELPG(IGELNO).GT.RELPG(JGELNO)) THEN
+          CALL ERROM(KBOUT,
+     +    'For this mode readings must be in left to right order')
+          GO TO 10
+        END IF
+C
+C IGELNO is first read to remove, JGELNO the last
+C
+C we must make a list of reads because removal changes numbers
+C
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Name for temporary file of reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+C
+C write out their names
+C
+        J = IGELNO
+ 20     CONTINUE
+        CALL READN(IDEVN,J,NAMARC)
+        WRITE(IDEV2,1000)NAMARC
+        WRITE(KBOUT,1001)NAMARC
+ 1000   FORMAT(A)
+ 1001   FORMAT(' ',A)
+        IF (J.NE.JGELNO) THEN
+          IF (J.NE.0) THEN
+            J=RNBR(J)
+            GO TO 20
+          END IF
+        END IF
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF (IOK.NE.0) THEN
+          CLOSE(UNIT=IDEV2)
+          RETURN
+        END IF
+        REWIND(UNIT=IDEV2)
+      ELSE IF(IOPT.EQ.2) THEN
+C
+C here we start from a file of file names
+C 
+        CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Name of file of reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+
+      ELSE
+        CALL ERROM(KBOUT,'How the hell did we get here?')
+        RETURN
+      END IF
+ 30   CONTINUE
+      IOK = GNFFOF(IDEV2,NAMARC)
+      IF(IOK.EQ.1) GO TO 100
+      IF(IOK.NE.0) GO TO 30
+      REMME = NAMENO(NAMARC,NGELS,IDEVN)
+      I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,REMME)
+      ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,I)
+      IF(ICONT.EQ.0) THEN
+        CALL ERROM(KBOUT,'No contig line for this reading')
+        IOK = 1
+        GO TO 100
+      END IF
+      CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+      GO TO 30
+ 100  CONTINUE
+      CLOSE(UNIT=IDEV2)
+      END
diff --git a/src/bap/dbsyscommon.f~ b/src/bap/dbsyscommon.f~
new file mode 100644
index 0000000..b741588
--- /dev/null
+++ b/src/bap/dbsyscommon.f~
@@ -0,0 +1,5742 @@
+C  DBSYS ROUTINES COMMON TO PRE AND POST .RD PROGRAMS
+C
+C 25-8-92 NOTE at the end of the file are new versions of getln2 (getln3)
+C              and gelid (gelidn) and a new routine nameno, that should
+C              replace getln2 and gelid.
+C
+C  Further sap routines are in dbsysold.f (pre .rd) and dbsysnew.f (post)
+C  the split was made by rs 23-1-91
+C 15-6-92 added fasta format output from consen
+C 15-4-92 added in all the speedup changes ive been making and made 
+C         screnv compatible
+C 13-4-92 changed autocn to use a new hashing routine encof and inite
+C  2-4-92 Added new dbauto related routines and changed gtconc
+C  2-4-92 Added filnam = ' ' and brought uptodate with dap
+C  4-5-90 Change to getreg to allow escape
+C  4-5-90 addition of graphics routines and changes to menus
+C  9-5-90 added default gel reading: many changes
+C  17-5-90 Fixed 3 bugs in screen editing: 1) rightjustified names
+C          caused problems; 2) beginnings of sequences starting at far
+C          right of lines where not seen; 3) lines with no numbers at
+C          the end of a contig (i.e. with <10 chars) were flagged
+C          as errors. Changes to ltype for 1, linlen for 2, dsplay for 3
+C  9-7-90  removed menu routines
+C  20-8-90 changed gelid to add / to reading name because xsap did
+C          not return the INFLAG = 3 for the default
+C  23-8-90 Changes to dbauto and autocn to deal with failures better
+C          Plus addition of calls to BUSY
+C  9-11-90 Replaced call to radio with call to radion
+C  19-11-90 Changed max match length in dbauto to maxglm+1  (was 50)
+C  25-11-90 Very important bug fix in tpchek. Old versions could
+C           duplicate bits of working versions.
+C  28-11-90 Modified slider to receive maxpg and maxpc and to allow exactly
+C           the requested number of matches at each end of the two 
+C           sequences. 
+C           Added two new options to dbauto: all gels to new contigs, all 
+C           gels to contig 1; plus resurrected forbidding joins to allow
+C           sequences to be entered only into the contig the overlap best.
+C           Changed autocn to sort overlaps into order based on % mismatch
+C           (previously it saved the best two in any order)
+C           Minor change to dbstar
+C 3-1-91   Discovered bug in dbopen: incorrect call to getint when the
+C          database is very old and needs values for the current format
+C 21-1-91  GELID allowed illegal gel numbers to be returned! Fixed it.
+C 22-1-91  Modified autocn, adism4,adism3 to give more info about
+C          overlaps, and to allow 10 overlaps. Modified dbopen to
+C          return version number, ditto dbstar
+C 23-1-91  Split into dbsyscommon, dbsysold, dbsysnew
+C 26-2-91  Improved overflow check in padcop
+C 28-7-91 added extra parameter to quality calc: mxgood is the maximum
+C         reading length in which we have confidence, so only add this
+C         many chars from the start of each reading. Also changed the
+C         quality calc to make it the same as the consensus one. Made
+C         all characters have nonzero score and made lowercase = 100
+C 21-8-91 Changed arrfil to arrfim which does not display comments
+C 22-8-91 Added routine to find contig line number given left gel (CLINNO)
+C 
+C
+C 12-11-91 BIG CHANGE: made database handle 99,999 readings and 16 char names
+C
+C
+C          Also added routine to make aedit take strandedness into account
+C          (SUMSS).
+C          Also added fmt4lp which is used by find internal joins and
+C          could be used to advantage by others that call fmt4ln.
+C 18-11-91 New routine GETLN2 with returns gel number specified
+C
+C     enconn
+C   routine to store positions of words in posns and first occurences
+C   in wordp and number of occurences in wordn
+C   each number is a value representing one of the le4 possible
+C   words of length length made up of 4 characters
+C   words in posns are numbers from 1 to 4**length
+      SUBROUTINE ENCONN(POSNS,IDIM,WORDP,WORDN,LE4,LENGTH,START)
+C   AUTHOR: RODGER STADEN
+      INTEGER WORDP(LE4),POSNS(IDIM)
+      INTEGER WORDN(LE4),START
+C   number of words of length length
+      IDIM1 = IDIM - (LENGTH-1)
+      IF (START.EQ.1) THEN
+        DO 10 I=1,LE4
+          WORDN(I) = 0
+10      CONTINUE
+      END IF
+C   loop for each word
+      DO 100 I=START,IDIM1
+        N = POSNS(I)
+        IF(N.NE.0) THEN
+          NW = WORDN(N)
+C   is their already an entry for this word?
+          IF(NW.EQ.0) THEN
+C  first entry, put in wordp
+            WORDP(N) = I
+            WORDN(N) = NW + 1
+          ELSE
+            WORDN(N) = NW + 1
+            POSNS(I) = WORDP(N)
+            WORDP(N) = I
+          END IF
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE ENCOF(SEQ,IDSEQ,CONST,CSTART,LENGTH,POSNS)
+      CHARACTER SEQ(IDSEQ)
+      INTEGER CONST(LENGTH),CSTART,POSNS(IDSEQ),HASH
+      INTEGER CTONUM,CONSTL
+      EXTERNAL CTONUM
+C
+C new hashing routine. hash = k1 + k2
+C
+C hash = k1.c1 + k2.c2 + ... + kn.cn - cstart
+C now c1=1, c2=4*c1, c3=4*c2,...
+C
+C find length bases in a row, then do first word base by base,
+C for rest only change what is necessary
+C
+      DO 1 I=1,IDSEQ
+        POSNS(I) = 0
+ 1    CONTINUE
+      CONSTL = CONST(LENGTH)
+      LM1 = LENGTH - 1
+      IDSQML = IDSEQ - LENGTH
+      IS = 1
+      I  = 1
+      IP = 1
+      HASH = 0
+ 10   CONTINUE
+C
+C end approaching ?
+C
+      IF (IS.GT.IDSQML) RETURN
+C
+C at least a words length of characters left
+C
+ 11   CONTINUE
+      K = CTONUM(SEQ(I))
+      IF (K.EQ.5) THEN
+C
+C start a new word
+C
+        IS = I + 1
+        I = IS
+        IP = 1
+        HASH = 0
+        GO TO 10
+      END IF
+      HASH = HASH + CONST(IP) * K
+      IF (IP.NE.LENGTH) THEN
+        I = I + 1
+        IP = IP + 1
+        GO TO 11
+      END IF
+C
+C word finished
+C
+C save the hash value and the 
+C
+ 20   CONTINUE
+      POSNS(IS) = HASH + CSTART
+C      K1 = CONST(1) * CTONUM(SEQ(IS)) note const(1) = 1
+      K1 = CTONUM(SEQ(IS))
+      K2 = (HASH - K1) / 4
+      IS = IS + 1
+      IF (IS.GT.IDSQML) RETURN
+      K = CTONUM(SEQ(IS+LM1))
+      IF (K.EQ.5) THEN
+        IS = IS + 1
+        I = IS
+        IP = 1
+        HASH = 0
+        GO TO 10
+      END IF
+C      HASH = K2 + K * CONST(LENGTH) note this is a constant constant
+      HASH = K2 + K * CONSTL
+      GO TO 20
+      END
+      SUBROUTINE INITE(CONST,CSTART,LENGTH)
+      INTEGER CONST(LENGTH),CSTART
+      CSTART = 1
+      DO 1 I=1,LENGTH
+C      WRITE(*,*)I
+        CONST(I) = 4**(I-1)
+        CSTART = CSTART - CONST(I)
+ 1    CONTINUE
+      END
+C   SUBROUTINE TO READ CHARACTER DATA FROM IDEV, REMOVE SPACES, FILL
+C   ARRAY AND RETURN NUMBER OF ELEMENTS USED. ANY LINES STARTING WITH
+C   A ; ARE TREATED AS COMMENTS
+      SUBROUTINE ARRFIM(IDEV,SEQNCE,J,KBOUT)
+C 14-8-91 Added err= option to read, and set length to 0 if error found
+C   AUTHOR: RODGER STADEN
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,ENDCHR,TITCHR
+      SAVE ENDCHR,SPACE,TITCHR
+      DATA ENDCHR/'@'/
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=40)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR)THEN
+C        WRITE(KBOUT,1003)(TEMP(K),K=2,80)
+C1003    FORMAT(' ',79A1)
+        GO TO 1
+      END IF
+10    CONTINUE
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(TEMP(I).EQ.ENDCHR)RETURN
+          IF(J.EQ.IDMX)THEN
+            WRITE(KBOUT,1002)IDMX
+1002        FORMAT(
+     +             ' Too much data. Maximum possible',
+     +             ' =',I6,', input stopped there')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+      RETURN
+ 40   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      J = 0
+      END
+C      ABEDIN
+C
+C   ROUTINE TO EDIT THE DB USING A PADDED SEQ
+C   HAVE AN ARRAY SEQC2 LENGTH IDC OF PADDED SECTION OF CONTIG LINCON
+C  THE LEFT END OF THE PADDED CONTIG STARTS AT X
+C   THERE ARE ITOTPC PADS TO MAKE
+C
+      SUBROUTINE ABEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,POSN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQC2(IDC),GEL(MAXGEL),P
+      SAVE P
+      DATA P/','/
+C
+C   POINT TO CONTIG
+      POSN=X-1
+C   POINT TO SEQC2
+      IAT=0
+C   COUNT PADS DONE
+      IDONE=0
+C   LOOP FOR ALL SEQC2
+      DO 100 J=1,IDC
+      POSN=POSN+1
+      IAT=IAT+1
+      IPAD=0
+C   IS THIS A PADDING CHAR?
+      IF(SEQC2(IAT).NE.P)GO TO 100
+50    CONTINUE
+C   COUNT PADS
+      IPAD=IPAD+1
+      IAT=IAT+1
+      IF(SEQC2(IAT).EQ.P)GO TO 50
+C   END OF THIS STRETCH OF PADS,DO INSERT
+C   HAVE IPAD INSERTS TO MAKE AT POSN
+      CALL PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,IPAD,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   MOVE POINTER TO CONTIG
+      POSN=POSN+IPAD
+C   COUNT PADS DONE
+      IDONE=IDONE+IPAD
+C   ANY MORE TO DO?
+      IF(IDONE.EQ.ITOTPC)GO TO 101
+100   CONTINUE
+C   ERROR SHOULD HAVE DONE ALL PADS
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Problem: some pads were not done!')
+101   CONTINUE
+      END
+      SUBROUTINE ADDTIT(SEQ1,NAMPRO,NGELS,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(20),NAMPRO*(*)
+      CHARACTER NUMS(10)
+C Set maximum number of digits in reading number
+      PARAMETER (MAXGD = 5)
+      SAVE NUMS
+      DATA NUMS/'0','1','2','3','4','5','6','7','8','9'/
+      CALL FILLC(SEQ1(2),18,'-')
+      SEQ1(1) = '<'
+      SEQ1(20) = '>'
+      IEND = INDEX(NAMPRO,'.')
+      N=NGELS
+      K=IEND+MAXGD
+      DO 10 J=1,MAXGD
+        N=MOD(N,10)+1
+        NAMPRO(K:K)=NUMS(N)
+        N=NGELS/(10**J)
+        K=K-1
+10    CONTINUE
+      K = 18-IEND
+      K=K/2
+      DO 20 I=1,IEND+MAXGD
+        SEQ1(K)=NAMPRO(I:I)
+        K=K+1
+20    CONTINUE
+      IDIM1=IDIM1+20
+      END
+      SUBROUTINE ADISM1(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,
+     +ISTRAN,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C  NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCCC
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C
+C   EDITED 07-02-83 TO ALLOW FOR CASE WHERE A GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL. USE PARM THAT CONTAINS
+C   THE POSITION OF THE LEFT END OF THE NEXT CONTIG. SET TO VERY HIGH
+C   VALUE TO START
+      NEXTC=IDIM+1
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPERATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C  COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     1SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C   COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     1SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+C
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       ADISM2
+C   ROUTINE TO DISPLAY MATCHES
+      SUBROUTINE ADISM2(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C   NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCC
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC. SEE ADISM1.
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with contig number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   UPDATE END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.GT.2)RETURN
+      ILEFTS(IMATC)=CENDS(JJ)+20
+      ILC(IMATC)=LCL+LCR+1
+      IPOSC(IMATC)=LCL+1
+      IPOSG(IMATC)=SAVPG
+      LLINO(IMATC)=NENDS(JJ)
+      ISENSE(IMATC)=1
+      IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      RETURN
+      END
+      SUBROUTINE ADISM3(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+        WRITE(KBOUT,1000)LLINO(IMATC),IPOSC(IMATC),ISTRAN,
+     +  IPOSG(IMATC)
+ 1000   FORMAT
+     +  (' Contig',I5,' position',I6,' matches strand',I2,
+     +  ' at position',I5)
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+      SUBROUTINE ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      RETURN
+      END
+      SUBROUTINE AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +GEL,MAXGEL,CON,IDC,IDEVW,IDEVR,LREG,RREG,KBOUT)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),CON(IDC)
+      INTEGER RREG,PC,PCA,PG
+      NG = LGEL
+      PG = RELPG(NG)
+      PC = LREG
+      NTT = 0
+      NCT = 0
+      NDT = 0
+10    CONTINUE
+C      WRITE(*,*)'GEL',NG
+      CALL READW(IDEVW,NG,GEL,MAXGEL)
+      LG = ABS(LNGTHG(NG))
+      IF(PC.LT.LREG) PC = LREG
+      PCA = PC - LREG + 1
+      IG = PC - PG + 1
+      LC = MIN(LG,RREG-PC+1)
+C      WRITE(*,*)'PC,PG,IG,LG,PCA,LC',PC,PG,IG,LG,PCA,LC
+      CALL ET(GEL(IG),LG,CON(PCA),LC,NE)
+      NTT = NTT + NE
+      CALL EC(GEL(IG),LG,CON(PCA),LC,NE)
+      NCT = NCT + NE
+      CALL ED(GEL(IG),LG,CON(PCA),LC,ND)
+      NDT = NDT + ND
+      CALL WRITEW(IDEVW,NG,GEL,MAXGEL)
+      IF(ND.GT.0) THEN
+        K = LNGTHG(NG)
+        LNGTHG(NG) = ABS(LNGTHG(NG)) - ND
+        LNGTHG(NG) = SIGN(LNGTHG(NG),K)
+        CALL WRITER(IDEVR,NG,RELPG(NG),LNGTHG(NG),LNBR(NG),RNBR(NG))
+      END IF
+      IF(RNBR(NG).NE.0) THEN
+        NG = RNBR(NG)
+        PG = RELPG(NG)
+        PC = PG
+        IF(PG.LE.RREG) GO TO 10
+      END IF
+      CALL EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      WRITE(KBOUT,1000)NTT
+1000  FORMAT(' Number of transpositions=',I6)
+      WRITE(KBOUT,1001)NCT
+1001  FORMAT(' Number of changes       =',I6)
+      WRITE(KBOUT,1002)NDT
+1002  FORMAT(' Number of deletions     =',I6)
+      END
+C    AJOIN2
+C   COMPLETES JOIN AND RETURNS LENGTH OF NEW CONTIG IN LLINOR
+      SUBROUTINE AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +RELX,LLINOL,LLINOR,LNCONL,LNCONR,IDEVR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNBR(IDBSIZ),RNBR(IDBSIZ),LNGTHG(IDBSIZ)
+      INTEGER RELX
+C   RELX IS THE POSITION OF THE JOINT
+C   LLINOL IS THE LEFT GEL NUMBER OF THE LEFT CONTIG
+C   LLINOR IS THE LEFT GEL OF THE RIGHT CONTIG
+C   LNCONL IS THE LEFT CONTIG LINE NUMBER
+C   LNCONR IS THE RIGHT CONTIG LINE NUMBER
+C
+C   ADJUST ALL RELATIVE POSITIONS IN RIGHT CONTIG
+      N=LLINOR
+      RELPG(N)=RELX
+50    CONTINUE
+      IF(RNBR(N).EQ.0)GO TO 60
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+RELX-1
+      GO TO 50
+60    CONTINUE
+C
+C   FIX UP NEW GEL LINE FOR OLD LEFT OF RIGHT CONTIG
+      LNBR(LLINOR)=RNBR(LNCONL)
+C   FIX UP RIGHT GEL OF LEFT CONTIG
+      N=RNBR(LNCONL)
+      RNBR(N)=LLINOR
+C   MERGE WILL SORT OUT THE CORRECT NEIGHBOURS
+C
+      CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LNCONL,IDBSIZ)
+C   MERGE DOES NOT WRITE TO DISK
+      N=LNBR(LNCONL)
+65    CONTINUE
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N))
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 65
+C   CONTIG LINES
+      X=RELPG(LNCONR)+RELX-1
+C   LENGTH MAY NOT HAVE INCREASED!
+      IF(X.GT.RELPG(LNCONL))RELPG(LNCONL)=X
+C   SAVE LENGTH OF NEW CONTIG
+      RELX=RELPG(LNCONL)
+C      WRITE(IDEVR,REC=LNCONL)RELPG(LNCONL),LNGTHG(LNCONL),LNBR(LNCONL),
+C     1RNBR(LNCONL)
+      CALL WRITER(IDEVR,LNCONL,RELPG(LNCONL),LNGTHG(LNCONL),
+     +LNBR(LNCONL),RNBR(LNCONL))
+C
+C   NOW MOVE ALL DATA DOWN TO DELETE OLD RIGHT END
+      N=IDBSIZ-NCONTS
+      M=LNCONR-N
+      IF(M.EQ.0)GO TO 80
+      K=LNCONR
+      J=LNCONR-1
+      DO 70 I=1,M
+      RELPG(K)=RELPG(J)
+      LNGTHG(K)=LNGTHG(J)
+      LNBR(K)=LNBR(J)
+      RNBR(K)=RNBR(J)
+C      WRITE(IDEVR,REC=K)RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K))
+      K=K-1
+      J=J-1
+70    CONTINUE
+80    CONTINUE
+      NCONTS=NCONTS-1
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+90    CONTINUE
+      RETURN
+      END
+C     SUBROUTINE AJOIN3
+      SUBROUTINE AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,IDIM22,
+     +KLASS,IOVER,KBOUT,PL,PR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LINCON(2),IDIM22(2)
+      INTEGER ITYPE(2),ISENSE(2),JOINT(2),PL(2),PR(2)
+C
+C   CALC POSITIONS OF CONTIGS RELATIVE TO FIXED GEL
+      DO 20 I=1,2
+C   R+
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.1))GO TO 11
+      PL(I)=-1*JOINT(I)+2
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   L+
+11    CONTINUE
+      IF((ITYPE(I).NE.1).OR.(ISENSE(I).NE.1))GO TO 12
+      PL(I)=JOINT(I)
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   R-
+12    CONTINUE
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.-1))GO TO 13
+      PR(I)=JOINT(I)+IDIM22(I)-1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+      GO TO 20
+C   L-
+13    CONTINUE
+      PR(I)=IDIM22(I)-JOINT(I)+1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+20    CONTINUE
+C  LENGTH OF OVERLAP
+      IOVER=MIN(PR(1),PR(2))-MAX(PL(1),PL(2))+1
+      WRITE(KBOUT,1002)IOVER
+1002  FORMAT(' Length of overlap between the contigs=',I6)
+C
+C  CLASS NUMBER 1-16
+      KLASS=1
+      IF(ITYPE(1).EQ.1)KLASS=KLASS+8
+      IF(ISENSE(1).EQ.-1)KLASS=KLASS+4
+      IF(ITYPE(2).EQ.1)KLASS=KLASS+2
+      IF(ISENSE(2).EQ.-1)KLASS=KLASS+1
+C      WRITE(KBOUT,1001)KLASS
+C1001  FORMAT(' CLASS OF JOIN=',I6)
+      RETURN
+      END
+C      ALINE
+C
+C    ROUTINE TO LINE UP 2 SEQS.
+C   IT SLIDES,REMOVES OVERLAPPING MATCHES,
+C   SORTS MATCHES INTO ASCENDING ORDER, THEN DOES DOES A TOPOLOGICAL
+C   CHECK, AND THEN PRODUCES 2 LINED UP SEQS WITH PADDING CHARS
+C   VARIABLES
+C       SEQ1 CONSENSUS
+C       SEQ2 GEL ORIGINAL IN CORRECT ORIENTATION
+C       SEQG2 ALIGNED GEL
+C       SEQC2 ALIGNED CONSENSUS
+C       SEQ3 SAVED GEL RAW DATA
+C       ISAV1,2,3 STORE MATCHES AND POSITIONS
+C       IDSAV NUMBER ISAV'S
+C       IDC LENGTH OF INPUT SEQ1
+C       IDIM2 LENGTH OF INPUT SEQ2
+C       IDOUT LENGTH OF OUTPUT ALIGNED SEQ1
+C       IDIM2 LENGTH OF SEQ2 ON OUTPUT AFTER ALIGNMENT
+C       MINSLI MIN MATCH FOR SLIDING
+C       IFAIL FLAG TO SHOW IF ALIGNMENT FAILED DUE TO TOO
+C   MANY MISMATCHES OR TOPOLIGICAL CHECK OR TOO MANY OR TOO MANY
+C   PADDING CHARS. 1=FAIL,0=PASS
+C
+      SUBROUTINE ALINE(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ3,MAXGEL,
+     +PERCM,LENO,ISHOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV)
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+C
+C added next routine 27-2-93
+C
+      CALL UPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(ITOTPC.GT.MAXPC)IFAIL=1
+      IF(ITOTPG.GT.MAXPG)IFAIL=1
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+      CALL DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IFAIL,LENO,PERMAX,ISHOW)
+      IF(IFAIL.NE.0)RETURN
+      IF(ISHOW.EQ.1) THEN
+        WRITE(KBOUT,1052)PERCM,ITOTPC,ITOTPG
+1052    FORMAT(' Percent mismatch=',F4.1,', pads in contig=',I3,
+     +  ', pads in gel=',I3)
+      END IF
+      END
+      SUBROUTINE ARCSER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*16,NAME2*16
+10    CONTINUE
+      L = 0
+      CALL GTSTR('Archive name',' ',NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(L.EQ.0) RETURN
+      CALL CCASE(NAME1,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      IF(NAME1(1:1).EQ.' ') RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      DO 100 I=1,NGELS
+        CALL READN(IDEVN,I,NAME2)
+        IF(NAME1.EQ.NAME2) THEN
+          WRITE(KBOUT,1003)NAME2,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1003      FORMAT(' ',A,2X,I4,2X,I6,2X,I6,2X,I6,2X,I6/)
+          GO TO 10
+        END IF
+100   CONTINUE
+      WRITE(KBOUT,1004)NAME1
+1004  FORMAT(' ',A,' Not in database')
+      GO TO 10
+      END
+      SUBROUTINE AUTOCN(SEQ1,IDIM,GEL,IDIMG,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,KBOUT,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,CSTART,
+     +JOBC,PERMIS,LENO,ISHOW)
+C   AUTHOR: RODGER STADEN
+C   changed 29-11-90 to make first in list of alignments the best
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(IDIM),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(IDIM),GEL(MAXGLM)
+C
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL(2),ITYPE(2)
+      PARAMETER (MAXC = 100)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC),CSTART,START
+      REAL PERMIS(2)
+C
+C jobc tells how to update the hash tables:
+C 0 means dont do anything because the consensus hasnt changed
+C 1 means add the last contig because a new one has been stuck on the end
+C 2 means do the whole consensus
+C
+      IFAIL(1) = 1
+      IFAIL(2) = 1
+      KFAIL = 0
+C  23-8-90 Need to deal with failures in a better way. Problem is
+C          case where overlaps are found but fail to align. In future
+C          signal them with new variable KFAIL which will be nonzero
+C          if any alignment fails.
+C  29-11-90 Changed sorting of overlaps so that the best is first in the 
+C           list returned to caller.
+C   SAVE GEL
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+C  COUNT NUMBER OF CONTIGS THAT MATCH
+      IMATC=0
+      IDCEND=MAXCON
+      CALL BUSY(KBOUT)
+      CALL FNDCON(SEQ1,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+      IF (JOBC.NE.0) THEN
+        START = 1
+        IF(JOBC.EQ.1) START = CENDS(IDCEND)
+        CALL ENCOF(SEQ1(START),IDIM-START+1,CONST,CSTART,LENGTH,
+     +POSNS(START))
+        CALL ENCONN(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH,START)
+      END IF
+1     CONTINUE
+      ISTRAN=1
+2     CONTINUE
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCOF(GEL,IDIMG,CONST,CSTART,LENGTH,GELN)
+      IDSAV=MAXSAV
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ1,GEL,MINMAT,IFCOMP,KBOUT)
+      IF(IFCOMP.NE.0)RETURN
+      IF(IDSAV.NE.0)THEN
+        CALL ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +  IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +  IMATC,ISTRAN,MAXC,KBOUT)
+      END IF
+      ISTRAN=ISTRAN+1
+      IF(ISTRAN.EQ.2) THEN
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        CALL SQREV(GEL,IDIMG)
+        CALL SQCOM(GEL,IDIMG)
+        GO TO 2
+      END IF
+      CALL SQCOPY(GELCOP,GEL,IDIMG)
+      KSENSE = 0
+      WRITE(KBOUT,*)'Total matches found',IMATC
+      IF(IMATC.EQ.0) THEN
+        IFAIL(1) = 0
+        RETURN
+      END IF
+      JMATC = 0
+      DO 100 I = 1,IMATC
+        IF(JSENSE(I).EQ.-1) THEN
+          IF(KSENSE.EQ.0) THEN 
+            CALL SQREV(GEL,IDIMG)
+            CALL SQCOM(GEL,IDIMG)
+            KSENSE = 1
+          END IF
+        END IF
+        JDIM22 = IDIMG
+        JDOUT = MAXGEL
+        IDSAV = MAXSAV
+        WRITE(KBOUT,*)'Trying to align with contig',JLLINO(I)
+        CALL ALINE(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +  SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +  JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +  JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ4,MAXGEL,PERMS,LENO,
+     +  ISHOW)
+        IF(JFAIL.EQ.0) THEN
+          JMATC = JMATC + 1
+          IF(JMATC.EQ.1) THEN
+C    Save in elements 1
+             CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            IFAIL(1) = 0
+          ELSE IF(JMATC.EQ.2) THEN
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+              IFAIL(2) = 0
+            END IF
+          ELSE
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE IF(PERMS.LT.PERMIS(2)) THEN
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+            END IF
+          END IF
+        ELSE
+          KFAIL = 1
+        END IF
+100   CONTINUE
+      IMATC = MIN(2,JMATC)
+      END
+      SUBROUTINE BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CHAINL,GCLIN
+      EXTERNAL CHAINL,GCLIN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTR = IDBSIZ - NCONTS - 1
+      IF(NCONTR.LE.NGELS) THEN
+        WRITE(KBOUT,*)'Insufficient space for new contig line.'
+        WRITE(KBOUT,*)'Increase database size with copy'
+        RETURN
+      END IF
+10    CONTINUE
+        MN = 0
+        MX = NGELS
+        IR  = 0
+        CALL GETINT(MN,MX,IR,
+     +  'Number of gel reading that will become a left end',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IVAL.LT.1) RETURN
+        IR = IVAL
+      IL = LNBR(IR)
+      IF(IL.EQ.0)THEN
+        WRITE(KBOUT,*)'Gel number',IR,' is already a left end'
+        GO TO 10
+      END IF
+      ILO = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IR)
+      IF(ILO.EQ.0)THEN
+        WRITE(KBOUT,*)
+     +'Problem with this contig. Check logical consistency'
+        WRITE(KBOUT,*)'of database. Break not made'
+        RETURN
+      END IF
+      NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,ILO)
+      IF(NCONTO.EQ.0)THEN
+        WRITE(KBOUT,*)'No contig line for this contig. Check logical'
+        WRITE(KBOUT,*)'consistency of database. Break not made'
+        RETURN
+      END IF
+      LCONTO = RELPG(NCONTO)
+      IF(LCONTO.LT.1)THEN
+        WRITE(KBOUT,*)'Contig has zero length. Break not made'
+        RETURN
+      END IF
+      CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+      END
+      SUBROUTINE CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CLEN
+      EXTERNAL CLEN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTS = NCONTS + 1
+C  WRITE LAST LINE OF DB
+      WRITE(KBOUT,*)'Increasing number of contigs by 1'
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C  MAKE NEW CONTIG A COPY OF OLD
+      RELPG(NCONTR) = RELPG(NCONTO)
+      LNGTHG(NCONTR) = LNGTHG(NCONTO)
+      LNBR(NCONTR) = IR
+      RNBR(NCONTR) = RNBR(NCONTO)
+      WRITE(KBOUT,*)'Writing new right contig line'
+      CALL WRITER(IDEVR,NCONTR,RELPG(NCONTR),LNGTHG(NCONTR),
+     +LNBR(NCONTR),RNBR(NCONTR))
+C  NEED LENGTH FOR OLD LEFT CONTIG
+      RNBR(IL) = 0
+C      L = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +IDBSIZ,IL)
+C  Change 24/6/93 jkb
+      L = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,ILO)
+      IF(L.LT.1)THEN
+        WRITE(KBOUT,*)
+     +  'New left contig has zero length. Break not made'
+        RETURN
+      END IF
+      RELPG(NCONTO) = L
+      RNBR(NCONTO) = IL
+C  DO CONTIG LINE FOR NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new left contig line'
+      WRITE(KBOUT,*)'New length=',RELPG(NCONTO)
+      WRITE(KBOUT,*)'New right gel=',RNBR(NCONTO)
+      CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +LNBR(NCONTO),RNBR(NCONTO))
+C  DO GEL LINE FOR RIGHT GEL OF NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new right gel of left contig'
+      WRITE(KBOUT,*)'Gel number=',IL
+      CALL WRITER(IDEVR,IL,RELPG(IL),LNGTHG(IL),
+     +LNBR(IL),RNBR(IL))
+C  DO GEL LINE FOR NEW RIGHT CONTIG
+      LNBR(IR) = 0
+      WRITE(KBOUT,*)'Writing new left gel of right contig'
+      WRITE(KBOUT,*)'Gel number=',IR
+      CALL WRITER(IDEVR,IR,RELPG(IR),LNGTHG(IR),
+     +LNBR(IR),RNBR(IR))
+C  NOW SHIFT
+      I = 1 - RELPG(IR)
+      WRITE(KBOUT,*)'Shifting gels in right contig by distance=',I
+      CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IR,NCONTR,I)
+      WRITE(KBOUT,*)'Right contig length=',RELPG(NCONTR)
+      WRITE(KBOUT,*)'Break completed'
+      IOK = 0
+      END
+C     BUBBL3
+C   SUBROUTINE TO SORT INTEGER ARRAY (LIST) INTO ASCENDING  ORDER
+C
+      SUBROUTINE BUBBL3(LIST,LISTEL,LISTAL,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM),LISTEL(IDIM),LISTAL(IDIM)
+C
+C   SET POINTERS TO ZERO
+      I=0
+      J=0
+C
+10    CONTINUE
+C
+C   SET I=J IF WE HAVE JUST CORRECTLY POSITIONED AN ELEMENT
+      IF(J.GT.I)I=J
+C
+C   INCREMENT POINTER TO NEXT ELEMENT
+      I=I+1
+C   TEST FOR END OF ARRAY
+      IF(I.EQ.IDIM)RETURN
+C
+20    CONTINUE
+C
+C   COMPARE ADJACENT ELEMENTS
+      IF(LIST(I).GE.LIST(I+1))GO TO 10
+C
+C   FIRST MOVE THIS ELEMENT? IF SO SET POINTER TO ITS INITIAL POSITION
+      IF(J.LT.I)J=I
+C
+C   EXCHANGE ADJACENT ELEMENTS
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+C
+      ITEMP=LISTEL(I)
+      LISTEL(I)=LISTEL(I+1)
+      LISTEL(I+1)=ITEMP
+      ITEMP=LISTAL(I)
+      LISTAL(I)=LISTAL(I+1)
+      LISTAL(I+1)=ITEMP
+C
+C
+C   DECREMENT BACK THRU LIST WITH THIS ELEMENT
+      IF(I.GT.1)I=I-1
+C
+      GO TO 20
+      END
+      SUBROUTINE CCTA(SEQ,ID)
+      CHARACTER SEQ(ID),COM,AS
+      SAVE COM,AS
+      DATA COM/','/,AS/'*'/
+      DO 10 I = 1,ID
+        IF(SEQ(I).EQ.COM) SEQ(I) = AS
+10    CONTINUE
+      END
+C
+C     CFGEL  new version 15-4-92
+C
+C   ROUTINE TO COMPARE A STRING OF WORD NUMBERS FOR A GEL WITH A SERIES
+C   OF ARRAYS REPRESENTING A CONSENSUS SEQUENCE. WE LOOK FOR OCCURENCES
+C   OF PAIRS OF WORDS (EACH WORD IS LENGTH CHARS LONG AND SO TOTAL MATCH IS
+C   2*LENGTH CHARS LONG). THE ARRAYS SENT ARE OF SIZE 4**LENGTH (LE4)
+      SUBROUTINE CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LE4,
+     +SAVPG,
+     +SAVPS,SAVL,IDSAV,SEQ,GEL,MINMAT,IFAIL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),GEL(IDIMG)
+      INTEGER POSNS(IDIM),WORDP(LE4),SAVPS(IDSAV),SAVPG(IDSAV)
+      INTEGER GELN(IDIMG),SAVL(IDSAV)
+      INTEGER WORDN(LE4)
+      INTEGER W1,W2,PS1W1,PS1W2
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C go thru the gel reading looking at words to see if they exist in the consensus
+      IDSAVM=IDSAV
+      IDSAV=0
+C length of a pair of words is LX2
+      LX2=2*LENGTH
+C number of pairs of words in gel reading is NW
+      NW=IDIMG-(LX2-1)
+C loop for each words start point
+      DO 200 I=1,NW
+C       IS THIS WORD A ZERO?
+        W1=GELN(I)
+        IF(W1.EQ.0)GO TO 200
+C       POINT TO NEXT WORD OF PAIR
+        W2=GELN(I+LENGTH)
+        IF(W2.EQ.0)GO TO 200
+C       DOES W1 EXIST IN SEQ?
+        N1S1=WORDN(W1)
+        IF(N1S1.EQ.0)GO TO 200
+        N2S1=WORDN(W2)
+        IF(N2S1.EQ.0)GO TO 200
+C       BOTH EXIST, SO POINT TO THE FIRST + LENGTH
+        PS1W1=WORDP(W1)+LENGTH
+C       LOOP FOR ALL PAIRS
+C there are N1S1 occurrences of word 1 and N2S1 of word 2 in consensus
+C compare the positions of all pairs to see if they are LENGTH apart
+        DO 50 J=1,N1S1
+C         POINT TO FIRST W2 BECAUSE IT IS IN WORDP NOT POSNS
+          PS1W2=WORDP(W2)
+C         LOOP FOR THESE
+          DO 40 K=1,N2S1
+C           ARE THIS PAIR LENGTH APART?
+            N=PS1W1-PS1W2
+            IF(N.NE.0)GO TO 20
+C           THEY ARE SO, IF REQUIRED LOOK FOR REST OF MATCH
+            LMAT=LX2
+C
+C new code
+C
+            IPC = PS1W2 + LENGTH - 1
+            IPG = I + LX2 - 1
+ 16         CONTINUE
+            IF (LMAT.LT.MINMAT) THEN
+              IPC = IPC + 1
+              IPG = IPG + 1
+              IF(IPG.GT.IDIMG)GO TO 20
+              IF(IPC.GT.IDIM)GO TO 20
+              IF(CTONUM(SEQ(IPC)).NE.CTONUM(GEL(IPG)))GO TO 20
+              LMAT=LMAT+1
+              GO TO 16
+            END IF
+C
+C match found, is it an extension of a previous one ?
+C
+C            WRITE(*,*)I,PS1W1-LENGTH
+            IF (IDSAV.GT.0) THEN
+              IF (I-SAVPG(IDSAV).EQ.PS1W1-LENGTH-SAVPS(IDSAV)) GO TO 20
+            END IF
+            IDSAV = IDSAV + 1
+            IF (IDSAV.GT.IDSAVM) THEN
+              WRITE(KBOUT,1000)IDSAVM
+1000          FORMAT(' More than ',I6,' matches. Search aborted')
+              IFAIL = 1
+              RETURN
+            END IF
+C            WRITE(*,*)IDSAV
+            SAVPG(IDSAV) = I
+            SAVPS(IDSAV) = PS1W1 - LENGTH
+ 20         CONTINUE
+C           POINT TO NEXT W2
+            PS1W2=POSNS(PS1W2)
+40        CONTINUE
+C         ALL TRIED THIS PS1W1, TRY NEXT
+          PS1W1=POSNS(PS1W1-LENGTH)+LENGTH
+50      CONTINUE
+200   CONTINUE
+      IFAIL=0
+      RETURN
+      END
+C
+C     CFGEL old version (before 15-4-92)
+C
+C   ROUTINE TO COMPARE A STRING OF WORD NUMBERS FOR A GEL WITH A SERIES
+C   OF ARRAYS REPRESENTING A CONSENSUS SEQUENCE. WE LOOK FOR OCCURENCES
+C   OF PAIRS OF WORDS (EACH WORD IS LENGTH CHARS LONG AND SO TOTAL MATCH IS
+C   2*LENGTH CHARS LONG). THE ARRAYS SENT ARE OF SIZE 4**LENGTH (LE4)
+      SUBROUTINE CFGELO(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LE4,
+     +SAVPG,
+     +SAVPS,SAVL,IDSAV,SEQ,GEL,MINMAT,IFAIL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),GEL(IDIMG)
+      INTEGER POSNS(IDIM),WORDP(LE4),SAVPS(IDSAV),SAVPG(IDSAV)
+      INTEGER GELN(IDIMG),SAVL(IDSAV)
+      INTEGER WORDN(LE4)
+      INTEGER W1,W2,PS1W1,PS1W2
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDSAVM=IDSAV
+      IDSAV=0
+C   LENGTH OF PAIR OF WORDS
+      LX2=2*LENGTH
+C   NUMBER OF PAIRS OF WORDS OF LENGTH LENGTH IN GEL
+      NW=IDIMG-(LX2-1)
+C   LOOP FOR EACH START POINT
+      DO 200 I=1,NW
+C   IS THIS WORD A ZERO?
+      W1=GELN(I)
+      IF(W1.EQ.0)GO TO 200
+C   POINT TO NEXT WORD OF PAIR
+      W2=GELN(I+LENGTH)
+      IF(W2.EQ.0)GO TO 200
+C   DOES W1 EXIST IN SEQ?
+      N1S1=WORDN(W1)
+      IF(N1S1.EQ.0)GO TO 200
+      N2S1=WORDN(W2)
+      IF(N2S1.EQ.0)GO TO 200
+C   BOTH EXIST, SO POINT TO THE FIRST + LENGTH
+      PS1W1=WORDP(W1)+LENGTH
+C   LOOP FOR ALL PAIRS
+      DO 50 J=1,N1S1
+C   POINT TO FIRST W2 BECAUSE IT IS IN WORDP NOT POSNS
+      PS1W2=WORDP(W2)
+C   LOOP FOR THESE
+      DO 40 K=1,N2S1
+C   ARE THIS PAIR LENGTH APART?
+      N=PS1W1-PS1W2
+      IF(N.NE.0)GO TO 20
+C   THEY ARE SO, IF REQUIRED LOOK FOR REST OF MATCH
+      LMAT=LX2
+      IF(MINMAT.EQ.LX2)GO TO 15
+      IPC=PS1W2+LENGTH
+      IPG=I+LX2
+16    CONTINUE
+      IF(IPG.GT.IDIMG)GO TO 15
+      IF(IPC.GT.IDIM)GO TO 15
+C
+      IF(CTONUM(SEQ(IPC)).NE.CTONUM(GEL(IPG)))GO TO 15
+      LMAT=LMAT+1
+      IPC=IPC+1
+      IPG=IPG+1
+      GO TO 16
+15    CONTINUE
+C  IS MATCH LONG ENOUGH?
+      IF(LMAT.LT.MINMAT)GO TO 20
+      IDSAV=IDSAV+1
+      IF(IDSAV.LE.IDSAVM)GO TO 18
+      WRITE(KBOUT,1000)IDSAVM
+1000  FORMAT(' More than ',I6,' matches. Search aborted')
+      IFAIL=1
+      RETURN
+18    CONTINUE
+      SAVL(IDSAV)=LMAT
+      SAVPG(IDSAV)=I
+      SAVPS(IDSAV)=PS1W1-LENGTH
+20    CONTINUE
+C   POINT TO NEXT W2
+      PS1W2=POSNS(PS1W2)
+40    CONTINUE
+C   ALL TRIED THIS PS1W1, TRY NEXT
+      PS1W1=POSNS(PS1W1-LENGTH)+LENGTH
+50    CONTINUE
+200   CONTINUE
+      IFAIL=0
+      RETURN
+      END
+      INTEGER FUNCTION CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      J = I
+      CHAINL = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        J = I
+        I = LNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CHAINL = J
+      END
+C
+C      CHANGE
+C
+C   ROUTINE TO EXCHANGE ALL THE CHARS IN A CHARACTER ARRAY USING
+C   A PAIR OF LOOKUP ARRAYS SENT BY CALLING PROG
+C
+C
+      SUBROUTINE CHANGE(SEQ,IDIM1,CHAR1,CHAR2,IDIM2,ELSE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      CHARACTER CHAR1(IDIM2),CHAR2(IDIM2)
+      CHARACTER ELSE
+      DO 100 I=1,IDIM1
+C
+      DO 50 J=1,IDIM2
+C
+      IF(SEQ(I).NE.CHAR1(J))GO TO 50
+C   MATCH SO EXCHANGE CHARS
+      SEQ(I)=CHAR2(J)
+      GO TO 100
+50    CONTINUE
+      SEQ(I)=ELSE
+100   CONTINUE
+C
+      RETURN
+      END
+      CHARACTER*1 FUNCTION CHARSL(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'ctag*-'/
+      CHARSL = C(I:I)
+      END
+      CHARACTER*1 FUNCTION CHARSU(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'CTAG*-'/
+      CHARSU = C(I:I)
+      END
+      INTEGER FUNCTION CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      CLEN= 0
+      LEN = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        LEN = MAX(LEN,(RELPG(I) + ABS(LNGTHG(I)) - 1))
+        I = RNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CLEN = LEN
+      END
+      SUBROUTINE CLIST(GELNO1,LINNO1,IGEL1,GELNO2,LINNO2,
+     +IGEL2,GELNOS,GELSTR,GELEND,IUNIQ1,IUNIQ,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER GELNO1(IGEL1),GELNO2(IGEL2),GELNOS(IUNIQ1)
+      INTEGER GELSTR(IUNIQ1),GELEND(IUNIQ1)
+      INTEGER LINNO1(IGEL1),LINNO2(IGEL2)
+      EXTERNAL INLIST
+C   GELNOS === GEL NUMBERS (GELNOS)
+C   GELSTR === GEL START LINES
+C   GELEND === GEL END LINES
+C   GELNO  === GEL NUMBERS PER STRIP
+C   LINNO  === GEL LINE NUMBERS PER STRIP
+C   IGEL   === NUMBER OF GELS PER STRIP
+C   LINENO === CURRENT LINE NUMBER
+C
+C     WHICH GELS IN GELNO2 DO NOT APPEAR IN GELNO1
+C     IE HAVE STARTED IN GELNO2
+        DO 20 I=1,IGEL2
+        MATCH=INLIST(GELNO1,IGEL1,GELNO2(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH SO NEW
+C         PUT IN GELSTR
+          IUNIQ=IUNIQ+1
+          GELNOS(IUNIQ)=GELNO2(I)
+          GELSTR(IUNIQ)=LINNO2(I)
+        END IF
+20    CONTINUE
+C     WHICH GELS IN GELNO1 DO NOT APPEAR IN GELNO2
+C     IE WHICH HAVE ENDED IN GELNO1
+      DO 10 I=1,IGEL1
+        MATCH=INLIST(GELNO2,IGEL2,GELNO1(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH  SO MUST HAVE ENDED
+C         WHERE IS IT STORED IN GELNOS?
+          MATCH=INLIST(GELNOS,IUNIQ,GELNO1(I))
+          IF(MATCH.NE.0)THEN
+            GELEND(MATCH)=LINNO1(I)
+            GO TO 10
+          END IF
+C         ERROR
+          WRITE(KBOUT,1000)GELNO1(I)
+1000      FORMAT( ' Error: gel number ',I5,
+     +    ' expected but not found in list')
+          IOK = 1
+          RETURN
+        END IF
+10    CONTINUE
+      IOK = 0
+      RETURN
+      END
+C
+C      CMPLMT
+C
+C   SUBROUTINE TO REVERSE AND COMPLEMENT GELS AND DATA BASE
+C   THE POSITIONS OF THE RIGHT ENDS OF GELS ARE FIRST STORED
+C   IN RELPG THEN WE DO A BUBBLE SORT ON THESE POSITIONS
+C   UPDATING RELATIONSHIPS AS WE GO
+C   ALSO SEQUENCES ARE COMPLEMENTED, SIGNS OF LENGTH ARE
+C   MULTIPLIED BY -1 AND THE CONTIG LINE IS ALTERED
+      SUBROUTINE CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,GEL,IDBSIZ,KBOUT,IDEVR,IDEVW,MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER X
+C
+      WRITE(KBOUT,1000)LLINO
+1000  FORMAT( ' Complementing contig',I6)
+C   CHAIN THRU AND PUT RIGHT ENDS IN RELPG
+      N=LLINO
+10    CONTINUE
+      RELPG(N)=RELPG(N)+(ABS(LNGTHG(N)))-1
+      IF(RNBR(N).EQ.0)GO TO 20
+      N=RNBR(N)
+      GO TO 10
+20    CONTINUE
+C
+C   NOW EFFECTIVELY BUBBLE SORT ON RELPG
+      N=RNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+      N=NL
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+      NL=LNBR(N)
+      IF(NL.EQ.0)GO TO 30
+      I1=0
+23    CONTINUE
+      IF(RELPG(N).GE.RELPG(NL))GO TO 21
+C   NOT IN CORRECT ORDER SO CHAIN ALONG UNTIL CORRECT,THEN COME
+C   BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE THIS LINE SET POINTER TO CURRENT POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C
+C   EXCHANGE NEIGHBOURS. CURRENTLY LOOKING AT N AND ITS LEFT
+C   NBR, AND THE LEFT NBR IS FURTHER RIGHT THAN N
+C   FIX UP POINTERS TO LEFT AND RIGHT OF THESE TWO
+      M=LNBR(NL)
+      IF(M.NE.0)RNBR(M)=N
+      M=RNBR(N)
+      IF(M.NE.0)LNBR(M)=NL
+      LNBR(N)=LNBR(NL)
+      LNBR(NL)=N
+      RNBR(NL)=RNBR(N)
+      RNBR(N)=NL
+C   CHAIN BACK THRU LIST WITH THIS LINE
+      N=RNBR(NL)
+      IF(N.EQ.0)GO TO 21
+C   IE END MET
+      GO TO 23
+30    CONTINUE
+C   FINISH WITH LEFT END IN N
+40    CONTINUE
+C   NOW REVERSE NBRS SO CHAIN BACK RIGHT
+      NL=RNBR(N)
+      IF(NL.EQ.0)GO TO 50
+      RNBR(N)=LNBR(N)
+      LNBR(N)=NL
+      N=NL
+      GO TO 40
+50    CONTINUE
+C   NEED TO FIX UP NEW LEFT END
+      RNBR(N)=LNBR(N)
+      LNBR(N)=0
+C   ALL POINTERS FIXED NOW DO RELATIVE POSITION
+C   FINISH WITH LEFT END IN N
+C   SO CHAIN BACK RIGHT
+C   SAVE RIGHT LINE NUMBER
+      NL=N
+      X=RELPG(N)
+60    CONTINUE
+      RELPG(N)=1+(-1*(RELPG(N)-X))
+      IF(RNBR(N).EQ.0)GO TO 70
+      N=RNBR(N)
+      GO TO 60
+70    CONTINUE
+C   NOW FIX CONTIG LINE
+      LNBR(LINCON)=NL
+      RNBR(LINCON)=N
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+C   NOW REVERSE AND COMPLEMENT GELS
+      N=NL
+80    CONTINUE
+C      READ(IDEVW,REC=N)GEL
+      CALL READW(IDEVW,N,GEL,MAXGEL)
+      M=ABS(LNGTHG(N))
+      CALL SQREV(GEL,M)
+      CALL SQCOM(GEL,M)
+      CALL WRITEW(IDEVW,N,GEL,MAXGEL)
+C      WRITE(IDEVW,REC=N)GEL
+C   CHANGE SIGNS
+      LNGTHG(N)=-1*LNGTHG(N)
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   ANY MORE?
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 80
+C   NO MORE
+      RETURN
+      END
+C      CONSEN
+C   CALCULATES A CONSENSUS USING THE RULES OUTLINED IN THE DOCUMENTATION
+C   AND SUBROUTINE SUMMER
+C   UNIT IDEV IS USED FOR OUTPUT
+      SUBROUTINE CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +KBIN,KBOUT,IDEVW,IDEV,NAMCON,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,IDM,PERCD,IDEVN,LLINO)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CHARACTER NAMCON*(*)
+100   CONTINUE
+      ISTART=1
+      NAMCON = ' '
+      CALL OPENF1(IDEV,NAMCON,1,IOK,KBIN,KBOUT,
+     +'Name for consensus file',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(ANS,'Make consensus for whole database',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.1)GO TO 150
+      N=IDBSIZ-NCONTS
+      CALL BUSY(KBOUT)
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        IF((ISTART+19+Y).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+1009      FORMAT(
+     +    ' Maximum consensus length(',I6,') exceeded,',/,
+     +    ' calculation aborted')
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        ISTART=ISTART+Y
+110   CONTINUE
+      ISTART=ISTART-1
+      CALL YESNO(ANS,'Staden format',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF (ANS.LT.0) RETURN
+      IF(ANS.EQ.0) THEN
+        CALL FMTDK(IDEV,SEQ1,ISTART)
+      ELSE
+        CALL WRITCF(IDEV,SEQ1,ISTART,NAMPRO,KBOUT,IOK)
+      END IF
+      RETURN
+150   CONTINUE
+      CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +'Contig identifier',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)GO TO 400
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)GO TO 400
+      IDIM2=RREG-LREG+1
+      IF((ISTART+19+IDIM2).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+         RETURN
+      END IF
+      CALL BUSY(KBOUT)
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1(ISTART),IDIM2,GEL,LREG,RREG,LLINO,IDBSIZ,TEMP,
+     +CHRSIZ,MAXGL2,IDEVW,MAXGEL,IDM,PERCD)
+      ISTART=ISTART+IDIM2
+300   CONTINUE
+      CALL YESNO(ANS,'Select another contig',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0) GO TO 150
+      ISTART=ISTART-1
+      CALL YESNO(ANS,'Staden format',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF (ANS.LT.0) RETURN
+      IF(ANS.EQ.0) THEN
+        CALL FMTDK(IDEV,SEQ1,ISTART)
+      ELSE
+        CALL WRITCF(IDEV,SEQ1,ISTART,NAMPRO,KBOUT,IOK)
+      END IF
+400   CONTINUE
+      CALL YESNO(ANS,'Make another consensus',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0)GO TO 100
+      END
+      SUBROUTINE WRITCF(IDEV,SEQ,IDSEQ,NAMPRO,KBOUT,IOK)
+      CHARACTER SEQ(IDSEQ),TITLE*10,ENAME*10,NAMPRO*(*)
+      CHARACTER NL
+      PARAMETER (MAXDG = 5)
+      EXTERNAL INDEXA
+C
+C write out in fasta format
+C also change -,* to N
+C currently name is project name for single contig, but left gel number
+C for multiple contigs, title is always left gel number
+C
+      NC = 0
+      DO 1 I=1,IDSEQ
+        IF (SEQ(I).EQ.'>') NC = NC + 1
+ 1    CONTINUE
+      NL = CHAR(10)
+      ENAME = ' '
+      I = INDEX(NAMPRO,'.')
+      ENAME(1:I-1) = NAMPRO(1:I-1)
+      IAT = 1
+ 10   CONTINUE
+      IF (IAT.GT.IDSEQ) THEN
+        CLOSE(UNIT=IDEV)
+        IOK =0
+        RETURN
+      END IF
+      IF(SEQ(IAT).NE.'<') THEN
+        CALL ERROM(KBOUT,'Missing header in consensus')
+        IOK = 1
+        RETURN
+      END IF
+      INAMES = INDEXA(SEQ(IAT),20,'.')
+      IF (INAMES.EQ.0) THEN
+        CALL ERROM(KBOUT,'Missing dot in header')
+        IOK = 1
+        RETURN
+      END IF
+      INAMES = IAT + INAMES
+      INAMEE = INAMES + MAXDG - 1
+      TITLE = ' '
+      K = 0
+      DO 20 I=INAMES,INAMEE
+        K = K + 1
+        TITLE(K:K) = SEQ(I)
+ 20     CONTINUE
+      IF (NC.GT.1) ENAME = TITLE
+      IAT = IAT + 20
+      IDT = IDSEQ-IAT+2
+      IDSQ = INDEXA(SEQ(IAT),IDT,'<')
+      IF (IDSQ.EQ.0) IDSQ = IDT
+      IDSQ = IDSQ - 1
+      CALL SETCCS(SEQ(IAT),IDSQ)
+      CALL WRITFF(IDEV,SEQ(IAT),IDSQ,ENAME,TITLE)
+      IAT = IAT + IDSQ
+      GO TO 10
+      END
+      SUBROUTINE SETCCS(SEQ,IDSEQ)
+      CHARACTER SEQ(IDSEQ),TO(5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE TO
+      DATA TO/'t','c','a','g','n'/
+C
+C change chars in array seq of type found to type to
+C
+      DO 10 I=1,IDSEQ
+        K = CTONUM(SEQ(I))
+        SEQ(I) = TO(K)
+ 10     CONTINUE
+      END
+      SUBROUTINE COPYM(JLEFTS,ILEFTS,JLC,ILC,
+     +JPOSC,IPOSC,JSENSE,ISENSE,JLLINO,LLINO,
+     +JJOINT,JOINT,JTOTPC,ITOTPC,JTOTPG,ITOTPG,
+     +JTYPE,ITYPE,JDOUT,IDOUT,JDIM22,IDIM22,
+     +SEQG3,SEQG2,SEQC3,SEQC2,PERMS,PERMIS)
+      CHARACTER SEQG3(JDIM22),SEQG2(JDIM22),SEQC3(JDOUT),SEQC2(JDOUT)
+      ILEFTS = JLEFTS
+      ILC = JLC
+      IPOSC = JPOSC
+      ISENSE = JSENSE
+      LLINO = JLLINO
+      JOINT = JJOINT
+      ITOTPC = JTOTPC
+      ITOTPG = JTOTPG
+      ITYPE = JTYPE
+      IDOUT = JDOUT
+      IDIM22 = JDIM22
+      CALL SQCOPY(SEQG3,SEQG2,JDIM22)
+      CALL SQCOPY(SEQC3,SEQC2,JDOUT)
+      PERMIS = PERMS
+      END
+C     SUBROUTINE DALIGN
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IFAIL,LO,PERMAX,ISHOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+      IFAIL = 1
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+100   CONTINUE
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      X=FLOAT(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        CALL ERROM(KBOUT,'DALIGN: matching region too long')
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      IF (X.GT.PERMAX) RETURN
+      IF (ISHOW.EQ.1) THEN
+        WRITE(KBOUT,1002)
+1002    FORMAT(' Best alignment found')
+        CALL SQMTCH(SEQC2(1),SEQG2(IENDG),SEQ3,LO)
+        CALL FMT4LN(SEQC2(1),SEQG2(IENDG),SEQ3,LO,IENDC,IENDG,KBOUT)
+      END IF
+      IFAIL=0
+      END
+      SUBROUTINE DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP,IERR,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP(IDBSIZ)
+      EXTERNAL LCCHEK,NCCHEK
+C to check the logical consistency of a database
+C
+C 1. are all nbrs holding hands
+C 2. are all gels in exactly 1 contig
+C 3. are there loops in contigs
+C 4. do the gels designated left or right ends have outward neighbours
+C 5. are the relative positions in same order as hand holding
+C 6. are there gels of zero length
+C 7. are there contigs of length < 1
+C 8. does the designated length of the contigs agree with the gel positions
+C 9. if i chain left thru a contig do i reach the gel designated as the left end
+C10.if i chain right thru a contig do i reach the gel designated as the right end
+C
+C return error code 2 for all errors except where only error is "gel not used"
+C for which we return 1
+      IERR=0
+C   hand holding OK?
+      DO 100 I=1,NGELS
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 50
+        IF(RNBR(K).EQ.I)GO TO 50
+        WRITE(KBOUT,1000)I
+1000    FORMAT(' Hand holding problem for gel reading',I6)
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+1001    FORMAT(' ',5I6)
+        IERR=2
+50      CONTINUE
+        K=RNBR(I)
+        IF(K.EQ.0)GO TO 100
+        IF(LNBR(K).EQ.I)GO TO 100
+        WRITE(KBOUT,1000)I
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+        IERR=2
+100   CONTINUE
+C
+C   are all gels in exactly 1 contig
+C
+      CALL FILLI(TEMP,IDBSIZ,0)
+      N=IDBSIZ-NCONTS
+C
+C count the number of times thru loop 320 to 300: if this exceeds the
+C database size a loop has been found
+C
+      ICOUNT=0
+      DO 300 I=N,IDBSIZ-1
+        II=I
+        K=LNBR(I)
+        J=RNBR(I)
+        IF((K.NE.0).AND.(J.NE.0))GO TO 310
+C
+C This contig points to zero gel number as a left or right end
+C
+        WRITE(KBOUT,1002)I
+1002    FORMAT(' Contig',I4,' has gel numbers of zero')
+        IERR=2
+        GO TO 290
+310     CONTINUE
+        IF((LNBR(K).EQ.0).AND.(RNBR(J).EQ.0))GO TO 290
+C
+C These ends reads have outward neighbours
+C
+        WRITE(KBOUT,1004)I
+1004  FORMAT(' The end gels of contig',I4,' have outward neighbours')
+        IERR=2
+290     CONTINUE
+C
+C Does the contig have nonzero length?
+C
+        IF(RELPG(I).GT.0)GO TO 320
+        WRITE(KBOUT,1010)I
+1010    FORMAT(' The contig on line number',I4,' has zero length')
+        IERR=2
+320     CONTINUE
+        TEMP(K)=TEMP(K)+1
+        ICOUNT=ICOUNT+1
+        IF(ICOUNT.GT.IDBSIZ)GO TO 601
+        K=RNBR(K)
+        IF(K.NE.0)GO TO 320
+300   CONTINUE
+      DO 400 I=1,NGELS
+        IF(TEMP(I).EQ.1)GO TO 390
+        IF(TEMP(I).EQ.0)GO TO 410
+        WRITE(KBOUT,1005)I,TEMP(I)
+1005    FORMAT(' Gel number ',I6,' is used ',I6,' times')
+        IERR=2
+        GO TO 400
+390     CONTINUE
+C
+C does the gel have nonzero length (only check those used once)
+C
+        IF(LNGTHG(I).NE.0)GO TO 400
+        WRITE(KBOUT,1011)I
+1011    FORMAT(' Gel number',I6,' has zero length')
+        IERR=2
+        GO TO 400
+410     CONTINUE
+        WRITE(KBOUT,1006)I
+1006    FORMAT(' Gel number ',I6,' is not used')
+C
+C need to increase the error count (dont reset to lower value)
+C
+        IF(IERR.LT.2)IERR=1
+400   CONTINUE
+C
+C all relative positions ok?
+C
+      N=IDBSIZ-NCONTS
+      DO 500 I=N,IDBSIZ-1
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 500
+510     CONTINUE
+        J=RNBR(K)
+        IF(J.EQ.0)GO TO 500
+        IF(RELPG(K).GT.RELPG(J))GO TO 520
+        K=J
+        GO TO 510
+520     CONTINUE
+        WRITE(KBOUT,1007)K,RELPG(K),J,RELPG(J)
+1007    FORMAT(' Gel number',I6,' with position',I6,
+     +  ' is the left neighbour of',
+     +  /,' gel number',I6,' with position',I6)
+        K=J
+        IERR=2
+        GO TO 510
+500   CONTINUE
+      IOK = LCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,KBOUT)
+      IF (IOK.NE.0) IERR = 2
+      IOK = NCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,KBOUT)
+      IF (IOK.NE.0) IERR = 2
+      IF(IERR.EQ.0) WRITE(KBOUT,1013)
+1013  FORMAT(' Database is logically consistent')
+      RETURN
+601   CONTINUE
+      IERR=2
+      WRITE(KBOUT,1008)II
+1008  FORMAT(' Loop in contig',I6,/,
+     +' No further checking done but gel numbers follow')
+      CALL FILLI(TEMP,IDBSIZ,0)
+      K=LNBR(II)
+710   CONTINUE
+      TEMP(K)=TEMP(K)+1
+      WRITE(KBOUT,1009)K
+1009  FORMAT(' ',I6)
+      IF(TEMP(K).GT.1)RETURN
+      K=RNBR(K)
+      GO TO 710
+      END
+      INTEGER FUNCTION LCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CLEN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      EXTERNAL CLEN
+      LCCHEK = 0
+      DO 10 I=IDBSIZ-NCONTS,IDBSIZ-1
+        IL = LNBR(I)
+        L1 = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,IL)
+        L2 = RELPG(I)
+        IF (L1.NE.L2) THEN
+          WRITE(KBOUT,1000)I,L2,L1
+ 1000     FORMAT(
+     +' Contig line',I6,' records length',I6,' but actual length is',I6)
+        LCCHEK = LCCHEK + 1
+        END IF
+ 10   CONTINUE
+      END
+      INTEGER FUNCTION NCCHEK(RELPG,LNGTHG,LNBR,RNBR,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHAINL,CHAINR
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      EXTERNAL CHAINL,CHAINR
+      NCCHEK = 0
+      DO 10 I=IDBSIZ-NCONTS,IDBSIZ-1
+        IL = LNBR(I)
+        L1 = CHAINR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,IL)
+        L2 = RNBR(I)
+        IF (L1.NE.L2) THEN
+          WRITE(KBOUT,1000)I,L2,L1
+ 1000     FORMAT(
+     +' Contig line',I6,' records right neighbour as',I6,
+     +' but left to right chaining gives',I6)
+        NCCHEK = NCCHEK + 1
+        END IF
+        IL = RNBR(I)
+        L1 = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,IL)
+        L2 = LNBR(I)
+        IF (L1.NE.L2) THEN
+          WRITE(KBOUT,1001)I,L2,L1
+ 1001     FORMAT(
+     +' Contig line',I6,' records left neighbour as',I6,
+     +' but right to left chaining gives',I6)
+        NCCHEK = NCCHEK + 1
+        END IF
+ 10   CONTINUE
+      END
+      INTEGER FUNCTION CHAINR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG RIGHT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      J = I
+      CHAINR = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        J = I
+        I = RNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CHAINR = J
+      END
+C   DBPRNT
+C   PRINTS A DATABASE. IE ITS RELATIONSHIPS
+      SUBROUTINE DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +IDEV,KBIN,KBOUT,IDEVN,LLINO,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LREG,RREG,ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMARC*16
+      IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +NCONTS,IDBSIZ,IDEV)
+      WRITE(IDEV,10011)NGELS,NCONTS
+10011 FORMAT(' Number of gel readings ',I6,' Number of contigs ',I6)
+20    CONTINUE
+      CALL YESNO(ANS,'Select contigs',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0) GO TO 45
+      N=IDBSIZ-NCONTS
+25    CONTINUE
+      CALL YESNO(ANS,'Show gel readings in positional order',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0)GO TO 41
+      WRITE(IDEV,1009)
+1009  FORMAT(' CONTIG LINES')
+      WRITE(IDEV,1000)
+1000  FORMAT(
+     +' CONTIG              LINE  LENGTH               ENDS'/
+     +'                                             LEFT   RIGHT')
+      DO 30 I=N,IDBSIZ-1
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+30    CONTINUE
+1007  FORMAT( ' ',18X,I6,2X,I7,9X,I6,2X,I6)
+      WRITE(IDEV,1008)
+1008  FORMAT(' GEL LINES')
+      WRITE(IDEV,1001)
+ 1001 FORMAT(
+     +' NAME              NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                             LEFT   RIGHT')
+      DO 40 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        WRITE(IDEV,1006)NAMARC,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1006    FORMAT( ' ',A,2X,I6,2X,I7,2X,I5,2X,I6,2X,I6)
+40    CONTINUE
+      RETURN
+C
+41    CONTINUE
+C
+C   SORTED DATA
+      DO 43 I=N,IDBSIZ-1
+        WRITE(IDEV,1021)
+1021    FORMAT( )
+        WRITE(IDEV,1000)
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+        J=LNBR(I)
+        WRITE(IDEV,1001)
+42      CONTINUE
+        CALL READN(IDEVN,J,NAMARC)
+        WRITE(IDEV,1006)NAMARC,J,RELPG(J),LNGTHG(J),LNBR(J),RNBR(J)
+        J=RNBR(J)
+        IF(J.NE.0)GO TO 42
+43    CONTINUE
+      RETURN
+45    CONTINUE
+C   SELECTED CONTIGS ONLY
+C
+C   GET GEL NUMBER AND CONTIG NUMBER
+      CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +'Contig identifier',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1009)
+      WRITE(IDEV,1000)
+      WRITE(IDEV,1007)LINCON,RELPG(LINCON),LNBR(LINCON),RNBR(LINCON)
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1008)
+      N=LLINO
+      WRITE(IDEV,1001)
+46    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).EQ.0)GO TO 48
+      N=RNBR(N)
+      IF(RELPG(N).GT.RREG)GO TO 48
+      GO TO 46
+48    CONTINUE
+      GO TO 45
+      END
+      SUBROUTINE DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,KBIN,KBOUT,IDEVW,
+     +IDEV,LINLEN,PERCD,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,LINOU1,LINOU2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEVN,
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CHARACTER FILEH*(*)
+      PARAMETER (MAXPRM = 10)
+      CHARACTER PROMPT(2)*(MAXPRM)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2),ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +'Contig identifier',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0) RETURN
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0) RETURN
+      IDIM2=RREG-LREG+1
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL DBSCSM(SEQ1(LREG),IDIM2,KBOUT)
+160   CONTINUE
+      ANS = 1
+      PROMPT(1) = 'List codes'
+      PROMPT(2) = 'Plot codes'
+      CALL RADION('Select results display mode',PROMPT,2,ANS,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.1) RETURN
+      IF(ANS.EQ.1) THEN
+        CALL FMTDB(SEQ1,IDIM1,LREG,RREG,LINLEN,IDEV)
+        RETURN
+      ELSE
+        CALL PLTQ(SEQ1(LREG),IDIM2,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      END
+      SUBROUTINE DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,LLINO,
+     +PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      IDIM2=RREG-LREG+1
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL PLTQ(SEQ1(LREG),IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END
+      SUBROUTINE DBSCSM(SEQ1,IDIM1,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER CODES(5)
+      REAL X(5)
+      SAVE CODES
+      DATA CODES/'0','1','2','3','4'/
+      DO 50 J=1,5
+        X(J)=0.
+50    CONTINUE
+      DO 100 I=1,IDIM1
+        DO 60 J=1,5
+          IF(SEQ1(I).NE.CODES(J))GO TO 60
+          X(J)=X(J)+1.
+          GO TO 61
+60      CONTINUE
+61      CONTINUE
+100   CONTINUE
+      SUM=0.
+      DO 130 J=1,5
+        SUM=SUM+X(J)
+130   CONTINUE
+      DO 140 J=1,5
+        IF(SUM.NE.0)X(J)=X(J)*100./SUM
+140   CONTINUE
+      WRITE(KBOUT,1001)X(1)
+1001  FORMAT(' ',F6.2,'% OK on both strands and they agree(0)')
+      WRITE(KBOUT,1002)X(2)
+1002  FORMAT(' ',F6.2,'% OK on plus strand only(1)')
+      WRITE(KBOUT,1003)X(3)
+1003  FORMAT(' ',F6.2,'% OK on minus strand only(2)')
+      WRITE(KBOUT,1004)X(4)
+1004  FORMAT(' ',F6.2,'% Bad on both strands(3)')
+      WRITE(KBOUT,1005)X(5)
+1005  FORMAT(' ',F6.2,'% OK on both strands but they disagree(4)')
+      RETURN
+      END
+C     DBSTAT
+      SUBROUTINE DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      N=IDBSIZ-NCONTS
+      SUM=0.
+      DO 20 I=N,IDBSIZ-1
+        SUM=SUM+RELPG(I)
+20    CONTINUE
+      AV=SUM/NCONTS
+      WRITE(KBOUT,1020)SUM,AV
+1020  FORMAT( ' Total contig length ',F10.0,'  Average',
+     +' length ',F10.1)
+      SUMG=0.
+      DO 30 I=1,NGELS
+        SUMG=SUMG+FLOAT(ABS(LNGTHG(I)))
+30    CONTINUE
+      AV=SUMG/SUM
+      WRITE(KBOUT,1021)SUMG
+1021  FORMAT( ' Total characters in gel readings ',F10.0)
+      WRITE(KBOUT,1022)AV
+1022  FORMAT
+     +( ' Average gel characters per consensus character ',F10.2)
+99    CONTINUE
+      RETURN
+      END
+C     DELCON
+C
+C   DELETES CONTIG FROM CONSENSUS SEQUENCE
+      SUBROUTINE DELCON(SEQ1,ILEFT,ILC,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+C   FIRST CHAR TO REPLACE
+      I1=ILEFT-20
+C   FIRST CHAR TO MOVE
+      I2=ILEFT+ILC
+C   IS THIS RIGHTMOST CONTIG ANYWAY?
+      IF(I2.GT.IDIM1)GO TO 10
+C   NUMBER TO MOVE
+      ID=IDIM1-I2+1
+C   MOVE
+      CALL SQCOPY(SEQ1(I2),SEQ1(I1),ID)
+C   RESET LENGTH
+      IDIM1=I1+ID-1
+      RETURN
+10    CONTINUE
+C   RIGHTMOST CONTIG SO DONT MOVE
+      IDIM1=I1-1
+C
+      RETURN
+      END
+      SUBROUTINE DISMAT(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C   EDIT 07-02-83 TO CHECK FOR CASE WHEN GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL!  DONE BY HAVING A
+C   PARAMETER THAT STORES THE POSITION OF THE LEFT END OF THE
+C   NEXT CONTIG (IE THE ONE AFTER THE ONE THE CURRENT GEL OVERLAPS)
+C   SET IT TO A VERY LARGE VALUE INITIALLY
+      NEXTC=99999
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPARATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C      WRITE(KBOUT,1000)SAVPG(1),SAVPS(1)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     +SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C      WRITE(KBOUT,1000)SAVPG(I),SAVPS(I)
+C1000  FORMAT(' ',2I6)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     +SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       DISMAU
+C   ROUTINE TO DISPLAY MATCHES
+C   EDITED 17-12-81 TO NOT SUBTRACT 1 FROM LCL AND LGR
+      SUBROUTINE DISMAU(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC (SEE DISMAT)
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with vector number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   SAVE POSN OF END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      RETURN
+      END
+      SUBROUTINE DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +GEL,LLINOO,LINCON,LREG,RREG,GEL2,I1,IDIM,NOPT,
+     +LLINOR,IDBSIZ,IDEV,KBOUT,IDEVW,IDEVN,LINLEN,PERCD,
+     +MAXGEL,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER CHRSIZ
+      PARAMETER (CHRSIZ = 6)
+      PARAMETER (IDC1 = CHRSIZ*100)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,X,XLS2,XLS1,XRS2,XRS1,RREG2
+      CHARACTER MATCH(100)
+      INTEGER CHARS(CHRSIZ,100),CHARS1(IDC1)
+      CHARACTER NAMARC*16
+      CHARACTER GEL(MAXGEL)
+      CHARACTER GEL2(MAXGEL)
+      INTEGER RELPOS(10),RELPO2(10)
+      INTEGER GELC
+      INTEGER RP
+      INTEGER LSEQNO,RSEQNO
+      CHARACTER LINOUT(100)
+      CHARACTER MUNOTP
+      CHARACTER GTCONC
+      EXTERNAL GTCONC
+      EQUIVALENCE (CHARS1,CHARS)
+      CALL FILLI(CHARS1,IDC1,0)
+C   SET CONTIG NUMBER
+      ICON=1
+      LLINO=LLINOO
+      NLEN=LINLEN/10
+      LSEQNO=LREG
+      X=LINLEN+LSEQNO-1
+      RSEQNO=MIN(RREG,X)
+C   SET LEFT GEL NUMBER FOR RIGHT CONTIG
+      LN2=LLINOR
+C   FIRST GEL NO IS LLINOO
+C   SET RREG FOR RIGHT CONTIG
+      RREG2=IDIM
+C   SET UP LSEQNO,RSEQNO FOR FOR NOPT=3
+      XLS2=I1
+      XRS2=RSEQNO-LSEQNO+XLS2
+9     CONTINUE
+C   IF RIGHT CONTIG SKIP NUMBER PRINTING
+      IF(ICON.EQ.2)GO TO 8
+C   NEED TO KEEP LONGEST LINE LENGTH FOR OUTPUT OF CONSENSUS
+      IE=0
+C   SETUP AND WRITE NUMBERS
+      RELPOS(1)=LSEQNO+9
+      DO 5 I=2,NLEN
+        RELPOS(I)=RELPOS(I-1)+10
+5     CONTINUE
+      WRITE(IDEV,1023)
+     +(RELPOS(K),K=1,MIN(NLEN,MAX(1,(RSEQNO-LSEQNO+1)/10)))
+1023  FORMAT( ' ',25X,10(I9,1X))
+C   SET CURRENT LINE NUMBER
+8     CONTINUE
+      GELC=LLINO
+10    CONTINUE
+C   IS LEFT END OF CURRENT GEL >RREG
+      IF(RELPG(GELC).GT.RSEQNO)GO TO 200
+C   ALSO NEED TO KNOW IF RIGHT END  ON THIS LINE (IF .LT. NO DATA
+C   TO DISPLAY)
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      IF(X.LT.LSEQNO)GO TO 190
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+      CALL FILLC(LINOUT,LINLEN,' ')
+      CALL READN(IDEVN,GELC,NAMARC)
+C
+C   NEED TO KNOW HOW MANY CHARS TO COPY OVER TO OUTPUT LINE
+C   AND WHERE IN LINE TO PUT THEM
+C   CURRENT LINE LEFT END IS LSEQNO,RIGHT END RSEQNO
+C   SO LEFT START CHAR IS
+      X=MAX(LSEQNO,RELPG(GELC))
+C   POSITION IN ARRAY LINE
+      LP=X-LSEQNO+1
+C   RIGHT END CHAR IS
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      X=MIN(RSEQNO,X)
+C   POSITION IN ARRAY LINE
+      RP=X-LSEQNO+1
+C   LOOK FOR LONGEST LINE
+      IF(RP.GT.IE)IE=RP
+C   NEED LEFT START IN GEL
+      K=LSEQNO-RELPG(GELC)+1
+      IF(K.LT.1)K=1
+      NCOP=RP-LP+1
+      IF(NCOP.GT.0)CALL SQCOPY(GEL(K),LINOUT(LP),NCOP)
+      N=LP+NCOP-1
+      II=K-1
+      IF(IDM.EQ.26)THEN
+        DO 50 I = LP,N
+          II = II + 1
+          CALL PCON1(GEL(II),CHARS(1,I))
+50      CONTINUE
+      ELSE
+        DO 70 I=LP,N
+          II=II+1
+          JJ = INDEXS(GEL(II),JSCORE)
+          CHARS(JJ,I) = CHARS(JJ,I) + JSCORE
+C          CHARS(CHRSIZ,I) = CHARS(CHRSIZ,I) + JSCORE
+70      CONTINUE
+      END IF
+      I=SIGN(GELC,LNGTHG(GELC))
+      WRITE(IDEV,1020)I,NAMARC,(LINOUT(K),K=1,RP)
+C1020  FORMAT( ' ',I4,2X,A,2X,100A1)
+1020  FORMAT( ' ',I6,1X,A,1X,100A1)
+C
+190   CONTINUE
+C   NOW GET NEXT GEL TO RIGHT
+      GELC=RNBR(GELC)
+      IF(GELC.NE.0)GO TO 10
+200   CONTINUE
+C   CALC CONSENSUS AND WRITE IT
+      IF(IDM.EQ.26)THEN
+        DO 49 I = 1,LINLEN
+          LINOUT(I) = MUNOTP(CHARS(1,I))
+          CHARS(1,I) = 0
+49      CONTINUE
+      ELSE
+        DO 230 I=1,LINLEN
+          LINOUT(I) = GTCONC(CHARS(1,I),CHRSIZ,PERCD)
+          CALL FILLI(CHARS(1,I),CHRSIZ,0)
+230     CONTINUE
+      END IF
+      WRITE(IDEV,1019)(LINOUT(K),K=1,IE)
+C   IF REQUIRED WRITE COMPARISON GEL
+C   WHICH OPTION IN OPERATION?
+      IF(NOPT.EQ.2)GO TO 52
+      IF(NOPT.NE.3)GO TO 250
+53    CONTINUE
+C   ALREADY DONE THIS LINE CONTIG2?
+      IF(ICON.EQ.2)GO TO 54
+      ICON=2
+C   NEED TO SAVE CONSENSUS FROM LEFT CONTIG
+      CALL SQCOPY(LINOUT,GEL2,IE)
+C   SAVE VALUES FROM LEFT CONTIG
+      XLS1=LSEQNO
+      XRS1=RSEQNO
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN1=LLINO
+C   SET UP VALUES FOR RIGHT CONTIG
+      LSEQNO=XLS2
+      RSEQNO=XRS2
+C   SET LEFT GEL NUMBER
+      LLINO=LN2
+C   GET NEXT GEL
+      GO TO 150
+54    CONTINUE
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN2=LLINO
+C   SET VALUES FOR RIGHT CONTIG NEXT PASS
+      XLS2=XRS2+1
+      XRS2=XLS2+LINLEN-1
+      IF(XRS2.GT.RREG2)XRS2=RREG2
+C   SET UP VALUES FOR LEFT CONTIG
+      LLINO=LN1
+      ICON=1
+      LSEQNO=XLS1
+      RSEQNO=XRS1
+C   SET DECREMENT FOR POINTER TO GEL2
+      MMM=I1-1
+52    CONTINUE
+C1020  FORMAT( ' ',I4,2X,A,2X,100A1)
+1017  FORMAT('        NEWGEL           ',100A1)
+1018  FORMAT('        MISMATCH         ',100A1)
+1019  FORMAT('        CONSENSUS        ',100A1)
+1022  FORMAT( ' ',26X,100A1)
+      I2=I1+LINLEN-1
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(NOPT.EQ.2)WRITE(IDEV,1017)(GEL2(K),K=I1,I2)
+C   SET DECREMENT
+      IF(NOPT.EQ.2)MMM=0
+55    CONTINUE
+      CALL FILLC(MATCH,LINLEN,'*')
+      K=0
+      DO 667 J=I1,I2
+        K=K+1
+        IF(GEL2(J-MMM).EQ.LINOUT(K))MATCH(K) = ' '
+667   CONTINUE
+      WRITE(IDEV,1018)(MATCH(K),K=1,IE)
+      RELPO2(1)=(I1)+9
+      DO 240 I=2,NLEN
+        RELPO2(I)=RELPO2(I-1)+10
+240   CONTINUE
+      WRITE(IDEV,1023)(RELPO2(K),K=1,NLEN)
+      I1=I2+1
+      I2=I2+LINLEN
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(I1.GT.I2)RETURN
+250   CONTINUE
+C
+      WRITE(IDEV,1021)
+1021  FORMAT( )
+C   NEXT LINE LENGTH
+C   NEXT LENGTH IS OLD RIGHT +1
+      LSEQNO=RSEQNO+1
+C   NEW RIGHT IS LEFT +LENGTH
+      RSEQNO=LSEQNO+(LINLEN)-1
+C   ARE WE OVER END OF REGION
+      IF(RSEQNO.GT.RREG)RSEQNO=RREG
+C   HAVE WE FINISHED REGION COMPLETELY
+      IF(RSEQNO.LT.LSEQNO) RETURN
+C   NOT FINISHED SO NEED TO FIND CURRENT LEFT GEL NO
+C   CURRENT LEFT GEL IS LLINO
+C
+150   CONTINUE
+C   NEED TO KNOW IF CURRENT LEFT GELS RIGHT END IS INSIDE REGION
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.LSEQNO)GO TO 9
+C   LOOK AT NEXT GEL TO RIGHT
+      LLINO=RNBR(LLINO)
+C   MAY HAVE GONE OVER END OF CONTIG?????
+      IF(LLINO.GT.0)GO TO 150
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE EC(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      PARAMETER (IDASH = 6)
+      K = 0
+      DO 10 I = 1,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.IDASH) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            GEL(I) = CHARSL(JC)
+            K = K + 1
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS CORRECTED=',K
+      END
+      SUBROUTINE ED(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = MIN(IDC,IDG),1,-1
+        JC = INDEXS(CON(I),J)
+        IF(JC.EQ.5) THEN
+          IF(I.LT.IDG) CALL SQCOPY(GEL(I+1),GEL(I),IDG-I)
+          K = K + 1
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS DELETED=',K
+      END
+      SUBROUTINE EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER CON(IDC)
+      INTEGER CHNRP
+      EXTERNAL CHNRP
+C  CHANGE RELATIVE POSITIONS FOR AE
+      ND = 0
+      DO 10 I = IDC,1,-1
+        IF(CON(I).EQ.'*') THEN
+          ND = ND + 1
+          K = I + LREG - 1
+          J = CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,K)
+          IF(J.NE.0) THEN
+            CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,IDUM,JDUM,IDEVR,
+     +      IDBSIZ,J,NCONT,-1)
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF DELETIONS=',ND
+      END
+      SUBROUTINE ET(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = 2,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.6) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            JNG = INDEXS(GEL(I-1),J)
+            JNC = INDEXS(CON(I-1),J)
+            IF(JNC.NE.JNG) THEN
+              IF((JNG.EQ.JC).AND.(JNC.EQ.JG)) THEN
+                GEL(I) = CHARSL(JNG)
+                GEL(I-1) = CHARSL(JG)
+                K = K + 1
+              END IF
+            END IF
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF CHARS TRANSPOSED=',K
+      END
+      SUBROUTINE FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND
+      STRAND = 1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHP)
+      IF(DEPTHP.LT.0) RETURN
+      STRAND = -1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHM)
+      IF(DEPTHM.LT.0) RETURN
+      CALL PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      END
+      SUBROUTINE FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTH,STRAND
+      EXTERNAL NCDEP
+C  LREG = left contig position
+C  RREG = right ''      ''
+C  LENCON = RREG-LREG+1
+      I = LGEL
+      DEPTH = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      ELSE
+        DEPTH = -1
+        RETURN
+      END IF
+C      WRITE(*,*)'LGEL',LGEL
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            K = RELPG(I) + ABS(LNGTHG(I)) -1
+            DEPTH = MAX(NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,
+     +      STRAND,K),DEPTH)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+C      WRITE(*,*)'DEPTH',DEPTH
+      END
+C      FIND
+C
+C   SUBROUTINE TO FIND THE FIRST OCCURENCE OF A GIVEN STRING
+C   IN A GIVEN ARRAY
+C
+      SUBROUTINE FIND(SEQ,IDIM1,STRING,IDIM2,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2),DASH
+      INTEGER PSEQ,PSTR
+      SAVE DASH
+      DATA DASH/'-'/
+      PSEQ=0
+      PSTR=1
+      IMATCH=0
+C
+100   CONTINUE
+C
+C   PUT PSEQ TO WHERE THIS FAILED MATCH STARTED
+      PSEQ=PSEQ+1-PSTR
+C
+400   CONTINUE
+C
+      PSTR=0
+C
+500   CONTINUE
+C
+C   POINT TO NEXT SEQ CHAR
+      PSEQ=PSEQ+1
+C   TEST FOR END
+      IF(PSEQ.GT.IDIM1)GO TO 300
+C   POINT TO NEXT STRING CHAR
+      PSTR=PSTR+1
+C   TEST FOR DASH IN STRING
+      IF(STRING(PSTR).EQ.DASH)GO TO 450
+C   TEST FOR DASH IN SEQ
+      IF(SEQ(PSEQ).EQ.DASH)GO TO 400
+C   TEST FOR MATCH
+      IF(SEQ(PSEQ).NE.STRING(PSTR))GO TO 100
+C
+450   CONTINUE
+C
+C   TEST FOR END OF STRING IE. WHOLE STRING MATCH
+      IF(PSTR.LT.IDIM2)GO TO 500
+C   HAVE MATCH. GET POINTER TO WHERE IT STARTED
+      IMATCH=PSEQ-IDIM2+1
+C
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE FMT4LP(SEQ1,SEQ2,IDIM,ISW,ISX,IDEV,NAME1,NAME2)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(60),NAME1*(*),NAME2*(*)
+      INTEGER KL(6)
+      ISXX=ISX
+      ISWW=ISW
+      IE=0
+10    CONTINUE
+      IS=IE+1
+      IE=IE+60
+      IF(IE.GT.IDIM)IE=IDIM
+      N=IE-IS+1
+      N=1+(N-1)/10
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,N
+        KL(J)=ISWW
+        ISWW=ISWW+10
+50    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+      WRITE(IDEV,1002)NAME1,(SEQ1(K),K=IS,IE)
+      IL = IE - IS + 1
+      CALL SQMTCH(SEQ1(IS),SEQ2(IS),MATCH,IL)
+      WRITE(IDEV,1003)(MATCH(K),K=1,IL)
+      WRITE(IDEV,1002)NAME2,(SEQ2(K),K=IS,IE)
+1002  FORMAT(2X,A,2X,6(10A1,1X))
+ 1003 FORMAT(10X,6(10A1,1X))
+C   SET UP DECIMAL COUNTERS
+      DO 60 J=1,N
+        KL(J)=ISXX
+        ISXX=ISXX+10
+60    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+1001  FORMAT( 5X,6(I6,5X))
+      IF(IE.LT.IDIM) GO TO 10
+      END
+      SUBROUTINE FMTDB(SEQ1,IDIM,ISW,ISE,LINLEN,IDEV)
+C   NOTE SAME AS FMTSEP!
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM)
+      INTEGER KL(12)
+      ISWW=ISW-1
+      IE=ISW-1
+1     CONTINUE
+      WRITE(IDEV,1003)
+1003  FORMAT( )
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,LINLEN/10
+        ISWW=ISWW+10
+        KL(J)=ISWW
+50    CONTINUE
+      IS=IE+1
+      IE=IE+LINLEN
+      IF(IE.GT.ISE)IE=ISE
+      WRITE(IDEV,1001)(KL(KKK),KKK=1,MIN(IE-IS+1,LINLEN)/10)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+1002  FORMAT( '  ',12(10A1,1X))
+1001  FORMAT( ' ',12(5X,I6))
+      IF(IE.EQ.ISE)RETURN
+      GO TO 1
+      END
+      SUBROUTINE FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   AUTHOR: RODGER STADEN
+C   STORES THEIR POSITIONS IN CENDS AND THEIR LEFT LINE NUMBERS IN NENDS
+      PARAMETER (MAXDG = 5)
+      CHARACTER SEQ(IDIM),DC(MAXDG)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      EXTERNAL IFROMC,INDEXA
+      IDCEND=0
+      DO 10 I=1,IDIM
+        IF(SEQ(I).NE.'<')GO TO 10
+        IDCEND=IDCEND+1
+C       PUT POSITION OF LEFT END OF CONTIG IN CENDS
+        CENDS(IDCEND)=I
+        K = INDEXA(SEQ(I),20,'.')
+        IF(K.EQ.0) THEN
+           WRITE(KBOUT,*)'Error in contig title: no dot!'
+           IDCEND = 0
+           RETURN
+         END IF
+        K = K + I
+C        K=I+11
+        DO 5 J=1,MAXDG
+          DC(J)=SEQ(K)
+          K=K+1
+5       CONTINUE
+        NENDS(IDCEND)=IFROMC(DC,MAXDG,KBOUT)
+10    CONTINUE
+C     STORE POSITION OF LAST CHAR +1 TO SIMPLIFY DISPLAY ROUTINES
+      CENDS(IDCEND+1)=IDIM+1
+      RETURN
+      END
+      INTEGER FUNCTION GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      GCLIN = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          GCLIN = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      INTEGER FUNCTION GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LLINO,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*17,NAME2*16,NAME3*17,NFLAG
+      PARAMETER (NFLAG='/')
+      NAME3 = ' '
+      IF(LLINO.NE.0) THEN
+        NAME3(1:1) = NFLAG
+        CALL READN(IDEVN,LLINO,NAME3(2:))
+      END IF
+      GELID = 0      
+10    CONTINUE
+      L = 0
+      IF(LLINO.NE.0) L = 17
+      CALL GTSTR('Contig identfier',NAME3,
+     +NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(INFLAG.EQ.3) THEN
+        GELID = LLINO
+        RETURN
+      END IF
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NAME1(1:1).EQ.NFLAG) THEN
+        CALL CCASE(NAME1,1)
+        DO 20 I=1,NGELS
+          CALL READN(IDEVN,I,NAME2)
+          CALL CCASE(NAME2,1)
+          IF(NAME1(2:17).EQ.NAME2) THEN
+            GELID = I
+            RETURN
+          END IF
+20     CONTINUE
+        WRITE(KBOUT,1004)NAME1(2:)
+1004    FORMAT(' ',A,' is not in the database!')
+      ELSE
+        CALL RJST(NAME1)
+        READ(NAME1,1001,ERR=10,END=10)GELID
+1001    FORMAT(I17)
+        IF((GELID.LT.1).OR.(GELID.GT.NGELS)) THEN
+          CALL ERROM(KBOUT,'Illegal gel reading number')
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +NCONTS,GEL,MAXGEL,IDEV3,IDEV4,IDEV5,IDEV1,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER  GEL(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER NAMARC*16
+      FILNAM = ' '
+      CALL OPENF1(IDEV5,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for names of extracted gel readings',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(I,'Extract ends of contigs only',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(I.LT.0) RETURN
+      IF(I.EQ.0) GO TO 15
+      DO 10 I=1,NGELS
+        L=ABS(LNGTHG(I))
+        IF(L.GT.0)THEN
+          CALL READN(IDEV4,I,NAMARC)
+          WRITE(KBOUT,1002)NAMARC
+1002      FORMAT(' ',A)
+          WRITE(IDEV5,1003)NAMARC
+1003      FORMAT(A)
+          FILNAM = NAMARC
+          CALL OPENRS(IDEV1,FILNAM,IOK,LRECL,1)
+          IF(IOK.NE.0) GO TO 100
+          CALL READW(IDEV3,I,GEL,MAXGEL)
+          IF(LNGTHG(I).LT.0)THEN
+            CALL SQREV(GEL,L)
+            CALL SQCOM(GEL,L)
+          END IF
+          CALL FMTDKN(IDEV1,GEL,L)
+          CLOSE(UNIT=IDEV1)
+      END IF
+10    CONTINUE
+      RETURN
+15    CONTINUE
+C   NUMBER OF LINES TO PROCESS
+      N=IDBSIZ-NCONTS
+      DO 20 I=N,IDBSIZ-1
+        JL=LNBR(I)
+        JR=RNBR(I)
+        CALL READN(IDEV4,JL,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        FILNAM = NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JL,GEL,MAXGEL)
+        L=ABS(LNGTHG(JL))
+        IF(LNGTHG(JL).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+        IF(JR.EQ.JL)GO TO 20
+        CALL READN(IDEV4,JR,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JR,GEL,MAXGEL)
+        L=ABS(LNGTHG(JR))
+        IF(LNGTHG(JR).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+20    CONTINUE
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,*)'Error opening file for extracted gel reading'
+      RETURN
+      END
+      SUBROUTINE GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),GELID
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER FILEH*(*)
+      EXTERNAL GELID
+      IERR = 1
+      NCONTC = GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LLINO,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(NCONTC.EQ.0) RETURN
+      IGELNO = NCONTC
+      IF(LNBR(NCONTC).NE.0) THEN
+        WRITE(KBOUT,1013)RELPG(NCONTC)
+1013    FORMAT(' Position of this reading=',I6)
+25      CONTINUE
+        NCONTC = LNBR(NCONTC)
+        IF(LNBR(NCONTC).NE.0) GO TO 25
+        WRITE(KBOUT,1014)NCONTC
+1014    FORMAT( ' Number of leftmost reading this contig=',I6)
+      END IF
+30    CONTINUE
+      N = IDBSIZ - NCONTS
+      DO 20 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.NCONTC) THEN
+          LINCON=J
+          GO TO 21
+        END IF
+20    CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' No contig line for this gel! Fix the database')
+      RETURN
+21    CONTINUE
+      LLINO = NCONTC
+      IERR = 0
+      END
+      SUBROUTINE GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CALL GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      END
+      SUBROUTINE GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LEFTMN,RIGHTM,LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,RIGHTM
+      CHARACTER FILEH*(*)
+40    CONTINUE
+      MN = LEFTMN
+      MX = RIGHTM
+      LREG = MN
+      CALL GETINT(MN,MX,LREG,
+     +'Start position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LREG = IVAL
+      MN = LREG
+      MX = RIGHTM
+      RREG = MX
+      CALL GETINT(MN,MX,RREG,
+     +'End position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      RREG = IVAL
+C   NOW FIND FIRST GEL THAT OVER LAPS
+50    CONTINUE
+      X=RELPG(LLINO)+(ABS(LNGTHG(LLINO)))-1
+      IF(X.GE.LREG)GO TO 60
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 50
+60    CONTINUE
+      RETURN
+      END
+      SUBROUTINE GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+      CHARACTER T
+        IF(T.EQ.'0') THEN
+          YF = Y0
+          YT = Y0
+        ELSE IF(T.EQ.'1') THEN
+          YF = Y0
+          YT = YM1
+        ELSE IF(T.EQ.'2') THEN
+          YF = Y0
+          YT = YP1
+        ELSE IF(T.EQ.'3') THEN
+          YF = YP1
+          YT = YM1
+        ELSE IF(T.EQ.'4') THEN
+          YF = YP2
+          YT = YM2
+        END IF
+      END
+      CHARACTER*1 FUNCTION GTCONC(COUNTS,IDM,CUT)
+      INTEGER IDM
+      INTEGER COUNTS(IDM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU
+C 30-3-92 made this routine sum counts
+      GTCONC = '-'
+      ISUM = 0
+      DO 5 I=1,IDM
+        ISUM = ISUM + COUNTS(I)
+ 5    CONTINUE
+      IF(ISUM.EQ.0) RETURN
+      Y = ISUM
+      DO 10 I = 1,IDM - 1
+        X = REAL(COUNTS(I))/Y
+        IF(X.GE.CUT) THEN
+          GTCONC = CHARSU(I)
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE HIGHLT(GELSAV,NAMSAV,NUMSAV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV1,IDEV2,
+     +FILNAM,IOK)
+      CHARACTER LINEIN*132,CONSEN*100
+      CHARACTER GELNO*6,GEL*100,GELSAV*100,GELNAM*16
+      CHARACTER NAMSAV*16,NUMSAV*6
+      CHARACTER FILNAM*(*),HELPF*(*)
+      DIMENSION GELSAV(50),NAMSAV(50),NUMSAV(50)
+      CHARACTER PLUS*4,MINUS*4
+      EQUIVALENCE (LINEIN(2:2),GELNO),(LINEIN(9:9),GELNAM)
+      EQUIVALENCE (LINEIN(26:26),GEL)
+      EXTERNAL NOTIRL
+      CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File containing contig display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for problem display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+C   FORMAT:
+C
+C12345678901234567890       10        20         30    ETC
+C   12  GELNAM0000  CAGACGCGCGCGCGCGCGGATATAGTCTCTCCGCTCT
+C  100  GELNAM0000       TGATACGCTCGCTCTCTCTCTCTCTCTCTTTC
+C                   AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C                           70        80       ETC
+C   12  GELNAM0000     AAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C
+      LIN = 1
+      CALL GTSTR('plus strand symbol',':',PLUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) PLUS = ':'
+      LIN = 1
+      CALL GTSTR('minus strand symbol','.',MINUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) MINUS = '.'
+C   COUNT LINE NUMBERS
+      LINNO=0
+10    CONTINUE
+C   READ LINE OF NOS
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+1003  FORMAT(A)
+C   WRITE IT OUT AGAIN
+      WRITE(IDEV2,1003)LINEIN
+C   ZERO GEL COUNT FOR THIS STRIP
+      IGEL=0
+20    CONTINUE
+C
+C   READ A LINE, COULD BE 1 GEL, 2 CONSENSUS OR BLANK
+C       LINEIN=' '
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+C   WHAT SORT OF LINE?  ONLY A GEL WILL HAVE NON BLANK CHARS AT THE LEFT END
+      IF(LINEIN(2:7).NE.' ')THEN
+C   GEL LINE SO SAVE
+      IGEL=IGEL+1
+      GELSAV(IGEL)=GEL
+      NAMSAV(IGEL)=GELNAM
+      NUMSAV(IGEL)=GELNO
+      GO TO 20
+      END IF
+C   MUST BE CONSENSUS
+      CONSEN=GEL
+C   PROCESS THIS STRIP OF GELS (IGEL OF THEM)
+      DO 50 I=1,IGEL
+C       WHERE DOES DATA START AND END?
+        IFIRST=1
+40      CONTINUE
+        IF(GELSAV(I)(IFIRST:IFIRST).NE.' ')GO TO 45
+        IFIRST=IFIRST+1
+        IF(IFIRST.LE.100)GO TO 40
+C       ERROR --- NO DATA FOUND
+        WRITE(KBOUT,1004)LINNO
+1004    FORMAT(' Error on line',I6,' of file')
+        RETURN
+45      CONTINUE
+C       NOW WHERE DOES IT END
+        ILAST=NOTIRL(GELSAV(I),100,' ')
+C       COMPARE WITH CONSENSUS
+        READ(NUMSAV(I),1001,ERR=900)INTEG
+ 1001   FORMAT(I6)
+        IF(INTEG.GE.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,PLUS)
+        IF(INTEG.LT.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,MINUS)
+        WRITE(IDEV2,1008)NUMSAV(I),NAMSAV(I),GELSAV(I)(1:ILAST)
+1008    FORMAT(' ',A,1X,A,1X,A)
+50    CONTINUE
+      WRITE(IDEV2,1009)CONSEN
+1009  FORMAT(' ',24X,A)
+1006  FORMAT( )
+C     READ A BLANK LINE
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+      WRITE(IDEV2,1003)LINEIN
+C     NO GO BACK FOR THE NEXT LINE OF NUMBERS
+      GO TO 10
+100   CONTINUE
+      WRITE(KBOUT,1005)
+1005  FORMAT(' Finished')
+      RETURN
+900   WRITE(KBOUT,*)'Error reading gel number'
+      END
+      SUBROUTINE IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +NCONTS,IX,IY,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,
+     +DBTDUX,DBTDUY,NCONT,IGEL,IS)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER CHNRP1
+      EXTERNAL CWORLD,CHNRP1
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      XX = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+      YX = CWORLD(IY,MARGB,MARGT,YMIN,YMAX)
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        IF((XX.GT.XF).AND.(XX.LT.XT)) THEN
+          IS = NINT(((XX-XF)/(XT-XF)) * RELPG(I))
+          JGEL = LNBR(I)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,JGEL,IS)
+          NCONT = I
+          RETURN
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      IGEL = 0
+      NCONT = 0
+      END
+      SUBROUTINE IDTOD(TOPLIN,GEL,IFIRST,ILAST,SYMBOL)
+      CHARACTER TOPLIN*100,GEL*100,SYMBOL*4
+      DO 10 I=IFIRST,ILAST
+        IF(GEL(I:I).EQ.TOPLIN(I:I))GEL(I:I)=SYMBOL(1:1)
+10    CONTINUE
+      END
+      INTEGER FUNCTION INDEXS(C,S)
+      PARAMETER (IDM = 29)
+      CHARACTER C
+      INTEGER POINTS(0:255),SCORES(IDM),IND(IDM),S
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      SAVE SCORES,IND
+      DATA 
+     +IND/1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,6,6,6,6,6,6,1,2,3,4,5,5,6/
+C      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  changed 28-7-91 to give 10 to old zeroes and 100 to lowercase
+      DATA SCORES/
+     +100,100,100,100,
+     +75,75,75,75,
+     +100,100,100,100,
+     +100,100,100,100,
+     +10,10,10,10,10,10,
+     +100,100,100,100,100,100,10/
+      I = ICHAR(C)
+      I = POINTS(I)
+      S = SCORES(I)
+      INDEXS = IND(I)
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP FOR SHOTGUN SEQUENCING
+      SUBROUTINE INITS
+C  AUTHOR RODGER STADEN
+      INTEGER POINTS(0:255)
+      PARAMETER (IDM = 29)
+      CHARACTER DUP*29
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-4 FOR ACGT
+C                 acgt
+C                 1234
+C                 BDHV
+C                 KLMN
+C      5 FOR      *
+C      6 FOR      5678- AND ELSE
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 6
+C
+        DO 30 I = 0,255
+          POINTS(I) = IDM
+30      CONTINUE
+        DO 35 I = 1,IDM
+          J = ICHAR(DUP(I:I))
+          POINTS(J) = I
+35      CONTINUE
+      END
+      FUNCTION INLIST(LIST,IDLIST,ITEM)
+C   AUTHOR: RODGER STADEN
+C   SENT LIST LIST, AND ITEM ITEM. IF IN LIST RETURNS ELEMENT NUMBER, ELSE 0
+      INTEGER LIST(IDLIST)
+      INLIST=0
+      DO 1 I=1,IDLIST
+        IF(LIST(I).NE.ITEM)GO TO 1
+        INLIST=I
+      RETURN
+1     CONTINUE
+      RETURN
+      END
+      SUBROUTINE IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH,X,Y,KBOUT,
+     +IGEL,IOK)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      IOK = 1
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      YINCO2 = STRAND*YINC/2.
+      I = LGEL
+      IGEL = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            IF((X.GE.XF).AND.(X.LE.XT)) THEN
+              IGEL = I
+              IF((Y.GE.YF-YINCO2).AND.(Y.LE.YF+YINCO2)) THEN
+                IOK = 0
+                RETURN
+              END IF
+            END IF
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+C     LINEUP
+C
+C   TAKES 2 SEQS SET OF MATCHES AND PRODUCES LINED UP SEQS
+C   FINDS IF WE HAVE A LEFT OVERLAP
+C   RETURNS POSITION OF JOINT. THIS IS RELATIVE TO THE CONTIG
+C   FOR MOST MATCHES BUT I RELATIVE TO THE GEL FOR A LEFT OVERLAP
+      SUBROUTINE LINEUP(SEQG,SEQC,SEQG2,SEQC2,IDC,IDG,IDOUT,
+     1MATG,MATC,MATL,IP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQG(IDG),SEQC(IDC),SEQG2(IDOUT),SEQC2(IDOUT),PAD
+      INTEGER MATG(IP),MATC(IP),MATL(IP)
+      SAVE PAD
+      DATA PAD/','/
+      IFAIL=0
+C   ZERO PADDING CHARS IN CONTIG (GEL DONE AT END BY DIFFERENCE
+C   IN INPUT AND OUTPUT LENGTHS)
+      ITOTPC=0
+C   FILL OUTPUT WITH PADDING
+      DO 10 I=1,IDOUT
+        SEQG2(I)=PAD
+        SEQC2(I)=PAD
+10    CONTINUE
+      NMTCH=0
+C   SET INITIAL POINTERS TO OUTPUT
+C   CONSENSUS
+      IS1=1
+C   GEL
+      IS2=1
+C   FIND DISTANCE FROM LEFT MATCH IN GEL TO LEFT OF GEL
+      IG2=MATG(1)-1
+      IF(IG2.EQ.0)THEN
+C       THE LEFT END OF THE GEL MATCHES SO THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET JOINT
+        JOINT=MATC(1)
+C       SKIP NEXT SECTION
+        GO TO 50
+      END IF
+C   FIND DISTANCE FROM LEFT MATCH IN CONTIG TO LEFT OF CONTIG
+      IC2=MATC(1)-1
+C   GET DISTANCE FROM FIRST MATCH IN CONTIG TO FIRST MATCH IN GEL.
+C   IF THIS DISTANCE <0 THEN WE HAVE A LEFT OVERLAP
+      IC1=IC2-IG2+1
+      IF(IC1.GT.0)THEN
+C       THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET LEFT END
+        JOINT=IC1
+C       COPY THE GEL UPTO THE FIRST MATCH, INTO THE OUTPUT ARRAY
+C       CHECK FOR OVERFLOW
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+C       COPY THE CONTIG FOR THE SAME REGION
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(IC1),SEQC2(1),IG2)
+        IS1=IS1+IG2
+        IS2=IS2+IG2
+        GO TO 50
+      END IF
+C   MUST BE LEFT END OVERLAP
+C   SET TYPE
+      ITYPE=1
+C   SET POSITION OF JOINT RELATIVE TO GEL
+      JOINT=ABS(IC1)+2
+C   COPY OVER THE GEL UPTO THE JOINT
+C   CHECK FOR OVERFLOW
+      IF(IG2.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+      IS2=IS2+IG2
+C   WE MAY ALSO HAVE MISMATCHING
+C   DATA AT THE JOIN SO DEAL WITH THAT NOW
+C   IF IC2 >0 THE LEFT END OF THE CONTIG MATCHES THE GEL BUT OTHERWISE
+C   WE HAVE SOME MISMATCHED DATA TO DEAL WITH - WE NEED TO TRANSFER
+C   THE MISMATCHED REGION OF THE CONTIG TO THE OUTPUT ARRAY
+      IF(IC2.GT.0)THEN
+        IF(IC2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(1),SEQC2(1),IC2)
+        IS1=IS1+IC2
+      END IF
+C   WHEN WE GET HERE WE HAVE SORTED OUT THE LEFT ENDS FOR LEFT OVERLAP
+C   AND MISMATCHED LEFT ENDS, WE NOW DEAL WITH THE REST OF THE SEQUENCE
+C   STARTING WITH THE FIRST BLOCK OF IDENTITY
+C
+C IG1 POSITION IN INPUT GEL
+C IS2 POSITION IN OUTPUT GEL
+C IC1 POSITION IN INPUT CONTIG
+C IS1 POSITION IN OUTPUT CONTIG
+C LG1 POSITION OF END OF CURRENT MATCH IN OUTPUT GEL
+C LC1 POSITION OF END OF CURRENT MATCH IN OUTPUT CONTIG
+C LG2 DISTANCE FROM CURRENT MATCH IN INPUT GEL TO NEXT MATCH
+C LC2 DISTANCE FROM CURRENT MATCH IN INPUT CONTIG TO NEXT MATCH
+C
+50    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   COPY NEXT MATCH
+      IG1=MATG(NMTCH)
+      IC1=MATC(NMTCH)
+      L=MATL(NMTCH)
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(IG1),SEQG2(IS2),L)
+C   CHECK FOR OVERFLOW
+      IF(IS1+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQC(IC1),SEQC2(IS1),L)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+L
+      IS2=IS2+L
+C   END OF CURRENT MATCH
+      LG1=IG1+L
+      LC1=IC1+L
+C   ANY MORE MATCHES
+      IF(NMTCH.EQ.IP)GO TO 500
+      K=NMTCH+1
+      LG2=MATG(K)-LG1
+      LC2=MATC(K)-LC1
+C   ANY DIFFERENCE IN LENGTH? IF SO WE HAVE TO PAD SO THEY BECOME THE SAME
+      L5=ABS(LG2-LC2)
+C   COUNT PADDING CHARS IN CONTIG
+      IF(LG2.GT.LC2)ITOTPC=ITOTPC+L5
+C   IF DIFFERENCE INCREMENT SHORTER
+      IF(LG2.GT.LC2)IS1=IS1+L5
+C   IF GEL NEEDS PADDING TRY TO PUT PADS NEXT TO DOUBLE CODES
+      IF(LC2.GT.LG2)CALL PADCOP(SEQG,SEQG2,
+     +LG1,MATG(K),L5,IS2,LG2,MAXGEL,IFAIL,KBOUT,SEQC,LC1)
+C   CHECK FOR OVERFLOW
+      IF(IFAIL.EQ.1)GO TO 700
+C   NOW COPY MISSMATCHED REGION
+C   CHECK FOR OVERFLOW
+      IF(IS2+LG2-1.GT.MAXGEL)GO TO 700
+      IF(LG2.GT.0)CALL SQCOPY(SEQG(LG1),SEQG2(IS2),LG2)
+C   CHECK FOR OVERFLOW
+      IF(IS1+LC2-1.GT.MAXGEL)GO TO 700
+      IF(LC2.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),LC2)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+LC2
+      IS2=IS2+LG2
+C   GET NEXT MATCH
+      GO TO 50
+500   CONTINUE
+C
+C   FINISH RIGHT ENDS
+C   ONLY COPY TO END OF GEL IN GEL AND TO THE SAME RELATIVE POSITION
+C   IN THE CONTIG FOR DISPLAY PURPOSES AND FOR COUNTING MISMATCH
+C   CURRENT ENDS AT LG1,LC1
+C   HOW FAR TO END OF GEL?
+C   SET M
+      M=0
+      L=IDG-LG1+1
+      IF(L.LT.1)GO TO 600
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(LG1),SEQG2(IS2),L)
+C   NEED TO COPY TO END OF GEL IN CONTIG FOR DISPLAY
+C   POINT TO POSN IN CONTIG LEVEL WITH END OF GEL
+      M=LC1+L-1
+C   IS THIS OVER END OF CONTIG?
+      IF(M.GT.IDC)M=IDC
+C   NUMBER TO COPY
+      M=M-LC1+1
+C   CHECK FOR OVERFLOW
+      IF(IS1+M-1.GT.MAXGEL)GO TO 700
+      IF(M.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),M)
+600   CONTINUE
+C   COUNT PADDING IN GEL
+      ITOTPG=IS2+L-1-IDG
+C   SET NEW LENGTHS FOR RETURN TO CALLING ROUTINE
+      IDOUT=IS1+M-1
+      IDG=IS2+L-1
+      IFAIL=0
+      RETURN
+700   CONTINUE
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Matching region too long for routine lineup,',
+     +' alignment aborted')
+      IFAIL=1
+      RETURN
+      END
+      SUBROUTINE LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,
+     +RREG,IDEV,IDEVN,NAMARC)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER RREG
+      CHARACTER NAMARC*(*)
+      N = LLINO
+      WRITE(IDEV,1001)
+10    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).NE.0) THEN
+        N = RNBR(N)
+        IF(RELPG(N).LE.RREG) GO TO 10
+      END IF
+1001  FORMAT(
+     +' NAME              NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                             LEFT   RIGHT')
+1006    FORMAT( ' ',A,2X,I6,2X,I7,2X,I5,2X,I6,2X,I6)
+C1006    FORMAT( ' ',A,2X,I4,2X,I7,2X,I5,2X,I6,2X,I6)
+      END
+C12345678901234567890
+C                          710       720       730       740       750
+C   -1  HINW.004    CGTCAGACGCACGCTGGAAAA
+      INTEGER FUNCTION LTYPE(LINE,LL,J1,J2,N,MAXDB,KBOUT)
+      PARAMETER (MAXDG = 5)
+      CHARACTER LINE*(*),NUM*(MAXDG),SPACE
+      EXTERNAL NOTRL,NOTLR
+      PARAMETER (SPACE= ' ')
+      J1 = NOTLR(LINE,LL,SPACE)
+      IF(J1.EQ.0) THEN
+C       BLANK LINE
+        LTYPE = 1
+        RETURN
+      END IF
+      IF(J1.GT.20) THEN
+C       LINE OF NUMBERS
+        LTYPE = 2
+        RETURN
+      END IF
+      IF(J1.GT.MAXDG+2) THEN
+C       CONSENSUS LINE
+        LTYPE = 3
+        RETURN
+      END IF
+C       SHOULD BE A SEQUENCE LINE
+      J = INDEX(LINE(J1:),SPACE)
+      NUM = SPACE
+      NUM = LINE(J1:J1+J-2)
+      CALL RJST(NUM)
+ 1001 FORMAT(I6)
+      READ(NUM,1001,ERR=10) N
+      IF(N.GT.MAXDB-2) GO TO 10
+C     NUMBER ENDS AT J1+J-2
+      J1 = J1 + J - 1
+C     LOOK FOR BEGINNING OF NAME
+      J = NOTLR(LINE(J1:),LL-J1+1,SPACE)
+      N1 = J1 + J - 1
+C     LOOK FOR END OF NAME
+      J = INDEX(LINE(N1:),SPACE)
+      N2 = N1 + J - 2
+C     LOOK FOR BEGINNING OF SEQ
+      J = NOTLR(LINE(N2+1:),LL-N2,SPACE)
+      J1 = N2 + J
+      LTYPE = 4
+C     LOOK FOR END OF SEQ
+      J2 = NOTRL(LINE,LL,SPACE)
+      IF(J2.GT.N2) RETURN
+10    CONTINUE
+      LTYPE = 0
+      END
+      INTEGER FUNCTION LWRAPS(I,J)
+      K = MOD(I,J)
+      IF(K.EQ.0) K = J
+      LWRAPS = K
+      END
+C     MERGE
+C
+C   ROUTINE SENT CONTIG WHOSE GELS MAY BE OUT OF ORDER
+C   REORDERS GELS ON POSITION OF LEFT ENDS AND SETS LEFT
+C   GEL NUMBER FOR THE REORDERED CONTIG
+C
+      SUBROUTINE MERGE(RELPG,LNGTHG,LNBR,RNBR,LINCON,IDBSIZ)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C   START AT LEFT END
+      N=LNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT IN LIST
+      N=NR
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT
+      NR=RNBR(N)
+      IF(NR.EQ.0)GO TO 30
+C   HAVENT REACHED END YET
+      I1=0
+23    CONTINUE
+C   ARE THESE 2 IN CORRECT ORDER IE N<=NR ?
+      IF(RELPG(N).LE.RELPG(NR))GO TO 21
+C   NOT IN ORDER SO CHAIN LEFT UNTIL CORRECTLY POSITIONED
+C   THEN COME BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE SAVE POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C   EXCHANGE NEIGHBOURS
+      M=RNBR(NR)
+      IF(M.NE.0)LNBR(M)=N
+      M=LNBR(N)
+      IF(M.NE.0)RNBR(M)=NR
+      RNBR(N)=RNBR(NR)
+      RNBR(NR)=N
+      LNBR(NR)=LNBR(N)
+      LNBR(N)=NR
+C   CHAIN BACK THRU LIST
+      N=LNBR(NR)
+      IF(N.EQ.0)GO TO 21
+C   END NOT REACHED
+      GO TO 23
+30    CONTINUE
+C  ALL DONE POINTER AT RIGHT GEL
+      RNBR(LINCON)=N
+      RETURN
+      END
+      SUBROUTINE MINCOM(SEQ1,IDIM1,SEQ2,IDIM2,SAV1,SAV2,SAV3,
+     +IP,MINM,KBOUT)
+C   AUTHOR: RODGER STADEN
+C
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER SAV1(IP),SAV2(IP),SAV3(IP)
+C
+      IP1=IP
+      IP=0
+C
+C   SITUATION 1
+      NT1=IDIM2-MINM
+      IES1=MINM-1
+      ISS2=NT1+1
+C
+      DO 100 I=1,NT1
+C
+C   POINT TO FIRST CHAR-1 OF SEQ2
+      ISS2=ISS2-1
+C   POINT TO LAST CHAR SEQ1
+      IES1=IES1+1
+C
+      N=0
+C
+      DO 200 J=1,IES1
+C   STORE POINTER
+      JJ=J
+C
+C   POINT TO SEQ2
+      K=ISS2+J
+C   TEST FOR EQUALITY
+      IF(SEQ1(J).NE.SEQ2(K))GO TO 220
+C   INCREMENT N
+      N=N+1
+      GO TO 200
+220   CONTINUE
+C   TEST FOR SUFFICENTLY LARGE N
+      IF(N.GE.MINM)CALL SAVIT(N,J,K,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C   RESET N TO ZERO
+      N=0
+200   CONTINUE
+C
+C   GOOD SCORE AT END?
+C   NEED TO INCREMENT POINTERS AS SAVIT EXPECTS TO BE POINTING AT NEXT
+C   MISMATCH
+      JJ=JJ+1
+      KK=K+1
+      IF(N.GE.MINM)CALL SAVIT(N,JJ,KK,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C
+100   CONTINUE
+C
+C
+C   SITUATION 2
+      NT2=IDIM1-IDIM2+1
+C
+      DO 300 I=1,NT2
+      N=0
+C
+      DO 400 J=1,IDIM2
+C   SAVE POINTER
+      JJ=J
+C
+C   SET POINTER TO SEQ1
+      L=I+J-1
+      IF(SEQ1(L).NE.SEQ2(J))GO TO 420
+      N=N+1
+      GO TO 400
+420   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+400   CONTINUE
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+300   CONTINUE
+C
+C
+C   SITUATION 3
+      ISS1=IDIM1-IDIM2
+C
+      DO 500 I=1,NT1
+C
+C   POINT TO FIRST CHAR SEQ1
+      K=ISS1+I
+      IES2=IDIM2-I
+      N=0
+C
+      DO 600 J=1,IES2
+C   SAVE POINTER
+      JJ=J
+C
+C   POINT TO SEQ1
+      L=K+J
+      IF(SEQ1(L).NE.SEQ2(J))GO TO  620
+      N=N+1
+      GO TO 600
+620   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C    TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+600   CONTINUE
+C
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+500   CONTINUE
+C
+      RETURN
+5000  CONTINUE
+C   OVERFLOW
+C
+      WRITE(KBOUT,1000)IP1
+1000  FORMAT(/' TOO MANY MATCHES. LIMIT = ',I6)
+      RETURN
+      END
+      SUBROUTINE ML(PC,PG,L,N,J)
+      INTEGER PC(N),PG(N),L(N)
+      DO 10 I = J,N-1
+        PC(I) = PC(I+1)
+        PG(I) = PG(I+1)
+        L(I) = L(I+1)
+10    CONTINUE
+      END
+      SUBROUTINE MSTLKL(SEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU,INDEXS
+      DO 100 I=1,IDIM
+        J = INDEXS(SEQ(I),K)
+        SEQ(I) = CHARSU(J)
+100   CONTINUE
+      END
+      CHARACTER FUNCTION MUNOTP(IP)
+C     AUTHOR RODGER STADEN
+      CHARACTER PUP*26
+      SAVE PUP
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      MUNOTP = '-'
+      IF((IP.GT.0).AND.(IP.LT.23))MUNOTP = PUP(IP:IP)
+      END
+      INTEGER FUNCTION NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +STRAND,RREG)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+      NCDEP = 0
+      N = 0
+      I = IGEL
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) N = N + 1
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      NCDEP = N
+      END
+      SUBROUTINE PADCOP(SEQG,SEQG2,LG1,MG,L5,IS2,LG2,MAXGEL,IFAIL,
+     +KBOUT,SEQC,IC1)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (NDUBL = 4)
+      CHARACTER SEQG(MAXGEL),SEQG2(MAXGEL),DUBBL(NDUBL),SEQC(MAXGEL)
+      SAVE DUBBL
+      DATA DUBBL/'D','B','V','H'/
+      JC1 = IC1
+C Make seqg2 from seqg placing L5 padding chars before position MG
+C which is the start of the next block of identity. Try to put the
+C padding either in line with consensus pads, or next to double
+C codes. The positions in seqg are LG1 to MG-1. seqg2 needs to be long
+C enough to be extended from IS2 to IS2 + L5 -1 + MGM1-LG1 +1
+C ie we add L5 pads, plus the chars between and including  LG1 and MGM1
+      IDONE=0
+C   POINT TO END OF MISMATCH
+      MGM1=MG-1
+C   MAY BE NO CHARS TO COPY
+      IF(MGM1.LT.LG1)GO TO 111
+C  Next check added 26-2-91
+      MAXREQ = IS2 + L5 - 1 + MGM1 - LG1 + 1
+      IF((MGM1.GT.MAXGEL).OR.(MAXREQ.GT.MAXGEL)) THEN
+        WRITE(KBOUT,1000)
+1000    FORMAT(' Matching region too large for routine padcop,',
+     +  ' alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 110 J=LG1,MGM1
+        IF(IDONE.LT.L5) THEN
+          IF((JC1.GT.0).AND.(JC1.LT.MAXGEL)) THEN
+          IF(SEQC(JC1).EQ.'*') THEN
+            IS2 = IS2 + 1
+            JC1 = JC1 + 1
+            IDONE = IDONE + 1
+            GO TO 109
+          END IF
+          END IF
+          DO 108 M=1,NDUBL
+            IF(SEQG(J).EQ.DUBBL(M)) THEN
+              IS2 = IS2 + 1
+              JC1 = JC1 + 1
+              IDONE = IDONE + 1
+              GO TO 109
+            END IF
+108       CONTINUE
+109       CONTINUE
+        END IF
+        SEQG2(IS2) = SEQG(J)
+        IS2 = IS2 + 1
+        JC1 = JC1 + 1
+110   CONTINUE
+111   CONTINUE
+C   ALL CHARS COPIED. ENOUGH PADDING?
+      IF(IDONE.LT.L5)IS2=IS2+L5-IDONE
+C   IS2 SHOULD NOW BE POINTING AT NEXT CHAR
+C   ZERO LG2 TO SHOW CALLING ROUTINE COPYING DONE
+      LG2=0
+      IFAIL=0
+      END
+      SUBROUTINE PCON1(CHAR,CHRSUM)
+C  AUTHOR RODGER STADEN
+C  PART OF PROTEIN 'CONSENSUS' CALCULATION
+      CHARACTER CHAR
+      INTEGER CHRSUM
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      K = CTONUM(CHAR)
+      IF(K.NE.26)THEN
+        IF(CHRSUM.EQ.0)THEN
+          CHRSUM = K
+        ELSE
+          IF(K.NE.CHRSUM)CHRSUM = -1
+        END IF
+      END IF
+      END
+      SUBROUTINE PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +NCONTS,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      YMIN = 0.
+      YMAX = ISYMAX
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        CALL LINE(XF,XT,Y,Y,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        IF((IGEL.NE.0).AND.(I.EQ.LINCON)) THEN
+          XZ = XF + RELPG(IGEL) + ABS(LNGTHG(IGEL))/2
+          CALL LINE(XZ,XZ,YMAX,YMIN,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      I = LGEL
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            CALL LINE(XF,XT,YF,YF,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      INTEGER DEPTHP,DEPTHM
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+C  have window size margt starting at margb
+C  depths depthp, depthm
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = LREG
+      XMAX = RREG
+      RINC = YMAX / (DEPTHP + DEPTHM + 2)
+      RMID =(DEPTHM+1) * RINC
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX,RMID,RMID,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMAX,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      STRAND = 1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHP)
+      STRAND = -1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHM)
+      CALL VT100M
+      END
+      SUBROUTINE PLTQ(SEQ,IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CHARACTER SEQ(IDIM2),T
+      PARAMETER (Y0  = 0.,
+     +           YP1 = 1.,
+     +           YP2 = 2.,
+     +           YM1 = -1.,
+     +           YM2 = -2.)
+      XMIN = 0.
+      XMAX = IDIM2
+      YMIN = YM2
+      YMAX = YP2
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XIN,XMAX,Y0,Y0,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      I = 1
+10    CONTINUE
+      XF = I
+      T = SEQ(I)
+20    CONTINUE
+      IF(SEQ(I).NE.T) THEN
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I - 1
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+      I = I + 1
+      IF(I.LT.IDIM2) GO TO 20
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      END
+      SUBROUTINE READN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      READ(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE READR(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      READ(IDEVR,REC=N+1)RELPG,LNGTHG,LNBR,RNBR
+      RELPG = SWAPBO(RELPG)
+      LNGTHG = SWAPBO(LNGTHG)
+      LNBR = SWAPBO(LNBR)
+      RNBR = SWAPBO(RNBR)
+      RETURN
+      END
+      SUBROUTINE READW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      READ(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE REMOVL(MATC,MATG,MATL,IP)
+C   AUTHOR: RODGER STADEN
+      INTEGER MATC(IP),MATG(IP),MATL(IP)
+C
+C   SET POINTER TO FIRST MATCH
+      NMTCH=0
+10    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   SORT MATCHES ON LENGTH
+      IPP=IP-NMTCH+1
+      CALL BUBBL3(MATL(NMTCH),MATG(NMTCH),MATC(NMTCH),IPP)
+C   LOOK FOR END OF POSITIVES
+      DO 20 I=NMTCH,IP
+      J=I
+20    IF(MATL(I).LT.1)GO TO 30
+      J=J+1
+30    CONTINUE
+      IP=J-1
+C   END OF POSITIVES AT IP
+      IF(NMTCH.GE.IP)RETURN
+      K1=MATC(NMTCH)
+      K2=K1+MATL(NMTCH)-1
+      K3=MATG(NMTCH)
+      K4=K3+MATL(NMTCH)-1
+C   POINT TO FIRST MATCH TO TEST
+      K6=NMTCH+1
+      DO 200 I=K6,IP
+C   DO CONSENSUS FIRST
+C   OVERLAP?
+      IF(MATC(I).GT.K2)GO TO 100
+      K5=MATC(I)+MATL(I)-1
+      IF(K5.LT.K1)GO TO 100
+C   DOES OVERLAP
+C   WHICH END
+      IF(K5.LE.K2)GO TO 80
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K2-MATC(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C  MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 100
+80    CONTINUE
+C   LENGTH
+      MATL(I)=K1-MATC(I)
+100   CONTINUE
+C   NOW LOOK FOR OVERLAPS WITH GEL
+C   OVERLAP?
+      IF(MATG(I).GT.K4)GO TO 200
+      K5=MATG(I)+MATL(I)-1
+      IF(K5.LT.K3)GO TO 200
+C   DOES OVERLAP
+C   WHICH END?
+      IF(K5.LE.K4)GO TO 180
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K4-MATG(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C   MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 200
+180   CONTINUE
+C   LENGTH
+      MATL(I)=K3-MATG(I)
+200   CONTINUE
+      GO TO 10
+      END
+C     SAVIT
+C
+      SUBROUTINE SAVIT(N,J,K,IP,S1,S2,S3,IP1)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1(IP1),S2(IP1),S3(IP1)
+C
+      IP=IP+1
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)RETURN
+      S1(IP)=N
+      S2(IP)=J-N
+      S3(IP)=K-N
+C
+      RETURN
+      END
+      SUBROUTINE SCRENR(GEL,MAXGEL,STRING,NAME,FILNAM,
+     +IDEV1,IDEV2,IDEV3,IDEV4,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER NAME*(*),FILNAM*(*),HELPF*(*)
+      CHARACTER GEL(MAXGEL),STRING(60)
+      INTEGER GNFFOF
+      EXTERNAL GNFFOF
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of sequences that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of recognition sequences',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+      JGEL = 0
+      IGEL = 0
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+31      CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 31
+        END IF
+      ELSE
+        IOK = GNFFOF(IDEV1,NAME)
+        IF(IOK.EQ.1) GO TO 100
+        IF(IOK.NE.0) GO TO 1
+      END IF
+1002  FORMAT(A)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing', JGEL,' in batch'
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV4,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(KBOUT,*)' Error opening gel reading file'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV4,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV4)
+2     CONTINUE
+      IF(IDIMG.LT.1)THEN
+        WRITE(KBOUT,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL MSTLKL(GEL,IDIMG)
+3     CONTINUE
+      READ(IDEV3,1005,END=6)STRING
+1005  FORMAT(60A1)
+C   FIND LENGTH OF STRING ASSUMING NO SPACES
+      DO 4 I=1,60
+        II=I
+        IF(STRING(I).EQ.' ')GO TO 5
+4     CONTINUE
+5     CONTINUE
+      II=II-1
+      IF(II.GT.0)CALL FIND(GEL,IDIMG,STRING,II,JMATCH)
+      IF(JMATCH.EQ.0)GO TO 3
+C   A MATCH
+      WRITE(IDEV,1007)JMATCH,(STRING(K),K=1,II)
+1007  FORMAT(' Match at',I6,' with ',60A1)
+      REWIND IDEV3
+      GO TO 1
+C   NO MATCH SO SAVE
+6     CONTINUE
+      WRITE(IDEV2,1002)NAME
+      IGEL = IGEL + 1
+      REWIND IDEV3
+      GO TO 1
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGEL,' passed'
+      RETURN
+      END
+      SUBROUTINE SCRENV(MAXGEL,
+     +WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ,MAXSEQ,GEL,GELCOP,MATCH,
+     +LENGTH,
+     +SAVPS,SAVPG,SAVL,MAXMAT,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM,NAME,IOK)
+      INTEGER POSNS(MAXSEQ),GELN(MAXGEL),WORDP(LPOWRC),SAVPS(MAXMAT)
+      INTEGER SAVPG(MAXMAT),SAVL(MAXMAT)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER FILNAM*(*),NAME*(*),HELPF*(*)
+      CHARACTER GELCOP(MAXGEL)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER CONST(LENGTH),CSTART
+      CHARACTER SEQ(MAXSEQ),GEL(MAXGEL),MATCH(MAXGEL)
+      INTEGER GNFFOF
+      EXTERNAL GNFFOF
+      JGEL = 0
+      IGELS = 0
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of gel readings that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV4,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of vector sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      IDIM = MAXSEQ
+      CALL ARRFIM(IDEV4,SEQ,IDIM,KBOUT)
+      CLOSE(UNIT=IDEV4)
+      MN = LENGTH*2
+      MX = 50
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   IS THE VECTOR SEQUENCE IN THE CORRECT FORMAT WITH A TITLE AT THE FRONT?
+      IF(IDCEND.EQ.0)THEN
+        CENDS(1) = -19
+        NENDS(1) = 1
+        CENDS(2) = IDIM + 1
+        IDCEND = 1
+      END IF
+C      WRITE(KBOUT,9999)
+C9999  FORMAT(' VECTOR SEQUENCE REQUIRES A TITLE EG ',
+C     1' <---M13MP7.001----->')
+C      RETURN
+C      END IF
+      CALL BUSY(KBOUT)
+C
+C init hashing routines
+C
+      CALL INITE(CONST,CSTART,LENGTH)
+      CALL ENCOF(SEQ,IDIM,CONST,CSTART,LENGTH,POSNS)
+      CALL ENCONN(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH,1)
+C
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        IOK = GNFFOF(IDEV1,NAME)
+        IF(IOK.EQ.1) GO TO 100
+        IF(IOK.NE.0) GO TO 1
+      END IF
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV3,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(IDEV,*)' Gel reading file not found'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV3,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV3)
+C  LONG ENOUGH ?
+      IF(IDIMG.LT.MINMAT)THEN
+        WRITE(IDEV,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+      ISTRAN=1
+      IMATCH=0
+2     CONTINUE
+      CALL BUSY(KBOUT)
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      WRITE(IDEV,1009)ISTRAN
+1009  FORMAT(' Searching strand',I6)
+      IDSAV=MAXMAT
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ,GELCOP,MINMAT,IFAIL,KBOUT)
+      IF(IDSAV.GT.0) THEN
+        IMATCH=1
+        CALL DISMAT(SEQ,IDIM,GELCOP,IDIMG,SAVPS,SAVPG,IDSAV,
+     +  CENDS,NENDS,IDCEND,MAXCON,IDEV,MATCH)
+      END IF
+      IF(ISTRAN.EQ.1) THEN
+        CALL SQREV(GELCOP,IDIMG)
+        CALL SQCOM(GELCOP,IDIMG)
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        ISTRAN = 2
+        GO TO 2
+      END IF
+      IF(IMATCH.EQ.0) THEN
+        WRITE(IDEV2,1010)NAME
+        IGELS = IGELS + 1
+      END IF
+      GO TO 1
+1010  FORMAT(A)
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGELS,' passed'
+      RETURN
+      END
+      SUBROUTINE SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IGN,NCONT,DIST)
+C  AUTHOR: RODGER STADEN
+C  SHIFTS PART OF A CONTIG FORM GEL IGN TO RIGHT END
+C  CONTIG LINE NUMBER IF NCONT
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER DIST,CLEN
+      EXTERNAL CLEN
+      I = IGN
+10    CONTINUE
+      IF(I.NE.0)THEN
+        RELPG(I) = RELPG(I) + DIST
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        I = RNBR(I)
+        GO TO 10
+      END IF
+C  UPDATE CONTIG LENGTH
+      L = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IGN)
+      RELPG(NCONT) = L
+      CALL WRITER(IDEVR,NCONT,RELPG(NCONT),LNGTHG(NCONT),
+     +LNBR(NCONT),RNBR(NCONT))
+      END
+      SUBROUTINE SLIDER(SEQ1,IDC,SEQ2,IDIM2,MS1,MS2,MAXPG,MAXPC,MINSLI,
+     +MATL,MATC,MATG,IP)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2)
+      INTEGER MATL(IP),MATC(IP),MATG(IP),P1S,P1,P2
+      IP1 = IP
+      IP = 0
+C   LEFT END S2 RELATIVE S1 - MAX PADS -2 READY FOR LOOP
+      P1S = MS1 - MS2 - MAXPC - 1
+C   TRY NSLIDE START POSNS FOR SEQ2
+      DO 100 I=1,MAXPG+MAXPC+1
+C       POINT TO SEQ1 START
+        P1S = P1S + 1
+C       POINT TO CURRENT SEQ1 POSN
+        P1 = P1S
+        N = 0
+C       COMPARE WHOLE LENGTH OF SEQ2 (IF P1 WITHIN RANGE)
+        DO 50 J=1,IDIM2
+          P2 = J
+          P1 = P1 + 1
+          IF(P1.LT.1)GO TO 50
+C         OFF RIGHT END? IF SO MAY HAVE BEEN A MATCH
+          IF(P1.GT.IDC)GO TO 40
+          IF(SEQ1(P1).EQ.SEQ2(P2))GO TO 45
+40        CONTINUE
+          IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+          N = 0
+          GO TO 50
+45        CONTINUE
+          N = N + 1
+50      CONTINUE
+C       GOOD SCORE AT END? NEED TO INCREMENT POINTERS FOR SAVIT
+        P1 = P1 + 1
+        P2 = P2 + 1
+        IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+100   CONTINUE
+      END
+      SUBROUTINE SUBS(SEQ,IDIMS,FROM,TO)
+      CHARACTER SEQ(IDIMS),FROM,TO
+C   AUTHOR RODGER STADEN
+      DO 10 I = 1,IDIMS
+        IF(SEQ(I).EQ.FROM) SEQ(I) = TO
+10    CONTINUE
+      END
+      SUBROUTINE SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,LINOU1,LINOU2,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      CHARACTER LINOU1(MAXGEL),LINOU2(MAXGEL),GTCONC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      EXTERNAL INDEXS,LWRAPS,GTCONC
+C 28-7-91 added extra parameter: mxgood is the maximum reading
+C length for which we have confidence, so only the parts of
+C reads 1 to mxgood will be included in the calculation
+C   SET INITIAL VALUES
+C hard to understand this very old code! rewrite it.
+C we have a summing array of twice the length of the longest sequence
+C posn is posn in contig of next to write, lseqno is current posn in contig
+C we write when lseqno-posn ge the length of the longest seq
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+40    CONTINUE
+50    CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read
+C
+      IF(ISS.EQ.1) THEN
+        N = MIN(MXGOOD,ABS(LNGTHG(GELC)))
+        IF(LSEQNO.LT.LREG)LSEQNO=LREG
+        IS = LSEQNO-RELPG(GELC)+1
+      ELSE
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read (right end for these)
+C
+        IREND = RELPG(GELC) - LNGTHG(GELC) + 1
+        IF (MXGOOD.LT.MAXGEL) THEN
+          LSEQNO = IREND - MXGOOD + 1
+        ELSE
+          LSEQNO = RELPG(GELC)
+        END IF
+        LSEQNO = MAX(LSEQNO,LREG)
+        IS = LSEQNO - RELPG(GELC) + 1
+        N  = ABS(LNGTHG(GELC))
+      END IF
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        LINOU1(I) = GTCONC(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        LINOU2(I) = GTCONC(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+250     CONTINUE
+        POSN=POSN+1
+230   CONTINUE
+C
+C   COMPARE STRANDS
+C
+      DO 500 I=1,LINLEN
+C        WRITE(*,*)I,LINOU1(I),LINOU2(I)
+        POSN1=POSN1+1
+        IF(LINOU1(I).EQ.LINOU2(I)) THEN
+          IF(LINOU1(I).EQ.'-') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF(LINOU1(I).EQ.'*') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          SEQ1(POSN1) = '0'
+        ELSE
+          IF((LINOU1(I).EQ.'*').AND.(LINOU2(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU2(I).EQ.'*').AND.(LINOU1(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*')) THEN
+            SEQ1(POSN1) = '1'
+            IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) 
+     +      SEQ1(POSN1) = '4'
+              GO TO 500
+          END IF
+          IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) THEN
+            SEQ1(POSN1) = '2'
+            IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*'))
+     +      SEQ1(POSN1) = '4'
+            GO TO 500
+          END IF
+        END IF
+500   CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+C      SUMMER
+C
+C   SUBROUTINE TO PRODUCE A CONSENSUS FROM LINED UP GEL READINGS
+      SUBROUTINE SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1,IDIM1,GEL,LREG,RREG,IGELC,IDBSIZ,CHARS,CHRSIZ,MAXGL2,
+     +IDEVW,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,RREG,LSEQNO,POSN,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER GELC
+      CHARACTER SEQ1(IDIM1)
+      INTEGER CHARS(CHRSIZ,MAXGL2)
+      CHARACTER MUNOTP,GTCONC
+      EXTERNAL MUNOTP,INDEXS,GTCONC,LWRAPS
+C
+C   SET INITIAL VALUES
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      IPSEQ1=0
+C
+C   ZERO ARRAY
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,CHRSIZ
+      CHARS(J,I)=0
+40    CONTINUE
+50    CONTINUE
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C   LOOP FOR RELEVANT ELEMENTS THIS GEL
+      N=ABS(LNGTHG(GELC))
+      IF(LSEQNO.LT.LREG)LSEQNO=LREG
+      IS=(LSEQNO-RELPG(GELC))+1
+****************************
+      IF(IDM.EQ.26)THEN
+        DO 51 I = IS,N
+          JJJ=(MOD(LSEQNO,MAXGL2))
+          IF(JJJ.EQ.0)JJJ=MAXGL2
+          CALL PCON1(GEL(I),CHARS(1,JJJ))
+          LSEQNO = LSEQNO + 1
+51      CONTINUE
+      ELSE
+****************************
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,JJJ) = CHARS(JJ,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      END IF
+C
+C   LOOK AT NEXT GEL TO RIGHT
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+C   RESET LSEQNO
+      LSEQNO=RELPG(GELC)
+C   IS THIS OVER END?
+      IF(LSEQNO.GT.RREG)GO TO 200
+C   ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C   SET FLAG TO SHOW END REACHED
+      IEND=1
+C   NEED TO SUM AND OUTPUT
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SUM NEXT SECTION OF CHARS
+      IF(IDM.EQ.26)THEN
+        DO 211 I = 1,LINLEN
+          IPSEQ1 = IPSEQ1 + 1
+          SEQ1(IPSEQ1) = '-'
+          JJJ = MOD(POSN,MAXGL2)
+          IF(JJJ.EQ.0)JJJ = MAXGL2
+          SEQ1(IPSEQ1) = MUNOTP(CHARS(1,JJJ))
+          CHARS(1,JJJ) = 0
+          POSN = POSN + 1
+211     CONTINUE
+      ELSE
+      DO 230 I=1,LINLEN
+        IPSEQ1=IPSEQ1+1
+        ISUM=0
+        JJJ = LWRAPS(POSN,MAXGL2)
+        SEQ1(IPSEQ1) = GTCONC(CHARS(1,JJJ),CHRSIZ,PERCD)
+        CALL FILLI(CHARS(1,JJJ),CHRSIZ,0)
+        POSN = POSN + 1
+230   CONTINUE
+      END IF
+C
+C
+C   ANY MORE TO OUTPUT?
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGLEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+      SUBROUTINE SUMSS(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,LINOU1,LINOU2)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      CHARACTER LINOU1(MAXGEL),LINOU2(MAXGEL),GTCONC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      EXTERNAL INDEXS,LWRAPS,GTCONC
+C
+C Routine to calculate a consensus. Only if the two strands agree
+C is a non dash character assigned.
+C
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+40    CONTINUE
+50    CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+      N=ABS(LNGTHG(GELC))
+      IF(LSEQNO.LT.LREG)LSEQNO=LREG
+      IS=LSEQNO-RELPG(GELC)+1
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        LINOU1(I) = GTCONC(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        LINOU2(I) = GTCONC(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+250     CONTINUE
+        POSN=POSN+1
+230   CONTINUE
+C
+C   Compare the strands. If they the same then set the consensus
+C   accordingly, otherwise set it to - so no edits are made.
+C
+      DO 500 I=1,LINLEN
+        POSN1=POSN1+1
+        IF(LINOU1(I).EQ.LINOU2(I)) THEN
+          SEQ1(POSN1) = LINOU1(I)
+        ELSE
+          SEQ1(POSN1) = '-'
+        END IF
+500   CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+      SUBROUTINE TPCHEK(PC,PG,L,N)
+      INTEGER PC(N),PG(N),L(N)
+C     AUTHOR RODGER STADEN
+C     IF OVERLAPPING BLOCKS ARE FOUND REMOVE THE SHORTER ONE
+C     THEN REMOVE LARGE GAPS AT ENDS (THOSE AS LARGE AS THE END BLOCK)
+      K1 = 2
+1     CONTINUE
+      DO 10 I = K1,N
+        J1 = I
+        IF(PC(I).LE.PC(I-1)) GO TO 20
+        IF(PG(I).LE.PG(I-1)) GO TO 20
+10    CONTINUE
+C     REMOVE LARGE GAPS FROM ENDS
+C     THIS RULE OF THUMB COULD BE CHANGED TO USE A DIFFERENCE
+C     BETWEEN THE NUMBERS OF MISMATCHING CHARACTERS
+      IF(N.GT.1) THEN
+        K1 = PC(2) - PC(1) - L(1) 
+        J1 = PG(2) - PG(1) - L(1)
+        IF(MAX(K1,J1).GT.L(1)) THEN
+          CALL ML(PC,PG,L,N,1)
+          N = N - 1
+        END IF
+        IF(N.GT.1) THEN
+          K1 = PC(N) - PC(N-1) - L(N-1)
+          J1 = PG(N) - PG(N-1) - L(N-1)
+          IF(MAX(K1,J1).GT.L(N)) THEN
+            CALL ML(PC,PG,L,N,N)
+            N = N - 1
+          END IF
+        END IF
+      END IF
+      RETURN
+20    CONTINUE
+      IF(L(J1-1).GT.L(J1)) THEN
+        CALL ML(PC,PG,L,N,J1)
+      ELSE
+        CALL ML(PC,PG,L,N,J1-1)
+      END IF
+C  Until 25-11-90 next line was k1=j1 but this does not deal with all 
+C  cases: when a line is deleted we must compare it with the previous
+C  one before dealing with the rest, because it could be left of that
+C   one as well!
+      K1 = MAX(2,J1-1)
+      N = N - 1
+      GO TO 1
+      END
+      SUBROUTINE WRITEN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      WRITE(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE WRITER(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      WRITE(IDEVR,REC=N+1)SWAPBO(RELPG),SWAPBO(LNGTHG),
+     +SWAPBO(LNBR),SWAPBO(RNBR)
+      RETURN
+      END
+      SUBROUTINE WRITEW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      WRITE(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NCONTS,LLINOI,LINCNI,LREG,RREG,
+     +WINDOW,GWIND,LENCON,DEPTHP,DEPTHM,
+     +MARGL,MARGR,MARGB,MARGT,MAXOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +KBOUT,GEL,GEL2,IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM,
+     +SEQ1,IDIM1,NGELS,TEMP3,CHRSIZ,MAXGL2,LINOU1,LINOU2,
+     +NOPT1,NOPT2,NOPT3,
+     +IHELPS,IHELPE,HELPF,IDEVH,MXGOOD)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),WINDOW,CHRSIZ,GWIND
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER MARGB(MAXOPT),MARGT(MAXOPT)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND,CHNRP1,HQN
+      INTEGER TEMP3(2,CHRSIZ,MAXGL2)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER TERM,TUPPER,NAMARC*16,HELPF*(*)
+      CHARACTER SEQ1(IDIM1),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      EXTERNAL NOPWIN,CWORLD,TUPPER,CHNRP1,HQN
+C  nopt1 = single contig
+C  nopt2 = all contigs
+C  nopt3 = scan
+10    CONTINUE
+      LLINO = LLINOI
+      LINCON = LINCNI
+      LOCLR = 0
+      LOCRR = 0
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL CLEARV
+      CALL XHAIRR(ISXMAX,ISYMAX,IX,IY,TERM,DBTDUX,DBTDUY)
+      CALL VT100M
+      INFLAG = HQN(TERM)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      NOPT = NOPWIN(IY,MARGB,MARGT,MAXOPT)
+      TERM = TUPPER(TERM)
+      IF(NOPT.EQ.0) RETURN
+      IF(NOPT.EQ.NOPT3) THEN
+        IF(TERM.EQ.'S') THEN
+          XMIN = LREG
+          XMAX = RREG
+          X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF((TERM.EQ.'N').OR.(TERM.EQ.'Z').OR.(TERM.EQ.'I')) GO TO 10
+      END IF
+      IF(NOPT.EQ.NOPT1) THEN
+        STRAND = 1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHP)
+        IF(DEPTHP.LT.0) RETURN
+        STRAND = -1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHM)
+        IF(DEPTHM.LT.0) RETURN
+        YMAX = ISYMAX
+        YMIN = 0.
+        XMIN = LREG
+        XMAX = RREG
+        RINC = ISYMAX / (DEPTHP + DEPTHM + 2)
+        RMID =(DEPTHM+1) * RINC
+        X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+        Y = CWORLD(IY,MARGB(NOPT),MARGT(NOPT),YMIN,YMAX)
+        IF(TERM.EQ.'I') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(ICLOSE.EQ.1) GO TO 10
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+1006      FORMAT
+     +    ( ' Name ',A,' Number ',I6,' Rel. Posn. ',I7,' Length ',I5)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Z') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(IGEL.EQ.0) GO TO 10
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT2),MARGT(NOPT2),ISXMAX,ISYMAX)
+          LREG = MAX(1,RELPG(IGEL)-GWIND)
+          RREG = MIN(RELPG(LINCON),RELPG(IGEL)+GWIND)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +    LLINO,LREG)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+      END IF
+      IF(NOPT.EQ.NOPT2) THEN
+        CALL IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  NCONTS,IX,IY,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +  ISXMAX,ISYMAX,DBTDUX,DBTDUY,
+     +  LINCON,IGEL,IS)
+        IF(IGEL.EQ.0) RETURN
+        IF(TERM.EQ.'Z') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'I') THEN
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION CLINNO(LNBR,IDBSIZ,NCONTS,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER LNBR(IDBSIZ)
+      CLINNO = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          CLINNO = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE UPDCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +SEQ,MAXSEQ,IDIM1,CSTART,CLENO,LINCON,NAMPRO,SEQ2,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV,IDEV2,IFAIL,MAXGEL,IDM,PERCD)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CSTART,CLENO,S1,B1,ECHRSZ,RREG
+      INTEGER TEMP3(ECHRSZ,MAXGL2)
+      CHARACTER SEQ(MAXSEQ),SEQ2(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      INTEGER CHNRP
+      EXTERNAL CHNRP
+C cstart consensus start point (before new reading)
+C cleno consensus length (before new reading)
+C lincon element number of contig
+C s1 number of first reading to shift
+C b1 number of first base to shift (in overall consensus positioning)
+C
+C there are 2 tasks: 1. make space for the new and altered region
+C                    2. calculate the new consensus and put it in the space
+C we do not have to make space if:
+C a. we are dealing with the last contig in the consensus and there are no
+C    readings starting to the right of the new data
+C b. the contig has not been padded
+C
+C New code to update the consensus only for the region affected by the
+C new reading. Find the next reading to the right of the new one, which
+C the new one does not overlap (might not be one!). Make a consensus from
+C start of new reading to here. Prior to this make space for it by moving
+C the consensus right (only if the contig is longer (padding or extra data
+C at its ends). Let s1 be the first reading to shift. We shift from its
+C left end to the end of the contig - where is this in the overall consensus?
+C The distance of the left end of s1 to the right end of the contig is
+C unchanged. This means that the new relpg(s1) is the same distance from
+C the right end of the old consensus as the old relpg(s1) was from the right
+C end of the old consensus. So from this we can calculate the position of the
+C the first base to move. 
+C Let L be the position in the overall consensus of  the last base in this contig
+C            L = cstart - cleno - 1
+C Let D = distance to end of contig 
+C            D = RELPG(LINCON) - relpg(s1) + 1.
+C First base to shift B1 = L - D + 1
+C Last base to shift is idim1
+C Distance to move to right is relpg(lincon) - cleno ie the number of extra bases
+C make consensus from relpg(ngels) to relpg(s1) - 1
+C put it at cstart + relpg(ngels) - 1
+C
+C Potential problems:
+C 1) reading at right end of contig
+C the search for the first nonoverlapping read to the right will return 0
+C shift al the next contig: ie cstart + cleno onwards
+C make consensus from relpg(ngels) to end of contig
+C put it at cstart + relpg(ngels) -1
+C
+C 2) reading at left end of contig
+C shift whole contig ie cstart - 20
+C add new title
+C shift consensus relpg(lincon) - cleno to the right
+C
+C 3) new reading contains contig - cases 1 and 2 combined
+C the search for the first nonoverlapping read to the right will return 0
+C shift whole of next contig and make consensus from relpg(ngels) to end of
+C contig.
+C
+C 4) Might not be a next contig to shift
+C
+C get number of first reading to shift
+C
+      S1 = CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +RELPG(NGELS)+ABS(LNGTHG(NGELS))-1)
+C      WRITE(*,*)'S1',S1
+C
+C is the altered region longer than the original: only then do we need to shift
+C
+C           WRITE(*,*)'IDIM1',IDIM1
+C           WRITE(*,*)'RELPG(LINCON)',RELPG(LINCON)
+C           WRITE(*,*)'CSTART,CLENO',CSTART,CLENO
+      IF (RELPG(LINCON) - CLENO.GT.0) THEN
+C
+C it is longer so we probably need to shift
+C
+        IF (S1.EQ.0) THEN
+C
+C no readings start to the right of the new data
+C
+          IF (CSTART+CLENO-1.LT.IDIM1) THEN
+C
+C there are other contigs to the right
+C
+C           WRITE(*,*)'CSTART,CLENO',CSTART,CLENO
+            B1 = CSTART + CLENO
+C            WRITE(*,*)'B1',B1
+            CALL MAKHCA(SEQ,MAXSEQ,B1,RELPG(LINCON)-CLENO,IDIM1)
+          ELSE
+C
+C there are no contigs to the right and no readings start to the right of
+C the new one so nothing to shift
+C
+          END IF
+        ELSE
+C
+C there are readings starting to the right of the new one
+C
+C shift from start of next reading to right
+C
+           L = CSTART + CLENO - 1
+C           WRITE(*,*)'CSTART,CLENO,L',CSTART,CLENO,L
+           LD = RELPG(LINCON) - RELPG(S1) + 1
+C           WRITE(*,*)'LD',LD
+           B1 = L - LD + 1
+C            WRITE(*,*)'B1',B1
+           CALL MAKHCA(SEQ,MAXSEQ,B1,RELPG(LINCON)-CLENO,IDIM1)
+        END IF
+      END IF
+C
+C now make new consensus (where do we put it,  do we need
+C to give it a header, and what region do we make it for ?
+C in the simplest case make it for relpg(ngels) to relpg(s1) -1
+C if s1=0 make it for relpg(ngels) to end of contig (relpg(lincon))
+C we give it a header if it is at the left end of the contig ie lnbr(ngels)=0
+C
+C we always start at the left end of the new reading
+C
+      LREG = RELPG(NGELS)
+C
+C we end at the next reading to the right or the end of the contig
+C
+      IF (S1.NE.0) THEN
+        RREG = RELPG(S1) - 1
+      ELSE
+        RREG = RELPG(LINCON)
+      END IF
+C
+C where do we put the new consensus ?
+C
+      B1 = CSTART + RELPG(NGELS) - 1
+C      WRITE(*,*)'LREG,RREG',LREG,RREG
+C            WRITE(*,*)'B1',B1
+C
+C do we need to add a title
+C
+      IF (LNBR(NGELS).EQ.0) THEN
+        B1 = CSTART - 20
+C        WRITE(*,*)'ADD NEW TIT AT',B1
+        CALL ADDTIT(SEQ(B1),NAMPRO,NGELS,B1)
+      END IF
+      IGELC = LNBR(LINCON)
+C
+C note aconsn will chain along until it find the first useful reading
+C
+      JOB = 2
+      CALL ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ,MAXSEQ,SEQ2,IDBSIZ,B1,JOB,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL,MAXGEL,IDM,PERCD)
+      IF(IFAIL.NE.0) THEN
+        CALL ERROM(KBOUT,'Error calculating consensus')
+        RETURN
+      END IF
+C
+C before we leave we must make the overall consensus length correct
+C  so add on the extra length (if any) which is the new length - old length
+C
+C      WRITE(*,*)'OLD IDIM1',IDIM1
+      IDIM1 = IDIM1 + RELPG(LINCON) - CLENO
+      IDIM2 = IDIM1 + RELPG(LINCON) - CLENO
+C      WRITE(*,*)'NEW IDIM1/2',IDIM1
+      END
+      SUBROUTINE MAKHCA(STRING,MAXAR,FROM,HSIZE,ASIZE)
+      CHARACTER STRING(MAXAR)
+      INTEGER FROM,HSIZE,ASIZE
+C
+C make a hole of size hsize in character array size asize
+C
+      J = ASIZE + HSIZE
+      DO 10 I=ASIZE,FROM,-1
+        STRING(J) = STRING(I)
+        J = J - 1
+ 10     CONTINUE
+      END
+      INTEGER FUNCTION CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C find first reading starting past lreg (0=none found)
+C
+      I = LGEL
+      CHNRP = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP = I
+        RETURN
+      END IF
+      END
+      INTEGER FUNCTION CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +LGEL,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C find first reading with data covering or past lreg (0=none found)
+C
+      I = LGEL
+      CHNRP1 = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I)+ABS(LNGTHG(I))-1.LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP1 = I
+        RETURN
+      END IF
+      END
+C      ACONSN
+      SUBROUTINE ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,JOB,LLINO,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      INTEGER CHNRP1
+      EXTERNAL CHNRP1
+C
+C new consensus calculating routine (could replace acons if we check values of job)
+C
+C job = 0 do it for whole db
+C     = 2 for selected contig only
+C     = 1 for selected contig, adding a header
+C     note jobs 0 and 1 update istart (it always points to end of overall 
+C     consensus), but job=2 does not
+C
+      CALL BUSY(KBOUT)
+      IFAIL=0
+      IF(JOB.EQ.1) THEN
+C
+C do it for a selected contig, adding title
+C
+        ISTART=ISTART+1
+        IDIM11=RREG-LREG+1
+        IF((ISTART+19+IDIM11).GT.IDIM1)THEN
+           WRITE(KBOUT,1009)IDIM1
+           IFAIL=1
+           RETURN
+        END IF
+C
+C allow summer to be dumb, and find first relevant reading number
+C
+        LLINO1 =  CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  LLINO,LREG)
+        IF(LLINO1.EQ.0) THEN
+          CALL ERROM(KBOUT,
+     +    'Error in ACONSN: no data found for consensus')
+          IFAIL = 1
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),IDIM11,GEL,LREG,RREG,LLINO1,IDBSIZ,TEMP,
+     +  CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        ISTART=ISTART+IDIM11-1
+        RETURN
+      END IF
+      IF(JOB.EQ.2) THEN
+C
+C do it for a selected contig
+C
+        IDIM11=RREG-LREG+1
+        IF((ISTART+IDIM11-1).GT.IDIM1)THEN
+           WRITE(KBOUT,1009)IDIM1
+           IFAIL=1
+           RETURN
+        END IF
+C
+C allow summer to be dumb, and find first relevant reading number
+C
+        LLINO1 =  CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  LLINO,LREG)
+        IF(LLINO1.EQ.0) THEN
+          CALL ERROM(KBOUT,
+     +    'Error in ACONSN: no data found for consensus')
+          IFAIL = 1
+          RETURN
+        END IF
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),IDIM11,GEL,LREG,RREG,LLINO1,IDBSIZ,TEMP,
+     +  CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        RETURN
+      END IF
+C
+C  do it for all contigs
+C
+      N=IDBSIZ-NCONTS
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        ISTART=ISTART+1
+        IF((ISTART+19+Y).GT.IDIM1)THEN
+           WRITE(KBOUT,1009)IDIM1
+1009       FORMAT(
+     +  ' Database maximum consensus length(',I6,') exceeded',/,
+     +  ' calculation aborted')
+           IFAIL=1
+           RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,MAXGEL,
+     +  IDM,PERCD)
+        ISTART=ISTART+Y-1
+110   CONTINUE
+      END
+      SUBROUTINE AERROR(IDEVS,IDEVF,NAME,IERR)
+      CHARACTER NAME*(*)
+C
+C handle errors for assembly
+C
+C errors are:
+C 0 file not found
+C 1 read too short
+C 2 failed to align and not entered
+C 3 failed on entry
+C 4 failed to align but entered
+      WRITE(IDEVF,1000)NAME(1:INDEX(NAME,' ')),IERR
+ 1000 FORMAT(A,I2)
+      CALL ERROM(IDEVS,'Failed reading written to error file')
+      END
+      SUBROUTINE SHFTLA(STRING,MAXAR,FROMS,TO,FROME)
+      CHARACTER STRING(MAXAR)
+      INTEGER FROMS,TO,FROME
+C
+C shift an array left from froms to to
+C
+      J = TO
+      DO 10 I=FROMS,FROME
+        STRING(J) = STRING(I)
+        J = J + 1
+ 10   CONTINUE
+      END
+      SUBROUTINE GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,PROMPT,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),GELIDN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER FILEH*(*),PROMPT*(*)
+      EXTERNAL GELIDN
+      IERR = 1
+      NCONTC = GELIDN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LLINO,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,PROMPT,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(NCONTC.EQ.0) RETURN
+      IGELNO = NCONTC
+      IF(LNBR(NCONTC).NE.0) THEN
+        WRITE(KBOUT,1013)RELPG(NCONTC)
+1013    FORMAT(' Position of this reading=',I6)
+25      CONTINUE
+        NCONTC = LNBR(NCONTC)
+        IF(LNBR(NCONTC).NE.0) GO TO 25
+        WRITE(KBOUT,1014)NCONTC
+1014    FORMAT( ' Number of leftmost reading this contig=',I6)
+      END IF
+30    CONTINUE
+      N = IDBSIZ - NCONTS
+      DO 20 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.NCONTC) THEN
+          LINCON=J
+          GO TO 21
+        END IF
+20    CONTINUE
+      CALL ERROM(KBOUT,'No contig line for this reading!')
+      RETURN
+21    CONTINUE
+      LLINO = NCONTC
+      IERR = 0
+      END
+      INTEGER FUNCTION GELIDN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LLINO,IDBSIZ,KBIN,KBOUT,IDEVN,PROMPT,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      CHARACTER FILEH*(*),PROMPT*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*17,NAME3*17,NFLAG
+      PARAMETER (NFLAG='/')
+      EXTERNAL NAMENO
+      NAME3 = ' '
+      IF(LLINO.NE.0) THEN
+        NAME3(1:1) = NFLAG
+        CALL READN(IDEVN,LLINO,NAME3(2:))
+      END IF
+      GELIDN = 0
+10    CONTINUE
+      L = 0
+      IF(LLINO.NE.0) L = 17
+      CALL GTSTR(PROMPT,NAME3,
+     +NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(INFLAG.EQ.3) THEN
+        GELIDN = LLINO
+        RETURN
+      END IF
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NAME1(1:1).EQ.NFLAG) THEN
+        GELIDN = NAMENO(NAME1(2:),NGELS,IDEVN)
+        IF (GELIDN.EQ.0) CALL ERROM(KBOUT,'Reading name not found')
+      ELSE
+        CALL RJST(NAME1)
+        READ(NAME1,1001,ERR=10,END=10)GELIDN
+1001    FORMAT(I17)
+        IF((GELIDN.LT.1).OR.(GELIDN.GT.NGELS)) THEN
+          CALL ERROM(KBOUT,'Illegal gel reading number')
+          GO TO 10
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION NAMENO(NAME,NGELS,IDEVN)
+      CHARACTER NAME*(*)
+      CHARACTER*16 NAME1,NAME2
+      NAME1 = NAME
+      CALL CCASE(NAME1,1)
+      DO 10 I=1,NGELS
+        CALL READN(IDEVN,I,NAME2)
+        CALL CCASE(NAME2,1)
+        IF (NAME1.EQ.NAME2) THEN
+          NAMENO = I
+          RETURN
+        END IF
+ 10   CONTINUE
+      NAMENO = 0
+      END
+      SUBROUTINE REMGBD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,IDEV2,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER HELPF*(*),GEL(MAXGEL),NAMARC*16,FILNAM*(*)
+      INTEGER REMME,GCLIN,CHAINL,GNFFOF
+      PARAMETER (MAXPRM = 35)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      EXTERNAL GCLIN,CHAINL,NAMENO,GNFFOF
+C assumes db is logical consistent
+      FILNAM = ' '
+      PROMPT(1) = 'Define a region by reading names'
+      PROMPT(2) = 'Use a file of reading names'
+      PROMPT(3) = 'Move a reading to a separate contig'
+      PROMPT(4) = 'Make a list of unattached readings'
+      IOPT = 1
+      CALL RADION('Select list definition mode',PROMPT,4,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) RETURN
+      IF(IOPT.EQ.4) THEN
+C
+C here we find all contigs with single readings and write their
+C names to a file
+C
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Name for file of reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        DO 5 I=IDBSIZ-NCONTS,IDBSIZ-1
+          IF (LNBR(I).EQ.RNBR(I)) THEN
+            IF (LNBR(I).NE.0) THEN
+              CALL READN(IDEVN,LNBR(I),NAMARC)
+              WRITE(IDEV2,1000)NAMARC
+            END IF
+          END IF
+ 5      CONTINUE
+        CLOSE(UNIT=IDEV2)
+        RETURN
+      END IF
+      IF(IOPT.EQ.3) THEN
+C
+C here we start a new contig with the selected reading
+C
+C we get the reading number igelno and move a copy of it
+C to ngels+1. Then we use the remove reading routine to delete
+C the original copy and move the new one to fill the hole. The 
+C reason for this convoluted route is that remgel cleans up
+C all the mess. We must write a new contig line and check the
+C orientation.
+C
+C
+        IF(NGELS+3.GE.IDBSIZ-NCONTS) THEN
+          CALL ERROM(KBOUT,'Insufficient space for new contig')
+          RETURN
+        END IF
+        NGELST = NGELS + 1
+        NCONTT = NCONTS + 1
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCOL,LLINOL,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +  'Reading to disconnect',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF (IERR.NE.0) RETURN
+        CALL READW(IDEVW,IGELNO,GEL,MAXGEL)
+C
+C move reading info over end of gel list
+C        
+        CALL READN(IDEVN,IGELNO,NAMARC)
+        CALL WRITEN(IDEVN,NGELST,NAMARC)
+        LNBR(NGELST) = 0
+        RNBR(NGELST) = 0
+C
+C leave orientation the same
+C
+        LNGTHG(NGELST) = LNGTHG(IGELNO)
+        RELPG(NGELST) = 1
+        CALL WRITER(IDEVR,NGELST,RELPG(NGELST),LNGTHG(NGELST),
+     +  LNBR(NGELST),RNBR(NGELST))
+        CALL WRITEW(IDEVW,NGELST,GEL,MAXGEL)
+        CALL MOVTAG(IGELNO,NGELST)
+C
+C start a new contig
+C
+        I = IDBSIZ - NCONTT
+        LNBR(I) = NGELST
+        RNBR(I) = NGELST
+        LNGTHG(I) = 0
+        RELPG(I) = ABS(LNGTHG(NGELST))
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+        NGELS = NGELST
+        NCONTS = NCONTT
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,0,0)
+        IGLNO = IGELNO
+        CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  IGLNO,LINCOL,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+        RETURN
+      END IF
+      IF(IOPT.EQ.1) THEN
+ 10     CONTINUE
+        LLINOL = 0
+        LLINOR = 0
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCOL,LLINOL,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +  'Leftmost reading',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF (IERR.NE.0) RETURN
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCOR,LLINOR,JGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +  'Rightmost reading',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF (IERR.NE.0) RETURN
+        IF (LLINOL.NE.LLINOR) THEN
+          CALL ERROM(KBOUT,
+     +    'For this mode readings must be in the same contig')
+          GO TO 10
+        END IF
+        IF (RELPG(IGELNO).GT.RELPG(JGELNO)) THEN
+          CALL ERROM(KBOUT,
+     +    'For this mode readings must be in left to right order')
+          GO TO 10
+        END IF
+C
+C IGELNO is first read to remove, JGELNO the last
+C
+C we must make a list of reads because removal changes numbers
+C
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Name for temporary file of reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+C
+C write out their names
+C
+        J = IGELNO
+ 20     CONTINUE
+        CALL READN(IDEVN,J,NAMARC)
+        WRITE(IDEV2,1000)NAMARC
+        WRITE(KBOUT,1001)NAMARC
+ 1000   FORMAT(A)
+ 1001   FORMAT(' ',A)
+        IF (J.NE.JGELNO) THEN
+          IF (J.NE.0) THEN
+            J=RNBR(J)
+            GO TO 20
+          END IF
+        END IF
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF (IOK.NE.0) THEN
+          CLOSE(UNIT=IDEV2)
+          RETURN
+        END IF
+        REWIND(UNIT=IDEV2)
+      ELSE IF(IOPT.EQ.2) THEN
+C
+C here we start from a file of file names
+C 
+        CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Name of file of reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+
+      ELSE
+        CALL ERROM(KBOUT,'How the hell did we get here?')
+        RETURN
+      END IF
+ 30   CONTINUE
+      IOK = GNFFOF(IDEV2,NAMARC)
+      IF(IOK.EQ.1) GO TO 100
+      IF(IOK.NE.0) GO TO 30
+      REMME = NAMENO(NAMARC,NGELS,IDEVN)
+      I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,REMME)
+      ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,I)
+      IF(ICONT.EQ.0) THEN
+        CALL ERROM(KBOUT,'No contig line for this reading')
+        IOK = 1
+        GO TO 100
+      END IF
+      CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+      GO TO 30
+ 100  CONTINUE
+      CLOSE(UNIT=IDEV2)
+      END
diff --git a/src/bap/dbsysnew.f b/src/bap/dbsysnew.f
new file mode 100644
index 0000000..5d0d6f7
--- /dev/null
+++ b/src/bap/dbsysnew.f
@@ -0,0 +1,5040 @@
+C 26-1-93 added new consensus routine
+C  added file busy check
+C  9-7-92  Changed remove contig line to remove all readings as well
+C   4-6-92 Added pad shifting routine
+C   6-4-92 fixed a bug in autocn revealed by new update method:
+C          for some cases where more than 2 overlaps were being found
+C          i was moving the wrong arrays around!
+C   2-4-92 added new dbauto with new update consensus method
+C          and new consensus calculation
+C   2-4-92 brought uptodate with dap
+C   29.05.91  IMPLEMENTED  REMOVE GEL READING
+C   21-8-91 Added routines to find internal overlaps
+C    2-9-91 Fixed bug in copytg
+C   8-11-91 fixed bugs in "find internal joins"
+C   23-Jun-92 COPYTAG - params to READCC and WRITCC in wrong order
+C
+      SUBROUTINE NEWCON(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,LLINO,LINCON,KOPT,
+     +TEMP3,SEQ1,MAXSEQ,SEQ2,SEQ3,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,
+     +ILADD,IRADD,MAXCON,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IDM,IDEVC,IDEVT)
+      INTEGER CHRSIZ,ECHRSZ,RREG
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER TEMP3(ECHRSZ,MAXGL2),ANS
+      INTEGER ILADD(MAXCON),IRADD(MAXCON)
+      CHARACTER SEQ3(MAXGLM)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*),HELPF*(*)
+      PARAMETER (MAXPRM = 6)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      IF(NGELS.LT.1) RETURN
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      PROMPT(1) = 'Staden'
+      PROMPT(2) = 'FASTA'
+C
+C idim is current consensus length
+C
+      IDIM = 0
+      ANS = 0
+      IWING = 0
+      FILE = ' '
+      CALL OPENF1(IDEV,FILE,1,IOK,KBIN,KBOUT,
+     +'Name for consensus file',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF (KOPT.EQ.1) THEN
+       CALL YESNO(I,'Use clipped data',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+       IF(I.LT.0) GO TO 400
+       IF(I.EQ.0) THEN
+         MN = 1
+         MX = MAXGEL
+         IWING = 100
+         CALL GETINT(MN,MX,IWING,
+     +   'Window size for good data scan',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+         IF(IOK.NE.0) GO TO 400
+         IWING = IVAL
+         MN = 1
+         MX = MIN(100,IWING)
+         NBAD = MIN(IWING,5)
+         CALL GETINT(MN,MX,NBAD,
+     +   'Maximum number of dashes in scan window',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+         IF(IOK.NE.0) GO TO 400
+         NBAD = IVAL
+       END IF
+      END IF
+      CALL YESNO(ANS,'Make consensus for whole database',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) GO TO 400
+      IF(ANS.EQ.1)GO TO 150
+      IDIM2=MAXGEL
+C
+C calc the consensus and add the unused data if required
+C save the lengths of the unused data in iladd, iradd
+C (we dont use them here!)
+C
+      CALL FILLI(ILADD,NCONTS,0)
+      CALL FILLI(IRADD,NCONTS,0)
+      JOB = 0
+      CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM,JOB,KDUMM,KDUMM,KDUMM,KDUMM,
+     +TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL,MAXGEL,IDM,PERCD,SEQ3,
+     +ILADD,IRADD,MAXCON,KDUMM,IWING,NBAD)
+      IF(IFAIL.NE.0) GO TO 400
+      ANS = 1
+      CALL RADION('Select output format',PROMPT,2,ANS,
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF (ANS.LT.1) GO TO 400
+      IF(ANS.EQ.1) THEN
+        CALL FMTDK(IDEV,SEQ1,IDIM)
+      ELSE
+        CALL WRITCF(IDEV,SEQ1,IDIM,NAMPRO,KBOUT,IOK)
+      END IF
+      RETURN
+150   CONTINUE
+      CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,NULGEL,IERR,IDBSIZ,KBIN,KBOUT,IDEV3,
+     +'Contig identifier',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)GO TO 400
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)GO TO 400
+      IDIM2=RREG-LREG+1
+      IF((IDIM+20+IDIM2).GT.MAXSEQ)THEN
+         CALL ERROM(KBOUT,'Maximum consensus length exceeded')
+         CLOSE(UNIT=IDEV)
+         RETURN
+      END IF
+      JOB = 1
+      JCON = 1
+      CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +    SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM,JOB,LLINO,LINCON,LREG,RREG,
+     +    TEMP3,
+     +    ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL,MAXGEL,IDM,PERCD,SEQ3,
+     +    ILADD,IRADD,MAXCON,JCON,IWING,NBAD)
+      IF (IFAIL.NE.0) GO TO 400
+300   CONTINUE
+      CALL YESNO(ANS,'Select another contig',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) GO TO 400
+      IF(ANS.EQ.0) GO TO 150
+      ANS = 1
+      CALL RADION('Select output format',PROMPT,2,ANS,
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF (ANS.LT.1) GO TO 400
+      IF(ANS.EQ.1) THEN
+        CALL FMTDK(IDEV,SEQ1,IDIM)
+      ELSE
+        CALL WRITCF(IDEV,SEQ1,IDIM,NAMPRO,KBOUT,IOK)
+      END IF
+      RETURN
+ 400  CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+C   SUBROUTINE TO ENTER NEW GEL SEQUENCES INTO DATA BASE.
+C   IT READS IN AN ARCHIVE VERSION AND WRITES OUT A WORKING VERSION.
+C   IT ALSO SETS UP ANY RELATIONSHIPS WITH OTHER DATA IN THE DATABASE
+C   BOTH BY POSITION IN A CONTIG AND POINTERS TO LEFT AND RIGHT
+C   NEIGHBOURS.
+      SUBROUTINE AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,NAMARC,X,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM,IDC,NCONTC,LINCON,IFAIL,IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IDEVT,IDEVC,IDEVG,MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),NAMARC*(*)
+      CHARACTER SEQC2(IDC)
+      CHARACTER NAMARK*16
+C      WRITE(*,*)'X,ITYPE,ISENSE,IDIM,IDC'
+C      WRITE(*,*)X,ITYPE,ISENSE,IDIM,IDC
+C   SET FAIL FLAG
+      IFAIL=0
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(' TRYING TO ENTER NEW GEL READING INTO DATABASE')
+C   IS THERE SPACE?
+      IF((IDBSIZ-(NGELS+NCONTS)).GT.2)GO TO 5
+C   FULL
+      WRITE(KBOUT,1999)IDBSIZ
+1999  FORMAT(' Database full, current size=',I6,' Extend with copy')
+      IFAIL=7
+      RETURN
+5     CONTINUE
+C   NEED TO CHECK TO SEE IF GEL ALREADY IN DB
+C   LOOK THRU ARC FILE
+      DO 10 J=1,NGELS
+      CALL READN(IDEVN,J,NAMARK)
+      IF(NAMARK.NE.NAMARC(1:16))GO TO 10
+C   FOUND
+      WRITE(KBOUT,1013)J
+1013  FORMAT(' New gel already in database with number',I6,
+     +' Entry aborted')
+      IFAIL=6
+      RETURN
+10    CONTINUE
+C   INCREMENT NUMBER OF GELS
+      NGELS=NGELS+1
+C   SET LENGTH THIS GEL
+      LNGTHG(NGELS)=IDIM*ISENSE
+C   WRITE NAME OF ARCHIVE TO LIST OF ARCHIVES
+C   NAMPRO,ARC
+      NAMARK=NAMARC(1:16)
+      CALL WRITEN(IDEVN,NGELS,NAMARK)
+      WRITE(KBOUT,1003)NGELS
+1003  FORMAT(' This gel reading has been given the number ',I6)
+C   WRITE GEL TO WORKING VERSION
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      IF(IDEVT.GT.0) CALL ENTRD(IDEVG,IDEVT,IDEVC,NAMARC,NGELS,IOK)
+C   CREATE TAGS FOR THIS NASTY
+      CALL TAGGEL(NGELS,LNGTHG(NGELS),GEL)
+C   SET UP RELATIONSHIPS
+C   DOES THIS GEL OVERLAP?
+      IF(ITYPE.NE.0)GO TO 100
+C
+C   DOES NOT OVERLAP SO IT STARTS A CONTIG OF ITS OWN
+C   SET LEFT AND RIGHT POINTERS TO ZERO,RELPG TO 1
+      LNBR(NGELS)=0
+      RNBR(NGELS)=0
+      RELPG(NGELS)=1
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C
+C   SET CONTIG POINTERS AND GENERAL VALUES
+C   INCREMENT NUMBER OF CONTIGS
+      NCONTS=NCONTS+1
+C   POINTER TO THIS CONTIG
+      N=IDBSIZ-NCONTS
+C   POINTER TO LEFT GEL THIS CONTIG
+      LNBR(N)=NGELS
+C   POINTER TO RIGHT GEL THIS CONTIG
+      RNBR(N)=NGELS
+C   LENGTH OF CONTIG
+      RELPG(N)=IDIM
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C   WRITE DB DESCRIPTOR
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      RETURN
+C
+100   CONTINUE
+C
+C
+C   DOES OVERLAP
+150   CONTINUE
+C
+C   LEFT END OR RIGHT OVERLAP?
+      IF(ITYPE.EQ.1)GO TO 400
+C   RIGHT END OR INTERNAL OVERLAP
+C
+160   CONTINUE
+C   NEED TO SEARCH THRU THIS CONTIG TO FIND LEFT AND RIGHT
+C   NEIGHBOURS FOR THIS NEW GEL
+C   LINE NUMBER OF LEFT END OF CONTIG
+      N=NCONTC
+C   LOOK THRU UNTIL CURRENT IS >= THEN IT MUST BE THE PREVIOUS ONE
+200   CONTINUE
+      IF(RELPG(N).GT.X)GO TO 250
+C   IS THIS THE LAST GEL IN CONTIG?
+      IF(RNBR(N).EQ.0)GO TO 350
+C   NO SO LOOK AT NEXT
+      N=RNBR(N)
+      GO TO 200
+250   CONTINUE
+C   GEL LIES BETWEEN N AND LNBR(N)
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS IN NEW GEL
+      LNBR(NGELS)=LNBR(N)
+      RNBR(NGELS)=N
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C   SET POINTERS  IN LEFT AND RIGHT NEIGHBOURS
+      K=LNBR(N)
+      RNBR(K)=NGELS
+C      RNBR(LNBR(N))=NGELS
+C   WRITE LEFT AND RIGHT NEIGHBOURS
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+      LNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C   WRITE NGELS NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   ITS LINE NUMBER IS LINCON
+C   NEED TO UPDATE IDIM IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      IF(Y.LE.RELPG(LINCON))RETURN
+      RELPG(LINCON)=Y
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      RETURN
+350   CONTINUE
+C   MUST BE A RIGHT END OVERLAP
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS FOR NEW GEL
+      LNBR(NGELS)=N
+      RNBR(NGELS)=0
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C   OLD RIGHT END
+      RNBR(N)=NGELS
+C   WRITE NEW RIGHT LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C   RESET RIGHT NAME IN CONTIG
+C   ITS LINE NUMBER IS LINCON
+      RNBR(LINCON)=NGELS
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   NEED TO UPDATE LENGTH OF GEL IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      RELPG(LINCON)=MAX(RELPG(LINCON),Y)
+C   WRITE HERE
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      RETURN
+C
+400   CONTINUE
+C
+C   ADDING TO LEFT END
+410   CONTINUE
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,1,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+420   CONTINUE
+C   SET POINTERS IN NEW GEL
+      RELPG(NGELS)=1
+      RNBR(NGELS)=NCONTC
+      LNBR(NGELS)=0
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C   SET POINTERS IN OLD LEFT END
+      LNBR(NCONTC)=NGELS
+      RELPG(NCONTC)=X
+C   WRITE NEW LEFT END
+      CALL WRITER(IDEVR,NCONTC,RELPG(NCONTC),LNGTHG(NCONTC),
+     +LNBR(NCONTC),RNBR(NCONTC))
+C   NEW LENGTH OF CONTIG
+      RELPG(LINCON)=RELPG(LINCON)+X-1
+C   MAY HAVE JUST ADDED A GEL LONGER THAN CONTIG
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=IDIM
+      IF(Y.GT.RELPG(LINCON))RELPG(LINCON)=Y
+C   NEW NAME OF LEFT END OF CONTIG
+      LNBR(LINCON)=NGELS
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C   NOW GO THRU AND CHANGE ALL RELATIVE POSITIONS
+      N=NCONTC
+440   CONTINUE
+      IF(RNBR(N).EQ.0)RETURN
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+X-1
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      GO TO 440
+      END
+      SUBROUTINE COPYTG(IDEVT,IDEV,IOK,IDBSIZ,NEWSIZ,NGELS)
+C Read tag details
+      IDIFF = NEWSIZ - IDBSIZ
+      CALL READTG(IDEVT,IDBSIZ,ICNT,LLEN,LCOM,LTYPE,NEXT)
+      IF (NEXT.NE.0) NEXT = NEXT + IDIFF
+      CALL WRITTG(IDEV,NEWSIZ,ICNT+IDIFF,LLEN,LCOM,LTYPE,NEXT)
+C Copy headers for each gels
+      DO 10 I = 1,NGELS
+        CALL READTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        IF (NEXT.NE.0) NEXT = NEXT + IDIFF
+        CALL WRITTG(IDEV,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+ 10   CONTINUE
+C Copy rest of tags
+      DO 20 I = IDBSIZ+1, ICNT
+        CALL READTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        IF (NEXT.NE.0) NEXT = NEXT + IDIFF
+        CALL WRITTG(IDEV,I+IDIFF,LPOS,LLEN,LCOM,LTYPE,NEXT)
+ 20   CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE COPYCC(IDEVC,IDEV,IOK)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+      CALL READCC(IDEVC,1,ICNT,NEXT,NOTE)
+      CALL WRITCC(IDEV,1,ICNT,NEXT,NOTE)
+      DO 10 I = 2,ICNT
+        CALL READCC(IDEVC,I,ICNT,NEXT,NOTE)
+        CALL WRITCC(IDEV,I,ICNT,NEXT,NOTE)
+ 10   CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,IDEVT,IDEVC,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB),PL(2),PR(2),RMOST
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LINCON(2),LLINO(2),ITYPE(2),IFAIL(2)
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER LREG,RREG,X,ANSJOK
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV),CSTART
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER GET,SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      REAL PERMIS(2)
+      INTEGER GNFFOF
+      EXTERNAL GNFFOF
+      SAVE GET
+      DATA GET/'>'/
+      WRITE(KBOUT,*)' Automatic sequence assembler'
+C
+C set flag for saving alignment scores
+C
+      IREPSC = 1
+      ICRAP = 1
+      IFAIL(1) = 0
+      IEMPTY=0
+      IF(NGELS.LT.1)IEMPTY=1
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      CALL YESNO(IOKENT,'Permit entry',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOKENT.LT.0) RETURN
+      CALL YESNO(ISHOW,'Hide alignments',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ISHOW.LT.0) RETURN
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILE = ' '
+        CALL OPENF1(IDEV7,FILE,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(IOKENT.EQ.0) THEN
+        FILE = ' '
+        CALL OPENF1(IDEV8,FILE,1,IOK,KBIN,KBOUT,
+     +  'File for names of failures',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      ELSE
+        CALL YESNO(IREPSC,'Save alignment scores in a file',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IREPSC.LT.0) RETURN
+        IF(IREPSC.EQ.0) THEN
+          FILE = ' '
+          CALL OPENF1(IDEV8,FILE,1,IOK,KBIN,KBOUT,
+     +    'File for names and scores',
+     +    IHELPS,IHELPE,HELPF,IDEVH)
+          IF(IOK.NE.0) RETURN
+          CALL YESNO(ICRAP,'Use poor data',
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(ICRAP.LT.0) RETURN
+        END IF
+      END IF
+      PROMPT(1) = 'Perform normal shotgun assembly'
+      PROMPT(2) = 'Put all sequences in one contig'
+      PROMPT(3) = 'Put all sequences in new contigs'
+      IOPT = 1
+      CALL RADION('Select entry mode',PROMPT,3,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) RETURN
+      IF(IOPT.EQ.1) THEN
+C  parameters for normal assembly
+        ANSJOK = 0
+        CALL YESNO(ANSJOK,'Permit joins',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(ANSJOK.LT.0) RETURN
+        MN = LENGTH*2
+        MX = MAXGLM + 1
+        MINMAT = MAX(15,MN)
+        CALL GETINT(MN,MX,MINMAT,
+     +  'Minimum initial match',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        MINMAT = IVAL
+        MINSLI = 3
+        MN = 0
+        MX = 25
+        MAXPG = 8
+        CALL GETINT(MN,MX,MAXPG,
+     +  'Maximum pads per gel',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        MAXPG = IVAL
+        MN = 0
+        MX = 25
+        MAXPC = 8
+        CALL GETINT(MN,MX,MAXPC,
+     +  'Maximum pads per gel in contig',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        MAXPC = IVAL
+C        IF((IOKENT.EQ.0).OR.(IREPSC.EQ.0)) THEN
+          RMN = 0.
+          RMX = 100.
+          PERMAX = 8.
+          CALL GETRL(RMN,RMX,PERMAX,
+     +    'Maximum percent mismatch after alignment',
+     +    VAL,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+          IF(IOK.NE.0) RETURN
+          PERMAX = VAL
+C        END IF
+        IDIM1=0
+        MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+        JGEL = 0
+        JNGEL = 0
+        JNJOIN = 0
+        JOINF = 0
+        IMATC = 0
+        IF(IEMPTY.EQ.0) THEN
+          JOB = 0
+          CALL ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +    SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,JOB,KDUMM,KDUMM,KDUMM,TEMP3,
+     +    ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+          IF(IFAIL(1).NE.0) THEN
+            CALL ERROM(KBOUT,'Error calculating consensus')
+            GO TO 900
+          END IF
+        END IF
+C
+C init hashing constants
+C
+        CALL INITE(CONST,CSTART,LENGTH)
+      END IF
+C
+C set intitial values for contig count and number of readings
+C just to get thru the first consensus calc
+      NGELSL = NGELS + 2
+      NCONTL = NCONTS +1
+C
+C
+C                          MAIN LOOP
+C
+C
+1     CONTINUE
+C      IF(IDIM1.GT.0)CALL FMTDB(SEQ1,IDIM1,1,IDIM1,60,IDEV)
+C
+C
+      IDIM2=MAXGEL
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAMARC,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        IOK = GNFFOF(IDEV7,NAMARC)
+        IF(IOK.EQ.1) GO TO 900
+        IF(IOK.NE.0) GO TO 1
+      END IF
+      DO 77 MM=1,80
+        MATCH(MM)=GET
+77    CONTINUE
+      WRITE(IDEV,1077)(MATCH(KK),KK=1,79)
+1077  FORMAT(' ',79A1)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+1007  FORMAT(' Gel reading name=',A)
+      WRITE(IDEV,1007)NAMARC
+      CALL OPENRS(IDEV4,NAMARC,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+C        IF(INF.EQ.1) RETURN
+        CALL AERROR(IDEV,IDEV8,NAMARC,0)
+        GO TO 1
+      END IF
+      CALL ARRFIN(IDEV4,SEQ2,IDIM2,KBOUT,ICRAP)
+      CLOSE(UNIT=IDEV4)
+      WRITE(IDEV,1800)IDIM2
+1800  FORMAT(' Gel reading length=',I6)
+C
+C
+C
+      IF(IOPT.NE.1) THEN
+        CALL DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +  IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +  IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL,IMATC,IEMPTY,IOPT)
+        IF(IFAIL(1).NE.0) THEN
+          CALL AERROR(IDEV,IDEV8,NAMARC,0)
+        ELSE
+          JNGEL = JNGEL + 1
+        END IF  
+        GO TO 1
+      END IF
+      IF(IDIM2.LT.MINMAT)THEN
+        CALL AERROR(IDEV,IDEV8,NAMARC,1)
+        GO TO 1
+      END IF
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      IFCOMP=0
+      IMATC=0
+      JOBC = 2
+      IF ((NGELSL.LT.NGELS).AND.(NCONTL.LT.NCONTS)) THEN
+        JOBC = 1
+      ELSE IF (NGELSL.EQ.NGELS) THEN
+        JOBC = 0
+      END IF
+      NGELSL = NGELS
+      NCONTL = NCONTS
+      IF(IEMPTY.EQ.0)
+     +CALL AUTOCN(SEQ1,IDIM1,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,CSTART,
+     +JOBC,PERMIS,LENO,ISHOW)
+      IF(IREPSC.EQ.0) THEN
+        IF(IFCOMP.NE.0) THEN
+          CALL AERROR(IDEV,IDEV8,NAMARC,2)
+          GO TO 1
+        END IF
+        IF(IMATC.GT.0) THEN
+          WRITE(IDEV8,1022)NAMARC,PERMIS(1),IDIM2,LENO
+        ELSE
+          PERMIS(1) = 0.
+          LENO = 0
+          WRITE(IDEV8,1022)NAMARC,PERMIS(1),IDIM2,LENO
+        END IF
+      END IF
+ 1022 FORMAT(' ',A,F5.1,2I6)
+      IF(IOKENT.NE.0) GO TO 1
+C     THIS RETURNS THE FOLLOWING:
+C     ILEFTS  POSITION IN CONSENSUS OF LEFT END OF MATCHING CONTIGS
+C     ILC     LENGTHS OF MATCHING CONTIGS
+C     IPOSC   POSITION OF MATCH RELATIVE TO CONTIG
+C     IPOSG   POSITION OF MATCH RELATIVE TO NEW GEL
+C     ISENSE  SENSE OF NEW GEL
+C     LLINO   LEFT GEL NUMBER IN MATCHING CONTIGS
+C     IMATC   THE NUMBER OF MATCHING CONTIGS (>2 IS ERROR!)
+C     IFCOMP  ERROR FLAG FOR COMPARISON (COMPARISON ARRAYS OVERFLOWED)
+      IF(IFCOMP.NE.0) THEN
+        CALL AERROR(IDEV,IDEV8,NAMARC,2)
+        GO TO 1
+      END IF
+      CALL SQCOPY(SEQ3,SEQ2,IDIM2)
+      IF(IMATC.EQ.0) THEN
+C
+C                     NO OVERLAP NEW CONTIG
+C
+C     ITYPE 0 = NO OVERLAP
+C     ISENSE 1 = SAME SENSE AS ARCHIVE
+        IF(IFAIL(1).NE.0) THEN
+          CALL AERROR(IDEV,IDEV8,NAMARC,2)
+          GO TO 1
+        END IF
+        ITYPE(1)=0
+        ISENSE(1)=1
+        IDOUT(1)=MAXGEL
+        WRITE(IDEV,1015)
+1015    FORMAT(' New gel reading does not overlap: start a new contig')
+        CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ2,NAMARC,X,ITYPE,ISENSE,SEQC2(1,1),ITOTPC(1),
+     +  IDIM2,IDOUT(1),LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +  IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+        IF(IFAIL(1).NE.0) THEN
+          CALL AERROR(IDEV,IDEV8,NAMARC,3)
+          GO TO 1
+        END IF
+        IEMPTY=0
+        IDIM1=IDIM1+1
+        IF((IDIM1+19+IDIM2).GT.MAXSEQ)THEN
+          WRITE(IDEV,1021)MAXSEQ
+1021      FORMAT(' Database maximum consensus length (',I6,') exceeded')
+          GO TO 900
+        END IF
+        CALL ADDTIT(SEQ1(IDIM1),NAMPRO,NGELS,IDIM1)
+        CALL MSTLKL(SEQ2,IDIM2)
+        CALL SQCOPY(SEQ2,SEQ1(IDIM1),IDIM2)
+        IDIM1=IDIM1+IDIM2-1
+        JNGEL = JNGEL + 1
+        GO TO 1
+      END IF
+C
+C
+C                     OVERLAP SO TRY TO ALIGN THE SEQUENCES
+C
+C
+      DO 100 I=1,IMATC
+        N=IDBSIZ-NCONTS
+        DO 99 J=N,IDBSIZ-1
+          IF(LNBR(J).NE.LLINO(I))GO TO 99
+          LINCON(I)=J
+          GO TO 100
+99      CONTINUE
+        WRITE(IDEV,10077)LLINO(I)
+10077   FORMAT(' Contig line for contig',I6,' not found!')
+        GO TO 900
+100   CONTINUE
+C
+      IF (IMATC.EQ.1) THEN
+C
+C
+C                     SINGLE OVERLAP
+C
+C
+C
+        WRITE(IDEV,1014)LLINO(1)
+1014    FORMAT(' New gel reading overlaps contig',I6)
+        IF(ITOTPG(1).GT.0) CALL CCTA(SEQG2(1,1),IDIM22(1))
+        CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQG2(1,1),NAMARC,JOINT(1),ITYPE(1),ISENSE(1),
+     +  SEQC2(1,1),
+     +  ITOTPC(1),IDIM22(1),IDOUT(1),LLINO(1),LINCON(1),
+     +  IFAIL(1),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+        IF(IFAIL(1).NE.0) THEN
+          CALL AERROR(IDEV,IDEV8,NAMARC,3)
+          GO TO 1
+        END IF
+        JNGEL = JNGEL + 1
+        CALL UPDCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +  SEQ1,MAXSEQ,IDIM1,ILEFTS(1),ILC(1),LINCON(1),NAMPRO,SEQ2,TEMP3,
+     +  ECHRSZ,MAXGL2,KBOUT,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+        IF(IFAIL(1).NE.0) THEN
+          CALL ERROM(KBOUT,'Error calculating consensus')
+          GO TO 900
+        END IF
+        IF(KFAIL.NE.0) THEN
+C          CALL AERROR(IDEV,IDEV8,NAMARC,4)
+        END IF
+        GO TO 1
+      END IF
+C
+C
+C                     DOUBLE OVERLAP
+C
+C
+      WRITE(IDEV,1013)LLINO
+1013  FORMAT(' Overlap between contigs',I6,' and',I6)
+      IF(ANSJOK.NE.0) GO TO 1
+      IF(LLINO(1).EQ.LLINO(2))THEN
+        WRITE(IDEV,*)' Trying to form loop in contig',LLINO(1)
+        WRITE(IDEV,*)' Gel not entered'
+        CALL AERROR(IDEV,IDEV8,NAMARC,3)
+        GO TO 1
+      END IF
+      CALL AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,
+     +IDIM22,KLASS,IOVER,IDEV,PL,PR)
+      IF(IOVER.GT.MAXOVR)THEN
+        WRITE(IDEV,*)' Overlap too large: entry only'
+C
+C cannot align the two contigs, so try to enter the reading into one of them
+C
+        IFAIL(2)=1
+        IGOOD=0
+        IF(IFAIL(1).EQ.0)IGOOD=1
+        IF(IFAIL(2).EQ.0)IGOOD=2
+        IF(IGOOD.EQ.0) THEN
+          CALL AERROR(IDEV,IDEV8,NAMARC,2)
+          JOINF = JOINF + 1
+          GO TO 1
+        END IF
+        IF(ITOTPG(IGOOD).GT.0) CALL CCTA(SEQG2(1,IGOOD),IDIM22(IGOOD))
+        WRITE(IDEV,1012)LLINO(IGOOD)
+        CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQG2(1,IGOOD),NAMARC,JOINT(IGOOD),ITYPE(IGOOD),
+     +  ISENSE(IGOOD),SEQC2(1,IGOOD),ITOTPC(IGOOD),
+     +  IDIM22(IGOOD),IDOUT(IGOOD),LLINO(IGOOD),LINCON(IGOOD),
+     +  IFAIL(IGOOD),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,
+     +  MAXGEL)
+        IF(IFAIL(IGOOD).NE.0) THEN
+          CALL AERROR(IDEV,IDEV8,NAMARC,3)
+          JOINF = JOINF + 1
+          GO TO 1
+        END IF
+        JNGEL = JNGEL + 1
+        CALL UPDCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +  SEQ1,MAXSEQ,IDIM1,ILEFTS(IGOOD),ILC(IGOOD),LINCON(IGOOD),
+     +  NAMPRO,SEQ2,TEMP3,
+     +  ECHRSZ,MAXGL2,KBOUT,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+        IF(IFAIL(1).NE.0) THEN
+          CALL ERROM(KBOUT,'Error calculating consensus')
+          GO TO 900
+        END IF
+        WRITE(IDEV,1020)LLINO,LLINO(IGOOD)
+ 1020   FORMAT(' Could not join contigs',I4,' and',I4,' but the gel',
+     +  ' has been entered into contig',I4,/,
+     +  ' If required do the join manually.')
+        JOINF = JOINF + 1
+        GO TO 1
+      END IF
+C   WHICH CONTIG IS LEFTMOST?
+      LMOST=1
+      RMOST=2
+      IF(PL(1).GT.PL(2))THEN
+        LMOST=2
+        RMOST=1
+      END IF
+C   SAVE LENGTH OF RMOST CONTIG FOR DELETION STEP LATER
+      ILCR=ILC(RMOST)
+      IF(ITOTPG(LMOST).GT.0) CALL CCTA(SEQG2(1,LMOST),IDIM22(LMOST))
+      WRITE(IDEV,1012)LLINO(LMOST)
+1012  FORMAT(' Entering the new gel reading into contig',I6)
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,LMOST),NAMARC,JOINT(LMOST),ITYPE(LMOST),
+     +ISENSE(LMOST),SEQC2(1,LMOST),ITOTPC(LMOST),
+     +IDIM22(LMOST),IDOUT(LMOST),LLINO(LMOST),LINCON(LMOST),
+     +IFAIL(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,
+     +MAXGEL)
+      IF(IFAIL(LMOST).NE.0) THEN
+        CALL AERROR(IDEV,IDEV8,NAMARC,3)
+        JOINF = JOINF + 1
+        GO TO 1
+      END IF
+      JNGEL = JNGEL + 1
+      CALL UPDCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +  SEQ1,MAXSEQ,IDIM1,ILEFTS(LMOST),ILC(LMOST),LINCON(LMOST),
+     +  NAMPRO,SEQ2,TEMP3,
+     +  ECHRSZ,MAXGL2,KBOUT,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0) THEN
+        CALL ERROM(KBOUT,'Error calculating consensus')
+        GO TO 900
+      END IF
+      IF(ITYPE(LMOST).EQ.1)LLINO(LMOST)=NGELS
+      IF(ILEFTS(LMOST).LT.ILEFTS(RMOST))THEN
+        ILEFTS(RMOST)=ILEFTS(RMOST) + 
+     +  (RELPG(LINCON(LMOST)) - ILC(LMOST))
+      END IF
+      ILC(LMOST) =  RELPG(LINCON(LMOST))
+      DO 500 I=1,2
+        IF(ISENSE(I).EQ.-1)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON(I),
+     +    LLINO(I),SEQ2,IDBSIZ,IDEV,IDEV1,IDEV2,MAXGEL)
+          CALL SQREV(SEQ1(ILEFTS(I)),ILC(I))
+          CALL SQCOM(SEQ1(ILEFTS(I)),ILC(I))
+          KT=IDIM1
+          CALL ADDTIT(SEQ1((ILEFTS(I)-20)),NAMPRO,LNBR(LINCON(I)),KT)
+        END IF
+500   CONTINUE
+C   NEED TO KNOW POSITION OF OVERLAP RELATIVE TO CONTIG, TO CONSENSUS
+C   WHICH BITS TO SEND TO ALIGNMENT ROUTINES
+C   SET UP FOR ALINE (NOTE RMOST IS EQUIVALENT TO THE GEL READING AND
+C   SO IS SLID ALONG THE LMOST CONTIG. THE SECTION SENT TO ALINE MUST 
+C   BE OF LENGTH < MAXGEL-2*MAX(MAXPC,MAXPG)
+C   IT MUST START AT POSITION 1 IN THE RMOST CONTIG AND EXTEND
+      IPOSC(LMOST)=PL(RMOST)+RELPG(NGELS)-1
+      ILCT = RELPG(LINCON(LMOST)) - RELPG(NGELS) - PL(RMOST) + 2
+      ILC(RMOST)=MIN(ILCT,ILC(RMOST))
+      IPOSC(RMOST)=1
+      IDOUT(LMOST)=MAXGEL
+      IDOUT(RMOST)=MAXGEL
+      IDSAV=MAXSAV
+C  ON INPUT TO ALINE ILC(RMOST) CONTAINS THE OVERLAP LENGTH
+C  ON OUTPUT IT CONTAINS THE LENGTH OF THE ALIGNED SECTION (IE INCLUDING 
+C  PADS)
+      WRITE(IDEV,1009)
+1009  FORMAT(' Trying to align the two contigs')
+      CALL ALINE(SEQ1(ILEFTS(LMOST)),SEQ1(ILEFTS(RMOST)),
+     +SEQC2(1,RMOST),SEQC2(1,LMOST),SAV1,SAV2,SAV3,IDSAV,
+     +ILC(LMOST),ILC(RMOST),IDOUT(LMOST),IPOSC(LMOST),IPOSC(RMOST),
+     +MINSLI,JOINT(LMOST),ITOTPC(LMOST),ITOTPC(RMOST),IFAIL(1),
+     +ITYPE(1),MAXPC,MAXPC,PERMAX,IDEV,SEQ4,MAXGEL,Z,LENO,ISHOW)
+C SEQC2(1,LMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE LMOST CONTIG
+C SEQC2(1,RMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE RMOST CONTIG
+C ILC(RMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE RMOST CONTIG
+C IDOUT(LMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE LMOST CONTIG
+C JOINT(LMOST)  IS THE POSITION OF THE JOIN RLETIVE TO THE LMOST CONTIG
+C ITYPE IS TYPE OF OVERLAP (-1 = RIGHT END OR INTERNAL, 1 = LEFT END)
+C  NB SHOULD ALWAYS BE -1
+C  IF THIS HAS BEEN DONE OK WE CAN EDIT THE TWO CONTIGS THEN JOIN
+      IF(IFAIL(1).NE.0)THEN
+        WRITE(IDEV,*)' Failed to align the two overlapping contigs'
+C        CALL AERROR(IDEV,IDEV8,NAMARC,4)
+        JOINF = JOINF + 1
+        GO TO 1
+      END IF
+      IF(ITOTPC(LMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(LMOST)
+1017    FORMAT(' Editing contig',I6)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(LMOST),JOINT(LMOST),SEQC2(1,LMOST),
+     +  ITOTPC(LMOST),IDOUT(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      JOINT(RMOST)=1
+      IDOUT(RMOST)=ILC(RMOST)
+      IF(ITOTPC(RMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(RMOST)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(RMOST),JOINT(RMOST),SEQC2(1,RMOST),
+     +  ITOTPC(RMOST),IDOUT(RMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      ILC(RMOST)=ILCR
+      LTL=LNBR(LINCON(LMOST))
+      LTR=LNBR(LINCON(RMOST))
+      WRITE(IDEV,1018)LNBR(LINCON(LMOST)),LNBR(LINCON(RMOST))
+1018  FORMAT(' Completing the join between contigs',I6,' and',I6)
+      CALL AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +JOINT(LMOST),LTL,LTR,LINCON(LMOST),LINCON(RMOST),IDEV1)
+      LLINO(1)=LTL
+      IF(ILEFTS(LMOST).GT.ILEFTS(RMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+      END IF
+      IF(ILEFTS(RMOST).GE.ILEFTS(LMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+      END IF
+      LREG=1
+      RREG=JOINT(LMOST)
+      IGELC=LLINO(1)
+      JOB = 1
+      CALL ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,JOB,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0) THEN
+        CALL ERROM(KBOUT,'Error calculating consensus')
+        GO TO 900
+      END IF
+C      CALL FMTDB(SEQ1,IDIM1,1,IDIM1,60,IDEV)
+      JNJOIN = JNJOIN + 1
+      IF(KFAIL.NE.0) THEN
+C        CALL AERROR(IDEV,IDEV8,NAMARC,4)
+C        JOINF = JOINF + 1
+      END IF
+      GO TO 1
+900   CONTINUE
+C      CALL FMTDB(SEQ1,IDIM1,1,IDIM1,60,IDEV)
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' sequences processed'
+      WRITE(KBOUT,*)JNGEL,' sequences entered into database'
+      WRITE(KBOUT,*)JNJOIN,' joins made'
+      WRITE(KBOUT,*)JOINF, ' joins failed'
+      END
+      SUBROUTINE DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL,IMATC,IEMPTY,IOPT)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER SEQ2(MAXGEL),SEQC2(MAXGEL)
+      CHARACTER NAMARC*(*)
+C  deals with entering all readings into contig 1 (IOPT=2)
+C  or all readings into new contigs (IOPT=3)
+      IF(IOPT.EQ.2) THEN
+        IF(IMATC.EQ.0) THEN
+          ITYPE=0
+          ISENSE=1
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+          IEMPTY=0
+          IMATC = 1
+        ELSE
+          ITYPE= - 1
+          ISENSE=1
+          JOINT = 1
+          LLINO = 1
+          LINCON = IDBSIZ - 1
+          ITOTPC = 0
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,IDEVT,IDEVT,IDEV4,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+        END IF
+      ELSE IF(IOPT.EQ.3) THEN
+        ITYPE=0
+        ISENSE=1
+        IDOUT=MAXGEL
+        CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+        IF(IFAIL.NE.0) RETURN
+      END IF
+      END
+      SUBROUTINE DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +NAMPRO,IDEV,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IDEVT,IDEVC,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,MAXDB,IDM)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMPRO*(*),NAMARC*16,GEL(MAXGEL)
+      CHARACTER V2,V1,VIN
+      INTEGER IWORD,DELDB,ACTF
+      PARAMETER (IWORD=4)
+      PARAMETER (MAXPRM = 21)
+      CHARACTER PERR(2)*(MAXPRM)
+      EXTERNAL DELDB,ACTF
+      IERR=1
+      LL = INDEX(NAMPRO,'.')
+C
+C save incoming version
+C
+      VIN = NAMPRO(LL+1:LL+1)
+1     CONTINUE
+      L = 1
+      V1='1'
+      CALL GTSTR('Make version',V1,V2,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(V2,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) V2 = V1
+C
+C check if file is open
+C
+      NAMPRO(LL+1:LL+1) = V2
+      LM1 = LL - 1
+      IOK = ACTF(1,NAMPRO,LM1,V2,KBOUT)
+      IF (IOK.NE.0) THEN
+        NAMPRO(LL+1:LL+1) = VIN
+        RETURN
+      END IF
+      MN = NGELS + NCONTS + 1
+      MX = MAXDB
+      NEWSIZ = IDBSIZ
+      CALL GETINT(MN,MX,NEWSIZ,
+     +'New database size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 40
+      CALL BUSY(KBOUT)
+      NEWSIZ = IVAL
+C     WHERE SHOULD CHARS BE CHANGED ?
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='RL'//V2
+      CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0) THEN
+C problem opening file
+        IF(IOK.EQ.2) THEN
+          CALL ERROM(KBOUT,'File already exists')
+          PERR(1) = 'Retype version number'
+          PERR(2) = 'Replace database'
+          IDO = 1
+          CALL RADION('Select action',PERR,2,IDO,
+     +    IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          IF(IDO.LT.1) GO TO 40
+          IF(IDO.EQ.1) THEN
+C
+C close busy file (note we dont care what we send for job=2!!!)
+C
+            NAMPRO(LL+1:LL+1) = V2
+            IOK = ACTF(2,NAMPRO,LLL,V2,KBOUT)
+            IF (IOK.NE.0) THEN
+C should never get here
+              NAMPRO(LL+1:LL+1) = VIN
+              RETURN
+            END IF
+            GO TO 1
+          END IF
+          IF(IDO.EQ.2) THEN
+            IOK =  DELDB(NAMPRO,V2,IDEV,MAXGEL)
+            IF(IOK.EQ.0) THEN
+              LLL = INDEX(NAMPRO,'.') + 1
+              NAMPRO(LLL:)='RL'//V2
+              CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+              IF(IOK.EQ.0) GO TO 2
+            ELSE
+              CALL ERROM(KBOUT,'File delete failed')
+            END IF
+          END IF
+        END IF
+        GO TO 100
+      END IF
+ 2    CONTINUE
+      CALL WRITER(IDEV,0,MAXDB,NEWSIZ,MAXGEL,IDM)
+      CALL WRITER(IDEV,NEWSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      DO 10 I=1,NGELS
+        CALL WRITER(IDEV,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      M=NEWSIZ-NCONTS
+      N=IDBSIZ-NCONTS
+      DO 15 I=N,IDBSIZ-1
+        CALL WRITER(IDEV,M,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        M=M+1
+15    CONTINUE
+      CLOSE(UNIT=IDEV)
+C  DO SEQUENCES
+      NAMPRO(LLL:)='SQ'//V2
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+      CALL OPENRS(IDEV,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+      DO 20 I=1,NGELS
+        CALL READW(IDEVW,I,GEL,MAXGEL)
+        CALL WRITEW(IDEV,I,GEL,MAXGEL)
+20    CONTINUE
+      CLOSE(UNIT=IDEV)
+C   DO ARCHIVE NAMES
+      NAMPRO(LLL:)='AR'//V2
+       CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+      DO 30 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        CALL WRITEN(IDEV,I,NAMARC)
+30    CONTINUE
+      CLOSE(UNIT=IDEV)
+      IF(IDEVT.GT.0.AND.IDEVC.GT.0) THEN
+        NAMPRO(LLL:)='TG'//V2
+        CALL OPENRS(IDEV,NAMPRO,IOK,5,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL COPYTG(IDEVT,IDEV,IOK,IDBSIZ,NEWSIZ,NGELS)
+        CLOSE(UNIT=IDEV)
+        NAMPRO(LLL:)='CC'//V2
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+        CALL OPENRS(IDEV,NAMPRO,IOK,11,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL COPYCC(IDEVC,IDEV,IOK)
+        CLOSE(UNIT=IDEV)
+      ENDIF
+      IERR=0
+ 40   CONTINUE
+C
+C close busy file (note we dont care what we send for job=2!!!)
+C
+      LL = INDEX(NAMPRO,'.')
+      NAMPRO(LL+1:LL+1) = V2
+      IOK = ACTF(2,NAMPRO,LL,V2,KBOUT)
+      NAMPRO(LL+1:LL+1) = VIN
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error opening new database, copy aborted')
+C
+C close busy file
+C
+      LL = INDEX(NAMPRO,'.')
+      NAMPRO(LL+1:LL+1) = V2
+      IOK = ACTF(2,NAMPRO,LL,V2,KBOUT)
+      NAMPRO(LL+1:LL+1) = VIN
+      END
+      INTEGER FUNCTION DELDB(NAMPRO,VERSN,IDEV,MAXGEL)
+      CHARACTER NAMPRO*(*),VERSN
+      INTEGER DELF
+      EXTERNAL DELF
+C
+C delete an xdap database (bap): names are 16 characters!
+C
+C assume relationships are 4 words, names are 4 and seqs are maxgel
+C all recls in BYTES
+C
+      DELDB = 1
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='RL'//VERSN
+      IF(DELF(NAMPRO,IDEV,16,4).NE.0) RETURN
+      NAMPRO(LLL:)='AR'//VERSN
+      IF(DELF(NAMPRO,IDEV,16,4).NE.0) RETURN
+      NAMPRO(LLL:)='SQ'//VERSN
+      IF(DELF(NAMPRO,IDEV,MAXGEL,4).NE.0) RETURN
+      NAMPRO(LLL:)='CC'//VERSN
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+      IF(DELF(NAMPRO,IDEV,44,4).NE.0) RETURN
+      NAMPRO(LLL:)='TG'//VERSN
+      IF(DELF(NAMPRO,IDEV,20,4).NE.0) RETURN
+      DELDB = 0
+      END
+      SUBROUTINE DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,MAXGEL,MAXGLM,
+     +IDEVT,IDEVC,TEMP)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C  12-12-90 Added function to change raw data parameter file
+C  and changed menu routines accordingly
+      INTEGER RELPG(IDBSIZ),X,TEMP(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAME*16,NEWNAM*16
+      INTEGER L,M,N
+      CHARACTER GEL2(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(9)*(MAXPRM)
+      INTEGER GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' Warning:',
+     +' make a copy first, and check logical consistency after use')
+10    CONTINUE
+C
+C   SELECT OPTION
+C      CALL BELL(1,KBOUT)
+C     DBMENU now defunct for bap - so we use RADION instead
+C      CALL DBMENU(4,NOPT,KOPT,IHELPS,IHELPE,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PROMPT(1) = 'Line change'
+      PROMPT(2) = 'Check logical consistency'
+      PROMPT(3) = 'Delete contig line'
+      PROMPT(4) = 'Shift'
+      PROMPT(5) = 'Move gel reading'
+      PROMPT(6) = 'Rename gel reading'
+      PROMPT(7) = 'Break a contig'
+      PROMPT(8) = 'Remove a gel reading'
+      PROMPT(9) = 'Alter raw data parameters'
+      NOPT = 1
+      CALL RADION('Alter relationships', PROMPT, 9, NOPT, IHELPS,
+     +  IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.LT.1)RETURN
+      IF(NOPT.EQ.1)THEN
+C       LINE CHANGE
+        MN = 0
+        MX = IDBSIZ
+        LNO = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of line to change',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.EQ.0) GO TO 18
+        LNO = IVAL
+        IF(LNO.EQ.IDBSIZ)GO TO 19
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)RELPG(LNO),LNGTHG(LNO),LNBR(LNO),RNBR(LNO)
+1001    FORMAT(' ',4I6)
+        MN = 0
+        MX = 99999
+        X  = RELPG(LNO)
+        CALL GETINT(MN,MX,X,
+     +  'Relative position',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        X = IVAL
+        MN = -MAXGEL
+        MX = 999999
+        L  = LNGTHG(LNO)
+        CALL GETINT(MN,MX,L,
+     +  'Length',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 0
+        MX = IDBSIZ
+        M  = LNBR(LNO)
+        CALL GETINT(MN,MX,M,
+     +  'Left neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        M = IVAL
+        MN = 0
+        MX = IDBSIZ
+        N  = RNBR(LNO)
+        CALL GETINT(MN,MX,N,
+     +  'Right neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        N = IVAL
+        CALL WRITER(IDEVR,LNO,X,L,M,N)
+        RELPG(LNO)=X
+        LNGTHG(LNO)=L
+        LNBR(LNO)=M
+        RNBR(LNO)=N
+        GO TO 10
+ 18     CONTINUE
+C
+C deal with record 1/0 which contains: maxdb,actualdbsiz,maxgel,idm
+C
+        WRITE(KBOUT,*)
+     +  'Extreme caution: after this record is changed the program'
+        WRITE(KBOUT,*)
+     +  'should be restarted, and could malfunction!'
+        CALL READR(IDEVR,0,IMSIZ,IASIZ,MXG,IDA)
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)IMSIZ,IASIZ,MXG,IDA
+        MN = 0
+        MX = 999999
+        X  = IMSIZ
+        CALL GETINT(MN,MX,X,
+     +  'Maximum database size',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        X = IVAL
+        MN = 0
+        MX = X
+        L  = IASIZ
+        CALL GETINT(MN,MX,L,
+     +  'Actual database size',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 0
+        MX = MAXGLM
+        M  = MXG
+        CALL GETINT(MN,MX,M,
+     +  'Maximum reading length',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        M = IVAL
+        ID = 0
+        IF(IDA.EQ.26) ID = 1
+        CALL YESONO(ID,'Database is for DNA',
+     +  'Database is for protein',
+     +  IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ID.LT.0) GO TO 10
+        N = 5
+        IF(ID.EQ.0)N = 5
+        IF(ID.EQ.1)N = 26
+        CALL WRITER(IDEVR,0,X,L,M,N)
+        GO TO 10
+19      CONTINUE
+C       NCONTS NGELS LINES
+        MN = 0
+        MX = IDBSIZ
+        LL = NGELS
+        CALL GETINT(MN,MX,LL,
+     +  'Number of gel readings',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        LL = IVAL
+        MN = 0
+        MX = IDBSIZ
+        MM  = NCONTS
+        CALL GETINT(MN,MX,MM,
+     +  'Number of contigs',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        MM = IVAL
+        CALL WRITER(IDEVR,IDBSIZ,LL,MM,LL,MM)
+        NGELS=LL
+        NCONTS=MM
+        GO TO 10
+      END IF
+C
+C
+      IF(NOPT.EQ.4)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of first gel reading to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNO)
+        IF(I.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'Problem with this gel reading. Check logical consistency'
+          WRITE(KBOUT,*)'of database. Shift not done'
+          GO TO 10
+        END IF
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'No contig line for this contig. Check logical'
+          WRITE(KBOUT,*)'consistency of database. Shift not done'
+          GO TO 10
+        END IF
+        MN = 1 - RELPG(LNO)
+        MX  = RELPG(NCONTO) - RELPG(LNO)
+        X = MN
+        CALL GETINT(MN,MX,X,
+     +  'Distance to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        X = IVAL
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,LNO,NCONTO,X)
+        WRITE(KBOUT,*)'Shift complete'
+        GO TO 10
+      END IF
+C
+      IF (NOPT.EQ.2) THEN
+        CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP,IERR,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.3)THEN
+        WRITE(KBOUT,*)'Remove a contig'
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of a reading in the contig to delete',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.EQ.0) GO TO 10
+        LNO = IVAL
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,LNO)
+        ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,I)
+        IF(ICONT.EQ.0) THEN
+          WRITE(KBOUT,*)'No contig line for this reading'
+          GO TO 10
+        END IF
+        CALL REMCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  ICONT,GEL,I,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT,TEMP)
+      END IF
+      IF(NOPT.EQ.6)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of gel reading to rename',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        CALL READN(IDEVN,LNO,NAME)
+        L = 16
+        CALL GTSTR('name for gel reading',
+     +  NAME,NEWNAM,L,KBOUT,KBIN,INFLAG)
+        IF(L.GT.0)CALL WRITEN(IDEVN,LNO,NEWNAM)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.5)THEN
+        MN = 0
+        MX = NGELS
+        IFROM  = 0
+        CALL GETINT(MN,MX,IFROM,
+     +  'Number of gel reading to move',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        IFROM = IVAL
+        MN = 0
+        MX = NGELS
+        ITO  = 0
+        CALL GETINT(MN,MX,ITO,
+     +  'New number for gel reading',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        ITO = IVAL
+        CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  GEL,IFROM,ITO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+        CALL BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+        GO TO 10        
+      END IF
+      IF(NOPT.EQ.8) THEN
+        CALL REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.9) THEN
+        CALL FIXRD(IDEVT,IDEVC,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH)
+        GO TO 10
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE REMCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +ICONT,GEL,LLINO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT,TEMP)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),TEMP(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+C
+C problem is to remove a contig. Strategy is to find all its read numbers
+C and store them in temp. Then go thru and replace them by reads from the
+C end of the list of reads. An earlier version tried to chain thru but
+C ran into complications about replacing reads by reads they pointed to!
+C
+      DO 10 I=1,IDBSIZ
+        TEMP(I) = 0
+ 10     CONTINUE
+C
+      NDEL = 0
+      I = LLINO
+ 20   CONTINUE
+      TEMP(I) = 1
+      NDEL = NDEL + 1
+      IF (RNBR(I).NE.0) THEN
+        I = RNBR(I)
+        GO TO 20
+      END IF
+C
+C now in temp all the reads in the contig are 1, the rest of temp is zero
+C let i be the read to move and j the last read left in the list of reads
+C so if temp(i) is 1 and temp(j) is zero move read j to i and set j = j - 1
+C then deal with the next i.
+C if temp(i) is 1 and temp(j) is also 1 simply set j = j - 1 and try to move
+C that one
+C we stop when weve gone so far along the list that the next read to
+C delete is equal to the number of reads left in the list
+C
+C what are the difficult cases? 
+C 1. only one contig
+C 2. all reads to right of llino are are near the end of the list of reads
+C
+      I = 1
+      J = NGELS
+ 30   CONTINUE
+      IF (TEMP(I).EQ.1) THEN
+        IF (TEMP(J).EQ.0) THEN
+          WRITE(*,*)'MOVE ',J,' TO ',I
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,J,NCONTS,IDBSIZ,
+     +    GEL,J,I,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+          J = J - 1
+        ELSE
+          J = J - 1
+          IF (I.LT.J) GO TO 30
+        END IF
+      END IF
+      IF (I.LT.J) THEN
+        I = I + 1
+        GO TO 30
+      END IF
+C
+C fix up number of reads and the contig record
+C
+      NGELS = NGELS - NDEL
+      CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +ICONT,IDEVR)
+      END
+      SUBROUTINE REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER HELPF*(*),GEL(MAXGEL)
+      INTEGER REMME,GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+C assumes db is logical consistent
+      WRITE(KBOUT,*)'Remove reading from database'
+      REMME = NGELS
+      CALL GETINT(1,NGELS,REMME,
+     +'Number of reading to remove',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      REMME = IVAL
+      I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,REMME)
+      ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,I)
+      IF(ICONT.EQ.0) THEN
+        WRITE(KBOUT,*)'No contig line for this reading'
+        IOK = 1
+        RETURN
+      END IF
+      CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+      END
+      SUBROUTINE REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+C Routine to remove a reading from a database
+C Cases: 1 left end
+C        2 right end
+C        3 internal and dispensible
+C        4 internal and indispensible
+C if 1 change contig lnbr, contig length, lnbr of rnbr of remme, relpgs
+C if 2 change contig rnbr, contig length, rnbr of lnbr of remme
+C if 3 change contig length, lnbr of rnbr of remme rnbr of lnbr of remme
+C if 4 need to break contig, then as for 1
+C if 1 and 2 then also remove contig line
+C for all cases move gel ngels to remme (if remme/=ngels)
+C and update line idbsiz
+C
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME,CLEN
+      LOGICAL LEFTE,RIGHTE,DISPEN
+      CHARACTER GEL(MAXGEL)
+      EXTERNAL CLEN
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+      DISPEN = .FALSE.
+C
+C Left end ?
+C
+      IF(LNBR(REMME).EQ.0) LEFTE = .TRUE.
+C
+C Right end ?
+C
+      IF(RNBR(REMME).EQ.0) RIGHTE = .TRUE.
+C
+C If both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading and contig'
+        IFROM = NGELS
+        NGELS = NGELS - 1
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  ICONT,IDEVR)
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+      ELSE IF(LEFTE) THEN
+        WRITE(KBOUT,*)'Removing reading from left end of contig'
+        LNBR(ICONT) = RNBR(REMME)
+        I = 1 - RELPG(RNBR(REMME))
+        WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,RNBR(REMME),ICONT,I)
+        I = LNBR(ICONT)
+        LNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE IF(RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading from right end of contig'
+        RNBR(ICONT) = LNBR(REMME)
+        I = RNBR(ICONT)
+        RNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNBR(ICONT))
+        CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +  LNBR(ICONT),RNBR(ICONT))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE
+C Is remme indispensible ?
+        NSTART = RELPG(RNBR(REMME))
+        I = REMME
+10      CONTINUE
+        I = LNBR(I)
+        IF(I.NE.0) THEN
+          IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.NSTART) GO TO 10
+          DISPEN = .TRUE.
+        END IF
+        IF(DISPEN) THEN
+          WRITE(KBOUT,*)
+     +    'Removing dispensible reading from middle of contig'
+          I = LNBR(REMME)
+          RNBR(I) = RNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          I = RNBR(REMME)
+          LNBR(I) = LNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,LNBR(ICONT))
+          CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +    LNBR(ICONT),RNBR(ICONT))
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        ELSE
+          WRITE(KBOUT,*)
+     +    'Removing indispensible reading from middle of contig'
+          WRITE(KBOUT,*)'So breaking contig first'
+          IR = REMME
+          IL = LNBR(REMME)
+          ILO = LNBR(ICONT)
+          NCONTO = ICONT
+          NCONTR = IDBSIZ - NCONTS - 1
+          CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +    KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+          IF(IOK.NE.0) RETURN
+          WRITE(KBOUT,*)'Removing reading from left end of contig'
+          ICONT = IDBSIZ - NCONTS
+          LNBR(ICONT) = RNBR(REMME)
+          I = 1 - RELPG(RNBR(REMME))
+          WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+          CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +    IDBSIZ,RNBR(REMME),ICONT,I)
+          I = LNBR(ICONT)
+          LNBR(I) = 0
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        END IF
+      END IF
+      END
+      SUBROUTINE REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,IDEVR)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME
+C Routine to remove a contig line from a db
+C Loop deals with case of remove top contig
+C Move down all lines from above
+      DO 10 I = REMME,IDBSIZ-NCONTS+1,-1
+        RELPG(I) = RELPG(I-1)
+        LNGTHG(I) = LNGTHG(I-1)
+        LNBR(I) = LNBR(I-1)
+        RNBR(I) = RNBR(I-1)
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      NCONTS = NCONTS - 1
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      END
+      SUBROUTINE MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +GEL,FROM,TO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+C   Subroutine to move a gel from line from to line to
+C   Extended 22-5-91
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),FROM,TO
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMGEL*16,GEL(MAXGEL)
+      INTEGER GCLIN,CHAINL
+      LOGICAL LEFTE,RIGHTE
+      EXTERNAL GCLIN,CHAINL
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+C
+C left end ?
+C
+      IF(LNBR(FROM).EQ.0) LEFTE = .TRUE.
+C
+C right end ?
+C
+      IF(RNBR(FROM).EQ.0) RIGHTE = .TRUE.
+C
+C if both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          RNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(LEFTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(RIGHTE) THEN
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no right neighbour and does not'
+          WRITE(KBOUT,*)'appear in a contig!'
+        ELSE
+          IF(RNBR(NCONTO).NE.FROM)THEN
+            WRITE(KBOUT,*)
+     +        'This gel has no right neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+          ELSE
+            RNBR(NCONTO) = TO
+            CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +        LNBR(NCONTO),RNBR(NCONTO))
+          END IF
+        END IF
+      END IF
+      RELPG(TO)=RELPG(FROM)
+      LNGTHG(TO)=LNGTHG(FROM)
+      LNBR(TO)=LNBR(FROM)
+      RNBR(TO)=RNBR(FROM)
+      CALL READW(IDEVW,FROM,GEL,MAXGEL)
+      CALL WRITEW(IDEVW,TO,GEL,MAXGEL)
+      CALL READN(IDEVN,FROM,NAMGEL)
+      CALL WRITEN(IDEVN,TO,NAMGEL)
+      CALL WRITER(IDEVR,TO,RELPG(TO),LNGTHG(TO),
+     +LNBR(TO),RNBR(TO))
+C   Do neighbours
+      IF(LNBR(FROM).NE.0) THEN
+        I=LNBR(FROM)
+        RNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      IF(RNBR(FROM).NE.0) THEN
+        I=RNBR(FROM)
+        LNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      CALL MOVTAG(FROM,TO)
+      END
+      SUBROUTINE DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +IDBSIS,IDBSIZ,IERR,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,IDEVT,IDEVC,
+     +MAXGEL,MAXGLM,LLINO,
+     +IDM,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER GEL(MAXGLM)
+      INTEGER RELPG(IDBSIS)
+      INTEGER LNGTHG(IDBSIS),LNBR(IDBSIS),RNBR(IDBSIS)
+      CHARACTER NAMPRO*(*),COPYNO*4,HELPF*(*)
+      INTEGER IWORD,ACTF
+      EXTERNAL ACTF
+      PARAMETER (IWORD=4)
+C  NOTE THIS IS THE MACHINES WORD LENGTH IE HOW MANY CHARS PER WORD
+      CALL FILLI(RELPG,IDBSIS,0)
+      CALL FILLI(LNGTHG,IDBSIS,0)
+      CALL FILLI(LNBR,IDBSIS,0)
+      CALL FILLI(RNBR,IDBSIS,0)
+      NAMPRO(1:)=' '
+      IERR=1
+1     CONTINUE
+      L = 0
+      CALL GTSTR('Project name',' ',NAMPRO,L,KBOUT,KBIN,INFLAG)
+      IF(L.LT.1) RETURN
+      LL = L
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      L = 1
+      CALL GTSTR('Version','0',COPYNO,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(COPYNO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) COPYNO(1:1) = '0'
+C
+C check if file is open
+C
+      NAMPRO(LL+1:LL+2) = '.'//COPYNO(1:1)
+      IOK = ACTF(1,NAMPRO,LL,COPYNO,KBOUT)
+      IF (IOK.NE.0) RETURN
+      NAMPRO(LL+1:LL+3)='.AR'
+      NAMPRO(LL+4:LL+4)=COPYNO(1:1)
+       CALL OPENRS(IDEVN,NAMPRO,IOK,4,4)
+      IF(IOK.NE.0)GO TO 100
+      NAMPRO(LL+2:LL+3)='RL'
+       CALL OPENRS(IDEVR,NAMPRO,IOK,4,4)
+      IF(IOK.NE.0)GO TO 100
+      CALL READR(IDEVR,0,IDBST,IDBSIZ,MAXGEL,IDM)
+C
+C Do I really need to look at this value? I could simply compare
+C IDBSIS with IDBSIZ which is what really counts. Cannot remember
+C why IDBST is stored!!!!!!!
+C
+      IF(IDBST.GT.IDBSIS) THEN
+        CALL ERROM(KBOUT,
+     +  'Fatal error: database size too large for program')
+        GO TO 100
+      END IF
+      NAMPRO(LL+2:LL+3)='SQ'
+C  DEFINE RECORD LENGTH IN TERMS OF NUMBER OF CHARS PER WORD (4 ON VAX)
+C  AND MAXGEL SIZE 
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+       CALL OPENRS(IDEVW,NAMPRO,IOK,IREC,4)
+      IF(IOK.NE.0)GO TO 100
+      NAMPRO(LL+2:LL+3) = 'TG'
+      CALL OPENRS(IDEVT,NAMPRO,IOK,5,4)
+      IF(IOK.NE.0) IDEVT = -1
+      NAMPRO(LL+2:LL+3) = 'CC'
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+      CALL OPENRS(IDEVC,NAMPRO,IOK,11,4)
+      IF(IOK.NE.0) IDEVC = -1
+C   READ A LINE FOR LUCK
+      CALL READW(IDEVW,1,GEL,MAXGEL)
+      CALL READR(IDEVR,IDBSIZ,NGELS,NCONTS,IDUM1,IDUM2)
+      WRITE(KBOUT,10011)NGELS,NCONTS,IDBSIZ,MAXGEL
+10011 FORMAT(' Number of gel readings ',I6,' Number of contigs ',I6,/,
+     +' Database size ',I6,' Maximum gel reading length=',I4)
+      LLINO = 0
+      IF(NGELS.LT.1)GO TO 5
+      DO 3 I=1,NGELS
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+3     CONTINUE
+      N=IDBSIZ-NCONTS
+      MXT = 0
+      DO 4 I=N,IDBSIZ-1
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        IF(RELPG(I).GT.MXT) THEN
+          MXT = RELPG(I)
+          LLINO = LNBR(I)
+        END IF
+4     CONTINUE
+5     CONTINUE
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      IERR=0
+      RETURN
+100   CONTINUE
+      IOK = ACTF(2,NAMPRO,LL,COPYNO,KBOUT)
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error encountered opening database files')
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      END
+      SUBROUTINE DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,
+     +IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +IERR,IHELPS,IHELPE,IDEVH,FILEH,
+     +MAXGEL,MAXGLM,IDM)
+      CHARACTER GEL(MAXGLM),FILEH*(*)
+      CHARACTER NAMPRO*(*)
+      INTEGER IWORD,ACTF
+      PARAMETER (IWORD=4)
+      EXTERNAL ACTF
+      IERR=1
+3     CONTINUE
+      NAMPRO = ' '
+      MN = 0
+      CALL GTSTR('New project name',' ',NAMPRO,MN,KBOUT,KBIN,INFLAG)
+      IF(MN.LT.1) RETURN
+      LL = MIN(12,MN)
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 3
+      END IF
+      NAMPRO(LL+1:LL+2) = '.0'
+      IOK = ACTF(1,NAMPRO,LL,'0',KBOUT)
+      IF (IOK.NE.0) RETURN
+      MN = 10
+      MX = IDBSIS
+      IDBSIZ = 50
+      CALL GETINT(MN,MX,IDBSIZ,
+     +'Database size',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 40
+      IDBSIZ = IVAL
+5     CONTINUE
+      MN = 512
+      MX = MAXGLM
+      MAXIN1 = MIN(512,MAXGEL)
+      MAXIN = 1024
+      CALL GETINT(MN,MX,MAXIN,
+     +'Maximum gel reading length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 40
+      MAXGEL = IVAL
+      IF(MOD(IVAL,MAXIN1).NE.0) THEN
+        MAXGEL = 512 + (IVAL/512)*512
+        WRITE(KBOUT,*)'Maximum set to',MAXGEL
+      END IF
+      CALL YESNO(IDM,'Database is for DNA',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(IDM.LT.0) GO TO 40
+      IF(IDM.EQ.0)IDM = 5
+      IF(IDM.EQ.1)IDM = 26
+      NAMPRO(LL+1:LL+4)='.RL0'
+      CALL OPENRS(IDEV1,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Writing new database files')
+      J = 0
+      K = 0
+      N = 0
+      M = 0
+C
+C Put maximum possible database size, actual size, max read length and type
+C in record 0 of relationships (actually record 1, as writer adds 1).
+C
+      CALL WRITER(IDEV1,0,IDBSIS,IDBSIZ,MAXGEL,IDM)
+C
+C Put zeroes in al other records
+C
+      DO 10 I=1,IDBSIZ
+        CALL WRITER(IDEV1,I,J,K,M,N)
+10    CONTINUE      
+      NAMPRO(LL+2:LL+3)='SQ'
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+      CALL OPENRS(IDEV2,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+      CALL FILLC(GEL,MAXGEL,' ')
+C      CALL WRITEW(IDEV2,IDBSIZ,GEL,MAXGEL)
+C
+C  write only the first record into the working versions
+C  and assume others will be added when required
+C
+      CALL WRITEW(IDEV2,1,GEL,MAXGEL)
+      NAMPRO(LL+2:LL+3)='AR'
+      CALL OPENRS(IDEV3,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+C
+C  write only the first record into the names
+C  and assume others will be added when required
+C
+      CALL WRITEN(IDEV3,1,'                ')
+C  CREATE TAG FILES (TAGS AND COMMENTS)
+        NAMPRO(LL+2:LL+3)='TG'
+        CALL OPENRS(IDEVT,NAMPRO,IOK,5,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL WRITTG(IDEVT,IDBSIZ,IDBSIZ,0,0,0,0)
+        NAMPRO(LL+2:LL+3)='CC'
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+        CALL OPENRS(IDEVC,NAMPRO,IOK,11,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL WRITCC(IDEVC,1,1,0,' ')
+      WRITE(KBOUT,1003)NAMPRO(1:LL),IDBSIZ
+1003  FORMAT(' Database ',A,' version 0, size ',I6,
+     +' successfully started')
+      IERR=0
+      NAMPRO(LL+2:LL+2) = '0'
+      RETURN
+ 40   CONTINUE
+      NAMPRO(LL+1:LL+2) = '.0'
+      IOK = ACTF(2,NAMPRO,LL,'0',KBOUT)
+      RETURN
+100   CONTINUE
+      CALL ERROM(KBOUT,'Error writing database files')
+      NAMPRO(LL+1:LL+2) = '.0'
+      IOK = ACTF(2,NAMPRO,LL,'0',KBOUT)
+      END
+      SUBROUTINE FIXRD(IDEVT,IDEVC,IDBSIZ,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C FILE_NAME_LENGTH
+      CHARACTER NAMFIL*18,NEWNAM*18,MTYPE*4,NEWMT*4,FILEH*(*)
+      IF(IDEVRD.LT.0) THEN
+        WRITE(KBOUT,*)'No raw data file!'
+        RETURN
+      END IF
+ 10   CONTINUE
+C  Change raw data record
+        MN = 0
+        MX = IDBSIZ-1
+        LNO = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of line to change',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IVAL.EQ.0) RETURN
+        LNO = IVAL
+        CALL READRD(IDEVT,IDEVC,LNO,LENR,LCUT,LENW,MTYPE,NAMFIL)
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)LENR,LCUT,LENW,MTYPE,NAMFIL
+1001    FORMAT(' ',3I6,' ',A,' ',A)
+        MN = 1
+        MX = 9999
+        LX  = LENR
+        CALL GETINT(MN,MX,LX,
+     +  'Length raw sequence',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        LX = IVAL
+        MN = 1
+        MX = LX
+        L  = LCUT
+        CALL GETINT(MN,MX,L,
+     +  'Left cutoff',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 1
+        MX = LX
+        M  = LENW
+        CALL GETINT(MN,MX,M,
+     +  'Length of original working sequence',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        M = IVAL
+ 20     CONTINUE
+        LNAM = 4
+        CALL GTSTR('Machine type',
+     +  MTYPE,NEWMT,LNAM,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 20
+        END IF
+        IF(INFLAG.EQ.3) NEWMT = MTYPE
+ 30     CONTINUE
+C FILE_NAME_LENGTH
+        LNAM = 18
+        CALL GTSTR('Name for raw data file',
+     +  NAMFIL,NEWNAM,LNAM,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 30
+        END IF
+        IF(INFLAG.EQ.3) NEWNAM = NAMFIL
+        WRITE(KBOUT,1001)LX,L,M,NEWMT,NEWNAM
+        CALL WRITRD(IDEVT,IDEVC,LNO,LX,L,M,NEWMT,NEWNAM)
+        WRITE(KBOUT,*)'New record written to disk'
+        GO TO 10
+        END
+      SUBROUTINE READRD(IDEVT,IDEVC,NGEL,LENR,LCUT,LENW,MTYPE,NAMFIL)
+      CHARACTER MTYPE*(*),NAMFIL*(*)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+      IF(IDEVT.GT.0)THEN
+        CALL READTG(IDEVT,NGEL,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        CALL READCC(IDEVC,LCOM,ICNT,NEXT,NOTE)
+        READ(NOTE,1001,ERR=100)LENR,LCUT,LENW,MTYPE,NAMFIL
+ 1001   FORMAT(3I6,A,A)
+      ENDIF
+      RETURN
+ 100  CONTINUE
+      LENR = 0
+      LCUT = 0
+      LENW = 0
+      MTYPE = ' '
+      NAMFIL = ' '
+      END
+      SUBROUTINE WRITRD(IDEVT,IDEVC,NGEL,LENR,LCUT,LENW,MTYPE,NAMFIL)
+      CHARACTER MTYPE*(*),NAMFIL*(*)
+      INTEGER FREECC
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+      IF(IDEVT.GT.0)THEN
+        CALL READTG(IDEVT,NGEL,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        IF(LCOM.EQ.0)THEN
+          LCOM = FREECC(IDEVC)
+        ENDIF
+        WRITE(NOTE,1001,ERR=100)LENR,LCUT,LENW,MTYPE,NAMFIL
+ 1001   FORMAT(3I6,A,A)
+        NEXT = 0
+        CALL WRITCC(IDEVC,LCOM,ICNT,NEXT,NOTE)
+      ENDIF
+      RETURN
+ 100  CONTINUE
+      END
+      SUBROUTINE READTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      IF(IDEVT.GT.0) THEN
+        READ(IDEVT,REC=I)LPOS,LLEN,LCOM,LTYPE,NEXT
+        LPOS = SWAPBO(LPOS)
+        LLEN = SWAPBO(LLEN)
+        LCOM = SWAPBO(LCOM)
+        NEXT = SWAPBO(NEXT)
+      ENDIF
+      END
+      SUBROUTINE WRITTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      IF (IDEVT.GT.0) THEN
+        WRITE(IDEVT,REC=I)SWAPBO(LPOS),SWAPBO(LLEN),
+     +SWAPBO(LCOM),LTYPE,SWAPBO(NEXT)
+      ENDIF
+      END
+      SUBROUTINE READCC(IDEVC,I,ICNT,NEXT,NOTE)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+C  COMMENT_LENGTH - 4
+      CHARACTER DUMM*36
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      IF(IDEVC.GT.0)THEN
+        READ(IDEVC,REC=1)NEXT,ICNT,DUMM
+        NEXT = SWAPBO(NEXT)
+        ICNT = SWAPBO(ICNT)
+        IF(I.EQ.0.OR.I.GT.ICNT)THEN
+          NEXT = 0
+          NOTE = ' '
+        ELSE
+          READ(IDEVC,REC=I)NEXT,NOTE
+          NEXT = SWAPBO(NEXT)
+        ENDIF
+      ENDIF
+      END
+      SUBROUTINE WRITCC(IDEVC,I,ICNT,NEXT,NOTE)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+C  COMMENT_LENGTH - 4
+      CHARACTER DUMM*36
+      INTEGER SWAPBO
+      EXTERNAL SWAPBO
+      IF(IDEVC.GT.0)THEN
+        IF(I.EQ.1) THEN
+          WRITE(IDEVC,REC=1)SWAPBO(NEXT),SWAPBO(ICNT),DUMM
+        ELSE
+          READ(IDEVC,REC=1)IDUM,ICNT,DUMM
+          ICNT = SWAPBO(ICNT)
+          IF(I.LE.ICNT) THEN
+             WRITE(IDEVC,REC=I)SWAPBO(NEXT),NOTE
+          ENDIF
+        ENDIF
+      ENDIF
+      END
+      SUBROUTINE PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,NC,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),POSN,X
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER PAD
+      SAVE PAD
+      DATA PAD/'*'/
+C   NOW FIND FIRST CHAR THAT OVERLAPS REGION
+      LLINO=LNBR(LINCON)
+30    CONTINUE
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.POSN)GO TO 40
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 30
+40    CONTINUE
+C   NOW GET THIS GEL FROM DISK
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+C   CALC POSN IN THIS GEL TO EDIT
+      X=POSN-RELPG(LLINO)+1
+      K=X
+C   MOVE THE DATA RIGHT
+      M=ABS(LNGTHG(LLINO))
+      LNGTHG(LLINO)=LNGTHG(LLINO)+SIGN(NC,LNGTHG(LLINO))
+C   CHECK FOR OVER END OF ARRAY
+      N=ABS(LNGTHG(LLINO))
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,1000)LLINO
+1000    FORMAT(
+     +' Data pushed off end of gel',I4,' during padding')
+        M=M-(N-MAXGEL)
+        N=MAXGEL
+        LNGTHG(LLINO)=SIGN(MAXGEL,LNGTHG(LLINO))
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        GEL(I)=PAD
+60    CONTINUE
+C   WRITE BACK TO DISK
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+C   NOW UPDATE TAG FILES ACCORDINGLY
+      CALL PADTAG(LLINO,K,NC,LNGTHG(LLINO))
+65    CONTINUE
+C   NOW GET NEXT GEL
+      LLINO=RNBR(LLINO)
+C   LAST GEL?
+      IF(LLINO.EQ.0)GO TO 70
+C   DOES IT HAVE DATA IN REGION?
+C   IE DO RELPG  AND RELPG+LNGTHG-1 LIE EITHER SIDE OF POSN?
+      IF(RELPG(LLINO).GT.POSN)GO TO 70
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 65
+C  WITHIN
+      GO TO 40
+70    CONTINUE
+C   INSERTS FINISHED SO NEED TO INCREMENT ALL THOSE GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+75    CONTINUE
+      IF(RELPG(LLINO).GT.POSN)GO TO 80
+76    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 90
+      GO TO 75
+80    CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)+NC
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      GO TO 76
+90    CONTINUE
+C   NEED TO INCREMENT CONTIG LINE
+      RELPG(LINCON)=RELPG(LINCON)+NC
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      END
+      SUBROUTINE AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK,IDEVC,IDEVT)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LLINO(2),ITYPE(2),IFAIL
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER WINDOW
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      IFAIL = 0
+      IF(NGELS.LT.1) RETURN
+      MN = LENGTH*2
+      MX = MAXGLM + 1
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      MINSLI = 3
+      MN = 0
+      MX = 50
+      MAXPG = 8
+      CALL GETINT(MN,MX,MAXPG,
+     +'Maximum pads per sequence',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MAXPC = IVAL
+        RMN = 0.
+        RMX = 100.
+        PERMAX = 75.
+        CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PERMAX = VAL
+      MN = MINMAT
+      MX = MAXGEL
+      WINDOW = 100
+      CALL GETINT(MN,MX,WINDOW,
+     +'Probe length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      WINDOW = IVAL
+       IOK = 1
+       I = 0
+       CALL YESNO(I,'Use clipped data',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+       IF(I.LT.0) RETURN
+       IWING = 0
+       IF(I.EQ.0) THEN
+         MN = 1
+         MX = MAXGEL
+         IWING = 100
+         CALL GETINT(MN,MX,IWING,
+     +   'Window size for good data scan',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+         IF(IOK.NE.0) RETURN
+         IWING = IVAL
+         MN = 1
+         MX = MIN(100,IWING)
+         NBAD = MIN(IWING,5)
+         CALL GETINT(MN,MX,NBAD,
+     +   'Maximum number of dashes in scan window',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+         IF(IOK.NE.0) RETURN
+         NBAD = IVAL
+       END IF
+      MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+      IDIM2=MAXGEL
+C
+C Note I am doing something horrible here to save space:
+C temp3 is used for cends, nends and as temp3. I can do this
+C because i call fndcon to reinstate cends ,nends after each
+C use of temp3 for the consensus calc. However iladd and iradd
+C need the same amount of storage as cends and nends so i use
+C the arrays cends and nends for them. iladd and iradd are not
+C updated after use of temp3. The switch in array use is made 
+C by the following subroutine call
+C
+C replaced following line 25-9-92
+C     +SAV1,SAV2,SAV3,MAXSAV,TEMP3,TEMP3(MAXCON+1,1),MAXCON,
+C
+      CALL AUTOJN(SEQ1,MAXSEQ,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,TEMP3,TEMP3(1,1+MAXCON/2),MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,CENDS,NENDS,RELPG,LNBR,IDBSIZ,NGELS,NCONTS,
+     +LNGTHG,RNBR,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,PERCD,IWING,NBAD,
+     +ECHRSZ,SEQ5,TEMP3,MAXGL2,IDM,NAMPRO,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      END
+C
+      SUBROUTINE AUTOJN(SEQ1,MAXSEQ,GEL,IDIMGI,ILEFTS,
+     +ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,ILADD,IRADD,RELPG,LNBR,IDBSIZ,NGELS,NCONTS,
+     +LNGTHG,RNBR,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,PERCD,IWING,NBAD,
+     +ECHRSZ,SEQ5,TEMP3,MAXGL2,IDM,NAMPRO,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER ECHRSZ
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(MAXSEQ),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM),NAMPRO*(*),HELPF*(*)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON),ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(MAXSEQ),GEL(MAXGLM),SEQ5(MAXGLM)
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL,ITYPE(2)
+      PARAMETER (MAXC = 100)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC),MCON(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC),WINDOW,CSTART
+      INTEGER RELPG(IDBSIZ),LNBR(IDBSIZ),LNGTHG(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP3(ECHRSZ,MAXGL2)
+      INTEGER CLINNO,CHAINL,XVERSN
+      EXTERNAL CLINNO,CHAINL,XVERSN
+      IEXT = 0
+      IF (IWING.NE.0) IEXT = 1
+      IFAIL = 1
+      KFAIL = 0
+C
+C calc the consensus and add the unused data if required
+C save the lengths of the unused data in iladd, iradd
+C
+      CALL FILLI(ILADD,NCONTS,0)
+      CALL FILLI(IRADD,NCONTS,0)
+      IDIM = 0
+      JOB = 0
+      CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,GEL,IDBSIZ,IDIM,JOB,KDUMM,KDUMM,KDUMM,KDUMM,
+     +TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL,MAXGEL,IDM,PERCD,SEQ5,
+     +ILADD,IRADD,MAXCON,KDUMM,IWING,NBAD)
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ1,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C      WRITE(*,*)'IDIM',IDIM
+C      WRITE(*,*)(CENDS(K),NENDS(K),K=1,IDCEND)
+C
+C find possible missed joins
+C we have consensus in seq1 in order first contig,second contig etc
+C compare the ends in reverse order, simultaneously shortening the consensus
+C the total consensus length is denoted by idimin
+C but we compare the last contig to the rest so denote the length
+C of the rest by idim
+C
+      NFOUND = 0
+      NMADE = 0
+C
+C init hashing constants
+C
+      CALL INITE(CONST,CSTART,LENGTH)
+C
+C idim is the length up to the end of the last but 1 consensus
+C (we get from the start of the last contig)
+C jcon is the last contig (ie the one we are currently comparing)
+C
+      JCON = IDCEND + 1
+C
+C come here for the next contig
+C
+ 10   CONTINUE
+C
+C
+      IDIMIN = IDIM
+C        WRITE(*,*)'IDIM,IDIMIN',IDIM,IDIMIN
+      JCON = JCON - 1
+C
+C come here if weve just made a join
+C
+ 20   CONTINUE
+C
+C
+C      WRITE(*,*)'*********COMPARING CONTIG',JCON
+      IF (JCON.GT.1) THEN
+C        WRITE(*,*)'IDCEND,JCON'
+C        WRITE(*,*)IDCEND,JCON
+C          WRITE(*,1222)
+C     + (K,NENDS(K),CENDS(K),ILADD(K),IRADD(K),K=1,JCON)
+C 1222     FORMAT(5I6)
+C          WRITE(*,*)(ILADD(K),IRADD(K),K=1,JCON)
+        IDIM = CENDS(JCON) - 1
+C        WRITE(*,*)'IDIM,IDIMIN',IDIM,IDIMIN
+        CALL ENCOF(SEQ1,IDIM,CONST,CSTART,LENGTH,POSNS)
+        CALL ENCONN(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH,1)
+C
+C point to probes at the ends of the current contig
+C
+        JS = CENDS(JCON) + 20
+        JE = CENDS(JCON+1) - 1
+        IEND = 1
+        IDIMG = MIN(WINDOW,JE-JS+1)
+C
+C check for case where contig is shorter than probe (window)
+C in which case only compare the left hand end
+C
+        IF(JE-JS+1.LE.WINDOW) IEND = 2
+C
+C come here for left end of contig
+C
+ 30     CONTINUE
+C
+C
+C        WRITE(*,*)'JCON,JS,JE,IDIMG'
+C        WRITE(*,*)JCON,JS,JE,IDIMG
+        IMATC = 0
+        IF(IEND.EQ.1) THEN
+          CALL SQCOPY(SEQ1(JE-IDIMG+1),GEL,IDIMG)
+        ELSE
+          CALL SQCOPY(SEQ1(JS),GEL,IDIMG)
+        END IF
+        CALL SQCOPY(GEL,GELCOP,IDIMG)
+        ISTRAN=1
+C
+C come here for strand 2
+C
+ 40     CONTINUE
+C
+C
+        CALL ENCOF(GEL,IDIMG,CONST,CSTART,LENGTH,GELN)
+        IDSAV=MAXSAV
+        CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +  SAVPG,SAVPS,SAVL,
+     +  IDSAV,SEQ1,GEL,MINMAT,IFCOMP,IDEV)
+        IF(IFCOMP.NE.0) THEN
+           CALL ERROM(KBOUT,'Error in CFGEL')
+           RETURN
+        END IF
+        IF(IDSAV.NE.0)THEN
+          CALL ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +    IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +    IMATC,ISTRAN,MAXC,KBOUT,MCON)
+        END IF
+        ISTRAN=ISTRAN+1
+C
+C have we done both strands for this probe ?
+C
+        IF(ISTRAN.EQ.2) THEN
+          CALL SQCOPY(GELCOP,GEL,IDIMG)
+          CALL SQREV(GEL,IDIMG)
+          CALL SQCOM(GEL,IDIMG)
+          GO TO 40
+        END IF
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        KSENSE = 0
+C        WRITE(KBOUT,*)'Total matches found',IMATC
+C        CALL FMTDB(SEQ1,IDIMIN,1,IDIMIN,60,KBOUT)
+        IF(IMATC.NE.0) THEN
+          JMATC = 0
+C
+C loop for all matches on both strands
+C
+          DO 100 I = 1,IMATC
+            IF(JSENSE(I).EQ.-1) THEN
+              IF(KSENSE.EQ.0) THEN 
+                CALL SQREV(GEL,IDIMG)
+                CALL SQCOM(GEL,IDIMG)
+                KSENSE = 1
+              END IF
+            END IF
+            JDIM22 = IDIMG
+            JDOUT = MAXGEL
+            IDSAV = MAXSAV
+            CALL ALINEJ(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +      SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +      JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +      JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,IDEV,
+     +      SEQ4,MAXGEL,PERMS,
+     +      NENDS(JCON),IEND,JLLINO(I),JSENSE(I),ILADD,IRADD,MAXCON,
+     +      MCON(I),
+     +      JCON,RELPG,LNBR,IDBSIZ,NCONTS,NFOUND,IENDCT,IENDGT)
+            IF (JFAIL.NE.0) GO TO 100
+C
+C if this is the x version and the results are being shown on the  screen
+C we can offer the join editor and make the join
+C
+            IF ((XVERSN().NE.0).AND.(IDEV.EQ.KBOUT)) THEN
+C
+C another horrible thing here: sending posns for use by dbchek after
+C a join is made (only used then)
+C
+              CALL USJED(
+     +        SEQ1,MAXSEQ,IDIMIN,IDIM,GEL,KBIN,KBOUT,MAXGEL,MAXGLM,
+     +        CENDS,NENDS,MAXCON,IFAIL,MCON(I),JCON,NAMPRO,
+     +        JSENSE(I), JLLINO(I),
+     +        ILADD,IRADD,RELPG,LNBR,IDBSIZ,NGELS,
+     +        NCONTS,
+     +        LNGTHG,RNBR,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +        PERCD,IWING,NBAD,
+     +        IENDGT,IENDCT,SEQ4,IDCEND,IEXT,
+     +        ECHRSZ,SEQ5,TEMP3,MAXGL2,IDM,
+     +        IHELPS,IHELPE,HELPF,IDEVH,POSNS,NMADE,IOK)
+              IF (IFAIL.NE.0) GO TO 200
+              IF (NCONTS.EQ.1) GO TO 200
+              IF (IOK.EQ.2) GO TO 200
+C
+C if join has been made, assume everything is tidy and do this contig again
+C
+              IF (IOK.EQ.0) GO TO 20
+            END IF
+100       CONTINUE
+        END IF
+        IEND = IEND + 1
+C
+C have we done both ends of this contig ?
+C
+        IF(IEND.EQ.2) GO TO 30
+C
+C do the next contig
+C
+        GO TO 10
+      END IF
+C
+C jump here if only one contig left ! or user wants to escape
+C
+ 200  CONTINUE
+      WRITE(KBOUT,*)'Number of potential joins found',NFOUND
+      WRITE(KBOUT,*)'Number of joins made',NMADE
+      END
+      SUBROUTINE USJED(
+     +SEQ1,MAXSEQ,IDIMIN,IDIM,GEL,KBIN,KBOUT,MAXGEL,MAXGLM,
+     +CENDS,NENDS,MAXCON,IFAIL,MCON,JCON,NAMPRO,JSENSE,JLLINO,
+     +ILADD,IRADD,RELPG,LNBR,IDBSIZ,NGELS,NCONTS,
+     +LNGTHG,RNBR,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,PERCD,IWING,NBAD,
+     +IENDGT,IENDCT,SEQ2,IDCEND,IEXT,
+     +ECHRSZ,SEQ5,TEMP3,MAXGL2,IDM,IHELPS,IHELPE,HELPF,IDEVH,
+     +TEMP1,NMADE,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER ECHRSZ
+      INTEGER CENDS(MAXCON),NENDS(MAXCON),ILADD(MAXCON),IRADD(MAXCON)
+      CHARACTER SEQ1(MAXSEQ),GEL(MAXGLM),SEQ5(MAXGLM),SEQ2(MAXGEL)
+      INTEGER JSENSE,JLLINO,MCON,TEMP1(IDBSIZ)
+      INTEGER RELPG(IDBSIZ),LNBR(IDBSIZ),LNGTHG(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP3(ECHRSZ,MAXGL2)
+      INTEGER CLINNO,CHAINL
+      CHARACTER HELPF*(*),NAMPRO*(*)
+      EXTERNAL CLINNO,CHAINL
+C      WRITE(*,*)'CURRENT MAX CONSENSUS LENGTH',MAXSEQ
+C      WRITE(*,*)'CURRENT CONSENSUS LENGTH',IDIM
+C      WRITE(*,*)'MAXGEL,MAXGLM,MAXCON',MAXGEL,MAXGLM,MAXCON
+C      WRITE(*,*)'NGELS,NCONTS',NGELS,NCONTS
+C
+C routine to offer use of join editor and if join is made tidy up
+C
+      IFAIL = 0
+      IOK = 2
+      IJOIN = 0
+      CALL YESNO(IJOIN,'Use the editor',
+     +     IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IJOIN.LT.0) RETURN
+      IOK = 1
+      IF (IJOIN.EQ.0) THEN
+C
+C weve called the editor
+C
+C
+C the contig names are nends(jcon) and jllino(i)
+C if jsense is -1 the contig to complement is nends(jcon)
+C the position of the overlap is iendct in jllino(i) and iendgt in nends(jcon)
+C we are going to need to know the endpoints of the consensuses to delete
+C the element number in nends of one contig is mcon(i) and the other is jcon
+C cends(i) stores the position relative to the consensus of the left end 
+C of contig i
+C
+C        WRITE(*,*)'NENDS(JCON)',NENDS(JCON)
+C        WRITE(*,*)'IDBSIZ,NCONTS',IDBSIZ,NCONTS
+        IF (JSENSE.EQ.-1) THEN
+          LINCON = CLINNO(LNBR,IDBSIZ,NCONTS,NENDS(JCON))
+          IF (LINCON.EQ.0) THEN
+C
+C major cockup here: weve lost the contig number
+C
+            CALL ERROM(KBOUT,
+     +      'Error: contig line not found for complementing')
+            IFAIL = 1
+            RETURN
+          END IF
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LINCON,NENDS(JCON),GEL,IDBSIZ,KBOUT,IDEV1,IDEV2,MAXGEL)
+          NENDS(JCON) = LNBR(LINCON)
+        END IF
+C
+C note the number of contigs
+C
+        NCIN = NCONTS
+C        WRITE(*,*)'JCON,NENDS(JCON)',JCON,NENDS(JCON)
+        LNCONL = CLINNO(LNBR,IDBSIZ,NCONTS,NENDS(JCON))
+        IF (LNCONL.EQ.0) THEN
+C
+C major cockup here: weve lost the contig number
+C
+          CALL ERROM(KBOUT,
+     +    'Error: contig line not found for left contig')
+          IFAIL = 1
+          RETURN
+        END IF
+C        WRITE(*,*)'I,MCON,NENDS(MCON)',I,MCON,NENDS(MCON)
+        LNCONR = CLINNO(LNBR,IDBSIZ,NCONTS,NENDS(MCON))
+        IF (LNCONR.EQ.0) THEN
+C
+C major cockup here: weve lost the contig number
+C
+          CALL ERROM(KBOUT,
+     +    'Error: contig line not found for right contig')
+          IFAIL = 1
+          RETURN
+        END IF
+C        WRITE(*,*)IDBSIZ,LNCONL,NENDS(JCON),LNCONR,NENDS(MCON)
+C        WRITE(*,*)IENDGT,IENDCT,PERCD,NGELS,NCONTS
+C
+C call the editor
+C
+        CALL JXEDIT(IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +  RELPG,LNGTHG,LNBR,RNBR,MAXGEL,
+     +  IDBSIZ,LNCONR,NENDS(MCON),LNCONL,NENDS(JCON),
+     +  0,IENDCT,0,IENDGT,PERCD,NGELS,NCONTS,IDM,IEXT,JOK)
+C
+C jok 0 = nothing done
+C     1 = left contig saved
+C     2 = right contig saved
+C     3 = both contigs saved
+C     7 = join made
+C we could use these values but we dont: we only count contigs.
+C if a contig is saved its consensus may be different. For now dont worry!!!!
+C
+C
+C note we may not have joined but may have either
+C complemented a contig or edited one! We shall ignore the edited
+C contig, but must uncomplement the contig. See below.
+C Remove the 2 consensus sequences
+C Add the new one to the right end
+C Update nends, cends
+C Update iladd iradd, and add the new data to the end of the list
+C to update iladd, iradd move all data left one element
+C
+        IF (NCIN.GT.NCONTS) THEN
+C
+C weve made a join
+C
+          NMADE = NMADE + 1
+C
+C check for consistency
+C
+          CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +    TEMP1,IFAIL,KBOUT)
+          IF (IFAIL.NE.0) RETURN
+C
+C sometimes there may only be one contig now!
+C
+          IF (NCONTS.EQ.1) RETURN
+C
+C get contig left gel number
+C
+          LLINOC = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,NENDS(JCON))
+          IF (LLINOC.EQ.0) THEN
+            CALL ERROM(KBOUT,'Gel number not in a contig')
+            IFAIL = 1
+            RETURN
+          END IF
+C
+C get contig line number
+C
+          LINCON = CLINNO(LNBR,IDBSIZ,NCONTS,LLINOC)
+          IF (LINCON.EQ.0) THEN
+            CALL ERROM(KBOUT,'Error: contig line not found')
+            IFAIL = 1
+            RETURN
+          END IF
+C          WRITE(*,*)'DELETE CONSENSUS FROM',CENDS(MCON)
+C          WRITE(*,*)'TO',CENDS(MCON+1)
+C          WRITE(*,*)'SHORTEN THE CONSENSUS BY'
+C          WRITE(*,*)CENDS(MCON+1)-CENDS(MCON)
+C          WRITE(*,*)CENDS(JCON+1)-CENDS(JCON)
+C
+C shift the consensus to delete the internal consensus then
+C reduce idim,idimin accordingly (for both contigs). Finally
+C move the extension sizes into the appropriate elements.
+C
+          CALL SHFTLA(SEQ1,MAXSEQ,CENDS(MCON+1),CENDS(MCON),
+     +    IDIMIN)
+          IDIM = IDIM - (CENDS(JCON+1)-CENDS(JCON))
+          IDIMIN = IDIMIN - (CENDS(JCON+1)-CENDS(JCON))
+          IDIM = IDIM - (CENDS(MCON+1)-CENDS(MCON))
+          IDIMIN = IDIMIN - (CENDS(MCON+1)-CENDS(MCON))
+          DO 50 J=MCON,NCONTS-1
+            ILADD(J) = ILADD(J+1)
+            IRADD(J) = IRADD(J+1)
+ 50       CONTINUE
+C
+C make a consensus for the joined contig, putting it at the end
+C
+          JOB = 1
+          JCON = JCON - 1
+          CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +    SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIMIN,JOB,LLINOC,LINCON,1,
+     +    RELPG(LINCON),TEMP3,
+     +    ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL,MAXGEL,IDM,PERCD,SEQ5,
+     +    ILADD,IRADD,MAXCON,JCON,IWING,NBAD)
+          IF (IFAIL.NE.0) RETURN
+C
+C now restore the contig end positions because jcons has overwritten them
+C (they have changed anyway)
+C
+          CALL FNDCON(SEQ1,IDIMIN,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+          IOK = 0
+C
+C   end of join made
+C
+        ELSE
+C
+C   if we get here we have called the editor but not made a join
+C   so we might need to tidy up
+C
+          IF (JSENSE.EQ.-1) THEN
+            LINCON = CLINNO(LNBR,IDBSIZ,NCONTS,NENDS(JCON))
+            IF (LINCON.EQ.0) THEN
+C
+C major cockup here: weve lost the contig number
+C
+              CALL ERROM(KBOUT,
+     +        'Error: contig line not found for complementing')
+              IFAIL = 1
+              RETURN
+            END IF
+            CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +      LINCON,NENDS(JCON),GEL,IDBSIZ,KBOUT,IDEV1,IDEV2,
+     +      MAXGEL)
+            NENDS(JCON) = LNBR(LINCON)
+          END IF
+        END IF
+      END IF
+      END
+      SUBROUTINE ALINEJ(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,IDEV,
+     +SEQ3,MAXGEL,
+     +PERCM,JCONN,IEND,NCON,JSENSE,ILADD,IRADD,MAXCON,MCON,JCON,
+     +RELPG,LNBR,IDBSIZ,NCONTS,NFOUND,IENDCT,IENDGT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV)
+      INTEGER ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER RELPG(IDBSIZ),LNBR(IDBSIZ),WINDOW
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      IFAIL=0
+C
+C save idim2 as window
+C
+      WINDOW = IDIM2
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+      CALL JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IDEV,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW,
+     +NFOUND,IENDCT,IENDGT)
+      IF (ITOTPC.GT.MAXPC) IFAIL = 1
+      IF (ITOTPG.GT.MAXPG) IFAIL = 1
+      IF (PERCM.GT.PERMAX) IFAIL = 1
+      END
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IDEV,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW,
+     +NFOUND,IENDCT,IENDGT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH,STRAND,NAME1*6,NAME2*6
+      INTEGER ILADD(MAXCON),IRADD(MAXCON),RELPG(IDBSIZ),LNBR(IDBSIZ)
+      INTEGER CLINNO,WINDOW
+      EXTERNAL CLINNO
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+C
+C  where are the overlaps?
+C
+C  if ITYPE is 1 the overlap starts within the reading at JOINT
+C  else it starts at the left end of the reading at JOINT in the contig
+C
+C
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      X=REAL(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        CALL ERROM(KBOUT,'Matching region too long in JALIGN')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      IF(X.LT.PERMAX) THEN
+        IC = CLINNO(LNBR,IDBSIZ,NCONTS,JCONN)
+        IF(IC.EQ.0) THEN
+          CALL ERROM(KBOUT,'Error: contig line not found')
+          IFAIL = 99
+          RETURN
+        END IF
+        IF(JSENSE.EQ.-1) THEN
+          STRAND = '-'
+          IF(IEND.EQ.1) THEN
+C
+C  probe is complement of right hand end of contig. Give posns assuming
+C  this contig is complemented.
+C
+            IENDGT = IENDG - IRADD(JCON)
+C
+C Next line gives posns relative to original orientation
+C            IENDGT = RELPG(IC) + IRADD(JCON) - IENDG + 1
+          ELSE
+C
+C  probe is complement of left end of contig. Give posns assuming this
+C  contig is going to be complemented.
+C
+            IENDGT = RELPG(IC) - WINDOW + ILADD(JCON) + IENDG
+          END IF
+        ELSE
+          STRAND = '+'
+          IF(IEND.EQ.2) THEN
+C
+C  probe is left hand end of contig in original sense
+C
+            IENDGT = IENDG - ILADD(JCON)
+          ELSE
+C
+C  probe is right hand end of contig in original sense
+C
+            IENDGT = RELPG(IC) + IRADD(JCON) - WINDOW + IENDG
+          END IF
+        END IF
+        IENDCT = IENDC - ILADD(MCON)
+          WRITE(IDEV,*)
+     +    ' Possible join between contig ',JCONN,' in the ',
+     +    STRAND,' sense and contig ',NCON
+        WRITE(IDEV,1000)X
+1000    FORMAT(' Percentage mismatch after alignment = ',F4.1)
+        WRITE(NAME1,1002)JCONN
+        WRITE(NAME2,1002)NCON
+ 1002   FORMAT(I6)
+        CALL FMT4LP(SEQC2(1),SEQG2(IENDG),LO,IENDCT,IENDGT,IDEV,
+     +  NAME2,NAME1)
+        NFOUND = NFOUND + 1
+      END IF
+      IFAIL=0
+      END
+      SUBROUTINE ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      END
+      SUBROUTINE ADISM6(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C new bit need to save contig number for alinej
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        MCON(IMATC) = JJ
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+C      JCONS
+      SUBROUTINE JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,JOB,LLINO,LINCON,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD,TGEL,ILADD,IRADD,MAXCON,JCON,
+     +IWIN,NBAD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ,ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),TGEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CALL BUSY(KBOUT)
+C      WRITE(*,*)'NGELS,NCONTS,IDIM1,ISTART,LREG,RREG'
+C      WRITE(*,*)NGELS,NCONTS,IDIM1,ISTART,LREG,RREG
+C JOB = 0 do all contigs, = 1 do one contig llino,lincon
+      IFAIL=0
+      IF (JOB.EQ.0) THEN
+        N=IDBSIZ-NCONTS
+        NCONS = 0
+        DO 110 I=N,IDBSIZ-1
+          J=LNBR(I)
+          X=1
+          Y=RELPG(I)
+          ISTART=ISTART+1
+          IF((ISTART+19+Y+2*MAXGEL).GT.IDIM1)THEN
+            WRITE(KBOUT,1009)IDIM1
+1009        FORMAT(
+     +      ' Maximum consensus length(',I6,') exceeded',/,
+     +      ' calculation aborted')
+            IFAIL=1
+            RETURN
+          END IF
+          CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+          NCONS = NCONS + 1
+          IDIN = 0
+          IF(IWIN.GT.0) THEN
+            IDIN = MAXGEL
+            CALL GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,1,
+     +      GEL,TGEL,IDIN,IWIN,NBAD)
+            IF(IDIN.GT.0)CALL SQCOPY(GEL,SEQ1(ISTART),IDIN)
+          END IF
+          ILADD(NCONS) = IDIN
+          ISTART = ISTART + IDIN
+          CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,
+     +    MAXGEL,IDM,PERCD)
+          ISTART=ISTART+Y-1
+          IDIN = 0
+          IF(IWIN.GT.0) THEN
+            IDIN = MAXGEL
+            CALL GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,2,
+     +      GEL,TGEL,IDIN,IWIN,NBAD)
+            IF(IDIN.GT.0)CALL SQCOPY(GEL,SEQ1(ISTART+1),IDIN)
+          END IF
+          IRADD(NCONS) = IDIN
+          ISTART = ISTART + IDIN
+110     CONTINUE
+      ELSE
+C
+C do it for one contig
+C
+        J = LLINO
+        I = LINCON
+        X = LREG
+        Y = RREG
+        ISTART = ISTART + 1
+C        WRITE(*,*)
+C     +'LLINO,LINCON,NCONTS,ISTART',LLINO,LINCON,NCONTS,ISTART
+        IF((ISTART+19+Y+2*MAXGEL).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+          IFAIL=1
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        NCONS = NCONTS
+        IDIN = 0
+        IF(IWIN.GT.0) THEN
+          IDIN = MAXGEL
+          CALL GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,1,
+     +      GEL,TGEL,IDIN,IWIN,NBAD)
+          IF(IDIN.GT.0)CALL SQCOPY(GEL,SEQ1(ISTART),IDIN)
+        END IF
+C        WRITE(*,*)'ILADD(JCON)',IDIN
+        ILADD(JCON) = IDIN
+        ISTART = ISTART + IDIN
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +    IDEVW,MAXGEL,IDM,PERCD)
+C repleced by next line 26-1-93        ISTART = ISTART + Y - 1
+        ISTART = ISTART + RREG - LREG
+        IDIN = 0
+        IF(IWIN.GT.0) THEN
+          IDIN = MAXGEL
+          CALL GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,2,
+     +      GEL,TGEL,IDIN,IWIN,NBAD)
+          IF(IDIN.GT.0)CALL SQCOPY(GEL,SEQ1(ISTART+1),IDIN)
+        END IF
+C        WRITE(*,*)'IRADD(JCON)',IDIN
+        IRADD(JCON) = IDIN
+        ISTART = ISTART + IDIN
+      END IF
+      END
+      SUBROUTINE GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,ICONT,IEND,
+     +GEL,GELT,ID,IWIN,NBAD)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ),RELPG(IDBSIZ)
+      CHARACTER GEL(ID),GELT(ID)
+      EXTERNAL NOK2
+      MAXGEL = ID
+      IDT = 0
+C routine to find a possible extension to a contig by looking in a tag file
+C contig ICONT end IEND = 1 =left 2=right
+C return data in GEL, of length ID
+C The worst aspect of this is that if we dont choose the very end reading
+C we dont know where it lies relative to the consensus. For now just assume
+C there are no length difference between the extension and the consensus
+C and just add it on the end
+      IF(IEND.EQ.1) THEN
+        LMOST = 1
+        IGEL = LNBR(ICONT)
+ 10     CONTINUE
+        IF(IGEL.EQ.0) GO TO 100
+        IF(RELPG(IGEL).GT.MAXGEL) GO TO 100
+        IF(LNGTHG(IGEL).LT.0) THEN
+          ID = MAXGEL
+          CALL GETEXT(IGEL,GELT,ID,IOK)
+          IF(IOK.EQ.0) THEN
+C  look for number of n's here and change id accordingly
+            K =  NOK2(GELT,ID,IWIN,NBAD)
+            LT = MIN(LMOST,RELPG(IGEL)-K)
+            IF(LT.LT.LMOST) THEN
+              IS = RELPG(IGEL)
+              N = K - IS + 1
+              CALL SQCOPY(GELT(IS),GEL,N)
+              CALL SQREV(GEL,N)
+              CALL SQCOM(GEL,N)
+              IDT = N
+              LMOST = LT
+            END IF
+          ELSE
+C            WRITE(*,*)'COCKUP IN GETEXT, gel',IGEL
+          END IF
+        END IF
+        IGEL = RNBR(IGEL)
+        GO TO 10
+      ELSE
+        IGEL = RNBR(ICONT)
+        LMOST = RELPG(ICONT)
+        IDC = RELPG(ICONT)
+ 20     CONTINUE
+        IF(IGEL.EQ.0) GO TO 100
+        IF(LMOST-RELPG(IGEL).GT.MAXGEL) GO TO 100
+        IF(LNGTHG(IGEL).GT.0) THEN
+          ID = MAXGEL
+          CALL GETEXT(IGEL,GELT,ID,IOK)
+          IF(IOK.EQ.0) THEN
+            K =  NOK2(GELT,ID,IWIN,NBAD)
+            LT = MAX(LMOST,RELPG(IGEL)+LNGTHG(IGEL)+K-1)
+            IF(LT.GT.LMOST) THEN
+              IS = RELPG(ICONT) - (RELPG(IGEL) + LNGTHG(IGEL)) + 2
+              N = K - IS + 1
+              CALL SQCOPY(GELT(IS),GEL,N)
+              IDT = N
+              LMOST = LT
+            END IF
+          ELSE
+C            WRITE(*,*)'COCKUP IN GETEXT, GEL',IGEL
+          END IF
+        END IF
+        IGEL = LNBR(IGEL)
+        GO TO 20
+      END IF
+ 100  CONTINUE
+      ID = IDT
+      END
+      INTEGER FUNCTION NOK2(GEL,ID,IWIN,NBADIN)
+      CHARACTER GEL(ID)
+      PARAMETER (MAXPOS = 101)
+      INTEGER POSNS(MAXPOS),R
+      EXTERNAL KWRAP
+C count N's over a window of iwin, return position
+C when over NBAD
+C INIT
+      NBAD = NBADIN + 1
+      I = 0
+      N = 0
+      NOK2 = ID
+      L = 1
+      R = 0
+      IF(NBAD.GT.MAXPOS)THEN
+        WRITE(*,*)'Scream: nok2 not happy'
+        RETURN
+      END IF
+ 10   CONTINUE
+      I = I + 1
+      IF(I.GT.ID) RETURN
+      IF(GEL(I).EQ.'-') THEN
+        N = N + 1
+        R = KWRAP(R,NBAD)
+        POSNS(R) = I
+        IF(N.GE.NBAD) THEN
+          IF(POSNS(R)-POSNS(L)+1.LT.IWIN) THEN
+            NOK2 = POSNS(L)
+            RETURN
+          END IF
+          L = KWRAP(L,NBAD)
+        END IF
+      END IF
+      GO TO 10
+      END
+      INTEGER FUNCTION KWRAP(I,J)
+      IT = I + 1
+      IF(IT.GT.J) IT = 1
+      KWRAP = IT
+      END
+      SUBROUTINE PADSHF(RELPG,LNGTHG,LNBR,RNBR,
+     +READS,IDIM1,LREG,RREG,LLINO,IDBSIZ,FREQS,CHRSIZ,MAXGL2,
+     +IDEVW,MAXGEL,READPS,READNS,PSTART,NPADS,MXG,PADPOS,MAXP,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,RREG
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER READS(IDIM1),PAD,DASH
+C      PARAMETER (MXG = 1000)
+C      PARAMETER (MAXP = MXG * 20)
+      INTEGER FREQS(CHRSIZ,MAXGL2),READPS(MXG),READNS(MXG)
+      INTEGER CR,SHIFTF,READST,PCF,WINS,WINE,PG,PC,PCR,CHNRP1
+      INTEGER PADPOS(MAXP),PSTART(MXG),NPADS(MXG),PADST,PGE
+      LOGICAL FIRSTP,PASS1
+      EXTERNAL INDEXS,CHNRP1,NSAMPL
+      SAVE PAD,DASH
+      DATA PAD/'*'/,DASH/'-'/
+C
+C region to process is lreg to rreg
+C current active region is wins to wine
+C current reading is cr
+C we process all the readings covering wins to wine,
+C write them to disk, then move wins and wine, and get the first
+C reading that covers them, etc
+C we save readings in reads and their numbers in readns
+C and their start positions in reads in readps
+C we accumulate frequencies in freqs
+C
+      NEDITS = 0
+      NSWAPT = 0
+      INDPAD = INDEXS(PAD,JUNK)
+      WINS = LREG
+      WINE = MIN(WINS + MAXGL2 - 1,RREG)
+1     CONTINUE
+      CR =  CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  LLINO,WINS)
+      IF (CR.EQ.0) RETURN
+      DO 10 I = 1,MAXGL2
+        DO 10 J = 1,CHRSIZ
+           FREQS(J,I) = 0
+10    CONTINUE
+      NREADS = 0
+      READST = 1
+      PADST  = 1
+      PASS1  = .TRUE.
+20    CONTINUE
+      CALL READW(IDEVW,CR,READS(READST),MAXGEL)
+      IF (NREADS.EQ.MXG) THEN
+        CALL ERROM(KBOUT,'Too many reads for buffer')
+        RETURN
+      END IF
+      NREADS = NREADS + 1
+      READNS(NREADS) = CR
+      READPS(NREADS) = READST
+      READST = READST + ABS(LNGTHG(CR))
+      SHIFTF = RELPG(CR) - WINS
+      DO 30 I = MAX(1,WINS-RELPG(CR) + 1),
+     +          MIN(ABS(LNGTHG(CR)),WINE-RELPG(CR)+1)
+        JJ = INDEXS(READS(READPS(NREADS)+I-1),JSCORE)
+        JJJ = SHIFTF + I
+        FREQS(JJ,JJJ) = FREQS(JJ,JJJ) + 1
+30    CONTINUE
+      IF (RNBR(CR).NE.0) THEN
+        CR = RNBR(CR)
+        IF (RELPG(CR).LE.WINE) GO TO 20
+      END IF
+      DO 35 I=1,NREADS
+        NPADS(I) = 0
+ 35   CONTINUE
+40    CONTINUE
+      NSWAP = 0
+C pg is position in reading ir
+C PCR is position in reads
+C pc is position in contig
+C pcf is position in freqs
+C
+C we swap if chars to left and right are not already pads and
+C theres at least one of the char to the left in the current column
+C or the char to the left is a dash (not always beneficial) and
+C the pads are not already aligned with only 1 other symbol
+C
+C we put the element number of the first pad for each reading at padst
+C we count the number of pads in npads
+C we store the pad positions in padpos
+C
+      DO 60 IR = 1,NREADS
+        SHIFTF = WINS - 1
+        FIRSTP = .TRUE.
+        DO 50 PG = MAX(2,WINS-RELPG(READNS(IR))+2),
+     +  MIN(ABS(LNGTHG(READNS(IR))),WINE-RELPG(READNS(IR))+1)
+          PCR = PG + READPS(IR) - 1
+C
+C horrible exception for case where base 1 in read is pad
+C we wont see it this pass but will when we write out tags
+C so make sure we do see it now or we cockup later
+C
+          IF (PASS1) THEN
+            IF (PG.EQ.2) THEN
+              IF (READS(PCR-1).EQ.PAD) THEN
+                PSTART(IR) = PADST
+                NPADS(IR) = NPADS(IR) + 1
+                PADPOS(PADST) = PG - 1
+                IF (PADST.EQ.MAXP) THEN
+                  CALL ERROM(KBOUT,'Too many pads for buffer')
+                  RETURN
+                END IF
+                PADST = PADST + 1
+                FIRSTP = .FALSE.
+              END IF
+            END IF
+          END IF
+          IF (READS(PCR).EQ.PAD) THEN
+            IF (PASS1) THEN
+              IF (FIRSTP) PSTART(IR) = PADST
+              NPADS(IR) = NPADS(IR) + 1
+              PADPOS(PADST) = PG
+              IF (PADST.EQ.MAXP) THEN
+                CALL ERROM(KBOUT,'Too many pads for buffer')
+                RETURN
+              END IF
+              PADST = PADST + 1
+              FIRSTP = .FALSE.
+            END IF
+            PC = PG + RELPG(READNS(IR)) - 1
+            INPCRM = INDEXS(READS(PCR-1),JSCORM)
+            IF (INPCRM.NE.INDPAD) THEN
+              PCF = PC - SHIFTF
+              IF (((FREQS(INPCRM,PCF).NE.0).OR.
+     +        (READS(PCR-1).EQ.DASH)).AND.
+     +        (FREQS(INDPAD,PCF).LT.(NSAMPL(FREQS(1,PCF),CHRSIZ)-1)))
+     +        THEN
+C
+C swap
+C
+                READS(PCR) = READS(PCR-1)
+                READS(PCR-1) = PAD
+                INPCR = INDEXS(READS(PCR),JSCORE)
+                FREQS(INPCRM,PCF-1) = FREQS(INPCRM,PCF-1) - 1
+                FREQS(INPCR,PCF)    = FREQS(INPCR,PCF)    - 1
+                FREQS(INPCR,PCF-1)  = FREQS(INPCR,PCF-1)  + 1
+                FREQS(INPCRM,PCF)   = FREQS(INPCRM,PCF)   + 1
+                NSWAP = NSWAP + 1
+              END IF
+            END IF
+          END IF
+50      CONTINUE
+60    CONTINUE
+C      WRITE(*,*)'NUMBER OF SWAPS',NSWAP
+      NSWAPT = NSWAPT + NSWAP
+      PASS1 = .FALSE.
+      IF (NSWAP.GT.0) GO TO 40
+C
+C no more movement so write back wot weve changed so far
+C
+      DO 70 IR = 1,NREADS
+        K = 0
+        NPAD = 0
+        DO 65 PG = MAX(1,WINS-RELPG(READNS(IR))+1),
+     +  MIN(ABS(LNGTHG(READNS(IR))),WINE-RELPG(READNS(IR))+1)
+          PCR = PG + READPS(IR) - 1
+          IF (READS(PCR).EQ.PAD) THEN
+            L = PADPOS(PSTART(IR)+NPAD)
+            IF (PG.LT.L) THEN
+              PC = PG + RELPG(READNS(IR)) - 1
+              LE = L
+              PGE = PG
+              IF (LNGTHG(READNS(IR)).LT.0) THEN
+                LE = 1 - LNGTHG(READNS(IR)) - LE
+                PGE = 1 - LNGTHG(READNS(IR)) - PGE
+C         WRITE(*,*)READNS(IR),' insert at',PGE,PC,' del at',LE
+                CALL DELEDT(READNS(IR),LE,'*')
+                CALL INSEDT(READNS(IR),PGE,'*')
+              ELSE
+C         WRITE(*,*)READNS(IR),' del at',LE,' insert at',PGE,PC
+                CALL DELEDT(READNS(IR),LE,'*')
+                CALL INSEDT(READNS(IR),PGE,'*')
+              END IF
+              NEDITS = NEDITS + 1
+            END IF
+            NPAD = NPAD + 1
+          END IF
+ 65     CONTINUE
+        CALL WRITEW(IDEVW,READNS(IR),READS(READPS(IR)),MAXGEL)
+70    CONTINUE
+C
+C we allow the next chunk to overlap the previous one by 20
+C in case some pads at the right edge of it could still move
+C further left. Alternatively we could look for 2 adjacent
+C positions which have 100% consensus and are different
+C
+      IF (WINE.NE.RREG) THEN
+        WINS = MAX(LREG,WINE-19)
+        WINE = MIN(WINS + MAXGL2 - 1,RREG)
+        GO TO 1
+      END IF
+C      WRITE(*,*)'Total swaps',NSWAPT
+      WRITE(KBOUT,*)'Number of pads moved',NEDITS
+      END
+      INTEGER FUNCTION NSAMPL(COUNTS,IDM)
+C
+C count the number of reads covering this position
+C
+      INTEGER IDM
+      INTEGER COUNTS(IDM)
+      ISUM = 0
+      DO 5 I=1,IDM
+        ISUM = ISUM + COUNTS(I)
+ 5    CONTINUE
+      NSAMPL = ISUM
+      END
+      INTEGER FUNCTION CONOK(
+     +RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ,MAXSEQ,SEQ2,IDBSIZ,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEVW,IDEV,
+     +MAXGEL,IDM,PERCD,CENDS,NENDS,MAXCON)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER ECHRSZ,TEMP3(ECHRSZ,MAXGL2),CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ(MAXSEQ),SEQ2(MAXGEL),NAMPRO*(*)
+      EXTERNAL NOK2
+C
+C routine to calc a consensus and look for ibad dashes in iwin bases
+C
+C returns number of cocked-up contigs or -1 for error in consensus calc
+C for each contig it stops where it find the first problem.
+C
+      CONOK = 0
+      JOB = 0
+      IDIM1 = 0
+      CALL ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ,MAXSEQ,SEQ2,IDBSIZ,IDIM1,JOB,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEVW,IFAIL,MAXGEL,IDM,PERCD)
+      IF(IFAIL.NE.0) THEN
+        CALL ERROM(KBOUT,'Error calculating consensus')
+        CONOK = -1
+        RETURN
+      END IF
+      CALL FNDCON(SEQ,IDIM1,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+      IWIN = 20
+      NBAD = 15
+      DO 10 I=1,IDCEND
+        J = CENDS(I) + 20
+        IDIM = CENDS(I+1) - J
+        K =  NOK2(SEQ(J),IDIM,IWIN,NBAD)
+        IF (K.LT.IDIM) THEN
+          WRITE(IDEV,*)'Problem at position',K,' In contig',NENDS(I)
+          CONOK = CONOK + 1
+        END IF
+ 10   CONTINUE
+      IF (CONOK.EQ.0) THEN
+        WRITE(KBOUT,*)
+     +' Consensus has no segments with',NBAD,' dashes in',IWIN
+      ELSE
+        WRITE(KBOUT,*)
+     +  CONOK,' contigs have segments with',NBAD,' dashes in',IWIN
+      END IF
+      END
+      SUBROUTINE AUTOM(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,LLINO,LINCON,MAXGEL,
+     +TEMP3,SEQ1,MAXSEQ,SEQ2,SEQ5,SEQC2,SEQG2,
+     +MAXGLM,MAXGL2,ECHRSZ,
+     +SAV1,SAV2,SAV3,MAXSAV,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV8,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK,IDEVC,IDEVT)
+      INTEGER ECHRSZ,SELCON
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER TEMP3(ECHRSZ,MAXGL2)
+      CHARACTER SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+C
+C take the poor data from reads to see if it aligns with the consensus
+C
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      IFAIL = 0
+      IF(NGELS.LT.1) RETURN
+C      MN = 0
+C      MX = 50
+C      MAXPG = 20
+C      CALL GETINT(MN,MX,MAXPG,
+C     +'Maximum pads per sequence',
+C     +IVAL,KBIN,KBOUT,
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MAXPC = IVAL
+      RMN = 0.
+      RMX = 100.
+      PERMAX = 13.
+      CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      PERMAX = VAL
+      MN = 1
+      MX = MAXGEL
+      IWING = 100
+      CALL GETINT(MN,MX,IWING,
+     +   'Window size for good data scan',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IWING = IVAL
+      MN = 1
+      MX = MIN(100,IWING)
+      NBAD = MIN(IWING,5)
+      CALL GETINT(MN,MX,NBAD,
+     +   'Maximum number of dashes in scan window',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      NBAD = IVAL
+      CALL YESNO(IREPN,'Save failed names in a file',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF (IREPN.LT.0) RETURN
+      IF (IREPN.EQ.0) THEN
+        CALL OPENF1(IDEV8,FILE,1,IOK,KBIN,KBOUT,
+     +  'File for names of failed reads',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF (IOK.NE.0) RETURN
+      END IF
+      CALL YESNO(SELCON,'Select contigs',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(SELCON.LT.0) GO TO 100
+      IF(SELCON.EQ.0) THEN
+        CALL GETLN3(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,NULGEL,IOK,IDBSIZ,KBIN,KBOUT,IDEV3,
+     +  'Contig identifier',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) GO TO 100
+        CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 100
+      END IF
+       IDIM2=MAXGEL
+      CALL AUTOMN(SEQ1,MAXSEQ,SEQ2,IDIM2,
+     +LLINO,LINCON,IFCOMP,LREG,RREG,
+     +IDEV,MAXGEL,MAXGLM,
+     +SAV1,SAV2,SAV3,MAXSAV,
+     +SEQG2,SEQC2,
+     +PERMAX,SEQG3,SEQC3,
+     +RELPG,LNBR,IDBSIZ,NGELS,NCONTS,
+     +LNGTHG,RNBR,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,PERCD,IWING,NBAD,
+     +ECHRSZ,TEMP3,MAXGL2,IDM,NAMPRO,KBIN,KBOUT,SELCON,IREPN,IDEV8,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+ 100  CONTINUE
+      IF(IREPN.EQ.0) CLOSE(UNIT=IDEV8)
+      END
+      SUBROUTINE AUTOMN(SEQ1,MAXSEQ,GEL,IDIMGI,
+     +LLINO,LINCON,IFCOMP,LREG,RREG,
+     +IDEV,MAXGEL,MAXGLM,
+     +SAVPS,SAVPG,SAVL,MAXSAV,
+     +SEQG2,SEQC2,
+     +PERMAX,SEQG3,SEQC3,
+     +RELPG,LNBR,IDBSIZ,NGELS,NCONTS,
+     +LNGTHG,RNBR,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,PERCD,IWING,NBAD,
+     +ECHRSZ,TEMP3,MAXGL2,IDM,NAMPRO,KBIN,KBOUT,SELCON,IREPN,IDEV8,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER ECHRSZ
+      INTEGER SAVPS(MAXSAV),SAVPG(MAXSAV),SAVL(MAXSAV)
+      CHARACTER NAMPRO*(*),HELPF*(*)
+      CHARACTER SEQ1(MAXSEQ),GEL(MAXGLM)
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      CHARACTER NAMT*16
+      PARAMETER (MAXCON = 1,MING = 10,MAXG = 500)
+      INTEGER ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER RREG,FIRSTR,SELCON,FIRSTC
+      INTEGER RELPG(IDBSIZ),LNBR(IDBSIZ),LNGTHG(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP3(ECHRSZ,MAXGL2)
+      INTEGER CHNRP1,XVERSN
+      EXTERNAL CHNRP1,XVERSN,NOK2
+      JOB = 1
+      JCON =  1
+      NPROBS = 0
+      IF (SELCON.EQ.0) THEN
+        FIRSTC = LINCON
+        LASTC  = LINCON
+      ELSE IF (SELCON.EQ.1) THEN
+        FIRSTC = IDBSIZ-NCONTS
+        LASTC  = IDBSIZ-1
+      ELSE
+        CALL ERROM(KBOUT,'Error in AUTOM: unexpected option')
+        RETURN
+      END IF
+      DO 50 ICONT = FIRSTC,LASTC
+        IF (SELCON.EQ.1) THEN
+          LINCON = ICONT
+          LLINO = LNBR(LINCON)
+          LREG = 1
+          RREG = RELPG(LINCON)
+        END IF          
+        IDIM = 0
+        CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +   SEQ1,MAXSEQ,GEL,IDBSIZ,IDIM,JOB,LLINO,LINCON,LREG,RREG,
+     +   TEMP3,
+     +    ECHRSZ,MAXGL2,KBOUT,IDEV2,IOK,MAXGEL,IDM,PERCD,SEQG3,
+     +    ILADD,IRADD,MAXCON,JCON,0,NBAD)
+        IF (IOK.NE.0) RETURN
+C      CALL FMTDB(SEQ1,IDIM,1,IDIM,60,KBOUT)
+        LENCON = RREG - LREG + 1
+        ISTRAN = 1
+ 5      CONTINUE
+C
+C main loop
+C
+        FIRSTR = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  LNBR(LINCON),LREG)
+        IF(FIRSTR.EQ.0) GO TO 50
+        IRNO = FIRSTR
+ 10     CONTINUE
+        IF (LNGTHG(IRNO).GT.0) THEN
+          IDIN = MIN(MAXG-LNGTHG(IRNO)+1,MAXGLM)
+          IF(IDIN.GT.MING) THEN
+            CALL GETEXT(IRNO,GEL,IDIN,IOK)
+            IF(IOK.EQ.0) THEN
+              K = NOK2(GEL,IDIN,IWING,NBAD)
+              IDIN = K
+            ELSE
+              IDIN = 0
+            END IF
+          ELSE
+            IDIN = 0
+          END IF
+C
+C fudge idin (it seems to return 1 less char than its value!)
+C
+          IDIN = IDIN - 2
+C        WRITE(*,*)'IDIN',IDIN
+          IF(IDIN.GT.MING) THEN
+C
+C trim gel if it goes off the end of the consensus!
+C
+            IDIN = MIN((LENCON-(RELPG(IRNO)+LNGTHG(IRNO)-1)),IDIN)
+C          CALL FMTDB(GEL,IDIN,1,IDIN,60,KBOUT)
+            JDIM22 = IDIN
+            JDOUT = MAXGEL
+            IDSAV = MAXSAV
+            JPOSC = RELPG(IRNO)+LNGTHG(IRNO) - LREG + 1
+            LC = LENCON - JPOSC + 1
+            IPC = RELPG(IRNO) + LNGTHG(IRNO) - 1
+C
+C fudge the consensus position as we dont seem to get alignments for
+C cases where a deletion is required at the start of the consensus
+C
+C          CALL ALINEM(SEQ1(20+JPOSC),GEL,SEQG3,SEQC3,
+            CALL ALINEM(SEQ1(18+JPOSC),GEL,SEQG3,SEQC3,
+     +      SAVPS,SAVPG,SAVL,IDSAV,LC,JDIM22,JDOUT,
+     +      1,1,
+     +      JFAIL,PERMAX,KBOUT,IDEV,
+     +      MAXGEL,PERMS,IPC,IRNO,LNBR(LINCON),IFAIL)
+            IF (IFAIL.EQ.0) THEN
+              IF (JFAIL.NE.0) NPROBS = NPROBS + 1
+              IF ((JFAIL.NE.0).AND.(IREPN.EQ.0)) THEN
+                CALL READN(IDEV3,IRNO,NAMT)
+                WRITE(IDEV8,1000)NAMT,IRNO,LLINO,IPC
+ 1000           FORMAT(' ',A,' ',3I6)
+              END IF
+              IF((JFAIL.NE.0).AND.(XVERSN().NE.0).AND.(IDEV.EQ.KBOUT)) 
+     +          THEN
+                CALL BPAUSE(KBIN,KBOUT,IOK)
+                IF (IOK.NE.0) GO TO 100
+                CALL YESNO(IJOIN,'Use the editor',
+     +          IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+                IF(IJOIN.LT.0) GO TO 100
+                IF (IJOIN.EQ.0) THEN
+                  CALL CXEDIT(IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +            RELPG,LNGTHG,LNBR,RNBR,MAXGEL,
+     +            IDBSIZ,LINCON,LNBR(LINCON),IRNO,LNGTHG(IRNO),
+     +            PERCD,IDM,1,IOK)
+                END IF
+                CALL BUSY(KBOUT)
+              END IF
+            END IF
+          END IF
+        END IF
+        IRNO = RNBR(IRNO)
+        IF(IRNO.NE.0) THEN
+          IF((RELPG(IRNO)+LNGTHG(IRNO)).LT.RREG) GO TO 10
+        END IF
+        IF (ISTRAN.EQ.1) THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +    LNBR(LINCON),GEL,IDBSIZ,KBOUT,IDEV1,IDEV2,MAXGEL)
+          LREG = RELPG(LINCON) - RREG + 1
+          RREG = LREG + LENCON - 1
+          CALL SQREV(SEQ1(21),LENCON)
+          CALL SQCOM(SEQ1(21),LENCON)
+          ISTRAN = 2
+          GO TO 5
+        END IF
+        CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LNBR(LINCON),GEL,IDBSIZ,KBOUT,IDEV1,IDEV2,MAXGEL)
+        ISTRAN  = 3
+ 50   CONTINUE
+ 100  CONTINUE
+      WRITE(KBOUT,*)'Number of possible problems ',NPROBS
+C
+C if required return the contig to original sense
+C
+      IF (ISTRAN.EQ.2) THEN
+        CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LNBR(LINCON),GEL,IDBSIZ,KBOUT,IDEV1,IDEV2,MAXGEL)
+      END IF
+      END
+      SUBROUTINE ALINEM(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,
+     +IFAIL,PERMAX,KBOUT,IDEV,MAXGEL,PERCM,IPC,IRNO,LLINO,JFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV)
+C      INTEGER PATH(5000)
+      CHARACTER NAME1*6,NAME2*6
+      PARAMETER (MINOVR = 10, MINSLT = 3)
+      EXTERNAL PMIS
+C
+C the crap must be at least minovr bases in length or we dont bother
+C
+      JFAIL = 1
+      IF((IDC.LT.MINOVR).OR.(IDIM2.LT.MINOVR)) RETURN
+      WRITE(NAME1,1002)LLINO
+      WRITE(NAME2,1002)IRNO
+1002  FORMAT(I6)
+      NAME1(1:1) = 'C'
+      NAME2(1:1) = 'R'
+      IDC1 = MIN(IDC,IDIM2)
+      X = MAX(1.0,REAL(IDC1)/100.)
+C
+C allowing 10% and 7% of overlap to be pads: consensus and read respectively
+C
+      MAXPC = NINT(10.0*X)
+      MAXPG = NINT(7.0*X)
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ2,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF((IPP.GT.IDSAV).OR.(IPP.LT.1)) THEN
+        CALL ERROM(KBOUT,'Warning: slider failed')
+        CALL ERROM(KBOUT,NAME2)
+        RETURN
+      END IF
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      CALL UPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,JFAIL)
+      IF(JFAIL.NE.0) THEN
+        CALL ERROM(KBOUT,'Warning: lineup failed')
+        CALL ERROM(KBOUT,NAME2)
+        RETURN
+      END IF
+C      IDC1 = MIN(IDC,IDIM2+MAX(MAXPG,MAXPC))
+C      IDC1 = MIN(IDC,IDIM2)
+C      WRITE(*,*)'IDC,IDIM2',IDC,IDIM2
+C      MMAXPC = -1*(MAXPC+MAXPC/2)
+C      X = MAX(1.0,REAL(IDC1)/100.)
+C      MMAXPC = NINT(-10.*X)
+C      MMAXPG = NINT(7.*X)
+C      MAXPC = -MMAXPC
+C      MAXPG = MMAXPG
+C      IDIM3 = IDC1
+C      WRITE(*,*)'PADS ALLOWED',MAXPC,MAXPG
+C      CALL GALIGN(SEQ1,IDC1,SEQ2,IDIM3,MMAXPC,MMAXPG,PATH)
+C      CALL DISP(SEQ1,IDC1,SEQ2,IDIM3,PATH)
+C      CALL EXPAND(SEQ1,IDC1,SEQ2,IDIM3,SEQC2,IDC2,SEQG2,IDG2,PATH)
+C      LO = MIN(IDC2,IDG2)
+C      WRITE(*,*)'IDC2,IDG2',IDC2,IDG2
+C      LO = MIN(NOTIRL(SEQC2,IDC2,','),NOTIRL(SEQG2,IDG2,','))
+C      WRITE(*,*)'LO',LO
+C      END IF
+      LO = MIN(IDOUT,IDIM2)
+      PERCM = PMIS(SEQC2,SEQG2,LO)
+      ITOTPC = NSYM(SEQC2,LO,',')
+      ITOTPG = NSYM(SEQG2,LO,',')
+      IFAIL = 0
+      IF (ITOTPC.GT.MAXPC) IFAIL = 1
+      IF (ITOTPG.GT.MAXPG) IFAIL = 1
+      IF (PERCM.GT.PERMAX) IFAIL = 1
+      IF(IFAIL.NE.0) THEN
+        WRITE(IDEV,1000)PERCM,ITOTPC,ITOTPG
+1000    FORMAT(' Percentage mismatch ',F4.1,', Pads',I4,I4)
+        CALL FMT4LP(SEQC2(1),SEQG2,LO,IPC,1,IDEV,
+     +  NAME1,NAME2)
+        CALL BUSY(KBOUT)
+      END IF
+      END
+      INTEGER FUNCTION NSYM(SEQ,ID,SYM)
+      CHARACTER SEQ(ID),SYM
+      N = 0
+      DO 10 I=1,ID
+        IF(SEQ(I).EQ.SYM) N = N + 1
+ 10   CONTINUE
+      NSYM = N
+      END
+      REAL FUNCTION PMIS(SEQ1,SEQ2,ID)
+      CHARACTER SEQ1(ID),SEQ2(ID),PAD
+      PARAMETER (PAD  = ',')
+C
+C count mismatch. padding in consensus ignored
+C
+      J = 0
+      DO 10 I=1,ID
+        IF(SEQ1(I).NE.SEQ2(I)) THEN
+          IF(SEQ1(I).NE.PAD) J = J + 1
+        END IF
+C        IF(SEQ1(I).NE.SEQ2(I)) J = J + 1
+ 10   CONTINUE
+      PMIS = 100. * REAL(J)/REAL(ID)
+      END
+      SUBROUTINE UPCHEK(PC,PG,L,N)
+      INTEGER PC(N),PG(N),L(N),DC,DG
+C     AUTHOR RODGER STADEN
+C
+C only allow gaps that are shorter than the next block of identity
+C
+      K1 = 2
+1     CONTINUE
+      DO 10 I = K1,N
+        J1 = I
+        DC = PC(I) - PC(I-1) - L(I-1)
+        DG = PG(I) - PG(I-1) - L(I-1)
+        IF(ABS(DC-DG).GE.L(I)) GO TO 20
+10    CONTINUE
+      RETURN
+20    CONTINUE
+C      WRITE(*,*)'REMOVING!!'
+        CALL ML(PC,PG,L,N,J1)
+C      IF(L(J1-1).GT.L(J1)) THEN
+C        CALL ML(PC,PG,L,N,J1)
+C      ELSE
+C        CALL ML(PC,PG,L,N,J1-1)
+C      END IF
+      K1 = MAX(2,J1-1)
+      N = N - 1
+      GO TO 1
+      END
+      SUBROUTINE ARRFIN(IDEV,SEQNCE,J,KBOUT,ICRAP)
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,TITCHR,GT
+      EXTERNAL NOK2
+      SAVE SPACE,TITCHR,GT
+      PARAMETER (IWIN = 100, NBAD = 5)
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      DATA GT/'>'/
+C
+C routine to read reads, including their poor data if required
+C allowing 5 n's in 100 bases
+C
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=80)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR) GO TO 1
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(J.EQ.IDMX)THEN
+            CALL ERROM(KBOUT,'Error in arrfin: too much data')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+C      WRITE(*,*)' GOOD',J
+      IF(ICRAP.NE.0) RETURN
+C
+C read in the crap
+C
+      REWIND(IDEV)
+      JG = J
+ 50   CONTINUE
+      READ(IDEV,1001,END=70,ERR=80)TEMP
+      IF(TEMP(1).NE.TITCHR) GO TO 50
+      IF(TEMP(2).EQ.GT) THEN
+        DO 60 I=3,80
+          IF(TEMP(I).NE.SPACE)THEN
+            IF(J.EQ.IDMX)THEN
+              CALL ERROM(KBOUT,'Error in arrfin: too much data')
+              GO TO 70
+            END IF
+            J=J+1
+            SEQNCE(J)=TEMP(I)
+          END IF
+60      CONTINUE
+      END IF
+      GO TO 50
+70    CONTINUE
+C      WRITE(*,*)' GOOD+BAD',J
+      K = NOK2(SEQNCE(JG+1),J-JG+1,IWIN,NBAD)
+      J = JG + K
+C      WRITE(*,*)' GOOD+BAD-CRAP',J
+C      WRITE(*,*)(SEQNCE(K),K=1,J)
+      RETURN
+ 80   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file in arrfin')
+      J = 0
+      END
+      SUBROUTINE DBLSTR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,KBIN,KBOUT,IDEVW,
+     +IDEVR,IDEV,LINLEN,PERCD,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,IDEVN,
+     +  LLINO,LINCON,LREG,RREG,MXGOOD,SEQ2,IDM,JOB,IERR,LSTRT,LEND)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM1)
+C     DBLSTR
+C     JOB 0 = Ask initial questions
+C     JOB 1 = Generate quality information
+C     JOB 2 = Complement contig and region info.
+C
+      IF (JOB.EQ.0) THEN
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0) RETURN
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0) RETURN
+      ELSE IF (JOB.EQ.1) THEN
+      CALL SUMMAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LSTRT,LEND,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,MXGOOD)
+      IDIM1=LEND-LSTRT+1
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2(LSTRT),IDIM1,GEL,LSTRT,LEND,LLINO,IDBSIZ,TEMP3,
+     +CHRSIZ,MAXGL2,IDEVW,MAXGEL,IDM,PERCD)
+      ELSE IF (JOB.EQ.2) THEN
+C
+C for complementary strand need to juggle the active region
+C
+         LSTRT = LREG-LSTRT
+         LEND  = LREG-LEND
+         IREG  = RREG
+         RREG  = RELPG(LINCON) - LREG + 1
+         LREG  = RELPG(LINCON) - IREG + 1
+         LSTRT = LREG-LSTRT
+         LEND  = LREG-LEND
+         IF (LSTRT.LT.1) LSTRT=1
+         IF (LEND.LT.1) LEND=1
+C
+         CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +        LINCON,LNBR(LINCON),GEL,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +        MAXGEL)
+         LLINO = LNBR(LINCON)
+         CALL SUMMAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +        SEQ1,IDIM1,GEL,LSTRT,LEND,LLINO,PERCD,IDBSIZ,
+     +        TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +        MAXGEL,MXGOOD)
+         CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +        SEQ2(LSTRT),IDIM1,GEL,LSTRT,LEND,LLINO,IDBSIZ,TEMP3,
+     +        CHRSIZ,MAXGL2,IDEVW,MAXGEL,IDM,PERCD)
+      END IF
+      END
+      SUBROUTINE SUMMAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      CHARACTER SYMS(7,7)
+      EXTERNAL INDEXS,LWRAPS,IGTCON
+      SAVE SYMS
+      DATA SYMS/
+     +'0','4','4','4','5','5','1',
+     +'4','0','4','4','5','5','1',
+     +'4','4','0','4','5','5','1',
+     +'4','4','4','0','5','5','1',
+     +'6','6','6','6','3','5','7',
+     +'6','6','6','6','6','3','7',
+     +'2','2','2','2','8','8','9'/
+C
+C codes are: using g = good consensus, b = bad consensus, n = no data
+C
+C               + -
+C           0   g g =
+C           1   g n
+C           2   n g
+C           3   b b
+C           4   g g !=
+C           5   g b
+C           6   b g
+C           7   b n
+C           8   n b
+C           9   n n      (should never occur!!!)
+C
+C hard to understand this very old code! rewrite it.
+C we have a summing array of twice the length of the longest sequence
+C posn is posn in contig of next to write, lseqno is current posn in contig
+C we write when lseqno-posn ge the length of the longest seq
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+ 40         CONTINUE
+ 50             CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read
+C
+      IF(ISS.EQ.1) THEN
+        N = MIN(MXGOOD,ABS(LNGTHG(GELC)))
+        IF(LSEQNO.LT.LREG)LSEQNO=LREG
+        IS = LSEQNO-RELPG(GELC)+1
+      ELSE
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read (right end for these)
+C
+        IREND = RELPG(GELC) - LNGTHG(GELC) + 1
+        IF (MXGOOD.LT.MAXGEL) THEN
+          LSEQNO = IREND - MXGOOD + 1
+        ELSE
+          LSEQNO = RELPG(GELC)
+        END IF
+        LSEQNO = MAX(LSEQNO,LREG)
+        IS = LSEQNO - RELPG(GELC) + 1
+        N  = ABS(LNGTHG(GELC))
+      END IF
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+ 70         CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+ 200     CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+ 210     CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        L1 = IGTCON(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        L2 = IGTCON(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        POSN1 = POSN1 + 1
+        SEQ1(POSN1) = SYMS(L2,L1)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+ 250           CONTINUE
+        POSN=POSN+1
+ 230       CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+      INTEGER FUNCTION IGTCON(COUNTS,IDM,CUT)
+      INTEGER IDM
+      INTEGER COUNTS(IDM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU
+C
+C returns values 1 to 7: 7 means no data, 1-5 means a,c,g,t,*
+C 6 means no consensus. if we change gtconc we should change this too
+C
+      IGTCON = 7
+      ISUM = 0
+      DO 5 I=1,IDM
+        ISUM = ISUM + COUNTS(I)
+ 5          CONTINUE
+      IF(ISUM.EQ.0) RETURN
+      Y = ISUM
+      DO 10 I = 1,IDM - 1
+        X = REAL(COUNTS(I))/Y
+        IF(X.GE.CUT) THEN
+          IGTCON = I
+          RETURN
+        END IF
+ 10         CONTINUE
+      IGTCON = 6
+      END
+      SUBROUTINE CHKREV(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,
+     +IDEVN,IDEV,KBIN,KBOUT,DONEIT,MATES,IGELS,DIST,RNAMES)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER DONEIT(IDBSIZ),MATES(IDBSIZ),IGELS(IDBSIZ),DIST(IDBSIZ)
+      INTEGER GCLIN,DTOEND,CHAINL
+      CHARACTER*(*) RNAMES(IDBSIZ)
+      LOGICAL SAMEC
+      EXTERNAL GCLIN,DTOEND,NMATE,SAMEC,CHAINL
+      PARAMETER (MAXD = 99999)
+C
+C need to know which ones have already been mated
+C
+      CALL FILLI(DONEIT,NGELS,0)
+      CALL FILLI(IGELS,NGELS,0)
+      CALL FILLI(MATES,NGELS,0)
+      CALL GNAMES(IDEVN,RNAMES,NGELS)
+      IGEL = 0
+      IDONE = 0
+ 10   CONTINUE
+      IGEL = IGEL + 1
+      IF (IGEL.EQ.NGELS) THEN
+        IF (IDONE.GT.0) THEN
+          CALL SHOWP(DIST,IGELS,MATES,IDONE,RNAMES,RELPG,IDBSIZ,IDEV)
+        ELSE
+          WRITE(KBOUT,*)'None found'
+        END IF
+        RETURN
+      END IF
+      IF (DONEIT(IGEL).EQ.1) GO TO 10
+      MATE = NMATE(IGEL,NGELS,RNAMES,2)
+      IF (MATE.EQ.0) GO TO 10
+      DONEIT(MATE) = 1
+      IDONE = IDONE + 1
+C
+C we have a pair igel and mate. deal separately with cases of 1 or 2 contigs
+C
+      IF (SAMEC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,IGEL,MATE))
+     +  THEN
+C
+C same contig
+C        
+        IF (((LNGTHG(IGEL).GT.0).AND.(LNGTHG(MATE).GT.0)).OR.
+     +  ((LNGTHG(IGEL).LT.0).AND.(LNGTHG(MATE).LT.0))) THEN
+C
+C >>>>>>>          >>>>>>>>> or <<<<<<<<      <<<<<<<<<<
+C
+          IGELS(IDONE) = IGEL
+          MATES(IDONE) = MATE
+          DIST(IDONE)  = MAXD
+C          WRITE(KBOUT,*)IGEL,MATE,
+C     +    ' are in the same contig face the same way'
+          GO TO 10
+        END IF
+        IF (LNGTHG(IGEL).GT.0) THEN
+          IF (RELPG(IGEL).GT.RELPG(MATE)+ABS(LNGTHG(MATE)) -1 ) THEN
+C
+C <<<<<<<222   111>>>>>>>>
+C
+            IGELS(IDONE) = IGEL
+            MATES(IDONE) = MATE
+            DIST(IDONE)  = MAXD
+C            WRITE(KBOUT,*)IGEL,MATE,
+C     +      ' are in the same contig and face away'
+          ELSE
+C
+C 111>>>>> <<<<<<222
+C
+            IGELS(IDONE) = IGEL
+            MATES(IDONE) = MATE
+            DIST(IDONE)  = RELPG(MATE)+ABS(LNGTHG(MATE))-RELPG(IGEL)
+C            WRITE(KBOUT,*)IGEL,RELPG(IGEL),MATE,RELPG(MATE)
+          END IF
+        ELSE
+C
+C <<<<<<111
+C
+          IF (RELPG(MATE).GT.RELPG(IGEL)+ABS(LNGTHG(IGEL)) -1 ) THEN
+C
+C <<<<<111 222>>>>>
+C
+            IGELS(IDONE) = IGEL
+            MATES(IDONE) = MATE
+            DIST(IDONE)  = MAXD
+C            WRITE(KBOUT,*)IGEL,MATE,
+C     +      ' are in the same contig and face away'
+          ELSE
+C
+C >>>>>222  <<<<<111
+C
+            IGELS(IDONE) = IGEL
+            MATES(IDONE) = MATE
+            DIST(IDONE)  = RELPG(IGEL)+ABS(LNGTHG(IGEL))-RELPG(MATE)
+C            WRITE(KBOUT,*)'same ',IGEL,RELPG(IGEL),MATE,RELPG(MATE)
+          END IF
+        END IF
+      ELSE
+C
+C different contigs
+C
+        ICONTL = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,IGEL)
+        ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,ICONTL)
+        MCONTL = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,MATE)
+        MCONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,MCONTL)
+        LENC1 = RELPG(ICONT)
+        LENC2 = RELPG(MCONT)
+        LENI = DTOEND(RELPG(IGEL),LNGTHG(IGEL),LENC1)
+        LENM = DTOEND(RELPG(MATE),LNGTHG(MATE),LENC2)
+        LENIM = LENI + LENM
+        IGELS(IDONE) = -IGEL
+        MATES(IDONE) = MATE
+        DIST(IDONE)  = LENIM
+C        WRITE(KBOUT,*)'different contigs',IGEL,MATE,' separation',LENIM
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE SHOWP(DIST,IGELS,MATES,IDONE,RNAMES,RELPG,IDBSIZ,IDEV)
+      INTEGER DIST(IDONE),IGELS(IDONE),MATES(IDONE),RELPG(IDBSIZ)
+      CHARACTER*(*) RNAMES(IDBSIZ)
+      CALL BUBBL3(DIST,IGELS,MATES,IDONE)
+      DO 10 I=1,IDONE
+        IF (IGELS(I).GT.0) THEN
+          J = ABS(IGELS(I))
+          WRITE(IDEV,1000)RNAMES(J),J,RELPG(J),DIST(I)
+          J = MATES(I)
+          WRITE(IDEV,1000)RNAMES(J),J,RELPG(J),DIST(I)
+        END IF
+ 10   CONTINUE
+      DO 20 I=1,IDONE
+        IF (IGELS(I).LT.1) THEN
+          J = ABS(IGELS(I))
+          WRITE(IDEV,1000)RNAMES(J),J,RELPG(J),DIST(I),'*'
+          J = MATES(I)
+          WRITE(IDEV,1000)RNAMES(J),J,RELPG(J),DIST(I),'*'
+        END IF
+ 20   CONTINUE
+ 1000 FORMAT(' ',A,3I6,A)
+      END
+      INTEGER FUNCTION DTOEND(RELPG,LNGTHG,LENCON)
+      INTEGER RELPG
+      IF (LNGTHG.GT.0) THEN
+        DTOEND = LENCON - RELPG + 1
+      ELSE
+        DTOEND = RELPG + ABS(LNGTHG) - 1
+      END IF
+      END
+      LOGICAL FUNCTION SAMEC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +NGELS,NCONTS,IGEL,JGEL)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CHAINL
+      EXTERNAL CHAINL
+      I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,IGEL)
+      J = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,JGEL)
+      SAMEC = .FALSE.
+      IF (I.EQ.J) SAMEC = .TRUE.
+      END
+      SUBROUTINE GNAMES(IDEVN,RNAMES,NGELS)
+      CHARACTER*(*) RNAMES(NGELS)
+      DO 10 I=1,NGELS
+        CALL READN(IDEVN,I,RNAMES(I))
+ 10   CONTINUE
+      END
+      INTEGER FUNCTION  NMATE(IGEL,NGELS,RNAMES,JOB)
+      PARAMETER (NAMLEN = 16)
+      CHARACTER*(NAMLEN) NAME,NAMEM,RNAMES(NGELS)
+C
+C assume s1 and r1 are mates
+C
+C if job = 1 we look at all the names
+C if job = 2 we look only at the ones with higher numbers
+      NMATE = 0
+      IF(JOB.EQ.1) THEN
+        JGEL = 1
+      ELSE IF(JOB.EQ.2) THEN
+        JGEL = IGEL +1
+      ELSE
+        WRITE(*,*)'SCREAM: NMATE'
+        RETURN
+      END IF
+      NAME = RNAMES(IGEL)
+      I = INDEX(NAME,'.')
+      IF (NAME(I+1:I+1).EQ.'s') THEN
+        NAME(I+1:I+1) = 'r'
+      ELSE
+        NAME(I+1:I+1) = 's'
+      END IF
+      DO 10 I=JGEL,NGELS
+        IF (I.NE.IGEL) THEN
+          NAMEM = RNAMES(I)
+          IF (NAME.EQ.NAMEM) THEN
+            NMATE = I
+            RETURN
+          END IF
+        END IF
+ 10   CONTINUE
+      END
+      SUBROUTINE LMATES(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,
+     +IDEV,IGELS,MATES,DIST,RNAMES,NMATES)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER MATES(IDBSIZ),IGELS(IDBSIZ),DIST(IDBSIZ)
+      CHARACTER*(*) RNAMES(IDBSIZ)
+C
+C list mates
+C
+C      WRITE(*,*)NMATES
+      DO 10 I=1,NMATES
+        J = IGELS(I)
+        K = MATES(I)
+        IF(DIST(I).GT.0) THEN
+          WRITE(IDEV,1000)J,RNAMES(J),RELPG(J),LNGTHG(J),DIST(I)
+          WRITE(IDEV,1000)K,RNAMES(K),RELPG(K),LNGTHG(K)
+ 1000     FORMAT(' ',I6,' ',A,' ',3I6)
+        END IF
+ 10   CONTINUE
+      DO 20 I=1,NMATES
+        J = IGELS(I)
+        K = MATES(I)
+        IF(DIST(I).LT.0) THEN
+          WRITE(IDEV,1000)J,RNAMES(J),RELPG(J),LNGTHG(J),DIST(I)
+          WRITE(IDEV,1000)K,RNAMES(K),RELPG(K),LNGTHG(K)
+        END IF
+ 20   CONTINUE
+      END
+      SUBROUTINE CHKREW(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,
+     +IDEVN,IGELS,MATES,DIST,RNAMES,LLINO,NMATES)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER MATES(IDBSIZ),IGELS(IDBSIZ),DIST(IDBSIZ)
+      CHARACTER*(*) RNAMES(IDBSIZ)
+      LOGICAL SAMEC
+      EXTERNAL NMATE,SAMEC
+C
+C need to know which ones have already been mated
+C
+      CALL FILLI(IGELS,NGELS,0)
+      CALL FILLI(MATES,NGELS,0)
+      CALL GNAMES(IDEVN,RNAMES,NGELS)
+      IGEL = LLINO
+      NMATES = 0
+ 10   CONTINUE
+      MATE = NMATE(IGEL,NGELS,RNAMES,1)
+      IF (MATE.NE.0) THEN
+        NMATES = NMATES + 1
+        IGELS(NMATES) = IGEL
+        MATES(NMATES) = MATE
+C        WRITE(*,*)IGELS(NMATES),MATES(NMATES)
+C
+C we have a pair igel and mate. deal separately with cases of 1 or 2 contigs
+C
+        IF (SAMEC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,NCONTS,IGEL,MATE))
+     +  THEN
+C
+C same contig
+C        
+          DIST(NMATES) = RELPG(MATE) + ABS(LNGTHG(MATE)) - RELPG(IGEL)
+          IF(RELPG(MATE).LT.RELPG(IGEL)) THEN
+C
+C seen it before so skip it
+C
+            NMATES = NMATES - 1
+C            IGELS(NMATES) = MATE
+C            MATES(NMATES) = IGEL
+          END IF
+C          WRITE(*,*)IGELS(NMATES),MATES(NMATES)
+C          WRITE(*,*)DIST(NMATES)
+        ELSE
+C
+C different contigs
+C
+          DIST(NMATES) = -ABS(LNGTHG(IGELS(NMATES)))
+C          WRITE(*,*)DIST(NMATES)
+        END IF
+      END IF
+      IF (RNBR(IGEL).NE.0) THEN
+        IGEL = RNBR(IGEL)
+        GO TO 10
+      END IF
+      END
+      SUBROUTINE GDEPTH(RELPG,LNGTHG,
+     +IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),IGELS(NMATES),MATES(NMATES),DIST(NMATES)
+      INTEGER LNGTHG(IDBSIZ)
+      INTEGER RREG,DEPTH
+C      WRITE(*,*)(IGELS(K),MATES(K),DIST(K),K=1,NMATES)
+      CALL GDPTH(RELPG,
+     +IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,DEPTH)
+      IF(NMATES.LT.1) RETURN
+      CALL PLTCOG(RELPG,LNGTHG,IDBSIZ,IGELS,MATES,DIST,NMATES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,LREG,RREG,DEPTH)
+      END
+      SUBROUTINE GDPTH(RELPG,
+     +IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),MATES(NMATES),IGELS(NMATES),DIST(NMATES)
+      INTEGER RREG,DEPTH
+      EXTERNAL NCDEPM
+C  LREG = left contig position
+C  RREG = right ''      ''
+C
+C have nmates reading numbers in igels and mates, their spacings are dist
+C dist <0 means they are in different contigs
+C the igels numbers are in left to right order
+C
+      I = 1
+5     CONTINUE
+      IF(I.LE.NMATES) THEN
+        IF((RELPG(IGELS(I))+ABS(DIST(I))-1).LE.LREG) THEN
+          I = I + 1
+          GO TO 5
+        END IF
+      END IF
+      DEPTH = 0
+10    CONTINUE
+      IF(I.LT.NMATES) THEN
+        IF(RELPG(IGELS(I)).LE.RREG) THEN
+            K = RELPG(IGELS(I)) + ABS(DIST(I))
+            DEPTH = MAX(NCDEPM(RELPG,IDBSIZ,I,IGELS,NMATES,K),DEPTH)
+          I = I + 1
+          GO TO 10
+        END IF
+      END IF
+C      WRITE(*,*)'DEPTH',DEPTH
+      END
+      SUBROUTINE PLTCOG(RELPG,LNGTHG,IDBSIZ,IGELS,MATES,DIST,NMATES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,LREG,RREG,DEPTH)
+      INTEGER DEPTH
+      INTEGER RELPG(IDBSIZ),IGELS(NMATES),MATES(NMATES),DIST(NMATES)
+      INTEGER LNGTHG(IDBSIZ)
+      INTEGER RREG
+C  have window size margt starting at margb
+C  depths depthp, depthm
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL PLTCG(RELPG,LNGTHG,
+     +IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,DEPTH)
+      CALL VT100M
+      END
+      SUBROUTINE PLTCG(RELPG,LNGTHG,
+     +IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),IGELS(NMATES),MATES(NMATES),DIST(NMATES)
+      INTEGER LNGTHG(IDBSIZ)
+      INTEGER RREG,DEPTH
+      CHARACTER SYM
+      LOGICAL MATEY
+      EXTERNAL MATEY
+      YINC = REAL(ISYMAX) / (DEPTH + 1)
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      XSLOPE = REAL(MARGR - MARGL)/100.
+      YSLOPE = YINC/10.
+C      WRITE(*,*)XMIN,XMAX
+      I = 1
+5     CONTINUE
+      IF(I.LE.NMATES) THEN
+        IF((RELPG(IGELS(I))+ABS(DIST(I))-1).LT.LREG) THEN
+          I = I + 1
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.LE.NMATES) THEN
+        IF(RELPG(IGELS(I)).LE.RREG) THEN
+            XF = MAX(RELPG(IGELS(I)),LREG)
+            XT = MIN(ABS(DIST(I))+RELPG(IGELS(I))-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = N * YINC
+C            WRITE(*,*)I,IGELS(I),MATES(I),DIST(I)
+C            WRITE(*,*)RELPG(IGELS(I)),LNGTHG(IGELS(I))
+C            WRITE(*,*)RELPG(MATES(I)),LNGTHG(MATES(I))
+C            WRITE(*,*)I,XF,XT
+            CALL LINE(XT,XF,YF,YF,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+            IF (.NOT.(MATEY(IGELS(I),MATES(I),
+     +      RELPG,LNGTHG,IDBSIZ)).AND.(DIST(I).GT.0)) THEN
+              XF1 = XF + (XT-XF)/2.
+              YT1 = YF - YSLOPE
+              YT2 = YF + YSLOPE
+              CALL LINE(XF1,XF1,YT1,YT2,XMAX,XMIN,YMAX,YMIN,
+     +        MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+            END IF
+            IF (LNGTHG(IGELS(I)).GT.0) THEN
+              XT = RELPG(IGELS(I))+LNGTHG(IGELS(I))
+              CALL RARROW(XT,YF,XSLOPE,YSLOPE,XMAX,XMIN,YMAX,YMIN,
+     +        MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+            ELSE
+              XT = MAX(RELPG(IGELS(I)),LREG)
+              CALL LARROW(XT,YF,XSLOPE,YSLOPE,XMAX,XMIN,YMAX,YMIN,
+     +        MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+            END IF
+            IF (DIST(I).GT.0) THEN
+              IF (LNGTHG(MATES(I)).GT.0) THEN
+                XT = RELPG(MATES(I))+LNGTHG(MATES(I))
+                CALL RARROW(XT,YF,XSLOPE,YSLOPE,XMAX,XMIN,YMAX,YMIN,
+     +          MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+              ELSE
+                XT = MIN(RELPG(MATES(I)),RREG)
+                CALL LARROW(XT,YF,XSLOPE,YSLOPE,XMAX,XMIN,YMAX,YMIN,
+     +          MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+              END IF
+            ELSE
+              SYM = '*'
+              CALL TEXT(XT,YF,SYM,1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +        MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+            END IF
+          I = I + 1
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE LARROW(XT,YF,XSLOPE,YSLOPE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      XF1 = XT + XSLOPE
+      YT1 = YF + YSLOPE
+      XF2 = XT + XSLOPE
+      YT2 = YF - YSLOPE
+      CALL LINE(XT,XF1,YF,YT1,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XT,XF2,YF,YT2,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END
+      SUBROUTINE RARROW(XT,YF,XSLOPE,YSLOPE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      XF1 = XT - XSLOPE
+      YT1 = YF + YSLOPE
+      XF2 = XT - XSLOPE
+      YT2 = YF - YSLOPE
+      CALL LINE(XT,XF1,YF,YT1,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XT,XF2,YF,YT2,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END
+      LOGICAL FUNCTION MATEY(IGEL,MATE,RELPG,LNGTHG,NGELS)
+      INTEGER RELPG(NGELS),LNGTHG(NGELS)
+      MATEY = .FALSE.
+      IF (RELPG(IGEL).LT.RELPG(MATE)) THEN
+        IF ((LNGTHG(IGEL).GT.0).AND.(LNGTHG(MATE).LT.0)) MATEY = .TRUE.
+      ELSE
+        IF ((LNGTHG(MATE).GT.0).AND.(LNGTHG(IGEL).LT.0)) MATEY = .TRUE.
+      END IF
+      END
+      INTEGER FUNCTION NCDEPM(RELPG,IDBSIZ,IGEL,
+     +IGELS,NMATES,RREG)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER IGELS(NMATES)
+      INTEGER RREG
+      NCDEPM = 0
+      N = 0
+      I = IGEL
+10    CONTINUE
+      IF(I.LE.NMATES) THEN
+        IF(RELPG(IGELS(I)).LE.RREG) THEN
+          N = N + 1
+          I = I + 1
+          GO TO 10
+        END IF
+      END IF
+      NCDEPM = N
+      END
+      SUBROUTINE XHGAP(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NCONTS,LLINOI,LINCNI,LREG,RREG,
+     +WINDOW,GWIND,LENCON,DEPTHP,DEPTHM,
+     +MARGL,MARGR,MARGB,MARGT,MAXOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +KBOUT,GEL,GEL2,IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM,
+     +SEQ1,IDIM1,NGELS,TEMP3,CHRSIZ,MAXGL2,LINOU1,LINOU2,
+     +NOPT1,NOPT2,NOPT3,NOPT4,
+     +IHELPS,IHELPE,HELPF,IDEVH,MXGOOD,RNAMES,IGELS,MATES,DIST)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),WINDOW,CHRSIZ,GWIND
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER MARGB(MAXOPT),MARGT(MAXOPT)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND,CHNRP1,HQN
+      INTEGER TEMP3(2,CHRSIZ,MAXGL2)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER TERM,TUPPER,NAMARC*16,HELPF*(*)
+      CHARACTER SEQ1(IDIM1),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER*(*) RNAMES(IDBSIZ)
+      INTEGER MATES(IDBSIZ),IGELS(IDBSIZ),DIST(IDBSIZ),DEPTHR
+      EXTERNAL NOPWIN,CWORLD,TUPPER,CHNRP1,HQN
+C  nopt1 = single contig
+C  nopt2 = all contigs
+C  nopt3 = scan
+C  nopt4 = plot templates with pairs of reads
+      CALL CHKREW(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,NGELS,
+     +IDEV3,IGELS,MATES,DIST,RNAMES,LLINOI,NMATES)
+10    CONTINUE
+      LLINO = LLINOI
+      LINCON = LINCNI
+      LOCLR = 0
+      LOCRR = 0
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL CLEARV
+      CALL XHAIRR(ISXMAX,ISYMAX,IX,IY,TERM,DBTDUX,DBTDUY)
+      CALL VT100M
+      INFLAG = HQN(TERM)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      NOPT = NOPWIN(IY,MARGB,MARGT,MAXOPT)
+C      WRITE(*,*)NOPT
+      TERM = TUPPER(TERM)
+      IF(NOPT.EQ.0) RETURN
+      IF(NOPT.EQ.NOPT3) THEN
+        IF(TERM.EQ.'S') THEN
+          XMIN = LREG
+          XMAX = RREG
+          X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF((TERM.EQ.'N').OR.(TERM.EQ.'Z').OR.(TERM.EQ.'I')) GO TO 10
+      END IF
+      IF(NOPT.EQ.NOPT1) THEN
+        STRAND = 1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHP)
+        IF(DEPTHP.LT.0) RETURN
+        STRAND = -1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHM)
+        IF(DEPTHM.LT.0) RETURN
+        YMAX = ISYMAX
+        YMIN = 0.
+        XMIN = LREG
+        XMAX = RREG
+        RINC = ISYMAX / (DEPTHP + DEPTHM + 2)
+        RMID =(DEPTHM+1) * RINC
+        X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+        Y = CWORLD(IY,MARGB(NOPT),MARGT(NOPT),YMIN,YMAX)
+C        WRITE(*,*)X,Y,IX,IY
+        IF(TERM.EQ.'I') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(ICLOSE.EQ.1) GO TO 10
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+1006      FORMAT
+     +    ( ' Name ',A,' Number ',I6,' Rel. Posn. ',I7,' Length ',I5)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Z') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(IGEL.EQ.0) GO TO 10
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT2),MARGT(NOPT2),ISXMAX,ISYMAX)
+          LREG = MAX(1,RELPG(IGEL)-GWIND)
+          RREG = MIN(RELPG(LINCON),RELPG(IGEL)+GWIND)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +    LLINO,LREG)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+      END IF
+      IF(NOPT.EQ.NOPT2) THEN
+        CALL IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  NCONTS,IX,IY,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +  ISXMAX,ISYMAX,DBTDUX,DBTDUY,
+     +  LINCON,IGEL,IS)
+        IF(IGEL.EQ.0) RETURN
+        IF(TERM.EQ.'Z') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'I') THEN
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+      END IF
+      IF(NOPT.EQ.NOPT4) THEN
+        CALL GDPTH(RELPG,
+     +  IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,DEPTHR)
+C     +  MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        IF(DEPTHR.LT.0) RETURN
+        YMAX = ISYMAX
+        YMIN = 0.
+        XMIN = LREG
+        XMAX = RREG
+        X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+        Y = CWORLD(IY,MARGB(NOPT),MARGT(NOPT),YMIN,YMAX)
+C        WRITE(*,*)X,Y,IX,IY
+        IF(TERM.EQ.'I') THEN
+          CALL IPLTCG(RELPG,LNGTHG,
+     +    IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,DEPTHR,X,Y,
+     +    IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) GO TO 10
+C          CALL READN(IDEV3,IGEL,NAMARC)
+          J = IGELS(IGEL)
+          WRITE(IDEV,1007)RNAMES(J),J,RELPG(J),LNGTHG(J)
+          J = MATES(IGEL)
+          WRITE(IDEV,1007)RNAMES(J),J,RELPG(J),LNGTHG(J)
+ 1007     FORMAT(' 'A,3I6)
+C          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE IPLTCG(RELPG,LNGTHG,
+     +IDBSIZ,LREG,RREG,IGELS,MATES,DIST,NMATES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,DEPTH,X,Y,IGEL,IOK)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),IGELS(NMATES),MATES(NMATES),DIST(NMATES)
+      INTEGER LNGTHG(IDBSIZ)
+      INTEGER RREG,DEPTH
+      LOGICAL MATEY
+      EXTERNAL MATEY
+      YINC = REAL(ISYMAX) / (DEPTH + 1)
+      YINCO2 = YINC/2.
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+C      WRITE(*,*)X,Y
+C      WRITE(*,*)XMIN,XMAX,YMIN,YMAX,YINC
+      I = 1
+5     CONTINUE
+      IF(I.LE.NMATES) THEN
+        IF((RELPG(IGELS(I))+ABS(DIST(I))-1).LT.LREG) THEN
+          I = I + 1
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.LE.NMATES) THEN
+        IF(RELPG(IGELS(I)).LE.RREG) THEN
+            XF = MAX(RELPG(IGELS(I)),LREG)
+            XT = MIN(ABS(DIST(I))+RELPG(IGELS(I))-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = N * YINC
+C            WRITE(*,*)XF,XT,YF
+C            WRITE(*,*)I,IGELS(I),MATES(I),DIST(I)
+C            WRITE(*,*)RELPG(IGELS(I)),LNGTHG(IGELS(I))
+C            WRITE(*,*)RELPG(MATES(I)),LNGTHG(MATES(I))
+C            WRITE(*,*)I,XF,XT
+            IF((X.GE.XF).AND.(X.LE.XT)) THEN
+              IF((Y.GE.YF-YINCO2).AND.(Y.LE.YF+YINCO2)) THEN
+                IOK = 0
+                IGEL = I
+                RETURN
+              END IF
+            END IF
+          I = I + 1
+          GO TO 10
+        END IF
+      END IF
+      IOK = 1
+      END
diff --git a/src/bap/dbsysold.f b/src/bap/dbsysold.f
new file mode 100644
index 0000000..21f89c1
--- /dev/null
+++ b/src/bap/dbsysold.f
@@ -0,0 +1,2440 @@
+C   dbsys routines that were changed when starting to use .rd files
+C         these versions are the pre .rd ones and afre used by sap and xsap
+C         post .rd equivalents are in dbsysnew.f
+C 22-5-91 added new remove gel reading function. Changed movgel, breakc
+C   21-8-91 Added routines to find internal overlaps
+C
+C   SUBROUTINE TO ENTER NEW GEL SEQUENCES INTO DATA BASE.
+C   IT READS IN AN ARCHIVE VERSION AND WRITES OUT A WORKING VERSION.
+C   IT ALSO SETS UP ANY RELATIONSHIPS WITH OTHER DATA IN THE DATABASE
+C   BOTH BY POSITION IN A CONTIG AND POINTERS TO LEFT AND RIGHT
+C   NEIGHBOURS.
+      SUBROUTINE AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,NAMARC,X,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM,IDC,NCONTC,LINCON,IFAIL,IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,
+     +MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),NAMARC*(*)
+      CHARACTER SEQC2(IDC)
+      CHARACTER NAMARK*16
+C      WRITE(*,*)'X,ITYPE,ISENSE,IDIM,IDC'
+C      WRITE(*,*)X,ITYPE,ISENSE,IDIM,IDC
+C   SET FAIL FLAG
+      IFAIL=0
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(' TRYING TO ENTER NEW GEL READING INTO DATABASE')
+C   IS THERE SPACE?
+      IF((IDBSIZ-(NGELS+NCONTS)).GT.2)GO TO 5
+C   FULL
+      WRITE(KBOUT,1999)IDBSIZ
+1999  FORMAT(' Database full, current size=',I6,' Extend with copy')
+      IFAIL=7
+      RETURN
+5     CONTINUE
+C   NEED TO CHECK TO SEE IF GEL ALREADY IN DB
+C   LOOK THRU ARC FILE
+      DO 10 J=1,NGELS
+      CALL READN(IDEVN,J,NAMARK)
+      IF(NAMARK.NE.NAMARC(1:16))GO TO 10
+C   FOUND
+      WRITE(KBOUT,1013)J
+1013  FORMAT(' New gel already in database with number',I6,
+     +' Entry aborted')
+      IFAIL=6
+      RETURN
+10    CONTINUE
+C   INCREMENT NUMBER OF GELS
+      NGELS=NGELS+1
+C   SET LENGTH THIS GEL
+      LNGTHG(NGELS)=IDIM*ISENSE
+C   WRITE NAME OF ARCHIVE TO LIST OF ARCHIVES
+C   NAMPRO,ARC
+      NAMARK=NAMARC(1:16)
+      CALL WRITEN(IDEVN,NGELS,NAMARK)
+      WRITE(KBOUT,1003)NGELS
+1003  FORMAT(' This gel reading has been given the number ',I6)
+C   WRITE GEL TO WORKING VERSION
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+C
+C
+C   SET UP RELATIONSHIPS
+C   DOES THIS GEL OVERLAP?
+      IF(ITYPE.NE.0)GO TO 100
+C
+C   DOES NOT OVERLAP SO IT STARTS A CONTIG OF ITS OWN
+C   SET LEFT AND RIGHT POINTERS TO ZERO,RELPG TO 1
+      LNBR(NGELS)=0
+      RNBR(NGELS)=0
+      RELPG(NGELS)=1
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C
+C   SET CONTIG POINTERS AND GENERAL VALUES
+C   INCREMENT NUMBER OF CONTIGS
+      NCONTS=NCONTS+1
+C   POINTER TO THIS CONTIG
+      N=IDBSIZ-NCONTS
+C   POINTER TO LEFT GEL THIS CONTIG
+      LNBR(N)=NGELS
+C   POINTER TO RIGHT GEL THIS CONTIG
+      RNBR(N)=NGELS
+C   LENGTH OF CONTIG
+      RELPG(N)=IDIM
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C   WRITE DB DESCRIPTOR
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      RETURN
+C
+100   CONTINUE
+C
+C
+C   DOES OVERLAP
+150   CONTINUE
+C
+C   LEFT END OR RIGHT OVERLAP?
+      IF(ITYPE.EQ.1)GO TO 400
+C   RIGHT END OR INTERNAL OVERLAP
+C
+160   CONTINUE
+C   NEED TO SEARCH THRU THIS CONTIG TO FIND LEFT AND RIGHT
+C   NEIGHBOURS FOR THIS NEW GEL
+C   LINE NUMBER OF LEFT END OF CONTIG
+      N=NCONTC
+C   LOOK THRU UNTIL CURRENT IS >= THEN IT MUST BE THE PREVIOUS ONE
+200   CONTINUE
+      IF(RELPG(N).GT.X)GO TO 250
+C   IS THIS THE LAST GEL IN CONTIG?
+      IF(RNBR(N).EQ.0)GO TO 350
+C   NO SO LOOK AT NEXT
+      N=RNBR(N)
+      GO TO 200
+250   CONTINUE
+C   GEL LIES BETWEEN N AND LNBR(N)
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS IN NEW GEL
+      LNBR(NGELS)=LNBR(N)
+      RNBR(NGELS)=N
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C   SET POINTERS  IN LEFT AND RIGHT NEIGHBOURS
+      K=LNBR(N)
+      RNBR(K)=NGELS
+C      RNBR(LNBR(N))=NGELS
+C   WRITE LEFT AND RIGHT NEIGHBOURS
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+      LNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C   WRITE NGELS NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   ITS LINE NUMBER IS LINCON
+C   NEED TO UPDATE IDIM IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      IF(Y.LE.RELPG(LINCON))RETURN
+      RELPG(LINCON)=Y
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      RETURN
+350   CONTINUE
+C   MUST BE A RIGHT END OVERLAP
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS FOR NEW GEL
+      LNBR(NGELS)=N
+      RNBR(NGELS)=0
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C   OLD RIGHT END
+      RNBR(N)=NGELS
+C   WRITE NEW RIGHT LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C   RESET RIGHT NAME IN CONTIG
+C   ITS LINE NUMBER IS LINCON
+      RNBR(LINCON)=NGELS
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   NEED TO UPDATE LENGTH OF GEL IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      RELPG(LINCON)=MAX(RELPG(LINCON),Y)
+C   WRITE HERE
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      RETURN
+C
+400   CONTINUE
+C
+C   ADDING TO LEFT END
+410   CONTINUE
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,1,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+420   CONTINUE
+C   SET POINTERS IN NEW GEL
+      RELPG(NGELS)=1
+      RNBR(NGELS)=NCONTC
+      LNBR(NGELS)=0
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C   SET POINTERS IN OLD LEFT END
+      LNBR(NCONTC)=NGELS
+      RELPG(NCONTC)=X
+C   WRITE NEW LEFT END
+      CALL WRITER(IDEVR,NCONTC,RELPG(NCONTC),LNGTHG(NCONTC),
+     +LNBR(NCONTC),RNBR(NCONTC))
+C   NEW LENGTH OF CONTIG
+      RELPG(LINCON)=RELPG(LINCON)+X-1
+C   MAY HAVE JUST ADDED A GEL LONGER THAN CONTIG
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=IDIM
+      IF(Y.GT.RELPG(LINCON))RELPG(LINCON)=Y
+C   NEW NAME OF LEFT END OF CONTIG
+      LNBR(LINCON)=NGELS
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C   NOW GO THRU AND CHANGE ALL RELATIVE POSITIONS
+      N=NCONTC
+440   CONTINUE
+      IF(RNBR(N).EQ.0)RETURN
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+X-1
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      GO TO 440
+      END
+      SUBROUTINE DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB),PL(2),PR(2),RMOST
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LINCON(2),LLINO(2),ITYPE(2),IFAIL(2)
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER LREG,RREG,X,ANS,ANSJOK
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER GET,SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      SAVE GET
+      DATA GET/'>'/
+      WRITE(KBOUT,*)' Automatic sequence assembler'
+      IFAIL(1) = 0
+      IEMPTY=0
+      IF(NGELS.LT.1)IEMPTY=1
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      CALL YESNO(IOKENT,'Permit entry',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOKENT.LT.0) RETURN
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        CALL OPENF1(IDEV7,FILE,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(IOKENT.EQ.0) THEN
+        CALL OPENF1(IDEV8,FILE,1,IOK,KBIN,KBOUT,
+     +  'File for names of failures',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      PROMPT(1) = 'Perform normal shotgun assembly'
+      PROMPT(2) = 'Put all sequences in one contig'
+      PROMPT(3) = 'Put all sequences in new contigs'
+      IOPT = 1
+      CALL RADION('Select entry mode',PROMPT,3,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) RETURN
+      IF(IOPT.EQ.1) THEN
+C  parameters for normal assembly
+      ANSJOK = 0
+      CALL YESNO(ANSJOK,'Permit joins',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSJOK.LT.0) RETURN
+      MN = LENGTH*2
+      MX = MAXGLM + 1
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      MN = 2
+      MX = 5
+      MINSLI = MAX(3,MN)
+      CALL GETINT(MN,MX,MINSLI,
+     +'Minimum alignment block',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINSLI = IVAL
+      MN = 0
+      MX = 25
+      MAXPG = 8
+      CALL GETINT(MN,MX,MAXPG,
+     +'Maximum pads per gel',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MN = 0
+      MX = 25
+      MAXPC = 8
+      CALL GETINT(MN,MX,MAXPC,
+     +'Maximum pads per gel in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPC = IVAL
+      IF(IOKENT.EQ.0) THEN
+        RMN = 0.
+        RMX = 15.
+        PERMAX = 8.
+        CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PERMAX = VAL
+      END IF
+      IDIM1=0
+      MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+      ANS=0
+      IF(IEMPTY.EQ.0)
+     +CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      END IF
+      JGEL = 0
+      JNGEL = 0
+      JNJOIN = 0
+      IMATC = 0
+      IF(IFAIL(1).NE.0)GO TO 900
+C
+C
+1     CONTINUE
+C
+C
+      IDIM2=MAXGEL
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAMARC,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        READ(IDEV7,1002,END=900)NAMARC
+      END IF
+1002  FORMAT( A)
+      DO 77 MM=1,80
+        MATCH(MM)=GET
+77    CONTINUE
+      WRITE(IDEV,1077)(MATCH(KK),KK=1,79)
+1077  FORMAT(' ',79A1)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+1007  FORMAT(' Gel reading name=',A)
+      WRITE(IDEV,1007)NAMARC
+      CALL OPENRS(IDEV4,NAMARC,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(IDEV,*)' Error opening gel reading file'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV4,SEQ2,IDIM2,KBOUT)
+      CLOSE(UNIT=IDEV4)
+      WRITE(IDEV,1800)IDIM2
+1800  FORMAT(' Gel reading length=',I6)
+      IF(IOPT.NE.1) THEN
+        CALL DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +  IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +  IDEV1,IDEV2,IDEV3,MAXGEL,IMATC,IEMPTY,IOPT)
+        IF(IFAIL(1).NE.0) GO TO 800
+        JNGEL = JNGEL + 1
+        GO TO 1
+      END IF
+      IF(IDIM2.LT.MINMAT)THEN
+        WRITE(IDEV,*)
+     +' Reading shorter than minimum match, so not processed'
+        GO TO 1
+      END IF
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      IFCOMP=0
+      IMATC=0
+      IF(IEMPTY.EQ.0)
+     +CALL AUTOCN(SEQ1,IDIM1,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL)
+      IF(IOKENT.NE.0) GO TO 1
+C     THIS RETURNS THE FOLLOWING:
+C     ILEFTS  POSITION IN CONSENSUS OF LEFT END OF MATCHING CONTIGS
+C     ILC     LENGTHS OF MATCHING CONTIGS
+C     IPOSC   POSITION OF MATCH RELATIVE TO CONTIG
+C     IPOSG   POSITION OF MATCH RELATIVE TO NEW GEL
+C     ISENSE  SENSE OF NEW GEL
+C     LLINO   LEFT GEL NUMBER IN MATCHING CONTIGS
+C     IMATC   THE NUMBER OF MATCHING CONTIGS (>2 IS ERROR!)
+C     IFCOMP  ERROR FLAG FOR COMPARISON (COMPARISON ARRAYS OVERFLOWED)
+      IF(IFCOMP.NE.0)GO TO 800
+      CALL SQCOPY(SEQ3,SEQ2,IDIM2)
+      IF(IMATC.GT.0)GO TO 200
+C
+C                         NO OVERLAP NEW CONTIG
+C
+C     ITYPE 0 = NO OVERLAP
+C     ISENSE 1 = SAME SENSE AS ARCHIVE
+      IF(IFAIL(1).NE.0) GO TO 800
+      ITYPE(1)=0
+      ISENSE(1)=1
+      IDOUT(1)=MAXGEL
+      WRITE(IDEV,1015)
+1015  FORMAT(' New gel reading does not overlap: start a new contig')
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2,NAMARC,X,ITYPE,ISENSE,SEQC2(1,1),ITOTPC(1),
+     +IDIM2,IDOUT(1),LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +IDEV1,IDEV2,IDEV3,MAXGEL)
+      IF(IFAIL(1).NE.0)GO TO 800
+      IEMPTY=0
+      IDIM1=IDIM1+1
+      IF((IDIM1+19+IDIM2).GT.MAXSEQ)THEN
+        WRITE(IDEV,1021)MAXSEQ
+1021    FORMAT(' Database maximum consensus length (',I6,') exceeded')
+        GO TO 900
+      END IF
+      CALL ADDTIT(SEQ1(IDIM1),NAMPRO,NGELS,IDIM1)
+      CALL MSTLKL(SEQ2,IDIM2)
+      CALL SQCOPY(SEQ2,SEQ1(IDIM1),IDIM2)
+      IDIM1=IDIM1+IDIM2-1
+      JNGEL = JNGEL + 1
+      GO TO 1
+C
+C
+C
+200   CONTINUE
+C
+C
+C   OVERLAP SO TRY TO ALIGN THE SEQUENCES
+C
+C
+      DO 100 I=1,IMATC
+        N=IDBSIZ-NCONTS
+        DO 99 J=N,IDBSIZ-1
+          IF(LNBR(J).NE.LLINO(I))GO TO 99
+          LINCON(I)=J
+          GO TO 100
+99      CONTINUE
+        WRITE(IDEV,10077)LLINO(I)
+10077   FORMAT(' Contig line for contig',I6,' not found!')
+        GO TO 800
+100   CONTINUE
+C
+      IF((IMATC.EQ.2).AND.(ANSJOK.EQ.0))GO TO 400
+C
+C
+C                           SINGLE OVERLAP
+C
+C
+C
+      WRITE(IDEV,1014)LLINO(1)
+1014  FORMAT(' New gel reading overlaps contig',I6)
+      IF(ITOTPG(1).GT.0) CALL CCTA(SEQG2(1,1),IDIM22(1))
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,1),NAMARC,JOINT(1),ITYPE(1),ISENSE(1),
+     +SEQC2(1,1),
+     +ITOTPC(1),IDIM22(1),IDOUT(1),LLINO(1),LINCON(1),
+     +IFAIL(1),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,MAXGEL)
+      IF(IFAIL(1).NE.0)GO TO 800
+      CALL DELCON(SEQ1,ILEFTS(1),ILC(1),IDIM1)
+      LREG=1
+      RREG=RELPG(LINCON(1))
+      IGELC=LNBR(LINCON(1))
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      JNGEL = JNGEL + 1
+      IF(KFAIL.NE.0) GO TO 800
+      GO TO 1
+C
+C
+C                        DOUBLE OVERLAP
+C
+C
+400   CONTINUE
+      WRITE(IDEV,1013)LLINO
+1013  FORMAT(' Overlap between contigs',I6,' and',I6)
+      IF(ANSJOK.NE.0)GO TO 800
+      IF(LLINO(1).EQ.LLINO(2))THEN
+        WRITE(IDEV,*)' Trying to form loop in contig',LLINO(1)
+        WRITE(IDEV,*)' Gel not entered'
+        GO TO 800
+      END IF
+      CALL AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,
+     +IDIM22,KLASS,IOVER,IDEV,PL,PR)
+      IF(IOVER.GT.MAXOVR)THEN
+        WRITE(IDEV,*)' Overlap too large: entry only'
+        IFAIL(2)=1
+        GO TO 600
+       END IF
+C   WHICH CONTIG IS LEFTMOST?
+      LMOST=1
+      RMOST=2
+      IF(PL(1).GT.PL(2))THEN
+        LMOST=2
+        RMOST=1
+      END IF
+C   SAVE LENGTH OF RMOST CONTIG FOR DELETION STEP LATER
+      ILCR=ILC(RMOST)
+      IF(ITOTPG(LMOST).GT.0) CALL CCTA(SEQG2(1,LMOST),IDIM22(LMOST))
+      WRITE(IDEV,1012)LLINO(LMOST)
+1012  FORMAT(' Entering the new gel reading into contig',I6)
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,LMOST),NAMARC,JOINT(LMOST),ITYPE(LMOST),
+     +ISENSE(LMOST),SEQC2(1,LMOST),ITOTPC(LMOST),
+     +IDIM22(LMOST),IDOUT(LMOST),LLINO(LMOST),LINCON(LMOST),
+     +IFAIL(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,
+     +MAXGEL)
+      IF(IFAIL(LMOST).NE.0)GO TO 800
+      JNGEL = JNGEL + 1
+      IF(ITYPE(LMOST).EQ.1)LLINO(LMOST)=NGELS
+      IF(ILEFTS(LMOST).LT.ILEFTS(RMOST))THEN
+        ILEFTS(RMOST)=ILEFTS(RMOST)-ILC(LMOST)-20
+      END IF
+      CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+      LREG=1
+      RREG=RELPG(LINCON(LMOST))
+      IGELC=LNBR(LINCON(LMOST))
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      ILC(LMOST)=RREG
+      ILEFTS(LMOST)=IDIM1-RREG+1
+      DO 500 I=1,2
+        IF(ISENSE(I).EQ.-1)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON(I),
+     +    LLINO(I),SEQ2,IDBSIZ,IDEV,IDEV1,IDEV2,MAXGEL)
+          CALL SQREV(SEQ1(ILEFTS(I)),ILC(I))
+          CALL SQCOM(SEQ1(ILEFTS(I)),ILC(I))
+          KT=IDIM1
+          CALL ADDTIT(SEQ1((ILEFTS(I)-20)),NAMPRO,LNBR(LINCON(I)),KT)
+        END IF
+500   CONTINUE
+C   NEED TO KNOW POSITION OF OVERLAP RELATIVE TO CONTIG, TO CONSENSUS
+C   WHICH BITS TO SEND TO ALIGNMENT ROUTINES
+C   SET UP FOR ALINE (NOTE RMOST IS EQUIVALENT TO THE GEL READING AND
+C   SO IS SLID ALONG THE LMOST CONTIG. THE SECTION SENT TO ALINE MUST 
+C   BE OF LENGTH < MAXGEL-2*MAX(MAXPC,MAXPG)
+C   IT MUST START AT POSITION 1 IN THE RMOST CONTIG AND EXTEND
+      IPOSC(LMOST)=PL(RMOST)+RELPG(NGELS)-1
+      ILCT = RELPG(LINCON(LMOST)) - RELPG(NGELS) - PL(RMOST) + 2
+      ILC(RMOST)=MIN(ILCT,ILC(RMOST))
+      IPOSC(RMOST)=1
+      IDOUT(LMOST)=MAXGEL
+      IDOUT(RMOST)=MAXGEL
+      IDSAV=MAXSAV
+C  ON INPUT TO ALINE ILC(RMOST) CONTAINS THE OVERLAP LENGTH
+C  ON OUTPUT IT CONTAINS THE LENGTH OF THE ALIGNED SECTION (IE INCLUDING 
+C  PADS)
+      WRITE(IDEV,1009)
+1009  FORMAT(' Trying to align the two contigs')
+      CALL ALINE(SEQ1(ILEFTS(LMOST)),SEQ1(ILEFTS(RMOST)),
+     +SEQC2(1,RMOST),SEQC2(1,LMOST),SAV1,SAV2,SAV3,IDSAV,
+     +ILC(LMOST),ILC(RMOST),IDOUT(LMOST),IPOSC(LMOST),IPOSC(RMOST),
+     +MINSLI,JOINT(LMOST),ITOTPC(LMOST),ITOTPC(RMOST),IFAIL(1),
+     +ITYPE(1),MAXPC,MAXPC,PERMAX,IDEV,SEQ4,MAXGEL,Z)
+C SEQC2(1,LMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE LMOST CONTIG
+C SEQC2(1,RMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE RMOST CONTIG
+C ILC(RMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE RMOST CONTIG
+C IDOUT(LMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE LMOST CONTIG
+C JOINT(LMOST)  IS THE POSITION OF THE JOIN RLETIVE TO THE LMOST CONTIG
+C ITYPE IS TYPE OF OVERLAP (-1 = RIGHT END OR INTERNAL, 1 = LEFT END)
+C  NB SHOULD ALWAYS BE -1
+C  IF THIS HAS BEEN DONE OK WE CAN EDIT THE TWO CONTIGS THEN JOIN
+      IF(IFAIL(1).NE.0)THEN
+        WRITE(IDEV,*)' Failed to align the two overlapping contigs'
+        GO TO 800
+      END IF
+      IF(ITOTPC(LMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(LMOST)
+1017    FORMAT(' Editing contig',I6)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(LMOST),JOINT(LMOST),SEQC2(1,LMOST),
+     +  ITOTPC(LMOST),IDOUT(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      JOINT(RMOST)=1
+      IDOUT(RMOST)=ILC(RMOST)
+      IF(ITOTPC(RMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(RMOST)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(RMOST),JOINT(RMOST),SEQC2(1,RMOST),
+     +  ITOTPC(RMOST),IDOUT(RMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      ILC(RMOST)=ILCR
+      LTL=LNBR(LINCON(LMOST))
+      LTR=LNBR(LINCON(RMOST))
+      WRITE(IDEV,1018)LNBR(LINCON(LMOST)),LNBR(LINCON(RMOST))
+1018  FORMAT(' Completing the join between contigs',I6,' and',I6)
+      CALL AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +JOINT(LMOST),LTL,LTR,LINCON(LMOST),LINCON(RMOST),IDEV1)
+      LLINO(1)=LTL
+      IF(ILEFTS(LMOST).GT.ILEFTS(RMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+      END IF
+      IF(ILEFTS(RMOST).GE.ILEFTS(LMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+      END IF
+      LREG=1
+      RREG=JOINT(LMOST)
+      IGELC=LLINO(1)
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+C      CALL FMTDB(SEQ1,IDIM1,1,IDIM1,60,IDEV)
+      JNJOIN = JNJOIN + 1
+      IF(KFAIL.NE.0) GO TO 800
+      GO TO 1
+C
+C
+C             JOINS THAT FAIL INITIAL ALIGNMENT CRITERIA
+C
+C
+600   CONTINUE
+C   COME HERE FOR JOINS THAT FAIL: WE MIGHT BE ABLE TO ENTER THE GEL 
+C   INTO A SINGLE CONTIG
+      IGOOD=0
+      IF(IFAIL(1).EQ.0)IGOOD=1
+      IF(IFAIL(2).EQ.0)IGOOD=2
+      IF(IGOOD.EQ.0)GO TO 800
+      IF(ITOTPG(IGOOD).GT.0) CALL CCTA(SEQG2(1,IGOOD),IDIM22(IGOOD))
+      WRITE(IDEV,1012)LLINO(IGOOD)
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,IGOOD),NAMARC,JOINT(IGOOD),ITYPE(IGOOD),
+     +ISENSE(IGOOD),SEQC2(1,IGOOD),ITOTPC(IGOOD),
+     +IDIM22(IGOOD),IDOUT(IGOOD),LLINO(IGOOD),LINCON(IGOOD),
+     +IFAIL(IGOOD),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,
+     +MAXGEL)
+      IF(IFAIL(IGOOD).NE.0)GO TO 800
+      JNGEL = JNGEL + 1
+      CALL DELCON(SEQ1,ILEFTS(IGOOD),ILC(IGOOD),IDIM1)
+      ANS=1
+      IGELC=LNBR(LINCON(IGOOD))
+      LREG=1
+      RREG=RELPG(LINCON(IGOOD))
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      WRITE(IDEV,1020)LLINO,LLINO(IGOOD)
+1020  FORMAT(' Could not join contigs',I4,' and',I4,' but the gel',
+     +' has been entered into contig',I4,/,
+     +' If required do the join manually.')
+C
+C
+C                      ALL FAILURES
+C
+C
+C
+800   CONTINUE
+      WRITE(IDEV,10888)
+10888 FORMAT(' Failed gel name written to error file')
+      WRITE(IDEV8,1002)NAMARC
+      GO TO 1
+900   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' sequences processed'
+      WRITE(KBOUT,*)JNGEL,' sequences entered into database'
+      WRITE(KBOUT,*)JNJOIN,' joins made'
+      RETURN
+      END
+      SUBROUTINE DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +IDEV1,IDEV2,IDEV3,MAXGEL,IMATC,IEMPTY,IOPT)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER SEQ2(MAXGEL),SEQC2(MAXGEL)
+      CHARACTER NAMARC*(*)
+C  deals with entering all readings into contig 1 (IOPT=2)
+C  or all readings into new contigs (IOPT=3)
+      IF(IOPT.EQ.2) THEN
+        IF(IMATC.EQ.0) THEN
+          ITYPE=0
+          ISENSE=1
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+          IEMPTY=0
+          IMATC = 1
+        ELSE
+          ITYPE= - 1
+          ISENSE=1
+          JOINT = 1
+          LLINO = 1
+          LINCON = IDBSIZ - 1
+          ITOTPC = 0
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+        END IF
+      ELSE IF(IOPT.EQ.3) THEN
+        ITYPE=0
+        ISENSE=1
+        IDOUT=MAXGEL
+        CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,MAXGEL)
+        IF(IFAIL.NE.0) RETURN
+      END IF
+      END
+      SUBROUTINE DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +NAMPRO,IDEV,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,MAXDB,IDM)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMPRO*(*),NAMARC*16,GEL(MAXGEL)
+      CHARACTER V2,V1
+      INTEGER IWORD
+      PARAMETER (IWORD=4)
+      IERR=1
+1     CONTINUE
+      L = 1
+      V1='1'
+      CALL GTSTR('Make version',V1,V2,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(V2,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) V2 = V1
+      MN = NGELS + NCONTS + 1
+      MX = MAXDB
+      NEWSIZ = IDBSIZ
+      CALL GETINT(MN,MX,NEWSIZ,
+     +'New database size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL BUSY(KBOUT)
+      NEWSIZ = IVAL
+C     WHERE SHOULD CHARS BE CHANGED ?
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='RL'//V2
+       CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+      CALL WRITER(IDEV,0,MAXDB,NEWSIZ,MAXGEL,IDM)
+      CALL WRITER(IDEV,NEWSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      DO 10 I=1,NGELS
+        CALL WRITER(IDEV,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      M=NEWSIZ-NCONTS
+      N=IDBSIZ-NCONTS
+      DO 15 I=N,IDBSIZ-1
+        CALL WRITER(IDEV,M,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        M=M+1
+15    CONTINUE
+      CLOSE(UNIT=IDEV)
+C  DO SEQUENCES
+      NAMPRO(LLL:)='SQ'//V2
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+      CALL OPENRS(IDEV,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+      DO 20 I=1,NGELS
+        CALL READW(IDEVW,I,GEL,MAXGEL)
+        CALL WRITEW(IDEV,I,GEL,MAXGEL)
+20    CONTINUE
+      CLOSE(UNIT=IDEV)
+C   DO ARCHIVE NAMES
+      NAMPRO(LLL:)='AR'//V2
+       CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+      DO 30 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        CALL WRITEN(IDEV,I,NAMARC)
+30    CONTINUE
+      CLOSE(UNIT=IDEV)
+      IERR=0
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error opening new database, copy aborted')
+      RETURN
+      END
+      SUBROUTINE DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,MAXGEL,MAXGLM)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),X
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAME*16,NEWNAM*16
+      INTEGER L,M,N
+      CHARACTER GEL2(MAXGEL)
+      INTEGER GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' Warning:',
+     +' make a copy first, and check logical consistency after use')
+10    CONTINUE
+C
+C   SELECT OPTION
+C      CALL BELL(1,KBOUT)
+      CALL DBMENU(4,NOPT,IHELPS,IHELPE,FILEH,IDEVH,
+     +KBIN,KBOUT)
+      IF(NOPT.LT.1)RETURN
+      IF(NOPT.EQ.2)RETURN
+      IF(NOPT.EQ.3)THEN
+C       LINE CHANGE
+        MN = 0
+        MX = IDBSIZ
+        LNO = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of line to change',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.EQ.0) GO TO 18
+        LNO = IVAL
+        IF(LNO.EQ.IDBSIZ)GO TO 19
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)RELPG(LNO),LNGTHG(LNO),LNBR(LNO),RNBR(LNO)
+1001    FORMAT(' ',4I6)
+        MN = 0
+        MX = 99999
+        X  = RELPG(LNO)
+        CALL GETINT(MN,MX,X,
+     +  'Relative position',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        X = IVAL
+        MN = -MAXGEL
+        MX = 99999
+        L  = LNGTHG(LNO)
+        CALL GETINT(MN,MX,L,
+     +  'Length',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 0
+        MX = IDBSIZ
+        M  = LNBR(LNO)
+        CALL GETINT(MN,MX,M,
+     +  'Left neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        M = IVAL
+        MN = 0
+        MX = IDBSIZ
+        N  = RNBR(LNO)
+        CALL GETINT(MN,MX,N,
+     +  'Right neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        N = IVAL
+        CALL WRITER(IDEVR,LNO,X,L,M,N)
+        RELPG(LNO)=X
+        LNGTHG(LNO)=L
+        LNBR(LNO)=M
+        RNBR(LNO)=N
+        GO TO 10
+ 18     CONTINUE
+C
+C deal with record 1/0 which contains: maxdb,actualdbsiz,maxgel,idm
+C
+        WRITE(KBOUT,*)
+     +  'Extreme caution: after this record is changed the program'
+        WRITE(KBOUT,*)
+     +  'should be restarted, and could malfunction!'
+        CALL READR(IDEVR,0,IMSIZ,IASIZ,MXG,IDA)
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)IMSIZ,IASIZ,MXG,IDA
+        MN = 0
+        MX = 9999999
+        X  = IMSIZ
+        CALL GETINT(MN,MX,X,
+     +  'Maximum database size',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        X = IVAL
+        MN = 0
+        MX = X
+        L  = IASIZ
+        CALL GETINT(MN,MX,L,
+     +  'Actual database size',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 0
+        MX = MAXGLM
+        M  = MXG
+        CALL GETINT(MN,MX,M,
+     +  'Maximum reading length',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        M = IVAL
+        ID = 0
+        IF(IDA.EQ.26) ID = 1
+        CALL YESONO(ID,'Database is for DNA',
+     +  'Database is for protein',
+     +  IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ID.LT.0) GO TO 10
+        N = 5
+        IF(ID.EQ.0)N = 5
+        IF(ID.EQ.1)N = 26
+        CALL WRITER(IDEVR,0,X,L,M,N)
+        GO TO 10
+19      CONTINUE
+C       NCONTS NGELS LINES
+        MN = 0
+        MX = IDBSIZ
+        LL = NGELS
+        CALL GETINT(MN,MX,LL,
+     +  'Number of gel readings',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        LL = IVAL
+        MN = 0
+        MX = IDBSIZ
+        MM  = NCONTS
+        CALL GETINT(MN,MX,MM,
+     +  'Number of contigs',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        MM = IVAL
+        CALL WRITER(IDEVR,IDBSIZ,LL,MM,LL,MM)
+        NGELS=LL
+        NCONTS=MM
+        GO TO 10
+      END IF
+C
+C
+      IF(NOPT.EQ.6)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of first gel reading to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNO)
+        IF(I.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'Problem with this gel reading. Check logical consistency'
+          WRITE(KBOUT,*)'of database. Shift not done'
+          GO TO 10
+        END IF
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'No contig line for this contig. Check logical'
+          WRITE(KBOUT,*)'consistency of database. Shift not done'
+          GO TO 10
+        END IF
+        MN = 1 - RELPG(LNO)
+        MX  = RELPG(NCONTO) - RELPG(LNO)
+        X = MN
+        CALL GETINT(MN,MX,X,
+     +  'Distance to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        X = IVAL
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,LNO,NCONTO,X)
+        WRITE(KBOUT,*)'Shift complete'
+        GO TO 10
+      END IF
+C
+      IF(NOPT.EQ.4)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of gel reading to edit',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        IDIM=ABS(LNGTHG(LNO))
+C       ALLOW EDITING TO BE RELATIVE TO CONTIG POSITION
+        ISTART=RELPG(LNO)
+        CALL READW(IDEVW,LNO,GEL,MAXGEL)
+        CALL EDITGL(GEL,GEL2,IDIM,KBIN,KBOUT,MAXGEL,
+     +  IHELP1,IHELP2,FILEH,IDEVH,ISTART)
+        CALL WRITEW(IDEVW,LNO,GEL,MAXGEL)
+        LNGTHG(LNO)=SIGN(IDIM,LNGTHG(LNO))
+        CALL WRITER(IDEVR,LNO,RELPG(LNO),LNGTHG(LNO),
+     +  LNBR(LNO),RNBR(LNO))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.5)THEN
+        MN=IDBSIZ-NCONTS
+        MX = IDBSIZ - 1
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of contig line to delete',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        LNO = IVAL
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  LNO,IDEVR)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.8)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of gel reading to rename',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        CALL READN(IDEVN,LNO,NAME)
+        L = 16
+        CALL GTSTR('name for gel reading',
+     +  NAME,NEWNAM,L,KBOUT,KBIN,INFLAG)
+        IF(L.GT.0)CALL WRITEN(IDEVN,LNO,NEWNAM)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+        MN = 0
+        MX = NGELS
+        IFROM  = 0
+        CALL GETINT(MN,MX,IFROM,
+     +  'Number of gel reading to move',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        IFROM = IVAL
+        MN = 0
+        MX = NGELS
+        ITO  = 0
+        CALL GETINT(MN,MX,ITO,
+     +  'New number for gel reading',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        ITO = IVAL
+        CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  GEL,IFROM,ITO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.1)THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.9)THEN
+        CALL BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+        GO TO 10        
+      END IF
+      IF(NOPT.EQ.10) THEN
+        CALL REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        GO TO 10
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER HELPF*(*),GEL(MAXGEL)
+      INTEGER REMME,GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+C assumes db is logical consistent
+      WRITE(KBOUT,*)'Remove reading from database'
+      REMME = NGELS
+      CALL GETINT(1,NGELS,REMME,
+     +'Number of reading to remove',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      REMME = IVAL
+      I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,REMME)
+      ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,I)
+      IF(ICONT.EQ.0) THEN
+        WRITE(KBOUT,*)'No contig line for this reading'
+        IOK = 1
+        RETURN
+      END IF
+      CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+      END
+      SUBROUTINE REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+C Routine to remove a reading from a database
+C Cases: 1 left end
+C        2 right end
+C        3 internal and dispensible
+C        4 internal and indispensible
+C if 1 change contig lnbr, contig length, lnbr of rnbr of remme, relpgs
+C if 2 change contig rnbr, contig length, rnbr of lnbr of remme
+C if 3 change contig length, lnbr of rnbr of remme rnbr of lnbr of remme
+C if 4 need to break contig, then as for 1
+C if 1 and 2 then also remove contig line
+C for all cases move gel ngels to remme (if remme/=ngels)
+C and update line idbsiz
+C
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME,CLEN
+      LOGICAL LEFTE,RIGHTE,DISPEN
+      CHARACTER GEL(MAXGEL)
+      EXTERNAL CLEN
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+      DISPEN = .FALSE.
+C
+C Left end ?
+C
+      IF(LNBR(REMME).EQ.0) LEFTE = .TRUE.
+C
+C Right end ?
+C
+      IF(RNBR(REMME).EQ.0) RIGHTE = .TRUE.
+C
+C If both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading and contig'
+        IFROM = NGELS
+        NGELS = NGELS - 1
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  ICONT,IDEVR)
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+      ELSE IF(LEFTE) THEN
+        WRITE(KBOUT,*)'Removing reading from left end of contig'
+        LNBR(ICONT) = RNBR(REMME)
+        I = 1 - RELPG(RNBR(REMME))
+        WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,RNBR(REMME),ICONT,I)
+        I = LNBR(ICONT)
+        LNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE IF(RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading from right end of contig'
+        RNBR(ICONT) = LNBR(REMME)
+        I = RNBR(ICONT)
+        RNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNBR(ICONT))
+        CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +  LNBR(ICONT),RNBR(ICONT))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE
+C Is remme indispensible ?
+        NSTART = RELPG(RNBR(REMME))
+        I = REMME
+10      CONTINUE
+        I = LNBR(I)
+        IF(I.NE.0) THEN
+          IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.NSTART) GO TO 10
+          DISPEN = .TRUE.
+        END IF
+        IF(DISPEN) THEN
+          WRITE(KBOUT,*)
+     +    'Removing dispensible reading from middle of contig'
+          I = LNBR(REMME)
+          RNBR(I) = RNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          I = RNBR(REMME)
+          LNBR(I) = LNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,LNBR(ICONT))
+          CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +    LNBR(ICONT),RNBR(ICONT))
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        ELSE
+          WRITE(KBOUT,*)
+     +    'Removing indispensible reading from middle of contig'
+          WRITE(KBOUT,*)'So breaking contig first'
+          IR = REMME
+          IL = LNBR(REMME)
+          ILO = LNBR(ICONT)
+          NCONTO = ICONT
+          NCONTR = IDBSIZ - NCONTS - 1
+          CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +    KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+          IF(IOK.NE.0) RETURN
+          WRITE(KBOUT,*)'Removing reading from left end of contig'
+          ICONT = IDBSIZ - NCONTS
+          LNBR(ICONT) = RNBR(REMME)
+          I = 1 - RELPG(RNBR(REMME))
+          WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+          CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +    IDBSIZ,RNBR(REMME),ICONT,I)
+          I = LNBR(ICONT)
+          LNBR(I) = 0
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        END IF
+      END IF
+      END
+      SUBROUTINE REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,IDEVR)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME
+C Routine to remove a contig line from a db
+C Loop deals with case of remove top contig
+C Move down all lines from above
+      DO 10 I = REMME,IDBSIZ-NCONTS+1,-1
+        RELPG(I) = RELPG(I-1)
+        LNGTHG(I) = LNGTHG(I-1)
+        LNBR(I) = LNBR(I-1)
+        RNBR(I) = RNBR(I-1)
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      NCONTS = NCONTS - 1
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      END
+      SUBROUTINE MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +GEL,FROM,TO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+C   Subroutine to move a gel from line from to line to
+C   Extended 22-5-91
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),FROM,TO
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMGEL*16,GEL(MAXGEL)
+      INTEGER GCLIN,CHAINL
+      LOGICAL LEFTE,RIGHTE
+      EXTERNAL GCLIN,CHAINL
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+C
+C left end ?
+C
+      IF(LNBR(FROM).EQ.0) LEFTE = .TRUE.
+C
+C right end ?
+C
+      IF(RNBR(FROM).EQ.0) RIGHTE = .TRUE.
+C
+C if both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          RNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(LEFTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(RIGHTE) THEN
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no right neighbour and does not'
+          WRITE(KBOUT,*)'appear in a contig!'
+        ELSE
+          IF(RNBR(NCONTO).NE.FROM)THEN
+            WRITE(KBOUT,*)
+     +        'This gel has no right neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+          ELSE
+            RNBR(NCONTO) = TO
+            CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +        LNBR(NCONTO),RNBR(NCONTO))
+          END IF
+        END IF
+      END IF
+      RELPG(TO)=RELPG(FROM)
+      LNGTHG(TO)=LNGTHG(FROM)
+      LNBR(TO)=LNBR(FROM)
+      RNBR(TO)=RNBR(FROM)
+      CALL READW(IDEVW,FROM,GEL,MAXGEL)
+      CALL WRITEW(IDEVW,TO,GEL,MAXGEL)
+      CALL READN(IDEVN,FROM,NAMGEL)
+      CALL WRITEN(IDEVN,TO,NAMGEL)
+      CALL WRITER(IDEVR,TO,RELPG(TO),LNGTHG(TO),
+     +LNBR(TO),RNBR(TO))
+C   Do neighbours
+      IF(LNBR(FROM).NE.0) THEN
+        I=LNBR(FROM)
+        RNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      IF(RNBR(FROM).NE.0) THEN
+        I=RNBR(FROM)
+        LNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      END
+      SUBROUTINE DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +IDBSIS,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,MAXGEL,MAXGLM,
+     +LLINO,
+     +IDM,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER GEL(MAXGLM)
+      INTEGER RELPG(IDBSIS)
+      INTEGER LNGTHG(IDBSIS),LNBR(IDBSIS),RNBR(IDBSIS)
+      CHARACTER NAMPRO*(*),COPYNO*4,HELPF*(*)
+      INTEGER IWORD
+      PARAMETER (IWORD=4)
+C  NOTE THIS IS THE MACHINES WORD LENGTH IE HOW MANY CHARS PER WORD
+      CALL FILLI(RELPG,IDBSIS,0)
+      CALL FILLI(LNGTHG,IDBSIS,0)
+      CALL FILLI(LNBR,IDBSIS,0)
+      CALL FILLI(RNBR,IDBSIS,0)
+      NAMPRO(1:)=' '
+      IERR=1
+1     CONTINUE
+      L = 0
+      CALL GTSTR('Project name',' ',NAMPRO,L,KBOUT,KBIN,INFLAG)
+      IF(L.LT.1) RETURN
+      LL = L
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      L = 1
+      CALL GTSTR('Version','0',COPYNO,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(COPYNO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) COPYNO(1:1) = '0'
+C   GET .ARN FIRST AS IT CONTAINS LENGTH OF OTHER FILES
+      NAMPRO(LL+1:LL+1)='.'
+      NAMPRO(LL+2:LL+2)='A'
+      NAMPRO(LL+3:LL+3)='R'
+      NAMPRO(LL+4:LL+4)=COPYNO(1:1)
+       CALL OPENRS(IDEVN,NAMPRO,IOK,4,4)
+      IF(IOK.NE.0)GO TO 100
+      NAMPRO(LL+2:LL+2)='R'
+      NAMPRO(LL+3:LL+3)='L'
+       CALL OPENRS(IDEVR,NAMPRO,IOK,4,4)
+      IF(IOK.NE.0)GO TO 100
+      CALL READR(IDEVR,0,IDBST,IDBSIZ,MAXGEL,IDM)
+C
+C Do I really need to look at this value? I could simply compare
+C IDBSIS with IDBSIZ which is what really counts. Cannot remember
+C why IDBST is stored!!!!!!!
+C
+      IF(IDBST.GT.IDBSIS) THEN
+        CALL ERROM(KBOUT,
+     +  'Fatal error: database size too large for program')
+        GO TO 100
+      END IF
+      NAMPRO(LL+2:LL+2)='S'
+      NAMPRO(LL+3:LL+3)='Q'
+C  DEFINE RECORD LENGTH IN TERMS OF NUMBER OF CHARS PER WORD (4 ON VAX)
+C  AND MAXGEL SIZE 
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+       CALL OPENRS(IDEVW,NAMPRO,IOK,IREC,4)
+      IF(IOK.NE.0)GO TO 100
+C   READ A LINE FOR LUCK
+      CALL READW(IDEVW,1,GEL,MAXGEL)
+      CALL READR(IDEVR,IDBSIZ,NGELS,NCONTS,IDUM1,IDUM2)
+      WRITE(KBOUT,10011)NGELS,NCONTS,IDBSIZ,MAXGEL
+10011 FORMAT(' Number of gel readings ',I6,' Number of contigs ',I6,/,
+     +' Database size ',I6,' Maximum gel reading length ',I4)
+      LLINO = 0
+      IF(NGELS.LT.1)GO TO 5
+      DO 3 I=1,NGELS
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+3     CONTINUE
+      N=IDBSIZ-NCONTS
+      MXT = 0
+      DO 4 I=N,IDBSIZ-1
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        IF(RELPG(I).GT.MXT) THEN
+          MXT = RELPG(I)
+          LLINO = LNBR(I)
+        END IF
+4     CONTINUE
+5     CONTINUE
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      IERR=0
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error encountered opening database files')
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      END
+      SUBROUTINE DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,
+     +IDEV1,IDEV2,IDEV3,IERR,IHELPS,IHELPE,IDEVH,FILEH,MAXGEL,
+     +MAXGLM,IDM)
+      CHARACTER GEL(MAXGLM),FILEH*(*)
+      CHARACTER NAMPRO*(*)
+      INTEGER IWORD
+      PARAMETER (IWORD=4)
+      IERR=1
+3     CONTINUE
+      MN = 0
+      CALL GTSTR('New project name',' ',NAMPRO,MN,KBOUT,KBIN,INFLAG)
+      IF(MN.LT.1) RETURN
+      LL = MIN(12,MN)
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 3
+      END IF
+      MN = 10
+      MX = IDBSIS
+      IDBSIZ = 50
+      CALL GETINT(MN,MX,IDBSIZ,
+     +'Database size',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IDBSIZ = IVAL
+5     CONTINUE
+      MN = 512
+      MX = MAXGLM
+      MAXIN1 = MIN(512,MAXGEL)
+      CALL GETINT(MN,MX,MAXIN1,
+     +'Maximum gel reading length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXGEL = IVAL
+      IF(MOD(IVAL,MAXIN1).NE.0) THEN
+        MAXGEL = 512 + (IVAL/512)*512
+        WRITE(KBOUT,*)'Maximum set to',MAXGEL
+      END IF
+      CALL YESNO(IDM,'Database is for DNA',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(IDM.LT.0) RETURN
+      IF(IDM.EQ.0)IDM = 5
+      IF(IDM.EQ.1)IDM = 26
+      NAMPRO(LL+1:LL+1)='.'
+      NAMPRO(LL+2:LL+2)='R'
+      NAMPRO(LL+3:LL+3)='L'
+      NAMPRO(LL+4:LL+4)='0'
+      CALL OPENRS(IDEV1,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Writing new database files')
+      J = 0
+      K = 0
+      N = 0
+      M = 0
+C
+C Put maximum possible database size, actual size, max read length and type
+C in record 0 of relationships (actually record 1, as writer adds 1).
+C
+      CALL WRITER(IDEV1,0,IDBSIS,IDBSIZ,MAXGEL,IDM)
+C
+C Put zeroes in al other records
+C
+      DO 10 I=1,IDBSIZ
+        CALL WRITER(IDEV1,I,J,K,M,N)
+10    CONTINUE      
+      NAMPRO(LL+2:LL+2)='S'
+      NAMPRO(LL+3:LL+3)='Q'
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+      CALL OPENRS(IDEV2,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+      CALL FILLC(GEL,MAXGEL,' ')
+C      CALL WRITEW(IDEV2,IDBSIZ,GEL,MAXGEL)
+C
+C  write only the first record into the working versions
+C  and assume others will be added when required
+C
+      CALL WRITEW(IDEV2,1,GEL,MAXGEL)
+      NAMPRO(LL+2:LL+2)='A'
+      NAMPRO(LL+3:LL+3)='R'
+      CALL OPENRS(IDEV3,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+C
+C  write only the first record into the names
+C  and assume others will be added when required
+C
+      CALL WRITEN(IDEV3,1,'                ')
+      WRITE(KBOUT,1003)NAMPRO(1:LL),IDBSIZ
+1003  FORMAT(' Database ',A,' version 0, size ',I6,
+     +' successfully started')
+      WRITE(KBOUT,1006)NAMPRO(1:LL)
+1006  FORMAT(
+     +' Note the 3 database files  are named ',A,'.AR0, .SQ0, .RL0')
+      IERR=0
+      NAMPRO(LL+2:LL+2) = '0'
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error when trying to open database,',
+     +' database not started')
+      RETURN
+      END
+      SUBROUTINE ENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,IDBSIZ,IDEV,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,IDEVG,
+     +LINLEN,PERCD,NAMARC,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM,NCONTC)
+      CHARACTER FILEH*(*)
+      INTEGER HELPS1,HELPS2,HELPE1,HELPE2
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER X,Y,ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),NAMARC*(*)
+      CHARACTER NAMARK*16
+      CHARACTER GEL2(MAXGEL)
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(
+C     +' Enter a new gel reading into the database.',//,
+C     +' Note that before a gel reading is entered into the',/,
+C     +' database it must be compared  with the database to ',/,
+C     +' search for overlaps using a function such as auto assemble',/,
+C     +' The replies to all the questions that follow are given',/,
+C     +' by these searches',//)
+      IF((IDBSIZ-(NGELS+NCONTS)).GT.2)GO TO 5
+      WRITE(KBOUT,1999)IDBSIZ
+1999  FORMAT(' Current database size=',I5,' extend with copy')
+      RETURN
+5     CONTINUE
+C      WRITE(KBOUT,1018)
+C1018  FORMAT(
+C     +' We need the name of the file containing the gel reading')
+      CALL OPENF1(IDEVG,NAMARC,0,IOK,KBIN,KBOUT,
+     +'File name of gel reading to enter',
+     +HELPS1,HELPE1,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF(NGELS.GT.0) THEN
+        DO 10 J=1,NGELS
+          CALL READN(IDEVN,J,NAMARK)
+          IF(NAMARC(1:16).NE.NAMARK)GO TO 10
+          WRITE(KBOUT,1013)J
+1013      FORMAT( ' Gel reading already in database (number,',I6,
+     +    '), entry aborted')
+          RETURN
+10      CONTINUE
+      END IF
+      IDIM=MAXGEL
+      CALL ARRFIM(IDEVG,GEL,IDIM,KBOUT)
+      CLOSE(UNIT=IDEVG)
+      NGELS=NGELS+1
+      LNGTHG(NGELS)=IDIM
+      NAMARK=NAMARC(1:16)
+      CALL WRITEN(IDEVN,NGELS,NAMARK)
+      WRITE(KBOUT,1003)NGELS
+1003  FORMAT( ' This gel reading has been given the number ',I6)
+      ANS=0
+      IF(NGELS.GT.1)THEN
+C        WRITE(KBOUT,1017)
+C1017    FORMAT(
+C     +' We need to know if the gel reading overlaps data already in',
+C     +' the database')
+        CALL YESNO(ANS,'New gel overlaps a contig',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.LT.0) THEN
+          NGELS = NGELS - 1
+          RETURN
+        END IF
+        IF(ANS.EQ.0)GO TO 100
+      END IF
+C
+C   DOES NOT OVERLAP SO IT STARTS A CONTIG OF ITS OWN
+      LNBR(NGELS)=0
+      RNBR(NGELS)=0
+      RELPG(NGELS)=1
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      NCONTS=NCONTS+1
+      N=IDBSIZ-NCONTS
+      LNBR(N)=NGELS
+      RNBR(N)=NGELS
+      RELPG(N)=(IDIM)
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      RETURN
+C
+100   CONTINUE
+C      WRITE(KBOUT,1019)
+C1019  FORMAT(
+C     +' We need to know if the new gel reading overlaps in the',/,
+C     +' normal or complementary sense')
+C   COMPLEMENT GEL IF REQUIRED
+      CALL YESNO(ANS,'Gel overlaps in normal sense',
+     +HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      IF(ANS.EQ.1) THEN
+        CALL SQREV(GEL,IDIM)
+        CALL SQCOM(GEL,IDIM)
+        LNGTHG(NGELS)=-1*LNGTHG(NGELS)
+      END IF
+C      WRITE(KBOUT,1020)
+C1020  FORMAT(
+C     +' We need to know which contig the new gel reading overlaps')
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,NCONTC,
+     +IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +HELPS1,HELPE1,FILEH,IDEVH)
+      IF(IERR.NE.0) THEN
+        NGELS=NGELS-1
+        RETURN
+      END IF
+C      WRITE(KBOUT,1014)
+C1014  FORMAT(
+C     +' We need information about the position of the overlap.',/,
+C     +' First there are two types: those that extend the contig',/,
+C     +' leftwards and those that start internally.')
+        CALL YESNO(ANS,'Left end of gel reading is inside contig',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0)THEN
+        NGELS=NGELS-1
+        RETURN
+      END IF
+      IF(ANS.EQ.1)GO TO 400
+C   RIGHT END OR INTERNAL OVERLAP
+C
+C      WRITE(KBOUT,1015)
+C1015  FORMAT(' Now we need to know the exact position of the overlap')
+160   CONTINUE
+      MN = 1
+      MX = RELPG(LINCON)
+      X = 0
+      CALL GETINT(MN,MX,X,
+     +'Position in contig of left end of gel reading',
+     +IVAL,KBIN,KBOUT,
+     +HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      X = IVAL
+      IF(X.EQ.0)GO TO 160
+      N=NCONTC
+200   CONTINUE
+      IF(RELPG(N).GT.X)GO TO 250
+      IF(RNBR(N).EQ.0)GO TO 350
+      N=RNBR(N)
+      GO TO 200
+250   CONTINUE
+C   GEL LIES BETWEEN N AND LNBR(N)
+C   DISPLAY JOINT
+      NOPT = 1
+      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+C   CHECK ON RESULT
+      IF(NOPT.EQ.1)GO TO 160
+      IF(NOPT.EQ.3)GO TO 260
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+260   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      LNBR(NGELS)=LNBR(N)
+      RNBR(NGELS)=N
+      RELPG(NGELS)=X
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      RNBR(LNBR(N))=NGELS
+      K=LNBR(N)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+      LNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+(IDIM)-1
+      IF(Y.LE.RELPG(LINCON))RETURN
+      RELPG(LINCON)=Y
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      RETURN
+350   CONTINUE
+C   MUST BE A RIGHT END OVERLAP
+C   DSPLAY JOINT
+      NOPT = 1
+      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      IF(NOPT.EQ.1)GO TO 160
+      IF(NOPT.EQ.3)GO TO 360
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+360   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      LNBR(NGELS)=N
+      RNBR(NGELS)=0
+      RELPG(NGELS)=X
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      RNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      RNBR(LINCON)=NGELS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+(IDIM)-1
+      RELPG(LINCON)=MAX(RELPG(LINCON),Y)
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      RETURN
+C
+400   CONTINUE
+C
+C   ADDING TO LEFT END
+C      WRITE(KBOUT,1015)
+410   CONTINUE
+      MN = 1
+      MX = IDIM
+      X = 0
+      CALL GETINT(MN,MX,X,
+     +'Position in new gel reading of left end of contig',
+     +IVAL,KBIN,KBOUT,
+     +HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      X = IVAL
+      IF(X.EQ.0)GO TO 410
+C   SET FLAG TO SHOW LEFT END OVERLAP
+      NOPT=2
+      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      IF(NOPT.EQ.1)GO TO 400
+      IF(NOPT.EQ.3)GO TO 420
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+420   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      RELPG(NGELS)=1
+      RNBR(NGELS)=NCONTC
+      LNBR(NGELS)=0
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      LNBR(NCONTC)=NGELS
+      RELPG(NCONTC)=X
+      CALL WRITER(IDEVR,NCONTC,RELPG(NCONTC),LNGTHG(NCONTC),
+     +LNBR(NCONTC),RNBR(NCONTC))
+      RELPG(LINCON)=RELPG(LINCON)+X-1
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=IDIM
+      IF(Y.GT.RELPG(LINCON))RELPG(LINCON)=Y
+      LNBR(LINCON)=NGELS
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      N=NCONTC
+440   CONTINUE
+      IF(RNBR(N).EQ.0)RETURN
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+X-1
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      GO TO 440
+      END
+      SUBROUTINE PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,NC,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),POSN,X
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER PAD
+      SAVE PAD
+      DATA PAD/'*'/
+C   NOW FIND FIRST CHAR THAT OVERLAPS REGION
+      LLINO=LNBR(LINCON)
+30    CONTINUE
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.POSN)GO TO 40
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 30
+40    CONTINUE
+C   NOW GET THIS GEL FROM DISK
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+C   CALC POSN IN THIS GEL TO EDIT
+      X=POSN-RELPG(LLINO)+1
+      K=X
+C   MOVE THE DATA RIGHT
+      M=ABS(LNGTHG(LLINO))
+      LNGTHG(LLINO)=LNGTHG(LLINO)+SIGN(NC,LNGTHG(LLINO))
+C   CHECK FOR OVER END OF ARRAY
+      N=ABS(LNGTHG(LLINO))
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,1000)LLINO
+1000    FORMAT(
+     +' Data pushed off end of gel',I4,' during padding')
+        M=M-(N-MAXGEL)
+        N=MAXGEL
+        LNGTHG(LLINO)=SIGN(MAXGEL,LNGTHG(LLINO))
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        GEL(I)=PAD
+60    CONTINUE
+C   WRITE BACK TO DISK
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+65    CONTINUE
+C   NOW GET NEXT GEL
+      LLINO=RNBR(LLINO)
+C   LAST GEL?
+      IF(LLINO.EQ.0)GO TO 70
+C   DOES IT HAVE DATA IN REGION?
+C   IE DO RELPG  AND RELPG+LNGTHG-1 LIE EITHER SIDE OF POSN?
+      IF(RELPG(LLINO).GT.POSN)GO TO 70
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 65
+C  WITHIN
+      GO TO 40
+70    CONTINUE
+C   INSERTS FINISHED SO NEED TO INCREMENT ALL THOSE GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+75    CONTINUE
+      IF(RELPG(LLINO).GT.POSN)GO TO 80
+76    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 90
+      GO TO 75
+80    CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)+NC
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      GO TO 76
+90    CONTINUE
+C   NEED TO INCREMENT CONTIG LINE
+      RELPG(LINCON)=RELPG(LINCON)+NC
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      END
+      SUBROUTINE AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LLINO(2),ITYPE(2),IFAIL(2)
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER ANS,WINDOW
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      IFAIL(1) = 0
+      IF(NGELS.LT.1) RETURN
+      MN = LENGTH*2
+      MX = MAXGLM + 1
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      MN = 2
+      MX = 5
+      MINSLI = MAX(3,MN)
+      CALL GETINT(MN,MX,MINSLI,
+     +'Minimum alignment block',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINSLI = IVAL
+      MN = 0
+      MX = 25
+      MAXPG = 8
+      CALL GETINT(MN,MX,MAXPG,
+     +'Maximum pads per sequence',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MAXPC = IVAL
+        RMN = 0.
+        RMX = 20.
+        PERMAX = 8.
+        CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PERMAX = VAL
+      MN = MINMAT
+      MX = MAXGEL
+      WINDOW = 100
+      CALL GETINT(MN,MX,WINDOW,
+     +'Probe length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      WINDOW = IVAL
+       IWING = 0
+      IDIM1=0
+      MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+      ANS=0
+      CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD,SEQ5,
+     +CENDS,NENDS,MAXCON,IWING,NBAD)
+      IDIM2=MAXGEL
+C
+C Note I am doing something horrible here to save space:
+C sending cends and nends to jcons to get the extension lengths
+C then swapping them with temp3 by sending temp to ajoin and
+C receiving them as cends and nends, and sending cends nends
+C and receiving them as iladd,iradd
+      CALL AUTOJN(SEQ1,IDIM1,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,TEMP3,TEMP3(MAXCON+1,1),MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,CENDS,NENDS,RELPG,LNBR,IDBSIZ,NCONTS)
+      END
+      SUBROUTINE AUTOJN(SEQ1,IDIMIN,GEL,IDIMGI,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,KBOUT,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,ILADD,IRADD,RELPG,LNBR,IDBSIZ,NCONTS)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(IDIMIN),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON),ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(IDIMIN),GEL(MAXGLM)
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL(2),ITYPE(2)
+      PARAMETER (MAXC = 10)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC),MCON(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC),WINDOW
+      INTEGER RELPG(IDBSIZ),LNBR(IDBSIZ)
+      IFAIL(1) = 1
+      IFAIL(2) = 1
+      KFAIL = 0
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ1,IDIMIN,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C find possible missed joins
+C we have consensus in seq1 in order first contig,second contig etc
+C compare the ends in reverse order, simultaneously shortening the consensus
+      IDIM = IDIMIN
+      DO 200 JCON = IDCEND,2,-1
+        IDIM = CENDS(JCON) - 1
+        CALL ENCO(SEQ1,IDIM,POSNS,CONST,LENGTH)
+        CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH)
+        JS = CENDS(JCON) + 20
+        JE = CENDS(JCON+1) - 1
+        IEND = 1
+        IDIMG = MIN(WINDOW,JE-JS+1)
+C        WRITE(*,*)JCON,JS,JE,IDIMG
+1       CONTINUE
+        IMATC = 0
+        IF(IEND.EQ.1) THEN
+          CALL SQCOPY(SEQ1(JE-IDIMG+1),GEL,IDIMG)
+        ELSE
+          CALL SQCOPY(SEQ1(JS),GEL,IDIMG)
+        END IF
+        CALL SQCOPY(GEL,GELCOP,IDIMG)
+      ISTRAN=1
+2     CONTINUE
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      IDSAV=MAXSAV
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ1,GEL,MINMAT,IFCOMP,KBOUT)
+      IF(IFCOMP.NE.0) THEN
+         CALL ERROM(KBOUT,'Error in CFGEL')
+         RETURN
+      END IF
+      IF(IDSAV.NE.0)THEN
+        CALL ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +  IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +  IMATC,ISTRAN,MAXC,KBOUT,MCON)
+      END IF
+      ISTRAN=ISTRAN+1
+      IF(ISTRAN.EQ.2) THEN
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        CALL SQREV(GEL,IDIMG)
+        CALL SQCOM(GEL,IDIMG)
+        GO TO 2
+      END IF
+      CALL SQCOPY(GELCOP,GEL,IDIMG)
+      KSENSE = 0
+C      WRITE(KBOUT,*)'Total matches found',IMATC
+      IF(IMATC.NE.0) THEN
+      JMATC = 0
+      DO 100 I = 1,IMATC
+        IF(JSENSE(I).EQ.-1) THEN
+          IF(KSENSE.EQ.0) THEN 
+            CALL SQREV(GEL,IDIMG)
+            CALL SQCOM(GEL,IDIMG)
+            KSENSE = 1
+          END IF
+        END IF
+        JDIM22 = IDIMG
+        JDOUT = MAXGEL
+        IDSAV = MAXSAV
+C        IF(IEND.EQ.1) WRITE(KBOUT,*)'Left end of Contig',NENDS(JCON)
+C        IF(IEND.EQ.2) WRITE(KBOUT,*)'Right end of Contig',NENDS(JCON)
+C        WRITE(KBOUT,*)'Trying to align with contig',JLLINO(I)
+        CALL ALINEJ(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +  SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +  JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +  JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ4,MAXGEL,PERMS,
+     +  NENDS(JCON),IEND,JLLINO(I),JSENSE(I),ILADD,IRADD,MAXCON,MCON(I),
+     +  JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+100   CONTINUE
+      END IF
+      IEND = IEND + 1
+      IF(IEND.EQ.2) GO TO 1
+      IDIM = CENDS(JCON)-1
+ 200  CONTINUE
+      END
+      SUBROUTINE ALINEJ(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ3,MAXGEL,
+     +PERCM,JCONN,IEND,NCON,JSENSE,ILADD,IRADD,MAXCON,MCON,JCON,
+     +RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV),WINDOW
+      INTEGER ILADD(MAXCON),IRADD(MAXCON),RELPG(IDBSIZ),LNBR(IDBSIZ)
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      IFAIL=0
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+C      WRITE(KBOUT,1052)ITOTPC,ITOTPG
+C1052  FORMAT(' Padding in contig= ',I4,' and in gel= ',I4)
+      CALL JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   ARE ALL CHECKS OK?
+      IF(ITOTPC.GT.MAXPC)IFAIL=1
+      IF(ITOTPG.GT.MAXPG)IFAIL=1
+      IF(PERCM.GT.PERMAX)IFAIL=1
+      END
+C     SUBROUTINE DALIGN
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH,STRAND,NAME1*6,NAME2*6
+      INTEGER ILADD(MAXCON),IRADD(MAXCON),RELPG(IDBSIZ),LNBR(IDBSIZ)
+      INTEGER CLINNO,WINDOW
+      EXTERNAL CLINNO
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+100   CONTINUE
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      X=FLOAT(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(
+     +' Matching region too long for routine dalign. Alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      IF(X.LT.PERMAX) THEN
+        IC = CLINNO(LNBR,IDBSIZ,NCONTS,JCONN)
+        IF(IC.EQ.0) THEN
+          IFAIL = 99
+          RETURN
+        END IF
+        IF(JSENSE.EQ.-1) THEN
+          STRAND = '-'
+          IF(IEND.EQ.1) THEN
+C
+C  probe is complement of right hand end of contig. Give posns assuming
+C  this contig is complemented.
+C
+            IENDGT = IENDG - IRADD(JCON)
+C
+C Next line gives posns relative to original orientation
+C            IENDGT = RELPG(IC) + IRADD(JCON) - IENDG + 1
+          ELSE
+C
+C  probe is complement of left end of contig. Give posns assuming this
+C  contig is going to be complemented.
+C
+            IENDGT = RELPG(IC) - WINDOW + ILADD(JCON) + IENDG
+          END IF
+        ELSE
+          STRAND = '+'
+          IF(IEND.EQ.2) THEN
+C
+C  probe is left hand end of contig in original sense
+C
+            IENDGT = IENDG - ILADD(JCON)
+          ELSE
+C
+C  probe is right hand end of contig in original sense
+C
+            IENDGT = RELPG(IC) + IRADD(JCON) - WINDOW + IENDG
+          END IF
+        END IF
+        IENDCT = IENDC - ILADD(MCON)
+          WRITE(KBOUT,*)
+     +    ' Possible join between contig ',JCONN,' in the ',
+     +    STRAND,' sense and contig ',NCON
+        WRITE(KBOUT,1000)X
+1000    FORMAT(' Percentage mismatch after alignment = ',F4.1)
+        WRITE(NAME1,1002)JCONN
+        WRITE(NAME2,1002)NCON
+ 1002   FORMAT(I6)
+        CALL FMT4LP(SEQC2(1),SEQG2(IENDG),LO,IENDCT,IENDGT,KBOUT,
+     +  NAME2,NAME1)
+      END IF
+      IFAIL=0
+      END
+      SUBROUTINE ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      END
+      SUBROUTINE ADISM6(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C new bit need to save contig number for alinej
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        MCON(IMATC) = JJ
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+C      JCONS
+      SUBROUTINE JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,ANS,LLINO,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD,TGEL,ILADD,IRADD,MAXCON,
+     +IWIN,NBAD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ,ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),TGEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CALL BUSY(KBOUT)
+      IF(IWIN.GT.0) PERC = REAL(NBAD)/REAL(IWIN)
+      IEND = 1
+      IFAIL=0
+      N=IDBSIZ-NCONTS
+      NCONS = 0
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        ISTART=ISTART+1
+        IF((ISTART+19+Y+2*MAXGEL).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+1009      FORMAT(
+     +    ' Maximum consensus length(',I6,') exceeded',/,
+     +    ' calculation aborted')
+          IFAIL=1
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        NCONS = NCONS + 1
+        IDIN = 0
+        ILADD(NCONS) = IDIN
+        ISTART = ISTART + IDIN
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,MAXGEL,
+     +  IDM,PERCD)
+        ISTART=ISTART+Y-1
+        IDIN = 0
+        IRADD(NCONS) = IDIN
+        ISTART = ISTART + IDIN
+110   CONTINUE
+      END
diff --git a/src/bap/dstrand.c b/src/bap/dstrand.c
new file mode 100644
index 0000000..9fb2617
--- /dev/null
+++ b/src/bap/dstrand.c
@@ -0,0 +1,780 @@
+/*#define DEBUG_DSTRAND 1*/
+
+#include <stdio.h>
+#include <string.h>
+#include <stdlib.h>
+#include "dstrand.h"
+#include "guk.h"
+#include "edUtils.h"
+#include "fort.h"
+#include "misc.h"
+
+/* Static global variables */
+static int_f last_gel;	/* remembers where we last were for		*/
+    			/* for efficiency (defaults leftmost)		*/
+
+static int countdb;	/* how many bases have been double stranded	*/
+static int consins;	/* how many insertions into consensus		*/
+
+/*
+ * SunOS doesn't have a memmove() function, so we use bcopy() instead.
+ */
+#ifdef NOMEMMOVE
+#    define memmove(a,b,c) bcopy(b,a,c)
+#endif
+
+#ifdef notdef
+/*----------------------------------------------------------------------*/
+
+#define MAXINS (3)	/* maximum number of neighbouring inserts	*/
+#define INSFREQ (15)	/* lowest len/inserts frequency allowed		*/
+
+/*
+ * Evaluates how good an alignment is.
+ * The integer returned is how much of the alignment to trust (the length
+ * of this trusted part).
+ */
+static int evalal(char *seq1, char *seq2, int *albuf, int_f seqlen) {
+    int continserts, freqinserts, scan;
+    
+    /* display the alignment as it stands */
+#if DEBUG_DSTRAND
+    dispmm_(seq1, &seqlen, seq2, &seqlen, albuf, 0, 0);
+#endif
+
+    /* end if more than MAXINS inserts or len/inserts < INSFREQ */
+    continserts = 0;
+    freqinserts = -1;
+    for (scan = 0; scan < seqlen; scan++) {
+	if (albuf[scan] != 0) {
+	    continserts++;
+	    freqinserts++;
+	    if (continserts >= MAXINS) {
+		scan-=continserts;
+		break;
+	    }
+	    if (freqinserts && scan/freqinserts <= INSFREQ) {
+		scan--;
+		break;
+	    }
+	} else
+	    continserts = 0;
+    }
+#if DEBUG_DSTRAND
+    fprintf(DSTR, "scan:%d, conti:%d, freqi:%d\n",
+	    scan, continserts, freqinserts);
+#endif
+
+    return scan;
+}
+#endif
+
+#ifdef notdef
+/*----------------------------------------------------------------------*/
+
+/*
+ * Evaluates how good an alignment is.
+ * The integer returned is how much of the alignment to trust (the length
+ * of this trusted part).
+ */
+static int evalal(char *seq1, char *seq2, int *albuf, int_f seqlen) {
+    int op = 0, scan = 0, score = 20;
+    float mismatch = 0, cutgaps = 0, congaps = 0;
+    int *alptr = albuf;
+    char *s1ptr = seq1, *s2ptr = seq2;
+    
+    /* display the alignment as it stands */
+#if DEBUG_DSTRAND
+    dispmm_(seq1, &seqlen, seq2, &seqlen, albuf, 0, 0);
+#endif
+
+    /*
+     * Need to add max_score - cut the score off when it gets too high so
+     * we cannot accumalate good data and hence resistance to bad data.
+     */
+    while (scan++ < seqlen) {
+	mismatch*=.95;
+	cutgaps*=.95;
+	congaps*=.95;
+	if (op == 0 && *alptr == 0) {
+	    if (*s1ptr == *s2ptr) {
+#if DEBUG_DSTRAND == 2
+		fprintf(DSTR, "'%c' = '%c'", *s1ptr, *s2ptr);
+#endif
+		score++;
+	    } else {
+#if DEBUG_DSTRAND == 2
+		fprintf(DSTR, "'%c' ! '%c'", *s1ptr, *s2ptr);
+#endif
+		score-=11;
+		mismatch++;
+	    }
+	    alptr++;
+	    s1ptr++;
+	    s2ptr++;
+	} else {
+	    if (op == 0) {
+		op = *alptr++;
+	    }
+
+	    if (op > 0) {
+#if DEBUG_DSTRAND == 2
+		fprintf(DSTR, "' ' - '%c'", *s2ptr);
+#endif
+		op--;
+		s2ptr++;
+		cutgaps++;
+		mismatch++;
+		score-=(cutgaps*2);
+	    } else /* op < 0 */ {
+#if DEBUG_DSTRAND == 2
+		fprintf(DSTR, "'%c' - ' '", *s1ptr);
+#endif
+		op++;
+		s1ptr++;
+		congaps++;
+		mismatch++;
+		score-=(congaps*2);
+	    }
+	}
+#if DEBUG_DSTRAND == 2
+	fprintf(DSTR," score=%4d, mismatch=%2.2f, gaps(cut)=%2.2f, gaps(con)=%2.2f, off=%d\n",
+	score, mismatch, cutgaps, congaps, scan); 
+#endif
+	
+	if (mismatch > 3.5 || cutgaps > 3 || congaps > 1)
+	    break;
+    }
+    return scan -1;
+}
+#endif
+
+/*----------------------------------------------------------------------*/
+
+/*
+ * Evaluates how good an alignment is.
+ * The integer returned is how much of the alignment to trust (the length
+ * of this trusted part).
+ */
+static int evalal(char *seq1, char *seq2, int *albuf, int_f seqlen,
+		  int_f maxmis, int_f missc, int_f matchsc, int_f padsc) {
+    int score = 0, op = 0, scan = 0, miscount = 0, bestscan = 0, bestscore = 0;
+    int *alptr = albuf;
+    char *s1ptr = seq1, *s2ptr = seq2;
+
+    /* display the alignment as it stands */
+#if DEBUG_DSTRAND
+    dispmm_(seq1, &seqlen, seq2, &seqlen, albuf, 0, 0);
+#endif
+
+    while (scan < seqlen && miscount <= maxmis) {
+	if (score >= bestscore) {
+	    bestscore = score;
+	    bestscan = scan;
+	}
+	if (op == 0 && *alptr == 0) {
+	    if (*s1ptr == *s2ptr) {
+		/* correct match */
+		/*
+		 * Note *FAILURE* here to do reasonable things if both the
+		 * aligned sequences happen to contain a '-'. It will simply
+		 * assume that both the sequences agree with each other and
+		 * give them a positive score. However for speeds sake it's
+		 * one less thing to check and the end result is that very
+		 * ocassionally we'll double strand something we should have.
+		 * This will show up in the quality checks so it'll still
+		 * get dealt with.
+		 */
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "'%c' = '%c'", *s1ptr, *s2ptr);
+#endif
+		score += matchsc;
+	    } else {
+		/* incorrect match */
+		/*
+		 * Should we count less for failing to match our new sequence
+		 * against the existing one (consensus in this case) then the
+		 * existing one happens to be a '-' (ie unknown). In this case
+		 * if the consensus is unknown should we penalise the sequence
+		 * regardless? Currently we do....
+		 */
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "'%c' ! '%c'", *s1ptr, *s2ptr);
+#endif
+		score += missc;
+		miscount++;
+	    }
+	    alptr++;
+	    s1ptr++;
+	    s2ptr++;
+	} else {
+	    miscount++;
+	    if (op == 0) {
+		op = *alptr++;
+	    }
+	    if (op > 0) {
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "' ' - '%c'", *s2ptr);
+#endif
+		/* pad in sequence */
+		score += padsc;
+		op--;
+		s2ptr++;
+	    } else {
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "'%c' - ' '", *s1ptr);
+#endif
+		/* pad in consensus */
+		score += padsc;
+		op++;
+		s1ptr++;
+	    }
+	}
+#if DEBUG_DSTRAND
+	fprintf(DSTR, " score=%4d, off=%3d, miscount=%d\n",
+		score, scan, miscount);
+#endif
+	scan++;
+    }
+
+    return bestscan;
+}
+
+/*----------------------------------------------------------------------*/
+
+/*
+ * Generates two buffers from an alignment. Each buffer represents a sequence
+ * of edits to perform on the contig in an easily parsable fashion.
+ */
+static void dstrform(char *seq1, int *albuf, int_f seqlen, char *new1,
+		     char *new2) {
+    int_f scan = 0;
+    int op = 0;
+    char *ptr1 = new1, *ptr2 = new2, *seq1ptr = seq1;
+    int *alptr = albuf;
+
+    while (scan++ < seqlen) {
+	*ptr1 = '.';
+	*ptr2 = '.';
+	if (op == 0 && *alptr == 0) {
+	    alptr++;
+	    *ptr1 = *seq1ptr++;
+	} else {
+	    if (op == 0)
+		op = *alptr++;
+	    if (op > 0) {
+		*ptr1 = '*';
+		op--;
+	    } else {
+		*ptr2 = '*';
+		*ptr1 = *seq1ptr++;
+		op++;
+	    }
+	}
+	ptr1++;
+	ptr2++;
+    }
+    *ptr1 = '\0';
+    *ptr2 = '\0';
+}
+
+/*----------------------------------------------------------------------*/
+
+/*
+ * patch() will attempt to 'patch' up a single stranded section into a double
+ * stranded section. The integer returned is the actual length we managed to
+ * double strand.
+ */
+static int patch(
+     int_f *relpg_p,	/* int array - relative position (gel reading)	*/
+			/*        or - length of contig 		*/
+     int_f *lngthg_p,	/* int array - length of gel 			*/
+			/*        or - empty for contig 		*/
+     int_f *lnbr_p,	/* int array - left nodes			*/
+     int_f *rnbr_p,	/* int array - right nodes			*/
+     int_f  maxgel,	/* maximum gel length				*/
+     int_f  off,	/* current relative offset in contig		*/
+     int_f  plen,	/* length of hole to patch			*/
+     char  *cons_p,	/* the consensus				*/
+     int_f *ngels_p,	/* int - passed back to fortran			*/
+     int_f *kbout_p,	/* int - passed back to fortran (output stream)	*/
+     int_f *idbsiz_p,	/* int - passed back to fortran			*/
+     char  *gel_p,	/* char array - passed back to fortran (tmp buf)*/
+     int_f *idevr_p,	/* int - passed back to fortran (relations db)	*/
+     int_f *idevw_p,	/* int - passed back to fortran (working db)	*/
+     int_f *idevn_p,	/* int - passed back to fortran (name db)	*/
+     int_f *lincon_p,	/* int - passed back to fortran			*/
+     int_f *nconts_p,	/* int - passed back to fortran			*/
+     int_f *cons_e,	/* int - extension of cons_p 			*/
+     int_fl gel_l,	/* int - length of gel_p 			*/
+     int_f  rreg,	/* length of consensus buffer (?!?)		*/
+     int_f  maxmis,	/* int - maximum number of mismatches		*/
+     int_f  missc,	/* int - score for mismatch			*/
+     int_f  matchsc,	/* int - score of match				*/
+     int_f  padsc,	/* int - score for pad				*/
+     int_f  sense	/* int - direction of double stranding		*/
+     )
+{
+    /* Local variables */
+    int sc;		/* a score for how good the alignment is 	*/
+    int_f cur_gel;	/* record of which gel we're looking at 	*/
+    int_f bestgeln = 0;	/* the best gel num found so far		*/
+    int bestlen = 0;	/* longest 'patch' coverage found so far	*/
+    int bestuse = 0;	/* longest needed gel extension found so far	*/
+    int *albuf = (int *)malloc(maxgel * 2 * sizeof(int));
+			/* buffer for aligned sequences			*/
+    int_f fail;		/* does getext() succeed?			*/
+    char *cutbuf = (char *)malloc(maxgel);
+			/* buffer for cutoff data			*/
+    int_f cutlen;	/* length of cutoff data			*/
+    int_f tgelend, gelend;
+			/* end of current gel (t = including cutoff)	*/
+    int tmp, len, len2, one = 1;
+    int_f tmp_f, posn;
+    char *newb1, *newb2;
+    char gelname[DB_NAMELEN + 1];
+
+    /*
+     * We're going to try to 'patch' this single strand section.
+     * So we need to find which gels (in the positive direction)
+     * exist at the start of the 'hole'.
+     */
+    
+    cur_gel = last_gel;
+    do {
+	/* positive only */
+	if (lngthg_p[cur_gel] >= 0) {
+	    /* are we two far past? */
+	    if (relpg_p[cur_gel] >= off)
+		break;
+	    /*
+	     * If the end of the gel covers this gap (ie NOT the
+	     * cutoff data) then we've obviously found a bit like so:
+	     * (unlikely though it is).
+	     * -------AGAAT------->
+	     * -------GAGCC------->
+	     * <-------------------
+	     */
+	    gelend = relpg_p[cur_gel] + lngthg_p[cur_gel];
+	    /*
+	     * if (gelend > off)
+	     *     return plen;
+	     */
+
+	    if (relpg_p[cur_gel] + maxgel > off) {
+		/* get cutoff data so we can compute total length */
+		fail = 0;
+		cutlen = maxgel;
+		getext_(&cur_gel, cutbuf, &cutlen, &fail,
+			(int_fl)sizeof(cutbuf));
+		tgelend = gelend + cutlen;
+	    } else 
+		fail = 1;
+
+	    if (!fail && tgelend >= off) {
+
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "covered by gel %d\n", cur_gel);
+#endif
+		/*
+		 * now do a quick(?) alignment check to see if better than
+		 * any we've found before.
+		 *
+		 * we need to only align with as little as possible. That is
+		 * do not align further than 10 bases past the end of the
+		 * 'hole'. (10 is some arbitrary amount to account for the
+		 * insertion of padding characters).
+		 */
+		if (tgelend >= (tmp_f = off + plen + ALEXTRA)) {
+		    cutlen = tmp_f - gelend + 1;
+		    tgelend = gelend + cutlen;
+		}
+		if (tgelend > (tmp_f = off + plen)) {
+		    cutlen -= tgelend - tmp_f -1;
+		    tgelend = gelend + cutlen;		    
+		}
+
+		/*
+		 * align maximum possible and 'evaluate' alignment (walking
+		 * left to right along albuf[]).
+		 */
+		sc = mmalign(cutbuf, cutlen, &cons_p[gelend], cutlen,
+			     albuf);
+		len = evalal(cutbuf, &cons_p[gelend], albuf, cutlen,
+			     maxmis, missc, matchsc, padsc);
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "aligned %d bases @ %d = %d, eval = %d\n",
+			cutlen, gelend, sc, len);
+#endif
+		len2 = len + (int)gelend - (int)off;
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "overlap of %d (/%d)\n", len2, bestlen);
+#endif
+		/*
+		 * Pick best coverage length.
+		 * If equal pick gel with shortest extension.
+		 */
+		if (len2 > bestlen ||
+		    (len2 == bestlen && len < bestuse)) {
+		    bestlen = len2;
+		    bestuse = len;
+		    bestgeln = cur_gel;
+		    /* bestgels = ... */
+#if DEBUG_DSTRAND
+		    fprintf(DSTR, "New best gel! %d(%d)\n", bestgeln, bestlen);
+#endif
+		}
+	    }
+	    last_gel = cur_gel;
+	}
+	/* jump to next element in list */
+	cur_gel = rnbr_p[cur_gel];
+    } while (cur_gel != 0);
+
+    /*
+     * When we've got here, bestlen is the best overlap (upto max 10 more
+     * than the hole length) and bestuse is the amount of data needed to
+     * be extended for the gel (bestgeln).
+     */
+
+    /*
+     * find best gel to use - only bother if the extra data would save an
+     * experiment. This is when we either totally double strand the section
+     * or if we extend by approx the average gel reading length.
+     */
+    /*
+     * Currently cheating - use if over half the hold or longer than 20 bases.
+     */
+/*    if (bestgeln && (bestlen >= 20 || plen/bestlen < 2)) {*/
+#if DEBUG_DSTRAND
+    fprintf(DSTR, "bestgeln=%d, bestlen=%d, plen=%d\n",
+	    bestgeln, bestlen, plen);
+#endif
+    if (bestgeln && (bestlen >= 20 || bestlen >= plen)) {
+#if DEBUG_DSTRAND
+	fprintf(DSTR, "Shall use gel no. %d (len %d x=%d)\n",
+		bestgeln, bestlen, bestuse);
+#endif
+
+	if (bestgeln != cur_gel) {
+	    /* get cutoff data so we can compute total length */
+	    cutlen = maxgel;
+	    /* do we need to get 'cutlen' amount? why not 'bestuse'? */
+	    getext_(&bestgeln, cutbuf, &cutlen, &fail, (int_fl)sizeof(cutbuf));
+	}
+
+	/* align an extra couple chars just to allow for a tidy ending */
+	gelend = relpg_p[bestgeln] + lngthg_p[bestgeln];
+	mmalign(cutbuf, bestuse+2, &cons_p[gelend], bestuse+2, albuf);
+
+	/* temporary */
+#if DEBUG_DSTRAND
+	dispmm_(cutbuf, &bestuse, &cons_p[gelend], &bestuse, albuf, 0, 0);
+#endif
+
+	/* format data in a fashion that is easy to use */
+	newb1 = (char *)malloc(bestuse * 2 +1);
+	newb2 = (char *)malloc(bestuse * 2 +1);
+	dstrform(cutbuf, albuf, bestuse, newb1, newb2);
+#if DEBUG_DSTRAND
+	fprintf(DSTR, "'%s'\n'%s'\n\n", newb1, newb2);
+#endif
+
+/*
+	countdb += bestlen>plen?plen:bestlen;
+	printf("Double stranded %d bases at offset %d\n",
+	       bestlen>plen?plen:bestlen, off);
+*/
+	countdb += bestuse;
+	readn_(idevn_p, &bestgeln, gelname, (int_fl)(DB_NAMELEN));
+	Fstr2Cstr(gelname, sizeof(gelname), gelname, (int_fl)sizeof(gelname));
+
+	{
+	    int_f offset = relpg_p[bestgeln] + lngthg_p[bestgeln];
+
+	    if (sense == 1) /* negative direction */
+		offset = relpg_p[*lincon_p] - offset + 1;
+
+#if DEBUG_DSTRAND
+	    printf("Double stranded %s by %d base%s at offset %d (was %d)\n",
+		   gelname, bestuse, bestuse==1 ? "" : "s",
+		   offset, offset - consins);
+#else
+	    printf("Double stranded %s by %d base%s at offset %d\n",
+		   gelname, bestuse, bestuse==1 ? "" : "s",
+		   offset);
+#endif
+	    updout_();
+	}
+	bestlen = bestuse; /* Yuk! bestlen changes meaning here :-( */
+	for (tmp = 0; tmp<bestuse; tmp++) {
+	    if (newb2[tmp] == '*') {
+		/* pad in consensus */
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "Inserting pad (c) at offset %d\n", gelend+tmp);
+#endif
+		posn = gelend+tmp;
+		padcon_(&relpg_p[1], &lngthg_p[1], &lnbr_p[1], &rnbr_p[1],
+			ngels_p, nconts_p, gel_p, lincon_p, &posn, &one,
+			idbsiz_p, idevr_p, idevw_p, &maxgel, kbout_p, gel_l);
+		memmove(&cons_p[posn+1], &cons_p[posn], rreg-posn-1);
+
+		(*cons_e)++;
+	    }
+	    /*
+	     * At the same time we're computing how much extra to leave on
+	     * the cutoff data. (This occurs when we pad out the sequence
+	     * and hence have a longer sequence than before.
+	     */
+	    if (newb1[tmp] == '*')
+		bestlen--;
+	}
+
+	/* shrink the cutoff data - should ideally check the return. */
+	(void)modext((int)bestgeln, bestlen);
+
+	/*
+	 * Add our new end of sequence onto the existing one.
+	 * This requires reading the sequence ('w'orking version), 
+	 * adding onto the end, and writing it back. Similarly for
+	 * 'r'elationships.
+	 */
+	readw_(idevw_p, &bestgeln, gel_p, &maxgel, maxgel);
+	strncpy(gel_p + lngthg_p[bestgeln], newb1, (size_t)bestuse);
+	writew_(idevw_p, &bestgeln, gel_p, &maxgel, maxgel);
+	lngthg_p[bestgeln] += (int_f)bestuse;
+	writer_(idevr_p, &bestgeln, &relpg_p[bestgeln], &lngthg_p[bestgeln],
+		&lnbr_p[bestgeln], &rnbr_p[bestgeln]);
+
+	/* create the necessary tags for any pads in our extension. */
+	for (tmp = 0; tmp<bestuse; tmp++) {
+	    if (newb1[tmp] == '*') {
+#if DEBUG_DSTRAND
+		fprintf(DSTR, "Inserting pad (g) at offset %d\n", gelend+tmp);
+#endif
+		posn = gelend+tmp;
+		padtag_(&bestgeln, &posn, &one, &lngthg_p[bestgeln]);
+		countdb++;
+	    }
+	}
+
+	free(newb1);
+	free(newb2);
+    } else
+#if DEBUG_DSTRAND
+	fprintf(DSTR, "No suitable gel.\n");
+#endif
+
+    /* tidy up memory */
+    free(albuf);
+    free(cutbuf);
+
+    /*
+     * We could either return the length of the original single stranded
+     * section, or we could return the length of the amount we managed
+     * to double strand. (if any). In this case must make sure we do not
+     * return 0 for a failed patch and hence get into infinite loops.
+     * Taking the easy solution currently...
+     */
+    return plen;
+}
+
+/*----------------------------------------------------------------------*/
+
+#ifdef notdef
+/*
+ * Calculate the average length of 'used' data in the gel readings.
+ */
+int avggellen(int_f *lngthg_p, int_f *ngels_p) {
+    int_f i, len = 0;
+
+    for (i=0; i<*ngels_p; i++)
+	len += lngthg_p[i];
+    
+    return (int)(len / *ngels_p);
+}
+#endif
+
+/*----------------------------------------------------------------------*/
+
+/*
+ * dstrand_x() - attempts to double strand the single stranded segments of
+ * sequence within a contig. This is performed by analysing the 'cutoff' data
+ * and comparing it with the opposite strand. If a close match is found then
+ * we extend as far as possible.
+ */
+void dstrnd_(
+     int_f *relpg_p,	/* int array - relative position (gel reading) */
+			/*        or - length of contig */
+     int_f *lngthg_p,	/* int array - length of gel */
+			/*        or - empty for contig */
+     int_f *lnbr_p,	/* int array - left nodes */
+     int_f *rnbr_p,	/* int array - right nodes */
+     int_f *ngels_p,	/* int - number of gels */
+     int_f *nconts_p,	/* int - number of contigs */
+     char  *qual_p,	/* char array - quality of contig */
+     char  *cons_p,	/* char array - consensus of contig */
+     int_f *arr_l,	/* int - length of arrays */
+     int_f *lreg_p,	/* int - left start in qual */
+     int_f *rreg_p,	/* int - right start in qual */
+     int_f *llino_p,	/* int - left gel in contig */
+     int_f *lincon_p,	/* int - record no. of contig */
+     int_f *maxgel_p,	/* int - max length of a gel */
+     int_f *kbout_p,	/* int - passed back to fortran (output stream) */
+     int_f *idbsiz_p,	/* int - passed back to fortran */
+     char  *gel_p,	/* char array - passed back to fortran (tmp buf) */
+     int_f *idevr_p,	/* int - passed back to fortran (relations db) */
+     int_f *idevw_p,	/* int - passed back to fortran (working db) */
+     int_f *idevn_p,	/* int - passed back to fortran (name db) */
+     int_f *sense_p,	/* int - direction of contig (0=pos, 1=neg) */
+	     		/* following four used in alignment evaluation */
+     int_f *maxmis_p,	/* int - maximum number of mismatches */
+     int_f *missc_p,	/* int - score for mismatch */
+     int_f *matchsc_p,	/* int - score of match */
+     int_f *padsc_p,	/* int - score for pad */
+     int_fl qual_l,	/* int - length of qual_p */
+     int_fl cons_l,	/* int - length of cons_p */
+     int_fl gel_l	/* int - length of gel_p */
+     )
+{
+    register int_f i, j;		/* loop variables */
+    int_f rreg = *rreg_p;	/* faster copy of *rreg_p */
+    int cons_e;
+    static int countdbt, consinst;
+
+    if (*sense_p == 0)
+	countdbt =  consinst = 0;
+    countdb = consins = 0;
+
+    /*
+     * Reset array pointers so our indexes correspond to FORTRANs ones
+     */
+    relpg_p--;
+    lngthg_p--;
+    lnbr_p--;
+    rnbr_p--;
+    qual_p--;
+    cons_p--;
+
+#if DEBUG_DSTRAND
+    fprintf(DSTR,"\nllreg = %d, rreg = %d\n", *lreg_p, rreg);
+#endif
+    /* initialise remembered last gel to left most one of this contig */
+    last_gel = *llino_p; 
+
+    /* scan through quality buffer */
+    for (i = *lreg_p; i<=rreg; i++) {
+
+	/* strong negative strand, but no positive strand */
+	if (qual_p[i] == '2' || qual_p[i] == '8') {
+
+	    /* find length of single stranded section */
+	    j = i;
+	    while(qual_p[j] == '2' || qual_p[j] == '8')
+		j++;
+
+	    if (j > rreg)
+		j = rreg+1;
+
+#ifdef ndef
+	    /*
+	     * We have a minimum length of single strand to patch.
+	     * This saves us having to do too many checks on strands
+	     * with padding characters in etc.
+	     */
+	    if ((j-i) < MINHOLELEN) {
+		i = j-1;
+		continue;
+	    }
+#endif
+#if DEBUG_DSTRAND
+	    fprintf(DSTR, "Single strand at %d - %d, (was %d - %d) len %d\n",
+		    i, j, i-consins, j-consins, j-i);
+#endif
+
+	    /*
+	     * Perform the 'operation' on the contig.
+	     * And leap forward to next potential problem.
+	     */
+	    cons_e = 0;
+	    /*
+	     * We pass over 'j-i+1' as the length due to an (as yet) unfound
+	     * 'feature'. For some reason it appears that the actual length
+	     * double stranded is not always the same as the amount we asked
+	     * for. So just to make sure we don't leave any 1 base gaps we
+	     * cheat a bit. Note: this bug appears to be indeterminate in that
+	     * runnin double strand twice on the same data with the same args
+	     * doesn't always do the same thing! (jkb 23/12/92)
+	     */
+	    j = patch(relpg_p, lngthg_p, lnbr_p, rnbr_p, *maxgel_p, i, j-i+1,
+		      cons_p, ngels_p, kbout_p, idbsiz_p, gel_p, idevr_p,
+		      idevw_p, idevn_p, lincon_p, nconts_p, &cons_e, gel_l,
+		      rreg, *maxmis_p, *missc_p, *matchsc_p, *padsc_p,
+		      *sense_p);
+	    /* move back quality buffer to ensure alignment with consensus */
+#if DEBUG_DSTRAND
+	    fprintf(DSTR, "Inserted %d pads into consensus\n", cons_e);
+#endif
+	    
+	    /*
+	     * Take into account number of additional consensus entries.
+	     * This involves shifting our consensus buffer along to the
+	     * right by a bit, and changing our right margin (rreg) for the
+	     * region.
+	     */
+	    /* qual_p -= cons_e;*/
+	    memmove(&qual_p[i+cons_e], &qual_p[i], rreg-i);
+	    rreg += cons_e;
+	    consins += cons_e;
+
+	    /* Skip over the hole we just patched (& take account of cons_e) */
+	    i += j + cons_e;
+	}
+    }
+
+    countdbt += countdb;
+    consinst += consins;
+    printf("%s strand : double stranded %d base%s with %d insert%s into consensus\n",
+	   *sense_p?"Negative":"Positive",
+	   countdb, countdb==1 ? "" : "s",
+	   consins, consins==1 ? "" : "s");
+    if (*sense_p)
+	printf("Total : double stranded %d bases with %d inserts\n",
+	       countdbt, consinst);
+    updout_();
+}
+
+void dblint_(int_f *status_p,  int_f *maxmis_p, int_f *missc_p,
+	     int_f *matchsc_p, int_f *padsc_p,  int_f *dialogue_p) {
+
+    if (*dialogue_p) {
+	*maxmis_p = getint(0, 99, 5, "Maximum number of mismatches", status_p);
+	if (*status_p < 0) {
+	    *status_p = -1; return;
+	}
+	
+	*missc_p = getint(-100, 0, -8, "Score for mismatch", status_p);
+	if (*status_p < 0) {
+	    *status_p = -1; return;
+	}
+	
+	*matchsc_p = getint(0, 100, 1, "Score for correct match", status_p);
+	if (*status_p < 0) {
+	    *status_p = -1; return;
+	}
+	
+	*padsc_p = getint(-100, 0, -5, "Score for insertion", status_p);
+	if (*status_p < 0)
+	    *status_p = -1;
+
+	*status_p = 0;
+	return;
+    } else {
+	*maxmis_p = 6;
+	*missc_p = -8;
+	*matchsc_p = 1;
+	*padsc_p = -5;
+	*status_p = 0;
+	return;
+    }
+}
diff --git a/src/bap/dstrand.h b/src/bap/dstrand.h
new file mode 100644
index 0000000..376d166
--- /dev/null
+++ b/src/bap/dstrand.h
@@ -0,0 +1,4 @@
+#define MINHOLELEN (5)	/* minimum single stranded size to bother with	*/
+#define ALEXTRA (10)	/* no. extra bases to align with past hole end	*/
+#define DSTR stderr
+
diff --git a/src/bap/edMenu.c b/src/bap/edMenu.c
new file mode 100644
index 0000000..ba35f65
--- /dev/null
+++ b/src/bap/edMenu.c
@@ -0,0 +1,145 @@
+/*
+** Menu driver for xdap contig editor
+**
+**
+** To add new menu entries:
+**    1. Add a new option name to menuOptions.
+**    2. Create a new ed_menu_option line, providing the string to appear on
+**       menu, and menu option.
+**    3. Add a call to your interface routine to the switch statement in
+**       MenuSelectCallBack.
+**    4. That's all
+*/
+
+#include <X11/StringDefs.h>
+#include <X11/Intrinsic.h>
+
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/SmeBSB.h>
+#include <X11/Xaw/SmeLine.h>
+
+#include "edMenu.h"
+#include "edUtils.h"
+#include "contigEditor.h"
+
+/* ---- Types ---- */
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+enum menuOptions {
+    OptLine,
+    OptSearchGeneric,
+    OptSaveContig,
+    OptCreateTag,
+    OptEditTag,
+    OptDeleteTag,
+    OptShowDifferences,
+    OptSelectOligo,
+    OptDumpContig
+};
+
+static MenuData ed_menu_options[] =
+{
+    {"Search",                                 OptSearchGeneric},
+    {"Highlight Disagreements",                OptShowDifferences},
+    {"",                                       OptLine},
+    {"Save Contig",                            OptSaveContig},
+    {"Dump Contig to File",                    OptDumpContig},
+    {"",                                       OptLine},
+    {"Create Tag",                             OptCreateTag},
+    {"Edit Tag",                               OptEditTag},
+    {"Delete Tag",                             OptDeleteTag},
+    {"Select Oligo",                           OptSelectOligo},
+};
+
+
+
+
+
+
+
+
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(XtParent(w))));
+
+    switch ( (int) i ) {
+        case OptSaveContig: 
+	    saveDB(
+		xx,
+		saveState.idevr,
+		saveState.idevw,
+		saveState.idevn,
+		saveState.relpg,
+		saveState.lngthg,
+		saveState.lnbr,
+		saveState.rnbr,
+		saveState.maxgel
+	    );
+	    break;
+        case OptCreateTag:
+            createTag(xx);
+	    break;
+        case OptEditTag:
+            editTag(xx);
+	    break;
+        case OptDeleteTag:
+            deleteTag(xx);
+	    break;
+        case OptSearchGeneric:
+	    invokeSearchGeneric(xx);
+	    break;
+	case OptShowDifferences:
+	    xx->showDifferences ^= 1;
+	    redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	    break;
+	case OptSelectOligo:
+	    invokeOligo(xx);
+	    break;
+	case OptDumpContig:
+	    dumpContig(xx);
+	    break;
+	default:
+	    break;
+    }
+}
+
+
+Widget createEdMenu(Widget parentWid)
+{
+    Widget menuWid;
+    int i;
+
+    Cardinal num_md = XtNumber(ed_menu_options);
+    /*
+      Create the menu parent widget
+    */
+    menuWid = XtCreatePopupShell("edMenu", simpleMenuWidgetClass, parentWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {
+        if (ed_menu_options[i].number == OptLine) {
+            (void) XtCreateManagedWidget("line",  smeLineObjectClass, menuWid, NULL, (Cardinal)0);
+	} else {
+            Widget entryWid = XtCreateManagedWidget(ed_menu_options[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	    XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) ed_menu_options[i].number);
+	}
+    }
+
+    return menuWid;
+
+}
+
diff --git a/src/bap/edUtils.c b/src/bap/edUtils.c
new file mode 100644
index 0000000..33b6e0d
--- /dev/null
+++ b/src/bap/edUtils.c
@@ -0,0 +1,2285 @@
+/*
+    Title: 	 edUtils
+
+    File: 	 edUtils.c
+    Purpose:	 C language utilities for contig editor
+    Last update: 5 April 1993
+*/
+
+
+/*
+  9/7/91  SD Added Trace Title field to call manageTrace
+  26/7/91 SD Tightened up editing options:
+             1. Allow only insertion and deletion of asterisks from consensus
+	        This means no replacement. This CANNOT be overridden
+	     2. Do not allow edits when cursor is not visible on the screen
+  28/7/91 SD Use FILE_NAME_LENGTH for trace file name length in showTrace
+  1/10/91 SD Removed static from calculateConsensusLength
+             Removed LLINOL and LLINOR from call to dojoin_
+
+  28/4/92 SD General Speedup
+  20/8/92 SD save state variable for returning what happened during edit/join
+  01/03/93 SD New dump contig command
+  05/04/93 JKB Check for editorState (disable edits after 'Leave Editor').
+
+*/
+
+
+#define CHKPNT(W,S) \
+    { XSync(XtDisplay(W),False); fprintf(stderr,"CHKPNT: %s\n",S); }
+
+
+/* ---- Includes ---- */
+
+#include <stdio.h>
+#include <ctype.h>
+#include <stdlib.h>
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Text.h>
+#include <X11/Xaw/AsciiSrc.h>
+#include <X11/Xlib.h>   /* IMPORT: XBell */
+#include "Sheet.h"
+
+#include "edUtils.h"
+#include "contigEditor.h"
+#include "tman_main.h"
+#include "main.h"
+#include "select.h"
+#include "tagUtils.h"
+#include "undo.h"
+#include "extend.h"
+#include "fortran.h"
+#include "fort.h"
+#include "misc.h"
+
+
+/* ------ External variables ------- */
+extern int_f save_state;
+extern enum States {StateDown=0,StateUp} editorState;
+
+/*
+** Static variables
+** defining the state of the contig editor
+*/
+#define MAXEDSTATES 2
+static int EdStructCount = 0;
+static EdStruct edstate[MAXEDSTATES] = {
+    {
+	/* oldMax             */ 0,
+	/* DB_flags           */ DB_ACCESS_UPDATE |
+				 DB_DELAYED_READ |
+				 DB_DATA_TYPE_DNA,
+	/* DB_gelCount        */ 0,
+	/* DB_contigNum       */ 0,
+	/* DBlist             */ NULL,
+	/* DBorder            */ NULL,
+	/* DB                 */ NULL,
+	/* displayPos         */ 1,
+	/* displayWidth       */ DEFAULT_DISPLAY_WIDTH,
+	/* displayHeight      */ 0,
+	/* cursorPos          */ 1,
+	/* cursorSeq          */ 0,
+	/* rulerDisplayed     */ 1,
+	/* consensusDisplayed */ 1,
+	/* fontWidth          */ 0,
+	/* fontHeight         */ 0,
+        /* edWid              */ NULL,
+	/* namesWid           */ NULL,
+	/* sequencesWid       */ NULL,
+	/* scrollButtonsWid   */ NULL,
+	/* sliderWid          */ NULL,
+	/* displayedConsensus */ "",
+	/* select_made        */ 0,
+	/* select_seq         */ 0,
+	/* select_start_pos   */ 0,
+	/* select_end_pos     */ 0,
+	/* select_tag         */ NULL,
+	/* reveal_cutoffs     */ 0,
+	/* showDifferences    */ 0
+    },
+    {
+	/* oldMax             */ 0,
+	/* DB_flags           */ DB_ACCESS_UPDATE |
+				 DB_DELAYED_READ |
+				 DB_DATA_TYPE_DNA,
+	/* DB_gelCount        */ 0,
+	/* DB_contigNum       */ 0,
+	/* DBlist             */ NULL,
+	/* DBorder            */ NULL,
+	/* DB                 */ NULL,
+	/* displayPos         */ 1,
+	/* displayWidth       */ DEFAULT_DISPLAY_WIDTH,
+	/* displayHeight      */ 0,
+	/* cursorPos          */ 1,
+	/* cursorSeq          */ 0,
+	/* rulerDisplayed     */ 1,
+	/* consensusDisplayed */ 1,
+	/* fontWidth          */ 0,
+	/* fontHeight         */ 0,
+        /* edWid              */ NULL,
+	/* namesWid           */ NULL,
+	/* sequencesWid       */ NULL,
+	/* scrollButtonsWid   */ NULL,
+	/* sliderWid          */ NULL,
+	/* displayedConsensus */ "",
+	/* select_made        */ 0,
+	/* select_seq         */ 0,
+	/* select_start_pos   */ 0,
+	/* select_end_pos     */ 0,
+	/* select_tag         */ NULL,
+	/* reveal_cutoffs     */ 0,
+	/* showDifferences    */ 0
+    }
+};
+
+static void bell()
+/*
+** Make a silly beep
+*/
+{
+    XBell (GetDisplay(),100);
+}
+
+
+
+void calculateConsensusLength(EdStruct *xx)
+/*
+** Calculate dynamic consensus length
+*/
+{
+    int sequenceEnd,i;
+
+    DBsetLength(xx,0,0);
+    for (i=1; i <= xx->DB_gelCount; i++) {
+        sequenceEnd = DBgetRelPos(xx,i) + DBgetLength(xx,i) -1;
+	if (DBgetLength(xx,0) < sequenceEnd) DBsetLength(xx,0,sequenceEnd);
+    }
+}
+
+EdStructPtr getFreeEdStruct()
+/*
+** Get the next free EdStruct
+*/
+{
+    if (EdStructCount == MAXEDSTATES)
+	return NULL;
+    else
+	return &edstate[EdStructCount++];
+}
+
+EdStructPtr intToEdStruct(int i)
+/*
+** Given an integer,
+** return the corresponding edit structure
+*/
+{
+    return &edstate[i];
+}
+
+EdStructPtr widgetToEdStruct(Widget w)
+/*
+** Given a widget,
+** return the corresponding edit structure
+*/
+{
+    int i;
+    if (w==NULL)
+	return NULL;
+    else
+	for (i=0;i<EdStructCount;i++)
+	    if (w==edstate[i].edWid)
+		return &edstate[i];
+    return NULL;
+}
+
+/* ---- Private Functions ---- */
+
+/************************************************/
+
+
+
+
+char *DBgetSeq(EdStruct *xx, int seq)
+/*
+** Force reading in the sequence for seq
+*/
+{
+    int i;
+
+    /* already in memory? */
+    if (!seq || DBgetFlags(xx,seq) & DB_FLAG_SEQ_IN_MEMORY)
+	return DB_Seq(xx,seq);
+
+    /* allocate memory */
+    if ((DBsetSeq(xx,seq,(char *) malloc((int)*saveState.maxgel)))==NULL)
+	return NULL;
+
+    /* force reading */
+    i = DBgetNumber(xx,seq);
+    readw_(saveState.idevw,&i,DB_Seq(xx,seq),saveState.maxgel,*saveState.maxgel);
+
+    /* mark as read */
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_IN_MEMORY);
+
+    return DB_Seq(xx,seq);
+}
+
+
+
+
+
+tagStruct *DBgetTags (EdStruct *xx, int seq)
+/*
+** Force tags into memory
+*/
+{
+    int i;
+
+    /* already in memory? */
+    if (!seq || DBgetFlags(xx,seq) & DB_FLAG_TAG_IN_MEMORY)
+	return (tagStruct *) DB_Tags(xx,seq);
+
+    /* read in tag list */
+    i = DBgetNumber(xx,seq);
+    DBsetTags(xx,seq,readTagList(i));
+
+    /* mark as read */
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_IN_MEMORY);
+
+    return (tagStruct *) DB_Tags(xx,seq);
+}
+
+
+
+char *DBgetName(EdStruct *xx, int seq)
+/*
+** Force reading in the sequence for seq
+*/
+{
+    int i;
+    char buf[NAMELEN+1];
+
+    /* already in memory? */
+    if (!seq || DBgetFlags(xx,seq) & DB_FLAG_NAME_IN_MEMORY)
+	return DB_Name(xx,seq);
+
+    /* allocate memory */
+    if ((DBsetName(xx,seq,(char *) malloc(sizeof(char)*(NAMELEN+1))))==NULL)
+	return NULL;
+
+    /* force reading */
+    i = DBgetNumber(xx,seq);
+    readn_(saveState.idevn,&i,buf,DB_NAMELEN); buf[DB_NAMELEN]='\0';
+    sprintf(DB_Name(xx,seq),"%*d %-*s",
+	    DB_GELNOLEN,(DBgetComp(xx,seq)==COMPLEMENTED)?-DBgetNumber(xx,seq):DBgetNumber(xx,seq),
+	    DB_NAMELEN, buf);
+
+    /* mark as read */
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_NAME_IN_MEMORY);
+
+    return DB_Name(xx,seq);
+}
+
+
+
+
+int initialiseDB(
+	EdStruct *xx,
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *llino	/* left-most gel in contig */
+	)
+/*
+** Create an internal database and
+** read all relevant data into it
+*/
+{
+    int i,c;
+
+    /*
+    ** Reset things
+    */
+    xx->showDifferences = 0;
+
+    /*
+    ** determine gel number
+    */
+    {
+	int dummy,numContigs;
+	int firstGel;
+
+	readr_(idevr,idbsiz,&dummy,&numContigs,&dummy,&dummy);
+	xx->DB_contigNum = (int)*idbsiz-numContigs;
+	readr_(idevr,&xx->DB_contigNum,&dummy,&dummy,&firstGel,&dummy);
+	while (xx->DB_contigNum < (int)*idbsiz && firstGel!=(int)*llino) {
+	    xx->DB_contigNum++;
+	    readr_(idevr,&xx->DB_contigNum,&dummy,&dummy,&firstGel,&dummy);
+	}
+
+    }
+
+    /*
+    ** count number of gels in contig
+    */
+    for (xx->DB_gelCount=1,i=(int) *llino;
+         xx->DB_gelCount<(int)*idbsiz && (int)rnbr[i-1];
+         xx->DB_gelCount++,i=(int)rnbr[i-1]);
+
+    if ((xx->DB = (DBStruct *) calloc (xx->DB_gelCount+1, sizeof(DBStruct)))==NULL)
+	goto disaster;
+    if ((xx->DBlist = (int *)  calloc (xx->DB_gelCount+1, sizeof(int) ))==NULL)
+	goto disaster;
+    if ((xx->DBorder = (int *) calloc (xx->DB_gelCount+1, sizeof(int) ))==NULL)
+	goto disaster;
+
+    /*
+    ** read information into local database
+    */
+    for (c=1,i=(int) *llino;
+         c<(int)*idbsiz && i;
+         c++,i=(int)rnbr[i-1]) {
+
+        DBsetRelPos(xx,c,relpg[i-1]);
+	DBsetLength(xx,c,abs(lngthg[i-1]));
+	DBsetComp(xx,c,(lngthg[i-1]<0)?-1:1);
+	DBsetNumber(xx,c,i);
+        DBsetFlags(xx,c,DB_FLAG_NONE);
+
+	xx->DBorder[c] = c;
+
+	if (xx->DB_flags & DB_STORAGE_INTERNAL) {
+	    if (DBgetSeq(xx,c)==NULL) goto disaster;
+	    if (DBgetName(xx,c)==NULL) goto disaster;
+	    (void)DBgetTags(xx,c);
+	}
+
+    }
+    /*
+    ** Set up consensus
+    */
+    {
+        DBsetRelPos(xx,0,1);
+	DBsetComp(xx,0,UNCOMPLEMENTED);
+	calculateConsensusLength(xx);
+        if ((DBsetSeq(xx,0,(char *) malloc(MAX_DISPLAY_WIDTH)))==NULL)
+	    goto disaster;
+	if ((DBsetName(xx,0,(char *) malloc(sizeof(char)*(NAMELEN+1))))==NULL)
+	    goto disaster;
+	sprintf(DB_Name(xx,0),"%*s %-*s",
+                  DB_GELNOLEN," ",
+                  DB_NAMELEN, "CONSENSUS");
+        xx->DBorder[0] = 0;
+    }
+
+    return 0;
+
+disaster:
+
+    freeDB(xx);
+    return 1;
+}
+
+void freeDB(EdStruct *xx)
+/*
+** Free an internal database
+*/
+{
+    int i;
+
+    if (xx->DB != NULL)
+        for (i=0; i <= xx->DB_gelCount; i++) {
+	    free(DB_Name(xx,i));
+            free(DB_Seq(xx,i));
+	    destroyTagList(DB_Tags(xx,i));
+        }
+
+    free(xx->DB);
+    free(xx->DBlist);
+    free(xx->DBorder);
+
+    xx->DB = NULL;
+    xx->DBlist = NULL;
+    xx->DBorder = NULL;
+}
+
+
+void saveDB(
+	EdStruct *xx,
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel	/* maximum length of gel */
+    )
+/*
+** Save an internal database
+*/
+{
+    int i;
+    int N,leftN,rightN;
+    int flag;
+
+    for (i=1; i<=xx->DB_gelCount; i++) {
+	/*
+	** update relationships
+	*/
+	N = DBgetNumber(xx,xx->DBorder[i]);
+	relpg[N-1]  = DBgetRelPos(xx,xx->DBorder[i]);
+	lngthg[N-1] = (DBgetComp(xx,xx->DBorder[i])==COMPLEMENTED)?-DBgetLength(xx,xx->DBorder[i]):DBgetLength(xx,xx->DBorder[i]);
+	if (i==1)
+	    leftN = 0;
+	else
+	    leftN = DBgetNumber(xx,xx->DBorder[i-1]);
+	if (i==xx->DB_gelCount)
+	    rightN = 0;
+	else
+	    rightN = DBgetNumber(xx,xx->DBorder[i+1]);
+	lnbr[N-1] = leftN;
+	rnbr[N-1] = rightN;
+	writer_(idevr,&N,&relpg[N-1],&lngthg[N-1],&lnbr[N-1],&rnbr[N-1]);
+
+	flag = DBgetFlags(xx,xx->DBorder[i]);
+	/*
+	** update working versions
+	*/
+	if (flag&(DB_FLAG_SEQ_IN_MEMORY|DB_FLAG_SEQ_MODIFIED))
+	    writew_(idevw,&N,DB_Seq(xx,xx->DBorder[i]),maxgel,*maxgel);
+	/*
+	** update tag list
+	*/
+	if (flag&(DB_FLAG_TAG_IN_MEMORY|DB_FLAG_TAG_MODIFIED)) {
+            writeTagList(xx, xx->DBorder[i]);
+            destroyTagList(DB_Tags(xx,xx->DBorder[i]));
+            DBsetTags(xx,xx->DBorder[i],readTagList(DBgetNumber(xx,xx->DBorder[i])));
+        }
+
+        /*
+        ** Preserve only ``sequence in memory flag''
+        */
+	DBsetFlags(xx,xx->DBorder[i],
+		   flag&(DB_FLAG_SEQ_IN_MEMORY ||
+			 DB_FLAG_NAME_IN_MEMORY ||
+			 DB_FLAG_TAG_IN_MEMORY));
+
+    }
+
+    /*
+    ** update contig relationships
+    */
+    calculateConsensusLength(xx);
+    relpg[xx->DB_contigNum-1]  = DBgetLength(xx,0);
+    lngthg[xx->DB_contigNum-1] = 0;
+    lnbr[xx->DB_contigNum-1] = DBgetNumber(xx,xx->DBorder[1]);
+    rnbr[xx->DB_contigNum-1] = DBgetNumber(xx,xx->DBorder[xx->DB_gelCount]);
+    writer_(idevr,
+	    &xx->DB_contigNum,
+	    &relpg[xx->DB_contigNum-1],
+	    &lngthg[xx->DB_contigNum-1],
+	    &lnbr[xx->DB_contigNum-1],
+	    &rnbr[xx->DB_contigNum-1]
+	    );
+    flushl_(idevr);
+    flushl_(idevw);
+    flushl_(idevn); /* not needed really */
+    flushl_(&devils_.idevt);
+    flushl_(&devils_.idevc);
+    
+    cleanUpStack(&bucketStack);
+
+    /* update save state */
+    if(xx == intToEdStruct(0))
+	save_state |= (int_f)1;
+    else
+	save_state |= (int_f)2;
+
+
+}
+
+void joinDB(
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *llinol,
+	int_f *lnconl,
+	int_f *llinor,
+	int_f *lnconr,
+	int_f *ngels,
+	int_f *nconts,
+	int_f *idbsiz
+    )
+{
+    int_f relx;
+    int i;
+
+    /*
+    ** Save an internal databases
+    */
+    for (i=0;i<2;i++)
+        saveDB(
+	    intToEdStruct(i),
+	    idevr,
+	    idevw,
+	    idevn,
+	    relpg,
+	    lngthg,
+	    lnbr,
+	    rnbr,
+	    maxgel
+        );
+    relx = editorLockedPos(1/*force*/);
+    if (relx<0) {
+	relx = 1-relx;
+        dojoin_(relpg,lngthg,lnbr,rnbr,ngels,nconts,lnconl,lnconr,
+                idbsiz,idevr,idevw,&relx);
+    } else {
+	relx = 1+relx;
+        dojoin_(relpg,lngthg,lnbr,rnbr,ngels,nconts,lnconr,lnconl,
+                idbsiz,idevr,idevw,&relx);
+    }
+
+    save_state |= (int_f)4;
+
+}
+
+
+static int linesInRegion(EdStruct *xx, int pos, int width)
+/*
+** Return number of sequences on screen
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ;
+	 i<=xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[i])<(pos+width) ;
+	 i++) {
+        if (DBgetRelPos(xx,xx->DBorder[i])+DBgetLength(xx,xx->DBorder[i])>pos &&
+	    DBgetLength(xx,xx->DBorder[i])) count++;
+    }
+    count += xx->consensusDisplayed;
+
+    return count;
+
+}
+
+
+static int linesOnScreen (EdStruct *xx, int pos, int width)
+/*
+** Return number of sequences on screen
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ; i<=xx->DB_gelCount ; i++) {
+	int relPos, length;
+	int len_lcut, len_rcut;
+
+	if (xx->reveal_cutoffs) {
+	    len_lcut = lenLCut(xx,xx->DBorder[i]);
+	    len_rcut = lenRCut(xx,xx->DBorder[i]);
+	} else
+	    len_lcut = len_rcut = 0;
+	relPos = DBgetRelPos(xx,xx->DBorder[i]) - len_lcut;
+	length = DBgetLength(xx,xx->DBorder[i]) + len_lcut + len_rcut;
+        if (relPos < pos+width && relPos+length>pos)
+	    count++;
+    }
+    count += xx->consensusDisplayed;
+
+    return count;
+}
+
+
+
+#ifdef notdef
+static int maxLinesOnScreen_x (EdStruct *xx, int width)
+/*
+** Given width, return maximum number of lines on screen
+*/
+{
+    int i,max,linesAtX,X;
+
+    max = linesOnScreen(xx,1,width);
+    for (i=1 ; i <= xx->DB_gelCount ; i++) {
+        X=DBgetRelPos(xx,i)-width+1;
+	if (X<=1) continue;
+	linesAtX=linesOnScreen(xx,X,width);
+	if (linesAtX>max) max = linesAtX;
+    }
+    max += xx->consensusDisplayed;
+
+    return max;
+}
+#endif
+
+int *sequencesInRegion(EdStruct *xx,int pos, int width)
+/*
+** Return a pointer to list of sequences in region of contig
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ; i<=xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[i])<(pos+width) ; i++) {
+        if (DBgetRelPos(xx,xx->DBorder[i])+DBgetLength(xx,xx->DBorder[i])>pos && DBgetLength(xx,xx->DBorder[i]))
+	    xx->DBlist[count++]=xx->DBorder[i];
+    }
+    if (xx->consensusDisplayed) xx->DBlist[count++] = 0;
+
+    return xx->DBlist;
+
+}
+
+
+int *sequencesOnScreen(EdStruct *xx,int pos, int width)
+/*
+** Return a pointer to list of sequences on screen
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ; i<=xx->DB_gelCount ; i++) {
+	int relPos, length;
+	int len_lcut, len_rcut;
+
+	if (xx->reveal_cutoffs) {
+	    len_lcut = lenLCut(xx,xx->DBorder[i]);
+	    len_rcut = lenRCut(xx,xx->DBorder[i]);
+	} else
+	    len_lcut = len_rcut = 0;
+	relPos = DBgetRelPos(xx,xx->DBorder[i]) - len_lcut;
+	length = DBgetLength(xx,xx->DBorder[i]) + len_lcut + len_rcut;
+        if (relPos < pos+width && relPos+length>pos)
+	    xx->DBlist[count++]=xx->DBorder[i];
+    }
+    if (xx->consensusDisplayed) xx->DBlist[count++] = 0;
+
+    return xx->DBlist;
+
+}
+
+void DBgetSequence(EdStruct *xx, int seq, int pos, int width, char *str)
+/*
+** get part of a sequence from its `pos' base for `width' bases
+** Bases number from 0?
+*/
+{
+    char *src;
+    int length = DBgetLength(xx,seq);
+    int i;
+
+    src = DBgetSeq(xx,seq);
+
+    /* Lefthand cut off */
+    if (pos<0) {
+	i = (width<-pos)?width:-pos;
+	getLCut(xx,seq, -pos, i, str);
+    } else
+	i=0;
+    
+    /*copy sequence*/
+    for (;
+	 i<width && (pos+i)<length;
+	 i++) {
+	str[i]=src[pos+i]; 
+    }
+    
+    /* Righthand cut off */
+    if (i<width) {
+	getRCut(xx,seq, pos+i-length, width-i, &str[i]);
+    }
+    
+    str[width]='\0';
+
+}
+
+
+
+void DBcalcConsensus (EdStruct *xx,int pos, int width, char *str, int mode)
+/*
+** calculate the consensus for position `pos' in contig,
+** for `width' characters
+*/
+{
+    /*
+    ** When calculating the consensus,
+    **   handle DNA differently from protein
+    ** switch (xx->DB_flags & DB_DATA_TYPE) {
+    **     case DB_DATA_TYPE_DNA:     do dna stuff
+    **     case DB_DATA_TYPE_PROTEIN: do protein stuff
+    **     default : do dna stuff???
+    ** }
+    ** BASES = 6 for dna, 26 for protein (specified by fortran variable IDM?)
+    */
+    int_f scores[MAX_DISPLAY_WIDTH][BASES];
+    int_f index,score;
+    int_f i,j,k;
+    int_f seven=BASES;
+    int jlow,klow,jhigh /* ,khigh */;
+    int chunk; /* do in chucks of MAX_DISPLAY_WIDTH */
+    int cwidth;
+
+    for (chunk=0; chunk<width; chunk+=MAX_DISPLAY_WIDTH) {
+	cwidth = width - chunk;
+	if (cwidth>MAX_DISPLAY_WIDTH) cwidth=MAX_DISPLAY_WIDTH;
+	/*now do for pos+chunk to pos+chunk+cwidth*/
+
+	for (i=0;i<MAX_DISPLAY_WIDTH;i++)
+	    for (j=0;j<BASES;j++) scores[i][j] = 0;
+
+	for (i=1 ;
+	    i<=xx->DB_gelCount &&
+	    DBgetRelPos(xx,xx->DBorder[i])<(pos+chunk+cwidth) ;
+	    i++)
+	{
+	    if (DBgetRelPos(xx,xx->DBorder[i]) +
+		DBgetLength(xx,xx->DBorder[i]) > (pos+chunk) &&
+		DBgetLength(xx,xx->DBorder[i]) &&
+		(mode == BOTH_STRANDS ||
+		 mode == DBgetComp(xx,xx->DBorder[i])))
+	    {
+		if (DBgetRelPos(xx,xx->DBorder[i]) < (pos+chunk))
+		{
+		    klow = (pos+chunk) - DBgetRelPos(xx,xx->DBorder[i]);
+		    jlow = 0;
+		}
+		else
+		{
+		    klow = 0;
+		    jlow = DBgetRelPos(xx,xx->DBorder[i]) - (pos+chunk);
+		}
+
+		if (DBgetRelPos(xx,xx->DBorder[i]) +
+		    DBgetLength(xx,xx->DBorder[i]) > pos+chunk+cwidth)
+		{
+		    /*khigh = klow + cwidth;*/
+		    jhigh = cwidth;
+		}
+		else
+		{
+		    /*khigh = DBgetLength(xx,xx->DBorder[i]);*/
+		    jhigh = jlow + DBgetLength(xx,xx->DBorder[i]) - klow;
+		}
+
+		{
+		    char s[MAX_DISPLAY_WIDTH+1];
+		    DBgetSequence(xx,xx->DBorder[i],klow,jhigh-jlow,s);
+		    for (j=jlow,k=0; j<jhigh; j++,k++)
+		    {
+			if (*saveState.idm == 26) {
+			    pcon1_(&s[k],&scores[j][0]);
+			} else {
+			    index = indexs_(&s[k],&score);
+			    scores[j][index-1] += score;
+			    /* No longer needed
+			    scores[j][BASES-1] += score;
+			    */
+			}
+		    }
+		}
+
+	    }
+	}
+
+	for (i=0;i<cwidth;i++){
+	    if (*saveState.idm == 26) 
+		munotp_(&str[chunk+i],(int_fl)1,&scores[i][0]);
+	    else
+		gtconc_(&str[chunk+i],(int_fl)1,&scores[i][0],&seven,&pcCut);
+	}
+    }
+    str[width]='\0';
+    
+}
+
+
+
+int positionInContig(EdStruct *xx, int seq, int pos)
+/*
+** returns relative position in a sequence as an 
+** absolute position in the contig
+*/
+{
+    return DBgetRelPos(xx,seq) + pos - 1;
+}
+
+static int onScreen (EdStruct *xx, int seq, int pos)
+/*
+** returns true if base in `seq' at position `pos' is currently
+** being displayed on screen 
+*/
+{
+    int posInContig;
+
+    posInContig = positionInContig(xx,seq,pos);
+    return (posInContig >= xx->displayPos &&
+            posInContig < xx->displayPos + xx->displayWidth );
+}
+
+
+#ifdef notdef
+static void getJoinedData(EdStruct *xx, int *leftPos, int* rightPos)
+/*
+** get information about relative positions of two joined contigs
+*/
+{
+    int offset = editorLockedPos(0/*don't force recalculation*/);
+    EdStruct *otherxx;
+
+    otherxx = intToEdStruct(0);
+    if (otherxx==xx) {
+	otherxx = intToEdStruct(1);
+        *leftPos = min(1,1-offset);
+	*rightPos = max(DBgetLength(xx,0),DBgetLength(otherxx,0)-offset);
+    } else {
+        *leftPos = min(1,1+offset);
+	*rightPos = max(DBgetLength(xx,0),DBgetLength(otherxx,0)+offset);
+    }
+}
+#endif /*notdef*/
+
+#ifdef notdef
+static int getJoinedLength(EdStruct *xx)
+/*
+** Get the combined length of the two joined contigs
+*/
+{
+    int leftPos, rightPos;
+    getJoinedData(xx,&leftPos,&rightPos);
+    return rightPos - leftPos;
+}
+
+static int getJoinedLeftPos(EdStruct *xx)
+/*
+** Get the position (relative to xx) of the leftmost base in
+** the joined contig
+*/
+{
+    int leftPos, rightPos;
+    getJoinedData(xx,&leftPos,&rightPos);
+    return leftPos;
+}
+
+static int getJoinedRightPos(EdStruct *xx)
+/*
+** Get the position (relative to xx) of the rightmost base in
+** the joined contig
+*/
+{
+    int leftPos, rightPos;
+    getJoinedData(xx,&leftPos,&rightPos);
+    return rightPos;
+}
+#endif
+
+static void extents(EdStruct *xx, int *left, int *right)
+/*
+** Get maximum extents of sequence, taking into account cutoffs.
+*/
+{
+
+    if (xx->reveal_cutoffs) {
+	int eleft, eright;
+	int i;
+
+	eleft = eright = 0;
+	
+	for (i=1; i<=xx->DB_gelCount ; i++) {
+	    int thisleft,thisright;
+
+	    thisleft = DBgetRelPos(xx,xx->DBorder[i]) - lenLCut(xx,xx->DBorder[i]);
+	    thisright =  DBgetRelPos(xx,xx->DBorder[i]) + DBgetLength(xx,xx->DBorder[i]) + lenRCut(xx,xx->DBorder[i]) - 1;
+
+	    if (eleft > thisleft)
+		eleft = thisleft;
+	    if (eright < thisright)
+		eright = thisright;
+	}
+
+	*left = eleft;
+	*right = eright;
+
+    } else {
+	*left = 1;
+	*right = DBgetLength(xx,0);
+    }
+
+}
+
+
+
+
+static void joinedExtents(EdStruct *xx, int *leftPos, int* rightPos)
+/*
+** get information about relative positions of two joined contigs
+*/
+{
+    int offset = editorLockedPos(0/*don't force recalculation*/);
+    EdStruct *otherxx;
+    int left,right;
+    int otherleft,otherright;
+
+    otherxx = intToEdStruct(0);
+    extents(xx,&left,&right);
+    if (otherxx==xx) {
+	otherxx = intToEdStruct(1);
+	extents(otherxx,&otherleft,&otherright);
+        *leftPos = min(left,otherleft-offset);
+	*rightPos = max(right,otherright-offset);
+    } else {
+	extents(otherxx,&otherleft,&otherright);
+        *leftPos = min(left,otherleft+offset);
+	*rightPos = max(right,otherright+offset);
+    }
+}
+
+
+
+
+
+
+static void getExtents(EdStruct *xx, int *left, int *right)
+{
+    if (inJoinMode() && editorLocked())
+	joinedExtents(xx,left,right);
+    else {
+	extents(xx,left,right);
+	if (inJoinMode()  && !editorLocked()) {
+	    *right += xx->displayWidth - 2;
+	    *left  -= xx->displayWidth - 1;
+	}
+    }
+
+
+}
+
+
+
+/***************************************************/
+/**/
+/**/
+
+
+
+
+
+static void setSliderPos (EdStruct *xx, int pos)
+/*
+** set slider as a position in contig
+*/
+{
+    float percentSet;
+    float percentShown;
+    int left, right;
+    getExtents(xx,&left,&right);
+    percentSet = (float)(pos-left)/(float)(right-left+1+1-xx->displayWidth);
+    percentShown = (float)xx->displayWidth/(float)(right-left+1+1);
+
+    XawScrollbarSetThumb(xx->sliderWid,percentSet,percentShown);
+}
+
+
+
+
+
+
+static void incDisplayPosP (EdStruct *xx, int distance)
+/*
+** Increase the leftmost base position on the screen by a symbolic amount
+*/
+{
+    int oldPos = xx->displayPos;
+    int left,right;
+    switch (distance) {
+        case D_screen     : xx->displayPos += xx->displayWidth; break;
+        case D_halfScreen : xx->displayPos += xx->displayWidth/2; break;
+        case D_character  : xx->displayPos += 1; break;
+    }
+    getExtents(xx,&left,&right);
+    if (xx->displayPos > right + 2 - xx->displayWidth)
+	xx->displayPos = right + 2 - xx->displayWidth;
+    if (oldPos!=xx->displayPos)
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+}
+
+
+
+
+
+
+
+
+static void decDisplayPosP (EdStruct *xx, int distance)
+/*
+** Decrease the leftmost base position on the screen by a symbolic ammount
+*/
+{
+    int oldPos = xx->displayPos;
+    int left,right;
+    switch (distance) {
+        case D_screen     : xx->displayPos -= xx->displayWidth; break;
+        case D_halfScreen : xx->displayPos -= xx->displayWidth/2; break;
+        case D_character  : xx->displayPos -= 1; break;
+    }
+    getExtents(xx,&left,&right);
+    if (xx->displayPos < left)
+	xx->displayPos = left;
+    if (oldPos!=xx->displayPos)
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+}
+
+
+
+
+
+
+
+
+
+static void setDisplayPosPercentP (EdStruct *xx, float percent)
+/*
+** Set the leftmost base position on the screen given a percentage
+*/
+{
+    int oldPos = xx->displayPos;
+    int left,right;
+    getExtents(xx,&left,&right);
+    xx->displayPos  = (int)((float) (right-left+1+1-xx->displayWidth) * percent) + left;
+    if (oldPos!=xx->displayPos)
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+}
+
+
+
+
+
+
+
+
+static void positionCursor(EdStruct *xx, Widget w, int seq, int pos)
+/*
+** position the cursor int sequence seq at position pos
+** and indicate cursor if on screen
+*/
+{
+    if (onScreen(xx,seq,pos)) {
+        int screenRow,screenColumn;
+        int *seqList;
+
+	screenColumn = positionInContig(xx,seq,pos) - xx->displayPos;
+        seqList = sequencesOnScreen(xx,xx->displayPos, xx->displayWidth);
+	for(screenRow=0;
+	    screenRow<xx->displayHeight && seqList[screenRow] != seq;
+	    screenRow++);
+	XawSheetDisplayCursor(w,True);
+	XawSheetPositionCursor(w,screenColumn,screenRow+xx->rulerDisplayed);
+    } else
+	XawSheetDisplayCursor(w,False);
+}
+
+static void redisplayDisagreement()
+/*
+** Recalculate and redisplay all disagreements
+*/
+{
+    char spare[MAX_DISPLAY_WIDTH];
+    EdStruct *xx[2];
+    int i;
+
+    if (inJoinMode()) {
+        xx[0] = intToEdStruct(0);
+        xx[1] = intToEdStruct(1);
+        for (i=0;i<xx[0]->displayWidth;i++)
+	    spare[i]=(xx[0]->displayedConsensus[i] ==
+	              xx[1]->displayedConsensus[i])?' ':'!';
+	XawSheetPutText(disagreeWid,0,0,xx[0]->displayWidth,spare);
+    }
+}
+
+
+void redisplaySequences (EdStruct *xx, Widget namesWid, Widget sequencesWid,
+                      int pos, int width)
+/*
+** Redisplay the whole sequence display
+*/
+{
+    Arg args[10];
+    int nargs;
+    int *seqList;
+
+    setSliderPos(xx,pos);
+
+    /*
+    ** Set Up Text Window sizes
+    */
+    xx->displayHeight = linesOnScreen(xx,pos,width);
+    seqList = sequencesOnScreen(xx,pos, width);
+
+    /*
+    ** Update names list
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, xx->displayHeight+xx->rulerDisplayed); nargs++;
+    XtSetValues(namesWid, args, nargs);
+    {
+	int i;
+	XawSheetInk splodge[NAMELEN];
+	for (i=0 ; i < NAMELEN ; i++) splodge[i].sh = sh_inverse;
+	for (i=0 ; i < xx->displayHeight ; i++ ) {
+	    if (DBgetFlags(xx,seqList[i]) & DB_FLAG_SELECTED)
+		XawSheetPutJazzyText(namesWid,0,(i+xx->rulerDisplayed),NAMELEN,DBgetName(xx,seqList[i]),splodge);
+	    else
+		XawSheetPutText(namesWid,0,(i+xx->rulerDisplayed),NAMELEN,DBgetName(xx,seqList[i]));
+	}
+    }
+
+    /*
+    ** Update sequence list
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, xx->displayHeight+xx->rulerDisplayed); nargs++;
+    XtSetValues(sequencesWid, args, nargs);
+    {
+	int i;
+	char spare[MAX_DISPLAY_WIDTH];
+	if (xx->rulerDisplayed) {
+	    char *k;
+	    int j,lower,times;
+	    lower = (pos - pos%10);
+	    times = width/10 + 2;
+	    for (j=0,k=spare;j<times;j++,k+=10,lower+=10) sprintf(k,"%10d",lower);
+	    XawSheetPutText(sequencesWid,0,0,width,&spare[9+pos%10]);
+	}
+	
+	DBcalcConsensus(xx,pos,width,xx->displayedConsensus,BOTH_STRANDS);
+	for (i=0 ; i < xx->displayHeight ; i++ ) {
+	    char * ptr;
+	    XawSheetInk splodge[MAX_DISPLAY_WIDTH];
+	    
+	    if (seqList[i]==0){
+		ptr      = xx->displayedConsensus;
+	    }else{
+		DBgetSequence(xx,seqList[i],pos-DBgetRelPos(xx,seqList[i]),width,spare);
+		ptr      = spare;
+	    }
+	    if (xx->showDifferences) {
+		int j;
+		for (j=0;j<width;j++) if (spare[j]==xx->displayedConsensus[j])
+		    spare[j]='.';
+	    }
+	    getTagSplodge(xx,seqList[i],pos-DBgetRelPos(xx,seqList[i]),width,splodge);
+	    XawSheetPutJazzyText(sequencesWid,0,(i+xx->rulerDisplayed),width,ptr,splodge);
+	}
+	
+    }
+
+    positionCursor(xx,sequencesWid,xx->cursorSeq,xx->cursorPos);
+    redisplaySelection(xx);
+
+}
+
+
+
+
+static void dumpSequence(EdStruct *xx, int seq, int pos, int width, char *str)
+/*
+** get part of a sequence from its `pos' base for `width' bases
+** Bases number from 0?
+*/
+{
+    char *src;
+    int length = DBgetLength(xx,seq);
+    int i,j;
+
+    src = DBgetSeq(xx,seq);
+
+    /* Lefthand cut off */
+    if (pos<0) {
+	i = (width<-pos)?width:-pos;
+	getLCut(xx,seq, -pos, i, str);
+	for(j=0;j<i;j++) if (isupper(str[j])) str[j] = tolower(str[j]);
+    } else
+	i=0;
+    
+    /*copy sequence*/
+    for (;
+	 i<width && (pos+i)<length;
+	 i++) {
+	str[i]=src[pos+i]; 
+    }
+    
+    /* Righthand cut off */
+    if (i<width) {
+	getRCut(xx,seq, pos+i-length, width-i, &str[i]);
+	for(j=i;j<width;j++) if (isupper(str[j])) str[j] = tolower(str[j]);
+    }
+    
+    str[width]='\0';
+
+}
+
+void dumpLine(EdStruct *xx, FILE *fp, int pos, int width)
+/*
+** Print out a section
+*/
+{
+    int *seqList;
+    int i;
+    char spare[MAX_DISPLAY_WIDTH];
+    char consensus[MAX_DISPLAY_WIDTH];
+    int displayHeight;
+
+    displayHeight = linesOnScreen(xx,pos,width);
+    seqList = sequencesOnScreen(xx,pos, width);
+
+
+    if (xx->rulerDisplayed) {
+	char *k;
+	int j,lower,times;
+	lower = (pos - pos%10);
+	times = width/10 + 2;
+	for (j=0,k=spare;j<times;j++,k+=10,lower+=10) sprintf(k,"%10d",lower);
+	fprintf(fp,"%*.*s   %*.*s\n",
+		NAMELEN,NAMELEN," ",
+		width,width,&spare[9+pos%10]);
+    }
+    DBcalcConsensus(xx,pos,width,consensus,BOTH_STRANDS);
+    
+    for (i=0 ; i < displayHeight ; i++ ) {
+	char * ptr;
+	
+	if (DBgetFlags(xx,seqList[i]) & DB_FLAG_SELECTED)
+	    fprintf(fp,"%*.*s * ",NAMELEN,NAMELEN, DBgetName(xx,seqList[i]));
+	else
+	    fprintf(fp,"%*.*s   ",NAMELEN,NAMELEN, DBgetName(xx,seqList[i]));
+	
+	if (seqList[i]==0){
+	    ptr = consensus;
+	}else{
+	    dumpSequence(xx,seqList[i],pos-DBgetRelPos(xx,seqList[i]),width,spare);
+	    ptr = spare;
+	}
+	if (xx->showDifferences) {
+	    int j;
+	    for (j=0;j<width;j++) if (spare[j]==consensus[j])
+		spare[j]='.';
+	}
+	fprintf(fp,"%*.*s\n",width,width,ptr);
+    }
+    
+    fprintf(fp,"\n");
+
+}
+
+
+void dumpRegion(EdStruct *xx, FILE *fp, int start, int end, int width)
+{
+    for(;start<=end;start+=width)
+	dumpLine(xx, fp, start, (end-start+1<width)?end-start+1:width);
+}
+
+
+
+
+
+
+void dumpContig(EdStruct *xx)
+{
+    int left,right;
+
+    static int i = 0;
+    char fn[1024];
+    FILE *fp;
+
+    i++;
+    sprintf(fn,"dump.%d.%d",getpid(),i);
+
+    if ( (fp = fopen(fn,"w")) != NULL ) {
+	extents(xx, &left, &right);
+	bell();
+	dumpRegion(xx,fp,left,right,60);
+	bell();
+	fclose(fp);
+    }
+}
+
+
+
+int createEdDisplay(EdStruct *xx, Widget namesWid, Widget sequencesWid, int seq, int pos)
+/*
+** Create the initial sequence display
+*/
+{
+    int i;
+
+    /*
+    ** Initial position on screen
+    */
+    xx->cursorSeq = 0;
+    xx->cursorPos = pos;
+
+    for (i=1; i<=xx->DB_gelCount; i++) {
+	if (DBgetNumber(xx,i) == seq) {
+	    xx->cursorSeq = i;
+	    xx->cursorPos = pos;
+	    break;
+	}
+    }
+
+    /*
+    ** Display.
+    ** set xx->displayPos to force repositioning of cursor
+    */
+    xx->displayPos = positionInContig(xx,xx->cursorSeq,xx->cursorPos) +
+	2*xx->displayWidth;
+    redisplayWithCursor(xx);
+    return 0;
+}
+
+
+void incDisplayPos (EdStruct *xx, int distance)
+/*
+** Increase the leftmost base position on the screen by a symbolic ammount
+*/
+{
+    if (editorLocked()) {
+	incDisplayPosP(intToEdStruct(0), distance);
+	incDisplayPosP(intToEdStruct(1), distance);
+    } else
+	incDisplayPosP(xx, distance);
+
+    redisplayDisagreement();
+}
+
+
+void decDisplayPos (EdStruct *xx, int distance)
+/*
+** Decrease the leftmost base position on the screen by a symbolic ammount
+*/
+{
+    if (editorLocked()) {
+	decDisplayPosP(intToEdStruct(0), distance);
+	decDisplayPosP(intToEdStruct(1), distance);
+    } else
+	decDisplayPosP(xx, distance);
+
+    redisplayDisagreement();
+}
+
+
+
+
+
+void setDisplayPosPercent (EdStruct *xx, float percent)
+/*
+** Set the leftmost base position on the screen given a percentage
+*/
+{
+    if (editorLocked()) {
+	setDisplayPosPercentP(intToEdStruct(0),percent);
+	setDisplayPosPercentP(intToEdStruct(1),percent);
+    } else
+	setDisplayPosPercentP(xx,percent);
+
+    redisplayDisagreement();
+}
+
+void setDisplayPos(EdStruct *xx, int pos)
+/*
+** centralise pos on screen
+*/
+{
+    if (pos==1) pos = 2; /*if pos==1 no redraw is done. So force*/
+
+    if (editorLocked()) {
+        int offset = editorLockedPos(0/*don't force recalculation*/);
+	EdStruct *otherxx;
+	otherxx = intToEdStruct(0);
+	if (otherxx == xx) {
+	    otherxx = intToEdStruct(1);
+	    otherxx->displayPos = pos + offset;
+	} else {
+	    otherxx->displayPos = pos - offset;
+	}
+    }
+
+    xx->displayPos = pos;
+    decDisplayPos(xx,D_halfScreen);
+
+}
+
+static void showCursor(EdStruct *xx, int seq, int pos)
+/*
+** ensure that the cursor is visible on the screen
+*/
+{
+    if (onScreen(xx,seq,pos))
+        positionCursor(xx,xx->sequencesWid,seq,pos);
+    else {
+	setDisplayPos(xx,positionInContig(xx,seq,pos));
+/*
+	if (xx->displayPos > positionInContig(xx,seq,pos))
+	    decDisplayPos(xx,D_halfScreen);
+	else
+	    incDisplayPos(xx,D_halfScreen);
+	xx->displayPos = positionInContig(xx,seq,pos) - (xx->displayWidth/2);
+	if (xx->displayPos < 1) xx->displayPos = 1;
+	if (xx->displayPos > (DBgetLength(xx,0)+2 - xx->displayWidth))
+	    xx->displayPos = (DBgetLength(xx,0)+2 - xx->displayWidth);
+	redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+*/
+    }
+}
+
+void redisplayWithCursor(EdStruct *xx)
+/*
+** Redisplay screen, ensuring cursor display
+*/
+{
+    if (onScreen(xx,xx->cursorSeq,xx->cursorPos)) {
+	redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	redisplayDisagreement();
+    } else
+	showCursor(xx,xx->cursorSeq,xx->cursorPos);
+}
+
+void caretRight (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor right
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    if (xx->cursorPos<=DBgetLength(xx,xx->cursorSeq)) {
+	xx->cursorPos++;
+        showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    } else
+	bell();
+
+}
+
+
+void caretLeft (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor left
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+
+    if (xx->cursorPos>1) {
+	xx->cursorPos--;
+        showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    } else
+	bell();
+
+}
+
+void caretDown (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor down,
+** cycle if necessary
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    int *seqList,seqCount;
+    int posInContig;
+    int i;
+
+    posInContig = positionInContig(xx,xx->cursorSeq,xx->cursorPos);
+    seqList = sequencesInRegion(xx,posInContig-1,2);
+    seqCount = linesInRegion(xx,posInContig-1,2);
+    for(i=0;
+	i<seqCount && seqList[i]!=xx->cursorSeq;
+	i++);
+
+    i++;
+    if (i == seqCount) i = 0;
+
+    xx->cursorSeq = seqList[i];
+    xx->cursorPos = posInContig - DBgetRelPos(xx,xx->cursorSeq) + 1;
+    if (xx->cursorPos<1)
+	xx->cursorPos = 1;
+    else
+	if (xx->cursorPos > DBgetLength(xx,xx->cursorSeq)+1)
+	    xx->cursorPos = DBgetLength(xx,xx->cursorSeq)+1;
+    showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    
+}
+
+void caretUp (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor up,
+** cycle if necessary
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    int *seqList,seqCount;
+    int posInContig;
+    int i;
+
+    posInContig = positionInContig(xx,xx->cursorSeq,xx->cursorPos);
+    seqList = sequencesInRegion(xx,posInContig-1,2);
+    seqCount = linesInRegion(xx,posInContig-1,2);
+    for(i=0;
+	i<seqCount && seqList[i]!=xx->cursorSeq;
+	i++);
+
+    if (i==0) i = seqCount;
+    i--;
+
+    xx->cursorSeq = seqList[i];
+    xx->cursorPos = posInContig - DBgetRelPos(xx,xx->cursorSeq) + 1;
+    if (xx->cursorPos<1)
+	xx->cursorPos = 1;
+    else
+	if (xx->cursorPos > DBgetLength(xx,xx->cursorSeq)+1)
+	    xx->cursorPos = DBgetLength(xx,xx->cursorSeq)+1;
+    showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    
+}
+
+
+static void deleteBase (EdStruct *xx, int seq, int pos)
+/*
+** delete a single base in a sequence
+** and update the screen if necessary
+*/
+{
+    if (seq && pos) {
+	int j;
+	char *s;
+	char deletedBase;
+	s=DBgetSeq(xx,seq);
+	deletedBase = s[pos-1];
+        for (j = pos; j < DBgetLength(xx,seq); j++)
+            s[j-1] = s[j];
+	DBsetLength(xx,seq,DBgetLength(xx,seq)-1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	tagDeleteBase(xx,seq,pos,&deletedBase);
+        selectDeleteBase(xx, seq, pos);
+    }
+    else if (seq) {
+	/*
+	** deletion at position 0 - shift left sequence
+	*/
+	int i,j;	
+
+        if (DBgetRelPos(xx,seq)==1) {
+	    for (i=1; i <= xx->DB_gelCount; i++) {
+		if (seq == xx->DBorder[i]) {
+		    for (j=i;j>1;j--)
+			xx->DBorder[j] = xx->DBorder[j-1];
+		    xx->DBorder[1] = seq;
+		} else
+		    DBsetRelPos(xx,xx->DBorder[i],DBgetRelPos(xx,xx->DBorder[i])+1);
+	    }
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	} else {
+	    DBsetRelPos(xx,seq,DBgetRelPos(xx,seq)-1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	    for (i=1; seq != xx->DBorder[i] && i <= xx->DB_gelCount; i++);
+	    for (j=i; DBgetRelPos(xx,xx->DBorder[j-1]) > DBgetRelPos(xx,seq) ;j--)
+		xx->DBorder[j] = xx->DBorder[j-1];
+	    xx->DBorder[j] = seq;
+	}
+
+    }
+
+}
+
+void deleteKey (Widget widget, XEvent *event, String *params,
+		Cardinal *num_params)
+/*
+** delete a character
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    int i;
+    char ch;
+
+    if (!(xx->DB_flags & DB_ACCESS_UPDATE)) return;
+    if (! onScreen(xx, xx->cursorSeq, xx->cursorPos)) {
+	/*
+	** When cursor is currently displayed on the screen,
+	** give a warning tone, and recentre screen on cursor
+	*/
+	showCursor(xx,xx->cursorSeq, xx->cursorPos);
+	bell();
+    } else if (positionInContig(xx,xx->cursorSeq,xx->cursorPos)==1 && !xx->cursorSeq) {
+	bell();
+    } else if (xx->cursorSeq) {
+	if (editModeIsSuperman()) {
+	    if (xx->cursorPos==1)
+		ch = ' ';
+	    else {
+	        char *s;
+	        s = DBgetSeq(xx,xx->cursorSeq);
+	        ch = s[xx->cursorPos-2];
+	    }
+
+	    recordEdit(&bucketStack,xx,undoDelete,xx->cursorSeq,xx->cursorPos-1,ch,0);
+
+	    deleteBase (xx,xx->cursorSeq, xx->cursorPos-1);
+
+	    calculateConsensusLength(xx);
+
+	    if (xx->cursorPos>1) xx->cursorPos--;
+	    redisplayWithCursor(xx);
+
+	} else bell();
+    } else {
+        char consensus[2];
+	DBcalcConsensus(xx,xx->cursorPos-1,1,consensus,BOTH_STRANDS);
+	if (*consensus != '*') {
+	    /*
+	    ** Only allow deletion from consensus when it's an asterisk
+	    */
+	    bell();
+	} else {
+	    recordEdit(&bucketStack,xx,undoMark,0,xx->cursorPos,'\0',0);
+            for (i=1; i<=xx->DB_gelCount; i++) {
+                if (DBgetRelPos(xx,i) + DBgetLength(xx,i) <= xx->cursorPos) ;
+		else if (DBgetRelPos(xx,i) >= xx->cursorPos) {
+		    recordEdit(&bucketStack,xx,undoDelete,i,0,' ',0);
+		    deleteBase (xx,i, 0);
+		}
+		else {
+	            char *s;
+		    s=DBgetSeq(xx,i);
+	            ch = s[xx->cursorPos-DBgetRelPos(xx,i)-1];
+		    recordEdit(&bucketStack,xx,undoDelete,i,xx->cursorPos-DBgetRelPos(xx,i),ch,0);
+		    deleteBase (xx,i, xx->cursorPos-DBgetRelPos(xx,i));
+		}
+
+            }
+	    recordEdit(&bucketStack,xx,undoMark,0,0,'\0',0);
+
+	    DBsetLength(xx,0,DBgetLength(xx,0)-1);
+	    redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	    redisplayDisagreement();
+	    xx->cursorPos--;
+	    redisplayWithCursor(xx);
+	}
+    }
+    
+}
+
+static void insertBase (EdStruct *xx, int seq, int pos, char base)
+/*
+** insert a single base in a sequence
+** and update the screen if necessary
+*/
+{
+    if (seq && base!=' ') {
+	int j;
+	char *s;
+	int len = DBgetLength(xx,seq);
+	/* don't let user overrun maximum gel length */
+	if (len<(int)*saveState.maxgel) {
+	    s=DBgetSeq(xx,seq);
+	    for (j = len+1; j > pos; j--)
+		s[j-1] = s[j-2];
+	    s[pos-1] = base;
+	    DBsetLength(xx,seq,len+1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	    tagInsertBase(xx,seq,pos,&base);
+	    selectInsertBase(xx, seq, pos);
+	}
+    }
+    else if (seq) {
+	/*
+	** space inserted at start: shift sequence right
+	*/
+	int i,j;	
+
+	if (DBgetRelPos(xx,seq)==1)
+	    for (i=1; i<xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[i+1])==1;i++);
+	else
+	    i=0;
+        if (i==1) {
+	    for (i=2; i <= xx->DB_gelCount; i++) {
+		DBsetRelPos(xx,xx->DBorder[i],DBgetRelPos(xx,xx->DBorder[i])-1);
+	    }
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	} else {
+	    DBsetRelPos(xx,seq,DBgetRelPos(xx,seq)+1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	    for (i=1; seq != xx->DBorder[i] && i <= xx->DB_gelCount; i++);
+	    for (j=i; j < xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[j+1]) < DBgetRelPos(xx,seq) ; j++)
+		xx->DBorder[j] = xx->DBorder[j+1];
+	    xx->DBorder[j] = seq;
+	}
+
+    }
+
+}
+
+static void insertChar (EdStruct *xx,int seq, int pos, char key)
+/*
+** insert a character
+*/
+{
+    int i;
+
+    /* Insert the base to the right of the caret */
+    /* perform a check to see that sequences aren't too long */
+    if (seq) {
+	/* don't let user overrun maximum gel length */
+	if (editModeIsSuperman() &&
+	    DBgetLength(xx,seq)<(int)*saveState.maxgel) {
+
+	    recordEdit(&bucketStack,xx,undoInsert,seq,pos,key,0);
+
+	    insertBase(xx, seq, pos, key);
+
+	    calculateConsensusLength(xx);
+
+	    if (key!=' ') xx->cursorPos = ++pos;
+	    redisplayWithCursor(xx);
+	} else bell();
+    } else if (key == '*') {
+	/*
+	** Only allow insertion of asterisks from consensus
+	*/
+	recordEdit(&bucketStack,xx,undoMark,0,pos,'\0',0);
+	for (i=1; i<=xx->DB_gelCount; i++) {
+	    int len = DBgetLength(xx,i);
+	    int relPos = DBgetRelPos(xx,i);
+	    if (relPos + len < pos ||
+		len>=(int)*saveState.maxgel) ;
+	    else if (relPos > pos) {
+		recordEdit(&bucketStack,xx,undoInsert,i,0,key,0);
+		insertBase(xx, i,0,' ');
+	    }
+	    else if (len) {
+		recordEdit(&bucketStack,xx,undoInsert,i, pos-relPos+1,key,0);
+	        insertBase(xx, i, pos-relPos+1, key);
+	    }
+	}
+	recordEdit(&bucketStack,xx,undoMark,0,0,'\0',0);
+
+	DBsetLength(xx,0,DBgetLength(xx,0)+1);
+ 
+	xx->cursorPos = ++pos;
+	redisplayWithCursor(xx);
+    } else {
+	bell();
+    }
+}
+
+static void replaceBase (EdStruct *xx, int seq, int pos, char base)
+/*
+** replace a single base in a sequence
+** and update the screen if necessary
+*/
+{
+    if (seq) {
+	char *s;
+	char deletedBase;
+	s=DBgetSeq(xx,seq);
+	deletedBase=s[pos-1];
+        s[pos-1] = base;
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED);
+	tagReplaceBase(xx,seq,pos,&deletedBase, &base);
+    }
+
+}
+
+static void replaceChar (EdStruct *xx, int seq, int pos, char key)
+/*
+** replace a character
+*/
+{
+    /* int i,j; */
+    char ch;
+
+    /*
+    ** Don't allow replacing past end of sequence
+    */
+    if (DBgetLength(xx,seq) < pos) {
+	bell();
+	return;
+    }
+    if (seq) {
+	char *s;
+	s=DBgetSeq(xx,seq);
+
+	ch = s[pos-1];
+	recordEdit(&bucketStack,xx,undoReplace,seq,pos,ch,0);
+
+	replaceBase(xx,seq, pos, key);
+
+	xx->cursorPos = ++pos;
+	redisplayWithCursor(xx);
+
+    } else {
+	bell();
+	/*
+	** Don't allow replacements on consensus at ANY stage
+	**
+	** recordEdit(&bucketStack,xx,undoMark,0,pos,'\0',0);
+	** for (i=1; i<=xx->DB_gelCount; i++) {
+	**     if (DBgetRelPos(xx,i) + DBgetLength(xx,i) <= pos) ;
+	**     else if (DBgetRelPos(xx,i) > pos) ;
+	**     else if (DBgetLength(xx,i)) {
+	** 	char *s;
+	** 	s=DBgetSeq(xx,i);
+	** 	ch = s[pos-DBgetRelPos(xx,i)];
+	** 	recordEdit(&bucketStack,xx,undoReplace,i,pos-DBgetRelPos(xx,i)+1,ch,0);
+	**         replaceBase(xx,i, pos-DBgetRelPos(xx,i)+1, key);
+	**     }
+	** }
+	** recordEdit(&bucketStack,xx,undoMark,0,0,'\0',0);
+	**
+	**
+	** xx->cursorPos = ++pos;
+        ** redisplayWithCursor(xx);
+	*/
+    }
+}
+
+static int validKey(char key)
+{
+    static char validKeys[] = "CcTtAaGg1234DVBHKLMNRY5678-*";
+    if (*saveState.idm == 26)
+	return (key!=' ');
+    else
+	return (strchr(validKeys,key) != NULL);
+}
+
+void keyPress (Widget widget, XEvent *event, String *params,
+	       Cardinal *num_params)
+/*
+** Handle a key press
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    char keys[8];
+    char key;
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+    /*
+    ** validkeys is different when database is for DNA and when
+    ** database is for protein
+    **
+    ** switch (xx->DB_flags & DB_DATA_TYPE) {
+    **     case DB_DATA_TYPE_DNA:     do dna stuff
+    **     case DB_DATA_TYPE_PROTEIN: do protein stuff
+    **     default : do dna stuff???
+    ** }
+    ** BASES = 6 for dna, 26 for protein (specified by fortran variable IDM?)
+    static char validKeys[] = "CcTtAaGg1234DVBHKLMNRY5678-*";
+    */
+
+    if (!(xx->DB_flags & DB_ACCESS_UPDATE)) return;
+
+    charCount = XLookupString((XKeyEvent *)event, keys,
+			      (int)sizeof(keys), &keySym,&compose);
+    key = keys[0];
+    if (charCount==1) {
+	if (! onScreen(xx, xx->cursorSeq, xx->cursorPos)) {
+	    /*
+	    ** When cursor is currently displayed on the screen,
+	    ** give a warning tone, and recentre screen on cursor
+	    */
+	    showCursor(xx,xx->cursorSeq, xx->cursorPos);
+	    bell();
+	} else if (validKey(key)) {
+	    if (editModeIsInsert())
+	        insertChar (xx,xx->cursorSeq, xx->cursorPos, key);
+	    else
+	        replaceChar (xx,xx->cursorSeq, xx->cursorPos, key);
+        }
+	else {
+	    if (xx->cursorSeq && xx->cursorPos==1 && key == ' ' && editModeIsInsert())
+	        insertChar (xx,xx->cursorSeq, xx->cursorPos, key);
+	    else bell();
+	}
+    }
+}
+
+void buttonDown (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** An attempt has been made to reposition the cursor
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+#define left_margin 4
+#define top_margin 4
+
+    int x,y;
+    int *seqList;
+
+    if (editorState == StateDown) return;
+
+    if (widget != xx->sequencesWid) return;
+
+    x = (event->xbutton.x-left_margin) / xx->fontWidth;
+    y = (event->xbutton.y-top_margin) / xx->fontHeight - xx->rulerDisplayed;
+
+
+    if (y>=0 && y < xx->displayHeight && x>=0 && x < xx->displayWidth) {
+        seqList = sequencesOnScreen(xx,xx->displayPos,xx->displayWidth);
+        xx->cursorSeq = seqList[y];
+        xx->cursorPos = xx->displayPos - DBgetRelPos(xx,xx->cursorSeq) + x + 1;
+	if (xx->cursorPos<1)
+	    xx->cursorPos = 1;
+	else
+	    if (xx->cursorPos > DBgetLength(xx,xx->cursorSeq)+1)
+		xx->cursorPos = DBgetLength(xx,xx->cursorSeq)+1;
+        positionCursor(xx,widget,xx->cursorSeq,xx->cursorPos);
+    }
+
+}
+
+static void undoLastEdit (EdStruct *xx, int command, int seq, int pos, char ch, int t)
+/*
+** Undo a command
+*/
+{
+    switch (command) {
+	case undoDelete:
+	    insertBase(xx, seq,pos,ch);
+	    break;
+	case undoInsert:
+	    deleteBase(xx,seq,(ch==' ')?0:pos);
+	    break;
+	case undoReplace:
+	    replaceBase(xx,seq,pos,ch);
+	    break;
+	case undoExtend:
+	    unextend(xx,seq,ch);
+	    break;
+	case undoUnextend:
+	    undo_unextend(xx,seq,ch,t);
+	    break;
+	default:
+	    break;
+	}
+}
+
+void undoLastCommand()
+/*
+** Undo last keypress that modified contig
+*/
+{
+    EdStruct *xx;
+    int command;
+    int seq;
+    int pos;
+    char ch;
+    int t;
+
+    retrieveEdit(&bucketStack,&xx,&command,&seq,&pos,&ch,&t);
+    if (command == undoNullCommand) {
+	bell();
+	return;
+    }
+    else if (command == undoMark) {
+
+        retrieveEdit(&bucketStack,&xx,&command,&seq,&pos,&ch,&t);
+	while (command != undoMark) {
+	    undoLastEdit(xx,command,seq,pos,ch,t);
+            retrieveEdit(&bucketStack,&xx,&command,&seq,&pos,&ch,&t);
+	}
+
+        xx->cursorSeq = 0;
+        xx->cursorPos = pos;
+
+    } else {
+
+	undoLastEdit(xx,command,seq,pos,ch,t);
+
+        xx->cursorSeq = seq;
+        xx->cursorPos = pos+(command==undoDelete);
+
+    }
+
+    calculateConsensusLength(xx);
+    redisplayWithCursor(xx);
+}
+
+static int showTrace(EdStruct *xx, int seq, int pos, int baseSpacing)
+{
+
+    char fileName[200];
+    tagStruct *t;
+    /*
+    int traceNo;
+    */
+    int t_len;        /* number of bases in trace */
+    int t_lcut;       /* left cut off of usable data */
+    int t_ulen;       /* length of usable data */
+    char t_type[5];   /* type of trace */
+    char t_fname[FILE_NAME_LENGTH+1]; /* file name of trace */
+    char *rawData;
+    int baseNum;
+
+    t = (tagStruct *) DBgetTags(xx,seq);
+
+    force_comment(t);
+
+    if (! t->newcommentlen) return 1;
+    sscanf(t->newcomment,"%6d%6d%6d%*s",&t_len,&t_lcut,&t_ulen);
+    strncpy(t_type,&t->newcomment[18],4);
+    strncpy(t_fname,&t->newcomment[22],FILE_NAME_LENGTH);
+
+    t_type[4] = '\0';
+    {   /* convert fortran string to c string */
+        int i;
+        for (i=FILE_NAME_LENGTH-1;i>=0 && (!t_fname[i] || isspace(t_fname[i]));i--) ;
+        t_fname[++i] = '\0';
+    }
+
+    /* skip if no raw data file for trace */
+    if (t_fname[0] == '\0') return 1;
+
+    /* check trace file exists */
+    if ( ! file_exists(t_fname) ) {
+	char *s;
+        /* try in rawData */
+	/*
+	** get environment details
+	*/
+	rawData = (char *)getenv ("RAWDATA");
+        if (rawData == NULL) return 1;
+	/*
+        sprintf (fileName,"%s/%s",rawData,t_fname);
+	*/
+	if ((s=findfile(t_fname,rawData))==NULL) return 1;
+	strcpy (fileName,s);
+    } else
+        /* ok */
+        strcpy (fileName,t_fname);
+
+    if (DBgetComp(xx,seq) == UNCOMPLEMENTED) {
+        baseNum = origpos(xx, seq, pos) + t_lcut;
+	manageTrace(t_type, fileName, baseNum, t_lcut, t_ulen, /*not complemented*/0, baseSpacing, DBgetName(xx,seq));
+    } else {
+        baseNum = t_len - t_lcut - origpos(xx, seq, normalisePos(xx,seq,pos,1));
+	manageTrace(t_type, fileName, baseNum, t_lcut, t_ulen, /*complemented*/1, baseSpacing, DBgetName(xx,seq));
+    }
+
+
+    return 0;
+}
+
+void invokeTrace (Widget widget, XEvent *event, String *params,
+		  Cardinal *num_params)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+    int baseSpacing = xx->fontWidth * 2;
+
+    if (xx->cursorSeq) {
+	showTrace(
+	    xx,
+	    xx->cursorSeq,
+	    xx->cursorPos,
+	    baseSpacing);
+    } else {
+	int *seqList;
+	int i;
+        seqList = sequencesOnScreen(xx,xx->cursorPos,1);
+	for (i=0; i<xx->displayHeight && seqList[i]; i++)
+	    showTrace(
+		xx,
+		seqList[i],
+		xx->cursorPos-DBgetRelPos(xx,seqList[i])+1,
+		baseSpacing);
+
+    }
+}
+
+void countDisagreements(int *overlapLength, int *wingeCount)
+{
+    int left0,right0;
+    int left1/*,right1*/;
+    int length0,length1;
+    int offset = editorLockedPos(1/*force recalculation*/);
+    int i;
+    EdStruct *xx[2];
+    char *ol0,*ol1;
+
+    xx[0] = intToEdStruct(0);
+    xx[1] = intToEdStruct(1);
+
+    if (offset < 0) {
+	left0 = 1-offset;
+	left1 = 1;
+    } else {
+	left0 = 1;
+	left1 = 1+offset;
+    }
+    length0 = DBgetLength(xx[0],0);
+    length1 = DBgetLength(xx[1],0);
+    if (offset+length0 < length1) {
+	right0 = length0;
+	/*
+	right1 = length0-offset;
+	*/
+    } else {
+	right0 = length1-offset;
+	/*
+	right1 = length1;
+	*/
+    }
+    *overlapLength = right0 - left0+1;
+    *wingeCount  = 0;
+
+    if (*overlapLength > 0) {
+	ol0 = (char *) malloc(*overlapLength+1);
+	ol1 = (char *) malloc(*overlapLength+1);
+	DBcalcConsensus(xx[0],left0,*overlapLength,ol0,BOTH_STRANDS);
+	DBcalcConsensus(xx[1],left1,*overlapLength,ol1,BOTH_STRANDS);
+	for (i=0;i<*overlapLength;i++) if(ol0[i]!=ol1[i])(*wingeCount)++;
+	free(ol0);
+	free(ol1);
+    }
+}
+
+
+void selectRead (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** A gel reading name has been selected - toggle highlighting
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+#define left_margin 4
+#define top_margin 4
+
+    int y;
+    int *seqList;
+
+    if (editorState == StateDown) return;
+
+    if (widget != xx->namesWid) return;
+
+    y = (event->xbutton.y-top_margin) / xx->fontHeight - xx->rulerDisplayed;
+
+    if (y>=0 && y < xx->displayHeight) {
+	int flag;
+        seqList = sequencesOnScreen(xx,xx->displayPos,xx->displayWidth);
+	flag = DBgetFlags(xx,seqList[y]);
+	flag ^= DB_FLAG_SELECTED;
+	DBsetFlags(xx,seqList[y],flag);
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+    }
+
+}
+
+
+
+
+/*************************************************************
+**
+** Fast accessing routines... utilising binary search
+**
+*************************************************************/
+
+int posToIndex(EdStruct *xx, int pos)
+/*
+** Find the first sequence that starts at or to the right of a
+** given position
+*/
+{
+    int Min, Max, Mid;
+
+    /* binary search */
+    /* Min, Max, Mid refer to pairs of numbers: ie MAX --> [MAX-1],[MAX] */
+    Min = 1;
+    Max = xx->DB_gelCount + 1;
+    
+    do {
+	int r1,r2;
+
+	Mid = (Max+Min)/2;
+
+	/* compare */
+	r1 = (Mid==1)?(pos-1):DBgetRelPos(xx,xx->DBorder[Mid-1]);
+	r2 = (Mid==xx->DB_gelCount+1)?(pos+1):DBgetRelPos(xx,xx->DBorder[Mid]);
+
+	if (r1 < pos && r2 >= pos) 
+	    return (Mid==xx->DB_gelCount+1)?(0):Mid;
+
+	if (r1 < pos)
+	    Min = Mid+1;
+	else
+	    Max = Mid-1;
+
+    } while (Max>=Min);
+
+    return 0;
+
+}
+
+
+int posToSeq(EdStruct *xx, int pos)
+/*
+** Find the first sequence that starts at or to the right of a
+** given position
+*/
+{
+    int ind;
+
+    ind = posToIndex(xx,pos);
+    if (ind)
+	return xx->DBorder[ind];
+    else
+	return 0;
+
+}
+
+
+int seqToIndex(EdStruct *xx, int seq)
+/*
+**
+*/
+{
+    int i;
+    int ind;
+
+    ind = posToIndex(xx,DBgetRelPos(xx,seq));
+
+    if (ind) {
+	for (i=ind; i<=xx->DB_gelCount && xx->DBorder[i]!=seq;i++) ;
+
+	if (i<=xx->DB_gelCount) return i;
+    }
+
+    return 0;
+	
+}
+
+
+
+
diff --git a/src/bap/edUtils.h b/src/bap/edUtils.h
new file mode 100644
index 0000000..fe421df
--- /dev/null
+++ b/src/bap/edUtils.h
@@ -0,0 +1,257 @@
+#include <X11/Intrinsic.h>
+#include "tagUtils.h"
+#include "fort.h"
+
+#ifndef _edUtils_h
+#define _edUtils_h
+
+/*
+** constant definitions
+*/
+#define MAX_DISPLAY_WIDTH 200
+#define DEFAULT_DISPLAY_WIDTH 80
+#define DB_NAMELEN 16  /* size of records in AR file */
+#define DB_GELNOLEN 5
+#define NAMELEN (DB_NAMELEN + DB_GELNOLEN + 1)
+#define BASES 6
+
+/*
+** Useful distances
+** (treat as symbolic rather than actual distances)
+*/
+#define D_screen     80
+#define D_halfScreen 40
+#define D_character   1
+
+/*
+** Flags for the internal database
+*/
+/* for whole database */
+#define DB_ACCESS            (1L<<0)
+#define DB_DATA_TYPE         (1L<<1)
+#define DB_STORAGE           (1L<<2)
+#define DB_ACCESS_READONLY   (0L)
+#define DB_ACCESS_UPDATE     (DB_ACCESS)
+#define DB_DATA_TYPE_DNA     (0L)
+#define DB_DATA_TYPE_PROTEIN (DB_DATA_TYPE)
+#define DB_STORAGE_DISK      (0L)
+#define DB_STORAGE_INTERNAL  (DB_STORAGE)
+#define DB_DELAYED_READ      (0L)
+/* for each sequence */
+#define DB_FLAG_NONE             (0L)
+#define DB_FLAG_IN_MEMORY        (1L<<0)
+#define DB_FLAG_SEQ_MODIFIED     (1L<<1)
+#define DB_FLAG_REL_MODIFIED     (1L<<2)
+#define DB_FLAG_TAG_MODIFIED     (1L<<3)
+#define DB_FLAG_SELECTED         (1L<<4)
+#define DB_FLAG_TAG_IN_MEMORY    (1L<<5)
+#define DB_FLAG_SEQ_IN_MEMORY    (1L<<0)
+#define DB_FLAG_NAME_IN_MEMORY   (1L<<6)
+/*
+** type definitions
+*/
+typedef struct {
+        int relPos;
+        int length;
+        int number;
+        int complemented;
+        char *name;
+        char *sequence;
+	long flags;
+	tagStruct *tagList;
+        } DBStruct, *DBptr;
+
+
+/* REMEMBER TO: update structure initialisation in edUtils.c */
+typedef struct {
+        int oldMax;
+	long DB_flags;
+        int DB_gelCount;
+        int DB_contigNum;
+        int *DBlist;
+        int *DBorder;
+        DBStruct *DB;
+        int displayPos ;
+        int displayWidth;
+        int displayHeight;
+        int cursorPos;
+        int cursorSeq;
+        int rulerDisplayed;
+        int consensusDisplayed;
+        int fontWidth;
+        int fontHeight;
+	Widget edWid;
+        Widget namesWid;
+	Widget sequencesWid;
+	Widget scrollButtonsWid;
+	Widget sliderWid;
+        char displayedConsensus[ MAX_DISPLAY_WIDTH ];
+	int select_made;
+	int select_seq;
+	int select_start_pos;
+	int select_end_pos;
+	tagStruct *select_tag;
+	int reveal_cutoffs;
+	int showDifferences;
+        } EdStruct, *EdStructPtr;
+/* REMEMBER TO: update structure initialisation in edUtils.c */
+
+
+/*
+** Useful macros concerning internal database
+*/
+#define DBgetRelPos(X,A)   ((X)->DB[(A)].relPos)
+#define DBgetLength(X,A)   ((X)->DB[(A)].length)
+#define DBgetNumber(X,A)   ((X)->DB[(A)].number)
+#define DBgetComp(X,A)     ((X)->DB[(A)].complemented)
+#define DBgetFlags(X,A)    ((X)->DB[(A)].flags)
+extern char *DBgetSeq();
+extern tagStruct *DBgetTags();
+extern char *DBgetName();
+
+/****************/
+#define DB_RelPos(X,A)   ((X)->DB[(A)].relPos)
+#define DB_Length(X,A)   ((X)->DB[(A)].length)
+#define DB_Number(X,A)   ((X)->DB[(A)].number)
+#define DB_Comp(X,A)     ((X)->DB[(A)].complemented)
+#define DB_Name(X,A)     ((X)->DB[(A)].name)
+#define DB_Seq(X,A)      ((X)->DB[(A)].sequence)
+#define DB_Flags(X,A)    ((X)->DB[(A)].flags)
+#define DB_Tags(X,A)     ((X)->DB[(A)].tagList)
+/****************/
+#define COMPLEMENTED -1
+#define BOTH_STRANDS 0
+#define UNCOMPLEMENTED 1
+#define DBgetGelName(xx,i) ( &( DBgetName(xx,i) )[DB_GELNOLEN+1] )
+
+#define DBsetRelPos(X,A,B) (X)->DB[(A)].relPos = (B)
+#define DBsetLength(X,A,B) (X)->DB[(A)].length = (B)
+#define DBsetNumber(X,A,B) (X)->DB[(A)].number = (B)
+#define DBsetComp(X,A,B)   (X)->DB[(A)].complemented = (B)
+#define DBsetName(X,A,B)   (X)->DB[(A)].name = (B)
+#define DBsetSeq(X,A,B)    (X)->DB[(A)].sequence = (B)
+#define DBsetFlags(X,A,B)  (X)->DB[(A)].flags = (B)
+#define DBsetTags(X,A,B)   (X)->DB[(A)].tagList = (B)
+
+/*
+** Useful macros
+*/
+#define normalisePos(X,S,P,L) \
+    ( (DBgetComp((X),(S))==UNCOMPLEMENTED) ? (P) : (DBgetLength((X),(S)) - (P) - (L) + 2) )
+
+
+/*
+** External definitions
+*/
+extern int initialiseDB(
+	EdStruct *xx,
+        int_f *idevr,    /* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *llino	/* left-most gel in contig */
+	);
+
+extern void saveDB(
+	EdStruct *xx,
+        int_f *idevr,    /* unit number for relationships */
+        int_f *idevw,    /* unit number for working versions of sequences */
+        int_f *idevn,    /* unit number for sequence names */
+        int_f *relpg,    /* relative positions of gels in sequences */
+        int_f *lngthg,   /* lengths of sequences */
+        int_f *lnbr,     /* left neighbours */
+        int_f *rnbr,     /* right neighbours */
+        int_f *maxgel    /* maximum length of gel */
+    );
+
+extern void joinDB(
+        int_f *idevr,    /* unit number for relationships */
+        int_f *idevw,    /* unit number for working versions of sequences */
+        int_f *idevn,    /* unit number for sequence names */
+        int_f *relpg,    /* relative positions of gels in sequences */
+        int_f *lngthg,   /* lengths of sequences */
+        int_f *lnbr,     /* left neighbours */
+        int_f *rnbr,     /* right neighbours */
+        int_f *maxgel,   /* maximum length of gel */
+        int_f *llinol,
+        int_f *lnconl,
+        int_f *llinor,
+        int_f *lnconr,
+        int_f *ngels,
+        int_f *nconts,
+        int_f *idbsiz
+    );
+
+
+extern void freeDB(EdStruct *xx);
+extern void countDisagreements(int *overlapLength, int *wingeCount);
+
+extern int  createEdDisplay(EdStruct *xx,Widget namesWid, Widget sequencesWid,int seq,int pos);
+
+extern void incDisplayPos(EdStruct *xx, int distance);
+extern void decDisplayPos(EdStruct *xx, int distance);
+extern void setDisplayPos(EdStruct *xx, int pos);
+extern void setDisplayPosPercent(EdStruct *xx, float percent);
+
+extern int posToIndex(EdStruct *xx, int pos);
+
+extern void undoLastCommand();
+extern void saveDatabase(EdStruct *xx);
+
+extern void caretRight(Widget, XEvent *, String *, Cardinal *);
+extern void caretLeft(Widget, XEvent *, String *, Cardinal *);
+extern void caretDown(Widget, XEvent *, String *, Cardinal *);
+extern void caretUp(Widget, XEvent *, String *, Cardinal *);
+extern void deleteKey(Widget, XEvent *, String *, Cardinal *);
+extern void keyPress(Widget, XEvent *, String *, Cardinal *);
+extern void buttonDown(Widget, XEvent *, String *, Cardinal *);
+extern void invokeTrace(Widget, XEvent *, String *, Cardinal *);
+extern void selectRead(Widget, XEvent *, String *, Cardinal *);
+
+extern EdStructPtr intToEdStruct(int i);
+extern EdStructPtr widgetToEdStruct(Widget w);
+extern EdStructPtr getFreeEdStruct();
+
+extern void DBgetSequence(EdStruct *xx, int seq, int pos, int width, char *str);
+/*
+** get part of a sequence from its `pos' base for `width' bases
+** Bases number from 0?
+*/
+extern int *sequencesInRegion(EdStruct *xx,int pos, int width);
+/*
+** Return a pointer to list of sequences in region of contig
+*/
+extern int *sequencesOnScreen(EdStruct *xx,int pos, int width);
+/*
+** Return a pointer to list of sequences on screen
+*/
+extern int positionInContig(EdStruct *xx, int seq, int pos);
+/*
+** returns relative position in a sequence as an 
+** absolute position in the contig
+*/
+extern void DBcalcConsensus (EdStruct *xx,int pos, int width, char *str, int strand);
+/*
+** calculate the consensus for position `pos' in contig,
+** for `width' characters. Take into accound readings on `strand' only:
+**
+*/
+extern void redisplaySequences (EdStruct *xx, Widget namesWid, Widget sequencesWid, int pos, int width);
+/*
+** Redisplay the whole sequence display
+*/
+extern void redisplayWithCursor(EdStruct *xx);
+/*
+** Redisplay screen, ensuring cursor display
+*/
+void calculateConsensusLength(EdStruct *xx);
+/*
+** Calculate dynamic consensus length
+*/
+
+#endif /* _edUtils_h */
diff --git a/src/bap/guk.c b/src/bap/guk.c
new file mode 100644
index 0000000..683a98c
--- /dev/null
+++ b/src/bap/guk.c
@@ -0,0 +1,92 @@
+#include "fort.h"
+#include "upam.gbl"
+#include "uascii.gbl"
+#include "llin.h"
+    
+    static int init_done = 0;
+
+static void initpam2()
+     /*
+      ** Initialise alignment routine
+      */
+{
+    int i, j, k;
+    
+    pam = npam;
+    nsq = naa;
+    
+    k=0;
+    for (i=0; i<nsq; i++)
+	for (j=0; j<=i; j++)
+	    pam2[j][i] = pam2[i][j] = -pam[k++];
+    
+    init_done++;
+}
+
+int mmalign(
+	    char *seq1,
+	    int   length1,
+	    char *seq2,
+	    int   length2,
+	    int  *res
+	    )
+     /*
+      ** int res[length1+length2];
+      **
+      ** Returns: alignment score
+      */
+{
+    if (! init_done) initpam2();
+    
+    return DIFF(seq1-1,seq2-1,
+		length1,length2,
+		pam2, -gdelval, -ggapval, res);
+    
+}
+
+int_f mm_(
+	 char  *seq1,
+	 int_f *length1,
+	 char  *seq2,
+	 int_f *length2,
+	 int_f *res,
+	 int_fl seq_l,
+	 int_fl seq2_l
+	 )
+/*
+** FORTRAN interface
+*/
+{
+    if (! init_done) initpam2();
+    
+    return DIFF(
+		seq1-1,
+		seq2-1,
+		(int)*length1,
+		(int)*length2,
+		pam2, -gdelval, -ggapval,
+		(int *)res
+		);
+}		 
+
+
+int_f dispmm_(
+	 char  *seq1,
+	 int_f *length1,
+	 char  *seq2,
+	 int_f *length2,
+	 int_f *res,
+	 int_fl seq1_l,
+	 int_fl seq2_l
+	 )
+/*
+** FORTRAN interface
+*/
+{
+    (void) DISPLAY(seq1-1, seq2-1,
+		   (int)*length1, (int)*length2,
+		   (int *)res);
+    printf("\n\n");
+    return 0;
+}
+
diff --git a/src/bap/guk.h b/src/bap/guk.h
new file mode 100644
index 0000000..9aa7d20
--- /dev/null
+++ b/src/bap/guk.h
@@ -0,0 +1,10 @@
+#include "fort.h"
+
+extern int mmalign(char *seq1, int length1, char *seq2, int length2,
+		   int *res);
+
+extern int_f mm_(char *seq1, int_f *length1, char *seq2, int_f *length2,
+		 int_f *res, int_fl seq_l, int_fl seq2_l);
+
+extern int_f dispmm_(char *seq1, int_f *length1, char *seq2, int_f *length2,
+		     int_f *res, int_fl seq1_l, int_fl seq2_l);
diff --git a/src/bap/llin.c b/src/bap/llin.c
new file mode 100644
index 0000000..86703a6
--- /dev/null
+++ b/src/bap/llin.c
@@ -0,0 +1,228 @@
+/* A PACKAGE FOR SEQUENCE COMPARISON WITH AFFINE WEIGHTS:
+     See include file "linear.h" for complete interface information. */
+
+#include "llin.h"
+#define XTERNAL
+#include "upam.gbl"
+#include "uascii.gbl"
+#define translate(A) (nascii[A])
+
+/* Globally passed params and macros */
+
+static int (*w)[32];				/* w = W */
+static int g, h, m;				/* g = G, h = H, m = g+h */
+
+#define gap(k)  ((k) <= 0 ? 0 : g+h*(k))	/* k-symbol indel cost */
+
+static int *sapp;				/* Current script append ptr */
+static int  last;				/* Last script op appended */
+
+						/* Append "Delete k" op */
+#define DEL(k)				{ if (last < 0)				    last = *(sapp-1) -= (k);		  else					    last = *sapp++ = -(k);		}
+						/* Append "Insert k" op */
+#define INS(k)				{ if (last < 0)				    { *(sapp-1) = (k); *sapp++ = last; }	  else					    last = *sapp++ = (k);		}
+
+#define REP { last = *sapp++ = 0; }		/* Append "Replace" op */
+
+
+/* diff(A,B,M,N,tb,te) returns the cost of an optimum conversion between
+   A[1..M] and B[1..N] that begins(ends) with a delete if tb(te) is zero
+   and appends such a conversion to the current script.                   */
+
+static int diff(A,B,M,N,tb,te) char *A, *B; int M, N; int tb, te;
+
+{ static int CC[NMAX+1], DD[NMAX+1];	/* Forward cost-only vectors */
+  static int RR[NMAX+1], SS[NMAX+1];	/* Reverse cost-only vectors */
+         int   midi, midj, type;	/* Midpoint, type, and cost */
+         int midc;
+
+{ register int   i, j;
+  register int c, e, d, s;
+           int t, *wa;
+
+/* Boundary cases: M <= 1 or N == 0 */
+
+  if (N <= 0)
+    { if (M > 0) DEL(M)
+      return gap(M);
+    }
+  if (M <= 1)
+    { if (M <= 0)
+        { INS(N);
+          return gap(N);
+        }
+      if (tb > te) tb = te;
+      midc = (tb+h) + gap(N);
+      midj = 0;
+      wa = w[translate(A[1])];
+      for (j = 1; j <= N; j++)
+        { c = gap(j-1) + wa[translate(B[j])] + gap(N-j);
+          if (c < midc)
+            { midc = c;
+              midj = j;
+            }
+        }
+      if (midj == 0)
+        { INS(N) DEL(1) }
+      else
+        { if (midj > 1) INS(midj-1)
+          REP
+          if (midj < N) INS(N-midj)
+        }
+      return midc;
+    }
+
+/* Divide: Find optimum midpoint (midi,midj) of cost midc */
+
+  midi = M/2;			/* Forward phase:                          */
+  CC[0] = 0;			/*   Compute C(M/2,k) & D(M/2,k) for all k */
+  t = g;
+  for (j = 1; j <= N; j++)
+    { CC[j] = t = t+h;
+      DD[j] = t+g;
+    }
+  t = tb;
+  for (i = 1; i <= midi; i++)
+    { s = CC[0];
+      CC[0] = c = t = t+h;
+      e = t+g;
+      wa = w[translate(A[i])];
+      for (j = 1; j <= N; j++)
+        { if ((c =   c   + m) < (e =   e   + h)) e = c;
+          if ((c = CC[j] + m) < (d = DD[j] + h)) d = c;
+          c = s + wa[translate(B[j])];
+          if (e < c) c = e;
+          if (d < c) c = d;
+          s = CC[j];
+          CC[j] = c;
+          DD[j] = d;
+        }
+    }
+  DD[0] = CC[0];
+
+  RR[N] = 0;			/* Reverse phase:                          */
+  t = g;			/*   Compute R(M/2,k) & S(M/2,k) for all k */
+  for (j = N-1; j >= 0; j--)
+    { RR[j] = t = t+h;
+      SS[j] = t+g;
+    }
+  t = te;
+  for (i = M-1; i >= midi; i--)
+    { s = RR[N];
+      RR[N] = c = t = t+h;
+      e = t+g;
+      wa = w[translate(A[i+1])];
+      for (j = N-1; j >= 0; j--)
+        { if ((c =   c   + m) < (e =   e   + h)) e = c;
+          if ((c = RR[j] + m) < (d = SS[j] + h)) d = c;
+          c = s + wa[translate(B[j+1])];
+          if (e < c) c = e;
+          if (d < c) c = d;
+          s = RR[j];
+          RR[j] = c;
+          SS[j] = d;
+        }
+    }
+  SS[N] = RR[N];
+
+  midc = CC[0]+RR[0];		/* Find optimal midpoint */
+  midj = 0;
+  type = 1;
+  for (j = 0; j <= N; j++)
+    if ((c = CC[j] + RR[j]) <= midc)
+      if (c < midc || CC[j] != DD[j] && RR[j] == SS[j])
+        { midc = c;
+          midj = j;
+        }
+  for (j = N; j >= 0; j--)
+    if ((c = DD[j] + SS[j] - g) < midc)
+      { midc = c;
+        midj = j;
+        type = 2;
+      }
+}
+
+/* Conquer: recursively around midpoint */
+
+  if (type == 1)
+    { diff(A,B,midi,midj,tb,g);
+      diff(A+midi,B+midj,M-midi,N-midj,g,te);
+    }
+  else
+    { diff(A,B,midi-1,midj,tb,0);
+      DEL(2);
+      diff(A+midi+1,B+midj,M-midi-1,N-midj,0,te);
+    }
+  return midc;
+}
+
+
+/* Interface and top level of comparator */
+
+int DIFF(A,B,M,N,W,G,H,S) char A[],B[]; int M,N; int W[][32],G,H; int S[];
+
+{ if (N > NMAX) return -1;	/* Error check */
+
+  w = W;			/* Setup global parameters */
+  g = G;
+  h = H;
+  m = g+h;
+  sapp = S;
+  last = 0;
+
+  return diff(A,B,M,N,g,g);	/* OK, do it */
+}
+
+
+/* Alignment display routine */
+
+static char ALINE[51], BLINE[51], CLINE[51];
+
+int DISPLAY(A,B,M,N,S) char A[], B[]; int M, N; int S[];
+{ register char *a, *b, *c;
+  register int   i,  j, op;
+           int   lines;
+
+  i = j = op = lines = 0;
+  a = ALINE;
+  b = BLINE;
+  c = CLINE;
+  while (i < M || j < N)
+    { if (op == 0 && *S == 0)
+        { op = *S++;
+          *a = nt[translate(A[++i])];
+          *b = nt[translate(B[++j])];
+          *c++ = (*a++ == *b++) ? ':' : ' ';
+        }
+      else
+        { if (op == 0)
+            op = *S++;
+          if (op > 0)
+            { *a++ = ' ';
+              *b++ = nt[translate(B[++j])];
+              op--;
+            }
+          else
+            { *a++ = nt[translate(A[++i])];
+              *b++ = ' ';
+              op++;
+            }
+          *c++ = '-';
+        }
+      if (a >= ALINE+50 || i >= M && j >= N)
+        { *a = *b = *c = '\0';
+          printf("\n%5d",50*lines++);
+          for (b = ALINE+10; b <= a; b += 10)
+            printf("    .    :");
+          if (b <= a+5)
+            printf("    .");
+          printf("\n     %s\n     %s\n     %s\n",ALINE,CLINE,BLINE);
+          a = ALINE;
+          b = BLINE;
+          c = CLINE;
+        }
+    }
+
+    return 0;
+}
+
diff --git a/src/bap/llin.h b/src/bap/llin.h
new file mode 100644
index 0000000..3b29774
--- /dev/null
+++ b/src/bap/llin.h
@@ -0,0 +1,62 @@
+/* A PACKAGE FOR SEQUENCE COMPARISON WITH AFFINE WEIGHTS:
+     Gene Myers, Dept. of Computer Science, U. of Arizona 85721 (10/3/87)
+
+#define NMAX <integer>
+
+    NMAX is a compilation constant giving the maximum input sequence length.
+    It is to be adjusted according to available memory.
+
+
+int DIFF(A,B,M,N,W,G,H,S) int M,N; char A[],B[]; int W[][128],G,H; int S[];
+
+    DIFF compares sequence A[1..M] with sequence B[1..N] and returns the
+    minimum conversion cost.  Costs are determined by the parameters W, G,
+    and H.  W[128][128] is an array giving replacement costs for each pair of
+    ASCII characters, e.g. W['a']['b'] is the cost of replacing 'a' by 'b'.
+    Be sure to set W['a']['a'] to zero if exact matches are to accrue no cost.
+    The cost of a k-symbol indel is the affine function G+Hk.
+
+    DIFF also has the side-effect of placing an encoding of an optimal
+    conversion in an integer array S[0..M+N-1] supplied by the caller.
+    The sequence of integers S[0], S[1], S[2], ... gives the editing
+    operations in a left-to-right conversion where integers encode
+    operations as follows:
+         0 => replace
+        -k => delete k symbols
+        +k => insert k symbols.
+    The script is guaranteed to have the properties:
+      (1)  Inserts are never followed by inserts.
+      (2)  Deletes are never followed by deletes or inserts.
+      (3)  A replacement followed by a k-gap is always preferred
+           to a k-gap followed by a replacement in the event that
+           both have the same cost.
+
+    DIFF returns -1.0 if NMAX isn't large enough.
+
+
+int DISPLAY(A,B,M,N,S) int M,N; char A[],B[]; int S[];
+
+    DISPLAY places on the standard output a display of the alignment
+    implied by the conversion S computed in the call DIFF(A,B,M,N,?,?,?,S).
+    For example:
+
+              0    .    :    .    :    .    :    .    :    .    :
+               ggcgtttcataccggcgagga  ctagagatcccagatgcagcctcgata
+               !-!!!!||||!!!!!!!!!!|--!!!!!|!!|!!||||!!-!!!!!!!!!
+               g cgttcataaccggcgaggtacctagacattcccagagc gcctcgata
+          
+             50    .    :    .    :    .
+               taggaagaa tc agcaacgatcggcatg
+               !|!||!!!!-!!-!!!!!!!!-!!|!-!!
+               tggacagaaatcgagcaacga cgac tg
+*/
+
+#ifdef BIGMEM
+#define NMAX 30000
+#else
+#define NMAX  3000
+#endif
+
+extern int   DIFF();
+extern int   DISPLAY();
+
diff --git a/src/bap/makefile-alpha b/src/bap/makefile-alpha
new file mode 100644
index 0000000..7cf135b
--- /dev/null
+++ b/src/bap/makefile-alpha
@@ -0,0 +1,577 @@
+# Makefile for staden X programs (Alpha version)
+# ---------------------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   bap  sap  sip   splitp1 splitp2 splitp3 sethelp convert_project
+LPROGS=           nipl  pip             sipl
+XPROGS=     xmep xnip  xpip  xbap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC = .
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)/alpha-binaries
+TEDSRC=$(SRC)/../ted
+TEDBIN=$(STADENBIN)
+OSPSRC=$(SRC)/osp-bits
+OSPBIN=$(OSPSRC)/alpha-binaries
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)/alpha-binaries
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(SRC) -I$(STADENSRC) -I$(TEDSRC) -I$(OSPSRC) -I$(MISCSRC)
+
+
+#
+# Compilers and flags
+#
+CC= cc
+CFLAGS= -g -std1 -ieee_with_inexact $(INCLUDES)
+F77= f77
+FFLAGS= -g -C
+
+FLIBS= -lfor -lutil -lUfor -lots -lFutil -lm -lfor -lutil -lUfor -lots -lFutil -ldnet_stub
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS= $(FLIBS)
+XLFLAGS= $(XLIBS) $(FLIBS)
+
+LD = f77
+XLD = cc
+LDFLAGS = -g -non_shared
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+$(TEDBIN)/%.o : $(TEDSRC)/%.f
+$(TEDBIN)/%.o : $(TEDSRC)/%.c
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	$(STADENBIN)/seqlibsubs.o
+
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	$(STADENBIN)/postscript.o\
+	$(STADENBIN)/Graph.o\
+	$(STADENBIN)/plotLog.o\
+	$(STADENBIN)/help.o\
+	$(STADENBIN)/dialogues.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/xmenu.o\
+	$(STADENBIN)/locks.o\
+	$(STADENBIN)/mcspec.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o
+
+COBJS=\
+	$(STADENBIN)/main.o\
+	$(CCORE)
+
+COBJS2=\
+	xbapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	$(STADENBIN)/fmain.o\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	$(STADENBIN)/postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS)
+
+XCOMMONOBJS2=\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS2)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	$(STADENBIN)/gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+
+
+MEP=\
+	$(STADENBIN)/mep.o\
+	$(STADENBIN)/mepsub.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(STADENBIN)/xmepSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	$(STADENBIN)/nip.o\
+	$(STADENBIN)/patternn.a\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(STADENBIN)/xnipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	$(STADENBIN)/nipl.o\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+
+PIP=\
+	$(STADENBIN)/pip.o\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/patternp.a\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(STADENBIN)/xpipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	$(STADENBIN)/pipl.o\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	$(TEDBIN)/seq.o\
+	$(TEDBIN)/seqIOABI.o\
+	$(TEDBIN)/seqIOALF.o\
+	$(TEDBIN)/seqIOSCF.o\
+	$(TEDBIN)/seqIOPlain.o\
+	$(TEDBIN)/opp.o\
+	$(TEDBIN)/fpoint.o\
+	$(TEDBIN)/mach-io.o
+
+TMANOBJS=\
+	$(STADENBIN)/tman_main.o\
+	$(STADENBIN)/tman_display.o\
+	$(STADENBIN)/tman_traceDisplay.o\
+	$(STADENBIN)/tman_basesDisplay.o\
+	$(STADENBIN)/tman_context.o\
+	$(STADENBIN)/tman_gadgets.o\
+	$(STDTEDFILES)
+
+OSPOBJS=\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/paramIOX.o\
+	$(OSPBIN)/paramIO.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	$(GNULIB)
+
+XHACK=\
+	$(STADENBIN)/SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	$(STADENBIN)/xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	$(STADENBIN)/undo.o\
+	$(STADENBIN)/Sheet.o\
+	$(STADENBIN)/select.o\
+	$(STADENBIN)/extend.o\
+	$(STADENBIN)/searchUtils.o\
+	edMenu.o\
+	oligo.o\
+	oligocom.o\
+	subclone.o\
+	myparams.o\
+	$(STADENBIN)/trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	$(STADENBIN)/tagEditor.o\
+	$(STADENBIN)/tagdbparse.o\
+	tagU2.o\
+	$(STADENBIN)/tagU1.o
+
+BAP=\
+	bap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	actf.o\
+	dstrand.o\
+	oligo_sel.o\
+	llin.o\
+	guk.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+BAPOBJS=\
+	$(BAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o\
+	$(STADENBIN)/conEdit.o\
+	tagU2.o\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	baponly.o\
+	oligocom.o
+
+XBAPOBJS=\
+	$(BAP)\
+	$(XCOMMONOBJS2)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)\
+	$(OSPOBJS)
+
+bap:	$(BAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(BAPOBJS) $(LFLAGS)
+
+xbap:	$(XBAPOBJS)
+	$(XLD) $(LDFLAGS) -o $@ $(XBAPOBJS) $(XLFLAGS)
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(STADENBIN)/xsapSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	$(STADENBIN)/sapf.o\
+	$(STADENBIN)/dbsyscommon.o\
+	$(STADENBIN)/dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	$(STADENBIN)/sapmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	$(STADENBIN)/sip.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(STADENBIN)/xsipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	$(STADENBIN)/sipl.o\
+	$(STADENBIN)/dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	$(STADENBIN)/sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)\
+	$(SCOMMONOBJS)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	$(STADENBIN)/splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	$(STADENBIN)/splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	$(STADENBIN)/splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(NPROGS) $(XPROGS) $(LPROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
diff --git a/src/bap/makefile-dec b/src/bap/makefile-dec
new file mode 100644
index 0000000..644db35
--- /dev/null
+++ b/src/bap/makefile-dec
@@ -0,0 +1,576 @@
+# Makefile for staden X programs (Ultrix mips version)
+# ------------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   bap  sap  sip   splitp1 splitp2 splitp3 sethelp convert_project
+LPROGS=           nipl  pip             sipl
+XPROGS=     xmep xnip  xpip  xbap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+TEDSRC=$(SRC)/../ted
+TEDBIN=$(STADENBIN)
+OSPSRC=$(SRC)/osp-bits
+OSPBIN=$(OSPSRC)
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I/usr/include/mit -I$(SRC) -I$(STADENSRC) -I$(TEDSRC) -I$(OSPSRC) -I$(MISCSRC)
+
+#
+# Compilers and flags
+#
+# debug on
+CC= c89
+CFLAGS= -common $(INCLUDES)
+F77= f77
+FFLAGS= -g -C
+LD= c89
+LDFLAGS= -g
+
+FLIBS= -lfor -lutil -lUfor -li -lots -lm
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS= $(FLIBS)
+XLFLAGS= $(FLIBS) $(XLIBS) 
+
+# For stdargs on the Sun
+#GNULIB = /usr/local/lib/gcc-gnulib
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+$(TEDBIN)/%.o : $(TEDSRC)/%.f
+$(TEDBIN)/%.o : $(TEDSRC)/%.c
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	$(STADENBIN)/seqlibsubs.o
+
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	$(STADENBIN)/postscript.o\
+	$(STADENBIN)/Graph.o\
+	$(STADENBIN)/plotLog.o\
+	$(STADENBIN)/help.o\
+	$(STADENBIN)/dialogues.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/xmenu.o\
+	$(STADENBIN)/locks.o\
+	$(STADENBIN)/mcspec.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o
+
+COBJS=\
+	$(STADENBIN)/main.o\
+	$(CCORE)
+
+COBJS2=\
+	xbapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	$(STADENBIN)/fmain.o\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	$(STADENBIN)/postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS)
+
+XCOMMONOBJS2=\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS2)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	$(STADENBIN)/gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+
+
+MEP=\
+	$(STADENBIN)/mep.o\
+	$(STADENBIN)/mepsub.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	$(STADENBIN)/nip.o\
+	$(STADENBIN)/patternn.a\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	$(STADENBIN)/nipl.o\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+
+PIP=\
+	$(STADENBIN)/pip.o\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/patternp.a\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	$(STADENBIN)/pipl.o\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	$(TEDBIN)/seq.o\
+	$(TEDBIN)/seqIOABI.o\
+	$(TEDBIN)/seqIOALF.o\
+	$(TEDBIN)/seqIOSCF.o\
+	$(TEDBIN)/seqIOPlain.o\
+	$(TEDBIN)/opp.o\
+	$(TEDBIN)/fpoint.o\
+	$(TEDBIN)/mach-io.o
+
+TMANOBJS=\
+	$(STADENBIN)/tman_main.o\
+	$(STADENBIN)/tman_display.o\
+	$(STADENBIN)/tman_traceDisplay.o\
+	$(STADENBIN)/tman_basesDisplay.o\
+	$(STADENBIN)/tman_context.o\
+	$(STADENBIN)/tman_gadgets.o\
+	$(STDTEDFILES)
+
+OSPOBJS=\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/paramIOX.o\
+	$(OSPBIN)/paramIO.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o
+
+XHACK=\
+	$(STADENBIN)/SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	$(STADENBIN)/xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	$(STADENBIN)/undo.o\
+	$(STADENBIN)/Sheet.o\
+	$(STADENBIN)/select.o\
+	$(STADENBIN)/extend.o\
+	$(STADENBIN)/searchUtils.o\
+	edMenu.o\
+	oligo.o\
+	oligocom.o\
+	subclone.o\
+	myparams.o\
+	$(STADENBIN)/trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	$(STADENBIN)/tagEditor.o\
+	$(STADENBIN)/tagdbparse.o\
+	tagU2.o\
+	$(STADENBIN)/tagU1.o
+
+BAP=\
+	bap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	actf.o\
+	dstrand.o\
+	oligo_sel.o\
+	llin.o\
+	guk.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+BAPOBJS=\
+	$(BAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o\
+	$(STADENBIN)/conEdit.o\
+	tagU2.o\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	baponly.o\
+	oligocom.o\
+	$(MISCBIN)/misc.a
+
+XBAPOBJS=\
+	$(BAP)\
+	$(XCOMMONOBJS2)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)\
+	$(OSPOBJS)
+
+bap: $(BAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(BAPOBJS) $(LFLAGS)
+
+xbap: $(XBAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XBAPOBJS) $(XLFLAGS)
+
+convert_project: $(STADENBIN)/cvt.o
+	$(CC) $(CFLAGS) -o $@ $(STADENBIN)/cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	$(STADENBIN)/sapf.o\
+	$(STADENBIN)/dbsyscommon.o\
+	$(STADENBIN)/dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	$(STADENBIN)/sapmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	$(STADENBIN)/sip.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	$(STADENBIN)/sipl.o\
+	$(STADENBIN)/dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	$(STADENBIN)/sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)\
+	$(SCOMMONOBJS)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	$(STADENBIN)/splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	$(STADENBIN)/splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	$(STADENBIN)/splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: bap xbap
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
diff --git a/src/bap/makefile-sgi b/src/bap/makefile-sgi
new file mode 100644
index 0000000..1721c90
--- /dev/null
+++ b/src/bap/makefile-sgi
@@ -0,0 +1,582 @@
+# Makefile for staden X programs (Silicon Graphics version)
+# ---------------------------------------------------------
+
+SHELL=/bin/sh
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   bap  sap  sip   splitp1 splitp2 splitp3 sethelp convert_project
+LPROGS=           nipl  pip             sipl
+XPROGS=     xmep xnip  xpip  xbap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+TEDSRC=$(SRC)/../ted
+TEDBIN=$(STADENBIN)
+OSPSRC=$(SRC)/osp-bits
+OSPBIN=$(OSPSRC)
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(SRC) -I$(STADENSRC) -I$(TEDSRC) -I$(OSPSRC) -I$(MISCSRC)
+
+
+#
+# Compilers and flags
+#
+CC= cc
+CFLAGS= -g -ansi -prototypes $(INCLUDES)
+F77= f77
+FFLAGS= -g -c -old_rl
+
+FLIBS= -lF77 -lU77 -lm -lI77 -lisam
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS= $(FLIBS)
+XLFLAGS= $(XLIBS) $(FLIBS)
+LDFLAGS= -g
+
+LD = cc
+LDFLAGS = -g 
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+$(TEDBIN)/%.o : $(TEDSRC)/%.f
+$(TEDBIN)/%.o : $(TEDSRC)/%.c
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	$(STADENBIN)/seqlibsubs.o
+
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	$(STADENBIN)/postscript.o\
+	$(STADENBIN)/Graph.o\
+	$(STADENBIN)/plotLog.o\
+	$(STADENBIN)/help.o\
+	$(STADENBIN)/dialogues.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/xmenu.o\
+	$(STADENBIN)/locks.o\
+	$(STADENBIN)/mcspec.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o
+
+COBJS=\
+	$(STADENBIN)/main.o\
+	$(CCORE)
+
+COBJS2=\
+	xbapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	$(STADENBIN)/fmain.o\
+	$(STADENBIN)/seeme-sg.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	$(STADENBIN)/postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	$(STADENBIN)/seeme-sg.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS)
+
+XCOMMONOBJS2=\
+	$(STADENBIN)/seeme-sg.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS2)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	$(STADENBIN)/gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+
+
+MEP=\
+	$(STADENBIN)/mep.o\
+	$(STADENBIN)/mepsub.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(STADENBIN)/xmepSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	$(STADENBIN)/nip.o\
+	$(STADENBIN)/patternn.a\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(STADENBIN)/xnipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	$(STADENBIN)/nipl.o\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+
+PIP=\
+	$(STADENBIN)/pip.o\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/patternp.a\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(STADENBIN)/xpipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	$(STADENBIN)/pipl.o\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	$(TEDBIN)/seq.o\
+	$(TEDBIN)/seqIOABI.o\
+	$(TEDBIN)/seqIOALF.o\
+	$(TEDBIN)/seqIOSCF.o\
+	$(TEDBIN)/seqIOPlain.o\
+	$(TEDBIN)/opp.o\
+	$(TEDBIN)/fpoint.o\
+	$(TEDBIN)/mach-io.o
+
+TMANOBJS=\
+	$(STADENBIN)/tman_main.o\
+	$(STADENBIN)/tman_display.o\
+	$(STADENBIN)/tman_traceDisplay.o\
+	$(STADENBIN)/tman_basesDisplay.o\
+	$(STADENBIN)/tman_context.o\
+	$(STADENBIN)/tman_gadgets.o\
+	$(STDTEDFILES)
+
+OSPOBJS=\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/paramIOX.o\
+	$(OSPBIN)/paramIO.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	$(GNULIB)
+
+XHACK=\
+	$(STADENBIN)/SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	$(STADENBIN)/xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	$(STADENBIN)/undo.o\
+	$(STADENBIN)/Sheet.o\
+	$(STADENBIN)/select.o\
+	$(STADENBIN)/extend.o\
+	$(STADENBIN)/searchUtils.o\
+	edMenu.o\
+	oligo.o\
+	oligocom.o\
+	subclone.o\
+	myparams.o\
+	$(STADENBIN)/trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	$(STADENBIN)/tagEditor.o\
+	$(STADENBIN)/tagdbparse.o\
+	tagU2.o\
+	$(STADENBIN)/tagU1.o
+
+BAP=\
+	bap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	actf.o\
+	dstrand.o\
+	oligo_sel.o\
+	llin.o\
+	guk.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+BAPOBJS=\
+	$(BAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o\
+	$(STADENBIN)/conEdit.o\
+	tagU2.o\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	baponly.o\
+	oligocom.o
+
+XBAPOBJS=\
+	$(BAP)\
+	$(XCOMMONOBJS2)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)\
+	$(OSPOBJS)
+
+bap:	$(BAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(BAPOBJS) $(LFLAGS)
+
+xbap:	$(XBAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XBAPOBJS) $(XLFLAGS)
+
+convert_project: $(STADENBIN)/cvt.o
+	$(CC) $(CFLAGS) -o $@ $(STADENBIN)/cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(STADENBIN)/xsapSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	$(STADENBIN)/sapf.o\
+	$(STADENBIN)/dbsyscommon.o\
+	$(STADENBIN)/dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	$(STADENBIN)/sapmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	$(STADENBIN)/sip.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(STADENBIN)/xsipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	$(STADENBIN)/sipl.o\
+	$(STADENBIN)/dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	$(STADENBIN)/sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)\
+	$(SCOMMONOBJS)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	$(STADENBIN)/splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	$(STADENBIN)/splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	$(STADENBIN)/splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(NPROGS) $(XPROGS) $(LPROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
diff --git a/src/bap/makefile-solaris b/src/bap/makefile-solaris
new file mode 100644
index 0000000..2c7bc8a
--- /dev/null
+++ b/src/bap/makefile-solaris
@@ -0,0 +1,592 @@
+# Makefile for staden X programs (Sun version)
+# --------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   bap  sap  sip   splitp1 splitp2 splitp3 sethelp convert_project
+LPROGS=           nipl  pip             sipl
+XPROGS=     xmep xnip  xpip  xbap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+TEDSRC=$(SRC)/../ted
+TEDBIN=$(STADENBIN)
+OSPSRC=$(SRC)/osp-bits
+OSPBIN=$(OSPSRC)
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(SRC) -I$(STADENSRC) -I$(TEDSRC) -I$(OSPSRC) -I$(MISCSRC) -I/usr/openwin/include
+
+
+#
+# Compilers and flags
+#
+# debug on
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -Wformat
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(INCLUDES)
+F77= f77
+FFLAGS= -ansi -g -Nx500 #-C
+
+LD	= f77
+SPRO	= /opt/SUNWspro/SC2.0.1
+LDFLAGS	= -Bstatic -g \
+	-L/opt/cygnus-sol2-1.1/lib/gcc-lib/sparc-sun-solaris2/cygnus-2.0.2 \
+	-L$(SPRO) -L/usr/ccs/lib -L/usr/ucblib -L/usr/openwin/lib
+LIBS	= -lm -lucb -lelf -Bdynamic -ldl -Bstatic -lsocket -lnsl -lintl -lgcc
+FLIBS	= -lV77 -lF77 -lM77
+XLIBS	= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS	= $(FLIBS) $(LIBS)
+XLFLAGS	= $(FLIBS) $(XLIBS) $(LIBS)
+
+# For stdargs on the Sun when not using gcc
+#GNULIB = /usr/local/lib/gcc-gnulib
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+$(TEDBIN)/%.o : $(TEDSRC)/%.f
+$(TEDBIN)/%.o : $(TEDSRC)/%.c
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	$(STADENBIN)/seqlibsubs.o
+
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	$(STADENBIN)/postscript.o\
+	$(STADENBIN)/Graph.o\
+	$(STADENBIN)/plotLog.o\
+	$(STADENBIN)/help.o\
+	$(STADENBIN)/dialogues.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/xmenu.o\
+	$(STADENBIN)/locks.o\
+	$(STADENBIN)/mcspec.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o
+
+COBJS=\
+	$(STADENBIN)/main.o\
+	$(CCORE)
+
+COBJS2=\
+	xbapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	$(STADENBIN)/fmain.o\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	$(STADENBIN)/postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS)
+
+XCOMMONOBJS2=\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS2)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	$(STADENBIN)/gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+
+
+MEP=\
+	$(STADENBIN)/mep.o\
+	$(STADENBIN)/mepsub.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(STADENBIN)/mepmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(STADENBIN)/xmepSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	$(STADENBIN)/nip.o\
+	$(STADENBIN)/patternn.a\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(STADENBIN)/nipmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(STADENBIN)/xnipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	$(STADENBIN)/nipl.o\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+
+PIP=\
+	$(STADENBIN)/pip.o\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/patternp.a\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(STADENBIN)/pipmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(STADENBIN)/xpipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	$(STADENBIN)/pipl.o\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	$(TEDBIN)/seq.o\
+	$(TEDBIN)/seqIOABI.o\
+	$(TEDBIN)/seqIOALF.o\
+	$(TEDBIN)/seqIOSCF.o\
+	$(TEDBIN)/seqIOPlain.o\
+	$(TEDBIN)/opp.o\
+	$(TEDBIN)/fpoint.o\
+	$(TEDBIN)/mach-io.o
+
+TMANOBJS=\
+	$(STADENBIN)/tman_main.o\
+	$(STADENBIN)/tman_display.o\
+	$(STADENBIN)/tman_traceDisplay.o\
+	$(STADENBIN)/tman_basesDisplay.o\
+	$(STADENBIN)/tman_context.o\
+	$(STADENBIN)/tman_gadgets.o\
+	$(STDTEDFILES)
+
+OSPOBJS=\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/paramIOX.o\
+	$(OSPBIN)/paramIO.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	$(GNULIB)
+
+XHACK=\
+	$(STADENBIN)/SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	$(STADENBIN)/xsapConEdit.o\
+	contigEditor.o\
+	$(STADENBIN)/edUtils.o\
+	$(STADENBIN)/undo.o\
+	$(STADENBIN)/Sheet.o\
+	$(STADENBIN)/select.o\
+	$(STADENBIN)/extend.o\
+	$(STADENBIN)/searchUtils.o\
+	edMenu.o\
+	oligo.o\
+	oligocom.o\
+	subclone.o\
+	myparams.o\
+	$(STADENBIN)/trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	$(STADENBIN)/tagEditor.o\
+	$(STADENBIN)/tagdbparse.o\
+	tagU2.o\
+	$(STADENBIN)/tagU1.o
+
+BAP=\
+	bap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	actf.o\
+	dstrand.o\
+	oligo_sel.o\
+	llin.o\
+	guk.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+BAPOBJS=\
+	$(BAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o\
+	$(STADENBIN)/conEdit.o\
+	tagU2.o\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	baponly.o\
+	oligocom.o
+
+XBAPOBJS=\
+	$(BAP)\
+	$(XCOMMONOBJS2)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)\
+	$(OSPOBJS)
+
+bap: $(BAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(BAPOBJS) $(LFLAGS)
+
+xbap: $(XBAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XBAPOBJS) $(XLFLAGS)
+
+convert_project: $(STADENBIN)/cvt.o
+	$(CC) $(CFLAGS) -o $@ $(STADENBIN)/cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(STADENBIN)/sapmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(STADENBIN)/xsapSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	$(STADENBIN)/sapf.o\
+	$(STADENBIN)/dbsyscommon.o\
+	$(STADENBIN)/dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	$(STADENBIN)/sapmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	$(STADENBIN)/sip.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(STADENBIN)/sipmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(STADENBIN)/xsipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	$(STADENBIN)/sipl.o\
+	$(STADENBIN)/dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	$(STADENBIN)/sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)\
+	$(SCOMMONOBJS)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	$(STADENBIN)/splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	$(STADENBIN)/splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	$(STADENBIN)/splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(NPROGS) $(XPROGS) $(LPROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
diff --git a/src/bap/makefile-sun b/src/bap/makefile-sun
new file mode 100644
index 0000000..a7e1ae1
--- /dev/null
+++ b/src/bap/makefile-sun
@@ -0,0 +1,594 @@
+# Makefile for staden X programs (Sun version)
+# --------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   bap  sap  sip   splitp1 splitp2 splitp3 sethelp convert_project
+LPROGS=           nipl  pip             sipl
+XPROGS=     xmep xnip  xpip  xbap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+TEDSRC=$(SRC)/../ted
+TEDBIN=$(STADENBIN)
+OSPSRC=$(SRC)/osp-bits
+OSPBIN=$(OSPSRC)
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(SRC) -I$(STADENSRC) -I$(TEDSRC) -I$(OSPSRC) -I$(MISCSRC)
+
+
+#
+# Compilers and flags
+#
+# debug on
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -DNOMEMMOVE $(GCCWARNINGS) $(INCLUDES)
+# KFLAGS - for compiling non ANSI C modules
+KFLAGS= -g $(INCLUDES)
+F77= f77
+FFLAGS= -ansi -g -C -Nx500
+#FFLAGS= -ansi -O -Nx500
+# if using gcc-2.x then link with it
+LD= gcc
+LDFLAGS= -g
+
+FLIBS= -lF77 -lI77 -lU77 -lm
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+# cc -Bstatic, gcc-2.x -static, otherwise nothing
+LFLAGS= -static $(FLIBS)
+XLFLAGS= $(LFLAGS) $(XLIBS)
+
+# For stdargs on the Sun when not using gcc
+#GNULIB = /usr/local/lib/gcc-gnulib
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+$(TEDBIN)/%.o : $(TEDSRC)/%.f
+$(TEDBIN)/%.o : $(TEDSRC)/%.c
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	$(STADENBIN)/seqlibsubs.o
+
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	$(STADENBIN)/postscript.o\
+	$(STADENBIN)/Graph.o\
+	$(STADENBIN)/plotLog.o\
+	$(STADENBIN)/help.o\
+	$(STADENBIN)/dialogues.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/xmenu.o\
+	$(STADENBIN)/locks.o\
+	$(STADENBIN)/mcspec.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o
+
+COBJS=\
+	$(STADENBIN)/main.o\
+	$(CCORE)
+
+COBJS2=\
+	xbapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	$(STADENBIN)/fmain.o\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	$(STADENBIN)/postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS)
+
+XCOMMONOBJS2=\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/xspec.o\
+	$(STADENBIN)/subs89.a\
+	$(COBJS2)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	$(STADENBIN)/gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+
+
+MEP=\
+	$(STADENBIN)/mep.o\
+	$(STADENBIN)/mepsub.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(STADENBIN)/mepmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(STADENBIN)/xmepSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	$(STADENBIN)/nip.o\
+	$(STADENBIN)/patternn.a\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(STADENBIN)/nipmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(STADENBIN)/xnipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	$(STADENBIN)/nipl.o\
+	$(STADENBIN)/patternnc.a\
+	$(STADENBIN)/anals89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+
+PIP=\
+	$(STADENBIN)/pip.o\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/patternp.a\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(STADENBIN)/pipmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(STADENBIN)/xpipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	$(STADENBIN)/pipl.o\
+	$(STADENBIN)/patternpc.a\
+	$(STADENBIN)/analps89.a\
+	$(STADENBIN)/asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	$(TEDBIN)/seq.o\
+	$(TEDBIN)/seqIOABI.o\
+	$(TEDBIN)/seqIOALF.o\
+	$(TEDBIN)/seqIOSCF.o\
+	$(TEDBIN)/seqIOPlain.o\
+	$(TEDBIN)/opp.o\
+	$(TEDBIN)/fpoint.o\
+	$(TEDBIN)/mach-io.o
+
+TMANOBJS=\
+	$(STADENBIN)/tman_main.o\
+	$(STADENBIN)/tman_display.o\
+	$(STADENBIN)/tman_traceDisplay.o\
+	$(STADENBIN)/tman_basesDisplay.o\
+	$(STADENBIN)/tman_context.o\
+	$(STADENBIN)/tman_gadgets.o\
+	$(STDTEDFILES)
+
+OSPOBJS=\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/paramIOX.o\
+	$(OSPBIN)/paramIO.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	$(GNULIB)
+
+XHACK=\
+	$(STADENBIN)/SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	$(STADENBIN)/xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	$(STADENBIN)/undo.o\
+	$(STADENBIN)/Sheet.o\
+	$(STADENBIN)/select.o\
+	$(STADENBIN)/extend.o\
+	$(STADENBIN)/searchUtils.o\
+	edMenu.o\
+	oligo.o\
+	oligocom.o\
+	subclone.o\
+	myparams.o\
+	$(STADENBIN)/trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	$(STADENBIN)/tagEditor.o\
+	$(STADENBIN)/tagdbparse.o\
+	tagU2.o\
+	$(STADENBIN)/tagU1.o
+
+BAP=\
+	bap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	actf.o\
+	dstrand.o\
+	oligo_sel.o\
+	llin.o\
+	guk.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+BAPOBJS=\
+	$(BAP)\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o\
+	$(STADENBIN)/conEdit.o\
+	tagU2.o\
+	$(OSPBIN)/analysis.o\
+	$(OSPBIN)/our_allo.o\
+	$(OSPBIN)/get_scores.o\
+	$(OSPBIN)/utils.o\
+	mess.o\
+	baponly.o\
+	oligocom.o
+
+XBAPOBJS=\
+	$(BAP)\
+	$(XCOMMONOBJS2)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)\
+	$(OSPOBJS)
+
+bap: $(BAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(BAPOBJS) $(LFLAGS)
+
+xbap: $(XBAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XBAPOBJS) $(XLFLAGS)
+
+convert_project: $(STADENBIN)/cvt.o
+	$(CC) $(CFLAGS) -o $@ $(STADENBIN)/cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(STADENBIN)/sapmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(STADENBIN)/xsapSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	$(STADENBIN)/sapf.o\
+	$(STADENBIN)/dbsyscommon.o\
+	$(STADENBIN)/dbsysold.o\
+	$(STADENBIN)/asubs89.a\
+	$(STADENBIN)/plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	$(STADENBIN)/sapmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	$(STADENBIN)/sip.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(STADENBIN)/sipmen.o\
+	$(COMMONOBJS)\
+	$(STADENBIN)/pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(STADENBIN)/xsipSpec.o\
+	$(XCOMMONOBJS)\
+	$(STADENBIN)/plX.o\
+	$(STADENBIN)/textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	$(STADENBIN)/sipl.o\
+	$(STADENBIN)/dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	$(STADENBIN)/sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)\
+	$(SCOMMONOBJS)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	$(STADENBIN)/splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	$(STADENBIN)/splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	$(STADENBIN)/splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(NPROGS) $(XPROGS) $(LPROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
diff --git a/src/bap/mess.c b/src/bap/mess.c
new file mode 100644
index 0000000..27f295e
--- /dev/null
+++ b/src/bap/mess.c
@@ -0,0 +1,61 @@
+/* 
+  Program Name: mess
+  File name: mess.c
+  Purpose: put user information to the screen for the text version
+         of osp--maybe error messages or any kind of user information
+	 or question.
+
+  Last Update: Tuesday 13 August 1991
+
+  Copyright 1991: LaDeana Hillier and Philip Green
+
+  Change Log:
+
+  Modified to work with xdap output window
+*/
+
+/* --- includes --- */
+#include "defn.h" /* IMPORT: stdio.h */
+#include "textOutput.h" /* IMPORT: UpdateTextOutput */
+#include <stdio.h>
+#include <stdarg.h>
+/* int       vfprintf (FILE *stream, const char *format, va_list arglist); */
+
+/* ---- Exports ---- */
+void messagef(char *format, ...)
+{
+    va_list args;
+    va_start (args,format);
+    vfprintf (stdout, format, args);
+    va_end(args);
+    UpdateTextOutput();
+}
+
+void message(char *message)
+{
+  /* prints the message to the screen*/
+  fprintf(stdout,"%s",message);
+  UpdateTextOutput();
+  return;
+}
+
+void popupMessage(char *message)
+{
+  fprintf(stdout,"%s",message);
+  UpdateTextOutput();
+  return;
+}
+
+
+void popUpErrorMessage()
+{
+  fprintf(stdout,"ERROR: Memory allocation problem.\nPlease exit and restart this program\n");
+/* since this is the text version I want the program to exit if this
+   memory allocation error occurs*/
+  UpdateTextOutput();
+  sleep(10);
+
+  exit();
+
+}
+
diff --git a/src/bap/movgel.f b/src/bap/movgel.f
new file mode 100644
index 0000000..11c73d5
--- /dev/null
+++ b/src/bap/movgel.f
@@ -0,0 +1,95 @@
+      SUBROUTINE MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +GEL,FROM,TO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+C   Subroutine to move a gel from line from to line to
+C   Extended 22-5-91
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),FROM,TO
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMGEL*16,GEL(MAXGEL)
+      INTEGER GCLIN,CHAINL
+      LOGICAL LEFTE,RIGHTE
+      EXTERNAL GCLIN,CHAINL
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+C
+C left end ?
+C
+      IF(LNBR(FROM).EQ.0) LEFTE = .TRUE.
+C
+C right end ?
+C
+      IF(RNBR(FROM).EQ.0) RIGHTE = .TRUE.
+C
+C if both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          RNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(LEFTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(RIGHTE) THEN
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no right neighbour and does not'
+          WRITE(KBOUT,*)'appear in a contig!'
+        ELSE
+          IF(RNBR(NCONTO).NE.FROM)THEN
+            WRITE(KBOUT,*)
+     +        'This gel has no right neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+          ELSE
+            RNBR(NCONTO) = TO
+            CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +        LNBR(NCONTO),RNBR(NCONTO))
+          END IF
+        END IF
+      END IF
+      RELPG(TO)=RELPG(FROM)
+      LNGTHG(TO)=LNGTHG(FROM)
+      LNBR(TO)=LNBR(FROM)
+      RNBR(TO)=RNBR(FROM)
+      CALL READW(IDEVW,FROM,GEL,MAXGEL)
+      CALL WRITEW(IDEVW,TO,GEL,MAXGEL)
+      CALL READN(IDEVN,FROM,NAMGEL)
+      CALL WRITEN(IDEVN,TO,NAMGEL)
+      CALL WRITER(IDEVR,TO,RELPG(TO),LNGTHG(TO),
+     +LNBR(TO),RNBR(TO))
+C   Do neighbours
+      IF(LNBR(FROM).NE.0) THEN
+        I=LNBR(FROM)
+        RNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      IF(RNBR(FROM).NE.0) THEN
+        I=RNBR(FROM)
+        LNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      CALL MOVTAG(FROM,TO)
+      END
diff --git a/src/bap/myparams.c b/src/bap/myparams.c
new file mode 100644
index 0000000..08fa7be
--- /dev/null
+++ b/src/bap/myparams.c
@@ -0,0 +1,232 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <X11/StringDefs.h>
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Command.h>
+
+#include "myparams.h"
+#define MAXWIDTH 50
+
+
+
+static void okCallback(Widget w, XtPointer status, XtPointer call_data)
+{
+    * (int *) status = 1;
+}
+static void cancelCallback(Widget w, XtPointer status, XtPointer call_data)
+{
+    * (int *) status = 2;
+}
+
+static void SourceChanged(Widget w, XtPointer i, XtPointer junk)
+{
+    XtRemoveAllCallbacks(w, XtNcallback);
+    *(int *)i = 1;
+}
+
+
+
+
+static Widget create_window(Widget parentWid, char *title, Field_entry *field_list, int field_entries,int *modified,Widget *fields, int *status)
+/*
+** This creates a popup widget that allows you to change the values of
+** fields
+*/
+{
+    Widget wid;
+    Widget form;
+    Widget label;
+    Widget box,    ok, cancel;
+    int maxlen; /* maximum length of label */
+    Cardinal nargs;
+    Arg args[10];
+    int i;
+    Position	x, y; 	 /* top-left hand corner of new widget */
+    Dimension	height;  /* height of parent widget */
+
+    /*
+    ** Determine the position of up and comming widget
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid, (Position) 0, (Position) height, &x, &y);
+  
+    y+=3; /* a suitable gap between widgets */
+
+    /*
+    ** Create popup shell
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    wid = XtCreatePopupShell("changeparam", transientShellWidgetClass, parentWid, args, nargs);
+
+    /*
+    ** Create main form
+    */
+    nargs = 0;
+    form = XtCreateManagedWidget("form", formWidgetClass, wid, args, nargs);
+
+    /*
+    ** Create title for form
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNlabel, (title != NULL) ? title : "Change Parameters..."); nargs++;
+    label = XtCreateManagedWidget("label", labelWidgetClass, form, args, nargs);
+
+    maxlen = 0;
+    for (i=0; i < field_entries; i++) {
+	int len = strlen(field_list[i].field_name);
+	if (maxlen < len) maxlen = len;
+    }
+
+    /*
+    ** Create labels and buttons for each field entry
+    */
+    for (i=0; i < field_entries; i++) {
+	Widget fromVert = label;
+
+	char init_string[MAXWIDTH+1];
+
+	if (maxlen>MAXWIDTH) maxlen = MAXWIDTH;
+	sprintf(init_string,"%-*s",maxlen,field_list[i].field_name);
+	nargs = 0;
+	XtSetArg(args[nargs], XtNfromVert, fromVert); nargs++;
+	XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+	XtSetArg(args[nargs], XtNlabel, init_string); nargs++;
+	label = XtCreateManagedWidget("label", labelWidgetClass, form, args, nargs);
+
+	switch (field_list[i].field_type) {
+	case t_int :
+	    sprintf(init_string,"%d", * (int *) field_list[i].field_value);
+	    break;
+	case t_float :
+	    sprintf(init_string,"%f", * (float *) field_list[i].field_value);
+	    break;
+	case t_char:
+	    strncpy(init_string, field_list[i].field_value, MAXWIDTH);
+	    break;
+	default:
+	    strcpy(init_string, "** Unknown Type **");
+	    break;
+	}
+
+	nargs = 0;
+	XtSetArg(args[nargs], XtNfromHoriz, label); nargs++;
+	XtSetArg(args[nargs], XtNfromVert, fromVert); nargs++;
+        XtSetArg(args[nargs], XtNstring, init_string);     nargs++;
+	XtSetArg(args[nargs], XtNeditType, XawtextEdit);        nargs++;
+	XtSetArg(args[nargs], XtNwidth, 300);        nargs++;
+    	fields[i] = XtCreateManagedWidget("text",asciiTextWidgetClass, form, args, nargs);
+        XtAddCallback(XawTextGetSource(fields[i]), XtNcallback, SourceChanged, (XtPointer) &modified[i]);
+	my_translations(fields[i]);
+	XawTextSetInsertionPoint(fields[i], strlen(init_string));
+	modified[i] = 0;
+    }
+
+    /*
+    ** Create Exit Action Buttons
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, label); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+#define XtOrientHorizontal "horizontal"
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    box = XtCreateManagedWidget("box", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    ok = XtCreateManagedWidget("ok", commandWidgetClass, box, args, nargs);
+    XtAddCallback(ok, XtNcallback, okCallback, (XtPointer) status);
+    nargs = 0;
+    cancel = XtCreateManagedWidget("cancel", commandWidgetClass, box, args, nargs);
+    XtAddCallback(cancel, XtNcallback, cancelCallback, (XtPointer) status);
+    
+
+    return wid;
+}
+
+
+
+static void update_params(Widget *fields, Field_entry *field_list, int field_entries, int *modified)
+{
+    int i;
+    Arg args[10];
+    Cardinal nargs;
+
+    for (i=0; i < field_entries; i++) {
+	if (modified[i]) {
+	    char *new_value;
+	    nargs = 0;
+	    XtSetArg(args[nargs], XtNstring, &new_value); nargs++;
+	    XtGetValues(fields[i],args,nargs);
+	    
+	    switch(field_list[i].field_type) {
+	    case t_int:
+		* (int *) field_list[i].field_value = atoi(new_value);
+		break;
+	    case t_float:
+		* (float *) field_list[i].field_value = (float) atof(new_value);
+		break;
+	    case t_char:
+		strncpy(field_list[i].field_value, new_value, field_list[i].field_size-1);
+		field_list[i].field_value[field_list[i].field_size-1]='\0';
+		break;
+	    default:
+		break;
+	    }
+	}
+    }
+}
+
+void change_params(Widget parentWid, char *title, Field_entry *field_list, int field_entries)
+{
+
+    Widget wid;
+    int status;
+    Widget *fields;
+    int *modified;
+
+    /*
+    ** Allocate space for data
+    */
+    fields = (Widget *) malloc(field_entries * sizeof(Widget));
+    modified = (int *) malloc(field_entries * sizeof(int));
+
+    wid = create_window(parentWid,title,field_list,field_entries,modified,fields,&status);
+
+    XtPopup(wid,   XtGrabExclusive);
+
+    status = 0;
+
+    while (!status)
+    {
+        XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(wid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    XtPopdown(wid);
+
+    if (status == 1) update_params(fields, field_list, field_entries, modified);
+
+    free (fields);
+    free (modified);
+    XtDestroyWidget(wid);
+}
+
+
+
+
+
+
+
+
+
+
diff --git a/src/bap/myparams.h b/src/bap/myparams.h
new file mode 100644
index 0000000..f7e16b3
--- /dev/null
+++ b/src/bap/myparams.h
@@ -0,0 +1,26 @@
+#ifndef _myparams_h
+#define _myparams_h
+
+#include <sys/types.h>
+
+enum types {
+    t_unknown, t_int, t_char, t_float
+};
+
+
+typedef struct _field_entry{
+    char *field_name;
+    char *field_value;
+    int  field_type;
+    size_t  field_size;
+} Field_entry;
+
+
+
+
+extern void change_params(Widget parentWid, char *title, Field_entry *field_list, int field_entries);
+
+
+
+
+#endif /* _myparams_h */
diff --git a/src/bap/oligo.c b/src/bap/oligo.c
new file mode 100644
index 0000000..6238f75
--- /dev/null
+++ b/src/bap/oligo.c
@@ -0,0 +1,1699 @@
+/*
+ * File: oligo.c
+ * Version:
+ *
+ * Author: Simon Dear
+ *         MRC Laboratory of Molecular Biology
+ *	   Hills Road
+ *	   Cambridge CB2 2QH
+ *	   United Kingdom
+ *
+ * Description: oligo selection module
+ *
+ * Created: 1991
+ * Updated: 6 November 1992
+ *	
+ * 6 November 1992
+ *	Changes for distribution
+ */
+#include <stdio.h>
+#include <stdlib.h> /* getenv & malloc */
+#include <string.h>
+
+#include <X11/StringDefs.h>
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Text.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/SmeBSB.h>
+
+#include "edUtils.h"
+#include "myparams.h"
+#include "defn.h"
+#include "struct.h"
+#include "tagUtils.h"
+#include "oligo.h"
+#include "misc.h"
+#include "oligocom.h"
+
+/*
+ * Compilation modes:
+ *
+ * NEMATODE - Make some features switchable by the environment variable SUBCLONES
+ * VERBOSENESS - Allow for varying degress of verbose output, including debugging information
+ */
+#define NEMATODE
+#define VERBOSENESS
+
+/* nematode - the drink when you're not having a drink */
+int nematode;
+
+
+#include "subclone.h"
+
+/*
+ * Useful #defines
+ */
+#define FORWARDS  True
+#define BACKWARDS False
+extern void messagef(char *format, ...);
+
+/*
+ * Widgets and things
+ */
+static EdStruct *thisxx;   /* the current EdStruct */
+static int up = 0;
+static Widget oldFogieWid;
+static Widget oligoWid = NULL;
+static Widget form;
+static Widget label;
+
+static Widget bbox,    strand;
+static int strand_state = FORWARDS;
+
+static Widget cbox,    change;
+static Widget dbox,    find, next;
+static Widget template;
+static Widget ebox,    ok, quit;
+
+/*
+ * Current state of selection
+ */
+static int p;                          /* cursor position for selection */
+static int l,r;                        /* position of left and right ends of selection region */
+static int num_oligos;                 /* number of oligo selected last time */
+static int curr_oligo;                 /* number of current oligo being considered */
+static Boolean oligo_sense;            /* status of sense buttons when find-oligo button pressed */
+static int template_index;             /* gel number of current selected template */
+static char template_name[DB_NAMELEN+1]; /* gel name of current selected template */
+static char *consensus = NULL; 	       /* consensus for region used to select oligo*/
+
+/*
+ * Parameters for template selection
+ */
+#ifdef VERBOSENESS
+static int verbose = 1;			/* verbose output is required */
+char verbosity[10];                     /* space for string form of verbose */
+#define verbose_debug (verbose==2 || verbose==3)
+#define verbose_panic (verbose==3)
+#endif /*VERBOSENESS*/
+
+static int fwd_width = 40;		/* how far ahead search window should stretch */
+static int bkwd_width = 40;		/* how far back search window should stretch */
+
+static int def_insert_size = 1000;            /* How far from start of template oligo can be */
+static char filter[100] = "\\.[sSfFrR]1[^a-z]"; /* filter out templates from gel names */
+static char clonelib[100]; /* library of subclone information: initialised in initialise()*/
+static int ave_read_len = 400;         /* average read length */
+
+
+/*
+ * A few necessary forward declarations
+ */
+static int findOligos(EdStruct *xx, int sense);
+static void nextOligo(EdStruct *xx, int oligo, int sense);
+static int create_new_oligo_tag(EdStruct *xx, int oligo, int pos, int len, int sense);
+static void destroy_oligo_popup();
+static void destroy_temporary_tag(EdStruct *xx);
+static void display_template_details();
+
+
+/*********************************************************************************/
+/*
+ * Start of code proper
+ */
+
+
+/*
+ * Callback routines
+ */
+
+
+static void set_strand_state(Widget w, int strand_state)
+{
+    Arg args[2];
+    Cardinal nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, (strand_state==FORWARDS)?"------>":"<------"); nargs++;
+    XtSetValues(w,args,nargs);
+    
+}
+
+
+static void strandCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Change strand
+ */
+{
+    strand_state = (strand_state == FORWARDS)?BACKWARDS:FORWARDS;
+    set_strand_state(w,strand_state);
+
+    XtCallCallbacks(find, XtNcallback, (XtPointer) NULL);
+}
+
+
+
+
+
+
+/*
+ * Move data[] outside scope of following function - non ANSI to perform
+ * aggregate definitions inside a function
+ */
+Field_entry data_1[] = {
+    {"Search window bases ahead",  (char *)&fwd_width,    t_int,  sizeof(fwd_width)},
+    {"Search window bases back",   (char *)&bkwd_width,   t_int,  sizeof(bkwd_width)},
+    {"Default insert size",        (char *)&def_insert_size,   t_int,  sizeof(def_insert_size)},
+    {"Average read length",        (char *)&ave_read_len, t_int,  sizeof(ave_read_len)},
+    {"Filter to select templates", (char *)filter,        t_char, sizeof(filter)},
+    {"Cosmid subclone library",    (char *)clonelib,      t_char, sizeof(clonelib)},
+#ifdef VERBOSENESS
+    {"Verbose output?",            (char *)verbosity,     t_char, sizeof(verbosity)}
+#endif /*VERBOSENESS*/
+};
+
+static void changeMineCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Change oligo selection parameters
+ */
+{
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Change my parameters\n");
+
+    /* Verbose output can be "yes", "no", "debug", or "panic"!!!! */
+    switch (verbose) {
+    case 0: strcpy(verbosity,"No"); break;
+    case 1: strcpy(verbosity,"Yes"); break;
+    case 2: strcpy(verbosity,"Debug"); break;
+    case 3: strcpy(verbosity,"Panic"); break;
+    }
+#endif /*VERBOSENESS*/
+
+    change_params((Widget)w,NULL,data_1,nematode?XtNumber(data_1):4);
+
+#ifdef VERBOSENESS
+    switch ( verbosity[0] ) {
+    case 'p': case 'P':
+	verbose = 3;
+	break;
+    case 'd': case 'D':
+	verbose = 2;
+	break;
+    case 'y': case 'Y':
+	verbose = 1;
+	break;
+    default:
+	verbose = 0;
+	break;
+    }
+#endif /*VERBOSENESS*/
+
+    XtCallCallbacks(find, XtNcallback, (XtPointer) NULL);
+}
+
+
+
+
+
+static void changeParamsCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Change oligo selection parameters
+ */
+{
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Change selection parameters\n");
+#endif /*VERBOSENESS*/
+    osp_change_parameters(w,&prm,0/*params*/);
+
+    XtCallCallbacks(find, XtNcallback, (XtPointer) NULL);
+}
+
+
+
+
+
+static void changeWeightsCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Change oligo selection parameters
+ */
+{
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Change selection weights\n");
+#endif /*VERBOSENESS*/
+    osp_change_parameters(w,&prm,1/*weights*/);
+
+    XtCallCallbacks(find, XtNcallback, (XtPointer) NULL);
+}
+
+
+
+
+
+
+static void informationCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Change oligo selection parameters
+ */
+{
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Display information\n");
+#endif /*VERBOSENESS*/
+    messagef("%s\n",score_info);
+
+}
+
+
+
+
+
+
+static void findCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Search for oligo
+ */
+{
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Find oligos\n");
+#endif /*VERBOSENESS*/
+
+    oligo_sense = strand_state;
+
+    (void) findOligos(thisxx,oligo_sense);
+
+    /*
+     * Reveal buttons depending on result
+     */
+    XawFormDoLayout(form,False);
+    if (num_oligos>0) {
+	XtManageChild(ok);
+	XtManageChild(template);
+    } else {
+	XtUnmanageChild(ok);
+	XtUnmanageChild(template);
+    }
+    if (num_oligos > 1)
+	XtManageChild(next);
+    else
+	XtUnmanageChild(next);
+    XawFormDoLayout(form,True);
+
+    curr_oligo = 0;
+
+    if (num_oligos>0)
+	nextOligo(thisxx,curr_oligo,oligo_sense);
+    else
+	destroy_temporary_tag(thisxx);
+}
+
+
+
+
+
+static void nextCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Search for oligo
+ */
+{
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Select next oligo\n");
+#endif /*VERBOSENESS*/
+    curr_oligo++;
+    if (curr_oligo+1 == num_oligos) {
+	XawFormDoLayout(form,False);
+	XtUnmanageChild(next);
+	XawFormDoLayout(form,True);
+    }
+
+    nextOligo(thisxx,curr_oligo,oligo_sense);
+}
+
+
+
+
+
+
+static void okCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Select current oligo,
+ * Create a tag for it and everything else...
+ */
+{
+    int i = curr_oligo;
+
+
+    /*
+     * To create a new oligo a valid template must have been specified
+     */
+    if (!template_index) {
+	message("A valid template has not been specified\n");
+	return;
+    }
+    if (oligo_sense == BACKWARDS) {
+	(void) create_new_oligo_tag(thisxx,i,
+				    r-OSP_RESULTS[i].end_position,
+				    OSP_RESULTS[i].end_position-OSP_RESULTS[i].start_position+1,
+				    oligo_sense);
+    } else {
+	(void) create_new_oligo_tag(thisxx,i,
+				    l+OSP_RESULTS[i].start_position,
+				    OSP_RESULTS[i].end_position-OSP_RESULTS[i].start_position+1,
+				    oligo_sense);
+    }
+
+    up = 0;
+
+    destroy_oligo_popup();
+
+}
+
+
+
+
+
+static void quitCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+ * Leave without selecting an oligo
+ */
+{
+    up = 0;
+
+    destroy_oligo_popup();
+}
+
+
+
+
+
+/*****************************************************************************************/
+
+static void initialise()
+/*
+ * Initialise miscellaneous variables
+ */
+{
+    char *subclones;
+
+    /* clonelib = subclones:$SUBCLONES */
+    if (is_file("subclones"))
+	strcpy(clonelib,"subclones");
+    else {
+	subclones = (char *) getenv("SUBCLONES");
+	if (subclones == NULL)
+	    clonelib[0] = '\0';
+	else if (is_file(subclones))
+	    strcpy(clonelib,subclones);
+	else
+	    clonelib[0] = '\0';
+    }
+#ifdef NEMATODE
+    nematode=(getenv("SUBCLONES")!=NULL);
+#else /*NEMATODE*/
+    nematode=1;
+#endif /*NEMATODE*/
+
+
+}
+
+
+
+static void destroy_temporary_tag(EdStruct *xx)
+/*
+ * Flag temporary tag as deleted
+ */
+{
+    if (DBgetTags(xx,0) != NULL) {
+	_delete_tag(xx, 0/*consensus*/, DBgetTags(xx,0));
+    }
+}
+
+
+
+
+
+static void create_temporary_tag(EdStruct *xx,int pos, int len)
+/*
+ * Create a temporary tag in the consensus to show position of oligo under consideration
+ */
+{
+    char *defComment = "*** Temporary Annotation ***\n";
+
+    tagStruct *tempTag;
+    tempTag = DBgetTags(xx,0);
+
+    /*
+     * Create a new tag and insert it with comment into bimbo falix
+     */
+    if (tempTag == NULL) {
+	tempTag = newTag();
+	strncpy(tempTag->tagrec.type.c,"OLIG",4);
+	tempTag->newcomment = (char *)malloc(strlen(defComment)+1);
+	strcpy(tempTag->newcomment,defComment);
+	tempTag->flags =
+	    TAG_INSERTED |
+            TAG_LENGTH_CHANGED |
+	    TAG_POSITION_CHANGED |
+	    TAG_TYPE_CHANGED |
+	    TAG_COMMENT_IN_MEMORY;
+	insertTag(xx,0/*consensus*/,tempTag);
+    }
+
+    tempTag->tagrec.position = pos;
+    tempTag->tagrec.length = len;
+
+    /*
+     * Jiggle about if tag is off screen
+     */
+    if (xx->displayPos > pos ||	xx->displayPos + xx->displayWidth < pos + len ) 
+	xx->displayPos = (pos+pos+len-xx->displayWidth)/2;
+
+    redisplaySequences (xx,
+			xx->namesWid,
+			xx->sequencesWid,
+			xx->displayPos,
+			xx->displayWidth);
+
+}
+
+
+
+
+static int gel_ok(EdStruct *xx, int pos, int len, int seq)
+/*
+ * If gel overlaps region in contig,
+ * return true
+ */
+{
+    return (pos >= DBgetRelPos(xx,seq) && pos+len <= DBgetRelPos(xx,seq)+DBgetLength(xx,seq));
+}
+
+
+
+
+
+static int find_gel_for_oligo(EdStruct *xx, int pos, int len, int sense, int trySeq)
+/*
+ * Find a gel number for this oligo. Try
+ * (A) gel trySeq first, (if valid)
+ * (B) then any in the correct sense,
+ * (C) otherwise any at all??
+ * The position here is a position in the contig
+ */
+{
+    int i;
+#ifdef VERBOSENESS
+    if (verbose_debug) messagef("Trying to find gel for oligo: pos=%d, len=%d, sense=%d\n",pos,len,sense);
+#endif /*VERBOSENESS*/
+
+    /**A**/
+    if (trySeq > 0) {
+#ifdef VERBOSENESS
+	if (verbose_debug) messagef("find_gel_for_oligo: Trying gel %s (%d)...\n",DBgetName(xx,trySeq),trySeq);
+#endif /*VERBOSENESS*/
+	if ( gel_ok(xx,pos,len,trySeq) ) {
+#ifdef VERBOSENESS
+	    if (verbose_debug) messagef("Using gel %s (%d)...\n",DBgetName(xx,trySeq),trySeq);
+#endif /*VERBOSENESS*/
+	    return trySeq;
+	}
+    }
+
+    /**B**/
+#ifdef VERBOSENESS
+    if (verbose_debug) message("find_gel_for_oligo: Trying gels in correct sense\n");
+#endif /*VERBOSENESS*/
+    for (i=1; i<=xx->DB_gelCount ; i++) {
+	if (DBgetComp(xx,i) == COMPLEMENTED && sense == BACKWARDS ||
+	    DBgetComp(xx,i) == UNCOMPLEMENTED && sense == FORWARDS) {
+	    if (gel_ok(xx,pos,len,i)) {
+#ifdef VERBOSENESS
+		if (verbose_debug) messagef("Using gel %s (%d)...\n",DBgetName(xx,trySeq),trySeq);
+#endif /*VERBOSENESS*/
+		return i;
+	    }
+	}
+    }
+
+    /**C**/
+#ifdef VERBOSENESS
+    if (verbose_debug) message("find_gel_for_oligo: Trying any gel\n");
+#endif /*VERBOSENESS*/
+    for (i=1; i<=xx->DB_gelCount ; i++) {
+	    if (gel_ok(xx,pos,len,i)) {
+#ifdef VERBOSENESS
+		if (verbose_debug) messagef("Using gel %s (%d)...\n",DBgetName(xx,trySeq),trySeq);
+#endif /*VERBOSENESS*/
+		return i;
+	    }
+    }
+
+#ifdef VERBOSENESS
+    if (verbose_debug) message("find_gel_for_oligo: Failed to find any suitable gel\n");
+#endif /*VERBOSENESS*/
+    return 0;
+
+}	 
+
+
+
+
+
+static char *generate_oligo_comment(int oligo)
+{
+    char s[200];
+    char seq[100];
+    char *c;
+
+    int pos,len;
+#ifdef VERBOSENESS
+    if (verbose_debug) message("creating comment for oligo:\n");
+#endif /*VERBOSENESS*/
+    pos = OSP_RESULTS[oligo].start_position;
+    len = OSP_RESULTS[oligo].end_position - pos + 1;
+    strncpy(seq,&consensus[pos],len);
+    seq[len]='\0';
+
+    
+    sprintf(s,"serial#=\ntemplate=%s\nsequence=%s\nflags=\n",template_name,seq);
+#ifdef VERBOSENESS
+    if (verbose_debug) messagef("(%s)\n",s);
+#endif /*VERBOSENESS*/
+
+    c = TAG_MALLOC(strlen(s)+1);
+    strcpy(c,s);
+    return c;
+}
+
+
+
+
+
+static int create_new_oligo_tag(EdStruct *xx, int oligo, int pos, int len, int sense)
+/*
+ * This routine creates a new oligo tag, prior to leaving the
+ * oligo selection window
+ */
+{
+    tagStruct *new_oligo;
+    int seq;
+
+    seq = find_gel_for_oligo(xx,pos,len,sense,template_index);
+
+    if (! seq) {
+	messagef("NO SUITABLE GEL FOR THIS OLIGO TAG POSITION %d LENGTH %d\n",pos,len);
+	return 1;
+    }
+
+    /*
+     * Create a new tag and insert it with comment into bimbo falix
+     */
+    new_oligo = newTag();
+    strncpy(new_oligo->tagrec.type.c,"OLIG",4);
+    new_oligo->tagrec.position = normalisePos(xx,seq,pos-DBgetRelPos(xx,seq)+1,len);
+    new_oligo->tagrec.length = len;
+    new_oligo->flags = TAG_INSERTED | TAG_LENGTH_CHANGED | TAG_POSITION_CHANGED | TAG_TYPE_CHANGED;
+    insertTag(xx,seq,new_oligo);
+
+    new_oligo->flags |= TAG_COMMENT_IN_MEMORY;
+    new_oligo->newcomment = generate_oligo_comment(oligo);
+
+    /*set modified tag flag */
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+
+    return 0;
+}
+
+
+
+
+
+int oligo_comp_pos(const void *pa, const void *pb)
+/*
+ * we need to sort the oligos according to how far they are from the
+ * cursor position (p)
+ */
+{
+    const OSP_Results *a = (OSP_Results *)pa;
+    const OSP_Results *b = (OSP_Results *)pb;
+#define absval(A) ( ( (A)<0 ) ? (-(A)) : (A) )
+    return absval((a->start_position+a->end_position)/2 - p) >
+	absval((b->start_position+b->end_position)/2 - p);
+
+}
+
+
+
+
+
+
+
+static int findOligos(EdStruct *xx, int sense)
+/*
+ * Find suitable oligos using OSP with current parameter settings.
+ * Return ok status
+ */
+{
+    int seq = xx->cursorSeq;
+    int pos = xx->cursorPos;
+    int position; /* in contig */
+    int contigLength; /* length of contig */
+    int consensusLength;
+    int ok; /* return status from osp_analyse */
+    int i;
+
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Finding oligos...\n");
+#endif /*VERBOSENESS*/
+
+    position =  positionInContig(xx,seq,pos);
+    contigLength = DBgetLength(xx,0/*consensus*/);
+
+
+
+
+    /*
+     * Conceptually we select off consensus.
+     * Determine consensus around point...
+     *    the oligo will be selected from this region
+     */
+    if (sense == FORWARDS) {
+	l = (position > bkwd_width)
+	    ? position - bkwd_width : 1;
+	r = (position + fwd_width < contigLength)
+	    ? position + fwd_width : contigLength;
+    } else {
+	l = (position > fwd_width)
+	    ? position - fwd_width : 1;
+	r = (position + bkwd_width < contigLength)
+	    ? position + bkwd_width : contigLength;
+    }
+    p = bkwd_width; /* save position of cursor in consensus fragment */
+
+    consensusLength = (r-l)+1;
+
+    /* allocate space for consensus */
+    if (consensus != NULL) free(consensus);
+    consensus = (char *) malloc ( consensusLength );
+
+    DBcalcConsensus (xx,l, consensusLength, consensus, BOTH_STRANDS);
+    if (sense == BACKWARDS) {
+	/* we need to complement the consensus */
+	/*
+	 * Use Rodger's routines in subs89.f
+	 */
+	sqcom_(consensus, &consensusLength, consensusLength);
+	sqrev_(consensus, &consensusLength, consensusLength);
+    }
+
+#ifdef VERBOSENESS
+    if (verbose_debug) messagef("Cursor position = %d\n",pos);
+    if (verbose_debug) messagef("Seqence = %s (%d)\n",DBgetName(xx,seq),seq);
+    if (verbose_debug) messagef("Consensus for region %d-%d = (%s)\n",l,r,consensus);
+    if (verbose_debug) messagef("Sense = %s\n", (sense==FORWARDS)?"forward":"reverse");
+#endif /*VERBOSENESS*/
+
+    /*
+     * A cludge to get around bug in osp_analyse
+     */
+    for (i=0;i<MAX_NUM_OLIGOS; i++) OSP_RESULTS[i].score = 0.0;
+
+    ok=osp_analyse(OSP_RESULTS,consensus,&prm,screens,score_info);
+
+#ifdef VERBOSENESS
+    if (verbose) messagef("osp_analyse returned with status %d\n", ok);
+#endif /*VERBOSENESS*/
+
+    /*
+     * Determine number of suitable oligos
+     */
+    if (ok)
+	for (num_oligos=0; OSP_RESULTS[num_oligos].score>0.0; num_oligos++);
+    else
+	num_oligos=0;
+#ifdef VERBOSENESS
+    if (verbose) messagef("%d suitable oligos found\n",num_oligos);
+#endif /*VERBOSENESS*/
+
+    /* sort? */
+    if (num_oligos>1) {
+	/* yes - sort by position relative to cursor */
+	qsort(OSP_RESULTS,num_oligos,sizeof(OSP_Results),oligo_comp_pos);
+    }
+
+    return (ok);
+}
+
+
+
+
+
+static int check_sense(EdStruct *xx, int i/*template number*/, int sense)
+/*
+ * templates that are in the wrong sense should fail
+ * returns 0 - ok,   1 - wrong sense
+ */
+{
+    return (DBgetComp(xx,i) == COMPLEMENTED && sense == FORWARDS ||
+	    DBgetComp(xx,i) == UNCOMPLEMENTED && sense == BACKWARDS);
+}
+
+
+
+
+static int check_5prime(EdStruct *xx, int i/*template number*/, int sense, int pos, int len)
+/*
+ * templates that have their 5' end after the oligo position are not usable
+ */
+{
+    int relpos = DBgetRelPos(xx,i);
+    int length = DBgetLength(xx,i);
+
+    if (sense == FORWARDS)
+	return (relpos > pos);
+    else
+	return (relpos+length < pos + len);
+
+}
+
+
+
+
+
+
+
+
+
+static int check_template_suitability(EdStruct *xx, int i, int sense, int pos, int len)
+/*
+ *
+ */
+{
+    int relpos = DBgetRelPos(xx,i);
+    int length = DBgetLength(xx,i);
+    int near_dist;
+    CloneInfo info;
+    char mtdname[4];
+
+    /*
+     * Get mtd name from gel reading name
+     */
+    strncpy(mtdname, DBgetGelName(xx,i), 3);
+    mtdname[3] = '\0';
+
+    /*
+     * Get size information from subclones file
+     */
+    if (read_subclone_info(clonelib,mtdname,&info))
+	near_dist = def_insert_size;
+    else
+	near_dist = info.range_from;
+
+
+    /* reject ones that are not near the required interval */
+    if (sense == FORWARDS)
+	return (pos - relpos > (near_dist-ave_read_len));
+    else
+	return (relpos+length-pos-len > (near_dist-ave_read_len));
+
+}
+
+
+
+
+
+
+
+
+
+static int check_template_for_oligo(EdStruct *xx,int pos, int len, int sense, int i/*template no*/)
+/*
+ * The template number here is actually the number of an existing gel for
+ * that template
+ *
+ * The template:
+ *   1. must be in the appropriate sense.
+ *   2. must exists in our template library.
+ *   3. must be found "near" the interval required.
+ *   4. need not have a sequenced gel over the interval required,
+ *	but must be past the 5' end.
+ */
+{
+#define reject_wrong_sense    1
+#define reject_mapped_after   2
+#define reject_not_close      3
+#define reject_not_in_library 4
+
+    /* reject ones that have a sense reverse to the one required */
+    if (check_sense(xx,i,sense)) {
+#ifdef VERBOSENESS
+	if (verbose_panic) messagef(" %s rejected because it is in the wrong sense\n",DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+	return reject_wrong_sense;
+    }
+
+
+    /* reject ones with 5' end after our oligo position */
+    if (check_5prime(xx,i,sense,pos,len)){
+#ifdef VERBOSENESS
+	if (verbose_panic) messagef(" %s rejected because template starts after oligo primer\n",DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+	return reject_mapped_after;
+    }
+
+
+    /*
+     * Check suitability of template according to position
+     */
+    if (check_template_suitability(xx,i,sense,pos,len)){
+#ifdef VERBOSENESS
+	if (verbose_panic) messagef(" %s rejected because template isn't near oligo primer position\n",DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+	return reject_not_close;
+    }
+
+
+    /*
+     * check that template exists in our template library
+     */
+    if (0 /* ?? */) {
+#ifdef VERBOSENESS
+	if (verbose_panic) messagef(" %s rejected because template not in template library\n",DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+	return reject_not_in_library;
+    }
+
+#ifdef VERBOSENESS
+    if (verbose_debug) messagef(" %s selected\n",DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+
+    return 0;
+}
+
+
+
+
+
+
+
+static int filter_template(char *template_name)
+/*
+ * We are interested in templates rather than simply template gel sequences
+ *	Reject all but *.s1 *.f1 on this basis
+ *	We are lucky in that we have a rigid nomenclature for templates
+ *	Other people will do things differently.
+ */
+{
+    char *error_message;
+
+    error_message = (char *) re_comp(filter);
+    if ( error_message != NULL ) {
+	messagef("  Error using re_comp: %s\n",error_message);
+	return 0;
+    }
+    return (re_exec(template_name) == 1);
+}
+
+
+
+
+
+
+
+
+static int *find_templates_for_oligo (EdStruct *xx, int pos, int len, int sense)
+/*
+ * Once an oligo has been found for the consensus at position pos, length len,
+ * search for a suitable template within the contig.
+ *
+ * Use existing gel readings as a basis to find a suitable template.
+ * As there can be several readings from one template, filter out only the
+ * initial reads for consideration
+ */
+{
+    static int *templateList = NULL;
+    int i;
+    int count;
+
+    if (templateList == NULL)
+	templateList = (int *) malloc ( xx->DB_gelCount * sizeof(int));
+
+#ifdef VERBOSENESS
+    if (verbose_debug) message("Finding template for oligo:\n");
+    if (verbose_debug) messagef("position = %d, length = %d, forward sense=%d\n",pos,len,sense);
+#endif /*VERBOSENESS*/
+
+    count = 0;
+    if (sense==BACKWARDS) {
+	for(i=1;i<=xx->DB_gelCount;i++) {
+	    char *name;
+	    name = DBgetGelName(xx,i);
+
+	    /* only nematode extensions */
+	    if (nematode && ! filter_template(name)) {
+#ifdef VERBOSENESS
+		if (verbose_debug) messagef(" %s rejected because template doesn't match filter\n",DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+		continue;
+	    }
+	    
+	    
+	    if (! check_template_for_oligo(xx,pos,len,sense,i))
+		templateList[count++] = i;
+	    
+	}
+    } else {
+	int ind;
+	ind=posToIndex(xx,pos); /* optimise a bit */
+	if(!ind) ind=xx->DB_gelCount;
+	for(;ind>0;ind--) {
+	    char *name;
+	    i = xx->DBorder[ind];
+	    name = DBgetGelName(xx,i);
+	    
+	    if (nematode && ! filter_template(name)) {
+#ifdef VERBOSENESS
+		if (verbose_debug) messagef(" %s rejected because template doesn't match filter\n",DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+		continue;
+	    }
+	    
+	    
+	    if (! check_template_for_oligo(xx,pos,len,sense,i))
+		templateList[count++] = i;
+	    
+	}
+    }
+    
+    templateList[count] = 0;
+
+    return templateList;
+}
+
+
+
+
+
+
+
+#ifdef OBSELETE
+static void trim_suffix(char * name)
+/*
+ * Remove tail from first dot onwards
+ */
+{
+    char *suffix;
+
+    /* truncate at suffix */
+    suffix = strchr(name,'.');
+    if (suffix != NULL)
+	*suffix = '\0';
+}
+#endif /*OBSELETE*/
+
+
+
+
+
+
+
+static void get_template_name(char *name, EdStruct *xx, int i)
+/*
+ * Get the template name for gel number 'i'
+ */
+{
+    (void) strcpy(name, DBgetGelName(xx,i) );
+#ifdef OBSELETE
+    trim_suffix(name);
+#endif /*OBSELETE*/
+}
+
+
+
+
+
+
+
+
+#ifdef OBSELETE
+static int score_reading_quality(EdStruct *xx, int seq)
+/*
+ * Returns a score for the quality of the read, higher the better.
+ * Ideally it would look at the traces, but for now
+ * will assume length of read is a good meter for this
+ */
+{
+    return DBgetLength(xx,seq);
+}
+#endif /*OBSELETE*/
+
+
+
+static int score_template(EdStruct *xx, int seq)
+/*
+ * Score this template
+ */
+{
+#ifdef OBSELETE
+    return score_reading_quality(xx,seq);
+#else /*OBSELETE*/
+    return 1; /* this should force the first one to be chosen */
+#endif /*OBSELETE*/
+}
+
+
+
+
+
+
+
+static void set_default_template(EdStruct *xx, int *templateList)
+/*
+ * Pick a default template from the list of available templates
+ */
+{
+    template_index = 0;
+    template_name[0] = '\0';
+    if (templateList[0]) {
+	int i;
+	int score, high_score;
+	high_score = 0;
+	/* search */
+	for (i=0; templateList[i]; i++) {
+	    score = score_template(xx, templateList[i]);
+	    if (high_score < score) {
+		high_score = score;
+		template_index = templateList[i];
+	    }
+	}
+	/* set template name */
+	get_template_name(template_name, xx, template_index);
+    }
+
+}
+
+
+
+
+
+#ifdef OBSELETE
+static int strcmp_ignorecase(char *a, char *b)
+{
+    for ( ; tolower(*a) == tolower(*b); a++, b++)
+	if (*a == '\0') return 0;
+    return tolower(*a) - tolower(*b);
+}
+#endif
+
+
+
+static int strncmp_ignorecase(char *a, char *b, int n)
+{
+    for ( ; n && tolower(*a) == tolower(*b); a++, b++, n--)
+	if (*a == '\0') return 0;
+    if (!n)
+	return 0;
+    else
+	return tolower(*a) - tolower(*b);
+}
+
+
+
+
+
+static void check_template_name(EdStruct *xx, char *template_name, int pos, int len, int sense)
+/*
+ * Check the template name is valid
+ *   + that it exists
+ *   + that is in the correct sense etc etc
+ * If it's not valid, set template_index to be 0
+ */
+{
+    int i;
+    int found_index;
+    int template_len;
+    int found;
+    char this[DB_NAMELEN];
+
+    strcpy(this,template_name);
+#ifdef OBSELETE
+    trim_suffix(this);
+#endif /*OBSELETE*/
+
+    /*
+     * Check template_name exists
+     */
+    found = 0;
+    template_len= strlen(template_name);
+
+    for (i=1; i<=xx->DB_gelCount ; i++) {
+	char *name;
+
+	name = DBgetGelName(xx,i);
+
+	if (nematode && ! filter_template(name))
+	    continue;
+
+	if (strncmp_ignorecase(template_name, DBgetGelName(xx,i), template_len) == 0) {
+	    found++;
+	    found_index = i;
+#ifdef VERBOSENESS
+	    if (verbose_debug) messagef("%s matches %s\n",template_name,DBgetName(xx,i));
+#endif /*VERBOSENESS*/
+	}
+
+    }
+
+
+    template_index = 0;
+
+    if (! found)
+	messagef("template %s not found\n", template_name);
+    else {
+	if (found > 1) {
+#ifdef VERBOSENESS
+	    if (verbose_debug) messagef("template %s found, but is not unique\n",
+					template_name);
+#endif /*VERBOSENESS*/
+	} else {
+	    if (check_sense(xx, found_index, sense)) {
+		messagef("template %s in the wrong sense\n", template_name);
+		return;
+	    }
+
+	    if (check_5prime(xx,found_index,sense,pos,len)) {
+		messagef("template %s starts after oligo position\n", template_name);
+		return;
+	    }
+
+	    template_index = found_index;
+	}
+    }
+
+
+}
+
+
+
+
+
+
+
+/*
+ * Move data[] outside scope of following function - non ANSI to perform
+ * aggregate definitions inside a function
+ */
+Field_entry data_2[] = {
+    {"Template",  (char *)template_name,  t_char,  sizeof(template_name)},
+};
+
+static void menuSelectCallback(Widget w, XtPointer p_i, XtPointer junk)
+/*
+ * A template has been selected off the menu
+ * Deal with it
+ */
+{
+    int i = (int) p_i;
+
+    /*
+     * i is either -1 (other) or the gel number corresponding to a template
+     */
+    if (i<0) {
+#ifdef VERBOSENESS
+	if (verbose_debug) message("Menu: Other selected\n");
+#endif /*VERBOSENESS*/
+
+	template_name[0]='\0';
+
+	change_params((Widget)oligoWid,"Please specify...",data_2,
+		      XtNumber(data_2));
+	template_index = 0;
+	if (oligo_sense == BACKWARDS) {
+	    check_template_name(thisxx, template_name,
+				r-OSP_RESULTS[curr_oligo].end_position,
+				OSP_RESULTS[curr_oligo].end_position-OSP_RESULTS[curr_oligo].start_position+1,
+				oligo_sense);
+	} else {
+	    check_template_name(thisxx, template_name,
+				l+OSP_RESULTS[curr_oligo].start_position,
+				OSP_RESULTS[curr_oligo].end_position-OSP_RESULTS[curr_oligo].start_position+1,
+				oligo_sense);
+	}
+
+    } else {
+	template_index = i;
+	get_template_name(template_name,thisxx,i);
+#ifdef VERBOSENESS
+	if (verbose_debug) messagef("Menu: %s selected\n", template_name);
+#endif /*VERBOSENESS*/
+    }
+
+    display_template_details();
+
+}
+
+
+
+
+
+
+static void create_template_menu(EdStruct *xx, int *list)
+/*
+ * Create new list of available templates
+ */
+{
+    static Widget menuWid = NULL;
+    int i;
+    Widget entryWid;
+    Arg args[2];
+    Cardinal nargs;
+
+    if (menuWid != NULL)
+	XtDestroyWidget(menuWid);
+
+
+    /*
+     * Create the menu parent widget
+     */
+    menuWid = XtCreatePopupShell("templateMenu", simpleMenuWidgetClass, template,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; list[i]; i++)
+    {
+	char name[DB_NAMELEN];
+
+#ifdef VERBOSENESS
+	if (verbose_debug) messagef("Creating menu for %s\n",DBgetName(xx,list[i]));
+#endif /*VERBOSENESS*/
+
+	/*
+	 * Prepare clone name
+	 */
+	get_template_name(name, xx, list[i]);
+
+	nargs = 0;
+	XtSetArg(args[nargs], XtNlabel, name); nargs++;
+	entryWid = XtCreateManagedWidget("entry", smeBSBObjectClass,
+					 menuWid, args, nargs);
+	XtAddCallback(entryWid, XtNcallback, menuSelectCallback,
+		      (XtPointer) list[i]);
+    }
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Other"); nargs++;
+    entryWid = XtCreateManagedWidget("entry", smeBSBObjectClass,
+				     menuWid, args, nargs);
+    XtAddCallback(entryWid, XtNcallback, menuSelectCallback,
+		  (XtPointer) -1);
+
+}
+
+
+
+
+
+
+static void display_template_details()
+/*
+ * Display importand template details
+ */
+{
+    if (template_index > 0) {
+	messagef("Template: %s\n", template_name);
+    } else {
+	message("Template: none chosen\n");
+    }
+
+}
+
+
+
+
+
+static void display_oligo_details(EdStruct *xx, int sense, int oligo)
+/*
+ * Display important oligo details
+ *   a. sequence
+ *   b. position, length
+ *   c. osp score, gc, tm
+ */
+{
+    char seq[100];
+    int pos,len;
+    int i;
+    char *a;
+    int consensus_start, consensus_end;
+
+    pos = OSP_RESULTS[oligo].start_position;
+    len = OSP_RESULTS[oligo].end_position - pos + 1;
+
+    /*
+     * Get consensus sequence with context
+     */
+    a = seq;
+    for (i=3;i;i--) *a++ = '.';
+    for (i=(pos>2)?2:pos;i;i--)
+	*a++ = tolower(consensus[pos-i]);
+    strncpy(a,&consensus[pos],len); a+=len;
+    for (i=0;i<2&&consensus[pos+len+i];i++)
+	*a++ = tolower(consensus[pos+len+i]);
+    for (i=3;i;i--) *a++ = '.';
+    *a++ = '\0';
+
+
+    /*
+     * Positions in contig
+     */
+    if (sense == BACKWARDS) {
+	consensus_start = r-OSP_RESULTS[oligo].end_position;
+	consensus_end   = consensus_start + len - 1;
+    } else {
+	consensus_start = l+OSP_RESULTS[oligo].start_position;
+	consensus_end   = consensus_start + len - 1;
+    }
+
+    messagef("Oligo: %s\n",seq);
+    messagef("\
+Primer # %2d                                      PRIMER-SELF      PRIMER-OTHER\n\
+5' end   3' end    length  Score G+C(%%)  Tm      3'  Internal     3'   Internal\n\
+%6d   %6d      %4d    %4.1f  %4.1f  %4.1f    %4.1f    %4.1f     %4.1f    %4.1f\n",
+	    oligo+1,
+	    consensus_start,
+	    consensus_end,
+	    len,
+	    OSP_RESULTS[oligo].score,
+	    OSP_RESULTS[oligo].gc * 100.0,
+	    OSP_RESULTS[oligo].tm,
+	    OSP_RESULTS[oligo].psI_score,
+	    OSP_RESULTS[oligo].ps3_score,
+	    OSP_RESULTS[oligo].poI_score,
+	    OSP_RESULTS[oligo].po3_score);
+
+}
+
+
+
+
+
+
+static void nextOligo(EdStruct *xx, int oligo, int sense)
+/*
+ * We cycle through the oligo list
+ * curr-oligo gives the current oligo entry under consideration
+ */
+{
+    /*
+     * Hilight position of next oligo in contig editor temporarily
+     */
+    int *templateList;
+    int i = oligo;
+
+    /*
+     * Print out information on oligos
+     */
+#ifdef VERBOSENESS
+    if (verbose_debug) messagef("************** %d ************\n",i);
+    if (verbose_debug) messagef("stp %d, endp %d, score %f,  gc %f, tm %f, psI %f ps3 %f poI  %f po3 %f\n",
+	    OSP_RESULTS[i].start_position,
+	    OSP_RESULTS[i].end_position,
+	    OSP_RESULTS[i].score,
+	    OSP_RESULTS[i].gc,
+	    OSP_RESULTS[i].tm,
+	    OSP_RESULTS[i].psI_score,
+	    OSP_RESULTS[i].ps3_score,
+	    OSP_RESULTS[i].poI_score,
+	    OSP_RESULTS[i].po3_score);
+#endif /*VERBOSENESS*/
+    display_oligo_details(xx,sense,i);
+
+    if (sense == BACKWARDS) {
+	/*
+	 * Convert position returned from oligo selection to
+	 * position in contig
+	 */
+	templateList =
+	    find_templates_for_oligo(xx, 
+				    r-OSP_RESULTS[i].end_position,
+				    OSP_RESULTS[i].end_position-OSP_RESULTS[i].start_position+1,
+				    sense);
+	create_temporary_tag(xx,
+			     r-OSP_RESULTS[i].end_position,
+			     OSP_RESULTS[i].end_position-OSP_RESULTS[i].start_position+1);
+    } else {
+	/*
+	 * Convert position returned from oligo selection to
+	 * position in contig
+	 */
+	templateList =
+	    find_templates_for_oligo(xx, 
+				    l+OSP_RESULTS[i].start_position,
+				    OSP_RESULTS[i].end_position-OSP_RESULTS[i].start_position+1,
+				    sense);
+	create_temporary_tag(xx,
+			     l+OSP_RESULTS[i].start_position,
+			     OSP_RESULTS[i].end_position-OSP_RESULTS[i].start_position+1);
+    }
+
+
+    set_default_template(xx,templateList);
+    create_template_menu(xx, templateList);
+
+    display_template_details();
+
+}
+
+
+/*****************************************************************************************/
+/*
+ * External routines
+ */
+
+
+
+void createOligoWidget(Widget parentWid)
+/*
+ * Prtend to create it now
+ */
+{
+    oldFogieWid = parentWid;
+}
+
+
+
+
+
+static void create_oligo_wid(Widget parentWid)
+/*
+ * Create the window for oligo selection
+ * This routine should be called just once, in the initialisation phase of xdap
+ */
+{
+    Cardinal nargs;
+    Arg args[10];
+    Position	x, y; 	 /* top-left hand corner of new widget */
+    Dimension	height;  /* height of parent widget */
+
+    /*
+     * Determine the position on the screen for this widget
+     */
+#define fromVertWid (thisxx->edWid)
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(fromVertWid, args, nargs);
+    XtTranslateCoords(fromVertWid, (Position) 0, (Position) height, &x, &y);
+
+    /*
+     * Create popup shell
+     */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    oligoWid = XtCreatePopupShell("oligo", transientShellWidgetClass, parentWid, args, nargs);
+
+    /*
+     * Create main form
+     */
+    nargs = 0;
+    form = XtCreateManagedWidget("form", formWidgetClass, oligoWid, args, nargs);
+
+    /*
+     * Create title for form
+     */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Select Oligos and Templates"); nargs++;
+    label = XtCreateManagedWidget("label", labelWidgetClass, form, args, nargs);
+
+#define XtOrientHorizontal "horizontal"
+
+    /*
+     * Create buttons for oligo sense
+     */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, label); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    bbox = XtCreateManagedWidget("bbox", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Direction:"); nargs++;
+    (void) XtCreateManagedWidget("label", labelWidgetClass, bbox, args, nargs);
+    nargs = 0;
+    strand = XtCreateManagedWidget("strand", commandWidgetClass,bbox,args,nargs);
+    XtAddCallback(strand, XtNcallback, strandCallback, (XtPointer) NULL);
+    set_strand_state(strand,strand_state);
+
+
+    /*
+     * Create Change Parameter Buttons
+     */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, bbox); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    cbox = XtCreateManagedWidget("cbox", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Parameters"); nargs++;
+    change = XtCreateManagedWidget("change", commandWidgetClass, cbox, args, nargs);
+    XtAddCallback(change, XtNcallback, changeMineCallback, (XtPointer) NULL);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Oligo Selection Parameters"); nargs++;
+    change = XtCreateManagedWidget("change", commandWidgetClass, cbox, args, nargs);
+    XtAddCallback(change, XtNcallback, changeParamsCallback, (XtPointer) NULL);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Oligo Selection Weights"); nargs++;
+    change = XtCreateManagedWidget("change", commandWidgetClass, cbox, args, nargs);
+    XtAddCallback(change, XtNcallback, changeWeightsCallback, (XtPointer) NULL);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Display Selection Information"); nargs++;
+    change = XtCreateManagedWidget("information", commandWidgetClass, cbox, args, nargs);
+    XtAddCallback(change, XtNcallback, informationCallback, (XtPointer) NULL);
+
+    /*
+     * Create action button
+     */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, cbox); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    XtSetArg(args[nargs], XtNresizable, True); nargs++;
+    dbox = XtCreateManagedWidget("dbox", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Find Oligos"); nargs++;
+    find = XtCreateManagedWidget("find", commandWidgetClass, dbox, args, nargs);
+    XtAddCallback(find, XtNcallback, findCallback, (XtPointer) NULL);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Select Next"); nargs++;
+    next = XtCreateManagedWidget("next", commandWidgetClass, dbox, args, nargs);
+    XtAddCallback(next, XtNcallback, nextCallback, (XtPointer) NULL);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Choose Template for this Oligo"); nargs++;
+    XtSetArg(args[nargs], XtNmenuName, "templateMenu"); nargs++;
+    template = XtCreateManagedWidget("template", menuButtonWidgetClass, dbox, args, nargs);
+
+
+    /*
+     * Exit action
+     */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, dbox); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    XtSetArg(args[nargs], XtNresizable, True); nargs++;
+    ebox = XtCreateManagedWidget("ebox", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Create Tag for this Oligo"); nargs++;
+    ok = XtCreateManagedWidget("ok", commandWidgetClass, ebox, args, nargs);
+    XtAddCallback(ok, XtNcallback, okCallback, (XtPointer) NULL);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Quit"); nargs++;
+    quit = XtCreateManagedWidget("quit", commandWidgetClass, ebox, args, nargs);
+    XtAddCallback(quit, XtNcallback, quitCallback, (XtPointer) NULL);
+}
+
+
+
+
+
+int invokeOligo(EdStruct *xx)
+/*
+ * Pop up the oligo selection window
+ */
+{
+    if (up) return 0;
+    thisxx = xx;
+
+    if (oligoWid == NULL) {
+	initialise();
+	osp_initialise();
+	create_oligo_wid(oldFogieWid);
+    }
+
+    XtUnmanageChild(ok);
+    XtUnmanageChild(next);
+    XtUnmanageChild(template);
+
+    XtPopup(oligoWid,   XtGrabNone);
+
+    up = 1;
+
+    /* find oligos */
+    XtCallCallbacks(find, XtNcallback, (XtPointer) NULL);
+
+    return 0;
+
+}
+
+
+
+
+
+
+
+int destroyOligo()
+/*
+ * Shut this baby down
+ */
+{
+    if (up)
+	XtCallCallbacks(quit, XtNcallback, (XtPointer) NULL);
+
+    return 0;
+}
+
+
+
+
+
+static void destroy_oligo_popup()
+/*
+ * Popdown the oligo popup window,
+ * free a few variables
+ * then relax
+ */
+{
+    EdStruct *xx = thisxx;
+
+    XtPopdown(oligoWid);
+
+    destroy_temporary_tag(xx);
+    free (consensus);
+    consensus = NULL;
+
+    redisplaySequences (xx,
+			xx->namesWid,
+			xx->sequencesWid,
+			xx->displayPos,
+			xx->displayWidth);
+}
+
+
diff --git a/src/bap/oligo.h b/src/bap/oligo.h
new file mode 100644
index 0000000..49539b2
--- /dev/null
+++ b/src/bap/oligo.h
@@ -0,0 +1,15 @@
+#ifndef _oligo_h
+#define _oligo_h
+
+#include "edUtils.h"
+#include <X11/Intrinsic.h>
+
+extern void createOligoWidget(Widget parentWid);
+extern int invokeOligo(EdStruct *xx);
+extern int destroyOligo();
+
+
+
+
+
+#endif /* _oligo_h */
diff --git a/src/bap/oligo_sel.c b/src/bap/oligo_sel.c
new file mode 100644
index 0000000..d00e1a4
--- /dev/null
+++ b/src/bap/oligo_sel.c
@@ -0,0 +1,378 @@
+/* #define DEBUG_OLIGO_SEL */
+
+#define SUBVERSION
+
+#include <stdio.h>
+#include <stdlib.h>
+#include "oligocom.h"
+#include "defn.h"
+#include "tagUtils.h"
+#include "edUtils.h"
+#include "fort.h"
+
+#define DSTR stdout
+
+#define MAXCOMLEN (100)
+
+/* Static global variables */
+static int_f last_gel;	/* remembers where we last were for		*/
+    			/* for efficiency (defaults leftmost)		*/
+static char primfilename[100];
+
+/* --- hooks --- */
+
+int insert_size() {
+    return 1000;
+}
+
+int avg_read_len() {
+    return 400;
+}
+
+/*
+ * Do the actual oligo selection.
+ * Returns the new offset to search from. (This takes into account the
+ * average gel reading length so that we do not finish we lots of suggested
+ * oligos next to each other.)
+ */
+int find_oligos(int_f offset,   int_f *relpg_p,  int_f *lngthg_p,
+		int_f *rnbr_p,	char *cons_p,    char  *comment,
+		char *oligo,    char *consensus, int_f *idevn_p,
+		char  sense,    FILE *outfile,   int_f olilen,
+		int_f olibak,   int_f *temnum,   char *datnam,
+		int_f olinum,	int_f lincon)
+{
+    int i, cur_gel, oligostart, choice, oligolen, oligoend;
+    int bestscore = 0, bestgel = 0, template = 0, score, ok, olis;
+    char gelname[DB_NAMELEN + 1];
+
+    /*
+     * Find oligo based on this position.
+     * We choose a region starting OLISTART back from here, extending
+     * for OLILEN bases.
+     */
+    olis = offset - (olilen + olibak);
+    if (olis < 1)
+	olis = 1;
+    memcpy(consensus, &cons_p[olis], olilen);
+    consensus[olilen+1] = '\0';
+    
+    for (i=0;i<MAX_NUM_OLIGOS; i++) OSP_RESULTS[i].score = 0.0;
+    ok = osp_analyse(OSP_RESULTS,consensus,&prm,screens,score_info);
+
+    if (ok == 0) {
+	return -1;
+    }
+
+#ifdef DEBUG_OLIGO_SEL
+    fprintf(DSTR, "osp_analyse() = %d\n", ok);
+#endif
+
+#ifdef DEBUG_OLIGO_SEL
+    i = 0;
+    while (OSP_RESULTS[i].score != 0) {
+	fprintf(DSTR, "Oligo found at %d, score = %f, '%.*s'\n",
+	       OSP_RESULTS[i].start_position + offset - (olilen+olibak),
+	       OSP_RESULTS[i].score,
+	       OSP_RESULTS[i].end_position -
+	       OSP_RESULTS[i].start_position + 1,
+	       &consensus[OSP_RESULTS[i].start_position]);
+	i++;
+    }
+#endif
+
+    /*
+     * Choose the first in the list (for the time being). This should be
+     * the oligo with the best score.
+     */
+
+    /* choice = 0; */
+    
+    oligostart = 0;
+    /* Choose the first 'olinum' oligos from the list */
+    for (choice = 0; choice < olinum; choice++) {
+
+	/*
+	 * In case we don't have as many oligos to chose from as the user
+	 * requested
+	 */
+	if (OSP_RESULTS[choice].score == 0) 
+	    return oligostart + avg_read_len();
+
+	memset(oligo, 0, olilen);
+	memcpy(oligo, &consensus[OSP_RESULTS[choice].start_position],
+	       OSP_RESULTS[choice].end_position - OSP_RESULTS[choice].
+	       start_position + 1);
+
+#ifdef DEBUG_OLIGO_SEL
+    fprintf(DSTR, "Choosing oligo '%s'\n", oligo);
+#endif
+
+	/* find sequences that could yield a valid primer */
+	cur_gel = last_gel;
+	oligostart = OSP_RESULTS[choice].start_position + offset -
+	    (olilen+olibak);
+	oligolen = OSP_RESULTS[choice].end_position
+	    - OSP_RESULTS[choice].start_position;
+
+#ifdef DEBUG_OLIGO_SEL
+	fprintf(DSTR, "oligo starts %d, ends %d\n",
+		oligostart, oligostart + oligolen);
+#endif
+
+	do {
+	    /* only look for positive directing sequences */
+	    if (lngthg_p[cur_gel] >= 0) {
+		/* stop when we've gone past our oligo */
+		if (relpg_p[cur_gel] >= oligostart)
+		    break;
+		/*
+		 * If the sequence ends past end of oligo then we have a
+		 * candidate template.
+		 */
+		if (relpg_p[cur_gel] + insert_size() - avg_read_len() 
+		    >= oligostart + oligolen) {
+		    /* remember possible template */
+		    if (relpg_p[cur_gel] + lngthg_p[cur_gel] >
+			oligostart + oligolen)
+			template = cur_gel;
+		    /*
+		     * Choose the gel that is furthest right (nearest to the
+		     * oligo), but that also has a large amount of data to
+		     * the right of the oligo. So if the right most gel is
+		     * a short reading (maybe due to bad template) then we
+		     * choose the next most right gel instead.
+		     */
+
+#ifdef DEBUG_OLIGO_SEL
+		    fprintf(DSTR, "Covered by gel no. %d (ends %d)\n",
+			    cur_gel, relpg_p[cur_gel] + lngthg_p[cur_gel]);
+#endif
+
+		    /* Base 'score' on nearness to oligo, and length */
+		    score = avg_read_len() - (oligostart - relpg_p[cur_gel])
+			+ 0.5 * lngthg_p[cur_gel];
+		    if (score > bestscore)
+			bestgel = cur_gel, bestscore = score;
+		}
+	    }
+	    cur_gel = rnbr_p[cur_gel];
+	} while (cur_gel != 0);
+	
+	if (bestgel == 0)
+	    return -1;
+#ifdef DEBUG_OLIGO_SEL
+	fprintf(DSTR, "Choosing gel no. %d for template\n", bestgel);
+#endif
+
+	/*
+	 * Create tag - far more convoluted than it needs to be. We cannot tag
+	 * the consensus so we have to tag a sequence instead. Preferably we
+	 * use the template, failing that a sequence in the same direction, or
+	 * failing that any sequence that at least covers the region.
+	 */
+	oligoend = oligostart + oligolen;
+	if (bestgel && relpg_p[bestgel] + lngthg_p[bestgel] >= oligoend)
+	    template = bestgel;
+	else if (template == 0) {
+	    /* no template in this direction */
+	    cur_gel = last_gel;
+	    while (cur_gel != 0) {
+		if ((relpg_p[cur_gel] + 
+		     (lngthg_p[cur_gel]>0?lngthg_p[cur_gel]:-lngthg_p[cur_gel]
+		      )) > oligoend) {
+		    template = cur_gel;
+		    break;
+		}
+		cur_gel = rnbr_p[cur_gel];
+	    }
+	}
+	if (template) {
+	    /*
+	     * In the comment we use 'bestgel' as the template even if we
+	     * are tagging a different sequence.
+	     */
+	    readn_(idevn_p, &bestgel, gelname, (int_fl)(DB_NAMELEN));
+	    Fstr2Cstr(gelname, DB_NAMELEN, gelname, DB_NAMELEN);
+
+#ifdef DEBUG_OLIGO_SEL
+	    fprintf(DSTR, "Using gel no. %d for template\n", template);
+#endif
+	    sprintf(comment, "Template=%s\nName=%s.%d\nSequence=%s\n",
+		    gelname, datnam, *temnum, oligo);
+
+	    insert_NEW_tag(template, oligostart - relpg_p[template] + 1,
+			   oligolen + 1, "OLIG", comment); 
+
+#ifdef DEBUG_OLIGO_SEL
+	    puts(comment);
+#endif
+	    ok = fprintf(outfile, "%s.%d %s %s (@ %d ) %c\n",
+			 datnam, *temnum, gelname, oligo, oligostart, sense);
+	    if (ok < 0 || ok == EOF) {
+		puts("*** Failed to write to output file");
+		return -1;
+	    }
+	    fprintf(stdout, "At %d - template %s, primer %s, number %d\n",
+		    (sense == '+') ? offset : relpg_p[lincon] - offset +1,
+		    gelname, oligo,	(*temnum)++);
+	    fflush(stdout);
+	    updout_();
+	} else {
+	    fprintf(stdout, "At %d - no suitable oligos found\n",
+		    (sense == '+') ? offset : relpg_p[lincon] - offset +1);
+	    return -1;
+	}
+	updout_();
+    }
+    return oligostart + avg_read_len();
+}
+
+/*
+ * Find oligos for a contig region. Output a list of suggested experiments
+ * to extend into problem areas (such as single stranded sections or the
+ * ends of the contig
+ */
+void olisel_(
+     int_f *relpg_p,	/* int array - relative position (gel reading) */
+			/*        or - length of contig */
+     int_f *lngthg_p,	/* int array - length of gel */
+			/*        or - empty for contig */
+     int_f *lnbr_p,	/* int array - left nodes */
+     int_f *rnbr_p,	/* int array - right nodes */
+     char  *qual_p,	/* char array - quality of contig */
+     char  *cons_p,	/* char array - consensus of contig */
+     int_f *llino_p,	/* int - left gel in contig */
+     int_f *lincon_p,	/* int - record no. of contig */
+     int_f *lreg_p,	/* int - left start in qual */
+     int_f *rreg_p,	/* int - right start in qual */
+     int_f *idevn_p,	/* int - stream of name file */
+     char  *sense,	/* char array - actually single char */
+     int_f *olilen_p,	/* int - length of oligo selection region */
+     int_f *olibak_p, 	/* int - offset bak of start of oli sel region */
+     int_f *lstrt_p,	/* int - start (not offset) of oli sel region */
+     int_f *temnum_p,	/* int - start template number */
+     char  *datnam_p,	/* string - name of database */
+     int_f *olinum_p,	/* int - number of oligos per template */
+     int_f *datnaml,	/* int length - of database name */
+     int_fl datnam_l	/* length - of elements in datnam (==1)*/
+     )
+{
+    register int_f i, j, l;
+    int_f rreg = *rreg_p;	/* faster copy of *rreg_p */
+    int_f lreg = *lreg_p;
+    char *consensus, comment[MAXCOMLEN], *oligo;
+    static FILE *outfile;
+    char fname[256];
+
+    strncpy(fname, datnam_p, *datnaml);
+    fname[*datnaml] = '\0';
+    oligo = (char *)malloc(*olilen_p+1);
+    consensus = (char *)malloc(*olilen_p+1);
+    last_gel = *llino_p;
+
+    osp_initialise();
+
+    if (*sense == '+' && (outfile = fopen(primfilename, "w")) == NULL) {
+	fprintf(stdout, "Failed to open 'oligo_sel.out'\n");
+	return;
+    }
+
+    relpg_p--;
+    lngthg_p--;
+    lnbr_p--;
+    rnbr_p--;
+    qual_p--;
+    cons_p--;
+
+    /* scan through quality buffer */
+    /* Do not look at the very left hand end. */
+    if (lreg < (*olilen_p + *olibak_p + 1))
+	lreg = (*olilen_p + *olibak_p + 1);
+    for (i = lreg; i<=rreg; i++) {
+
+	/* strong negative strand, but no positive strand */
+	if (qual_p[i] == '2' || qual_p[i] == '8') {
+
+	    /* find length of single stranded section */
+	    j = i;
+	    while(qual_p[j] == '2' || qual_p[j] == '8')
+		j++;
+
+#ifdef DEBUG_OLIGO_SEL
+	    fprintf(DSTR, "Single strand at %d - %d, len %d\n", i, j, j-i);
+#endif
+/*
+	    fprintf(stdout, "At %d - ", (*sense == '+') ? i :
+		    relpg_p[*lincon_p] - i + 1);
+*/
+	    l = find_oligos(i,         relpg_p,  lngthg_p,  rnbr_p,
+			    cons_p,    comment,  oligo,     consensus,
+			    idevn_p,   *sense,   outfile,   *olilen_p,
+			    *olibak_p, temnum_p, fname,     *olinum_p,
+			    *lincon_p);
+	    if (l != -1)
+		i = (int_f)l;
+/*
+	    else
+		puts("No suitable oligos found");
+*/
+
+	    if (i < j)
+		i = j;
+	}
+    }
+    
+    free(oligo);
+    free(consensus);
+    osp_cleanup();
+    if (*sense == '-' && outfile) {
+	fclose(outfile);
+    }
+}
+
+/*
+ * Initialises the olisel_() function by asking the required questions to
+ * the user. The only question here is the filename - the others are currently
+ * written in FORTRAN.
+ */
+void olinit_(int_f *status_p,   int_f *olilen_p, int_f *olibak_p,
+	     int_f *dialogue_p, int_f *maxgel_p, int_f *temnum_p,
+	     int_f *numoli_p) {
+    if (gtstr("Name of file for primers", "primers", primfilename,
+	      sizeof(primfilename)) == -1) {
+	*status_p = -1;
+	return;
+    }
+
+    *temnum_p = getint(1, 9999, 1, "Start oligo number", status_p);
+    if (*status_p < 0) {
+	*status_p = -1; return;
+    }
+
+    if (*dialogue_p) {
+	*olibak_p = getint(1, *maxgel_p, 20, 
+			   "Start of oligo choice region", status_p);
+	if (*status_p < 0) {
+	    *status_p = -1; return;
+	}
+
+	*olilen_p = getint(*olibak_p, *maxgel_p, *olibak_p+40, 
+			   "End of oligo choice region", status_p)-*olibak_p;
+	if (*status_p < 0) {
+	    *status_p = -1; return;
+	}
+	
+	*numoli_p = getint(1, 99, 2, "Number of oligos per region", status_p);
+	if (*status_p < 0) {
+	    *status_p = -1; return;
+	}
+	
+	*status_p = 0;
+	return;
+    } else {
+	*olibak_p = 20;
+	*olilen_p = 60;
+	*numoli_p = 2;
+    }
+}
diff --git a/src/bap/oligocom.c b/src/bap/oligocom.c
new file mode 100644
index 0000000..2921407
--- /dev/null
+++ b/src/bap/oligocom.c
@@ -0,0 +1,151 @@
+#include "defn.h"
+#include "struct.h"
+
+/* NEM_DEFS - Preferred Nematode defaults */
+#define NEM_DEFS
+
+/*
+ * OSP requires these variables to be set?????
+ */
+static int init_done = 0;
+extern int program_option;
+extern int program_version;
+extern Prm prm;
+char *score_info;
+char **screens;
+OSP_Results *OSP_RESULTS;/* table of results from call to osp_analyse */
+
+
+static void param_init()
+{
+  
+  
+  if (program_option==3 || program_version==3 || program_version==1) {
+
+#ifdef NEM_DEFS    
+    prm.prod_len_low=0; /* old len_low; lower bound on product length */
+    /* For sequencing primers only, this is distance
+       from the end of the sequence including the primer
+       itself */
+    prm.prod_len_high=200;  /* upper bound on product length */
+#else /*NEMDEFS*/
+    prm.prod_len_low=0; /* old len_low; lower bound on product length */
+    /* For sequencing primers only, this is distance
+       from the end of the sequence including the primer
+       itself */
+    prm.prod_len_high=100;  /* upper bound on product length */
+#endif /*NEMDEFS*/
+
+#ifdef NEM_DEFS
+    prm.min_prim_len=17, prm.max_prim_len=23; /* minimum and maximum primer length */
+#else /*NEM_DEFS*/
+    prm.min_prim_len=17, prm.max_prim_len=18; /* minimum and maximum primer length */
+#endif /*NEM_DEFS*/
+  }
+  else {
+    
+    prm.prod_len_low=100; /* old len_low; lower bound on product length */
+    /* For sequencing primers only, this is distance
+       from the end of the sequence including the primer
+       itself */
+    prm.prod_len_high=300;  /* upper bound on product length */
+    prm.min_prim_len=18, prm.max_prim_len=22; /* minimum and maximum primer length */
+  }
+  prm.prod_tm_low=70.0;    /*lower bound on melting temperature, degrees C */
+  prm.prod_tm_high=90.0; /* upper bound on melting temperature in degrees C */
+  
+#ifdef NEM_DEFS
+  prm.prim_gc_low=.30; /* lower bound on percentage G+C content, primer */
+  prm.prim_gc_high=.70; /* upper bound on percentage G+C content, primer*/
+#else /*NEM_DEFS*/
+  prm.prim_gc_low=.40; /* lower bound on percentage G+C content, primer */
+  prm.prim_gc_high=.55; /* upper bound on percentage G+C content, primer*/
+#endif /*NEM_DEFS*/
+  prm.prod_gc_low=.40; /* lower bound on percentage G+C content, primer */
+  prm.prod_gc_high=.55; /* upper bound on percentage G+C content, primer*/
+  prm.prim_tm_low=50;/* lower bound on primer melting temperature in degrees C */
+  prm.prim_tm_high=55; /* upper bound on primer melting temperature in degrees C */
+  prm.self3_hmlg_cut=8,prm.selfI_hmlg_cut=14; /*homology cutoff scores primer-self*/
+  prm.pp3_hmlg_cut=8,prm.ppI_hmlg_cut=14; /*  homology cutoff scores 
+					      primer-primer (not used in nemo)*/
+  prm.primprodI_hmlg_cut=0, prm.primprod3_hmlg_cut=0;  /*primer-product homology 
+							 score cutoffs*/
+  prm.delta_tm_cut=2.0; /* cutoff for difference in tm between primer1 
+			 and tm of primer2*/
+  prm.primotherI_hmlg_cut=0.0,prm.primother3_hmlg_cut=0.0; /*homology cutoff scores for 
+							 the primer versus the
+							 other sequence file */
+  /* the other sequence file contains vector, cosmid, repeated sequences, etc */
+  /*allocations*/
+  sprintf(prm.end_nucs,"S");   /* final nucleotides of the sequence */
+  
+  prm.AT_score=2.0; /* used in the calculation of annealing score, each time
+		     an A-T align, the score would be prm.AT_score */
+  prm.CG_score=4.0; /* used in the calculation of annealing score, each time
+		     an C-G align, the score would be prm.CG_score */
+  sprintf(prm.wt_ambig,"avg"); /* either full or average, determines 
+       calculation of annealing scores when
+       ambiguous nucleotides are being considered;  if avg, then take
+       an average of the possible nucleotide scores; if full, then
+       give it the maximum possible score. e.g.
+       a C on one strand and an R on the other, the full score
+       would be prm.CG_score; the avg score would be 1/2*prm.CG_score
+       The table is set up in analysis.c in the function 
+       get_weight_matrix*/
+	     
+
+  
+  /* scoring weights */
+  prm.wt_prod_len=0;
+  prm.wt_prod_tm=0;
+  prm.wt_prod_gc=0;
+  prm.wt_prim_a_len=0,prm.wt_prim_s_len=0;
+  prm.wt_prim_a_gc=0,prm.wt_prim_s_gc=0;
+  prm.wt_prim_a_tm=0,prm.wt_prim_s_tm=0;
+  prm.wt_self3_hmlg_cut=2,prm.wt_selfI_hmlg_cut=1;
+  prm.wt_pp3_hmlg_cut=2,prm.wt_ppI_hmlg_cut=1;
+  prm.wt_primprodI_hmlg_cut=0,prm.wt_primprod3_hmlg_cut=0;
+  prm.wt_primotherI_hmlg_cut=0,prm.wt_primother3_hmlg_cut=0;
+  prm.wt_delta_tm_cut=0;
+} 
+
+void osp_initialise()
+{ int i;
+
+
+  if (init_done) return;
+  init_done++;
+
+  screens=(char **)our_alloc(10 * sizeof(char *));
+  for (i=0; i<10; i++) screens[i]=(char *)our_alloc(MAX_SEQ_LEN * sizeof(char));
+  score_info=(char *)our_alloc(3000 * sizeof(char));
+  prm.end_nucs = (char *)our_alloc(MAX_NAME_SIZE *sizeof(char));
+  prm.wt_ambig = (char *)our_alloc(6 *sizeof(char));
+  OSP_RESULTS  = (OSP_Results *)our_alloc(MAX_NUM_OLIGOS * sizeof(OSP_Results));
+  /*end allocations*/
+
+  program_option=3;
+  program_version=3;
+
+  param_init();
+  
+}
+
+void osp_cleanup() {
+    int i;
+
+    if (init_done == 0)
+	return;
+    init_done = 0;
+
+    if (screens) {
+	for (i=0; i<10; i++)
+	    if (screens[i])
+		our_free(screens[i]);
+	our_free(screens);
+    }
+    score_info   && our_free(score_info);
+    prm.end_nucs && our_free(prm.end_nucs);
+    prm.wt_ambig && our_free(prm.wt_ambig);
+    OSP_RESULTS  && our_free(OSP_RESULTS);
+}
diff --git a/src/bap/oligocom.h b/src/bap/oligocom.h
new file mode 100644
index 0000000..acc279e
--- /dev/null
+++ b/src/bap/oligocom.h
@@ -0,0 +1,7 @@
+#include "struct.h"
+
+/* Needed for oligo_sel.c */
+extern Prm prm;
+extern char *score_info;
+extern char **screens;
+extern OSP_Results *OSP_RESULTS;
diff --git a/src/bap/osp-bits/README b/src/bap/osp-bits/README
new file mode 100644
index 0000000..dac015e
--- /dev/null
+++ b/src/bap/osp-bits/README
@@ -0,0 +1,16 @@
+Bits and pieces of OSP used by xbap   (C) LaDeana Hillier and Philip Green
+
+People wanting to obtain the program OSP should contact:
+
+    LaDeana Hillier (lfw@elegans.wustl.edu)
+    Department of Genetics
+    Washington University School of Medicine
+    4566 Scott Avenue, Box 8232
+    St. Louis, MO 63110
+    USA
+
+Reference:
+
+Hillier, L., and Green, P. (1991). "OSP: an oligonucleotide selection
+program," PCR Methods and Applications, 1:124-128.
+
diff --git a/src/bap/osp-bits/Xincludes.h b/src/bap/osp-bits/Xincludes.h
new file mode 100644
index 0000000..eb743f7
--- /dev/null
+++ b/src/bap/osp-bits/Xincludes.h
@@ -0,0 +1,44 @@
+/* 
+  File: Xincludes.h 
+
+  Copyright 1991: LaDeana Hillier and Philip Green
+   
+*/
+
+#ifndef _Xincludes_h
+#define _Xincludes_h
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/AsciiSrc.h>
+#include <X11/Xaw/Viewport.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Dialog.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Paned.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/TextSink.h>
+#include <X11/Xaw/Scrollbar.h>
+#include <X11/extensions/shape.h>
+/* #include <X11/extensions/MITMisc.h> */
+#include <X11/Shell.h>
+#include <X11/Xaw/Cardinals.h>
+
+
+/* global X variables */
+ extern Display  *theDisplay;
+ extern int      theScreen;
+ extern Colormap theColormap;
+ extern XFontStruct *theDrawFont;
+ extern Pixmap graphResultsPixmap;
+ extern GC graphResultsGC;
+ extern XGCValues graphResultsGC_values;
+ extern Dimension graphWidth,graphHeight;
+ extern Dimension charWidth; /* font character width*/
+ extern Widget resultsWid;
+ extern Widget graphResultsWid;
+
+
+#endif /* _Xincludes_h */
diff --git a/src/bap/osp-bits/Xmess.h b/src/bap/osp-bits/Xmess.h
new file mode 100644
index 0000000..80a7f03
--- /dev/null
+++ b/src/bap/osp-bits/Xmess.h
@@ -0,0 +1,37 @@
+#ifndef _Xmess_h
+#define _Xmess_h
+
+/* 
+  Program Name: Xmess 
+  File name: Xmess.h
+  Purpose: send any message to either a popupWidget or the resultsWid
+     from any of the other program modules -- can be warnings, or
+     information or errors
+  Copyright 1991: LaDeana Hillier and Philip Green
+
+  Last Update: Fri Mar 22 1991
+
+  Change Log:
+*/
+
+/* ---- Exports ---- */
+extern void message();
+/* void message(message)
+ String message;
+  put the String message in the results window */
+
+
+
+extern void popupMessage();
+/* void popupMessage(message_str)
+ char *message_str;
+  put the message_str in the popupMessage window */
+
+extern void popUpErrorMessage();
+/* pops up an error message if there has been overwriting
+   of any variable in our_alloc, i.e. if I have not allocated
+   enough space for some variable*/
+
+#endif /* _Xmess_h */
+
+
diff --git a/src/bap/osp-bits/analysis.c b/src/bap/osp-bits/analysis.c
new file mode 100644
index 0000000..0caad5b
--- /dev/null
+++ b/src/bap/osp-bits/analysis.c
@@ -0,0 +1,2881 @@
+/* 
+  Program Name: analysis
+  File name: analysis.c
+  Purpose: all sequence analysis is done in this module, except
+  when the user only asks for scores without looking for primers 
+  In this program all of the primer and product evaluation
+  takes place, from selection of valid primers to selection of valid
+  primer pairs 
+  
+  Last Update: Tuesday April 16 1991
+  
+  Change Log:
+  
+  Copyright 1991: LaDeana Hillier and Philip Green
+  */
+
+
+
+/* ---- Includes ---- */
+
+#include <stdlib.h>
+#include <string.h>
+
+#include "our_allo.h"
+#include "defn.h" /* macros and stdio */
+
+#include "struct.h" /* global structure defns */
+
+#if defined(TEXTVERSION) || defined(XVERSION)
+#include "extern_var.h" /* external variables */
+#include "utils.h" /* IMPORT: text_to_output*/
+#include "primerGraphics.h" /* IMPORT drawPrimers, drawflankingPrimers */
+#include "Xmess.h"  /* IMPORT: message, popupMessage, popUpErrorMessage */
+OSP_Results *OSP_RESULTS;
+#endif
+
+#include "analysis.h" 
+
+
+extern char tedType[4];
+
+#ifdef SUBVERSION
+extern void message();
+extern void popupMessage();
+extern void popUpErrorMessage();
+extern void text_to_output();
+extern void drawPrimers();
+extern void drawflankingPrimers();
+extern void writeTedShell();
+extern void get_scores();
+int strmatch();
+
+extern char *score_info;
+/*extern Prm prm;*/
+extern char *seq;
+extern OSP_Results *OSP_RESULTS;
+
+/* extern_var.h extractions */
+p_bank *a_bank,*s_bank;
+score_bank *product;
+otherSeqBank *otherSeq;
+Prm prm;
+/* not used at all in this file */
+char *output_fn;
+
+/* not used but go ahead and allocate a little space */
+char *seq_two;
+/******/
+
+int score_info_len=3000;
+static char *opp;
+int num_valid_primers=0;
+int buftext=0;
+int seq_len;
+int *ndx;
+int one_primer=1;
+int num_seq=1;
+int orient=1; /* I assume xdap will always send it to me in top strand form */
+int num_other_seqs;
+int program_option=3; /* search for a single primer in one sequence, may want 4 too?,
+			 4 is output scores for a single primer which you supply*/
+int program_version=3; /* subroutine version=3, text=2, osp=0, nemo=1 */
+int double_stranded=1; /* do primer-product calcs versus both strands*/
+int otherRight=0; /* look on the top strand only */
+int otherDouble_stranded=1; /* look at both strands of the other sequence for homologies */
+
+
+
+
+
+
+void interface_init(score_info)
+    char *score_info;
+{
+    int i;
+    
+    seq_two = (char *)our_alloc(MAX_SEQ_LEN * sizeof(char));
+    opp = (char *)our_alloc(125 * sizeof(char));
+    a_bank = (p_bank *)our_alloc(MAX_NUM_OLIGOS *sizeof(p_bank));
+    s_bank = (p_bank *)our_alloc(MAX_NUM_OLIGOS *sizeof(p_bank));
+    product = (score_bank *)our_alloc(MAX_NUM_OLIGOS *sizeof(score_bank));
+    ndx = (int *)our_alloc(MAX_NUM_OLIGOS*sizeof(int));
+    
+    /* initialize values of global variables */
+    for (i = 0; i <= 125; i++) {
+	opp[i]='N'; /* for now everything non-AGCT is made into
+		       an N in the complementary sequence */
+    }
+    
+    /* this array is used when getting complementary sequences */
+    opp['A'] = 'T';
+    opp['G'] = 'C';
+    opp['T'] = 'A';
+    opp['U'] = 'A';
+    opp['C'] = 'G';  
+    opp['K'] = 'M';
+    opp['5'] = 'K';
+    opp['6'] = 'M';
+    opp['7'] = 'W';
+    opp['8'] = 'S';
+    opp['Y'] = 'R';
+    opp['R'] = 'Y';
+    opp['W'] = 'W';
+    opp['S'] = 'S';
+    opp['M'] = 'K'; 
+    opp['V'] = 'B';  
+    opp['B'] = 'V';
+    opp['D'] = 'H';
+    opp['H'] = 'D';
+    opp['N'] = 'N';
+    opp['X'] = 'N';
+    
+    *score_info = '\0';
+    num_other_seqs=-1; /* -1 if there are no other seqs */
+    *seq_two = '\0';
+    
+    return;
+    
+}
+
+
+void clean_up()
+{ int i;
+  our_free(ndx);
+  our_free(opp);
+  our_free(seq_two);
+  our_free(s_bank);
+  our_free(a_bank);
+  our_free(product);
+  if (num_other_seqs!=-1) {
+      for (i=0; i<=num_other_seqs; i++)
+	  our_free(otherSeq[i].seq);
+      our_free(otherSeq);
+  }
+  return;
+}
+
+void primer_to_xdap(num_save,OSP_RESULTS) 
+    int num_save;
+    OSP_Results *OSP_RESULTS;
+{ int primer_number;
+  /* osp_results from 1 to num_save */
+  
+  for (primer_number=1; primer_number <= num_save; primer_number++) {
+      /* len is product length, or distance from end , not length of primer */
+      OSP_RESULTS[primer_number-1].start_position=s_bank[product[ndx[primer_number]].s_primer].stp;
+      OSP_RESULTS[primer_number-1].end_position=s_bank[product[ndx[primer_number]].s_primer].endp;
+      OSP_RESULTS[primer_number-1].score=product[ndx[primer_number]].sum_score;
+      OSP_RESULTS[primer_number-1].gc=s_bank[product[ndx[primer_number]].s_primer].gc;
+      OSP_RESULTS[primer_number-1].tm=s_bank[product[ndx[primer_number]].s_primer].tm;
+      OSP_RESULTS[primer_number-1].psI_score=s_bank[product[ndx[primer_number]].s_primer].score;
+      OSP_RESULTS[primer_number-1].ps3_score=s_bank[product[ndx[primer_number]].s_primer].end_score;
+      OSP_RESULTS[primer_number-1].poI_score=s_bank[product[ndx[primer_number]].s_primer].primotherI;
+      OSP_RESULTS[primer_number-1].po3_score=s_bank[product[ndx[primer_number]].s_primer].primother3;
+  }
+  if (num_save==0) primer_number=0;
+  OSP_RESULTS[primer_number].start_position=0;
+  OSP_RESULTS[primer_number].end_position=0;
+  OSP_RESULTS[primer_number].score=0;
+  OSP_RESULTS[primer_number].gc=0;
+  OSP_RESULTS[primer_number].tm=0;
+  OSP_RESULTS[primer_number].psI_score=0;
+  OSP_RESULTS[primer_number].ps3_score=0;
+  OSP_RESULTS[primer_number].poI_score=0;
+  OSP_RESULTS[primer_number].po3_score=0;
+  return;
+}
+
+int osp_analyse(OSP_RESULTS, seq, params, screens, score_info)
+    OSP_Results *OSP_RESULTS; /* results structure see struct.h */
+    char *seq; /* sequence from which to choose oligo */
+    Prm *params; /* parameters, NULL means use default */
+    char **screens; /* sequences to screen against */
+    char *score_info; /* rejection information */
+    
+{ int stp,endp,stp2,endp2;
+  int analysis_ok;
+  int i,j;
+  int primer_number;
+  
+  
+  prm = *params;
+  
+  for (i=0; seq[i]; i++)  {
+      if (!isupper(seq[i])) toupper(seq[i]);
+  }
+  
+  interface_init(score_info);
+  
+  
+  /* put your other sequences into my other sequence structure */
+  /* how many other sequences are there? */
+  for (i=0; screens[i][0]!='\0'; i++);
+  num_other_seqs = i-1;
+  
+  
+  /* allocate for other seqs */
+  if (num_other_seqs != -1) {
+      otherSeq = (otherSeqBank *)our_alloc((num_other_seqs + 1) *
+					   sizeof(otherSeqBank));
+      for (i=0; i<=num_other_seqs; i++)
+	  otherSeq[i].seq = (char *)our_alloc(OTHER_SEQ_MAX_SIZE*sizeof(char));
+      
+      /* stuff xdaps screens into my otherseqs */
+      for (i=0; screens[i][0]!='\0'; i++) {
+	  for  (j=0; screens[i][j]; j++)
+	      otherSeq[i].seq[j]=toupper(screens[i][j]);
+	  otherSeq[i].len=strlen(otherSeq[i].seq);
+	  otherSeq[i].strand=1;
+      }
+  } else {
+      otherSeq = NULL;
+  }
+  
+  /* go call analyze */  
+  stp=0; endp=strlen(seq)-1;
+  seq_len=endp;
+  stp2=stp; endp2=endp;
+  
+  analysis_ok=analysis(seq,stp,endp,stp2,endp2,OSP_RESULTS,score_info);
+  
+  /* stick my primer structures back into xdaps, already done in analysis() 
+     unless it didn't find any primers; make sure that all the osp_results
+     for 0 are 0 */
+  if (analysis_ok==0 || num_valid_primers==0) {
+      primer_number=0;
+      OSP_RESULTS[primer_number].start_position=0;
+      OSP_RESULTS[primer_number].end_position=0;
+      OSP_RESULTS[primer_number].score=0;
+      OSP_RESULTS[primer_number].gc=0;
+      OSP_RESULTS[primer_number].tm=0;
+      OSP_RESULTS[primer_number].psI_score=0;
+      OSP_RESULTS[primer_number].ps3_score=0;
+      OSP_RESULTS[primer_number].poI_score=0;
+      OSP_RESULTS[primer_number].po3_score=0;
+  }
+  
+  
+  /* clean up my allocations */
+  clean_up();
+  
+  /* return */
+  return(analysis_ok);
+}
+
+void osp_get_score(OSP_RESULTS, seq, primer_start, primer_end, screens)
+    OSP_Results *OSP_RESULTS; /* results structure see struct.h */
+    char *seq; /* sequence from which to choose oligo */
+    char **screens; /* sequences to screen against */
+    int primer_start,primer_end; /* starting and ending point for primer
+				    within sequence */
+{ int stp,endp;
+  int i,j;
+  
+  
+  for (i=0; seq[i]; i++)  {
+      if (!isupper(seq[i])) toupper(seq[i]);
+  }
+  
+  num_other_seqs=-1; /* -1 if there are no other seqs */
+  
+  /* put your other sequences into my other sequence structure */
+  /* how many other sequences are there? */
+  for (i=0; screens[i][0]!='\0'; i++);
+  num_other_seqs = i-1;
+  
+  
+  /* allocate for other seqs */
+  if (num_other_seqs != -1) {
+      otherSeq = (otherSeqBank *)our_alloc((num_other_seqs + 1)*
+					   sizeof(otherSeqBank));
+      for (i=0; i<=num_other_seqs; i++)
+	  otherSeq[i].seq = (char *)our_alloc(OTHER_SEQ_MAX_SIZE*sizeof(char));
+  
+      /* stuff xdaps screens into my otherseqs */
+      for (i=0; screens[i][0]!='\0'; i++) {
+	  for  (j=0; screens[i][j]; j++)
+	      otherSeq[i].seq[j]=toupper(screens[i][j]);
+	  otherSeq[i].len=strlen(otherSeq[i].seq);
+	  otherSeq[i].strand=1;
+      }
+  } else {
+      otherSeq = NULL;
+  }
+  
+  /* go call get_scores */  
+  stp=0; endp=strlen(seq)-1;
+  seq_len=endp;
+  
+  get_scores(seq,seq,primer_start,primer_end,stp,endp,1,orient,OSP_RESULTS);
+  
+  /* stick my primer structures back into xdaps, already done in get_scores*/
+  
+  /* return */
+  return;
+}
+
+
+#endif /* if defined subversion */
+
+
+
+/* ---- Internal Functions ---- */
+
+
+void melt_temp(len,pgc,tm,num_ambigs)
+    /* calculates melting temperatures for a given 
+       length, and percent gc */
+    float pgc; /* INPUT: prcnt gc content */
+    int len; /* INPUT: length of the primer or product */
+    float *tm; /* OUTPUT: Tm given that percent gc and length */
+    int num_ambigs; /* INPUT: number of ambiguities within this
+		       length of sequence */
+    
+{  
+  /* calculate the tm based on the maniatis formula, see below */
+  
+  len-=(float)num_ambigs;
+  
+  *tm = 62.3 + 0.41*pgc*100.0 - (float)(500.0/(float)len);
+  
+  /*  Tm = 62.3 + 0.41*(%G+C) - 500/N where N is the length of the sequence*/
+  
+  return;
+}
+
+
+void get_gc_array(theseq,gc_array)
+    char *theseq; /* INPUT: the sequence for which you wish to get 
+		     the gc_array */
+    int *gc_array; /* OUTPUT: the gc content for each position in
+		      theseq */
+    
+    /* calculate the gc_content at every position in a seq and
+       output it in gc_array */
+    
+{ int i;
+  int *v;
+  int totalGC = 0;
+  
+  v = (int *)our_alloc(125 * sizeof(int));
+  
+  /* initializations, use array lookup */
+  for (i=0; i<125; i++)  v[i]=0;
+  v['c'] = 1; v['C']=1;
+  v['g'] = 1; v['G']=1;
+  
+  for (i=0; theseq[i]; i++) {
+      /*make sure I have allocated enought for this gc_array*/
+      if (i>MAX_SEQ_LEN) {
+	  message("Input sequence longer than MAX_SEQ_LEN") ;
+	  popUpErrorMessage();
+      }
+      totalGC += v[theseq[i]];
+      gc_array[i] = totalGC;
+  }
+  
+  our_free(v);
+  return;
+  
+}
+
+
+
+
+int is_n(vec,stp,endp,is_nuc) 
+    char *vec; /* INPUT: sequence */
+    int stp,endp; /* INPUT: starting point to look for N's */
+    int *is_nuc; /* INPUT: is_nuc, array indicating whether or
+		    not a given nucleotide should be condsidered
+		    ambiguous, 1 if not ambiguous */
+    
+    /* returns a 1 if there is a non AGCT in the sequence
+       between stp and endp, returns a 0 if the whole sequence
+       is AGCT */
+    
+{ int i;
+  
+  for (i=stp; i <= endp; i++) 
+      if (!is_nuc[vec[i]]) return(1);
+  
+  return(0);
+}
+
+
+int find_end_nucs(vec,stp,endp,end_nucs)
+    char *vec;		/*INPUT: sequence */
+    int stp,endp;	/* INPUT: starting and ending points in that vector */
+    char *end_nucs;	/* INPUT: the nucleotide string you wish to find */
+    
+    /* returns the position of the first end_nucs sequence it finds,
+       returns a -1 if no end_nuc sequence was found */
+{ /* int i,j; */
+  int eql[125][125];
+  int pos;
+  
+  /*	for (i = 0; i < 125; i++) {
+	for (j = 0; j < 125; j++) {
+	eql[i][j]=0;
+	}
+	}*/
+  
+  eql['a']['N'] = 1;
+  eql['A']['N'] = 1;
+  eql['g']['N'] = 1;
+  eql['G']['N'] = 1;
+  eql['c']['N'] = 1;
+  eql['C']['N'] = 1;
+  eql['t']['N'] = 1;
+  eql['T']['N'] = 1;
+  eql['T']['U'] = 1;
+  eql['U']['T'] = 1;
+  eql['a']['n'] = 1;
+  eql['A']['n'] = 1;
+  eql['g']['n'] = 1;
+  eql['G']['n'] = 1;
+  eql['c']['n'] = 1;
+  eql['C']['n'] = 1;
+  eql['t']['n'] = 1;
+  eql['T']['n'] = 1;
+  eql['T']['n'] = 1;
+  
+  eql['a']['a'] = 1;
+  eql['A']['A'] = 1;
+  eql['g']['g'] = 1;
+  eql['G']['G'] = 1;
+  eql['c']['c'] = 1;
+  eql['C']['C'] = 1;
+  eql['t']['t'] = 1;
+  eql['T']['T'] = 1;
+  eql['T']['W'] = 1;
+  eql['U']['W'] = 1;
+  eql['t']['W'] = 1;
+  eql['A']['W'] = 1;
+  eql['a']['W'] = 1;
+  eql['T']['w'] = 1;
+  eql['t']['w'] = 1;
+  eql['A']['w'] = 1;
+  eql['a']['w'] = 1;
+  eql['C']['S'] = 1;
+  eql['c']['S'] = 1;
+  eql['G']['S'] = 1;
+  eql['g']['S'] = 1;
+  eql['C']['s'] = 1;
+  eql['c']['s'] = 1;
+  eql['G']['s'] = 1;
+  eql['g']['s'] = 1;
+  
+  pos = strmatch(vec,end_nucs,stp,endp,eql);
+  return(pos);
+}
+
+
+
+int strmatch(vec,field,stp,endp,eql)
+    char *vec; /* INPUT: character string of information*/
+    char *field; /* INPUT: character string for which you
+		    wish to find a match in vec */
+    int stp,endp; /*INPUT: starting and ending points within
+		    the vector to use for searching*/
+    int eql[125][125]; /*INPUT: array of which characters
+			 are considered equal */
+    
+    /* this function takes an input vec from the indices vec[stp] to 
+       vec[endp] and searches for amatch for the given input field 
+       string, field.  If it finds a match it returns the value of the 
+       starting array indices of that match. Else it returns -1 */
+    
+{ int i,j,k;
+  int startm = -1;
+  
+  for (i = stp; i <= endp; i++) {
+      j = 0;
+      k = i;
+      
+      while(field[j]) 
+	  if (eql[vec[k++]][field[j++]]!=1) break;
+      
+      if ((eql[vec[k-1]][field[j-1]]==1) && (!field[j])) {
+	  startm = i;
+	  break;
+      }
+  }
+  
+  return(startm);
+}
+
+
+int add_to_bank(bstp,bendp,ascore,aend_score,oscore,oend_score,gccon,tm,bank)
+    float oscore,oend_score; /* other sequence homology scores, internal and
+				three prime*/
+    float ascore,aend_score;  /*homology and end_homology scores */
+    float gccon,tm;  /*gc content and tm*/
+    int bstp,bendp;  /*starting and ending point of candidate */
+    p_bank *bank;  /*could be passed the sense bank list*/
+    /*or the antisense bank list */
+    
+    /* add primers to the list of candidate primers, bank[0] contains
+       the total number of banks input so far, bank[1] contains the stp 
+       for the first candidate, bank[2] contains the endp for that
+       candidate, then bank[3] and bank[4] contain the next stp and endp, etc.
+       
+       returns a 0 if the total number of oligos exceeded MAX_NUM_OLIGOS*/
+    
+{ 
+    char *str;
+    
+    bank[0].num += 1;
+    if (bank[0].num > MAX_NUM_OLIGOS) {
+	str = (char *)our_alloc(500 * sizeof(char));
+	sprintf(str,"The program has found more than %d candidate oligos, the maximum number of oligos allowed.\nPlease tighten the constraints on your search and analyze again.\n",MAX_NUM_OLIGOS);
+	if (strlen(str)>500) popUpErrorMessage();
+	message(str);
+	our_free(str);
+	return(0);
+    }
+    bank[bank[0].num].stp = bstp;
+    bank[bank[0].num].endp  = bendp;
+    bank[bank[0].num].score = ascore;
+    bank[bank[0].num].end_score = aend_score;
+    bank[bank[0].num].primotherI = oscore;
+    bank[bank[0].num].primother3 = oend_score;
+    bank[bank[0].num].gc = gccon;
+    bank[bank[0].num].tm = tm;
+    
+    return(1);
+}
+
+void indexx(n, arrin,indx)
+    int n, indx[];
+    float arrin[];
+    /* indexes an array arrin[1..n], i.e. outputs the array indx[1..n]
+       such that arrin[indx[j]] is in ascending order for j=1,2,...,N.
+       The input quatitities n and arrin are not changed */
+{
+    int l,j,ir,indxt,i;
+    float q;
+    
+    for (j=1; j<=n; j++) indx[j]=j;
+    
+    l = (n >> 1) + 1;
+    
+    ir = n;
+    for (;;) {
+	if (l > 1)
+	    q=arrin[(indxt=indx[--l])];
+	else {
+	    q=arrin[(indxt=indx[ir])];
+	    indx[ir]=indx[1];
+	    if (--ir == 1) {
+		indx[1]=indxt;
+		return;
+	    }
+	}
+	i=l;
+	j=l << 1;
+	while (j <= ir) {
+	    if (j < ir && arrin[indx[j]] < arrin[indx[j+1]]) j++;
+	    if (q < arrin[indx[j]]) {
+		indx[i]=indx[j];
+		j+=(i=j);
+	    }
+	    else j = ir+1;
+	}
+	indx[i]=indxt;
+    }
+}
+
+
+
+int sort_products(product, num)
+    score_bank *product; /*product bank to sort*/
+    int num; /* number of valid products */
+    
+    /* sorts a product bank by their sum_score 
+       returns 0 if there were too many products*/
+{ int i,j;
+  float *score;
+  char *str;
+  
+  
+  score = (float *)our_alloc(MAX_NUM_OLIGOS* sizeof(float));
+  
+  if (num > MAX_NUM_OLIGOS) {
+      str = (char *)our_alloc(500 * sizeof(char));
+      sprintf(str,"Number of products found, %d, exceeds ",num);
+      j=strlen(str);
+      sprintf(str+j,"MAX_NUM_OLIGOS, %d, allowed.\n",MAX_NUM_OLIGOS);
+      if (strlen(str)>500) popUpErrorMessage();
+      message(str);
+      popUpErrorMessage();
+      our_free(str);
+      our_free(score);
+      return(0);
+  }
+  
+  for (i = 1; i <= num; i++) 
+      score[i] = product[i].sum_score;
+  
+  
+  indexx(num,score,ndx);
+  /* indexes an array arrin[1..n], i.e. outputs the array indx[1..n]
+     such that arrin[indx[j]] is in ascending order for j=1,2,...,N.
+     The input quantities n and arrin are not changed */
+  
+  our_free(score);
+  return(1);
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+/* calculates internal homology scores, returns the score*/
+float homology(seq1,stp1,endp1,seq2,stp2,endp2,weight)
+    char *seq1,*seq2;  /*INPUT: vectors containing the sequence stored as follows:
+			 A is 0, T is 1, C is 2, G is 3 */
+    float **weight;  /*INPUT: weights: are determined by user-input parameters*/ 
+    int stp1,stp2,endp1,endp2; /* INPUT: starting and ending 
+				  positions within seq1 and seq2 for the search */
+    
+{ int i,j,i1,j1; 
+  float score,max_score;
+  
+  /* compare all possible alignments of the two sequences:
+     seq1 5' __________________ 3'
+     seq2         3' __________________ 5'  (opposite order, but
+     do not take the opposite
+     of the nt)
+     Start counting the score when you come to a pair of complementary
+     nucleotides, stop counting the score (governed by the weights)
+     when you come to two that are not complementary.
+     Output the maximum score.
+     */
+  
+  max_score = 0;
+  
+  for (i=endp1; i >= stp1; i--) { /* start seq1 at 3prime end */  
+      for (j=endp2; j >= stp2; j--) { /* start seq2 at the 3prime end */
+	  if (i < endp1) j = stp2;
+	  /* counting mechanism here is
+	     (len1,len2), (len1,len2-1), (len1,len2-2) ... (len1, 0),
+	     (len1 - 1, 0), (len1 - 2, 0)... (0,0) */
+	  i1 = i;
+	  j1 = j;
+	  while (j1 <= endp2) {
+	      score = 0;
+	      while (weight[seq1[i1]][seq2[j1]]!=0) {  /*start scoring */
+		  score += weight[seq1[i1]][seq2[j1]];
+		  if (score > max_score) max_score = score;	
+		  if (i1>stp1) i1--;
+		  else break;
+		  if (j1<endp2) j1++;
+		  else break;
+	      } /* while weight */
+	      if (i1>stp1) i1--;
+	      else break;
+	      if (j1<endp2) j1++;
+	      else break;
+	      
+	  } /* while j1 < endp2 */
+      }
+  }
+  
+  return(max_score);
+}	  
+
+
+float end_hom(seq1,stp1,endp1,seq2,stp2,endp2,weight)
+    char *seq1,*seq2;/* INPUT: vectors containing the sequence stored as follows:
+			A is 0, T is 1, C is 2, G is 3 */
+    
+    int stp1,stp2,endp1,endp2;  /*INPUT: starting and ending positions */
+    float **weight; /* INPUT: user-specified weights for a-t and c-g matches */
+    
+{ int i,j,i1,j1; 
+  /* int len1,len2; */
+  float score,max_score;
+  
+  /* compare all possible alignments of the two sequences:
+     seq1 5' __________________ 3'
+     seq2         3' __________________ 5'  (opposite order, but
+     do not take the opposite
+     of the nt)
+     Start counting the score when you come to a pair of complementary
+     nucleotides, stop counting the score (governed by the weights)
+     when you come to two that are not complementary.
+     Output the maximum score.
+     */
+  
+  max_score = 0;
+  
+  i = endp1;
+  for (j=endp2; j >= stp2; j--) { /* start seq2 at the 3prime end */
+      if (i < endp1) j = stp2;
+      /* counting mechanism here is
+	 (len1,len2), (len1,len2-1), (len1,len2-2) ... (len1, 0),
+	 (len1 - 1, 0), (len1 - 2, 0)... (0,0) */
+      i1 = i;
+      j1 = j;
+      while (j1 <= endp2) {
+	  score = 0;
+	  while (weight[seq1[i1]][seq2[j1]]!=0) { /* start scoring */
+	      score += weight[seq1[i1]][seq2[j1]];
+	      if (score > max_score) max_score = score;	
+	      if (i1>stp1) i1--;
+	      else break;
+	      if (j1<endp2) j1++;
+	      else break;
+	  } /* while weight */
+	  break;
+      } /* while j1 < endp2 */
+  }
+  
+  return(max_score);
+}	  
+
+
+/* this function determines annealing scores for any nucleotide
+   versus any other nucleotide, e.g. for a A on one strand
+   and T on other, score is prm.AT_score.  For an A on one 
+   strand and C on other, score is 0 */
+
+void get_weight_matrix(wt,wt_ambig,AT,CG)
+    float **wt;
+    char *wt_ambig;/* prm.wt_ambig=avg, weight the score; 
+		      if prm.wt_ambig=full
+		      then give each alignment its maximum
+		      possible score */
+    float AT,CG;/* AT and CG score */
+{ int i,j;
+  int avg;
+  float MAX; /* greater of the AT or CG scores */
+  
+  /*this matrix conforms to the Staden ambiguities as 
+    well as the genbank ambiguities tables */
+  
+  if (!strcmp(wt_ambig,"full")) avg=0;
+  else avg=1;
+  
+  for (i=0; i<125; i++) {
+      for (j=0; j<125; j++) wt[i][j]=0;
+  }
+  
+  
+  if (avg==1) {
+      wt['A']['A']=0;
+      wt['A']['C']=0;
+      wt['A']['G']=0;
+      wt['A']['T']=AT;
+      wt['A']['U']=AT;
+      wt['A']['5']=0;
+      wt['A']['M']=0;
+      wt['A']['R']=0;
+      wt['A']['7']=AT/2.0;
+      wt['A']['W']=AT/2.0;
+      wt['A']['8']=0;
+      wt['A']['S']=0;
+      wt['A']['Y']=AT/2.0;
+      wt['A']['6']=AT/2.0;
+      wt['A']['K']=AT/2.0;
+      wt['A']['V']=0;
+      wt['A']['H']=AT/3.0;
+      wt['A']['D']=AT/3.0;
+      wt['A']['B']=AT/3.0;
+      wt['A']['X']=AT/4.0;
+      wt['A']['N']=AT/4.0;
+      wt['C']['A']=0;
+      wt['G']['A']=0;
+      wt['T']['A']=AT;
+      wt['U']['A']=AT;
+      wt['5']['A']=0;
+      wt['M']['A']=0;
+      wt['R']['A']=0;
+      wt['7']['A']=AT/2.0;
+      wt['W']['A']=AT/2.0;
+      wt['8']['A']=0;
+      wt['S']['A']=0;
+      wt['Y']['A']=AT/2.0;
+      wt['6']['A']=AT/2.0;
+      wt['K']['A']=AT/2.0;
+      wt['V']['A']=0;
+      wt['H']['A']=AT/3.0;
+      wt['D']['A']=AT/3.0;
+      wt['B']['A']=AT/3.0;
+      wt['X']['A']=AT/4.0;
+      wt['N']['A']=AT/4.0;
+      
+      
+      wt['C']['C']=0;
+      wt['C']['G']=CG;
+      wt['C']['T']=0;
+      wt['C']['U']=0;
+      wt['C']['5']=0;
+      wt['C']['M']=0;
+      wt['C']['R']=CG/2.0;
+      wt['C']['7']=0;
+      wt['C']['W']=0;
+      wt['C']['8']=CG/2.0;
+      wt['C']['S']=CG/2.0;
+      wt['C']['Y']=0;
+      wt['C']['6']=CG/2.0;
+      wt['C']['K']=CG/2.0;
+      wt['C']['V']=CG/3.0;
+      wt['C']['H']=0;
+      wt['C']['D']=CG/3.0;
+      wt['C']['B']=CG/3.0;
+      wt['C']['X']=CG/4.0;
+      wt['C']['N']=CG/4.0;
+      wt['G']['C']=CG;
+      wt['T']['C']=0;
+      wt['U']['C']=0;
+      wt['5']['C']=0;
+      wt['M']['C']=0;
+      wt['R']['C']=CG/2.0;
+      wt['7']['C']=0;
+      wt['W']['C']=0;
+      wt['8']['C']=CG/2.0;
+      wt['S']['C']=CG/2.0;
+      wt['Y']['C']=0;
+      wt['6']['C']=CG/2.0;
+      wt['K']['C']=CG/2.0;
+      wt['V']['C']=CG/3.0;
+      wt['H']['C']=0;
+      wt['D']['C']=CG/3.0;
+      wt['B']['C']=CG/3.0;
+      wt['X']['C']=CG/4.0;
+      wt['N']['C']=CG/4.0;
+      
+      wt['G']['G']=0;
+      wt['G']['T']=0;
+      wt['G']['U']=0;
+      wt['G']['5']=CG/2.0;
+      wt['G']['M']=CG/2.0;
+      wt['G']['R']=0;
+      wt['G']['7']=0;
+      wt['G']['W']=0;
+      wt['G']['8']=CG/2.0;
+      wt['G']['S']=CG/2.0;
+      wt['G']['Y']=CG/2.0;
+      wt['G']['6']=0;
+      wt['G']['K']=0;
+      wt['G']['V']=CG/3.0;
+      wt['G']['H']=CG/3.0;
+      wt['G']['D']=0;
+      wt['G']['B']=CG/3.0;
+      wt['G']['X']=CG/4.0;
+      wt['G']['N']=CG/4.0;
+      wt['T']['G']=0;
+      wt['U']['G']=0;
+      wt['5']['G']=CG/2.0;
+      wt['M']['G']=CG/2.0;
+      wt['R']['G']=0;
+      wt['7']['G']=0;
+      wt['W']['G']=0;
+      wt['8']['G']=CG/2.0;
+      wt['S']['G']=CG/2.0;
+      wt['Y']['G']=CG/2.0;
+      wt['6']['G']=0;
+      wt['K']['G']=0;
+      wt['V']['G']=CG/3.0;
+      wt['H']['G']=CG/3.0;
+      wt['D']['G']=0;
+      wt['B']['G']=CG/3.0;
+      wt['X']['G']=CG/4.0;
+      wt['N']['G']=CG/4.0;
+      
+      wt['T']['T']=0;
+      wt['T']['U']=0;
+      wt['T']['5']=AT/2.0;
+      wt['T']['M']=AT/2.0;
+      wt['T']['R']=AT/2.0;
+      wt['T']['7']=AT/2.0;
+      wt['T']['W']=AT/2.0;
+      wt['T']['8']=0;
+      wt['T']['S']=0;
+      wt['T']['Y']=0;
+      wt['T']['6']=0;
+      wt['T']['K']=0;
+      wt['T']['V']=AT/3.0;
+      wt['T']['H']=AT/3.0;
+      wt['T']['D']=AT/3.0;
+      wt['T']['B']=0;
+      wt['T']['X']=AT/4.0;
+      wt['T']['N']=AT/4.0;
+      wt['U']['T']=0;
+      wt['5']['T']=AT/2.0;
+      wt['M']['T']=AT/2.0;
+      wt['R']['T']=AT/2.0;
+      wt['7']['T']=AT/2.0;
+      wt['W']['T']=AT/2.0;
+      wt['8']['T']=0;
+      wt['S']['T']=0;
+      wt['Y']['T']=0;
+      wt['6']['T']=0;
+      wt['K']['T']=0;
+      wt['V']['T']=AT/3.0;
+      wt['H']['T']=AT/3.0;
+      wt['D']['T']=AT/3.0;
+      wt['B']['T']=0;
+      wt['X']['T']=AT/4.0;
+      wt['N']['T']=AT/4.0;
+      
+      wt['U']['U']=0;
+      wt['U']['5']=0;
+      wt['U']['M']=AT/2.0;
+      wt['U']['R']=AT/2.0;
+      wt['U']['7']=AT/2.0;
+      wt['U']['W']=AT/2.0;
+      wt['U']['8']=AT/2.0;
+      wt['U']['S']=0;
+      wt['U']['Y']=0;
+      wt['U']['6']=0;
+      wt['U']['K']=0;
+      wt['U']['V']=0;
+      wt['U']['H']=AT/3.0;
+      wt['U']['D']=AT/3.0;
+      wt['U']['B']=0;
+      wt['U']['X']=AT/4.0;
+      wt['U']['N']=AT/4.0;
+      wt['5']['U']=0;
+      wt['M']['U']=AT/2.0;
+      wt['R']['U']=AT/2.0;
+      wt['7']['U']=AT/2.0;
+      wt['W']['U']=AT/2.0;
+      wt['8']['U']=AT/2.0;
+      wt['S']['U']=0;
+      wt['Y']['U']=0;
+      wt['6']['U']=0;
+      wt['K']['U']=0;
+      wt['V']['U']=0;
+      wt['H']['U']=AT/3.0;
+      wt['D']['U']=AT/3.0;
+      wt['B']['U']=0;
+      wt['X']['U']=AT/4.0;
+      wt['N']['U']=AT/4.0;
+      
+      
+      wt['5']['5']=0;
+      wt['5']['M']=0;
+      wt['5']['R']=CG/4.0;
+      wt['5']['7']=AT/4.0;
+      wt['5']['W']=AT/4.0;
+      wt['5']['8']=CG/4.0;
+      wt['5']['S']=CG/4.0;
+      wt['5']['Y']=AT/4.0;
+      wt['5']['6']=(AT+CG)/4.0;
+      wt['5']['K']=(AT+CG)/4.0;
+      wt['5']['V']=CG/6.0;
+      wt['5']['H']=AT/6.0;
+      wt['5']['D']=(AT+CG)/6.0;
+      wt['5']['B']=(AT+CG)/6.0;
+      wt['5']['X']=(AT+CG)/8.0;
+      wt['5']['N']=(AT+CG)/8.0;
+      wt['M']['5']=0;
+      wt['R']['5']=CG/4.0;
+      wt['7']['5']=AT/4.0;
+      wt['W']['5']=AT/4.0;
+      wt['8']['5']=CG/4.0;
+      wt['S']['5']=CG/4.0;
+      wt['Y']['5']=AT/4.0;
+      wt['6']['5']=(AT+CG)/4.0;
+      wt['K']['5']=(AT+CG)/4.0;
+      wt['V']['5']=CG/6.0;
+      wt['H']['5']=AT/6.0;
+      wt['D']['5']=(AT+CG)/6.0;
+      wt['B']['5']=(AT+CG)/6.0;
+      wt['X']['5']=(AT+CG)/8.0;
+      wt['N']['5']=(AT+CG)/8.0;
+      
+      
+      wt['M']['M']=0;
+      wt['M']['R']=CG/4.0;
+      wt['M']['7']=AT/4.0;
+      wt['M']['W']=AT/4.0;
+      wt['M']['8']=CG/4.0;
+      wt['M']['S']=CG/4.0;
+      wt['M']['Y']=AT/4.0;
+      wt['M']['6']=(AT+CG)/4.0;
+      wt['M']['K']=(AT+CG)/4.0;
+      wt['M']['V']=CG/6.0;
+      wt['M']['H']=AT/6.0;
+      wt['M']['D']=(AT+CG)/6.0;
+      wt['M']['B']=(AT+CG)/6.0;
+      wt['M']['X']=(AT+CG)/8.0;
+      wt['M']['N']=(AT+CG)/8.0;
+      wt['R']['M']=CG/4.0;
+      wt['7']['M']=AT/4.0;
+      wt['W']['M']=AT/4.0;
+      wt['8']['M']=CG/4.0;
+      wt['S']['M']=CG/4.0;
+      wt['Y']['M']=AT/4.0;
+      wt['6']['M']=(AT+CG)/4.0;
+      wt['K']['M']=(AT+CG)/4.0;
+      wt['V']['M']=CG/6.0;
+      wt['H']['M']=AT/6.0;
+      wt['D']['M']=(AT+CG)/6.0;
+      wt['B']['M']=(AT+CG)/6.0;
+      wt['X']['M']=(AT+CG)/8.0;
+      wt['N']['M']=(AT+CG)/8.0;
+      
+      wt['R']['R']=0;
+      wt['R']['7']=AT/4.0;
+      wt['R']['W']=AT/4.0;
+      wt['R']['8']=CG/4.0;
+      wt['R']['S']=CG/4.0;
+      wt['R']['Y']=(AT+CG)/4.0;
+      wt['R']['6']=AT/4.0;
+      wt['R']['K']=AT/4.0;
+      wt['R']['V']=CG/6.0;
+      wt['R']['H']=(AT+CG)/6.0;
+      wt['R']['D']=AT/6.0;
+      wt['R']['B']=(AT+CG)/6.0;
+      wt['R']['X']=(AT+CG)/8.0;
+      wt['R']['N']=(AT+CG)/8.0;
+      wt['7']['R']=AT/4.0;
+      wt['W']['R']=AT/4.0;
+      wt['8']['R']=CG/4.0;
+      wt['S']['R']=CG/4.0;
+      wt['Y']['R']=(AT+CG)/4.0;
+      wt['6']['R']=AT/4.0;
+      wt['K']['R']=AT/4.0;
+      wt['V']['R']=CG/6.0;
+      wt['H']['R']=(AT+CG)/6.0;
+      wt['D']['R']=AT/6.0;
+      wt['B']['R']=(AT+CG)/6.0;
+      wt['X']['R']=(AT+CG)/8.0;
+      wt['N']['R']=(AT+CG)/8.0;
+      
+      wt['7']['7']=AT/2.0;
+      wt['7']['W']=AT/2.0;
+      wt['7']['8']=0;
+      wt['7']['S']=0;
+      wt['7']['Y']=AT/4.0;
+      wt['7']['6']=AT/4.0;
+      wt['7']['K']=AT/4.0;
+      wt['7']['V']=AT/6.0;
+      wt['7']['H']=AT/3.0;
+      wt['7']['D']=AT/3.0;
+      wt['7']['B']=AT/6.0;
+      wt['7']['X']=AT/4.0;
+      wt['7']['N']=AT/4.0;
+      wt['W']['7']=AT/2.0;
+      wt['8']['7']=0;
+      wt['S']['7']=0;
+      wt['Y']['7']=AT/4.0;
+      wt['6']['7']=AT/4.0;
+      wt['K']['7']=AT/4.0;
+      wt['V']['7']=AT/6.0;
+      wt['H']['7']=AT/3.0;
+      wt['D']['7']=AT/3.0;
+      wt['B']['7']=AT/6.0;
+      wt['X']['7']=AT/4.0;
+      wt['N']['7']=AT/4.0;
+      
+      wt['W']['W']=AT/2.0;
+      wt['W']['8']=0;
+      wt['W']['S']=0;
+      wt['W']['Y']=AT/4.0;
+      wt['W']['6']=AT/4.0;
+      wt['W']['K']=AT/4.0;
+      wt['W']['V']=AT/6.0;
+      wt['W']['H']=AT/3.0;
+      wt['W']['D']=AT/3.0;
+      wt['W']['B']=AT/6.0;
+      wt['W']['X']=AT/4.0;
+      wt['W']['N']=AT/4.0;
+      wt['8']['W']=0;
+      wt['S']['W']=0;
+      wt['Y']['W']=AT/4.0;
+      wt['6']['W']=AT/4.0;
+      wt['K']['W']=AT/4.0;
+      wt['V']['W']=AT/6.0;
+      wt['H']['W']=AT/3.0;
+      wt['D']['W']=AT/3.0;
+      wt['B']['W']=AT/6.0;
+      wt['X']['W']=AT/4.0;
+      wt['N']['W']=AT/4.0;
+      
+      wt['8']['8']=CG/2.0;
+      wt['8']['S']=CG/2.0;
+      wt['8']['Y']=CG/4.0;
+      wt['8']['6']=CG/4.0;
+      wt['8']['K']=CG/4.0;
+      wt['8']['V']=CG/3.0;
+      wt['8']['H']=CG/6.0;
+      wt['8']['D']=CG/6.0;
+      wt['8']['B']=CG/3.0;
+      wt['8']['X']=CG/4.0;
+      wt['8']['N']=CG/4.0;
+      wt['S']['8']=CG/2.0;
+      wt['Y']['8']=CG/4.0;
+      wt['6']['8']=CG/4.0;
+      wt['K']['8']=CG/4.0;
+      wt['V']['8']=CG/3.0;
+      wt['H']['8']=CG/6.0;
+      wt['D']['8']=CG/6.0;
+      wt['B']['8']=CG/3.0;
+      wt['X']['8']=CG/4.0;
+      wt['N']['8']=CG/4.0;
+      
+      wt['S']['S']=CG/2.0;
+      wt['S']['Y']=CG/4.0;
+      wt['S']['6']=CG/4.0;
+      wt['S']['K']=CG/4.0;
+      wt['S']['V']=CG/3.0;
+      wt['S']['H']=CG/6.0;
+      wt['S']['D']=CG/6.0;
+      wt['S']['B']=CG/3.0;
+      wt['S']['X']=CG/4.0;
+      wt['S']['N']=CG/4.0;
+      wt['Y']['S']=CG/4.0;
+      wt['6']['S']=CG/4.0;
+      wt['K']['S']=CG/4.0;
+      wt['V']['S']=CG/3.0;
+      wt['H']['S']=CG/6.0;
+      wt['D']['S']=CG/6.0;
+      wt['B']['S']=CG/3.0;
+      wt['X']['S']=CG/4.0;
+      wt['N']['S']=CG/4.0;
+      
+      wt['Y']['Y']=0.0;
+      wt['Y']['6']=CG/4.0;
+      wt['Y']['K']=CG/4.0;
+      wt['Y']['V']=(AT+CG)/6.0;
+      wt['Y']['H']=AT/6.0;
+      wt['Y']['D']=(AT+CG)/6.0;
+      wt['Y']['B']=CG/6.0;
+      wt['Y']['X']=(AT+CG)/8.0;
+      wt['Y']['N']=(AT+CG)/8.0;
+      wt['6']['Y']=CG/4.0;
+      wt['K']['Y']=CG/4.0;
+      wt['V']['Y']=(AT+CG)/6.0;
+      wt['H']['Y']=AT/6.0;
+      wt['D']['Y']=(AT+CG)/6.0;
+      wt['B']['Y']=CG/6.0;
+      wt['X']['Y']=(AT+CG)/8.0;
+      wt['N']['Y']=(AT+CG)/8.0;
+      
+      wt['6']['6']=0;
+      wt['6']['K']=0;
+      wt['6']['V']=(AT+CG)/6.0;
+      wt['6']['H']=(AT+CG)/6.0;
+      wt['6']['D']=AT/6.0;
+      wt['6']['B']=CG/6.0;
+      wt['6']['X']=(AT+CG)/8.0;
+      wt['6']['N']=(AT+CG)/8.0;
+      wt['K']['6']=0;
+      wt['V']['6']=(AT+CG)/6.0;
+      wt['H']['6']=(AT+CG)/6.0;
+      wt['D']['6']=AT/6.0;
+      wt['B']['6']=CG/6.0;
+      wt['X']['6']=(AT+CG)/8.0;
+      wt['N']['6']=(AT+CG)/8.0;
+      
+      wt['K']['K']=0;
+      wt['K']['V']=(AT+CG)/6.0;
+      wt['K']['H']=(AT+CG)/6.0;
+      wt['K']['D']=AT/6.0;
+      wt['K']['B']=CG/6.0;
+      wt['K']['X']=(AT+CG)/8.0;
+      wt['K']['N']=(AT+CG)/8.0;
+      wt['V']['K']=(AT+CG)/6.0;
+      wt['H']['K']=(AT+CG)/6.0;
+      wt['D']['K']=AT/6.0;
+      wt['B']['K']=CG/6.0;
+      wt['X']['K']=(AT+CG)/8.0;
+      wt['N']['K']=(AT+CG)/8.0;
+      
+      wt['V']['V']=2.0*CG/9.0;
+      wt['V']['H']=(AT+CG)/9.0;
+      wt['V']['D']=(AT+CG)/9.0;
+      wt['V']['B']=(AT+2.0*CG)/9.0;
+      wt['V']['X']=(AT+2.0*CG)/12.0;
+      wt['V']['N']=(AT+2.0*CG)/12.0;
+      wt['H']['V']=(AT+CG)/9.0;
+      wt['D']['V']=(AT+CG)/9.0;
+      wt['B']['V']=(AT+2.0*CG)/9.0;
+      wt['X']['V']=(AT+2.0*CG)/12.0;
+      wt['N']['V']=(AT+2.0*CG)/12.0;
+      
+      
+      wt['H']['H']=2.0*AT/9.0;
+      wt['H']['D']=(2.0*AT+CG)/9.0;
+      wt['H']['B']=(AT+CG)/9.0;
+      wt['H']['X']=(2.0*AT+CG)/12.0;
+      wt['H']['N']=(2.0*AT+CG)/12.0;
+      wt['D']['H']=(2.0*AT+CG)/9.0;
+      wt['B']['H']=(AT+CG)/9.0;
+      wt['X']['H']=(2.0*AT+CG)/12.0;
+      wt['N']['H']=(2.0*AT+CG)/12.0;
+      
+      
+      wt['D']['D']=2.0*AT/9.0;
+      wt['D']['B']=(AT+CG)/9.0;
+      wt['D']['X']=(2.0*AT+CG)/12.0;
+      wt['D']['N']=(2.0*AT+CG)/12.0;
+      wt['B']['D']=(AT+CG)/9.0;
+      wt['X']['D']=(2.0*AT+CG)/12.0;
+      wt['N']['D']=(2.0*AT+CG)/12.0;
+      
+      wt['B']['B']=2*CG/9.0;
+      wt['B']['X']=(AT+2.0*CG)/12.0;
+      wt['B']['N']=(AT+2.0*CG)/12.0;
+      wt['X']['B']=(AT+2.0*CG)/12.0;
+      wt['N']['B']=(AT+2.0*CG)/12.0;
+      
+      wt['X']['N']=(AT+CG)/8.0;
+      wt['X']['X']=(AT+CG)/8.0;
+      wt['N']['N']=(AT+CG)/8.0;
+      wt['N']['X']=(AT+CG)/8.0;
+      
+      
+  }
+  
+  else  {
+      if (AT>CG) MAX=AT;
+      else MAX=CG;
+      
+      wt['A']['A']=0;
+      wt['A']['C']=0;
+      wt['A']['G']=0;
+      wt['A']['T']=AT;
+      wt['A']['U']=AT;
+      wt['A']['5']=0;
+      wt['A']['M']=0;
+      wt['A']['R']=0;
+      wt['A']['7']=AT;
+      wt['A']['W']=AT;
+      wt['A']['8']=0;
+      wt['A']['S']=0;
+      wt['A']['Y']=AT;
+      wt['A']['6']=AT;
+      wt['A']['K']=AT;
+      wt['A']['V']=0;
+      wt['A']['H']=AT;
+      wt['A']['D']=AT;
+      wt['A']['B']=AT;
+      wt['A']['X']=AT;
+      wt['A']['N']=AT;
+      wt['C']['A']=0;
+      wt['G']['A']=0;
+      wt['T']['A']=AT;
+      wt['U']['A']=AT;
+      wt['5']['A']=0;
+      wt['M']['A']=0;
+      wt['R']['A']=0;
+      wt['7']['A']=AT;
+      wt['W']['A']=AT;
+      wt['8']['A']=0;
+      wt['S']['A']=0;
+      wt['Y']['A']=AT;
+      wt['6']['A']=AT;
+      wt['K']['A']=AT;
+      wt['V']['A']=0;
+      wt['H']['A']=AT;
+      wt['D']['A']=AT;
+      wt['B']['A']=AT;
+      wt['X']['A']=AT;
+      wt['N']['A']=AT;
+      
+      
+      wt['C']['C']=0;
+      wt['C']['G']=CG;
+      wt['C']['T']=0;
+      wt['C']['U']=0;
+      wt['C']['5']=0;
+      wt['C']['M']=0;
+      wt['C']['R']=CG;
+      wt['C']['7']=0;
+      wt['C']['W']=0;
+      wt['C']['8']=CG;
+      wt['C']['S']=CG;
+      wt['C']['Y']=0;
+      wt['C']['6']=CG;
+      wt['C']['K']=CG;
+      wt['C']['V']=CG;
+      wt['C']['H']=0;
+      wt['C']['D']=CG;
+      wt['C']['B']=CG;
+      wt['C']['X']=CG;
+      wt['C']['N']=CG;
+      wt['G']['C']=CG;
+      wt['T']['C']=0;
+      wt['U']['C']=0;
+      wt['5']['C']=0;
+      wt['M']['C']=0;
+      wt['R']['C']=CG;
+      wt['7']['C']=0;
+      wt['W']['C']=0;
+      wt['8']['C']=CG;
+      wt['S']['C']=CG;
+      wt['Y']['C']=0;
+      wt['6']['C']=CG;
+      wt['K']['C']=CG;
+      wt['V']['C']=CG;
+      wt['H']['C']=0;
+      wt['D']['C']=CG;
+      wt['B']['C']=CG;
+      wt['X']['C']=CG;
+      wt['N']['C']=CG;
+      
+      wt['G']['G']=0;
+      wt['G']['T']=0;
+      wt['G']['U']=0;
+      wt['G']['5']=CG;
+      wt['G']['M']=CG;
+      wt['G']['R']=0;
+      wt['G']['7']=0;
+      wt['G']['W']=0;
+      wt['G']['8']=CG;
+      wt['G']['S']=CG;
+      wt['G']['Y']=CG;
+      wt['G']['6']=0;
+      wt['G']['K']=0;
+      wt['G']['V']=CG;
+      wt['G']['H']=CG;
+      wt['G']['D']=0;
+      wt['G']['B']=CG;
+      wt['G']['X']=CG;
+      wt['G']['N']=CG;
+      wt['T']['G']=0;
+      wt['U']['G']=0;
+      wt['5']['G']=CG;
+      wt['M']['G']=CG;
+      wt['R']['G']=0;
+      wt['7']['G']=0;
+      wt['W']['G']=0;
+      wt['8']['G']=CG;
+      wt['S']['G']=CG;
+      wt['Y']['G']=CG;
+      wt['6']['G']=0;
+      wt['K']['G']=0;
+      wt['V']['G']=CG;
+      wt['H']['G']=CG;
+      wt['D']['G']=0;
+      wt['B']['G']=CG;
+      wt['X']['G']=CG;
+      wt['N']['G']=CG;
+      
+      wt['T']['T']=AT;
+      wt['T']['U']=AT;
+      wt['T']['5']=AT;
+      wt['T']['M']=AT;
+      wt['T']['R']=AT;
+      wt['T']['7']=AT;
+      wt['T']['W']=AT;
+      wt['T']['8']=0;
+      wt['T']['S']=0;
+      wt['T']['Y']=0;
+      wt['T']['6']=0;
+      wt['T']['K']=0;
+      wt['T']['V']=AT;
+      wt['T']['H']=AT;
+      wt['T']['D']=AT;
+      wt['T']['B']=0;
+      wt['T']['X']=AT;
+      wt['T']['N']=AT;
+      wt['U']['T']=AT;
+      wt['5']['T']=AT;
+      wt['M']['T']=AT;
+      wt['R']['T']=AT;
+      wt['7']['T']=AT;
+      wt['W']['T']=AT;
+      wt['8']['T']=0;
+      wt['S']['T']=0;
+      wt['Y']['T']=0;
+      wt['6']['T']=0;
+      wt['K']['T']=0;
+      wt['V']['T']=AT;
+      wt['H']['T']=AT;
+      wt['D']['T']=AT;
+      wt['B']['T']=0;
+      wt['X']['T']=AT;
+      wt['N']['T']=AT;
+      
+      wt['U']['U']=0;
+      wt['U']['5']=AT;
+      wt['U']['M']=AT;
+      wt['U']['R']=AT;
+      wt['U']['7']=AT;
+      wt['U']['W']=AT;
+      wt['U']['8']=0;
+      wt['U']['S']=0;
+      wt['U']['Y']=0;
+      wt['U']['6']=0;
+      wt['U']['K']=0;
+      wt['U']['V']=AT;
+      wt['U']['H']=AT;
+      wt['U']['D']=AT;
+      wt['U']['B']=0;
+      wt['U']['X']=AT;
+      wt['U']['N']=AT;
+      wt['5']['U']=AT;
+      wt['M']['U']=AT;
+      wt['R']['U']=AT;
+      wt['7']['U']=AT;
+      wt['W']['U']=AT;
+      wt['8']['U']=0;
+      wt['S']['U']=0;
+      wt['Y']['U']=0;
+      wt['6']['U']=0;
+      wt['K']['U']=0;
+      wt['V']['U']=AT;
+      wt['H']['U']=AT;
+      wt['D']['U']=AT;
+      wt['B']['U']=0;
+      wt['X']['U']=AT;
+      wt['N']['U']=AT;
+      
+      
+      wt['5']['5']=0;
+      wt['5']['M']=0;
+      wt['5']['R']=CG;
+      wt['5']['7']=AT;
+      wt['5']['W']=AT;
+      wt['5']['8']=CG;
+      wt['5']['S']=CG;
+      wt['5']['Y']=AT;
+      wt['5']['6']=MAX;
+      wt['5']['K']=MAX;
+      wt['5']['V']=CG;
+      wt['5']['H']=AT;
+      wt['5']['D']=MAX;
+      wt['5']['B']=MAX;
+      wt['5']['X']=MAX;
+      wt['5']['N']=MAX;
+      wt['M']['5']=0;
+      wt['R']['5']=CG;
+      wt['7']['5']=AT;
+      wt['W']['5']=AT;
+      wt['8']['5']=CG;
+      wt['S']['5']=CG;
+      wt['Y']['5']=AT;
+      wt['6']['5']=MAX;
+      wt['K']['5']=MAX;
+      wt['V']['5']=CG;
+      wt['H']['5']=AT;
+      wt['D']['5']=MAX;
+      wt['B']['5']=MAX;
+      wt['X']['5']=MAX;
+      wt['N']['5']=MAX;
+      
+      
+      wt['M']['M']=0;
+      wt['M']['R']=CG;
+      wt['M']['7']=AT;
+      wt['M']['W']=AT;
+      wt['M']['8']=CG;
+      wt['M']['S']=CG;
+      wt['M']['Y']=AT;
+      wt['M']['6']=MAX;
+      wt['M']['K']=MAX;
+      wt['M']['V']=CG;
+      wt['M']['H']=AT;
+      wt['M']['D']=MAX;
+      wt['M']['B']=MAX;
+      wt['M']['X']=MAX;
+      wt['M']['N']=MAX;
+      wt['R']['M']=CG;
+      wt['7']['M']=AT;
+      wt['W']['M']=AT;
+      wt['8']['M']=CG;
+      wt['S']['M']=CG;
+      wt['Y']['M']=AT;
+      wt['6']['M']=MAX;
+      wt['K']['M']=MAX;
+      wt['V']['M']=CG;
+      wt['H']['M']=AT;
+      wt['D']['M']=MAX;
+      wt['B']['M']=MAX;
+      wt['X']['M']=MAX;
+      wt['N']['M']=MAX;
+      
+      wt['R']['R']=0;
+      wt['R']['7']=AT;
+      wt['R']['W']=AT;
+      wt['R']['8']=CG;
+      wt['R']['S']=CG;
+      wt['R']['Y']=MAX;
+      wt['R']['6']=AT;
+      wt['R']['K']=AT;
+      wt['R']['V']=CG;
+      wt['R']['H']=MAX;
+      wt['R']['D']=AT;
+      wt['R']['B']=MAX;
+      wt['R']['X']=MAX;
+      wt['R']['N']=MAX;
+      wt['7']['R']=AT;
+      wt['W']['R']=AT;
+      wt['8']['R']=CG;
+      wt['S']['R']=CG;
+      wt['Y']['R']=MAX;
+      wt['6']['R']=AT;
+      wt['K']['R']=AT;
+      wt['V']['R']=CG;
+      wt['H']['R']=MAX;
+      wt['D']['R']=AT;
+      wt['B']['R']=MAX;
+      wt['X']['R']=MAX;
+      wt['N']['R']=MAX;
+      
+      wt['7']['7']=AT;
+      wt['7']['W']=AT;
+      wt['7']['8']=0;
+      wt['7']['S']=0;
+      wt['7']['Y']=AT;
+      wt['7']['6']=AT;
+      wt['7']['K']=AT;
+      wt['7']['V']=AT;
+      wt['7']['H']=AT;
+      wt['7']['D']=AT;
+      wt['7']['B']=AT;
+      wt['7']['X']=AT;
+      wt['7']['N']=AT;
+      wt['W']['7']=AT;
+      wt['8']['7']=0;
+      wt['S']['7']=0;
+      wt['Y']['7']=AT;
+      wt['6']['7']=AT;
+      wt['K']['7']=AT;
+      wt['V']['7']=AT;
+      wt['H']['7']=AT;
+      wt['D']['7']=AT;
+      wt['B']['7']=AT;
+      wt['X']['7']=AT;
+      wt['N']['7']=AT;
+      
+      wt['W']['W']=AT;
+      wt['W']['8']=0;
+      wt['W']['S']=0;
+      wt['W']['Y']=AT;
+      wt['W']['6']=AT;
+      wt['W']['K']=AT;
+      wt['W']['V']=AT;
+      wt['W']['H']=AT;
+      wt['W']['D']=AT;
+      wt['W']['B']=AT;
+      wt['W']['X']=AT;
+      wt['W']['N']=AT;
+      wt['8']['W']=0;
+      wt['S']['W']=0;
+      wt['Y']['W']=AT;
+      wt['6']['W']=AT;
+      wt['K']['W']=AT;
+      wt['V']['W']=AT;
+      wt['H']['W']=AT;
+      wt['D']['W']=AT;
+      wt['B']['W']=AT;
+      wt['X']['W']=AT;
+      wt['N']['W']=AT;
+      
+      wt['8']['8']=CG;
+      wt['8']['S']=CG;
+      wt['8']['Y']=CG;
+      wt['8']['6']=CG;
+      wt['8']['K']=CG;
+      wt['8']['V']=CG;
+      wt['8']['H']=CG;
+      wt['8']['D']=CG;
+      wt['8']['B']=CG;
+      wt['8']['X']=CG;
+      wt['8']['N']=CG;
+      wt['S']['8']=CG;
+      wt['Y']['8']=CG;
+      wt['6']['8']=CG;
+      wt['K']['8']=CG;
+      wt['V']['8']=CG;
+      wt['H']['8']=CG;
+      wt['D']['8']=CG;
+      wt['B']['8']=CG;
+      wt['X']['8']=CG;
+      wt['N']['8']=CG;
+      
+      wt['S']['S']=CG;
+      wt['S']['Y']=CG;
+      wt['S']['6']=CG;
+      wt['S']['K']=CG;
+      wt['S']['V']=CG;
+      wt['S']['H']=CG;
+      wt['S']['D']=CG;
+      wt['S']['B']=CG;
+      wt['S']['X']=CG;
+      wt['S']['N']=CG;
+      wt['Y']['S']=CG;
+      wt['6']['S']=CG;
+      wt['K']['S']=CG;
+      wt['V']['S']=CG;
+      wt['H']['S']=CG;
+      wt['D']['S']=CG;
+      wt['B']['S']=CG;
+      wt['X']['S']=CG;
+      wt['N']['S']=CG;
+      
+      wt['Y']['Y']=0;
+      wt['Y']['6']=CG;
+      wt['Y']['K']=CG;
+      wt['Y']['V']=MAX;
+      wt['Y']['H']=AT;
+      wt['Y']['D']=MAX;
+      wt['Y']['B']=CG;
+      wt['Y']['X']=MAX;
+      wt['Y']['N']=MAX;
+      wt['6']['Y']=CG;
+      wt['K']['Y']=CG;
+      wt['V']['Y']=MAX;
+      wt['H']['Y']=AT;
+      wt['D']['Y']=MAX;
+      wt['B']['Y']=CG;
+      wt['X']['Y']=MAX;
+      wt['N']['Y']=MAX;
+      
+      
+      wt['6']['6']=0;
+      wt['6']['K']=0;
+      wt['6']['V']=MAX;
+      wt['6']['H']=MAX;
+      wt['6']['D']=AT;
+      wt['6']['B']=CG;
+      wt['6']['X']=MAX;
+      wt['6']['N']=MAX;
+      wt['K']['6']=0;
+      wt['V']['6']=MAX;
+      wt['H']['6']=MAX;
+      wt['D']['6']=AT;
+      wt['B']['6']=CG;
+      wt['X']['6']=MAX;
+      wt['N']['6']=MAX;
+      
+      wt['K']['K']=0;
+      wt['K']['V']=MAX;
+      wt['K']['H']=MAX;
+      wt['K']['D']=AT;
+      wt['K']['B']=CG;
+      wt['K']['X']=MAX;
+      wt['K']['N']=MAX;
+      wt['V']['K']=MAX;
+      wt['H']['K']=MAX;
+      wt['D']['K']=AT;
+      wt['B']['K']=CG;
+      wt['X']['K']=MAX;
+      wt['N']['K']=MAX;
+      
+      wt['V']['V']=CG;
+      wt['V']['H']=MAX;
+      wt['V']['D']=MAX;
+      wt['V']['B']=MAX;
+      wt['V']['X']=MAX;
+      wt['V']['N']=MAX;
+      wt['H']['V']=MAX;
+      wt['D']['V']=MAX;
+      wt['B']['V']=MAX;
+      wt['X']['V']=MAX;
+      wt['N']['V']=MAX;
+      
+      
+      wt['H']['H']=AT;
+      wt['H']['D']=MAX;
+      wt['H']['B']=MAX;
+      wt['H']['X']=MAX;
+      wt['H']['N']=MAX;
+      wt['D']['H']=MAX;
+      wt['B']['H']=MAX;
+      wt['X']['H']=MAX;
+      wt['N']['H']=MAX;
+      
+      
+      wt['D']['D']=AT;
+      wt['D']['B']=MAX;
+      wt['D']['X']=MAX;
+      wt['D']['N']=MAX;
+      wt['B']['D']=MAX;
+      wt['X']['D']=MAX;
+      wt['N']['D']=MAX;
+      
+      wt['B']['B']=CG;
+      wt['B']['X']=MAX;
+      wt['B']['N']=MAX;
+      wt['X']['B']=MAX;
+      wt['N']['B']=MAX;
+      
+      wt['X']['N']=MAX;
+      wt['N']['X']=MAX;
+      wt['N']['N']=MAX;
+      wt['X']['X']=MAX;
+      
+  }
+  
+  
+}
+
+
+/* this function is the heart analysis.c -- it chooses the
+   candidate primers and products */
+#ifdef SUBVERSION
+int  analysis(seq,stp,endp,stp2,endp2,OSP_RESULTS,score_info)
+    char *seq;
+    OSP_Results *OSP_RESULTS;
+    char *score_info;
+#endif
+#if defined(TEXTVERSION) || defined(XVERSION)
+    int  analysis(stp,endp,stp2,endp2)
+#endif
+    
+    int stp,endp;  /* starting and ending indices of sequence defining 
+		      the analysis region */
+    int stp2,endp2; /* defines the starting and ending points on the
+		       sequence used to find antisense primers*/
+    /* returns a 1 if all the analysis went okay */
+    
+{ int i,j,k,iii; /*indexes*/
+  int analysis_ok=1; /*turns to 0 if no products were found*/
+  int pos1,pos2;
+  int sense; /*0 if looking at the anti-sense candidates, and 1 if the sense */
+  int starti,endi;
+  float prcnt_gc; /* prcnt G+C content */
+  float prod_prcnt_gc; /* prcnt GC content for a given product */
+/*  int cmpl; /* 1 if the antisense primer is to be transformed to its 
+	       complimentary form before looking at primer-primer
+	       homologies */
+  float hmlg,hmlga,end_hmlg,end_hmlga; /* homology score and end-homology score */
+  float ohmlg=0,oend_hmlg=0;        /* homology score and end-homology score 
+					   for primer versus other sequence */
+  char *c_seq;   /*complementary sequence */
+  int len; /* amplified product length */
+  float tm; /* melting temperature */
+  float **weight;  /* weights: A-T = 2, T-A = 2, C-G = 4, G-C = 4 
+		      e.g. w['A']['T']=prm.AT_score
+		      These weights are used to assign scores to primer-
+		      primer homologies */
+  int num,ii,jjj;
+  int num_delta_tm; /* number of products rejected based on difference in Tm between the
+		       two primers*/
+  int  num_prod_cons; /* number of products considered */
+  int num_anti,num_sense; /* number of (anti)sense primer candidates considered */
+  int num_ident_ends; /* number rejected because of identical endpoints, only shortest one is displayed */
+  int num_a_n,num_s_n; /* number of primer candidates having an N, thus rejected */
+  float a_pgc; /* prcnt gc for antisense primer */
+  float s_pgc; /* prcnt gc for sense primer */
+  int num_gcs=0; /* number of end_nucs-type sequences found */
+  int num_prod_len; /* number of projects rejected based on length */
+  int s_gc_rej=0,a_gc_rej=0; /* number of sense and antisense  primers rejected based on gc content */
+  int s_hmlg_rej=0,a_hmlg_rej=0; /* number of sense and anti primers rejected based on self-homology */
+  int s_other_hmlg_rej=0,a_other_hmlg_rej=0; /* number of sense and anti 
+						primers rejected based on self-homology */
+  int p_hmlg_rej=0; /* number of products rejected based on 
+		       primer-primer homology */
+  int p_primprod_hmlg_rej=0; /* number of products rejected based on 
+				primer-product homology */
+  int p_tm_rej = 0; /* number of products rejected based on melting temperature */
+  int prod_gc_rej = 0; /* number of products rejected based on percent gc content */
+  int s_prim_tm_rej = 0; /* number of sense primers rejected based on tm */
+  int a_prim_tm_rej = 0; /* number of antisense primers rejected based on tm */
+  float delta_tm; /* difference between tms of primer1 and 2 */
+  int *is_nuc; /* 1 if the letter in the array is AGC or T and 0 if not */
+  int accepted; /* keeps track of whether the preceding product considered was accepted */
+  int ident; /* if a sense primer with the same endp, and an identical antisense primer
+		has already been accepted ident = 1, and the product is thrown away */
+  float p1p_hmlg,p2p_hmlg; /* primer 1 or 2 vs product homology, calculated as greater
+			      of the scores when the primer is paired against both the
+			      top and bottom strand */
+  float p1p_hmlg_c,p2p_hmlg_c;  /* primer1 or 2 vs product homology against c_seq */
+  float p1p_end_hmlg,p2p_end_hmlg; /* primer 1 or 2 vs product 3' homology, calculated as greater
+				      of the scores when the primer is paired against both the
+				      top and bottom strand */
+  float p1p_end_hmlg_c,p2p_end_hmlg_c;  /* primer1 or 2 vs product 3' homology against c_seq */
+  float curr_ohmlg; /* used for temporary storage of homology vs other sequence scores*/
+  float curr_oend_hmlg;  /* used for temporary storage of homology vs other sequence scores*/
+/*  float pseq_hmlg_c, pseq_end_hmlg_c; /*used for temp storage of homology scores */
+  float pp_hmlg,pp_end_hmlg; /*used for temp storage of homology scores */
+  
+  float prim_tm; /* primer melting temperature */
+  int *a_gc_array; /* array which store the number of GorC in the string up to that indices
+		      for the antisense strand */
+  int *s_gc_array; /* array which store the number of GorC in the string up to that indices 
+		      for the sense strand */
+  
+  int passed; /* used as a boolean to indicate whether or not the primer
+		 has passed a certain criterion */
+/*  int num_to_compare; /* number of other sequences to for primer-other 
+			 homologies.  it will equal num_other_seqs if the
+			 search is single stranded; it will equal num_other_seq
+			 times 2 if double stranded search */
+  char *str;
+  int add_okay;
+  int num_ambigs; /* in calculation of melting temperature, I need
+		     to be able to ignore any ambiguous nucs, therefore
+		     I calculate the number of ambiguities in a
+		     stretch of sequence and subtract from the length
+		     before calculating melting temperature*/
+  
+  /* allocations */
+  a_gc_array = (int *)our_alloc(MAX_SEQ_LEN * sizeof(int));
+  s_gc_array = (int *)our_alloc(MAX_SEQ_LEN * sizeof(int));
+  c_seq = (char *)our_alloc(MAX_SEQ_LEN * sizeof(char));
+  str = (char *)our_alloc(1000 * sizeof(char));
+  /* allocate for 125 because z is 122 in ascii*/
+  weight = (float **)our_alloc(125 * sizeof(float *));
+  for (i=0; i<125; i++) weight[i]=(float *)our_alloc(125 * sizeof(float));
+  is_nuc = (int *)our_alloc(125 * sizeof(int));
+  /* end allocations */
+  
+  
+  /* the weight array is used to determine homology
+     scoring.  When the homology finds two nucleotides
+     that are opposites, weight > 0, then
+     it uses weight to determine the score for that
+     current nucleotide*/
+  
+  get_weight_matrix(weight,prm.wt_ambig,prm.AT_score,prm.CG_score);
+  
+  /* the is_nuc array is used just to check
+     whether a primer has any ambiguous nucleotides;
+     if it is in the list it is considered non-ambiguous */
+  for (i = 0; i < 125; i++) 
+      is_nuc[i] = 0;
+  
+  is_nuc['A']=1;
+  is_nuc['G']=1;
+  is_nuc['C']=1;
+  is_nuc['T']=1;
+  is_nuc['a']=1;
+  is_nuc['t']=1;
+  is_nuc['c']=1;
+  is_nuc['g']=1;
+  is_nuc['U']=1;
+  is_nuc['u']=1;
+  is_nuc['-']=0;
+  is_nuc['N']=0;
+  is_nuc['n']=0;
+  
+  /* set the number of primer candidates found for the
+     antisense and sense bank to 0 */
+  a_bank[0].num = 0;
+  s_bank[0].num = 0;
+  num = 0;
+  
+  
+  /* if only one sequence, then go get the complementary sequence to use
+     to look for the antisense primer */
+  if ((program_option==1) || (((program_option==0) && strlen(seq_two)<10)) || (one_primer)) get_compl_seq(c_seq,seq,0,seq_len,seq_len);
+  else { /* two sequences must have been entered, so check orientation 
+	    to see if you need to 
+	    get_compl_seq or not */
+      /* if orient==1 sequence  is in its top strand form when entered, so
+	 get the complementary sequence to look for the antisense strand on*/
+      if (orient == 1)
+	  get_compl_seq(c_seq,seq_two,stp2,endp2,seq_len);
+      else { /* if they are already in the opposite orientation
+		they just go ahead and use the sequence when looking
+		for an antisense primer */
+	  for (i = 0; i <=seq_len; i++)
+	      c_seq[i]=seq_two[i];
+	  
+      }
+  } /* two sequences must have been entered */
+  
+  /* get gc_content array, so you can look up GC content for any position
+     rather than having to calculate it each time - this should speed things
+     up considerably */
+  get_gc_array(seq,s_gc_array);
+  get_gc_array(c_seq,a_gc_array);
+  
+  
+  /* sequence two should be the antisense flanking region, it should
+     have been input in its top strand form if orient==1, bottom strand
+     if orient==2*/
+  
+  /*initialize counters to 0 */
+  num_sense = 0;
+  num_anti = 0;
+  num_prod_len = 0;
+  num_a_n=0;
+  num_s_n=0;
+  num_delta_tm=0;
+  num_ident_ends=0;
+  
+  
+  /* Build AntiSense Primer Candidates in first round and
+     sense candidates in second*/
+  
+  if (program_version!=3) message("......building primer candidates\n");
+  
+  for (sense = 0; sense < 2; sense++) {
+      
+      if (one_primer) sense = 1; /* do not need to look for any antisense primers
+				    if we're just looking for one primer */
+      /* each time you start looking for a new primer candidate, set
+	 other internal homology and  other 3 primer homology to zero,
+	 because those variables are used to keep the max scores
+	 found */
+      ohmlg=0;
+      oend_hmlg=0;
+      
+      
+      if (sense) {
+	  starti = stp + prm.min_prim_len - strlen(prm.end_nucs); /*- 2; */
+	  /*  because you're going to be concatenating the previous MAX_PRIM_LEN - 2
+	      bases on the 5' side when you find your first GC and you must
+	      have a primer that is at least MIN_PRIM_LEN */
+	  endi = endp;
+      }
+      else {
+	  starti = stp2 + prm.min_prim_len - strlen(prm.end_nucs); /* remember here that
+								      stp2 and endp2 refer to indices on the complementary sequence */
+	  endi = endp2;
+      }
+      /* start at 5prime end */
+      
+      for (i = starti; i <= endi; i++) {
+	  /* go find the next occurrence of prm.end_nucs */
+	  if (sense) 
+	      pos1 = find_end_nucs(seq,i,endp,prm.end_nucs);
+	  else 
+	      pos1 = find_end_nucs(c_seq,i,endp2,prm.end_nucs); 
+	  
+	  i = pos1;
+	  
+	  
+	  if (pos1 == -1) break; /* no gc-type sequences are left */
+	  else 
+	      num_gcs++; /*number of end_nucs it has found*/
+	  
+	  /* concatenate the next max_prim_len-1 bases lying on
+	     the 3' side for the antisense primer and the
+	     previous max_prim_len-2 bases lying on the 5' side
+	     for the sense primer: n
+	     you want the GC-type end at the 3' end,
+	     */
+	  
+	  pos2 = pos1 + strlen(prm.end_nucs) -1 ;
+	  pos1 = pos2 - (prm.min_prim_len - 1);
+	  
+	  /* do not start searching any earlier than stp or stp2 */
+	  if (sense) {
+	      if (pos1 < stp) pos1 = stp; 
+	  }
+	  else {
+	      if (pos1 < stp2) pos1 = stp2;
+	  }
+	  
+	  
+	  /* check each possible primer from min_prim_len to max_prim_len, 
+	     do not just stop once the min_prim_len one works */
+	  for (; pos2-pos1 <= prm.max_prim_len - 1; pos1--) {
+	      /*make sure you are not searching earlier than stp or stp2 */
+	      if (pos1 < stp && sense) break;
+	      if (pos1 < stp2 && !sense) break;
+	      
+	      /* if the primers are  closer than prm.prod_len_min to the
+		 end of the sequence, then their product length will never
+		 be long enough, so you might as well just throw them away,
+		 but only if !one primer, if one_primer they will be thrown
+		 away as product length too short -- but can only do this
+		 when you are looking for primers on one sequence, and when
+		 you are not looking for primers in two flanking regions*/
+	      
+	      if (program_option != 0 && !one_primer &&  ((sense && (endp-pos1+1<prm.prod_len_low)) ||
+							  (!sense && (endp2-pos1+1<prm.prod_len_low)))) break;
+	      
+	      /*  new code as of version 1.1 */
+	      
+	      /* if you are looking for primers in two flanking regions
+		 and those two flanking regions are in one sequence, then
+		 you can check product lengths to make sure you are not
+		 looking for primers which are never going to have
+		 long enough product lengths
+		 
+		 note that when I have input two flanking regions...e.g.
+		 start1=1 end1=50 start2=100 end2=145
+		 
+		 then stp is 0, endp is 49, stp2 is 0 and endp2 is 45*/
+	      
+	      if ((program_option==0 && num_seq==1 && sense && (seq_len-pos1+1<prm.prod_len_low || (seq_len-endp2)-pos1+1>prm.prod_len_high)) ||
+		  (program_option==0 && num_seq==1 && !sense  && (seq_len-endp2+pos1+1-stp<prm.prod_len_low || seq_len-endp2+pos1-endp+1>prm.prod_len_high)))
+		  break;
+	      /*  end of new code as of version 1.1 */
+	      
+	      if (sense) num_sense++;
+	      else num_anti++;
+	      
+	      
+	      /* ---if only one primer 
+		 check the product length before making any calculations
+		 where product length = start of primer to end of sequence--- */
+	      
+	      if (one_primer) {  
+		  len = seq_len - pos1;
+		  accepted = 1;
+		  if ((len < prm.prod_len_low && prm.prod_len_low != 0) || 
+		      (len > prm.prod_len_high && prm.prod_len_high != 0)
+		      || len < prm.min_prim_len)
+		  { accepted = 0; num_prod_len++; continue;} 
+		  if (accepted == 0) continue;
+	      }
+	      
+	      /* ---check if there is an N or n or any other non-A,G,C,T in the 
+		 primer sequence, if there is throw out this primer
+		 and do not consider any more with this starting point --- */
+	      if (sense)
+		  if (is_n(seq,pos1,pos2,is_nuc)) { num_s_n++; continue; }
+	      if (!sense)
+		  if (is_n(c_seq,pos1,pos2,is_nuc))   { num_a_n++; continue; }
+	      
+	      
+	      /* ---check the primer gc_content--- */
+	      if (sense)  {
+		  prcnt_gc = (float)(s_gc_array[pos2]-s_gc_array[pos1-1])/(float)(pos2-pos1+1);
+		  s_pgc = prcnt_gc;
+	      }
+	      else{
+		  prcnt_gc = (float)(a_gc_array[pos2]-a_gc_array[pos1-1])/(float)(pos2-pos1+1);
+		  a_pgc = prcnt_gc;
+	      }
+	      
+	      if ((prcnt_gc < prm.prim_gc_low && prm.prim_gc_low != 0) || (prcnt_gc > prm.prim_gc_high && prm.prim_gc_high != 0))
+	      {
+		  /* did not pass*/
+		  if (sense) s_gc_rej++;
+		  else a_gc_rej++;
+		  continue;
+	      }
+	      
+	      
+	      /* ---check the primer melting temperature--- */
+	      num_ambigs=0;
+	      if (sense) 
+		  melt_temp(pos2-pos1+1,s_pgc,&prim_tm,num_ambigs);
+	      else 
+		  melt_temp(pos2-pos1+1,a_pgc,&prim_tm,num_ambigs);
+	      
+	      
+	      if ((prim_tm < prm.prim_tm_low && prm.prim_tm_low != 0) || 
+		  (prim_tm > prm.prim_tm_high && prm.prim_tm_high != 0))
+	      {
+		  /* did not pass */
+		  if (sense) s_prim_tm_rej++;
+		  else a_prim_tm_rej++;
+		  continue;
+	      }
+	      
+	      
+	      /* ---check for primer-self homology-- */
+	      if (sense) {
+		  hmlg = homology(seq,pos1,pos2,seq,pos1,pos2,weight);
+		  end_hmlg = end_hom(seq,pos1,pos2,seq,pos1,pos2,weight);
+	      }
+	      else {
+		  hmlg = homology(c_seq,pos1,pos2,c_seq,pos1,pos2,weight);
+		  end_hmlg = end_hom(c_seq,pos1,pos2,c_seq,pos1,pos2,weight);
+	      }
+	      
+	      if ((hmlg > prm.selfI_hmlg_cut && prm.selfI_hmlg_cut != 0) || 
+		  (end_hmlg > prm.self3_hmlg_cut && prm.self3_hmlg_cut != 0))
+	      {
+		  /* did not pass primer-self homology test*/
+		  if (sense) s_hmlg_rej++;
+		  else a_hmlg_rej++;
+		  continue;
+	      }
+	      
+	      /* ---check primer-other homologies--- */
+	      /* primer-other includes the primer with the rest of the sequence */
+	      
+	      ohmlg=0;
+	      oend_hmlg=0;
+	      
+
+	      if (sense) {
+		  if (!otherSeq)
+		      goto no_others1;
+
+		  for (ii=0; ii<=num_other_seqs; ii++) {
+		      /* primer versus other sequence  */
+		      if ((otherDouble_stranded==0) && 
+			  ((otherSeq[ii].strand==1 && otherRight==0) || 
+			   (otherSeq[ii].strand==0 && otherRight==1)))
+			  continue;
+		      
+		      curr_ohmlg = homology(seq,pos1,pos2,otherSeq[ii].seq,
+					    0,otherSeq[ii].len,weight);
+		      curr_oend_hmlg=end_hom(seq,pos1,pos2,otherSeq[ii].seq,
+					     0,otherSeq[ii].len,weight);
+		      
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      
+		  } /* end of for (ii=0; ii<num_other_seqs; ii++) */
+
+	      no_others1:
+		  /* primer versus rest of input sequence */
+		  /*check the primer versus the part of the sequence
+		    before the primer*/
+		  curr_ohmlg = homology(seq,pos1,pos2,seq,0,pos1,weight);
+		  curr_oend_hmlg=end_hom(seq,pos1,pos2,seq,0,pos1,weight);
+		  
+		  /* if this primer vs rest of input sequence
+		     has a greater hmlg, go ahead and
+		     hold on to the greater score for as other homology score 
+		     for output */
+		  if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		  if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		  
+		  /* if there are two sequences entered
+		     also included in primer-other should be seq versus top
+		     strand of seq_two ? */
+		  if (num_seq==2) {
+		      if (orient==1) {
+			  /* if orient was 1 sequence 2 must have been in its top
+			     strand form already, so compare primer 1 to the
+			     entire top strand of the other sequence */
+			  curr_ohmlg = homology(seq,pos1,pos2,seq_two,0,seq_len,weight);
+			  curr_oend_hmlg=end_hom(seq,pos1,pos2,seq_two,0,seq_len,weight);
+		      }
+		      else {
+			  /* it must have been entered in its bottom strand form,
+			     so we want to compare it to that top strand equivalent
+			     which was calculated above*/
+			  /* also calculate the top_strand equivalent so that when I am
+			     calculating primer versus rest of sequence annealing, I can
+			     look at primer 1 versus the top strand of the entire piece 
+			     you are getting primer 2 from */
+			  char *top_seq;
+			  top_seq = (char *)our_alloc(MAX_SEQ_LEN * sizeof(char));
+			  get_compl_seq(top_seq,seq_two,0,seq_len,seq_len);
+			  curr_ohmlg = homology(seq,pos1,pos2,top_seq,0,seq_len,weight);
+			  curr_oend_hmlg=end_hom(seq,pos1,pos2,top_seq,0,seq_len,weight);
+			  our_free(top_seq);
+		      }
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		  }
+		  
+		  
+		  /*primer versus part of sequence after the primer*/
+		  /* use strlen(seq)-1, not seq_len here, since if there are
+		     two sequences entered, seq_len holds the length of seq_two,
+		     not of seq*/
+		  curr_ohmlg = homology(seq,pos1,pos2,seq,pos2,strlen(seq)-1,weight);
+		  curr_oend_hmlg=end_hom(seq,pos1,pos2,seq,pos2,strlen(seq)-1,weight);
+		  /* if either of these are greater, hold on to the largest
+		     scores*/
+		  if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		  if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		  
+		  
+		  if (double_stranded) {
+		      /* check the other strand  as well */
+		      
+		      if (num_seq==2) {
+			  /* if 2 sequences, then primer versus whole other sequence
+			     is part of primer-other*/
+			  curr_ohmlg = homology(seq,pos1,pos2,c_seq,0,seq_len,weight);
+			  curr_oend_hmlg = end_hom(seq,pos1,pos2,c_seq,0,seq_len,weight);
+		      }
+		      else {
+			  /*if 1 seq, primer versus bottom strand sequence before primer*/
+			  curr_ohmlg = homology(seq,pos1,pos2,c_seq,seq_len-pos1,seq_len,weight);
+			  curr_oend_hmlg = end_hom(seq,pos1,pos2,c_seq,seq_len-pos1,seq_len,weight);
+		      }
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      
+		      
+		      /*primer versus bottom strand sequence after primer*/
+		      if (num_seq != 2) {
+			  curr_ohmlg = homology(seq,pos1,pos2,c_seq,0,seq_len-pos2,weight);
+			  curr_oend_hmlg = end_hom(seq,pos1,pos2,c_seq,0,seq_len-pos2,weight);
+			  if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+			  
+			  if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      }
+		  } /* end of if double stranded*/
+		  
+		  
+	      } /* end of if sense */
+	      else { /* must be antisense */
+		  
+		  if (!otherSeq)
+		      goto no_others2;
+		  
+		  for (ii=0; ii<=num_other_seqs; ii++) {
+		      /* primer versus other sequence */
+		      if ((otherDouble_stranded==0) && 
+			  ((otherSeq[ii].strand==1 && otherRight==0) || 
+			   (otherSeq[ii].strand==0 && otherRight==1)))
+			  continue;
+		      
+		      curr_ohmlg = homology(c_seq,pos1,pos2,otherSeq[ii].seq,
+					    0,otherSeq[ii].len,weight);
+		      curr_oend_hmlg=end_hom(c_seq,pos1,pos2,otherSeq[ii].seq,
+					     0,otherSeq[ii].len,weight);
+		      
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      
+		      
+		  } /* end of for (ii=0; ii<num_other_seqs; ii++) */
+		  
+	      no_others2:
+		  if (double_stranded) {
+		      
+		      /* primer versus rest of input sequence,bottom strand */
+		      /*check the primer versus the part of the sequence
+			before the primer, same strand*/
+		      curr_ohmlg = homology(c_seq,pos1,pos2,c_seq,0,pos1,weight);
+		      curr_oend_hmlg=end_hom(c_seq,pos1,pos2,c_seq,0,pos1,weight);
+		      /*hold on to the greater score for as other 
+			homology score for output */
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      
+		      /*primer versus part of sequence after the primer, bottom strand*/
+		      curr_ohmlg = homology(c_seq,pos1,pos2,c_seq,pos2,seq_len,weight);
+		      curr_oend_hmlg=end_hom(c_seq,pos1,pos2,c_seq,pos2,seq_len,weight);
+		      /* if either of these are greater, hold on to the largest
+			 scores*/
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      
+		      if (num_seq==2) {
+			  /*also calculate the primer versus the bottom strand of seq, or
+			    bottom of the sense strand if there are two sequences*/
+			  char *bot_seq;
+			  int lng; /* length of sequence one*/
+			  bot_seq = (char *)our_alloc(MAX_SEQ_LEN * sizeof(char));
+			  lng=strlen(seq)-1;
+			  get_compl_seq(bot_seq,seq,stp,endp,lng);
+			  curr_ohmlg=homology(c_seq,pos1,pos2,bot_seq,0,lng,weight);
+			  curr_oend_hmlg=end_hom(c_seq,pos1,pos2,bot_seq,0,lng,weight);
+			  /* if either of these are greater, hold on to the largest
+			     scores*/
+			  if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+			  if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+			  our_free(bot_seq);
+		      }
+		      
+		  } /* end if double_stranded*/
+		  
+		  /* check the other strand  primer-versus-top strand 
+		     no matter what */
+		  {
+		      int seq_len1;
+		      seq_len1 = strlen(seq)-1;
+		      
+		      /*primer versus top strand sequence before primer*/
+		      curr_ohmlg = homology(c_seq,pos1,pos2,seq,seq_len1-pos1,seq_len1,weight);
+		      curr_oend_hmlg = end_hom(c_seq,pos1,pos2,seq,seq_len1-pos1,seq_len1,weight);
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      
+		      /*primer versus top strand sequence after primer*/
+		      curr_ohmlg=homology(c_seq,pos1,pos2,seq,0,seq_len1-pos2,weight);
+		      curr_oend_hmlg=end_hom(c_seq,pos1,pos2,seq,0,seq_len1-pos2,weight);
+		      if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+		      if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      
+		      if (num_seq==2) {
+			  /* if two sequences, check the antisense primer versus
+			     the top strand or sense sequence*/
+			  curr_ohmlg=homology(c_seq,pos1,pos2,seq,0,seq_len1,weight);
+			  curr_oend_hmlg=end_hom(c_seq,pos1,pos2,seq,0,seq_len1,weight);
+			  if (curr_ohmlg > ohmlg) ohmlg = curr_ohmlg;
+			  if (curr_oend_hmlg > oend_hmlg) oend_hmlg = curr_oend_hmlg;
+		      }
+		      
+		  } /* end of check on these no matter what block*/
+		  
+	      } 
+	      
+	      if ((ohmlg > prm.primotherI_hmlg_cut && prm.primotherI_hmlg_cut != 0) ||
+		  (oend_hmlg > prm.primother3_hmlg_cut && prm.primother3_hmlg_cut != 0))
+	      {
+		  /* did not pass primer-other homology cutoffs*/
+		  if (sense) s_other_hmlg_rej++;
+		  else a_other_hmlg_rej++;
+		  continue;
+	      }
+	      
+	      
+	      /* ---if it has passed the above criteria, add it to the correct
+		 candidate bank--- */
+	      
+	      if (sense) {
+		  if (one_primer) { /* check to make sure you only keep the short-mer if
+				       multiple primers with the same 3' ending point are
+				       accepted */
+		      if (pos2 != s_bank[s_bank[0].num].endp) {
+			  if (!add_to_bank(pos1,pos2,hmlg,end_hmlg,ohmlg,oend_hmlg,prcnt_gc,prim_tm,s_bank)) {analysis_ok=0; goto freeVars;}
+		      }
+		      else num_ident_ends++;
+		  }
+		  else {
+		      add_okay=add_to_bank(pos1,pos2,hmlg,end_hmlg,ohmlg,oend_hmlg,prcnt_gc,prim_tm,s_bank);
+		      if (!add_okay) {analysis_ok=0; goto freeVars;}
+		  }
+	      } /* if sense */
+	      else {
+		  add_okay=add_to_bank(pos1,pos2,hmlg,end_hmlg,ohmlg,oend_hmlg,prcnt_gc,prim_tm,a_bank);
+		  if (!add_okay) {analysis_ok=0;goto freeVars;}
+	      }
+	  }
+      }    /* for pos2-pos1... */
+      if (s_bank[0].num>=MAX_NUM_OLIGOS || a_bank[0].num>=MAX_NUM_OLIGOS) {
+	  message("ERROR: Number of primers found exceeds the maximum number allowed");
+	  popUpErrorMessage();
+	  analysis_ok=0;
+	  goto freeVars;
+      }
+  } /* for sense = 0... */
+  
+  
+  /* From 5' to 3' begin comparing each sense primer versus each
+     antisense primer (you must now look at the transformed 
+     (complementary) version of the antisense primer) */
+  
+  num_prod_cons = 0;
+  /* We want to eliminate products which are identical 
+     except for the length of the two primers --
+     we want to go ahead and just take the shortest primers, 
+     this keeps track of whether
+     the product considered just before this one was accepted 
+     (accepted =0 means the primer just considered was not accepted) */
+  
+  if (one_primer==0) 
+      message(".....getting products\n");
+  
+  accepted = 0;
+  if (one_primer == 0) {
+      for (i = 1; i <= s_bank[0].num; i++) {
+	  for (j = 1; j<=a_bank[0].num; j++)  {
+	      num_prod_cons++;
+	      
+	      
+	      if ((a_bank[j].endp == a_bank[j+1].endp)&&(accepted)) {
+		  /* throw out the longer primers with identical antisense 
+		     endpoints, this program
+		     only displays the shorter antisense if the 
+		     18,19,20,21 and 22 were all accepted 
+		     it will only display the 18 */
+		  num_ident_ends++;
+		  accepted = 1;
+		  continue;
+	      }
+	      
+	      /* ------check that the length of the amplified product is within
+		 the specified range------ */
+	      /* if two flanking regions were input, then length
+		 constraints cannot be used */
+	      if ((program_option==1) || ((program_option==0) && strlen(seq_two)<10)) {
+		  if ((seq_len - a_bank[j].stp) < s_bank[i].stp) {
+		      num_prod_len++; /* number products rejected based on length */
+		      accepted = 0;
+		      continue;
+		  }
+		  len = (seq_len - a_bank[j].stp) - s_bank[i].stp + 1; 
+		  
+		  
+		  if (len < prm.prod_len_low && prm.prod_len_low !=0 || len < 2*prm.min_prim_len) { accepted = 0; num_prod_len++; continue; }
+		  if (len > prm.prod_len_high && prm.prod_len_high != 0) { accepted = 0; num_prod_len++; continue;} 
+	      }
+	      
+	      
+	      /* ---Calculate the gc for the amplified region (including primers), but
+		 only for those program_options which allow product calculations */
+	      if ((program_option==1) || ((program_option==0) && strlen(seq_two)<10)) {
+		  prod_prcnt_gc  = (float)(s_gc_array[seq_len-a_bank[j].stp]-s_gc_array[s_bank[i].stp-1])/
+		      (float)(seq_len-a_bank[j].stp-s_bank[i].stp+1);
+		  if ((prod_prcnt_gc < prm.prod_gc_low && prm.prod_gc_low != 0) || 
+		      (prod_prcnt_gc > prm.prod_gc_high && prm.prod_gc_high != 0)) { 
+		      accepted = 0; 
+		      prod_gc_rej++;
+		      continue;
+		  }
+	      }
+	      
+	      
+	      /* ---check difference in melting temperatures between the two primers--- */
+	      
+	      delta_tm = s_bank[i].tm - a_bank[j].tm;
+	      if (delta_tm < 0) delta_tm *= -1; /* error checking */
+	      if (delta_tm > prm.delta_tm_cut && prm.delta_tm_cut != 0) 
+	      { 
+		  accepted = 0; 
+		  num_delta_tm++; 
+		  continue; 
+	      }
+	      
+	      /* ---check primer-primer homologies--- */
+	      hmlg = homology(seq,s_bank[i].stp,s_bank[i].endp,c_seq,
+			      a_bank[j].stp,a_bank[j].endp,weight);
+	      hmlga = homology(c_seq,a_bank[j].stp,a_bank[j].endp,seq,
+			       s_bank[i].stp,s_bank[i].endp,weight);
+	      end_hmlg = end_hom(seq,s_bank[i].stp,s_bank[i].endp,c_seq,
+				 a_bank[j].stp,a_bank[j].endp,weight);
+	      end_hmlga = end_hom(c_seq,a_bank[j].stp,a_bank[j].endp,seq,
+				  s_bank[i].stp,s_bank[i].endp,weight);
+	      
+	      if (end_hmlga > end_hmlg) end_hmlg = end_hmlga;
+	      if (hmlga > hmlg) hmlg = hmlga;
+	      
+	      
+	      if ((hmlg > prm.ppI_hmlg_cut && prm.ppI_hmlg_cut != 0) || 
+		  (end_hmlg > prm.pp3_hmlg_cut && prm.pp3_hmlg_cut != 0))
+	      {
+		  p_hmlg_rej++;
+		  accepted = 0;
+		  continue;
+	      }
+	      
+	      /* ----check primer-product homologies -- only look at the amplified
+		 region, not including primers--- */
+	      /* if two flanking regions in two separate sequences 
+		 were input,  or if only looking for one primer
+		 then primer-product homologies should not be calculated
+		 */
+	      if (program_option==1 || ((program_option==0) && strlen(seq_two)<10)) {
+		  /* primer 1 versus current strand -- internal homology */
+		  p1p_hmlg = homology(seq,s_bank[i].stp,s_bank[i].endp,seq, 
+				      s_bank[i].endp,seq_len-a_bank[j].endp,weight);
+		  /* primer 1 versus other strand -- internal homology */
+		  if (double_stranded) {
+		      p1p_hmlg_c = homology(seq,s_bank[i].stp,s_bank[i].endp,c_seq, 
+					    a_bank[j].endp,seq_len-s_bank[i].endp,weight);
+		      if (p1p_hmlg_c > p1p_hmlg) p1p_hmlg = p1p_hmlg_c;
+		  }
+		  
+		  /* primer 2 versus current strand */
+		  p2p_hmlg = homology(c_seq,a_bank[j].stp,a_bank[j].endp,seq,
+				      s_bank[i].endp,seq_len-a_bank[j].endp,weight);
+		  /* primer 2 versus other strand */
+		  if (double_stranded) {
+		      p2p_hmlg_c = homology(c_seq,a_bank[j].stp,a_bank[j].endp,c_seq, 
+					    a_bank[j].endp,seq_len-s_bank[i].endp,weight);
+		      if (p2p_hmlg_c > p2p_hmlg) p2p_hmlg = p2p_hmlg_c;
+		  }
+		  
+		  /* primer 1 versus current strand -- three prime homology */
+		  p1p_end_hmlg = end_hom(seq,s_bank[i].stp,s_bank[i].endp,seq, 
+					 s_bank[i].endp,seq_len-a_bank[j].endp,weight);
+		  /* primer 1 versus other strand -- three prime homology */
+		  if (double_stranded) {
+		      p1p_end_hmlg_c = end_hom(seq,s_bank[i].stp,s_bank[i].endp,c_seq, 
+					       a_bank[j].endp,seq_len-s_bank[i].endp,weight);
+		      if (p1p_end_hmlg_c > p1p_end_hmlg) p1p_end_hmlg = p1p_end_hmlg_c;
+		  }
+		  
+		  /* primer 2 versus current strand -- three prime homology */
+		  p2p_end_hmlg = end_hom(c_seq,a_bank[j].stp,a_bank[j].endp,seq,
+					 s_bank[i].endp,seq_len-a_bank[j].endp,weight);
+		  /* primer 2 versus other strand -- three prime homology */
+		  if (double_stranded) {
+		      p2p_end_hmlg_c = end_hom(c_seq,a_bank[j].stp,a_bank[j].endp,c_seq, 
+					       a_bank[j].endp,seq_len-s_bank[i].endp,weight);
+		      if (p2p_end_hmlg_c > p2p_end_hmlg) p2p_end_hmlg = p2p_end_hmlg_c;
+		  }
+		  
+		  /*check parameters */
+		  if ((p1p_hmlg > prm.primprodI_hmlg_cut && prm.primprodI_hmlg_cut != 0)||
+		      (p1p_end_hmlg > prm.primprod3_hmlg_cut && prm.primprod3_hmlg_cut!=0)||
+		      (p2p_hmlg > prm.primprodI_hmlg_cut && prm.primprodI_hmlg_cut != 0)||
+		      (p2p_end_hmlg > prm.primprod3_hmlg_cut && prm.primprod3_hmlg_cut!=0))
+		  {
+		      /* did not pass the primer-product homology cutoffs */
+		      p_primprod_hmlg_rej++;
+		      accepted = 0;
+		      continue;
+		  }
+		  
+	      }  /* end if program_option=1 || 0 and there is not  a seq_two */
+	      
+	      
+	      
+	      
+	      /* ---Calculate the Tm for the amplified region (including primers), but
+		 only for those program_options which allow product calculations--- */
+	      num_ambigs=0;
+	      if ((program_option==1) || ((program_option==0) && strlen(seq_two)<10))
+	      {
+		  len = (seq_len - a_bank[j].stp) - s_bank[i].stp + 1; 
+		  /*calculate the number of ambiguties in the product so
+		    that we do not include those in the melting temperature
+		    calculation*/
+		  for (iii=s_bank[i].stp; iii<=(seq_len-a_bank[j].stp); iii++)
+		      if (!is_nuc[seq[iii]]) num_ambigs++;
+		  melt_temp(len,prod_prcnt_gc,&tm,num_ambigs);
+	      }
+	      else tm = 0; /*because if you are looking for only one primer 
+			     or you have two separate sequenceces input you 
+			     (a) do not care about product tm or (b) cannot
+			     calculate it anyway*/
+	      
+	      ident = 0;
+	      
+	      passed=1;
+	      if ((program_option==1) || ((program_option==0) && strlen(seq_two)<10))
+	      {/*only check if the tm is correct if you can actually calculate it*/
+		  if ((tm < prm.prod_tm_low && prm.prod_tm_low != 0) || 
+		      (tm > prm.prod_tm_high && prm.prod_tm_high != 0)) passed = 0;
+	      }
+	      
+	      if (passed) {
+		  accepted = 1; /* this primer was accepted */
+		  /* check that this sense primer endpoint has not already 
+		     been used with an identical
+		     antisense primer */
+		  for (jjj = 1; jjj <= num; jjj++) {
+		      if ((s_bank[i].endp == s_bank[product[jjj].s_primer].endp) &&
+			  /*	      (a_bank[j].stp == a_bank[product[jjj].a_primer].stp) && */
+			  (a_bank[j].endp == a_bank[product[jjj].a_primer].endp))  {
+			  num_ident_ends++;
+			  ident = 1;
+			  break;
+		      }
+		  }
+	      }
+	      if (ident) continue;
+	      
+	      if (passed) {	  
+		  num++; /* keep track of how many primer-pairs that have passed the tests */
+		  if (num>MAX_NUM_OLIGOS) {
+		      message("ERROR: Number of products found exceeds the maximum number allowed");
+		      popUpErrorMessage();
+		      analysis_ok=0;
+		      goto freeVars;
+		  }
+		  product[num].s_primer = i;
+		  product[num].a_primer = j;
+		  product[num].score = hmlg;
+		  product[num].end_score = end_hmlg;
+		  if ((program_option==1) || ((program_option==0) && strlen(seq_two)<10)) {
+		      /* these product scores cannot be calculated
+			 when we're looking at flanking sequences only OR
+			 if we're only looking for one sequence */
+		      product[num].gc = prod_prcnt_gc; 
+		      product[num].tm = tm; 
+		      product[num].p1hom = p1p_hmlg;
+		      product[num].p2hom = p2p_hmlg;
+		      product[num].p1end = p1p_end_hmlg;
+		      product[num].p2end = p2p_end_hmlg;
+		  }
+		  else {
+		      product[num].p1hom = 0;
+		      product[num].p2hom = 0;
+		      product[num].p1end = 0;
+		      product[num].p2end = 0;
+		  }
+		  
+		  
+		  /* calculate the score, thus the ranking, of this product based on 
+		     the weights */
+		  
+		  /* primer-self total homology, internal and three-prime */
+		  hmlg=s_bank[i].score+a_bank[j].score;
+		  end_hmlg = s_bank[i].end_score + a_bank[j].end_score;
+		  
+		  /* primer-product total homology, internal and three-prime */
+		  pp_hmlg=product[num].p1hom + product[num].p2hom;
+		  pp_end_hmlg = product[num].p1end + product[num].p2end;
+		  
+		  /* primer-other total homology, internal and three-prime */
+		  ohmlg=s_bank[i].primotherI + a_bank[j].primotherI;
+		  oend_hmlg=s_bank[i].primother3 + a_bank[j].primother3;
+		  
+		  delta_tm = s_bank[i].tm - a_bank[j].tm;
+		  if (delta_tm < 0) delta_tm *= -1; /* error checking */
+		  
+		  
+		  /*calculate the total score for that product */
+		  product[num].sum_score = 
+		      prm.wt_prod_len*(float)len + 
+			  prm.wt_prod_gc*product[num].gc +
+			      prm.wt_prod_tm*product[num].tm + 
+				  prm.wt_prim_s_len*(float)(s_bank[i].endp-s_bank[i].stp+1) + 
+				      prm.wt_prim_a_len*(float)(a_bank[j].endp-a_bank[j].stp+1) + 
+					  prm.wt_prim_s_gc*s_bank[i].gc  +	    
+					      prm.wt_prim_a_gc*a_bank[j].gc  +	    
+						  prm.wt_prim_s_tm*s_bank[i].tm  +	    
+						      prm.wt_prim_a_tm*a_bank[j].tm  +	    
+							  prm.wt_self3_hmlg_cut*end_hmlg  +
+							      prm.wt_selfI_hmlg_cut*hmlg + 
+								  prm.wt_pp3_hmlg_cut*product[num].end_score +
+								      prm.wt_ppI_hmlg_cut*product[num].score + 
+									  prm.wt_primprod3_hmlg_cut*pp_end_hmlg + 
+									      prm.wt_primprodI_hmlg_cut*pp_hmlg + 
+										  prm.wt_primother3_hmlg_cut*oend_hmlg + 
+										      prm.wt_primotherI_hmlg_cut*ohmlg + 
+											  prm.wt_delta_tm_cut*delta_tm;
+		  
+	      } /* if passed tm */
+	      else { 
+		  accepted = 0; 
+		  p_tm_rej++;
+	      }
+	      
+	      
+	  } /* for j = 1... */
+      } /* for i = 1... */
+  } /* if !one_primer */
+  
+  /* Display user information to the screen or oligoInfoWidget */
+  
+  sprintf(score_info,"PRIMERS\n");
+  k=strlen(score_info); 
+  sprintf(score_info+k,"(Number of %s-type sequences found was %d)\n",prm.end_nucs,num_gcs);
+  k=strlen(score_info); 
+  sprintf(score_info+k,"  Sense:\n");
+  k=strlen(score_info); 
+  sprintf(score_info+k,"    Total considered:                           %5d\n",num_sense);
+  k=strlen(score_info); 
+  sprintf(score_info+k,"    Rejected based on ambiguity (N):            %5d\n",num_s_n);
+  k=strlen(score_info); 
+  sprintf(score_info+k,"    Rejected based on gc_content:               %5d\n",s_gc_rej);
+  k=strlen(score_info); 
+  sprintf(score_info+k,"    Rejected based on primer Tm:                %5d\n",s_prim_tm_rej);
+  k=strlen(score_info); 
+  sprintf(score_info+k,"    Rejected based on self-annealing            %5d\n",s_hmlg_rej);
+  k=strlen(score_info); 
+  sprintf(score_info+k,"    Rejected based on other-annealing:          %5d\n",s_other_hmlg_rej);
+  if (one_primer) {
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on identical endpts:         %5d\n",num_ident_ends);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on product length:           %5d\n",num_prod_len);
+  }
+  k=strlen(score_info); 
+  sprintf(score_info+k,"    Number accepted:                            %5d\n",s_bank[0].num);
+  k=strlen(score_info); 
+  if (!one_primer) {
+      sprintf(score_info+k,"  Antisense:\n");
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Total considered:                           %5d\n",num_anti);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on ambiguity (N):            %5d\n",num_a_n);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on gc_content:               %5d\n",a_gc_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on primer Tm:                %5d\n",a_prim_tm_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on self-annealing:           %5d\n",a_hmlg_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on other-annealing:          %5d\n",a_other_hmlg_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Number accepted:                            %5d\n",a_bank[0].num);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"PRODUCTS\n");
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Total considered:                           %5d\n",a_bank[0].num*s_bank[0].num);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on identical endpts:         %5d\n",num_ident_ends);
+      k=strlen(score_info); 
+      /* have to include total-num_prod_cons, because I skip over a bunch of primers once I realize I'm in the range of
+	 products which will all be too long */
+      sprintf(score_info+k,"    Rejected based on length:                   %5d\n",num_prod_len + a_bank[0].num*s_bank[0].num - num_prod_cons);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on primer-primer annealing:  %5d\n",p_hmlg_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on primer-product annealing: %5d\n",p_primprod_hmlg_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on melting temperature:      %5d\n",p_tm_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on gc_content:               %5d\n",prod_gc_rej);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Rejected based on difference in Tm:         %5d\n",num_delta_tm);
+      k=strlen(score_info); 
+      sprintf(score_info+k,"    Number accepted:                            %5d\n",num);
+      k=strlen(score_info); 
+  } /* if !one_primer */
+  
+  
+  score_info_len = k;
+  sprintf(score_info+k,"*******************************************************\n");
+  k=strlen(score_info); if (k>MAX_INFO_LEN) popUpErrorMessage();
+  
+  if (!one_primer) {
+      /* sort the products based on total scores */
+      if (num > 0) {
+	  if (num>1) {
+	      analysis_ok=sort_products(product,num);
+	      /* if sort_products returns a 0, there
+		 was a problem, so quit this analysis */
+	      if (analysis_ok==0) goto freeVars;
+	  }
+	  else {
+	      /* if only one primer you do not need to sort, just assign*/
+	      ndx[1]=1;
+	  }
+	  num_valid_primers = num; 
+	  
+#ifdef SUBVERSION
+	  if (program_version==3) /* subroutine version */
+	  {
+	      primer_to_xdap(num_valid_primers,OSP_RESULTS);
+	      goto freeVars;
+	  }
+#endif
+#ifdef TEXTVERSION
+	  if (program_version==2) { /* text version */
+	      textVersionOutputPrimers(num_valid_primers,orient);
+	      goto freeVars;
+	  }
+#endif
+	  
+#ifdef XVERSION
+	  
+	  get_pix_array(num);
+	  for (i = 1; i <= num; i++) {
+	      if (num_seq==2)
+		  drawflankingPrimers(stp,endp,stp2,endp2,product[ndx[i]].sum_score,s_bank[product[ndx[i]].s_primer],a_bank[product[ndx[i]].a_primer],i,product[ndx[i]],seq_len,one_primer);     
+	      else 
+		  drawPrimers(stp,endp,stp2,endp2,product[ndx[i]].sum_score,s_bank[product[ndx[i]].s_primer],a_bank[product[ndx[i]].a_primer],i,product[ndx[i]],seq_len,one_primer);
+	      
+	  }
+	  sprintf(str,"Click with the left mouse button ON any primer to have score information displayed here\n");
+	  if (num_seq!=2 && (tedType[0]=='A' || tedType[0]=='P')  && buftext==0) 
+	      sprintf(str+strlen(str),"\nClick with the left mouse button ABOVE the sequence bar at the desired indices to open a ted window.");
+	  if (strlen(str) > 1000) popUpErrorMessage();
+	  message(str);
+#endif
+      }
+      else {
+	  
+	  /* if this is the text version just output the score information
+	     directly to the screen, even if there were no suitable
+	     products found*/
+	  
+#ifdef TEXTVERSION
+	  if (program_version==2) 
+	      text_to_output(score_info,0,k,0,"junk12545.");
+#endif
+	  
+	  if (program_version==3) /* subroutine version */
+	  {
+	      /* don't need to convert to osp_results becuase
+		 no products were found */
+	      goto freeVars;
+	  }
+	  
+	  /* tell the user there were no suitable products found*/
+	  message("No suitable products found\n");
+	  analysis_ok=0;
+      }
+  }
+  else { /* must be one_primer == 1 */
+      if (s_bank[0].num > 0) { /* do the product analysis */
+	  for (num=0; num <= s_bank[0].num; num++) {
+	      product[num].s_primer=num;
+	      product[num].sum_score = 
+		  prm.wt_prod_len*(float)(seq_len-s_bank[num].stp+1) + 
+		      prm.wt_prim_s_len*(float)(s_bank[num].endp-s_bank[num].stp+1) + 
+			  prm.wt_prim_s_gc*s_bank[num].gc  + 	    
+			      prm.wt_prim_s_tm*s_bank[num].tm  + 	    
+				  prm.wt_self3_hmlg_cut*s_bank[num].end_score  +
+				      prm.wt_selfI_hmlg_cut*s_bank[num].score + 
+					  prm.wt_primother3_hmlg_cut*s_bank[num].primother3 +
+					      prm.wt_primotherI_hmlg_cut*s_bank[num].primotherI;
+	  } /* for num=0...*/
+	  
+	  if (s_bank[0].num > 1) {
+	      analysis_ok=sort_products(product,s_bank[0].num);
+	      if (analysis_ok==0) goto freeVars;
+	  }
+	  else {
+	      /* if only one product, do not need to sort, just assign*/
+	      ndx[1]=1;
+	  }
+	  
+	  num_valid_primers = s_bank[0].num;
+	  /* num_valid_primers is used by the save oligo widget to
+	     know how may primers can be saved */
+#ifdef SUBVERSION
+	  if (program_version==3) { /* subroutine version */
+	      primer_to_xdap(num_valid_primers,OSP_RESULTS);
+	      goto freeVars;
+	  }
+#endif
+#ifdef TEXTVERSION      
+	  if (program_version==2) { /* text version */
+	      textVersionOutputPrimers(num_valid_primers,orient);
+	      goto freeVars;
+	  }
+#endif
+#ifdef XVERSION
+	  /* figure out where all the primers will be drawn to the screen */
+	  get_pix_array(s_bank[0].num);
+	  /* draw the primers one by one */
+	  for (i = 1; i <= s_bank[0].num; i++)  
+	      drawPrimers(stp,endp,stp2,endp2,product[ndx[i]].sum_score,s_bank[product[ndx[i]].s_primer],a_bank[product[ndx[i]].a_primer],i,product[ndx[i]],seq_len,one_primer);
+	  message("Click with the left mouse button on any primer to have score information displayed here.\n");
+#endif
+      } /* if s_bank[0].num > 0 */
+      else {
+	  
+	  if (program_version==3) { 	/* subroutine version */
+	      /* message("No suitable products found\n"); */
+	      analysis_ok = 0;
+	      goto freeVars;
+	  }
+	  
+#ifdef TEXTVERSION
+	  /* output score info to the user even if no suitable products found */
+	  if (program_version==2)  /* text version */
+	      text_to_output(score_info,0,k,0,"junk12545.");
+#endif
+	  message("No suitable products found\n");
+	  analysis_ok=0;
+      } /*the else corresponding to must be one_primer == 1*/
+  }
+  
+  
+ freeVars:;
+  our_free(is_nuc);
+  for (i=0; i<125; i++) our_free(weight[i]);
+  our_free(weight);
+  our_free(c_seq);
+  our_free(str);
+  our_free(a_gc_array);
+  our_free(s_gc_array);
+  
+  if (program_version==3) { /* subroutine version */
+      /* do a bunch more our_frees */
+  }
+  
+  /* analysis_ok indicates if any errors occurred */
+  return(analysis_ok);
+  
+}
+
+
+/* uses aseq to find the sequence that
+   would appear on the opposite strand and places
+   that sequence in ac_seq; seq_len is the total length
+   of the input sequence (if there are two input sequences,
+   it is the length of seq_two) */
+
+void get_compl_seq(ac_seq,aseq,stp,endp,len_of_seq)
+    char *ac_seq; /*OUPUT: complementary sequence */
+    char *aseq; /*INPUT: sequence */
+    int stp; /*INPUT starting and ending point for complement */
+    int endp;
+    int len_of_seq; /* INPUT: length of aseq */
+{ int i,j;
+  
+  /* make sure we have not under-allocated memory*/
+  if (strlen(aseq)>MAX_SEQ_LEN || strlen(ac_seq)>MAX_SEQ_LEN) {
+      message("Sequence length greater than MAX_SEQ_LEN\n");
+      popUpErrorMessage();
+  }
+  
+  /* get the complementary sequence from stp to endp */
+  j = endp;
+  for (i = stp; i <= endp; i++)
+      ac_seq[len_of_seq - i] = opp[aseq[i]];
+  
+  return;
+}
+
+
+
diff --git a/src/bap/osp-bits/analysis.h b/src/bap/osp-bits/analysis.h
new file mode 100644
index 0000000..0289bb1
--- /dev/null
+++ b/src/bap/osp-bits/analysis.h
@@ -0,0 +1,48 @@
+#ifndef _analysis_h
+#define _analysis_h
+
+/* 
+  Program Name: analysis
+  File name: analysis.h
+  Purpose: all sequence analysis is done in this module, except
+    when the user only asks for scores without looking for primers 
+        In this program all of the primer and product evaluation
+  takes place, from selection of valid primers to selection of valid
+  primer pairs 
+  Copyright 1991: LaDeana Hillier and Philip Green
+
+  Last Update: Fri Mar 23 1991
+
+  Change Log:
+*/
+
+/* ---- Exports ----  */
+extern int analysis();
+/* this function is the heart analysis.c -- it chooses the
+   candidate primers and products
+      returns a 1 if all the analysis went okay */
+
+extern float homology();
+/* calculates internal homology scores, returns the max score*/
+
+extern float end_hom();
+/* calculates three prime homology scores, returns the max score*/
+
+extern void get_compl_seq();
+/*gets the complementary sequence of the input sequence
+ in the desired range */
+
+extern void get_weight_matrix();
+/* this function determines annealing scores for any nucleotide
+   versus any other nucleotide, e.g. for a A on one strand
+   and T on other, score is prm.AT_score.  For an A on one
+   strand and C on other, score is 0 */
+
+
+/* internal */
+void indexx();
+/* indexes an array arrin[1..n], i.e. outputs the array indx[1..n]
+such that arrin[indx[j]] is in ascending order for j=1,2,...,N.
+The input quatitities n and arrin are not changed */
+
+#endif /* _analysis_h */
diff --git a/src/bap/osp-bits/defn.h b/src/bap/osp-bits/defn.h
new file mode 100644
index 0000000..38cd4d2
--- /dev/null
+++ b/src/bap/osp-bits/defn.h
@@ -0,0 +1,49 @@
+#ifndef _defn_h
+#define _defn_h
+
+/*
+  Program Name: defn
+  File name: def.h
+  Purpose: assigns macros and stdio and global declaration of
+          our_alloc
+  Last Update: Fri Mar 23 1991
+  Copyright 1991: LaDeana Hillier and Philip Green
+  Change Log:
+*/
+
+#if vms
+#include stdio
+#include ctype
+#else
+#include <stdio.h>
+#include <ctype.h>
+#endif
+
+
+#define MAX_WORDS 10
+#define MAX_I  10
+#define MAX_WORD_SIZE 100
+#define MAX_NAME_SIZE 100
+#define MAX_PARAM_SIZE 10000
+#define MAX_NUM_OLIGOS 300
+#define MAX_SEQ_LEN 50000
+#define MAX_NUM_OTHER_SEQS 25
+#define OTHER_SEQ_MAX_SIZE 100000
+#define OTHER_SEQ_FILE_MAX_SIZE 100000
+#define MAX_GRAPH_RESULTS_WIDTH 1000
+#define MAX_GRAPH_RESULTS_HEIGHT 1000
+#define MAX_INFO_LEN 3000   /* score information, which is the
+			      information about how many primers
+			      were accepted and rejected and for
+			      what reasons*/
+
+#define OriginX 20
+#define OriginY 25
+#define oligoFont1 "6x13"
+#define Primer_Intvl 10 /* pixel interval between primers */
+
+char *our_alloc();
+
+
+
+#endif /* _defn_h */
diff --git a/src/bap/osp-bits/defs.h b/src/bap/osp-bits/defs.h
new file mode 100644
index 0000000..7366431
--- /dev/null
+++ b/src/bap/osp-bits/defs.h
@@ -0,0 +1,121 @@
+#ifndef _osp_defs_h
+#define _osp_defs_h
+
+/*  File: defs.h
+    Purpose: hold all definitions for primer program
+ Copyright 1991: LaDeana Hillier and Philip Green*/
+
+#define MAX_LEN_LOC_NAME 20
+#define MAX_LEN_PKU_NAME 80
+#define MAX_LEN_ORDS_NAME 80
+#define NUM_PRE_COMP     20
+
+typedef long ALLOC;
+
+/* if change type SHORT to anything larger than short need to read values
+   in scanf & fscanf into a dummy variable of type short, then assign the
+   dummy variable to the true variable.
+*/
+
+typedef short SHORT;
+typedef long ID;
+typedef long LINDEX;
+typedef int INT;
+
+struct loci_orders{
+     SHORT num_loci, num_orders;
+     SHORT **orders;
+     struct loci_orders *next_orders;	
+};
+
+
+struct loci_data{
+     SHORT n, num_types;
+     double ***data;
+};
+
+
+struct chrom_data{
+     SHORT num_loci, num_fams;
+     SHORT *num_chroms;  /* different number of chroms for each fam */
+     struct phase **phase_choices;
+     char ***chrom_array;
+     ID *fam_nums_array;
+     char **locus_names;
+};
+
+struct phase{
+     SHORT num_switches;
+     SHORT *locus_nums;
+     char **array;
+};
+
+
+struct int_array{
+       SHORT n;
+       struct intervals **list;
+};
+
+struct intervals {
+       SHORT chrom_num, i, j, k, index, sort_index;
+       unsigned rank : 4;
+       unsigned r : 1;
+  /* i = sex; j, k are starting, ending loci; r = reference recomb. count; index, sort_index give position in list */
+       struct intervals *next_interval, *prev_interval;
+};
+
+struct interval_list{
+       SHORT num_intervals;
+       struct intervals *first_interval;
+};
+
+struct interval_ptrs{
+       struct intervals *interval;
+       struct interval_ptrs *next_ptr, *prev_ptr;
+};
+
+struct tswitchs{
+       SHORT chrom_tswitch_index, num_intervals;
+       struct interval_ptrs *first_ptr;
+       struct tswitchs *next_tswitch, *prev_tswitch;
+};
+
+struct tswitch_list {
+       SHORT num_tswitchs;
+       struct tswitchs *first_tswitch;
+};       
+
+struct flank_tswitchs {
+/*   struct tswitchs *tswitch;   ELIMINATE -- AND MAKE CORRESPONDING CHANGES IN PROGRAM? */
+   LINDEX m_in_l_list, m_in_r_list;
+   unsigned affects_l_interval : 1; 
+   unsigned affects_r_interval : 1; 
+   unsigned in_l_list : 1; 
+   unsigned in_r_list : 1; 
+   struct flank_tswitchs *next_tswitch;
+};
+
+struct flank_list {
+   SHORT num_flanks;
+   SHORT *num_tswitchs;
+   LINDEX *m_left_off, *m_right_off;
+   struct flank_tswitchs **first_tswitch;
+};
+
+
+struct individual{
+     ID id, moth_id, fath_id, moth, fath;
+     char sex;
+     SHORT **a;
+};
+
+struct data{
+     SHORT num_fams, num_loci;
+     SHORT *num_mems;
+     ID *fam_id;
+     char **locus_names;
+     struct individual ***ind;
+};
+
+
+#endif
diff --git a/src/bap/osp-bits/get_scores.c b/src/bap/osp-bits/get_scores.c
new file mode 100644
index 0000000..f6449c1
--- /dev/null
+++ b/src/bap/osp-bits/get_scores.c
@@ -0,0 +1,799 @@
+/* 
+  Program Name:  get_scores
+  File name: get_scores.c
+  Purpose: to just calculate scores for given primers, does not
+       look for primers
+
+  Last Update:  Mon April 15 1991
+
+  Copyright 1991: LaDeana Hillier and Philip Green
+
+  Change Log:
+*/
+
+
+
+/* ---- Includes ---- */
+#include <stdlib.h>
+#include <string.h>
+#include "our_allo.h"
+
+#if defined(XVERSION) || defined (TEXTVERSION)
+#include "Xmess.h" /* IMPORT: message, popupMessage */
+#endif
+
+#include "defn.h" /* macros and stdio */
+#include "struct.h" /* global structure defns */
+#include "analysis.h"  /*IMPORT: get_compl_seq, get_weight_matrix, homology,
+			           end_hom*/
+
+extern void text_to_output();
+
+/* external variables */
+extern Prm prm;
+extern otherSeqBank *otherSeq;
+extern int seq_len;
+extern int one_primer;
+extern int orient;
+extern int num_other_seqs;
+extern char *output_fn;
+extern int program_version;
+extern int double_stranded;
+extern int otherRight;
+extern int otherDouble_stranded;
+extern int buftext;
+
+#if defined(XVERSION) || defined (TEXTVERSION)
+extern char *seq_filename;
+extern char *seq2_fn;
+#endif
+
+extern OSP_Results *OSP_RESULTS;
+
+#ifdef SUBVERSION
+extern void message();
+extern void popupMessage();
+extern void popUpErrorMessage();
+
+static char seq_filename[MAX_NAME_SIZE];
+char seq2_fn[MAX_NAME_SIZE];
+
+ void score_to_xdap(stp,endp,gc,tm,end_h,hom,oend_hmlg,ohmlg,sum_score,OSP_RESULTS)
+int stp,endp;
+float gc,tm,end_h,hom,oend_hmlg,ohmlg,sum_score;
+OSP_Results *OSP_RESULTS; /* results structure see struct.h */
+
+{ int primer_number;
+
+  primer_number=0;
+  OSP_RESULTS[primer_number].start_position=stp;
+  OSP_RESULTS[primer_number].end_position=endp;
+  OSP_RESULTS[primer_number].score=sum_score;
+  OSP_RESULTS[primer_number].gc=gc;
+  OSP_RESULTS[primer_number].tm=tm;
+  OSP_RESULTS[primer_number].psI_score=hom;
+  OSP_RESULTS[primer_number].ps3_score=end_h;
+  OSP_RESULTS[primer_number].poI_score=ohmlg;
+  OSP_RESULTS[primer_number].po3_score=oend_hmlg;
+
+  primer_number=1;
+  OSP_RESULTS[primer_number].start_position=0;
+  OSP_RESULTS[primer_number].end_position=0;
+  OSP_RESULTS[primer_number].score=0;
+  OSP_RESULTS[primer_number].gc=0;
+  OSP_RESULTS[primer_number].tm=0;
+  OSP_RESULTS[primer_number].psI_score=0;
+  OSP_RESULTS[primer_number].ps3_score=0;
+  OSP_RESULTS[primer_number].poI_score=0;
+  OSP_RESULTS[primer_number].po3_score=0;
+  return;
+
+}
+
+#endif
+
+/* --- Internal Functions --- */
+
+/* calculate primer versus rest of sequence homologies for the given 
+   sequence */
+void calc_primer_rest_of_seq_hom(seq1,cseq2,stp1,endp1,weight,seq_len,double_stranded,the_hmlg,the_end_hmlg)
+     char *seq1; /* INPUT: sequence */
+     char *cseq2; /* INPUT: complementary sequence */
+     int stp1,endp1; /* INPUT: starting and ending indices of primer on seq1 */
+     float **weight; /* INPUT: weights, determines annealing scores */
+     int seq_len; /* INPUT: total length of sequence */
+     int double_stranded; /* INPUT: 1 if you should look on both strands for 
+			     annealing (seq1 and cseq2), 0 if only on the 
+			     current strand, which would be seq1 */
+     float *the_hmlg; /* OUTPUT: internal annealing score */
+     float *the_end_hmlg; /* OUTPUT: three prime annealing score */
+{ float ohmlg_c=0,oend_hmlg_c=0; /* internal and three-prime homologies
+			      versus complementary strand */
+  float ohmlg=0,oend_hmlg=0; /* internal and three-prime homologies */
+  float kkk; /* used as a temporary storage for homology scores */
+  
+  /*primer versus rest of sequence  - internal */
+  if (stp1 > 0)
+    ohmlg = homology(seq1,stp1,endp1,seq1,0,stp1,weight);
+  if (endp1 < seq_len)
+    kkk = homology(seq1,stp1,endp1,seq1,endp1,seq_len,weight);
+  if (kkk > ohmlg) ohmlg = kkk;
+  
+  if (double_stranded) {
+    if (stp1 > 0)
+      ohmlg_c = homology(seq1,stp1,endp1,cseq2, 
+			 seq_len-stp1,seq_len,weight);
+    if (endp1 < seq_len)
+      kkk = homology(seq1,stp1,endp1,cseq2, 
+		     0,seq_len-endp1,weight);
+    if (kkk > ohmlg_c) ohmlg_c = kkk;
+    
+    if (ohmlg_c >ohmlg) ohmlg=ohmlg_c;
+  }
+  
+  /*primer versus rest of sequence  - three prime  */
+  if (stp1 > 0)
+    oend_hmlg = end_hom(seq1,stp1,endp1,seq1, 
+			0,stp1,weight);
+  if (endp1 < seq_len)
+    kkk = end_hom(seq1,stp1,endp1,seq1, 
+		  endp1,seq_len,weight);
+  if (kkk > oend_hmlg) oend_hmlg = kkk;
+  
+  if (double_stranded) {
+    if (stp1 > 0)
+      oend_hmlg_c = end_hom(seq1,stp1,endp1,cseq2, 
+			    seq_len-stp1,seq_len,weight);
+    if (endp1 < seq_len)
+      kkk = end_hom(seq1,stp1,endp1,cseq2, 
+		    0,seq_len-endp1,weight);
+    if (kkk > oend_hmlg_c) oend_hmlg_c = kkk;
+    
+    if (oend_hmlg_c >oend_hmlg) oend_hmlg=oend_hmlg_c;
+  }/* end of if double_stranded*/
+  
+  /* return the maximum scores found*/
+  *the_end_hmlg = oend_hmlg;
+  *the_hmlg = ohmlg;
+}
+
+/* calculate primer versus other sequence homlogies */
+void calc_prim_other_hmlg(seq1,stp1,stp2,weight,num_to_compare,ohmlg,oend_hmlg)
+     char *seq1; /* INPUT: your primary sequence */
+     int stp1,stp2;/* INPUT: starting and ending indices of the primer on seq1 */
+     float **weight;/*INPUT: weight array determines annealing scores */
+     int num_to_compare; /*INPUT: number of other sequences */
+     float *ohmlg; /* OUTPUT: maximum annealing score of current primer versus
+		    other sequence that this subroutine finds -- internal */
+     float *oend_hmlg;/* OUTPUT: maximum annealing score of current primer versus
+		       other sequence that this subroutine finds -- three prime */
+{ float curr_ohmlg=0;
+  float curr_oend_hmlg=0;
+  int ii;
+  
+  for (ii=0; ii<=num_to_compare; ii++) {
+    /* primer versus other sequence*/
+    
+    /* check to see if you are analyzing the correct strand*/
+    if ((!otherDouble_stranded) && 
+	((otherSeq[ii].strand==1 && otherRight==0) || 
+	 (otherSeq[ii].strand==0 && otherRight==1)))
+      continue;
+    
+    curr_ohmlg = homology(seq1,stp1,stp2,otherSeq[ii].seq,
+			  0,otherSeq[ii].len,weight);
+    curr_oend_hmlg=end_hom(seq1,stp1,stp2,otherSeq[ii].seq,
+			   0,otherSeq[ii].len,weight);
+    
+    if (curr_ohmlg > *ohmlg) *ohmlg = curr_ohmlg;
+    if (curr_oend_hmlg > *oend_hmlg) *oend_hmlg = curr_oend_hmlg;
+    
+  } /* end of for (ii=0; ii<num_other_seqs; ii++) */
+  
+}
+
+void get_sngl(vec,gstp,gendp,sngl)
+     char *vec;/*INPUT: sequence*/
+     int gstp,gendp; /* INPUT: starting and ending indices */
+     int *sngl;/*OUTPUT: the array giving counts for each nucleotide*/
+     
+     /* outputs the nucleotide counts for each single nucleotide,
+	sngl['A']=number of A's etc. */
+     
+{ int i;
+  int *v;
+  
+  v = (int *)our_alloc(125 * sizeof(int));
+  
+  /* initializations */
+  v['a'] = 0; v['A']=0;
+  v['c'] = 1; v['C']=1;
+  v['g'] = 2; v['G']=2;
+  v['t'] = 3; v['T']=3;
+  sngl[0]=0;
+  sngl[1]=0;
+  sngl[2]=0;
+  sngl[3]=0;
+  
+  for (i=gstp; i<=gendp; i++) { 
+    if (isalpha(vec[i])) sngl[v[vec[i]]]+=1;
+  }
+  our_free(v);
+  return;
+}
+
+
+float get_gc_content(vec,gstp,gendp)
+     char *vec; /* INPUT: sequence */
+     int gstp,gendp; /* INPUT: starting and ending indices */
+     
+     /* returns percentage gc content for the region
+	vec[stp] through and including vec[endp] */
+     
+{
+  int *sngl;
+  float prcnt;
+  
+  sngl = (int *)our_alloc(4 * sizeof(int));
+  
+  get_sngl(vec,gstp,gendp,sngl);
+  
+  prcnt = (float)(sngl[2] + sngl[1]);
+  prcnt = prcnt/(prcnt +(float)(sngl[3]+sngl[0]));
+  
+  our_free(sngl);
+  return(prcnt);
+}
+
+
+
+
+void old_melt_temp(vec, tstp, tendp, len,pgc,tm)
+     char *vec; /* INPUT: sequence */
+     int tstp,tendp; /* INPUT: starting and ending indices on vec
+			for which to calculate tm */
+     int len; /* INPUT: length of region */
+     float *pgc; /*OUTPUT: percent gc for that sequence */
+     float *tm; /*OUTPUT: tm for that sequence */
+     /* calculates melting temperatures and returns percnt gc as well*/
+{ int i;
+  float gc_content;
+  
+  /* equation taken from: */
+  
+  gc_content = get_gc_content(vec,tstp,tendp);
+  *pgc = gc_content;
+  
+  *tm = 62.3 + 0.41*(gc_content*100.0) - 500.0/(float)len;
+  
+  /*  Tm = 62.3 + 0.41*(%G+C) - 500/N where N is the length of the sequence*/
+  
+  return;
+}
+
+
+
+
+/* ---- Exports ---- */
+/* calculates scores for primers which the user provides, not
+   primers that the program searhces for.  Only used in
+   response to options 2 and 4 where the user asks for
+   scores for their own primers*/
+
+void get_scores(seq1,seq2,stp1,endp1,stp2,endp2,same,orient,OSP_RESULTS)
+     OSP_Results *OSP_RESULTS; /* results structure see struct.h */
+     char *seq1;  /*sequence containing sense primer */
+     int stp1,endp1;  /*starting and ending indices in their top
+			strand version for sense primer */
+     char *seq2;  /*sequence containing antisense primer, but
+		    written in its top strand form */
+     int stp2,endp2;  /*starting and ending indices wrt
+			bottom strand for antisense primer */
+     int same;  /*used for get_scores routine, 0 if you're sending in
+		  one sequence with flanking region numbers, 1 if your
+		  sending in two separate sequences */
+     /* if same is 1, orient doesn't matter */
+     int orient;  /*if two sequence files are entered, orient = 1 if the sense
+		    and antisense files are in the same orientation, orient = 2
+		    if they are in complementary orientation */
+
+     
+     /* calculates scores for given primers */
+{ int i,j,k;
+/*  int n, kkk; */
+  float hom=0,end_h=0,hom1=0,hom2=0,end_h1=0,end_h2=0;
+  float ohmlg=0,oend_hmlg=0; /*other sequence vs primer homologies*/
+  float o2hmlg=0,o2end_hmlg=0;
+  int num_to_compare;
+  float pgc1,tm1,pgc2,tm2,pgcall,tmall;
+  char *cseq2;
+  char *str;
+  float **weight;  /* weights: A-T = 2, T-A = 2, C-G = 4, G-C = 4 
+		   therefore, w['A']['T']=2
+		   These weights are used to assign scores to primer-
+		   primer homologies */
+  FILE *gp;
+  float sum_score;
+  float p1p_hmlg=0,p2p_hmlg=0; /* primer 1 or 2 vs product homology, calculated as greater
+			    of the scores when the primer is paired against both the
+			    top and bottom strand */
+  float p1p_hmlg_c=0,p2p_hmlg_c=0;  /* primer1 or 2 vs product homology against c_seq */
+  float p1p_end_hmlg=0,p2p_end_hmlg=0; /* primer 1 or 2 vs product 3' homology, calculated as greater
+				    of the scores when the primer is paired against both the
+				    top and bottom strand */
+  float p1p_end_hmlg_c=0,p2p_end_hmlg_c=0;  /* primer1 or 2 vs product 3' homology against c_seq */
+  char *strg;
+  int prod_len; /*calculated product length when more two primers from 
+		  same sequence */
+  float delta_tm; /*calculate delta_tm */
+  
+  int max_display_len=5000; /* most characters in the primer
+			       information output (scores etc)*/
+  int max_oligo_len=500;
+
+  weight = (float **)our_alloc(125 * sizeof(float *));
+  for (i=0; i<125; i++) weight[i]=(float *)our_alloc(125 * sizeof(float));
+
+  j = strlen(seq2) + 10;
+  cseq2 = (char *)our_alloc(j * sizeof(char));
+  str = (char *)our_alloc(max_display_len * sizeof(char));
+  strg= (char *)our_alloc(max_oligo_len * sizeof(char));
+  
+  /* the weight array is used to determine homology
+     scoring.  When the homology finds two nucleotides
+     that are opposites, using the is_opp array, then
+     it uses weight to determine the score for that
+     current nucleotide*/
+
+  get_weight_matrix(weight,prm.wt_ambig,prm.AT_score,prm.CG_score);
+  
+  seq_len = strlen(seq2)-1;
+
+  
+  if (same) get_compl_seq(cseq2,seq1,0,strlen(seq1)-1,seq_len);
+  else { /*two sequences must have been entered; 
+	   not same, so check orientation to see if you need to 
+	   get_compl_seq or not */
+    /* If orient=1, then the second sequence was entered in its top strand
+       form, so get the complementary sequence to find the antisense strand
+       off of */
+    if (orient == 1)
+      get_compl_seq(cseq2,seq2,stp2,endp2,seq_len); 
+    else { /* the second sequence already was entered in its bottom strand
+	      form, so do not complement it before looking for the antisense
+	      primer*/
+      for (i = 0; i <=seq_len; i++)
+	cseq2[i]=seq2[i];
+    }
+  }
+  
+  /* primer-primer homologies*/
+  if (!one_primer) {
+    hom = homology(seq1,stp1,endp1,cseq2,stp2,endp2,weight);
+    end_h = end_hom(seq1,stp1,endp1,cseq2,stp2,endp2,weight);
+    end_h2 =end_hom(cseq2,stp2,endp2,seq1,stp1,endp1,weight);
+    if (end_h2 > end_h) end_h = end_h2;
+  }
+  else {
+    hom = 0;
+    end_h = 0;
+  }
+  
+  /* primer-self homologies */
+
+  /* for primer 1*/
+  hom1 = homology(seq1,stp1,endp1,seq1,stp1,endp1,weight);
+  end_h1 = end_hom(seq1,stp1,endp1,seq1,stp1,endp1,weight);
+  /* for primer 2*/
+  if (!one_primer) {
+    hom2 = homology(cseq2,stp2,endp2,cseq2,stp2,endp2,weight);
+    end_h2 = end_hom(cseq2,stp2,endp2,cseq2,stp2,endp2,weight);
+  }
+  else {
+    hom2 = 0;
+    end_h2 = 0;
+  }
+  
+  /* primer-product homologies */
+  if ((same) && (!one_primer)) {
+    
+    p1p_hmlg = homology(seq1,stp1,endp1,seq1,endp1,seq_len-endp2,weight);
+    if (double_stranded) {
+      p1p_hmlg_c = homology(seq1,stp1,endp1,cseq2,endp2,seq_len-endp1,weight);
+      if (p1p_hmlg_c > p1p_hmlg) p1p_hmlg = p1p_hmlg_c;
+    }
+    
+    p1p_end_hmlg = end_hom(seq1,stp1,endp1,seq1,endp1,seq_len-endp2,weight);
+    if (double_stranded) {
+      p1p_end_hmlg_c = end_hom(seq1,stp1,endp1,cseq2,endp2,seq_len-endp1,weight);
+      if (p1p_end_hmlg_c > p1p_end_hmlg) p1p_end_hmlg = p1p_end_hmlg_c;
+    }
+    
+    p2p_hmlg = homology(cseq2,stp2,endp2,seq1,endp1,seq_len-endp2,weight);
+    if (double_stranded) {
+      p2p_hmlg_c = homology(cseq2,stp2,endp2,cseq2,endp2,seq_len-endp1,weight);
+      if (p2p_hmlg_c > p2p_hmlg) p2p_hmlg = p2p_hmlg_c;
+    }
+    
+    p2p_end_hmlg = end_hom(cseq2,stp2,endp2,seq1,endp1,seq_len-endp2,weight);
+    if (double_stranded) {
+      p2p_end_hmlg_c = end_hom(cseq2,stp2,endp2,cseq2,endp2,seq_len-endp1,weight);
+      if (p2p_end_hmlg_c > p2p_end_hmlg) p2p_end_hmlg = p2p_end_hmlg_c;
+    }
+  } /* if (same) and  not one_primer*/
+  else {
+    p1p_hmlg_c = 0;
+    p2p_hmlg_c = 0;
+    p1p_end_hmlg_c = 0;
+    p2p_end_hmlg_c = 0;
+    p1p_hmlg = 0;
+    p2p_hmlg = 0;
+    p1p_end_hmlg = 0;
+    p2p_end_hmlg = 0;
+  }
+  
+  
+  if (one_primer) { 
+    /* primer-rest of sequence homologies 
+       which is subject to the same criteria
+       as primer-other sequence homologies */
+    calc_primer_rest_of_seq_hom(seq1,cseq2,stp1,endp1,weight,seq_len,double_stranded,&ohmlg,&oend_hmlg);
+  }/* end of if (one_primer) */
+  if (!one_primer && same) {
+    /* calculate primer-rest of sequence homology
+       for primer pairs when one sequence was input */
+    /* for primer 1*/
+    calc_primer_rest_of_seq_hom(seq1,cseq2,stp1,endp1,weight,seq_len,double_stranded,&ohmlg,&oend_hmlg);
+    /* for primer 2*/
+    calc_primer_rest_of_seq_hom(cseq2,seq1,stp2,endp2,weight,seq_len,double_stranded,&o2hmlg,&o2end_hmlg);
+  }
+/* I do not calculate primer-rest of sequence homology when you input
+   two sequences becuase I assume that if you have input two sequences,
+   then those two sequences are the primers themselves so that primer-other
+   would be the same as primer-primer*/
+  
+  
+  
+  
+  /*---primer-other homologies ----*/
+  
+  
+  num_to_compare=num_other_seqs;
+  
+  /* calculate primer1 vs file of other sequences homologies*/
+  calc_prim_other_hmlg(seq1,stp1,stp2,weight,num_to_compare,&ohmlg,&oend_hmlg);
+
+
+  /* calculate primer2 vs file of other sequences homologies*/
+  if (!one_primer) 
+    calc_prim_other_hmlg(seq1,stp1,stp2,weight,num_to_compare,&o2hmlg,&o2end_hmlg);
+  
+  /* calculate primer1-melting temp*/
+  old_melt_temp(seq1,stp1,endp1,endp1-stp1+1,&pgc1,&tm1);
+
+  /* calculate primer2-melting temp*/
+  if (!one_primer) old_melt_temp(cseq2,stp2,endp2,endp2-stp2+1,&pgc2,&tm2);
+
+  /*calculate product melting temp for two primers*/
+  if ((same) && (!one_primer))
+    old_melt_temp(seq1,stp1,seq_len-stp2,seq_len-stp2-stp1+1,&pgcall,&tmall);
+
+  /*calculate product melting temp for single primers*/
+  if (one_primer)
+    old_melt_temp(seq1,stp1,seq_len,seq_len-stp1+1,&pgcall,&tmall);
+  
+  if (!same) {
+    prod_len=0;
+    tmall=0;
+    pgcall=0;
+  }
+  
+  /*calculate product length*/
+  if (same && (!one_primer)) prod_len=seq_len-stp2-stp1+1;
+  
+  if (one_primer) prod_len=seq_len-stp1+1;
+  
+  if (!one_primer) {
+    
+    delta_tm=tm2-tm1;
+    if (delta_tm<0) delta_tm*=-1;
+    
+    /* calculate the sum score*/
+    sum_score = 
+      prm.wt_prod_len*(float)prod_len + 
+	prm.wt_prod_gc*pgcall +
+	  prm.wt_prod_tm*tmall+ 	    
+	    prm.wt_prim_s_len*(float)(endp1-stp1+1) + 
+	      prm.wt_prim_a_len*(float)(endp2-stp2+1) + 
+		prm.wt_prim_s_gc*pgc1  + 	    
+		  prm.wt_prim_a_gc*pgc2  + 	    
+		    prm.wt_prim_s_tm*tm1  +     
+		      prm.wt_prim_a_tm*tm2  + 	    
+			prm.wt_self3_hmlg_cut*(end_h1+end_h2)  +
+			  prm.wt_selfI_hmlg_cut*(hom1+hom2) + 
+			    prm.wt_pp3_hmlg_cut*end_h + 
+			      prm.wt_ppI_hmlg_cut*hom + 
+				prm.wt_primprod3_hmlg_cut*(p1p_end_hmlg+p2p_end_hmlg) + 
+				  prm.wt_primprodI_hmlg_cut*(p1p_hmlg+p2p_hmlg) + 
+				    prm.wt_primother3_hmlg_cut*(oend_hmlg+o2end_hmlg) + 
+				      prm.wt_primotherI_hmlg_cut*(ohmlg+o2hmlg) + 
+					prm.wt_delta_tm_cut*delta_tm;
+    
+  }
+  else {
+    sum_score = 
+      prm.wt_prod_len*(float)prod_len + 
+	prm.wt_prim_s_len*(float)(endp1-stp1+1) + 
+	  prm.wt_prim_s_gc*pgc1  + 	    
+	    prm.wt_prim_s_tm*tm1  + 	    
+	      prm.wt_self3_hmlg_cut*end_h1  +
+		prm.wt_selfI_hmlg_cut*hom1 + 
+		  prm.wt_primprod3_hmlg_cut*p1p_hmlg + 
+		    prm.wt_primprodI_hmlg_cut*p2p_hmlg + 
+		      prm.wt_primother3_hmlg_cut*oend_hmlg +
+			prm.wt_primotherI_hmlg_cut*ohmlg;
+  }
+  
+  sprintf(str,"Filename(s): %s  %s\n",seq_filename,seq2_fn);
+  
+  if (!one_primer) {
+    
+    if (program_version==2) {
+      
+      j = strlen(str); if (j>max_display_len) popUpErrorMessage();
+      sprintf(str+j,"                                          5' end  3' end  length G+C(%%)  Tm \n");
+      
+      j = strlen(str); if (j>max_display_len) popUpErrorMessage();
+      sprintf(str+j,"OLIGO1:  ");
+      j = strlen(str); if (j>max_display_len) popUpErrorMessage();
+
+
+      /* make sure you have not under allocated for primer strg*/
+      if ((endp1-stp1+1) > max_oligo_len) popUpErrorMessage();
+      k=0;
+      for (i = stp1; i <= endp1; i++)
+	strg[k++]=seq1[i];
+      strg[k]='\0';
+      sprintf(str+j,"%-30s ",strg);
+      j = strlen(str); 
+      sprintf(str + j," %6d  %6d  %6d  %4d   %4.1f\n",
+	      stp1+1,endp1+1,endp1-stp1+1,(int)(pgc1*100),tm1);
+      
+      j = strlen(str); 
+      sprintf(str+j,"OLIGO2:  ");
+      j = strlen(str); 
+      k=0;
+      /*make sure you have not under allocated for strg*/
+      if ((endp2-stp2+1) > max_oligo_len) popUpErrorMessage();
+      for (i = stp2; i <= endp2; i++)
+	strg[k++]=cseq2[i];
+      strg[k]='\0';
+      sprintf(str+j,"%-30s ",strg);
+      
+      
+      j = strlen(str); 
+      sprintf(str + j," %6d  %6d  %6d  %4d   %4.1f\n",
+	      (seq_len-stp2)+1,(seq_len-endp2)+1,endp2-stp2+1,(int)(pgc2*100),tm2);
+      
+      
+      j = strlen(str); 
+      sprintf(str+j,"           PRIMER-SELF     PRIMER-PRIMER   PRIMER-PRODUCT   PRIMER-OTHER\n");
+      j = strlen(str); 
+      sprintf(str+j,"           3'   Internal   3'    Internal  3'    Internal   3'    Internal\n");
+      j = strlen(str); 
+      sprintf(str + j,"OLIGO1:  %4.1f    %4.1f                     %4.1f     %4.1f    %4.1f     %4.1f\n",
+	      end_h1,hom1,p1p_end_hmlg,p1p_hmlg,oend_hmlg,ohmlg);
+      
+      j = strlen(str); if (j>max_display_len) popUpErrorMessage();
+      sprintf(str + j,"OLIGO2:  %4.1f    %4.1f    %4.1f     %4.1f ",end_h2,hom2,end_h,hom);
+      
+      j = strlen(str); 
+      sprintf(str + j,"   %4.1f     %4.1f    %4.1f     %4.1f\n",
+	      p2p_end_hmlg,p2p_hmlg,o2end_hmlg,o2hmlg);
+      
+      
+      if (same) {
+	j = strlen(str); 
+	sprintf(str+j,"        PRODUCT\n");
+	j = strlen(str); 
+	sprintf(str+j," Length  G+C(%%)   Tm                                 Total Score\n");
+	j = strlen(str); 
+	sprintf(str + j,"%6d    %3d    %4.1f                                  %6.1f\n",seq_len - stp2 - stp1 + 1,
+		(int)(pgcall*100),tmall,sum_score);
+      }
+      else {
+	j = strlen(str); if (j>max_display_len) popUpErrorMessage();
+	sprintf(str+j,"Total Score: %6.1f\n",sum_score);
+      }
+      
+    } /* if program_version==2*/
+    else {
+      
+      j = strlen(str);
+      sprintf(str+j,"                                                                                 PRIMER-SELF   PRIMER-PRIMER   PRIMER-PRODUCT   PRIMER-OTHER");
+      if (same) {
+	j = strlen(str);
+	sprintf(str+j,"                     PRODUCT\n");
+      }
+      else {
+	j = strlen(str);
+	sprintf(str+j,"\n");
+      }
+      j = strlen(str); 
+      sprintf(str+j,"                                          5' end  3' end  length G+C(%%)  Tm     3'   Internal   3'    Internal  3'   Internal   3'   Internal");
+      
+      if (same) {
+	j = strlen(str);
+	sprintf(str+j,"  Score       Length  G+C(%%)   Tm  \n");
+      }
+      else {
+	j = strlen(str);
+	sprintf(str+j,"\n");
+      }
+      
+      j = strlen(str); 
+      sprintf(str+j,"OLIGO1:  ");
+      j = strlen(str); 
+      k=0;
+      /* make sure you have not under allocated for primer strg*/
+      if ((endp1-stp1+1) > max_oligo_len) popUpErrorMessage();
+      for (i = stp1; i <= endp1; i++)
+	strg[k++]=seq1[i];
+      strg[k]='\0';
+      sprintf(str+j,"%-30s ",strg);
+      j = strlen(str); if (j>max_display_len) popUpErrorMessage();
+      sprintf(str + j," %6d  %6d  %6d  %4d   %4.1f  %4.1f    %4.1f                    %4.1f     %4.1f   %4.1f     %4.1f",
+	      stp1+1,endp1+1,endp1-stp1+1,(int)(pgc1*100),tm1,end_h1,hom1,p1p_end_hmlg,
+	      p1p_hmlg,oend_hmlg,ohmlg);
+      if (same) {
+	j = strlen(str); 
+	sprintf(str + j,"                 %6d    %3d    %4.1f\n",seq_len - stp2 - stp1 + 1,
+		(int)(pgcall*100),tmall);
+      }
+      else {
+	j = strlen(str);
+	sprintf(str+j,"\n");
+      }
+      
+      j = strlen(str); 
+      sprintf(str+j,"OLIGO2:  ");
+      j = strlen(str); 
+
+
+      /* make sure you have not under allocated for primer strg*/
+      if ((endp2-stp2+1) > max_oligo_len) popUpErrorMessage();
+
+      k=0;
+      for (i = stp2; i <= endp2; i++)
+	strg[k++]=cseq2[i];
+      strg[k]='\0';
+      sprintf(str+j,"%-30s ",strg);
+      
+      
+      j = strlen(str); 
+      sprintf(str + j," %6d  %6d  %6d  %4d   %4.1f  %4.1f    %4.1f    %4.1f     %4.1f ",
+	      (seq_len-stp2)+1,(seq_len-endp2)+1,endp2-stp2+1,(int)(pgc2*100),
+	      tm2,end_h2,hom2,end_h,hom);
+      
+      j = strlen(str); 
+      sprintf(str + j,"  %4.1f     %4.1f   %4.1f     %4.1f   %7.1f",
+	      p2p_end_hmlg,p2p_hmlg,o2end_hmlg,o2hmlg,sum_score);
+    } /*else that means, must be program_version !=2*/
+  } /*if !one_primer*/
+  else { /* must be one_primer*/
+    j = strlen(str); 
+    /* if program_version==2, the score output must fit on the screen,
+       and no scrolling allowed */
+    if (program_version == 2) {
+      j = strlen(str); 
+      sprintf(str+j,"OLIGO1:  ");
+      j = strlen(str); 
+
+      /* make sure you have not under allocated for primer strg*/
+      if ((endp1-stp1+1) > max_oligo_len) popUpErrorMessage();
+      k=0;
+      for (i = stp1; i <= endp1; i++)
+	strg[k++]=seq1[i];
+      strg[k]='\0';
+      sprintf(str+j,"%-30s \n",strg);
+    }
+    j = strlen(str); 
+    if (program_version!=2) sprintf(str+j,"                                        ");
+    j = strlen(str);
+    sprintf(str+j,"                                        PRIMER-SELF  PRIMER-OTHER\n");
+    j = strlen(str);
+    if (program_version!=2) sprintf(str+j,"                                         ");
+    j = strlen(str);
+    sprintf(str+j,"  5' end  3' end  length G+C(%%)  Tm    3'  Internal  3'   Internal\n");
+    if (program_version != 2) {
+      j = strlen(str);
+      sprintf(str+j,"OLIGO1:  ");
+      j = strlen(str);
+
+      /* make sure you have not under allocated for primer strg*/
+      if ((endp1-stp1+1) > max_oligo_len) popUpErrorMessage();
+      k=0;
+      for (i = stp1; i <= endp1; i++)
+	strg[k++]=seq1[i];
+      strg[k]='\0';
+      sprintf(str+j,"%-30s ",strg);
+    }
+    j = strlen(str); 
+    sprintf(str + j," %6d  %6d  %6d  %4d   %4.1f  %4.1f    %4.1f    %4.1f     %4.1f\n",
+	    stp1+1,endp1+1,endp1-stp1+1,(int)(pgc1*100),tm1,end_h1,hom1,oend_hmlg,
+	    ohmlg);
+    j = strlen(str); 
+    sprintf(str + j,"                                                                                                                SCORE: %7.1f",sum_score);
+
+
+
+  } /* must be one_primer*/
+  
+ if (j>max_display_len) popUpErrorMessage();
+
+#ifdef SUBVERSION
+    if (program_version==3)
+      score_to_xdap(stp1,endp1,pgc1,tm1,end_h1,hom1,oend_hmlg,ohmlg,sum_score,OSP_RESULTS);
+#endif
+
+  if (program_version!=3)
+      message(str); /* print the str to the results Widget or screen*/
+  
+  if (program_version == 2) {
+    char *ans;
+    ans = (char *)our_alloc(25*sizeof(char));
+    
+    printf("\nWould you like to output this information to a file? ");
+    scanf("%s",ans);
+    if (strlen(ans) > 25) {
+      message("Answer longer than 25 characters\n");
+      popUpErrorMessage();
+    }
+    if ((ans[0]=='y') || (ans[0]=='Y')) {
+      printf("Output filename? ");
+      scanf("%s",output_fn);
+      if (strlen(output_fn) > MAX_NAME_SIZE) 
+	printf("Output filename longer than MAX_NAME_SIZE, %d, allowed.\n Please use a shorter name\n",MAX_NAME_SIZE);
+
+
+      
+      if ((gp=fopen(output_fn,"r"))!=NULL) {
+	printf("File %s exists. Would you like to \n (a) delete existing information?\n (b) append to existing information\n (c) specify a new output filename\n",output_fn);
+	scanf("%s",ans);
+	if ((ans[0]=='a') || (ans[0]=='A')) {
+	  /* wipe out the file */
+	  if ((gp=fopen(output_fn,"w"))==NULL) exit(1);
+	}
+	else {
+	  if ((ans[0]=='C') || (ans[0]=='c')) {
+	    printf("\nOutput filename? ");
+	    scanf("%s",output_fn);
+	    if (strlen(output_fn) > MAX_NAME_SIZE) 
+	      printf("Output filename longer than MAX_NAME_SIZE, %d, allowed.\n Please use a shorter name\n",MAX_NAME_SIZE);
+
+	  }
+	}
+	
+      }
+      fclose(gp);
+#if defined(XVERSION) || defined(TEXTVERSION)     
+      text_to_output(str,0,strlen(str),1,output_fn);
+#endif
+      
+    } /* if ans=yes*/
+    our_free(ans);
+  }/*if program_version==2, i.e. text version */
+  
+  
+  buftext = 0;
+  our_free(cseq2);
+  our_free(str);
+  our_free(strg);
+  for (i=0; i<125; i++) our_free(weight[i]);
+  our_free(weight);
+  return;
+}
+
+
+
+
diff --git a/src/bap/osp-bits/makefile-alpha b/src/bap/osp-bits/makefile-alpha
new file mode 100644
index 0000000..a3ccd57
--- /dev/null
+++ b/src/bap/osp-bits/makefile-alpha
@@ -0,0 +1,50 @@
+#
+# Makefile for osp (oligo selection program) - Alpha version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subosp
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# X VERSION compilation flag
+#
+VERSION = SUBVERSION
+
+
+
+#
+# Compiler - debug on
+#
+CC= cc
+CFLAGS= -g -non_shared -std1 -ieee_with_inexact -D$(VERSION)
+
+
+#
+# Object files
+#
+XOSPOBJS= our_allo.o analysis.o get_scores.o paramIO.o paramIOX.o utils.o
+
+#
+# Linking
+#
+subosp: $(XOSPOBJS)
+
+
+#
+# Miscellaneous
+#
+all: $(PROGS)
+
+clean:
+	rm -f $(PROGS) $(OSPOBJS) *~ 
+ 
+%.o:	$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/bap/osp-bits/makefile-dec b/src/bap/osp-bits/makefile-dec
new file mode 100644
index 0000000..c2c5d75
--- /dev/null
+++ b/src/bap/osp-bits/makefile-dec
@@ -0,0 +1,50 @@
+#
+# Makefile for osp (oligo selection program) - DEC Ultrix (mips) version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subosp
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# X VERSION compilation flag
+#
+VERSION = SUBVERSION
+
+
+
+#
+# Compiler - debug on
+#
+CC= c89 
+CFLAGS= -g -common -D$(VERSION) -I/usr/include/mit
+
+
+#
+# Object files
+#
+XOSPOBJS= our_allo.o analysis.o get_scores.o paramIO.o paramIOX.o utils.o
+
+#
+# Linking
+#
+subosp: $(XOSPOBJS)
+
+
+#
+# Miscellaneous
+#
+all: $(PROGS)
+
+clean:
+	rm -f $(PROGS) $(OSPOBJS) *~ 
+ 
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/bap/osp-bits/makefile-sgi b/src/bap/osp-bits/makefile-sgi
new file mode 100644
index 0000000..37b2b0a
--- /dev/null
+++ b/src/bap/osp-bits/makefile-sgi
@@ -0,0 +1,52 @@
+#
+# Makefile for osp (oligo selection program) - SGI Iris version
+#
+
+SHELL=/bin/sh
+
+#
+# Programs made by this makefile
+#
+PROGS = subosp
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# X VERSION compilation flag
+#
+VERSION = SUBVERSION
+
+
+
+#
+# Compiler - debug on
+#
+CC= cc
+CFLAGS= -g -ansi -prototypes -D$(VERSION)
+
+
+#
+# Object files
+#
+XOSPOBJS= our_allo.o analysis.o get_scores.o paramIO.o paramIOX.o utils.o
+
+#
+# Linking
+#
+subosp: $(XOSPOBJS)
+
+
+#
+# Miscellaneous
+#
+all: $(PROGS)
+
+clean:
+	rm -f $(PROGS) $(OSPOBJS) *~ 
+ 
+%.o:	$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/bap/osp-bits/makefile-solaris b/src/bap/osp-bits/makefile-solaris
new file mode 100644
index 0000000..fcd2853
--- /dev/null
+++ b/src/bap/osp-bits/makefile-solaris
@@ -0,0 +1,51 @@
+#
+# Makefile for osp (oligo selection program) - Sun Version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subosp
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# X VERSION compilation flag
+#
+VERSION = SUBVERSION
+
+
+
+#
+# Compiler - debug on
+#
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -Wformat
+CC= gcc  -D$(VERSION) -O
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) -I/usr/openwin/include
+
+
+#
+# Object files
+#
+XOSPOBJS= our_allo.o analysis.o  get_scores.o paramIO.o paramIOX.o utils.o
+
+#
+# Linking
+#
+subosp: $(XOSPOBJS)
+
+
+#
+# Miscellaneous
+#
+all: $(PROGS)
+
+clean:
+	rm -f $(PROGS) $(OSPOBJS) *~ 
+ 
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/bap/osp-bits/makefile-sun b/src/bap/osp-bits/makefile-sun
new file mode 100644
index 0000000..61bb937
--- /dev/null
+++ b/src/bap/osp-bits/makefile-sun
@@ -0,0 +1,52 @@
+#
+# Makefile for osp (oligo selection program) - Sun Version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subosp
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# X VERSION compilation flag
+#
+VERSION = SUBVERSION
+
+
+
+#
+# Compiler - debug on
+#
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CC= gcc  -D$(VERSION) -O
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) 
+
+
+#
+# Object files
+#
+XOSPOBJS= our_allo.o analysis.o  get_scores.o paramIO.o paramIOX.o utils.o
+
+#
+# Linking
+#
+subosp: $(XOSPOBJS)
+
+
+#
+# Miscellaneous
+#
+all: $(PROGS)
+
+clean:
+	rm -f $(PROGS) $(OSPOBJS) *~ 
+ 
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/bap/osp-bits/our_allo.c b/src/bap/osp-bits/our_allo.c
new file mode 100644
index 0000000..337394f
--- /dev/null
+++ b/src/bap/osp-bits/our_allo.c
@@ -0,0 +1,126 @@
+#if vms
+#include stdio
+#else
+#include <stdio.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#endif
+
+#include "defs.h"
+
+#if 0
+
+typedef double ALIGN;
+
+union header {
+	struct {
+		union header *ptr;
+		ALLOC size;
+	} s;
+	ALIGN  x;
+};
+
+typedef union header HEADER;
+
+static HEADER base;
+static HEADER *allocp = NULL;	
+
+#define  NALLOC  63000
+
+static HEADER *morecore(nu)
+        ALLOC nu;
+{
+/*	char *malloc();*/
+	char *sbrk();
+	char *cp;
+	HEADER *up;
+	ALLOC rnu;
+        SHORT our_free();
+
+	rnu = NALLOC * ((nu + NALLOC - 1) / NALLOC);
+/*        printf("\n%ld bytes allocated in morecore\n", rnu * sizeof(HEADER));*/
+/*	cp = malloc(rnu * sizeof(HEADER));*/
+	cp = sbrk(rnu * sizeof(HEADER));
+	if (!cp) {
+           printf("\n ERROR: ALLOCATION FAILED IN MORECORE\n");
+           exit();  
+        }
+	up = (HEADER *)cp;
+	up->s.size = rnu;
+	our_free((char *)(up + 1));
+	return(allocp);
+}
+
+char *our_alloc(nbytes)
+        ALLOC nbytes;
+{
+	HEADER *morecore();
+	HEADER *p;
+	HEADER *q;
+	ALLOC nunits;
+
+	nunits = 1 +(nbytes + sizeof(HEADER) - 1)/ sizeof(HEADER);
+	if ((q = allocp) == NULL) {
+		base.s.ptr = allocp = q = &base;
+		base.s.size = 0;
+	}
+	for (p = q->s.ptr;;q = p, p = p->s.ptr) {
+		if (p->s.size >= nunits){
+			if(p->s.size == nunits) q->s.ptr = p->s.ptr;
+			else {
+				p->s.size -= nunits;
+				p += p->s.size;
+				p->s.size = nunits;
+			}
+			allocp = q;
+			return ((char *)(p + 1));
+		}
+		if (p == allocp)
+			if ((p = morecore(nunits)) == NULL) return (NULL);
+	}
+}
+
+find_frags()
+{
+	SHORT num_frags;
+	HEADER *p;
+
+	for (p = allocp->s.ptr, num_frags = 0; p != allocp;
+            p = p->s.ptr, num_frags++);
+        printf("\n number of allocation fragments = %d \n", num_frags);
+}
+
+SHORT our_free(ap)
+char *ap;
+{
+	HEADER *p;
+	HEADER *q;
+	
+	p = (HEADER *)ap - 1;
+	for (q = allocp; !(p > q && p < q->s.ptr); q = q->s.ptr)
+		if (q>= q->s.ptr && (p > q || p < q->s.ptr)) break;
+
+	if ( p + p->s.size == q->s.ptr){
+		p->s.size += q->s.ptr->s.size;
+		p->s.ptr = q->s.ptr->s.ptr;
+	} else p->s.ptr = q->s.ptr;
+	if (q + q->s.size == p){
+		q->s.size += p->s.size;
+		q->s.ptr = p->s.ptr;
+	} else q->s.ptr = p;
+	allocp = q;
+        return(0);
+}
+ 
+#else
+
+char *our_alloc(int nbytes) {
+    return (char *)malloc((size_t)nbytes);
+}
+
+SHORT our_free(void *ap) {
+    free(ap);
+    return (SHORT)0;
+}
+
+#endif
diff --git a/src/bap/osp-bits/our_allo.h b/src/bap/osp-bits/our_allo.h
new file mode 100644
index 0000000..c7b244f
--- /dev/null
+++ b/src/bap/osp-bits/our_allo.h
@@ -0,0 +1,4 @@
+#include "defs.h"
+
+extern char *our_alloc(int nbytes);
+extern SHORT our_free(void *ap);
diff --git a/src/bap/osp-bits/paramIO.c b/src/bap/osp-bits/paramIO.c
new file mode 100644
index 0000000..c40f24e
--- /dev/null
+++ b/src/bap/osp-bits/paramIO.c
@@ -0,0 +1,617 @@
+/* 
+  Program Name: paramIO
+  File name: paramIO.c
+  Purpose: input and output of constraint information, reads constraints
+  from files or from the keyboard, prints constraints out to file or
+  screen, etc.
+  Last Update: Apr 15 1991
+  Copyright 1991: LaDeana Hillier and Philip Green
+
+  Change Log:
+*/
+
+
+/* ---- Includes ---- */
+#include <stdlib.h>
+#include <string.h>
+#include "our_allo.h"
+#include "paramIO.h"
+#include "defn.h" /* macros and stdio */
+#include "struct.h"/* global structure defns */
+#include "Xmess.h" /* IMPORT: message, popupMessage */
+#include "utils.h" /* IMPORT: parse_chars, text_to_output, space_file_to_vec */
+
+ extern Prm prm;
+ extern int program_version;
+ extern int program_option;
+#if defined(XVERSION) || defined(TEXTVERSION)
+ extern char *other_seqfn;
+#endif
+
+ extern int otherRight;
+ extern int otherDouble_stranded;
+ extern int double_stranded;
+
+
+#ifdef SUBVERSION
+char other_seqfn[1];
+#endif
+
+
+/* ---- Exports ---- */
+void print_params(astr)
+/* print constraints to astr */
+ char *astr;
+ { int j;
+
+   j = strlen(astr);
+ sprintf(astr+j," PROD_LEN_MIN %d",prm.prod_len_low);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PROD_LEN_MAX %d ",prm.prod_len_high);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PROD_GC_MIN %3.0f",(prm.prod_gc_low*100.0));
+   j = strlen(astr);
+ sprintf(astr+j,"\n PROD_GC_MAX %3.0f ",(prm.prod_gc_high*100.0));
+   j = strlen(astr);
+ sprintf(astr+j,"\n PROD_TM_MIN %4.1f ",prm.prod_tm_low);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PROD_TM_MAX %4.1f",prm.prod_tm_high);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_GC_MIN %3.0f",(prm.prim_gc_low*100.0));
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_GC_MAX %3.0f",(prm.prim_gc_high*100.0));
+   j = strlen(astr);
+  sprintf(astr+j,"\n PRIM_LEN_MIN %d",prm.min_prim_len);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_LEN_MAX %d",prm.max_prim_len);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_TM_MIN %4.1f",prm.prim_tm_low);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_TM_MAX %4.1f",prm.prim_tm_high);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_TM_DIFF %4.1f",prm.delta_tm_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_NUCS %s",prm.end_nucs);
+   j = strlen(astr);
+ sprintf(astr+j,"\n AT_SCORE %4.1f",prm.AT_score);
+   j = strlen(astr);
+ sprintf(astr+j,"\n CG_SCORE %4.1f",prm.CG_score);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_AMBIG %5s",prm.wt_ambig);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_SELF_I_ANN %4.1f",prm.selfI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_SELF_3_ANN %4.1f ",prm.self3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_PRIM_I_ANN %4.1f",prm.ppI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_PRIM_3_ANN %4.1f ",prm.pp3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_PROD_I_ANN %4.1f",prm.primprodI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_PROD_3_ANN %4.1f ",prm.primprod3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_OTHER_I_ANN %4.1f",prm.primotherI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n PRIM_OTHER_3_ANN %4.1f ",prm.primother3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j," \n WT_PROD_LEN %5.2f",prm.wt_prod_len);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PROD_GC %5.2f",prm.wt_prod_gc);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PROD_TM %5.2f",prm.wt_prod_tm);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_1_GC %5.2f",prm.wt_prim_s_gc);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_2_GC %5.2f",prm.wt_prim_a_gc);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_1_LEN %5.2f",prm.wt_prim_s_len);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_2_LEN %5.2f",prm.wt_prim_a_len);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_1_TM %5.2f",prm.wt_prim_s_tm);
+   j = strlen(astr);
+   sprintf(astr+j,"\n WT_PRIM_2_TM %5.2f",prm.wt_prim_a_tm);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_TM_DIFF %5.2f",prm.wt_delta_tm_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_SELF_I_ANN %5.2f",prm.wt_selfI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_SELF_3_ANN %5.2f",prm.wt_self3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_PRIM_I_ANN %5.2f",prm.wt_ppI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_PRIM_3_ANN %5.2f",prm.wt_pp3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_PROD_I_ANN %5.2f",prm.wt_primprodI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_PROD_3_ANN %5.2f",prm.wt_primprod3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_OTHER_I_ANN %5.2f",prm.wt_primotherI_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n WT_PRIM_OTHER_3_ANN %5.2f",prm.wt_primother3_hmlg_cut);
+   j = strlen(astr);
+ sprintf(astr+j,"\n NUM_STRANDS %d",double_stranded+1);
+   j = strlen(astr);
+ sprintf(astr+j,"\n OTHER_SEQ_NAME %s",other_seqfn);
+   j = strlen(astr);
+ sprintf(astr+j,"\n OTHER_SEQ_NUM_STRAND %d",otherDouble_stranded+1);
+ j = strlen(astr);
+ if (otherRight==1) 
+   sprintf(astr+j,"\n OTHER_SEQ_STRAND top");
+ else
+   sprintf(astr+j,"\n OTHER_SEQ_STRAND bottom");
+   j = strlen(astr);
+  sprintf(astr+j,"\n *\n");
+   j = strlen(astr);
+ }
+
+
+
+int ReadDef(def_fn, Xup)
+char *def_fn;
+int Xup; /* 1 if X is up and running, 0 if this is
+	    the text version or if this is the X version
+	    and the default filename was a command line paramter;
+	    then I can not use the popupmessage or message to
+	    an x window because the x application has not
+	    started up yet*/
+/* read in the default file , returns
+ a 0 if there was a problem when reading in the parameter file*/
+{ 
+       int *intgrs;
+       char **alphas;
+       char *str;
+       int i;
+       FILE *gp;
+       int n,j;
+       int line_count,vec_st,max_n;
+       int readdef_okay=1;
+       char *mess_str;
+       int max_param_str_len=5000;
+       
+       
+       str = (char *)our_alloc(max_param_str_len * sizeof(char ));
+       mess_str = (char *)our_alloc(1000 * sizeof(char ));
+       intgrs = (int *)our_alloc(MAX_I * sizeof(int));
+       alphas = (char **)our_alloc(MAX_WORDS * sizeof(char *));
+       for (i=0; i < MAX_WORDS; i++) alphas[i] = (char *)our_alloc(MAX_WORD_SIZE*sizeof(char));
+       
+       max_n = 3000;
+       n=0;
+       
+  /* open the default parameters file to see if any old contents are there*/
+  if ((gp=fopen(def_fn,"r"))==NULL) {
+    if (!Xup)
+      printf("No file found with name %s\n original default parameters assumed\n",def_fn);
+    else {
+      sprintf(str,"No file found with name %s\n original default parameters assumed\n",def_fn);
+    message(str);
+    }
+    readdef_okay=0;
+  }
+  else {
+    if (getc(gp)==EOF) {
+      if (!Xup)
+	printf("File %s is empty, original default parameters assumed.\n",def_fn);
+      else {
+	sprintf(str,"File %s is empty, original default parameters assumed.\n",def_fn);
+	message(str);
+      }
+      readdef_okay=0;
+    }
+    else { /*read in old default parameters */
+      readdef_okay=space_file_to_vec(def_fn, str, &vec_st, max_n,mess_str);
+      if (!readdef_okay) {
+	message(mess_str);
+	goto freeVar;
+      }
+      if (strlen(str)>max_param_str_len) {
+	message("Parameter file larger than max size allowable");
+	popUpErrorMessage();
+	readdef_okay=0;
+	goto freeVar;
+      }
+
+      line_count=1;
+      for (i=0; str[i]; i++) {
+	line_count++;
+        /*get the first line and 
+	  put spaces in place of any equal signs*/
+	for (j=i; str[j] != '\n'; j++) if (str[j]=='=') str[j]=' ';
+	parse_chars(str,intgrs,i,j,alphas);
+	if (read_params(alphas[0],alphas[1],intgrs[0])==0) break;
+	j++;
+	i=j;
+      } /* for i = 0... */
+    } /* else*/
+  } /* else*/
+	fclose(gp);
+
+     freeVar:;
+       our_free(str);
+       our_free(mess_str);
+       our_free(intgrs);
+      for (i = 0; i < MAX_WORDS; i++) our_free(alphas[i]);
+       our_free(alphas);
+       return(readdef_okay);
+     }
+
+
+
+int read_params(var_name,curr_string,curr_int)
+char *var_name;
+char *curr_string;
+int curr_int;
+/*this fcn is used to interpret strings in .def file or from keyboard*/
+/*     var_name is the name of the variable, 
+       curr_string is the string value of the variable;
+       curr_int is the first integer on the line, usually the
+	value of the variable,
+	
+	this function returns a 0 if it is the last parameter
+	and a 1 if there must be more; it thinks its the last
+	 parameter if the var_name is '*' */
+{  int i;
+
+  if (!strcmp(var_name,"*")) return(0);
+  if (!strcmp(var_name,"PRIM_NUCS")) {
+    if (strlen(curr_string) > MAX_NAME_SIZE)
+      popupMessage("Length of PRIM_NUCS is larger than size, MAX_NAME_SIZE, allowable\nChoice of PRIM_NUCS ignored.");
+    else
+      strcpy(prm.end_nucs,curr_string);
+    return(1);
+  }
+
+  if (!strcmp(var_name,"PROD_LEN_MIN")) prm.prod_len_low = curr_int;
+  else if (!strcmp(var_name,"PROD_LEN_MAX")) prm.prod_len_high = curr_int;
+  else if (!strcmp(var_name,"PROD_GC_MIN")) {
+    sscanf(curr_string,"%f",&prm.prod_gc_low);
+    prm.prod_gc_low/=100.0;
+  }
+  else if (!strcmp(var_name,"PROD_GC_MAX")) {
+    sscanf(curr_string,"%f",&prm.prod_gc_high);
+    prm.prod_gc_high/=100.0;
+  }
+  else if (!strcmp(var_name,"PROD_TM_MIN")) 
+    sscanf(curr_string,"%f",&prm.prod_tm_low);
+  else if (!strcmp(var_name,"PROD_TM_MAX")) 
+    sscanf(curr_string,"%f",&prm.prod_tm_high);
+  else if (!strcmp(var_name,"PRIM_LEN_MIN")) prm.min_prim_len = curr_int;
+  else if (!strcmp(var_name,"PRIM_LEN_MAX")) prm.max_prim_len = curr_int;
+  else if (!strcmp(var_name,"PRIM_GC_MIN")) {
+    sscanf(curr_string,"%f",&prm.prim_gc_low);
+    prm.prim_gc_low/=100.0;
+  }
+  else if (!strcmp(var_name,"PRIM_GC_MAX")) {
+    sscanf(curr_string,"%f",&prm.prim_gc_high);
+    prm.prim_gc_high/=100.0;
+  }
+  else if (!strcmp(var_name,"PRIM_TM_MIN"))
+     sscanf(curr_string,"%f",&prm.prim_tm_low);
+  else if (!strcmp(var_name,"PRIM_TM_MAX")) 
+     sscanf(curr_string,"%f",&prm.prim_tm_high);
+  else if (!strcmp(var_name,"PRIM_SELF_I_ANN")) 
+     sscanf(curr_string,"%f",&prm.selfI_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_SELF_3_ANN")) 
+     sscanf(curr_string,"%f",&prm.self3_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_PRIM_I_ANN"))
+     sscanf(curr_string,"%f",&prm.ppI_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_PRIM_3_ANN"))
+     sscanf(curr_string,"%f",&prm.pp3_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_PROD_I_ANN"))
+    sscanf(curr_string,"%f",&prm.primprodI_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_PROD_3_ANN")) 
+    sscanf(curr_string,"%f",&prm.primprod3_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_OTHER_I_ANN"))
+    sscanf(curr_string,"%f",&prm.primotherI_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_OTHER_3_ANN"))
+    sscanf(curr_string,"%f",&prm.primother3_hmlg_cut);
+  else if (!strcmp(var_name,"PRIM_TM_DIFF"))
+     sscanf(curr_string,"%f",&prm.delta_tm_cut);
+  else if (!strcmp(var_name,"WT_PROD_LEN")) 
+    sscanf(curr_string,"%f",&prm.wt_prod_len);
+  else if (!strcmp(var_name,"WT_PROD_GC"))
+    sscanf(curr_string,"%f",&prm.wt_prod_gc);
+  else if (!strcmp(var_name,"WT_PROD_TM"))
+    sscanf(curr_string,"%f",&prm.wt_prod_tm);
+  else if (!strcmp(var_name,"WT_PRIM_1_LEN"))
+    sscanf(curr_string,"%f",&prm.wt_prim_s_len);
+  else if (!strcmp(var_name,"WT_PRIM_2_LEN"))
+    sscanf(curr_string,"%f",&prm.wt_prim_a_len);
+  else if (!strcmp(var_name,"WT_PRIM_1_GC"))
+    sscanf(curr_string,"%f",&prm.wt_prim_s_gc);
+  else if (!strcmp(var_name,"WT_PRIM_2_GC"))
+    sscanf(curr_string,"%f",&prm.wt_prim_a_gc);
+  else if (!strcmp(var_name,"WT_PRIM_1_TM"))
+    sscanf(curr_string,"%f",&prm.wt_prim_s_tm);
+  else if (!strcmp(var_name,"WT_PRIM_2_TM"))
+    sscanf(curr_string,"%f",&prm.wt_prim_a_tm);
+  else if (!strcmp(var_name,"WT_PRIM_TM_DIFF"))
+    sscanf(curr_string,"%f",&prm.wt_delta_tm_cut);
+  else if (!strcmp(var_name,"WT_PRIM_SELF_I_ANN"))
+    sscanf(curr_string,"%f",&prm.wt_selfI_hmlg_cut);
+  else if (!strcmp(var_name,"WT_PRIM_SELF_3_ANN")) 
+    sscanf(curr_string,"%f",&prm.wt_self3_hmlg_cut);
+  else if (!strcmp(var_name,"WT_PRIM_PRIM_I_ANN"))
+    sscanf(curr_string,"%f",&prm.wt_ppI_hmlg_cut);
+  else if (!strcmp(var_name,"WT_PRIM_PRIM_3_ANN"))
+    sscanf(curr_string,"%f",&prm.wt_pp3_hmlg_cut);
+  else if (!strcmp(var_name,"WT_PRIM_PROD_I_ANN")) 
+    sscanf(curr_string,"%f",&prm.wt_primprodI_hmlg_cut);
+  else if (!strcmp(var_name,"WT_PRIM_PROD_3_ANN"))
+    sscanf(curr_string,"%f",&prm.wt_primprod3_hmlg_cut);
+  else if (!strcmp(var_name,"WT_PRIM_OTHER_I_ANN"))
+    sscanf(curr_string,"%f",&prm.wt_primotherI_hmlg_cut);
+  else if (!strcmp(var_name,"WT_PRIM_OTHER_3_ANN")) 
+    sscanf(curr_string,"%f",&prm.wt_primother3_hmlg_cut);
+  else if (!strcmp(var_name,"AT_SCORE")) 
+    sscanf(curr_string,"%f",&prm.AT_score);
+  else if (!strcmp(var_name,"CG_SCORE")) 
+    sscanf(curr_string,"%f",&prm.CG_score);
+  else if (!strcmp(var_name,"WT_AMBIG")) {
+    if (strlen(curr_string)>6) 
+      popupMessage("WT_AMBIG should be full or avg\nValue for WT_AMBIG ignored\n");
+    else
+	strcpy(prm.wt_ambig,curr_string);
+  }
+  else if (!strcmp(var_name,"NUM_STRANDS")) 
+    double_stranded=curr_int-1;
+  else if (!strcmp(var_name,"OTHER_SEQ_NAME")) {
+    if (strlen(curr_string)>MAX_NAME_SIZE) 
+      popupMessage("OTHER_SEQ_NAME longer than MAX_NAME_SIZE\nPlease use a shorter name.\n");
+    else
+      sscanf(curr_string,"%s",other_seqfn);
+  }
+  else if (!strcmp(var_name,"OTHER_SEQ_STRAND")) {
+    for (i=0; curr_string[i]; i++) {
+      if (curr_string[i]==' ') i--;
+      else if (isupper(curr_string[i])) curr_string[i]=tolower(curr_string[i]);
+    }
+    if (!strcmp(curr_string,"bottom"))
+      otherRight=0;
+    else otherRight=1;
+  }
+  else if (!strcmp(var_name,"OTHER_SEQ_NUM_STRAND"))
+    otherDouble_stranded=curr_int-1;
+  else  printf("\n\n ERROR: parameter name %s unknown\n\n", var_name);
+
+  return(1);
+}
+
+int check_params()
+     /* check to see for each program option that the parameters
+	are what you expect, i.e. set to 0 things that shouldn't
+	matter for program_version==1, nemo.
+	*/
+{
+  char *str;
+  int temp_int;
+  float temp_float;
+  
+  str = (char *)our_alloc(1000 * sizeof(char));
+  
+  /* PRODUCT CONSTRAINTS */
+  /* product length constraints, and distance from end constraints */
+  if (prm.prod_len_low > prm.prod_len_high  && prm.prod_len_high != 0)  {
+    sprintf(str,"Product minimum length must be less\n  than product maximum length\n");
+    sprintf(str+strlen(str),"The minimum length now\n  equals the maximum and vice versa\n");
+    popupMessage(str);
+    temp_int = prm.prod_len_low;
+    prm.prod_len_low=prm.prod_len_high;
+    prm.prod_len_high = temp_int;
+  }
+  if (program_option != 3) {
+    /* product gc content*/
+    if (prm.prod_gc_low > prm.prod_gc_high && prm.prod_gc_high != 0 )  {
+      sprintf(str,"Product minimum gc content must be less\n  than product maximum gc content\n");
+      sprintf(str+strlen(str),"The minimum gc content now\n  equals the maximum and vice versa\n");
+      popupMessage(str);
+      temp_float = prm.prod_gc_low;
+      prm.prod_gc_low=prm.prod_gc_high;
+      prm.prod_gc_high = temp_float;
+    }
+    /* product upper and lower bound on melting temperature, degrees C */
+    if (prm.prod_tm_low > prm.prod_tm_high && prm.prod_tm_high != 0 )  {
+      sprintf(str,"Product minimum Tm must be less than\n  product maximum Tm\n");
+      sprintf(str+strlen(str),"The minimum Tm now equals\n  the maximum and vice versa\n");
+      popupMessage(str);
+      temp_float = prm.prod_tm_low;
+      prm.prod_tm_low=prm.prod_tm_high;
+      prm.prod_tm_high = temp_float;
+    }
+  }
+  
+  /* PRIMER CONSTRAINTS */
+  /* minimum and maximum primer length */
+  if (prm.min_prim_len < 2) {
+    if (prm.min_prim_len==0) sprintf(str,"CONSTRAINT ERROR:\nYou must choose a minimum primer length.\n0 is not acceptable\n");
+    else {
+      sprintf(str,"CONSTRAINT ERROR:\nYour minimum primer length should be 2 or greater\nIt has been reset to 2.\n");
+      sprintf(str+strlen(str),"Please reexamine your constraint settings\nand run the program again.\n");
+    }
+    popupMessage(str);
+    our_free(str);
+    return(0);
+  }
+  if (prm.min_prim_len > 1000) {
+    sprintf(str,"CONSTRAINT ERROR:\nYour minimum primer length should be\n  less than 1000\n");
+    sprintf(str+strlen(str),"min_prim_len has been reset\n");
+    sprintf(str+strlen(str),"Please reexamine your constraint settings\n  and run the program again.\n");
+    prm.min_prim_len=1000;
+    popupMessage(str);
+    our_free(str);
+    return(0);
+  }
+  if (prm.max_prim_len > 1000 || prm.max_prim_len == 0) {
+    if (prm.max_prim_len==0)
+      sprintf(str,"CONSTRAINT ERROR:\nYou must choose a maximum primer length;\n 0 is not acceptable\n");
+    else
+      sprintf(str,"CONSTRAINT ERROR:\nYour maximum primer length should be less than 1000\n");
+    sprintf(str+strlen(str),"max_prim_len has been reset to min_prim_len+4\n");
+    sprintf(str+strlen(str),"Please reexamine your constraint settings\n  and run the program again.\n");
+    prm.max_prim_len=prm.min_prim_len+4;
+    popupMessage(str);
+    our_free(str);
+    return(0);
+  }
+  if (prm.min_prim_len > prm.max_prim_len) {
+    sprintf(str,"CONSTRAINT ERROR:\nYour minimum primer length was set to greater than\n  your maximum primer length\n");
+    sprintf(str+strlen(str),"Your maximum primer length has\n  been reset to min_prim_len + 4\n");
+    sprintf(str+strlen(str),"Please reexamine your constraint\n  settings and run the program again.\n");
+    popupMessage(str);
+    prm.max_prim_len=prm.min_prim_len + 4;
+    our_free(str);
+    return(0);
+  }
+
+  if (prm.min_prim_len==prm.max_prim_len) 
+    prm.max_prim_len+=1;
+
+
+
+/* end nucleotides*/
+  if (prm.end_nucs[0]=='0' || (!strcmp(prm.end_nucs,""))) {
+    sprintf(str,"You must choose a primer-ending nucleotide.\n  S has been selected for you.\n");
+    popupMessage(str);
+    sprintf(prm.end_nucs,"S");
+  }
+
+  
+  /* primer gc content*/
+  if (prm.prim_gc_low > prm.prim_gc_high && prm.prim_gc_high != 0 )  {
+    sprintf(str,"Primer minimum gc content must be less\n  than primer maximum gc content\n");
+    sprintf(str+strlen(str),"The minimum gc content now\n  equals the maximum and vice versa\n");
+    popupMessage(str);
+    temp_float = prm.prim_gc_low;
+    prm.prim_gc_low=prm.prim_gc_high;
+    prm.prim_gc_high = temp_float;
+  }
+  
+  /* primer upper and lower bound on melting temperature, degrees C */
+  if (prm.prim_tm_low > prm.prim_tm_high && prm.prim_tm_high != 0 )  {
+    sprintf(str,"Primer minimum Tm must be less\n  than primer maximum Tm\n");
+    sprintf(str+strlen(str),"The minimum Tm now\n  equals the maximum and vice versa\n");
+    popupMessage(str);
+    temp_float = prm.prim_tm_low;
+    prm.prim_tm_low=prm.prim_tm_high;
+    prm.prim_tm_high = temp_float;
+  }
+  
+  
+  /* ANNEALING CONSTRAINTS, I will not put constraints on those */
+  /*homology cutoff scores primer-self*/
+  /*prm.self3_hmlg_cut;*/
+  /*prm.selfI_hmlg_cut;*/
+  /*prm.pp3_hmlg_cut,prm.ppI_hmlg_cut;*/ /*
+    homology cutoff scores primer-primer (not used in nemo)*/
+  /*prm.primprodI_hmlg_cut, prm.primprod3_hmlg_cut;*/  /*cutoff scores for primer
+    product annealing scores*/
+  /*prm.primotherI_hmlg_cut,prm.primother3_hmlg_cut;*/ /*homology cutoff scores for 
+    the primer versus the
+    other sequence file */
+  
+  /* the other sequence file contains vector, cosmid, repeated sequences, etc */
+  /*prm.delta_tm_cut;*/ /* cutoff for difference in tm between primer1 and tm of primer2*/
+  
+  if (program_version==1) { /*nemo*/
+    prm.wt_prod_gc=0;
+    prm.wt_prod_tm=0;
+    prm.wt_prim_a_len=0;
+    prm.wt_prim_a_gc=0;
+    prm.wt_prim_a_tm=0;
+    prm.wt_pp3_hmlg_cut=0;
+    prm.wt_ppI_hmlg_cut=0;
+    prm.wt_primprodI_hmlg_cut=0;
+    prm.wt_primprod3_hmlg_cut=0;
+    prm.delta_tm_cut=0;
+  }
+  
+  
+  /* scoring weights, used for ranking of primers chosen */
+  /*prm.wt_prod_len;
+    prm.wt_prod_gc;
+    prm.wt_prod_tm;
+    prm.wt_prim_s_len;
+    prm.wt_prim_a_len;
+    prm.wt_prim_s_gc;
+    prm.wt_prim_a_gc;
+    prm.wt_prim_s_tm;
+    prm.wt_prim_a_tm;
+    prm.wt_self3_hmlg_cut,prm.wt_selfI_hmlg_cut;
+    prm.wt_pp3_hmlg_cut,prm.wt_ppI_hmlg_cut;
+    prm.wt_primprodI_hmlg_cut,prm.wt_primprod3_hmlg_cut;
+    prm.wt_primotherI_hmlg_cut,prm.wt_primother3_hmlg_cut;
+    prm.wt_delta_tm_cut;*/
+  /*prm.AT_score;*/ /* used in the calculation of annealing score, each time
+    an A-T align, the score would be prm.AT_score */
+  
+  /*prm.CG_score;*/ 
+  /* used in the calculation of annealing score, each time
+     an C-G align, the score would be prm.CG_score */
+  
+  
+  if (strlen(prm.wt_ambig)>5) sprintf(prm.wt_ambig,"%5s",prm.wt_ambig);
+  
+  our_free(str);
+  return(1);
+}
+
+
+void write_params_file(fn)
+char *fn;
+{ FILE *gp;
+  int max_param_str_len=5000;
+  char *pstr; /* holds parameters string*/
+  char *astr; /* holds any error message*/
+
+
+  pstr = (char *)our_alloc(max_param_str_len * sizeof(char));
+  astr = (char *)our_alloc(200 *sizeof(char));
+
+
+    if ((gp=fopen(fn,"r"))!=NULL) { /* if it exists */
+      /* wipe out the file */
+      fclose(gp);
+      if ((gp=fopen(fn,"w"))==NULL) {
+        sprintf(astr,"ERROR: file %s cannot be written to.",fn);
+	message(astr);
+        fclose(gp);  
+        our_free(astr);
+        our_free(pstr);
+        return;
+      }
+    }
+
+    fclose(gp);
+    pstr[0] = '\0';
+    print_params(pstr);
+    if (strlen(pstr)>max_param_str_len) {
+        message("Parameter file larger than max size allowable");
+        popUpErrorMessage();
+      }
+    text_to_output(pstr,0,strlen(pstr),1,fn);
+
+  our_free(pstr);
+  our_free(astr);
+  return;
+}
+
+  
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/src/bap/osp-bits/paramIO.h b/src/bap/osp-bits/paramIO.h
new file mode 100644
index 0000000..b2a8f87
--- /dev/null
+++ b/src/bap/osp-bits/paramIO.h
@@ -0,0 +1,50 @@
+#ifndef _paramIO_h
+#define _paramIO_h
+
+/* 
+  Program Name: paramIO
+  File name: paramIO.c
+  Purpose: input and output of constraint information, reads constraints
+  from files or from the keyboard, prints constraints out to file or
+  screen, etc.
+  Last Update: Fri Mar 23 1991 
+
+  Copyright 1991: LaDeana Hillier and Philip Green
+  Change Log:
+*/
+
+/*  Description: This function gets new parameters when the
+           user changes constraints using the changeParamsPopup */
+   extern void GetParams();
+
+/* saves the values that Get Params has gotten from the constraints
+information to a file */
+   extern void SaveParams();
+
+/* reads in a constraints file */
+   extern int ReadDef();
+
+/* checks that all the parameters are within a reasonable range */
+   extern int check_params();
+
+/* reads the constraint file name off the of changeParamspopup*/
+   extern void inputParams();
+
+/* prints all constraints to an input string*/
+   extern void print_params();
+
+/*this fcn is used to interpret strings in .def file or from keyboard*/
+/*     this function returns a 0 if it is the last parameter
+        and a 1 if there must be more; it thinks its the last
+         parameter if the var_name is '*' */
+   extern int read_params();
+
+/*write the parameters out to a file*/
+extern void write_params_file();
+
+/* popup window allowing you to change parameters */
+extern void osp_change_parameters();
+
+#endif /* _paramIO_h */
+
+
diff --git a/src/bap/osp-bits/paramIOX.c b/src/bap/osp-bits/paramIOX.c
new file mode 100644
index 0000000..3666d51
--- /dev/null
+++ b/src/bap/osp-bits/paramIOX.c
@@ -0,0 +1,1782 @@
+/* 
+  Program Name: paramIOX
+  File name: paramIOX.c
+  Purpose: X windows portion of reading in and output constraints
+  Last Update: Aug 6 1991
+  Copyright 1991: LaDeana Hillier and Philip Green
+
+  Change Log:
+*/
+
+
+/* ---- Includes ---- */
+#include "defn.h"  /* macros and stdio */
+#include "our_allo.h"
+#include "Xmess.h"
+#include "Xincludes.h"  /* global X variables and X libs etc */
+#include "struct.h" /* global structure defns */
+#include "paramIO.h" /* IMPORT ReadDef,write_params_file */
+
+
+/* --------External variables--------*/
+
+extern Prm prm;
+extern int program_option;
+extern int program_version;
+static Prm *orig_params;
+
+#ifdef XVERSION
+extern Widget resultsWid;
+extern int weights;
+extern char *def_fn; /* default file name */
+extern char *seq_filename;
+#endif
+
+extern void updateOtherFnWid();
+void set_text_def();
+
+#ifdef SUBVERSION
+int weights=0; /* he may want to ask me to change the weights, then what? */
+char def_fn[MAX_NAME_SIZE];
+static char seq_filename[MAX_NAME_SIZE];
+#endif
+
+
+void create_osp_change_parameters_popup();
+void check_param_validity();
+
+/* Widgets global to paramIO.c */
+Widget option1weight1Wid=NULL,option1weight0Wid=NULL,option3weight0Wid=NULL, option3weight1Wid=NULL;
+
+
+
+/* ---- Exports ---- */
+
+static void
+updateOtherSeqFn(w,client_data, call_data)
+/*callback when the user asks to input parameters from
+  file, this updates the other sequence filename input
+  box on the main form */
+Widget w;
+XtPointer client_data;
+XtPointer call_data;
+{
+  /*this function is in osp.c and just writes
+    the other fn to the main form widget*/
+#ifdef XVERSION
+  updateOtherFnWid();
+#endif
+  return;
+}
+
+
+
+static void changeWtAmbig(widget,client_data,call_data)
+Widget widget;
+XtPointer client_data;
+XtPointer call_data;
+{
+   Arg arg[10];
+   int n=0;
+   Widget WtAmbigWid = (Widget) client_data;
+   
+   if (!strcmp("avg",prm.wt_ambig)) {
+     sprintf(prm.wt_ambig,"full");
+     XtSetArg(arg[n], XtNlabel, "full"); 
+   }
+   else {
+     sprintf(prm.wt_ambig,"avg");
+     XtSetArg(arg[n], XtNlabel, "avg "); 
+   }
+        n=1;
+	XtSetValues(WtAmbigWid,arg,n);
+ }
+
+
+
+void osp_change_parameters(button,params,weight_or_param)
+Widget button;
+Prm *params;
+int weight_or_param; /* a 0 if you want me to popup the change
+			parameters window, a 1 if you want the window
+			to change weights instead. */
+
+{ int temp_weights;
+  int temp_option;
+
+    weights=weight_or_param;
+    orig_params = params;
+    prm=*params;
+
+/* if they have not been popped up before, go ahead and create
+   all the parameter and weight popups */
+
+  if (option1weight1Wid==NULL && option1weight0Wid==NULL && option3weight0Wid==NULL && option3weight1Wid==NULL)
+ {
+
+/* temporarily story these so you can change them */
+    temp_weights=weights;
+    temp_option=program_option;
+
+/* this creates all the necessary popup prompts */
+    program_option=3;
+    weights=0;
+    create_osp_change_parameters_popup(button,prm);
+    weights=1;
+    create_osp_change_parameters_popup(button,prm);
+  if (program_version!=1) {
+    program_option=1;
+    weights=0;
+    create_osp_change_parameters_popup(button,prm);
+    weights=1;
+    create_osp_change_parameters_popup(button,prm);
+  }
+
+/* reset them back to their original values */
+    weights=temp_weights;
+    program_option=temp_option;
+  }
+
+
+
+
+  if (program_option==3 || program_option==4) {
+    if (weights)
+      set_text_def(option3weight1Wid);
+    else
+      set_text_def(option3weight0Wid);
+  }
+  else {
+    if (weights)
+      set_text_def(option1weight1Wid);
+    else
+      set_text_def(option1weight0Wid);
+  }
+
+
+  if (program_option==3 || program_option==4) {
+    if (weights)
+      XtPopup(option3weight1Wid, XtGrabExclusive);
+    else
+      XtPopup(option3weight0Wid, XtGrabExclusive);
+  }
+  else {
+    if (weights)
+      XtPopup(option1weight1Wid, XtGrabExclusive);
+    else
+      XtPopup(option1weight0Wid, XtGrabExclusive);
+  }
+
+    return;
+ }
+
+void 
+popDownPopupPrompt(widget,client_data,call_data)
+Widget widget;
+XtPointer client_data;
+XtPointer call_data;
+{
+
+  if (program_option==3 || program_option==4) {
+    if (weights)
+      XtPopdown(option3weight1Wid);
+    else
+      XtPopdown(option3weight0Wid);
+  }
+  else {
+    if (weights)
+      XtPopdown(option1weight1Wid);
+    else
+      XtPopdown(option1weight0Wid);
+  }
+}
+
+void 
+destroyParamPopupPrompts()
+{
+/*
+  XtDestroyWidget(option3weight1Wid);
+  XtDestroyWidget(option3weight0Wid);
+  if (program_version!=3)  {
+    XtDestroyWidget(option1weight1Wid);
+    XtDestroyWidget(option1weight0Wid);
+  }
+*/
+}
+
+
+void create_osp_change_parameters_popup(button,prm)
+Widget button;
+Prm prm;
+{ 
+    Arg		args[5],arg[5];
+    Widget	popup;
+    Widget lprod_leng_min,lprod_leng_max,lprod_tm_min,lprod_tm_max,lprim_tm_max,lprim_tm_min;
+    Widget lprod_gc_min,lprod_gc_max;
+    Widget lprim_len_min,lprim_len_max,lprim_gc_min,lprim_gc_max;
+    Widget lprimprodWid,lprimprimWid,lprimotherWid;
+    Widget lprim_nucs,ldelta_tm;
+    Widget lprimselfWid,lann_hmlg;
+    Widget lATWeightWid,lCGWeightWid;
+    Widget lWtAmbigWid;
+    Widget cancelWid, confirmWid, defFnWid, saveDefWid,inputDefWid;
+    Widget ann_labl;
+    Position	x, y;
+    Dimension	width, height;
+    Cardinal	n;
+    String str;
+Widget paramForm,prod_form,prod_labl,prod_leng_min,prod_leng_max;
+Widget prod_tm_min,prod_tm_max,prim_form,prim_labl,prim_len_min;
+Widget prim_len_max,prim_gc_min,prim_gc_max,ann_hmlg_i,ann_hmlg_t;
+Widget prim_nucs,prim_tm_min,prim_tm_max,delta_tm;
+Widget ppann_hmlg_i,ppann_hmlg_t;
+Widget prod_gc_min,prod_gc_max;
+Widget primprodIWid,primprod3Wid,primotherIWid,primother3Wid;
+Widget ATWeightWid,CGWeightWid,WtAmbigWid;
+Widget ann_form;
+
+
+
+
+
+#ifdef XVERSION
+    button=resultsWid; /* this way the param popup will be most likely
+			  to fit on the screen */
+#endif
+
+    /*
+     * This will position the upper left hand corner of the popup at the
+     * center of the widget which invoked this callback, which will also
+     * become the parent of the popup. 
+     */
+
+    n = 0;
+    XtSetArg(args[0], XtNwidth, &width); n++;
+    XtSetArg(args[1], XtNheight, &height); n++;
+    XtGetValues(button, args, n);
+    XtTranslateCoords(button, (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+    x -= 417; /*move it over by half of the param popups width, so its centered
+		over the results wid window */
+    y -= 240;
+    
+    n = 0;
+    XtSetArg(args[n], XtNx, x);				n++;
+    XtSetArg(args[n], XtNy, y);				n++;
+
+    if (program_option==3) {
+      if (weights) {
+	option3weight1Wid = XtCreatePopupShell("params", 
+		      transientShellWidgetClass, button, args, n);   
+	popup=option3weight1Wid;
+      }
+      else {
+	option3weight0Wid = XtCreatePopupShell("params", 
+			  transientShellWidgetClass, button, args, n);   
+	popup=option3weight0Wid;
+      } 
+    }
+    else {
+      if (weights) {
+	option1weight1Wid = XtCreatePopupShell("params", 
+			  transientShellWidgetClass, button, args, n);   
+	popup=option1weight1Wid;
+      }
+      else {
+	option1weight0Wid = XtCreatePopupShell("params", 
+			  transientShellWidgetClass, button, args, n);   
+	popup=option1weight0Wid;
+      }
+    }
+
+    paramForm = XtCreateManagedWidget("paramForm", formWidgetClass, popup ,NULL, 0);
+
+	prod_form = XtVaCreateManagedWidget("prod_form", formWidgetClass, 
+					    paramForm, 
+					NULL);
+if (program_option != 3) {
+	prod_labl = XtVaCreateManagedWidget("prod_labl",labelWidgetClass, 
+					    prod_form,
+					     XtNwidth, 280,
+					     XtNlabel, "PRODUCT CONSTRAINTS",
+					     NULL);
+
+
+	lprod_leng_min = XtVaCreateManagedWidget("lprod_leng_min",
+						 labelWidgetClass, prod_form,
+					     XtNwidth, 200,
+					     /* XtNjustify, XtJustifyLeft,*/
+					     XtNfromVert, prod_labl,
+					    NULL);
+
+      } 
+else {
+	prod_labl = XtVaCreateManagedWidget("prod_labl",labelWidgetClass, 
+					    prod_form,
+					     XtNwidth, 280,
+					     XtNlabel, "PRIMER LOCATION CONSTRAINTS",
+					     NULL);
+	lprod_leng_min = XtVaCreateManagedWidget("lprod_leng_min",
+						 labelWidgetClass, prod_form,
+					     XtNwidth, 200,
+					     /* XtNjustify, XtJustifyLeft,*/
+					     XtNfromVert, prod_labl,
+					    NULL);
+
+
+
+      }
+
+    n = 0;
+    if (weights) 
+      XtSetArg(args[n], XtNlabel,     "Length:                    "); 
+    else {
+      if (program_option==3 || program_option==4)
+	XtSetArg(args[n], XtNlabel,   "Length from end(bp):  Min ");
+      else
+	XtSetArg(args[n], XtNlabel,   "Product length(bp):   Min ");
+    }
+    n++;
+    XtSetValues(lprod_leng_min, args, n);
+
+	prod_leng_min = XtVaCreateManagedWidget("prod_leng_min",
+						asciiTextWidgetClass, prod_form,
+					     XtNwidth, 70,
+					     XtNeditType,  XawtextEdit,
+					     XtNfromVert, prod_labl,
+					     XtNfromHoriz, lprod_leng_min,
+					    NULL);
+
+
+    if (!weights) {
+	lprod_leng_max = XtVaCreateManagedWidget("lprod_leng_max",
+						 labelWidgetClass, prod_form,
+					     XtNwidth, 200,
+	       	                             XtNlabel,  "                    Max",
+					     XtNfromVert, prod_leng_min,
+					     /* XtNjustify, XtJustifyLeft,*/
+						NULL);
+	prod_leng_max = XtVaCreateManagedWidget("prod_leng_max",
+						asciiTextWidgetClass, prod_form,
+					     XtNwidth, 70,
+					     XtNeditType,  XawtextEdit,
+					     XtNfromVert, prod_leng_min,
+					     XtNfromHoriz, lprod_leng_max,
+						NULL);
+      }
+
+
+if (program_option != 3) {
+	lprod_gc_min = XtVaCreateManagedWidget("lprod_gc_min",
+                                               labelWidgetClass, prod_form,
+					       XtNwidth, 200,
+					       /* XtNjustify, XtJustifyLeft,*/
+					    NULL);
+
+    n = 0;
+    if (weights) {
+      XtSetArg(args[n], XtNfromVert, prod_leng_min); n++;
+      XtSetArg(args[n], XtNlabel, "G+C Content:               "); n++;
+    }
+    else {
+      XtSetArg(args[n], XtNfromVert, prod_leng_max); n++;
+      XtSetArg(args[n], XtNlabel, "G+C Content(%):       Min "); n++;	
+    }
+    XtSetValues(lprod_gc_min, args, n);
+
+
+	prod_gc_min = XtVaCreateManagedWidget("prod_gc_min",
+                                               asciiTextWidgetClass, prod_form,
+					       XtNwidth, 70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromHoriz, lprod_gc_min,
+					    NULL);
+
+    n = 0;
+    if (weights)
+      XtSetArg(args[n], XtNfromVert, prod_leng_min); 
+    else
+      XtSetArg(args[n], XtNfromVert, prod_leng_max);
+    n++;
+    XtSetValues(prod_gc_min, args, n);
+
+
+    if (!weights) {
+	lprod_gc_max = XtVaCreateManagedWidget("lprod_gc_max",
+					       labelWidgetClass, prod_form,
+					       XtNwidth, 200,
+					       XtNlabel,"                    Max",
+					       XtNfromVert, prod_gc_min,
+					       /* XtNjustify, XtJustifyLeft,*/
+					    NULL);
+	prod_gc_max = XtVaCreateManagedWidget("prod_gc_max",
+					      asciiTextWidgetClass, prod_form,
+					       XtNwidth,  70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, prod_gc_min,
+					       XtNfromHoriz, lprod_gc_max,
+					    NULL);
+
+      }
+
+
+
+	lprod_tm_min = XtVaCreateManagedWidget("lprod_tm_min",
+	                                     labelWidgetClass, prod_form,
+					     XtNwidth, 200,
+					     XtNeditType,  XawtextEdit,
+					     /* XtNjustify, XtJustifyLeft,*/
+					    NULL);
+
+    n = 0;
+    if (weights) {
+      XtSetArg(args[n], XtNfromVert, lprod_gc_min); n++;
+      XtSetArg(args[n], XtNlabel, "Tm:                        "); n++;
+    }
+    else {
+      XtSetArg(args[n], XtNfromVert, lprod_gc_max); n++;
+      XtSetArg(args[n], XtNlabel, "Tm(Celsius):          Min "); n++;	
+    }
+    XtSetValues(lprod_tm_min, args, n);
+
+
+	prod_tm_min = XtVaCreateManagedWidget("prod_tm_min",
+	                                     asciiTextWidgetClass, prod_form,
+					     XtNwidth, 70,
+					     XtNeditType,  XawtextEdit,
+					     XtNfromHoriz, lprod_tm_min,
+					    NULL);
+
+    n = 0;
+    if (weights) 
+      XtSetArg(args[n], XtNfromVert, prod_gc_min); 
+    else 
+      XtSetArg(args[n], XtNfromVert, prod_gc_max);
+    n++;
+    XtSetValues(prod_tm_min, args, n);
+
+
+      if (!weights) {
+	lprod_tm_max = XtVaCreateManagedWidget("lprod_tm_max",
+	                                     labelWidgetClass, prod_form,
+					     XtNwidth, 200,
+	 				     XtNlabel,"                    Max",
+					     XtNfromVert, prod_tm_min,
+					     /* XtNjustify, XtJustifyLeft,*/
+					    NULL);
+	prod_tm_max = XtVaCreateManagedWidget("prod_tm_max",
+                                             asciiTextWidgetClass, prod_form,
+					     XtNwidth, 70,
+					     XtNeditType,  XawtextEdit,
+					     XtNfromVert, prod_tm_min,
+					     XtNfromHoriz, lprod_tm_max,
+					    NULL);
+      }
+      
+     }  /* if program_option != 3 */
+
+
+
+
+
+	prim_form = XtVaCreateManagedWidget("prim_form", 
+                                            formWidgetClass, paramForm, 
+					    XtNfromHoriz, prod_form,
+					    NULL);
+        prim_labl = XtVaCreateManagedWidget("prim_labl",
+                                            labelWidgetClass, prim_form,
+					    XtNlabel, "PRIMER CONSTRAINTS",
+					    XtNwidth, 305,
+					    NULL);
+					    
+	lprim_len_min = XtVaCreateManagedWidget("lprim_len_min",
+                                               labelWidgetClass, prim_form,
+					       XtNwidth, 225,
+						/* XtNjustify, XtJustifyLeft,*/
+					       XtNfromVert, prim_labl,
+					    NULL);
+
+
+    n = 0;
+    if (weights) 
+      XtSetArg(args[n], XtNlabel, "Primer 1 Length:               "); 
+    else 
+      XtSetArg(args[n], XtNlabel, "Length(bases):        Min      ");
+    n++;
+    XtSetValues(lprim_len_min, args, n);
+
+
+	prim_len_min = XtVaCreateManagedWidget("prim_len_min",
+                                               asciiTextWidgetClass, prim_form,
+					       XtNwidth, 70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, prim_labl,
+					       XtNfromHoriz, lprim_len_min,
+					    NULL);
+	lprim_len_max = XtVaCreateManagedWidget("lprim_len_max",
+                                                labelWidgetClass, prim_form,
+					       XtNwidth, 225,
+					     /* XtNjustify, XtJustifyLeft,*/
+					       XtNfromVert, prim_len_min,
+					    NULL);
+
+
+	prim_len_max = XtVaCreateManagedWidget("prim_len_max",
+                                               asciiTextWidgetClass, prim_form,
+					       XtNwidth, 70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromHoriz, lprim_len_max,
+					       XtNfromVert, prim_len_min,
+					    NULL);
+
+    n = 0;
+    if (weights) {
+      if (program_option==3) {
+	XtSetArg(args[n], XtNeditType, XawtextRead); n++;
+	XtSetArg(args[n], XtNborderWidth, 0); n++;
+	XtSetValues(prim_len_max, args, n);
+	n=0;
+	XtSetArg(args[n], XtNlabel, "---------"); n++;
+      }
+      else
+	XtSetArg(args[n], XtNlabel, "Primer 2 Length:               "); n++;
+      XtSetValues(lprim_len_max, args, n);
+    }
+    else {
+      XtSetArg(args[n], XtNlabel,"                      Max      "); n++;
+      XtSetValues(lprim_len_max, args, n);
+    }
+
+	lprim_gc_min = XtVaCreateManagedWidget("lprim_gc_min",
+                                               labelWidgetClass, prim_form,
+					       XtNwidth, 225,
+					     /* XtNjustify, XtJustifyLeft,*/
+					       XtNfromVert, prim_len_max,
+					    NULL);
+    n = 0;
+    if (weights) 
+      XtSetArg(args[n], XtNlabel, "Primer 1 G+C:                  "); 
+    else 
+      XtSetArg(args[n], XtNlabel, "G+C Content(%):       Min      ");
+    n++;
+    XtSetValues(lprim_gc_min, args, n);
+
+	prim_gc_min = XtVaCreateManagedWidget("prim_gc_min",
+                                               asciiTextWidgetClass, prim_form,
+					       XtNwidth, 70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, prim_len_max,
+					       XtNfromHoriz, lprim_gc_min,
+					    NULL);
+	lprim_gc_max = XtVaCreateManagedWidget("lprim_gc_max",labelWidgetClass, 
+					       prim_form,
+					       XtNwidth, 225,
+					     /* XtNjustify, XtJustifyLeft,*/
+					       XtNfromVert, prim_gc_min,
+					    NULL);
+
+	prim_gc_max = XtVaCreateManagedWidget("prim_gc_max",
+					      asciiTextWidgetClass, prim_form,
+					       XtNwidth,  70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, prim_gc_min,
+					       XtNfromHoriz, lprim_gc_max,
+					    NULL);
+    n = 0;
+    if (weights) {
+      if (program_option==3) {
+	XtSetArg(args[n], XtNeditType, XawtextRead); n++;
+	XtSetArg(args[n], XtNborderWidth, 0); n++;
+	XtSetValues(prim_gc_max, args, n);
+	n=0;
+	XtSetArg(args[n], XtNlabel, "---------"); n++;
+      }
+      else
+	XtSetArg(args[n], XtNlabel, "Primer 2 G+C:                  "); n++;
+      XtSetValues(lprim_gc_max, args, n);
+    }
+    else {
+      XtSetArg(args[n], XtNlabel, "                      Max      ");    n++;
+      XtSetValues(lprim_gc_max, args, n);
+    }
+
+
+        lprim_tm_min = XtVaCreateManagedWidget("lprim_tm_min",
+					       labelWidgetClass,prim_form,
+                                             XtNwidth, 225,
+					     /* XtNjustify, XtJustifyLeft,*/
+                                             XtNeditType,  XawtextEdit,
+                                             XtNfromVert, lprim_gc_max,
+                                            NULL);
+
+    n = 0;
+    if (weights) 
+      XtSetArg(args[n], XtNlabel, "Primer 1 Tm:                   "); 
+    else 
+      XtSetArg(args[n], XtNlabel,"Tm(Celsius):          Min      ");
+    n++;
+    XtSetValues(lprim_tm_min, args, n);
+
+
+       prim_tm_min = XtVaCreateManagedWidget("prim_tm_min",asciiTextWidgetClass, prim_form,
+                                             XtNwidth, 70,
+                                             XtNeditType,  XawtextEdit,
+                                             XtNfromHoriz, lprim_tm_min,
+                                             XtNfromVert, prim_gc_max,
+                                            NULL);
+
+
+        lprim_tm_max = XtVaCreateManagedWidget("lprim_tm_max",labelWidgetClass,prim_form,
+                                             XtNwidth, 225,
+					     /* XtNjustify, XtJustifyLeft,*/
+                                             XtNfromVert, lprim_tm_min,
+                                            NULL);
+        prim_tm_max = XtVaCreateManagedWidget("prim_tm_max",asciiTextWidgetClass, prim_form,
+                                             XtNwidth, 70,
+                                             XtNeditType,  XawtextEdit,
+                                             XtNfromVert, prim_tm_min,
+                                             XtNfromHoriz, lprim_tm_max,
+                                            NULL);
+    n = 0;
+    if (weights) {
+      if (program_option==3) {
+	XtSetArg(args[n], XtNeditType, XawtextRead); n++;
+	XtSetArg(args[n], XtNborderWidth, 0); n++;
+	XtSetValues(prim_tm_max, args, n); 
+        n=0;
+	XtSetArg(args[n], XtNlabel, "---------"); n++;
+      }
+      else 
+	XtSetArg(args[n], XtNlabel, "Primer 2 Tm:                   "); n++;
+      XtSetValues(lprim_tm_max, args, n); 
+    }
+    else {
+      XtSetArg(args[n], XtNlabel,"                      Max      ");  n++;
+      XtSetValues(lprim_tm_max, args, n); 
+    }
+
+
+if (program_option != 3) {
+
+       ldelta_tm = XtVaCreateManagedWidget("ldelta_tm",labelWidgetClass, prim_form,
+					       XtNwidth, 225,
+					       XtNjustify, XtJustifyLeft,
+					       XtNlabel, "Difference in Tm cutoff:   ",
+					       XtNfromVert, prim_tm_max,
+					    NULL);
+       delta_tm = XtVaCreateManagedWidget("delta_tm",asciiTextWidgetClass, prim_form,
+					       XtNwidth, 70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, prim_tm_max,
+					       XtNfromHoriz, ldelta_tm,	
+					    NULL);
+     }
+
+
+if (!weights) {
+
+       lprim_nucs = XtVaCreateManagedWidget("lprim_nucs",
+					    labelWidgetClass, prim_form,
+					       XtNwidth, 225,
+					       XtNjustify, XtJustifyLeft,
+                                   XtNlabel, "3' Nucleotides (S=G/C,W=A/T):",
+					       XtNfromVert, primotherIWid,
+					    NULL);
+
+    n = 0;
+    if (program_option==3) 
+      XtSetArg(args[n], XtNfromVert, lprim_tm_max); 
+    else 
+      XtSetArg(args[n], XtNfromVert, ldelta_tm); 
+    n++;
+    XtSetValues(lprim_nucs, args, n);
+
+       prim_nucs = XtVaCreateManagedWidget("prim_nucs",asciiTextWidgetClass, prim_form,
+					       XtNwidth, 70,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromHoriz, lprim_nucs,
+					    NULL);
+    n = 0;
+    if (program_option==3) 
+      XtSetArg(args[n], XtNfromVert, prim_tm_max); 
+    else 
+      XtSetArg(args[n], XtNfromVert, delta_tm); 
+    n++;
+    XtSetValues(prim_nucs, args, n);
+     }
+
+    /* form widget to hold all annealing constraints information */
+	ann_form = XtVaCreateManagedWidget("ann_form", 
+                                            formWidgetClass, paramForm, 
+					    XtNfromHoriz, prim_form,
+					    NULL);
+        ann_labl = XtVaCreateManagedWidget("ann_labl",
+                                            labelWidgetClass, ann_form,
+					    XtNlabel, "ANNEALING CONSTRAINTS",
+					    XtNwidth, 250,
+					    NULL);
+					    
+
+
+    /* big annealing homology title */
+	lann_hmlg = XtVaCreateManagedWidget("lann_hmlg",
+					    labelWidgetClass, ann_form,
+					       XtNwidth, 250,
+					       XtNlabel, "Annealing Scores:  Internal  3'",
+					       XtNfromVert, prim_labl,
+					     NULL);
+    /* label for primer-self annealing*/
+    lprimselfWid  = XtVaCreateManagedWidget("lprimself",
+					       labelWidgetClass, ann_form,
+					       XtNwidth, 150,
+					       XtNlabel, "Primer-self",
+					       XtNfromVert, lann_hmlg,
+					     NULL);
+    if (program_option!=3) {
+      /*These are only needed when looking for primer pairs*/
+      /* primer-primer annealing label*/
+	lprimprimWid = XtVaCreateManagedWidget("lprimprim",
+						labelWidgetClass, ann_form,
+					       XtNwidth, 150,
+					       XtNlabel, "Primer-primer",
+					       XtNfromVert, lprimselfWid,
+					     NULL);
+
+      /* primer-product annealing label*/	
+	lprimprodWid = XtVaCreateManagedWidget("lprimprod",
+					      labelWidgetClass, ann_form,
+					       XtNwidth, 150,
+					       XtNlabel, "Primer-product",
+					       XtNfromVert, lprimprimWid,
+					    NULL);
+      } /*if program_option != 3*/
+
+    /* label for primer-other sequence annealing, other-sequence includes
+     rest of sequence, other than primer*/
+	lprimotherWid = XtVaCreateManagedWidget("lprimother",
+						labelWidgetClass, ann_form,
+					       XtNwidth, 150,
+					       XtNlabel, "Primer-other",
+					     NULL);
+
+    if (program_option==3) {
+        n = 0;
+        XtSetArg(args[n], XtNfromVert, lprimselfWid);  n++;
+        XtSetValues(lprimotherWid, args, n);
+      } 
+    else {
+        n = 0;
+        XtSetArg(args[n], XtNfromVert, lprimprodWid);  n++;
+        XtSetValues(lprimotherWid, args, n);
+      }
+
+/* primer-self annealing, internal */
+	ann_hmlg_i = XtVaCreateManagedWidget("ann_hmlg_i",
+                                         asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, lann_hmlg,
+                                               XtNfromHoriz, lprimselfWid,
+					     NULL);
+
+if (program_option != 3) {
+  /*primer-primer internal annealing homology*/
+	ppann_hmlg_i = XtVaCreateManagedWidget("ppann_hmlg_i",
+                                         asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, ann_hmlg_i,
+                                               XtNfromHoriz, lprimprimWid,
+					     NULL);
+/* primer-primer three prime annealing homology*/
+	ppann_hmlg_t = XtVaCreateManagedWidget("ppann_hmlg_t",
+                                   asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, ann_hmlg_i,
+                                               XtNfromHoriz, ppann_hmlg_i,
+					    NULL);
+      }
+
+/* primer-self annealing, three-prime*/
+	ann_hmlg_t = XtVaCreateManagedWidget("ann_hmlg_t",
+                               asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, lann_hmlg,
+					     XtNfromHoriz, ann_hmlg_i,
+					    NULL);
+
+
+
+    if (program_option != 3) {
+      /*only look at these product annealings if looking for primer-pairs*/
+    /* primer vs product, internal annealing */
+	primprodIWid = XtVaCreateManagedWidget("primprodI",
+                                              asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, ppann_hmlg_i,
+					       XtNfromHoriz, lprimprodWid,
+					    NULL);
+
+
+    /* primer-product, three-prime annealing */
+	primprod3Wid = XtVaCreateManagedWidget("primprod3",
+                                              asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, ppann_hmlg_t,
+					       XtNfromHoriz, primprodIWid,
+					    NULL);
+      }
+    /* primer vs other sequence, internal annealing */
+	primotherIWid = XtVaCreateManagedWidget("primotherI",
+						asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromHoriz, lprimotherWid,
+					    NULL);
+    /* primer-other sequence, three-prime annealing */
+	primother3Wid = XtVaCreateManagedWidget("primother3",
+						asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromHoriz, primotherIWid,
+					    NULL);
+
+
+    if (program_option==3) {
+        n = 0;
+        XtSetArg(args[n], XtNfromVert, ann_hmlg_i);  n++;
+        XtSetValues(primotherIWid, args, n);
+        n = 0;
+        XtSetArg(args[n], XtNfromVert, ann_hmlg_t);  n++;
+        XtSetValues(primother3Wid, args, n);
+      }
+    else {
+        n = 0;
+        XtSetArg(args[n], XtNfromVert, primprodIWid);  n++;
+        XtSetValues(primotherIWid, args, n);
+        n = 0;
+        XtSetArg(args[n], XtNfromVert, primprod3Wid);  n++;
+        XtSetValues(primother3Wid, args, n);
+      }
+
+
+if (!weights) {
+       lATWeightWid =  XtVaCreateManagedWidget("lATWeight",
+					       labelWidgetClass, ann_form,
+					       XtNwidth, 150,
+					       XtNlabel, "A<->T Score",
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, lprimotherWid,
+					    NULL);
+
+       ATWeightWid =  XtVaCreateManagedWidget("ATWeight",
+					       asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, primother3Wid,
+					       XtNfromHoriz, lATWeightWid,
+					    NULL);
+
+       lCGWeightWid =  XtVaCreateManagedWidget("lCGWeight",
+					       labelWidgetClass, ann_form,
+					       XtNwidth, 150,
+					       XtNlabel, "C<->G Score",
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, lATWeightWid,
+					    NULL);
+
+       CGWeightWid =  XtVaCreateManagedWidget("CGWeight",
+					       asciiTextWidgetClass, ann_form,
+					       XtNwidth, 45,
+					       XtNeditType,  XawtextEdit,
+					       XtNfromVert, ATWeightWid,
+					       XtNfromHoriz, lCGWeightWid,
+					    NULL);
+
+       lWtAmbigWid =  XtVaCreateManagedWidget("lWtAmbig",
+					       labelWidgetClass, ann_form,
+					       XtNwidth, 150,
+					       XtNlabel, "Wt Ambig",
+					       XtNfromVert, lCGWeightWid,
+					    NULL);
+
+       WtAmbigWid =  XtVaCreateManagedWidget("WtAmbig",
+					       commandWidgetClass, ann_form,
+					       XtNwidth, 49,
+					       XtNfromVert, CGWeightWid,
+					       XtNfromHoriz, lWtAmbigWid,
+					    NULL);
+     }
+
+
+        confirmWid = XtVaCreateManagedWidget("confirm", commandWidgetClass,paramForm,
+					 XtNfromVert, prim_form,
+                                         XtNlabel, "CONFIRM",
+                                      NULL);
+        cancelWid = XtVaCreateManagedWidget("cancel", 
+					 commandWidgetClass,paramForm,
+					 XtNfromHoriz, confirmWid,
+					 XtNfromVert, prim_form,
+                                      NULL);
+
+        inputDefWid = XtVaCreateManagedWidget("inputDef", commandWidgetClass,paramForm,
+					 XtNfromVert, prim_form,
+					 XtNfromHoriz, cancelWid,
+					 XtNborderColor, XtDefaultBackground,
+                                         XtNlabel, "INPUT CONSTRAINTS FROM FILE:",
+                                      NULL);
+
+        saveDefWid = XtVaCreateManagedWidget("saveDef", commandWidgetClass,paramForm,
+					 XtNfromVert, prim_form,
+					 XtNfromHoriz, inputDefWid,
+					 XtNborderColor, XtDefaultBackground,
+                                         XtNlabel, "SAVE CONSTRAINTS TO FILE:",
+                                      NULL);
+
+        defFnWid = XtVaCreateManagedWidget("defFn", 
+					 asciiTextWidgetClass,paramForm,
+				         XtNeditType, XawtextEdit,
+					 XtNfromVert, prim_form,
+					 XtNfromHoriz, saveDefWid,
+					 XtNwidth, 250,
+					 XtNborderColor, XtDefaultBackground,
+                                      NULL);
+
+    str = (char *)our_alloc(MAX_NAME_SIZE*sizeof(char));
+    if (strlen(seq_filename)>(MAX_NAME_SIZE-4)) {
+      message("Sequence Filename longer than MAX_NAME_SIZE\n");
+      popUpErrorMessage();
+    }
+    sprintf(str,"%s.def",seq_filename);
+    n=0;
+    XtSetArg(arg[n], XtNstring, str); n++;
+    XtSetValues(defFnWid,arg,n);
+
+
+
+/* weights lets me know whether the widget that was clicked on 
+   was for changing the actual parameters or changing the weights that
+   are attached to the parameters, weights=0 then parameters if weights=1
+   then its the weights I'm changing */
+
+   our_free(str);
+
+
+    XtAddCallback(saveDefWid, XtNcallback, GetParams, (XtPointer)popup);
+    XtAddCallback(saveDefWid, XtNcallback, check_param_validity, NULL);
+    XtAddCallback(saveDefWid, XtNcallback, SaveParams, (XtPointer)defFnWid);
+    XtAddCallback(saveDefWid, XtNcallback, popDownPopupPrompt,(XtPointer)paramForm);
+    XtAddCallback(inputDefWid, XtNcallback, inputParams, (XtPointer)defFnWid);
+    XtAddCallback(inputDefWid, XtNcallback, updateOtherSeqFn, NULL);
+    XtAddCallback(inputDefWid, XtNcallback, popDownPopupPrompt,(XtPointer)paramForm);
+    XtAddCallback(inputDefWid, XtNcallback, check_param_validity, NULL);
+    if (!weights)
+      XtAddCallback(WtAmbigWid, XtNcallback, changeWtAmbig, (XtPointer)WtAmbigWid);
+
+    XtAddCallback(confirmWid, XtNcallback, GetParams, (XtPointer)popup);
+    XtAddCallback(confirmWid, XtNcallback, check_param_validity, NULL);
+    XtAddCallback(confirmWid, XtNcallback, popDownPopupPrompt, (XtPointer)paramForm);
+    XtAddCallback(cancelWid, XtNcallback, popDownPopupPrompt, (XtPointer)paramForm);
+
+    
+}
+
+
+
+/* writes the values to the changeParamsPopup*/
+void set_text_def(w)
+Widget w;
+/* if weights == 1, then we are changing the parameter weights,
+   otherwise we are changing the parameters themselves */
+
+/* initializes and/or refreshes the change parameter input windows*/
+{ Arg arg[15];
+  int n;
+  char *temp;
+  float gc;
+  Widget paramForm,primForm,prodForm,annForm;
+  Widget prod_leng_max,prod_leng_min;
+  Widget prod_gc_min,prod_gc_max;
+  Widget prod_tm_min,prod_tm_max;
+  Widget prim_len_min,prim_len_max;
+  Widget prim_gc_min,prim_gc_max;
+  Widget prim_tm_min,prim_tm_max;
+  Widget delta_tm;
+  Widget prim_nucs;
+  Widget ann_hmlg_i,ann_hmlg_t; /* primer-self */
+  Widget ppann_hmlg_i, ppann_hmlg_t; /* primer-primer */
+  Widget primotherIWid,primother3Wid;
+  Widget primprodIWid, primprod3Wid;
+  Widget ATWeightWid,CGWeightWid,WtAmbigWid; 
+
+
+  /* get the names of all of the children of the current
+     parameter popup */
+
+  paramForm = XtNameToWidget(w,"paramForm");
+  primForm = XtNameToWidget(paramForm,"prim_form");
+  prodForm = XtNameToWidget(paramForm,"prod_form");
+  annForm = XtNameToWidget(paramForm,"ann_form");
+  prod_leng_min = XtNameToWidget(prodForm,"prod_leng_min");
+  prod_leng_max = XtNameToWidget(prodForm,"prod_leng_max");
+  prod_gc_min = XtNameToWidget(prodForm,"prod_gc_min");
+  prod_gc_max = XtNameToWidget(prodForm,"prod_gc_max");
+  prod_tm_min = XtNameToWidget(prodForm,"prod_tm_min");
+  prod_tm_max = XtNameToWidget(prodForm,"prod_tm_max");
+  prim_len_min = XtNameToWidget(primForm,"prim_len_min");
+  prim_len_max = XtNameToWidget(primForm,"prim_len_max");
+  prim_gc_min = XtNameToWidget(primForm,"prim_gc_min");
+  prim_gc_max = XtNameToWidget(primForm,"prim_gc_max");
+  prim_tm_min = XtNameToWidget(primForm,"prim_tm_min");
+  prim_tm_max = XtNameToWidget(primForm,"prim_tm_max");
+  delta_tm = XtNameToWidget(primForm,"delta_tm");
+  prim_nucs = XtNameToWidget(primForm,"prim_nucs");
+  ann_hmlg_i = XtNameToWidget(annForm,"ann_hmlg_i");
+  ann_hmlg_t = XtNameToWidget(annForm,"ann_hmlg_t");
+  ppann_hmlg_i = XtNameToWidget(annForm,"ppann_hmlg_i");
+  ppann_hmlg_t = XtNameToWidget(annForm,"ppann_hmlg_t");
+  primotherIWid = XtNameToWidget(annForm,"primotherI");
+  primother3Wid = XtNameToWidget(annForm,"primother3");
+  primprodIWid = XtNameToWidget(annForm,"primprodI");
+  primprod3Wid = XtNameToWidget(annForm,"primprod3");
+  ATWeightWid =  XtNameToWidget(annForm,"ATWeight");
+  CGWeightWid =  XtNameToWidget(annForm,"CGWeight");
+  WtAmbigWid =  XtNameToWidget(annForm,"WtAmbig");
+  
+
+
+
+  temp = (char *)our_alloc(MAX_WORD_SIZE * sizeof(char));
+
+  
+
+  if (weights)
+  sprintf(temp,"%5.2f",prm.wt_prod_len);
+  else
+  sprintf(temp,"%d",prm.prod_len_low);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prod_leng_min,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prod_leng_min,arg,n);
+
+
+  if (!weights) {
+  sprintf(temp,"%d",prm.prod_len_high);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prod_leng_max,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prod_leng_max,arg,n);
+  }
+
+  
+  if (program_option != 3) {
+
+  if (weights) {
+    gc = prm.wt_prod_gc;
+    sprintf(temp,"%5.2f",gc);
+  }
+  else {
+    if (prm.prod_gc_low < 1.0) gc = prm.prod_gc_low * 100.0;
+    else gc = prm.prod_gc_low;
+    sprintf(temp,"%2.0f",gc);
+  }
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prod_gc_min,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prod_gc_min,arg,n);
+
+
+  if (!weights)
+  {
+    if (prm.prod_gc_high < 1.0) gc = prm.prod_gc_high * 100.0;
+    else gc = prm.prod_gc_high;
+  sprintf(temp,"%2.0f",gc);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prod_gc_max,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prod_gc_max,arg,n);
+  }
+
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_prod_tm);
+  else
+    sprintf(temp,"%4.1f",prm.prod_tm_low);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prod_tm_min,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prod_tm_min,arg,n);
+
+  if (!weights) {
+  sprintf(temp,"%4.1f",prm.prod_tm_high);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prod_tm_max,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prod_tm_max,arg,n);
+ }
+} 
+  else {
+    if (weights) {
+      prm.wt_prod_tm=0.0;
+    }
+    else  {
+      prm.prod_tm_low=0.0;prm.prod_tm_high=0.0;
+    }
+  }
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_prim_s_len);
+  else
+    sprintf(temp,"%d",prm.min_prim_len);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prim_len_min,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prim_len_min,arg,n);
+
+  if (weights){
+    if (program_option==3)
+      sprintf(temp," ");
+    else
+      sprintf(temp,"%5.2f",prm.wt_prim_a_len);
+  }
+  else
+    sprintf(temp,"%d",prm.max_prim_len);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prim_len_max,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prim_len_max,arg,n);
+
+  if (weights) {
+    gc = prm.wt_prim_s_gc;
+    sprintf(temp,"%5.2f",gc);
+  }
+  else {
+    if (prm.prim_gc_low < 1.0) gc = prm.prim_gc_low * 100.0;
+    else gc = prm.prim_gc_low;
+    sprintf(temp,"%2.0f",gc);
+  }
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prim_gc_min,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prim_gc_min,arg,n);
+
+
+  if (weights) {
+    if (program_option==3) 
+      sprintf(temp," ");
+    else {
+      gc=prm.wt_prim_a_gc;
+      sprintf(temp,"%5.2f",gc);
+    }
+  }
+  else {
+    if (prm.prim_gc_high < 1.0) gc = prm.prim_gc_high * 100.0;
+    else gc = prm.prim_gc_high;
+    sprintf(temp,"%2.0f",gc);
+  }
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prim_gc_max,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prim_gc_max,arg,n);
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_selfI_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.selfI_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(ann_hmlg_i,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(ann_hmlg_i,arg,n);
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_self3_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.self3_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(ann_hmlg_t,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(ann_hmlg_t,arg,n);
+
+
+if (program_option != 3) {
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_ppI_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.ppI_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(ppann_hmlg_i,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(ppann_hmlg_i,arg,n);
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_pp3_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.pp3_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(ppann_hmlg_t,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(ppann_hmlg_t,arg,n);
+
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_primprodI_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.primprodI_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(primprodIWid,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(primprodIWid,arg,n);
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_primprod3_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.primprod3_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(primprod3Wid,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(primprod3Wid,arg,n);
+}
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_primotherI_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.primotherI_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(primotherIWid,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(primotherIWid,arg,n);
+
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_primother3_hmlg_cut);
+  else
+    sprintf(temp,"%4.1f",prm.primother3_hmlg_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(primother3Wid,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(primother3Wid,arg,n);
+
+
+  if (!weights) {
+    sprintf(temp,"%s",prm.end_nucs);
+    n=0;
+    XtSetArg(arg[n],XtNstring,temp); n++;
+    XtSetValues(prim_nucs,arg,n);
+    n=0;
+    XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+    XtSetValues(prim_nucs,arg,n);
+  }
+
+  
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_prim_s_tm);
+  else
+    sprintf(temp,"%4.1f",prm.prim_tm_low);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prim_tm_min,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prim_tm_min,arg,n);
+
+  if (weights) {
+    if (program_option==3) 
+      sprintf(temp," ");
+   else
+    sprintf(temp,"%5.2f",prm.wt_prim_a_tm);
+  }
+  else
+    sprintf(temp,"%4.1f",prm.prim_tm_high);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(prim_tm_max,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(prim_tm_max,arg,n);
+
+
+if (program_option != 3) {
+  if (weights)
+    sprintf(temp,"%5.2f",prm.wt_delta_tm_cut);
+  else
+    sprintf(temp,"%4.1f",prm.delta_tm_cut);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(delta_tm,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(delta_tm,arg,n);
+}
+else {
+  if (weights)
+    prm.wt_delta_tm_cut = 0;  
+  else
+    prm.delta_tm_cut = 0;  
+}
+
+
+  if (!weights) {
+  sprintf(temp,"%4.1f",prm.AT_score);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(ATWeightWid,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(ATWeightWid,arg,n);
+  }
+
+  if (!weights) {
+  sprintf(temp,"%4.1f",prm.CG_score);
+  n=0;
+  XtSetArg(arg[n],XtNstring,temp); n++;
+  XtSetValues(CGWeightWid,arg,n);
+  n=0;
+  XtSetArg(arg[n],XtNinsertPosition, strlen(temp)); n++;
+  XtSetValues(CGWeightWid,arg,n);
+  }
+
+  if (!weights) {
+  sprintf(temp,"%s",prm.wt_ambig);
+  n=0;
+  XtSetArg(arg[n],XtNlabel,temp); n++;
+  XtSetValues(WtAmbigWid,arg,n);
+  }
+
+  /* if a parameter equals zero, it 
+     is interpreted as not being considered as a parameter -i.e.
+     they do not care about a particular parameter */
+
+  our_free(temp);
+  return;
+ }                                 
+
+
+void GetParams(w,client_data,call_data)
+Widget w;
+XtPointer client_data;
+XtPointer call_data;
+    {
+      Widget paramFormParent = (Widget) client_data;
+      Arg args[15];
+      String str;
+      int k,kk;
+      int temp_int;
+      float temp_float;
+      int stp,endp;
+/*      int o_stp1,o_endp1,o_stp2,o_endp2;
+      int endseq; /* last sequence indices */
+
+  Widget paramForm,primForm,prodForm,annForm;
+  Widget prod_leng_max,prod_leng_min;
+  Widget prod_gc_min,prod_gc_max;
+  Widget prod_tm_min,prod_tm_max;
+  Widget prim_len_min,prim_len_max;
+  Widget prim_gc_min,prim_gc_max;
+  Widget prim_tm_min,prim_tm_max;
+  Widget delta_tm;
+  Widget prim_nucs;
+  Widget ann_hmlg_i,ann_hmlg_t; /* primer-self */
+  Widget ppann_hmlg_i, ppann_hmlg_t; /* primer-primer */
+  Widget primotherIWid,primother3Wid;
+  Widget primprodIWid,primprod3Wid;
+  Widget ATWeightWid,CGWeightWid,WtAmbigWid; 
+
+
+  /* get the names of all of the children of the current
+     parameter popup */
+
+  paramForm = XtNameToWidget(paramFormParent,"paramForm");
+  primForm = XtNameToWidget(paramForm,"prim_form");
+  prodForm = XtNameToWidget(paramForm,"prod_form");
+  annForm = XtNameToWidget(paramForm,"ann_form");
+  prod_leng_min = XtNameToWidget(prodForm,"prod_leng_min");
+  prod_leng_max = XtNameToWidget(prodForm,"prod_leng_max");
+  prod_gc_min = XtNameToWidget(prodForm,"prod_gc_min");
+  prod_gc_max = XtNameToWidget(prodForm,"prod_gc_max");
+  prod_tm_min = XtNameToWidget(prodForm,"prod_tm_min");
+  prod_tm_max = XtNameToWidget(prodForm,"prod_tm_max");
+  prim_len_min = XtNameToWidget(primForm,"prim_len_min");
+  prim_len_max = XtNameToWidget(primForm,"prim_len_max");
+  prim_gc_min = XtNameToWidget(primForm,"prim_gc_min");
+  prim_gc_max = XtNameToWidget(primForm,"prim_gc_max");
+  prim_tm_min = XtNameToWidget(primForm,"prim_tm_min");
+  prim_tm_max = XtNameToWidget(primForm,"prim_tm_max");
+  delta_tm = XtNameToWidget(primForm,"delta_tm");
+  prim_nucs = XtNameToWidget(primForm,"prim_nucs");
+  ann_hmlg_i = XtNameToWidget(annForm,"ann_hmlg_i");
+  ann_hmlg_t = XtNameToWidget(annForm,"ann_hmlg_t");
+  ppann_hmlg_i = XtNameToWidget(annForm,"ppann_hmlg_i");
+  ppann_hmlg_t = XtNameToWidget(annForm,"ppann_hmlg_t");
+  primotherIWid = XtNameToWidget(annForm,"primotherI");
+  primother3Wid = XtNameToWidget(annForm,"primother3");
+  primprodIWid = XtNameToWidget(annForm,"primprodI");
+  primprod3Wid = XtNameToWidget(annForm,"primprod3");
+  ATWeightWid =  XtNameToWidget(annForm,"ATWeight");
+  CGWeightWid =  XtNameToWidget(annForm,"CGWeight");
+  WtAmbigWid =  XtNameToWidget(annForm,"WtAmbig");
+  
+
+
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prod_leng_min, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_prod_len);
+      else
+	sscanf(str,"%d",&prm.prod_len_low);
+
+      if (!weights) {
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prod_leng_max, args, ONE);
+      sscanf(str,"%d",&prm.prod_len_high);
+      }
+
+      if (program_option!=3) {
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prod_gc_min, args, ONE);
+	sscanf(str,"%f",&temp_float);      
+      if (weights)
+	prm.wt_prod_gc = temp_float;
+      else
+	prm.prod_gc_low = temp_float/100.0;
+
+      if (!weights) {
+       XtSetArg(args[0],XtNstring, &str);
+       XtGetValues(prod_gc_max, args, ONE);
+       sscanf(str,"%f",&temp_float);  
+       prm.prod_gc_high = temp_float/100.0;
+      }
+
+	XtSetArg(args[0],XtNstring, &str);
+	XtGetValues(prod_tm_min, args, ONE);
+	sscanf(str,"%f",&temp_float);
+	if (weights)
+	  prm.wt_prod_tm = temp_float;
+	else
+	  prm.prod_tm_low = temp_float;
+	
+      if (!weights) {
+	XtSetArg(args[0],XtNstring, &str);
+	XtGetValues(prod_tm_max, args, ONE);
+	sscanf(str,"%f",&temp_float);
+        prm.prod_tm_high = temp_float; 
+      }
+    } /* if (program_option!=3) */
+      else {
+	prm.prod_gc_low=0;
+	prm.wt_prod_gc=0;
+	prm.prod_gc_high=0;
+	prm.prod_tm_low=0;
+	prm.wt_prod_tm=0;
+	prm.prod_tm_high=0;
+      }
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prim_len_min, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_prim_s_len);
+      else
+	sscanf(str,"%d",&prm.min_prim_len);
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prim_len_max, args, ONE);
+      if (weights) {
+	if (program_option!=3)
+	  sscanf(str,"%f",&prm.wt_prim_a_len);
+      }
+      else
+	sscanf(str,"%d",&prm.max_prim_len);
+
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prim_gc_min, args, ONE);
+      sscanf(str,"%f",&temp_float);      
+      if (weights)
+	prm.wt_prim_s_gc = temp_float;
+      else
+	prm.prim_gc_low = temp_float/100.0;
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prim_gc_max, args, ONE);
+      sscanf(str,"%f",&temp_float);      
+      if (weights) {
+	if (program_option!=3)
+	  prm.wt_prim_a_gc = temp_float;
+      }
+      else 
+	prm.prim_gc_high = temp_float/100.0;
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(ann_hmlg_i, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_selfI_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.selfI_hmlg_cut);
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(ann_hmlg_t, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_self3_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.self3_hmlg_cut);
+
+      if (program_option==3) {
+	if (!weights) {
+	  prm.pp3_hmlg_cut = 0;
+	  prm.ppI_hmlg_cut = 0;
+	}
+	else {
+	  prm.wt_pp3_hmlg_cut = 0;
+	  prm.wt_ppI_hmlg_cut = 0;
+	}
+      }
+      else {
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(ppann_hmlg_i, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_ppI_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.ppI_hmlg_cut);
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(ppann_hmlg_t, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_pp3_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.pp3_hmlg_cut);
+
+     
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(primprodIWid, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_primprodI_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.primprodI_hmlg_cut);
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(primprod3Wid, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_primprod3_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.primprod3_hmlg_cut);
+    }
+
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(primotherIWid, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_primotherI_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.primotherI_hmlg_cut);
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(primother3Wid, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_primother3_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.primother3_hmlg_cut);
+
+
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(primotherIWid, args, ONE);
+      if (weights)
+	sscanf(str,"%f",&prm.wt_primotherI_hmlg_cut);
+      else
+	sscanf(str,"%f",&prm.primotherI_hmlg_cut);
+
+
+      if (!weights) {
+	XtSetArg(args[0],XtNstring, &str);
+	XtGetValues(ATWeightWid, args, ONE);
+	sscanf(str,"%f",&prm.AT_score);
+
+
+	XtSetArg(args[0],XtNstring, &str);
+	XtGetValues(CGWeightWid, args, ONE);
+	sscanf(str,"%f",&prm.CG_score);
+
+	XtSetArg(args[0],XtNstring, &str);
+	XtGetValues(prim_nucs, args, ONE);
+	if (strlen(str) > MAX_NAME_SIZE)
+	  popupMessage("Length of PRIM_NUCS is larger than size, MAX_NAME_SIZE, allowable\nChoice of PRIM_NUCS ignored.");
+	else
+	  {
+	    
+	    kk = 0;
+	    for (k = 0; str[k]; k++) {
+	      prm.end_nucs[kk]=str[k];
+	      if (!isupper(prm.end_nucs[kk])) toupper(prm.end_nucs[kk]);
+	      kk++;
+	    }
+	    prm.end_nucs[kk]='\0';
+	  }
+      }
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prim_tm_min, args, ONE);
+      sscanf(str,"%f",&temp_float);
+      if (weights)
+	prm.wt_prim_s_tm = temp_float;
+      else
+	prm.prim_tm_low = temp_float;
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(prim_tm_max, args, ONE);
+      sscanf(str,"%f",&temp_float);
+      if (weights) {
+	if (program_option!=3)
+	  prm.wt_prim_a_tm = temp_float; 
+      }
+      else
+	prm.prim_tm_high = temp_float; 
+
+      if (program_option != 3) {
+	XtSetArg(args[0],XtNstring, &str);
+	XtGetValues(delta_tm, args, ONE);
+	sscanf(str,"%f",&temp_float);      
+	if (weights) 
+	prm.wt_delta_tm_cut = temp_float;
+	else
+	prm.delta_tm_cut = temp_float;
+      }
+      else {
+	prm.delta_tm_cut=0; /* 0 meaning not applicable */
+	prm.wt_delta_tm_cut=0;
+      }
+
+
+       return;
+
+    }
+
+/* saves the values that Get Params has gotten from the constraints
+   information changes in changeParamsPopup to a file */
+void SaveParams(w,client_data,call_data)
+     Widget w;
+     XtPointer client_data;
+     XtPointer call_data;
+     
+{ String str;
+  char *fn;
+  Widget apane1 = (Widget) client_data;
+  Arg args[10];
+  int n;
+  FILE *gp;
+  
+
+  fn = (char *)our_alloc(100 *sizeof(char));
+  
+  XtSetArg(args[0],XtNstring, &str);
+  XtGetValues(apane1, args, ONE);
+  strcpy(fn,str);
+  if (strlen(fn) > 100) {
+    popupMessage("Filename too long\nFile could not be saved\n");
+  }
+  else {
+    write_params_file(fn);
+  }
+  
+  our_free(fn);
+  return;
+}
+
+void check_param_validity(w,client_data,call_data)
+     Widget w;
+     XtPointer client_data;
+     XtPointer call_data;
+{
+  check_params();
+  *orig_params = prm;
+
+  return;
+}
+
+
+/* reads the constraint file name from the changeParamsPopup */
+void inputParams(w,  client_data,call_data)
+Widget w;
+XtPointer client_data;
+XtPointer call_data;
+ 
+    { Widget defFnWid = (Widget) client_data;
+      Arg args[10];
+      String str;
+
+
+      XtSetArg(args[0],XtNstring, &str);
+      XtGetValues(defFnWid, args, ONE);
+
+      if (strlen(str) > MAX_NAME_SIZE) 
+	printf("ERROR: Parameter filename longer than MAX_NAME_SIZE, %d, allowed.\nPlease use a shorter filename.\nParameter file was not read in.",MAX_NAME_SIZE);
+      else {
+	strcpy(def_fn,str);
+	ReadDef(def_fn,1); /* the 1 indicates that X windows 
+			    is up and running when you make this
+			    call to the read def function*/
+      }
+
+/*      our_free(str);*/
+      return;
+    }
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/src/bap/osp-bits/struct.h b/src/bap/osp-bits/struct.h
new file mode 100644
index 0000000..99b8828
--- /dev/null
+++ b/src/bap/osp-bits/struct.h
@@ -0,0 +1,176 @@
+#ifndef _struct_h
+#define _struct_h
+
+/* File: struct.h
+   Purpose: holds structure definitions for osp
+  Copyright 1991: LaDeana Hillier and Philip Green
+*/
+
+
+/* PARAM INITIALIZATIONS and EXTERNAL VARIABLES */
+typedef struct {
+  int num;
+  int stp;
+  int endp;
+  float score; /* primer-self internal homology */
+  float end_score; /* primer-self 3prime homology */
+  float gc;
+  float tm;
+  float primotherI; /* primer-other internal homology */
+  float primother3; /* primer-other 3prime homology */
+} p_bank;
+
+typedef struct {
+  float sum_score; /* determines the ranking of the product, a total
+		    of all the parameters using the prm.wt_* */
+  int s_primer; /* sense_primer array indices, i.e. which element of
+		   the sense array is used for this product */
+  int a_primer; /* anti_sense primer indices */
+  float score;    /* primer-primer internal homology */
+  float end_score;     /* primer-primer three prime homology, the greater of 
+		   primer1 end vs. primer2 or primer2 end  vs primer1 */
+  float gc; 
+  float tm;
+  float p1hom; /* primer1-product internal homology */
+  float p2hom; /* primer2-product internal homology */
+  float p1end; /* primer1-product 3 prime homology */
+  float p2end; /* primer2-product 3 prime homology */
+} score_bank;
+
+
+typedef struct {
+
+/* PRODUCT CONSTRAINTS */
+int prod_len_low; /* old len_low; lower bound on product length */
+                 /* For sequencing primers only, this is distance
+		    from the end of the sequence including the primer
+		    itself */
+int prod_len_high;  /* OLD len_high upper bound on product length */
+float prod_gc_low; /* gc content of the product */
+float prod_gc_high;
+float prod_tm_low;    /*OLD
+			tm_low, lower bound on melting temperature, degrees C */
+float prod_tm_high; /* OLD
+		       tm_high,upper bound on melting temperature in degrees C */
+
+/* PRIMER CONSTRAINTS */
+int min_prim_len;
+int max_prim_len; /* minimum and maximum primer length */
+float prim_gc_low; /* lower bound on percentage G+C content, primer */
+float prim_gc_high; /* upper bound on percentage G+C content, primer*/
+float prim_tm_low;/* lower bound on primer melting temperature in degrees C */
+float prim_tm_high; /* upper bound on primer melting temperature in degrees C */
+
+/* ANNEALING CONSTRAINTS */
+float self3_hmlg_cut;
+float selfI_hmlg_cut; /* OLD hmlg_cut and end_hmgl_cut homology 
+				 cutoff scores primer-self*/
+float pp3_hmlg_cut,ppI_hmlg_cut; /* OLD pp_hmlg_cut and pp_end_hmlg_cut
+			      homology cutoff scores primer-primer (not used in nemo)*/
+float primprodI_hmlg_cut, primprod3_hmlg_cut;  /*cutoff scores for primer
+					      product annealing scores*/
+float primotherI_hmlg_cut,primother3_hmlg_cut; /*homology cutoff scores for 
+					   the primer versus the
+					   other sequence file */
+
+/* the other sequence file contains vector, cosmid, repeated sequences, etc */
+float delta_tm_cut; /* cutoff for difference in tm between primer1 and tm of primer2*/
+
+char *end_nucs;  
+
+/* scoring weights, used for ranking of primers chosen */
+float wt_prod_len;
+float wt_prod_gc;
+float wt_prod_tm;
+float wt_prim_s_len;
+float wt_prim_a_len;
+float wt_prim_s_gc;
+float wt_prim_a_gc;
+float wt_prim_s_tm;
+float wt_prim_a_tm;
+float wt_self3_hmlg_cut,wt_selfI_hmlg_cut;
+float wt_pp3_hmlg_cut,wt_ppI_hmlg_cut;
+float wt_primprodI_hmlg_cut,wt_primprod3_hmlg_cut;
+float wt_primotherI_hmlg_cut,wt_primother3_hmlg_cut;
+float wt_delta_tm_cut;
+float AT_score; /* used in the calculation of annealing score, each time
+                   an A-T align, the score would be prm.AT_score */
+
+float CG_score;/* used in the calculation of annealing score, each time
+                   an C-G align, the score would be prm.CG_score */
+char *wt_ambig; /*used in the calculation of annealing score; if wt_ambig==avg,
+		then use an average calculation of annealing scores when
+		looking at ambiguous nucleotides;i.e. if looking at 
+		W on one strand and an A on the other strand, if
+		wt_ambig=1 then score for W versus A is 1/2* AT_score.
+		W=A or T
+	      nuc1nuc2 score
+		A - A   0
+		T - A   AT_score
+		 ______________
+		 conservative score = 1/2 *AT_score
+
+		if wt_ambig=full, give it the maximum possible score for
+		that nucleotide; i.e. for W vs A, the maximum possible
+		score is AT_score if the W actually is a T.*/
+
+
+}  Prm;
+/* note that double and single stranded only affect the calculation
+   of primer-product and primer-other score calculations */
+
+/* structure for holding other sequences to search against */
+typedef struct {
+  char *seq; /* sequence string */
+  int len; /* length of the sequence string */
+  int strand;  /* if 1 then I assume it's the top strand, if 0 I assume
+		  it is the bottom strand.  When the user selects double
+		  stranded to search against, I go through the database
+		  and make extra entries for all the complementary strands.
+		  If the user selects single stranded then I just search
+		  the strand that they are on*/
+  }otherSeqBank;
+
+
+typedef struct _osp_results {
+	int start_position; /* stp */
+	int end_position; /* endp */
+	float score; /* total score used for ranking */
+	float psI_score; /* primer-self internal homology */
+	float ps3_score; /* primer-self 3prime homology */
+	float gc; /* gc content */
+	float tm; /* melting temperature of the primer */
+	float poI_score; /* primer-other internal homology */
+	float po3_score; /* primer-other 3prime homology */
+	int len; /* or you can calculate product length */
+} OSP_Results;
+
+
+
+/*
+** analyse is the oligo selection engine
+** Returns a list of possible oligos with scores etc
+*/
+/*extern int osp_analyse (
+	OSP_Results *OSP_RESULTS, 
+	char *sequence, 
+	Prm *params,    
+	char **screens  
+	);*/
+
+
+/*
+** pop up a window to allow user to change selection parameters
+*/
+/*extern osp_change_parameters (
+	Widget parentWidget,
+	Prm *params
+	);
+*/
+
+/*
+** As we experiment to tailor the oligo selection to our needs,
+** we'll need as much data on scoring and rejection as possible.
+*/
+
+#endif /* _struct_h */
diff --git a/src/bap/osp-bits/utils.c b/src/bap/osp-bits/utils.c
new file mode 100644
index 0000000..eae3db8
--- /dev/null
+++ b/src/bap/osp-bits/utils.c
@@ -0,0 +1,477 @@
+/* 
+  Program Name: utils
+  File name: utils.c
+  Purpose:  utilities, parsing lines, reading in files, sorting
+  Last Update: Apr 9, 1991
+  Copyright 1991: LaDeana Hillier and Philip Green
+
+  Change Log:
+*/
+
+
+/* ---- Includes ---- */
+#include <stdlib.h>
+#include "our_allo.h"
+#include "defn.h" /* macros */
+#include "Xmess.h" /*IMPORT: message */
+
+/* ---- Exports ---- */
+void text_to_output(vec,stp,endp,dvice,outfile)
+char *vec;
+int stp,endp;
+int dvice;
+char *outfile;
+/*text_to_output(vec,stp,endp,dvice,outfile)
+	input: char **vec,*outfile; int stp, endp, dvice;
+	this program output a specified portion of a genbank file
+(from vec[stp] to vec[endp]) to the screen (default), a file (dvice
+=1), or lpr (dvice =2).  Outfile is the input filename if you wish to
+output the information to a specified file. stp and endp are integers,
+not pointers. */
+
+
+/*char *vec,*outfile;
+int stp,endp,dvice; */
+
+/* this program puts a specified portion of a genbank file and outputs */
+ /* it to the screen (default), a file (dvice = 1), or the lpr */
+ /* (dvice = 2) */
+
+{
+  int i;
+  FILE *fopen(),*fp,*where;
+ 
+ 
+  if (dvice == 1) {
+    if ((fp = fopen(outfile,"a"))==NULL) {
+      printf ("\nERROR: can't open file %s\n",outfile);
+      return;
+    }
+    else where = fp;
+  }
+  else if (dvice == 2) {
+    if ((fp = fopen("junkfile.","a"))==NULL) {
+      printf ("\nERROR: can't open file junkfile. to output to the lpr\n");
+      return;
+    }
+    else where = fp;
+  }
+  else {
+    /*  default : */
+    where = stdout;
+  }
+  
+  for (i = stp; i < endp; i++)
+    putc(vec[i],where);
+  
+  if ((dvice == 1) || (dvice == 2)) fclose(fp);
+/*  if (dvice == 2) { */
+/*    fclose(fp); */
+/*    system("lpr junkfile."); */
+/*    system("rm junkfile.");   */
+/*  } */
+}
+
+
+
+int parse_chars(vec,intgrs,stp,endp,alphas)
+char *vec;
+char *alphas[];
+int *intgrs;
+int stp,endp;
+
+/*int parse_chars(vec,intgrs,stp,endp,alphas)
+	input: char *vec; int stp,endp;
+	output: int *intgrs; char **alphas;
+	parses vec[stp] to vec[endp] into strings !isspace held in the
+two dimensional character array alphas with intgrs holding the purely
+integer fields.  Returns the total number of fields parsed.
+*/
+
+
+{ char *atemp;
+  int i,j,k,icount,is_intgr,acount;
+
+  /* read the input field char by char until you come to a blank -- */
+  /* indicating a termination of that field.  Then test to see if that */
+  /* field is all digits.  If it is enter it into the integer array */
+  /* that will be passed back to the calling program, else continue */
+  /* reading the vec for the next string and blank. */
+  
+  atemp = (char *)our_alloc(MAX_WORD_SIZE * sizeof(char));
+  icount = 0;
+  acount = 0;
+
+  /* intialize alphas and intgrs arrays so old data doesn't stay around */
+  for (i = 0; i < MAX_WORDS; i++) alphas[i][0] = '\0';
+  for (i = 0; i < MAX_I;     i++) intgrs[i] = 0;
+
+  for (i=stp; i<endp;) {
+    k=0;
+    while((isspace(vec[i])) && (i<endp)) i++;
+    if (i==endp) return(0);
+    is_intgr = 1;
+    while((!isspace(vec[i])) && (i<endp)) 
+      if (!isdigit(atemp[k++]=vec[i++])) is_intgr = 0;
+    atemp[k] = '\0';
+
+    /* store it in the intrs array only if it is all intgrs */
+    if (is_intgr) {
+      sscanf(atemp,"%d",intgrs+icount);
+      icount++;
+      if (icount > MAX_I) 
+	printf("ERROR: Number of integers parsed was greater than MAX_I allowable\n");
+    }
+      if (k>MAX_WORD_SIZE) {
+	printf("ERROR: Word parsed was bigger than MAX_WORD_SIZE allowable\n");
+	exit(1);
+      }
+
+    /* store it in the alphas array no matter what */
+      for (j=0; atemp[j]; j++) alphas[acount][j] = atemp[j];
+      alphas[acount][j] = '\0';
+      acount++;
+      if (acount>MAX_WORDS) {
+	printf("ERROR: Number of words parsed greater than MAX_WORDS\n");
+	exit(1);
+      }
+  }
+
+  our_free(atemp);
+  return(acount);
+}
+
+
+
+
+
+int parse_intgrs (vec,intgrs,stp,endp)
+char *vec;
+int *intgrs;
+int stp,endp;
+
+/*int parse_intgrs(vec,intgrs,stp,endp)
+	input: char *vec; int stp,endp;
+	output: int *intgrs;
+	parses vec[stp] to vec[endp] into strings of integers --
+throws away char strings.  Saves the integers in the array intgrs.
+Returns the number of integer fields found. */
+
+/*char *vec;
+int *intgrs;
+int stp,endp;*/
+
+{ char *temp;
+  int i,k,icount,is_intgr;
+  int temp_size = 256;
+/* read the input field char by char until you come to a blank -- */
+ /* indicating a termination of that field.  Then test to see if that */
+ /* field is all digits.  If it is enter it into the integer array */
+ /* that will be passed back to the calling program, else continue */
+ /* reading the vec for the next string and blank. */
+
+temp = (char *)our_alloc(temp_size * sizeof(char));
+icount = 0;
+for (i=stp; i<endp;) {
+  k=0;
+  while((isspace(vec[i])) && (i<endp)) i++;
+  is_intgr = 1;
+  while((!isspace(vec[i])) && (i<endp)) 
+    if (!isdigit(temp[k++]=vec[i++])) is_intgr = 0;
+  if (k>temp_size) printf("ERROR: Integer parsed greater than maximum size integer allowed:\n  See  parse_intgrs subroutine\n");
+
+
+  if (is_intgr) {
+    temp[k] = '\0';
+    sscanf(temp,"%d",intgrs+icount);
+    icount++;
+  }
+}
+our_free(temp);
+return(icount-1);
+}
+
+
+
+
+
+
+
+
+
+
+int file_to_vec(f_name,vec,vec_st,max_n,str)
+char *f_name;
+char *vec; 
+int *vec_st;
+int max_n; 
+char *str;
+
+/* file_to_vec(f_name, vec, vec_st,max_n)
+	input: char *f_name; int max_n;
+	output: char *vec, int *vec_st;
+	reads the input file, f_name, to a vec array of chars of
+size vec_st with a maximum size of max_n 
+
+returns an error message in str, if there is one. Returns a 0 if there
+was some problem when reading in the file */
+
+{
+    FILE *fopen(), *fp;
+    int i;
+
+    if (NULL == (fp = fopen(f_name, "r"))) {
+	sprintf(str,"\nERROR: file %s does not exist.  See file_to_vec.c\n",f_name);
+	return(0);
+    }
+    for (i = 0; i < max_n; i++) 
+      if (EOF == (vec[i] = getc(fp))) {
+	vec[++i] = '\0';
+	*vec_st = i;
+	return(1);
+      }
+    sprintf (str,"vector size exceeded when reading file %s\n",f_name);
+    return(0);
+}	
+	
+
+int space_file_to_vec(f_name,vec,vec_st,max_n,str)
+char *f_name;
+char *vec;
+int *vec_st;
+int max_n;
+char *str;
+/* space_file_to_vec(f_name, vec, vec_st,max_n)
+	input: char *f_name; int max_n;
+	output: char *vec, int *vec_st;
+	reads the input file, f_name, to a vec array of chars of
+size vec_st with a maximum size of max_n , puts in a space after all
+new lines
+
+returns an error message in str, if there is one. Returns a 0 if there
+was some problem when reading in the file */
+
+{
+    FILE *fopen(), *fp;
+    int i;
+
+    if (NULL == (fp = fopen(f_name, "r"))) {
+	sprintf(str,"\nERROR: file %s does not exist.  See file_to_vec.c\n",f_name);
+	return(0);
+    }
+    for (i = 0; i < max_n; i++) {
+      vec[i]=getc(fp);
+      if (vec[i]==EOF) {
+	vec[++i] = '\0';
+	*vec_st = i;
+	return(1);
+      }
+      if (vec[i]=='\n') vec[++i]=' ';
+    }
+    sprintf (str,"vector size exceeded when reading file %s\n",f_name);
+    return(0);
+}	
+	
+#if defined(XVERSION) || defined(TEXTVERSION)
+/* variables relating to the opening of ted windows */
+extern int tedNedBases;
+extern int tedleftCutoff;
+extern int tedSeqLen; /* NedBases-rightCutoff-leftCutoff */
+extern char tedType[4]; /*ABI, PLN or ALF */
+extern char tedFileName[200]; /* usually just the sequence filename less the .seq
+ */
+#endif
+
+#ifdef SUBVERSION
+/* variables relating to the opening of ted windows */
+int tedNedBases;
+int tedleftCutoff;
+int tedSeqLen; /* NedBases-rightCutoff-leftCutoff */
+char tedType[4]; /*ABI, PLN or ALF */
+char tedFileName[200]; /* usually just the sequence filename less the .seq
+ */
+#endif
+
+
+int ted_nnl_file_to_vec(f_name,vec,max_n)
+char *f_name;
+char *vec;
+int max_n;
+/* int ted_nnl_file_to_vec(f_name, vec, max_n)
+	input: char *f_name; int max_n;
+	output: char *vec;
+	looks at the first char for a semicolon, if it is,
+	it knows the file has been input from ted, so it
+	gets the information about the sequence from the 
+	first line and gets the sequence from the rest 
+	
+	reads the input file, f_name, to a vec array of chars
+ with a maximum size of max_n; does not read in any of
+the end of line characters , returns the size of vec array.*/
+
+{
+    FILE *fopen(), *fp;
+    int i,j;
+    char line[500];
+    int lookBlank,lookBlank_i,newLine;
+    int MAXLINE=500;
+    int comment=0; /* keeps track of whether there was 
+		    a comment line or not */
+    int more_comments=0;
+
+
+    lookBlank=0;
+    newLine=1;
+    
+
+
+    if (NULL == (fp = fopen(f_name, "r"))) {
+      return(0);
+    }
+
+    /* go read the first line of the sequence to see if it is
+       a ted file or a comment line */
+    fgets(line, MAXLINE, fp); /* fgets only reads a line at a time*/
+
+    if (line[0]==';')  {
+    sscanf(line,";%6d%6d%6d%s%s\n",&tedNedBases,&tedleftCutoff,&tedSeqLen,tedType,tedFileName);
+    comment=1;
+  }
+    else { /* I check to make sure that tedType[0] is either A or P for ABI or PLN
+	      before I will actually try to run a ted window.  So if I blank out
+	      the value for tedType when the file wasn't ted file,  then the
+	      program will not be tempted to open a ted window on a non-ted
+	      file */
+      tedType[0]='\0';
+
+    /* also check to see if the first line is a comment, by checking
+     for greater than signs */
+      comment=0;
+      for (i=0; line[i]; i++) {
+
+	if (line[i]=='>') comment=1;
+	/* then this must be a comment line, just
+	   get the rest of the line and throw it away*/
+
+
+      }
+
+    }
+
+    
+    if (comment==1 || tedType[0]!='\0') more_comments=1;
+    /* if there were any comments go past any more lines that were comments */
+      /* fgets only reads a line at a time, returns a NULL when eof*/
+    while (fgets(line, MAXLINE, fp)!=NULL && more_comments==1)  {
+      for (i=0; line[i]; i++) {
+	if (line[i]=='>' || line[i]==';') { comment++; break;}
+	/* then this must be a comment line, just
+	   get the rest of the line and throw it away*/
+	more_comments=0;
+      }
+    }
+    
+
+    /* close the file because well start over reading it again after
+       we check the first line */
+    fclose(fp);
+
+
+    if (NULL == (fp = fopen(f_name, "r"))) {
+      return(0);
+    }
+
+    /* if it was a comment or ted file find the end of the first line*/
+    if (comment>0) {
+      /* then find the end of the first line, the comment */
+      for (j=0; j<comment; j++) 
+	if (fgets(line, MAXLINE, fp)==NULL) break;
+    }
+
+
+
+
+
+/* all the lookBlank and new line stuff is looking for any
+   groups of numbers starting a line if it finds after a 
+   newline a set of contiguous numbers then a blank before it finds an
+   alpha it assumes those were just nucleotide position
+   numbers*/
+
+    for (i = 0; i < max_n; i++) { 
+      vec[i] = getc(fp);
+      /* make sure you leave the check for vec[i] being EOF 
+	 as the first check*/
+      if (vec[i] == EOF) {
+	vec[i] = '\0';
+	return(i);
+      }
+
+      /*check if the sequence is numbered on its left end side*/
+      if (isdigit(vec[i]) && newLine==1) {
+	/* keep looking to see if you find a blank before any more alphas,
+	 which would indicate that there are numbers in the sequence */
+	lookBlank=1;
+	lookBlank_i = i;
+	newLine=0;
+      }
+
+      /* make sure that all '-' are 'N' and that
+	all characters are uppercase */
+      if (vec[i]=='-') vec[i]='N';
+      else if (isalpha(vec[i])) {
+	newLine=0;
+	lookBlank=0;
+	if (!isupper(vec[i])) vec[i]=toupper(vec[i]);
+      }
+      /*throw away any new lines*/
+      if (vec[i] == '\n') {
+	if (i>-1) i--;
+	lookBlank=0;
+        newLine=1;
+      }
+      /*throw away any blanks*/
+      if (vec[i] == ' ') {
+	if (i>-1) i--;
+	if (lookBlank==1) { /* go back and throw away all those
+			       digits between the \n and the blank,
+			       because they must be numbering of
+			       the sequence*/
+	   i=lookBlank_i-1;
+	 }
+      } /* if vec[i] is a blank*/
+
+      if (!isalpha(vec[i]) && !isdigit(vec[i]) && vec[i]!=' ' && i>-1) {
+	printf("WARNING: Your sequence contains a non-alphabetic character, %c.\nThis character was removed from the sequence.\n",vec[i]);
+	if (i>-1) i--;
+      }
+
+     }/* for i=0, i< max_n*/
+  
+    printf ("file_to_vec vector size exceeded when reading file %s\n",f_name);
+    return(0);
+   }
+	
+
+
+
+  
+
+	
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/src/bap/osp-bits/utils.h b/src/bap/osp-bits/utils.h
new file mode 100644
index 0000000..c6e1bd7
--- /dev/null
+++ b/src/bap/osp-bits/utils.h
@@ -0,0 +1,79 @@
+#ifndef _utils_h
+#define _utils_h
+
+/* 
+  Program Name: utils
+  File name: utils.h
+  Purpose:  utilities, parsing lines, reading in files, sorting
+  Last Update: Fri Mar 23 1991
+  Copyright 1991: LaDeana Hillier and Philip Green
+  Change Log:
+*/
+
+extern int space_file_to_vec();
+
+extern void text_to_output();
+
+/*text_to_output(vec,stp,endp,dvice,outfile)
+        input: char **vec,*outfile; int stp, endp, dvice;
+        this program output a specified portion of a genbank file
+(from vec[stp] to vec[endp]) to the screen (default), a file (dvice
+=1), or lpr (dvice =2).  Outfile is the input filename if you wish to
+output the information to a specified file. stp and endp are integers,
+not pointers. */
+
+
+extern int parse_chars();
+
+/*int parse_chars(vec,intgrs,stp,endp,alphas)
+        input: char *vec; int stp,endp;
+        output: int *intgrs; char **alphas;
+        parses vec[stp] to vec[endp] into strings !isspace held in the
+two dimensional character array alphas with intgrs holding the purely
+integer fields.  Returns the total number of fields parsed.
+*/
+
+
+
+extern int parse_intgrs();
+
+/*int parse_intgrs(vec,intgrs,stp,endp)
+        input: char *vec; int stp,endp;
+        output: int *intgrs;
+        parses vec[stp] to vec[endp] into strings of integers --
+throws away char strings.  Saves the integers in the array intgrs.
+Returns the number of integer fields found. */
+
+extern int file_to_vec();
+
+/* file_to_vec(f_name, vec, vec_st,max_n)
+	input: char *f_name; int max_n;
+	output: char *vec, int *vec_st;
+	reads the input file, f_name, to a vec array of chars of
+size vec_st with a maximum size of max_n
+
+returns any error messages in str.  returns a 0 if there was a problem, a 1 
+if there was not*/
+
+
+
+extern int ted_nnl_file_to_vec();
+
+/* int ted_nnl_file_to_vec(f_name, vec, max_n)
+	input: char *f_name; int max_n;
+	output: char *vec;
+	looks at the first char for a semicolon, if it is,
+	it knows the file has been input from ted, so it
+	gets the information about the sequence from the 
+	first line and gets the sequence from the rest 
+	
+	reads the input file, f_name, to a vec array of chars
+ with a maximum size of max_n; does not read in any of
+the end of line characters , returns the size of vec array.*/
+
+
+
+#endif /* _utils_h */
+
+
+
diff --git a/src/bap/remgel.f b/src/bap/remgel.f
new file mode 100644
index 0000000..8e32bfe
--- /dev/null
+++ b/src/bap/remgel.f
@@ -0,0 +1,152 @@
+      SUBROUTINE REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+C Routine to remove a reading from a database
+C Cases: 1 left end
+C        2 right end
+C        3 internal and dispensible
+C        4 internal and indispensible
+C if 1 change contig lnbr, contig length, lnbr of rnbr of remme, relpgs
+C if 2 change contig rnbr, contig length, rnbr of lnbr of remme
+C if 3 change contig length, lnbr of rnbr of remme rnbr of lnbr of remme
+C if 4 need to break contig, then as for 1
+C if 1 and 2 then also remove contig line
+C for all cases move gel ngels to remme (if remme/=ngels)
+C and update line idbsiz
+C
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME,CLEN
+      LOGICAL LEFTE,RIGHTE,DISPEN
+      CHARACTER GEL(MAXGEL)
+      EXTERNAL CLEN
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+      DISPEN = .FALSE.
+C
+C Left end ?
+C
+      IF(LNBR(REMME).EQ.0) LEFTE = .TRUE.
+C
+C Right end ?
+C
+      IF(RNBR(REMME).EQ.0) RIGHTE = .TRUE.
+C
+C If both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading and contig'
+        IFROM = NGELS
+        NGELS = NGELS - 1
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  ICONT,IDEVR)
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+      ELSE IF(LEFTE) THEN
+        WRITE(KBOUT,*)'Removing reading from left end of contig'
+        LNBR(ICONT) = RNBR(REMME)
+        I = 1 - RELPG(RNBR(REMME))
+        WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,RNBR(REMME),ICONT,I)
+        I = LNBR(ICONT)
+        LNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE IF(RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading from right end of contig'
+        RNBR(ICONT) = LNBR(REMME)
+        I = RNBR(ICONT)
+        RNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNBR(ICONT))
+        CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +  LNBR(ICONT),RNBR(ICONT))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE
+C Is remme indispensible ?
+        NSTART = RELPG(RNBR(REMME))
+        I = REMME
+10      CONTINUE
+        I = LNBR(I)
+        IF(I.NE.0) THEN
+          IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.NSTART) GO TO 10
+          DISPEN = .TRUE.
+        END IF
+        IF(DISPEN) THEN
+          WRITE(KBOUT,*)
+     +    'Removing dispensible reading from middle of contig'
+          I = LNBR(REMME)
+          RNBR(I) = RNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          I = RNBR(REMME)
+          LNBR(I) = LNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,LNBR(ICONT))
+          CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +    LNBR(ICONT),RNBR(ICONT))
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        ELSE
+          WRITE(KBOUT,*)
+     +    'Removing indispensible reading from middle of contig'
+          WRITE(KBOUT,*)'So breaking contig first'
+          IR = REMME
+          IL = LNBR(REMME)
+          ILO = LNBR(ICONT)
+          NCONTO = ICONT
+          NCONTR = IDBSIZ - NCONTS - 1
+          CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +    KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+          IF(IOK.NE.0) RETURN
+          WRITE(KBOUT,*)'Removing reading from left end of contig'
+          ICONT = IDBSIZ - NCONTS
+          LNBR(ICONT) = RNBR(REMME)
+          I = 1 - RELPG(RNBR(REMME))
+          WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+          CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +    IDBSIZ,RNBR(REMME),ICONT,I)
+          I = LNBR(ICONT)
+          LNBR(I) = 0
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        END IF
+      END IF
+      END
diff --git a/src/bap/sap.f b/src/bap/sap.f
new file mode 100644
index 0000000..78e40e1
--- /dev/null
+++ b/src/bap/sap.f
@@ -0,0 +1,628 @@
+C     SAP (Sequence assembly program)
+C     AUTHOR RODGER STADEN
+C     9-1-90  sent idev instead of kbout to join and enter
+C     16-3-90 removed escape after bpause
+C     2-5-90 Many changes related to introducing graphics
+C     4-5-90 Allowed escape from getreg
+C     9-5-90 Introduced a default gel reading
+C    23-8-90 Added calls to SHOWFU
+C    28-11-90 See dbsys89
+C    22-1-91 Changed use of showfi to include version number
+C    27-1-91 Changed maxsav from 1000 to maxglm
+C     13-04-91 removed initial CLEARV
+C    28-7-91 added extra parameter for quality calc: mxgood, only the parts
+C    of reads upto this length are used
+C    21-8-91 Added function to find internal joins
+C    27-8-91 Added buffer flushing for database files
+C    2-9-91  BIG CHANGE: reading names increased to 16 chars and
+C            max readings can now be set at compile up to 999999 and
+C            database organisation changed (record 1 of relationships
+C            now used for file info, last record of names not used
+C            for file size info and database start does not waste
+C            space in working version and names files: they are
+C            increased as we go along
+C    12-11-91 Changed autoedit so that it takes strands into account: needs
+C             a new consensus type calculation sumss and a call from main.
+      SUBROUTINE FMAIN()
+      INTEGER CHRSIZ,ECHRSZ
+      PARAMETER (
+     +           NAMLEN = 40,
+     +           MAXDEV = 10,
+     +           MAXSEQ=50000,
+     +           MAXGLM=4096,
+     +           MAXGL2=MAXGLM*2,
+     +           CHRSIZ=4,
+     +           ECHRSZ = 6,
+     +           IDT1=ECHRSZ*MAXGL2,
+     +           IDT2=2*CHRSIZ*MAXGL2,
+     +           IDT3=2*MAXGL2,
+     +           IDT4=2*ECHRSZ*MAXGL2,
+     +           MAXDB=1000)
+      INTEGER BOTOPT,TOPOPT,DEVNOS(MAXDEV)
+      INTEGER RELPG(MAXDB),LREG,RREG,ANS
+      INTEGER TEMP1(IDT1),TEMP2(IDT2),TEMP3(IDT4)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILNAM,NAMPRO,FILE,NAMARC,FILMAR
+      CHARACTER*(NAMLEN) SHONAM
+      CHARACTER GEL(MAXGLM)
+      CHARACTER GEL2(MAXGLM),GEL3(MAXGLM),GEL4(MAXGLM)
+      PARAMETER (BOTOPT=0,TOPOPT=35)
+      PARAMETER (HELPF='SAPHELP')
+      PARAMETER (POINTF='SAPHPNT')
+      PARAMETER (FILMAR='SAPMARG')
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      INTEGER MARGB(TOPOPT),MARGT(TOPOPT)
+      CHARACTER SEQ1(MAXSEQ)
+C     DBAUTO TEMP3A = TEMP1, SEQ3A = GEL3, SEQ2A = GEL2
+C     MATCHA = GEL, SEQ4A = GEL4
+      PARAMETER (LENGTH = 6,
+     +           MAXCON = 500,
+     +           MAXSAV = MAXGLM,
+     +           LPOWRC = CHRSIZ**LENGTH)
+      INTEGER CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ5(MAXGLM),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      INTEGER WINDOW
+C     DBAUTO
+      EQUIVALENCE (TEMP2,TEMP1)
+      EQUIVALENCE (TEMP1,TEMP3)
+      IDM = 5
+      CALL INITS
+      CALL INITLU(IDM)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(10))
+      WRITE(KBOUT,1000)
+1000  FORMAT(/,
+     +' SAP (Sequence assembly program) V6.0 Nov 1991',/,
+     +' Author: Rodger Staden')
+      IGORT = 0
+      IDEV = KBOUT
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +TOPOPT,DEVNOS(5),FILMAR)
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(5),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,IDEVH)
+      IOPEN=1
+      LINLEN=50
+      PERCD=0.75
+      WINDOW = 25
+      MXGOOD = MAXGLM
+      IGWIND = 1000
+      FILE = ' '
+      MAXGEL = MAXGLM
+      SHONAM = ' '
+      CALL SHOWFI(KBOUT,SHONAM)
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          IOPEN=1
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+        CALL SHOWFU(KBOUT,'Start new database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          LLINO = 0
+          IF(IERR.EQ.0) THEN
+            IOPEN=0
+            SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+            CALL SHOWFI(KBOUT,SHONAM)
+          END IF
+          CALL INITLU(IDM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+        CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      MENU = 0
+10    CONTINUE
+C
+C Flush the database buffers, just in case!
+C
+      CALL FLUSHL(DEVNOS(1))
+      CALL FLUSHL(DEVNOS(2))
+      CALL FLUSHL(DEVNOS(3))
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL SHOWFU(KBOUT,' ')
+C      IF(IOK.NE.0) GO TO 9999
+      CALL DBMENT(MENU,NOPT,TOPOPT,HELPS(0),HELPE(0),HELPF,DEVNOS(5),
+     +KBIN,KBOUT)
+      IF(NOPT.EQ.3)THEN
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          IOPEN=1
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+          CALL SHOWFU(KBOUT,'Start a database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          IF(IERR.EQ.0)IOPEN=0
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+          CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      END IF
+C  STOP
+      IF(NOPT.EQ.2)THEN
+        IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBOUT)
+        GO TO 9999
+      END IF
+
+
+C   clear all
+C
+      IF(NOPT.EQ.10)THEN
+        CALL SHOWFU(KBOUT,'Clear graphics')
+        CALL CLEARG
+        GO TO 10
+      END IF
+C   clear VT100
+C
+      IF(NOPT.EQ.11)THEN
+        CALL SHOWFU(KBOUT,'Clear text')
+        CALL CLEARV
+        GO TO 10
+      END IF
+C
+C   xhairs
+C
+      IF(NOPT.EQ.13) THEN
+          CALL SHOWFU(KBOUT,'Use xhair')
+          CALL XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,NCONTS,LLINO,LINCON,LREG,RREG,
+     +    WINDOW,IGWIND,LENCON,IDEPTH,JDEPTH,
+     +    MARGL,MARGR,MARGB,MARGT,TOPOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +    KBOUT,GEL,GEL2,DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM,
+     +    SEQ1,MAXSEQ,NGELS,TEMP3,ECHRSZ,MAXGL2,GEL3,GEL4,33,34,29,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MXGOOD)
+        GO TO 10
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(NOPT.EQ.14)THEN
+        CALL SHOWFU(KBOUT,'Change margins')
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(NOPT),HELPE(NOPT),TOPOPT,HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+C  ruler
+C
+      IF(NOPT.EQ.12) THEN
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+          CALL SHOWFU(KBOUT,'Show ruler')
+          CALL RULER(LREG,RREG,MARGL,MARGR,
+     +    MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          GO TO 10
+        END IF
+      END IF
+C  labler
+      IF(NOPT.EQ.15)THEN
+        CALL SHOWFU(KBOUT,'Label diagram')
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.16) THEN
+        CALL SHOWFU(KBOUT,'Plot map')
+C  map
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+          CALL PLTMAP(DEVNOS(4),FILNAM,LENCON,MARGL,MARGR,MARGB(NOPT),
+     +    MARGT(NOPT),ISXMAX,ISYMAX,LREG,RREG,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.21).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Enter new gel reading')
+C       ENTER
+        CALL ENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +  DEVNOS(3),DEVNOS(4),LINLEN,PERCD,FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+     +  MAXGEL,IDM,LLINO)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.25).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Show relationships')
+C       PRINT DB
+        CALL DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,IDEV,
+     +  KBIN,KBOUT,DEVNOS(3),LLINO,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.5).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Display contig')
+C       DISPLAY
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+          IF(IERR.EQ.0)THEN
+          CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +    GEL,LLINO,LINCON,LREG,RREG,GEL2,I1,I2,0,I,IDBSIZ,IDEV,KBOUT,
+     +    DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+200   CONTINUE
+      IF((NOPT.EQ.22).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Join contigs')
+C       JOIN
+        WRITE(KBOUT,1006)
+1006    FORMAT(' Which contigs do you want to join ?')
+        WRITE(KBOUT,1017)
+1017    FORMAT( ' Left contig')
+        LLINOL = 0
+        LLINOR = 0
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LNCONL,LLINOL,
+     +  IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          WRITE(KBOUT,1018)
+1018      FORMAT( ' Right contig')
+          CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LNCONR,LLINOR,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(IERR.EQ.0)THEN
+C           CHECK TO SEE IF MAKING A CIRCLE!
+            IF(LLINOL.EQ.LLINOR)THEN
+              WRITE(KBOUT,*)' Making circles is not allowed!'
+              GO TO 200
+            END IF
+            CALL JOIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +      LNCONL,LLINOL,LNCONR,LLINOR,GEL,GEL2,
+     +      IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +      DEVNOS(3),LINLEN,PERCD,
+     +      HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+     +      MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.23).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Complement contig')
+C       COMPLEMENT
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LINCON,LLINO,GEL,IDBSIZ,KBOUT,DEVNOS(1),DEVNOS(2),
+     +    MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+C  EDIT
+      IF((NOPT.EQ.4).AND.(IOPEN.EQ.0))THEN
+C       EDIT DB
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    GEL,LINCON,0,0,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+C  PLOT SINGLE CONTIG
+      IF((NOPT.EQ.33).AND.(IOPEN.EQ.0))THEN
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0)THEN
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.26).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Alter relationships')
+C       FIX
+        CALL DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),
+     +  HELPS(4),HELPE(4),HELPF,DEVNOS(5),MAXGEL,MAXGLM)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.24).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Copy database')
+C       COPY
+        CALL DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +  NAMPRO,DEVNOS(4),IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),
+     +  DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,MAXDB,IDM)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.19).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Check database for consistency')
+C       CHECK
+        CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP1,IERR,KBOUT)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.29).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Examine quality')
+C       SCAN
+        IDIM1=MAXSEQ
+        CALL DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),IDEV,LINLEN,PERCD,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,GEL3,GEL4,
+     +  MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,DEVNOS(3),
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.8).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Make consensus')
+C       CONSENSUS
+        IDIM1=MAXSEQ
+        CALL CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),DEVNOS(4),FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,IDM,PERCD,
+     +  DEVNOS(3),LLINO)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+        CALL SHOWFU(KBOUT,'Redirect output')
+        CALL REDIR(IDEV,DEVNOS(6),DEVNOS(10),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+C  HELP
+      IF(NOPT.EQ.1)THEN
+        CALL SHOWFU(KBOUT,'Help')
+C       HELP
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,
+     +  HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.27)THEN
+        CALL SHOWFU(KBOUT,'Set parameters')
+C  SET DISPLAY PARAMETERS
+        MN = 10
+        MX = 100
+        CALL GETINT(MN,MX,LINLEN,'Display line length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) LINLEN = IVAL
+        IPCMIN = 51
+        IPCMAX = 100
+        IPCD = INT(PERCD*100.)
+        CALL GETINT(IPCMIN,IPCMAX,IPCD,
+     +  'Percentage score for consensus',
+     +  IPVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) THEN
+          PERCD = REAL(IPVAL)/100.
+        END IF
+        MN = 1
+        MX = MAXGLM
+        CALL GETINT(MN,MX,MXGOOD,
+     +  'Maximum read length for quality analysis',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) MXGOOD = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,WINDOW,'Text window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) WINDOW = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,IGWIND,'Graphics window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) IGWIND = IVAL
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.6)THEN
+        CALL SHOWFU(KBOUT,'Display text file')
+       CALL TTEXT(DEVNOS(4),FILNAM,KBIN,KBOUT,
+     + HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+       GO TO 10
+      END IF
+      IF(NOPT.EQ.28) THEN
+        CALL SHOWFU(KBOUT,'Highlight disagreements')
+        CALL HIGHLT(SEQ1,SEQ1(6001),SEQ1(10001),KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(7),
+     +  DEVNOS(8),FILE,IOK)
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.34).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Plot contigs')
+C  PLOT ALL CONTIGS
+        CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,0,0,
+     +  NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+C  AUTO EDIT
+      IF((NOPT.EQ.30).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'AutoEdit a contig')
+        WRITE(KBOUT,*)'Make a copy first!'
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0) THEN
+        IDIM1 = MAXSEQ
+        CALL SUMSS(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,IDIM1,
+     +  GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +  DEVNOS(2),MAXGEL,GEL3,GEL4)
+        CALL AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,LINCON,
+     +  GEL,MAXGEL,SEQ1,IDIM1,DEVNOS(2),DEVNOS(1),LREG,RREG,
+     +  KBOUT)
+      END IF
+      END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.9).AND.(IOPEN.EQ.0)) THEN
+        CALL SHOWFU(KBOUT,'Screen Edit a contig')
+        CALL CEDT(KBIN,KBOUT,
+     +  POSNS(1),POSNS(1001),POSNS(2001),
+     +  MAXDB,POSNS(3001),POSNS(3101),50,
+     +  RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(4),
+     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),LINLEN,FILNAM,LLINO,IOK)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.17) THEN
+        CALL SHOWFU(KBOUT,'Screen for restriction sites')
+        CALL SCRENR(GEL2,MAXGEL,GEL3,NAMARC,FILE,
+     +  DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.18) THEN
+        CALL SHOWFU(KBOUT,'Screen against vector')
+        CALL SCRENV(MAXGEL,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +  SEQ1,MAXSEQ,GEL2,GEL3,GEL4,LENGTH,
+     +  SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +  KBIN,KBOUT,DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,FILE,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.20).AND.(IOPEN.EQ.0)) THEN
+        CALL SHOWFU(KBOUT,'Auto assemble sequences')
+      CALL DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.35).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+        CALL SHOWFU(KBOUT,'Find internal joins')
+      CALL AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.32).AND.(IOPEN.EQ.0)) THEN
+        CALL SHOWFU(KBOUT,'Extract gel readings')
+        CALL GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +  NCONTS,GEL,MAXGEL,DEVNOS(2),DEVNOS(3),DEVNOS(4),DEVNOS(7),
+     +  KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),FILE)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.31) THEN
+        CALL SHOWFU(KBOUT,'Type in gel readings')
+        CALL BATIN(SEQ1,MAXSEQ,KBIN,KBOUT,
+     +  DEVNOS(4),DEVNOS(7),IDEV,
+     +  FILNAM,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.GT.3).AND.(IOPEN.NE.0))THEN
+        WRITE(KBOUT,1012)
+1012    FORMAT(' You have not opened a database!')
+        GO TO 10
+      END IF
+      GO TO 10
+9999  CONTINUE
+      END
diff --git a/src/bap/sapmen.f b/src/bap/sapmen.f
new file mode 100644
index 0000000..c456223
--- /dev/null
+++ b/src/bap/sapmen.f
@@ -0,0 +1,188 @@
+      SUBROUTINE DBMENT(MENU,NOPT,KOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+1     CONTINUE
+      IF(MENU.EQ.0) THEN
+        WRITE(KBOUT,5000)
+5000    FORMAT(' Menus and their numbers are',/,
+     +  ' m0 = This menu',/,
+     +  ' m1 = General',/,
+     +  ' m2 = Screen control',/,
+     +  ' m3 = Modification',/,
+     +  '  ? = Help',/,
+     +  '  ! = Quit')
+      ELSE IF(MENU.EQ.-1) THEN
+      WRITE(KBOUT,1001)
+1001  FORMAT(' General menu',/,
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Open a database',/,
+     +'  5 = Display a contig',/,
+     +'  6 = List a text file',/,
+     +'  7 = Redirect output',/,
+     +'  8 = Calculate a consensus',/,
+     +' 17 = Screen against restriction enzymes',/,
+     +' 18 = Screen against vector',/,
+     +' 19 = Check database',/,
+     +' 24 = Copy database',/,
+     +' 25 = Show relationships',/,
+     +' 27 = Set parameters',/,
+     +' 28 = Highlight disagreements',/,
+     +' 29 = Examine quality',/,
+     +' 35 = Find internal joins')
+      ELSE IF(MENU.EQ.-2) THEN
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Screen control menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +' 10 = Clear graphics',/,
+     +' 11 = Clear text',/,
+     +' 12 = Draw ruler',/,
+     +' 13 = Use cross hair',/,
+     +' 14 = Change margins',/,
+     +' 15 = Label diagram',/,
+     +' 16 = Plot map',/,
+     +' 33 = Plot single contig',/,
+     +' 34 = Plot all contigs')
+      ELSE IF(MENU.EQ.-3) THEN
+      WRITE(KBOUT,1003)
+1003  FORMAT(' Modification menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  4 = Edit contig',/,
+     +'  9 = Screen edit',/,
+     +' 20 = Auto assemble',/,
+     +' 21 = Enter new gel reading',/,
+     +' 22 = Join contigs',/,
+     +' 23 = Complement a contig',/,
+     +' 26 = Alter relationships',/,
+     +' 30 = Auto edit a contig',/,
+     +' 31 = Disassemble readings',/,
+     +' 32 = Extract gel readings')
+      END IF
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      IF((NOPT.LT.-3).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS(NOPT),IHELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(NOPT.EQ.0) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF(NOPT.LT.0) THEN
+        MENU = NOPT
+        GO TO 1
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      RETURN
+      END
+      SUBROUTINE DBMENU(MENU,NOPT,KOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+1     CONTINUE
+      IF(MENU.EQ.2)THEN
+20    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1002)
+1002  FORMAT( ' Enter menu options are:',/,
+     +'  ? = Help',/,
+     +'  ! = Give up',/,
+     +'  3 = Complete entry',/,
+     +'  4 = Edit contig',/,
+     +'  5 = Display',/,
+     +'  6 = Edit new gel reading')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 6
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 20
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 20
+      END IF
+      ELSE IF(MENU.EQ.3)THEN
+30    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1003)
+1003  FORMAT( ' Join menu options are:'/
+     +'  ? = Help',/,
+     +'  ! = Give up',/,
+     +'  3 = Complete join',/,
+     +'  4 = Edit left contig',/,
+     +'  5 = Display join',/,
+     +'  6 = Edit right contig',/,
+     +'  7 = Move join')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 7
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 30
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      ELSE IF(MENU.EQ.4)THEN
+40    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Alter relationships menu options are:'/
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Line change',/,
+     +'  4 = Check logical consistency',/,
+     +'  5 = Remove a contig',/,
+     +'  6 = Shift',/,
+     +'  7 = Move gel reading',/,
+     +'  8 = Rename gel reading',/,
+     +'  9 = Break a contig',/,
+     +' 10 = Remove a gel reading')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 10
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 40
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 40
+      END IF
+      ELSE IF(MENU.EQ.5)THEN
+50    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1005)
+1005  FORMAT(
+     +' Edit options are:',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Insert',/,
+     +'  4 = Delete',/,
+     +'  5 = Change')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 5
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 50
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 50
+      END IF
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      IF(NOPT.LE.0) GO TO 1
+      RETURN
+      END
diff --git a/src/bap/subclone.c b/src/bap/subclone.c
new file mode 100644
index 0000000..84c99d9
--- /dev/null
+++ b/src/bap/subclone.c
@@ -0,0 +1,211 @@
+#include <stdio.h>
+
+#include "subclone.h"
+
+static CloneInfo NULLCloneInfo = {
+    "",  /* mtd */
+    "",  /* cosmid */
+    1000, /* range_from */
+    1000, /* range_to */
+    "",  /* method */
+    "",  /* vector */
+};
+
+
+/*
+** Imports
+*/
+extern double atof(char *s);
+
+
+
+
+static void get_entry(char *fn, char *mtd, char *entry)
+{
+    FILE *f;
+    char in_line[256];
+
+    entry[0]='\0';
+    if ( (f = fopen(fn,"r")) == NULL) return;
+
+    while (fgets(in_line,(int)sizeof(in_line),f) != NULL) {
+	int l_mtd = strlen(mtd);
+	if (strncmp(in_line,mtd,l_mtd) == 0 && isspace(in_line[l_mtd])) {
+	    strcpy(entry,in_line);
+	    fclose(f);
+	    return;
+	}
+    }
+
+    fclose(f);
+}
+	
+
+
+
+
+
+
+#define MAX_CACHE_ENTRIES 5
+
+static struct {
+    char key[200];
+    int time;
+    CloneInfo info;
+} cache[MAX_CACHE_ENTRIES];
+
+static int cache_entries = 0;
+static int cache_time = 0;
+
+
+
+
+static int check_in_cache(char *fn, char *mtd, CloneInfo *info)
+{
+    int i;
+    char key[200];
+
+    if (! cache_entries) return 0;
+
+    strcpy(key,mtd);
+    strcat(key,fn);
+
+    for (i=0; i < cache_entries; i++) {
+	if (strcmp(key,cache[i].key)==0) {
+	    cache[i].time = cache_time++;
+	    *info = cache[i].info;
+	    return 1;
+	}
+    }
+
+    return 0;
+}
+
+
+
+
+
+
+
+static void store_in_cache(char *fn, char *mtd, CloneInfo *info)
+{
+    int slot;
+    char key[200];
+
+    strcpy(key,mtd);
+    strcat(key,fn);
+
+    if (cache_entries < MAX_CACHE_ENTRIES)
+	slot = cache_entries++;
+    else {
+	int i;
+	int slot_time = cache_time;
+
+	for (i = 0; i < cache_entries; i++) {
+	    if (slot_time > cache[i].time) {
+		slot_time = cache[i].time;
+		slot = i;
+	    }
+	}
+    }
+
+    strcpy(cache[slot].key, key);
+    cache[slot].info = *info;
+    cache[slot].time = cache_time++;
+}
+
+
+
+
+
+
+
+
+int read_subclone_info(char *fn, char *mtd, CloneInfo *info)
+/*
+** Use grep to get info from subclone file
+*/
+{
+    char temp[100];
+    char entry[256];
+#ifdef OLD_CLUNKY_FORMAT
+    float float_range_from, float_range_to;
+#endif /*OLD_CLUNKY_FORMAT*/
+
+    /*
+    ** Initialise all fields
+    */
+    *info = NULLCloneInfo;
+
+    /*
+    ** Check arguments are sensible
+    */
+    if (!strlen(fn) || !strlen(mtd) || info==NULL)
+	return 1;
+
+    /*
+    ** Check in cache
+    */
+    if ( check_in_cache(fn,mtd,info) ) return 0;
+
+    /*
+    ** Get subclone entry (as a string)
+    */
+    get_entry(fn,mtd,entry);
+
+    /* check that a sensible result was reached */
+    if (!entry[0])
+	return 1;
+
+    /*
+    ** Parse line
+    */
+    sscanf(entry, "%s %s %s %s %s",
+	   info->mtd,
+	   info->cosmid,
+	   temp,
+	   info->method,
+	   info->vector);
+
+#ifdef OLD_CLUNKY_FORMAT
+    /*
+    ** Size information is (currently) expressed as a range of two floating
+    ** point numbers representing size in kilobases.
+    **
+    ** Parse range information:
+    **     <range> ::= <float> | <null-float> - <null-float>
+    **     <null-float> ::= <float> | 
+    */
+    if (strchr(temp,'-') == NULL) {
+	float_range_from = float_range_to = (float) atof(temp) ;
+    } else {
+	sscanf(temp, "%f-%f", &float_range_from, &float_range_to);
+    }
+
+    info->range_from = (int) (float_range_from * 1000.0);
+    info->range_to = (int) (float_range_to * 1000.0);
+    
+#else /* OLD_CLUNKY_FORMAT */
+
+    /*
+    ** Size information is (currently) expressed as a range of two integer
+    ** numbers representing size in bases.
+    **
+    ** Parse range information:
+    **     <range> ::= <integer> | <null-integer> .. <null-integer>
+    **     <null-integer> ::= <integer> | 
+    */
+    if (strchr(temp,'.') == NULL) {
+	info->range_from = info->range_to = atoi(temp) ;
+    } else {
+	sscanf(temp, "%d..%d", &info->range_from, &info->range_to);
+    }
+
+#endif /* OLD_CLUNKY_FORMAT */
+
+    store_in_cache(fn,mtd,info);
+
+    return 0;
+    	
+}
+
diff --git a/src/bap/subclone.h b/src/bap/subclone.h
new file mode 100644
index 0000000..c13c2f7
--- /dev/null
+++ b/src/bap/subclone.h
@@ -0,0 +1,22 @@
+#ifndef _subclone_h
+#define _subclone_h
+
+typedef char Mtd[10];
+typedef char Cosmid[10];
+typedef char FragMethod[10];
+typedef char Vector[10];
+
+typedef struct _clone_info {
+    Mtd mtd;
+    Cosmid cosmid;
+    int range_from; /* in bases */
+    int range_to;   /* in bases */
+    FragMethod method;
+    Vector vector;
+} CloneInfo;
+
+
+
+extern int read_subclone_info(char *fn, char *mtd, CloneInfo *info);
+
+#endif /* _subclone_h */
diff --git a/src/bap/subs.f b/src/bap/subs.f
new file mode 100644
index 0000000..49122c7
--- /dev/null
+++ b/src/bap/subs.f
@@ -0,0 +1,228 @@
+      INTEGER FUNCTION CONOK(
+     +RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ,MAXSEQ,SEQ2,IDBSIZ,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEVW,IDEV,
+     +MAXGEL,IDM,PERCD,CENDS,NENDS,MAXCON)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER ECHRSZ,TEMP3(ECHRSZ,MAXGL2),CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ(MAXSEQ),SEQ2(MAXGEL),NAMPRO*(*)
+      EXTERNAL NOK2
+      CONOK = 0
+      JOB = 0
+      IDIM1 = 0
+      CALL ACONSN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ,MAXSEQ,SEQ2,IDBSIZ,IDIM1,JOB,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEVW,IFAIL,MAXGEL,IDM,PERCD)
+      IF(IFAIL.NE.0) THEN
+        CALL ERROM(KBOUT,'Error calculating consensus')
+        CONOK = -1
+        RETURN
+      END IF
+      CALL FNDCON(SEQ,IDIM1,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+      IWIN = 20
+      NBAD = 15
+      DO 10 I=1,IDCEND
+        J = CENDS(I) + 20
+        IDIM = CENDS(I+1) - J
+        K =  NOK2(SEQ(J),IDIM,IWIN,NBAD)
+        IF (K.LT.IDIM) THEN
+          WRITE(IDEV,*)'Problem at position',K,' In contig',NENDS(I)
+          CONOK = CONOK + 1
+        END IF
+ 10   CONTINUE
+      IF (CONOK.EQ.0) THEN
+        WRITE(KBOUT,*)
+     +' Consensus has no segments with',NBAD,' dashes in',IWIN
+      ELSE
+        WRITE(KBOUT,*)
+     +  CONOK,' contigs have segments with',NBAD,' dashes in',IWIN
+      END IF
+      END
+      SUBROUTINE DBMENU(MENU,NOPT,KOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+1     CONTINUE
+      IF(MENU.EQ.0) THEN
+        WRITE(KBOUT,5000)
+5000    FORMAT(' Menus and their numbers are',/,
+     +  ' m0 = This menu',/,
+     +  ' m1 = General',/,
+     +  ' m2 = Screen control',/,
+     +  ' m3 = Modification',/,
+     +  '  ? = Help',/,
+     +  '  ! = Quit')
+      ELSE IF(MENU.EQ.-1) THEN
+      WRITE(KBOUT,1001)
+1001  FORMAT(' General menu',/,
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Open a database',/,
+     +'  5 = Display a contig',/,
+     +'  6 = List a text file',/,
+     +'  7 = Redirect output',/,
+     +'  8 = Calculate a consensus',/,
+     +' 17 = Screen against restriction enzymes',/,
+     +' 18 = Screen against vector',/,
+     +' 19 = Check database',/,
+     +' 24 = Copy database',/,
+     +' 25 = Show relationships',/,
+     +' 27 = Set parameters',/,
+     +' 28 = Highlight disagreements',/,
+     +' 29 = Examine quality',/,
+     +' 35 = Find internal joins')
+      ELSE IF(MENU.EQ.-2) THEN
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Screen control menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +' 10 = Clear graphics',/,
+     +' 11 = Clear text',/,
+     +' 12 = Draw ruler',/,
+     +' 13 = Use cross hair',/,
+     +' 14 = Change margins',/,
+     +' 15 = Label diagram',/,
+     +' 16 = Plot map',/,
+     +' 33 = Plot single contig',/,
+     +' 34 = Plot all contigs')
+      ELSE IF(MENU.EQ.-3) THEN
+      WRITE(KBOUT,1003)
+1003  FORMAT(' Modification menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  4 = Edit contig',/,
+     +'  9 = Screen edit',/,
+     +' 20 = Auto assemble',/,
+     +' 21 = Enter new gel reading',/,
+     +' 22 = Join contigs',/,
+     +' 23 = Complement a contig',/,
+     +' 26 = Alter relationships',/,
+     +' 30 = Auto edit a contig',/,
+     +' 31 = Disassemble readings',/,
+     +' 32 = Extract gel readings')
+      END IF
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      IF((NOPT.LT.-3).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS(NOPT),IHELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(NOPT.EQ.0) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF(NOPT.LT.0) THEN
+        MENU = NOPT
+        GO TO 1
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      RETURN
+      END
+      SUBROUTINE DBMENT(MENU,NOPT,KOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+1     CONTINUE
+      IF(MENU.EQ.2)THEN
+20    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1002)
+1002  FORMAT( ' Enter menu options are:',/,
+     +'  ? = Help',/,
+     +'  ! = Give up',/,
+     +'  3 = Complete entry',/,
+     +'  4 = Edit contig',/,
+     +'  5 = Display',/,
+     +'  6 = Edit new gel reading')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 6
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 20
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 20
+      END IF
+      ELSE IF(MENU.EQ.3)THEN
+30    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1003)
+1003  FORMAT( ' Join menu options are:'/
+     +'  ? = Help',/,
+     +'  ! = Give up',/,
+     +'  3 = Complete join',/,
+     +'  4 = Edit left contig',/,
+     +'  5 = Display join',/,
+     +'  6 = Edit right contig',/,
+     +'  7 = Move join')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 7
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 30
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      ELSE IF(MENU.EQ.4)THEN
+40    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Alter relationships menu options are:'/
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Line change',/,
+     +'  4 = Check logical consistency',/,
+     +'  5 = Remove a contig',/,
+     +'  6 = Shift',/,
+     +'  7 = Move gel reading',/,
+     +'  8 = Rename gel reading',/,
+     +'  9 = Break a contig',/,
+     +' 10 = Remove a gel reading')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 10
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 40
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 40
+      END IF
+      ELSE IF(MENU.EQ.5)THEN
+50    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1005)
+1005  FORMAT(
+     +' Edit options are:',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Insert',/,
+     +'  4 = Delete',/,
+     +'  5 = Change')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 5
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 50
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 50
+      END IF
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      IF(NOPT.LE.0) GO TO 1
+      RETURN
+      END
diff --git a/src/bap/tagU2.c b/src/bap/tagU2.c
new file mode 100644
index 0000000..6b12b5c
--- /dev/null
+++ b/src/bap/tagU2.c
@@ -0,0 +1,896 @@
+/*
+** tagU2.c
+**
+** Tag IO routines required by non-X assembly program
+**
+** Changes:
+**
+** 8-Jul-92
+**      getext_() searchs for an IGN tag, to determine if cutoff should be
+**      ignored
+** 7-Aug-92
+**	now initial tags can be specified in the sequence file
+**      format is ";;%4s %6d %6d %s\n",type,position,length,comment
+**      The comment is optional
+** 27-Aug-92
+**      modext() modifies cutoff data
+**
+*/
+#include "contigEditor.h"
+#include <stdio.h>
+#include <stdlib.h>
+#include "tagUtils.h"
+#include "fort.h"
+
+#define COMMENT_HEAD_ID (1)
+#define TAG_HEAD_ID (devils_.idbsiz)
+
+int read_tag(tag_id n, tagRecord *t)
+{
+    readtg_(&devils_.idevt,&n,&t->position,&t->length,&t->comment,&t->type.i,&t->next);
+    return 0;
+}
+int write_tag(tag_id n, tagRecord t)
+{
+    writtg_(&devils_.idevt,&n,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    return 0;
+}
+int read_comment(comment_id n, int *cnt, commentStruct *c)
+{
+    readcc_(&devils_.idevc,&n,cnt,&c->next,&c->comment[0],40);
+    return 0;
+}
+int write_comment(comment_id n, int cnt, commentStruct c)
+{
+    writcc_(&devils_.idevc,&n,&cnt,&c.next,&c.comment[0],40);
+    return 0;
+}
+/* comment interface */
+tag_id get_free_tag()
+{
+    tagRecord head;
+    tagRecord freerec;
+    tag_id free_id;
+    (void) read_tag(TAG_HEAD_ID,&head);
+    if (head.next != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.next;
+	(void) read_tag(free_id,&freerec);
+	head.next = freerec.next;
+	(void) write_tag(TAG_HEAD_ID,head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.position;
+	(void) write_tag(TAG_HEAD_ID,head);
+	(void) write_tag(free_id,freerec);
+    }
+
+    return free_id;
+}
+
+
+void delete_comment (comment_id cp)
+/*
+** Put a comment on the free stack
+*/
+{
+    int dummy;
+    commentStruct head;
+    commentStruct freerec;
+    comment_id free_id;
+    if (!cp) return;
+    free_id = cp;
+    (void) read_comment(free_id,&dummy,&freerec);
+    while (freerec.next) {
+	free_id = freerec.next;
+        (void) read_comment(free_id,&dummy,&freerec);
+    }
+    (void) read_comment(COMMENT_HEAD_ID,&dummy,&head);
+    freerec.next = head.next;
+    (void) write_comment(free_id,dummy,freerec);
+    head.next = cp;
+    (void) write_comment(COMMENT_HEAD_ID,dummy,head);
+}
+
+
+
+
+
+void blank_tag_rec(tag_id t)
+/*
+** Blank out fields in tag record t
+*/
+{
+
+    tagRecord r;
+
+    (void) read_tag(t, &r);
+
+    r.position = 0;
+    r.length = 0;
+    r.type.i = 0x20202020;
+    r.comment = 0;
+    r.next = 0;
+    
+    (void) write_tag(t, r);
+
+}
+
+
+
+
+void delete_tag_rec(tag_id t)
+/*
+** remove t from file, discarding comment if necessary
+*/
+{
+    tagRecord head;
+    tagRecord freerec;
+
+    (void) read_tag(t,&freerec);
+
+    if (freerec.comment)
+	delete_comment(freerec.comment);
+
+    (void) read_tag(TAG_HEAD_ID,&head);
+    freerec.next = head.next;
+    (void) write_tag(t,freerec);
+    head.next = t;
+    (void) write_tag(TAG_HEAD_ID,head);
+    
+}
+
+static comment_id get_free_comment()
+{
+    commentStruct head;
+    commentStruct freerec;
+    comment_id free_id;
+    int count;
+    (void) read_comment(COMMENT_HEAD_ID,&count,&head);
+    if (head.next != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.next;
+	(void) read_comment(free_id,&count,&freerec);
+	head.next = freerec.next;
+	(void) write_comment(COMMENT_HEAD_ID,count,head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++count;
+	(void) write_comment(COMMENT_HEAD_ID,free_id,head);
+	(void) write_comment(free_id,0,freerec);
+    }
+
+    return free_id;
+}
+
+comment_id put_comment(char *c)
+{
+    commentStruct com;
+    comment_id cur,next,this_comment;
+    int clen = strlen(c);
+    int piece;
+
+    /* write out first block of COMMENT_LENGTH */
+    this_comment=cur=get_free_comment();
+    if (clen>COMMENT_LENGTH)
+	piece = COMMENT_LENGTH;
+    else
+	piece = clen;
+
+    {int i; for(i=0;i<COMMENT_LENGTH;i++)com.comment[i]=' ';}
+    strncpy(com.comment,c,piece);
+
+    c+= piece;
+    clen -= piece;
+    while (clen > 0) {
+	next = get_free_comment();
+	com.next = next;
+	write_comment(cur,0,com);
+	cur = next;
+	if (clen<COMMENT_LENGTH)
+	    piece = clen;
+
+	{int i; for(i=0;i<COMMENT_LENGTH;i++)com.comment[i]=' ';}
+	strncpy(com.comment,c,piece);
+
+	c+= piece;
+	clen -= piece;
+    }
+    com.next = 0;
+    if (piece!=COMMENT_LENGTH)
+	com.comment[piece]='\0';
+    write_comment(cur,0,com);
+
+    return this_comment;
+}
+
+static void add_RD_comment(tag_id t_id, char *comment)
+{
+    tagRecord t;
+
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    t.comment = put_comment(comment);
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+static void insert_CUT_tags(tag_id t_id, int pos, int length, char *type, char *comment)
+{
+    tagRecord t,new_t;
+    tag_id next_id,new_id;
+
+    new_t.length = 1;
+    strncpy(new_t.type.c,type,4);
+    new_t.comment = put_comment(comment);
+
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    next_id = t.next;
+    new_id = get_free_tag();
+    new_t.position = pos;
+    new_t.next = next_id;
+    writtg_(&devils_.idevt,&new_id,&new_t.position,&new_t.length,&new_t.comment,&new_t.type.i,&new_t.next);
+    next_id = new_id;
+    t.next=next_id;
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+void insert_NEW_tag(tag_id t_id, int pos, int length, char *type, char *comment)
+{
+    tagRecord new_t;
+    tagRecord t, next_t;
+    tag_id next_id,new_id;
+
+    new_t.position = pos;
+    new_t.length = length;
+    strncpy(new_t.type.c,type,4);
+    if (comment)
+	new_t.comment = put_comment(comment);
+    else
+	new_t.comment = 0;
+
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    next_id = t.next;
+    if (next_id)
+	readtg_(&devils_.idevt,&next_id,&next_t.position,&next_t.length,&next_t.comment,&next_t.type.i,&next_t.next);
+    while(next_id && next_t.position <= pos) {
+	t_id = next_id;
+	t = next_t;
+	next_id = next_t.next;
+	if(next_id)
+	    readtg_(&devils_.idevt,&next_id,&next_t.position,&next_t.length,&next_t.comment,&next_t.type.i,&next_t.next);
+    }
+
+    new_id = get_free_tag();
+    new_t.next = next_id;
+    writtg_(&devils_.idevt,&new_id,&new_t.position,&new_t.length,&new_t.comment,&new_t.type.i,&new_t.next);
+    t.next = new_id;
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+}
+
+static void insert_INS_tags(tag_id t_id, int pos, int length)
+{
+    tagRecord t,new_t;
+    tag_id next_id,new_id;
+    int i;
+
+    new_t.length = 1;
+    strncpy(new_t.type.c,TAG_TYPE_INSERT,3);
+    new_t.type.c[3]='*';
+    new_t.comment = 0;
+
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    next_id = t.next;
+    for (i=length-1;i>=0;i--) {
+	new_id = get_free_tag();
+	new_t.position = pos+i;
+	new_t.next = next_id;
+	writtg_(&devils_.idevt,&new_id,&new_t.position,&new_t.length,&new_t.comment,&new_t.type.i,&new_t.next);
+	next_id = new_id;
+    }
+    t.next=next_id;
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+static void myFstr2Cstr(char *Fstr, int_fl Flen,
+	       char *Cstr, int_fl Clen)
+/*
+    Copy the significant characters of a blank padded Fortran string
+    to a '\0' terminated C string, ignoring excess characters.
+
+    This function works if the strings are distinct or coincident, but
+    not if they overlap in any other way.
+*/
+{   int_fl FsigLen, i;
+
+    /* Find the significant length of Fstr */
+    FsigLen=Flen;
+    while ((FsigLen > 0) && (Fstr[FsigLen-1] == ' '))
+    {   FsigLen--;
+    }
+
+    /* Copy up to (Clen-1) significant characters */
+    i=0;
+    while ((i < FsigLen) && (i < (Clen-1)))
+    {   Cstr[i] = Fstr[i];
+        i++;
+    }
+
+    Cstr[i] = '\0';
+}
+
+/* C version of ENTRD */
+void entrd_(int_f *IDEVG,int_f *IDEVT,int_f *IDEVC,char *NAMARC,int_f *NGEL,
+       int_f *IOK, int_fl l_NAMARC)
+{
+    int LENR,LCUT,LENW;
+    int RCUT;
+
+#define fn_len 100
+    char fn[fn_len];
+#define l_line 100
+    char line[l_line];
+    char *result;
+    char *cutbuff,*cb,*l;
+
+    FILE *fp;
+
+    /* blank tag rec */
+    blank_tag_rec((tag_id) *NGEL);
+
+    myFstr2Cstr(NAMARC,l_NAMARC,fn,(int_fl)fn_len);
+
+    *IOK = 0;
+    /* don't need this anymore
+    saveState.idevt  = (int_f *) &devils_.idevt;
+    saveState.idevc  = (int_f *) &devils_.idevc;
+    saveState.idbsiz = (int_f *) &devils_.idbsiz;
+    */
+    /* Enters raw data info into database */
+    if ( (fp = fopen(fn,"r")) != NULL ) {
+	result = fgets(line,l_line,fp);
+	if (result != NULL && line[0] == ';') { 
+	    /* process header */
+	    sscanf(line,";%6d%6d%6d%*s\n",&LENR,&LCUT,&LENW);
+	    RCUT = LENR - LCUT - LENW;
+            add_RD_comment(*NGEL, &line[1]);
+	    result = fgets(line,l_line,fp);
+	}
+
+	if (result != NULL && strlen(line)>1 && strncmp(line,";<",2)==0) {
+	    /* process left cut off */
+	    /* starting with this line we need to read LCUT ditties */
+	    cutbuff = (char *) TAG_MALLOC(LCUT+1);
+	    cb = cutbuff;
+	    while (result!=NULL && strlen(line)>1 && strncmp(line,";<",2)==0) {
+		for(l=&line[2];*l;l++)if(strchr(" \n\t",*l)==NULL)*cb++=*l;
+		result = fgets(line,l_line,fp);
+	    }
+	    *cb='\0';
+            insert_CUT_tags(*NGEL, 0, 0, TAG_TYPE_LCUT, cutbuff);
+	    TAG_FREE(cutbuff);
+	}
+
+	if (result != NULL && strlen(line)>1 && strncmp(line,";>",2)==0) {
+	    /* process right cut off */
+	    /* starting with this line we need to read RCUT ditties */
+	    cutbuff = (char *) TAG_MALLOC(RCUT+1);
+	    cb = cutbuff;
+	    while (result != NULL && strlen(line)>1 && strncmp(line,";>",2)==0) {
+		for(l=&line[2];*l;l++)if(strchr(" \n\t",*l)==NULL)*cb++=*l;
+		result = fgets(line,l_line,fp);
+	    }
+	    *cb='\0';
+            insert_CUT_tags(*NGEL, 0, 0, TAG_TYPE_RCUT, cutbuff);
+	    TAG_FREE(cutbuff);
+	}
+	while (result != NULL && strlen(line)>1 && strncmp(line,";;",2)==0) {
+	    int pos,len;
+	    char *comment;
+	    /* format ";;%4s %6d %6d %s\n",type,position,length,comment 
+	       --or-- ";;%4s %6d %6d\n",   type,position,length
+	    */
+	    if (strlen(line)>21)
+		comment = line+21;
+	    else
+		comment = NULL;
+	    sscanf(line+7,"%6d %6d",&pos,&len);
+            insert_NEW_tag((tag_id)*NGEL, pos, len, line+2, comment);
+	    result = fgets(line,l_line,fp);
+	}
+
+	fclose(fp);
+    }
+}
+
+int_f freecc_ (int_f *IDEVC)
+{
+    return get_free_comment();
+}
+
+void padtag_(int_f *llino_, int_f *k_, int_f *nc_, int_f *lngthg_)
+/*
+** Play around with tags when auto assembling
+*/
+{
+    int seq;
+    int pos;
+    int length;
+    int seq_length;
+    tagRecord t;
+    tag_id last,next;
+    int last_pos;
+    if (devils_.idevt>0) {
+	/* fudge */
+	/* don't need this anymore
+	saveState.idevt  = (int_f *) &devils_.idevt;
+	saveState.idbsiz = (int_f *) &devils_.idbsiz;
+	*/
+	seq = *llino_;
+	length = *nc_;
+	if (*lngthg_ < 0) {
+	    seq_length = - *lngthg_;
+	    pos = seq_length - *k_ - *nc_ + 2;
+	} else {
+	    seq_length = *lngthg_;
+	    pos = *k_;
+	}
+
+	next = seq;
+	last = 0;
+	last_pos = 0;
+	while (next) {
+	    readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    /*
+	    ** Move tags accordingly
+	    */
+	    if (t.position >= pos)
+	    {
+		t.position+=length;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+	    else if (t.position + t.length > pos)
+	    {
+		t.length+=length;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+
+	    /*
+	    ** Insert *INS tags
+	    */
+	    if (last_pos < pos && t.position >= pos)
+		/*
+		** Insert after last `length' tags
+		*/
+		insert_INS_tags(last,pos,length);
+	    else if (t.position < pos && t.next==0)
+		/*
+		** Insert after next `length' tags
+		*/
+		insert_INS_tags(next,pos,length);
+
+            last = next;
+	    last_pos = t.position;
+	    next = t.next;
+	}
+
+    }
+}
+
+void taggel_(int_f *ngels_, int_f *lngthg_, char *gel_, int_fl gel_length)
+{
+    int_f length = abs(*lngthg_);
+    int_f one=1;
+    int_f i,j;
+    
+    if (*lngthg_ > 0) {
+	for (i=0,j=1; i<length; i++,j++)
+	    if (gel_[i]=='*')
+		padtag_(ngels_,&j,&one,lngthg_);
+    } else {
+	for (i=length-1,j=length; i>=0; i--,j--)
+	    if (gel_[i]=='*')
+		padtag_(ngels_,&j,&one,lngthg_);
+    }
+
+}
+
+void movtag_ (int_f *from, int_f *to )
+/*
+** Move tag information of gel ``from'' to gel ``to'',
+** and perform garbage collection on old gel ``to''
+*/
+{
+
+    tagRecord freerec;
+    tag_id this,next;
+
+    /* don't do anything if tag files aren't open */
+    if ( devils_.idevc < 0 || devils_.idevt < 0 ) return;
+
+    /* Throw away ``to'' tag records */
+    (void) read_tag((tag_id) *to,&freerec);
+
+    if (freerec.comment)
+	delete_comment(freerec.comment);
+
+    next = freerec.next;
+    while ( next ) {
+        this = next;
+	(void) read_tag( this , &freerec );
+        next = freerec.next;
+        delete_tag_rec (this);
+    }
+
+    /* copy ``from'' record to ``to'' record */
+    (void) read_tag((tag_id) *from,&freerec);
+    (void) write_tag((tag_id) *to,freerec);
+
+    /* initialise the hole to blank */
+    blank_tag_rec(*from);
+
+}
+
+char *get_comment(comment_id cp)
+{
+    int dummy;
+    commentStruct c;
+    int count;
+    comment_id nc;
+    char *com,*comptr;
+
+    if (!cp) return NULL;
+    /* determine how long string is */
+    count = 1;
+    nc=cp;
+    read_comment(nc, &dummy, &c);
+    while (c.next != 0) {
+	nc = c.next;
+	count++;
+        read_comment(nc, &dummy, &c);
+    }
+
+    com = comptr = (char *)TAG_MALLOC(count * COMMENT_LENGTH+1);
+    nc=cp;
+    read_comment(nc, &dummy, &c);
+    strncpy(com,c.comment,COMMENT_LENGTH); com+=COMMENT_LENGTH;
+    while (c.next != 0) {
+	nc = c.next;
+	count++;
+        read_comment(nc, &dummy, &c);
+        strncpy(com,c.comment,COMMENT_LENGTH); com+=COMMENT_LENGTH;
+    }
+    *com = '\0';
+
+    return comptr;
+    
+}
+
+void getext_(int_f *gel, char *cutoff, int_f *lcutoff, int_f *ok,
+	     int_fl l_cutoff)
+/*
+** Get right cutoff for lowly Fortran Users
+** If a tag TAG_TYPE_IGNORE exists return with ok=0
+*/
+{
+#define TAG_TYPE_IGNORE "IGN"
+
+    tagRecord rec;
+    tag_id next;
+
+    (void) read_tag((tag_id) *gel,&rec);
+
+    next = rec.next;
+
+    *ok = 1;
+    while ( next && ! rec.position && *ok == 1) {
+	(void) read_tag( next , &rec );
+        next = rec.next;
+
+	if (strncmp(rec.type.c,TAG_TYPE_RCUT,4) == 0) {
+	    if (rec.comment) {
+	        char *c;
+		int i;
+		/* use this */
+		c = get_comment(rec.comment);
+		/* copy to fortran array */
+		for (i=0; i<*lcutoff && c[i]; i++) cutoff[i] = c[i];
+		/* set length of returned string */
+		*lcutoff = i;
+		for (; i<*lcutoff; i++) cutoff[i] = ' ';
+
+		/* rec temporary string space */
+		TAG_FREE(c);
+		*ok = 0;
+	    } else {
+		return;
+	    }
+
+	} else if (strncmp(rec.type.c,TAG_TYPE_IGNORE,3) == 0) return;
+
+    }
+
+    if (*ok == 0) {
+	/* check to see we should ignore this reading */
+	/* assumption - IGN tag occurs pos >= 1 */
+	while ( next && *ok == 0) {
+	    (void) read_tag( next , &rec );
+	    next = rec.next;
+	    *ok = (strncmp(rec.type.c,TAG_TYPE_IGNORE,3) == 0);
+	}
+    }
+
+}
+
+
+
+
+
+
+
+
+
+
+
+/*************************************************************/
+/**      Routines for Rodger's pad shuffling                 */
+/*************************************************************/
+
+static void insert_edit_tag(tag_id t_id, int pos, char base, char *type)
+{
+    tagRecord t,new_t;
+    tag_id new_id;
+
+    /* read previous tag */
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,
+	    &t.type.i,&t.next);
+
+    /* set up new record */
+    strncpy(new_t.type.c,type,3);
+    new_t.type.c[3]=base;
+    new_t.length = 1;
+    new_t.comment = 0;
+    new_t.position = pos;
+    new_t.next = t.next;
+
+    /* write new record */
+    new_id = get_free_tag();
+    writtg_(&devils_.idevt,&new_id,&new_t.position,&new_t.length,&new_t.comment,&new_t.type.i,&new_t.next);
+
+    /* update previous tag */
+    t.next=new_id;
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+
+
+void insedt_(int_f *seq_, int_f *pos_, char *base_, int_fl base_l)
+/*
+** Insert a tag for a base at position pos in gel
+*/
+{
+    int seq;
+    int pos;
+    char base;
+
+    tagRecord t;
+    tag_id last,next;
+
+    if (devils_.idevt>0) {
+
+	seq = (int) *seq_;
+	pos = (int) *pos_;
+	base = base_[0];
+
+
+	/* skip over header */
+	last = next = seq;
+	readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	next = t.next;
+	if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	/* adjust lengths of all tags staring before pos */
+	while (next && t.position < pos) {
+	    if (t.position + t.length - 1 >= pos &&
+		t.type.c[0] != '*') {
+		t.length++;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+            last = next;
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	}
+
+	/* insert a tag after last and before next */
+	insert_edit_tag(last,pos,base,TAG_TYPE_INSERT);
+
+	/* shift everything else right */
+	while (next) {
+	    t.position++;
+	    writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	}
+
+    }
+}
+
+
+
+
+
+void deledt_(int_f *seq_, int_f *pos_, char *base_, int_fl base_l)
+/*
+** Delete a base at position pos in gel
+*/
+{
+    int_f seq;
+    int_f pos;
+    char base;
+
+    tagRecord t;
+    tag_id last,next;
+
+    if (devils_.idevt>0) {
+
+	seq = *seq_;
+	pos = *pos_;
+	base = base_[0];
+
+
+	/* skip over header */
+	last = next = (tag_id)seq;
+	readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	next = t.next;
+	if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	/* adjust lengths of all tags staring before pos */
+	while (next && t.position <= pos) {
+	    if (t.position + t.length - 1 >= pos &&
+		t.type.c[0] != '*') {
+		t.length--;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+            last = next;
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	}
+
+	/* insert a tag after last and before next */
+	insert_edit_tag(last,pos,base,TAG_TYPE_DELETE);
+
+	/* shift everything else right */
+	while (next) {
+	    t.position--;
+	    writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	}
+
+    }
+}
+
+
+
+
+
+
+
+
+
+
+/**********************************************************************/
+
+
+int modext(int gel, int shorten_by)
+/*
+** shorted an extension by 'shorten_by' characters
+** Returns:
+**    0 - modification successful
+**  !=0 - an error has occurred occur
+*/
+{
+
+    tagRecord rec;
+    tag_id next;
+    int ok;
+    char *c;
+    tag_id hit;
+    comment_id cut_off;
+
+    (void) read_tag((tag_id) gel,&rec);
+    /* store cutoff comment id */
+    cut_off = rec.comment;
+
+    next = rec.next;
+
+    ok = 0;
+    while ( next && ! rec.position && ok == 0) {
+	(void) read_tag( next , &rec );
+
+	if (strncmp(rec.type.c,TAG_TYPE_RCUT,4) == 0) {
+	    if (rec.comment) {
+		hit = next;
+		c = get_comment(rec.comment);
+		ok = 1;
+	    } else {
+		ok = 2;
+	    }
+
+	} else if (strncmp(rec.type.c,TAG_TYPE_IGNORE,3) == 0) ok = 2;
+
+        next = rec.next;
+
+    }
+
+    
+    if (ok == 1) {
+	/* check to see we should ignore this reading */
+	/* assumption - IGN tag occurs pos >= 1 */
+	ok = 0;
+	while ( next && ok == 0) {
+	    (void) read_tag( next , &rec );
+	    next = rec.next;
+	    ok = (strncmp(rec.type.c,TAG_TYPE_IGNORE,3) == 0);
+	}
+	if (!ok) {
+	    /*
+	     * we have found a cutoff and not found an IGN tag!
+	     * we can now shorten
+	     */
+	    if (ok = (strlen(c) >= shorten_by)) {
+		/* must have enough sequence */
+		comment_id oldcid, newcid;
+		(void) read_tag( hit , &rec );
+		oldcid = rec.comment;
+		newcid = put_comment(c+shorten_by);
+		rec.comment = newcid;
+		(void) write_tag( hit , rec );
+		delete_comment(oldcid);
+
+		/* adjust cutoff */
+		if (cut_off){
+		    commentStruct c;
+		    int dummy;
+		    read_comment(cut_off, &dummy, &c);
+		    /*
+		     * parse comment and adjust current sequence length
+		     */
+		    {
+			int llen,lcut,wlen;
+			char type[5];
+			char name[FILE_NAME_LENGTH+1];
+			sscanf(c.comment,"%6d%6d%6d%*s",&llen,&lcut,&wlen);
+			strncpy(type,&c.comment[18],4);
+			strncpy(name,&c.comment[22],FILE_NAME_LENGTH);
+			wlen+=shorten_by;
+			sprintf(c.comment,"%6d%6d%6d",llen,lcut,wlen);
+			strncat(c.comment,type,4);
+			strncat(c.comment,name,FILE_NAME_LENGTH);
+		    }
+		    write_comment(cut_off, dummy, c);
+		    
+		}
+	    }
+	    
+	}
+
+	TAG_FREE(c);
+    }
+
+    return (ok!=1);
+
+}
+
diff --git a/src/bap/tagUtils.h b/src/bap/tagUtils.h
new file mode 100644
index 0000000..f26153f
--- /dev/null
+++ b/src/bap/tagUtils.h
@@ -0,0 +1,146 @@
+#ifndef _tagUtils_h
+#define _tagUtils_h
+
+#include "fortran.h"
+#include "fort.h"
+/* #include "Sheet.h" */
+/*
+** The following describe two database files:
+**     The tag list file
+**     The comment list file
+**
+** the tag list file consists of at least IDBSIZ records.
+**
+**     1       : header tag for sequence 1
+**     2       : header tag for sequence 2
+**     ...
+**     IDBSIZ-1: header tag for sequence IDBSIZ-1:
+**     IDBSIZ  : descriptor record defining MAX_TAG
+**     IDBSIZ+1: supplimentary tags
+**     ...
+**     MAX_TAG : supplimentary tags
+**
+** the comment list file consists of at least 1 record.
+**
+**     1       : descriptor record defining MAX_COM
+**     2       : supplimentary comment
+**     ...
+**     MAX_COM : supplimentary comment
+**     
+*/
+
+#define COMMENT_LENGTH 40
+/* COMMENT_LENGTH is hard coded in dbsysnew.f */
+/* grep for COMMENT_LENGTH                    */
+#define FILE_NAME_LENGTH 18
+/* FILE_NAME_LENGTH is hard coded in dbsysnew.f */
+/* grep for FILE_NAME_LENGTH                    */
+
+typedef int_f tag_id;
+typedef int_f comment_id;
+
+/*
+** element in tag list
+*/
+typedef union {
+    char c[4];
+    int_f i;
+    } tag_types;
+
+typedef struct _tagRecord{
+    int_f position;        /* position in sequence */
+    int_f length;          /* length of tag */
+    tag_types type;
+    comment_id comment;  /* index to comment */
+    tag_id next;         /* link to next in structure */
+} tagRecord;
+
+/*
+** comment chain
+*/
+typedef struct _commentStruct{
+    char comment[COMMENT_LENGTH];
+    comment_id next;
+} commentStruct;
+
+typedef struct _tagstruct{
+    /*
+    ** Data from the original file
+    */
+    tagRecord tagrec;
+    /*
+    ** Data for database management
+    */
+    tag_id original_tag_id;
+    char *newcomment;
+    int newcommentlen;
+    long flags;
+    int time;
+    struct _tagstruct *next;
+} tagStruct,*tagptr;
+
+#define TAG_UNCHANGED         (0)
+#define TAG_POSITION_CHANGED  (1L<<1)
+#define TAG_LENGTH_CHANGED    (1L<<2)
+#define TAG_TYPE_CHANGED      (1L<<3)
+#define TAG_COMMENT_CHANGED   (1L<<4)
+#define TAG_INSERTED          (1L<<5)
+#define TAG_DELETED           (1L<<6)
+#define TAG_COMMENT_IN_MEMORY (1L<<7)
+
+#define TAG_MALLOC(s) (char *)malloc(s)
+#define TAG_FREE(c)   free(c)
+
+#define TAG_TYPE_INSERT "*INS"
+#define TAG_TYPE_DELETE "*DEL"
+#define TAG_TYPE_LCUT   "*LC*"
+#define TAG_TYPE_RCUT   "*RC*"
+#define TAG_TYPE_RAWDATA "*RAW"
+
+#include "edUtils.h"
+
+/* define external routines */
+extern void force_comment(tagStruct *t);
+
+/* comment interface */
+extern tagStruct *readTagList(int seq);
+extern void writeTagList(EdStruct *xx, int seq);
+extern void destroyTagList(tagStruct *s);
+
+extern void createTag(EdStruct *xx);
+extern void editTag(EdStruct *xx);
+extern void deleteTag(EdStruct *xx);
+/* extern void getTagSplodge(EdStruct *xx, int seq, int pos, int width, XawSheetInk *ink); */
+
+extern void insertTag(EdStruct *xx, int seq, tagStruct *t);
+extern tagStruct *newTag();
+/* this gives me lots of problems...why?
+extern char normaliseBase(EdStruct *xx,int seq,char deletedBase);
+*/
+
+extern void tagInsertBase(EdStruct *xx,int seq,int pos,char *base);
+extern void tagDeleteBase(EdStruct *xx,int seq,int pos,char *base);
+extern void tagReplaceBase(EdStruct *xx,int seq,int pos,char *dbse,char *ibse);
+extern tagStruct *findTag(EdStruct *xx,int seq,int pos);
+extern int origpos(EdStruct *xx, int seq, int pos);
+extern void getLeftCutOff(EdStruct *xx,int seq, int width, char *str);
+extern void getRightCutOff(EdStruct *xx,int seq, int width, char *str);
+extern char *get_comment(comment_id cp);
+extern int lenLCut(EdStruct *xx, int seq);
+extern int lenRCut(EdStruct *xx, int seq);
+extern void getLCut(EdStruct *xx,int seq, int pos, int width, char *str);
+extern void getRCut(EdStruct *xx,int seq, int pos, int width, char *str);
+extern tagStruct *findTagByType(EdStruct *xx, int seq, char *type);
+extern char normaliseBase(EdStruct *xx,int seq,char deletedBase);
+extern void dump_tags(EdStruct *xx, int seq);
+extern void setButtonName(Widget w, char *c);
+extern void createTagTypeMenu(Widget parent, void (*call_back)() );
+extern void delete_comment (comment_id cp);
+extern comment_id put_comment(char *c);
+extern int read_tag(tag_id n, tagRecord *t);
+extern int invokeTagEditor(char *type_id, char *newType, char *tagComment, char **newComment);
+extern int write_tag(tag_id n, tagRecord t);
+extern tag_id get_free_tag();
+extern void delete_tag_rec(tag_id t);
+
+#endif /* _tagUtils_h */
diff --git a/src/bap/uascii.gbl b/src/bap/uascii.gbl
new file mode 100644
index 0000000..f765d9b
--- /dev/null
+++ b/src/bap/uascii.gbl
@@ -0,0 +1,46 @@
+/*	ascii.gbl	ascii translation to amino acids */
+/*	modified 10-Mar-1987 for B, Z	*/
+
+#define NA 124
+#define EL 125
+#define ES 126
+
+#ifndef XTERNAL
+/*      0  1  2  3  4  5  6  7  8  9  10 11 12 13 14 15	*/
+int aascii[]={
+	EL,NA,NA,NA,NA,NA,NA,NA,NA,NA,EL,NA,NA,EL,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,ES,NA,NA,NA,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,
+	ES, 0,20, 4, 3, 6,13, 7, 8, 9,NA,11,10,12, 2,NA,
+	14, 5, 1,15,16,NA,19,17,22,18,21,NA,NA,NA,NA,NA,
+	ES, 0,20, 4, 3, 6,13, 7, 8, 9,NA,11,10,12, 2,NA,
+	14, 5, 1,15,16,NA,19,17,22,18,21,NA,NA,NA,NA,NA};
+
+int *sascii;
+#define AAMASK 127
+
+int nascii[]={
+/*	 0  1  2  3  5  6  7  8  9 10 11 12 13 14 15 15
+	 @  A  B  C  D  E  F  G  H  I  J  K  L  M  N  O
+	 P  Q  R  S  T  U  V  W  X  Y  Z		*/
+	EL,NA,NA,NA,NA,NA,NA,NA,NA,NA,EL,NA,NA,EL,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,16,NA,NA,15,NA,NA,
+	ES,ES,ES,NA,NA,NA,NA,NA,NA,NA,NA,ES,NA,NA,ES,NA,
+	ES, 0,14, 1,11,NA,NA, 2,12,NA,NA,10,NA, 7,15,NA,
+	 5, 6, 5, 9, 3, 4,13, 8,16, 6,NA,NA,NA,NA,NA,NA,
+	ES, 0,14, 1,11,NA,NA, 2,12,NA,NA,10,NA, 7,15,NA,
+	 5, 6, 5, 9, 3, 4,13, 8,16, 6,NA,NA,NA,NA,NA,NA};
+
+#else
+extern int aascii[];
+extern int nascii[];
+extern int *sascii;
+
+#define AAMASK 127
+
+extern int lascii[];
+
+#define LAMASK 15
+#endif
diff --git a/src/bap/upam.gbl b/src/bap/upam.gbl
new file mode 100644
index 0000000..fd5f1a2
--- /dev/null
+++ b/src/bap/upam.gbl
@@ -0,0 +1,127 @@
+/*	20-June-1986	universal pam file */
+/* Changed by Rodger Staden and Simon Dear
+** to suit local preferences 9-April-1992
+*/
+
+#ifndef XTERNAL
+
+int gdelval= -1;
+int ggapval= -4;
+
+extern int gdelval, ggapval;
+
+
+#define EOSEQ 127
+#define MAXSQ 32
+
+
+char sqnam[]={"aa"};
+char sqtype[]={"protein"};
+
+char *sq;
+char aa[MAXSQ] = {"ARNDCQEGHILKMFPSTWYVBZX"};
+
+int naa = 23;
+int nsq;
+
+int haa[MAXSQ] = {0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,2,6,0};
+int *hsq;
+
+int apam[450] = {
+ 2,
+-2, 6,
+ 0, 0, 2,
+ 0,-1, 2, 4,
+-2,-4,-4,-5,12,
+ 0, 1, 1, 2,-5, 4,
+ 0,-1, 1, 3,-5, 2, 4,
+ 1,-3, 0, 1,-3,-1, 0, 5,
+-1, 2, 2, 1,-3, 3, 1,-2, 6,
+-1,-2,-2,-2,-2,-2,-2,-3,-2, 5,
+-2,-3,-3,-4,-6,-2,-3,-4,-2, 2, 6,
+-1, 3, 1, 0,-5, 1, 0,-2, 0,-2,-3, 5,
+-1, 0,-2,-3,-5,-1,-2,-3,-2, 2, 4, 0, 6,
+-4,-4,-4,-6,-4,-5,-5,-5,-2, 1, 2,-5, 0, 9,
+ 1, 0,-1,-1,-3, 0,-1,-1, 0,-2,-3,-1,-2,-5, 6,
+ 1, 0, 1, 0, 0,-1, 0, 1,-1,-1,-3, 0,-2,-3, 1, 2,
+ 1,-1, 0, 0,-2,-1, 0, 0,-1, 0,-2, 0,-1,-3, 0, 1, 3,
+-6, 2,-4,-7,-8,-5,-7,-7,-3,-5,-2,-3,-4, 0,-6,-2,-5,17,
+-3,-4,-2,-4, 0,-4,-4,-5, 0,-1,-1,-4,-2, 7,-5,-3,-3, 0,10,
+ 0,-2,-2,-2,-2,-2,-2,-1,-2, 4, 2,-2, 2,-1,-1,-1, 0,-6,-2, 4,
+ 0,-1, 2, 3,-4, 1, 2, 0, 1,-2,-3, 1,-2,-5,-1, 0, 0,-5,-3,-2, 2,
+ 0, 0, 1, 3,-5, 3, 3,-1, 2,-2,-3, 0,-2,-5, 0, 0,-1,-6,-4,-2, 2, 3,
+ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0};
+
+/*	DNA alphabet
+
+	A, C, G, T U
+	R, Y
+	M (A or C)	6
+	W (A or T)	7
+	S (C or G)	8
+	K (G or T)	9
+	D (not C)	10
+	H (not G)	11
+	V (not T)	12
+	B (not A)	13
+	N X 		14
+*/
+
+char nt[MAXSQ]={"ACGTURYMWSKDHVB-*"};
+
+int nnt = 17;
+
+int hnt[MAXSQ] = {0,1,2,3,3,0,1,0,0,1,2,0,0,0,1,0,0};
+
+int npam[450] = {
+/*       A  C  G  T  U  R  Y  M  W  S  K  D  H  V  B  -  *  */
+	 4,						/* A */
+	-3, 4,						/* C */
+	-3,-3, 4,					/* G */
+	-3,-3,-3, 4,					/* T */
+	-3,-3,-3, 4, 4,					/* U */
+	 2,-1, 2,-1,-1, 2,				/* R (A G)*/
+	-1, 2,-1, 2, 2,-2, 2,				/* Y (C T)*/
+	 2, 2,-1,-1,-1, 0, 0, 2,			/* M (A C)*/
+	 2,-1,-1, 2, 2, 1, 1, 1, 2,			/* W (A T)*/
+	-1, 2, 2,-1,-1, 1, 1, 1,-1, 2,			/* S (C G)*/
+	-1,-1, 2, 2, 2, 1, 1,-1, 1, 1, 2,		/* K (G T)*/
+	 1,-2, 1, 1, 1, 1, 0, 0, 1, 0, 1, 1,		/* D (!C) */
+	 1, 1,-2, 1, 1, 0, 1, 1, 1, 0, 0, 0, 1,		/* H (!G) */
+	 1, 1, 1,-2,-1, 1, 0, 1, 0, 1, 0, 0, 0, 1,	/* V (!T) */
+	-2, 1, 1, 1, 1, 0, 1, 0, 0, 1, 1, 0, 0, 0, 1,	/* B (!A) */
+	 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 3, /* - */
+	 3, 3, 3, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}; /* * */
+/*       A  C  G  T  U  R  Y  M  W  S  K  D  H  V  B  N  */
+
+int *pam;
+int pam2[MAXSQ][MAXSQ];
+int pamh1[MAXSQ];		/* used for kfact replacement */
+
+#else
+#define MAXSQ 32
+
+extern char sqnam[];
+extern char sqtype[];
+extern int gdelval, ggapval;
+extern	char aa[];
+extern	char nt[];
+extern 	char *sq;
+
+#define EOSEQ 127
+
+extern  int naa;
+extern  int nnt;
+extern	int nsq;
+
+extern  int hnt[];
+extern  int haa[];
+extern	int *hsq;
+
+extern  int apam[];
+extern	int npam[];
+extern	int *pam;
+
+extern  int pam2[MAXSQ][MAXSQ];
+extern	int pamh1[];
+#endif
diff --git a/src/bap/xbapSpec.c b/src/bap/xbapSpec.c
new file mode 100644
index 0000000..93e281d
--- /dev/null
+++ b/src/bap/xbapSpec.c
@@ -0,0 +1,397 @@
+/*
+    Title:       xdapSpec
+
+    File: 	 xdapSpec.c
+    Purpose:	 Definitions specific to the `xdap' program
+    Last update: 27 Aug 1992
+
+    Change log:
+
+       27-8-92 sd added argument kopt to dbmenu, dbment
+       25-8-92 rs added disassemble routine
+        4-6-92 rs Added auto edit menu function and moved find internal joins
+        3/11/90 SD  Added extra menu option
+	22 August 1991 SD Added new menu option (Find internal joins)
+	    and associated help structures
+*/
+
+
+/*
+    This module contains data specific to the menus and help system
+    for the `xbap' program.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "progSpec.h"
+#include "userface.h" /* IMPORT: menu_x */
+#include "fort.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+
+static Widget menubarWid = (Widget)NULL; /* Initialised by CreateProgMenus */
+
+    
+static MenuData general_menu_data[] =
+{   {"Open a database",	                   3},
+    {"Display a contig",		   5},
+    {"List a text file",		   6},
+    {"Redirect output",		           7}, 
+    {"Calculate a consensus",	           8}, 
+    {"Screen against restriction enzymes", 17}, 
+    {"Screen against vector",	           18}, 
+    {"Check database",		           19}, 
+    {"Copy database",		           24}, 
+    {"Show relationships",		   25}, 
+    {"Set parameters",		           27}, 
+    {"Highlight disagreements",	           28}, 
+    {"Examine quality",		           29}, 
+};
+
+
+static MenuData screen_menu_data[] =
+{   {"Clear graphics",     10},
+    {"Clear text",	   11},
+    {"Draw ruler",	   12},
+    {"Use cross hair",	   13},
+    {"Change margins",	   14},
+    {"Label diagram",	   15},
+    {"Plot map",	   16},
+    {"Plot single contig", 33},
+    {"Plot all contigs",   34},
+};
+
+
+static MenuData modification_menu_data[] =
+{  {"Edit contig",           4},
+   {"Auto assemble",	     20},
+   {"Join contigs",	     22},
+   {"Complement a contig",   23},
+   {"Find internal joins",   35}, 
+   {"Alter relationships",   26},
+   {"Shuffle pads",          30}, 
+   {"Disassemble readings",  31},
+   {"Extract gel readings",  32},
+};
+
+
+static MenuData alter_menu_data[] =
+{   {"Cancel",             2},
+    {"Line change",        3},
+    {"Check logical consistency",   4},
+    {"Delete contig",      5},
+    {"Shift",              6},
+    {"Move gel reading",   7},
+    {"Rename gel reading", 8},
+    {"Break a contig",     9},
+    {"Remove a gel reading",     10},
+    {"Alter raw data parameters", 11},
+};
+
+
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button.
+	The button is initially unmanaged within its parent.
+    */
+    buttonWid = XtCreateWidget(menuButtonName, menuButtonWidgetClass,
+			       parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    They are initially all unmanaged.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+    menubarWid = parentWid;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Modification",
+	       modification_menu_data, XtNumber(modification_menu_data));
+    CreateMenu(parentWid, "Alter",
+	       alter_menu_data, XtNumber(alter_menu_data));
+}
+
+
+
+
+/*
+    Special menu functions for SAP.
+
+    The generic menu functions, found in 'dialogues' are only
+    used indirectly by SAP.
+
+      SUBROUTINE DBMENT(MENU,NOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+
+      SUBROUTINE DBMENU(MENU,NOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+
+    If this code were in 'dialogues', then we wouldn't need to
+    keep the reference to 'menubarWid' which is making a big
+    assumption anyway.
+*/
+
+#define dbment_x dbment_
+#define dbmenu_x dbmenu_
+
+
+void dbment_x(int_f *MENU_p,
+	      int_f *NOPT_p,
+	      int_f *KOPT_p,
+	      int_f *MAXOPT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    int_f MOPT, MINMEM; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the 'top level' menus.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    XtManageChild(XtNameToWidget(menubarWid, "General"));
+    XtManageChild(XtNameToWidget(menubarWid, "Screen"));
+    XtManageChild(XtNameToWidget(menubarWid, "Modification"));
+
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	    KOPT_p,
+	   &MOPT,
+	    MAXOPT_p,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+}
+
+
+void dbmenu_x(int_f *MENU_p,
+	      int_f *NOPT_p,
+	      int_f *KOPT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    int_f MOPT, MINMEM, MAXOPT; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the menu specified by 'MENU'.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    switch (*MENU_p)
+    {   case 4: XtManageChild(XtNameToWidget(menubarWid, "Alter"));
+                break;
+    }
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	    KOPT_p,
+	   &MOPT,
+	   &MAXOPT,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+
+}
+	      
+
+
+
+
+const int botHelpOpt = 0;
+const int topHelpOpt = 35;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "DAPHELP";
+const char helpPtrsFN[] = "DAPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "SAP",
+    "Help",
+    "Quit",
+    "Open a database",
+    "Edit contig",
+    "Display a contig",
+    "List a text file",
+    "Direct output to disk",
+    "Calculate a consensus",
+    "Screen edit",
+    "Clear graphics",
+    "Clear text",
+    "Draw ruler",
+    "Use cross hair",
+    "Change margins",
+    "Plot map",
+    "Label diagram",
+    "Screen against restriction enzymes",
+    "Screen against vector",
+    "Check consistency",
+    "Auto assemble",
+    "Enter new gel reading",
+    "Join contigs",
+    "Complement a contig",
+    "Copy database",
+    "Show relationships",
+    "Alter relationships",
+    "set parameters",
+    "Highlight disagreements",
+    "Examine quality",
+    "Shuffle pads",
+    "Disassemble readings",
+    "Extract gel readings",
+    "Plot single contig",
+    "Plot all contigs",
+    "Find internal joins",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
+
+
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/src/bap/xbapmain.c b/src/bap/xbapmain.c
new file mode 100644
index 0000000..d1116c3
--- /dev/null
+++ b/src/bap/xbapmain.c
@@ -0,0 +1,542 @@
+/*
+    Title: 	 main
+
+    File: 	 main.c
+    Purpose:	 C language entry point and initialisation functions
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+    This module contains the C language entry point `main' and
+    initialisation for the X system. It provides the main event loop,
+    as far as X is concerned.
+
+    Three top-level shells are created: control, dialogue and help.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include <stdio.h>    /* IMPORT: freopen, stdout, stderr, fprintf */
+#include "mystdlib.h" /* IMPORT: exit */
+
+#include "main.h"
+#include "dialogues.h"     /* IMPORT: CreateDialogueShell */
+#include "help.h"          /* IMPORT: CreateHelpShell */
+#include "plX.h"           /* IMPORT: CreateGraphicsOutput */
+#include "textOutput.h"    /* IMPORT: CreateTextOutput
+                                      UpdateTextOutput */
+#include "contigEditor.h"  /* IMPORT: CreateEditorShell */
+#include "locks.h"
+#include "mcspec.h"
+#include "actf.h"	   /* IMPORT: actfstack */
+#include "postscript.h"    /* IMPORT: closep_x() */
+#include "fort.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Types ---- */
+typedef enum {Alive, Dying, Dead} LifeForce;
+
+/* ---- Static variables ---- */
+
+static unsigned int global_argc;
+static char **global_argv;
+static XtAppContext app_con;
+static Display *display;
+static Widget toplevelWid;
+static Widget dialogueShellWid, helpShellWid, outputShellWid,graphicsShellWid;
+static LifeForce lifeForce = Alive;
+static int exitcode;
+/*
+static XrmOptionDescRec options[] =
+{
+};
+*/
+/* The following are used if Xdap cannot be found */
+static String fallback_resources[] = {
+    /* Dialogue shell */
+    /* -------------- */
+    "Xdap.Dialogue.allowShellResize: False",
+    "Xdap.Dialogue.width:	 550",
+    "Xdap.Dialogue.minWidth:  550",
+    "Xdap.Dialogue.maxWidth:  550",
+    "Xdap.Dialogue.height:    180",
+    "Xdap.Dialogue.minHeight: 180",
+    "Xdap.Dialogue.maxHeight: 180",
+    "Xdap.Dialogue.dialogueForm*translations: #override \\n\
+         <Key>Return: CRAction()",
+    /* The default chaining for all children is Top,Top,Left,Left */
+    "Xdap.Dialogue.dialogueForm*top:    chainTop",
+    "Xdap.Dialogue.dialogueForm*bottom: chainTop",
+    "Xdap.Dialogue.dialogueForm*left:   chainLeft",
+    "Xdap.Dialogue.dialogueForm*right:  chainLeft",
+    /* currFile */
+    "Xdap.Dialogue.dialogueForm.currFile.label: Current file:",
+    "Xdap.Dialogue.dialogueForm.currFile.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.currFile.resizable: True",
+    /* currFunc */
+    "Xdap.Dialogue.dialogueForm.currFunc.label: Current function:",
+    "Xdap.Dialogue.dialogueForm.currFunc.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.currFunc.resizable: True",
+    /* quit */
+    "Xdap.Dialogue.dialogueForm.quit.label: Quit application",
+    "Xdap.Dialogue.dialogueForm.quit.foreground:  red",
+    "Xdap.Dialogue.dialogueForm.quit.borderColor: red",
+
+    /* null */
+    "Xdap.Dialogue.dialogueForm.null.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.null.label: Working ... ",
+
+    /* nChoice */
+    "Xdap.Dialogue.dialogueForm.nChoice.resizable: True",
+    /* nChoice buttons */
+    "Xdap.Dialogue.dialogueForm.nChoice.buttons.resizable: True",
+    /* nChoice OK */
+    "Xdap.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3",
+
+    /* getopt */
+    "Xdap.Dialogue.dialogueForm.getopt.width: 540",
+    /* getopt execlabel */
+    "Xdap.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:",
+    "Xdap.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0",
+    /* getopt execcommand */
+    "Xdap.Dialogue.dialogueForm.getopt.execcommand.label: Execute with dialogue",
+    "Xdap.Dialogue.dialogueForm.getopt.execcommand.resizable: True",
+    /* getopt menubar */
+    "Xdap.Dialogue.dialogueForm.getopt.menubar.width:	530",
+    "Xdap.Dialogue.dialogueForm.getopt.menubar.height:	25",
+
+    /* getstring */
+    "Xdap.Dialogue.dialogueForm.getstring.label:",
+    "Xdap.Dialogue.dialogueForm.getstring.value:",
+    "Xdap.Dialogue.dialogueForm.getstring.resizable: True",
+    /* getstring label */
+    "Xdap.Dialogue.dialogueForm.getstring.label.resizable: True",
+    /* getstring value */
+    "Xdap.Dialogue.dialogueForm.getstring.value.width: 350",
+    /* getstring OK */
+    "Xdap.Dialogue.dialogueForm.getstring.OK.borderWidth: 3",
+
+    /* yesno */
+    "Xdap.Dialogue.dialogueForm.yesno.resizable: True",
+    /* yesno label */
+    "Xdap.Dialogue.dialogueForm.yesno.lab.label:",
+    "Xdap.Dialogue.dialogueForm.yesno.lab.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.yesno.lab.resizable: True",
+    "Xdap.Dialogue.dialogueForm.yesno.YES.borderWidth: 3",
+
+    /* Help shell */
+    /* ---------- */
+    "Xdap.Help.width:     550",
+    "Xdap.Help.minWidth:  550",
+    "Xdap.Help.height:    400",
+    "Xdap.Help.minHeight: 150",
+    "Xdap.Help*top:    chainTop",
+    "Xdap.Help*bottom: chainTop",
+    "Xdap.Help*left:   chainLeft",
+    "Xdap.Help*right:  chainLeft",
+
+    /* currDisp */
+    "Xdap.Help.helpForm.currDisp.resizable: True",
+    "Xdap.Help.helpForm.currDisp.borderWidth: 0",
+    /* rmHelp */
+    "Xdap.Help.helpForm.rmHelp.label: Remove help",
+    /* button */
+    "Xdap.Help.helpForm.button.label: List of topics",
+    /* topicsViewport */
+    "Xdap.Help.helpForm.topicsViewport.allowVert: True",
+    "Xdap.Help.helpForm.topicsViewport.bottom: chainBottom",
+    "Xdap.Help.helpForm.topicsViewport.right:  chainRight",
+    "Xdap.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1",
+    /* text */
+    "Xdap.Help.helpForm.text.editType: edit",
+    "Xdap.Help.helpForm.text.scrollVertical: always",
+    "Xdap.Help.helpForm.text.bottom: chainBottom",
+    "Xdap.Help.helpForm.text.right:  chainRight",
+
+    /* Output shell */
+    /* ------------ */
+    "Xdap.Graphics.graph.width:  550",
+    "Xdap.Graphics.graph.height: 400",
+    "Xdap.Output.text.width:   550",
+    "Xdap.Output.text.height:  100",
+
+    /* Editor shell */
+    /* ------------ */
+    "Xdap.Editor.allowShellResize: True",
+    "Xdap.Editor.mainForm.resizable: True",
+/*
+    "Xdap.Editor*font:  5x8",
+*/
+    "Xdap.Editor*font:  *Fixed-*-20-200-*",
+    /* The default chaining for all children is Top,Top,Left,Left */
+    "Xdap.Editor.mainForm*top:    chainTop",
+    "Xdap.Editor.mainForm*bottom: chainTop",
+    "Xdap.Editor.mainForm*left:   chainLeft",
+    "Xdap.Editor.mainForm*right:  chainLeft",
+    "Xdap.Editor.mainForm.title.borderWidth: 0",
+    "Xdap.Editor.mainForm.title.label: Contig Editor (V1.1)",
+    "Xdap.Editor.mainForm.buttons.borderWidth: 0",
+    "Xdap.Editor.mainForm.buttons.orientation: horizontal",
+    "Xdap.Editor.mainForm.buttons.resizable:  True",
+    "Xdap.Editor.mainForm.buttons*vertDistance: 0",
+    "Xdap.Editor.mainForm.buttons.insert.label: Insert",
+    "Xdap.Editor.mainForm.buttons.replace.label: Replace",
+    "Xdap.Editor.mainForm.buttons.replace.state: True",
+    "Xdap.Editor.mainForm.buttons.superman.label: Super Edit",
+    "Xdap.Editor.mainForm.buttons.superman.state: False",
+    "Xdap.Editor.mainForm.buttons.reveal.label: Reveal Cutoffs",
+    "Xdap.Editor.mainForm.buttons.reveal.state: False",
+    "Xdap.Editor.mainForm.buttons.undo.label: Undo",
+    "Xdap.Editor.mainForm.buttons.save.label: Save",
+    "Xdap.Editor.mainForm.buttons.exit.label: Leave Editor",
+    "Xdap.Editor.mainForm.buttons.exit.foreground:  red",
+    "Xdap.Editor.mainForm.buttons.exit.borderColor: red",
+    "Xdap.Editor.mainForm.buttons.lock.label: Lock",
+    "Xdap.Editor.mainForm.buttons.join.label: Join",
+    "Xdap.Editor.mainForm.buttons.next.label: Next Problem",
+    "Xdap.Editor.mainForm.edForm.resizable: True",
+    "Xdap.Editor.mainForm.edForm.defaultDistance: 0",
+    "Xdap.Editor.mainForm.edForm.borderWidth: 0",
+    "Xdap.Editor.mainForm.edForm.sequences.resizable: True",
+    "Xdap.Editor.mainForm.edForm.names.resizable: True",
+    "Xdap.Editor.mainForm.edForm.slider.length:  500",
+    "Xdap.Editor.mainForm.edForm.slider.orientation:  horizontal",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.defaultDistance:  0",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.borderWidth:  0",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.left:  ChainLeft",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.right:  ChainLeft",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.resizable:  True",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.resize:  True",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.left:  Rubber",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.right:  Rubber",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.leftleft.label:  <<",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.left.label:  <",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.right.label:  >",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.rightright.label:  >>",
+    "Xdap.Editor*warnPrompt.label: Save changes? ",
+    "Xdap.Editor*warnBox.borderWidth: 0",
+    "Xdap.Editor*warnBox.orientation: vertical",
+    "Xdap.Editor*warnPrompt.borderWidth: 0",
+    "Xdap.Editor.mainForm.disagreeForm.resizable: True",
+    "Xdap.Editor.mainForm.disagreeForm.dummy.resizable: True",
+    "Xdap.Editor.mainForm.disagreeForm.disagreements.resizable: True",
+    "Xdap.Editor.mainForm.disagreeForm.borderWidth: 0",
+    "Xdap.Editor.mainForm.disagreeForm.defaultDistance: 0",
+
+    /* Trace Manager */
+    /* ------------- */
+    "*Traces.mainForm.buttons.borderWidth: 0",
+    "*Traces.mainForm.buttons.orientation: horizontal",
+    "*Traces.mainForm.buttons*top:    ChainTop",
+    "*Traces.mainForm.buttons*bottom: ChainTop",
+    "*Traces.mainForm.buttons*left:   ChainLeft",
+    "*Traces.mainForm.buttons*right:  ChainLeft",
+
+    "*Traces.allowShellResize: True",
+    "*Traces.mainForm.traceForm.resizable: True",
+    "*Traces.mainForm.traceForm.form.resizable: True",
+    "*Traces.mainForm.defaultDistance: 0",
+    "*Traces.mainForm.traceForm.form.defaultDistance: 1",
+    "*Traces.mainForm.traceForm.borderWidth: 0",
+    "*Traces.mainForm.traceForm.form.borderWidth: 0",
+
+    /* The main display: a viewport containing a form containing graphs */
+    "*Traces.mainForm.traceForm.form.top:    Rubber",
+    "*Traces.mainForm.traceForm.form.bottom: Rubber",
+    "*Traces.mainForm.traceForm.form.left: ChainLeft",
+    "*Traces.mainForm.traceForm.form.right: ChainRight",
+    "*Traces.mainForm.traceForm.form.viewport.top:    Rubber",
+    "*Traces.mainForm.traceForm.form.viewport.bottom: Rubber",
+    "*Traces.mainForm.traceForm.form.viewport.left:   ChainLeft",
+    "*Traces.mainForm.traceForm.form.viewport.right:  ChainRight",
+    "*Traces.mainForm.traceForm.form.viewport.allowHoriz: True",
+    "*Traces.mainForm.traceForm.form.viewport.forcebars:  True",
+    "*Traces.mainForm.traceForm.form.viewport.width:      700",
+    "*Traces.mainForm.traceForm.form.viewport.height: 150",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.defaultDistance: 0",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.resizable: True",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.top:    ChainTop",  
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.bottom: ChainTop",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.left:   ChainLeft",  
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.right:  ChainRight",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.trace.top:    ChainTop",  
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.trace.bottom: ChainBottom",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.font: -misc-*-bold-*15-*",
+    /* The text graph heights are dynamically set to accomodate the font */
+    /* The graphs are dynamically initially set to fill the viewport */
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.dimBackground: grey",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour1:  green3",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour2:  purple",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour3:  black",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour4:  red",
+
+
+    /* Gadgets */
+    "*Traces.mainForm.traceForm.form.gadgets.borderWidth:    0",
+    "*Traces.mainForm.traceForm.form.gadgets.traceName.borderWidth:    0",
+    "*Traces.mainForm.traceForm.form.gadgets*top:    ChainTop",
+    "*Traces.mainForm.traceForm.form.gadgets*bottom: ChainTop",
+    "*Traces.mainForm.traceForm.form.gadgets*left:   ChainLeft",
+    "*Traces.mainForm.traceForm.form.gadgets*right:  ChainLeft",
+    "*Traces.mainForm.traceForm.form.gadgets.scaleUp.label:  Scale up",
+    "*Traces.mainForm.traceForm.form.gadgets.scaleDown.label:  Scale down",
+
+
+    /* Tag Editor */
+    /* ---------- */
+    "*Tag.mainForm.  defaultDistance: 0",
+    "*Tag.mainForm.buttons.borderWidth: 0",
+    "*Tag.mainForm.buttons*resize: True",
+    "*Tag.mainForm.buttons.orientation: horizontal",
+    "*Tag.mainForm.buttons*top:    ChainTop",
+    "*Tag.mainForm.buttons*bottom: ChainTop",
+    "*Tag.mainForm.buttons*left:   ChainLeft",
+    "*Tag.mainForm.buttons*right:  ChainLeft",
+    "*Tag.mainForm.tagText.width:  300",
+    "*Tag.mainForm.tagText.height: 100",
+    "*Tag.mainForm.tagText.left:   ChainLeft",
+    "*Tag.mainForm.tagText.right:  ChainRight",
+    "*Tag.mainForm.tagText.top:    ChainTop",
+    "*Tag.mainForm.tagText.bottom: ChainBottom",
+    "*Tag.mainForm.tagText.resize: True",
+
+    NULL,
+};
+
+
+
+
+/* ---- Callback routines ---- */
+
+
+static void destroyCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{   lifeForce = Dead;
+}
+
+
+
+
+/* ---- Other functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    fprintf(stderr, "%s: unknown options:", argv[0]);
+    for (i=1; i <argc; i++)
+	fprintf(stderr, " %s", argv[i]);
+    fprintf(stderr, "\n\n");
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr, 
+"       [-geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]]\n");
+    fprintf(stderr,
+"       [-display [{host}]:[{vs}]]\n");
+    fprintf(stderr,
+"       [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]\n");
+}
+
+
+
+
+static void finalxPhase2(void)
+/*
+    Recover all X resources and exit with `status'.
+    This function does not return.
+*/
+{
+    int_f job = 2;
+
+    while (actfptr != 0) {
+	actf_(&job, 0, 0, 0, 0, 0, 0);
+    }
+
+    /* close any opened postscript file. */
+    closep_x();
+
+    XtDestroyApplicationContext(app_con);
+
+    exit(exitcode);
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+int main(unsigned int argc, char **argv)
+{
+    activeLock = semaphoreCreate(65535/*a big number for max*/);
+
+    global_argc = argc;
+    global_argv = argv;
+
+    fmain_();
+
+    finalx(0);
+    finalxPhase2();
+
+    return 0;
+}
+
+
+
+
+void initx_(void)
+/*
+    Initialise the X system, creating all necessary fixtures and
+    fittings for the application.
+    `stdout' may be redirected.
+*/
+{
+
+
+    toplevelWid = XtAppInitialize(&app_con, "Xdap",
+				  NULL, (Cardinal) 0,
+				  (int *)&global_argc, global_argv,
+				  fallback_resources,
+				  NULL, (Cardinal) 0);
+
+    XtAddCallback(toplevelWid, XtNdestroyCallback, destroyCallback, NULL);
+    display = XtDisplay(toplevelWid);
+
+    if (global_argc != 1)
+    {   usage(global_argc, global_argv);
+        finalx(1);
+    }
+
+    /*
+        Create the dialogue shell. Must be done before the others.
+    */
+    dialogueShellWid = CreateDialogueShell(toplevelWid);
+
+    /*
+        Create the graphics shell.
+    */
+    graphicsShellWid = XtCreatePopupShell("Graphics", topLevelShellWidgetClass,
+					 toplevelWid,
+					 NULL, (Cardinal) 0);
+
+    (void) CreateGraphicsOutput(graphicsShellWid);
+
+    /*
+        Create the output shell.
+    */
+    outputShellWid = XtCreatePopupShell("Output", topLevelShellWidgetClass,
+					 toplevelWid,
+					 NULL, (Cardinal) 0);
+
+    (void) CreateTextOutput(outputShellWid);
+
+    /*
+        Create the help shell.
+    */
+    helpShellWid = CreateHelpShell(toplevelWid);
+
+    /*
+    **  Create the editor shell
+    */
+    (void) CreateEditorShell(toplevelWid);
+
+    /*
+    **  Create the trace manager shell
+    */
+    CreateTraceManager(toplevelWid);
+
+    /*
+        The output, graphics and dialogue shells are initially displayed.
+    */
+    XtPopup(dialogueShellWid, XtGrabNone);
+    XtPopup(graphicsShellWid,   XtGrabNone);
+    XtPopup(outputShellWid,   XtGrabNone);
+
+}
+
+
+
+
+void finalx(int status)
+/*
+    Recover all X resources and exit with `status'.
+
+    Death under X is a two phase process. XtDestroyWidget is called on
+    the toplevel, but this may only take effect when we return to the
+    main loop. When it does take effect, the toplevel destroyCallback
+    will be called and set ``lifeForce'' to dead, which indicates that
+    the second phase can be called. This then calls exit.
+*/
+{   lifeForce = Dying;
+    exitcode = status;
+
+    XtDestroyWidget(toplevelWid);
+
+    if (lifeForce == Dead) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+void dispatchEventsUntil(Boolean (*stopPred)(void))
+/*
+    X events are cycled until the stopping predicate returns true or
+    we are in the process of dying. If this is the case, then the
+    second phase of finalisation is carried out and this function
+    does not return.
+*/
+{   XEvent event;
+
+    UpdateTextOutput();
+
+    while ((lifeForce==Alive) && !stopPred())
+    {   XtAppNextEvent(app_con, &event);
+	XtDispatchEvent(&event);
+    }
+
+    if (lifeForce!=Alive) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+void dispatchCurrentEvents()
+/*
+    All X events in the queue are cycled. If we are in the process of
+    dying, then the second phase of finalisation is carried out and
+    this function does not return.
+*/
+{   XEvent event;
+
+    UpdateTextOutput();
+
+    while ((lifeForce==Alive) && (XtAppPending(app_con) != 0))
+    {   XtAppNextEvent(app_con, &event);
+	XtDispatchEvent(&event);
+    }
+
+    if (lifeForce!=Alive) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+
+Display *GetDisplay(void)
+/*
+    Return the X display running.
+    This function can only be called after initx_().
+*/
+{   return(display);
+}
diff --git a/src/bap/xdapSpec.c b/src/bap/xdapSpec.c
new file mode 100644
index 0000000..ffc16a6
--- /dev/null
+++ b/src/bap/xdapSpec.c
@@ -0,0 +1,431 @@
+/*
+    Title:       xdapSpec
+
+    File: 	 xdapSpec.c
+    Purpose:	 Definitions specific to the `xdap' program
+    Last update: Fri Dec 14 1990
+
+    Change log:
+
+        3/11/90 SD  Added extra menu option
+	22 August 1991 SD Added new menu option (Find internal joins)
+	    and associated help structures
+*/
+
+
+/*
+    This module contains data specific to the menus and help system
+    for the `xdap' program.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "progSpec.h"
+#include "dialogues.h" /* IMPORT: menu_x */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+
+static Widget menubarWid = (Widget)NULL; /* Initialised by CreateProgMenus */
+
+    
+static MenuData general_menu_data[] =
+{   {"Open a database",	                   3},
+    {"Display a contig",		   5},
+    {"List a text file",		   6},
+    {"Redirect output",		           7}, 
+    {"Calculate a consensus",	           8}, 
+    {"Screen against restriction enzymes", 17}, 
+    {"Screen against vector",	           18}, 
+    {"Check database",		           19}, 
+    {"Copy database",		           24}, 
+    {"Show relationships",		   25}, 
+    {"Set parameters",		           27}, 
+    {"Highlight disagreements",	           28}, 
+    {"Examine quality",		           29}, 
+    {"Find internal joins",	           35}, 
+};
+
+
+static MenuData screen_menu_data[] =
+{   {"Clear graphics",     10},
+    {"Clear text",	   11},
+    {"Draw ruler",	   12},
+    {"Use cross hair",	   13},
+    {"Change margins",	   14},
+    {"Label diagram",	   15},
+    {"Plot map",	   16},
+    {"Plot single contig", 33},
+    {"Plot all contigs",   34},
+};
+
+
+static MenuData modification_menu_data[] =
+{  {"Edit contig",           4},
+   {"Auto assemble",	     20},
+   {"Join contigs",	     22},
+   {"Complement a contig",   23},
+   {"Alter relationships",   26},
+   {"Extract gel readings",  32},
+};
+
+
+static MenuData enter_menu_data[] =
+{   {"Cancel",               2},
+    {"Complete entry",       3},
+    {"Edit contig",          4},
+    {"Display",              5},
+    {"Edit new gel reading", 6},
+};
+
+
+static MenuData join_menu_data[] =
+{   {"Cancel",            2},
+    {"Complete join",     3},
+    {"Edit left contig",  4},
+    {"Display join",      5},
+    {"Edit right contig", 6},
+    {"Move join",         7},
+};
+
+
+static MenuData alter_menu_data[] =
+{   {"Cancel",             2},
+    {"Line change",        3},
+    {"Edit gel reading",   4},
+    {"Delete contig",      5},
+    {"Shift",              6},
+    {"Move gel reading",   7},
+    {"Rename gel reading", 8},
+    {"Break a contig",     9},
+    {"Remove a gel reading",     10},
+    {"Alter raw data parameters", 11},
+};
+
+
+static MenuData edit_menu_data[] =
+{   {"Cancel",  2},
+    {"Insert",  3},
+    {"Delete",  4},
+    {"Change",  5},
+};
+
+
+
+
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button.
+	The button is initially unmanaged within its parent.
+    */
+    buttonWid = XtCreateWidget(menuButtonName, menuButtonWidgetClass,
+			       parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    They are initially all unmanaged.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+    menubarWid = parentWid;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Modification",
+	       modification_menu_data, XtNumber(modification_menu_data));
+    CreateMenu(parentWid, "Enter",
+	       enter_menu_data, XtNumber(enter_menu_data));
+    CreateMenu(parentWid, "Join",
+	       join_menu_data, XtNumber(join_menu_data));
+    CreateMenu(parentWid, "Alter",
+	       alter_menu_data, XtNumber(alter_menu_data));
+    CreateMenu(parentWid, "Edit",
+	       edit_menu_data, XtNumber(edit_menu_data));
+}
+
+
+
+
+/*
+    Special menu functions for SAP.
+
+    The generic menu functions, found in 'dialogues' are only
+    used indirectly by SAP.
+
+      SUBROUTINE DBMENT(MENU,NOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+
+      SUBROUTINE DBMENU(MENU,NOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+
+    If this code were in 'dialogues', then we wouldn't need to
+    keep the reference to 'menubarWid' which is making a big
+    assumption anyway.
+*/
+
+#define dbment_x dbment_
+#define dbmenu_x dbmenu_
+
+
+void dbment_x(long *MENU_p,
+	      long *NOPT_p,
+	      long *MAXOPT_p,
+	      long *IHELPS_p,
+	      long *IHELPE_p,
+	      char *HELPF_p,
+	      long *IDEVH_p,
+	      long *KBIN_p,
+	      long *KBOUT_p,
+	      long  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    long KOPT, MOPT, MINMEM; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the 'top level' menus.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    XtManageChild(XtNameToWidget(menubarWid, "General"));
+    XtManageChild(XtNameToWidget(menubarWid, "Screen"));
+    XtManageChild(XtNameToWidget(menubarWid, "Modification"));
+
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	   &KOPT,
+	   &MOPT,
+	    MAXOPT_p,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+}
+
+
+void dbmenu_x(long *MENU_p,
+	      long *NOPT_p,
+	      long *IHELPS_p,
+	      long *IHELPE_p,
+	      char *HELPF_p,
+	      long *IDEVH_p,
+	      long *KBIN_p,
+	      long *KBOUT_p,
+	      long  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    long KOPT, MOPT, MINMEM, MAXOPT; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the menu specified by 'MENU'.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    switch (*MENU_p)
+    {   case 2: XtManageChild(XtNameToWidget(menubarWid, "Enter"));
+                break;
+
+        case 3: XtManageChild(XtNameToWidget(menubarWid, "Join"));
+                break;
+      
+        case 4: XtManageChild(XtNameToWidget(menubarWid, "Alter"));
+                break;
+      
+        case 5: XtManageChild(XtNameToWidget(menubarWid, "Edit"));
+                break;
+    }
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	   &KOPT,
+	   &MOPT,
+	   &MAXOPT,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+
+}
+	      
+
+
+
+
+const long botHelpOpt = 0;
+const long topHelpOpt = 35;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "DAPHELP";
+const char helpPtrsFN[] = "DAPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "SAP",
+    "Help",
+    "Quit",
+    "Open a database",
+    "Edit contig",
+    "Display a contig",
+    "List a text file",
+    "Direct output to disk",
+    "Calculate a consensus",
+    "Screen edit",
+    "Clear graphics",
+    "Clear text",
+    "Draw ruler",
+    "Use cross hair",
+    "Change margins",
+    "Plot map",
+    "Label diagram",
+    "Screen against restriction enzymes",
+    "Screen against vector",
+    "Check consistency",
+    "Auto assemble",
+    "Enter new gel reading",
+    "Join contigs",
+    "Complement a contig",
+    "Copy database",
+    "Show relationships",
+    "Alter relationships",
+    "set parameters",
+    "Highlight disagreements",
+    "Examine quality",
+    "Auto edit a contig",
+    "Type in gel readings",
+    "Extract gel readings",
+    "Plot single contig",
+    "Plot all contigs",
+    "Find internal joins",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/src/convert/README b/src/convert/README
new file mode 100644
index 0000000..adb8626
--- /dev/null
+++ b/src/convert/README
@@ -0,0 +1,77 @@
+	     Converting xdap databases to xbap databases
+			      (and more)
+				   
+			      Simon Dear
+			   18 November 1992
+
+
+0. Introduction
+
+The Staden package (version 1992.3 or greater) now includes the
+sequence assembly program xbap. Xbap cannot be used with databases
+created by the sequence assembly program xdap. People wanting to
+switch to using the new software will need to convert their old
+databases. The program `convert' is provided for this purpose.
+
+1. The conversion program
+
+`convert' (version 1.0) is an interactive program for converting
+sequence assembly data between various formats. With it, it is
+possible to convert data in any of the formats:
+
+	text file (created by convert)
+	xdap database
+	xbap database
+
+to any of the following formats:
+
+	text file (created by convert)
+	xbap database
+
+
+2. Converting databases
+
+The program takes no command line arguments. First, you are prompted
+for the format, name and version of the database you wish to convert.
+If the source is an xdap or xbap database, ensure that the name and
+version are in uppercase. If the source is a text file, the version is
+prompted for but it is ignored. Second, you are prompted for the
+format, name and version of the database you wish to create. Ensure
+that names and versions are in the appropriate case and that the files
+do already exist.
+
+After this, the programs converts the database. This may take some
+time. The next message to appear on the terminal is a message to
+indicate the conversion has successfully completed.
+
+
+3. Example
+
+Here is a log of a typical conversion session:
+
+    Covert Project Database
+    Version 1.0, 13 November 1991
+    Please enter database to convert:
+
+    Available types are:
+    0. Flat file - created with this program
+    1. xdap database
+    2. xbap database
+
+    Database type? 1
+    Database name? ZK643
+    Database version? 0
+
+    Please enter database to create:
+
+    Available types are:
+    0. Flat file - created with this program
+    1. xbap database
+
+    Database type? 1
+    Database name? ZK643
+    Database version? 1
+
+    Conversion completed
+
+
diff --git a/src/convert/bapDB.c b/src/convert/bapDB.c
new file mode 100644
index 0000000..db9408b
--- /dev/null
+++ b/src/convert/bapDB.c
@@ -0,0 +1,838 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "process.h"
+#include "bapIO.h"
+#include "list.h"
+#include "misc.h"
+
+/*
+** Bap IO
+*/
+static BapIO io;
+
+
+static int cur_gel_index;
+static int cur_contig_index;
+
+
+
+static List *file_details()
+{
+    return
+	build_list(
+		   atom_str(db_files),
+		   build_list(
+			      atom_str(db_files_arch),
+			      atom_str(io.ar_file),
+			      nil),
+		   build_list(
+			      atom_str(db_files_rel),
+			      atom_str(io.rl_file),
+			      nil),
+		   build_list(
+			      atom_str(db_files_seq),
+			      atom_str(io.sq_file),
+			      nil),
+		   build_list(
+			      atom_str(db_files_tag),
+			      atom_str(io.tg_file),
+			      nil),
+
+		   build_list(
+			      atom_str(db_files_com),
+			      atom_str(io.cc_file),
+			      nil),
+		   nil
+	       );
+}
+
+
+
+
+static List *db_details()
+{
+    
+    return
+	build_list(
+		   build_list(
+			      atom_str(db_max_db_size),
+			      atom_int(io.max_db_size),
+			      nil),
+		   build_list(
+			      atom_str(db_max_gels),
+			      atom_int(io.max_gels),
+			      nil),
+		   build_list(
+			      atom_str(db_max_gel_length),
+			      atom_int(io.max_gel_length),
+			      nil),
+		   build_list(
+			      atom_str(db_data_class),
+			      atom_int(io.data_class),
+			      nil),
+		   build_list(
+			      atom_str(db_num_gels),
+			      atom_int(io.num_gels),
+			      nil),
+		   build_list(
+			      atom_str(db_num_contigs),
+			      atom_int(io.num_contigs),
+			      nil),
+		   nil);
+
+}
+
+
+
+
+void xdap_late_open_for_read(List *l)
+/*
+**
+*/
+{
+    char *name;
+    char *version;
+
+    name = assoc(l,db_name);
+    if (! name)	crash("No database name specified\n");
+
+    version = assoc(l,db_version);
+    if (! version) crash("No version specified\n");
+
+
+    bap_open_for_read(&io,name,version);
+
+    cur_gel_index = 1;
+    cur_contig_index = 1;
+}
+
+
+List *xdap_late_read_header()
+/*
+**
+*/
+{
+    List *files;
+    List *details;
+    
+    files = file_details();
+    details = db_details();
+
+    return
+	join_list(
+		  build_list(
+			     atom_str(db_from),
+			     nil),
+		  details,
+		  build_list(
+			     files,
+			     nil),
+		  nil);
+
+
+}
+
+
+
+void xdap_late_close(List *l)
+/*
+** Close all relevant files
+*/
+{
+
+    bap_close_files(&io);
+
+}
+
+
+
+
+
+List *xdap_late_read_gel_data()
+{
+    List *gel_details;
+
+    if (cur_gel_index > io.num_gels)
+	gel_details = nil;
+    else {
+	bap_ar_file_rec ar_line;
+	bap_rl_file_rec rl_line;
+	bap_sq_file_rec sq_line;
+	bap_tg_file_rec tg_line;
+
+	int index;
+	char name[17];
+	char *seq;
+	int length;
+	int comp;
+	int pos;
+	int l_nbr;
+	int r_nbr;
+
+	sq_line = (char *) malloc(io.max_gel_length+1);
+
+	bap_read_ar(&io,cur_gel_index,&ar_line);
+	bap_read_rl(&io,cur_gel_index,&rl_line);
+	bap_read_tg(&io,cur_gel_index,&tg_line);
+	bap_read_sq(&io,cur_gel_index,sq_line);
+
+
+	index = cur_gel_index;
+	f2cstr(ar_line.lines.name,BAP_FILE_NAME_LENGTH,name,
+	       (size_t)BAP_FILE_NAME_LENGTH);
+	length = abs(rl_line.lines.length);
+	comp = (rl_line.lines.length < 0);
+	seq = sq_line; seq[length] = '\0';
+	pos = rl_line.lines.rel_pos;
+	l_nbr = rl_line.lines.left_nbr;
+	r_nbr = rl_line.lines.right_nbr;
+
+	gel_details =
+	    build_list(
+		       atom_str(gel_rec),
+		       build_list(
+				  atom_str(gel_index),
+				  atom_int(index),
+				  nil),
+		       build_list(
+				  atom_str(gel_name),
+				  atom_str(name),
+				  nil),
+		       build_list(
+				  atom_str(gel_length),
+				  atom_int(length),
+				  nil),
+		       build_list(
+				  atom_str(gel_comp),
+				  atom_int(comp),
+				  nil),
+		       build_list(
+				  atom_str(gel_seq),
+				  atom_str(seq),
+				  nil),
+		       build_list(
+				  atom_str(gel_pos),
+				  atom_int(pos),
+				  nil),
+		       build_list(
+				  atom_str(gel_l_nbr),
+				  atom_int(l_nbr),
+				  nil),
+		       build_list(
+				  atom_str(gel_r_nbr),
+				  atom_int(r_nbr),
+				  nil),
+		       nil);
+
+
+
+	/* Get raw data details */
+	if (tg_line.lines.comment) {
+	    List *raw_data_details;
+
+	    char *rd;
+	    int rd_length;
+	    int rd_cut;
+	    int rd_ulen;
+	    char rd_type[5];
+	    char rd_file[19];
+	    
+	    rd = bap_read_comment(&io, tg_line.lines.comment);
+	    sscanf(rd,"%6d%6d%6d%*s",&rd_length, &rd_cut, &rd_ulen);
+	    f2cstr(&rd[18],4,rd_type,(size_t)4);
+	    f2cstr(&rd[22],18,rd_file,(size_t)18);
+
+
+	    raw_data_details =
+		build_list(
+			   build_list(
+				      atom_str(gel_rd_length),
+				      atom_int(rd_length),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_cut),
+				      atom_int(rd_cut),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_ulen),
+				      atom_int(rd_ulen),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_type),
+				      atom_str(rd_type),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_file),
+				      atom_str(rd_file),
+				      nil),
+			   nil);
+
+	    join_list (gel_details, raw_data_details, nil);
+
+	}
+
+
+
+	/*
+	** Process tags, maintaining separate lists for
+	** (a) special tags
+	** (b) annotation
+	** (c) edits
+	*/
+	{
+	    List *specials;
+	    List *notes;
+	    List *edits;
+
+	    int_4 next;
+	    specials = nil;
+	    notes =
+		build_list(
+			   atom_str(gel_annotation),
+			   nil);
+	    edits =
+		build_list(
+			   atom_str(gel_edits),
+			   nil);
+
+	    while (tg_line.lines.next) {
+		next = tg_line.lines.next;
+		bap_read_tg(&io,next,&tg_line);
+
+		if (strncmp(tg_line.lines.type.c,"*LC*",4)==0) {
+		    if (tg_line.lines.comment) {
+			List *lc;
+			lc = build_list(
+					atom_str(gel_l_cut_seq),
+					atom_str(bap_read_comment(&io,tg_line.lines.comment)),
+					nil);
+			if (isNil(specials))
+			    specials = build_list(lc,nil);
+			else
+			    specials = join_list(specials,build_list(lc,nil),nil);
+		    }
+		} else if (strncmp(tg_line.lines.type.c,"*RC*",4)==0) {
+		    if (tg_line.lines.comment) {
+			List *rc;
+			rc = build_list(
+					atom_str(gel_r_cut_seq),
+					atom_str(bap_read_comment(&io,tg_line.lines.comment)),
+					nil);
+			if (isNil(specials))
+			    specials = build_list(rc,nil);
+			else
+			    specials = join_list(specials,build_list(rc,nil),nil);
+		    }
+		} else if (strncmp(tg_line.lines.type.c,"*",1)==0) {
+		    List *ed;
+		    char base[2];
+		    base[0] = tg_line.lines.type.c[3];
+		    base[1] = '\0';
+		    ed = build_list(
+				    build_list(
+					       atom_str(gel_ed_op),
+					       atom_str( (strncmp(tg_line.lines.type.c,"*IN",3))==0
+							? gel_ed_insert : gel_ed_delete),
+					       nil),
+				    build_list(
+					       atom_str(gel_ed_base),
+					       atom_str(base),
+					       nil),
+				    build_list(
+					       atom_str(gel_ed_base_pos),
+					       atom_int( tg_line.lines.position ),
+					       nil),
+				    nil);
+		    edits = join_list(edits, build_list(ed,nil),nil);
+		} else {
+		    List *an;
+		    char type[5];
+		    char *com;
+		    strncpy(type,tg_line.lines.type.c,4);
+		    type[4]='\0';
+		    com = bap_read_comment(&io,tg_line.lines.comment);
+		    an = build_list(
+				    build_list(
+					       atom_str(gel_an_pos),
+					       atom_int(tg_line.lines.position),
+					       nil),
+				    build_list(
+					       atom_str(gel_an_len),
+					       atom_int(tg_line.lines.length),
+					       nil),
+				    build_list(
+					       atom_str(gel_an_type),
+					       atom_str(type),
+					       nil),
+				    (com == NULL) ? nil :
+				    build_list(
+					       atom_str(gel_an_comment),
+					       atom_str(com),
+					       nil),
+				    nil);
+		    notes = join_list(notes,build_list(an,nil),nil);
+
+		}
+	    }
+	    if (isNil(specials))
+		gel_details = join_list(gel_details,build_list(edits,nil),build_list(notes,nil),nil);
+	    else
+		gel_details = join_list(gel_details,specials,build_list(edits,nil),build_list(notes,nil),nil);
+
+	}
+
+
+
+	cur_gel_index++;
+	free(sq_line);
+
+    }
+
+    return gel_details;
+
+}
+
+
+
+
+
+List *xdap_late_read_contig_data()
+{
+    List *contig_details;
+
+    if (cur_contig_index > io.num_contigs)
+	contig_details = nil;
+    else {
+	bap_rl_file_rec rl_line;
+	int length;
+	int index;
+	int left_end;
+	int right_end;
+
+	index = io.max_gels-cur_contig_index;
+	bap_read_rl(&io,index,&rl_line);
+	length = rl_line.clines.length;
+	left_end = rl_line.clines.left_end;
+	right_end = rl_line.clines.right_end;
+
+	contig_details =
+	    build_list(
+		       atom_str(contig_rec),
+		       build_list(
+				  atom_str(contig_index),
+				  atom_int(index),
+				  nil),
+		       build_list(
+				  atom_str(contig_length),
+				  atom_int(length),
+				  nil),
+		       build_list(
+				  atom_str(contig_left_end),
+				  atom_int(left_end),
+				  nil),
+		       build_list(
+				  atom_str(contig_right_end),
+				  atom_int(right_end),
+				  nil),
+		       nil);
+
+	cur_contig_index++;
+    }
+
+    return contig_details;
+
+}
+
+
+
+
+
+void xdap_late_open_for_write(List *l)
+/*
+**
+*/
+{
+    char *name;
+    char *version;
+
+
+    name = assoc(l,db_name);
+    if (! name)	crash("No database name specified\n");
+
+    version = assoc(l,db_version);
+    if (! version) crash("No version specified\n");
+
+
+    bap_open_for_write(&io,name,version);
+
+    cur_gel_index = 1;
+    cur_contig_index = 1;
+
+}
+
+
+
+void xdap_late_write_header(List *l)
+{
+    char *a;
+    bap_rl_file_rec rl_dbheader;
+    bap_rl_file_rec rl_header;
+    bap_ar_file_rec ar_line;
+    bap_sq_file_rec sq_line;
+    bap_tg_file_rec tg_line;
+    bap_cc_file_rec cc_line;
+
+    if ( (a = assoc(l,db_data_class)) == NULL)
+	crash("Sequence type (DNA or protein) not specified\n");
+    else
+	io.data_class = atoi(a);
+
+    if ( (a = assoc(l,db_num_gels)) == NULL)
+	crash("Number of gels not specified\n");
+    else
+	io.num_gels = atoi(a);
+
+    if ( (a = assoc(l,db_num_contigs)) == NULL)
+	crash("Number of contigs not specified\n");
+    else
+	io.num_contigs = atoi(a);
+
+    if ( (a = assoc(l,db_max_gel_length)) == NULL)
+	crash("Maximum length of a gel reading not specified\n");
+    else
+	io.max_gel_length = atoi(a);
+
+    if ( (a = assoc(l,db_max_gels)) == NULL)
+	crash("Maximum number of gels not specified\n");
+    else
+	io.max_gels = atoi(a);
+
+    if ( (a = assoc(l,db_max_db_size)) == NULL ) {
+	io.max_db_size = 1000;
+    } else {
+	io.max_db_size = atoi(a);
+    }
+
+    rl_dbheader.dbheader.maxdb = io.max_db_size;
+    rl_dbheader.dbheader.idbsiz = io.max_gels;
+    rl_dbheader.dbheader.maxgel = io.max_gel_length;
+    rl_dbheader.dbheader.idm = io.data_class;
+    bap_write_rl(&io,bap_rl_dbheader_rec(&io),&rl_dbheader);
+
+
+    rl_header.header.num_gels = io.num_gels;
+    rl_header.header.num_contigs = io.num_contigs;
+    bap_write_rl(&io,bap_rl_header_rec(&io),&rl_header);
+
+    bap_write_ar(&io,1,&ar_line);
+
+    sq_line = (bap_sq_file_rec) malloc(io.max_gel_length);
+    bap_write_sq(&io,1,sq_line);
+    free(sq_line);
+
+
+    tg_line.header.free_list = 0;
+    tg_line.header.count = bap_tg_header_rec(&io);
+    bap_write_tg(&io,bap_tg_header_rec(&io),&tg_line);
+
+    cc_line.header.free_list = 0;
+    cc_line.header.count = bap_cc_header_rec(&io);
+    bap_write_cc(&io,bap_cc_header_rec(&io),&cc_line);
+}
+
+
+
+void xdap_late_write_gel_data(List *l)
+{
+
+    char *a;
+    int i;
+    bap_rl_file_rec rl_line;
+    bap_ar_file_rec ar_line;
+    bap_sq_file_rec sq_line;
+    bap_tg_file_rec tg_line;
+    List *edits;
+    List *notes;
+
+    sq_line = (bap_sq_file_rec) malloc(io.max_gel_length);
+
+    /*
+    ** Relationship line
+    */
+    if ( (a = assoc(l,gel_l_nbr)) == NULL)
+	crash("No left neighbour for gel %d\n",cur_gel_index);
+    else
+	rl_line.lines.left_nbr = atoi(a);
+
+    if ( (a = assoc(l,gel_r_nbr)) == NULL)
+	crash("No right neighbour for gel %d\n",cur_gel_index);
+    else
+	rl_line.lines.right_nbr = atoi(a);
+
+    if ( (a = assoc(l,gel_length)) == NULL)
+	crash("Length of gel reading not specified for gel %d\n", cur_gel_index);
+    else
+	rl_line.lines.length = atoi(a);
+
+    if ( (a = assoc(l,gel_comp)) == NULL)
+	crash("Not known if gel %d complemented\n", cur_gel_index);
+    else {
+	i = atoi(a);
+	if (i) rl_line.lines.length = -rl_line.lines.length;
+    }
+
+    if ( (a = assoc(l,gel_pos)) == NULL)
+	crash("No position in contig specified for gel %d\n", cur_gel_index);
+    else
+	rl_line.lines.rel_pos = atoi(a);
+
+    bap_write_rl(&io,cur_gel_index,&rl_line);
+
+
+
+    /*
+    ** Archive line
+    */
+    if ( (a = assoc(l,gel_name)) == NULL)
+	crash("No gel name specified for gel %d\n", cur_gel_index);
+    else
+	c2fstr(a,BAP_FILE_NAME_LENGTH,ar_line.lines.name,
+	       (size_t)BAP_FILE_NAME_LENGTH);
+	    
+    bap_write_ar(&io,cur_gel_index,&ar_line);
+
+
+
+    /*
+    ** Sequence
+    */
+    if ( (a = assoc(l,gel_seq)) == NULL)
+	crash("No sequence for gel %d\n", cur_gel_index);
+    else
+	c2fstr(a,strlen(a), sq_line, (size_t)io.max_gel_length);
+
+    bap_write_sq(&io,cur_gel_index,sq_line);
+
+
+    /*
+    ** Initialise tag fields
+    */
+    tg_line.lines.next = 0;
+    tg_line.lines.length = 0;
+    tg_line.lines.comment = 0;
+    tg_line.lines.position = 0;
+    bap_write_tg(&io, cur_gel_index, &tg_line);
+
+    /*
+    ** Raw data
+    */
+    if ( (a = assoc(l,gel_rd_length)) != NULL) {
+	int length;
+	int cut;
+	int ulen;
+	char type[4];
+	char file[18];
+	char s[41];
+
+	length = atoi(a);
+
+	if ( (a = assoc(l,gel_rd_cut)) == NULL)
+	    crash ("No raw data left cutoff specified for gel %d\n",cur_gel_index);
+	else
+	    cut = atoi(a);
+
+	if ( (a = assoc(l,gel_rd_ulen)) == NULL)
+	    crash ("No raw data length specified for gel %d\n",cur_gel_index);
+	else
+	    ulen = atoi(a);
+
+	if ( (a = assoc(l,gel_rd_type)) == NULL)
+	    crash ("No raw data file type specified for gel %d\n",cur_gel_index);
+	else
+	    c2fstr(a,strlen(a),type,sizeof(type));
+
+	if ( (a = assoc(l,gel_rd_file)) == NULL)
+	    crash ("No raw data file specified for gel %d\n",cur_gel_index);
+	else
+	    c2fstr(a,strlen(a),file,sizeof(file));
+
+	sprintf(s,"%6d%6d%6d%4.4s%18.18s",length,cut,ulen,type,file);
+
+	tg_line.lines.comment = bap_write_comment(&io,s);
+	bap_write_tg(&io, cur_gel_index, &tg_line);
+    }
+
+
+
+
+
+
+    /*
+    ** Edits
+    */
+    edits = index_list_by_str(l,gel_edits);
+    if ( ! isNil(edits)) {
+
+	bap_tg_file_rec tg_last_ed;
+	long last_ed_index;
+	bap_tg_file_rec tg_ed;
+	long ed_index;
+
+	tg_last_ed = tg_ed = tg_line;
+	last_ed_index = ed_index = cur_gel_index;
+
+	for( edits = cdr(edits); ! isNil(edits) ; edits = cdr(edits)) {
+	    List *e;
+	    int pos;
+	    char type[4];
+
+	    e = car(edits);
+
+	    if ( (a = assoc(e,gel_ed_op)) == NULL)
+		crash ("No edit operation specified, gel %d\n",cur_gel_index);
+	    else {
+		if (strcmp(a,gel_ed_delete)==0)
+		    strcpy(type,"*DE");
+		else
+		    strcpy(type,"*IN");
+	    }
+
+	    if ( (a = assoc(e,gel_ed_base)) == NULL)
+		crash ("No base specified for edit, gel %d\n",cur_gel_index);
+	    else
+		type[3] = a[0];
+
+	    if ( (a = assoc(e,gel_ed_base_pos)) == NULL)
+		crash ("No base position specified for edit, gel %d\n",cur_gel_index);
+	    else
+		pos = atoi(a);
+
+	    ed_index = bap_get_free_tag(&io);
+	    tg_last_ed.lines.next = ed_index;
+
+	    bap_write_tg(&io,last_ed_index,&tg_last_ed);
+	
+	    tg_ed.lines.position = pos;
+	    tg_ed.lines.comment = 0;
+	    tg_ed.lines.next = 0;
+	    strncpy(tg_ed.lines.type.c,type,4);
+
+	    tg_last_ed = tg_ed;
+	    last_ed_index = ed_index;
+
+	}
+
+	bap_write_tg(&io,ed_index,&tg_ed);
+
+    }
+
+    /*
+    ** Right cut offs
+    */
+    if ( (a = assoc(l, gel_r_cut_seq)) != NULL) {
+	bap_tg_file_rec rcut;
+	rcut.lines.position = 0;
+	rcut.lines.length = 0;
+	rcut.lines.next = 0;
+	strncpy(rcut.lines.type.c,"*RC*",4);
+	rcut.lines.comment = bap_write_comment(&io,a);
+	bap_insert_tag(&io,cur_gel_index,rcut);
+    }
+
+
+    /*
+    ** Left cut offs
+    */
+    if ( (a = assoc(l, gel_l_cut_seq)) != NULL) {
+	bap_tg_file_rec lcut;
+	lcut.lines.position = 0;
+	lcut.lines.length = 0;
+	lcut.lines.next = 0;
+	strncpy(lcut.lines.type.c,"*LC*",4);
+	lcut.lines.comment = bap_write_comment(&io,a);
+	bap_insert_tag(&io,cur_gel_index,lcut);
+    }
+
+
+    /*
+    ** Annotation
+    */
+    notes = index_list_by_str(l,gel_annotation);
+    if (!isNil(notes)){
+
+	for( notes = cdr(notes); ! isNil(notes) ; notes = cdr(notes)) {
+	    List *n;
+	    bap_tg_file_rec tg_rec;
+	    
+	    n = car(notes);
+
+	    if ( (a = assoc(n,gel_an_pos)) == NULL)
+		crash("No position for annotation, gel %d\n",cur_gel_index);
+	    else
+		tg_rec.lines.position = atoi(a);
+
+	    if ( (a = assoc(n,gel_an_len)) == NULL)
+		crash("No length for annotation, gel %d\n",cur_gel_index);
+	    else
+		tg_rec.lines.length = atoi(a);
+
+	    if ( (a = assoc(n,gel_an_type)) == NULL)
+		crash("No type for annotation, gel %d\n",cur_gel_index);
+	    else
+		c2fstr(a,strlen(a),tg_rec.lines.type.c,(size_t)4);
+
+	    if ( (a = assoc(n,gel_an_comment)) == NULL)
+		tg_rec.lines.comment = 0;
+	    else
+		tg_rec.lines.comment = bap_write_comment(&io,a);
+
+	    bap_insert_tag(&io,cur_gel_index,tg_rec);
+	}
+
+    }
+
+    cur_gel_index++;
+
+    free(sq_line);
+
+}
+
+
+
+
+
+void xdap_late_write_contig_data(List *l)
+{
+
+    char *a;
+    bap_rl_file_rec rl_line;
+    long index;
+
+    index = io.max_gels-cur_contig_index;
+    
+    if ( (a = assoc(l,contig_left_end)) == NULL)
+	crash("No left end for contig %d\n", cur_contig_index);
+    else
+	rl_line.clines.left_end = atoi(a);
+
+    if ( (a = assoc(l,contig_right_end)) == NULL)
+	crash("No right end for contig %d\n",cur_contig_index);
+    else
+	rl_line.clines.right_end = atoi(a);
+
+    if ( (a = assoc(l,contig_length)) == NULL)
+	crash("No length for contig %d\n",cur_contig_index);
+    else
+	rl_line.clines.length = atoi(a);
+
+
+    bap_write_rl(&io,index,&rl_line);
+
+    cur_contig_index++;
+
+}
+
+
+
diff --git a/src/convert/bapDB.h b/src/convert/bapDB.h
new file mode 100644
index 0000000..46de9e0
--- /dev/null
+++ b/src/convert/bapDB.h
@@ -0,0 +1,17 @@
+#ifndef _bapDB_h
+#define _bapDB_h
+
+#include "list.h"
+
+extern xdap_late_open_for_read(List *l);
+extern void xdap_late_close(List *l);
+extern void xdap_late_open_for_write(List *l);
+extern void xdap_late_write_header(List *l);
+extern void xdap_late_write_gel_data(List *l);
+extern void xdap_late_write_contig_data(List *l);
+extern List *xdap_late_read_header();
+extern List *xdap_late_read_gel_data();
+extern List *xdap_late_read_contig_data();
+
+#endif /* _bapDB_h */
+
diff --git a/src/convert/bapIO.c b/src/convert/bapIO.c
new file mode 100644
index 0000000..1d7cf56
--- /dev/null
+++ b/src/convert/bapIO.c
@@ -0,0 +1,505 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "bapIO.h"
+#include "misc.h"
+
+
+
+#define swap_int4(src, dst) \
+    ((char *)&(dst))[0] = ((char *) &(src))[3];\
+    ((char *)&(dst))[1] = ((char *) &(src))[2];\
+    ((char *)&(dst))[2] = ((char *) &(src))[1];\
+    ((char *)&(dst))[3] = ((char *) &(src))[0];
+
+
+/*
+** Tag File IO
+*/
+
+void swap_tg_file_rec(bap_tg_file_rec *src, bap_tg_file_rec *dst)
+{
+    int i=1;
+
+    if (*(char*)&i) {
+	swap_int4(src->lines.position,dst->lines.position);
+	swap_int4(src->lines.length,dst->lines.length);
+	swap_int4(src->lines.comment,dst->lines.comment);
+	dst->lines.type = src->lines.type;
+	swap_int4(src->lines.next,dst->lines.next);
+    } else {
+	dst->lines.position = src->lines.position;
+	dst->lines.length = src->lines.length;
+	dst->lines.comment = src->lines.comment;
+	dst->lines.type = src->lines.type;
+	dst->lines.next = src->lines.next;
+    }
+    
+}
+
+
+
+
+void bap_read_tg(BapIO *io, int_4 rec, bap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    bap_tg_file_rec tbuf;
+
+    if ( fseek(f,(off_t)bap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+    if ( fread(&tbuf, sizeof(bap_tg_file_rec), 1, f) != 1)
+	crash("Read failure on tag file\n");
+
+    swap_tg_file_rec(&tbuf,t);
+}
+
+
+
+
+void bap_write_tg(BapIO *io, int rec, bap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    bap_tg_file_rec tbuf;
+
+    if ( fseek(f,(off_t)bap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+
+    swap_tg_file_rec(t,&tbuf);
+
+    if ( fwrite(&tbuf, sizeof(bap_tg_file_rec), 1, f) != 1)
+	crash("Write failure on tag file\n");
+}
+
+
+
+/*
+** Archive File IO
+*/
+void bap_read_ar(BapIO *io, int rec, bap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)bap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fread(t, sizeof(bap_ar_file_rec), 1, f) != 1)
+	crash("Read failure on archive file\n");
+}
+
+void bap_write_ar(BapIO *io, int rec, bap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)bap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(bap_ar_file_rec), 1, f) != 1)
+	crash("Write failure on archive file\n");
+}
+
+
+
+
+/*
+** Relationship file IO
+*/
+
+void swap_rl_file_rec(bap_rl_file_rec *src, bap_rl_file_rec *dst)
+{
+    int i=1;
+
+    if (*(char*)&i) {
+	swap_int4(src->lines.rel_pos,dst->lines.rel_pos);
+	swap_int4(src->lines.length,dst->lines.length);
+	swap_int4(src->lines.left_nbr,dst->lines.left_nbr);
+	swap_int4(src->lines.right_nbr,dst->lines.right_nbr);
+    } else {
+	dst->lines.rel_pos = src->lines.rel_pos;
+	dst->lines.length = src->lines.length;
+	dst->lines.left_nbr = src->lines.left_nbr;
+	dst->lines.right_nbr = src->lines.right_nbr;
+    }
+    
+}
+
+
+
+void bap_read_rl(BapIO *io, int rec, bap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    bap_rl_file_rec tbuf;
+
+    if ( fseek(f,(off_t)bap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    if ( fread(&tbuf, sizeof(bap_rl_file_rec), 1, f) != 1)
+	crash("Read failure on relationships file\n");
+
+    swap_rl_file_rec(&tbuf,t);
+}
+
+void bap_write_rl(BapIO *io, int rec, bap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    bap_rl_file_rec tbuf;
+    if ( fseek(f,(off_t)bap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    swap_rl_file_rec(t,&tbuf);
+
+    if ( fwrite(&tbuf, sizeof(bap_rl_file_rec), 1, f) != 1)
+	crash("Write failure on relationships file\n");
+}
+
+
+
+
+/*
+** Comment file IO
+*/
+
+
+void swap_cc_file_rec_header(bap_cc_file_rec *src, bap_cc_file_rec *dst)
+{
+    int i=1;
+    if (*(char*)&i) {
+	swap_int4(src->header.free_list,dst->header.free_list);
+	swap_int4(src->header.count,dst->header.count);
+    } else {
+	dst->header.free_list = src->header.free_list;
+	dst->header.count = src->header.count;
+    }
+    
+}
+
+
+
+
+void swap_cc_file_rec_lines(bap_cc_file_rec *src, bap_cc_file_rec *dst)
+{
+    int i=1;
+    if (*(char*)&i) {
+	swap_int4(src->lines.next,dst->lines.next);
+    } else {
+	dst->lines.next = src->lines.next;
+    }
+    memcpy(dst->lines.comment,src->lines.comment,BAP_COMMENT_SIZE);
+    
+}
+
+void bap_read_cc(BapIO *io, int rec, bap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    bap_cc_file_rec tbuf;
+    if ( fseek(f,(off_t)bap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if ( fread(&tbuf, sizeof(bap_cc_file_rec), 1, f) != 1)
+	crash("Read failure on comment file, record %d\n",rec);
+
+    if (rec == bap_cc_header_rec(io))
+	swap_cc_file_rec_header(&tbuf,t);
+    else
+	swap_cc_file_rec_lines(&tbuf,t);
+
+}
+
+
+
+void bap_write_cc(BapIO *io, int rec, bap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    bap_cc_file_rec tbuf;
+    if ( fseek(f,(off_t)bap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if (rec == bap_cc_header_rec(io))
+	swap_cc_file_rec_header(t,&tbuf);
+    else
+	swap_cc_file_rec_lines(t,&tbuf);
+
+    if ( fwrite(&tbuf, sizeof(bap_cc_file_rec), 1, f) != 1)
+	crash("Crash failure on comment file, record %d\n",rec);
+}
+
+
+
+/*
+** Sequence file IO
+*/
+void bap_read_sq(BapIO *io, int rec, bap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)bap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fread(t, io->max_gel_length, 1, f) != 1)
+	crash("Read failure on sequence file\n");
+}
+
+void bap_write_sq(BapIO *io, int rec, bap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)bap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fwrite(t, io->max_gel_length, 1, f) != 1)
+	crash("Write failure on sequence file\n");
+}
+
+
+
+/*
+** Comment IO - Strings
+*/
+char *bap_read_comment(BapIO *io, int_4 cp)
+{
+    bap_cc_file_rec c;
+    int count;
+    int_4 nc;
+    char *com,*comptr;
+
+    if (!cp) return NULL;
+    /* determine how long string is */
+    count = 1;
+    nc=cp;
+    bap_read_cc(io, nc, &c);
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        bap_read_cc(io, nc, &c);
+    }
+
+    com = comptr = (char *)malloc(count * BAP_COMMENT_SIZE+1);
+    nc=cp;
+    bap_read_cc(io, nc, &c);
+    strncpy(com,c.lines.comment,BAP_COMMENT_SIZE); com+=BAP_COMMENT_SIZE;
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        bap_read_cc(io, nc, &c);
+        strncpy(com,c.lines.comment,BAP_COMMENT_SIZE); com+=BAP_COMMENT_SIZE;
+    }
+
+    *com = '\0';
+
+    return comptr;
+    
+}
+
+
+
+static int_4 get_free_comment(BapIO *io)
+{
+    bap_cc_file_rec head;
+    bap_cc_file_rec freerec;
+    int_4 free_id;
+    bap_read_cc(io,bap_cc_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	bap_read_cc(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	bap_write_cc(io,bap_cc_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	bap_write_cc(io,bap_cc_header_rec(io),&head);
+	bap_write_cc(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+int_4 bap_get_free_tag(BapIO *io)
+{
+    bap_tg_file_rec head;
+    bap_tg_file_rec freerec;
+    int_4 free_id;
+    bap_read_tg(io,bap_tg_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	bap_read_tg(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	bap_write_tg(io,bap_tg_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	bap_write_tg(io,bap_tg_header_rec(io),&head);
+	bap_write_tg(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+
+
+
+
+void bap_insert_tag(BapIO *io, int_4 gel, bap_tg_file_rec t)
+{
+    int_4 next, last;
+    int_4 free;
+    bap_tg_file_rec tg,last_tg;
+
+    last = gel;
+    bap_read_tg(io,last,&last_tg);
+
+    next = last_tg.lines.next;
+    if (next) bap_read_tg(io,next,&tg);
+
+    while (next && tg.lines.position <= t.lines.position) {
+	last = next;
+	last_tg = tg;
+	next = tg.lines.next;
+	if (next) bap_read_tg(io,next,&tg);
+    }
+
+    /* insert after last */
+    free = bap_get_free_tag(io);
+    t.lines.next = next;
+    last_tg.lines.next = free;
+    bap_write_tg(io,last,&last_tg);
+    bap_write_tg(io,free,&t);
+
+}
+
+
+int_4 bap_write_comment(BapIO *io, char *c)
+{
+    bap_cc_file_rec com;
+    int_4 cur,next,this_comment;
+    int clen = strlen(c);
+    int piece;
+
+    /* write out first block of BAP_COMMENT_SIZE */
+    this_comment=cur=get_free_comment(io);
+    if (clen>BAP_COMMENT_SIZE)
+	piece = BAP_COMMENT_SIZE;
+    else
+	piece = clen;
+
+    {int i; for(i=0;i<BAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+    strncpy(com.lines.comment,c,piece);
+
+    c+= piece;
+    clen -= piece;
+    while (clen > 0) {
+	next = get_free_comment(io);
+	com.lines.next = next;
+	bap_write_cc(io,cur,&com);
+	cur = next;
+	if (clen<BAP_COMMENT_SIZE)
+	    piece = clen;
+
+	{int i; for(i=0;i<BAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+	strncpy(com.lines.comment,c,piece);
+
+	c+= piece;
+	clen -= piece;
+    }
+    com.lines.next = 0;
+    if (piece!=BAP_COMMENT_SIZE)
+	com.lines.comment[piece]='\0';
+    bap_write_cc(io,cur,&com);
+
+    return this_comment;
+}
+
+
+
+
+
+
+
+
+
+
+static void set_file_names(BapIO *io, char *name, char *version)
+{
+    strcpy(io->ar_file,name); strcat(io->ar_file,".AR"); strcat(io->ar_file,version);
+    strcpy(io->rl_file,name); strcat(io->rl_file,".RL"); strcat(io->rl_file,version);
+    strcpy(io->sq_file,name); strcat(io->sq_file,".SQ"); strcat(io->sq_file,version);
+    strcpy(io->tg_file,name); strcat(io->tg_file,".TG"); strcat(io->tg_file,version);
+    strcpy(io->cc_file,name); strcat(io->cc_file,".CC"); strcat(io->cc_file,version);
+}
+
+static void bap_open_files(BapIO *io, char *name, char *version, char *mode)
+/*
+**
+*/
+{
+    /*
+    ** Create file names
+    */
+    set_file_names(io,name,version);
+
+    /*
+    ** Open files
+    */
+    if ( ( io->ar_fp = fopen(io->ar_file,mode) ) == NULL )
+	crash("Error opening archive file %s\n",io->ar_file);
+    if ( ( io->rl_fp = fopen(io->rl_file,mode) ) == NULL )
+	crash("Error opening relationships file %s\n",io->rl_file);
+    if ( ( io->sq_fp = fopen(io->sq_file,mode) ) == NULL )
+	crash("Error opening sequence file %s\n",io->sq_file);
+    if ( ( io->tg_fp = fopen(io->tg_file,mode) ) == NULL )
+	crash("Error opening tag file %s\n",io->tg_file);
+    if ( ( io->cc_fp = fopen(io->cc_file,mode) ) == NULL )
+	crash("Error opening tag-comment file %s\n",io->cc_file);
+
+}
+
+
+void bap_open_for_read(BapIO *io, char *name, char *version)
+{
+    bap_rl_file_rec rl_header;
+
+    bap_open_files(io,name,version,"rb");
+    
+    bap_read_rl(io,bap_rl_dbheader_rec(io),&rl_header);
+    io->max_gels = rl_header.dbheader.idbsiz;
+    io->max_gel_length = rl_header.dbheader.maxgel;
+    io->data_class = rl_header.dbheader.idm;
+
+    bap_read_rl(io,bap_rl_header_rec(io),&rl_header);
+    io->num_gels = rl_header.header.num_gels;
+    io->num_contigs = rl_header.header.num_contigs;
+}
+
+void bap_open_for_write(BapIO *io, char *name, char *version)
+{
+    bap_open_files(io,name,version,"w+b");
+}
+
+void bap_open_for_update(BapIO *io, char *name, char *version)
+{
+    bap_open_files(io,name,version,"r+b");
+}
+
+
+
+void bap_close_files(BapIO *io)
+/*
+** Close all relevant files
+*/
+{
+
+    fclose(io->ar_fp);
+    fclose(io->rl_fp);
+    fclose(io->sq_fp);
+    fclose(io->tg_fp);
+    fclose(io->cc_fp);
+
+}
+
diff --git a/src/convert/bapIO.h b/src/convert/bapIO.h
new file mode 100644
index 0000000..69e9f50
--- /dev/null
+++ b/src/convert/bapIO.h
@@ -0,0 +1,190 @@
+#ifndef _bapIO_h
+#define _bapIO_h
+
+#include "newtypes.h"
+
+/*
+** Definition of dap database files
+*/
+
+
+
+/*
+** Archive file (*.AR?)
+*/
+#define BAP_FILE_NAME_LENGTH 16
+typedef union {
+
+    struct _ar_lines {
+	char name[BAP_FILE_NAME_LENGTH];
+    } lines;
+
+} bap_ar_file_rec;
+
+#define bap_ar_byte_index(I,R) ( ((R)-1) *sizeof(bap_ar_file_rec) )
+
+
+
+
+
+
+
+/*
+** Relationships file (*.RL?)
+*/
+typedef union {
+
+    struct _rl_dbheader { 
+	int_4 maxdb;
+	int_4 idbsiz;
+	int_4 maxgel;
+	int_4 idm;
+    } dbheader;
+
+    struct _rl_header { 
+	int_4 num_gels;
+	int_4 num_contigs;
+	int_4 spare1;
+	int_4 spare2;
+    } header;
+
+    struct _rl_lines {
+	int_4 rel_pos;
+	int_4 length;
+	int_4 left_nbr;
+	int_4 right_nbr;
+    } lines;
+
+    struct _rl_clines {
+	int_4 length;
+	int_4 spare3;
+	int_4 left_end;
+	int_4 right_end;
+    } clines;
+
+} bap_rl_file_rec;
+
+#define bap_rl_header_rec(I) ( (I)->max_gels )
+#define bap_rl_byte_index(I,R) ( (R) * sizeof(bap_rl_file_rec) )
+#define bap_rl_dbheader_rec(I) (0)
+
+
+
+
+
+
+/*
+** Sequence file (*.SQ?)
+*/
+typedef char *bap_sq_file_rec;
+#define bap_sq_byte_index(I,R) ( ((R)-1) * ((I)->max_gel_length) )
+
+
+
+/*
+** Tag files (*.TG?)
+*/
+typedef union {
+    int i;
+    char c[4];
+} bap_tag_type;
+
+
+
+typedef union {
+
+    struct _tg_header {
+	int_4 count;
+	int_4 spare1;
+	int_4 spare2;
+	bap_tag_type spare3;
+	int_4 free_list;
+    } header;
+
+    struct _tg_lines {
+	int_4 position;
+	int_4 length;
+	int_4 comment;
+	bap_tag_type type;
+	int_4 next;
+    } lines;
+
+} bap_tg_file_rec;
+
+#define bap_tg_byte_index(I,R) ( ((R)-1) *sizeof(bap_tg_file_rec) )
+#define bap_tg_header_rec(I) ( (I)->max_gels )
+
+/*
+** Comment files (*.CC?)
+*/
+#define BAP_COMMENT_SIZE 40
+typedef union {
+
+    struct _cc_header {
+	int_4 free_list;
+	int_4 count;
+	char spare[BAP_COMMENT_SIZE - sizeof(int_4)];
+    } header;
+
+    struct _cc_lines {
+	int_4 next;
+	char comment[BAP_COMMENT_SIZE];
+    } lines;
+
+} bap_cc_file_rec;
+
+#define bap_cc_byte_index(I,R) ( ((R)-1) *sizeof(bap_cc_file_rec) )
+#define bap_cc_header_rec(I) (1)
+
+
+
+/*
+** Useful variables
+*/
+typedef char IOString[200];
+
+typedef struct {
+    int max_gels;
+    int num_gels;
+    int num_contigs;
+    int max_gel_length;
+    int data_class;
+    int max_db_size;
+    
+    FILE *ar_fp;
+    FILE *rl_fp;
+    FILE *sq_fp;
+    FILE *tg_fp;
+    FILE *cc_fp;
+
+    IOString ar_file;
+    IOString rl_file;
+    IOString sq_file;
+    IOString tg_file;
+    IOString cc_file;
+} BapIO;
+
+
+
+
+
+extern void bap_read_tg(BapIO *io, int rec, bap_tg_file_rec *t);
+extern void bap_write_tg(BapIO *io, int rec, bap_tg_file_rec *t);
+extern void bap_read_ar(BapIO *io, int rec, bap_ar_file_rec *t);
+extern void bap_write_ar(BapIO *io, int rec, bap_ar_file_rec *t);
+extern void bap_read_rl(BapIO *io, int rec, bap_rl_file_rec *t);
+extern void bap_write_rl(BapIO *io, int rec, bap_rl_file_rec *t);
+extern void bap_read_cc(BapIO *io, int rec, bap_cc_file_rec *t);
+extern void bap_write_cc(BapIO *io, int rec, bap_cc_file_rec *t);
+extern void bap_read_sq(BapIO *io, int rec, bap_sq_file_rec t);
+extern void bap_write_sq(BapIO *io, int rec, bap_sq_file_rec t);
+extern char *bap_read_comment(BapIO *io, int_4 cp);
+extern int_4 bap_write_comment(BapIO *io, char *c);
+extern int_4 bap_get_free_tag(BapIO *io);
+extern void bap_insert_tag(BapIO *io, int_4 gel, bap_tg_file_rec t);
+extern void bap_open_for_read(BapIO *io, char *name, char *version);
+extern void bap_open_for_write(BapIO *io, char *name, char *version);
+extern void bap_close_files(BapIO *io);
+
+
+#endif /* _bapIO_h */
diff --git a/src/convert/convert.doc b/src/convert/convert.doc
new file mode 100644
index 0000000..adb8626
--- /dev/null
+++ b/src/convert/convert.doc
@@ -0,0 +1,77 @@
+	     Converting xdap databases to xbap databases
+			      (and more)
+				   
+			      Simon Dear
+			   18 November 1992
+
+
+0. Introduction
+
+The Staden package (version 1992.3 or greater) now includes the
+sequence assembly program xbap. Xbap cannot be used with databases
+created by the sequence assembly program xdap. People wanting to
+switch to using the new software will need to convert their old
+databases. The program `convert' is provided for this purpose.
+
+1. The conversion program
+
+`convert' (version 1.0) is an interactive program for converting
+sequence assembly data between various formats. With it, it is
+possible to convert data in any of the formats:
+
+	text file (created by convert)
+	xdap database
+	xbap database
+
+to any of the following formats:
+
+	text file (created by convert)
+	xbap database
+
+
+2. Converting databases
+
+The program takes no command line arguments. First, you are prompted
+for the format, name and version of the database you wish to convert.
+If the source is an xdap or xbap database, ensure that the name and
+version are in uppercase. If the source is a text file, the version is
+prompted for but it is ignored. Second, you are prompted for the
+format, name and version of the database you wish to create. Ensure
+that names and versions are in the appropriate case and that the files
+do already exist.
+
+After this, the programs converts the database. This may take some
+time. The next message to appear on the terminal is a message to
+indicate the conversion has successfully completed.
+
+
+3. Example
+
+Here is a log of a typical conversion session:
+
+    Covert Project Database
+    Version 1.0, 13 November 1991
+    Please enter database to convert:
+
+    Available types are:
+    0. Flat file - created with this program
+    1. xdap database
+    2. xbap database
+
+    Database type? 1
+    Database name? ZK643
+    Database version? 0
+
+    Please enter database to create:
+
+    Available types are:
+    0. Flat file - created with this program
+    1. xbap database
+
+    Database type? 1
+    Database name? ZK643
+    Database version? 1
+
+    Conversion completed
+
+
diff --git a/src/convert/dapDB.c b/src/convert/dapDB.c
new file mode 100644
index 0000000..5487d2a
--- /dev/null
+++ b/src/convert/dapDB.c
@@ -0,0 +1,453 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "process.h"
+#include "list.h"
+#include "dapIO.h"
+#include "dapDB.h"
+#include "misc.h"
+
+/*
+** For dap io
+*/
+static DapIO io;
+
+
+
+
+
+
+
+
+int maxgel;
+
+
+static int cur_gel_index;
+static int cur_contig_index;
+
+
+
+static List *file_details()
+{
+    return
+	build_list(
+		   atom_str(db_files),
+		   build_list(
+			      atom_str(db_files_arch),
+			      atom_str(io.ar_file),
+			      nil),
+		   build_list(
+			      atom_str(db_files_rel),
+			      atom_str(io.rl_file),
+			      nil),
+		   build_list(
+			      atom_str(db_files_seq),
+			      atom_str(io.sq_file),
+			      nil),
+		   build_list(
+			      atom_str(db_files_tag),
+			      atom_str(io.tg_file),
+			      nil),
+		   build_list(
+			      atom_str(db_files_com),
+			      atom_str(io.cc_file),
+			      nil),
+		   nil
+	       );
+}
+
+
+
+
+static List *db_details()
+{
+    
+    return
+	build_list(
+		   build_list(
+			      atom_str(db_max_gels),
+			      atom_int(io.max_gels),
+			      nil),
+		   build_list(
+			      atom_str(db_max_gel_length),
+			      atom_int(io.max_gel_length),
+			      nil),
+		   build_list(
+			      atom_str(db_data_class),
+			      atom_int(io.data_class),
+			      nil),
+		   build_list(
+			      atom_str(db_num_gels),
+			      atom_int(io.num_gels),
+			      nil),
+		   build_list(
+			      atom_str(db_num_contigs),
+			      atom_int(io.num_contigs),
+			      nil),
+		   nil);
+
+}
+
+
+
+
+
+
+
+void xdap_middle_open_for_read(List *l)
+/*
+**
+*/
+{
+    char *name;
+    char *version;
+
+
+    name = assoc(l,db_name);
+    if (! name)	crash("No database name specified\n");
+
+    version = assoc(l,db_version);
+    if (! version) crash("No version specified\n");
+
+    dap_open_for_read(&io, name, version);
+
+    cur_gel_index = 1;
+    cur_contig_index = 1;
+}
+
+
+List *xdap_middle_read_header()
+/*
+**
+*/
+{
+    List *files;
+    List *details;
+    
+    files = file_details();
+    details = db_details();
+
+    return
+	join_list(
+		  build_list(
+			     atom_str(db_from),
+			     nil),
+		  details,
+		  build_list(
+			     files,
+			     nil),
+		  nil);
+
+
+}
+
+
+
+
+
+
+
+
+List *xdap_middle_read_gel_data()
+{
+    List *gel_details;
+
+    if (cur_gel_index > io.num_gels)
+	gel_details = nil;
+    else {
+	dap_ar_file_rec ar_line;
+	dap_rl_file_rec rl_line;
+	dap_sq_file_rec sq_line;
+	dap_tg_file_rec tg_line;
+
+	int index;
+	char name[13];
+	char *seq;
+	int length;
+	int comp;
+	int pos;
+	int l_nbr;
+	int r_nbr;
+
+	sq_line = (char *) malloc(io.max_gel_length+1);
+
+	dap_read_ar(&io,cur_gel_index,&ar_line);
+	dap_read_rl(&io,cur_gel_index,&rl_line);
+	dap_read_tg(&io,cur_gel_index,&tg_line);
+	dap_read_sq(&io,cur_gel_index,sq_line);
+
+
+	index = cur_gel_index;
+	f2cstr(ar_line.lines.name,10,name,12);
+	length = abs(rl_line.lines.length);
+	comp = (rl_line.lines.length < 0);
+	seq = sq_line; seq[length] = '\0';
+	pos = rl_line.lines.rel_pos;
+	l_nbr = rl_line.lines.left_nbr;
+	r_nbr = rl_line.lines.right_nbr;
+
+	gel_details =
+	    build_list(
+		       atom_str(gel_rec),
+		       build_list(
+				  atom_str(gel_index),
+				  atom_int(index),
+				  nil),
+		       build_list(
+				  atom_str(gel_name),
+				  atom_str(name),
+				  nil),
+		       build_list(
+				  atom_str(gel_length),
+				  atom_int(length),
+				  nil),
+		       build_list(
+				  atom_str(gel_comp),
+				  atom_int(comp),
+				  nil),
+		       build_list(
+				  atom_str(gel_seq),
+				  atom_str(seq),
+				  nil),
+		       build_list(
+				  atom_str(gel_pos),
+				  atom_int(pos),
+				  nil),
+		       build_list(
+				  atom_str(gel_l_nbr),
+				  atom_int(l_nbr),
+				  nil),
+		       build_list(
+				  atom_str(gel_r_nbr),
+				  atom_int(r_nbr),
+				  nil),
+		       nil);
+
+
+
+	/* Get raw data details */
+	if (tg_line.lines.comment) {
+	    List *raw_data_details;
+
+	    char *rd;
+	    int rd_length;
+	    int rd_cut;
+	    int rd_ulen;
+	    char rd_type[5];
+	    char rd_file[19];
+	    
+	    rd = dap_read_comment(&io, tg_line.lines.comment);
+	    sscanf(rd,"%6d%6d%6d%*s",&rd_length, &rd_cut, &rd_ulen);
+	    f2cstr(&rd[18],4,rd_type,4);
+	    f2cstr(&rd[22],18,rd_file,18);
+
+
+	    raw_data_details =
+		build_list(
+			   build_list(
+				      atom_str(gel_rd_length),
+				      atom_int(rd_length),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_cut),
+				      atom_int(rd_cut),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_ulen),
+				      atom_int(rd_ulen),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_type),
+				      atom_str(rd_type),
+				      nil),
+			   build_list(
+				      atom_str(gel_rd_file),
+				      atom_str(rd_file),
+				      nil),
+			   nil);
+
+	    join_list (gel_details, raw_data_details, nil);
+
+	}
+
+
+
+	/*
+	** Process tags, maintaining separate lists for
+	** (a) special tags
+	** (b) annotation
+	** (c) edits
+	*/
+	{
+	    List *specials;
+	    List *notes;
+	    List *edits;
+
+	    int_4 next;
+	    specials = nil;
+	    notes =
+		build_list(
+			   atom_str(gel_annotation),
+			   nil);
+	    edits =
+		build_list(
+			   atom_str(gel_edits),
+			   nil);
+
+	    while (tg_line.lines.next) {
+		next = tg_line.lines.next;
+		dap_read_tg(&io,next,&tg_line);
+
+		if (strncmp(tg_line.lines.type.c,"*LC*",4)==0) {
+		    if (tg_line.lines.comment) {
+			List *lc;
+			lc = build_list(
+					atom_str(gel_l_cut_seq),
+					atom_str(dap_read_comment(&io,tg_line.lines.comment)),
+					nil);
+			if (isNil(specials))
+			    specials = build_list(lc,nil);
+			else
+			    specials = join_list(specials,build_list(lc,nil),nil);
+		    }
+		} else if (strncmp(tg_line.lines.type.c,"*RC*",4)==0) {
+		    if (tg_line.lines.comment) {
+			List *rc;
+			rc = build_list(
+					atom_str(gel_r_cut_seq),
+					atom_str(dap_read_comment(&io,tg_line.lines.comment)),
+					nil);
+			if (isNil(specials))
+			    specials = build_list(rc,nil);
+			else
+			    specials = join_list(specials,build_list(rc,nil),nil);
+		    }
+		} else if (strncmp(tg_line.lines.type.c,"*",1)==0) {
+		    List *ed;
+		    char base[2];
+		    base[0] = tg_line.lines.type.c[3];
+		    base[1] = '\0';
+		    ed = build_list(
+				    build_list(
+					       atom_str(gel_ed_op),
+					       atom_str( (strncmp(tg_line.lines.type.c,"*IN",3))==0
+							? gel_ed_insert : gel_ed_delete),
+					       nil),
+				    build_list(
+					       atom_str(gel_ed_base),
+					       atom_str(base),
+					       nil),
+				    build_list(
+					       atom_str(gel_ed_base_pos),
+					       atom_int( tg_line.lines.position ),
+					       nil),
+				    nil);
+		    edits = join_list(edits, build_list(ed,nil),nil);
+		} else {
+		    List *an;
+		    char type[5];
+		    char *com;
+		    strncpy(type,tg_line.lines.type.c,4);
+		    type[4]='\0';
+		    com = dap_read_comment(&io,tg_line.lines.comment);
+		    an = build_list(
+				    build_list(
+					       atom_str(gel_an_pos),
+					       atom_int(tg_line.lines.position),
+					       nil),
+				    build_list(
+					       atom_str(gel_an_len),
+					       atom_int(tg_line.lines.length),
+					       nil),
+				    build_list(
+					       atom_str(gel_an_type),
+					       atom_str(type),
+					       nil),
+				    (com == NULL) ? nil :
+				    build_list(
+					       atom_str(gel_an_comment),
+					       atom_str(com),
+					       nil),
+				    nil);
+		    notes = join_list(notes,build_list(an,nil),nil);
+		}
+	    }
+
+	    if (isNil(specials))
+		gel_details = join_list(gel_details,build_list(edits,nil),build_list(notes,nil),nil);
+	    else
+		gel_details = join_list(gel_details,specials,build_list(edits,nil),build_list(notes,nil),nil);
+
+	}
+
+
+
+	cur_gel_index++;
+	free(sq_line);
+
+    }
+
+    return gel_details;
+
+}
+
+
+
+
+
+List *xdap_middle_read_contig_data()
+{
+    List *contig_details;
+
+    if (cur_contig_index > io.num_contigs)
+	contig_details = nil;
+    else {
+	dap_rl_file_rec rl_line;
+	int length;
+	int index;
+	int left_end;
+	int right_end;
+
+	index = io.max_gels-cur_contig_index;
+	dap_read_rl(&io,index,&rl_line);
+	length = rl_line.clines.length;
+	left_end = rl_line.clines.left_end;
+	right_end = rl_line.clines.right_end;
+
+	contig_details =
+	    build_list(
+		       atom_str(contig_rec),
+		       build_list(
+				  atom_str(contig_index),
+				  atom_int(index),
+				  nil),
+		       build_list(
+				  atom_str(contig_length),
+				  atom_int(length),
+				  nil),
+		       build_list(
+				  atom_str(contig_left_end),
+				  atom_int(left_end),
+				  nil),
+		       build_list(
+				  atom_str(contig_right_end),
+				  atom_int(right_end),
+				  nil),
+		       nil);
+
+	cur_contig_index++;
+    }
+
+    return contig_details;
+
+}
+
+
+void xdap_middle_close(List *l)
+/*
+** Close all relevant files
+*/
+{
+    dap_close_files(&io);
+}
diff --git a/src/convert/dapDB.h b/src/convert/dapDB.h
new file mode 100644
index 0000000..137b642
--- /dev/null
+++ b/src/convert/dapDB.h
@@ -0,0 +1,13 @@
+#ifndef _dapDB_h
+#define _dapDB_h
+
+#include "list.h"
+
+extern void xdap_middle_open_for_read(List *l);
+extern void xdap_middle_close(List *l);
+extern List *xdap_middle_read_header();
+extern List *xdap_middle_read_gel_data();
+extern List *xdap_middle_read_contig_data();
+
+#endif /* _dapDB_h */
+
diff --git a/src/convert/dapIO.c b/src/convert/dapIO.c
new file mode 100644
index 0000000..906f176
--- /dev/null
+++ b/src/convert/dapIO.c
@@ -0,0 +1,376 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "dapIO.h"
+#include "misc.h"
+
+/*
+** Tag File IO
+*/
+void dap_read_tg(DapIO *io, int rec, dap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    if ( fseek(f,(off_t)dap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_tg_file_rec), 1, f) != 1)
+	crash("Read failure on tag file\n");
+}
+
+void dap_write_tg(DapIO *io, int rec, dap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    if ( fseek(f,(off_t)dap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_tg_file_rec), 1, f) != 1)
+	crash("Write failure on tag file\n");
+}
+
+
+
+/*
+** Archive File IO
+*/
+void dap_read_ar(DapIO *io, int rec, dap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)dap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_ar_file_rec), 1, f) != 1)
+	crash("Read failure on archive file\n");
+}
+
+void dap_write_ar(DapIO *io, int rec, dap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)dap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_ar_file_rec), 1, f) != 1)
+	crash("Write failure on archive file\n");
+}
+
+
+
+
+/*
+** Relationship file IO
+*/
+void dap_read_rl(DapIO *io, int rec, dap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    if ( fseek(f,(off_t)dap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_rl_file_rec), 1, f) != 1)
+	crash("Read failure on relationships file\n");
+}
+
+void dap_write_rl(DapIO *io, int rec, dap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    if ( fseek(f,(off_t)dap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_rl_file_rec), 1, f) != 1)
+	crash("Write failure on relationships file\n");
+}
+
+
+
+
+
+/*
+** Comment file IO
+*/
+void dap_read_cc(DapIO *io, int rec, dap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    if ( fseek(f,(off_t)dap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_cc_file_rec), 1, f) != 1)
+	crash("Read failure on comment file, record %d\n",rec);
+}
+
+void dap_write_cc(DapIO *io, int rec, dap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    if ( fseek(f,(off_t)dap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_cc_file_rec), 1, f) != 1)
+	crash("Crash failure on comment file, record %d\n",rec);
+}
+
+
+
+/*
+** Sequence file IO
+*/
+void dap_read_sq(DapIO *io, int rec, dap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)dap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fread(t, io->max_gel_length, 1, f) != 1)
+	crash("Read failure on sequence file\n");
+}
+
+void dap_write_sq(DapIO *io, int rec, dap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)dap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fwrite(t, io->max_gel_length, 1, f) != 1)
+	crash("Write failure on sequence file\n");
+}
+
+
+
+/*
+** Comment IO - Strings
+*/
+char *dap_read_comment(DapIO *io, int_4 cp)
+{
+    dap_cc_file_rec c;
+    int count;
+    int_4 nc;
+    char *com,*comptr;
+
+    if (!cp) return NULL;
+    /* determine how long string is */
+    count = 1;
+    nc=cp;
+    dap_read_cc(io, nc, &c);
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        dap_read_cc(io, nc, &c);
+    }
+
+    com = comptr = (char *)malloc(count * DAP_COMMENT_SIZE+1);
+    nc=cp;
+    dap_read_cc(io, nc, &c);
+    strncpy(com,c.lines.comment,DAP_COMMENT_SIZE); com+=DAP_COMMENT_SIZE;
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        dap_read_cc(io, nc, &c);
+        strncpy(com,c.lines.comment,DAP_COMMENT_SIZE); com+=DAP_COMMENT_SIZE;
+    }
+
+    *com = '\0';
+
+    return comptr;
+    
+}
+
+#ifdef nodef
+static int_4 get_free_comment(DapIO *io)
+{
+    dap_cc_file_rec head;
+    dap_cc_file_rec freerec;
+    int_4 free_id;
+    dap_read_cc(io,dap_cc_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	dap_read_cc(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	dap_write_cc(io,dap_cc_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	dap_write_cc(io,dap_cc_header_rec(io),&head);
+	dap_write_cc(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+static int_4 get_free_tag(DapIO *io)
+{
+    dap_tg_file_rec head;
+    dap_tg_file_rec freerec;
+    int_4 free_id;
+    dap_read_tg(io,dap_tg_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	dap_read_tg(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	dap_write_tg(io,dap_tg_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	dap_write_tg(io,dap_tg_header_rec(io),&head);
+	dap_write_tg(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+
+
+
+
+static void insert_tag(DapIO *io, int_4 gel, dap_tg_file_rec t)
+{
+    int_4 next, last;
+    int_4 free;
+    dap_tg_file_rec tg,last_tg;
+
+    last = gel;
+    dap_read_tg(io,last,&last_tg);
+
+    next = last_tg.lines.next;
+    if (next) dap_read_tg(io,next,&tg);
+
+    while (next && tg.lines.position <= t.lines.position) {
+	last = next;
+	last_tg = tg;
+	next = tg.lines.next;
+	if (next) dap_read_tg(io,next,&tg);
+    }
+
+    /* insert after last */
+    free = get_free_tag(io);
+    t.lines.next = next;
+    last_tg.lines.next = free;
+    dap_write_tg(io,last,&last_tg);
+    dap_write_tg(io,free,&t);
+
+}
+
+
+static int_4 write_comment(DapIO *io, char *c)
+{
+    dap_cc_file_rec com;
+    int_4 cur,next,this_comment;
+    int clen = strlen(c);
+    int piece;
+
+    /* write out first block of DAP_COMMENT_SIZE */
+    this_comment=cur=get_free_comment(io);
+    if (clen>DAP_COMMENT_SIZE)
+	piece = DAP_COMMENT_SIZE;
+    else
+	piece = clen;
+
+    {int i; for(i=0;i<DAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+    strncpy(com.lines.comment,c,piece);
+
+    c+= piece;
+    clen -= piece;
+    while (clen > 0) {
+	next = get_free_comment(io);
+	com.lines.next = next;
+	dap_write_cc(io,cur,&com);
+	cur = next;
+	if (clen<DAP_COMMENT_SIZE)
+	    piece = clen;
+
+	{int i; for(i=0;i<DAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+	strncpy(com.lines.comment,c,piece);
+
+	c+= piece;
+	clen -= piece;
+    }
+    com.lines.next = 0;
+    if (piece!=DAP_COMMENT_SIZE)
+	com.lines.comment[piece]='\0';
+    dap_write_cc(io,cur,&com);
+
+    return this_comment;
+}
+#endif /*nodef*/
+
+static void set_file_names(DapIO *io, char *name, char *version)
+{
+    strcpy(io->ar_file,name); strcat(io->ar_file,".AR"); strcat(io->ar_file,version);
+    strcpy(io->rl_file,name); strcat(io->rl_file,".RL"); strcat(io->rl_file,version);
+    strcpy(io->sq_file,name); strcat(io->sq_file,".SQ"); strcat(io->sq_file,version);
+    strcpy(io->tg_file,name); strcat(io->tg_file,".TG"); strcat(io->tg_file,version);
+    strcpy(io->cc_file,name); strcat(io->cc_file,".CC"); strcat(io->cc_file,version);
+}
+
+static void dap_open_files(DapIO *io, char *name, char *version, char *mode)
+/*
+**
+*/
+{
+
+    /*
+    ** Create file names
+    */
+    set_file_names(io,name,version);
+
+    /*
+    ** Open files
+    */
+    if ( ( io->ar_fp = fopen(io->ar_file,mode) ) == NULL )
+	crash("No archive file %s\n",io->ar_file);
+    if ( ( io->rl_fp = fopen(io->rl_file,mode) ) == NULL )
+	crash("No relationships file %s\n",io->rl_file);
+    if ( ( io->sq_fp = fopen(io->sq_file,mode) ) == NULL )
+	crash("No sequence file %s\n",io->sq_file);
+    if ( ( io->tg_fp = fopen(io->tg_file,mode) ) == NULL )
+	crash("No tag file %s\n",io->tg_file);
+    if ( ( io->cc_fp = fopen(io->cc_file,mode) ) == NULL )
+	crash("No tag-comment file %s\n",io->cc_file);
+
+}
+
+
+void dap_open_for_read(DapIO *io, char *name, char *version)
+{
+    dap_ar_file_rec ar_header;
+    dap_rl_file_rec rl_header;
+
+    dap_open_files(io,name,version,"rb");
+    
+    dap_read_ar(io,dap_ar_header_rec(io),&ar_header);
+    io->max_gels = ar_header.header.idbsiz;
+    io->max_gel_length = ar_header.header.maxgel;
+    io->data_class = ar_header.header.idm;
+
+    dap_read_rl(io,dap_rl_header_rec(io),&rl_header);
+    io->num_gels = rl_header.header.num_gels;
+    io->num_contigs = rl_header.header.num_contigs;
+}
+
+void dap_open_for_write(DapIO *io, char *name, char *version)
+{
+    dap_open_files(io,name,version,"r+b");
+}
+
+
+
+void dap_close_files(DapIO *io)
+/*
+** Close all relevant files
+*/
+{
+
+    fclose(io->ar_fp);
+    fclose(io->rl_fp);
+    fclose(io->sq_fp);
+    fclose(io->tg_fp);
+    fclose(io->cc_fp);
+
+}
+
diff --git a/src/convert/dapIO.h b/src/convert/dapIO.h
new file mode 100644
index 0000000..8d8dca5
--- /dev/null
+++ b/src/convert/dapIO.h
@@ -0,0 +1,184 @@
+#ifndef _dapIO_h
+#define _dapIO_h
+
+#include "newtypes.h"
+
+/*
+** Definition of dap database files
+*/
+
+
+
+/*
+** Archive file (*.AR?)
+*/
+#define DAP_FILE_NAME_LENGTH 12
+typedef union {
+
+    struct _ar_header {
+	int_4 idbsiz;
+	int_4 maxgel;
+	int_4 idm;
+    } header;
+
+    struct _ar_lines {
+	char name[DAP_FILE_NAME_LENGTH];
+    } lines;
+
+} dap_ar_file_rec;
+
+#define dap_ar_header_rec(I)  ( 1000 )
+#define dap_ar_byte_index(I,R) ( ((R)-1) * sizeof(dap_ar_file_rec) )
+
+
+
+
+
+
+
+/*
+** Relationships file (*.RL?)
+*/
+typedef union {
+
+    struct _rl_header { 
+	int_4 num_gels;
+	int_4 num_contigs;
+	int_4 spare1;
+	int_4 spare2;
+    } header;
+
+    struct _rl_lines {
+	int_4 rel_pos;
+	int_4 length;
+	int_4 left_nbr;
+	int_4 right_nbr;
+    } lines;
+
+    struct _rl_clines {
+	int_4 length;
+	int_4 spare3;
+	int_4 left_end;
+	int_4 right_end;
+    } clines;
+
+} dap_rl_file_rec;
+
+#define dap_rl_header_rec(I) ( (I)->max_gels )
+#define dap_rl_byte_index(I,R) ( ((R)-1) * sizeof(dap_rl_file_rec) )
+
+
+
+
+
+
+/*
+** Sequence file (*.SQ?)
+*/
+typedef char *dap_sq_file_rec;
+#define dap_sq_byte_index(I,R) ( ((R)-1) * ((I)->max_gel_length) )
+
+
+
+/*
+** Tag files (*.TG?)
+*/
+typedef union {
+    int i;
+    char c[4];
+} dap_tag_type;
+
+
+
+typedef union {
+
+    struct _tg_header {
+	int_4 count;
+	int_4 spare1;
+	int_4 spare2;
+	dap_tag_type spare3;
+	int_4 free_list;
+    } header;
+
+    struct _tg_lines {
+	int_4 position;
+	int_4 length;
+	int_4 comment;
+	dap_tag_type type;
+	int_4 next;
+    } lines;
+
+} dap_tg_file_rec;
+
+#define dap_tg_byte_index(I,R) ( ((R)-1) *sizeof(dap_tg_file_rec) )
+#define dap_tg_header_rec(I) ( (I)->max_gels )
+
+/*
+** Comment files (*.CC?)
+*/
+#define DAP_COMMENT_SIZE 40
+typedef union {
+
+    struct _cc_header {
+	int_4 free_list;
+	int_4 count;
+	char spare[DAP_COMMENT_SIZE - sizeof(int_4)];
+    } header;
+
+    struct _cc_lines {
+	int_4 next;
+	char comment[DAP_COMMENT_SIZE];
+    } lines;
+
+} dap_cc_file_rec;
+
+#define dap_cc_byte_index(I,R) ( ((R)-1) * sizeof(dap_cc_file_rec) )
+#define dap_cc_header_rec(I) (1)
+
+
+/*
+** Useful variables
+*/
+typedef char IOString[200];
+
+typedef struct {
+    int max_gels;
+    int num_gels;
+    int num_contigs;
+    int max_gel_length;
+    int data_class;
+    
+    FILE *ar_fp;
+    FILE *rl_fp;
+    FILE *sq_fp;
+    FILE *tg_fp;
+    FILE *cc_fp;
+
+    IOString ar_file;
+    IOString rl_file;
+    IOString sq_file;
+    IOString tg_file;
+    IOString cc_file;
+} DapIO;
+
+
+
+
+
+extern void dap_read_tg(DapIO *io, int rec, dap_tg_file_rec *t);
+extern void dap_write_tg(DapIO *io, int rec, dap_tg_file_rec *t);
+extern void dap_read_ar(DapIO *io, int rec, dap_ar_file_rec *t);
+extern void dap_write_ar(DapIO *io, int rec, dap_ar_file_rec *t);
+extern void dap_read_rl(DapIO *io, int rec, dap_rl_file_rec *t);
+extern void dap_write_rl(DapIO *io, int rec, dap_rl_file_rec *t);
+extern void dap_read_cc(DapIO *io, int rec, dap_cc_file_rec *t);
+extern void dap_write_cc(DapIO *io, int rec, dap_cc_file_rec *t);
+extern void dap_read_sq(DapIO *io, int rec, dap_sq_file_rec t);
+extern void dap_write_sq(DapIO *io, int rec, dap_sq_file_rec t);
+extern char *dap_read_comment(DapIO *io, int_4 cp);
+extern void dap_open_for_read(DapIO *io, char *name, char *version);
+extern void dap_open_for_write(DapIO *io, char *name, char *version);
+extern void dap_close_files(DapIO *io);
+
+
+#endif /* _dapIO_h */
diff --git a/src/convert/flat_sd.c b/src/convert/flat_sd.c
new file mode 100644
index 0000000..672a59a
--- /dev/null
+++ b/src/convert/flat_sd.c
@@ -0,0 +1,143 @@
+#include <string.h>
+#include "list.h"
+#include "process.h"
+#include "misc.h"
+
+typedef char IOString[200];
+
+static IOString flat_file;
+static FILE *flat_fp;
+static List *next_list;
+
+static void set_file_names(char *name)
+{
+   strcpy(flat_file,name); strcat(flat_file,".flat");
+}
+
+void flat_sd_open_for_write(List *l)
+{
+    char *name;
+
+    name = assoc(l,db_name);
+
+    /*
+    ** Create file name
+    */
+    set_file_names(name);
+
+
+    /*
+    ** Check for existance
+    */
+    if ( file_exists(flat_file) ) crash("Flat file %s already exists\n",flat_file);
+
+    /*
+    ** Open files
+    */
+    if ( (flat_fp = (fopen(flat_file,"w"))) == NULL) crash("Cannot open file %s\n",flat_file);
+
+}
+
+void flat_sd_write_header(List *l)
+{
+    fprint_list_f(flat_fp,l);
+}
+
+void flat_sd_write_gel_data(List *l)
+{
+    fprint_list_f(flat_fp,l);
+}
+
+void flat_sd_write_contig_data(List *l)
+{
+    fprint_list_f(flat_fp,l);
+}
+
+
+void flat_sd_close(List *l)
+{
+    fclose(flat_fp);
+}
+
+void flat_sd_open_for_read(List *l)
+{
+    char *name;
+
+    name = assoc(l,db_name);
+
+    /*
+    ** Create file name
+    */
+    set_file_names(name);
+
+    /*
+    ** Open files
+    */
+    if ( (flat_fp = (fopen(flat_file,"r"))) == NULL) crash("Cannot open file %s\n",flat_file);
+
+    next_list = nil;
+
+}
+
+List *flat_sd_read_header()
+{
+    List *l;
+
+    if (isNil(next_list))
+	next_list=read_list(flat_fp);
+
+    if (isNil(next_list))
+	l = nil;
+    else {
+	if (strcmp(db_from,atomVal(car(next_list))))
+	    l = nil;
+	else {
+	    l = next_list;
+	    next_list = nil;
+	}
+    }
+
+    return l;
+}
+
+List *flat_sd_read_gel_data()
+{
+    List *l;
+
+    if (isNil(next_list))
+	next_list=read_list(flat_fp);
+
+    if (isNil(next_list))
+	l = nil;
+    else {
+	if (strcmp(gel_rec,atomVal(car(next_list))))
+	    l = nil;
+	else {
+	    l = next_list;
+	    next_list = nil;
+	}
+    }
+
+    return l;
+}
+
+List *flat_sd_read_contig_data()
+{
+    List *l;
+
+    if (isNil(next_list))
+	next_list=read_list(flat_fp);
+
+    if (isNil(next_list))
+	l = nil;
+    else {
+	if (strcmp(contig_rec,atomVal(car(next_list))))
+	    l = nil;
+	else {
+	    l = next_list;
+	    next_list = nil;
+	}
+    }
+
+    return l;
+}
diff --git a/src/convert/flat_sd.h b/src/convert/flat_sd.h
new file mode 100644
index 0000000..97e3558
--- /dev/null
+++ b/src/convert/flat_sd.h
@@ -0,0 +1,16 @@
+#ifndef _flat_sd_h
+#define _flat_sd_h
+
+#include "list.h"
+
+extern void flat_sd_open_for_write(List *l);
+extern void flat_sd_close(List *l);
+extern void flat_sd_write_contig_data(List *l);
+extern void flat_sd_write_gel_data(List *l);
+extern void flat_sd_write_header(List *l);
+extern void flat_sd_open_for_read(List *l);
+extern List *flat_sd_read_header();
+extern List *flat_sd_read_contig_data();
+extern List *flat_sd_read_gel_data();
+
+#endif /* _flat_sd_h */
diff --git a/src/convert/list.c b/src/convert/list.c
new file mode 100644
index 0000000..e0055eb
--- /dev/null
+++ b/src/convert/list.c
@@ -0,0 +1,416 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <stdarg.h>
+#include <ctype.h>
+#include "list.h"
+
+static char *mystrdup(char *s)
+/*
+ * A quick implementation of strdup()
+ */
+{
+    char *copy;
+    if ( (copy = (char *)malloc(strlen(s)+1)) != NULL ) strcpy(copy,s);
+    return copy;
+}
+
+Node *nodeList = nil;
+
+void destroy_node_list()
+{
+    Node *n,*m;
+
+    for(n=nodeList;!isNil(n);) {
+	m = cdr(n);
+	free(n);
+	n = m;
+    }
+
+}
+
+static Node *create_node (int type)
+{
+    Node *n;
+
+    if (isNil(nodeList)) {
+	n = (Node *) malloc(sizeof(Node));
+    } else {
+	n = nodeList;
+	nodeList = cdr(nodeList);
+    }
+    if (! isNil(n)){
+	n->type = type;
+	n->val.list.head = nil;
+	n->val.list.tail = nil;
+    }
+
+    return n;
+}
+
+
+static void free_node (Node *n)
+{
+
+    if (isNil(n))
+	;
+    else {
+	n->type = Node_List; /* entries on the nodeList are ALWAYS lists */
+	n->val.list.head = nil;
+	n->val.list.tail = nodeList;
+	nodeList = n;
+    }
+
+}
+
+
+#define create_node_list() create_node(Node_List)
+#define create_node_atom() create_node(Node_Atom)
+
+void destroy_list (Node *n)
+{
+    
+    if (isNil(n))
+	;
+    else if (isAtom(n)){
+	free(atomVal(n)); /* free the atom here */
+        free_node(n);
+    } else {
+	destroy_list(car(n));
+	destroy_list(cdr(n));
+	free_node(n);
+    }
+}
+
+
+
+Node *copy_list(Node *n)
+{
+    Node *new;
+
+    if (isNil(n))
+	new = nil;
+    else if (isAtom(n)){
+	new = create_node_atom();
+	new->val.atom.ptr = mystrdup(atomVal(n));
+    } else {
+	new = create_node_list();
+	new->val.list.head = copy_list(car(n));
+	new->val.list.tail = copy_list(cdr(n));
+    }
+
+    return new;
+}
+
+
+Node *atom_str(char *s)
+{
+    Node *new;
+
+    new = create_node_atom();
+    new->val.atom.ptr = mystrdup(s);
+
+    return new;
+}
+
+
+Node *atom_int(int i)
+{
+    char buffer[200];
+
+    sprintf(buffer,"%d",i);
+    return atom_str(buffer);
+}
+
+
+
+Node *build_list(Node *n1, ...)
+{
+    Node *root;
+    Node *next;
+    Node *ROOT;
+
+    va_list ap;
+
+    va_start(ap, n1);
+
+    root = ROOT = create_node_list();
+    
+    /*
+     * no action if n1 is nil
+     * we are trying to construct '()'
+     */
+
+    if (! isNil(n1)) {
+	root->val.list.head = n1;
+	next = va_arg(ap, Node *);
+	while (! isNil(next)) {
+	    Node *new;
+	    new = create_node_list();
+	    new->val.list.head = next;
+	    root->val.list.tail = new;
+	    root = new;
+	    next = va_arg(ap, Node *);
+	}
+    }
+
+    va_end(ap);
+
+    return ROOT;
+}
+
+
+
+Node *tail_list(Node *n)
+{
+    Node *tail;
+
+    if (isNil(n))
+	tail = nil;
+    else if (isAtom(n))
+	tail = nil;
+    else {
+	for ( tail = n; !isNil(cdr(tail)); tail = cdr(tail)) ;
+    }
+
+    return tail;
+}
+
+
+Node *join_list(Node *n1,...)
+{
+    Node *tail;
+    Node *ROOT;
+    Node *next;
+
+    va_list ap;
+
+    va_start(ap, n1);
+
+    ROOT = n1;
+    tail = tail_list(ROOT);
+
+    next = va_arg(ap, Node *);
+    while (! isNil(next)) {
+	Node *newtail;
+	newtail = tail_list(next);
+	tail->val.list.tail = next;
+	tail = newtail;
+	next = va_arg(ap, Node *);
+    }
+
+    va_end(ap);
+
+    return ROOT;
+}
+
+
+
+static void _print_list (Node *n)
+{
+
+    if ( isNil(n) ) {
+	printf("nil ");
+    } else if (isAtom(n))
+	printf("%s ", atomVal(n));
+    else {
+	Node *next;
+	printf("( ");
+	for (next = n; ! isNil(next); next = cdr(next))
+	    _print_list(car(next));
+	printf(") ");
+    }
+
+}
+
+void print_list (Node *n)
+{
+    _print_list(n);
+    nl;
+}
+
+
+
+static void printAtom(FILE *f, Atom *n)
+{
+    char *val = atomVal(n);
+    char *q;
+
+    for (q=val; *q && !(isspace(*q)) && *q!='"' && *q!='(' && *q!=')'; q++);
+
+    if (*q) putc('"',f);
+
+    for (; *val; val++) {
+	if (*val=='"') putc('"',f);
+	putc(*val,f);
+    }
+
+    if (*q) putc('"',f);
+
+}
+
+
+static void _print_list_f (FILE *f,Node *n,int level)
+{
+
+    if ( isNil(n) ) {
+	fprintf(f,"nil ");
+    } else if (isAtom(n)) {
+	printAtom(f,n);
+	fprintf(f," ");
+    } else {
+	Node *next;
+	int i;
+	if (level) fprintf(f,"\n");
+	for (i=0;i<level;i++) fprintf(f,"  ");
+	fprintf(f,"( ");
+	for (next = n; ! isNil(next); next = cdr(next))
+	    _print_list_f(f,car(next),level+1);
+	fprintf(f,") ");
+    }
+
+}
+
+void print_list_f (Node *n)
+{
+    _print_list_f(stdout, n,0);
+    nl;
+}
+
+
+void fprint_list_f (FILE *f, Node *n)
+{
+    _print_list_f(f, n,0);
+    fnl(f);
+}
+
+
+Node *index_list(Node *n, Atom *a)
+{
+    Node *index;
+
+    if (! isList(n) || ! isAtom(a))
+	index = nil;
+    else {
+	Node *i;
+	int found = 0;
+	for (i = n; ! isNil(i) && !found; i=cdr(i)) {
+	    index = car(i);
+	    if (isList(index) && isAtom(car(index))) {
+		found = (strcmp(atomVal(car(index)),atomVal(a))==0);
+	    }
+	}
+	if (! found) index = nil;
+    }
+
+    return index;
+}
+
+
+    
+
+Node *index_list_by_str(Node *n, char *a)
+{
+    Atom *atom;
+    List *node;
+
+    atom = atom_str(a);
+
+    node = index_list(n,atom);
+
+    destroy_list(atom);
+
+    return node;
+}
+    
+
+
+char *assoc(Node *n, char *a)
+{
+    return atomVal(car(cdr(index_list_by_str(n,a))));
+
+}
+
+
+
+
+int read_string(FILE *fp, char **s)
+{
+    char buff[4096];
+    int c;
+    int len;
+    int ret;
+
+    /* skip over while space */
+    for (c = getc(fp);c != EOF && isspace(c); c= getc(fp));
+
+    len = 0;
+    ret = 0;
+    if (c == EOF)
+	*s = NULL;
+    else if (c == '"') {
+	int l;
+	for (c = getc(fp), l = getc(fp);
+	     c != EOF && !(c=='"' && (l == EOF ||  l!='"') );
+	     c=l,l = getc(fp)) {
+	    buff[len++] = c;
+	    if (c=='"' && l=='"') {
+		c = '\0';
+		l = getc(fp);
+	    }
+	}
+	ret = 1;
+	buff[len] = '\0';
+	*s = (char *)mystrdup(buff);
+    } else {
+	buff[len++] = c;
+	for (c = getc(fp); c != EOF && !isspace(c); c = getc(fp))
+	    buff[len++] = c;
+	buff[len] = '\0';
+	*s = (char *)mystrdup(buff);
+    }
+
+    return ret;
+}
+
+
+
+List *read_list(FILE *fp)
+{
+    List *l;
+    char *s;
+    int q;
+
+    q = read_string(fp,&s);
+
+    if (s == NULL)
+	l = nil;
+    else if ( strcmp(s,")") == 0 && !q) {
+	free(s);
+	l = nil;
+    } else if ( strcmp(s,"(") == 0 && !q) {
+	List *item, *t;
+	free(s);
+	l = nil;
+	t = nil;
+	for (item = read_list(fp); ! isNil(item); item = read_list(fp)) {
+	    List *m;
+	    m = create_node_list();
+	    m->val.list.head = item;
+	    if (isNil(l)) l = m;
+	    if (isNil(t))
+		t = m;
+	    else {
+		t->val.list.tail = m;
+		t = m;
+	    }
+	}
+	if (isNil(l)) l = create_node_list();
+    } else {
+	l = atom_str(s);
+    }
+
+    return l;
+
+}
+
diff --git a/src/convert/list.h b/src/convert/list.h
new file mode 100644
index 0000000..dbbd56d
--- /dev/null
+++ b/src/convert/list.h
@@ -0,0 +1,52 @@
+#ifndef _list_h
+#define _list_h
+#include <stdio.h>
+#include <stdarg.h>
+
+#define Node_Nil  0
+#define Node_List 1
+#define Node_Atom 2
+#define nl putchar('\n')
+#define fnl(F) putc('\n',F)
+
+
+#define nil NULL
+#define isAtom(N) ((N)==nil?0:(N)->type==Node_Atom)
+#define isList(N) ((N)==nil?0:(N)->type==Node_List)
+#define isNil(N)  ((N)==nil)
+#define car(N) ( isNil(N)||isAtom(N)?nil:(N)->val.list.head )
+#define cdr(N) ( isNil(N)||isAtom(N)?nil:(N)->val.list.tail )
+#define atomVal(N) ( isAtom(N)?(N)->val.atom.ptr:NULL )
+
+typedef struct _node{
+    int type;
+    union {
+	struct {
+	    struct _node *head;
+	    struct _node *tail;
+	} list;
+	struct {
+	    char *ptr;
+	    char *spare;
+	} atom;
+    }val;
+} Node,List,Atom;
+
+extern void destroy_list(Node *n);
+extern void destroy_node_list();
+extern Node *copy_list(Node *n);
+extern Node *atom_str(char *s);
+extern Node *atom_int(int i);
+extern Node *build_list(Node *n1, ...);
+extern Node *tail_list(Node *n);
+extern Node *join_list(Node *n1,...);
+extern Node *index_list(Node *n, Atom *a);
+extern void print_list (Node *n);
+extern Node *index_list_by_str(Node *n, char *s);
+extern char *assoc(Node *n, char *s);
+extern void print_list_l (Node *n);
+extern void fprint_list_l (FILE *f, Node *n);
+extern List *read_list(FILE *f);
+extern void fprint_list_f (FILE *f, Node *n);
+
+#endif /* _list_h */
diff --git a/src/convert/main.c b/src/convert/main.c
new file mode 100644
index 0000000..3471d30
--- /dev/null
+++ b/src/convert/main.c
@@ -0,0 +1,158 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "list.h"
+#include "process.h"
+#include "misc.h"
+
+char *types[] = {
+    db_type_RS_flat_file,
+    db_type_SD_flat_file,
+    db_type_sap,
+    db_type_early_xdap,
+    db_type_middle_xdap,
+    db_type_late_xdap,
+};
+
+char *vtypes[] = {
+    db_vtype_RS_flat_file,
+    db_vtype_SD_flat_file,
+    db_vtype_sap,
+    db_vtype_early_xdap,
+    db_vtype_middle_xdap,
+    db_vtype_late_xdap,
+};
+
+int all_types[] = {
+    0, /* db_type_RS_flat_file */
+    1, /* db_type_SD_flat_file */
+    2, /* db_type_sap */
+    3, /* db_type_early_xdap */
+    4, /* db_type_middle_xdap */
+    5, /* db_type_late_xdap */
+};
+
+int src_types[] = {
+    1, /* db_type_SD_flat_file */
+    4, /* db_type_middle_xdap */
+    5, /* db_type_late_xdap */
+};
+
+int tgt_types[] = {
+    1, /* db_type_SD_flat_file */
+    5, /* db_type_late_xdap */
+};
+
+
+typedef char IOString[200];
+
+
+
+static void get_db(char *prompt,
+		   int *use_types,
+		   int ntypes,
+		   char *name,
+		   char *version,
+		   char *type)
+{
+    int i;
+    IOString ctype;
+    int itype;
+
+    printf("%s\n",prompt);
+    printf("Available types are:\n");
+
+    for (i=0;i<ntypes;i++)
+	printf("%d. %s\n",i,vtypes[use_types[i]]);
+
+    printf("\n");
+    do {
+	printf("Database type? ");
+	if ( gets(ctype) == NULL )
+	    itype = -1;
+	else
+	    itype = atoi(ctype);
+    } while (itype<0 || itype>=ntypes);
+
+    strcpy(type,types[use_types[itype]]);
+
+    printf("Database name? ");
+    gets(name);
+
+    printf("Database version? ");
+    gets(version);
+
+    printf("\n");
+}
+
+
+
+
+
+int main ()
+{
+
+    List *f;
+    List *t;
+    IOString src_name;
+    IOString src_version;
+    IOString src_type;
+    IOString tgt_name;
+    IOString tgt_version;
+    IOString tgt_type;
+
+    printf("Covert Project Database\nVersion 1.0, 18 November 1992\n");
+
+#define NUMBER(A) ((int)(sizeof(A) / sizeof((A)[0])))
+    get_db("Please enter database to convert:\n",src_types, NUMBER(src_types), src_name, src_version, src_type);
+    get_db("Please enter database to create:\n",tgt_types, NUMBER(tgt_types), tgt_name, tgt_version, tgt_type);
+
+    if (strcmp(src_type,tgt_type)==0)
+	crash("Cannot convert a database to another of the same type\n");
+
+    f = build_list (
+		    atom_str(db_from),
+		    build_list(
+			       atom_str(db_name),
+			       atom_str(src_name),
+			       nil),
+		    build_list(
+			       atom_str(db_version),
+			       atom_str(src_version),
+			       nil),
+		    build_list(
+			       atom_str(db_type),
+			       atom_str(src_type),
+			       nil),
+		    nil);
+
+    t = build_list (
+		    atom_str(db_to),
+		    build_list(
+			       atom_str(db_name),
+			       atom_str(tgt_name),
+			       nil),
+		    build_list(
+			       atom_str(db_version),
+			       atom_str(tgt_version),
+			       nil),
+		    build_list(
+			       atom_str(db_type),
+			       atom_str(tgt_type),
+			       nil),
+		    nil);
+
+    process (f,t);
+
+    destroy_list(f);
+    destroy_list(t);
+
+    destroy_node_list(); /* garbage collect */
+
+    printf("\nConversion completed\n");
+
+    return 0;
+}
+
+
diff --git a/src/convert/makefile-alpha b/src/convert/makefile-alpha
new file mode 100644
index 0000000..30cf237
--- /dev/null
+++ b/src/convert/makefile-alpha
@@ -0,0 +1,45 @@
+#
+# Makefile for convert (Alpha version)
+#
+
+SRC = .
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)/alpha-binaries
+
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact -I$(MISCSRC)
+LD = cc
+LDFLAGS = -g -non_shared
+LIBS = $(MISCBIN)/misc.a
+
+#
+# Programs made by this makefile
+#
+PROGS = convert
+
+all : $(PROGS)
+
+#
+# Where the wild things are
+#
+
+OBJS = \
+	list.o \
+	main.o \
+	dapIO.o \
+	dapDB.o \
+	bapIO.o \
+	bapDB.o \
+	process.o \
+	flat_sd.o
+
+convert: $(OBJS)
+	$(LD) $(LDFLAGS) -o convert $(OBJS) $(LIBS) $(LDFLAGS)
+#
+# Special dependencies
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+clean :
+	/bin/rm -f $(PROGS) $(OBJS) *~
diff --git a/src/convert/makefile-dec b/src/convert/makefile-dec
new file mode 100644
index 0000000..dd289ef
--- /dev/null
+++ b/src/convert/makefile-dec
@@ -0,0 +1,48 @@
+#
+# Makefile for convert (DEC Ultrix version)
+# *** requires gnu make ***
+#
+
+SRC = .
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+
+CC = c89
+CFLAGS = -g -std -w0 -I$(MISCSRC)
+# -static is only necessary for gcc 2.1
+LDFLAGS =
+LIBS = $(MISCBIN)/misc.a
+
+#
+# Programs made by this makefile
+#
+PROGS = convert
+
+all : $(PROGS)
+
+#
+# Where the wild things are
+#
+SRC = ..
+
+OBJS = \
+	list.o \
+	main.o \
+	dapIO.o \
+	dapDB.o \
+	bapIO.o \
+	bapDB.o \
+	process.o \
+	flat_sd.o
+
+convert: $(OBJS)
+	$(CC) $(CFLAGS) -o convert $(OBJS) $(LIBS) $(LDFLAGS)
+#
+# Special dependencies
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+clean :
+	/bin/rm -f $(PROGS) $(OBJS) *~
diff --git a/src/convert/makefile-sgi b/src/convert/makefile-sgi
new file mode 100644
index 0000000..27abca2
--- /dev/null
+++ b/src/convert/makefile-sgi
@@ -0,0 +1,46 @@
+#
+# Makefile for convert (SGI version)
+#
+
+SHELL=/bin/sh
+
+SRC = .
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+
+CC = cc
+CFLAGS = -g -ansi -prototypes -I$(MISCSRC)
+LDFLAGS = -g
+LIBS = $(MISCBIN)/misc.a
+
+#
+# Programs made by this makefile
+#
+PROGS = convert
+
+all : $(PROGS)
+
+#
+# Where the wild things are
+#
+
+OBJS = \
+	list.o \
+	main.o \
+	dapIO.o \
+	dapDB.o \
+	bapIO.o \
+	bapDB.o \
+	process.o \
+	flat_sd.o
+
+convert: $(OBJS)
+	$(CC) $(CFLAGS) -o convert $(OBJS) $(LIBS) $(LDFLAGS)
+#
+# Special dependencies
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+clean :
+	/bin/rm -f $(PROGS) $(OBJS) *~
diff --git a/src/convert/makefile-solaris b/src/convert/makefile-solaris
new file mode 100644
index 0000000..2bb558a
--- /dev/null
+++ b/src/convert/makefile-solaris
@@ -0,0 +1,49 @@
+#
+# Makefile for convert (SunOS version)
+#
+
+SRC = .
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS) -I$(MISCSRC)
+# -static is only necessary for gcc 2.1
+LDFLAGS = -static
+LIBS = $(MISCBIN)/misc.a
+
+#
+# Programs made by this makefile
+#
+PROGS = convert
+
+all : $(PROGS)
+
+#
+# Where the wild things are
+#
+SRC = ..
+
+OBJS = \
+	list.o \
+	main.o \
+	dapIO.o \
+	dapDB.o \
+	bapIO.o \
+	bapDB.o \
+	process.o \
+	flat_sd.o
+
+convert: $(OBJS)
+	$(CC) $(CFLAGS) -o convert $(OBJS) $(LIBS) $(LDFLAGS)
+#
+# Special dependencies
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+clean :
+	/bin/rm -f $(PROGS) $(OBJS) *~
diff --git a/src/convert/makefile-sun b/src/convert/makefile-sun
new file mode 100644
index 0000000..c51d06e
--- /dev/null
+++ b/src/convert/makefile-sun
@@ -0,0 +1,49 @@
+#
+# Makefile for convert (SunOS version)
+#
+
+SRC = ..
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS) -I$(MISCSRC)
+# -static is only necessary for gcc 2.1
+LDFLAGS = -static
+LIBS = $(MISCBIN)/misc.a
+
+#
+# Programs made by this makefile
+#
+PROGS = convert
+
+all : $(PROGS)
+
+#
+# Where the wild things are
+#
+SRC = .
+
+OBJS = \
+	list.o \
+	main.o \
+	dapIO.o \
+	dapDB.o \
+	bapIO.o \
+	bapDB.o \
+	process.o \
+	flat_sd.o
+
+convert: $(OBJS)
+	$(CC) $(CFLAGS) -o convert $(OBJS) $(LIBS) $(LDFLAGS)
+#
+# Special dependencies
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+clean :
+	/bin/rm -f $(PROGS) $(OBJS) *~
diff --git a/src/convert/newtypes.h b/src/convert/newtypes.h
new file mode 100644
index 0000000..23ab37b
--- /dev/null
+++ b/src/convert/newtypes.h
@@ -0,0 +1,11 @@
+#ifndef _NEWTYPES_h
+#define _NEWTYPES_h
+
+typedef char		int_1;
+typedef short		int_2;
+typedef int		int_4;
+typedef unsigned char	uint_1;
+typedef unsigned short	uint_2;
+typedef unsigned int	uint_4;
+
+#endif
diff --git a/src/convert/process.c b/src/convert/process.c
new file mode 100644
index 0000000..d2cfc99
--- /dev/null
+++ b/src/convert/process.c
@@ -0,0 +1,264 @@
+#include <string.h>
+#include "list.h"
+#include "process.h"
+#include "dapDB.h"
+#include "bapDB.h"
+#include "flat_sd.h"
+#include "misc.h"
+
+void open_for_read(List *from)
+{
+    char *a;
+
+    if ( (a = assoc(from,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	flat_sd_open_for_read(from);
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	xdap_middle_open_for_read(from);
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	xdap_late_open_for_read(from);
+    else
+	crash("Source database type not supported\n");
+}
+
+void open_for_write(List *to)
+{
+    char *a;
+
+    if ( (a = assoc(to,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	flat_sd_open_for_write(to);
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	xdap_late_open_for_write(to);
+    else
+	crash("Source database type not supported\n");
+}
+
+void close_files(List *to)
+{
+    char *a;
+
+    if ( (a = assoc(to,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	flat_sd_close(to);
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	xdap_middle_close(to);
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	xdap_late_close(to);
+    else
+	crash("Source database type not supported\n");
+}
+
+
+
+
+
+List *read_header(List *from)
+{
+    char *a;
+
+    if ( (a = assoc(from,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	return (List *)flat_sd_read_header();
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	return (List *)xdap_middle_read_header();
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	return (List *)xdap_late_read_header();
+    else
+	crash("Source database type not supported\n");
+    return (List *)0; /* stops warnings */
+}
+
+
+void write_header(List *to, List *l)
+{
+    char *a;
+
+    if ( (a = assoc(to,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	flat_sd_write_header(l);
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	xdap_late_write_header(l);
+    else
+	crash("Source database type not supported\n");
+}
+
+
+List *read_gel_data(List *from)
+{
+    char *a;
+
+    if ( (a = assoc(from,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	return (List *)flat_sd_read_gel_data();
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	return (List *)xdap_middle_read_gel_data();
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	return (List *)xdap_late_read_gel_data();
+    else
+	crash("Source database type not supported\n");
+    return (List *)0; /* stops warnings */
+}
+
+void write_gel_data(List *to, List *l)
+{
+    char *a;
+
+    if ( (a = assoc(to,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	flat_sd_write_gel_data(l);
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	xdap_late_write_gel_data(l);
+    else
+	crash("Source database type not supported\n");
+}
+
+List *read_contig_data(List *from)
+{
+    char *a;
+
+    if ( (a = assoc(from,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	return (List *)flat_sd_read_contig_data();
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	return (List *)xdap_middle_read_contig_data();
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	return (List *)xdap_late_read_contig_data();
+    else
+	crash("Source database type not supported\n");
+    return (List *)0; /* stops warnings */
+}
+
+void write_contig_data(List *to, List *l)
+{
+    char *a;
+
+    if ( (a = assoc(to,db_type)) == NULL)
+	crash("Type not specified for source database\n");
+    else if (strcmp(a,db_type_RS_flat_file) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_SD_flat_file) == 0)
+	flat_sd_write_contig_data(l);
+    else if (strcmp(a,db_type_sap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_early_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_middle_xdap) == 0)
+	crash("Source database type not supported\n");
+    else if (strcmp(a,db_type_late_xdap) == 0)
+	xdap_late_write_contig_data(l);
+    else
+	crash("Source database type not supported\n");
+}
+
+void process(List *from, List *to)
+{
+    List *l;
+
+
+    /*
+    ** Initialise read
+    */
+    open_for_read(from);
+    open_for_write(to);
+
+    l = read_header(from);
+    write_header(to,l);
+    destroy_list(l);
+
+    /*
+    ** Process Gels
+    */
+    for (l = read_gel_data(from);
+	 !isNil(l);
+	 l = read_gel_data(from)) {
+	write_gel_data(to,l);
+	destroy_list(l);
+    }
+
+    /*
+    ** Process Contigs
+    */
+    for (l = read_contig_data(from);
+	 !isNil(l);
+	 l = read_contig_data(from)) {
+	write_contig_data(to,l);
+	destroy_list(l);
+    }
+
+
+    /*
+    ** Tidy up read
+    */
+    close_files(from);
+    close_files(to);
+	
+}
+
+
+
+
+
+
diff --git a/src/convert/process.h b/src/convert/process.h
new file mode 100644
index 0000000..b9af024
--- /dev/null
+++ b/src/convert/process.h
@@ -0,0 +1,96 @@
+#ifndef _process_h
+#define _process_h
+
+#include "list.h"
+
+#define db_from "From"
+#define db_to "To"
+
+
+#define db_files "Files"
+#define db_files_seq "Sequence"
+#define db_files_arch "Archive"
+#define db_files_rel "Relationships"
+#define db_files_raw "Raw-Data"
+#define db_files_tag "Tag"
+#define db_files_com "Comment"
+#define db_files_flat "Flat-File"
+
+
+#define db_rec "Database"
+#define db_name "Name"
+#define db_version "Version"
+#define db_max_gels "Max-Gels"
+#define db_max_db_size "Max-Database-Size"
+#define db_max_gel_length "Max-Gel-Length"
+#define db_max_contigs "Max-Contigs"
+#define db_data_class "Data-Class"
+#define db_num_gels "Num-Gels"
+#define db_num_contigs "Num-Contigs"
+
+#define db_type "Type"
+#define db_type_RS_flat_file "RS-Flat-File"
+#define db_type_SD_flat_file "SD-Flat-File"
+#define db_type_sap "Sap"
+#define db_type_early_xdap "Early-Xdap"
+#define db_type_middle_xdap "Middle-Xdap"
+#define db_type_late_xdap "Late-Xdap"
+#define db_vtype_RS_flat_file "Flat file - created with sapf"
+#define db_vtype_SD_flat_file "Flat file - created with this program"
+#define db_vtype_sap "Sap database - the original format with three database files"
+#define db_vtype_early_xdap "Early xdap database - with the raw data (RD) file"
+#define db_vtype_middle_xdap "xdap database"
+#define db_vtype_late_xdap "xbap database"
+
+#define contig_rec "Contigs"
+#define contig_index "Contig-Index"
+#define contig_length "Length"
+#define contig_left_end "Left-End"
+#define contig_right_end "Right-End"
+
+#define gel_rec "Gels"
+#define gel_index "Gel-Index"
+#define gel_name "Name"
+#define gel_seq "Sequence"
+#define gel_pos "Pos-In-Contig"
+#define gel_length "Length"
+#define gel_comp "Complemented"
+#define gel_l_nbr "Left-Nbr"
+#define gel_r_nbr "Right-Nbr"
+#define gel_rd_length "RD-Length"
+#define gel_rd_cut "RD-Cut-Off-Position"
+#define gel_rd_ulen "RD-Usable-Length"
+#define gel_rd_type "Trace-File-Type"
+#define gel_rd_file "Trace-File-Name"
+#define gel_l_cut_seq "Left-Cutoff"
+#define gel_r_cut_seq "Right-Cutoff"
+#define gel_annotation "Annotation"
+#define gel_an_pos "Position"
+#define gel_an_len "Length"
+#define gel_an_type "Type"
+#define gel_an_comment "Comment"
+#define gel_edits "Edits"
+#define gel_ed_pos "Position"
+#define gel_ed_type "Type"
+#define gel_ed_char "Character"
+#define gel_ed_delete "Delete"
+#define gel_ed_insert "Insert"
+#define gel_ed_op "Op"
+#define gel_ed_base "Base"
+#define gel_ed_base_pos "Pos"
+
+#define tag_rec "TagTypes"
+#define tag_name "Name"
+#define tag_type "Type"
+#define tag_fg "Foreground-Colour"
+#define tag_bg "Background-Colour"
+#define tag_dt "Default-Comment"
+
+
+
+extern void write_header(List *to, List *l);
+extern void write_gel_data(List *to, List *l);
+extern void write_contig_data(List *to, List *l);
+extern void process(List *from, List *to);
+
+#endif /* _process_h */
diff --git a/src/cop/COP.GUIDE b/src/cop/COP.GUIDE
new file mode 100644
index 0000000..9b54859
--- /dev/null
+++ b/src/cop/COP.GUIDE
@@ -0,0 +1,79 @@
+		  Checking Xdap Databases For Errors
+			Using COP Version 1.1
+
+			      Simon Dear
+			    16 March 1992
+
+
+0. Introduction
+
+The program cop checks for editing errors in xdap project databases.
+It uses a robust method that can detect insertions, deletions and
+changes that have been inadvertently made. In later versions places
+where there is reliant on traces of insufficient quality will be
+detectable also.
+
+
+1. Usage
+
+The program allows the user to specify, the project name, the project
+version, the consensus calculation cutoff percentage and a search path
+for where traces are to be found:
+
+	cop [-p project]
+	    [-v version]
+	    [-c consensus_cutoff_percentage] 
+	    [-r raw_data_search_path]
+	    [-h]
+
+An example: cop can be run on F59B2.??0 with the command:
+
+	cop -p f59b2 -v 0 -r ~mmm/F59B2 -c 66
+
+If the project and/or version are not specified, the user is prompted
+for them. The default consensus cutoff percentage is 100%
+
+If a trace file cannot be found in the current working directory and
+the -r option is not used, the environment variable RAWDATA is used to
+find the file.
+
+
+2. How cop works
+
+Cop works on a problem exclusion principle. It ignores problem areas
+(places where there are insertions, deletions, changes, or where the
+trace quality is poor) and concentrates on identifying places where
+the coverage is good. It then reports regions where coverage is poor.
+Unfortunately it isn't possible to provide explanations using this
+approach.
+	     
+The algorithm is as follows, and is performed on each contig.
+
+a) The consensus for the contig is calculated and a "coverage"
+array (to record areas of good coverage) is initialised.
+
+b) Each gel reading in the contig is investigated. Information about
+the trace file (its name, and size of cutoffs) is read from the
+database. The trace file is read in.
+
+c) The consensus of the region in which the gel reading lies is
+aligned with the clipped trace sequence. If necessary, the consensus
+is complemented. The alignment is performed using Myers and Miller's
+algorithm [1], in the incarnation supplied in the fasta package.
+
+d) A map is made relating the bases in the raw sequence and the bases
+in the consensus. Places where trace quality is poor are removed from
+this map.  For each region in the consensus where there is perfect
+alignment (with no deletions, insertions, changes but are mapped) the
+coverage array is updated.  Each entry in this array represents a pairs
+of adjacent bases, and both must be adjacent in the alignment for the
+entry to be marked as covered.
+
+e) Once all the readings in the contig have been processed, all gaps
+in the coverage are reported.
+
+
+A. References
+
+[1] Myers, E.W. and Miller, W. 1988. Optimal alignments in linear
+space. CABIOS 4(1):11-17.
diff --git a/src/cop/README b/src/cop/README
new file mode 100644
index 0000000..076ffc7
--- /dev/null
+++ b/src/cop/README
@@ -0,0 +1,35 @@
+Guide to this directory                Simon Dear, 3 December 1991
+------------------------------------------------------------------
+
+In this directory there are all the sources required to make COP, plus
+documentation.
+
+COP.GUIDE
+
+	Guide to the design and use of cop
+
+makefile
+
+	Makefile for cop.
+
+cop.c
+
+	Main program module.
+
+misc.c misc.h
+
+	Useful everyday routines.
+
+xdap_db.c xdap_db.h
+
+	Routines to read xdap databases.
+
+getSeq.c fpoint.c fpoint.h opp.c opp.h seq.c seq.h seqIOABI.c
+seqIOABI.h seqIOALF.c seqIOALF.h seqIOEdit.h seqIOPlain.c
+seqIOPlain.h mystdlib.h
+
+	Routines to read in trace files. Taken from ted.
+
+align.c llin.c llin.h uascii.gbl upam.gbl
+	
+	Alignment routines. Taken from fasta.
diff --git a/src/cop/TODO b/src/cop/TODO
new file mode 100644
index 0000000..2037a64
--- /dev/null
+++ b/src/cop/TODO
@@ -0,0 +1,33 @@
+		     1001 Ways to Make COP Better
+				   
+			      Simon Dear
+			   3 December 1991
+				   
+
+1. Incorporate quality considerations.
+Certain problems can slip through the detetion net. An example are
+overcalled bases in the consensus. Currently the whole raw data read
+is used, and overcalling will agree with the consensus. Overcalled
+runs of bases are very symptomatic of poor trace quality. If these
+could be detected and excluded from the coverage map, the problem
+would be detected.
+
+2. Running the program on contigs and regions of contigs.
+Cop could then be run on partially completed project databases.
+
+3. Provide explanations as to why a region is problematic.
+Maybe this isn't worth the effort to implement, but would be useful.
+
+4. Take into account strandedness.
+This would help detect regions where the coverage by good quality
+traces is poor.
+
+5. Interface to contig editor.
+The contig editor could center on and highlight regions where problems
+exist, speeding up the process.
+
+6. Consensus coverage.
+Currently if 10 good readings disagree with a region and just 1
+mediocre reading agrees, no problem is detected.
+
+7-1001. Make it detect 995 times as many problems.
diff --git a/src/cop/align.c b/src/cop/align.c
new file mode 100644
index 0000000..e3feb94
--- /dev/null
+++ b/src/cop/align.c
@@ -0,0 +1,294 @@
+/*      align.c
+	protein driver for linear sequence comparison method
+*/
+
+#include <stdio.h>
+/*#include <ctype.h>*/
+
+#define TRUE 1
+#define FALSE 0
+
+#ifndef BIGMEM
+#define MAXTST 2000	/* longest test sequence */
+#define MAXLIB 10000
+#define MAXDIAG (MAXTST+MAXLIB)
+#else
+#define MAXTST 10000
+#define MAXLIB 50000
+#define MAXDIAG (MAXTST+MAXLIB)
+#endif
+
+FILE *outfd;		/* fd for output file */
+
+/* globals for matching */
+
+long lmark;		/* position in library file from ftell() */
+int nlib, onlib;
+long ntt, ontt;		/* number of library sequences, number of
+				residues scanned*/
+char libstr[21];	/* partial title from library sequence */
+char name0[11], name1[11];	/* for labeling output */
+int ixstat;		/* >0 if annotations displayed */
+
+char *aa0, *aa1;	/* amino acid sequence data */
+int *res;
+
+int nc, nd, gscore;
+char *seqc0, *seqc1;	/* aligned sequences */
+
+int dnaseq, lcont;
+int bktup, bkfact, scfact, bestoff, bestscale, histint, bestmax;
+
+int maxn, maxt;		/* max space for lib sequence */
+int n0, n1, nd, noff;	/* length of aa0, length of aa1, n0+n1,
+				diagonal offset */
+long loffset = 0l;		/* offset into sequence */
+
+/*  the following are defaults for values that are read by
+    pam.c from *.mat if SMATRIX is defined */
+
+int nshow; char rline[20],sline[20];
+
+/* output options */
+int showall,markx, llen;
+
+char ttitle[60], ltitle[60];
+int smark[4] = {-10000,-10000,-10000,-10000};
+int min0,min1,max0,max1;
+
+long tstart, tscan, tdone, stime();
+
+extern int optind;
+char *libenv, *aaenv, *smptr;
+char smstr[40];
+
+#include "upam.gbl"		/* includes pam array */
+
+main(argc, argv)
+        int argc; char **argv;
+{
+	char tname[40], lname[40], qline[40];
+	int itemp, iln, nln;
+	char *calloc(), *getenv(), *cptr, *bp, *strchr();
+	float percent;
+
+	initenv(argc,argv);
+
+	if ((aa0=calloc(MAXTST+MAXLIB,sizeof(char)))==0) {
+		fprintf(stderr," cannot allocate sequence array\n");
+		exit(1);
+		}
+	maxn = MAXTST+MAXLIB;
+
+        if (argc-optind < 3) {
+                printf(" align 1.0 [April, 1988] compares two sequences\n");
+	l1:	printf(" first sequence file name: ");
+		fflush(stdout);
+		fgets(tname,40,stdin);
+		if (tname[strlen(tname)-1]=='\n') tname[strlen(tname)-1]='\0';
+		if (tname[0]=='\0') goto l1;
+        	if ((n0=getseq(tname,aa0,maxn,&dnaseq))==0) {
+                  fprintf(stderr," %s : %s sequence not found\n",tname,sqtype);
+                  goto l1;
+                  }
+
+		resetp(dnaseq);
+			
+	l2:	printf(" second sequence file name: ");
+		fflush(stdout);
+		fgets(lname,40,stdin);
+		if (lname[strlen(lname)-1]=='\n') lname[strlen(lname)-1]='\0';
+		if (*lname==0) goto l2;
+		}
+	else {
+		strncpy(tname,argv[optind+1],40);
+        	if ((n0=getseq(tname,aa0,maxn,&dnaseq))==0) {
+                  fprintf(stderr," %s : %s sequence not found\n",tname,sqtype);
+                  exit(1);
+                  }
+		resetp(dnaseq);
+		strncpy(lname,argv[optind+2],40);
+		}
+
+	strncpy(name0,tname,6); name0[6]='\0';
+
+	fprintf(stderr," %s : %4d %-s\n",tname, n0, sqnam);
+
+	aa1 = aa0 + n0 + 2;
+	maxn -= n0 + 3;
+
+	openlib(lname,libenv);
+
+	tstart = stime();
+
+	n1=getlib(aa1,maxn,libstr,&lmark,&lcont);
+	strncpy(name1,libstr,6); 
+	if ((bp = strchr(name1,' '))!=NULL) *bp='\0'; name1[6]='\0';
+	gettitle(tname,ttitle,50);
+	gettitle(lname,ltitle,50);
+
+	initseq(n0+n1);
+
+	initpam2();	/* convert 1-d pam to 2-d pam2 */
+	gscore = -DIFF(aa0-1,aa1-1,n0,n1,pam2,-gdelval,-ggapval,res);
+
+	nc=calcons(aa0,n0,aa1,n1,res);
+	percent = (double)nd*100.0/(double)nc;
+
+	tdone = stime();
+
+	printf("%-50s %4d %s vs.\n%-50s %4d %s\n",ttitle,n0,sqnam,ltitle,n1,sqnam);
+	printf("%4.1f%% identity;\tOptimized score: %d\n",percent,gscore);
+	fixpam2();
+
+	outfd = stdout;
+	discons(seqc0,seqc1,nc);
+
+	printf("\nElapsed time: "); ptime(stdout,tdone-tstart); printf("\n");
+	}
+
+extern int *sascii, nascii[], aascii[];
+
+initenv(argc,argv)
+	int argc;
+	char **argv;
+{
+	char *cptr, *getenv();
+	int copt, getopt();
+	extern char *optarg;
+
+	libenv="\0";
+	aaenv="\0";
+
+	sascii = aascii;
+	pam = apam;
+	sq = aa;
+	hsq = haa;
+	nsq = naa;
+	dnaseq = 0;
+
+	showall = 1;
+
+	if ((cptr=getenv("LINLEN"))!=NULL) sscanf(cptr,"%d",&llen);
+	else llen = 60;
+	if (llen>=200) llen=200-1;
+	markx=0;
+	if ((cptr=getenv("MARKX"))==NULL) markx=0;
+	else sscanf(cptr,"%d",&markx);
+
+	while ((copt=getopt(argc,argv,"m:s:w:"))!=EOF)
+		switch(copt) {
+			case 'w': sscanf(optarg,"%d",&llen); break;
+			case 'm': sscanf(optarg,"%d",&markx); break;
+			case 's': strncpy(smstr,optarg,sizeof(smstr));
+				smptr = smstr;
+				if (initpam(smptr)) {
+					dnaseq= -1;
+					}
+				else smptr="\0";
+				break;
+			default : fprintf(stderr," illegal option -%c\n",copt);
+			}
+
+	optind--;
+
+	if (dnaseq>=0) {
+		if ((smptr=getenv("SMATRIX"))!=NULL && initpam(smptr)) {
+			dnaseq = -1;
+			}
+		else
+			smptr="\0";
+		}
+
+	if (strlen(smptr)>0) fprintf(stderr," using matrix file %s\n",smptr);
+	}
+
+resetp(dnaseq)
+	int dnaseq;
+{
+	if (dnaseq==1) pam = npam;
+	}
+
+initpam2()
+{
+	int i, j, k;
+
+	k=0;
+	for (i=0; i<nsq; i++)
+		for (j=0; j<=i; j++)
+			pam2[j][i] = pam2[i][j] = -pam[k++];
+	}
+
+fixpam2()
+{
+	int i, j;
+
+	for (i=0; i<nsq; i++)
+		for (j=0; j<nsq; j++)
+			pam2[i][j] = -pam2[i][j];
+	}
+
+int smin0, smin1, smins;	/* set bounds for discons */
+
+calcons(aa0,n0,aa1,n1,res)
+	char *aa0, *aa1;
+	int n0, n1;
+	int *res;
+{
+	int i0, i1;
+	int op, nc;
+	char *sp0, *sp1;
+	int *rp;
+
+	sp0 = seqc0;
+	sp1 = seqc1;
+	rp = res;
+	nc = nd = i0 = i1 = op = 0;
+	min0 = min1 = 0;
+
+	while (i0 < n0 || i1 < n1) {
+		if (op == 0 && *rp == 0) {
+			op = *rp++;
+			*sp0 = sq[aa0[i0++]];
+			*sp1 = sq[aa1[i1++]];
+			nc++;
+			if (*sp0++ == *sp1++) nd++;
+			}
+		else {
+			if (op==0) op = *rp++;
+			if (op>0) {
+				*sp0++ = '-';
+				*sp1++ = sq[aa1[i1++]];
+				op--;
+				nc++;
+				}
+			else {
+				*sp0++ = sq[aa0[i0++]];
+				*sp1++ = '-';
+				op++;
+				nc++;
+				}
+			}
+		}
+	max0 = max1 = nc;
+	return nc;
+	}
+
+initseq(seqsiz)		/* initialize arrays */
+	int seqsiz;
+{
+	char *calloc();
+
+	res = (int *)calloc(seqsiz,sizeof(int));
+	seqc0=calloc(seqsiz,sizeof(char));
+	seqc1=calloc(seqsiz,sizeof(char));
+	if (res==NULL || seqc0==NULL || seqc1==NULL)
+		{fprintf(stderr,"cannot allocate consensus arrays %d\n",seqsiz);
+		 exit();}
+	}
+
+freeseq()
+{
+	free(seqc0); free(seqc1);
+	}
+
diff --git a/src/cop/bapIO.c b/src/cop/bapIO.c
new file mode 100644
index 0000000..1d7cf56
--- /dev/null
+++ b/src/cop/bapIO.c
@@ -0,0 +1,505 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "bapIO.h"
+#include "misc.h"
+
+
+
+#define swap_int4(src, dst) \
+    ((char *)&(dst))[0] = ((char *) &(src))[3];\
+    ((char *)&(dst))[1] = ((char *) &(src))[2];\
+    ((char *)&(dst))[2] = ((char *) &(src))[1];\
+    ((char *)&(dst))[3] = ((char *) &(src))[0];
+
+
+/*
+** Tag File IO
+*/
+
+void swap_tg_file_rec(bap_tg_file_rec *src, bap_tg_file_rec *dst)
+{
+    int i=1;
+
+    if (*(char*)&i) {
+	swap_int4(src->lines.position,dst->lines.position);
+	swap_int4(src->lines.length,dst->lines.length);
+	swap_int4(src->lines.comment,dst->lines.comment);
+	dst->lines.type = src->lines.type;
+	swap_int4(src->lines.next,dst->lines.next);
+    } else {
+	dst->lines.position = src->lines.position;
+	dst->lines.length = src->lines.length;
+	dst->lines.comment = src->lines.comment;
+	dst->lines.type = src->lines.type;
+	dst->lines.next = src->lines.next;
+    }
+    
+}
+
+
+
+
+void bap_read_tg(BapIO *io, int_4 rec, bap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    bap_tg_file_rec tbuf;
+
+    if ( fseek(f,(off_t)bap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+    if ( fread(&tbuf, sizeof(bap_tg_file_rec), 1, f) != 1)
+	crash("Read failure on tag file\n");
+
+    swap_tg_file_rec(&tbuf,t);
+}
+
+
+
+
+void bap_write_tg(BapIO *io, int rec, bap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    bap_tg_file_rec tbuf;
+
+    if ( fseek(f,(off_t)bap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+
+    swap_tg_file_rec(t,&tbuf);
+
+    if ( fwrite(&tbuf, sizeof(bap_tg_file_rec), 1, f) != 1)
+	crash("Write failure on tag file\n");
+}
+
+
+
+/*
+** Archive File IO
+*/
+void bap_read_ar(BapIO *io, int rec, bap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)bap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fread(t, sizeof(bap_ar_file_rec), 1, f) != 1)
+	crash("Read failure on archive file\n");
+}
+
+void bap_write_ar(BapIO *io, int rec, bap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)bap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(bap_ar_file_rec), 1, f) != 1)
+	crash("Write failure on archive file\n");
+}
+
+
+
+
+/*
+** Relationship file IO
+*/
+
+void swap_rl_file_rec(bap_rl_file_rec *src, bap_rl_file_rec *dst)
+{
+    int i=1;
+
+    if (*(char*)&i) {
+	swap_int4(src->lines.rel_pos,dst->lines.rel_pos);
+	swap_int4(src->lines.length,dst->lines.length);
+	swap_int4(src->lines.left_nbr,dst->lines.left_nbr);
+	swap_int4(src->lines.right_nbr,dst->lines.right_nbr);
+    } else {
+	dst->lines.rel_pos = src->lines.rel_pos;
+	dst->lines.length = src->lines.length;
+	dst->lines.left_nbr = src->lines.left_nbr;
+	dst->lines.right_nbr = src->lines.right_nbr;
+    }
+    
+}
+
+
+
+void bap_read_rl(BapIO *io, int rec, bap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    bap_rl_file_rec tbuf;
+
+    if ( fseek(f,(off_t)bap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    if ( fread(&tbuf, sizeof(bap_rl_file_rec), 1, f) != 1)
+	crash("Read failure on relationships file\n");
+
+    swap_rl_file_rec(&tbuf,t);
+}
+
+void bap_write_rl(BapIO *io, int rec, bap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    bap_rl_file_rec tbuf;
+    if ( fseek(f,(off_t)bap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    swap_rl_file_rec(t,&tbuf);
+
+    if ( fwrite(&tbuf, sizeof(bap_rl_file_rec), 1, f) != 1)
+	crash("Write failure on relationships file\n");
+}
+
+
+
+
+/*
+** Comment file IO
+*/
+
+
+void swap_cc_file_rec_header(bap_cc_file_rec *src, bap_cc_file_rec *dst)
+{
+    int i=1;
+    if (*(char*)&i) {
+	swap_int4(src->header.free_list,dst->header.free_list);
+	swap_int4(src->header.count,dst->header.count);
+    } else {
+	dst->header.free_list = src->header.free_list;
+	dst->header.count = src->header.count;
+    }
+    
+}
+
+
+
+
+void swap_cc_file_rec_lines(bap_cc_file_rec *src, bap_cc_file_rec *dst)
+{
+    int i=1;
+    if (*(char*)&i) {
+	swap_int4(src->lines.next,dst->lines.next);
+    } else {
+	dst->lines.next = src->lines.next;
+    }
+    memcpy(dst->lines.comment,src->lines.comment,BAP_COMMENT_SIZE);
+    
+}
+
+void bap_read_cc(BapIO *io, int rec, bap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    bap_cc_file_rec tbuf;
+    if ( fseek(f,(off_t)bap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if ( fread(&tbuf, sizeof(bap_cc_file_rec), 1, f) != 1)
+	crash("Read failure on comment file, record %d\n",rec);
+
+    if (rec == bap_cc_header_rec(io))
+	swap_cc_file_rec_header(&tbuf,t);
+    else
+	swap_cc_file_rec_lines(&tbuf,t);
+
+}
+
+
+
+void bap_write_cc(BapIO *io, int rec, bap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    bap_cc_file_rec tbuf;
+    if ( fseek(f,(off_t)bap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if (rec == bap_cc_header_rec(io))
+	swap_cc_file_rec_header(t,&tbuf);
+    else
+	swap_cc_file_rec_lines(t,&tbuf);
+
+    if ( fwrite(&tbuf, sizeof(bap_cc_file_rec), 1, f) != 1)
+	crash("Crash failure on comment file, record %d\n",rec);
+}
+
+
+
+/*
+** Sequence file IO
+*/
+void bap_read_sq(BapIO *io, int rec, bap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)bap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fread(t, io->max_gel_length, 1, f) != 1)
+	crash("Read failure on sequence file\n");
+}
+
+void bap_write_sq(BapIO *io, int rec, bap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)bap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fwrite(t, io->max_gel_length, 1, f) != 1)
+	crash("Write failure on sequence file\n");
+}
+
+
+
+/*
+** Comment IO - Strings
+*/
+char *bap_read_comment(BapIO *io, int_4 cp)
+{
+    bap_cc_file_rec c;
+    int count;
+    int_4 nc;
+    char *com,*comptr;
+
+    if (!cp) return NULL;
+    /* determine how long string is */
+    count = 1;
+    nc=cp;
+    bap_read_cc(io, nc, &c);
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        bap_read_cc(io, nc, &c);
+    }
+
+    com = comptr = (char *)malloc(count * BAP_COMMENT_SIZE+1);
+    nc=cp;
+    bap_read_cc(io, nc, &c);
+    strncpy(com,c.lines.comment,BAP_COMMENT_SIZE); com+=BAP_COMMENT_SIZE;
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        bap_read_cc(io, nc, &c);
+        strncpy(com,c.lines.comment,BAP_COMMENT_SIZE); com+=BAP_COMMENT_SIZE;
+    }
+
+    *com = '\0';
+
+    return comptr;
+    
+}
+
+
+
+static int_4 get_free_comment(BapIO *io)
+{
+    bap_cc_file_rec head;
+    bap_cc_file_rec freerec;
+    int_4 free_id;
+    bap_read_cc(io,bap_cc_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	bap_read_cc(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	bap_write_cc(io,bap_cc_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	bap_write_cc(io,bap_cc_header_rec(io),&head);
+	bap_write_cc(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+int_4 bap_get_free_tag(BapIO *io)
+{
+    bap_tg_file_rec head;
+    bap_tg_file_rec freerec;
+    int_4 free_id;
+    bap_read_tg(io,bap_tg_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	bap_read_tg(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	bap_write_tg(io,bap_tg_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	bap_write_tg(io,bap_tg_header_rec(io),&head);
+	bap_write_tg(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+
+
+
+
+void bap_insert_tag(BapIO *io, int_4 gel, bap_tg_file_rec t)
+{
+    int_4 next, last;
+    int_4 free;
+    bap_tg_file_rec tg,last_tg;
+
+    last = gel;
+    bap_read_tg(io,last,&last_tg);
+
+    next = last_tg.lines.next;
+    if (next) bap_read_tg(io,next,&tg);
+
+    while (next && tg.lines.position <= t.lines.position) {
+	last = next;
+	last_tg = tg;
+	next = tg.lines.next;
+	if (next) bap_read_tg(io,next,&tg);
+    }
+
+    /* insert after last */
+    free = bap_get_free_tag(io);
+    t.lines.next = next;
+    last_tg.lines.next = free;
+    bap_write_tg(io,last,&last_tg);
+    bap_write_tg(io,free,&t);
+
+}
+
+
+int_4 bap_write_comment(BapIO *io, char *c)
+{
+    bap_cc_file_rec com;
+    int_4 cur,next,this_comment;
+    int clen = strlen(c);
+    int piece;
+
+    /* write out first block of BAP_COMMENT_SIZE */
+    this_comment=cur=get_free_comment(io);
+    if (clen>BAP_COMMENT_SIZE)
+	piece = BAP_COMMENT_SIZE;
+    else
+	piece = clen;
+
+    {int i; for(i=0;i<BAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+    strncpy(com.lines.comment,c,piece);
+
+    c+= piece;
+    clen -= piece;
+    while (clen > 0) {
+	next = get_free_comment(io);
+	com.lines.next = next;
+	bap_write_cc(io,cur,&com);
+	cur = next;
+	if (clen<BAP_COMMENT_SIZE)
+	    piece = clen;
+
+	{int i; for(i=0;i<BAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+	strncpy(com.lines.comment,c,piece);
+
+	c+= piece;
+	clen -= piece;
+    }
+    com.lines.next = 0;
+    if (piece!=BAP_COMMENT_SIZE)
+	com.lines.comment[piece]='\0';
+    bap_write_cc(io,cur,&com);
+
+    return this_comment;
+}
+
+
+
+
+
+
+
+
+
+
+static void set_file_names(BapIO *io, char *name, char *version)
+{
+    strcpy(io->ar_file,name); strcat(io->ar_file,".AR"); strcat(io->ar_file,version);
+    strcpy(io->rl_file,name); strcat(io->rl_file,".RL"); strcat(io->rl_file,version);
+    strcpy(io->sq_file,name); strcat(io->sq_file,".SQ"); strcat(io->sq_file,version);
+    strcpy(io->tg_file,name); strcat(io->tg_file,".TG"); strcat(io->tg_file,version);
+    strcpy(io->cc_file,name); strcat(io->cc_file,".CC"); strcat(io->cc_file,version);
+}
+
+static void bap_open_files(BapIO *io, char *name, char *version, char *mode)
+/*
+**
+*/
+{
+    /*
+    ** Create file names
+    */
+    set_file_names(io,name,version);
+
+    /*
+    ** Open files
+    */
+    if ( ( io->ar_fp = fopen(io->ar_file,mode) ) == NULL )
+	crash("Error opening archive file %s\n",io->ar_file);
+    if ( ( io->rl_fp = fopen(io->rl_file,mode) ) == NULL )
+	crash("Error opening relationships file %s\n",io->rl_file);
+    if ( ( io->sq_fp = fopen(io->sq_file,mode) ) == NULL )
+	crash("Error opening sequence file %s\n",io->sq_file);
+    if ( ( io->tg_fp = fopen(io->tg_file,mode) ) == NULL )
+	crash("Error opening tag file %s\n",io->tg_file);
+    if ( ( io->cc_fp = fopen(io->cc_file,mode) ) == NULL )
+	crash("Error opening tag-comment file %s\n",io->cc_file);
+
+}
+
+
+void bap_open_for_read(BapIO *io, char *name, char *version)
+{
+    bap_rl_file_rec rl_header;
+
+    bap_open_files(io,name,version,"rb");
+    
+    bap_read_rl(io,bap_rl_dbheader_rec(io),&rl_header);
+    io->max_gels = rl_header.dbheader.idbsiz;
+    io->max_gel_length = rl_header.dbheader.maxgel;
+    io->data_class = rl_header.dbheader.idm;
+
+    bap_read_rl(io,bap_rl_header_rec(io),&rl_header);
+    io->num_gels = rl_header.header.num_gels;
+    io->num_contigs = rl_header.header.num_contigs;
+}
+
+void bap_open_for_write(BapIO *io, char *name, char *version)
+{
+    bap_open_files(io,name,version,"w+b");
+}
+
+void bap_open_for_update(BapIO *io, char *name, char *version)
+{
+    bap_open_files(io,name,version,"r+b");
+}
+
+
+
+void bap_close_files(BapIO *io)
+/*
+** Close all relevant files
+*/
+{
+
+    fclose(io->ar_fp);
+    fclose(io->rl_fp);
+    fclose(io->sq_fp);
+    fclose(io->tg_fp);
+    fclose(io->cc_fp);
+
+}
+
diff --git a/src/cop/bapIO.h b/src/cop/bapIO.h
new file mode 100644
index 0000000..69e9f50
--- /dev/null
+++ b/src/cop/bapIO.h
@@ -0,0 +1,190 @@
+#ifndef _bapIO_h
+#define _bapIO_h
+
+#include "newtypes.h"
+
+/*
+** Definition of dap database files
+*/
+
+
+
+/*
+** Archive file (*.AR?)
+*/
+#define BAP_FILE_NAME_LENGTH 16
+typedef union {
+
+    struct _ar_lines {
+	char name[BAP_FILE_NAME_LENGTH];
+    } lines;
+
+} bap_ar_file_rec;
+
+#define bap_ar_byte_index(I,R) ( ((R)-1) *sizeof(bap_ar_file_rec) )
+
+
+
+
+
+
+
+/*
+** Relationships file (*.RL?)
+*/
+typedef union {
+
+    struct _rl_dbheader { 
+	int_4 maxdb;
+	int_4 idbsiz;
+	int_4 maxgel;
+	int_4 idm;
+    } dbheader;
+
+    struct _rl_header { 
+	int_4 num_gels;
+	int_4 num_contigs;
+	int_4 spare1;
+	int_4 spare2;
+    } header;
+
+    struct _rl_lines {
+	int_4 rel_pos;
+	int_4 length;
+	int_4 left_nbr;
+	int_4 right_nbr;
+    } lines;
+
+    struct _rl_clines {
+	int_4 length;
+	int_4 spare3;
+	int_4 left_end;
+	int_4 right_end;
+    } clines;
+
+} bap_rl_file_rec;
+
+#define bap_rl_header_rec(I) ( (I)->max_gels )
+#define bap_rl_byte_index(I,R) ( (R) * sizeof(bap_rl_file_rec) )
+#define bap_rl_dbheader_rec(I) (0)
+
+
+
+
+
+
+/*
+** Sequence file (*.SQ?)
+*/
+typedef char *bap_sq_file_rec;
+#define bap_sq_byte_index(I,R) ( ((R)-1) * ((I)->max_gel_length) )
+
+
+
+/*
+** Tag files (*.TG?)
+*/
+typedef union {
+    int i;
+    char c[4];
+} bap_tag_type;
+
+
+
+typedef union {
+
+    struct _tg_header {
+	int_4 count;
+	int_4 spare1;
+	int_4 spare2;
+	bap_tag_type spare3;
+	int_4 free_list;
+    } header;
+
+    struct _tg_lines {
+	int_4 position;
+	int_4 length;
+	int_4 comment;
+	bap_tag_type type;
+	int_4 next;
+    } lines;
+
+} bap_tg_file_rec;
+
+#define bap_tg_byte_index(I,R) ( ((R)-1) *sizeof(bap_tg_file_rec) )
+#define bap_tg_header_rec(I) ( (I)->max_gels )
+
+/*
+** Comment files (*.CC?)
+*/
+#define BAP_COMMENT_SIZE 40
+typedef union {
+
+    struct _cc_header {
+	int_4 free_list;
+	int_4 count;
+	char spare[BAP_COMMENT_SIZE - sizeof(int_4)];
+    } header;
+
+    struct _cc_lines {
+	int_4 next;
+	char comment[BAP_COMMENT_SIZE];
+    } lines;
+
+} bap_cc_file_rec;
+
+#define bap_cc_byte_index(I,R) ( ((R)-1) *sizeof(bap_cc_file_rec) )
+#define bap_cc_header_rec(I) (1)
+
+
+
+/*
+** Useful variables
+*/
+typedef char IOString[200];
+
+typedef struct {
+    int max_gels;
+    int num_gels;
+    int num_contigs;
+    int max_gel_length;
+    int data_class;
+    int max_db_size;
+    
+    FILE *ar_fp;
+    FILE *rl_fp;
+    FILE *sq_fp;
+    FILE *tg_fp;
+    FILE *cc_fp;
+
+    IOString ar_file;
+    IOString rl_file;
+    IOString sq_file;
+    IOString tg_file;
+    IOString cc_file;
+} BapIO;
+
+
+
+
+
+extern void bap_read_tg(BapIO *io, int rec, bap_tg_file_rec *t);
+extern void bap_write_tg(BapIO *io, int rec, bap_tg_file_rec *t);
+extern void bap_read_ar(BapIO *io, int rec, bap_ar_file_rec *t);
+extern void bap_write_ar(BapIO *io, int rec, bap_ar_file_rec *t);
+extern void bap_read_rl(BapIO *io, int rec, bap_rl_file_rec *t);
+extern void bap_write_rl(BapIO *io, int rec, bap_rl_file_rec *t);
+extern void bap_read_cc(BapIO *io, int rec, bap_cc_file_rec *t);
+extern void bap_write_cc(BapIO *io, int rec, bap_cc_file_rec *t);
+extern void bap_read_sq(BapIO *io, int rec, bap_sq_file_rec t);
+extern void bap_write_sq(BapIO *io, int rec, bap_sq_file_rec t);
+extern char *bap_read_comment(BapIO *io, int_4 cp);
+extern int_4 bap_write_comment(BapIO *io, char *c);
+extern int_4 bap_get_free_tag(BapIO *io);
+extern void bap_insert_tag(BapIO *io, int_4 gel, bap_tg_file_rec t);
+extern void bap_open_for_read(BapIO *io, char *name, char *version);
+extern void bap_open_for_write(BapIO *io, char *name, char *version);
+extern void bap_close_files(BapIO *io);
+
+
+#endif /* _bapIO_h */
diff --git a/src/cop/cop.c b/src/cop/cop.c
new file mode 100644
index 0000000..a3f0323
--- /dev/null
+++ b/src/cop/cop.c
@@ -0,0 +1,1220 @@
+/*
+ * Check xdap database for errors
+ */
+
+#include <stdio.h>
+#include <ctype.h>    /* IMPORT : tolower */
+#include "misc.h"
+#include "seq.h"
+
+typedef char StringBuffer[200];
+
+
+
+
+
+
+
+
+
+
+/*
+ * Database io
+ */
+#ifdef BAP_VERSION
+/*
+ * BAP version
+ */
+#include "bapIO.h"
+BapIO io;
+#define FILE_NAME_LENGTH BAP_FILE_NAME_LENGTH
+#define open_for_read bap_open_for_read
+#define close_files bap_close_files
+#define read_rl bap_read_rl
+#define read_sq bap_read_sq
+#define read_ar bap_read_ar
+#define read_tg bap_read_tg
+#define read_comment bap_read_comment
+#define rl_file_rec bap_rl_file_rec
+#define ar_file_rec bap_ar_file_rec
+#define tg_file_rec bap_tg_file_rec
+#else
+/*
+ * DAP version
+ */
+#include "dapIO.h"
+DapIO io;
+#define FILE_NAME_LENGTH DAP_FILE_NAME_LENGTH
+#define open_for_read dap_open_for_read
+#define close_files dap_close_files
+#define read_rl dap_read_rl
+#define read_sq dap_read_sq
+#define read_ar dap_read_ar
+#define read_tg dap_read_tg
+#define read_comment dap_read_comment
+#define rl_file_rec dap_rl_file_rec
+#define ar_file_rec dap_ar_file_rec
+#define tg_file_rec dap_tg_file_rec
+#endif
+/*
+ * Prototypes
+ */
+extern void *malloc(size_t size);
+extern void *calloc(size_t nobj, size_t size);
+extern char *getenv(char *s);
+
+/*
+ * Global variables - to this file at least
+ */
+static int consensusCutoff;
+static int alignmentCutoff = 0; /* scores range from -2000 to +2000 */
+static int bridge=2;
+
+/* was -200 */
+static StringBuffer rawData;
+static StringBuffer contigName;
+static FILE *log_fp;
+
+
+#include "upam.gbl"
+#include "uascii.gbl"
+#include "llin.h"
+
+initpam2()
+/*
+ * Initialise alignment routine
+ */
+{
+    int i, j, k;
+    
+    pam = npam;
+    nsq = naa;
+    
+    k=0;
+    for (i=0; i<nsq; i++)
+	for (j=0; j<=i; j++)
+	    pam2[j][i] = pam2[i][j] = -pam[k++];
+}
+
+
+static void seqout(char *cseq)
+/*
+ * Print out a string in lines 50 characters long
+ */
+{
+    char *s;
+    int i;
+    
+    for (i = 0,s = cseq; *s; s++) {
+	putchar(*s);
+	i++;
+	if (i==50) {
+	    putchar('\n');
+	    i = 0;
+	}
+    }
+    if (i) putchar('\n');
+}
+
+
+
+
+static void print_seq(char *seq, int from, int to)
+/*
+ * Print a portion of sequence
+ */
+{
+    char *s;
+    int i;
+    
+    for (i = 0, s = seq+from-1; *s && s < seq+to; s++) {
+	putchar(*s);
+	i++;
+	if (! (i%50) ) {
+	    putchar('\n');
+	}
+    }
+    if (i%50) putchar('\n');
+}
+
+
+
+/*
+ * Consensus calculation routines
+ */
+
+static int base_scores[256];
+static int base_indexes[256];
+static char base_complement[256];
+
+typedef int Scores[7];
+
+static void inits()
+/*
+ * Initialise tables
+ * Based on Rodger Staden's INITS
+ */
+{
+    static char bases[]  = "CTAG1234DVBHKLMNRY5678ctag*,-";
+    static char cbases[] = "GATC4321HBVDNMLKYR6578gatc*,-";
+    static int ind[]  = {1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,6,6,6,6,6,6,1,2,3,4,5,5,6};
+    static int scr[]  = {
+	100,100,100,100,
+	75,75,75,75,
+	100,100,100,100,
+	100,100,100,100,
+	10,10,10,10,10,10,
+	100,100,100,100,100,100,10};
+    
+    int i;
+    for (i=0;i<256;i++) {
+	base_scores[i] = base_indexes[i] = 0;
+	base_complement[i] = '-';
+    }
+    
+    for (i=0;i<sizeof(bases);i++) {
+	base_scores[bases[i]] = scr[i];
+	base_indexes[bases[i]] = ind[i];
+	base_complement[bases[i]] = cbases[i];
+    }
+    
+}
+
+
+
+
+
+
+static void seq_complement(char *seq, int len)
+/*
+ * Complement a sequence (but don't reverse)
+ */
+{
+    int i;
+    
+    for (i=0; i < len; i++) {
+	seq[i] = base_complement[seq[i]];
+    }
+}
+
+
+
+static void seq_reverse(char *seq, int len)
+/*
+ * Reverse a sequence
+ */
+{
+    char temp;
+    int i;
+    
+    for (i=0; i < len/2; i++) {
+	temp = seq[i];
+	seq[i] = seq[len-i-1];
+	seq[len-i-1] = temp;
+    }
+    
+}
+
+
+static void complement_seq(char *seq, int len)
+/*
+ * Complement a sequence
+ */
+{
+    seq_reverse(seq,len);
+    seq_complement(seq,len);
+}
+
+
+static void complement_zseq(char *seq)
+/*
+ * Complement a zero-terminated sequence
+ */
+{
+    complement_seq(seq, strlen(seq));
+}
+
+
+
+
+
+static int indexs(unsigned char c, int *score)
+/*
+ * Return inde and score of character c
+ * Based on Rodger Staden's INDEXS
+ */
+{
+    *score = base_scores[c];
+    return base_indexes[c];
+}
+
+
+
+
+static char charsu(int i)
+/*
+ * Return necleotide with index i
+ * Based on Rodger Staden's CHARSU
+ */
+{
+    static char c[] = "CTAG*-";
+    
+    return c[i-1];
+}
+
+
+
+
+static char gtconc(Scores scores, int idm, int cut)
+/*
+ * Returns the consensus of matrix scores
+ * 	cut is the percentage cutoff
+ *	idm should always be 7
+ * Based on Rodger Staden's GTCONC
+ */
+{
+    char c;
+    int i;
+    
+    if (!scores[idm-1]) return '-';
+    
+    for (i=0;i<idm;i++)
+	if ( scores[i] * 100 >= cut * scores[idm-1] ) return charsu(i);
+    
+    return '-';
+    
+}
+
+
+
+
+
+static void consensus_region(int contig, int start, int end, char *cseq, int cutoff)
+/*
+ * Determine the consensus of a region in a contig
+ * Adequate space must be allocated in cseq.
+ * It used a memory intensive algorithm! (The score maxtrix is 7xlength of region!)
+ */
+{
+    Scores *scores;
+    rl_file_rec cline;
+    rl_file_rec gline;
+    char *gseq;
+    int gel;
+    int region_len;
+    int i;
+    
+    region_len = end - start + 1;
+    scores = (Scores *)calloc(1,region_len * sizeof(Scores));
+    
+    read_rl(&io,contig,&cline);
+    gel = cline.clines.left_end;
+    
+    gseq = malloc(io.max_gel_length+1);
+    
+    /*
+     * find left-most gel in region
+     */
+    read_rl(&io,gel,&gline);
+    while (gel && gline.lines.rel_pos + abs(gline.lines.length) <= start) {
+	if (gel = gline.lines.right_nbr) read_rl(&io,gel,&gline);
+    }
+    
+    while (gel && gline.lines.rel_pos <= end) {
+	int gstart, gend, goffset;
+	char *s;
+	
+	read_sq(&io,gel,gseq);
+	gseq[abs(gline.lines.length)] = '\0';
+	
+	gstart = max(start, gline.lines.rel_pos) -
+	    gline.lines.rel_pos + 1;
+	gend = min(end, gline.lines.rel_pos + abs(gline.lines.length) - 1) -
+	    gline.lines.rel_pos + 1;
+	goffset = gline.lines.rel_pos - start - 1;
+	
+	for (s = &gseq[gstart-1];gstart<=gend;gstart++, s++) {
+	    int j,score;
+	    
+	    j = indexs(*s,&score);
+	    
+	    scores[gstart+goffset][j] += score;
+	    scores[gstart+goffset][6] += score;
+	}
+	
+	if (gel = gline.lines.right_nbr) read_rl(&io,gel,&gline);
+    }
+    
+    for (i=0;i<region_len;i++){
+	cseq[i] = gtconc(scores[i],7,cutoff);
+    }
+    cseq[region_len] = '\0';
+    
+    free(gseq);
+    free(scores);
+    
+}
+
+
+
+
+
+static void consensus_contig(int contig, char *cseq, int cutoff)
+/*
+ * Determine the consensus sequence of a contig
+ * It allocates just enough space and returns a pointer to the string.
+ * Remember to free it when you've finished with it!
+ */
+{
+    rl_file_rec cline;
+    
+    read_rl(&io, contig, &cline);
+    
+    consensus_region(contig, 1, cline.clines.length, cseq, cutoff);
+}
+
+
+
+static int isUncertain(char c)
+/*
+ * Return true if c not in "acgtACGT".
+ */
+{
+    switch (c) {
+    case 'a': case 'c': case 'g': case 't':
+    case 'A': case 'C': case 'G': case 'T':
+	return 0;
+    default:
+	return 1;
+    }
+}
+
+
+
+
+static void select_good_raw_bits(Seq trace,
+				 int rawLCut,
+				 int rawLength,
+				 char *rawSeq,
+				 int *map)
+/*
+ * Select only the good quality pieces in the raw sequence.
+ * This routine will use the quality measures developed by
+ * LaDeana, once they have been tested.`
+ * In the mean time - FUDGE IT!
+ */
+{
+    int rcut;
+    int score;
+    int i;
+    
+    
+    /*
+     * Determine a rough estimate of how much to use.
+     */
+    
+    /* MAXREADLEN bases max */
+#define MAXREADLEN rawLength
+    rcut = MAXREADLEN; 
+    
+    
+    /* MAXPERCENT of rawLength max */
+#define MAXPERCENT 100
+    
+    rcut = min(rcut, rawLength*MAXPERCENT/100);
+    
+    /* Up to where NN uncertainties in MM window */
+#define NN 0
+#define MM 5
+    if (NN>0) {
+	for(i=1-MM,score=0;i<rawLength+MM-1;i++) {
+	    if (i>=0) score -= isUncertain(rawSeq[i]);
+	    if (i+MM-1<rawLength) score += isUncertain(rawSeq[i+MM-1]);
+	    if (score >= NN) break;
+	}
+	rcut = min(rcut,max(i,0));
+    }
+    
+    /*
+     * Remove 3' crap from map
+     */
+    for (i=rcut;i<rawLength;i++) map[i] = ~map[i];
+    
+    /*
+     * Remove XX either side of "-" from map
+     */
+#define XX 1
+    for (i=0;i<rcut;i++) {
+	if (rawSeq[i] == '-') {
+	    int k;
+	    for (k=(i<XX)?0:i-XX; k<=i+XX; k++)
+		if (map[k]>0) map[k] = ~map[k];
+	}
+    }
+}
+
+
+
+static int pos_in_contig(int pos,
+			 int rel_pos,
+			 int length,
+			 int complemented)
+{
+    if (complemented)
+	return rel_pos + length - pos;
+    else
+	return rel_pos + pos - 1;
+}
+
+
+static int bases_equal(char a, char b)
+/*
+ * Return true if bases a and b are equal.
+ * This ignores the case of the bases
+ */
+{
+    /*
+     * Beware of non ANSI implementations of tolower()
+     * that only work when argument isupper().
+     */
+    if (isupper(a)) a = tolower(a);
+    if (isupper(b)) b = tolower(b);
+    
+    return (a == b);
+}
+
+
+
+
+static void region_covered(int start,
+			   int end,
+			   int *coverage)
+/*
+ * Mark as covered, the region from start to end in the map
+ */
+{
+    int i;
+    
+    for (i=start; i<=end+(1-bridge); i++) {
+	/*
+	 * Setting coverage at position x means bases between x and x+(1-bridge) (inclusive) are covered
+	 */
+	coverage[i-1]++;
+    }
+}
+
+
+
+
+static void compare_seq_update_coverage(int gel,
+					char *seq,
+					int length,
+					char *rawSeq,
+					int rawLength,
+					int rel_pos,
+					int complemented,
+					int *coverage,
+					int *map
+					)
+/*
+ * Report the differences, insertions and deletions
+ */
+{
+    int i;
+    int mstart,mend;
+    int gstart,gend;
+    int cstart,cend;
+    
+    i = 0;
+    while (i < rawLength) {
+	/*
+	 * Skip over regions:
+	 *    (a) not mapped because the quality of the trace was deemed too poor (*map < 0)
+	 * or (b) where deletions are indicated (*map == 0)
+	 * or (c) where bases disagree ( *rawSeq != seq[*map] )
+	 */
+	while (i < rawLength &&
+	       (map[i] <= 0 || 
+		! bases_equal(rawSeq[i],seq[map[i]-1]) ) )
+	    i++;
+	
+	/*
+	 * We have hit start of good coverage
+	 * Determine regions where there are no insertions or changes,
+	 * then mark them as covered
+	 */
+	if (i < rawLength) {
+	    int mstart = i;
+	    i++;
+	    while (i < rawLength && map[i] > 0) {
+		if (map[i] != map[i-1]+1) {
+		    /* insertion detected */
+		    break;
+		}
+		
+		if ( ! bases_equal(rawSeq[i],seq[map[i]-1]) ) {
+		    /* change detected */
+		    break;
+		}
+		i++;
+	    }
+	    
+	    mend = i-1;
+	    
+	    gstart = map[mstart];
+	    gend = map[mend];
+	    
+	    if (complemented) {
+		cstart = pos_in_contig(gend,rel_pos,length,complemented);
+		cend   = pos_in_contig(gstart,rel_pos,length,complemented);
+	    } else {
+		cstart = pos_in_contig(gstart,rel_pos,length,complemented);
+		cend   = pos_in_contig(gend,rel_pos,length,complemented);
+	    }
+	    
+	    region_covered(cstart,cend,coverage);
+	    
+	    i++;
+	    
+	}
+	
+    }
+    
+    
+    
+    
+    
+}
+
+
+
+static int make_alignment_map(int gel,
+			      char *seq,
+			      int length,
+			      char *rawSeq,
+			      int rawLength,
+			      int *map
+			      )
+/*
+ * Produce an alignment map which maps base positions in the raw sequence
+ * to positions in the gel reading
+ */
+{
+    int totalLength;
+    int *res; /* used by alignment program */
+    int score;
+    
+    totalLength = length+rawLength;
+    res = (int *) malloc(totalLength * sizeof(int));
+    
+    score = DIFF(rawSeq-1,seq-1,
+		 rawLength,length,
+		 pam2, -gdelval, -ggapval, res);
+    
+    if (score > alignmentCutoff) {
+	fprintf(log_fp,";Alignment for gel %d failed to meet cutoff score (%d/%d)\n",
+		gel,score,alignmentCutoff);
+	return 0;
+    }
+    
+    /*
+     * Use alignment results to produce an array
+     * mapping trace to consensus positions
+     */
+    
+    /*************************************************************************/
+    {
+	int mindex;
+	int mvalue;
+	int i, j, op;
+	int *S, M, N;
+	
+	S = res;
+	M = rawLength;
+	N = length;
+	i = j = op = 0;
+	mindex = 0;
+	mvalue = 1;
+	while (i < M || j < N) {
+	    if (op == 0 && *S == 0) {
+		map[mindex++] = mvalue;
+		mvalue++;
+		op = *S++;
+		i++; j++;
+	    } else {
+		if (op == 0)
+		    op = *S++;
+		if (op > 0) {
+		    mvalue++;
+		    op--;
+		    j++;
+		} else {
+		    map[mindex++] = 0;
+		    op++;
+		    i++;
+		}
+	    }
+	}
+    }
+    /*************************************************************************/
+    
+    /***********
+      {
+	(void) DISPLAY(rawSeq-1, seq-1,
+		       rawLength, length,
+		       res);
+	printf("\n\n");
+      }
+      ************/
+    
+    
+    free(res);
+    
+    return 1;
+}
+
+
+
+
+
+
+
+static void compare_sequences(/* This gel reading info */
+			      int gel,
+			      char *seq,
+			      int length,
+			      int rel_pos,
+			      int complemented,
+			      /* This trace info */
+			      Seq trace,
+			      int rawLCut,
+			      int rawLength,
+			      char *rawSeq,
+			      /* This cosmid info*/
+			      int *coverage
+			      )
+/*
+ * Perform an alignment between the consensus sequence and the raw data.
+ * Check only high quality bits and update coverage
+ */
+{
+    int *map;
+    
+    map = (int *)malloc(rawLength*sizeof(int));
+    
+    if (make_alignment_map(gel,seq,length,rawSeq,rawLength,map)) {
+	
+        select_good_raw_bits(trace,rawLCut,rawLength,rawSeq,map);
+	
+	compare_seq_update_coverage(gel,seq,length,rawSeq,rawLength,rel_pos,
+				    complemented,coverage,map);
+	
+    }
+    
+    free(map);
+    
+}
+
+
+
+
+
+
+
+
+static void process_gel_reading(int gel,
+				int length,
+				int complemented,
+				int rel_pos,
+				char *cseq,
+				int contig,
+				int *coverage
+				)
+/*
+ * Perform the quality check on a specific gel reading
+ */
+{
+    ar_file_rec ar_rec;
+    tg_file_rec tg_rec;
+    char *rd;
+    int rd_length;
+    int rd_cut;
+    int rd_ulen;
+    char rd_type[5];
+    char rd_file[19];
+    
+    /*
+     * Read Name
+     */
+    read_ar(&io,gel,&ar_rec);
+    
+    fprintf (log_fp,";%d %s\n",complemented?-gel:gel,ar_rec.lines.name);
+    
+    /*
+     * Read Raw data file name
+     */
+    read_tg(&io,gel,&tg_rec);
+    if (tg_rec.lines.comment) {
+	char *fullFileName;
+	rd = (char *) read_comment(&io, tg_rec.lines.comment);
+	sscanf(rd,"%6d%6d%6d%*s",&rd_length, &rd_cut, &rd_ulen);
+	f2cstr(&rd[22],18,rd_file,18);
+	f2cstr(&rd[18],4,rd_type,4);
+	if ( (fullFileName = findfile(rd_file,rawData)) == NULL) {
+	    /* Trace file specified but not found */
+	    fprintf(log_fp,";Trace file %s specified but not found for gel reading %d %s\n",
+		    rd_file,gel,ar_rec.lines.name);
+	} else {
+	    Seq trace;
+	    trace = (Seq) getSeq(fullFileName,rd_type);
+	    if (trace == NULLSeq) {
+		/* Trace file specified but not found */
+		fprintf(log_fp,";Error reading Trace file %s for gel reading %d %s\n",
+			fullFileName,gel,ar_rec.lines.name);
+	    } else {
+		char *seq;
+		char *rawSeq;
+		
+		rawSeq = (char *)getSequence(trace);
+		seq = (char *)malloc(length+1);
+		strncpy(seq, &cseq[rel_pos-1],length);
+		seq[length] = '\0';
+		
+		if (complemented) {
+		    /*
+		     * Complement consensus sequence
+		     */
+		    complement_zseq(seq);
+		}
+		
+		compare_sequences(gel,seq,length,rel_pos,complemented,
+				  trace,rd_cut,rd_ulen,&rawSeq[rd_cut],
+				  coverage);
+		
+		freeSeq(trace);
+		free(rawSeq);
+		free(seq);
+	    }
+	}
+	
+	
+	free(rd);
+    } else
+	/* Skip */
+	fprintf(log_fp,";No raw data tag for gel reading %d %s\n",gel,ar_rec.lines.name);
+}
+
+
+
+
+
+
+static void print_coverage_gaps(int *coverage, int length)
+/*
+ * Print out a list of places where there is not coverage
+ */
+{
+    int starti,endi;
+    int i;
+    
+    printf("Problem areas:\n");
+    fprintf(log_fp,"Problem areas:\n");
+    
+    for (i=0;i<length-(bridge-1);) {
+	if (coverage[i])
+	    i++;
+	else {
+	    starti = i+1;
+	    for(;!coverage[i] && i<length; i++);
+	    endi = i + bridge - 2;
+	    
+	    if (starti == endi) {
+		printf("  %d\n",starti);
+		fprintf(log_fp,"  %d\n",starti);
+	    } else {
+		printf("  %d-%d\n",starti,endi);
+		fprintf(log_fp,"  %d-%d\n",starti,endi);
+	    }
+	}
+    }
+    
+    
+}
+
+
+
+
+
+
+static void process_contig(int contig)
+/*
+ * Perform the error check on a specific contig
+ */
+{
+    char *cseq;
+    int *coverage;
+    rl_file_rec cline;
+    ar_file_rec ar_rec;
+    
+    /*
+     * Read Contig Details
+     */
+    read_rl(&io, contig, &cline);
+    read_ar(&io,cline.clines.left_end,&ar_rec);
+    
+    /*
+     * Write to log
+     */
+    printf("Checking contig %d: %s\n",cline.clines.left_end,ar_rec.lines.name);
+    fprintf(log_fp,"Checking contig %d: %s\n",cline.clines.left_end,ar_rec.lines.name);
+    
+    /*
+     * Determine consensus sequence
+     */
+    cseq = malloc(cline.clines.length + 1);
+    consensus_region(contig, 1, cline.clines.length, cseq, consensusCutoff);
+    
+    /*
+     * Allocate coverage arrays
+     */
+    coverage = (int *)calloc(1,cline.clines.length * sizeof(int));
+    
+    {
+	int gel;
+	rl_file_rec gline;
+	
+	/*
+	 * Print out a list of names and raw data files
+	 */
+	if (gel = cline.clines.left_end) read_rl(&io,gel,&gline);
+	while (gel) {
+	    process_gel_reading(gel,
+				abs(gline.lines.length),
+				gline.lines.length < 0,
+				gline.lines.rel_pos,
+				cseq,
+				contig,
+				coverage);
+	    if (gel = gline.lines.right_nbr) read_rl(&io,gel,&gline);
+	}
+	
+    }
+    
+    print_coverage_gaps(coverage,cline.clines.length);
+    printf("\n");
+    fprintf(log_fp,"\n");
+    
+    free(coverage);
+    free(cseq);
+    
+}
+
+
+
+
+static void process()
+/*
+ * Perform the error check on the whole database
+ */
+{
+    int i;
+    
+    for (i=0; i<io.num_contigs; i++)
+	process_contig(io.max_gels-i-1);
+}
+
+
+static void open_log_file(char *projectName, char *versionNumber)
+{
+    StringBuffer fn;
+    
+    sprintf(fn,"%s.%s.LOG",projectName,versionNumber);
+    
+    if (file_exists(fn)) {
+	StringBuffer fnold;
+	sprintf(fnold,"%s~",fn);
+	fprintf(stderr,"Previous log file renamed %s\n\n",fnold);
+	rename(fn,fnold);
+    }
+    
+    if ( (log_fp = fopen(fn,"w")) == NULL )
+	crash("Cannot open log file %s\n",fn);
+    
+    fprintf(log_fp,";Log started: %s\n",date_str());
+    
+}
+
+static void close_log_file()
+{
+    fprintf(log_fp,";Log stopped: %s\n",date_str());
+    fclose(log_fp);
+}
+
+
+
+
+static int read_from_name(char *contigName)
+/*
+ *
+ */
+{
+    int i,j;
+    ar_file_rec ar_rec;
+    char buf[FILE_NAME_LENGTH+1];
+    
+    for (i=1; i<io.num_gels;i++) {
+	/* read next name */
+	read_ar(&io,i,&ar_rec);
+	/* copy name to first space */
+	for(j=0;j<FILE_NAME_LENGTH && ar_rec.lines.name[j]!=' ';j++)
+	    buf[j] = ar_rec.lines.name[j];
+	buf[j] = '\0';
+	/* if a match, return i */
+	if (strcmp(buf,contigName)==0) return i;
+	
+    }
+    return 0;
+}
+
+
+static int valid_read_number(int readNumber)
+/*
+ *
+ */
+{
+    return (readNumber < 1) || (readNumber >io.num_gels);
+}
+
+
+int find_contig(int readNumber)
+/*
+ *
+ */
+{
+    int i;
+    rl_file_rec cline;
+    int contigNum;
+    
+    for (i=0; i<io.num_contigs; i++) {
+	contigNum = io.max_gels-i-1;
+	read_rl(&io,contigNum,&cline);
+	if (cline.clines.left_end == readNumber) return contigNum;
+    }
+    return 0;
+}
+
+
+int left_most_read(int readNumber)
+/*
+ *
+ */
+{
+    int lmr = readNumber;
+    rl_file_rec line;
+    
+    read_rl(&io,lmr,&line);
+    while (line.lines.left_nbr) {
+	lmr = line.lines.left_nbr;
+	read_rl(&io,lmr,&line);
+    }
+    return lmr;
+}
+
+
+
+
+static int contig_from_read(int readNumber)
+/*
+ */
+{
+    if (valid_read_number(readNumber)) return 0;
+    return find_contig(left_most_read(readNumber));
+}
+
+
+
+
+static int check_contig(char *contigName)
+/*
+ * contigName is
+ * (a) number of a reading
+ * (b) a reading name prefixed a "/"
+ */
+{
+    int readNumber;
+    if (*contigName=='/')
+	readNumber = read_from_name(contigName+1);
+    else
+	readNumber = atoi(contigName);
+    
+    return contig_from_read(readNumber);
+}
+
+
+
+
+static void usage()
+/*
+ * Print out usage
+ */
+{
+    printf("Usage: cop [options]\n  where options are:\n  -p project\n  -v version\n  -c consensus_cutoff_percentage\n  -r raw_data_search_path\n  -h\n");
+    
+}
+
+
+static void help()
+/*
+ * Print out help and advice
+ */
+{
+    printf("Usage: cop [options]\n  where options are:\n");
+    printf("  -p project\n     Database project name. Prompted for if not supplied. Case is not\n     important.\n");
+    printf("  -v version\n     Database version. Prompted for if not supplied.\n");
+    printf("  -c consensus_cutoff_percentage\n     Default is 100(%%)\n");
+    printf("  -r raw_data_search_path\n     Where to look if trace files are not found in the present working\n     directory. Default is that specified by environment variable\n     RAWDATA.\n");
+    printf("  -C contig_name\n     Only check a named contig\n");
+    printf("  -h\n     Print out this help\n");
+    printf("\nExample: cop -p f59b2 -v 0 -c 66 -r ~mmm/F59B2\n     Run cop on project F59B2 version 0 with consensus cutoff\n     percentage 66%, looking for trace files in ~mmm/F59B2.\n");
+}
+
+
+
+main(int argc, char**argv)
+/*
+ * COP - error checking program
+ *
+ * Usage: cop [options]
+ *   where options are:
+ *   -p project
+ *   -v version
+ *   -c consensus_cutoff_percentage
+ *   -r raw_data_search_path
+ *   -C contig
+ *   -h
+ *
+ * Example:
+ * 	cop -p f59b2 -v r -c 100
+ *
+ */
+{
+    StringBuffer projectName;
+    StringBuffer versionNumber;
+    
+    int c;
+    extern char *optarg;
+    extern int optint;
+    int contigNum;
+    
+    int p_opt=0, v_opt=0, c_opt=0, r_opt=0, h_opt=0, C_opt=0;
+    int err_opt=0;
+    
+    while ( (c = getopt(argc,argv,"hp:v:c:r:C:")) != -1 )
+	switch (c) {
+	case 'h':
+	    h_opt++;
+	    break;
+	case 'p':
+	    strcpy(projectName, optarg);
+	    p_opt++;
+	    break;
+	case 'v':
+	    strcpy(versionNumber, optarg);
+	    v_opt++;
+	    break;
+	case 'c':
+	    consensusCutoff = atoi(optarg);
+	    c_opt++;
+	    break;
+	case 'r':
+	    strcpy(rawData, optarg);
+	    r_opt++;
+	    break;
+	case 'C':
+	    strcpy(contigName, optarg);
+	    C_opt++;
+	    break;
+	case '?':
+	    err_opt++;
+	    break;
+	}
+    
+    if (err_opt) {
+	usage();
+	exit(1);
+    }
+    
+#ifdef BAP_VERSION
+    printf("COP v1.2: Check Out Project\nChecks xbap database for errors\n\n");
+#else
+    printf("COP v1.2: Check Out Project\nChecks xdap database for errors\n\n");
+#endif
+    if (h_opt) {
+	help();
+	exit(0);
+    }
+    
+    if (! p_opt) {
+	printf("Project name ? ");
+	gets(projectName);
+    }
+    
+    if (! v_opt) {
+	printf("Version ? ");
+	gets(versionNumber);
+    }
+    
+    if (! c_opt) {
+	consensusCutoff = 100;
+	printf("Consensus cutoff = %d%%\n",consensusCutoff);
+    }
+    
+    if (! r_opt) {
+	char *r;
+	r = getenv("RAWDATA");
+	if (r != NULL)
+	    strcpy(rawData,r);
+	else
+	    strcpy(rawData,".");
+	printf("Trace directory = %s\n",rawData);
+    }
+    
+    printf("\n");
+    
+    fn_toupper(versionNumber);
+    fn_toupper(projectName);
+    
+    /*
+     * Open files
+     */
+    open_for_read(&io,projectName,versionNumber);
+    open_log_file(projectName,versionNumber);
+    
+    if (io.data_class != 5)
+	crash ("Database must be for DNA only\n");
+    
+    /* initialisations */
+    inits();    /* for consensus calculations */
+    initpam2(); /* for alignments */
+    
+    if (!C_opt) {
+	printf("Check which contig? [all] ");
+	gets(contigName);
+	C_opt++;
+    }
+    
+    if (strlen(contigName)==0 || strcmp(contigName,"all")==0) C_opt=0;
+    
+    if (C_opt) {
+	contigNum = check_contig(contigName);
+	if (contigNum==0)
+	    crash("cop: Invalid contig number/name '%s'\n",contigName);
+	process_contig(contigNum);
+    } else
+	process();
+    
+    close_files(&io);
+    close_log_file();
+}
+
+
diff --git a/src/cop/dapIO.c b/src/cop/dapIO.c
new file mode 100644
index 0000000..906f176
--- /dev/null
+++ b/src/cop/dapIO.c
@@ -0,0 +1,376 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "dapIO.h"
+#include "misc.h"
+
+/*
+** Tag File IO
+*/
+void dap_read_tg(DapIO *io, int rec, dap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    if ( fseek(f,(off_t)dap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_tg_file_rec), 1, f) != 1)
+	crash("Read failure on tag file\n");
+}
+
+void dap_write_tg(DapIO *io, int rec, dap_tg_file_rec *t)
+{
+    FILE *f = io->tg_fp;
+    if ( fseek(f,(off_t)dap_tg_byte_index(io,rec),0) )
+	crash("Seek failure on tag file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_tg_file_rec), 1, f) != 1)
+	crash("Write failure on tag file\n");
+}
+
+
+
+/*
+** Archive File IO
+*/
+void dap_read_ar(DapIO *io, int rec, dap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)dap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_ar_file_rec), 1, f) != 1)
+	crash("Read failure on archive file\n");
+}
+
+void dap_write_ar(DapIO *io, int rec, dap_ar_file_rec *t)
+{
+    FILE *f = io->ar_fp;
+    if ( fseek(f,(off_t)dap_ar_byte_index(io,rec),0) )
+	crash("Seek failure on archive file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_ar_file_rec), 1, f) != 1)
+	crash("Write failure on archive file\n");
+}
+
+
+
+
+/*
+** Relationship file IO
+*/
+void dap_read_rl(DapIO *io, int rec, dap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    if ( fseek(f,(off_t)dap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_rl_file_rec), 1, f) != 1)
+	crash("Read failure on relationships file\n");
+}
+
+void dap_write_rl(DapIO *io, int rec, dap_rl_file_rec *t)
+{
+    FILE *f = io->rl_fp;
+    if ( fseek(f,(off_t)dap_rl_byte_index(io,rec),0) )
+	crash("Seek failure on relationships file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_rl_file_rec), 1, f) != 1)
+	crash("Write failure on relationships file\n");
+}
+
+
+
+
+
+/*
+** Comment file IO
+*/
+void dap_read_cc(DapIO *io, int rec, dap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    if ( fseek(f,(off_t)dap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if ( fread(t, sizeof(dap_cc_file_rec), 1, f) != 1)
+	crash("Read failure on comment file, record %d\n",rec);
+}
+
+void dap_write_cc(DapIO *io, int rec, dap_cc_file_rec *t)
+{
+    FILE *f = io->cc_fp;
+    if ( fseek(f,(off_t)dap_cc_byte_index(io,rec),0) )
+	crash("Seek failure on comment file, record %d\n",rec);
+
+    if ( fwrite(t, sizeof(dap_cc_file_rec), 1, f) != 1)
+	crash("Crash failure on comment file, record %d\n",rec);
+}
+
+
+
+/*
+** Sequence file IO
+*/
+void dap_read_sq(DapIO *io, int rec, dap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)dap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fread(t, io->max_gel_length, 1, f) != 1)
+	crash("Read failure on sequence file\n");
+}
+
+void dap_write_sq(DapIO *io, int rec, dap_sq_file_rec t)
+{
+    FILE *f = io->sq_fp;
+    if ( fseek(f,(off_t)dap_sq_byte_index(io,rec),0) )
+	crash("Seek failure on sequence file, record %d\n",rec);
+
+    if ( fwrite(t, io->max_gel_length, 1, f) != 1)
+	crash("Write failure on sequence file\n");
+}
+
+
+
+/*
+** Comment IO - Strings
+*/
+char *dap_read_comment(DapIO *io, int_4 cp)
+{
+    dap_cc_file_rec c;
+    int count;
+    int_4 nc;
+    char *com,*comptr;
+
+    if (!cp) return NULL;
+    /* determine how long string is */
+    count = 1;
+    nc=cp;
+    dap_read_cc(io, nc, &c);
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        dap_read_cc(io, nc, &c);
+    }
+
+    com = comptr = (char *)malloc(count * DAP_COMMENT_SIZE+1);
+    nc=cp;
+    dap_read_cc(io, nc, &c);
+    strncpy(com,c.lines.comment,DAP_COMMENT_SIZE); com+=DAP_COMMENT_SIZE;
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        dap_read_cc(io, nc, &c);
+        strncpy(com,c.lines.comment,DAP_COMMENT_SIZE); com+=DAP_COMMENT_SIZE;
+    }
+
+    *com = '\0';
+
+    return comptr;
+    
+}
+
+#ifdef nodef
+static int_4 get_free_comment(DapIO *io)
+{
+    dap_cc_file_rec head;
+    dap_cc_file_rec freerec;
+    int_4 free_id;
+    dap_read_cc(io,dap_cc_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	dap_read_cc(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	dap_write_cc(io,dap_cc_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	dap_write_cc(io,dap_cc_header_rec(io),&head);
+	dap_write_cc(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+static int_4 get_free_tag(DapIO *io)
+{
+    dap_tg_file_rec head;
+    dap_tg_file_rec freerec;
+    int_4 free_id;
+    dap_read_tg(io,dap_tg_header_rec(io),&head);
+    if (head.header.free_list != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.header.free_list;
+	dap_read_tg(io,free_id,&freerec);
+	head.header.free_list = freerec.lines.next;
+	dap_write_tg(io,dap_tg_header_rec(io),&head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.header.count;
+	dap_write_tg(io,dap_tg_header_rec(io),&head);
+	dap_write_tg(io,free_id,&freerec);
+    }
+
+    return free_id;
+}
+
+
+
+
+
+static void insert_tag(DapIO *io, int_4 gel, dap_tg_file_rec t)
+{
+    int_4 next, last;
+    int_4 free;
+    dap_tg_file_rec tg,last_tg;
+
+    last = gel;
+    dap_read_tg(io,last,&last_tg);
+
+    next = last_tg.lines.next;
+    if (next) dap_read_tg(io,next,&tg);
+
+    while (next && tg.lines.position <= t.lines.position) {
+	last = next;
+	last_tg = tg;
+	next = tg.lines.next;
+	if (next) dap_read_tg(io,next,&tg);
+    }
+
+    /* insert after last */
+    free = get_free_tag(io);
+    t.lines.next = next;
+    last_tg.lines.next = free;
+    dap_write_tg(io,last,&last_tg);
+    dap_write_tg(io,free,&t);
+
+}
+
+
+static int_4 write_comment(DapIO *io, char *c)
+{
+    dap_cc_file_rec com;
+    int_4 cur,next,this_comment;
+    int clen = strlen(c);
+    int piece;
+
+    /* write out first block of DAP_COMMENT_SIZE */
+    this_comment=cur=get_free_comment(io);
+    if (clen>DAP_COMMENT_SIZE)
+	piece = DAP_COMMENT_SIZE;
+    else
+	piece = clen;
+
+    {int i; for(i=0;i<DAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+    strncpy(com.lines.comment,c,piece);
+
+    c+= piece;
+    clen -= piece;
+    while (clen > 0) {
+	next = get_free_comment(io);
+	com.lines.next = next;
+	dap_write_cc(io,cur,&com);
+	cur = next;
+	if (clen<DAP_COMMENT_SIZE)
+	    piece = clen;
+
+	{int i; for(i=0;i<DAP_COMMENT_SIZE;i++)com.lines.comment[i]=' ';}
+	strncpy(com.lines.comment,c,piece);
+
+	c+= piece;
+	clen -= piece;
+    }
+    com.lines.next = 0;
+    if (piece!=DAP_COMMENT_SIZE)
+	com.lines.comment[piece]='\0';
+    dap_write_cc(io,cur,&com);
+
+    return this_comment;
+}
+#endif /*nodef*/
+
+static void set_file_names(DapIO *io, char *name, char *version)
+{
+    strcpy(io->ar_file,name); strcat(io->ar_file,".AR"); strcat(io->ar_file,version);
+    strcpy(io->rl_file,name); strcat(io->rl_file,".RL"); strcat(io->rl_file,version);
+    strcpy(io->sq_file,name); strcat(io->sq_file,".SQ"); strcat(io->sq_file,version);
+    strcpy(io->tg_file,name); strcat(io->tg_file,".TG"); strcat(io->tg_file,version);
+    strcpy(io->cc_file,name); strcat(io->cc_file,".CC"); strcat(io->cc_file,version);
+}
+
+static void dap_open_files(DapIO *io, char *name, char *version, char *mode)
+/*
+**
+*/
+{
+
+    /*
+    ** Create file names
+    */
+    set_file_names(io,name,version);
+
+    /*
+    ** Open files
+    */
+    if ( ( io->ar_fp = fopen(io->ar_file,mode) ) == NULL )
+	crash("No archive file %s\n",io->ar_file);
+    if ( ( io->rl_fp = fopen(io->rl_file,mode) ) == NULL )
+	crash("No relationships file %s\n",io->rl_file);
+    if ( ( io->sq_fp = fopen(io->sq_file,mode) ) == NULL )
+	crash("No sequence file %s\n",io->sq_file);
+    if ( ( io->tg_fp = fopen(io->tg_file,mode) ) == NULL )
+	crash("No tag file %s\n",io->tg_file);
+    if ( ( io->cc_fp = fopen(io->cc_file,mode) ) == NULL )
+	crash("No tag-comment file %s\n",io->cc_file);
+
+}
+
+
+void dap_open_for_read(DapIO *io, char *name, char *version)
+{
+    dap_ar_file_rec ar_header;
+    dap_rl_file_rec rl_header;
+
+    dap_open_files(io,name,version,"rb");
+    
+    dap_read_ar(io,dap_ar_header_rec(io),&ar_header);
+    io->max_gels = ar_header.header.idbsiz;
+    io->max_gel_length = ar_header.header.maxgel;
+    io->data_class = ar_header.header.idm;
+
+    dap_read_rl(io,dap_rl_header_rec(io),&rl_header);
+    io->num_gels = rl_header.header.num_gels;
+    io->num_contigs = rl_header.header.num_contigs;
+}
+
+void dap_open_for_write(DapIO *io, char *name, char *version)
+{
+    dap_open_files(io,name,version,"r+b");
+}
+
+
+
+void dap_close_files(DapIO *io)
+/*
+** Close all relevant files
+*/
+{
+
+    fclose(io->ar_fp);
+    fclose(io->rl_fp);
+    fclose(io->sq_fp);
+    fclose(io->tg_fp);
+    fclose(io->cc_fp);
+
+}
+
diff --git a/src/cop/dapIO.h b/src/cop/dapIO.h
new file mode 100644
index 0000000..8d8dca5
--- /dev/null
+++ b/src/cop/dapIO.h
@@ -0,0 +1,184 @@
+#ifndef _dapIO_h
+#define _dapIO_h
+
+#include "newtypes.h"
+
+/*
+** Definition of dap database files
+*/
+
+
+
+/*
+** Archive file (*.AR?)
+*/
+#define DAP_FILE_NAME_LENGTH 12
+typedef union {
+
+    struct _ar_header {
+	int_4 idbsiz;
+	int_4 maxgel;
+	int_4 idm;
+    } header;
+
+    struct _ar_lines {
+	char name[DAP_FILE_NAME_LENGTH];
+    } lines;
+
+} dap_ar_file_rec;
+
+#define dap_ar_header_rec(I)  ( 1000 )
+#define dap_ar_byte_index(I,R) ( ((R)-1) * sizeof(dap_ar_file_rec) )
+
+
+
+
+
+
+
+/*
+** Relationships file (*.RL?)
+*/
+typedef union {
+
+    struct _rl_header { 
+	int_4 num_gels;
+	int_4 num_contigs;
+	int_4 spare1;
+	int_4 spare2;
+    } header;
+
+    struct _rl_lines {
+	int_4 rel_pos;
+	int_4 length;
+	int_4 left_nbr;
+	int_4 right_nbr;
+    } lines;
+
+    struct _rl_clines {
+	int_4 length;
+	int_4 spare3;
+	int_4 left_end;
+	int_4 right_end;
+    } clines;
+
+} dap_rl_file_rec;
+
+#define dap_rl_header_rec(I) ( (I)->max_gels )
+#define dap_rl_byte_index(I,R) ( ((R)-1) * sizeof(dap_rl_file_rec) )
+
+
+
+
+
+
+/*
+** Sequence file (*.SQ?)
+*/
+typedef char *dap_sq_file_rec;
+#define dap_sq_byte_index(I,R) ( ((R)-1) * ((I)->max_gel_length) )
+
+
+
+/*
+** Tag files (*.TG?)
+*/
+typedef union {
+    int i;
+    char c[4];
+} dap_tag_type;
+
+
+
+typedef union {
+
+    struct _tg_header {
+	int_4 count;
+	int_4 spare1;
+	int_4 spare2;
+	dap_tag_type spare3;
+	int_4 free_list;
+    } header;
+
+    struct _tg_lines {
+	int_4 position;
+	int_4 length;
+	int_4 comment;
+	dap_tag_type type;
+	int_4 next;
+    } lines;
+
+} dap_tg_file_rec;
+
+#define dap_tg_byte_index(I,R) ( ((R)-1) *sizeof(dap_tg_file_rec) )
+#define dap_tg_header_rec(I) ( (I)->max_gels )
+
+/*
+** Comment files (*.CC?)
+*/
+#define DAP_COMMENT_SIZE 40
+typedef union {
+
+    struct _cc_header {
+	int_4 free_list;
+	int_4 count;
+	char spare[DAP_COMMENT_SIZE - sizeof(int_4)];
+    } header;
+
+    struct _cc_lines {
+	int_4 next;
+	char comment[DAP_COMMENT_SIZE];
+    } lines;
+
+} dap_cc_file_rec;
+
+#define dap_cc_byte_index(I,R) ( ((R)-1) * sizeof(dap_cc_file_rec) )
+#define dap_cc_header_rec(I) (1)
+
+
+/*
+** Useful variables
+*/
+typedef char IOString[200];
+
+typedef struct {
+    int max_gels;
+    int num_gels;
+    int num_contigs;
+    int max_gel_length;
+    int data_class;
+    
+    FILE *ar_fp;
+    FILE *rl_fp;
+    FILE *sq_fp;
+    FILE *tg_fp;
+    FILE *cc_fp;
+
+    IOString ar_file;
+    IOString rl_file;
+    IOString sq_file;
+    IOString tg_file;
+    IOString cc_file;
+} DapIO;
+
+
+
+
+
+extern void dap_read_tg(DapIO *io, int rec, dap_tg_file_rec *t);
+extern void dap_write_tg(DapIO *io, int rec, dap_tg_file_rec *t);
+extern void dap_read_ar(DapIO *io, int rec, dap_ar_file_rec *t);
+extern void dap_write_ar(DapIO *io, int rec, dap_ar_file_rec *t);
+extern void dap_read_rl(DapIO *io, int rec, dap_rl_file_rec *t);
+extern void dap_write_rl(DapIO *io, int rec, dap_rl_file_rec *t);
+extern void dap_read_cc(DapIO *io, int rec, dap_cc_file_rec *t);
+extern void dap_write_cc(DapIO *io, int rec, dap_cc_file_rec *t);
+extern void dap_read_sq(DapIO *io, int rec, dap_sq_file_rec t);
+extern void dap_write_sq(DapIO *io, int rec, dap_sq_file_rec t);
+extern char *dap_read_comment(DapIO *io, int_4 cp);
+extern void dap_open_for_read(DapIO *io, char *name, char *version);
+extern void dap_open_for_write(DapIO *io, char *name, char *version);
+extern void dap_close_files(DapIO *io);
+
+
+#endif /* _dapIO_h */
diff --git a/src/cop/getSeq.c b/src/cop/getSeq.c
new file mode 100644
index 0000000..3b73cbf
--- /dev/null
+++ b/src/cop/getSeq.c
@@ -0,0 +1,48 @@
+#include <stdio.h> /* IMPORT: stderr */
+#include "seq.h"
+#include "seqIOABI.h"
+#include "seqIOALF.h"
+#include "seqIOSCF.h"
+#include "seqIOPlain.h"
+
+extern void *malloc(size_t s);
+
+Seq getSeq(char *filename, char *type)
+{
+    Seq seq;
+
+    if (is_SCF(filename)) {
+        seq = readSeqSCF(filename);
+    } else {
+	if (strcmp(type, "ABI")   == 0)
+	    seq = readSeqABI(filename);
+	else if (strcmp(type, "ALF")   == 0)
+	    seq = readSeqALF(filename);
+	else if (strcmp(type, "SCF")   == 0)
+	    seq = readSeqSCF(filename);
+	else
+	    seq = readSeqPlain(filename);
+    }
+
+    if (seq == NULLSeq ) {
+	fprintf(stderr,"Error reading %s trace file %s\n",type,filename);
+	return NULL;
+    }
+
+    return seq;
+}
+
+char *getSequence(Seq seq)
+{
+    char *bases;
+
+    if (seq == NULLSeq ) {
+	return NULL;
+    }
+
+    bases = malloc(seq->NorigBases+1);
+    strncpy(bases,seq->base,seq->NorigBases);
+    bases[seq->NorigBases] = '\0';
+
+    return bases;
+}
diff --git a/src/cop/llin.c b/src/cop/llin.c
new file mode 100644
index 0000000..86703a6
--- /dev/null
+++ b/src/cop/llin.c
@@ -0,0 +1,228 @@
+/* A PACKAGE FOR SEQUENCE COMPARISON WITH AFFINE WEIGHTS:
+     See include file "linear.h" for complete interface information. */
+
+#include "llin.h"
+#define XTERNAL
+#include "upam.gbl"
+#include "uascii.gbl"
+#define translate(A) (nascii[A])
+
+/* Globally passed params and macros */
+
+static int (*w)[32];				/* w = W */
+static int g, h, m;				/* g = G, h = H, m = g+h */
+
+#define gap(k)  ((k) <= 0 ? 0 : g+h*(k))	/* k-symbol indel cost */
+
+static int *sapp;				/* Current script append ptr */
+static int  last;				/* Last script op appended */
+
+						/* Append "Delete k" op */
+#define DEL(k)				{ if (last < 0)				    last = *(sapp-1) -= (k);		  else					    last = *sapp++ = -(k);		}
+						/* Append "Insert k" op */
+#define INS(k)				{ if (last < 0)				    { *(sapp-1) = (k); *sapp++ = last; }	  else					    last = *sapp++ = (k);		}
+
+#define REP { last = *sapp++ = 0; }		/* Append "Replace" op */
+
+
+/* diff(A,B,M,N,tb,te) returns the cost of an optimum conversion between
+   A[1..M] and B[1..N] that begins(ends) with a delete if tb(te) is zero
+   and appends such a conversion to the current script.                   */
+
+static int diff(A,B,M,N,tb,te) char *A, *B; int M, N; int tb, te;
+
+{ static int CC[NMAX+1], DD[NMAX+1];	/* Forward cost-only vectors */
+  static int RR[NMAX+1], SS[NMAX+1];	/* Reverse cost-only vectors */
+         int   midi, midj, type;	/* Midpoint, type, and cost */
+         int midc;
+
+{ register int   i, j;
+  register int c, e, d, s;
+           int t, *wa;
+
+/* Boundary cases: M <= 1 or N == 0 */
+
+  if (N <= 0)
+    { if (M > 0) DEL(M)
+      return gap(M);
+    }
+  if (M <= 1)
+    { if (M <= 0)
+        { INS(N);
+          return gap(N);
+        }
+      if (tb > te) tb = te;
+      midc = (tb+h) + gap(N);
+      midj = 0;
+      wa = w[translate(A[1])];
+      for (j = 1; j <= N; j++)
+        { c = gap(j-1) + wa[translate(B[j])] + gap(N-j);
+          if (c < midc)
+            { midc = c;
+              midj = j;
+            }
+        }
+      if (midj == 0)
+        { INS(N) DEL(1) }
+      else
+        { if (midj > 1) INS(midj-1)
+          REP
+          if (midj < N) INS(N-midj)
+        }
+      return midc;
+    }
+
+/* Divide: Find optimum midpoint (midi,midj) of cost midc */
+
+  midi = M/2;			/* Forward phase:                          */
+  CC[0] = 0;			/*   Compute C(M/2,k) & D(M/2,k) for all k */
+  t = g;
+  for (j = 1; j <= N; j++)
+    { CC[j] = t = t+h;
+      DD[j] = t+g;
+    }
+  t = tb;
+  for (i = 1; i <= midi; i++)
+    { s = CC[0];
+      CC[0] = c = t = t+h;
+      e = t+g;
+      wa = w[translate(A[i])];
+      for (j = 1; j <= N; j++)
+        { if ((c =   c   + m) < (e =   e   + h)) e = c;
+          if ((c = CC[j] + m) < (d = DD[j] + h)) d = c;
+          c = s + wa[translate(B[j])];
+          if (e < c) c = e;
+          if (d < c) c = d;
+          s = CC[j];
+          CC[j] = c;
+          DD[j] = d;
+        }
+    }
+  DD[0] = CC[0];
+
+  RR[N] = 0;			/* Reverse phase:                          */
+  t = g;			/*   Compute R(M/2,k) & S(M/2,k) for all k */
+  for (j = N-1; j >= 0; j--)
+    { RR[j] = t = t+h;
+      SS[j] = t+g;
+    }
+  t = te;
+  for (i = M-1; i >= midi; i--)
+    { s = RR[N];
+      RR[N] = c = t = t+h;
+      e = t+g;
+      wa = w[translate(A[i+1])];
+      for (j = N-1; j >= 0; j--)
+        { if ((c =   c   + m) < (e =   e   + h)) e = c;
+          if ((c = RR[j] + m) < (d = SS[j] + h)) d = c;
+          c = s + wa[translate(B[j+1])];
+          if (e < c) c = e;
+          if (d < c) c = d;
+          s = RR[j];
+          RR[j] = c;
+          SS[j] = d;
+        }
+    }
+  SS[N] = RR[N];
+
+  midc = CC[0]+RR[0];		/* Find optimal midpoint */
+  midj = 0;
+  type = 1;
+  for (j = 0; j <= N; j++)
+    if ((c = CC[j] + RR[j]) <= midc)
+      if (c < midc || CC[j] != DD[j] && RR[j] == SS[j])
+        { midc = c;
+          midj = j;
+        }
+  for (j = N; j >= 0; j--)
+    if ((c = DD[j] + SS[j] - g) < midc)
+      { midc = c;
+        midj = j;
+        type = 2;
+      }
+}
+
+/* Conquer: recursively around midpoint */
+
+  if (type == 1)
+    { diff(A,B,midi,midj,tb,g);
+      diff(A+midi,B+midj,M-midi,N-midj,g,te);
+    }
+  else
+    { diff(A,B,midi-1,midj,tb,0);
+      DEL(2);
+      diff(A+midi+1,B+midj,M-midi-1,N-midj,0,te);
+    }
+  return midc;
+}
+
+
+/* Interface and top level of comparator */
+
+int DIFF(A,B,M,N,W,G,H,S) char A[],B[]; int M,N; int W[][32],G,H; int S[];
+
+{ if (N > NMAX) return -1;	/* Error check */
+
+  w = W;			/* Setup global parameters */
+  g = G;
+  h = H;
+  m = g+h;
+  sapp = S;
+  last = 0;
+
+  return diff(A,B,M,N,g,g);	/* OK, do it */
+}
+
+
+/* Alignment display routine */
+
+static char ALINE[51], BLINE[51], CLINE[51];
+
+int DISPLAY(A,B,M,N,S) char A[], B[]; int M, N; int S[];
+{ register char *a, *b, *c;
+  register int   i,  j, op;
+           int   lines;
+
+  i = j = op = lines = 0;
+  a = ALINE;
+  b = BLINE;
+  c = CLINE;
+  while (i < M || j < N)
+    { if (op == 0 && *S == 0)
+        { op = *S++;
+          *a = nt[translate(A[++i])];
+          *b = nt[translate(B[++j])];
+          *c++ = (*a++ == *b++) ? ':' : ' ';
+        }
+      else
+        { if (op == 0)
+            op = *S++;
+          if (op > 0)
+            { *a++ = ' ';
+              *b++ = nt[translate(B[++j])];
+              op--;
+            }
+          else
+            { *a++ = nt[translate(A[++i])];
+              *b++ = ' ';
+              op++;
+            }
+          *c++ = '-';
+        }
+      if (a >= ALINE+50 || i >= M && j >= N)
+        { *a = *b = *c = '\0';
+          printf("\n%5d",50*lines++);
+          for (b = ALINE+10; b <= a; b += 10)
+            printf("    .    :");
+          if (b <= a+5)
+            printf("    .");
+          printf("\n     %s\n     %s\n     %s\n",ALINE,CLINE,BLINE);
+          a = ALINE;
+          b = BLINE;
+          c = CLINE;
+        }
+    }
+
+    return 0;
+}
+
diff --git a/src/cop/llin.h b/src/cop/llin.h
new file mode 100644
index 0000000..3b29774
--- /dev/null
+++ b/src/cop/llin.h
@@ -0,0 +1,62 @@
+/* A PACKAGE FOR SEQUENCE COMPARISON WITH AFFINE WEIGHTS:
+     Gene Myers, Dept. of Computer Science, U. of Arizona 85721 (10/3/87)
+
+#define NMAX <integer>
+
+    NMAX is a compilation constant giving the maximum input sequence length.
+    It is to be adjusted according to available memory.
+
+
+int DIFF(A,B,M,N,W,G,H,S) int M,N; char A[],B[]; int W[][128],G,H; int S[];
+
+    DIFF compares sequence A[1..M] with sequence B[1..N] and returns the
+    minimum conversion cost.  Costs are determined by the parameters W, G,
+    and H.  W[128][128] is an array giving replacement costs for each pair of
+    ASCII characters, e.g. W['a']['b'] is the cost of replacing 'a' by 'b'.
+    Be sure to set W['a']['a'] to zero if exact matches are to accrue no cost.
+    The cost of a k-symbol indel is the affine function G+Hk.
+
+    DIFF also has the side-effect of placing an encoding of an optimal
+    conversion in an integer array S[0..M+N-1] supplied by the caller.
+    The sequence of integers S[0], S[1], S[2], ... gives the editing
+    operations in a left-to-right conversion where integers encode
+    operations as follows:
+         0 => replace
+        -k => delete k symbols
+        +k => insert k symbols.
+    The script is guaranteed to have the properties:
+      (1)  Inserts are never followed by inserts.
+      (2)  Deletes are never followed by deletes or inserts.
+      (3)  A replacement followed by a k-gap is always preferred
+           to a k-gap followed by a replacement in the event that
+           both have the same cost.
+
+    DIFF returns -1.0 if NMAX isn't large enough.
+
+
+int DISPLAY(A,B,M,N,S) int M,N; char A[],B[]; int S[];
+
+    DISPLAY places on the standard output a display of the alignment
+    implied by the conversion S computed in the call DIFF(A,B,M,N,?,?,?,S).
+    For example:
+
+              0    .    :    .    :    .    :    .    :    .    :
+               ggcgtttcataccggcgagga  ctagagatcccagatgcagcctcgata
+               !-!!!!||||!!!!!!!!!!|--!!!!!|!!|!!||||!!-!!!!!!!!!
+               g cgttcataaccggcgaggtacctagacattcccagagc gcctcgata
+          
+             50    .    :    .    :    .
+               taggaagaa tc agcaacgatcggcatg
+               !|!||!!!!-!!-!!!!!!!!-!!|!-!!
+               tggacagaaatcgagcaacga cgac tg
+*/
+
+#ifdef BIGMEM
+#define NMAX 30000
+#else
+#define NMAX  3000
+#endif
+
+extern int   DIFF();
+extern int   DISPLAY();
+
diff --git a/src/cop/makefile-alpha b/src/cop/makefile-alpha
new file mode 100644
index 0000000..26c2255
--- /dev/null
+++ b/src/cop/makefile-alpha
@@ -0,0 +1,114 @@
+#
+# Makefile for COP (Check Out Project) - Alpha version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = cop cop-bap
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)/alpha-binaries
+CONVERTSRC = $(SRC)/../convert
+INCLUDES = -I$(SRC) -I$(TEDSRC) -I$(MISCSRC) -I$(CONVERTSRC)
+
+#
+# Alpha
+#
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact $(INCLUDES)
+LD = cc
+LDFLAGS = -g -non_shared
+LIBFLAGS = 
+
+#
+# Main and miscellaneous routines
+#
+COPOBJ = \
+	cop.o
+
+COPBAPOBJ = \
+	cop-bap.o
+
+LIBS = \
+	$(MISCBIN)/misc.a
+
+#
+# For reading in xdap database
+#
+XDAPIO = \
+	dapIO.o
+
+XBAPIO = \
+	bapIO.o
+
+#
+# For reading in trace files
+#
+TRACES = \
+	getSeq.o \
+	seqIOABI.o \
+	seqIOALF.o \
+	seqIOSCF.o \
+	seqIOPlain.o \
+	opp.o \
+	seq.o \
+	fpoint.o\
+	mach-io.o
+
+#
+# For aligning sequences
+#
+ALIGN = \
+	llin.o
+
+#
+# The whole lot
+#
+OBJ = \
+	$(COPOBJ) \
+	$(XDAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+OBJBAP = \
+	$(COPBAPOBJ) \
+	$(XBAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+
+#
+# Cop
+#
+cop : $(OBJ)
+	$(LD) $(LDFLAGS) -o cop $(OBJ) $(LIBS) $(LIBFLAGS)
+
+cop-bap : $(OBJBAP)
+	$(LD) $(LDFLAGS) -o cop-bap $(OBJBAP) $(LIBS) $(LIBFLAGS)
+
+
+
+#
+# Special dependencies
+#
+
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(OBJ) $(PROGS) *~
+
+cop-bap.o: $(SRC)/cop.c
+	$(CC) -DBAP_VERSION $(CFLAGS) -o cop-bap.o -c $(SRC)/cop.c
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/cop/makefile-dec b/src/cop/makefile-dec
new file mode 100644
index 0000000..43cf53c
--- /dev/null
+++ b/src/cop/makefile-dec
@@ -0,0 +1,111 @@
+#
+# Makefile for COP (Check Out Project) - DEC Ultrix (mips) version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = cop cop-bap
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+CONVERTSRC = $(SRC)/../convert
+INCLUDES = -I/usr/include/mit -I$(SRC) -I$(TEDSRC) -I$(MISCSRC) -I$(CONVERTSRC)
+
+#
+# Alliant
+#
+CC = c89
+CFLAGS = -g -common $(INCLUDES)
+
+#
+# Main and miscellaneous routines
+#
+COPOBJ = \
+	cop.o
+
+COPBAPOBJ = \
+	cop-bap.o
+
+LIBS = \
+	$(MISCBIN)/misc.a
+
+#
+# For reading in xdap database
+#
+XDAPIO = \
+	dapIO.o
+
+XBAPIO = \
+	bapIO.o
+
+#
+# For reading in trace files
+#
+TRACES = \
+	getSeq.o \
+	seqIOABI.o \
+	seqIOALF.o \
+	seqIOSCF.o \
+	seqIOPlain.o \
+	opp.o \
+	seq.o \
+	fpoint.o\
+	mach-io.o
+
+#
+# For aligning sequences
+#
+ALIGN = \
+	llin.o
+
+#
+# The whole lot
+#
+OBJ = \
+	$(COPOBJ) \
+	$(XDAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+OBJBAP = \
+	$(COPBAPOBJ) \
+	$(XBAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+
+#
+# Cop
+#
+cop : $(OBJ)
+	$(CC) $(CFLAGS) -o cop $(OBJ) $(LIBS)
+
+cop-bap : $(OBJBAP)
+	$(CC) $(CFLAGS) -o cop-bap $(OBJBAP) $(LIBS)
+
+
+
+#
+# Special dependencies
+#
+
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(OBJ) $(PROGS) *~
+
+cop-bap.o: $(SRC)/cop.c
+	$(CC) -DBAP_VERSION $(CFLAGS) -o cop-bap.o -c $(SRC)/cop.c
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/cop/makefile-sgi b/src/cop/makefile-sgi
new file mode 100644
index 0000000..bc75b6f
--- /dev/null
+++ b/src/cop/makefile-sgi
@@ -0,0 +1,114 @@
+#
+# Makefile for COP (Check Out Project) - SGI version
+#
+
+SHELL=/bin/sh
+
+#
+# Programs made by this makefile
+#
+PROGS = cop cop-bap
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+CONVERTSRC = $(SRC)/../convert
+INCLUDES = -I$(SRC) -I$(TEDSRC) -I$(MISCSRC) -I$(CONVERTSRC)
+
+#
+# Sgi
+#
+CC = cc
+CFLAGS = -g -ansi -prototypes $(INCLUDES)
+LIBFLAGS = 
+
+#
+# Main and miscellaneous routines
+#
+COPOBJ = \
+	cop.o
+
+COPBAPOBJ = \
+	cop-bap.o
+
+LIBS = \
+	$(MISCBIN)/misc.a
+
+#
+# For reading in xdap database
+#
+XDAPIO = \
+	dapIO.o
+
+XBAPIO = \
+	bapIO.o
+
+#
+# For reading in trace files
+#
+TRACES = \
+	getSeq.o \
+	seqIOABI.o \
+	seqIOALF.o \
+	seqIOSCF.o \
+	seqIOPlain.o \
+	opp.o \
+	seq.o \
+	fpoint.o\
+	mach-io.o
+
+#
+# For aligning sequences
+#
+ALIGN = \
+	llin.o
+
+#
+# The whole lot
+#
+OBJ = \
+	$(COPOBJ) \
+	$(XDAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+OBJBAP = \
+	$(COPBAPOBJ) \
+	$(XBAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+
+#
+# Cop
+#
+cop : $(OBJ)
+	$(CC) $(CFLAGS) -o cop $(OBJ) $(LIBS) $(LIBFLAGS)
+
+cop-bap : $(OBJBAP)
+	$(CC) $(CFLAGS) -o cop-bap $(OBJBAP) $(LIBS) $(LIBFLAGS)
+
+
+
+#
+# Special dependencies
+#
+
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(OBJ) $(PROGS) *~
+
+cop-bap.o: $(SRC)/cop.c
+	$(CC) -DBAP_VERSION $(CFLAGS) -o cop-bap.o -c $(SRC)/cop.c
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/cop/makefile-solaris b/src/cop/makefile-solaris
new file mode 100644
index 0000000..99e2c8b
--- /dev/null
+++ b/src/cop/makefile-solaris
@@ -0,0 +1,115 @@
+#
+# Makefile for COP (Check Out Project) - SunOS version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = cop cop-bap
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+CONVERTSRC = $(SRC)/../convert
+INCLUDES = -I$(SRC) -I$(TEDSRC) -I$(MISCSRC) -I$(CONVERTSRC) -I/usr/openwin/include
+
+#
+# Sun
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(INCLUDES)
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+#
+# Main and miscellaneous routines
+#
+COPOBJ = \
+	cop.o
+
+COPBAPOBJ = \
+	cop-bap.o
+
+LIBS = \
+	$(MISCBIN)/misc.a
+
+#
+# For reading in xdap database
+#
+XDAPIO = \
+	dapIO.o
+
+XBAPIO = \
+	bapIO.o
+
+#
+# For reading in trace files
+#
+TRACES = \
+	getSeq.o \
+	seqIOABI.o \
+	seqIOALF.o \
+	seqIOSCF.o \
+	seqIOPlain.o \
+	opp.o \
+	seq.o \
+	fpoint.o\
+	mach-io.o
+
+#
+# For aligning sequences
+#
+ALIGN = \
+	llin.o
+
+#
+# The whole lot
+#
+OBJ = \
+	$(COPOBJ) \
+	$(XDAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+OBJBAP = \
+	$(COPBAPOBJ) \
+	$(XBAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+
+#
+# Cop
+#
+cop : $(OBJ)
+	$(CC) $(CFLAGS) -o cop $(OBJ) $(LIBS) $(LIBFLAGS)
+
+cop-bap : $(OBJBAP)
+	$(CC) $(CFLAGS) -o cop-bap $(OBJBAP) $(LIBS) $(LIBFLAGS)
+
+
+
+#
+# Special dependencies
+#
+
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(OBJ) $(PROGS) *~
+
+cop-bap.o: $(SRC)/cop.c
+	$(CC) -DBAP_VERSION $(CFLAGS) -o cop-bap.o -c $(SRC)/cop.c
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/cop/makefile-sun b/src/cop/makefile-sun
new file mode 100644
index 0000000..e810f75
--- /dev/null
+++ b/src/cop/makefile-sun
@@ -0,0 +1,115 @@
+#
+# Makefile for COP (Check Out Project) - SunOS version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = cop cop-bap
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+CONVERTSRC = $(SRC)/../convert
+INCLUDES = -I$(SRC) -I$(TEDSRC) -I$(MISCSRC) -I$(CONVERTSRC)
+
+#
+# Sun
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(INCLUDES)
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+#
+# Main and miscellaneous routines
+#
+COPOBJ = \
+	cop.o
+
+COPBAPOBJ = \
+	cop-bap.o
+
+LIBS = \
+	$(MISCBIN)/misc.a
+
+#
+# For reading in xdap database
+#
+XDAPIO = \
+	dapIO.o
+
+XBAPIO = \
+	bapIO.o
+
+#
+# For reading in trace files
+#
+TRACES = \
+	getSeq.o \
+	seqIOABI.o \
+	seqIOALF.o \
+	seqIOSCF.o \
+	seqIOPlain.o \
+	opp.o \
+	seq.o \
+	fpoint.o\
+	mach-io.o
+
+#
+# For aligning sequences
+#
+ALIGN = \
+	llin.o
+
+#
+# The whole lot
+#
+OBJ = \
+	$(COPOBJ) \
+	$(XDAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+OBJBAP = \
+	$(COPBAPOBJ) \
+	$(XBAPIO) \
+	$(TRACES) \
+	$(ALIGN)
+
+
+#
+# Cop
+#
+cop : $(OBJ)
+	$(CC) $(CFLAGS) -o cop $(OBJ) $(LIBS) $(LIBFLAGS)
+
+cop-bap : $(OBJBAP)
+	$(CC) $(CFLAGS) -o cop-bap $(OBJBAP) $(LIBS) $(LIBFLAGS)
+
+
+
+#
+# Special dependencies
+#
+
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(OBJ) $(PROGS) *~
+
+cop-bap.o: $(SRC)/cop.c
+	$(CC) -DBAP_VERSION $(CFLAGS) -o cop-bap.o -c $(SRC)/cop.c
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
diff --git a/src/cop/uascii.gbl b/src/cop/uascii.gbl
new file mode 100644
index 0000000..f765d9b
--- /dev/null
+++ b/src/cop/uascii.gbl
@@ -0,0 +1,46 @@
+/*	ascii.gbl	ascii translation to amino acids */
+/*	modified 10-Mar-1987 for B, Z	*/
+
+#define NA 124
+#define EL 125
+#define ES 126
+
+#ifndef XTERNAL
+/*      0  1  2  3  4  5  6  7  8  9  10 11 12 13 14 15	*/
+int aascii[]={
+	EL,NA,NA,NA,NA,NA,NA,NA,NA,NA,EL,NA,NA,EL,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,ES,NA,NA,NA,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,
+	ES, 0,20, 4, 3, 6,13, 7, 8, 9,NA,11,10,12, 2,NA,
+	14, 5, 1,15,16,NA,19,17,22,18,21,NA,NA,NA,NA,NA,
+	ES, 0,20, 4, 3, 6,13, 7, 8, 9,NA,11,10,12, 2,NA,
+	14, 5, 1,15,16,NA,19,17,22,18,21,NA,NA,NA,NA,NA};
+
+int *sascii;
+#define AAMASK 127
+
+int nascii[]={
+/*	 0  1  2  3  5  6  7  8  9 10 11 12 13 14 15 15
+	 @  A  B  C  D  E  F  G  H  I  J  K  L  M  N  O
+	 P  Q  R  S  T  U  V  W  X  Y  Z		*/
+	EL,NA,NA,NA,NA,NA,NA,NA,NA,NA,EL,NA,NA,EL,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,
+	NA,NA,NA,NA,NA,NA,NA,NA,NA,NA,16,NA,NA,15,NA,NA,
+	ES,ES,ES,NA,NA,NA,NA,NA,NA,NA,NA,ES,NA,NA,ES,NA,
+	ES, 0,14, 1,11,NA,NA, 2,12,NA,NA,10,NA, 7,15,NA,
+	 5, 6, 5, 9, 3, 4,13, 8,16, 6,NA,NA,NA,NA,NA,NA,
+	ES, 0,14, 1,11,NA,NA, 2,12,NA,NA,10,NA, 7,15,NA,
+	 5, 6, 5, 9, 3, 4,13, 8,16, 6,NA,NA,NA,NA,NA,NA};
+
+#else
+extern int aascii[];
+extern int nascii[];
+extern int *sascii;
+
+#define AAMASK 127
+
+extern int lascii[];
+
+#define LAMASK 15
+#endif
diff --git a/src/cop/upam.gbl b/src/cop/upam.gbl
new file mode 100644
index 0000000..fd5f1a2
--- /dev/null
+++ b/src/cop/upam.gbl
@@ -0,0 +1,127 @@
+/*	20-June-1986	universal pam file */
+/* Changed by Rodger Staden and Simon Dear
+** to suit local preferences 9-April-1992
+*/
+
+#ifndef XTERNAL
+
+int gdelval= -1;
+int ggapval= -4;
+
+extern int gdelval, ggapval;
+
+
+#define EOSEQ 127
+#define MAXSQ 32
+
+
+char sqnam[]={"aa"};
+char sqtype[]={"protein"};
+
+char *sq;
+char aa[MAXSQ] = {"ARNDCQEGHILKMFPSTWYVBZX"};
+
+int naa = 23;
+int nsq;
+
+int haa[MAXSQ] = {0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,2,6,0};
+int *hsq;
+
+int apam[450] = {
+ 2,
+-2, 6,
+ 0, 0, 2,
+ 0,-1, 2, 4,
+-2,-4,-4,-5,12,
+ 0, 1, 1, 2,-5, 4,
+ 0,-1, 1, 3,-5, 2, 4,
+ 1,-3, 0, 1,-3,-1, 0, 5,
+-1, 2, 2, 1,-3, 3, 1,-2, 6,
+-1,-2,-2,-2,-2,-2,-2,-3,-2, 5,
+-2,-3,-3,-4,-6,-2,-3,-4,-2, 2, 6,
+-1, 3, 1, 0,-5, 1, 0,-2, 0,-2,-3, 5,
+-1, 0,-2,-3,-5,-1,-2,-3,-2, 2, 4, 0, 6,
+-4,-4,-4,-6,-4,-5,-5,-5,-2, 1, 2,-5, 0, 9,
+ 1, 0,-1,-1,-3, 0,-1,-1, 0,-2,-3,-1,-2,-5, 6,
+ 1, 0, 1, 0, 0,-1, 0, 1,-1,-1,-3, 0,-2,-3, 1, 2,
+ 1,-1, 0, 0,-2,-1, 0, 0,-1, 0,-2, 0,-1,-3, 0, 1, 3,
+-6, 2,-4,-7,-8,-5,-7,-7,-3,-5,-2,-3,-4, 0,-6,-2,-5,17,
+-3,-4,-2,-4, 0,-4,-4,-5, 0,-1,-1,-4,-2, 7,-5,-3,-3, 0,10,
+ 0,-2,-2,-2,-2,-2,-2,-1,-2, 4, 2,-2, 2,-1,-1,-1, 0,-6,-2, 4,
+ 0,-1, 2, 3,-4, 1, 2, 0, 1,-2,-3, 1,-2,-5,-1, 0, 0,-5,-3,-2, 2,
+ 0, 0, 1, 3,-5, 3, 3,-1, 2,-2,-3, 0,-2,-5, 0, 0,-1,-6,-4,-2, 2, 3,
+ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0};
+
+/*	DNA alphabet
+
+	A, C, G, T U
+	R, Y
+	M (A or C)	6
+	W (A or T)	7
+	S (C or G)	8
+	K (G or T)	9
+	D (not C)	10
+	H (not G)	11
+	V (not T)	12
+	B (not A)	13
+	N X 		14
+*/
+
+char nt[MAXSQ]={"ACGTURYMWSKDHVB-*"};
+
+int nnt = 17;
+
+int hnt[MAXSQ] = {0,1,2,3,3,0,1,0,0,1,2,0,0,0,1,0,0};
+
+int npam[450] = {
+/*       A  C  G  T  U  R  Y  M  W  S  K  D  H  V  B  -  *  */
+	 4,						/* A */
+	-3, 4,						/* C */
+	-3,-3, 4,					/* G */
+	-3,-3,-3, 4,					/* T */
+	-3,-3,-3, 4, 4,					/* U */
+	 2,-1, 2,-1,-1, 2,				/* R (A G)*/
+	-1, 2,-1, 2, 2,-2, 2,				/* Y (C T)*/
+	 2, 2,-1,-1,-1, 0, 0, 2,			/* M (A C)*/
+	 2,-1,-1, 2, 2, 1, 1, 1, 2,			/* W (A T)*/
+	-1, 2, 2,-1,-1, 1, 1, 1,-1, 2,			/* S (C G)*/
+	-1,-1, 2, 2, 2, 1, 1,-1, 1, 1, 2,		/* K (G T)*/
+	 1,-2, 1, 1, 1, 1, 0, 0, 1, 0, 1, 1,		/* D (!C) */
+	 1, 1,-2, 1, 1, 0, 1, 1, 1, 0, 0, 0, 1,		/* H (!G) */
+	 1, 1, 1,-2,-1, 1, 0, 1, 0, 1, 0, 0, 0, 1,	/* V (!T) */
+	-2, 1, 1, 1, 1, 0, 1, 0, 0, 1, 1, 0, 0, 0, 1,	/* B (!A) */
+	 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 3, /* - */
+	 3, 3, 3, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0}; /* * */
+/*       A  C  G  T  U  R  Y  M  W  S  K  D  H  V  B  N  */
+
+int *pam;
+int pam2[MAXSQ][MAXSQ];
+int pamh1[MAXSQ];		/* used for kfact replacement */
+
+#else
+#define MAXSQ 32
+
+extern char sqnam[];
+extern char sqtype[];
+extern int gdelval, ggapval;
+extern	char aa[];
+extern	char nt[];
+extern 	char *sq;
+
+#define EOSEQ 127
+
+extern  int naa;
+extern  int nnt;
+extern	int nsq;
+
+extern  int hnt[];
+extern  int haa[];
+extern	int *hsq;
+
+extern  int apam[];
+extern	int npam[];
+extern	int *pam;
+
+extern  int pam2[MAXSQ][MAXSQ];
+extern	int pamh1[];
+#endif
diff --git a/src/cop/xdap_db.c b/src/cop/xdap_db.c
new file mode 100644
index 0000000..bae98e0
--- /dev/null
+++ b/src/cop/xdap_db.c
@@ -0,0 +1,213 @@
+#include <stdio.h>
+#include "xdap_db.h"
+
+
+int maxgel;
+
+typedef char IOString[200];
+
+
+FILE *ar_fp;
+FILE *rl_fp;
+FILE *sq_fp;
+FILE *tg_fp;
+FILE *cc_fp;
+
+static IOString ar_file;
+static IOString rl_file;
+static IOString sq_file;
+static IOString tg_file;
+static IOString cc_file;
+
+int max_gels;
+int num_gels;
+int num_contigs;
+int max_gel_length;
+int data_class;
+
+static ar_file_rec ar_header;
+static rl_file_rec rl_header;
+
+
+read_tg(FILE *f, int rec, tg_file_rec *t)
+{
+
+    if ( fseek(f,tg_byte_index(rec),0) ) {
+	fprintf(stderr,"Seek failure on tag file, record %d\n",rec);
+	exit (1);
+    }
+
+    if ( fread(t, sizeof(tg_file_rec), 1, f) != 1) {
+	fprintf(stderr, "Read failure on tag file\n");
+	exit (1);
+    }
+
+}
+
+
+
+read_ar(FILE *f, int rec, ar_file_rec *t)
+{
+
+    if ( fseek(f,ar_byte_index(rec),0) ) {
+	fprintf(stderr,"Seek failure on archive file, record %d\n",rec);
+	exit (1);
+    }
+
+    if ( fread(t, sizeof(ar_file_rec), 1, f) != 1) {
+	fprintf(stderr, "Read failure on archive file\n");
+	exit (1);
+    }
+
+}
+
+
+
+
+read_rl(FILE *f, int rec, rl_file_rec *t)
+{
+
+    if ( fseek(f,rl_byte_index(rec),0) ) {
+	fprintf(stderr,"Seek failure on relationships file, record %d\n",rec);
+	exit (1);
+    }
+
+    if ( fread(t, sizeof(rl_file_rec), 1, f) != 1) {
+	fprintf(stderr, "Read failure on relationships file\n");
+	exit (1);
+    }
+
+}
+
+
+read_cc(FILE *f, int rec, cc_file_rec *t)
+{
+
+    if ( fseek(f,cc_byte_index(rec),0) ) {
+	fprintf(stderr,"Seek failure on comment file, record %d\n",rec);
+	exit (1);
+    }
+
+    if ( fread(t, sizeof(cc_file_rec), 1, f) != 1) {
+	fprintf(stderr, "Read failure on comment file, record %d\n",rec);
+	exit (1);
+    }
+
+}
+
+
+read_sq(FILE *f, int rec, sq_file_rec t)
+{
+#define sq_byte_index(R) ( ((R)-1) * max_gel_length )
+    if ( fseek(f,sq_byte_index(rec),0) ) {
+	fprintf(stderr,"Seek failure on sequence file, record %d\n",rec);
+	exit (1);
+    }
+
+    if ( fread(t, max_gel_length, 1, f) != 1) {
+	fprintf(stderr, "Read failure on sequence file\n");
+	exit (1);
+    }
+
+}
+
+
+
+char *read_comment(FILE *f, uint_4 cp)
+{
+    int dummy;
+    cc_file_rec c;
+    int count;
+    int i;
+    uint_4 nc;
+    char *com,*comptr;
+
+    if (!cp) return NULL;
+    /* determine how long string is */
+    count = 1;
+    nc=cp;
+    read_cc(f, nc, &c);
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        read_cc(f, nc, &c);
+    }
+
+    com = comptr = (char *)malloc(count * COMMENT_SIZE+1);
+    nc=cp;
+    read_cc(f, nc, &c);
+    strncpy(com,c.lines.comment,COMMENT_SIZE); com+=COMMENT_SIZE;
+    while (c.lines.next != 0) {
+	nc = c.lines.next;
+	count++;
+        read_cc(f, nc, &c);
+        strncpy(com,c.lines.comment,COMMENT_SIZE); com+=COMMENT_SIZE;
+    }
+
+    *com = '\0';
+
+    return comptr;
+    
+
+}
+
+
+
+static void set_file_names(char *name, char *version)
+{
+    strcpy(ar_file,name); strcat(ar_file,".AR"); strcat(ar_file,version);
+    strcpy(rl_file,name); strcat(rl_file,".RL"); strcat(rl_file,version);
+    strcpy(sq_file,name); strcat(sq_file,".SQ"); strcat(sq_file,version);
+    strcpy(tg_file,name); strcat(tg_file,".TG"); strcat(tg_file,version);
+    strcpy(cc_file,name); strcat(cc_file,".CC"); strcat(cc_file,version);
+}
+
+void open_for_read(char *name, char *version)
+/*
+**
+*/
+{
+    /*
+    ** Create file names
+    */
+    set_file_names(name,version);
+
+    /*
+    ** Open files
+    */
+    if ( ( ar_fp = fopen(ar_file,"r") ) == NULL )
+	crash("No archive file %s\n",ar_file);
+    if ( ( rl_fp = fopen(rl_file,"r") ) == NULL )
+	crash("No relationships file %s\n",rl_file);
+    if ( ( sq_fp = fopen(sq_file,"r") ) == NULL )
+	crash("No sequence file %s\n",sq_file);
+    if ( ( tg_fp = fopen(tg_file,"r") ) == NULL )
+	crash("No tag file %s\n",tg_file);
+    if ( ( cc_fp = fopen(cc_file,"r") ) == NULL )
+	crash("No tag-comment file %s\n",cc_file);
+    
+    read_ar(ar_fp,ar_header_rec(),&ar_header);
+#define rl_header_rec() (ar_header.header.idbsiz)
+    read_rl(rl_fp,rl_header_rec(),&rl_header);
+
+    max_gels = ar_header.header.idbsiz;
+    max_gel_length = ar_header.header.maxgel;
+    data_class = ar_header.header.idm;
+    num_gels = rl_header.header.num_gels;
+    num_contigs = rl_header.header.num_contigs;
+
+}
+
+void close_files()
+/*
+** Close all relevant files
+*/
+{
+
+    fclose(ar_fp);
+    fclose(rl_fp);
+    fclose(sq_fp);
+    fclose(tg_fp);
+    fclose(cc_fp);
+
+}
diff --git a/src/cop/xdap_db.h b/src/cop/xdap_db.h
new file mode 100644
index 0000000..684e31d
--- /dev/null
+++ b/src/cop/xdap_db.h
@@ -0,0 +1,156 @@
+#ifndef _xdap_db_h
+#define _xdap_db_h
+
+#include "mach-io.h"
+
+/*
+** Definition of xdap database files
+*/
+
+
+
+/*
+** Archive file (*.AR?)
+*/
+typedef union {
+
+    struct _ar_header {
+	uint_4 idbsiz;
+	uint_4 maxgel;
+	uint_4 idm;
+    } header;
+
+    struct _ar_lines {
+	char name[12];
+    } lines;
+
+} ar_file_rec;
+
+#define ar_header_rec()  ( 1000 )
+#define ar_byte_index(R) ( ((R)-1) *sizeof(ar_file_rec) )
+
+
+
+
+
+
+
+/*
+** Relationships file (*.RL?)
+*/
+typedef union {
+
+    struct _rl_header { 
+	uint_4 num_gels;
+	uint_4 num_contigs;
+	uint_4 spare1;
+	uint_4 spare2;
+    } header;
+
+    struct _rl_lines {
+	uint_4 rel_pos;
+	uint_4 length;
+	uint_4 left_nbr;
+	uint_4 right_nbr;
+    } lines;
+
+    struct _rl_clines {
+	uint_4 length;
+	uint_4 spare3;
+	uint_4 left_end;
+	uint_4 right_end;
+    } clines;
+
+} rl_file_rec;
+
+#define rl_byte_index(R) ( ((R)-1) * sizeof(rl_file_rec) )
+
+
+
+
+
+
+/*
+** Sequence file (*.SQ?)
+*/
+typedef char *sq_file_rec;
+
+
+
+/*
+** Tag files (*.TG?)
+*/
+typedef union {
+    int i;
+    char c[4];
+} TagType;
+
+
+
+typedef union {
+
+    struct _tg_header {
+	uint_4 count;
+	uint_4 spare1;
+	uint_4 spare2;
+	TagType spare3;
+	uint_4 free_list;
+    } header;
+
+    struct _tg_lines {
+	uint_4 position;
+	uint_4 length;
+	uint_4 comment;
+	TagType type;
+	uint_4 next;
+    } lines;
+
+} tg_file_rec;
+
+#define tg_byte_index(R) ( ((R)-1) *sizeof(tg_file_rec) )
+
+
+/*
+** Comment files (*.CC?)
+*/
+#define COMMENT_SIZE 40
+typedef union {
+
+    struct _cc_header {
+	uint_4 free_list;
+	uint_4 count;
+	char spare[COMMENT_SIZE - sizeof(uint_4)];
+    } header;
+
+    struct _cc_lines {
+	uint_4 next;
+	char comment[COMMENT_SIZE];
+    } lines;
+
+} cc_file_rec;
+
+#define cc_byte_index(R) ( ((R)-1) *sizeof(cc_file_rec) )
+
+
+
+/*
+** Useful variables
+*/
+extern int max_gels;
+extern int num_gels;
+extern int num_contigs;
+extern int max_gel_length;
+extern int data_class;
+
+extern FILE *ar_fp;
+extern FILE *rl_fp;
+extern FILE *sq_fp;
+extern FILE *tg_fp;
+extern FILE *cc_fp;
+
+
+
+
+
+#endif /* _xdap_db_h */
+
diff --git a/src/expGetSeq/README b/src/expGetSeq/README
new file mode 100644
index 0000000..6dc14a4
--- /dev/null
+++ b/src/expGetSeq/README
@@ -0,0 +1,34 @@
+expGetSeq
+---------
+
+This is the partial source of a program that reads a trace file, clips
+it, then outputs the sequence and clipped points in squirrel experiment
+file format.  The clipping is performed using LaDeana's quality
+measures. If a trace failes to meet the overall quality criterion, QL
+is 0 and QR is 1. Sources for this are in ../newted. Example of
+output:
+
+
+SQ
+     AG--TGCATG CCTGCAGGTC GACTCTAGAG G-TCCCGGGG TACCGAGCTC GAATTC-TAA
+     TCATGGTCAT AGCTGT-TCC TGTGTGAAAT T-TTATCCGC TCACAATTCC ACACAACATA
+     CGAGCCGGAA GCATAAAGTG TAAAGCCTGG GGTGCCTAAT GAGTGAGCTA ACTCACATTA
+     ATTGCGTTGC GCTCACTGCC CGCTTTCCAG TCGGGAAACC TGTCGTGCCA GCTGCATTAA
+     TGAATCGGCC AACGCGCGGG GAGAGGCGGT TTGCGTATTG GGCGCCAGGG TGGTTTTA-T
+     TTTCACCAGT GAGACGGGCA ACAGCTGATT GCCCTTCACC GCCTGGCCCT GAGAGGAGTT
+     GCAGCAAGCG GTCCACGCT- GTT-TGCCCC AGCAGGCG-A AATCCTGTTT TGATGGTTGG
+     TTCC-GAAAT CGGCAAAAAT CCCTTATAAA TTC-AAAGGA TAGG-CCGAG --TGGGG-T-
+     AGTTTTT-TT TCCGG-TTTG AACAAGT-GT TCCCCTGTT- AAAGGA-CGT GGG--TC-AA
+     CCGTT-AAA- GGG-G--AA- -CCGTTTT-T TTATGGG--A --GGC---TT -C--GTTCCG
+     TTA-CCTG-T T-AA-GTTTT TTTGGGGTT- TGG-G-CGTA AG-GCT-AAT -GGACC-TAA
+     TGG-GGCCC- GATTTTGGGT T-TGGGGGAA GCGGA--GAC GG-GG-TG-G TGGGTGTAT-
+     GGA-GTC--G -TTAGG-TGG GAAT--T-GG TG--T-GG-- ATCG-TT-G- -T--GGTTTT
+     T-GGT-T-TA GGGGTGGTT  --GTGTTTTT T-GTGT-TTG --TT--T
+//
+QL   8
+QR   329
+
+
+*** 28-Jul-92
+Overall quality checking has now been turned off.
+Reinvoke using the -DQUAL_CHECK compilation flag.
diff --git a/src/expGetSeq/getMCH.c b/src/expGetSeq/getMCH.c
new file mode 100644
index 0000000..84e0fce
--- /dev/null
+++ b/src/expGetSeq/getMCH.c
@@ -0,0 +1,259 @@
+/*
+    Title: 	 getMCH
+
+    File: 	 getMCH.c
+    Purpose:	 Main module
+    Last update: Thursday 11 July 1991
+
+    Change log:
+
+*/
+
+
+/*
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include "seqRead.h"
+#include "mystdlib.h"  /* IMPORT: exit */
+#include "traceType.h"
+
+
+/*
+ * EMBL_STYLE
+ * Produce output in EMBL_STYLE output
+ */
+#define EMBL_STYLE
+
+
+/* ---- Static variables ---- */
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+typedef struct
+{   char * plain;
+    char * ABI;
+    char * ALF;
+    char * SCF;
+    char * output;
+    int bottom;
+    char * raw;
+    int header;
+} Arguments;
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    if (argc != 1)
+    {   fprintf(stderr, "%s: unknown option%s:", argv[0], (argc>2)?"s":"");
+	for (i=1; i<argc; i++)
+	    fprintf(stderr, " %s", argv[i]);
+	fprintf(stderr, "\n\n");
+    }
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr,
+"       [(-PLN | -ABI | -ALF | -SCF) {tracefilename} [-bottom {1(true) or 0(false)}]\n");
+    fprintf(stderr,
+"       -output {outputfilename}\n");
+}
+
+
+
+/*
+** Hacked from sqeOutput.c
+*/
+#ifdef EMBL_STYLE
+#define BasesPerLine 60
+#else /*EMBL_STYLE*/
+#define BasesPerLine 50
+#endif /*EMBL_STYLE*/
+Boolean writeSeq(Seq seq, char *fn)
+     /*
+       Write the clipped, edited part of the ABI format sequence `seq'
+       into file `fn'. The result indicates success.
+       Currently, this just writes the bases out as text.
+       */
+{
+    FILE *fp;
+    int baseNum, lineLen;
+    
+    /* Open for writing, text */
+    if ((fp=fopen(fn, "w")) == NULL) return(False);
+    
+    lineLen = 0;
+    
+    /* write experiment file header */
+    fprintf(fp,"SQ\n");
+
+    /* write sequence */
+#ifdef EMBL_STYLE
+    for (baseNum=0; baseNum<seq->NedBases; baseNum++) {
+	if (lineLen==0) fputs("     ",fp);
+	fputc(getBase(seq, EdBases, baseNum), fp);
+	if (++lineLen == BasesPerLine) {
+	    fputc('\n', fp);
+	    lineLen = 0;
+	} else if ((lineLen % 10)==0) fputc(' ', fp);
+    }
+#else /*EMBL_STYLE*/
+    for (baseNum=0; baseNum<seq->NedBases; baseNum++) {
+	fputc(getBase(seq, EdBases, baseNum), fp);
+	if (++lineLen == BasesPerLine) {
+	    fputc('\n', fp);
+	    lineLen = 0;
+	}
+    }
+#endif /*EMBL_STYLE*/
+
+    if (lineLen != 0) fputc('\n', fp);
+
+    /* write experiment file header */
+    fprintf(fp,"//\n");
+
+    /* write cutoffs */
+#ifdef EMBL_STYLE
+    fprintf(fp,"CC   Quality clipping by newted May-1992\n");
+    fprintf(fp,"QL   %d\n",seq->leftCutoff);
+    fprintf(fp,"QR   %d\n",seq->NedBases-seq->rightCutoff+1);
+#else /*EMBL_STYLE*/
+    fprintf(fp,"CC Quality clipping by newted May-1992\n");
+    fprintf(fp,"QL %d\n",seq->leftCutoff);
+    fprintf(fp,"QR %d\n",seq->NedBases-seq->rightCutoff+1);
+#endif /*EMBL_STYLE*/
+    
+    (void) fclose(fp);
+    
+    return(True);
+}
+
+
+
+
+/* ---- Exported functions ---- */
+void setScaleFactor(float f)
+/* a dummy routine */
+{ }
+
+void main(unsigned int argc, char **argv)
+{
+    Arguments args;
+    char *fn;
+    char *o_fn;
+    Seq currSeq;
+    int rc;
+    int count;
+    char *enzString = "TAGAGGCTCCCC";
+
+    /* parse command line arguments here */
+
+    count=1;
+
+    fn = NULL;
+    args.ABI = NULL;
+    args.ALF = NULL;
+    args.SCF = NULL;
+    args.output = NULL;
+    args.plain = NULL;
+
+    while (count < argc) {
+	char *a = argv[count];
+	if ( *a != '-') {
+            usage(argc, argv);
+fprintf(stderr,"not a - switch\n");
+	    exit(1);
+	}
+
+	if (strcmp(a,"-PLN") == 0) {
+	    args.plain = fn = argv[++count];
+	} else if (strcmp(a,"-ABI") == 0) {
+	    args.ABI = fn = argv[++count];
+	} else if (strcmp(a,"-ALF") == 0) {
+	    args.ALF = fn = argv[++count];
+	} else if (strcmp(a,"-SCF") == 0) {
+	    args.SCF = fn = argv[++count];
+	} else if (strcmp(a,"-output") == 0) {
+	    args.output = o_fn = argv[++count];
+	} else {
+            usage(argc, argv);
+fprintf(stderr,"switch not recognised\n");
+	    exit(1);
+	}
+        count++;
+    }
+
+
+
+
+
+
+
+
+
+    /*
+        Die if bad options given.
+    */
+    if ((args.plain==NULL &&
+	 args.ABI==NULL   &&
+	 args.ALF==NULL   &&
+	 args.SCF==NULL))
+    {   
+	usage(argc, argv);
+fprintf(stderr,"illegal argument combination\n");
+	exit(1);
+    }
+
+    if ( args.output==NULL )
+    {   
+	usage(argc, argv);
+fprintf(stderr,"no output specified\n");
+	exit(1);
+    }
+
+    if (args.plain != NULL)
+	currSeq = readSeq(fn,0,"",trace_types[TT_PLN]);
+    else if (args.ABI != NULL)
+	currSeq = readSeq(fn,0,"",trace_types[TT_ABI]);
+    else if (args.ALF != NULL)
+	currSeq = readSeq(fn,0,"",trace_types[TT_ALF]);
+    else if (args.SCF != NULL)
+	currSeq = readSeq(fn,0,"",trace_types[TT_SCF]);
+
+    if (currSeq == NULLSeq) {
+	fprintf(stderr,"Error reading file %s\n",fn);
+	exit(1);
+    }
+
+    rc=writeSeq(currSeq, o_fn);
+
+    if (!rc) {
+	fprintf(stderr,"Error writing file %s\n",o_fn);
+	exit(1);
+    }
+
+
+
+    exit(0);
+
+}
+
+
diff --git a/src/expGetSeq/makefile-alpha b/src/expGetSeq/makefile-alpha
new file mode 100644
index 0000000..af3252c
--- /dev/null
+++ b/src/expGetSeq/makefile-alpha
@@ -0,0 +1,93 @@
+#
+# Makefile for expGetSeq - Alpha version
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE
+
+#
+# Programs made by this makefile
+#
+PROGS = expGetSeq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../newted
+INCLUDES = -I/usr/include/mit -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= cc
+CFLAGS= -g -ieee_with_inexact $(SWITCHES) $(INCLUDES)
+LD= cc
+LDFLAGS= -g -non_shared
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o
+
+
+#
+# Linking
+#
+expGetSeq: $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(OBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/expGetSeq/makefile-dec b/src/expGetSeq/makefile-dec
new file mode 100644
index 0000000..6b0d401
--- /dev/null
+++ b/src/expGetSeq/makefile-dec
@@ -0,0 +1,91 @@
+#
+# Makefile for expGetSeq - DEC Ultrix (mips) fork
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE
+
+#
+# Programs made by this makefile
+#
+PROGS = expGetSeq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../newted
+INCLUDES = -I/usr/include/mit -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= c89
+CFLAGS= -g -common $(SWITCHES) $(INCLUDES)
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o
+
+
+#
+# Linking
+#
+expGetSeq: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/expGetSeq/makefile-sgi b/src/expGetSeq/makefile-sgi
new file mode 100644
index 0000000..f59faf2
--- /dev/null
+++ b/src/expGetSeq/makefile-sgi
@@ -0,0 +1,93 @@
+#
+# Makefile for getMCH - SGI fork
+#
+
+SHELL=/bin/sh
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE
+
+#
+# Programs made by this makefile
+#
+PROGS = expGetSeq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../newted
+INCLUDES = -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= cc
+CFLAGS= -g $(SWITCHES) $(INCLUDES)
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o
+
+
+#
+# Linking
+#
+expGetSeq: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/expGetSeq/makefile-solaris b/src/expGetSeq/makefile-solaris
new file mode 100644
index 0000000..f553679
--- /dev/null
+++ b/src/expGetSeq/makefile-solaris
@@ -0,0 +1,91 @@
+#
+# Makefile for getMCH - Solaris version
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE
+
+#
+# Programs made by this makefile
+#
+PROGS = expGetSeq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../newted
+INCLUDES = -I$(SRC) -I$(TEDSRC) -I/usr/openwin/include
+
+#
+# Compiler - debug on
+#
+CC= gcc
+CFLAGS= -g -ansi $(SWITCHES) $(INCLUDES)
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o
+
+
+#
+# Linking
+#
+expGetSeq: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) -static $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/expGetSeq/makefile-sun b/src/expGetSeq/makefile-sun
new file mode 100644
index 0000000..e26416e
--- /dev/null
+++ b/src/expGetSeq/makefile-sun
@@ -0,0 +1,91 @@
+#
+# Makefile for getMCH - SunOS fork
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE
+
+#
+# Programs made by this makefile
+#
+PROGS = expGetSeq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../newted
+INCLUDES = -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= gcc
+CFLAGS= -g -ansi $(SWITCHES) $(INCLUDES)
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o
+
+
+#
+# Linking
+#
+expGetSeq: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) -static $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/frog/README b/src/frog/README
new file mode 100644
index 0000000..bd0ebbd
--- /dev/null
+++ b/src/frog/README
@@ -0,0 +1,27 @@
+	  Frog: A Utility to Relabel Bases in ABI 373A Files
+			      Simon Dear
+			(sd@mrc-lmb.cam.ac.uk)
+			   20 November 1992
+
+
+Correcting ABI 373A traces that have had their bases labelled
+incorrectly is not a simple matter. The program "frog" is a utility to
+swap bases around.
+
+To use frog you will need to know the existing base lane order, and
+the correct base order. For example, the base order may currently be
+CAGT but examination of the sequence (eg a known cloning site) may
+reveal that it should gave been CATG. With this information, simply
+run frog as follows:
+
+    frog old new [ABI_files...]
+
+where "old" is the current (incorrect) lane ordering and "new" is the
+correct lane ordering.
+
+In the previous example we discovered that the T and G lanes have been
+inadvertantly switched. We could correct the file (trace.abi, say) by
+typing:
+
+    frog CAGT CATG trace.abi
+
diff --git a/src/frog/abi.c b/src/frog/abi.c
new file mode 100644
index 0000000..af95c2e
--- /dev/null
+++ b/src/frog/abi.c
@@ -0,0 +1,160 @@
+#include <stdio.h>
+#include "mach-io.h"
+
+typedef struct {
+	char id[12];
+	int_2 s1;
+	int_2 s2;
+	int_2 s3;
+	int_2 s4;
+	int_2 recs;
+	int_4 l1;
+	int_4 offset;
+	int_4 l2;
+} Header;
+
+typedef struct {
+	char id[4];
+	int_4 index;
+	int_2 type;
+	int_2 size;
+	int_4 N;
+	int_4 length;
+	int_4 ptr1;
+	int_4 ptr2;
+} Rec;
+
+/*
+** Meaningful ids: (Explanations c/o jes)
+**
+** GMBF       Gel type
+** DATA.1-4   Raw data block
+** DATA.5-8   Central (rather featureless) data block
+** DATA.9-12  Processed data block
+** FWO_       Base order (see seqIOABI.c)
+** PBAS.1-2   Base sequence
+** PLOC.1-2   Base positions
+** S/N%       Signal strengths (array of 4 shorts floats)
+** SMPL       Sample name
+** SPAC       Base spacing (float)
+**
+** It appears that if the value for an id occupies less than or equal four
+** bytes, it is shoe-horned into the ptr1 field. Otherwise, ptr1 holds
+** the byte offset in the file where the data can be found.
+** 
+**
+**
+**
+*/
+
+int read_header(FILE *fp, Header *h)
+{
+#define FREAD(R) \
+    if ( fread(&(h->R),sizeof(h->R),1,fp) == 0) \
+	return 1;
+
+    fseek(fp,0,0);
+    FREAD(id);
+    FREAD(s1);
+    FREAD(s2);
+    FREAD(s3);
+    FREAD(s4);
+    FREAD(recs);
+    FREAD(l1);
+    FREAD(offset);
+    FREAD(l2);
+    return 0;
+}
+
+int abi_dump(char *fn)
+{
+    FILE *fp;
+    Rec *recs;
+    Header header;
+    int i;
+    int reads;
+
+    if ( (fp=fopen(fn,"rb")) == NULL ) return 1;
+
+    if (read_header(fp,&header)) { fclose(fp); return 2;}
+
+    recs = (Rec *) malloc(header.recs * sizeof(Header));
+
+    fseek(fp,header.offset,0);
+    reads = fread(recs,sizeof(Rec),header.recs,fp);
+
+    if (reads!=header.recs) {
+	fclose(fp);
+	free(recs);
+	return 3;
+    }
+    
+    printf("file:%s\n",fn);
+    printf(" ID   indx type size   recs length pointer1 pointer2 Text...\n");
+    printf(" ---- ---- ---- ---- ------ ------ -------- -------- -------\n");
+    for (i=0; i<header.recs;i++) {
+	printf(" %4.4s %4d %04x %4d %6d %6d %08x %08x",
+	recs[i].id,
+	recs[i].index,
+	recs[i].type,
+	recs[i].size,
+	recs[i].N,
+	recs[i].length,
+	recs[i].ptr1,
+	recs[i].ptr2
+	);
+	switch(recs[i].type) {
+	case 0x12: /* string */
+	{
+	    char *s;
+	    char buf[32];
+	    int len;
+	    len = (recs[i].length > sizeof(buf)) ? sizeof(buf) : recs[i].length;
+	    if(recs[i].length<=4) {
+		/*
+		** stored in ptr1
+		*/
+		s = (char *) &recs[i].ptr1;
+	    } else {
+		/*
+		** stored in file at offset ptr1
+		** NOTE - byte at offset is string length
+		*/
+		fseek(fp,recs[i].ptr1,0);
+		if (len = fread(buf,1,len,fp)) len--;
+		s = buf+1;
+	    }
+	    printf(" %-32.*s\n",len,s,(len>32)?"...\n":"\n");
+	    break;
+	}
+	case 0x0c: /* character array? */
+	case 0x0d: /* bit array? */
+	case 0x04: /* integer array */
+	case 0x02: /* sequence ?*/
+	case 0x0a: /* date */ 
+	case 0x0b: /* time */ 
+	case 0x07: /* float */ 
+	default:
+	    printf("\n");
+	    break;
+
+	}
+    }
+
+    printf("\n");
+
+    free(recs);
+    fclose(fp);
+    return 0;
+}
+
+
+
+main(int argc, char **argv)
+{
+    while (--argc) {
+	int err;
+	argv++;
+	if (err=abi_dump(*argv)) printf("Couldn't read %s (%d)\n",*argv,err);
+    }
+}
diff --git a/src/frog/frog.c b/src/frog/frog.c
new file mode 100644
index 0000000..c5d16f5
--- /dev/null
+++ b/src/frog/frog.c
@@ -0,0 +1,272 @@
+/*
+ * File: frog.c
+ *
+ * Author: Simon Dear
+ *         MRC Laboratory of Molecular Biology
+ *         Hills Road
+ *         Cambridge CB2 2QH
+ *         United Kingdom
+ *
+ * Description: utility to alter base lane ordering of an ABI file
+ *
+ * Created: 8 October 1992
+ * Updated:
+ *
+ */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+#include <sys/types.h>
+#include "mach-io.h"
+
+typedef struct {
+	char id[12];
+	uint_2 s1;
+	uint_2 s2;
+	uint_2 s3;
+	uint_2 s4;
+	uint_2 recs;
+	uint_4 l1;
+	uint_4 offset;
+	uint_4 l2;
+} Header;
+
+typedef struct {
+	char id[4];
+	uint_4 index;
+	uint_2 type;
+	uint_2 size;
+	uint_4 N;
+	uint_4 length;
+	uint_4 ptr1;
+	uint_4 ptr2;
+} Rec;
+
+/*
+** Meaningful ids: (Explanations c/o jes)
+**
+** GMBF       Gel type
+** DATA.1-4   Raw data block
+** DATA.5-8   Central (rather featureless) data block
+** DATA.9-12  Processed data block
+** FWO_       Base order (see seqIOABI.c)
+** PBAS.1-2   Base sequence
+** PLOC.1-2   Base positions
+** S/N%       Signal strengths (array of 4 shorts floats)
+** SMPL       Sample name
+** SPAC       Base spacing (float)
+**
+** It appears that if the value for an id occupies less than or equal four
+** bytes, it is shoe-horned into the ptr1 field. Otherwise, ptr1 holds
+** the byte offset in the file where the data can be found.
+** 
+**
+**
+**
+*/
+
+int read_header(FILE *fp, Header *h)
+{
+    if (fread(&h->id[0],12,1,fp) == 0) return 1;
+    if (be_read_int_2(fp,&h->s1)==0) return 1;
+    if (be_read_int_2(fp,&h->s2)==0) return 1;
+    if (be_read_int_2(fp,&h->s3)==0) return 1;
+    if (be_read_int_2(fp,&h->s4)==0) return 1;
+    if (be_read_int_2(fp,&h->recs)==0) return 1;
+    if (be_read_int_4(fp,&h->l1)==0) return 1;
+    if (be_read_int_4(fp,&h->offset)==0) return 1;
+    if (be_read_int_4(fp,&h->l2)==0) return 1;
+
+    return 0;
+}
+
+int read_record(FILE *fp, Rec *r)
+{
+    if (fread(&r->id[0],4,1,fp) == 0) return 1;
+    if (be_read_int_4(fp,&r->index)==0) return 1;
+    if (be_read_int_2(fp,&r->type)==0) return 1;
+    if (be_read_int_2(fp,&r->size)==0) return 1;
+    if (be_read_int_4(fp,&r->N)==0) return 1;
+    if (be_read_int_4(fp,&r->length)==0) return 1;
+    if (be_read_int_4(fp,&r->ptr1)==0) return 1;
+    if (be_read_int_4(fp,&r->ptr2)==0) return 1;
+
+    return 0;
+}
+
+int write_record(FILE *fp, Rec *r)
+{
+    if (fwrite(&r->id[0],4,1,fp) == 0) return 1;
+    if (be_write_int_4(fp,&r->index)==0) return 1;
+    if (be_write_int_2(fp,&r->type)==0) return 1;
+    if (be_write_int_2(fp,&r->size)==0) return 1;
+    if (be_write_int_4(fp,&r->N)==0) return 1;
+    if (be_write_int_4(fp,&r->length)==0) return 1;
+    if (be_write_int_4(fp,&r->ptr1)==0) return 1;
+    if (be_write_int_4(fp,&r->ptr2)==0) return 1;
+
+    return 0;
+}
+
+
+int fix_abi(char *fn,char *old, char *new)
+{
+    FILE *fp;
+    Header header;
+    Rec rec;
+    int i;
+    off_t pl,pn;
+
+    if ( (fp=fopen(fn,"r+b")) == NULL ) {
+        fprintf(stderr,"frog: %s: error opening file\n",fn);
+	return 1;
+    }
+
+    fseek(fp,0,0);
+    if (read_header(fp,&header)) {
+        fprintf(stderr,"frog: %s: error reading header\n",fn);
+        fclose(fp);
+        return 2;
+    }
+
+    pl = 0;
+    pn = header.offset;
+    for(i=0;i<header.recs;i++) {
+	pl = pn;
+	fseek(fp,pn,0);
+        if(read_record(fp,&rec)) {
+            fprintf(stderr,"frog: %s: error reading record\n",fn);
+            fclose(fp);
+            return 2;
+         }
+	pn = ftell(fp);
+
+        if(strncmp(rec.id,"FWO_",4)==0) {
+	    char map[4];
+	    char tmap[4];
+
+#define ind(base) ((base)=='C'?0:(base)=='A'?1:(base)=='G'?2:3)
+
+/*
+	    printf("Changing base to trace mapping (FWO_)\n");
+*/
+
+	    /*
+	     * map of lanes to old bases
+	     */
+	    tmap[ind('C')] = (rec.ptr1>>24)&255;
+	    tmap[ind('A')] = (rec.ptr1>>16)&255;
+	    tmap[ind('G')] = (rec.ptr1>> 8)&255;
+	    tmap[ind('T')] = rec.ptr1&255;
+	    /*
+	     * map of old bases to new bases
+	     */
+	    map[ind(old[0])] = new[0];
+	    map[ind(old[1])] = new[1];
+	    map[ind(old[2])] = new[2];
+	    map[ind(old[3])] = new[3];
+	    /*
+	     * map lanes to new bases
+	     */
+	    rec.ptr1 = (((map[ind(tmap[ind('C')])]<<8)+
+			  map[ind(tmap[ind('A')])]<<8)+
+			  map[ind(tmap[ind('G')])]<<8)+
+			  map[ind(tmap[ind('T')])];
+
+	    fseek(fp,pl,0);
+	    if (write_record(fp,&rec)) {
+                fprintf(stderr,"frog: %s: error writing record\n",fn);
+                fclose(fp);
+                return 2;
+	    }
+
+        } else if(strncmp(rec.id,"PBAS",4)==0) {
+	    char *seq;
+	    char map[4];
+	    int k;
+/*
+	    printf("Changing bases (PBAS)\n");
+*/
+	    seq = (char *) malloc(rec.length);
+	    /* read sequence */
+	    fseek(fp,rec.ptr1,0);
+	    if (fread(seq,rec.length,1,fp)!=1) {
+                fprintf(stderr,"frog: %s: error reading sequence\n",fn);
+                fclose(fp);
+                return 2;
+	    }
+	    /* swap bases */
+	    map[ind(old[0])] = new[0];
+	    map[ind(old[1])] = new[1];
+	    map[ind(old[2])] = new[2];
+	    map[ind(old[3])] = new[3];
+	    for (k=0;k<rec.length;k++) if (strchr("ACGT",seq[k])) seq[k] = map[ind(seq[k])];
+
+	    /* write back */
+	    fseek(fp,rec.ptr1,0);
+	    if (fwrite(seq,rec.length,1,fp)!=1) {
+                fprintf(stderr,"frog: %s: error writing sequence\n",fn);
+                fclose(fp);
+                return 2;
+	    }
+        }
+
+    }
+
+    fclose(fp);
+    return 0;
+}
+
+
+void usage()
+{
+    fprintf(stderr,"Usage: frog old new [ABI_files...]\n\n");
+    fprintf(stderr,"examples\n");
+    fprintf(stderr,"To swap all Gs with Ts and Ts with Gs\n    frog CAGT CATG trace.abi\n");
+    fprintf(stderr,"To swap all As with Cs, Cs with Gs and Gs with As\n    frog ACGT CGAT trace.abi\n");
+}
+
+int check(char *bases)
+{
+    int i;
+    int map[4];
+    if (strlen(bases)!=4) {
+	fprintf(stderr,"frog: should be four bases %s\n",bases);
+	return 1;
+    }
+
+    for(i=0;i<4;i++) map[i]=0;
+    for(i=0;i<4;i++) {
+	if (islower(bases[i])) bases[i]=toupper(bases[i]);
+        if(strchr("ACGT",bases[i])==NULL) {
+	    fprintf(stderr,"frog: invalid base '%c' in bases \"%s\"\n",bases[i],bases);
+	    return 1;
+	}
+	map[ind(bases[i])] = 1;
+    }
+    for(i=0;i<4;i++) {
+	if (map[i]==0) {
+	    fprintf(stderr,"frog: repeated bases in \"%s\"\n",bases);
+	    return 1;
+        }
+    }
+
+    return 0;
+}
+
+int main(int argc, char **argv)
+{
+    int i;
+
+    if (argc<3) {usage(); exit(1);}
+    if (check(argv[1])) exit(1);
+    if (check(argv[2])) exit(1);
+	
+    for(i=3;i<argc;i++) {
+	int err;
+	if (err=fix_abi(argv[i],argv[1],argv[2])) printf("Couldn't read %s (%d)\n",argv[i],err);
+    }
+    return 0;
+}
diff --git a/src/frog/mach-io.c b/src/frog/mach-io.c
new file mode 100644
index 0000000..9b78205
--- /dev/null
+++ b/src/frog/mach-io.c
@@ -0,0 +1,238 @@
+/*
+** Machine independant io:
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value suggessfully read or written
+*/
+
+#include <stdio.h>
+#include "mach-io.h"
+
+
+
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+int be_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a big-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[0] = (uint_1) (*i2>>8)&255;
+    buf[1] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(*i4)];
+
+
+    buf[0] = (uint_1) (*i4>>24)&255;
+    buf[1] = (uint_1) (*i4>>16)&255;
+    buf[2] = (uint_1) (*i4>>8)&255;
+    buf[3] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a big-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[1]) +
+         ((uint_2)buf[0]<<8));
+    return (1);
+}
+
+
+
+
+
+int be_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[3]) +
+         ((uint_4)buf[2]<<8) +
+         ((uint_4)buf[1]<<16) +
+         ((uint_4)buf[0]<<24));
+    return (1);
+}
+
+
+
+
+
+
+
+
+
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+int le_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a little-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[1] = (uint_1) (*i2>>8)&255;
+    buf[0] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    buf[3] = (uint_1) (*i4>>24)&255;
+    buf[2] = (uint_1) (*i4>>16)&255;
+    buf[1] = (uint_1) (*i4>>8)&255;
+    buf[0] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a little-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[0]) +
+         ((uint_2)buf[1]<<8));
+    return (1);
+}
+
+
+
+
+
+int le_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[0]) +
+         ((uint_4)buf[1]<<8) +
+         ((uint_4)buf[2]<<16) +
+         ((uint_4)buf[3]<<24));
+    return (1);
+}
diff --git a/src/frog/mach-io.h b/src/frog/mach-io.h
new file mode 100644
index 0000000..5783df3
--- /dev/null
+++ b/src/frog/mach-io.h
@@ -0,0 +1,115 @@
+#ifndef _mach_io_h
+#define _mach_io_h
+/*
+** Machine independant io
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value successfully read or written
+*/
+
+#include <stdio.h>
+
+typedef char		int_1;
+typedef short		int_2;
+typedef int		int_4;
+typedef unsigned char	uint_1;
+typedef unsigned short	uint_2;
+typedef unsigned int	uint_4;
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+extern int be_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a big-endian int_1
+*/
+
+extern
+int be_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a big-endian int_2
+*/
+
+extern
+int be_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a big-endian int_4
+*/
+
+extern
+int be_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a big-endian int_1
+*/
+
+extern
+int be_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a big-endian int_2
+*/
+
+extern
+int be_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a big-endian int_4
+*/
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+extern
+int le_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a little-endian int_1
+*/
+
+extern
+int le_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a little-endian int_2
+*/
+
+extern
+int le_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a little-endian int_4
+*/
+
+extern
+int le_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a little-endian int_1
+*/
+
+extern
+int le_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a little-endian int_2
+*/
+
+extern
+int le_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a little-endian int_4
+*/
+
+
+#endif /* _mach_io_h */
diff --git a/src/frog/makefile-alpha b/src/frog/makefile-alpha
new file mode 100644
index 0000000..3d8bef9
--- /dev/null
+++ b/src/frog/makefile-alpha
@@ -0,0 +1,35 @@
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact
+LD = cc
+LDFLAGS = -g -non_shared
+LIBFLAGS =
+
+SRC = .
+PROGS = frog toad
+all: $(PROGS)
+
+FROGOBJ =\
+	frog.o\
+	mach-io.o
+
+frog : $(FROGOBJ)
+	$(LD) $(LDFLAGS) -o frog $(FROGOBJ) $(LIBFLAGS)
+
+
+TOADOBJ =\
+	toad.o\
+	scfIO.o\
+	mach-io.o
+
+toad : $(TOADOBJ)
+	$(LD) $(LDFLAGS) -o toad $(TOADOBJ) $(LIBFLAGS)
+
+
+#
+# Specials
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+clean:
+	/bin/rm -f $(PROGS) $(FROGOBJ)
diff --git a/src/frog/makefile-dec b/src/frog/makefile-dec
new file mode 100644
index 0000000..7a89354
--- /dev/null
+++ b/src/frog/makefile-dec
@@ -0,0 +1,32 @@
+CC = c89
+CFLAGS = -g -std -w0
+
+SRC = .
+PROGS = frog toad
+all: $(PROGS)
+
+FROGOBJ =\
+	frog.o\
+	mach-io.o
+
+frog : $(FROGOBJ)
+	$(CC) $(CFLAGS) -o frog $(FROGOBJ)
+
+
+TOADOBJ =\
+	toad.o\
+	scfIO.o\
+	mach-io.o
+
+toad : $(TOADOBJ)
+	$(CC) $(CFLAGS) -o toad $(TOADOBJ)
+
+
+#
+# Specials
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+clean:
+	/bin/rm -f $(PROGS) $(FROGOBJ)
diff --git a/src/frog/makefile-sgi b/src/frog/makefile-sgi
new file mode 100644
index 0000000..07a5b97
--- /dev/null
+++ b/src/frog/makefile-sgi
@@ -0,0 +1,36 @@
+
+SHELL=/bin/sh
+
+CC = cc
+CFLAGS = -g -ansi -prototypes
+LIBFLAGS =
+
+SRC = .
+PROGS = frog toad
+all: $(PROGS)
+
+FROGOBJ =\
+	frog.o\
+	mach-io.o
+
+frog : $(FROGOBJ)
+	$(CC) $(CFLAGS) -o frog $(FROGOBJ) $(LIBFLAGS)
+
+
+TOADOBJ =\
+	toad.o\
+	scfIO.o\
+	mach-io.o
+
+toad : $(TOADOBJ)
+	$(CC) $(CFLAGS) -o toad $(TOADOBJ) $(LIBFLAGS)
+
+
+#
+# Specials
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+clean:
+	/bin/rm -f $(PROGS) $(FROGOBJ)
diff --git a/src/frog/makefile-solaris b/src/frog/makefile-solaris
new file mode 100644
index 0000000..fc6bfea
--- /dev/null
+++ b/src/frog/makefile-solaris
@@ -0,0 +1,36 @@
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS)
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+SRC = .
+PROGS = frog toad
+all: $(PROGS)
+
+FROGOBJ =\
+	frog.o\
+	mach-io.o
+
+frog : $(FROGOBJ)
+	$(CC) $(CFLAGS) -o frog $(FROGOBJ) $(LIBFLAGS)
+
+
+TOADOBJ =\
+	toad.o\
+	scfIO.o\
+	mach-io.o
+
+toad : $(TOADOBJ)
+	$(CC) $(CFLAGS) -o toad $(TOADOBJ) $(LIBFLAGS)
+
+
+#
+# Specials
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+clean:
+	/bin/rm -f $(PROGS) $(FROGOBJ)
diff --git a/src/frog/makefile-sun b/src/frog/makefile-sun
new file mode 100644
index 0000000..fc6bfea
--- /dev/null
+++ b/src/frog/makefile-sun
@@ -0,0 +1,36 @@
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS)
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+SRC = .
+PROGS = frog toad
+all: $(PROGS)
+
+FROGOBJ =\
+	frog.o\
+	mach-io.o
+
+frog : $(FROGOBJ)
+	$(CC) $(CFLAGS) -o frog $(FROGOBJ) $(LIBFLAGS)
+
+
+TOADOBJ =\
+	toad.o\
+	scfIO.o\
+	mach-io.o
+
+toad : $(TOADOBJ)
+	$(CC) $(CFLAGS) -o toad $(TOADOBJ) $(LIBFLAGS)
+
+
+#
+# Specials
+#
+%.o:$(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+clean:
+	/bin/rm -f $(PROGS) $(FROGOBJ)
diff --git a/src/frog/scfIO.c b/src/frog/scfIO.c
new file mode 100644
index 0000000..5470e09
--- /dev/null
+++ b/src/frog/scfIO.c
@@ -0,0 +1,131 @@
+/*
+ * File: scfIO.c
+ *
+ * Author: Simon Dear
+ *         MRC Laboratory of Molecular Biology
+ *         Hills Road
+ *         Cambridge CB2 2QH
+ *         United Kingdom
+ *
+ * Description: IO routines for reading and writing SCF files
+ *
+ * Created: 8 October 1992 (from seqIOSCF.c)
+ * Updated:
+ *
+ */
+
+#include <stdio.h>
+#include <sys/types.h>
+#include "mach-io.h"
+#include "scfIO.h"
+
+#define False 0
+#define True  1
+
+int read_scf_header(FILE *fp, Header *h)
+{
+    if (be_read_int_4(fp,&h->magic_number)==False) return (False);
+    if (be_read_int_4(fp,&h->samples)==False) return (False);
+    if (be_read_int_4(fp,&h->samples_offset)==False) return (False);
+    if (be_read_int_4(fp,&h->bases)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_left_clip)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_right_clip)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_offset)==False) return (False);
+    if (be_read_int_4(fp,&h->comments_size)==False) return (False);
+    if (be_read_int_4(fp,&h->comments_offset)==False) return (False);
+
+    return (True);
+}
+
+
+int read_scf_sample(FILE *fp, Samples *s)
+{
+    if (be_read_int_1(fp,&s->sample_A)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_C)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_G)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_T)==False) return (False);
+
+    return (True);
+}
+
+
+
+int read_scf_base(FILE *fp, Bases *b)
+{
+    if (be_read_int_4(fp,&b->peak_index)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_A)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_C)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_G)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_T)==False) return (False);
+    if (be_read_int_1(fp,(uint_1 *)&b->base)==False) return (False);
+    if (be_read_int_1(fp,&b->spare[0])==False) return (False);
+    if (be_read_int_1(fp,&b->spare[1])==False) return (False);
+    if (be_read_int_1(fp,&b->spare[2])==False) return (False);
+
+    return (True);
+}
+
+
+
+
+
+
+
+
+int write_scf_header(FILE *fp, Header *h)
+{
+    int i;
+
+    if (be_write_int_4(fp,&h->magic_number)==False) return (False);
+    if (be_write_int_4(fp,&h->samples)==False) return (False);
+    if (be_write_int_4(fp,&h->samples_offset)==False) return (False);
+    if (be_write_int_4(fp,&h->bases)==False) return (False);
+    if (be_write_int_4(fp,&h->bases_left_clip)==False) return (False);
+    if (be_write_int_4(fp,&h->bases_right_clip)==False) return (False);
+    if (be_write_int_4(fp,&h->bases_offset)==False) return (False);
+    if (be_write_int_4(fp,&h->comments_size)==False) return (False);
+    if (be_write_int_4(fp,&h->comments_offset)==False) return (False);
+    for (i=0;i<23;i++)
+	if (be_write_int_4(fp,&h->spare[i])==False) return (False);
+
+    return (True);
+}
+
+
+int write_scf_sample(FILE *fp, Samples *s)
+{
+    if (be_write_int_1(fp,&s->sample_A)==False) return (False);
+    if (be_write_int_1(fp,&s->sample_C)==False) return (False);
+    if (be_write_int_1(fp,&s->sample_G)==False) return (False);
+    if (be_write_int_1(fp,&s->sample_T)==False) return (False);
+
+    return (True);
+}
+
+
+
+
+
+int write_scf_base(FILE *fp, Bases *b)
+{
+    if (be_write_int_4(fp,&b->peak_index)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_A)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_C)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_G)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_T)==False) return (False);
+    if (be_write_int_1(fp,(uint_1 *)&b->base)==False) return (False);
+    if (be_write_int_1(fp,&b->spare[0])==False) return (False);
+    if (be_write_int_1(fp,&b->spare[1])==False) return (False);
+    if (be_write_int_1(fp,&b->spare[2])==False) return (False);
+
+    return (True);
+}
+
+
+
+int write_scf_comment(FILE *fp, Comments *c, size_t l)
+{
+    if (fwrite(c, l, 1, fp) !=1) return (False);
+    return (True);
+
+}
diff --git a/src/frog/scfIO.h b/src/frog/scfIO.h
new file mode 100644
index 0000000..225301d
--- /dev/null
+++ b/src/frog/scfIO.h
@@ -0,0 +1,81 @@
+/*
+ * File: scfIO.h
+ *
+ * Author: Simon Dear
+ *         MRC Laboratory of Molecular Biology
+ *         Hills Road
+ *         Cambridge CB2 2QH
+ *         United Kingdom
+ *
+ * Description: header file for SCF file IO
+ *
+ * Created: 8 October 1992 (from seqIOSCF.c)
+ * Updated:
+ *
+ */
+
+#ifndef _SCFIO_H_
+#define _SCFIO_H_
+
+#include "mach-io.h"
+
+/* ---- Constants ---- */
+#define SCF_MAGIC (((((uint_4)'.'<<8)+(uint_4)'s'<<8)+(uint_4)'c'<<8)+(uint_4)'f')
+
+/*
+** Type definition for the Header structure
+*/
+typedef struct {
+	uint_4 magic_number;       /* Always ((('.'<<8)+'s'<<8)+'c'<<8)+'f'; */
+	uint_4 samples;            /* Number of elements in Samples matrix */
+	uint_4 samples_offset;     /* Byte offset from start of file */
+	uint_4 bases;              /* Number of bases in Bases matrix */
+	uint_4 bases_left_clip;    /* Number of bases in left clip (vector)*/
+	uint_4 bases_right_clip;   /* Number of bases in right clip (unreliable) */
+	uint_4 bases_offset;       /* Byte offset from start of file */
+	uint_4 comments_size;      /* Number of bytes in Comment section */
+	uint_4 comments_offset;    /* Byte offset from start of file */
+	uint_4 spare[23];          /* Unused */
+} Header;
+
+/*
+** Type definition for the Sample data
+*/
+typedef struct {
+	uint_1 sample_A;           /* Sample for A trace */
+	uint_1 sample_C;           /* Sample for C trace */
+	uint_1 sample_G;           /* Sample for G trace */
+	uint_1 sample_T;           /* Sample for T trace */
+} Samples;
+
+/*
+** Type definition for the sequence data
+*/
+typedef struct {
+	uint_4 peak_index;        /* Index into Samples matrix for base position */
+	uint_1 prob_A;            /* Probability of it being an A */
+	uint_1 prob_C;            /* Probability of it being an C */
+	uint_1 prob_G;            /* Probability of it being an G */
+	uint_1 prob_T;            /* Probability of it being an T */
+	uint_1 base;		  /* Base called */
+	uint_1 spare[3];          /* Spare */
+} Bases;
+
+
+/*
+** Type definition for the comments
+*/
+typedef char Comments;            /* Zero terminated list of \n separated entries */
+
+
+
+
+extern int read_scf_header(FILE *fp, Header *h);
+extern int read_scf_sample(FILE *fp, Samples *s);
+extern int read_scf_base(FILE *fp, Bases *b);
+extern int write_scf_header(FILE *fp, Header *h);
+extern int write_scf_sample(FILE *fp, Samples *s);
+extern int write_scf_base(FILE *fp, Bases *b);
+extern int write_scf_comment(FILE *fp, Comments *c, size_t l);
+
+#endif /*_SCFIO_H_*/
diff --git a/src/frog/toad.c b/src/frog/toad.c
new file mode 100644
index 0000000..0a5350e
--- /dev/null
+++ b/src/frog/toad.c
@@ -0,0 +1,202 @@
+/*
+ * File: toad.c
+ *
+ * Author: Simon Dear
+ *         MRC Laboratory of Molecular Biology
+ *         Hills Road
+ *         Cambridge CB2 2QH
+ *         United Kingdom
+ *
+ * Description: utility to alter base lane ordering of an SCF file
+ *
+ * Created: 8 October 1992
+ * Updated:
+ *
+ */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+#include <sys/types.h>
+#include "scfIO.h"
+#include "mach-io.h"
+
+#define ind(base) ((base)=='C'?0:(base)=='A'?1:(base)=='G'?2:3)
+
+int fix_scf(char *fn,char *old, char *new)
+{
+    FILE *fp;
+    Header header;
+    Samples *samples;
+    Bases *bases;
+    int i;
+    char map[4];
+
+    if ( (fp=fopen(fn,"r+b")) == NULL ) {
+        fprintf(stderr,"toad: %s: error opening file\n",fn);
+	return 1;
+    }
+
+    fseek(fp,0,0);
+    if (read_scf_header(fp,&header)==0) {
+        fprintf(stderr,"toad: %s: error reading header\n",fn);
+        fclose(fp);
+        return 2;
+    }
+
+    /* swap samples */
+    samples = (Samples *)malloc(sizeof(Samples)*header.samples);
+    if (samples==NULL) {
+        fprintf(stderr,"toad: %s: out of memory\n",fn);
+        fclose(fp);
+        return 2;
+    }
+    fseek(fp,header.samples_offset,0);
+    for(i=0;i<header.samples;i++) {
+	if (read_scf_sample(fp,&samples[i])==0) {
+            fprintf(stderr,"toad: %s: error reading sample %d\n",fn,i);
+            fclose(fp);
+            return 2;
+        }
+    }
+    for(i=0;i<header.samples;i++) {
+	uint_1 smap[4];
+	uint_1 osmap[4];
+	osmap[ind('A')] = samples[i].sample_A;
+	osmap[ind('C')] = samples[i].sample_C;
+	osmap[ind('G')] = samples[i].sample_G;
+	osmap[ind('T')] = samples[i].sample_T;
+	smap[ind(new[0])] = osmap[ind(old[0])];
+	smap[ind(new[1])] = osmap[ind(old[1])];
+	smap[ind(new[2])] = osmap[ind(old[2])];
+	smap[ind(new[3])] = osmap[ind(old[3])];
+	samples[i].sample_A = smap[ind('A')];
+	samples[i].sample_C = smap[ind('C')];
+	samples[i].sample_G = smap[ind('G')];
+	samples[i].sample_T = smap[ind('T')];
+    }
+    fseek(fp,header.samples_offset,0);
+    for(i=0;i<header.samples;i++) {
+	if (write_scf_sample(fp,&samples[i])==0) {
+            fprintf(stderr,"toad: %s: error writing sample %d\n",fn,i);
+            fclose(fp);
+            return 2;
+        }
+    }
+    free(samples);
+
+
+    /* swap bases */
+    bases = (Bases *)malloc(sizeof(Bases)*header.bases);
+
+    if (bases==NULL) {
+        fprintf(stderr,"toad: %s: out of memory\n",fn);
+        fclose(fp);
+        return 2;
+    }
+    fseek(fp,header.bases_offset,0);
+    for(i=0;i<header.bases;i++) {
+	if (read_scf_base(fp,&bases[i])==0) {
+            fprintf(stderr,"toad: %s: error reading bases %d\n",fn,i);
+            fclose(fp);
+            return 2;
+        }
+    }
+    map[ind(old[0])] = new[0];
+    map[ind(old[1])] = new[1];
+    map[ind(old[2])] = new[2];
+    map[ind(old[3])] = new[3];
+    for(i=0;i<header.bases;i++) {
+	uint_1 smap[4];
+	uint_1 osmap[4];
+	osmap[ind('A')] = bases[i].prob_A;
+	osmap[ind('C')] = bases[i].prob_C;
+	osmap[ind('G')] = bases[i].prob_G;
+	osmap[ind('T')] = bases[i].prob_T;
+	smap[ind(new[0])] = osmap[ind(old[0])];
+	smap[ind(new[1])] = osmap[ind(old[1])];
+	smap[ind(new[2])] = osmap[ind(old[2])];
+	smap[ind(new[3])] = osmap[ind(old[3])];
+	bases[i].prob_A = smap[ind('A')];
+	bases[i].prob_C = smap[ind('C')];
+	bases[i].prob_G = smap[ind('G')];
+	bases[i].prob_T = smap[ind('T')];
+	if (strchr("ACGT",bases[i].base)) bases[i].base = map[ind(bases[i].base)];
+    }
+    fseek(fp,header.bases_offset,0);
+    for(i=0;i<header.bases;i++) {
+	if (write_scf_base(fp,&bases[i])==0) {
+            fprintf(stderr,"toad: %s: error writing base %d\n",fn,i);
+            fclose(fp);
+            return 2;
+        }
+    }
+    free(bases);
+
+
+
+
+
+
+
+    fclose(fp);
+    return 0;
+}
+
+
+void usage()
+{
+    fprintf(stderr,"Usage: toad old new [SCF_files...]\n\n");
+    fprintf(stderr,"examples\n");
+    fprintf(stderr,"To swap all Gs with Ts and Ts with Gs\n    toad CAGT CATG trace.scf\n");
+    fprintf(stderr,"To swap all As with Cs, Cs with Gs and Gs with As\n    toad ACGT CGAT trace.scf\n");
+}
+
+int check(char *bases)
+{
+    int i;
+    int map[4];
+    if (strlen(bases)!=4) {
+	fprintf(stderr,"toad: should be four bases %s\n",bases);
+	return 1;
+    }
+
+    for(i=0;i<4;i++) map[i]=0;
+    for(i=0;i<4;i++) {
+	if (islower(bases[i])) bases[i]=toupper(bases[i]);
+        if(strchr("ACGT",bases[i])==NULL) {
+	    fprintf(stderr,"toad: invalid base '%c' in bases \"%s\"\n",bases[i],bases);
+	    return 1;
+	}
+	map[ind(bases[i])] = 1;
+    }
+    for(i=0;i<4;i++) {
+	if (map[i]==0) {
+	    fprintf(stderr,"toad: repeated bases in \"%s\"\n",bases);
+	    return 1;
+        }
+    }
+
+    return 0;
+}
+
+int main(int argc, char **argv)
+{
+    int i;
+
+/*
+    fprintf(stderr,"Eeek! Not finished yet.\n");
+    return 1;
+*/
+
+    if (argc<3) {usage(); exit(1);}
+    if (check(argv[1])) exit(1);
+    if (check(argv[2])) exit(1);
+	
+    for(i=3;i<argc;i++) {
+	int err;
+	if (err=fix_scf(argv[i],argv[1],argv[2])) printf("Couldn't read %s (%d)\n",argv[i],err);
+    }
+    return 0;
+}
diff --git a/src/getMCH/README b/src/getMCH/README
new file mode 100644
index 0000000..60db9fd
--- /dev/null
+++ b/src/getMCH/README
@@ -0,0 +1,53 @@
+		Extracting Sequences from Trace Files
+			      Simon Dear
+			(sd@mrc-lmb.cam.ac.uk)
+			   19 November 1992
+
+
+To obtain the sequence from a trace file, the program 'trace2seq' can
+be used.  It can operate on several trace file sources: ABI 373A
+sample files, Pharmacia A.L.F. traces (created by 'alfsplit') and SCF
+traces (created by 'makeSCF' or from other sources).  The sequence is
+stored in a file. A header, required for assembly into the sequence
+assembly programs xdap and xbap, may optionally be produced.
+
+Usage is as follows:
+
+	trace2seq (-ABI | -ALF | -SCF) {tracefilename}
+		  -output {outputfilename}
+		  [-header (0|1)] [-raw filename]
+
+By default, a header is not produced. To be included, the -header
+option with argument 1, or the -raw option should be used. The -raw
+option allows the trace file name, that forms part of the header, to
+be explicitly specified. By default the trace file name in the header
+is that from from which the sequence came.
+
+An example:
+
+To output the sequence from the ABI 373A trace 'trace.abi', with the
+inclusion of a header, into a file called 'abi.seq':
+
+	trace2seq -ABI trace.abi -output abi.seq -header 1
+
+
+After the operation, 'abi.seq' will contain:
+
+	;   827     0   827ABI trace.abi
+	AG--TGCATGCCTGCAGGTCGACTCTAGAGG-TCCCGGGGTACCGAGCTC
+	GAATTC-TAATCATGGTCATAGCTGT-TCCTGTGTGAAATT-TTATCCGC
+	TCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGG
+	GGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCC
+	CGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCC
+	AACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCCAGGGTGGTTTTA-T
+	TTTCACCAGTGAGACGGGCAACAGCTGATTGCCCTTCACCGCCTGGCCCT
+	GAGAGGAGTTGCAGCAAGCGGTCCACGCT-GTT-TGCCCCAGCAGGCG-A
+	AATCCTGTTTTGATGGTTGGTTCC-GAAATCGGCAAAAATCCCTTATAAA
+	TTC-AAAGGATAGG-CCGAG--TGGGG-T-AGTTTTT-TTTCCGG-TTTG
+	AACAAGT-GTTCCCCTGTT-AAAGGA-CGTGGG--TC-AACCGTT-AAA-
+	GGG-G--AA--CCGTTTT-TTTATGGG--A--GGC---TT-C--GTTCCG
+	TTA-CCTG-TT-AA-GTTTTTTTGGGGTT-TGG-G-CGTAAG-GCT-AAT
+	-GGACC-TAATGG-GGCCC-GATTTTGGGTT-TGGGGGAAGCGGA--GAC
+	GG-GG-TG-GTGGGTGTAT-GGA-GTC--G-TTAGG-TGGGAAT--T-GG
+	TG--T-GG--ATCG-TT-G--T--GGTTTTT-GGT-T-TAGGGGTGGTTT
+	--GTGTTTTTT-GTGT-TTG--TT--T
diff --git a/src/getMCH/getMCH.c b/src/getMCH/getMCH.c
new file mode 100644
index 0000000..4df494d
--- /dev/null
+++ b/src/getMCH/getMCH.c
@@ -0,0 +1,178 @@
+/*
+    Title: 	 getMCH
+
+    File: 	 getMCH.c
+    Purpose:	 Main module
+    Last update: Thursday 11 July 1991
+
+    Change log:
+
+*/
+
+
+/*
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include <stdlib.h>
+#include "seqIOABI.h"
+#include "seqIOALF.h"
+#include "seqIOSCF.h"
+#include "seqIOPlain.h"
+#include "seqOutput.h"
+
+
+/* ---- Static variables ---- */
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+#define trace_unknown 0
+#define trace_SCF     1
+#define trace_ALF     2
+#define trace_ABI     3
+#define trace_Plain   4
+
+typedef struct
+{
+    char * fn;
+    int type;
+    char * output;
+    char * raw;
+    int header;
+} Arguments;
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage()
+{
+    fprintf(stderr, "usage: getMCH (-ABI | -ALF | -SCF) {tracefilename} -output {outputfilename} [-header (0|1)] [-raw filename]\n");
+}
+
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(int argc, char **argv)
+{
+    Arguments args;
+    Seq currSeq;
+    int in, out;
+    int i;
+
+    /* parse command line arguments here */
+
+    args.fn = NULL;
+    args.type = trace_unknown;
+    args.output = NULL;
+    args.raw = NULL;
+    args.header = 0;
+    in = out = 0;
+
+    for(argc--,argv++; argc>0 ; argc--,argv++) {
+
+	if (strcmp(*argv,"-ABI") == 0) {
+	    args.type = trace_ABI;
+	    args.fn = *++argv; argc--;
+	    in++;
+	} else if (strcmp(*argv,"-ALF") == 0) {
+	    args.type = trace_ALF;
+	    args.fn = *++argv;argc--;
+	    in++;
+	} else if (strcmp(*argv,"-SCF") == 0) {
+	    args.type = trace_SCF;
+	    args.fn = *++argv;argc--;
+	    in++;
+	} else if (strcmp(*argv,"-output") == 0) {
+	    args.output = *++argv; argc--;
+	    out++;
+	} else if (strcmp(*argv,"-raw") == 0) {
+	    args.header = 1;
+	    args.raw = *++argv; argc--;
+	} else if (strcmp(*argv,"-header") == 0) {
+	    args.header = atoi(*++argv); argc--;
+	} else {
+            usage();
+	    fprintf(stderr,"switch not recognised\n");
+	    exit(1);
+	}
+
+    }
+
+
+
+
+    /*
+        Die if bad options given.
+    */
+    if (in>1 || out>1) {
+	usage();
+	fprintf(stderr,"illegal argument combination\n");
+	exit(1);
+    }
+
+    if ( args.output==NULL ) {
+	usage();
+	fprintf(stderr,"no output specified\n");
+	exit(1);
+    }
+
+    if ( args.fn==NULL ) {
+	usage();
+	fprintf(stderr,"no input specified\n");
+	exit(1);
+    }
+
+
+
+    switch(args.type) {
+    case trace_ABI:
+	currSeq = readSeqABI(args.fn); break;
+    case trace_ALF:
+	currSeq = readSeqALF(args.fn); break;
+    case trace_SCF:
+	currSeq = readSeqSCF(args.fn); break;
+    default:
+	usage();
+	fprintf(stderr,"Unknown trace file format\n");
+	exit(1);
+    }
+
+    if (currSeq == NULLSeq) {
+	fprintf(stderr,"Error reading file %s\n",args.fn);
+	exit(1);
+    }
+
+    /* All ed bases are the same are orig bases */
+    for (i=0;i<currSeq->NorigBases;i++) currSeq->edits[i] = i;
+    
+    /* The table of edits has all its entries free */
+    for (i=0;i<MaxEdits;i++) currSeq->edBasePos[i] = NULLPoint;
+
+    if ( ! writeSeq(currSeq, args.output, (args.raw == NULL)?args.fn:args.raw,args.header) ) {
+	fprintf(stderr,"Error writing file %s\n",args.output);
+	exit(1);
+    }
+
+    exit (0);
+
+}
+
diff --git a/src/getMCH/getMCHfof b/src/getMCH/getMCHfof
new file mode 100644
index 0000000..6644a0c
--- /dev/null
+++ b/src/getMCH/getMCHfof
@@ -0,0 +1,47 @@
+#! /bin/csh -f
+#
+# getMCHfof
+#    Run all the files in a file of file names through "getMCH -SCF"
+#
+#    This program was written for Bill Rawlinson, and is intended for
+#    his use only.
+#
+#    Usage: getMCHfof file_of_file_names
+#
+
+if ( $#argv != 1 ) then
+	echo "Usage: getMCHfof file_of_file_names"
+	exit 2
+endif
+
+if (! -e $1 ) then
+	echo "No such file of file names"
+	exit 2
+endif
+
+echo "Processing a file of file names through getMCH"
+echo ""
+
+# New fofn
+set seq_fofn = ${1:r}.fofn
+if (-e $seq_fofn) /bin/mv $seq_fofn ${seq_fofn}.$$
+touch $seq_fofn
+
+echo "Processing:"
+foreach x ( `cat $1` )
+	# This file of file name contains the sequence file names
+	# Determine chromatogram file
+	echo -n "   " $x
+	set trace = ${x}.scf
+	if (-e $x) /bin/mv $x ${x}.$$
+	getMCH -SCF $trace -output $x -header 1
+	if ($status != 0) then
+		echo " - error in getMCH\!"
+	else
+		echo " - done"
+		echo $x >> $seq_fofn
+	endif
+end
+
+echo
+echo "File of file names created:" $seq_fofn
diff --git a/src/getMCH/makefile-alpha b/src/getMCH/makefile-alpha
new file mode 100644
index 0000000..c8dcb14
--- /dev/null
+++ b/src/getMCH/makefile-alpha
@@ -0,0 +1,86 @@
+#
+# Makefile for getMCH - Alpha version
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Programs made by this makefile
+#
+PROGS = getMCH trace2seq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+INCLUDES = -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= cc
+CFLAGS= -g -std1 -ieee_with_inexact $(SWITCHES) $(INCLUDES)
+LD= cc
+LDFLAGS= -g -non_shared
+LIBFLAGS =
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+trace2seq getMCH: $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/getMCH/makefile-dec b/src/getMCH/makefile-dec
new file mode 100644
index 0000000..532eca5
--- /dev/null
+++ b/src/getMCH/makefile-dec
@@ -0,0 +1,83 @@
+#
+# Makefile for getMCH - DEC Ultrix (mips) fork
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Programs made by this makefile
+#
+PROGS = getMCH trace2seq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+INCLUDES = -I/usr/include/mit -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= c89
+CFLAGS= -g -std -w0 $(SWITCHES) $(INCLUDES)
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+trace2seq getMCH: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/getMCH/makefile-sgi b/src/getMCH/makefile-sgi
new file mode 100644
index 0000000..fa1691a
--- /dev/null
+++ b/src/getMCH/makefile-sgi
@@ -0,0 +1,86 @@
+#
+# Makefile for getMCH - SGI version
+#
+
+SHELL=/bin/sh
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Programs made by this makefile
+#
+PROGS = getMCH trace2seq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+INCLUDES = -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= cc
+CFLAGS= -g -ansi -prototypes $(SWITCHES) $(INCLUDES)
+LIBFLAGS =
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+trace2seq getMCH: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/getMCH/makefile-solaris b/src/getMCH/makefile-solaris
new file mode 100644
index 0000000..3f0f05d
--- /dev/null
+++ b/src/getMCH/makefile-solaris
@@ -0,0 +1,87 @@
+#
+# Makefile for getMCH - SunOS fork
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Programs made by this makefile
+#
+PROGS = getMCH trace2seq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+INCLUDES = -I$(SRC) -I$(TEDSRC) -I/usr/openwin/include
+
+#
+# Compiler - debug on
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -pedantic $(SWITCHES) $(INCLUDES)
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+trace2seq getMCH: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/getMCH/makefile-sun b/src/getMCH/makefile-sun
new file mode 100644
index 0000000..ccf0d6e
--- /dev/null
+++ b/src/getMCH/makefile-sun
@@ -0,0 +1,87 @@
+#
+# Makefile for getMCH - SunOS fork
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Programs made by this makefile
+#
+PROGS = getMCH trace2seq
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+INCLUDES = -I$(SRC) -I$(TEDSRC)
+
+#
+# Compiler - debug on
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -pedantic $(SWITCHES) $(INCLUDES)
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+
+#
+# Object files
+#
+OBJS=\
+	getMCH.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+trace2seq getMCH: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : %.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f $(PROGS) $(OBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/getMCH/match.c b/src/getMCH/match.c
new file mode 100644
index 0000000..b8da367
--- /dev/null
+++ b/src/getMCH/match.c
@@ -0,0 +1,62 @@
+int string_match(seq1, n1, seq2, n2, nmiss, indices)
+
+/*
+  This function may be called once from initialDisplayedSeq if
+  the user has input a string they wish to search for in the
+  input sequence -- this subroutine returns *indices, the first
+  position at which the query sequence (app_resources.astring)
+  matched the input file sequence (currSeq). */
+
+/* a modified version of match.c 
+  finds alignments between a search sequence, seq1, and a target sequence, 
+  seq2, with no gaps (except possibly at ends) 
+   and at most nmiss mismatches (relative to seq1),
+   n1 is the length of seq1 where seq1 is assumed to start at indices 0,
+   n2 is the length of seq2 where seq2 is assumed to start at indices 0 */
+      char *seq1, *seq2;
+      int n1, n2, nmiss;
+      int *indices;
+{
+    int i, j, d, istart, iend, i_miss, n_match;
+    int mtable[100][5];  /* 100 would be, now the total number
+			    of matches the program may find between
+			    a given input string and the sequence */
+
+
+/* d = j - i is the "offset" between the two sequences */
+    if (n1 - n2 > nmiss) return (0);
+    n_match = 0;
+    for (d = -nmiss; d <= n2 + nmiss - n1; d++){
+	if (d < 0)  istart = i_miss = -d;
+	else istart = i_miss = 0;
+	if (d > n2 - n1) {
+	    iend = n2 - d;
+	    i_miss += n1 + d - n2;
+	}
+	else iend = n1;
+	for (i = istart, j = d + i; i < iend; i++, j++)
+	    if (seq1[i] != seq2[j] && ++i_miss > nmiss) goto nextd; 
+        mtable[n_match][0] = d + istart; /* indices (assuming they start at
+					    0) of starting nucleotide in the
+					    searched sequence */
+        mtable[n_match][1] = istart; /* nucleotide position in the query
+					sequence where match starts (assuming
+					query index starts with 0) */
+        mtable[n_match][2] = iend - istart; /* number of nucleotides in the
+					       match */
+	if (mtable[n_match][2]==n1) {
+	  indices[n_match] = mtable[n_match][0];
+	}
+	  
+        mtable[n_match][3] = i_miss; /* number of mismatches */
+	n_match++; /* number of matches */
+	if (n_match >= 100) return(n_match);
+
+    nextd:;
+    }
+
+    return (n_match);
+/*    return(0); /* return 0 if it got to this point; that means
+		 it must have not found an exact match, so 
+		 you want the baseNum to be 0 in that case*/
+}
diff --git a/src/getMCH/match.h b/src/getMCH/match.h
new file mode 100644
index 0000000..fc6b802
--- /dev/null
+++ b/src/getMCH/match.h
@@ -0,0 +1,19 @@
+#ifndef _match_h
+#define _match_h
+
+/* 
+    Title:       match
+
+    File: 	 match.h
+    Purpose:	 match
+    Last update: Thu Jan 03 1991
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+extern int string_match();
+
+#endif  _match_h
diff --git a/src/indexseqlibs/CHANGES b/src/indexseqlibs/CHANGES
new file mode 100644
index 0000000..af57dbd
--- /dev/null
+++ b/src/indexseqlibs/CHANGES
@@ -0,0 +1,153 @@
+Wed Feb 17 11:30:28 GMT 1993
+----------------------------
+freetext.c
+  PIR 35.0. Changes to format
+	One field identifier has changed in the PIR-International
+	databases. All "#Title" tags for submitted citations have been
+	converted to the new tag "#Description" which will not be
+	standardized. This information may be considered free text.
+  Changed code to reflect this.
+
+access4.c
+  The record size stored in acnum.hit header was 18. It should be
+  4.
+
+piraccession.script
+emblaccession.script
+genbaccession.script
+  The name of the accession number index files are now acnum.hit and
+  acnum.trg.
+
+
+Thu Jan 21 15:32:26 GMT 1993
+----------------------------
+genbentryname1.c
+pirentryname1.c
+  These programs now give the offset of the FIRST base in the
+  sequence. The entryname index previously being created was not
+  in accordance with the standard specification. This change
+  corresponds to changes to programs in the Staden package,
+  which are included in release 1993.0 of the package.
+
+
+Thu Jan 21 15:29:56 GMT 1993
+----------------------------
+genbentryname1.c
+  The sequence offsets created in the entryname index were
+  calculated wrongly. With the use with the Staden package
+  it caused the first line of the entry to be omitted.
+
+
+genbaccession.script
+genbauthor.script
+genbdivision.script
+genbentryname.script
+genbfreetext.script
+genbtitle.script
+  Genbank has 13 divisions
+
+division.c
+genbdivision.script
+pirdivision.script
+  Routines and scripts to create division lookup files.
+
+
+Thu Jul 16 17:27:43 BST 1992
+----------------------------
+freetext.c
+  Look for words in "OG" (EMBL/SWISSPROT) and "GN" (SWISSPROT)
+  lines.
+
+
+Tue Jun 16 16:56:09 BST 1992
+----------------------------
+
+freetext4.c
+hitNtrg.c
+  Creation of author and freetext indexes was in error. Each
+  occurrance of author/word in the final sorted list was being written
+  to the target file, rather than just once as it should have been.
+  This bug did not affect the functionality but only the performance
+  of the Staden programs that use the indexes.
+
+
+
+Wed May 20 10:43:56 BST 1992
+----------------------------
+
+title2.c
+entryname2.c
+  In the embl updates it is possible that an entry appears more
+  than once. These programs have been modified so that they ignore
+  all but the first occurrence of the entry name, so that the brief
+  and entryname index have the correct number of entries. This is
+  not a clean solution, as words, authors, and accession numbers
+  for the more recent entry won't appear in the annotation of the
+  entry.
+
+
+
+Wed May 13 17:22:09 BST 1992
+----------------------------
+
+author.c
+hitNtrg.c
+emblauthor.script
+pirauthor.script
+genbauthor.script
+swissauthor.script
+  Programs and scripts to create the new author indexes have been
+  written. They are based closely on the freetext index. The program
+  hitNtrg.c is almost identical to freetext4.c but takes the string
+  length to be written to the target file from the command line.
+  It is possible to write the accession number creation routines
+  in the same fashion.
+
+
+
+Wed Apr  1 16:33:11 BST 1992
+----------------------------
+
+freetext4.c Version 1.1
+  Words that were longer than target file field width were not being
+  truncated, thus corrupting the index. Fixed.
+
+
+embltitle1.c Version 1.1
+pirtitle1.c Version 1.1
+pirtitle2.c Version 1.1
+genbtitle1.c Version 1.1
+  From some sources, the sequence libraries end each line with a
+  carriage return followed by a new line character. The programs
+  were changed to filter out non-printable characters in the title
+  lines.
+
+Wed Apr  1 18:48:12 BST 1992
+----------------------------
+
+genbaccession.script Version 1.1
+piraccession.script Version 1.1
+  The second sort in these scripts was in error, causing the file
+  access.sorted2 to in fact no be sorted on accession number.  The
+  command "${SORT} +1 +0..." should have been "${SORT} -b +1...".
+
+
+Wed Apr 22 1992
+---------------
+
+freetext.c Version 1.1
+  The line offset for PIR should be 16 not 15. This would only affect
+  libraries where the 10th character of the entry name is significant
+  and excluding it would result in a different sort order.
+
+author.c Version 1.0
+  A new program for extracting author names from sequence libraries.
+  We have yet to see the EMBL CR-ROM author indexes, so this program
+  may change. No scripts written yet. Subsequence processing of output
+  file will include:
+	1) Sorting on entry name, removing duplicate entry-name/author
+	entries. (sort -u ...)
+	2) Assigning entry numbers, using freetext2.c
+	3) Sorting on author name. (sort -b +1 ...)
+	4) Creation of indexes with program similar to freetext4 (differing
+	only by the fact that the target string will be a different size.)
diff --git a/src/indexseqlibs/README b/src/indexseqlibs/README
new file mode 100644
index 0000000..310e4ab
--- /dev/null
+++ b/src/indexseqlibs/README
@@ -0,0 +1,112 @@
+Notes on indexing the sequence libraries
+========================================
+
+We handle EMBL, SwissProt, GenBank, PIR in codata form, NRL3D.
+
+Currently we produce entryname index, accession number index, brief
+index (brief index contains the entry name the primary accession number
+the sequence length and an 80 character description) and a freetext
+index for all but nrl3d (only entryname and brief).
+
+
+Naturally the libraries are all in different formats.
+
+To produce any of the indexes requires the creation of several intermediate
+files and the indexing programs are written so that the intermediate files
+are the same for all libraries. This means that only the programs that read
+the distributed form of each library need to be unique to that library, and
+all the other processing programs can be used for all libraries.
+
+
+However even the though the indexes have the same format, programs (like nip)
+that read the libraries need to treat each library separately because their
+actual contents are written differently.
+
+
+With the exception of the freetext index creation script all the
+procedures run quite quickly.
+
+Making the entry name index
+---------------------------
+
+Common program entryname2
+
+EMBL		emblentryname1
+SwissProt	emblentryname1
+
+GenBank		genbentryname1
+
+PIR		pirentryname1
+NRL3D		pirentryname1
+
+
+Making the accession number index
+---------------------------------
+
+Common programs access2 access4
+
+EMBL		emblaccess1
+SwissProt	emblaccess1
+
+GenBank		genbaccess1
+
+PIR		piraccess1 piraccess2 
+NRL3D		No accession numbers
+
+Making the brief index
+----------------------
+
+Common program title2
+
+EMBL		embltitle1
+SwissProt	embltitle1
+
+GenBank		genbtitle1
+
+PIR		pirtitle1 pirtitle2 (pir3 has no accession numbers)
+NRL3D		pirtitle2
+
+Making the freetext index
+-------------------------
+
+Common programs freetext2 freetext4
+
+EMBL		emblfreetext
+SwissProt	emblfreetext
+
+GenBank		genbfreetext
+
+PIR		pirfreetext
+NRL3D		not done
+
+Note the file stopwords is required.
+
+Scripts
+-------
+
+emblentryname.script
+emblaccession.script
+embltitle.script
+emblfreetext.script
+
+swissentryname.script
+swissaccession.script
+swisstitle.script
+emblfreetext.script
+
+
+genbentrynamescript
+genbaccession.script
+genbtitle.script
+genbfreetext.script
+
+pirentryname.script
+piraccession.script
+pirtitle.script
+pirfreetext.script
+
+makenrl3d.script
+
+
+
+
diff --git a/src/indexseqlibs/access2.c b/src/indexseqlibs/access2.c
new file mode 100644
index 0000000..3c410ff
--- /dev/null
+++ b/src/indexseqlibs/access2.c
@@ -0,0 +1,55 @@
+/*             access2             */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+
+#define MAXLINE 128
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    long entries = 0;
+    char last_entry_name[11];
+    char line[MAXLINE];
+
+    printf("access2 Version 1.0\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: access2 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"access2: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"access2: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+
+    last_entry_name[0] = '\0';
+
+    while (fgets(line,MAXLINE,ifp)!=NULL) {
+	/*
+	  format of input line:
+	        %10s %10s\n     (entry name, accession number)
+	*/
+
+	if (strncmp(last_entry_name,line,10)!=0) {
+	    entries++;
+       	    strncpy(last_entry_name,line,10);
+	}
+	fprintf(ofp, "%-10.10s %-10.10s %10ld\n",line,line+11,entries);
+    }
+
+    printf(" Number of entries = %ld\n\n",entries);
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
+
diff --git a/src/indexseqlibs/access4.c b/src/indexseqlibs/access4.c
new file mode 100644
index 0000000..5f02be1
--- /dev/null
+++ b/src/indexseqlibs/access4.c
@@ -0,0 +1,146 @@
+/*            access4            */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#include <string.h>
+#include "mach-io.h"
+#include "cdromheader.h"
+
+#define MAXLINE 256
+
+/* program to read in an accession number index and write it out in binary
+   with a header */
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *tofp;
+    FILE *hofp;
+    char last_accession_number[10];
+    char line[MAXLINE];
+    uint_4 hits,hitStart,entryNumber;
+    uint_4 hitRecords,tgtRecords;
+    
+    uint_2 tRecSize = 18;
+    uint_2 hRecSize = 4;
+
+    printf("access4 Version 1.0\n");
+    
+    if (argc != 4) {
+	fprintf(stderr,"Usage: access4 filein targetfileout hitfileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"access4: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((tofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"access4: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    if ((hofp = fopen(argv[3],"w")) == NULL) {
+	fprintf(stderr,"access4: Cannot open output file %s\n",argv[3]);
+	exit(1);
+    }
+    
+    if(cdromheader(tofp, tRecSize ) != 0 ) {
+	fprintf(stderr,"access4: Error writing target file header\n");
+        exit(1);
+    }
+    if(cdromheader(hofp, hRecSize ) != 0 ) {
+	fprintf(stderr,"access4: Error writing hit file header\n");
+        exit(1);
+    }
+    
+    /* get the first line */
+    if (fgets(line,MAXLINE,ifp) == NULL) {
+	fprintf(stderr,"access4: Error reading input file\n");
+	exit(1);
+    }
+    tgtRecords=0;
+    hitRecords=0;
+
+    /* get information */
+    strncpy(last_accession_number,line+11,10);
+    hitStart = 1;
+    hits = 1;
+
+    /* for every record write the entryNumber to the hits file */
+    hitRecords++;
+    entryNumber = atoi(line+22);
+    if(le_write_int_4(hofp, &entryNumber) == 0) {
+	fprintf(stderr,"access4: Error writing hit file\n");
+	exit(1);
+    }
+    
+    /*
+      hitStart is the first record in the hit file for an accession number
+      hits is the number fo hits for this accession number
+      entryNumber is the record number in the entryname index for the current 
+      entryname
+      */
+    
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+
+	/* if current accession number is different - update files */
+	if (strncmp(last_accession_number,line+11,10) != 0) {
+	    if(le_write_int_4(tofp, &hits) == 0) {
+		fprintf(stderr,"access4: Error writing hit file\n");
+		exit(1);
+	    }
+	    if(le_write_int_4(tofp, &hitStart) == 0) {
+		fprintf(stderr,"access4: Error writing hit file\n");
+		exit(1);
+	    }
+	    fwrite(last_accession_number, sizeof(last_accession_number), 1, tofp);
+	    tgtRecords++;
+	    /* get information */
+	    strncpy(last_accession_number,line+11,10);
+	    hitStart = hitRecords+1;
+	    hits = 0;
+	}
+	hits++;
+
+	/* for every record write the entryNumber to the hits file */
+	hitRecords++;
+	entryNumber = atoi(line+22);
+	if(le_write_int_4(hofp, &entryNumber) == 0) {
+	    fprintf(stderr,"access4: Error writing hit file\n");
+	    exit(1);
+	}
+
+    }
+
+
+
+    /* write the last target record */
+    if(le_write_int_4(tofp, &hits) == 0) {
+	fprintf(stderr,"access4: Error writing to hit file\n");
+	exit(1);
+    }
+    if(le_write_int_4(tofp, &hitStart) == 0) {
+	fprintf(stderr,"access4: Error writing to hit file\n");
+	exit(1);
+    }
+    fwrite(last_accession_number, sizeof(last_accession_number), 1, tofp);
+    tgtRecords++;
+    
+    /* add number of records to header */
+    if (fseek(tofp, (off_t)4,0)) {
+	fprintf(stderr,"access4: Error seeking on target file\n");
+	exit(1);
+    }
+    if(le_write_int_4(tofp, &tgtRecords) == 0) {
+	fprintf(stderr,"access4: Error writing to target file\n");
+	exit(1);
+    }
+
+    printf(" Records in target file %d\n", tgtRecords);
+    printf(" Records in hit file %d\n\n", hitRecords);
+
+    fclose(ifp);
+    fclose(tofp);
+    fclose(hofp);
+    return 0;
+}
diff --git a/src/indexseqlibs/addnl.c b/src/indexseqlibs/addnl.c
new file mode 100644
index 0000000..756ac9d
--- /dev/null
+++ b/src/indexseqlibs/addnl.c
@@ -0,0 +1,46 @@
+/*          addnl      */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+
+#define LINE_LEN 80    /* fold after every LINE_LEN characters */
+
+/* program to put newlines into pir and other libraries */
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[LINE_LEN+1];
+    long entries = 0;
+    
+    printf("addnl Version 1.0\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: addnl filein fileout\n");
+	exit(2);
+    }
+    
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"addnl: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"addnl: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+
+    while (fgets(line,sizeof(line),ifp)!=NULL) {
+	fputs(line,ofp);
+	if(line[strlen(line)-1]!='\n') fputc('\n',ofp);
+	entries++;
+    }
+
+    fclose(ofp);
+    fclose(ifp);
+
+    printf(" Number of entries = %ld\n\n",entries); 
+
+    return 0;
+}
+
diff --git a/src/indexseqlibs/author.c b/src/indexseqlibs/author.c
new file mode 100644
index 0000000..2f7857d
--- /dev/null
+++ b/src/indexseqlibs/author.c
@@ -0,0 +1,247 @@
+/*
+** author.c
+**
+**    Source for:
+**	pirauthor
+**	emblauthor
+**	genbauthor
+*/
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 100     /* maximum input line length*/
+
+#ifdef PIR
+char *entryPATTERN  = "ENTRY";
+char *authorPATTERN = "   #Authors";
+char *contPATTERN   = "           ";
+#endif /*PIR*/
+#ifdef EMBL
+char *entryPATTERN  = "ID";
+char *authorPATTERN = "RA";
+char *contPATTERN   = "RA";
+#endif /*EMBL*/
+#ifdef GENBANK
+char *entryPATTERN  = "LOCUS";
+char *authorPATTERN = "  AUTHORS";
+char *contPATTERN   = "         ";
+#endif /*GENBANK*/
+
+char entryName[10];
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+#define Number(A) ( sizeof(A) / sizeof((A)[0]))
+
+
+
+static void parse(FILE *ofp, char *line)
+/*
+** Pick out all Surnames
+*/
+{
+    char *name_start, *surname_end, *name_end, *initials_end;
+    char *end; /* end of entry */
+    if (line != NULL && *line) {
+	int was_space;
+	name_start = line;
+	while (*name_start) {
+	    /* skip white space */
+	    for(;isspace(*name_start);name_start++);
+	    /* skip to first "," or "." */
+	    for(surname_end=name_start;
+		*surname_end && *surname_end!=',' && *surname_end!='.';
+		surname_end++);
+	    /* start looking for next surname */
+	    if (*surname_end) surname_end++;
+	    /* skip over initials [-a-zA-Z.]*/
+	    for(name_end=surname_end;
+		*name_end && (isalpha(*name_end) || *name_end=='-' || *name_end=='.');
+		name_end++);
+	    initials_end = name_end-1;
+	    /* skip over white space */
+	    for(;isspace(*name_end);name_end++);
+	    /* skip over name terminators "," ";" "and" */
+	    if (*name_end==',' || *name_end==';') name_end++;
+	    else if (strncmp(name_end,"and",3)==0) name_end+=3;
+	    /* skip over trailing white space */
+	    for(;isspace(*name_end);name_end++);
+	    
+	    /* nibble back on t to find end of Surname*/
+	    if (*surname_end=='.') surname_end-=3;
+	    else if (*surname_end==',') surname_end--;
+
+	    end = initials_end;
+	    if (end-name_start > 0) {
+		fprintf(ofp,"%-10.10s ",entryName);
+		was_space = 0;
+		for(;name_start<=end;name_start++) {
+		    if (isspace(*name_start)) {
+			if (! was_space) putc(' ',ofp);
+			was_space = 1;
+		    } else {
+			if (islower(*name_start))
+			    putc(toupper(*name_start),ofp);
+			else
+			    putc(*name_start,ofp);
+			was_space = 0;
+		    }
+		}
+		putc('\n',ofp);
+	    }
+	    name_start = name_end;
+	}
+    } else {
+	/* must force an entry */
+	fprintf(ofp,"%-10.10s %s\n",entryName,"BLOGGS");
+    }
+}
+
+
+
+
+
+
+/******************************************************/
+/* A small toolkit to support variable length strings */
+/******************************************************/
+
+typedef struct {
+    int length;
+    int allocated;
+    char *str;
+} STR;
+
+static void init_str(STR *s)
+/*
+** initialise the string
+*/
+{
+    s->length = 0;
+    if(s->allocated) s->str[0] = '\0';
+}
+
+static void free_str(STR *s)
+/*
+** Destroy string (reclaim memory)
+*/
+{
+    free(s->str);
+    free(s);
+}
+
+static void cat_str(STR *s, char *t)
+/*
+** Join string t to string *s
+*/
+{
+    int need;
+
+    need = s->length + strlen(t) + 1;
+    if (need > s->allocated) {
+	if ( s->allocated )
+	    /* create just that little bit more */
+	    s->str = (char *)realloc(s->str,need+need/2);
+	else {
+	    /* allocating for the first time */
+	    s->str = (char *)malloc(need+need/2);
+	    s->str[0] = '\0';
+	}
+	s->allocated = need+need/2;
+    }
+    s->length = need-1;
+    strcat(s->str,t);
+
+}
+
+
+static STR *create_str()
+/*
+** Create a new string
+*/
+{
+    STR *new;
+
+    new = (STR *) malloc(sizeof(STR));
+    new->length = 0;
+    new->allocated = 0;
+    new->str = NULL;
+
+    return new;
+}
+
+
+
+
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    int entries = 0;
+    char line[MAXLINE];
+    STR *authors;
+    int author_mode;
+
+#ifdef PIR
+    char *progname = "pirauthor";
+    int offset = 16; /* Entry name offset in Entry line */
+#endif /*PIR*/
+#ifdef EMBL
+    char *progname = "emblauthor";
+    int offset = 5; /* Entry name offset in Entry line */
+#endif /*EMBL*/
+#ifdef GENBANK
+    char *progname = "genbauthor";
+    int offset = 12; /* Entry name offset in Entry line */
+#endif /*GENBANK*/
+
+    printf("%s Version 1.0\n",progname);
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: %s filein fileout\n",progname);
+	exit(2);
+    }
+
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"%s: cannot open input file %s\n",progname,argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"%s: cannot open output file %s\n",progname,argv[2]);
+	exit(1);
+    }
+
+    authors = create_str();
+    author_mode = 0;
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+	/*
+	** Determine line type
+	*/
+	if (author_mode && patmatch(line,contPATTERN)) {
+	    cat_str(authors,line+strlen(contPATTERN));
+	} else if (patmatch(line,authorPATTERN)) {
+	    author_mode = 1;
+	    cat_str(authors,line+strlen(authorPATTERN));
+	} else {
+	    if (author_mode) parse(ofp,authors->str);
+	    author_mode = 0;
+	    if ( patmatch(line,entryPATTERN) ) {
+		strncpy(entryName,line+offset,10);
+		entries++;
+		init_str(authors);
+	    }
+	}
+	
+    }
+
+    free_str(authors);
+    printf(" Number of entries = %d\n\n",entries); 
+
+    fclose(ifp);
+    fclose(ofp);
+    
+    return 0;
+}
diff --git a/src/indexseqlibs/cdromheader.c b/src/indexseqlibs/cdromheader.c
new file mode 100644
index 0000000..fa495f2
--- /dev/null
+++ b/src/indexseqlibs/cdromheader.c
@@ -0,0 +1,44 @@
+/*       cdromheader     */
+
+#include <stdio.h>
+#include "mach-io.h"
+
+/*   adds a cdrom style header to file ofp */
+int cdromheader( FILE *ofp, uint_2 recordSize )
+{
+
+  int i;
+  uint_4 fSize = 0;
+  uint_4 nRecords = 0;
+  char dbName[20] = "PIR45678901234567890";
+  char dbRel[10] = "Release890";
+  char dt[4] = "1234";
+  char empty[256];
+
+/* header contains: 
+
+        ul fsize      
+        ul nRecords      (needs to be updated after rest of file is written)
+        us recordSize
+      [20] dbName
+      [10] dbRel
+      [4]  dt
+     [256] empty
+
+   all are constant except recordSize which hence must be passed
+
+*/
+
+  for (i=0;i<256;i++) empty[i] = ' ';
+  if(le_write_int_4(ofp, &fSize) == 0) return 4;
+  if(le_write_int_4(ofp, &nRecords) == 0) return 5;
+  if(le_write_int_2(ofp, &recordSize) == 0) return 6;
+
+  fwrite(dbName, sizeof(dbName), 1, ofp);
+  fwrite(dbRel, sizeof(dbRel), 1, ofp);
+  fwrite(dt, sizeof(dt), 1, ofp);
+  fwrite(empty, sizeof(empty), 1, ofp);
+
+  return 0;
+}
+
diff --git a/src/indexseqlibs/cdromheader.h b/src/indexseqlibs/cdromheader.h
new file mode 100644
index 0000000..5a46bcb
--- /dev/null
+++ b/src/indexseqlibs/cdromheader.h
@@ -0,0 +1,3 @@
+#include "mach-io.h"
+
+extern int cdromheader(FILE *fp, uint_2 recordSize);
diff --git a/src/indexseqlibs/data-flow.doc b/src/indexseqlibs/data-flow.doc
new file mode 100644
index 0000000..216ecea
--- /dev/null
+++ b/src/indexseqlibs/data-flow.doc
@@ -0,0 +1,165 @@
+Scripts and programs to create indexes for various sequence libraries
+---------------------------------------------------------------------
+
+We have written a suite of scripts for creating title, entry name,
+freetext and accession number indexes for EMBL, GenBank, PIR and
+SwissProt sequence libraries in EMBL CD-ROM format. Each script calls a
+series of programs and system command which extract, sort and process
+the data before the indexes are made. The first step of each script is
+to gather the data.  This program will be particular to the format of
+the sequence library.  All other programs in the script will be common
+to all formats.
+
+Below are the steps for in creating each index or pair of indexes. The
+programs particular to the step is given in parentheses.  Between
+steps the format of the intermediate files are given, and in parentheses
+the generic name of the file. For the format of the index file please
+refer to the EMBL CD-ROM format specification.
+
+
+Creating Accession Number Indexes
+---------------------------------
+
+Scripts:
+	emblaccession.script
+	genbaccession.script
+	piraccession.script
+	swissaccession.script
+
+Step 1: Gather entry name and accession number information
+(emblaccess1, genbaccess1, piraccess1)
+
+	"%10s %10s\n" entry_name accession_no			(*.list)
+
+Step 2: Sort on entry name
+
+	"%10s %10s\n" entry_name accession_no			(access.sorted)
+
+Step 3: Assign entry numbers (access2)
+
+	"%10s %10s %10d\n" entry_name accession_no entry_no 	(access.entry)
+
+Step 4: Sort on accession number
+
+	"%10s %10s %10d\n" entry_name accession_no entry_no 	(access.sorted2)
+
+Step 5: Create indexes (access4)
+
+	(acnum.hit, acnum.trg)
+
+
+
+Creating Title/Brief Directory Index
+------------------------------------
+
+Scripts:
+	embltitle.script
+	genbtitle.script
+	pirtitle.script
+	swisstitle.script
+
+Step 1: Gather entry name, accession number, sequence length and title
+information (embltitle1, genbtitle1, pirtitle1)
+
+	"%10s %10s %10d %80s\n" entry_name acc_no seq_len title (*.list)
+
+Step 2: Sort on entry name
+
+	"%10s %10s %10d %80s\n" entry_name acc_no seq_len title (title.sorted)
+
+Step 3: Generate index (title2)
+
+	(brief.idx)
+
+
+
+Creating Free Text Index
+------------------------
+
+Scripts:
+	emblfreetext.script
+	genbfreetext.script
+	pirfreetext.script
+	swissfreetext.script
+
+Step 1: Gather entry names and free text (emblfreetext, genbfreetext,
+pirfreetext)
+
+	"%10s %s\n" entry_name word (*.list)
+
+Step 2: Sort on word
+Step 3: Remove duplicate word/entry name entries
+Step 4: Remove stopwords (excludewords)
+Step 5: Sort on entry name
+
+	"%10s %s\n" entry_name word (freetext.sorted)
+
+Step 6: Assign entry numbers (freetext2)
+
+	"%10s %10d %s\n" entry_name entry_no word (freetext.entry)
+
+Step 7: Sort on word
+
+	"%10s %10d %s\n" entry_name entry_no word (freetext.sorted2)
+
+Step 8: Create indexes (freetext4)
+
+	(freetext.hit, freetext.trg)
+
+
+
+Creating Entry Name Index
+-------------------------
+
+Scripts:
+	emblentryname.script
+	genbentryname.script
+	pirentryname.script
+	swissentryname.script
+
+Step 1: Gather entry names, annotation offsets, sequence offsets and
+divisions (emblentryname1, genbentryname1, pirentryname1)
+
+	"%10s %10d %10d %5d" entry_name ann_offset seq_offset division (*.list)
+
+Step 2: Sort on entry names
+
+	"%10s %10d %10d %5d" entry_name ann_offset seq_offset division (entry.sorted)
+
+Step 3: Create index (entryname2)
+
+	(entryname.idx)
+	
+
+
+Creating Author Index
+------------------------
+NOTE: this is similar to free text index creation
+
+Scripts:
+	emblauthor.script
+	genbauthor.script
+	pirauthor.script
+	swissauthor.script
+
+Step 1: Gather entry names and authors (emblauthor, genbauthor,
+pirauthor)
+
+	"%10s %s\n" entry_name author (*.list)
+
+Step 2: Sort on entry name, removing duplicate entries
+
+	"%10s %s\n" entry_name author (author.sorted)
+
+Step 6: Assign entry numbers (freetext2)
+
+	"%10s %10d %s\n" entry_name entry_no author (author.entry)
+
+Step 7: Sort on author
+
+	"%10s %10d %s\n" entry_name entry_no author (author.sorted2)
+
+Step 8: Create indexes (hitNtrg)
+
+	(author.hit, author.trg)
+
diff --git a/src/indexseqlibs/division.c b/src/indexseqlibs/division.c
new file mode 100644
index 0000000..8ad9445
--- /dev/null
+++ b/src/indexseqlibs/division.c
@@ -0,0 +1,74 @@
+/*        division          */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include "mach-io.h"
+
+/* program to read in a title file and write it out in binary  with a header */
+#define MAXLINE 128 /* > 80 + 10 + 10 + 10 */
+
+int main(int argc, char *argv[])
+{
+    int cdromheader( FILE *fp, unsigned short recordSize);
+    FILE *ifp;
+    FILE *tofp;
+    char line[MAXLINE];
+    
+    unsigned long nRecords = 0;
+    unsigned short recordSize = 14;
+
+
+    printf("division Version 1.1\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: division filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"division: Cannot open input file %s\n",argv[1]);
+	exit(1);
+	}
+    if ((tofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"division: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    if(cdromheader(tofp, recordSize ) != 0 ) {
+	fprintf(stderr,"division: Error writing target file header\n");
+	exit(1);
+    }
+
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+	/*
+	  Format of input line:
+	  "%12s %d\n"
+	     division name , division number
+	  */
+	int div;
+	div = atoi(line+13);    
+	if(le_write_int_2(tofp, (uint_2 *)&div) == 0) {
+	    fprintf(stderr,"division: Error writing to lookup file\n");
+	    exit(1);
+	}
+	fwrite(line,12,1,tofp);
+	nRecords++;;
+    }
+    
+    /* add number of records to header */
+    if (fseek(tofp, 4L,0)) {
+	fprintf(stderr,"division: Error seeking on target file\n");
+	exit(1);
+    }
+    if(le_write_int_4(tofp, (uint_4 *)&nRecords) == 0) {
+	fprintf(stderr,"division: Error writing to target file\n");
+	exit(1);
+    }
+
+    printf(" Records in division lookup file %d\n\n", nRecords);
+
+    fclose(ifp);
+    fclose(tofp);
+    
+    return 0;
+}
diff --git a/src/indexseqlibs/emblaccess1.c b/src/indexseqlibs/emblaccess1.c
new file mode 100644
index 0000000..964128b
--- /dev/null
+++ b/src/indexseqlibs/emblaccess1.c
@@ -0,0 +1,117 @@
+/*          emblaccess1           */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 82     /* maximum input line length*/
+
+/*
+ * program to create index for pir library in embl cdrom form
+ * this one gets the accession numbers and entry names and writes them to a
+ * file this file is sorted on entryname, then the next program processes the
+ * sorted file to add a number to each record that has a new entryname. This
+ * file is then sorted on accession number. This file is then processed to add
+ * the number of hits for each accession numnber: the first occurrence for
+ * each number is given 1, the next 2 and so on. This file is then processed
+ * to produce the final accession number target and hit filesin binary with
+ * the header
+ */
+
+int parseACCESSION(FILE *fp, char *, char *);
+
+char patternENTRY[] = "ID";
+char patternACCESSION[] = "AC";
+char patternCONTINUE[] =  "AC";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char eName[10];
+
+    long entries = 0;
+    int leftChar = 5;
+    
+    printf("emblaccess1 Version 1.0\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: emblaccess1 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"emblaccess1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"emblaccess1: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+
+    eName[0] = '\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL) {
+	
+	if (patmatch(line, patternENTRY)){
+	    
+	    if (*eName)
+		printf(" No accession lines for %-10.10s\n",eName);
+
+	    strncpy(eName,&line[leftChar],10);
+	    entries++;
+	    
+	} else if (patmatch(line, patternACCESSION)){
+	    
+	    if (parseACCESSION( ofp, &line[leftChar], eName) == 0)
+		printf(" Empty accession line !!\n");
+
+	    /* Assuming we wont run into an ENTRY line !!!!!!!!!!  */
+	    while (fgets(line,MAXLINE,ifp)!=NULL){
+		
+		if (patmatch(line, patternCONTINUE)){
+		    
+		    if (parseACCESSION( ofp, &line[leftChar], eName) == 0)
+			printf(" Empty accession line !!\n");
+		    
+		} else
+		    break;
+	    }
+	    eName[0] = '\0';
+	}
+    }
+    
+    printf(" Number of entries = %ld\n\n",entries); 
+    
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
+
+
+int parseACCESSION(FILE *fp, char *line, char *eName)
+{
+    char *s, *t;
+    int entries;
+    
+    entries = 0;
+    s = line;
+    while (*s) {
+	/* skip over white space etc */
+	for ( ; *s && !isalnum(*s) ; s++);
+	
+	/* find end of accession number if there is one*/
+	if (*s) {
+	    entries++;
+	    for (t=s; isalnum(*s) ; s++);
+	    fprintf(fp, "%-10.10s %-10.*s\n",eName,s-t,t);
+	}
+	
+    }
+    
+    return entries;
+    
+}	
diff --git a/src/indexseqlibs/emblaccession.script b/src/indexseqlibs/emblaccession.script
new file mode 100644
index 0000000..b099808
--- /dev/null
+++ b/src/indexseqlibs/emblaccession.script
@@ -0,0 +1,79 @@
+#! /bin/csh -f
+#
+# script to make an accession number index for the embl library newdata file
+#
+
+echo "emblaccession.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is what you've called the sequence library
+#
+
+# get entry names and accession numbers
+emblaccess1 bb.dat bb.list
+if ( $status ) goto abort
+emblaccess1 fun.dat fun.list
+if ( $status ) goto abort
+emblaccess1 inv.dat inv.list
+if ( $status ) goto abort
+emblaccess1 mam.dat mam.list
+if ( $status ) goto abort
+emblaccess1 org.dat org.list
+if ( $status ) goto abort
+emblaccess1 patent.dat patent.list
+if ( $status ) goto abort
+emblaccess1 phg.dat phg.list
+if ( $status ) goto abort
+emblaccess1 pln.dat pln.list
+if ( $status ) goto abort
+emblaccess1 pri.dat pri.list
+if ( $status ) goto abort
+emblaccess1 pro.dat pro.list
+if ( $status ) goto abort
+emblaccess1 rod.dat rod.list
+if ( $status ) goto abort
+emblaccess1 syn.dat syn.list
+if ( $status ) goto abort
+emblaccess1 una.dat una.list
+if ( $status ) goto abort
+emblaccess1 vrl.dat vrl.list
+if ( $status ) goto abort
+emblaccess1 vrt.dat vrt.list
+if ( $status ) goto abort
+
+
+# sort on entry name
+${SORT} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list >! access.sorted
+if ( $status ) goto abort
+${RM} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list
+
+# assign entry numbers to entry names
+access2 access.sorted access.entry
+if ( $status ) goto abort
+${RM} access.sorted
+
+# sort on accession numbers
+${SORT} -b +1 access.entry >! access.sorted2
+if ( $status ) goto abort
+${RM} access.entry
+
+# create index files
+access4 access.sorted2 acnum.trg acnum.hit
+if ( $status ) goto abort
+${RM} access.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblaccession.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/emblauthor.script b/src/indexseqlibs/emblauthor.script
new file mode 100644
index 0000000..eb1d49c
--- /dev/null
+++ b/src/indexseqlibs/emblauthor.script
@@ -0,0 +1,75 @@
+#! /bin/csh -f
+#
+# script to make an author index for the embl newdata library
+#
+
+echo "emblauthor.script Version 1.0"
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+#
+#
+
+# get authors
+emblauthor bb.dat bb.list
+if ( $status ) goto abort
+emblauthor fun.dat fun.list
+if ( $status ) goto abort
+emblauthor inv.dat inv.list
+if ( $status ) goto abort
+emblauthor mam.dat mam.list
+if ( $status ) goto abort
+emblauthor org.dat org.list
+if ( $status ) goto abort
+emblauthor patent.dat patent.list
+if ( $status ) goto abort
+emblauthor phg.dat phg.list
+if ( $status ) goto abort
+emblauthor pln.dat pln.list
+if ( $status ) goto abort
+emblauthor pri.dat pri.list
+if ( $status ) goto abort
+emblauthor pro.dat pro.list
+if ( $status ) goto abort
+emblauthor rod.dat rod.list
+if ( $status ) goto abort
+emblauthor syn.dat syn.list
+if ( $status ) goto abort
+emblauthor una.dat una.list
+if ( $status ) goto abort
+emblauthor vrl.dat vrl.list
+if ( $status ) goto abort
+emblauthor vrt.dat vrt.list
+if ( $status ) goto abort
+
+
+# sort on entry name
+${SORT} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list >! author.sorted
+if ( $status ) goto abort
+${RM} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list
+
+# include entry numbers in the file
+freetext2 author.sorted author.entry
+if ( $status ) goto abort
+${RM} author.sorted
+
+# sort on text
+${SORT} -b +2 author.entry >! author.sorted2
+if ( $status ) goto abort
+${RM} author.entry
+
+# create the index
+hitNtrg author.sorted2 author.trg author.hit 35
+if ( $status ) goto abort
+${RM} author.sorted2
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblauthor.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/embldivision.script b/src/indexseqlibs/embldivision.script
new file mode 100644
index 0000000..5a26a56
--- /dev/null
+++ b/src/indexseqlibs/embldivision.script
@@ -0,0 +1,48 @@
+#! /bin/csh -f
+#
+# script to create an division index for the embl library in codata format
+#
+
+echo "embldivision.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+
+#
+#
+#
+
+
+awk '{printf("%-12.12s %d\n",$1,$2);}' << EOF >! division.list
+bb.dat 1
+fun.dat 2
+inv.dat 3
+mam.dat 4
+org.dat 5
+patent.dat 6
+phg.dat 7
+pln.dat 8
+pri.dat 9
+pro.dat 10
+rod.dat 11
+syn.dat 12
+una.dat 13
+vrl.dat 14
+vrt.dat 15
+EOF
+
+division division.list division.lkp
+if ( $status ) goto abort
+${RM} division.list
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "embldivision.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/emblentryname.script b/src/indexseqlibs/emblentryname.script
new file mode 100644
index 0000000..ce5aac2
--- /dev/null
+++ b/src/indexseqlibs/emblentryname.script
@@ -0,0 +1,69 @@
+#! /bin/csh -f
+#
+# script to create an entryname index for the embl newdata library
+#
+
+echo "emblentryname.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+#
+#
+
+emblentryname1 bb.dat bb.list 1
+if ( $status ) goto abort
+emblentryname1 fun.dat fun.list 2
+if ( $status ) goto abort
+emblentryname1 inv.dat inv.list 3
+if ( $status ) goto abort
+emblentryname1 mam.dat mam.list 4
+if ( $status ) goto abort
+emblentryname1 org.dat org.list 5
+if ( $status ) goto abort
+emblentryname1 patent.dat patent.list 6
+if ( $status ) goto abort
+emblentryname1 phg.dat phg.list 7
+if ( $status ) goto abort
+emblentryname1 pln.dat pln.list 8
+if ( $status ) goto abort
+emblentryname1 pri.dat pri.list 9
+if ( $status ) goto abort
+emblentryname1 pro.dat pro.list 10
+if ( $status ) goto abort
+emblentryname1 rod.dat rod.list 11
+if ( $status ) goto abort
+emblentryname1 syn.dat syn.list 12
+if ( $status ) goto abort
+emblentryname1 una.dat una.list 13
+if ( $status ) goto abort
+emblentryname1 vrl.dat vrl.list 14
+if ( $status ) goto abort
+emblentryname1 vrt.dat vrt.list 15
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list >! entry.sorted
+if ( $status ) goto abort
+${RM} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list
+
+
+
+# create index file
+entryname2 entry.sorted entrynam.idx
+if ( $status ) goto abort
+${RM} entry.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblentryname.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/emblentryname1.c b/src/indexseqlibs/emblentryname1.c
new file mode 100644
index 0000000..40bdcd2
--- /dev/null
+++ b/src/indexseqlibs/emblentryname1.c
@@ -0,0 +1,81 @@
+/*         emblentryname1           */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#include <string.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create index for embl library in embl cdrom form */
+/* this one gets the entry names and offsets and writes them to an ascii file 
+   this file is sorted, then the next program converts the sorted file to binary
+   and adds the header */
+
+char patternENTRY[] = "ID";
+char patternSEQUENCE[] = "SQ";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char entry_name[10];
+    off_t entry_offset = 0;
+    off_t sequence_offset = 0;
+    long entries_in = 0;
+    long entries_out = 0;
+    int leftChar = 5;
+
+    printf("emblentryname1 Version 1.0\n");
+    
+    if (argc != 4) {
+	fprintf(stderr,"Usage: emblentryname1 filein fileout division\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"emblentryname1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"emblentryname1: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    entry_name[0]='\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL){
+
+	if (patmatch(line, patternENTRY)){
+
+	    if (*entry_name)
+		printf("No sequence for entry %-10.10s\n",entry_name);
+
+	    strncpy(entry_name,&line[leftChar],10);
+	    entry_offset = ftell(ifp) - strlen(line);
+	    entries_in++;
+
+	} else if (patmatch(line, patternSEQUENCE) && *entry_name) {
+
+	    sequence_offset = ftell(ifp) + 5;
+	    fprintf(ofp,"%-10.10s %10ld %10ld %-5.5s\n",entry_name,
+		    entry_offset,sequence_offset,argv[3]);
+	    entries_out++;
+	    entry_name[0]='\0';
+
+	}
+
+    }
+
+    printf(" Number of entries read = %ld\n",entries_in); 
+    printf(" Number of entries written = %ld\n\n",entries_out); 
+
+    fclose(ifp);
+    fclose(ofp);
+    
+    return 0;
+}
+
+
+
diff --git a/src/indexseqlibs/emblfreetext.script b/src/indexseqlibs/emblfreetext.script
new file mode 100644
index 0000000..3d17f5c
--- /dev/null
+++ b/src/indexseqlibs/emblfreetext.script
@@ -0,0 +1,95 @@
+#! /bin/csh -f
+#
+# script to make a free text index for the embl newdata library
+#
+
+echo "emblfreetext.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+set UNIQ = uniq
+
+#
+#
+#
+
+set WORDS = stopwords
+
+
+
+emblfreetext bb.dat bb.list
+if ( $status ) goto abort
+emblfreetext fun.dat fun.list
+if ( $status ) goto abort
+emblfreetext inv.dat inv.list
+if ( $status ) goto abort
+emblfreetext mam.dat mam.list
+if ( $status ) goto abort
+emblfreetext org.dat org.list
+if ( $status ) goto abort
+emblfreetext patent.dat patent.list
+if ( $status ) goto abort
+emblfreetext phg.dat phg.list
+if ( $status ) goto abort
+emblfreetext pln.dat pln.list
+if ( $status ) goto abort
+emblfreetext pri.dat pri.list
+if ( $status ) goto abort
+emblfreetext pro.dat pro.list
+if ( $status ) goto abort
+emblfreetext rod.dat rod.list
+if ( $status ) goto abort
+emblfreetext syn.dat syn.list
+if ( $status ) goto abort
+emblfreetext una.dat una.list
+if ( $status ) goto abort
+emblfreetext vrl.dat vrl.list
+if ( $status ) goto abort
+emblfreetext vrt.dat vrt.list
+if ( $status ) goto abort
+
+
+# sort on entry name
+${SORT} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list >! freetext.sorted
+if ( $status ) goto abort
+${RM} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list
+
+
+
+
+
+
+
+
+# sort on words, remove dumplicates, remove stopwords, resort on entry name
+${SORT} -b +1 bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list | ${UNIQ} | excludewords ${WORDS} | ${SORT} >! freetext.sorted
+if ( $status ) goto abort
+${RM} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list
+
+# include entry numbers in the file
+freetext2 freetext.sorted freetext.entry
+if ( $status ) goto abort
+${RM} freetext.sorted
+
+# sort on text
+${SORT} -b +2 freetext.entry >! freetext.sorted2
+if ( $status ) goto abort
+${RM} freetext.entry
+
+# create the index
+freetext4 freetext.sorted2 freetext.trg freetext.hit
+if ( $status ) goto abort
+${RM} freetext.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblfreetext.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/embltitle.script b/src/indexseqlibs/embltitle.script
new file mode 100644
index 0000000..11e47da
--- /dev/null
+++ b/src/indexseqlibs/embltitle.script
@@ -0,0 +1,69 @@
+#! /bin/csh -f
+#
+# script to make a brief index for the embl newdata library
+#
+
+echo "embltitle.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+#
+#
+
+embltitle1 bb.dat bb.list
+if ( $status ) goto abort
+embltitle1 fun.dat fun.list
+if ( $status ) goto abort
+embltitle1 inv.dat inv.list
+if ( $status ) goto abort
+embltitle1 mam.dat mam.list
+if ( $status ) goto abort
+embltitle1 org.dat org.list
+if ( $status ) goto abort
+embltitle1 patent.dat patent.list
+if ( $status ) goto abort
+embltitle1 phg.dat phg.list
+if ( $status ) goto abort
+embltitle1 pln.dat pln.list
+if ( $status ) goto abort
+embltitle1 pri.dat pri.list
+if ( $status ) goto abort
+embltitle1 pro.dat pro.list
+if ( $status ) goto abort
+embltitle1 rod.dat rod.list
+if ( $status ) goto abort
+embltitle1 syn.dat syn.list
+if ( $status ) goto abort
+embltitle1 una.dat una.list
+if ( $status ) goto abort
+embltitle1 vrl.dat vrl.list
+if ( $status ) goto abort
+embltitle1 vrt.dat vrt.list
+if ( $status ) goto abort
+
+
+# sort on entry name
+${SORT} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list >! title.sorted
+if ( $status ) goto abort
+${RM} bb.list fun.list inv.list mam.list org.list patent.list phg.list pln.list pri.list pro.list rod.list syn.list una.list vrl.list vrt.list
+
+
+# generate index
+title2 title.sorted brief.idx
+if ( $status ) goto abort
+${RM} title.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "embltitle.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/embltitle1.c b/src/indexseqlibs/embltitle1.c
new file mode 100644
index 0000000..177011f
--- /dev/null
+++ b/src/indexseqlibs/embltitle1.c
@@ -0,0 +1,104 @@
+/*          embltitle1            */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create index for pir library in embl cdrom form */
+/* this one gets the entry names and offsets and writes them to an ascii file 
+   this file is sorted, then the next program converts the sorted file to binary
+   and adds the header */
+
+char patternENTRY[] = "ID";
+char patternTITLE[] = "DE";
+char patternACCESSION[] = "AC";
+char patternSUMMARY[] = "SQ";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char entry_name[10];
+    char accession_number[10];
+    unsigned long sequence_length;
+    char title[80];
+
+    int i;
+    int leftChar = 5;
+    long entries_in = 0;
+    long entries_out = 0;
+
+
+    printf("embltitle1 Version 1.1\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: embltitle1 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"embltitle1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"embltitle1: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    entry_name[0] = '\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL) {
+
+	if (patmatch(line, patternENTRY)){
+	    if (*entry_name) {
+		/* let's output gathered values for last entry*/
+		fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+			entry_name,
+			accession_number,
+			sequence_length,
+			title);
+		entries_out++;
+	    }
+	    /* initialise values */
+	    entry_name[0] = '\0';
+	    accession_number[0] = '\0';
+	    title[0] = '\0';
+	    sequence_length = 0L;
+
+	    entries_in++;
+	    strncpy(entry_name,&line[leftChar],10);
+	} else if (patmatch(line, patternTITLE) && *title=='\0'){
+	    for(i=0;isprint(line[leftChar+i]) && i<80;i++) title[i]=line[leftChar+i];
+	    for(;i<80;i++) title[i] = ' ';
+	} else if (patmatch(line, patternACCESSION) && *accession_number=='\0'){
+	    for(i=0;isalnum(line[leftChar+i]) && i<10;i++) accession_number[i]=line[leftChar+i];
+	    for(;i<10;i++) accession_number[i] = ' ';
+	} else if (patmatch(line, patternSUMMARY) && sequence_length==0){
+	    sequence_length = atol(&line[13]);
+	}
+    }
+    if (*entry_name) {
+	/* let's output gathered values for last entry*/
+	fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+		entry_name,
+		accession_number,
+		sequence_length,
+		title);
+	entries_out++;
+    }
+
+    printf(" Number of entries read = %ld\n",entries_in);
+    printf(" Number of entries written = %ld\n\n",entries_out);
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
+
+
+
diff --git a/src/indexseqlibs/embluaccession.script b/src/indexseqlibs/embluaccession.script
new file mode 100644
index 0000000..bae5382
--- /dev/null
+++ b/src/indexseqlibs/embluaccession.script
@@ -0,0 +1,51 @@
+#! /bin/csh -f
+#
+# script to make an accession number index for the embl library newdata file
+#
+
+echo "emblaccession.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is what you've called the sequence library
+#
+set LIBFILE = emseq.all
+
+# get entry names and accession numbers
+emblaccess1 $LIBFILE access.list
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} access.list >! access.sorted
+if ( $status ) goto abort
+${RM} access.list
+
+# assign entry numbers to entry names
+access2 access.sorted access.entry
+if ( $status ) goto abort
+${RM} access.sorted
+
+# sort on accession numbers
+${SORT} -b +1 access.entry >! access.sorted2
+if ( $status ) goto abort
+${RM} access.entry
+
+# create index files
+access4 access.sorted2 acnum.trg acnum.hit
+if ( $status ) goto abort
+${RM} access.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblaccession.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/embluauthor.script b/src/indexseqlibs/embluauthor.script
new file mode 100644
index 0000000..6c7a5b6
--- /dev/null
+++ b/src/indexseqlibs/embluauthor.script
@@ -0,0 +1,47 @@
+#! /bin/csh -f
+#
+# script to make an author index for the embl newdata library
+#
+
+echo "emblauthor.script Version 1.0"
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE = emseq.all
+
+# get authors
+emblauthor $LIBFILE author.list
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} -u author.list >! author.sorted
+if ( $status ) goto abort
+${RM} author.list
+
+# include entry numbers in the file
+freetext2 author.sorted author.entry
+if ( $status ) goto abort
+${RM} author.sorted
+
+# sort on text
+${SORT} -b +2 author.entry >! author.sorted2
+if ( $status ) goto abort
+${RM} author.entry
+
+# create the index
+hitNtrg author.sorted2 author.trg author.hit 35
+if ( $status ) goto abort
+${RM} author.sorted2
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblauthor.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/embludivision.script b/src/indexseqlibs/embludivision.script
new file mode 100644
index 0000000..8cefeaf
--- /dev/null
+++ b/src/indexseqlibs/embludivision.script
@@ -0,0 +1,34 @@
+#! /bin/csh -f
+#
+# script to create an division index for the embl library in codata format
+#
+
+echo "embldivision.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+
+#
+#
+#
+
+
+awk '{printf("%-12.12s %d\n",$1,$2);}' << EOF >! division.list
+emseq.all 1
+EOF
+
+division division.list division.lkp
+if ( $status ) goto abort
+${RM} division.list
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "genbdivision.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/embluentryname.script b/src/indexseqlibs/embluentryname.script
new file mode 100644
index 0000000..65de1c2
--- /dev/null
+++ b/src/indexseqlibs/embluentryname.script
@@ -0,0 +1,41 @@
+#! /bin/csh -f
+#
+# script to create an entryname index for the embl newdata library
+#
+
+echo "emblentryname.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name you've given to the library file
+#
+set LIBFILE = emseq.all
+
+# get entry names
+emblentryname1 $LIBFILE entry.list 1
+if ( $status ) goto abort
+
+# sort on entry names
+${SORT} entry.list >! entry.sorted
+if ( $status ) goto abort
+${RM} entry.list
+
+# create index file
+entryname2 entry.sorted entrynam.idx
+if ( $status ) goto abort
+${RM} entry.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblentryname.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/emblufreetext.script b/src/indexseqlibs/emblufreetext.script
new file mode 100644
index 0000000..e2fa0e0
--- /dev/null
+++ b/src/indexseqlibs/emblufreetext.script
@@ -0,0 +1,54 @@
+#! /bin/csh -f
+#
+# script to make a free text index for the embl newdata library
+#
+
+echo "emblfreetext.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+set UNIQ = uniq
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE = emseq.all
+
+set WORDS = stopwords
+
+# get free text
+emblfreetext $LIBFILE freetext.list
+if ( $status ) goto abort
+
+# sort on words, remove dumplicates, remove stopwords, resort on entry name
+${SORT} -b +1 freetext.list | ${UNIQ} | excludewords ${WORDS} | ${SORT} >! freetext.sorted
+if ( $status ) goto abort
+${RM} freetext.list
+
+# include entry numbers in the file
+freetext2 freetext.sorted freetext.entry
+if ( $status ) goto abort
+${RM} freetext.sorted
+
+# sort on text
+${SORT} -b +2 freetext.entry >! freetext.sorted2
+if ( $status ) goto abort
+${RM} freetext.entry
+
+# create the index
+freetext4 freetext.sorted2 freetext.trg freetext.hit
+if ( $status ) goto abort
+${RM} freetext.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "emblfreetext.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/emblutitle.script b/src/indexseqlibs/emblutitle.script
new file mode 100644
index 0000000..acc7a33
--- /dev/null
+++ b/src/indexseqlibs/emblutitle.script
@@ -0,0 +1,41 @@
+#! /bin/csh -f
+#
+# script to make a brief index for the embl newdata library
+#
+
+echo "embltitle.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE = emseq.all
+
+# create title list
+embltitle1 $LIBFILE title.list
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} title.list >! title.sorted
+if ( $status ) goto abort
+${RM} title.list
+
+# generate index
+title2 title.sorted brief.idx
+if ( $status ) goto abort
+${RM} title.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "embltitle.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/entryname2.c b/src/indexseqlibs/entryname2.c
new file mode 100644
index 0000000..d170441
--- /dev/null
+++ b/src/indexseqlibs/entryname2.c
@@ -0,0 +1,92 @@
+/*       entryname2     */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "mach-io.h"
+#include "cdromheader.h"
+
+#define MAXLINE 256
+
+/* program to read in a sorted index and write it out in binary  with a header */
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    uint_4 entryOffset;
+    uint_4 sequenceOffset;
+    uint_2 div;
+    uint_4 nRecords = 0;
+    uint_2 recordSize = 20;
+    char line[MAXLINE];
+    char last_entry[10];
+
+    printf("entryname2 Version 1.1\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: entryname2 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"entryname2: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"entryname2: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    /* new    */
+    if(cdromheader(ofp, recordSize ) != 0 ) {
+	fprintf(stderr,"entryname2: Error writing index file header\n");
+	exit(1);
+    }
+    
+
+    last_entry[0] = '\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL){
+
+	/* beware of duplicate entries */
+	if (strncmp(last_entry,line,10) == 0) {
+	    fprintf(stderr,"entryname2: warning - entry %-10.10s appears twice\n", line);
+	} else {
+	    entryOffset = atoi(line+11);
+	    sequenceOffset = atoi(line+22);
+	    div = (int_2) atoi(line+33);
+	    
+	    fwrite(line, 10, 1, ofp);
+	    if(le_write_int_4(ofp, &entryOffset) == 0) {
+		fprintf(stderr,"entryname2: Error writing to index file\n");
+		exit(1);
+	    }
+	    if(le_write_int_4(ofp, &sequenceOffset) == 0) {
+		fprintf(stderr,"entryname2: Error writing to index file\n");
+		exit(1);
+	    }
+	    if(le_write_int_2(ofp, &div) == 0) {
+		fprintf(stderr,"entryname2: Error writing to index file\n");
+		exit(1);
+	    }
+	    
+	    nRecords++;
+	    strncpy(last_entry,line,10);
+	}
+    }
+
+    /* add number of records to header */
+    if (fseek(ofp, (off_t)4,0)) {
+	fprintf(stderr,"entryname2: Error seeking on index file\n");
+	exit(1);
+    }
+    if(le_write_int_4(ofp, &nRecords) == 0) {
+	fprintf(stderr,"entryname2: Error writing to index file\n");
+	exit(1);
+    }
+
+    printf(" Number of entries = %d\n\n",nRecords);
+
+    fclose(ifp);
+    fclose(ofp);
+    return 0;
+}
diff --git a/src/indexseqlibs/excludewords.c b/src/indexseqlibs/excludewords.c
new file mode 100644
index 0000000..affbea3
--- /dev/null
+++ b/src/indexseqlibs/excludewords.c
@@ -0,0 +1,103 @@
+/*--- excludewords ---*/
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+
+#define MAXLINE 100
+
+static FILE *ex;
+static FILE *in;
+static FILE *out;
+
+
+static int line_comp(char *ex, char *in, int l)
+/*
+** Compare exclude line with inline
+*/
+{
+    return strncmp(ex,in+11,l);
+}
+
+
+
+
+static void process()
+/*
+** Do the work
+*/
+{
+    char exline[MAXLINE];
+    char inline[MAXLINE];
+    char *exok, *inok;
+    int compare;
+
+
+    exok = fgets(exline,MAXLINE,ex);
+    inok = fgets(inline,MAXLINE,in);
+
+    while (exok && inok) {
+
+	compare = line_comp(exok,inok,MAXLINE);
+
+	if (compare < 0)
+	    exok = fgets(exline,MAXLINE,ex);
+	else if (compare > 0) {
+	    fprintf(out,"%s",inline);
+	    inok = fgets(inline,MAXLINE,in);
+	} else
+	    inok = fgets(inline,MAXLINE,in);
+
+    }
+
+    while (inok = fgets(inline,MAXLINE,in))
+	fprintf(out,"%s",inline);
+
+}
+
+
+
+
+
+
+int main(int argc, char *argv[])
+/*
+** Open files
+*/
+{
+
+    if (argc < 2 || argc > 4) {
+	fprintf(stderr,"Usage: excludewords word_file [file_in [file_out]]\n");
+	exit(2);
+    }
+
+    /* set default files */
+    in = stdin; out = stdout;
+    switch (argc) {
+    case 4:
+	if ( (out = fopen(argv[3],"w")) == NULL ) {
+	    fprintf(stderr,"excludewords: Cannot open file %s for output\n",argv[3]);
+	    exit(1);
+	}
+    case 3:
+	if ( (in = fopen(argv[2],"r")) == NULL) {
+	    fprintf(stderr,"excludewords: Cannot open file %s for input\n",argv[2]);
+	    exit(1);
+	}
+    case 2:
+	if ( (ex = fopen(argv[1],"r")) == NULL) {
+	    fprintf(stderr,"excludewords: Cannot open exclude file %s\n",argv[1]);
+	    exit(1);
+	}
+    }
+
+    process();
+
+    switch (argc) {
+    case 4: fclose(out);
+    case 3: fclose(in);
+    case 2: fclose(ex);
+    }
+
+    return 0;
+}
diff --git a/src/indexseqlibs/freetext.c b/src/indexseqlibs/freetext.c
new file mode 100644
index 0000000..6f6e4a9
--- /dev/null
+++ b/src/indexseqlibs/freetext.c
@@ -0,0 +1,226 @@
+/*
+ * freetext.c
+ *
+ *    Source for:
+ *	pirfreetext
+ *	emblfreetext
+ *	genbfreetext
+ */
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 100     /* maximum input line length*/
+
+#ifdef PIR
+char *PATTERNS[] = {
+    "ENTRY",     /* 1 Entry */
+    "FEATURES",  /* 2 Features - we're not interested in these */
+    "TITLE",     /* 3 Definition */
+    "KEYWORDS",  /* 4 Keyword */
+    "COMMENT",   /* 5 Comment */
+    "REFERENCE", /* 6 Title */
+    "   #Title", /* 7 Title */
+    "   #Description", /* 7 Description */
+    };
+#endif /*PIR*/
+#ifdef EMBL
+char *PATTERNS[] = {
+    "ID", /* 1 Entry */
+    "FT", /* 2 Features */
+    "DE", /* 3 Definition */
+    "KW", /* 4 Keyword */
+    "CC", /* 5 Comment */
+    "RT", /* 6 Title */
+    "OG", /* 7 Organelle */
+    "GN", /* 8 Gene Name */
+    };
+#endif /*EMBL*/
+#ifdef GENBANK
+char *PATTERNS[] = {
+    "LOCUS",      /* 1 Entry */
+    "FEATURES",   /* 2 Features */
+    "DEFINITION", /* 3 Definition */
+    "KEYWORDS",   /* 4 Keyword */
+    "COMMENT",    /* 5 Comment */
+    "  TITLE",    /* 6 Title */
+    };
+#endif /*GENBANK*/
+
+#ifndef PIR
+/*
+** The following entries in feature tables are considered
+** to have useful text in them
+*/
+char *FEATURES[] = {
+    "/product=",
+    "/gene=",
+    "/note=",
+    "/bound_moiety=",
+    "/rpt_family=",
+    "/function=",
+};
+#endif
+
+
+char entryName[10];
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+#define Number(A) ( sizeof(A) / sizeof((A)[0]))
+
+static int terminator(char c)
+/*
+** Free text terminator character
+*/
+{
+    return !(isgraph(c) && ! ispunct(c));
+}
+
+
+
+
+static void parse(FILE *ofp, char *line)
+/*
+** Pick out all interesting strings
+*/
+{
+    char *s,*t;
+
+    s=t=line;
+    while( 1 ) {
+	
+	if ( terminator(*t) ) {
+	    if (t-s) fprintf(ofp, "%-10.10s %-.*s\n",entryName,t-s,s);
+	    if (!*t) break;
+	    s = ++t;
+	} else {
+	    if ( islower(*t) ) *t = toupper(*t);
+	    t++;
+	}
+
+    }
+    
+}
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    int entries = 0;
+    char line[MAXLINE];
+    int type = 0;
+    int i,tt;
+#ifndef PIR
+    char  *s, *t; /* for parsing FEATURE lines */
+    int note = 0;
+#endif
+
+#ifdef PIR
+    char *progname = "pirfreetext";
+    char *continue_str = "     ";
+    int offset = 16; /* Entry name offset in Entry line */
+#endif /*PIR*/
+#ifdef EMBL
+    char *progname = "emblfreetext";
+    char *continue_str = " ";
+    int offset = 5; /* Entry name offset in Entry line */
+#endif /*EMBL*/
+#ifdef GENBANK
+    char *progname = "genbfreetext";
+    char *continue_str = "     ";
+    int offset = 12; /* Entry name offset in Entry line */
+#endif /*GENBANK*/
+
+    printf("%s Version 1.2\n",progname);
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: %s filein fileout\n",progname);
+	exit(2);
+    }
+
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"%s: cannot open input file %s\n",progname,argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"%s: cannot open output file %s\n",progname,argv[2]);
+	exit(1);
+    }
+
+
+
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+	/*
+	** Determine line type
+	*/
+	tt = 0;
+	for (i = 0; i< Number(PATTERNS); i++) {
+	    if (patmatch(line,PATTERNS[i])){
+		tt = i+1;
+		break;
+	    }
+	}
+	if (tt!=0 || !patmatch(line,continue_str)) type = tt;
+
+
+	switch(type) {
+	case 0: /* of no interest */
+#ifndef PIR
+	    note = 0;
+#endif
+	    break;
+	case 1: /* 1 Entry */
+	    strncpy(entryName,line+offset,10);
+	    entries++;
+	    break;
+	case 2: /* 2 Features */
+#ifndef PIR
+	    /* NOTE: Nothing useful in features of PIR */
+	    /* Look for /.*=" entries */
+	    s = t = line+offset;
+	    while (t) {
+		switch (note) {
+		case 0: /* not processing comment */
+		    t = strchr(s,'/'); /* get start of note */
+		    if (t==NULL) break; /* line dealt with */
+		    for(i=0;i<Number(FEATURES);i++) {
+			if (patmatch(t,FEATURES[i])){
+			    t += strlen(FEATURES[i]);
+			    note = 1;
+			    break;
+			}
+		    }
+		    s = ++t; /* set start */
+		    if (!note) break;
+		case 1: /* processing comment */
+		    t = strchr(s,'"'); /* get end of string */
+		    if (t==NULL) { /* no end this line */
+			parse(ofp, s); /* parse to end of line */
+			break; /* line dealt with */
+		    } else { 
+			*t = '\0'; /* mark end of comment */
+			parse(ofp, s); /* parse comment */
+			note = 0; /* set mode */
+			s = ++t; /* step over string */
+		    }
+		}
+	    }
+#endif /* PIR */
+	    break;
+	default: /* Anything else */
+	    parse(ofp, line+offset);
+	    break;
+	}
+	
+    }
+
+
+    printf(" Number of entries = %d\n\n",entries); 
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
diff --git a/src/indexseqlibs/freetext2.c b/src/indexseqlibs/freetext2.c
new file mode 100644
index 0000000..0700869
--- /dev/null
+++ b/src/indexseqlibs/freetext2.c
@@ -0,0 +1,50 @@
+/*             freetext2             */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+
+#define MAXLINE 100 /* > 80 + 11 */
+
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    long entries = 0;
+    char line[MAXLINE];
+    char eNamec[10];
+   
+    printf("freetext2 Version 1.0\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: freetext2 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"freetext2: cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"freetext2: cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    eNamec[0] = '\0';
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+	
+	if (strncmp(eNamec, line,10)!=0) {
+	    entries++;
+	    strncpy(eNamec,line,10);
+	}
+	fprintf(ofp, "%-10.10s %10ld %s",line,entries,line+11);
+    }
+    printf(" Number of entries = %ld\n\n",entries);
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
+
+
diff --git a/src/indexseqlibs/freetext4.c b/src/indexseqlibs/freetext4.c
new file mode 100644
index 0000000..84d5322
--- /dev/null
+++ b/src/indexseqlibs/freetext4.c
@@ -0,0 +1,163 @@
+/*            freetext4            */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#include <string.h>
+#include "mach-io.h"
+#include "cdromheader.h"
+
+#define MAXLINE 200 /* > 80 + 11 + 11 + 11 */
+
+/* program to read in a free text number index and write it out in binary
+   with a header */
+
+
+static void cpy2nl(char *a, char *b)
+/*
+** Copy string b to a, up to first newline
+**
+** YUK! no longer needed
+*/
+{
+    for( ; *b && *b != '\n'; a++, b++ ) *a = *b;
+    *a = '\0';
+}
+
+
+static void blatnl(char *a)
+/*
+** Remove the newline
+*/
+{
+    for ( ; *a && *a != '\n' ; a++) ;
+    *a = '\0';
+}
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *tofp;
+    FILE *hofp;
+    uint_4 tgtRecords,hitRecords,hitStart,hits;
+    uint_2 tRecSize = 55;
+    uint_2 hRecSize = 4;
+    uint_4 entryNumber;
+    char last_word[MAXLINE];
+    char line[MAXLINE];
+
+
+    printf("freetext4 Version 1.1\n");
+    
+
+    if (argc != 4) {
+	fprintf(stderr,"Usage: freetext4 filein targetfileout hitfileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"freetext4: cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((tofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"freetext4: cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    if ((hofp = fopen(argv[3],"w")) == NULL) {
+	fprintf(stderr,"freetext4: cannot open output file %s\n",argv[3]);
+	exit(1);
+    }
+    
+    if(cdromheader(tofp, tRecSize ) != 0 ) {
+	fprintf(stderr,"freetext: Error writing target file header\n");
+        exit(1);
+    }
+    if(cdromheader(hofp, hRecSize ) != 0 ) {
+	fprintf(stderr,"freetext: Error writing hit file header\n");
+        exit(1);
+    }
+
+    /* get the first line */
+    if (fgets(line,MAXLINE,ifp) == NULL) {
+	fprintf(stderr,"freetext4: Error reading input file\n");
+	exit(1);
+    }
+    tgtRecords = 0;
+    hitRecords = 0;
+
+    /* get information */
+    cpy2nl(last_word,line+22);
+    hitStart = 1;
+    hits = 1;
+
+    /* for every record write the entryNumber to the hits file */
+    hitRecords++;
+    entryNumber = atoi(line+11);
+    if (le_write_int_4(hofp,&entryNumber) == 0) {
+	fprintf(stderr,"freetext: Error writing hit file\n");
+	exit(1);
+    }
+
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+
+	/* if current word is different - update files */
+	blatnl(line+22);
+	if (strncmp(last_word,line+22,MAXLINE)!=0) {
+	    /* write to target file */
+	    if (le_write_int_4(tofp,&hits) == 0) {
+		fprintf(stderr,"freetext: Error writing target file\n");
+		exit(1);
+	    }
+	    if (le_write_int_4(tofp,&hitStart) == 0) {
+		fprintf(stderr,"freetext: Error writing target file\n");
+		exit(1);
+	    }
+	    fprintf(tofp,"%-47.47s",last_word);
+	    tgtRecords++;
+	    /* get information */
+	    cpy2nl(last_word,line+22);
+	    hitStart = hitRecords+1;
+	    hits = 0;
+	}
+	hits++;
+
+	/* for every record write the entryNumber to the hits file */
+	hitRecords++;
+	entryNumber = atoi(line+11);
+	if (le_write_int_4(hofp,&entryNumber) == 0) {
+	    fprintf(stderr,"freetext: Error writing hit file\n");
+	    exit(1);
+	}
+    }
+
+    /* write the last target file record */
+    if (le_write_int_4(tofp,&hits) == 0) {
+	fprintf(stderr,"freetext: Error writing target file\n");
+	exit(1);
+    }
+    if (le_write_int_4(tofp,&hitStart) == 0) {
+	fprintf(stderr,"freetext: Error writing target file\n");
+	exit(1);
+    }
+    fprintf(tofp,"%-47.47s",last_word);
+    tgtRecords++;
+    
+	
+    /* add number of records to header */
+    if (fseek(tofp, (off_t)4,0)) {
+	fprintf(stderr,"freetext: Error seeking on target file\n");
+	exit(1);
+    }
+    if(le_write_int_4(tofp, &tgtRecords) == 0) {
+	fprintf(stderr,"freetext: Error writing to target file\n");
+	exit(1);
+    }
+
+    printf(" Records in target file %d\n", tgtRecords);
+    printf(" Records in hit file %d\n\n", hitRecords);
+
+    fclose(ifp);
+    fclose(hofp);
+    fclose(tofp);
+    
+    return 0;
+}
diff --git a/src/indexseqlibs/genbaccess1.c b/src/indexseqlibs/genbaccess1.c
new file mode 100644
index 0000000..e93df66
--- /dev/null
+++ b/src/indexseqlibs/genbaccess1.c
@@ -0,0 +1,121 @@
+/*              genbaccess1             */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 82     /* maximum input line length*/
+
+/*
+ * Program to create index for genbank library in embl cdrom form
+ *
+ * this one gets the accession numbers and entry names and writes them to a
+ * file this file is sorted on entryname, then the next program processes the
+ * sorted file to add a number to each record that has a new entryname. This
+ * file is then sorted on accession number. This file is then processed to
+ * add the number of hits for each accession numnber: the first occurrence
+ * for each number is given 1, the next 2 and so on. This file is then
+ * processed to produce the final accession number target and hit filesin
+ * binary with the header
+ */
+
+int parseACCESSION(FILE *fp, char *, char *);
+
+char patternENTRY[] = "LOCUS";
+char patternACCESSION[] = "ACCESSION";
+char patternCONTINUE[] =  "ACCESSION";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char eName[10];
+
+    long entries = 0;
+    int leftChar = 12;
+
+
+    printf("genbaccess1 Version 1.0\n");
+
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: genbaccess1.c filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"genbaccess1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"genbaccess1: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+
+    eName[0] = '\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL) {
+	
+	if (patmatch(line, patternENTRY)){
+
+	    if (*eName)
+		printf(" No accession lines for %-10.10s\n",eName);
+		
+	    strncpy(eName,&line[leftChar],10);
+	    entries++;
+
+	} else if (patmatch(line, patternACCESSION)){
+	    
+	    if (parseACCESSION( ofp, &line[leftChar], eName) == 0)
+		printf(" Empty accession line !!\n");
+	    
+	    /* Assuming we wont run into an ENTRY line !!!!!!!!!!  */
+	    while (fgets(line,MAXLINE,ifp)!=NULL){
+		
+		if (patmatch(line, patternCONTINUE)){
+		    
+		    if (parseACCESSION( ofp, &line[leftChar], eName) == 0)
+			printf(" Empty accession line !!\n");
+
+		} else
+		    break;
+	    }
+	    eName[0] = '\0';
+	    
+	}
+    }
+
+    printf(" Number of entries = %ld\n\n",entries); 
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
+
+
+int parseACCESSION(FILE *fp, char *line, char *eName)
+{
+    char *s, *t;
+    int entries;
+
+    entries = 0;
+    s = line;
+    while (*s) {
+	/* skip over white space etc */
+	for ( ; *s && !isalnum(*s) ; s++);
+
+	/* find end of accession number if there is one*/
+	if (*s) {
+	    entries++;
+	    for (t=s; isalnum(*s) ; s++);
+	    fprintf(fp, "%-10.10s %-10.*s\n",eName,s-t,t);
+	}
+
+    }
+
+    return entries;
+
+}	
diff --git a/src/indexseqlibs/genbaccession.script b/src/indexseqlibs/genbaccession.script
new file mode 100644
index 0000000..0f1ab16
--- /dev/null
+++ b/src/indexseqlibs/genbaccession.script
@@ -0,0 +1,72 @@
+#! /bin/csh -f
+#
+# script to make an accession number index for the genbank library
+#
+# NOTE only active for primate file
+#
+
+echo "genbaccession.script Version 1.1"
+echo ""
+
+#set RM = "/bin/rm -f"
+set RM = "echo /bin/rm -f"
+set SORT = "sort -T tmp"
+
+#
+# 
+#
+
+genbaccess1 gbpri.seq gbpri.list
+if ( $status ) goto abort
+genbaccess1 gbrod.seq gbrod.list
+if ( $status ) goto abort
+genbaccess1 gbmam.seq gbmam.list
+if ( $status ) goto abort
+genbaccess1 gbvrt.seq gbvrt.list
+if ( $status ) goto abort
+genbaccess1 gbinv.seq gbinv.list
+if ( $status ) goto abort
+genbaccess1 gbpln.seq gbpln.list
+if ( $status ) goto abort
+genbaccess1 gborg.seq gborg.list
+if ( $status ) goto abort
+genbaccess1 gbbct.seq gbbct.list
+if ( $status ) goto abort
+genbaccess1 gbrna.seq gbrna.list
+if ( $status ) goto abort
+genbaccess1 gbvrl.seq gbvrl.list
+if ( $status ) goto abort
+genbaccess1 gbphg.seq gbphg.list
+if ( $status ) goto abort
+genbaccess1 gbsyn.seq gbsyn.list
+if ( $status ) goto abort
+genbaccess1 gbuna.seq gbuna.list
+if ( $status ) goto abort
+
+${SORT} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list >! access.sorted
+if ( $status ) goto abort
+${RM} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list 
+
+access2 access.sorted access.entry
+if ( $status ) goto abort
+${RM} access.sorted
+
+${SORT} -b +1 access.entry >! access.sorted2
+if ( $status ) goto abort
+${RM} access.entry
+
+access4 access.sorted2 acnum.trg acnum.hit
+if ( $status ) goto abort
+${RM} access.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "genbaccession.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/genbauthor.script b/src/indexseqlibs/genbauthor.script
new file mode 100644
index 0000000..1a188ce
--- /dev/null
+++ b/src/indexseqlibs/genbauthor.script
@@ -0,0 +1,76 @@
+#! /bin/csh -f
+#
+# script to make author index for the genbank library
+#
+# NOTE only active for primate file
+#
+
+echo "genbauthor.script Version 1.0"
+
+set RM = "echo /bin/rm -f"
+set SORT = "sort -T tmp"
+
+#
+#
+#
+set WORDS = $STADENROOT/seqman/stopwords
+
+genbauthor gbpri.seq gbpri.list
+if ( $status ) goto abort
+genbauthor gbrod.seq gbrod.list
+if ( $status ) goto abort
+genbauthor gbmam.seq gbmam.list
+if ( $status ) goto abort
+genbauthor gbvrt.seq gbvrt.list
+if ( $status ) goto abort
+genbauthor gbinv.seq gbinv.list
+if ( $status ) goto abort
+genbauthor gbpln.seq gbpln.list
+if ( $status ) goto abort
+genbauthor gborg.seq gborg.list
+if ( $status ) goto abort
+genbauthor gbbct.seq gbbct.list
+if ( $status ) goto abort
+genbauthor gbrna.seq gbrna.list
+if ( $status ) goto abort
+genbauthor gbvrl.seq gbvrl.list
+if ( $status ) goto abort
+genbauthor gbphg.seq gbphg.list
+if ( $status ) goto abort
+genbauthor gbsyn.seq gbsyn.list
+if ( $status ) goto abort
+genbauthor gbuna.seq gbuna.list
+if ( $status ) goto abort
+
+${SORT} -u gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list >! author.sorted
+if ( $status ) goto abort
+${RM} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list 
+
+# sort on entry name
+#${SORT} -u gbpri.list >! author.sorted
+#if ( $status ) goto abort
+#${RM} gbpri.list
+
+# include entry numbers in the file
+freetext2 author.sorted author.entry
+if ( $status ) goto abort
+${RM} author.sorted
+
+# sort on text
+${SORT} -b +2 author.entry >! author.sorted2
+if ( $status ) goto abort
+${RM} author.entry
+
+# create the index
+hitNtrg author.sorted2 author.trg author.hit 35
+if ( $status ) goto abort
+${RM} author.sorted2
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "genbauthor.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/genbdivision.script b/src/indexseqlibs/genbdivision.script
new file mode 100644
index 0000000..e0f578c
--- /dev/null
+++ b/src/indexseqlibs/genbdivision.script
@@ -0,0 +1,46 @@
+#! /bin/csh -f
+#
+# script to create an division index for the genbank library in codata format
+#
+
+echo "genbdivision.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+
+#
+#
+#
+
+
+awk '{printf("%-12.12s %d\n",$1,$2);}' << EOF >! division.list
+gbbct.seq 1
+gbinv.seq 2
+gbmam.seq 3
+gborg.seq 4
+gbphg.seq 5
+gbpln.seq 6
+gbpri.seq 7
+gbrna.seq 8
+gbrod.seq 9
+gbsyn.seq 10
+gbuna.seq 11
+gbvrl.seq 12
+gbvrt.seq 13
+EOF
+
+division division.list division.lkp
+if ( $status ) goto abort
+${RM} division.list
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "genbdivision.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/genbentryname.script b/src/indexseqlibs/genbentryname.script
new file mode 100644
index 0000000..3619834
--- /dev/null
+++ b/src/indexseqlibs/genbentryname.script
@@ -0,0 +1,63 @@
+#! /bin/csh -f
+#
+# script to create an entryname index for the genbank library
+#
+# NOTE only active for primate file
+#
+
+echo "genbentryname.script Version 1.0"
+echo ""
+
+set RM = "echo /bin/rm -f"
+set SORT = "sort -T tmp"
+
+#
+#
+#
+
+genbentryname1 gbbct.seq gbbct.list 1
+if ( $status ) goto abort
+genbentryname1 gbinv.seq gbinv.list 2
+if ( $status ) goto abort
+genbentryname1 gbmam.seq gbmam.list 3
+if ( $status ) goto abort
+genbentryname1 gborg.seq gborg.list 4
+if ( $status ) goto abort
+genbentryname1 gbphg.seq gbphg.list 5
+if ( $status ) goto abort
+genbentryname1 gbpln.seq gbpln.list 6
+if ( $status ) goto abort
+genbentryname1 gbpri.seq gbpri.list 7
+if ( $status ) goto abort
+genbentryname1 gbrna.seq gbrna.list 8
+if ( $status ) goto abort
+genbentryname1 gbrod.seq gbrod.list 9
+if ( $status ) goto abort
+genbentryname1 gbsyn.seq gbsyn.list 10
+if ( $status ) goto abort
+genbentryname1 gbuna.seq gbuna.list 11
+if ( $status ) goto abort
+genbentryname1 gbvrl.seq gbvrl.list 12
+if ( $status ) goto abort
+genbentryname1 gbvrt.seq gbvrt.list 13
+if ( $status ) goto abort
+
+${SORT} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list >! entry.sorted
+if ( $status ) goto abort
+${RM} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list 
+
+entryname2 entry.sorted entrynam.idx
+if ( $status ) goto abort
+${RM} entry.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "genbentryname.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/genbentryname1.c b/src/indexseqlibs/genbentryname1.c
new file mode 100644
index 0000000..be7a230
--- /dev/null
+++ b/src/indexseqlibs/genbentryname1.c
@@ -0,0 +1,82 @@
+/*       genbentryname1          */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create index for genbank library in embl cdrom form */
+/* this one gets the entry names and offsets and writes them to an ascii file 
+   this file is sorted, then the next program converts the sorted file to binary
+   and adds the header */
+
+char patternENTRY[] = "LOCUS";
+char patternSEQUENCE[] = "ORIGIN";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char entry_name[10];
+    long entry_offset = 0;
+    long sequence_offset = 0;
+    long entries_in = 0;
+    long entries_out = 0;
+    int leftChar = 12;
+
+
+    printf("genbentryname1 Version 1.0\n");
+    
+
+    if (argc != 4) {
+	fprintf(stderr,"Usage: genbentryname1 filein fileout division\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"genbentryname1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"genbentryname1: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+
+    entry_name[0]='\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL){
+
+	if (patmatch(line, patternENTRY)){
+
+	    if (*entry_name)
+		printf("No sequence for entry %-10.10s\n",entry_name);
+
+	    strncpy(entry_name,&line[leftChar],10);
+	    entry_offset = ftell(ifp) - strlen(line);
+	    entries_in++;
+
+	} else if (patmatch(line, patternSEQUENCE) && *entry_name) {
+
+	    fgets(line, MAXLINE, ifp);
+
+	    sequence_offset = ftell(ifp) - strlen(line) + 10;
+	    /* 10 = magic offset to first character of sequence */
+
+	    fprintf(ofp,"%-10.10s %10d %10d %-5.5s\n",entry_name,entry_offset,sequence_offset,argv[3]);
+	    entries_out++;
+	    entry_name[0]='\0';
+
+	}
+
+    }
+
+    printf(" Number of entries read = %d\n",entries_in); 
+    printf(" Number of entries written = %d\n\n",entries_out); 
+
+    fclose(ifp);
+    fclose(ofp);
+    return 0;
+}
+
diff --git a/src/indexseqlibs/genbfreetext.script b/src/indexseqlibs/genbfreetext.script
new file mode 100644
index 0000000..4631276
--- /dev/null
+++ b/src/indexseqlibs/genbfreetext.script
@@ -0,0 +1,81 @@
+#! /bin/csh -f
+#
+# script to make a free text index for the genbank library
+#
+# NOTE only active for primate file
+#
+
+echo "genbfreetext.script Version 1.0"
+echo ""
+
+set RM = "echo /bin/rm -f"
+set SORT = "sort -T tmp"
+set UNIQ = uniq
+
+#
+#
+#
+set WORDS = $STADENROOT/seqman/stopwords
+
+genbfreetext gbpri.seq gbpri.list
+if ( $status ) goto abort
+genbfreetext gbrod.seq gbrod.list
+if ( $status ) goto abort
+genbfreetext gbmam.seq gbmam.list
+if ( $status ) goto abort
+genbfreetext gbvrt.seq gbvrt.list
+if ( $status ) goto abort
+genbfreetext gbinv.seq gbinv.list
+if ( $status ) goto abort
+genbfreetext gbpln.seq gbpln.list
+if ( $status ) goto abort
+genbfreetext gborg.seq gborg.list
+if ( $status ) goto abort
+genbfreetext gbbct.seq gbbct.list
+if ( $status ) goto abort
+genbfreetext gbrna.seq gbrna.list
+if ( $status ) goto abort
+genbfreetext gbvrl.seq gbvrl.list
+if ( $status ) goto abort
+genbfreetext gbphg.seq gbphg.list
+if ( $status ) goto abort
+genbfreetext gbsyn.seq gbsyn.list
+if ( $status ) goto abort
+genbfreetext gbuna.seq gbuna.list
+if ( $status ) goto abort
+
+${SORT} -b +1 gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list | ${UNIQ} | excludewords ${WORDS} | ${SORT} >! freetext.sorted
+if ( $status ) goto abort
+${RM} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list 
+
+# sort on words, remove dumplicates, remove stopwords, resort on entry name
+#${SORT} -b +1 gbpri.list | ${UNIQ} | excludewords ${WORDS} | ${SORT} >! freetext.sorted
+#if ( $status ) goto abort
+#${RM} gbpri.list
+
+# include entry numbers in the file
+freetext2 freetext.sorted freetext.entry
+if ( $status ) goto abort
+${RM} freetext.sorted
+
+# sort on text
+${SORT} -b +2 freetext.entry >! freetext.sorted2
+if ( $status ) goto abort
+${RM} freetext.entry
+
+# create the index
+freetext4 freetext.sorted2 freetext.trg freetext.hit
+if ( $status ) goto abort
+${RM} freetext.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "genbfreetext.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/genbtitle.script b/src/indexseqlibs/genbtitle.script
new file mode 100644
index 0000000..8f5f139
--- /dev/null
+++ b/src/indexseqlibs/genbtitle.script
@@ -0,0 +1,68 @@
+#! /bin/csh -f
+#
+# script to make a title or brief index for the genbank library
+#
+# NOTE only active for primate file
+#
+
+echo "genbtitle.script Version 1.0"
+echo ""
+
+set RM = "echo /bin/rm -f"
+set SORT = "sort -T tmp"
+
+#
+#
+#
+
+genbtitle1 gbpri.seq gbpri.list
+if ( $status ) goto abort
+genbtitle1 gbrod.seq gbrod.list
+if ( $status ) goto abort
+genbtitle1 gbmam.seq gbmam.list
+if ( $status ) goto abort
+genbtitle1 gbvrt.seq gbvrt.list
+if ( $status ) goto abort
+genbtitle1 gbinv.seq gbinv.list
+if ( $status ) goto abort
+genbtitle1 gbpln.seq gbpln.list
+if ( $status ) goto abort
+genbtitle1 gborg.seq gborg.list
+if ( $status ) goto abort
+genbtitle1 gbbct.seq gbbct.list
+if ( $status ) goto abort
+genbtitle1 gbrna.seq gbrna.list
+if ( $status ) goto abort
+genbtitle1 gbvrl.seq gbvrl.list
+if ( $status ) goto abort
+genbtitle1 gbphg.seq gbphg.list
+if ( $status ) goto abort
+genbtitle1 gbsyn.seq gbsyn.list
+if ( $status ) goto abort
+genbtitle1 gbuna.seq gbuna.list
+if ( $status ) goto abort
+
+${SORT} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list >! title.sorted
+if ( $status ) goto abort
+${RM} gbpri.list gbrod.list gbmam.list gbvrt.list gbinv.list gbpln.list gborg.list gbbct.list gbrna.list gbvrl.list gbphg.list gbsyn.list gbuna.list 
+
+#${SORT} gbpri.list  >! title.sorted
+#if ( $status ) goto abort
+#${RM} gbpri.list
+
+
+title2 title.sorted brief.idx
+if ( $status ) goto abort
+${RM} title.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "genbtitle.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/genbtitle1.c b/src/indexseqlibs/genbtitle1.c
new file mode 100644
index 0000000..e5c472e
--- /dev/null
+++ b/src/indexseqlibs/genbtitle1.c
@@ -0,0 +1,104 @@
+/*          genbtitle1          */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create index for genbank library in embl cdrom form */
+/* this one gets the entry names, titles, accession numbers and lengths
+   and writes them to an ascii file 
+   this file is sorted, then the next 
+   program converts the sorted file to binary
+   and adds the header */
+
+char patternENTRY[] = "LOCUS";
+char patternTITLE[] = "DEFINITION";
+char patternACCESSION[] = "ACCESSION";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char entry_name[10];
+    char accession_number[10];
+    unsigned long sequence_length;
+    char title[80];
+
+    int i;
+    int leftChar = 12;
+    long entries_in = 0;
+    long entries_out = 0;
+
+    
+    printf("genbtitle1 Version 1.1\n");
+     
+
+    if (argc != 3) {
+	fprintf(stderr,"Usage: genbtitle1 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"genbtitle1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"genbtitle1: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+
+    entry_name[0] = '\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL){
+
+	if (patmatch(line, patternENTRY)){
+	    if (*entry_name) {
+		/* let's output gathered values for last entry*/
+		fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+			entry_name,
+			accession_number,
+			sequence_length,
+			title);
+		entries_out++;
+	    }
+	    /* initialise values */
+	    entry_name[0] = '\0';
+	    accession_number[0] = '\0';
+	    title[0] = '\0';
+	    sequence_length = 0;
+
+	    entries_in++;
+	    strncpy(entry_name,&line[leftChar],10);
+	    sequence_length = atol(&line[22]);
+	} else if (patmatch(line, patternTITLE) && *title=='\0'){
+	    for(i=0;isprint(line[leftChar+i]) && i<80;i++) title[i]=line[leftChar+i];
+	    for(;i<80;i++) title[i] = ' ';
+	} else if (patmatch(line, patternACCESSION) && *accession_number=='\0'){
+	    for(i=0;isalnum(line[leftChar+i]) && i<10;i++) accession_number[i]=line[leftChar+i];
+	    for(;i<10;i++) accession_number[i] = ' ';
+	}
+    }
+    if (*entry_name) {
+	/* let's output gathered values for last entry*/
+	fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+		entry_name,
+		accession_number,
+		sequence_length,
+		title);
+	entries_out++;
+    }
+
+    printf(" Number of entries read = %ld\n",entries_in);
+    printf(" Number of entries written = %ld\n\n",entries_out);
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
+
+
diff --git a/src/indexseqlibs/getEMBLupdates.script b/src/indexseqlibs/getEMBLupdates.script
new file mode 100644
index 0000000..ebb8c2e
--- /dev/null
+++ b/src/indexseqlibs/getEMBLupdates.script
@@ -0,0 +1,186 @@
+#! /bin/sh
+#
+#	This script should be run by cron, once a week
+#
+#	This script will use anonymous ftp to genbak.bio.net to
+#	retrieve the latest updates to the embl library
+#
+#	Updates appear to be made every Monday mid afternoon PST
+#	We should transfer them at 6am Tuesday
+#	The crontab file should look like:
+#
+#	    # SunOS
+#	    0 6 * * 2 /nfs/al/pubseq/pubseq/seqlibs/getEMBLupdates
+#
+#	    # Alliant Concentrix
+#	    0 6 * * 2 pubseq /nfs/al/pubseq/pubseq/seqlibs/getEMBLupdates
+#
+#
+
+#
+# Set defaults
+#
+PROGS=/nfs/al/pubseq/pubseq/inhouse/staden/src/indexseqlibs
+NEWDIR=embl-updates-new
+OLDDIR=embl-updates-old
+CURDIR=embl-updates
+FILE=emseq.all.Z
+SEQLIBS=/nfs/al/pubseq/pubseq/seqlibs
+RECIPIENT=pubseq
+# test
+#CURDIR=embl-updates-test
+#FILE=em1230.seq.Z
+
+#
+# report
+# for interactive mode use: REPORT=/dev/stty
+#
+REPORT=${SEQLIBS}/EMBLupdate.$$
+
+echo "getEMBLupdates Version 1.0" >> ${REPORT}
+echo Started: `date` >> ${REPORT}
+
+cd $SEQLIBS
+
+#
+# Remove old directory
+#
+if test -d $OLDDIR; then
+	echo Removing old directory $OLDDIR >> ${REPORT}
+	/bin/rm -r $OLDDIR
+fi
+
+#
+# Build new directory with all the stuff we need
+#
+if test -d $NEWDIR; then
+	echo Removing existing new directory $NEWDIR >> ${REPORT}
+	/bin/rm -r $NEWDIR
+fi
+mkdir $NEWDIR
+cd $NEWDIR
+
+#
+# Get files via ftp
+#
+echo FTP-ing latest updates >> ${REPORT}
+ftp -v 134.172.1.160 << EOF >> ${REPORT}
+cd pub/db/embl-newdata
+type binary
+get README
+get $FILE emseq.all.Z
+EOF
+
+#
+# Report errors during ftp
+#
+if test $? -eq 1; then
+	echo Error: Transferring files >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+#
+# Check files transferred
+#
+if test ! -f emseq.all.Z ; then
+	echo Error: File not transferred >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+#
+# Uncompress
+#
+echo Uncompressing update file >> ${REPORT}
+uncompress emseq.all.Z
+if test $? -eq 1; then
+	echo Error: Cannot uncompressing file >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+#
+# Create indices
+#
+
+#
+# entry name index
+#
+echo Building entryname index >> ${REPORT}
+emblentryname.script >> ${REPORT}
+if test $? -ne 0; then
+	echo Error: Failed to make entry name index >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+#
+# accession number index
+#
+echo Building accession number index >> ${REPORT}
+emblaccession.script >> ${REPORT}
+if test $? -ne 0; then
+	echo Error: Failed to make accession number index >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+#
+# free text index
+#
+echo Building free text index >> ${REPORT}
+emblfreetext.script >> ${REPORT}
+if test $? -ne 0; then
+	echo Error: Failed to make free text index >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+#
+# Author name index
+#
+echo Building author index >> ${REPORT}
+emblauthor.script >> ${REPORT}
+if test $? -ne 0; then
+	echo Error: Failed to make author index >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+#
+# brief title index
+#
+echo Building brief title index >> ${REPORT}
+embltitle.script >> ${REPORT}
+if test $? -ne 0; then
+	echo Error: Failed to brief title index >> ${REPORT}
+	mail ${RECIPIENT} < ${REPORT}
+	exit 1
+fi
+
+
+#
+# switch databases
+#
+echo Switching databases >> ${REPORT}
+cd $SEQLIBS
+if test -d $CURDIR; then
+	mv $CURDIR $OLDDIR
+fi
+mv $NEWDIR $CURDIR
+
+#
+# finish up
+#
+echo "" >> ${REPORT}
+echo "===========================README==========================" >> ${REPORT}
+cat $CURDIR/README >> ${REPORT}
+echo "" >> ${REPORT}
+echo "---------------------------README--------------------------" >> ${REPORT}
+echo Update successful >> ${REPORT}
+echo Finished: `date` >> ${REPORT}
+
+mail ${RECIPIENT} < ${REPORT}
+exit 0
+#
diff --git a/src/indexseqlibs/getstopwords.script b/src/indexseqlibs/getstopwords.script
new file mode 100644
index 0000000..1d4e9e4
--- /dev/null
+++ b/src/indexseqlibs/getstopwords.script
@@ -0,0 +1,22 @@
+#! /bin/csh -f
+#
+# Create a file stopwords which is a reformatted version of
+# the file indices/embl/stopwords.lst on the EMBL CDROM
+#
+# NOTES:
+#     The first 300 characters for stopwords.lst are a header
+#     The words are in blocked format. The block size is specified in
+#     the header but here we assume it is 12.
+#
+set header = 300
+@ start = $header + 1
+set block_size = 12
+set lst_file = /nfs/gans/cdrom/indices/stopwords.lst
+set out_file = stopwords
+
+# Save old copy of converted file
+if (-e ${outfile}) then
+	/bin/mv ${outfile} ${outfile}~
+endif
+
+tail +${start}c ${lst_file} | dd cbs=${block_size} conv=unblock >! ${out_file}
diff --git a/src/indexseqlibs/hitNtrg.c b/src/indexseqlibs/hitNtrg.c
new file mode 100644
index 0000000..a50afc7
--- /dev/null
+++ b/src/indexseqlibs/hitNtrg.c
@@ -0,0 +1,169 @@
+/*            hitNtrg            */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#include <string.h>
+#include "mach-io.h"
+#include "cdromheader.h"
+
+#define MAXLINE 200 /* > 80 + 11 + 11 + 11 */
+
+/* program to read in a character-mode index and write it out in binary
+   with a header */
+
+static void cpy2nl(char *a, char *b)
+/*
+** Copy string b to a, up to first newline
+**
+** YUK! no longer needed
+*/
+{
+    for( ; *b && *b != '\n'; a++, b++) *a = *b;
+    *a = '\0';
+}
+
+
+static void blatnl(char *a)
+/*
+** Remove the newline
+*/
+{
+    for ( ; *a && *a != '\n' ; a++) ;
+    *a = '\0';
+}
+
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *tofp;
+    FILE *hofp;
+    uint_4 tgtRecords,hitRecords,hitStart,hits;
+    uint_2 tRecSize;
+    uint_2 hRecSize = 4;
+    uint_2 stringSize;
+    uint_4 entryNumber;
+    char last_word[MAXLINE];
+    char line[MAXLINE];
+
+
+    printf("hitNtrg Version 1.0\n");
+    
+
+    if (argc != 5) {
+	fprintf(stderr,"Usage: hitNtrg filein targetfileout hitfileout string_size\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"hitNtrg cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((tofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"hitNtrg cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    if ((hofp = fopen(argv[3],"w")) == NULL) {
+	fprintf(stderr,"hitNtrg cannot open output file %s\n",argv[3]);
+	exit(1);
+    }
+    if ((stringSize = atoi(argv[4])) == 0) {
+	fprintf(stderr,"hitNtrg string size should be greater than 0\n");
+	exit(1);
+    }
+    tRecSize = stringSize + (uint_2) sizeof(uint_4) * 2;
+    
+    if(cdromheader(tofp, tRecSize ) != 0 ) {
+	fprintf(stderr,"hitNtrg: Error writing target file header\n");
+        exit(1);
+    }
+    if(cdromheader(hofp, hRecSize ) != 0 ) {
+	fprintf(stderr,"hitNtrg: Error writing hit file header\n");
+        exit(1);
+    }
+
+    /* get the first line */
+    if (fgets(line,MAXLINE,ifp) == NULL) {
+	fprintf(stderr,"hitNtrg Error reading input file\n");
+	exit(1);
+    }
+    tgtRecords = 0;
+    hitRecords = 0;
+
+    /* get information */
+    cpy2nl(last_word,line+22);
+    hitStart = 1;
+    hits = 1;
+
+    /* for every record write the entryNumber to the hits file */
+    hitRecords++;
+    entryNumber = atoi(line+11);
+    if (le_write_int_4(hofp,&entryNumber) == 0) {
+	fprintf(stderr,"hitNtrg: Error writing hit file\n");
+	exit(1);
+    }
+
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+
+	/* if current word is different - update files */
+	blatnl(line+22);
+	if (strncmp(last_word,line+22,MAXLINE)!=0) {
+	    /* write to target file */
+	    if (le_write_int_4(tofp,&hits) == 0) {
+		fprintf(stderr,"hitNtrg: Error writing target file\n");
+		exit(1);
+	    }
+	    if (le_write_int_4(tofp,&hitStart) == 0) {
+		fprintf(stderr,"hitNtrg: Error writing target file\n");
+		exit(1);
+	    }
+	    fprintf(tofp,"%-*.*s",stringSize,stringSize,last_word);
+	    tgtRecords++;
+	    /* get information */
+	    cpy2nl(last_word,line+22);
+	    hitStart = hitRecords+1;
+	    hits = 0;
+	}
+	hits++;
+
+	/* for every record write the entryNumber to the hits file */
+	hitRecords++;
+	entryNumber = atoi(line+11);
+	if (le_write_int_4(hofp,&entryNumber) == 0) {
+	    fprintf(stderr,"hitNtrg: Error writing hit file\n");
+	    exit(1);
+	}
+    }
+
+    /* write the last target file record */
+    if (le_write_int_4(tofp,&hits) == 0) {
+	fprintf(stderr,"hitNtrg: Error writing target file\n");
+	exit(1);
+    }
+    if (le_write_int_4(tofp,&hitStart) == 0) {
+	fprintf(stderr,"hitNtrg: Error writing target file\n");
+	exit(1);
+    }
+    fprintf(tofp,"%-*.*s",stringSize,stringSize,last_word);
+    tgtRecords++;
+    
+	
+    /* add number of records to header */
+    if (fseek(tofp, (off_t)4,0)) {
+	fprintf(stderr,"hitNtrg: Error seeking on target file\n");
+	exit(1);
+    }
+    if(le_write_int_4(tofp, &tgtRecords) == 0) {
+	fprintf(stderr,"hitNtrg: Error writing to target file\n");
+	exit(1);
+    }
+
+    printf(" Records in target file %d\n", tgtRecords);
+    printf(" Records in hit file %d\n\n", hitRecords);
+
+    fclose(ifp);
+    fclose(hofp);
+    fclose(tofp);
+    
+    return 0;
+}
diff --git a/src/indexseqlibs/mach-io.c b/src/indexseqlibs/mach-io.c
new file mode 100644
index 0000000..9b78205
--- /dev/null
+++ b/src/indexseqlibs/mach-io.c
@@ -0,0 +1,238 @@
+/*
+** Machine independant io:
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value suggessfully read or written
+*/
+
+#include <stdio.h>
+#include "mach-io.h"
+
+
+
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+int be_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a big-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[0] = (uint_1) (*i2>>8)&255;
+    buf[1] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(*i4)];
+
+
+    buf[0] = (uint_1) (*i4>>24)&255;
+    buf[1] = (uint_1) (*i4>>16)&255;
+    buf[2] = (uint_1) (*i4>>8)&255;
+    buf[3] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a big-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[1]) +
+         ((uint_2)buf[0]<<8));
+    return (1);
+}
+
+
+
+
+
+int be_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[3]) +
+         ((uint_4)buf[2]<<8) +
+         ((uint_4)buf[1]<<16) +
+         ((uint_4)buf[0]<<24));
+    return (1);
+}
+
+
+
+
+
+
+
+
+
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+int le_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a little-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[1] = (uint_1) (*i2>>8)&255;
+    buf[0] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    buf[3] = (uint_1) (*i4>>24)&255;
+    buf[2] = (uint_1) (*i4>>16)&255;
+    buf[1] = (uint_1) (*i4>>8)&255;
+    buf[0] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a little-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[0]) +
+         ((uint_2)buf[1]<<8));
+    return (1);
+}
+
+
+
+
+
+int le_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[0]) +
+         ((uint_4)buf[1]<<8) +
+         ((uint_4)buf[2]<<16) +
+         ((uint_4)buf[3]<<24));
+    return (1);
+}
diff --git a/src/indexseqlibs/mach-io.h b/src/indexseqlibs/mach-io.h
new file mode 100644
index 0000000..5783df3
--- /dev/null
+++ b/src/indexseqlibs/mach-io.h
@@ -0,0 +1,115 @@
+#ifndef _mach_io_h
+#define _mach_io_h
+/*
+** Machine independant io
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value successfully read or written
+*/
+
+#include <stdio.h>
+
+typedef char		int_1;
+typedef short		int_2;
+typedef int		int_4;
+typedef unsigned char	uint_1;
+typedef unsigned short	uint_2;
+typedef unsigned int	uint_4;
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+extern int be_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a big-endian int_1
+*/
+
+extern
+int be_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a big-endian int_2
+*/
+
+extern
+int be_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a big-endian int_4
+*/
+
+extern
+int be_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a big-endian int_1
+*/
+
+extern
+int be_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a big-endian int_2
+*/
+
+extern
+int be_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a big-endian int_4
+*/
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+extern
+int le_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a little-endian int_1
+*/
+
+extern
+int le_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a little-endian int_2
+*/
+
+extern
+int le_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a little-endian int_4
+*/
+
+extern
+int le_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a little-endian int_1
+*/
+
+extern
+int le_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a little-endian int_2
+*/
+
+extern
+int le_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a little-endian int_4
+*/
+
+
+#endif /* _mach_io_h */
diff --git a/src/indexseqlibs/makefile-alpha b/src/indexseqlibs/makefile-alpha
new file mode 100644
index 0000000..f150f63
--- /dev/null
+++ b/src/indexseqlibs/makefile-alpha
@@ -0,0 +1,171 @@
+#
+# indexseqlibs - Alpha version
+#
+
+
+#
+# Programs need making
+#
+PROGS = \
+	addnl \
+	genbentryname1 \
+	entryname2 \
+	access4 \
+	access2 \
+	genbaccess1 \
+	title2 \
+	genbtitle1 \
+	emblentryname1 \
+	emblaccess1 \
+	embltitle1 \
+	pirentryname1 \
+	piraccess1 \
+	piraccess2 \
+	pirtitle1 \
+	pirtitle2 \
+	excludewords \
+	emblfreetext \
+	genbfreetext \
+	pirfreetext \
+	freetext2 \
+	freetext4 \
+	emblauthor \
+	genbauthor \
+	pirauthor \
+	hitNtrg \
+	division
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Alpha
+#
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact
+LD = cc
+LDFLAGS = -g -non_shared
+
+#
+# Shared objects
+#
+OBJS = \
+	cdromheader.o \
+	mach-io.o
+
+#
+# Dependencies for programs
+#
+addnl : addnl.o
+	$(LD) $(LDFLAGS) -o $@ addnl.o
+
+genbentryname1 : genbentryname1.o
+	$(LD) $(LDFLAGS) -o $@ genbentryname1.o
+
+entryname2 : entryname2.o $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ entryname2.o $(OBJS)
+
+access4 : access4.o $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ access4.o $(OBJS)
+
+access2 : access2.o
+	$(LD) $(LDFLAGS) -o $@ access2.o
+
+genbaccess1 : genbaccess1.o
+	$(LD) $(LDFLAGS) -o $@ genbaccess1.o
+
+title2 : title2.o $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ title2.o $(OBJS)
+
+genbtitle1 : genbtitle1.o
+	$(LD) $(LDFLAGS) -o $@ genbtitle1.o
+
+emblentryname1 : emblentryname1.o
+	$(LD) $(LDFLAGS) -o $@ emblentryname1.o
+
+emblaccess1 : emblaccess1.o
+	$(LD) $(LDFLAGS) -o $@ emblaccess1.o
+
+embltitle1 : embltitle1.o
+	$(LD) $(LDFLAGS) -o $@ embltitle1.o
+
+pirentryname1 : pirentryname1.o
+	$(LD) $(LDFLAGS) -o $@ pirentryname1.o
+
+piraccess1 : piraccess1.o
+	$(LD) $(LDFLAGS) -o $@ piraccess1.o
+
+piraccess2 : piraccess2.o
+	$(LD) $(LDFLAGS) -o $@ piraccess2.o
+
+pirtitle1 : pirtitle1.o
+	$(LD) $(LDFLAGS) -o $@ pirtitle1.o
+
+pirtitle2 : pirtitle2.o
+	$(LD) $(LDFLAGS) -o $@ pirtitle2.o
+
+excludewords : excludewords.o
+	$(LD) $(LDFLAGS) -o $@ excludewords.o
+
+emblfreetext.o: $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -DEMBL -c -o $@ $<
+emblfreetext : emblfreetext.o
+	$(LD) $(LDFLAGS) -o $@ emblfreetext.o
+
+genbfreetext.o: $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -DGENBANK -c -o $@ $<
+genbfreetext : genbfreetext.o
+	$(LD) $(LDFLAGS) -o $@ genbfreetext.o
+
+pirfreetext.o: $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -DPIR -c -o $@ $<
+pirfreetext : pirfreetext.o
+	$(LD) $(LDFLAGS) -o $@ pirfreetext.o
+
+freetext2 : freetext2.o
+	$(LD) $(LDFLAGS) -o $@ freetext2.o
+
+freetext4 : freetext4.o $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ freetext4.o $(OBJS)
+
+emblauthor.o : $(SRC)/author.c
+	$(CC) $(CFLAGS) -DEMBL -c -o $@ $<
+emblauthor : emblauthor.o
+	$(LD) $(LDFLAGS) -o $@ emblauthor.o
+
+genbauthor.o : $(SRC)/author.c
+	$(CC) $(CFLAGS) -DGENBANK -c -o $@ $<
+genbauthor : genbauthor.o
+	$(LD) $(LDFLAGS) -o $@ genbauthor.o
+
+pirauthor.o : $(SRC)/author.c
+	$(CC) $(CFLAGS) -DPIR -c -o $@ $<
+pirauthor : pirauthor.o
+	$(LD) $(LDFLAGS) -o $@ pirauthor.o
+
+hitNtrg : hitNtrg.o $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ hitNtrg.o $(OBJS)
+
+division : division.o $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ division.o $(OBJS)
+
+
+
+
+#
+# Special dependencies
+#
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+all : $(PROGS)
+
+clean :
+	/bin/rm -f $(PROGS) *.o *~
+
+
diff --git a/src/indexseqlibs/makefile-dec b/src/indexseqlibs/makefile-dec
new file mode 100644
index 0000000..bc8be43
--- /dev/null
+++ b/src/indexseqlibs/makefile-dec
@@ -0,0 +1,156 @@
+#
+# indexseqlibs - DEC Ultrix (mips) fork
+#
+
+#
+# Programs need making
+#
+PROGS = \
+	addnl \
+	genbentryname1 \
+	entryname2 \
+	access4 \
+	access2 \
+	genbaccess1 \
+	title2 \
+	genbtitle1 \
+	emblentryname1 \
+	emblaccess1 \
+	embltitle1 \
+	pirentryname1 \
+	piraccess1 \
+	piraccess2 \
+	pirtitle1 \
+	pirtitle2 \
+	excludewords \
+	emblfreetext \
+	genbfreetext \
+	pirfreetext \
+	freetext2 \
+	freetext4 \
+	emblauthor \
+	genbauthor \
+	pirauthor \
+	hitNtrg \
+	division
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# DEC
+#
+CC = c89
+CFLAGS = -g -std -w0
+
+#
+# Shared objects
+#
+OBJS = \
+	cdromheader.o \
+	mach-io.o
+
+#
+# Dependencies for programs
+#
+cdromheader.o: $(SRC)/cdromheader.c
+	$(CC) $(CFLAGS) -c $(SRC)/cdromheader.c
+
+mach-io.o: $(SRC)/mach-io.c
+	$(CC) $(CFLAGS) -c $(SRC)/mach-io.c
+
+addnl : $(SRC)/addnl.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/addnl.c
+
+genbentryname1 : $(SRC)/genbentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbentryname1.c
+
+entryname2 : $(SRC)/entryname2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/entryname2.c $(OBJS)
+
+access4 : $(SRC)/access4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access4.c $(OBJS)
+
+access2 : $(SRC)/access2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access2.c
+
+genbaccess1 : $(SRC)/genbaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbaccess1.c
+
+title2 : $(SRC)/title2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/title2.c $(OBJS)
+
+genbtitle1 : $(SRC)/genbtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbtitle1.c
+
+emblentryname1 : $(SRC)/emblentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblentryname1.c
+
+emblaccess1 : $(SRC)/emblaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblaccess1.c
+
+embltitle1 : $(SRC)/embltitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/embltitle1.c
+
+pirentryname1 : $(SRC)/pirentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirentryname1.c
+
+piraccess1 : $(SRC)/piraccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess1.c
+
+piraccess2 : $(SRC)/piraccess2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess2.c
+
+pirtitle1 : $(SRC)/pirtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle1.c
+
+pirtitle2 : $(SRC)/pirtitle2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle2.c
+
+excludewords : $(SRC)/excludewords.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/excludewords.c
+
+emblfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/freetext.c
+
+genbfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/freetext.c
+
+pirfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/freetext.c
+
+freetext2 : $(SRC)/freetext2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext2.c
+
+freetext4 : $(SRC)/freetext4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext4.c $(OBJS)
+
+emblauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/author.c
+
+pirauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/author.c
+
+genbauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/author.c
+
+hitNtrg : $(SRC)/hitNtrg.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/hitNtrg.c $(OBJS)
+
+division : $(SRC)/division.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/division.c $(OBJS)
+
+
+
+
+#
+# Special dependencies
+#
+all : $(PROGS)
+
+clean :
+	/bin/rm -f $(PROGS) *.o *~
+
+
diff --git a/src/indexseqlibs/makefile-sgi b/src/indexseqlibs/makefile-sgi
new file mode 100644
index 0000000..26248b4
--- /dev/null
+++ b/src/indexseqlibs/makefile-sgi
@@ -0,0 +1,158 @@
+#
+# indexseqlibs - SGI version
+#
+
+SHELL=/bin/sh
+
+#
+# Programs need making
+#
+PROGS = \
+	addnl \
+	genbentryname1 \
+	entryname2 \
+	access4 \
+	access2 \
+	genbaccess1 \
+	title2 \
+	genbtitle1 \
+	emblentryname1 \
+	emblaccess1 \
+	embltitle1 \
+	pirentryname1 \
+	piraccess1 \
+	piraccess2 \
+	pirtitle1 \
+	pirtitle2 \
+	excludewords \
+	emblfreetext \
+	genbfreetext \
+	pirfreetext \
+	freetext2 \
+	freetext4 \
+	emblauthor \
+	genbauthor \
+	pirauthor \
+	hitNtrg \
+	division
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Sun
+#
+CC = cc
+CFLAGS = -g -ansi -prototypes
+
+#
+# Shared objects
+#
+OBJS = \
+	cdromheader.o \
+	mach-io.o
+
+#
+# Dependencies for programs
+#
+cdromheader.o: $(SRC)/cdromheader.c
+	$(CC) $(CFLAGS) -c $(SRC)/cdromheader.c
+
+mach-io.o: $(SRC)/mach-io.c
+	$(CC) $(CFLAGS) -c $(SRC)/mach-io.c
+
+addnl : $(SRC)/addnl.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/addnl.c
+
+genbentryname1 : $(SRC)/genbentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbentryname1.c
+
+entryname2 : $(SRC)/entryname2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/entryname2.c $(OBJS)
+
+access4 : $(SRC)/access4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access4.c $(OBJS)
+
+access2 : $(SRC)/access2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access2.c
+
+genbaccess1 : $(SRC)/genbaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbaccess1.c
+
+title2 : $(SRC)/title2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/title2.c $(OBJS)
+
+genbtitle1 : $(SRC)/genbtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbtitle1.c
+
+emblentryname1 : $(SRC)/emblentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblentryname1.c
+
+emblaccess1 : $(SRC)/emblaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblaccess1.c
+
+embltitle1 : $(SRC)/embltitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/embltitle1.c
+
+pirentryname1 : $(SRC)/pirentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirentryname1.c
+
+piraccess1 : $(SRC)/piraccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess1.c
+
+piraccess2 : $(SRC)/piraccess2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess2.c
+
+pirtitle1 : $(SRC)/pirtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle1.c
+
+pirtitle2 : $(SRC)/pirtitle2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle2.c
+
+excludewords : $(SRC)/excludewords.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/excludewords.c
+
+emblfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/freetext.c
+
+genbfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/freetext.c
+
+pirfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/freetext.c
+
+freetext2 : $(SRC)/freetext2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext2.c
+
+freetext4 : $(SRC)/freetext4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext4.c $(OBJS)
+
+emblauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/author.c
+
+pirauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/author.c
+
+genbauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/author.c
+
+hitNtrg : $(SRC)/hitNtrg.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/hitNtrg.c $(OBJS)
+
+division : $(SRC)/division.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/division.c $(OBJS)
+
+
+
+
+#
+# Special dependencies
+#
+all : $(PROGS)
+
+clean :
+	/bin/rm -f $(PROGS) *.o *~
+
+
diff --git a/src/indexseqlibs/makefile-solaris b/src/indexseqlibs/makefile-solaris
new file mode 100644
index 0000000..7e4abce
--- /dev/null
+++ b/src/indexseqlibs/makefile-solaris
@@ -0,0 +1,158 @@
+#
+# indexseqlibs - SunOS (sparc) fork
+#
+
+#
+# Programs need making
+#
+PROGS = \
+	addnl \
+	genbentryname1 \
+	entryname2 \
+	access4 \
+	access2 \
+	genbaccess1 \
+	title2 \
+	genbtitle1 \
+	emblentryname1 \
+	emblaccess1 \
+	embltitle1 \
+	pirentryname1 \
+	piraccess1 \
+	piraccess2 \
+	pirtitle1 \
+	pirtitle2 \
+	excludewords \
+	emblfreetext \
+	genbfreetext \
+	pirfreetext \
+	freetext2 \
+	freetext4 \
+	emblauthor \
+	genbauthor \
+	pirauthor \
+	hitNtrg \
+	division
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Sun
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS)
+
+#
+# Shared objects
+#
+OBJS = \
+	cdromheader.o \
+	mach-io.o
+
+#
+# Dependencies for programs
+#
+cdromheader.o: $(SRC)/cdromheader.c
+	$(CC) $(CFLAGS) -c $(SRC)/cdromheader.c
+
+mach-io.o: $(SRC)/mach-io.c
+	$(CC) $(CFLAGS) -c $(SRC)/mach-io.c
+
+addnl : $(SRC)/addnl.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/addnl.c
+
+genbentryname1 : $(SRC)/genbentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbentryname1.c
+
+entryname2 : $(SRC)/entryname2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/entryname2.c $(OBJS)
+
+access4 : $(SRC)/access4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access4.c $(OBJS)
+
+access2 : $(SRC)/access2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access2.c
+
+genbaccess1 : $(SRC)/genbaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbaccess1.c
+
+title2 : $(SRC)/title2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/title2.c $(OBJS)
+
+genbtitle1 : $(SRC)/genbtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbtitle1.c
+
+emblentryname1 : $(SRC)/emblentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblentryname1.c
+
+emblaccess1 : $(SRC)/emblaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblaccess1.c
+
+embltitle1 : $(SRC)/embltitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/embltitle1.c
+
+pirentryname1 : $(SRC)/pirentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirentryname1.c
+
+piraccess1 : $(SRC)/piraccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess1.c
+
+piraccess2 : $(SRC)/piraccess2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess2.c
+
+pirtitle1 : $(SRC)/pirtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle1.c
+
+pirtitle2 : $(SRC)/pirtitle2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle2.c
+
+excludewords : $(SRC)/excludewords.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/excludewords.c
+
+emblfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/freetext.c
+
+genbfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/freetext.c
+
+pirfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/freetext.c
+
+freetext2 : $(SRC)/freetext2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext2.c
+
+freetext4 : $(SRC)/freetext4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext4.c $(OBJS)
+
+emblauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/author.c
+
+pirauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/author.c
+
+genbauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/author.c
+
+hitNtrg : $(SRC)/hitNtrg.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/hitNtrg.c $(OBJS)
+
+division : $(SRC)/division.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/division.c $(OBJS)
+
+
+
+
+#
+# Special dependencies
+#
+all : $(PROGS)
+
+clean :
+	/bin/rm -f $(PROGS) *.o *~
+
+
diff --git a/src/indexseqlibs/makefile-sun b/src/indexseqlibs/makefile-sun
new file mode 100644
index 0000000..7e4abce
--- /dev/null
+++ b/src/indexseqlibs/makefile-sun
@@ -0,0 +1,158 @@
+#
+# indexseqlibs - SunOS (sparc) fork
+#
+
+#
+# Programs need making
+#
+PROGS = \
+	addnl \
+	genbentryname1 \
+	entryname2 \
+	access4 \
+	access2 \
+	genbaccess1 \
+	title2 \
+	genbtitle1 \
+	emblentryname1 \
+	emblaccess1 \
+	embltitle1 \
+	pirentryname1 \
+	piraccess1 \
+	piraccess2 \
+	pirtitle1 \
+	pirtitle2 \
+	excludewords \
+	emblfreetext \
+	genbfreetext \
+	pirfreetext \
+	freetext2 \
+	freetext4 \
+	emblauthor \
+	genbauthor \
+	pirauthor \
+	hitNtrg \
+	division
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Sun
+#
+CC = gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS = -g -ansi -pedantic $(GCCWARNINGS)
+
+#
+# Shared objects
+#
+OBJS = \
+	cdromheader.o \
+	mach-io.o
+
+#
+# Dependencies for programs
+#
+cdromheader.o: $(SRC)/cdromheader.c
+	$(CC) $(CFLAGS) -c $(SRC)/cdromheader.c
+
+mach-io.o: $(SRC)/mach-io.c
+	$(CC) $(CFLAGS) -c $(SRC)/mach-io.c
+
+addnl : $(SRC)/addnl.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/addnl.c
+
+genbentryname1 : $(SRC)/genbentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbentryname1.c
+
+entryname2 : $(SRC)/entryname2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/entryname2.c $(OBJS)
+
+access4 : $(SRC)/access4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access4.c $(OBJS)
+
+access2 : $(SRC)/access2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/access2.c
+
+genbaccess1 : $(SRC)/genbaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbaccess1.c
+
+title2 : $(SRC)/title2.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/title2.c $(OBJS)
+
+genbtitle1 : $(SRC)/genbtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/genbtitle1.c
+
+emblentryname1 : $(SRC)/emblentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblentryname1.c
+
+emblaccess1 : $(SRC)/emblaccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/emblaccess1.c
+
+embltitle1 : $(SRC)/embltitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/embltitle1.c
+
+pirentryname1 : $(SRC)/pirentryname1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirentryname1.c
+
+piraccess1 : $(SRC)/piraccess1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess1.c
+
+piraccess2 : $(SRC)/piraccess2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/piraccess2.c
+
+pirtitle1 : $(SRC)/pirtitle1.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle1.c
+
+pirtitle2 : $(SRC)/pirtitle2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/pirtitle2.c
+
+excludewords : $(SRC)/excludewords.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/excludewords.c
+
+emblfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/freetext.c
+
+genbfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/freetext.c
+
+pirfreetext : $(SRC)/freetext.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/freetext.c
+
+freetext2 : $(SRC)/freetext2.c
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext2.c
+
+freetext4 : $(SRC)/freetext4.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/freetext4.c $(OBJS)
+
+emblauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DEMBL $(SRC)/author.c
+
+pirauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DPIR $(SRC)/author.c
+
+genbauthor : $(SRC)/author.c
+	$(CC) $(CFLAGS) -o $@ -DGENBANK $(SRC)/author.c
+
+hitNtrg : $(SRC)/hitNtrg.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/hitNtrg.c $(OBJS)
+
+division : $(SRC)/division.c $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(SRC)/division.c $(OBJS)
+
+
+
+
+#
+# Special dependencies
+#
+all : $(PROGS)
+
+clean :
+	/bin/rm -f $(PROGS) *.o *~
+
+
diff --git a/src/indexseqlibs/makenrl3d.script b/src/indexseqlibs/makenrl3d.script
new file mode 100644
index 0000000..f6d4fcc
--- /dev/null
+++ b/src/indexseqlibs/makenrl3d.script
@@ -0,0 +1,49 @@
+#! /bin/csh -f
+#
+# script to make entryname and brief indexes for nrl3d
+#
+
+echo "makenrl3d.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE = nrl3d.dat
+
+pirentryname1 $LIBFILE entry.list 1
+if ( $status ) goto abort
+
+${SORT} entry.list >! entry.sorted
+if ( $status ) goto abort
+${RM} entry.list
+
+entryname2 entry.sorted nrl3d.idx
+if ( $status ) goto abort
+${RM} entry.sorted
+
+pirtitle2 $LIBFILE title.list
+if ( $status ) goto abort
+
+${SORT} title.list >! title.sorted
+if ( $status ) goto abort
+${RM} title.list
+
+title2 title.sorted nrl3dbrief.idx
+if ( $status ) goto abort
+${RM} title.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "makenrl3d.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/piraccess1.c b/src/indexseqlibs/piraccess1.c
new file mode 100644
index 0000000..adf93f7
--- /dev/null
+++ b/src/indexseqlibs/piraccess1.c
@@ -0,0 +1,118 @@
+/*       piraccess1           */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 82     /* maximum input line length*/
+
+/* Program to create index for pir library in embl cdrom form
+ *
+ * This one gets the accession numbers and entry names and writes them to a
+ * file this file is sorted on entryname, then the next program processes the
+ * sorted file to add a number to each record that has a new entryname. This
+ * file is then sorted on accession number. This file is then processed to add
+ * the number of hits for each accession numnber: the first occurrence for
+ * each number is given 1, the next 2 and so on. This file is then processed
+ * to produce the final accession number target and hit filesin binary with
+ * the header
+ */
+
+int parseACCESSION(FILE *fp, char *, char *);
+
+char patternENTRY[] = "ENTRY";
+char patternACCESSION[] = "ACCESSION";
+char patternCONTINUE[] =  "         ";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char eName[10];
+
+    long entries = 0;
+    int leftChar = 16;
+    
+
+    printf("piraccess1 Version 1.0\n");
+    
+
+    if (argc != 3) {
+	fprintf(stderr,"Usage: piraccess1 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"piraccess1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"piraccess1: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+
+    eName[0] = '\0';
+    while (fgets(line, MAXLINE,ifp)!=NULL) {
+	
+	if (patmatch(line, patternENTRY)){
+	    
+	    if (*eName)
+		printf(" No accession lines for %-10.10s\n",eName);
+
+	    strncpy(eName,&line[leftChar],10);
+	    entries++;
+
+	} else if (patmatch(line, patternACCESSION)){
+	    
+	    if (parseACCESSION( ofp, &line[leftChar], eName) == 0)
+		printf(" Empty accession line !!\n");
+	    
+	    /* Assuming we wont run into an ENTRY line !!!!!!!!!!  */
+	    while (fgets(line, MAXLINE,ifp)!=NULL) {
+		
+		if (patmatch(line, patternCONTINUE)){
+		    
+		    if (parseACCESSION( ofp, &line[leftChar], eName) == 0)
+			printf(" Empty accession line !!\n");
+		} else
+		    break;
+	    }
+	    eName[0] = '\0';
+	}
+    }
+
+    printf(" Number of entries = %ld\n\n",entries); 
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
+
+
+int parseACCESSION(FILE *fp, char *line, char *eName)
+{
+    char *s, *t;
+    int entries;
+
+    entries = 0;
+    s = line;
+    while (*s) {
+	/* skip over white space etc */
+	for ( ; *s && !isalnum(*s) ; s++);
+
+	/* find end of accession number if there is one*/
+	if (*s) {
+	    entries++;
+	    for (t=s; isalnum(*s) ; s++);
+	    fprintf(fp, "%-10.10s %-10.*s\n",eName,s-t,t);
+	}
+
+    }
+
+    return entries;
+
+}	
diff --git a/src/indexseqlibs/piraccess2.c b/src/indexseqlibs/piraccess2.c
new file mode 100644
index 0000000..800dcf5
--- /dev/null
+++ b/src/indexseqlibs/piraccess2.c
@@ -0,0 +1,57 @@
+/*      piraccess2          */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create index for pir library in embl cdrom form */
+/* this one gets the entry names and offsets and writes them to an ascii file 
+   this file is sorted, then the next program converts the sorted file to binary
+   and adds the header */
+
+char patternENTRY[] = "ENTRY";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    long entries = 0;
+    int leftChar = 16;
+
+
+    printf("piraccess2 Version 1.0\n");
+
+
+    if (argc != 3) {
+	fprintf(stderr,"Usage: piraccess2 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"piraccess2: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"piraccess2: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    while (fgets(line, MAXLINE,ifp)!=NULL) {
+	
+	if (patmatch(line, patternENTRY)){
+	    fprintf(ofp, "%-10.10s %-10.10s\n",&line[leftChar],&line[leftChar]);
+	    entries++;
+	}
+    }
+
+    printf(" Number of entries = %ld\n\n",entries);
+
+    fclose(ifp);
+    fclose(ofp);
+    
+    return 0;
+}
diff --git a/src/indexseqlibs/piraccession.script b/src/indexseqlibs/piraccession.script
new file mode 100644
index 0000000..46dde36
--- /dev/null
+++ b/src/indexseqlibs/piraccession.script
@@ -0,0 +1,53 @@
+#! /bin/csh -f
+#
+# script to make an accession number index for the pir library in codata format
+#
+
+echo "piraccession.script Version 1.1"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name you've given to the library file
+#
+set LIBFILE1 = pir1.dat
+set LIBFILE2 = pir2.dat
+set LIBFILE3 = pir3.dat
+
+piraccess1 $LIBFILE1 pir1.list
+if ( $status ) goto abort
+piraccess1 $LIBFILE2 pir2.list
+if ( $status ) goto abort
+piraccess2 $LIBFILE3 pir3.list
+if ( $status ) goto abort
+
+${SORT} pir1.list pir2.list pir3.list >! access.sorted
+if ( $status ) goto abort
+${RM} pir1.list pir2.list pir3.list
+
+
+access2 access.sorted access.entry
+if ( $status ) goto abort
+${RM} access.sorted
+
+${SORT} -b +1 access.entry >! access.sorted2
+if ( $status ) goto abort
+${RM} access.entry
+
+access4 access.sorted2 acnum.trg acnum.hit
+if ( $status ) goto abort
+${RM} access.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "piraccession.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/pirauthor.script b/src/indexseqlibs/pirauthor.script
new file mode 100644
index 0000000..c361665
--- /dev/null
+++ b/src/indexseqlibs/pirauthor.script
@@ -0,0 +1,52 @@
+#! /bin/csh -f
+#
+# script to make an author index for the pir library
+#
+
+echo "pirauthor.script Version 1.0"
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE1 = pir1.dat
+set LIBFILE2 = pir2.dat
+set LIBFILE3 = pir3.dat
+
+pirauthor $LIBFILE1 pir1.list
+if ( $status ) goto abort
+pirauthor $LIBFILE2 pir2.list
+if ( $status ) goto abort
+pirauthor $LIBFILE3 pir3.list
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} -u pir1.list pir2.list pir3.list >! author.sorted
+if ( $status ) goto abort
+${RM} pir1.list pir2.list pir3.list
+
+# include entry numbers in the file
+freetext2 author.sorted author.entry
+if ( $status ) goto abort
+${RM} author.sorted
+
+# sort on text
+${SORT} -b +2 author.entry >! author.sorted2
+if ( $status ) goto abort
+${RM} author.entry
+
+# create the index
+hitNtrg author.sorted2 author.trg author.hit 35
+if ( $status ) goto abort
+${RM} author.sorted2
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "pirauthor.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/pirdivision.script b/src/indexseqlibs/pirdivision.script
new file mode 100644
index 0000000..e9cbe50
--- /dev/null
+++ b/src/indexseqlibs/pirdivision.script
@@ -0,0 +1,39 @@
+#! /bin/csh -f
+#
+# script to create an division index for the pir library in codata format
+#
+
+echo "pirdivision.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE1 = pir1.dat
+set LIBFILE2 = pir2.dat
+set LIBFILE3 = pir3.dat
+
+
+awk '{printf("%-12.12s %d\n",$1,$2);}' << EOF >! division.list
+$LIBFILE1 1
+$LIBFILE2 2
+$LIBFILE3 3
+EOF
+
+division division.list division.lkp
+if ( $status ) goto abort
+${RM} division.list
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "pirdivision.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/pirentryname.script b/src/indexseqlibs/pirentryname.script
new file mode 100644
index 0000000..5362270
--- /dev/null
+++ b/src/indexseqlibs/pirentryname.script
@@ -0,0 +1,44 @@
+#! /bin/csh -f
+#
+# script to create an entryname index for the pir library in codata format
+#
+
+echo "pirentryname.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE1 = pir1.dat
+set LIBFILE2 = pir2.dat
+set LIBFILE3 = pir3.dat
+
+pirentryname1 $LIBFILE1 pir1.list 1
+if ( $status ) goto abort
+pirentryname1 $LIBFILE2 pir2.list 2
+if ( $status ) goto abort
+pirentryname1 $LIBFILE3 pir3.list 3
+if ( $status ) goto abort
+
+${SORT} pir1.list pir2.list pir3.list >! entry.sorted
+if ( $status ) goto abort
+${RM} pir1.list pir2.list pir3.list
+
+entryname2 entry.sorted entrynam.idx
+if ( $status ) goto abort
+${RM} entry.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "pirentryname.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/pirentryname1.c b/src/indexseqlibs/pirentryname1.c
new file mode 100644
index 0000000..2538c1e
--- /dev/null
+++ b/src/indexseqlibs/pirentryname1.c
@@ -0,0 +1,84 @@
+/*           pirentryname1          */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#include <string.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create index for pir library in embl cdrom form */
+/* this one gets the entry names and offsets and writes them to an ascii file 
+   this file is sorted, then the next program converts the sorted file to binary
+   and adds the header */
+
+char patternENTRY[] = "ENTRY";
+char patternSEQUENCE[] = "SEQUENCE";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char entry_name[10];
+    off_t entry_offset = 0;
+    off_t sequence_offset = 0;
+    long entries_in = 0;
+    long entries_out = 0;
+    int leftChar = 16;
+    
+
+    printf("pirentryname1 Version 1.0\n");
+
+
+    if (argc != 4) {
+	fprintf(stderr,"Usage: pirentryname1 filein fileout division\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"pirentryname1: cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"pirentryname1: cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    entry_name[0]='\0';
+    while (fgets(line,MAXLINE,ifp)!=NULL){
+
+	if (patmatch(line, patternENTRY)){
+
+	    if (*entry_name)
+		printf("No sequence for entry %-10.10s\n",entry_name);
+
+	    strncpy(entry_name,&line[leftChar],10);
+	    entry_offset = ftell(ifp) - strlen(line);
+	    entries_in++;
+
+	} else if (patmatch(line, patternSEQUENCE) && *entry_name) {
+
+	    fgets(line, MAXLINE, ifp);
+	    fgets(line, MAXLINE, ifp);
+
+	    sequence_offset = ftell(ifp) - strlen(line) + 8;
+	    /* 8 = magic offset to first character of sequence */
+
+	    fprintf(ofp,"%-10.10s %10ld %10ld %-5.5s\n",entry_name,entry_offset,sequence_offset,argv[3]);
+	    entries_out++;
+	    entry_name[0]='\0';
+
+	}
+
+    }
+
+    printf(" Number of entries read = %ld\n",entries_in); 
+    printf(" Number of entries written = %ld\n\n",entries_out); 
+
+    fclose(ifp);
+    fclose(ofp);
+
+    return 0;
+}
diff --git a/src/indexseqlibs/pirfreetext.script b/src/indexseqlibs/pirfreetext.script
new file mode 100644
index 0000000..7df13bc
--- /dev/null
+++ b/src/indexseqlibs/pirfreetext.script
@@ -0,0 +1,69 @@
+#! /bin/csh -f
+#
+# script to make a free text index for the pir library
+#
+
+echo "pirfreetext.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+set UNIQ = uniq
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE1 = pir1.dat
+set LIBFILE2 = pir2.dat
+set LIBFILE3 = pir3.dat
+set WORDS = stopwords
+
+pirfreetext $LIBFILE1 pir1.list
+if ( $status ) goto abort
+pirfreetext $LIBFILE2 pir2.list
+if ( $status ) goto abort
+pirfreetext $LIBFILE3 pir3.list
+if ( $status ) goto abort
+
+# sort on words, remove dumplicates, remove stopwords
+${SORT} -b +1 pir1.list | ${UNIQ} | excludewords ${WORDS} >!  freetext.presort
+if ( $status ) goto abort
+${RM} pir1.list
+${SORT} -b +1 pir2.list | ${UNIQ} | excludewords ${WORDS} >>  freetext.presort
+if ( $status ) goto abort
+${RM} pir2.list
+${SORT} -b +1 pir3.list | ${UNIQ} | excludewords ${WORDS} >>  freetext.presort
+if ( $status ) goto abort
+${RM} pir3.list
+
+# resort on entry name
+${SORT} freetext.presort >! freetext.sorted
+if ( $status ) goto abort
+${RM} freetext.presort
+
+# include entry numbers in the file
+freetext2 freetext.sorted freetext.entry
+if ( $status ) goto abort
+${RM} freetext.sorted
+
+# sort on text
+${SORT} -b +2 freetext.entry >! freetext.sorted2
+if ( $status ) goto abort
+${RM} freetext.entry
+
+# create the index
+freetext4 freetext.sorted2 freetext.trg freetext.hit
+if ( $status ) goto abort
+${RM} freetext.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "pirfreetext.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/pirtitle.script b/src/indexseqlibs/pirtitle.script
new file mode 100644
index 0000000..ed9c074
--- /dev/null
+++ b/src/indexseqlibs/pirtitle.script
@@ -0,0 +1,44 @@
+#! /bin/csh -f
+#
+# script to make a brief index for the pir library in codata format
+#
+
+echo "pirtitle.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name youve given to the library file
+#
+set LIBFILE1 = pir1.dat
+set LIBFILE2 = pir2.dat
+set LIBFILE3 = pir3.dat
+
+pirtitle1 $LIBFILE1 pir1.list
+if ( $status ) goto abort
+pirtitle1 $LIBFILE2 pir2.list
+if ( $status ) goto abort
+pirtitle2 $LIBFILE3 pir3.list
+if ( $status ) goto abort
+
+${SORT} pir1.list pir2.list pir3.list >! title.sorted
+if ( $status ) goto abort
+${RM} pir1.list pir2.list pir3.list
+
+title2 title.sorted brief.idx
+if ( $status ) goto abort
+${RM} title.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "pirtitle.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/pirtitle1.c b/src/indexseqlibs/pirtitle1.c
new file mode 100644
index 0000000..3b86ea3
--- /dev/null
+++ b/src/indexseqlibs/pirtitle1.c
@@ -0,0 +1,104 @@
+/*        pirtitle1            */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <ctype.h>
+#include <string.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create short directory for pir library in embl cdrom form */
+
+char patternENTRY[] = "ENTRY";
+char patternTITLE[] = "TITLE";
+char patternACCESSION[] = "ACCESSION";
+char patternSUMMARY[] = "SUMMARY";
+char patternLength[] = "Length";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char entry_name[10];
+    char accession_number[10];
+    unsigned long sequence_length;
+    char title[80];
+    char *found_length;
+
+    int i;
+    int leftChar = 16;
+    long entries_in = 0;
+    long entries_out = 0;
+
+
+    printf("pirtitle1 Version 1.1\n");
+    
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: pirtitle1 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"pirtitle1: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"pirtitle1: cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+
+    entry_name[0] = '\0';
+    while (fgets(line, MAXLINE,ifp)!=NULL) {
+
+	if (patmatch(line, patternENTRY)){
+	    if (*entry_name) {
+		/* let's output gathered values for last entry*/
+		fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+			entry_name,
+			accession_number,
+			sequence_length,
+			title);
+		entries_out++;
+	    }
+	    /* initialise values */
+	    entry_name[0] = '\0';
+	    accession_number[0] = '\0';
+	    title[0] = '\0';
+	    sequence_length = 0;
+
+	    entries_in++;
+	    strncpy(entry_name,&line[leftChar],10);
+	} else if (patmatch(line, patternTITLE) && *title=='\0'){
+	    for(i=0;isprint(line[leftChar+i]) && i<80;i++) title[i]=line[leftChar+i];
+	    for(;i<80;i++) title[i] = ' ';
+	} else if (patmatch(line, patternACCESSION) && *accession_number=='\0'){
+	    for(i=0;isalnum(line[leftChar+i]) && i<10;i++) accession_number[i]=line[leftChar+i];
+	    for(;i<10;i++) accession_number[i] = ' ';
+	} else if (patmatch(line, patternSUMMARY) && sequence_length==0){
+	    found_length = strstr(line, patternLength);
+	    if (found_length) sequence_length = atoi(found_length+7);
+	}
+
+
+    }
+    if (*entry_name) {
+	/* let's output gathered values for last entry*/
+	fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+		entry_name,
+		accession_number,
+		sequence_length,
+		title);
+	entries_out++;
+    }
+
+    printf(" Number of entries read = %ld\n",entries_in);
+    printf(" Number of entries written = %ld\n\n",entries_out);
+
+    fclose(ifp);
+    fclose(ofp);
+    
+    return 0;
+}
diff --git a/src/indexseqlibs/pirtitle2.c b/src/indexseqlibs/pirtitle2.c
new file mode 100644
index 0000000..c84a734
--- /dev/null
+++ b/src/indexseqlibs/pirtitle2.c
@@ -0,0 +1,100 @@
+/*        pirtitle2            */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <ctype.h>
+
+#define MAXLINE 81     /* maximum input line length*/
+
+/* program to create short directory for pir library in embl cdrom form */
+
+char patternENTRY[] = "ENTRY";
+char patternTITLE[] = "TITLE";
+char patternSUMMARY[] = "SUMMARY";
+char patternLength[] = "Length";
+
+#define patmatch(L,P) (strncmp((L),(P),strlen(P))==0)
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *ofp;
+    char line[MAXLINE];
+    char entry_name[10];
+    char accession_number[10];
+    unsigned long sequence_length;
+    char title[80];
+    char *found_length;
+
+    int i;
+    int leftChar = 16;
+    long entries_in = 0;
+    long entries_out = 0;
+
+    
+    printf("pirtitle2 Version 1.0\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: pirtitle2 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"pirtitle2: Cannot open input file %s\n",argv[1]);
+	exit(1);
+    }
+    if ((ofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"pirtitle2: cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    entry_name[0] = '\0';
+    while (fgets(line, MAXLINE,ifp)!=NULL) {
+
+	if (patmatch(line, patternENTRY)){
+	    if (*entry_name) {
+		/* let's output gathered values for last entry*/
+		fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+			entry_name,
+			accession_number,
+			sequence_length,
+			title);
+		entries_out++;
+	    }
+	    /* initialise values */
+	    entry_name[0] = '\0';
+	    accession_number[0] = '\0';
+	    title[0] = '\0';
+	    sequence_length = 0;
+
+	    entries_in++;
+	    strncpy(entry_name,&line[leftChar],10);
+	    strncpy(accession_number,&line[leftChar],10);
+	} else if (patmatch(line, patternTITLE) && *title=='\0'){
+	    for(i=0;isprint(line[leftChar+i]) && i<80;i++) title[i]=line[leftChar+i];
+	    for(;i<80;i++) title[i] = ' ';
+	} else if (patmatch(line, patternSUMMARY) && sequence_length==0){
+	    found_length = strstr(line, patternLength);
+	    if (found_length) sequence_length = atoi(found_length+7);
+	}
+
+
+    }
+    if (*entry_name) {
+	/* let's output gathered values for last entry*/
+	fprintf(ofp,"%-10.10s %-10.10s %10lu %-80.80s\n",
+		entry_name,
+		accession_number,
+		sequence_length,
+		title);
+	entries_out++;
+    }
+
+    printf(" Number of entries read = %ld\n",entries_in);
+    printf(" Number of entries written = %ld\n\n",entries_out);
+
+    fclose(ifp);
+    fclose(ofp);
+   
+    return 0;
+}
diff --git a/src/indexseqlibs/stopwords b/src/indexseqlibs/stopwords
new file mode 100644
index 0000000..f804983
--- /dev/null
+++ b/src/indexseqlibs/stopwords
@@ -0,0 +1,415 @@
+&
+!
++
+-
+0
+01
+02
+03
+04
+05
+06
+07
+08
+09
+1
+2
+3
+4
+5
+6
+7
+8
+9
+A
+AA
+ABOUT
+ABOVE
+ACCORDING
+ACROSS
+ACTUALLY
+ADJ
+AFTER
+AFTERWARDS
+AGAIN
+AGAINST
+ALL
+ALMOST
+ALONE
+ALONG
+ALREADY
+ALSO
+ALTHOUGH
+ALWAYS
+AMONG
+AMONGST
+AN
+AND
+ANOTHER
+ANY
+ANYHOW
+ANYONE
+ANYTHING
+ANYWHERE
+ARE
+AREN'T
+AROUND
+AS
+AT
+B
+BE
+BECAME
+BECAUSE
+BECOME
+BECOMES
+BECOMING
+BEEN
+BEFORE
+BEFOREHAND
+BEGIN
+BEGINNING
+BEHIND
+BEING
+BELOW
+BESIDE
+BESIDES
+BETWEEN
+BEYOND
+BILLION
+BOTH
+BUT
+BY
+C
+CAN
+CAN'T
+CANNOT
+CAPTION
+CO
+CO.
+COULD
+COULDN'T
+CREATED
+D
+DID
+DIDN'T
+DO
+DOES
+DOESN'T
+DON'T
+DOWN
+DURING
+E
+EACH
+EG
+EIGHT
+EIGHTY
+EITHER
+ELSE
+ELSEWHERE
+END
+ENDING
+ENOUGH
+ETC
+EVEN
+EVER
+EVERY
+EVERYONE
+EVERYTHING
+EVERYWHERE
+EXCEPT
+F
+FEW
+FIFTY
+FIRST
+FIVE
+FOR
+FORMER
+FORMERLY
+FORTY
+FOUND
+FOUR
+FROM
+FURTHER
+G
+H
+HAD
+HAS
+HASN'T
+HAVE
+HAVEN'T
+HE
+HE'D
+HE'LL
+HE'S
+HENCE
+HER
+HERE
+HERE'S
+HEREAFTER
+HEREBY
+HEREIN
+HEREUPON
+HERS
+HERSELF
+HIM
+HIMSELF
+HIS
+HOW
+HOWEVER
+HUNDRED
+I
+I'D
+I'LL
+I'M
+I'VE
+IE
+IF
+IN
+INC.
+INDEED
+INSTEAD
+INTO
+IS
+ISN'T
+IT
+IT'S
+ITS
+ITSELF
+J
+K
+L
+LAST
+LATER
+LATTER
+LATTERLY
+LEAST
+LESS
+LET
+LET'S
+LIKE
+LIKELY
+LTD
+M
+MADE
+MAKE
+MAKES
+MANY
+MAYBE
+ME
+MEANTIME
+MEANWHILE
+MIGHT
+MILLION
+MISS
+MORE
+MOREOVER
+MOST
+MOSTLY
+MR
+MRS
+MUCH
+MUST
+MY
+MYSELF
+N
+NAME
+NAMELY
+NEITHER
+NEVER
+NEVERTHELESS
+NEXT
+NINE
+NINETY
+NO
+NOBODY
+NONE
+NONETHELESS
+NOONE
+NOR
+NOT
+NOTHING
+NOW
+NOWHERE
+O
+OF
+OFF
+OFTEN
+ON
+ONCE
+ONE
+ONE'S
+ONLY
+ONTO
+OR
+OTHER
+OTHERS
+OTHERWISE
+OUR
+OURS
+OURSELVES
+OUT
+OVER
+OVERALL
+OWN
+P
+PER
+PERHAPS
+Q
+R
+RATHER
+RECENT
+RECENTLY
+REL
+S
+SAME
+SEEM
+SEEMED
+SEEMING
+SEEMS
+SEVEN
+SEVENTY
+SEVERAL
+SHE
+SHE'D
+SHE'LL
+SHE'S
+SHOULD
+SHOULDN'T
+SINCE
+SIX
+SIXTY
+SO
+SOME
+SOMEHOW
+SOMEONE
+SOMETHING
+SOMETIME
+SOMETIMES
+SOMEWHERE
+STILL
+STOP
+SUCH
+T
+TAKING
+TEN
+THAN
+THAT
+THAT'LL
+THAT'S
+THAT'VE
+THE
+THEIR
+THEM
+THEMSELVES
+THEN
+THENCE
+THERE
+THERE'D
+THERE'LL
+THERE'RE
+THERE'S
+THERE'VE
+THEREAFTER
+THEREBY
+THEREFORE
+THEREIN
+THEREUPON
+THESE
+THEY
+THEY'D
+THEY'LL
+THEY'RE
+THEY'VE
+THIRTY
+THIS
+THOSE
+THOUGH
+THOUSAND
+THREE
+THROUGH
+THROUGHOUT
+THRU
+THUS
+TO
+TOGETHER
+TOO
+TOWARD
+TOWARDS
+TRILLION
+TWENTY
+TWO
+U
+UNDER
+UNLESS
+UNLIKE
+UNLIKELY
+UNTIL
+UP
+UPON
+US
+USED
+USING
+V
+VERY
+VIA
+W
+WAS
+WASN'T
+WE
+WE'D
+WE'LL
+WE'RE
+WE'VE
+WELL
+WERE
+WEREN'T
+WHAT
+WHAT'LL
+WHAT'S
+WHAT'VE
+WHATEVER
+WHEN
+WHENCE
+WHENEVER
+WHERE
+WHERE'S
+WHEREAFTER
+WHEREAS
+WHEREBY
+WHEREIN
+WHEREUPON
+WHEREVER
+WHETHER
+WHICH
+WHILE
+WHITHER
+WHO
+WHO'D
+WHO'LL
+WHO'S
+WHOEVER
+WHOLE
+WHOM
+WHOMEVER
+WHOSE
+WHY
+WILL
+WITH
+WITHIN
+WITHOUT
+WON'T
+WOULD
+WOULDN'T
+X
+Y
+YES
+YET
+YOU
+YOU'D
+YOU'LL
+YOU'RE
+YOU'VE
+YOUR
+YOURS
+YOURSELF
+YOURSELVES
diff --git a/src/indexseqlibs/swissaccession.script b/src/indexseqlibs/swissaccession.script
new file mode 100644
index 0000000..24e331e
--- /dev/null
+++ b/src/indexseqlibs/swissaccession.script
@@ -0,0 +1,52 @@
+#! /bin/csh -f
+#
+# script to make an accession number index for the swissprot library
+# newdata file
+#
+
+echo "swissaccession.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is what you've called the sequence library
+#
+set LIBFILE = seq.dat
+
+# get entry names and accession numbers
+emblaccess1 $LIBFILE access.list
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} access.list >! access.sorted
+if ( $status ) goto abort
+${RM} access.list
+
+# assign entry numbers to entry names
+access2 access.sorted access.entry
+if ( $status ) goto abort
+${RM} access.sorted
+
+# sort on accession numbers
+${SORT} -b +1 access.entry >! access.sorted2
+if ( $status ) goto abort
+${RM} access.entry
+
+# create index files
+access4 access.sorted2 acnum.trg acnum.hit
+if ( $status ) goto abort
+${RM} access.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "swissaccession.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/swissauthor.script b/src/indexseqlibs/swissauthor.script
new file mode 100644
index 0000000..e286ab7
--- /dev/null
+++ b/src/indexseqlibs/swissauthor.script
@@ -0,0 +1,53 @@
+#! /bin/csh -f
+#
+# script to make an author index for the swissprot newdata library
+#
+
+echo "swissauthor.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+set UNIQ = uniq
+
+#
+# LIBFILE is the name you've given to the library file
+#
+set LIBFILE = seq.dat
+set WORDS = stopwords
+
+# get free text
+emblauthor $LIBFILE author.list
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} -u author.list >! author.sorted
+if ( $status ) goto abort
+${RM} author.list
+
+# include entry numbers in the file
+freetext2 author.sorted author.entry
+if ( $status ) goto abort
+${RM} author.sorted
+
+# sort on text
+${SORT} -b +2 author.entry >! author.sorted2
+if ( $status ) goto abort
+${RM} author.entry
+
+# create the index
+hitNtrg author.sorted2 author.trg author.hit 35
+if ( $status ) goto abort
+${RM} author.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "swissauthor.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/swissdivision.script b/src/indexseqlibs/swissdivision.script
new file mode 100644
index 0000000..a3eeeca
--- /dev/null
+++ b/src/indexseqlibs/swissdivision.script
@@ -0,0 +1,34 @@
+#! /bin/csh -f
+#
+# script to create an division index for the embl library in codata format
+#
+
+echo "embldivision.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+
+#
+#
+#
+
+
+awk '{printf("%-12.12s %d\n",$1,$2);}' << EOF >! division.list
+seq.dat 1
+EOF
+
+division division.list division.lkp
+if ( $status ) goto abort
+${RM} division.list
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "embldivision.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/swissentryname.script b/src/indexseqlibs/swissentryname.script
new file mode 100644
index 0000000..b10c032
--- /dev/null
+++ b/src/indexseqlibs/swissentryname.script
@@ -0,0 +1,41 @@
+#! /bin/csh -f
+#
+# script to create an entryname index for the swissprot newdata library
+#
+
+echo "swissentryname.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name you've given to the library file
+#
+set LIBFILE = seq.dat
+
+# get entry names
+emblentryname1 $LIBFILE entry.list 1
+if ( $status ) goto abort
+
+# sort on entry names
+${SORT} entry.list >! entry.sorted
+if ( $status ) goto abort
+${RM} entry.list
+
+# create index file
+entryname2 entry.sorted entrynam.idx
+if ( $status ) goto abort
+${RM} entry.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "swissentryname.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/swissfreetext.script b/src/indexseqlibs/swissfreetext.script
new file mode 100644
index 0000000..e31a593
--- /dev/null
+++ b/src/indexseqlibs/swissfreetext.script
@@ -0,0 +1,53 @@
+#! /bin/csh -f
+#
+# script to make a free text index for the swissprot newdata library
+#
+
+echo "swissfreetext.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+set UNIQ = uniq
+
+#
+# LIBFILE is the name you've given to the library file
+#
+set LIBFILE = seq.dat
+set WORDS = stopwords
+
+# get free text
+emblfreetext $LIBFILE freetext.list
+if ( $status ) goto abort
+
+# sort on words, remove dumplicates, remove stopwords, resort on entry name
+${SORT} -b +1 freetext.list | ${UNIQ} | excludewords ${WORDS} | ${SORT} >! freetext.sorted
+if ( $status ) goto abort
+${RM} freetext.list
+
+# include entry numbers in the file
+freetext2 freetext.sorted freetext.entry
+if ( $status ) goto abort
+${RM} freetext.sorted
+
+# sort on text
+${SORT} -b +2 freetext.entry >! freetext.sorted2
+if ( $status ) goto abort
+${RM} freetext.entry
+
+# create the index
+freetext4 freetext.sorted2 freetext.trg freetext.hit
+if ( $status ) goto abort
+${RM} freetext.sorted2
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "swissfreetext.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/swisstitle.script b/src/indexseqlibs/swisstitle.script
new file mode 100644
index 0000000..b109e2f
--- /dev/null
+++ b/src/indexseqlibs/swisstitle.script
@@ -0,0 +1,41 @@
+#! /bin/csh -f
+#
+# script to make a brief index for the swissprot newdata library
+#
+
+echo "swisstitle.script Version 1.0"
+echo ""
+
+set RM = "/bin/rm -f"
+set SORT = sort
+
+#
+# LIBFILE is the name you've given to the library file
+#
+set LIBFILE = seq.dat
+
+# create title list
+embltitle1 $LIBFILE title.list
+if ( $status ) goto abort
+
+# sort on entry name
+${SORT} title.list >! title.sorted
+if ( $status ) goto abort
+${RM} title.list
+
+# generate index
+title2 title.sorted brief.idx
+if ( $status ) goto abort
+${RM} title.sorted
+
+echo "Completion successful"
+echo ""
+
+exit 0
+
+#
+# Abort
+#
+abort:
+echo "swisstitle.script: aborting due to serious error"
+exit 2
diff --git a/src/indexseqlibs/title2.c b/src/indexseqlibs/title2.c
new file mode 100644
index 0000000..fe14b23
--- /dev/null
+++ b/src/indexseqlibs/title2.c
@@ -0,0 +1,88 @@
+/*        title2          */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#include <string.h>
+#include "mach-io.h"
+#include "cdromheader.h"
+
+/* program to read in a title file and write it out in binary  with a header */
+#define MAXLINE 128 /* > 80 + 10 + 10 + 10 */
+
+int main(int argc, char *argv[])
+{
+    FILE *ifp;
+    FILE *tofp;
+    char line[MAXLINE];
+    char last_entry[10];
+    
+    uint_4 sequenceLength = 0;
+    uint_4 nRecords = 0;
+    uint_2 recordSize = 104;
+
+
+    printf("title2 Version 1.1\n");
+    
+    if (argc != 3) {
+	fprintf(stderr,"Usage: title2 filein fileout\n");
+	exit(2);
+    }
+    if ((ifp = fopen(argv[1],"r")) == NULL) {
+	fprintf(stderr,"title2: Cannot open input file %s\n",argv[1]);
+	exit(1);
+	}
+    if ((tofp = fopen(argv[2],"w")) == NULL) {
+	fprintf(stderr,"title2: Cannot open output file %s\n",argv[2]);
+	exit(1);
+    }
+    
+    if(cdromheader(tofp, recordSize ) != 0 ) {
+	fprintf(stderr,"title2: Error writing target file header\n");
+	exit(1);
+    }
+
+    last_entry[0]='\0';
+    while (fgets(line,MAXLINE,ifp) != NULL) {
+	/*
+	  Format of input line:
+	  "%10s %10s %10d %80s\n"
+	     entry name, accession number, sequenceLength, title
+	  */
+	
+	/* beware of duplicate entries */
+	if (strncmp(last_entry,line,10) == 0) {
+	    fprintf(stderr,"title2: warning - entry %-10.10s appears twice\n", line);
+	} else {
+	    sequenceLength = atoi(line+22);
+	    
+	    fwrite(line,10,1,tofp);
+	    fwrite(line+11,10,1,tofp);
+	    if(le_write_int_4(tofp, &sequenceLength) == 0) {
+		fprintf(stderr,"title2: Error writing to target file\n");
+		exit(1);
+	    }
+	    fwrite(line+33,80,1,tofp);
+
+	    nRecords++;;
+	    strncpy(last_entry,line,10);
+	}
+    }
+    
+    /* add number of records to header */
+    if (fseek(tofp, (off_t)4,0)) {
+	fprintf(stderr,"title2: Error seeking on target file\n");
+	exit(1);
+    }
+    if(le_write_int_4(tofp, &nRecords) == 0) {
+	fprintf(stderr,"title2: Error writing to target file\n");
+	exit(1);
+    }
+
+    printf(" Records in target file %d\n\n", nRecords);
+
+    fclose(ifp);
+    fclose(tofp);
+
+    return 0;
+}
diff --git a/src/newted/CRN b/src/newted/CRN
new file mode 100644
index 0000000..990a9c4
--- /dev/null
+++ b/src/newted/CRN
@@ -0,0 +1,11 @@
+ted - trace editor
+
+Copyright 1991 Timothy Gleeson, LaDeana Hillier and Simon Dear
+
+PLEASE NOTE:
+	373A sequence data file formats are the exclusive property
+	of Applied Biosystems, Inc.
+
+	ALF sequence data file formats are the exclusive property
+	of Pharmacia LKB Biotechnology, Inc.
+
diff --git a/src/newted/Graph.c b/src/newted/Graph.c
new file mode 100644
index 0000000..0e69bc7
--- /dev/null
+++ b/src/newted/Graph.c
@@ -0,0 +1,283 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: Graph.c,v 1.2 88/10/25 17:40:25 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#include <X11/IntrinsicP.h>
+#include <X11/StringDefs.h>
+#include "GraphP.h"
+
+
+#include <stdio.h>
+
+/* ---- Forward declarations ---- */
+
+
+static void Initialize(Widget request, Widget new,
+		       ArgList args, Cardinal *num_args);
+static void Destroy(Widget w);
+static void Redisplay(Widget w, XEvent *event, Region region);
+static void Resize(Widget w);
+static void InputAction(Widget w, XEvent *event,
+			String *params,Cardinal *num_params);
+static Boolean SetValues(Widget current, Widget request, Widget new,
+			 ArgList args, Cardinal *num_args);
+
+
+
+
+/* ---- Private data ---- */
+
+
+static XtResource resources[] = {
+#define  offset(field) XtOffset(GraphWidget, graph.field)
+#define coffset(field) XtOffset(Widget,      core.field)
+    /* {name, class, type, size, offset, default_type, default_addr}, */
+    { XtNcallback, XtCCallback, XtRCallback, (Cardinal)sizeof(XtCallbackList),
+	  offset(input_callback), XtRCallback, NULL },
+    { XtNdimBackground, XtCBackground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(dimBackground), XtRString, "XtDefaultBackground" },
+    { XtNexposeCallback, XtCCallback, XtRCallback,
+	  (Cardinal)sizeof(XtCallbackList),
+	  offset(expose_callback), XtRCallback, NULL },
+    { XtNfont, XtCFont, XtRFontStruct, (Cardinal)sizeof(XFontStruct*),
+	  offset(font), XtRString, XtDefaultFont },
+    { XtNgraphColour1, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour1), XtRString, "XtDefaultForeground" },
+    { XtNgraphColour2, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour2), XtRString, "XtDefaultForeground" },
+    { XtNgraphColour3, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour3), XtRString, "XtDefaultForeground" },
+    { XtNgraphColour4, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour4), XtRString, "XtDefaultForeground" },
+    { XtNheight, XtCHeight, XtRDimension, (Cardinal)sizeof(Dimension),
+	  coffset(height), XtRImmediate, (caddr_t)1},
+    { XtNwidth, XtCWidth, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  coffset(width), XtRImmediate, (caddr_t)1},
+    { XtNresizeCallback, XtCCallback, XtRCallback,
+	  (Cardinal)sizeof(XtCallbackList),
+	  offset(resize_callback), XtRCallback, NULL },
+#undef  offset
+#undef coffset
+};
+
+
+static XtActionsRec actions[] =
+{
+  /* {name, procedure}, */
+    {"input",	InputAction},
+};
+
+
+static char translations[] =
+"<Key>:		input()	\n\
+ <BtnDown>:     input() \
+";
+
+
+GraphClassRec graphClassRec = {
+  { /* core fields initial values */
+    /* superclass               */      (WidgetClass) &simpleClassRec,
+    /* class_name		*/	"Graph",
+    /* widget_size		*/	(Cardinal)sizeof(GraphRec),
+    /* class_initialize		*/	NULL,
+    /* class_part_initialize	*/	NULL,
+    /* class_inited		*/	FALSE,
+    /* initialize		*/	Initialize,
+    /* initialize_hook		*/	NULL,
+    /* realize			*/	XtInheritRealize,
+    /* actions			*/	actions,
+    /* num_actions		*/	XtNumber(actions),
+    /* resources		*/	resources,
+    /* num_resources		*/	XtNumber(resources),
+    /* xrm_class		*/	NULLQUARK,
+    /* compress_motion		*/	TRUE,
+    /* compress_exposure	*/	XtExposeCompressMaximal,
+    /* compress_enterleave	*/	TRUE,
+    /* visible_interest		*/	FALSE,
+    /* destroy			*/	Destroy,
+    /* resize			*/	Resize,
+    /* expose			*/	Redisplay,
+    /* set_values		*/	SetValues,
+    /* set_values_hook		*/	NULL,
+    /* set_values_almost	*/	XtInheritSetValuesAlmost,
+    /* get_values_hook		*/	NULL,
+    /* accept_focus		*/	NULL,
+    /* version			*/	XtVersion,
+    /* callback_private		*/	NULL,
+    /* tm_table			*/	translations,
+    /* query_geometry		*/	XtInheritQueryGeometry,
+    /* display_accelerator	*/	XtInheritDisplayAccelerator,
+    /* extension		*/	NULL
+  },
+  { /* Simple class initial values */
+    /* change_sensitive         */      XtInheritChangeSensitive
+  },
+  { /* Graph class initial values */
+    /* empty			*/	0
+  }
+};
+
+WidgetClass graphWidgetClass = (WidgetClass)&graphClassRec;
+
+
+
+
+/* ---- Private procedures ---- */
+
+
+static void Initialize(Widget request, Widget new,
+		       ArgList args, Cardinal *num_args)
+{   GraphWidget    gw = (GraphWidget)new;
+    unsigned long  xGCMask;
+    XGCValues      xGCValues;
+    Display       *display;
+    int            screen;
+
+    display  = XtDisplay(request);
+    screen   = DefaultScreen(display);
+
+    if (DisplayPlanes(display,screen)==1)
+    {   /* We are on a one-plane monochrome display.
+           Use dashes to make lines distinguishable.
+        */
+        xGCMask = GCForeground | GCBackground | GCLineStyle;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineSolid;
+        gw->graph.graphGC1 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCMask = GCForeground | GCBackground | GCLineStyle | GCDashList;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineOnOffDash;
+        xGCValues.dashes     = 1;
+        gw->graph.graphGC2 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCMask = GCForeground | GCBackground | GCLineStyle | GCDashList;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineOnOffDash;
+        xGCValues.dashes     = 2;
+        gw->graph.graphGC3 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCMask = GCForeground | GCBackground | GCLineStyle | GCDashList;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineOnOffDash;
+        xGCValues.dashes     = 4;
+        gw->graph.graphGC4 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+	gw->graph.graphColour1 = BlackPixel(display,screen);
+	gw->graph.graphColour2 = BlackPixel(display,screen);
+	gw->graph.graphColour3 = BlackPixel(display,screen);
+	gw->graph.graphColour4 = BlackPixel(display,screen);
+    }
+    else
+    {   /* Some sort of multi-plane display.
+        */
+        xGCMask = GCForeground | GCBackground;
+        xGCValues.background = gw->core.background_pixel;
+
+        xGCValues.foreground = gw->graph.graphColour1;
+        gw->graph.graphGC1 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCValues.foreground = gw->graph.graphColour2;
+        gw->graph.graphGC2 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCValues.foreground = gw->graph.graphColour3;
+        gw->graph.graphGC3 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCValues.foreground = gw->graph.graphColour4;
+        gw->graph.graphGC4 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+    }
+}
+
+
+static void Destroy (Widget w)
+{    GraphWidget gw = (GraphWidget) w;
+
+     XtDestroyGC(gw->graph.graphGC1);
+     XtDestroyGC(gw->graph.graphGC2);
+     XtDestroyGC(gw->graph.graphGC3);
+     XtDestroyGC(gw->graph.graphGC4);
+}
+
+
+static void Redisplay(Widget w, XEvent *event, Region region)
+{   XtCallCallbacks(w, XtNexposeCallback, (caddr_t)region);
+}
+
+
+static void Resize(Widget w)
+{   XtCallCallbacks(w, XtNresizeCallback, 0);
+}
+
+
+static void InputAction(Widget w, XEvent *event,
+			String *params,Cardinal *num_params)
+{   XtCallCallbacks(w, XtNcallback, (caddr_t)event);
+}
+
+
+static Boolean SetValues(Widget current, Widget request, Widget new,
+			 ArgList args, Cardinal *num_args)
+{   GraphWidget g_old = (GraphWidget) current;
+    GraphWidget g_new = (GraphWidget) new;
+
+    if	(DisplayPlanes(XtDisplay(new), DefaultScreen(XtDisplay(new))) == 1 ||
+	 !XtIsRealized(new))
+    {   /* Silly to change, or can't change.
+           Don't generate an expose.
+        */
+        return(False);
+    }
+    else
+    {   if (g_old->graph.graphColour1 != g_new->graph.graphColour1)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC1,
+			   g_new->graph.graphColour1);
+
+        if (g_old->graph.graphColour2 != g_new->graph.graphColour2)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC2,
+			   g_new->graph.graphColour2);
+
+        if (g_old->graph.graphColour3 != g_new->graph.graphColour3)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC3,
+			   g_new->graph.graphColour3);
+
+        if (g_old->graph.graphColour4 != g_new->graph.graphColour4)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC4,
+			   g_new->graph.graphColour4);
+
+        return(True);
+    }
+}
+
+
+
+
+/* ---- Exported procedures ---- */
+
+
+GC GraphGC1(Widget w)
+{ return((GraphWidget)w)->graph.graphGC1;
+}
+
+
+GC GraphGC2(Widget w)
+{ return((GraphWidget)w)->graph.graphGC2;
+}
+
+
+GC GraphGC3(Widget w)
+{ return((GraphWidget)w)->graph.graphGC3;
+}
+
+
+GC GraphGC4(Widget w)
+{ return((GraphWidget)w)->graph.graphGC4;
+}
diff --git a/src/newted/Graph.h b/src/newted/Graph.h
new file mode 100644
index 0000000..99fe869
--- /dev/null
+++ b/src/newted/Graph.h
@@ -0,0 +1,81 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: Graph.h,v 1.2 88/10/25 17:22:09 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#ifndef _Graph_h
+#define _Graph_h
+
+/****************************************************************
+ *
+ * Graph widget
+ *
+ ****************************************************************/
+
+#include <X11/Xaw/Simple.h>
+
+/* Resources:
+
+ Name		     Class		RepType		Default Value
+ ----		     -----		-------		-------------
+ background	     Background		Pixel		XtDefaultBackground
+ border		     BorderColor	Pixel		XtDefaultForeground
+ borderWidth	     BorderWidth	Dimension	1
+ callback            Callback           Callback        NULL
+ cursor              Cursor             Cursor          None
+ destroyCallback     Callback		Pointer		NULL
+ dimBackground	     Background		Pixel		XtDefaultBackground
+ exposeCallback      Callback           Callback        NULL
+ font                Font               XFontStruct*    XtDefaultFont
+ graphColour1        Foreground         Pixel           XtDefaultForeground
+ graphColour2        Foreground         Pixel           XtDefaultForeground
+ graphColour3        Foreground         Pixel           XtDefaultForeground
+ graphColour4        Foreground         Pixel           XtDefaultForeground
+ height		     Height		Dimension	1
+ insensitiveBorder   Insensitive	Pixmap		Gray
+ mappedWhenManaged   MappedWhenManaged	Boolean		True
+ resizeCallback      Callback           Callback        NULL
+ sensitive	     Sensitive		Boolean		True
+ width		     Width		Dimension	1
+ x		     Position		Position	0
+ y		     Position		Position	0
+
+*/
+
+/* define any special resource names here that are not in <X11/StringDefs.h> */
+
+#define XtNgraphResource		"graphResource"
+#define XtCGraphResource		"GraphResource"
+
+#define XtNdimBackground                "dimBackground"
+#define XtNexposeCallback               "exposeCallback"
+#define XtNgraphColour1                 "graphColour1"
+#define XtNgraphColour2                 "graphColour2"
+#define XtNgraphColour3                 "graphColour3"
+#define XtNgraphColour4                 "graphColour4"
+#define XtNresizeCallback               "resizeCallback"
+
+
+/* declare specific GraphWidget class and instance datatypes */
+
+typedef struct _GraphClassRec  *GraphWidgetClass;
+typedef struct _GraphRec       *GraphWidget;
+
+
+/* declare the class constant */
+
+extern WidgetClass graphWidgetClass;
+
+
+/* ---- Exported procedures ---- */
+
+extern Pixel GraphColour1(Widget w);
+extern Pixel GraphColour2(Widget w);
+extern Pixel GraphColour3(Widget w);
+extern Pixel GraphColour4(Widget w);
+extern GC    GraphGC1(Widget w);
+extern GC    GraphGC2(Widget w);
+extern GC    GraphGC3(Widget w);
+extern GC    GraphGC4(Widget w);
+
+#endif /* _Graph_h */
diff --git a/src/newted/GraphP.h b/src/newted/GraphP.h
new file mode 100644
index 0000000..07c673f
--- /dev/null
+++ b/src/newted/GraphP.h
@@ -0,0 +1,54 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: GraphP.h,v 1.2 88/10/25 17:37:59 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#ifndef _GraphP_h
+#define _GraphP_h
+
+#include "Graph.h"
+/* include superclass private header file */
+#include <X11/Xaw/SimpleP.h>
+
+
+/* define unique representation types not found in <X11/StringDefs.h> */
+
+#define XtRGraphResource		"GraphResource"
+
+typedef struct {
+    int empty;
+} GraphClassPart;
+
+typedef struct _GraphClassRec {
+    CoreClassPart	core_class;
+    SimpleClassPart     simple_class;
+    GraphClassPart	graph_class;
+} GraphClassRec;
+
+extern GraphClassRec graphClassRec;
+
+typedef struct {
+    /* resources */
+    Pixel dimBackground;
+    XFontStruct* font;
+    Pixel graphColour1;
+    Pixel graphColour2;
+    Pixel graphColour3;
+    Pixel graphColour4;
+    XtCallbackList expose_callback;
+    XtCallbackList resize_callback;
+    XtCallbackList input_callback;
+    /* private state */
+    GC graphGC1;
+    GC graphGC2;
+    GC graphGC3;
+    GC graphGC4;
+} GraphPart;
+
+typedef struct _GraphRec {
+    CorePart	core;
+    SimplePart  simple;
+    GraphPart	graph;
+} GraphRec;
+
+#endif /* _GraphP_h */
diff --git a/src/newted/README b/src/newted/README
new file mode 100644
index 0000000..670da50
--- /dev/null
+++ b/src/newted/README
@@ -0,0 +1,32 @@
+ted - trace editor
+------------------
+This is the source directory for the ted trace editor.
+As well as the sources you should find a makefile,
+a manual page "ted.manpage", documentation  "ted.tex"
+(this preprint is still in review and should be treated
+as confidential until it has been accepted for publication)
+a help file "ted.help" and a default resources file "Xted".
+
+The "ted.help" and "Xted" files should be placed in the
+"/usr/lib/X11/app-defaults" directory. The "ted.manpage"
+file should be placed in "/usr/man/manl/ted.l", or wherever
+local manual pages are stored on your system.
+
+The "ted" application can be constructed by simply typing
+"make ted". The "ted" application can then be placed in
+a generally accessible directory of binaries, perhaps
+"/usr/local/bin".
+
+
+
+sample files
+------------
+
+A sample ABI file is also included in this directory, trace.abi, which
+ can be opened using the following command:
+		ted -ABI trace.abi
+
+A sample ALF file is also included in this directory, trace.alf, which
+ can be opened using the following command:
+		ted -ALF trace.alf
+
diff --git a/src/newted/Xted b/src/newted/Xted
new file mode 100644
index 0000000..85aa190
--- /dev/null
+++ b/src/newted/Xted
@@ -0,0 +1,173 @@
+!
+! Resources for the "ted" trace editor (Dec 1990)
+!
+
+! default magnification
+Xted*magnif: 30
+! default enzyme string to search for as a left cutoff
+Xted*enz: N
+
+! The application
+Xted.minWidth:  350
+Xted.minHeight: 250
+
+! Labels have no borders and are not resized
+Xted*Label.borderWidth: 0
+Xted*Label.top:         chainTop
+Xted*Label.bottom:      chainTop
+Xted*Label.left:        chainLeft
+Xted*Label.right:       chainLeft
+
+! Buttons are not resized
+Xted*Command.top:         chainTop
+Xted*Command.bottom:      chainTop
+Xted*Command.left:        chainLeft
+Xted*Command.right:       chainLeft
+
+! All the toggles are one-of-many
+Xted*Toggle.Translations: #override \n\
+         <EnterWindow>:        highlight(Always) \n\
+         <LeaveWindow>:        unhighlight()     \n\
+         <Btn1Down>,<Btn1Up>: set() notify()
+
+Xted.mainForm.title.label: Trace editor
+Xted.mainForm.version.label: \
+Dec 1990
+Xted.mainForm.seqNamePrompt.label: Input:
+Xted.mainForm.seqNamePrompt.height: 13
+Xted.mainForm.seqName.label:       NONE
+Xted.mainForm.seqName.resizable:   True
+Xted.mainForm.seqName.height: 13
+Xted.mainForm.search.label: Search
+Xted.mainForm.search.height: 13
+Xted.mainForm.NorigBasesPrompt.label: Bases:
+Xted.mainForm.NorigBases.label:
+Xted.mainForm.NorigBases.resizable:   True
+Xted.mainForm.strand.label: Strand: Top
+Xted.mainForm.strand.resizable: True
+
+! Display forms have no border
+Xted.mainForm.Form.borderWidth:     0
+
+ ! The edit mode toggles
+Xted.mainForm.Toggle.top:    chainTop
+Xted.mainForm.Toggle.bottom: chainTop
+Xted.mainForm.Toggle.left:   chainLeft
+Xted.mainForm.Toggle.right:  chainLeft
+Xted.mainForm.edmodelab.label: Mode:
+Xted.mainForm.adjustL.label:   Adj left cut
+Xted.mainForm.edSeq.label:     Edit seq
+Xted.mainForm.adjustR.label:   Adj right cut
+Xted.mainForm.edSeq.state:     True
+Xted.mainForm.adjustL.height:   13
+Xted.mainForm.edSeq.height:     13
+Xted.mainForm.adjustR.height:   13
+
+
+! The magnification slider
+Xted.mainForm.maglab.top:    chainTop
+Xted.mainForm.maglab.bottom: chainTop
+Xted.mainForm.maglab.left:   chainLeft
+Xted.mainForm.maglab.right:  chainLeft
+Xted.mainForm.maglab.label: Mag:
+Xted.mainForm.maglab.height: 13
+Xted.mainForm.magscr.top:    chainTop
+Xted.mainForm.magscr.bottom: chainTop
+Xted.mainForm.magscr.left:   chainLeft
+Xted.mainForm.magscr.right:  chainLeft
+Xted.mainForm.magscr.orientation: horizontal
+Xted.mainForm.magscr.length: 100
+Xted.mainForm.magscr.height: 13
+
+Xted.mainForm.scaleDown.label: Scale down
+Xted.mainForm.scaleUp.label: Scale up
+
+! The main display: a viewport containing a form containing graphs
+Xted.mainForm.viewport.top:    chainTop
+Xted.mainForm.viewport.bottom: chainBottom
+Xted.mainForm.viewport.left:   chainLeft
+Xted.mainForm.viewport.right:  chainRight
+Xted.mainForm.viewport.allowHoriz: True
+Xted.mainForm.viewport.forcebars:  True
+Xted.mainForm.viewport.width:      700
+Xted.mainForm.viewport.vpForm.Graph.resizable: True
+Xted.mainForm.viewport.vpForm.Graph.top:    chainTop
+Xted.mainForm.viewport.vpForm.Graph.bottom: chainTop
+Xted.mainForm.viewport.vpForm.Graph.left:   chainLeft
+Xted.mainForm.viewport.vpForm.Graph.right:  chainRight
+Xted.mainForm.viewport.vpForm.trace.top:    chainTop
+Xted.mainForm.viewport.vpForm.trace.bottom: chainBottom
+Xted.mainForm.viewport.vpForm.Graph.font: -misc-*-bold-*15-*
+! The text graph heights are dynamically set to accomodate the font
+Xted.mainForm.viewport.vpForm.trace.height: 150
+! The graphs are dynamically initially set to fill the viewport
+Xted.mainForm.viewport.vpForm.Graph.background: snow2
+Xted.mainForm.viewport.vpForm.Graph.dimBackground: grey
+Xted.mainForm.viewport.vpForm.Graph.graphColour1:  limegreen
+Xted.mainForm.viewport.vpForm.Graph.graphColour2:  purple
+Xted.mainForm.viewport.vpForm.Graph.graphColour3:  black
+Xted.mainForm.viewport.vpForm.Graph.graphColour4:  red
+
+! Dummy widget - we only care about its width
+Xted.mainForm.dummy.top:    chainTop
+Xted.mainForm.dummy.bottom: chainBottom
+Xted.mainForm.dummy.left:   chainLeft
+Xted.mainForm.dummy.right:  chainRight
+Xted.mainForm.dummy.width:  700
+
+
+! Dialogues
+Xted*search.ioForm.ioPrompt.label:  Search string or base number
+Xted*input.ioForm.ioPrompt.label:   Input a file
+Xted*output.ioForm.ioPrompt.label:  Save a file
+Xted*check.ioForm.ioPrompt.label:   \
+The existing sequence has been altered.\n\
+Do you really want to overwrite it?
+Xted*quit.ioForm.ioPrompt.label:    \
+The existing sequence has been altered.\n\
+Do you really want to quit?
+Xted*check.ioForm..translations:   #override \n\
+         <Key>Return: CRAction()
+Xted*quit.ioForm..translations:    #override \n\
+         <Key>Return: CRAction()
+
+Xted*ioForm.namePrompt.label:       File name:
+Xted*ioForm.nameVal.width:          250
+Xted*ioForm.nameVal.translations:   #override \n\
+         <Key>Return: CRAction()
+Xted*ioForm.next.label: 
+Xted*ioForm.next.resizable:         True
+Xted*ioForm.formatPrompt.label:     File format:
+Xted*ioForm.formatBox.borderWidth:  0
+Xted*ioForm.formatBox.orientation:  horizontal
+Xted*ioForm.formatBox.plainFmt.label: Plain
+Xted*ioForm.formatBox.abiFmt.label:   ABI
+Xted*ioForm.formatBox.abiFmt.state:   True
+Xted*ioForm.formatBox.alfFmt.label:   ALF
+Xted*ioForm.formatBox.alfFmt.state:   False
+Xted*ioForm.formatBox.bottomFmt.label: Bottom
+Xted*ioForm.formatBox.topFmt.label:   Top
+Xted*ioForm.formatBox.topFmt.state:   True
+
+! The `default' button for the dialogue has a wider border
+Xted*input.ioForm.OK.borderWidth:     3
+Xted*output.ioForm.OK.borderWidth:    3
+Xted*check.ioForm.Cancel.borderWidth: 3
+Xted*quit.ioForm.Cancel.borderWidth:  3
+
+! Help shell widget
+Xted.Help.width:  525
+Xted.Help.height: 200
+Xted.Help.minWidth: 525
+Xted.Help.maxWidth: 525
+Xted.Help.minHeight: 150
+Xted.Help.title: ted help
+Xted.Help.form.rmButton.label:  Remove help
+Xted.Help.form.rmButton.top:    chainTop
+Xted.Help.form.rmButton.bottom: chainTop
+Xted.Help.form.rmButton.left:   chainLeft
+Xted.Help.form.rmButton.right:  chainLeft
+Xted.Help.form.text.top:    chainTop
+Xted.Help.form.text.bottom: chainBottom
+Xted.Help.form.text.left:   chainLeft
+Xted.Help.form.text.right:  chainRight
diff --git a/src/newted/asciited.c b/src/newted/asciited.c
new file mode 100644
index 0000000..94df614
--- /dev/null
+++ b/src/newted/asciited.c
@@ -0,0 +1,364 @@
+/*
+    Title: 	 asciited
+
+    File: 	 asciited.c
+    Purpose:	 Main module
+    Last update: Friday Nov 8, 1991
+
+    Change log:
+
+*/
+
+
+/* 
+
+This program prints out in an ascii file information about a
+trace file...Each type of data in the file is delimited by *.
+(1) original base calls made by the software (0..numOrigBases-1)*
+(2) trace position at which each base was called (0..numOrigBases-1)*
+(3) trace positions (trace indices...which simply increment
+by 1 and go from 0 up to the total number of points in the trace, NPoints)
+(0...NPoints-1) *
+(4) trace data for each position for the A trace (0...NPoints-1)*
+(5) trace data for each position for the C trace (0...NPoints-1)*
+(6) trace data for each position for the G trace (0...NPoints-1)*
+(7) trace data for each position for the T trace (0...NPoints-1)*
+
+
+call the program using:
+asciited -ABI abi_filename     not using this right now -output output.filename
+
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include "seqIOABI.h"
+#include "seqIOALF.h"
+#include "seqIOPlain.h"
+#include "mystdlib.h"  /* IMPORT: exit */
+
+#define BasesPerLine 50
+#define NumPerLine 15
+
+/* ---- Static variables ---- */
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+typedef struct
+{   char * plain;
+    char * ABI;
+    char * ALF;
+    char * SCF;
+    char * output;
+    char * enzyme;
+    int bottom;
+    char * raw;
+} Arguments;
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    if (argc != 1)
+    {   fprintf(stderr, "%s: unknown option%s:", argv[0], (argc>2)?"s":"");
+	for (i=1; i<argc; i++)
+	    fprintf(stderr, " %s", argv[i]);
+	fprintf(stderr, "\n\n");
+    }
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr,
+"       [(-PLN | -ABI | -ALF | -SCF) {tracefilename} [-bottom {1(true) or 0(false)}]\n");
+    fprintf(stderr,
+"       -output {outputfilename}\n");
+}
+
+
+int writeInfo(Seq seq, char *fn)
+{   FILE *fp;
+    int baseNum, lineLen, point;
+    int to_screen=0;
+    
+    /* initialize the complement array */
+    oppInitialize();
+    
+    /* Open for writing, text */
+/*      if ((fp=fopen(fn, "w")) == NULL) return(False);*/
+
+    
+    lineLen = 0;
+    
+    /* assumes there is no -bottom option */
+    
+
+/* print out all the original software-called bases*/
+    for (baseNum=0;
+	 baseNum<seq->NorigBases;
+	 baseNum++)
+      { 
+	
+	fputc(seq->base[baseNum], stdout);
+	if (++lineLen == BasesPerLine)
+	  {   fputc('\n', stdout);
+	      lineLen = 0;
+	    }
+      }
+    
+    if (lineLen != 0) fputc('\n', stdout);
+    fputc('*', stdout);
+    fputc('\n', stdout);
+    lineLen = 0;
+    
+/* printout all the base position information for each base */
+    
+    for (baseNum=0;
+	 baseNum<seq->NorigBases;
+	 baseNum++)
+      {   fprintf(stdout,"%d ",seq->basePos[baseNum]);
+	  if (++lineLen >= NumPerLine)
+	    {   fputc('\n', stdout);
+		lineLen = 0;
+	      }
+	}
+    
+    
+    
+    if (lineLen != 0) fputc('\n', stdout);
+    fputc('*', stdout);
+    fputc('\n', stdout);
+    
+    lineLen = 0;
+
+/* print out trace pos indices */
+
+    for (point=0;
+	 point<seq->NPoints;
+	 point++)
+      {   fprintf(stdout,"%d ",point);
+	  if (++lineLen >= NumPerLine)
+	    {   fputc('\n', stdout);
+		lineLen = 0;
+	      }
+	}
+    
+    if (lineLen != 0) fputc('\n', stdout);
+    fputc('*', stdout);
+    fputc('\n', stdout);
+
+
+    
+/* print out A trace for each trace point */
+    for (point=0;
+	 point<seq->NPoints;
+	 point++)
+      {   fprintf(stdout,"%d ",seq->traceA[point]);
+	  if (++lineLen >= NumPerLine)
+	    {   fputc('\n', stdout);
+		lineLen = 0;
+	      }
+	}
+    
+    if (lineLen != 0) fputc('\n', stdout);
+    fputc('*', stdout);
+    fputc('\n', stdout);
+    
+    lineLen = 0;
+    
+/* print out C trace for each trace point */
+    for (point=0;
+	 point<seq->NPoints;
+	 point++)
+      {   fprintf(stdout,"%d ",seq->traceC[point]);
+	  if (++lineLen >= NumPerLine)
+	    {   fputc('\n', stdout);
+		lineLen = 0;
+	      }
+	}
+    
+    if (lineLen != 0) fputc('\n', stdout);
+    fputc('*', stdout);
+    fputc('\n', stdout);
+    
+    lineLen = 0;
+
+/* print out G trace for each trace point */
+    
+    for (point=0;
+	 point<seq->NPoints;
+	 point++)
+      {   fprintf(stdout,"%d ",seq->traceG[point]);
+	  if (++lineLen >= NumPerLine)
+	    {   fputc('\n', stdout);
+		lineLen = 0;
+	      }
+	}
+    
+    if (lineLen != 0) fputc('\n', stdout);
+    fputc('*', stdout);
+    fputc('\n', stdout);
+    lineLen = 0;
+
+/* print out T trace for each trace point */
+    
+    for (point=0;
+	 point<seq->NPoints;
+	 point++)
+      {   fprintf(stdout,"%d ",seq->traceT[point]);
+	  if (++lineLen >= NumPerLine)
+	    {   fputc('\n', stdout);
+		lineLen = 0;
+	      }
+	}
+    
+    if (lineLen != 0) fputc('\n', stdout);
+    fputc('*', stdout);
+    fputc('\n', stdout);
+    lineLen = 0;
+    
+    
+    
+    
+    
+/*    (void) fclose(fp);*/
+    
+    return(True);
+    
+  }
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{
+    Arguments args;
+    char *fn;
+    char *o_fn;
+    Seq currSeq;
+    int rc;
+    int count;
+    char *enzString = "TAGAGGCTCCCC";
+
+    /* parse command line arguments here */
+
+    count=1;
+
+    fn = NULL;
+    args.ABI = NULL;
+    args.ALF = NULL;
+    args.SCF = NULL;
+    args.output = NULL;
+    args.plain = NULL;
+    args.bottom = 0;
+    args.raw = 0;
+    args.enzyme = NULL;
+
+    while (count < argc) {
+	char *a = argv[count];
+	if ( *a != '-') {
+            usage(argc, argv);
+fprintf(stderr,"not a - switch\n");
+	    exit(1);
+	}
+
+	if (strcmp(a,"-PLN") == 0) {
+	    args.plain = fn = argv[++count];
+	} else if (strcmp(a,"-ABI") == 0) {
+	    args.ABI = fn = argv[++count];
+	} else if (strcmp(a,"-ALF") == 0) {
+	    args.ALF = fn = argv[++count];
+	} else if (strcmp(a,"-SCF") == 0) {
+	    args.SCF = fn = argv[++count];
+	} else if (strcmp(a,"-output") == 0) {
+	    args.output = o_fn = argv[++count];
+	} else if (strcmp(a,"-bottom") == 0) {
+	    args.bottom = atoi(argv[++count]);
+	} else if (strcmp(a,"-raw") == 0) {
+	    args.raw = argv[++count];
+	} else if (strcmp(a,"-enzyme") == 0) {
+	    args.enzyme = enzString = argv[++count];
+	} else {
+            usage(argc, argv);
+fprintf(stderr,"switch not recognised\n");
+	    exit(1);
+	}
+        count++;
+    }
+
+
+
+
+
+
+
+
+
+    /*
+        Die if bad options given.
+    */
+    if ((args.plain==NULL &&
+	 args.ABI==NULL   &&
+	 args.SCF==NULL   &&
+	 args.ALF==NULL))
+    {   
+	usage(argc, argv);
+fprintf(stderr,"illegal argument combination\n");
+	exit(1);
+    }
+
+/*    if ( args.output==NULL )
+    {   
+	usage(argc, argv);
+fprintf(stderr,"no output specified\n");
+	exit(1);
+    }
+*/
+    if (args.plain != NULL)
+	currSeq = readSeqPlain(fn,args.bottom,enzString);
+    else if (args.ABI != NULL)
+	currSeq = readSeqABI(fn,args.bottom,enzString);    
+    else if (args.ALF != NULL)
+	currSeq = readSeqALF(fn,args.bottom,enzString);
+    else if (args.SCF != NULL)
+	currSeq = readSeqSCF(fn,args.bottom,enzString);
+
+    if (currSeq == NULLSeq) {
+	fprintf(stderr,"Error reading file %s\n",fn);
+	exit(1);
+    }
+
+/*    findLeftCutoff(currSeq, enzString);
+
+    rc=writeSeq(currSeq, o_fn, (args.raw == NULL)?o_fn:args.raw,1);*/
+
+    rc=writeInfo(currSeq, o_fn);
+
+    if (!rc) {
+	fprintf(stderr,"Error writing file %s\n",o_fn);
+	exit(1);
+    }
+
+}
+
+
+
+
+
diff --git a/src/newted/autoted.c b/src/newted/autoted.c
new file mode 100644
index 0000000..d585b7e
--- /dev/null
+++ b/src/newted/autoted.c
@@ -0,0 +1,186 @@
+/*
+    Title: 	 autoted
+
+    File: 	 autoted.c
+    Purpose:	 Main module
+    Last update: Thursday 11 July 1991
+
+    Change log:
+
+*/
+
+
+/*
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include "seqRead.h"
+#include "mystdlib.h"  /* IMPORT: exit */
+
+
+/* ---- Static variables ---- */
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+typedef struct
+{   char * plain;
+    char * ABI;
+    char * ALF;
+    char * SCF;
+    char * output;
+    char * enzyme;
+    int bottom;
+    char * raw;
+} Arguments;
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    if (argc != 1)
+    {   fprintf(stderr, "%s: unknown option%s:", argv[0], (argc>2)?"s":"");
+	for (i=1; i<argc; i++)
+	    fprintf(stderr, " %s", argv[i]);
+	fprintf(stderr, "\n\n");
+    }
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr,
+"       [(-PLN | -ABI | -ALF | -SCF ) {tracefilename} [-bottom {1(true) or 0(false)}]\n");
+    fprintf(stderr,
+"       -output {outputfilename}\n");
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+void setScaleFactor(float f)
+/* to keep linking happy */
+{
+}
+
+void main(unsigned int argc, char **argv)
+{
+    Arguments args;
+    char *fn;
+    char *o_fn;
+    Seq currSeq;
+    int rc;
+    int count;
+    char *enzString = "N";
+
+    /* parse command line arguments here */
+
+    count=1;
+
+    fn = NULL;
+    args.ABI = NULL;
+    args.ALF = NULL;
+    args.SCF = NULL;
+    args.output = NULL;
+    args.plain = NULL;
+    args.bottom = 0;
+    args.raw = 0;
+    args.enzyme = NULL;
+
+    while (count < argc) {
+	char *a = argv[count];
+	if ( *a != '-') {
+            usage(argc, argv);
+fprintf(stderr,"not a - switch\n");
+	    exit(1);
+	}
+
+	if (strcmp(a,"-PLN") == 0) {
+	    args.plain = fn = argv[++count];
+	} else if (strcmp(a,"-ABI") == 0) {
+	    args.ABI = fn = argv[++count];
+	} else if (strcmp(a,"-ALF") == 0) {
+	    args.ALF = fn = argv[++count];
+	} else if (strcmp(a,"-SCF") == 0) {
+	    args.SCF = fn = argv[++count];
+	} else if (strcmp(a,"-output") == 0) {
+	    args.output = o_fn = argv[++count];
+	} else if (strcmp(a,"-bottom") == 0) {
+	    args.bottom = atoi(argv[++count]);
+	} else if (strcmp(a,"-raw") == 0) {
+	    args.raw = argv[++count];
+	} else if (strcmp(a,"-enzyme") == 0) {
+	    args.enzyme = enzString = argv[++count];
+	} else {
+            usage(argc, argv);
+fprintf(stderr,"switch not recognised\n");
+	    exit(1);
+	}
+        count++;
+    }
+
+
+
+
+
+    /*
+        Die if bad options given.
+    */
+    if ((args.plain==NULL &&
+	 args.ABI==NULL   &&
+	 args.ALF==NULL   &&
+	 args.SCF==NULL))
+    {   
+	usage(argc, argv);
+fprintf(stderr,"illegal argument combination\n");
+	exit(1);
+    }
+
+    if ( args.output==NULL )
+    {   
+	usage(argc, argv);
+fprintf(stderr,"no output specified\n");
+	exit(1);
+    }
+
+    if (args.plain != NULL)
+	currSeq = readSeq(fn,args.bottom,enzString,"plainFmt");
+    else if (args.ABI != NULL)
+	currSeq = readSeq(fn,args.bottom,enzString,"abiFmt");
+    else if (args.ALF != NULL)
+	currSeq = readSeq(fn,args.bottom,enzString,"alfFmt");
+    else if (args.SCF != NULL)
+	currSeq = readSeq(fn,args.bottom,enzString,"scfFmt");
+
+    if (currSeq == NULLSeq) {
+	fprintf(stderr,"Error reading file %s\n",fn);
+	exit(1);
+    }
+
+
+    rc=writeSeq(currSeq, o_fn, (args.raw == NULL)?fn:args.raw,1);
+
+    if (!rc) {
+	fprintf(stderr,"Error writing file %s\n",o_fn);
+	exit(1);
+    }
+
+}
+
+
diff --git a/src/newted/basesDisplay.c b/src/newted/basesDisplay.c
new file mode 100644
index 0000000..79ada06
--- /dev/null
+++ b/src/newted/basesDisplay.c
@@ -0,0 +1,1014 @@
+/* 
+    Title:       basesDisplay
+
+    File: 	 basesDisplay.h
+    Purpose:	 Display the bases of a sequence on graphs
+    Last update: Wednesday 8 April 1992
+*/
+
+
+/*
+    `plotEdgeOffset' indicates the pixel position at which point 0
+    occurs. The characters for bases are printed with their centres
+    corresponding to their locations on the plot.
+
+    A caret is XOR drawn in the editable graph on top of (i.e. after)
+    all characters have been drawn.    
+
+    Changes to this module by lfw:
+        module displayEdBases and displayBases
+          added int baseNum as a parameter so that
+        the user could specify a baseNum via the
+        command line baseNum parameter or via
+        specifying a string to search for ( if
+        you specify a string to search for
+        ) and have that be where the caret
+        is originally positioned.  Before this
+        modification, the caret was always
+        initially positioned at base 0.
+    Changes made by sd:
+	leftCutoffPoint: handle NULLPoint
+    Changes made by SD:
+	Support monochrome displays when marking cutoffs
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "basesDisplay.h"
+#include "display.h"
+
+#include "Graph.h"
+#include "seq.h"     /* IMPORT: Seq, NULLSeq, getNPoints */
+
+#include <ctype.h>   /* IMPORT: toupper */
+
+#include <X11/Intrinsic.h>
+#include <X11/keysym.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Scrollbar.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Viewport.h>
+
+
+
+
+/* ---- Constants ---- */
+
+/* Caret stuff */
+#define caretWidth  ((Dimension) 6)
+#define caretHeight ((Dimension) 3)
+static char caretBits[] = {0x0c, 0x1e, 0x33};
+
+
+
+
+/* ---- Statics ---- */
+
+/* Current sequence information */
+static Seq currSeq = NULLSeq;
+static int NPoints;
+static int NorigBases, NedBases;
+
+
+/* Font information */
+static Dimension charWidth;
+static Dimension halfCharWidth;
+static Dimension charVOffset; /* Of char baseline from top of graph=ascent+1 */
+
+/* The two graphs */
+static Widget origSeqWid;
+static Widget edSeqWid;
+/* the numbers above the two graphs*/
+static Widget baseNumWid;
+
+static Dimension graphWidth;     /* Current width of the graph */
+static Dimension graphHeight;    /* Height of the graph */
+static Dimension plotEdgeOffset;
+static Dimension plotWidth;      /* Current width of the plot */
+
+static int leftCutoff, rightCutoff;
+
+static GC Agc, Cgc, Ggc, Tgc;
+static Pixel normBackground, dimBackground;
+
+/* monochrome stippling */
+static GC greygc;
+static int planes;
+
+/* Caret stuff */
+/*
+    The caret's position is represented by a base number, caretBN, (the
+    point at which the corresponding base occurs is maintained as caretP)
+    and is drawn to the right of that base. To allow insertion at the left
+    end and deletion at the right end, it can range from -1..NBase-1.
+*/
+static GC caretGC;
+static Pixmap caretPixmap = (Pixmap) 0;
+static int caretBN;
+static int caretP;
+
+
+
+
+
+/* ---- Position translation ---- */
+
+/*
+    Convert a value from the `point-in-the-sequence' to the
+    `x-pixel-on-the-graph' scale, and vice-versa, given that
+    `plotWidth' holds the current pixel width of the graph which
+    starts `plotEdgeOffset' from either edge of the graph and
+    `NPoints' gives the current point width.
+    Fractional results are truncated - for pixelToPoint this may
+    mean a point one too low is indicated, but a point too high is
+    never indicated.
+*/
+
+
+static int pointToPixel(int point)
+{   return(((point * plotWidth) / (NPoints-1)) + plotEdgeOffset);
+}
+
+
+int pixelToPoint(int pixel)
+{   return(((pixel-plotEdgeOffset) * (NPoints-1)) / plotWidth);
+}
+
+
+static int leftCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a left cutoff at `baseNum'.
+*/
+{   int pL, pR;
+
+    pL = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum-1);
+    pR = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum);
+
+    return((pR==NULLPoint)?pL:(pL+pR)/2);
+}
+
+
+static int rightCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a right cutoff at `baseNum'.
+*/
+{   int NedBases = getNBases(currSeq, EdBases);
+    int pL, pR;
+
+    pL = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum);
+    pR = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum-1);
+
+    return((pL+pR)/2);
+}
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+static void writeBase(Widget wid, int baseNum, Widget bnwid)
+{   WhichBases which = (wid == origSeqWid) ? OrigBases : EdBases;
+    char       base  = getBase(currSeq, which, baseNum);
+    int        point = baseNumToPoint(currSeq, which, baseNum);
+    GC gc;
+    char lbl[10];
+
+    /* Get the correct GC */
+    switch (base) 
+    {   case 'A': gc=Agc; break;
+        case 'C': gc=Cgc; break;
+        case 'G': gc=Ggc; break;
+        case 'T': gc=Tgc; break;
+        default:  gc=Ggc; break;
+    }
+
+    XDrawImageString(XtDisplay(wid),
+		     XtWindow(wid),
+		     gc,
+		     pointToPixel(point)-halfCharWidth, charVOffset,
+		     &base, 1);
+/*    if (currSeq->bottom)
+      sprintf(lbl,"%d",(currSeq->NorigBases-1)-baseNum);
+    else*/
+      sprintf(lbl,"%d",baseNum);
+    if ((wid == origSeqWid) &&  (baseNum%10 == 0)) {
+      XDrawString(XtDisplay(bnwid),
+		     XtWindow(bnwid),
+		     gc,
+		     pointToPixel(point)-halfCharWidth, charVOffset,
+		     lbl,strlen(lbl));
+    }
+}
+
+
+
+
+static void caretExpose(int *pix0, /* First exposed pixel */
+			int *pixN) /* Last exposed pixel */
+/*
+    Extend the expose region, if necessary, so that the caret
+    is either completely, or not at all exposed.
+*/
+{   int caretLwb, caretUpb;
+
+    caretLwb = pointToPixel(caretP)+halfCharWidth;
+    caretUpb = caretLwb + caretWidth;
+
+    /* Return if the caret is fully outside the exposed region */
+    if ((caretUpb<*pix0) || (caretLwb>*pixN)) return;
+
+    /* Return if the caret is fully inside the exposed region */
+    if ((caretLwb>=*pix0) && (caretUpb<=*pixN)) return;
+
+    /* Extend the exposed region */
+    *pix0 = caretLwb<*pix0 ? caretLwb : *pix0;
+    *pixN = caretUpb>*pixN ? caretUpb : *pixN;
+    XClearArea(XtDisplay(edSeqWid),
+	       XtWindow(edSeqWid),
+	       caretLwb, 0,
+	       caretWidth, graphHeight,
+	       False);
+}
+
+
+
+
+static void drawCaret()
+{   int caretLwb;
+
+    caretLwb = pointToPixel(caretP)+halfCharWidth;
+
+    XCopyPlane(XtDisplay(edSeqWid),
+	       caretPixmap, XtWindow(edSeqWid),
+	       caretGC,
+	       0, 0,
+	       (unsigned int) caretWidth, (unsigned int) caretHeight,
+	       caretLwb, charVOffset,
+	       1);
+}
+
+
+
+
+static void drawExposedCaret(int p0, /* First exposed pixel */
+			     int pN) /* Last exposed pixel */
+/*
+    Draw the caret, if it is within the exposed points.
+*/
+{   int caretLwb;
+
+    caretLwb = pointToPixel(caretP)+halfCharWidth;
+
+    if ((caretLwb<=pN) && (caretLwb>=p0))
+        drawCaret();
+    else
+        return;
+}
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void exposeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Region region = (Region) call_data;
+    XRectangle rect;
+    int x0, xN;    /* Affected region: pixels */
+    int p0, pN;    /* Affected region: points */
+    int b, b0, bN; /* Affected region: bases */
+    int leftCutoffP, rightCutoffP;
+    int NBases;
+    int whichBases;
+
+    if (currSeq == NULLSeq) return;
+
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+    leftCutoffP  = leftCutoffPoint(leftCutoff);
+    rightCutoffP = rightCutoffPoint(rightCutoff);
+    NBases     = (wid==origSeqWid) ? NorigBases : NedBases;
+    whichBases = (wid==origSeqWid) ? OrigBases  : EdBases;
+
+
+    /*
+        `region' gives us the exposed graph region.
+	XClipBox will return a bounding rectangle, in pixels.
+    */
+    XClipBox(region, &rect);
+    x0 = rect.x;
+    xN = rect.x+rect.width;
+
+
+    /* For the editable sequence, ensure the caret was not half exposed */
+    if (wid == edSeqWid) caretExpose(&x0, &xN);
+
+
+    /* Convert to affected points after trimming to the plot proper */
+    p0=pixelToPoint((x0<plotEdgeOffset)          ?plotEdgeOffset          :x0);
+    pN=pixelToPoint((xN>plotWidth+plotEdgeOffset)?plotWidth+plotEdgeOffset:xN);
+    if (pN<NPoints-1) pN++;
+
+
+    if (p0<leftCutoffP)
+    /*
+        Some of the exposed region is in the left cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = p0;
+       int lastP  = (pN<leftCutoffP) ? pN : leftCutoffP;
+
+       if (whichBases == EdBases)
+       {   int pix0 = pointToPixel(firstP);
+	   int pixN = pointToPixel(lastP);
+
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, graphHeight);
+       } else {
+           XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				dimBackground);
+           XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		      pix0, 0,
+		      pixN-pix0, graphHeight,
+		      False);
+	   XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				normBackground);
+       }
+       }
+    }
+      
+
+    if (pN>rightCutoffP)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = (p0>rightCutoffP) ? p0 : rightCutoffP;
+       int lastP  = pN;
+
+       if (whichBases == EdBases)
+       {   int pix0 = pointToPixel(firstP);
+	   int pixN = pointToPixel(lastP);
+
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, graphHeight);
+       } else {
+	   XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				dimBackground);
+	   XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		      pix0, 0,
+		      pixN-pix0, graphHeight,
+		      False);
+	   XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				normBackground);
+       }
+       }
+    }
+
+
+    /*
+        Now draw the affected bases.
+    */
+
+    /* The first base is to the left of the exposed region */
+    b0 = pointToBaseNum(currSeq, whichBases, p0);
+    b0 =   (b0 == NULLBaseNum) ? NBases-1
+         : (b0>0)              ? b0-1
+	 :                       0;
+    /* The last base is to the right of the exposed region */
+    bN = pointToBaseNum(currSeq, whichBases, pN);
+    bN = (bN == NULLBaseNum) ? NBases-1 : bN;
+
+    if (b0<leftCutoff)
+    /*
+        Some of the bases are in the left cutoff area.
+	Draw them, the edited ones with the dim background.
+    */
+    {   if (whichBases == EdBases)
+	{   /* Set all GCs to have dim backgrounds */
+	    XSetBackground(XtDisplay(wid), Agc, dimBackground);
+	    XSetBackground(XtDisplay(wid), Cgc, dimBackground);
+	    XSetBackground(XtDisplay(wid), Ggc, dimBackground);
+	    XSetBackground(XtDisplay(wid), Tgc, dimBackground);
+	}
+
+	for (b=b0; b<=leftCutoff-1; b++) writeBase(wid, b, baseNumWid);
+	
+	if (whichBases == EdBases)
+        {   XSetBackground(XtDisplay(wid), Agc, normBackground);
+	    XSetBackground(XtDisplay(wid), Cgc, normBackground);
+	    XSetBackground(XtDisplay(wid), Ggc, normBackground);
+	    XSetBackground(XtDisplay(wid), Tgc, normBackground);
+	}
+    }
+
+    if (bN>=leftCutoff && b0-NedBases-1<rightCutoff)
+    /*
+        Some of the exposed region is in the middle area.
+	Draw the bases.
+    */
+    {   int firstBase = (b0<leftCutoff) ? leftCutoff : b0;
+	int lastBase  =   (bN>NedBases-1-rightCutoff)
+	                 ? NedBases-1-rightCutoff
+			 : bN;
+
+	for (b=firstBase; b<=lastBase; b++) writeBase(wid, b, baseNumWid);
+    }
+
+    if (bN>NedBases-1-rightCutoff)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  if (whichBases == EdBases)
+       {   XSetBackground(XtDisplay(wid), Agc, dimBackground);
+	   XSetBackground(XtDisplay(wid), Cgc, dimBackground);
+	   XSetBackground(XtDisplay(wid), Ggc, dimBackground);
+	   XSetBackground(XtDisplay(wid), Tgc, dimBackground);
+       }
+
+       /* Plot the affected bases */
+       for (b=NedBases-rightCutoff; b<=bN; b++) writeBase(wid, b, baseNumWid);
+
+       if (whichBases == EdBases)
+       {   XSetBackground(XtDisplay(wid), Agc, normBackground);
+	   XSetBackground(XtDisplay(wid), Cgc, normBackground);
+	   XSetBackground(XtDisplay(wid), Ggc, normBackground);
+	   XSetBackground(XtDisplay(wid), Tgc, normBackground);
+       }
+    }
+
+
+    /* If this is the editable sequence, display the caret */
+    if (wid == edSeqWid) drawExposedCaret(x0, xN);
+}
+
+
+
+
+static void userCallback(Widget wid,
+			 XtPointer client_data, XtPointer call_data)
+/*
+    Handles all XtNcallbacks, i.e., buttons and keys.
+*/
+{   XEvent *eventP = (XEvent *) call_data;
+
+    if (currSeq == NULLSeq) return;
+
+    /*
+        Callback to the `display' module.
+    */
+    userEvent(eventP);
+}
+
+
+
+
+/* ---- Exports ---- */
+void redisplayBases()
+{
+    /*
+    **  Clear the area and generate an expose.
+    */
+    XClearArea(XtDisplay(baseNumWid), XtWindow(baseNumWid),
+		0, 0,
+		0, 0,
+		True);
+    XClearArea(XtDisplay(origSeqWid), XtWindow(origSeqWid),
+		0, 0,
+		0, 0,
+		True);
+    XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		0, 0,
+		0, 0,
+		True);
+}
+
+
+
+
+Widget createBasesDisplay(Widget parentWid, Widget fromVertWid,
+				 Dimension width)
+/*
+    Create the bases display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width'. Return the (lower) widget constructed.
+    No bases are initially displayed.
+*/
+{   Arg args[10];
+    int nargs;
+
+    currSeq    = NULLSeq;
+    graphWidth = width;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    graphWidth);  nargs++;
+    baseNumWid = XtCreateManagedWidget("baseNumWid", graphWidgetClass,
+				       parentWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, baseNumWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    graphWidth);  nargs++;
+    origSeqWid = XtCreateManagedWidget("origSeq", graphWidgetClass,
+				       parentWid, args, nargs);
+    XtAddCallback(origSeqWid, XtNcallback,       userCallback, NULL);
+    XtAddCallback(origSeqWid, XtNexposeCallback, exposeCallback, NULL);
+
+        /*
+	    Get font information.
+	    The text graphs are set to be two pixels higher than
+	    (ascent+descent). Characters are then plotted with
+	    a vertical offset one more than the ascent.
+	*/
+        { XFontStruct *fontStruct;
+
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNfont, &fontStruct); nargs++;
+	  XtGetValues(origSeqWid, args, nargs);
+	  charWidth      = fontStruct->max_bounds.width;
+	  halfCharWidth  = fontStruct->max_bounds.width/2;
+	  charVOffset    = fontStruct->ascent+1;
+	  plotEdgeOffset = halfCharWidth+2;
+	  plotWidth      = graphWidth-(2*plotEdgeOffset);
+	  graphHeight    = fontStruct->ascent + fontStruct->descent + 2;
+	  /* Make this graph tall enough to hold it */
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNheight, graphHeight); nargs++;
+	  XtSetValues(origSeqWid, args, nargs);
+	  /* Make this graph tall enough to hold it */
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNheight, graphHeight); nargs++;
+	  XtSetValues(baseNumWid, args, nargs);
+	}
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, origSeqWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,  graphWidth);   nargs++;
+    XtSetArg(args[nargs], XtNheight, graphHeight);  nargs++;
+    edSeqWid = XtCreateManagedWidget("edSeq", graphWidgetClass,
+				     parentWid, args, nargs);
+    XtAddCallback(edSeqWid, XtNcallback,       userCallback, NULL);
+    XtAddCallback(edSeqWid, XtNexposeCallback, exposeCallback, NULL);
+
+
+    /*
+        Get the GCs
+    */
+    Agc = GraphGC1(origSeqWid);
+    Cgc = GraphGC2(origSeqWid);
+    Ggc = GraphGC3(origSeqWid);
+    Tgc = GraphGC4(origSeqWid);
+
+    /*
+        Get the backgrounds
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNbackground,    &normBackground);    nargs++;
+    XtSetArg(args[nargs], XtNdimBackground, &dimBackground); nargs++;
+    XtGetValues(edSeqWid, args, nargs);
+
+    planes = DisplayPlanes(XtDisplay(edSeqWid),DefaultScreen(XtDisplay(edSeqWid)));
+    if (planes == 1) {
+	/* prepare for monochrome display */
+	XtGCMask valuemask = (GCStipple | GCFillStyle);
+	XGCValues values;
+	Pixmap grey_stipple;
+
+#define grey_width 4
+#define grey_height 4
+	static char grey_bits[] = {
+	   0x0e, 0x0b, 0x0e, 0x0b};
+
+	grey_stipple =
+	    XCreateBitmapFromData(XtDisplay(edSeqWid),
+				  RootWindowOfScreen(XtScreenOfObject(edSeqWid)),
+				  grey_bits,
+				  grey_width,
+				  grey_height);
+
+	values.stipple = grey_stipple;
+	values.fill_style = FillOpaqueStippled;
+	greygc = XCreateGC(XtDisplayOfObject(edSeqWid),
+			     RootWindowOfScreen(XtScreenOfObject(edSeqWid)),
+			     valuemask, &values);
+
+    }
+ 
+    return(edSeqWid);
+}
+
+
+
+
+void getBasesFontInfo(Dimension *offset, Dimension *chWidth)
+/*
+    In `offset' return the offset which the bases are drawn
+    from the left and right hand edges. The width of the characters
+    used is returned in `chWidth'.
+*/
+{   *offset  = plotEdgeOffset;
+    *chWidth = charWidth;
+}
+
+
+
+
+void displayBases(Seq seq, int baseNum)
+/*
+    Display the bases of sequence `seq'.
+*/
+{
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    NPoints     = getNPoints(currSeq);
+    NorigBases  = getNBases(currSeq, OrigBases);
+    NedBases    = getNBases(currSeq, EdBases); /* Should be the same */
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+
+
+        /* 
+        Ensure the caret for the editable graph exists.
+	This is *not* the place to do this, but it has to be done
+	when we are sure the graph has been realised.
+	In either case, move it to the start.
+    */
+/*    if (caretPixmap == (Pixmap) 0)*/
+    if (caretPixmap == (Pixmap) 0)
+    {   XGCValues values;
+
+	caretPixmap = XCreateBitmapFromData(XtDisplay(edSeqWid),
+					    XtWindow(edSeqWid),
+					    caretBits,
+					    caretWidth,
+					    caretHeight);
+    
+	values.function = GXxor;
+	caretGC = XCreateGC(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+			    GCFunction, &values);
+	XCopyGC(XtDisplay(edSeqWid),
+		GraphGC3(edSeqWid),
+		(GCForeground | GCBackground | GCLineStyle | GCDashList),
+		caretGC);
+    }
+
+    /*
+        Initially, the caret is placed hard to the left.
+    
+    caretBN = -1;
+    caretP  = 0; */
+
+/* modification by lfw 10/24/90,
+   the following section added to move the caret to the position
+  the user requests for baseNum , either by specifying 
+   -baseNum on the command line or -astring */
+
+    caretBN = baseNum-1;
+/* baseNum-1 because it inserts the caret at the space after baseNum */
+
+    if (baseNum == -1)
+      caretP = 0;
+    else
+      caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+    moveCaretTo(baseNum-1);
+
+}
+
+
+void displayEdBases(Seq seq, int baseNum)
+/*
+    Display the bases of sequence `seq'.
+*/
+{
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    NedBases    = getNBases(currSeq, EdBases);
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+
+
+    /* 
+        Ensure the caret for the editable graph exists.
+	This is *not* the place to do this, but it has to be done
+	when we are sure the graph has been realised.
+	In either case, move it to the start.
+    */
+/*    if (caretPixmap == (Pixmap) 0)*/
+    if (caretPixmap == (Pixmap) 0)
+    {   XGCValues values;
+
+	caretPixmap = XCreateBitmapFromData(XtDisplay(edSeqWid),
+					    XtWindow(edSeqWid),
+					    caretBits,
+					    caretWidth,
+					    caretHeight);
+    
+	values.function = GXxor;
+	caretGC = XCreateGC(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+			    GCFunction, &values);
+	XCopyGC(XtDisplay(edSeqWid),
+		GraphGC3(edSeqWid),
+		(GCForeground | GCBackground | GCLineStyle | GCDashList),
+		caretGC);
+    }
+
+    /*
+        Initially, the caret is placed hard to the left.
+    
+    caretBN = -1;
+    caretP  = 0; */
+
+/* modification by lfw 10/24/90,
+   the following section added to move the caret to the position
+  the user requests for baseNum , either by specifying 
+ -baseNum on the command line or -astring */
+
+    caretBN = baseNum-1;
+/* baseNum-1 because it inserts the caret at the space after baseNum */
+
+    if (baseNum == -1)
+      caretP = 0;
+    else
+      caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+    moveCaretTo(baseNum-1);
+	
+}
+
+
+
+
+void unDisplayBases()
+/*
+    Cease displaying the current sequence (if any).
+*/
+{   currSeq = NULLSeq;
+}
+
+
+
+
+void setBasesWidth(Dimension width)
+/*
+    Set the width of the current sequence bases display.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, width); nargs++;
+    XtSetValues(origSeqWid, args, nargs);
+    XtSetValues(edSeqWid,   args, nargs);
+    XtSetValues(baseNumWid, args, nargs);
+
+    graphWidth = width;
+    plotWidth  = graphWidth-(2*plotEdgeOffset);
+
+    /*
+        Clear the graphs.
+    */
+    if (XtIsRealized(origSeqWid))
+        XClearWindow(XtDisplay(origSeqWid), XtWindow(origSeqWid));
+    if (XtIsRealized(edSeqWid))
+        XClearWindow(XtDisplay(edSeqWid), XtWindow(edSeqWid));
+    if (XtIsRealized(baseNumWid))
+        XClearWindow(XtDisplay(baseNumWid), XtWindow(baseNumWid));
+}
+
+
+
+
+int getCaret()
+/*
+    Get the current position of the caret.
+*/
+{   return(caretBN);
+}
+
+
+
+
+void moveCaretLeft()
+/*
+    Move the caret left one base.
+*/
+{   drawCaret();
+    
+    caretBN = (caretBN==-1) ? -1 : caretBN-1;
+    caretP  = (caretBN==-1) ?  0 : baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+}
+
+
+
+
+void moveCaretRight()
+/*
+    Move the caret right one base.
+*/
+{   drawCaret();
+
+    caretBN = (caretBN==NedBases-1) ? NedBases-1 : caretBN+1;
+    caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+}
+
+
+
+
+void moveCaretTo(int baseNum)
+/*
+    Move the caret to after base `baseNum'.
+*/
+{   drawCaret();
+
+    caretBN = baseNum;
+    caretP  = (caretBN==-1) ? 0 : baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+}
+
+
+
+
+void baseInserted()
+/*
+    The editable sequence has changed by the insertion of a base
+    to the right of the base indicated by the caret. Update our display.
+*/
+{   int point, pix;
+
+    drawCaret();
+
+    NedBases++;
+    caretBN++;
+    caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+
+    /* Expose the new base */
+    point = baseNumToPoint(currSeq,EdBases,caretBN);
+    pix   = pointToPixel(point) - halfCharWidth;
+    XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+	       pix, 0,
+	       charWidth, graphHeight,
+	       True);
+}
+
+
+
+
+void baseDeleted()
+/*
+    The editable sequence has changed by the deletion of a base
+    from the left of the caret. Update our display.
+*/
+{   int basePix = pointToPixel(caretP) - halfCharWidth;
+		    
+    drawCaret();
+
+    NedBases--;
+    caretBN--;;
+    caretP = (caretBN==-1) ?  0 : baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+
+    /* Clear and expose the deleted base */
+    XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+	       basePix, 0,
+	       charWidth, graphHeight,
+	       True);
+}
+
+
+
+
+void basesCutoffChange()
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
+{   int newLeftCutoff, newRightCutoff;
+
+    getCutoffs(currSeq, &newLeftCutoff, &newRightCutoff);
+
+
+    if (newLeftCutoff != leftCutoff)
+    /*
+        The region between newLeftCutoff and leftCutoff has changed.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	*/
+	if (newLeftCutoff<leftCutoff)
+	{   b0 = newLeftCutoff;
+	    bN = leftCutoff;
+	}
+	else
+	{   b0 = leftCutoff;
+	    bN = newLeftCutoff;
+	}
+
+	p0 = leftCutoffPoint(b0);
+	pN = leftCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		   pix0, 0,
+		   pixN-pix0, graphHeight,
+		   True);
+    }
+
+
+    if (newRightCutoff != rightCutoff)
+    /*
+        The region between newRightCutoff and rightCutoff has
+	changed. Expose it.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	    N.B. lower cutoff means the base is further right.
+	*/
+	if (newRightCutoff<rightCutoff)
+	{   b0 = rightCutoff;
+	    bN = newRightCutoff;
+	}
+	else
+	{   b0 = newRightCutoff;
+	    bN = rightCutoff;
+	}
+
+
+	p0 = rightCutoffPoint(b0);
+	pN = rightCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		   pix0, 0,
+		   pixN-pix0, graphHeight,
+		   True);
+  
+    }
+
+
+    /*
+        Update static data.
+    */
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+}
+
+
+
+
+
+
+
+
diff --git a/src/newted/basesDisplay.h b/src/newted/basesDisplay.h
new file mode 100644
index 0000000..83eb02e
--- /dev/null
+++ b/src/newted/basesDisplay.h
@@ -0,0 +1,114 @@
+/* 
+    Title:       basesDisplay
+
+    File: 	 basesDisplay.h
+    Purpose:	 Display the bases of a sequence on graphs
+    Last update: Wed Jun 13 1990
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget createBasesDisplay(Widget parentWid, Widget fromVertWid,
+				 Dimension width);
+/*
+    Create the bases display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width'. Return the (lower) widget constructed.
+    No bases are initially displayed.
+*/
+
+
+extern void getBasesFontInfo(Dimension *offset, Dimension *chWidth);
+/*
+    In `offset' return the offset which the bases are drawn
+    from the left and right hand edges. The width of the characters
+    used is returned in `chWidth'.
+*/
+
+
+extern void displayBases(Seq seq, int baseNum);
+/*
+    Display the bases of sequence `seq'.
+*/
+
+
+extern void unDisplayBases();
+/*
+    Cease displaying the bases of the current sequence (if any).
+*/
+
+
+extern void setBasesWidth(Dimension width);
+/*
+    Set the width of the current sequence bases display.
+*/
+
+
+extern int getCaret();
+/*
+    Get the current position of the caret.
+*/
+
+
+extern void moveCaretLeft();
+/*
+    Move the caret left one base.
+*/
+
+
+extern void moveCaretRight();
+/*
+    Move the caret right one base.
+*/
+
+
+extern void moveCaretTo(int baseNum);
+/*
+    Move the caret to after base `baseNum'.
+*/
+
+
+extern void baseInserted();
+/*
+    The editable sequence has changed by the insertion of a base
+    to the right of the base indicated by the caret. Update our display.
+*/
+
+
+extern void baseDeleted();
+/*
+    The editable sequence has changed by the deletion of a base
+    from the left of the caret. Update our display.
+*/
+
+
+extern int pixelToPoint(int pixel);
+/*
+    Given `pixel' on the bases display, return the corresponding point.
+    Fractional results are truncated - this may mean a point one too
+    low is indicated, but a point too high is never indicated.
+*/
+
+
+extern void basesCutoffChange();
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
diff --git a/src/newted/dialogues.c b/src/newted/dialogues.c
new file mode 100644
index 0000000..c583e85
--- /dev/null
+++ b/src/newted/dialogues.c
@@ -0,0 +1,1310 @@
+/* 
+    Title:       dialogues
+
+    File: 	 dialogues.c
+    Purpose:	 Dialogues
+    Last update: Thurs 31 October 1991
+
+    Change log:
+
+	27.11.90 SD  outputOKCallback() now passes seqName in writeSeqABI() call
+	28.11.90 SD  put undesirables under STLOUIS compilation flag
+	02.01.91 SD  remerged with St Louis version
+	22.07.91 LFW put in -enzyme as a command line option for STLOUIS
+	17.09.91 LFW changed MRC switch to !AUTO_CLIP
+		     changed STLOUIS switch to OUT_SEQ
+	31.10.91 SD  Popup dialogue if file exists on output
+	26.11.91 SD  Added SCF format
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include "values.h"     /* IMPORT: MAXLONG */
+
+#include "dialogues.h"
+#include "display.h"    /* IMPORT: displaySeq, unDisplaySeq */
+#include "info.h"       /* IMPORT: info */
+#include "seq.h"        /* IMPORT: Seq, NULLSeq */
+#include "seqRead.h"    /* IMPORT: readSeq */
+#include "seqIOEdit.h"  /* IMPORT: isDotNum, stripDotNum */
+#include "seqOutput.h"  /* IMPORT: writeSeq */
+#include "match.h"  /* IMPORT: writeSeq */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Shell.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Dialog.h>
+
+
+/* ---- Globals ---- */
+
+char o_fn[200];
+char r_fn[200];
+
+/* ---- Statics ---- */
+
+static Seq currSeq = NULLSeq;
+static int indices[100]; /* array of starting indices of all of the
+			  matches a search for a string found */
+static int num_matches; /* total number of matches to a given
+			    string found in the currSeq */
+static int next_indices; /* the next indices to search for
+			     if the user wishes to search for
+			     the next occurence of string */
+
+/* ----- get resources -------*/
+/* allows you to get the default enzString to search for
+   from the Xted file */
+
+typedef struct
+{   int magnif;
+    String enz;
+} AppData, *AppDataPtr;
+
+#define XtNmagnif "magnif"
+#define XtCMagnif "Magnif"
+#define XtNenz "enz"
+#define XtCEnz "Enz"
+
+static XtResource resources[] = {
+  { XtNmagnif,
+    XtCMagnif,
+    XtRInt,
+    sizeof(int),
+    XtOffset(AppDataPtr, magnif),
+    XtRImmediate,
+    NULL
+    },
+  { XtNenz,
+    XtCEnz,
+    XtRString,
+    sizeof(String),
+    XtOffset(AppDataPtr, enz),
+    XtRImmediate,
+    NULL
+    }
+    };
+
+
+
+void set_strand_label(Widget strandWid, Seq seq)
+{
+    Cardinal nargs;
+    Arg args[1];
+
+    nargs = 0;
+    if (seq->bottom) {
+	XtSetArg(args[nargs], XtNlabel, "Strand: Bottom"); nargs++;
+    } else {
+	XtSetArg(args[nargs], XtNlabel, "Strand: Top"); nargs++;
+    }
+    XtSetValues(strandWid, args, nargs);
+}    
+
+
+/* ---- Callbacks ---- */
+
+
+static void inputOKCallback(Widget w,
+			    XtPointer client_data, XtPointer call_data)
+{
+    Widget popupWid, nameValWid, radioGroupWid, radioGroupWid2;
+    Widget seqNameWid, NorigBasesWid,strandWid;
+    char *radioData, *radioData2, *fn;
+    Arg args[10];
+    int nargs;
+    Seq oldSeq = currSeq;
+    int bottom;
+    char enzString[100];
+    AppData app_data;
+    
+    
+    popupWid       = (Widget) client_data;
+    nameValWid     = XtNameToWidget(popupWid, "ioForm.nameVal");
+    radioGroupWid  = XtNameToWidget(popupWid, "ioForm.formatBox.plainFmt");
+    radioGroupWid2 = XtNameToWidget(popupWid, "ioForm.formatBox.bottomFmt");
+    seqNameWid     = XtNameToWidget(XtParent(XtParent(popupWid)), "seqName");
+    NorigBasesWid  = XtNameToWidget(XtParent(XtParent(popupWid)), "NorigBases");
+    strandWid      = XtNameToWidget(XtParent(XtParent(popupWid)), "strand");
+    
+    XtGetApplicationResources(popupWid, (XtPointer) &app_data,
+                              resources, XtNumber(resources),
+                              NULL, 0);
+    
+#ifndef AUTO_CLIP
+    app_data.enz='\0';
+#endif
+    
+    /* LFW, 05.02.91, for command line option to work */
+    if (strcmp(enzString,"")==0) {
+	if (app_data.enz != NULL)
+	    strcpy(enzString,app_data.enz);
+	else *enzString = '\0';
+    }
+    /*
+      Get the name and type of the new one and read it in.
+      */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstring, &fn); nargs++;
+    XtGetValues(nameValWid, args, nargs);
+    
+    radioData = (char *) XawToggleGetCurrent(radioGroupWid);
+    radioData2 = (char *) XawToggleGetCurrent(radioGroupWid2);
+    bottom = (strcmp(radioData2, "bottomFmt") == 0);
+    currSeq = readSeq(fn,bottom,enzString,radioData);
+    
+#ifdef DEF_OUT  /* out_seq means default output file name
+		   of input_filename.seq */
+    /* make a default output filename */
+    sprintf(o_fn,"%s",fn);
+    if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+#endif
+    
+    if (currSeq != NULLSeq)
+	{   char NBasesString[10];
+	    
+	    XtDestroyWidget(popupWid);
+	    
+	    /*
+	      Undisplay and dispose of the old sequence.
+	      */
+	    unDisplaySeq();
+	    freeSeq(oldSeq);
+	    
+	    /*
+	      Set the `file name' and `number of bases' labels.
+	      */
+	    nargs = 0;
+	    XtSetArg(args[nargs], XtNlabel, fn); nargs++;
+	    XtSetValues(seqNameWid, args, nargs);
+	    nargs = 0;
+	    sprintf(NBasesString, "%d", getNBases(currSeq, OrigBases));
+	    XtSetArg(args[nargs], XtNlabel, NBasesString); nargs++;
+	    XtSetValues(NorigBasesWid, args, nargs);
+	    /* set strand label */
+	    set_strand_label(strandWid, currSeq);
+	    
+	    /* original line        displaySeq(currSeq, -1, 1.0);, 
+	       new line added by lfw: */
+	    displaySeq(currSeq, 1, 0.3);
+	}
+    else
+	{   XBell(XtDisplay(popupWid), 100);
+	    nargs = 0;
+	    XtSetArg(args[nargs], XtNstring, ""); nargs++;
+	    XtSetValues(nameValWid, args, nargs);        
+	}
+}
+
+
+
+
+/********************************Check for overwriting***********************/
+#include <sys/types.h>
+#include <sys/stat.h>
+int checkOKwriteSeq(char *fn)
+/* See if file exists */
+{
+    struct stat buf;
+    return ( stat(fn,&buf) );
+}
+
+static void warningYesCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A yes response to the
+** warning about data not saved
+*/
+{
+    int *response = (int *) client_data;
+
+    *response = 1;
+}
+
+static void warningNoCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A No response to the
+** warning about data not saved
+*/
+{
+    int *response = (int *) client_data;
+
+    *response = 0;
+}
+
+
+static int warning(Widget parentWid, String warn_message)
+/*
+** Warn that changes have been made but contig hasn't been saved.
+*/
+{
+    Widget dialog;
+    Arg	args[10];
+    int nargs;
+    int Response;
+    Widget warningWid, warnFormWid;
+    Position  x, y;
+
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    XtTranslateCoords(parentWid,
+		      (Position) 0, (Position) 0,
+		      &x, &y);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    warningWid = XtCreatePopupShell("warning", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    /* Create the form */
+    nargs = 0;
+    warnFormWid = XtCreateManagedWidget("warnForm", formWidgetClass,
+				      warningWid, args, nargs);
+
+
+
+    /* Create the form */
+    nargs = 0;
+    XtSetArg(args[nargs],XtNlabel,warn_message); nargs++;
+    dialog = XtCreateManagedWidget("warnwid", dialogWidgetClass,
+				      warnFormWid, args, nargs);
+    XawDialogAddButton(dialog,"Yes",warningYesCallback,(XtPointer)&Response);
+    XawDialogAddButton(dialog,"No",warningNoCallback,(XtPointer)&Response);
+
+    /* This is a modal dialogue */
+    Response = -1;
+    XtPopup(warningWid, XtGrabExclusive);
+
+    while (Response==-1)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(parentWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    XtDestroyWidget(warningWid);
+
+    return Response;
+}
+
+
+
+int check_file_is_ok(Widget w, char *fn)
+{
+    if (!checkOKwriteSeq(fn)) {
+	return warning(w,"File already exists\nOK to overwrite?");
+    } else
+	return 1;
+}
+
+/********************************************************************/
+
+
+
+
+ static void inputCancelCallback(Widget w,
+				 XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+ }
+
+
+
+
+ static void outputOKCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, nameValWid, radioGroupWid, seqNameWid;
+     Widget includeHeaderWid;
+     char *fn, *seqName;
+     Boolean rc;
+     Boolean includeHeader;
+     Arg args[10];
+     int nargs;
+
+     popupWid      = (Widget) client_data;
+     nameValWid    = XtNameToWidget(popupWid, "ioForm.nameVal");
+     radioGroupWid = XtNameToWidget(popupWid, "ioForm.formatBox.plainFmt");
+     includeHeaderWid = XtNameToWidget(popupWid, "ioForm.includeHeader");
+     seqNameWid    = XtNameToWidget(XtParent(XtParent(popupWid)), "seqName");
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNlabel, &seqName); nargs++;
+     XtGetValues(seqNameWid, args, nargs);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNstring, &fn); nargs++;
+     XtGetValues(nameValWid, args, nargs);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNstate, &includeHeader); nargs++;
+     XtGetValues(includeHeaderWid, args, nargs);
+
+     if (check_file_is_ok(w,fn)) {
+	 rc=writeSeq(currSeq, fn, (strlen(r_fn)==0)?seqName:r_fn,includeHeader);
+	 
+	 if (rc)
+	     {   XtDestroyWidget(popupWid);
+		 setDirty(currSeq, False);
+	     }
+	 else
+	     {   XBell(XtDisplay(popupWid), 100);
+		 nargs = 0;
+		 XtSetArg(args[nargs], XtNstring, ""); nargs++;
+		 XtSetValues(nameValWid, args, nargs);
+	     }
+     } else
+	 XBell(XtDisplay(popupWid), 100);
+ }
+
+
+
+
+ static void outputCancelCallback(Widget w,
+				  XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+ }
+
+
+
+
+ static void checkOKCallback(Widget w,
+			     XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, parentWid;
+
+     popupWid = (Widget) client_data;
+     parentWid = XtParent(popupWid);
+
+     XtDestroyWidget(popupWid);
+
+     /*
+	 The user does want to overwrite the existing sequence.
+	 Mark it as clean so `inputSeq' will dispose of it.
+     */
+     setDirty(currSeq, False);
+     inputSeq(parentWid);
+ }
+
+
+
+
+ static void checkCancelCallback(Widget w,
+				 XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+ }
+
+
+
+
+ static void quitOKCallback(Widget w,
+			    XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+
+     /*
+	 The user does not want to save the existing sequence.
+	 Quit the application.
+     */
+     freeSeq(currSeq);
+     XtDestroyApplicationContext(XtWidgetToApplicationContext(w));
+     exit(0);
+ }
+
+
+
+
+ static void quitCancelCallback(Widget w,
+				XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+
+     /*
+	 The user does not want to quit the application.
+     */
+ }
+
+  static void searchCallback(Widget w,
+			     XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, nameValWid, NorigBasesWid, nextWid;
+     char *astring;
+     Arg args[10];
+     int nargs;
+     int num,i,num_bases,baseNum;
+     int mag;
+     char *theSeq;
+
+     popupWid      = (Widget) client_data;
+     nameValWid    = XtNameToWidget(popupWid, "ioForm.nameVal");
+     NorigBasesWid = XtNameToWidget(XtParent(XtParent(popupWid)), "NorigBases");
+     nextWid    = XtNameToWidget(popupWid, "ioForm.next");
+
+     next_indices = 0;
+     num_matches = 0;
+
+     /*
+	 Get the name and type of the new one and read it in.
+     */
+     nargs = 0;
+     XtSetArg(args[nargs], XtNstring, &astring); nargs++;
+     XtGetValues(nameValWid, args, nargs);
+
+
+    if (currSeq != NULLSeq)
+    {    
+	 /* astring != NULL, find the string that corresponds
+	    to the app_resources.astring and center the display on that string,
+	    change by lfw, 10/24/90 from this line to free line */
+
+	 if (astring != NULL)  {
+
+	   num = 1; /* num = 1 if astring is a basenum and 0 if alpha */
+	   for (i = 0; i < strlen(astring); i++)
+	     if (!isdigit(astring[i])) {
+	       num = 0; 
+	       break;
+	     }
+	   if (num==0) {
+	     	   for (i = 0; i < strlen(astring); i++)
+		     if (islower(astring[i])) astring[i]=toupper(astring[i]);
+		 }
+	   /* if any of the entries were alphabetic, the program
+		searches for a string */
+
+	   if (num == 0) {
+
+	   num_bases = getNBases(currSeq,EdBases);
+	   theSeq = (char *)calloc(num_bases,sizeof(char));
+	   for (i = 0; i < num_bases; i++)
+	     theSeq[i] = getBase(currSeq, EdBases, i);
+
+	   num_matches = string_match(astring,strlen(astring),theSeq,num_bases,
+				      0, indices);
+
+	   if (num_matches == 0) baseNum = -5;
+	   else {
+	     baseNum = indices[0];
+	     next_indices++;
+	   }
+
+	   free(theSeq);
+
+	 }
+	   else {
+	     sscanf(astring,"%d",&baseNum) ;
+/*	     if (currSeq->bottom) baseNum = currSeq->NorigBases - 1 - baseNum;*/
+	   }
+
+	   mag = -5; /* by setting magnification to -5, the displaySeq
+			routine will go and back calculate the current
+			magnification */
+
+	 if (baseNum != -5) {
+	 displaySeq(currSeq,
+		    ((baseNum!=NULLBaseNum)                 &&
+		     (baseNum>=0)                           &&
+		     (baseNum<getNBases(currSeq,OrigBases))
+		     )
+		    ? baseNum
+		    : NULLBaseNum,
+		    mag
+		    );
+	 if (num_matches > 1) {
+	     nargs = 0;
+	     XtSetArg(args[nargs], XtNlabel, "Next?"); nargs++;
+	     XtSetValues(nextWid, args, nargs);
+	 }
+	 else
+	   XtDestroyWidget(popupWid);
+
+       }
+	   else {
+	     nargs = 0;
+	     XtSetArg(args[nargs], XtNstring, "String not found"); nargs++;
+	     XtSetValues(nameValWid, args, nargs);
+	   }
+	 }
+       }
+	 else
+	   XBell(XtDisplay(NorigBasesWid), 100);
+       }
+
+
+
+
+  static void searchNextCallback(Widget w,
+			     XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, nextWid;
+     Arg args[10];
+     int nargs;
+     int baseNum;
+     int mag;
+
+     popupWid      = (Widget) client_data;
+     nextWid    = XtNameToWidget(popupWid, "ioForm.next");
+
+    if (currSeq != NULLSeq)
+    {    
+	   mag = -5; /* by setting magnification to -5, the displaySeq
+			routine will go and back calculate the current
+			magnification */
+	   if (next_indices < num_matches) {
+	     baseNum = indices[next_indices];
+	     next_indices++;
+
+	     nargs = 0;
+	     XtSetArg(args[nargs], XtNlabel, "Next?"); nargs++;
+	     XtSetValues(nextWid, args, nargs);
+
+	     displaySeq(currSeq,
+		    ((baseNum!=NULLBaseNum)                 &&
+		     (baseNum>=0)                           &&
+		     (baseNum<getNBases(currSeq,OrigBases))
+		     )
+		    ? baseNum
+		    : NULLBaseNum,
+		    mag
+		    );
+	   }
+	   else 
+	     XtDestroyWidget(popupWid);
+	 }
+
+
+   }
+
+
+
+
+ /* ---- Action procedures ---- */
+
+
+ static void CRAction(Widget w, XEvent *event,
+		      String *params, Cardinal *num_params)
+ /*
+     Action procedure to be called when CR is hit.
+
+     This may be called from the `ioform' directly in the case of a
+     `check' dialogue, or from an `ioform.nameVal' widget in the case
+     of an input or ouput dialogue.
+     We must call the `default' button callback procedure. We can
+     detect which is the default by looking at its borderwidth, the
+     default one being of size 3.
+ */
+ {   Widget ioFormWid, okWid, cancelWid;
+     Arg args[10];
+     int nargs;
+     Dimension width;
+
+     ioFormWid = (strcmp("ioform", XtName(w))==0) ? w : XtParent(w);
+     okWid     = XtNameToWidget(ioFormWid, "*OK");
+     cancelWid = XtNameToWidget(ioFormWid, "*Cancel");
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNborderWidth, &width); nargs++;
+     XtGetValues(okWid, args, nargs);
+
+     XtCallCallbacks((width==3) ? okWid : cancelWid,
+		     XtNcallback, NULL);
+ }
+
+
+
+
+ /* ---- Internal functions ---- */
+
+
+ void checkInputSeq(Widget parentWid)
+ {   Widget checkWid, ioFormWid, ioPromptWid;
+     Widget okWid, cancelWid;
+     Arg	args[10];
+     int nargs;
+     Position  x, y;
+     Dimension width, height;
+
+
+     /* Add string to function bindings for our application actions */
+     XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+     XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		     actionTable, XtNumber(actionTable));
+
+
+     /*
+	 Position the upper left hand corner of the popup at the
+	 center of the parent widget.
+     */
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+     XtSetArg(args[nargs], XtNheight, &height); nargs++;
+     XtGetValues(parentWid, args, nargs);
+     XtTranslateCoords(parentWid,
+		       (Position) (width / 2), (Position) (height / 2),
+		       &x, &y);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNx, x); nargs++;
+     XtSetArg(args[nargs], XtNy, y); nargs++;
+     checkWid = XtCreatePopupShell("check", transientShellWidgetClass,
+				   parentWid,
+				   args, nargs);
+
+     nargs = 0;
+     ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				       checkWid, args, nargs);
+
+     nargs = 0;
+     ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					 ioFormWid, args, nargs);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+     okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				   ioFormWid, args, nargs);
+     XtAddCallback(okWid, XtNcallback,
+		   checkOKCallback, (XtPointer) checkWid);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+     XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+     cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				       ioFormWid, args, nargs);
+     XtAddCallback(cancelWid, XtNcallback,
+		   checkCancelCallback, (XtPointer) checkWid);
+
+
+     /* This is a modal dialogue */
+     XtPopup(checkWid, XtGrabExclusive);
+ }
+
+#ifdef nodef
+int string_match(seq1, n1, seq2, n2, nmiss, indices)
+
+/*
+  This function may be called once from initialDisplayedSeq if
+  the user has input a string they wish to search for in the
+  input sequence -- this subroutine returns *indices, the first
+  position at which the query sequence (app_resources.astring)
+  matched the input file sequence (currSeq). */
+
+/* a modified version of match.c 
+  finds alignments between a search sequence, seq1, and a target sequence, 
+  seq2, with no gaps (except possibly at ends) 
+   and at most nmiss mismatches (relative to seq1),
+   n1 is the length of seq1 where seq1 is assumed to start at indices 0,
+   n2 is the length of seq2 where seq2 is assumed to start at indices 0 */
+      char *seq1, *seq2;
+      int n1, n2, nmiss;
+      int *indices;
+{
+    int i, j, d, istart, iend, i_miss, n_match;
+    int mtable[100][5];  /* 100 would be, now the total number
+			    of matches the program may find between
+			    a given input string and the sequence */
+
+
+/* d = j - i is the "offset" between the two sequences */
+    if (n1 - n2 > nmiss) return (0);
+    n_match = 0;
+    for (d = -nmiss; d <= n2 + nmiss - n1; d++){
+	if (d < 0)  istart = i_miss = -d;
+	else istart = i_miss = 0;
+	if (d > n2 - n1) {
+	    iend = n2 - d;
+	    i_miss += n1 + d - n2;
+	}
+	else iend = n1;
+	for (i = istart, j = d + i; i < iend; i++, j++)
+	    if (seq1[i] != seq2[j] && ++i_miss > nmiss) goto nextd; 
+/* VOMIT! I want to PUKE!!!! who put this horrid goto here? */
+        mtable[n_match][0] = d + istart; /* indices (assuming they start at
+					    0) of starting nucleotide in the
+					    searched sequence */
+        mtable[n_match][1] = istart; /* nucleotide position in the query
+					sequence where match starts (assuming
+					query index starts with 0) */
+        mtable[n_match][2] = iend - istart; /* number of nucleotides in the
+					       match */
+	if (mtable[n_match][2]==n1) {
+	  indices[n_match] = mtable[n_match][0];
+	}
+	  
+        mtable[n_match][3] = i_miss; /* number of mismatches */
+	n_match++; /* number of matches */
+	if (n_match >= 100) return(n_match);
+
+    nextd:;
+    }
+
+    return (n_match);
+    /* return(0); *//*return 0 if it got to this point; that means
+		 it must have not found an exact match, so 
+		 you want the baseNum to be 0 in that case*/
+}
+#endif /*nodef*/
+
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+void  inputSeq(Widget parentWid)
+/*
+    Set up a dialogue which will read in and display a sequence.
+*/
+{   Widget inputWid, ioFormWid, ioPromptWid;
+    Widget namePromptWid, nameValWid;
+    Widget formatPromptWid, formatBoxWid, plainFmtWid, abiFmtWid, scfFmtWid;
+    Widget alfFmtWid;
+    Widget okWid, cancelWid;
+    Widget bottomFmtWid, topFmtWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+
+
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+
+
+    /*
+          Check whether there is an existing sequence which
+          is dirty which needs saving.
+    */
+    if (currSeq!=NULLSeq && isDirty(currSeq))
+    {   checkInputSeq(parentWid);
+        return;
+    }
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    inputWid = XtCreatePopupShell("input", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      inputWid, args, nargs);
+
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+    namePromptWid = XtCreateManagedWidget("namePrompt", labelWidgetClass,
+					  ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, namePromptWid);  nargs++;
+    XtSetArg(args[nargs], XtNeditType,  XawtextEdit);    nargs++;
+    nameValWid = XtCreateManagedWidget("nameVal", asciiTextWidgetClass,
+				       ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, namePromptWid); nargs++;
+    formatPromptWid = XtCreateManagedWidget("formatPrompt", labelWidgetClass,
+					    ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  namePromptWid);   nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, formatPromptWid); nargs++;
+    formatBoxWid = XtCreateManagedWidget("formatBox", boxWidgetClass,
+					 ioFormWid, args, nargs);
+
+    nargs = 0;
+    plainFmtWid = XtCreateManagedWidget("plainFmt", toggleWidgetClass,
+					formatBoxWid,
+					args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, plainFmtWid); nargs++;
+    abiFmtWid = XtCreateManagedWidget("abiFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, abiFmtWid); nargs++;
+    alfFmtWid = XtCreateManagedWidget("alfFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, abiFmtWid); nargs++;
+    scfFmtWid = XtCreateManagedWidget("scfFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+
+    nargs = 0;
+    bottomFmtWid = XtCreateManagedWidget("bottomFmt", toggleWidgetClass,
+					formatBoxWid,
+					args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, bottomFmtWid); nargs++;
+    topFmtWid = XtCreateManagedWidget("topFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, formatPromptWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback, inputOKCallback, (XtPointer) inputWid);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  formatPromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback,
+		  inputCancelCallback, (XtPointer) inputWid);
+
+
+    /* nameVal (text) widget gets the keyboard focus */
+    XtSetKeyboardFocus(ioFormWid, nameValWid);
+
+
+    /* This is a modal dialogue */
+    XtPopup(inputWid, XtGrabExclusive);
+}
+
+
+
+
+void outputSeq(Widget parentWid, String defaultFileName)
+/*
+    Save the current sequence using the deafult file name, if given.
+*/
+{   Widget outputWid, ioFormWid, ioPromptWid;
+    Widget namePromptWid, nameValWid;
+    Widget includeHeaderWid,plainFormatWid;
+    Widget okWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+
+
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+
+
+    /* If there is no current sequence, then return */
+    if (currSeq==NULLSeq) return;
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    outputWid = XtCreatePopupShell("output", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    /* Create the form */
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      outputWid, args, nargs);
+
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+    namePromptWid = XtCreateManagedWidget("namePrompt", labelWidgetClass,
+					  ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid);     nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, namePromptWid);   nargs++;
+    XtSetArg(args[nargs], XtNeditType,  XawtextEdit);     nargs++;
+    XtSetArg(args[nargs], XtNstring,    defaultFileName); nargs++;
+    nameValWid = XtCreateManagedWidget("nameVal", asciiTextWidgetClass,
+				       ioFormWid, args, nargs);
+    /* Set the insertion point after the text - dodgy method used */
+    XawTextSetInsertionPoint(nameValWid, MAXLONG);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, True); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, namePromptWid); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Include Header"); nargs++;
+    includeHeaderWid = XtCreateManagedWidget("includeHeader", toggleWidgetClass,
+					ioFormWid,
+					args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, False); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, namePromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, includeHeaderWid); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Plain Format"); nargs++;
+    XtSetArg(args[nargs], XtNradioGroup, includeHeaderWid); nargs++;
+    plainFormatWid = XtCreateManagedWidget("plainFormat", toggleWidgetClass,
+					ioFormWid,
+					args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, includeHeaderWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback, outputOKCallback, (XtPointer) outputWid);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, includeHeaderWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback, 
+		  outputCancelCallback, (XtPointer) outputWid);
+    
+    /* nameVal (text) widget gets the keyboard focus */
+    XtSetKeyboardFocus(ioFormWid, nameValWid);
+    
+    
+    /* This is a modal dialogue */
+    XtPopup(outputWid, XtGrabExclusive);
+}
+
+
+
+
+void quitApplication(Widget parentWid)
+/*
+  This function must be called to exit the application.
+  It ensures any current sequence is saved.
+  */
+{   Widget ioFormWid, ioPromptWid, quitWid;
+    Widget okWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+    
+    
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+    
+    
+    /*
+      Check whether there is an existing sequence which
+      is dirty which needs saving.
+      */
+    if (currSeq==NULLSeq || !isDirty(currSeq))
+	{   free(currSeq);
+	    XtDestroyApplicationContext(XtWidgetToApplicationContext(parentWid));
+	    exit(0);
+	}
+    
+    
+    /*
+      Position the upper left hand corner of the popup at the
+      center of the parent widget.
+      */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    quitWid = XtCreatePopupShell("quit", transientShellWidgetClass,
+				 parentWid,
+				 args, nargs);
+    
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      quitWid, args, nargs);
+    
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback,
+		  quitOKCallback, (XtPointer) quitWid);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback,
+		  quitCancelCallback, (XtPointer) quitWid);
+    
+    
+    /* This is a modal dialogue */
+    XtPopup(quitWid, XtGrabExclusive);
+}
+
+
+
+
+void initialDisplayedSeq(Widget toplevelWid,
+			 char *format, char *fn,
+			 int baseNum, int mag, char *astring, 
+			 char *enzyme, int bottom)
+/*
+  This function may be called once, after the application
+  has been realised, to specify a sequence to be displayed
+  initially. If it is not called, no sequence is initially
+  displayed.
+  */
+{   Arg args[10];
+    int nargs;
+    int i;
+    char NBasesString[10];
+    char *theSeq;
+    AppData app_data;
+    char enzString[100];
+    
+    
+    int num_bases;
+    Widget seqNameWid    = XtNameToWidget(toplevelWid, "mainForm.seqName");
+    Widget NorigBasesWid = XtNameToWidget(toplevelWid, "mainForm.NorigBases");
+    num_matches = 0;
+    next_indices = 0;
+    
+    XtGetApplicationResources(seqNameWid, (XtPointer) &app_data,
+                              resources, XtNumber(resources),
+                              NULL, 0);
+    
+#ifndef AUTO_CLIP
+    app_data.enz='\0';
+#endif
+    
+    /* enzyme will already equal either the app_resources.enz or
+       app_data.enz, lfw 05.02.91*/
+    if (enzyme!=NULL) 
+	strcpy(enzString,enzyme);
+    /*      if (app_data.enz != NULL)
+	    strcpy(enzString,app_data.enz);*/
+    else *enzString = '\0';
+    
+    
+    
+    /* read a sequence */
+    currSeq = readSeq(fn,bottom,enzString,format);
+    
+    if (currSeq != NULLSeq)
+	{    
+	    /*
+	      Set the `file name' and `number of bases' labels.
+	      */
+	    nargs = 0;
+	    XtSetArg(args[nargs], XtNlabel, fn); nargs++;
+	    XtSetValues(seqNameWid, args, nargs);
+	    nargs = 0;
+	    sprintf(NBasesString, "%d", getNBases(currSeq, OrigBases));
+	    XtSetArg(args[nargs], XtNlabel, NBasesString); nargs++;
+	    XtSetValues(NorigBasesWid, args, nargs);
+	    
+	    /* astring != NULL, find the string that corresponds
+	       to the app_resources.astring and center the display on that string,
+	       change by lfw, 10/24/90 from this line to free line */
+	    
+	    if (astring != NULL)  {
+		num_bases = getNBases(currSeq,EdBases);
+		theSeq = (char *)calloc(num_bases,sizeof(char));
+		for (i = 0; i < num_bases; i++)
+		    theSeq[i] = getBase(currSeq, EdBases, i);
+		
+		num_matches = string_match(astring,strlen(astring),theSeq,num_bases,
+					   0, indices);
+		baseNum = indices[0];
+		if (num_matches == 0) baseNum = 0;
+		
+		free(theSeq);
+	    }
+	    
+	    displaySeq(currSeq,
+		       ((baseNum!=NULLBaseNum)                 &&
+			(baseNum>=0)                           &&
+			(baseNum<getNBases(currSeq,OrigBases))
+			)
+		       ? baseNum
+		       : NULLBaseNum,
+		       mag
+		       );
+	}
+    else
+        XBell(XtDisplay(NorigBasesWid), 100);
+    
+}
+
+
+void  inputSearchString(Widget parentWid)
+/*
+  Set up a dialogue which will read in a search for a baseNum or string
+  and center the display on that baseNum, added by lfw 12/04/90
+  */
+{   Widget searchWid, ioFormWid, ioPromptWid;
+    Widget nameValWid, nextWid;
+    Widget okWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+    
+    
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+    
+    /*
+      Position the upper left hand corner of the popup at the
+      center of the parent widget.
+      */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    searchWid = XtCreatePopupShell("search", transientShellWidgetClass,
+				   parentWid,
+				   args, nargs);
+    
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      searchWid, args, nargs);
+    
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+    
+    /*    nargs = 0;
+	  XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+	  namePromptWid = XtCreateManagedWidget("namePrompt", labelWidgetClass,
+	  ioFormWid, args, nargs);*/
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+    /*    XtSetArg(args[nargs], XtNfromHoriz, namePromptWid);  nargs++;*/
+    XtSetArg(args[nargs], XtNeditType,  XawtextEdit);    nargs++;
+    nameValWid = XtCreateManagedWidget("nameVal", asciiTextWidgetClass,
+				       ioFormWid, args, nargs);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, nameValWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback, searchCallback, (XtPointer) searchWid);
+    
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  nameValWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback,
+		  inputCancelCallback, (XtPointer) searchWid);
+    
+    
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  nameValWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, cancelWid);           nargs++;
+    nextWid = XtCreateManagedWidget("next",  commandWidgetClass,
+				    ioFormWid, args, nargs);
+    XtAddCallback(nextWid, XtNcallback,
+		  searchNextCallback, (XtPointer) searchWid);
+    
+    
+    
+    /* nameVal (text) widget gets the keyboard focus */
+    XtSetKeyboardFocus(ioFormWid, nameValWid);
+    
+    
+    /* This is a modal dialogue */
+    XtPopup(searchWid, XtGrabExclusive);
+}
+
+
+
+
+
+void information (Widget w)
+/*
+ ** Display sequence and trace information gleaned from the sequence file
+ */
+{
+    
+    info(w, currSeq);
+    
+}
diff --git a/src/newted/dialogues.h b/src/newted/dialogues.h
new file mode 100644
index 0000000..a5649e3
--- /dev/null
+++ b/src/newted/dialogues.h
@@ -0,0 +1,71 @@
+#ifndef _dialogues_h
+#define _dialogues_h
+
+/* 
+    Title:       dialogues
+
+    File: 	 dialogues.h
+    Purpose:	 Dialogues
+    Last update: Thu Jan 03 1991
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+#include <stdio.h>
+
+/* ---- Global to ted.c and dialogues.c --- */
+extern char o_fn[200]; /* added by lfw, so a default output filename
+			  could be specified */
+extern char r_fn[200]; /* added by sd, so a default raw data filename
+			  could be specified */
+
+/* ---- Exports ---- */
+
+extern void inputSeq(Widget parentWid);
+/*
+    Set up a dialogue which will read in and display a new sequence.
+*/
+
+
+extern void outputSeq(Widget parentWid, String defaultFileName);
+/*
+    Save the current sequence using the default file name, if given.
+*/
+
+extern void inputSearchString(Widget parentWid);
+/*
+    Set up a dialogue which will read in and search for a string.
+*/
+
+extern void quitApplication(Widget parentWid);
+/*
+    This function must be called to exist the application.
+    It ensures any current sequence is saved.
+*/
+
+
+extern void initialDisplayedSeq(Widget toplevelWid,
+				char *format, char *fn,
+				int baseNum, int mag, char *astring, 
+				char *enzyme, int bottom);
+/*
+    This function may be called once, after the application
+    has been realised, to specify a sequence to be displayed
+    initially. If it is not called, no sequence is initially
+    displayed.
+*/
+
+extern void information(Widget w);
+/*
+    Display useful sequence and trace information
+*/
+
+extern int string_match(char *seq1, int n1, char *seq2, int n2,
+			int nmiss, int *indices);
+
+#endif  /*_dialogues_h*/
diff --git a/src/newted/display.c b/src/newted/display.c
new file mode 100644
index 0000000..a0c9537
--- /dev/null
+++ b/src/newted/display.c
@@ -0,0 +1,801 @@
+/* 
+    Title:       display
+
+    File: 	 display.c
+    Purpose:	 Sequence display and editing
+    Last update: Friday 15 May 1992
+*/
+
+
+/*
+    The display is created by a call to CreateDisplay. It principally
+    consists of two components: the first, created by a call to
+    createBasesDisplay displays the original and edited bases, the
+    second, created by a call to createTracesDisplay, displays the
+    traces.
+
+    Editing is done by stream: as the client presses keys and buttons
+    these must be interpreted as editing keystrokes.
+
+    The viewport widget does not itself hand out resize events, so we
+    create a dummy (graph) widget in the same patch, connected in the
+    same way to the walls of the form, to collect resize events and
+    allow us to adjust scalings for the graphs within the viewport.
+    This is slightly dodgy and should be redone some other way.
+
+  Changes to this program by lfw:
+  module displaySeq
+        Added the parameter mag to the parameters
+                passed to the subroutine
+        Changed to default magnification to 30%
+               by changing the call to setGraphWidth
+               to be 30 rather than -1.  And by
+               allowing the call to setGraphWidth
+               to be specified by the user which
+               comes from the main calling program
+               via the mag parameter
+
+  Changes made by sd:
+	Added scaleUp and scaleDown buttons + callbacks
+
+  SD 15-May-1992
+        editSeqEvents(): DEC Xlib function XlookupString didn't always work with
+	a buffer of length of 1, even when only one character returned. Made 
+	buffer larger - now eight characters (doesn't work with <=four)
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "display.h"
+#include "traceDisplay.h" /* IMPORT: createTraceDisplay, displayTrace,
+			             unDisplayTrace, setTraceWidth */
+#include "basesDisplay.h" /* IMPORT: createBasesDisplay, displayBases,
+			             unDisplayBases, setBasesWidth */
+#include "Graph.h"
+#include "seq.h"          /* IMPORT: Seq, NULLSeq, getNPoints */
+
+#include <ctype.h>        /* IMPORT: isgraph */
+
+#include <X11/Intrinsic.h>
+#include <X11/keysym.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Scrollbar.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Viewport.h>
+
+
+
+
+/* ---- Constants ---- */
+
+
+#define MaxDimension 32767 /* By definition, 2^16-1 */
+#define MaxMagCharSpace 5 /* At maximum magnification, the distance in
+			     characters between each base */
+#define ViewportFudge 10  /* A guess of the space required within the
+			     viewport for the graph to fit */
+
+
+/* the following allows us to get the default magnification from the
+   app_defaults file, Xted */
+typedef struct
+{   int magnif;
+    String enz;
+} AppData, *AppDataPtr;
+
+
+#define XtNmagnif "magnif"
+#define XtCMagnif "Magnif"
+#define XtNenz "enz"
+#define XtCEnz "Enz"
+
+static XtResource resources[] = {
+  { XtNmagnif,
+    XtCMagnif,
+    XtRInt,
+    sizeof(int),
+    XtOffset(AppDataPtr, magnif),
+    XtRImmediate,
+    NULL
+    },
+  { XtNenz,
+    XtCEnz,
+    XtRString,
+    sizeof(String),
+    XtOffset(AppDataPtr, enz),
+    XtRImmediate,
+    NULL
+    }
+    };
+
+
+
+
+
+
+
+
+/* ---- Statics ---- */
+
+/* Current sequence information */
+static Seq currSeq = NULLSeq;
+static int NPoints;
+static int NedBases;
+
+/* Viewport widget */
+static Widget viewportWid;
+static Widget vpFormWid;
+static Widget dummyWid;
+
+static Dimension charWidth;      /* As used by basesDisplay */
+static Dimension minGraphWidth;  /* All characters displayed
+				    (depends on width of parent viewport) */
+static Dimension maxGraphWidth;  /* Bases spaced out with `MaxMagCharSpace'
+				    blanks (depends on char size and
+				    NorigBases) */
+static Dimension graphWidth;     /* Current width of the graph */
+static Dimension plotEdgeOffset;
+
+
+/* Editing state toggles */
+static Widget togWid;            /* The group of toggles */
+
+
+/* Magnification widget */
+static Widget magWid;
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+
+
+static void setGraphWidth(int percent)
+/*
+      Set the widths of the graphs to be `percent' between
+      the minimum and maximum widths.
+      The current ``centre of interest'' is maintained.
+*/
+{   int nargs;
+    Arg args[10];
+    float centre, topOfThumb, shown;
+    Widget hScrollWid = XtNameToWidget(viewportWid, "horizontal");
+
+    /*
+        The ``centre of interest'' is the middle of the displayed
+	graph. Percentagewise, this is given by `topOfThumb+show/2'.
+    */
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtSetArg(args[nargs], XtNshown,      &shown);      nargs++;
+    XtGetValues(hScrollWid, args, nargs);
+    centre = topOfThumb+shown/2;
+
+    /*
+        Set the displays to the required width
+    */
+    graphWidth = ((maxGraphWidth-minGraphWidth) * percent)/100 + minGraphWidth;
+    XawFormDoLayout(vpFormWid, False);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, graphWidth); nargs++;
+    setBasesWidth(graphWidth);
+    setTraceWidth(graphWidth);
+    XawFormDoLayout(vpFormWid, True);
+
+    /*
+        We have changed the size of the form widget within the viewport
+	widget. The viewport does not define where exactly the form
+	will now sit, so we manually reset the ``centre of interest'',
+	keeping `shown' at its (new) value.
+    */
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtSetArg(args[nargs], XtNshown,      &shown);      nargs++;
+    XtGetValues(hScrollWid, args, nargs);
+    topOfThumb = centre-shown/2;
+    XawScrollbarSetThumb(hScrollWid, topOfThumb, shown);
+    XtCallCallbacks(hScrollWid, XtNjumpProc, &topOfThumb);
+}
+
+
+
+
+static void editSeqEvent(XEvent *eventP)
+/*
+    `eventP' (a button or key press) is interpreted as a sequence
+    editing command.
+*/
+{
+    char buffer[8];
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+
+    switch (eventP->type)
+    {   case KeyPress:
+            charCount = XLookupString((XKeyEvent *)eventP, buffer, 8,
+				      &keySym, &compose);
+	    switch (keySym)
+	    {   case XK_Right:
+		    moveCaretRight();
+		    break;
+
+                case XK_Left:
+		    moveCaretLeft();
+		    break;
+
+		case XK_Delete:
+		    /* Delete the base to the left of the caret */
+		    if (deleteBase(currSeq, getCaret()))
+		    {   baseDeleted();
+		        NedBases--;
+		    }
+		    else
+		        XBell(XtDisplay(viewportWid), 100);
+	            break;
+
+		default:
+		    if (charCount==1 && isgraph(buffer[0]))
+		    {   /* Insert the base to the right of the caret */
+			if (insertBase(currSeq, buffer[0], getCaret()))
+			{   baseInserted();
+			    NedBases++;
+			}
+			else
+		            XBell(XtDisplay(viewportWid), 100);
+		    }
+		    break;
+	    }
+            break;
+
+        case ButtonPress:
+	    {	/* Move the caret to where the pointer is */
+		int point = pixelToPoint(eventP->xbutton.x);
+		if      (point<0)         moveCaretTo(-1);
+		else if (point>NPoints-1) moveCaretTo(NedBases-1);
+		else
+	        {    int b = pointToBaseNum(currSeq, EdBases, point);
+		     moveCaretTo((b == NULLBaseNum) ? NedBases-1 : b-1);
+		}
+		break;
+	    }
+    }
+}
+
+
+
+
+static void adjustLEvent(XEvent *eventP)
+/*
+    `eventP' (a button or key press) is interpreted as a command
+    to adjust the left cutoff.
+*/
+{   char buffer[1];
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+    int leftCutoff, rightCutoff;
+
+    switch (eventP->type)
+    {   case KeyPress:
+            charCount = XLookupString((XKeyEvent *)eventP, buffer, 1,
+				      &keySym, &compose);
+	    switch (keySym)
+	    {   case XK_Right:
+		    /* Move the cutoff right one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (leftCutoff<NedBases) leftCutoff++;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+
+                case XK_Left:
+		    /* Move the cutoff left one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (leftCutoff>0) leftCutoff--;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+	    }
+            break;
+
+        case ButtonPress:
+	    {   int point, baseNum;
+
+		/* Move the cutoff to where the pointer is */
+		point = pixelToPoint(eventP->xbutton.x);
+		if      (point<0)         baseNum = 0;
+		else if (point>NPoints-1) baseNum = NedBases;
+		else
+		{ baseNum = pointToBaseNum(currSeq, EdBases, point);
+
+		  baseNum = (baseNum == NULLBaseNum)
+		              ? NedBases
+		              : baseNum;
+		}
+		getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		setCutoffs(currSeq, baseNum, rightCutoff);		      
+		basesCutoffChange();
+		traceCutoffChange();
+		break;
+	    }
+    }
+}
+
+
+
+
+static void adjustREvent(XEvent *eventP)
+/*
+    `eventP' (a button or key press) is interpreted as a command
+    to adjust the right cutoff.
+*/
+{   char buffer[1];
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+    int leftCutoff, rightCutoff;
+
+    switch (eventP->type)
+    {   case KeyPress:
+            charCount = XLookupString((XKeyEvent *)eventP, buffer, 1,
+				      &keySym, &compose);
+	    switch (keySym)
+	    {   case XK_Right:
+		    /* Move the cutoff right one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (rightCutoff>0) rightCutoff--;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+
+                case XK_Left:
+		    /* Move the cutoff left one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (rightCutoff<NedBases) rightCutoff++;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+	    }
+            break;
+
+        case ButtonPress:
+	    {   int point, baseNum;
+
+		/* Move the cutoff to where the pointer is */
+		point = pixelToPoint(eventP->xbutton.x);
+		if      (point<0)         baseNum = 0;
+		else if (point>NPoints-1) baseNum = NedBases;
+		else
+		{ baseNum = pointToBaseNum(currSeq, EdBases, point);
+
+		  baseNum = (baseNum == NULLBaseNum)
+		              ? NedBases
+		              : baseNum;
+		}
+		getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		setCutoffs(currSeq, leftCutoff, NedBases-baseNum);
+		basesCutoffChange();
+		traceCutoffChange();
+		break;
+	    }
+    }
+}
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void resizeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Dimension width;
+    int mag;
+    Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtGetValues(viewportWid, args, nargs);
+    
+    /* Back calculate what the current magnification is */
+    mag = ((graphWidth-minGraphWidth)*100) / (maxGraphWidth-minGraphWidth);
+
+    /* Adjust the minimum magnification */
+    minGraphWidth = width-ViewportFudge;
+    graphWidth    = minGraphWidth;
+
+    /* Set us to the old magnification, but with the new dimensions */
+    setGraphWidth(mag);
+}
+
+
+
+
+static void magJumpCallback(Widget wid,
+			    XtPointer client_data, XtPointer call_data)
+{   float percent = *((float *) call_data);
+
+    setGraphWidth((int) (percent*100));
+}
+
+
+
+
+static void magScrollCallback(Widget wid,
+			      XtPointer client_data, XtPointer call_data)
+{   int position = (int) call_data;
+    float     topOfThumb;
+    int nargs;
+    Arg args[10];
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtGetValues(wid, args, nargs);
+
+    /* Move the thumb by 1% in the appropriate direction */
+    if (position>0)
+    {   topOfThumb += .01;
+	if (topOfThumb > 1.0) topOfThumb = 1.0;
+    }
+    else
+    {   topOfThumb -= .01;
+	if (topOfThumb < 0.0) topOfThumb = 0.0;
+    }
+
+    XawScrollbarSetThumb(wid, topOfThumb, -1.0);
+
+    setGraphWidth((int) (topOfThumb*100));
+}
+
+
+static void scaleUpCallback(Widget wid,
+			    XtPointer client_data, XtPointer call_data)
+{
+    incScaleFactor();
+}
+
+static void scaleDownCallback(Widget wid,
+			    XtPointer client_data, XtPointer call_data)
+{
+    decScaleFactor();
+}
+
+
+static void complementCallback(Widget wid,
+			       XtPointer client_data, XtPointer call_data)
+{
+    complement_seq(currSeq);
+    /* force redisplay of everything in viewport widget */
+    redisplayTraces();
+    redisplayBases();
+    /* shift caret */
+    moveCaretTo(currSeq->NedBases - getCaret() - 2);
+
+    { /* set viewport position */
+	Cardinal nargs;
+	float topOfThumb, shown;
+	Arg args[2];
+	Widget hScrollWid = XtNameToWidget(viewportWid, "horizontal");
+	nargs=0;
+	XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+	XtSetArg(args[nargs], XtNshown, &shown); nargs++;
+	XtGetValues(hScrollWid, args, nargs);
+	topOfThumb = 1.0 - shown - topOfThumb;
+	XawScrollbarSetThumb(hScrollWid, topOfThumb, shown);
+	XtCallCallbacks(hScrollWid, XtNjumpProc, &topOfThumb);
+    }
+
+    /* set strand widget */
+    {
+	Widget strandWid;
+	strandWid = XtNameToWidget(XtParent(wid), "strand");
+	set_strand_label(strandWid,currSeq);
+
+    }
+
+}
+
+
+
+/* ---- Exports ---- */
+
+
+void createDisplay(Widget parentWid, Widget fromVertWid)
+/*
+    Create the display within the Form widget `parentWid'
+    with the XtNfromVert constraint `fromVertWid'.
+    No sequence is initally displayed.
+*/
+{   Widget labelWid, basesWid;
+    Widget scaleUpWid, scaleDownWid;
+    Widget compWid;
+    Arg args[10];
+    int nargs;
+
+    /* The magnification slider */
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    labelWid = XtCreateManagedWidget("maglab", labelWidgetClass,
+				     parentWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, labelWid); nargs++;
+    magWid = XtCreateManagedWidget("magscr", scrollbarWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(magWid, XtNjumpProc,   magJumpCallback, NULL);
+    XtAddCallback(magWid, XtNscrollProc, magScrollCallback, NULL);
+
+
+
+    /* The editing mode toggles */
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+   XtSetArg(args[nargs], XtNfromHoriz, magWid); nargs++;
+    labelWid = XtCreateManagedWidget("edmodelab", labelWidgetClass,
+				     parentWid, args, nargs);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, labelWid); nargs++;
+    togWid = XtCreateManagedWidget("adjustL", toggleWidgetClass,
+				   parentWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, togWid); nargs++;
+    XtSetArg(args[nargs], XtNradioGroup, togWid); nargs++;
+    togWid = XtCreateManagedWidget("edSeq", toggleWidgetClass,
+				   parentWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, togWid); nargs++;
+    XtSetArg(args[nargs], XtNradioGroup, togWid); nargs++;
+    togWid = XtCreateManagedWidget("adjustR", toggleWidgetClass,
+				   parentWid, args, nargs);
+
+
+    /* for vertical scaling */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, togWid); nargs++;
+    scaleDownWid = XtCreateManagedWidget("scaleDown", commandWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(scaleDownWid, XtNcallback, scaleDownCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, scaleDownWid); nargs++;
+    scaleUpWid = XtCreateManagedWidget("scaleUp", commandWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(scaleUpWid, XtNcallback, scaleUpCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, scaleUpWid); nargs++;
+    compWid = XtCreateManagedWidget("switch", commandWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(compWid, XtNcallback, complementCallback, NULL);
+
+
+    /* The main display */
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, labelWid); nargs++;
+    XtSetArg(args[nargs], XtNforceBars, True); nargs++;
+    viewportWid = XtCreateManagedWidget("viewport", viewportWidgetClass,
+					parentWid, args, nargs);
+
+        /*
+	    The minimum graph width is the viewport width minus a
+	    random amount to allow for spacing between the viewport
+	    and the graph, borders etc.
+	    The initial graph width is this minimum width.
+        */
+        {   Dimension vpWidth;
+
+            nargs = 0;
+            XtSetArg(args[nargs], XtNwidth, &vpWidth); nargs++;
+	    XtGetValues(viewportWid, args, nargs);
+	    minGraphWidth = vpWidth-ViewportFudge;
+	    graphWidth = minGraphWidth;
+	}
+
+    nargs = 0;
+    vpFormWid = XtCreateManagedWidget("vpForm", formWidgetClass,
+				      viewportWid, args, nargs);
+
+    basesWid = createBasesDisplay(vpFormWid, NULL, graphWidth);
+    getBasesFontInfo(&plotEdgeOffset, &charWidth);
+    createTraceDisplay(vpFormWid,basesWid,graphWidth, plotEdgeOffset);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, magWid); nargs++;
+    dummyWid = XtCreateManagedWidget("dummy", graphWidgetClass,
+				     parentWid, args, nargs);
+    XtAddCallback(dummyWid, XtNresizeCallback, resizeCallback, NULL);
+}
+
+
+
+
+void displaySeq(Seq seq, int baseNum, int mag)
+/*
+    Display the sequence `seq' for editing.
+    If baseNum is not equal to NULLBaseNum, display at 30% (or mag)
+    magnification centered on baseNum.
+*/
+{   AppData app_data;
+
+
+
+    XtGetApplicationResources(magWid, (XtPointer) &app_data,
+                              resources, XtNumber(resources),
+                              NULL, 0);
+
+
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+    displayBases(seq,baseNum);
+    displayTrace(seq);
+
+
+    /*
+      The maximum graph width is such to allow for a `MaxMagCharSpace'
+      spacing between bases.
+
+      There is a limit on Dimension sizes, noting that the graph we
+      generate has to fit into a Viewport.
+    */
+    {  Dimension maxWidth;
+       int  NorigBases;
+
+       NorigBases = getNBases(currSeq, OrigBases);
+       maxWidth = ((NorigBases-1) * charWidth * MaxMagCharSpace) +
+	                (2 * plotEdgeOffset);
+       maxGraphWidth =
+	     (maxWidth>MaxDimension-ViewportFudge) ? MaxDimension-ViewportFudge
+	   : (maxWidth<minGraphWidth)              ? minGraphWidth
+	   :                                         maxWidth;
+    }
+
+
+    /*
+        Get static information about this sequence
+    */
+    NedBases = getNBases(seq, EdBases);
+    NPoints  = getNPoints(seq);
+
+    
+    if (baseNum==NULLBaseNum)
+    {   /* Starting at LH end, minimum magnification (everything visible) */
+
+        if (mag==-5) /* back calculate the magnification */ 
+	  mag = ((graphWidth-minGraphWidth)*100) / (maxGraphWidth-minGraphWidth);
+
+
+	if (mag <= 0 || mag > 100) {
+	  setGraphWidth(0);
+	  XawScrollbarSetThumb(magWid, 0.3, -1.0);
+	}
+	else {
+	  setGraphWidth(mag);
+	  XawScrollbarSetThumb(magWid, (float)mag/100.0, -1.0);
+	}
+    }
+    else
+    {   /* Centred on baseNum, half magnification */
+        Widget hScrollWid = XtNameToWidget(viewportWid, "horizontal");
+        float topOfThumb;
+
+	/* Set the `centre of interest' on baseNum. Zero `shown'. */
+/*
+	topOfThumb = (float)baseNum / (float)(NedBases-1);
+*/
+	topOfThumb = (float) baseNumToPoint(seq, OrigBases,baseNum) / (float) NPoints;
+	XawScrollbarSetThumb(hScrollWid, topOfThumb, 0.0);
+	/* Set the actual magnification */
+
+/* the original program automatically set the magnification to 30% if
+   the user specified a baseNum other than NULL, this section was added
+   by LaDeana Hillier, 9/27/90, to allow the user to set their own
+   magnification when calling up ted. */
+
+        if (mag==-5) /* back calculate it */ 
+	  mag = ((graphWidth-minGraphWidth)*100) / (maxGraphWidth-minGraphWidth);
+	if (app_data.magnif==0) app_data.magnif = 30;
+
+	if (mag <= 0 || mag > 100) {
+	  setGraphWidth(app_data.magnif);
+	/* Set the magnification scrollbar */
+          XawScrollbarSetThumb(magWid, (float)(app_data.magnif)/100.0, -1.0);
+	}
+	else {
+	  setGraphWidth(mag);
+          XawScrollbarSetThumb(magWid, (float)mag/100.0, -1.0);
+	}
+    }
+}
+
+
+
+
+void unDisplaySeq()
+/*
+    Cease displaying the current sequence (if any).
+*/
+{   currSeq = NULLSeq;
+
+
+    /*
+        Reset the magnification to minimum.
+	The thumb is left its default size.
+    */
+    XawScrollbarSetThumb(magWid, 0.0, -1.0);
+
+    unDisplayBases();
+    unDisplayTrace();
+    /*
+        The initial width is the minimum, so everything is visible.
+	This also clears the graphs.
+    */
+    setGraphWidth(0);
+}
+
+
+
+
+Seq displayedSeq()
+/*
+    Return the currently displayed sequence, or NULL
+    if none is being displayed.
+*/
+{   return(currSeq);
+}
+
+
+
+
+void userEvent(XEvent *eventP)
+{   String currTog;
+
+    if (currSeq == NULLSeq) return;
+
+    /*
+        Find which of the toggles is currently set
+    */
+    currTog = (String) XawToggleGetCurrent(togWid);
+
+    if (strcmp(currTog, "adjustL") == 0)
+    {   /* Adjust the left cutoff */
+        adjustLEvent(eventP);
+    }
+    else if (strcmp(currTog, "adjustR") == 0)
+    {  /* Adjust the right cutoff */
+        adjustREvent(eventP);
+    }
+    else
+    {   /* Edit the sequence itself */
+	editSeqEvent(eventP);
+    }
+}
+
+
diff --git a/src/newted/display.h b/src/newted/display.h
new file mode 100644
index 0000000..a93070f
--- /dev/null
+++ b/src/newted/display.h
@@ -0,0 +1,68 @@
+/* 
+    Title:       display
+
+    File: 	 display.h
+    Purpose:	 Sequence display and editing
+    Last update: Tue Jun 5 1990
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern void createDisplay(Widget parentWid, Widget superiorWid);
+/*
+    Create the display within the Form widget `parent',
+    underneath `superiorWid'.
+    No sequence is initially displayed.
+*/
+
+
+extern void displaySeq(Seq seq, int baseNum, int mag);
+/*
+    Display the sequence `seq' for editing.
+    If baseNum is not equal to -1, display at half magnification
+    centered on baseNum. (unless the user has also set mag)
+*/
+
+
+extern void unDisplaySeq();
+/*
+    Cease displaying the current sequence (if any).
+*/
+
+
+extern Seq displayedSeq();
+/*
+    Return the currently displayed sequence, or NULLSeq
+    if none is being displayed.
+*/
+
+
+extern void userEvent(XEvent *eventP);
+/*
+    Handle the (key or button) event generated in a subdisplay.
+*/
+
+
+
+
+
+
+
+
diff --git a/src/newted/fpoint.c b/src/newted/fpoint.c
new file mode 100644
index 0000000..cd930c1
--- /dev/null
+++ b/src/newted/fpoint.c
@@ -0,0 +1,54 @@
+/*#include <math.h>*/
+extern double log ( double x ) ;
+extern double exp ( double x ) ;
+#define IEEE
+
+float int_to_float(int in)
+/*
+** interpret the integer in as a
+** floating point number in IEEE format
+*/
+{
+   /*
+  Assume `in' is stored as a float according to the 
+  ANSI IEEE 754-1985 standard. See the tables below:
+
+  s = sign ( 1 bit)
+  e = biased exponent (8 bits)
+  f = fraction (23 bits)
+
+  floating point number =  (-1)^s 2^(e-127) 1.f
+
+     Bits  Name      Content
+      31   Sign      1 iff number is negative
+    23-30  Exponent  Eight-Bit exponent, biased by 127
+     0-22  Fraction  23-bit fraction component of normalised significant.
+		     The "one" bit is "hidden"
+
+  If IEEE floating point format is supported on your machine...
+  ensure there is a #define IEEE somewhere. 
+  */
+
+#ifdef IEEE
+  union {
+    int i;
+    float f;
+  } cvt;
+  cvt.i = in;
+  return cvt.f;
+#else
+  int fraction;
+  int exponent;
+  int sign;
+
+  fraction = in & ( (1<<23)-1 );
+  exponent = (in >> 23) & ( (1<<8)-1 );
+  sign = (in >> 31);
+
+  return
+    (float) (
+      (sign?-1.0:1.0) *
+      exp ( log ( (double) 2.0) * (double) (exponent - 127 - 23) ) *
+      (double) ((1<<23)+fraction)) ;
+#endif
+}
diff --git a/src/newted/fpoint.h b/src/newted/fpoint.h
new file mode 100644
index 0000000..065c5df
--- /dev/null
+++ b/src/newted/fpoint.h
@@ -0,0 +1,6 @@
+#ifndef _fpoint_h
+#define _fpoint_h
+
+extern float int_to_float(int in);
+
+#endif /* _fpoint_h */
diff --git a/src/newted/getABIPrimerPos.c b/src/newted/getABIPrimerPos.c
new file mode 100644
index 0000000..e4cd4f1
--- /dev/null
+++ b/src/newted/getABIPrimerPos.c
@@ -0,0 +1,203 @@
+/*
+    Title: 	 getABIPrimerPos
+
+    Purpose:	 Extract the Primer Position from and ABI trace file
+    Last update: Fri 17 Jul 1992
+
+    Change log:
+
+*/
+
+
+/*
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+
+
+
+
+/* ---- Constants ---- */
+
+
+/*
+    The ABI trace file structure is based around an `index'
+    which is located towards the end of the file. It's location
+    is given by a longword at a fixed place.
+*/
+#define IndexPO ((long)26)
+
+/*
+    The index consists of a number of entries.
+    Each of these is a label of 4 characters followed by 6 long words.
+*/
+#define LW1 4
+#define LW2 8
+#define LW3 12
+#define LW4 16
+#define LW5 20
+#define LW6 24
+#define IndexEntryLength 28
+
+/*
+    Here is the label we will be looking for (SMPL), packed
+    into a long word.
+*/
+#define PrimerPosEntryLabel  ((long) ((((('P'<<8)+'P')<<8)+'O')<<8)+'S')
+
+
+
+
+/* ---- Types ---- */
+
+typedef long int4;
+typedef short int2;
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static int4 readABIInt4_no_address(FILE *fp)
+{
+    unsigned char buf[sizeof(int4)];
+
+    if (fread(buf, sizeof(long), 1, fp) != 1) {
+	fprintf(stderr,"Fread fail\n");
+	exit(1);
+    }
+    return (int4) 
+	   (((unsigned long)buf[3]) +
+	    ((unsigned long)buf[2]<<8) +
+	    ((unsigned long)buf[1]<<16) +
+	    ((unsigned long)buf[0]<<24));
+
+}
+
+static int readABIInt2(FILE *fp, int2 *i2)
+{
+    unsigned char buf[sizeof(int2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 = (int2)
+        (((unsigned short)buf[1]) +
+         ((unsigned short)buf[0]<<8));
+    return (1);
+}
+
+
+static int readABIInt4(FILE *fp, int4 *i4)
+{
+    unsigned char buf[sizeof(int4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 = (int4)
+        (((unsigned long)buf[3]) +
+         ((unsigned long)buf[2]<<8) +
+         ((unsigned long)buf[1]<<16) +
+         ((unsigned long)buf[0]<<24));
+    return (1);
+}
+
+
+
+static int getIndexEntryW(FILE *fp, long indexO, long label, int lw, int2 *val)
+/*
+    From the ABI results file connected to `fp' whose index starts
+    at byte offset `indexO', return in `val' the `lw'th  word (int2)
+    from the entry labelled `label'.
+    The result indicates success.
+*/
+{   int entryNum=-1;
+    int i;
+    int4 entryLabel, entryLw1;
+    int4 jval;
+
+    do
+    {   entryNum++;
+        if (fseek(fp, indexO+(entryNum*IndexEntryLength), 0) != 0) return(-1);
+        if (!readABIInt4(fp, &entryLabel)) return(0);
+    } while (!(entryLabel == label));
+
+
+    for(i=2; i<lw; i++)
+        if (!readABIInt4(fp, &jval)) return(-1);
+    if (!readABIInt2(fp, val)) return(-1);
+
+    return(1);
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{   FILE *fp = NULL;
+    int4 indexO, sampleNameO;
+    int sampleNameLen;
+    int4 val;
+    int entryNum;
+    int2 primerPos;
+
+    if (argc != 2)
+    {   fprintf(stderr, "%s: usage: getPrimerPos ABI_filename\n", argv[0]);
+	exit(1);
+    }
+
+
+    /* Open the gel file: read, binary */
+    if ((fp = fopen(argv[1], "rb")) == NULL)
+    {   fprintf(stderr, "%s: Unable to open file %s\n", argv[0], argv[1]);
+	exit(1);
+    }
+
+
+    /* Get the index offset */
+    if (fseek(fp, IndexPO, 0) != 0)
+    {   fprintf(stderr,"%s: Seek fail\n", argv[0]);
+	exit(1);
+    }
+    indexO = readABIInt4_no_address(fp);
+
+
+    /* Get Primer Position...where the ABI file found the start of */
+    if (!getIndexEntryW(fp,indexO,PrimerPosEntryLabel,6,&primerPos))
+    {   fclose(fp);
+	printf("Could not find primer position...\n");
+	exit(1);
+    }
+    printf("%s: %d\n",argv[1],primerPos);
+
+
+    /*
+        Write the sample name out on the standard output
+	The name is in a BCPL like format -
+	the first byte holds the string length
+    */
+/*    if (fseek(fp, sampleNameO, 0) != 0)
+    {   fprintf(stderr,"%s: Seek fail\n", argv[0]);
+	exit(1);
+    }
+    sampleNameLen = (int) fgetc(fp);
+    while (sampleNameLen > 0)
+    {   fputc(fgetc(fp), stdout);
+	sampleNameLen--;
+    }
+*/
+
+    fclose(fp);
+    exit(0);
+}
+
+
+
diff --git a/src/newted/getABISampleName.c b/src/newted/getABISampleName.c
new file mode 100644
index 0000000..7840e5f
--- /dev/null
+++ b/src/newted/getABISampleName.c
@@ -0,0 +1,161 @@
+/*
+    Title: 	 getABISampleName
+
+    Purpose:	 Extract the Sample name from and ABI trace file
+    Last update: Fri Oct 25 1991
+
+    Change log:
+
+	25-Oct-1991 SD Machine independant I/O
+*/
+
+
+/*
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include <stdlib.h>
+#include <sys/types.h>
+#include "mach-io.h"
+
+
+
+/* ---- Constants ---- */
+
+
+/*
+    The ABI trace file structure is based around an `index'
+    which is located towards the end of the file. It's location
+    is given by a longword at a fixed place.
+*/
+#define IndexPO ((off_t)26)
+
+/*
+    The index consists of a number of entries.
+    Each of these is a label of 4 characters followed by 6 long words.
+*/
+#define LW1 4
+#define LW2 8
+#define LW3 12
+#define LW4 16
+#define LW5 20
+#define LW6 24
+#define IndexEntryLength 28
+
+/*
+    Here is the label we will be looking for (SMPL), packed
+    into a long word.
+*/
+#define SampleNameEntryLabel  ((int_4) ((((('S'<<8)+'M')<<8)+'P')<<8)+'L')
+
+
+
+/* ---- Internal functions ---- */
+
+
+static int_4 readABIInt4(FILE *fp)
+{
+    unsigned char buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(int_4), 1, fp) != 1) {
+	fprintf(stderr,"Fread fail\n");
+	exit(1);
+    }
+    return (int_4) 
+	   (((int_4)buf[3]) +
+	    ((int_4)buf[2]<<8) +
+	    ((int_4)buf[1]<<16) +
+	    ((int_4)buf[0]<<24));
+
+}
+
+
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{   FILE *fp = NULL;
+    int_4 indexO, sampleNameO;
+    int sampleNameLen;
+    int_4 val;
+    int entryNum;
+
+    if (argc != 2)
+    {   fprintf(stderr, "%s: usage\n", argv[0]);
+	exit(1);
+    }
+
+
+    /* Open the gel file: read, binary */
+    if ((fp = fopen(argv[1], "rb")) == NULL)
+    {   fprintf(stderr, "%s: Unable to open file %s\n", argv[0], argv[1]);
+	exit(1);
+    }
+
+
+    /* Get the index offset */
+    if (fseek(fp, IndexPO, 0) != 0)
+    {   fprintf(stderr,"%s: Seek fail\n", argv[0]);
+	exit(1);
+    }
+    indexO = readABIInt4(fp);
+
+
+    /* Find the sample name entry by searching for the correct label */
+    entryNum = 0;
+    (void) fseek(fp, indexO+(entryNum*IndexEntryLength), 0);
+    while ((val = readABIInt4(fp)) != SampleNameEntryLabel)
+    {   entryNum++;
+	(void) fseek(fp, indexO+(entryNum*IndexEntryLength), 0);
+    }
+
+    /* Get the offset of the actual sample name */
+    (void) fseek(fp, indexO+(entryNum*IndexEntryLength)+LW4, 0);
+    if (readABIInt4(fp) <= 4) {
+	sampleNameO = indexO+(entryNum*IndexEntryLength)+LW5;
+    } else {
+	/* Get the offset of the actual sample name */
+	(void) fseek(fp, indexO+(entryNum*IndexEntryLength)+LW5, 0);
+	sampleNameO = readABIInt4(fp);
+    }
+
+    /* If this is 0, there is The Sample With No Name */
+    if (sampleNameO == (int_4)0)
+    {   fclose(fp);
+	exit(0);
+    }
+
+
+    /*
+        Write the sample name out on the standard output
+	The name is in a BCPL like format -
+	the first byte holds the string length
+    */
+    if (fseek(fp, sampleNameO, 0) != 0)
+    {   fprintf(stderr,"%s: Seek fail\n", argv[0]);
+	exit(1);
+    }
+    sampleNameLen = (int) fgetc(fp);
+    while (sampleNameLen > 0)
+    {   fputc(fgetc(fp), stdout);
+	sampleNameLen--;
+    }
+
+
+    fclose(fp);
+    exit(0);
+}
+
+
+
diff --git a/src/newted/getPrimerPos.c b/src/newted/getPrimerPos.c
new file mode 100644
index 0000000..0476da4
--- /dev/null
+++ b/src/newted/getPrimerPos.c
@@ -0,0 +1,184 @@
+/*
+    Title: 	 autoted
+
+    File: 	 autoted.c
+    Purpose:	 Main module
+    Last update: Thursday 11 July 1991
+
+    Change log:
+
+*/
+
+
+/*
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include "seqIOABI.h"
+#include "seqIOALF.h"
+#include "seqIOSCF.h"
+#include "seqIOPlain.h"
+#include "mystdlib.h"  /* IMPORT: exit */
+
+
+/* ---- Static variables ---- */
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+typedef struct
+{   char * plain;
+    char * ABI;
+    char * ALF;
+    char * SCF;
+    char * output;
+    char * enzyme;
+    int bottom;
+    char * raw;
+} Arguments;
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    if (argc != 1)
+    {   fprintf(stderr, "%s: unknown option%s:", argv[0], (argc>2)?"s":"");
+	for (i=1; i<argc; i++)
+	    fprintf(stderr, " %s", argv[i]);
+	fprintf(stderr, "\n\n");
+    }
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr,
+"       [(-PLN | -ABI | -ALF | -SCF ) {tracefilename} [-bottom {1(true) or 0(false)}]\n");
+    fprintf(stderr,
+"       -output {outputfilename}\n");
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{
+    Arguments args;
+    char *fn;
+    char *o_fn;
+    Seq currSeq;
+    int rc;
+    int count;
+    char *enzString = "CAGGTC";
+
+    /* parse command line arguments here */
+
+    count=1;
+
+    fn = NULL;
+    args.ABI = NULL;
+    args.ALF = NULL;
+    args.SCF = NULL;
+    args.output = NULL;
+    args.plain = NULL;
+    args.bottom = 0;
+    args.raw = 0;
+    args.enzyme = NULL;
+
+    while (count < argc) {
+	char *a = argv[count];
+	if ( *a != '-') {
+            usage(argc, argv);
+fprintf(stderr,"not a - switch\n");
+	    exit(1);
+	}
+
+	if (strcmp(a,"-PLN") == 0) {
+	    args.plain = fn = argv[++count];
+	} else if (strcmp(a,"-ABI") == 0) {
+	    args.ABI = fn = argv[++count];
+	} else if (strcmp(a,"-ALF") == 0) {
+	    args.ALF = fn = argv[++count];
+	} else if (strcmp(a,"-SCF") == 0) {
+	    args.SCF = fn = argv[++count];
+	} else if (strcmp(a,"-output") == 0) {
+	    args.output = o_fn = argv[++count];
+	} else if (strcmp(a,"-bottom") == 0) {
+	    args.bottom = atoi(argv[++count]);
+	} else if (strcmp(a,"-raw") == 0) {
+	    args.raw = argv[++count];
+	} else if (strcmp(a,"-enzyme") == 0) {
+	    args.enzyme = enzString = argv[++count];
+	} else {
+            usage(argc, argv);
+fprintf(stderr,"switch not recognised\n");
+	    exit(1);
+	}
+        count++;
+    }
+
+
+
+
+
+    /*
+        Die if bad options given.
+    */
+    if ((args.plain==NULL &&
+	 args.ABI==NULL   &&
+	 args.ALF==NULL   &&
+	 args.SCF==NULL))
+    {   
+	usage(argc, argv);
+fprintf(stderr,"illegal argument combination\n");
+	exit(1);
+    }
+
+/*    if ( args.output==NULL )
+    {   
+	usage(argc, argv);
+fprintf(stderr,"no output specified\n");
+	exit(1);
+    }
+*/
+
+    if (args.plain != NULL)
+	currSeq = readSeqPlain(fn,args.bottom,enzString);
+    else if (args.ABI != NULL)
+	currSeq = readSeqABI(fn,args.bottom,enzString);    
+    else if (args.ALF != NULL)
+	currSeq = readSeqALF(fn,args.bottom,enzString);
+    else if (args.SCF != NULL)
+	currSeq = readSeqSCF(fn,args.bottom,enzString);
+
+    if (currSeq == NULLSeq) {
+	fprintf(stderr,"Error reading file %s\n",fn);
+	exit(1);
+    }
+
+
+/*    rc=writeSeq(currSeq, o_fn, (args.raw == NULL)?fn:args.raw,1);*/
+
+      printf("%s: %d\n",fn,currSeq->info.primerPos);
+
+
+}
+
+
diff --git a/src/newted/help.c b/src/newted/help.c
new file mode 100644
index 0000000..c446e79
--- /dev/null
+++ b/src/newted/help.c
@@ -0,0 +1,128 @@
+/* 
+    Title:       help
+
+    File: 	 help.c
+    Purpose:	 Help
+    Last update: Fri Jun 15 1990
+*/
+
+
+/* ---- Includes ---- */
+#include <stdio.h>
+
+#include "help.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Statics ---- */
+
+static Widget helpShellWid;
+static Boolean helpShellMapped=False;
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void removeCallback(Widget w,
+			   XtPointer client_data, XtPointer call_data)
+{    XtPopdown(helpShellWid);
+     helpShellMapped=False;
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+Widget createHelpShell(Widget toplevelWid)
+/*
+    Create the help shell as a child of `toplevelWid'.
+    The shell is not initially mapped.
+*/
+{   Widget formWid, removeWid, textWid;
+    String fn;
+    Arg args[10];
+    int nargs;
+
+    char apology[] = "Sorry, I couldn't find the \"ted.help\" file.\n\
+This should either be in \"/usr/X11/app-defaults\" or somewhere\n\
+referred to by the \"XFILESEARCHPATH\" environment variable.";
+
+
+    /*
+        The toplevel shell widget holds a form widget
+    */
+    helpShellWid = XtCreatePopupShell("Help", topLevelShellWidgetClass,
+				      toplevelWid,
+				      NULL, (Cardinal) 0);
+
+    formWid = XtCreateManagedWidget("form", formWidgetClass, helpShellWid,
+				    NULL, (Cardinal) 0);
+
+
+    nargs = 0;
+    removeWid = XtCreateManagedWidget("rmButton", commandWidgetClass,
+				      formWid, args, nargs);
+    XtAddCallback(removeWid, XtNcallback, removeCallback, NULL);
+
+
+    /*
+        Find the help file.
+    */
+    fn = XtResolvePathname(XtDisplay(toplevelWid),
+			   "app-defaults",     /* type */
+			   "ted",              /* file name */
+			   ".help",            /* suffix */
+			   NULL,               /* path */
+			   NULL, (Cardinal)0,  /* substitutions */
+			   NULL                /* file predicate */
+			  );
+
+
+    /*
+        If we found the file display it, else an apology.
+    */
+    nargs = 0;
+    if (fn != NULL)
+    {   XtSetArg(args[nargs], XtNtype, XawAsciiFile); nargs++;
+	XtSetArg(args[nargs], XtNstring, fn); nargs++;
+    }
+    else
+    {   XtSetArg(args[nargs], XtNtype, XawAsciiString); nargs++;
+	XtSetArg(args[nargs], XtNstring, apology); nargs++;
+    }
+    XtSetArg(args[nargs], XtNfromVert, removeWid); nargs++;
+    XtSetArg(args[nargs], XtNscrollVertical, XawtextScrollAlways); nargs++;
+    textWid = XtCreateManagedWidget("text", asciiTextWidgetClass,
+				    formWid, args, nargs);
+
+
+    if (fn != NULL) XtFree(fn);
+
+
+    return(helpShellWid);
+}
+
+
+
+
+void help()
+/*
+    Display help modeless dialogue.
+*/
+{   if (!helpShellMapped)
+    {   XtPopup(helpShellWid, XtGrabNone);
+	helpShellMapped = True;
+    }
+}
diff --git a/src/newted/help.h b/src/newted/help.h
new file mode 100644
index 0000000..5778471
--- /dev/null
+++ b/src/newted/help.h
@@ -0,0 +1,38 @@
+#ifndef _help_h
+#define _help_h
+
+
+/* 
+    Title:       help
+
+    File: 	 help.h
+    Purpose:	 Help
+    Last update: Fri Jun 15 1990
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget createHelpShell(Widget toplevelWid);
+/*
+    Create the help shell as a child of `toplevelWid'.
+    The shell is not initially mapped.
+*/
+
+
+extern void help();
+/*
+    Display help modeless dialogue.
+*/
+
+
+#endif  /*_help_h*/
diff --git a/src/newted/info.c b/src/newted/info.c
new file mode 100644
index 0000000..f264a2e
--- /dev/null
+++ b/src/newted/info.c
@@ -0,0 +1,123 @@
+/* 
+    Title:       info
+
+    File: 	 info.c
+    Purpose:	 Info
+    Last update: Tue Nov 10 1992
+*/
+
+
+/* ---- Includes ---- */
+#include <stdio.h>
+
+#include "info.h"
+#include "seq.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Statics ---- */
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+static void get_info_string(Seq seq, char *seq_info)
+{
+    switch (seq->format) {
+	case ABIFormat:
+	case SCFFormat:
+	    if (seq->info != NULL) {
+		sprintf(seq_info,"%s\n",seq->info);
+		break;
+	    }
+	case ALFFormat:
+        default:
+            strcpy(seq_info,"No information for this sequence\n");
+            break;
+    }
+}
+
+/*ARGSUSED*/
+static void
+DestroyPopupPrompt(Widget widget, XtPointer client_data, XtPointer call_data)
+{
+
+    Widget popup = XtParent( (Widget) client_data);
+    XtDestroyWidget(popup);
+}
+
+
+/* ---- Exports ---- */
+
+
+/* ARGSUSED */
+void info(Widget button, Seq seq)
+{
+    Arg		args[5];
+    Widget	popup;
+    Position	x, y;
+    Dimension	width, height;
+    Cardinal	n;
+    Widget apane,apane1,apane2;
+    char seq_info[500];
+
+    if (seq == NULLSeq) return;
+    /*
+     * This will position the upper left hand corner of the popup at the
+     * center of the widget which invoked this callback, which will also
+     * become the parent of the popup. 
+     */
+
+    get_info_string(seq,seq_info);
+
+    n = 0;
+    XtSetArg(args[0], XtNwidth, &width); n++;
+    XtSetArg(args[1], XtNheight, &height); n++;
+    XtGetValues(button, args, n);
+    XtTranslateCoords(button, (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+
+    x -= 350;
+    n = 0;
+    XtSetArg(args[n], XtNx, x);				n++;
+    XtSetArg(args[n], XtNy, y);				n++;
+
+    popup = XtCreatePopupShell("prompt", transientShellWidgetClass, button,
+			       args, n);
+
+    apane = XtCreateManagedWidget("apane", formWidgetClass, popup ,NULL, 0);    
+    apane2 = XtVaCreateManagedWidget("apane2", asciiTextWidgetClass,apane,
+				         XtNeditType, XawtextRead,
+					 XtNwidth, 400,
+				         XtNheight, 80,
+				         XtNstring, seq_info,
+				         XtNscrollVertical, XawtextScrollAlways,
+				      NULL);
+
+    apane1 = XtVaCreateManagedWidget("apane3", commandWidgetClass,apane,
+				         XtNeditType, XawtextEdit,
+					 XtNwidth, 400,
+				         XtNheight, 20,
+				         XtNfromVert, apane2,
+				         XtNlabel, "Cancel",				     
+				      NULL);
+
+    XtAddCallback(apane1, XtNcallback, DestroyPopupPrompt, (XtPointer)apane);
+    XtPopup(popup, XtGrabNone);
+    
+}
+
+
diff --git a/src/newted/info.h b/src/newted/info.h
new file mode 100644
index 0000000..9d4161d
--- /dev/null
+++ b/src/newted/info.h
@@ -0,0 +1,30 @@
+#ifndef _info_h
+#define _info_h
+
+
+/* 
+    Title:       info
+
+    File: 	 info.h
+    Purpose:	 Info
+    Last update: Fri Jun 15 1990
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+extern void info();
+/*
+    Display info modeless dialogue.
+*/
+
+
+#endif  /*_info_h*/
diff --git a/src/newted/mach-io.c b/src/newted/mach-io.c
new file mode 100644
index 0000000..9b78205
--- /dev/null
+++ b/src/newted/mach-io.c
@@ -0,0 +1,238 @@
+/*
+** Machine independant io:
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value suggessfully read or written
+*/
+
+#include <stdio.h>
+#include "mach-io.h"
+
+
+
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+int be_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a big-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[0] = (uint_1) (*i2>>8)&255;
+    buf[1] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(*i4)];
+
+
+    buf[0] = (uint_1) (*i4>>24)&255;
+    buf[1] = (uint_1) (*i4>>16)&255;
+    buf[2] = (uint_1) (*i4>>8)&255;
+    buf[3] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a big-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[1]) +
+         ((uint_2)buf[0]<<8));
+    return (1);
+}
+
+
+
+
+
+int be_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[3]) +
+         ((uint_4)buf[2]<<8) +
+         ((uint_4)buf[1]<<16) +
+         ((uint_4)buf[0]<<24));
+    return (1);
+}
+
+
+
+
+
+
+
+
+
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+int le_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a little-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[1] = (uint_1) (*i2>>8)&255;
+    buf[0] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    buf[3] = (uint_1) (*i4>>24)&255;
+    buf[2] = (uint_1) (*i4>>16)&255;
+    buf[1] = (uint_1) (*i4>>8)&255;
+    buf[0] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a little-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[0]) +
+         ((uint_2)buf[1]<<8));
+    return (1);
+}
+
+
+
+
+
+int le_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[0]) +
+         ((uint_4)buf[1]<<8) +
+         ((uint_4)buf[2]<<16) +
+         ((uint_4)buf[3]<<24));
+    return (1);
+}
diff --git a/src/newted/mach-io.h b/src/newted/mach-io.h
new file mode 100644
index 0000000..5783df3
--- /dev/null
+++ b/src/newted/mach-io.h
@@ -0,0 +1,115 @@
+#ifndef _mach_io_h
+#define _mach_io_h
+/*
+** Machine independant io
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value successfully read or written
+*/
+
+#include <stdio.h>
+
+typedef char		int_1;
+typedef short		int_2;
+typedef int		int_4;
+typedef unsigned char	uint_1;
+typedef unsigned short	uint_2;
+typedef unsigned int	uint_4;
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+extern int be_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a big-endian int_1
+*/
+
+extern
+int be_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a big-endian int_2
+*/
+
+extern
+int be_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a big-endian int_4
+*/
+
+extern
+int be_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a big-endian int_1
+*/
+
+extern
+int be_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a big-endian int_2
+*/
+
+extern
+int be_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a big-endian int_4
+*/
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+extern
+int le_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a little-endian int_1
+*/
+
+extern
+int le_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a little-endian int_2
+*/
+
+extern
+int le_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a little-endian int_4
+*/
+
+extern
+int le_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a little-endian int_1
+*/
+
+extern
+int le_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a little-endian int_2
+*/
+
+extern
+int le_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a little-endian int_4
+*/
+
+
+#endif /* _mach_io_h */
diff --git a/src/newted/makefile-alpha b/src/newted/makefile-alpha
new file mode 100644
index 0000000..e6707b2
--- /dev/null
+++ b/src/newted/makefile-alpha
@@ -0,0 +1,117 @@
+#
+# Makefile for ted (trace editor) - Alpha fork
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE -DQUAL_CHECK
+
+
+#
+# Programs made by this makefile
+#
+PROGS = ted autoted
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Compiler - debug on
+#
+CC= cc
+INCLUDES = -I$(SRC)
+CFLAGS= -g -std1 -ieee_with_inexact $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11 -ldnet_stub
+#
+LD= cc
+LDFLAGS= -g -non_shared
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o\
+	match.o
+
+AUTOTEDOBJS=\
+	autoted.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	traceType.o\
+	fpoint.o\
+	mach-io.o\
+	match.o\
+	seqRead.o
+
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LIBS)
+autoted: $(AUTOTEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(AUTOTEDOBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/newted/makefile-dec b/src/newted/makefile-dec
new file mode 100644
index 0000000..a1b52b1
--- /dev/null
+++ b/src/newted/makefile-dec
@@ -0,0 +1,118 @@
+#
+# Makefile for ted (trace editor) - DEC Ultrix
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE -DQUAL_CHECK
+
+
+#
+# Programs made by this makefile
+#
+PROGS = ted autoted
+
+#
+# Where the wild things are
+#
+SRC = ..
+
+#
+# Compiler - debug on
+#
+CC= c89
+INCLUDES = -I$(SRC) -I/usr/include/mit
+CFLAGS= -g -common $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11
+#
+# If you have gcc-2.x, should link with it
+LD= c89
+LDFLAGS= -g
+# cc -Bstatic, gcc2.x -static, otherwise nothing
+LDLIBS= $(LIBS)
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o\
+	match.o
+
+AUTOTEDOBJS=\
+	autoted.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	traceType.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	match.o
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LDLIBS)
+autoted: $(AUTOTEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(AUTOTEDOBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/newted/makefile-sgi b/src/newted/makefile-sgi
new file mode 100644
index 0000000..d579521
--- /dev/null
+++ b/src/newted/makefile-sgi
@@ -0,0 +1,119 @@
+#
+# Makefile for ted (trace editor) - SGI fork
+#
+
+SHELL=/bin/sh
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE -DQUAL_CHECK
+
+
+#
+# Programs made by this makefile
+#
+PROGS = ted autoted
+
+#
+# Where the wild things are
+#
+SRC = ..
+
+#
+# Compiler - debug on
+#
+CC= cc
+INCLUDES = -I$(SRC)
+CFLAGS= -g -ansi -prototypes $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11
+#
+LD= cc
+LDFLAGS= -g
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o\
+	match.o
+
+AUTOTEDOBJS=\
+	autoted.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	traceType.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	match.o
+
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LIBS)
+autoted: $(AUTOTEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(AUTOTEDOBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/newted/makefile-solaris b/src/newted/makefile-solaris
new file mode 100644
index 0000000..e947d87
--- /dev/null
+++ b/src/newted/makefile-solaris
@@ -0,0 +1,120 @@
+#
+# Makefile for ted (trace editor) - Solaris fork
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE -DQUAL_CHECK
+
+
+#
+# Programs made by this makefile
+#
+PROGS = ted autoted
+
+#
+# Where the wild things are
+#
+SRC = ..
+
+#
+# Compiler - debug on
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -Wformat
+INCLUDES = -I$(SRC) -I/usr/openwin/include
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11 -lm
+#
+# If you have gcc-2.x, should link with it
+LD= gcc
+LDFLAGS= -g # -static # causes major dl problems
+# cc -Bstatic, gcc2.x -static, otherwise nothing
+LDLIBS= $(LIBS)
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	traceType.o\
+	match.o
+
+AUTOTEDOBJS=\
+	autoted.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	traceType.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	match.o
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LDLIBS)
+autoted: $(AUTOTEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(AUTOTEDOBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/newted/makefile-sun b/src/newted/makefile-sun
new file mode 100644
index 0000000..6018b03
--- /dev/null
+++ b/src/newted/makefile-sun
@@ -0,0 +1,123 @@
+#
+# Makefile for ted (trace editor) - SunOS fork
+#
+
+#
+# Turning on the AUTO_CLIP switch allows ted to automatically
+# select a left (using the -enzyme option or the .enzyme in the
+# Xted file) and right cutoff on your sequence (using 2 out of
+# 5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+# Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+# Turning on the QUAL_CODE switch enables LaDeana's trace quality
+# clipping code.
+#
+# Turning on the QUAL_CHECK switch (at the same time as QUAL_CODE)
+# enables the overall trace quality check.
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS  -DQUAL_CODE -DQUAL_CHECK
+SWITCHES = -DAUTO_CLIP -DQUAL_CODE -DQUAL_CHECK
+
+
+#
+# Programs made by this makefile
+#
+PROGS = ted autoted
+
+#
+# Where the wild things are
+#
+SRC = ..
+
+#
+# Compiler - debug on
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+INCLUDES = -I$(SRC)
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11
+#
+# If you have gcc-2.x, should link with it
+LD= gcc
+LDFLAGS= -g
+# cc -Bstatic, gcc2.x -static, otherwise nothing
+LDLIBS= -static $(LIBS)
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	traceType.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	match.o
+
+
+AUTOTEDOBJS=\
+	autoted.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	traceType.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o\
+	match.o
+
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LDLIBS)
+autoted: $(AUTOTEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(AUTOTEDOBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/newted/match.c b/src/newted/match.c
new file mode 100644
index 0000000..9d6b231
--- /dev/null
+++ b/src/newted/match.c
@@ -0,0 +1,62 @@
+int string_match(seq1, n1, seq2, n2, nmiss, indices)
+
+/*
+  This function may be called once from initialDisplayedSeq if
+  the user has input a string they wish to search for in the
+  input sequence -- this subroutine returns *indices, the first
+  position at which the query sequence (app_resources.astring)
+  matched the input file sequence (currSeq). */
+
+/* a modified version of match.c 
+  finds alignments between a search sequence, seq1, and a target sequence, 
+  seq2, with no gaps (except possibly at ends) 
+   and at most nmiss mismatches (relative to seq1),
+   n1 is the length of seq1 where seq1 is assumed to start at indices 0,
+   n2 is the length of seq2 where seq2 is assumed to start at indices 0 */
+      char *seq1, *seq2;
+      int n1, n2, nmiss;
+      int *indices;
+{
+    int i, j, d, istart, iend, i_miss, n_match;
+    int mtable[100][5];  /* 100 would be, now the total number
+			    of matches the program may find between
+			    a given input string and the sequence */
+
+
+/* d = j - i is the "offset" between the two sequences */
+    if (n1 - n2 > nmiss) return (0);
+    n_match = 0;
+    for (d = -nmiss; d <= n2 + nmiss - n1; d++){
+	if (d < 0)  istart = i_miss = -d;
+	else istart = i_miss = 0;
+	if (d > n2 - n1) {
+	    iend = n2 - d;
+	    i_miss += n1 + d - n2;
+	}
+	else iend = n1;
+	for (i = istart, j = d + i; i < iend; i++, j++)
+	    if (seq1[i] != seq2[j] && ++i_miss > nmiss) goto nextd; 
+        mtable[n_match][0] = d + istart; /* indices (assuming they start at
+					    0) of starting nucleotide in the
+					    searched sequence */
+        mtable[n_match][1] = istart; /* nucleotide position in the query
+					sequence where match starts (assuming
+					query index starts with 0) */
+        mtable[n_match][2] = iend - istart; /* number of nucleotides in the
+					       match */
+	if (mtable[n_match][2]==n1) {
+	  indices[n_match] = mtable[n_match][0];
+	}
+	  
+        mtable[n_match][3] = i_miss; /* number of mismatches */
+	n_match++; /* number of matches */
+	if (n_match >= 100) return(n_match);
+
+    nextd:;
+    }
+
+    return (n_match);
+    /*return(0);*/ /*return 0 if it got to this point; that means
+		 it must have not found an exact match, so 
+		 you want the baseNum to be 0 in that case*/
+}
diff --git a/src/newted/match.h b/src/newted/match.h
new file mode 100644
index 0000000..9e63456
--- /dev/null
+++ b/src/newted/match.h
@@ -0,0 +1,6 @@
+#ifndef _match_h_
+#define _match_h_
+
+extern int string_match(char *seq1, int n1, char *seq2, int n2, int nmiss, int *indices);
+
+#endif /*_match_h_*/
diff --git a/src/newted/mystdlib.h b/src/newted/mystdlib.h
new file mode 100644
index 0000000..e92c64b
--- /dev/null
+++ b/src/newted/mystdlib.h
@@ -0,0 +1,33 @@
+/*
+    Title:       stdlib
+
+    File: 	 mystdlib.h
+    Purpose:	 ANSII C stdlib.h replacement
+    Last update: Thu May 24 1990
+*/
+
+
+/*
+    The SUN does not provide the ANSII C stdlib.h header file, so
+    so this file provides some of the definitions which should be
+    found there.
+
+    Note that we cannot override existing type definitions.
+*/
+
+
+
+
+/* String processing functions */
+extern double strtod(char *str, char **ptr);
+extern double atof(char *str);
+extern long   strtol(char *str, char **ptr, int base);
+
+
+/* Control functions */
+extern void exit(int status);
+
+
+/* Storage allocation */
+extern char *malloc(unsigned size);
+extern char *calloc(unsigned elt_count, unsigned elt_size);
diff --git a/src/newted/opp.c b/src/newted/opp.c
new file mode 100644
index 0000000..7fe9809
--- /dev/null
+++ b/src/newted/opp.c
@@ -0,0 +1,128 @@
+/*
+  Title:  opp.c
+
+  File:   opp.c
+  Purpose: code for complementing sequences
+
+  Last update: Tue Jan 15 1991
+
+  15.01.90 SD  Taken from seqIOEdit.c
+*/
+
+
+/* ---- Imports ---- */
+#include "opp.h"
+#include "seq.h"
+
+
+/* ---- Globals ---- */
+char opp[256]; /* complement of any given base */
+
+/* ---- Exports ---- */
+void oppInitialize()
+{ int i;
+
+  for (i = 0; i<256; i++) opp[i]='-';  
+
+	/* RMD 31/12/90 'N' -> '-' above.
+	   removed 'N' and 'n' entries below and added reciprocal
+	   'K' and 'N' entries as for full Staden table */
+
+  opp['A'] = 'T';
+  opp['G'] = 'C';
+  opp['T'] = 'A';
+  opp['C'] = 'G';
+  opp['a'] = 't';
+  opp['g'] = 'c';
+  opp['t'] = 'a';
+  opp['c'] = 'g';
+  opp['D'] = 'H';
+  opp['H'] = 'D';
+  opp['V'] = 'B';
+  opp['B'] = 'V';
+  opp['K'] = 'N';
+  opp['N'] = 'K';
+  opp['L'] = 'M';
+  opp['M'] = 'L';
+  opp['5'] = '6';
+  opp['6'] = '5';
+  opp['R'] = 'Y';
+  opp['Y'] = 'R';
+  opp['7'] = '7';
+  opp['8'] = '8';
+}
+
+
+void complement_seq(Seq seq)
+/*
+** Complement and reverse bases and traces
+*/
+{
+    int2 temp_int2;
+    TRACE *temp_TRACEptr;
+    char temp_char;
+    int temp_int;
+    int i;
+
+    /* swap */
+#define swap(A,B,I) ( (I)=(A), (A)=(B), (B)=(I) )
+
+    /* complement and reverse traces */
+    if (seq->mode == BasesAndTraces) {
+	/* swap traces A<->T and C<->G */
+	swap(seq->traceA,seq->traceT,temp_TRACEptr);
+	swap(seq->traceC,seq->traceG,temp_TRACEptr);
+	
+	/* reverse points in traces */
+	for (i=0;i<seq->NPoints/2;i++) {
+	    swap(seq->traceA[i],seq->traceA[seq->NPoints-i-1],temp_int2);
+	    swap(seq->traceC[i],seq->traceC[seq->NPoints-i-1],temp_int2);
+	    swap(seq->traceG[i],seq->traceG[seq->NPoints-i-1],temp_int2);
+	    swap(seq->traceT[i],seq->traceT[seq->NPoints-i-1],temp_int2);
+	}
+    }
+    /* complement the sequence */
+    /* handle edited base */
+    for (i=0;i<seq->NedBases;i++) {
+	if (seq->edits[i]<0) {
+	    int k = (-seq->edits[i]);
+	    seq->edBase[k] = opp[seq->edBase[k]];
+	    seq->edBasePos[k] = seq->NPoints - seq->edBasePos[k] - 1;
+	} else {
+	    int k = seq->edits[i];
+	    seq->edits[i] = seq->NorigBases-k-1;
+	    /*
+	     * skip complement and mod of base position until later
+	     * because not every orig base is necessarily represented in the
+	     * edits array
+	     */
+	}
+    }
+
+    /* handle original bases */
+    for (i=0;i<seq->NorigBases;i++) {
+	seq->base[i] = opp[seq->base[i]];
+	seq->basePos[i] = seq->NPoints - seq->basePos[i] - 1;
+    }
+
+    /* reverse sequence */
+    /* edits */
+    for (i=0;i<seq->NedBases/2;i++) {
+	swap(seq->edits[i],seq->edits[seq->NedBases-i-1],temp_int2);
+    }
+    /* orig bases */
+    for (i=0;i<seq->NorigBases/2;i++) {
+	swap(seq->base[i],seq->base[seq->NorigBases-i-1],temp_char);
+	swap(seq->basePos[i],seq->basePos[seq->NorigBases-i-1],temp_int2);
+    }
+
+
+    /* swap cutoffs */
+    swap(seq->leftCutoff,seq->rightCutoff,temp_int);
+
+    /* move caret ??? */
+
+    /* toggle strand */
+    seq->bottom = !seq->bottom;
+}
+
diff --git a/src/newted/opp.h b/src/newted/opp.h
new file mode 100644
index 0000000..0852a95
--- /dev/null
+++ b/src/newted/opp.h
@@ -0,0 +1,37 @@
+#ifndef _opp_h
+#define _opp_h
+
+/*
+  Title:       opp
+
+  File:        opp.h
+  Purpose:     Required for complimenting a sequence
+  Last update: Tue Jan 15 1991
+
+  15.01.90 SD  Taken from seqIOEdit.h
+
+*/
+
+#include "seq.h"
+
+/* ---- Exports ---- */
+
+extern char opp[256]; /* complement of any given base */
+
+extern void oppInitialize();
+
+/* initializes the array which stores the complement 
+   of any of the Staden nucleotides or ambiguity
+   codes */
+
+
+void complement_seq(Seq seq);
+
+/* complement a sequence */
+
+#endif  /*_opp_h*/
+
+
+
+
+
diff --git a/src/newted/scf.h b/src/newted/scf.h
new file mode 100644
index 0000000..adde21d
--- /dev/null
+++ b/src/newted/scf.h
@@ -0,0 +1,89 @@
+/*
+ * File: scf.h
+ * Version:
+ *
+ * Author: Simon Dear
+ *         MRC Laboratory of Molecular Biology
+ *	   Hills Road
+ *	   Cambridge CB2 2QH
+ *	   United Kingdom
+ *
+ * Description: file structure definitions for SCF file
+ *
+ * Created: 19 November 1992
+ * Updated:
+ *
+ */
+
+#ifndef _SCF_H_
+#define _SCF_H_
+
+#include "mach-io.h"
+
+#define SCF_MAGIC (((((uint_4)'.'<<8)+(uint_4)'s'<<8)+(uint_4)'c'<<8)+(uint_4)'f')
+
+#define VERSION 2.00
+
+/*
+ * Type definition for the Header structure
+ */
+typedef struct {
+    uint_4 magic_number;       /* SCF_MAGIC */
+    uint_4 samples;            /* Number of elements in Samples matrix */
+    uint_4 samples_offset;     /* Byte offset from start of file */
+    uint_4 bases;              /* Number of bases in Bases matrix */
+    uint_4 bases_left_clip;    /* Number of bases in left clip (vector)*/
+    uint_4 bases_right_clip;   /* Number of bases in right clip (unreliable) */
+    uint_4 bases_offset;       /* Byte offset from start of file */
+    uint_4 comments_size;      /* Number of bytes in Comment section */
+    uint_4 comments_offset;    /* Byte offset from start of file */
+    char version[4];	     /* "version.revision" */
+    uint_4 sample_size;	     /* precision of samples (in bytes) */
+    uint_4 code_set;	     /* uncertainty codes used */
+    uint_4 spare[20];          /* Unused */
+} Header;
+
+#define CSET_DEFAULT 0  /* {A,C,G,T,-} */
+#define CSET_STADEN  1
+#define CSET_NC_IUB  2
+#define CSET_ALF     3  /* extended NC_IUB */
+#define CSET_ABI     4  /* {A,C,G,T,N} */
+
+/*
+ * Type definition for the Sample data
+ */
+typedef unsigned char byte;
+typedef struct {
+    byte sample_A;           /* Sample for A trace */
+    byte sample_C;           /* Sample for C trace */
+    byte sample_G;           /* Sample for G trace */
+    byte sample_T;           /* Sample for T trace */
+} Samples1;
+typedef struct {
+    unsigned short sample_A;           /* Sample for A trace */
+    unsigned short sample_C;           /* Sample for C trace */
+    unsigned short sample_G;           /* Sample for G trace */
+    unsigned short sample_T;           /* Sample for T trace */
+} Samples2;
+
+/*
+ * Type definition for the sequence data
+ */
+typedef struct {
+    uint_4 peak_index;        /* Index into Samples matrix for base position */
+    byte prob_A;            /* Probability of it being an A */
+    byte prob_C;            /* Probability of it being an C */
+    byte prob_G;            /* Probability of it being an G */
+    byte prob_T;            /* Probability of it being an T */
+    char base;		    /* Base called */
+    byte spare[3];          /* Spare */
+} Bases;
+
+
+/*
+ * Type definition for the comments
+ */
+typedef char Comments;            /* Zero terminated list of \n separated entries */
+
+
+#endif /*_SCF_H_*/
diff --git a/src/newted/seq.c b/src/newted/seq.c
new file mode 100644
index 0000000..53ddd6b
--- /dev/null
+++ b/src/newted/seq.c
@@ -0,0 +1,604 @@
+/* 
+    Title:       seq
+
+    File: 	 seq.c
+    Purpose:	 Sequence data type
+    Last update: Tue Nov 10 1992
+*/
+
+
+/*
+    The Seq data type is designed so that it can hold a varying degree
+    of information about sequences, yet have a single set of calls
+    to access the data.
+
+    The edited sequence has a limit of MaxEdits changes relative to
+    the original sequence. If entry M in the edited sequence is the
+    same as entry N in the original sequence then edits[M]=N. If
+    entry M is changed, then its value is negative and is a reference
+    into the newBase and newBasePos arrays (thus position 0 in these
+    arrays is unused). Initially, all the entries in newBasePos are
+    set to NULLPoint. The representation can be summarised thus
+    (where => is logical implication):
+
+        getNBases(seq, EdBases) = seq->NedBases
+
+        getBase(seq, EdBases, n) =
+	    (seq->edits[n] >= 0) => seq->base[seq->edits[n]]
+	   ~(seq->edits[n] >= 0) => seq->edBase[-(seq->edits[n])]
+
+	baseNumToPoint(seq, EdBases, n) =
+	    (seq->edits[n] >= 0) => seq->basePos[seq->edits[n]]
+	   ~(seq->edits[n] >= 0) => seq->edBasePos[-(seq->edits[n])]
+
+    The main problem with this representation is that an insertion
+    (or deletion) requires, on average, copying of half of the edits
+    array up (or down). Given that we have ~1000 bases this probably
+    OK.
+
+    A problem which has not been considered well enough is how we deal
+    with several bases located at the same point.
+
+    There are plenty of assumptions around that both the number of
+    bases and the number of points will fit into an int2, a short.
+
+    The only changes made by lfw to this module is the addition of
+    two subroutines: isBottom, setBottom.  
+
+    
+    21.05.92 LFW added initialization of quality index and type
+    17.07.92 LFW added primerPos to seq->info.primerPos for ABI...
+              indicates position in gel at which it found the
+	      primer peak
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"      
+
+#include <stdlib.h>
+/* #include "mystdlib.h" *//* IMPORT: malloc, calloc */
+
+
+
+
+/* ---- Exports ---- */
+    
+
+
+
+/* ---- General sequence manipulation ---- */
+
+
+
+
+Seq newSeq(Mode mode, int2 numPoints, int2 numBases)
+/*
+    Allocate a new sequence, with the given sizes.
+    The sequence is initially clean with no bases cutoff.
+    If `mode' is BasesOnly, `numPoints' is ignored.
+    Returns NULLSeq on failure.
+*/
+{   Seq seq = NULLSeq;
+    size_t int2Size = sizeof(int2);
+
+
+    /* Allocate the body of the sequence */
+    if ((seq = (Seq) malloc(sizeof(SeqData))) == NULL) return(NULLSeq);
+
+    /*   
+        Initialise the body, all pointers are set to NULL so we can
+        happily call `freeSeq'
+    */
+    seq->dirty = False;
+    seq->leftCutoff  = 0;
+    seq->rightCutoff = 0;
+    seq->bottom = False;
+
+    seq->traceC    = NULL;
+    seq->traceA    = NULL;
+    seq->traceG    = NULL;
+    seq->traceT    = NULL;
+
+    seq->base      = NULL;
+    seq->basePos   = NULL;
+
+    seq->edits     = NULL;
+    seq->edBase    = NULL;
+    seq->edBasePos = NULL;
+
+    seq->info = NULL;
+
+#ifdef QUAL_CODE
+    /* lfw 21.05.92 
+       initialize quality information */
+    seq->qualType = 0;
+    seq->qualIndex = NULL;
+#endif /*QUAL_CODE*/
+
+    /* Allocate space for the bases */
+    if (((seq->base     =        malloc(numBases))                   == NULL)||
+        ((seq->basePos  =(int2 *)calloc(numBases,int2Size))          == NULL)||
+	((seq->edits    =(int2 *)calloc(numBases+MaxEdits,int2Size)) == NULL)||
+	((seq->edBase   =        malloc(MaxEdits))                   == NULL)||
+        ((seq->edBasePos=(int2 *)calloc(MaxEdits,int2Size))          == NULL))
+    {
+	freeSeq(seq);
+	return(NULLSeq);
+    }
+
+    /* If needed, allocate space for the traces */
+    if (mode == BasesAndTraces)
+	if (((seq->traceC   =(TRACE *)calloc(numPoints,int2Size))         == NULL)||
+	    ((seq->traceA   =(TRACE *)calloc(numPoints,int2Size))         == NULL)||
+	    ((seq->traceG   =(TRACE *)calloc(numPoints,int2Size))         == NULL)||
+	    ((seq->traceT   =(TRACE *)calloc(numPoints,int2Size))         == NULL)
+#ifdef QUAL_CODE
+	    || ((seq->qualIndex  =(float *)calloc(numBases,sizeof(float)))          == NULL)
+#endif /*QUAL_CODE*/
+	    )
+	    {
+		freeSeq(seq);
+		return(NULLSeq);
+	    }
+    
+    return(seq);
+}
+
+
+
+
+void freeSeq(Seq seq)
+/*
+    Free a sequence created by one of the above routines.
+    
+    ANSI C is happy to free a NULL pointer.
+*/
+{   if (seq == NULLSeq) return;
+
+    if (seq->traceC != NULL) free(seq->traceC);
+    if (seq->traceA != NULL) free(seq->traceA);
+    if (seq->traceG != NULL) free(seq->traceG);
+    if (seq->traceT != NULL) free(seq->traceT);
+
+    if (seq->base    != NULL) free(seq->base);
+    if (seq->basePos != NULL) free(seq->basePos);
+
+#ifdef QUAL_CODE
+    /* lfw 21.05.92 */
+    if (seq->qualIndex    != NULL) free(seq->qualIndex);
+#endif /*QUAL_CODE*/
+
+    if (seq->edits     != NULL) free(seq->edits);
+    if (seq->edBase    != NULL) free(seq->edBase);
+    if (seq->edBasePos != NULL) free(seq->edBasePos);
+
+    if (seq->info != NULL) free(seq->info);
+
+    free(seq);
+}
+
+
+
+
+/* ---- General sequence information ---- */
+
+
+
+
+Boolean isDirty(Seq seq)
+/*
+    Indicate if the sequence has been edited.
+*/
+{   return(seq->dirty);
+}
+
+
+
+
+void setDirty(Seq seq, Boolean b)
+/*
+    Set the sequence to the state of cleanliness indicated by `b'.
+*/
+{   seq->dirty = b;
+}
+
+/* isBottom and setBottom were added by lfw when the new element
+was added to seq. seq->bottom */
+
+
+Boolean isBottom(Seq seq)
+/*
+    Indicate if the sequence we're working with is bottom strand
+*/
+{   return(seq->bottom);
+}
+
+
+
+
+void setBottom(Seq seq, Boolean b)
+/*
+    Set the sequence to the state bottom if b is true and
+    top if b is false 
+*/
+{   seq->bottom = b;
+}
+
+
+
+
+Boolean isBasesOnly(Seq seq)
+/*
+    Indicate if the sequence only contains bases, or if it
+    contains bases and traces.
+*/
+{   return((seq->mode)==BasesOnly);
+}
+
+
+
+
+/* ---- Trace information ---- */
+
+
+
+
+int getNPoints(Seq seq)
+/*
+    Return the number of points in this sequence.
+*/
+{   return(seq->NPoints);
+}
+
+
+
+
+void getTraces(Seq seq, char base, int p0, int pN, int *traces)
+/*
+    For basetype `base' in the sequence `seq', return in consecutive
+    elements of `traces' (an array of int) the values at the points
+    between and including p0 to pN, where p0<=pN.
+*/
+{   int p;
+    int i;
+    TRACE *t;
+
+    switch (base)
+    {   case 'C': t = seq->traceC; break;
+        case 'A': t = seq->traceA; break;
+        case 'G': t = seq->traceG; break;
+        case 'T': t = seq->traceT; break;
+    }
+
+    p = p0;
+    i = 0;
+    while (p<=pN)
+    {   traces[i] = t[p];
+	p++;
+	i++;
+    }
+}
+
+
+#ifdef QUAL_CODE
+void getQualIndex(Seq seq, int p0, int pN, float *index)
+/*
+    For basetype `base' in the sequence `seq', return in consecutive
+    elements of `traces' (an array of int) the values at the points
+    between and including p0 to pN, where p0<=pN.
+*/
+{   int p;
+    int i;
+    float *t;
+    int b,bN;
+    WhichBases whichBases=EdBases;
+
+    t=seq->qualIndex;
+
+    p = p0;
+    i = 0;
+
+    b = pointToBaseNum(seq, whichBases, p0);
+    bN = pointToBaseNum(seq, whichBases, pN);
+
+    for (i=0; i<=pN-p0; i++)
+      index[i]=0;
+
+    i=0;
+
+    while (b<=bN-1)
+    {   index[i] = t[b];
+        printf("index[i] is %f\n",index[i]);
+        b++;
+        /* i should increase by the point distance between base b and b+1 */
+        i+=baseNumToPoint(seq,whichBases,b)-baseNumToPoint(seq,whichBases,b-1);
+    }
+}
+#endif /*QUAL_CODE*/
+
+
+
+
+
+TRACE getMaxTraceVal(Seq seq)
+/*
+    Return the maximum value from anywhere in any of the traces
+    from the sequence `seq'.
+*/
+{   return(seq->maxTraceVal);
+}
+
+
+
+
+/* ---- Base information ---- */
+
+
+
+
+int getNBases(Seq seq, WhichBases which)
+/*
+    Return the number of bases in the `which' part of `seq'
+*/
+{   return((which == OrigBases) ? seq->NorigBases : seq->NedBases);
+}
+
+
+void getCutoffs(Seq seq, int *leftCutoff, int *rightCutoff)
+/*
+    Return the number of characters to be cut off at either end.
+*/
+{   *leftCutoff  = seq->leftCutoff;
+    *rightCutoff = seq->rightCutoff;
+}
+
+
+
+char getBase(Seq seq, WhichBases which, int baseNum)
+/*
+    Given `baseNum' as the number of a base in the `which' part of
+    the sequence `seq', return the base found there.
+*/
+{   return((which == OrigBases)
+               ? (seq->base[baseNum])
+               : (seq->edits[baseNum] >= 0)
+	           ? seq->base[seq->edits[baseNum]]
+	           : seq->edBase[-(seq->edits[baseNum])]
+	  );
+}
+
+
+int baseNumToPoint(Seq seq, WhichBases which, int baseNum)
+/*
+   Given `baseNum' as the number of a base in `which' part of
+   sequence `seq', return its point location. NULLPoint is
+   returned if `baseNum' was invalid.
+*/
+{    return((which == OrigBases)
+
+                ? /* Original sequence */
+	          (baseNum<0 || baseNum>seq->NorigBases-1)
+                      ? NULLPoint
+                      : seq->basePos[baseNum]
+
+	        : /* Edited sequence */
+	          (baseNum<0 || baseNum>seq->NedBases-1)
+                      ? NULLPoint
+	              : (seq->edits[baseNum] >= 0)
+                            ? seq->basePos[seq->edits[baseNum]]
+	                    : seq->edBasePos[-(seq->edits[baseNum])]
+	   );
+	    
+}
+
+
+
+
+int pointToBaseNum(Seq seq, WhichBases which, int point)
+/*
+    Given `point' as a point in the `which' part of sequence `seq',
+    return the base number of the next base to the right (or on
+    the point), or NULLBaseNum if there isn't one.
+*/
+{   int l,r,N,m;
+
+    if (point<0 || point>seq->NPoints-1) return(NULLBaseNum);
+
+    /*
+         Let's use a groovy binary search.
+	 Roughly, we wish to find an m (0..NBases-1) such that:
+	     baseNumToPoint(m]>=point & baseNumToPoint(m-1]<point
+
+	 Let l and r indicate two sections of the array, LS and RS. We
+	 have the following predicates:
+	 NotInLS   == 0<=j<l: baseNumToPoint(j)<point
+	 MaybeInRS == r<=k<N: baseNumToPoint(k)>=point
+
+	 Maintaining these two, we adjust l and r (using a binary
+	 chop) such that the complete array is spanned by these
+	 sections.
+    */
+
+    /* Start off with the two sections empty */
+    N = (which == OrigBases) ? seq->NorigBases : seq->NedBases;
+    l=0; r=N; /* NotinLS & MaybeInRS */
+
+    while (l<r)
+    {   /* NotinLS & MaybeInRS & l<r */
+        m=(l+r)/2; /* l<=m<r */
+	if (baseNumToPoint(seq,which,m)<point)
+	    l=m+1; /* NotinLS & MaybeInRS & l<=r */
+	else
+	    r=m;   /* NotinLS & MaybeInRS & l<=r */
+    }
+
+    /* 
+        NotinLS & MaybeInRS & l=r
+        ==
+	0<=j<l=r<=k<N: baseNumToPoint(j)<point & baseNumToPoint(k)>=point
+    */
+    if (r==N)
+        return(NULLBaseNum);
+    else
+        return(r);
+}
+
+
+
+
+/* ---- Base editing ---- */
+
+
+
+
+Boolean deleteBase(Seq seq, int baseNum)
+/*
+    Delete the base `baseNum' from the sequence `seq'.
+*/
+{   int i;
+
+    /* Fail if base number bad */
+    if ((baseNum<0) || (baseNum>(seq->NedBases-1))) return(False);
+
+    /* Fail if in a cutoff region */
+    if ((baseNum <= seq->leftCutoff-1) ||
+        (baseNum > seq->NedBases-1-seq->rightCutoff)) return(False);
+
+    seq->dirty = True;
+
+    /* If this was a base we added, then free its slot */
+    if (seq->edits[baseNum]<0) 
+        seq->edBasePos[-(seq->edits[baseNum])] = NULLPoint;
+
+    /* Shuffle all the entries in `edits' down one. */
+    for (i=baseNum+1;i<seq->NedBases;i++)
+        seq->edits[i-1] = seq->edits[i];
+
+    seq->NedBases--;
+    return(True);        
+}
+
+
+
+
+Boolean insertBase(Seq seq, char base, int baseNum)
+/*
+    Insert the base `base' to the right of `baseNum'. Its position
+    is halfway between its neighbours, except when it would be
+    replacing a deleted base from the original sequence, in which
+    case it is placed there.
+
+    If `baseNum' equals -1 `base' is added to the head of the
+    sequence.
+*/
+{   int i;
+    int prevBasePos, nextBasePos;
+    int origBaseNum, origBasePos;
+
+
+    /* Fail if bad base number */
+    if ((baseNum<-1) || (baseNum>(seq->NedBases-1))) return(False);
+
+    /* Fail if in a cutoff region */
+    if ((baseNum < seq->leftCutoff-1) ||
+        (baseNum > seq->NedBases-1-seq->rightCutoff)) return(False);
+
+    seq->dirty = True;
+
+
+    /* 
+        Before we alter anything, find what will become the
+	surrounding positions
+    */
+    prevBasePos = (baseNum == -1)
+                  ? -1
+		  : baseNumToPoint(seq, EdBases, baseNum);
+    nextBasePos = (baseNum == (seq->NedBases-1))
+                  ? seq->NPoints-1
+		  : baseNumToPoint(seq, EdBases, baseNum+1);
+
+
+    /*
+        Shuffle all the entries in `edits' up one
+    */
+    for (i=seq->NedBases-1;i>baseNum;i--)
+        seq->edits[i+1] = seq->edits[i];
+    seq->NedBases++;
+
+
+    /*
+        Look in the original sequence to see if there is a base
+	which lies between prevBasePos and nextBasePos.
+    */
+    origBaseNum = pointToBaseNum(seq, OrigBases, prevBasePos+1);
+    origBasePos = baseNumToPoint(seq, OrigBases, origBaseNum);
+
+    if ((origBaseNum!=NULLBaseNum) &&
+	(prevBasePos<origBasePos)  &&
+	(origBasePos<nextBasePos)
+       )
+    {   char origBase = getBase(seq, OrigBases, origBaseNum);
+	
+	if (origBase == base)
+	{   /* We are reintroducing an existing base */
+	    seq->edits[baseNum+1] = origBaseNum;
+	}
+	else
+	{   int slot;
+
+	    /* We can consider this a replacement, at the same position */
+
+	    /* Find a free slot in the newBasePos (and thus newBase) table */
+	    for (slot=1;
+		 (slot<MaxEdits)&&(seq->edBasePos[slot]!=NULLPoint);
+		 slot++)
+	        if (slot==MaxEdits) /* All entries used */ return(False);
+
+	    /* Fill in the edBasePos and edBase entries */
+	    seq->edBasePos[slot] = origBasePos;
+	    seq->edBase[slot] = base;
+
+	    /* Put in the (negative) reference to this slot */
+	    seq->edits[baseNum+1] = -slot;
+	}
+    }
+    else
+    {   int slot;
+
+	/* Find a free slot in the newBasePos (and thus newBase) table */
+        for (slot=1;(slot<MaxEdits)&&(seq->edBasePos[slot]!=NULLPoint);slot++)
+	if (slot==MaxEdits) /* All entries used */ return(False);
+
+	/* Fill in the edBasePos and edBase entries */
+	seq->edBasePos[slot] = (prevBasePos + nextBasePos) / 2;
+	seq->edBase[slot] = base;
+
+	/* Put in the (negative) reference to this slot */
+	seq->edits[baseNum+1] = -slot;
+    }
+    
+
+    return(True);        
+}
+
+
+
+
+Boolean setCutoffs(Seq seq, int leftCutoff, int rightCutoff)
+/*
+    Set the cutoffs to be the indicated number of characters from
+    either end of the sequence. The result indicates success.
+*/
+{   if ((leftCutoff+rightCutoff) > seq->NedBases)
+        return(False);
+    else
+    {   seq->leftCutoff  = leftCutoff;
+	seq->rightCutoff = rightCutoff;
+	return(True);
+    }
+}
diff --git a/src/newted/seq.h b/src/newted/seq.h
new file mode 100644
index 0000000..db0afd5
--- /dev/null
+++ b/src/newted/seq.h
@@ -0,0 +1,302 @@
+#ifndef _seq_h
+#define _seq_h
+
+
+/* 
+    Title:       seq
+
+    File: 	 seq.h
+    Purpose:	 Sequence data type
+    Last update: Tue Nov 10 1992
+*/
+
+
+/*
+    This module encodes the `Seq' sequence data structure.
+
+    A `Seq' contains information about bases and traces which are layed
+    out along a single dimension of points. The number of points in a
+    paricular sequence is given by `getNPoints', and these are numbered
+    0..getNPoints-1. At each point there are four trace readings, one
+    for each base.
+
+    There are two sequences of bases: the `original', and the
+    `editable'. There have `getNorigBases' and `getNedBases' respectively,
+    which are numbered 0..N-1. Bases are represented by `char's. Every
+    base is located at a particular point.
+
+    When written out, the list of bases is trimmed by a left and a right
+    cutoff, which indicate the number of bases to remove from either
+    end. Editing cannot be performed in the cutoff regions.
+
+    The behaviour of these routines is undefined if given NULLSeq or
+    an undefined sequence.
+
+    SD. Added information field to seq data structure
+
+    21.05.92 LFW added trace quality stuff to the sequence structure
+    17.05.92 LFW added Seq_info->primerPos for primer position information
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+
+/* ---- Private ---- */
+
+#define MaxEdits 200
+
+
+typedef short int2; /* Two byte integers  */
+typedef int   int4; /* Four byte integers */
+typedef unsigned short TRACE; /* for trace heights */
+
+#define MaxInt2 (32767) /* An estimate of the lower bound */
+
+
+typedef enum
+{   BasesOnly,
+    BasesAndTraces
+} Mode;
+
+typedef enum
+{
+    ABIFormat,
+    ALFFormat,
+    PlainFormat,
+    SCFFormat
+} Format;
+
+typedef struct
+{   Mode        mode;
+    Boolean     dirty; /* Have any changes been made? */
+    Boolean     bottom; /* are we looking at the top or
+			   bottom strand? */
+    Format	format;
+
+    int         NPoints;    /* No. of points of data */
+    int         NorigBases; /* No. of bases in the original sequence */
+    int         NedBases;   /* No. of bases in the edited sequence */
+
+    /* Traces */
+    TRACE      *traceC;      /* Array of length `NPoints' */
+    TRACE      *traceA;      /* Array of length `NPoints' */
+    TRACE      *traceG;      /* Array of length `NPoints' */
+    TRACE      *traceT;      /* Array of length `NPoints' */
+    TRACE       maxTraceVal; /* The maximal value in any trace */
+
+#ifdef QUAL_CODE
+    /* Trace quality */
+    float       *qualIndex;   /* Array of length 'NorigBases' */
+    int         qualType;
+#endif /*QUAL_CODE*/
+
+
+    /* Original bases */
+    char       *base;    /* Array of length `NorigBases' */
+    int2       *basePos; /* Array of length `NorigBases' */
+
+    /* Edited bases */
+    int2       *edits;     /* Array of length `NorigBases+MaxEdits' */
+    char       *edBase;    /* Array of length `MaxEdits' */
+    int2       *edBasePos; /* Array of length `MaxEdits' */
+
+    /* Cutoffs */
+    int leftCutoff;  /* Number of unwanted bases */
+    int rightCutoff; /* Number of unwanted bases */
+
+    /* Miscellaneous Sequence Information */
+    char *info; /* misc seq info */
+
+} SeqData, *Seq;
+
+
+extern Seq newSeq(Mode mode, int2 numPoints, int2 numBases);
+/*
+    Allocate a new sequence, with the given sizes.
+    The sequence is initially clean with no bases cutoff.
+    If `mode' is BasesOnly, `numPoints' is ignored.
+    Returns NULLSeq on failure.
+*/
+
+
+
+
+/* ---- Exports ---- */
+
+
+#define NULLSeq     ((Seq)NULL)
+#define NULLBaseNum (-1)
+#define NULLPoint   (-1)
+
+
+typedef enum
+{   OrigBases,
+    EdBases
+} WhichBases;
+    
+
+
+
+/* ---- General sequence manipulation ---- */
+
+
+extern void freeSeq(Seq seq);
+/*
+    Free a sequence.
+    Does nothing if given NULLSeq.
+*/
+
+
+/* ---- General sequence information ---- */
+
+
+extern Boolean isDirty(Seq seq);
+/*
+    Indicate if the sequence has been edited since last set clean.
+*/
+
+
+extern void setDirty(Seq seq, Boolean b);
+/*
+    Set the sequence to the state of cleanliness indicated by `b'.
+*/
+
+extern Boolean isBottom(Seq seq);
+/*
+    Indicate if we're looking at bottom strand 
+*/
+
+
+extern void setBottom(Seq seq, Boolean b);
+/*
+  Set the sequence to b - true if bottom, false if top strand
+*/
+
+
+extern Boolean isBasesOnly(Seq seq);
+/*
+    Indicate if the sequence only contains bases, or if it
+    contains bases and traces.
+*/
+
+
+/* ---- Trace information ---- */
+
+
+extern int getNPoints(Seq seq);
+/*
+    Return the number of points in this sequence.
+*/
+
+
+extern void getTraces(Seq seq, char base, int p0, int pN, int *traces);
+/*
+    For basetype `base' in the sequence `seq', return in consecutive
+    elements of `traces' (an array of int) the values at the points
+    between and including p0 to pN, where p0<=pN.
+*/
+
+#ifdef QUAL_CODE
+extern void getQualIndex(Seq seq, int p0, int pN, float *index);
+/*
+    For basetype `base' in the sequence `seq', return in consecutive
+    elements of `traces' (an array of int) the values at the points
+    between and including p0 to pN, where p0<=pN.
+*/
+#endif /*QUAL_CODE*/
+
+
+
+extern TRACE getMaxTraceVal(Seq seq);
+/*
+    Return the maximum value from anywhere in any of the traces
+    from the sequence `seq'.
+*/
+
+
+/* ---- Base information ---- */
+
+
+extern int getNBases(Seq seq, WhichBases which);
+/*
+    Return the number of bases in the `which' part of `seq'
+*/
+
+
+extern void getCutoffs(Seq seq, int *leftCutoff, int *rightCutoff);
+/*
+    Return the number of characters to be cut off at either end.
+*/
+
+
+extern char getBase(Seq seq, WhichBases which, int baseNum);
+/*
+    Given `baseNum' as the number of a base in the `which' part of
+    the sequence `seq', return the base found there.
+*/
+
+/*extern int getBasePos(Seq seq, WhichBases which, int baseNum);*/
+/*
+    Given `baseNum' as the number of a base in the `which' part of
+    the sequence `seq', return the base position found there.
+*/
+
+
+
+extern int baseNumToPoint(Seq seq, WhichBases which, int baseNum);
+/*
+   Given `baseNum' as the number of a base in `which' part of
+   sequence `seq', return its point location. NULLPoint is
+   returned if `baseNum' was invalid.
+*/
+
+
+extern int pointToBaseNum(Seq seq, WhichBases which, int point);
+/*
+    Given `point' as a point in the `which' part of sequence `seq',
+    return the base number of the next base to the right (or on
+    the point), or NULLBaseNum if there isn't one.
+*/
+
+
+/* ---- Base editing ---- */
+
+
+extern Boolean deleteBase(Seq seq, int baseNum);
+/*
+    Delete the base `baseNum' from the sequence `seq'.
+    The result indicates success.
+*/
+
+
+extern Boolean insertBase(Seq seq, char base, int baseNum);
+/*
+    Insert the base `base' to the rigt of `baseNum'. Its position
+    is halfway between its neighbours, except when it would be
+    replacing a deleted base from the original sequence, in which
+    case it is placed there.
+
+    If `baseNum' equals -1 `base' is added to the head of the
+    sequence.
+
+    The result indicates success.
+*/
+
+
+extern Boolean setCutoffs(Seq seq, int leftCutoff, int rightCutoff);
+/*
+    Set the cutoffs to be the indicated number of characters from
+    either end of the sequence. The result indicates success.
+*/
+
+
+#endif  /*_seq_h*/
diff --git a/src/newted/seqIOABI.c b/src/newted/seqIOABI.c
new file mode 100644
index 0000000..5aeba3a
--- /dev/null
+++ b/src/newted/seqIOABI.c
@@ -0,0 +1,430 @@
+/* 
+  Title:       seqIOABI
+  
+  File: 	 seqIOABI.c
+  Purpose:	 IO of ABI sequences
+  Last update: Tue Nov 10 1992
+  
+  Change log:
+  27/11/90 SD     writeSeqABI() outputs header to sequence file:
+  format: ;{noOfBases}{leftCutOff}{basesWritten}{type}{tra
+  cefile}
+  eg:     ;   867    45    383ABI a09b7.s1RES
+  28.11.90 SD  put undesirables under STLOUIS compilation flag
+  11.12.90 SD  new static function tail to find file name in path name
+  02.01.91 SD  Merged with St.L version
+  15.01.91 SD  New include added (opp.h)
+  30.07.91 SD  Those ole FWO_ field blues
+  17.09.91 LFW changed STLOUIS compilation flag to SAVE_EDITS
+  and AUTO_CLIP
+  25.10.91 SD  Machine independant I/O...removed BIGENDIAN flag
+  21.07.92 LFW Added finding of primer position 
+  11.11.92 LFW added section to actually check that the trace it
+               is trying to open is an ALF file using traceType sub
+  10.11.92 SD  FWO_ and S/N% interpretation. Comments for information
+  window.
+  
+  */
+
+
+/*
+  This module should be regarded as part of `seq' since it is
+  privy to the internal structure of `Seq'.
+  
+  At the moment, the `maxTraceVal' of the sequence is hardwired
+  as 1200, but I suspect it is encoded somewhere in the ABI file
+  structure, or if not, we should calculate it as we read the
+  traces in.
+  
+  The ABI results file is controlled by an index found towards
+  the end --- this is pointed to by a longword found at `IndexPO'.
+  The index consists of a number of entries, each of which is
+  four character label followed by 6 long words. The first of these
+  long words holds a simple count (starting at 1) for those cases
+  where there are multiple entries with the same label. Entries should
+  be found by label (and count), rather than their index position,
+  because entries can be ommited or new ones added. This happens when
+  ABI changes the version of their software and also depending
+  on whether the data was analysed or unalaysed. We do, however,
+  make assumptions about the relative order of entries.
+  
+  Ideally we would have a separate module which provides a number
+  of functions to extract the data we are interested in, keeping
+  the ABI format well wrapped up and out of harms way.
+  
+  Note that we are relying on the endian-ness of the machine being
+  appropriate so we can just read long words in as integers. This
+  should be recoded to deal with running on different endians.
+  
+  All references to the seq->bottom, the bottom strand of the
+  sequence, were added by lfw
+  */
+
+
+
+
+/* ---- Imports ---- */
+
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			 newSeq, freeSeq */
+#include "seqIOABI.h"
+#include "seqIOEdit.h" /* IMPORT: writeEdSeq, 
+			  readEdSEq */
+
+#include "fpoint.h"    /* IMPORT: int_to_float */
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+			 EOF */
+#include <stdlib.h>
+
+#include "mach-io.h" /* IMPORT: machine independant io library */
+#include "dialogues.h"
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+
+/*
+  The index is located towards the end of the ABI trace file.
+  It's location is given by a longword at a fixed place.
+  */
+#define IndexPO ((off_t)26)
+
+#define IndexEntryLength 28
+
+
+/*
+  Here are some labels we will be looking for, four chars packed
+  into a long word.
+  */
+#define DataEntryLabel    ((int_4) ((((('D'<<8)+'A')<<8)+'T')<<8)+'A')
+#define BaseEntryLabel    ((int_4) ((((('P'<<8)+'B')<<8)+'A')<<8)+'S')
+#define BasePosEntryLabel ((int_4) ((((('P'<<8)+'L')<<8)+'O')<<8)+'C')
+#define SpacingEntryLabel ((int_4) ((((('S'<<8)+'P')<<8)+'A')<<8)+'C')
+#define SignalEntryLabel  ((int_4) ((((('S'<<8)+'/')<<8)+'N')<<8)+'%')
+#define FWO_Label         ((int_4) ((((('F'<<8)+'W')<<8)+'O')<<8)+'_')
+#define MCHNLabel         ((int_4) ((((('M'<<8)+'C')<<8)+'H')<<8)+'N')
+#define PDMFLabel         ((int_4) ((((('P'<<8)+'D')<<8)+'M')<<8)+'F')
+#define SMPLLabel         ((int_4) ((((('S'<<8)+'M')<<8)+'P')<<8)+'L')
+#define PPOSLabel         ((int_4) ((((('P'<<8)+'P')<<8)+'O')<<8)+'S')
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+
+static Boolean getIndexEntryLW(FILE *fp, off_t indexO,
+			       uint_4 label, uint_4 count, int lw,
+			       uint_4 *val)
+/*
+  From the ABI results file connected to `fp' whose index starts
+  at byte offset `indexO', return in `val' the `lw'th long word
+  from the `count'th entry labelled `label'.
+  The result indicates success.
+  */
+{   off_t entryNum=-1;
+    int i;
+    uint_4 entryLabel, entryLw1;
+    
+    do
+	{   entryNum++;
+	    if (fseek(fp, indexO+(entryNum*IndexEntryLength), 0) != 0) return(-1);
+	    if (!be_read_int_4(fp, &entryLabel)) return(False);
+	    if (!be_read_int_4(fp, &entryLw1))   return(False);
+	} while (!(entryLabel == label && entryLw1 == count));
+    
+    for(i=2; i<=lw; i++)
+	if (!be_read_int_4(fp, val)) return(-1);
+    
+    return(True);
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+Seq readSeqABI(char *fn)
+/*
+  Read the ABI format sequence with name `fn' into `seq'.
+  All printing characters (as defined by ANSII C `isprint')
+  are accepted, but `N's are translated to `-'s.  If bottom 
+  is True, we read the sequence in in reverse and complemented 
+  A NULL result indicates failure.
+  
+  In this module all references to the bottom strand were 
+  inserted by lfw.
+  */
+{   Seq seq = NULLSeq;
+    uint_4 fwo_; /* base -> lane mapping */
+    FILE *fp;
+    uint_4 numPoints, numBases;
+    uint_4 signalO;
+
+    uint_4 indexO;   /* File offset where the index is */
+    uint_4 baseO;    /* File offset where the bases are stored */
+    uint_4 basePosO; /* File offset where the base positions are stored */
+    uint_4 dataCO;   /* File offset where the C trace is stored */
+    uint_4 dataAO;   /* File offset where the A trace is stored */
+    uint_4 dataGO;   /* File offset where the G trace is stored */
+    uint_4 dataTO;   /* File offset where the T trace is stored */
+    uint_4 MCHN_O;   /* File offset where the machine name is stored */
+    uint_4 PDMF_O;   /* File offset where the dye primer guff is stored */
+    /*
+      uint_4 SMPL_O;   /. File offset where the sample name is stored ./
+    */
+    
+    int i;
+    
+    
+    if ((fp = fopen(fn, "rb")) == NULL) return(NULLSeq);
+    
+    
+    /* Get the index offset */
+    if ((fseek(fp, IndexPO, 0) != 0) || (!be_read_int_4(fp, &indexO)))
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    /* Get the number of points */
+    if (!getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,9,3,&numPoints))
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    /* Get the number of bases */
+    if (!getIndexEntryLW(fp,(off_t)indexO,BaseEntryLabel,1,3,&numBases))
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesAndTraces, numPoints, numBases)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    seq->mode        = BasesAndTraces;
+    seq->format      = ABIFormat;
+    seq->dirty       = False;
+    seq->maxTraceVal = 1200;
+    seq->NPoints     = numPoints;
+    seq->NorigBases  = numBases;
+    seq->NedBases    = numBases;
+    seq->bottom      = False;
+    
+    
+    /*************************************************************
+     *
+     *************************************************************/
+    
+    /*
+     ** The order of the DATA fields is determined by the field FWO_
+     ** Juggle around with data pointers to get it right
+     */
+    {
+	uint_4 *dataxO[4];
+	
+	dataxO[0] = &dataCO;
+	dataxO[1] = &dataAO;
+	dataxO[2] = &dataGO;
+	dataxO[3] = &dataTO;
+	
+	/* Get the Freak World Out (FWO?) field ... */
+	if (!getIndexEntryLW(fp,(off_t)indexO,FWO_Label,1,5,&fwo_))
+	    {   fclose(fp);
+		return(NULLSeq);
+	    }
+#define baseIndex(B) ((B)=='C'?0:(B)=='A'?1:(B)=='G'?2:3)
+	/*Get the positions of the four traces */
+	if (! ( getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,9 ,5,dataxO[baseIndex((char)(fwo_>>24&255))]) &&
+	       getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,10,5,dataxO[baseIndex((char)(fwo_>>16&255))]) &&
+	       getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,11,5,dataxO[baseIndex((char)(fwo_>>8&255))]) &&
+	       getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,12,5,dataxO[baseIndex((char)(fwo_&255))])
+	       )
+	    )    
+	    {   fclose(fp);
+		freeSeq(seq);
+		return(NULLSeq);
+	    }
+	
+    }
+    
+    
+    /* Read in the C trace */
+    if (fseek(fp, (off_t)dataCO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &(seq->traceC[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    /* Read in the A trace */
+    if (fseek(fp, (off_t)dataAO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &seq->traceA[i]))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    /* Read in the G trace */
+    if (fseek(fp, (off_t)dataGO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &seq->traceG[i]))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    
+    /* Read in the T trace */
+    if (fseek(fp, (off_t)dataTO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &seq->traceT[i]))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    
+    
+    /* Read in the bases */
+    if (! (getIndexEntryLW(fp,(off_t)indexO,BaseEntryLabel,1,5,&baseO) &&
+           (fseek(fp, (off_t)baseO, 0) == 0)
+	   )
+	)
+	{   fclose(fp);
+	    freeSeq(seq);
+	    return(NULLSeq);
+	}
+    for (i=0;i<(seq->NorigBases);i++)
+	{   int ch;
+	    
+	    if ((ch = fgetc(fp)) == EOF)
+		{   fclose(fp);
+		    freeSeq(seq);
+		    return(NULLSeq);
+		}
+	    seq->base[i] = (ch == 'N') ? '-' : ch;
+	}
+    
+    
+    
+    
+    /* Read in the base positions */
+    if (!(getIndexEntryLW(fp,(off_t)indexO,BasePosEntryLabel,1,5,&basePosO) &&
+          (fseek(fp, (off_t)basePosO, 0) == 0)
+	  )
+	)
+	{    fclose(fp);
+	     freeSeq(seq);
+	     return(NULLSeq);
+	 }
+    
+    for (i=0;i<(seq->NorigBases);i++)
+	{   if (!be_read_int_2(fp, (uint_2 *)&seq->basePos[i]))
+		{ fclose(fp); freeSeq(seq); return(NULLSeq); }
+	}
+    
+    
+    /*************************************************************
+     * Gather useful information
+     *************************************************************/
+    {
+	char comment[1024];
+	char line[128];
+	int4 spacing;
+	int4 ppos;
+	
+	*comment = '\0';
+	
+	
+	/* Get Signal Strength Offset */
+	if (getIndexEntryLW(fp,(off_t)indexO,SignalEntryLabel,1,5,&signalO)) {
+	    int2 C,A,G,T;
+	    int2 *base[4];
+	    base[0] = &C;
+	    base[1] = &A;
+	    base[2] = &G;
+	    base[3] = &T;
+	    if (fseek(fp, (off_t)signalO, 0) >= 0 &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_>>24&255))]) &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_>>16&255))]) &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_>>8&255))]) &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_&255))])) {
+		sprintf(line,"avg_signal_strength = C:%d A:%d G:%d T:%d\n",C,A,G,T);
+		strcat(comment,line);
+	    }
+	}
+
+	/* Get the spacing.. it's a float but don't worry yet */
+	if (getIndexEntryLW(fp,(off_t)indexO,SpacingEntryLabel,1,5,(uint_4*)&spacing)) {
+	    sprintf(line,"avg_spacing = %6.2f\n",int_to_float(spacing));
+	    strcat(comment,line);
+	}
+
+	
+	/* Get primer position */
+	if (getIndexEntryLW(fp,(off_t)indexO,PPOSLabel,1,5,(uint_4 *)&ppos)) {
+	    sprintf(line,"primer_position = %d\n",(ppos>>16)); /* ppos stores in MBShort of pointer */
+	    strcat(comment,line);
+	}
+
+	/* Get Machine Name Offset */
+	if (getIndexEntryLW(fp,(off_t)indexO,MCHNLabel,1,5,&MCHN_O)) {
+	    if (fseek(fp, (off_t)MCHN_O, 0) >= 0) {
+		unsigned char l;
+		char buffer[256];
+		/* first byte is a length */
+		fread(&l,sizeof(char),1,fp);
+		fread(buffer,l,1,fp);
+		sprintf(line,"machine_name = %.*s\n",l,buffer);
+		strcat(comment,line);
+	    }
+	}
+	
+	/* Get Dye Primer Offset */
+	if (getIndexEntryLW(fp,(off_t)indexO,PDMFLabel,1,5,&PDMF_O)) {
+	    if (fseek(fp, (off_t)PDMF_O, 0) >= 0) {
+		unsigned char l;
+		char buffer[256];
+		/* first byte is a length */
+		fread(&l,sizeof(char),1,fp);
+		fread(buffer,l,1,fp);
+		sprintf(line,"dye_primer = %.*s\n",l,buffer);
+		strcat(comment,line);
+	    }
+	}
+
+	/* Get Sample Name Offset */
+	/*
+         * The code here needs to be a bit more clever
+	 * When sample name length is less than (or equal?) four, the
+	 * string is packed into the SMPL_O field
+	if (getIndexEntryLW(fp,(off_t)indexO,SMPLLabel,1,5,&SMPL_O)) {
+	    if (fseek(fp, (off_t)SMPL_O, 0) >= 0) {
+		unsigned char l;
+		char buffer[256];
+		/. first byte is a length ./
+		fread(&l,sizeof(char),1,fp);
+		fread(buffer,l,1,fp);
+		sprintf(line,"sample_name = %.*s\n",l,buffer);
+		strcat(comment,line);
+	    }
+	}
+	*/
+
+	
+	/* dumplicate string and set info */
+	{
+	    char *s = (char *)malloc(strlen(comment)+1);
+	    strcpy(s,comment);
+	    seq->info = s;
+	}
+    }
+    
+    /* Finished with the file */
+    fclose(fp);
+    
+    return(seq);
+}
diff --git a/src/newted/seqIOABI.h b/src/newted/seqIOABI.h
new file mode 100644
index 0000000..6e9aeca
--- /dev/null
+++ b/src/newted/seqIOABI.h
@@ -0,0 +1,39 @@
+#ifndef _seqIOABI_h
+#define _seqIOABI_h
+
+
+/* 
+    Title:       seqIOABI
+
+    File: 	 seqIOABI.h
+    Purpose:	 IO of ABI sequences
+    Last update: Mon May 28 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"   /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Seq readSeqABI(char *fn);
+/*
+    Read the ABI format sequence with name `fn' into `seq'.
+    All printing characters (as defined by ANSII C `isprint')
+    are accepted, but `N's are translated to `-'s.
+    A NULLSeq result indicates failure.
+
+    enzString is the enzyme string that is to be used for 
+    searching for the left cutoff of sequence
+*/
+#endif  /*_seqIOABI_h*/
diff --git a/src/newted/seqIOALF.c b/src/newted/seqIOALF.c
new file mode 100644
index 0000000..f41d41a
--- /dev/null
+++ b/src/newted/seqIOALF.c
@@ -0,0 +1,339 @@
+/* 
+  Title:       seqIOALF
+  
+  File: 	 seqIOALF.c
+  Purpose:	 IO of ALF sequences
+  Last update: Tue Nov 10 1992
+  */
+
+/*
+  Change Log :- 
+  14.01.91 SD
+  when complimenting the sequence with an odd number of bases,
+  the middle base position was not adjusted.
+  15.01.91 SD  Put StLouis stuff on compilation flag
+  15.01.91 SD  New include file (opp.h)
+  02.08.91 SD  Changes the mapping of uncertainty codes so that we
+  now only generate A C G T and -
+  Previously... bug in interpreting ALF integer fields.
+  We now treat them as unsigned.
+  17.09.91 LFW changed STLOUIS compilation flag to SAVE_EDITS
+  and AUTO_CLIP
+  25.10.91 SD  Machine independant I/O...removed BIGENDIAN flag
+  25.11.91 SD There was a hard limit (of 1024) for allocation of
+  space for number of bases, yet program would 
+  read in more if there were any, causing nasties to happen.
+
+    11.11.92 LFW added section to actually check that the trace it
+             is trying to open is an ALF file using traceType sub
+
+    10.11.92 SD  SCF comments now stored in seq data structure
+  
+  */
+
+/* RMD I made substantial changes to this file 12/28/90 so as to
+   read sequence data more freely (necessary when reading data from
+   multiple trace files).
+   The affected area is indicated by comments starting RMD, like
+   this one.
+   */
+
+/* This file was adapted by LFW from seqIOABI.c.
+   
+   At the moment, the `maxTraceVal' of the sequence is hardwired
+   as 1200.  This fudge worked for the ABI, but is not really good
+   for the ALF.  To keep compatibility we keep 1200 as a max and
+   scale to fit (search for scaleFactor).  This needs work.
+   
+   The ALF results file is a concatenation of many files with an
+   index structure at the beginning, consisting of a 512 byte
+   block that we ignore, followed by 128 byte blocks describing
+   each file.  All files, including the header region, are rounded 
+   up to a multiple of 512 bytes long.  
+   The getIndexEntry routines identify the 128 byte index component
+   of interest by matching 4 chars of its ASCII label, then extract
+   the field of choice from that entry.
+   
+   Note that the SUN and PC are of opposite endian-ness, so that
+   we have to provide special routines to read words and longwords
+   from the results file.  Luckily the floating point numbers are
+   written out in ASCII.
+   
+   All references to the seq->bottom, the bottom strand of the
+   sequence, were added by lfw.
+   */
+
+
+/* ---- Imports ---- */
+
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			 newSeq, freeSeq */
+#include "seqIOEdit.h" /* IMPORT: writeEdSeq, 
+			  readEdSEq */
+
+
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+			 EOF */
+#include "traceType.h"
+#include "mach-io.h"
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+#define IndexEntryLength ((off_t)128)
+
+
+/*
+  Here are some labels we will be looking for, four chars packed
+  into a long word.
+  */
+#define EntryLabel        ((uint_4) ((((('A'<<8)+'L')<<8)+'F')<<8)+' ')
+#define BaseEntryLabel    ((uint_4) ((((('S'<<8)+'e')<<8)+'q')<<8)+'u')
+#define DataEntryLabel    ((uint_4) ((((('P'<<8)+'r')<<8)+'o')<<8)+'c')
+
+
+/* ---- Internal functions ---- */
+
+static Boolean getIndexEntryLW(FILE *fp, off_t indexO,
+			       uint_4 label, int lw,
+			       uint_4 *val)
+/*
+  From the ALF results file connected to `fp' whose index starts
+  at byte offset `indexO', return in `val' the `lw'th long word
+  from the entry labelled `label'.
+  The result indicates success.
+  */
+{   off_t entryNum=-1;
+    int i;
+    uint_4 entryLabel;
+    
+    do
+	{   entryNum++;
+	    if (fseek(fp, indexO+(entryNum*IndexEntryLength), 0) != 0) 
+		return(-1);
+	    
+	    if (!be_read_int_4(fp, &entryLabel)) return(False);
+	} while (!(entryLabel == label));
+    
+    for(i=2; i<lw; i++)
+	if (!be_read_int_4(fp, val))  return(-1);
+    
+    
+    /* when i = lw read in the 4 bytes backwards */
+    if (!le_read_int_4(fp,val)) return(-1);
+    
+    return(True);
+}
+
+static Boolean getIndexEntryW(FILE *fp, off_t indexO,
+			      uint_4 label, int lw,
+			      uint_2 *val)
+/*
+  From the ALF results file connected to `fp' whose index starts
+  at byte offset `indexO', return in `val' the `lw'th  word (int2)
+  from the entry labelled `label'.
+  The result indicates success.
+  */
+{   off_t entryNum=-1;
+    int i;
+    uint_4 entryLabel;
+    uint_4 jval;
+    
+    do
+	{   entryNum++;
+	    if (fseek(fp, indexO+(entryNum*IndexEntryLength), 0) != 0) return(-1);
+	    if (!be_read_int_4(fp, &entryLabel)) return(False);
+	} while (!(entryLabel == label));
+    
+    
+    for(i=2; i<lw; i++)
+	if (!be_read_int_4(fp, &jval)) return(-1);
+    if (!le_read_int_2(fp, val)) return(-1);
+    
+    return(True);
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+Seq readSeqALF(char *fn)
+/*
+  Read the ABI format sequence with name `fn' into `seq'.
+  All printing characters (as defined by ANSII C `isprint')
+  are accepted, but `N's are translated to `-'s.  If bottom 
+  is True, we read the sequence in in reverse and complemented 
+  A NULL result indicates failure.
+  
+  In this module all references to the bottom strand were 
+  inserted by lfw.
+  */
+{   Seq seq = NULLSeq;
+    int i;
+    FILE *fp;
+    uint_4 numPoints;
+    
+    uint_4 data_size;
+    uint_4 dataO;
+    uint_4 header_size=396; /* size of the header of the processed data
+			     section */
+    uint_2 actBaseDataSize; /* actual number of bytes of data of information
+			     containing the base and basePos information */
+    int num_points; /* keeps track of the actual number of points, rather
+		       than the early guess of numPoints */
+    
+    /* RMD 12/28/90 changed scaleFactor from 7 to 20 -
+       should read this from file, or calculate it */
+    
+    off_t indexO;   /* File offset where the index is */
+    uint_4 baseO;    /* File offset where the bases are stored */
+    
+    
+    if ((fp = fopen(fn, "rb")) == NULL) return(NULLSeq);
+    
+    
+    /* RMD lots of changes below here until end of data reading section
+       Some are cosmetic - I introduced the label abort: at the end of
+       the routine and sent all error stuff there.  
+       getIndexEntry calls in front of where they were needed, and made
+       There is a substantive change to the inner loop of the sequence
+       reading section.  This now uses fscanf - much less rigid than the
+       previous scheme.  Note that it reads bp as a float.  This is because
+       it is a float in multiple trace data files! (bizarre Pharmacia
+       programming!).
+       */
+    
+    
+    /* find offsets and length of sequence and data segments */
+    
+    indexO = 522;      /* indexO is the offset of the index */
+    /* or I could look for the first label, starting 'ALF'
+       if I used 512 then none of the entries are on long 
+       word boundaries */
+    
+    if (! ( getIndexEntryLW(fp,indexO,BaseEntryLabel,12,&baseO)  ))
+	/* offset in file of first base of sequence */
+	goto abort ;
+    
+    if (! ( getIndexEntryW(fp,indexO,BaseEntryLabel,10,&actBaseDataSize)  ))
+	/* actual size of region containing this data */
+	goto abort ;
+    
+    if (! ( getIndexEntryLW(fp,indexO,DataEntryLabel,12,&dataO)))
+	/* offset in file to start of processed data segment - there 
+	   is then a header of size header_size (currently 396) */
+	goto abort ;
+    
+    if (! ( getIndexEntryLW(fp,indexO,DataEntryLabel,10,&data_size)  ))
+	/* actual size of region containing this data */
+	goto abort ;
+    
+    numPoints = (int)((data_size - header_size)/ 8); 
+    /* because each trace value is stored in a 2 byte
+       integer, thus to store A C G T information
+       it takes 8 bytes.  So subtract off the header and
+       divide by 8 */
+    
+    /* RMD make enough space for 1024 bases - hard limit */
+#define BASELIMIT 1024
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesAndTraces, numPoints, BASELIMIT)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    seq->mode        = BasesAndTraces;
+    seq->format      = ALFFormat;
+    seq->dirty       = False;
+    seq->maxTraceVal = 0;
+    seq->NPoints     = numPoints;
+    seq->bottom	     = False;
+    
+    /* RMD set ->NorigBases and ->NedBases after reading them in */
+    
+    /* read in the sequence */
+    
+    if (!(fseek(fp, (off_t)baseO, 0) == 0))
+	goto abort ;
+    
+    {	/* new locals introduced by LFW and/or RMD for the ALF */
+	int numBases;	/* number of nucleotides read in */
+	float bp ;
+	char ch;
+	
+	
+	for (numBases = 0 ; ftell(fp) < baseO+(unsigned short)actBaseDataSize  && numBases<BASELIMIT; ) {
+	    char line[200];
+	    fgets(line,(int)sizeof(line),fp);
+	    sscanf (line,"%c %*d %f", &ch, &bp) ;
+	    
+	    /* we convert ch to Staden format here */
+	    switch (ch) {
+	    case 'A':
+	    case 'C':
+	    case 'G':
+	    case 'T':
+		break;
+	    default:
+		if (isupper(ch))
+		    ch = '-';
+		else
+		    ch = '\0';
+	    }
+	    
+	    if (ch) {
+		seq->base[numBases]    = ch;
+		seq->basePos[numBases] = bp;
+		++numBases ;
+		
+	    }
+	}
+	
+	seq->NorigBases  = numBases;
+	seq->NedBases    = numBases;
+    }
+    
+    /* read in the traces , stored in 2 byte integers in records in the
+       order A C G T A C G T A C G T ...*/
+    
+    if (fseek(fp, (off_t)(dataO+header_size), 0) != 0) 
+	goto abort ;
+    
+    num_points = 0;
+    
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!le_read_int_2(fp, &(seq->traceA[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceA[i]) seq->maxTraceVal = seq->traceA[i];
+	    if (!le_read_int_2(fp, &(seq->traceC[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceC[i]) seq->maxTraceVal = seq->traceC[i];
+	    if (!le_read_int_2(fp, &(seq->traceG[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceG[i]) seq->maxTraceVal = seq->traceG[i];
+	    if (!le_read_int_2(fp, &(seq->traceT[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceT[i]) seq->maxTraceVal = seq->traceT[i];
+	    
+	    if (seq->traceA[i]==0 && seq->traceT[i]==0 && seq->traceC[i]==0 && seq->traceG[i]==0 && i > (numPoints-64)) break;
+	    
+	    num_points++;
+	}
+    
+    
+    /* Finished with the file */
+    fclose(fp);
+    
+    return(seq);
+    
+ abort:
+    fclose(fp);
+    freeSeq(seq);
+    return(NULLSeq);
+}
diff --git a/src/newted/seqIOALF.h b/src/newted/seqIOALF.h
new file mode 100644
index 0000000..28b7388
--- /dev/null
+++ b/src/newted/seqIOALF.h
@@ -0,0 +1,40 @@
+#ifndef _seqIOALF_h
+#define _seqIOALF_h
+
+
+/* 
+    Title:       seqIOALF
+
+    File: 	 seqIOALF.h
+    Purpose:	 IO of ABI sequences
+    Last update: Tues Dec 11 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"   /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Seq readSeqALF(char *fn);
+/*
+    Read the ABI format sequence with name `fn' into `seq'.
+    All printing characters (as defined by ANSII C `isprint')
+    are accepted, but `N's are translated to `-'s.
+    A NULLSeq result indicates failure.
+
+    enzString is the enzyme string that is to be used for 
+    searching for the left cutoff of sequence
+*/
+
+#endif  /*_seqIOALF_h*/
diff --git a/src/newted/seqIOEdit.c b/src/newted/seqIOEdit.c
new file mode 100644
index 0000000..632c3f9
--- /dev/null
+++ b/src/newted/seqIOEdit.c
@@ -0,0 +1,1313 @@
+/*
+  Title:  seqIOEdit
+
+  File:   seqIOEdit.c
+  Purpose: IO of the editted portion of plain or edited sequences 
+
+  Last update: Monday 24 February 1992
+
+  Change log :-
+  15.01.91 SD  New include file (opp.h)
+  04.12.91 lfw added sample lanes up to 40 instead of 24
+  04.12.91 changed the way the left cutoff is found...now I allow
+  look for first occurrence of the left cutting sequence;
+  looking first for an exact match, then a match with
+  one mismatch, then with two...if nothing is found at
+  that point I assume the left cutoff is not there
+  24.02.92 SD Fixed bug in findRightCutoff() when checking for overlap with
+  leftCutoff. There was some confusion over what the value of
+  rightCutoff actually means.
+  18.11.92 lfw changed the names of the temporary files to .abc* instead
+  of abc*, also used the remove() command rather than system() to get
+  rid of the files and added another remove call so they will be sure to
+  be removed
+  
+  
+  */
+
+
+/* ---- Imports ---- */
+#include <stdlib.h>
+#include "seqIOEdit.h"
+
+#include "seq.h"/* IMPORT: Seq, BasesAndTraces, NULLSeq,
+		   newSeq, freeSeq */
+#include "opp.h"       /* IMPORT: oppInitialise */
+#include "dialogues.h"
+#include "match.h"
+
+
+
+
+#ifdef QUAL_CODE
+/* global definitions quality cutoffs for this file */
+#define SIDEBAND_CUTOFF 0.8
+#define NONCALLED_OVER_CALLED_CUTOFF 0.25
+#define OVERALL_TRACE_QUAL_CUTOFF 0.27
+#define STEP_SIZE 8
+#define LAST_ALLOWED_BASE 400
+
+
+
+/* definition of internal functions found below */
+int findRightQualCutoff(Seq seq, int num_bases);
+void findLeftQualCutoff(Seq seq, int *start_point);
+void SeqQual_sideband(Seq seq1);
+void SeqQual_nonCalledOverCalled(Seq seq1);
+
+
+#endif /*QUAL_CODE*/
+
+
+
+
+
+/* ---- Internal Functions ---- */
+
+int findPercntAmbig(char *theSeq, int num_bases);
+
+static void text_to_output(char *vec,int stp,int endp,int dvice,char *outfile)
+
+/*
+ * text_to_output(vec,stp,endp,dvice,outfile)
+ * input: char **vec,*outfile; int stp, endp, dvice;
+ * this program output a specified portion of a genbank file
+ * (from vec[stp] to vec[endp]) to the screen (default), a file (dvice
+ * =1), or lpr (dvice =2).  Outfile is the input filename if you wish to
+ * output the information to a specified file. stp and endp are integers,
+ *  not pointers.
+ */
+
+{
+    int i;
+    FILE *fopen(),*fp,*where;
+    
+    
+    if (dvice == 1) {
+	if ((fp = fopen(outfile,"a"))==NULL) {
+	    printf ("\nERROR: can't open file %s\n",outfile);
+	    return;
+	}
+	else where = fp;
+    }
+    else if (dvice == 2) {
+	if ((fp = fopen("junkfile.","a"))==NULL) {
+	    printf ("\nERROR: can't open file junkfile. to output to the lpr\n");
+	    return;
+	}
+	else where = fp;
+    }
+    else {
+	/*  default : */
+	where = stdout;
+    }
+    
+    for (i = stp; i < endp; i++)
+	putc(vec[i],where);
+    
+    if ((dvice == 1) || (dvice == 2)) fclose(fp);
+    
+}
+
+int checkForExistingEdFile(char *fn)
+/*
+ * check to see if there is an existing .seq.n file;
+ * return the largest n  or 0 if no files existed,
+ * return a -1 if it was a problem with opening files
+ */
+
+{
+    char vec[500];
+    int last_ed_num;
+    FILE *fp;
+    int i;
+    
+    /* make sure there are no files with the names I'm about to use*/
+    remove (".abcxyztmpsh.");
+    remove (".abcxyztmpout.");
+    
+    
+    /* write a little shell to see the last n in your_filename.n
+       in the current directory */
+    if ((fp=fopen(".abcxyztmpsh.", "w")) == NULL) return(-1);
+    fclose(fp);
+    sprintf(vec,"for i in 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 \n");
+    i = strlen(vec);
+    sprintf(vec+i,"  do\n   oldname=%s.$i\n   if test -f $oldname\n   then  echo $i\n",fn);
+    i = strlen(vec);
+    sprintf(vec+i,"   fi\ndone\n");
+    
+    text_to_output(vec,0,strlen(vec),1,".abcxyztmpsh.");
+    
+    /* execute the shell and have it output the last number it found into
+       a file called .abcxyztmpout. */
+    system("sh .abcxyztmpsh. | tail -1 > .abcxyztmpout.");
+    
+    /* read that number from that file */
+    
+    if ((fp=fopen(".abcxyztmpout.", "r")) == NULL) return(-1);
+    vec[0]=getc(fp);
+    fclose(fp);
+    /* if that number was EOF return(0), nothing found */
+    if (vec[0]==EOF) {
+      remove (".abcxyztmpsh.");
+      remove (".abcxyztmpout.");
+
+      return(0);
+    }
+    
+    vec[1]='\0';
+    
+    sscanf(vec,"%d",&last_ed_num);
+    remove (".abcxyztmpsh.");
+    remove (".abcxyztmpout.");
+    
+    return(last_ed_num);
+}
+
+Boolean isDotSeq(char *fn)
+/* make sure there is a .seq on  the end of fn, puts one on
+   if there is not */
+{
+    int i;
+    
+    i = strlen(fn)-1;
+    if (fn[i]!='q' || fn[i-1]!='e' || fn[i-2]!='s' || fn[i-3]!='.') 
+	return(False);
+    else return(True);  
+}
+
+void stripDotSeq(char *fn)
+/* there is a .seq on  the end of fn, strip it off */
+
+{ int i;
+  
+  i = strlen(fn)-1;
+  
+  if (fn[i]=='q' && fn[i-1]=='e' && fn[i-2]=='s' && fn[i-3]=='.') 
+      fn[i-3]='\0';
+  
+  return;
+}
+
+void stripDotNum(char *fn)
+/* if there is a .num on  the end of fn, strip it off */
+
+{ int i;
+  
+  i = strlen(fn)-1;
+  
+  if (isdigit(fn[i]) && fn[i-1]=='.') 
+      fn[i-1]='\0';
+  else if (isdigit(fn[i]) && isdigit(fn[i-1]) && fn[i-2]=='.')   
+      fn[i-2]='\0';
+  
+  return;
+}
+
+int  isDotNum(char *fn)
+/*
+ * checks if there is a .1 or .2 or .m on the inputfilename.
+ * If there is, then returns that num.  If not returns -1.
+ */
+{
+    int i,j;
+    int dotnum;
+    char *atemp;
+    
+    atemp = (char *)calloc(20,sizeof(char));
+    
+    i = strlen(fn)-1;
+    j = 0;
+    
+    if (isdigit(fn[i]) && (fn[i-1]=='.')) {
+	atemp[0] = fn[i];
+	atemp[1] = fn[i+1]; 
+	sscanf(atemp,"%d",&dotnum);
+	free(atemp);
+	return(dotnum);
+    }
+    else if (isdigit(fn[i]) && isdigit(fn[i-1]) && fn[i-2]=='.')   {
+	atemp[0] = fn[i-1];
+	atemp[1] = fn[i];
+	atemp[2] = '\0';
+	sscanf(atemp,"%d",&dotnum);
+	free(atemp);
+	return(dotnum);
+    }
+    
+    free(atemp);
+    return(-1);
+}
+
+static void get_compl_seq(char *ac_seq,char *aseq,int stp,int endp, int seq_len, int rev)
+
+/*
+ * uses aseq to find the sequence that
+ * would appear on the opposite strand and places
+ * that sequence in ac_seq; stp is the starting point on
+ * aseq and endp is the ending point on aseq;
+ * seq_len is the length of the input sequence,
+ * if rev==1 reverses as well as complements, if rev==0
+ * only complements
+ */
+{
+    int i;
+    
+    oppInitialize();
+    
+    if (rev == 1) {
+	for (i = stp; i <= endp; i++)
+	    ac_seq[seq_len - i] = opp[aseq[i]];
+    }
+    else {
+	for (i = stp; i <= endp; i++)
+	    ac_seq[i] = opp[aseq[i]];
+	ac_seq[i-1] ='\0';
+    }
+    
+    
+    return;
+}
+
+
+/* ---- Externals ---- */
+
+Boolean writeEdSeq(Seq seq, char *fn)
+
+{
+    FILE *fp;
+    int i,j;
+    char ed_fn[200];
+    int last_ed_num; /* largest n of fn.seq.n in
+			current directory */
+    
+    oppInitialize();
+    
+    /* make sure the filename does not have a .seq on the end */
+    stripDotSeq(fn);
+    
+    /* get the n to put on inputfilename.n for the edited 
+       file to be kept */
+    last_ed_num = checkForExistingEdFile(fn);
+    
+    if (last_ed_num == -1) return(False);
+    else sprintf(ed_fn,"%s.%d",fn,last_ed_num+1);
+    /* that line takes care of 0 too, because
+       it names the file fn.seq.1 */
+    
+    /* Open for writing, text */
+    if ((fp=fopen(ed_fn, "w")) == NULL) return(False);
+    
+    
+    /* write information in the following format:
+       NedBases*edits array*edBase array*edBasePos array*
+       each division ends in an * and within
+       each division entries are separated by spaces */
+    
+    if (seq->bottom) {
+	fprintf(fp," %6d*%3d*%6d*%6d*",seq->NorigBases,seq->NedBases,seq->rightCutoff,seq->leftCutoff);
+    }
+    else {
+	fprintf(fp," %6d*%3d*%6d*%6d*",seq->NorigBases,seq->NedBases,seq->leftCutoff,seq->rightCutoff);
+    }
+    
+    /* print out edits array */
+    
+    if (seq->bottom) {
+	for (i=seq->NorigBases+MaxEdits-1;
+	     i > -1;
+	     i--)
+	    { 
+		j = 0;
+		if (i==seq->NorigBases+MaxEdits-1) 
+		    /* this part is a fudge to stick in 0 0 */
+		    fprintf(fp,"%6d %6d ",j,j);
+		
+		
+		if (((seq->edits[i]!=0)  && ((seq->NedBases -1- i)>=0)) || i==0){
+		    if (seq->edits[i] <0)
+			fprintf(fp,"%6d %6d ",seq->NedBases -1- i,seq->edits[i]);
+		    else
+			fprintf(fp,"%6d %6d ",seq->NedBases -1 -i,seq->NorigBases - 1-(seq->edits[i]));   
+		}
+	    }
+    }
+    else {
+	for (i=0;
+	     i<seq->NorigBases+MaxEdits;
+	     i++) {
+	    
+	    if ((seq->edits[i]!=0)  || (i==0))
+		fprintf(fp,"%6d %6d ",i,seq->edits[i]);
+	}
+    }
+    
+    fprintf(fp,"%6d %6d ",NULLPoint,NULLPoint);
+    
+    fprintf(fp,"*");
+    
+    /* print out non-NULL entries in edBase array */
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{
+	    if (seq->edBase[i] == NULL) {
+		fprintf(fp,"* ");
+		break;
+	    }
+	    else {
+		if (seq->bottom) 
+		    fprintf(fp,"%c ",opp[seq->edBase[i]]);
+		else
+		    fprintf(fp,"%c ",seq->edBase[i]);
+	    }
+	}
+    
+    fprintf(fp,"*");
+    
+    /* print out non-NULL entries in edBasePos array */
+    
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{
+	    
+	    if (seq->edBasePos[i] == NULLPoint)  {
+		fprintf(fp,"%6d ",NULLPoint); /* -1 is NULLPoint*/
+		break;
+	    }
+	    else {
+		int fudge;
+		/*
+		 * when you're plotting the strand in the reverse
+		 * order, you must move the starting position over by
+		 * the width of one character.  Because positions in the
+		 * other file, already take into account the character
+		 * width. Therefore, fudge = ~character width + basePos
+		 * of the first base
+		 */
+		
+		fudge = seq->basePos[0] + 6;
+		
+		if (seq->bottom) {
+		    /*	   fprintf(fp,"%6d ",seq->basePos[seq->NorigBases-1]-seq->edBasePos[i]+fudge);	  */
+		    fprintf(fp,"%6d ",seq->NPoints-seq->edBasePos[i]);
+		}
+		else
+		    fprintf(fp,"%6d ",seq->edBasePos[i]);
+	    }
+	}
+    
+    fprintf(fp,"*");
+    (void) fclose(fp);
+    return(True);
+}
+
+
+
+Boolean readEdSeq(Seq seq, char *fn, int dotnum)
+/*
+ * reads in the most recent fn.seq.n file. This
+ * file should be of the form:
+ * basePos;edBasePos;edBase
+ * returns True if it read in the sequence from
+ * an editted file, and False if there was no
+ * editted file or if there was a problem 
+ * 
+ * dotnum == -1 if the user did not specify a
+ * version number of the sequence to read in,
+ * if they want version m read in.
+ */
+
+{
+    int last_ed_num;
+    char ed_fn[200];
+    FILE *fp;
+    
+    oppInitialize();
+    
+    /* make sure the filename does not have a .seq on the end */
+    stripDotSeq(fn);
+    
+    if (dotnum == -1) {
+	/* get the n to put on fn.seq.n */
+	last_ed_num = checkForExistingEdFile(fn);
+	if (last_ed_num == -1) return(False);
+	else if (last_ed_num == 0) return(False);
+	else sprintf(ed_fn,"%s.%d",fn,last_ed_num);
+    }
+    else
+	sprintf(ed_fn,"%s.%d",fn,dotnum);
+    
+    
+    /* Open for reading, text */
+    if ((fp=fopen(ed_fn, "r")) == NULL) return(False);
+    fclose(fp);
+    
+    if (processEdSeqFile(seq,ed_fn))
+	return(True);
+    else return(False);
+}
+
+Boolean processEdSeqFile(Seq seq, char *fn)
+
+/*
+ * processes a char vector containing the Editted
+ * sequence in the following format:
+ *
+ * NedBases*NorigBases*leftCutoff*rightCutoff*negative
+ * components of edits array in the form (position,
+ * negative number)*non NULL components edBase array*
+ * non NULL components edBasePos array*
+ *
+ * each division ends in an H. The end of the
+ * edits array is signaled by a -1 -1 entry. Within
+ * each division entries are separated by spaces
+ * 
+ * sticks the information it finds into the seq array
+ * 
+ * returns false if there was a problem with the file format
+ */
+{
+    FILE *fp;
+    int nbases;
+    char achar;
+    int i,j,k;
+    char ed_fn[200];
+    
+    strcpy(ed_fn,fn);
+    
+    if ((fp=fopen(ed_fn, "r")) == NULL) return(False);
+    
+    fscanf(fp," %6d",&nbases);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the number of bases)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    
+    fscanf(fp,"%3d",&i);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the number of edited bases)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    seq->NedBases = i;
+    if (nbases != seq->NorigBases) {
+	printf("ERROR: Input editted sequence was of wrong format\n (Number of editted bases has changed)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    
+    fscanf(fp,"%6d",&j);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the left cutoff)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    fscanf(fp,"%6d",&k);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the right cutoff)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    if (seq->bottom) {
+	seq->leftCutoff = k;
+	seq->rightCutoff = j;
+    }
+    else {
+	seq->leftCutoff = j;
+	seq->rightCutoff = k;
+    }
+    
+    /* read in the seq->edits array */
+    
+    for (i=1;
+	 i<seq->NorigBases+MaxEdits;
+	 i++)
+	{ 
+	    fscanf(fp,"%6d %6d ",&j,&k);
+	    if (j==NULLPoint) break;
+	    else {
+		if (seq->bottom) {
+		    if (k<0) /* then it is a reference to the edBase array */
+			seq->edits[seq->NedBases-1-j]=k;
+		    else /*it is a base number which needs to be converted
+			   to the base number on the opposite strand */
+			seq->edits[seq->NedBases-1-j]=seq->NorigBases-1-k;
+		}
+		else
+		    seq->edits[j]=k;
+	    }
+	    
+	}
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(Error in the edits array)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    /* read in the seq->edBase array */  
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{  
+	    fscanf(fp,"%c ",&achar);
+	    if (achar == '*') break;
+	    else {
+		if (seq->bottom) 
+		    seq->edBase[i]=opp[achar];
+		else
+		    seq->edBase[i]=achar;
+	    }
+	}
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("Input editted sequence was of wrong format\n(Extraneous information after edBases and before the *\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    
+    /* read in the seq->edBasePos array */  
+    
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{ 
+	    fscanf(fp,"%6d ",&k);
+	    if (k==NULLPoint) break;
+	    else {
+		int fudge;
+		
+		/*
+		 * when you're plotting the strand in the reverse
+		 * order, you must move the starting position over by
+		 * the width of one character.  Because positions in the
+		 * ABI File,  already take into account the character
+		 * width. Therefore, fudge = ~character width + offset of
+		 * the first peaks position
+		 */
+		
+		fudge = seq->basePos[0] + 6;
+		
+		if (seq->bottom)  {
+		    /*	  seq->edBasePos[i]=seq->basePos[seq->NorigBases-1]-k+fudge;*/
+		    seq->edBasePos[i]=seq->NPoints - k;
+		}
+		else
+		    seq->edBasePos[i]=k;
+	    }
+	    
+	}
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format\n(Error in editted Base Position array)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    /*
+     * Don't set the seq to Dirty, otherwise the user won't know
+     * if they have or have not edited their input edited sequence
+     */
+    fclose(fp);
+    return(True);
+}
+
+
+
+
+int findLeftCutoff(Seq seq, char *enzInString)
+/*
+ * looks for left cutoff, if it doesn't find a "enzInString", then
+ * it looks from enzInString less it's last character, etc
+ */
+
+{
+    int maxStartPos=100; /* if the enzyme site wasn't found before this
+			    baseNum, then that's probably not the cloning
+			    site */
+    int i,j,found;
+    /* int jj,kk; */
+    int indices[100];
+    int num_matches;
+    char *theSeq;
+    int num_bases;
+    char enzString[100];
+    /* char enztemp[100]; */
+#ifdef QUAL_CODE
+    int cut_point; 
+#endif /*QUAL_CODE*/
+    
+    found = 0;
+    
+    if (seq->bottom)
+	get_compl_seq(enzString,enzInString,0,strlen(enzInString),strlen(enzInString),0);
+    else
+	strcpy(enzString,enzInString);
+    
+    
+    num_bases = getNBases(seq,EdBases);
+    theSeq = (char *)calloc(num_bases,sizeof(char));
+    
+    j = 0;
+    if (seq->bottom) {
+	for (i = num_bases-1; i >= 0; i--){
+	    theSeq[i] = getBase(seq, EdBases, j);
+	    j++;
+	}
+    }
+    else {
+	for (i = 0; i < num_bases; i++)
+	    theSeq[i] = getBase(seq, EdBases, i);
+    }
+    
+    
+    
+    /* look for first occurrence of enzString; 
+       just look a match with at most i mismatches, starting
+       with 0 mismatches down to two*/
+    for (i=0; i<3; i++) {
+	num_matches=string_match(enzString,strlen(enzString),theSeq,num_bases,i,indices);
+	if (num_matches > 0) 
+	    if (indices[0] < maxStartPos) {
+		found = 1;
+		break;
+	    }
+    }
+    
+    free(theSeq);
+
+    if (found && indices[0])
+	return(indices[0] + strlen(enzString));
+#ifdef QUAL_CODE
+    else {
+	cut_point=0;
+	/*
+	 * make sure there are not a bunch of Ns from the ABI
+	 * primer problem at the start of this sequence...move the
+	 * left cutoff past all of those Ns
+	 */
+	for (i=0; i<seq->NorigBases; i++)
+	    if (seq->base[i]!='N' && seq->base[i]!='-') {cut_point=i; break;}
+    }
+
+    findLeftQualCutoff(seq,&cut_point);
+
+    return(cut_point);
+#else /*QUAL_CODE*/
+    else return(0);
+#endif /*QUAL_CODE*/
+    
+}
+
+
+
+
+
+
+#ifdef QUAL_CODE
+
+
+int findRightCutoff(Seq seq)
+{
+    int num_bases;
+    int rightCutoff;
+    
+    num_bases = getNBases(seq,EdBases);
+    
+    rightCutoff = findRightQualCutoff(seq,num_bases);
+    
+    /* added so that the left and right cutoffs do not overlap */
+    if (rightCutoff > num_bases - seq->leftCutoff) rightCutoff=num_bases - seq->leftCutoff;
+    
+    return(rightCutoff);
+}
+#else /*QUAL_CODE*/
+int findRightCutoff(Seq seq)
+{
+    /* give the %age cutoff a default but let it be user
+       specifiable on the command line ? */
+    /* 
+     * ways to look for ends of sequence 
+     * 1. runs of nucleotides or dinucleotides
+     * -- but rick says there are lots of runs of
+     * A's and T's in what they're sequencing
+     * 2. percentage of N's
+     * 3. automatically drop down to baseNum 600 to even start
+     * looking for a cutoff
+     */
+    
+    int num_bases;
+    char *theSeq;
+    int i,j;
+    int rightCutoff;
+    
+    num_bases = getNBases(seq,EdBases);
+    theSeq = (char *)calloc(num_bases,sizeof(char));
+    
+    j = 0;
+    if (seq->bottom) {
+	for (i = num_bases-1; i >= 0; i--){
+	    theSeq[i] = getBase(seq, EdBases, j);
+	    j++;
+	}
+    }
+    else {
+	for (i = 0; i < num_bases; i++)
+	    theSeq[i] = getBase(seq, EdBases, i);
+    }
+    
+    rightCutoff = findPercntAmbig(theSeq,num_bases);
+    
+    /* added so that the left and right cutoffs do not overlap */
+    if (rightCutoff > num_bases - seq->leftCutoff) rightCutoff=num_bases - seq->leftCutoff;
+    
+    free(theSeq);
+    return(rightCutoff);
+}
+#endif /*QUAL_CODE*/
+
+
+int findPercntAmbig(char *theSeq, int num_bases)
+
+{
+    int i,j;
+    int isN[256];
+    int totalN = 0;
+    int numN;
+    int nucWindow;
+    
+    numN = 2;
+    nucWindow = 5;
+    
+    for (i = 0; i <= 256; i++) isN[i]=0;
+    isN['n']=1;
+    isN['N']=1;
+    isN['-']=1;
+    
+    
+    
+    /* 
+     * start at base num 200 and look for numN Ns within
+     * a window of nucWindow nucleotides, once you find that
+     * second N send back the indices of that second
+     * N as the cutoff line
+     */
+    
+    for (i = 200; i < num_bases; i++) {
+	totalN = 0;
+	for (j = 0; j < nucWindow; j++) {
+	    if (isN[theSeq[i+j]]) totalN++; 
+	    if (totalN == numN) 	return(num_bases - (i+j));
+	}
+    }
+    return(0);
+    
+}
+
+
+#ifdef QUAL_CODE
+
+/*
+  Title:       seqQual
+  
+  File:        seqQual.c
+  Purpose:     Sequence Quality calculation module
+  Last update: May 1992
+  
+  Change log:
+  
+  */
+
+
+
+
+
+static int one_half_forwards(Seq seq, int base)
+/*
+ * Returns the position half way between base and the following base.
+ */
+{
+    int pos;
+
+
+    if ((base+1) < seq->NorigBases) {
+        pos = (seq->basePos[base]+seq->basePos[base+1]) / 2;
+    } else {
+        /*
+         * Last base is a special case. We should guestimate.
+         * 
+         * guess 1: pos = bp[N] + (bp[N] - bp[N-1])/2
+         *
+         * if pos > NPoints
+         *    guess 2: pos = NPoints-1
+         * 
+         */
+        pos = seq->basePos[base] +
+	    (seq->basePos[base] - seq->basePos[base-1])/2;
+        if (pos >= seq->NPoints) pos = seq->NPoints-1;
+    }
+
+    return pos;
+
+}
+
+
+static int one_half_backwards(Seq seq, int base)
+/*
+ * Returns the position half way between base and the precedingbase.
+ */
+{
+    int pos;
+
+
+    if (base > 0) {
+        pos = (seq->basePos[base]+seq->basePos[base-1]) / 2;
+    } else {
+        /*
+         * Last base is a special case. We should guestimate.
+         * 
+         * guess 1: pos = bp[N] - (bp[N+1] - bp[N])/2
+	 *
+	 * if pos < 0
+	 *    guess 2: pos = 0
+	 * 
+	 */
+        pos = seq->basePos[base] -
+            (seq->basePos[base+1] - seq->basePos[base])/2;
+        if (pos < 0) pos = 0;
+    }
+
+    return pos;
+
+}
+
+
+
+
+
+
+
+
+/*
+ * 1) when they ask to find
+ * the left cutoff go ahead and just
+ * calculate max_non_called over called measure to
+ * see whether or not to throw the trace away entirely
+ *
+ * 1.5)find the left cutoff first and use that for starting your
+ * hunt for the right cutoff.
+ *
+ * 2) go out and calculated the side band ratio and find the cutoffs
+ *
+ * 3) last calculate any quality measures base by base which you
+ * actually may want for writing out to the sequence file or whatever
+ */
+
+
+int overallTraceQual(Seq seq)
+/*
+ * returns a one if the overall trace quality was good enough to
+ * warrant keeping the trace....0 if the trace should be thrown away 
+ *
+ * Seq is the trace structure
+ * for step size...you are looking at values for the quality
+ * measure each STEP_SIZE-th base
+ */
+
+{ int j;
+  int num_good=0;		/* count of number of consecutive bases having
+				   a quality index value better than the cutoff */
+  int num_consecutive_good=20;  /* 24 being about 200 bases of
+				   good trace since values are
+				   read every 8 bases */
+  int num_problems;		/* count of number of bases in this run having
+				   a quality index above the cutoff */
+  int num_problems_allowed=4;	/* number of values above the 
+				   cutoff allowed in the span
+				   of num_consecutive_good */
+  float cutoff=OVERALL_TRACE_QUAL_CUTOFF;  /* cutoff for the value of max_non_called over called */
+  int last_problem;
+  
+  SeqQual_nonCalledOverCalled(seq);
+  
+  for (j=0; j< seq->NorigBases; j+=STEP_SIZE) {
+      /*     printf("overall seq->qual[%d] is %4.3f\n",j,seq->qualIndex[j]);*/
+      if (seq->qualIndex[j]<cutoff) num_good++;
+      else {
+	  num_problems++;
+	  if (num_problems==1) last_problem=j;
+	  if (num_problems>num_problems_allowed) {num_good=0; num_problems=0; j=last_problem+1;}
+      }
+      /*
+       * make sure you  hit num_consecutive_good in a row and that you are not
+       * out past LAST_ALLOWED_BASE when you do hit it
+       */
+      if (num_good==num_consecutive_good) return(1);
+  }
+  
+  return(0);
+}
+
+int findRightQualCutoff(Seq seq, int num_bases)
+{ int i,j;
+  /*int num_good=0;*/ /* count of number of consecutive bases having
+                         a quality index value better than the cutoff */
+  int num_problems;		/* count of number of bases in this run having
+				   a quality index above the cutoff */
+  int num_allowed_problems=4;	/* number of values above the 
+				   cutoff allowed before setting
+				   the right cutoff */
+  float cutoff=SIDEBAND_CUTOFF;  
+  /* cutoff for the value of this side-band-ratio */
+  int rightCutoff;
+  int first_problem_base;
+  
+  
+  /* go calculate the quality measure */
+  SeqQual_sideband(seq);
+  
+  
+  /* step through all of the bases in STEP_SIZE increments */
+  
+  for (j=seq->leftCutoff; j<seq->NorigBases; j+=STEP_SIZE) {
+      /*    printf("seq->qualIndex[%d] is %5.3f\n",j,seq->qualIndex[j]);*/
+      
+      /* if the quality index exceeds the cutoff ... count it as a problem area*/
+      if (seq->qualIndex[j]>cutoff)  {
+	  num_problems++;
+	  if (num_problems==1) first_problem_base=j;
+      }
+      else num_problems=0;
+      
+      
+      
+      /*    printf("num_problems is %d\n",num_problems);*/
+      
+      /*
+       * if we have reached the num_allowed_problems over
+       * consecutive bases...then go ahead and assign 
+       * the right cutoff to the point where the start
+       * of the problem bases was found
+       */
+      if (num_problems==num_allowed_problems )  {
+	  rightCutoff=j-num_problems*STEP_SIZE;
+      }
+  }
+  
+  if (num_problems<num_allowed_problems) {
+      rightCutoff=LAST_ALLOWED_BASE;
+      if (rightCutoff>num_bases) rightCutoff=num_bases;
+  }
+  
+  /*  printf("rightcutoff is %d\n",rightCutoff);*/
+  
+  /*
+   * now go check the other quality measure from this cutoff
+   * base backwards....checking that we are not exceeding
+   * that rule...or could just take the more conservative
+   * of the two estimates....except then we're sometimes
+   * actually finding the left cutoff
+   */
+  
+  SeqQual_nonCalledOverCalled(seq);
+  
+  for (i=rightCutoff; i>seq->leftCutoff; i-=8) {
+      /*	printf("other seq->qualIndex[%d] is %5.3f\n",i,seq->qualIndex[i]);*/
+      /* If two consecutive regions are good using the noncalled over called
+	 cutoff, then go ahead and set the right cutoff there */
+      if (seq->qualIndex[i]<NONCALLED_OVER_CALLED_CUTOFF && seq->qualIndex[i-STEP_SIZE]<NONCALLED_OVER_CALLED_CUTOFF) break;
+  }
+  rightCutoff=i;
+  
+  
+  
+  /* ABSOLUTE CUTOFF IS LAST_ALLOWED_BASE */
+  if (rightCutoff>LAST_ALLOWED_BASE)  rightCutoff=LAST_ALLOWED_BASE;
+  
+  
+  /*  printf("rightcutoff is %d\n",rightCutoff);  */
+  
+  /*
+   * remember that the right cutoff in the trace structure is not
+   * the base position of the right cutoff...rather it is num_bases
+   * minus that position
+   */
+  
+  rightCutoff=num_bases-rightCutoff;
+  
+  return(rightCutoff);
+}
+
+void findLeftQualCutoff(Seq seq,int *start_point)
+{ int j;
+  /*
+   * go calculate the quality measure ... this will take care
+   * of calculating it both for the the left and the right
+   */
+  
+  /* had already called this for the overall trace quality */
+  /*  SeqQual_nonCalledOverCalled(seq);*/
+  
+  /*
+   * start looking at start_point+STEP_SIZE because you really do not
+   * want to look at what comes before the cutoff.  You only care
+   * about what is after the cutoff...so in essence you are making
+   * your window centered on the left foot of the rectangle window rather
+   * than the center
+   */
+  
+  for (j=*start_point+STEP_SIZE; j< seq->NorigBases; j+=STEP_SIZE) {
+      /*    printf("seq->qualIndex[%d] is %4.3f\n",j,seq->qualIndex[j]);*/
+      if (seq->qualIndex[j]<NONCALLED_OVER_CALLED_CUTOFF) {
+	  *start_point=j-STEP_SIZE;
+	  return;
+      }
+  }
+  
+  *start_point=j;
+  return;
+  
+}  
+
+
+/*
+ * MODULE     SeqQual17  - SeqQual_sideband
+ *
+ *  for the called base, take the ratio of the value of that
+ *  trace 1/2 of the way between this base and the next base
+ *  over the value of the trace at its peak...
+ *  compare that to the ratio of the value of that trace 1/2 or
+ *  the way between this base and the previous base 
+ *  over the value of the trace at its peak...take the worst ratio
+ *  and average that over the 8 bases before and after this base
+ *
+ */
+
+void SeqQual_sideband(Seq seq1)
+{
+    int i;
+    int pos;
+    int one_half_for; 
+    int one_half_back;
+    int half_window_size=8; /* 8 */
+    int start_sum,end_sum;
+    int j;
+    /* int one_half_pos; */
+    float forward,backward;
+    
+    for (i=0; i<seq1->NorigBases; i++)
+	seq1->qualIndex[i] = 0.0;
+    seq1->qualType=17;
+    
+    
+    for (j=0; j< seq1->NorigBases; j++) {
+	
+	end_sum=j+half_window_size;
+	start_sum=j-half_window_size;
+	
+	
+	if (end_sum>=seq1->NorigBases) end_sum=(seq1->NorigBases)-2;
+	if (start_sum<0) start_sum=1;
+	
+	for (i=start_sum; i<=end_sum; i++) {
+
+            /*
+	    one_half_for=(int)((seq1->basePos)[i]+((seq1->basePos)[i+1] - (seq1->basePos[i]))/2);
+	    one_half_back=(int)((seq1->basePos)[i]-((seq1->basePos)[i] - (seq1->basePos[i-1]))/2);
+	    
+	    */
+	    one_half_for = one_half_forwards(seq1,i);
+	    one_half_back = one_half_backwards(seq1,i);
+
+	    pos = (seq1->basePos)[i];
+	    
+	    switch ((seq1->base)[i]) {
+	    case 'A':
+	    case 'a':
+		forward=(float)seq1->traceA[one_half_for]/(float)seq1->traceA[pos];
+		backward=(float)seq1->traceA[one_half_back]/(float)seq1->traceA[pos];
+		if (forward>backward)
+		    seq1->qualIndex[j] += forward;
+		else
+		    seq1->qualIndex[j] += backward;
+		break;
+	    case 'C':
+	    case 'c':
+		forward=(float)seq1->traceC[one_half_for]/(float)seq1->traceC[pos];
+		backward=(float)seq1->traceC[one_half_back]/(float)seq1->traceC[pos];
+		if (forward>backward)
+		    seq1->qualIndex[j] += forward;
+		else
+		    seq1->qualIndex[j] += backward;
+		
+		break;
+	    case 'G':
+	    case 'g':
+		forward=(float)seq1->traceG[one_half_for]/(float)seq1->traceG[pos];
+		backward=(float)seq1->traceG[one_half_back]/(float)seq1->traceG[pos];
+		if (forward>backward)
+		    seq1->qualIndex[j] += forward;
+		else
+		    seq1->qualIndex[j] += backward;
+		
+		break;
+	    case 'T':
+	    case 't':
+		forward=(float)seq1->traceT[one_half_for]/(float)seq1->traceT[pos];
+		backward=(float)seq1->traceT[one_half_back]/(float)seq1->traceT[pos];
+		if (forward>backward)
+		    seq1->qualIndex[j] += forward;
+		else
+		    seq1->qualIndex[j] += backward;
+		
+		break;
+	    default:
+		(seq1->qualIndex)[j] += 1.0;
+		break;
+	    }
+	}
+	/*    seq1->qualIndex[j] = seq1->qualIndex[j]/(half_window_size*2+1);*/
+	seq1->qualIndex[j] = seq1->qualIndex[j]/(end_sum-start_sum+1);
+    } 
+    
+    seq1->qualType = 17;	/* identify quality index as type 16 */
+}
+
+
+/*
+ * MODULE    SeqQual15    SeqQual_nonCalledOverCalled
+ *
+ *
+ * center the window at base N, look to either side of base N,
+ * by window_size bases/2
+ *
+ * area of the called divided by the area of the max non-called
+ * for range from base N-window_size/2 to N+window_size/2
+ * between this base and the next base
+ *
+ * So find the area for each base....divide area of the called
+ * by other max area
+ */
+
+float max_area (  TRACE *tx, TRACE *ty, TRACE *tz , int stp, int endp);
+float get_area( TRACE *trace, int startp, int endp);
+
+void  SeqQual_nonCalledOverCalled(Seq seq1)
+{
+    int i,j;
+    int pos,start_pos,end_pos;
+    int half_window_size=8;  /* look at three bases on either side */
+    float max_area();
+    float get_area();
+    int end_sum,start_sum;
+    
+    for (i=0; i<seq1->NorigBases; i++)
+	seq1->qualIndex[i] = 0.0;
+    seq1->qualType=15;
+    
+    for (j=0; j< seq1->NorigBases; j++) {
+	
+	end_sum=j+half_window_size;
+	start_sum=j-half_window_size;
+	
+	if (end_sum>=seq1->NorigBases) end_sum=(seq1->NorigBases)-1;
+	if (start_sum<0) start_sum=0;
+	
+	for (i=start_sum; i<=end_sum; i++) {
+	    pos = (seq1->basePos)[i];
+	    
+	    /*
+	    start_pos=(seq1->basePos)[i]-((seq1->basePos)[i] - (seq1->basePos[i-1]))/2;
+	    end_pos=(seq1->basePos)[i]+((seq1->basePos)[i+1] - (seq1->basePos[i]))/2;
+	    */
+	    start_pos = one_half_backwards(seq1,i);
+	    end_pos = one_half_forwards(seq1,i);
+	    
+	    
+	    switch ((seq1->base)[i]) {
+	    case 'A':
+	    case 'a':
+		seq1->qualIndex[j] +=
+		    (max_area (seq1->traceC, seq1->traceG, seq1->traceT,start_pos,end_pos)/
+		     get_area(seq1->traceA,start_pos,end_pos));
+		break;
+	    case 'C':
+	    case 'c':
+		seq1->qualIndex[j] +=
+		    (max_area(seq1->traceA, seq1->traceG, seq1->traceT,start_pos,end_pos) /
+		     get_area(seq1->traceC,start_pos,end_pos));
+		break;
+	    case 'G':
+	    case 'g':
+		seq1->qualIndex[j] +=
+		    (max_area(seq1->traceC, seq1->traceA,seq1->traceT,start_pos,end_pos) /
+		     get_area(seq1->traceG,start_pos,end_pos));
+		break;
+	    case 'T':
+	    case 't':
+		seq1->qualIndex[j] +=
+		    (max_area(seq1->traceC, seq1->traceG, seq1->traceA,start_pos,end_pos) /
+		     get_area(seq1->traceT,start_pos,end_pos));
+		break;
+	    default:
+		(seq1->qualIndex)[j] += 1.0;
+		break;
+	    }
+	}
+	/*            seq1->qualIndex[j] = seq1->qualIndex[j]/(half_window_size*2+1);*/
+	seq1->qualIndex[j] = seq1->qualIndex[j]/(end_sum-start_sum+1);
+    }
+    
+    seq1->qualType = 15;	/* identify quality index as type 15 */
+}
+
+float get_area(TRACE*trace, int startp, int endp)
+{ int i;
+  float sum=0;
+  
+  for (i=startp; i<endp; i++)
+      sum += trace[i];
+  
+  return(sum);
+}
+
+
+float max_area (  TRACE *tx, TRACE *ty, TRACE *tz , int stp, int endp)
+{
+    float x,y,z,max;
+    x = get_area(tx,stp,endp);
+    y = get_area(ty,stp,endp);
+    z = get_area(tz,stp,endp);
+    
+    if (x > y) {
+	if (z > x) {
+	    max = z;
+	} else {
+	    max = x;
+	}
+    } else {
+	if (z > y) {
+	    max = z;
+	} else {
+	    max = y;
+	}
+    }
+
+    return max;
+}
+
+#endif /*QUAL_CODE*/
diff --git a/src/newted/seqIOEdit.h b/src/newted/seqIOEdit.h
new file mode 100644
index 0000000..9442d60
--- /dev/null
+++ b/src/newted/seqIOEdit.h
@@ -0,0 +1,115 @@
+#ifndef _seqIOEdit_h
+#define _seqIOEdit_h
+
+/*
+ Title:       seqIOEdit
+
+ File:        seqIOEdit.h
+ Purpose:     IO of edited sequences
+    Last update: Tue Jan 15 1991
+
+ Change log :-
+ 15.01.91 SD 
+    Removed parts required for complementation and put them in opp.h
+*/
+
+/* ---- Imports ---- */
+
+#include "seq.h"   /* IMPORT: Seq */
+#include <stdio.h>      /* IMPORT: fopen, fclose, fseek, ftell, fgetc */
+#include <ctype.h>      /* IMPORT: isprint */
+
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+/* ---- Private ---- */
+
+/* ---- Exports ---- */
+
+extern Boolean writeEdSeq(Seq seq, char *fn);
+/*
+    Write the clipped, edited part of the sequence `seq'
+    into file `fn'. The result indicates success.
+
+    (*==fn in the following description)
+
+    addition by lfw, 10/24/90,
+
+  1. writes *.out as text file without printing bases
+       before left and right cutoff, and without
+       printing base numbers (in this program)
+
+    2. write *.1 as text if seq.1 does not exist
+       write *.2 as text if seq.1 does exist
+       write *.3 as text if seq.2 does exist
+       ...
+       write *.n as text if seq.(n-1) does exist
+       ( in writeEdSeq in seqIOEdit.c )
+
+
+*/
+
+extern Boolean readEdSeq(Seq seq, String fn, int dotnum);
+/*reads in the most recent fn.seq.n file. This
+  file should be of the form:
+    basePos;edBasePos;edBase
+  returns True if it read in the sequence from
+   an editted file, and False if there was no
+   editted file or if there was a problem 
+
+   dotnum == -1 if the user did not specify a
+   version number of the sequence to read in,
+   m if they want version m read in. */
+
+
+
+extern Boolean processEdSeqFile(Seq seq, String fn);
+
+/* processes a char vector containing the Editted
+   sequence in the following format:
+
+       NedBases*NorigBases*leftCutoff*rightCutoff*negative 
+       components of edits array in the form (position,
+       negative number)*non NULL componenets edBase array*
+       non NULL components edBasePos array*
+
+       each division ends in an H. The end of the
+       edits array is signaled by a -1 -1 entry. Within
+       each division entries are separated by spaces
+
+   sticks the information it finds into the seq array
+
+   returns false if there was a problem with the file format */
+
+extern Boolean isDotSeq(char *fn);
+
+/* checks to see if the filename already has the .seq
+   suffix or not */
+
+
+extern int isDotNum(char *fn);
+/* checks if there is a .1 or .2 or .m on the inputfilename.
+If there is, then returns that num.  If not returns -1.
+   */
+
+extern void stripDotNum(char *fn);
+/* if there is a .num (indicating that the user has input
+   an edited version of the sequence they wish to have
+   read in) on  the end of fn, strip it off */
+
+extern int findLeftCutoff(Seq seq, char *enzString);
+/* if the sequence has not previously been edited, this
+subroutine is used for auto-clipping */
+
+
+extern int findRightCutoff(Seq seq);
+/* if the sequence has not previously been edited, this
+subroutine is used for auto-clipping */
+
+
+#endif  /*_seqIOEdit_h*/
+
+
+
+
+
diff --git a/src/newted/seqIOPlain.c b/src/newted/seqIOPlain.c
new file mode 100644
index 0000000..e1b8c12
--- /dev/null
+++ b/src/newted/seqIOPlain.c
@@ -0,0 +1,120 @@
+/* 
+  Title:       seqIOPlain
+  
+  File: 	 seqIOPlain.c
+  Purpose:	 IO of plain sequences
+  Last update: Tuesday Jan 15 1991
+  
+  Change log:
+  
+  28.11.90 SD  put undesirables under STLOUIS compilation flag
+  15.01.91 SD  new include file (opp.h)
+  17.09.91 LFW changed STLOUIS compilation flag to SAVE_EDITS
+  and AUTO_CLIP
+  */
+
+
+/*
+  This module should be regarded as part of `seq' since it is
+  privy to the internal structure of `Seq'.
+  
+  Any references to the writing or reading of edited sequences,
+  or to the bottom strand were added by lfw
+  */
+
+
+
+
+/* ---- Imports ---- */
+
+#include "seqIOPlain.h"
+#include "seq.h"        /* IMPORT: Seq, BasesOnly, NULLSeq,
+			   newSeq, freeSeq */
+#include "seqIOEdit.h" /* IMPORT: readEdSeq, writeEdSeq */
+
+#include <stdio.h>      /* IMPORT: fopen, fclose, fseek, ftell, fgetc */
+#include <ctype.h>      /* IMPORT: isprint */
+
+
+
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+Seq readSeqPlain(char *fn)
+/*
+  Read the plain format sequence with name `fn' into `seq'.
+  All printing characters (as defined by ANSII C `isprint')
+  are accepted, but `N's are translated to `-'s.
+  A NULLSeq result indicates failure.
+  */
+{   Seq seq = NULLSeq;
+    FILE *fp;
+    off_t fileLen;
+    int  ch;
+    int i;
+    int wait_for_eol=0; /* used for looking for end of lines in the header */
+
+    if ((fp = fopen(fn, "r")) == NULL) return(NULLSeq);
+    
+    /*
+      Find the length of the file.
+      Use this as an overestimate of the length of the sequence.
+      */
+    fseek(fp, (off_t) 0, 2);
+    if ((fileLen = ftell(fp)) > MaxInt2)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    fseek(fp, (off_t) 0, 0);
+    
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesOnly, 0, fileLen)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    seq->mode   = BasesOnly;
+    seq->format = PlainFormat;
+    seq->dirty  = False;
+    seq->bottom = False;
+    
+    /* Read in the bases */
+    
+    seq->NorigBases = 0;
+    while ((ch = fgetc(fp)) != EOF) {
+        if (ch==';' || wait_for_eol) {
+	    /* if you find a semicolon assume that line is a comment
+	       and stop reading sequence until you have reached the
+	       end of line */
+	    wait_for_eol=1;
+	    if (ch=='\n') wait_for_eol=0;
+        }
+        else if (isprint(ch))
+	    {   seq->base[seq->NorigBases] = (ch=='N') ? '-' : ch;
+		seq->NorigBases++;
+	    }
+    }
+    
+    
+    seq->NedBases = seq->NorigBases;
+    (void) fclose(fp);
+    
+    
+    /*
+      Build a dummy basePos table such that the position of the
+      base is equal to its place in the sequence * 10.
+      */
+    seq->NPoints = (seq->NorigBases)*10;
+    for (i=0; i < seq->NorigBases; i++) seq->basePos[i] = i*10;
+
+    return(seq);
+}
diff --git a/src/newted/seqIOPlain.h b/src/newted/seqIOPlain.h
new file mode 100644
index 0000000..feb513e
--- /dev/null
+++ b/src/newted/seqIOPlain.h
@@ -0,0 +1,45 @@
+#ifndef _seqIOPlain_h
+#define _seqIOPlain_h
+
+
+/* 
+    Title:       seqIOPlain
+
+    File: 	 seqIOPlain.h
+    Purpose:	 IO of plain sequences
+    Last update: Mon May 28 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Seq readSeqPlain(char *fn);
+/*
+    Read the plain format sequence with name `fn' into `seq'.
+    All printing characters (as defined by ANSI C `isprint')
+    are accepted, but `N's are translated to `-'s.
+    A NULLSeq result indicates failure.
+
+    enzString is used to automatically determine the left
+    cutoff for a given sequence -- only determined if this
+    is the first time this sequence has been read in.  If 
+    the sequence has already been edited, the program uses
+    the previous settings for left and right cutoffs
+
+*/
+
+
+#endif  /*_seqIOPlain_h*/
diff --git a/src/newted/seqIOSCF.c b/src/newted/seqIOSCF.c
new file mode 100644
index 0000000..cee7d13
--- /dev/null
+++ b/src/newted/seqIOSCF.c
@@ -0,0 +1,286 @@
+/* 
+  Title:       seqIOSCF
+  
+  File: 	 seqIOSCF.c
+  Purpose:	 IO of Standard Chromatogram Format sequences
+  Last update: Tue Nov 10 1992
+  
+  Change log:
+  4 Feb 1992,  Now draft proposal version 2
+  20 Feb 1992, Grab info from comment lines
+  19 Aug 1992, If SCF file has clip information, don't clip automatically
+  10 Nov 1992  SCF comments now stored in seq data structure
+  
+*/
+
+
+
+
+
+/* ---- Imports ---- */
+
+#include "scf.h"      /* SCF structures */
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			 newSeq, freeSeq */
+#include "seqIOEdit.h" /* IMPORT: writeEdSeq, 
+			  readEdSEq */
+
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+			 EOF */
+#include <stdlib.h>
+#include "mach-io.h"
+
+
+/* ---- Internal functions ---- */
+
+
+static int read_scf_header(FILE *fp, Header *h)
+{
+    int i;
+
+    if (be_read_int_4(fp,&h->magic_number)==False) return (False);
+    if (be_read_int_4(fp,&h->samples)==False) return (False);
+    if (be_read_int_4(fp,&h->samples_offset)==False) return (False);
+    if (be_read_int_4(fp,&h->bases)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_left_clip)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_right_clip)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_offset)==False) return (False);
+    if (be_read_int_4(fp,&h->comments_size)==False) return (False);
+    if (be_read_int_4(fp,&h->comments_offset)==False) return (False);
+    if (fread(&h->version[0],sizeof(h->version),1,fp)!=1) return (False);
+    if (be_read_int_4(fp,&h->sample_size)==False) return (False);
+    if (be_read_int_4(fp,&h->code_set)==False) return (False);
+    for (i=0;i<20;i++)
+	if (be_read_int_4(fp,&h->spare[i])==False) return (False);
+    
+    return (True);
+}
+
+
+static int read_scf_sample1(FILE *fp, Samples1 *s)
+{
+    if (be_read_int_1(fp,&s->sample_A)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_C)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_G)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_T)==False) return (False);
+    
+    return (True);
+}
+
+
+static int read_scf_sample2(FILE *fp, Samples2 *s)
+{
+    if (be_read_int_2(fp,&s->sample_A)==False) return (False);
+    if (be_read_int_2(fp,&s->sample_C)==False) return (False);
+    if (be_read_int_2(fp,&s->sample_G)==False) return (False);
+    if (be_read_int_2(fp,&s->sample_T)==False) return (False);
+    
+    return (True);
+}
+
+
+
+
+
+static int read_scf_base(FILE *fp, Bases *b)
+{
+    if (be_read_int_4(fp,&b->peak_index)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_A)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_C)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_G)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_T)==False) return (False);
+    if (be_read_int_1(fp,(byte *)&b->base)==False) return (False);
+    if (be_read_int_1(fp,&b->spare[0])==False) return (False);
+    if (be_read_int_1(fp,&b->spare[1])==False) return (False);
+    if (be_read_int_1(fp,&b->spare[2])==False) return (False);
+    
+    return (True);
+}
+
+
+
+static float parse_version(char version[])
+/*
+ *
+ */
+{
+    char v[5];
+    strncpy(v,version,4);v[4]='\0';
+    if (strspn(v,"0123456789. ")!=4) return 0.0;
+    return atof(v);
+
+}
+
+
+
+/* ---- Exports ---- */
+
+int is_SCF(char *fn)
+/*
+ * Check to see if file with name `fn' is in SCF format
+ * 
+ * Returns:
+ * 1 - is SCF format
+ * 0 - not SCF format
+ */
+{
+    FILE *fp;
+    uint_4 magic;
+    int ok;
+    
+    if ( (fp=fopen(fn,"rb")) == NULL) {
+	ok = 0;
+    } else {
+	if ( be_read_int_4(fp, &magic) != 1 ) {
+	    ok = 0;
+	} else {
+	    ok = (magic==SCF_MAGIC);
+	}
+	fclose(fp);
+    }
+    
+    return ok;
+}
+
+
+
+
+
+
+
+
+Seq readSeqSCF(char *fn)
+/*
+ * Read the SCF format sequence with name `fn' into `seq'.
+ * If bottom is True, we read the sequence in in reverse and
+ * complemented A NULL result indicates failure.
+ */
+{   Seq seq = NULLSeq;
+    FILE *fp;
+    Header header;
+    int i;
+    
+    /* Open for: reading, binary */
+    if ((fp = fopen(fn, "rb")) == NULL) return(NULLSeq);
+    
+    /* Read header */
+    if (fseek(fp, (off_t)0, 0) != 0) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    if (read_scf_header(fp,&header)==False) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesAndTraces, header.samples, header.bases)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    
+    seq->mode         = BasesAndTraces;
+    seq->format       = SCFFormat;
+    seq->dirty        = False;
+    seq->maxTraceVal  = 0;
+    seq->NPoints      = header.samples;
+    seq->NorigBases   = header.bases;
+    seq->NedBases     = header.bases;
+    seq->leftCutoff   = header.bases_left_clip;
+    seq->rightCutoff  = header.bases_right_clip;
+    seq->bottom       = False;
+    
+    /* set precision if necessary */
+    if (parse_version(header.version)<VERSION) {
+	/* old 8-bit format */
+	header.sample_size = 1;
+    }
+    
+    /* read sample points */
+    if (fseek(fp, (off_t)(header.samples_offset), 0) != 0) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    for(i=0;i<seq->NPoints;i++) {
+	switch ( header.sample_size ) {
+	case 2:
+	    {
+		Samples2 sample;
+		if (read_scf_sample2(fp,&sample)==False) {
+		    fclose(fp);
+		    return(NULLSeq);
+		}
+		seq->traceA[i] = sample.sample_A;
+		seq->traceC[i] = sample.sample_C;
+		seq->traceG[i] = sample.sample_G;
+		seq->traceT[i] = sample.sample_T;
+		break;
+	    }
+	default:
+	    {
+		Samples1 sample;
+		if (read_scf_sample1(fp,&sample)==False) {
+		    fclose(fp);
+		    return(NULLSeq);
+		}
+		seq->traceA[i] = sample.sample_A;
+		seq->traceC[i] = sample.sample_C;
+		seq->traceG[i] = sample.sample_G;
+		seq->traceT[i] = sample.sample_T;
+		break;
+	    }
+	}
+	if (seq->maxTraceVal < seq->traceA[i]) seq->maxTraceVal = seq->traceA[i];
+	if (seq->maxTraceVal < seq->traceC[i]) seq->maxTraceVal = seq->traceC[i];
+	if (seq->maxTraceVal < seq->traceG[i]) seq->maxTraceVal = seq->traceG[i];
+	if (seq->maxTraceVal < seq->traceT[i]) seq->maxTraceVal = seq->traceT[i];
+    }
+    
+    
+    /* Read bases */
+    if (fseek(fp, (off_t)(header.bases_offset), 0) != 0) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    for(i=0;i<seq->NorigBases;i++) {
+	Bases base;
+	if (read_scf_base(fp,&base)==False) {
+	    fclose(fp);
+	    return(NULLSeq);
+	}
+	seq->base[i] = base.base;
+	seq->basePos[i] = base.peak_index;
+    }
+    
+    
+    /* Read selected comment information  */
+    {
+	char *comments;
+	
+	comments = (char *) malloc(header.comments_size+1);
+	
+	if (fseek(fp,(off_t)(header.comments_offset), 0) != 0) {
+	    free(comments);
+	    fclose(fp);
+	    return(NULLSeq);
+	}
+	if (fread(comments,header.comments_size,1,fp)==0) {
+	    free(comments);
+	    fclose(fp);
+	    return(NULLSeq);
+	}
+	comments[header.comments_size] = '\0';
+	
+	seq->info = comments;
+	
+    }
+    
+    
+    
+    /* Finished with the file */
+    fclose(fp);
+    
+    return(seq);
+}
diff --git a/src/newted/seqIOSCF.h b/src/newted/seqIOSCF.h
new file mode 100644
index 0000000..88c2e20
--- /dev/null
+++ b/src/newted/seqIOSCF.h
@@ -0,0 +1,50 @@
+#ifndef _seqIOSCF_h
+#define _seqIOSCF_h
+
+
+/* 
+    Title:       seqIOSCF
+
+    File: 	 seqIOSCF.h
+    Purpose:	 IO of ABI sequences
+    Last update: Tues Dec 11 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"   /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+extern Seq readSeqSCF(char *fn);
+/*
+    Read the SCF format sequence with name `fn' into `seq'.
+    A NULLSeq result indicates failure.
+
+    enzString is the enzyme string that is to be used for 
+    searching for the left cutoff of sequence
+*/
+
+
+
+
+
+extern int is_SCF(char *fn);
+/*
+    Check to see if file with name `fn' is in SCF format
+
+    Returns:
+	1 - is SCF format
+	0 - not SCF format
+*/
+
+#endif  /*_seqIOSCF_h*/
diff --git a/src/newted/seqOutput.c b/src/newted/seqOutput.c
new file mode 100644
index 0000000..931c7b2
--- /dev/null
+++ b/src/newted/seqOutput.c
@@ -0,0 +1,230 @@
+/* 
+    Title:       seqOutput
+
+    File: 	 seqOutput.c
+    Purpose:	 Output of sequences
+    Last update: Monday April 8 1991
+    Change log:
+        27/11/90 SD     writeSeqABI() outputs header to sequence file:
+                        format: ;{noOfBases}{leftCutOff}{basesWritten}{type}{tra
+cefile}
+                        eg:     ;   867    45    383ABI a09b7.s1RES
+        28.11.90 SD  put undesirables under STLOUIS compilation flag
+        11.12.90 SD  new static function tail to find file name in path name
+	15.01.91 SD  new include file (opp.h)
+	08.04.91 SD  header information now written in write_header()
+	17.09.91 LFW changed STLOUIS flag to SAVE_EDITS to more accurately
+	             reflect the flag
+	07.10.91 SD  Removed width of file field in write_header()
+	26.11.91 SD  Added SCF format on output
+
+*/
+
+
+/*
+    This module should be regarded as part of `seq' since it is
+    privy to the internal structure of `Seq'.
+
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+#include "seqIOEdit.h"
+#include "opp.h"      /* IMPORT: oppInitialise */
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			         newSeq, freeSeq */
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+		                 EOF */
+/*#include <string.h>*/
+
+
+
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+/* ---- Private Functions ---- */
+static char *tail (char *pathname)
+{
+    char *a;
+    if ((a = (char *) strrchr(pathname,'/'))==NULL)
+	a = pathname;
+    else
+	a++;
+    return a;
+}
+
+static void write_header(Seq seq, char *seqName, FILE *fp)
+/*
+Write out a header with the sequence.
+The header consists of a line describing the cut off sequence, plus the
+actual discarded sequences. The format is:
+
+    ;header description
+    ;<left cutoff sequence (possibly several lines)
+    ;>right cutoff sequence (possibly several lines)
+
+*/
+{
+    int baseNum,lineLen;
+
+
+    if (seq->bottom) {
+	/* this is a complemented sequence */
+	fprintf (fp,";%6d%6d%6d%-4s%s\n",
+		    seq->NedBases,
+		    seq->rightCutoff,
+		    seq->NedBases - seq->rightCutoff - seq->leftCutoff,
+		    (seq->format==ABIFormat)?"ABI":
+		    (seq->format==ALFFormat)?"ALF":
+		    (seq->format==PlainFormat)?"PLN":
+		    (seq->format==SCFFormat)?"SCF":
+		    "   ",
+		    tail(seqName));
+
+	opp['N']='-';
+
+	/* output left cut off */
+	lineLen = 0;
+	for (baseNum=seq->NedBases-1;
+	    baseNum>=seq->NedBases-seq->rightCutoff;
+	    baseNum--) {
+	    if (! lineLen) fprintf(fp, ";<");
+	    fputc(opp[getBase(seq, EdBases, baseNum)], fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+	/* output right cut off */
+	lineLen = 0;
+	for (baseNum=seq->leftCutoff-1;
+	    baseNum>=0;
+	    baseNum--) {
+	    if (! lineLen) fprintf(fp, ";>");
+	    fputc(opp[getBase(seq, EdBases, baseNum)], fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+
+    } else {
+	/* this is a sequence in its normal orientation */
+	fprintf (fp,";%6d%6d%6d%-4s%s\n",
+		    seq->NedBases,
+		    seq->leftCutoff,
+		    seq->NedBases - seq->rightCutoff - seq->leftCutoff,
+		    (seq->format==ABIFormat)?"ABI":
+		    (seq->format==ALFFormat)?"ALF":
+		    (seq->format==PlainFormat)?"PLN":
+		    (seq->format==SCFFormat)?"SCF":
+		    "   ",
+		    tail(seqName));
+
+	opp['N']='-';
+
+	/* output left cut off */
+	lineLen = 0;
+	for (baseNum=0;
+	    baseNum<seq->leftCutoff;
+	    baseNum++) {
+	    if (! lineLen) fprintf(fp, ";<");
+	    fputc(getBase(seq, EdBases, baseNum), fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+	/* output right cut off */
+	lineLen = 0;
+	for (baseNum=seq->NedBases-seq->rightCutoff;
+	    baseNum<seq->NedBases;
+	    baseNum++) {
+	    if (! lineLen) fprintf(fp, ";>");
+	    fputc(getBase(seq, EdBases, baseNum), fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+    }
+
+}
+
+
+
+
+Boolean writeSeq(Seq seq, char *fn, char *seqName, Boolean includeHeader)
+/*
+    Write the clipped, edited part of the ABI format sequence `seq'
+    into file `fn'. The result indicates success.
+    Currently, this just writes the bases out as text.
+*/
+
+{   FILE *fp;
+    int baseNum, lineLen;
+
+    /* initialize the complement array */
+    oppInitialize();
+
+    /* Open for writing, text */
+    if ((fp=fopen(fn, "w")) == NULL) return(False);
+
+    lineLen = 0;
+
+    /* write header */
+    if (includeHeader) write_header(seq,seqName,fp);
+
+    if (seq->bottom) { /* this if seq-> bottom section
+			  added by lfw */
+      opp['N']='-';
+
+      for (baseNum=(seq->NedBases-seq->rightCutoff)-1;
+	   baseNum>=seq->leftCutoff;
+	   baseNum--)
+	{   fputc(opp[getBase(seq, EdBases, baseNum)], fp);
+	    if (++lineLen == BasesPerLine)
+	      {   fputc('\n', fp);
+		  lineLen = 0;
+		}
+	  }
+    }
+    else {
+      for (baseNum=seq->leftCutoff;
+	   baseNum<seq->NedBases-seq->rightCutoff;
+	   baseNum++)
+	{   fputc(getBase(seq, EdBases, baseNum), fp);
+	    if (++lineLen == BasesPerLine)
+	      {   fputc('\n', fp);
+		  lineLen = 0;
+		}
+	  }
+    }
+
+    if (lineLen != 0) fputc('\n', fp);
+
+    (void) fclose(fp);
+
+#ifdef SAVE_EDITS
+    if (writeEdSeq(seq,seqName)) return(True);
+    else return(False);
+#else
+    return(True);
+#endif
+}
+
+
diff --git a/src/newted/seqOutput.h b/src/newted/seqOutput.h
new file mode 100644
index 0000000..79f3d18
--- /dev/null
+++ b/src/newted/seqOutput.h
@@ -0,0 +1,18 @@
+#ifndef _seqOutput_h
+#define _seqOutput_h
+/* 
+    Title:       seqOutput
+
+    File: 	 seqOutput.h
+    Purpose:	 Output of sequences
+
+*/
+
+extern Boolean writeSeq(Seq seq, char *fn, char *seqName,Boolean includeHeader);
+/*
+    Write the clipped, edited part of the ABI format sequence `seq'
+    into file `fn'. `seqName' is for sequence header information.
+    The result indicates success.
+    Currently, this just writes the bases out as text.
+*/
+#endif  /*_seqOutput_h*/
diff --git a/src/newted/seqRead.c b/src/newted/seqRead.c
new file mode 100644
index 0000000..7c984f2
--- /dev/null
+++ b/src/newted/seqRead.c
@@ -0,0 +1,161 @@
+#include <stdio.h>
+#include "seq.h"
+#include "opp.h"
+#include "seqIOABI.h"   /* IMPORT: readSeqABI   */
+#include "seqIOALF.h"   /* IMPORT: readSeqALF   */
+#include "seqIOSCF.h"   /* IMPORT: readSeqSCF   */
+#include "seqIOPlain.h"   /* IMPORT: readSeqPlain   */
+#include "traceDisplay.h" /* IMPORT: setScaleFactor */
+#include "traceType.h"
+
+
+
+Seq readSeq(char *fn, int bottom, char *enzString, char *format)
+{
+    Seq seq;
+#ifdef SAVE_EDITS
+    int dotnum=-1;
+#endif /*SAVE_EDITS*/
+    /* the number of the edited sequence the
+       user wishes to read in.  It's a -1 if the
+       user wishes to read the most recent sequence */
+    int found_edited_sequence=0; /* indicates if there was a correct .n file to
+				    read when the program went to read 
+				    the edited sequence file */
+    int trace_type;
+
+
+    trace_type = determine_trace_type(fn);
+    
+    if (trace_type <= TT_UNK) {
+
+	    
+#ifdef SAVE_EDITS
+	/* 920917 this chunk of code was moved by lfw
+	   so that ted first tries to open up the filename
+	   as given by the user...if that is not a trace
+	   file then it goes ahead and strips off the dotnum
+	   and tries to open that */
+	    
+	/* check to see if the file the user asked to read in is
+	   a .1 or .2 or .m; if it is they must have wanted to
+	   read in a specific edited sequence */
+	    
+	dotnum = isDotNum(fn);
+	if (dotnum != -1) stripDotNum(fn);
+	trace_type = determine_trace_type(fn);
+	if (trace_type <= TT_UNK) return(NULLSeq);
+#else
+	return(NULLSeq);
+#endif
+    }
+
+    /*
+     * Make a quick check
+     */
+    if (strcmp(format,trace_types[trace_type]))
+	fprintf(stderr,"** trace is not of format specified - looks like %s\n",trace_types[trace_type]);
+
+    /*
+     * read in the sequence
+     */
+    switch(trace_type) {
+    case TT_PLN:
+	seq = readSeqPlain(fn); break;
+    case TT_ABI:
+	seq = readSeqABI(fn); break;
+    case TT_ALF:
+	seq = readSeqALF(fn); break;
+    case TT_SCF:
+	seq = readSeqSCF(fn); break;
+    default: /* this should never happen */
+	return(NULLSeq);
+    }
+
+
+
+    /*
+     * further processing
+     */
+    if (seq != NULLSeq) {
+
+	int i;
+
+	/*  
+	  
+	  Go ahead and Initialise the edited sequence
+	  with orig bases, then go to read the Edited
+	  sequence to override those bases which have been
+	  changed
+	  
+	  */
+	
+	
+	/* All ed bases are the same are orig bases */
+	for (i=0;i<seq->NorigBases;i++) seq->edits[i] = i;
+	
+	/* The table of edits has all its entries free */
+	for (i=0;i<MaxEdits;i++) seq->edBasePos[i] = NULLPoint;
+	
+
+
+
+	found_edited_sequence=0;
+	
+#ifdef SAVE_EDITS
+	found_edited_sequence=readEdSeq(seq,fn,dotnum);
+#endif
+	
+#ifdef AUTO_CLIP
+	if (!found_edited_sequence &&
+	    seq->leftCutoff == 0 &&
+	    seq->rightCutoff == 0) {
+	    
+#ifdef QUAL_CODE
+#ifdef QUAL_CHECK
+	    /* check the overall trace quality...it will
+	       return a zero if the trace should be thrown away */
+	    if (overallTraceQual(seq)) {
+#endif /*QUAL_CHECK*/
+#endif /*QUAL_CODE*/
+	
+	    
+	    if (seq->bottom) {
+		seq->rightCutoff = findLeftCutoff(seq,enzString);
+		seq->leftCutoff = findRightCutoff(seq);
+	    }
+	    else {
+		seq->leftCutoff = findLeftCutoff(seq,enzString);
+		seq->rightCutoff = findRightCutoff(seq);
+	    }
+#ifdef QUAL_CODE
+#ifdef QUAL_CHECK
+	}
+	    else {
+		/* make the right cutoff equal to the left cutoff if
+		   you think the trace should be thrown away */
+		seq->rightCutoff=seq->NorigBases;
+		seq->leftCutoff=0;
+	    }
+#endif /*QUAL_CHECK*/
+#endif /*QUAL_CODE*/
+
+	    
+	}
+#endif /*AUTO_CLIP*/
+    }
+
+    /* initialize the complement array */
+    oppInitialize();
+
+    if (bottom) complement_seq(seq);
+
+    /* set default scale factor */
+    setScaleFactor(1.0);
+    return seq;
+
+}
+
+
+
+
diff --git a/src/newted/seqRead.h b/src/newted/seqRead.h
new file mode 100644
index 0000000..ff5d4a0
--- /dev/null
+++ b/src/newted/seqRead.h
@@ -0,0 +1,6 @@
+#ifndef _SEQREAD_H_
+#define _SEQREAD_H_
+
+extern Seq readSeq(char *fn, int bottom, char *enzString, char *format);
+
+#endif /*_SEQREAD_H_*/
diff --git a/src/newted/ted.c b/src/newted/ted.c
new file mode 100644
index 0000000..fcb3f9a
--- /dev/null
+++ b/src/newted/ted.c
@@ -0,0 +1,716 @@
+/*
+    Title: 	 ted
+
+    File: 	 ted.c
+    Purpose:	 Main module
+    Last update: 23 June 1992
+
+    Change log:
+
+        28.11.90 SD  put undesirables under STLOUIS compilation flag
+	22.07.91 LFW put in -enzyme as a command line option for STLOUIS
+	30.07.91 SD fixed bug in MRC switch
+	17.09.91 LFW changed MRC switch to !AUTO_CLIP and STLOUIS flag
+	         to DEF_OUT to more accurately reflect the use of the flags
+	26.11.91 SD  Added SCF format
+	23-Jun-92 SD Some ugly code in DEF_OUT option
+*/
+
+
+/*
+    This module contains the C language entry point `main' and
+    initialisation for the X system.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "dialogues.h" /* IMPORT: inputSeq, outputSeq, quitApplication */
+#include "display.h"   /* IMPORT: createDisplay */
+#include "help.h"      /* IMPORT: createHelpShell, help */
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+#include "seqIOEdit.h" /* IMPORT: isDotNum, stripDotNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include "mystdlib.h"  /* IMPORT: exit */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+
+
+/* ---- Static variables ---- */
+
+
+static String fallback_resources[] =
+{   /* The application */
+    "Xted.minWidth:  550",
+    "Xted.minHeight: 250",
+
+    "Xted*magnif: 90",
+
+    /* Labels have no borders and are not resized */
+    "Xted*Label.borderWidth: 0",
+    "Xted*Label.top:         ChainTop",
+    "Xted*Label.bottom:      ChainTop",
+    "Xted*Label.left:        ChainLeft",
+    "Xted*Label.right:       ChainLeft",
+
+    /* Buttons are not resized */
+    "Xted*Command.top:         ChainTop",
+    "Xted*Command.bottom:      ChainTop",
+    "Xted*Command.left:        ChainLeft",
+    "Xted*Command.right:       ChainLeft",
+
+    /* All the toggles are one-of-many */
+    "Xted*Toggle.Translations: #override \\n\
+         <EnterWindow>:        highlight(Always) \\n\
+         <LeaveWindow>:        unhighlight()     \\n\
+         <Btn1Down>,<Btn1Up>: set() notify()",
+
+    "Xted.mainForm.title.label: Trace editor",
+    "Xted.mainForm.version.label: \
+     June 1991",
+    "Xted.mainForm.seqNamePrompt.label: Input:",
+    "Xted.mainForm.seqNamePrompt.height: 13",
+    "Xted.mainForm.seqName.label:       NONE",
+    "Xted.mainForm.seqName.resizable:   True",
+    "Xted.mainForm.seqName.height: 13",
+    "Xted.mainForm.NorigBasesPrompt.label: Bases:",
+    "Xted.mainForm.NorigBasesPrompt.height: 13",
+    "Xted.mainForm.NorigBases.label:       ",
+    "Xted.mainForm.NorigBases.resizable:   True",
+    "Xted.mainForm.strand.label: Strand: Top",
+    "Xted.mainForm.strand.resizable: True",
+
+
+    /* Display forms have no border */
+    "Xted.mainForm.Form.borderWidth:     0",
+
+     /* The edit mode toggles */
+    "Xted.mainForm.Toggle.top:    ChainTop",
+    "Xted.mainForm.Toggle.bottom: ChainTop",
+    "Xted.mainForm.Toggle.left:   ChainLeft",
+    "Xted.mainForm.Toggle.right:  ChainLeft",
+    "Xted.mainForm.edmodelab.label: Mode:",
+    "Xted.mainForm.adjustL.label:   Adj left cut",
+    "Xted.mainForm.edSeq.label:     Edit seq",
+    "Xted.mainForm.adjustR.label:   Adj right cut",
+    "Xted.mainForm.edSeq.state:     True", /* Default toggle */
+    "Xted.mainForm.adjustL.height:   13",
+    "Xted.mainForm.edSeq.height:     13",
+    "Xted.mainForm.adjustR.height:   13",
+
+    /* The magnification slider */
+    "Xted.mainForm.maglab.top:    ChainTop",
+    "Xted.mainForm.maglab.bottom: ChainTop",
+    "Xted.mainForm.maglab.left:   ChainLeft",
+    "Xted.mainForm.maglab.right:  ChainLeft",
+    "Xted.mainForm.maglab.label: Mag:",
+    "Xted.mainForm.maglab.height: 13",
+      
+    "Xted.mainForm.magscr.top:    ChainTop",
+    "Xted.mainForm.magscr.bottom: ChainTop",
+    "Xted.mainForm.magscr.left:   ChainLeft",
+    "Xted.mainForm.magscr.right:  ChainLeft",
+    "Xted.mainForm.magscr.orientation: horizontal",
+    "Xted.mainForm.magscr.length: 100",
+    "Xted.mainForm.magscr.height: 13",
+
+    "Xted.mainForm.scaleDown.label: Scale down",
+    "Xted.mainForm.scaleDown.height: 13",
+    "Xted.mainForm.scaleUp.label: Scale up",
+    "Xted.mainForm.scaleUp.height: 13",
+    "Xted.mainForm.switch.label: Switch",
+    "Xted.mainForm.switch.height: 13",
+
+    /* The main display: a viewport containing a form containing graphs */
+    "Xted.mainForm.viewport.top:    ChainTop",
+    "Xted.mainForm.viewport.bottom: ChainBottom",
+    "Xted.mainForm.viewport.left:   ChainLeft",
+    "Xted.mainForm.viewport.right:  ChainRight",
+    "Xted.mainForm.viewport.allowHoriz: True",
+    "Xted.mainForm.viewport.forcebars:  True",
+    "Xted.mainForm.viewport.width:      700",
+    "Xted.mainForm.viewport.vpForm.Graph.resizable: True",
+    "Xted.mainForm.viewport.vpForm.Graph.top:    ChainTop",  
+    "Xted.mainForm.viewport.vpForm.Graph.bottom: ChainTop",
+    "Xted.mainForm.viewport.vpForm.Graph.left:   ChainLeft",  
+    "Xted.mainForm.viewport.vpForm.Graph.right:  ChainRight",
+    "Xted.mainForm.viewport.vpForm.trace.top:    ChainTop",  
+    "Xted.mainForm.viewport.vpForm.trace.bottom: ChainBottom",
+    "Xted.mainForm.viewport.vpForm.Graph.font: -misc-*-bold-*15-*",
+    /* The text graph heights are dynamically set to accomodate the font */
+    "Xted.mainForm.viewport.vpForm.trace.height: 150",
+    /* The graphs are dynamically initially set to fill the viewport */
+    "Xted.mainForm.viewport.vpForm.Graph.dimBackground: grey",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour1:  green",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour2:  purple",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour3:  black",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour4:  red",
+
+    /* Dummy widget - we only care about its width */
+    "Xted.mainForm.dummy.top:    ChainTop",
+    "Xted.mainForm.dummy.bottom: ChainBottom",
+    "Xted.mainForm.dummy.left:   ChainLeft",
+    "Xted.mainForm.dummy.right:  ChainRight",
+    "Xted.mainForm.dummy.width:  700",
+
+
+    /* Dialogues */
+    "Xted*input.ioForm.ioPrompt.label:   Input a file",
+    "Xted*search.ioForm.ioPrompt.label:  Search for string or base number",
+    "Xted*output.ioForm.ioPrompt.label:  Save a file",
+    "Xted*check.ioForm.ioPrompt.label:   \
+The existing sequence has been altered.\\n\
+Do you really want to overwrite it?",
+    "Xted*checkwrite.ioForm.ioPrompt.label:   \
+This output file already exists.\\n\
+Do you really want to overwrite it?",
+    "Xted*quit.ioForm.ioPrompt.label:    \
+The existing sequence has been altered.\\n\
+Do you really want to quit?",
+    "Xted*check.ioForm..translations:   #override \\n\
+         <Key>Return: CRAction()",
+    "Xted*quit.ioForm..translations:    #override \\n\
+         <Key>Return: CRAction()",
+
+    "Xted*ioForm.namePrompt.label:       File name:",
+    "Xted*ioForm.nameVal.width:          200",
+    "Xted*ioForm.nameVal.translations:   #override \\n\
+         <Key>Return: CRAction()",
+    "Xted*ioForm.formatPrompt.label:     File format:",
+    "Xted*ioForm.formatBox.borderWidth:  0",
+    "Xted*ioForm.formatBox.orientation:  horizontal",
+    "Xted*ioForm.formatBox.plainFmt.label: Plain",
+    "Xted*ioForm.formatBox.abiFmt.label:   ABI",
+    "Xted*ioForm.formatBox.abiFmt.state:   True", /* Default toggle */
+    "Xted*ioForm.formatBox.alfFmt.label:   ALF",
+    "Xted*ioForm.formatBox.alfFmt.state:   False", 
+    "Xted*ioForm.formatBox.scfFmt.label:   SCF",
+    "Xted*ioForm.formatBox.scfFmt.state:   False", 
+    "Xted*ioForm.formatBox.bottomFmt.label: Bottom",
+    "Xted*ioForm.formatBox.topFmt.label:   Top",
+    "Xted*ioForm.formatBox.topFmt.state:   True", /* Default toggle */
+    /* The `default' button for the dialogue has a wider border */
+    "Xted*input.ioForm.OK.borderWidth:     3",
+    "Xted*output.ioForm.OK.borderWidth:    3",
+    "Xted*check.ioForm.Cancel.borderWidth: 3",
+    "Xted*quit.ioForm.Cancel.borderWidth:  3",
+
+    /* Help shell widget */
+    "Xted.Help.width:  525",
+    "Xted.Help.height: 200",
+    "Xted.Help.minWidth: 525",
+    "Xted.Help.maxWidth: 525",
+    "Xted.Help.minHeight: 150",
+    "Xted.Help.title: ted help",
+    "Xted.Help.form.rmButton.label:  Remove help",
+    "Xted.Help.form.rmButton.top:    ChainTop",
+    "Xted.Help.form.rmButton.bottom: ChainTop",
+    "Xted.Help.form.rmButton.left:   ChainLeft",
+    "Xted.Help.form.rmButton.right:  ChainLeft",
+    "Xted.Help.form.text.top:    ChainTop",
+    "Xted.Help.form.text.bottom: ChainBottom",
+    "Xted.Help.form.text.left:   ChainLeft",
+    "Xted.Help.form.text.right:  ChainRight",
+
+    NULL,
+};
+
+/*
+     The command line arguments and how to parse them into resources.
+*/
+static XrmOptionDescRec options[] =
+{
+    {"-PLN",     "PLN",     XrmoptionSepArg, NULL},
+    {"-ABI",     "ABI",     XrmoptionSepArg, NULL},
+    {"-ALF",     "ALF",     XrmoptionSepArg, NULL},
+    {"-SCF",     "SCF",     XrmoptionSepArg, NULL},
+    {"-output",  "output",  XrmoptionSepArg, NULL},
+    {"-baseNum", "baseNum", XrmoptionSepArg, NULL},
+    {"-mag", "mag", XrmoptionSepArg, NULL},
+    {"-astring", "astring", XrmoptionSepArg, NULL},
+    {"-bottom", "bottom", XrmoptionSepArg, NULL},
+    {"-raw", "raw", XrmoptionSepArg, NULL},
+    {"-enzyme","enzyme",XrmoptionSepArg,NULL},
+      
+};
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+typedef struct
+{   String plain;
+    String ABI;
+    String ALF;
+    String SCF;
+    String output;
+    int baseNum;
+    int mag;
+    String astring;
+    int bottom;
+    String raw;
+    String enzyme;
+} AppResources;
+
+/*
+    Dig the command line arguments out of resources and into
+    the above structure.
+*/
+static XtResource tedResources[] =
+{
+    {"PLN", "Plain", XtRString, sizeof(String),
+     XtOffset(AppResources *, plain), XtRImmediate, NULL},
+    {"ABI", "ABI", XtRString, sizeof(String),
+     XtOffset(AppResources *, ABI), XtRImmediate, NULL},
+    {"ALF", "ALF", XtRString, sizeof(String),
+     XtOffset(AppResources *, ALF), XtRImmediate, NULL},
+    {"SCF", "SCF", XtRString, sizeof(String),
+     XtOffset(AppResources *, SCF), XtRImmediate, NULL},
+    {"output", "output", XtRString, sizeof(String),
+     XtOffset(AppResources *, output), XtRImmediate, NULL},
+    {"baseNum", "baseNum", XtRInt, sizeof(int),
+     XtOffset(AppResources *, baseNum), XtRImmediate, (XtPointer) NULLBaseNum},
+    {"mag", "mag", XtRInt, sizeof(int),
+     XtOffset(AppResources *, mag), XtRImmediate, NULL},
+    {"astring", "astring", XtRString, sizeof(String),
+     XtOffset(AppResources *, astring), XtRImmediate, NULL},
+    {"bottom", "bottom", XtRInt, sizeof(int),
+     XtOffset(AppResources *, bottom), XtRImmediate, NULL},
+    {"raw", "raw", XtRString, sizeof(String),
+     XtOffset(AppResources *, raw), XtRImmediate, NULL},
+    {"enzyme", "enzyme", XtRString, sizeof(String),
+     XtOffset(AppResources *, enzyme), XtRImmediate, NULL},
+};
+
+
+
+typedef struct
+{   int magnif;
+    String enz;
+} AppData, *AppDataPtr;
+
+
+#define XtNmagnif "magnif"
+#define XtCMagnif "Magnif"
+#define XtNenz "enz"
+#define XtCEnz "Enz"
+
+static XtResource resources[] = {
+  { XtNmagnif,
+    XtCMagnif,
+    XtRInt,
+    sizeof(int),
+    XtOffset(AppDataPtr, magnif),
+    XtRImmediate,
+    (caddr_t)30  /* default is 30, although you can override that from
+		    the app-defaults file or on the command line */
+    },
+  { XtNenz,
+    XtCEnz,
+    XtRString,
+    sizeof(String),
+    XtOffset(AppDataPtr, enz),
+    XtRImmediate,
+    NULL
+    }
+    };
+
+
+/* ---- Callbacks ---- */
+
+
+static void inputCallback(Widget widget,
+			  XtPointer client_data, XtPointer call_data)
+{   inputSeq(widget);
+}
+
+
+static void searchCallback(Widget widget,
+			  XtPointer client_data, XtPointer call_data)
+{   inputSearchString(widget);
+}
+
+
+
+static void outputCallback(Widget widget,
+			   XtPointer client_data, XtPointer call_data)
+/*
+   client_data contains the default (NULL) or command line specified
+   output file name to be used.
+*/
+{
+
+/* this next statement was added by lfw 10/16/90, to assure that
+   a default output filename of inputfilename.seq will be assigned.*/
+
+    if (client_data!=NULL) 
+	sprintf(o_fn,"%s",client_data);
+
+#ifdef DEF_OUT
+    if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+
+    if (!isDotSeq(o_fn))
+	strcat(o_fn,".seq");
+    /*
+    ** This is an aweful way of accomplishing something very simple
+      { char junk[strlen(o_fn)+5];
+	strcpy(junk,o_fn);
+	sprintf(o_fn,"%s.seq",junk);
+      }
+    */
+#endif
+
+    outputSeq(widget, (String) o_fn);
+}
+
+
+static void helpCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{
+    help();
+}
+
+static void infoCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{
+    information(widget);
+}
+
+
+static void quitCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{   /* Quit the application */
+    quitApplication(widget);
+}
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    if (argc != 1)
+    {   fprintf(stderr, "%s: unknown option%s:", argv[0], (argc>2)?"s":"");
+	for (i=1; i<argc; i++)
+	    fprintf(stderr, " %s", argv[i]);
+	fprintf(stderr, "\n\n");
+    }
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr,
+"       [(-PLN | -ABI | -ALF | -SCF) {tracefilename} [-baseNum {number}] [-mag {number, 1 to 100}] [-bottom {1(true) or 0(false)}]\n");
+    fprintf(stderr,
+"       [-output {outputfilename}]\n");
+    fprintf(stderr,
+"       [-astring {sequence}]\n");
+    fprintf(stderr,
+"       [-enzyme {5' cutting sequence}]\n");
+    fprintf(stderr,
+"       [-raw {filename to be placed at head of xdap compatible .seq file}]\n");
+    fprintf(stderr, 
+"       [-geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]]\n");
+    fprintf(stderr,
+"       [-display [{host}]:[{vs}]]\n");
+    fprintf(stderr,
+"       [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]\n");
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{   XtAppContext app_con;
+    AppResources app_resources;
+    Widget toplevelWid, mainFormWid;
+    /* Widget titleWid, Widget versionWid; */
+    Widget seqNameWid, seqNamePromptWid;
+    Widget NorigBasesWid, NorigBasesPromptWid,strandWid;
+    Widget comWid;
+    Arg args[10];
+    int nargs;
+    AppData app_data;
+
+    /*
+        Setup X and the toplevel widget
+    */
+    toplevelWid = XtAppInitialize(&app_con, "Xted",
+				  options, XtNumber(options),
+				  (int *)&argc, argv,
+				  fallback_resources,
+				  NULL, (Cardinal) 0);
+
+
+    /*
+         Transfer the command line arguments into app_resources.
+    */
+    XtGetApplicationResources(toplevelWid, (XtPointer) &app_resources,
+			      tedResources, XtNumber(tedResources),
+			      NULL, 0);
+
+    XtGetApplicationResources(toplevelWid, (XtPointer) &app_data,
+			      resources, XtNumber(resources),
+			      NULL, 0);
+
+#ifndef AUTO_CLIP
+    app_data.enz = NULL;
+#endif
+
+
+    /*
+        Die if bad options given.
+    */
+    if ((argc != 1)                          ||
+	(app_resources.plain==NULL &&
+	 app_resources.ABI==NULL   &&
+	 app_resources.ALF==NULL   &&
+	 app_resources.SCF==NULL   &&
+	 (app_resources.baseNum!=NULLBaseNum ||
+	 app_resources.astring!=NULL)) ||
+	(app_resources.plain!=NULL && app_resources.ABI!=NULL) ||
+	(app_resources.astring!=NULL && app_resources.baseNum!=NULLBaseNum))
+	
+    {   usage(argc, argv);
+        
+	XtDestroyApplicationContext(app_con);
+	exit(1);
+    }
+
+
+    /*
+        Create the widgets
+    */
+    mainFormWid = XtCreateManagedWidget("mainForm", formWidgetClass,
+					toplevelWid,
+					NULL, (Cardinal) 0);
+    
+    /* Title and version */
+/*    nargs = 0;
+    titleWid = XtCreateManagedWidget("title",  labelWidgetClass,
+				     mainFormWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, titleWid); nargs++;
+    versionWid = XtCreateManagedWidget("version",  labelWidgetClass,
+				       mainFormWid, args, nargs);
+*/    
+    /* Current sequence name */
+    nargs = 0;
+    seqNamePromptWid = XtCreateManagedWidget("seqNamePrompt",
+					     labelWidgetClass, mainFormWid,
+					     args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, seqNamePromptWid); nargs++;
+    seqNameWid = XtCreateManagedWidget("seqName",  labelWidgetClass,
+				       mainFormWid, args, nargs);
+    
+    /* Original number of bases */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, seqNameWid); nargs++;
+    NorigBasesPromptWid = XtCreateManagedWidget("NorigBasesPrompt",
+						labelWidgetClass, mainFormWid,
+						args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, NorigBasesPromptWid); nargs++;
+    NorigBasesWid = XtCreateManagedWidget("NorigBases", labelWidgetClass,
+					  mainFormWid, args, nargs);
+
+    /* Buttons */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, NorigBasesWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Input",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, inputCallback,  NULL);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Output",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, outputCallback,
+		  (XtPointer) app_resources.output);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("search",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, searchCallback,
+		  NULL);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Help",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, helpCallback,  NULL);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Information",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, infoCallback,  NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Quit",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, quitCallback,  NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    strandWid = XtCreateManagedWidget("strand",  labelWidgetClass,
+				   mainFormWid, args, nargs);
+
+
+    createDisplay(mainFormWid, seqNameWid);
+
+
+    /* Create the help shell, undisplayed. */
+    (void) createHelpShell(toplevelWid);
+    
+    /*
+        Realize everything.
+    */
+    XtRealizeWidget(toplevelWid);
+
+
+    /*
+        Setup initial sequence, if any.
+	Note that we have already filtered out multiple formats.
+    */
+
+    /*
+    ** set up raw file name for output
+    */
+    if (app_resources.raw!=NULL) {
+        strcpy (r_fn,app_resources.raw);
+    } else {
+        r_fn[0] = '\0';
+    }
+
+    if (app_resources.bottom!=NULL) {
+      if (app_resources.bottom==1) {
+	nargs = 0;
+	XtSetArg(args[nargs], XtNlabel, "Strand: Bottom"); nargs++;
+	XtSetValues(strandWid, args, nargs);
+      }
+    }
+
+
+    if (app_resources.plain!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.plain);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.enzyme == NULL) app_resources.enzyme = app_data.enz;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+
+      initialDisplayedSeq(toplevelWid, "plainFmt",
+			  app_resources.plain, app_resources.baseNum , app_resources.mag, 
+			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+    }
+
+
+
+    if (app_resources.ABI!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.ABI);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.enzyme == NULL) app_resources.enzyme = app_data.enz;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+      initialDisplayedSeq(toplevelWid, "abiFmt",
+			  app_resources.ABI, app_resources.baseNum , app_resources.mag,
+			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+
+    }
+
+
+
+    if (app_resources.ALF!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.ALF);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+      initialDisplayedSeq(toplevelWid, "alfFmt",
+			  app_resources.ALF, app_resources.baseNum, app_resources.mag,			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+
+    }
+
+
+    if (app_resources.SCF!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.SCF);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+      initialDisplayedSeq(toplevelWid, "scfFmt",
+			  app_resources.SCF, app_resources.baseNum, app_resources.mag,			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+
+    }
+
+
+    /*
+        Set everything running.
+    */
+    XtAppMainLoop(app_con);
+}
+
diff --git a/src/newted/ted.help b/src/newted/ted.help
new file mode 100644
index 0000000..bbea1bf
--- /dev/null
+++ b/src/newted/ted.help
@@ -0,0 +1,168 @@
+Trace Editor Help
+-----------------
+
+The ted trace editor is a prototype to allow the display and editing
+of traces from sequencing machines, and the simple editing of plain
+sequences. It runs under the X window system. It provides simultaneous
+display of traces and bases. The editing allows individual bases to be
+removed and new ones added, and also a range of bases at either end to
+be cutoff. Currently, only ABI result files and plain sequences are
+accepted.
+
+Only one trace can be edited at a time.
+
+
+Invocation
+----------
+
+ted can be run from the command line by simply typing:
+
+    ted
+
+It will come up with no sequence initially displayed. If provided with
+any arguments it does not understand, or invalid combinations of
+arguments, ted will exit with a message indicating its intended usage.
+ted accepts the standard X arguments allowing, for example, background
+colour or geometry to be specified. ted can accept an argument
+specifying an initial file to display. The key for this is the format
+of the file, for example:
+
+    ted -ABI {ABI format filename}
+    ted -plain {plain format filename}
+
+The file is then displayed at 50% magnification, with the caret 
+initially positioned at the first base.
+
+When an initial file is given, a base number of interest and/or a
+magnification can also be given, for example:
+
+    ted -ABI {ABI format file} -baseNum 280 -mag 30
+
+or the bottom strand may be specified:
+ 
+    ted -ABI {ABI format file} -baseNum 280 -mag 30 -bottom 1
+	or
+    ted -ABI {ABI format file} -bottom 1
+
+or a string of nucleotides on which the center the window:
+	
+    ted -ABI {ABI format file} -astring 1
+	or
+    ted -ABI {ABI format file} -astring 1 -mag 30 -bottom 1
+
+Options can be specified in any order.
+
+An output filename can be specified in a similar manner:
+
+    ted -ABI inputfilename -output outputfilename
+
+The default output filename is inputfilename.seq
+
+If you are running the program on a remote machine, you must
+specify a display parameter:
+
+    ted -display machine_name:0.2
+
+You can also specify the size of the opening window or
+other screen parameters by the following:
+
+    ted  -geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]
+	   [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]
+
+Displays
+--------
+
+When running, ted displays the name of the file it is currently
+operating on (if any) and the original number of bases.
+
+A so-called viewport presents four different synchronised views of
+part of the trace. The top one indicates the sequence indices - the
+first digit of the number if positioned over the base to which that 
+number corresponds.  Below this is a list of the bases as originally
+found in the file (this is the interpretation of the trace as made by
+the sequencing machine). Below this is the list of bases as edited by
+the user --- initially, if this file has not been edited in the past,
+this is identical to the list of original bases.  However, if in a 
+previous session the user has edited this sequence, the edited
+version of the sequence will appear in the edit window. 
+The final display is of the traces produced by the sequencing
+machine for the four respective bases.
+
+Two controls allow the view presented to be adjusted: both are
+horizontal sliders or scrollbars. The first affects the magnification
+at which the trace is viewed. The minimum magnification is such that
+the whole of the trace is visible within the viewport; when a trace is
+first input, this is the magnification used. The maximum magnification
+is such that bases are spaced out with several characters of space
+between them --- this should allow more than enough room for base
+insertions to be clearly visible. The second scrollbar is immediately
+above the viewport and allows the user to select which part of the
+trace is viewed. Both the sliders work in a similar way: the middle
+mouse button can be used to drag the thumb to any desired position,
+the left and right mouse buttons can be clicked within the scrollbar
+to indicate that paging up or down is desired. In the case of the
+viewport scrollbar, the amount of paging is determined by how far up
+the scrollbar the pointer is.
+
+The whole ted window can be expanded and contracted (to an extent) by
+dragging the "grow-region" provided by whatever window manager is
+running. The viewport takes up all of this change in size.
+
+Controls
+--------
+
+ted has four buttons. "Quit" exits the program after first checking
+whether there is a sequence which has been edited and not saved.
+
+"Help" pops up this window which has a scrollbar on the left allowing
+all the text to be viewed.
+
+"Input" presents a dialogue which asks for the format and name of a
+file to be processed. The bases and (if this is not a plain format
+file) traces are read in and displayed for editing. The only
+conversion performed on bases is from 'N' to '-'.
+
+"Output" presents a dialogue which asks for a filename into which the
+edited and clipped bases can be saved. The default value can be set
+on the command line using the "-output" keyword. No conversion of bases
+is performed on output.
+
+ted operates in one of three editing modes, one of which is selected
+from three "radio buttons". The currently selected mode is
+highlighted.
+
+Editing
+-------
+
+In "Edit sequence" mode, the (lower) list of editable bases can be
+edited in much the same way as a text editor operates. A "caret" which
+is visible in the display of edited bases can be moved left and right
+with the cursor keys (these are sometimes called arrow keys and often
+appear on numeric keypads). It can also be positioned by clicking any
+button while the pointer is pointing into either of the list of bases
+or the traces. The DELETE key deletes the base immediately to the left
+of the caret. Any printing character can be inserted to the right of
+the caret by simply typing it. Inserted characters are placed halfway
+between their neighbours, or if a space is left by the deletion of a
+base originally there, its position is used. A base can thus be
+changed by deleting it and entering the new base.
+
+Note that in the current version of ted the caret is not constrained
+to remain within the viewed part of the display and that editing can
+still continue while it is thus invisible. Such editing would probably
+only occur by accident.
+
+ted provides a facility to define a cutoff at either end of the trace.
+A number of the leftmost bases (corresponding to the vector) and the
+rightmost bases (corresponding to the point where the data become
+unreliable) can be defined by setting the editor into "Adjust left
+cutoff" or "Adjust right cutoff" mode. In either of these modes, the
+pointer and mouse buttons can be used to indicate the cutoff point,
+and the cursor keys can be used to adjust this leftwards or
+rightwards. Initially, the cutoff regions are both empty. The cutoff
+regions are clearly indicated on the list of edited bases display and
+on the traces display by being drawn with a dimmed background.
+
+When the sequence is written out, the list of edited bases, with both
+cutoff regions removed, is written. The output contains newlines
+for convenient formatting and always ends with one.
diff --git a/src/newted/ted.manpage b/src/newted/ted.manpage
new file mode 100644
index 0000000..21201a9
--- /dev/null
+++ b/src/newted/ted.manpage
@@ -0,0 +1,107 @@
+.TH ted 1L "July 1991" "MRC LMB" "LOCAL"
+.SH NAME
+ted \- trace editor
+.SH SYNOPSIS
+.B ted
+[(
+.B -ABI\||\|-ALF\||\|-plain
+)
+.I tracefilename
+[
+.B -baseNum
+.I number
+]
+.B [
+.B -mag
+.I number
+( 1 to 100 )
+] 
+.B [
+.B -bottom
+.I number
+(1(true) or 0(false))
+.B ]
+.B [
+.B -astring
+.I nucleotide-string
+]] 
+.B [
+.B -enzyme
+.I  5' cutting sequence
+] 
+.B [
+.B -raw
+.I  filename
+(to be placed at head of xdap compatible .seq file)
+.B ]
+[
+.B -output
+.I outputfilename
+]
+
+.SH DESCRIPTION
+.B ted
+is a simple prototype editor for traces produced from automatic
+sequencing machines. It allows the traces (from the ABI
+or ALF sequencing machines) produced to be
+displayed along with the machines interpretation of these into
+bases and an initially identical sequence which can be edited
+by the user. A cutoff region can be defined at both ends. The
+edited and clipped list of bases can then be written out.
+.LP
+When initially run,
+.B ted
+displays the trace file
+.I tracefilename
+(if given) of the specified format centered on the base number
+.I baseNum
+(if given). If no file is provided,
+.B ted
+initially displays nothing.
+.LP
+The display consists of
+the control panel and the synchronized view of the base position
+information, original and edited sequence data, 
+and graphical representation of the trace (with each nucleotide's trace
+being represented by a different color).  The control
+panel allows the user to read in new trace files (in either
+bottom or top strand orientation)
+as well as to search for a string of nucleotides or a certain base position.
+The information button brings up signal strength and average spacing for
+ABI files.
+Scroll bars allow the user to adjust the magnification of or scroll through
+the sequence and trace data.  The user may also choose to change the vertical
+magnification of the trace data.  Further, sequence on the head (vector)
+or tail (uncertain data) of the sequence may be ``cutoff'' 
+using the adjust left and right cutoff buttons. Bases can be inserted, 
+deleted, or replaced as with
+any ordinary word-processor in the sequence data window. Finally, the
+sequence may be written to an ascii file using the output button on
+the control panel.  The output filename is specified in a dialogue, 
+but a default value of inputfilename.seq is provided or the default value
+can be given with the
+.I outputfilename
+argument.
+.LP
+A simple help system is provided.
+.SH FILES
+.PD 0
+.TP 20
+.B ted.help
+Text provided in the help window.
+.TP
+.B /usr/lib/X11/app-defaults/Xted
+Default application resources.
+.SH ENVIRONMENT
+.TP 20
+.SB XFILESEARCHPATH
+Specifies the locations where
+.B ted.help
+is sought.
+If this is not defined,
+.B ted.help
+must be in the
+.B /usr/lib/X11/app-defaults
+directory.
+.SH AUTHORS
+Tim Gleeson, LaDeana Hillier, Simon Dear.
diff --git a/src/newted/ted.tex b/src/newted/ted.tex
new file mode 100644
index 0000000..0a0b291
--- /dev/null
+++ b/src/newted/ted.tex
@@ -0,0 +1,213 @@
+\documentstyle[12pt]{article}
+
+\title{A trace display and editing program for data from fluorescence based 
+sequencing machines}
+\author{Timothy Gleeson \and LaDeana Hillier}
+
+\begin{document}
+\maketitle
+\section*{}
+\subsection*{}
+\subsubsection*{ABSTRACT}
+
+``Ted'' ({\em T}race {\em ed}itor) 
+is a graphical editor for sequence and trace data from automated 
+fluorescence sequencing machines.  It provides facilities 
+for viewing sequence and trace data (in top or bottom strand 
+orientation), for editing the base sequence,  for 
+automated or manual trimming of the head (vector) and tail 
+(uncertain data) from the sequence, for vertical and horizontal trace 
+scaling, for keeping a history of sequence editing, and for output of 
+the edited sequence.  Ted has been used extensively in the C. 
+elegans genome sequencing project,
+both as a stand-alone program and integrated into 
+the Staden sequence assembly package, and  has 
+greatly aided in the efficiency 
+and accuracy of sequence editing.  It runs in the X 
+windows environment on Sun workstations and is available from the 
+authors.  Ted currently supports sequence and trace data from the ABI 
+373A and Pharmacia A.L.F. sequencers.
+
+\subsubsection*{INTRODUCTION}
+	Time involved in sequence editing is extensive, and anything easing 
+that burden will improve the efficiency of any major sequencing 
+project.  Having sequence and trace data available online in easily-
+manipulable form is invaluable. Ted (a Trace-EDitor) was developed to 
+fill this role in the C. elegans genome 
+sequencing project [1]. 
+
+\subsubsection*{METHODS}
+
+{\em Computing Design and Implementation.}
+When designing ted, we had a number of specific computing goals 
+in mind including portability and adaptability.  For portability, we 
+chose to write ted in ANSI C using the X windowing system and the 
+Xaw toolkit.  X provides basic capabilities for the creation and use 
+of windows, and the toolkit contains a number of pre-packaged 
+components, such as the ``sliders'' used for scrolling. X also allows 
+site, user and per-run defaults to be set.  Adaptability is also an 
+important goal since we are providing a new function to 
+research groups who are constantly adding new requirements.  
+
+	Stylistically, we have followed an ``Abstract Data Type''
+discipline.  In this discipline, a program is split into a number of 
+modules which provide separate, well-defined functions.  We 
+separate the interface of a module from its implementation.  For 
+example, a unified internal sequence format is used.  This can store 
+a varying amount of information.  However, there is a clear and 
+simple interface by which the rest of the program accesses this 
+module.  Such a style is not well supported by C, but its adoption has 
+been very successful.  The addition of new sequencing machines, and 
+thus new external data formats, may cause some changes in the 
+internal representation of the sequence but should not affect  
+the rest of the program.
+
+	Ted accepts a large number of optional command line arguments,
+many of which can also be specified as system defaults. This
+supports a mode of working whereby ted is invoked not directly by the
+user but instead by a script or another application which supplies
+arguments appropriate to the editing task.
+
+
+{\em Graphical Interface.}
+Ted currently accepts data from two fluorescence based sequencing
+machines, the Pharmacia A.L.F. and the ABI 373A.
+The sequencing machine data consists of 
+four traces of fluorescence levels together with the machine's 
+interpretation, which is a sequence of bases.  
+Ted displays 
+the traces and the machine-generated base list.  
+A second, initially identical, list of bases is provided for correction 
+by the user.
+
+	Ted has an X windows based 
+graphical interface. The trace file
+can either be input from the command line or by 
+clicking on the INPUT button after the program has been invoked.  
+Other parameters which the user may specify on the
+command line include: the output 
+file name; a base position or sequence string on which the trace is 
+to be centered;  a default trace magnification;  a 5' vector sequence 
+for automated elimination of the sequence head (vector); top or 
+bottom strand orientation; or any of the usual X-window parameters (e.g. 
+display, geometry...).
+
+	The graphics display (Figure 1) consists of the control 
+panel, the base position information, the original and edited sequence 
+data, and the graphical representation of the trace.  The user may 
+begin by using the control panel INPUT button to input a new trace 
+file at which time the user selects whether to view the sequence
+and trace in top or bottom strand orientation.
+The trace file is displayed and, if a 5' vector sequence has been 
+specified on the command line, the program attempts to select a 
+cutoff point corresponding to the vector sequence at the ``head'' of the 
+trace file.  The bases beyond the ``cutoff'' point are  
+displayed on a shaded background.  The user may modify the cutoff 
+position by clicking on the ``Adj left cut'' button and clicking on the 
+position of the desired cutoff.  Similarly, the user may adjust the 
+right cutoff of the sequence (chosen by starting at the 5' end of the 
+sequence and looking for the first occurrence when 2 out of 5 bases 
+are 'N') by scrolling along the sequence to that point, clicking on the 
+``Adj right cut'' button, and clicking on the appropriate base.  
+Automation of the ``cutoff'' process is optional; the user may compile 
+the program with that feature turned ``off.'' 
+
+	Clicking on the ``Edit seq'' button allows the user to enter the edit 
+mode.  The ``Search'' button can be used to skip from ``problem'' to 
+``problem'' (i.e., ambiguity to ambiguity) or to look for runs of 
+identical bases (e.g., TTTT) which are often mis-called by
+the machine software.
+
+  Bases can be inserted, deleted, or replaced as with
+any ordinary word-processor.  In difficult-to-read areas,  
+the trace may be vertically or horizontally scaled by dragging or 
+clicking on the magnification scroll bar or by clicking on the 
+vertical scaling buttons (``Scale down'', ``Scale up''), respectively.  
+Finally, the edited sequence is saved to an ascii file using the 
+``Output'' button.  A history of the editing session can also be saved
+along with the sequence. 
+The ``Quit'' button is used 
+to exit the program.  When reinvoking ted on an edited trace file the 
+edited base sequence, rather than the original sequence, is shown in 
+the edited base window.  The user may invoke ted by calling in any one 
+of the previous editing sessions.   
+
+
+\subsubsection*{APPLICATIONS AND CONCLUSIONS}
+
+	In the C. elegans genome sequencing project, data from the ABI or 
+A.L.F. sequencing machines' computers are transferred to Sun 
+workstations.  
+The user invokes a Unix shell script that calls ted systematically 
+on each of the new set of trace files creating a set of sequence files.
+The sequence files that are deemed to be of acceptable quality
+are then entered into the sequence 
+assembly program xdap [2] where the sequences are assembled into 
+contigs.  Portions of the ted trace-editor have been incorporated 
+into the xdap ``trace manager,''  which is used in 
+conjunction with the contig editor to view sets of aligned traces 
+at sites of discrepancies in the aligned sequences.  
+
+	Ted is also used at the stage of choosing oligo primers for the 
+``walking'' stage of the sequencing project.  It can be invoked directly 
+from the oligo selection program, osp [3], to allow examination
+of the trace data in the region of the primers so that  
+integrity of the sequence data can be verified.
+
+	Currently, no other programs are known to be available 
+which support editing of the ABI trace data. 
+Further, the modular design of the program should allow
+support for new types of sequencing machines, with new data 
+formats, to be implemented in a straightforward fashion.  
+
+
+\subsubsection*{AVAILABILITY}
+	Ted is freely available from the authors or from Rodger Staden and
+Simon Dear (MRC Laboratory of Molecular Biology, Hills Road, Cambridge,
+UK, CB2 2QH) for use on Sun workstations running X-windows (or OpenLook).
+
+
+\subsubsection*{ACKNOWLEDGMENTS}
+	The authors would like to thank all members of the C. elegans
+sequencing project with special thanks to the following people:
+John Sulston, Bob Waterston,  
+Phil Green, Rick Wilson, Richard Durbin, Simon Dear, and Rodger Staden 
+for their helpful suggestions for improvements in the ted interface 
+and for their parts in the development of ted.  This work was 
+supported by the Medical Research Council and NIH grant R01-HG00136.
+
+\subsubsection*{REFERENCES}
+
+1. Waterston, R., Sulston, J., et al. (1991), in preparation.
+
+2. Dear, S. and Staden, R. (1991) Nuc. Acids Res.,  in press.
+
+3. Hillier, L. and Green, P. (1991) submitted.
+
+
+{\bf Figure 1 legend.}
+
+Figure 1 shows a ``screen dump'' of the ted graphical interface.  
+The display consists of
+the control panel and the synchronized view of the base position
+information, original and edited sequence data, 
+and graphical representation of the trace (with each nucleotide's trace
+ being represented
+by a different color).  The control
+panel allows the user to read in new trace files (in either
+bottom or top strand orientation)
+as well as to search for a string of nucleotides or a certain base position.
+Scroll bars allow the user to adjust the magnification of or scroll through
+the sequence and trace data.  The user may also choose to change the vertical
+magnification of the trace data.  Further, sequence on the head (vector)
+or tail (uncertain data) of the sequence may be ``cutoff'' 
+using the adjust left and right cutoff buttons. Bases can be inserted, 
+deleted, or replaced as with
+any ordinary word-processor in the sequence data window. Finally, the
+sequence may be written to an ascii file using the output button on
+the control panel.
+
+\end{document}
+
+
+
diff --git a/src/newted/trace.abi b/src/newted/trace.abi
new file mode 100644
index 0000000..5b9b9f4
Binary files /dev/null and b/src/newted/trace.abi differ
diff --git a/src/newted/trace.alf b/src/newted/trace.alf
new file mode 100644
index 0000000..e1bf33c
Binary files /dev/null and b/src/newted/trace.alf differ
diff --git a/src/newted/trace.scf b/src/newted/trace.scf
new file mode 100644
index 0000000..ceff4dd
Binary files /dev/null and b/src/newted/trace.scf differ
diff --git a/src/newted/traceDisplay.c b/src/newted/traceDisplay.c
new file mode 100644
index 0000000..cf28145
--- /dev/null
+++ b/src/newted/traceDisplay.c
@@ -0,0 +1,618 @@
+/* 
+    Title:       traceDisplay
+
+    File: 	 traceDisplay.c
+    Purpose:	 Display the trace of a sequence on a graph widget
+    Last update: Friday 15 May 1992
+*/
+
+
+/*
+    `plotEdgeOffset' indicates the pixel position at which point 0
+    occurs. The characters for bases are printed with their centres
+    corresponding to their locations on the plot.
+
+    The cutoffs are actually drawn between the positions of the
+    bases on either side.
+
+    SD:
+	Added incScaleFactor, decScaleFactor
+	Changed scaleTrace to use scale factor
+    SD:
+	Support monochrome displays when marking cutoffs
+    SD: 15-May-1992
+        Calculation of next value of pSegN in drawTrace() gives a value one too many
+	
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "traceDisplay.h"
+#include "display.h"      /* IMPORT: userEvent */
+#include "Graph.h"
+#include "seq.h"          /* IMPORT: Seq, NULLSeq, getNPoints */
+
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Form.h>
+
+
+
+/* ---- Statics ---- */
+
+static Seq currSeq = NULLSeq;
+static int NPoints;
+static TRACE maxTraceVal;
+
+static Widget traceWid;
+
+static Dimension traceWidth;     /* Current width of the trace graph */
+static Dimension traceHeight;    /* Current height of the trace graph */
+static Dimension plotEdgeOffset;
+static Dimension plotWidth;      /* Current width of the plot
+				    = graphWidth - 2*plotEdgeOffset */
+
+static int leftCutoff, rightCutoff;
+
+static GC Agc, Cgc, Ggc, Tgc;
+static Pixel normBackground, dimBackground;
+
+/* monochrome stippling */
+static GC greygc;
+static int planes;
+
+static float scaleFactor;
+
+
+
+
+/* ---- Position translation ---- */
+
+/*
+    Convert a value from the `point-in-the-sequence' to the
+    `x-pixel-on-the-graph' scale, and vice-versa, given that
+    `plotWidth' holds the current pixel width of the graph which
+    starts `plotEdgeOffset' from either edge of the graph and
+    `NPoints' gives the current point width.
+    Fractional results are truncated - for pixelToPoint this may
+    mean a point one too low is indicated, but a point too high is
+    never indicated.
+*/
+
+
+static int pointToPixel(int point)
+{   return(((point * plotWidth) / (NPoints-1)) + plotEdgeOffset);
+}
+
+
+static int pixelToPoint(int pixel)
+{   return(((pixel-plotEdgeOffset) * (NPoints-1)) / plotWidth);
+}
+
+
+static int scaleTrace(int y)
+{   return(traceHeight - 
+	   (int)(scaleFactor * (float)y * (float)traceHeight / (float)maxTraceVal)
+	   );
+}
+
+
+static int leftCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a left cutoff at `baseNum'.
+*/
+{   int pL, pR;
+
+    pL = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum-1);
+    pR = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum);
+
+    return((pR==NULLPoint)?pL:(pL+pR)/2);
+}
+
+
+static int rightCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a right cutoff at `baseNum'.
+*/
+{   int NedBases = getNBases(currSeq, EdBases);
+    int pL, pR;
+
+    pL = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum);
+    pR = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum-1);
+
+    return((pL+pR)/2);
+}
+
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+
+
+static void drawTrace(Widget wid, char base, int p0, int pN)
+/*
+    Draw the `base' trace between points p0..pN (inclusive)
+*/
+{   GC gc;
+    int pSeg, pSeg0, pSegN;
+#define NSegs 500
+    int traces[NSegs];
+    XPoint xPoint[NSegs];
+
+    switch (base) 
+    {   case 'A': gc=Agc; break;
+        case 'C': gc=Cgc; break;
+        case 'G': gc=Ggc; break;
+        case 'T': gc=Tgc; break;
+    }
+
+
+    /*
+        The range (p0..pN) is drawn as a series of overlapping
+	segments (pSeg0..pSegN).
+    */
+    pSegN = p0;
+    do
+    {   int i;
+
+	/*
+	    This segment carries on from and including the last point
+	    of the last segment.
+	*/
+	pSeg0 = pSegN;
+	pSegN = ((pSeg0+NSegs-1)<=pN) ? pSeg0+NSegs-1 : pN;      
+	/* Get the traces for this segment */
+	getTraces(currSeq, base, pSeg0, pSegN, traces);
+
+	/* Translate to pixel coords and place in an XPoint array */
+	pSeg = pSeg0;
+	i = 0;
+	while (pSeg<=pSegN)
+	{   xPoint[i].x = pointToPixel(pSeg);
+	    xPoint[i].y = scaleTrace(traces[i]);
+	    pSeg++;
+	    i++;
+	}
+
+	/* Draw the segments in the XPoint array */
+	XDrawLines(XtDisplay(wid),
+		   XtWindow(wid),
+		   gc,
+		   xPoint,
+		   pSegN-pSeg0+1,
+		   CoordModeOrigin);
+    } while (pSegN < pN);
+}
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+static void exposeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Region region = (Region) call_data;
+    XRectangle rect;
+    int x0, xN; /* Affected region: pixels */
+    int p0, pN; /* Affected region: points */
+    int leftCutoffP,  rightCutoffP;
+    
+
+    if ((currSeq == NULLSeq) || isBasesOnly(currSeq)) return;
+
+
+    /*
+        `region' gives us the exposed graph region.
+	XClipBox will return a bounding rectangle, in pixels.
+    */
+    XClipBox(region, &rect);
+    x0 = rect.x;
+    xN = rect.x+rect.width;
+
+    /* Trim it so we only consider the plot proper */
+    if ((x0>plotWidth+plotEdgeOffset) || (xN<plotEdgeOffset)) return;
+    x0 = (x0<plotEdgeOffset)           ? plotEdgeOffset           : x0;
+    xN = (xN>plotWidth+plotEdgeOffset) ? plotWidth+plotEdgeOffset : xN;
+
+    /* Convert to affected points */
+    p0 = pixelToPoint(x0);
+    pN = pixelToPoint(xN);
+    if (pN<NPoints-1) pN++;
+
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+    leftCutoffP  = leftCutoffPoint(leftCutoff);
+    rightCutoffP = rightCutoffPoint(rightCutoff);
+      
+
+    if (p0<leftCutoffP)
+    /*
+        Some of the exposed region is in the left cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = p0;
+       int lastP  = (pN<leftCutoffP) ? pN : leftCutoffP;
+
+       int pix0 = pointToPixel(firstP);
+       int pixN = pointToPixel(lastP);
+
+	
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(traceWid), XtWindow(traceWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, traceHeight);
+       } else {
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    dimBackground);
+       XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		  pix0, 0,
+		  pixN-pix0, traceHeight,
+		  False);
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    normBackground);
+       }
+    }
+      
+
+    if (pN>rightCutoffP)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = (p0>rightCutoffP) ? p0 : rightCutoffP;
+       int lastP  = pN;
+
+       int pix0 = pointToPixel(firstP);
+       int pixN = pointToPixel(lastP);
+
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(traceWid), XtWindow(traceWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, traceHeight);
+       } else {
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    dimBackground);
+       XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		  pix0, 0,
+		  pixN-pix0, traceHeight,
+		  False);
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    normBackground);
+	}
+    }
+
+    /*
+        Now draw the traces.
+	The GC background is unused when drawing lines, so we do
+	not need to change it depending on whether we are in the
+	cutoff region or not.
+    */
+    drawTrace(wid, 'A', p0, pN);
+    drawTrace(wid, 'C', p0, pN);
+    drawTrace(wid, 'G', p0, pN);
+    drawTrace(wid, 'T', p0, pN);
+}
+
+
+
+
+static void userCallback(Widget wid,
+			 XtPointer client_data, XtPointer call_data)
+/*
+    Handles all XtNcallbacks, i.e., buttons and keys.
+*/
+{   XEvent *eventP = (XEvent *) call_data;
+
+    if (currSeq == NULLSeq) return;
+
+    /*
+        Callback to the `display' module.
+    */
+    userEvent(eventP);
+}
+
+
+
+
+static void resizeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Arg args[10];
+    int nargs;
+
+    /*
+	This routine can be called in two ways.
+	(a) as a callback when the trace is resized by the user
+	(b) indirectly when setGraphWidth alters the graph width
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &traceHeight); nargs++;
+    XtSetArg(args[nargs], XtNwidth,  &traceWidth);  nargs++;
+    XtGetValues(wid, args, nargs);
+
+    if (XtIsRealized(wid)) XClearWindow(XtDisplay(wid), XtWindow(wid));
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+void redisplayTraces()
+{
+    /*
+    **  Clear the area and generate an expose.
+    */
+    XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		0, 0,
+		0, 0,
+		True);
+}
+
+void incScaleFactor()
+{
+    scaleFactor *= 1.5;
+    redisplayTraces();
+}
+
+void decScaleFactor()
+{
+    scaleFactor /= 1.5;
+    redisplayTraces();
+}
+
+void setScaleFactor(float sf)
+{
+    scaleFactor = sf;
+    redisplayTraces();
+}
+
+
+
+void createTraceDisplay(Widget parentWid, Widget fromVertWid,
+			Dimension width, Dimension offset)
+/*
+    Create the trace display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width' and the plot is drawn `offset' from
+    the left and right hand edges.
+    No trace is initially displayed.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    width);       nargs++;
+    traceWid = XtCreateManagedWidget("trace", graphWidgetClass,
+				     parentWid, args, nargs);
+    XtAddCallback(traceWid, XtNcallback,       userCallback, NULL);
+    XtAddCallback(traceWid, XtNexposeCallback, exposeCallback, NULL);
+    XtAddCallback(traceWid, XtNresizeCallback, resizeCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &traceHeight); nargs++;
+    XtGetValues(traceWid, args, nargs);
+    traceWidth     = width;
+    plotWidth      = traceWidth-(2*plotEdgeOffset);
+    plotEdgeOffset = offset;
+    scaleFactor = 1.0;
+
+    currSeq = NULLSeq;
+
+    /*
+        Get the GCs
+    */
+    Agc = GraphGC1(traceWid);
+    Cgc = GraphGC2(traceWid);
+    Ggc = GraphGC3(traceWid);
+    Tgc = GraphGC4(traceWid);
+
+    /*
+        Get the backgrounds
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNbackground,    &normBackground);    nargs++;
+    XtSetArg(args[nargs], XtNdimBackground, &dimBackground); nargs++;
+    XtGetValues(traceWid, args, nargs);
+
+    planes = DisplayPlanes(XtDisplay(traceWid),DefaultScreen(XtDisplay(traceWid)));
+    if (planes == 1) {
+	/* prepare for monochrome display */
+	XtGCMask valuemask = (GCStipple | GCFillStyle);
+	XGCValues values;
+	Pixmap grey_stipple;
+
+#define grey_width 4
+#define grey_height 4
+	static char grey_bits[] = {
+	   0x0e, 0x0b, 0x0e, 0x0b};
+
+	grey_stipple =
+	    XCreateBitmapFromData(XtDisplay(traceWid),
+				  RootWindowOfScreen(XtScreenOfObject(traceWid)),
+				  grey_bits,
+				  grey_width,
+				  grey_height);
+
+	values.stipple = grey_stipple;
+	values.fill_style = FillOpaqueStippled;
+	greygc = XCreateGC(XtDisplayOfObject(traceWid),
+			     RootWindowOfScreen(XtScreenOfObject(traceWid)),
+			     valuemask, &values);
+
+    }
+ 
+}
+
+
+
+
+void displayTrace(Seq seq)
+/*
+    Display the trace of sequence `seq'.
+*/
+{
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    NPoints     = getNPoints(currSeq);
+    maxTraceVal = getMaxTraceVal(currSeq);
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+}
+
+
+
+
+void unDisplayTrace()
+/*
+    Cease displaying the current sequence (if any).
+*/
+{   currSeq = NULLSeq;
+}
+
+
+
+
+void setTraceWidth(Dimension width)
+/*
+    Set the width of the current sequence trace display.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, width); nargs++;
+    XtSetValues(traceWid,   args, nargs);
+
+    traceWidth = width;
+    plotWidth  =  traceWidth-(2*plotEdgeOffset);
+
+    /*
+        Clear the graph.
+    */
+    if (XtIsRealized(traceWid))
+        XClearWindow(XtDisplay(traceWid), XtWindow(traceWid));
+}
+
+
+
+
+void traceCutoffChange()
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
+{   int newLeftCutoff, newRightCutoff;
+
+    getCutoffs(currSeq, &newLeftCutoff, &newRightCutoff);
+
+
+    if (newLeftCutoff != leftCutoff)
+    /*
+        The region between newLeftCutoff and leftCutoff has changed.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	*/
+	if (newLeftCutoff<leftCutoff)
+	{   b0 = newLeftCutoff;
+	    bN = leftCutoff;
+	}
+	else
+	{   b0 = leftCutoff;
+	    bN = newLeftCutoff;
+	}
+
+	p0 = leftCutoffPoint(b0);
+	pN = leftCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		   pix0, 0,
+		   pixN-pix0, traceHeight,
+		   True);
+    }
+
+
+    if (newRightCutoff != rightCutoff)
+    /*
+        The region between newRightCutoff and rightCutoff has
+	changed. Expose it.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	    N.B. lower cutoff means the base is further right.
+	*/
+	if (newRightCutoff<rightCutoff)
+	{   b0 = rightCutoff;
+	    bN = newRightCutoff;
+	}
+	else
+	{   b0 = newRightCutoff;
+	    bN = rightCutoff;
+	}
+
+	p0 = rightCutoffPoint(b0);
+	pN = rightCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		   pix0, 0,
+		   pixN-pix0, traceHeight,
+		   True);
+  
+    }
+
+
+    /*
+        Update static data.
+    */
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+}
+
+
+
+
+
+
diff --git a/src/newted/traceDisplay.h b/src/newted/traceDisplay.h
new file mode 100644
index 0000000..a1a1956
--- /dev/null
+++ b/src/newted/traceDisplay.h
@@ -0,0 +1,80 @@
+/* 
+    Title:       traceDisplay
+
+    File: 	 traceDisplay.h
+    Purpose:	 Display the trace of a sequence on a graph widget
+    Last update: Wednesday 9 Jan 1991
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget, Dimension */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern void createTraceDisplay(Widget parentWid, Widget fromVertWid,
+			       Dimension width, Dimension offset);
+/*
+    Create the trace display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width' and the plot is drawn `plotEdgeOffset' from
+    the left and right hand edges.
+    No trace is initially displayed.
+*/
+
+
+extern void displayTrace(Seq seq);
+/*
+    Display the trace of sequence `seq'.
+*/
+
+
+extern void unDisplayTrace();
+/*
+    Cease displaying the trace of the current sequence (if any).
+*/
+
+
+extern void setTraceWidth(Dimension width);
+/*
+    Set the width of the current sequence trace display.
+*/
+
+
+extern void traceCutoffChange();
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
+
+
+extern void incScaleFactor();
+/*
+    Increment vertical scaling of trace
+*/
+
+
+extern void decScaleFactor();
+/*
+    Decrement vertical scaling of trace
+*/
+
+
+
+extern void setScaleFactor(float sf);
+/*
+    Set vertical scaling of trace
+*/
diff --git a/src/newted/traceType.c b/src/newted/traceType.c
new file mode 100644
index 0000000..2e55d2c
--- /dev/null
+++ b/src/newted/traceType.c
@@ -0,0 +1,119 @@
+/*
+  Title:  traceType
+
+  File:   traceType.c
+  Purpose: determining trace format
+
+  Last update: Wed Nov 11 1992
+
+  Change log :-
+*/
+
+/* ---- Imports ---- */
+
+#include "traceType.h"
+
+
+/* ---- Privates ---- */
+struct {
+    int type;
+    int offset;
+    char *string;
+} magics[] = {
+	{ TT_SCF , 0,   ".scf" } ,
+	{ TT_ABI , 0,   "ABIF" } ,
+	{ TT_ALF , 518, "ALF " } ,
+	{ TT_SCF , 0,   "\234\330\300\000" } /* Amersham variant */
+};
+
+
+
+char *trace_types[5] = {
+    "unknownFmt",
+    "scfFmt",
+    "abiFmt",
+    "alfFmt",
+    "plainFmt"
+    };
+
+
+#define Number(A) ( sizeof(A) / sizeof((A)[0]) )
+
+
+
+int determine_trace_type(char *fn)
+{
+    FILE *fp;
+    int i;
+    int len;
+    char buf[512];
+    int ps;
+    int acgt;
+    int c;
+
+
+    if ( (fp = fopen(fn,"rb")) == NULL ) return TT_EEK;
+
+    /* determine if this is a text file */
+    len = 0; ps = 0; acgt = 0;
+    for (i = 0; i < 512; i++) {
+	if ( ( c = fgetc(fp) ) == EOF ) break;
+	switch(c) {
+	case 'a': case 'c': case 'g': case 't':
+	case 'A': case 'C': case 'G': case 'T':
+	/*YUK! need the next line?*/
+	case 'n': case 'N': case '-':
+	    acgt++;
+	default:
+	    len++;
+	    if ( isprint(c) || isspace(c) ) ps++;
+	}
+    }
+    /*YUK! 75% of characters printable means text*/
+    if ( 100 * ps > 75 * len ) {
+	fclose(fp);
+	/*YUK! 75% of printables ACGTN means plain*/
+	return ( 100 * acgt > 75 * ps ) ? TT_PLN : TT_UNK;
+    }
+
+    /* YUK! short files are not traces? */
+    if (len<512) {
+        fclose(fp);
+        return TT_UNK;
+    }
+
+    /* check magics */
+    for (i = 0 ; i < Number(magics) ; i++) {
+	if (fseek(fp,magics[i].offset,0) == 0) {
+	    len = strlen(magics[i].string);
+	    if (fread(buf,len,1,fp)==1) {
+		if (strncmp(buf,magics[i].string,len)==0) {
+		    fclose(fp);
+		    return magics[i].type;
+		}
+	    }
+	}
+    }
+
+    fclose(fp);
+    return TT_UNK;
+
+}
+
+char *traceType(char *traceName)
+{
+    char *t;
+    switch(determine_trace_type(traceName)) {
+    case TT_UNK: t = "UNK"; break;
+    case TT_SCF: t = "SCF"; break;
+    case TT_ABI: t = "ABI"; break;
+    case TT_ALF: t = "ALF"; break;
+    case TT_PLN: t = "PLN"; break;
+    case TT_EEK:
+    default:
+	t = "EEK"; break;
+    }
+    return t;
+	
+}
+
diff --git a/src/newted/traceType.h b/src/newted/traceType.h
new file mode 100644
index 0000000..6449c5b
--- /dev/null
+++ b/src/newted/traceType.h
@@ -0,0 +1,39 @@
+#ifndef _traceType_h
+#define _traceType_h
+
+/*
+ Title:       traceType.h
+
+ File:        traceType.h
+ Purpose:     determining traceType of traces
+    Last update: Tue Jan 15 1991
+
+ Change log :-
+*/
+
+/* ---- Imports ---- */
+
+#include <stdio.h>      /* IMPORT: fopen, fclose, fseek, ftell, fgetc */
+#include <ctype.h>      
+#include <string.h>     /* IMPORT: isprint*/
+
+/* ---- Exports ---- */
+#define TT_EEK -1
+#define TT_UNK 0
+#define TT_SCF 1
+#define TT_ABI 2
+#define TT_ALF 3
+#define TT_PLN 4
+
+extern char *trace_types[5];
+
+extern int determine_trace_type(char *fn);
+
+extern char *traceType(char *traceName);
+/* returns the traceType, e.g. ALF, PLN, ABI..., when sent
+a tracename */
+
+#endif /*_traceType_h*/
+
+
+
diff --git a/src/newted/values.h b/src/newted/values.h
new file mode 100644
index 0000000..792e966
--- /dev/null
+++ b/src/newted/values.h
@@ -0,0 +1,118 @@
+/*	@(#)values.h 1.7 89/10/02 SMI; from S5R2 1.16	*/
+
+#ifndef _values_h
+#define _values_h
+
+/* These values work with any binary representation of integers
+ * where the high-order bit contains the sign. */
+
+/* a number used normally for size of a shift */
+#if gcos
+#define BITSPERBYTE	9
+#else
+#define BITSPERBYTE	8
+#endif
+#define BITS(type)	(BITSPERBYTE * (int)sizeof(type))
+
+/* short, regular and long ints with only the high-order bit turned on */
+#define HIBITS	((short)(1 << BITS(short) - 1))
+#define HIBITI	(1 << BITS(int) - 1)
+#define HIBITL	(1L << BITS(long) - 1)
+
+/* largest short, regular and long int */
+#define MAXSHORT	((short)~HIBITS)
+#define MAXINT	(~HIBITI)
+#define MAXLONG	(~HIBITL)
+
+/* various values that describe the binary floating-point representation
+ * _EXPBASE	- the exponent base
+ * DMAXEXP 	- the maximum exponent of a double (as returned by frexp())
+ * FMAXEXP 	- the maximum exponent of a float  (as returned by frexp())
+ * DMINEXP 	- the minimum exponent of a double (as returned by frexp())
+ * FMINEXP 	- the minimum exponent of a float  (as returned by frexp())
+ * MAXDOUBLE	- the largest double
+			((_EXPBASE ** DMAXEXP) * (1 - (_EXPBASE ** -DSIGNIF)))
+ * MAXFLOAT	- the largest float
+			((_EXPBASE ** FMAXEXP) * (1 - (_EXPBASE ** -FSIGNIF)))
+ * MINDOUBLE	- the smallest double (_EXPBASE ** (DMINEXP - 1))
+ * MINFLOAT	- the smallest float (_EXPBASE ** (FMINEXP - 1))
+ * DSIGNIF	- the number of significant bits in a double
+ * FSIGNIF	- the number of significant bits in a float
+ * DMAXPOWTWO	- the largest power of two exactly representable as a double
+ * FMAXPOWTWO	- the largest power of two exactly representable as a float
+ * _IEEE	- 1 if IEEE standard representation is used
+ * _DEXPLEN	- the number of bits for the exponent of a double
+ * _FEXPLEN	- the number of bits for the exponent of a float
+ * _HIDDENBIT	- 1 if high-significance bit of mantissa is implicit
+ * LN_MAXDOUBLE	- the natural log of the largest double  -- log(MAXDOUBLE)
+ * LN_MINDOUBLE	- the natural log of the smallest double -- log(MINDOUBLE)
+ * LN_MAXFLOAT	- the natural log of the largest float  
+ * LN_MINFLOAT	- the natural log of the smallest float
+ */
+#if u3b || u3b5 || sun
+#define MAXDOUBLE	1.797693134862315708e+308
+#define MAXFLOAT	((float)3.40282346638528860e+38)
+#define MINDOUBLE	4.94065645841246544e-324
+#define MINFLOAT	((float)1.40129846432481707e-45)
+#define	_IEEE		1
+#define _DEXPLEN	11
+#define _HIDDENBIT	1
+#define DMINEXP	(-(DMAXEXP + DSIGNIF - _HIDDENBIT - 3))
+#define FMINEXP	(-(FMAXEXP + FSIGNIF - _HIDDENBIT - 3))
+#define LN_MAXFLOAT	8.8722839052068e+01
+#define LN_MINFLOAT	-1.03278929903432e+02
+#endif
+#if pdp11 || vax
+#define MAXDOUBLE	1.701411834604692293e+38
+#define MAXFLOAT	((float)1.701411733192644299e+38)
+/* The following is kludged because the PDP-11 compilers botch the simple form.
+   The kludge causes the constant to be computed at run-time on the PDP-11,
+   even though it is still "folded" at compile-time on the VAX. */
+#define MINDOUBLE	(0.01 * 2.938735877055718770e-37)
+#define MINFLOAT	((float)MINDOUBLE)
+#define _IEEE		0
+#define _DEXPLEN	8
+#define _HIDDENBIT	1
+#define DMINEXP	(-DMAXEXP)
+#define FMINEXP	(-FMAXEXP)
+#endif
+#if gcos
+#define MAXDOUBLE	1.7014118346046923171e+38
+#define MAXFLOAT	((float)1.7014118219281863150e+38)
+#define MINDOUBLE	2.9387358770557187699e-39
+#define MINFLOAT	((float)MINDOUBLE)
+#define _IEEE		0
+#define _DEXPLEN	8
+#define _HIDDENBIT	0
+#define DMINEXP	(-(DMAXEXP + 1))
+#define FMINEXP	(-(FMAXEXP + 1))
+#endif
+#if u370
+#define _LENBASE	4
+#else
+#define _LENBASE	1
+#endif
+#define _EXPBASE	(1 << _LENBASE)
+#define _FEXPLEN	8
+#define DSIGNIF	(BITS(double) - _DEXPLEN + _HIDDENBIT - 1)
+#define FSIGNIF	(BITS(float)  - _FEXPLEN + _HIDDENBIT - 1)
+#define DMAXPOWTWO	((double)(1L << BITS(long) - 2) * \
+				(1L << DSIGNIF - BITS(long) + 1))
+#define FMAXPOWTWO	((float)(1L << FSIGNIF - 1))
+#define DMAXEXP	((1 << _DEXPLEN - 1) - 1 + _IEEE)
+#define FMAXEXP	((1 << _FEXPLEN - 1) - 1 + _IEEE)
+#define LN_MAXDOUBLE	(M_LN2 * DMAXEXP)
+#define LN_MINDOUBLE	(M_LN2 * (DMINEXP - 1))
+
+#define H_PREC	(DSIGNIF % 2 ? (1L << DSIGNIF/2) * M_SQRT2 : 1L << DSIGNIF/2)
+#define X_EPS	(1.0/H_PREC)
+#define X_PLOSS	((double)(long)(M_PI * H_PREC))
+#define X_TLOSS	(M_PI * DMAXPOWTWO)
+#define M_LN2	0.69314718055994530942
+#define M_PI	3.14159265358979323846
+#define M_SQRT2	1.41421356237309504880
+#define MAXBEXP	DMAXEXP /* for backward compatibility */
+#define MINBEXP	DMINEXP /* for backward compatibility */
+#define MAXPOWTWO	DMAXPOWTWO /* for backward compatibility */
+
+#endif /*!_values_h*/
diff --git a/src/scf/README b/src/scf/README
new file mode 100644
index 0000000..92c369c
--- /dev/null
+++ b/src/scf/README
@@ -0,0 +1,29 @@
+Creating SCF files from ABI 373A and Pharmacia A.L.F. trace files
+-----------------------------------------------------------------
+
+SCF files can be created from existing trace files using the program
+makeSCF. Typically, SCF files take up a mere 30% of the space of ABI
+373A traces.
+
+Usage:
+
+    makeSCF {-ABI | -ALF} tracefile -output scffile
+
+Where:
+    tracefile is the file you wish to convert
+and
+    scffile is the name of the SCF file that will be created.
+
+
+Example:
+
+To convert an ABI 373A trace:
+    makeSCF -ABI trace.abi -output trace.scf
+
+To convert a Pharmacia A.L.F. trace:
+    makeSCF -ALF trace.alf -output trace.scf
+
+
+NOTE:
+The Pharmacia A.L.F. traces referred to here are those created by
+the program alfsplit.
diff --git a/src/scf/SCF-SPEC.V2.0 b/src/scf/SCF-SPEC.V2.0
new file mode 100644
index 0000000..64f2efb
--- /dev/null
+++ b/src/scf/SCF-SPEC.V2.0
@@ -0,0 +1,218 @@
+	  A Proposed Standard File Format for Chromatograms
+		       (Version 2, Revision 0)
+				   
+			      Simon Dear
+			(sd@mrc-lmb.cam.ac.uk)
+		       Medical Research Council
+		   Laboratory of Molecular Biology
+			   18 November 1992
+
+
+
+
+0. Introduction
+
+We seek to establish a standard chromatogram file format that will be
+supported by all sequencing machine manufacturers and software
+developers.  Our software supports data in this format. We will
+encourage other software developers to do the same, and equipment
+manufacturers to provide either files in the format, or conversion
+software to produce them.
+
+
+1. The Header Record
+
+The file begins with a 128 byte header record that describes the
+location and size of the chromatogram data in the file. Nothing is
+implied about the order in which the components (samples, sequence and
+comments) appear. The version field is a character string
+respresenting the version and revision of the SCF format. The current
+value of this field is "2.00"
+
+/*
+** Type definition for the Header structure
+*/
+#define SCF_MAGIC (((((long)'.'<<8)+(long)'s'<<8)+(long)'c'<<8)+(long)'f')
+typedef struct {
+	long magic_number;
+	long sample;             /* Number of elements in Samples matrix */
+	long samples_offset;     /* Byte offset from start of file */
+	long bases;              /* Number of bases in Bases matrix */
+	long bases_left_clip;    /* Number of bases in left clip (vector)*/
+	long bases_right_clip;   /* Number of bases in right clip (unreliable) */
+	long bases_offset;       /* Byte offset from start of file */
+	long comments_size;      /* Number of bytes in Comment section */
+	long comments_offset;    /* Byte offset from start of file */
+	char version[4];	 /* "version.revision" */
+	long sample_size;	 /* Size of samples in bytes 1=8bits, 2=16bits*/
+	long code_set;	 	 /* code set used */
+	long spare[20];          /* Unused */
+} Header;
+
+NOTE: Features new to version 2
+
+For versions of SCF files 2.0 or greater (Header.version >= "2.00"),
+the version number, precision of data, the uncertain code set are
+specified in the header.  Otherwise, the precision is assumed to be 1
+byte, and the code set to be the default code set.
+
+The following uncertainty code sets are recognised. People wishing to
+use others with the SCF format should register them with the author of
+this document.
+
+	0	{A,C,G,T,-} (default)
+	1	Staden
+	2	IUPAC (NC-IUB)
+	3	Pharmacia A.L.F. (NC-IUB)
+	4	{A,C,G,T,N} (ABI 373A)
+	5	IBI/Pustell
+	6	DNA*
+	7	DNASIS
+	8	IG/PC-Gene
+	9	MicroGenie
+
+
+2. The Sample Points.
+
+The trace information is stored at byte offset Header.samples_offset
+from the start of the file. For each sample point there are values for
+each of the four bases. Header.sample_size holds the precision of the
+sample values. The precision must be one of "1" (unsigned byte) and
+"2" (unsigned short). The sample points need not be normalised to any
+particular value, though it is assumed that they represent positive
+values. This is, they are of unsigned type.
+
+/*
+** Type definition for the Sample data
+*/
+
+typedef struct {
+	unsigned char sample_A;           /* Sample for A trace */
+	unsigned char sample_C;           /* Sample for C trace */
+	unsigned char sample_G;           /* Sample for G trace */
+	unsigned char sample_T;           /* Sample for T trace */
+} Samples1;
+typedef struct {
+	unsigned short sample_A;           /* Sample for A trace */
+	unsigned short sample_C;           /* Sample for C trace */
+	unsigned short sample_G;           /* Sample for G trace */
+	unsigned short sample_T;           /* Sample for T trace */
+} Samples2;
+
+NOTE: Features new to version 2
+The samples are no longer restricted to 8 bit values.
+
+
+3. Sequence Information.
+
+Information relating to the base interpretation of the trace is stored
+at byte offset Header.bases_offset from the start of the file. Stored
+for each base are: its character representation and a number (an index
+into the Samples data structure) indicating its position within the
+trace. The relative probabilities of each of the 4 bases occuring at
+the point where the base is called can be stored in prob_A, prob_C,
+prob_G and prob_T.
+
+The Amersham FilmReader uses Bases.spare[0] to store the confidence
+level of the base. The value ranges from 0 (low confidence) to 8
+(high), and 9 indicating the base has been manually edited.
+
+/*
+** Type definition for the sequence data
+*/
+typedef unsigned char byte;
+typedef struct {
+	long peak_index;        /* Index into Samples matrix for base position */
+	byte prob_A;            /* Probability of it being an A */
+	byte prob_C;            /* Probability of it being an C */
+	byte prob_G;            /* Probability of it being an G */
+	byte prob_T;            /* Probability of it being an T */
+	char base;              /* Called base character        */
+	byte spare[3];          /* Spare */
+} Base;
+
+
+
+4. Comments.
+
+Comments are stored at offset Header.comments_offset from the start of
+the file. Lines in this section are of the format:
+
+	<Field-ID>=<Value>
+
+<Field-ID> can be any string, though several have special meaning and
+their use is encouraged.
+
+ID	Field					Example
+----    ------------------------                ----------------
+MACH	Sequencing machine model                MACH=Pharmacia A.L.F.
+TPSW	Trace processing software version       TPSW=A.L.F. Analysis Program, Version=1.67
+BCSW	Base calling software version           BCSW=A.L.F. Analysis Program, Version=1.67
+DATF	Data source format                      DATF=AM_Version=2.0
+DATN	Data source name			DATN=a10c.alf
+CONV	Format conversion software              CONV=makeSCF v2.0
+
+Other fields might include:
+
+ID	Field					Example
+----    ------------------------                ----------------
+OPER	Operator				OPER=sd
+STRT	Time run started			STRT=Aug 05 1991  12:25:01
+STOP	Time run stopped			STOP=Aug 05 1991  16:26:25
+PROC	Time processed				PROC=Aug 05 1991  18:50:13
+EDIT    Time edited                             EDIT=Aug 05 1991  19:06:18
+NAME	Sample name				NAME=a21b1.s1
+SIGN	Average signal strength			SIGN=A=56,C=66,G=13,T=18
+SPAC	Average base spacing			SPAC=12.04
+SCAL	Factor used in scaling traces           SCAL=0.5
+
+
+/*
+** Type definition for the comments
+*/
+typedef char Comments;            /* Zero terminated list of \n separated entries */
+
+Appendix: Byte ordering and integer representation
+
+"Forward byte and reverse bit" ordering will be used for all integer
+values. This is the same as used in the MC680x0 and SPARC processors,
+but the reverse of the byte ordering used on the VAX and Intel 80x86
+processors.
+
+
+	 Off+0   Off+1  
+       +-------+-------+  
+Short  |  MSB  |  LSB  |  
+       +-------+-------+  
+
+	 Off+0   Off+1   Off+2   Off+3
+       +-------+-------+-------+-------+
+Long   |  MSB  |  ...  |  ...  |  LSB  | 
+       +-------+-------+-------+-------+
+
+
+
+To read integers on systems with any byte order use something like this:
+
+short read_short(FILE *fp)
+{
+    unsigned char buf[sizeof(short)];
+
+    fread(buf, sizeof(buf), 1, fp);
+    return (short)
+        (((unsigned short)buf[1]) +
+         ((unsigned short)buf[0]<<8));
+}
+
+long read_long(FILE *fp)
+{
+    unsigned char buf[sizeof(long)];
+
+    fread(buf, sizeof(buf), 1, fp);
+    return (long)
+        (((unsigned long)buf[3]) +
+         ((unsigned long)buf[2]<<8) +
+         ((unsigned long)buf[1]<<16) +
+         ((unsigned long)buf[0]<<24));
+}
+
diff --git a/src/scf/makeSCF.c b/src/scf/makeSCF.c
new file mode 100644
index 0000000..b9441f6
--- /dev/null
+++ b/src/scf/makeSCF.c
@@ -0,0 +1,157 @@
+/*
+    Title: 	 makeSCF
+
+    File: 	 makeSCF.c
+    Purpose:	 Main module
+    Last update: Thursday 11 July 1991
+
+    Change log:
+
+*/
+
+
+/*
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include "seqIOABI.h"
+#include "seqIOALF.h"
+#include "seqIOSCF.h"
+#include "seqIOPlain.h"
+#include "mystdlib.h"  /* IMPORT: exit */
+#include "writeSCF.h"
+
+
+/* ---- Static variables ---- */
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+#define trace_unknown 0
+#define trace_SCF     1
+#define trace_ALF     2
+#define trace_ABI     3
+#define trace_Plain   4
+
+typedef struct
+{
+    char * fn;
+    int type;
+    char * output;
+} Arguments;
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage()
+{
+
+    fprintf(stderr,"usage: makeSCF [(-ABI | -ALF) {tracefilename} -output {outputfilename}\n");
+
+}
+
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{
+    Arguments args;
+    Seq currSeq;
+    int in, out;
+
+    /* parse command line arguments here */
+
+    args.fn = NULL;
+    args.type = trace_unknown;
+    args.output = NULL;
+    in = out = 0;
+
+    for(argc--,argv++; argc>0 ; argc--,argv++) {
+
+	if (strcmp(*argv,"-ABI") == 0) {
+	    args.type = trace_ABI;
+	    args.fn = *++argv; argc--;
+	    in++;
+	} else if (strcmp(*argv,"-ALF") == 0) {
+	    args.type = trace_ALF;
+	    args.fn = *++argv;argc--;
+	    in++;
+	} else if (strcmp(*argv,"-SCF") == 0) {
+	    args.type = trace_SCF;
+	    args.fn = *++argv;argc--;
+	    in++;
+	} else if (strcmp(*argv,"-output") == 0) {
+	    args.output = *++argv; argc--;
+	    out++;
+	} else {
+            usage();
+	    fprintf(stderr,"switch not recognised\n");
+	    exit(1);
+	}
+
+    }
+
+
+
+
+    /*
+        Die if bad options given.
+    */
+    if (in!=1 || out!=1 || args.fn==NULL ) {
+	usage();
+	fprintf(stderr,"illegal argument combination\n");
+	exit(1);
+    }
+
+    if ( args.output==NULL ) {
+	usage();
+	fprintf(stderr,"no output specified\n");
+	exit(1);
+    }
+
+
+    switch(args.type) {
+    case trace_ABI:
+	currSeq = readSeqABI(args.fn); break;
+    case trace_ALF:
+	currSeq = readSeqALF(args.fn); break;
+    case trace_SCF:
+	currSeq = readSeqSCF(args.fn); break;
+    default:
+	usage();
+	fprintf(stderr,"Unknown trace file format\n");
+	exit(1);
+    }
+
+    if (currSeq == NULLSeq) {
+	fprintf(stderr,"Error reading file %s\n",args.fn);
+	exit(1);
+    }
+
+    if ( ! writeSeqSCF(currSeq, args.output) ) {
+	fprintf(stderr,"Error writing file %s\n",args.output);
+	exit(1);
+    }
+
+    exit (0);
+
+}
+
diff --git a/src/scf/makefile-alpha b/src/scf/makefile-alpha
new file mode 100644
index 0000000..331c209
--- /dev/null
+++ b/src/scf/makefile-alpha
@@ -0,0 +1,67 @@
+#
+# getMCH - Alpha version
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = makeSCF
+
+#
+# SWITCHES - see ted makefile
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+
+#
+# alpha
+#
+CC= cc
+CFLAGS= -g -std1 -ieee_with_inexact $(SWITCHES) -I$(TEDSRC)
+LD= cc
+LDFLAGS= -g -non_shared
+LIBFLAGS =
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+#
+# Object files
+#
+OBJS=\
+	makeSCF.o\
+	writeSCF.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+makeSCF: $(OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Special dependencies
+#
+all: $(PROGS) 
+
+clean:
+	/bin/rm -f $(OBJS) $(PROGS) *~
diff --git a/src/scf/makefile-dec b/src/scf/makefile-dec
new file mode 100644
index 0000000..86d61a4
--- /dev/null
+++ b/src/scf/makefile-dec
@@ -0,0 +1,64 @@
+#
+# getMCH - DEC Ultrix (mips) fork
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = makeSCF
+
+#
+# SWITCHES - see ted makefile
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+INCLUDES = -I/usr/include/mit -I$(TEDSRC)
+#
+# DEC
+#
+CC= c89
+CFLAGS= -g -std -w0 $(SWITCHES) $(INCLUDES)
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+#
+# Object files
+#
+OBJS=\
+	makeSCF.o\
+	writeSCF.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+makeSCF: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS)
+
+#
+# Special dependencies
+#
+all: $(PROGS) 
+
+clean:
+	/bin/rm -f $(OBJS) $(PROGS) *~
diff --git a/src/scf/makefile-sgi b/src/scf/makefile-sgi
new file mode 100644
index 0000000..539c9ab
--- /dev/null
+++ b/src/scf/makefile-sgi
@@ -0,0 +1,67 @@
+#
+# getMCH - SGI version
+#
+
+SHELL=/bin/sh
+
+#
+# Programs made by this makefile
+#
+PROGS = makeSCF
+
+#
+# SWITCHES - see ted makefile
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+
+#
+# Sun
+#
+CC= cc
+CFLAGS= -g -ansi -prototypes $(SWITCHES) -I$(TEDSRC)
+LIBFLAGS =
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+#
+# Object files
+#
+OBJS=\
+	makeSCF.o\
+	writeSCF.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+makeSCF: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Special dependencies
+#
+all: $(PROGS) 
+
+clean:
+	/bin/rm -f $(OBJS) $(PROGS) *~
diff --git a/src/scf/makefile-solaris b/src/scf/makefile-solaris
new file mode 100644
index 0000000..a276972
--- /dev/null
+++ b/src/scf/makefile-solaris
@@ -0,0 +1,68 @@
+#
+# getMCH - SunOS fork
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = makeSCF
+
+#
+# SWITCHES - see ted makefile
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+
+#
+# Sun
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(SWITCHES) -I$(TEDSRC) -I/usr/openwin/include
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+#
+# Object files
+#
+OBJS=\
+	makeSCF.o\
+	writeSCF.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+makeSCF: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Special dependencies
+#
+all: $(PROGS) 
+
+clean:
+	/bin/rm -f $(OBJS) $(PROGS) *~
diff --git a/src/scf/makefile-sun b/src/scf/makefile-sun
new file mode 100644
index 0000000..7d336de
--- /dev/null
+++ b/src/scf/makefile-sun
@@ -0,0 +1,68 @@
+#
+# getMCH - SunOS fork
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = makeSCF
+
+#
+# SWITCHES - see ted makefile
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+
+#
+# Where the wild things are
+#
+SRC = .
+TEDSRC = $(SRC)/../ted
+
+#
+# Sun
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(SWITCHES) -I$(TEDSRC)
+# gcc2.1 -static, cc -Bstatic, else nothing
+LIBFLAGS = -static
+
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+%.o : $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+#
+# Object files
+#
+OBJS=\
+	makeSCF.o\
+	writeSCF.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	fpoint.o\
+	match.o\
+	mach-io.o
+
+
+#
+# Linking
+#
+makeSCF: $(OBJS)
+	$(CC) $(CFLAGS) -o $@ $(OBJS) $(LIBS) $(LIBFLAGS)
+
+#
+# Special dependencies
+#
+all: $(PROGS) 
+
+clean:
+	/bin/rm -f $(OBJS) $(PROGS) *~
diff --git a/src/scf/match.c b/src/scf/match.c
new file mode 100644
index 0000000..6a2534b
--- /dev/null
+++ b/src/scf/match.c
@@ -0,0 +1,62 @@
+int string_match(seq1, n1, seq2, n2, nmiss, indices)
+
+/*
+  This function may be called once from initialDisplayedSeq if
+  the user has input a string they wish to search for in the
+  input sequence -- this subroutine returns *indices, the first
+  position at which the query sequence (app_resources.astring)
+  matched the input file sequence (currSeq). */
+
+/* a modified version of match.c 
+  finds alignments between a search sequence, seq1, and a target sequence, 
+  seq2, with no gaps (except possibly at ends) 
+   and at most nmiss mismatches (relative to seq1),
+   n1 is the length of seq1 where seq1 is assumed to start at indices 0,
+   n2 is the length of seq2 where seq2 is assumed to start at indices 0 */
+      char *seq1, *seq2;
+      int n1, n2, nmiss;
+      int *indices;
+{
+    int i, j, d, istart, iend, i_miss, n_match;
+    int mtable[100][5];  /* 100 would be, now the total number
+			    of matches the program may find between
+			    a given input string and the sequence */
+
+
+/* d = j - i is the "offset" between the two sequences */
+    if (n1 - n2 > nmiss) return (0);
+    n_match = 0;
+    for (d = -nmiss; d <= n2 + nmiss - n1; d++){
+	if (d < 0)  istart = i_miss = -d;
+	else istart = i_miss = 0;
+	if (d > n2 - n1) {
+	    iend = n2 - d;
+	    i_miss += n1 + d - n2;
+	}
+	else iend = n1;
+	for (i = istart, j = d + i; i < iend; i++, j++)
+	    if (seq1[i] != seq2[j] && ++i_miss > nmiss) goto nextd; 
+        mtable[n_match][0] = d + istart; /* indices (assuming they start at
+					    0) of starting nucleotide in the
+					    searched sequence */
+        mtable[n_match][1] = istart; /* nucleotide position in the query
+					sequence where match starts (assuming
+					query index starts with 0) */
+        mtable[n_match][2] = iend - istart; /* number of nucleotides in the
+					       match */
+	if (mtable[n_match][2]==n1) {
+	  indices[n_match] = mtable[n_match][0];
+	}
+	  
+        mtable[n_match][3] = i_miss; /* number of mismatches */
+	n_match++; /* number of matches */
+	if (n_match >= 100) return(n_match);
+
+    nextd:;
+    }
+
+    return (n_match);
+    return(0); /*return 0 if it got to this point; that means
+		 it must have not found an exact match, so 
+		 you want the baseNum to be 0 in that case*/
+}
diff --git a/src/scf/match.h b/src/scf/match.h
new file mode 100644
index 0000000..fc6b802
--- /dev/null
+++ b/src/scf/match.h
@@ -0,0 +1,19 @@
+#ifndef _match_h
+#define _match_h
+
+/* 
+    Title:       match
+
+    File: 	 match.h
+    Purpose:	 match
+    Last update: Thu Jan 03 1991
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+extern int string_match();
+
+#endif  _match_h
diff --git a/src/scf/writeSCF.c b/src/scf/writeSCF.c
new file mode 100644
index 0000000..81d807a
--- /dev/null
+++ b/src/scf/writeSCF.c
@@ -0,0 +1,255 @@
+/* 
+    Title:       writeSCF
+
+    File: 	 writeSCF.c
+    Purpose:	 Output of Standard Chromatogram Format sequences
+    Last update: 23 November 1992
+
+    Change log:
+    4 Feb 1992, Now draft proposal version 2
+    23 Nov 92,  SCF 2.0 + LaDeana's changes
+           
+*/
+
+
+/*
+** makeSCF version number
+*/
+#define VERSION_NO "2.00"
+
+/* ---- Imports ---- */
+
+
+#include "scf.h"      /* IMPORT: scf structures */
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			         newSeq, freeSeq */
+
+
+#include "mach-io.h"  /* IMPORT: be_write_int_1, be_write_int_2, be_write_int_4 */
+
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+		                 EOF */
+
+
+
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+
+
+static int write_scf_header(FILE *fp, Header *h)
+{
+    int i;
+
+    if (be_write_int_4(fp,&h->magic_number)==False) return (False);
+    if (be_write_int_4(fp,&h->samples)==False) return (False);
+    if (be_write_int_4(fp,&h->samples_offset)==False) return (False);
+    if (be_write_int_4(fp,&h->bases)==False) return (False);
+    if (be_write_int_4(fp,&h->bases_left_clip)==False) return (False);
+    if (be_write_int_4(fp,&h->bases_right_clip)==False) return (False);
+    if (be_write_int_4(fp,&h->bases_offset)==False) return (False);
+    if (be_write_int_4(fp,&h->comments_size)==False) return (False);
+    if (be_write_int_4(fp,&h->comments_offset)==False) return (False);
+    if (fwrite(h->version,sizeof(h->version),1,fp)!=1) return (False);
+    if (be_write_int_4(fp,&h->sample_size)==False) return (False);
+    if (be_write_int_4(fp,&h->code_set)==False) return (False);
+    for (i=0;i<20;i++)
+	if (be_write_int_4(fp,&h->spare[i])==False) return (False);
+
+    return (True);
+}
+
+
+static int write_scf_sample1(FILE *fp, Samples1 *s)
+{
+    if (be_write_int_1(fp,&s->sample_A)==False) return (False);
+    if (be_write_int_1(fp,&s->sample_C)==False) return (False);
+    if (be_write_int_1(fp,&s->sample_G)==False) return (False);
+    if (be_write_int_1(fp,&s->sample_T)==False) return (False);
+
+    return (True);
+}
+
+
+static int write_scf_sample2(FILE *fp, Samples2 *s)
+{
+    if (be_write_int_2(fp,&s->sample_A)==False) return (False);
+    if (be_write_int_2(fp,&s->sample_C)==False) return (False);
+    if (be_write_int_2(fp,&s->sample_G)==False) return (False);
+    if (be_write_int_2(fp,&s->sample_T)==False) return (False);
+
+    return (True);
+}
+
+
+
+
+
+static int write_scf_base(FILE *fp, Bases *b)
+{
+    if (be_write_int_4(fp,&b->peak_index)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_A)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_C)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_G)==False) return (False);
+    if (be_write_int_1(fp,&b->prob_T)==False) return (False);
+    if (be_write_int_1(fp,(uint_1 *)&b->base)==False) return (False);
+    if (be_write_int_1(fp,&b->spare[0])==False) return (False);
+    if (be_write_int_1(fp,&b->spare[1])==False) return (False);
+    if (be_write_int_1(fp,&b->spare[2])==False) return (False);
+
+    return (True);
+}
+
+
+
+static int write_scf_comment(FILE *fp, Comments *c, size_t l)
+{
+    if (fwrite(c, l, 1, fp) !=1) return (False);
+    return (True);
+
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+
+
+
+
+Boolean writeSeqSCF(Seq seq, char *fn)
+/*
+** Write Seq out as a .scf file
+*/
+{
+
+    FILE *fp;
+    Header header;
+    Bases base;
+    Comments comments[1024];
+    Comments default_comments[] = "conversion_program = makeSCF " VERSION_NO "\n";
+    char *src;
+
+
+    if ((fp = fopen(fn,"wb"))==NULL) 
+	return (False);
+    else {
+	int i;
+	int prec; /* precision to use */
+
+	/* source dependant switches */
+	prec = 1;
+	switch (seq->format) {
+	case ABIFormat:
+	    src = "ABI 373A";
+	    break;
+	case ALFFormat:
+	    src = "Pharmacia A.L.F.";
+	    prec = 2;
+	    break;
+	case SCFFormat:
+	    src = "SCF";
+	    if (seq->maxTraceVal > 255) prec=2; /* retain precision */
+	    break;
+	default:
+	    src = "Unknown";
+	    break;
+	}
+
+	if (seq->info==NULL) {
+	    sprintf(comments,"%ssource = %s\n",
+		    default_comments,
+		    src);
+	} else {
+	    sprintf(comments,"%s\n%ssource = %s\n",
+		    seq->info,
+		    default_comments,
+		    src);
+	}
+	
+	header.magic_number = SCF_MAGIC;
+	header.samples = seq->NPoints;
+	header.samples_offset = (uint_4)sizeof(Header);
+	header.bases = seq->NorigBases;
+	header.bases_left_clip = seq->leftCutoff;
+	header.bases_right_clip = seq->rightCutoff;
+	header.bases_offset = (uint_4)(header.samples_offset + header.samples * ((prec==2)?sizeof(Samples2):sizeof(Samples1)));
+	header.comments_size = (uint_4)strlen(comments)+1;
+	header.comments_offset = (uint_4)(header.bases_offset + header.bases * sizeof(Bases));
+	strncpy(header.version, VERSION_NO,4);
+	header.sample_size = prec;
+	header.code_set = CSET_DEFAULT;
+
+	for(i=0;i<20;i++) header.spare[i]=0;
+	if (write_scf_header(fp, &header)==False) return (False);
+
+
+	switch(prec) {
+	case 2:
+	    {
+		Samples2 sample;
+		
+		for(i=0; i<header.samples; i++) {
+		    sample.sample_A = seq->traceA[i];
+		    sample.sample_C = seq->traceC[i];
+		    sample.sample_G = seq->traceG[i];
+		    sample.sample_T = seq->traceT[i];
+		    if (write_scf_sample2(fp, &sample)==False) return(False);
+		}
+		break;
+	    }
+	default:
+	    {
+		Samples1 sample;
+		
+		for(i=0; i<header.samples; i++) {
+#define scale(P,M) (  (byte) ((float)(P)*255.0/(float)(M))   )
+		    sample.sample_A = scale(seq->traceA[i],seq->maxTraceVal);
+		    sample.sample_C = scale(seq->traceC[i],seq->maxTraceVal);
+		    sample.sample_G = scale(seq->traceG[i],seq->maxTraceVal);
+		    sample.sample_T = scale(seq->traceT[i],seq->maxTraceVal);
+		    if (write_scf_sample1(fp, &sample)==False) return(False);
+		}
+		break;
+	    }
+	}
+
+
+	for(i=0; i<header.bases; i++) {
+	    base.peak_index = seq->basePos[i];
+	    base.base = seq->base[i];
+	    base.spare[0] = base.spare[1] = base.spare[2] = 0;
+	    base.prob_A = base.prob_C = base.prob_G = base.prob_T = 0;
+	    switch(base.base) {
+	    case 'A' : case 'a':
+		base.prob_A = 1; break;
+	    case 'C' : case 'c':
+		base.prob_C = 1; break;
+	    case 'G' : case 'g':
+		base.prob_G = 1; break;
+	    case 'T' : case 't':
+		base.prob_A = 1; break;
+	    default:
+		base.prob_A = base.prob_C = base.prob_G = base.prob_T = 1;
+	    }
+	    if (write_scf_base(fp,&base)==False) return (False);
+
+	}
+
+	if (write_scf_comment(fp,comments,(size_t)header.comments_size)==False) return (False);
+
+
+	fclose(fp);
+    }
+    
+    return (True);
+}
diff --git a/src/scf/writeSCF.h b/src/scf/writeSCF.h
new file mode 100644
index 0000000..5c09613
--- /dev/null
+++ b/src/scf/writeSCF.h
@@ -0,0 +1,10 @@
+#ifndef _writeSCF_h
+#define _writeSCF_h
+
+#include "scf.h"      /* IMPORT: scf structures */
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			         newSeq, freeSeq */
+
+extern Boolean writeSeqSCF(Seq seq, char *fn);
+
+#endif /* _writeSCF_h */
diff --git a/src/scripts/FE-stealdata b/src/scripts/FE-stealdata
new file mode 100644
index 0000000..08397bc
--- /dev/null
+++ b/src/scripts/FE-stealdata
@@ -0,0 +1,51 @@
+#! /bin/csh -f
+#
+# stealdata (front end)
+#
+#    Borrow data and files from a neighbouring cosmid database
+#
+#
+# Change log:
+#
+#   v1.8 Front and back end version
+#
+
+onintr abort
+
+#
+# set defaults here
+#
+# NOTE: "other" refers to the project you want to pinch data off
+# NOTE: db_type and other_db_type must be either "dap" or "bap"
+#
+set project = F54G8
+set version = 0
+set db_type = dap
+set other_dir = ~kt/ZC84
+set other_project = ZC84
+set other_version = 0
+set other_db_type = bap
+
+#
+# available operations are:
+#     r = report useful readings (response mailed)
+#     s = "r" + copy useful sequences only
+#     a = "r" + copy useful sequences AND traces
+#     l = "r" + copy useful sequences and LINK traces
+# default = "r"
+#
+set operation = "s"
+
+
+
+#****************************************************
+# MAKE NO CHANGES BELOW THIS LINE
+#****************************************************
+FE-stealdata.csh $project $version $db_type $other_project $other_dir $other_version $other_db_type $operation
+if ( $status ) then
+abort:
+	echo stealdata abnormally terminated
+	exit 1
+endif
+
+exit 0
diff --git a/src/scripts/FE-stealdata.csh b/src/scripts/FE-stealdata.csh
new file mode 100644
index 0000000..5115d06
--- /dev/null
+++ b/src/scripts/FE-stealdata.csh
@@ -0,0 +1,380 @@
+#! /bin/csh -f
+#
+# stealdata.csh
+#
+#    Borrow data and files from a neighbouring cosmid database
+#
+#
+# Change log:
+#
+#   v1.2 Handle dap and bap databases
+#   v1.3 Provide linking option
+#   v1.4 Under no circumstances link sequence files
+#   v1.5 Bug in option "l". Now works when databases have same name
+#   v1.6 To work with new version of bap
+#   v1.7 Variable report now set correctly
+#   v1.8 Front and back end version
+#
+
+onintr abort
+
+if ( $#argv != 8 ) then
+	echo Usage: stealdata.csh ... (8 args)
+	exit 1
+endif
+
+set project       = $1
+set version       = $2
+set db_type       = $3
+set other_dir     = $4
+set other_project = $5
+set other_version = $6
+set other_db_type = $7
+set operation     = $8
+
+
+#
+# other - DON'T TOUCH
+#
+set trace_suffixes = ALF,RES,SCF
+
+#
+# check defaults
+#
+if (! -e ${project}.SQ${version}) then
+	echo Project ${project} version ${version} does not exist
+	exit 1
+endif
+
+if (! -d ${other_dir}) then
+	echo Directory ${other_dir} does not exist
+	exit 1
+endif
+
+if (! -e ${other_dir}/${other_project}.SQ${other_version}) then
+	echo Project ${other_dir}/${other_project} version ${other_version} does not exist
+	exit 1
+endif
+
+#
+# Report file
+#
+set report = $cwd/_steal.rep.${project}
+set count = 0
+while ( -e $report )
+	# echo Report file ${report} already exists...
+	@ count ++
+	set report = $cwd/_steal.rep.${project}.${count}
+end
+
+touch ${report}
+
+#
+# Output details of stealdata run
+#
+cat << EOF | tee -a ${report}
+Stealdata v1.6
+Borrow data from neighbouring cosmid database
+
+project = ${project}
+version = ${version}
+db_type = ${db_type}
+other_dir = ${other_dir}
+other_project = ${other_project}
+other_version = ${other_version}
+other_db_type = ${other_db_type}
+operation = ${operation}
+EOF
+
+switch ( ${operation} )
+case "r":
+	echo "    ( report only )" | tee -a ${report}
+	breaksw
+case "s":
+	echo "    ( copy sequences only, report )" | tee -a ${report}
+	breaksw
+case "a":
+	echo "    ( copy sequences and traces, report )" | tee -a ${report}
+	breaksw
+case "l":
+	echo "    ( copy sequences and link traces, report )" | tee -a ${report}
+	breaksw
+default:
+	echo "    ( unknown - report only )" | tee -a ${report}
+	set operation = "r"
+	breaksw
+endsw
+echo "" | tee -a ${report}
+
+#
+# create a temporary directory
+#
+echo Creating a temporary directory | tee -a ${report}
+set temp_dir = _steal.dir.${other_project}
+set count = 0
+while (-e ${temp_dir})
+	echo Temporary directory ${temp_dir} already exists | tee -a ${report}
+	@ count ++
+	set temp_dir = _steal.dir.${other_project}.${count}
+end
+
+set home_dir = `pwd`
+mkdir ${temp_dir}
+
+#
+#consensus for database
+#
+set consensus = _steal.seq.${project}
+set count = 0
+while ( -e $consensus )
+	echo Consensus file ${consensus} already exists | tee -a ${report}
+	@ count ++
+	set consensus = _steal.seq.${project}.${count}
+end
+
+#
+# determine consensus
+#
+echo Determining the consensus for project ${project} version ${version} | tee -a ${report}
+#
+# FUDGE!!!
+# The third question in option 8 (Calculate consensus)
+# has changed in the latest version of bap.
+# Was:
+#     ? Make another consensus (y/n) (y) 
+# Now is:
+#     ? Staden format (y/n) (y) =
+#
+# Set fudge to reflect the appropriate choice
+#
+if ( ${db_type} == "bap" ) then
+	set fudge = y
+else
+	set fudge = n
+endif
+${db_type} << EOF > /dev/null
+
+y
+${project}
+${version}
+
+8
+${consensus}
+y
+${fudge}
+
+!
+EOF
+if ( $status ) then
+	echo stealdata: failed to make consensus | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# move consensus to temp_dir
+#
+mv ${consensus} ${temp_dir}
+if ( $status ) then
+	echo stealdata: failed to move consensus to temp_dir | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# copy other database to this directory
+#
+echo Copying project ${other_project} version ${other_version} | tee -a ${report}
+cd ${temp_dir}
+cp -i \
+	${other_dir}/${other_project}.SQ${other_version} \
+	${other_dir}/${other_project}.AR${other_version} \
+	${other_dir}/${other_project}.RL${other_version} \
+	${other_dir}/${other_project}.TG${other_version} \
+	${other_dir}/${other_project}.CC${other_version} \
+	.
+if ( $status ) then
+	echo stealdata: failed to copy other database | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# file of file names in other database
+#
+set other_files = ${other_project}.all
+set other_useless = ${other_project}.useless
+set other_useful = ${other_project}.useful
+set other_new = ${other_project}.fn.$$
+
+#
+# extract gel readings
+#
+set option_extract = 32
+#
+# screen against vector
+#
+set option_screen  = 18
+#
+echo Extracting gel readings from ${other_project} and | tee -a ${report}
+echo "  screening against consensus from ${project}" | tee -a ${report}
+${other_db_type} << EOF > /dev/null
+
+y
+${other_project}
+${other_version}
+
+${option_extract}
+${other_files}
+n
+
+${option_screen}
+y
+${other_files}
+${other_useless}
+${consensus}
+30
+
+!
+EOF
+if ( $status ) then
+	echo stealdata: failed to extact or screen readings | tee -a ${report}
+	goto abort
+endif
+
+#
+# determine useful files in other project
+#
+echo Determining useful gel readings | tee -a ${report}
+sort ${other_files} | tr -d ' ' > ${other_files}.nice
+sort ${other_useless} | tr -d ' ' > ${other_useless}.nice
+comm -23 ${other_files}.nice ${other_useless}.nice > ${other_useful}
+
+#
+# create report header
+#
+echo "" | tee -a ${report}
+echo Useful gel readings: | tee -a ${report}
+
+#
+# determine new files 
+#
+touch ${other_new}
+set useful = 0
+foreach x ( `cat ${other_useful}` )
+	if ( -e ../$x ) then
+		echo $x | tee -a ${report}
+	else
+		echo $x "- NEW" | tee -a ${report}
+		echo $x >> ${other_new}
+		@ useful ++
+	endif
+end
+
+if ( $useful == 0 ) then
+	echo No new gel readings | tee -a ${report}
+	goto done	
+endif
+
+#
+# transfer sequences
+#
+if (${operation} == "s") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence files: " | tee -a ${report}
+	foreach x ( `cat ${other_new}` )
+		cp -i ${other_dir}/${x} ..
+		echo ${other_dir}/${x} | tee -a ${report}
+	end
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+#
+# transfer sequences and traces
+#
+if (${operation} == "a") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence and trace files: " | tee -a ${report}
+	foreach x ( `cat ${other_new}` )
+		# copy sequence file
+		cp -i ${other_dir}/${x} ..
+		echo ${other_dir}/${x} | tee -a ${report}
+		foreach file ( ${other_dir}/${x}{${trace_suffixes}} )
+			if (-e $file) then
+				cp -i $file ..
+				echo $file | tee -a ${report}
+			endif
+		end
+	end
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+#
+# transfer sequences and link traces
+#
+if (${operation} == "l") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence and trace files: " | tee -a ${report}
+	set files = `cat ${other_new}`
+	pushd .. >& /dev/null
+	foreach x ( $files )
+		# copy sequence file
+		cp -i ${other_dir}/${x} .
+		echo ${other_dir}/${x} | tee -a ${report}
+		foreach file ( ${other_dir}/${x}{${trace_suffixes}} )
+			# link to traces
+			if (-e $file) then
+				ln -s $file
+				echo $file | tee -a ${report}
+			endif
+		end
+	end
+	popd >& /dev/null
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+done:
+#
+# cleanup
+#
+echo Cleaning up | tee -a ${report}
+cd ..
+/bin/rm -r ${temp_dir}
+
+#
+# Finish up
+#
+echo "" | tee -a ${report}
+echo "Completed" | tee -a ${report}
+
+#
+# mail report
+#
+echo Mailing you a report
+mail `whoami` < ${report}
+
+
+exit 0
+
+
+#
+#
+#
+abort:
+	echo stealdata aborted `date` | tee -a ${report}
+	if ( $?home_dir ) then
+		cd $home_dir
+		echo Cleaning up | tee -a ${report}
+		/bin/rm -r ${temp_dir}
+	endif
+
+	exit 1
diff --git a/src/scripts/assemble b/src/scripts/assemble
new file mode 100644
index 0000000..770c286
--- /dev/null
+++ b/src/scripts/assemble
@@ -0,0 +1,168 @@
+#
+# IMPORTANT NOTICE:
+#     Assemble will only work with bap version 12.1 or greater
+#
+# Script to assemble data into a database using a file of file names.
+# The batch of readings is screened against two vectors. Any matching
+# the vectors have their names added to growing files of vector failures.
+# Those that pass are assembled. Those that fail assembly have their names
+# added to a growing list of failures. Output from 
+# the initial assembly is directed to a file that is deleted, hence
+# nothing appears on the screen. The failed files are then recompared
+# with the output directed to another file. This file is printed then
+# both output files are deleted. The procedure starts by making a copy
+# of the database to copy X (preceded by its deletion!!!!).
+#
+# use: assemble fileoffilenames
+#
+unset noclobber
+onintr bailout
+set PROJECT = F02A9
+set VECTOR_SCORE = 20
+set ASSEMBLY_SCORE = 20
+set MAX_READ_PADS = 25
+set MAX_CONTIG_PADS = 25
+set MAX_MISMATCH_1 = 8
+set MAX_MISMATCH_2 = 15
+if ($#argv != 1) then
+echo "Usage: assemble fileoffilenames"
+	exit 1
+endif
+if  (! -f $1) then
+echo "File of file names not found"
+	exit 2
+endif
+if  (! -f `head -1 $1`) then
+echo "First file in file of file names not found"
+	exit 3
+endif
+echo -n "Assembling data from file of filenames "; echo -n $1
+echo  -n " into data base "; echo $PROJECT
+echo -n "Deleting database backup: "; echo -n $PROJECT
+echo " version X"
+\rm -f $PROJECT.RLX
+\rm -f $PROJECT.SQX
+\rm -f $PROJECT.ARX
+\rm -f $PROJECT.CCX
+\rm -f $PROJECT.TGX
+bap <<endofinput
+3
+y
+$PROJECT
+0
+
+7
+1
+out$$
+
+24
+X
+1000
+
+18
+y
+$1
+passm13$$
+M13MP18_VECTOR
+$VECTOR_SCORE
+
+18
+y
+passm13$$
+passblue$$
+BLUE_VECTOR
+$VECTOR_SCORE
+
+20
+Y
+Y
+Y
+passblue$$
+fail$$
+1
+Y
+$ASSEMBLY_SCORE
+3
+$MAX_READ_PADS
+$MAX_CONTIG_PADS
+$MAX_MISMATCH_1
+
+7
+y
+
+7
+1
+show$$
+
+20
+y
+n
+y
+fail$$
+fail2$$
+1
+y
+$ASSEMBLY_SCORE
+3
+$MAX_READ_PADS
+$MAX_CONTIG_PADS
+$MAX_MISMATCH_2
+
+25
+!
+
+7
+y
+
+!
+endofinput
+cat fail2$$ >> $PROJECT.FAILS
+echo -n "Assembly for project ">>show1$$
+echo -n  $PROJECT >>show1$$
+echo -n " on " >>show1$$
+date >> show1$$
+echo -n "In directory " >> show1$$
+pwd >> show1$$
+echo -n "Input file of file names ">>show1$$
+echo $1 >> show1$$
+echo "Parameters used" >>show1$$
+echo -n "Minimum initial match  " >>show1$$
+echo $ASSEMBLY_SCORE >> show1$$
+echo -n "Maximum pads in each reading  " >>show1$$
+echo $MAX_READ_PADS >> show1$$
+echo -n "Maximum pads in contig  " >>show1$$
+echo $MAX_CONTIG_PADS >> show1$$
+echo -n "Maximum percent mismatch (for assembly)  " >>show1$$
+echo $MAX_MISMATCH_1 >> show1$$
+echo -n "Maximum percent mismatch (for report)  " >>show1$$
+echo $MAX_MISMATCH_2 >> show1$$
+echo "List of m13mp18 failures">>show1$$
+cat $1 | tr -d " " | sort >! sortin$$
+cat passm13$$ | tr -d " " | sort >! sortm13$$
+comm -23 sortin$$ sortm13$$ >> show1$$
+echo "List of bluescribe failures">>show1$$
+#assuming already have sortm13$$
+cat passblue$$ | tr -d " " | sort >! sortblue$$
+comm -23 sortm13$$ sortblue$$ >> show1$$
+comm -23 sortin$$ sortm13$$ >> $PROJECT.M13
+comm -23 sortm13$$ sortblue$$ >> $PROJECT.BLUE
+echo "List of entry failures ">>show1$$
+cat fail2$$ >> show1$$
+#cp fail$$ show1$$
+echo "Alignments for entry failures ">>show1$$
+cat show$$ >> show1$$
+lpr show1$$
+bailout:
+\rm fail$$
+\rm fail2$$
+\rm -f out$$
+\rm -f show$$
+\rm -f show1$$
+\rm -f passm13$$
+\rm -f passblue$$
+\rm -f sortin$$
+\rm -f sortm13$$
+\rm -f sortblue$$
+#cat $PROJECT.FAILS
+;
+
diff --git a/src/scripts/clip-seq.awk b/src/scripts/clip-seq.awk
new file mode 100644
index 0000000..c938a58
--- /dev/null
+++ b/src/scripts/clip-seq.awk
@@ -0,0 +1,173 @@
+#
+# File: clip-seq.awk
+# Version: 1.0
+# Created: 17 November 1992
+#
+# Author: Simon Dear
+#         MRC Laboratory of Molecular Biology
+#	  Hills Road
+#	  Cambridge CB2 2QH
+#	  United Kingdom
+#
+# Description:
+#
+# Clip poor quality sequence at the left (5') and right (3') ends
+# on basis of sequence content. Program acts as a filter for xdap
+# sequence files.
+#
+# Rule:
+#
+# At the 3' end: A window of (MM) bases is slid down the sequence,
+# starting from base position (SR) until there are (NN) Ns within the
+# window, or until the window reaches position (RR), whichever happens
+# first. The 3' extent of the good data is the set to be (KR) bases
+# upsteam of this position.
+# 
+# At the 5' end: This essentially the same as for the 3' end. The window
+# is slid back along the sequence starting at base position (SL), until
+# there are sufficient Ns or until the window reaches position (LL),
+# whichever happens first. There is no further adjustment of the 5'
+# extent of good data.
+#
+# St.Louis use:  NN=2, MM=5, SL=1, LL=1, SR=200, RR=sequence_length, KR=0
+# Cambridge use: NN=2, MM=5, SL=100, LL=1, SR=200, RR=450, KR=50
+#
+
+BEGIN {
+	# Up to where NN uncertainties in window of MM bases
+	NN = 2;
+	MM = 5;
+	SL = 100
+	SR = 200
+	LL = 1;
+	RR = 450;
+	KR = 50;
+	# don't change
+	QL = -1;
+	QR = -1;
+}
+
+
+/;/ && NR==1 { header = $0; next; } #first line is a header
+/;</ { lseq = lseq substr($0,3); next; }
+/;>/ { rseq = rseq substr($0,3); next; }
+/;/ { if (extras=="") extras = $0; else extras = extras "\n" $0; next; } # gather extras
+     { mseq = mseq $0; next; }
+
+END {
+	seq = lseq mseq rseq;
+	# fudge QL and QR
+	QL = length(lseq);
+	QR = length(lseq) + length(mseq) + 1;
+
+
+	seqlen = length(seq);
+	if (RR > seqlen) RR = seqlen;
+	if (LL < 1) LL = 1
+	lcut = LL - 1;
+	rcut = RR+1;
+	if ( NN>0 ) {
+		#
+		# clip to the right
+		#
+		score = 0;
+		# calculate default score
+		for(j=SR-1;j<SR+MM-1 && j<=seqlen ; j++)
+			if(j>0 && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SR; j<=RR; j++) {
+			if (j>1 && index("ACGTacgt",substr(seq,j-1,1)) == 0)
+				score--;
+			if (j+MM-1<=seqlen && index("ACGTacgt",substr(seq,j+MM-1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				rcut = j;
+				break;
+			}
+		}
+
+		#
+		# clip to the left
+		#
+		score = 0;
+		# calculate default score
+		for(j=SL+1;j>SL-MM+1 && j>0 ; j--)
+			if(j<=seqlen && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SL; j>=LL; j--) {
+			if (j<seqlen && index("ACGTacgt",substr(seq,j+1,1)) == 0)
+				score--;
+			if (j-MM+1>0 && index("ACGTacgt",substr(seq,j-MM+1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				lcut = j;
+				break;
+			}
+		}
+	}
+
+	# knock back the 3' end
+	rcut = rcut - KR;
+	if (rcut <= lcut) rcut = lcut + 1;
+
+	# Determine old values
+	if (QR<0) QR = seqlen+1;
+	if (QL<0) QL = 0;
+	# Constrain new values to old values
+	if (lcut >= QR) lcut = QR - 1;
+	if (rcut <= QL) rcut = QL + 1;
+	# Adjust if there are changes
+	if (lcut < QL) lcut = QL;
+	if (rcut > QR) rcut = QR;
+
+	#
+	# Print out the filtered file
+	#
+
+	# Write dap file header
+	if (length(header)==0) {
+		printf(";%6d%6d%6d%-4s%-18s\n",seqlen,lcut,rcut-lcut-1,"PLN","");
+	} else {
+		printf(";%6d%6d%6d%s\n",seqlen,lcut,rcut-lcut-1,substr(header,20));
+	}
+
+
+
+	# Write left cutoffs (if any)
+	if(lcut>0) {
+		str = substr(seq,1,lcut);
+		strlen = lcut;
+		bits = 48;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf (";<%s\n",substr(str,i,bits));
+		}
+	}
+
+	# Write right cutoffs (if any)
+	if(rcut<=seqlen) {
+		str = substr(seq,rcut);
+		strlen = length(str);
+		bits = 48;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf (";>%s\n",substr(str,i,bits));
+		}
+	}
+
+	# Write extras here
+	if (extras!="") print extras;
+
+
+	# Write remaining sequence (if any!)
+	if (lcut+1 != rcut) {
+		str = substr(seq,lcut+1,rcut-lcut-1);
+		strlen = length(str);
+		bits = 50;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf ("%s\n",substr(str,i,bits));
+		}
+	}
+
+
+}
diff --git a/src/scripts/clip-seqs b/src/scripts/clip-seqs
new file mode 100644
index 0000000..329e10d
--- /dev/null
+++ b/src/scripts/clip-seqs
@@ -0,0 +1,42 @@
+#!/bin/csh -f
+#
+# clip-seqs
+#
+# clip all sequences in a file of file names on quality
+#
+# Usage:
+#   clip-seqs fof
+#
+onintr abort
+
+if ( $#argv != 1 ) then
+	echo Usage: clip-seqs fof
+	exit 2
+endif
+
+if (! -f "$1") then
+	echo clip-seqs: file "$1" does not exist
+	exit 1
+endif
+
+
+foreach file ( `cat "$1"`)
+	/bin/rm -f tmp.$$
+	awk -f clip-seq.awk $file > tmp.$$
+	if ( $status ) then
+		echo clip-seqs: error while clipping $file
+		goto abort
+	endif
+	/bin/mv $file ${file}~
+	/bin/mv tmp.$$ $file
+end
+
+
+
+exit 0
+
+abort:
+
+echo aborted `date`	
+/bin/rm -f tmp.$$
+exit 1
\ No newline at end of file
diff --git a/src/scripts/clipping.doc b/src/scripts/clipping.doc
new file mode 100644
index 0000000..e289b90
--- /dev/null
+++ b/src/scripts/clipping.doc
@@ -0,0 +1,37 @@
+Marking regions of poor quality                  Simon Dear  Nov 17 1992
+
+Regions of poor quality can be excised using the "clip-seqs" script.
+This script takes a file of sequence file names as input, and filters
+each to the awk program clip-seq.awk. The sequence files are assumed
+to be in staden xdap format. For details on this format, see the
+README file in $STADENROOT. The files are modified to reflect the
+removal of poor data, while the original sequence is retained in a "~"
+suffixed file.
+
+Usage:
+	clip-seqs file-of-file-names
+
+
+Quality clipping is based on a simple analysis of the base content. By
+default, it works as follows. The original pre-clipped sequence is
+determined from the sequence file. The extents of good quality
+sequence is determined for both 5' and 3' ends of the sequence. The
+numbers in brackets are set in clip-seq.awk and can be set to suit
+local preferences.
+
+At the 3' end: A window of (5) bases is slid down the sequence,
+starting from base position (200) until there are (2) Ns within the
+window, or until the window reaches position (450), whichever happens
+first. The 3' extent of the good data is the set to be (50) bases
+upsteam of this position.
+
+At the 5' end: This essentially the same as for the 3' end. The window
+is slid back along the sequence starting at base position (100), until
+there are sufficient Ns or until the window reaches position (1),
+whichever happens first. There is no further adjustment of the 5'
+extent of good data.
+
+If there are existing extents in the sequence file, and they are more
+conservative than the ones calculated from the sequence, then they
+will be the extents used.
+	
\ No newline at end of file
diff --git a/src/scripts/dub b/src/scripts/dub
new file mode 100644
index 0000000..3b8cfab
--- /dev/null
+++ b/src/scripts/dub
@@ -0,0 +1,93 @@
+#!/bin/sh
+
+# looks for any block of 10 which has more than 6 bases which have
+# yet to be double stranded
+
+case $# in
+  1) ;;
+  *)echo dub reads an output file created by the Examine quality;
+echo option inside of xdap and outputs a list of which portions of;
+echo the sequence have yet to be double stranded and on which strand;
+echo 'USAGE: dub examine_quality_output_file ' 1>&2; exit 2;;
+esac
+
+
+awk 'BEGIN{start_file=0; start_num=0; prev1=0; prev2=0; prev3=0; prev4=0;
+		totnum12=0; totnum34=0; totnum1=0; totnum2=0; prevprev1=0; prevprev2=0; printf("REGIONS YET TO BE DOUBLE STRANDED FOR: %s\n\n","'$1'");}
+{
+
+# right now Bob does not care about information about type 3 and 4 
+
+
+# look for a number 10 to indicate the beginning of quality information
+	if (NF>0) {
+# 		if column 1 has 10 characters in it then this is
+#		most likely a line with quality information in it
+	  if (length($1)>8) {
+# go through each block of 10 counting the number of problems for
+# each row of quality counts
+	    for (j=1; j<=NF; j++) {
+		prevprev1=prev1; prevprev2=prev2;
+		prev1=num1; prev2=num2; prev3=num3; prev4=num4;
+# num12 and num34 are counts for each block of 10 so reset them
+# each time
+		num1=0;num2=0;num3=0;num4=0;num0=0;
+		for (i=0; i<=length($j); i++) {
+# for codes 1 and 2 you need the other strand
+  		  if (substr($j,i,1)=="1") num1++;
+		  if (substr($j,i,1)=="2") num2++;
+# for codes 3 and 4 you need both strands to resolve the disagreement
+#  		  if (substr($j,i,1)=="3" || substr($j,i,1)=="4") num34++;
+	        }
+#if any block of 10 has more than 6 total non-zeros then it is
+#a problem area
+		if (num1+num2>=6) {
+# only reset start_num if you have not already started a region of
+# problem areas
+   		   end_num=count+10*(j-1);
+	           if (start_num==0) {
+			start_num=count+10*(j-1)-9;
+# if that previous block before a problem area had more then 4 problems then go
+# ahead and move the start point to the beginning of that previous block
+	         	if (prev1+prev2>=4) start_num-=10;
+			if (prevprev1+prevprev2>=4) start_num-=10;
+		   }
+		   else {
+			totnum12+=num1+num2; 
+#			totnum34+=num34;
+			totnum1+=num1;
+			totnum2+=num2;
+		   }
+
+		}
+		else {
+# if the 3s and 4s make up more than half of the problems then tell the
+# user they need to pick up both strands
+
+	if (end_num!=0 && start_num!=0) {
+#use totnum1+totnum0 here becuase if it is mostly 0s and 1s then
+#you want to just see the plus strand  you do not need it to tell
+#you both strands
+#		   if (totnum1_totnum0>=(end_num-start_num)/2) printf("Needs plus strand from:  %10d to %10d\n",start_num,end_num);
+#		   else if (totnum2+totnum0>(end_num-start_num)/2) printf("Needs minus strand from: %10d to %10d\n",start_num,end_num);
+		   if (totnum1>totnum2) printf("Needs plus strand from:  %10d to %10d\n",start_num,end_num);
+		   else if (totnum2>totnum1) printf("Needs minus strand from: %10d to %10d\n",start_num,end_num);
+		else if (totnum12>6)
+		  printf("Needs one strand from:   %10d to %10d\n",start_num,end_num);
+        }
+		start_num=0;
+		end_num=0;
+		totnum0=0;
+		totnum1=0;
+		totnum2=0;
+		totnum12=0;
+		totnum34=0;
+		}
+	     }
+	  }
+	  else 
+	   count=$1;
+# 		count is the sequence indices indicator
+	  }
+}
+END{}' <$1 | sort +1 
diff --git a/src/scripts/stealdata b/src/scripts/stealdata
new file mode 100644
index 0000000..ce9897a
--- /dev/null
+++ b/src/scripts/stealdata
@@ -0,0 +1,415 @@
+#! /bin/csh -f
+#
+# stealdata
+#
+#    Borrow data and files from a neighbouring cosmid database
+#
+#
+# Change log:
+#
+#   v1.2 Handle dap and bap databases
+#   v1.3 Provide linking option
+#   v1.4 Under no circumstances link sequence files
+#   v1.5 Bug in option "l". Now works when databases have same name
+#   v1.6 To work with new version of bap
+#   v1.7 Variable report now set correctly
+#   v1.8 Check for BUSY file
+#   v1.9 There was a bug in the consensus output format fudge
+#   v1.10 There is a new style dialogue for choosing between
+#	 Staden and Fasta format of outout (26-Jan-92)
+#
+
+onintr abort
+
+#
+# set defaults here
+#
+# NOTE: "other" refers to the project you want to pinch data off
+# NOTE: db_type and other_db_type must be either "dap" or "bap"
+#
+set project = F54G8
+set version = 0
+set db_type = dap
+set other_dir = ~kt/ZC84
+set other_project = ZC84
+set other_version = 0
+set other_db_type = bap
+
+#
+# available operations are:
+#     r = report useful readings (response mailed)
+#     s = "r" + copy useful sequences only
+#     a = "r" + copy useful sequences AND traces
+#     l = "r" + copy useful sequences and LINK traces
+# default = "r"
+#
+set operation = "a"
+
+#
+# other - DON'T TOUCH
+#
+set trace_suffixes = ALF,RES,SCF
+
+#
+# check defaults
+#
+if (! -e ${project}.SQ${version}) then
+	echo Project ${project} version ${version} does not exist
+	exit 1
+endif
+
+if (! -d ${other_dir}) then
+	echo Directory ${other_dir} does not exist
+	exit 1
+endif
+
+if (! -e ${other_dir}/${other_project}.SQ${other_version}) then
+	echo Project ${other_dir}/${other_project} version ${other_version} does not exist
+	exit 1
+endif
+
+#
+# Check for BUSY files
+#
+if ( -e ${project}_BUSY${version} ) then
+	echo Project ${project} version ${version} is busy - try again later
+	exit 1
+endif
+
+#
+# Report file
+#
+set report = $cwd/_steal.rep.${project}
+set count = 0
+while ( -e $report )
+	# echo Report file ${report} already exists...
+	@ count ++
+	set report = $cwd/_steal.rep.${project}.${count}
+end
+
+touch ${report}
+
+#
+# Output details of stealdata run
+#
+cat << EOF | tee -a ${report}
+Stealdata v1.10
+Borrow data from neighbouring cosmid database
+
+project = ${project}
+version = ${version}
+db_type = ${db_type}
+other_dir = ${other_dir}
+other_project = ${other_project}
+other_version = ${other_version}
+other_db_type = ${other_db_type}
+operation = ${operation}
+EOF
+
+switch ( ${operation} )
+case "r":
+	echo "    ( report only )" | tee -a ${report}
+	breaksw
+case "s":
+	echo "    ( copy sequences only, report )" | tee -a ${report}
+	breaksw
+case "a":
+	echo "    ( copy sequences and traces, report )" | tee -a ${report}
+	breaksw
+case "l":
+	echo "    ( copy sequences and link traces, report )" | tee -a ${report}
+	breaksw
+default:
+	echo "    ( unknown - report only )" | tee -a ${report}
+	set operation = "r"
+	breaksw
+endsw
+echo "" | tee -a ${report}
+
+#
+# create a temporary directory
+#
+echo Creating a temporary directory | tee -a ${report}
+set temp_dir = _steal.dir.${other_project}
+set count = 0
+while (-e ${temp_dir})
+	echo Temporary directory ${temp_dir} already exists | tee -a ${report}
+	@ count ++
+	set temp_dir = _steal.dir.${other_project}.${count}
+end
+
+set home_dir = `pwd`
+mkdir ${temp_dir}
+
+#
+#consensus for database
+#
+set consensus = _steal.seq.${project}
+set count = 0
+while ( -e $consensus )
+	echo Consensus file ${consensus} already exists | tee -a ${report}
+	@ count ++
+	set consensus = _steal.seq.${project}.${count}
+end
+
+#
+# determine consensus
+#
+echo Determining the consensus for project ${project} version ${version} | tee -a ${report}
+#
+# FUDGE!!!
+# The third question in option 8 (Calculate consensus)
+# has changed in the latest version of bap.
+# Was:
+#     ? Make another consensus (y/n) (y) 
+# ## Now is:
+# ##     ? Staden format (y/n) (y) =
+# Now is:
+#     Select output format
+#     X  1 Staden
+#        2 Fasta
+#     ? Selection (1-2) (1) =
+#
+# Set fudge to reflect the appropriate choice
+#
+if ( ${project} == "bap" ) then
+	set fudge = ""
+	# set fudge = y
+else
+	set fudge = n
+endif
+
+#
+# One last minute check for BUSY files
+#
+if ( -e ${project}_BUSY${version} ) then
+	echo Project ${project} version ${version} is busy - try again later
+	goto abort
+endif
+
+${db_type} << EOF > /dev/null
+
+y
+${project}
+${version}
+
+8
+${consensus}
+y
+${fudge}
+
+!
+EOF
+if ( $status ) then
+	echo stealdata: failed to make consensus | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# move consensus to temp_dir
+#
+mv ${consensus} ${temp_dir}
+if ( $status ) then
+	echo stealdata: failed to move consensus to temp_dir | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# copy other database to this directory
+#
+echo Copying project ${other_project} version ${other_version} | tee -a ${report}
+cd ${temp_dir}
+cp -i \
+	${other_dir}/${other_project}.SQ${other_version} \
+	${other_dir}/${other_project}.AR${other_version} \
+	${other_dir}/${other_project}.RL${other_version} \
+	${other_dir}/${other_project}.TG${other_version} \
+	${other_dir}/${other_project}.CC${other_version} \
+	.
+if ( $status ) then
+	echo stealdata: failed to copy other database | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# file of file names in other database
+#
+set other_files = ${other_project}.all
+set other_useless = ${other_project}.useless
+set other_useful = ${other_project}.useful
+set other_new = ${other_project}.fn.$$
+
+#
+# extract gel readings
+#
+set option_extract = 32
+#
+# screen against vector
+#
+set option_screen  = 18
+#
+echo Extracting gel readings from ${other_project} and | tee -a ${report}
+echo "  screening against consensus from ${project}" | tee -a ${report}
+${other_db_type} << EOF > /dev/null
+
+y
+${other_project}
+${other_version}
+
+${option_extract}
+${other_files}
+n
+
+${option_screen}
+y
+${other_files}
+${other_useless}
+${consensus}
+30
+
+!
+EOF
+if ( $status ) then
+	echo stealdata: failed to extact or screen readings | tee -a ${report}
+	goto abort
+endif
+
+#
+# determine useful files in other project
+#
+echo Determining useful gel readings | tee -a ${report}
+sort ${other_files} | tr -d ' ' > ${other_files}.nice
+sort ${other_useless} | tr -d ' ' > ${other_useless}.nice
+comm -23 ${other_files}.nice ${other_useless}.nice > ${other_useful}
+
+#
+# create report header
+#
+echo "" | tee -a ${report}
+echo Useful gel readings: | tee -a ${report}
+
+#
+# determine new files 
+#
+touch ${other_new}
+set useful = 0
+foreach x ( `cat ${other_useful}` )
+	if ( -e ../$x ) then
+		echo $x | tee -a ${report}
+	else
+		echo $x "- NEW" | tee -a ${report}
+		echo $x >> ${other_new}
+		@ useful ++
+	endif
+end
+
+if ( $useful == 0 ) then
+	echo No new gel readings | tee -a ${report}
+	goto done	
+endif
+
+#
+# transfer sequences
+#
+if (${operation} == "s") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence files: " | tee -a ${report}
+	foreach x ( `cat ${other_new}` )
+		cp -i ${other_dir}/${x} ..
+		echo ${other_dir}/${x} | tee -a ${report}
+	end
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+#
+# transfer sequences and traces
+#
+if (${operation} == "a") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence and trace files: " | tee -a ${report}
+	foreach x ( `cat ${other_new}` )
+		# copy sequence file
+		cp -i ${other_dir}/${x} ..
+		echo ${other_dir}/${x} | tee -a ${report}
+		foreach file ( ${other_dir}/${x}{${trace_suffixes}} )
+			if (-e $file) then
+				cp -i $file ..
+				echo $file | tee -a ${report}
+			endif
+		end
+	end
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+#
+# transfer sequences and link traces
+#
+if (${operation} == "l") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence and trace files: " | tee -a ${report}
+	set files = `cat ${other_new}`
+	pushd .. >& /dev/null
+	foreach x ( $files )
+		# copy sequence file
+		cp -i ${other_dir}/${x} .
+		echo ${other_dir}/${x} | tee -a ${report}
+		foreach file ( ${other_dir}/${x}{${trace_suffixes}} )
+			# link to traces
+			if (-e $file) then
+				ln -s $file
+				echo $file | tee -a ${report}
+			endif
+		end
+	end
+	popd >& /dev/null
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+done:
+#
+# cleanup
+#
+echo Cleaning up | tee -a ${report}
+cd ..
+/bin/rm -r ${temp_dir}
+
+#
+# Finish up
+#
+echo "" | tee -a ${report}
+echo "Completed" | tee -a ${report}
+
+#
+# mail report
+#
+echo Mailing you a report
+mail `whoami` < ${report}
+
+
+exit 0
+
+
+#
+#
+#
+abort:
+	echo stealdata aborted `date` | tee -a ${report}
+	if ( $?home_dir ) then
+		cd $home_dir
+		echo Cleaning up | tee -a ${report}
+		/bin/rm -r ${temp_dir}
+	endif
+
+	exit 1
diff --git a/src/scripts/stealdata.stl b/src/scripts/stealdata.stl
new file mode 100644
index 0000000..399d1ae
--- /dev/null
+++ b/src/scripts/stealdata.stl
@@ -0,0 +1,441 @@
+#! /bin/csh -f
+#
+# stealdata (St.Louis Version)
+#
+#    Borrow data and files from a neighbouring cosmid database
+#
+# St.Louis version -
+# 1. prompts users for projects, versions, etc
+# 2. has to handle different reading/trace name conventions:
+#
+#    Site          Reading     Trace
+#    ------------  ----------  --------------------
+#    Cambridge,UK  <read>      <read>.{SCF,ABI,ALF>
+#    St.Louis      <read>.seq  <read>
+#
+#
+#
+# Change log:
+#
+#   v1.2 Handle dap and bap databases
+#   v1.3 Provide linking option
+#   v1.4 Under no circumstances link sequence files
+#   v1.5 Bug in option "l". Now works when databases have same name
+#   v1.6 To work with new version of bap
+#   v1.7 Variable report now set correctly
+#   v1.7.s St.Louis Version
+#   v1.8 bug in fudge
+#
+
+onintr abort
+
+#
+# set defaults here
+#
+# NOTE: "other" refers to the project you want to pinch data off
+# NOTE: db_type and other_db_type must be either "dap" or "bap"
+#
+
+echo -n "project name? (e.g. ZK1236)"
+set project = $<
+echo -n "project version (e.g. 0) "
+set version = $<
+echo -n "database type? (dap or bap) " 
+set db_type = $<
+echo -n "other project directory? (e.g. ~rick/ZK353 or ~rick/ZK112/assembly) "
+set other_dir = $<
+echo -n "other project name? (e.g. ZK353) "
+set other_project = $<
+echo -n "other project version? (e.g. 0) "
+set other_version = $<
+echo -n "other database type? (dap or bap) "
+set other_db_type = $<
+
+#set project = F54G8
+#set version = 0
+#set db_type = dap
+#set other_dir = ~kt/ZC84
+#set other_project = ZC84
+#set other_version = 0
+#set other_db_type = bap
+
+#
+# available operations are:
+#     r = report useful readings (response mailed)
+#     s = "r" + copy useful sequences only
+#     a = "r" + copy useful sequences AND traces
+#     l = "r" + copy useful sequences and LINK traces
+# default = "s"
+#
+# set operation = "s"
+
+echo "Choose desired operation: "
+echo " Available operations are:"
+echo "     r = report useful readings (response mailed)"
+echo "     s = r + copy useful sequences only"
+echo "     a = r + copy useful sequences AND traces"
+echo "     l = r + copy useful sequences and LINK traces"
+set operation = $<
+
+
+
+
+#
+# other - DON'T TOUCH ... only used for England
+#
+set trace_suffixes = ALF,RES,SCF
+
+#
+# check defaults
+#
+if (! -e ${project}.SQ${version}) then
+	echo Project ${project} version ${version} does not exist
+	exit 1
+endif
+
+if (! -d ${other_dir}) then
+	echo Directory ${other_dir} does not exist
+	exit 1
+endif
+
+if (! -e ${other_dir}/${other_project}.SQ${other_version}) then
+	echo Project ${other_dir}/${other_project} version ${other_version} does not exist
+	exit 1
+endif
+
+#
+# Report file
+#
+
+set report = $cwd/_steal.rep.${project}
+set count = 0
+while ( -e $report )
+	# echo Report file ${report} already exists...
+	@ count ++
+	set report = $cwd/_steal.rep.${project}.${count}
+end
+
+touch ${report}
+
+#
+# Output details of stealdata run
+#
+cat << EOF | tee -a ${report}
+Stealdata v1.8
+Borrow data from neighbouring cosmid database
+
+project = ${project}
+version = ${version}
+db_type = ${db_type}
+other_dir = ${other_dir}
+other_project = ${other_project}
+other_version = ${other_version}
+other_db_type = ${other_db_type}
+operation = ${operation}
+EOF
+
+switch ( ${operation} )
+case "r":
+	echo "    ( report only )" | tee -a ${report}
+	breaksw
+case "s":
+	echo "    ( copy sequences only, report )" | tee -a ${report}
+	breaksw
+case "a":
+	echo "    ( copy sequences and traces, report )" | tee -a ${report}
+	breaksw
+case "l":
+	echo "    ( copy sequences and link traces, report )" | tee -a ${report}
+	breaksw
+default:
+	echo "    ( unknown - report only )" | tee -a ${report}
+	set operation = "r"
+	breaksw
+endsw
+echo "" | tee -a ${report}
+
+#
+# create a temporary directory
+#
+echo Creating a temporary directory | tee -a ${report}
+set temp_dir = _steal.dir.${other_project}
+set count = 0
+while (-e ${temp_dir})
+	echo Temporary directory ${temp_dir} already exists | tee -a ${report}
+	@ count ++
+	set temp_dir = _steal.dir.${other_project}.${count}
+end
+
+set home_dir = `pwd`
+mkdir ${temp_dir}
+
+#
+#consensus for database
+#
+set consensus = _steal.seq.${project}
+set count = 0
+while ( -e $consensus )
+	echo Consensus file ${consensus} already exists | tee -a ${report}
+	@ count ++
+	set consensus = _steal.seq.${project}.${count}
+end
+
+#
+# determine consensus
+#
+echo Determining the consensus for project ${project} version ${version} | tee -a ${report}
+#
+# FUDGE!!!
+# The third question in option 8 (Calculate consensus)
+# has changed in the latest version of bap.
+# Was:
+#     ? Make another consensus (y/n) (y) 
+# Now is:
+#     ? Staden format (y/n) (y) =
+#
+# Set fudge to reflect the appropriate choice
+#
+if ( ${db_type} == "bap" ) then
+	set fudge = y
+else
+	set fudge = n
+endif
+${db_type} << EOF > /dev/null
+
+y
+${project}
+${version}
+
+8
+${consensus}
+y
+${fudge}
+
+!
+EOF
+if ( $status ) then
+	echo stealdata: failed to make consensus | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# move consensus to temp_dir
+#
+mv ${consensus} ${temp_dir}
+if ( $status ) then
+	echo stealdata: failed to move consensus to temp_dir | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# copy other database to this directory
+#
+echo Copying project ${other_project} version ${other_version} | tee -a ${report}
+cd ${temp_dir}
+cp -i \
+	${other_dir}/${other_project}.SQ${other_version} \
+	${other_dir}/${other_project}.AR${other_version} \
+	${other_dir}/${other_project}.RL${other_version} \
+	${other_dir}/${other_project}.TG${other_version} \
+	${other_dir}/${other_project}.CC${other_version} \
+	.
+if ( $status ) then
+	echo stealdata: failed to copy other database | tee -a ${report}
+	goto abort
+endif
+
+
+#
+# file of file names in other database
+#
+set other_files = ${other_project}.all
+set other_useless = ${other_project}.useless
+set other_useful = ${other_project}.useful
+set other_new = ${other_project}.fn.$$
+
+#
+# extract gel readings
+#
+set option_extract = 32
+#
+# screen against vector
+#
+set option_screen  = 18
+#
+echo Extracting gel readings from ${other_project} and | tee -a ${report}
+echo "  screening against consensus from ${project}" | tee -a ${report}
+${other_db_type} << EOF > /dev/null
+
+y
+${other_project}
+${other_version}
+
+${option_extract}
+${other_files}
+n
+
+${option_screen}
+y
+${other_files}
+${other_useless}
+${consensus}
+30
+
+!
+EOF
+if ( $status ) then
+	echo stealdata: failed to extact or screen readings | tee -a ${report}
+	goto abort
+endif
+
+#
+# determine useful files in other project
+#
+echo Determining useful gel readings | tee -a ${report}
+# lfw added awk intermediary
+# need to get a .seq on the end of all of the filenames...the output
+# from dap/bap would have truncated names
+sort ${other_files} | tr -d ' ' | awk  'BEGIN{FS=".";}{ print $1"."$2".seq";}' > ${other_files}.nice
+sort ${other_useless} | tr -d ' ' | awk  'BEGIN{FS=".";}{ print $1"."$2".seq";}' > ${other_useless}.nice
+comm -23 ${other_files}.nice ${other_useless}.nice > ${other_useful}
+
+#
+# create report header
+#
+echo "" | tee -a ${report}
+echo Useful gel readings: | tee -a ${report}
+
+#
+# determine new files 
+#
+touch ${other_new}
+set useful = 0
+foreach x ( `cat ${other_useful}` )
+	if ( -e ../$x ) then
+		echo $x | tee -a ${report}
+	else
+		echo $x "- NEW" | tee -a ${report}
+		echo $x >> ${other_new}
+		@ useful ++
+	endif
+end
+
+if ( $useful == 0 ) then
+	echo No new gel readings | tee -a ${report}
+	goto done	
+endif
+
+#
+# transfer sequences
+#
+if (${operation} == "s") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence files: " | tee -a ${report}
+	foreach x ( `cat ${other_new}` )
+		cp -i ${other_dir}/${x} ..
+		echo ${other_dir}/${x} | tee -a ${report}
+	end
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+#
+# transfer sequences and traces
+#
+if (${operation} == "a") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence and trace files: " | tee -a ${report}
+
+#	foreach x ( `cat ${other_new} | sed 's/.seq//g' ` )
+# change by lfw for our sequence file naming convention
+	foreach x ( `cat ${other_new} | sed 's/.seq//g' ` )
+		# copy sequence file
+		cp -i ${other_dir}/${x} ..
+		echo ${other_dir}/${x} | tee -a ${report}
+#		foreach file ( ${other_dir}/${x}{${trace_suffixes}} )
+# change by lfw for our sequence file naming convention
+		foreach file ( ${other_dir}/${x} )
+			if (-e $file) then
+				cp -i $file ..
+				echo $file | tee -a ${report}
+			endif
+		end
+	end
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+#
+# transfer sequences and link traces
+#
+if (${operation} == "l") then
+	echo "" | tee -a ${report}
+	echo "Transfering sequence and trace files: " | tee -a ${report}
+#change by lfw so that it accomodates our sequence file names
+	set files = `cat ${other_new} | sed 's/.seq//g' `
+	pushd .. >& /dev/null
+	foreach x ( $files )
+		# copy sequence file
+		cp -i ${other_dir}/${x} .
+		echo ${other_dir}/${x} | tee -a ${report}
+#		foreach file ( ${other_dir}/${x}{${trace_suffixes}} )
+# change by lfw so that it accomodates our sequence and trace file names
+		foreach file ( ${other_dir}/${x} )
+			# link to traces
+			if (-e $file) then
+				ln -s $file
+				echo $file | tee -a ${report}
+			endif
+		end
+	end
+	popd >& /dev/null
+	# file of file names
+	mv ${other_new} ..
+	echo "" | tee -a ${report}
+	echo File of file names is ${other_new} | tee -a ${report}
+endif
+
+done:
+#
+# cleanup
+#
+echo Cleaning up | tee -a ${report}
+cd ..
+/bin/rm -r ${temp_dir}
+
+#
+# Finish up
+#
+echo "" | tee -a ${report}
+echo "Completed" | tee -a ${report}
+
+#
+# mail report
+#
+echo Mailing you a report
+mail `whoami` < ${report}
+
+
+exit 0
+
+
+#
+#
+#
+abort:
+	echo stealdata aborted `date` | tee -a ${report}
+	if ( $?home_dir ) then
+		cd $home_dir
+		echo Cleaning up | tee -a ${report}
+		/bin/rm -r ${temp_dir}
+	endif
+
+	exit 1
+
+
diff --git a/src/squirrel-1.4/DISCLAIMER b/src/squirrel-1.4/DISCLAIMER
new file mode 100644
index 0000000..5f4b1e8
--- /dev/null
+++ b/src/squirrel-1.4/DISCLAIMER
@@ -0,0 +1,17 @@
+DISCLAIMER
+
+Squirrel was written by Simon Dear (MRC Laboratory of Molecular
+Biology) as a tool to automate the processing of data on the C.elegans
+sequencing project. It has been used extensively since May 1992.
+
+The software is not generally applicable to other sequencing projects
+without extensive modification. We would like to encourage other
+groups to build similar systems based on our version of squirrel. You
+are free to use and modify the squirrel software as you see fit. We do
+not accept any responsibility for the software as it is distributed or
+after it has been modified by you.
+
+
+
+
+						sd@mrc-lmb.cam.ac.uk
diff --git a/src/squirrel-1.4/bin/README b/src/squirrel-1.4/bin/README
new file mode 100644
index 0000000..13dc411
--- /dev/null
+++ b/src/squirrel-1.4/bin/README
@@ -0,0 +1,29 @@
+dir: $SQUIRREL/bin/				
+date: Thu May 06 14:18:51 BST 1993
+
+
+The following programs must be installed in this directory:
+
+expGetSeq	
+	For extracting the sequence and trace clipping information from
+	trace files.
+	Sources in $STADENROOT/src/expGetSeq/
+
+
+vepe		
+	For excising vector from a sequence in experiment file format.
+	Sources in $STADENROOT/src/vepe/
+
+
+getABISampleName
+	For retrieving the ABI Sample Name from an ABI trace file.
+	Sources in $STADENROOT/src/abi/
+
+
+makeSCF	
+	For converting a trace file to SCF format.
+	Sources in $STADENROOT/src/scf/
+
+update_subclones
+	For updating the $SQUIRREL/tables/subclones database
+	Sources in $STADENROOT/src/update_subclones/
diff --git a/src/squirrel-1.4/doc/DISCLAIMER b/src/squirrel-1.4/doc/DISCLAIMER
new file mode 100644
index 0000000..5f4b1e8
--- /dev/null
+++ b/src/squirrel-1.4/doc/DISCLAIMER
@@ -0,0 +1,17 @@
+DISCLAIMER
+
+Squirrel was written by Simon Dear (MRC Laboratory of Molecular
+Biology) as a tool to automate the processing of data on the C.elegans
+sequencing project. It has been used extensively since May 1992.
+
+The software is not generally applicable to other sequencing projects
+without extensive modification. We would like to encourage other
+groups to build similar systems based on our version of squirrel. You
+are free to use and modify the squirrel software as you see fit. We do
+not accept any responsibility for the software as it is distributed or
+after it has been modified by you.
+
+
+
+
+						sd@mrc-lmb.cam.ac.uk
diff --git a/src/squirrel-1.4/doc/ExperimentFile.format b/src/squirrel-1.4/doc/ExperimentFile.format
new file mode 100644
index 0000000..183a7e7
--- /dev/null
+++ b/src/squirrel-1.4/doc/ExperimentFile.format
@@ -0,0 +1,303 @@
+Experiment file format                   Simon Dear, 15 February 1993
+---------------------------------------------------------------------
+
+The experiment file will contain every piece of information that will
+ever needed to be known for the initial sequence processing, assembly,
+oligo primer selection and automated walking.  This does not mean that
+all records in the file will be stored in the new style database, as
+many are clearly not relevant after the preprocessing stages.
+
+
+Records in file
+---------------
+
+The order of records in the file is not important. They are listed
+here in alphabetical order:
+
+CC   (comment line)
+CF   (Cloning vector sequence File)
+CN   (Clone Name)
+CS   (Cloning vector Sequence present in sequence)
+CV   (Cloning Vector type)
+DR   (Direction of Read)
+DT   (DaTe of experiment)
+EN   (Experiment Name)
+EX   (EXperimental notes)
+FM   (sequencing vector Fragmentation Method)
+ID   (IDentifier)
+LN   (Local format trace file Name)
+LT   (Local format trace file Type)
+MC   (MaChine on which experiment ran)
+MN   (Machine generated trace file Name)
+MT   (Machine generated trace file Type)
+OP   (OPerator)
+PN   (Primer Name)
+PS   (Processing Status)
+QL   (poor Quality sequence present at Left (5') end)
+QR   (poor Quality sequence present at Right (3') end)
+SC   (Sequencing vector Cloning site)
+SF   (Sequencing vector sequence File)
+SI   (Sequencing vector Insertion length)
+SL   (Sequencing vector sequence present at Left (5') end)
+SP   (Sequencing vector Primer site (relative to cloning site))
+SQ   (SeQuence)
+SR   (Sequencing vector sequence present at Right (3') end)
+SS   (Screening Sequencing)
+ST   (STrands)
+SV   (Sequencing Vector type)
+TN   (Template Name)
+
+
+Explanation of records
+----------------------
+
+Record: CC   (comment line)
+Format:	CC   string
+Explanation:
+A general comment line
+be used by vep.
+
+Record: CF   (Cloning vector sequence File)
+Format:	CF   string
+Explanation:
+The name of the file containing the sequence of the cloning vector, to
+be used by vep.
+
+Record: CN   (Clone Name)
+Format:	CN   string
+Explanation:
+The name of the physical map clone. The new database will allow
+assembly of sequences from overlapping clones. Knowing the clone name
+provides gap with mapping information about the sequence.
+
+Record: CS   (Cloning vector Sequence present in sequence)
+Format:	CS   range
+Explanation:
+Regions of sequence considered to be cloning vector. Used in assembly
+to exclude unwanted sequence.
+
+Record: CV   (Cloning Vector type)
+Format:	CV   string
+Explanation:
+The type of the cloning vector used. Required for vep
+
+Record: DR   (Direction of Read)
+Format:	DR   direction
+Explanation:
+Which of forward or reverse primers were used. Useful for oligo primer
+selection, and automated walking. Allows mapping of forward and
+reverse reads off same template.
+
+Record: DT   (DaTe of experiment)
+Format:	DT   dd-mon-yyyy
+Explanation:
+The date on which the experiment file was created. Not used anywhere.
+
+Record: EN   (Experiment Name)
+Format:	EN   string
+Explanation:
+The name given to the experiment. Allows disassociation of the name of
+the experiment file and the name that gets used by gap.
+
+Record: EX   (EXperimental notes)
+Format:	EX   string
+Explanation:
+Free format comment for additional information. Not used anywhere.
+
+Record: FM   (sequencing vector Fragmentation Method)
+Format:	FM   string
+Explanation:
+Fragmentation method used to create subclone (sequencing vector)
+library. Non used anywhere.
+
+Record: ID   (IDentifier)
+Format:	ID   string
+Explanation:
+Same as EN (Experiment Name). This field is to make the experiment
+file look like it is in EMBL format.
+
+Record: LN   (Local format trace file Name)
+Format:	LN   string
+Explanation:
+The name of the local format trace file. This information is passed
+onto gap, and allows for local formats to be used.
+
+Record: LT   (Local format trace file Type)
+Format:	LT   string
+Explanation:
+The type of the local trace file type.
+
+Record: MC   (MaChine on which experiment ran)
+Format:	MC   string
+Explanation:
+The lab's name for the sequencing machine used to create the data. Is
+this useful?
+
+Record: MN   (Machine generated trace file Name)
+Format:	MN   string
+Explanation:
+The name of the trace file generated by the sequencing machine MC.
+
+Record: MT   (Machine generated trace file Type)
+Format:	MT   string
+Explanation:
+The type of MN.
+
+Record: OP   (OPerator)
+Format:	OP   string
+Explanation:
+Someone's name, possibly the person who ran the script. Not used anywhere.
+
+Record: PN   (Primer Name)
+Format:	PN   string
+Explanation:
+Name of primer used, using local naming convention. Could be a
+universal primer. May be useful one day. Not used currently.
+
+Record: PS   (Processing Status)
+Format:	PS   explanation
+Explanation:
+Indication of processing status. The presence of this entry indicates
+failure of some kind, and the experiment file should not be further
+processed.
+
+Record: QL   (poor Quality sequence present at Left (5') end)
+Format:	QL   position
+Explanation:
+The sequence up to and including the base at the marked position are
+considered too poor quality to be considered. May overlap with other
+marked sequences - CS, SL or SR. Used in assembly to exclude unwanted
+sequence.
+
+Record: QR   (poor Quality sequence present at Right (3') end)
+Format:	QR   position
+Explanation:
+The sequence from and including the base at the marked position to the
+end are considered too poor quality to be considered. May overlap with
+other marked sequences - CS, SL or SR. Used in assembly to exclude
+unwanted sequence.
+
+Record: SC   (Sequencing vector Cloning site)
+Format:	SC   position
+Explanation:
+The cloning site of the sequence vector, based on local knowledge of
+the sequence vector. Used by vep.
+
+Record: SF   (Sequencing vector sequence File)
+Format:	SF   string
+Explanation:
+The name of the file containing the sequence of the sequencing vector,
+to be used by vep.
+
+Record: SI   (Sequencing vector Insertion length)
+Format:	SI   range
+Explanation:
+Suspected insertion length of subclone. Useful for selecting templates
+in oligo primer selection.
+
+Record: SL   (Sequencing vector sequence present at Left (5') end)
+Format:	SL   position
+Explanation:
+The sequence up to and including the base at the marked position are
+considered to be sequencing vector. Used in assembly to exclude
+unwanted sequence.
+
+Record: SP   (Sequencing vector Primer site (relative to cloning site))
+Format:	SP   position
+Explanation:
+Location of the primer using to sequence relative to cloning site.
+Used by vep.
+
+Record: SQ   (SeQuence)
+Format:	SQ   \nsequence...\n\\\n
+Explanation:
+Complete sequence, as determined by the sequencing machine.
+
+Record: SR   (Sequencing vector sequence present at Right (3') end)
+Format:	SR   position
+Explanation:
+The sequence from and including the base at the marked position to the
+end are considered to be sequencing vector. Used in assembly to
+exclude unwanted sequence.
+
+Record: SS   (Screening Sequencing)
+Format:	SS   string
+Explanation:
+Sequences to screen sequence against. To screen against more that one
+sequence, it is permissible to include more that one SS line. The
+field is used by the script screen-against-vector from within squirrel.
+Experiments partially matching a screen file will have a PS line added
+to the end.
+
+Record: ST   (STrands)
+Format:	ST   number
+Explanation:
+Indicating whether this is a single or double stranded template. This
+is useful in oligo primer selection, for deducing suitable templates.
+
+Record: SV   (Sequencing Vector type)
+Format:	SV   string
+Explanation:
+Name of sequencing vector used in experiment. Allows preference lists
+to be set up for choosing templates in selecting oligos.
+
+Record: TN   (Template Name)
+Format:	TN   string
+Explanation:
+The name of the template used in the experiment.
+
+
+
+Example File
+------------
+
+The information so far included here is easily derivable from the
+trace file, using various scripts and a few look-up tables. This
+example is based on one created automatically:
+
+
+ID   h4a01h6.s1
+EN   h4a01h6.s1
+TN   h4a01h6
+EX   lane 18, run time 10 hrs
+MN   Sample 18
+MC   A
+MT   ABI
+LN   h4a01h6.s1SCF
+LT   SCF
+DT   08-Jan-1993
+CC   Dir: /home/sd/Mac-files-sd/Results F 16_12_92.2
+OP   sd
+TN   h4a01h6
+SV   M13mp18
+SF   /home/sd/squirrel/seqs/m13mp18.vec
+SI   1000..2000
+SC   6249
+FM   shear
+PN   -21
+DR   +
+SP   41
+ST   1
+CN   3G9
+CV   sCos-1
+CF   /home/sd/squirrel/seqs/sCos-1.seq
+SS   /home/sd/squirrel/seqs/Alu.seqs
+SS   /home/sd/squirrel/seqs/schrrIII.seq
+SQ
+     GCTTGCATGC CTGCAGGTCG ACTCTAGAGG ATCCCCAACC AGTAAGGCAA CCCCGCCAGC
+     CTAGCCGGGT CCTCAACGAC AGGAGCACGA TCATGCGCAC CCGTCAGATC CAGACATGAT
+     AAGATACATT GATGAGTTTG GACAAACCAC AACTAGAATG CAGT-AAAAA AATGCTTTAT
+     TTGTGAAATT TGTGATGCTA TTGCTTTATT TGTAACCATT ATAAGCTGCA ATAAACAAGT
+     TAACAACAAC AATTGCATTC ATTTTATGTT TCAGGTTCAG GGGGAGGTGT GGGAGGTTTT
+     TTAAAGCAAG TAAAACCTCT ACAAATGTGG TATGGCTGAT TATGATCTCT AGTCAAGGCA
+     CTATACATCA AATATT-CCT TATTAACCCC CTTTACAAAT TTAAAAGGCT -AAAGGGTCC
+     ACAATTTTTG -GCCTAGGTA TTAATAGCCG GCACTTCTT- TGCCTGTTTT GG-GTAGGG-
+     AAAACCGGTA TGTTT-TGGT T-TTC
+//
+CC   Quality clipping by newted May-1992
+QL   0
+QR   281
+SL   36
+SR   506
+CS   37..280
+PS   Completely cloning vector
diff --git a/src/squirrel-1.4/doc/NOTES b/src/squirrel-1.4/doc/NOTES
new file mode 100644
index 0000000..e88b20c
--- /dev/null
+++ b/src/squirrel-1.4/doc/NOTES
@@ -0,0 +1,14 @@
+NOTES (C.elegans project)
+
+1. Sequence files, cloning sites and vepe
+
+When vepe excises cloning vector, it does not make allowances for the
+vector being circular. It also makes no assumptions about the locations
+of the splice sites. It is important when setting up the sequence files
+($SQUIRREL/seqs) to make allowances for these factors - DO NOT LET THE
+SPICE SITE BE TOO NEAR EITHER END OF THE FILE. You may need to edit the
+sequence file so that the site lies near the centre of the sequence.
+
+2. Cosmids and cloning sites
+Lorist2    BamHI/G'GATCC//
+
diff --git a/src/squirrel-1.4/doc/Porting.Overview b/src/squirrel-1.4/doc/Porting.Overview
new file mode 100644
index 0000000..cc73874
--- /dev/null
+++ b/src/squirrel-1.4/doc/Porting.Overview
@@ -0,0 +1,154 @@
+SQUIRREL Overview (Version 1.4)      10 July 1992     Simon Dear
+================================================================
+
+0. Introduction
+
+This is intended as a brief overview for those wanting to set up
+squirrel on their system. It gives an overview of what is supplied
+in the package, what changes must be made, and places where it is
+foreseen problems may occur.
+
+
+1. Overview of the squirrel directory structure
+
+./squirrel
+	The root of the squirrel tree. To use squirrel V1.4 ensure
+that the environment variable SQUIRREL is set to be the path name of
+the top of the tree. eg
+	setenv SQUIRREL /home/BioSW/squirrel
+
+./squirrel/bin
+	Contains all of the programs required by squirrel -except- the
+assembly program(s) dap(,bap,gap).
+
+./squirrel/doc
+	All the existing documentation on squirrel. Of interest are:
+		SQUIRREL - overview of processing
+		SysManagement - how to manage squirrel's tables
+	Read them!!!
+
+./squirrel/scripts
+	All the scripts used by squirrel are here. The important one
+is `squirrel'. A copy of this should be placed in the user's project
+directory and edited so that the environment variables PROJECT and
+EDITOR reflect the project name and the user's choice of editor.
+	
+./squirrel/seqs
+	All the vector sequence files that squirrel V1.4 knows about.
+
+./squirrel/tables
+	Information about the sequencing project. See the document
+SysManagement and SQUIRREL for details.
+
+
+
+
+
+2. Changes required
+
+The following changes should be made immediately. They are the obvious
+things. There are undoubtedly more subtle changes to be made in the
+scripts.
+
+./squirrel/tables/vector-seqfiles
+	You may need to add entries to this file. V1.4 assumes that
+the files exist in the directory $SQUIRREL/seqs
+
+./squirrel/tables/databases
+	Register your users now!
+
+./squirrel/tables/seqclone-sites
+	You may need to add entries to this file. WARNING - it is
+exceedingly difficult to get the values of the cloning site and
+primer positions correct. Ensure that the associated vector file
+is in the appropriate sense.
+	Support for multiple cutters: You will need to do provide
+entries for each vector-cutter pair. Eg M13mp18-SmaI. Default values
+(eg M13mp18) should be included AFTER more specific ones. Subclones
+should include the vector-cutter pair as the sequencing vector.
+	Multiple forward or reverse primers aren't yet supported.
+
+./squirrel/tables/subclones
+	The painful bit. This information will need to be updated on a
+regular basis.
+
+./squirrel/tables/clone_types.awk
+	Information mapping clones to their cloning vector.
+
+
+
+
+
+3. Assumptions made
+
+To foresee where squirrel may go wrong, it will pay be be aware of the
+assumptions made by the script.
+
+a) Machines supported
+	Currently squirrel assumes that you have an ABI machine. The
+script has been written so that all ABI specific parts are separated.
+If other machines are to be supported, then similarly functioning
+scripts to ABI-sess-files.csh (for determining samples and experiments to
+be processed) and ABI-exp-files.csh (creation of experiment files -
+machine specific bits) will need to be written.
+
+b) Location of Results folders
+	Results folders are assumed to be in the directory called:
+		$HOME/Mac-files-$USER/"Results $MACHINE $DATE"
+where $HOME is the user's home directory, $USER is the user's is,
+$MACHINE and $DATE are the command line arguments passed to squirrel.
+This is evaluated once, in the script squirrel.
+
+c) Samples to be processed
+	V1.4 will assume that all files in the results folder matched by the
+glob "Sample [0-9][0-9]{,.[0-9]}" are to be processed.
+
+d) Experiment naming conventions
+	squirrel makes a few assumptions about the naming convention
+of experiments (samples):
+	<name> ::= <mtd><well>.<class><anything>
+where:
+             <mtd> ::= [a-z]{1,2}[0-9][0-9]
+            <well> ::= <anything>
+	   <class> ::= [fF] {forward read, double stranded}
+		     | [rR] {reverse read, double stranded}
+	             | [^rR] {forward read, single stranded}
+	<anything> ::= {any else}
+Because of the way squirrel uses lookup tables to gather its
+information, it is essential that the prefix <mtd> determines
+a unique microtitre dish. (ie can't have mtds with names "a10" and
+"a1"). I think our naming convention guarentees this. Also, being able
+to determine the direction of the read is essential for vepe to work.
+The strandedness is determined but isn't currently used. All this is
+encoded in the script `embellish-exp-files.csh'.
+
+e) Only used for shotgun stage
+	squirrel has only been tested for the shotgun stage. V1.4 is
+not guarenteed to work for the walking phase.
+
+f) Owner of directory
+	squirrel doesn't check that the user is the ower of the
+results folder or the project directory.
+
+g) Unwanted files
+	Squirrel assumes that you don't want to keep any of the files
+(ABI traces or .Seq files) in the Results folder. Traces are converted
+to SCF format by ABI-exp-files.csh. 
+
+h) Software assumptions
+	You mush have awk and assembly program (dap,bap) to run
+squirrel. All other programs are provided. squirrel assumes that you
+have a version of the assembly program that writes out reason codes
+with the assembly failures, but it doesn't complain if you don't.
+
+i) That you want to run the entirity of squirrel and not just bits of it
+	Most of the scripts in squirrel could be run as stand-alone
+programs but they weren't written with that intention. With squirrel
+you should assume you get all or nothing.
+
+
+
+
+
+
+
diff --git a/src/squirrel-1.4/doc/README b/src/squirrel-1.4/doc/README
new file mode 100644
index 0000000..7a8c300
--- /dev/null
+++ b/src/squirrel-1.4/doc/README
@@ -0,0 +1,44 @@
+ Dir: $SQUIRREL/doc/
+Date: Fri May 07 11:16:27 BST 1993
+
+Here is the documentation on squirrel.
+
+
+DISCLAIMER -
+	Please read first!
+
+SQUIRREL - An overview of squirrel
+	A quick guide to the processing phases involved in squirrel.
+	Example session files, experiment files, reports and squirrel
+	databases (tables) are given.
+
+SysManagement - Maintaining the squirrel databases
+	An in depth guide into how to maintain the information squirrel
+	needs to process automatically.
+
+Porting.Overview - Getting squirrel working on your system
+	A brief overview for those wanting to set up squirrel on their
+	system. It gives an overview of what is supplied in the
+	package, what changes must be made, and places where it is
+	foreseen problems may occur. 
+
+ExperimentFile.format - A guide to experiment files
+	Experiment files are the mechanism information is transferred
+	between programs in squirrel. Future versions of the Staden
+	assembly program will read in experiment files directly. This
+	document is useful for those wanting to produce compatible
+	files but who do not wish to use squirrel.
+	
+NOTES - General comments
+	Not much here, though an important note for those adding new
+	cloning vector files to their squirrel databases.
+
+
+-----
+
+Further information should be sought from the author.
+
+
+
+Simon Dear
+sd@mrc-lmb.cam.ac.uk
diff --git a/src/squirrel-1.4/doc/SQUIRREL b/src/squirrel-1.4/doc/SQUIRREL
new file mode 100644
index 0000000..2e9f37c
--- /dev/null
+++ b/src/squirrel-1.4/doc/SQUIRREL
@@ -0,0 +1,281 @@
+SQUIRREL
+----------------------------------------------------------------------
+
+    squirrel n. small bushy-tailed tree-living rodent; (fig) one
+    who hoards objects of small value
+
+
+
+Squirrel performs all operations on samples in a results folder:
+quality clipping, vector removal and assembly.
+
+Usage:
+	squirrel machine_letter date
+
+
+
+Phase 0: Start up
+
+Determines the pass number. Squirrel allows several passes on the same
+Results folder.
+
+
+Phase 1: Determining samples to be processed (ABI)
+
+Creates a session file which maps each available sample to its
+experiment name (sample name). An example of the session file appears
+at the end.
+
+
+Phase 2: Edit the session file
+
+The user can edit the session file, to correct experiment names, and
+to selectively omit entries from further processing
+
+FROM HERE ON, the process is automatic and requires no user
+intervention.
+
+Phase 2a: Check the edited experiment file for consistency
+
+
+Phase 3: Create experiment files for each entry in the session file. (ABI)
+
+Only sequencing machine (ABI) specific fields are added to the new file.
+In addition, the trace file is converted to our local (SCF) format.
+
+
+Phase 4: Embellish the experiment files.
+
+Non machine specific attributions.  Starting from the experiment name,
+it is possible to determine all information about a specific
+experiment file.  A series of lookup tables is used (see end).
+
+	experiment-name -> (cosmid,insert-size,frag-method,seq-vector)
+	eg c72a11.s1 -> (C40H1,1000..2000,shear,M13mp18)
+
+	experiment-name -> direction-of-read
+	eg c72a11.s1 -> forwards
+
+	cosmid -> cloning-vector
+	eg C40H1 -> pJB8
+
+	seq-vector -> (clone-site,forward-primer-site,reverse-primer-site)
+	eg M13mp18 -> (6249,41,-24)
+
+	vector -> vector-file
+	eg pJB8 -> /nfs/westmorland/home/BioSW/staden/userdata/pjb8.vep
+	eg M13mp18 -> /nfs/westmorland/home/BioSW/staden/userdata/m13mp18.vec
+
+The sequence is extracted from the trace (SCF) file, and clipped.
+Clipping is now done by examining the traces, using LaDeana's quality
+measures. As a fallback mechanism, the old method based on base content
+is still used, but only when it gives more conservative values.
+
+[Old method is:
+
+To the right of base number 200 and to the left of base 100 everything
+is considered good quality upto where there are 2 uncertainties in a
+5 base window. The right hand end is ALWAYS knocked back 50 bases, and
+never is set greater than base number 450.
+]
+
+
+Phase 5: Quality checking.
+
+Each experiment file is checked for quality. Sequences that, after
+quality clipping, are shorter that 100 bases or have greater than 4%
+uncertainties are failed. Traces that fail LaDeana's overall quality
+check will fail here as the sequence will have zero length.
+
+
+Phase 6: Vector removal
+
+There are two passes. First to remove sequencing vector, and second to
+remove cloning vector. The program "vepe" uses information about vector
+and vector sequence files embedded in the experiment file.
+
+
+Phase 7: Transfer files to project directory
+
+Trace files are directly copied. Experiment files are converted to the
+current sequence file format used by the assembly program. If they
+have been failed for any reason (completely vector, poor quality) they
+are placed in the subdirectoy "Failures". Also, the experiment files are
+retained and stored in the subdirectory "Expfiles". Experiments that
+already exist are abandoned.
+
+
+Phase 8: Assemble the sequences
+
+THe current database is temporarily copied to a working version ('X'), into
+which the sequences are assembled. If the assembly completes without
+error, the current version is updated.
+
+
+Phase 10: Reporting
+
+A report is produced detailing the history of each experiment file.
+Copies the report are left in subdirectory "Logs" of the project
+directory.
+Each experiment is given a processing status, which is one of:
+
+       OK - the experiment ended up in the database
+ Rejected - tried to assembly the experiment but failed
+   Failed - the experiment was not considered suitable for assembly
+            (too short, too poor quality, completely vector, etc)
+Abandoned - not processed this time because something went wrong
+	    (missing entries in tables, experiment already exists, etc)
+Discarded - no information was obtainably for this experiment
+            (no sequence or trace data)
+
+
+
+Phase 9: Tidying up
+
+All samples that were not abandoned are removed from the transfer
+directory.
+
+
+
+Example session file
+--------------------
+
+# SQUIRREL Version 1.1
+#
+# Processing /home/sd/Mac-files-sd/Results A 4_10_92.2
+# Project TESTDB
+# Started at Wed Apr 15 13:04:58 BST 1992
+# Pass 1
+#
+# You may...
+#  + delete lines you don't wish to be processed
+#  + change the experiment name
+# But DON'T...
+#  - create additional lines
+#  - introduce white space
+#
+Sample 01:c68a1.s1
+Sample 02:c68a2.s1
+Sample 03:c68a3.s1
+Sample 04:c68a4.s1
+Sample 05:c68a5.s1
+Sample 06:c68a6.s1
+Sample 07:c68a7.s1
+Sample 08:c68a8.s1
+Sample 09:c68a9.s1
+Sample 10:c68a10.s1
+Sample 11:c68a11.s1
+Sample 12:c68a12.s1
+Sample 13:c68b1.s1
+Sample 14:c68b2.s1
+Sample 15:c68b3.s1
+Sample 16:c68b4.s1
+Sample 17:c68b5.s1
+Sample 18:c68b6.s1
+Sample 19:c68b7.s1
+Sample 20:c68b8.s1
+Sample 21:c68b9.s1
+Sample 22:c68b10.s1
+Sample 23:c68b11.s1
+Sample 24:c68b12.s1
+Sample 25:c68c1.s1
+Sample 26:c68c2.s1
+Sample 27:c68c3.s1
+Sample 28:c68c4.s1
+Sample 29:c68c5.s1
+Sample 30:c68c6.s1
+Sample 31:c68c7.s1
+Sample 32:c68c8.s1
+Sample 33:c68c9.s1
+Sample 34:c68c10.s1
+Sample 35:c68c11.s1
+Sample 36:c68c12.s1
+
+
+Example Lookup tables
+---------------------
+
+experiment-name -> (cosmid,insert-size,frag-method,seq-vector)
+
+c40 K01F9 1000..2000 shear M13mp18
+c41 K01F9 1000..2000 shear M13mp18
+c42 F31H8 1000..2000 shear M13mp18 (Cosmid+=ZC84)
+c43 C18H2 1000..2000 shear M13mp18
+c44 K01B6 1000..2000 shear M13mp18
+c45 ZK512 9000..14000 shear pUC118
+c46 K01B6 1000..2000 shear M13mp18
+c47 M01A8 1000..2000 shear M13mp18
+c48 ZK512 9000..14000 shear pUC118
+c49 ZK507 6000..9000 shear pUC118
+c50 M01A8 1000..2000 shear M13mp18
+
+
+seq-vector -> (clone-site,forward-primer-site,reverse-primer-site)
+
+M13mp18 6249 41 -24
+pBS     2305 46 -79
+
+
+vector -> vector-file
+
+Lambda /nfs/westmorland/home/BioSW/staden/userdata/lambda.seq
+Lorist2 /nfs/westmorland/home/BioSW/staden/userdata/lorist2.vep
+Lorist6 /nfs/westmorland/home/BioSW/staden/userdata/lorist6.vep
+M13mp18 /nfs/westmorland/home/BioSW/staden/userdata/m13mp18.vec
+M13mp7 /nfs/westmorland/home/BioSW/staden/userdata/m13mp7.seq
+pBS /nfs/westmorland/home/BioSW/staden/userdata/blue.vep
+pJB8 /nfs/westmorland/home/BioSW/staden/userdata/pjb8.vep
+pUC18 /nfs/westmorland/home/BioSW/staden/userdata/puc18.vec
+
+
+For simplicity of use, these are converted to awk programs. The
+mapping of (cosmid -> cloning-vector) only exists in its awk form, and
+is not included here.
+
+
+
+Example Report
+--------------
+
+# SQUIRREL Version 1.1
+# Processing /home/sd/Mac-files-sd/Results A 4_10_92.2
+# Project TESTDB
+# Started at Wed Apr 15 18:35:01 BST 1992
+# Pass 4
+#
+Sample 01:  c68a1.s1   Failed    Trace quality
+Sample 02:  c68a2.s1   OK        Assembled
+Sample 03:  c68a3.s1   Failed    Trace quality
+Sample 04:  c68a4.s1   Failed    Trace quality
+Sample 05:  c68a5.s1   Failed    Trace quality
+Sample 06:  c68a6.s1   Failed    Trace quality
+Sample 07:  c68a7.s1   Failed    Completely cloning vector
+Sample 08:  c68a8.s1   OK        Assembled
+Sample 09:  c68a9.s1   OK        Assembled
+Sample 10:  c68a10.s1  Failed    Trace quality
+Sample 11:  c68a11.s1  OK        Assembled
+Sample 12:  c68a12.s1  OK        Assembled
+Sample 13:  c68b1.s1   OK        Assembled
+Sample 14:  c68b2.s1   OK        Assembled
+Sample 15:  c68b3.s1   Failed    Completely cloning vector
+Sample 16:  c68b4.s1   OK        Assembled
+Sample 17:  c68b5.s1   Failed    Trace quality
+Sample 18:  c68b6.s1   Rejected  Did not assemble : failed to assemble and not entered
+Sample 19:  c68b7.s1   Failed    Completely sequencing vector
+Sample 20:  c68b8.s1   OK        Assembled
+Sample 21:  c68b9.s1   OK        Assembled
+Sample 22:  c68b10.s1  Failed    Trace quality
+Sample 23:  c68b11.s1  OK        Assembled
+Sample 24:  c68b12.s1  OK        Assembled
+Sample 25:  c68c1.s1   Failed    Trace quality
+Sample 26:  c68c2.s1   OK        Assembled
+Sample 27:  c68c3.s1   Failed    Completely cloning vector
+Sample 28:  c68c4.s1   Failed    Trace quality
+Sample 29:  c68c5.s1   OK        Assembled
+Sample 30:  c68c6.s1   Failed    Completely cloning vector
+Sample 31:  c68c7.s1   OK        Assembled
+Sample 32:  c68c8.s1   Failed    Completely cloning vector
+Sample 33:  c68c9.s1   OK        Assembled
+Sample 34:  c68c10.s1  Failed    Trace quality
+Sample 35:  c68c11.s1  Failed    Trace quality
+Sample 36:  c68c12.s1  Failed    Completely sequencing vector
diff --git a/src/squirrel-1.4/doc/SysManagement b/src/squirrel-1.4/doc/SysManagement
new file mode 100644
index 0000000..60ed20f
--- /dev/null
+++ b/src/squirrel-1.4/doc/SysManagement
@@ -0,0 +1,260 @@
+			       SQUIRREL
+			  System Management
+
+			      Simon Dear
+			     29 June 1992
+
+
+0. Introduction
+---------------
+
+Squirrel is a complicated script that gathers together information
+about an experiment and then assembles the processed sequences into a
+user's database. The philosophy behind squirrel is that there should
+be minimal user interaction. In actuality, all that need be entered is
+little more than the name of directory containing data to be
+processed.
+
+Ease of use for the user often means extra work for someone else. With
+squirrel this is no exception. Squirrel requires the maintenance of
+several small files. This document describes these files and the
+procedure for keeping them updated.
+
+Squirrel needs to gather additional information about:
+
+  a) subclone libraries
+  b) vectors used to clone each cosmid
+  c) sequencing vector clone sites
+  d) files containing vector sequence
+  e) user's databases
+
+For a sequencing project the data on (b)-(d) will be fairly static and
+quite small. Data on (a) will be frequently updated as new libraries
+are going to be continually created. Data on (e) will be updated
+occasionally; every time new databases are started.
+
+For C-Shell programming simplicity and uniformity the data is
+encapsulated in simple awk programs.
+
+We sometimes need to look up information which is a generalisation
+about a specific example. For example, we may be want data on a
+subclone e06b11 where data is held for all subclones starting e06. The
+pattern matching abilities of awk are ideal for this.
+
+We also sometime need to determine information about a specific
+example, where it is not possible to make general inferences using
+simple rules.  An example is data on (b), which reflects the
+changing strategies during genome mapping. Building this data into an
+awk program is a very simple task. See Appendix 1.
+
+However, is it not desirable to maintain awk programs. Small typing
+mistakes can result in a program with severe errors.  For (a), and
+(c)-(e) it is more convenient to enter the data into a text file, and
+have the awk program automatically created from it using the UNIX make
+facility. In the case of (b), there is no escaping having an awk
+program. It is therefore fortunate that this data will not
+substantially change during the lifetime of the sequencing project.
+
+The text files for (a)-(e) can be found in the directory
+
+	$SQUIRREL/tables
+	(/nfs/tode/home2/users/wormpub/BioSW/squirrel/tables)
+
+During the rest of this document it is assumed that the person
+performing the updates is logged in as user wormpub and that they have
+changed to this directory.
+
+
+
+
+1. Updating information on subclone libraries
+---------------------------------------------
+
+The information on subclone libraries can be updated in two ways.
+
+First, the file ``subclones'' can be modified using an editor. It is
+important that information is entered in exactly the right format. The
+fields of this file are:
+
+    <microtitre-dish> <cosmid> <insert-size> <fragmentation-method> ...
+    <sequencing-vector> <comments...>
+
+An example:
+    e50 F54C8 1000..2000 shear M13mp18
+
+Alternatively the program update_subclones can be used. It provides
+sensible default values for all the fields and checks that the
+data specified is valid. The program can be run by any user from any
+directory. It assumes that the environment variable SUBCLONES holds
+the full path name of the file ``subclones''.
+
+Here is an example of an update of the subclone file using
+update_subclones. Default values are given in square brackets.
+Responding with a question mark ('?') at any time provides information
+of what is expected.
+
+
+    Maintain subclone database
+
+    Database=/nfs/al/hgmp0/wormpub/BioSW/staden/tables/subclones
+
+    Microtitre dish [e50] ?
+    Cosmid [F54C8] ?
+    Method of fragmentation [shear] ?
+    Vector [M13mp18] ?
+    Size range [1000..2000] ?
+    Comment #1 [] ?
+
+    Here are the values you entered:
+
+    Microtitre dish number: e50
+    Cosmid:                 F54C8
+    Fragmentation method:   shear
+    Vector:                 M13mp18
+    Size range:             1000..2000
+
+    Are these values correct [Yes] ?
+
+
+Once the file has been updated it is important to remake the
+awk program required by squirrel. To do this, type:
+
+	make all
+
+
+
+
+2. Updating information on vectors used to clone each cosmid
+------------------------------------------------------------
+
+Information on which vectors are used for each cosmid can be found in
+the file ``clone_types.awk''. This information is be updated using an
+editor, though it is unlikely that it will change much through the
+course of the sequencing project. The data is only encapsulated in an
+awk program; there is no text file. It is important to update this file
+correctly and to ensure that no errors are introduced into the awk
+program. The program can be tested for errors by typing something
+like:
+
+	echo TEST | awk -f clone_types.awk
+
+
+
+
+3. Updating information on sequencing vector clone sites
+--------------------------------------------------------
+
+Information on sequencing vector clone sites is held in the file
+``seqclone-sites'', and is updated by manually editing the text file.
+The format of the data is:
+
+	<clone> <clone-site> <forward-primer-site> <reverse-primer-site>
+
+An example:
+	pBS     2305 46 -79
+
+The positions relate to the sequence held in the file specified in the
+``vector-seqfiles'' files. The clone-site is defined as the base
+position of the 3' end of the cloning site The primer sites are positions
+relative to this.
+
+To make the awk program required by squirrel, type:
+	make all
+
+It is important that this information is correct and that the sequence in
+the sequence file is in the correct orientation. Otherwise, the vector
+will not be successfully removed from sequence processed by squirrel.
+
+
+
+
+
+4. Updating information on files containing vector sequence
+-----------------------------------------------------------
+
+Information of the files containing vector sequence is help in the file
+``vector-seqfiles'', and is updated by manually editing the text file.
+The format is:
+
+	<vector> <file-name>
+
+An example:
+	pBS     /nfs/al/pubseq/pubseq/inhouse/sun-nematode/squirrel/seqs/blue.vep
+
+To make the awk program required by squirrel, type:
+	make all
+
+Note that the file name includes the full path name. All working versions
+of sequence files required by squirrel are kept in the $SQUIRREL/seqs
+directory.
+
+
+
+
+5. Updating information on user's databases
+-------------------------------------------
+
+User's databases must be registered with squirrel by including an
+entry for the project in the file ``databases''. The format is:
+
+	<cosmid> <directory> <name> <version> <database_type> <owner>
+
+An example:
+	C38C10 C38C10 C38C10 0 bap kt
+
+The lookup file is remade by typing:
+	make all
+
+The directory is relative to the owner's home directory. The database type
+is currently one of "dap" or "bap". So the database for kt's cosmid C38C10
+is ~kt/C38C10/C38C10.??0.
+
+
+A1. Vectors used to clone each cosmid
+-------------------------------------
+
+#clone_types.awk
+#
+# Determine Clone types from Clone names
+#
+# 3 February 1992, Simon Dear
+#
+# Assumes that there is one clone name per line.
+# Outputs to stdout clone vector.
+#
+# Data in this program has been taken from "Clone types in the genome map
+# of Caenorhabditis elegans" courtesy by John Sulston. This needs
+# revision.
+#
+# Possible usage:
+#	 set clone-type = `echo $clone | awk -f clone-types.awk`
+#
+/^A/		{ print "Lambda"; next; }
+/^ZL/		{ print "Lambda"; next; }
+/^YSL/		{ print "Lambda"; next; }
+/^[BCDERM]/	{ print "pJB8"; next; }
+/^ZC/		{ print "pJB8"; next; }
+/^K/		{ print "LoristB"; next; }
+/^T/		{ print "Lorist2"; next; }
+/^W/		{ print "Lorist4"; next; }
+/^F/		{ print "Lorist6"; next; }
+/^ZK/		{
+			num = substr($1,3) + 0.0;
+			if (num <   15) { print "unknown"; next; }
+			if (num <=  56) { print "Lorist6"; next; }
+			if (num <= 130) { print "Lorist2"; next; }
+			if (num <= 164) { print "Lorist6"; next; }
+			if (num <= 177) { print "Lorist2"; next; }
+			if (num <= 192) { print "Lorist6"; next; }
+			if (num <= 344) { print "Lorist2"; next; }
+			if (num <= 354) { print "Lorist6"; next; }
+			if (num <= 514) { print "Lorist2"; next; }
+			if (num <= 552) { print "Lorist6"; next; }
+			if (num <= 596) { print "Lorist2"; next; }
+			if (num <= 616) { print "Lorist?"; next; }
+			if (num <= 626) { print "Lorist2"; next; }
+			if (num <= 692) { print "Lorist6"; next; }
+			if (num <= 756) { print "Lorist2"; next; }
+			if (num >= 757) { print "Lorist6"; next; }
+		}
+/^Y/		{ print "pYAC4"; next; }
diff --git a/src/squirrel-1.4/report/README b/src/squirrel-1.4/report/README
new file mode 100644
index 0000000..66977c7
--- /dev/null
+++ b/src/squirrel-1.4/report/README
@@ -0,0 +1,25 @@
+				SQUIZZ
+		     A Squirrel Reporting Program
+
+			      Simon Dear
+			      7 May 1993
+
+
+
+0. Introduction
+
+Squizz analyses squirrel reports and produces a breakdown summary
+detailing how many readings were assembled and failed, and for what
+reasons. The average reading length of processed data is also
+calculated.
+
+
+1. Usage
+
+Run squizz in your project directory. Just type:
+
+	$SQUIRREL/report/squizz
+
+The breakdown summary will be produces quite quickly. The report on
+average reading lengths will take a little longer.
+
diff --git a/src/squirrel-1.4/report/a.awk b/src/squirrel-1.4/report/a.awk
new file mode 100644
index 0000000..139f0a5
--- /dev/null
+++ b/src/squirrel-1.4/report/a.awk
@@ -0,0 +1,16 @@
+BEGIN { dir = "unknown"; pass = "unknown" }
+/^# Processing/ {
+	dir = substr($0,14);
+}
+
+/^# Pass/ {
+	pass = $3;
+}
+
+/^#/ {next;}
+
+{ 
+	colon = index($0,":");
+        sample = substr($0,1,colon - 1);
+	rest = substr($0,colon+1);
+	print dir, sample, ":", pass, ":", rest; }
diff --git a/src/squirrel-1.4/report/b.awk b/src/squirrel-1.4/report/b.awk
new file mode 100644
index 0000000..a9ba749
--- /dev/null
+++ b/src/squirrel-1.4/report/b.awk
@@ -0,0 +1,7 @@
+BEGIN { FS = ":"; }
+{
+	if (lastkey != $1 && lastkey != "") print last;
+	lastkey = $1;
+	last = $0;
+}
+END { print last; }
diff --git a/src/squirrel-1.4/report/c.awk b/src/squirrel-1.4/report/c.awk
new file mode 100644
index 0000000..653ff4e
--- /dev/null
+++ b/src/squirrel-1.4/report/c.awk
@@ -0,0 +1,29 @@
+{
+	class = $2;
+	$1 = "" ; $2 = "" ;
+	reason = substr($0,3);
+	count++;
+	ccount[class]++;
+	rcount[class reason]++;
+	reazon[class reason] = reason;
+	klass[class reason] = class;
+}
+
+END {
+	for (c in ccount) {
+		if (c=="") {
+			print "unknown:"
+		} else {
+			print c ":"
+		}
+		for ( cr in rcount ) {
+			if (klass[cr] == c) {
+				if (reazon[cr] == "") reazon[cr]="unknown";
+				printf "   %-64s%4d\n",reazon[cr],rcount[cr];
+			}
+		}
+		printf "   %-64s%4d\n","TOTAL", ccount[c];
+		print "";
+	}
+	printf "%-64s   %4d\n", "TOTAL PROCESSED",count;
+}
diff --git a/src/squirrel-1.4/report/squizz b/src/squirrel-1.4/report/squizz
new file mode 100644
index 0000000..a7de940
--- /dev/null
+++ b/src/squirrel-1.4/report/squizz
@@ -0,0 +1,47 @@
+#! /bin/csh -f
+
+onintr abort
+
+set A = $SQUIRREL/report/a.awk
+set B = $SQUIRREL/report/b.awk
+set C = $SQUIRREL/report/c.awk
+
+if ( ! -d Logs ) then
+    echo "No nut hoards for squirrel report"
+    exit 1
+endif
+
+#
+# Temp files
+#
+set S_all = /tmp/S_all.$$
+set S_oks = /tmp/S_oks.$$
+set S_lens = /tmp/S_lens.$$
+
+#
+# Remove temporary files
+#
+/bin/rm -f $S_all $S_oks $S_lens
+
+# Distill information from several passes
+awk -f $A Logs/log* | sort | awk -f $B | sed 's/[^:]*:[^:]*: *//' > $S_all
+
+# Main report
+awk -f $C $S_all
+
+# Determine average lengths
+awk '$2=="OK" {print $1;}' $S_all > $S_oks
+touch $S_lens
+foreach files ( `cat $S_oks` )
+    if (-e $files) head -1 $files | awk '{print substr($0,14,6);}' >> $S_lens
+end
+echo ""
+echo  "AVERAGE READING LENGTH OF ASSEMBLED READS ---->" `awk '{s += $1;} END {print s/NR,"(" NR ")"; }' $S_lens`
+
+abort:
+#
+# Remove temporary files
+#
+/bin/rm -f $S_all $S_oks $S_lens
+
+exit 0
diff --git a/src/squirrel-1.4/scripts/ABI-exp-files.csh b/src/squirrel-1.4/scripts/ABI-exp-files.csh
new file mode 100644
index 0000000..a6f3726
--- /dev/null
+++ b/src/squirrel-1.4/scripts/ABI-exp-files.csh
@@ -0,0 +1,107 @@
+#! /bin/csh -f
+#
+# ABI-exp-files
+#   Creates an experiment file for each trace file in the current
+#   directory, and a file of filenames for them.
+#
+#   Usage:
+#      ABI-exp-files session-file fofn_in fofn_out
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+# Usage
+if ( $#argv != 3 ) then
+	echo "Usage: ABI-exp-files session-file fofn_in fofn_out" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+set sessfile = $1
+if (! -e $sessfile) then
+	echo "ABI-exp-files: Session file does not exist" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+if ( ! -e $2 ) then
+	echo "ABI-exp-files: File of file names does not exist" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+set filenames = $3
+if ( -e $filenames) /bin/rm -f $filenames
+touch $filenames
+
+# Directories
+if ( $?SQUIRREL_SUB ) then
+	set TABLES_DIR   = $SQUIRREL/tables-$SQUIRREL_SUB
+else
+	set TABLES_DIR   = $SQUIRREL/tables
+endif
+
+# Programs and scripts
+set GET_ABI_DATE  = $SQUIRREL/scripts/ABI-gel-date.awk
+set MAKE_SCF_FILE = $SQUIRREL/bin/makeSCF
+
+# Process each file
+echo "" | tee -a "$SQUIRREL_LOG"
+echo "Creating experiment file for:" | tee -a "$SQUIRREL_LOG"
+foreach expname ( `cat $2` )
+
+	# specify file names
+	set file = `awk 'BEGIN {FS=":";} $2 == "'$expname'" { s = $1; } END { print s;}' ${sessfile}`
+	set expfile = $expname
+	set tracefile = ${expfile}SCF
+
+	# log
+	echo $expname | tee -a "$SQUIRREL_LOG"
+
+	# ensure the experiment doesn't already exist in this directory
+	if ( -e $expfile ) /bin/rm -f $expfile
+
+	# create experiment file
+	touch $expfile
+
+	# include obvious information
+	echo "ID   $expname" >> $expfile
+	echo "EN   $expname" >> $expfile
+	echo "MN   $file" >> $expfile
+	echo "MT   ABI" >> $expfile
+
+	# Make standard chromatogram file
+	$MAKE_SCF_FILE -ABI "$file" -output $tracefile
+	if ( $status ) then
+		echo Discarded: ${expfile}: Failed to create SCF trace file | tee -a "$SQUIRREL_LOG"
+		/bin/rm -f $tracefile
+		goto skip
+	endif
+	echo "LN   $tracefile" >> $expfile
+	echo "LT   SCF" >> $expfile
+	
+	# get date information
+	# YUK! next to useless
+	#set DT = `echo $date | awk -f $GET_ABI_DATE`
+	#if ( $DT == "" ) set DT = `date +%d-%h-19%y`
+	set DT = `date +%d-%h-19%y`
+	echo "DT   $DT" >> $expfile
+	# YUK! next to useless
+	# echo MC "ABI-$machine" >> $expfile
+	echo "CC   Dir: $cwd" >> $expfile
+
+	# update file of file names
+	echo $expfile >> $filenames
+
+# skip to here on error
+skip:
+
+end
+
+exit 0
+
+
+abort:
+#
+# interruption - not successful completion
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/ABI-gel-date.awk b/src/squirrel-1.4/scripts/ABI-gel-date.awk
new file mode 100644
index 0000000..d6756f6
--- /dev/null
+++ b/src/squirrel-1.4/scripts/ABI-gel-date.awk
@@ -0,0 +1,24 @@
+#
+# A little program to convert dates of the format MM_DD_YY
+# into the format DD-Mon-YEAR
+#
+# EG 7_9_92 -> 9-Jul-1992
+#
+BEGIN {FS="_"; month = "Unknown"; }
+$1==1  {month = "January";}
+$1==2  {month = "February";}
+$1==3  {month = "March";}
+$1==4  {month = "April";}
+$1==5  {month = "May";}
+$1==6  {month = "June";}
+$1==7  {month = "July";}
+$1==8  {month = "August";}
+$1==9  {month = "September";}
+$1==10 {month = "October";}
+$1==11 {month = "November";}
+$1==12 {month = "December";}
+{ 
+	if (month != "Unknown" && NF == 3) {
+		print $2 "-" substr(month,1,3) "-19" substr($3,1,2);
+	}
+}
diff --git a/src/squirrel-1.4/scripts/ABI-sess-file.csh b/src/squirrel-1.4/scripts/ABI-sess-file.csh
new file mode 100644
index 0000000..b23ac52
--- /dev/null
+++ b/src/squirrel-1.4/scripts/ABI-sess-file.csh
@@ -0,0 +1,86 @@
+#! /bin/csh -f
+#
+# ABI-sess-file
+#   Creates a session file for the files in the current working directory
+#
+#   Usage:
+#      ABI-sess-file session-file
+#
+# 22-Jan-93
+#     Also picks out files of format "Sample [0-9][0-9]{,.[0-9]}"
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+# Usage
+if ( $#argv != 1 ) then
+	echo "Usage: ABI-sess-file session-file" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+set filenames = $1
+if (-e $filenames) /bin/rm -f $filenames
+touch $filenames
+
+# Programs and scripts
+set GET_ABI_NAME  = $SQUIRREL/bin/getABISampleName
+
+#
+# Temporary files
+#
+set file_list = $1
+
+#
+# Create session file header
+#
+if ( -e ${file_list} ) /bin/rm -f ${file_list}
+touch ${file_list}
+head -7 "$SQUIRREL_LOG" | sed 's/^/# /' >> ${file_list}
+cat << EOF >> ${file_list}
+#
+# You may...
+#  + delete lines you don't wish to be processed
+#  + change the experiment name
+# But DON'T...
+#  - create additional lines
+#  - introduce white space
+#
+EOF
+
+#
+# Build a list of files in this directory and experiment names
+#
+echo "" | tee -a "$SQUIRREL_LOG"
+echo Determining samples to be processed | tee -a "$SQUIRREL_LOG"
+foreach file ( Sample*[0-9][0-9]{,.[0-9]} )
+	echo -n $file "- " | tee -a "$SQUIRREL_LOG"
+
+	# tidy up as we go - delete Seq files
+	/bin/rm -f "$file.Seq"
+
+	# get sample name
+	set abiname = `$GET_ABI_NAME "$file"`
+	if ( $status ) then
+		echo "" | tee -a "$SQUIRREL_LOG"
+		echo Discarded: : Cannot determine sample name for $file | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	# beautify it
+	set expname = `echo $abiname | sed 's/ /_/g'`
+	echo $expname | tee -a "$SQUIRREL_LOG"
+
+	echo $file":"$expname >> ${file_list}
+
+skip:
+end
+
+exit 0
+
+
+abort:
+#
+# interruption - not successful completion
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/README b/src/squirrel-1.4/scripts/README
new file mode 100644
index 0000000..4c24499
--- /dev/null
+++ b/src/squirrel-1.4/scripts/README
@@ -0,0 +1,18 @@
+ Dir: $SQUIRREL/scripts/
+Date: Fri May 07 12:13:12 BST 1993
+
+This directory contains all the scripts required by squirrel, plus a
+few others. The following concern the users of squirrel directly:
+
+squirrel
+	This is the main script that drives squirrel. It should be
+	placed in the user's project directory and edited
+	appropriately.
+
+packhorse
+	This is for merging multiple ABI results folders into one, so
+	that squirrel can process them all in one pass.
+
+clean
+	This removes results folders, but only if there appears to be
+	no files left unprocessed.
diff --git a/src/squirrel-1.4/scripts/assemble-error-codes b/src/squirrel-1.4/scripts/assemble-error-codes
new file mode 100644
index 0000000..e053328
--- /dev/null
+++ b/src/squirrel-1.4/scripts/assemble-error-codes
@@ -0,0 +1,17 @@
+0  file not found
+	The file in the file of file names was not found.
+
+1  read too short
+	The read was shorter than the minimum match length.
+
+2  failed to align and not entered
+	The read had minimum match length hits with more than one
+	reading and didn't aligned with any.
+
+3  failed on entry
+	The read already exists in the database.
+
+4  failed to align but entered
+	The read was to make a join, but after being incorporated into
+	one contig, it could not be aligned successfully with the other
+	within the assembly constraints.
diff --git a/src/squirrel-1.4/scripts/assemble-files.csh b/src/squirrel-1.4/scripts/assemble-files.csh
new file mode 100644
index 0000000..9730d52
--- /dev/null
+++ b/src/squirrel-1.4/scripts/assemble-files.csh
@@ -0,0 +1,373 @@
+#! /bin/csh -f
+#
+# assemble-files
+#
+#   Script to assemble data into a database using a file of file names.
+#   Output from the initial assembly is directed to a file that is
+#   deleted, hence nothing appears on the screen. The failed files are
+#   then recompared with the output directed to another file. This
+#   file is printed then both output files are deleted.
+# 
+# Usage:
+#   assemble-files project file_of_filenames
+#
+# 27-May-1992 Set consensus cutoff percentage to 51 for assembly
+# 15-Oct-1992 PROJECT was not set to information in databases table!
+#	      Rather - it was set to $1 ("project" name)
+# 22-Oct-1992 Should delete busy file for working version
+# 05-Jan-1993 Check copying to working version is successful
+# 13-Jan-1993 FAILURES_PASS1 is the important one when reporting
+#             errors, not FAILURES_PASS2
+# 18-May-1993 **** this script will only work with bap version 12.1 or greater ****
+#	      Bap auto assembly now prompts for "Hide alignments?" (after "Permit entry?")
+#	      We should respond "Y" first pass, "N" the second
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+if ( $#argv != 2 ) then
+	echo "Usage: assemble-files project file_of_filenames" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+if  ( ! -f $2 ) then
+	echo "assemble-files: File of file names not found" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Commands used in this script
+#
+set RM = "/bin/rm -f"
+set CP = "/bin/cp"
+set MV = "/bin/mv"
+
+
+
+
+# Directories
+if ( $?SQUIRREL_SUB ) then
+	set TABLES_DIR   = $SQUIRREL/tables-$SQUIRREL_SUB
+else
+	set TABLES_DIR   = $SQUIRREL/tables
+endif
+
+
+
+
+#
+# Gather project infomation from lookup
+#
+set GET_LOOK_UP  = $TABLES_DIR/lookup.csh
+set db_info = `$GET_LOOK_UP "$1" $TABLES_DIR/databases.lookup`
+if ( $#db_info == 0 ) then
+	echo assemble-files: No database information for $1 | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+echo "" | tee -a "$SQUIRREL_LOG"
+echo Assembling data from file of filenames $2 into database $1 | tee -a "$SQUIRREL_LOG"
+echo "" | tee -a "$SQUIRREL_LOG"
+
+#
+# database versions
+#
+set PROJECT = $db_info[3]
+set V_CURRENT  = $db_info[4]
+set V_PREVIOUS = "1"
+set V_WORKING  = "X"
+
+#
+# Check for database's existence
+#
+if (! -e ${PROJECT}.SQ${V_CURRENT}) then
+	echo assemble-files: Cannot find database ${PROJECT} version ${V_CURRENT} | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Check database isn't already busy
+#
+if (-e ${PROJECT}_BUSY${V_CURRENT}) then
+	echo "assemble-files: Version ${V_CURRENT} of the database ${PROJECT} is currently being used" | tee -a "$SQUIRREL_LOG"
+	echo "                Processing has been suspended - Please try again later" | tee -a "$SQUIRREL_LOG" 
+	echo "                (running squirrel again will allow you to resume from this point)" | tee -a "$SQUIRREL_LOG" 
+	exit 1
+endif
+
+
+#
+# Delete old working version
+#
+$RM ${PROJECT}.RL${V_WORKING}
+$RM ${PROJECT}.SQ${V_WORKING}
+$RM ${PROJECT}.AR${V_WORKING}
+$RM ${PROJECT}.CC${V_WORKING}
+$RM ${PROJECT}.TG${V_WORKING}
+$RM ${PROJECT}_BUSY${V_WORKING}
+
+#
+# Make a copy of the current version
+#
+# YUK! should check that each copy succeeds
+$CP ${PROJECT}.RL${V_CURRENT} ${PROJECT}.RL${V_WORKING}
+if ( $status ) then
+	echo assemble-files: Error copying database file ${PROJECT}.RL${V_CURRENT} | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+$CP ${PROJECT}.SQ${V_CURRENT} ${PROJECT}.SQ${V_WORKING}
+if ( $status ) then
+	echo assemble-files: Error copying database file ${PROJECT}.SQ${V_CURRENT} | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+$CP ${PROJECT}.AR${V_CURRENT} ${PROJECT}.AR${V_WORKING}
+if ( $status ) then
+	echo assemble-files: Error copying database file ${PROJECT}.AR${V_CURRENT} | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+$CP ${PROJECT}.CC${V_CURRENT} ${PROJECT}.CC${V_WORKING}
+if ( $status ) then
+	echo assemble-files: Error copying database file ${PROJECT}.CC${V_CURRENT} | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+$CP ${PROJECT}.TG${V_CURRENT} ${PROJECT}.TG${V_WORKING}
+if ( $status ) then
+	echo assemble-files: Error copying database file ${PROJECT}.TG${V_CURRENT} | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Temporary files for assembly
+#
+set FAILURES_PASS1 = _fail.$$
+set FAILURES_PASS2 = _fail2.$$
+set ASSEMBLE_LOG = _show.$$
+set REPORT = _show2.$$
+
+#
+# assembly parameters
+#
+set DEFAULT = ""
+set CUT_OFF = 51
+set VECTOR_SCORE = 20
+set ASSEMBLY_SCORE = 20
+set MAX_READ_PADS = 25
+set MAX_CONTIG_PADS = 25
+set MAX_MISMATCH = 8
+
+#
+# Determine assembly program
+# ?? Why do this? Why not assume $db_info[5]
+#
+switch ( $db_info[5] )
+case "bap":
+	#set ASSEMBLE = /nfs/al/pubseq/pubseq/inhouse/sun-nematode/bin/bap
+	set ASSEMBLE = bap
+	breaksw
+case "dap":
+	echo assemble-files: Squirrel does not supoport database type $db_info[5] | tee -a "$SQUIRREL_LOG"
+	exit 1
+	##set ASSEMBLE = /nfs/al/pubseq/pubseq/inhouse/sun-nematode/bin/dap
+	#set ASSEMBLE = dap
+	#breaksw
+deault:
+	echo assemble-files: Unknown database type $db_info[5] | tee -a "$SQUIRREL_LOG"
+	exit 1
+endsw
+
+#
+# ASSEMBLE
+#
+# ---------------------------------ASSEMBLE---------------------------
+$ASSEMBLE << EOF
+3
+y
+${PROJECT}
+${V_WORKING}
+
+27
+${DEFAULT}
+${CUT_OFF}
+${DEFAULT}
+${DEFAULT}
+${DEFAULT}
+
+20
+Y
+y
+Y
+$2
+${FAILURES_PASS1}
+1
+Y
+${ASSEMBLY_SCORE}
+${MAX_READ_PADS}
+${MAX_CONTIG_PADS}
+${MAX_MISMATCH}
+
+7
+1
+${ASSEMBLE_LOG}
+
+20
+y
+n
+y
+${FAILURES_PASS1}
+${FAILURES_PASS2}
+1
+y
+${ASSEMBLY_SCORE}
+${MAX_READ_PADS}
+${MAX_CONTIG_PADS}
+${MAX_MISMATCH}
+
+25
+!
+
+7
+y
+
+!
+EOF
+# =================================ASSEMBLE===========================
+
+#
+# Check for successful completion
+#
+if ( $status ) then
+	echo assemble-files: Error during assembly | tee -a "$SQUIRREL_LOG"
+	$RM ${PROJECT}.RL${V_WORKING}
+	$RM ${PROJECT}.SQ${V_WORKING}
+	$RM ${PROJECT}.AR${V_WORKING}
+	$RM ${PROJECT}.CC${V_WORKING}
+	$RM ${PROJECT}.TG${V_WORKING}
+	exit 1
+endif
+
+#
+# Switch databases 
+#
+# old copy to save copy
+$RM ${PROJECT}.RL${V_PREVIOUS}
+$RM ${PROJECT}.SQ${V_PREVIOUS}
+$RM ${PROJECT}.AR${V_PREVIOUS}
+$RM ${PROJECT}.CC${V_PREVIOUS}
+$RM ${PROJECT}.TG${V_PREVIOUS}
+# keep an old copy of the current version
+$MV ${PROJECT}.RL${V_CURRENT} ${PROJECT}.RL${V_PREVIOUS}
+$MV ${PROJECT}.SQ${V_CURRENT} ${PROJECT}.SQ${V_PREVIOUS}
+$MV ${PROJECT}.AR${V_CURRENT} ${PROJECT}.AR${V_PREVIOUS}
+$MV ${PROJECT}.CC${V_CURRENT} ${PROJECT}.CC${V_PREVIOUS}
+$MV ${PROJECT}.TG${V_CURRENT} ${PROJECT}.TG${V_PREVIOUS}
+# make the new version current
+$MV ${PROJECT}.RL${V_WORKING} ${PROJECT}.RL${V_CURRENT}
+$MV ${PROJECT}.SQ${V_WORKING} ${PROJECT}.SQ${V_CURRENT}
+$MV ${PROJECT}.AR${V_WORKING} ${PROJECT}.AR${V_CURRENT}
+$MV ${PROJECT}.CC${V_WORKING} ${PROJECT}.CC${V_CURRENT}
+$MV ${PROJECT}.TG${V_WORKING} ${PROJECT}.TG${V_CURRENT}
+
+#
+# Determine assembly failures and successes
+#
+set temp_in = _temp_in.$$
+set temp_out = _temp_out.$$
+set temp_out2 = _temp_out2.$$
+tr -d ' ' < $2 | sort > ${temp_in}
+# no stripping of space needes as done by awk
+sort ${FAILURES_PASS1} > ${temp_out}
+# Strip out error codes
+awk '{print $1;}' ${temp_out} > ${temp_out2}
+set expfiles = `cat ${temp_out2}`
+set errcodes = `awk '{if (NF==2)print $2;else print -1;}' ${temp_out}`
+
+while ( $#expfiles )
+
+	set expname = $expfiles[1]
+	set expfile = Expfiles/$expfiles[1]
+	set errcode = $errcodes[1]
+
+	switch(${errcode})
+	case 0:
+		set reason = " : file not found"
+		breaksw;
+	case 1:
+		set reason = " : read too short"
+		breaksw;
+	case 2:
+		set reason = " : failed to align and not entered"
+		breaksw;
+	case 3:
+		set reason = " : failed on entry"
+		breaksw;
+	case 4:
+		set reason = " : failed to align but entered"
+		breaksw;
+	default:
+		set reason = "" #Unknown
+		breaksw;
+	endsw
+
+	if ( ${errcode} == 4 ) then
+		# This actually assembled
+		echo OK: ${expname}: Assembled $reason | tee -a "$SQUIRREL_LOG"
+	else
+		echo Rejected: ${expname}: Did not assemble $reason | tee -a "$SQUIRREL_LOG"
+		if (-e ${expfile}) echo "PS   Did not assemble $reason" >> ${expfile}
+	endif
+
+	shift expfiles
+	shift errcodes
+end
+
+#
+# Determine passes
+#
+foreach expfile (`comm -23 ${temp_in} ${temp_out2}`)
+	echo OK: ${expfile}: Assembled | tee -a "$SQUIRREL_LOG"
+end
+
+#
+# Remove temporary files
+#
+$RM ${temp_in} ${temp_out} ${temp_out2}
+
+#
+# Produce a report
+#
+cat ${FAILURES_PASS1} >> ${PROJECT}.FAILS
+echo Assembly for project ${PROJECT} on `date` >> ${REPORT}
+echo In directory `pwd` >> ${REPORT}
+echo Input file of file names $2 >> ${REPORT}
+echo Parameters used >> ${REPORT}
+echo Minimum initial match = ${ASSEMBLY_SCORE} >> ${REPORT}
+echo Maximum pads in each reading = ${MAX_READ_PADS} >> ${REPORT}
+echo Maximum pads in contig = ${MAX_CONTIG_PADS} >> ${REPORT}
+echo Maximum percent mismatch = ${MAX_MISMATCH} >> ${REPORT}
+
+echo "List of entry failures ">> ${REPORT}
+cat ${FAILURES_PASS1} >> ${REPORT}
+
+echo "Alignments for entry failures " >> ${REPORT}
+cat ${ASSEMBLE_LOG} >> ${REPORT}
+
+lpr ${REPORT}
+
+
+#
+# Remove temporary files
+#
+$RM ${FAILURES_PASS1} ${FAILURES_PASS2} ${ASSEMBLE_LOG} ${REPORT}
+
+exit 0
+
+
+
+abort:
+
+#
+# interruption - not successful conclusion
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/check-exp-exists.csh b/src/squirrel-1.4/scripts/check-exp-exists.csh
new file mode 100644
index 0000000..582c179
--- /dev/null
+++ b/src/squirrel-1.4/scripts/check-exp-exists.csh
@@ -0,0 +1,85 @@
+#! /bin/csh -f
+#
+# check-exp-exists
+#   Checks that experiments don't already exist in project
+#   directory
+#
+#   Usage:
+#      check-exp-exists project-directory session_file out_fofn
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+#
+# Check usage
+#
+if ( $#argv != 3 ) then
+	echo "Usage: check-exp-exists project-directory session_file out_fofn" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+
+if ( ! -d $1 ) then
+	echo "check-exp-exists: Project directory $1 does not exist" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+if ( ! -e $2 ) then
+	echo "check-exp-exists: Session file $2 does not exist" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+if ( -e $3 ) then
+	echo "check-exp-exists: File of file names $3 already exists" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Save typing if need to change
+#
+set FAILURES = $1/Failures
+set EXPFILES = $1/Expfiles
+
+#
+# Determine experiment file list to process this time
+#
+set temp_fofn = _temp_fofn
+if ( -e ${temp_fofn} ) /bin/rm -f ${temp_fofn}
+awk 'BEGIN {FS = ":";} /^#/ {next;} {print $2;}' $2 > ${temp_fofn}
+
+
+echo "" | tee -a "$SQUIRREL_LOG"
+echo "Checking files don't alreay exist in $1" | tee -a "$SQUIRREL_LOG"
+touch $3
+foreach expname ( `cat ${temp_fofn}` )
+
+	set file = `awk 'BEGIN {FS=":";} $2 == "'$expname'" { s = $1; } END { print s;}' $2`
+
+	# details for log
+	echo "**" $expname from $file | tee -a "$SQUIRREL_LOG"
+
+	# Check expname doesn't already exist
+	if ( -e $1/$expname || -e ${EXPFILES}/$expname || -e ${FAILURES}/$expname ) then
+		echo Abandoned: ${expname}: Experiment already processed | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	echo $expname >> $3
+	
+skip:
+end
+
+#
+# Remove temporary files
+#
+/bin/rm -f ${temp_fofn}
+
+exit 0
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/check-quality.csh b/src/squirrel-1.4/scripts/check-quality.csh
new file mode 100644
index 0000000..6cb59e0
--- /dev/null
+++ b/src/squirrel-1.4/scripts/check-quality.csh
@@ -0,0 +1,63 @@
+#! /bin/csh -f
+#
+# check-quality
+#   Check a file of file name for quality
+#
+#   Usage:
+#      create-exp-files fofn_in fofn_out
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+# Usage
+if ( $#argv != 2 ) then
+	echo "Usage: check-quality fofn_in fofn_out" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+if ( ! -e $1 ) then
+	echo check-quality: file of file names $1 does not exist | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+if ( -e $2 ) then
+	echo check-quality: output file of file names $1 already exists | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+# Programs and scripts
+set CHECK_QUALITY = $SQUIRREL/scripts/quality-check.awk
+
+#
+touch $2
+
+# Process each file
+echo "" | tee -a "$SQUIRREL_LOG"
+echo "Checking for quality:" | tee -a "$SQUIRREL_LOG"
+foreach file ( `cat $1` )
+
+	# Do a quality check
+	set qc = `awk -f $CHECK_QUALITY $file`
+	if ($qc != "0") then
+		echo Failed: ${file}: Trace quality | tee -a "$SQUIRREL_LOG"
+		echo "PS   Trace quality" >> ${file}
+		goto skip
+	endif
+
+	# update file of file names
+	echo $file >> $2
+
+# skip to here on error
+skip:
+
+end
+
+
+exit 0
+
+
+abort:
+#
+# interruption - not successful completion
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/clean b/src/squirrel-1.4/scripts/clean
new file mode 100644
index 0000000..0a68759
--- /dev/null
+++ b/src/squirrel-1.4/scripts/clean
@@ -0,0 +1,48 @@
+#! /bin/csh -f
+#
+# File: clean
+# Version: 1.1
+#
+# Author: Simon Dear
+#         MRC Laboratory of Molecular Biology
+#	  Hills Road
+#	  Cambridge CB2 2QH
+#	  United Kingdom
+#
+# Created: 19 January 1993
+# Updated:
+#
+# Description:
+#       C shell script to remove empty results folders
+#
+# Usage:
+#	clean dir [dir...]
+#
+
+set diri = 1
+set dirn = $#argv
+while ( $diri <= $dirn )
+    set dir = "$argv[$diri]"
+
+    if (! -e "$dir") then
+	echo $dir does not exist
+	goto skip
+    else if (! -d "$dir") then
+	echo "$dir" is not a directory
+	goto skip
+    endif
+
+    # remove directory, but only if no files left
+    # >150 0.5k disk blocks is arbitrary, but should trap any valid ABI files
+    set big_files = `find "$dir" -size +150 -print`
+    if ( $#big_files == 0 ) then
+	echo Removing directory "$dir"
+	/bin/rm -r "$dir"
+    else
+	echo Not removing directory "$dir" - large files present
+    endif
+
+skip:
+    @ diri ++
+end
+
diff --git a/src/squirrel-1.4/scripts/clip-exp.awk b/src/squirrel-1.4/scripts/clip-exp.awk
new file mode 100644
index 0000000..7e4eb6e
--- /dev/null
+++ b/src/squirrel-1.4/scripts/clip-exp.awk
@@ -0,0 +1,101 @@
+#
+# Clip poor quality sequence at the left (5') and right (3') ends
+# on basis of sequence content - filter version
+#
+# Rule:
+#	To the right of base number SR and to the left of base SL
+#       everything is good quality before where there are NN
+#	uncertainties in MM window, in the base range LL..RR 
+#       At the right hand end only, ALWAYS knock back KR bases
+#
+# St.Louis use NN=2, MM=5, SL=1, LL=1, SR=200, RR=sequence_length, KR=0
+#
+# Outputs just the new clip line in experiment file format
+#
+
+BEGIN {
+	# Up to where NN uncertainties in window of MM bases
+	NN = 2;
+	MM = 5;
+	SL = 100
+	SR = 200
+	LL = 1;
+	RR = 450;
+	KR = 50;
+}
+
+/^\/\// {
+	slines = 0;
+}
+
+slines==1 {
+	# sequence may be broken up by white space
+	for (frag=1;frag<=NF;frag++) seq = seq $frag;
+}
+	
+
+slines==0 && $1 == "SQ" {
+	slines = 1;
+	seq = "";
+}
+
+{
+	print;
+}
+
+END {
+	lseq = length(seq);
+	if (RR > lseq) RR = lseq;
+	if (LL < 1) LL = 1
+	lcut = LL - 1;
+	rcut = RR+1;
+	if ( NN>0 ) {
+		#
+		# clip to the right
+		#
+		score = 0;
+		# calculate default score
+		for(j=SR-1;j<SR+MM-1 && j<=lseq ; j++)
+			if(j>0 && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SR; j<=RR; j++) {
+			if (j>1 && index("ACGTacgt",substr(seq,j-1,1)) == 0)
+				score--;
+			if (j+MM-1<=lseq && index("ACGTacgt",substr(seq,j+MM-1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				rcut = j;
+				break;
+			}
+		}
+
+		#
+		# clip to the left
+		#
+		score = 0;
+		# calculate default score
+		for(j=SL+1;j>SL-MM+1 && j>0 ; j--)
+			if(j<=lseq && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SL; j>=LL; j--) {
+			if (j<lseq && index("ACGTacgt",substr(seq,j+1,1)) == 0)
+				score--;
+			if (j-MM+1>0 && index("ACGTacgt",substr(seq,j-MM+1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				lcut = j;
+				break;
+			}
+		}
+	}
+
+	# knock back the 3' end
+	rcut = rcut - KR;
+	if (rcut <= lcut) rcut = lcut + 1;
+
+	# output
+	if (lcut >= 1)    print "QL   " lcut;
+	if (rcut <= lseq) print "QR   " rcut;
+}
diff --git a/src/squirrel-1.4/scripts/clip-seq-loose.awk b/src/squirrel-1.4/scripts/clip-seq-loose.awk
new file mode 100644
index 0000000..ab0782a
--- /dev/null
+++ b/src/squirrel-1.4/scripts/clip-seq-loose.awk
@@ -0,0 +1,120 @@
+#
+# Clip poor quality sequence at the left (5') and right (3') ends
+# on basis of sequence content
+#
+# Rule:
+#	To the right of base number SR and to the left of base SL
+#       everything is good quality before where there are NN
+#	uncertainties in MM window, in the base range LL..RR 
+#       At the right hand end only, ALWAYS knock back KR bases
+#
+# St.Louis use NN=2, MM=5, SL=1, LL=1, SR=200, RR=sequence_length, KR=0
+#
+# Outputs just the new clip line in experiment file format
+#
+# 2-Jun-1992
+#   If cutoffs already exist only set if they restrict quality further
+#
+
+BEGIN {
+	# Up to where NN uncertainties in window of MM bases
+	NN = 4;
+	MM = 10;
+	SL = 50;
+	SR = 150;
+	LL = 1;
+	RR = 450;
+	KR = 0;
+	# don't change
+	QL = -1;
+	QR = -1;
+}
+
+/^\/\// {
+	slines = 0;
+}
+
+slines==1 {
+	# sequence may be broken up by white space
+	for (frag=1;frag<=NF;frag++) seq = seq $frag;
+}
+	
+
+slines==0 && $1 == "SQ" {
+	slines = 1;
+	seq = "";
+}
+
+slines==0 && $1 == "QL" {
+	QL = $2;
+}
+
+slines==0 && $1 == "QR" {
+	QR = $2;
+}
+
+END {
+	lseq = length(seq);
+	if (RR > lseq) RR = lseq;
+	if (LL < 1) LL = 1
+	lcut = LL - 1;
+	rcut = RR+1;
+	if ( NN>0 ) {
+		#
+		# clip to the right
+		#
+		score = 0;
+		# calculate default score
+		for(j=SR-1;j<SR+MM-1 && j<=lseq ; j++)
+			if(j>0 && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SR; j<=RR; j++) {
+			if (j>1 && index("ACGTacgt",substr(seq,j-1,1)) == 0)
+				score--;
+			if (j+MM-1<=lseq && index("ACGTacgt",substr(seq,j+MM-1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				rcut = j;
+				break;
+			}
+		}
+
+		#
+		# clip to the left
+		#
+		score = 0;
+		# calculate default score
+		for(j=SL+1;j>SL-MM+1 && j>0 ; j--)
+			if(j<=lseq && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SL; j>=LL; j--) {
+			if (j<lseq && index("ACGTacgt",substr(seq,j+1,1)) == 0)
+				score--;
+			if (j-MM+1>0 && index("ACGTacgt",substr(seq,j-MM+1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				lcut = j;
+				break;
+			}
+		}
+	}
+
+	# knock back the 3' end
+	rcut = rcut - KR;
+	if (rcut <= lcut) rcut = lcut + 1;
+
+	# Determine old values
+	if (QR<0) QR = lseq+1;
+	if (QL<0) QL = 0;
+	# Constrain new values to old values
+	if (lcut >= QR) lcut = QR - 1;
+	if (rcut <= QL) rcut = QL + 1;
+	# Print out new values (if they differ)
+	if ( (lcut > QL) || (rcut < QR) ) {
+		print "CC   Quality clipping by clip-seq-loose.awk";
+		if (lcut > QL) print "QL   " lcut;
+		if (rcut < QR) print "QR   " rcut;
+	}
+}
diff --git a/src/squirrel-1.4/scripts/clip-seq.awk b/src/squirrel-1.4/scripts/clip-seq.awk
new file mode 100644
index 0000000..971684c
--- /dev/null
+++ b/src/squirrel-1.4/scripts/clip-seq.awk
@@ -0,0 +1,120 @@
+#
+# Clip poor quality sequence at the left (5') and right (3') ends
+# on basis of sequence content
+#
+# Rule:
+#	To the right of base number SR and to the left of base SL
+#       everything is good quality before where there are NN
+#	uncertainties in MM window, in the base range LL..RR 
+#       At the right hand end only, ALWAYS knock back KR bases
+#
+# St.Louis use NN=2, MM=5, SL=1, LL=1, SR=200, RR=sequence_length, KR=0
+#
+# Outputs just the new clip line in experiment file format
+#
+# 2-Jun-1992
+#   If cutoffs already exist only set if they restrict quality further
+#
+
+BEGIN {
+	# Up to where NN uncertainties in window of MM bases
+	NN = 2;
+	MM = 5;
+	SL = 100
+	SR = 200
+	LL = 1;
+	RR = 450;
+	KR = 50;
+	# don't change
+	QL = -1;
+	QR = -1;
+}
+
+/^\/\// {
+	slines = 0;
+}
+
+slines==1 {
+	# sequence may be broken up by white space
+	for (frag=1;frag<=NF;frag++) seq = seq $frag;
+}
+	
+
+slines==0 && $1 == "SQ" {
+	slines = 1;
+	seq = "";
+}
+
+slines==0 && $1 == "QL" {
+	QL = $2;
+}
+
+slines==0 && $1 == "QR" {
+	QR = $2;
+}
+
+END {
+	lseq = length(seq);
+	if (RR > lseq) RR = lseq;
+	if (LL < 1) LL = 1
+	lcut = LL - 1;
+	rcut = RR+1;
+	if ( NN>0 ) {
+		#
+		# clip to the right
+		#
+		score = 0;
+		# calculate default score
+		for(j=SR-1;j<SR+MM-1 && j<=lseq ; j++)
+			if(j>0 && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SR; j<=RR; j++) {
+			if (j>1 && index("ACGTacgt",substr(seq,j-1,1)) == 0)
+				score--;
+			if (j+MM-1<=lseq && index("ACGTacgt",substr(seq,j+MM-1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				rcut = j;
+				break;
+			}
+		}
+
+		#
+		# clip to the left
+		#
+		score = 0;
+		# calculate default score
+		for(j=SL+1;j>SL-MM+1 && j>0 ; j--)
+			if(j<=lseq && index("ACGTacgt",substr(seq,j,1)) == 0)
+				score++;
+		# loop through
+		for(j=SL; j>=LL; j--) {
+			if (j<lseq && index("ACGTacgt",substr(seq,j+1,1)) == 0)
+				score--;
+			if (j-MM+1>0 && index("ACGTacgt",substr(seq,j-MM+1,1)) == 0)
+				score++;
+			if (score >= NN) {
+				lcut = j;
+				break;
+			}
+		}
+	}
+
+	# knock back the 3' end
+	rcut = rcut - KR;
+	if (rcut <= lcut) rcut = lcut + 1;
+
+	# Determine old values
+	if (QR<0) QR = lseq+1;
+	if (QL<0) QL = 0;
+	# Constrain new values to old values
+	if (lcut >= QR) lcut = QR - 1;
+	if (rcut <= QL) rcut = QL + 1;
+	# Print out new values (if they differ)
+	if ( (lcut > QL) || (rcut < QR) ) {
+		print "CC   Quality clipping by squirrel 8-Jun-1992";
+		if (lcut > QL) print "QL   " lcut;
+		if (rcut < QR) print "QR   " rcut;
+	}
+}
diff --git a/src/squirrel-1.4/scripts/do-it-no-assembly.csh b/src/squirrel-1.4/scripts/do-it-no-assembly.csh
new file mode 100644
index 0000000..f59cfff
--- /dev/null
+++ b/src/squirrel-1.4/scripts/do-it-no-assembly.csh
@@ -0,0 +1,479 @@
+#! /bin/csh -f
+#
+# do_it
+#     Process a batch work of readings
+# 
+# Usage:
+#     do_it transfer_directory project
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+
+# Usage
+if ( $#argv != 2 ) then
+	echo "Usage: do-it transfer_directory project" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+
+set PROJECT = $2
+set transfer_directory = "$1"
+
+if ( ! -d "$transfer_directory" ) then
+	echo "do-it: Bad folder machine or date" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+
+
+# Directories
+if ( $?SQUIRREL_SUB ) then
+	set TABLES_DIR   = $SQUIRREL/tables-$SQUIRREL_SUB
+else
+	set TABLES_DIR   = $SQUIRREL/tables
+endif
+
+
+# Programs and scripts
+set ABI_SESS_FILE    = $SQUIRREL/scripts/ABI-sess-file.csh
+set EDIT_FILES       = $SQUIRREL/scripts/edit-session-file.csh
+set CHECK_FILES      = $SQUIRREL/scripts/check-exp-exists.csh
+set ABI_EXP_FILES    = $SQUIRREL/scripts/ABI-exp-files.csh
+set BUILD_EXP_FILES  = $SQUIRREL/scripts/embellish-exp-files.csh
+set QUALITY_CHECK    = $SQUIRREL/scripts/check-quality.csh
+set VECTOR_EXCISE    = $SQUIRREL/scripts/excise-vector.csh
+set TRANSFER_FILES   = $SQUIRREL/scripts/transfer-files.csh
+set ASSEMBLE_FILES   = $SQUIRREL/scripts/assemble-files.csh
+set SCREEN_FILES     = $SQUIRREL/scripts/screen-against-vector.csh
+set BACKUP_DIR       = $SQUIRREL/scripts/tidy-files.csh
+set LOG_TO_SUMMARY   = $SQUIRREL/scripts/log-to-summary.awk
+
+#
+# Gather information about this project
+#
+set GET_LOOK_UP  = $TABLES_DIR/lookup.csh
+set db_info = `$GET_LOOK_UP $PROJECT $TABLES_DIR/databases.lookup`
+if ( $#db_info == 0 ) then
+	echo do-it: no database information for $PROJECT
+	exit 2
+endif
+# Check things look ok
+if ( ! -e ~$db_info[6]/$db_info[2]/$db_info[3].SQ$db_info[4] ) then
+	echo do-it: Cannot find database ~$db_info[6]/$db_info[2]/$db_info[3] version $db_info[4]
+	exit 2
+endif
+# Check owner ok
+if ( $db_info[6] != $USER ) then
+	echo do-it: User $USER not registered to use database ~$db_info[6]/$db_info[2]/$db_info[3] version $db_info[4]
+	exit 2
+endif
+
+
+#
+# Go to where all the action is
+#
+cd "$transfer_directory"
+
+
+#
+# Determine which pass this is
+#
+set pass_file = .squirrel
+if (! -e ${pass_file}) then
+	set last_pass  = 0
+	set last_phase = completed
+	touch ${pass_file}
+else
+	set last_pass  = `tail -1 ${pass_file} | awk '/^Pass/ {pass = $2;} END {print pass;}'`
+	set last_phase = `tail -1 ${pass_file} | awk '/^Pass/ {phase = $4;} END {print phase;}'`
+endif
+@ pass_count = $last_pass + 1
+
+set resume = 0
+
+if ( "$last_phase" != "completed" ) then
+
+	switch ( "$last_phase" )
+	case started:
+		set phase = "log file creation"; breaksw
+	case session:
+		set phase = "session file creation"; breaksw
+	case edited:
+		set phase = "session file editing"; breaksw
+	case checked:
+		set phase = "checking experiment files don't already exist"; breaksw
+	case created:
+		set phase = "experiment files creation"; breaksw
+	case embellished:
+		set phase = "experiment files embellishment"; breaksw
+	case quality:
+		set phase = "quality checking"; breaksw
+	case excised:
+		set phase = "vector removal"; breaksw
+	case screened:
+		set phase = "screen against vector"; breaksw
+	case transferred:
+		set phase = "file transfer"; breaksw
+	case assembled:
+		set phase = "sequence assembly"; breaksw
+	case reported:
+		set phase = "reporting"; breaksw
+	case archived:
+		set phase = "backing up files"; breaksw
+	default:
+		set phase = "***problem***" ; breaksw
+	endsw
+
+	echo "***"
+	echo "*** WARNING\!\!\!"
+	echo "***"
+	echo "*** SQUIRREL did not successfully complete the last time it was"
+	echo "*** run on this directory"
+	echo "***"
+	echo ""
+	echo "Last phase successfully completed: $phase"
+repeat:
+	echo -n "Do you want to resume the previous pass? "
+	set reply = $<
+	switch ( $reply )
+	case [yY]*:
+		set pass_count = $last_pass
+		set resume = 1;
+		breaksw
+	case [nN]*:
+		set last_phase = "completed"
+		breaksw
+	default:
+		echo "please reply yes or no"
+		goto repeat
+	endsw
+	echo ""
+	echo ""
+
+endif
+
+
+#
+# START
+#
+setenv SQUIRREL_LOG "$transfer_directory"/log.Pass.${pass_count}
+
+
+#
+# Files and directories
+#
+# WARNING : the following line makes assumptions about directory names
+#
+# YUK! YUK! YUK! the following line is aweful
+set temp = ( $1 )
+set session =  $temp[2]$temp[3].${pass_count}
+set sessfile =  sess.$session
+set sessfile2 =  sess.$session.2
+set filenames = fn.$session
+set filenames2 = fn.$session.2
+set filenames3 = fn.$session.3
+set filenames4 = fn.$session.4
+
+#
+# PHASE 0: Start log file
+#
+if ( $last_phase == "completed" ) then
+	if ( -e "$SQUIRREL_LOG" ) /bin/rm -f "$SQUIRREL_LOG"
+	touch "$SQUIRREL_LOG"
+
+	# log this phase
+	set last_phase = started
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Add messages to log
+#
+if ( $resume == 0) then
+	echo SQUIRREL Version 1.4 | tee -a "$SQUIRREL_LOG"
+	echo "" | tee -a "$SQUIRREL_LOG"
+	echo Processing "$transfer_directory" | tee -a "$SQUIRREL_LOG"
+	echo Project $PROJECT | tee -a "$SQUIRREL_LOG"
+	echo Started at `date` | tee -a "$SQUIRREL_LOG"
+        echo Pass ${pass_count} | tee -a "$SQUIRREL_LOG"
+else
+	echo Resumed at `date` | tee -a "$SQUIRREL_LOG"
+        echo Pass ${pass_count} - Phase $phase | tee -a "$SQUIRREL_LOG"
+endif
+
+#
+# Create session file files
+#
+if ( $last_phase == "started" ) then
+	/bin/rm -f $sessfile
+	$ABI_SESS_FILE $sessfile
+	if ($status != 0) then
+		echo "do-it: Error while creating session file" | tee -a "$SQUIRREL_LOG"
+		exit 1
+	endif
+
+	# log this phase
+	set last_phase = session
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Allow the user to edit the session file to suit themselves
+#
+if ( $last_phase == "session" ) then
+	/bin/rm -f $sessfile2
+	$EDIT_FILES $sessfile $sessfile2
+	if ( $status ) then
+		echo "do-it: Error while editing session file" | tee -a "$SQUIRREL_LOG"
+		exit 1
+	endif
+	/bin/mv $sessfile2 $sessfile
+
+	# log this phase
+	set last_phase = edited
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Filter out experiments already processed
+#
+if ( $last_phase == "edited" ) then
+	/bin/rm -f $filenames
+	$CHECK_FILES ~$db_info[6]/$db_info[2] $sessfile $filenames
+	if ( $status ) then
+		echo "do-it: Error while checking for previous processing" | tee -a "$SQUIRREL_LOG"
+		exit 1
+	endif
+
+	# log this phase
+	set last_phase = checked
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Create experiment files from session file
+#
+if ( $last_phase == "checked" ) then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames2
+		$ABI_EXP_FILES $sessfile $filenames $filenames2
+		if ($status != 0) then
+			echo "do-it: Error while creating experiment files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+		/bin/mv -f $filenames2 $filenames
+	else
+		echo do-it: Skipping creation of experiment files | tee -a "$SQUIRREL_LOG"
+	endif
+
+	# log this phase
+	set last_phase = created
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Fill out the details on the experiment files
+#
+if ( $last_phase == "created" ) then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames2
+		$BUILD_EXP_FILES $filenames $filenames2
+		if ($status != 0) then
+			echo "do-it: Error while building experiment files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+		/bin/mv $filenames2 $filenames
+	else
+		echo do-it: Skipping building of experiment files | tee -a "$SQUIRREL_LOG"
+	endif
+
+	# log this phase
+	set last_phase = embellished
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Remove poor quality reads
+#
+if ( $last_phase == "embellished") then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames2
+		$QUALITY_CHECK $filenames $filenames2
+		if ($status != 0) then
+			echo "do-it: Error while checking quality of reads" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping quality check of files | tee -a "$SQUIRREL_LOG"
+		/bin/cp $filenames $filenames2
+	endif
+
+	# log this phase
+	set last_phase = quality
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Remove vector sequence
+#
+if ( $last_phase == "quality" ) then
+	if ( `wc $filenames2 | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames3
+		$VECTOR_EXCISE $filenames2 $filenames3
+		if ($status != 0) then
+			echo "do-it: Error while excising vectors from files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping excising of vector from files | tee -a "$SQUIRREL_LOG"
+		/bin/cp $filenames2 $filenames3
+	endif
+
+	# log this phase
+	set last_phase = excised
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+
+
+
+
+#
+# Screen against vector
+#
+if ( $last_phase == "excised" ) then
+	if ( `wc $filenames3 | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames4
+		$SCREEN_FILES $filenames3 $filenames4
+		if ($status != 0) then
+			echo "do-it: Error while screening files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping screening of vector from files | tee -a "$SQUIRREL_LOG"
+		/bin/cp $filenames3 $filenames4
+	endif
+
+	# log this phase
+	set last_phase = screened
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+
+
+
+
+#
+# Transfer files - some reformating and converting may occur
+# NOTE: The second $filenames is in ~$db_info[6]/$db_info[2] directory
+#
+if ( $last_phase == screened ) then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f ~$db_info[6]/$db_info[2]/$filenames 
+		$TRANSFER_FILES ~$db_info[6]/$db_info[2] $filenames $filenames 
+		if ($status != 0) then
+			echo "do-it: Error while transferring files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping transfer of files | tee -a "$SQUIRREL_LOG"
+	endif
+
+	# log this phase
+	set last_phase = transferred
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+
+#
+# Do the assembly
+#
+if ( $last_phase == transferred ) then
+	pushd ~$db_info[6]/$db_info[2]
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		echo do-it: Skipping assembly of files | tee -a "$SQUIRREL_LOG"
+		foreach expfile (`cat $filenames`)
+			echo OK: ${expfile}: Not yet assembled | tee -a "$SQUIRREL_LOG"
+		end
+	else
+		echo do-it: Skipping assembly of files | tee -a "$SQUIRREL_LOG"
+	endif
+	popd
+
+	# log this phase
+	set last_phase = assembled
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Generate log to tell us what has happened with all the files this pass
+#
+if ( $last_phase == assembled ) then
+	pushd ~$db_info[6]/$db_info[2]
+	if (! -e Logs) mkdir Logs
+	set session_log = log.$session
+	awk -f $LOG_TO_SUMMARY "$SQUIRREL_LOG" > Logs/$session_log
+	/bin/cp Logs/$session_log "$transfer_directory"
+	lpr Logs/$session_log
+	popd
+
+	# log this phase
+	set last_phase = reported
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Back up
+#
+if ( $last_phase == reported ) then
+	$BACKUP_DIR $session_log
+	if ($status != 0) then
+		echo "do-it: Error while tidying up files in transfer directory" | tee -a "$SQUIRREL_LOG"
+		exit 2
+	endif
+
+	# log this phase
+	set last_phase = archived
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Remove temporary files
+#
+/bin/rm -f $sessfile $sessfile2 $filenames $filenames2 $filenames3 $filenames4
+
+
+#
+# Completed successfully
+#
+
+# log this phase
+set last_phase = completed
+echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+
+echo Completed at `date` | tee -a "$SQUIRREL_LOG"
+
+exit 0
+
+
+abort:
+#
+# Interruption not normal termination
+#
+
+if ( $?SQUIRREL_LOG ) then
+    echo Aborted at `date` | tee -a "$SQUIRREL_LOG"
+else
+    echo Aborted at `date`
+endif
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/do-it.csh b/src/squirrel-1.4/scripts/do-it.csh
new file mode 100644
index 0000000..6adca75
--- /dev/null
+++ b/src/squirrel-1.4/scripts/do-it.csh
@@ -0,0 +1,480 @@
+#! /bin/csh -f
+#
+# do_it
+#     Process a batch work of readings
+# 
+# Usage:
+#     do_it transfer_directory project
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+
+# Usage
+if ( $#argv != 2 ) then
+	echo "Usage: do-it transfer_directory project" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+
+set PROJECT = $2
+set transfer_directory = "$1"
+
+if ( ! -d "$transfer_directory" ) then
+	echo "do-it: Bad folder machine or date" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+
+
+# Directories
+if ( $?SQUIRREL_SUB ) then
+	set TABLES_DIR   = $SQUIRREL/tables-$SQUIRREL_SUB
+else
+	set TABLES_DIR   = $SQUIRREL/tables
+endif
+
+
+# Programs and scripts
+set ABI_SESS_FILE    = $SQUIRREL/scripts/ABI-sess-file.csh
+set EDIT_FILES       = $SQUIRREL/scripts/edit-session-file.csh
+set CHECK_FILES      = $SQUIRREL/scripts/check-exp-exists.csh
+set ABI_EXP_FILES    = $SQUIRREL/scripts/ABI-exp-files.csh
+set BUILD_EXP_FILES  = $SQUIRREL/scripts/embellish-exp-files.csh
+set QUALITY_CHECK    = $SQUIRREL/scripts/check-quality.csh
+set VECTOR_EXCISE    = $SQUIRREL/scripts/excise-vector.csh
+set TRANSFER_FILES   = $SQUIRREL/scripts/transfer-files.csh
+set ASSEMBLE_FILES   = $SQUIRREL/scripts/assemble-files.csh
+set SCREEN_FILES     = $SQUIRREL/scripts/screen-against-vector.csh
+set BACKUP_DIR       = $SQUIRREL/scripts/tidy-files.csh
+set LOG_TO_SUMMARY   = $SQUIRREL/scripts/log-to-summary.awk
+
+#
+# Gather information about this project
+#
+set GET_LOOK_UP  = $TABLES_DIR/lookup.csh
+set db_info = `$GET_LOOK_UP $PROJECT $TABLES_DIR/databases.lookup`
+if ( $#db_info == 0 ) then
+	echo do-it: no database information for $PROJECT
+	exit 2
+endif
+# Check things look ok
+if ( ! -e ~$db_info[6]/$db_info[2]/$db_info[3].SQ$db_info[4] ) then
+	echo do-it: Cannot find database ~$db_info[6]/$db_info[2]/$db_info[3] version $db_info[4]
+	exit 2
+endif
+# Check owner ok
+if ( $db_info[6] != $USER ) then
+	echo do-it: User $USER not registered to use database ~$db_info[6]/$db_info[2]/$db_info[3] version $db_info[4]
+	exit 2
+endif
+
+
+#
+# Go to where all the action is
+#
+cd "$transfer_directory"
+
+
+#
+# Determine which pass this is
+#
+set pass_file = .squirrel
+if (! -e ${pass_file}) then
+	set last_pass  = 0
+	set last_phase = completed
+	touch ${pass_file}
+else
+	set last_pass  = `tail -1 ${pass_file} | awk '/^Pass/ {pass = $2;} END {print pass;}'`
+	set last_phase = `tail -1 ${pass_file} | awk '/^Pass/ {phase = $4;} END {print phase;}'`
+endif
+@ pass_count = $last_pass + 1
+
+set resume = 0
+
+if ( "$last_phase" != "completed" ) then
+
+	switch ( "$last_phase" )
+	case started:
+		set phase = "log file creation"; breaksw
+	case session:
+		set phase = "session file creation"; breaksw
+	case edited:
+		set phase = "session file editing"; breaksw
+	case checked:
+		set phase = "checking experiment files don't already exist"; breaksw
+	case created:
+		set phase = "experiment files creation"; breaksw
+	case embellished:
+		set phase = "experiment files embellishment"; breaksw
+	case quality:
+		set phase = "quality checking"; breaksw
+	case excised:
+		set phase = "vector removal"; breaksw
+	case screened:
+		set phase = "screen against vector"; breaksw
+	case transferred:
+		set phase = "file transfer"; breaksw
+	case assembled:
+		set phase = "sequence assembly"; breaksw
+	case reported:
+		set phase = "reporting"; breaksw
+	case archived:
+		set phase = "backing up files"; breaksw
+	default:
+		set phase = "***problem***" ; breaksw
+	endsw
+
+	echo "***"
+	echo "*** WARNING\!\!\!"
+	echo "***"
+	echo "*** SQUIRREL did not successfully complete the last time it was"
+	echo "*** run on this directory"
+	echo "***"
+	echo ""
+	echo "Last phase successfully completed: $phase"
+repeat:
+	echo -n "Do you want to resume the previous pass? "
+	set reply = $<
+	switch ( $reply )
+	case [yY]*:
+		set pass_count = $last_pass
+		set resume = 1;
+		breaksw
+	case [nN]*:
+		set last_phase = "completed"
+		breaksw
+	default:
+		echo "please reply yes or no"
+		goto repeat
+	endsw
+	echo ""
+	echo ""
+
+endif
+
+
+#
+# START
+#
+setenv SQUIRREL_LOG "$transfer_directory"/log.Pass.${pass_count}
+
+
+#
+# Files and directories
+#
+# WARNING : the following line makes assumptions about directory names
+#
+# YUK! YUK! YUK! the following line is aweful
+set temp = ( $1 )
+set session =  $temp[2]$temp[3].${pass_count}
+set sessfile =  sess.$session
+set sessfile2 =  sess.$session.2
+set filenames = fn.$session
+set filenames2 = fn.$session.2
+set filenames3 = fn.$session.3
+set filenames4 = fn.$session.4
+
+#
+# PHASE 0: Start log file
+#
+if ( $last_phase == "completed" ) then
+	if ( -e "$SQUIRREL_LOG" ) /bin/rm -f "$SQUIRREL_LOG"
+	touch "$SQUIRREL_LOG"
+
+	# log this phase
+	set last_phase = started
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Add messages to log
+#
+if ( $resume == 0) then
+	echo SQUIRREL Version 1.4 | tee -a "$SQUIRREL_LOG"
+	echo "" | tee -a "$SQUIRREL_LOG"
+	echo Processing "$transfer_directory" | tee -a "$SQUIRREL_LOG"
+	echo Project $PROJECT | tee -a "$SQUIRREL_LOG"
+	echo Started at `date` | tee -a "$SQUIRREL_LOG"
+        echo Pass ${pass_count} | tee -a "$SQUIRREL_LOG"
+else
+	echo Resumed at `date` | tee -a "$SQUIRREL_LOG"
+        echo Pass ${pass_count} - Phase $phase | tee -a "$SQUIRREL_LOG"
+endif
+
+#
+# Create session file files
+#
+if ( $last_phase == "started" ) then
+	/bin/rm -f $sessfile
+	$ABI_SESS_FILE $sessfile
+	if ($status != 0) then
+		echo "do-it: Error while creating session file" | tee -a "$SQUIRREL_LOG"
+		exit 1
+	endif
+
+	# log this phase
+	set last_phase = session
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Allow the user to edit the session file to suit themselves
+#
+if ( $last_phase == "session" ) then
+	/bin/rm -f $sessfile2
+	$EDIT_FILES $sessfile $sessfile2
+	if ( $status ) then
+		echo "do-it: Error while editing session file" | tee -a "$SQUIRREL_LOG"
+		exit 1
+	endif
+	/bin/mv $sessfile2 $sessfile
+
+	# log this phase
+	set last_phase = edited
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Filter out experiments already processed
+#
+if ( $last_phase == "edited" ) then
+	/bin/rm -f $filenames
+	$CHECK_FILES ~$db_info[6]/$db_info[2] $sessfile $filenames
+	if ( $status ) then
+		echo "do-it: Error while checking for previous processing" | tee -a "$SQUIRREL_LOG"
+		exit 1
+	endif
+
+	# log this phase
+	set last_phase = checked
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Create experiment files from session file
+#
+if ( $last_phase == "checked" ) then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames2
+		$ABI_EXP_FILES $sessfile $filenames $filenames2
+		if ($status != 0) then
+			echo "do-it: Error while creating experiment files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+		/bin/mv -f $filenames2 $filenames
+	else
+		echo do-it: Skipping creation of experiment files | tee -a "$SQUIRREL_LOG"
+	endif
+
+	# log this phase
+	set last_phase = created
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Fill out the details on the experiment files
+#
+if ( $last_phase == "created" ) then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames2
+		$BUILD_EXP_FILES $filenames $filenames2
+		if ($status != 0) then
+			echo "do-it: Error while building experiment files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+		/bin/mv $filenames2 $filenames
+	else
+		echo do-it: Skipping building of experiment files | tee -a "$SQUIRREL_LOG"
+	endif
+
+	# log this phase
+	set last_phase = embellished
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Remove poor quality reads
+#
+if ( $last_phase == "embellished") then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames2
+		$QUALITY_CHECK $filenames $filenames2
+		if ($status != 0) then
+			echo "do-it: Error while checking quality of reads" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping quality check of files | tee -a "$SQUIRREL_LOG"
+		/bin/cp $filenames $filenames2
+	endif
+
+	# log this phase
+	set last_phase = quality
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+#
+# Remove vector sequence
+#
+if ( $last_phase == "quality" ) then
+	if ( `wc $filenames2 | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames3
+		$VECTOR_EXCISE $filenames2 $filenames3
+		if ($status != 0) then
+			echo "do-it: Error while excising vectors from files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping excising of vector from files | tee -a "$SQUIRREL_LOG"
+		/bin/cp $filenames2 $filenames3
+	endif
+
+	# log this phase
+	set last_phase = excised
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+
+
+
+
+#
+# Screen against vector
+#
+if ( $last_phase == "excised" ) then
+	if ( `wc $filenames3 | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f $filenames4
+		$SCREEN_FILES $filenames3 $filenames4
+		if ($status != 0) then
+			echo "do-it: Error while screening files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping screening of vector from files | tee -a "$SQUIRREL_LOG"
+		/bin/cp $filenames3 $filenames4
+	endif
+
+	# log this phase
+	set last_phase = screened
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+
+
+
+
+#
+# Transfer files - some reformating and converting may occur
+# NOTE: The second $filenames is in ~$db_info[6]/$db_info[2] directory
+#
+if ( $last_phase == screened ) then
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		/bin/rm -f ~$db_info[6]/$db_info[2]/$filenames 
+		$TRANSFER_FILES ~$db_info[6]/$db_info[2] $filenames $filenames 
+		if ($status != 0) then
+			echo "do-it: Error while transferring files" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping transfer of files | tee -a "$SQUIRREL_LOG"
+	endif
+
+	# log this phase
+	set last_phase = transferred
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+
+#
+# Do the assembly
+#
+if ( $last_phase == transferred ) then
+	pushd ~$db_info[6]/$db_info[2]
+	if ( `wc $filenames | awk '{print $2;}'` > 0 ) then
+		$ASSEMBLE_FILES $PROJECT $filenames
+		if ($status != 0) then
+			echo "do-it: Error while assembling files into database" | tee -a "$SQUIRREL_LOG"
+			exit 2
+		endif
+	else
+		echo do-it: Skipping assembly of files | tee -a "$SQUIRREL_LOG"
+	endif
+	popd
+
+	# log this phase
+	set last_phase = assembled
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Generate log to tell us what has happened with all the files this pass
+#
+if ( $last_phase == assembled ) then
+	pushd ~$db_info[6]/$db_info[2]
+	if (! -e Logs) mkdir Logs
+	set session_log = log.$session
+	awk -f $LOG_TO_SUMMARY "$SQUIRREL_LOG" > Logs/$session_log
+	/bin/cp Logs/$session_log "$transfer_directory"
+	lpr Logs/$session_log
+	popd
+
+	# log this phase
+	set last_phase = reported
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Back up
+#
+if ( $last_phase == reported ) then
+	$BACKUP_DIR $session_log
+	if ($status != 0) then
+		echo "do-it: Error while tidying up files in transfer directory" | tee -a "$SQUIRREL_LOG"
+		exit 2
+	endif
+
+	# log this phase
+	set last_phase = archived
+	echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+endif
+
+
+#
+# Remove temporary files
+#
+/bin/rm -f $sessfile $sessfile2 $filenames $filenames2 $filenames3 $filenames4
+
+
+#
+# Completed successfully
+#
+
+# log this phase
+set last_phase = completed
+echo Pass ${pass_count} : ${last_phase} `date` >> ${pass_file}
+
+echo Completed at `date` | tee -a "$SQUIRREL_LOG"
+
+exit 0
+
+
+abort:
+#
+# Interruption not normal termination
+#
+
+if ( $?SQUIRREL_LOG ) then
+    echo Aborted at `date` | tee -a "$SQUIRREL_LOG"
+else
+    echo Aborted at `date`
+endif
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/edit-session-file.csh b/src/squirrel-1.4/scripts/edit-session-file.csh
new file mode 100644
index 0000000..97c7753
--- /dev/null
+++ b/src/squirrel-1.4/scripts/edit-session-file.csh
@@ -0,0 +1,92 @@
+#! /bin/csh -f
+#
+# edit-session-file
+#
+#    Allow a session file to be edited, then parse it for editing
+#    mistakes.
+#
+# Usage:
+#    edit-session-file fof_in fof_out
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+# Usage
+if ( $#argv != 2 ) then
+	echo Usage: edit-session-file fof_in fof_out
+	exit 2
+endif
+
+if ( ! -f $1 ) then
+	echo edit-session-file: file of file names $1 does not exist. | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+if ( -e $2 ) then
+	echo exit-session-file: file $2 exists. | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Where the wild things are
+#
+set PARSE_EDIT = $SQUIRREL/scripts/parse-session-file.awk
+
+
+#
+# Temporary files
+#
+set sess_edit = _session.$$
+set parse_out = _parse.$$
+set spacer = _spacer.$$
+
+#
+# Make a copy
+#
+/bin/cp $1 ${sess_edit}
+
+#
+# Allow the user to edit the file
+#
+$EDITOR ${sess_edit}
+
+#
+# Create a spacer - DO NOT CHANGE THIS!!!!
+#
+cat << EOF > ${spacer}
+##PASS-2##
+EOF
+
+#
+# Parse the edited file for errors
+#
+awk -f $PARSE_EDIT $1 ${spacer} ${sess_edit} > ${parse_out}
+
+#
+# Report errors
+#
+echo "" | tee -a "$SQUIRREL_LOG"
+echo Parsing edited file... | tee -a "$SQUIRREL_LOG"
+grep -v '^OUT>' ${parse_out} | tee -a "$SQUIRREL_LOG"
+
+#
+# Gather clean lines
+#
+awk '/^OUT>/ {print substr($0,length($1)+2);}' ${parse_out} > $2
+
+#
+# Tidy up
+#
+/bin/rm -f ${sess_edit} ${parse_out} ${spacer}
+
+
+
+exit 0
+
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/embellish-exp-files.csh b/src/squirrel-1.4/scripts/embellish-exp-files.csh
new file mode 100644
index 0000000..171a15a
--- /dev/null
+++ b/src/squirrel-1.4/scripts/embellish-exp-files.csh
@@ -0,0 +1,201 @@
+#! /bin/csh -f
+#
+# embellish-exp-files
+#   Fill in the details of an experiment file once the machine specific bits
+#   have been done.
+#
+#   Usage:
+#      embellish-exp-files fofn_in fofn_out
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+# Usage
+if ( $#argv != 2 ) then
+	echo Usage: embellish-exp-files fofn_in fofn_out | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+if ( ! -e $1 ) then
+	echo embellish-exp-files: File of file names $1 does not exist | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+if ( -e $2 ) then
+	echo embellish-exp-files: Output file of file names $2 already exists | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+# create output file of file names
+touch $2
+
+# Directories
+if ( $?SQUIRREL_SUB ) then
+	set TABLES_DIR   = $SQUIRREL/tables-$SQUIRREL_SUB
+else
+	set TABLES_DIR   = $SQUIRREL/tables
+endif
+
+# Programs and scripts
+set GET_MACH_SEQ  = $SQUIRREL/bin/expGetSeq
+set GET_CLIP_SEQ  = $SQUIRREL/scripts/clip-seq.awk
+set GET_LOOK_UP   = $TABLES_DIR/lookup.csh
+set MAKE_SCF_FILE = $SQUIRREL/bin/makeSCF
+
+
+
+
+
+# Process each file
+echo "" | tee -a "$SQUIRREL_LOG"
+echo "Embellishing experiment file for:" | tee -a "$SQUIRREL_LOG"
+foreach file ( `cat $1` )
+
+	echo $file | tee -a "$SQUIRREL_LOG"
+
+	if ( ! -e $file ) then
+		echo Abandoned: ${file}: Experiment file missing | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	# check not embellished previously (look for OP)
+	set operator = `awk '/^OP/ {$1="";line = substr($0,2);} END {print line;}' $file`
+	if ( $operator != "" ) then
+		# This file has been embellished already - clean it
+		set tempfile = _temp.exp
+		/bin/mv -f $file $tempfile
+		sed '/^OP/,$d' $tempfile > $file
+	endif
+
+	# beautify it
+	set expname = `awk '/^EN/ {$1="";line = substr($0,2);} END {print line;}' $file`
+	if ( $expname == "" ) set expname = $file
+
+	# specify file names
+	set expfile = $expname
+	set tracefile = `awk '/^LN/ {$1="";line = substr($0,2);} END {print line;}' $file`
+	if ( $tracefile == "" ) set tracefile = ${expfile}SCF
+
+	# include obvious information
+	# OP must always be the first embellished record
+	echo "OP   `whoami`" >> $expfile
+
+	# gather information from lookup files
+
+	# information on subclones/templates
+	set template = `$GET_LOOK_UP $expname $TABLES_DIR/subclones.lookup`
+	if ( $#template == 0 ) then
+		echo Abandoned: ${expfile}: No template information for $expname | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	# information on clones and cloning vectors
+	set clone = `$GET_LOOK_UP $template[2] $TABLES_DIR/clone-types.lookup`
+	if ( $#clone == 0 ) then
+		echo Abandoned: ${expfile}: No clone information for $template[2] | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	# information on sequencing clone sites
+	set subclone = `$GET_LOOK_UP $template[5] $TABLES_DIR/seqclone-sites.lookup`
+	if ( $#subclone == 0 ) then
+		echo Abandoned: ${expfile}: No cloning site information for $template[5] | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	# information on sequencing vector sequence files
+	set scfile = `$GET_LOOK_UP $template[5] $TABLES_DIR/vector-seqfiles.lookup`
+	if ( $#scfile == 0 ) then
+		echo Abandoned: ${expfile}: No sequence file for sequencing vector $template[5] | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+	if ( ! -e $SQUIRREL/seqs/$scfile[2] ) then
+		echo Abandoned: ${expfile}: Sequencing vector file does not exist | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	# information on cloning vector sequence files
+	set cfile = `$GET_LOOK_UP $clone[2] $TABLES_DIR/vector-seqfiles.lookup`
+	if ( $#cfile == 0 ) then
+		echo Abandoned: ${expfile}: No sequence file for cloning vector $clone[2] | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+	if ( ! -e $SQUIRREL/seqs/$cfile[2] ) then
+		echo Abandoned: ${expfile}: Cloning vector file does not exist | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+
+	# include info derived from lookups
+	echo "TN   $expname:r" >> $expfile
+	echo "SV   $template[5]" >> $expfile
+	echo "SF   $SQUIRREL/seqs/$scfile[2]" >> $expfile
+	echo "SI   $template[3]" >> $expfile
+	echo "SC   $subclone[2]" >> $expfile
+	echo "FM   $template[4]" >> $expfile
+
+	# assume Universal primer U1
+	#echo PN U1 >> $expfile
+
+	# determine whether forwards or reverse read
+	if ( $expname =~ *.[rR]* ) then
+		# these experiments are reverse reads
+		echo "DR   -" >> $expfile
+		echo "SP   $subclone[4]" >> $expfile
+	else
+		# otherwise assume forward read
+		echo "DR   +" >> $expfile
+		echo "SP   $subclone[3]" >> $expfile
+	endif
+
+	# determine strands
+	if ( $expname =~ *.[rRfF]* ) then
+		# these experiments are double stranded
+		echo "ST   2" >> $expfile
+	else
+		# otherwise assume single stranded
+		echo "ST   1" >> $expfile
+	endif
+
+	# include cloning vector information
+	echo "CN   $template[2]" >> $expfile
+	echo "CV   $clone[2]" >> $expfile
+	echo "CF   $SQUIRREL/seqs/$cfile[2]" >> $expfile
+
+
+	# determine lists of screens
+	$GET_LOOK_UP $template[2] $TABLES_DIR/screens.lookup | awk '{$1="";print  "SS   " substr($0,2);}' >> $expfile
+
+
+	# retrieve the sequence
+	set temp_file = $expfile.temp.$$
+	$GET_MACH_SEQ -SCF "$tracefile" -output $temp_file
+	if ( $status ) then
+		echo Discarded: ${expfile}: Error reading sequence from trace | tee -a "$SQUIRREL_LOG"
+		/bin/rm -f $temp_file
+		goto skip
+	endif
+
+	# include sequence in experiment file
+	cat $temp_file >> $expfile
+	/bin/rm -f $temp_file
+
+	# Determine poor quality parts
+	awk -f $GET_CLIP_SEQ $expfile >> $expfile
+
+	# update file of file names
+	echo $expfile >> $2
+
+# skip to here on error
+skip:
+
+end
+
+
+exit 0
+
+
+abort:
+#
+# interruption - not successful completion
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/excise-vector.csh b/src/squirrel-1.4/scripts/excise-vector.csh
new file mode 100644
index 0000000..3f84d43
--- /dev/null
+++ b/src/squirrel-1.4/scripts/excise-vector.csh
@@ -0,0 +1,135 @@
+#! /bin/csh -f
+#
+# excise_vector
+#    Removes sequencing and cloning vector from sequences
+#
+# Usage:
+#	excise-vector in_fofn out_fofn
+#
+# 5-June-1992
+#    vepe now writes fails to a fail file
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+#
+# Check usage
+#
+if ( $#argv != 2 ) then
+	echo "Usage: excise-vector in_fofn out_fofn" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+if ( ! -f $1 ) then
+	echo excise-vector: file of file names $1 does not exist. | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+if ( -e $2 ) then
+	echo excise-vector: file $2 exists. | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Where the wild things are
+#
+set EXCISE_VECTOR = $SQUIRREL/bin/vepe
+set VEPE_FAILS    = $SQUIRREL/scripts/vepe-fails.csh
+#
+# Temporary files
+#
+set temp_fofn = _vep_temp_fofn.$$
+set fails1    = _vep_fails1.$$
+set fails2    = _vep_fails2.$$
+/bin/rm -f ${temp_fofn} ${fails1} ${fails2}
+
+#
+# Vep options, responses
+#
+set DEFAULT = ""
+
+#************************************************************
+# First pass - excise sequencing vector
+#************************************************************
+echo "" | tee -a "$SQUIRREL_LOG"
+echo "Excising sequencing vector" | tee -a "$SQUIRREL_LOG"
+
+${EXCISE_VECTOR} << EOF
+1
+$1
+${temp_fofn}
+${fails1}
+${DEFAULT}
+${DEFAULT}
+${DEFAULT}
+EOF
+if ( $status ) then
+	echo "excise_vector: Abnormal termination" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Report failures
+#
+$VEPE_FAILS $fails1 1
+if ( $status ) then
+	echo "excise_vector: Abnormal termination" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+
+
+
+
+
+
+
+
+
+
+
+
+#************************************************************
+# Second pass - excise cloning vector
+#************************************************************
+echo "" | tee -a "$SQUIRREL_LOG"
+echo "Excising cloning vector" | tee -a "$SQUIRREL_LOG"
+
+${EXCISE_VECTOR} << EOF
+2
+${temp_fofn}
+$2
+${fails2}
+${DEFAULT}
+${DEFAULT}
+${DEFAULT}
+EOF
+if ( $status ) then
+	echo "excise_vector: Abnormal termination" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Report failures
+#
+$VEPE_FAILS $fails2 2
+if ( $status ) then
+	echo "excise_vector: Abnormal termination" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+
+#
+# Tidy up
+#
+/bin/rm -f ${temp_fofn}
+/bin/rm -f ${fails1} ${fails2}
+
+exit 0
+
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/exp2dap.awk b/src/squirrel-1.4/scripts/exp2dap.awk
new file mode 100644
index 0000000..6f0a460
--- /dev/null
+++ b/src/squirrel-1.4/scripts/exp2dap.awk
@@ -0,0 +1,157 @@
+#
+# Convert experiment file format to a format compatible with dap
+#
+# It's pretty clutzy and nasty as information is lost in the conversion
+# process. By it'll do until a version of "gap" is written that accepts
+# files in the nice format.
+#
+# Written by Simon Dear, 10 February 1992
+#
+# Changes:
+# 3-Nov-92 Added code for creation of initial tags
+#
+
+BEGIN {
+	# set createTags to "YES" if you want them
+	createTags = "YES";
+}
+
+$1 == "SQ" {
+	# Gather sequence
+	seq = "";
+	getline;
+	while ( $0 != "//") {
+		# sequence may be broken up by white space
+		for (frag=1;frag<=NF;frag++) seq = seq $frag;
+		getline;
+	}
+	seqlen = length(seq);
+}
+
+{
+	# Record entry lines
+	type = $1;
+	$1 = "";
+	entry[type] = substr($0,2);
+}
+
+
+END {
+	# Determine left and right end cuts
+	# Take all left and right ends of excluded sequences
+	n = 0;
+	if ( entry["CS"] != "" ) {
+		# special variables because we will need them later
+		CS = entry["CS"];
+		CSl = substr(CS,1,index(CS,".")-1)+0;
+		CSr = substr(CS,index(CS,".")+2)+0;
+		n++; pos[n] = CSl; type[n] = +1;
+		n++; pos[n] = CSr; type[n] = -1;
+	}
+	if ( entry["SR"] != "" ) {
+		l = entry["SR"]+0;
+		r = seqlen+1;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+	if ( entry["QL"] != "" ) {
+		l = 0;
+		r = entry["QL"]+0;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+	if ( entry["QR"] != "" ) {
+		l = entry["QR"]+0;
+		r = seqlen+1;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+	if ( entry["SL"] != "" ) {
+		l = 0;
+		r = entry["SL"]+0.0;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+
+	# Sort them
+	for (i = 2; i <= n; i++ ) {
+		for (j=i; j>1 && pos[j-1] > pos[j]; j--) {
+			#swap pos[j-1] and pos[j];
+			t=pos[j-1]; pos[j-1]=pos[j]; pos[j]=t;
+			t=type[j-1]; type[j-1]=type[j]; type[j]=t;
+		}
+	}
+
+	# Set defaults
+	pos[0] = 0; pos[n+1] = seqlen+1;
+
+	# Deduce region not marked
+	# Select longest unmarked region of sequence
+	inn = 0;
+	max_width = 0;
+	for (i = 1; i <= n+1; i++ ) {
+		if (inn == 0) {
+			if (max_width < pos[i]-pos[i-1] ) {
+				l_cut = pos[i-1];
+				r_cut = pos[i];
+				max_width = pos[i]-pos[i-1];
+			}
+
+		}
+		inn += type[i];
+	}
+
+	# Write dap file header
+	ulen = r_cut - l_cut - 1;
+	printf(";%6d%6d%6d%-4s%-18s\n",seqlen,l_cut,ulen,entry["LT"],entry["LN"]);
+
+	# Write left cutoffs (if any)
+	if(l_cut>0) {
+		str = substr(seq,1,l_cut);
+		strlen = l_cut;
+		bits = 48;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf (";<%s\n",substr(str,i,bits));
+		}
+	}
+
+	# Write right cutoffs (if any)
+	if(r_cut<=seqlen) {
+		str = substr(seq,r_cut);
+		strlen = length(str);
+		bits = 48;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf (";>%s\n",substr(str,i,bits));
+		}
+	}
+
+	if (createTags == "YES") {
+		# Write initial tags here
+		# IGNS
+		if (entry["SR"]+0 <= seqlen && ulen > 5) {
+			printf (";;IGNS %6d %6d %s\n",ulen-4,5,entry["SV"]);
+		}
+		# IGNC
+		if (entry["CS"] != "" && ulen > 5) {
+			# determine which end to put tag
+			if (CSr <= l_cut) {
+				# at the left
+				printf (";;IGNC %6d %6d %s\n",1,5,entry["CV"]);
+			} else {
+				# at the right
+				printf (";;IGNC %6d %6d %s\n",ulen-4,5,entry["CV"]);
+			}
+		}
+	}
+
+	# Write remaining sequence (if any!)
+	if (l_cut+1 != r_cut) {
+		str = substr(seq,l_cut+1,r_cut-l_cut-1);
+		strlen = length(str);
+		bits = 50;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf ("%s\n",substr(str,i,bits));
+		}
+	}
+}
+
diff --git a/src/squirrel-1.4/scripts/exp2fasta.awk b/src/squirrel-1.4/scripts/exp2fasta.awk
new file mode 100644
index 0000000..1a5009d
--- /dev/null
+++ b/src/squirrel-1.4/scripts/exp2fasta.awk
@@ -0,0 +1,129 @@
+#
+# Convert experiment file format to fasta format
+#
+# It's pretty clutzy and nasty as information is lost in the conversion
+# process. By it'll do until a version of "gap" is written that accepts
+# files in the nice format.
+#
+# Written by Simon Dear, 10 February 1992
+#
+# Changes:
+# 3-Nov-92 Added code for creation of initial tags
+#
+
+BEGIN {
+	# set createTags to "YES" if you want them
+	createTags = "YES";
+}
+
+$1 == "SQ" {
+	# Gather sequence
+	seq = "";
+	getline;
+	while ( $0 != "//") {
+		# sequence may be broken up by white space
+		for (frag=1;frag<=NF;frag++) seq = seq $frag;
+		getline;
+	}
+	seqlen = length(seq);
+}
+
+{
+	# Record entry lines
+	type = $1;
+	$1 = "";
+	entry[type] = substr($0,2);
+}
+
+
+END {
+	# Determine left and right end cuts
+	# Take all left and right ends of excluded sequences
+	n = 0;
+	if ( entry["CS"] != "" ) {
+		# special variables because we will need them later
+		CS = entry["CS"];
+		CSl = substr(CS,1,index(CS,".")-1)+0;
+		CSr = substr(CS,index(CS,".")+2)+0;
+		n++; pos[n] = CSl; type[n] = +1;
+		n++; pos[n] = CSr; type[n] = -1;
+	}
+	if ( entry["SR"] != "" ) {
+		l = entry["SR"]+0;
+		r = seqlen+1;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+	if ( entry["QL"] != "" ) {
+		l = 0;
+		r = entry["QL"]+0;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+	if ( entry["QR"] != "" ) {
+		l = entry["QR"]+0;
+		r = seqlen+1;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+	if ( entry["SL"] != "" ) {
+		l = 0;
+		r = entry["SL"]+0.0;
+		n++; pos[n] = l; type[n] = +1;
+		n++; pos[n] = r; type[n] = -1;
+	}
+
+	# Sort them
+	for (i = 2; i <= n; i++ ) {
+		for (j=i; j>1 && pos[j-1] > pos[j]; j--) {
+			#swap pos[j-1] and pos[j];
+			t=pos[j-1]; pos[j-1]=pos[j]; pos[j]=t;
+			t=type[j-1]; type[j-1]=type[j]; type[j]=t;
+		}
+	}
+
+	# Set defaults
+	pos[0] = 0; pos[n+1] = seqlen+1;
+
+	# Deduce region not marked
+	# Select longest unmarked region of sequence
+	inn = 0;
+	max_width = 0;
+	for (i = 1; i <= n+1; i++ ) {
+		if (inn == 0) {
+			if (max_width < pos[i]-pos[i-1] ) {
+				l_cut = pos[i-1];
+				r_cut = pos[i];
+				max_width = pos[i]-pos[i-1];
+			}
+
+		}
+		inn += type[i];
+	}
+
+	# Write dap file header
+	ulen = r_cut - l_cut - 1;
+	printf(">%s\n",entry["EN"]);
+
+
+	# Write remaining sequence (if any!)
+	if (l_cut+1 != r_cut) {
+		str = substr(seq,l_cut+1,r_cut-l_cut-1);
+		strlen = length(str);
+		# convert all '-'s to Ns
+		str2 = ""
+		for (i=1; i<=strlen; i++) {
+		    	c = substr(str,i,1);
+			if (c=="-")
+				str2 = str2 "N";
+			else
+				str2 = str2 c;
+		}
+		str = str2;
+		bits = 50;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf ("%s\n",substr(str,i,bits));
+		}
+	}
+}
+
diff --git a/src/squirrel-1.4/scripts/log-to-summary.awk b/src/squirrel-1.4/scripts/log-to-summary.awk
new file mode 100644
index 0000000..21054c4
--- /dev/null
+++ b/src/squirrel-1.4/scripts/log-to-summary.awk
@@ -0,0 +1,67 @@
+#
+# log-to-summary.awk
+#
+#   Generate a summary from a squirrel log file
+#
+# Possible usage:
+#   awk -f log-to-summary.awk logfile > summary
+#
+# Changes:
+#  22-Oct-92 There are now defaults for readings lost in processing
+#
+BEGIN {
+	count = 0;
+}
+
+/^\*\*/ {
+	# Format: ** expfile from trace
+	if (reverse[$2] == "") {
+	    count++;
+	    t = substr($0,length($2)+10);
+	    sample[count] = $2;
+	    trace[count] = t;
+	    reverse[$2] = count;
+            # defaults
+	    status[count] = "Abandoned";
+	    comment[count] = "lost in processing";
+
+	}
+}
+
+/^Discarded:/ || /^Failed:/ || /^Abandoned:/ || /^OK:/ || /^Rejected:/ {
+	# Format: Failed: expfile: reason
+	#   -or-  Succeeded: expfile: comment
+	if (length($2) > 1) {
+		expfile = substr($2,1,length($2)-1);
+		NN = reverse[expfile];
+		status[NN] = substr($1,1,length($1)-1);
+		comment[NN] = substr($0,length($1)+length($2)+3);
+	}
+}
+
+#Miscellaneous lines
+/^SQUIRREL/ || /^Processing/ || /^Project/ || /^Started at/ || /^Completed at/ || /^Aborted at/ || /^Pass/ || /^Resumed/ {
+	print "#", $0;
+}
+
+END {
+	print "#"; 
+	maxTrace = 0; maxSample = 0; maxSTATUS = 0;
+	for (i = 1 ; i <= count ; i++ ) {
+		if (sample[i] != "") {
+			if (maxTRACE < length(trace[i]))
+			    maxTRACE = length(trace[i]);
+			if (maxSAMPLE < length(sample[i]))
+			    maxSAMPLE = length(sample[i]);
+			if (maxSTATUS < length(status[i]))
+			    maxSTATUS = length(status[i]);
+		}
+	}
+	fmt = "%-" maxTRACE+1 "s  %-" maxSAMPLE "s  %-" maxSTATUS "s  %s\n";
+	for (i = 1 ; i <= count ; i++ ) {
+		if (sample[i] != "") {
+			printf(fmt,trace[i] ":",sample[i],status[i],comment[i]);
+		}
+	}
+}
+	
diff --git a/src/squirrel-1.4/scripts/packhorse b/src/squirrel-1.4/scripts/packhorse
new file mode 100644
index 0000000..e21329d
--- /dev/null
+++ b/src/squirrel-1.4/scripts/packhorse
@@ -0,0 +1,84 @@
+#! /bin/csh -f
+#
+# File: packhorse
+# Version: 1.1
+#
+# Author: Simon Dear
+#         MRC Laboratory of Molecular Biology
+#	  Hills Road
+#	  Cambridge CB2 2QH
+#	  United Kingdom
+#
+# Created: 13 October 1992
+# Updated:
+#
+# Description:
+#       C shell script to back ABI results files into one directory
+#
+# Usage:
+#	packhorse dir [dir...] directory
+#
+
+# default number
+set number = 0
+
+if ($#argv < 2) then
+    echo 'Usage: packhorse dir [dir...] directory'
+    exit 2
+endif
+
+set packdir = "$argv[$#argv]"
+
+set logfile = "$packdir"/.PACK-LOG
+
+if (! -e "$packdir") then
+    mkdir "$packdir"
+else if (! -d "$packdir") then
+    echo "$packdir" exists but is not a directory
+    exit 1
+else if ( -e "$packdir"/.rsrc ) then
+    echo "$packdir" is a Results folder...abandoned
+    exit 1
+else if ( ! -e "$logfile" ) then
+    echo "$packdir" is not a pack directory
+    exit 1
+endif
+
+touch "$logfile"
+
+set diri = 1
+set dirn = $#argv ; @ dirn --
+while ( $diri <= $dirn )
+    set dir = "$argv[$diri]"
+    echo Packing "$dir" into "$packdir" | tee -a "$logfile"
+    # check number
+    set a = `find "$packdir" -name "Sample ${number}[0-9][0-9]" -print`
+    while ( $#a != 0 ) 
+	@ number ++
+        set a = `find "$packdir" -name "Sample ${number}[0-9][0-9]" -print`
+    end
+    # set files = `find "$dir" -prune -name "Sample [0-9][0-9]" -print`
+    set files = "$dir"/"Sample "[0-9][0-9]{,.[0-9]}
+
+    set filei = 1
+    while ( $filei <= $#files )
+	set file = "$files[$filei]"
+	set newfile = `echo "$file" | sed 's/.*Sample /Sample '${number}'/'`
+	echo "$file -> $packdir/$newfile" | tee -a "$logfile"
+	mv "$file" "$packdir"/"$newfile"
+	@ filei ++
+    end
+
+    # remove directory, but only if no files left
+    # >100 disk blocks is arbitrary, but should trap any valid ABI files
+    set big_files = `find "$dir" -size +100 -print`
+    if ( $#big_files == 0 ) then
+	echo Removing directory "$dir" | tee -a "$logfile"
+	/bin/rm -r "$dir"
+    else
+	echo Not removing directory "$dir" - large files present | tee -a "$logfile"
+    endif
+
+    @ diri ++
+end
+
diff --git a/src/squirrel-1.4/scripts/parse-session-file.awk b/src/squirrel-1.4/scripts/parse-session-file.awk
new file mode 100644
index 0000000..e7ab09f
--- /dev/null
+++ b/src/squirrel-1.4/scripts/parse-session-file.awk
@@ -0,0 +1,93 @@
+#
+# Parse edited session file
+#
+# Input:
+#
+#      (original session file)
+#      ##PASS-2##
+#      (edited session file)
+#
+# Output:
+#      (annotated edited session file)
+#      OUT> passed line...
+#
+#
+
+BEGIN {
+	FS = ":";
+	pass = 1;
+	count = 0;
+	l = 0;
+}
+
+/^##PASS-2##/ {
+	pass = 2;
+	next;
+}
+
+pass == 2 { l++; printf("%3d %s\n",l,$0); }
+
+/^#/ { next; }
+
+pass == 1 {
+	pass1sample[$1] = 1;
+	next;
+}
+
+pass == 2 {
+
+	# must be two fields
+	if (NF != 2) {
+		print "ERROR: wrong number of fields >>", $0;
+		next;
+	}
+
+	# must be non null
+	if (length($1)==0) {
+		print "ERROR: sample missing >>",$0;
+		next;
+	}
+	if (length($2)==0) {
+		print "ERROR: experiment missing >>",$0;
+		next;
+	}
+
+	# $1 should be in original file
+	if (pass1sample[$1] != 1) {
+		print "ERROR: sample", $1, "was not in original file"
+		next;
+	}
+
+	# $1 should not be duplicated
+	sample[$1]++;
+	if (sample[$1] > 1) {
+		print "ERROR: sample", $1, "appears more than once in edited file"
+		line[sample2line[$1]] = "";
+		next;
+	}
+
+	# $2 should look ok
+	for(i=1;i<length($2);i++) {
+		if(index("abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ0123456789. ",substr($2,i,1))==0) {
+			print "ERROR: experiment", $2, "contains invalid characters"
+			next;
+		}
+	}
+
+	# $2 should not be duplicated
+	expname[$2]++;
+	if (expname[$2] > 1) {
+		print "ERROR: experiment", $2, "appears more than once in edited file"
+		line[experiment2line[$2]] = "";
+		next;
+	}
+
+	line[count] = $0;
+	sample2line[$1] = count;
+	experiment2line[$2] = count;
+	count ++;
+}
+
+END {
+	for (i=0; i<count; i++) if (line[i]!="") print "OUT>",line[i];
+}
diff --git a/src/squirrel-1.4/scripts/quality-check-loose.awk b/src/squirrel-1.4/scripts/quality-check-loose.awk
new file mode 100644
index 0000000..e425412
--- /dev/null
+++ b/src/squirrel-1.4/scripts/quality-check-loose.awk
@@ -0,0 +1,74 @@
+#
+# Determine if the experiment file should be rejected because of
+# poor quality. Ideally this would be done by looking at trace
+# quality, but this will have to do in the meantime
+#
+# Method:
+#    Determine the extents of the good data, by looking at the
+#    experiment file entries. Default is to use LL..RR
+#    Reject if the good data is less that MINLEN
+#    Reject if the good data has greater than CRAP (%) ambiguity content
+#
+# Outputs:
+#    0 - passes quality check
+#    1 - fails length constraint
+#    2 - fails ambiguity content constraint
+#
+# Written by Simon Dear, 27 March 1992
+#
+# loose version
+#
+
+BEGIN {
+	LL = 50;
+	RR = 350;
+	MINLEN = 50;
+	CRAP = 10.0;
+}
+
+$1 == "SQ" {
+	# Gather sequence
+	seq = "";
+	getline;
+	while ( $0 != "//") {
+		# sequence may be broken up by white space
+		for (frag=1;frag<=NF;frag++) seq = seq $frag;
+		getline;
+	}
+	seqlen = length(seq);
+}
+
+{
+	# Record entry lines
+	type = $1;
+	$1 = "";
+	entry[type] = substr($0,2);
+}
+
+
+END {
+	if ( entry["QL"] != "" ) LL = entry["QL"] + 0;
+	if ( entry["QR"] != "" ) RR = entry["QR"] + 0;
+	if ( LL > seqlen ) LL = seqlen;
+	if ( RR > seqlen ) RR = seqlen;
+	good_len = RR - LL + 1;
+
+	# Check length constraint
+	if ( good_len < MINLEN ) {
+		print 1;
+	} else {
+		# Check ambiguity content restraint
+		
+		count = 0;
+		for (i = LL; i <= RR; i++) {
+			
+			if ( index("ACGTacgt",substr(seq,i,1)) == 0 ) count++;
+
+		}
+
+		if ( count * 100 / good_len > CRAP )
+			print 2;
+		else
+			print 0;
+	}
+}
diff --git a/src/squirrel-1.4/scripts/quality-check.awk b/src/squirrel-1.4/scripts/quality-check.awk
new file mode 100644
index 0000000..750fe63
--- /dev/null
+++ b/src/squirrel-1.4/scripts/quality-check.awk
@@ -0,0 +1,72 @@
+#
+# Determine if the experiment file should be rejected because of
+# poor quality. Ideally this would be done by looking at trace
+# quality, but this will have to do in the meantime
+#
+# Method:
+#    Determine the extents of the good data, by looking at the
+#    experiment file entries. Default is to use LL..RR
+#    Reject if the good data is less that MINLEN
+#    Reject if the good data has greater than CRAP (%) ambiguity content
+#
+# Outputs:
+#    0 - passes quality check
+#    1 - fails length constraint
+#    2 - fails ambiguity content constraint
+#
+# Written by Simon Dear, 27 March 1992
+#
+
+BEGIN {
+	LL = 50;
+	RR = 350;
+	MINLEN = 100;
+	CRAP = 4.0;
+}
+
+$1 == "SQ" {
+	# Gather sequence
+	seq = "";
+	getline;
+	while ( $0 != "//") {
+		# sequence may be broken up by white space
+		for (frag=1;frag<=NF;frag++) seq = seq $frag;
+		getline;
+	}
+	seqlen = length(seq);
+}
+
+{
+	# Record entry lines
+	type = $1;
+	$1 = "";
+	entry[type] = substr($0,2);
+}
+
+
+END {
+	if ( entry["QL"] != "" ) LL = entry["QL"] + 0;
+	if ( entry["QR"] != "" ) RR = entry["QR"] + 0;
+	if ( LL > seqlen ) LL = seqlen;
+	if ( RR > seqlen ) RR = seqlen;
+	good_len = RR - LL + 1;
+
+	# Check length constraint
+	if ( good_len < MINLEN ) {
+		print 1;
+	} else {
+		# Check ambiguity content restraint
+		
+		count = 0;
+		for (i = LL; i <= RR; i++) {
+			
+			if ( index("ACGTacgt",substr(seq,i,1)) == 0 ) count++;
+
+		}
+
+		if ( count * 100 / good_len > CRAP )
+			print 2;
+		else
+			print 0;
+	}
+}
diff --git a/src/squirrel-1.4/scripts/screen-against-vector.csh b/src/squirrel-1.4/scripts/screen-against-vector.csh
new file mode 100644
index 0000000..5d502c1
--- /dev/null
+++ b/src/squirrel-1.4/scripts/screen-against-vector.csh
@@ -0,0 +1,155 @@
+#! /bin/csh -f
+#
+# screen-against-vector
+#
+#   Screen files against sequences
+#
+# Usage:
+#   screen-against-vector in_fofn out_fofn
+#
+# NOTE:
+#   This script:
+#   1) determines a list of files to screen against
+#   2) extracts the clipped sequences from the experiment files
+#   3) screens each sequence against the files in (1)
+#
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+#
+# Check usage
+#
+if ( $#argv != 2 ) then
+	echo Usage: screen-against-vector in_fofn out_fofn | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+if ( ! -f $1 ) then
+	echo screen-against-vector: file of file names $1 does not exist | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+if ( -e $2 ) then
+	echo screen-against-vector: output file of file names $2 already exists | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Scripts and programs
+#
+set EXPAND = $SQUIRREL/scripts/staden-env.csh
+set CONVERT = $SQUIRREL/scripts/exp2dap.awk
+
+#
+# Temporary files
+#
+set in = _in_$$
+set out = _out_$$
+set hits = _hits_$$
+set misses = _misses_$$
+set tmpdir = _tmpdir_$$
+
+#
+# Determine all sequences to screen against
+#
+set expfiles = `cat $1`
+set f = `awk '/^SS/{$1="";print substr($0,2);}' $expfiles | sort -u`
+
+#
+# Set up for dap/screen against vector
+#
+sort $1 >! $in
+
+if ( $#f != 0 ) then
+	if ( -d $tmpdir ) /bin/rm -fr $tmpdir
+	mkdir $tmpdir
+	foreach file ( $expfiles )
+		awk -f $CONVERT $file > $tmpdir/$file
+	end
+endif
+
+set VECTOR_SCORE = 25
+foreach vector ( $f )
+
+	echo Screening against $vector | tee -a "$SQUIRREL_LOG"
+
+	if (`wc -l $in | awk '{print $1;}'` == 0) goto skip
+
+	#
+	set xvec = $SQUIRREL/seqs/$vector
+	if (! -f $xvec) then
+		echo vector file $xvec does not exist...skipping | tee -a "$SQUIRREL_LOG"
+		goto skip
+	endif
+	# this is a hack to make bap happy
+	set xvec = SQUIRREL/seqs/$vector
+
+	# determine files to screen against
+	grep -l $vector `cat $in` >! $hits
+	comm -23 $in $hits >! $misses
+
+	/bin/rm -f $out
+pushd $tmpdir > /dev/null
+bap << EOF
+3
+!
+
+18
+y
+../$hits
+../$out
+$xvec
+$VECTOR_SCORE
+
+!
+EOF
+
+
+	#
+	# Check exit status
+	#
+	if ( $status ) then
+		echo screen-against-vector: Error running bap/screen against vector | tee -a "$SQUIRREL_LOG"
+		exit 1
+	endif
+
+	popd > /dev/null
+	echo "" | tee -a "$SQUIRREL_LOG"
+
+	#
+	# Print out Failed this pass
+	#
+	tr -d ' ' < $out >! $in ; /bin/mv -f $in $out
+	foreach file ( `comm -23 $hits $out` )
+		echo Failed: ${file}: Matches screening sequence $vector | tee -a "$SQUIRREL_LOG"
+		echo "PS   Matches screening sequence $vector" >> ${file}
+	end
+
+	#
+	# Determine sequences passed to next round
+	#
+	sort $misses $out >! $in
+
+skip:
+end
+
+
+# transfer successful passes to output file
+/bin/cp $in $2
+
+#
+# Remove temporary files
+#
+/bin/rm -f $in $out $hits $misses
+/bin/rm -fr $tmpdir
+
+exit 0
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/screen-against-vector.csh.old b/src/squirrel-1.4/scripts/screen-against-vector.csh.old
new file mode 100644
index 0000000..ce17499
--- /dev/null
+++ b/src/squirrel-1.4/scripts/screen-against-vector.csh.old
@@ -0,0 +1,77 @@
+#! /bin/csh -f
+#
+# screen-against-vector
+#
+#   Screen files against vector sequence
+#
+# Usage:
+#   screen-against-vector vector_file in_fofn out_fofn
+#
+# NOTE:
+#   This script uses xdap, which as yet cannot handle the experiment
+#   file format. Eventually, vepe will be modified to do the same task.
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+#
+# Check usage
+#
+if ( $#argv != 3 ) then
+	echo Usage: screen-against-vector vector_file in_fofn out_fofn
+	exit 2
+endif
+
+if ( ! -f $1 ) then
+	echo screen-against-vector: vector file $1 does not exist
+	exit 1
+endif
+
+if ( ! -f $2 ) then
+	echo screen-against-vector: file of file names $2 does not exist
+	exit 1
+endif
+
+if ( -e $3 ) then
+	echo screen-against-vector: output file of file names $3 already exists
+	exit 1
+endif
+
+
+#
+# Set up for dap/screen against vector
+#
+set VECTOR_SCORE = 20
+setenv VECTOR_FILE $1
+
+dap << EOF
+3
+!
+
+18
+y
+$2
+$3
+VECTOR_FILE
+$VECTOR_SCORE
+
+!
+EOF
+
+#
+# Check exit status
+#
+if ( $status ) then
+	echo screen-against-vector: Error running dap/screen against vector
+	exit 1
+endif
+
+exit 0
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/squirrel b/src/squirrel-1.4/scripts/squirrel
new file mode 100644
index 0000000..959aba5
--- /dev/null
+++ b/src/squirrel-1.4/scripts/squirrel
@@ -0,0 +1,73 @@
+#! /bin/csh
+#
+# squirrel
+#     A small brown furry animal
+# 
+# Usage
+#     squirrel machine_letter date
+#
+
+#
+# Define the project
+#
+set PROJECT = TESTDB
+
+#
+# Define the database to use if different from default
+#
+#setenv SQUIRREL_SUB H4
+
+
+onintr abort
+
+# Usage
+if ( $#argv != 2 ) then
+	echo "Usage: squirrel machine_letter date"
+	exit 2
+endif
+
+set transfer_directory = "$HOME/Mac-files-$USER/Results $1 $2"
+if ( ! -d "$transfer_directory" ) then
+	echo "squirrel: Bad folder machine or date"
+	exit 1
+endif
+
+#
+# Where the brown and furry things are
+#
+if (! $?SQUIRREL) then
+	echo "Error - environment variable SQUIRREL not defined"
+	#setenv SQUIRREL /nfs/al/cb0/wormpub/BioSW/squirrel
+	exit 1
+endif
+setenv EDITOR emacs
+
+#
+# Programs and scripts
+#
+# to assemble automatically
+# 	set PROCESS = $SQUIRREL/scripts/do-it.csh
+# to process without assembly
+# 	set PROCESS = $SQUIRREL/scripts/do-it-no-assembly.csh
+set PROCESS = $SQUIRREL/scripts/do-it.csh
+
+#
+# The real work starts here
+#
+$PROCESS "$transfer_directory" $PROJECT
+if ( $status ) then
+	echo "squirrel: Error occurred during processing."
+	goto abort
+endif
+
+#
+# Successful completion
+#
+exit 0
+
+abort:
+#
+# Interruption not normal termination
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/tidy-files.csh b/src/squirrel-1.4/scripts/tidy-files.csh
new file mode 100644
index 0000000..df7f934
--- /dev/null
+++ b/src/squirrel-1.4/scripts/tidy-files.csh
@@ -0,0 +1,65 @@
+#! /bin/csh -f
+#
+# tidy-files
+#     Remove processed (not abandoned) files from transfer directory
+#
+# Usage:
+#     tidy-files session-file
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+#
+# Check usage
+#
+if ( $#argv != 1 ) then
+	echo "Usage: tidy-files session-file | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+if ( ! -e $1 ) then
+	echo tidy-files: Session log file does not exist | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+#
+# Programs and scripts and things
+#
+
+#
+# Temporary files
+#
+set includes = _incl.$$
+set commands = _comm.$$
+/bin/rm -f $includes $commands
+
+echo "" | tee -a "$SQUIRREL_LOG"
+echo Tidying up transfer directory | tee -a "$SQUIRREL_LOG"
+
+#
+# Remove unneeded files in directory
+#
+awk 'BEGIN {FS=":";} /^#/ {next;} /Abandon/ {next;} {print $1;}' $1 > $includes
+sed 's/^/\/bin\/rm -f "/;s/$/"/' $includes > $commands
+source $commands
+
+/bin/rm -f $1
+/bin/rm _*
+
+
+#
+# Remove temporary files
+#
+/bin/rm -f $includes $commands
+
+
+
+exit 0
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/transfer-files.csh b/src/squirrel-1.4/scripts/transfer-files.csh
new file mode 100644
index 0000000..d8a5930
--- /dev/null
+++ b/src/squirrel-1.4/scripts/transfer-files.csh
@@ -0,0 +1,141 @@
+#! /bin/csh -f
+#
+# transfer-files
+#   Copies files specified by a file of file names from the current
+#   directory to the project_directory, making file format changes
+#   where necessary.
+#
+#   When running with dap experiment files are converted to xdap files.
+#
+#   Usage:
+#      transfer-files project-directory in_fofn out_fofn
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+#
+# Check usage
+#
+if ( $#argv != 3 ) then
+	echo "Usage: transfer-files project-directory in_fofn out_fofn" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+
+
+if ( ! -d $1 ) then
+	echo "transfer-files: Project directory $1 does not exist" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+if ( ! -e $2 ) then
+	echo "transfer-files: File of file names $2 does not exist" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+if ( -e $1/$3 ) then
+	echo "transfer-files: File of file names $1/$3 already exists" | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+set CONVERT_EXP_FILE = $SQUIRREL/scripts/exp2dap.awk
+
+#
+# Save typing if need to change
+#
+set FAILURES = $1/Failures
+set EXPFILES = $1/Expfiles
+
+# Repository for experiment files and failures
+if (! -e ${EXPFILES}) mkdir ${EXPFILES}
+if (! -e ${FAILURES}) mkdir ${FAILURES}
+
+echo "" | tee -a "$SQUIRREL_LOG"
+echo "Transferring files from $cwd to $1" | tee -a "$SQUIRREL_LOG"
+touch $1/$3
+foreach file ( `cat $2` )
+
+	# Check file exists in transfer directory
+	if (! -e $file ) then
+		# check it is processed
+		if ( ! -e ${EXPFILES}/$file ) then
+			echo Abandoned: ${file}: Problems transferring | tee -a "$SQUIRREL_LOG"
+			goto skip
+		endif
+
+		echo transfer-files: warning - $file already transferred
+		#		set PS = `awk '/^PS/ {$1="";line = substr($0,2);} END {print line;}' ${EXPFILES}/$file`
+		#		goto recover
+		goto skip
+	endif
+
+	# Ensure files don't already exist
+	/bin/rm -f $1/$file ${EXPFILES}/$file ${FAILURES}/$file
+
+	# Determine directory for sequence files and trace files
+	set PS = `awk '/^PS/ {$1="";line = substr($0,2);} END {print line;}' $file`
+
+	if ( "$PS" == "" ) then
+		set dir = "$1"		
+	else
+		if ( $PS[1] == "Matches" ) then
+			set dir = "$1"		
+		else
+			set dir = "${FAILURES}"
+		endif
+	endif
+
+
+	# Convert experiment file
+	awk -f $CONVERT_EXP_FILE $file > $dir/$file
+	if ( $status ) then
+		echo Abandoned: ${file}: Converting experiment file 
+		/bin/rm -f $dir/$file
+		goto skip
+	endif
+
+	# Transfer Local Trace file
+	set LN = `awk '/^LN/ {$1="";line = substr($0,2);} END {print line;}' $file`
+	if ( "$LN" != "" ) then
+		/bin/cp $LN $dir
+		if ( $status ) then
+			echo Abandoned: ${file}: Transferring trace file 
+			/bin/rm -f $dir/$LN
+			goto skip
+		endif
+	endif
+
+	# Copy experiment file
+	# This should be the last operation before removal of files
+	/bin/cp $file ${EXPFILES}
+	if ( $status ) then
+		echo "transfer-files.csh: failed to copy experiment file"
+	endif
+
+	# Get rid of files
+	/bin/rm -f $file $LN
+
+recover:
+	# If not marked for failure add to file of file names
+	if ( "$PS" == "" ) then
+		echo $file >> $1/$3
+	else
+		if ( $PS[1] == "Matches" ) then
+			# have a match with a vector... add to screen file
+			set failf = "$1"/Screen.$PS[4]
+			if ( ! -e "$failf" ) touch "$failf"
+			echo $file >> "$failf"
+		endif
+	endif
+
+skip:
+end
+
+exit 0
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/scripts/trim-seq.awk b/src/squirrel-1.4/scripts/trim-seq.awk
new file mode 100644
index 0000000..158d0f4
--- /dev/null
+++ b/src/squirrel-1.4/scripts/trim-seq.awk
@@ -0,0 +1,90 @@
+#
+# Clip sequence on basis of sequence content -
+#
+#	(xdap version)
+#
+# Rule: everything is good quality up to where NN uncertainties in
+#       MM window, in the base range LL..RR
+#
+# Outputs new experiment file with extra clip line to stdout
+#
+# St.Louis use NN=2, MM=5, LL=200, RR=sequence-length
+#
+
+BEGIN {
+	# Up to where NN uncertainties in MM window starting at base SS
+	NN = 2;
+	MM = 5;
+	LL = 200;
+	RR = 400;
+}
+
+
+/;</ { lseq = lseq substr($0,3); next; }
+/;>/ { rseq = rseq substr($0,3); next; }
+/;/  { header = $0; next; }
+     { mseq = mseq $0; next; }
+
+END {
+	seq = lseq mseq rseq;
+
+	seqlen = length(seq);
+	lcut = length(lseq);
+
+	if (RR > seqlen) RR = seqlen;
+	rcut = lcut + length(mseq) + 1;
+	#	rcut = RR+1;
+	if ( NN>0 ) {
+		score = 0;
+		for(i=LL-MM+1; i<=RR-MM+1; i++) {
+			if (i>=MM) {
+				if (index("ACGTacgt",substr(seq,i,1)) == 0) score--
+			}
+			if (i+MM-1<=RR) {
+				if (index("ACGTacgt",substr(seq,i+MM-1,1)) == 0) score++;
+			}
+			if (score >= NN) {
+				rcut = i;
+				break;
+			}
+		}
+	}
+
+	# Write dap file header
+	if (length(header)==0) {
+		printf(";%6d%6d%6d%-4s%-18s\n",seqlen,lcut,rcut-lcut-1,"PLN","");
+	} else {
+		printf(";%6d%6d%6d%s\n",seqlen,lcut,rcut-lcut-1,substr(header,20));
+	}
+
+	# Write left cutoffs (if any)
+	if(lcut>0) {
+		str = substr(seq,1,lcut);
+		strlen = lcut;
+		bits = 48;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf (";<%s\n",substr(str,i,bits));
+		}
+	}
+
+	# Write right cutoffs (if any)
+	if(rcut<=seqlen) {
+		str = substr(seq,rcut);
+		strlen = length(str);
+		bits = 48;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf (";>%s\n",substr(str,i,bits));
+		}
+	}
+
+	# Write remaining sequence (if any!)
+	if (lcut+1 != rcut) {
+		str = substr(seq,lcut+1,rcut-lcut-1);
+		strlen = length(str);
+		bits = 50;
+		for (i = 1; i <= strlen; i+=bits) {
+			printf ("%s\n",substr(str,i,bits));
+		}
+	}
+
+}
diff --git a/src/squirrel-1.4/scripts/vepe-fails.csh b/src/squirrel-1.4/scripts/vepe-fails.csh
new file mode 100644
index 0000000..90dfa95
--- /dev/null
+++ b/src/squirrel-1.4/scripts/vepe-fails.csh
@@ -0,0 +1,113 @@
+#! /bin/csh -f
+#
+# vepe-fails
+#    Parse a vepe failure file and report errors
+#
+# Usage:
+#	vepe-fails vepe_fail_file vepe_task
+#
+# where vepe_task is
+#    1 Mark sequencing vector
+#    2 Mark cloning vector
+#
+
+onintr abort
+if (! $?SQUIRREL_LOG) setenv SQUIRREL_LOG /dev/null
+
+#
+# Check usage
+#
+if ( $#argv != 2 ) then
+	echo "Usage: vepe-fails vepe_fail_file vepe_task" | tee -a "$SQUIRREL_LOG"
+	exit 2
+endif
+if ( ! -f $1 ) then
+	echo vepe-fails: failure file does not exist | tee -a "$SQUIRREL_LOG"
+	exit 1
+endif
+
+
+
+#
+# Strip out failures and reasons
+#
+set expfiles = `awk '{print $1;}' $1`
+set errcodes = `awk '{if (NF==2)print $2;else print -1;}' $1`
+
+while ( $#expfiles )
+
+	set expname = $expfiles[1]
+	set expfile = $expfiles[1]
+	set errcode = $errcodes[1]
+
+	switch(${errcode})
+	case 1:
+		set reason = "Couldn't open experiment file"
+		breaksw;
+	case 2:
+		set reason = "No reading"
+		breaksw;
+	case 3:
+		set reason = "Reading too short for vep"
+		breaksw;
+	case 4:
+		set reason = "No vector file name"
+		breaksw;
+	case 5:
+		set reason = "No cloning site"
+		breaksw;
+	case 6:
+		set reason = "No priming site"
+		breaksw;
+	case 7:
+		set reason = "Couldn't open vector file"
+		breaksw;
+	case 8:
+		set reason = "Error writing to experiment file"
+		breaksw;
+	case 9:
+		switch ( $2 )
+		case 1:
+			set reason = "Completely sequencing vector"
+			breaksw
+		case 2:
+			set reason = "Completely cloning vector"
+			breaksw
+		default:
+			set reason = "Completely vector"
+			breaksw
+		endsw
+		breaksw;
+	default:
+		set reason = "Reason unknown" #Unknown
+		breaksw;
+	endsw
+
+	if ( ${errcode} == 3  || ${errcode} == 9 ) then
+		# vep failures
+		echo Failed: ${expname}: $reason | tee -a "$SQUIRREL_LOG"
+		if (-e ${expfile}) echo "PS   $reason" >> ${expfile}
+	else
+		# vep abandons
+		echo Abandoned: ${expname}: Vep failed : $reason | tee -a "$SQUIRREL_LOG"
+	endif
+
+
+	# Cycle
+	shift expfiles
+	shift errcodes
+end
+
+
+
+
+
+exit 0
+
+
+abort:
+#
+# interruption not normal exit
+#
+
+exit 1
diff --git a/src/squirrel-1.4/seqs/Alu.seq b/src/squirrel-1.4/seqs/Alu.seq
new file mode 100644
index 0000000..718b19a
--- /dev/null
+++ b/src/squirrel-1.4/seqs/Alu.seq
@@ -0,0 +1,8 @@
+;Alu consensus ex PROTOTYPIC SEQENCES FOR HUMAN REPETITIVE DNA
+ aatggcagaatgacacttcncaaaagaagacatttatgggccgggcgctttggctcacgc
+ cctgtaatcccagcactttgggaggccgaggcgggcggatcacgaggtcaggagatcgag
+ accatcccggctaaaacggtgaaaccccgtctctactaaaactacaaaaaatagccgggc
+ gtagtgcgggcgcctgtagtcctagctacttgggaggctgaggcaggagaatggcgtgaa
+ cccgggaggcggagcttgcagtgagccgagatcccgccactgcactccagcctgggtgac
+ agagcgagactccatctcaaaaaaaaaaaaaaaaaaaaaaaaaaaaagaagacatttatg
+ cagccaaaaaacacatgaaaaatgctcatcatcagagaaggcaatcaaaccacatg
diff --git a/src/squirrel-1.4/seqs/README b/src/squirrel-1.4/seqs/README
new file mode 100644
index 0000000..458277c
--- /dev/null
+++ b/src/squirrel-1.4/seqs/README
@@ -0,0 +1,4 @@
+ Dir: $SQUIRREL/seqs/
+Date: Fri May 07 12:10:51 BST 1993
+
+In this directory should be placed any sequences squirrel requires.
diff --git a/src/squirrel-1.4/seqs/blue.vep b/src/squirrel-1.4/seqs/blue.vep
new file mode 100644
index 0000000..1a48a74
--- /dev/null
+++ b/src/squirrel-1.4/seqs/blue.vep
@@ -0,0 +1,55 @@
+GACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATA
+ATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGG
+AACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCA
+TGAGACAATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGT
+ATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATT
+TTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATG
+CTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAAC
+AGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGAT
+GAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACG
+CCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTG
+GTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGT
+AAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCA
+ACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTG
+CACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCT
+GAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAA
+TGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCT
+TCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACC
+ACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTG
+GAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGAT
+GGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAAC
+TATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTA
+AGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGAT
+TTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGA
+TAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGT
+CAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTG
+CGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGT
+TTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCT
+TCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTA
+GGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCT
+AATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCG
+GGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGA
+ACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGA
+ACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAG
+GGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAG
+CGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGT
+CGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAG
+GGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTC
+CTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCC
+TGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTC
+GCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAA
+GAGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTA
+ATGCAGCTGGCACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCA
+ACGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACAC
+TTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATT
+TCACACAGGAAACAGCTATGACCATGATTACGCCAAGCTCGGAATTAACC
+CTCACTAAAGGGAACAAAAGCTTGCATGCCTGCAGGTCGACTCTAGAGGA
+TCCCCGGGTACCGAGCTCGAATTCGCCCTATAGTGAGTCGTATTACAATT
+CACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACC
+CAACTTAATCGCCTTGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAG
+CGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATG
+GCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATT
+TCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATA
+GTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGG
+CTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGG
+AGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGA
diff --git a/src/squirrel-1.4/seqs/lambda.seq b/src/squirrel-1.4/seqs/lambda.seq
new file mode 100644
index 0000000..c49bf21
--- /dev/null
+++ b/src/squirrel-1.4/seqs/lambda.seq
@@ -0,0 +1,810 @@
+<---LAMBDA.001----->
+GGGCGGCGACCTCGCGGGTTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGCGTTTCCG
+TTCTTCTTCGTCATAACTTAATGTTTTTATTTAAAATACCCTCTGAAAAGAAAGGAAACG
+ACAGGTGCTGAAAGCGAGGCTTTTTGGCCTCTGTCGTTTCCTTTCTCTGTTTTTGTCCGT
+GGAATGAACAATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCG
+TACCATTCAGAACTGGCAGGAACAGGGAATGCCCGTTCTGCGAGGCGGTGGCAAGGGTAA
+TGAGGTGCTTTATGACTCTGCCGCCGTCATAAAATGGTATGCCGAAAGGGATGCTGAAAT
+TGAGAACGAAAAGCTGCGCCGGGAGGTTGAAGAACTGCGGCAGGCCAGCGAGGCAGATCT
+CCAGCCAGGAACTATTGAGTACGAACGCCATCGACTTACGCGTGCGCAGGCCGACGCACA
+GGAACTGAAGAATGCCAGAGACTCCGCTGAAGTGGTGGAAACCGCATTCTGTACTTTCGT
+GCTGTCGCGGATCGCAGGTGAAATTGCCAGTATTCTCGACGGGCTCCCCCTGTCGGTGCA
+GCGGCGTTTTCCGGAACTGGAAAACCGACATGTTGATTTCCTGAAACGGGATATCATCAA
+AGCCATGAACAAAGCAGCCGCGCTGGATGAACTGATACCGGGGTTGCTGAGTGAATATAT
+CGAACAGTCAGGTTAACAGGCTGCGGCATTTTGTCCGCGCCGGGCTTCGCTCACTGTTCA
+GGCCGGAGCCACAGACCGCCGTTGAATGGGCGGATGCTAATTACTATCTCCCGAAAGAAT
+CCGCATACCAGGAAGGGCGCTGGGAAACACTGCCCTTTCAGCGGGCCATCATGAATGCGA
+TGGGCAGCGACTACATCCGTGAGGTGAATGTGGTGAAGTCTGCCCGTGTCGGTTATTCCA
+AAATGCTGCTGGGTGTTTATGCCTACTTTATAGAGCATAAGCAGCGCAACACCCTTATCT
+GGTTGCCGACGGATGGTGATGCCGAGAACTTTATGAAAACCCACGTTGAGCCGACTATTC
+GTGATATTCCGTCGCTGCTGGCGCTGGCCCCGTGGTATGGCAAAAAGCACCGGGATAACA
+CGCTCACCATGAAGCGTTTCACTAATGGGCGTGGCTTCTGGTGCCTGGGCGGTAAAGCGG
+CAAAAAACTACCGTGAAAAGTCGGTGGATGTGGCGGGTTATGATGAACTTGCTGCTTTTG
+ATGATGATATTGAACAGGAAGGCTCTCCGACGTTCCTGGGTGACAAGCGTATTGAAGGCT
+CGGTCTGGCCAAAGTCCATCCGTGGCTCCACGCCAAAAGTGAGAGGCACCTGTCAGATTG
+AGCGTGCAGCCAGTGAATCCCCGCATTTTATGCGTTTTCATGTTGCCTGCCCGCATTGCG
+GGGAGGAGCAGTATCTTAAATTTGGCGACAAAGAGACGCCGTTTGGCCTCAAATGGACGC
+CGGATGACCCCTCCAGCGTGTTTTATCTCTGCGAGCATAATGCCTGCGTCATCCGCCAGC
+AGGAGCTGGACTTTACTGATGCCCGTTATATCTGCGAAAAGACCGGGATCTGGACCCGTG
+ATGGCATTCTCTGGTTTTCGTCATCCGGTGAAGAGATTGAGCCACCTGACAGTGTGACCT
+TTCACATCTGGACAGCGTACAGCCCGTTCACCACCTGGGTGCAGATTGTCAAAGACTGGA
+TGAAAACGAAAGGGGATACGGGAAAACGTAAAACCTTCGTAAACACCACGCTCGGTGAGA
+CGTGGGAGGCGAAAATTGGCGAACGTCCGGATGCTGAAGTGATGGCAGAGCGGAAAGAGC
+ATTATTCAGCGCCCGTTCCTGACCGTGTGGCTTACCTGACCGCCGGTATCGACTCCCAGC
+TGGACCGCTACGAAATGCGCGTATGGGGATGGGGGCCGGGTGAGGAAAGCTGGCTGATTG
+ACCGGCAGATTATTATGGGCCGCCACGACGATGAACAGACGCTGCTGCGTGTGGATGAGG
+CCATCAATAAAACCTATACCCGCCGGAATGGTGCAGAAATGTCGATATCCCGTATCTGCT
+GGGATACTGGCGGGATTGACCCGACCATTGTGTATGAACGCTCGAAAAAACATGGGCTGT
+TCCGGGTGATCCCCATTAAAGGGGCATCCGTCTACGGAAAGCCGGTGGCCAGCATGCCAC
+GTAAGCGAAACAAAAACGGGGTTTACCTTACCGAAATCGGTACGGATACCGCGAAAGAGC
+AGATTTATAACCGCTTCACACTGACGCCGGAAGGGGATGAACCGCTTCCCGGTGCCGTTC
+ACTTCCCGAATAACCCGGATATTTTTGATCTGACCGAAGCGCAGCAGCTGACTGCTGAAG
+AGCAGGTCGAAAAATGGGTGGATGGCAGGAAAAAAATACTGTGGGACAGCAAAAAGCGAC
+GCAATGAGGCACTCGACTGCTTCGTTTATGCGCTGGCGGCGCTGCGCATCAGTATTTCCC
+GCTGGCAGCTGGATCTCAGTGCGCTGCTGGCGAGCCTGCAGGAAGAGGATGGTGCAGCAA
+CCAACAAGAAAACACTGGCAGATTACGCCCGTGCCTTATCCGGAGAGGATGAATGACGCG
+ACAGGAAGAACTTGCCGCTGCCCGTGCGGCACTGCATGACCTGATGACAGGTAAACGGGT
+GGCAACAGTACAGAAAGACGGACGAAGGGTGGAGTTTACGGCCACTTCCGTGTCTGACCT
+GAAAAAATATATTGCAGAGCTGGAAGTGCAGACCGGCATGACACAGCGACGCAGGGGACC
+TGCAGGATTTTATGTATGAAAACGCCCACCATTCCCACCCTTCTGGGGCCGGACGGCATG
+ACATCGCTGCGCGAATATGCCGGTTATCACGGCGGTGGCAGCGGATTTGGAGGGCAGTTG
+CGGTCGTGGAACCCACCGAGTGAAAGTGTGGATGCAGCCCTGTTGCCCAACTTTACCCGT
+GGCAATGCCCGCGCAGACGATCTGGTACGCAATAACGGCTATGCCGCCAACGCCATCCAG
+CTGCATCAGGATCATATCGTCGGGTCTTTTTTCCGGCTCAGTCATCGCCCAAGCTGGCGC
+TATCTGGGCATCGGGGAGGAAGAAGCCCGTGCCTTTTCCCGCGAGGTTGAAGCGGCATGG
+AAAGAGTTTGCCGAGGATGACTGCTGCTGCATTGACGTTGAGCGAAAACGCACGTTTACC
+ATGATGATTCGGGAAGGTGTGGCCATGCACGCCTTTAACGGTGAACTGTTCGTTCAGGCC
+ACCTGGGATACCAGTTCGTCGCGGCTTTTCCGGACACAGTTCCGGATGGTCAGCCCGAAG
+CGCATCAGCAACCCGAACAATACCGGCGACAGCCGGAACTGCCGTGCCGGTGTGCAGATT
+AATGACAGCGGTGCGGCGCTGGGATATTACGTCAGCGAGGACGGGTATCCTGGCTGGATG
+CCGCAGAAATGGACATGGATACCCCGTGAGTTACCCGGCGGGCGCGCCTCGTTCATTCAC
+GTTTTTGAACCCGTGGAGGACGGGCAGACTCGCGGTGCAAATGTGTTTTACAGCGTGATG
+GAGCAGATGAAGATGCTCGACACGCTGCAGAACACGCAGCTGCAGAGCGCCATTGTGAAG
+GCGATGTATGCCGCCACCATTGAGAGTGAGCTGGATACGCAGTCAGCGATGGATTTTATT
+CTGGGCGCGAACAGTCAGGAGCAGCGGGAAAGGCTGACCGGCTGGATTGGTGAAATTGCC
+GCGTATTACGCCGCAGCGCCGGTCCGGCTGGGAGGCGCAAAAGTACCGCACCTGATGCCG
+GGTGACTCACTGAACCTGCAGACGGCTCAGGATACGGATAACGGCTACTCCGTGTTTGAG
+CAGTCACTGCTGCGGTATATCGCTGCCGGGCTGGGTGTCTCGTATGAGCAGCTTTCCCGG
+AATTACGCCCAGATGAGCTACTCCACGGCACGGGCCAGTGCGAACGAGTCGTGGGCGTAC
+TTTATGGGGCGGCGAAAATTCGTCGCATCCCGTCAGGCGAGCCAGATGTTTCTGTGCTGG
+CTGGAAGAGGCCATCGTTCGCCGCGTGGTGACGTTACCTTCAAAAGCGCGCTTCAGTTTT
+CAGGAAGCCCGCAGTGCCTGGGGGAACTGCGACTGGATAGGCTCCGGTCGTATGGCCATC
+GATGGTCTGAAAGAAGTTCAGGAAGCGGTGATGCTGATAGAAGCCGGACTGAGTACCTAC
+GAGAAAGAGTGCGCAAAACGCGGTGACGACTATCAGGAAATTTTTGCCCAGCAGGTCCGT
+GAAACGATGGAGCGCCGTGCAGCCGGTCTTAAACCGCCCGCCTGGGCGGCTGCAGCATTT
+GAATCCGGGCTGCGACAATCAACAGAGGAGGAGAAGAGTGACAGCAGAGCTGCGTAATCT
+CCCGCATATTGCCAGCATGGCCTTTAATGAGCCGCTGATGCTTGAACCCGCCTATGCGCG
+GGTTTTCTTTTGTGCGCTTGCAGGCCAGCTTGGGATCAGCAGCCTGACGGATGCGGTGTC
+CGGCGACAGCCTGACTGCCCAGGAGGCACTCGCGACGCTGGCATTATCCGGTGATGATGA
+CGGACCACGACAGGCCCGCAGTTATCAGGTCATGAACGGCATCGCCGTGCTGCCGGTGTC
+CGGCACGCTGGTCAGCCGGACGCGGGCGCTGCAGCCGTACTCGGGGATGACCGGTTACAA
+CGGCATTATCGCCCGTCTGCAACAGGCTGCCAGCGATCCGATGGTGGACGGCATTCTGCT
+CGATATGGACACGCCCGGCGGGATGGTGGCGGGGGCATTTGACTGCGCTGACATCATCGC
+CCGTGTGCGTGACATAAAACCGGTATGGGCGCTTGCCAACGACATGAACTGCAGTGCAGG
+TCAGTTGCTTGCCAGTGCCGCCTCCCGGCGTCTGGTCACGCAGACCGCCCGGACAGGCTC
+CATCGGCGTCATGATGGCTCACAGTAATTACGGTGCTGCGCTGGAGAAACAGGGTGTGGA
+AATCACGCTGATTTACAGCGGCAGCCATAAGGTGGATGGCAACCCCTACAGCCATCTTCC
+GGATGACGTCCGGGAGACACTGCAGTCCCGGATGGACGCAACCCGCCAGATGTTTGCGCA
+GAAGGTGTCGGCATATACCGGCCTGTCCGTGCAGGTTGTGCTGGATACCGAGGCTGCAGT
+GTACAGCGGTCAGGAGGCCATTGATGCCGGACTGGCTGATGAACTTGTTAACAGCACCGA
+TGCGATCACCGTCATGCGTGATGCACTGGATGCACGTAAATCCCGTCTCTCAGGAGGGCG
+AATGACCAAAGAGACTCAATCAACAACTGTTTCAGCCACTGCTTCGCAGGCTGACGTTAC
+TGACGTGGTGCCAGCGACGGAGGGCGAGAACGCCAGCGCGGCGCAGCCGGACGTGAACGC
+GCAGATCACCGCAGCGGTTGCGGCAGAAAACAGCCGCATTATGGGGATCCTCAACTGTGA
+GGAGGCTCACGGACGCGAAGAACAGGCACGCGTGCTGGCAGAAACCCCCGGTATGACCGT
+GAAAACGGCCCGCCGCATTCTGGCCGCAGCACCACAGAGTGCACAGGCGCGCAGTGACAC
+TGCGCTGGATCGTCTGATGCAGGGGGCACCGGCACCGCTGGCTGCAGGTAACCCGGCATC
+TGATGCCGTTAACGATTTGCTGAACACACCAGTGTAAGGGATGTTTATGACGAGCAAAGA
+AACCTTTACCCATTACCAGCCGCAGGGCAACAGTGACCCGGCTCATACCGCAACCGCGCC
+CGGCGGATTGAGTGCGAAAGCGCCTGCAATGACCCCGCTGATGCTGGACACCTCCAGCCG
+TAAGCTGGTTGCGTGGGATGGCACCACCGACGGTGCTGCCGTTGGCATTCTTGCGGTTGC
+TGCTGACCAGACCAGCACCACGCTGACGTTCTACAAGTCCGGCACGTTCCGTTATGAGGA
+TGTGCTCTGGCCGGAGGCTGCCAGCGACGAGACGAAAAAACGGACCGCGTTTGCCGGAAC
+GGCAATCAGCATCGTTTAACTTTACCCTTCATCACTAAAGGCCGCCTGTGCGGCTTTTTT
+TACGGGATTTTTTTATGTCGATGTACACAACCGCCCAACTGCTGGCGGCAAATGAGCAGA
+AATTTAAGTTTGATCCGCTGTTTCTGCGTCTCTTTTTCCGTGAGAGCTATCCCTTCACCA
+CGGAGAAAGTCTATCTCTCACAAATTCCGGGACTGGTAAACATGGCGCTGTACGTTTCGC
+CGATTGTTTCCGGTGAGGTTATCCGTTCCCGTGGCGGCTCCACCTCTGAATTTACGCCGG
+GATATGTCAAGCCGAAGCATGAAGTGAATCCGCAGATGACCCTGCGTCGCCTGCCGGATG
+AAGATCCGCAGAATCTGGCGGACCCGGCTTACCGCCGCCGTCGCATCATCATGCAGAACA
+TGCGTGACGAAGAGCTGGCCATTGCTCAGGTCGAAGAGATGCAGGCAGTTTCTGCCGTGC
+TTAAGGGCAAATACACCATGACCGGTGAAGCCTTCGATCCGGTTGAGGTGGATATGGGCC
+GCAGTGAGGAGAATAACATCACGCAGTCCGGCGGCACGGAGTGGAGCAAGCGTGACAAGT
+CCACGTATGACCCGACCGACGATATCGAAGCCTACGCGCTGAACGCCAGCGGTGTGGTGA
+ATATCATCGTGTTCGATCCGAAAGGCTGGGCGCTGTTCCGTTCCTTCAAAGCCGTCAAGG
+AGAAGCTGGATACCCGTCGTGGCTCTAATTCCGAGCTGGAGACAGCGGTGAAAGACCTGG
+GCAAAGCGGTGTCCTATAAGGGGATGTATGGCGATGTGGCCATCGTCGTGTATTCCGGAC
+AGTACGTGGAAAACGGCGTCAAAAAGAACTTCCTGCCGGACAACACGATGGTGCTGGGGA
+ACACTCAGGCACGCGGTCTGCGCACCTATGGCTGCATTCAGGATGCGGACGCACAGCGCG
+AAGGCATTAACGCCTCTGCCCGTTACCCGAAAAACTGGGTGACCACCGGCGATCCGGCGC
+GTGAGTTCACCATGATTCAGTCAGCACCGCTGATGCTGCTGGCTGACCCTGATGAGTTCG
+TGTCCGTACAACTGGCGTAATCATGGCCCTTCGGGGCCATTGTTTCTCTGTGGAGGAGTC
+CATGACGAAAGATGAACTGATTGCCCGTCTCCGCTCGCTGGGTGAACAACTGAACCGTGA
+TGTCAGCCTGACGGGGACGAAAGAAGAACTGGCGCTCCGTGTGGCAGAGCTGAAAGAGGA
+GCTTGATGACACGGATGAAACTGCCGGTCAGGACACCCCTCTCAGCCGGGAAAATGTGCT
+GACCGGACATGAAAATGAGGTGGGATCAGCGCAGCCGGATACCGTGATTCTGGATACGTC
+TGAACTGGTCACGGTCGTGGCACTGGTGAAGCTGCATACTGATGCACTTCACGCCACGCG
+GGATGAACCTGTGGCATTTGTGCTGCCGGGAACGGCGTTTCGTGTCTCTGCCGGTGTGGC
+AGCCGAAATGACAGAGCGCGGCCTGGCCAGAATGCAATAACGGGAGGCGCTGTGGCTGAT
+TTCGATAACCTGTTCGATGCTGCCATTGCCCGCGCCGATGAAACGATACGCGGGTACATG
+GGAACGTCAGCCACCATTACATCCGGTGAGCAGTCAGGTGCGGTGATACGTGGTGTTTTT
+GATGACCCTGAAAATATCAGCTATGCCGGACAGGGCGTGCGCGTTGAAGGCTCCAGCCCG
+TCCCTGTTTGTCCGGACTGATGAGGTGCGGCAGCTGCGGCGTGGAGACACGCTGACCATC
+GGTGAGGAAAATTTCTGGGTAGATCGGGTTTCGCCGGATGATGGCGGAAGTTGTCATCTC
+TGGCTTGGACGGGGCGTACCGCCTGCCGTTAACCGTCGCCGCTGAAAGGGGGATGTATGG
+CCATAAAAGGTCTTGAGCAGGCCGTTGAAAACCTCAGCCGTATCAGCAAAACGGCGGTGC
+CTGGTGCCGCCGCAATGGCCATTAACCGCGTTGCTTCATCCGCGATATCGCAGTCGGCGT
+CACAGGTTGCCCGTGAGACAAAGGTACGCCGGAAACTGGTAAAGGAAAGGGCCAGGCTGA
+AAAGGGCCACGGTCAAAAATCCGCAGGCCAGAATCAAAGTTAACCGGGGGGATTTGCCCG
+TAATCAAGCTGGGTAATGCGCGGGTTGTCCTTTCGCGCCGCAGGCGTCGTAAAAAGGGGC
+AGCGTTCATCCCTGAAAGGTGGCGGCAGCGTGCTTGTGGTGGGTAACCGTCGTATTCCCG
+GCGCGTTTATTCAGCAACTGAAAAATGGCCGGTGGCATGTCATGCAGCGTGTGGCTGGGA
+AAAACCGTTACCCCATTGATGTGGTGAAAATCCCGATGGCGGTGCCGCTGACCACGGCGT
+TTAAACAAAATATTGAGCGGATACGGCGTGAACGTCTTCCGAAAGAGCTGGGCTATGCGC
+TGCAGCATCAACTGAGGATGGTAATAAAGCGATGAAACATACTGAACTCCGTGCAGCCGT
+ACTGGATGCACTGGAGAAGCATGACACCGGGGCGACGTTTTTTGATGGTCGCCCCGCTGT
+TTTTGATGAGGCGGATTTTCCGGCAGTTGCCGTTTATCTCACCGGCGCTGAATACACGGG
+CGAAGAGCTGGACAGCGATACCTGGCAGGCGGAGCTGCATATCGAAGTTTTCCTGCCTGC
+TCAGGTGCCGGATTCAGAGCTGGATGCGTGGATGGAGTCCCGGATTTATCCGGTGATGAG
+CGATATCCCGGCACTGTCAGATTTGATCACCAGTATGGTGGCCAGCGGCTATGACTACCG
+GCGCGACGATGATGCGGGCTTGTGGAGTTCAGCCGATCTGACTTATGTCATTACCTATGA
+AATGTGAGGACGCTATGCCTGTACCAAATCCTACAATGCCGGTGAAAGGTGCCGGGACCA
+CCCTGTGGGTTTATAAGGGGAGCGGTGACCCTTACGCGAATCCGCTTTCAGACGTTGACT
+GGTCGCGTCTGGCAAAAGTTAAAGACCTGACGCCCGGCGAACTGACCGCTGAGTCCTATG
+ACGACAGCTATCTCGATGATGAAGATGCAGACTGGACTGCGACCGGGCAGGGGCAGAAAT
+CTGCCGGAGATACCAGCTTCACGCTGGCGTGGATGCCCGGAGAGCAGGGGCAGCAGGCGC
+TGCTGGCGTGGTTTAATGAAGGCGATACCCGTGCCTATAAAATCCGCTTCCCGAACGGCA
+CGGTCGATGTGTTCCGTGGCTGGGTCAGCAGTATCGGTAAGGCGGTGACGGCGAAGGAAG
+TGATCACCCGCACGGTGAAAGTCACCAATGTGGGACGTCCGTCGATGGCAGAAGATCGCA
+GCACGGTAACAGCGGCAACCGGCATGACCGTGACGCCTGCCAGCACCTCGGTGGTGAAAG
+GGCAGAGCACCACGCTGACCGTGGCCTTCCAGCCGGAGGGCGTAACCGACAAGAGCTTTC
+GTGCGGTGTCTGCGGATAAAACAAAAGCCACCGTGTCGGTCAGTGGTATGACCATCACCG
+TGAACGGCGTTGCTGCAGGCAAGGTCAACATTCCGGTTGTATCCGGTAATGGTGAGTTTG
+CTGCGGTTGCAGAAATTACCGTCACCGCCAGTTAATCCGGAGAGTCAGCGATGTTCCTGA
+AAACCGAATCATTTGAACATAACGGTGTGACCGTCACGCTTTCTGAACTGTCAGCCCTGC
+AGCGCATTGAGCATCTCGCCCTGATGAAACGGCAGGCAGAACAGGCGGAGTCAGACAGCA
+ACCGGAAGTTTACTGTGGAAGACGCCATCAGAACCGGCGCGTTTCTGGTGGCGATGTCCC
+TGTGGCATAACCATCCGCAGAAGACGCAGATGCCGTCCATGAATGAAGCCGTTAAACAGA
+TTGAGCAGGAAGTGCTTACCACCTGGCCCACGGAGGCAATTTCTCATGCTGAAAACGTGG
+TGTACCGGCTGTCTGGTATGTATGAGTTTGTGGTGAATAATGCCCCTGAACAGACAGAGG
+ACGCCGGGCCCGCAGAGCCTGTTTCTGCGGGAAAGTGTTCGACGGTGAGCTGAGTTTTGC
+CCTGAAACTGGCGCGTGAGATGGGGCGACCCGACTGGCGTGCCATGCTTGCCGGGATGTC
+ATCCACGGAGTATGCCGACTGGCACCGCTTTTACAGTACCCATTATTTTCATGATGTTCT
+GCTGGATATGCACTTTTCCGGGCTGACGTACACCGTGCTCAGCCTGTTTTTCAGCGATCC
+GGATATGCATCCGCTGGATTTCAGTCTGCTGAACCGGCGCGAGGCTGACGAAGAGCCTGA
+AGATGATGTGCTGATGCAGAAAGCGGCAGGGCTTGCCGGAGGTGTCCGCTTTGGCCCGGA
+CGGGAATGAAGTTATCCCCGCTTCCCCGGATGTGGCGGACATGACGGAGGATGACGTAAT
+GCTGATGACAGTATCAGAAGGGATCGCAGGAGGAGTCCGGTATGGCTGAACCGGTAGGCG
+ATCTGGTCGTTGATTTGAGTCTGGATGCGGCCAGATTTGACGAGCAGATGGCCAGAGTCA
+GGCGTCATTTTTCTGGTACGGAAAGTGATGCGAAAAAAACAGCGGCAGTCGTTGAACAGT
+CGCTGAGCCGACAGGCGCTGGCTGCACAGAAAGCGGGGATTTCCGTCGGGCAGTATAAAG
+CCGCCATGCGTATGCTGCCTGCACAGTTCACCGACGTGGCCACGCAGCTTGCAGGCGGGC
+AAAGTCCGTGGCTGATCCTGCTGCAACAGGGGGGGCAGGTGAAGGACTCCTTCGGCGGGA
+TGATCCCCATGTTCAGGGGGCTTGCCGGTGCGATCACCCTGCCGATGGTGGGGGCCACCT
+CGCTGGCGGTGGCGACCGGTGCGCTGGCGTATGCCTGGTATCAGGGCAACTCAACCCTGT
+CCGATTTCAACAAAACGCTGGTCCTTTCCGGCAATCAGGCGGGACTGACGGCAGATCGTA
+TGCTGGTCCTGTCCAGAGCCGGGCAGGCGGCAGGGCTGACGTTTAACCAGACCAGCGAGT
+CACTCAGCGCACTGGTTAAGGCGGGGGTAAGCGGTGAGGCTCAGATTGCGTCCATCAGCC
+AGAGTGTGGCGCGTTTCTCCTCTGCATCCGGCGTGGAGGTGGACAAGGTCGCTGAAGCCT
+TCGGGAAGCTGACCACAGACCCGACGTCGGGGCTGACGGCGATGGCTCGCCAGTTCCATA
+ACGTGTCGGCGGAGCAGATTGCGTATGTTGCTCAGTTGCAGCGTTCCGGCGATGAAGCCG
+GGGCATTGCAGGCGGCGAACGAGGCCGCAACGAAAGGGTTTGATGACCAGACCCGCCGCC
+TGAAAGAGAACATGGGCACGCTGGAGACCTGGGCAGACAGGACTGCGCGGGCATTCAAAT
+CCATGTGGGATGCGGTGCTGGATATTGGTCGTCCTGATACCGCGCAGGAGATGCTGATTA
+AGGCAGAGGCTGCGTATAAGAAAGCAGACGACATCTGGAATCTGCGCAAGGATGATTATT
+TTGTTAACGATGAAGCGCGGGCGCGTTACTGGGATGATCGTGAAAAGGCCCGTCTTGCGC
+TTGAAGCCGCCCGAAAGAAGGCTGAGCAGCAGACTCAACAGGACAAAAATGCGCAGCAGC
+AGAGCGATACCGAAGCGTCACGGCTGAAATATACCGAAGAGGCGCAGAAGGCTTACGAAC
+GGCTGCAGACGCCGCTGGAGAAATATACCGCCCGTCAGGAAGAACTGAACAAGGCACTGA
+AAGACGGGAAAATCCTGCAGGCGGATTACAACACGCTGATGGCGGCGGCGAAAAAGGATT
+ATGAAGCGACGCTGAAAAAGCCGAAACAGTCCAGCGTGAAGGTGTCTGCGGGCGATCGTC
+AGGAAGACAGTGCTCATGCTGCCCTGCTGACGCTTCAGGCAGAACTCCGGACGCTGGAGA
+AGCATGCCGGAGCAAATGAGAAAATCAGCCAGCAGCGCCGGGATTTGTGGAAGGCGGAGA
+GTCAGTTCGCGGTACTGGAGGAGGCGGCGCAACGTCGCCAGCTGTCTGCACAGGAGAAAT
+CCCTGCTGGCGCATAAAGATGAGACGCTGGAGTACAAACGCCAGCTGGCTGCACTTGGCG
+ACAAGGTTACGTATCAGGAGCGCCTGAACGCGCTGGCGCAGCAGGCGGATAAATTCGCAC
+AGCAGCAACGGGCAAAACGGGCCGCCATTGATGCGAAAAGCCGGGGGCTGACTGACCGGC
+AGGCAGAACGGGAAGCCACGGAACAGCGCCTGAAGGAACAGTATGGCGATAATCCGCTGG
+CGCTGAATAACGTCATGTCAGAGCAGAAAAAGACCTGGGCGGCTGAAGACCAGCTTCGCG
+GGAACTGGATGGCAGGCCTGAAGTCCGGCTGGAGTGAGTGGGAAGAGAGCGCCACGGACA
+GTATGTCGCAGGTAAAAAGTGCAGCCACGCAGACCTTTGATGGTATTGCACAGAATATGG
+CGGCGATGCTGACCGGCAGTGAGCAGAACTGGCGCAGCTTCACCCGTTCCGTGCTGTCCA
+TGATGACAGAAATTCTGCTTAAGCAGGCAATGGTGGGGATTGTCGGGAGTATCGGCAGCG
+CCATTGGCGGGGCTGTTGGTGGCGGCGCATCCGCGTCAGGCGGTACAGCCATTCAGGCCG
+CTGCGGCGAAATTCCATTTTGCAACCGGAGGATTTACGGGAACCGGCGGCAAATATGAGC
+CAGCGGGGATTGTTCACCGTGGTGAGTTTGTCTTCACGAAGGAGGCAACCAGCCGGATTG
+GCGTGGGGAATCTTTACCGGCTGATGCGCGGCTATGCCACCGGCGGTTATGTCGGTACAC
+CGGGCAGCATGGCAGACAGCCGGTCGCAGGCGTCCGGGACGTTTGAGCAGAATAACCATG
+TGGTGATTAACAACGACGGCACGAACGGGCAGATAGGTCCGGCTGCTCTGAAGGCGGTGT
+ATGACATGGCCCGCAAGGGTGCCCGTGATGAAATTCAGACACAGATGCGTGATGGTGGCC
+TGTTCTCCGGAGGTGGACGATGAAGACCTTCCGCTGGAAAGTGAAACCCGGTATGGATGT
+GGCTTCGGTCCCTTCTGTAAGAAAGGTGCGCTTTGGTGATGGCTATTCTCAGCGAGCGCC
+TGCCGGGCTGAATGCCAACCTGAAAACGTACAGCGTGACGCTTTCTGTCCCCCGTGAGGA
+GGCCACGGTACTGGAGTCGTTTCTGGAAGAGCACGGGGGCTGGAAATCCTTTCTGTGGAC
+GCCGCCTTATGAGTGGCGGCAGATAAAGGTGACCTGCGCAAAATGGTCGTCGCGGGTCAG
+TATGCTGCGTGTTGAGTTCAGCGCAGAGTTTGAACAGGTGGTGAACTGATGCAGGATATC
+CGGCAGGAAACACTGAATGAATGCACCCGTGCGGAGCAGTCGGCCAGCGTGGTGCTCTGG
+GAAATCGACCTGACAGAGGTCGGTGGAGAACGTTATTTTTTCTGTAATGAGCAGAACGAA
+AAAGGTGAGCCGGTCACCTGGCAGGGGCGACAGTATCAGCCGTATCCCATTCAGGGGAGC
+GGTTTTGAACTGAATGGCAAAGGCACCAGTACGCGCCCCACGCTGACGGTTTCTAACCTG
+TACGGTATGGTCACCGGGATGGCGGAAGATATGCAGAGTCTGGTCGGCGGAACGGTGGTC
+CGGCGTAAGGTTTACGCCCGTTTTCTGGATGCGGTGAACTTCGTCAACGGAAACAGTTAC
+GCCGATCCGGAGCAGGAGGTGATCAGCCGCTGGCGCATTGAGCAGTGCAGCGAACTGAGC
+GCGGTGAGTGCCTCCTTTGTACTGTCCACGCCGACGGAAACGGATGGCGCTGTTTTTCCG
+GGACGTATCATGCTGGCCAACACCTGCACCTGGACCTATCGCGGTGACGAGTGCGGTTAT
+AGCGGTCCGGCTGTCGCGGATGAATATGACCAGCCAACGTCCGATATCACGAAGGATAAA
+TGCAGCAAATGCCTGAGCGGTTGTAAGTTCCGCAATAACGTCGGCAACTTTGGCGGCTTC
+CTTTCCATTAACAAACTTTCGCAGTAAATCCCATGACACAGACAGAATCAGCGATTCTGG
+CGCACGCCCGGCGATGTGCGCCAGCGGAGTCGTGCGGCTTCGTGGTAAGCACGCCGGAGG
+GGGAAAGATATTTCCCCTGCGTGAATATCTCCGGTGAGCCGGAGGCTATTTCCGTATGTC
+GCCGGAAGACTGGCTGCAGGCAGAAATGCAGGGTGAGATTGTGGCGCTGGTCCACAGCCA
+CCCCGGTGGTCTGCCCTGGCTGAGTGAGGCCGACCGGCGGCTGCAGGTGCAGAGTGATTT
+GCCGTGGTGGCTGGTCTGCCGGGGGACGATTCATAAGTTCCGCTGTGTGCCGCATCTCAC
+CGGGCGGCGCTTTGAGCACGGTGTGACGGACTGTTACACACTGTTCCGGGATGCTTATCA
+TCTGGCGGGGATTGAGATGCCGGACTTTCATCGTGAGGATGACTGGTGGCGTAACGGCCA
+GAATCTCTATCTGGATAATCTGGAGGCGACGGGGCTGTATCAGGTGCCGTTGTCAGCGGC
+ACAGCCGGGCGATGTGCTGCTGTGCTGTTTTGGTTCATCAGTGCCGAATCACGCCGCAAT
+TTACTGCGGCGACGGCGAGCTGCTGCACCATATTCCTGAACAACTGAGCAAACGAGAGAG
+GTACACCGACAAATGGCAGCGACGCACACACTCCCTCTGGCGTCACCGGGCATGGCGCGC
+ATCTGCCTTTACGGGGATTTACAACGATTTGGTCGCCGCATCGACCTTCGTGTGAAAACG
+GGGGCTGAAGCCATCCGGGCACTGGCCACACAGCTCCCGGCGTTTCGTCAGAAACTGAGC
+GACGGCTGGTATCAGGTACGGATTGCCGGGCGGGACGTCAGCACGTCCGGGTTAACGGCG
+CAGTTACATGAGACTCTGCCTGATGGCGCTGTAATTCATATTGTTCCCAGAGTCGCCGGG
+GCCAAGTCAGGTGGCGTATTCCAGATTGTCCTGGGGGCTGCCGCCATTGCCGGATCATTC
+TTTACCGCCGGAGCCACCCTTGCAGCATGGGGGGCAGCCATTGGGGCCGGTGGTATGACC
+GGCATCCTGTTTTCTCTCGGTGCCAGTATGGTGCTCGGTGGTGTGGCGCAGATGCTGGCA
+CCGAAAGCCAGAACTCCCCGTATACAGACAACGGATAACGGTAAGCAGAACACCTATTTC
+TCCTCACTGGATAACATGGTTGCCCAGGGCAATGTTCTGCCTGTTCTGTACGGGGAAATG
+CGCGTGGGGTCACGCGTGGTTTCTCAGGAGATCAGCACGGCAGACGAAGGGGACGGTGGT
+CAGGTTGTGGTGATTGGTCGCTGATGCAAAATGTTTTATGTGAAACCGCCTGCGGGCGGT
+TTTGTCATTTATGGAGCGTGAGGAATGGGTAAAGGAAGCAGTAAGGGGCATACCCCGCGC
+GAAGCGAAGGACAACCTGAAGTCCACGCAGTTGCTGAGTGTGATCGATGCCATCAGCGAA
+GGGCCGATTGAAGGTCCGGTGGATGGCTTAAAAAGCGTGCTGCTGAACAGTACGCCGGTG
+CTGGACACTGAGGGGAATACCAACATATCCGGTGTCACGGTGGTGTTCCGGGCTGGTGAG
+CAGGAGCAGACTCCGCCGGAGGGATTTGAATCCTCCGGCTCCGAGACGGTGCTGGGTACG
+GAAGTGAAATATGACACGCCGATCACCCGCACCATTACGTCTGCAAACATCGACCGTCTG
+CGCTTTACCTTCGGTGTACAGGCACTGGTGGAAACCACCTCAAAGGGTGACAGGAATCCG
+TCGGAAGTCCGCCTGCTGGTTCAGATACAACGTAACGGTGGCTGGGTGACGGAAAAAGAC
+ATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGATGGGTAACCTG
+CCGCCGCGCCCGTTTAATATCCGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAG
+CTGCAGAACAAAACGCTCTGGTCGTCATACACTGAAATCATCGATGTGAAACAGTGCTAC
+CCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTG
+AGCCGTAATTATCATCTGCGCGGGCGTATTCTGCAGGTGCCGTCGAACTATAACCCGCAG
+ACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAACCGGCATACAGCAACAACATG
+GCCTGGTGTCTGTGGGATATGCTGACCCATCCGCGCTACGGCATGGGGAAACGTCTTGGT
+GCGGCGGATGTGGATAAATGGGCGCTGTATGTCATCGGCCAGTACTGCGACCAGTCAGTG
+CCGGACGGCTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCGTACCTGACCACACAG
+CGTAAGGCGTGGGATGTGCTCAGCGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGG
+AACGGGCAGACGCTGACGTTCGTGCAGGACCGACCGTCGGATAAGACGTGGACCTATAAC
+CGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTG
+AAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGAACAACGGCTGGGAGACG
+GCGACAGAGCTTGTTGAAGATACGCAGGCCATTGCCCGTTACGGTCGTAATGTTACGAAG
+ATGGATGCCTTTGGCTGTACCAGCCGGGGGCAGGCACACCGCGCCGGGCTGTGGCTGATT
+AAAACAGAACTGCTGGAAACGCAGACCGTGGATTTCAGCGTCGGCGCAGAAGGGCTTCGC
+CATGTACCGGGCGATGTTATTGAAATCTGCGATGATGACTATGCCGGTATCAGCACCGGT
+GGTCGTGTGCTGGCGGTGAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACG
+CTGCCATCCTCCGGTACCGCGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGC
+GTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTAAAAGTGAGCCGTGTTCCTGACGGT
+GTTGCTGAATACAGCGTATGGGAGCTGAAGCTGCCGACGCTGCGCCAGCGACTGTTCCGC
+TGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTG
+CCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGAACAGAGTGGCACG
+GTGAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACTGCAGACAGC
+GGGGAATATCAGGTGCTGGCGCGATGGGACACACCGAAGGTGGTGAAGGGCGTGAGTTTC
+CTGCTCCGTCTGACCGTAACAGCGGACGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGG
+ACGACGGAAACCACATACCGCTTCACGCAACTGGCGCTGGGGAACTACAGGCTGACAGTC
+CGGGCGGTAAATGCGTGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCC
+GCACCGGCAGCACCGTCGAGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACG
+CCGCATCTTGCCGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAGCAG
+ATTGCGGATATCAGACAGGTTGAAACCAGCACGCGTTATCTTGGTACGGCGCTGTACTGG
+ATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTACTTTTATATCCGCAGTGTG
+AACACCGTTGGCAAATCGGCATTCGTGGAGGCCGTCGGTCGGGCGAGCGATGATGCGGAA
+GGTTACCTGGATTTTTTCAAAGGCAAGATAACCGAATCCCATCTCGGCAAGGAGCTGCTG
+GAAAAAGTCGAGCTGACGGAGGATAACGCCAGCAGACTGGAGGAGTTTTCGAAAGAGTGG
+AAGGATGCCAGTGATAAGTGGAATGCCATGTGGGCTGTCAAAATTGAGCAGACCAAAGAC
+GGCAAACATTATGTCGCGGGTATTGGCCTCAGCATGGAGGACACGGAGGAAGGCAAACTG
+AGCCAGTTTCTGGTTGCCGCCAATCGTATCGCATTTATTGACCCGGCAAACGGGAATGAA
+ACGCCGATGTTTGTGGCGCAGGGCAACCAGATATTCATGAACGACGTGTTCCTGAAGCGC
+CTGACGGCCCCCACCATTACCAGCGGCGGCAATCCTCCGGCCTTTTCCCTGACACCGGAC
+GGAAAGCTGACCGCTAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCCGGGACG
+CTCAGTAATGTGACGATAGCTGAAAACTGTACGATAAACGGTACGCTGAGGGCGGAAAAA
+ATCGTCGGGGACATTGTAAAGGCGGCGAGCGCGGCTTTTCCGCGCCAGCGTGAAAGCAGT
+GTGGACTGGCCGTCAGGTACCCGTACTGTCACCGTGACCGATGACCATCCTTTTGATCGC
+CAGATAGTGGTGCTTCCGCTGACGTTTCGCGGAAGTAAGCGTACTGTCAGCGGCAGGACA
+ACGTATTCGATGTGTTATCTGAAAGTACTGATGAACGGTGCGGTGATTTATGATGGCGCG
+GCGAACGAGGCGGTACAGGTGTTCTCCCGTATTGTTGACATGCCAGCGGGTCGGGGAAAC
+GTGATCCTGACGTTCACGCTTACGTCCACACGGCATTCGGCAGATATTCCGCCGTATACG
+TTTGCCAGCGATGTGCAGGTTATGGTGATTAAGAAACAGGCGCTGGGCATCAGCGTGGTC
+TGAGTGTGTTACAGAGGTTCGTCCGGGAACGGGCGTTTTATTATAAAACAGTGAGAGGTG
+AACGATGCGTAATGTGTGTATTGCCGTTGCTGTCTTTGCCGCACTTGCGGTGACAGTCAC
+TCCGGCCCGTGCGGAAGGTGGACATGGTACGTTTACGGTGGGCTATTTTCAAGTGAAACC
+GGGTACATTGCCGTCGTTGTCGGGCGGGGATACCGGTGTGAGTCATCTGAAAGGGATTAA
+CGTGAAGTACCGTTATGAGCTGACGGACAGTGTGGGGGTGATGGCTTCCCTGGGGTTCGC
+CGCGTCGAAAAAGAGCAGCACAGTGATGACCGGGGAGGATACGTTTCACTATGAGAGCCT
+GCGTGGACGTTATGTGAGCGTGATGGCCGGACCGGTTTTACAAATCAGTAAGCAGGTCAG
+TGCGTACGCCATGGCCGGAGTGGCTCACAGTCGGTGGTCCGGCAGTACAATGGATTACCG
+TAAGACGGAAATCACTCCCGGGTATATGAAAGAGACGACCACTGCCAGGGACGAAAGTGC
+AATGCGGCATACCTCAGTGGCGTGGAGTGCAGGTATACAGATTAATCCGGCAGCGTCCGT
+CGTTGTTGATATTGCTTATGAAGGCTCCGGCAGTGGCGACTGGCGTACTGACGGATTCAT
+CGTTGGGGTCGGTTATAAATTCTGATTAGCCAGGTAACACAGTGTTATGACAGCCCGCCG
+GAACCGGTGGGCTTTTTTGTGGGGTGAATATGGCAGTAAAGATTTCAGGAGTCCTGAAAG
+ACGGCACAGGAAAACCGGTACAGAACTGCACCATTCAGCTGAAAGCCAGACGTAACAGCA
+CCACGGTGGTGGTGAACACGGTGGGCTCAGAGAATCCGGATGAAGCCGGGCGTTACAGCA
+TGGATGTGGAGTACGGTCAGTACAGTGTCATCCTGCAGGTTGACGGTTTTCCACCATCGC
+ACGCCGGGACCATCACCGTGTATGAAGATTCACAACCGGGGACGCTGAATGATTTTCTCT
+GTGCCATGACGGAGGATGATGCCCGGCCGGAGGTGCTGCGTCGTCTTGAACTGATGGTGG
+AAGAGGTGGCGCGTAACGCGTCCGTGGTGGCACAGAGTACGGCAGACGCGAAGAAATCAG
+CCGGCGATGCCAGTGCATCAGCTGCTCAGGTCGCGGCCCTTGTGACTGATGCAACTGACT
+CAGCACGCGCCGCCAGCACGTCCGCCGGACAGGCTGCATCGTCAGCTCAGGAAGCGTCCT
+CCGGCGCAGAAGCGGCATCAGCAAAGGCCACTGAAGCGGAAAAAAGTGCCGCAGCCGCAG
+AGTCCTCAAAAAACGCGGCGGCCACCAGTGCCGGTGCGGCGAAAACGTCAGAAACGAATG
+CTGCAGCGTCACAACAATCAGCCGCCACGTCTGCCTCCACCGCGGCCACGAAAGCGTCAG
+AGGCCGCCACTTCAGCACGAGATGCGGTGGCCTCAAAAGAGGCAGCAAAATCATCAGAAA
+CGAACGCATCATCAAGTGCCGGTCGTGCAGCTTCCTCGGCAACGGCGGCAGAAAATTCTG
+CCAGGGCGGCAAAAACGTCCGAGACGAATGCCAGGTCATCTGAAACAGCAGCGGAACGGA
+GCGCCTCTGCCGCGGCAGACGCAAAAACAGCGGCGGCGGGGAGTGCGTCAACGGCATCCA
+CGAAGGCGACAGAGGCTGCGGGAAGTGCGGTATCAGCATCGCAGAGCAAAAGTGCGGCAG
+AAGCGGCGGCAATACGTGCAAAAAATTCGGCAAAACGTGCAGAAGATATAGCTTCAGCTG
+TCGCGCTTGAGGATGCGGACACAACGAGAAAGGGGATAGTGCAGCTCAGCAGTGCAACCA
+ACAGCACGTCTGAAACGCTTGCTGCAACGCCAAAGGCGGTTAAGGTGGTAATGGATGAAA
+CGAACAGAAAAGCCCACTGGACAGTCCGGCACTGACCGGAACGCCAACAGCACCAACCGC
+GCTCAGGGGAACAAACAATACCCAGATTGCGAACACCGCTTTTGTACTGGCCGCGATTGC
+AGATGTTATCGACGCGTCACCTGACGCACTGAATACGCTGAATGAACTGGCCGCAGCGCT
+CGGGAATGATCCAGATTTTGCTACCACCATGACTAACGCGCTTGCGGGTAAACAACCGAA
+GAATGCGACACTGACGGCGCTGGCAGGGCTTTCCACGGCGAAAAATAAATTACCGTATTT
+TGCGGAAAATGATGCCGCCAGCCTGACTGAACTGACTCAGGTTGGCAGGGATATTCTGGC
+AAAAAATTCCGTTGCAGATGTTCTTGAATACCTTGGGGCCGGTGAGAATTCGGCCTTTCC
+GGCAGGTGCGCCGATCCCGTGGCCATCAGATATCGTTCCGTCTGGCTACGTCCTGATGCA
+GGGGCAGGCGTTTGACAAATCAGCCTACCCAAAACTTGCTGTCGCGTATCCATCGGGTGT
+GCTTCCTGATATGCGAGGCTGGACAATCAAGGGGAAACCCGCCAGCGGTCGTGCTGTATT
+GTCTCAGGAACAGGATGGAATTAAGTCGCACACCCACAGTGCCAGTGCATCCGGTACGGA
+TTTGGGGACGAAAACCACATCGTCGTTTGATTACGGGACGAAAACAACAGGCAGTTTCGA
+TTACGGCACCAAATCGACGAATAACACGGGGGCTCATGCTCACAGTCTGAGCGGTTCAAC
+AGGGGCCGCGGGTGCTCATGCCCACACAAGTGGTTTAAGGATGAACAGTTCTGGCTGGAG
+TCAGTATGGAACAGCAACCATTACAGGAAGTTTATCCACAGTTAAAGGAACCAGCACACA
+GGGTATTGCTTATTTATCGAAAACGGACAGTCAGGGCAGCCACAGTCACTCATTGTCCGG
+TACAGCCGTGAGTGCCGGTGCACATGCGCATACAGTTGGTATTGGTGCGCACCAGCATCC
+GGTTGTTATCGGTGCTCATGCCCATTCTTTCAGTATTGGTTCACACGGACACACCATCAC
+CGTTAACGCTGCGGGTAACGCGGAAAACACCGTCAAAAACATTGCATTTAACTATATTGT
+GAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAAT
+CTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGACGCATATATTCCGCCTCATACC
+GGTCTGCCTGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTTGTGGCT
+GTTTTCAACAGTGATGAGGCATCGTGGCATCTCGTTGAAGACCATCGGGGTAAAACCGTC
+TATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTCGGTCCGTTACCGGAAAAT
+TTTACCTGGTTATCGCCGGGAGGGGAATATCAGAAGTGGAACGGCACAGCCTGGGTGAAG
+GATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCGGAAGAAACAAAAAAAAGCCTG
+ATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGCAGATCTGGAAATTGCA
+ACGAAGGAAGAAACCTCGTTGCTGGAAGCCTGGAAGAAGTATCGGGTGTTGCTGAACCGT
+GTTGATACATCAACTGCACCTGATATTGAGTGGCCTGCTGTCCCTGTTATGGAGTAATCG
+TTTTGTGATATGCCGCAGAAACGTTGTATGAAATAACGTTCTGCGGTTAGTTAGTATATT
+GTAAAGCTGAGTATTGGTTTATTTGGCGATTATTATCTTCAGGAGAATAATGGAAGTTCT
+ATGACTCAATTGTTCATAGTGTTTACATCACCGCCAATTGCTTTTAAGACTGAACGCATG
+AAATATGGTTTTTCGTCATGTTTTGAGTCTGCTGTTGATATTTCTAAAGTCGGTTTTTTT
+TCTTCGTTTTCTCTAACTATTTTCCATGAAATACATTTTTGATTATTATTTGAATCAATT
+CCAATTACCTGAAGTCTTTCATCTATAATTGGCATTGTATGTATTGGTTTATTGGAGTAG
+ATGCTTGCTTTTCTGAGCCATAGCTCTGATATCCAAATGAAGCCATAGGCATTTGTTATT
+TTGGCTCTGTCAGCTGCATAACGCCAAAAAATATATTTATCTGCTTGATCTTCAAATGTT
+GTATTGATTAAATCAATTGGATGGAATTGTTTATCATAAAAAATTAATGTTTGAATGTGA
+TAACCGTCCTTTAAAAAAGTCGTTTCTGCAAGCTTGGCTGTATAGTCAACTAACTCTTCT
+GTCGAAGTGATATTTTTAGGCTTATCTACCAGTTTTAGACGCTCTTTAATATCTTCAGGA
+ATTATTTTATTGTCATATTGTATCATGCTAAATGACAATTTGCTTATGGAGTAATCTTTT
+AATTTTAAATAAGTTATTCTCCTGGCTTCATCAAATAAAGAGTCGAATGATGTTGGCGAA
+ATCACATCGTCACCCATTGGATTGTTTATTTGTATGCCAAGAGAGTTACAGCAGTTATAC
+ATTCTGCCATAGATTATAGCTAAGGCATGTAATAATTCGTAATCTTTTAGCGTATTAGCG
+ACCCATCGTCTTTCTGATTTAATAATAGATGATTCAGTTAAATATGAAGGTAATTTCTTT
+TGTGCAAGTCTGACTAACTTTTTTATACCAATGTTTAACATACTTTCATTTGTAATAAAC
+TCAATGTCATTTTCTTCAATGTAAGATGAAATAAGAGTAGCCTTTGCCTCGCTATACATT
+TCTAAATCGCCTTGTTTTTCTATCGTATTGCGAGAATTTTTAGCCCAAGCCATTAATGGA
+TCATTTTTCCATTTTTCAATAACATTATTGTTATACCAAATGTCATATCCTATAATCTGG
+TTTTTGTTTTTTTGAATAATAAATGTTACTGTTCTTGCGGTTTGGAGGAATTGATTCAAA
+TTCAAGCGAAATAATTCAGGGTCAAAATATGTATCAATGCAGCATTTGAGCAAGTGCGAT
+AAATCTTTAAGTCTTCTTTCCCATGGTTTTTTAGTCATAAAACTCTCCATTTTGATAGGT
+TGCATGCTAGATGCTGATATATTTTAGAGGTGATAAAATTAACTGCTTAACTGTCAATGT
+AATACAAGTTGTTTGATCTTTGCAATGATTCTTATCAGAAACCATATAGTAAATTAGTTA
+CACAGGAAATTTTTAATATTATTATTATCATTCATTATGTATTAAAATTAGAGTTGTGGC
+TTGGCTCTGCTAACACGTTGCTCATAGGAGATATGGTAGAGCCGCAGACACGTCGTATGC
+AGGAACGTGCTGCGGCTGGCTGGTGAACTTCCGATAGTGCGGGTGTTGAATGATTTCCAG
+TTGCTACCGATTTTACATATTTTTTGCATGAGAGAATTTGTACCACCTCCCACCGACCAT
+CTATGACTGTACGCCACTGTCCCTAGGACTGCTATGTGCCGGAGCGGACATTACAAACGT
+CCTTCTCGGTGCATGCCACTGTTGCCAATGACCTGCCTAGGAATTGGTTAGCAAGTTACT
+ACCGGATTTTGTAAAAACAGCCCTCCTCATATAAAAAGTATTCGTTCACTTCCGATAAGC
+GTCGTAATTTTCTATCTTTCATCATATTCTAGATCCCTCTGAAAAAATCTTCCGAGTTTG
+CTAGGCACTGATACATAACTCTTTTCCAATAATTGGGGAAGTCATTCAAATCTATAATAG
+GTTTCAGATTTGCTTCAATAAATTCTGACTGTAGCTGCTGAAACGTTGCGGTTGAACTAT
+ATTTCCTTATAACTTTTACGAAAGAGTTTCTTTGAGTAATCACTTCACTCAAGTGCTTCC
+CTGCCTCCAAACGATACCTGTTAGCAATATTTAATAGCTTGAAATGATGAAGAGCTCTGT
+GTTTGTCTTCCTGCCTCCAGTTCGCCGGGCATTCAACATAAAAACTGATAGCACCCGGAG
+TTCCGGAAACGAAATTTGCATATACCCATTGCTCACGAAAAAAAATGTCCTTGTCGATAT
+AGGGATGAATCGCTTGGTGTACCTCATCTACTGCGAAAACTTGACCTTTCTCTCCCATAT
+TGCAGTCGCGGCACGATGGAACTAAATTAATAGGCATCACCGAAAATTCAGGATAATGTG
+CAATAGGAAGAAAATGATCTATATTTTTTGTCTGTCCTATATCACCACAAAATGGACATT
+TTTCACCTGATGAAACAAGCATGTCATCGTAATATGTTCTAGCGGGTTTGTTTTTATCTC
+GGAGATTATTTTCATAAAGCTTTTCTAATTTAACCTTTGTCAGGTTACCAACTACTAAGG
+TTGTAGGCTCAAGAGGGTGTGTCCTGTCGTAGGTAAATAACTGACCTGTCGAGCTTAATA
+TTCTATATTGTTGTTCTTTCTGCAAAAAAGTGGGGAAGTGAGTAATGAAATTATTTCTAA
+CATTTATCTGCATCATACCTTCCGAGCATTTATTAAGCATTTCGCTATAAGTTCTCGCTG
+GAAGAGGTAGTTTTTTCATTGTACTTTACCTTCATCTCTGTTCATTATCATCGCTTTTAA
+AACGGTTCGACCTTCTAATCCTATCTGACCATTATAATTTTTTAGAATGGTTTCATAAGA
+AAGCTCTGAATCAACGGACTGCGATAATAAGTGGTGGTATCCAGAATTTGTCACTTCAAG
+TAAAAACACCTCACGAGTTAAAACACCTAAGTTCTCACCGAATGTCTCAATATCCGGACG
+GATAATATTTATTGCTTCTCTTGACCGTAGGACTTTCCACATGCAGGATTTTGGAACCTC
+TTGCAGTACTACTGGGGAATGAGTTGCAATTATTGCTACACCATTGCGTGCATCGAGTAA
+GTCGCTTAATGTTCGTAAAAAAGCAGAGAGCAAAGGTGGATGCAGATGAACCTCTGGTTC
+ATCGAATAAAACTAATGACTTTTCGCCAACGACATCTACTAATCTTGTGATAGTAAATAA
+AACAATTGCATGTCCAGAGCTCATTCGAAGCAGATATTTCTGGATATTGTCATAAAACAA
+TTTAGTGAATTTATCATCGTCCACTTGAATCTGTGGTTCATTACGTCTTAACTCTTCATA
+TTTAGAAATGAGGCTGATGAGTTCCATATTTGAAAAGTTTTCATCACTACTTAGTTTTTT
+GATAGCTTCAAGCCAGAGTTGTCTTTTTCTATCTACTCTCATACAACCAATAAATGCTGA
+AATGAATTCTAAGCGGAGATCGCCTAGTGATTTTAAACTATTGCTGGCAGCATTCTTGAG
+TCCAATATAAAAGTATTGTGTACCTTTTGCTGGGTCAGGTTGTTCTTTAGGAGGAGTAAA
+AGGATCAAATGCACTAAACGAAACTGAAACAAGCGATCGAAAATATCCCTTTGGGATTCT
+TGACTCGATAAGTCTATTATTTTCAGAGAAAAAATATTCATTGTTTTCTGGGTTGGTGAT
+TGCACCAATCATTCCATTCAAAATTGTTGTTTTACCACACCCATTCCGCCCGATAAAAGC
+ATGAATGTTCGTGCTGGGCATAGAATTAACCGTCACCTCAAAAGGTATAGTTAAATCACT
+GAATCCGGGAGCACTTTTTCTATTAAATGAAAAGTGGAAATCTGACAATTCTGGCAAACC
+ATTTAACACACGTGCGAACTGTCCATGAATTTCTGAAAGAGTTACCCCTCTAAGTAATGA
+GGTGTTAAGGACGCTTTCATTTTCAATGTCGGCTAATCGATTTGGCCATACTACTAAATC
+CTGAATAGCTTTAAGAAGGTTATGTTTAAAACCATCGCTTAATTTGCTGAGATTAACATA
+GTAGTCAATGCTTTCACCTAAGGAAAAAAACATTTCAGGGAGTTGACTGAATTTTTTATC
+TATTAATGAATAAGTGCTTACTTCTTCTTTTTGACCTACAAAACCAATTTTAACATTTCC
+GATATCGCATTTTTCACCATGCTCATCAAAGACAGTAAGATAAAACATTGTAACAAAGGA
+ATAGTCATTCCAACCATCTGCTCGTAGGAATGCCTTATTTTTTTCTACTGCAGGAATATA
+CCCGCCTCTTTCAATAACACTAAACTCCAACATATAGTAACCCTTAATTTTATTAAAATA
+ACCGCAATTTATTTGGCGGCAACACAGGATCTCTCTTTTAAGTTACTCTCTATTACATAC
+GTTTTCCATCTAAAAATTAGTAGTATTGAACTTAACGGGGCATCGTATTGTAGTTTTCCA
+TATTTAGCTTTCTGCTTCCTTTTGGATAACCCACTGTTATTCATGTTGCATGGTGCACTG
+TTTATACCAACGATATAGTCTATTAATGCATATATAGTATCGCCGAACGATTAGCTCTTC
+AGGCTTCTGAAGAAGCGTTTCAAGTACTAATAAGCCGATAGATAGCCACGGACTTCGTAG
+CCATTTTTCATAAGTGTTAACTTCCGCTCCTCGCTCATAACAGACATTCACTACAGTTAT
+GGCGGAAAGGTATGCATGCTGGGTGTGGGGAAGTCGTGAAAGAAAAGAAGTCAGCTGCGT
+CGTTTGACATCACTGCTATCTTCTTACTGGTTATGCAGGTCGTAGTGGGTGGCACACAAA
+GCTTTGCACTGGATTGCGAGGCTTTGTGCTTCTCTGGAGTGCGACAGGTTTGATGACAAA
+AAATTAGCGCAAGAAGACAAAAATCACCTTGCGCTAATGCTCTGTTACAGGTCACTAATA
+CCATCTAAGTAGTTGATTCATAGTGACTGCATATGTTGTGTTTTACAGTATTATGTAGTC
+TGTTTTTTATGCAAAATCTAATTTAATATATTGATATTTATATCATTTTACGTTTCTCGT
+TCAGCTTTTTTATACTAAGTTGGCATTATAAAAAAGCATTGCTTATCAATTTGTTGCAAC
+GAACAGGTCACTATCAGTCAAAATAAAATCATTATTTGATTTCAATTTTGTCCCACTCCC
+TGCCTCTGTCATCACGATACTGTGATGCCATGGTGTCCGACTTATGCCCGAGAAGATGTT
+GAGCAAACTTATCGCTTATCTGCTTCTCATAGAGTCTTGCAGACAAACTGCGCAACTCGT
+GAAAGGTAGGCGGATCCCCTTCGAAGGAAAGACCTGATGCTTTTCGTGCGCGCATAAAAT
+ACCTTGATACTGTGCCGGATGAAAGCGGTTCGCGACGAGTAGATGCAATTATGGTTTCTC
+CGCCAAGAATCTCTTTGCATTTATCAAGTGTTTCCTTCATTGATATTCCGAGAGCATCAA
+TATGCAATGCTGTTGGGATGGCAATTTTTACGCCTGTTTTGCTTTGCTCGACATAAAGAT
+ATCCATCTACGATATCAGACCACTTCATTTCGCATAAATCACCAACTCGTTGCCCGGTAA
+CAACAGCCAGTTCCATTGCAAGTCTGAGCCAACATGGTGATGATTCTGCTGCTTGATAAA
+TTTTCAGGTATTCGTCAGCCGTAAGTCTTGATCTCCTTACCTCTGATTTTGCTGCGCGAG
+TGGCAGCGACATGGTTTGTTGTTATATGGCCTTCAGCTATTGCCTCTCGGAATGCATCGC
+TCAGTGTTGATCTGATTAACTTGGCTGACGCCGCCTTGCCCTCGTCTATGTATCCATTGA
+GCATTGCCGCAATTTCTTTTGTGGTGATGTCTTCAAGTGGAGCATCAGGCAGACCCCTCC
+TTATTGCTTTAATTTTGCTCATGTAATTTATGAGTGTCTTCTGCTTGATTCCTCTGCTGG
+CCAGGATTTTTTCGTAGCGATCAAGCCATGAATGTAACGTAACGGAATTATCACTGTTGA
+TTCTCGCTGTCAGAGGCTTGTGTTTGTGTCCTGAAAATAACTCAATGTTGGCCTGTATAG
+CTTCAGTGATTGCGATTCGCCTGTCTCTGCCTAATCCAAACTCTTTACCCGTCCTTGGGT
+CCCTGTAGCAGTAATATCCATTGTTTCTTATATAAAGGTTAGGGGGTAAATCCCGGCGCT
+CATGACTTCGCCTTCTTCCCATTTCTGATCCTCTTCAAAAGGCCACCTGTTACTGGTCGA
+TTTAAGTCAACCTTTACCGCTGATTCGTGGAACAGATACTCTCTTCCATCCTTAACCGGA
+GGTGGGAATATCCTGCATTCCCGAACCCATCGACGAACTGTTTCAAGGCTTCTTGGACGT
+CGCTGGCGTGCGTTCCACTCCTGAAGTGTCAAGTACATCGCAAAGTCTCCGCAATTACAC
+GCAAGAAAAAACCGCCATCAGGCGGCTTGGTGTTCTTTCAGTTCTTCAATTCGAATATTG
+GTTACGTCTGCATGTGCTATCTGCGCCCATATCATCCAGTGGTCGTAGCAGTCGTTGATG
+TTCTCCGCTTCGATAACTCTGTTGAATGGCTCTCCATTCCATTCTCCTGTGACTCGGAAG
+TGCATTTATCATCTCCATAAAACAAAACCCGCCGTAGCGAGTTCAGATAAAATAAATCCC
+CGCGAGTGCGAGGATTGTTATGTAATATTGGGTTTAATCATCTATATGTTTTGTACAGAG
+AGGGCAAGTATCGTTTCCACCGTACTCGTGATAATAATTTTGCACGGTATCAGTCATTTC
+TCGCACATTGCAGAATGGGGATTTGTCTTCATTAGACTTATAAACCTTCATGGAATATTT
+GTATGCCGACTCTATATCTATACCTTCATCTACATAAACACCTTCGTGATGTCTGCATGG
+AGACAAGACACCGGATCTGCACAACATTGATAACGCCCAATCTTTTTGCTCAGACTCTAA
+CTCATTGATACTCATTTATAAACTCCTTGCAATGTATGTCGTTTCAGCTAAACGGTATCA
+GCAATGTTTATGTAAAGAAACAGTAAGATAATACTCAACCCGATGTTTGAGTACGGTCAT
+CATCTGACACTACAGACTCTGGCATCGCTGTGAAGACGACGCGAAATTCAGCATTTTCAC
+AAGCGTTATCTTTTACAAAACCGATCTCACTCTCCTTTGATGCGAATGCCAGCGTCAGAC
+ATCATATGCAGATACTCACCTGCATCCTGAACCCATTGACCTCCAACCCCGTAATAGCGA
+TGCGTAATGATGTCGATAGTTACTAACGGGTCTTGTTCGATTAACTGCCGCAGAAACTCT
+TCCAGGTCACCAGTGCAGTGCTTGATAACAGGAGTCTTCCCAGGATGGCGAACAACAAGA
+AACTGGTTTCCGTCTTCACGGACTTCGTTGCTTTCCAGTTTAGCAATACGCTTACTCCCA
+TCCGAGATAACACCTTCGTAATACTCACGCTGCTCGTTGAGTTTTGATTTTGCTGTTTCA
+AGCTCAACACGCAGTTTCCCTACTGTTAGCGCAATATCCTCGTTCTCCTGGTCGCGGCGT
+TTGATGTATTGCTGGTTTCTTTCCCGTTCATCCAGCAGTTCCAGCACAATCGATGGTGTT
+ACCAATTCATGGAAAAGGTCTGCGTCAAATCCCCAGTCGTCATGCATTGCCTGCTCTGCC
+GCTTCACGCAGTGCCTGAGAGTTAATTTCGCTCACTTCGAACCTCTCTGTTTACTGATAA
+GTTCCAGATCCTCCTGGCAACTTGCACAAGTCCGACAACCCTGAACGACCAGGCGTCTTC
+GTTCATCTATCGGATCGCCACACTCACAACAATGAGTGGCAGATATAGCCTGGTGGTTCA
+GGCGGCGCATTTTTATTGCTGTGTTGCGCTGTAATTCTTCTATTTCTGATGCTGAATCAA
+TGATGTCTGCCATCTTTCATTAATCCCTGAACTGTTGGTTAATACGCTTGAGGGTGAATG
+CGAATAATAAAAAAGGAGCCTGTAGCTCCCTGATGATTTTGCTTTTCATGTTCATCGTTC
+CTTAAAGACGCCGTTTAACATGCCGATTGCCAGGCTTAAATGAGTCGGTGTGAATCCCAT
+CAGCGTTACCGTTTCGCGGTGCTTCTTCAGTACGCTACGGCAAATGTCATCGACGTTTTT
+ATCCGGAAACTGCTGTCTGGCTTTTTTTGATTTCAGAATTAGCCTGACGGGCAATGCTGC
+GAAGGGCGTTTTCCTGCTGAGGTGTCATTGAACAAGTCCCATGTCGGCAAGCATAAGCAC
+ACAGAATATGAAGCCCGCTGCCAGAAAAATGCATTCCGTGGTTGTCATACCTGGTTTCTC
+TCATCTGCTTCTGCTTTCGCCACCATCATTTCCAGCTTTTGTGAAAGGGATGCGGCTAAC
+GTATGAAATTCTTCGTCTGTTTCTACTGGTATTGGCACAAACCTGATTCCAATTTGAGCA
+AGGCTATGTGCCATCTCGATACTCGTTCTTAACTCAACAGAAGATGCTTTGTGCATACAG
+CCCCTCGTTTATTATTTATCTCCTCAGCCAGCCGCTGTGCTTTCAGTGGATTTCGGATAA
+CAGAAAGGCCGGGAAATACCCAGCCTCGCTTTGTAACGGAGTAGACGAAAGTGATTGCGC
+CTACCCGGATATTATCGTGAGGATGCGTCATCGCCATTGCTCCCCAAATACAAAACCAAT
+TTCAGCCAGTGCCTCGTCCATTTTTTCGATGAACTCCGGCACGATCTCGTCAAAACTCGC
+CATGTACTTTTCATCCCGCTCAATCACGACATAATGCAGGCCTTCACGCTTCATACGCGG
+GTCATAGTTGGCAAAGTACCAGGCATTTTTTCGCGTCACCCACATGCTGTACTGCACCTG
+GGCCATGTAAGCTGACTTTATGGCCTCGAAACCACCGAGCCGGAACTTCATGAAATCCCG
+GGAGGTAAACGGGCATTTCAGTTCAAGGCCGTTGCCGTCACTGCATAAACCATCGGGAGA
+GCAGGCGGTACGCATACTTTCGTCGCGATAGATGATCGGGGATTCAGTAACATTCACGCC
+GGAAGTGAATTCAAACAGGGTTCTGGCGTCGTTCTCGTACTGTTTTCCCCAGGCCAGTGC
+TTTAGCGTTAACTTCCGGAGCCACACCGGTGCAAACCTCAGCAAGCAGGGTGTGGAAGTA
+GGACATTTTCATGTCAGGCCACTTCTTTCCGGAGCGGGGTTTTGCTATCACGTTGTGAAC
+TTCTGAAGCGGTGATGACGCCGAGCCGTAATTTGTGCCACGCATCATCCCCCTGTTCGAC
+AGCTCTCACATCGATCCCGGTACGCTGCAGGATAATGTCCGGTGTCATGCTGCCACCTTC
+TGCTCTGCGGCTTTCTGTTTCAGGAATCCAAGAGCTTTTACTGCTTCGGCCTGTGTCAGT
+TCTGACGATGCACGAATGTCGCGGCGAAATATCTGGGAACAGAGCGGCAATAAGTCGTCA
+TCCCATGTTTTATCCAGGGCGATCAGCAGAGTGTTAATCTCCTGCATGGTTTCATCGTTA
+ACCGGAGTGATGTCGCGTTCCGGCTGACGTTCTGCAGTGTATGCAGTATTTTCGACAATG
+CGCTCGGCTTCATCCTTGTCATAGATACCAGCAAATCCGAAGGCCAGACGGGCACACTGA
+ATCATGGCTTTATGACGTAACATCCGTTTGGGATGCGACTGCCACGGCCCCGTGATTTCT
+CTGCCTTCGCGAGTTTTGAATGGTTCGCGGCGGCATTCATCCATCCATTCGGTAACGCAG
+ATCGGATGATTACGGTCCTTGCGGTAAATCCGGCATGTACAGGATTCATTGTCCTGCTCA
+AAGTCCATGCCATCAAACTGCTGGTTTTCATTGATGATGCGGGACCAGCCATCAACGCCC
+ACCACCGGAACGATGCCATTCTGCTTATCAGGAAAGGCGTAAATTTCTTTCGTCCACGGA
+TTAAGGCCGTACTGGTTGGCAACGATCAGTAATGCGATGAACTGCGCATCGCTGGCATCA
+CCTTTAAATGCCGTCTGGCGAAGAGTGGTGATCAGTTCCTGTGGGTCGACAGAATCCATG
+CCGACACGTTCAGCCAGCTTCCCAGCCAGCGTTGCGAGTGCAGTACTCATTCGTTTTATA
+CCTCTGAATCAATATCAACCTGGTGGTGAGCAATGGTTTCAACCATGTACCGGATGTGTT
+CTGCCATGCGCTCCTGAAACTCAACATCGTCATCAAACGCACGGGTAATGGATTTTTTGC
+TGGCCCCGTGGCGTTGCAAATGATCGATGCATAGCGATTCAAACAGGTGCTGGGGCAGGC
+CTTTTTCCATGTCGTCTGCCAGTTCTGCCTCTTTCTCTTCACGGGCGAGCTGCTGGTAGT
+GACGCGCCCAGCTCTGAGCCTCAAGACGATCCTGAATGTAATAAGCGTTCATGGCTGAAC
+TCCTGAAATAGCTGTGAAAATATCGCCCGCGAAATGCCGGGCTGATTAGGAAAACAGGAA
+AGGGGGTTAGTGAATGCTTTTGCTTGATCTCAGTTTCAGTATTAATATCCATTTTTTATA
+AGCGTCGACGGCTTCACGAAACATCTTTTCATCGCCAATAAAAGTGGCGATAGTGAATTT
+AGTCTGGATAGCCATAAGTGTTTGATCCATTCTTTGGGACTCCTGGCTGATTAAGTATGT
+CGATAAGGCGTTTCCATCCGTCACGTAATTTACGGGTGATTCGTTCAAGTAAAGATTCGG
+AAGGGCAGCCAGCAACAGGCCACCCTGCAATGGCATATTGCATGGTGTGCTCCTTATTTA
+TACATAACGAAAAACGCCTCGAGTGAAGCGTTATTGGTATGCGGTAAAACCGCACTCAGG
+CGGCCTTGATAGTCATATCATCTGAATCAAATATTCCTGATGTATCGATATCGGTAATTC
+TTATTCCTTCGCTACCATCCATTGGAGGCCATCCTTCCTGACCATTTCCATCATTCCAGT
+CGAACTCACACACAACACCATATGCATTTAAGTCGCTTGAAATTGCTATAAGCAGAGCAT
+GTTGCGCCAGCATGATTAATACAGCATTTAATACAGAGCCGTGTTTATTGAGTCGGTATT
+CAGAGTCTGACCAGAAATTATTAATCTGGTGAAGTTTTTCCTCTGTCATTACGTCATGGT
+CGATTTCAATTTCTATTGATGCTTTCCAGTCGTAATCAATGATGTATTTTTTGATGTTTG
+ACATCTGTTCATATCCTCACAGATAAAAAATCGCCCTCACACTGGAGGGCAAAGAAGATT
+TCCAATAATCAGAACAAGTCGGCTCCTGTTTAGTTACGAGCGACATTGCTCCGTGTATTC
+ACTCGTTGGAATGAATACACAGTGCAGTGTTTATTCTGTTATTTATGCCAAAAATAAAGG
+CCACTATCAGGCAGCTTTGTTGTTCTGTTTACCAAGTTCTCTGGCAATCATTGCCGTCGT
+TCGTATTGCCCATTTATCGACATATTTCCCATCTTCCATTACAGGAAACATTTCTTCAGG
+CTTAACCATGCATTCCGATTGCAGCTTGCATCCATTGCATCGCTTGAATTGTCCACACCA
+TTGATTTTTATCAATAGTCGTAGTCATACGGATAGTCCTGGTATTGTTCCATCACATCCT
+GAGGATGCTCTTCGAACTCTTCAAATTCTTCTTCCATATATCACCTTAAATAGTGGATTG
+CGGTAGTAAAGATTGTGCCTGTCTTTTAACCACATCAGGCTCGGTGGTTCTCGTGTACCC
+CTACAGCGAGAAATCGGATAAACTATTACAACCCCTACAGTTTGATGAGTATAGAAATGG
+ATCCACTCGTTATTCTCGGACGAGTGTTCAGTAATGAACCTCTGGAGAGAACCATGTATA
+TGATCGTTATCTGGGTTGGACTTCTGCTTTTAAGCCCAGATAACTGGCCTGAATATGTTA
+ATGAGAGAATCGGTATTCCTCATGTGTGGCATGTTTTCGTCTTTGCTCTTGCATTTTCGC
+TAGCAATTAATGTGCATCGATTATCAGCTATTGCCAGCGCCAGATATAAGCGATTTAAGC
+TAAGAAAACGCATTAAGATGCAAAACGATAAAGTGCGATCAGTAATTCAAAACCTTACAG
+AAGAGCAATCTATGGTTTTGTGCGCAGCCCTTAATGAAGGCAGGAAGTATGTGGTTACAT
+CAAAACAATTCCCATACATTAGTGAGTTGATTGAGCTTGGTGTGTTGAACAAAACTTTTT
+CCCGATGGAATGGAAAGCATATATTATTCCCTATTGAGGATATTTACTGGACTGAATTAG
+TTGCCAGCTATGATCCATATAATATTGAGATAAAGCCAAGGCCAATATCTAAGTAACTAG
+ATAAGAGGAATCGATTTTCCCTTAATTTTCTGGCGTCCACTGCATGTTATGCCGCGTTCG
+CCAGGCTTGCTGTACCATGTGCGCTGATTCTTGCGCTCAATACGTTGCAGGTTGCTTTCA
+ATCTGTTTGTGGTATTCAGCCAGCACTGTAAGGTCTATCGGATTTAGTGCGCTTTCTACT
+CGTGATTTCGGTTTGCGATTCAGCGAGAGAATAGGGCGGTTAACTGGTTTTGCGCTTACC
+CCAACCAACAGGGGATTTGCTGCTTTCCATTGAGCCTGTTTCTCTGCGCGACGTTCGCGG
+CGGCGTGTTTGTGCATCCATCTGGATTCTCCTGTCAGTTAGCTTTGGTGGTGTGTGGCAG
+TTGTAGTCCTGAACGAAAACCCCCCGCGATTGGCACATTGGCAGCTAATCCGGAATCGCA
+CTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCTGCTTTGAATGCTGC
+CCTTCTTCAGGGCTTAATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTG
+ATGTGCTCAGTATCACCGCCAGTGGTATTTATGTCAACACCGCCAGAGATAATTTATCAC
+CGCAGATGGTTATCTGTATGTTTTTTATATGAATTTATTTTTTGCAGGGGGGCATTGTTT
+GGTAGGTGAGAGATCTGAATTGCTATGTTTAGTGAGTTGTATCTATTTATTTTTCAATAA
+ATACAATTGGTTATGTGTTTTGGGGGCGATCGTGAGGCAAAGAAAACCCGGCGCTGAGGC
+CGGGTTATTCTTGTTCTCTGGTCAAATTATATAGTTGGAAAACAAGGATGCATATATGAA
+TGAACGATGCAGAGGCAATGCCGATGGCGATAGTGGGTATCATGTAGCCGCTTATGCTGG
+AAAGAAGCAATAACCCGCAGAAAAACAAAGCTCCAAGCTCAACAAAACTAAGGGCATAGA
+CAATAACTACCGATGTCATATACCCATACTCTCTAATCTTGGCCAGTCGGCGCGTTCTGC
+TTCCGATTAGAAACGTCAAGGCAGCAATCAGGATTGCAATCATGGTTCCTGCATATGATG
+ACAATGTCGCCCCAAGACCATCTCTATGAGCTGAAAAAGAAACACCAGGAATGTAGTGGC
+GGAAAAGGAGATAGCAAATGCTTACGATAACGTAAGGAATTATTACTATGTAAACACCAG
+GCATGATTCTGTTCCGCATAATTACTCCTGATAATTAATCCTTAACTTTGCCCACCTGCC
+TTTTAAAACATTCCAGTATATCACTTTTCATTCTTGCGTAGCAATATGCCATCTCTTCAG
+CTATCTCAGCATTGGTGACCTTGTTCAGAGGCGCTGAGAGATGGCCTTTTTCTGATAGAT
+AATGTTCTGTTAAAATATCTCCGGCCTCATCTTTTGCCCGCAGGCTAATGTCTGAAAATT
+GAGGTGACGGGTTAAAAATAATATCCTTGGCAACCTTTTTTATATCCCTTTTAAATTTTG
+GCTTAATGACTATATCCAATGAGTCAAAAAGCTCCCCTTCAATATCTGTTGCCCCTAAGA
+CCTTTAATATATCGCCAAATACAGGTAGCTTGGCTTCTACCTTCACCGTTGTTCGGCCGA
+TGAAATGCATATGCATAACATCGTCTTTGGTGGTTCCCCTCATCAGTGGCTCTATCTGAA
+CGCGCTCTCCACTGCTTAATGACATTCCTTTCCCGATTAAAAAATCTGTCAGATCGGATG
+TGGTCGGCCCGAAAACAGTTCTGGCAAAACCAATGGTGTCGCCTTCAACAAACAAAAAAG
+ATGGGAATCCCAATGATTCGTCATCTGCGAGGCTGTTCTTAATATCTTCAACTGAAGCTT
+TAGAGCGATTTATCTTCTGAACCAGACTCTTGTCATTTGTTTTGGTAAAGAGAAAAGTTT
+TTCCATCGATTTTATGAATATACAAATAATTGGAGCCAACCTGCAGGTGATGATTATCAG
+CCAGCAGAGAATTAAGGAAAACAGACAGGTTTATTGAGCGCTTATCTTTCCCTTTATTTT
+TGCTGCGGTAAGTCGCATAAAAACCATTCTTCATAATTCAATCCATTTACTATGTTATGT
+TCTGAGGGGAGTGAAAATTCCCCTAATTCGATGAAGATTCTTGCTCAATTGTTATCAGCT
+ATGCGCCGACCAGAACACCTTGCCGATCAGCCAAACGTCTCTTCAGGCCACTGACTAGCG
+ATAACTTTCCCCACAACGGAACAACTCTCATTGCATGGGATCATTGGGTACTGTGGGTTT
+AGTGGTTGTAAAAACACCTGACCGCTATCCCTGATCAGTTTCTTGAAGGTAAACTCATCA
+CCCCCAAGTCTGGCTATGCAGAAATCACCTGGCTCAACAGCCTGCTCAGGGTCAACGAGA
+ATTAACATTCCGTCAGGAAAGCTTGGCTTGGAGCCTGTTGGTGCGGTCATGGAATTACCT
+TCAACCTCAAGCCAGAATGCAGAATCACTGGCTTTTTTGGTTGTGCTTACCCATCTCTCC
+GCATCACCTTTGGTAAAGGTTCTAAGCTTAGGTGAGAACATCCCTGCCTGAACATGAGAA
+AAAACAGGGTACTCATACTCACTTCTAAGTGACGGCTGCATACTAACCGCTTCATACATC
+TCGTAGATTTCTCTGGCGATTGAAGGGCTAAATTCTTCAACGCTAACTTTGAGAATTTTT
+GTAAGCAATGCGGCGTTATAAGCATTTAATGCATTGATGCCATTAAATAAAGCACCAACG
+CCTGACTGCCCCATCCCCATCTTGTCTGCGACAGATTCCTGGGATAAGCCAAGTTCATTT
+TTCTTTTTTTCATAAATTGCTTTAAGGCGACGTGCGTCCTCAAGCTGCTCTTGTGTTAAT
+GGTTTCTTTTTTGTGCTCATACGTTAAATCTATCACCGCAAGGGATAAATATCTAACACC
+GTGCGTGTTGACTATTTTACCTCTGGCGGTGATAATGGTTGCATGTACTAAGGAGGTTGT
+ATGGAACAACGCATAACCCTGAAAGATTATGCAATGCGCTTTGGGCAAACCAAGACAGCT
+AAAGATCTCGGCGTATATCAAAGCGCGATCAACAAGGCCATTCATGCAGGCCGAAAGATT
+TTTTTAACTATAAACGCTGATGGAAGCGTTTATGCGGAAGAGGTAAAGCCCTTCCCGAGT
+AACAAAAAAACAACAGCATAAATAACCCCGCTCTTACACATTCCAGCCCTGAAAAAGGGC
+ATCAAATTAAACCACACCTATGGTGTATGCATTTATTTGCATACATTCAATCAATTGTTA
+TCTAAGGAAATACTTACATATGGTTCGTGCAAACAAACGCAACGAGGCTCTACGAATCGA
+GAGTGCGTTGCTTAACAAAATCGCAATGCTTGGAACTGAGAAGACAGCGGAAGCTGTGGG
+CGTTGATAAGTCGCAGATCAGCAGGTGGAAGAGGGACTGGATTCCAAAGTTCTCAATGCT
+GCTTGCTGTTCTTGAATGGGGGGTCGTTGACGACGACATGGCTCGATTGGCGCGACAAGT
+TGCTGCGATTCTCACCAATAAAAAACGCCCGGCGGCAACCGAGCGTTCTGAACAAATCCA
+GATGGAGTTCTGAGGTCATTACTGGATCTATCAACAGGAGTCATTATGACAAATACAGCA
+AAAATACTCAACTTCGGCAGAGGTAACTTTGCCGGACAGGAGCGTAATGTGGCAGATCTC
+GATGATGGTTACGCCAGACTATCAAATATGCTGCTTGAGGCTTATTCGGGCGCAGATCTG
+ACCAAGCGACAGTTTAAAGTGCTGCTTGCCATTCTGCGTAAAACCTATGGGTGGAATAAA
+CCAATGGACAGAATCACCGATTCTCAACTTAGCGAGATTACAAAGTTACCTGTCAAACGG
+TGCAATGAAGCCAAGTTAGAACTCGTCAGAATGAATATTATCAAGCAGCAAGGCGGCATG
+TTTGGACCAAATAAAAACATCTCAGAATGGTGCATCCCTCAAAACGAGGGAAAATCCCCT
+AAAACGAGGGATAAAACATCCCTCAAATTGGGGGATTGCTATCCCTCAAAACAGGGGGAC
+ACAAAAGACACTATTACAAAAGAAAAAAGAAAAGATTATTCGTCAGAGAATTCTGGCGAA
+TCCTCTGACCAGCCAGAAAACGACCTTTCTGTGGTGAAACCGGATGCTGCAATTCAGAGC
+GGCAGCAAGTGGGGGACAGCAGAAGACCTGACCGCCGCAGAGTGGATGTTTGACATGGTG
+AAGACTATCGCACCATCAGCCAGAAAACCGAATTTTGCTGGGTGGGCTAACGATATCCGC
+CTGATGCGTGAACGTGACGGACGTAACCACCGCGACATGTGTGTGCTGTTCCGCTGGGCA
+TGCCAGGACAACTTCTGGTCCGGTAACGTGCTGAGCCCGGCCAAACTCCGCGATAAGTGG
+ACCCAACTCGAAATCAACCGTAACAAGCAACAGGCAGGCGTGACAGCCAGCAAACCAAAA
+CTCGACCTGACAAACACAGACTGGATTTACGGGGTGGATCTATGAAAAACATCGCCGCAC
+AGATGGTTAACTTTGACCGTGAGCAGATGCGTCGGATCGCCAACAACATGCCGGAACAGT
+ACGACGAAAAGCCGCAGGTACAGCAGGTAGCGCAGATCATCAACGGTGTGTTCAGCCAGT
+TACTGGCAACTTTCCCGGCGAGCCTGGCTAACCGTGACCAGAACGAAGTGAACGAAATCC
+GTCGCCAGTGGGTTCTGGCTTTTCGGGAAAACGGGATCACCACGATGGAACAGGTTAACG
+CAGGAATGCGCGTAGCCCGTCGGCAGAATCGACCATTTCTGCCATCACCCGGGCAGTTTG
+TTGCATGGTGCCGGGAAGAAGCATCCGTTACCGCCGGACTGCCAAACGTCAGCGAGCTGG
+TTGATATGGTTTACGAGTATTGCCGGAAGCGAGGCCTGTATCCGGATGCGGAGTCTTATC
+CGTGGAAATCAAACGCGCACTACTGGCTGGTTACCAACCTGTATCAGAACATGCGGGCCA
+ATGCGCTTACTGATGCGGAATTACGCCGTAAGGCCGCAGATGAGCTTGTCCATATGACTG
+CGAGAATTAACCGTGGTGAGGCGATCCCTGAACCAGTAAAACAACTTCCTGTCATGGGCG
+GTAGACCTCTAAATCGTGCACAGGCTCTGGCGAAGATCGCAGAAATCAAAGCTAAGTTCG
+GACTGAAAGGAGCAAGTGTATGACGGGCAAAGAGGCAATTATTCATTACCTGGGGACGCA
+TAATAGCTTCTGTGCGCCGGACGTTGCCGCGCTAACAGGCGCAACAGTAACCAGCATAAA
+TCAGGCCGCGGCTAAAATGGCACGGGCAGGTCTTCTGGTTATCGAAGGTAAGGTCTGGCG
+AACGGTGTATTACCGGTTTGCTACCAGGGAAGAACGGGAAGGAAAGATGAGCACGAACCT
+GGTTTTTAAGGAGTGTCGCCAGAGTGCCGCGATGAAACGGGTATTGGCGGTATATGGAGT
+TAAAAGATGACCATCTACATTACTGAGCTAATAACAGGCCTGCTGGTAATCGCAGGCCTT
+TTTATTTGGGGGAGAGGGAAGTCATGAAAAAACTAACCTTTGAAATTCGATCTCCAGCAC
+ATCAGCAAAACGCTATTCACGCAGTACAGCAAATCCTTCCAGACCCAACCAAACCAATCG
+TAGTAACCATTCAGGAACGCAACCGCAGCTTAGACCAAAACAGGAAGCTATGGGCCTGCT
+TAGGTGACGTCTCTCGTCAGGTTGAATGGCATGGTCGCTGGCTGGATGCAGAAAGCTGGA
+AGTGTGTGTTTACCGCAGCATTAAAGCAGCAGGATGTTGTTCCTAACCTTGCCGGGAATG
+GCTTTGTGGTAATAGGCCAGTCAACCAGCAGGATGCGTGTAGGCGAATTTGCGGAGCTAT
+TAGAGCTTATACAGGCATTCGGTACAGAGCGTGGCGTTAAGTGGTCAGACGAAGCGAGAC
+TGGCTCTGGAGTGGAAAGCGAGATGGGGAGACAGGGCTGCATGATAAATGTCGTTAGTTT
+CTCCGGTGGCAGGACGTCAGCATATTTGCTCTGGCTAATGGAGCAAAAGCGACGGGCAGG
+TAAAGACGTGCATTACGTTTTCATGGATACAGGTTGTGAACATCCAATGACATATCGGTT
+TGTCAGGGAAGTTGTGAAGTTCTGGGATATACCGCTCACCGTATTGCAGGTTGATATCAA
+CCCGGAGCTTGGACAGCCAAATGGTTATACGGTATGGGAACCAAAGGATATTCAGACGCG
+AATGCCTGTTCTGAAGCCATTTATCGATATGGTAAAGAAATATGGCACTCCATACGTCGG
+CGGCGCGTTCTGCACTGACAGATTAAAACTCGTTCCCTTCACCAAATACTGTGATGACCA
+TTTCGGGCGAGGGAATTACACCACGTGGATTGGCATCAGAGCTGATGAACCGAAGCGGCT
+AAAGCCAAAGCCTGGAATCAGATATCTTGCTGAACTGTCAGACTTTGAGAAGGAAGATAT
+CCTCGCATGGTGGAAGCAACAACCATTCGATTTGCAAATACCGGAACATCTCGGTAACTG
+CATATTCTGCATTAAAAAATCAACGCAAAAAATCGGACTTGCCTGCAAAGATGAGGAGGG
+ATTGCAGCGTGTTTTTAATGAGGTCATCACGGGATCCCATGTGCGTGACGGACATCGGGA
+AACGCCAAAGGAGATTATGTACCGAGGAAGAATGTCGCTGGACGGTATCGCGAAAATGTA
+TTCAGAAAATGATTATCAAGCCCTGTATCAGGACATGGTACGAGCTAAAAGATTCGATAC
+CGGCTCTTGTTCTGAGTCATGCGAAATATTTGGAGGGCAGCTTGATTTCGACTTCGGGAG
+GGAAGCTGCATGATGCGATGTTATCGGTGCGGTGAATGCAAAGAAGATAACCGCTTCCGA
+CCAAATCAACCTTACTGGAATCGATGGTGTCTCCGGTGTGAAAGAACACCAACAGGGGTG
+TTACCACTACCGCAGGAAAAGGAGGACGTGTGGCGAGACAGCGACGAAGTATCACCGACA
+TAATCTGCGAAAACTGCAAATACCTTCCAACGAAACGCACCAGAAATAAACCCAAGCCAA
+TCCCAAAAGAATCTGACGTAAAAACCTTCAACTACACGGCTCACCTGTGGGATATCCGGT
+GGCTAAGACGTCGTGCGAGGAAAACAAGGTGATTGACCAAAATCGAAGTTACGAACAAGA
+AAGCGTCGAGCGAGCTTTAACGTGCGCTAACTGCGGTCAGAAGCTGCATGTGCTGGAAGT
+TCACGTGTGTGAGCACTGCTGCGCAGAACTGATGAGCGATCCGAATAGCTCGATGCACGA
+GGAAGAAGATGATGGCTAAACCAGCGCGAAGACGATGTAAAAACGATGAATGCCGGGAAT
+GGTTTCACCCTGCATTCGCTAATCAGTGGTGGTGCTCTCCAGAGTGTGGAACCAAGATAG
+CACTCGAACGACGAAGTAAAGAACGCGAAAAAGCGGAAAAAGCAGCAGAGAAGAAACGAC
+GACGAGAGGAGCAGAAACAGAAAGATAAACTTAAGATTCGAAAACTCGCCTTAAAGCCCC
+GCAGTTACTGGATTAAACAAGCCCAACAAGCCGTAAACGCCTTCATCAGAGAAAGAGACC
+GCGACTTACCATGTATCTCGTGCGGAACGCTCACGTCTGCTCAGTGGGATGCCGGACATT
+ACCGGACAACTGCTGCGGCACCTCAACTCCGATTTAATGAACGCAATATTCACAAGCAAT
+GCGTGGTGTGCAACCAGCACAAAAGCGGAAATCTCGTTCCGTATCGCGTCGAACTGATTA
+GCCGCATCGGGCAGGAAGCAGTAGACGAAATCGAATCAAACCATAACCGCCATCGCTGGA
+CTATCGAAGAGTGCAAGGCGATCAAGGCAGAGTACCAACAGAAACTCAAAGACCTGCGAA
+ATAGCAGAAGTGAGGCCGCATGACGTTCTCAGTAAAAACCATTCCAGACATGCTCGTTGA
+AACATACGGAAATCAGACAGAAGTAGCACGCAGACTGAAATGTAGTCGCGGTACGGTCAG
+AAAATACGTTGATGATAAAGACGGGAAAATGCACGCCATCGTCAACGACGTTCTCATGGT
+TCATCGCGGATGGAGTGAAAGAGATGCGCTATTACGAAAAAATTGATGGCAGCAAATACC
+GAAATATTTGGGTAGTTGGCGATCTGCACGGATGCTACACGAACCTGATGAACAAACTGG
+ATACGATTGGATTCGACAACAAAAAAGACCTGCTTATCTCGGTGGGCGATTTGGTTGATC
+GTGGTGCAGAGAACGTTGAATGCCTGGAATTAATCACATTCCCCTGGTTCAGAGCTGTAC
+GTGGAAACCATGAGCAAATGATGATTGATGGCTTATCAGAGCGTGGAAACGTTAATCACT
+GGCTGCTTAATGGCGGTGGCTGGTTCTTTAATCTCGATTACGACAAAGAAATTCTGGCTA
+AAGCTCTTGCCCATAAAGCAGATGAACTTCCGTTAATCATCGAACTGGTGAGCAAAGATA
+AAAAATATGTTATCTGCCACGCCGATTATCCCTTTGACGAATACGAGTTTGGAAAGCCAG
+TTGATCATCAGCAGGTAATCTGGAACCGCGAACGAATCAGCAACTCACAAAACGGGATCG
+TGAAAGAAATCAAAGGCGCGGACACGTTCATCTTTGGTCATACGCCAGCAGTGAAACCAC
+TCAAGTTTGCCAACCAAATGTATATCGATACCGGCGCAGTGTTCTGCGGAAACCTAACAT
+TGATTCAGGTACAGGGAGAAGGCGCATGAGACTCGAAAGCGTAGCTAAATTTCATTCGCC
+AAAAAGCCCGATGATGAGCGACTCACCACGGGCCACGGCTTCTGACTCTCTTTCCGGTAC
+TGATGTGATGGCTGCTATGGGGATGGCGCAATCACAAGCCGGATTCGGTATGGCTGCATT
+CTGCGGTAAGCACGAACTCAGCCAGAACGACAAACAAAAGGCTATCAACTATCTGATGCA
+ATTTGCACACAAGGTATCGGGGAAATACCGTGGTGTGGCAAAGCTTGAAGGAAATACTAA
+GGCAAAGGTACTGCAAGTGCTCGCAACATTCGCTTATGCGGATTATTGCCGTAGTGCCGC
+GACGCCGGGGGCAAGATGCAGAGATTGCCATGGTACAGGCCGTGCGGTTGATATTGCCAA
+AACAGAGCTGTGGGGGAGAGTTGTCGAGAAAGAGTGCGGAAGATGCAAAGGCGTCGGCTA
+TTCAAGGATGCCAGCAAGCGCAGCATATCGCGCCGTGACGATGCTAATCCCAAACCTTAC
+CCAACCCACCTGGTCACGCACTGTTAAGCCGCTGTATGACGCTCTGGTGGTGCAATGCCA
+CAAAGAAGAGTCAATCGCAGACAACATTTTGAATGCGGTCACACGTTAGCAGCATGATTG
+CCACGGATGGCAACATATTAACGGCATGATATTGACTTATTGAATAAAATTGGGTAAATT
+TGACTCAACGATGGGTTAATTCGCTCGTTGTGGTAGTGAGATGAAAAGAGGCGGCGCTTA
+CTACCGATTCCGCCTAGTTGGTCACTTCGACGTATCGTCTGGAACTCCAACCATCGCAGG
+CAGAGAGGTCTGCAAAATGCAATCCCGAAACAGTTCGCAGGTAATAGTTAGAGCCTGCAT
+AACGGTTTCGGGATTTTTTATATCTGCACAACAGGTAAGAGCATTGAGTCGATAATCGTG
+AAGAGTCGGCGAGCCTGGTTAGCCAGTGCTCTTTCCGTTGTGCTGAATTAAGCGAATACC
+GGAAGCAGAACCGGATCACCAAATGCGTACAGGCGTCATCGCCGCCCAGCAACAGCACAA
+CCCAAACTGAGCCGTAGCCACTGTCTGTCCTGAATTCATTAGTAATAGTTACGCTGCGGC
+CTTTTACACATGACCTTCGTGAAAGCGGGTGGCAGGAGGTCGCGCTAACAACCTCCTGCC
+GTTTTGCCCGTGCATATCGGTCACGAACAAATCTGATTACTAAACACAGTAGCCTGGATT
+TGTTCTATCAGTAATCGACCTTATTCCTAATTAAATAGAGCAAATCCCCTTATTGGGGGT
+AAGACATGAAGATGCCAGAAAAACATGACCTGTTGGCCGCCATTCTCGCGGCAAAGGAAC
+AAGGCATCGGGGCAATCCTTGCGTTTGCAATGGCGTACCTTCGCGGCAGATATAATGGCG
+GTGCGTTTACAAAAACAGTAATCGACGCAACGATGTGCGCCATTATCGCCTAGTTCATTC
+GTGACCTTCTCGACTTCGCCGGACTAAGTAGCAATCTCGCTTATATAACGAGCGTGTTTA
+TCGGCTACATCGGTACTGACTCGATTGGTTCGCTTATCAAACGCTTCGCTGCTAAAAAAG
+CCGGAGTAGAAGATGGTAGAAATCAATAATCAACGTAAGGCGTTCCTCGATATGCTGGCG
+TGGTCGGAGGGAACTGATAACGGACGTCAGAAAACCAGAAATCATGGTTATGACGTCATT
+GTAGGCGGAGAGCTATTTACTGATTACTCCGATCACCCTCGCAAACTTGTCACGCTAAAC
+CCAAAACTCAAATCAACAGGCGCCGGACGCTACCAGCTTCTTTCCCGTTGGTGGGATGCC
+TACCGCAAGCAGCTTGGCCTGAAAGACTTCTCTCCGAAAAGTCAGGACGCTGTGGCATTG
+CAGCAGATTAAGGAGCGTGGCGCTTTACCTATGATTGATCGTGGTGATATCCGTCAGGCA
+ATCGACCGTTGCAGCAATATCTGGGCTTCACTGCCGGGCGCTGGTTATGGTCAGTTCGAG
+CATAAGGCTGACAGCCTGATTGCAAAATTCAAAGAAGCGGGCGGAACGGTCAGAGAGATT
+GATGTATGAGCAGAGTCACCGCGATTATCTCCGCTCTGGTTATCTGCATCATCGTCTGCC
+TGTCATGGGCTGTTAATCATTACCGTGATAACGCCATTACCTACAAAGCCCAGCGCGACA
+AAAATGCCAGAGAACTGAAGCTGGCGAACGCGGCAATTACTGACATGCAGATGCGTCAGC
+GTGATGTTGCTGCGCTCGATGCAAAATACACGAAGGAGTTAGCTGATGCTAAAGCTGAAA
+ATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCT
+GTCAGTCAGTGCGTGAAGCCACCACCGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGAC
+TGGCAGACACCGCTGAACGGGATTATTTCACCCTCAGAGAGAGGCTGATCACTATGCAAA
+AACAACTGGAAGGAACCCAGAAGTATATTAATGAGCAGTGCAGATAGAGTTGCCCATATC
+GATGGGCAACTCATGCAATTATTGTGAGCAATACACACGCGCTTCCAGCGGAGTATAAAT
+GCCTAAAGTAATAAAACCGAGCAATCCATTTACGAATGTTTGCTGGGTTTCTGTTTTAAC
+AACATTTTCTGCGCCGCCACAAATTTTGGCTGCATCGACAGTTTTCTTCTGCCCAATTCC
+AGAAACGAAGAAATGATGGGTGATGGTTTCCTTTGGTGCTACTGCTGCCGGTTTGTTTTG
+AACAGTAAACGTCTGTTGAGCACATCCTGTAATAAGCAGGGCCAGCGCAGTAGCGAGTAG
+CATTTTTTTCATGGTGTTATTCCCGATGCTTTTTGAAGTTCGCAGAATCGTATGTGTAGA
+AAATTAAACAAACCCTAAACAATGAGTTGAAATTTCATATTGTTAATATTTATTAATGTA
+TGTCAGGTGCGATGAATCGTCATTGTATTCCCGGATTAACTATGTCCACAGCCCTGACGG
+GGAACTTCTCTGCGGGAGTGTCCGGGAATAATTAAAACGATGCACACAGGGTTTAGCGCG
+TACACGTATTGCATTATGCCAACGCCCCGGTGCTGACACGGAAGAAACCGGACGTTATGA
+TTTAGCGTGGAAAGATTTGTGTAGTGTTCTGAATGCTCTCAGTAAATAGTAATGAATTAT
+CAAAGGTATAGTAATATCTTTTATGTTCATGGATATTTGTAACCCATCGGAAAACTCCTG
+CTTTAGCAAGATTTTCCCTGTATTGCTGAAATGTGATTTCTCTTGATTTCAACCTATCAT
+AGGACGTTTCTATAAGATGCGTGTTTCTTGAGAATTTAACATTTACAACCTTTTTAAGTC
+CTTTTATTAACACGGTGTTATCGTTTTCTAACACGATGTGAATATTATCTGTGGCTAGAT
+AGTAAATATAATGTGAGACGTTGTGACGTTTTAGTTCAGAATAAAACAATTCACAGTCTA
+AATCTTTTCGCACTTGATCGAATATTTCTTTAAAAATGGCAACCTGAGCCATTGGTAAAA
+CCTTCCATGTGATACGAGGGCGCGTAGTTTGCATTATCGTTTTTATCGTTTCAATCTGGT
+CTGACCTCCTTGTGTTTTGTTGATGATTTATGTCAAATATTAGGAATGTTTTCACTTAAT
+AGTATTGGTTGCGTAACAAAGTGCGGTCCTGCTGGCATTCTGGAGGGAAATACAACCGAC
+AGATGTATGTAAGGCCAACGTGCTCAAATCTTCATACAGAAAGATTTGAAGTAATATTTT
+AACCGCTAGATGAAGAGCAAGCGCATGGAGCGACAAAATGAATAAAGAACAATCTGCTGA
+TGATCCCTCCGTGGATCTGATTCGTGTAAAAAATATGCTTAATAGCACCATTTCTATGAG
+TTACCCTGATGTTGTAATTGCATGTATAGAACATAAGGTGTCTCTGGAAGCATTCAGAGC
+AATTGAGGCAGCGTTGGTGAAGCACGATAATAATATGAAGGATTATTCCCTGGTGGTTGA
+CTGATCACCATAACTGCTAATCATTCAAACTATTTAGTCTGTGACAGAGCCAACACGCAG
+TCTGTCACTGTCAGGAAAGTGGTAAAACTGCAACTCAATTACTGCAATGCCCTCGTAATT
+AAGTGAATTTACAATATCGTCCTGTTCGGAGGGAAGAACGCGGGATGTTCATTCTTCATC
+ACTTTTAATTGATGTATATGCTCTCTTTTCTGACGTTAGTCTCCGACGGCAGGCTTCAAT
+GACCCAGGCTGAGAAATTCCCGGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAAT
+CATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTCGTTGA
+CATGAGGTTGCCCCGTATTCAGTGTCGCTGATTTGTATTGTCTGAAGTTGTTTTTACGTT
+AAGTTGATGCAGATCAATTAATACGATACCTGCGTCATAATTGATTATTTGACGTGGTTT
+GATGGCCTCCACGCACGTTGTGATATGTAGATGATAATCATTATCACTTTACGGGTCCTT
+TCCGGTGATCCGACAGGTTACG
diff --git a/src/squirrel-1.4/seqs/lorist2.vep b/src/squirrel-1.4/seqs/lorist2.vep
new file mode 100644
index 0000000..e19a34b
--- /dev/null
+++ b/src/squirrel-1.4/seqs/lorist2.vep
@@ -0,0 +1,97 @@
+
+CTATGTATTCTATAGTGTCACCTAAATCGTATGTGTATGATACATAAGGTTATGT
+ATTAATTGTAGCCGCGTTCTAACGACAATATGTACAAGCCTAATTGTGTAGCATCTGGCT
+TACTGAAGCAGACCCTATCATCTCTCTCGTAAACTGCCGTCAGAGTCGGTTTGGTTGGAC
+GAACCTTCTGAGTTTCTGGTAACGCCGTCCCGCACCCGGAAATGGTCAGCGAACCAATCA
+GCAGGGTCATCGCTAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAATTC
+TAGCCAGATCCCCGGGCGGGCCTGATTTATGCTGGTTACTGTTGCGCCTGTTA
+GCGCGGCAACGTCCGGCGCACAGAAGCTATTATGCGTCCCCAGGTAATGAATAATTGCCT
+CTTTGCCCGTCATACACTTGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCGATCTT
+CGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCCCATGACAGGAAGTTGTTTTACTGG
+TTCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGC
+CTTACGGCGTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGT
+AACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCC
+TCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGC
+GGTAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGAAATGG
+TCGATTCTGCCGACGGGCTACGCGCATTCCTGCGTTAACCTGTTCCATCGTGGTGATCCC
+GTTTTCCCGAAAAGCCAGAACCCACTGGCGACGGATTTCGTTCACTTCGTTCTGGTCACG
+GTTAGCCAGGCTCGCCGGGAAAGTTGCCAGTAACTGGCTGAACACACCGTTGATGATCTG
+CGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTGTTGGCGATCCG
+ACGCATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAGATCCACC
+CCGTAAATCCAGTCTGTGTTTGTCAGGTCGAGTTTTGGTTTGCTGGCTGTCACGCCTGCC
+TGTTGCTTGTTACGGTTGATTTCGAGTTGGGTCCACTTATCGCGGAGTTTGGCCGGGCTC
+AGCACGTTACCGGACCAGAAGTTGTCCTGGCATGCCCAGCGGAACAGCACACACATGTCG
+CGGTGGTTACGTCCGTCACGTTCACGCATCAGGCGGATATCGTTAGCCCACCCAGCAAAA
+TTCGGTTTTCTGGCTGATGGTGCGATAGTCTTCACCATGTCAAACATCCACTCTGCGGCG
+GTCAGGTCTTCTGCTGTCCCCCACTTGCTGCCGCTCTGAATTGCAGCATCCGGTTTCACC
+ACAGAAAGGTCGTTTTCTGGCTGGTCAGAGGATTCGCCAGAATTCTCTGACGAATAATCT
+TTTCTTTTTTCTTTTGTAATAGTGTCTTTTGTGTCCCCCTGTTTTGAGGGATAGCAATCC
+CCCAATTTGAGGGATGTTTTATCCCTCGTTTTAGGGGATTTTCCCTCGTTTTGAGGGATG
+CACCATTCTGAGATGTTTTTATTTGGTCCAAACATGCCGCCTTGCTGCTTGATAATATTC
+ATTCTGACGAGTTCTAACTTGGCTTCATTGCACCGTTTGACAGGTAACTTTGTAATCTCG
+CTAAGTTGAGAATCGGTGATTCTGTCCATTGGTTTATTCCACCCATAGGTTTTACGCAGA
+ATGGCAAGCAGCACTTTAAACTGTCGCTTGGTCAGATCTGCGCCCGAATAAGCCTCAAGC
+AGCATATTTGATAGTCTGGCGTAACCATCATCGAGATCTGCCACATTACGCTCCTGTCCG
+GCAAAGTTACCTCTGCCGAAGTTGAGTATTTTTGCTGTATTTGTCATAATGACTCCTGTT
+GATAGATCCAGTAATGACCTCAGAACTCCATCTGGATTTGTTCAGAACGCTCGGTTGCCG
+CCGGGCGTTTTTTATTGGTGAGAATCGCAGCAACTTGTCGCGCCAATCGAGCCATGTCGT
+CGTCAACGACCCCCCATTCAAGAACAGCAAGCAGCATTGAGAACTTTGGAATCCAGTCCC
+TCTTCCACCTGCTGATCTGCGACTTATCAACGCCCACAGCTTCCGCTGTCTTCTCAGTTC
+CAAGCATTGCGATTTTGTTAAGCAACGCACTCTCGATTCGTAGAGCCTCGTTGCGTTTGT
+TTGCACGAACCATATGTAAGTATTTCCTTAGATAACAATTGATTGAATGTATGCAAATAA
+ATGCATACACCATAGGTGTGGTTTAATTTGATGCCCTTTTTCAGGGCTGGAATGTGTAAG
+AGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACCTCTTCCGCA
+TAAACGCTTCCATCAGCGTTTATAGTTAAAAAAATCTTTCGGCCTGCATGAATGGCCTTG
+TTGATCGCGCTTTGATATACGCCGAGATCTTTAGCTGTCTTGGTTTGCCCAAAGCGCATT
+GCATAATCTTTCAGGGTTATGCGTTGTTCCATACAACCTCCTTAGTACATGCAACCATTA
+TCACCGCCAGAGGTAAAATAGTCAACACGCACGGTGTTAGATATTTATCCCTTGCGGTGA
+TAGATTTAACGTATGAGCACAAAAAAGAAACCATTAACACAAGAGCAGCTTGAGGACGCA
+CGTCGCCTTAAAGCAATTTATGAAAAAAAGAAAAATGAACTTGGCTTATCCCAGGAATCT
+GTCGCAGACAAGATGGGGATGGGGCAGTCAGGCGTTGGTGCTTTATTTAATGGCATCAAT
+GCATTAAATGCTTATAACGCCGCATTGCTTACAAAAATTCTCAAAGTTAGCGTTGAAGAA
+TTTAGCCCTTCAATCGCCAGAGAAATCTACGAGATGTATGAAGCGGTTAGTATGCAGCCG
+TCACTTAGAAGTGAGTATGAGTACCCTGTTTTTTCTCATGTTCAGGCAGGGATGTTCTCA
+CCTAAGCT
+AGCTTCACGCTGCCGCAAGCACTCAGGGCGCAAGGGCTGCTAAAGGAAGCGGAACACGT
+AGAAAGCCAGTCCGCAGAAACGGTGCTGACCCCGGATGAATGTCAGCTACTGGGCTATCT
+GGACAAGGGAAAACGCAAGCGCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATGGC
+GATAGCTAGACTGGGCGGTTTTATGGACAGCAAGCGAACCGGAATTGCCAGCTGGGGCGC
+CCTCTGGTAAGGTTGGGAAGCCCTGCAAAGTAAACTGGATGGCTTTCTTGCCGCCAAGGA
+TCTGATGGCGCAGGGGATCAAGATCTGATCAAGAGACAGGATGAGGATCGTTTCGCATGA
+TTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCT
+ATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGC
+AGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGAACTGCAGG
+ACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCG
+ACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATC
+TCCTGTCATCTCACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGC
+GGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCG
+AGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAGAGC
+ATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCG
+AGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCC
+GCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAG
+CGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCTCG
+TGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATCGCCTTCTTGACG
+AGTTCTTCTGAGCGGGACTCTGGGGTTCGAAATGACCGACCAAGCGACGCCCAACCTGCC
+ATCACGAGATTTCGATTCCACCGCCGCCTTCTATGAAAGGTTGGGCTTCGGAATCGTTTT
+CCGGGACGCCGGCTGGATGATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCA
+CCCCGGGCTCGATCCCCTCGCGAGTTGGTTCAGCTGCTGCCTGAGGCTGGACGACCTCGC
+GGAGTTCTACCGGCAGTGCAAATCCGTCGGCATCCAGGAAACCAGCAGCGGCTATCCGCG
+CATCCATGCCCCCGAACTGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTCGGATCAAT
+TCGCGCGACGG
+ATCGATCCGGGCAACGTTGTTGCCATTGCTGCAGGGGGGGGGGGGGGGGGGGGGGGGGGT
+TGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAA
+AAGCCTCGCTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGAGGGTATTTTAAATAAAAAC
+ATTAAGTTATGACGAAGAAGAACGGAAACGCCTTAAACCGGAAAATTTTCATAAATAGCG
+AAAACCCGCGAGGTCGCCGCCCCGTAACCTGTCGGATCACCGGAAAGGACCCGTAAAGTG
+ATAATGATTATCATCTACATATCACAACGTGCGTGGAGGCCATCAAACCACGTCAAATAA
+TCAATTATGACGCAGGTATCGTATTAATTGATCTGCATCAACTTAACGTAAAAACAACTT
+CAGACAATACAAATCAGCGACACTGAATACGGGGCAACCTCATGTCAACCCCCCCCCCCCCCC
+CCCCCCCCCCCTGCAGGCGGAGAACTGGTAGGTATGGAAGATCTCTAGAGAATTCAAAAA
+AAGCCCGCTCATTAGGCGGGCTAATTCGAGCT
+CGCCCGGGGATCGATCCTCTAGAGTCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCG
+GTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAA
+GCCAGTATATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCG
+CCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAA
+GCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGC
+GCGAGGCCCAGCTGGCTTATCGAAATTAATACGACTCACTATAGGGAGACCGGAAGCTTA
+GGATC
diff --git a/src/squirrel-1.4/seqs/lorist6.vep b/src/squirrel-1.4/seqs/lorist6.vep
new file mode 100644
index 0000000..a94a432
--- /dev/null
+++ b/src/squirrel-1.4/seqs/lorist6.vep
@@ -0,0 +1,109 @@
+GATCCTAGTATTCTATAGTGTCACCTAAATCGTATGTGTATGATACATAA
+GGTTATGTATTAATTGTAGCCGCGTTCTAACGACAATATGTACAAGCCTA
+ATTGTGTAGCATCTGGCTTACTGAAGCAGACCCTATCATCTCTCTCGTAA
+ACTGCCGTCAGAGTCGGTTTGGTTGGACGAACCTTCTGAGTTTCTGGTAA
+CGCCGTCCCGCACCCGGAAATGGTCAGCGAACCAATCAGCAGGGTCATCG
+CTAGAAATCATCCTTAGCGAAAGCTAAGGATTTTTTTTATCTGAATTCTA
+GCCAGATCCCCGCTGATTTATGCTGGTTACTGTTGCGCCTGTTAGCGCGG
+CAACGTCCGGCGCACAGAAGCTATTATGCGTCCCCAGGTAATGAATAATT
+GCCTCTTTGCCCGTCATACACTTGCTCCTTTCAGTCCGAACTTAGCTTTG
+ATTTCTGCGATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGCC
+CATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAA
+TTCTCGCAGTCATATGGACAAGCTCATCTGCGGCCTTACGGCGTAATTCC
+GCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTAACCAG
+CCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACA
+GGCCTCGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACG
+TTTGGCAGTCCGGCGGTAACGGATGCTTCTTCCCGGCACCATGCAACAAA
+CTGCCCGGGTGATGGCAGAAATGGTCGATTCTGCCGACGGGCTACGCGCA
+TTCCTGCGTTAACCTGTTCCATCGTGGTGATCCCGTTTTCCCGAAAAGCC
+AGAACCCACTGGCGACGGATTTCGTTCACTTCGTTCTGGTCACGGTTTTC
+CCGAAAAGCCAGAACCCACTGGCGACGGATTTCGTTCACTTCGTTCTGGT
+CACGGTTAGCCAGGCTCGCCGGGAAAGTTGCCAGTAACTGGCTGAACACA
+CCGTTGATGATCTGCGCTACCTGCTGTACCTGCGGCTTTTCGTCGTACTG
+TTCCGGCATGTTGTTGGCGATCCGACGCATCTGCTCACGGTCAAAGTTAA
+CCATCTGTGCGGCGATGTTTTTCATAGATCCACCCCGTAAATCCAGTCTG
+TGTTTGTCAGGTCGAGTTTTGGTTTGCTGGCTGTCACGCCTGCCTGTTGC
+TTGTTACGGTTGATTTCGAGTTGGGTCCACTTATCGCGGAGTTTGGCCGG
+GCTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCATGCCCAGCGGAACA
+GCACACACATGTCGCGGTGGTTACGTCCGTCACGTTCACGCATCAGGCGG
+ATATCGTTAGCCCACCCAGCAAAATTCGGTTTTCTGGCTGATGGTGCGAT
+AGTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTG
+TCCCCCACTTGCTGCCGCTCTGAATTGCAGCATCCGGTTTCACCACAGAA
+AGGTCGTTTTCTGGCTGGTCAGAGGATTCGCCAGAATTCTCTGACGAATA
+ATCTTTTCTTTTTTCTTTTGTAATAGTGTCTTTTGTGTCCCCCTGTTTTG
+AGGGATAGCAATCCCCCAATTTGAGGGATGTTTTATCCCTCGTTTTAGGG
+GATTTTCCCTCGTTTTGAGGGATGCACCATTCTGAGATGTTTTTATTTGG
+TCCAAACATGCCGCCTTGCTGCTTGATAATATTCATTCTGACGAGTTCTA
+ACTTGGCTTCATTGCACCGTTTGACAGGTAACTTTGTAATCTCGCTAAGT
+TGAGAATCGGTGATTCTGTCCATTGGTTTATTCCACCCATAGGTTTTACG
+CAGACTAAGTTGAGAATCGGTGATTCTGTCCATTGGTTTATTCCACCCAT
+AGGTTTTACGCAGAATGGCAAGCAGCACTTTAAACTGTCGCTTGGTCAGA
+TCTGCGCCCGAATAAGCCTCAAGCAGCATATTTGATAGTCTGGCGTAACC
+ATCATCGAGATCTGCCACATTACGCTCCTGTCCGGCAAAGTTACCTCTGC
+CGAAGTTGAGTATTTTTGCTGTATTTGTCATAATGACTCCTGTTGATAGA
+TCCAGTAATGACCTCAGAACTCCATCTGGATTTGTTCAGAACGCTCGGTT
+GCCGCCGGGCGTTTTTTATTGGTGAGAATCGCAGCAACTTGTCGCGCCAA
+TCGAGCCATGTCGTCGTCAACGACCCCCCATTCAAGAACAGCAAGCAGCA
+TTGAGAACTTTGGAATCCAGTCCCTCTTCCACCTGCTGATCTGCGACTTA
+TCAACGCCCACAGCTTCCGCTGTCTTCTCAGTTCCAAGCATTGCGATTTT
+GTTAAGCAACGCACTCTCGATTCGTAGAGCCTCGTTGCGTTTGTTTGCAC
+GAACCATATGTAAGTATTTCCTTAGATAACAATTGATTGAATGTATGCAA
+ATAAATGCATACACCATAGGTGTGGTTTAATTTGATGCCCTTTTTCAGGG
+CTGGAATGTGTAAGAGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTC
+GGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCAGCGTTTATAGT
+TAAAAAAATCTTTCGGCCTGCATGAATGGCCTTGTTGATCGCGCTTTGAT
+ATACGCCGAGATCTTTAGCTGTCTTGGTTTGCCCAAAGCGCATTGCATAA
+TCTTTCAGGGTTATGCGTTGTTCCATACAACCTCCTTAGTACATGCAACC
+ATTATCACCGCCAGAGGTAAAATAGTCAACACGCACGGTGTTAGATATTT
+ATCCCTTGCGGTGATCACCGCCAGAGGTAAAATAGTCAACACGCACGGTG
+TTAGATATTTATCCCTTGCGGTGATAGATTTAACGTATGAGCACAAAAAA
+GAAACCATTAACACAAGAGCAGCTTGAGGACGCACGTCGCCTTAAAGCAA
+TTTATGAAAAAAAGAAAAATGAACTTGGCTTATCCCAGGAATCTGTCGCA
+GACAAGATGGGGATGGGGCAGTCAGGCGTTGGTGCTTTATTTAATGGCAT
+CAATGCATTAAATGCTTATAACGCCGCATTGCTTACAAAAATTCTCAAAG
+TTAGCGTTGAAGAATTTAGCCCTTCAATCGCCAGAGAAATCTACGAGATG
+TATGAAGCGGTTAGTATGCAGCCGTCACTTAGAAGTGAGTATGAGTACCC
+TGTTTTTTCTCATGTTCAGGCAGGGATGTTCTCACCTAAGCTAGCGAATT
+CTCATGTTTGACAGCTTATCATCGATATGCTTTAATGCGGTAGTGATCAA
+GAGACAGGATGAGGATCGTTTCGCATGATTGAACAAGATGGATTGCACGC
+AGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCAC
+AACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAG
+GGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGAATGA
+ACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTC
+CTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTG
+CTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTGCTCC
+TGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCTCCTGTCATCTC
+ACCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGG
+CTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACA
+TCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGG
+ATGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCC
+AGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGG
+CGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGAT
+TCATCGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCG
+TTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCG
+CTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCT
+TCTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGATG
+TCGACAGCCCGCCTAATGAGCGGGCTTTTTTTTCCTGAGGCTGGACGACC
+TCGCGGAGTTCTACCGGCAGTGCAAATCCGTCGGCATCCAGGAAACCAGC
+AGCGGCTATCCGCGCATCCATGCCCCCGAACTGCAGGAGTGGGGAGGCAC
+GATGGCCGCTTTGGTCCCGGATCAATTCGCGCGACCGGATCGATCCGGGC
+AACGTTGTTGCCATTGCTGCAGGGGGGGGGGGGGGGGGTTCCATTGTTCA
+TTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAATCCATTGT
+TCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCT
+CGCTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGAGGGTATTTTAAATAA
+AAACATTAAGTTATGACGAAGAAGAACGGAAACGCCTTAAACCGGAAAAT
+TTTCATAAATAGCGAAAACCCGCGAGGTCGCCGCCCCGTAACCTGTCGGA
+TCACCGGAAAGGACCCGTAAAGTGATAATGATTATCATCTACATATCACA
+ACGTGCGTGGAGGCCATCAAACCACGTCAAATAATCAATTATGACGCAGG
+TATCGTATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACA
+ATACAAATCAGCGACACTGAATACGGGGCAACCTCATGTCCCCCCCCCCC
+CCCCCTcCAGGCGcAGAACTGGTAGGTATGGAAGATCTCTAGAATTCAAA
+AAAAGCCCGCTCATTAGGCGGGCTAATTCGAGCTCGCCCGGGGATCGATC
+CTCTAGAGTCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTT
+CACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAG
+TTAAGCCAGTATATACACTCCtgatgCGCTACGTGACTGGGTCATGGCTG
+CGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTG
+CTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCAT
+GTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCCCAGCTGGC
+TTATCGAAATTAATACGACTCACTATAGGGAGACCCAAGCTTAGGATCGC
+GGCCGCAGTACTGGATC
\ No newline at end of file
diff --git a/src/squirrel-1.4/seqs/loristB.seq b/src/squirrel-1.4/seqs/loristB.seq
new file mode 100644
index 0000000..46a59d0
--- /dev/null
+++ b/src/squirrel-1.4/seqs/loristB.seq
@@ -0,0 +1,105 @@
+GATCCTA*GTATTCTATAGTGTCACCTAAATCGTATGTGTATGATACATAAGGTTATGTA
+TTAATTGTAGCCGCGTTCTAACGACAATATGTACAAGCCTAATTGTGTAGCATCTGGCTT
+ACTGAAGCAGACCCTATCATCTCTCTCGTAAACTGCCGTCAGAGTCGGTTTGGTTGGACG
+AACCTTCTGAGTTTCTG*GTAACGCCGTCCCGCACCCGGAAATGGTCAGCGAACCAATCA
+GCAGGGTCATCGCTAGCCAGATCCCCGCTGATTTATGCTGGTTACTGTTGCGCCTGTTAG
+CGCGGCAACGTCCGGCGCACAGAAGCTATTATGCGTCCCCAGGTAATGAATAATTGCCTC
+TTTGCCCGTCATACACTTGCTCCTTTCAGTCCGAACTTAGCTTTGATTTCTGCGATCTTC
+GCCAGAGCCTGTGCACGATTTAGAG-TCTACCGCCCATGACAGGAAGTTGTTTTACTGGT
+TCAGGGATCGCCTCACCACGGTTAATTCTCGCAGTCATATGGACAAGCTCATCTGCGGCC
+TTACGG-GTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTA
+TTACGG-GTAATTCCGCATCAGTAAGCGCATTGGCCCGCATGTTCTGATACAGGTTGGTA
+ACCAGCCAGTAGTGCGCGTTTGATTTCCACGGATAAGACTCCGCATCCGGATACAGGCCT
+CGCTTCCGGCAATACTCGTAAACCATATCAACCAGCTCGCTGACGTTTGGCAGTCCGGCG
+GTAACGGATGCTTCTTCCCGGCACCATGCAACAAACTGCCCGGGTGATGGCAGAAATGGT
+CGATTCTGCC*GACGGGCT*ACGC-CATTCCTGCGTT*AACCTGTTCCATCGTGGTGATC
+CCGTTTT*CCCGAAAAGCCAGAACCCACTGGCGACGGATTTCGTT-ACTTCGTTCTGGTC
+ACGGTTAGCCAGGCTCGCCGGGAAAGTTGCCAGTAACTGGCTGAACACACCGTTGATGAT
+CTGCGCTACCTGCTGTA-CTGCGGCTTTTCGTCGTACTGTTCCGGCATGTTGTTGGCGAT
+CCG*ACGCATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAGATC
+CCG*ACGCATCTGCTCACGGTCAAAGTTAACCATCTGTGCGGCGATGTTTTTCATAGATC
+CACCCCGTAAATCCAGTCTGTGTTTGTCAGGTCGAGTTTTGGTTTGCTGGCTGTCACGCC
+TGCCTGTTGCTTGTTACGGTTGATTTCGAGTTGGGTCCACTTATCGCGGAGTTTGGCCGG
+GCTCAGCACGTTACCGGACCAGAAGTTGTCCTGGCATGCCCAGCGGAACAGCACACACAT
+GTCGCGGTGGTTACGTCCGTCACGTTCACGCATCAGGCGGATATCGTTAGCCCACCCAGC
+AAAATTCGGTTTTCTGGCTGATGGTGCGATAGTCTTCACCATGTCAAACATCCACTCTGC
+GGCGGTCAGGTCTTCTGCTGT**CCCCCAC*TTGCTGCCGCTCTGAATTGCAGCAT-CCG
+GTTTCACCACAGAAAGGTCGTTTTCTGGCTGGTCAGAGGATTCGCCAGAATTCTCTGACG
+AATAATCTTTTCTTTTTTCTTTTGTAATAGTGTCTTTTGTGTCCCCCTGTTTTGAGGGAT
+AGCAATCCCCCAATTTGAGGGATGTTTTATCCCTCGTTTTAGGGGATTTTCCCTCGTTTT
+AGCAATCCCCCAATTTGAGGGATGTTTTATCCCTCGTTTTAGGGGATTTTCCCTCGTTTT
+GAGGGATGCA*CCATTCTGAGATGTTTTTATTTGGTCCAAACATGCC*GCCTTGCTGCTT
+*--ATAAT*ATTCATTCTGACGAGTTCTAACTT*GGCTTC*ATTGCACCG*TTTGACAGG
+TAACTTTGTAATCTCGCT*AAGTTGAGAATCGGTGATTCTGTCCATTGGTTTATTCCACC
+CATAGGTTTTACGCAGAATGGCAAGCAGCACTTT*AAACTGTCGCTTGGTCAGATCTGCG
+CCCGAATAAGCCTCAAGCAGCATATTTGATAGTCTGGCGTAACCATCATCGAGATCTGCC
+ACATTACGCTCCTGTCCGGCAAAGTTACCTCTGCCGAAGTTGAGTATTTTTGCTGTATTT
+GTCATAATGACTCCTGTTGATAGATCCAGTAATGACCTCAGAACTCCATCTGGATTTGTT
+CAGAACGCTCGGTTGCCGCCGGGCGTTTTTTATTGGTGAGAATCGCAGCAACTTGTCGCG
+CCAATCGAGCCA*TGTCGTCGTCAACGACCCCCCATTCAAGAACAGCAAGCAGCATTGAG
+CCAATCGAGCCA*TGTCGTCGTCAACGACCCCCCATTCAAGAACAGCAAGCAGCATTGAG
+AACTTTGGAATCCAGTCCCTCTTCCACCTGCTGATCTGCGACTTATCAACG-CCC-A---
+-TTCCGCTGTCTTCTCAG*TTCCAAGCATTGCGATTTTGTTAAGCAACGCACTCTCGATT
+CGTAGAGCCTCGTTGCGTTTGTTTGCACGAACCATATGTAAGTATTTCCTTAGATAACAA
+TTGATTGAATGTATGCAAATAAATGCATA---CCATAGGTGTGGTTTAATTTGA-TGCCC
+TTTTTCAGGGCTGGAATGTGTAAGAGCGGGGTTATTTATGCTGTTGTTTTTTTGTTACTC
+GGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCAGCGTTTATAGTTAAAAAAATC
+TTTCGGCCTGCATGAATGGCCTTGTTGATCGCGCTTTGATATACGCCGAGATCTTTAGCT
+GTCTTGGTTTGCCCAAAGCGCATTGCATAATCTTTCAGGGTTATGCGTTGTTCCATACAA
+CCTCCTTAGTACATGCAACCATTATCACCGCCAGAGGTAAAATAGTCAACACGCACGGTG
+CCTCCTTAGTACATGCAACCATTATCACCGCCAGAGGTAAAATAGTCAACACGCACGGTG
+TTAGATATTTATCCCTTGCGGTGATAGATTTAACGTATGAGCACAAAAAAGAAACCATTA
+ACACAAGAGCAGCTTGAGGACGCACGTCGCCTTAAAGCAATTTATGAAAAAAAGAAAAAT
+GAACTTGGCTTATCCCAGGAATCTGTCGCAGCAA--ATGGGGATGGGGCAGTCAGGCGTT
+GGTGCTTTATTTAATGGCATCAATGCATTAAATGCTTATAACGCCGCATT*GCTTGCAAA
+AATTCTCAAAGTTAGCGTTGAAGAATTTAGCCCTTCAATCGCCAGAGAAATCTACGAGAT
+GTATGAAGCGGTTAGTATGCAGCCGTCACTTAGAAGTGAGTATGAGTACCCTGTTTTTTC
+TCATGTTCAGGCAGGGAT*GTTCTCACCTGAGCTTAGAACCTTTACCAAAGGT*GATGCG
+GAGAGATGGGTAAGCACAACCAAAAAAGCCAGTGATTCTGCATTCTGGCTTGAGGTTGAA
+GGTAATTCCATGACCGCACCAACAGGCTCCAAGCCAAGCTAGCTTCACGCT-CCGCAAGC
+GGTAATTCCATGACCGCACCAACAGGCTCCAAGCCAAGCTAGCTTCACGCT-CCGCAAGC
+ACTCAGGGCGCAAGGGCTGCTAAAGGAAGCGGAACACGTAGAAAGCCAGTCCGCAGAAAC
+GGTGCTGACCCCGG*ATGAATGTCAGCTACTGGGCTATCTGGACAAGGGAAAACGCAAGC
+GCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATGGCGATA--TAGACTGGGCGGTT
+TTATGGACAGCAAGCGAACCGGAATTGCCAGCTGGGGCGCCCTCTGGTAAGGTTGGGAAG
+CCCTGCAAAGTAAACTGGATGGCTTTCTTGCCG*CCAAGGATCTGATGGCGCAGGGGAT*
+*CAAGATCTGATCAAGAGACAGGATGAGGATCGTTTCGCATGATTGAACAAGATGGATTG
+CACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGGGCACAACAG
+ACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGC-C-G--TTC
+TTTTTGTCAAGACCGACCTGTCCGGT-CCCTGAATGAACTGCAGGACGAGGCAGCGCGGC
+TTTTTGTCAAGACCGACCTGTCCGGT-CCCTGAATGAACTGCAGGACGAGGCAGCGCGGC
+TATCGTGGCTGG--------GGGCGTTCCTTGCGCA-----TGCTCGACGTTGTCACTGA
+AGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCA
+CCTTGCTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCATACGCT
+TGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCATCGAGCGAGCACGTAC
+TCGGATGGAAGCCGGTCTTGTCGATCAGGATGATCTGGACGAAG-AGCAT*CAG--G-TC
+GCGCCAGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCG*AGGATCTCGT
+CGTGACCCAT-GCGATGCCTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGG
+ATTCATCGACTGTGGCCGGCTGGGTGTG-CGGACCGCTATCAGGACATAGCGTTGGCTAC
+CCGTGATATTG--GAAGAGCTT-GCGGC****GAATGGGCTGACCGCTTCCTCGTGCTTT
+ACGGTATCGCCGCTCCCGATTCGCAGCGCAT-GCCTTCTATCGCCTTCTTGACGAGTTCT
+TCTGAGCGGGACTCT-GGG-TTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCAC
+TCTGAGCGGGACTCT-GGG-TTCGAAATGACCGACCAAGCGACGCCCAACCTGCCATCAC
+GAGATTTCGATTCCACCGCCG*CCTTCTATGAAAGGTTGGGCTTCGGAATCGTTTTCCGG
+GA-GCCG-C-TGGATGA-CCTCCAGCGCGGGGATCTCATGCTGGAGTTCTT-GCCCACCC
+CGGGCTCGA-CCCCTCGCGAGTTGGTTCAGCTGCTGCCT-AGGCTGG-ACGA-CTCGCGG
+AGTTCTACCGGCAGTGCAAATCCGTCGGCATCCAGGAAACCAGCAGCGGCTATCCGCGC*
+ATCCATGCCCCCGAACTGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTC--GGATCAA
+TTCGCGCGA-CGGATCGATCCGGGCAACGTTGTT--CATTGCTGCAGGGGGGGGGGGGGG
+GGGGGGGGGGGG--G--TTCCATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGA
+CAGAGGCCAAAAAGC-TC-CTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGAGGGTATTT
+TAAATAAAAACATTAAGTTATGACGAAGAAGAACGGAAACGCCTTAAACCGGAAAATTTT
+TAAATAAAAACATTAAGTTATGACGAAGAAGAACGGAAACGCCTTAAACCGGAAAATTTT
+CATAAATAGCGAAAACCCGCGAGGTCG-CGCCCCGTAACCTGTCGGAT-ACCGGAAAGGA
+CCCGTAAAGTGATAATGATTATCATCTACATA-TCACAACGTGCGTGGAGGCCATCAAAC
+CACGTCAAATAATCAATTATGACGCAGGTATCGTATTAATTGATCTGCATCAACTTAACG
+TAAAAACAACTTCAGACAATACAAATCAGCGACACTGAATACGGGGCAACCTCATGTCAA
+CCCCCCCCCCCCCCCCCCCCCCCCCCTGCAGGCGGAGAACTGGTAGGTATGGAAGATCTC
+TAGAGAATTCAAAAAAAGCCCGCTCATTAGGCGGGCTAATTCGAGCTCGCCCGGGGATCG
+ATCCTCTAGAGTCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCG
+CATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATATACA
+CTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGCCAACACCCGCTG
+ACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCT
+CCGGGAGCTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCCCAGCT
+GGCTTATCGAAATTAATACGACTCACTATAGGGAGACCGGAAGCTTAGGATCGCGGCCGC
+AGTACT
diff --git a/src/squirrel-1.4/seqs/m13mp18.vec b/src/squirrel-1.4/seqs/m13mp18.vec
new file mode 100644
index 0000000..25341c9
--- /dev/null
+++ b/src/squirrel-1.4/seqs/m13mp18.vec
@@ -0,0 +1,123 @@
+<---m13mp.001------>
+;M13MP18 cloning vector from vecbase
+ AATGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT
+ ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT
+ CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA
+ GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA
+ TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG
+ TTGGAGTTTGCTTCCGGTCTGGTTCGCTTTGAAGCTCGAATTAAAACGCGATATTTGAAG
+ TCTTTCGGGCTTCCTCTTAATCTTTTTGATGCAATCCGCTTTGCTTCTGACTATAATAGT
+ CAGGGTAAAGACCTGATTTTTGATTTATGGTCATTCTCGTTTTCTGAACTGTTTAAAGCA
+ TTTGAGGGGGATTCAATGAATATTTATGACGATTCCGCAGTATTGGACGCTATCCAGTCT
+ AAACATTTTACTATTACCCCCTCTGGCAAAACTTCTTTTGCAAAAGCCTCTCGCTATTTT
+ GGTTTTTATCGTCGTCTGGTAAACGAGGGTTATGATAGTGTTGCTCTTACTATGCCTCGT
+ AATTCCTTTTGGCGTTATGTATCTGCATTAGTTGAATGTGGTATTCCTAAATCTCAACTG
+ ATGAATCTTTCTACCTGTAATAATGTTGTTCCGTTAGTTCGTTTTATTAACGTAGATTTT
+ TCTTCCCAACGTCCTGACTGGTATAATGAGCCAGTTCTTAAAATCGCATAAGGTAATTCA
+ CAATGATTAAAGTTGAAATTAAACCATCTCAAGCCCAATTTACTACTCGTTCTGGTGTTT
+ CTCGTCAGGGCAAGCCTTATTCACTGAATGAGCAGCTTTGTTACGTTGATTTGGGTAATG
+ AATATCCGGTTCTTGTCAAGATTACTCTTGATGAAGGTCAGCCAGCCTATGCGCCTGGTC
+ TGTACACCGTTCATCTGTCCTCTTTCAAAGTTGGTCAGTTCGGTTCCCTTATGATTGACC
+ GTCTGCGCCTCGTTCCGGCTAAGTAACATGGAGCAGGTCGCGGATTTCGACACAATTTAT
+ CAGGCGATGATACAAATCTCCGTTGTACTTTGTTTCGCGCTTGGTATAATCGCTGGGGGT
+ CAAAGATGAGTGTTTTAGTGTATTCTTTCGCCTCTTTCGTTTTAGGTTGGTGCCTTCGTA
+ GTGGCATTACGTATTTTACCCGTTTAATGGAAACTTCCTCATGAAAAAGTCTTTAGTCCT
+ CAAAGCCTCTGTAGCCGTTGCTACCCTCGTTCCGATGCTGTCTTTCGCTGCTGAGGGTGA
+ CGATCCCGCAAAAGCGGCCTTTAACTCCCTGCAAGCCTCAGCGACCGAATATATCGGTTA
+ TGCGTGGGCGATGGTTGTTGTCATTGTCGGCGCAACTATCGGTATCAAGCTGTTTAAGAA
+ ATTCACCTCGAAAGCAAGCTGATAAACCGATACAATTAAAGGCTCCTTTTGGAGCCTTTT
+ TTTTTGGAGATTTTCAACGTGAAAAAATTATTATTCGCAATTCCTTTAGTTGTTCCTTTC
+ TATTCTCACTCCGCTGAAACTGTTGAAAGTTGTTTAGCAAAACCCCATACAGAAAATTCA
+ TTTACTAACGTCTGGAAAGACGACAAAACTTTAGATCGTTACGCTAACTATGAGGGTTGT
+ CTGTGGAATGCTACAGGCGTTGTAGTTTGTACTGGTGACGAAACTCAGTGTTACGGTACA
+ TGGGTTCCTATTGGGCTTGCTATCCCTGAAAATGAGGGTGGTGGCTCTGAGGGTGGCGGT
+ TCTGAGGGTGGCGGTTCTGAGGGTGGCGGTACTAAACCTCCTGAGTACGGTGATACACCT
+ ATTCCGGGCTATACTTATATCAACCCTCTCGACGGCACTTATCCGCCTGGTACTGAGCAA
+ AACCCCGCTAATCCTAATCCTTCTCTTGAGGAGTCTCAGCCTCTTAATACTTTCATGTTT
+ CAGAATAATAGGTTCCGAAATAGGCAGGGGGCATTAACTGTTTATACGGGCACTGTTACT
+ CAAGGCACTGACCCCGTTAAAACTTATTACCAGTACACTCCTGTATCATCAAAAGCCATG
+ TATGACGCTTACTGGAACGGTAAATTCAGAGACTGCGCTTTCCATTCTGGCTTTAATGAA
+ GATCCATTCGTTTGTGAATATCAAGGCCAATCGTCTGACCTGCCTCAACCTCCTGTCAAT
+ GCTGGCGGCGGCTCTGGTGGTGGTTCTGGTGGCGGCTCTGAGGGTGGTGGCTCTGAGGGT
+ GGCGGTTCTGAGGGTGGCGGCTCTGAGGGAGGCGGTTCCGGTGGTGGCTCTGGTTCCGGT
+ GATTTTGATTATGAAAAGATGGCAAACGCTAATAAGGGGGCTATGACCGAAAATGCCGAT
+ GAAAACGCGCTACAGTCTGACGCTAAAGGCAAACTTGATTCTGTCGCTACTGATTACGGT
+ GCTGCTATCGATGGTTTCATTGGTGACGTTTCCGGCCTTGCTAATGGTAATGGTGCTACT
+ GGTGATTTTGCTGGCTCTAATTCCCAAATGGCTCAAGTCGGTGACGGTGATAATTCACCT
+ TTAATGAATAATTTCCGTCAATATTTACCTTCCCTCCCTCAATCGGTTGAATGTCGCCCT
+ TTTGTCTTTAGCGCTGGTAAACCATATGAATTTTCTATTGATTGTGACAAAATAAACTTA
+ TTCCGTGGTGTCTTTGCGTTTCTTTTATATGTTGCCACCTTTATGTATGTATTTTCTACG
+ TTTGCTAACATACTGCGTAATAAGGAGTCTTAATCATGCCAGTTCTTTTGGGTATTCCGT
+ TATTATTGCGTTTCCTCGGTTTCCTTCTGGTAACTTTGTTCGGCTATCTGCTTACTTTTC
+ TTAAAAAGGGCTTCGGTAAGATAGCTATTGCTATTTCATTGTTTCTTGCTCTTATTATTG
+ GGCTTAACTCAATTCTTGTGGGTTATCTCTCTGATATTAGCGCTCAATTACCCTCTGACT
+ TTGTTCAGGGTGTTCAGTTAATTCTCCCGTCTAATGCGCTTCCCTGTTTTTATGTTATTC
+ TCTCTGTAAAGGCTGCTATTTTCATTTTTGACGTTAAACAAAAAATCGTTTCTTATTTGG
+ ATTGGGATAAATAATATGGCTGTTTATTTTGTAACTGGCAAATTAGGCTCTGGAAAGACG
+ CTCGTTAGCGTTGGTAAGATTCAGGATAAAATTGTAGCTGGGTGCAAAATAGCAACTAAT
+ CTTGATTTAAGGCTTCAAAACCTCCCGCAAGTCGGGAGGTTCGCTAAAACGCCTCGCGTT
+ CTTAGAATACCGGATAAGCCTTCTATATCTGATTTGCTTGCTATTGGGCGCGGTAATGAT
+ TCCTACGATGAAAATAAAAACGGCTTGCTTGTTCTCGATGAGTGCGGTACTTGGTTTAAT
+ ACCCGTTCTTGGAATGATAAGGAAAGACAGCCGATTATTGATTGGTTTCTACATGCTCGT
+ AAATTAGGATGGGATATTATTTTTCTTGTTCAGGACTTATCTATTGTTGATAAACAGGCG
+ CGTTCTGCATTAGCTGAACATGTTGTTTATTGTCGTCGTCTGGACAGAATTACTTTACCT
+ TTTGTCGGTACTTTATATTCTCTTATTACTGGCTCGAAAATGCCTCTGCCTAAATTACAT
+ GTTGGCGTTGTTAAATATGGCGATTCTCAATTAAGCCCTACTGTTGAGCGTTGGCTTTAT
+ ACTGGTAAGAATTTGTATAACGCATATGATACTAAACAGGCTTTTTCTAGTAATTATGAT
+ TCCGGTGTTTATTCTTATTTAACGCCTTATTTATCACACGGTCGGTATTTCAAACCATTA
+ AATTTAGGTCAGAAGATGAAATTAACTAAAATATATTTGAAAAAGTTTTCTCGCGTTCTT
+ TGTCTTGCGATTGGATTTGCATCAGCATTTACATATAGTTATATAACCCAACCTAAGCCG
+ GAGGTTAAAAAGGTAGTCTCTCAGACCTATGATTTTGATAAATTCACTATTGACTCTTCT
+ CAGCGTCTTAATCTAAGCTATCGCTATGTTTTCAAGGATTCTAAGGGAAAATTAATTAAT
+ AGCGACGATTTACAGAAGCAAGGTTATTCACTCACATATATTGATTTATGTACTGTTTCC
+ ATTAAAAAAGGTAATTCAAATGAAATTGTTAAATGTAATTAATTTTGTTTTCTTGATGTT
+ TGTTTCATCATCTTCTTTTGCTCAGGTAATTGAAATGAATAATTCGCCTCTGCGCGATTT
+ TGTAACTTGGTATTCAAAGCAATCAGGCGAATCCGTTATTGTTTCTCCCGATGTAAAAGG
+ TACTGTTACTGTATATTCATCTGACGTTAAACCTGAAAATCTACGCAATTTCTTTATTTC
+ TGTTTTACGTGCTAATAATTTTGATATGGTTGGTTCAATTCCTTCCATAATTCAGAAGTA
+ TAATCCAAACAATCAGGATTATATTGATGAATTGCCATCATCTGATAATCAGGAATATGA
+ TGATAATTCCGCTCCTTCTGGTGGTTTCTTTGTTCCGCAAAATGATAATGTTACTCAAAC
+ TTTTAAAATTAATAACGTTCGGGCAAAGGATTTAATACGAGTTGTCGAATTGTTTGTAAA
+ GTCTAATACTTCTAAATCCTCAAATGTATTATCTATTGACGGCTCTAATCTATTAGTTGT
+ TAGTGCACCTAAAGATATTTTAGATAACCTTCCTCAATTCCTTTCTACTGTTGATTTGCC
+ AACTGACCAGATATTGATTGAGGGTTTGATATTTGAGGTTCAGCAAGGTGATGCTTTAGA
+ TTTTTCATTTGCTGCTGGCTCTCAGCGTGGCACTGTTGCAGGCGGTGTTAATACTGACCG
+ CCTCACCTCTGTTTTATCTTCTGCTGGTGGTTCGTTCGGTATTTTTAATGGCGATGTTTT
+ AGGGCTATCAGTTCGCGCATTAAAGACTAATAGCCATTCAAAAATATTGTCTGTGCCACG
+ TATTCTTACGCTTTCAGGTCAGAAGGGTTCTATCTCTGTTGGCCAGAATGTCCCTTTTAT
+ TACTGGTCGTGTGACTGGTGAATCTGCCAATGTAAATAATCCATTTCAGACGATTGAGCG
+ TCAAAATGTAGGTATTTCCATGAGCGTTTTTCCTGTTGCAATGGCTGGCGGTAATATTGT
+ TCTGGATATTACCAGCAAGGCCGATAGTTTGAGTTCTTCTACTCAGGCAAGTGATGTTAT
+ TACTAATCAAAGAAGTATTGCTACAACGGTTAATTTGCGTGATGGACAGACTCTTTTACT
+ CGGTGGCCTCACTGATTATAAAAACACTTCTCAAGATTCTGGCGTACCGTTCCTGTCTAA
+ AATCCCTTTAATCGGCCTCCTGTTTAGCTCCCGCTCTGATTCCAACGAGGAAAGCACGTT
+ ATACGTGCTCGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGG
+ GTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTT
+ TCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATC
+ GGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTG
+ ATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGA
+ CGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACC
+ CTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGAACCACCATCAA
+ ACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGG
+ CCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCGCTGGTGAAAAGAAAAACCACCCT
+ GGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGC
+ ACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGC
+ TCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA
+ TTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGAATTCGAGC
+ TCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTGGCACTGGCCGTC
+ GTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCA
+ CATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAA
+ CAGTTGCGCAGCCTGAATGGCGAATGGCGCTTTGCCTGGTTTCCGGCACCAGAAGCGGTG
+ CCGGAAAGCTGGCTGGAGTGCGATCTTCCTGAGGCCGATACGGTCGTCGTCCCCTCAAAC
+ TGGCAGATGCACGGTTACGATGCGCCCATCTACACCAACGTAACCTATCCCATTACGGTC
+ AATCCGCCGTTTGTTCCCACGGAGAATCCGACGGGTTGTTACTCGCTCACATTTAATGTT
+ GATGAAAGCTGGCTACAGGAAGGCCAGACGCGAATTATTTTTGATGGCGTTCCTATTGGT
+ TAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTA
+ CAATTTAAATATTTGCTTATACAATCTTCCTGTTTTTGGGGCTTTTCTGATTATCAACCG
+ GGGTACATATGATTGACATGCTAGTTTTACGATTACCGTTCATCGATTCTCTTGTTTGCT
+ CCAGACTCTCAGGCAATGACCTGATAGCCTTTGTAGATCTCTCAAAAATAGCTACCCTCT
+ CCGGCATTAATTTATCAGCTAGAACGGTTGAATATCATATTGATGGTGATTTGACTGTCT
+ CCGGCCTTTCTCACCCTTTTGAATCTTTACCTACACATTACTCAGGCATTGCATTTAAAA
+ TATATGAGGGTTCTAAAAATTTTTATCCTTGCGTTGAAATAAAGGCTTCTCCCGCAAAAG
+ TATTACAGGGTCATAATGTTTTTGGTACAACCGATTTAGCTTTATGCTCTGAGGCTTTAT
+ TGCTTAATTTTGCTAATTCTTTGCCTTGCCTGTATGATTTATTGGATGTT
diff --git a/src/squirrel-1.4/seqs/m13mp7.seq b/src/squirrel-1.4/seqs/m13mp7.seq
new file mode 100644
index 0000000..75b10d8
--- /dev/null
+++ b/src/squirrel-1.4/seqs/m13mp7.seq
@@ -0,0 +1,126 @@
+<---M13MP7.001----->
+AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT                    
+ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT                    
+; by starting a record with ; comments can be inserted anywhere
+; but the ; must appear in column 1.
+; comment lines will be displayed when the file is read
+CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA                    
+GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA                    
+;  here is another comment
+TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG                    
+TTGGAGTTTGCTTCCGGTCTGGTTCGCTTTGAAGCTCGAATTAAAACGCGATATTTGAAG                    
+TCTTTCGGGCTTCCTCTTAATCTTTTTGATGCAATCCGCTTTGCTTCTGACTATAATAGT                    
+CAGGGTAAAGACCTGATTTTTGATTTATGGTCATTCTCGTTTTCTGAACTGTTTAAAGCA                    
+TTTGAGGGGGATTCAATGAATATTTATGACGATTCCGCAGTATTGGACGCTATCCAGTCT                    
+AAACATTTTACTATTACCCCCTCTGGCAAAACTTCTTTTGCAAAAGCCTCTCGCTATTTT                    
+GGTTTTTATCGTCGTCTGGTAAACGAGGGTTATGATAGTGTTGCTCTTACTATGCCTCGT                    
+AATTCCTTTTGGCGTTATGTATCTGCATTAGTTGAATGTGGTATTCCTAAATCTCAACTG                    
+ATGAATCTTTCTACCTGTAATAATGTTGTTCCGTTAGTTCGTTTTATTAACGTAGATTTT                    
+TCTTCCCAACGTCCTGACTGGTATAATGAGCCAGTTCTTAAAATCGCATAAGGTAATTCA                    
+CAATGATTAAAGTTGAAATTAAACCATCTCAAGCCCAATTTACTACTCGTTCTGGTGGTT                    
+CTCGTCAGGGCAAGCCTTATTCACTGAATGAGCAGCTTTGTTACGTTGATTTGGGTAATG                    
+AATATCCGGTTCTTGTCAAGATTACTCTTGATGAAGGTCAGCCAGCCTATGCGCCTGGTC                    
+TGTACACCGTTCATCTGTCCTCTTTCAAAGTTGGTCAGTTCGGTTCCCTTATGATTGACC                    
+GTCTGCGCCTCGTTCCGGCTAAGTAACATGGAGCAGGTCGCGGATTTCGACACAATTTAT                    
+CAGGCGATGATACAAATCTCCGTTGTACTTTGTTTCGCGCTTGGTATAATCGCTGGGGGT                    
+CAAAGATGAGTGTTTTAGTGTATTCTTTCGCCTCTTTCGTTTTAGGTTGGTGCCTTCGTA                    
+GTGGCATTACGTATTTTACCCGTTTAATGGAAACTTCCTCATGAAAAAGTCTTTAGTCCT                    
+CAAAGCCTCTGTAGCCGTTGCTACCCTCGTTCCGATGCTGTCTTTCGCTGCTGAGGGTGA                    
+CGATCCCGCAAAAGCGGCCTTTAACTCCCTGCAAGCCTCAGCGACCGAATATATCGGTTA                    
+TGCGTGGGCGATGGTTGTTGTCATTGTCGGCGCAACTATCGGTATCAAGCTGTTTAAGAA                    
+ATTCACCTCGAAAGCAAGCTGATAAACCGATACAATTAAAGGCTCCTTTTGGAGCCTTTT                    
+TTTTTGGAGATTTTCAACGTGAAAAAATTATTATTCGCAATTCCTTTAGTTGTTCCTTTC                    
+TATTCTCACTCCGCTGAAACTGTTGAAAGTTGTTTAGCAAAACCCCATACAGAAAATTCA                    
+TTTACTAACGTCTGGAAAGACGACAAAACTTTAGATCGTTACGCTAACTATGAGGGTTGT                    
+CTGTGGAATGCTACAGGCGTTGTAGTTTGTACTGGTGACGAAACTCAGTGTTACGGTACA                    
+TGGGTTCCTATTGGGCTTGCTATCCCTGAAAATGAGGGTGGTGGCTCTGAGGGTGGCGGT                    
+TCTGAGGGTGGCGGTTCTGAGGGTGGCGGTACTAAACCTCCTGAGTACGGTGATACACCT                    
+ATTCCGGGCTATACTTATATCAACCCTCTCGACGGCACTTATCCGCCTGGTACTGAGCAA                    
+AACCCCGCTAATCCTAATCCTTCTCTTGAGGAGTCTCAGCCTCTTAATACTTTCATGTTT                    
+CAGAATAATAGGTTCCGAAATAGGCAGGGGGCATTAACTGTTTATACGGGCACTGTTACT                    
+CAAGGCACTGACCCCGTTAAAACTTATTACCAGTACACTCCTGTATCATCAAAAGCCATG                    
+TATGACGCTTACTGGAACGGTAAATTCAGAGACTGCGCTTTCCATTCTGGCTTTAATGAA                    
+GATCCATTCGTTTGTGAATATCAAGGCCAATCGTCTGACCTGCCTCAACCTCCTGTCAAT                    
+GCTGGCGGCGGCTCTGGTGGTGGTTCTGGTGGCGGCTCTGAGGGTGGTGGCTCTGAGGGT                    
+GGCGGTTCTGAGGGTGGCGGCTCTGAGGGAGGCGGTTCCGGTGGTGGCTCTGGTTCCGGT                    
+GATTTTGATTATGAAAAGATGGCAAACGCTAATAAGGGGGCTATGACCGAAAATGCCGAT                    
+GAAAACGCGCTACAGTCTGACGCTAAAGGCAAACTTGATTCTGTCGCTACTGATTACGGT                    
+GCTGCTATCGATGGTTTCATTGGTGACGTTTCCGGCCTTGCTAATGGTAATGGTGCTACT                    
+GGTGATTTTGCTGGCTCTAATTCCCAAATGGCTCAAGTCGGTGACGGTGATAATTCACCT                    
+TTAATGAATAATTTCCGTCAATATTTACCTTCCCTCCCTCAATCGGTTGAATGTCGCCCT                    
+TTTGTCTTTAGCGCTGGTAAACCATATGAATTTTCTATTGATTGTGACAAAATAAACTTA                    
+TTCCGTGGTGTCTTTGCGTTTCTTTTATATGTTGCCACCTTTATGTATGTATTTTCTACG                    
+TTTGCTAACATACTGCGTAATAAGGAGTCTTAATCATGCCAGTTCTTTTGGGTATTCCGT                    
+TATTATTGCGTTTCCTCGGTTTCCTTCTGGTAACTTTGTTCGGCTATCTGCTTACTTTTC                    
+TTAAAAAGGGCTTCGGTAAGATAGCTATTGCTATTTCATTGTTTCTTGCTCTTATTATTG                    
+GGCTTAACTCAATTCTTGTGGGTTATCTCTCTGATATTAGCGCTCAATTACCCTCTGACT                    
+TTGTTCAGGGTGTTCAGTTAATTCTCCCGTCTAATGCGCTTCCCTGTTTTTATGTTATTC                    
+TCTCTGTAAAGGCTGCTATTTTCATTTTTGACGTTAAACAAAAAATCGTTTCTTATTTGG                    
+ATTGGGATAAATAATATGGCTGTTTATTTTGTAACTGGCAAATTAGGCTCTGGAAAGACG                    
+CTCGTTAGCGTTGGTAAGATTCAGGATAAAATTGTAGCTGGGTGCAAAATAGCAACTAAT                    
+CTTGATTTAAGGCTTCAAAACCTCCCGCAAGTCGGGAGGTTCGCTAAAACGCCTCGCGTT                    
+CTTAGAATACCGGATAAGCCTTCTATATCTGATTTGCTTGCTATTGGGCGCGGTAATGAT                    
+TCCTACGATGAAAATAAAAACGGCTTGCTTGTTCTCGATGAGTGCGGTACTTGGTTTAAT                    
+ACCCGTTCTTGGAATGATAAGGAAAGACAGCCGATTATTGATTGGTTTCTACATGCTCGT                    
+AAATTAGGATGGGATATTATTTTTCTTGTTCAGGACTTATCTATTGTTGATAAACAGGCG                    
+CGTTCTGCATTAGCTGAACATGTTGTTTATTGTCGTCGTCTGGACAGAATTACTTTACCT                    
+TTTGTCGGTACTTTATATTCTCTTATTACTGGCTCGAAAATGCCTCTGCCTAAATTACAT                    
+GTTGGCGTTGTTAAATATGGCGATTCTCAATTAAGCCCTACTGTTGAGCGTTGGCTTTAT                    
+ACTGGTAAGAATTTGTATAACGCATATGATACTAAACAGGCTTTTTCTAGTAATTATGAT                    
+TCCGGTGTTTATTCTTATTTAACGCCTTATTTATCACACGGTCGGTATTTCAAACCATTA                    
+AATTTAGGTCAGAAGATGAAATTAACTAAAATATATTTGAAAAAGTTTTCTCGCGTTCTT                    
+TGTCTTGCGATTGGATTTGCATCAGCATTTACATATAGTTATATAACCCAACCTAAGCCG                    
+GAGGTTAAAAAGGTAGTCTCTCAGACCTATGATTTTGATAAATTCACTATTGACTCTTCT                    
+CAGCGTCTTAATCTAAGCTATCGCTATGTTTTCAAGGATTCTAAGGGAAAATTAATTAAT                    
+AGCGACGATTTACAGAAGCAAGGTTATTCACTCACATATATTGATTTATGTACTGTTTCC                    
+ATTAAAAAAGGTAATTCAAATGAAATTGTTAAATGTAATTAATTTTGTTTTCTTGATGTT                    
+TGTTTCATCATCTTCTTTTGCTCAGGTAATTGAAATGAATAATTCGCCTCTGCGCGATTT                    
+TGTAACTTGGTATTCAAAGCAATCAGGCGAATCCGTTATTGTTTCTCCCGATGTAAAAGG                    
+TACTGTTACTGTATATTCATCTGACGTTAAACCTGAAAATCTACGCAATTTCTTTATTTC                    
+TGTTTTACGTGCTAATAATTTTGATATGGTTGGTTCAATTCCTTCCATAATTCAGAAGTA                    
+TAATCCAAACAATCAGGATTATATTGATGAATTGCCATCATCTGATAATCAGGAATATGA                    
+TGATAATTCCGCTCCTTCTGGTGGTTTCTTTGTTCCGCAAAATGATAATGTTACTCAAAC                    
+TTTTAAAATTAATAACGTTCGGGCAAAGGATTTAATACGAGTTGTCGAATTGTTTGTAAA                    
+GTCTAATACTTCTAAATCCTCAAATGTATTATCTATTGACGGCTCTAATCTATTAGTTGT                    
+TAGTGCACCTAAAGATATTTTAGATAACCTTCCTCAATTCCTTTCTACTGTTGATTTGCC                    
+AACTGACCAGATATTGATTGAGGGTTTGATATTTGAGGTTCAGCAAGGTGATGCTTTAGA                    
+TTTTTCATTTGCTGCTGGCTCTCAGCGTGGCACTGTTGCAGGCGGTGTTAATACTGACCG                    
+CCTCACCTCTGTTTTATCTTCTGCTGGTGGTTCGTTCGGTATTTTTAATGGCGATGTTTT                    
+AGGGCTATCAGTTCGCGCATTAAAGACTAATAGCCATTCAAAAATATTGTCTGTGCCACG                    
+TATTCTTACGCTTTCAGGTCAGAAGGGTTCTATCTCTGTTGGCCAGAATGTCCCTTTTAT                    
+TACTGGTCGTGTGACTGGTGAATCTGCCAATGTAAATAATCCATTTCAGACGATTGAGCG                    
+TCAAAATGTAGGTATTTCCATGAGCGTTTTTCCTGTTGCAATGGCTGGCGGTAATATTGT                    
+TCTGGATATTACCAGCAAGGCCGATAGTTTGAGTTCTTCTACTCAGGCAAGTGATGTTAT                    
+TACTAATCAAAGAAGTATTGCTACAACGGTTAATTTGCGTGATGGACAGACTCTTTTACT                    
+CGGTGGCCTCACTGATTATAAAAACACTTCTCAAGATTCTGGCGTACCGTTCCTGTCTAA                    
+AATCCCTTTAATCGGCCTCCTGTTTAGCTCCCGCTCTGATTCCAACGAGGAAAGCACGTT                    
+ATACGTGCTCGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGG                    
+GTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTT                    
+TCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATC                    
+GGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTG                    
+ATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGA                    
+CGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTTGAACAACACTCAACC                    
+CTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGAACCACCATCAA                    
+ACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGG                    
+CCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCGCTGGTGAAAAGAAAAACCACCCT                    
+GGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATCCAGCTGGC                    
+ACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTACC                    
+TCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA                    
+TTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGAATTCCCCG                    
+GATCCGTCGACCTGCAGGTCGACGGATCCGGGGAATTCACTGGCCGTCGTTTTACAACGT                    
+CGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCCTTC                    
+GCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGTAGC                    
+CTGAATGGCGAATGGCGCTTTGCCTGGTTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGG                    
+CTGGAGTGCGATCTTCCTGAGGCCGA-AC-GTCGTCGTCCCCTCAAACTGGCAGATGCAC                    
+GGTTACGATGCGCCCATCTACACCAACGTAACCTATCCCATTACGGTCAATCCGCCGTTT                    
+GTTCCCACGGAGAATCCGACGGGTTGTTACTCGCTCACATTTAATGTTGATGAAAGCTGG                    
+CTACAGGAAGGCCAGACGCGAATTATTTTTGATGGCGTTCCTATTGGTTAAAAAATGAGC                    
+TGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTAAATAT                    
+TTGCTTATACAATCTTCCTGTTTTTGGGGCTTTTCTGATTATCAACCGGGGTACATATGA                    
+TTGACATGCTAGTTTTACGATTACCGTTCATCGATTCTCTTGTTTGCTCCAGACTCTCAG                    
+GCAATGACCTGATAGCCTTTGTAGATCTCTCAAAAATAGCTACCCTCTCCGGCATGAATT                    
+TATCAGCTAGAACGGTTGAATATCATATTGATGGTGATTTGACTGTCTCCGGCCTTTCTC                    
+ACCCTTTTGAATCTTTACCTACACATTACTCAGGCATTGCATTTAAAATATATGAGGGTT                    
+CTAAAAATTTTTATCCTTGCGTTGAAATAAAGGCTTCTCCCGCAAAAGTATTACAGGGTC                    
+ATAATGTTTTTGGTACAACCGATTTAGCTTTATGCTCTGAGGCTTTATTGCTTAATTTTG                    
+CTAATTCTTTGCCTTGCCTGTATGATTTATTGGATGTT                                          
diff --git a/src/squirrel-1.4/seqs/pYAC4.seq b/src/squirrel-1.4/seqs/pYAC4.seq
new file mode 100644
index 0000000..5cc7e6e
--- /dev/null
+++ b/src/squirrel-1.4/seqs/pYAC4.seq
@@ -0,0 +1,225 @@
+CTGCCTCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGC
+TCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAA
+GCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCAT
+GACCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGC
+ATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGC
+ACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCG
+CTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGC
+TCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAG
+GAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAG
+GCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCA
+CAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAA
+GATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCG
+ACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGT
+GGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCG
+TTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC
+TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGA
+CTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGT
+ATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTAC
+ACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTT
+CGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTA
+GCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGA
+TCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAA
+CGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCT
+TCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGT
+ATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC
+ACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCC
+CCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGT
+GCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGC
+AATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTT
+TATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGT
+AGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCAT
+CGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCC
+AACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTT
+AGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTT
+ATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCAT
+CCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGA
+GAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGA
+TAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAAC
+GTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGT
+TCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTT
+CACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAA
+AGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTT
+CAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACAT
+ATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTC
+CCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTA
+ACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTAAT
+TCGGTCGAAAAAAGAAAAGGAGAGGGCCAAGAGGGAGGGCATTGGTGACT
+ATTGAGCACGTGAGTATACGTGATTAAGCACACAAAGGCAGCTTGGAGTA
+TGTCTGTTATTAATTTCACAGGTAGTTCTGGTCCATTGGTGAAAGTTTGC
+GGCTTGCAGAGCACAGAGGCCGCAGAATGTGCTCTAGATTCCGATGCTGA
+CTTGCTGGGTATTATATGTGTGCCCAATAGAAAGAGAACAATTGACCCGG
+TTATTGCAAGGAAAATTTCAAGTCTTGTAAAAGCATATAAAAATAGTTCA
+GGCACTCCGAAATACTTGGTTGGCGTGTTTCGTAATCAACCTAAGGAGGA
+TGTTTTGGCTCTGGTCAATGATTACGGCATTGATATCGTCCAACTGCATG
+GAGATGAGTCGTGGCAAGAATACCAAGAGTTCCTCGGTTTGCCAGTTATT
+AAAAGACTCGTATTTCCAAAAGACTGCAACATACTACTCAGTGCAGCTTC
+ACAGAAACCTCATTCGTTTATTCCCTTGTTTGATTCAGAAGCAGGTGGGA
+CAGGTGAACTTTTGGATTGGAACTCGATTTCTGACTGGGTTGGAAGGCAA
+GAGAGCCCCGAAAGCTTACATTTTATGTTAGCTGGTGGACTGACGCCAGA
+AAATGTTGGTGATGCGCTTAGATTAAATGGCGTTATTGGTGTTGATGTAA
+GCGGAGGTGTGGAGACAAATGGTGTAAAAGACTCTAACAAAATAGCAAAT
+TTCGTCAAAAATGCTAAGAAATAGGTTATTACTGAGTAGTATTTATTTAA
+GTATTGTTTGTGCACTTGCCTGCAGGCCTTTTGAAAAGCAAGCATAAAAG
+ATCTAAACATAAAATCTGTAAAATAACAAGATGTAAAGATAATGCTAAAT
+CATTTGGCTTTTTGATTGATTGTACAGGAAAATATACATCGCAGGGGGTT
+GACTTTTACCATTTCACCGCAATGGAATCAAACTTGTTGAAGAGAATGTT
+CACAGGCGCATACGCTACAATGACCCGATTCTTGCTAGCCTTTTCTCGGT
+CTTGCAAACAACCGCCGGCAGCTTAGTATATAAATACACATGTACATACC
+TCTCTCCGTATCCTCGTAATCATTTTCTTGTATTTATCGTCTTTTCGCTG
+TAAAAACTTTATCACACTTATCTCAAATACACTTATTAACCGCTTTTACT
+ATTATCTTCTACGCTGACAGTAATATCAAACAGTGACACATATTAAACAC
+AGTGGTTTCTTTGCATAAACACCATCAGCCTCAAGTCGTCAAGTAAAGAT
+TTCGTGTTCATGCAGATAGATAACAATCTATATGTTGATAATTAGCGTTG
+CCTCATCAATGCGAGATCCGTTTAACCGGACCCTAGTGCACTTACCCCAC
+GTTCGGTCCACTGTGTGCCGAACATGCTCCTTCACTATTTTAACATGTGC
+TAAATCCTCTTTATATGATCTGCCGATAGATAGTTCTAAGTCATTGAGGT
+TCATCAACAATTGGATTTTCTGTTTACTCGACTTCAGGTAAATGAAATGA
+GATGATACTTGCTTATCTCATAGTTAACTCTAAGAGGTGATACTTATTTA
+CTGTAAAACTGTGACGATAAAACCGGAAGGAAGAATAAGAAAACTCGAAC
+TGATCTATAATGCCTATTTTCTGTAAAGAGTTTAAGCTATGAAAGCCTCG
+GCATTTTGGCCGATCCTAGGTAGTGCTTTTTTTCCAAGGACAAAACAGTT
+TCTTTTTCTTGAGCAGGTTTTATGTTTCGGTAATCATAAACAATAAATAA
+ATTATTTCATTTATGTTTAAAAATAAAAAATAAAAAAGTATTTTAAATTT
+TTAAAAAAGTTGATTATAAGCATGTGACCTTTTGCAAGCAATTAAATTTT
+GCAATTTGTGATTTTAGGCAAAAGTTACAATTTCTGGCTCGTGTAATATA
+TGTATGCTAAAGTGAACTTTTACAAAGTCGATATGGACTTAGTCAAAAGA
+AATTTTCTTAAAAATATATAGCACTAGCCAATTTAGCACTTCTTTATGAG
+ATATATTATAGACTTTATTAAGCCAGATTTGTGTATTATATGTATTTACC
+CGGCGAATCATGGACATACATTCTGAAATAGGTAATATTCTCTATGGTGA
+GACAGCATAGATAACCTAGGATACAAGTTAAAAGCTAGTACTGTTTTGCA
+GTAATTTTTTTCTTTTTTATAAGAATGTTACCACCTAAATAAGTTATAAA
+GTCAATAGTTAAGTTTGATATTTGATTGTAAAATACCGTAATATATTTGC
+ATGATCAAAAGGCTCAATGTTGACTAGCCAGCATGTCAACCACTATATTG
+ATCACCGATATATGGACTTCCACACCAACTAGTAATATGACAATAAATTC
+AAGATATTCTTCATGAGAATGGCCCAGCTGTTCCTTCAACATTTGAGAGA
+TCTGAAATGACCTCGAGTCTTTGATTTAAAAGTGAAACGCGCTGATTGAT
+CTCTAAGTAACCTCTTGTTGCTTGGTATATGGGTTCTAACTGCGGTTCTG
+ACCACATAATTTCTGGAGAGTCTAAAACTGATCCATGTAGATTGATATTT
+ATCCTCAAAATGAATAGCTCCCCTATACTTTTCATTATATCTTCTTTGCT
+CATAGAAACTTTACCGCTGTATGCAATTTCTTGTGGTATATCCTGAGTAT
+CTTCAATAGTATTATCCACAAGCTCTTCGAAAAGTGAAATTTTTACGCTT
+TGAGCAATTGCATGAGCTCATGTTTGACAGCTTATCATCGATAAGCTTTA
+ATGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGTA
+TGAAATCTAACAATGCGCTCATCGTCATCCTCGGCACCGTCACCCTGGAT
+GCTGTAGGCATAGGCTTGGTTATGCCGGTACTGCCGGGCCTCTTGCGGGA
+TATCGTCCATTCCGACAGCATCGCCAGTCACTATGGCGTGCTGCTAGCGC
+TATATGCGTTGATGCAATTTCTATGCGCACCCGTTCTCGGAGCACTGTCC
+GACCGCTTTGGCCGCCGCCCAGTCCTGCTCGCTTCGCTACTTGGAGCCAC
+TATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGGATCAATTCTT
+TAGTATAAATTTCACTCTGAACCATCTTGGAAGGACCGGATAATTATTTG
+AAATCTCTTTTTCAATTGTATATGTGTTATGTAGTATACTTTGTCTTCAA
+CAATTAAATACTCTCGGTAGCCAAGTTGGTTTAAGGCGCAAGACTGTATT
+TTATCACTACGGAATTCCGTAATCTTGAGATCGGGCGTTCGACTCGCCCC
+CGGGAGATTTTTTTGTTTTTTTATGTCTCCATTCACTTCCCAGACTTGCA
+AGTTGAAATATTTCTTTCAAGCTCTACGCCGGACGCATCGTGGCCGGCAT
+CACCGGCGCCACAGGTGCGGTTGCTGGCGCCTATATCGCCGACATCACCG
+ATGGGGAAGATCGGGCTCGCCACTTCGGGCTCATGAGCGCTTGTTTCGGC
+GTGGGTATGGTGGCAGGCCCCGTGGCCGGGGGACTGTTGGGCGCCATCTC
+CTTGCATGCACCATTCCTTGCGGCGGCGGTGCTCAACGGCCTCAACCTAC
+TACTGGGCTGCTTCCTAATGCAGGAGTCGCATAAGGGAGAGCGTCGACCG
+ATGCCCTTGAGAGCCTTCAACCCAGTCAGCTCCTTCCGGTGGGCGCGGGG
+CATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGCAACTCG
+TAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGCTTT
+CGCTGGAGCGCGACGATGATCGGCCTGTCGCTTGCGGTATTCGGAATCTT
+GCACGCCCTCGCTCAAGCCTTCGTCACTGGTCCCGCCACCAAACGTTTCG
+GCGAGAAGCAGGCCATTATCGCCGGCATGGCGGCCGACGCGCTGGGCTAC
+GTCTTGCTGGCGTTCGCGACGCGAGGCTGGATGGCCTTCCCCATTATGAT
+TCTTCTCGCTTCCGGCGGCATCGGGATGCCCGCGTTGCAGGCCATGCTGT
+CCAGGCAGGTAGATGACGACCATCAGGGACAGCTTCAAGGATCGCTCGCG
+GCTCTTACCAGCCTAACTTCGATCACTGGACCGCTGATCGTCACGGCGAT
+TTATGCCGCCTCGGCGAGCACATGGAACGGGTTGGCATGGATTGTAGGCG
+CCGCCCTATACCTTGTCTGCCTCCCCGCGTTGCGTCGCGGTGCATGGAGC
+CGGGCCACCTCGACCTGAATGGAAGCCGGCGGCACCTCGCTAACGGATTC
+ACCACTCCAAGAATTGGAGCCAATCAATTCTTGCGGAGAACTGTGAATGC
+GCAAACCAACCCTTGGCAGAACATATCCATCGCGTCCGCCATCTCCAGCA
+GCCGCACGCGGCGCATCCCCCCCCCCCTTTCAATTCAATTCATCATTTTT
+TTTTTATTCTTTTTTTTGATTTCGGTTTCTTTGAAATTTTTTTGATTCGG
+TAATCTCCGAACAGAAGGAAGAACGAAGGAAGGAGCACAGACTTAGATTG
+GTATATATACGCATATGTAGTGTTGAAGAAACATGAAATTGCCCAGTATT
+CTTAACCCAACTGCACAGAACAAAAACCTGCAGGAAACGAAGATAAATCA
+TGTCGAAAGCTACATATAAGGAACGTGCTGCTACTCATCCTAGTCCTGTT
+GCTGCCAAGCTATTTAATATCATGCACGAAAAGCAAACAAACTTGTGTGC
+TTCATTGGATGTTCGTACCACCAAGGAATTACTGGAGTTAGTTGAAGCAT
+TAGGTCCCAAAATTTGTTTACTAAAAACACATGTGGATATCTTGACTGAT
+TTTTCCATGGAGGGCACAGTTAAGCCGCTAAAGGCATTATCCGCCAAGTA
+CAATTTTTTACTCTTCGAAGACAGAAAATTTGCTGACATTGGTAATACAG
+TCAAATTGCAGTACTCTGCGGGTGTATACAGAATAGCAGAATGGGCAGAC
+ATTACGAATGCACACGGTGTGGTGGGCCCAGGTATTGTTAGCGGTTTGAA
+GCAGGCGGCAGAAGAAGTAACAAAGGAACCTAGAGGCCTTTTGATGTTAG
+CAGAATTGTCATGCAAGGGCTCCCTATCTACTGGAGAATATACTAAGGGT
+ACTGTTGACATTGCGAAGAGCGACAAAGATTTTGTTATCGGCTTTATTGC
+TCAAAGAGACATGGGTGGAAGAGATGAAGGTTACGATTGGTTGATTATGA
+CACCCGGTGTGGGTTTAGATGACAAGGGAGACGCATTGGGTCAACAGTAT
+AGAACCGTGGATGATGTGGTCTCTACAGGATCTGACATTATTATTGTTGG
+AAGAGGACTATTTGCAAAGGGAAGGGATGCTAAGGTAGAGGGTGAACGTT
+ACAGAAAAGCAGGCTGGGAAGCATATTTGAGAAGATGCGGCCAGCAAAAC
+TAAAAAACTGTATTATAAGTAAATGCATGTATACTAAACTCACAAATTAG
+AGCTTCAATTTAATTATATCAGTTATTACGAATCTCGGTCGTAATGATTT
+TTATAATGACGAAAAAAAAAAAATTGGAAAGAAAAGGGGGGGGGGGCAGC
+GTTGGGTCCTGGCCACGGGTGCGCATGATCGTGCTCCTGTCGTTGAGGAC
+CCGGCTAGGCTGGCGGGGTTGCCTTACTGGTTAGCAGAATGAATCACCGA
+TACGCGAGCGAACGTGAAGCGACTGCTGCTGCAAAACGTCTGCGACCTGA
+GCAACAACATGAATGGTCTTCGGTTTCCGTGTTTCGTAAAGTCTGGAAAC
+GCGGAAGTCAGCGCCCTGCACCATTATGTTCCGGATCTGCATCGCAGGAT
+GCTGCTGGCTACCCTGTGGAACACCTACATCTGTATTAACGAAGCGCTGG
+CATTGACCCTGAGTGATTTTTCTCTGGTCCCGCCGCATCCATACCGCCAG
+TTGTTTACCCTCACAACGTTCCAGTAACCGGGCATGTTCATCATCAGTAA
+CCCGTATCGTGAGCATCCTCTCTCGTTTCATCGGTATCATTACCCCCATG
+AACAGAAATTCCCCCTTACACGGAGGCATCAAGTGACCAAACAGGAAAAA
+ACCGCCCTTAACATGGCCCGCTTTATCAGAAGCCAGACATTAACGCTTCT
+GGAGAAACTCAACGAGCTGGACGCGGATGAACAGGCAGACATCTGTGAAT
+CGCTTCACGACCACGCTGATGAGCTTTACCGCAGCCCTCGAGGGCAAGCT
+TGCATTTTTAGATAAAATTTATTAATCATCATTAATTTCTTGAAAAACAT
+TTTATTTATTGATCTTTTATAACAAAAAACCCTTCTAAAAGTTTATTTTT
+GAATGAAAAACTTATAAAAATTTATGAAAACTACAAAAAATAAAATTTTT
+AATTAAAATAATTTTGATAAGAACTTCAATCTTTGACTAGCTAGCTTAGT
+CATTTTTGAGATTTAATTAATATTTTATGTTTATTCATATATAAACTATT
+CAAAATATTATAGAATTTAAACATTTTAACATCTTAATCATTCATAAATA
+ACTAAAAATCAAAGTATTACATCAATAAATAACTTTTACTCAATGTCAAA
+GAAATTATTTGGGGTTGGGGTTGGGGTTGGGGTTGGGGTTGGGGGATCCG
+CTGCACGGDCCTGTTCCCTAGCATGTACGTGAGCGTATTTCCTTTTAAAC
+CACGACGCTTTGTCTTCATTCAACGTTTCCCATTGTTTTTTTCTACTATT
+GCTTTGCTGTGGGAAAAACTTATCGAAAGATGACGACTTTTTCTTAATTC
+TCGTTTTAAGAGCTTGGTGAGCGCTAGGAGTCACTGCCAGGTATCGTTTG
+AACACGGCATTAGTCAGGGAAGTCATAACACAGTCCTTTCCCGCAATTTT
+CTTTTTCTATTACTCTTGGCCTCCTCTAGTACACTCTATATTTTTTTATG
+CCTCGGTAATGATTTTCATTTTTTTTTTTCCACCTAGCGGATGACTCTTT
+TTTTTTCTTAGCGATTGGCATTATCACATAATGAATTATACATTATATAA
+AGTAATGTGATTTCTTCGAAGAATATACTAAAAAATGAGCAGGCAAGATA
+AACGAAGGCAAAGATGACAGAGCAGAAAGCCCTAGTAAAGCGTATTACAA
+ATGAAACCAAGATTCAGATTGCGATCTCTTTAAAGGGTGGTCCCCTAGCG
+ATAGAGCACTCGATCTTCCCAGAAAAAGAGGCAGAAGCAGTAGCAGAACA
+GGCCACACAATCGCAAGTGATTAACGTCCACACAGTGATAGGGTTTCTGG
+ACCATATGATACATGCTCTGGCCAAGCATTCCGGCTGGTCGCTAATCGTT
+GAGTGCATTGGTGACTTACACATAGACGACCATCACACCACTGAAGACTG
+CGGGATTGCTCTCGGTCAAGCTTTTAAAGAGGCCCTACTGGCGCGTGGAG
+TAAAAAGGTTTGGATCAGGATTTGCGCCTTTGGATGAGGCACTTTCCAGA
+GCGGTGGTAGATCTTTCGAACAGGCCGTACGCAGTTGTCGAACTTGTTTT
+GCAAAGGGAGAAAGTAGGAGATCTCTCTTGCGAGATGATCCCGCATTGTC
+TTGAAAGCTTTGCAGAGGCTAGCAGAATTACCCTCCACGTTGATTGTCTG
+CGAGGCAAGAATGATCATCACCGTAGTGAGAGTGCGTTCAAGGCTCTTGC
+GGTTGCCATAAGAGAAGCCACCTCGCCCAATGGTACCAACGATGTTCCCT
+CCACCAAAGGTGTTCTTATGTAGTGACACCGATTATTTAAAGCTGCAGCA
+TACGATATATATACATGTGTATATATGTATACCTATGAATGTCAGTAAGT
+ATGTATACGAACAGTATGATACTGAAGATGACAAGGTAATGCATCATTCT
+ATACGTGTCATTCTGAACGAGGCGCGCTTTCCTTTTTTCTTTTTGCTTTT
+TCTTTTTTTTTCTCTTGAACTCGAGAAAAAAAATATAAAAGAGATGGAGG
+AACGGGAAAAAGTTAGTTGTGGTGATAGGTGGCAAGTGGTATTCCGTAAG
+AACAACAAGAAAAGCATTTCATATTATGGCTGAACTGAGCGAACAAGTGC
+AAAATTTAAGCATCAACGACAACAACGAGAATGTGTATGTTCCTCCTCAC
+TTAAGAGGAAAACCAAGAAGTGCCAGAAATAACATGAGCAACTACAATAA
+CAACAACGGCGGCTACAACGGTGGCCGTGGCGGTGGCAGCTTCTTTAGCA
+ACAACCGTCGTGGTGGTTACGGCAACGGTGGTTTCTTCGGTGGAAACAAC
+GGTGGCAGCAGATCTAACGGCCGTTCTGGTGGTAGATGGATATGGCAAAC
+ATGTCCCAGTCCCAAGAAACGAAAAGGCCGAGATCGCCATATTTGGTGTC
+CCCGAGGATCCCCCCAACCCCAACCCCAACCCCAACCCCAACCCCAAATA
+ATTTCTTTGACATTGAGTAAAAGTTATTTATTGATGTAATACTTTGATTT
+TTAGTTATTTATGAATGATTAAGATGTTAAAATGTTTAAATTCTATAATA
+TTTTGAATAGTTTATATATGAATAAACATAAAATATTAATTAAATCTCAA
+AAATGACTAAGCTAGCTAGTCAAAGATTGAAGTTCTTATCAAAATTATTT
+TAATTAAAAATTTTATTTTTTGTAGTTTTCATAAATTTTTATAAGTTTTT
+CATTCAAAAATAAACTTTTAGAAGGGTTTTTTGTTATAAAAGATCAATAA
+ATAAAATGTTTTTCAAGAAATTAATGATGATTAATAAATTTTATCTAAAA
+ATGCAAGCTTGCCCTCGAGGG
+
diff --git a/src/squirrel-1.4/seqs/pjb8.vep b/src/squirrel-1.4/seqs/pjb8.vep
new file mode 100644
index 0000000..ddcad83
--- /dev/null
+++ b/src/squirrel-1.4/seqs/pjb8.vep
@@ -0,0 +1,93 @@
+; pjb8 sequence from karens r107 database 4-10-91
+ GAATTCTCATGTTTGACAGCTTATCATCGATAAGCTTTAA
+ TGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGTATGAAATCTAAC
+ AATGCGCTCATCGTCATCCTCGGCACCGTCACCCT-GGATGCTGTAGGCATAGGCTTGGT
+ TATGCCGGTACTGCCGGGCCTCTTGCGGGATATCGTCCATTCCGACAGCATCGCCAGTCA
+ CTATGGCGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTATGCGCACCCGTTCTCGG
+ AGCACTGTCCGACCGCTTTGGCCGCCGCC-AGTCCTGCTCGC-TTCGCTACTTGGAGCCA
+ CTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGGATCTGCCTCGCTGGCCTGC
+ CGCAGTTCTTCAACCTCCCGGCGCA-GTTTTTCGTTTCTCAATTTCAGCATCC-TTTCGG
+ CATACCATTTTATGACGGCGGCAGAGTCATAAAGCACCTCATTACCCTTGCCACCGCCTC
+ GCAGAACGGGCATTCCCTGTTCCTGCCAGTTCTGAATGGTAC*GGAT-ACTCGCACCGAA
+ AATGTCAGCC-AGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAG
+ AAAGGA-ACGACAGAGGCCAAAAAGCTC--TTTCAGCACC-TGTCGTTTCCTTTTC--TT
+ TTCAGAGGGTATTTTAAATAAAAACATTAAGT*TATGACGAAGAAGAACGGAAACGCCTT
+ AA-ACCGGAAAATTTTCATAAATAGCGAAA-CCCGCGAGTCGCCGCCCCGTAACAAGGCG
+ GATCC-*GAGAAGGACCCGCAAATGATAATAATTATCAATTGCATACTATCGACGGCACT
+ GCTGCCAGATAACACCACCGGGGAAACATTCCATCATGATGGCCGTGCGGACATAGGAAG
+ CCAGTTCATCCATCGCTTTCTTGTCT-CTGCCATTTGCTTTGTGACATCCAGCGCCGCAC
+ ATTCAGCAGCGTTTTTCAGCGCGTTTTCGATCAACGTTTCAA*TGTTGGTATCAACA-CC
+ AGG-TTTAACTTT--*-----TTATCGGCACTGACGGTTACCTTGTTCTGCG**CTGGCT
+ CATCACGCTGGATACCAAGGCTGATG-TT*GTA*GATATTGGTCAC*CGGCTGAGGTGT*
+ TTCGATTGCCGCTGCGTGGATAGCACCATTTGCGATAG----CGTCCTTGATGAATGACA
+ CTCCATTGCGAATAAGTTCGAAGGAGACGGTGTCACGAATGCGCTGGTCCAGCTCGTCGA
+ TTGCCTTTTGTGCAGCAGAGGTATCAATCT-CAAC-CCAAGC-TCATCGAAGCGCAATAT
+ TGCTGCTCACCAAAACGCGTATTGACCAGGTGTTCAACGGCAAATTTCTGCCCTTCTGAT
+ GTCAGAAAGGTAAAGTGATTTTCTTTCTGGTATTCAGTTGCTGT-GTGTCTGGTTTCAGC
+ AAA*--CCAAG-TCGCGCAATTCGGCTGTGCAGTATTTAGAAGGCAGATCACCAGACAGC
+ AACGGC-CACGGAAAAACAGCGCATACAGAACATCCGTCGCCGCGCCGGACAACGTGATA
+ ATTTTATGACCCATGATTTATTTCCTTTTAGACGTGAGCCTGTCGCACAGCAAAGCCGCC
+ GAAAGTTAACGGTTTGCCCAGGCTCA-CAACTGAAAGACTTTCT-ACGGT-GTGCGCGTG
+ CGATGCG--TAGAAGACTGATTTATCAACCTGTCTTTATATCAGGATTCATTACCTGACT
+ ATTTGTGGGTAAAGTTCGTAGTGCGCTGATCGTGCAAAATGATTTTAGTTGGGAACAGTT
+ CGCAACTCTGTCCCATAAAAATCAGCATATTCCCATCTATCCCATATCCAGCGCATTGAC
+ CATCG*GGATACTGAAGGGAGATTCCATCATCTCTTAGAAAGATCACCATCTCTTTTGTT
+ TCAATTTGCATATAGCTACCTGGAGGATTTATGAATACAAGGATTTTCATG-ACTATTAC
+ CATGAGATT-GATTTTCCATCTTTATTCGCGAGA-CAGT-GGAAAGCGATGACGATG-TG
+ GGTACTACATTGCGCATTCACCTACT-TGTGAGCGCATG*GTCGAAGCATGG-ATATG--
+ CATGCTGTGACTGCCAAGATCCTCTAC-CCGGACGCATCGTGGCCA-TTCACCGGGCCAC
+ AGGTGCGGTTGCTGGCGCCTATATCGCCGACATCACCGATGGGGAAGATCGGGCTCGCCA
+ CTTCGGGCTCATGAGCGCTTGTTTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCG-GGG-
+ ACTGTTGGGCGCCATCTCCTTGCATGCACCATTCCTTGCGGCGGCGGGTGC-TCAACGGC
+ CTCAACCTACTACTGG-TGCTTCCTAATGCAGGAGTCGCATAAGGGAGAGCGTCGACCGA
+ TGCCCTTGAGAGCCTTCAACCCAGTCAGCCTTCT-TCCGGTGGGGCGCGGGGG-CAATGA
+ CTATCGTCGCCGCACTTATGACTGTC--TCTTTATCATGCAACTCGTAGGACA-G---CC
+ GGCAGCGCTCTGGGTCATTTTCGGCGAG----*****CC--T-T-CTGGAGCGCGACGAT
+ GATC-GGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCCCTCGCTCAAGCCTTCGT-A
+ CTG-TCCCGCCACCAAACGTTTCGGCGAGAAGCAGGCCATTA-TCGCGGCATGG*CGGCC
+ GACGCGCT-GGGCTACGTCTTGCTG-CGTTC-CGA--CGAGGC-TGGATGGCCTTCCCCA
+ TTATGATTCTTCT-CGCTTC*-----GGCATCGGGAT---CCGCGTTGCAGGCCATGCTG
+ TCCAGGCAGGTAGATGACGACCATCAGGGACAGCTTCAAGGA*TCGCTCGCGGCTCTTAC
+ CAGCCTAACTTCGATCATTGGACCGCTGATCGTCACGGCGATTT*ATGCC-CC*TC-G--
+ -GCA-CATGGAACGGGTTGGCATGGATTGTAG-CGCCG-CCTATACCTTGTCTGCCT---
+ CCCG--TTGC-TCGCGGTGCATGGAGCCGGGCCACCTCGACCTGAATGGA---CGGC--C
+ ACCTCGCTAACGGATTCACCACTCCAAGAATTGGAGCCAATCAATTCTTGCGGAGAACTG
+ TGAATGCGCAAACCAACCCTTGGCAGAACATATCCATC-CGTCCGCCATCTC---AGCA-
+ -CGCACGCG-------*CTC*GGGCAGCGTTGG*GTCCTGGCCACGGGTGCGCATGATCG
+ TGCTCCT*GTCGTTGAGGACCCGGCTAGGCTGGCGGGGTTGCCTTACTGGTTAGCAGAAT
+ GAATCACCGA-TACGCGAGCGAACGTGAAGCGACT-CTGCTGCAA-AAC-CTGCGACCTG
+ AGCAACAACATGAATGGTCTTCGGTTTCCGTGTTTCGTAAAGTCTGGAAACGCGGAAGTC
+ AGCGCTCTTCCGCTTCCTCGCTCA-ATG-ACCTGCGCTCGGTCGTTCGGCTGCGGCGAGC
+ GGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGG
+ AAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGGCCGTTGCTG
+ GCGTTTTTCCACTAGCCGCCCCCCT-ACGAGCATCACAAAAATCGACGCTCAAGTCAGAG
+ GT-GCGAAACCCGACAGGACTATAAAGATAC-AGGCGT-TC-CCCTGAAGCTCCCTCGTG
+ CGCTCTCCTGTTCCGA-CTCCTGCCGCTTACCGGATACCTGTCCGCCTTTTTCCTTCGGG
+ AAGCGTGGCGCTTTTCTTCAA-TACGTGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTC
+ GCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCTTGCGCCTTATCCGG
+ TAACTATCGTCTTGAGTTCCAACCCGGTAAGACACGACTTCATGCGCCA*CCTGGCAG-C
+ AGCCACTGGTA-ACAGGATTAGCAGAGCGAGGTATGTAGGC-GGTGCTACAGAGTTCTTG
+ AAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCCTCTGCTGA
+ AGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAA-AC-A-AACCGCTG
+ GTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAG
+ AAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAG
+ GGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAAT
+ GAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCT
+ TAATCAGTGAGGCACCTATCTCAGCG-TCTGTCTATTTCGTTCATCCATAGTTGCCTGAC
+ TCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAA
+ TGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCG
+ GAAGGCCTGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATT
+ GTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTT-CC-
+ ATTGCT-CAGGCATC-GTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGG
+ TTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTA-TCC
+ TTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATG
+ GCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGT
+ GAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCG
+ GCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGA
+ AAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGAT*CCAGTTCGAT
+ GTAACCCACTCGTGCACCCAACTG-ATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTG
+ GGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAAT
+ GTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTC
+ TCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCA
+ CATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCT
+ ATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTCAA
diff --git a/src/squirrel-1.4/seqs/puc18.vec b/src/squirrel-1.4/seqs/puc18.vec
new file mode 100644
index 0000000..d7f9a6c
--- /dev/null
+++ b/src/squirrel-1.4/seqs/puc18.vec
@@ -0,0 +1,47 @@
+<--puc.001--------->
+;puc 18 vector sequence from vecbase library
+ GCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCA
+ CGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCT
+ CACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAAT
+ TGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGAATTCGAGCT
+ CGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTGGCACTGGCCGTCG
+ TTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCAC
+ ATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAAC
+ AGTTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGT
+ GCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGT
+ TAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCC
+ CGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTT
+ CACCGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGG
+ TTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGC
+ GCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGAC
+ AATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATT
+ TCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAG
+ AAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCG
+ AACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAA
+ TGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGC
+ AAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAG
+ TCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAA
+ CCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGC
+ TAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGG
+ AGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAA
+ CAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAA
+ TAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTG
+ GCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAG
+ CACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGG
+ CAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATT
+ GGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTT
+ AATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAAC
+ GTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAG
+ ATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGG
+ TGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCA
+ GAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGA
+ ACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCA
+ GTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGC
+ AGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACA
+ CCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAA
+ AGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTC
+ CAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGC
+ GTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGG
+ CCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTAT
+ CCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCA
+ GCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGA
diff --git a/src/squirrel-1.4/seqs/pwe15.seq b/src/squirrel-1.4/seqs/pwe15.seq
new file mode 100644
index 0000000..0677190
--- /dev/null
+++ b/src/squirrel-1.4/seqs/pwe15.seq
@@ -0,0 +1,138 @@
+;cosmid vector pWE15 from embl entry pwe15a
+ ctatagtgagtcgtattacccgggcccgccggcgccacgcgtcgacggccggccgggccg
+ cggccgcgaattcttgaagacgaaagggcctcgtgatacgcctatttttataggttaatg
+ tcatgataataatggtttcttagacgtcaggtggcacttttcggggaaatgtgcgcggaa
+ cccctatttgtttatttttctaaatacattcaaatatgtatccgctcatgagacaataac
+ cctgataaatgcttcaataatattgaaaaaggaagagtatgagtattcaacatttccgtg
+ tcgcccttattcccttttttgcggcattttgcttcctgtttttgctcacccagaaacgct
+ ggtgaaagtaaaagatgctgaagatcagttgggtgcacgagtgggttacatcgaactgga
+ tctcaacagcggtaagatccttgagagttttcgccccgaagaacgttttccaatgatgag
+ cacttttaaagttctgctatgtggcgcggtattatcccgtgttgacgccgggcaagagca
+ actcggtcgccgcatacactattctcagaatgacttggttgagtactcaccagtcacaga
+ aaagcatcttacggatggcatgacagtaagagaattatgcagtgctgccataaccatgag
+ tgataacactgcggccaacttacttctgacaacgatcggaggaccgaaggagctaaccgc
+ ttttttgcacaacatgggggatcatgtaactcgccttgatcgttgggaaccggagctgaa
+ tgaagccataccaaacgacgagcgtgacaccacgatgcctgcagcaatggcaacaacgtt
+ gcgcaaactattaactggcgaactacttactctagcttcccggcaacaattaatagactg
+ gatggaggcggataaagttgcaggaccacttctgcgctcggcccttccggctggctggtt
+ tattgctgataaatctggagccggtgagcgtgggtctcgcggtatcattgcagcactggg
+ gccagatggtaagccctcccgtatcgtagttatctacacgacggggagtcaggcaactat
+ ggatgaacgaaatagacagatcgctgagataggtgcctcactgattaagcattggtaact
+ gtcagaccaagtttactcatatatactttagattgatttaaaacttcatttttaatttaa
+ aaggatctaggtgaagatcctttttgataatctcatgaccaaaatcccttaacgtgagtt
+ ttcgttccactgagcgtcagaccccgtagaaaagatcaaaggatcttcttgagatccttt
+ ttttctgcgcgtaatctgctgcttgcaaacaaaaaaaccaccgctaccagcggtggtttg
+ tttgccggatcaagagctaccaactctttttccgaaggtaactggcttcagcagagcgca
+ gataccaaatactgtccttctagtgtagccgtagttaggccaccacttcaagaactctgt
+ agcaccgcctacatacctcgctctgctaatcctgttaccagtggctgctgccagtggcga
+ taagtcgtgtcttaccgggttggactcaagacgatagttaccggataaggcgcagcggtc
+ gggctgaacggggggttcgtgcacacagcccagcttggagcgaacgacctacaccgaact
+ gagatacctacagcgtgagctatgagaaagcgccacgcttccgaagggagaaaggcggac
+ aggtatccggtaagcggcagggtcggaacaggagagcgcacgagggagcttccaggggga
+ aacgcctggtatctttatagtcctgtcggggtttcgccacctctgacttgagcgtcgatt
+ tttgtgatgctcgtcaggggggcggagcctatggaaaaacgccagcaacgcggccttttt
+ acggttcctggccttttgctggccttttgctcacatgttctttcctgcgttatcccctga
+ ttctgtggataaccgtattaccgcctttgagtgagctgataccgctcgccgcagccgaac
+ gaccgagcgcagcgagtcagtgagcgaggaagcggaagagcgctgacttccgcgtttcca
+ gactttacgaaacacggaaaccgaagaccattcatgttgttgctcaggtcgcagacgttt
+ tgcagcagcagtcgcttcacgttcgctcgcgtatcggtgattcattctgctaaccagtaa
+ ggcaaccccgccagcctagccgggtcctcaacgacaggagcacgatcatgcgcacccgtc
+ agatccagacatgataagatacattgatgagtttggacaaaccacaactagaatgcagtg
+ aaaaaaatgctttatttgtgaaatttgtgatgctattgctttatttgtaaccattataag
+ ctgcaataaacaagttaacaacaacaattgcattcattttatgtttcaggttcaggggga
+ ggtgtgggaggttttttaaagcaagtaaaacctctacaaatgtggtatggctgattatga
+ tctctagtcaaggcactatacatcaaatattccttattaacccctttacaaattaaaaag
+ ctaaaggtacacaatttttgagcatagttattaatagcagacactctatgcctgtgtgga
+ gtaagaaaaaacagtatgttatgattataactgttatgcctacttataaaggttacagaa
+ tatttttccataattttcttgtatagcagtgcagctttttcctttgtggtgtaaatagca
+ aagcaagcaagagttctattactaaacacagcatgactcaaaaaacttagcaattctgaa
+ ggaaagtccttggggtcttctacctttctcttcttttttggaggagtagaatgttgagag
+ tcagcagtagcctcatcatcactagatggcatttcttctgagcaaaacaggttttcctca
+ ttaaaggcattccaccactgctcccattcatcagttccataggttggaatctaaaataca
+ caaacaattagaatcagtagtttaacacattatacacttaaaaattttatatttacctta
+ gagctttaaatctctgtaggtagtttgtccaattatgtcacaccacagaagtaaggttcc
+ ttcacaaagatccggaccaaagcggccatcgtgcctccccactcctgcagttcgggggca
+ tggatgcgcggatagccgctgctggtttcctggatgccgacggatttgcactgccggtag
+ aactcgcgaggtcgtccagcctcaggcagcagctgaaccaactcgcgaggggatcgagcc
+ cggggtgggcgaagaactccagcatgagatccccgcgctggaggatcatccagccggcgt
+ cccggaaaacgattccgaagcccaacctttcatagaaggcggcggtggaatcgaaatctc
+ gtgatggcaggttgggcgtcgcttggtcggtcatttcgaaccccagagtcccgctcagaa
+ gaactcgtcaagaaggcgatagaaggcgatgcgctgcgaatcgggagcggcgataccgta
+ aagcacgaggaagcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagc
+ caacgctatgtcctgatagcggtccgccacacccagccggccacagtcgatgaatccaga
+ aaagcggccattttccaccatgatattcggcaagcaggcatcgccatgggtcacgacgag
+ atcctcgccgtcgggatgcgcgccttgagcctggcgaacagttcggctggcgcgagcccc
+ tgatgctcttcgtccagatcatcctgatcgacaagaccggcttccatccgagtacgtgct
+ cgctcgatgcgatgtttcgcttggtggtcgaatgggcaggtagccggatcaagcgtatgc
+ agccgccgcattgcatcagccatgatggatactttctcggcaggagcaaggtgagatgac
+ aggagatcctgccccggcacttcgcccaatagcagccagtcccttcccgcttcagtgaca
+ acgtcgagcacagctgcgcaaggaacgcccgtcgtggccagccacgatagccgcgctgcc
+ tcgtcctgcagttcattcagggcaccggacaggtcggtcttgacaaaaagaaccgggcgc
+ ccctgcgctgacagccggaacacggcggcatcagagcagccgattgtctgttgtgcccag
+ tcatagccgaatagcctctccacccaagcggccggagaacctgcgtgcaatccatcttgt
+ tcaatcatgcgaaacgatcctcatcctgtctcttgatcagatcttgatcccctgcgccat
+ cagatccttggcggcaagaaagccatccagtttactttgcagggcttcccaaccttacca
+ gagggcgccccagctggcaattccggttcgcttgctgtccataaaaccgcccagtctagc
+ tatcgccatgtaagcccactgcaagctacctgctttctctttgcgcttgcgttttccctt
+ gtccagatagcccagtagctgacattcatccggggtcagcaccgtttctgcggactggct
+ ttctacgtgttccgcttcctttagcagcccttgcgccctgagtgcttgcggcagcgtgaa
+ agctttttgcaaaagcctaggcctccaaaaaagcctcctcactacttctggaatagctca
+ gaggccgaggcggcctaaataaaaaaaattagtcagccatggggcggagaatgggcggaa
+ ctgggcggagttaggggcgggatgggcggagttaggggcgggactatggttgctgactaa
+ ttgagatgcatgctttgcatacttctgcctgctggggagcctggggactttccacacctg
+ gttgctgactaattgagatgcatgctttgcatacttctgcctgctggggagcctggggac
+ tttccacaccctaactgacacacattccacagccggatctgcaggacccaacgctgcccg
+ agatgcgccgcgtgcggctgctggagatggcggacgcgatggatatgttctgccaagggt
+ tggtttgcgcattcacagttctccgcaagaattgattggctccaattcttggagtggtga
+ atccgttagcgaggtgccgccggcttccattcaggtcgaggtggcccggctccatgcacc
+ gcgacgcaacgcggggaggcagacaaggtatagggcggcgcctacaatccatgccaaccc
+ gttccatgtgctcgccgaggcgcataaatcgccgtgacgatcagcggtccaatgatcgaa
+ gttaggctggtaagagccgcgagcgatccttgaagctgtccctgatggtcgtcatctacc
+ tgcctggacagcatggcctgcaacgcggcatcccgatgccgccggaagcgagaagaatca
+ taatggggaaggccatccagcctcgcgtcgcgaacgccagcaagacgtagcccagcgcgt
+ cgggccgccatgccggcgataatggcctgcttctcgccgaaacgtttggtggcgggacca
+ gtgacgaaggcttgagcgagggcgtgcaagattccgaataccgcaagcgacaggccgatc
+ atcgtcgcgctccagcgaaagcggtcctcgccgaaaatgacccagagcgctgccggcacc
+ tgtcctacgagttgcatgataaagaagacagtcataagtgcggcgacgatagtcatgccc
+ cgcgcccaccggaaggagctgactgggttgaaggctctcaagggcatcggtcgacgctct
+ cccttatgcgactcctgcattaggaagcagcccagtagtaggttgaggccgttgagcacc
+ gccgccgcaaggaatggtgcatgcaaggagatggcgcccaacagtcccccggccacgggc
+ ctgccaccatacccacgccgaaacaagcgctcatgagcccgaagtggcgagcccgatctt
+ ccccatcggtgatgtcggcgatataggcgccagcaaccgcacctgtggcgccggtgatgc
+ cggccacgatgcgtccggcgtagaggatcttggcagtcacagcatgcgcatatccatgct
+ tcgaccatgcgctcacaaagtaggtgaatgcgcaatgtagtacccacatcgtcatcgctt
+ tccactgctctcgcgaataaagatggaaaatcaatctcatggtaatagtccatgaaaatc
+ cttgtattcataaatcctccaggtagctatatgcaaattgaaacaaaagagatggtgatc
+ tttctaagagatgatggaatctcccttcagtatcccgatggtcaatgcgctggatatggg
+ atagatgggaatatgctgatttttatgggacagagttgcgaactgttcccaactaaaatc
+ attttgcacgatcagcgcactacgaactttacccacaaatagtcaggtaatgaatcctga
+ tataaagacaggttgataaatcagtcttctacgcgcatcgcacgcgcacaccgtagaaag
+ tctttcagttgtgagcctgggcaaaccgttaactttcggcggctttgctgtgcgacaggc
+ tcacgtctaaaaggaaataaatcatgggtcataaaattatcacgttgtccggcgcggcga
+ cggatgttctgtatgcgctgtttttccgtggcgcgttgctgtctggtgatctgccttcta
+ aatctggcacagccgaattgcgcgagcttggttttgctgaaaccagacacacagcaactg
+ aataccagaaagaaaatcactttacctttctgacatcagaagggcagaaatttgccgttg
+ aacacctggtcaatacgcgttttggtgagcagcaatattgcgcttcgatgacgcttggcg
+ ttgagattgatacctctgctgcacaaaaggcaatcgacgagctggaccagcgcattcgtg
+ acaccgtctccttcgaacttattcgcaatggagtgtcattcatcaaggacgccgctatcg
+ caaatggtgctatccacgcagcggcaatcgaaacacctcagccggtgaccaatatctaca
+ acatcagccttggtatccagcgtgatgagccagcgcagaacaaggtaaccgtcagtgccg
+ ataagttcaaagttaaacctggtgttgataccaacattgaaacgttgatcgaaaacgcgc
+ tgaaaaacgctgctgaatgtgcggcgctggatgtcacaaagcaaatggcagcagacaaga
+ aagcgatggatgaactggcttcctatgtccgcacggccatcatgatggaatgtttccccg
+ gtggtgttatctggcagcagtgccgtcgatagtatgcaattgataattattatcatttgc
+ gggtcctttccggcgatccgccttgttacggggcggcgacctcgcgggttttcgctattt
+ atgaaaattttccggtttaaggcgtttccgttcttcttcgtcataacttaatgtttttat
+ ttaaaataccctctgaaaagaaaggaaacgacaggtgctgaaagcgagctttttggcctc
+ tgtcgtttcctttctctgtttttgtccgtggaatgaacaatggaagtcaacaaaaagcag
+ ctggctgacattttcggtgcgagtatccgtaccattcagaactggcaggaacagggaatg
+ cccgttctgcgaggcggtggcaagggtaatgaggtgctttatgactctgccgccgtcata
+ aaatggtatgccgaaagggatgctgaaattgagaacgaaaagctgcgccgggaggttgaa
+ gaactgcggcaggccagcgaggcagatccacaggacgggtgtggtcgccatgatcgcgta
+ gtcgatagtggctccaagtagcgaagcgagcaggactgggcggcggcaaagcggtcggac
+ agtgctccgagaacgggtgcgcatagaaattgcatcaacgcatatagcgctagcagcacg
+ ccatagtgactggcgatgctgtcggaatggacgatatcccgcaagaggcccggcagtacc
+ ggcataaccaagcctatgcctacagcatccagggtgacggtgccgaggatgacgatgagc
+ gcattgttagatttcatacacggtgcctgactgcgttagcaatttaactgtgataaacta
+ ccgcattaaagcttatcgatgataagcggtcaaacatgagaattcgcggccgcggcctcg
+ agggccgatcgcgagctcagtactccgcggatttaggtgacactataggatcc
diff --git a/src/squirrel-1.4/seqs/sCos-1.seq b/src/squirrel-1.4/seqs/sCos-1.seq
new file mode 100644
index 0000000..87755e0
--- /dev/null
+++ b/src/squirrel-1.4/seqs/sCos-1.seq
@@ -0,0 +1,159 @@
+GAATTCGCGGCCGCAATTAACCCTCACTAAAGGATCCCTATAGTGAGTCG
+TATTATGCGGCCGCGAATTCTCATGTTTGACCGCTTATCATCGATAAGCT
+CTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAGAAA
+GGAAACGACAGAGGCCAAAAAGCTCGCTTTCAGCACCTGTCGTTTCCTTT
+CTTTTCAGAGGGTATTTTAAATAAAAACATTAAGTTATGACGAAGAAGAA
+CGGAAACGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGCGAG
+GTCGCCGCCCCGTAACAAGGCGGATCGCCGGAAAGGACCCGCAAATGATA
+ATAATTATCAATTGCATACTATCGACGGCACTGCTGCCAGATAACACCAC
+CGGGGAAACATTCCATCATGATGGCCGTGCGGACATAGGAAGCCAGTTCA
+TCCATCGCTTTCTTGTCTGCTGCCATTTGCTTTGTGACATCCAGCGCCGC
+ACATTCAGCAGCGTTTTTCAGCGCGTTTTCGATCAACGTTTCAATGTTGG
+TATCAACACCAGGTTTAACTTTGAACTTATCGGCACTGACGGTTACCTTG
+TTCTGCGCTGGCTCATCACGCAGGATACCAAGGCTGATGTTGTAGATATT
+GGTCACCGGCTGAGGGTTTTCGATTGCCGCTGCGTGGATAGCACCATTTG
+CGATCAGGCGTCCTTGATGAATGACACTCCATTGCGAATAAGTTCGAAGG
+AGACGGTGTCACGAATGCGCTGGTCCAGCTCGGTCGATTGCCTTTTGTGC
+AGCAGAGGTATCAATCTCAACGCCAAGGCTCATCGAAGCGCAATATTGCT
+GCTCACCAAAACGCGTATTGACCAGGTGTTCAACGGCAAATTTCTGCCCT
+TCTGATGTCAGAAAGGCAAAGTGATTTTCTTTCTGGTATTCAGTTGCTGT
+GTGTCGGTTTCAGCAAAACCAAGCTCGCGCAATTCGGCTGTGCAGATTTA
+GAAGGCAGATCACCAGACAGCAACGGCCAACGGAAAACAGCGCATACAGA
+ACATCCGTCGCCGCGCCGACAACGTGATAATTTTTATGACCCATGATTTA
+TTTCCTTTTAGACGTGAGCCTGTCGCACAGCAAAGCCGCCGAAAGTTCCT
+CGAAGCTAGCTTCAGACGTGTCTAGATACGTCTGCTTTTTGTTGACTTCC
+ATTGTTCATTCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAA
+AAGCTCGCTTTCAGCACCTGTCGTTTCCTTTCTTTTCAGAGGGTATTTTA
+AATAAAAACATTAAGTTATGACGAAGAAGAACGGAAACGCCTTAAACCGG
+AAAATTTTCATAAATAGCGAAAACCCGCGAGGTCGCCGCCCCGTAACAAG
+GCGGATCGCCGGAAAGGACCCGCAAATGATAATAATTATCAATTGCATAC
+TATCGACGGCACTGCTGCCAGATAACACCACCGGGGAAACATTCCATCAT
+GATGGCCGTGCGGACATAGGAAGCCAGTTCATCCATCGCTTTCTTGTCTG
+CTGCCATTTGCTTTGTGACATCCAGCGCCGCACATTCAGCAGCGTTTTTC
+AGCGCGTTTTCGATCAACGTTTCAATGTTGGTATCAACACCAGGTTTAAC
+TTTGAACTTATCGGCACTGACGGTTACCTTGTTCTGCGCTGGCTCATCAC
+GCAGGATACCAAGGCTGATGTTGTAGATATTGGTCACCGGCTGAGGGTTT
+TCGATTGCCGCTGCGTGGATAGCACCATTTGCGATCAGGCGTCCTTGATG
+AATGACACTCCATTGCGAATAAGTTCGAAGGAGACGGTGTCACGAATGCG
+CTGGTCCAGCTCGGTCGATTGCCTTTTGTGCAGCAGAGGTATCAATCTCA
+ACGCCAAGGCTCATCGAAGCGCAATATTGCTGCTCACCAAAACGCGTATT
+GACCAGGTGTTCAACGGCAAATTTCTGCCCTTCTGATGTCAGAAAGGCAA
+AGTGATTTTCTTTCTGGTATTCAGTTGCTGTGTGTCGGTTTCAGCAAAAC
+CAAGCTCGCGCAATTCGGCTGTGCAGATTTAGAAGGCAGATCACCAGACA
+GCAACGGCCAACGGAAAACAGCGCATACAGAACATCCGTCGCCGCGCCGA
+CAACGTGATAATTTTTATGACCCATGATTTATTTCCTTTTAGACGTGAGC
+CTGTCGCACAGCAAAGCCGCCGAAAGTTCCTCGACCGATGCCCTTGAGAG
+CCTTCAACCCAGTCAGCTCCTTCCGGTGGGCGCGGGGCATGACTATCGTC
+GCCGCACTTATGACTGTCTTCTTTATCATGCAACTCGTAGGACAGGTGCC
+GGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGCTTTCGCTGGAGCGCGA
+CGATGATCGGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCCCTCGCT
+CAAGCCTTCGTCACTGGTCCCGCCACCAAACGTTTCGGCGAGAAGCAGGC
+CATTATCGCCGGCATGGCGGCCGACGCGCTGGGCTACGTCTTGCTGGCGT
+TCGCGACGCGAGGCTGGATGGCCTTCCCCATTATGATTCTTCTCGCTTCC
+GGCGGCATCGGGATGCCCGCGTTGCAGGCCATGCTGTCCAGGCAGGTAGA
+TGACGACCATCAGGGACAGCTTCAAGGATCGCTCGCGGCTCTTACCAGCC
+TAACTTCGATCATTGGACCGCTGATCGTCACGGCGATTTATGCCGCCTCG
+GCGAGCACATGGAACGGGTTGGCATGGATTGTAGGCGCCGCCCTATACCT
+TGTCTGCCTCCCCGCGTTGCGTCGCGGTGCATGGAGCCGGGCCACCTCGA
+CCTGAATGGAAGCCGGCGGCACCTCGCTAACGGATTCACCACTCCAAGAA
+TTGGAGCCAATCAATTCTTGCGGAGAACTGTGAATGCGCAAACCAACCCT
+TGGCAGAACATATCCATCGCGTCCGCCATCTCCAGCAGCCGCACGCGGCG
+CATCTCGGGCAGCGTTGGGTCCTGCAGATCCGGCTGTGGAATGTGTGTCA
+GTTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAA
+GCATGCATCTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCC
+CCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCAT
+AGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCG
+CCCATTCTCCGCCCCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCC
+GAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTTTT
+TGGAGGCCTAGGCTTTTGCAAAAAGCTTCACGCTGCCGCAAGCACTCAGG
+GCGCAAGGGCTGCTAAAGGAAGCGGAACACGTAGAAAGCCAGTCCGCAGA
+AACGGTGCTGACCCCGGATGAATGTCAGCTACTGGGCTATCTGGACAAGG
+GAAAACGCAAGCGCAAAGAGAAAGCAGGTAGCTTGCAGTGGGCTTACATG
+GCGATAGCTAGACTGGGCGGTTTTATGGACAGCAAGCGAACCGGAATTGC
+CAGCTGGGGCGCCCTCTGGTAAGGTTGGGAAGCCCTGCAAAGTAAACTGG
+ATGGCTTTCTTGCCGCCAAGGATCTGATGGCGCAGGGGATCAAGATCTGA
+TCAAGAGACAGGATGAGGATCGTTTCGCATGATTGAACAAGATGGATTGC
+ACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCTATTCGGCTATGACTGG
+GCACAACAGACAATCGGCTGCTCTGATGCCGCCGTGTTCCGGCTGTCAGC
+GCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGCCCTGA
+ATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGC
+GTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTG
+GCTGCTATTGGGCGAAGTGCCGGGGCAGGATCTCCTGTCATCTCACCTTG
+CTCCTGCCGAGAAAGTATCCATCATGGCTGATGCAATGCGGCGGCTGCAT
+ACGCTTGATCCGGCTACCTGCCCATTCGACCACCAAGCGAAACATCGCAT
+CGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGATGATC
+TGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTC
+AAGGCGCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGC
+CTGCTTGCCGAATATCATGGTGGAAAATGGCCGCTTTTCTGGATTCATCG
+ACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAGGACATAGCGTTGGCT
+ACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATGGGCTGACCGCTTCCT
+CGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTTCTATC
+GCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGAAATGACCG
+ACCAAGCGACGCCCAACCTGCCATCACGAGATTTCGATTCCACCGCCGCC
+TTCTATGAAAGGTTGGGCTTCGGAATCGTTTTCCGGGACGCCGGCTGGAT
+GATCCTCCAGCGCGGGGATCTCATGCTGGAGTTCTTCGCCCACCCCGGGC
+TCGATCCCCTCGCGAGTTGGTTCAGCTGCTGCCTGAGGCTGGACGACCTC
+GCGGAGTTCTACCGGCAGTGCAAATCCGTCGGCATCCAGGAAACCAGCAG
+CGGCTATCCGCGCATCCATGCCCCCGAACTGCAGGAGTGGGGAGGCACGA
+TGGCCGCTTTGGTCCGGATCTTTGTGAAGGAACCTTACTTCTGTGGTGTG
+ACATAATTGGACAAACTACCTACAGAGATTTAAAGCTCTAAGGTAAATAT
+AAAATTTTTAAGTGTATAATGTGTTAAACTACTGATTCTAATTGTTTGTG
+TATTTTAGATTCCAACCTATGGAACTGATGAATGGGAGCAGTGGTGGAAT
+GCCTTTAATGAGGAAAACCTGTTTTGCTCAGAAGAAATGCCATCTAGTGA
+TGATGAGGCTACTGCTGACTCTCAACATTCTACTCCTCCAAAAAAGAAGA
+GAAAGGTAGAAGACCCCAAGGACTTTCCTTCAGAATTGCTAAGTTTTTTG
+AGTCATGCTGTGTTTAGTAATAGAACTCTTGCTTGCTTTGCTATTTACAC
+CACAAAGGAAAAAGCTGCACTGCTATACAAGAAAATTATGGAAAAATATT
+CTGTAACCTTTATAAGTAGGCATAACAGTTATAATCATAACATACTGTTT
+TTTCTTACTCCACACAGGCATAGAGTGTCTGCTATTAATAACTATGCTCA
+AAAATTGTGTACCTTTAGCTTTTTAATTTGTAAAGGGGTTAATAAGGAAT
+ATTTGATGTATAGTGCCTTGACTAGAGATCATAATCAGCCATACCACATT
+TGTAGAGGTTTTACTTGCTTTAAAAAACCTCCCACACCTCCCCCTGAACC
+TGAAACATAAAATGAATGCAATTGTTGTTGTTAACTTGTTTATTGCAGCT
+TATAATGGTTACAAATAAAGCAATAGCATCACAAATTTCACAAATAAAGC
+ATTTTTTTCACTGCATTCTAGTTGTGGTTTGTCCAAACTCATCAATGTAT
+CTTATCATGTCTGGATCTGACGGGTGCGCATGATCGTGCTCCTGTCGTTG
+AGGACCCGGCTAGGCTGGCGGGGTTGCCTTACTGGTTAGCAGAATGAATC
+ACCGATACGCGAGCGAACGTGAAGCGACTGCTGCTGCAAAACGTCTGCGA
+CCTGAGCAACAACATGAATGGTCTTCGGTTTCCGTGTTTCGTAAAGTCTG
+GAAACGCGGAAGTCAGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGC
+GCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTA
+ATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGC
+AAAAGGCCAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGC
+GTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAA
+ATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATAC
+CAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCT
+GCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGC
+TTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGC
+TCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGC
+CTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTAT
+CGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTA
+GGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAG
+AAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAA
+AAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGT
+GGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCA
+AGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAA
+ACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACC
+TAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATA
+TGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTA
+TCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTC
+GTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGC
+AATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAA
+ACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCC
+GCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTC
+GCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGG
+TGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGA
+TCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTC
+CTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCAC
+TCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTA
+AGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATA
+GTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATA
+CCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCT
+TCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGAT
+GTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCA
+GCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGA
+ATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATA
+TTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTG
+AATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGA
+AAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCTA
+TAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAA
diff --git a/src/squirrel-1.4/seqs/scchrIII.seq b/src/squirrel-1.4/seqs/scchrIII.seq
new file mode 100644
index 0000000..59f7018
--- /dev/null
+++ b/src/squirrel-1.4/seqs/scchrIII.seq
@@ -0,0 +1,5256 @@
+ cccacacaccacacccacaccacacccacacaccacacacaccacacccacacacccaca
+ ccacaccacacccacaccacacccacacacccacacccacacaccacacccacacacacc
+ acacccacacacacccacacccacacaccacacccacacacacaccacacccacacacac
+ cacaccacacccacaccacacccacacccacacaccacacccacacccacaccccacacc
+ cacacaccacacccacacacaccacacccacacacacccacaccacacccacacaccaca
+ cccacacacccacacccacacacaccacacccacaccacacccacacccacacacccaca
+ ccctaacactaccctaacactaccctattctaaccctgattttacctgtctcccaactta
+ ctctccattaccctacctctccactcgttaccctgtctgattcaaccgtaccactcccaa
+ ccaccatccatctctctacttactaccaccaacccaccgtccaccataaccgttaccctc
+ caactacccgtatccaactccactaccgcttaccctaccatcgaccatgtcctactcact
+ gtactgttgttcacccaccatattgaaacgtctacaaatgatcgtaaataatacacatat
+ acttatcctaccactctaatcccactaccacatgccatactcaccttcacttgtattctg
+ atcggtcatacgcacacggatgctacagtatataccatctcaaacttacctactttcata
+ ttcatcatcacccatctctcaccatcagtaccaatgcactcgcatcattatgcacggctt
+ gcctcagcggtctataccctgtgccatttacgcataacgcccatcattatccacatttta
+ atatctatatctcattcggcgacaccaaatattgtataactgcccttaatacatacgtta
+ taccacttttacaccatatactaaccactcaatttatacacacttatgtcaatataacca
+ caaaatcaccactaaaatcacctaaacataaaaatattctactcttcaacaataatacat
+ aaacacactcaattgcgtatctataccaccatgacgtcattaacgtaaaagttccttaat
+ attgcaaattgcttgaacggatgccatttcatgttgaatgtggtaacccaatagcatgat
+ atgagtaatgctttagtattgtttcagagttgtttcagtaatgttttagacaaggagaac
+ atatagtagcaaacctctaatccggtagtacttaagaaactacagtttctatgtacgaaa
+ gcagtaactatgtaattattacatttacatgacatataggaaggtccaataaacttacta
+ cattatgacctataagctagatcgtaattcattacgtcaacaggttatgagccctagagc
+ aatgcttctgagaacataatcaatctatctagccccaacaattataaacagtggctgtac
+ ggtatcgagaccgctgctgaatatgctaacgaatatatgaacgaattcgttcataccgga
+ gatatccaatcaatgaaaagggattacaatctcagcgcgaatgatgaaagctttgtcaaa
+ accgtatttaacagtttcctggtaaagctctacaagaaaactatcgtgggtgaagctgca
+ tgtgaaatgaactggatatgtgatgattcgcttggaagggtctctgcttatgatattttc
+ tcgcacttcgaagaaaactataatgaagtcactattggatccaggcttactcttatagag
+ gacctaccaaatatatcctccaagcctgtagatgaaattgcttcctttttgaaaactcta
+ ttcacgatgcttgaagacaatagcgaagaacaggacaaaaaaaaaagacgcgacaccaat
+ atcgcgttgttattaatgaccttcttacccgagttaaaagaatcattccacgagaaattc
+ ggtgactctaaggctcttcagctatcacaagtcattagattctgtaaattaaatgcgtca
+ tcgaattcatcatcttcggtctcagatgcattggttgcacaagacagaagaaactatcaa
+ aagaaaggaaataagggatgtatataatttacggagctgatcatcgcataagcaactgtt
+ ctctgcttaaacgaagaataccagaagcacgaatctttaaattatatcctaatgacaaaa
+ cgagtagatcttcatctgctagtgtcgcgattcctgactatgaaacgcaaggccaaacag
+ caggacagataacaccgaagtcctggctctgtatgttatcttcgactgttccagctacca
+ aatcctcagaatggatttttgacacaggatgtacttcccacatgtgccacgaccgttcca
+ ttttttcatcatttactagatcctctaggaaagactttgtcagaggagttggcggttcca
+ tacccatcatgggctccggaactgtaaacattggcactgttcaattacatgacgtatctt
+ acgttcctgatttaccagttaacctgatatccgtttgaaaactatgcactaaatcaaact
+ cttctgttacgttcacaaaagagggtgtcactgtgcaatcacctgatgatgtggtttcta
+ cagctgggtattcacaataaaagactctgggaagtacaaagctcgccttgtcgcccaagg
+ acatactcaaaaggctggtattgactatcaggaaacctttgcaccagtcattcgatatga
+ ctctgttagattatttctggcccttgctagctgcctcaaactaatagtatatcagatgga
+ cgttgacactgcgtttctaaactcaaccatggatgagccgatctacgtaaaacaaccacc
+ cgggtttgttaacgagaggaatcccgactatgtatgggaactatacggcggtatgtatgg
+ actcaaacaagccccattactatggaacgaacatatcaacaatactcttaaaaagattgg
+ tttctgtcgacatgaaggcgaacatggcttatattttcgttccacatctgatggtcccat
+ ctacattggcgtatacgtagacgacttacttgttgctgctccctctcctaaaatatatga
+ cagggttaagcaagaattaacgaaattatactcaatgaaggatctcggtaaagttgacaa
+ attcctcggtcttaacattcatcaatcgacaaacggagacatcactctctcccttcaaga
+ ctatattgctaaagctgcatctgaaagcgaaataaacacatttaaacttacacagactcc
+ gctctgcaactcaaagcctcttttcgaaacaacttccccgcatctaaaagacatcactcc
+ ttatcagagcatagttggtcagcttctcttttgtgcaaatactggtcgtccggacatatc
+ gtatccagtctcattactctccaggttccttcgagaacctcgcgcaatccatttggagtc
+ tgctcggcgggttctacggtacctatataccaccagaagtatgtgtctcaagtatcgctc
+ tggatctcaggtggcattaactgtatattgtgatgcatctcatggagctattcacgatct
+ cccacactctactggggggtacgtgactctacttgctggtgctcccgttacgtggtcatc
+ gaagaagctcaagggtgtgattcctgtaccatctactgaggcagaatacattactgcaag
+ tgaaactgtcatggagatataatggattcaaaacttgtttgaacacttaggccagccact
+ tatctcatcaacatcatatgtagataataaacctgctataaaactgtctaaacatcctgt
+ atttcacacgagagcaacacacattgctttgagataccacaagctaagaaatgcagtggc
+ agcaggcataattaccatagaacatgttatcacaaagaaacaagttgctgacatatttac
+ aaaaatccttccagctgaatcatttaaaacacatagggctgtcatgataagggaaccaga
+ aactacaaaataaccatactcatgcgtattcagttatggggggatgttaaatgtggtaac
+ ctaatagcatgatatgagtaatgctttagtattgtttcagagttgtttcagtaatgtttt
+ agacaaagaaaacatataatagtaaacctgtaatcaggtagtacttaagaaactatactt
+ tctgtgtacaaaacactaactatgtaattcttacatttacataacatgtagaaaggtcca
+ ataaacttactatattatgacatataagttagatcgtaattcactacgtcaacatatccc
+ acgataaaaaaaataccttgactcaagtgccgaaacaggaaataatgaaagatttcatac
+ aaggttgtttctaaggtgctatatgtccctacggcctcgtttagccccatctcgcatgca
+ ataccgtaacataggtatatacacacagagaagtagtaattttacgtgtcttatacttat
+ tttatatagaaatgtccgacacttgcgctacactaaggcatataagatagatcttaattc
+ cttgtattaacatataaacagacaacataacatagccagtttatcttacttatcttttcc
+ tggaagagatggcattatttgtgtgtatcaaagaggcctttttcaattttcattttaata
+ tttttttaatattcttatattgaacacatttttctcccctagatttatgtttcctattgc
+ tctccatatcacattatatctatgaatgaatatgcattgctcttctgatactatggtatg
+ agaggccttggtaaagggcatcgataaccgaaagaatcatccgcagagaagtagaatctt
+ agagcatatcgtctcgttgagaccgtatagattcgcaaaaataacagaaacttgcgttaa
+ cattgaatttcaaactcagttttccataattaactgttaaaaaaggctactataagggtg
+ tcaattactgttctgtttcgcaggattaattccgttatgtgctcttgtaattgagttagc
+ gcttcttatatctcttcagtaatgcagcagtaataaaacacattggcttcgctttcgctt
+ tttttccaaatcattatcgaagattctttgcttatgatatactggtgctcgaggatgatg
+ agaggacacaactttttgaggcttatcgttatcgtctgagtcacaacaacatacattaac
+ tgtccacgattcagggatgtgctatgagttcaattttcgctgtagtgctgccattggcgc
+ tgctactggtgtctggaaaaactccacagctttcaatggagatgtgactgcacgagtgag
+ acattatctgtattagcttcacaaccatactctgttagtaaggagccatagtcgacatct
+ gtccagctcaatcgggtatgatatttgatatccccaaaatacatggctgtaccatgtaaa
+ atgagcggtatatcgaagatcagtatatttcagcctatacaggatcctactatttcaaca
+ aaaatgatttcatcaaaattaaatcacgctgcaacattggtgaagaatctcaaaaggtta
+ gtaacatcaatatatcgcggagtagctatcattgtggcagttaacttaaaggatatacag
+ ttaattcaacaagattttcaagtccattttttttgggcacgctgaacacgctcaatgtag
+ tacttgttttggaaatcactcgcaagtttaatgggacgaattatatcagtaatgtgtaaa
+ ttccgacacaatgttcggttagtagatttactgctggattattgggttcacccgtagact
+ gttgccgactccaagaaaactatgatctcaatacaggcataaagaattgttgtatctgtg
+ cccatgtgatctgtaccttttttgcggtaccaaaatcgcatgttgcactcacactacgga
+ acacatcaacttgtaacgacaacaatacaaaataccaggagacatatcactttttttcgc
+ ctagaatgatttacaatggcatacacttagaaaagtgctatgacatgacgcccttactag
+ aagttacatgattctccttttttttctttctttctttctttttcggtacaacataattaa
+ tgcattcatacagagcgcatttccagcatcatttattggatagtaactcctcccgcactt
+ tcaatatcaatgacgctgactcttttactgacgctgagtctttaccgacgctgagtcttt
+ accgacgctgagtctttaccgacgctgagtctttaccgacgcttgtgaaaataaaaagtg
+ cgacacagctatatgcggcgcaagtacagatgtgggtctatcgaacatgtttatcctatc
+ ctgttatgtcatttttatcggaagtaataaaagtgtagaaatcacttgtttttgcctttt
+ ttgtgattttgctttcatttttttttgcccttttttcaccagcttggggatcaacatcct
+ aatttcttgttacttcgattatgtctaaaaagtgccttgatttgagagaaaatattttcc
+ tcatctgtcaagacacccacaccatcttcagtgaattcttcgtgtgtcgccttgtctatt
+ agttccgggtctctcatgaacattgtaaacgcaaagaacggtaccgtacatgccaagcca
+ accgtcattattattcgttgaacatatctgtaagattcaaccacagcatctctttcgggt
+ gagccccaaggataatcactaatgaatgtataaggtgatgcatatgctatttctgccaat
+ gtatcgtttccaaggttcttgtagagttggttaggcatggtttgtgtccatatggcacca
+ gaaacggaggcaccgatagcagcgctagctttgaaaaggtatactggatgccggttatta
+ cagccatcctactatgcgtagtcatggcttgcagtatgacgatcactgaattgctgcata
+ ggagaccactcaaacccatgataacagatgcagcgataacaccttcatgagacccagatc
+ ctcctctgtatttgtaaaaaaggcccatgcacaccatccatgcagcacaacctccgatta
+ cagaaagtttcagtttccttgtctttgccaccaacaaactgtagaatggagatgcagtag
+ cagcaacaaagtcaggaaggtttactatccgagcagccgaagtggacgattctttcatcg
+ ataccagcaaaacaggatacagatagtcacacgaaatgaaaaaggtgaaaaaattgaaaa
+ aagtaacaccaagggtgcccaaattccacgatcacttagtaatttgaacggtaggaagag
+ gagatttggcaaatttggctcccaatataaaaaaatgaaaaataaacagccacctgaaac
+ taaagtggcaattatttttgaattgtgccacttctgtgatgtctcattagccaacgtcaa
+ agggacaaggatacaccctagcgacacagttattaataatatgccaacaatatcgagttt
+ ccacaatagaaacaccaaattctcaaataatccgcccgttctttcctttctagcctgttc
+ tttgagagatctccactcagcagtctttgaagatttgtacttcatataaagaataagaaa
+ tataattggcaaggcagagagtgggtaaataaaagcccacattgcaatattcccagacca
+ gtttttctgaggatttgctgctgtgataatattacctgaaatccatggtattatgatata
+ tggccaatatgaggcgtactggtaaaacattctccacttcaaggaggagaaatcagaaag
+ tattaatgtcaggagcagatttgttccgacgtatccacagttatagaaaaccgatcctgc
+ tgcatacattgtgagacgggtcgcctgtgattgaatgatggttcccattatataaaaaat
+ agttgcaactaaaaaaagccttagtcttccgaagtggtcagagagtctggagtagacaac
+ ttgggatccgacacttacaacagcattgataacttggacagttgaaagtaaggagtgttc
+ tgaatatgagttcgtcgcatagcccgtataggtcgatctaagtgtgtagtctaaactaat
+ tccaaacccacatacaaacgcggtacttatcagtagaattttatatttcaaggaatcaaa
+ ctgtgcagacataatttcgttttctttgagcttaaaaaaggtcgatgtcatagagtacgt
+ gtcatcgtttgagtgacgttctcgctcaataatctcacaggactgcctaattccactttt
+ tttattacttgaggcaccaacaacactagaactcatccgataatttatttacttgattat
+ tccttttttttttttttctctttccagtccataaaactattttacttttcttatttatgt
+ aaatagtatttaagattctacggtcgctacatctgctagaaatggaattactttaataca
+ ttccatgttctcatatatacttttactgacacctttcgtcactattttttgtttaacttt
+ ttttttggaagtactaaaattatatgctcgtaattattgagtatcaaaaaaaaaaaaaaa
+ aaaaattgctgtgacaccccttcaatgtggtgtctatacacctggtgactttaactttaa
+ ctttgacctgtcgtcgtacaatcaacttagtttccgaaaatccacgacgaaccagttata
+ ccgaagacttttgtacgacaagaatactgtaccgtcgaaactatctcttttcaaactgat
+ gagtgaagcttgtgtttatgtctgttgaaacaaggaaaaaaatgcggtgcctgtcgacag
+ gatatgcgcacaagtgcaaaatcatcaaaaaattgagtacctagatccggtggtgtagga
+ cgagcaaagttccttcataaacaaaccgtaagggttactgatacacaatttcctttttgt
+ aaagagtatttgagcaaattagtcagcgattgaaaacaacacccttttttctatatatag
+ taggtgaaagttaacatgcgagtaaaagtagtgccacatttcctttgcagcaggtgcaaa
+ ttggtacatatttaacagatagcgccgcacctcatatgcagcgcgtacatgatacgagac
+ gacaagatatgcaaaagataatagtgtcatcacacctttatgagaagcgaattttttttt
+ ttttttggtttaaatatatatatatacatatatatagatatagatatacacatgtataga
+ tgtattctagttatgcttatacctagaactgatcagaccagatcatacctggtatagagc
+ ttaatcgatagaaattcaaaaatatggaggaaactaagtactcttcgcagcaggagatag
+ aagaagcatgtggttcagacgcttcattgaatgctagaggtagcaatgattctccaatgg
+ gactttccttgtacctctgcctggcttcgtgaactcttgtactattcataactgcactgg
+ atattttgatagtgggaactattattgacgtggtcgcagaacagttcggaaactactcca
+ aaacaggttggctcgttacaggctacagtttaccaaatgctattctgagtctcatttggg
+ gaagattcgcatctatcataggtttccagcatagtctcattttagcaatacttatttttg
+ aagccggatccctaattgctgcccttgcctcttcaatgaatatgctcattgtcggtagag
+ ttgttgctagtgttgggggaagcggacttcaaacgctttgctttgttattggttgtacga
+ tggttggtgaaaggtcacgtccattggtgatttccatcctaagttgtgcatttgctgtag
+ ctgctatcgttggtcctataatcggaggtgcctttacaacccatgttacctggaggtggt
+ gcttctatatcaatcttcctatcggtggtcttgccattattatgtttttactcacatata
+ aggccgagaataagggtatacttcaacaaattaaagatgctataggaacaatctcgagct
+ ttacttttagtaagttcagacaccaagttaattttaaaagacttatgaatggcataatct
+ tcaagtttgacttctttggttttgccctctgctctgcagggctggtccttttcctactgg
+ ggctaacctttggtggtaataaatatagttggaactctggccaagtcatcgcatatttgg
+ ttttgggtgtcttactttttattttttcattggtgtacgatttcttcttattcgataaat
+ tcaacccggaacctgataatatatcctacaggcctctccttctaagaagattggtagcaa
+ aaccagccataataataataaacatggtaacatttctattatgtaccggttacaatgggc
+ aaatgatatactctgtccagtttttccaacttatatttgcgtcgagtgcatggaaagccg
+ gtcttcacttgataccaatcgttattaccaacgttattgcggccattgcaagtggtgtga
+ ttaccaaaaagctcggtttagttaaaccactcttaatatttggaggcgttcttggggtaa
+ ttggagcagggcttatgacacttatgacaaatacgtccacgaagtcaactcaaattggtg
+ ttttgctattaccggggttttcccttggatttgctctacaagcatcgctcatgagtgcac
+ agcttcaaattaccaaagatcgtccagaagctgctatggactttattgaagtaacagctt
+ tcaatacattcatgaagtcattaggtacaactcttggtggtgtgctttcaaccactgttt
+ tttccgcctcctttcacaacaaagtatcacgagctcatctagagccttacgaaggaaaaa
+ cggttgatgacatgattttgtatcgtcttcaaaactacgacggttctcattcgactattg
+ gaaacattttaagcgactccattaagaacgtattttggatggatctagggttttatgcct
+ taggatttttgttttgtagtttttcatccaataagaaattaatcataccaaaaaaggacg
+ agacaccagaagataatttagaagacaagtagcgcagttattttcttattttcattttat
+ ttttttcgccttttatacagacttcaacacaatcagaatcaaataggtgtatcgcaatgg
+ aatgtaatttcttaagtattctatatgtacttaaaacctattaatatatggatcaacaca
+ gtatcttatgaatgggtttttgatttttttatgtttttttaaaacattaaagttttcggc
+ acggacttatttggaattcaaattattaatgaaagaacaattaactaattaatgtactta
+ gtatttggccattattatcgatttcgggggccaaatctaaccaaattcaacctacatttt
+ ttcaaattgattcaaacacctttcacaataagatttttatatctagcgcacatagaatga
+ aatgtaaacaaagatttcagaaaaatcgtcattcaaacttgtattagacgagggacggag
+ tgatttttgtgtttgtttttattaattgtgggataggatagtagcaactcttggaggaga
+ gcattgtcagttgtccagtctctgaagttaagtagtaagtttgcggagtcaaagggggat
+ ggcttttgccatttgtgagagttgtgcggcagcatcttattcaaatagagctgtattctg
+ aagacctcttgtagaacatcatccatactaaaaagtaaatcgtcctgtcccattacgagc
+ tgtagtagtgctgtgaccctctgtatatttacgttgccatgaagaaggtaatgggcgata
+ ttttgatacaattcctgagttgcatgttggattgagtttacgaagggtcgccagacggcc
+ agaaacctccaggcggagttaacaactagtaatacggcatccatgtttgcatcagcgccg
+ agcctataccagtcactgagtagacgttttcttgctctttttatgtcctgacttcttttg
+ acgagggggcattctctagagacacaggcagttgcttccagcaactgcggtacggccgtt
+ ctcatgctgtcgaggattttttttgggacgatattgtcattatagggcagtgtgtgactt
+ atgaattgttgtagaaggacgtctgtgatgttggagatatgtattttgttaactcttctt
+ gagacaatttggccctggatagcgaagcgtgcggttacaaataggtcgtcttgttcaaga
+ aggtaggcgaggacattatctatcagttacaaacatcttagtagtgtctgaggagagggt
+ tgattgtttatgtatttttgcgaaatatatatatatatattctacacagatatatacata
+ tttgtttttcgggctcattctttcttctttgccagaggctcaccgctcaagaggtccgct
+ aattctggagcgattgttattgttttttcttttcttcttctattcgaaacccagtttttg
+ atttgaatgcgagataaactggtattcttcattagattctctaggcccttggtatctaga
+ tatgggttctcgatgttctttgcaaaccaactttctagtattcggacattttcttttgta
+ aaccggtgtcctctgtaaggtttagtacttttgtttatcatatcttgagttaccacatta
+ aataccaacccatccgccgatttatttttctgtgtaagttgataattacttctatcgttt
+ tctatgctgcgcatttctttgagtaatacagtaatggtagtagtgagttgagatgttgtt
+ tgcaacaacttcttctcctcatcactaatcttacggtttttgttggccctagataagaat
+ cctaatatatcccttaattcaacttcttcttctgttgttacactctctggtaacttaggt
+ aaattacagcaaatagaaaagagctttttatttatgtctagtatgctggatttaaactca
+ tctgtgatttgtggatttaaaaggtctttaatgggtattttattcattttttcttgctta
+ tcttcctttttttcttgcccacttctaagctgatttcaatctctcctttatatatatttt
+ taagttccaacattttatgtttcaaaacattaatgatgtctgggttttgtttgggatgca
+ atttattgcttcccaatgtagaaaagtacatcatatgaaacaacttaaactcttaactac
+ ttcttttaaccttcactttttatgaaatgtatcaaccatatataataacttaatagacga
+ cattcacaatatgtttacttcgaagcctgctttcaaaattaagaacaaagcatccaaatc
+ atacagaaacacagcggtttcaaaaaagctgaaagaaaaacgtctagctgagcatgtgag
+ gccaagctgcttcaatattattcgaccactcaagaaagatatccagattcctgttccttc
+ ctctcgatttttaaataaaatccaaattcacaggatagcgtctggaagtcaaaatactca
+ gtttcgacagttcaataagacatctataaaatcttcaaagaaatatttaaactcatttat
+ ggcttttagagcatattactcacagtttggctccggtgtaaaacaaaatgtcttgtcttc
+ tctgctcgctgaagaatggcacgcggacaaaatgcagcacggaatatgggactacttcgc
+ gcaacagtataattttataaaccctggttttggttttgtagagtggttgacgaataatta
+ tgctgaagtacgtggtgacggatattgggaagatgtgtttgtacatttggccttatagag
+ tgtggtcgtgccggaggttgtttatctttcgagtactgaatgttgtcagtatagctatcc
+ tatttgaaactccccatcgtcttgctcttgttcccaatgtttgtttatacactcatatgg
+ ctatacccttatctacttgcctcttttgtttatgtctatgtatttgtataaaatatgata
+ ttactcagactcaagcaaacaatcaaagaaatctttcactgctcttttctgtgttccatt
+ tagtttttagtacgattgcattgtctatatactgtatttaccaaatcttaattttagtca
+ aacattaatgtttactttcgtatttaaggatgtgtatactaagccttgggaaaatgcgtg
+ ttccttgtgccatccgaaagctacttcataaaaaagatactgttcattttcaagcccgca
+ tagttggatcaaatattaactatatatcaacatggtccaaagatcgtaagtttggtctaa
+ tgttgttttagcaaaaaaaaatactatacctttcattaacgtttattgaatttcttgaaa
+ aaaaaaccctaaattattataaaattatagcaggatagttcttagtggaagatattaagc
+ aaactattttctgaattgctatgcgaaatttctctaatgccagctgagtaactaactctc
+ atggtacataaaaatacgcatcttaaaaaatttgatatttcatgatttatttatagaaat
+ aacgatgcttattgtgctttgttgggtgtttgacttctatgttaacttacttcaacatga
+ aagcccgacgtttgcaatgatatcctctttttcacccgattattttcatttattagtaag
+ tggggttctgggtgaaaaaatcaaaacaaaaatatgaatttgtaatgatttttatatttt
+ cgccggaggtgctggaaatggcaaacgaaaatactatgacataaaagctgggcacactat
+ gtacgttctttttaattttttatcaacatgagaaaaattcatgaacactgtaccttatat
+ tatctaataaatgaagtaagctttgcatctccgtaacagctgcatatatataattaagcg
+ ggagctttctttcgatctctttcattccatatgtttattatgtactgatgaaaagcattt
+ tactaagttgaggtctctgtaagtatattgtgaccattatgctaaaatactggggtcttc
+ atttgtgtcaaattctacgtaatagttcccctttttcttacatttttatactaaagagtt
+ acggtatttatacaaacgaggggctttactgtaatatatgattaagattcaagctgaaac
+ attctaagttagaggtataaaatgattattgatgaaagagctgcaaatgtcggcaaaata
+ taaataccttagatgttaccagctgggaagattattacggaacacattcttataaatcta
+ taggaatacataggattgtatttctattgacaaagtacaacctatattttatgttatgtt
+ atgttatgtttgctatgactatagttgcttttcttcacttagtaaggattaactaactct
+ cgagctagtaaatttgacgtgagaagcataattatattactaatagaacgatatctggtg
+ tgttttattacagactgcactgagtgttagaggtagaaattataagttaaaaagcagtag
+ tttatgctttatgctcgagtatcaagtgaatttgaacaggctagtgcttcattggtactt
+ ctttcatggataattttgagcaaatttctgcagcatgtccccctttatacaaattctgtg
+ cattgccggcctagaaatatgtcaacgttttggatatgttgatgcttacttcgagaaatc
+ ttacactaatacttctggaaaaaatcaatactagcaaaatagtgatatatgagtaaaatg
+ tatgtagtacatgtatgaaaattatcaagggcaaattgatgcttcaacgaaaaagttatt
+ ggattttcaagcactttttaaattcacaatatttttttctggaaatatgaaattgtcagc
+ gacttctattacaggagtgtctttttttctcatgctatccaacgcttcttccaagtcctt
+ tatagtattagaggagctcgcacaagcaataattatcacaatgtcatccgcagctaattt
+ tgagcccagtgcattttctaggatcttagtgttgtaacccaaatgcaatgcggcgccaca
+ tgccggttcaatcaccatattgaattgatgtgtatatttaagacagggttcaataacatc
+ tttgtcctctattacaacggatctggtgttgtatttgcgagcgtattcgaaagtttgatt
+ agagatcacggccgttcctagagaagtagcaatacttgttatcttcttgaattgaactgg
+ ttggcctattttcaaagaagtattgaaaacatgacatccattcgtttccacccccacaat
+ agggatcctatcagctaaaccatacctttccaaaccttgaataataccattgtataaacc
+ acctccaccaacgctgcatactatgcctttaaccttattcacggaaatatgttgcgattt
+ caaatcttgtactatttcatctatcatagatgaatgtccttcccaaatatccggattatc
+ gaagggatgaacataaatgggctcaatgacctgagagtctattttattcatgacatttgt
+ ttttaaaaaagtatctgcttctttccagtaggcaccactcacgataacctgggcaccggt
+ gttcctgattttatctaccattctcttctttgtcgctgtaggaaccacgactgtacatgg
+ tagagacagtctttgacatgctgttgcagcagcaaaaccggcattaccgccagaactagc
+ gaaaacctgaggagatcttttaccgtccttttgaattcgaatggcacttttcatgatgag
+ attaccgattcctctacttttgaaggagccacttggttgaaggcattcatatttcaagaa
+ aaattgtgcagaagcctttccggggaagaattgacgtaataatggtgttttattgtagac
+ tatcgacatctcgctggttaattttcctgtctcttgtctatccagcacttaaaaaagaga
+ aaactgtataaacattttccttttatttggcgatatttatatacctgtgatcctcctcat
+ attgtccctttaatcttatctcatcgcaaacaatttgataaagcttggcactattatgac
+ actcatggaattacatttccgctgggacttaacaggagcgcccatgcggaatcatgtctc
+ cgcgcagaggactcattagtttccgttcgcggtgattagtactcgcgaaaagagaaaaag
+ aaaaatagtaggaacatttcgcaatgatcaaaatgtaaacagtgcagtaaagaatgggac
+ cataagatagataagaaacagctcgcataaggaaacaaggacacatcgattaatggcaac
+ ccaagccctagaggatatcacggagaggcttttaataaggtcgcaagaggctatcttgca
+ attagatctgtggatacaacgtcagcagagatcatcaatatgccagacaacagatcaaga
+ gtcattggataagttatcccaacagtacaaccagtatatgtctcaactgaactctttgta
+ tgttagatcggaatctgttcgagacaagttgagcaaggaacaacaacgcagattgatcac
+ agaggataatgagcatcaacgcatagaagacttggttcgtgaattccaggatatcacttt
+ gaggttgaacgagctggccaccgtccaaatgaagcgcctaatgattctccacaatcgcaa
+ tccaccagaagtagtttagggtcatttcaacctcgaccattgaaaataattgagaggcaa
+ cgtctgtgtatggtaactccatcgaaaccaccaaaaaaatcggtaggctttaaccccatc
+ aatgaagtcgattgtccttcgaaaactaactctttaccgtgctcacctaaaaaacaacct
+ gcaagaaatcgactttacgtgcagccaaatcacatgatactggcttgaacaaaagtaaga
+ aaccgtcttcctcggatacgtatgagtctttcttcaaaaatagacaaagactttcgttga
+ ccttctttgatgaaatggatgatgaagattttgattctgatcaagatactatcattctac
+ caaacataagtacccctccacatgttggtgtgaccgcaaagggtgctgaattcgaacctt
+ taaggagatataactctcacgaaagtatactatctaacaaaccagcaccttctaagtcgc
+ tcaatctgggaagtttctccgcctcattcttcaggccatctaatccgacgtttggaactt
+ cgatatcaaatgtccaagtgaactgtcatccaactgttgcagcgacaatggctcctagtc
+ gtaatggtcctcgtatttcaagttctaaggcgttgttatcatcattcattgcacgatcag
+ atacacatacggtaaaagagaacaacacaaatcttaagcatgcatcttttatggataagt
+ ttaattcatcgttaagcacaatatcagaatcttttcaaagtaagagggggagaaagaata
+ agggcatgaatgaagaacgaatatcaaatcataatgtagcacaggaacaaaaaaataata
+ tggatataagcgtctctatagaagaattgcaagatgctttgaatacagaactgctgtttt
+ aattaatgcctcctttacctaacctactcttcttttgctccaaatgtttattcttttttt
+ ttttattcagaatatcttcaagtttttttgtatctttttgtattattattaaacttcttt
+ ataatatttgtcttataaaatcctatcataacatgactatggcttggcctagactcgggt
+ gccatcttttttaatgcgactacttcaagacagcttctggagttcaatcaacttcttcgg
+ aaaagataaaaaaccactaattatcaaagctatcttgtccgctttcaaaaagtttattcg
+ attttgttttcgttttcattttcatttttttattatgatggtcatggtggtagcggtggc
+ tcccttcagttttcctctttggtgctattaattttctcgttttccccatatatgtgatag
+ aagcttttgaactccccactgtcttgtaagactttagaatcttgactgtcttgttacctt
+ ctttaaacttatcatttatgtctgttctggacgcaaaggatttgattattgatggatgtt
+ tgaaaacgccaactagtgatgtatctccattttccacctcatcttcatcctctgggcgct
+ tttctaagtcaattattgtgtttactgtttttgaagtggtcgaatcagtttgagaatttg
+ tgaatgactttatcgaactattctgttttagggtatataaatcttcaattgcttcatcat
+ tgccatgttgtatccttgatagctccttatccgtttcaaaatcctcataattattacttt
+ ttaggaaggataagcttttctgcacaaagtcttcagatatgataacctttttgttctttt
+ gatctacaggtttctgttcattatttcccgtattatcacctacattaatgctattgtcct
+ gagatctaggtcagtagccgtacttgtaatgtctaggttatattcctcttcaggccccgg
+ aagggggtttttatattcgataatatcttccaccatactttcaaaaaatgcgctcgactt
+ tgggtttttaacaagctttgcatcatcgccaatttctaatctcctttttctcatcaattc
+ tctccttttgagacgatattgttgtagtacgtcgtcttcatcatcatcacttagttctag
+ ctctaagctattttttgctcttttattacgaaatcctccatttttaatgtcataaagaat
+ tttgtttatcattttgatatccatttccttattttctgcggcaagcatttctctgatttc
+ atgaggattgaaattgttcttggaataatcatcgatcattttttctaggtcagaatcata
+ gtcgtcagatccttctccatctgcaccaccgataccatgccattcatcctcggattcttc
+ ggcttccatttcaaagaagtttgtgacacctcttttcttcaactcttttatcttagcctc
+ gtgctccttttctttctgttttagttgcaatttctctttgtcaatcagctgacggcgaat
+ tgcttcatcagcttcttcattgacgtatacatcatctttgacttcattttcgattgtatt
+ tctttcggccatcacctcttcaatgttttgtgtatctagaacattggtttcttgaaattt
+ gtcggtatcctctccgatattatcaccgtaatggcccagattaatggcgatccttttcgg
+ tagactttcatcagctttctctttgggtttggcttcgacttcggtatcagagtctgattc
+ gttaatgatatgtttcacgtggtgggattttttttgctttagaattaatttcatcatctt
+ ccttttcactatcggattctctacgattgtcattttcactaccagatgattcataatcgg
+ caatttcattaccactcaattgcaaatccagaagactctgaccctgagtcagaaccggaa
+ tcatgggcgttcaattgaaagtcattttcttcaagtttttcccttctcttctctttttgt
+ ctgattcttttatttctaagaatttcttgttctagcaagttctctactatttccttttct
+ ttggccatatcctccagcttcagacctttcgtttcaatcacttctttctgatgatccaga
+ atttgtttcctgctagctttccttaaggtatttaatagtacattatgatcaacttttggg
+ tctttacttttattcggcctttgagacaacttttgattttgtttagagagtcttgccttc
+ aggtttaatatagtcgcctttgataactgagagattggcaaaacgctttcatctttggat
+ aatttaatgctatccatatcatcatctccatattcgtcactgtcgctgtcgaggtccagt
+ gttatgcatttcgaggagtcaatttcccgtttcaaattattagcataggagctcaatagt
+ ggaacccttttctcatttattcttgtctcatcagttagatttaattcaacacttttgttc
+ tcatgcagttcatcgtcattctgtactggttttggatgtgcattttccagctttatatca
+ tcgtcctcatttgaggacgaatcgtcaaaatcagcaagaaaagactctttagaaaaaaat
+ tttttggttggtattggcttttgcggttttttgtattcgacatttctagctttttccctt
+ ttagaatcttctgaagccaattcgctttgtatttcccgaatttttaatgtgtgtggtact
+ tcatcttgaagtgtttgtgcgactgtatctatttgggttttctgttcctgatgcgtagcc
+ tgtggaataccaattgcttgtgtcttatcctcatcatgcgcagtctgtggtggtacatcc
+ atttgggtgtcaggaccacgattttgaatggactgaaataaaggactagttccaataagt
+ tccgttggaattataggaattttcggcgacagagcattagaatgattgctggaagtcgta
+ attatttgggattgtggttcaattgattttataggctgagtttgtgaggttgcaccgctg
+ ctatctatacgctgtgtagccctggccgtcctgtcgtgagaggtgttttcagcgcttgag
+ aagtttcgttgacgagaccatcttctttgattaattgggtgggtttcccgtcattaacgg
+ aatggataggtacgttaaacaccttgtcttgttggatgctgacaggtattctttgcgtct
+ gcgtgaatgaagaagagacatttttctcactgtaactattatctttaacttcggatttct
+ ccagctcttcaccaccatcgtcaaagattagagattaattgagttggtaacgaattctca
+ ctcctatcttttccattattgtgattctgttcgggtgctttcttgccttctaatctgttc
+ ttaacacgattaatgtggcatttgcaaataaaaacccgttcccagtcaactctggcggat
+ tgtctatgtcattgggcccatttccattagtattatcgttctcatctagtatgggaacag
+ caacttttttgtaggtggtagttctcttctttgcagtcaacgaggacaaagcatgcaagg
+ catcatccatcactaaaatatttggtgataagttcaaaagcgaataacgaacttccttag
+ ttgtttgtctgttcttggtttttttccaacataatgtgtttgaaatgctattgaagcaaa
+ cgcgttttacgcattctatctcttcatgacgcgtcgaagaaggataaggaaaatttcaag
+ tattgatgtatatctctctgcatatgagagccagtagagttggtgtcctagtaagaattt
+ ctttcactttacaaacatatctaagtaaacgaaatgtgtgtgtgtttctatatatatata
+ gacatatatgaaggattccgtagcggtgtaaactaattttggtttggcttgtatgcttct
+ tcttccggagctatgaaatcttttgctctctcttcctgcctttcgatttctctttccatt
+ tgcttttccttttgctcatttaatttcttcatttgcttttctctcttgcttaagaaatac
+ tcaccactttcaatttccaaatcaacctttctaggcaattgggcaggaggaaatggagta
+ tagaccttcttttcgacgtttctgatcttcttgggtttctttctggccacattcctcttc
+ ttaaacatgggcaagaatcttgaccaatcttcattggctaactctggcctttttgccaat
+ tctctttttatcattaattccttgatatgatagatagggtgaatatttttcatacaatct
+ tctactactcgacggacctccttcaagcccttgaagggacccatggcacttactgtgtta
+ ccttgtactagaatgtaacatttagttagaagttccaaagcctttaaagtattaccgtta
+ gggcctacaagacgttgtcttctcttgacaaacctttctttgttagtaacgaaattacca
+ attttaataacgtcgcatgccatgtcatcttgtaggatcttaacggcttgcgggaaagga
+ acggatctcgccaataatttgatcaaatctctggctttcaaaatgatagcgggatcgtat
+ gtttttctagttgtttttactgtcatagaaccttcgactaaatctagaacacacgctatg
+ ttgtgtttgtctagagcccttgttacatcattccaaatcgtcttcaagtaactttctctg
+ tatttaggaaacaaagtcataaaactggactcttcagcaaaaggttgaccggatgcgtta
+ tcctcttccttaaactcctctatcttccatttatcaatatcatccgtatcccaaggttta
+ tctctgttatgtgtagacaccatcgtttgccaatttggatatttgtgtgacccttttgtt
+ tgctgtctactttacaatagttaacctcatcatctcttttttttgaaaattttcatatct
+ catcgctaaaagaattagaaatataaggaaaaaaaaattttcgttttcagatgtgcaagc
+ ctgctataataaggtacaataactcaagggcatttagcaaggaaaaaatgggcaagtgta
+ gcatgaaaaagaaaggtgtgggcaagaatgttggtgttggcaagaaagtacaaaaaaaga
+ ggtcgatcagcaccgctgaaaggaagagaacaaagttacaagtggaaaagttaaacaaaa
+ gtagtgaaacaatgataccgacgctgctgcgggaggcaagtacacaagagccagctaaac
+ tgaaagctgagactactttgaaagccgaggagctgatcaaggaccaggaaaaggactcca
+ aggtacgagagcaaattcggacagaaaaatcaaaaacaaacgacagcatgctgaagcaga
+ tcgaaatgatatccggcttttccttatagggaatagtggtgaaagttacgtaaatatata
+ catatagagcggaatggtactgcatatttttattatttttttttttgaatatatataaag
+ catcgtcgcttaagactaaaccttcgaggatctgaggccagcgctggatctaaagatggc
+ gtcgccctcggcgtagcctcttccaacaccaaatccaatgcctagccatacaggaaatgc
+ acggcgcttgaagaataatactgaggtgaacacacccacgccaaaacccatggcagtctt
+ gaccagcatgttggacaggacaatatcccacttagtgtcgaggatcgtggacacggagct
+ gccattcttgttagaatccttggagggagtgcttttcgctggttgttgtgtttgtgcttg
+ ttcggacatggtatattttgtctttttccgtgtttattccctctcgtagcaaagttaatc
+ tgccaattgacagtagtttaatatatggtattatcgttgtttgaagccctttaaaaggaa
+ aaaacgggttagggccacccggcgcgaagtaatagctgctgattggtcagaatagtaaaa
+ cggcttgaattgatagtatatcgtactgtatataccttatatttgggctaataatcatct
+ cccaaatgctccagtatacatgtcacctagctatgtatgaggttgactaccttatatgca
+ gctactacacaataactgtagtatcgcgctgcacgagcgtgacggcggttaaatctggat
+ tttttgccaccttttttatacgaaaataagaactaggtaaataggaaggaatggttggat
+ ttgagtttaaaaatagagggaatacttggagatgcgattgttgctgtgcaagaattggtt
+ tgcgtcacctgtaatctcaccactactgtatacccgctccttatattcaatggctaacac
+ tactagtttccctattgctccccaggccccgcctaattggtcgttcactcccagcgatat
+ tagtgggaaaaccaacgaaatcatcaacaacagcaacaatttctatgattctatgagtaa
+ ggtagagagcccttccgtgagtaattttgtggagcctttcatgaagtttgaaaatgaatt
+ gggcccaataattaaccaattaactttcttacagcatgtgtcgtctgataaagaaattag
+ ggacgcatctgtgaactcctcaatgaaactggatgagttgaacatcgatctatctctgcg
+ tcacgacatctttttgcaattcgcccgcgtctggcaggatgttcaatcgaaggcagattc
+ tgtggaaagagaaactttcaaatacgttgagaagtcttacaaggactacattcattctgg
+ tttggaacttgacgagggaaaccgattgaaaatcaaagagatcaaaaagaagatctccgt
+ taactctattaatttttcgaagaatctgggagaacaaaaggaatacatcactttcaccaa
+ agaacaattggaaggtgtgccggattctattttgacgcagttcgagacaataaaatctga
+ caaagatagcaatgaaaccttgtataaagtcaccttcaaatatccggacatttttcccgt
+ gatgaaattggcatcctcagctcagactagaaagcaggcctttttggccgaccaaaataa
+ ggtccctgaaaatgaagctatactgttggatacattgaagctgcgtgacgaattggcctc
+ gttattgggctatgacacgtatgcgaactacaacctgtatgataaaatggctgaagatag
+ cactacggtaatgaactttttgaatgatttgaaggacaagctaattccgctgggcagaaa
+ ggaactacaggtcttgcaagatatgaaagccgaagatgttaagaaacttaaccagggtgc
+ agatccaaactactacatttgggaccaccgttactacgataacaaatatttgttagaaaa
+ cttcaatgtggacctagaaaagatttctgaatattttccactagaggctacgattactgg
+ tatgctggaaatatacgaaacattgtttaatttgaagtttatcgagacgaaagattctca
+ aaacaaatctgtttggcatgacgacgtcaaacaaatcgccgtttggaatatggatgatcc
+ aaagtctccaaactttgttggttggatttatttcgatttacatcctcgtgatggtaaata
+ tggccacgctgccaattttggtttatcgtcatcattcatgattgatgacaccacaagatc
+ gtatccggttactgcgttggtttgcaatttctccaaatctacgaaggataaaccttctct
+ actgaagcataacgaaatagtgacctttttccatgaattgggccatggtatccatgacct
+ ggtgggacaaaacaaggaatcgaggtttaatggccccggatctgttccatgggattttgt
+ ggaggcaccttcccaaatgttagaattttggacttggaataagaatgaattaatcaacct
+ ctcatcacattacaaaacgggcgaaaaaattccagaatctttgatcaattcattgatcaa
+ aactaaacacgtaaatggtgctttattcactctaagacaattacattttgggttatttga
+ tatgaaagtacatacttgtaaagacttgcaaaacctgtcaatttgcgatacctggaacca
+ attgagacaggatatttctttgatttctaatggtggtacgttatccaagggttatgattc
+ atttggccatataatgtcagactcttactctgccggttattacggttatctatgggcgga
+ agtctttgcaactgatatgtatcacaccaaattcgctaaggatccgttaaatgccaagaa
+ tgggatacaataccgtgatattgtgttggctcgtggtggcctttatgatattaatgataa
+ tctgaaagaatttttgggtagggaaccttctaaggatgctttcttgaaggagctgggctt
+ acagaactaaattcgttcaagcataaaattatacaattattctacaaagtgttattatgt
+ tagataatcttttaaataacctcgtccctggatagattaaaaccattaaatagctcaaat
+ cttggagtaaagcagcagtaatatgtctgatatgttcgttatattttttcttgaagttta
+ tcaacttatcctttactagaccatttccctccccatccaatcgttgatcgattttaaatt
+ tcaccttgcataattttattaatttccatacacattttatcactgaaatgcacgagtaac
+ acagccaaactttccgtaatagcccacttacatataacttatcgttgaccacgcctaggt
+ tttccaatactgcgaagaaatcaagcagcgaacttacaccactccatatattctcccata
+ tatcttttgcactgatttcagccgtatttttcttcgaaaccattcaactttcttttcttc
+ ttcctttacttttattcatctcctctttacttcttacctcagagtcgaacattgaaatca
+ tttcccgcgaatttttcgattataattaccctttctacttaacaagaagcaataagagca
+ aaggggcgaaatgcttgtcccatactttggcttttggatgtgacattaggagccttatct
+ cctcttaatgccttaagtcaaaaaatagtttttgggatatatcttgtgatcaaatgcttt
+ gcaaaagggtaaaacagcgtttacaataggatagccaaaagtaaaaaaaaaaaaaaaaaa
+ aaaaagaaggaagttagtatcgagctcaaacaattttagttttgctggacatattttcag
+ tttcattactgtatatttgcagcttggtttttgtcgctctgtagtcttgtttcacacttc
+ gtcatttccgagtttcatcttgcgacataacttgtttcaacttttcaagtgcgacttttt
+ gttaaatattatgacatagaagaaagtagaacagccattataaaacaaaagagaagtgag
+ aatactattaagatggcattccaagatccaacttacgaccagaataaaagcagacacatc
+ aacaacagtcacttgcaagggccaaaccaggaaacaatagaaatgaaatctaaacacgta
+ tcattcaaaccctctagagacttccatacaaacgattactcgaataactacattcatggg
+ aagtcgctaccgcaacagcatgttactaatattgagaatagggttgatggctatccaaaa
+ cttcagaaattatttcaggcgaaagctaaacaaataaatcaatttgccactacgccattt
+ gggtgtaaaatcggaatagattccattgttccaacgttgaatcactggatacagaacgaa
+ aatttgactttcgacgtggtgatgattggctgcttaacagaaaatcagtttatttaccca
+ attttaacccaattgccattggatagattgatctccaaaccaggtttcctgttcatctgg
+ gccaattctcaaaaaatcaatgaacttactaaacttttgaataatgaaatatgggctaaa
+ aagtttagaagaagtgaagaattggtttttgttcctattgacaagaaatcaccgttttat
+ ccaggtttagatcaggacgatgaaacgttgatggaaaaaatgcaatggcactgttggatg
+ tgtatcacaggtacagtaaggaggtctacagatggacatcttattcattgtaacgtagac
+ actgacttgagtatcgaaacgaaggacaccactaatggtgctgtaccatcccatttgtat
+ cgtattgcagaaaacttctctaccgcgactagacgattacatattattcctgcaaggact
+ ggttacgagacacccgtcaaagtaagacctggctgggttatagtgagcccagatgttatg
+ ttggataacttctcacccaagagatataaagaagagatagctaatttaggttcgaatatc
+ ccattaaaaaatgagattgagctgttaagaccaagaagtccagtacaaaaagcacaataa
+ agcatcaacacataaatccttgcttagctcaattaaatatactagtaaataagtatataa
+ acaataattttgcattttattttactggaaactcttctcaaaaccaaattgcgcaaggat
+ tgattcagtacaattatgcaaactcgaaaagtaaataaacaaaaagatacagttgagtga
+ aaggtaaataaacattattctacttgattttcctctgttttttccatttttcttttttgt
+ cttttcgtgttctagcgtgttcactttttcagccatgtaattacttaaataggaagttta
+ cacgatgttctcgcataaaaagctggaagtgcagaacaaagaggaagaaaataggaacag
+ agaatttgtttgtttatttgtttgtttgcttgtttgtttgtttgttttgatttgaagtcg
+ aggtacttattcgaaataattattttgcccagggttttggatagttttttgttttccaat
+ tagatttttagaaatattttatcatagttttttgtttatttatttgtttatttatttgtt
+ gatttgtttatttatttgttggtttgtttatttaagtttgtaaatgtttaataaacaaaa
+ caatatttgcaagccattgaaaactttgtaaatttgtttgagaatttacgaaacaatgaa
+ aattaaggactctttgatggtgaacgaattactactttgcagcaggtaattctctaaaac
+ tcagaaacaaggcatgtatttttcactaggaaaaagaaggaaaatgggactacagtatta
+ gggaaacaggattaaaattcgtaggtaataataacccatactttcattgtagcagtactt
+ tcgttaataaaatggatattgttgtacttcggtagatgttatattctaaagagttaaatt
+ atccaattccaaattctagggacggtttccaaagatttgtattgtccagtgacataattt
+ tcactaaaacattgatctgtttaataggtttgctataaatgcacaataatttttcaccca
+ tgatacccaatttcaagagaagcaattgctacatataattatttaggctttactatctac
+ tactcattgactgtgcccttttacacaattataacaaatatgtcaaagcagatgccatga
+ actttgtatctgaatttttgatttccttttaattctaattgcagacgacgtaaatatagt
+ tctgaatttcaaagtcactgttaattaattgttctaattgtttggtttttttaatataaa
+ tcactagtgcttaagttctgttgacgcacacagtacctatctttgattccttcgtgcaaa
+ cagtattccggcacgttaattgataccaaaaagatttcaaaatcagatgcttcaagacta
+ attgtatactttatgtgacttctggagtttaatttgttgtattgatgagataatattgtc
+ tattatattgccaataacaaccagcgccaggtgtactaagtcgagatgacagaaaattta
+ taataagtgtaaatacaataaattttgtaaataaataactacttcaatagaaatctcaaa
+ gtcgaagaaacacagaacacagaagtaaactggactaaatgattttcaataaaggaaaag
+ cacgaaaaattttttgaacttttttgggacagcgtggatcgtgtaattactgtattggga
+ aatttacacgattctcgacaaataaagaaaaaggcgcgtttaaagacttacaagcacacc
+ tttgaattatttttgttctctatgattatcaaatgtgctagtgtttgcaagatggtgctt
+ ttttttctttagattgccacgtttctgctcctctttgtggttttctttttttttttcaat
+ cttacttataaaatgaatatttcaattgatgaatagctatatattggatacaaaaattag
+ catttaatcgaaaactgcagcatgtatttaatcgaaaactacagcatgtagctatgatac
+ gacagaagatttttgtttttatagttaagtcaagaagaaattctatttgtccagcaatcc
+ ggcgcaaagaagactactaaagggtattgaaaaatttctagaaccagaaataatcgaaag
+ cattactttttttggctctataaaggagttctggacagcttgaacttccgaagtctgtct
+ gtttagctcgttaagataaatctcaaaagagtagcaagcaggtgggagggataatttgga
+ ttaaagtgtatatattgcgtatatatagccacgtaaacctcacactattgagtttaaaca
+ tttttcaccttacgttattattcttttgtctaattttaatctcctccagtatactaaaac
+ cgcaaaagcgggttacccagcttgtttctttttgtctgaaaaattttttgcgatgcccat
+ tatgaaaaattggcaataatagtatagtagttagtttaagtttagattctttcaatactc
+ attctgcttcagtttgtagttagatttaactcaatagaggtgattggcaaaaatgggtaa
+ gacacaaaagaagaatagtaagggacgtttagatagatactattatctagcaaaggagaa
+ aggttatcgtgctcgttcatcctttaagattattcagattaatgaaaagtatggccactt
+ cttagaaaaatcgaaggttgttattgatctgtgtgctgctcctggttcatggtgtcaagt
+ tgcatccaaactctgtcctgtcaactccttaattattggtgttgatattgttccaatgaa
+ gccgatgcccaatgttataacttttcaaagtgatattacaacagaagattgtagatcaaa
+ attgaggggttatatgaagacttggaaagctgatacagtgttgcatgatggtgctcctaa
+ tgttggtttgggttgggttcaggatgctttcacccaatctcaattaaccttacaagcttt
+ gaagttggccgtcgaaaatttggtggtgaacggtacttttgttactaaaattttcagatc
+ caaggattataataaattgatttgggttttccaacaattgtttgagaaagttgaagccac
+ aaaaccacccgcatcaagaaatgtttctgcagaaatttttgtagtgtgtaaaggtttcaa
+ ggcaccaaagaggctggacccaagattactggatccaaaagaagtttttgaagaattgcc
+ agatgggcaacagaatatggagtccaagatttataatcccgaaaaaaaagttagaaaaag
+ acaaggttatgaggaaggtgataatttactgtatcatgaaacctcaattttggattttgt
+ caggacggaagacccaataagcatgcttggagaaatgaataagtttacaattgatgagaa
+ tgaccatgaatggaagatcttgaagaaattgaaacaaaccacagacgagttccgttcctg
+ tattgaagatttaaaagtcttgggtaaaaaggattttaaaatgatcctaagatggagaaa
+ gattgcaagggaaatcttgggcatcgaagttaaggacgacgctaaaacagaaattgaagt
+ agtaccgttaacagaagaggagcaaattgaaaaagatttgcaaggtttacaggagaagca
+ gcgtctaaatgtcaagcgcgaaagaagaaggaagaacgaaatgaagcaaaaggaactaca
+ gagaatgcaaatgaacatgataacccccactgatattggtattgaagccgcaagtttggg
+ taaagaatcgttgtttaatttgaaaactgcagaaaagactggtatcttgaacgacttggc
+ gaagggtaagaaaagaatgatttttacagacgatgaattagccaaagataatgatattta
+ cattgatgaaaacatcatgatcaaagataaggattctgccgctgatgcggacgatttaga
+ aagcgagttgaatgccatgtatagtgattataaaactagaaggtcggaaagggatgccaa
+ gtttagagctaagcaagcacgtggtggcgataatgaggaagaatggaccggttttaatga
+ gggaagtctagaaaagaaagaagaggaaggtaaagattatatcgaagacaatgacgatga
+ aggtgtcgaaggagactccgatgatgatgaagccatcaccaatttgattagcaaattgaa
+ gggacaagaaggtgatcacaaattgagtagtaaggcacgtatgattttcaatgatccgat
+ atttaataacgttgaacctgatttgccagtaaataccgtcaatgatggcataatgagttc
+ cgagtctgttggtgatatttctaaattaaataagaaaagaaaacatgaagaaatgcacca
+ gaaacaagatgaagcagattcttcggatgagagttcaagtgatgactccgatttcgaaat
+ tgtggccaacgataatgcatcagaagaattcgattctgattatgattcagaggaagaaaa
+ aaatcaaacaaagaaagaaaagcattccagagacattgatattgctactgttgaagccat
+ gactttggcacatcagttagcattgggtcagaaaaacaagcatgatcttgttgatgaagg
+ tttcaatagatacaccttccgtgacaccgaaaatttgccagattggtttttagaagatga
+ aaaggaacattcaaagataaataagccgattactaaggaggcagcgatggcaattaaaga
+ aaaataaaggcaatgaacgcccgtcctattaagaaggttgctgaagctaaggcaagaaag
+ aggatcgtgccgtagctcggttagaaagatcaagaagaaggcgggcttgattaacgatga
+ ttcagacaagacagagaaggacaaggctgaagaaatttctagattgatgcgtaaggttac
+ caaaaaaccaaagaccaagccaaaggttactttggttgttgcctcaggtaggaacaaagg
+ tttggcaggtagaccaaagggcgttaagggtaagtataagatggttgatggtgtcatgaa
+ gaatgaacaaagagccctaagacgtatcgcaaagaagcatcacaagaaaaagtagtaaca
+ aagagcattatttttccattcccttgtataatattgaaaataaaggaaaagaaagcaata
+ gatatattatgtatacatagaattaatcaaattatcatcaaaaaacttataggaaaccta
+ tcgttttgaattcagaaaatacgagtgcacagtagaacaatttttattcccgttttactg
+ atcttttcttcttgttgttaccaaataccttggagaaaatgtatatgatggagatgagta
+ agcatagtaacgtaccatttttgatcttggaataatcctttgtgtcaggccttggaattg
+ gtactagaagagttgtcgagttcaaatggcggaatgtagtatcgtctgtaaagagagatt
+ catatcctagaccttttttataaaatggattacgggaaactttattgtcacctgtgtcgc
+ aggctaatataacttctggatcaaatgaaacttctaatttatcccctttattattagaag
+ gctcaatatatctagtatgcaatgtcacttcattcattgtgccggctttcaattcaaaga
+ tactttcagaaccccatgccttatctcgaagagagtattctggtaattctaagtcgtctt
+ ctccaaaaagtagtaagggagaggattggaatttatcgataaataattctaacggcaatt
+ gcaggtgcattagatacatgcatttagggcgttcttcgaaatctgtgagatcaatcatga
+ tttttggatgcaaaccgataggttcattcagatatagtgatgtcgtagtcgtcgagtgat
+ tgtatgcgatatggccctgtttatacaataaggatgttttctgacatctttccattttac
+ catcgtccatcctccaattacaacgtattccaccaatatcgacatcactttcatcacttg
+ catccacaaagaatacacctacttcgagtttctcatctttcactcttgcaattgtaaatt
+ cttcaccctgtttcaacagacgatattcaacaatttggatttgtgtgggctccactgata
+ tatcatatgtaaaatcttctggattccatgacagattcaaatctacttcgggaggcaagt
+ actgctctatgtcaaacttctcactgtgaaatgaattgtagactgggttagtaataaacc
+ tttccggcacatccgaagaatttgagtaaactgaaaagccagcactcagttcattttcta
+ taacgcttaaacttctagccaagtccgctctgggcctccaagtgattctcgtaaaggatt
+ tatccaagcttccagtcctctctaatagccacctttgttgtaaaaccacaccagaacctc
+ cacgaacagtcaaatgagtgtcattttggcgcatatgattaccgatatcctcaggggcat
+ tgtagagtacagtcactctatgtcttgtcaccattactcagctagaaacgcacacttctt
+ gcgtttattctcctcccgatttgattatgattagcgtttccattatttgatcttttttat
+ cagggggatcttcaaagaaaggaactttttttaggaaaaaaaaataatcatcttttatta
+ catgtgtaatacttcaagcaatcataaacaagagttaagaaggcaagtctgctgtagtgt
+ gtcctccaatactcttgcacgacaaagttttagttatccccttctacgtgtattgttcct
+ ccgtggtcgattgtgttcaaagattgggagctcttttcgtgttttgcactaagagcatac
+ tgttaatccgcaaaggccactaaaacttacctgcatttgtctacttaaagatacattcca
+ actatttacattcatttttatttctaaaagagcattctgaaccctcggcaattatttcat
+ actccttctcagaataggaaaatgcccaatacgcatactcaacatgtgcaaatatcagag
+ ccaaatcctgtaaatactttgtctacaccatccaaaagaggtcaccgccatcgcagatcg
+ ctagcaatatcaggagattttgattttttgaaacagcctgcagcaattgtgaatttacca
+ cctccacaggcggctgaaaattgtccttcaactgccccaactgctgtatcaagtacatta
+ tcgccaatacgctacaatagatttccttgcaaaaccaatgaagacgctggaacgttagat
+ ttgcctgaaccaagattttatccgttatcaccaaagaacaatctgcaaacaccaagtcca
+ cgatttttcattagtgaagagccaagtttttcatcgccagttaaaggcgtcccagatgcc
+ attattaaccttgacgatgcgttgaagacaaggcctaggtcatttaaatcacatagaaga
+ tctgaatccgctcctcctgatttggaggttatggtagataagggcaattgtgcagccggt
+ tctaactctatgattaaagaagaagaggactccttaattgaaccagaatcgaaaaatgaa
+ tattatgagcaaaagcttccaacagcactattatccccactgcggccttccctttgtgta
+ tctgaacaggccattgatgtagatgattcagctctcaatgggtcaccgacccatcacaac
+ catgggatgcaaaacgccaatgcacggaattccaacacattcaattcgttgaagatcaaa
+ ggccaaaagcaaagatattatcattatacgaagcagctacctttgaccgtaggctgtgac
+ tcgcaatctccaaaagaacaaaggtcggctgcttcaatgacaatcaatcaggcaatgaca
+ ccttcttccctggcctataccccttctaaactagcatctactcccgcaacaccagtatcc
+ ttttatgacagcaatgcggacattaacttagaaagtgataattttccactaaaagataac
+ cctagatatgccaaggatggttatcctaaaaagtgcggcaattcacagcttaatcgtgtg
+ ctggatagcgataaaagacaggattttagtggagaatcgagaagaagaagatcgggcagt
+ cctatctcccacatgcaacaccgcaacctgattgataatatgaaaggtagacgaaacagt
+ aacacgataaactcaatcttcaactacaagagtcaacattatgaaatgccatatgatgat
+ atgatgaaaaatgaaaacattaatgcacagtccatgcccttttcagtcaacggtgtcaac
+ aatgaaaatagtatcggaggggttattacgagagcggacgatgcaccccttcaacactct
+ gtggtcaaatcctgtacgcctgatggcaaggaagaaatgaataggcttaaaagtaatgac
+ agtaatgaatattccaagtctgaagggcagatcagaaccaattcgcaactaagtaaggac
+ attctcatgggtgaaccaggtgatatggttgatctgtcctcttttgtcaacacgcagaga
+ aaagcctcaaatgaaactggtgacttagtctttagtttatcccaggatgatgacgcactg
+ aaaacgttccatgcgagcaatagcgcagcaacaagcaatgaaagctggtgtattagcgat
+ ggtgcgttaggaaagcaggcgcaggacagtgaagttaggaggaaagaaatcaaattagga
+ ctctttagacatattttcaaggaagtaatacaacaatattaataataagtaataaaaagg
+ tgaatcgtaaccttagcggcattgattgcatgtagttttaataataaagtgtagaataat
+ aaatataaaatttcaaagtcattagaaataaaaattgaatagacatatacatatagaaat
+ tacttctatcatttaaatacatatatatatacatacatgtatacgtgtatacttgtactt
+ gtgaaggactcaatagttgtattcgtttggtttttgaccagaagtgtttgggaaaccaca
+ ttccaataagatatcgttaatcaattcaggtttttcagtaatcttggagaaaacttcatc
+ atttttggttaagatcataccacaataacctgtggagttgaaacctatcttcatttcatc
+ agaaaaggccttcgaacgtggaacgacacagatccattccttggttaacataagattgta
+ ggatttctttagttctggattttcgttcaaccagtcctggaaaaaggtcaaagctctttg
+ caatatggagatgtaacacatagctaataggtcttcatcaacagtttcttcggactccgg
+ cattggcaagacaaaatgagcgaacgagactttagcatcttgcaaaggttcagtattgaa
+ agttggtaggaaatgttctttaccattacatagtctatcttggaaagtgacgaacttttc
+ aggcatttgcaaaatttgcaaatgtttgtggtccaatgaagaaccactggctggaccaga
+ attgtaaaagaccatgtgtctcttgtcggattcttcattgtccaaggcacacagcaactt
+ ataagcagtcaataaatcggttggggtcaaggcatcagtttgatgttggtattcgttagt
+ taccaataaagtgtgtccagggattacagggaatttgtttaatagcaatttatacgcttt
+ gttatcggcaccaccaaattctgggataaccgttaattcttcctcaggtttgcctagtgg
+ atcctctccttctggagtttggccacgctctggcttttcgatcagacttggcatgtggct
+ aatcaagtatggcatgctggtttttgggtcctttgttttcgttgtttcagtctggataaa
+ ttttaagttaccattatcgaaggcacttttgtacttgtcactaattaaagatgcaatgtc
+ agcggggatactcatttttattttaatgtttacttttctgtttgttctaaaatctatcta
+ aactggctttcaagatcaatctattgtcttttaaggtaaactttaaattggaaataatag
+ taatgttagttccttcattttaaccttgtattgtatttcctttgcgtgatgaaaaaaaaa
+ ctgaaaaagagaaaaataagaaaatcttctagaacgttccgaaacaggacacttagcaca
+ caaatacagaataggaaagtaaaaggcaatatatgaatgcagtgcttgtaactggtgctt
+ gtatccaagaatagcttcttgctgtaggttatgggattatcgtgtaagctggggtgactt
+ ttgagctattcgcgacgcccgacgccgtaataactactttcgacagaccacttatgacag
+ tatttcaggccgctcttataaaatgacatgttaacaaacagttctgattattcgccttct
+ gacaggacgataatgtaaatagttgtggtagtatcattcaggtatgtaactgtttacttg
+ gtatcgcttgaaaaaaataagcatttcagagccttctttggagctcaagtggattgaggc
+ cacagcaagaccggccagtttgaatgctcaactcttcaaaagaaattcctcaaatatgtc
+ cagtttcatgtactgtccggtgtgatttattattttttatttactttgtagttcttaaag
+ ctaaggtttttttctttcctaaattcttgtttaaatatcctaaaactaaagggaaaataa
+ acaatacataacaaaacatataaaaaccaacacaataaaagaaaggatcaaatactcatt
+ aaagtaacttacacggggggctaaaaacggagtttgatgaatattcacaagataaaaatc
+ atatgtatgtttctgatatatcgatatacaatcaaacactttcaagaatttgtttgtaga
+ ctttttgctagagacctcatcaaagtgctaccaactaagatcaacttatacttcttttag
+ agaaaatttttttcaatgtactccaaagagatttagatcctgtctcttcctcttcctctt
+ cctcgaaagtcaaagaaaaatcagagtctccctgcttattcaggcggagaggctctaggg
+ tagttgcgtttctctcattgggacactgaacctcattttccgacattttggtcatgtaag
+ aggcgacaggctcatcgcaggtaggtgcatcaacatggtagtacctggaccaagcgctac
+ attgagtccctcctggataaacaccgctacaatattgtctttgaacgtttgcccaaacca
+ tatattttgaataccaaagctggaccacattgtatggcttaatcattggtgctaccataa
+ tactggattgggaaacagtctggttaattttattcaaccaatttttcttatctagcaatg
+ atttaataaacctgaaatctaaattgtcttcgttagcgtctgtgtcataatctacaattg
+ agtactgtgacgtccaattatatggcaccgagatggggaatctgtccggtgtttcgtcgc
+ tgttatccttctcctccctccaaatgcagtcagaggcaggtgcccattcggttcgccagt
+ ctccgttatttactacttggtactgttcccaatcgtaatacgtttcctctgggttgaaga
+ tacttgctctgctcttgacattgcccatagccacaccacgagaaacatcgtggaagatta
+ cggagctgtttacgatagcaggagcaatggatttgacgaatgacacttgataaaagtctt
+ tggtcgaaaatacagtatcaacgccgtcgtgggactcgacccctggtagaagaagcagag
+ catctagatcaacggtagatgctgcagaggtcaaagccacaaagagggaagatgcagtta
+ cgaggtatttactaaacattggttcgaagtgattacagatttttctctcttgaaccctgg
+ aattgcaatagcttggttcctattgggaccggaccctcttgcgtaaagtatatatttaga
+ agcagctatttaaatctgaatgcaagacgaaaaaaagctcgtttataggagaaaaatacc
+ aaaaaaaggaaaaccatacacattcgggtatcccagctttgcttgcaaaatcaatcgatg
+ agcaagacaaacagggccagctgatgcatatgttttgtgttgctttcctacgatcagcta
+ atgcgagtgttataaatattcccctagttttcttctcctcctttatgtaacaataacttg
+ tggacttccccaacataagactaaaagcaaagggataacaagggatttatcttattccat
+ ccctttgcctagcgcactgttacgcagctgatcttatcaacaagtgtggttcttgcctct
+ aagatttacattgccttttcccaaagggaaggtgggacaaaactcattttccctaagggg
+ aaacgtaagataccctttagggaagataagggaagttagtagattgacgacgtaaatctg
+ tgatgagatgaaaaaaaaagattagcatctatataagaaagtacataacctccatttaag
+ ttgaaaaaaatttgtatgtaatgtcttttcttcctcttttctttatttcattctcaccgc
+ taaatcgataacggctaaaacaattaaagctcccttttcaaaccttatacttccgttaaa
+ taatgcaaatcaaaaacattgtcgctgttctagctacagtcactgctatcaacgcccaag
+ ttggcattgaacctaacgccactactccaaatgctacgcagccaaatgctacgcagccaa
+ atactactcttcctaccgcttcggtgactacgacagtttcgattggcgaagccgtagtta
+ ataccatggcggcgggtgcctttggagcagcaatcgctgccggtgttgccttcttattct
+ aggaacatttttatttcctttatttttttctttctttctaattccataaccgttgttcaa
+ aattggggagctttttaactttcgttccattaagctttttctttgttctttacgtttgat
+ tctatttcaaccataatttattttttcgttatttaataattactaatatttaatactaga
+ tgtatatacgttgtgtgtatgcgctactatagtatagcttaactgttttattataccctt
+ tttttctctctttactgtacagaagatgcgtcactaattcaaaaaaaaaaaaaaaaaaaa
+ aaatataaaaaagaaagattttggagaagctatatccttttgtgacatacaatagagaaa
+ cttgttctgttggtaagatatctcatctgaaatacaacaaggaaaaccaaaagccatttt
+ ttcattacttgatcgacttatgaaccgtattactaggaaaagttgtttattcgcgattat
+ atttgcatcattatttgtgacacatgcattgggtgccgctattgatccgccaaggcgacc
+ acataatgtgaagccttttcataacggtaatctcgaacttcaaagaagagcaaatgaacc
+ gttttttgaaatagatgtcaagagtctgaacacaaactcaccgatatcagagttgtgtaa
+ aaaagatttgcacgtcattgaatcgtctcatgatctttttcatttacaaaaccaatgtga
+ attcatcttggggtcattaaaagtcacaaactatgattctaacattttggatttgaacag
+ cttgagggccattggtggtgacctgattattcaggattcacctgaactgatcagaatcca
+ agccgggaacttgaataaaatcgaagggctcttccaattacagggactaacctctttggt
+ ttctgttgaaattccaactttgaaattttgtcagtcactggagtggaaagttgttcccat
+ cttgaactacgtctccatggattctcagaatattgagattataaaggatattgtcatatc
+ ggatacttcattagcaaacatcgagaatttcaacaaggttcaggaaattgatactttcaa
+ tatcaataataacagatttttagaaactattcattcgaacgttaaaaccattaggggaca
+ attcagtgtacatgcgaacgctaaggagctagaacttgaaatgccacacttgagagaagt
+ ggaaaacataacgattagggacacatcattggtctaccttccacaattaacaaaagtgaa
+ aagctctttagagttcatcgaaaattacttttacgaattgaacctgaacaatttgcagaa
+ gattggtggaacattaggaattatcaacaatgtaaatttaataaaagttaatttggagaa
+ cttaacagacattcaaggtggcttgatgatcgccgataacgaatccctcgaggatattac
+ tttcctgccaaacttgaagcagattggaggtgctattttctttgaaggttcgttcaaaga
+ tatcatgttcgatagcttgaaactggtgaaaggtagcgcttttattaagagttcatcaaa
+ cgtgttggattgcaataaatggacaaacccatcaaatggaagatcaatcatcaggggtgg
+ gaaattcacttgtatttctggtaagaaggaaaatacgctgaatgttaaacaggatggtac
+ aatcatagaaaaagggtacaaagatttaacgcaagaaggtgaagactccaagaaaagagt
+ gatttcaaaatacgcgaactcagcaaatccaagcatgcaattggacccccttctttttgg
+ tacatgccttgttgctatgttattgttttaagattactagtctagtataacagaaagttt
+ tttttgttcctaatattatctttatctatatacatatatctccatgaaaggcatttgaat
+ ttctatctttctctttcaatcaaatattgtcttattattgatcaaataatcgtatatgca
+ caatggcaactcgtctttcaaactctccacttgcccattcttcagtttgttcctaatctc
+ ggatgatgaaacatttcttaatgctggataatcattttgcataacatgaatcctagcgcc
+ ccattctcttggtattaccggttcgaattttccctcaaggatgtcactggcgtatcttag
+ ttgcgattcagcactgagatggcaatcgcctctcgccaagcaatataattgagatcccga
+ catgaagctctccattacatcactgatttttaaaggatgataatatttttcatcaaagat
+ cctagcaattgtatcaaaaccaactaagtagcctatatctgcactgcatcctttaacaaa
+ ttgctcatgtaatattttgtccttgtcgatgaatttcgagaaaacagtcaacccgacaga
+ tactacagattgggggagcttctcctgaaggtagtcagcgaataagcacatcatttccag
+ acgagttggaaaagatgccggcttaggcaacttatctgcattattcaccgctaacagtaa
+ taaaacatgcgaacgggtgtccttcaatttgaagtttttaatagtctgcgatagtagttg
+ aaaatgggccagatgtggtggattgaatgaagaatcaaggacaaataacttctgacattc
+ tagatgctggggtcctttgagaacctgaaatagtaaattgctttttcgaaactgctcgaa
+ cgtcttcttcatattagtctgctagtgcacgtactgctatcaaccgtcattgattaagtg
+ tccattttccaaagaagtaggagattatataaccacgagtatgacgtcttccagggaaaa
+ aaaaaaaaaaaaaaaagaaagccaaaataaggagccttgaaatgtgtatctgatatatat
+ aatgtgtatgtaaatatctatttaattagccattgggatttcaacttcttgtttgaaaca
+ gaaggacggataaaatacgtgatcaccaacaagacgaatatagtaataagcaatttaccc
+ ttttcgaaagtaggactcattaaatcaaattggcccgaaggtgtgatccttgtacaaaat
+ acatcaaggcctagatcacatataatgcttgtggattccagattggttgggatggagatc
+ aactgcgaatcggatcctggcaatagatttctgaaatgagtgataatgaaattatcatta
+ attgggataactggcgtgtatggggtagccataaactcttttttcttatccttggccatt
+ tcttctgctggtttacctcttgcattcaataaaagctttggtatgtaggtgatttgtcca
+ ttttctaattccataacgattgcctttgtggtaatatcatccgttgtcttggaaatggac
+ attgttttgataatctcgggaaaaatgaattgtttagtttgaaattgaggtttgttaatg
+ tgtccagtcaatggatcataagaaaaattgtcgcttgagttagacaaacgctcatctggg
+ gttagtgactcatataattccaccactactaacttttgttctggaacaggttcagaactg
+ aaataggaataaacgacccaatattcaccgaaaacaatatccattggaaacctaaaatcc
+ ggagaatccttgtgctcttgggtaatgaggatttctcctgtaatggtatcaattaagtta
+ aaagtgattgtatgatgttcttcattagcgatcagataagccgctaggttggggtacaaa
+ tatttataaagaaccgatttgtcacctagtgtaataccaagagtgtttaggtttgttgtt
+ tccctcttgctatatgccaccattttctctttcgaatttacggcttttttccaagtttgt
+ tttaccgtatcgttttcgattatgtagccataaatgccattcgtgtcatgctctgtgacg
+ aatatgtgggaactagaatggttgttggcaactattggtacatccgtattcttgccggga
+ ttcaatttgaacagtaaatgatccttatcctcaaacttaatcagatactggtgatcatgt
+ tcattcagcctaataacggaatcaacagttttcctttcagataggggggatcttgatttg
+ ataatcggttgatctttagtaacttcaattgtcaaataatgcccatcatgcgaaaaaaca
+ acaagttcgtcatggtttttatccgtccagaacattttgacgccctgatcaatttctaaa
+ tcgagtttccagataagttggcccttattgaccatatcaaggccgccgataaagccatca
+ tgcgtcaaaacaattaagattttggcgaatccgaacttcaaatctgataccgtggtatcc
+ ttagcgtctagagctaggagtttagtgaagacacgtcctggtgagaattggttttctttc
+ aataagttgataaggcgattccaattagttgtcaatctaagccagtaagcattccaaagc
+ gaatttgaatcaagttccgctttcatatctttgttcaactccacatccctagaatcgaag
+ acgtccaatacagcatagtctaccacgttagtcaaccattcatctctttgccaaacgggg
+ atcaattctgtgccattggcagagaatcccatatattgggtatcattaacgttcaacact
+ aacgccaggggatcttcacgatgaaaatattccactttattgaacccttgaggtaactcc
+ aatttccattctgccagttttgactgctcatcgagaatatgtagttcattgttcttaata
+ gatacttgatcgtttaaatatgaggattgaggtaaaatggtcgtagaaggggcattatct
+ acgccttcctctagtaaaaaccaatcgtgtaaatcatatttctgccaatggttgcttgaa
+ gagttcacacaaaccattgcgttatggtcattactatccagttgaatctcatcaatggta
+ aagggtaaaacgtttctgaaaagaatctgtccggaagaaacgttaaacgaagaaactaag
+ caggaagtttcggtagggttcgataagatgagaaccctgttgcggtctcgagaatcaggg
+ atgactttctcccaaggacctaagttagccagttgccaatcagtgataaatgcatcatct
+ gaaaaaacggcttggacacaactcgtgtttaggaagagtaaaatgaagacgtacaccaag
+ tctgtacacgttatcttcattgctatgggggaaggggaggatgaaagtgttgatatgaat
+ gtaggtattagttattaatggagtgtatatatatatatgttattatatatttgcatatat
+ aatatgaaatcccagccatattttctctggtagccgtctgaaaaatcacggtgtacgaag
+ aaggatttaatatacgcacggtacaactaagcaatccgcaaagacctaatgtgtcttttc
+ attagtgagagccttgggggtaggccctggtaacggctggtccgtgctagtgggtgtctt
+ ggtatgggagggcatggtagttggtatgaatttgatgcttagcgaagtttctaaggccaa
+ agtcatccatggatcataaagcgatcttattgtatcgagatcaatatcttgcgggccttt
+ cgtgtcatggggcaaaatgacacctcgggagagtgtggtgctggatctcttctttactgg
+ tgcggaaatggaatctcccgaagatgctaagaaatgcagtcttttttccacattttgtag
+ atggagttttttggactcggaaagaccttcaaggccagtagaagaagactgcggattgtg
+ ggatatggaattgatgagttcaatggaattttctctaatgtttcttttattttgaaaaaa
+ ctcgaacatgtccaagtgaaattccatgcgattgaggtactcgttgaaagacatgggctt
+ gtattccaagggtccgaaccatttaataatgggattctgcgcctcaacttgcgctctagc
+ gagagcgacgctttgtttgacctcagggatatttgtcgagagaggagtaccgttcatagc
+ atccttgacttttattttggtggggtctatgccctcgtgataatatttgacaatattgtt
+ gaaaaacttcttgtcacgtacggatttcatttcgtcgcaccaggggtcgtatttaagaaa
+ gttcaggcgagtaagctccaacagtgactgagaaagtagcaaagagccaaacaaccccca
+ taagagctttctgacgctgacgaatcgagattggcgaggtatgatgggaggaggggcgag
+ cacaatgtcaccatgggtggtggagaagcggttcaaagttggaaagtcagcaatatcctt
+ tttgattttagtcttgccattgacaatcatttcactgggtagttcgccaaaggataaatc
+ cgagtgagtgccagtggcaatgcggcgcggtggcggtagtttgtagagccttctcagtct
+ ccaaagggcaaagaaacccactgcagattggatagtgattagagtgtatagtcccggtag
+ gttatcaggagcgggtacgagcggaccccacagttttaagcctaaacttggccttatgta
+ gaatttcttgatatcattatcatctttgtcgtggtcatcttgtgtgtgagtatggtcggt
+ ggaattgctattacctgatggaggagtaaatacagccatcttttattacggtaattggaa
+ aaggagagggaatggaggaggatgagataagtagtttcctaagcagtaaagcaagtgtcc
+ acatttaagtaaaagacgatgccatgcacttattattactacgtggtcaattttctcttt
+ ccctaaggcgttcgtgcagtgtgacgaatatagcgaacaactattgtgtttgaattttaa
+ cgtttatctttttatgatttttttaaaaaaacttcctagaaaatttcttatatatctcta
+ tttaatgaaaaaccaaagtgatcagaattacaattcatcgtgaatggcatcttcttcgtc
+ agccaattcagcgtcagcatcggcttcctcagcagctttttcctgggcttcttcgtacaa
+ ggccttaccgtcgacgtcgaagtgaccgttttccttgatgaagtcgaataaagagtccaa
+ ggatcttgaaccttggtacacaacagattcggacttcttaccacctgggtataagacgat
+ tgttgggtaaccttcaattacgacgcctctgacatcgttttcagtgtggtctagtttagc
+ aatcaaaacgtcggatgtggcgttggcgtaggtatcagctagttcttggtaagttggggc
+ caatctcttacagtgaccacaccatggggcatagtacaaaacaagaacgtccttctttgg
+ gtcgttgacgatttcgtcatggttcttaccgaccaattggaagacagaggaatcttggtt
+ ctcgaagatctcttgggacttcacgattggggaggcatcacctttcaagaagtccttaac
+ caaagattcaatagccttagactccaacacgatcttgtcgctcaattcgtcaaacgcctc
+ ttcagagagttgaggcaaaccgtacttcaagtcttcagtcatgtcgtggatggcaaatag
+ agggaattgttccttcatgttcaagttgccggcgtgtctgccgaattttctggcatcgat
+ gctaacaaagttcattagacctctgttctttttggccaactcggtaaagagaggcttgta
+ ttcttccaattcttcctcgtcattgtagaataagtaacccaaaggcaaaccgctttcgac
+ gtattgggcgaaaacggaaccgtcgatttcaccaaagtagggcaaggcttccacttgcaa
+ ccatttttcaaaaacatcagcgtcagcgatatcggctttcttaccgttgtatactacagg
+ ctcgtccatggcggagggcaagtaaatagaaagcttgaaatcatcgtctgcgttttcagc
+ ggagacaaagtcgtagtcgttgaagtgtttgttggccatggagtaaaaggtggcgttgaa
+ gtcggcgtcaatcttaccggattggacgataactggagtgacaaaagtctcgttagcaag
+ gtaagctggtagatcagcaacaacggcgacagccggttggctttgcttgatcatgaattg
+ gacaatggcctcggcagttctaggtccctcgtaatcgatcgagttgttaacatcgctgtt
+ tttgaaaatcttcaagcttgggaaccctggaatgttgtgttccatacacagatcctggtt
+ ttcagtacagtcgatctgggccaaggtaatgtttttctcaactaaagtctcggcggcttt
+ aacgtattcaggagccatgttcttacagtggccacaccatggagcaaaaaactccgcaag
+ caccaagtcgtgcgactgaatgtactcattgaaggagtcggtggccaacttaacgacagc
+ ggagtcttcaggggccacagcctcttgttgggcgaaaacagaggaggcgagcagcaggga
+ ggaccatgacaggacggcaccagcagaaaacttcataacgggatagatgtatgtttttct
+ ttatgtagagcttggcagctttaaaattaccaattagacatgctgcttgcaacaagaaaa
+ tgcacgcgtaacaaaatatatatatatatatatatatatatgtatgtccatacgggtttt
+ tcgttttgttgtggttgtaacagcacaacaaaatgctacacggtggcaaattggaaaaag
+ agactagtgagagaaaagggaaaaagaggcgccgcccgacagggtaacatattatcacgt
+ gcagcccaggataattttcaggacacgtgtttcgaaaggtttgtcgctccgaaaaaatca
+ aaaaaaacaaaaaacgggaaataacaataacgacaaaaatggaaaaaaaaaaaattttag
+ acgcggcgcttgcaccccgcattataagtggtgtgccgacggacggtcaaccctttcagg
+ gggccccctatcttgcgtggtgtcacaccacgctcaaacggtgggcactaatgaaaaggg
+ gcccatataaatatccgctatcaacagaacccccaacccccccatcagtgcccaactcag
+ cttccgtaaaccacaacaccaccactaatacaactctatcatacacaagatgtcattcga
+ cgacttacacaaagccactgagagagcggtcatccaggccgtggaccagatctgcgacga
+ tttcgaggttacccccgagaagctggacgaattaactgcttacttcatcgaacaaatgga
+ aaaaggtctagctccaccaaaggaaggccacacattggcctcggacaaaggtcttcctat
+ gattccggcgttcgtcaccgggtcacccaacgggacggagcgcggtgttttactagccgc
+ cgacctgggtggtaccaatttccgtatatgttctgttaacttgcatggagatcatacttt
+ ctccatggagcaaatgaagtccaagattcccgatgatttgctagacgatgagaacgtcac
+ atctgacgacctgtttgggtttctagcacgtcgtacactggcctttatgaagaagtatca
+ cccggacgagttggccaagggtaaagacgccaagcccatgaaactggggttcactttctc
+ ataccctgtagaccagacctctctaaactccgggacattgatccgttggaccaagggttt
+ ccgcatcgcggacaccgtcggaaaggatgtcgtgcaattgtaccaggagcaattaagcgc
+ tcagggtatgcctatgatcaaggttgttgcattaaccaacgacaccgtcggaacgtacct
+ atcgcattgctacacgtccgataacacggactcaatgacgtccggagaaatctcggagcc
+ ggtcatcggatgtattttcggtaccggtaccaatgggtgctatatggaggagatcaacaa
+ gatcacgaagttgccacaggagttgcgtgacaagttgataaaggagggtaagacacacat
+ gatcatcaatgtcgaatgggggtccttcgataatgagctcaagcacttgcctactactaa
+ gtatgacgtcgtaattgaccagaaactgtcaacgaacccgggatttcacttgtttgaaaa
+ acgtgtctcagggatgttcttgggtgaggtgttgcgtaacattttagtggacttgcactc
+ gcaaggcttgcttttgcaacagtacaggtccaaggaacaacttcctcgccacttgactac
+ acctttccagttgtcatccgaagtgctgtcgcatattgaaattgacgactcgacaggtct
+ acgtgaaacagagttgtcattattacagagtctcagactgcccaccactccaacagagcg
+ tgttcaaattcaaaaattggtgcgcgcgatttctaggagatctgcgtatttagccgccgt
+ gccgcttgccgcgatattgatcaagacaaatgctttgaacaagagatatcatggtgaagt
+ cgagatcggttgtgatggttccgttgtggaatactaccccggtttcagatctatgctgag
+ acacgccttagccttgtcacccttgggtgccgagggtgagaggaaggtgcacttgaagat
+ tgccaaggatggttccggagtgggtgccgccttgtgtgcgcttgtagcatgatcttttta
+ catttttttggtttgtgtacgtatcccaccgtacttaccatcttctctcctttatatata
+ tatatatatatgtatattttcagtgtatatacatacattcttatacaataccgtataaga
+ acgtatgtatgtatgtatgattttctcacattaatgaacaatatcatcggaaacgggcct
+ ttggtttttaagggaaaatgaagaacaataaatgtcttgctaggtcttcctttatcattt
+ tccaactcaaatatgaaaagcagctccaaggggcgtactacttcaactaataaaatgtca
+ cacactaataagatcgcatacgtgttgaacaatgacacggaggaaacagcctcgccctct
+ tccgttggttgttttgacaagaaacagctcactaaattactgatacatactctaaaggag
+ ctgggctacgactccgccgctaaccagttactgctagagagcggcggatatcaaaatgag
+ tctaaccacatccagactttcttcaaactcatcaagaccggccaattccatcttattaat
+ tggcagattgtttgctcgctgccccttgcccatagctctccgctaaggtccgaatggctt
+ caaaggctgctcattcccacgccgacgcccgccacgacttcacttttcgaccacatgctc
+ ctgcaactgcaatacctgcaacagctgatgagctctgtaaattcatctacttgttcggac
+ gcagagatcgccacgctcagaaattacgtagagatcatgattctagttaatagacaaata
+ ttcctcgaattcttccatccagtcacaaattccgcctctcacaagggccctcacactgca
+ ctgcccgtcctgtatttgcgcaaaatcctcaaaaacttcatcgagatatgggattccctg
+ ctggtgtctaacgatcagttcctaaatgaagaaaacatcttcaatccggaaacgaccttg
+ agagaactgtcgacgtacctgaccaacccaaaactaaccgcgcagttgaacctcgagcga
+ gaccatctgattgacgccatctccaaatatatcgatccgaacgaactcgttcccaagggt
+ cgtctcttgcatctcttgaaacaggccatcaagtatcaacaatcacaagacatttttaat
+ atcatcgatccggatgatgacgcctctttctcctctccaccccaccggatcaaccttttg
+ caggataacttctcacacgatctgactgtgacctttcaagaatggaagactatccaagac
+ accacagacgaaatttggtttttgacattttcgcccaatggcaagtatttggcttctgcg
+ acttccgagtcttcaagaggctacttcattactgtttatgatgtggaacaagacttcaaa
+ atttataagacttgcgtgagcttgtcacaatccgtattgtacctcatgttttctcctgat
+ agccggtatctggttgcttgccctttcagcgaagatgttaccatttacgacatgaatgcc
+ acctccctgcccgatgcgtctgccacagactcgttcctactatacccttctaccaggctc
+ tcgcccatggactcgtttaaactggacaccaccacgtacccagatgataccgaatcatca
+ gcatcatcttcctcgaggccggcaaacgcgaattctaatcaatcaagagtatggtgctgc
+ gatgccttccacactgcggaacgtgcgggctggatggtggttggatcgcccgacagagaa
+ gctattgtacactcactcaccacgaaagagtcactttttagcttaaaaggcaggacttgt
+ atcgcgttgggccacgatgaaaacatctctgggagaaaatcaatcgatcctgcaaaagtc
+ ctttacaaaccaacaagtagcaatggtaactggcaatatgtggaggacgacgaaaccttt
+ ccacgtgtgcacgacgtgaaaatcagctatgacgataaatacgtgcttttaatgacccac
+ cagggcgttatagatgtctacgatttcagtgggttcccctccaaagaggagctatctaag
+ caaacagtggatcccaaaaactttctgattccaaggatcgcaagattggacgtgggcaaa
+ aacatgacttgtatctcattaccgctaaataccacgcatcaaggcttccataggcaacaa
+ atatccgagtctcaacatttggtattagttagccttcaagataacgaattacaaatgtgg
+ gactataaagaaaatattctcatacaaaagtactttggacagaagcaacagcactttatt
+ atcagatcttgcttcgcctacggtaataagcttgtcatgagcggatcagaagatggtaaa
+ atttacatttgggatagaattagaggtaacctagtatctgttttatccggtcattccacg
+ gtaatgagtaattccacaaaacccatggggaaaaattgtaacgtagttgcatcaaaccct
+ gcagataaagaaatgtttgcttctggcggtgatgacggtaagataaaaatctggaaaatt
+ tcaagaaattaatactccaccttatatatatacgtaagcaaagttttatgtaacaaaaaa
+ aaaaaaagaactaatccaaccttctttcactttcaggtaaaacttgactgagttcttcag
+ cctctcttctacctctcttgacatccaaacagtttagcacaggcaatgatagcatcaaaa
+ gcaaaaagaagaaataaaacgaatagcgaatattatgcgttttatcggtaagcagtccaa
+ caaggaagggtcccaggatggacgaccccttatctgtgatactgaacatactaaaaaacg
+ tggattcttttcctggaggtacaatcaaactgaaaactgatcttgaaaccgcggacaggc
+ cacctagtgataatccataccaaatggccaataagaacatttcaaacttatgctttaaac
+ cgaacgcattgaagaaaaatccaagaataccataaaatggtatgaaacttgcccaaatga
+ taatgtacatcaaagtttgactagaagtccaccgaaactttgtggcaagaaattgtggaa
+ tcataaaggcacccagcattgcatttacaacggtcaaaacacttatcatgattaaattga
+ gggtactcatgtgcagttctgccttggagaacaaaaccgctgtagaatttatagttgtaa
+ tggaatcactaataataaaccacgcaataagaaaaatcatcacatctttcaatagtctgg
+ catgtttaaacgattcgaaaagcgaaacccaaccatatgaaaggtaattcttccagttta
+ gttgacccaaggcgtcttgctcaccgggatacggactgcgggcggatgctaatgtagaat
+ cgtcaactcttatcggtattgtcacatcgtcaatcaaccagatcatgggcagttgccaca
+ caaaccaccaaatcccaacgaaaagaacagccacttgaacatcctgcttactgccctttt
+ tagatgcgactaagaacatagatacaatctgaacaatgagggcacttgaataacctaaag
+ atgcacctctaccgctaataacagtagttaacgagtctaccttatcaggttcataagctc
+ cttgactttgacatttcaaagaatccttgacaaaaatgggcagaagagaattcccaacaa
+ cattgataacgccaaaacaaccattggccactatataaagcccagccagagaataaatct
+ gggtatcattcaattttgaaatcgcaacagtcgacaatgcgcccacaataccaaaccaaa
+ ccagaattctgcctttgaatttaacgctcccccagagatctactatccctgaaacggaaa
+ tgactattatagtttggaataaaacgctaagggaaaagacatataatgcaaaacttgaag
+ tatctacgaagatccgattgttgaacaaaccaagaacacacttatctgaatcactacccg
+ tttctgacaggcagggtatggagtgatcgtgaacttttacaccatttatactcgcaaatt
+ gctgcagtagtaagggaatatacgttgaaaccgcagaaacgacaaatggttcgctagaaa
+ atgagtaagcgtaccatcccttgatattgttttgggctttttttattcgatagttcgtta
+ ccgattcattcatatcatttatagttccatagctcatatctaatgtagagaatatgttat
+ gagcaaactattgcttataagggcaaaaaccgtggtaatataggtacagtttgaaatctt
+ tccttttgtttacactacgcatctttcatagctgaggtattttcgtttaggaaaataccg
+ ttttaaggaattaattttagcaatttcctaaaattgttgagcggcgctatcaatgccgca
+ caatacaacaattaataatctaagacacgttgcattatacatgacagaagctaaagcatg
+ gtcttagatgacaataaaaaggaatttgacagcacaaaagagaaaaaaaaaactatacat
+ aaaattttttacatatgaattgcttagagtaaaataggcatttttcatatcttcttccca
+ attattaaattagcaaaaaattgaataaaaaaagcaggtggtttaagaattgcgcaggga
+ gaagaaagaatatcatatctcatgatggtagccacttagacagagagaaaacgaagtgag
+ aagaggctagtatttttatatcttaaaaaaaaaaaaaaaacaaaaaaaaaaaaacataaa
+ tatttgtatcttattgcttatgatgataatgtacaatgaattcaaaacatgaaaattgtt
+ caaagtttttatttttattttctctttcgtattaaactgatggaacaaatgaaaaaaatg
+ gataaaaaaacaaaaaaaaaaaaaaaacaaaaaaaaaacagctttgctggaggatgatta
+ acttttaatttttttaattattaggtttttcaattttttatggttttttgtgtaaaatgc
+ aatgaacgaatccattttttattttgcaagtgtgagaggcctttataaggaagactcgga
+ ttcaccggcagcaaccggcgcagagtcttctttggtttcttgttcttgagaagcgagttc
+ agtggatgattcctcttcttgttgaggaacaggtggtagggatatcatgaattggtccaa
+ tgcatcgccgaccagttctttttcaatattaatttcagtttcaaaagtttctggtaacca
+ gtttgaccagctcttggaacgaacgaagtacttatccaatggagatggttctttggcttc
+ accggtaacgttatcaccttccttggcggccaaagtaccttctgcaacattgacggtagc
+ ggcttcgttagcgacaatttctctcaatgcggctaaaatcaggttagtgtcacctccgaa
+ ggacatgttaacaactctgtaaaacttagagattaaagacaatggagcaaaaccatcctt
+ ggagagtttggaccttaagtaattgtcgacggtcaagttttcttcgctgaaatagtattc
+ aatttgtctagcaatgttgttgatggccattaacactggttggactggataaaactgttg
+ ctgtaatttgaccataggttgttgaggatgttgttggttgtgatggaagtggttgtggta
+ tttagagcggttgttgttgttattacgagcgttgcgtccttggtaaggtttaaactgtgg
+ tgggaagccttggttctgaggcataccggcgttattactgttgtggaactttctcctttg
+ aggaccattgctgttatggtgatgactgtggtgatgattacggtggtgaggtggttgttg
+ ctgttgtgtagattgaccattttcttgagaggtagcgtcttttgaatcttcctcacctac
+ tgccgattcaaagccatctttcttcatttgttgttgcttcttagcattatgttggtgatg
+ cttctttttcctgttagcattactcgacgatgtcgacgatgctgcagttttgttgttttt
+ gctcttgttgctgttgccattacttgcaccattcttggaaccggatcttttggtgccaga
+ gacggtaatggaggccttgatgggaacccacttggtagcagtcgatgatttcggagtggg
+ tgttctgttcttcttctttcttgtggcatccaagtcttctattgagatagtagaaacagg
+ gatctcagtaggagcaagtttccatggggaagatgtgggcaatggtgccggcgtcaagtt
+ gacttgcttgctcttggagctctcttgttcttgcacttctgggactggggcagtagcagt
+ gtttgcggcggcggtttcagcagacatcgtgggcagagaagtgaaaagaagatgaaacac
+ gttagcaggtccagagtaatcctgatgttctattaccgatctaggatcccaaaagatctt
+ cataaaaccgaaaaagatgttaagaaatgtttacacttgtacttttctttctttgttctt
+ gacctaccagaagtgaaaaaaatttttcattgcgtgggtacatccgtgcaccatttctga
+ aaaatttgcgatgagcatccctcaaaatccgtattatccgctagtgttttgaggttcaac
+ aagacaacatataagaagggaagaaggaaaggaggcaatgagaacttgggtcttgctcga
+ gagtccagatccacatgcaggttcaaaaaatggtgagagataacagtaataacggtagcg
+ ataaaagcgtccattgggagaggaggaataataacggcgcaggcccccgttatcgttcca
+ gaagcggtaataccggtgctttggcaacaaaactaagtaatgggacgctctctgtcagag
+ gattagtgaaggaccgaacaggaagcggcaagatcgcgggctgtgtggaggcgtttctgg
+ atgccaggacccaattgaatacgccctgggaccgtgctaagtgcaattggctggaccaga
+ tagattactatgtacagttgagaaagaccgcgttttctaaggaattggaccaactaagga
+ agcccatgatcgatgcatatgtggcggagatgaggcagaagtttgatgcctcctatggac
+ aatccagggcgcaattggaagccaaactggcgcaggtggacagtgaatggcatatggtac
+ atggtgatgtgcatgcaaaactggaaaaactcgtggaagaacgccggtttttgaaaagat
+ taagcgacacgatcgtaccacccaggtccaaaagatcacagcggctgtctccattgacca
+ aagaggaccgagccaactgtatctgtccgcagcccaaaggaatgagcgacaccgcttggt
+ tcgaagccattcagaagaaaatgttaggaatgaatggtaccatcaagctcctagagacag
+ aacagaaactactggctgacgagaaaaacagcgtgaggaagacgttctggcccatggtgg
+ aagcacattcacgctcgaatgaatttgcttatctggagaaatgcatcaggctgatggcct
+ ctcagagagcaatatgcttttgtcttgatatagaggctttcgaaacaaaccagaacgtaa
+ tcaccgaaattgggatttcaatttatgaccccagggaaaatatggtgccgtcaatggttc
+ caattacaaagaattaccacctaattatcgaggagtccctggaacttagaaaccaaaaat
+ gggtctgtgactacaaggattgctacttattgggagaaagctatgttttgagcttgaaag
+ agtgcgtgcatttcattcaatcactaataaactattacttggtcccggtgaccgaagaag
+ acaagacatggtcaagggcatttgttggtcatcacgtgagcggggatcttaagtggctgg
+ agactattggtgtcaaattccctggcagagggtatgaaggccatctggaccatacgctgc
+ ttttggctgaaactcccggtgatctagacgtgttcatcttggacactgagcagttttaca
+ ggaaatcgtatggcgaaaagggcagcagtctgggcaagattctgcggttgttcgagatac
+ cgcatgcgtttctacacaatgccggtaacgatgcctactataccctgcatttgttcatga
+ agttttgcgatgttaatttcaggaaaataagcggcatggacgatgttcttaaagtaatgg
+ gccaagtaaaagtttggggagaacgagacgtacgagagcctaaagtggtgcccatgtcgt
+ atgccatctccatcgaggaggcagtcaaaaatcggacgtaccgcaagggcgtcaagagca
+ gtaggaaggaaagagtctgccaaacggaattcggtgggttaacgtatttcggaactgcta
+ aagacgccttcacaagcactcttccgacacactaaatatattcatcgatttttttcctgt
+ aaatacaaacacatgcttgtatacatatacatatacatatatataagctcattataaacc
+ tgtgtgcatggaaaaaactttgtctgcccttaatttgcaagaataggttctaacaattcc
+ tccaattcaccagtctccctcaattcctgcaagtcgtcgttgcctccaatatgtttacca
+ ttaatatagatgtttggcacggttctttggccattaatctcatataacgcagcctgaatg
+ tctgcgccttccttcatgtcattcaattgcaaaaccagaactttggacctgggaaccttt
+ aacttttcaaaaagcgtgtttagggctgcatggcagtatggacagtacgttttggatgcg
+ acgaagatctcgttttctgcaataaggtccttgacgtgcttgatagtttcttgagatacc
+ attttttttccttctgaggttttgtctatttgtataattatgcgaatgctcgtataagta
+ ctgcacgaagttctttattttatgctaaagtccctttttatattatataaacttttcatt
+ tattttatatatttactcctttcaagacaaaattcccacatcacgtgaatcaggaggcgc
+ aaggggggtagtaaggcagtagtagataaagaaaacaagcatcgcaatctggaaacgccg
+ ttgcggctgctcgttatctgtagacagctcacttgtgcatgaagttcatgttttatagat
+ gatggagccatacttcaaggtgataactatgaaactgttttgaccttgacgtgtaaagtg
+ atcaaggtaatagttaaactttatatatgttgtttatcaatgtggtatcctttctaagga
+ tgaagacaacacgattgtgacagtgtgagagagtgataaagcattggaggcaggcaaaac
+ aagaacatggggtttctttcggatcatccgcatacagctatcaccgagacgatctttcga
+ attgtctcttcaagggactatacgctagaagtggagttagctcccttgattcagcttatc
+ aaagcagatcacaacgattataactataccgtgaaccaagaagaactgcgcgagcactta
+ gaaagaaaataaagtatgggaaccggctgcagcagtccaggactctggacctcttggatc
+ tgttcatctcacaaggcgtgaagtttacagtcatgtacaatgatgacaagctgctgcaaa
+ ggctgagagggatggctacgaactctgaaaacagcgggtccggtgaaaagtatgagccta
+ ggattattaagaagtgcgcggcgtatgctatctcgtggctaaactatattacgcagaaca
+ accttgagaatgcgagggcgtattcaggtctttatcaattgggtcaaacagtcaaacaga
+ ggtactcgaagagcagccgttcccgccgtagcgcgaggctctggcggacggtctaacttc
+ atggatgatagcgccgacgacactttgtaccagtccaactcgctgactagtgccgataga
+ ttgtataggattccacagatcaacatgaacaaagaggcacccaggattcggctcataatc
+ agcgatgctctcgcgtctgctgtgtcgttacagaactcgctgatcgggctgcccaagggc
+ aagttctccacagacgatgaagaggccacgtccaagttcatccaggcaagggcgatcaga
+ aggaaagttcttagatatctgcaattggtcacggagggtgagtttctgggtagcttaatt
+ catgccaatgatgaactggtggctgcgttgactgcatacgacgaccgaagcgctcaggac
+ gatagcagcgatgaaagtgatcacggctcgtacgatgacggtatttatgatgaaaacgag
+ caggataacagcaggtatattgatagcgagtcctcagaggaagagagcttgtcttcttat
+ cagccgtccacgatctcaaaccctttcggtgatcataacaaaatttgaagaagtatgtac
+ gtatgcatacacacatatatatatatatatatatagtacatttaatccttcttgaggttt
+ aaagacgcactgttcacacagtgtctggtgtccttgggcaagtttagcaactgtttccag
+ ccttcaccttcaaacacatgtcccaagtgtccaccacaccttgcacaacatatctccacc
+ ctcgcaggcattaaagaattgtcacgatgatatgtgatggctccaggggatacctcttcg
+ tagaatgcgggccatccacaacgagcgtcgaacttggccttgctcgaatacaacggtctg
+ tcgcagttggcacaatggtagacaccggactcgttggtgtgtaaatacgcaccggtgttg
+ ggcctttcagtggccttatctctcagcaccatcagctgcaatggtgtcagggcatcgttc
+ cacttcacgtcattcgattcgtcactcattttcttgctcttattccagtactgcgtcagt
+ acaatgtttcttctccctggaaaagtcctgcgtacagttataacgtacagcctgctccac
+ ttattcattttttgatattgcttcctttctccctttatttaaactatcatcggcgacata
+ tttcccaataggcaaagctgaaaaatactatcgctaattataacaaagaagctaggtcga
+ aggactcgaaaagacagaggtcgtactagcagagatagcaaatcagatggaggacggtaa
+ acaggccatcaatgagggatcaaacgatgcttcgccggatctggacgtgaatggcacaat
+ attgatgaataatgaagacttttcccagtggtcggttgatgatgtgataacttggtgtat
+ atccacgctggaggtggaagaaaccgatccattatgtcagagactgcgagaaaatgatat
+ tgtaggagatcttttgccggaattgtgcttgcaagattgccaggacttgtgtgacggtga
+ tttgaataaggccataaaattcaagatactgatcaataagatgagagacagcaagttgga
+ gtggaaggacgacaagactcaagaggacatgataacggtactgaaaaacttgtacactac
+ tacatctgcgaaattgcaagaatttcaatcgcagtacacaaggctgaggatggatgtctt
+ ggacgtaatgaagaccagctcaagctcttctccgattaacacacatggagtgtccactac
+ ggtaccttcttcaaacaacacaattatacccagtagtgacggtgtgtctctttcacaaac
+ agactatttcgacacagttcataaccgacaatcaccgtcaaggagagaatccccggtaac
+ ggtatttaggcaacccagtctttcccactcaaaatctttgcacaaggatagcaaaaacaa
+ agtaccccaaatatctacaaaccaatctcacccatctgccgtttcaacagcgaacacacc
+ ggggccatcacctaacgaggcgttaaaacagttgcgtgcatctaaagaagactcctgcga
+ acggatcttgaaaaacgcaatgaaaagacataacttagcagatcaggattggagacaata
+ tgtcttggtcatttgctatggggatcaagagaggctgttagaattgaacgaaaagcctgt
+ gatcatattcaagaacttaaagcaacagggtttgcaccccgccattatgttaagaagaag
+ aggtgatttcgaagaagtagcaatgatgaacggaagtgacaatgtcacccccggtggaag
+ actctaatgtgcagttgtcatgcacatcatcatactaaacttacacgaataggataacat
+ gtatgctagcagaatatatatgaaagaaaaaaattattgatgcctttaaacttatactat
+ tatactatattatgttatattatattattagttttatagatatattgagatatgttgaat
+ atgatgacgaggatggtggtttaagtgtatggattgaattttctcttagctttcattacc
+ ttgattctttcttgatcttccttaaatttacttaacagttgattgatctcctgtttttta
+ cgttctctgacctgaaatctatagaaatctttctttgcctttttatctatatttgaaatt
+ ggcttggccttattttcatgtttggataatggatttttgtttaatattttctttcttatg
+ gaattcaatgacttggtgttctttcctacaactaatgtgaatccatcctcatccactata
+ gaactttgaacatcctcttgtgcttgagcttcacgctgttcaaaaattgccatatgtgta
+ tgaatatcttctttcaaataatcaatatccagtggtttgtagaagttaacaaaagttgtg
+ aatgatggagtcgtatatgtccattcaaatagttcatttggatgtttggcatgcaaattc
+ gagtattttttcaaagcattccagcagttatttatacttgcagcatcaacaaattttaat
+ agcgccgtgtttcttggtgtgtatctcttctcgttgacgtcagtggaggacatcagatcg
+ gaggtcaatgccgataaatctacttcatgtaatccaaattcatcgttatatagtagttcc
+ tctacatgcgatactgtatcgtatttcccacagagctgcccgacaaatttcttcatgtgc
+ tctatgttagataataatggaaggttgaccaaaaacaaacagtcagactcgttggaattt
+ gaactctggtgtcttttagcaaacatgaaatgcaacgaagcttcctggcttttgggtagt
+ gccttgtgatccggtaatttgaacggcaccactataaacccgttcttcatggcgctaatg
+ tcttcaatacccatcccgtaacactctatctgaggttgttatctgttacctagttcgttc
+ gccctaaatgcctcttgctttaccatcctctcaatgttctcttctttttcctgcttattt
+ cttaggtgaccaaaaagtgaaaaattttccaatttaacttacgtcgttcgaagtgatgac
+ aataaggatattcatttattaatcgctatttgatacccactcttgctactacctctcttt
+ taatccaaaattacaatttttacgttaccattcattacatagtgtatctctattcattca
+ agactgtattttgtttgatatatatatgtatatatacatattattttcgttagtgttcgg
+ tttccaagttagaaataatctactggaaatcctttgggatcaacccaagcttactcattc
+ tgattttcacagcgtttctgtgaccgtctagaccctccttcttggcaacgcacataacag
+ ctctaccgatgttttctaaaccttcaggggtaatgttttgggcagtgataaacttttgga
+ aggttgcagtgttggcaccactgtactgcctagcgtaaccataggttggtaatgtatggt
+ tagtaccacttgaatagtcaccgcacgattctggagtgtaagcacccacaaatacggacc
+ ctgcattgtcaaccaatttaacataatcgttagcattggcgatttgtagaatcaaatgtt
+ ctggtgcatattggttggacatttcaagggcttcttcgtaaccgtcacaaagaacgatcg
+ tactgtgagcaatacatttacgaacaatatccacacgtggcagttgtaaagcttgattgt
+ ggacagcatcttgaatctcttgaattttcttttcgctcaagttaacaccaacaaggataa
+ cttgggagtcaataccgtgttcagcttgcgatagcaaatcacttgcaacaaaatccacat
+ cggcatcttcatcggcaataaccaaaacttcacttgggccagctggcatatcaatggaac
+ atagagcttgagtgtcattttgaacatacattttggcggcagtcacaaattgattacctg
+ gacccaagatcttatccactttaggaatagtttctgtcccgtaagccatagcagcaacgg
+ cttgggcaccaccagctagaacaatcttggaagcgccaactttttctgcgacataaacaa
+ cttcgggtgaaactttaccatcagattttcttggtggagatgcaaacacaatctccttac
+ attgggcaacttgtgctggaacaccaagcattaatgcagtacttggtaaaatggcagtgc
+ caccagggatatacaaaccaactttttcaataggacgagggaatctggaacacaagacac
+ caggttgggtttcaacttcaagagtctctgttggcaattgagcagcatggaatttgcgga
+ cgttttcaattgaaaggtccaaagcttccttcatttcctcggttaaaccttcaaagtatt
+ cttctgggaatggagcattaagaacaggattggataattttacaccatcaaacttttctg
+ tgtactccaaaagggcagagttacctttgtctctaacattttcgatgatcggattgacta
+ aatgcataatttcagaagtcttttggattggtctcctcaaagccttctgcacaccaactt
+ tgtcggaagccttcaccacgtccaagtgaattggaccggtcagtttttcttcttcagcct
+ ttggttgtccaacaaactttggcttagcatcacctttccgtcttgtaaccttcagatgct
+ tcatattcagattattctcgacgtccttcaatgaaacatcgttggccactaatttggcca
+ gtgcaaagtagaacaaatcggcagcctcccaagaaagctccttcttaccctttgcctcag
+ tcagttcttcagcttcttccttgatcttggcatctaacaatgcagagtcgttgaatagtc
+ ttctagtataagattcctctggagcgtcctgtagcctttgttttagtaaagattctagcc
+ ccaccaaaccatgcttgaattcaccaaagcaagacatggtctccaagtggcaaaatccaa
+ cgttttcttgttcaacgataaactttaaggcatccgaatcacagtcagtagagatttgta
+ aaagcttttggccattgccagaagtttcacccttgatccagatttcattcctagaacgag
+ aataataaacgccacgacccaaatcgatggcctttgctatagatttcttcgaagaataca
+ ccaaccctagacaacgctcatattggtcgacaactagggtggtatataaaccgtcaggac
+ ggtctgtacgtacttcaccaagcacttctttggtcaacatatccttgcttaatttcttta
+ tggacacaattttatcttgcgagaatttttgttttaccatgaattgattggagaaaacac
+ cgttctcttccacaacaacacgctcctttggtacattcaattgttcaaccaagtgttcgg
+ ctgttttagcatcttggctagcaatgaacagagaagaaactccgttgttcaagaaggcaa
+ tgatttcatcatcgctgaatttaccacttggcaaggacaaagccaccaatggaacttctt
+ cctctttggagaactggagaatctcttcattactcaggctcgagccatccaaaagtacct
+ gaccaacaagtgaaacgtattccttcttactattccatgaggccagatcatcaattaacg
+ gtagaatcggcaaaaccattattcagaaaaaaaattttgtaaactattgtattactatta
+ cacagcgcagttgtgctatgatatttttatgtatgtacaacacacatcggaggtgaatat
+ aacgttccatatctattatatacacagtatactactgttcatagtcatatccttttctta
+ ccttctatatcgaatgactgataaaaaaacgtgagtcactgtgcatgggtttagcaatta
+ attaactaatttaccggagtcactattagagtcagttcgactgcctagaagaactgctgg
+ ttgtcaggattgtgatgggggcattctgctgtattatgacccatcgtatcgcaatgctca
+ caccactgttgtcttcctgccgtggtatcgactggtgcaggggggtcgaaaattggcaac
+ gattccacggctgtttgtgcttgagcctgttccaactgtttgattttttcattagcctct
+ tcaagttttttcgttaaggatgccacctcttccgatgaggaatcttgtggttttgtcaaa
+ aatagttccttgctcaaattttggtattctttactgagcgaatcgttatgcattttcaat
+ tgttcgcgttctttagcccactttgtcttgtgtaactcaaattggtcttctatgttgcgt
+ aattgttccagctgttttttcaggagttcgacatcttcgttggcaccagtgggttgatta
+ tgagaaagatttctctcttcgttttctttgatctcttcgtgtagttggcttacgacagca
+ agtagctgttcattctcagcgtcaaaaaactgcttttgtttggcttgctgtctgcgttcg
+ agctccaattgttgcttgagatggtctatctctttctctctttcttgtattgtggcttca
+ tacctatcaaaagtcggttgcacttcttcgaggaccattctttggtcatcgagtagcctt
+ ttgtagtgtagttgtttcctttgtagcttttcgatggtcaattgggcttctcgtaattca
+ attgtaacttcgctgctattgaggtcattcatgtggccattgtccggtttccaatcgctg
+ gtggtgttgtgattagcctttctgtctgatgacaggatagagtcgacctccattctgtct
+ tctctgttatcgtaaccaaattcttgctgttgatggtgatccgatgcctcctggtccatc
+ gactgttgattaccgctgtgccgactggtgatccggaaacttctcatgggtgtgggggat
+ ttaggatcatccatgggagagaactggttagtgagcctcacaatagatctgtttttgggt
+ attgatagcggttccatcgtcgttcttctcgaggtttgcgatatcgatgccttctcgatc
+ aatgatgcgactttttgcaactggataaatagtccactttgaggatactctgtttgaaaa
+ tacttcttccccatgaatgatccatcgttcttaccaatgttggcaagtaagtctacacca
+ gcaaacattccagctttcgtgtccactggacccacgtatttcagttgtccgcgccccaaa
+ tttgggatttgtatgaaacatcctatctttctttgatatctatccatggtatttcaaacg
+ catacacagccagccacagctcaacgccttttaccttgtcctttgatgcctgtctcgtcc
+ aaacgtttttggtgtcttggccaattgcccttctgaaaaatctcactgtccgcaactcat
+ taaaagatacccaagcaagctacacgataaagaaaggagaaagttcattaaaagaacgta
+ catatagcgatacaaacgtatagcaaagatctgaaatggatacggataagttaatctcag
+ aggctgagtctcatttttctcaaggaaaccatgcagaagctgttgcgaagttgacatccg
+ cagctcagtcgaaccccaatgacgagcaaatgtcaactattgaatcattaattcaaaaaa
+ tcgcaggatacgtcatggacaaccgtagtggtggtagtgacgcctcgcaagatcgtgctg
+ ctggtggtggttcatcttttatgaacactttaatggcagactctaagggttcttcccaaa
+ cgcaactaggaaaactagctttgttagccacagtgatgacacactcatcaaataaaggtt
+ cttctaacagagggtttgacgtagggactgtcatgtcaatgctaagtggttctggcggcg
+ ggagccaaagtatgggtgcttccggcctggctgccttggcttctcaattctttaagtcag
+ gtaacaattcccaaggtcagggacaaggtcaaggtcaaggtcaaggtcaaggacaaggtc
+ aaggtcaaggttcttttactgctttggcgtctttggcttcatctttcatgaattccaaca
+ acaataatcagcaaggtcaaaatcaaagctccggtggttcctcctttggagcactagctt
+ ctatggcaagttcttttatgcattccaataataatcagaactccaacaatagtcaacagg
+ gttataaccaatcctatcaaaacggtaaccaaaatagtcaaggttacaataatcaacagt
+ accaaggtggcaacggtggttaccaacaacaacagggacaatctggtggtgctttttcct
+ cattggcctccatggctcaatcttacttaggtggtggacaaactcaatccaaccaacagc
+ aatacaatcaacaaggccaaaacaaccagcagcaataccagcaacaaggccaaaactatc
+ agcaccaacaacagggtcagcagcagcaacaaggccactccagttcattctcagctttgg
+ cttccatggcaagttcctacctgggcaataactccaattcaaattcgagttatgggggcc
+ agcaacaggctaatgagtatggtagaccacaacacaatggtcaacaacaatctaatgagt
+ acggaagaccgcaatacggcggaaaccagaactccaatggacagcacgaatcctttaatt
+ tttctggcaacttttctcaacagaacaataacggcaaccagaaccgctactgaacgatga
+ ttcagttcgccttctatccctttgtttacgtatttgtttatatatataactttatttttt
+ ttattaattgggctgcaagacaattttgttgtcagtgatgcctcaatccttcttttgctt
+ ccatatttaccatgtggaccctttcaaaacagagttgtatctctgcaggatgcccttttt
+ gacgtattgaatggcataattgcactgtcacttttcgcgctgtctcattttggtgcgatg
+ atgaaacaaacatgaaacgtcagtaatttgaaacaaataacgtaattctcgggattggtt
+ ttatttaaatgacaatgtaagagtggctttgtaaggtatgtgttgctcttaaaatatttg
+ gatacgacatcctttatctttttcctttaagagcaggatataagccatcaagtttctgaa
+ aatcaaaatggtagcaacaataatgcagacgacaacaactgtgctgacgacagtcgccgc
+ aatgtctactaccttagcatcaaattacatatcttcgcaagctagttcctcgacgagtgt
+ aacaacagtaacgacaatagcgacatcaatacgctctacaccgtctaatctactcttttc
+ taatgtggcggctcagccaaaatcatcttcagcaagcacaattgggctttcaatcggact
+ tcccatcggaatattctgtttcggattacttatccttttgtgttatttctaccttaaaag
+ gaattcggtgtccatttcaaatccacccatgtcagctacgattccaagggaagaggaata
+ ttgtcgccgcactaattggttctcacggttatttcggcagagtaagtgtgaggatcagaa
+ ttcatattctaatcgtgatattgagaagtataacgacacccagtggacctcgggtgataa
+ catgtcttcaaaaatacagtacaaaatttccaaacccataataccgcagcatatactgac
+ acctaagaaaacggtgaagaacccatatgcttggtctggtaaaaacatttcgttagaccc
+ caaagtgaacgaaatggaggaagagaaagttgtggatgcattcctgtatactaaaccacc
+ gaatattgtccatattgaatccagcatgccctcgtataatgatttaccttctcaaaaaac
+ ggtgtcctcaaagaaaactgcgttaaaaacgagtgagaaatggagttacgaatctccact
+ atctcgatggttcttgaggggttctacatactttaaggattatggcttatcaaagacctc
+ tttaaagaccccaactggggctccacaactgaagcaaatgaaaatgctctcccggataag
+ taagggttacttcaatgagtcagatataatgcctgacgaacgatcgcccatcttggagta
+ taataacacgcctctggatgcaaatgacagcgtgaataacttgggtaataccacgccaga
+ ttcacaaatcacatcttatcgcaacaataacatcgatctaatcacggcaagaccccattc
+ agtgatatacggtactactgcacaacaaactttggaaaccaacttcaatgatcatcatga
+ ctgcaataaaagcactgagaaacacgagttgataatacccaccccatcaaaaccactaaa
+ gaaaaggaaaaaaagaagacaaagtaaaatgtatcagcatttacaacatttgtcacgttc
+ taaaccattgccgcttactccaaactccaaatataatggagaggctagcgtccaattagg
+ gaagacatatacagttattcaggattacgagcctagattgacagacgaaataagaatctc
+ gctgggtgaaaaagttaaaattctggccactcataccgatggatggtgtctggtagagaa
+ gtgtaatacacgaaagggtactattcacgtcagtgttgacgataaaagatacctcaatga
+ agatagaggcattgtgcctggtgactgtctccaagaatacgactgatgaaaataatattg
+ acgttcgcatttaatctatacctataattctgtacttatatactgttccttaattgaaga
+ tttcaacatcgtttttgatgtaggtcttttcacctggaggtgcggctggggtaccgaaga
+ ctaattgagcttgtacggtccaagactcagggattttgcttggcaaagcagcttttatgt
+ aaccattgtagtgttgtaggtgaccacccaggcccattgcctccaaggcaacccacgagt
+ tgatttgagcggcaccagaggtatggtccgcgaaactagggaatgcagctgcgtacgctg
+ ggaagtcagcctttagcttttcagttaccttgtcgtcggtgaagaagattacagaaccaa
+ aggcctcatcccttgctgaagcaggcctcttttgaccggcaggctttctatagccttagt
+ cacttcgtcccaaacttttttgtgagtttcaccagtcaagataacagcgcgatttggctg
+ ggagttgaaagcggtgggtgtttctttaatgatggtttggacgacggattggatgtcgtt
+ gatagtaatttcaccaggtaactccggtttcaaagcgtaaatagtacgacgagcagttaa
+ agttttcaaataagttgcaaccagcagacatgatattggattgtttctttgtatatatgt
+ tgatataggatacttcagtctacgaaaaaagtacaaattatgtagtcagttccttcagca
+ tggtgtccttatatactgtagtttggacaaggtgcaaatgccaagaccctagcccgaaaa
+ gctcgaggcaccccaggatctttttctttacgtaattttcacgtaaaacgccacagtccg
+ atttttttcgaataatcattagtaaaagcggtatactggattattgtacgataacaaggt
+ agagctttattactaagctaagacgttcttacatcaatagtgctgttcgttattgatgtt
+ aggagaaggagagggtctgctgaatagtgtgagcagtgtttctgaactttttcttcgtct
+ aagtccttgtaatgtaaggtaagaatgcaagcatcttgtttgtaacgcgattgtacgttg
+ acgttagtaagtcacaaacccaagcttaacttcttcgtgaggaaggaaagtgttgtctcc
+ tactttttttaaattttcgaattgtatttatatttatttagtacttcttgagtttacata
+ tccttcgtaaaaatgcaacttctgtcgaaaaacacttccaaaaaaaaataataatgaatt
+ tatgaagcatactaacgagcgagcacatcgctgaggtatcattacttcatgagataaatt
+ aagatctcctcatatgcgaatttcctgttcagtgataaacgttgattacgttattgataa
+ aagtcttttcttctggcaaggcagttggaacaccaaagaccaattgagattgtacagtcc
+ acgcaataggaacatcttgaggcaaagcagatttgacgtagtcattatagtgttgcaaat
+ tagcccccaatcccaatagttcgagggcagtccaagactgaatttgcacagcaccggtcg
+ tatgagcggcgcatgttgggaaagcggtgccaaggctggaaaatctcttgcagttttcag
+ ttggtccttcatcagtgaagaaaatgactgaaccgtaagcctcatctctgcaagactctg
+ gtctcttcttggcttcagcagttggcattgcgctcgcaacagcatcccatatccttttgt
+ gtgtatcaccaacgataatgacagcgcgattcacttgtgagttaaaagctgttggcgtat
+ tcttgagaataacgtgtacagttctctttacatcatccaaaccgacaccttgtggtaatt
+ cgggcttcaaattgtagatggtacgacggtttgtaatagcgtttaagtagtttccagttg
+ gggacatttctttggcttggaggtctggtgttcttgattttgatggtgtatatagcttta
+ aaaaaccaaaaatgatcaacctttatatctctcttcgatcacaacatgggtggagacaaa
+ cacaaagtagtgatgacccactcaggtgttaaagtgtgtcatatctgcaaatgacactgt
+ atttttttttcctttgccatttattgtcgcagtaaggaaaagcgcagagcaaagctgaca
+ aattgaaaaattattcggaagagcattaacaaatattgattaataatcaagtaacactcg
+ cagacattgttatttcagaagtaaaaatatcgcagaacaagttggttaaactgaatgctt
+ cagtcattctaatgctaattacaccttaacaaaattaaaaattcttaacgtagattttta
+ aaaaaggcaagaagatcgttcctgatgcacgacgaaaacggttgcacccctactaaggca
+ aaattgcatatgcggatgaaatcaaacctcttctgttgcattaagaacaaaaggaaatca
+ tttttctctcacttcatattatttcacttatattttcctctccatttccatcacgcaatc
+ gttagtgctttttattttttttcttcgctcataaaggactagaattaaaatgaaaatcgt
+ ctatcattacacgtatgctattactacaaaacagaagtacaaaaatgaataaatataaaa
+ gaagtaaatgctttttttttaacaccagaaggcaacgacccttttccaataaggtccgtt
+ cctcaaacgttccctatattcttcgtcttcttgcttgattaattcttcatcaaagatttg
+ tctatgagaatctaggtcgatcttgtcggccctgatgaacagtttccaatccttgtgcca
+ gaccttgtagccgacatatagtgcaatcaagattggcatagccaagtagttttcgaaaaa
+ ggcttgtgcatccagcttaccttcaccaatgggggcgatagcgacccaaaattgggcaat
+ aagaattaaaatcatcataatgcaagcgtaggcagatccccaaacaccagtttgagattt
+ gaaacccaattctccaagagatctcccttggactttcatggctcttctaaatctaagatg
+ ggataaacaaatggcagtccatgtgaaaagctgagacaaaccagaaatggccaataacca
+ agtgaaaacttgttcttccttgggagatgcagcacagaaggcaataacggcaaacaatgc
+ agaaacacccatggcaatcaatggtctaccggctctgtcgatgtaggagaaaaccttagg
+ agcgtaaccttgctcggatagagttaaaaataaacgagcactggagtagaaggaggagtt
+ agccatggacagcacggaaagtagaataacggcgttaatgaagtgtgggacgacacggac
+ accgtgggatgcaacagcaatgacatatggcgaggctttagtaccaccaccggtagaacc
+ caataattgatcggaattgtatggcaccaagaaacccagtagaataatggtagccaagaa
+ taggaataagattctgtagatcatttgtttggccgcacctggaatggcctttcttggatt
+ agattgttctgcagtggtgatggcaataaactctgaaccaccaaaggcaaaagcagcagt
+ cactaatgttgcaacaacacctttaaatctgtcaatggcatgtttaccattgaaagcgcc
+ cggatcgtgccagtatttaccaccaataaaaccatcattaccagcgccaccaacatcgat
+ gataataccaagaatgaagaacccagtcatcatcaaaattttgcaacagttgaagaagaa
+ ctcagcttctgcataaccacgagcaccgaaaatattaatagtaatcaccaaaacatagaa
+ aataatgacgaacacatccgggttcacagatgtcgtccaatatttgatagtcatggatgc
+ ggtcaccaattccagaggacacacacacagccattgcaaacaataaacccaagcgactgc
+ aaacccaaaaaccatcatccacaaggaaactgggtatgcattgtagccaccagtcaagtt
+ actatacactagcgccatttcaccacatgcttgaataatacagtacaagatagaacccat
+ aatagcgtaaccaataagtagtccagctggacccgcatgaaccaacgcggtaccgttacc
+ gaccagtaacccagtaccgatacccgtacccaacgcaatcatcagaacatgtctaggctg
+ aatggttttcttcagcgagtccgacttatgacctgtattagcgcctatgttgtccgaact
+ ttcatttttttctagttcttgagcctgtctagaagaaggtgagatagccgacgtaagatc
+ gttcatttgtgttgcttctacttcatcctgtggaccctgatccgctcttttaaaggagtc
+ gaaaaacctacgcacggcagaagtgttatgctgttcgtaacctaaatgaggttgtgagga
+ tggacggtcattggagcctgtttcgaaatattcaatttcattatcctgccccgtggcatg
+ tatttctgtggagctatttttcaagtctttcagttcgtatagagacttcgacgacgacat
+ tgtgcgaagctatctttgtctatattagcgtgcttcttctataaagattacataataaac
+ ttgttctacattgttgtgggcagaccatcaaagacaaagcacttatatatgtatgtaaaa
+ caaatgagtattacgaggaaataacaaaagaaactaatcacgaaggatatgagtcttgtt
+ agcggacgaggggaagatagcgcaagacacacgtatagccgacctatcctatgcatctta
+ gccgtcttatcaggattgatttgcggttcagtgaaagagataaccactgccaagctggaa
+ gaaatggctagaaggcactaaaaaaagagcctcgcagcgcaacttaagaactgtggacta
+ ctaaggagcagcaatccgctggacgtgggggagttgaaagttggccggatactgcaacct
+ gcttatctatgactcgatgagaccgttatttagtgcgatggtgccactgggagtgccatc
+ aacggcaaagttgcagctttttgcagcagatcactggcttagacggcaccctcaagcggc
+ actgaagaacggcgcgcttttttttttcttctattatgccggtcaaggaaaaaaagaaac
+ caaaaaaaaatgccgtgggaggcacggatgcaagatttctccaaagtccctttttagccg
+ ccgaggaaatgaggaaataaggggttagccgccgagtgaagccgccgaggtataaaaggg
+ agaatgcacaccttcgtaagaagattctatttgaaagagtgctaggtaaagtcatcaaaa
+ ggcttattgatataataattgctttgcaaaatggaaaacaaatgactttgaaaatgggag
+ aaaacaaaaattatataggtaatgttgagttcttgattttttttttgtttctgtccttgc
+ cacagctcttttcatggaacattgggctctaggataatttttttatgttttctgttatgt
+ ttgccatggtaggttgaggtaaaggcgctctttcagcaatgacgatttatgctaatatgt
+ tttccttaaaaggaaaacatctcgttctttttttcgcgtgctgtgcaagaaaacgcgtaa
+ cgcagtcttagtaaaagggctctcttgaaaactacatgaaaactaaaaaagaagatgtaa
+ atttggatacaacagcgtagaatagcataatagagtgtgcaataaacgtgtataggcttg
+ ataatagtgctgacgtaaattaccagaactactgcagtatgactgtggcgaataccgaga
+ cccattctgctgctaaaccatcgagtactataggcccatggaagcttggtgaaacactgg
+ gctttggtagcaccggcaaggtccaacttgcgcaacatgagcgcacgggccatagaacag
+ ccgtcaaagtaatctcaaagtccattttcaacaacaacggaaaccacagtaacgacgatt
+ cagtgctgccatacaacatcgagcgtgagattgttattatgaaacttttgagccacccaa
+ acgttctgagtctttacgacgtttgggaaaccaacaataacctgtaccttatcctggaat
+ acgccgaaaagggagaactgttcaacttgcttgtggaccacggtcccttgcccgagcgtg
+ aggccatcaattgcttcagacagataattataggcatttcatactgccatgcgctaggaa
+ tagtacatcgggatttaaagccagaaaatcttttgctcgatagtttctataatatcaaaa
+ ttgcggattttggtatggcagctttacagacagacgccgacctgctagaaacttcttgtg
+ ggtctcctcattatgccgctcctgagatcgtatcaggtctgccctatgaaggatttgcca
+ gcgacgtttggtcttgcggtgtaatcctctttgcccttttaacgggtagactacccttcg
+ atgaagaaaacggtaatgttagggatttgttattgaaagtccaaaagggccagtttgaaa
+ tgcccaatgacaccgaaatttccagagatgcacaagatttaatcggcaagattctcgttg
+ tagatcccaggcaaagaataaaaatcagagacatccttagtcatccactgctaaaaaaat
+ atcaaaccattaaggattccaaaagtatcaaagatttacctcgtgaaaatacctatctat
+ atccactggctgactcgaacaaccatactagcgcctctatcgatgattcaatcctacaaa
+ acttggtggtactttggcatggtagacacgccgatgacatcgtatctaaactgaaagaga
+ atggaaccaacaaagaaaaaattctatatgcgctactataccgtttcaaactggactctg
+ taaggggatcgaataagaagaaccgcaataagataaagaaaaccaaaaaaaataaaagat
+ ctagcacactttcgtcatcgtcatcattattattgaacaaccgaagcatacaatctacgc
+ cacgcagacgtacatccaagaggcattccagggaattctcttctagcaggaagagatcgt
+ cctttttactgtcctcgaatcctaccgatagttctccaatacctttgagaagtagtaaaa
+ gaattacacatattaacgtagcgtctgcaaatacccaagcaacgccaagtggtgtcccga
+ atcctcacaagaggaactcaaaaaaacggtcaagcaaaagattatcttatatgcctaata
+ caaaaagaagctcactaacatcaaaatcattatcaaactttactaacctaatagatgatg
+ atgattgggagtacattgaaaaggatgcaaagagaacaagttccaacttcgctacactga
+ ttgatgaaatatttgagcctgagaaatttgaattggcgaaaagagaaaaggctgaacttc
+ aaagaaaagttcaggaagcaaaaaggcaatcagtgaatgcacagaagattaatgaggacg
+ agtttggatccgaagtttctgatggaatgaaagagctgaaaaaaataaatgacaaagtgt
+ cgtccccgttgataaattacgaattttcgcaacaagaactattgcaagatatagacacct
+ tactaacgaatcgttatcaactttcgtcatatactaggcctatttcgagactggatcccg
+ gattaacgcctgttactgagacacttcctaacaacctaaaagaaaaaacagctctgctgc
+ aggatactgaaaagaaaataatagaaacgatacgcagatccaaatttttaggatcgctac
+ taaatgttagagggggactatcgccagggaaaagtgaactggcacctatcgaagagtctc
+ ccatagtttcaaccacaccactaatatataatgatcgaatggaacctcgtaggatatccg
+ atgtggaagtcccacatttcacgaggaaatcaaaacactttactactgctaataatcggc
+ gctcagtcttatctttgtatgcgaaggattcaatcaaagacttaaacgaatttttaataa
+ aggaagatcctgatttgcctccacaaggaagcactgataacgaaagtaggagcgaagatc
+ ccgaaatagccgagagtatcactgattcaaggaatatacaatatgatgaggatgatagta
+ aggatggtgataatgtgaataatgataatatattgagcgactttccctcaaggcgttggc
+ atatcacaggaatacgacatgaaggataaaaatccaaaccaatctccaatatcaaaaagc
+ gcagagcccacactggtagtgaaacttccgtctttgagttctttccaaggaaaaaacgcc
+ agtgggttgggcctataccaaagagagccttctaaggtaaccttaccgagccttacaagt
+ aataacagcagcgtcggagaaaacactagaggatggggcggaaaaagggactgagagtga
+ gaaaatcgctgcttccctgtcagacgatgacttgaaggaagataatgacaagaaagataa
+ cgacacagtgaatgccctactacagtaaaaaagccacctaacagcgtactcttgaaaaaa
+ ttctcaaaaggtaaaatcttagaacttgaaatacatgccaaaatacctgaaaagagacta
+ tacgagggcctacataaactactagaaggctggaagcaatacgggttgaaaaacctggtg
+ tttaacatcactaatatgatcataacaggaaaattggtgaatgacagtattttattttta
+ cgatctacgctttttgaaataatggttttaccaaatggagacggtagaagtttaattaaa
+ ttcaataagaaaacaggatctactaaaacgctgaccaagcttgctacagaaatccaaatt
+ attttacaaaaagaaggtgttttggacaaataaaaaaaagggagggataatctatcccca
+ aatggacaatacgatgtaacaccgttatataattaataaatgtatatatgtaactatttt
+ ttgaaaaaaataaaatggtaatttattaaatatttggaccggtcaacaatctcagatata
+ gttgaaaaaatctttaattgtaatagcagaaaaaaaaaactccgatacggggagtcgaac
+ cccggtctccacggtgaaagcgtgatgtgatagccgttacactatatcggataacaaatt
+ attaaaattatttaataaaaagatgaactaatggtgtataaaaacatcataaaaactcaa
+ ctcaactgttccaataaaatctgcaatttaaattaattattctgaaaaaacacgtaaatt
+ gttggaatggaaccctaaaatatttcctctttagtactattcatgttacttgagaaatgg
+ gtgaattttgagatagttgttgggattccattgttgataaaggctaataatattaggtat
+ acagaatatgctagaggttctcctcgagcatataggaatcctcaaaatggaattagcatt
+ tctacataataacctcaattcccttttgtggattcacatattctcgatcggaactcctag
+ catagtgcagacaacactcacatagctctcccatttatcaagttggaattcactccaaga
+ aattttcttgcttctctctttcccacagaaatgaaaaggaacagtttggcggctattgtt
+ ttagcccttactttttggaccctttgaaaaaatgccaagaatgcgtgacggtttccctcc
+ atgctcgagaaccacaaaaaccctatgtggtattcgattatgcgactcacggcaaatatt
+ tgacttgtcaaagttattcgggatttccttgaatccctgagaagtgaaatggaaagaatt
+ taagggcataaatacctttcaagcacccagaagaacagtaaaataaagcaaggtacgtga
+ aattaatatttttaaatggttctaaccgatgccgaagaactgcgcgatccggttataacg
+ tctgacatgtccttttttgatttggaatccaaccactcagtgactctgtcattacttgcg
+ aaattattccacaaattccatcgaaagtgaatcgcaaaccaccttcagactggcaccgac
+ aaagcaaagattatacagacagagtactttatacgtaccgttaagtctcaagcaaagggt
+ tttcttatttactgaacgggtaaagagtatctgggccggcttgccaagatgcaaaccgaa
+ taagtatttcaaagttgcatttgccttagccgtcctgacaccattggctatttggatatt
+ ttatattgactttcgtgtacattgatcacatcgactgttctattggcaaatgaaccacgg
+ gcattgactatttttcaggttactactatatattattgttggaataaaaatcaactatca
+ tctactaactagtatttacgttactagtatattatcatatacggtgttagaagatgacgc
+ aaatgatgagaaatagtcatctaaattagtggaagctgaaacacaaggattgataatgta
+ ataggatcaatgaatattaacatataaaatgatgataataatatttatagaattgtgtag
+ aattgcagattcccttttatggtgttggaataaaaatcaactatcatctactaactagta
+ tttacgttacctagtatattatcatatacggtgttagaagatgacgcaaatgatgagaaa
+ tagtcatctaaattagtggaagctgaaacgcaaggattgataatgtaataggatcaatga
+ atattaacatataaaatgatgataataatatttatagaattgtgtagaattgcagattcc
+ cttttatggattcctaaatcctcgaggagaacttctagtatatctacatacctaatatta
+ ttgccttataaaaaatggaatcccaacaattacatcaaaatccacgttcttttcatatgg
+ attcctaaatcctcgaggagaatttataatatagtctgtatacataatattatagcttct
+ aacgacaatggaattctaacaattatcaaattgccgccagttcctcaaaaagatccatgt
+ ataatcttcattattacagccctcttgacctctaatcatgaatgttctcaaataaatctg
+ ttggaataaaaatcaactatcatctactaactagtatttacgttactagtatattatcat
+ atacggtgttagaagatgacgcaaatgatgagaaatagtcatctaaattagtggaagctg
+ aaacgcaaggattgataatgtaataggatcaatgaatattaacatataaaatgatgataa
+ taatatttatagaattgtgtagaattgcagattcccttttatggattcctaaatcctcga
+ ggagaacttctagtatatctacatacctaatattattgccttataaaaaatggaatccca
+ acaattacatcaaaatccacgttctcttcatggtagcgcctatgcttcggttacttctaa
+ ggaagtcccatcaaatcaagatccgttagccgtttcagcttccaatttaccggaatttga
+ tagagattccactaaggttaattctcaacaagagacaacacctgggacatcagctgttcc
+ agagaaccatcatcatgtctctcctcaacctgcttcagtaccacctccacagaatggaca
+ gtaccaacagcacggcatgatgaccccaaacaaagctatggcctctaactgggcacatta
+ ccaacaaccgtctatgatgacgtgttcacattatcaaacgtcacctgcgtattatcaacc
+ ggacccacactatccgttgccacagtatatcccaccactgagtacttcctcacctgatcc
+ aatcgattcacagaatcaacactctgaagtacctcaagctgagacaaaggtgagaaataa
+ cgtcttaccaccacacactttaacatcagatgaaaacttttctacatgggttaaatttta
+ catcagatttttgaagaactctaatctcggtgacatcattccaaatgaccagggtgaaat
+ caaaagacaaatgacttatgaagaacatgcgtatatatacaataccttccaagcatttgc
+ cccatttcatttattgccaacatgggtaaaacaaattttagaaattaattatgctgacat
+ ccttacagtcctttgtaaaagtgtgtccaaaatgcaaactaacaatcaagaattaaagga
+ ttggatagctcttgccaaccttgagtacgacggaagtacatctgctgatacatttgaaat
+ tacagtcagtacgatcattcagaggctaaaagaaaacaatatcaatgttagcgacagatt
+ ggcctgtcaactaatacttaaaggtctatccggtgacttcaaatacctacgtaatcaata
+ tcgtaccaaaacgaacatgaaactttcccaattattcgctgaaattcagttaatatatga
+ cgaaaataaaatcatgaatctaaataaaccgtcccaatacaaacaacacagcgaatacaa
+ aaatgtttctcgcacatctccaaacacgactaacacaaaggttacaactcgtaattatca
+ gagaacaaatagttcaaaaccaagagcagcaaaagctcacaatattgctacatctagtaa
+ attctcaagggtgaacaatgatcacattaatgaatcaaccgtttcatcacaatacttaag
+ cgatgacaacgaacttagtcttaggccagcaacagaaagaatctaagccaacacacacaa
+ tagactcgaatgacgaactacctgatcaccttcttattgattcaggacgttcgcaaacgc
+ ttgtcagatcagcccattatttacaccatgcaacacccaattctgaaataaacatagtcg
+ atgctcaaaaacaagacattcctataaatgccattggtaatcttcacttcaactttcaga
+ acggcaccaaaacatcaataaaagcactacacacaccaaacatagcctatgatctattaa
+ gtttgagtgagctggctaatcaaaatattacagcctgctttaccagaaacactttagaaa
+ gatcggatggtacagtcctagctcccatagtcaaacatggagacttttactggttatcta
+ aaaaatacctaattccttcgcacatttcaaagctaacaataaacaacgtcaacaaaagca
+ aaagcgtaaataaatatccatatccgttaatacatcgaatgcttggacatgctaacttcc
+ gaagtattcagaagtctcttaagaagaatgcagttacatatttgaaggaatcggatattg
+ aatggtctaacgcatgcacatatcaatgtcctgactgtctaatcggcaaaagcacgaaac
+ atagacatgtcaaaggatcacgactaaagtaccaagaatcatatgagccttttcagtact
+ tgcataccgatatatttggtcctgtacatcacttaccgaaaagtgcaccttcttacttta
+ tatcgtttacagatgagaaaaccagattccaatgggtgtacccattacacgaccgtcgtg
+ aagaatctatcctcaatgtttttacatcgatattagcatttattaagaaccaattcaatg
+ ctcgcgttctagttatccagatggatcgtggctccgagtacactaacaaaactcttcata
+ agttctttacgaacagaggtattactgcatgctatacaaccacggcagattctagagcac
+ acggtgtcgctgaacgattaaatcgtactttattaaacgattgtcgcacactgcttcatt
+ gcagtggtctaccaaatcatctatggttctcagcagtcgaattttctactataatcagaa
+ attcattagtctcaccaaaaaacgataaatccgccgcaagacaacatgcaggtttagctg
+ gactggacattactactatactacctttcggtcaaccggttatagttaacaaccataatc
+ ccgactcgaaaatacatcctcgtggcattccaggttacgccttacatccgtcacgaaact
+ cttatggctatattatctatcttccatcattaaaaaagacagtagatactaccaattacg
+ ttatattacaagacaagcaatccaaattggaccaattcaattatgatacactcacctttg
+ atgatgatctcaatcgtttaacagcccataaccaatcttttattgaacaaaatgaaacgg
+ agcagtcatatgatcaaaatacagaatctgatcatgactatcaatcggagattgaaataa
+ actctgatcctctagtgaacgacttctcgtcccaatcaataaaccctttacaattagaca
+ aggaaccagtccaaaaagtacgtgcaccaaaagaagttgatgccgacatatctgaataca
+ atattcttccatctcctgtacgatctcgtacaccccatatcattaataaagagagtaccg
+ aaatgggtggtaccgttgaatcagatactacttcacctagacactcgtctaccttcactg
+ cacgaaaccaaaagcgacctggtagtcccaatgatatgattgatttgacctcacaggata
+ gagttaattatggacttgaaaacatcaaaactacacgtttgggtggtacggaggaaccat
+ atattcaacgaaatagtgatacaaatatcaaatacaggactacaaatagtacgccctcaa
+ tagatgaccgttcgtccaacagtgaatccactactcccatcatctccatagaaacaaagg
+ ctgtatgtgataatacaccctccattgatacggatccgccagaatatcgatcttctgacc
+ atgcgactcctaatataatgcctgacaaatcctcaaaaaatgttacggctgattctattc
+ ttgacgacctcccacttcctgacttaacccatcaatctcctacggacacttctgatgttt
+ caaaagatattccacacatacactctcgtcagactaattccagtttgggtggtatggatg
+ attctaatgttctgactactaccaaaagtaagaaaagatcattagaagataatgaaactg
+ aaattgaggtatcccgagacacatggaataataagaatatgagaagtctggaaccaccaa
+ gatcgaagaaacgcataaatttaattgcagcaataaaaggagtgaaatcgatcaaaccag
+ ttcgaacgaccttaagatatgatgaagcaattacttataataaagacaacaaagaaaaag
+ acagatatgttgaagcttatcataaagaaattagccaactattgaaaatgaacacttggg
+ atacaaacaaatattatgatagaaatgacatagatcctaaaaaagtaataaactcaatgt
+ ttatatttaacaagaaacgtgatggtacacacaaagctagatttgttgcaagaggcgaca
+ ttcaacaccccgatacatatgattctgatatgcaatccaataccgtacatcactatgcac
+ tgatgacgtcactgtcaatcgcattagacaacgactattatatcacacagctggacatat
+ cctctgcttacttatatgctgatatcaaagaagaattatacataagacctccaccacatt
+ taggtttgaatgataaattactacgtttgagaaaatcactctatggtttgaaacaaagtg
+ gtgcaaactggtatgaaaccattaaatcatatttaataaattgttgcgacatgcaagaag
+ ttcgcggatggtcatgcgtatttaagaatagtcaagtaacaatttgcttattcgttgatg
+ atatgatattattcagcaaagacttaaatgcaaataagaaaatcataacaacactcaaga
+ aacaatacgatacaaagataataaatctgggtgaaagtgataacgaaattcagtacgaca
+ tacttggattagagatcaaatatcaaagaagcaagtacatgaaattaggtatggaaaaat
+ ccttgacagaaaaattacccaaactaaacgtacctttgaacccaaaaggaaagaaactta
+ gagctccaggtcaaccaggtcattatatagaccaggatgaactagaaatagatgaagatg
+ aatacaaagagaaagtacatgaaatgcaaaagttgattggtctagcttcatatgttggat
+ ataaatttagatttgacttactatactacatcaacacattgctcaaccatatactattcc
+ cctctaggcaagttttagacatgacatatgagttaatacaattcatgtgggacactagag
+ ataaacaattaatatggcacaaaaacaaacctaccaagccagataataaactagtcgcaa
+ taagcgatgcttcatatggtaaccaaccatattacaagtcacaaattggtaacattttcc
+ tactcaacggaaaagtgattggaggaaagtcgacaaaggcttcgttaacatgcacttcaa
+ ctacagaagcagaaatacacgcggtcagtgaagctattccgctattgaataacctcagtc
+ accttgtgcaagaacttaacaagaaaccaattattaaaggcttacttactgatagtagat
+ caacgatcagtataattaagtctacaaatgaagagaaatttagaaacagattttttggca
+ caaaggcaatgagacttagagatgaagtatcaggtaataatttatacgtatactacatcg
+ agaccaagaagaacattgctgatgtgatgacaaaacctcttccgataaaaacatttaaac
+ tattaactaacaaatggattcattagatctattacattatgggtggtatgttggaataaa
+ aatcaactatcatctactaactagtatttacgttactagtatattatcatatacggtgtt
+ agaagatgacgcaaatgatgagaaatagtcatctaaattagtggaagctgaaacgcaagg
+ attgataatgtaataggatcaatgaatattaacatataaaatgatgataataatatttat
+ agaattgtgtagaattgcagattcccttttatggattcctaaatcctcgaggagaacttc
+ tagtatatctacatacctaatattattgccttattaaaaatggaatcccaacaattacat
+ caaaatccacattctcttcaaaatcaattgtcctgtacttccttgttcatgtgtgttcaa
+ aaacgttatatttataggataattatactctatttctcaacaagtaattggttgtttggc
+ cgagcggtctaaggcgcctgattcaagaaatatcttgaccgcagttaactgtgggaatac
+ tcaggtatcgtaagatgcaagagttcgaatctcttagcaaccattatttttttcctcaac
+ ataacgagaacacacaggggcgctatcgcacagaatcaaattcgatgactggaaattttt
+ tgttaatttcagaggtcgcctgacgcatatacctttttcaactgaaaaattgggagaaaa
+ aggaaaggtgagagccgcggaaccggcttttcatatagaatagagaagcgttcatgacta
+ aatgcttgcatcacaatacttgaagttgacaatattatttaaggacctattgttttttcc
+ aataggtggttagcaatcgtcttactttctaacttttcttaccttttacatttcagcaat
+ atatatatatatatttcaaggatataccattctaatgtctgcccctaagaagatcgtcgt
+ tttgccaggtgaccacgttggtcaagaaatcacagccgaagccattaaggttcttaaagc
+ tatttctgatgttcgttccaatgtcaagttcgatttcgaaaatcatttaattggtggtgc
+ tgctatcgatgctacaggtgtcccacttccagatgaggcgctggaagcctccaagaaggt
+ tgatgccgttttgttaggtgctgtgggtggtcctaaatggggtaccggtagtgttagacc
+ tgaacaaggtttactaaaaatccgtaaagaacttcaattgtacgccaacttaagaccatg
+ taactttgcatccgactctcttttagacttatctccaatcaagccacaatttgctaaagg
+ tactgacttcgttgttgtcagagaattagtgggaggtatttactttggtaagagaaagga
+ agacgatggtgatggtgtcgcttgggatagtgaacaatacaccgttccagaagtgcaaag
+ aatcacaagaatggccgctttcatggccctacaacatgagccaccattgcctatttggtc
+ cttggataaagctaatgttttggcctcttcaagattatggagaaaaactgtggaggaaac
+ catcaagaacgaattccctacattgaaggttcaacatcaattgattgattctgccgccat
+ gatcctagttaagaacccaacccacctaaatggtattataatcaccagcaacatgtttgg
+ tgatatcatctccgatgaagcctccgttatcccaggttccttgggtttgttgccatctgc
+ gtccttggcctctttgccagacaagaacaccgcatttggtttgtacgaaccatgccacgg
+ ttctgctccagatttgccaaagaataaggtcaaccctatcgccactatcttgtctgctgc
+ aatgatgttgaaattgtcattgaacttgcctgaagaaggtaaggccattgaagatgcagt
+ taaaaaggttttggatgcaggtatcagaactggtgatttaggtggttccaacagtaccac
+ ggaagtcggtgatgctgtcgccgaagaagttaagaaaatccttgcttaaaaagattctct
+ ttttttatgatatttgtacataaactttataaatgaaattcataatagaaacgacacgaa
+ attacaaaatggaatatgttcatagggtagacgaaactatatacgcaatctacatacatt
+ tatcaagaaggagaaaaaggaggatgtaaaggaatacaggtaagcaaattgatactaatg
+ gctcaacgtgataaggaaaaagaattgcactttaacattaatattgacaaggaggagggc
+ accacacaaaaagttaggtgtaacagaaaatcatgaaactatgattcctaatttatatat
+ tggaggattttctctaaaaaaaaaaaaatacaacaaataaaaaacactcaatgacctgac
+ catttgatggagtttaagtcaataccttcttgaaccatttcccataatggtgaaagttcc
+ ctcaagaattttactctgtcagaaacggccttaacgacgtagtcgacctcctcttcagta
+ ctaaatctaccaataccaaatctgatggaagaatgggctaatgcatcatccttacccagc
+ gcatgtaaaacataagaaggttctagggaagcagatgtacaggctgaacccgaggataat
+ gcgatatcccttagtgccatcaataaagattctccttccacgtaggcgaaagaaacgtta
+ acacaccctggataacgatgatctggagatccgttcaacgtggtatgttcagcggataat
+ agacctttgactaatttatcggatagtcttttgatgtgagcttggtcgttgtcaaattct
+ ttcttcatcaatctcgcagcttcaccaaatcccgctaccaatgggggggccaaagtacca
+ gatctcaatcctctctcttggccaccaccggatagtaaaggttctaatctaactcttggt
+ ctccttcttacatagatggcacctattccctttggaccgtaaatcttgtgagaagaaatt
+ gatagtaaatcaatgttcatttcattgacatcaatgtgaatcttaccataggcttgtgcg
+ gcgtcagtatgaaagtagatcttattctttctacaaattgcaccaatttctttaataggt
+ tgaatgacaccgatttcattattgacagccatcacagagacgagacaggtatctggtcta
+ atggcatcttccaattccttcaaatcgataagaccttgatcgtccacatttaggaaagtg
+ acttcaaatccctccttcatcatggcccgtgcggcttccaagacacacttgtgttccgtt
+ ctagtggtgatgatgtgtttcttagtcttcttataaaatcttgggacacccttaagaacc
+ atattattagattcggtcgctcccgaagtgaatattatttccttggggtcggcattgatc
+ atctttgctacgtaagctctagcattttccacagcagtatttgtttcccaaccgtaagag
+ tgagtgttggaatgaggattaccataaagtcccgtataaaacttcaacatcgtatccaaa
+ accctagggtctgttggtgtagtggcttgcatgtcaagatatatgggacgagtaccaaaa
+ cctgtgttttcttgataagcatggctcattgcagtgctaccagaagctactacagcatct
+ ggggtggtaccggatgcactcgcacgggcactagcctgtgcctttgcagcagcctgaata
+ tcggtatgcgtttccagagagaagttgtcgtctaacttcacgcctgctgcaggaggggaa
+ tagaatctcctgcttaccaaacaagccctatatgtggccgctggaacgttgtaaacttga
+ gataatcttgttatcgatcttgtagcagttgatttcaacatcttccagttgtagcagttc
+ cttcgtgtatgcgatatattttaatgaaaatgatgtatctgaaatagaaaatataacaca
+ taatgaactctgtcggtctttactctgagtgagttacttagatagtcatcatatatccct
+ gggcgatggccttactatttgtcagcgaccgtatctaaaattggtaggtcactatcaact
+ attgcgggtaatagggtgccataatagtattgtaatatatataaatatataaagcatcat
+ atgatagatatttgttgctattcacgtcatccactattactttagtggttccgctatata
+ ttcgcattgcccatcgaaggcagttttgtgtgctacctgctcgtgaccacggtctttttg
+ cccagtcttttacggcgggcatacttcatgatgaaactgtctattttcatacctcttgaa
+ tttagttcttcaattagaggcttgatatcctccaagtcccactgtgactgtagcctaaac
+ aggactttaaaccgttctttggggtccattggtagtgtgcttttcgctatatactggaca
+ gtcttatcggtaggcttgaaatgatagcctcgcagcatgtcaatgtcaatgtcacatggg
+ aagaaaggtgggaaaagggacttccacttgatgaggaactcatcaattggcatgcttatt
+ ccagaaacatatttccttagcgcttgaatcccgtaccactgagctataaacggtattctc
+ aagcgccacgtattgttttcagcctcttgctcttgaacagcaaatttattcagcactgtt
+ tcaatgatttctcttgtgtatggattgaactcgtcctcaatgtccttccccacagcgtga
+ tgtgtatcctcaacattcagatgctgtagatcgagtgattctgccattgcgctcatcagt
+ agtacatgcagtgctttgaaaaggaagtcttgtgaaagaatacacaacacaccgtctttc
+ acagatccaccaatcttatgccattttgatataccttctagcgcagaacatggtgagttc
+ tcaagtagttcttccagggtcccgataacctttgtagatgacctcttcatgattttgtcg
+ gagaaatccagttctccgttgtatattggtactgaattcaagttcaattcgccatgtgtc
+ tctctggtctcaaattctgattcagtcttggcgaatcccacgacgtccatgtacggcttg
+ gacagtccaaacaagagcgtttcgtcgaaagtaataggttgttcaggaacaaattctctc
+ attagtagaactgtgtttgaatgtttgcgctgcttgcagcacccaagtcttgtcgtgcga
+ acacagtacaacttcagacttgtctttgtccaatgacttaaaccttaactggtgattttg
+ aaccggatcctgtattatatccagtaactctggtgtcaactggatcagcttataagatgg
+ atcatactcgggtgcggaatgtaggttgatggacatggtacaggattattagtgagcaaa
+ tctctggtggatttttaccgttcatctcatagtgcttttttagtttatttttcacttttt
+ tctggttatactatacgctgcctactgaagcaataagcaagaaacacaacagtataaagg
+ gacaaagataataaaccgcctactactaaataatatcgctgtggtatgctactcctagag
+ atttcgatcaccatcgtggtgctctttgtcatacgcatagaattgacaaaaaagaaccct
+ aagctataaattaggtcatgttgagcgattgccagttgccaagacctagggttttgttcg
+ aacaattctaaaaaggtaaataaaaacaatggtaaacaaaggctttgatcttcaatgagg
+ aagatttctctcactgcggatagtaaaaacaacactagtttgtctgtactcgcattctac
+ tgctgctacctgctactcattacgtgttcgggctgttatctggttgctaaaagagtatat
+ ttacacctcaccaatggagaaagacctgtcgtctctttactctgaaaagaaagacaaaga
+ gaacgatgaaaccttatttaacatcaaactatccaaatctgttgtcgagaccacaccgct
+ aaatggtcattcattgtttgatgatgataaatcactttcagactggacggataatgtgtt
+ cactcaatcagtattctatcacgggtcagatgacttgatatgggggaagttctttgtctg
+ cgtgtacaagtcccccaacagcaataagttgaacgctataatattcgacaaattaggaac
+ atcatgcttcgaatccgtcgatatatcttccaactcgcaatactatccggccattgagaa
+ tttgagtccaagtgatcaggaaagcaatgttaagaaatgcattgctgtcattctgttaca
+ gcgctatccattactttcaccatcagacttatcacaaatattgtccaataaatcggaaaa
+ ttgcgactatgaccccccttatgctggagatttggctagtagttgccagttgataacagc
+ agttcctccagaagatctggggaagcgcttctttacatcaggacttctgcaaaatagatt
+ tgtcagctctaccctgttagatgttatttatgaaaacaatgaatccaccatcgaactaaa
+ taataggttggtattccatctgggtgaacaacttgaacaactttttaacccagtcacaga
+ atactcaccggaacagacagaatatggttataaggcgccagaggacgaattacccacaga
+ atcggatgatgatcttgtcaaggccatttgcaacgagttattacaactacaaacaaattt
+ tactttcaatttggtagaatttttgccaaaattcctgatcgccttgagagtcagagtact
+ caatgaagaaattaatgggttatccacaaccaaattaaatcgactcttcccacctacaat
+ agatgaagtcacaagaatcaattgtatttttctagactcgctaaagacagcaatccctta
+ cggttccctcgaagtactgaaggcatgcagcattactattccttatttctacaaagcata
+ tacaagacacgaggcggccacaaagaacttcagcaaagatattaaattgtttattaggca
+ tttcagcaatgtaattccagaaagagaggtctacacggaaatgaaaatcgagagtataat
+ taagggacctcaggaaaaactactgaagctaaacttaattatagagagattgtggaagtc
+ gaaaaaatggagaccgaaaaatcaagaaatggcaaaaaaatgctacaacaatatcattga
+ tgtcattgattcgtttggaaaattagattccccacttcattcttatagtaccagagtatt
+ tactccatcgggaaaaatccttacagaattagccaaatgctggcccgtagaactgcaata
+ caaatggctgaagagaagggtagtcggtgtgtatgatgtagtggatttgaatgatgaaaa
+ taagagaaatttattagtcatattcagtgattatgtggttttcatcaatatactggaggc
+ agaaagttactacacttcagatggatcaaacaggcccttaatctcagatattttaatgaa
+ ctcattgatcaacgaagttccgttgccctccaagatccctaagttgaaagtggagcgtca
+ ttgctatatagatgaggttctagtttctatattagacaaaagcactctacgttttgatcg
+ attgaagggaaaagattctttctcaatggtatgtaaattatcctctgcatttatctcttc
+ ttcgtcagttgctgacttgattacgaaggctagaattttggaaaaagacactgcatttca
+ tttatttaaagctagtagaagccattttacattatattctactgctcacgagctttgcgc
+ ttatgattccgaaaaaataaaatcaaaatttgccttattcctgaacataccaccatccaa
+ ggagatattggaggtcaacaaccttcatttggctttttttgcaagattttgcagtaacga
+ tggtagagataacatcgtaatcttagacgtcttaaccaaacatgacgataaacatataga
+ agttacatccgataacattgttttcaccataattaatcaattggccattgaaataccgat
+ atgcttttcttccttaaactcatcgatggccaaagatttactctgtgtaaatgagaattt
+ gataaaaaacttagaacatcaattggaagaggtcaagcacccttcaacagacgaacatag
+ ggctgttaatagcaaactttccggtgcatccgatttcgatgctactcacgagaagaaaag
+ atcatacggtaccataacaacatttagaagctatacaagcgacttgaaggacagtccatc
+ aggcgataatagtaatgtcaccaaggaaactaaggaaattttaccagtgaaacctacgaa
+ aaagtcttcaaaaaaaccaagagaaattcaaaagaagaccaagacaaacgcctctaaagc
+ agagcacatagaaaagaagaagcctaacaaaggcaaagggttttttggcgtgttaaaaaa
+ tgtttttggaagtaaaagcaagagcaagccttcaccagttcaaagagtgcctaaaaaaat
+ atcgcagaggcatcctaagtctccagtgaagaagccaatgacctcagaaaagaaatcctc
+ ccctaaaagggcagtcgtttcatctcccaaaattaaaaagaaaagtacttctttttccac
+ aaaagaatcacaaactgctaaatcttctcttcgagcagttgaattcaaatctgatgactt
+ gatcggaaaaccacctgatgttggaaatggcgcacatcctcaagaaaataccagaatatc
+ ttcagtagtaagggatacaaaatatgtctcctacaatccctctcagcctgtgacagaaaa
+ taccagtaacgaaaaaaaatgtcgaaccaaaagcggatcaatccacaaagcaggataaca
+ tttccaattttgcagatgtagaggtatctgcgtcttcttatcctgaaaaacttgatgcag
+ aaacagatgatcaaataattgggaaggcgacgaattcgtcatcagttcatggaaataaag
+ agctgccagaccttgctgaggtgactacagcaaatagggtttctacaacatcggctgggg
+ accaacgtattgatacccaaagcgaatttttacgtgcagctgatgttgaaaacttaagtg
+ atgacgatgaacacagacagaatgaaagtagagtttttaacgatgacctctttggtgatt
+ ttattcctaagcattaccgtaataaacaggagaacattaacagctcgagtaatttgtttc
+ cagagggaaaggtgccccaagaaaagggcgtatcaaatgaaaacactaacatatctctca
+ aaactaatgaagatgcatctacattgacgcagaaactctctccacaagcgagtaaagtgc
+ tgacagaaaattctaatgaattaaaagataccaacaatgaagggaaggacgcaaaggaca
+ taaaattaggagatgattacagtgataaagaaacagcgaaagaaataactaaaccaaaaa
+ attttgttgaaggaataactgaacggaaagaaatattccccactattcctaggttagcgc
+ cgccagcttcaaaaattaactttcaaaggtcaccatcctatattgagctctttcaaggaa
+ tgagggtggttttagataagcatgatgcccattataactggaaacgcttggctagtcaag
+ tctccttaagtgagggactaaaagtcaatactgaggaagatgcggcaattataaataaaa
+ gtcaggatgatgccaaggcggaaagaatgactcaaatttctgaagtgattgagtatgaaa
+ tgcagcaacctatcccaacttatttgcctaaggcgcatctagatgactcgggtattgaaa
+ aaagtgatgacaaattcttcgaaattgaagaagaacttaaggaagaattgaagggcagca
+ aaacggtaatgaagatgtcggtaataataatccatccaattctattccaaaaatcgagaa
+ gcccccagcattcaaagttattagaacatcgcctgtgagaattatcggaaggcacttttg
+ aagacactagaaaatatgaaaatggctctccatctgatatttcgttcacttatgatactc
+ acaacaatgatgaacctgacaaaaggctgatggaattaaaatttccatcccaagatgaaa
+ ttccggatgacagattctatactccagcagaggaacccactgctgaatttccggtggaag
+ aacttccaaatactccgcgaagtattaacgttacaacttcaaataacaagagcacagacg
+ ataagttgagtagcggtaatattgatcaaaaacctaccgaactgttagatgatttagaat
+ tcagttcatttaatatagcatttggaaatacctccatgagtactgacaatatgaaaatat
+ catccgacttaagttcgaataaaaccgtgttaggaaatgctcagaaagttcaagagtctc
+ ctagtggaccattaatctatgttttgcctcaagtgagcacaaagcatgagaaagaggggt
+ tccttcgaaagaaacaaaaagacgagcccatttggtttcccctagcaaaattgactttgc
+ tgatctaagtaggagaactaaagcattgacgccagagcgtaatactgttcctttgaaaaa
+ caacgacagtagaaaatacaaatatactggagagggatctatcggtaatatgacaaatat
+ gctgttaactaaagatgcttcgtacgcgtacttaaaagattttgttgcgttgagtgacga
+ tgaagatgaagatgggaaacagaactgcgctgttggtggcccagagaaactgaaatttta
+ ttgagtgttttattgattttttttttctttttaatttaatgcaatgtatacatttacata
+ aaatatggacaaataagaaaaagggggaaaaacctcccacttaagtaaaactgaatatat
+ tagatacctttacctactatcaaaggtgaatgcttttggatcaataggggttccatcggg
+ ggcaataccattttggccgaactttctttccatatataatagtctagccttatcgaaatc
+ tacattatcgttttccataatttgtttaatctcatcttggagacatcatccaaaccacct
+ cttacgtcgttgctattctccgagataatatcgaaattcctggaatggagaccgttttct
+ aggtcatcagcaaaggaatcatttaaacgtatatgtctattaaaacgtaattgtattctt
+ gaacgcagtcacgaaatttgtcggctattacagctgttagtatttgatacggtttgcact
+ ttgcattttcaaatgcttaaatatactaccaactttggttttccggtgtgccataagaag
+ gcataaagacaatagcacaacccagaggagcaatttgagatagaacaaactcttcatact
+ cgtatttgcaatttttggaaattaattgtgtagcttttcttctcttaattggtggcagtt
+ tttatatattcagtttagaatcccttttttttttttttttttcttcgcttttcgcatctc
+ atcgctggaagtggtgcacttttacagggattaaataaggttattctttttggtcaaaat
+ gataaatataactattaaagttatttataacccgcccgcttccttattatttatacgtta
+ tcataaagtacacaggtcatggttcggttggtgcttaggaattaatcacgtctagcataa
+ gagatctgtaaactacaaccaccataattataattatttaatttttgaatacaaaaatct
+ gcatctactaaattttcatattctacaacagcaacaccagttggttgaccattttcctgt
+ ggtttcaattccgcgttattgattttgccgataggcccaaacaagtcgaataaatcgctt
+ cttgctgttgagaatggtaaattactacaataaatgaaacagtttctatcccctcctgga
+ ttgacaccttcagtaaattttgctgcagtttcatcaatgtggacagcggcatcctgcgca
+ agacctggttcagtacctctggtatcttcaaggtcctcacgcctttgattataacgatca
+ ttgttctttctcttgttgaaacgcccttctctaacttccaaaactctaccttctacttcc
+ atgccgttgaatgtatcgatagctctaatcatttcatcctcagtaggataaatgacagaa
+ ccgaatcctcttgaaaatccgttgaaatccaattctacatcggcacgcaagacatgacca
+ cattctttaaacatatcttttaaggattgccaattcatagagtacggtaaattgatgatg
+ aacacttcaaacccattatctatttcttccctagtagctttcttagaaaattccttggca
+ gcttcaggaggaggattatcctgtcttaccattagtttccggtccataaagagggcacca
+ tcaaacttcgatatggcatcttggacagattcgtttttggtaaattccacagtccccata
+ ccacggtgatggccctttgatgtgataatgtcagcctccacaacttcgcccactgtacca
+ aacaattccttaaggtcttcaggggtacaatcaaaagtcaagtttctcacgaagatacta
+ ttttcaaaatttctcttggtcgcgtcaaattgcctttctaagtctctttcaacaattgga
+ cctaaagtacgacccctaccgccaccgcgtccccctctgaatgatctgcttcctcctctg
+ ccacggcctcctcggcgagggggccagttcctatcatcgtggtagcggctgccaccataa
+ ctctgatcgtaccgatcattgaaacgggagcctcggtcgcgtctctgacgacgactgcca
+ ttaccattattactgctactatcgttataatcatcgtacctgtctctgtcgtcgctcaaa
+ cgacgacgtacaggtgatcttgatctactcctatcatttccatacatccctagctctctc
+ tccatcttggcgaaaaaatccagctgatgtttgtttccttttcgcccatagcttatttct
+ aatgggaaatcctcaatataattgatttcatccaagctttcatttttttttcaatttttt
+ tcaccttcaagtgaatacatcacttccgacgataccaaaataatgacaaaatgccatgag
+ aagatcttatgatatgtagtaaatgttaaccaccattgctatttccttgctaggtttgcc
+ aaagccgttggcgaaggcagcactagacggcgagtcacttccgtctctttgtccacttcc
+ tcttcaccatcaaatcgaagcctgcaagtgccgtccctcttggttccgatgacaattgca
+ ggataaaaagtggtcgtttcaggatatcttgccagaaccttagtacctggcgggtagttc
+ ttagtgggaaagcctggcgggatcaacagtagttctttccgattacatttatacaccttg
+ cctgagttgcccaattcatcaggttcaggatctctcacttcaaatcgtgtaccatctgct
+ acgactttgagtactcacactgtatcattcgccgtcgcactgccccttctgggcttatag
+ gcaacctctgagcccaccaaaataggtgcatttgggttgtattcgctggtccagtacgat
+ ctgccaactttccccacagctttcttgccctgcgacaatgccatgcgtgtcccttctcct
+ tcttctgcatcttcagtattttcctcactcttttcctctcgtgactgattcaacatcaac
+ gacaaattggaacgtacattgtcaaggattcggtgcactctgttgacgttttccatgtgt
+ tcttggaaagtgttgagatgcaattgcagctgatcctttgacatattaaggaaatttagt
+ ttttttgtttgcagctcatcatcaaagggaatgacctcgttggcattgtatatgtcttgc
+ aaggacgacactacaacatcccaatatccgtccataataagaaaggtgaccgtgtgtttt
+ gctgtgaaaaactccccccacttagagaagccaccaaggtattgtgtctttaaccttttt
+ cgtatctggcaaaatcgaagaggaaagcagaaaaccaaaagaaaaaaatagtttagtaac
+ gcgtcacctcgtacaaacgtacaatcatagcaacttatgagtaaataggctcatggttat
+ agatacgttatatagatgtatagaggagagtcccgagggcgatcgcaaggccgagagact
+ aacctaaccaggtggtagttgggaaatgtcgaccatttcgctgatcttttcgtcttcgtc
+ ggcagcttcctcggtgcttgtcttcaaaggagttcttggcaatgccatcataccgcttct
+ tgcacactctaggacaccgaatggctcgaccaacttcaagaaggcagagatacgtgtggg
+ ttttgcagacaattccacaatacagcttgtttcgctgatgtcgacgacacgacctccaaa
+ gttgttggtcaagttggtgatatcgttcaaatgctcgtgcttcaaccttaatacctcact
+ ggcgggcaagttggcagggtggaattgcttttctctgatttcggcgaccaattcttggga
+ gtcagcggcgccagcattggtggaagtgtggtggtgcaatagtaggtcttcgaagtattc
+ agtacccaatagagagattctggccatcactagctctcttttgatgatctcagaattggt
+ atagtctaggacggcgtagacggggaccaagtcttcgatttgtctgcgtgcttgttcgac
+ tacgccatcttgcccttgcaaaacaatggtcattctacttaggtctttgacctcggtgtt
+ gcacacgaccaacgaatcgatgttaaagcctctggcagctaacgtacccgagactctgga
+ caagacaccgggttcgttttgcaccaaacagttcaagacatgctgttttcttggttgacg
+ agaaggcgctggtgtttcgtaaatgatggatgaaacggcactgttggcattccaggaagg
+ agtgtctagtgtgggcaagggaggtcttgttgcgtgtctgtgcatctgcttgtacgccaa
+ cgcggaggtcgacgaggaactgcaacgcaccatggtagcacatgaagaagcaaccaccct
+ gcggtggccgctttgcaataacgatctcagcattagaaagactgggttagtgagctcaga
+ tgttctaaagattctgtatatacgaactatgtacttccatatataccattccccgttctt
+ gttaggtaaaggttaaaaaaatattttttatggcacttcggcgatgcgaaagaaagtgag
+ tcacgataacggtgaggtgattcgttggatgtgccatcgcacaaggaattgctggcgtgc
+ taattcacgaccttgcttgacaaaagtatcgaggggaatggtgccacggtgcagcatcct
+ agataagcactcaatggtgtccgtgagagcgtagtcctgggcaacgaggttgtatagttg
+ attgagtccatctgttttggccactgctatggagttcacgtcctcgtcatccacatgcga
+ agttgagctaaactgttggactttggccctatttgcggttcaggacgtctatctgtgcgt
+ ttagcgagtgcatggtctgctcaatagcctgctcaacggctcgtaaatggttcttgtcga
+ tggcgattatttcatggaatctggcaattgattcttgcatgacggtctgtctcgtcagta
+ tcttgtccgctacatagtccacatcctctctgtataactcattgactacggtttgtagat
+ tttgtagcatttcatgatggttggttggggagatatccgtgttgtccatgtccataaggt
+ cgagtgcgttagatgccggttgaggtggtggaggtggaggcaatggcggttttaggtgtg
+ gagacttgggctttgggggcagtgggggagtattctccctcctgctggagctgcgtattg
+ ggcttagggggtagactgggggcctgatcttgtggcggctcgtgtaagagactcattaat
+ tcttgtacgaccattataaggttcatcgcggcaggatcccagcagtggagtatgggcagc
+ gcaatccacccattggagtcgatgtattcttgaatgggcaaggaactactgatggtgttc
+ atgtcgaaattttccaagtttatacttataaagggaggttttaccgggtacatactgggt
+ acccacatgatcacaggaatactgtgaggagatgaaccatcctcaccagtactaatggta
+ ccatatatggacaacagtagttgtggagtgccgtcagagtgcgtaaatacccttgttctt
+ ggccttaaactgtgaaaattgtcagcagcgctaaagaatcgtggaaagttgttcttccgt
+ cgttataaatgggttgaatgaactttgaacagccaattgaccaccgcctgggaacagaga
+ tcttgccgtttgcagacatcgttgctcttcactctctctcttttcttggccgttaagata
+ ccaactgcccacaaaattttgccgtaactttttattctcgctctgtgaaaaaagactact
+ acattttttgaccttttatctatactttaagaaacaaaggtccaacgaataataaagaac
+ aaagagtaatgagtagtttgtatgtgtcataatctcttctctccgcttcccttttcttcc
+ ttcttcttctactaactctctgcgcgtccaatttagctatactgttgttggtgtattcat
+ agtggttagtgccatgtccgtgctgttctggataatggctccaaagaacaaccaagcgta
+ tgtgcaacattttttttacgatgtgtctccatattactagtttccctttcttttttacta
+ acaagagtttgtttgttttgttttgtcgtacagtgtttggagaagtacagtgattttgac
+ attggccatgatgtttttgatgtgggccattacgttcctttgccagctccatcccttggt
+ ggctccacgccgttcagatttgagacctgaatttgcagaataaagcgtcgagtatatcaa
+ gccaagtgacgtttcctgagtataagaaaaataagcaaaataagtatacataaaaataaa
+ tacgaatagaaatatagtggcatctaaacaacccagagattttttgaaaaacacaatgag
+ tatatactgatagacaaccacacggtctttgaaacgagacgccgtctacggttgtatcaa
+ aatgtgtaaacgggccatttcaaagcttgatcttctccataattagaacccatgtcccgt
+ ggaagcgttttaaaagacatgaggcagaaacaggagtgtgatatccttgcatgcgcaagt
+ tggtaatacactgcacaaaacacgcgtgtgcacatacacttatgtggttcaccgtgccgc
+ tgctgtgtttatctgttgctcgacatgtgctgctaatgattatgttggaatagacaaaca
+ atttaaagtaatttcttctttttttatagtgagaaatctcaaaagaaaaaaaaaaaataa
+ aaaaaactaaagagctgccgggtgagcaacgactaaaagaagaaaccatttagggcttca
+ tcagcaacttgtaggaggagaaagcagtatataactagccgcaatatgtttgtggtggat
+ tggagcgtgcagctatgcatgggagtgatttcacctctcttccgtgccctcgtgcaactg
+ cctttatcaattttcgtatggaatgggtttcagacttgtgcattgcctatcaacatccca
+ ttagttgttcctgggacgtcgctgagccgtttagttgcacaaacaagcacattagacttc
+ tatgtggttttgacgttgtttcaatattttgctgtgctttgtgcttttggcagcatcata
+ ggactcatctttggatttatattgggtgtgttccactcaatctgcggggtacccagtgta
+ tacataagtctagaatggaaacggtggtttgctccgatacgtacggtccttgaacgtgct
+ tccactagtattgtcaacattatgcgaggacaaactattgcgccaatacccatgcctaag
+ cccaatcccacgcatatatcaaagcctaacatgaaaaaattccatgatgagcctggagct
+ gatgatatgactataacgcatgatgtgaactgctacatcaccccttgccaaacgcctact
+ aacgaaaaaattcagcattataataatgattcattcaacacgaccaccacagacgatgaa
+ cccactgatatttgggataggtctgacacttaccaaaactcattcgtcaccaatgaaact
+ ttgatgtctctttctaatagagctaagcttcgaagaaatgccagtgatgcggacatcgtt
+ aatataaagattttacgaaaggaattctaggtgatattgcaattacttcttctcatgcac
+ taacaagtgaatgatagaaatatgttgagttgctaactgcctgattttaaataagtttca
+ tattataatcttttagcatatatatatatatattgatcctctctcttctttattttctgc
+ cagtaacccatgtgtgaagaagaaaacataaataaaaaagcagtagcacatggacacatt
+ cacgcccgaacacttctaaaaagcagcccacacaagaaagtagatataatgtaggacacc
+ cagcttgtccataattgctaatagcatactcaggataacatatattaatgacgactcgtt
+ tgctccaactcactcgtcctcattacagattattatccctacctctccagaaacccttca
+ atataaaaagggcagatgtccgctgcgaacccttctccatttggcaattatttgaacacc
+ atcactaagtccctacaacagaatttacaaacatgctttcatttccaagcaaaagaaatc
+ gatataatcgaatctccatctcagttttacgatctcttgaagacaaaaatacttaattca
+ caaaatagaatattcattgcgtctctgtatttaggcaaaagcgagactgagttggtggac
+ tgcatatcccaggcattgaccaagaaccccaagttgaaagtttcttttctacttgatggc
+ cttcgaggaacaagagaattgccttccgcctgttccgccactttattatcgtctttagta
+ gccaaatatgggtcagagagagtggattgccgattgtacaagacgcctgcttatcatggt
+ tggaaaaaagtcttggttcccaagagatttaatgaaggtttaggcttacaacattgaaaa
+ tatatgggtttgataacgaggtcattctttcgggagccaacctttcgaacgactatttca
+ ccaacagacaagatagatactatctctttaaatctcgaaacttctccaactattatttta
+ aattacatcaactcataagttccttcagttatcagattataaagccaatggtggatggta
+ gcatcaacatcatttggccagattcgaatcctactgttgaaccgacgaaaaataaaaggc
+ tgtttttaagggaagcatctcaattactagatggctttttaaagagttctaaacaaagcc
+ tcccgattactgccgtgggtcaattctccacattagtttacccaatttctcaattcactc
+ cactttttcccaaatataatgacaaatcgaccgaaaaaagaacaatattgtcattgcttt
+ ccactataacaagcaatgccatttcttggacgttcactgcaggatacttcaatattttgc
+ cagacatcaaagcaaaactgctggcaacgccggttgctgaggcaaatgtaataacagctt
+ ccccctttgcaaacggcttttaccaatcaaagggcgtctcatcaaatttacctggtgctt
+ acttgtacctgtcaaaaaaatttctacaagatgtatgtaggtacagacaagatcatgcta
+ ttaccattaagagaatggcaaagaggcgtagtaaataagccgaatggttggtcatatcac
+ gcaaaaggtactttggctttccgctcgtgataaaaatgatgctaacaattggaaaccctt
+ tatcacggttataggatcttcaaactatacgagaagggcgtattcattagatttgaatcg
+ aatgctctcattattacaagagatgaagagctaagaaaaaaaatgaaagcagagttagat
+ aatttattacaatatacaaaacctgtaactctagaagactttcaatcagacccagaaaga
+ catgttggcactggtgtaaagatagctacctccattttgggtaaaaaactttagattcta
+ tcatagtatgtcgcctgtaaatatggaaaaataaaagaaaaaatgataataataaatgca
+ ttatctaaaactccttaaatttgcaaacatcgcaaagccattaaggtagagaatgagctg
+ accttgttatagttttccttttggcaaccgtggtttccgtcctacgtaaacatactgaac
+ ttgacaaagtatgaccaataggtgacaaattgtatcaattttacccaaaagggcgttgaa
+ ttaaatggtatctcgtaaaacaatgcattggatcggtattaccagtctacctaataaatc
+ gatagattcagccaaaaatgatatcaatgcaaactttgatgataaacctactgttgagca
+ tcccatccagtttatactcatttgaggcacatacttgaaacatttaagcagatttgccat
+ tacccacaaataatttatgtgttccaaatagaaaacgccaaatttacctgaatttttcaa
+ cggtaaattagagatggaacagccatatgtaaatatgcctaatacaaggaacacacttat
+ aattattatggaagtaatggaaatgccctgataaatatgtttagttgacctataatagta
+ taactgccttaaaaccatcatgcagcaacaggaaacgcaaaaatcttttagaatgaggaa
+ actggatataggtatactacgggcatcattcaacggataaaaaagtgggaatctttgaga
+ cagttgttccctcaccagagatgaataacaataatgcaacgcgcaatacaagtacagacc
+ attaccaacaaaatctagcagaaataaatcatatgaaagcccgtaaatagatcgatgaag
+ tttattatagcgcttttgacatagaatggcgatgatctatgtaagaaatggaagttttct
+ cccttagttagtcaaacctttcatgtatactttctccttcaattatgcatactgataaaa
+ aacctgccaagccctgcagaacaataacaagcatgtgaatgattcaatataattgcttcg
+ cagggctcctcaacctctcctctggaaaaccttacctcaatgaggttaaaaattgtgggt
+ ttttcgttcgcagaggaaaaaaaaacttgtcaaaacacgcctctgcagtagaacacgaat
+ tatacataagaaaaataacgatacaagaaaggcgatttcgatttgggactaagtgttaca
+ cgaaaggcatagagtatcaaaaaaaggaagtcaagatggattacgatagctctgatacaa
+ tgaacggtggttcaagtaaccccttaatcactaagatgaatacaatgaaattattatatc
+ aacactatttggataaagtcactcctcacgctaaggagaggtgggctgtattgggtggtt
+ tgttatgtttgtttatggttcgtattacaatggccgaaggctggtatgtgatttgttatg
+ gtctaggtctatttttattgaatcaatttttagcctttttgaccccaaaattcgatatgt
+ ccttacagcaagatgaagaaaacaacgaattggaagctggagaaaaatcagaagaattcc
+ tttccattcatcagaagacttaccagagttcaaattctggtataacagcatttagagcca
+ ctgtcatttccctcttgttgtcggtattttcaatcttcagtattccagttatttttgccc
+ attctttatttgattgttattttcattattattgttttttttaactatgagaaggcagat
+ tcaagcatatgataaaatatagatatatacccttagatatcggtaagaagaaactattct
+ cattcttctaactgatataccctgactccacatacctattcaacgaacacattcgtatct
+ taattatatgtaaacaaaattatatgatagttacagaataattgtgattattttctcctt
+ ctactcatcttctcttatattggaagaatacatgaagttattgattaagcaaaattgaag
+ ggaagaggctcattttcttccatttataaatggagagcttgagatgcctccaattgataa
+ atttttgtaaagactttcaagaaacatgtctaaaaagaattgacagcttcgaacctttct
+ gcataattactaataccaaattagtgagaaatggggtatgtcgaggaataaaggatgcaa
+ aagagagagtaatgtcatgcaagttgtgagtttgaaatgttcacacgtttataatgtcat
+ actcgtggcgtgcatgtgttttagtgtattagtttcagttattaaagaacttagtaaatt
+ tatcatcgaacgcattttgattaagtggcacaatagacttcaatggaaaatgaaaatcat
+ atctgtagcaaaaacgacagcgaagataacggttacacaattatcacgttgattcgtcaa
+ cttaaagatgacctttttacaatttatcaataataatagacaagaaggacaaggttatat
+ ttcagaaaaattattcaaaactaagaagaatgagatgataagaaaaacagttaccaattt
+ agtagctgtaagattaaaaaacttatcccacgaatttgatgtaatagagaattatctgcg
+ ctatatagctagcacagtgaacatctatttactgctattaagcgccactttaacaaatgt
+ gccagaaaacttttgaaagaagcaatcgactccaaatcaaactcagaaactgctacggtg
+ gttctgcaagaaggattttctggcatttgcttattgaaagcgtcttcaattatattaaaa
+ ctaaaattgaagtttccaaaaaagaaagatagaactgatattagcaaattgtgtgacaag
+ aaagaacggatgacacagtggttagaaatttcaattttgatgaactgagaataataatat
+ tatgctcccctggattttatgcgaagacactgctgaaaaaatttctgatatatgcagaga
+ aggagcaaataagccttttaaggaaccgagataagctgttttcgcccattgttctagttc
+ tacaaaatatctacagggaatcgatgaggttgtaagaaatcctgaatactctatgatagt
+ gcataacactaaaaagttgaaagaatcccgtatcatggacgatttccttgaacatttgag
+ caaagacgataacaaagcatggtatggcgcggaagaaaccgagagagctgcaaaattaga
+ tgcaatagaaacactacttattacagatagtgtactaaaaaggaacgacgtgaaaaaacg
+ tgaaaaatacctagacctaatagagaatagtggaaacaacaatggcaaaatattcgtact
+ cagtacttcaaaaatcacagtgagcaacttgacaaaccaacagatataggctgtgtctta
+ aaatataccatccaataccttgatgaacttttcaaagatgactaaaataagtgaaatttc
+ aacattaacttcgaatttttttctttttatctaacacttgtcaaacagaatataaggatt
+ acttgtcttctttgctacatattgctaccacttctattacacaatagtttcaatagcttg
+ cagcgtagctaaactctaaaatttatctaaatcactcatataaaccgaacccttcccctt
+ ccgcttatagtacagtacctatacatttcataaacatggcatgcgatcagcgccaaacaa
+ tatggaaaatccacagaaagctattcattgaaaaaatagtacaaataagtcacatgatga
+ tatttgattttattatatttttaaaaaaagtaaaaaataaaaagtagtttatttttaaaa
+ aataaaatttaaaatattagtgtatttgatttccgaaagttaaaaaagaaatagtaagaa
+ atatatatttcattgaatggatatatgaaacgtttactggtggaagttttgctcatatat
+ tattattcaatagaagtaataaagaaaaagttggtaaagcaacttaacagtaaaaaggta
+ atgattgaaaaagtttttgaacatctaagctatatgttgatgggtttacaattttaccat
+ tagtactcatgcctatacttttctgttcgtccttaatgtccgcgatttagagcaatcatt
+ gaaagtactagatacattttagccagagaggactcgttgacgtagaattaaaattcaaat
+ gaatttccgccccattcatataccccaaataacaaacatattaaaacttcataattattc
+ aaaatgtggagtagtatagaagagcagtaccttcaaaattgatttcttcagtttcccacc
+ cgggatccacttgtcatgcggtgagaatcgtatattgcgtataatccgtgtttcatcacc
+ cacattatagtacaaacctactggtgtaaccattatcatattcatgacttgtagttaaaa
+ atctttcacgataaattgaggaaaaaatcacgttaaatagtttcatatcattctttagtt
+ aaagtctgtgcataaaaagaggatcgggtgttgtactcgttctatcaaaatattgttatt
+ gttgcattatcacacgaattacaatcaatttttttatctacaggccagagtacctgaaat
+ aatggaattagattcaaaaaggatagcttaatcggtctttatactaattattttacgccc
+ agaacgaaacataacgtgcattaaatctcactgtcacaacattgtatataatggtagcat
+ cgtactaactattggcattcattcaatattatacaattatcttattaatactttctcaag
+ tagtcttgaagaccagcatgtaggaaggtgatgatatgctccgtaaataattggaaatat
+ tgagatataataacgatgacaatactaacaaaagtaataggtgagctccgccaattgatt
+ gttttgttttgaatatatattgatgcttaacaatttagtcgtcaaatagcacctgcaaaa
+ cctacctcttacaggctttttttcaatttactccccatgtgggagttctacgtaaaccac
+ ttagaatgcagtttctcttctaaataacatgttatatcttctgaagaaattgaaacatat
+ tgattcttatggtgagagcaatttatcaatgttttatcttaagttctactgcccttggaa
+ attatgtcctttgcttcccattttatatggaagaaacaaatcagtacaaaatgcttggtc
+ atgtcataagcttcatcgatttattgacttcccaactccttcggctaatatgaatttttt
+ tttctttataagagtttgctatggaactgtctcgctgtttatttcaacgtttctaaagaa
+ aatatccttttttatttatgtgaacttcactgtatacttttagtacctttcggtggattt
+ cttctattttactcgatcacttttcttgactgtgcactgtcggtacaatccttattgtat
+ taacttgcgaagaatatttagtcgttcagtagggttcttttggaagaaatgctagcttaa
+ catgagtgtgatctcttctagaattaactcttttgtttatccatgtaatctgttaccaac
+ acataaaatgcattgatagaccaaatatcaaaagcgtgcaggcatctcttatctaacagt
+ aagcaaaattttgcctgcgctttacttcaattttccattctttttctgagtcttcgtcag
+ tgcatatttttttactgacagcaccattaatcaatcagctcacctgcgcatttccaataa
+ tcgtgctcatgaagatcataaagaaacgtccaaacatgaaagtgctcctttagaatttct
+ aaatccaaacgatatatgtacccctcgttgatttgtcctaaaatgagtatagcttgattt
+ tcgttatatattttttcatagtattcagaagaggagagaggatgtgatttatcaatatct
+ gcattaagtaatgaactttgccctctactgaggatattcccactagaaaactataaaaaa
+ tgggagaaagtcaatcgctgttcaggattcaacatagttgttgacaacgtatgataagaa
+ ggccagtccagtcctcctaagttaggttaagggaggtagggaaaaactgaagtgtaatga
+ taatcgaggatgaaagtgtgcggccagaaaaatcttttctttttatattaatctgtaata
+ aatacaagacctatatattaaagtaaaacaacattgaatcagatcgcatactggtgtaga
+ cgtgaagtagacatatgataatcgagcagggtgctcaatttggtgtgggacacgtttggg
+ aaggttaaaagaagaggtatatccggattacatttatttttggcgtattttaataattta
+ ctaaaagcgcgaatagcttcccattagcaacgctcagggactctttctaatgtttatagt
+ aatactttaataagatgagtaccaattgcttctctatgaggcacaatccctaaccaaact
+ tttatttagaattaccggtaaactccgttttcttcatagattaaatatttggctaagaca
+ acaaaatgttctttcattttaccaaacgagaaggtgatagctgtacgccagaaaagaagg
+ taaaaactctcgccaatccagcaatacctgtacctctagttttcgatcgcattatgtcat
+ tatgtaaaaccaaggagagaagcaagttacggtgttttcatatagaataatttgagaatt
+ cttaataacataagatatataatcaccaccattcttatgagattcaacgttgaatggcgt
+ tgctagacatatgagctgaggaacgactattcgcattttctttcaaggcaattaattgtc
+ ttgctctgaacccttcataatgaatgtaggaagttgtttcgattaagtcttggagatgag
+ ttcgaatcaaaaattcccttaaatatacaaaatcacactggttgatatcctcaacattaa
+ tagcgctccaacgagtttttcttcccctgaaggtttcaccgtttatttcaatctcattct
+ cagaaccaaccactgcaaacggaatgatagatctaacacttctgtttagttctaattcct
+ cgtcagttagttcttccgaatcataaggataaatcttgaaattgtatttttcgaattcat
+ tttgaataagctccctaaactccgttctttcatctaaagtcaatgtatccgacttgccaa
+ taactggtataacatttgctatttctgtcaatcttttcaaggcttcaacgtcaaggcggc
+ tcaactcctttccatttggttgcaggaaataaagaattgcatgaactcttgtatcagtaa
+ taaacctttcacgttgggctgtcaattctttacgtaagtattgagaatgttgttccttaa
+ tgtacttcacaataggctcccaagctttagaattgtcaataaagtcaccaaatccaggtg
+ tatctataacattaatattcaagcgaacgcggtcctccacaagagtatgagtagaaattt
+ tcatttcagttgtttttgtaacaggcagggcagaaatatcatcaccagtagcagaatcaa
+ tcaaatgtgaggcaaataacgtatttattagagtacttttacccaatccggattggccaa
+ caaccattatattaaattgaaaacctttcttcaacagacgatgttcgatctgattcgtga
+ tggtatcaaaaccaacataagaagcaggctgtactgagctgagaggatccattaatatat
+ ttatagagtgcttgtagtaaccgtggggcttggccttgtattttcttgtctttgcctgaa
+ aaaagttaagtattactgatacatgtttacatatattctattcagcgattctgatctatt
+ tgtggtctctcgaggaaatctcatcgtatcatacgattatttccgcaagcattcagaaag
+ caaatcttaatataaggcacaatgttcccaaataataccttaacttggagacaaaaacac
+ cagggaatatcagcaaacaaactatgaattctttgatttttggtaaacaattagcatttc
+ acaaaattgtgcctaccactgcaattgggtggttggtaccgctaggaaatccttcactgc
+ agattccaggccaaaaacaactgggatctatccaccgttggttgagagaaaagctacaac
+ aagatcataaggacactgaagataaagattttttctctaataatggtattctactagcag
+ ttcctaaaaaaaaagtatcacaccaaaaaaaaaggcaaaaactttacggtccaggtaaga
+ agcaattgaagatgattcaccatttgaataagtgcccatcatgcggccattataagagag
+ ccaatacactgtgtatgtattgtgttggacaaataagtcatatatggaaaacgcataccg
+ ctaaagaagaaattaagccgagacaagaggaggaactttccgaactagaccaaagagtcc
+ tatatcctggtagaagagataccaaatataccaaggatttgaaagataaagataactatt
+ tggaacgtcgcgttcggactttaaaaaaggactaggctcatctttcatctagtggtttcc
+ tttttaaataaacttaagcctgtatatagaaacttaatttcaaactgtatatttattgtt
+ taagacagcatcaatggtagaatacaaggagcggaaacagaaagaaaagtcgaaatagaa
+ tttttcgcttttggcttaacaaaatgaggtctttgaaatcaagagttttgcttgacaaaa
+ aacaactatatttcaaaatattaggaaatgatattattagtacaaggacttacagaaata
+ tacttttaatataaaaaggaaaaaataaaataaaaagagttaattctttattaattttcc
+ aacgtcctacattgatatatctgctgagatagaattaatgttagtaagaactaaatatta
+ gacgcgaactgttgtttcttacatgacagtacagcaggataataatcctttgtcctcctt
+ cttttctgcagttgtagctgctgcaggtttagtttgtctcttagccgcatcagaagtagt
+ ggtcttcttttcagtggagggcttggcttcttttgcagggaaaagttttttggcggtttc
+ gtagaatgttttaccttgaatttcagcaattctcaattcaagtggagacgcagttcttga
+ gccgtcaggtcctgcaagggtaccagcaccccatggagagccaccgtgtacctcttctat
+ actggctaactcagcaaatgaattcttataacccagtggtaaaaagataattccgtgatg
+ agctaaataagacaaacaggctttaacggtactttcttgaccacctccgtaactggaagt
+ actaacgaatatccccgcagctttgccgttcaaagagcccttggcccataatccaccggt
+ tttatcccaaaaggcggaccattgagccggcaaattaccaaacctagttggaacaccgaa
+ caaaaaggcgtcatattcgagcaacgttttctcagtggcaacaggaatatcttcaggttt
+ ctgaggagcgttcatcttggtgaggacttcatcaggtaaagtttcctcgaccctgtatat
+ atcagctttaccaccagctgcctccacacctttcttaacagcttgggctaaaacgtctat
+ gtgcccgtaggtagagtaagtaattatcgcaatctttaccattcttatagtgcttcaatt
+ gagtctgtattgttttgtttactctctttttttttattgatactaacctgacccctcttt
+ tcctttttcctcttcttgttttctttctgataaagtttacaaacttcaaataaaaatgaa
+ cgtaaaacaagtaaaaatgtaggatgttatagtatcaactagctgccggaatagtgcaaa
+ ttgtatgaatcgaaataagtacgctagccaaggcagtaaggcctaatgcccttaggcaaa
+ tttattatttatattaagttccaagaaggggaagcgccttctttccccgcagtacagtat
+ tttacccggaggttcttaataagttattactaaaagggtcataacggatttattaataat
+ gggattataaaaataatgcgtgtgtacctagttatcgcgtgatagcttccgcaattttcc
+ aaccattagtaaattttatgtaactaaaacgaggaaggaaatagtaacgttcattttctt
+ aattataaatatcattatcatagtgtatggttaaaagaatatattattgtattaaataag
+ tgctaaatactaaatggtcatgaggaaagaaaaatatgcagaggggtgtaaaagtaggat
+ gtaatccaactatagtttgctttcaatgtttttgaccaattccttgtatttctcagtaga
+ ataggactttggcctttcaatggaagcaccgatggccctatcagtgatcaattgagcaag
+ aataccaaatgcccttgaaacgccaaataaaacggtatagaaagaagattcttttagtcc
+ ataatattgtaataagacaccagagtgagcatctacatttggccatggatttttagtttt
+ accatgttcagtcaatacgccaggtgctacctcgtatattgatgaaactaacttgaataa
+ ttcataatctggaaaatggtccatggcaaacttacgctgagccatataacgaggatcagt
+ tttccttagcacagcatgaccataaccgggaatgactcttcctgagtttagagtatccca
+ taaatatttttcgatcgtatctttagagtagtcatcatttacctcttctttaagtgcaaa
+ taaccattctagtacttcttgattagcacgcccatgaagtgggccagccaacccgttcaa
+ acctgatgcaagggacagataaggtgatgatagtgctgagcccacaagatgggatgtatg
+ tgcagatacattaccaccttcgtgatccgaatgaatggttaaataaagtctcatcaagtc
+ cacgaaatcttcatccttagaaccaatcaagttgaccagatttttagcataatcggcatt
+ tgggtccacttcacccattttgccatctttgaatacattacgataaattttagctgcaat
+ aactggcaatttacccagcaagtctagtgaatcttcaaaagtataactccaataatcttg
+ cttggaaattccttgagcataagccttagcaaactttgactcgctttccaaggcagttac
+ agcaatagagaattgagccattgggtgtaagtcctttggtaaattatccaaaagttgaac
+ gacatgactaggtagttccgatcttgacattagatcagctgataagttttcaacttgcgc
+ ttgagttggaacctcgccagttagcaataaccaaaagagagcttctggtagtggttgtga
+ gcttccttttgccttgggcaggtccttttgaatgtcggcgatcgtacgacctctgaaacg
+ aataccgtcttctgggtccaaaacggaaccttcccatacgctccctggaatacctctcat
+ accaccatatacctgttctaatagaacatcgctaattttagttttgccatgctctttaac
+ gaattgccttacatcttgagcatggatggggtagatttcgctaaatctctcttttagagt
+ cttttcttggcttgaatttgattgtaaatatgatgcaacatttctgtttgaatttagata
+ aggaactgtcatttttcttgttactagtattattaaaacaaaaagttttgagaacctgtt
+ atgatatgtgttgatattgttccctgaaacttaataatgaggaacgaacaccatatctaa
+ agaaccaagttttaatagtcttatatacgctcaacgtcgagaataatgggcccttttgag
+ agcatgactatctttagattaacgtcaccaatttcatacgaatgtaacgctcaaaaggta
+ attgcgcttaccattgttattgtttttccgctgtgcctcccattgctgacgccctgatag
+ acgtctagctacggaaaaggtcacactttttttctgcaagaaaaaataggtcgtgacctc
+ ttcgcccctagtcgatgaccaccggataaatgcgtcgaatgtgcgcgctattagtttgcg
+ ctcttttgagataccgccaagccattgattgaatgacccaaggacggatttgttcaacac
+ tccaaagcggcttatgttgtggcaacaactctggggaaagtcagcgcaatgttttcccct
+ aaggattcgcgcgagccgtgtgtaacaaagttccgccggtatagcatatctaaaccacag
+ aaggacgcttatacggccgtcttccacagtaattagaaggacattagtcggagtgtaaat
+ aagacggttgcatacgcgtatgaacatgggccattcattttacggaaatggcaacacatt
+ ttcttgtttttaatgtaaatagtacggagttcctttttggtgcgctgaatgtatgccgta
+ gtttggtttcttgccaggcgtgagaagagaggtcacacatgctgttggggttcatgattt
+ tcagccttcgctctgctaagtaaaaggtccattgatccgaagcgcttcctccaatatcgc
+ agtgataaacggtcatgtaagcctacagtctttaaactgtagatgatttaatggattatc
+ tcatacattgaatatcacacttcagcagttcaagttttcattaacattggtcttctccag
+ cttactctgtaaagtccgctgcagttttctagaaatcataatctgtacctccattcagcc
+ atctgaggaacctccgaaataacgagtaaatgttcaaacatgccatgcgccttggtttat
+ tgtacgagttttagcatattgcaaaaaaaaactgctcctgagtacacttgttttacgtcg
+ gaaagataaatgccgtaccatttctgccagtagcgacaccacacattgtaaaaacaatgt
+ ggggcgagccgggactcgaacccgggacctcccgcaccctaagcgagaatcatacctcta
+ gaccacacgccctcctttgaaaaatcccggcagaacagcgcctgaagtctgggatacagc
+ cctattttccctgccatcgtcattgacctttttcttaacgaccagattaacagccagtta
+ ttgaaagtttgcgaacgtgaagtaaaactgcaaaatggtcgttttaagtcaaaagtaacc
+ ctgcgtcacacatgagaaacattatcgctaagttgttgttactacttcttgtaattaact
+ tactgtcgcattccaaatggactgcgaaacagacacgaaacaattacgaaagtgacgaca
+ gagtaaatattgtcattaaagagaatgatgacctccgggtaaactgtgagaataagtcta
+ gaccattcgttaaacttcatctttaattgagaagcatagtaaaaaataattaacaagtta
+ acacaactatcttgacccatgctaccaaggacataagtaatatcctccaccacttttact
+ caaagtatatctcattctgagaaatgggtgaatgttgagataattgttgggattccattg
+ ttgataaaggctataatattaggtatacagaatatactagaagttctcctcgagggtcta
+ ggaatccccataatggaatctatatttctatataccaatattacgattattcctcattcc
+ gttttatatgtttcattatcctattacattatcaatccttacacttcagcttcctctaac
+ ttcgatgacagcttctcataacttatgtcatcatcttaacaccgtatatgataatatatt
+ gataatataactattagttgatagacgatagtggatttttattccaacacattcgagcta
+ gtataaaattgtgaggaattatcgggcatcttgtttattaggtacgcaaattgtaataat
+ tgccaagttgtgctttttagttgagaagtattccgtcctcatgaaatggctaatataagt
+ atggatttcataaatattatccaaagtaattatacacacttgttttttgctatgtcttgg
+ gtttaagagaaaatgctacatagaccgtaagaacgtcttcaagaatgcaaggatgctgca
+ agcgaagttgttcccgttatgaagcagaaagggcacatcaacatggttcaaagatttaaa
+ tttcggcacttgttcttgttcactatatgttttaagattcttatcgtgcaagtattcaat
+ tatatcctctaagtgaccgaacaatcaacacatcaagatttttgtagttttgaagttcca
+ caacctacaggttagtgctatttggtctaatttatacatgggaactatgtgaaacagtaa
+ tgttagcacaatggacctctcaaagcctaagaaaattttgctggggacaaaaagtggttc
+ ttgatggtaaagaagtagtttagcttcataataatttctttgtaaaatttttctcgtagc
+ acactaagctttcagaaatcaaataaggcagaggaagtggttttgtcgtattcaatgact
+ aattgtaagactcgagtataacttatgcagtttccaagaagtaagcaaacagtttctctc
+ aacctttggtgacacactatgcgcatgcaagtcgtaatgcttctctatttatgtgtgggt
+ ttattaaaatagttagaattatctaggtttgtgggaatttggccatgaatattgaatagt
+ aacaagaacaaagcgtgaaaatgttgaatataatgaaagctgaagacatcaaatcatgtt
+ ctccaaatctttctgtctcgtatattttctttgatatttgttttatgcggaaattgcagt
+ ggaaacaatcgcgaaacattcgagaaagaaactaggtagctacaaaagtatggaaaaaag
+ ggtaataagcattcctttttttttagccgcaaagcctgcttaaccatacatgaagcactt
+ taacatatagatgctgctcttcactgttacattatatacagaaaaaagtgctcgcaataa
+ gaaacaaaccttcagaaaaagaagcttatcttgagcacaagtcattcacagcaaaatttc
+ ttttttcagttgctacatgttaattatggagtcctgagagtaagtaactagccattagaa
+ ttaatttccttcccttaaaaatgatagggaatgcacgacaatacgttaatgtgcacccaa
+ cttgcaccacagcatattctatcctttagtttgtacaatattattagctgtgagaaagcc
+ ccttcgtgtggcacaaatttattaatctctgtcaaaatacttctacttgtttatactgta
+ taaacgattaagttccattatgaattgtgaatttagaatcggtttggtacatcgctcagt
+ ataagtgctaaataaaattatcaagatttatatttcatggtatatcggcgtcaggatatt
+ gctttctccaatattcctgttgtgcttgtttctcaacttctactgctttaacaaagtatg
+ cttcgaatattgggtcagaactgaaactaaagcattttattacctcatttgaagcctcat
+ ttgctggtgcatctatctgcgtacttgtggcgtcctttaccgatgaatttacattcacac
+ ctgggggtttttctttcacaagaaatctgaagaatttgtagcatctttcactactataaa
+ agtaatagcgagtactccataaaccattatcaagtaaatattgattcatattgtctgtga
+ ttgctctccattcttttccctttaccgctggcctgtgggtgataacttccagcaaaagct
+ taattctgaaaccctcatccgagatatacacattaaacggaatcaacgtagacaaaaaca
+ aaatagaagtgacgaaaattgaaaaaccaaaaacgacatcacaagctttaacattatgaa
+ agatcacaactattaaaattgaaactgaaagtgccaagaaggaaaatatcatcggtttgt
+ aatgagccatctcgtacagcaagtaggtcatatatgatttaaaggtgtcttcaggtaaag
+ tgatattttctgcggatgataacacattttgttcccctaatgtatccaatttgacatcgt
+ cattattactgttgttgtctaaatgaggctgcatgcagatacggagttttgtttcttgtc
+ ttgatatggtatagacaaccgactggtgtatataaacagaacacagacttacattttcag
+ gttcgaagttttattgaagttcataaactctttttaaattattttgtaaacttgagctat
+ tcaagcagtattcttatttaacctaagcatagccccggatacaaggcgtataatacccaa
+ ctaacttaaaatgagagccacttactgaacaccttggaaggtcattgtttggttgaaagg
+ aatattattaactaagccttagaacaaggcgtcacctaaatgataatggcttttcttctg
+ ggcgcggtcggggcttcgactcatttgatactaggttaattacccttttacgcataccta
+ aatatttgtaactttagatctgtccactaaaagcgtttcgatgacttagttgtgctacta
+ atagtaaactctttgtgcaggcaaattatatcatcgctggctagactttttgtcaaaagt
+ tctgtttatacagtgggcagctagagtgcttaataacaaagatattgtcctttagactta
+ cacaagagcagaagcaaagataagcactgtttgttctgcttccatcgtttttgctgaaaa
+ ttttgcatgcatatccaaccaatatttaacacctggattatctcggttcaaagactacaa
+ taaacgccccaataattgtgatgcttttttcaatccagcccgcacaaatgaataccgctg
+ ctattggatgccatcttcagatgtacgtccaaactgtggttggtatagtttttagaacct
+ cgtggtgaggagtgttccattctttcaaaggcaatgagaaaaaaaaacagttgcgatatt
+ agtcaactgttttgcatggtttctttgacgcagttgtaaaatcatactcgaagggtagtt
+ ggaaaattttcaacatatattagtaatactttttctgtaaaagtgacacaagataaaatg
+ actccatggccaagttggttaaggcgtgcgactgttaatcgcaagatcgtgagttcaacc
+ ctcactggggtcgttaatttttcttttctttttttttttttatctccgacggagaaaaaa
+ ttttttctcaacccccggcatggcgaatggaaaaatcacctggcacgtcatttcatcaaa
+ tcaacatcggcctccccgtgctaatatgttaacaggccgagctcttgccatctcacgcat
+ tttccgctatacaaatagtatatataatattacagcttttaaagagggaagaggaaacgg
+ agaagaaaatatactactttatagagatattcggcaattttttttagcaaaagtttatta
+ aaccctaatatagacctttacttggagtttcgattttaagaattttttatcatttttttt
+ ttttgacgatttctgtcggtttcgttataacctgttgtgttgttgttgttgttgttattg
+ ctgggttgttgttattcactttaacattatcgaataaaatttggactttcaaagtcctgt
+ tcaataagttgtactaagcctttaagcgtttttaagttacaaataaaaactggaaagctt
+ aaggtagctttctaatattaggctctcaacctttttttactctccatttaccgctagata
+ ttaactagtagtaagtattgtgactactattccctgtaatcaaaaaaaaagtaatcagat
+ tttattttatttcgacattacccctcaaatatatgactggtatgaatgataataatgccg
+ ctattcctcagcaaactccaaggaaacatgcgctatcttctaaagttatgcaacttttta
+ gaagcggttcaagatcatctaggcagggaaaggcctcatcgaatatccagccaccttcta
+ atataaacacaaacgttccatcggcgtctaaatcagccaaatttggtttacataccccaa
+ ccactgctactcctagggtagtttctaatccttctaatactgcaggtgtgagtaaaccgg
+ gcatgtatatgcccgaatattaccagtcggcatcaccatcgcactctagttcatccgcat
+ cattaaacaaccatattgatattaacacctctaagtcatcatcagctgcttctttaactt
+ cgtcagtatcagctttatccttatcacccacatcagccataaatattagctccaaaagtt
+ tgagcccaaagttctctcatcatagtaacagcaatactgctattacacccgcgcctactc
+ ccactgcttcaaatattaataatgtaaataagataaccaatacaagtgcacctatttgtg
+ ggaggtttcttgtgcataaagatggtacccatgaacatcacttaaaaaatgctaagagac
+ aagaaaagctaagcacaatgattaaaaacatggttggtgcgagcaaattacgtggtgagg
+ caaaatctgctgtccctgatataataatggatccaaagacgactttaaaatccaacaaga
+ atcctcctactctttttgcaggcttcatgaagcaggtcgtggatatggatgataaatatc
+ cagaaggcgctcccacaagtggcgctttaaattgtcctgaaagggatatatacaggtcag
+ atcaaaaagattccaaaaataatacgcataatatcactactactaaaaaagataggcaat
+ gttttgccgaaaagtatggtcgctgtcaagaagtccttggtaaaggtgcttttggtgtag
+ taagaatatgtcaaaagaaaaatgtttcttctcaagatggtaataaaagtgaaaagcttt
+ atgcagtgaaagagttcaagcgtagaacatccgaatcagcagaaaagtattctaagaggt
+ tgacttctgaattttgcatttcttcttcattacaccatacaaatattgttactacactag
+ atcttttccaagatgccaaaggcgagtactgtgaagtaatggaatattgtgcaggtggcg
+ atctattcactttggtcgttgccgccggaaaattagaatatatggaagcagattgtttct
+ tcaagcagcttattagaggtgttgtttatatgcatgaaatgggtgtttgtcatagagatt
+ tgaagcctgagaacttactgcttacgcacgatggtgtgctaaaaattacagactttggta
+ acagcgaatgtttcaagatggcatgggaaaaaaatattcaccttagtggaggcgtttgcg
+ gttcatcgccgtacatcgccccagaggaatatatcaaagaagagtttgatccaagacccg
+ tagatatatgggcatgtggtgtcatttatatggcaatgagaactggtagacaattgtgga
+ gttctgctgaaaaagacgatccattttatatgaattatttaaaaggacgtaaggaaaagg
+ gaggctatgagccaatcgaaagtttaaaaagagccaggtgtaggaatgttatatattcga
+ tgttagatcccgttccgtacagaagaattaacgggaaacaaattttgaacagtgaatggg
+ gaagggagataaaatgctgccataatgggcgcgcattgaaataaacgagtacttcacttt
+ caaatatcacgatattttggaacaattcggtattttttacttaatctagtacactaagga
+ atgctttgttatccggcattcgtatcttattcctcgcttctattgttctacttttatatc
+ ccgtttggctgattacggatcacgttcaaattggtaaatcccattaattaaaaaagaatt
+ gtaaccttatttaaaaaaaaaaatagtacataacaataaaaaaaaaaaagataataattt
+ tgaatttattgctagacattcttacgtttattgtgttcatgttaacaaaaacacatgccc
+ gcggttattatcgctttttaattgctttcttaaaaagagtcggttggtcagatagagcag
+ atagtaaattatagtgctatttaggtctgttaatttaaatgaaagaaacataaatgaccg
+ taaaaaactaaaggcaaaagcattaatttattttatcccgagcgcacaaatatctaggct
+ tgtcatttgtcctaatagttcttcgattttagtgtctaataacccattggcatagtcgtc
+ tcttgattgttggtccaaatattgttgaatctgcgctaaacgagtgtaaccatcagtaca
+ gaaccttagctgaatcttgattaaagcttcaaaacttgggtcaaagtaaggtactcttaa
+ tgaaactaactgtggtagttcagttttcaattggttattaagattttcgaaaatatcttt
+ agctaagctcaattctttttcagcccttggcagtttagaggcatctttagcaggtttgtc
+ cactaatctacgaacttttgccttcgcagcatcgaagtcttgtttcttatggtctctctt
+ ttttatggcctcctcaatttctttgaaatacgtcgaaaactttgttattggatctagtac
+ ggtttctcttaagggcccgtctaattgcttaacagtttcgctatcaaaatcttgaacaca
+ ttgcaaataatagttaccaacgttgtaaccaccaccagcaacatattttgaatcgtcata
+ gaggttagagatgacctcggcaatggtagtctgtgatgctgtcacagctctcaatgagtc
+ caagaaacctttggcttccttttgtaatgcctcacctgctctttgaagaactttataacg
+ acgttcttccatgtcatactctttatcaatggtcttgtcgacattcttaattatcacact
+ gtgaccagctctgttgatagctttcttaaaaccttcccaactcatcttgtatcagctttc
+ ttctggcttcttataataggatttgtatagttttctatatcacagcgtttccttttcttc
+ tctcacgtctgcctgttattaaatttgttttccctttcttttaattcggattctcataat
+ tgttcaaaatccgcgcatttggatcgtccagcccctgcagcgcccagttcagccaacaaa
+ aattcatagggacgtaaaggaaacgtttcatgtctgttacaatgaaaaagacaagtcgaa
+ tcacattatgaatggcaaacgatagacctttcgttgcctttcaattacatgatattcgtg
+ gcaacatcttctccgctaacctaaaagttttatcagaaacatatatcttattttttctga
+ agtaaaatcaaagtgcagggaagacaagtaaaggagtcagcaaaaaagtgaagtaaactt
+ atatgttaactgtttataaccgtaccttgcttacatcatctatattttcgtaatcatttt
+ tgtacattgtttatatcttgttttgaagttttagctttgaaaaacatttaattgcgaagt
+ aacataatgttattaatagtgtacgactgaaacagccattctatttaaacatataaaagt
+ acgattagttggtgtggttatataagtcataaaataattatcatattaaaaaattgaaaa
+ ataaaaaactactcttttttatttcaatagttctcgttattagtaggtcgtgctcttaaa
+ agattaccctttcagtagatggtaatgggaatggacgagccagtacatatgaattctgct
+ ttgtagcaacacctgcatatgcgttgtaccaagcaatgaaagcaacaacaactcccagga
+ caccaccagctcttgtgacaccaagtctattagcaaagtgaccaatagacaacagtagga
+ aagttaatgctagtaagaagaacaacaaaaagaacataacagtggatttcatggtacaaa
+ cggttaaaccaaacgtaaagatggcccaccccaacaaataaaatcctaaagcattattca
+ aatcagattcattgtcttcgtaagcttccaagataccaaaccaaggaatgtaaattgcag
+ cgaaactcaaccaaaacccaccgtaagaacataatgcggtaccaccaaaagtattttcca
+ aagctatctcccaaataccagcaatcaattgcaccaaaccaccataaaacatagcacaac
+ cgacgacaacattaggaacagtgatcccttgcgctctcgcattgaacatggacagcacaa
+ atgtcgtcaacgcgaaggctgaaagacctaagggcgcaggattagcaaatttgtgcactg
+ gagcaggagctaaccctggattcaaggtaccaccaaaggcttggtataagtcgctcttca
+ aaaacttttgacgcccaatatagatatattcattgttatcacctccagtgtaaatcttgc
+ ccaacgaatcatgagatggacgttcatcttctgcaacgccattaacgtcgttatcatggg
+ aactatagtatcctgctggtgcattctccaaatctgtgtttccgctcgtttgttccttgt
+ cagacatattatttgtttgtatatgagttgttttgtggttgtttagttatatctgtgtta
+ gctgtcgactatttgcctttttcttataaacagatgctagtagtttctgttgtattgtta
+ ttgctattggacgatatataaaatgaacctatagcaagatctcagggaaatacgatccaa
+ ccaaggtgctttatatacagcttggtgtctctgacaggttggagtacagaggagaagcgc
+ catagccgcccaacccgcgataacgaaaaaaagctgtaagaaaatgcagtcttccccaac
+ gtttgttaccccaacaataccgaaatagccgcccaaccgcaaatctggcaaaaaatcttt
+ tcgtatctcgctgcagatgaccttccaacccgcccctttttgtagtcacgcctttaaggg
+ cccttgtttggcttgctcacatggggctacacgtaaaaaccgtagcttggagaaccactc
+ agccgctcgcctattccacggagctccggatcaacgggcagtggaccctctcgaaaataa
+ ggaaagttgcggggccaaaaattattatttcttatgtaaaagaagagactatatacgacg
+ cctatctacttttgttccacaactatccacttaagggctagataccccattattaaaaac
+ accacattaaacagagccaggatcttgatgtcaaatacaaggttttggacaacaaatcca
+ aatgtgctcaagatagtagcgcctggaacttcaatatttaagccgtactttctctctttc
+ agcatcaatgttttgacctcattgatcaataaagattcgtaggcgtagtaaaacacagag
+ aagtttttcaggtacttgaaggccacgtttgtaatattcttagtattgataaatagtccg
+ ctaaacagtagtgagcccaaaagcaccagcacgcttaatattatggagttattcaagtct
+ tcaaaaattatgccgatggttaggatttccaacgatatcccaaggttaaacagtataagg
+ attccaatacatttaaaaaaagcattgtctttcatgtttaaaccagtcattgggtaaaca
+ atcaatgacaagagtataggtggtacaacacgtagcgggaccacttcgctcattatctta
+ ctaatgtagtatgcaagtggcgagtaatagttattggatctttcttttatgaaaatgatc
+ ctttccagagcgaacgagctgagacctgtgaatgtaacaaaaccgaagtacgttagtata
+ aagaagaacagccccattctgttctgaaaaccgctgatatcattggagacgttgtaatat
+ agtgttcccaagaataaactcaataggatcgtcagtaaataattacccaataatagttta
+ gggtttctgtacatgtttttaaaacttcttgaattcaatatcgataactgttgcagaaaa
+ ccagcagattgttgacctgtgggtaaatctccattcaaaacgttactttcctcatcacct
+ tcgcttaaaacttcctcgatctcctgtgaaagctctgcataatagacgctatctttatat
+ ttatcgtgtagtagtttggtatttaagtcaatctcagtagcacctcgccttccatctgtt
+ ccctctacatcttcttcatctctgagtaaagatctgatctcatctctatgagctgcaaga
+ tgagcccactctctctgtgttgtaccatcactgctagtaaatgttgtttggtgtatcgta
+ ttatcaatatcgttagtatccgtaccagcttctaaatcggaaatgtttctgattcttctc
+ cttttcccctgaggaccggcttcaaaagtaatatcaatcaaataatcagcaatattatag
+ ttgtccggacagatatatccctcatttctcaaaaattctgacacttttttggcatttccg
+ gaatagaccatctcacctttacttaacaggaccaatttatcgaataaataaaatatattt
+ gatcttggctgatgaatagatagcaccaatgtcctgttatagtcgctggataaccttacc
+ aaacattcaataacattattggcattactagcatctaaaccagatgtaggttcatccaaa
+ aataaaaccaatggagatgtcactaattcacatgcaatggaaactcggcgtttttcacct
+ ccactaataccacgatcaaattcattaccaataatacgatctttgatatcaataattctt
+ agttcttccaacaccttataaactcttgccttcttggcctcgaatgacaatgcttttggc
+ aatcttaacagcgcactatttaatacggtttcaaaaacagtcaaagtgggcagcaaaaag
+ tcatcttgatcgacgaacccgattattttcgagaaagatttacggtccatactaataccg
+ ttaacttttatggaacccgaaacgtgacctgttttccgtttcattgctaggatatctaat
+ aaagtagttttacccgcaccagatccacccatgatagctaatatttggccgggcttcacg
+ ataccacttatttcattcagcacagtttcttcaacaccatctgaatttatcgaggggaca
+ ctataagtgatattttcaaaacttaatgtcgccagtgtgtcatcttcattttgtaagaaa
+ ttattcaccgcatcttcatctggcaaacgaatgggagactttgaggaacccaatccattt
+ ctgaataacggagatttagaaatgtaaaaggtagcaaatgtaaaaagtgccaggaccatc
+ acagcagtcaatgccaacaccaatttcccttgccatgacactgttggatcttttgaagga
+ gatttgtaacctggaatctcactataatgaacacattcaccggattcacacttcaaagta
+ atataagggtcaccaaacacggtcaatatcaaatcattcatagaaggctcactgaattta
+ cattgccttgtttctaaatcacagctgaaatctcctggcccttttattgtctctgtcagg
+ aaatccgagatatctatagaccccttagcaccacacaacacagtgtcgggaacgcatttg
+ cattgaacgtcattacacttataatgggaggtattctgttccaagtcgtattcaaaggca
+ caatcacttaagccacaatagaagctttctaactgatctatccaaaactgaaaattacat
+ tcttgattaggtttatcacaggcaaatgtaatttgtggtattttgccgttcaaaatctgt
+ agaattttctcattggtcacattacaacctgaaaatactttatctacaatcataccattc
+ ttataacatgtccccttaatactaggatcaggcatgaacgcatcacagacaaaatcttct
+ tgacaaacgtcacaattgatccctccccatccgttatcacaatgacaggtgtcattttgt
+ gctcttatgggacgatccttattaccgctttcatccggtgatagaccgccacagaggggc
+ agagagcaatcatcacctgcaaacccttctatacactcacatctaccagtgtacgaattg
+ cattcagaaaactgtttgcattcaaaaataggtagcatacaattaaaacatggcgggcac
+ gtatcattgcccttatcttgtgcagttagacgcgaatttttcgaagaagtaccttcaaag
+ aatggggtctcatcttgttttgcaagtaccactgagcaggataataatagaaatgataat
+ atactatagtagagataacgtcgatgacttcccatactgtaattgcttttagttgtgtat
+ ttttagtgtgcaagtttctgtaaatcgattaatttttttttctttcctctttttattaac
+ cttaatttttattttagattcctgacttcaactcaagacgcacagatattataacatctg
+ cacaataggcatttgcaagaattactcgtgagtaaggaaagagtgaggaactatcgcata
+ cctgcatttaaagatgccgatttgggcgcgaatcctttattttggcttcaccctcatact
+ attatcagggccagaaaaaggaagtgtttccctccttcttgaattgatgttaccctcata
+ aagcacgtggcctcttatcgagaaagaaattaccgtcgctcgtgatttgtttgcaaaaag
+ aacaaaactgaaaaaacccagacacgctcgacttcctgtcttcctattgattgcagcttc
+ caatttcgtcacacaacaaggtcctagcgacggctcacaggttttgtaacaagcaatcga
+ aggttctggaatggcgggaaagggtttagtaccacatgctatgatgcccactgtgatctc
+ cagagcaaagttcgttcgatcgtactgttactctctctctttcaaacagaattgtccgaa
+ tcgtgtgacaacaacagcctgttctcacacactcttttcttctaaccaagggggtggttt
+ agtttagtagaacctcgtgaaacttacatttacatatatataaacttgcataaattggtc
+ aatgcaagaaatacatatttggtcttttctaattcgtagtttttcaagttcttagatgct
+ ttctttttctcttttttacagatcatcaaggaagtaattatctactttttacaacaaata
+ taaaacaatgtctttatcttcaaagttgtctgtccaagatttggacttgaaggacaagcg
+ tgtcttcatcagagttgacttcaacgtcccattggacggtaagaagatcacttctaacca
+ aagaattgttgctgctttgccaaccatcaagtacgttttggaacaccacccaagatacgt
+ tgtcttggcttctcacttgggtagaccaaacggtgaaagaaacgaaaaatactctttggc
+ tccagttgctaaggaattgcaatcattgttgggtaaggatgtcaccttcttgaacgactg
+ tgtgcgtccagaagttgaagccgctgtcaaggcttctgccccaggttccgttattttgtt
+ ggaaaacttgcgttaccacatcgaagaagaaggttccagaaaggtcgatggtcaaaaggt
+ caaggcttccaaggaagatgttcaaaagttcagacacgaattgagctctttggctgatgt
+ ttacatcaacgatgccttcggtaccgctcacagagctcactcttctatggtcggtttcga
+ cttgccacaacgtgctgccggtttcttgttggaaaaggaattgaagtacttcggtaaggc
+ tttggagaacccaaccagaccattcttggccatcttaggtggtgccaaggttgctgacaa
+ gattcaattgattgacaacttgttggacaaggtcgactctatcatcattggtggtggtat
+ ggctttcaccttcaagaaggttttggaaaacactgaaatcggtgactccatcttcgacaa
+ ggctggtgctgaaatcgttccaaagttgatggaaaaggccaaggccaagggtgtcgaagt
+ cgtcttgccagtcgacttcatcattgctgatgctttctctgctgatgccaacaccaagac
+ tgtcactgacaaggaaggtattccagctggctggcaagggttggacaatggtccagaatc
+ tagaaagttgtttgctgctactgttgcaaaggctaagaccattgtctggaacggtccacc
+ aggtgttttcgaattcgaaaagttcgctgctggtactaaggctttgttagacgaagttgt
+ caagagctctgctgctggtaacaccgtcatcattggtggtggtgacactgccactgtcgc
+ taagaagtacggtgtcactgacaagatctcccatgtctctactggtggtggtgcttcttt
+ ggaattattggaaggtaaggaattgccaggtgttgctttcttatccgaaaagaaataaat
+ tgaattgaattgaaatcgatagatcaatttttttcttttctctttccccatcctttacgc
+ taaaataatagtttattttattttttgaatattttttatttatatacgtatatatagact
+ attatttatcttttaatgattattaagatttttattaaaaaaaaattcgctcctctttta
+ atgcctttatgcagtttttttttcccattcgatatttctatgttcgggttcagcgtattt
+ taagtttaataactcgaaaattctgcgttcgttaaagctttcgagaaggatattatttcg
+ aaataaaccgtgttgtgtaagcttgaagcctttttgcgctgccaatattcttatccatct
+ attgtactctttagatccagtatagtgtattcttcctgctccaagctcatcccacttgca
+ acaaaaaaagtctaatcttctgcaataatttccatccttggcattcagagacatatattg
+ gtcaatcgggtttaattttgtttcttcttcttcttcttcttcaattaacaatgtgccttc
+ atcatttttcatttctttcagttcttctaattctcttggagagaatttgactcccatgaa
+ aaatttcttgattatctcagaagaataccacctctttctttctccatacccaacaatctt
+ cgctgtacgaaatctctccactattctttttaagaatagcttttccaagtttggcgttag
+ ctgtaaaaaacaatggatatagccatccttgtacaacttaatacaaagcgtttccatgat
+ ttttgccaactcggtcgtgtcattacaaaacggcttgaaaaataaaagatcaatgtcacc
+ acacttggaatagcccctattataacttccctgtaattcgacttggcattcaggatcaat
+ gccacgcagcgctttttgaaccttctttaaatgtgtgaaacattcattccgagacatctt
+ gcataaccaatcgtcgtaatatgaccaaccaaataaaattgtccaatctgatacaaactc
+ ctctggatccttcttagctgcaacacaaaagctttcaaaatttagaagattccagcgttt
+ agcaatttccgacccaatgccgtaacaatttttgaagtattttaacttgtcttctaatcc
+ cactgaatcatttaaacctggtaaaactcccgtatctaatataacttgtatttttttggc
+ aatactaggcccgatattccttaatttagtatgtgcttcttcaccggaacgaacattgaa
+ atcgcaattttccatcgactgtttagccagtctataacttcttgcacgaaatttctcacc
+ ttcgatctcatatttttttgttaaccttttcaacgctccgataatcaattcattgttttt
+ gtacttcgttttgtcagtaacaagggaagctcttcctttatcctcctgtaaaagtcgctt
+ aagaggttttgaagcttgtatcatatccttattgttgctgtcatcattcctttcactttc
+ tacgtcagttgaaatctcactctctttatcagtagaactttgtccgtcatcagcgttatc
+ tgatataacgataatggatggttgaaatttaatcaaatcatctttttgaaacgcaaattt
+ atcattctggacccactttgtgatgcaactggttttcacacattgaatgcctgactgttc
+ aatcttaccaagaaattcatcgacatcatttaagcctgcttccctttggaaaatatcttt
+ ctgagtcaaatgcatatcagaatcaacaaaggagtcctcaatcaaaatgataacttcctt
+ acggctagattgaagacaattttgaatacttgacacaattgtggcgccctttttctccaa
+ tatactcttaaagaacttattggaagatgtgttaggattaggtaaaaaggcgaaaaattt
+ accctttagagacatgtgcgatttgtctaacaggtcgcatgtttatcctttacttattac
+ cactgtgacatataagttctttgcaagcccttaaaattattagccccatgtatttgaaca
+ tctcaaagtaattttagaccgagaccgattacggagatagaagaatagtgcaagcgaaat
+ actattttatctttacttaaaaaagaaaaaacaggaggaaaattagcggtattccattat
+ tattagcactgcaacgtcctggttatataaatttgaatatacaggatttcatccgtttct
+ acatttttgttacctgcatgatcaaatcttttaaagagtcccaggatttcacaaggtagg
+ cacctttaccgccttccttttcacaaaactgtaaccacgctggaccattatcagcttcaa
+ aactcgagattttatcctttgggataccaataatcttttcggtaatttttatgaatttgg
+ aaggattttcttgcgggtttattagcagtacaccattagtagatggatgtaatatggaca
+ acaagtcagtctcactatcccctatataccagtagctgcaagaatttccttctttattgc
+ aacctgagtcgatcttatctaatatttcacccagtatcttgaccttatcagagcctgtca
+ ataaccgacaatcaaattcaccattgtagctttgagaacacttgtcactaacttttttta
+ gatcgttacaaaaaatatgactatttttaagtcttctgtcaccaatgacttcatgaataa
+ actcttttgaccaatttatagataaaacgtaaaaatcactttcaaaatttttgacaactg
+ aagagcaaaacgttttgaaaccatctcttagtaaacaatcatcgtgattctgatctctgg
+ cgaacgttttcatttgatccaaagaaatcgacttaaaaatctgttgttttgtaatttcat
+ ttacactatttaactcaacaaccctgttatgattttgatattttaattcgtccgcaaaca
+ acttattgaagttagactgcgaaattatcgtaggtacgcctgaagataacaacggcagtg
+ atctcgttccattgtatttgtatttatggtatccgtccatatatgtcttggtaaaatgac
+ cccactcaggtttcaaccgcgggttcagtagatacggtaatttagcaatggtgcagattg
+ tgtcaactcttgtgattgtttcatcaaaatctgaaataataatggttttcatatcctcca
+ ctgttcgcttctggttactgcaattgttaaatgcattggctcgattgtcaacattgcttt
+ gtgcgaagaaaggtcgcttttaaagaaaacgataagaactaacctctaactacataataa
+ taagaaaaaagtatacagataaacaagctcagtagtaatacataaaataaaaagttttgc
+ aaatcgattgtccacacacttctatatcaattccaactaagaaaacgaaggaaccgactc
+ aaaccggtgtaaaagctaggctttcaatctctgctcctccaaacaaaactagtgcttgat
+ atcacatccaattaaattacctaaataattttagagtgacaccatcgtacaaagagggca
+ ttaagcttattttttttaattcttatgaaaaaaaaatgtggaagagagcagaaaattttc
+ ataatgaatctcttctattcttatgtaccgttccgccaaatttttcatgaaaaaatttgg
+ gaaaaagaaaaaaaaaaaaaaaactagtatataaagtaaaaaaagatgcgcaagcccgga
+ atcgaaccgggggcccaacgatggcaacgttggattttaccactaaaccacttgcgctag
+ taatattgatgcattattcaagaaaagcaaaaaacatattagccgataatttctatttta
+ taagtaaaaacattggtgagatagtggtaaagagttagataattattgggattccattgt
+ tgagaaaagcaataatattatgtatacggaatatactaggagttctcttcaaggatagag
+ gaatcctcaaaatggaatccatatttctacatactaatattacgattatcccttcttcta
+ ttttatatgtttcattatcctattacattatcaatccttgcacttcagcttcctctaact
+ tcgatgacagcttctcataacttatgtcatcatcttaacaccgtatatgataatatattg
+ ataatataactattagttgatagacgatagtggatttctattccaacaagaacatccttg
+ aaaggtcgactattaacataagaataagagagagtgtgtggccgatgctgttgctatagc
+ tcggtcgtagtttaacaccacggcattattcagctttaaaacaattgaattataaacctg
+ cttttcttactacgaaaagtgtttccctttaataaatctattgtacatactttacagttg
+ atagtaccttcaccataaaaatatgcaaacgcttttgttattgagaacaggaataccaac
+ cgttctcatggaaaatagcaaagtgtaaaagaatttattatcatcgagtgataaagaata
+ tcgccagatctttacggatatatatcgctaaaaatatttgtcgccaataaaagatatgtc
+ taacgctccctttgcaggtagttttttactggcatcgcctttttactgcaaattattgaa
+ aattttcgagcgagaaaaaggaacgaaaagagatgagctaaattcaaatatcaaagtact
+ gtggtaagtgctttctttatcgacaaattacaagcagaagaaacattcactttacaatta
+ aatgtccgaaaatcacgttcctgcctggaaaagaattgctttgaagagacagactataag
+ cagtggtgacgaaagtaaagaaaaaggccaatccaatctaatagatgatgatccactaaa
+ tatcactacgcatttatccactggtaatttgacaaagaaagagaaaaaaagaatcattaa
+ tggtgagagcaaatcctctaccaaaaaaggaaagcgtgtgtcgaaaccagggacgaagaa
+ gaaggaaaaattgtcaaaggatgaaaagaattcgaaaaaaaataaaattcttaaagatca
+ attacgttacctgatagaatttttcaggacaaagtctgagagcaagtttcccaccggaat
+ cctagaattggaaagtgtaaaagaaaattacggcgactccctgatcaaagacgagccatc
+ agagtctggtgttgttgaagtttggaaattttccaagcaaaagcaaaattggcttattaa
+ gcattttttcaatttggacgaaatcccctcagtatataatgatcttttgcttttatattt
+ cagggacttacaaggtaaatcaaaagaagaactaatatcaaaatgcaaaggaaaactgaa
+ gcaatggaacgactatgtggaagatcaagaaacaagaataaaagcgttaattgcagagga
+ taaggctagcgaaccaattaatggtgaagaaaaagaagaaggtgaaaaagacggtaatgc
+ cgaacaagggaagcaaaaagaggtacaagacgagcaagaggaagtacaaatgccgaataa
+ agagcttgttcaaagaagtttgaaattactagaaatttggaagaatgatgactcggagca
+ gatagaacttaagaatttcttcgttgatgtttaacatagagttcaacaacgttgttcagt
+ tgtagtatatattttgctgtgaaatacgctcaaaaagtagcgggataaatcgtttcagga
+ agtataatttcttacaaaactataacaagatcgtctaagtaacatacataaagaaatcac
+ ataaatttttaaaattcgaggtcaaaaaaatcaggtggatacataacttaccttaagtta
+ tcgagattttcagtacacacaatgtgattacactgatttataaaaccaccttacggcctc
+ ttataagtaataacgtggctcatcaaaaacatggtagaagtgacccctctcaccttcgcc
+ taaaaacttatctggaaatttcatgtcctcatattccggttcactcatatgaccgtattc
+ atacatagaatccgagtgctctactaacgcttgctcactcaaaacttggaacttaccaac
+ ttgtagctccgtatcggttatcgttcctcttgcaactctagcatatcctgttcttctcaa
+ gcccttatacaagatatactcacaccatctatcatcgtcagagggatcaatgtcgtgcat
+ tccggatgtttcactaacgtacgtattgtagttgccttcacctggatcaacaactaagta
+ acttaataaaatagctgggcgagtcgtattgcccatgagcttagccatgtagttactttg
+ cagtcttctatcctcttcatcctcttcttgtccactatggaatacaaagacgtcaacgag
+ agtgtcattgtacgtttgaagtgtggcatgaatggctggcgcaagttccccaactggaga
+ gggcaataaatgatgcgtagaatttacgatagggaatttagaaagaagaacacagcccca
+ ggtatgtttatttggacctggtccgaaatctgcatacatattcaaatcatgagctagttt
+ gctagttagatccctgttccccatggtaattctttgtgtatctgtttctagtagacctac
+ cacatctagttccatatctttaataaggttgatcattctgtcttcagatgcccacatatc
+ attatcgagaccaaagtggatagtccaaataccagctgtaatcaactgagaatctggatg
+ ataaggctgaggaattcctgtaggtctaatatcatacacgaacctagctgttagtgatag
+ tagggccacggcaaagaagaaaacataaatgaagaattttttgcccatagtcaccagttg
+ aacgttaaggtttttgcatgttaaagcaccaataatgataaatgtggaagaaaaggcaag
+ aaccgtctcaatcttctccctcagtacccagcccattggaacaaatgcgtatgcaacgac
+ ccaaacatgggcaaggacaaagataatataaacggagaatgataggacataaacccatat
+ gttttgaacttggcctaatgcagaaaaatacgaaggaactaaccaaagcatagcgatagc
+ gtacaataaaccaccaaaaatatatttaggccattgtgtaatgcttctagctgaaagtac
+ agcagtggatatgagaagcaacaaacatggaactaatggcttgcccataaacttcacaga
+ actcaaagaagcaaataacatgaccgtacaagttaaggcgccccaaggccatggcaaggg
+ accgtgtgattcattgtaaccttcccatgcccaataaatagtagtagaagaatccgtcaa
+ taactgatgaattccgaataacaacgagccaaaaccaacagccaaaaataatttgccgat
+ caacgaagcagattttgcttgagcattccttttcccttcgaaattaattgaattaacata
+ aggcgacagcatgccgaataaaacagtaagcacgagagctgtcttattatacccaccatt
+ acgttcatctaaaatgacccaagttgggttgttggtatagaaccccattttcaagatcac
+ agatgcaacaagacctagcagccaagttagagcaaaagagaaactagtctccgcatttgt
+ gatatatcttagattttgtacaaacgttgcaacggtaatggaagttcccacagctacaga
+ aatcaaccttaattctggcatctgaacaatataggcaccaatggcaataatacctcctaa
+ aagtacaccatattgcatgaaggcttcagaaacgaacggcagatataagaaaatgggtcc
+ caaataacccaatattgcagcttcataacctgagattcccatataccatagcgggaaatg
+ ccaaatactacataaaagggacgtaacgttcgaccagaaaagaaaaccgttaaaaatatt
+ tgttagtaagtaaaaaaaagagtcaaacctgaaatatgaaccagttgcgcttctagcaac
+ gggtttttccttttcaaccttcttttcaccacttttctgcaattctgaacttttttcttc
+ tccataatttatgggattagagtctctaatttggaaaaaactggtattaccgggtttctc
+ attaaaagcgagaacaatatctatctttttgaaatcagcgtaagcaaatgcatcaaatgc
+ aatatctaaaagaatcagagaccattcgaaataagcatatatagaatatgccccagctct
+ ctgttggacggagtgctgaatgtaccagtaaatcatcgggaacaatgttccaaaaaaaat
+ tgtagcagtcaaccctttgttcttcgaagttaaaggactagaatagcgggtaatcattat
+ atcccatggtaacgttaaaacaatgtatgtgatcataaatatatcatgaatatcgtggtc
+ atctgtacttgtaatgtatacccaaccaccgcaagagactgtccggagtacaccgacaag
+ gaagcataccttagattggttcaagtagtagtgacctagtagcagtaaaaatcttggaaa
+ agcagttagagctattaagatttggaaaatagaacgttccggatagcggtccccgatagt
+ ggctgatacactaggaaaccactcatctggatacgtataatgtgcattggttacaatttt
+ atgaaaatgtaatgaataaccagtgaccaaagctgcaaagaaggcggagaatgcgcaaat
+ agtatgagctatagggatgatcttcccattgatgatcagcataactggcggtggatacct
+ tctgtattcctggttacttattcctcgagaaatcggacattaaaagggtgcaaatactgt
+ ggagaaattataagttaccaagtcaaaaagcgaatgagaatctattgttgcgattttttt
+ ggttactacttcttacccgaactaactttacataataaaaaagcatgaagcgcatattca
+ gggactcaaaaatcccaaactggcacttctgtatataccgatgagcatattttgattgta
+ aaaaacaacattaaaaagaactgcaaacctgaaccaagaatacttaatttcaaaaccgag
+ aaatgaggtacaacttgtagaagttcacttaaacttattgagcatacaatctcaacaaaa
+ ctaactgcaaagtttgttgctatagtcacatagtaatactagcaggaagccattaatcaa
+ aaagaaactccatataaaacgtaataaatattatgaataatagtacaatataagaacaaa
+ gaagaagaaaatacattataactgaagtgccaatattagaaaaaaaaaaggtggaatatc
+ ctagtggaaaaatatcatctatgcggtgaaagaattaaatagatactatggattgtttcg
+ aatctatccttttatctgccgcttgcctttttttttcattctccttcttcagtctttttc
+ cataggcgagtccacagggactacaaagttccctgttttggtcgggtccactacgccatt
+ gaattgtccaagtgtctttacatttagagcattcctttatctcaccatttgggaggattg
+ tttgtctcttttctcttttaggacgactagtagtctttttgctgtatttcttcgtaatac
+ tggcagaccgactagttccttgtgcacaactatggtagccttcaagattgtttgcttctt
+ ggctagtcgtacgtctctcggaattgcatgcactaataacacgattctttgagggtgtaa
+ tttcgttcaaagagctggttttgttcttaacaatatttcttgtaagatctcttagttgtt
+ cggcattctcaatttttttttcaacatcaacagcccttatttgagaaaaggatacttcaa
+ taaatgagcaacttgcataaactcttccgcgttccttatatgccttctttttatcgtttg
+ acttccagtcatcaatttctctatttaataataacgttttctccatatctgactttttaa
+ ctactctagttaagaaggaacttccagagttgtcataattattatatcgcatactagcaa
+ tagtaaatgtctggttatttacctagcaacctcttaacaaagcgataacaatgaaacagc
+ aatataagtctatttaattgacaacatcattgataaaagtttgaacaagcgtctaaaagt
+ cgaagaagttcatttcaaagttttgtgtatcgagcttgatgatacttatttagggatccg
+ ctctagtttactgtgatacaactgaaaaacattacaaagctatcgcgcctgcattctcat
+ attagtggcgtatgtttgtgtcacattcgatacttctacataatacaatgactgaaccct
+ taccgctggcacaggaataccgcgccgcgaaagcagggtctaggagaaacaatagaattc
+ ttggcgatgagagaggggaaggctttaataaaaaaaaatagcgcttatccgattctgttg
+ gaataagtggctatcaatatttacatgttgatatggtagtaaggatataaatgcttatat
+ tgaatatttggaataagcatcaactatcacttattgaaaaggactcgaacaaaaaaatat
+ gaaatgtcgaaagccacatcactgagaagttgttggattgaagctctaaaatgtcatctt
+ ttagtagtattcgtgttactagtatattatcacatgcggtgtaaaaagatgacgtaaaga
+ ttgagaaacagtcatcaactgttatggaagctgaaatgcaaatattgataatgtaatagg
+ ataatgaaacgtataaaatgaaaaagaataataaaatattattatgtagaaataccgatt
+ ccattttgagaattcctatatccttgagaagaacttctagtatattctgtatacataata
+ ttactgcctttatcaacaatggaatgccaaaaattatcaaaatatcacccaatattcaat
+ tttattgttgtgaagacatcaagccctagatatatatcatagtaggataaaatagtagaa
+ taaaactattcatatatctacgtgaaagcatcagagattgtaatgttgagcgccgtggcg
+ cagtggaagcgcgcagggctcataaccctgatgtcctcggatcgaaaccgagcggcgcta
+ ttaaattttttgaactgtccattaataataatggaactgacgtcgtcaattaccggtagc
+ ttttttgcccacatttttaatcatcttttaagtatttgagtatttgttgttcactagagt
+ tttactaattaaggttctgagaaaataaagatatgcatttagaaaagtcggttctacacc
+ gcaaatttctttattacagaaacattttatacaaagaggatacatagttattggaaagaa
+ aatcaattatctttttgttgggtagtatgaatattacttgtacattatcatatacaatat
+ aaaaagatgtcatagagttgtgaaatatttattatatttaataaaagctgaagcgcaatg
+ attgaaaacgttttaggatgattaatgataaagtacaaaatgaggagaagaaataataat
+ aacataaagtggaaataccggttatcttttaaggatattataacctggaggaaaacttcc
+ tgggttctctgtatattcaacacttttccgttatcaaaaatagaatcccaacagtatccc
+ cattttcgtcaatatctccataacgacaaggtcttctagataccatataatgacaataac
+ caagcccaccatttaggtggattttttgtgtggttttgaaattaaaacatttttagagga
+ aaattgtgatcacagaatttcacattcccagttactttaataatattcatagttagagat
+ gaagaggtactattaatatgaaggttaagtatgaatcttctcatgcttctataagcgagg
+ gatataaaacatcgccaactgtttcatcttatttattatcaatattatgtgtgtttggag
+ cttactatacagttaatgtgataattaatataactcttgtataactaccatatcaatgtt
+ caattaatagatcttaaatcggtttcattactacctgaactgcatgtatgaaatttcctt
+ ccacattaggcacattcacatgtaatacgagatgtacatctatatgtgatattagcattt
+ acacagcaatcctatatcactattaatggccacttgttccaaaaagtatatattatgaag
+ cgtattattattttatactcagatagctggtgcgatacgttcgcagattttgagcgcaat
+ gcaatgagtatgacggaaaggttcactcataatttcaataactcaagttgtttgaattta
+ aaatacattccccgttagataagctataaacattgttacttgcgcttgcggcacatgtaa
+ tgtagtttctttctttttccaagaaaaaagaaagccctgtagggggctcgaacccctaac
+ cttatgattaagagtcatacgcgctaccgattgcgccaacaaggctctttaattatgaaa
+ aattacgtttaataagtgacttaaaaatataaatactgttactaaaatgtgaactaaaac
+ ctcaatattattttctgatcacttgatgaatagctggtaattgttggattccattgttgg
+ taaaggctataattttacgtatatagaattgttggaataaaaatccactatcgtctatca
+ actatagttatattatcaatatattatcatatacggtgttaagatgatgacataagttat
+ gagaagctgtcatcgaagttagaggaagctgaagtgcaaggattgataatgtaataggat
+ aatgaaacatataaaacggaatgaggaataatcgtaatattagtatgtagaaatatagat
+ tccattttgaggattcctatatcctcgaggagaacttctagtatattctgtatacataat
+ attactgcctttatcaataatggaatcccaacaactatctgaaaattcacgtgattctca
+ tgtgagtacatcagaaaacaaccacctaacaaccgtgtttgaatgttgatctgaaaacaa
+ aaattaatctctcgtacttattatggatattgtattttctcaaaattccaaatgattttc
+ ttgttagatggtattcacggtcatcggaatctcagttcaatatcaccaatttgcttataa
+ ggtttatgcaagataacatgtttggtcttgttcccaagtataacttaagaaaaaaaatta
+ caatccaagaaaggtaatacagaattcatagcaataatggggtccattttcaatagataa
+ tgtacctggaagaaaagggctgactaggaaacaaaaagaaaaacaaaactatgatactcc
+ atcaaacatagggaaatttactcgaatatgaaaagtacaaatctgtcaaaagcacaccgt
+ atattgtcttggttccgcttttattcagaagaattactgaaatgaagaggaatgacgtaa
+ taaacaaacaacgattctgatatcagaaagacatcttgaatattcctatgtagtaattaa
+ ctatcccagtacttacaaacataacttgacgatgcacgctacgatttttgtgtgtttaaa
+ cttttgttggttggaactcaagaagatcttctctgagataattgtaattaaatgtacttt
+ aggagtgttcatgtataatttgcctctgcaaaagcttaataatgtattttaaaacttgca
+ tgttgctctgtgaatctgaatgtatgataatgctacgattactctagggaacttcctttt
+ tacatttaaagtttccgagatcctcagctgacaaataaatgtcatatcaagacaataaac
+ ggtagtgctaaagtaattgcatattaaacaggcgcgactaaagctttcatcccttctatt
+ attgcataacctggctatacatgaaaacacttctcacatttgatttaagaagtttactgt
+ tctaagaaaataaagaacgtgataatgatgaatgaagaggattctacagaaacgaaaagc
+ ctagtcataactaatggcatgtttatcatagacgacatcgagcgtagtaaatataatatt
+ cactataagaatgtcccaggaggcggagggacttttgccattttgggtgcatgcataata
+ tcttccggcaatgtcacatccaaaggtttgaagtggatagtggacagaggctctgacttt
+ ccaaaggaagttataagggaaatagactcatggggtactgatgtgaggtttcgagatgac
+ tttagcagattaactaccaaagggttgaattattacgagggaagtgatgatttgagaaag
+ ttcaagtttttgacgccgaagaagcagattaacgtcgatgactggatttccacatttggg
+ cagaagataattgatgaaatgcatgcgtttcatttgctatgttctgggtctagatgctta
+ gacataataaacgatctgctacgggtgaaaagttcaaagggcacaaaaccaatcgtgatt
+ tgggagccattcccagatctttgcgactttgatcatcaaaatgacattaaaagtgtaatg
+ cagaggaacgatgttacggtaatattatctccaaatgccgaagaatcaagtcgcttattt
+ ggtttaagtagcaaggaaccgactagtttggaagaatgtctagcattagcgcatcgtttc
+ gatgatttcatggatgaaaacaatatgtgtattctacgatgcggtgccctcggaagcata
+ tcggtaagtgagaagtttaagaacggacgaacctatgaccatttccccgcctaccatttc
+ aaaactcagtctaaagtactagatcctactggcgggggaaactcgttccttggcggcttt
+ gcagtttcttatgccctaacgaaaagcttagatattgctagtatatgtgggaacatcgct
+ gcaggcgcaataattgaacaattcggaataccgaggtacgatccaattgctaaaacctgg
+ aacggaatcacattcttggatagactgaaattttacctttcacagtccggtcttcaatat
+ aatataaacgatctttacaaaagtctaacacgatgatcatatcagcttgttcattgctct
+ tatttttctgcaaaaatatacatatatatatataagtaagtgttatatactacttgagat
+ taggcgttgtaacaagattcaaaaaattcgaactcgaactgtgaaaccttcacaaatatg
+ ctttccacttcttcgacgggaaacttgtcaacttcagcctttagaaattcacaccaagtt
+ tgaaaatgctcaccatcatgtaaatcaatccatttttgatatttccaatgcaaccctggg
+ gctctaggagtatcacgagcccatctccaatagagttcctcagcaacccataagctggtt
+ actaaggatggccatgtaattgaggcgtcctttctcagctcatctaagaaattaatatac
+ gcatcaacgccggggatcgctttattgtcaaaattatctctttcttccttggtgagggat
+ ggtgccaataattctaagcagtcatgaaagtatgagttttcatcattagaaaaaaatcca
+ atctttttggctaaggttattaaagcgtgagtagttggtgctaaagaagtcgtcttacat
+ atcaaccttaagctagtttcaaaaaattgcagatcttgtgataaatagatgtacaaacta
+ cggtccttcaatgttccggcacatagttccttagcgaatttatgttctgtggttttccta
+ acaagggcgtcgtacttttgtattaacttatcagtggtacagctcattgttcttccttgc
+ agatattgttgatattgttattatttgcgctaaattttagttaatacagaaaaaaatatc
+ taagttcagtccagttcatattcgtttatatatctttcgaagccagaccttttttgccaa
+ gcaattgttactatggaaactattagaaccatatgcgctaaacaatattagccatcaatt
+ caactttattcgcagtaagctcctgcagaacaggcttatcaatcattatggtcttaacgg
+ aacgcctagggacgctgactaggcccatcattctactactggagcccatacgttcctcaa
+ catgatctaatagcaaattcatttgggcatctacagcaaccaagcttccaactaaaatac
+ gatcttctgtaagggaaactattaaagtatttccaataaaatctgacagtttcaagatgt
+ ccatgacgctttggtgccccttactaagtttgttgaagactatcttcaccctatcaatga
+ tatcatcgcatttaaaaaatagtcagaactcatgcacttcgcgagatattttttgaaaaa
+ aaaaaaattaatccgccacataagatgcacaaaatgagcctataacattgaattttatga
+ tagaacgaaaggattgtgttatttagaacagagtatcagtatatcgtatgtcacagaaca
+ acacaatcagctcaatgaatcctgaaagagcttacaacaatgtaacgctgaaaaatttaa
+ cagcatttcagttattatctcaaagagaaaacatatgcgagttattgaacttggtagaaa
+ gtacggaaagacacaatagtattatcaatcctgaaaggcaaaggatgagtttggaagaaa
+ tgaaaaaaatgctcgatgctttgaaaaatgaaaggaaaaagtaggaaaacgcaacattca
+ caattgtaagctacaaaatattgaatgtgaatcctgcttcattacaaatgaagaatttca
+ aagcgtatctagttcaccaaaagtcacaatatcttattattatttggaagcttttatatg
+ catctaatggtcataaatcccatctcactttataaaacaagttaccgcgaatgtatccag
+ tgatacagaatagagtgtaacaaatgagaataaaaataccatattatcgtttaattttgg
+ tagggtacgctaaaataaagtggtgtttgcgtctccatataactaaaaagaattgaaatg
+ gcctactatcgattagactattcaggcatggaagcagtaagaacatttcttgattttgga
+ aaaatgagattagattattaattattattgagtttgtgtatataatttattttgcatatt
+ tcataaagaaattaaaattagattattaaatattaagtttcattagtgacattaaaagaa
+ gaaaactgatgttttgaaatgtgttaagcaaagaatgattaagacaatctcaagctgctc
+ taagcagtatcttcgacagcttgctctgcttcaggttcgggttcgtggattgcagtctct
+ ccagatgctcttgggctctctggaacagcttcttttgtttcaggagccgcatcttccaca
+ tcgtccgtagcatggtgatgagagaatcctcctgtcaaagatagccttggcaatttgcca
+ tcacggctgactgcaattagcaagtaacatggataaatggcaaatacacataaatccaaa
+ acaccataaaagatagcctcaccgtctggttgaatacggttaccaccatcacttagaccc
+ caacagataaagtacaaccaaacgattaccatgcaggtacacagcataagtgcattgaac
+ cctctagttttcaaattgaagaattgacgttggcatatcacaccttgggtaaccaacata
+ gcgacagcaccaatggtgtaataaccccacttgtaggtggacttgatcaatgaaccaact
+ aatagcgaaacaacccagaataaggtacccacgatttgaattaacagcgaatgtaccata
+ tccaaggcggaaatgtcgtcattctcaccagtagtactagcggctaactcgattaaaaat
+ aaaaggcatggccaggacaagaaccaggcaatatatttcgagtaaaagatttgtctaata
+ ccgggaacttcacctgtgatagacttgctaaccttgacgtggttaaattcagcttgaaca
+ ccagtccaacctaaatcagaagcataagtgaagaaagcaaaaaattcaaagaaagtgatc
+ aaaaatgcaggagctaaggcatatctagtcaatctggagcccttgttctccgcaatgaag
+ aacatcacaacatagcatagcaatataaagccaaagactgcaaacactgcccataaccaa
+ tccgaacctctcttggtaatgtgcatatccaggccatgtggtggattaagccctaaagcc
+ tcgttacgatttaaaaagcttgatagcgtatcgttcatttgaaatttgttgtttttagta
+ atcaaaagatattaaagtctcaaacttgttgttttgtttttgtaattataaataataata
+ attgagttaatatttgttgttgtgttgtattgcagaatgtaaaaacaagaaacaagatta
+ gaaagtgaataggtgttacgctaaataattgaagactctcacagacttatatatgatttt
+ tttagtgtgatacaacttgtaatgaggtaagggattgatattatactttctaactgatga
+ tgaatatgaaaactttggccaattagggagaggaaagttttcgatgagatgctagaaaaa
+ aacttgtccttgccatagcagaaagggatgatgaaataacgaagatgagctaatttggta
+ ttcgaataaacatgcaggatgtggcggaagggatcggaataagagtcgggatcgggatcg
+ ggatatggctttgcttactttagccataccaaatattcttacaaagcacgctttcgatgc
+ gtttttgccttctgagagaatcctcctacggctcgacccttgaaaaattctagaggatct
+ ggaaaatggccaacaaaatcgcgcagggagacgcaaggaaagtaagtcagatggatccga
+ ctgtatgtaacaagggttgctgcagcggttttccatccctttcggcaccctgcacaagta
+ gttattcttacttatgcaaaacggatatgagctacatagcgttcattttttaggttttca
+ gaagctattacaacgcagtaaaaaggatacacaactttcttacggggaagaagaatacgg
+ ctcaatgcacccacttcaacccccggtatcacgcggcactaaagttgtttgattgtaata
+ gtaatactgctgttgaggttttcttcttcagggctgctcacaacgtgatatatttgaatt
+ gacaggacgtattcagcgcagttgaggaaaattaacatatgacatgcgcaggaaataagt
+ gctcaagttttctacttttgaagaaaggactttggtttacttgcctactatctccatgtc
+ aagtactcggttgtatgatgttaaggagcgcagggaacactaatcgataaaaaaccctta
+ ttggctattactaaatgtaacggccgtgctctagtgaaatttgtatagaaaaatgggttc
+ tctcgctcaaaatgggcgccattacttgtctggtcaaattgagttcctggatataattac
+ ataaacggattgaaatttacataaggagtataataattccggaaactatcttgagtgcaa
+ tgccatcacgcggccatgccttgacataacatgctctcatcaaaaaggtgcaaattttcg
+ ccagttatggttgatatctaaaatctactggagttcatttttttttttattagatatcct
+ ttttgcaagccttctgaaaaaaaatttctcatgtggtacagcgttggttacttaagctgc
+ ttaaactagatcgatgagccattcttaaatcagaagactgtgactgcgaattgtaggcat
+ cttcactaccctgtccatgaagctcattttcgttatcatcaattggtgctatcttataac
+ tttgataaagagctaccatacaaaaaagacttaacgaccaccaactgacccaggcgacgt
+ cattttgctgggaccaagacataatatagccccatactaagggacctatgaaacgagcag
+ aggcagaaatactaatggtggcgccattgatgactgatctacaactcaagggactcgaat
+ tatgaattaataacataatttgtggtgacgttaaagcgccacaaaatgtttttatccctg
+ tgattatgtacaagtagacgacagtataccagctaggaatccgttcattctgtagaaaaa
+ caacgtaaggaaccataacgtacataatagggtacagcttgactaaagttctgaaaattg
+ ttaaacaatcgaaatttcgatcaactatgggaaaaatgaaaataaccacaaaacaaccaa
+ agatacctgttgtcgacaaaagagtaccggtttgttctggttcataacctataccgccag
+ atattttccacggaaattttgaagccagcttctttggattttctggatctacggctaaat
+ cataagctaaaaaaacaggcaaaaattcgttgtatacaatcaaatgtaaagccataataa
+ aattcaccgatatagggtaaaatacttttgtatgaaaaacgtgatgaaatatactacttt
+ ctttacaaccgtctggagctgtatctatattggcatgcacagcgtctgttggttcatgca
+ gagaatatgtcctaatcaggcctacagactgtcttcttgttaaaataggatcaatcgatt
+ gtatgctttcagtatcatcgtcattggtagtattcactaggggactattttcatcctctt
+ cactatcttgtcctcgattgctgttcacattctcattacggtggtgaatgttttccgaat
+ catcctgaatgcgcttttgccagggtcttcttttcggctgtataccaaatatattcttct
+ taataaaatcaccgacctctaaaccgtaatctcttctatttttaaaagcaggatgtgttt
+ cttccaaaaacaatgttgcattagttaaaccaaacatcaaaaacatgcacactaccacgt
+ ttggcaaggcgtacggatatgacctaattattctttttgcaaaatgtggaaaccatagtg
+ gcacttcattcattgttccatctctaaatacaagaaacccaccgatcataggcccaacaa
+ cggcaccaaattgaaataataaaggcatagtactgaaagctaaagcctgatgttttcttt
+ cagttgctatttcaccaataatggttctaataacgccgacgttaccatttagcaatccca
+ ttaaacttcttgccaccaaagcctgatagaaattgtgtgaaaatcccagtattaacaatg
+ atacagatgttcctataagcccgcaagtcaatgttatttttctgccatgtttttcagaga
+ atctaccccagtggtacgcagatatgacttggcataacgcaaatgatgaagataaataac
+ ctgaatatttggacacttgagcatcattgggagcaatattaaaatctctaaccatgaaat
+ aaacataaggaaatagtgacgaaaacgcaattggttcgctgaacctaactaaggacacaa
+ caacaagttggacccaggggaaaccatccatttgttctttgaaagtaagcttttgacgcg
+ ccatatcttttatattcgtgtgttcatatgcttaccacttcctagttccttggctagtat
+ tcctttaggtgatctaaattcaagctctgtggataaggaccattatagtattttttacta
+ atgatagatactttttgttatattttgttttggcattttcgtatccgagatttccaatga
+ gcggaatgtgtcattctttgtcaattaacatagatcaggccgaatgccaaatggagttta
+ gtaccttcatattgaagaaaaatgctacattattatatttacaaggatggtcttaaataa
+ agcttacttttaaggggggaagaatggtgaacagtgagtaaatacatagaaggagataag
+ aaacaaatggaatatcaaaaatcgatggattctgcagatgttctataaaagggtatggca
+ tcctttatattatggttgccatataagtatgagataaatctctcaaaccctagaccaaag
+ cctccgtgtggtgcacttccttcttttctcagagaaacataccagtccaattctccagat
+ ctattcatcccgcgtgctttcatttctctacataacttgtcatagtcatcttcccttaaa
+ ctcccaccaattatttcacccattccaggaaccagtagatcaaagcatccaacagtatcg
+ tcaggagtggaattttgtttcatgtagaatggtttacaaagacgtggatagtcggtaacg
+ aaaactggggacttaaaatactctccggctaaaaatttttcatgttcagtttgcaaaggc
+ tgtccccatttaggttcatacttaaagtgtgaaacttcattgtggcgtttcttgagaatt
+ tctattgcattggtatacgttatattgtgccatttttcatttattaaatcttcccatcgt
+ gttttaatttgttgtgtctcttgatttattgatagctctgacgatgcattattttcttgt
+ gaagagataaattgcttcggcaagagttcttgttggttatctatacaagctttaattatg
+ tgttttattgtagtctcaacaaacgatgttagctcgttgacgctattaacaaagcacatt
+ tccacttcaagcatccaaaactccgaaaggtgtcttggagtatcactcttttcggctcta
+ aagcaaggagataacgtccaacaccttgacagtgataacgctaaaatttccaagtgcaat
+ tgagtggacacagtcaaataagtcggcttcccaaagtacgaggatgcagttggcgacgta
+ ttggtggagacttgaaacaactcgccggcaccttcacaatcgtttgaagttaatattggt
+ ggtgaaactttggtgaagtggtttttctggaaatatagcatgaactggaattctacaaat
+ gatcttaaccgtaaaattgcacttaagtaagcggttctgtattttagtgtaggtaaggac
+ cttaaataacgtaaggtttgatatttcttttgtaatggatagttttctgaaacgggaccc
+ acaagtttaattgacttgacaggatttttgatttgcaattcaaaaggttgttttctatta
+ ggggtgctttgccaggtagcattagatatagataaagtttgaccagttttcagaattttt
+ aggaactgtacttcatcagtatttgtgagtggaataactattcttaatgggttcacagaa
+ gtcccatcttgtaaatccaaaaacgctatacgttttaatagtcttatggatttgatccat
+ ccattaattgaaatgggatcatggctagtatggtgtacctgctcgtacaatgatttaact
+ gttagtgaagagtaaaatctaccacctttaaggaaggtgaaagcatgaaacattagctgt
+ ttgtattcgcttgttctctttctttctctcttagtaagcgataagtcgaactctacttaa
+ attgtggtacgttttctttttttcctataatataaaaaagataaaaaatcaggacgagaa
+ aaagtagatcataactataagttgtgcatactcatactccacactccacggcctaatctc
+ tgtaaggattgtattttattttttatttatttattatttagattttctaaaaataaaaaa
+ aaatcatacactattttgattacactactatgtaatttatctttgaaaacaaccatgaaa
+ aactcaattttacacacacatgagttatcatgaggagtcatcagccagacttaacaatca
+ cacacattgttaagagtggaaaaaaataatagtgataaaaggaaaaaagctaacaatttg
+ gattttacgggatctgtctaaatacgaaacacacactcgttttagttcaaatatcataac
+ cataaaaaaaaaagcacaattcaacttgagcacgcacactaagtagtaacaacgttcatt
+ tacagtaaagcgaacggacgaaacaaataaaagaaaggcatagtgagggataaaaggtag
+ aaagaattcaaaaaagggtatcgcacacacacacatagtagtctaattcatgttctagaa
+ tggatataaacaaagatgaaggggacaggaaataaaatggctcgctccgccctttttaga
+ atctttgcaatcctctttgtctagcttgccattttctgtagataatggtagcaataatgg
+ caatacaagccaaaccgaccaaaatgaaaactaaaataacaccacctggtaaagtcattt
+ atgatagattaaaagtgctgaaatgggaagaggagaaacaaacaaaggatatgtaggaac
+ aaaagtttgcttgagtgactagtttgaagttgagaaaaaaaaagagatgaaaaagttaac
+ ttatttgctcttccttcaaagtgtgtagcgtcttcctttaaatactattgaaagggaaaa
+ agaataaatggaaaaagggtagtaatgaaaaactctaaaggaagataagacgcaataaga
+ gaaaaaaagagaagttaagattacaagagaaacaataaattgctttcaaagctttatata
+ ctggcaagcacggaaggctaaatttgaaaaaaaaaaatgggaaattattatcggaaaaaa
+ ttgaagctatcttctattacgcagtgacaagtctatattgacatctttctgctttctgct
+ ttctgtttgttgttttttttctcatagttgttttgattttactctgttggaaatgcgaga
+ attgttttgattttacccgattgcgacctattcccggctttcccaaaaaaggaattcctg
+ tccaatgcaacgacagccatagaaacagccgcggttgggtgcgtgggtgcggtaatttaa
+ cgatcatatggtatgggtgcacgtgggatgctaggggaagtgcgcgcggataccgaatcg
+ gccatcgaaccggccacccggattgaggttggtatgttgcctctttgcgcaattttattg
+ aatgcaaagtttctccttctagtaaaagacaacagctgaaaaatgctcgatgtccaaatg
+ acgacataaacctcctcgtttaggaaagagttaacacgcctagtttcagcttgtttcttt
+ ttattgtgtaattagttttcttacttagtacatcttattgaccccactgtcatcggatgt
+ gagagagtggttcacttctgaattaagtttagagatatactaaatctaaataaaagtctc
+ tttctgcgtgcatatgttttcttgcgcgggactcacaaggatgcgtcttttatttcataa
+ cgcagttgtcccaccttaagcttaggtatcagaatcgactaaatcgtcgatgctgtcaat
+ tatgtcgtcaatgagatcttttgcgtcgttgattatgtcttgaagccaatcccccactgt
+ gctagcagtagcgctagcttgtatgctgctgctgctgcttcttcttgatgatggtgatgt
+ ttcgcccggtagagtctcagttgctgatggtattgaagttgtcatttctagttttatcgt
+ taatgacgaagatggcatagcggctatttcatcagtgtttccatcgtcgttgtcatcatt
+ atcattgtcaacattgtcttgttctgtctcttcggggtacccgccccaaattatattata
+ agtgctgaattcaccaaacttactgaccaaataatcaaagtcatcttcaatctctacttc
+ ttcgctgctctggggttcgcggagttggttcataagcatcccagtcccatcattggaatt
+ tctgtccttttctgccacaccgcaaatatcgcacaaaaggttgtaaatttcggtattttg
+ gaacggctcaatgtatccctgtggaaaatatggccccataccaataaatacagatctcat
+ atcgatcgcgttattgtcgtatccatgtgatccaatcgtgaacacattgtcttcgttttt
+ gtccttatgatcaccttttgccaccttcttcaattgttctttcttcatcactgcataccc
+ aggctcgggcacaatccagattgacgccatgtgatgattttttccatcattaaagttcca
+ ttctttggggaaatttccattaacgtaaaccgtatacttgtcttcatctatagaagtctt
+ taaattgtggtaaacctcattgatgtttccggaatctttcaacgatatagccatcatcgg
+ accctctagatatgcgtgcgatacataatccttcctcaatttttcgtccagtaagtcttc
+ ccatataataacatttgagggaactacgatatcgctcataccatgatcgctaacaatgac
+ caaattagtaaagctggttaaatttctttcttgcagcgattccactagttgcttcagaaa
+ tgtatctacttcccccagtgtttcagtgaagtcttcataatagtattccgactctgacgg
+ atatccatgcttatgtccaaaggcatctacgttcggtacataaccgagaattaactgtgg
+ tctttcgttcagtgtactcatatccacatattcaataatcttggataatttttgcgaaag
+ tggttctttagcattgaattcgtcgaagtaaaatggagttctctctctagcaataggctt
+ tttatgttcaggttgtagtttctcttcagtcttatacttggtataattcacatcgctacc
+ tggccacatatgggtagcagctttgaatggtatatcaccgtcaaatgcagactgaacagt
+ ttgccatattggttctgtatcattattgttccatattcgagggtccaatacacctggatg
+ gaattcttcattaagatcaggatcccaaaatacgttagagactataccgtggtgtattgg
+ gtattgtccagtaaccaacgtccaatggttgggaaaggtctccgtagggaagctgggtat
+ cataaaaggtgtggacgtgatattcatacctccatcatatttcaattcatataagtcatg
+ taaaaacggtgtgttcctcttagatatgagtgaaggatggaaaccgtctagtgaaattac
+ tatcgttaaggggaaaaattcatgcgtaccgttaaaatacttcacgtaagggtcaaattc
+ aggactgccaaagtgcgatagcactctcgtggagtgatggggttttgctatgttaatcat
+ cagaattaaaaaaatggcaaacaccagcgagccaagcaatgtgcgaaatattattttaaa
+ aggcactttggaacgaaatcccgttccaaaagaaagttcattctcatcaacaagttcagt
+ atcaatgggttgaccattcgcgtctaattcatatagcggtataccccttctaaagttaga
+ accatccctactatctttcgcgtcgccaatcactatgggattgtttgtccacttcagtct
+ attcttataaaaccaatatttcatcctggtccacgcagatcgccaccccgatcttacagc
+ gtcttcatccgttatgaaatcatcacgaaatggatcttcactaaaatcattcagctcgtt
+ atcgagcgactctaaatcattctgaagttccatctatagagagtgttatactgtacctct
+ tttcagatatatgtaattacctcgcctcacttattttgcttatttgtagtttgttaggca
+ gtttgcatatcgtaccaatgtttttttataacttcatgtcttttcttttttttcattttt
+ agcatcgtacgcagatatcttgtctcgttatgatattttgtgagataggtatccaaaatt
+ aataagagaaaatgtcgtattgtgttcacaattcgcctgattgtttgactagtccaaccc
+ ctggtataattgtcacttaatatattcttagcttccaggtggtgttttattccattattt
+ atgttatgtatatataatcgtaactcttagagcaaaaagtaaaaaagaaggtggtactta
+ ccgagggagaacatatacaaatttattcgtaaatgtaaataattaattgaagcagagcta
+ aaagaaaaatacaagaaaatttatttacactcgctgtttggttctgatcttttggtattc
+ acaattttagtgctgtccaaaaatattgcgatgtgctgataaagtacatagtgtgatccg
+ acttcgtcccagtcaaagaaaggatgaaatattagttagtacatttactggttgctactg
+ gatgtcctatctgaatcttccatcttaggtacaaaaagattaagaaagtcaaattgtcct
+ tagttttaaaagacagagttacttcatgtctttaacatcaacagcaaattactttgaaaa
+ aatgcactaaaagttaacacatgtaggagcattctaaacatgaaatttttaatggtactt
+ aggtttgctaacgcatcttgaaaactagcaaaaaccccataattaaaatattcttgcttg
+ ttaaacgaggcgtccaagccattgaaaggttttatcaatattcaacctcttactataaat
+ gcagcacaccgtaacaggtatgagaaagatgtggtattattatacaaattgtactgaccg
+ ctttcctactaataaaaccaatgatatcctagggacgctcgagttttcacattatagaac
+ atttattattattttcggcatcaagccccagagttccttcgacacgttccatttgtttca
+ aaaggagcataaaagtttcttcaacattataatctaactcggcactgcattctataaatg
+ ctgcctggttcctcttatcttgactgccaattgtcgaagctagtttctctccttcagctt
+ tcgtgacacacctttttacaccttttgtactccttcccaaatcagctttggtacccacaa
+ gtattaccgggaggttatccttacccagctgatctaccagcttgtcccagagaatgggaa
+ taagatcaaagctagcacgatttattatactatagcacagcattatgcctcgtacgcccg
+ tcaacgatttaatgtttaatagagaaacttcatcttggcctgcagtatctagaatttcca
+ gagtacagtcatgacttttataaggaattatcctggtaaattcattttcaatagtgggat
+ aataggattcaacgaaccgcgattctacgaagcgaaccgttaatgtggttttgccgacat
+ ttctagctcctataagagcaatctttctctgaaagttatgtgtggagttcgaagaagaca
+ tagtggcgtattccattatgtcacaggttatatcttttctcgccgtgccccaataaaact
+ gtttcggtaaagagacaattcgaaccagcccaaaagaatgaatatgaaaatatcagtatg
+ cgtcaatgaaaatggaggcaagttcaataatttcactggaaatataactagtaataattc
+ gtgtaactgctgtagaaattgttagatattaaacctggcagaagcgtcttgttaatactt
+ atagagaaaccaccagtagcattcttctgtatctgaagatagtgaaaaatcaaaaaaata
+ aatctgtattcaaaaaaaggtcctacccggattcgaaccggggttgtccggatcaaaacc
+ gaaagtgataaccactacactataggaccaggacttcttgtagtatcaacgttctagata
+ ccaacatatcaatataaaaatactgaatattgagtcatcttttagattacgttttggttg
+ ccgcaaaccaaaagtgcaatttaaatgggtgaataatttgaatggttggaaatctattct
+ cggtaaaggttataatatcagtgttggaataaaactctaaaataccatccattcagaagt
+ attcatgctactagcattttatcacatgaggtgaggtgtaagaagatgacataaagattg
+ agatacagtcctcaaacctaatggaagcgaaatgcagggattgataatgtaataagataa
+ tgaaacataagaaacagaaggtagtagtattattatgtagaaatacccattccattttaa
+ ggattcctatgtcctcgagtagaacgtctagtatatacgaaatattataccctttaaaaa
+ acactggacccaacgataatcacaacattcaccgatttctcacctgaacagatcacgatc
+ atccttcagaacatcttgctgtagccatattaaattcctttgtctctggactctttctcg
+ cgcgactgatctcccaatgaagaattgtgtcaggtcatctcgtcctcaatttctaaccaa
+ aattggatgattttagttctcactgacgctgcttgaacgttaacaccgaaagagcatgca
+ ataatcacaacacggtaaaccaacttgtgtcatatttaagagaaatattaaaacgctaag
+ aagatagctcccataatagaagtacaagttatatttatcgttgcccaactgcagaagtta
+ aataaaggtaaataaatcgagctaagagaaaaaaaagtataggcaaaattccgaccggag
+ ctagctttaccttgctcctccacctaagtcacctgctcaggtcactcaagttcacaatct
+ tctgagaggatagataaaggaaaaattggaacagatcttttcatggcctggcagttgtct
+ gtggattggtaaagctctgtcagtgaaaaaaaagttctcgctccttcttggtctcttcta
+ tttcctgagattctaatgctttccttacattgtttcataaaataaccaatttttcgtgtc
+ acaataatcttgaaatttgaagtttctaaaatgcatcagacatggtgtcgtagttagttg
+ atttactttaacttctaccaagaagtagaaaatagcagagtgcgtgaacgcatagcagag
+ aacaaaatacttactaaacgttttcgaataagttagcaagaagcttttaaaaacatagac
+ ttagaaattgtcctgcacttggcaaggaagaaaagaaagattagttaaacagaaatcgaa
+ aaatgcatgatgaaaagctaagtagacctctttagaagtgtcttagcggctattgttcat
+ gatacttttgaaaagagccctgagattttccattaccatcaacaggaccaatgaatagct
+ aaaagaatcgaatgtatttatcggaaaggatttgaaatttcggccatttcctcattatga
+ agttcaacttccatactctcgtcgtcgccatcattttcattatcctggttgtcgtcatcg
+ tcatcttccccgggcatgtcattagcatctatataaggaacctgtttcttagacaaattc
+ tcttttatctgtagtgagcgtatgacgaccgagacaattccgcttgatattgccgtagcc
+ aagagggcgtagcaacctctctcaaacttgggcaccatatctgaagcaaagaataatata
+ gaccaccatgcgttaacggcccctgaaaacatattcattgaagcaagaacgatagcacgt
+ tcttgaagatctgcatgacaaataatgtttgcccacgaaaaaaaaaccgcttgtccagcg
+ tatgctacgcctccaagatactgtgcagaaaagacgacttttggatttagtgggtctgca
+ cgtattagtaccgcaacaataaccattaccaatgatatgaaaacagaaacatgccaatgc
+ ctagctcttgggatcttactcatatatacagcagaacaaagcgtagaaactatacctacg
+ gcaaatatccccgaaggataattatttctttgcgccaacgtatatttttggttttgtaac
+ cataatgcaaatgtagaattagatgcgaaacccaaattctcacctcccagaacccaaaca
+ agagagaacatccaccagtgccacctttttaggactctaggaatagtcgaccagtctaac
+ cgggtgctttcgtccctagcggggagccttctcctagcataatgtagctcttgttcatta
+ aaaatgtatctcgtcatagaaaatttgctaacagcacttgtttgatcaggaaggccgggg
+ aagaaaataaacccataaattgcaatgggtaatgtgatacaaaaatcaataatgaataac
+ catctccaacccgccaagccattccgcccattcaaatgagtaaagatacttgtttgcata
+ aatccactgaacatagatcccaccaaaccgctacctgtaaaaatagcacttctaatgggc
+ aattcgtcttctttataccacgaacccaaaacaaaatgtgttcctgaaaatgtgcaactc
+ tcaaataaggcttgaaagaatctaattgcgcaaatatgtttgaacgatgtaactttgtac
+ attcccaaggtcaataaaccccaggcaaacgtacaaaaacttagccatatcctaggtgga
+ acacacaataacattaaattattggggaccatacctacaatgtaaccaatcatgaaaact
+ gtgttagacacggtcaaatcgtttccgaccatcttaagatcttccttcatacccgatata
+ tatgcattggtgaaaccgacacggtcgacataattaatccagtattgcaagcaaacaaat
+ gatagcacaaacaagtctattttaaacagaagtaatctctttttaatggcacgtttggaa
+ gaaacagattctctctcaacctcatgttgagtgatagatttcgattccttcatcatgact
+ taggcccaggctctctccaatcaaaaattctgaaaacagcttctcaatcttgaataaatc
+ ggaatacttttatatcagaattacaggtccgcctcatttctcttaagagccgatgtcttc
+ ctttacaatatacgcatgtagaagatttgatcgatgctgagttcttgtaattcgtcaaaa
+ atgcctttcaagatcgattactcaaaatctacaataatcggactgcatggatcaatggag
+ gtgtagccaattttgtaacaatgaaggatatgttaaattcgctaaagcttcctcctcttt
+ tgctaaatttcatcatcaaaaagcattttgcataacacacgttaaccaatatattatagt
+ atatatacattctatttgagaataagcatagtttttacataactagttcgttattccaaa
+ tgtatttttcctatttgccataaatattttaagtaaaagaaagcatgctctagataaaaa
+ atatcgaggaagagtcgaaataagcaagcgtaaatattacaattattctgaagaagcagc
+ attttcttggccctcagcattttcttggccctcagcattttcttggccctcagcatcttc
+ taatttcttcccattcttcaataaattaatgtccttttgaactaagctcaaagtagtacc
+ cttagaaccgtcgtctctctcgaagacatagttagcagcatctgcttcaacatataccaa
+ agcgctgaataaattatgtaaaggaaatgttagtaaattataacgcgtaattgtaaagta
+ agtaaaattgaagaataaataacatacccttttctaacatattctgtcaaaaaattgatt
+ tgaggttcattgaaaacggtgatattataccaattggtttggccatctcttcttggttgc
+ gaagcgatactatatttcaaataacgattgttattagcagaagtatgttcagtgaattca
+ gagccaatgcggccgacgatggacattttcgagaagtccatcttcttggtagtagcatgg
+ aagaaacgagcttgagtacgtaaaaacattacaaaggattagtggatttttgtttgtttt
+ gtatttgtcaaggacacacaattagcaaatgacttgttatcaaattcgtggtatttgtct
+ ttctcaactactgtgttagcttttttttttcgcctgggcgcaaacgccttttaagcactc
+ ctccacaaataataagtattagatgaaaagtcctaaaagtccgaaaaaaggaatagaaaa
+ gaacgaaaatacagaatctttacagaacattatagtacataagtgtagttctacgtaagt
+ cttaaacaaaggtttcaaaaggaaaatatgatttagcggcgccacctccataaattatgg
+ attagcaaacaaaagaacaattaatattgaaaaatagaagcatagcagaaatatcaagcg
+ agaccatggtagttaccagtagtcaaagttctttttgtgtttacatcgagccactcacca
+ ccaaatggattgttttcatccagctcctgacaagtcaaggtgattccactgccgctgccg
+ ctatggggtagagccgcgttatccgtctcactgatagtgaatttggtaatcacgccttta
+ gcggattcatgtgccaggccgtctgtaataaatctggcgattaatctttgaccttcacca
+ tttcttgtcaaaacaactggttctttaaatctccacgttattctcttcttttctttactg
+ aaggagccctgaggttttgataaagcgcttgttgcattagcaccatcgatattcacaaag
+ acaacgaggtcttctattacgatttgtgatatttcatcgggtaagcttggtgacatcttg
+ acagtcaatacaacacttgcctggtgagattcaaatctccagaccggatgaataacaata
+ ggcgcgatcggctcctttatggaatatttaatagctcctaaggttctagaatcgataaat
+ gaagggttaactttgaattcttctggagcaacacgctcaataaaggcttggttcagaatg
+ accttttcgaacttagcgccattattgattctcaaattaataccgatagtaaaggcgaat
+ tcatgacagaattgggtaaatagttgagggcaatttccccaatcagttgggagttctgca
+ acataccgtctttgaaagaagcattgagcacttctgcgatactagcattcaacccaaatt
+ gagatgtatctaaagaggagtgctgaaacaacgattgaccaactaaagatgttgaagagc
+ ccgtggcttgaggatttagttctcttagctcgccagtaatctgagaggagagtgtggaaa
+ ctggacgttcctggggaatagaggacaaggaagaagaagtaggggtagcgcctgagccgt
+ gattaggcaaagagtcatagagttgagaatccctgttatgaatgaaattttgtttccttg
+ agggtggtaaagcaggtgcgcggatatgcaagggccttgaatcaacttgacctgaaggtt
+ ggttatcaaccgtagcagtaggaagtgaaatactgggagaggcaggctcaactggtaatg
+ gttttgtttttgattgtggctgaagtggaggttgagaaattgatatcggtactgtcttgg
+ gtaagttacttgagtctggttctttcgcagtgggtggagaactttgttgtggcgttgaag
+ gaggcacttcagaaggagtaaaagtttgcgcaggtggtgtatcggcagtaacaggtgcat
+ ctacgtttttaggagtagcatggggaggaagaggaggagtgtcactagcttcacccgggg
+ tccaattgtttgattttgatgatgaataggtgggtcgacgttgcttctgataaatagaag
+ aagttgcagttgaagacactctggtggaagaattattcggagtctcagtgatggatgcct
+ gaatatgagagtttgatgatgattgctgttgtctcttgttcttggtcttatttctaccga
+ aaattgagcccactttagatcttagggtgtgggacgacttttttgatttcaaggaagaat
+ tattagtagagtctgaaaattcgttattcatgagatcattatgagtgagactggaagatg
+ cagaggcaagacgatgtccaatgtttccaaatgcactcttacgcttatccttttgcgggg
+ atttcttttctctttccgtattagcggcaccattggatacagagaccgggcgggcaccag
+ tggcagaagctggcgaagcattattaggaactacttctttggaagaagaggatagttgga
+ agttgtattgactggcatctttggcaaaacgatcaatttcactttgcggttcgaatgcca
+ ggaatttcgtcatcaccgtttcgcattctttggtagtatttagtaaataatcgctaaaac
+ tggtttgaaatctaagcatacagttttttaaagtgtccaaacggttatagtcaatagtct
+ cgaacagctcaaacaaataggggctttgctggtcccattgccttcttgcctcttctaagt
+ gactagagttttcgttattcttactatagtgctcaattgaagctgcatttttactaagct
+ tggaatgtaagtccttgctttctctccagctagtattgttctccacggattccttcaatt
+ cacgaacgacttgctggtccaaagtgttgtagtattctgtacttgacttcaaatctgatt
+ ttagttcttcaattacagtatcccatacatttcttagctctcccagcgagttgaatctga
+ atgcactcatttcctgcggagtcaaaacattactttcaatcatttgagcgtttaagatct
+ ttgttatatcttcattttcagctattatttttctcaattgctgagcatagttcctcttca
+ gatttgccaattccttaaataacaggtaaaagtctttattcaacaatttgacctgcgata
+ acctgattctaatagtttcggtagcctcgtaaggactcttagtagtcaatatgctatctg
+ catatttggttctttgttccgtcatgagaggtgattattcgcaattttcaaataagcttg
+ cttcaatacgactacagcaaaagtaattcctttattaaggcaataactacgagacctttt
+ ctttgtatttcgtgaaaacttgaatgcaatatagagtttttcaatctgcaagtacttaag
+ gggcccttttagctacacataatatagaaatgaaagatcacgggctacgaaaagccattc
+ ataatcatatacaaatatcacctgtaatatataagaatatatgagctattgattttcttc
+ taccgcctgatttagttgttttcactcttcaaaactaaattcagtattagcgacagtgga
+ ggtgctagtactctgtaaatgaaagtaaacgctggtattcatgtagagcaccaacctgac
+ tgtgaatgcgtaacgtgaaacatgatagccgcctgtaaaatattgttatcaataagtaga
+ ccggcttacacgaataggattcgctttattctttactccccactcaaccctaagcaatgt
+ tcaaaatagtgtgagattattttatgcgcgcctccttcaaaaaaaatttacatatataac
+ ataggtggaaaaacttccctccgatgtctaataagttaatcaatagaatgaaatggacac
+ gaaaaggcgtagacatggaagatgacaatgacgagaacgtcatttttttttccacgtatt
+ ttaaaaaggggtagcctgcttcaccagatgttcttgtcaacaaaattctaaaatgatgat
+ ttttttaaacacaatttatgctagatagtatctgaaagcattcaaactttatgattacag
+ tgttttcgacagttttaaatctctgaaccgattttgcaactcttggtaattgtgtcctta
+ catttccgtagtgacgttaatcaaatcgcaatcttgaatcttcttaactaaaaggttata
+ agggcttttttttattactaaattaacgtatgagctgatttgattatgggcgctatataa
+ ttatatgtacacattttttaactgtaaaaaataatacggcaatacaagtacaatacatgc
+ ataactcataatcttctacctcttctaccacccttctttctggtggagtcagatggaact
+ ggggtaacatcttcgatacggccaattctcaaaccagatctggccaaagctctcaaagca
+ gcttgaccacctggacctggagtcttggttctagtaccaccggtagctctgatcttaacg
+ tgaacggcagtgataccgacttccctacacttagcggcaacatcttgggcagctaacata
+ gcagcgtatggagaagattcatctctgtcagccttaaccttcataccaccagtaactctg
+ gcgatggtttccttaccagataaatcggtaacatgaacgaaagtatcgttgaaagaagcg
+ taaattctagcaacaccaaaaacttgggaattgtcacgagcttgaacaacgtctaaatgg
+ aaatagtaaatcgatatcggacgatatcaaacgtaatgttgttagtatccaaaagttttt
+ ttttttaattcttacatagtatctttgcttttccatcgaaaaaaaagttgacaaaatata
+ ctgccattgacacaataggctctactaatcgctatgaattactttggaggcaatgaattc
+ gttgaattgtctttatgtttctgactttcaatgtatcttgcacgtcgtcagaaacattca
+ acaacaatcaatggaaaatagagatgctttaaaaatgttcactatgtgattaaacatact
+ agacatggcgggttcttgtggtttcttactctccttgggtttaaaatagaatcttcatta
+ aaaaaagtgctttctgtaatatatagtgattaataaccaattcatttcaactgtctacct
+ gaggaagccctgcaccatatggaatgagccaggaagcgggatggtctgccgcaagcagac
+ aaatttgccaaaatgcagtagagaatggcaatgctcgtgctgaaggttaaaaaaagaacg
+ caaaaaaaaaaagagtagtaaaatgtacagatgtttattaagtccgtacaccgcacgtaa
+ gtactccaaagcagtctctattacattttaaatagggagaaagaagatacataacaagta
+ tgaacaagaacgataatcagtgagtaaaatgaatagagatgtcaggcccgtgtgaagtac
+ ttgaaagtaataagtgagtatatagaatataccagaacttgtaagctttcgaataacaaa
+ aagagccatttgtacttattctcgaactcggttgtagagaggacgttgccttgaaagtca
+ caaatcccctacaataataccggtccttggaatgtaaattattttgttaattttcttata
+ agttcacagttgaaaaaacaacggtcacagtgaatcaataaaagaaggatctcactttat
+ aaagaaaagaacaggaaaaaaaaatgtatatctgggtatattacccggcccttgataata
+ tttttcaataattttgccaggtaacaatgtgattcatgcatggcttaaagaaaatacatt
+ gctgaagcaaaagctcaccaaaaatttgcgttgcaaagctttgtttgcggtatgttaata
+ aagttaccatgaagcttttttcatgctctcagaccataaagacaggaagcaacttgtcat
+ atctatttcgacttcagcggttattttcttcagcccattgataaggcgatatatgcctag
+ aagaaaggaagagtgcagacttgataataaagcaagaccaagagcaactaaatagactaa
+ aatatagagccttccaaactgttagttattattacttaatactcgtacgggtgatgaagg
+ tagtgactttgtaatccgggtaatggggaaaataagaagccttcaaagctatagcatttg
+ tccctactgcatcggcccctttccagtacaatagaatctcgttgatgtgattactgattc
+ aactgacccaaataaatcatttgaagaactcataaaaggactatagcagcaagaactgca
+ agccaattggttctttctcctttctgccaaaaaaaaaacacaatacggatatgaattcaa
+ ttattaatgctgcttcgaaagtcttaagactccaagacgatgtgaagaaggctactataa
+ tattaggagatatactgatattacaaccaattaatcacgaagttgaaccagatgtagaaa
+ acttggtacagcatgaactaaccaagataatacaaggttatcccatacaggataatatga
+ ttattaatagcaaaaaaggcacagttgaagatgacttatgcgaactcaataactatacct
+ gttttgcactttcgaaaagctttgatttatgccatgatagcagaaatttcaacatagcgc
+ agccgaaacgatggatacaattattagagacattaactgactcagttagtttcgcagtta
+ ttgttcaaattattctcactttatctaacatttcgctaataaataaacaaaccttgggga
+ agttaaaaaaactgaggattcgaattttcgaaatactatcaaataaaaacgatagttgga
+ aatctacattactacagaaaaaccttatagaatggtacatttttatgctttccgtggatt
+ gcacacctttagaattgcaaaacttatatctccataaggagttgaaattctgtaacgata
+ tcttgaattcattaacactccaagtttctgatcctcgctcacaaaattacctgcaatttg
+ agaacacgtataagctttttcaaatacaaaagtcatctagaattaacaactcgttccttt
+ tttacatagaattcaattccgttacctcaaataggataatgaccatagaaaaacacattt
+ atttggaaattaaggaaggccagttttgtatttcaaatgataactacataatcggtttat
+ ttgaaaacttcgaattcgaagcgggcactttgtactttattggagttttaattgatcaca
+ ataatcgaataactctttatgttgatggaagtatgatcaatcagctcacgttatttgaaa
+ actctatatgccaattaagcacttgtgaactgggatccatgatttgttcaattaaagtat
+ atagattttatttgtgggatggattattaacagaatttgcgataaatatacttcaagcta
+ tcggcaccaattaccaatatacatttagcaagaaaaaagaagggcctgaagttttatcgc
+ tctgccaagactttttgatcgctaaggctcatttaatggccaggcctgcaacagaaatat
+ cttccacaaaatacatcgatgagattgaacttcttgaaatggaaaatatcattattgatg
+ ttaacccaaatgatattcttcaagatttcaccgaatcgtctaattttacggtaaaatttg
+ aggaaagcacaaactcgaaaaatattccggaagtgggtaagtgctatttctataggagtt
+ caaacttggtttcaaaatttgtgtccattgattctatacggcttgcgtttttaaacatga
+ cagaatccggtagtatagacgatctgtttcatcatgtatcacatctgatgaatcttttac
+ gaaatattgatattcttaattggtttaaaaaagactttggcttccctttatttgcttata
+ ctttaaaacaaaaaataacacaagatttatctcagcctctgaatatccaatttttcaatt
+ tattcttagaattttgcgggtgggatttcaacgatatttccaaatccataattctagata
+ ctgatgcctacgaaaacatagtccttaacttggatttatggtatatgaatgaggatcaaa
+ gttctctggcgtcaggcggattagaaattatcagatttcttttcttccaaatttcaagtt
+ tgatggaagcctctatttattctaagttcaattccaataaattcaatgatatgaatatcc
+ tagaaaaactatgtttaagctatcaggctgtcacaaaaagagaaaatcagaacagtaaat
+ ttaatgagctatcaaatgatttaatttctgtatttgttactttattgaaaagcaatactg
+ ataaacgacacctgcagtggtttttacatctctcatattactttattaagagaaaagatg
+ tacgttctacagaaattatacttcaagcggtagatcaacttttttcgttttacttagatc
+ aaggtagcgacgaaaatgcgaagatactttcagagattataccacttaagctaatgctga
+ tgattatggatcaaatagtggaaaataatgaatcaaaccctattacgtgcttgaatatct
+ tatttaaggtagttctgaccaataaaccgcttttcaaacaattttacaaaaatgatggtt
+ tgaaactcatattgactatgctttgtaaggtagggaaaagctatcgagaggagattattt
+ ctttgcttctcacatattctattggcaattataccacagctaacgaaatattttcaggtg
+ ctgaagacatgattggaggaatttcaaacgacaagataactgcaaaagaaattatttatt
+ tggctgtcaacttcattgagtggcatgtgattaattctaatgccagtgattcttcttctg
+ tattggacctgaacaaccatatattaagattcgtcgaagatctgaaatcgctgagcgctg
+ ttccgattaatgaatctgtatttgatcctaaaaaaagttatgtgatggtttcattattag
+ atctctcgatagctttgaatgaatcggaggacatctcaaagttcaagagctcttcaaaag
+ tgatttcagagctcattaaaggtaatataatgtgtgctcttacgaaatatgccgcttatg
+ atttcgaagtctatatgagcacatttttttgtcacagtacagaatacaaactggtttatc
+ caaaaactgtaatgaacaattccagttacttagagctatcatttatagtgacactcctac
+ ccgaaatacttaatgacctgatagatagcaataacaatttgaacctgatgatgttgaagc
+ atccatacacgatgtcaaatctcctttattttcttcgcaaatttcgacctgatacgtcac
+ agatagttatgcctaaagatttttatttctcaagttatacatgtctcttgcattgtgtta
+ ttcagattgataaatcatcattttaccatttcaaaaacgtttctaagtcgcaactgttac
+ aggaattcaaaatctgcataatgaacttaatatattccaatactctaaagcagataatct
+ gggagaaagaagaatacgagatgttttctgagtcactgatggcgcatcaggaagttttat
+ ttgcacatggagcatgtgataatgagaccgttggcttattgttaatattttttgccaaca
+ gattacgtgattgtggatacaacaaagcagtcttcaattgtatgaaagtgatcattaaga
+ acaaggaaaggaaactaaaggaggtggcgtgtttttttgacgcagcgaataaaagtgaag
+ tactcgaaggtttaagtaatatcctctcatgcaataactctgaaacaatgaacctcataa
+ ctgaacaatacccattttttttcaacaatacacaacaggtacggttcataaacattgtca
+ ccaatatcttgtttaagaacaacaatttttctccaataagcgttagacagatcaaaaacc
+ aagtttacgaatggaaaaatgcaagatcagaatacgtcacccaaaacaataaaaagtgcc
+ ttattttatttagaaaagacaacacatccttagattttaaaatcaaaaagtccatatcaa
+ gatacacttacaacctcaaaacggatagagaagaaaatgcagttttctatcgaaataatt
+ taaatcttttgatttttcatctgaaacatacactggagatacaatcaaatccaaattcgt
+ cctgcaagtggtcattggactttgcagaagattttgatgggatgaaacggaggcttttgc
+ ctgcttgggaaccaaaatatgaaccactcattaacgaggaagatgctaatcaagatacta
+ taacaggtggtaacagacaaaggagagaaagtggaagcattttatcctacgaatttatcg
+ aacatatggagactcttgagtcggagccagttggagatttgaatgagaatagaaaaattc
+ ttagacttttgaaggataacgattctattgcaactatttggaattgcagtttgattattg
+ gattagaaattaaggaggggattttaattcatggcagtaattacctttactttgtaagtg
+ attactattttagtttagaggataaaaagattctaaaattatcagaagtatcgcaagaat
+ cacgggatatgacggttagcttaattaacggccctgatgttaaaagggtatcaactttcc
+ taaagcacgaagtctttgtttggaaacttctcgatatcactttcgttaccaaacgaccct
+ ttctacttcgggatgtcgccatcgaattattgttcaaagagagagttagcgcttttttta
+ gtttttacaacaaaagagtgagagatgacgttttacgggtactgaataagatcccgaagc
+ accttccagcagatccaattttttcaagcgttttacaagaaataaacgaccgaggaaata
+ gtatagtggcaagaaatggaataggaaaggcaagcattgcttccaaattcactagcgtct
+ tctcagcgaacaacagcctaatagatggatttgagatcagcaaaaaatgggttaggggag
+ agatttctaatttttattacctgttgagtatcaacatcctagcgggaaggtcattcaacg
+ atttgacccaatatccagtgtttccgtgggttattgcagattacgaaagtaacgtactcg
+ atttagagaatcctaaaacttaccgggacctatcgaaacctatgggcgctcaaagtgaga
+ aaaggaaattacagtttatagagcgttatgaagctttggcttccctggaaaatgctgatt
+ ccgcaccatttcattatggcacgcattattcctcagctatgatagtatcttcatatctga
+ taaggctgaagccctttgtcgaatcctttttgttattgcaaggcggaagttttggccctg
+ cagatcgtttatttagttcgcttgaaagggcctggagctctgcttcttctgaaaatacaa
+ cggatgtcagggaattgacacctgaatttttttttctacctgaatttttgatcaacgtta
+ atagttatgactttggtacagaccaaagcggtaaaaaagttgacgacgtcgtacttccac
+ cctgggcaaatggtgacccaaaggttttcattcaaaagaatagagaagctttagaaagtc
+ cttatgtatcagcacatttacatgaatggattgatttgatatttggttacaaacaaaagg
+ gggaaattgctgtgaaatctgttaacgtattcaacagattgagttacccaggcgctgtaa
+ atctagataatattgacgatgaaaatgagcgcagagctatcacaggcattattcacaact
+ ttggtcaaacgcctttacaaatatttcaggaacctcatccggaaaaaatagcctgcaatg
+ ttcaacagctaacaacagaggtatggcgtaaggttccaatgaagccaatatttgagaaga
+ caatctttaatttgaatgaaaagaacaggtctgtcgattatgttatacacgatcctagtt
+ acttcgattcattatactggaggggcttcgctttcccaaacttgtttttcagaacggaag
+ aatcgttagtgtcattgagaattgtgcataaaaattggttaaaaattggactagatattt
+ ttaaaaagacgcatatggctcagattacatcgtttgcgtactggaagttgggcgaattca
+ taactggtgataaaaatgggctgataaaagtttggaaatatcgtaaagataagcattcgg
+ tttcaggtaaccttgagaacaaaaaaacaatgtttgggcacctatgcgagctaaaggaaa
+ tgcgctgttatcacgactacaatacgcttttaaccttagacatcagcggcttagtatatg
+ tctgggacatgattaatttcgaactagtgagacaaataacaaatgatgcgcaaaaggtcg
+ caatatctcaacatgcagggagcattatggtattgactaagaataacgccatttcgatct
+ tcaatctaaatggacaaatatatacatcaaagaaattcgaaccagctaaaattgtaagct
+ caattgatttttttgacttcactaagttagacgcaggttacagaaagcatatctattgga
+ aagagatggaaatactactagtgggctttgaagatggaactatagaaatttacgagctct
+ ttttgacttttcataatgaatgggcgataaagctactgaaacagctctgtaccgaaagag
+ ggaaagccataactagcattaagggacaggggaagacatacctgtcccagaaaagacgca
+ aggatacagcagagcctcatgagatagaagtgattgcgggaacattagatggcagattag
+ ctatttggtactaggcatgacatcgtaacgcctttctttaaatgattcaatttttgtagt
+ ttatatctttacttttgaaactgatttctcatcccacctagtattgtaattgcgtacgta
+ tccaatatcattaccaacgccgggtatttttttctagtatttcttctccatttcgcctat
+ ggaaaacagcaaaagggtaaaagaaaaaaacaaacgattaattcttcattgaattatgta
+ aaaatcaaaacgcaaccgcagatttaatagagaccagaaattcggattactattgacttt
+ gtgcaccaccttcaaatttactcattgtttaagacaggcagtgggaaagaagccgtcata
+ ttgctcgaatccttaacaagcaaaatatacaaccactaaattattccgaaagggcctgct
+ taataatttgcctactaacttgtgcatagaacagcaaacagaaacaaagcgtaagaaaca
+ tggggtatccgccacctacacgaaggcttggagataagaaaaggtaccattattccaata
+ atcctaaccgaaggcatccttccgctgtttattccaagaatagctttccaaaatcaagca
+ ataatggatttgtatcttctcctactgccgataattcaacaaatccgtctgtaactccca
+ gtactgcatctgtacctcttcctacagcggcacctggaagcacgtttggtatcgaagcac
+ ccaggccatctcgatatgatccgagctcagtcagtaggccttcgtcatcatcttattcgt
+ caacaagaaaaattggaagccgttataacccagatgtggaaagatcctcttcaaccacta
+ gttcaactccggaaagtatgaatacgagcaccataacacacaccaatacggatatcggaa
+ actcacgctattctcgaaaaaccatgagcagatataatcctcaatctactagttctacaa
+ acgttacccactttccctcggcattatcaaacgctccaccgttttatgttgccaacggga
+ gttctcggagacctcgatcaatggatgattatagtcctgatgtaacgaacaagctcgaaa
+ caaataatgtttcatctgttaataataacagccctcattcttattactctaggagcaaca
+ aatggagatccattggaacgccttccagaccaccatttgataatcatgtcggcaatatga
+ cgaccaccagcaatactaactcgatccatcaaagggaacctttttggaaagcaaatagta
+ ctactattttaaaatcaactcattcacagtcatcgccttcccttcatactaaaaaatttc
+ acgatgcgaataaattggacaaaccagaggcttcagttaaagttgaaacacccagtaaag
+ atgagacaaaaaccatatcgtaccatgataacaattttccaccaagaaaatcagtttcta
+ aacctaatgcacctttagaacccgataatatcaaggttggcgaagaagatgcattgggga
+ aaaaagaagtacataaaagtgggcgtgagatagcaaaggaacatcctactcctgtaaaaa
+ tgaaagagcatgatgaactagaagctcgcgctaaaaaagtaaataaaatcaatattgatg
+ gaaagcaggacgaaatttggacgacagcaaaaacagtggccagtgcagtcgaagtttcca
+ aagaaagtcataaggaactaacacgctctgttgaaaggaaggaaagtccagaaattagag
+ attatgaaagagcatacgatccgaaagccctgaaaacagacgcaacaaagttgacagtag
+ acgatgataataaaagttacgaagaacctcttgaaaaagtggaagggtgtattttcccat
+ taccaaaagcagaaacgagattatgggaattgaaaaaccagaaaagaaacaaaataataa
+ gtaaacaaaagtacttactgaaaaaggcaattaggaatttctcagagtatcctttttacg
+ cacagaacaaacttatacatcagcaggctaccggacttatcttgacgaaaattatatcaa
+ agataaaaaaggaggaacatttgaaaaaaataaatttaaaacatgattatttcgatctcc
+ agaagaagtatgaaaaagaatgcgaaattttgactaaactgagtgaaaatttaaggaagg
+ aagaaatcgaaaataaacgtaaagagcacgaattaatggagcagaaaagacgtgaagaag
+ gtatcgaaacagaaaaagaaaaaagcttacggcatccatcctcgtcttcctcatctcgtc
+ gcagaaatagggctgacttcgttgatgatgcggaaatggaaaatgtattgctacaaatcg
+ acccaaattataaacattatcaggctgctgcaacaattcctccgctaattttagatccaa
+ tccgcaaatactcttacaaattctgtgatgtaaataacttggttacagacaaaaagcttt
+ gggcgtctagaatattgaaagacgcctctgacaactttactgaccatgagcactctttat
+ ttttggagggttatttaattcatcctaaaaaattcggtaaaatttctcactacatgggcg
+ gcttaagaagtcctgaagagtgtgtcctacattattatagaacaaagaaaactgtgaatt
+ ataaacaacttcttatcgataagaacaagaaaagaaaaatgtcagccgctgcgaagcgcc
+ gcaagaggaaggaaagaagtaatgacgaggaagtcgaagttgatgagagtaaagaagagt
+ caacgaacacgatagataaggaagaaaaaagtgagaacaatgccgaggaaaatgttcagc
+ cggttctagttcaaggttctgaagtgaaaggtgatccattaggtacaccggaaaaagttg
+ aaaatatgattgaaaagagaggcgaagagtttgcaggtgaattggaaaatgctgagaggg
+ taaatgacttaaaaagggcgcatgatgaaattggagaagagagcaataagtccagtgtaa
+ tagaaaccaacaatgaggtacaaataatggctccaaaaggaggtgttcggaatggttatt
+ atccagaggagaccaaagaacttgacttcagtttagagaatgcgttacagagaaagaaac
+ acaaatctgcaccagagcataaaacaagttattggagtgttcgtgaatctcaactctttc
+ cagaattgttgaaggagtttggctctcaatggtctctcatatcagaaaaactgggtacca
+ aatctactacaatggtaaggaattactaccaaagaaatgcagctcgcaatggatggaaat
+ tactggttgatgaaaccgacttaaagcgagatgggactagttcagaatctgtacaacaat
+ ctcaaattttgatacaaccagaacgaccaaacatcaatgcctatagtaatattcctcctc
+ aacaaagaccggctttgggttattttgttggacaaccaactcatgggcataatacatcta
+ tttcatctatcgatggctctataagaccatttgggcctgattttcatcgtgatacctttt
+ ctaaaattagtgctcctttaaccactttaccaccaccaagactaccatctattcagtttc
+ ctcgttcagaaatggcagaacctacagtgacagatttgcgtaacaggcccttagaccata
+ ttgacacgttggctgatgcagcttcgtcagtaacaaataatcaaaacttcagtaatgaaa
+ ggaatgcaattgacattggccgtaaatcgacgacaatcagcaatctattgaataattcgg
+ atcgaagcatgaaatcttctttccaaagcgcttcaagacacgaagcacagctcgaagaca
+ ctcccagcatgaacaatattgtagtacaagaaataaaaccgaatattactacgccaagat
+ cgagttctatttctgcattactaaatcctgtaaatgggaatgggcaatcaaacccagatg
+ gaaggccgttgctgccatttcagcatgctatttctcaaggcactcctactttccctttac
+ cggcccctcgcactagtccaataagtcgtgcgcctccaaagttcaatttttcgaatgatc
+ cgttggcagctttggctgcggttgcctccgcgccagatgcaatgagcagttttttatcta
+ aaaaggaaaataataattgaacaaacggctgagacgggcaatacatatgctctacttctt
+ ttccatccaatggttggtgaaactctcgagcatacattaccttacgtgtgttagtgtact
+ atattatatatatatatatgtatatatataaagggaggagtttttaattataattgtaat
+ ttcgtattttttctgcattatacagttttttccgattttaaacgactttatttaagtgtc
+ gtgtaaatatgtcacattttatttttgtacgtattcacatgtcctggcgtgcggccattg
+ ctgaaaatcgcaaaacccacagagaaataaacatcgcgaaaaagtcaatgaaaaattgga
+ aaatatttttcatttcactattatccacaagcaattttgtacaaagtgaaaaggttgaac
+ taattatcttcgtctagaagccatgaattcactcgttactcaatatgctgctccgttgtt
+ cgagcgttatccccaacttcatgactatttaccaactttggagcgaccattttttaatat
+ ttcgttgtgggaacatttcgatgatgtcgtcactcgtgtaactaacggtagatttgttcc
+ aagcgaattccaattcattgcaggtgaattaccattaagcactttgccccctgtgctata
+ cgccatcactgcctattacgttattatttttggtggcaggtttttgttaagtaagtcgaa
+ accatttaaattaaatggccttttccaattgcataatttggttttaacttcactttcatt
+ gacgcttttattgcttatggttgaacaattagtgccaattattgttcagcacgggttata
+ cttcgctatctgtaatattggtgcttggactcaaccgctcgttacattatattacatgaa
+ ttacattgtcaagtttattgaatttatagacacctttttcttggtgctaaaacataaaaa
+ attgacatttttgcatacttatcaccatggcgctactgccttattatgttacacccaatt
+ gatgggcaccacatctatttcttgggtccctatttcattgaaccttggtgttcacgtggt
+ tatgtattggtactatttcttggctgccagaggcatcagggtctggtggaaggaatgggt
+ taccagatttcaaattatccaatttgttttggatatcggtttcatatattttgctgtcta
+ ccaaaaagcagttcacttgtatttcccaattttgccacattgtggtgactgtgtgggttc
+ aacaactgccacctttgcaggttgtgccattatttcttcatatttggtactatttatttc
+ attttacattaacgtttataaacgtaaaggcaccaaaaccagtagagtggtaaagcgtgc
+ ccacggcggtgttgccgcaaaggttaatgagtatgttaacgttgacttgaaaaacgttcc
+ tactccatctccatcaccaaaacctcaacacagaagaaaaaggtaagtgtaaaatctttg
+ aaagaatttaagtattcaactttcgtatattcgttttttcttagtggatctattgttact
+ attatcactattattatattgtaaaagaccggatggttttgttatatattacatacacat
+ gttatcgttgaaaaaagttttccgtttcctttcgacagtcatcagataattttatccgag
+ tcttttatatgttaaatcttgttgacaaatcgtccgctctaaccctagataacaacaatg
+ atctttttatcatttctggcgttattttagcgcctcctcccattaatgttaattgcttat
+ agttacctgaaggagcagcgaggatggagatcgtactgtgtatacttgtttcttccgctt
+ cagtatctaggtctgctattagcacagtgttcaatttatccatatcaatgtccacttcct
+ cttcttcagagttatcagagttttgtaattggcattcggggtctaactccactatcccat
+ aatttgaagcaaatgcaatattctttgcgtttatcattaacggtactgacttagtttggt
+ cgcaaataatttcatacgtttctcttatggtggcacttcttcctcttgttcttatagtca
+ ttcttaaatcggacgcacgctcatctataaatgctctaggtaactttacgctctgtaaag
+ cgtacatcaaagaattccaacacaaatcgaagactggaccagtgcgactcaagactacaa
+ ttttcgcatacaaaacgtatgaccattttcttttcattttcaggtttgtttcattcagcg
+ tatcatcttctaattcatcgggatacaagacggaaaacgtaccgtcttcattcgcactac
+ gaacaccagcttttaccttcaaggcttttttaggcaatatcctagagtgcaatatggaat
+ cgtgtagtttttgtgatatagtcatttcttcatcggtgcaagcgcccactctacctcttt
+ caacttccaccaccggatagacagaagcataatttgcaattatgtcctcctctttggtca
+ cttcgaacaattcttcctcaccgaaatcatccaaatctttaatcgaggcgctagtttctt
+ ctattattccacccgtaatagaagttatgactatggttttcccacttttcaaaacatttg
+ agcctaaaatgttgtttttagtgtctatattaccagcatccgcataacgcgataaagtat
+ tattttctatagcaacatctctaaattcttcatattttcttagacatggcctgattccta
+ gggataagtgtctttgtaaagacaattctggagatattctcgcaagaacttcaggtggaa
+ aagtaattgggtgtatctcaatagtttctaacgtggtactttcagccatttccaatatta
+ gatgcctgtgaaactactgtcaaacttatttgtgtagcaatactcatcgtctaatttgtc
+ cattattgttatatactatgtcataaaaaaagaaagaaaaatcttatactttcgtcgact
+ gtgatttgaaaatctctccagatcaaacgggaataaaagggtttatatccgaaggtggcc
+ agaacaatcatgaagcaaataggataaaaatagccaataagatgtttgttgtctttcatc
+ ccaaagttccaacagcatatttcaaagatgggtattacagtaataggttctctaaactat
+ gatttggacacatttacggatagattacctaacgctggagaaactttcagggctaaccac
+ ttcgaaacacatgctggtggtaagggattgaaccaagctgcggccattggtaaattaaaa
+ aaccccagcagcagatatagtgttcgaatgattggtaatgttggaaatgatacatttggt
+ aaacaattgaaggacactttatccgattgcggagtcgatatcactcacgtcggtacttac
+ gaaggcattaatacgggtaccgctaccatattaattgaagagaaagctggtggccaaaat
+ aggatattgattgtagaaggtgctaacagcaagactatttatgacccgaaacagttgtgt
+ gaaatttttccagagggcaaggaggaagaagagtatgttgtttttcaacacgaaattcct
+ gatcctctttccattattaaatggatacatgcgaacaggccgaattttcagatcgtatat
+ aacccctcacctttcaagaccatgcctaagaaagattgggagttggtagaccttttggtc
+ gttaatgaaattgagggtcttcaaatcgtggaaagtgtatttgataatgaacttgttgaa
+ gaaataagggagaagataaaggacgactttttaggagaatatcgtaaaatttgtgagctt
+ ttgtatgaaaaactcatgaatcgaaagaaaagaggaattgtggttatgactttgggttcg
+ agaggggtgcttttctgttcgcacgaaagccctgaagtacaattccttccggctattcaa
+ aatgtttcggttgttgatactacaggagctggagatactttcctgggcggtttggttact
+ caattgtatcaaggagagaccttgtctatggctataaagttctctacattagctagttca
+ ttgaccattcaaagaaaaggtgctgctgaaagcatgccactgtataaagatgttcagaaa
+ gatgcataaaatctactgcaattttttacttttagtttgttaaattttagttttcgtcat
+ atatataaagtttaaaaaggtacaaagaatgcagtaataagaaatgtaattacatgaggg
+ cagtttaagtgctaccttttaagacggaagaactaataccataaccgagagtaattacgc
+ agacaaacgctaacaacgaagcaggaacacctctggagataaaagcgcctacagttagcc
+ atctattaccctttttatcggtcatagaaatagcagtaacgtttggaaatccagatgaag
+ ccaaccccataccgcatgaggctaacaaggcgcaaccgaacacaagaattggagctgcct
+ ttggatcggataatttgtcaccaacttcttgcaccaagggaataatgatgattgctgaca
+ cagtatgtgagacaaaagtgcccacaactaacattaaaataccgaaaatacataagatag
+ caaaaacaccatcgttctgaattttcttttgtaatgctcttgcaatagttaccaacaagc
+ ctgaagatgaaactgccttaccaagggctataccacccatagcaagaacaacaattgacc
+ aagggaatgtgttgaaatcctttgttgatagtagacctgtaccaaaaaacaggactatcg
+ gtattactgcaatttcaccggacgatccaaaagcactttctatttgtgactctacacacc
+ atagaagaatagtagcaatagttacaatgatgataaaatattgctttatagtaaatctgg
+ tccttattggtttaaatttttccagtttagttttgcctattttaaaggtgagtatcatca
+ aggcccaggagcacagcatcgatagaatacctgtaggcagagcgacagcaaaaaattgcc
+ cccagccgattccataaggttttaagtactgcatggagatgatattctgtggcgaagaaa
+ taggtgaagccataccaccaatatctgccgaaagtgcaacacccatgactaatgccttag
+ caaacggtgaagtgtagtccagcggatctagtaagggtgttaatagagaatatgtcaata
+ ctggggaggcaacgttggaaatccacatcgaaagaaagaatacaacactcattgccatta
+ aaaggacatttcttggcttggtacctgcaagggccaataaccacgatgccaaaactttcg
+ caacgttatattgcgacaaggcttcacccaatgtgaaacctgctaataaaatcataattg
+ ttgacgaccacattgtacctaagatttctgtagaggcagctgctgctcccattaccttac
+ cgtcatcgtcttttagtaccctaaaaaggacagttagaaggggaaccaataaacctgtga
+ tgtgtaatggaatggcttcactagcccataagaaagcacagcattctaccaatgccatac
+ accggtgttcgacagggtcattgaaagttttaacacccaacaaaacacctgtgacaatga
+ taatgaagcctattttcattgctcttttcccaaaaaatagcttaggaactgccaaatttt
+ caattttagtaaacttcaaattaattggcttaggcaatggatacttgaaatattcaattt
+ gaagcttttcttgcataagtttctcagcttcatcttcaatagttattatatcgttatttt
+ gcgaagagaggcccaacatgtctttccatgtgttgcttctttcccaaacaatgtggtcgt
+ gcaggtacgactttaactcttgcttgcatgaatctacgttcccaggctgacccatgagaa
+ cagcataaaattcaataagttgtgcaattttactgtttaaactgcttaaagtttcatgct
+ ggaagatgtaggtgtctttgaaaaattcttcactttctattaattcttgcctagtgttca
+ tgtgcaatactttatcagacttcttagtaattttactgaaacccattctgttcaattcga
+ taaaagatttcaactggcaaaggtctatgtaaagatttattattgtctgttttaataatg
+ attttttctgagatttaatgttaaattgcgaataattcaagagggcggtattgtcggtag
+ tgtcagcaaacacgtcaacgtcatcatcttcatccatttcttcttcgctgtaatcaaacc
+ tatttctcaattcactgtcgatgtttgaagagtgtatttcaaatgggttattagatgaat
+ gtgtctctgatgtaccagaaataatttctcttaaaggaggcaatgcttgaatttcaggat
+ tgaaagtttcattcaatcttgttacaatgccttccttctccaaatcattaatcaagtggt
+ caaatctttcatagaacttagcttccattcttttgtagaaatcatctactttcaatttct
+ cgtcgctcaggttagttacgaatgtatcataagaactgaagagcgtctttccctcgctac
+ tcacagatgagctgcgtgattcaaagaatttcttacgtataaaattcttattgaaagagg
+ tggttttcccttgaggagataactcgtctaactcaattgtttcctcgttaatattgttcc
+ catctatggccttttcgtcggagtctccccttttgcttcctgaaggtgttttagaaccaa
+ agagcttacgcttgagtttatgtgtaaatcttcttttcgatggagatggttctcctgcag
+ ctatgttgctttctatatcagcgattggacctggagtctgagagtcagcgtcatcgttta
+ agtcaccggttggcgtttcttgtttcaattcatctgtctgtaatgtgtagatcaaatttt
+ tcaattcgttataatctaggtaatgattctgccattcagggacagcgttgtatttgagaa
+ agtgtgagaatctcatttcgaggcacaataatagccgcctttaccgtagttttgctgcac
+ ctttatctgagagctgactgcttttttggtgtgaaacactgttttctggtaataattttt
+ caatgcatcggattacctttcccacgtgcgaaatcatcaattaattagattgaaaaaagg
+ gtaagggaaaataagaaagaggcggtagcctaaagatacggtaattgaaacgtttcctat
+ gcacaatcttaaacctttttaggtaattgattaagttgactgtaatatctgtaaaagatt
+ acatctaatttacgtctggtttcttatcaaaaaatattatctactgtagttatgcaagta
+ ttcttaactttcagtactaaacatgaatattaattatttgtgagcgtacaagaagcaaaa
+ ggaagtcatctttctttgaacagttctgtttttaggtaagccttggaaccttagctatac
+ tgacttgatacgccctgtctttgttgtcgttatataaagtgtgcacctttatcatagccc
+ ctaataagcttcttgcagtagtatttgaacagtgtatatcttcatattgcttttgttgta
+ aatattgcgcaaaagcgactatctttgtgatttggttgaacttttgcatattcacgttat
+ taccatctttagtgaatgtatcgtttccgtctcttatgaaggtaatgtccctgataagca
+ gtgatgtaaaaggtacacaaggaagctgactgtggaaaatatgtttaattgttcttctat
+ aaacgttgtagttgttgtttggatgtacgacgacttttaggcgctgaaaaaggtggtctg
+ atttaacatctatcgggagagatagtctttctattgaatgattttgcaatgatgtaatga
+ ttgaagcaagtgagtttaagtttcgtaagtatagacaagcaagtgcaacttgtaaccagt
+ aggaaatggtgtgcgtttgctgcaaagttgtctcaagaacgtaggacgataactggttgg
+ aaagcgtaaacacggagtcaattgttgtatcgttgtgtttgaaatgtcttgtgaattcta
+ ttgtttcaatatccaagtaaagagaagattctaagagagtcaatgtctttgcaagtgacc
+ atggagatacgtttagagcgaaggcttcgacttctagaggttcaccctgtttccattgga
+ aaagtaaatcactgaggtcagttgcaccgcacaattcatcatttgcgttcgttttgtcta
+ tctcgtattggagaaataagttttcgtagtccaggggcggtttattttcttcgatttcct
+ tatcagttagaccaatgtaatgaatcttcaattgcctcgttctttttttgtttataaaca
+ gctgaactatgtctgcaaacagttcttggtattcataatattcagccaagtgactgtacc
+ aaaaattgaggatatcttttaatgttgtctcactatcttgccaataagactctacccaga
+ tttgtattagacgagggacggagtgatttttgtgtttgtttttattaattgtgggatagg
+ atagtagcaactcttggaggagagcattgtcagttgtccagtctctgaagttaagtagta
+ agtttgcggagtcaaagggggatggcttttgccatttgtgagagttgtgcggcagcatct
+ tattcaaatagagctgtattctgaagacctcttgtagaacatcatccatactaaaaagta
+ aatcgtcctgtcccattacgagctgtattagtgctgtgaccctctgtatatttacgttgc
+ catgaagaaggtaatgggcgatattttgatacaattcctgagttgcatgttggattgagt
+ ttacgaagggtcgccagacggccagaaacctccaggcggagttaacaactaataatacgg
+ catccatgtttgcatcagcgccgagcctataccagtcactgagtagacgttttcttgctc
+ tttttatgtcctgacttcttttgacgagggggcattctctagagacacaggcagttgctt
+ ccagcaactgcggtacggccgttctcatgctgtcgaggattttttttgggacgatattgt
+ cattatagggcagtgtgtgacttatgaattgttgtagaaggacgtctgtgatgttggaga
+ tatgtattttgttaactcttcttgagacgatttggccctggatagcgaagcgtgcggtta
+ caaataggtcgtcttgttcaagaaggtaggcgaggacattatctatcagttacaaacatc
+ ttagtagtgtctgaggagagggttgattgtttatgtatttttgcgaaatatatatatata
+ tattctacacagatatatacatatttgtttttcgggctcattctttcttctttgccagag
+ gctcaccgctcaagaggtccgctaattctggagcgattgttattgttttttcttttcttc
+ ttctattcgaaacccagtttttgatttgaatgcgagataaactggtattcttcattagat
+ tctctaggcccttggtatctagatatgggttctcgatgttctttgcaaaccaactttcta
+ gtattcggacattttcttttgtaaaccggtgtcctctgtaaggtttagtacttttgttta
+ tcatatcttgagttaccacattaaataccaacccatccgccgatttatttttctgtgtaa
+ gttgataattacttctatcgttttctatgctgcgcatttctttgagtaatacagtaatgg
+ tagtagtgagttgagatgttgtttgcaacaacttcttctcctcatcactaatcttacggt
+ ttttgttggccctagataagaatcctaatatatcccttaattcaacttcttcttctgttg
+ ttacactctctggtaacttaggtaaattacagcaaatagaaaagagctttttatttatgt
+ ctagtatgctggatttaaactcatctgtgatttgtggatttaaaaggtctttaatgggta
+ ttttattcattttttcttgcttatcttcctttttttcttgcccacttctaagctgatttc
+ aatctctcctttatatatatttttaagttccaacattttatgtttcaaaacattaatgat
+ gtctgggttttgtttgggatgcaatttattgcttcccaatgtagaaaagtacatcatatg
+ aaacaacttaaactcttaactacttcttttaaccttcactttttatgaaatgtatcaacc
+ atatataataacttaatagacgacattcacaatatgtttacttcgaagcctgctttcaaa
+ attaagaacaaagcatccaaatcatacagaaacacagcggtttcaaaaaagctgaaagaa
+ aaacgtctagctgagcatgtgaggccaagctgcttcaatattattcgaccactcaagaaa
+ gatatccagattcctgttccttcctctcgatttttaaataaaatccaaattcacaggata
+ gcgtctggaagtcaaaatactcagtttcgacagttcaataagacatctataaaatcttca
+ aagaaatatttaaactcatttatggcttttagagcatattactcacagtttggctccggt
+ gtaaaacaaaatgtcttgtcttctctgctcgctgaagaatggcacgcggacaaaatgcag
+ cacggaatatgggactacttcgcgcaacagtataattttataaaccctggttttggtttt
+ gtagagtggttgacgaataattatgctgaagtacgtggtgacggatattgggaagatgtg
+ tttgtacatttggccttatagagtgtggtcgtggcggaggttgtttatctttcgagtact
+ gaatgttgtcagtatagctatcctatttgaaactccccatcgtcttgctcttgttcccaa
+ tgtttgtttatacactcatatggctatacccttatctacttgcctcttttgtttatgtct
+ atgtatttgtataaaatatgatattactcagactcaagcaaacaatcaattcttagcatc
+ attctttgttcttatcttaaccataaacgatcttgatgtgacttttgtaatttgaacgaa
+ ttggctatacgggacggatgacaaatgcaccattactctaggttgttgttggatcttaac
+ aaaccgtaaaggtaaactgcccatgcggttcacatgacttttgactttcctttgtttgct
+ agttaccttcggcttcacaatttgtttttccacttttctaacaggtttatcacctttcaa
+ acttatctttatcttattcgccttcttgggtgcctccacagtagaggttacttccttttt
+ aatatgtacttttaggatactttcacgctttataacaatatcaagtttaccttcttcatt
+ actattcatcttcgccacaagtcttctctcccttggtgtttccaatctaactacaaaact
+ gttgattagggtgtacatcaccctaacaagatcatgtatttgcttcctctggtacaagct
+ aagaacaggtaaattcaaaacatcccagagtaatatcttcaaagggctataccctttaaa
+ catatctcggcatatttgtattaacccactaatattttgacggccaatcttttctatttt
+ tattttcatatcatcgacgtaatgaccacttaaaaacaactcaatatcatcctcactttt
+ tatatttttaacttcattgtgcacagattggacggaggacttaatatcgtcacttatggt
+ atccaaactaccatccataacaactagatttacacattcagaaagcacatcaacactttt
+ atccctaatataagaatcctcatggaagcaaagtatttcagtaaacaaaacaagggcatc
+ cttgttttttaaaccaccttctaaaagcatgattttcaaaaccattaaaaactcattctg
+ aaagctttccctggaaatatctccgttatttatacctaatgttaatgtatgcattatcat
+ tcctgtcaattccgtaaagtcataagcacccactttatgcaatttgtacttagcatacat
+ gatagtagtcttaatcaaagactcgtatgatgaaagccaattcaccaaattttcgtacct
+ caataactgagtcttaactttttccatatatgctttatctttatactcatttaaagcaac
+ tttaacaacattttcaatcaagctgcggacgtacgcatcatcactgtatttattttcacc
+ attctcattataaacaagaatatccagtaaaaattgcttcaccaattttggacatttacc
+ attttcatttttcactttagccaaatatttgggaatattacactgtaagaaatacaactt
+ aggattctcaaagttattactctttggaatattagagtcttctaggcaaaacaaaatctg
+ aaaaatctgaattaaatgcttaacaccgccagtaaaatccggatcatatacgtatttact
+ aagcgcttcgcacgccgcgagtcttatgccaaaaaagtaacgctcgtcgatcgccgttct
+ aaacaaaatacttgaataaacaagtgatttcacaccaccattaacaacgacgtcctcata
+ atatcgtatggcttctagttgggcctctatgtccccatcttgtctcaactgagaagaaaa
+ catgtaatccggctgattaatgtgcatttggcaaatccattccagatcagagtctatgcg
+ tatccactcaaatgcttcgttttgcttaagcaattcattactctcagacgtacgattaaa
+ ttccgtcaaactgaatcgggaacactcttcgggcgacatgtatacatttcctagacaatt
+ cacatccacaacaataggcttctcctcattattgttttcaacaccattttcatcatttgc
+ accaccacctcttttccttaatcttctgtactttgtattgtactgaatatctatttttgt
+ gaatgtatctttgatttccacaatatgctcatacggagtaccatcgtgttcatggatcct
+ tatagtcatcgagcccgtgaagcattcggttcggttcaaatctggatgttctaagtggtc
+ tagtgcacttttgaaaaatccttcctcccctaccactttttcgtggccaagttcttcatc
+ ttgaacttgccttatacccagttctataaccatcctcttcctattaaatctttgggtgac
+ acgtaatatgggtaccccagacccatatacccattcgttgaaaaaattctctaatttact
+ tttattaactctttcgcaaacatgttgaaaatgcgatgaagtcaacgaattattcggtaa
+ atcaccagacatagcttgaaggaaaattttaggtaatactcgagacataccgaaagatcg
+ ttctgttttagtcatccttctgtcaagtatgtgtagtatcatcggtgccttcaattgtat
+ aaaggacaaatctttagaagaccaagatattggcctagaactgccggtaaaagtgctccc
+ aataggcggtttctcgaagtcttgttccacaatcgcctcactattacgctttaattgata
+ tttatacgtgttattacccattaattttttggttacctgaaacaccatataaccagcaat
+ accaagacagcaccagtagtcgttcatatctaaaggagttatattcacacaagaccattg
+ gttagcaagtgcccatgctaactcattcgtagtactgaatgctttatcaataacttctag
+ agggtacaataatctggtattacaaatgcctaatgctgcaaaatccatatgcttagaagg
+ tgcggtaggtaaaaacaccatagagtaacaagtgaaaggataagacccaaattctttcga
+ gtagaaatctataattttttggcagacaactgtcgaatttataactgttaactcatctgt
+ ttcctgcgtcggaagtgtgaaaatttgaatgggtataatatccgaaccaatgtcgtcatc
+ catagtatcgttaacattgtccacaacattctctctcaacttgtcttcctctacttcgtc
+ ctcggcatcaacacttggaggtgatatcaaaggtaaagaccatgaattaaaggcgcctat
+ agcccaaccaatgtgatgtggagccacaggattaattatctgaaatatgcattttttttt
+ cgtcaaatcaatcgggtgcggaagttctttaatatttgaatattctgaacaacacacaat
+ cacatccctcaaactcggattgttgctttcctcaatattccgcctttcctcttccccttc
+ ctcttcctcttcgtcactttcgccttcctcttcttcttcatcattatcttgtgcatcttt
+ gcttttactatttttgccttcttcgtcatttttcaagttagaatccttatcttcgtcttt
+ gataactcttgccggcttcccctcttcctcttcatcgtgctcaggcgtatcctccttctc
+ tttttcactctcttctccattttgtccgattaatttcgaagtacctatatttttaaccaa
+ tctcggtacgctgaattctaactcccatgtagatttttcatcaagcaaatcgacacatgg
+ gacccaatatgatgcagaactgcaaatctcaccatttgaagtgtaaacgttccataacca
+ gggcttgtcagcatacacagtatcgaatttaatacccgactttgggtttctgatttcata
+ ttcaatttgtaatgtaattggagtgaaaacagattcttgaaacccaggggtagtcttaat
+ tgaaggagtaatcggagtgtaattacttaatgcattggcgtcctccaaagatattttgat
+ ggaagatggtatttttatagttaactgggatttagagtccgtttctgggtattcattcaa
+ gctagcaaacttagatctcaaaaaatgtgactgttcaatggaattgtcgctgtataatac
+ attttgtgaagtcaatccattcaaatttgtttgaagtgggtcgtcatgaataaattgatc
+ acatcgacgattttcgaccagaacatctttaatagtcatttccttgcaatcaaaagttac
+ atattctaggttttggatcaacggaatgattattatagtagcgctaccggtaatgcagtg
+ agtagctagatctatatccaacgagattttttcatgggcaactctaaaatttctaaactt
+ catctcatgcaaagtgctagattcactaacaatggctctaggagtggcgtttttggaaaa
+ ggacatcatgaagtccacctatttatcctgttgtcacctcttcactgtccaaaacttcat
+ caaaaaatacttttacatccgttccattttactatagttttcatcttacgcaatttacca
+ attgccccattcggcaaaagtatttaagtgttacccggttacttaaggtaataccaaaaa
+ tattcactatattgaaagttctatgtattttattgaattaaataagacttgattttgtag
+ cacgatatccgcaagaatgattcacaaccattttagatgtctgcagcttttttttttttg
+ atttttttactaattttagtgcgattttagaataactcttggattaaataaataaaaaaa
+ atacaattatcactcagattttgaagcaatctgataaaaattgcagcatattattataac
+ aattctagttcactacaaacagcacattatcaagaaacttatgtttgatggacagtttct
+ tcaaataattgaatgcagtaaaaaataaaaaaaataaccatggaatatgtttctaagaat
+ tcgcaatccttgtttatataactatagctttttttatgttataagaaaacatttattcct
+ ttaaccaaaaggaaaaaggtaacaaccccacttacccctctgtcaatagttaatcatttt
+ attattttcaactttaaacatccaaaaaaaataaaataaattaaattaaaatcaaataga
+ aaatgcattattactataataatatctctatgtagacatatacatattatttcattgctt
+ tttgtaataaacgaggaaaatggtgggataaaaaaatgcattgttattccttcgaaacca
+ gcagtaataagtcgtcctgagaacgatgtagatttagatctttacatgttggtactctaa
+ atcacctccttccatcttaaagtcgtccttcttcttgctccaatcaaacttgcttttctt
+ gctgctgggacaccaatttaaacttattaaccagatagtaaacaaaatcatgaaacaaca
+ gaaataacaccaaaaatgatcgtaaacgtgaccccttatgatatcacacactgaataatc
+ atacacttcatcgtctataaccagtccataattgaaataaccgagtccgtatttcaatac
+ aggtttatatccaccttcatcggtaaacgtacttggactcaaaactgtgcttagatcatt
+ agttccttggaatgcatgttctctcaataaggaagcatcaagtggagctgggatcatcct
+ ttgccaaagtacgttaagagcgctttcactgcctactgttgccccaaaaatttcttggcc
+ ttatcttggcacaaatattcttttgcatttagttctatacatacagaaagagcgaaaaaa
+ aagtgaaaaataaacagaatgtgataacggtaaaagaaaatcaaaaataaacaaaaaaaa
+ atacatatataactattattcatgatagtaaacaggtgaattaagatatttagtacaatt
+ gtctattaccccaataggcataatcctcgaggaaaaagtcgtacaatacccacgagggcc
+ atattgtgcacagatgccatagcgcatgggcatcaatctgccattcgtaactaaaaaagt
+ caaacagttccaatgacatggccattgtggttacaatcacaagaagaattggaatgacac
+ acagtttgaagatttgccgcttgaagctggagtacgcggttttcttgaattcgcccatta
+ gcttttgcttttgcagagcatggtagttttggcatgataacattatcaataaaatatact
+ gtaaaacaccgaaaaaaatgttgaatctcatgttgtatgtgtacgaccagtcaacataga
+ gtctcaagatgtgcagggcaaagattgccgcaactgacgcagtgaacgcttgtgctatct
+ tggggtacaggaacatggaagtcattcttgcaaatattgcatgaaaccctgttaggacag
+ ttaaacctgcgaaaaagtaatctagcttctccgttatgatcaaatcacgacagtgaaaga
+ ccgagcttgctgtccaagccaacattcccgccacagtaacgtaaaggtagttccaaatca
+ gtatacttctgctgttttttctcctcctatctccctcttcgcgtatgattctagaaaact
+ ttacaaatcccttataatgtggaataaagttacctatactgaatatcgtcgagaagaatt
+ cctgagtccccaatactctcaaaaatggccatttcccatggaattggtatatttcttcct
+ cttcgtcaattctccagcgcgtaatgatgtgctgacattggtaatcacaatctgagatac
+ aatcccagaacaataacttagagtacaaaggcggggtatcgaaaaactcaatatcatgaa
+ acatgttggtttcagggtcaatgtagtttatttgggaattcggacatcttctgttatact
+ cgcacgcatacgtacagtctataaactcatctaaattatctcctggagagcatgtcacta
+ ggaaacagtgaacaagtagggtcacaaccacagctaacctcatatttcttgccacacttc
+ tgtttcttttctcccgaagggtatggtttcacaccttaggttgctcttttattcgcttta
+ tgaaaaatgaaaaatgaaaagtacaagaatacgcaagagaagagcgaaaaaactaatgtt
+ gacatgacattatgcaataatttatttatgagagaaagagaaaaacactacaaatttttc
+ ataaaatcacgcctcttcaatcgaatctcaccaagaggtagtgtgacatcgttattccaa
+ ccccacagttttgaaccagattgaagattctcagacataggagtttgtaaagtggcattg
+ tactcctccagttccttagcaattgcctctatgttcaatcgagggaacgtttcgtcgtca
+ tatggatcgtccagtttatcgacgccgttgtaaagaatctggtttggcgtagaaggcttg
+ aaaaacaacgttcttctatggaaaacattacgcgtggacaaatactcaatctcctgtgct
+ atcatttcaggctcaattcccttcgagagtaggatcgctgcaactccggtgacaatgggc
+ gtcgacatagatgtacctgacaaaattaaagtgtcgttgtagttcagatgagatagcgac
+ tcaatttccacgcctggggcaaagatgtttacacagggcccccaattgctgaacttggca
+ atcgtgtcaatgtggtcatcaaaggcccctacggtgataacgttttctgcagacgcaggt
+ gaggcccagtaggcgtctaaattgaagttccccgccgcggcaacaaatacaataccttct
+ tcaatggccccctccactgccatgttgattatggggctcctgaaactccctagacttaga
+ ttggccacgcattttttaccctgtgggcgactgacctttgtgcaatgttccacgatgaac
+ tccagaccactaagaacgttactggcctcgccagacccgtctttgcccaagactttgacc
+ tccacaagattgaccctctttgccgctccatacgttttggaacctacgagtcccgccacg
+ tgcgttccgtggccattctggtcgccaaacccttctttggtcaagtcaatcccctggatg
+ actctgtcttcgaattccggatggtccgcgaagatacccgtatccatgatataggcgttg
+ acgtcttgaccttgatagtcgtgttcatagtagtaattgaaccagcttttgtagcggtcc
+ ttgtccccaacgtcgaatgggagttggtagtggcgggaaatccgggccaagtgacggggt
+ gcgtctggttgatacgttatattttgctcttcctcgacgtcttcgtacgagtatttggcg
+ gtagcgttaaacgtataactcgactcggcgctatttacactgtcgccttcaaaagcttcg
+ aacctgaagttgggcacaatatcagccaccagtgggcttttcttcagcttgttcactaag
+ ttcttggacaggttcatggtcacgccgcggaaggatccgatggagaatttcttctcaatt
+ ttgttgtcgatgaactgtctccatgacctgttggaattcgaagtgaattgttggaatgct
+ gtgggatttttgaacctcaccaaatactcctctgcgtatgcgaggttgaccagcaaagag
+ atgagtatgcactgtggcttcatgaccagtgtttttttgctgtttttcttggcggatgtt
+ agaggatgtctagcacttgagtatacctctatatgctgtgtaaaatagaaaataaaaaag
+ ccaatgagaacaatattttgtgtcgttttttttattttcgcttcaattctctatacacgt
+ acgtcatcactacgggaaaccaaaaaaaaaaaaaagaaaacagtacaaaaaggtttactt
+ tacccatagttagtatacgtagattttatacacgcactatttttcttatatacaggagat
+ gggtggccacagaacccgcgcctagcctatttctttcttcttagacctgcctcgaggtcg
+ ccgacatccaaccttgtacctctgatgtagtagtgtttgtttctttgtctgctcgagacg
+ tggggggttagcaagtcgattctctcgatcagcggcgaaaacaatggcactctaatctcg
+ acggccgtcttggctatctggttccgcaacagcaacgaggcgtcttgcaccagttgtttg
+ cggtctatggaaaggatgtagccaacaaaggtgtcgtacgagcacttggacgagtcgtac
+ acaatccggaccacgtcgcctgccttcaagcggtccttgttctttttactgatgagcttc
+ ctcttaacggccccgggatccagagactctatttctgagagggccacctgcttcataatc
+ tgcgacgaagctatgcgctgcacaggtgggtacacggggatggtttttctcttggtggcg
+ gggaccatgtaggaccttgtccatgatccaagcaatctgacgttcctgctccacatcgct
+ atggttctctctccctttcagcagctgctcagagggaagactcgattcttcttcttcttc
+ tttactgggagttccaaagcaccaaaattgaaaaccaagaaaaaaaggaaaatacatcac
+ tacataaaagtgcgcggagccattgaccactcacatacagatataagtgtatatatacaa
+ gtatacatatacaagtatacatgtagaaggagacagagacagagggaaatgtataagtgt
+ gtatttttatagtatacaaaagaatatgcgtgtgaggatgtgtgtttcttctagaacctg
+ tgtcttcttttcctcccggtgaacttggagtccttcgcaactttcctcccacgtcttttc
+ atgagctccttctttcttagaatgaaactcttggcagactccttgtctttgcctcccttc
+ aagcgctggcgcagttgtttcttcaagttgacgttctcctcgtccatggtcacaccgtcc
+ aaattcacctgctcctccccctgcggtggggccccactgctcaacacaaggtagtacttc
+ ttattctttttagactctgggtcgtccaccacaagcccgccactgaaccctgccaccttg
+ gcagactgcagtatgtcgtccacctggtcgtcgtttttcgggtagaactgggccacaaat
+ ttccctcccttcttcagtgcagcatacaatgtgttgaaaaacctcatcaaccgctgttta
+ ggatcgttgtatgaagtgtccgcattgcacagccattggatcgcactgatactaatagcc
+ gcgtcaaacgagcccgcccggaacggtatcccggtgcccatatcctgcaacatcaagtcg
+ ccctccagctctctactaagaccggtcgcaagcatgctgggcgatatatccaaaccacac
+ cacacatggtctccctcctgcgtcaaaatctccccagacagtccggacccgcacccgata
+ tccagaatgaaactgcacggctgcagattcaaaagctccaacgccctcagcgtcatcttc
+ gcctggatatgctgcactctggtcgaacccgtgtacttgtgtgcttcgctatcattatag
+ aaaatctccggtggtgccaactcctcaggacgtgacattatttcttctctgatatatttc
+ ctgtgtttccgtaccgcacctttttagcactacttttttactatgctcttcttcttctgc
+ ttcttctgcttttttcctctttatcacactatgtatgtgctgctcatctcttctttttat
+ cgataaaattgaaaaatgtgagatggtgtagagtgaaaaaaaaaaaaaaatctggcttgg
+ ccatcaaatacccggccgtggttggactcgtttagcgaacaatagcacccagcagaccct
+ ggcaacatgcggatgatataagaaggacgagcgtggtggaggaaaggggcgccattggca
+ cactcacgcaggtggttgttcagcacggcttgcagcaagagcgccaaaacagattgcaag
+ aatgacggagactaaggatttgttgcaagacgaagagtttcttaagatccgcagactcaa
+ ttccgcagaagccaacaaacggcattcggtcacgtacgataacgtgatcctgccacagga
+ gtccatggaggtttcgccacggtcgtctaccacgtcgctggtggagccagtggagtcgac
+ tgaaggagtggagtcgactgaggcggaacgtgtggcagggaagcaggagcaggaggagga
+ gtaccctgtggacgcccacatgcaaaagtacctttcacacctgaagagcaagtctcggtc
+ gaggttccaccgaaaggatgctagcaagtatgtgtcgttttttggggacgtgagttttga
+ tcctcgccccacgctcctggacagcgccatcaacgtgcccttccagacgactttcaaagg
+ tccggtgctggagaaacagctcaaaaatttacagttgacaaagaccaagaccaaggccac
+ ggtgaagactacggtgaagactacggagaaaacggacaaggcagatgcccccccaggaga
+ aaaactggagtcgaacttttcagggatctacgtgttcgcatggatgttcttgggctggat
+ agccatcaggtgctgcacagattactatgcgtcgtacggcagtgcatggaataagctgga
+ aatcgtgcagtacatgacaacggacttgttcacgatcgcaatgttggacttggcaatgtt
+ cctgtgcactttcttcgtggttttcgtgcactggctggtgaaaaagcggatcatcaactg
+ gaagtggactgggttcgttgcagtgagcatcttcgagttggctttcatccccgtgacgtt
+ ccccatttacgtctactactttgatttcaactgggtcacgagaatcttcctgttcctgca
+ ctccgtggtgtttgttatgaagagccactcgtttgccttttacaacgggtatctttggga
+ cataaagcaggaactcgagtactcttccaaacagttgcaaaaatacaaggaatctttgtc
+ cccagagacccgcgagattctgcaaaaaagttgcgacttttgccttttcgaattgaacta
+ ccagaccaaggataacgacttccccaacaacatcagttgcagcaatttcttcatgttctg
+ tttgttccccgtcctcgtgtaccagatcaactacccaagaacgtcgcgcatcagatggag
+ gtatgtgttggagaaggtgtgcgccatcattggcaccatcttcctcatgatggtcacggc
+ acagttcttcatgcacccggtggccatgcgctgtatccagttccacaacacgcccacctt
+ cggcggctggatccccgccacgcaagagtggttccacctgctcttcgacatgattccggg
+ cttcactgttctgtacatgctcacgttttacatgatatgggacgctttattgaattgcgt
+ ggcggagttgaccaggtttgcggacagatatttctacggcgactggtggaattgcgtttc
+ gtttgaagagtttagcagaatctggaacgtccccgttcacaaatttttactaagacacgt
+ gtaccacagctccatgggcgcattgcatttgagcaagagccaagctacattatttacttt
+ tttcttgagtgccgtgttccacgaaatggccatgttcgccattttcagaagggttagagg
+ atatctgttcatgttccaactgtcgcagtttgtgtggactgctttgagcaacaccaagtt
+ tctacgggcaagaccgcagttgtccaacgttgtcttttcgtttggtgtctgttcagggcc
+ cagtatcattatgacgttgtacctgaccttatgaactgccaccataccacgtgtgtccct
+ cgcaagcccttgatagatatacaatagggaatgggcgtccgtccaccgtggtcaaagaca
+ ggggcaaagagctcctaggtctatatatatatctatatacatatttatatatattattag
+ aactttacaatatagtatataccattcattgtttaagtttcgggtaatactttttttttc
+ cttgtcataaccccaaaaattttcgatgcctttgatataattgagaacaagaagagtttg
+ caggtgacaaaaatcgatgattataggtgttgtgacgacaaaatgaacgctaatatatgg
+ gtggctgcttcagatggtaatttggaccgagtggaacatatcctccgcgagagtaaaggc
+ gccatgaccccgcaatccaaggacattaacggctacactccaatgcatgctgccgccgca
+ tacggccacctggatttgctgaagaaaatgtgcaatgagtacaatggagacattaatgtg
+ ttggacaacgacggcgataccccgttgcaccatgtggaggatgtggccactgccaggttg
+ atcgtggaagagctgggtggagacttcactatcaggaatgtggagggccaaacgccatac
+ gactcgttcgtcgagaacggtgaagatggtgagctaatcgagtacatgaggattaagtcc
+ ggcgtggccgatgttcacggagtggacggcgtgcagggtgagggtgtcatcgacagcaaa
+ ttgctggaagagttcaaggacaacgtgagatacaccttggaaaatgaccctgaggaagga
+ gccgatgaggccactctgcaacgcaggaggcagttggaacagatcattacgggagacaac
+ gctgaggaggagttggaaaggtacatccgtgctatggtcagagagcagatgctgggccag
+ ggctccatggcgggttccggggacgaaccagattccaagagaagaaaataacgacccagc
+ acaaaggctcttacagcttgctaaaagaaattgaacgcgacgctacatgaactacttctt
+ tctcttacatagtcttttcccttatgtatcttttctgtacattaatagacgttcttacaa
+ ggtaaaatttcaccgcgtttttaaatagaatgaaaaaaacgttgtagagtgaaagaaaag
+ caacaaatatacagttcacaaggcagcttcgtatagtaatacagcacgaaaaacagctca
+ tagaaatggtaacacagaccaatccggtccctgttacatatccaacggatgcttatatcc
+ ccacgtatctgcccgatgataaggtctccaatctggcagatttgaaaaaattgatagaaa
+ tggattccagactagatttgtatctgacaagaaggaggctggatacgtccatcaatttac
+ ctacaaacaccaagaccaaggaccatccccccaataaagagatgctgaggatttacgtct
+ acaacactacggaaagcagccctcgcagcgattctggcaccccagcggactcaggcaaga
+ ctacatggacactgagaatagaaggtaagcttctgcacgagtccgcaaacggaaagcacc
+ catttagtgagtttttggaaggtgtcgcggtcgactttaaaagactgaaaccgctgggca
+ tgggcaagaagaggaaacgcgattcgtcattgagccttcctttgaatctgcaacaacccg
+ aatacaatgatcaagatagcaccatgggcgataacgacaacggcgaggatgaggacagtg
+ cagaggcagaatccagggaggaaattgtagacgcactggaatggaactacgatgaaaaca
+ acgttgtggagtttgatggtatcgacatcaagaggcaaggcaaggataatttgcgatgca
+ gtataaccatccagttgaggggtgtcgacggtggaaaagtacagtactcgcccaacttag
+ ctaccttgataggtatgcaaacgggctccgttaatgacgcggtttattcgatctacaagt
+ acattttgatcaacaatctgtttgttacggaacaaacagaggctcaagatggttccaacg
+ atgccgaagacagcagtaacgagaataacaataaaaacggtgctggtgacgatgatggcg
+ tcgagggaagtactccaaaggataagcccgaattgggtgaagtgaagctagattcactct
+ tacaaaaggtattggatacaaacgccgcgcacctccccttgatgaatgttgtgcaaaccg
+ tgaacaaactggtatcacccctaccgcccatcatcctagattatacaattgatctttcca
+ aagataccacctatggtgctaccaccttggatgtagatgtgtcgcacattctccaccagc
+ ctcaaccccagccaaatttacaaaaagaggaagaaacagatgctgaagacacagcaaaac
+ tacgtgaaatcacaaagcttgccttgcagttgaactctagtgctcaaaaataccagtttt
+ tccacgaactgtctttgcatccaagagaaacgctgactcactacttatggtcttccaagc
+ aaaacgagcttgtgctgcagggcgaccaatacttcaatgaagatgctgcaagaacgagtg
+ acatatacagtaacaacaacaatgacaggtcactaatgggcaatatctcactactgtact
+ cccaaggaagactataacatcactgtatgatctcattctttctgcaattattgcacgcct
+ cctcgtgagtagtgaccgtgcgaacaaaagagtcattacaacgaggaaatagaagaaaat
+ gaaaaattttcgacaaaatgtatagtcatttctatcaataacgtatacacgtttttttaa
+ tactatctatattgtgtatcttatatttttccatttggtagtatagcagcaaatatttgt
+ tcttagacggccaccatccctcacaccaatgcctaacgcttcccaagtttacagatctac
+ cagatccagctctccaaagacaatctcttttgaagaggctatcattcaaggtctggccac
+ tgacggtggtcttttcattccaccaactattccacaagtggaccaagccactcttttcaa
+ tgattggtcaaagctctccttccaagacttagcctttgctatcatgagactatacattgc
+ ccaagaagagattccagatgctgatctaaaggacttgatcaagagatcttattctacttt
+ ccgttctgatgaagtcacccccttggtgcaaaacgtcactggtgacaaggagaatttgca
+ cattttagaattattccacggtcctacctacgctttcaaagacgttgctttacaatttgt
+ cggtaatctttttgaatacttcttacaaagaaccaacgccaatttacctgaaggcgagaa
+ aaagcaaatcactgtggtcggtgctacttccggtgacactggttctgcagccatctacgg
+ tttaagaggcaaaaaggacgtttccgttttcatcttatatccaaccggtagaatttcccc
+ aattcaagaagaacaaatgaccaccgttccagatgaaaacgtccagactttgtctgttac
+ cggtactttcgacaactgtcaagatatcgtcaaagctattttcggtgacaaagaattcaa
+ ctctaaacacaacgtcggtgctgttaactccatcaactgggcaagaatcttggcccaaat
+ gacctattacttttattcattcttccaagccaccaacggtaaggactccaagaaggtcaa
+ gttcgttgtgccaagtgggaacttcggtgatatattggccggttattttgccaagaaaat
+ gggtttgcctattgaaaaactggccatcgctaccaatgaaaacgacattttggacagatt
+ tttgaaatctggtctatacgaaagatcagacaaggttgctgctactttatccccagcaat
+ ggatatcttaatctcttctaactttgaaagactactatggtacctagctcgtgaatacct
+ agctaatggtgatgatttgaaagccggtgaaatcgtcaacaattggttccaggaattgaa
+ gaccaacggtaagttccaagttgacaaatccatcattgaaggcgcatcaaaggactttac
+ atcagaaagagtttccaatgaagaaacatctgaaacaatcaagaagatctacgaatcatc
+ tgtaaatccaaaacattacatcttagatcctcacacagctgtcggtgtttgcgccacaga
+ aagattgattgcaaaagataatgacaagtccatccaatacatttctctatctaccgctca
+ cccagctaaatttgccgatgctgtaaacaatgcattgtctggattttccaattattcatt
+ tgaaaaggatgttttgcctgaggaattgaagaaactatccacattaaagaagaaattaaa
+ attcatcgaaagagctgacgttgaattggtcaaaaacgctattgaagaagaacttgctaa
+ aatgaaattataagatcttcctttttaatgaatatattttatcttatataacatatagaa
+ ttctataattaatagagagtatgaagtacgattatccttaatgttaatcaaagtgaatca
+ tgattgcgagagtaaggaaccataaaattaatcttcatctatgctaataatctcagaatt
+ ctcgttcccagggtcttcttctttagatactaatctaactttcccgcccttttcaactga
+ tatttcaaacccagctttgctcaatgcatcgtcttgaacaacgtcttgagcttccatttt
+ tttgacatattcctgattcttggcattattctttaacaaaaacttcaagcaaacaataga
+ tctttccttgataaacggatcattatcatcgatgacacaattggaaagaattaatgcaag
+ accacctaattctcttatttgatcttggatttcaggatgtgcatacactaacgatgccaa
+ tatttcgattattactaatttacaatggggaaaattatcagccctaataaaggattcgtt
+ ggtattaacacggttgaaaattaataaccggtcctcgattttgttaccttcatgatctgt
+ gattttaatttcattcaccgtatcaccgtttcctttcaaaaaattaatcctcaacaaatt
+ cttttggaaagtatccaagagagataccaatattttcacagagtcatacgaaattaagaa
+ ttttctaacatgattgaatttgcacaagtcagaaattatgtccaggcaagaaagtaaagt
+ ggagtgtaattggatggcaactgaaggagcataatcgttacaacttttattgaaaaaggc
+ aggaatatccttgcatgcaagacgttgcagaatggggtagcaccatgacataattgcagt
+ caactcataggtattccaatcttctttactcgttacaaccaattgcaatatctgcaacaa
+ agtggtgaattttcctggctgtctttcttctaacccattgaggtatttaccaaaagaagc
+ agttgtaagacttttgctaatacattctaagacagataggtcaatagaagttatttccaa
+ ttttgaatcatccagatagtttctgcccttctcaattaaaaaggaattctcttgatcctg
+ gatgaatttggctgtcactatatcgaaaaatattctatccctaacacaataatagtcttt
+ ctggggtgtatcattatccctgaaaaattcgttggaaaattcatcattgttgagatacgt
+ cttgaaaagaaataaatatggtagccagaaatgctcttcttcttcaccgtcttccaattt
+ gatcccataagggtattgaagaaattgtaaaagagctggaaaagtagtcatatccactgt
+ agagtagttctgctgtatattgaaaaggcacgttaaacatgcgatgtatagatgtttgat
+ tagcggagtgtcaacgacatactcactagttgcatgcagaaaaccacgaattacgaggtt
+ ttgtagtaaaagcttttggggtatttcctggttttcagcggcaaggtttctcatcattaa
+ gataaggcccctcactgttcttgtatacggtattgagctattctcatttaacgacgctgg
+ caacagcattgcttccaaggtgtctcttaatcgttcccaaaaaagctgggagttagccat
+ tgatttcctatgttcttcatcattaaacgtcctttgaaatatacggttcaggttttcgac
+ gattggctcataatcctcgacgcaacatgggtcgttcttcatcatggtaatgattgaatc
+ aaataaattgaattcatctagaaaattgttcataggcattgttatttagttatcttcttt
+ tttgatgaatcaataattttataaggcttagtgtaattgttctcccgatgatagtatgaa
+ acggtagtcgataaatttgtgtcaaattttttcctcatcccatcaattcttcattttttc
+ ccttcctatttttttttttactgaagcgatgatgccttatatataaggaaaatagcctcg
+ tcagtcaaaggacgactcccgtaatggtagcctcagtttatattaggcgctctcatcatg
+ atgtatatcgatattggcgatgatatttttttttgttgatattaaatacaaatgtaatgc
+ tttatatagttctatacttggggaatacatgatgtgaatgtgcataaatagaggacagtg
+ aatatttttcaaggaagctctttctcattttcttcttcatcactggaatcattttcatta
+ gacaagggtaatttgttatccttcccattgaaaccaatccatccttcctcgtcgtcagat
+ tccatgactacatcgttctcttcgttctcttcgttatcttcatctgcgtcatcttttaat
+ agaacctcgtcatcatcagccgcgccatcttccatggacccatcagttgataccaaaaat
+ ctataggtgtatttgttatcagtagtggtattgactacttccttagccactctaacaata
+ aatctttgtatcgacctcatagccgtcgagaagagatatttgtgttcatttatgtaacca
+ ccgctcgcagataatagagctttgatccaaattaggttaaactcaatgtgttgggattca
+ atggcaaaatcaccgatgaacttcagaatcctcggtaaatatattgcaggaatattactt
+ gcaaccaaggggatttccttaataggtatggcttcatagactttattgatcaaatattct
+ tcatttaacctgaacgccattactaatgcatttaaaaactgcttttctcgtagcgcctct
+ acagtagaatggggggtgacgtccacatccagatcaaagggatcaaataatatcgtgtca
+ ttggtggaatatatcaataaaccttccgttgaagcagcggcaaatgcattcgccgttggg
+ gagaattgcaccgaagtaactctaacctctggtctcatttttcttgtggacaggtcgcca
+ cctctttgagaccctggtaaagaattatcaatacgatcctccaaatctgaattttcgcct
+ gcatcgtcaatcaaatctaatgaacctgcttcagtcattttcttactgtttaaaaattcg
+ agagtaccattcaaagccatgtttctggacacaatgaatctttttaacaagacttcattt
+ ggaacatcatatagacaaatggagttattattaccaccagccacaatagccataccatca
+ aaactgtaatgtattgtagtaaaaaatttggatcgttcagaattttggcagtgaacctat
+ cttgattaaacctaccagatattatatcctttctacagtcaatgttgcccacctgcttgg
+ catcttctatgttgaaaatggatatttgaccctttaaggtagatactgcaacttctttac
+ catctggtctcattgataaggctaaaacatcggaataaacttctataggttctacttgtt
+ ggcttctaccaaatatggaccagattctaattgttttatcccatgatgcagaagctaaga
+ cactgttctcttgactaaatgaaagacacgaaacagggccttcatgtccggacaaagcat
+ ctaataattgaccagtttgcacggaccaaacatgaatgtcaaaattgtccagggacccgg
+ cacaaaccacttcacctgatggatccaccgctaaacaattgaattggattctttcagtac
+ cagtgaatgttctaaaattacgatacctgattaagtcccacgctctcaccgtaccatcta
+ acgatgatgagaacatgacctgaccccttttcgcaaactgtacagcagtaactgaagagg
+ tgtgttcttcaaaagtggccaaacaaaaccctgatgtaatgtcccaaactttgattttcc
+ catcttcggatgctgtcactacacgtgaaccatccggagagtatgcaagactatttgtgg
+ aatcgaaatggccctgctgcttcaagatatacgattccgattgccattcgtaaactagta
+ attggcccagtttgctggaaccaaacgccagccattcgccggtttggttgacgctaacgg
+ tgttgactgggttttgccccatagaaagctgttgaatcaaagtgaaatcaggcaaatcgt
+ aaagacggaattccccactagtaaatccgacagctaaaagccttgttgctggatggaagg
+ tgacacactttactttggcttggtttgcgtaaaaaaaatgtttctttgtgattctccagc
+ tgtatttcgaaatatctacttcttcttgcttgtcgtcgtcttcactttcattgtcgtcgt
+ catcggatggcctcttggtaaattcccagacaaagacagcaccgtctttgcttacagtgt
+ agattttttcctgatcatgactgaagaacgcacccataacgtagtctctgtgcccattaa
+ atgttgtcgccgcaagttcttttcctctgaatctacggaccatatttttgcgcttaagtc
+ tttggaagtcgtaaggataaatctggaatcttgtgaccacgtcaaagacgttatgtcttg
+ aaagtgtcccgcatgcaccctatggcggacgaagggagcaaactgtctgtctttattaac
+ atctggagtcttccaaatctgtaaaaacctgcctgaggctaatgcaaagagtctcccatc
+ agggctgaacttcacagcggagatttttctttgaagttgaaatggtgaagcacgttacgg
+ gctttgaaattgacaaggatggcgcgaccgtcctcgtcaatagaaatcagcaatgtgcct
+ tgtttgttcagatcaatggcagcaatatttttgcgatgctcgtattcaaacgtgaacgat
+ ttgttgttgattaagtcaaacacgctgaccctattccccaccggtgagagtagttgcttg
+ ccatcatcggaaaaggtgatgttaccttgcctgtagaccgtacctaaaaggttagagaac
+ ttgaaatcggatttcattgggatgcggttcaatctgcggtctgctctcctgtctacctgt
+ tgaaaacgatactattattactattattgatattatactagttattgtacatacctagaa
+ ctatactcgaaactttttttttttcatctttcagctcatcgcaaaatttttccaaaaaat
+ ttcaaaaagaactcccgtcgcatgtgatcaaggttacaggtgcttgacataatcataatg
+ attacgagtcgaagccggccctgacaacagcttctcttgccgtagaattgatgtgtttaa
+ accggtcgggacctatgtgggcaccgccgaaccaacgggccaccacaacgatgacgttcc
+ acacatccatgatggtgatgaggtgcagcattctggagccggcggccgtttcaccgtcat
+ catcggaatcttgatatgttgccgcagagccatcctgcttgattcgccatgcactcatga
+ catggtttgccttacgcatcttggagtcggtcttcagtaggtctagcatggcaaacgctt
+ gttcctcggaggtaacatgtgctgcaaaggccatgaaagtcgagcctctatcagtaatgg
+ ggtccgacgcggtccagccctcgaaggggtctgtgggaatgtcactctgctggaccggtt
+ ctgtctcctcctctggttcaacgtacaagacaccgtcgagttctgtgaggaagtcaaata
+ gacagacagatccgcggtggaaaacagagtccatcacttcctggaacaaatgctgaaggt
+ acttggtatcgtagagatcgcgcttagccaaagaagtgcagacaccaacttcgatgacat
+ taggagcctcctcggaggggtagtgtgtcgggaaggagatctgcagtgtcatgtattcat
+ gctgcggcactttcacaacgatgatgcttccgtcttcctgcttcttggagagaagatccg
+ gatagatggcctcgacggcctccagttcttcgaccaactgttcgtgatcgtcatccattg
+ agcttttctttcctctctcttttttttttcttgttacatattcctatatatatgtatata
+ tatctatatatatatatatcccagcgtatatgactgcacaagacgcaattttcttgaacg
+ gttttttcttttcagcactgcgaaagagagagcaacggtaaagagcatgagccaatttga
+ aaagcagaaggaacagggcaattctttgttcaaacagggcctgtatcgcgaggctgtgca
+ ctgttatgaccaactaattactgctcaaccgcagaacccggtcgggtacagcaacaaagc
+ catggcgctgatcaaactgggtgaatatacacaggctattcaaatgtgccagcaaggact
+ gcggtacacctcaacggcagagcatgtagctatcagatccaaattgcaatatcgtctaga
+ gctggcacagggagcggtaggttcagtacagatccctgttgtagaggttgatgaactacc
+ ggagggatacgaccggtcctgagaaaagagaaagggaacacttctaatttatctttaaaa
+ ttatatatatatatatatatatatgtgtgtttgtatactctgtgggtatttccgtgtata
+ tggttaataatagtagtatcttgtcagttttttttatgtttttcttcgcgcgtcaacttt
+ ctaccaagagaaaaacaatataaggtctccttactctataggagaataaaacaaacaaaa
+ ataaaaagcacatcgtagcgccaagaaaatactgcaaataccaaataccacaataatact
+ acaattatctatacacaagtgttttgccgcttaaaacttcgattttcatagtacgaaact
+ atacacccttggtttttctcttttctaaatacatatctaccttgtaaagaatttcccgca
+ catctttgcgggcatacagttcatgtattggcaactaacggaactaaggcaacatatctt
+ gcatattgcaatgttcactacatagatgaaaacttatatctaggtttcacgacacgagaa
+ taactaagaagcacgatccatgatatagaaaaatcagttacgacgaagcacggcaaatta
+ gccgccgaagaccgatatttttgccaccggtcacagttttcttttccacggagctcttcg
+ cggtttttttgttccggattttttttaccggctcttagcggaccgttttggttataaggg
+ gagtggcagcggcggtagacactgcgctctataagaatacttgcaagggtcttgtctatt
+ gtataattcgctagtatttgttttgcattgtactcttaataccccaaccaaaaacaaaat
+ agtgagagtaatggtgcgttttgtttcaattttaagtttattcggctgcgcggcgacgct
+ tgtcacggcccatgatgacatggacatggacatggatatggacatggatatggacatgaa
+ tatcgatacgacaacgtctcaatccatagatgtctcatccacggcttcaatcgtccccgt
+ gccacatgaaccaaaacatttgcatggccttcctatactgcaatcgccctcgcttacccc
+ tgcggagagattgtactgggaaaactacaacaccacaacctactttactacacaggctgg
+ gaataggtctgcccttcgctaccacattattacgctgctcttggttgcatttgtgctcta
+ ccctgtgtccctggcgctaagcgccgcccgttctaggtggtacttacccctgctgtttgt
+ taatctatgcatttgtatttcgtccgtaatggcattgtccgtgttcaaaaatactttccc
+ ggaagaagactggtatgcgcataatatctatggcaccacttctgtgctacttctcgtttt
+ tatgcttgttcacttcttcgctgcggtgctttctgtccccgtctcattagcatcgaaaaa
+ ggagtaccgtccggttgacaccatccctctgaatgatcttgaatctacgcccgtcatggt
+ gaatagtgcacgtggctctccaagtccttcttccaacagagacacgttgttctcgctctc
+ ttcagacaccacgaccgccacggccaccaataataataaacggagacgcgctgaaggcga
+ agacgagggtgataacacctccaaccacgacactttgcgcgacgaagactacgataatga
+ tgacgacgaaattgcttccattgaagcgccacctctgcttcctcaagacatacccgtttt
+ ccgaatcttgtttaccaacacgaagtaccagatgcttgccgcgcacctctcgtgcgtcgc
+ caacgtggtctttcacatgcttacctacccgctattcatgtacatctttgtagacctaat
+ catcggcttcgctgtaggtaacttgctcggcaagggcatccgcatctttaatctcttggc
+ ccactggattaagggcggcgtattttttactctgggcgttgtctctttagcaagatactg
+ cggtttcgcagctaagtacggctgggcatggaacaacatcagcttcacctctcaactcac
+ acaaacgcgttcctccaatcttcttttccggtttgctcctgcggggactttcaccatgga
+ attcgttgaatccttcctcattttcttttacgggtccaccaacatcttcttggagcacct
+ ggcaggaaacggcggcgcatggactgccaaggatttacagcatgtgtcgataaattctca
+ ccggccccaaggtgtgtgggctactcacggagtacaagctcaaccattggcgattcgagc
+ atgcccgcaaacggccacagaccgatgtagttgctgccacaccggggtactctccaaacc
+ cgttccccgctttcaccatattttggactgggattctgatgtcccagcacgcacagtcct
+ cgcaattttctactaccattcacacgcaatggggatacttgttgtcctatgggtccttct
+ tccgtctgctaacatttttgattctgtttttggtgcccaacaccaacagtgccgcatcca
+ agcctttcacggagttgatcacctcgttctgtctcctctgtggtggtctggtatttatgg
+ agtccacggatcagtccattgaagccatggaatatagggggtttacccccatgttcactt
+ tcaacctcagtgttggattcgtttctttgttgatggcttgggaaatgattttatttattt
+ ggaaagactggctcatcaaaaccaggaaaaccagtctttaaccgccataatggcttttct
+ tttttcttttttttttcacccccccctcatctttgagtgagaagcttatatgctttactt
+ ttcctcacctactttttaattcttaattattagtttatccagcattcagtcaattctgaa
+ tgcgttatcgcttcctcaaaatttataaaaatgaataagcaagatctcattggaattgag
+ ttcaatgacgattattgagggcccatacctgataaaagtgaatatattatgtataatgac
+ tctactggaggttctctctgagaatgtaaaataatccacaaaagacatgaaaattagtta
+ ctctacataatgtataataagttcttccttccttttttctttttttagtttttctactca
+ ccatcctacctacgccaagctgctctactgtcttggatctgtgaaaaaacgcctagaaac
+ atgtataaaacctaccgctttaaaccataatttcaaatcgaaatggcactatggccgagt
+ ggttaaggcgagagactcgaatggaataaaaagttcggctatctcttgggctctgcccgc
+ gctggttcaaatcctgctggtgtcgtttaattttttttaaataacatcgttgattaaaac
+ aatcctattcactgcagtcacagacttagcacatttacaaccagtacaagatcaatcaca
+ cgcgttgtggtaggaactttgacgatgatactgcgttggctcttgaacgcatagtacatt
+ tgtcgttcacaacggaaagtgaaatactgaaaaaaaaagaaaaattttcttagggactca
+ tcatcaatgtaaacacctctacagccgtcaagactagcaaagctcacctcacaaaacgac
+ atgccgcgcataaagaccagaagatccaagcctgcacctgacgggttcgaaaaaatcaag
+ ccaaccctcacagatttcgaaatccaactcagagatgcccaaaaggacaagtcgtctaag
+ ctcgcagcaaagtccaatgagcagctctgggagataatgcaactccaccaccagcgctct
+ agatacatatatactctgtactacaagagaaaggccatctccaaagacctttacgattgg
+ ttgataaaggaaaagtatgctgataaattgctaattgccaaatggcgcaaaaccgggtat
+ gaaaaactgtgctgtctgcgctgcattcaaaagaacgaaactaacaacggtagcacttgc
+ atctgcagggtgcctcgtgcacagttagaggaagaagcacgcaaaaagggcacacaggtg
+ tccttccatcagtgcgtccactgcggctgccgtggatgtgcaagcacagactaacgacta
+ tatgcacgttgcagtattgttctggccatgtaattatctctctcccaacgttttagaaga
+ ggttttcgtctccttcgtcaaagacgaccactttcccattttggtcattggaacgcgaaa
+ aaacaaaacagacgcgagaacgcgacgcgaaaaaatatatttttccgttttttttttttt
+ tcggaacaaagataacgcgttaaaacctgtggatgtaataaggatctgaaacactgacta
+ aaacgcgtcctcatttaaaatcaaaaaattacttcttctcctccttacggtgccctttga
+ tccttctcaaactttaaatatatgttgcatgattaattgaaaatcatatagagtatatct
+ taacagaggttccaacgctttaaatcgtattctattcttttccccccttttattatagtt
+ acatctactatttgagcttcttttattgaccgaaactacctcttatttcgaagcaaaaaa
+ aaaaaaaatcaacaaaaattccaacttttcaatgatgaatgaagacatatccatcattga
+ tggccataatagttttttaacggaaaaaagcaccgtgctattaacccaagccaagagaac
+ actagaagacgaaaaggaaatgattactcccccgagctcaactgtgagaaaaacaatgaa
+ ggaagtaaataagaggccgtcgcatcccctctcaccggatcactcgtccccaattgctcc
+ atctaaggccaagcgccaaagatcggacacatgcgctcggtccaatggtaacctaacctt
+ ggaagaaattcttcaatctttggaaagaagaagaataaatggtgaactcgccaagaaacc
+ tccatattcgtatgcaactttgatttgcttggccattttgcaatctcaggagggaaagct
+ aacgctatcccagatatatcattggatccacgttcacttcccttattacaagcagaaaga
+ tgctagttggcaaaattcaataagacataacttgtctttaaatgatgcgttcatcaagac
+ tgaaaagtcctgcgatggtaagggtcatttctgggaggtcagaccgggtgccgaaacaaa
+ atttttcaaaggtgaaaatcgtggttatgaatttgtaaaggactccttacaagacattgg
+ gaagtattttgaaatagattctacacttgatgaattagaacaagttgagagtggagaagg
+ caatgatgatcttcctgacgaggaagaaagagaggaagcagggaaattcccttccattga
+ aattcaattgaactcctcccctatactgagagtttcccagttacatcacataccgcaatt
+ gaaaacagacaacagtgtactgaaccctcacgaaaacctagaatcgatgcggaacatgat
+ agaaaacgatgtcaacaatatagattccttggaacctccttatgtcatgaagaaatatca
+ tacttctttaggcttaccgtcgctggtgaatgccaaagatcatttccaggcgggtgtgaa
+ aaacaataatatcacccaggcaaatagatttaatacactccctataactagcgcaaagtc
+ tcctcagaatttcagaaaatatttcacctcattcaattcaaattttgaagatttatctcc
+ acttcgaagtaatgtaggggctggttctctactcgacccacttccgtattccccattgaa
+ gctgtacgatcagaaaaatcttgcgctcatgtcgaaaccacaatctcagcaatcatattc
+ caattctcaacttccacctccaccttcctctcatggttcggacttacttaaaacacccaa
+ gatgaggcattccgatggcttagagaaaaccccatcgcggttgataagcacacctaagga
+ cggtaactcgattttgaggaaatggcagactccttcacacctttttgaagatttgtactg
+ ttctccgctatttagagctatagagactccaatcaggtatatcacgacgccggggggcaa
+ ctttggaaacccaaatttcaccaagaaagtcctctgcacccgatgtcctcacaagcgcaa
+ cgaattccaaatttgcttcaagcgggctgtttggcgtggatgtttattctgtttggaagc
+ gcgcaactgaaaagatttctgatggtaacaatactacggatagcaatcaaaaacatcatc
+ cttatcataatcacccttccaacgatagcggtaatgaaaagaattgactttttttttttt
+ ttttttttcattttcttttcctcaccgaaaactaattattcttaatctgatggattcatc
+ gcaaacggttggatgagtcctaaactgggctggagttatcatatcacatagcattttctt
+ tttaatcttattgtattgtcgggtcaactaacttcttttcccgctcttctatggtataat
+ caattgcagaagaggcaggaaaaaattgtaatcataacaaaaatgatacaaatttttttt
+ ctaagttttatattttacatatctttattttttgatccttacggatttttaatacagtat
+ ttagctgaagtttccattcacttcacttacctttttgatctcttttatttttacttgccc
+ cgttgccttgcatttttcaatagaaagtgaaaaactaagtaccacttgagcaatgccaca
+ ttagaagaaaaaccatccgcaagtgagcatcacagctactaagaaaaggccatttttact
+ actcatggaccaccaaataaccactgcaagcgacttcacgactacttcaataccgagcct
+ gtaccaattggatacacttttgagatgtcacatttgtaaagattttctaaaagtccccgt
+ cttaacaccttgtggccatacattttgttccctttgtattagaacacatttgaataacca
+ accaaattgtcctctctgccttttcgagttcagagagtccttgctgagaagtgagttcct
+ ggtcagtgaaataattcaaagttatacatccctacgatcttccttactagatgcactaag
+ gataccgaagcctacccctgtccctgagaatgaggaagtaccaggtcctgaaaattcttc
+ atggatagaactcatatcagagtctgaaagtgacagtgtaaatgccgctgatgatgactt
+ gcaaattgttgcaacaagtgaaagaaaacttgccaaaagatccatgactgatatattacc
+ actgagttccaaaccatccaaaaggaattttgcaatgttcagaagtgaacgtatcaagaa
+ aaaatcaaagccaaatgaacaaatggcccagtgccccatatgtcaacaattttatcctct
+ taaagcccttgaaaaaacacatttggatgaatgcctaactttacaatcactaggcaaaaa
+ accaaaaatttctaccactttccctacagagtcaaatccacataacaaaagttcatccag
+ attcaaggtacgaactccagaagtcgacaaaagctcatgtggtgagacctcacatgtgga
+ taagtatttaaactcaatgatgagtgcagaacaccaaagattgccgaagatcaattttac
+ gtctatgactcaatcccaaataaaacaaaaactgtcatcgttgggactgtcaactaatgg
+ tactaggcaaaacatgattaaaagatacaatcactacgaaatgctttggaattctaattt
+ ttgtgattctctagaacctgttgatgaagctgaactaaaaagacagttgttaagctggga
+ tgtttcacacaataaaaccccccaaaatagtagcaacaagggtggaatttctaaattaat
+ gataatgaagagtaatgggaaatcttcttcatataggaaattacttgaaaatttcaaaaa
+ cgataaatttaataggaaaggatggatggttatgtttcggaaggattttgctaggcttat
+ cagggaagcaaaaatgaaaataaaaacaggttcatcggacagttcaggttcagtgggaca
+ ttctaatgatggagatggtgttgaaaaagttcaaagtgaccagggaaccgaggatcagca
+ aatggagaaggatcaggacactgttatcaacgaagatagagttgctggtgaaagaaattt
+ gcctaacgaagattcaactgatgctgacttatcaagagaattaatggacttgaatgaata
+ tagtaaagacccacccggtaacaattaaggtgtgtatatgtaatcaggcctgtttgttag
+ cttgtgcacatttgttaattaataatttatttaaaaatgggcaaaatataaatgcataaa
+ agatgacatagagatatgtgcacaaagctatgtaatcatatttaacattaatatatatta
+ tttcgttattcacaattcgtcatgaaggccagcgtagtttacggttcctgctactgcatt
+ atcatcgttgacaatgtttgtattcgaattattattcaaaggtgctgcaattgtttgcgg
+ aggggccactggattaggcaaagaagtgatctccacgttaagtgctgaaataacttcact
+ agatagagagacgtcttcggtaacatctaccgtaccagatgcctgcatttccgtcgtaag
+ cattcgggaaacaggagtatttacattttcgctgattttgtggcgtaccccaaacgcttc
+ agatgaatcttttgatgactgtacttctatcattatttctttctgtacaaatcctgatgc
+ atcaattgaactagcagatattttgtgagggctacttacaaaagatggtgctgttgtggt
+ tacacctggcatcggctgagctactgtgcttgacgcgctctccacggataaagttccctt
+ gcctatatcagacagagcgccaattgaagcaccactactactaaagatcatagatgccat
+ agaagtcccttcaatatttgtggattctgatagggaagattgaccaacactcgtcttagg
+ tgattgtacagcatcagtgaaaacactagtgggttccgaactgtcagccaaaacactagt
+ gggttccgaattgtcagccaaaacactagtgggttccaaattgtcagccaacacactggt
+ gtattccgaattatcaatcaaggcactggagcgttctgaatcctcggtagagaaagcaat
+ tactctcgaagtcttcgtttctgtttttgagaatgagctagttgccgtctccacaatagc
+ tgatgaaacaggttcagagatggaatgaaatttagaaaaagtggaatctgagattgaagg
+ gattgattcagaagcaagaacagaagaagttacagtcgatacttctattgactggaagcc
+ agaggttatagatgaatcagaaaatgtactcggatgcgaggaaatagagtcctgggatgc
+ tgatgttgacaaaatttcttcatttgggatcaaattagacgcaactgagtcaaattcgga
+ tgctatagaaatagaatgcgtaggtgtttcaaccatacgctctaattctgatgctatggg
+ atcaatttcggagacgggcgaagaaaccatagtgtctgtgggcgtcaacaacattgaaga
+ agatgttgtaacatactgctcggcttctgcattggaatctgatgcaggtatattcctata
+ ctcaatttttgtaatgacttcagtgattatttttgtcttataaatctctctcgtggatgc
+ cggttttgagttgtctagaaaagatgtgttggctgcatcattggtggggtttgtttgtgt
+ ggtggtaaccgcagtcgcggtacttgtggataatgctgtagtagggctagttgttaacga
+ tgaggaaagagataatgaagatgaagatgaagacggcatgtgtgatagcgaaggtgattc
+ agtacttaagagttttgaaacgctttctctatctgttgttatttcgctttctgattgttc
+ tactattaacgaggacaatttctcctccaagtcaggagtcggctcaggatattctgtgat
+ agaatttgaagatgaggataacaaatctatcgactctgaattcagtggaagcgatggttt
+ ttcgacaatattcggttctgtaggagaggctagcggcttcgtaatttcagaggtaggtga
+ agacacaattggttcattagtgacactctcttcatagaaagaagaagaagaagaagaaga
+ agaagaagaagctgatgttgggatatctgcagaagtgaattgtgaatcagctgatgacga
+ tatccaaacaaatttttgctcatcttctgtggtggttctcatagtgctagtgtcaaatga
+ ggaaaattcagtctctatatcgctagcaacatctgtagcgctgatgatatctgcactttc
+ agtaatgcctctgtaatcgtcattaatagtgaaataagaagaagatccttggtctatggt
+ agtgttttccttgaatttcgacttatatatatcatgagcatgtttgtaaataaagccgtg
+ cacattatgtttgtagcctagttgaatagcgccagccaaaaccactaacaaaacaaaatt
+ caagaaaaatttccaaatatttcgtaaaattgatgacgatgtaccggaaagcttcagaac
+ gttgatcgtgttccccatagaagtgcttgccaaatatctagaatctggagaaaaagtaac
+ ttttgttatagtagattcatgtactcttggaaccaactgaacaagctttagcttttcaac
+ gctgacgatggcaaccaagttatcgttagatgataatgcaacgaacttcttatttgggga
+ gacatccatcgatgttattccattcaaatcgaacatgaaggatgctgtttttaaaatttt
+ aacacctttggaactcacacctagttttaacaaggaaactccatcagagttcgataatga
+ tgctgccactaatagcgtattgtcattgagaaacctgatattagaaagaacaagattttt
+ atcaaaatccgttttccttgctacaaaatttccgtccctaacagacgctacttccaaaga
+ attagaagtgatatatgacaaaagaataccgttaggggcgaagtgcaaatcattaacttc
+ cctacccgtttcaatttcgtaattttccgtcaaattacgcgggtcaatgattcttatcac
+ agtgggcacttcagaggacgcaattgcagctactgtggccctttgattaatgtaaacaaa
+ cttagtgaaaacggatgggtctgtagttctgttgaaatcaatactggccacgaattttag
+ atgttcttgttcaaacacaaacttcctcaagtggtggttaacgttctccatggaggaatc
+ attaaacagttcattgcaaccaactagaatcaggttattattacaatccaatgaagtggg
+ cgcatcgtcaatcgaatctaatgtgatctcacgaaaccttctaatgtgtttctttttctg
+ gaagttcacctttaaagctgtaattttgtttggaaacgaggaattgaactggcctccacc
+ accagccaccagtagagtgccttcatttataaatttagcaccatatatcggatatccgac
+ gtcatagtttgcagagttgccactcattattatgctagtttttctaagtttaaaaggaga
+ gaaaaaaaaggaagagaggacgtatcgtagggaattgtgatgcttgttgacaaacagcct
+ ccgtcgatcgaagcagctataagcgtaggttacttgaagcgttcacagttctcatttaca
+ ctgtacaattctcgagatataaatgaaaattactacaaccaagaaaaaaggacaggaggg
+ ctaaaaggataataaaatagggcagttacaaatgtggtcaaaaatactagggctcaaaat
+ tgctaagtaagggcttgacagatgtaacataatacgaaataaactggatatcgtttaata
+ attatcaaggtgatttaaatcgtaaatacataggctgggcatatacactaacatgtgtcg
+ tgaccaatgtgcagcagatagacttgctcattaaatatattccaggtaggattctctaag
+ ggttttttttttttctctatagctgtcacttttttttttttctcttatcaatcacagata
+ taataagcataacatcggaaagaaaataatgatagggcaaaagggaaagcatttccagtt
+ gaatacagtatctctactatattgttgttcgctgaagacagccgtttccggaaaaaagtg
+ attgtgtgatcgcagttacccgcacctttgttcaacagtacataataagaatctgtagat
+ atagatgaactttctctattacataagactagcctaaaagccttgtttctatcatttaaa
+ ggcttttgagacactcaaatcagtgtcaaccttttggataagtcaaagaaagtacgccgc
+ tgctcagaaaattgttatcgtgattgagaactgatctaggcattacaattaaaggaaaca
+ agggaaatattctattgaatgttgcataaaagcccttcaagaaagagatttgcttctcct
+ ttgcatctaggatgcattctaacgcttacagtgctctgccttattgcttactactttgct
+ ttaccggattatttatcggtagggaagagctcatccaggggtgccatggatcaaaaatct
+ gatggaacctttagacttaagtctatctacaggcatggcgtcggagcaaaccatcggcta
+ caccagaggctggaggtcactccagaagttatttctgccgctggaatgctgtatcaagaa
+ accacaactcaaggacaggatttcgaagaccaagaacccctatggactactaatgccgaa
+ tatgccaccactaacccattcgattttgagtttgagctacgaaggatgccgctattgatg
+ aagcggatgaaggaaagagatcctgaattcatagagtcgtacatttacggagaaacatac
+ atgacggaggaggaagaacacgccatgtggatagatgatgacattgtggcacccaatatc
+ acagatagaggtacagtagtgtcgttagcattaatgtcgtctaatgcttacgtgagaata
+ ccgcaaacgggagactggcgtaacgtcacggagccgtggaatgaaacagagccggaagat
+ tttggttgggatggtgatggtatacgtggtcacgtcttttacaatgaagtggagaatatc
+ gtggtactctcgataaagggaaccagcgctcagggtttgccagggtctggcgaagatgag
+ acaactggaaacgacaagatcaacgataatttgttattttcgtgttgttgtgcaagagtg
+ agctacctgtggactaccgtgtgcgattgctatgtgaagtcgtatatatgcgatgagtct
+ tgcctggaaaaggaactgagacgtaaggatagattttactctgcagtcgttgacatatac
+ aaaggcgtcctcaaagaatacccgatgcggccatatgggtcacaggccactcactgggag
+ gcgcattggccagtttactgggccgcacttttggattgcctgcggtcgcattcgagtccc
+ ctggagagctactaccttcaaaaagacttcacttaccattcccaccgggacttccctcat
+ acatggagggtatctggcacttcggtcacaacgcagacccaatcttcatgggtacatgca
+ acggagctagttcaagttgctcactggtgggctacgctatggaaaccgcgtgccacactg
+ gtagagtctgtgtctacgatgtggtcaacgacaagggctggagtgtaaacatgttcaacc
+ acagaatccacaaagtcattgacgaagttctccttggataccgagcaggctgccaagtgc
+ gttgaaccagagccctgcgtagattgctacaactggaagtttattccaagcagagactgg
+ gaatcctcatcgaggctcattaccaagactaaaagccacgctgcaccaaccacgacgact
+ cgcaccactgctactactacctcttcatctacctgtgtaggccgcaattggcttggcttc
+ tgcaccaaatacgagttgtaagacccatacatctattatgaccctagtgttgttttaggg
+ cctatgcgccctttttttcttccattgtctgataaataatgtatcattttgacaacttaa
+ aggttcagctaaacgacacgagaagatatattaatactgctttaaagtgaaaggcgtcag
+ tcaggtgcaattaataggtcacatattctttgcttcaattgctctctagcaaactcacac
+ actcgtagattggaagaccgaagaaaatagatagactttcgaataagttcagcaacaaca
+ tgtggttgaaatccttgctgctctgcctgtactccttagtactctgccaagtccacgctg
+ caccttcatcagggaagcagattacctccaaggatgttgatcttcagaaaaaatatgagc
+ ccagtccccccgccacacatcgtggaataatcactatcgaatactttgatcccgtttcga
+ agtcgatgaaagaggcggatctgacttttgagttgtacggtactgtcgtgcccaaaactg
+ tgaacaactttgctatgctggcccatggtgttaaggcagttatcgaagggaaagatccca
+ atgatatacatacttactcgtaccgtaagaccaaaatcaacaaggtttaccctaacaagt
+ atatccagggtggtgtggttgccccagatgtgggtcctttcaccgtctatgggcccaaat
+ ttgatgacgaaaacttttacttaaaacatgacaggcctgaaagactcgcaatggcctatt
+ ttggacctgattctaacacctcggaattctaacacctcggaattcatcatcaccactaaa
+ gccgatggaaatgaggaattggatggcaaaagtgtcgtgtttggtcaaataacttctggt
+ ctagatcaactaatggatgctattcaatacacagaaacagacgaatatggaaagcctcag
+ catgaattacggttcctgtatttcgttctagaaatcttaaaaattagtaacatcttagat
+ ttgcacgctgcgtacacagaaaaagtcgagaagtttagaaatggcgatgtgtctgttggc
+ tccactttggaaaacatcttccgtaacgataaagcctacacacctttaaccacctccact
+ ggaaccaccgcctatgatttaaaccacccaatttccagagccttgatgtgtttaactgtt
+ cttggcctttgtttcattgcctacaagggcatgcacgaaaagcctcatacggtttcatta
+ agacacaagtaagcaacttcttttcgatggcttgatgaccacatggctatgattttaatt
+ aatatatatacatgacaattatttttttatacattagcgatacctaaaatttcttggtta
+ atacccttttgacagcctctactgcattgcccttgataataatttttttcgattgcatca
+ ccacagccatgattttttgggtatgaaaggcagttgctcctgcaagtcatttcttagttg
+ aattacatctccttcaatctttctgatctccgtgataatcttgttacctccatttttata
+ agcggaatacaccggtagatttccggtcctgcttctctccacgtagtaggtctttttacc
+ aaaattgttgtttcctatgagctcatgcatcttcacgtcttccagctttggaaatattgg
+ cgcctcagcagctttatttgaattagtggatatacatcttatcgtttggtatatcgaggc
+ agtggcaggccctggaaggcatttacccctaccaaaacatctagtaacacacgaactgat
+ catagttttgctcacacgctaccttggagtaagttgctttaggtaaacgggcacttcttt
+ caaaaggttttccttcttgctggagctatactgcttatagtaccatattttaaattcata
+ ccaagccattgtacgcgatgaccattatagatcgaccttgcgacaccatcatatttaata
+ ttcaaaaagaaaaaaatgcagttctatatcacagttatataatatttatatgtgcatgta
+ tgttacttgtctttgcacataactcaatgcgtccacaatcttaccatcttgtctttccca
+ ccactacacactcttttaccgtcgacactccagtcgacgtataacttcgtctttatacca
+ gggaggtcaacagaaagttttctagttcttacatcccacactttcaacgtggtatctttg
+ gaacatgacaccagtagtcggcagtccgatgaccacgcaacctggtatacgctggctata
+ tgccctctaaatgtggagataaacttaccatctctaccgtcccaaagtttgatagagtta
+ tcaaaagacgctgagacaatatacctaccatcagggctgaacgccacatgattgactaat
+ ttttggtgaccggtcattcttgctataggcttggtagattttagtgggttccataaaaac
+ atggtataatcatcgcttgcagtaaccatcatttcttctgaatttccattctttttacag
+ attttttcataattttccaatgcctttttctgggcttcttctggtgtagaaggcttctta
+ cctgtatgatcgaatgcaccaatgcgcaatgcgtaatctgtagataaagataagtgatta
+ acccagtgcgcatgcgacttcaaaatgttgatacatctgccctgcgaattgatgtcccat
+ acacgtacggttctatcgtgagagccactatacaatagaccttggccgccccatttgaca
+ caagacactgaatttgtgtgaccactcatcgtatactggcaaactctgctcacagtgtcc
+ caaatcttaatagtaccatccttagaagatgaagctaatcttggtttggagcccggcttc
+ acaagatgtataggttcccaacttaaagaagtgatccacttggaatgacctctgagagca
+ tcacctagacactgaccgctttttgggtcccataatcttatggtattgtccatggatccc
+ gtagcaattacttctccatcgggggaccaggaaacgcagagaacccaattgtagtgaccc
+ tttagagtatgcattggcgtttgggtgtcacagtcccaaatccttgcagtattatcacct
+ gcaccggttaccatcctagaactcgtatgtggtgcgaaggcagaacacaaaattgtggaa
+ ccgtgacctgcaatggctgatgaacttctagttaccggcttgactttgaaaactgctctt
+ ggcgtatacagtagcgtgatctgatcttctgtactgttatagcctggttttattaatgaa
+ gaatataggttatctgttatatcaatcgtcttcacagggtcactggccttcttaccttga
+ attgtacagctgaaggtatatggcactggatcgtctgaagtaccgttcaattgatttaaa
+ agttcttctaactgtttctcggagatagcaccgggaactctcagggcgccacctacattg
+ tcaccagtatctaaagcttggaacttgattgaaacattgggtaaatctttcggaataata
+ gctacttctctgggaagttgagcctctttcttttgtttcttagaaggtggaggaatcagg
+ gtggacatctgtatattgtgttagtgtgtgtttatcgaggttacaaatataaaaatcttg
+ tgatatatgtcttttaaagatactctttttgaagctcatcgcattttttttttttttttt
+ cattttaaaatttttcaccggtttctcgggtaacagctctaaatgaaatcttgccttcat
+ caccattattgaattacattacattatatatcgtagtatatcatatttttagacgttgac
+ cactctatgagcttaggataggtacttgatctttctctgagttgccaaatgctatttcac
+ ggatttgtatcatccaagggtctatcagtagttccacagcagtagccctcatagtggggt
+ cttgaaccaaacacctttccaagaagggctcttcccgctgcagtcatttcgtctctattg
+ ggtagttgcggtattcgacctgcagcaacgtggtacatgatggcccattcattatccaag
+ ttagaccaaggtcgtctacctgtggccatttctagcacaacacatcctaatgcccataca
+ tcgtccgctccaagttttcccttaactgccgagcctgaaatagtctctggagccatatac
+ atcggtgtccccatcatttcattgagggactttgtttctactccaaaatcttgaacggct
+ gcgttccgcacagttctagtcctagatcctacaacggtacgtgccgtaccaaaatccaca
+ tattttatgattccattgaaatccagcaagatattctccggtttaatgtcgcgatgcacc
+ acgccagattggtgcaaatatgccaaaccttctaatagttcgaatgtgtacacttgtgtt
+ accatttcatcttcaattcttccatgatccaataacgaggctaaagaaccaccctcacag
+ tattccatgaagatgttaactttatcgcgatgtacttcgacaccatagtactggacaata
+ ttaggatggtttaacatttccaatacggtcatctcttctttaatcaggggaaaaatcttc
+ ttcattgttgtggtatcgtgtatctttatttccttaacagctaagatttcaccgttttcc
+ agattaattgcagagtatacctgtccaaatgttccaccgccaatgaagcttcttttttgc
+ cacctcattgatacattcgataatgaggatgctagcgatagtaggtatttgtttccctga
+ tctgtagcatccaataccttacccatttgcctgggattcctcttcatagtcttttccaac
+ gtctttatagcttccagtctcaacctgctgtttatttctagggtggcctcttcatcaata
+ ttctcttcagtatcaatattcaatcttgcctgttgcatgccattttcagcttcagtggct
+ cgtgcacccataacgtcgaaatgtgaaattaataatgacatacatactgataccctctcc
+ ttcagttcttgaaattgctttacattcagcgccaggatattataaccactagtcatctgc
+ atcgcaaaatccattgcaagaacgcaccatcggaaagttcgctgatcggtaggatcacac
+ tcttcaacaaggaatttcaaccagttcattcctaatttaaccatcagaaagatgataacg
+ tacttcctctttgagctaaaggcgttatatttaaggaaacttcttccaaaatcccttcca
+ aacatgaatatcgaatttaacaactcatttcccggacattgcttcataaagctacctaga
+ attcctttgtagttgttcaatactgtataagtaagtttgtaatatgccttattgattttt
+ tgtagggaggattcaattgaattgagtgaacaacgtttctctatgaatgaaacagaatca
+ cctaccgtttgattaaacttgtcgatttgataagtcagtgcataattagaaccttgattg
+ agtaaaatcaacgtatttgggttaccaattttccctaaatccatttcgttggatgcaaaa
+ aggttgttacctggaagcttatatactggtccatcccaaactactggctctccaggatat
+ agtacaactagtgcacccaatgagctcaaagcaacttccaattcaaaaacttcattgttc
+ ttttcatcaaggtgctgatcaaaagggtctgttgaaaaattcttctttttggactgatga
+ gcatgcctgtccaaatcacttgacgaatttgctactactcggtagtaaggaattcctttg
+ gaatcctcccctttcgatactagaatgttcaaatctgtttcttttgttgttatatcatac
+ acattcaaattatttccaatgtcaagctttgggaccaaatcacagccaatggatttattt
+ cgcaaaattcttaagatggtatcattgtcacaacctaataattcaggagcagcaaacata
+ tatacaccactagactcaaaagtgttaccggagtataccaaaaaatagtgagcatctttt
+ aactttttaacaagttgtgcattatgattaatctgataaacagcagaattttggaacgcc
+ ttgactagaatattgctgaacctgttcagctttcttttcatggcacccaaattttcgaaa
+ atggaacttagccatttttctgcttctgaagcgttagcaaatgttggtgggaagttttgt
+ tgtttcaaaagatagaattgcaatttatgtacgagtctcaaagttaacttgagaaattca
+ tttggaatcacggtttctgcaccgttaatatagtgaccggtgttttttaggtgatcccag
+ tatttcaagagtagatcgggtgctttgaaattttgtttactggaatcaatcaactttaaa
+ ttcaacagaaaaaataaataacgaatggcttctattacagtattttcgaacgtcgggtca
+ aaatccacatcgaacggcaaattggagcaatattttgtcagtgtgtacttcaattgcaca
+ gaaagtcgaataaaagcgttaaaatcatcgatcatttgatcgatcatcatcattgttgga
+ tttttaagctttcgtgcatatgacaatctagtcaaaataacttctttgattaaatagacg
+ gggaacgccagtagtgatcttaacctttcataaggatattttatgttcatctttgtcaaa
+ gtttctctgtagactataaaatgtagtttcagtttgaacatccaaggtgatagcggatag
+ aaaatttttttttggaatatagactcgatgttcttttccttcatcaattgttctgcaaga
+ atcttcattttgcagttccattggggcacaccatcaaaatcctcttggcaattatccgaa
+ ctacttatcggtgagaaacggtttatccagtcgtccaactcttgcctcctcaacgttaag
+ ttcgttttaaagttcagccaagaattcattacgtctattctagcggtaaattctgcagta
+ ttggtgattggtttatccttatacatttttttcaagttaggccaataagagacaacgctg
+ tgatatctgttcatgaggtcttgtaagatggcctccagtgcatccaggctcaatatcttg
+ ccctgtggaatttgaaacttcattatctcttcaaaaacggaatctgaactatctcttaaa
+ tatgtaagcgactgttccatctcttgcatggtcctcccattcagccatgccttcaattcg
+ agccatatctcatctgagagctccttatttaaccctggtttcttgacttggttagcaatc
+ ctcgtcttttcacctttaacaatatctcccttcagcactctggccagcattgactgccat
+ ttgaaccgttcggtcgcaatttcattccaagctagttcatcactgtctgacctcattttc
+ ctgcaataaggtgtagtggagaaaaacttgacacctgagtgcaggtcttcatcaattctg
+ tcgccattagacaaccgaataatttcgtcatcaccgtcgtcttcctcgtagcggttagtg
+ agttttatgcccctcgtgtagtagtcatctaaagcggtcttttttagctttttcaagtat
+ agcgattcattgaaaacgtatcggtccttaaaagtcagctgactggggccgtcgcttatt
+ gtttccgagccacccaatgctttcaaatcagatcttgatagtttatggctggaatgactt
+ gataatctcttcacgagcttacttgagatggtatgaggagccagaactctcccgccttca
+ gccgctttttgttgctgtgtattcagtatatccatcatcattttcacctacaaggaacta
+ ccttttatagccaccctaagtaaaacaacattagcttagcatcctcaattcttatcgtat
+ gttgttgctgctatttttatcctattgttcctggcatcgcttttttacataaggtaccaa
+ ggcaagagaaaagacccgcgaaattttcaattcgagacatagggttaatacgaaatatgt
+ taaggtctagtttccaaaaaatgaagaaaatgtgattagacatcctggggaaattaggtt
+ taaatagggcgggcgctacaggggttttcctaacaaatttcaatgataatagtggcatca
+ tcatcgtcatatccagtgtaggtatggactagaacagaaagcaatttgcagcaagacaat
+ atgactacgacggtacccaagatattcgcgtttcacgagttttcagacgtggcagaggcc
+ gtagctgaccatgtagtccacgcgcaagacggtgcattggctccaaagaacgagaggaaa
+ cactctgttcccaacatcagcatgaatgcactggatatgacgagagaggcctcttgcaaa
+ agcacagcatctgccgcggaagggaaaagtggtagcagtggtagtggcagtggtagcagt
+ aagcccaaaaaggagaaacggttcaagattgctctctccggtgggtcattgatcgaagtg
+ ctacacgaaggtctgctaaaacgagacgatgtacggtggggagactgggacatttacttt
+ gcagacgagagacttgtacccttcagctcgaatgaaagcaattatggatgcgccaaaagg
+ aagattttggacctgatagacacggcgaagtatggaactccgaaggtgtaccacattgac
+ gagtcattgattgacgacccgcaagaatgcgttgataactatgaaaaggtgctaatccgc
+ gggtttgccggtagagattccgtcaaacttccgatgttcgacttgttcctgcttggttgt
+ gcccccgatggtcatatcgcatcactcttccctaacttccaggacaatctacgtgagaaa
+ cttgcatgggtggtgcccgtggagaacgctcctagtgggccctcgaccagaatttcgctg
+ actatacctgtaatctgccattctcacagggttactttcgttgtcgaaggtgcaaccaag
+ gcgcccatcatcaagaccattatggaaaggcctgaaaagggcctacctagcagtattgtc
+ aacgaaggtgctgctggtcgtgtatcatggtttgttgacgacgatgctcttacggacgtc
+ ctcgtcaccaaaaaaaagtataaattccaccaaggtttgtctatttaaaccatgcatgtg
+ tgtacatacatacatacacacatacatacatacatacatatatttatgtatatttatgta
+ tatatatatatatatatatgcgtaattatgcagatttctccttgtgtgctacagcgggta
+ ctctgacgccagatcatggcctcgagatcgatatacaaaccactgcatgataaagatgaa
+ gttgaatattaaagttaccattgacagtcccacttttccaaaatttgtcacgaacgtatc
+ cagactgaaaccttgatcgttagacaactgccaaatcaattggagcagcgaggcgatacc
+ gccagtgacatctagaaacacaccctgaatggggaaacaatccatagattttcttgtcga
+ gttatgcgtcacttgtgggatgtacttgattagtgacatcgatattttgagtaaaaacaa
+ attgttgcaatacgccagagttcttgagttatgccaatcgtaccagtagttggagtacat
+ aaattgtacggttagcagtgaaaaaatggccagtgatgcgagtaaaatccttaggtacca
+ tggattcatcttgcggtgacctttacctggaaatcgccagattctcgctccagctaccac
+ ctgggtcaataagacaacattcatcaagcacccatgcaggcaataccagaaatcaaattg
+ cgtcaacttgggcctgcccaagtcagactcatcacccgtcattttccagcagtacaattg
+ caaaaatatggatatgaccaggtaagagtaacctgccgtatttaacatgacaaaatccat
+ cgatatcgcgctcgctgacttatggcgccaattggtgattatcggtggatacatcgatat
+ cgaccatgacgtaacatacacgatacctagtatatcgtctaacgacaccattttgcttcg
+ cgtttgcttttgtttgcttttagtttttcttcccttgtttacatatggaaactcacctta
+ aaatttcaagattatcaaaaaatataaaggattccgctattaaagggcaataccacaacc
+ tcaaaccaaatcacctaaaccaaatcaaagatggaagcggaaaaacaatctgatattaag
+ ggaaccattgctttcgatactcacggtaacgttatagagtctaccggcgtgggcagtcaa
+ aggatcgaagatatcggtgatttatcgaaagttacgctggatgcagaaggttttgctcaa
+ gttcaaggagattcgctactcgtacacctgtacaaacgcaatgatatcactttagctgta
+ tataccagcgctcaataaattctttccttccctttcgacatagctatgtctgtatactat
+ ctaaaaactaaatatcacaataaacaaagtactaaaataaacacccatttaaataaagcc
+ agatcctgaccctccaaatccgccggatcctgaacctccaaatccgccagatccaggacc
+ aaatggttcgtcgtacttggatccaggcatccatcctggcggcatatcttcccgtcttgg
+ ggcaggtcttctatttggatcgaaaatcataccgccttggctacttggattccccagagg
+ attgggaagccctccagaccaatccggatggcttgtgccaacaggatacagatcaccatc
+ cgcataccctcctgctgggttagcgaagctcggttcagttaccgtaagtccgggatatgg
+ gttcaacggtgctgttccctggttttgtcttgcttgtgcctggtactcatcgtcaaattt
+ tggcatgtccgcactacctgcagtttgtcttttaagttgtgtcattataggtttcaggtc
+ tggctccacaggtaagctttctctttctagtggccatttcaattctgtgggcaatttaaa
+ atcgctgtaacccaccacacatgccatggtgatcgacgtaccatgacgaataacaacagt
+ atacttgtcattgtcatccagagtctgaatcaccactagctcagagagaaacgatgttga
+ acccttctcactgggagaatctttaatgcgaaaactcacttggcatgatttggttccctc
+ tgtatggcgtgacaccacctcagttttaccgatggactccaacactagttctgctaccag
+ ttcaaccttattctcaatcattgtttaactcttgcgttactatggtggtattattgatgc
+ taatacgggttttttatgaaaaaaaaaggataacaaacttatataaacttatataaaaac
+ atatacaataaatgatctacaaagggtaggaaataaaaaataagggagaaaaaaaaatac
+ atgcgtaagtacattaaaattacaggaaaaatcttactttagttctgatatttcaccatc
+ gcgatacaccaaccccataacgtttaggaacaaattcaaatcttggtatagtttctcttt
+ attataaataatagattttaactctgtgtctaattcttgagacctctgaggtaatacact
+ caattctttctctttcttttgtaaggtctccgcctcgttaatagtggcaaagatttcatc
+ acgtacttcatcaataatgattctctggtggtttagctttccacttagtgctaaactagc
+ taggaactgccaaagtatgcttcatccataaacttgtcattttgtaaacgcatgatgtgg
+ acgttatattcccttggagggaaaatcagctgaatcttactttccaatgaagtgaataat
+ ttatcataaatctcattccatgtggagatttcaggagaggaaagaatattagatcttgat
+ gaatcttgcttgattaatgcggacgagaaatcaaaatggtgatcaaatttagaccaattt
+ tggaggtggtcaagaacgaaacgttgttgtttctgattaactgtagcaacagacccataa
+ tttcgataaaattggagttattagacaaaaacgagacaatgatctttaatatgatcattt
+ ggaacagatcgactttcttcaattgtgtcaaagttggttttggtgtagtcttgtaggaac
+ tcaatatgatgatttgcaagtgtgataattcattgaagattttttgcaggattttcaatt
+ tttgctgcttatccaagaaattaaaaattctaggcataatcttgatacctttatcaaatg
+ atagcatcgaaatgaaagggtttacgtcatatgaactgtcgtcgatgtgaagggcctccc
+ acattgcagtgctgtcagtttgctggccattcctcaagttagcttccaagtctaaaatct
+ cttcataaactgtttcaattaactcaagaatgaattttttgcccccagatttgttcaaat
+ ttgtggcaccattaccgttgttgaacgcgtatgaagatcttcttctttgcttagaagcaa
+ cagcagcggcagaagagcccgtagaggaggaccctgaagaggccaagtccacgttggtca
+ gagagtcagtgacgtcttttagcgcatcggggtcagaagaaggctgttgtctggggatct
+ gcagttgtcttcttggattcttgctgttcaaagtggaagagattttgcccagaaccgtgt
+ ttgcttgcttattatcctggctagagttaccagcagccgcttgttgatccttcaacttag
+ aaggtctttccttagccacagtgacagccttttctacttgactttgcattctctgtaggg
+ caatatcggttctcttatagcgaccaccgagtctgtgtccagaatgttccaaatacgccc
+ tagcaatcaaacctttgttggattcggacgtgataccgcctctttggataatcttgtaga
+ cctggaagtagaaatcctcattgtaagggtcctcagtgacaatttgagacaactgatatc
+ tggtgatgaagtccttatcacgaggagtcattaaacctgagtatttcaagattttctcca
+ ctttggcatgacgaatctgcaaacgtctttgctcttcaggcgacaaatctctcttcgatc
+ tgcgtccactttgcagaggcattttagttgcgccaaccacagggcctggagaaggagtgt
+ tggacataccgatgggtgaaggcgaagcttgaactggaacttgtacaggacccattggca
+ tttgattcatattcacattaacgccaggaggtggaggtggaggcgcaaattgtgagggac
+ ctggttgacccgacgcaggctgcattggaaattgctgctgaggcggaccctgtgggagac
+ cctgtggatgacccatggcgtggaaattcataaactgttgctgcatttgacgttgttgtc
+ tttccaagtcttgcatcgacaagattggctgtaggggggccatttgcgcggagcaggagc
+ cggagccattgtactttggggtgaaggcgccattgctggaggtggtgcagtagaccacaa
+ agattccattggcttcaggtctagaggtcctgctgctgccgttccatcggtcctagggcc
+ gctaattccttctgcagtgtgctcccaaggcttgaaatcagggtatacttgttcctgagg
+ tttcgagcttggcacataggggtgaagatcatctgggggagtcagcaagtatcgaccaga
+ accgttattcatcgcgttattggcttacctctatagacctttttttaatctttcgattag
+ catcaaaaatctcagcattttttcagaaaggtgtttaaaatgcgctataaagaatataaa
+ aagtaatatatatgtatacatactaataaaaggaagaaaaaaaatacaaaagatgattat
+ ctaggacttgtttcacccaggggggtgtttagtttaattttgtttcgttagattgtgcat
+ tcattagtttggtttctctttggggccagttctgtcaacattgggccagttgcccaattg
+ gaaagcctaaccaccacgcaagtggcgttgtctgagtgtttagtggctattgcctgaata
+ aatttgattgtctcctctgcaacaaattgaggagtcgcctttttcagtccccaggagttc
+ accgtagaggtgatgaggtcaacaacctcttggtctgcaagtttattggtgatgccgtcg
+ gtgacaagcgccaggaagcattcatcaccattgaagggcagcttcgacctttcagatcgc
+ ggcaaatgcaaggtatttccgacaagaaacgagaaaatatccggctcactagatatacca
+ tacggtttcccagcgacatcaccaaacgatctagtgttggcgaaattattcaaaaacctt
+ gtctctccaaaggcgtctgggtccaatcgagaagggtctatgctcaaacggtgtcgttcc
+ ctactggaattgatgtggtgagttgttgttaatgcgtgtgcaatcccgtcttgatcgcag
+ aggatgatcttagagtcgcccacctgcgtgacaatgagtttgagcaatccagagtgcgat
+ atgatccacgagtcatcgtctttcccctcatcaatgggtgcattcggatcatcatacgga
+ tatagaaatatggacgacgctgtggaccccgctgcggtgcccaggccgcagcattggtcg
+ agatcgaacctggcaaacccgtagaacaatcgcagacggtctagctcagtgagcagtgca
+ attttgtcaataatattccccattttatcaaaaatcatcctgccaccattttgtccaagg
+ atcgcactatcgccctccttcatcagcagcagttcttgtttcgagttgcagttctttata
+ aacttgttgtagaatttttcaaacgtggacagcttatgtttccagtaatttccctcggga
+ aaccttcttgcatattcctgtaccaacttcttcaagtcctgcttgtcccaggagggttca
+ gggaaatccaggctactgcacaatttatcggctaacaactgtgacagttgcgcacctttc
+ tccccatggccatcgaatacagagcagttcagggcacttccatacgcattaggccacgac
+ cgcatcattttaatggagtaattgtcctggttctctaaccggttgatcctactcgtggcg
+ tgacccacgtaccctggaagaccccgtaacaagtccagcttcagcccaacattttgcgac
+ ggcttgcaatacgcataagcattccccagccgcatcaaattcccctcttctgatgcttaa
+ ttgccccgattgcagcctgttattgatactattgttatcacttccctatttgttcttctc
+ gcattacacgaaattttttttttcctcttcctctttttaaagaaagaaacaaggtgaatg
+ aaaagaaacatgtacaccttgaaggaagcaaaggttttaaccggaagtaagagcagcaag
+ aagcactagcaatgtccttctttgggttagaaaatagcggtaatgcgcgggatggtcctc
+ tggactttgaagagagttacaagggctatggcgagcacgaacttgaggagaacgactatt
+ tgaacgacgaaacatttggtgataatgttcaggttggtaccgactttgattttggaaatc
+ ctcacagcagcggcagcagcggcaacgcaattggtggtaatggcgtcggtgccacggcta
+ gatcatatgttgcagctactgcaggcagagatcaaatcctagcaaactttgtggctaagg
+ gcttttaccatacaccaatggtaaatttcgagtccatatctgcgagatcatctgttcatg
+ aatcattagtcactcaatccaacattctttcccagcaattcgacaaaattatcaagatta
+ gagaagaccacattaataagatcccctcaaattccacgacgacattacacgggcctggtt
+ ttcagttgcctaatagaataacccttactgatcatagaaaggaaacgtggttgcatgaat
+ tgagattcgtctcacacttcgactggtcaaaattggcaagtttatacctcacggcttgaa
+ aagaaggcaagtcatcgagcagtgctatttaaaatttataccattgaaaagggcgatttg
+ gttgataaagtgctgctattttatcgaatggaaatcgaaccagaaaaagaagaggtcaaa
+ tgctgctggggcagatgatgccatttccatgcacctgctaaaggactggacggatacctt
+ tgtatacatcctggaaaagctcatctttgatatgacaaatcactataacgattctcaaca
+ actgcgtacgtggaagaggcagatttcttattttttaaaacttttggggaattgctactc
+ actaagattgatcaataaggaaatctttcatcattggcttgtagagtttataaataagat
+ ggaaaacttcgaatttttgccattatctttacatattttgatgattttttggaacgacat
+ ctgccaaattgatacaaatgctcctgttgcggctacaataacatcaagtcaaaaagagcc
+ cttctttctggtaacaaaaatcactgatatgctattgcacaaatattatattgtttccag
+ cagcaaatcaatgataaatgacgagaactacatcatcaatgatataaagaaaaacaacaa
+ gataaagttgaatattctcaaaatattatccagtttaattttgaaaatttttcaagaaca
+ atctttagaggtgtttatatttcccacatctaactgggaaatttacaagcccttactttt
+ tgaaatagtctcaaacgccgacactaatcaaaattctgatatgaagaaaaaattagagtt
+ aattagttacagaaacgagtcattgaagaataattcttctatacgaaacgtaataatgtc
+ tgccagcaacgcaaatgactttcaattaactatcgtcacctgtaaacaatttccaaaact
+ atcatgcattcaattaaattgtatagatactcagttcaccaagctactggacgataaccc
+ tacagaattcgattggcccacttacgttgaccaaaatccccttacaatgcataaaattat
+ tcaattaattctctggtccatacatccatcaaggcaatttgatcactatgaatctaatca
+ actggtagcgaaattattactattgcgaataaattcaacagatgaggatttgcacgaatt
+ ccagatagaagatgccatttggtcattggttttccaattagccaaaaatttttcggccca
+ aaagagggtggtatcatatatgatgccttctttgtatcgcctgcttaatatactaattac
+ ttatggcatcattaaggtccctacgtatatcagaaagctaatcagttccggcctacttta
+ tctccaagattccaatgataagtttgtgcatgtccagctgttaattaacttgaaaatttc
+ accgttgatgaaaagtcaatacaatatggtattgaggaacgttatggaatatgacgttaa
+ attttatgaaatttttaatttcgaccaactcgtggaaatcacagaacaaatcaaaatgcg
+ aatactctccaatgatataactaatttgcaactgtcgaaaactcctctgagcattaaaat
+ catggttgcagaatggtacttatcacatttatgttccggtattttatctagtgttaaccg
+ cacagtgttgctaaaaatattcaagattttttgtatcgatctggaggttttccaccactt
+ ttttaagtggatcgagtttattgtctaccatcaattgctaagtgatatagaatctctgga
+ ggcattgatggacatcttgctatgctaccaaaaattgttctcacaattcattaatgacca
+ tattctttttacgaagacgttcatattcatttacaagaaagttttgaaagaaaaagacgt
+ gcctgcttataatgtgacttcatttatgccattctggaaattttttatgaaaaacttccc
+ ttttgttttaaaggtggataacgatttaaggattgagttacaatctgtttacaatgatga
+ gaaattgaaaactgagaagctgaagaatgataaatcagaagtcttgaaggtgtattccat
+ gatcaataattcaaaccaagctgttggacagacttggaattttcccgaggtgtttcaagt
+ aaacatcaggtttctactacacaactccgagatcattgatacaaatacaagcaaacagtt
+ ccagaaagcacgaaacaatgtcatgcttttgattgccactaacttgaaggagtacaataa
+ atttatgtccattttcttgaaaaggaaagactttactaacaaaaatttaattcaattgat
+ ctctctaaaacttctaacttttgaagtgacgcagaatgtgttggggctcgagtatattat
+ tcgattattaccaataaacttggaaaataatgacggctcatatggtctgtttttgaagta
+ tcataaagaacaattcataaagtcaaattttgagaaaattttacttacatgttatgaatt
+ agaaaaaaaatatcatggcaacgaatgtgaaataaattattatgagatcctattgaaaat
+ tttaataacttatgggtcatctcccaaattacttgcaacatctacaaaaatcattatgtt
+ gttattgaatgatagcgtggaaaactcatctaatattttggaggatattttgtactactc
+ aacttgtccgtcggaaaccgatcttaacgatattccattgggtagtggacaaccagacaa
+ tgacactgttgtaaccaacgatgataaaagtgacgatgatgatcacacagtcgacgaaat
+ tgatcatgtagaatattacgttatgatggactttgccaatctttgggttttccaagcgtt
+ tacctgtttctgcatcaaaaaaatcatggagaataatgagccagcaatggcaatggaaga
+ cttgaagaacttcatattccaaattatcgaaataactaattctaatgatttatgttcaca
+ aatatttgaccaactgaaggatatgcagaccattgagatgataacccaaatagtggagaa
+ agatttctgcacttcttgtttgcaaaacaacaaccaaaagatagatgataattacatcgt
+ tgtggtgatcgagattataacgtcattatcgatgaggtttcaaagagaaacttctggtat
+ gatagttatttccatggagaactatcatttactaataaagatcataagacaattaagtga
+ actgaacgaaggaaatttatctaagagagaaatccaaatagatgccgtcttgaaaatttt
+ tagctttcatcaggattccattttccaacgcatcatcgctgatttatcagctgataaacc
+ cacaagtccattcattgatagcatatgcaagctgtttgataaaatatcatttaatttaag
+ attgaagctgttcttgtacgaaattttgtcttcattgaaatcattcgccatctattcatc
+ cacaattgatgccccagcattccacacaagcggtaaggtcgaactaccgaagaaattgct
+ gaacttaccaccattccaagtgtcctctttcgttaaggaaacaaaacttcatagtggcga
+ ctacggggaagaagaagatgcagaccaagaagaatcgtttagtttaaatttaggaatcgg
+ catagttgaaatagcgcacgaaaacgaacagaaatggctcatttatgacaagaaagatca
+ taaatatgtctgcacattttccatggagccgtaccacttcatctccaactataataccaa
+ gtacacagatgacatggctacaggcagtaatgatacgactgcgtttaacgattcctgtgt
+ aaacctgagtctttttgatgctcggtttgagaggaaaaatccacattgatctcagaatat
+ atccaaatggataaattataaatttaccaataacagtaattatgtgtcagttttaatacc
+ caaccaattgattcctctaaatgataccgtaccaaatgacatgtttgccaccgattataa
+ aactggggtttatgagcgatggttaaggaggattaagacaaagtgctcatccgcttacaa
+ gatcagtactagcgtgtaccagtctatttaaactggaatgatcaccccaaagggaacgca
+ cgatgctgtggccaagtttcaaaaaactgacctgcatcaagatctcgattacatcgtact
+ gcaacaacgaagaacacagctagagacgcttattaacgaaagagaatcttttgtcaagaa
+ tctgtgttctcttttccacaaaattcaaaataccaagaattaccaggaatttgttgatgt
+ attggcggagaatagggatttactgcgagaaattttcactgtagagaacggatttcaaaa
+ gcaaaaatggattagcaacgacgacattccccagatagactgggacaagtttgccctaga
+ tatcaacgcttatatagcagagaacgatcaattgttggctttgtatgaagatggcttatt
+ atgatcaaatagcggctatggacatttttaaacatatatttatatatatatacatatatg
+ cgtatattgcataaatcacaaagaagaacaacgccctagatatagtgacccaaaatatta
+ tgtttaagttactggttggggttcatgtacatttttcactatcttttaattccaaatgtg
+ gattgtgcttctgcagattttgttgcctagtattctgtgatggaaaaaattgccccgatg
+ agatataaaatgagctagccccctaaccaaagaaagcaggattggttagtacatagagaa
+ accaaagctgttatgttgttctataagcctgtgatgaggatggcggtgagaccgctaaaa
+ agcataagattccagtcctcatacaccagtattactaaattgacgaacctaacagaattt
+ aggaatttgatcaagcaaaatgataaactagtcatcgatttttatgctacttggtgtggc
+ ccctgtaagatgatgcaaccacacttaacgaaattaattcaggcttatccagatgtaaga
+ tttgtcaagtgcgacgtggacgaatcaccagatattgccaaagagtgtgaagtgacggct
+ atgcccacctttgttcttggcaaggatggccaactcatcggcaagatcattggagctaac
+ cctactgctttagagaagggaatcaaagatctataaattttatattcatgagcacatgta
+ gtttttttttttttagttttaaatataagaaaaaagatagtatatatatatatatatata
+ catatacatataaaaggaaagtaagcttatacaatacacaaataaattcatatcaaagaa
+ taatgaaccgcaatattcagaaacacaggaaaaggaggggaagggatgaatggtgaggaa
+ agtaactgtttgttttaaagcgtacctggatcataacataacaaataagtagataaaaaa
+ gaatagagagaaagatataatgacaatgaattgaagaatagtttagttagttacatttgt
+ aaagtgttccttttgtgttctgttcttaatttggcgctatttttttatacttccaaatcc
+ ttgctttacaatcaccgctaccagtagcaaaaacgttatattctggacccagcggagacc
+ cgtttgccacagccacagatataactgaattcctatgaccttgcaacatcaataacggat
+ tgccggatttcttatcccaaaacaggacaccacgatctttggaaccggacaagatgtact
+ catcattttgtgtggtggccacggacaatacaaagtctttatgcccgatatacgtaactt
+ cacaagtgccggaatttggagttttcgaatcgctcttgttgtttgcattctgcaaattcc
+ agagcttaacagatctatctaatgagccggatacaacgctttgtccatctctagtgaaga
+ caacgctataaacagagtccttgtggcctgtaccggattcgttttccgaatctagtcttt
+ ccaccaagaatccggtctcggaatcccaaactctcacagcacgatctagagaaccagcag
+ cgatgtatttaccatcacctggtgatacagcgacggtggtaacaccatcttcaatggata
+ aagtcaatgaacactggcctgtacgtaagtcccaaatacgaacggtacggtcaccagaac
+ cggagactaatttgtcacctgagggaaagtagtccaatgaataaatatcttgttcgtggc
+ cttgaagaatcataacaatctttctattttcaatatcccaaattctaatcagtctgtctt
+ cagcacctgttgccaaaaatttcccatctggagaaaaacacactgaacggatatacaagt
+ cggatgatggggaagacgaagtgttcaggttttccacatcttttgccaattctgctgccg
+ aagtcatcgctgtggtagtaattgtggtggtagtagtggttgtcattgtattgttatccg
+ tggacgtggtggtgttattttcagtgatcgaatttcgatggttattggcagcagaatcgt
+ cagatagacgggccaccagagaaccatctgaaacgcgatacacttgagtagttttgttgc
+ agcctgtggctaagtattcaccatcgttactgaacttcacgcaacaaacaactgaagtat
+ gatccaaagatttgtgtaactcaacgtcaatttctcttggtagtgccgggttgtataaaa
+ tataataatcatttgtttgcttcttcagagcatcgggaacagactgggaatctagatcca
+ aaaggaaaggtgggataggtttagagtggtttgctctttgattataagggactaagtagt
+ gatcctggtgcaaactagccggggtggcgtcttcctccttaggtttgatttcagtttcag
+ ttgcggtggtagtggtggccgtggtggcggatccggtgtcatttatcttcgaggtattat
+ tattttccggttcagtttctttcaacgtagattcaggtgccttgacagagggtaaagtag
+ tggtctctttagaagtaggagatccgttgatggctgtgttactcaatggtgccacggaaa
+ cctggggaggtggttgctgttgttgaagttgctgctgttgcagttgctgttgtggtagtt
+ gttgttgggcgtttgaggacaccacaggcaaagtggtggtaggcaactgtgagccaacca
+ gattaggacggctagcttgtactgggaaggccgatggcgaaccagtagttgtgtttgctg
+ ctggagtggcggtggccgaagtagtagccggtggttgttgcgcaactggaacagatgcag
+ atgcagcggctagctgctgctgttgctgttgtaaatgctgctggacctgttgctgctgtt
+ gctgttgccgctgttgctggacggtcaaagatgcaatttgatggtctctttgctccagcc
+ ctagtttcaagtgcttgatctcttcttcgtacgcgtccttcattttcctgtgagttagtt
+ ccagttcgtagacggtgtttcttatctgctgcatctcagccagctgctggttcattttga
+ aatcgtaatccttttggttttgaagacggtaggtatttgcctcttgtgagacttggagaa
+ actcctgtctgatggcatcgagaagctcattcagcttattctgcgtattcgaaacgctgg
+ cagtcatattggtttggatggaaagctgatttctttcttcccctgcttatcaaaaaagac
+ aaaaaaacaatagaaaaagagtagtggtgtgcgaaaggtaacttggcgtgaaagaaaatt
+ aaacaattgcagaaggacagatagaggggcacgtgttcagccagttgttcttgcttcgct
+ gatgtttatcaattgggggtagatgggggaatttctctctatttgtttacttatgtactt
+ gttattttgttttgttgtttgttgctttgacgggagagctttttaattttttcgcagttt
+ ttcagtgttttttgaaaagtacgaaaaattaaggtccgtcggtagcgataatgtaagagg
+ gttctgtctctaaaaatgttagggtgaaagttccaaaaaattattaactcctagattgct
+ cagccaaaaagaagggtaaattagggctttcctggtttatttatttgtttatgttgttct
+ tatatacatatgtttatgtctttttatgtacaaataatgatgtttacaatgaaacatctg
+ ttatgctatcaaaaacgtcagcgcattgttttatagctgaagaagttacaacggataacg
+ gtctgatttgcggcctggcaatgcttggtaaaactaaatatcaattttacgaattattta
+ ctgtatatagtattcagtccctcactcaactggcgtcaagagtgaagaagggcggcctca
+ ttatggctcgccttattctgttcaccctctgcgctcttcccgtattatttcattttattt
+ tgtttatgcttcaatatcttgtatttgtttacattgaaaaatgattggaaaaaggtcaca
+ ttgaaatcaaaacagtgaagaagcagacagacaagaaggtcattagatagataaacagga
+ acaataaaaaaaagaaagcgaaacaagacaatccaacgtgcgcaaaatggatccattgac
+ tgtatacaaaaactcagtgaaacaacagatcgattccgcagacttactagtggcaaattt
+ ggtaaatgagaatttcgtactgtcggagaagctggatacaaaagcaaccgagatcaaaca
+ gctgcagaagcagatagactcgctaaacgcacaagtcaaggagctgaagactcagacttc
+ ccaacaggcagaaaattcagaagtcataaaggacctttacgagtacctctgtaacgtacg
+ tgtacataaaagctacgaggatgattctgggttgtggttcgacatctcgcagggcaccca
+ ctcagggggatcttccgatgattattcgataatggactataaactcggatttgtcaaggg
+ ccaggcccaagtcacagaagtcatatatgcgcccgttcttaaacagcgatccaccgagga
+ actatactcgctacagtcgaaactaccggaatacctcttcgaaacgttgagtttccccct
+ ctcgtcgctaaaccaattctataacaaaatcgctaaaagcctgaataagaaaagagagaa
+ aaaagatgaaaccgagtaagctgctacataatgtctatatatctacacataaaattccga
+ ttattcctttgcatacctgatttgcccctcagaatccacaaccagacttttcaagaaggt
+ cttttttgcccctttatccttcatggttttcaaaattttgtacaacgacttgcccttgtg
+ aagctcgataccgttcgaggtcttttccactttctctgccttcttgatcacaggcttctc
+ cactttctttgcaggagttgccactggctggtgctggtgctgcttctgctgctgcttctg
+ ctgctgcttctgcttctgcttcttgttgcccttgtacaacgctttctggtacttctcgtc
+ ctcgctgatgcaagacgtgtgattcttgtaactcacgccatcttcaaacgtcttggagca
+ atctatgcatgtatagtacgcgttaggacatctataataatgcttttcggtattcttctt
+ gggcacagtatcattacacacctcacagttgaacgtaaccatcctgtaataacaaatatt
+ tcttgactgagaccgtttgctgttgtatacagaatactcttagagctcatcgcaagttaa
+ aaattttcaattttttttcactttttcccgtcaaggcaaaaagcaaccaaaagagaatga
+ acctttatttttgatttatttattatgagatgctgctagtccactcatctgcatcaatgt
+ agtagtcacaaagctaatatttagacgttactttgatatctctgtccatgagagcactta
+ ttttaggaagttaaatgagacagtcaatagttcacaatatcccgtcagcaatggagggaa
+ aaggcattccttttccataggattttaatcgttttcaagcatcatacgccctcgaggaac
+ tcttgttttcgctttactatgcaaccattgatgtatttctgtataaatgtgcgtcacgtg
+ gccttgtgtctcttatttccacttgttttttcacaatgcggaaaacctcgattaaagtag
+ aaaaaaaggatataataggagtataccatattggatagttcaatctataaacaaacaatc
+ gcataaccgcacgtatatacacgcacacacctatcaatcacaatggctttggaacctatt
+ gattatactactcactcgagagagatcgacgcagagtacctgaagattgtcagaggctcc
+ gatcctgacaccacctggttgattatttcacccaatgcgaaaaaagaatacgaacctgag
+ tctaccggttcctcctttcacgatttcttgcaattgtttgatgaaaccaaggtccagtac
+ ggactggcacgtgtgtccccaccagggtcagacgttgagaagattattatcattggttgg
+ tgtcctgattctgcgccattgaagacaagggcctctttcgccgccaattttgctgcagtt
+ gctaataatctgttcaagggttaccacgttcaagttaccgccagagacgaggacgatctt
+ gacgaaaatgaactgttgatgaaaatcagtaacgcggccggtgcccgttattctattcag
+ acttcctccaagcaacaggggaaggcttccactcctcccgtgaagaaatccttcacacct
+ tccaagagccctgctccagtttctaagaaggaaccagtcaagactccttccccagcacct
+ gctgctaagatttcttcccgtgttaacgacaacaatgacgacgacgattggaatgagcct
+ gaattaaaggaacgcgacttcgatcaggctcccctgaaaccaaatcaatcatcttacaaa
+ ccaattggcaaaatcgacttgcaaaaagtgattgctgaagaaaaggctaaggaggaccca
+ cgtcttgttcaaaagccaaccgctgctggttccaagattgatcctagttctgatatcgct
+ aatttaaagaacgaatcaaaattaaagagggactccgagtttaactcctttttgggcacc
+ actaaacccccctccatgacggaatcttcattaaagaatgatgatgataaagtcattaag
+ ggttttagaaacgagaaatcacctgctcaattatgggccgaaagaaaggcaaagcaaaac
+ agcggcaacgccgaaactaaggctgaggcaccaaaacctgaagttccagaagatgagcct
+ gaaggtgaacctgacgtcaaagatttgaaatcaaaatttgaaggattggccgcttcagaa
+ aaagaggaggaagaaatggaaaacaaatttgctcctcctccaaagaaatcagaaccaact
+ attatctcaccaaaacccttctccaagccacaagaacctgtgaaagctgaagaagccgag
+ cagcctaagactgattacaagaagatcggcaacccattacccggtatgcacattgaagcg
+ gataatgaggaagaaccagaagagaatgatgatgactgggatgatgatgaagacgaggct
+ gctcaacctcctttgccttcgaggaatgttgcgtcaggagcaccagtgcaaaaagaagag
+ cctgaacaagaagagatcgccccaagcttaccttctagaaactcgatcccagctccaaaa
+ caagaagaagcacctgaacaagcacctgaagaagaaattgaagaagaagctgaggaagcc
+ gctccacagctgccatcaagaagctctgcagctcctcctccgcctccaagacgagcaact
+ ccagagaaaaagccaaaggaaaatccttgggccacagcagaatatgattacgatgctgca
+ gaagataacgaactgacctttgtggaaaatgacaagattatcaatattgaatttgtcgac
+ gatgactggtggctaggggaactagagaaagacggctcaaaaggtctcttccccagcaat
+ tatgtgtctttgggcaactagaatcacacgtcagcgtcatgctattatattattcttacg
+ taaaaaaatacttacagagacgttaaagcttttgtatagatactttcacgagccttccct
+ ttcagtaaaatgaacaaattattttgccttgtccttctttgtttcaagcgtttatatgac
+ tcctctacagtattatttttcaacagacggtgtgcccgtacggcattgtcaaaacctcga
+ acataaaacaacactgaaacaaacttattgcactcctgcatcgcatacagatctgatact
+ tactacaaccataatgctgttttagaatcgcatcaggtgtttaccagaagtgtatggtac
+ atggtgctttccttatgctcactatttcggatttacttctagtctatgtatagatttgtt
+ agagaccagttttaagctacaatggaacgttaagattaagaaatgcatgaagagtctccc
+ agtagatatataaagctgaattaaattcagtacaatggaagtttccacgtctttctgtat
+ tttcatacctttaattataggaaaagcacactacaaaaatacgctatcatctaccgaata
+ taactgcaatgaactcatttgcgtcattaggtctgatatattcagtagtaaaccttttaa
+ ctagagtagaggctcaaattgtgttctaccagaatagtagtacttcactgcctgtcccta
+ ctttagtatccacctcaatagcagattttcacgagtcctcatcaactggcgaagtgcagt
+ attcatcctcttattcgtatgtgcagccctcaatagactccttcacttcatctagcttct
+ taacaagttttgaagctcctaccgaaacttcttccagctatgcagtttcttcctcattga
+ taacttctgatactttttcttcatactctgatatcttcgatgaagaaacaagttcattaa
+ tatcaacctcagctgcctcatcggagaaagcctcgtccaccctttcttcaactgcacaac
+ ctcataggacatctcactcttcctcttcattcgagctaccagtcactgctccatcatcct
+ ctagtttaccgtcctcaacttcattgacatttacgtcagttaatccatctcaaagttgga
+ cttcatttaactcagaaaaatctagcgctctttcctcaaccatagattttacttcttctg
+ agatttcaggttcaacatctccaaagagcctggaaagtttcgataccaccggtactataa
+ cttcatcttattctccttctccttcttcaaaaaattctaaccagacctcactactcagcc
+ cattggagcctctgtccagttcttcaggagatttaatattgagttcaactattcaagcta
+ ctaccaatgaccaaacttcaaaaactattccaactcttgttgacgccacatcgtcattac
+ caccaacattgaggtcatccagtatggcaccaacaagtggttctgattcaatctcacaca
+ actttacgagccccccctctaaaacaagtggtaactacgatgttttgacttcaaactcaa
+ tagatccttctctatttacaactacgagtgaatattcatctacacaattgtcgagtttaa
+ atcgggcctcaaaaagtgaaacagttaatttcactgcttctattgcttccacaccatttg
+ gtacagattcggctacttctctaatagaccccattagttcagtgggttctacagcatcta
+ gctttgtgggaatttcaaccgccaattttagtacacaagggaactcgaactatgttcctg
+ aatcaactgcaagtggaagttcacaataccaggactggtcaagctcttctcttccgctgt
+ cacaaaccacttgggttgtcatcaacacaactaatacacaagggtctgtaacgtcaacca
+ catccccggcttatgtttctacggccaccaaaacggttgacggggtgatcaccgaatatg
+ ttacatggtgtcctctaacacaaaccaaatcacaagcaattggggtcagttcgtccattt
+ ctagcgttccacaagcctcctcatttagcggtagttctattttgagctccaattccagca
+ ctcttgctgcctcgaacaacgttcctgaatcaactgcaagcggaagttcacaataccagg
+ actggtcaagctcttctcttccgctgtcacaaaccacttgggttgtcatcaacacaacta
+ atacacaagggtctgtaacgtcaaccacatccccggcttatgtttctacggccaccaaaa
+ cggttgacggggtgatcaccgaatatgttacatggtgtcctctaacacaaaccaaatcac
+ aagcaattgggatcagttcatccacgattagcgccacacaaacctctaaaccatcttcaa
+ tattaacattggggatatcgaccttacaattgtctgatgccacatttaaggggactgaaa
+ ctataaacacccatctcatgaccgaaagtacttcaatcacagagcccacctattttagtg
+ gcacatcggatagtttttatttgtgcaccagtgaagttaatcttgcgtcttccttatctt
+ cttatccaaatttttcatcttcagaaggctctacggcgaccattactaactctaccgtta
+ catttggatcgaccagcaagtatccatctactagtgtatctaacccaacagaagccagcc
+ aacatgtgagctctagtgtgaactcactcactgattttacttcaaattcaaccgaaacca
+ tcgcagttatatctaatattcacaaaacttcgtcaaataaagactattcattgacgacta
+ cgcaattaaagaccagcggaaagcaaacgcttgtgctttctactgtcacaacaacggtga
+ acggtgctgctacggaatacacaacgtggtgcccggcatcaagtattgcttatacgacat
+ ccatatcatataaaacattagttttgaccactgaagtctgctctcattctgagtgtactc
+ caacggttattaccagtgttactgcaacaagctctacaatcccccttttatcaacctcta
+ gctctacggtattatcttctacagtatccgaaggtgcaaaaaatcccgctgcttctgaag
+ taactattaatacccaagtttctgctacttccgaagctactagtactagcactcaagtgt
+ ctgctacttctgcgacggccactgctagcgagagttcaaccacatcccaggtttctactg
+ cttccgaaactattagcactctcggtactcaaaactttaccactactggaagcttacttt
+ tcccggctttgtctactgaaatgataaatactactgtggtttcccgaaaaaccctaatta
+ ttagtacagaggtatgttcccattccaaatgtgtcccaacagtcattaccgaggttgtta
+ cttcgaaaggcacgccttctaatggacattcttctcaaactctacaaacggaggcagtag
+ aggtgacattgtcatcccatcaaaccgtaactatgagtaccgaagtatgttctaattcga
+ tttgcacaccgactgttattacatctgtgcaaatgagaagtactccttttccatacttaa
+ cttcttcaacgtcaagttcctctttagcctccaccaaaaaaagttccttagaagcctcct
+ cagaaatgtccaccttttctgtcagtacgcaaagtttgcctttggcattcacttgttcag
+ aaaaacgctccaccacatctgtctctcaatggtcaaataccgttttaactaatacaataa
+ tgtcctcttcttctaatgtcatatcaacaaatgaaaagcccagtagtactacctctccat
+ acaacttctcttcggggtactctttaccttcttcttctacaccttcccaatattcactat
+ ctacagctactacaacaatcaacggaatcaaaactgtgtacacaacttggtgtccattgg
+ cagaaaaatctactgtagctgcttcttctcaatcttcccgcagtgttgacaggtttgttt
+ cgtcgtcaaaaccatcctcatctttatctcagacctctattcaatatacattatctactg
+ ctaccaccaccataagtggtttgaagactgtatacacgacttggtgtccattaacaagta
+ aatcgactttaggtgctactactcaaacttcctcgacagccaaagttagaattacttccg
+ cttcatctgcaacatctacttctatttctttgagcacttcaacagaatcagaatcttcat
+ ctggatatttgtcgaaaggagtatgctcaggtactgaatgtacgcaagatgtgccaacac
+ aatcatcctcacctgcttcaacgttagcatattccccctctgtttctacatcatcatcat
+ catcattctcaacaacaactgcatcaacactaacgtcaacacacacctctgtcccgttat
+ taccatcatctagctctatatcagcatcttcgccatcatcaacttcgttgttatccactt
+ ctttaccatctcccgcttttacgtcatcaacacttccaacagcaacagcagtatcttcct
+ ccactttcatagcgtcttctctaccattgtcctctaaatcatcattgtcgttatcgccag
+ tctcgtcgtctattttgatgtctcagttttcatcatcatcatcatcatcatcatcattgg
+ catcgttgccatctctttctatatcaccaactgttgacactgtttctgttctacaaccaa
+ ctacttccatcgcaacactaacttgcacagactcacaatgccaacaggaggtatccacta
+ tctgtaatggatccaactgtgacgatgtgacttcaactgccactactcctccatctacgg
+ ttactgatactatgacatgtactggatctgagtgccagaaaaccacatctagcagctgtg
+ atggttactcgtgtaaagtatccgaaacgtataaatcaagcgctacaatatctgcatgta
+ gtggagaaggatgccaagcttccgctacaagtgagctaaattctcaatacgtcacgatga
+ cgtctgtcattaccccaagtgccataacaacaacatcagtggaagtgcattcaactgaat
+ ccactatatcaattactacagtgaagccagttacatatacatccagtgatactaatggag
+ aactgataaccataacaagttccagccaaactgtaattccatcagtaacgacgataataa
+ cgagaacaaaagtggccataacttcagcaccaaagccaacaactacgacctatgtcgagc
+ aacgactttcctccagtgggattgctacttcttttgttgctgctgcatcctcaacttgga
+ ttactacacccattgtcagtacgtatgctggttcggcgtcaaaatttctctgtagtaagt
+ tctttatgataatggtaatggtgatcaacttcatttaatttcccgtatatattatatgta
+ cgtatatatgctataaacagacactctttttacttacctttaataatgcttgatgagaaa
+ tcagtcattgttacttcttccccaaggttgtcatcataatcataccattcattatcctct
+ agttggcatgacattactttccccgaacttagagaaacttcaaacgttatggtgtcggga
+ taaaatttgattaattcacaacctctgcaatcaaggcttaaaggaatgaaagaatcttct
+ ttgaccttcgataaaccatgcttcttccgcacatcttttatcttggcccatttgtcgtca
+ ctctgatcagttaaatattcatcctcaaaagcgcacaggttaactgacaattcttttgaa
+ caaaatttacacttcatcaagaaagatgcggtaccttttgatgccggcatagcatattct
+ tcaaatgtattgatgatgactttagaatcgtgtaattctctacaactggtacattgtaga
+ tcgaaggcaaattctgcccttgattcgtttgtattctcaattgaaaccttggtcacgttt
+ tctgataatgttgctttcagaaccaaaaataacggcattatcgtacacaacctggttttc
+ tttattggttaagttgtctttattcaaaatttttcattacttcttataaacagcttcttg
+ acgcttgtaatttcataagtttttgatacaagcaatggaaacaaaattaagaaaatgtag
+ gcaacgaagaactaaatttgaaaaaaaaaaaaaaaaaaatttactgccaagatcatctaa
+ attatcttatagtagcaaagcttgagcataaaaggatcaatcttttagttttttatggca
+ aagtacatttgcagtaagtaattagtgatgagataaacaaagaataataatccaattcga
+ aatcaaaaattcaaacaacaaaaaacataatttcacatatatatctaattcattcacgaa
+ aaacacgtcagttccttctgtctgttgtaaataggatgcatccgcagtgaatcatggaaa
+ tatttcaaaactttcagcccaatcattttatttggtcaatgaagccataaacgctccgta
+ attctaagattaaaaaaaattcataaaaataaggaccaatataaatatatatatttaatg
+ tgctagtgacaatcctagtgataaaattgcaaattatactacaattggtcatggtgaagc
+ ctaaacacaccattacgtcttcacagtgttattcattttattttcaaaggtatgctacta
+ ctaatgcaccagtgtataccatgatacggtgaaaagggcatgtaatattgcgttaatttg
+ tcgataacctttctttacgtttaaattttttttttataactggcgacactaaacttctcg
+ ataaaatacaacataaaaagcagtacctagctaagcgaacaacagataataaatagctaa
+ acatctaggttaatagccaattagcaatacatttaaggattgatagtgtggtatttacaa
+ ggtaatagcccgctaggctataaggaaactactctaaattcaccttccatatatgtgact
+ gccaagaaaatattactagttatcaagctttttttgtgtgcattcaacttcaacgaatcc
+ ttcgtcatcaatgaaacaaaagaacaagccagaactgaaaaaaaaaaatcaaaaccacta
+ agaatgtcacatccttccaaaactcctgctggctcttcaactcttcaaagaatatcaaga
+ taaatatcattatctccagtgtggccgacccatgcatgagtgcgaggagcattaagttgg
+ atcctgggaaaacatgtcaaggacattcaataaagtacttttcatactacaacagacgct
+ agaagaactagaatgtaaccgtgaactgcctctcaaacacataattatgatagcgccgtc
+ ttcggcatcctttttcggaaatatactttcctttttccctctttcagctatattgttcaa
+ ggagtaacataagacataataaccccagtttttttagagaaagttcttcttctttaatta
+ tagtatgtttgaaacgtacaagaattcattgagtgccctaaggattactgcaaacaaggt
+ tcaacagtgttggtacacatgactcagcaagaataccgttccccctcacaacgcttatcc
+ aaggggaggagcatgtcgctacccaaaatatttgctcgtaatttgagatctctgcaaaac
+ aatgcacctcctggcaaaaacatcaatgtcaattgtttgaacgtcaattcttgttcgttg
+ tccgcaagcccaagctcacaaattaatatggcttgtaatggaaacaagcaagatcttccc
+ ataccgtttcccctgcatgtagaatgcaacgatagctggtcaagctccaaacttaacaag
+ ttcaaatcaatgtttaatcataacagatcaaagagcagtggtactacagatgcgtcaact
+ tcagaaaaaggtacgcataagcgtgaaccccggtcgacgatacatacagagctgttacaa
+ agttccattatcggtgagccaaatgtccatagtactacaagtagcacacttatacccaat
+ gaggcgatatgctccacacctaatgagatctcaggtagctcttctccggacgcggagtta
+ tttacctttgacatgcccacagacccgtcatccttccacactcctagctccccaagttat
+ atagcaaaggacagtagaaacctgagtaatggatctttgaatgatattaacgaaaatgaa
+ gagctccaaaatttccatagaaaaatcagcgaaaatggcagtgcctcccccctggctaac
+ ttgtcattatccaattcaccaattgattccccaaggaaaaatagcgaaaccagaaaggat
+ caaatacctatgaacataacaccacgtttaaggagggccgcttccgaaccgttcaatacg
+ gcaaaggatgggttaatgcgggaagattacattgccttgaaacaacctccaagcttggga
+ gatattgtagaaccgaggagatctcgtcgtttaagaaccaagtcattcggtaacaagttc
+ caagacattactgtcgaacctcaatccttcgaaaaaattagactacttggccaaggtgac
+ gtaggtaaagtgtatttagtgagggaacgcgataccaaccagatattcgccctgaaagtt
+ ttgaataaacatgagatgatcaagaggaagaaaattaaacgagtactcactgaacaggaa
+ attctcgcgacaagtgatcatccatttattgtgacactgtatcattcctttcaaaccaaa
+ gactatttgtatctctgtatggaatactgcatgggaggggaattctttagagccttacaa
+ acaagaaaaagtaaatgcattgcagaagaagatgcgaagttttacgccagtgaagtagta
+ gcagctttggaatatttacacctactgggcttcatatacagagatttgaaacccgaaaac
+ atattactgcatcaatctggtcatgtcatgctttctgactttgatttatccatccaagca
+ acgggatcaaaaaaacccaccatgaaagactctacgtatttagatacaaaaatttgttca
+ gatggattcagaactaattcctttgttggtactgaagagtatttagctccagaagtaatc
+ agagggaatggccacactgcagcagtagactggtggactttaggaatattgatttacgag
+ atgctatttggctgtactccatttaaaggagataattcaaatgaaacattctctaacatt
+ ttaaccaaggacgtcaaatttccacatgataaggaagtttcgaagaattgtaaagacctg
+ ataaagaaactactaaacaaaaacgaggcaaaaaggcttggttccaaatcaggagctgca
+ gacataaagagacatcccttcttcaaaaaagttcagtggtcgttcttaagaaaccaagac
+ ccccctctaatacctgcattaaatgataacggctgcgaacttccttttatattgtcttgc
+ aataaacacccgaaaaggaactcagtgagtgaacaggaaaccaaaatgttctgtgagaaa
+ gttgcaaacgatgatgaaattgatgaggctgatccattccatgattttaattctatgagt
+ ttaacgaagaaagatcacaatatcttaacctactctgaaaattatactacggaaaaattc
+ tatacaaagcaacttgtacaaggccaaggcataacagctcacatagaagtttctttaaag
+ acatcatacctgaactataacatgtttacagaaagataaacccatggccaaatttctgta
+ ttgtttctctcaatttatcttatgataaaaatgtaaaaaattatttaaaagtaaacaaat
+ aaacaaaggaacaattcaaaaacgagaaagtgaatgaaaaatataatatcatttattgtc
+ tgataatgctgcatttagaacatacgtaccatccgcatcagtggatatccaatgatagta
+ atttcgcgagtttatccgttgctgttatattatcactttttaatattcttttcaaagagc
+ tgaataatttcaacgcgtcttcgttaatggattccttccgcaattcttctgaaatactga
+ atgcccgatttataatatctttgtccaggcgtgccaatttcgccacattcatcccataac
+ tattataagtcaatcccttttttaacttatatagaaaaattacactcatccagtcctcgc
+ cagttttttgttcttccacgtaatccatatgataattccttattaacggagatttgattt
+ ctcccagcatgggaaaatgggtagtaaataatatcaaggggcagtcacttaactcagaaa
+ aatactttattaaagcataagaaattgctataccatcgtgcgtgccagtacctcttccca
+ cttcgtctaataatagtaaagaccgtttattgcaatttttcaagatgtgtaggatatcaa
+ gcatttccactttaaaagtagaatcaccgtttataatatcatcgtgcgcaccgattcgag
+ tgagtacgttttcaaatatgcttaatctgatctcttctgcggggacaaatgagccgatct
+ gtgccattatagtaagcagtgccacttgtctaatataagatgatttcccacccatattcg
+ gccccgttataatattgattttaccgttttctggggacatcatgatatcatttggtacat
+ aatgaacatccagcgactcgataattggatttcttgcattttttgcgattatggcttgtt
+ gaccattcacaaaagttggtctaacataatttacgttgcatgatgtggctgctaacgaca
+ aaatacagtcatactgcgccaaattgagtgtaattttacggagctctgtatattctgccg
+ taattttgttcaagaattctttatactgtagttcagattcccgaattaataagtccttgt
+ aatattctagcttttgagtcagtttctgggttctgggagtggtaaatctactgaccatct
+ tcgtattgttaacttttatccaatcatctggcaagtccttaatttgcgagtttttcactt
+ cgattaagtaatcaacttcatctctaaaatttagatatggacgtttgagatattttcgta
+ tttctgccaattcttcctttaactgtgaccgtactgattcgctttccctttgaattttta
+ ttataccctctgaacaatcatagttatttaaattaaaaaaatccattacttgcttatctg
+ aatttttttccattaccgcagaaacgttgatcatggtcaaaaaatgaggcaactgagtgg
+ tagaaagtagttcattcaattcactaaatagtctaaaaagtaaaggagattgtttgccta
+ tccttccatctgatgacttgaaatgttctgacaggtaagattgatgcatcttgaagtgat
+ caacgaaagaagttatttgctttaaatagaaatagacttcttttctagaagttgtaccat
+ acattatgcgatttaaagttcttaataagtcaggggtatgattcaacatttgattcaacg
+ attcaaaaaatatactgttgttgatttcggatgtaatgcactcaatggcatcaagccgct
+ cttcaatttggtgtacatcaaccaaaggtttgagaatccattctctcaacattcttaatc
+ caaacgatgtccttgtatggtccaataaccaaaacaaagaacctttaccaccatcatggg
+ taaaaatgtccaaactttgcagggagttaggatcaagtaacatatgtatttttgatgcga
+ agggcgaatagatagaaggaatcaacatcacctgctcattattgtactcaaccatatgcg
+ aatataacttagaaactaagcgtatgagagacggcgagagttgaattttctcattcatta
+ cttttattgcctgaactacatgatcttccaaatcatactcctgcttgtgtattaaaggac
+ atgaaatatctttgaaaaatttcgctacatgtaatggaagatcatctgtatttaccagta
+ cttctatgggctgtaaatatttgattcgtatctgtagtttctcatcagcaagattaggct
+ cttcaaattcatcatacacgacttccccgttatttaaattgacagaaattaaggaatatt
+ tagccacctttccctgatgtacgtcacgggacaaagcccatatactgtttgtatcaccga
+ gaatacgtttccccctaaggacaaaggtggaattaacaccaaatgtagctttggtaaaga
+ catttgaaatctttctttcaaaaacgcttgatttgctggcacctggatcatgcttcttaa
+ tagcgcttgtttctgcttgctctaccacggcaacctttaaattatgatgcacaagtctct
+ ctaggtgaacgtttaatctgacatccgggaaagaacagtacgcaaactgcctatgattgc
+ aatcttgaggattagactcatcgatagtcaattttccaggcacaagtttgatgtgaagta
+ ttctgctaaccgttactgcatcctctgcaaaacatttgtacttgtagcctactctaataa
+ caagcactttatctctatgatgcattttcaggtccttcacctgtttgtccaagggagtaa
+ gtttcgctgtaggggactttctggcttttttttttacgaaatcttcctcaccctcttcct
+ catcatcttcagcctcaacattttcatcacttcgtcttttcataatcctatccaatttct
+ tagcaaagtcaatatcattaaatgttgtcgaagtaccactagtcttttctgaatttttcg
+ aattcttggaagaaacactgcctttgagtggaaagctatcattagttactgttgtgcttg
+ caacagaagataaattgtcctcttcatcagtgtcaaggcagatggattcgctaccagcgc
+ catttccaaccgcaacttcttgttcttgcttatgcgtcagctctgattttaccgccttct
+ tgaaaaacctgcttattgtgggttgtcccgccattattgttaatagattcaacaaaataa
+ atctatttgcactgcttttggctctcaaaagtaccagtttaggttcattacctatcacca
+ ttgttccagcttatttatcattgaaaattcacgtgatcgcaatattccgtcaggagtgat
+ aaatacacgaaaatttttcttttttttcggtggcaaccaaagtcaaaccatcgaattcga
+ ataagataaacaggggcaaaggtgagtcggggaaacactattggtaaaagataagcaatt
+ gagaaacgtcactcagggacaattgattacatgctatcggccacataccgtgatttgaac
+ acagcatctaatttagaaacatcaaaggaaaaacaggccgctcaaatcgtcattgcacaa
+ attagtttattattcacgactcttaacaacgacaattttgaatccgtggaaagagaaatt
+ agacatattttagacaggtcgtccgtagatatttacataaaagtttgggaacgattatta
+ accttaagttctcgggatattttacaagcgggaaaatttttacttcaagaaaatctacta
+ cacagactactattagaatttgcgaaggatttaccgaagaaaagcacagaccttattgag
+ cttttgaaagaacgaaccttcaataaccaggagtttcaaaaacaaacaggaattacatta
+ tcacttttcattgatctatttgataaatctgcaaacaaggacattatagagtcacttgac
+ cgctcctctcagattaacgatttcaagacaattaagatgaatcatacaaattatttaagg
+ aatttttttcttcaaaccacaccagaaacactagagtccaatctacgcgacttattgcat
+ tccttggaaggtgaaagtctaaatgacttattagctcttttactgtccgaaatactttca
+ cctgggtctcagaatttacaaaatgatcccacacggagttggttgacacctccgatggtt
+ ttagacgcaacgaaccgtgggaacgttatagcaagatctataagttctctgcaagccaac
+ cagataaattggaatcgtgtgtttaatttaatgtcaacaaagtatttcttgagcgcacca
+ ttgatgcctactacagcatctttgagttgcttatttgcagcattgcacgatggtccagtt
+ attgatgaatttttcagttgcgactggaaagttattttcaaactagatttggccattcaa
+ cttcataagtggtcggtacagaatggttgctttgacttattaaatgcagaaggtaccagg
+ aaagtttctgaaaccatcccaaacacaaagcaatctttactctacttattatccattgca
+ tcattgaatttagaattgttcctacaaagggaggaattgtctgatggtcctatgctagct
+ tattttcaagagtgcttctttgaagatttcaactacgcccctgaatatcttattttagca
+ ttagtcaaagaaatgaagcggttcgttttattgatagaaaacaggacagtcatagacgaa
+ atacttattaccttattgattcaagtgcataataaatcaccgtcatcgttcaaggacgtt
+ atttctacaataaccgatgattctaaaatcgtagatgcagcaaaaatcataatcaactcg
+ gatgacgcacctattgccaactttttaaaatcgttgttagatacgggaagattagatacg
+ gtcattaataaacttcctttcaatgaagcttttaaaattttgccatgcgcaagacaaatt
+ ggttgggaggggttcgatactttcttaaaaacaaaagtttctccatctaatgtcgatgta
+ gtgctggaatcactagaggttcaaacgaaaatgactgatacaaacactccatttaggtca
+ ttaaagacatttgacttattcgcttttcattcattaattgaagtactgaacaaatgccca
+ ctagatgttctccaattacaaaggtttgaatccttggaattttccttattaattgcattt
+ cctagattgatcaattttggttttggacacgatgaagctattttagccaatggtgacatc
+ gcagggattaataatgatattgaaaaggagatgcagaactatttacagaaaatgtatagt
+ ggtgagttagccattaaagatgtaatcgaacttctgagaaggttaagagatagcgacttg
+ ccaagggaccaggaagtcttcacatgtattacccatgccgttatagcagaatcgacattc
+ ttccaagattatccattggatgcattggctactacatctgttctttttggatccatgatt
+ ctctttcaactgttacgtggattcgtattagacgtcgcatttaggataatcatgaggttt
+ gccaaggagcctccagagtccaagatgtttaagtttgctgtacaagctatttatgcattt
+ aggatacgtttggccgaatatccacagtattgtaaggacctcttgagagatgttccggct
+ ttgaagtctcaggctcaagtttaccaatctatcgtcgaagctgctaccctagcaaatgct
+ ccaaaggaaaggtcaagacccgtccaggaaatgatcccattaaaattttttgctgtagat
+ gaagtttcatgtcagatcaatcaagaaggtgctcctaaagatgtcgtagaaaaagttctt
+ tttgttctcaacaacgttactctggctaacttgaataataaggttgatgaattgaaaaaa
+ agtttgacaccaaattatttttcttggttttccacatatttagttacgcaaagggctaaa
+ acagaacctaactatcatgatctttatagcaaggttatagttgctatggggtcagggttg
+ ctacatcagttcatggtcaacgttactttgagacaattatttgtcctactatctacaaaa
+ gacgagcaagccatcgataaaaagcacctaaagaatttggcttcatggttaggatgtatc
+ acattagctttgaataaaccaattaaacacaagaatatcgcattcagggaaatgttaatc
+ gaagcttataaggaaaatagacttgaaatagttgtgccttttgtaacaaagattttacaa
+ agggcttctgaatcaaaaattttcaagcctccaaatccctggactgttggcatattaaag
+ ctgttgattgagttgaacgaaaaagcaaactggaaattaagtttgactttcgaagttgag
+ gttttattaaaatcttttaatttgaccaccaaatctctcaagccctcgaatttcatcaat
+ actccggaagttatagaaactttatccggtgctttgggatcaatcactctggagcaacaa
+ caaacagagcaacaaaggcaaattatactaatgcaacaacaccagcaacagatgctaata
+ tatcaacagagacaacaacaacaacaacaaaggcaacaacaacaacaacatcatattagt
+ gcaaatacaatcgcagaccaacaagcggcatttggcggcgagggttcaatttcacacgac
+ aatccttttaacaacttacttggttctactatttttgtaacccaccctgacttgaagagg
+ gtatttcaaatggctttagccaagtcagttcgcgaaattttgttggaagtagtcgaaaag
+ tcatcaggaattgctgttgttacgacgacaaaaataatacttaaagactttgccactgaa
+ gttgatgagtctaagttgaagacggctgcaatcattatggtaaggcatttggcacaaagt
+ ttagctcgagctacttcaattgaaccattgaaagaaggcatacgttctactatgcaatca
+ ctagcaccgaatttaatgtctctttcttcttcacctgcagaggagcttgacacggcaata
+ aatgaaaatattggcattgctctagttttgattgagaaagcatctatggacaagtctact
+ caagatttagcagaccaattgatgcaagcgattgctattcgtcgttatcacaaggaaaga
+ agggcagaccaaccatttattacgcaaaataccaatccatattcactgtctttaccagaa
+ cctcttggtttgaaaaacactggtgttactcctcaacaattcagggtatacgaagaattt
+ ggtaagaatattccaaacttggatgttattccgtttgcaggattgcccgctcacgctcca
+ ccgatgactcaaaatgtgggttcaactcagcctcagcaacaacaagcgcaaatgcctacc
+ caaatcctaacctccgaacaaataagagctcaacaacaacagcagcaattacagaaaagc
+ cgtttgaatcagccatcccagtcggctcaacctccaggagtgaatgtcccaaatcctcaa
+ ggtgggattgctgcagttcaatcagatttggaacagaatcaacgtgttctcgttcacctc
+ atggacattttagtttctcaaattaaagaaaatgctacgaagaataacttagctgaatta
+ ggcgatcaaaaccaaattaaaaccatcatttttcaaattttgacattcattgcaaaaagc
+ gcacaaaaggatcaattagctttaaaggtatcccaagctgtcgttaatagcctttttgcc
+ actagtgagagtcctctctgcagagaagttttgtccctacttttggaaaagttatgttct
+ ttatccctcgttgctagaaaagacgttgtctggtggttagtttatgccttggacagtagg
+ aaattcaatgttcccgttatcagatcccttctagaagttaatttaattgatgctacagaa
+ ttagataacgttttagttactgcaatgaaaaataaaatggagaactcaactgaatttgct
+ atgaaattaattcagaatactgtcttgtctgatgatccaattttgatgagaatggacttc
+ attaaaaccttagaacacttggcctcttcggaagatgaaaatgtaaagaaattcatcaaa
+ gagttcgaagatactaagataatgccagtgaggaaaggtaccaaaaccacaagaacagaa
+ aagctttacttagtatttacggaatgggtaaaattacttcaaagagttgagaataacgac
+ gtaatcacaactgtttttatcaagcaattagtcgaaaagggtgttatcagcgatactgat
+ aatttacttacatttgtcaaaagttctcttgagctatcagtttcttcattcaaagaaagt
+ gacccgactgatgaggttttcatcgctattgatgctctaggatcgctaattataaaattg
+ ttgattttacagggtttcaaagatgatacaagaagagattacataaatgcaatattttct
+ gtgatcgttttagtgtttgctaaggatcatagccaagagggtaccacattcaatgaacga
+ ccatatttcagactattttctaacatcttatacgaatgggctaccatcaggacgcacaat
+ tttgttagaatatctgattccagcactaggcaggagctgatcgaatttgattctgtattt
+ tacaacactttctcaggatatttgcacgctctgcaaccatttgccttccctggattctca
+ tttgcatgggtgacactattatcacacagaatgttattaccaattatgctaagattaccc
+ aataaaataggttgggaaaagttaatgcttttgattatcgatttgtttaaatttttggac
+ caatacacaagtaaacatgcagtctctgacgctgtttcggttgtttataagggaacactg
+ cgtgttattttaggcatttcgaatgatatgccatcctttttgattgaaaatcactatgaa
+ ttaatgaacaatctacctccaacatatttccaactaaagaatgttattttatctgctatt
+ cctaagaatatgaccgttcccaacccatatgacgtggatcttaatatggaggatattcca
+ gcatgtaaagaactacctgaagtcttctttgatcctgtaattgatttacactcattgaaa
+ aagccagttgacaactacctacgtattccctcaaattcattattaagaacaatactaagc
+ gctatttacaaggatacctatgacataaaaaagggcgtaggctacgactttttatctgtt
+ gatagtaaattaattcgcgctattgtattacatgtgggcattgaagctggaatagagtat
+ aagagaacttcttcaaatgcggtatttaatacgaagtcttcttattatactttattgttc
+ aatctgattcaaaatggtagcatcgaaatgaaatatcaaattattctgtctattgtggaa
+ caattgcggtatccaaacatccacacctattggttcagctttgtgttaatgaatatgttc
+ aaaagtgacgaatggaatgatcaaaaacttgaagtccaagaaattattttaagaaacttt
+ ttaaaaagaattattgttaacaaaccacatacctggggtgtttcagttttctttactcag
+ ttgataaacaataacgatattaatcttttagacctgccctttgtacaaagtgttcccgaa
+ attaaactaattttacaacaattagtaaaatattccaaaaaatacacaaccagtgaacaa
+ gatgaccaatccgccaccatcaatagaaggcaaacccctctacaatccaacgcataaaaa
+ aactgcatttatttatatacgaaaatcatgatttcagaaaaaaaaatataacttgtataa
+ tactatatttcagagaatactattttgaataagaatagaagcaatttcgattaacttttc
+ ttactgtatatgggtcgctttcacttttgaactactgaaaattagtgttacccgtattat
+ tgttttggagaataataataataatgagcatcaatttatatcaaacaagagctgcaatta
+ ccttatgtccatgacgcactgcgaacaaagaataattgagtaatcgaacttgaagttcta
+ tagaacagcttgtatatggtttcattgttcaaaagaggtaaggctccaccgctcacgaaa
+ gaaggccccacttctaaaaagcctcctaacacagcgtttagacaacaaaggcttaaggca
+ tggcaaccaatactgtctcctcaaagtgtgcttccgttgttaatattcgttgcatgtata
+ tttactcctattggtattggactcattgtaagcgctactaaggtacaagatctaacaatt
+ gattatagtcattgtgatacaaaagcatctacaactgcttttgaagatataccaaagaag
+ tacattaaatatcactttaaaagtaaagttgaaaataaaccacaatggaggctaaccgaa
+ aatgaaaatggcgaacaatcatgcgaactgcagttcgaaatcccaaacgatatcaagaaa
+ tccatttttatatattataaaataaccaatttttatcaaaatcatcgcagatatgtccaa
+ tcgtttgacacaaagcaaatattaggggagcctatcaaaaaagatgatctggatacaagc
+ tgtagtccaataagaagtagggaagacaaaataatatatccctgtgggttgatcgctaat
+ tccatgtttaatgatacattttctcaggtgttgagtggtatagatgacacagaagactat
+ aatttaactaacaagcatatatcatggagtattgatcgtcacagatttaaaaccaccaag
+ tataatgctagcgatattgttccaccgccaaactggatgaagaagtatcccgatgggtat
+ acagatgaaaatcttcctgatatccatacttgggaagagttccaggtatggatgaggact
+ gcagcctttcccaagttttacaagttgacgttgaaaaatgaatctgcttctttaccgaag
+ ggtaaatatcaaatgaacattgagttgaattatccgatttcactctttggtggcacaaaa
+ tcatttgtactgactacaaatggagctattggtggtagaaatatgtcactaggcgtactg
+ tacctcatcgttgcagggctttgcgccttatttggcatcatttttttggttaaattaatc
+ ttccaaccaagagcgatgggtgatcacacttatttgaattttgatgatgaagaaaacgag
+ gattatgaggatgtacacgcagagaatacaacattgagggaaattttatagaggccaaaa
+ aacaaactttagtgcctgtaggtatttgtgtttataagtaaacaagttttgttgacttcc
+ tgaaacgtaaacaaaagaaaaattgaagtatgcctttttacgcttggaaaatagtgcgac
+ aaagaactcccagggtttgctattcccatttagatgatgaaatgatagaaaagtggaaaa
+ cgtttatcatgaaatcatctatacatatacgtaaaaatgtaaactatgtaaaaatatgtt
+ ccctataaaagtgatgctcttccggaacgttttctttttcccacaaatctcgttgaaacc
+ tgaaccaatctggtaattcagaatcacagggtatccttattttgacgttatcttttcccc
+ ttgctttataatagccataatctccattttcttttttattaaatgctaatctatacctat
+ ggccaaaaattcctaggtcagaagaggttttcttcaaatcgtctcctagaactccactta
+ cttgtaatactttattatttttcactttggtttcgatttgatttattaatgtcaaaagcc
+ gctgaggcacttctggctctctgcaaaggtcatcttcatttactctaataagtttgccag
+ aattggccttatactctatcgaatgcgtcctattattctcaaggggcaatttttttgctt
+ tgttgtccacaattgtcttattatccttctcctgttctatctcaatgttaattatttcta
+ gaaatgtttccgcaaaataatttctattcttaccagaaaataatctgtactcatttatga
+ ttgatgcaatattccatttctgaatttttcgaagtataccaatgaccaacgcggtcttat
+ ctaccaataaaacattataattatccacattcaataatgttttgaaagttctttttaaac
+ acgtgctcttaatcagcattaagtcatcattgtcagtcaagtggtaagccaattcctcct
+ gaattacaggatcccctgtagtagaattttgtgacccatttgcagtggattttttctttt
+ cttcctgtaatgatagagtactgttattattcgaatataaatttgcgttagatacagaag
+ agctttttaccggaactgccgcagccgagaaatcagacatccttaaaacaatccatttta
+ tggatgatcttgtgaaaaaatctttgaaaaacttggatggttcttgacctccaataaata
+ tagctgtctttaaattgagagtttccagaaaggataaatttaatgtctccactttagaac
+ atctatatattccttcttcagcaatcccaaagttagcaggtggaaccagcattatgaatt
+ caggttatggatcgcaccagtgctgtaagtgtctctctttttattctacgatctccatat
+ aacaaccacatctcatttctttaccttactgtccatttaatgctagaaaacttccgaata
+ ttgtattattattacccggatatgtcaaggcggtatattaagatgttcctttcaaaagtt
+ tcattataccgcatacggtaaggacagccataagtagttttactgataaaatatggggct
+ aaagaactgtatatttatggatagcactaaaagtaaagattttccccgtgtcctgacttg
+ gagaaacaaaatttcattgtcataaaaaacttttaaaatatgatattcaaagcttgagca
+ ttgggcttctggatgacagttctaatcaagttgggtctaaggatattgcatgtatacaaa
+ ggattctttcggaaagtaatcctgaaatatttttttttttcgtctgaacatacgaaagta
+ aataaaaaaagttcaatgcatgccttcctatgcaagatttataaaaggtgaacaggtcag
+ gagcagccacatctcaataaaggcagttaaagacggtgatgtaggtgcagaaagagaaag
+ tgaacaaaggatgatttgagatgtgtaatgattcgtttgagcagcaacaactgctaagat
+ attcatactaataggccacttcacacattatgccttggagaatatcctgcattgctcctt
+ ttatagtatagtctattcagcactaacaagcaaagcgaacataatggtaaaactaacttc
+ aactgttgctggtgtcgcctgcatcgctggccctcccctccactacggccattttagcac
+ aacttgaatgagtcaacttggttgagttgcatgttaacgtctctgatatcagagctcact
+ tgctcaataatacatgttacaaactgcccactaaattgaacactctaacattatattgac
+ ctgaattatttgaaacgaagtcaacagaatgatcactgattttccaatgtactccgctat
+ gttgaagctagccatcttaagcgatcaatgacggtgctcaatcattcttcgaaatgggct
+ aaaaatgacagttgaaatatattctttacgttcattttttttcatactgcaaacccattc
+ tagaaccaatggtaaaagtgacgggagttgctttcttttactatttcaatgatagctata
+ gcctgcatacacacaaaagaatgtaagtgattgcgtgaaagcttaaagtcttaaaactgt
+ atttttttcaactcctcccatatattaaactggtcctcacagttcgcaggataactaaca
+ attaaaaaaattgttcatatggttgtatttctttccgtctcacattattaaaattcaacg
+ aagaagtttctacaataacagacaagggcctacgattactatgtactggaaggtgagtct
+ tgaattcatcctcagagattgttctaaagaccaacagaaatccatatagattaccgtagt
+ tactctgatggaatttcagagtaagacccttgtcaacttcatcactttaagttgagatgt
+ agaaagacacttttgtaaggtctccaacaagtatatttgataaatagactgttgagctac
+ aaaaactcaatgtgaagtaaaagacctcataaatgatcgattatctgtttggcaggtact
+ aataaaactgcattggtgtcttttttcacttacgttaaaaaaaactttttgcaatgattt
+ aagcgtgcgtgaagataacactacaatccattttaaagcaacatccacattgagtgtata
+ caccacaaaggttttttcagggcgtttttctcgccactttatgttgaccaaaattattaa
+ tggaacttacaacgtttccaaaagttagttaaatacatacgtctatttactaagcaagaa
+ atatatcatgacaagcccaaatattatattgttatgtttacaaaaaaaaaatggctatat
+ acatcaagtctggaggctttttataacaagcaagtggggtaacttagacataagattgac
+ ttctttgaattcaacaaaaatacatacttttgatgatttcaatggtagaagcagtagtaa
+ ctatgccgatgaaataaaatcctgatatcatctatatagtagtactcacgttactagtat
+ atcatctataagatgtaggaaaatgtcataagaaatgagaaacagtcattaaatttaatg
+ gaaacgaaatgcaaggattggtgatgagataagataatgaaacatagaaaacgaaaggag
+ aaataacagtagtattattattattattattatgtagaaatattgactctctcttgtaga
+ ttcatatatcttcaaggggaacttcttgtacattttgtatacctatattatagcgtttaa
+ caacaatggaatcccaacaattaccaaaattttcacacattcctcatataaaatctaagc
+ aatgactagagaactatcgctaaattatatagagcaaggaaaccgaataattacatgtgt
+ cagataagaaaaaaataaagaaaacaacgcgtcatgaaaaagagttactgaaccttcaga
+ tcctacttattgtaatgcttcgcgacatccaatccatttaataatcaatttaaaactaga
+ gttggtagagttccttgttgaacgtgataacccaaaagcataatacgagtaatgtttcag
+ tattgctattatatgtttacacaaggaaaacatataataacaaacctctaatccggtagt
+ acttaagaaactatagtttctatgtacaaaaaggtaactatgtaattcttacatttacat
+ aacatatagaagggtccaataaacttactaaacttactaccttgttgtatataggctaga
+ tcgtaatccactacgtcaacataaaaaaaacttaaggagtttgaattttatgtacaaaca
+ gattgttaaaatataatataagattatggaaacgaacttgctctaaaaaaaatttaaagt
+ tttataaaatcctcgcactatcgctgttatacatgatgtccccaaagcgtgtacaaataa
+ ttttgtagtattgtatcggtaatatcatacaagaagaaaatagttgcaaatctctaaccc
+ actagtacttaaaaaaactgtagtttcagtgcaaaaaagttttaacattacgtatcttgt
+ accctttttattgcatatagaaaggtcaaataatccttcacatcatgaaatataagctaa
+ atcgcatttcttttcgtccacatttgcaaacaaaacttttcaataataattttataaata
+ gtatcaatatatatatatatatatatatttatttgtttactttttctatcagtgttttca
+ attttttattaaacaatgtttgattttttaaatcgcaatttaatacctaaatataaaaaa
+ tgttattatattgcaaaaacccatcaaccttgaaaaaaagtagaaacgttttatttaatt
+ ctatcaatacatcataaaatacgaacgatccccgtccaagttatgagcttaatcttccat
+ aaaaatatttgaaagcaatagatcatgtactaaactaaaatcagggaaattaagactcct
+ tttgaagtaatacctattacttactaatacgtttgagaataagcgcaggtactcctggtt
+ tttgttaaaactacaaatttatacttagcattacgaagattctcgattccgaaaaacaaa
+ aattttatcgtcatatacaaatctagaaattaccagagctatccatcttgttcaagaagg
+ taggcgaggacattatctatcagtacaaacatcttagtagtgtctgaggagagggttgat
+ tgtttatgtatttttgcgaaatatatatatatatatattctacacagatatatacatatt
+ tgtttttcgggctcattctttcttctttgccagaggctcaccgctcaagaggtccgctaa
+ ttctggagcgattgttattgttttttcttttcttcttctattcgaaacccagtttttgat
+ ttgaatgcgagataaactggtattcttcattagattctctaggcccttggtatctagata
+ tgggttctcgatgttctttgcaaaccaactttctagtattcggacattttcttttgtaaa
+ ccggtgtcctctgtaaggtttagtacttttgtttatcatatcttgagttaccacattaaa
+ taccaacccatccgccgatttatttttctgtgtaagttgataattacttctatcgttttc
+ tatgctgcgcatttctttgagtaatacagtaatggtagtagtgagttgagatgttgtttg
+ caacaacttcttctcctcatcactaatcttacggtttttgttggccctagataagaatcc
+ taatatatcccttaattcaacttcttcttctgttgttacactctctggtaacttaggtaa
+ attacagcaaatagaaaagagctttttattcttgatttttgttctttcggggaaactgta
+ taaaacttccaaaaaggaaaagtaaaacaatacatctccttatatcaaagaaaatcaaga
+ aggacaacatggatgatatttgtagtatggcggaaaacataaacagaactctgtttaaca
+ ttctaggtactgagattgatgaaatcaatctcaatactaataatctttataatgtatgtt
+ ttcatttcaaggatagcctttgaatcaatttactaacaatacttcagtttataatggaaa
+ gtaatttgactaaagtagagcaacatacattacacaaaaatatttctaacaataggttag
+ aaatataccaccacattaaaaaagagaagagcccaaagggaaaatcatcaatatcacccc
+ aagcacgggcatttttagaacaggtttttagaagaaagcaaagccttaattccaaggaaa
+ aagaagaagttgcaaagaaatgtggcattactccacttcaagtaagagtttgggtatgta
+ atatgagaatcaaacttaaatatatcctatactaacaatttgtagttcataaataaacgt
+ atgagatctaaataaattcgttttcaatgattaaaatagcatagtcgggtttttctttta
+ gtttcagctttccgcaacagtaaaattttataaaccctggttttggttttgtagagtggt
+ tgacgaataattatgctgaagtacgtggtgacggatattgggaagatgtgtttgtacatt
+ tggccttatagagtgtggtcgtggcggaggttgtttatctttcgagtactgaatgttgtc
+ agtatagctatcctatttgaaactccccatcgtcttgctcttgttctcaatgtttgttta
+ tatactcatatttctatgtgtttatacaattgctattgtttatataatgtagtgacattt
+ tctcttaatcttatactaatttctatgacatttatataagaagagacttatgatcaacat
+ aattttgcaaactttgagagaaatatgtctttctactgcgataaagttattatttagatt
+ acatgtcaccaacattttcgtatatggcgatataatttatcatgttttggtatgataatt
+ taatttttaaaaaaacaaatttaattgacctcattaattaatatttattaatacctttaa
+ atgttgaggtaaatagctattttctctcttcttttcctttagttggaatttgcacaagaa
+ aatgtttttccacacactttagcgttttttcctaaatgttggaataaaaaacaactatca
+ tctatcaactagtagtcacactaccaatgtgttatcattatactgtgttaaacaatgaca
+ taaggtatgaaaatttgtcaacgaagttagagaaagctggatgcaaggattgataatgtg
+ gtaggaaaatgaaacatataacggaatgaggaataatcgtaatatcagtatatagaaata
+ tagattcccttttgaggattcctatatcctcgaggagaacttctagtatattctatatac
+ ctaatattattacttttatctacaatgcaaccccacaataatataaaaattcaccaattc
+ cgcatctgcagattactttcctaaatttgcatatagaattgtcaagcgcaaatccgacgt
+ cgattccgcggcggatgggtcattctaggtcattctaccaattttatttgagaccaggtt
+ tattcaaccggtaacatagaaatattcatacaattaagcttctatggccaagttggtaag
+ gcgccacactagtaatgtggagatcatcggttcaaatccgattggaagcattttttatca
+ cgttattcggtgacacccaggttgccgccgcgttcgcgtccatcgtcatctgaaaaataa
+ tgaatattaatggaccttgtgccccataaaggttccatgttccataagtcttcaataata
+ cttttgtatattaaatttaccgttattcggagatctcttacggcttatgattttctttac
+ attccaggccgccttttgatgacattcagtgcgtcacgacgcgaatagggctttgaaagc
+ cttaatgtggagtaaataaatatatgaatgcaataagtaaacataagacgagttcttcta
+ tatccggtgtacctaataatatagattttgccaacaatggaatctcattacccatcccaa
+ gataaaatattgatcaaggcatgggccaccaacataggaatacgcaaaataggattgata
+ tttttacatattatattatctttcttcttctgtctcaaatatagctcttctcctatttaa
+ aaaattatcaattcttgaatctcaacttccattaaatttgatgattgtttcccaatatcg
+ atgtcatctttttacactgtaaattataatttactaggaacatgaatactgatcaatgga
+ tgatagttgattcttattgcaataaaaagtactttctaagacgctaggacttctaaacac
+ agatattatgaaaatgccaaaaaatacagaatattttccttcttttcactataacaagta
+ gtaagaaattaagcaatatggtaatttgagcttatcagataaaagatcctggtagtcttg
+ caaaggtgtgagtacttttgatgaaactaagtttcagcgcagctatttcaattttggaac
+ ctttaaagaaagtttaatcgaaacattgaacgatccaaagcacaactttgtagcgggaag
+ tcacaaacaaatataaaaagtgttaataaactgccatcctatgttagaattgtgaaactt
+ tttggactaataaaaggttttcaacatggtgttccaaagcacttatttctaggttagcta
+ tggtaacgagttttaacgtaatcaaaacaatatacaattcaagatctgtatcatttacac
+ taacacaccccattttaatagagctgagtacttcctacatttcttttgcatatgatatcg
+ ccactgcatcatttctgtagtcgtaaaatttccttctcattaagacaccgaaaatagcta
+ aggattataacaaaacaaacaaatgctcaaacaaaacttttagtaaatcttttgaaatgt
+ aaaagttacttagaataactaaaaaaaaaatattacgctataccatatatttccattccc
+ aattttatcgttttataataatattcagtttcaagaacaatgaatatatttcaaacatta
+ catgcactcttgggagtaagctgcatcataataaattaaaattaacgctttagaaaaaga
+ aattttgaggttaaaggatggaggaagaaatcgataccaatttcaactttgatcgacctg
+ tctgactgaaattatttctgcgtttttcttactacaattagtaccattctcttcaacttc
+ aatagttctaaccattgattcttcatctgtctcatcaaatcccatcttaccagtccagcc
+ attggataccaaatagttgtgaaattcaacgtcttgcttcattaaatcgctttcaagaga
+ atgtggaacaaagaaatatgtaataatgataccaataagaccacaaattgcagcaataat
+ aaaagtccatcttgcacccaaattatccctaatgggttggaaacattcgacgcctactac
+ agaaccgattttaccagtcacagcagataaaccatagaaaacacctctaacagcggttgc
+ tgacgcttcactactaataacaccaagcatatcacctggtccagcattacctaacatatt
+ catgaatgcgtagaagataataaacaagggggtgatttttttcaattggtcgtacgcaca
+ tccaatgattagaccaaagatgatgtacccagagaaaccaaacatcaacgtatatttacg
+ accaatacgatcggacagataagcaccaattggtacacccagtacagctaaaactcccaa
+ cagtaagttccactctgccacttttactaaatcattttggtccttgataacggaactgat
+ aattgttgaactgaaaataccatttgggaaggtaacaaaatcatacataaaccatgtacc
+ acatgtaccaagtaaccttttccaataaaatttcaatgctaggaaatatggtatatttct
+ cttgattctacctttttcatagacttctgtagtagcagtcttccatctaaaatagaacac
+ actcaatggccagaagcaccctattgcaaaaacagtcctccagatcgcctctaaatgttt
+ tgttcctgaacagattttgtagacgattaaaaagatgatcgtagcaaatggaccaccgaa
+ ggctagtggcaaatttgtcaccataaccaggataccacctcttttggtagtggtatattc
+ attagcagactcattagcacttaatgtactggtaggatattctgcaccaacaccaatacc
+ taccaaacctctcataactgttaacatccaaaacatgccaggtacagtagtaccgtgaga
+ ggcagcacacagagcactaccaataaccaagatagcagtggccacaaggatacaagattt
+ tctactataataatcagcagcgatacccataaagaattggccaaaaataataccaactag
+ ggctgcgttggaaactctagtcgacacttttgagctatagtttttcttaccgtactccat
+ aacaaaaaccttgtttagcatactcattgaaccatttacgtaaccatcacttatcagagc
+ aaaaccggacgcaatgattgtaactatgtttttccatttatctttctttgaggtttcagt
+ acgggttgcacgcctctcggcatcatattttattattctaggattagtgttctcgttcac
+ ttccttctcatttaccgatgtgatatctttgtcttccattttttatcctattctattttt
+ ttgattataagcaaaaagatcaactcacgtctgcggctggttgactagtcacaagaaaca
+ gataataaatactgtaacgaacatctgctacaactacattcatcggggtttatatattca
+ cgagtttcctacttcatccagacacattctggatgtcttccagcagatgtccctcatcaa
+ agaattaagatagccagattgaaaaactaggttaaccagagaactaagttaactaatgtt
+ ggaaaataatcctacgatacagtggacggaatgcccatatgcctatataatcaatccatg
+ ctaaggtgggggtccagtaaggaaattcatgaatatgaagataaatgtggcaccaaacgg
+ caacgcggaagaccgtaggatccaatcatataacccctattattctaacatcattgctgt
+ tgggtacaacattctacgccgctattactacgtgctgcttcttttgattctctgctttct
+ ttaccgtcatgcacgtgcgccgcgaaaagtattaatcatgcggacactctggcatttagg
+ ggatactgtcaaaaagactatgctcaatactgtcatattcatatgcacgttcttttttta
+ atttatcttggcaaaacgtggggctggaacttttgaaacgcactgcaacatctgtggctg
+ tacagtcaagaaaagtaagtactcatcgtcatgctctaatgtgtatatggtatttgtggt
+ gcagacaccattagaagtatagttcacgaaatgatgacacgatagcatgtgcttttcgac
+ ggctgataaccttgtattctcttatttttgtattcttttcgtaagcggcatttgtttgca
+ gaaccttatcgcagatatcgataaacatttgttgacaattctttttcgctccccttcttt
+ ttgtttatcttcgttatattcaatagtctacatacactaaccgttgaattgttgcaaaca
+ aatactaataaaaatctatatatagtaaattgaagatgtactgatcagtctatcattgtt
+ tagtgcacccagactcgaatcttaaataccactttacacacctactaaattttgtcctca
+ caaaatgaagacaggattcaaaaccgattaatagtagcagaaactaaaaaagtacgaata
+ ttagtaaaattcatgttcttgaatcgagctactatctttgtcgggagggtaaacgattat
+ aactcaaaatgactggaactggtgattattaatttttacgtttcctgtgccaataagcgg
+ aagataagaggatagaagaaaagaaaggcggcacttggcgaactacaatggcgattatat
+ tcatggcgattatattcatacaaaggtaatggaggcctcggataatggacaatattgaga
+ aaatccttatgcttacttctcttaataaaaaatagacacagccatttattatgcgtaaaa
+ aagattacccacttgtcttcgatgcgtgctgctgccaatcaaccttttgagcggaacttc
+ gagctcgcaatgcgtctggaatgttgctagagacagtcttggttatctgtgacatgtgtt
+ tcgttcaggcgtgtgagcatcttcttgttcgatttcaaaattaccgccttgactcgtgaa
+ actggataattcgttggcgttttcatataagtcgtctgatggcgaaaacttttcctttac
+ ttagcatacagcaaatatccccatttgacggatttttgaaaaatgagcccgctaacccag
+ aatgaactgcattaccaagcatttatgtaaacgttccgccaccatctttggtaaggtata
+ ctattatgttctggatttaaggttgattcacaatttttcatcaccaaaatctggtggcat
+ gcctagttgtctggtttcaggcaatttagccatcatagaaaagcatcctctgtcttgagt
+ tgagaaaatgttactcatagagccaaacaaataaaccctggaattaaaatatagtttaaa
+ cacaacttcatattacactgtaaatacatatcacaataataaattaacgaaatagtaaga
+ ccttatgtttctatatgacatcagtcccgaatagtgtcaattttttgtcgtcagtgtcga
+ acatttttactgtttattcatcatatcgtgaggtatgtaggcatcttctattgacttaag
+ catgataaaagattcacccaaaactgtatcgaaatattgataaaattaaatttcaaagtc
+ aaacttgttttcaatggttgctattttcttaccattcgagctctccctcaggattttttc
+ acatggaatgttcatggcctcaatttccatgggctttttgcatttatctcctttaattaa
+ ttgtaatggctctaatgatacagtttcgccattagctttgtcacatatatcggaaagggc
+ aatctgatcataatctggtatacagtcaccctttgcatctctaacaaattcagaagagca
+ ttcgtagtcggattcagtacaactgtcacaaggtatttcatgcaaaatcatatctttgaa
+ tgtcctttttaccaaacaacgggcgtcctgtttcctccttttatacatgtatttggcgcc
+ attaaggcatttcccatctgccaaattccagtcttcgaaatcttcctcttcacacgtttt
+ atagtcaaaatgcagcagaaaaatcgatcgcatacaggacacttcttgcaatatatgaga
+ tgttggtgttgtttccttcttgactagccgtagtaatgagatgtccgcttaaaataaatt
+ ttgagcctgatccatctggcgttgtattaatcaactctaatggatggatataaaataaag
+ cgtcaacaagctcatattgattccatgttctaccttggtccaaggaaaaataaagtttgg
+ attgtacatcaccatctttatatgaaggagacgggatatatacaataatatttcccagat
+ caccagtagcatataatccagaagaattatgggccaccctccatgttagaccaccgtctc
+ tagaaatgaaagtttttcggtccttccgatcgaattcattgtcatcgccaacagaacctg
+ tcattaatatgatcccagcagaaggatttgaaagttttagattgtaaaaataaccctgaa
+ gtgaacatctctcaggcctagtgatatcacagccgaacgaatcggcattttcttcatcaa
+ caactttcaaatttgaccatgtgaggccgttatcaacggatattttggtttctcctccgg
+ ctatattttcagtatagtttacttggtgattagaatagtcaatggaaggatgaaacgatc
+ caaatattgtccctttcaagaaatcaggaaaagtgagcctaaaatttccaaactgatggt
+ ttggtgtccattcaacaggtgagaatttcaacccttgcgaatctgatatgaaaatctctg
+ agagatctggatcagcattttcctcatttgtgatttctgtcgatataagaaggatgatcc
+ ttccgatagaatcttcacgaattttaatcactttgacgtgccgtaattgagtaggcagct
+ gtgccttttgaaaagtagatgcatcattggatatccagatgtccgtagatgacatctcgt
+ tatgcccgtcatcttgcgttaaaacgaccacatatgacttcaatatcttgtattgagtaa
+ caatcttatctttaaattcatcaaaatctttaaacgttttgcctccatcactactgaaag
+ ccaactgtgtgtaggtagcagttaagcgatattcttcatcgtattcacggttagagaaca
+ aacagagtatcgaagtatcattaccagtaaagtcagaatccttgctagacttaatgaaat
+ gacagctttaagttgttatatcgtcgtcttttttcttttccaaggaaggtttaattttaa
+ aaaaggattcccatcattgctcaagtaaatttcacttgggtcatataaaacgtctgcgtc
+ tatcatatagctatagcaattcgcaataagaagttctttgtttctaggatgagtagttat
+ aaaacagtcgccatctaaatattcttctgatgagggtatagttaaagccctccatgactc
+ cccacggtcgtcggtaatgtagaatttgggtgattcacatataaaagcgaaacctctttc
+ ctgtccatggaaaggatcaacagtaaatctgaaaatgcgctcttcaatttctttgaccgt
+ ttcccaattttctccagcgtcgaaacttatccatacaaccccccaaccatctaatctgat
+ agaagtgttggaatcatcaaaactcactagattaaatgatgtttctataggagcctccgt
+ tacttttgggacaaatttctctgcattgagtagcggaagtagtataattacccaaagagt
+ atatatggcatgaagtaatatcataattgaggatctcttaccttttcttagttcaaagtt
+ ttttttcaaagctcggaactttatgctacaattcaatgactaaaaaaaaggtagcggata
+ atcttttggttgcaaattggcggcatttaaatatattttgggttgcaagtattctaaaaa
+ tattctcataatattctaaaaatatccagtctaggtaattgcaattattttgagtccggt
+ gactttgagttgaccatgttcacttcctgtacagccatcaaggtaacgtgtttaccatct
+ aatggagaaaaccaagagataagactagaaacctagttaggatacttggatttgctaatg
+ tcgccagtagtacgataataccataaagatacgctgttgaacctgatttgtatatttaaa
+ gaatgggaatctttaagggatcatagtctagaatggtattctgcagaatagattttaaat
+ ggtgaaaactttccctcactatgtccaattgatgctttttaggatgacgttagaatttta
+ ttctggtgaaggcacttttaatatgaagattcgttgaagtatcaataaccttttattggc
+ atacccttacctaactggtctctctcacctcgtgatgttacgaacgccacgattattctc
+ aaactaggtaaagattttttcttcaaaattagtgggaaagactcgacagacccgcttacc
+ aggtattctttctaaacaatagggtttaccagtacccctgaaaatgataatgcccatgtg
+ cggctgatcgcgttaactacgtatgcagttgaatcaaagccataccacagaaaatggtac
+ aagagcccactggatagaatagtgctggctatcaaaccaatattttctttccagctcttg
+ ggggacctctactgcgttatatcgtagtttccattcagcctgcttatcgccttaggaaaa
+ gggaatgtaagaggcagaaaagtgcggcgcaatcgttcggacagcgagccgctattatag
+ gtaagaattcttaaagagcttcgccggatattattacagtgtattaaatttcgccaagtc
+ tttgccaatggaaataggtgcataattttatgtgcttggtacatatgagttcaatagata
+ tgtagtagttggatagtagatagtaaggatatataatcatggaagggtgacgcacaacaa
+ atcacataatgagttatgattttcactgttctcatctgtttcccctacaaggcgtgatca
+ ataacaaacaatttcattaaaaaaaaaaaaaccaagtcactctaacatacgtaataactg
+ tgtatacaaaagaaaaataatatctgttcgtcatctagtttaatacggcaactagttttt
+ caccagagttcttaccgattttaatgtcttccaggatacgaggaacatcgtaaagcccgt
+ tcttatagacccttgctggaatatggtgaatttgcccatcactaatctttggattgatga
+ acttgacgaattcggtggcagctctcctggcttctgggtcagcagggaaagtaatgccac
+ caaatggtacttcatggccgcctattgaatacagtcttgttctgtcaatagtgacatttt
+ gcctcctattctcctttttgacgttttcttctgtcaaattagttaattcgacaacggtag
+ catcctgtttatcggccgcacatttgtacacttgttgaagcgtattttgattcgcgacac
+ agtcgactaaatacgagatattgttgtacttgtgtttaatttgttctaccacgtcaatat
+ catggtaatcaaatagttgatcagcaccatattctttcaacagtttttcgtgtttccgag
+ aagccacaacaatgatcttggtgaagccattcaatttattggctaattggatgagcgact
+ gacctactgcagttgcaccgccccataataagatggggccgtttctttgtggggttgatg
+ gctcccacttcaggttcaagcccaagttataggtcaacaccaagccagctgtggtcagtg
+ acactgggatagtggctgccccttccaaagacctgacagggccggcaggtagaacatctt
+ cacccaaaaatttgagttcattgggtgatttgtaggcaaccacagttgaaatagcagaat
+ attcagcaaaagcaccattggaaggaaaccttacggaagatccgtgaatgaacccataaa
+ tataatcaccaatagaaaagtctttaggatcgacggctgggcccaatttgacaatttggc
+ cggcagcgtcacatcccagaatagatccttgaggcccgaccttgtagtcaatgtgtgccc
+ aatcagtcgggttaccagcaacagcgagtgtcttaatcaatacgaatccttcttccaatt
+ caggaatgggaacgccctctttgacaaccgctttaccgtcttcaatgacgacagccttca
+ tggtttctggaattgcaacttgcctgtttgtaggttatatgtaattgtcgttgataatct
+ gtattcagcttgccattttatagaatatggttagaaggatcttaagctagataaagaaaa
+ agaaaaaaatagaaaaaagcctgcccgctttataagaacttttctcaatatctgttgagg
+ aacacttagtaatattgtctatcatggttgctttattcttggtgatattactctctggga
+ ttcgatgtgttctcatcatctgaagggtttcctatgtgttgaaagcctatcccactagac
+ ggagccgcgcgccagccgggcggatccgagacatcttagtcagctgagacaagaaggcac
+ agctctcggacaggaactgctgtcatatgaaaacctaaaaatgtttcctggaaattccat
+ gactgcgcaaagtttgatacgtttattgttcttctctactaccgcactttctatctcgga
+ aacctttcaactgaatgaactattgttaattgtatggcgtgtggccttgctatttcagca
+ aatttagagtaagtaaactccgtcggaccagtatagtaacgaaacaagccattgattctg
+ cagcatattgcgaccaaaacctggaaattggtggctatgagttttaatttataattatat
+ gaagacaaagcttggaatgctgagtactttttcttcaatgctgctcactgcccagaagcg
+ ttcagaacaactctactggagccattctgcttgaaggaggataaaaatgccaagatcaag
+ tcttttaaagactctatcccttttcatcggaaaagcactacaagtttattctacagaagt
+ ggacaaactgtggaacttgatccatactgagtaagtatggaagactgttaacttggagga
+ tttttctctgctttctaaagaaggttcactgctttacactcacctaggttctcatgagta
+ ttgctagctagtgcgagtttgatctatacttcacagacttttaacaccccgttctttaag
+ ttttccctgattatcaagcggatttccaattcttattccccactatcgcacctttcttat
+ ctgaattgtatttttgagtggcctaactgtaatcttttattttgttttctaattattgaa
+ ggcggtttctcgaaacatttcacgatttccaaaattcaagaattgaaattatgaacatag
+ aagacaaaatgccttttttatcaactattatgaattctctagaaagtggttgcaaaggaa
+ ataaatatgtatttgctgagaggaaaagtttggaagaacaaaaagtatttctttgatgga
+ gctgattatcaagcattctttacggggagctttcgaaattttttatattcaaggacattt
+ atgttgtgtgggttattaaacgtgggactttggaaatacatccaagaagcacaattgagc
+ tctaacgatgaaccgttaccataaagatagagacatttcttttgacttagaagctacatg
+ gagtaccaactattaatacacctctcctagagaacatttttaaaatagaataaaataaag
+ tagcgctctcaactacccttacacttttttccttcatatctggctagaaaagagggatta
+ catctcggaatcaaagttttaatactgctactacgaactggttacaattctatttttatc
+ atacacgatactttgaacatcaacttctctcaataccatcgccttcagcggctacaatta
+ cgtaattgcaccaagcattcgccgagagtttacatatttgcagaccctaacagtatacga
+ ctcccaacacacgtttaatgttgtacgaattcaagagatggaaatatttcttccgcggaa
+ gccgtttaaagcagactgaatttaggcttagacaataagcttctgaacgagggtccaaat
+ ctaaaattcggatattctttttgaccgaaagcggaaacttcacaggcgcgcgtaaaaggg
+ ttttgcaaggcaccccggcacccctataattggcattcccacattctgcgcatacgaata
+ cacatgagtgccataaaacaatactccacatacgaaacctatgaaagatgtatgattcgt
+ tcagaaacaagagcatctccatagagataatgagattgtgtgaaagatgagatatggagg
+ atatgtgaggtgctaaatgagcgtctgttaacgaatcgtttaagtagtgatgacagatgg
+ aggaacgggtataaatagagctgcttcaactatatgcctttgaggtatgcgctttcattt
+ cactatttcttgttaagattacatcagcactaacaaacaaaacaaatacaatggtcaaat
+ taacttcaatcgctgctggtgttgccgccatcgctgccggtattgccgctgccccagcca
+ ctaccactctatctccatctgacgaaagggtcaacttggtcgaattgggtgtttacgtct
+ ccgatatcagagctcatttggctcaatactacttgtttcaagcagctcatccaactgaga
+ cctacccagttgagattgctgaagctgttttcaactatggtgacttcaccactatgttga
+ ctggtattccagctgaacaagtcaccagagtcatcactggtgtcccatggtactccacta
+ gattgagaccagccatctccagtgctctatctaaggacggtatctacactgctattccaa
+ aatagagacatgcctttactaatgaacttccatagacattgaaaaataacgaaagaataa
+ aaaaaaaatctttttttttttttgatataaatacagttgactatataaacatatatcaat
+ actataagcctatatttgatcaagacgacattatttaggtgcatcaacaacggaaaagaa
+ actttcttctaggctatccttctggggaagcagttgaaagtgctatccagtattattgca
+ tatattcttaatgagtagtgctgaagtttgatacgtttggctctgttgcttttatttctc
+ ccacgaaaataaattgatttctttatttttcagtgtgttggcccggagttgtttacacac
+ atgtctctttttggattaatgctgtagatcagggactatgcgagcgacaagtcagagtaa
+ ccatattgaatttttcgtctcctgttatacgaatatttgcatttctgtgcctacttatgt
+ gtgtatatgtttttattagtcagaaaatggagtaagtgttaaagtaaggatagtgttaag
+ gaggtaataactaaagatttgtaggataaggcggtgttatttgttgcaattttctcaggg
+ gtggagtggaaatggtttgattcaattatcagttaagcaaggaagagttgaagggcgaac
+ atttgcaataatcccaatgtcatttaatgcgggtcggctgagacgctcaatattcggccg
+ aaaagtatcattcggccgcataataaaatggaatagaagataaagcagatgtcactaatt
+ gagcaccaaaactctgagtagaaattatataaagaagcacgaacatgacctgctttagat
+ gttgcttgatattattgaaataaaaaaaaacatagaaccactgaaaaatacaaaaaaaaa
+ atgctttacccagaaaaatttcagggcatcggtatttccaacgcaaaggattggaagcat
+ cctaaattagtgagttttgacccaaaaccctttggcgatcatgacgttgatgttgaaatt
+ gaagcctgtggtatctgcggatctgattttcatatagccgttggtaattggggtccagtc
+ ccagaaaatcaaatccttggacatgaaataattggccgcgtggtgaaggttggatccaag
+ tgccacactggggtaaaaatcggtgaccgtgttggtgttggtgcccaagccttggcgtgt
+ tttgagtgtgaacgttgcaaaagtgacaacgagcaatactgtaccaatgaccacgttttg
+ actatgtggactccttacaaggacggctacatttcacaaggaggctttgcctcccacgtg
+ aggcttcatgaacactttgctattcaaataccagaaaatattccaagtccgctagccgct
+ ccattattgtgtggtggtattacagttttctctccactactaagaaatggctgtggtcca
+ ggtaagagggtaggtattgttggcatcggtggtattgggcatatggggattctgttggct
+ aaagctatgggagccgaggtttatgcgttttcgcgaggccactccaagcgggaggattct
+ atgaaactcggtgctgatcactatattgctatgttggaggataaaggctggacagaacaa
+ tactctaacgctttggaccttcttgtcgtttgctcatcatctttgtcgaaagttaatttt
+ gacagtatcgttaagattatgaagattggaggctccatcgtttcaattgctgctcctgaa
+ gttaatgaaaagcttgttttaaaaccgttgggcctaatgggagtatcaatctcaagcagt
+ gctatcggatctaggaaggaaatcgaacaactattgaaattagtttccgaaaagaatgtc
+ aaaatatgggtggaaaaacttccgatcagcgaagaaggcgtcagccatgcctttacaagg
+ atggaaagcggagacgtcaaatacagatttactttggtcgattatgataagaaattccat
+ aaatagtctatatacgtaatatttttcagaatctcttaaaatgcatgtcaaatctcggaa
+ tttatcatgacccattacataacaagagaaaagtggtttgcattgtatgtataccttaaa
+ ccaattcttcggtagtcacatgacctatatatagggacattcatttcttgaataggatat
+ ttcttaaacttaagggaaattttgatgaaaatttagaatctctcgacaacagtgcagagc
+ agtaagttttattcaatacgaaatgtgtgcagggttttacccattcatgtcgagtacgtc
+ tacatttgttctgcgatttaacttaatacttagcctactcatatgtatcgaagccaaact
+ attaaatctgaaaactgaatacccattaagtaggtacattaatgcaattttcgtccgaaa
+ ggtagagtgttaatcatgcttttgcttcgaaatctagagtccgagcatgtctgatgaggt
+ cctcttaaagcacttcaattttaaaagagaagagcgcaaatttttacaaagaatgcctat
+ attaatatactgtgctatctaagaggatggttcaatggtttgaaatgagttgaaaaataa
+ agatttacatagtagcgtgtattcttcacatttttgcattcggtagattttctactcaag
+ acccgtgcagatcactattacactcattattacaagtcattgagaacgaaattttgacat
+ cattactaataatgttactcattaatactttattaatgtaagcggtagataaacaaatga
+ acatgaaattggctgctgaaatgctctgatggattcgattacagtaaaaaaacctcggtt
+ aagattggtttgcctgcaatgcaaaaagatcaaacggaaatgtgataaactgcggcctgc
+ ttgctcgcgatgccaacaaaattcattacagtgtgaatatgaagagagaacagatttatc
+ tgccaatgttgcagcaaacgactctgatggattcaattcctctcataagctcaatttcga
+ acagcaacctgtacttgaaaggactgggcttagatattccttacaagtgcctgaaggtgt
+ cgttaatgctacgctgtcgatatggaacgccgaagatatgctagttatagtaggattagt
+ tacatttctggattatccttttgctgcgcatagtctggcgcaacatgaccagtatatcag
+ ggcactttgtgcttcgttgtacggcatggcgcttgttgactttagcaattatgctaatgg
+ tattccttgtgaagacacatcaagaagtatactaggaccattgtcattcatagaaaaggc
+ catttttagacggatagaacatagtaagcaatttcgagttcagtctgccgccttagggtt
+ attatacaatgcattttcaatggaagaagaaaacttctcgactcttctaccgtcactcat
+ cgctgaagtggaagacgtgttgatgcaaaaaaaagactgtgaaatacttttgaggtgttt
+ ctatcaaaatatttatcccttctatccttttatggacatttcactctttgagagcgatct
+ cactagtttgcttttacaagacgacaataatcgttggaaaattagtactgaagttaaaaa
+ tgtgcgcaaaaaaatagaaactttgtcattacttacaatagtaatggccatggccttgat
+ gcattcaaaattggatgcaaatcttctttcaatggtaaaagaaaatgcctccgaaagtgc
+ caggaaactttctcttttatgtcataaactattatgcctcctggatgtatttcgctatcc
+ aaatgagaacacttttacttgccttttatatttctacgtttcagagcatttagatcccga
+ gagtcccgattgtgtactgagccccactaacttgcttactctgcaccatcttttaaattt
+ gtccatgaccttaggtcttcaatatgagccttcgaagtacaaacgtttcaaagatccaga
+ agtgataaggcagagacggatattatggttaggagttcagtcattactttttcaaatttc
+ tcttgctgaaggtgatgctggtaaatcaaatagtgaatatatggaggcatatttaacaga
+ cttcgaagaatatattgaagcttcctcagagtatgaaaaaagttctgcgagtgaatcgaa
+ cgtgcaaatgaatgatattgtttggaataagtacaaatttcacgtcattttgagtaaact
+ aatgtctgattgcacttcagttatacaacatccgcagcttttccacattttaggaaatat
+ taaaagatctgaagattttatggctgagaactttcctacaagttcgatttaccaacccct
+ tcatgaaaaggaaccaaatgcgatcaaagttggcaaaagtacggttctcgatgtcatgga
+ tattcaaaaaactgaaatatttcttacaaatattgtgggaagtatgtgttttttaaacat
+ ttttgatgtcctatcgttacattttgaaaaaaaatgtgttatgcactgggaagaatatga
+ aaagaactatcatttccttactttgaaaagtttcaatgcatacttaaagctagcagggtt
+ gatatctgattatctcgagaataagtttcaagggaacattttagagagtcgcggttatat
+ catagataaacaaatatgttttatgcttgtaaggatctggatgttccaatgtcgtatttt
+ gttaaggttttcatacaagcaagaaagtcagaaaaaattggcctcttccagtatatccac
+ taacgataatgaaaaagaagatgaaatgattgtcattttagaaagacttattaaacacat
+ tcgtaaccaaatggcacatttagtggatctagcaaagggaaaacttcaagatagttactt
+ tggtgcttaccaaactgttcccatgtttagatacgttgtgtatttgatcgatgttggcgg
+ cttagtatctgtgacaaatgggttttgggataagatttccagtgatggtgaaataccgcc
+ aaaagtacaacaagccgtgagattgaaatggggattggactgcaataattcgagaagaat
+ caaacaaaagttaataagcagccagagtttgcagagtttcaatcaagttctgttgtgcca
+ gatggaggatgcagttctctccagttccttcgcaataaaagccaataccgctatgtccca
+ aaacacggctgaagaatttttcaatatcagcgaagaagaggctttaaatcaactattgga
+ aaacaacaattttgatgccttctgggatttattaggtgaaaatctgagcgatatgccttc
+ tttgtgagaatacggatatttggaaaggctaaaaaaaaaaaaacaagccaaaaacaagaa
+ acactgttatcctccacgttttttccactgttttaagactcgacaagacggaaagattcc
+ tcacatttcgcagtgtttcttttaacctgctcgtcaatttgaaaaaatgagcatggttag
+ gcaggcggaatgtaataggtgcggtgcgtataaataagtatctcggccgttcctctattg
+ cgctcggccgagatgtcctacatgcataaaatagattaaaaaaagggatcacgatactca
+ aatatgcatataataacaacaacacactttgcggtaaatagaaggataatatcgtaatct
+ tttgctggtttcgatgatgacttgaaaaaattcaattactctttcgatcatctagaagaa
+ gcatatccatcgtgcttaaaatgattgggtccgcgtccgactcatctagcaagttaggac
+ gcctccgatttctttctgaaactgccgctattaaagtatccccgttaatcctaggagaag
+ tctcatacgatggagctcgttcggattttctcaaatcaatgaacaagaatcgagcttttg
+ aattgcttgatactttttacgaggcaggtggaaatttcattgatgccgcaaacaactgcc
+ aaaacgagcaatcagaagaatggattggtgaatggatacagtccagaaggttacgtgatc
+ aaattgtcattgcaaccaagtttataaaaagcgataaaaagtataaagcaggtgaaagta
+ acactgccaactactgtggtaatcacaagcgtagtttacatgtgagtgtgagggattctc
+ tccgcaaattgcaaactgattggattgatatactttacgttcactggtgggattatatga
+ gttcaatcgaagaatttatggatagtttgcatattctggtccagcagggcaaggtcctct
+ atttgggtgtatctgatacacctgcttgggttgtttctgcggcaaactactacgctacat
+ cttatggtaaaactccctttagtatctaccaaggtaaatggaacgtgttgaacagagatt
+ ttgagcgtgatattattccaatggctaggcatttcggtatggccctcgccccatgggatg
+ tcatgggaggtggaagatttcagagtaaaaaagcaatggaggaacggaggaagaatggag
+ agggtattcgttctttcgttggcgcctccgaacaaacagatgcagaaatcaagattagtg
+ aagcattggccaagattgctgaggaacatggcactgagtctgttactgctattgctattg
+ cctatgttcgctctaaggcgaaaaatttttttccgtcggttgaaggaggaaaaattgagg
+ atctcaaagagaacattaaggctctcagtatcgatctaacgccagacaatataaaatact
+ tagaaagtatagttccttttgacatcggatttcctaataattttatcgtgttaaattcct
+ tgactcaaaaatatggtacgaataatgtttagataatttttcagtaatcaactacgcaag
+ taaagcagtaaatacgttactgctggtattaatgtcatgtattgaggcaatgatgctatg
+ ctcttaacgacatgtagcttacaaaacgttcctatggttttactgcatggtatttacaaa
+ ttagatagcagttccatccgcccttgacatattattattcagacaaggtgtatatgagca
+ taaatatgtatatatgcacatgaggtttgtattaaccttgcaactgtaccaaaatacaat
+ cttctttgctgctatttaacggcgtatgtgcagttcataaatgcgtgttctgtatagtac
+ atccgcgcacattcttcctagcggaagacatattaacgtagcccgtacgccctggttaag
+ acttggtatagttccaataattggaatattactgcatagtggtccaatagcaggatttag
+ cataaacatgataatatttagagatgctttcatccctctgacgaaggtggtagagaacaa
+ gaaaatgaataacattatatcatattcctatatatatatatatatatatatatatatcat
+ ataacggtggaaaatcccggcgatattacagataaacattacacccgcatgaatgtgagc
+ cactactatattataacttagtgaataaagagtgttacaatagtgaggtgcatattattc
+ taatgtaaggtctgtataagtacgaaatattctgaagtggcatccagtcaagcaaatggt
+ aatattaaggaacttttaagttaatgacgtcatggtagtgctcgtacttcaagtcaaagt
+ gtttatgtattatggttgaagaatagaatatttttatgtttaggtgattttagtggtgat
+ ttttctgtaatattgacataagtgtatataaattaagtggttagtatacggtgaaaaaga
+ ggtataacgtatgtattaagggaatttatacgatatttgggcccgccgaatgagatatag
+ atattaaaatgtggataatcgtgggctttatgggtaaatggcacagggtatagaccgctg
+ aggcaagtgccgtgcataatgatgtgggtgcatttggtactgatttagtgagaatgggcc
+ atggattggagtgtgagagtagggtaacttgagagtggtatatactgtagcatccgtgtg
+ cgtatgccccatcaatataagtgaaggtgagtatggcatgtggtggtggtatagagtggt
+ agggtaagtatgtatgtattatttacgatccggggaattcttacctataatagcggctcg
+ ctgtccgaacgattgcgccgcacttttctgcctcttacattcccttttcctaaggcgata
+ agcaggctgaatggaaactacgatataacgcagtagaggtcccccaagagctggaaagaa
+ aatattggtttgatagccagcactattctatccagtgggctcttgtaccattttctgtgg
+ tatggctttgattcaactgcatacgtagttaacgcgatcagccgcacatgggcattatca
+ ttttcaggggtactggtaaaccctattgtttagaaagaatacctggtaagcgggtctgtc
+ gagtctttcccaactaattttgaagaaaaaatctttacctagtttgagaataatcgt
diff --git a/src/squirrel-1.4/tables/README b/src/squirrel-1.4/tables/README
new file mode 100644
index 0000000..ff69b38
--- /dev/null
+++ b/src/squirrel-1.4/tables/README
@@ -0,0 +1,5 @@
+ Dir: $SQUIRREL/tables/
+Date: Fri May 07 12:42:56 BST 1993
+
+This directory contains squirrel databases. Please read
+SysManagement.
\ No newline at end of file
diff --git a/src/squirrel-1.4/tables/SysManagement b/src/squirrel-1.4/tables/SysManagement
new file mode 100644
index 0000000..70b7aec
--- /dev/null
+++ b/src/squirrel-1.4/tables/SysManagement
@@ -0,0 +1,259 @@
+			    SQUIRREL v1.3
+			  System Management
+
+			      Simon Dear
+			     29 June 1992
+
+
+0. Introduction
+---------------
+
+Squirrel is a complicated script that gathers together information
+about an experiment and then assembles the processed sequences into a
+user's database. The philosophy behind squirrel is that there should
+be minimal user interaction. In actuality, all that need be entered is
+little more than the name of directory containing data to be
+processed.
+
+Ease of use for the user often means extra work for someone else. With
+squirrel this is no exception. Squirrel requires the maintenance of
+several small files. This document describes these files and the
+procedure for keeping them updated.
+
+Squirrel needs to gather additional information about:
+
+  a) subclone libraries
+  b) vectors used to clone each cosmid
+  c) sequencing vector clone sites
+  d) files containing vector sequence
+  e) user's databases
+
+For a sequencing project the data on (b)-(d) will be fairly static and
+quite small. Data on (a) will be frequently updated as new libraries
+are going to be continually created. Data on (e) will be updated
+occasionally; every time new databases are started.
+
+For C-Shell programming simplicity and uniformity the data is
+encapsulated in simple awk programs.
+
+We sometimes need to look up information which is a generalisation
+about a specific example. For example, we may be want data on a
+subclone e06b11 where data is held for all subclones starting e06. The
+pattern matching abilities of awk are ideal for this.
+
+We also sometime need to determine information about a specific
+example, where it is not possible to make general inferences using
+simple rules.  An example is data on (b), which reflects the
+changing strategies during genome mapping. Building this data into an
+awk program is a very simple task. See Appendix 1.
+
+However, is it not desirable to maintain awk programs. Small typing
+mistakes can result in a program with severe errors.  For (a), and
+(c)-(e) it is more convenient to enter the data into a text file, and
+have the awk program automatically created from it using the UNIX make
+facility. In the case of (b), there is no escaping having an awk
+program. It is therefore fortunate that this data will not
+substantially change during the lifetime of the sequencing project.
+
+The text files for (a)-(e) can be found in the directory
+
+	~wormpub/BioSW/squirrel/tables
+
+During the rest of this document it is assumed that the person
+performing the updates is logged in as user wormpub and that they have
+changed to this directory.
+
+
+
+
+1. Updating information on subclone libraries
+---------------------------------------------
+
+The information on subclone libraries can be updated in two ways.
+
+First, the file ``subclones'' can be modified using an editor. It is
+important that information is entered in exactly the right format. The
+fields of this file are:
+
+    <microtitre-dish> <cosmid> <insert-size> <fragmentation-method> ...
+    <sequencing-vector> <comments...>
+
+An example:
+    e50 F54C8 1000..2000 shear M13mp18
+
+Alternatively the program update_subclones can be used. It provides
+sensible default values for all the fields and checks that the
+data specified is valid. The program can be run by any user from any
+directory. It assumes that the environment variable SUBCLONES holds
+the full path name of the file ``subclones''.
+
+Here is an example of an update of the subclone file using
+update_subclones. Default values are given in square brackets.
+Responding with a question mark ('?') at any time provides information
+of what is expected.
+
+
+    Maintain subclone database
+
+    Database=/nfs/al/hgmp0/wormpub/BioSW/staden/tables/subclones
+
+    Microtitre dish [e50] ?
+    Cosmid [F54C8] ?
+    Method of fragmentation [shear] ?
+    Vector [M13mp18] ?
+    Size range [1000..2000] ?
+    Comment #1 [] ?
+
+    Here are the values you entered:
+
+    Microtitre dish number: e50
+    Cosmid:                 F54C8
+    Fragmentation method:   shear
+    Vector:                 M13mp18
+    Size range:             1000..2000
+
+    Are these values correct [Yes] ?
+
+
+Once the file has been updated it is important to remake the
+awk program required by squirrel. To do this, type:
+
+	make all
+
+
+
+
+2. Updating information on vectors used to clone each cosmid
+------------------------------------------------------------
+
+Information on which vectors are used for each cosmid can be found in
+the file ``clone_types.awk''. This information is be updated using an
+editor, though it is unlikely that it will change much through the
+course of the sequencing project. The data is only encapsulated in an
+awk program; there is no text file. It is important to update this file
+correctly and to ensure that no errors are introduced into the awk
+program. The program can be tested for errors by typing something
+like:
+
+	echo TEST | awk -f clone_types.awk
+
+
+
+
+3. Updating information on sequencing vector clone sites
+--------------------------------------------------------
+
+Information on sequencing vector clone sites is held in the file
+``seqclone-sites'', and is updated by manually editing the text file.
+The format of the data is:
+
+	<clone> <clone-site> <forward-primer-site> <reverse-primer-site>
+
+An example:
+	pBS     2305 46 -79
+
+The positions relate to the sequence held in the file specified in the
+``vector-seqfiles'' files. The clone-site is defined as the base
+position of the 3' end of the cloning site The primer sites are positions
+relative to this.
+
+To make the awk program required by squirrel, type:
+	make all
+
+It is important that this information is correct and that the sequence in
+the sequence file is in the correct orientation. Otherwise, the vector
+will not be successfully removed from sequence processed by squirrel.
+
+
+
+
+
+4. Updating information on files containing vector sequence
+-----------------------------------------------------------
+
+Information of the files containing vector sequence is help in the file
+``vector-seqfiles'', and is updated by manually editing the text file.
+The format is:
+
+	<vector> <file-name>
+
+An example:
+	pBS     /nfs/al/pubseq/pubseq/inhouse/sun-nematode/squirrel/seqs/blue.vep
+
+To make the awk program required by squirrel, type:
+	make all
+
+Note that the file name includes the full path name. All working versions
+of sequence files required by squirrel are kept in the $SQUIRREL/seqs
+directory.
+
+
+
+
+5. Updating information on user's databases
+-------------------------------------------
+
+User's databases must be registered with squirrel by including an
+entry for the project in the file ``databases''. The format is:
+
+	<cosmid> <directory> <name> <version> <database_type> <owner>
+
+An example:
+	C38C10 C38C10 C38C10 0 bap kt
+
+The lookup file is remade by typing:
+	make all
+
+The directory is relative to the owner's home directory. The database type
+is currently one of "dap" or "bap". So the database for kt's cosmid C38C10
+is ~kt/C38C10/C38C10.??0.
+
+
+A1. Vectors used to clone each cosmid
+-------------------------------------
+
+#clone_types.awk
+#
+# Determine Clone types from Clone names
+#
+# 3 February 1992, Simon Dear
+#
+# Assumes that there is one clone name per line.
+# Outputs to stdout clone vector.
+#
+# Data in this program has been taken from "Clone types in the genome map
+# of Caenorhabditis elegans" courtesy by John Sulston. This needs
+# revision.
+#
+# Possible usage:
+#	 set clone-type = `echo $clone | awk -f clone-types.awk`
+#
+/^A/		{ print "Lambda"; next; }
+/^ZL/		{ print "Lambda"; next; }
+/^YSL/		{ print "Lambda"; next; }
+/^[BCDERM]/	{ print "pJB8"; next; }
+/^ZC/		{ print "pJB8"; next; }
+/^K/		{ print "LoristB"; next; }
+/^T/		{ print "Lorist2"; next; }
+/^W/		{ print "Lorist4"; next; }
+/^F/		{ print "Lorist6"; next; }
+/^ZK/		{
+			num = substr($1,3) + 0.0;
+			if (num <   15) { print "unknown"; next; }
+			if (num <=  56) { print "Lorist6"; next; }
+			if (num <= 130) { print "Lorist2"; next; }
+			if (num <= 164) { print "Lorist6"; next; }
+			if (num <= 177) { print "Lorist2"; next; }
+			if (num <= 192) { print "Lorist6"; next; }
+			if (num <= 344) { print "Lorist2"; next; }
+			if (num <= 354) { print "Lorist6"; next; }
+			if (num <= 514) { print "Lorist2"; next; }
+			if (num <= 552) { print "Lorist6"; next; }
+			if (num <= 596) { print "Lorist2"; next; }
+			if (num <= 616) { print "Lorist?"; next; }
+			if (num <= 626) { print "Lorist2"; next; }
+			if (num <= 692) { print "Lorist6"; next; }
+			if (num <= 756) { print "Lorist2"; next; }
+			if (num >= 757) { print "Lorist6"; next; }
+		}
+/^Y/		{ print "pYAC4"; next; }
diff --git a/src/squirrel-1.4/tables/all_pages b/src/squirrel-1.4/tables/all_pages
new file mode 100644
index 0000000..744133c
--- /dev/null
+++ b/src/squirrel-1.4/tables/all_pages
@@ -0,0 +1,22 @@
+#! /bin/csh -f
+set FORMAT = ~wormpub/BioSW/staden/tables/format.awk
+set SUBCLONEDB = subclones
+if (-e subclones) then
+	set SUBCLONEDB = subclones
+else
+	if ($?SUBCLONES) then
+		set SUBCLONEDB = $SUBCLONES
+		if (! -e $SUBCLONEDB) then 
+			echo "Cannot open file $SUBCLONEDB"
+			exit 1
+		endif
+	else
+		echo "Cannot open file"
+		exit 1
+	endif
+endif
+
+echo "Printing $SUBCLONEDB on lp"
+
+awk -f $FORMAT $SUBCLONEDB | tbl | ptroff -Plp
+exit 0
diff --git a/src/squirrel-1.4/tables/clone-types.lookup b/src/squirrel-1.4/tables/clone-types.lookup
new file mode 100644
index 0000000..1b2770f
--- /dev/null
+++ b/src/squirrel-1.4/tables/clone-types.lookup
@@ -0,0 +1,46 @@
+#
+# Database to determine clone types from clone names
+#
+# Example:
+# 
+# This is an example from the C.elegans sequencing project.
+#
+# Assumes that there is one clone name per line.
+# Outputs to stdout clone vector.
+#
+# Data in this program has been taken from "Clone types in the genome map
+# of Caenorhabditis elegans" courtesy by John Sulston. This needs
+# revision.
+#
+# Possible usage:
+#	 set clone-type = `echo $clone | awk -f clone-types.awk`
+#
+/^A/		{ print $1,"Lambda"; next; }
+/^ZL/		{ print $1,"Lambda"; next; }
+/^YSL/		{ print $1,"Lambda"; next; }
+/^[BCDERM]/	{ print $1,"pJB8"; next; }
+/^ZC/		{ print $1,"pJB8"; next; }
+/^K/		{ print $1,"LoristB"; next; }
+/^T/		{ print $1,"Lorist2"; next; }
+/^W/		{ print $1,"Lorist4"; next; }
+/^F/		{ print $1,"Lorist6"; next; }
+/^ZK/		{
+			num = substr($1,3) + 0.0;
+			if (num <   15) { print $1,"unknown"; next; }
+			if (num <=  56) { print $1,"Lorist6"; next; }
+			if (num <= 130) { print $1,"Lorist2"; next; }
+			if (num <= 164) { print $1,"Lorist6"; next; }
+			if (num <= 177) { print $1,"Lorist2"; next; }
+			if (num <= 192) { print $1,"Lorist6"; next; }
+			if (num <= 344) { print $1,"Lorist2"; next; }
+			if (num <= 354) { print $1,"Lorist6"; next; }
+			if (num <= 514) { print $1,"Lorist2"; next; }
+			if (num <= 552) { print $1,"Lorist6"; next; }
+			if (num <= 596) { print $1,"Lorist2"; next; }
+			if (num <= 616) { print $1,"Lorist?"; next; }
+			if (num <= 626) { print $1,"Lorist2"; next; }
+			if (num <= 692) { print $1,"Lorist6"; next; }
+			if (num <= 756) { print $1,"Lorist2"; next; }
+			if (num >= 757) { print $1,"Lorist6"; next; }
+		}
+/^Y/		{ print $1,"pYAC4"; next; }
diff --git a/src/squirrel-1.4/tables/create.lookup b/src/squirrel-1.4/tables/create.lookup
new file mode 100644
index 0000000..c584625
--- /dev/null
+++ b/src/squirrel-1.4/tables/create.lookup
@@ -0,0 +1,3 @@
+#! /bin/csh -f
+#
+sed 's/^\([^ 	]*\)/\/^&\/ {print "&/;s/$/";}/' $1 >! $2
diff --git a/src/squirrel-1.4/tables/cron-job b/src/squirrel-1.4/tables/cron-job
new file mode 100644
index 0000000..021c32f
--- /dev/null
+++ b/src/squirrel-1.4/tables/cron-job
@@ -0,0 +1,6 @@
+#! /bin/sh
+
+if test -d /nfs/al/pubseq/pubseq/inhouse/sun-nematode/squirrel/tables ; then
+    cd /nfs/al/pubseq/pubseq/inhouse/sun-nematode/squirrel/tables
+    make all
+fi
diff --git a/src/squirrel-1.4/tables/databases b/src/squirrel-1.4/tables/databases
new file mode 100644
index 0000000..dac0099
--- /dev/null
+++ b/src/squirrel-1.4/tables/databases
@@ -0,0 +1,18 @@
+#
+# A database of Staden databases currently using squirrel:
+#
+# Format:
+#
+#    <clone> <directory> <project> <version> <database_type> <owner>
+#
+# Example:
+#
+# We are sequencing clone CLONE. We have created a bap database 
+# called T in the subdirectory TEST of user sd's home directory.
+# (Database files would be called ~sd/TEST/T.{SQ,AR,RL,TG,CC}0)
+# CLONE would be the name of the project in the edited version of
+# the script 'squirrel' that has been copied to ~sd/TEST/.
+#
+# CLONE TEST T 0 bap sd
+#
+TESTDB TESTDB TESTDB 0 bap sd
diff --git a/src/squirrel-1.4/tables/databases.lookup b/src/squirrel-1.4/tables/databases.lookup
new file mode 100644
index 0000000..5b93807
--- /dev/null
+++ b/src/squirrel-1.4/tables/databases.lookup
@@ -0,0 +1,18 @@
+/^#/ {print "#";}
+/^#/ {print "# A database of Staden databases currently using squirrel:";}
+/^#/ {print "#";}
+/^#/ {print "# Format:";}
+/^#/ {print "#";}
+/^#/ {print "#    <clone> <directory> <project> <version> <database_type> <owner>";}
+/^#/ {print "#";}
+/^#/ {print "# Example:";}
+/^#/ {print "#";}
+/^#/ {print "# We are sequencing clone CLONE. We have created a bap database ";}
+/^#/ {print "# called T in the subdirectory TEST of user sd's home directory.";}
+/^#/ {print "# (Database files would be called ~sd/TEST/T.{SQ,AR,RL,TG,CC}0)";}
+/^#/ {print "# CLONE would be the name of the project in the edited version of";}
+/^#/ {print "# the script 'squirrel' that has been copied to ~sd/TEST/.";}
+/^#/ {print "#";}
+/^#/ {print "# CLONE TEST T 0 bap sd";}
+/^#/ {print "#";}
+/^TESTDB/ {print "TESTDB TESTDB TESTDB 0 bap sd";}
diff --git a/src/squirrel-1.4/tables/format.awk b/src/squirrel-1.4/tables/format.awk
new file mode 100644
index 0000000..7527695
--- /dev/null
+++ b/src/squirrel-1.4/tables/format.awk
@@ -0,0 +1,58 @@
+BEGIN {
+    entries_per_page = 50;
+    PP = 0;
+}
+{
+    if ( (NR % entries_per_page) == 1 ) {
+	if (PP)	print ".bp";
+	PP++;
+	print "Page - " PP;
+	#
+	print ".sp 3";
+	print ".TS";
+	print "box, tab (%);";
+	print "c s   s   s   s";
+	print " c | c | c | c | c | c";
+	print " c | c | c | c | c | c";
+	print " l | l | l | l | l | l.";
+	print "Cosmid Subclone Libraries";
+	print "=";
+	print "Microtitre%Cosmid%Size Range%Fragmentation%Vector%Comments" ;
+	print "Dish Number%%(bases)%Method%";
+	print "=";
+    }
+
+    count = 0;
+    if (NF > 5) {
+	line = $0;
+	s = index(line,"(");
+	t = index(line,")");
+	while (s!=0 && t!=0) {
+	    comment = substr(line,s+1,t-s-1);
+	    line = substr(line,t+1);
+	    if (count==0) {
+		print $1 "%" $2 "%" $3 "%" $4 "%" $5 "%" comment;
+	    } else {
+		print "%%%%%" comment;
+	    }
+	    count++;
+	    s = index(line,"(");
+	    t = index(line,")");
+	}
+	if (count>1) print "%%%%%_";
+    }
+    if (count == 0)
+	print $1 "%" $2 "%" $3 "%" $4 "%" $5 "%";
+
+    if ( (NR % entries_per_page) == 0 ) {
+	print ".TE";
+    }
+}
+
+END {
+    if ( (NR % entries_per_page) != 0 ) {
+	# fill up page
+	for(i=NR; i % entries_per_page; i++) print "%%%%%";
+	print ".TE";
+    }
+}
diff --git a/src/squirrel-1.4/tables/last_page b/src/squirrel-1.4/tables/last_page
new file mode 100644
index 0000000..7b6b71f
--- /dev/null
+++ b/src/squirrel-1.4/tables/last_page
@@ -0,0 +1,27 @@
+#! /bin/csh -f
+set FORMAT = ~wormpub/BioSW/staden/tables/format.awk
+set SUBCLONEDB = subclones
+if (-e subclones) then
+	set SUBCLONEDB = subclones
+else
+	if ($?SUBCLONES) then
+		set SUBCLONEDB = $SUBCLONES
+		if (! -e $SUBCLONEDB) then 
+			echo "Cannot open file $SUBCLONEDB"
+			exit 1
+		endif
+	else
+		echo "Cannot open file"
+		exit 1
+	endif
+endif
+
+echo "Printing last page of $SUBCLONEDB on lp"
+
+set a = `wc -l $SUBCLONEDB`
+# entries_per_page is also explicitly defined in $FORMAT
+set entries_per_page = 50
+@ b = ( ( ( $a[1] - 1 ) / $entries_per_page ) * $entries_per_page ) + 1
+tail +$b $SUBCLONEDB | awk -f $FORMAT | grep -v "^Page -" | tbl | ptroff -Plp
+
+exit 0
diff --git a/src/squirrel-1.4/tables/lookup.csh b/src/squirrel-1.4/tables/lookup.csh
new file mode 100644
index 0000000..277de3e
--- /dev/null
+++ b/src/squirrel-1.4/tables/lookup.csh
@@ -0,0 +1,6 @@
+#! /bin/csh -f
+#
+# Usage:
+#      lookup key file
+#
+echo $1 | awk -f $2
diff --git a/src/squirrel-1.4/tables/makefile b/src/squirrel-1.4/tables/makefile
new file mode 100644
index 0000000..dcbff53
--- /dev/null
+++ b/src/squirrel-1.4/tables/makefile
@@ -0,0 +1,22 @@
+#
+# Make look up tables for files
+#
+TABLES= \
+	databases.lookup \
+	seqclone-sites.lookup \
+	subclones.lookup \
+	vector-seqfiles.lookup \
+	screens.lookup
+
+
+%.lookup : %
+	@echo ""
+	@echo Building $<.lookup
+	@rm -f $<.lookup
+	@./create.lookup $< $<.lookup
+
+all: $(TABLES)
+
+clean:
+	rm -f $(TABLES)
+
diff --git a/src/squirrel-1.4/tables/screens b/src/squirrel-1.4/tables/screens
new file mode 100644
index 0000000..4a58f8d
--- /dev/null
+++ b/src/squirrel-1.4/tables/screens
@@ -0,0 +1,13 @@
+#
+# Database of sequences to screen against clones
+#
+# Format:
+#
+#     <clone> <sequence-file>
+#
+# Example:
+#
+# We wish to screen our cosmid CLONE against yeast chromosome 3.
+#
+# CLONE scchrIII.seq
+#
diff --git a/src/squirrel-1.4/tables/screens.lookup b/src/squirrel-1.4/tables/screens.lookup
new file mode 100644
index 0000000..3f35816
--- /dev/null
+++ b/src/squirrel-1.4/tables/screens.lookup
@@ -0,0 +1,13 @@
+/^#/ {print "#";}
+/^#/ {print "# Database of sequences to screen against clones";}
+/^#/ {print "#";}
+/^#/ {print "# Format:";}
+/^#/ {print "#";}
+/^#/ {print "#     <clone> <sequence-file>";}
+/^#/ {print "#";}
+/^#/ {print "# Example:";}
+/^#/ {print "#";}
+/^#/ {print "# We wish to screen our cosmid CLONE against yeast chromosome 3.";}
+/^#/ {print "#";}
+/^#/ {print "# CLONE scchrIII.seq";}
+/^#/ {print "#";}
diff --git a/src/squirrel-1.4/tables/seqclone-sites b/src/squirrel-1.4/tables/seqclone-sites
new file mode 100644
index 0000000..8b15bc7
--- /dev/null
+++ b/src/squirrel-1.4/tables/seqclone-sites
@@ -0,0 +1,20 @@
+#
+# Database of sequencing clone cloning and priming sites
+#
+#    <clone> <clone-site> <forward-primer-site> <reverse-primer-site>
+#
+# NOTES:
+#    clone-site is defined as the base position of the 3' end
+#    of the cloning site
+#
+# Example:
+#
+# Our sequencing vector is M13mp18, our cloning site is SmaI and
+# we are using the -20 forward sequencing primer and the -21
+# reverse sequencing primer. With respect to the seqeunce file
+# $SQUIRREL/seqs/M13mp18.seq we would have:
+#
+# M13mp18 6249 41 -24
+#
+M13mp18 6249 41 -24
+pBS     2305 46 -79
diff --git a/src/squirrel-1.4/tables/seqclone-sites.lookup b/src/squirrel-1.4/tables/seqclone-sites.lookup
new file mode 100644
index 0000000..92761bf
--- /dev/null
+++ b/src/squirrel-1.4/tables/seqclone-sites.lookup
@@ -0,0 +1,20 @@
+/^#/ {print "#";}
+/^#/ {print "# Database of sequencing clone cloning and priming sites";}
+/^#/ {print "#";}
+/^#/ {print "#    <clone> <clone-site> <forward-primer-site> <reverse-primer-site>";}
+/^#/ {print "#";}
+/^#/ {print "# NOTES:";}
+/^#/ {print "#    clone-site is defined as the base position of the 3' end";}
+/^#/ {print "#    of the cloning site";}
+/^#/ {print "#";}
+/^#/ {print "# Example:";}
+/^#/ {print "#";}
+/^#/ {print "# Our sequencing vector is M13mp18, our cloning site is SmaI and";}
+/^#/ {print "# we are using the -20 forward sequencing primer and the -21";}
+/^#/ {print "# reverse sequencing primer. With respect to the seqeunce file";}
+/^#/ {print "# $SQUIRREL/seqs/M13mp18.seq we would have:";}
+/^#/ {print "#";}
+/^#/ {print "# M13mp18 6249 41 -24";}
+/^#/ {print "#";}
+/^M13mp18/ {print "M13mp18 6249 41 -24";}
+/^pBS/ {print "pBS     2305 46 -79";}
diff --git a/src/squirrel-1.4/tables/subclones b/src/squirrel-1.4/tables/subclones
new file mode 100644
index 0000000..eb07145
--- /dev/null
+++ b/src/squirrel-1.4/tables/subclones
@@ -0,0 +1,15 @@
+#
+# Database of subclone libraries:
+#
+# Format:
+#
+#   <microtitre-dish> <clone> <size-range> <frag method vector>
+#
+# Example:
+#
+# Microtitre dish a01 contains a subclone library for clone CLONE.
+# It was created by shearing then cloning into M13mp18. Fragment
+# size if selected on range 1000 to 2000 bases.
+#
+# a01 CLONE 1000..2000 shear M13mp18
+#
diff --git a/src/squirrel-1.4/tables/subclones.lookup b/src/squirrel-1.4/tables/subclones.lookup
new file mode 100644
index 0000000..2195528
--- /dev/null
+++ b/src/squirrel-1.4/tables/subclones.lookup
@@ -0,0 +1,15 @@
+/^#/ {print "#";}
+/^#/ {print "# Database of subclone libraries:";}
+/^#/ {print "#";}
+/^#/ {print "# Format:";}
+/^#/ {print "#";}
+/^#/ {print "#   <microtitre-dish> <clone> <size-range> <frag method vector>";}
+/^#/ {print "#";}
+/^#/ {print "# Example:";}
+/^#/ {print "#";}
+/^#/ {print "# Microtitre dish a01 contains a subclone library for clone CLONE.";}
+/^#/ {print "# It was created by shearing then cloning into M13mp18. Fragment";}
+/^#/ {print "# size if selected on range 1000 to 2000 bases.";}
+/^#/ {print "#";}
+/^#/ {print "# a01 CLONE 1000..2000 shear M13mp18";}
+/^#/ {print "#";}
diff --git a/src/squirrel-1.4/tables/vector-seqfiles b/src/squirrel-1.4/tables/vector-seqfiles
new file mode 100644
index 0000000..a643a83
--- /dev/null
+++ b/src/squirrel-1.4/tables/vector-seqfiles
@@ -0,0 +1,14 @@
+# database for mapping of vector to sequence file
+#
+# Format:
+#        <vector> <file-name>
+#
+Lambda  lambda.seq
+Lorist2 lorist2.vep
+Lorist6 lorist6.vep
+LoristB loristB.seq
+M13mp18 m13mp18.vec
+M13mp7  m13mp7.seq
+pBS     blue.vep
+pJB8    pjb8.vep
+pUC18   puc18.vec
diff --git a/src/squirrel-1.4/tables/vector-seqfiles.lookup b/src/squirrel-1.4/tables/vector-seqfiles.lookup
new file mode 100644
index 0000000..c30cfc1
--- /dev/null
+++ b/src/squirrel-1.4/tables/vector-seqfiles.lookup
@@ -0,0 +1,14 @@
+/^#/ {print "# database for mapping of vector to sequence file";}
+/^#/ {print "#";}
+/^#/ {print "# Format:";}
+/^#/ {print "#        <vector> <file-name>";}
+/^#/ {print "#";}
+/^Lambda/ {print "Lambda  lambda.seq";}
+/^Lorist2/ {print "Lorist2 lorist2.vep";}
+/^Lorist6/ {print "Lorist6 lorist6.vep";}
+/^LoristB/ {print "LoristB loristB.seq";}
+/^M13mp18/ {print "M13mp18 m13mp18.vec";}
+/^M13mp7/ {print "M13mp7  m13mp7.seq";}
+/^pBS/ {print "pBS     blue.vep";}
+/^pJB8/ {print "pJB8    pjb8.vep";}
+/^pUC18/ {print "pUC18   puc18.vec";}
diff --git a/src/staden/FtoC.c b/src/staden/FtoC.c
new file mode 100644
index 0000000..83d0312
--- /dev/null
+++ b/src/staden/FtoC.c
@@ -0,0 +1,76 @@
+#include "fort.h"
+
+void Cstr2Fstr(char *Cstr,
+	       char *Fstr, int_fl Flen)
+/*
+    Copy a '\0' terminated C string to a Fortran string, blank padding
+    if needed and ignoring excess C characters if needed.
+
+    This function works if the strings are distinct or coincident, but
+    not if they overlap in any other way.
+*/
+{   int_fl i;
+
+    for (i=0; (i<Flen) && (Cstr[i] != '\0'); i++)
+    {   Fstr[i] = Cstr[i];
+    }
+    for (; i<Flen; i++)
+    {   Fstr[i] = ' ';
+    }
+}
+
+
+
+
+void Fstr2Cstr(char *Fstr, int_fl Flen,
+	       char *Cstr, int_fl Clen)
+/*
+    Copy the significant characters of a blank padded Fortran string
+    to a '\0' terminated C string, ignoring excess characters.
+
+    This function works if the strings are distinct or coincident, but
+    not if they overlap in any other way.
+*/
+{   int_fl FsigLen, i;
+
+    /* Find the significant length of Fstr */
+    FsigLen=Flen;
+    while ((FsigLen > 0) && (Fstr[FsigLen-1] == ' '))
+    {   FsigLen--;
+    }
+
+    /* Copy up to (Clen-1) significant characters */
+    i=0;
+    while ((i < FsigLen) && (i < (Clen-1)))
+    {   Cstr[i] = Fstr[i];
+        i++;
+    }
+
+    Cstr[i] = '\0';
+}
+
+
+
+int_f swapbo_(int_f *i4)
+/*
+ * Returns the big-endian form of a four byte integer
+ */
+{
+    int i=1;
+
+    if (*(char*)&i) {
+
+	int_f swapped;
+
+#define swap_int4(src, dst) \
+    ((char *)&(dst))[0] = ((char *) &(src))[3];\
+    ((char *)&(dst))[1] = ((char *) &(src))[2];\
+    ((char *)&(dst))[2] = ((char *) &(src))[1];\
+    ((char *)&(dst))[3] = ((char *) &(src))[0];
+
+	swap_int4(*i4,swapped);
+	return swapped;
+    } else
+	return *i4;
+
+}
diff --git a/src/staden/FtoC.h b/src/staden/FtoC.h
new file mode 100644
index 0000000..7b301bb
--- /dev/null
+++ b/src/staden/FtoC.h
@@ -0,0 +1,41 @@
+/*
+    Title: 	 FtoC
+
+    File: 	 FtoC.h
+    Purpose:	 FORTRAN-C string conversion routines
+    Last update: Mon Jun 18 1990
+*/
+
+#ifndef _FTOC_H_
+#define _FTOC_H_
+
+#include "fort.h"
+
+
+extern void Cstr2Fstr(char *Cstr,
+		      char *Fstr, int_fl Flen);
+/*
+    Copy a '\0' terminated C string to a Fortran string, blank padding
+    if needed and ignoring excess C characters if needed.
+
+    This function works if the strings are distinct or coincident, but
+    not if they overlap in any other way.
+*/
+
+
+extern void Fstr2Cstr(char *Fstr, int_fl Flen,
+		      char *Cstr, int_fl Clen);
+/*
+    Copy the significant characters of a blank padded Fortran string
+    to a '\0' terminated C string, ignoring excess characters.
+
+    This function works if the strings are distinct or coincident, but
+    not if they overlap in any other way.
+*/
+
+extern int_f swapbo_(int_f *i4);
+/*
+ * Returns the big-endian form of a four byte integer
+ */
+
+#endif /*_FTOC_H_*/
diff --git a/src/staden/README b/src/staden/README
new file mode 100644
index 0000000..2a4ce75
--- /dev/null
+++ b/src/staden/README
@@ -0,0 +1,37 @@
+The staden suite of sequence analysis programs for the SUN/X11
+--------------------------------------------------------------
+
+The contents of this directory should be (at least):
+bin		executable sequence analysis programs
+help		helpfiles used by these programs
+staden.l        manual page
+staden.login	csh initialisations
+staden.profile	sh initialisations
+tables		tables used by the programs
+
+Installers should first print out a copy of the manual page.
+This describes the current state of the software and how to
+invoke the initialisations. The manual page should be placed
+somewhere where `man' can find it.
+
+The directory structure should be position independent. It's
+location, however, must be encoded in the environment
+variable $STADENROOT. This is described in the man page.
+
+
+xdap - Sequence assembly program
+--------------------------------
+
+This is the source directory for the dap and xdap sequence assembly
+programmes, and convert_project.  As well as the sources you should
+find a makefile and a default resources file "Xdap".
+
+The "xdap" application can be constructed by simply typing "make xdap",
+and installed by typing "make installx" As xdap requires objects from
+the "staden" and "ted" directories, it should be made after the staden
+package proper and the trace editor have been made.
+
+The default X resource file "Xdap" files is found in
+$STADENROOT/tables, and the environment variable XFILESEARCHPATH is set
+in staden.login to refer to it. This file could also be placed in the
+"/usr/lib/X11/app-defaults" directory.
diff --git a/src/staden/README0 b/src/staden/README0
new file mode 100644
index 0000000..4af2c4e
--- /dev/null
+++ b/src/staden/README0
@@ -0,0 +1,25 @@
+The staden suite of sequence analysis programs for the SUN/X11
+--------------------------------------------------------------
+
+This is the source directory for the staden suite of programs.
+This file describes what its contents are, how to make it and
+how to install it. The companion file `README' is to be read
+by those who have just installed the suite.
+
+The contents of this directory should be (at least):
+                The executable sequence analysis programs
+*.f, *.c *.h    The program sources
+help		helpfiles used by these programs
+makefile
+staden.l        manual page
+staden.login	csh initialisations
+staden.profile	sh initialisations
+tables		tables used by the programs
+
+makefile
+--------
+The makefile needs the macro `SRCDIR' setting before it runs.
+There are commands to make individual programs (e.g. `make xpip'),
+all the `normal' xterm programs (`make nprogs'), all the X programs
+(`make xprogs'), all the programs (`make all') and to install
+everything (`make install') and to install bits.
diff --git a/src/staden/Sheet.c b/src/staden/Sheet.c
new file mode 100644
index 0000000..fa07b27
--- /dev/null
+++ b/src/staden/Sheet.c
@@ -0,0 +1,1222 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: Sheet.c,v 1.2 88/10/25 17:40:25 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#include <X11/IntrinsicP.h>
+#include <X11/StringDefs.h>
+#include "SheetP.h"
+
+
+#include <stdio.h>
+#include <sys/types.h>
+
+/* ---- Forward declarations ---- */
+
+
+static void Initialize(Widget request, Widget new,
+		       ArgList args, Cardinal *num_args);
+static void Destroy(Widget w);
+static void Redisplay(Widget w, XEvent *event, Region region);
+static void Resize(Widget w);
+static void InputAction(Widget w, XEvent *event,
+			String *params,Cardinal *num_params);
+static Boolean SetValues(Widget current, Widget request, Widget new,
+			 ArgList args, Cardinal *num_args);
+
+
+static void destroy_array(sheet_array a);
+static sheet_array create_array (int r, int c, size_t s);
+static void move_array (sheet_array a, sheet_array b);
+static void extend_array (sheet_array *a, int r, int c);
+static char *get_array_element(sheet_array a, int r, int c);
+
+
+/* ---- Private data ---- */
+
+
+static XtResource resources[] = {
+#define  offset(field) XtOffset(SheetWidget, field)
+    /* {name, class, type, size, offset, default_type, default_addr}, */
+    { XtNcallback, XtCCallback, XtRCallback, (Cardinal)sizeof(XtCallbackList),
+	  offset(sheet.input_callback), XtRCallback, NULL },
+    { XtNexposeCallback, XtCCallback, XtRCallback,
+	  (Cardinal)sizeof(XtCallbackList),
+	  offset(sheet.expose_callback), XtRCallback, NULL },
+    { XtNfont, XtCFont, XtRFontStruct, (Cardinal)sizeof(XFontStruct*),
+	  offset(sheet.font), XtRString, XtDefaultFont },
+    { XtNheight, XtCHeight, XtRDimension, (Cardinal)sizeof(Dimension),
+	  offset(core.height), XtRImmediate, (caddr_t)1},
+    { XtNwidth, XtCWidth, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  offset(core.width), XtRImmediate, (caddr_t)1},
+    { XtNresizeCallback, XtCCallback, XtRCallback,
+	  (Cardinal)sizeof(XtCallbackList),
+	  offset(sheet.resize_callback), XtRCallback, NULL },
+    { XtNrows, XtCHeight, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  offset(sheet.rows), XtRImmediate, (caddr_t)10},
+    { XtNcolumns, XtCWidth, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  offset(sheet.columns), XtRImmediate, (caddr_t)80},
+    { XtNleftMargin, XtCMargin, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  offset(sheet.margin.left), XtRImmediate, (caddr_t)4},
+    { XtNrightMargin, XtCMargin, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  offset(sheet.margin.right), XtRImmediate, (caddr_t)4},
+    { XtNtopMargin, XtCMargin, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  offset(sheet.margin.top), XtRImmediate, (caddr_t)4},
+    { XtNbottomMargin, XtCMargin, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  offset(sheet.margin.bottom), XtRImmediate, (caddr_t)4},
+    {XtNforeground, XtCForeground, XtRPixel, (Cardinal)sizeof (Pixel),
+	  offset(sheet.foreground), XtRString, XtDefaultForeground},
+    {XtNbackground, XtCBackground, XtRPixel, (Cardinal)sizeof (Pixel),
+	  offset(sheet.background), XtRString, XtDefaultBackground},
+    {XtNcursor, XtCCursor, XtRCursor, (Cardinal)sizeof(Cursor),
+	  offset(simple.cursor), XtRString, "xterm"},
+    {XtNdisplayCursor, XtCOutput, XtRBoolean, (Cardinal)sizeof(Boolean),
+	  offset(sheet.display_cursor), XtRImmediate, (caddr_t)False},
+    {XtNcursorRow, XtCWidth, XtRDimension, (Cardinal)sizeof(Dimension),
+	  offset(sheet.cursor_row), XtRImmediate, (caddr_t)0},
+    {XtNcursorColumn, XtCHeight, XtRDimension, (Cardinal)sizeof(Dimension),
+	  offset(sheet.cursor_column), XtRImmediate, (caddr_t)0},
+#undef  offset
+};
+
+
+static XtActionsRec actions[] =
+{
+  /* {name, procedure}, */
+    {"input",	InputAction},
+};
+
+
+static char translations[] =
+"<Key>:		input()	\n\
+ <BtnDown>:     input() \
+";
+
+
+SheetClassRec sheetClassRec = {
+  { /* core fields initial values */
+    /* superclass               */      (WidgetClass) &simpleClassRec,
+    /* class_name		*/	"Sheet",
+    /* widget_size		*/	(Cardinal)sizeof(SheetRec),
+    /* class_initialize		*/	NULL,
+    /* class_part_initialize	*/	NULL,
+    /* class_inited		*/	FALSE,
+    /* initialize		*/	Initialize,
+    /* initialize_hook		*/	NULL,
+    /* realize			*/	XtInheritRealize,
+    /* actions			*/	actions,
+    /* num_actions		*/	XtNumber(actions),
+    /* resources		*/	resources,
+    /* num_resources		*/	XtNumber(resources),
+    /* xrm_class		*/	NULLQUARK,
+    /* compress_motion		*/	TRUE,
+    /* compress_exposure	*/	XtExposeNoCompress,
+    /* compress_exposure		XtExposeCompressMaximal, */
+    /* compress_enterleave	*/	TRUE,
+    /* visible_interest		*/	FALSE,
+    /* destroy			*/	Destroy,
+    /* resize			*/	Resize,
+    /* expose			*/	Redisplay,
+    /* set_values		*/	SetValues,
+    /* set_values_hook		*/	NULL,
+    /* set_values_almost	*/	XtInheritSetValuesAlmost,
+    /* get_values_hook		*/	NULL,
+    /* accept_focus		*/	NULL,
+    /* version			*/	XtVersion,
+    /* callback_private		*/	NULL,
+    /* tm_table			*/	translations,
+    /* query_geometry		*/	XtInheritQueryGeometry,
+    /* display_accelerator	*/	XtInheritDisplayAccelerator,
+    /* extension		*/	NULL
+  },
+  { /* Simple class initial values */
+    /* change_sensitive         */      XtInheritChangeSensitive
+  },
+  { /* Sheet class initial values */
+    /* empty			*/	0
+  }
+};
+
+WidgetClass sheetWidgetClass = (WidgetClass)&sheetClassRec;
+
+
+
+
+/* ---- Private procedures --- */
+int binary_op(int src, int dst, int op)
+{
+    switch (op & HOP_MASK) {
+	case 0: return 0;
+	case 1: return ~ (src | dst);
+	case 2: return ~src & dst;
+	case 3: return ~src;
+	case 4: return src & ~dst;
+	case 5: return ~dst;
+	case 6: return src ^ dst;
+	case 7: return ~(src & dst);
+	case 8: return src & dst;
+	case 9: return ~src ^ ~dst;
+	case 10: return dst;
+	case 11: return ~src|dst;
+	case 12: return src;
+	case 13: return src|~dst;
+	case 14: return src|dst;
+	case 15: return 1;
+    }
+}
+
+#define GET_ARRAY_CELL(A,R,C)\
+    ( &A->base[(R * A->cols + C)*A->size] )
+
+static void destroy_array(sheet_array a)
+{
+    XtFree (a->base);
+    XtFree ((char *)a);
+}
+
+static sheet_array create_array (int r, int c, size_t s)
+{
+    sheet_array new = (sheet_array) XtCalloc (1,sizeof(sheet_array_struct));
+    if (new != NULL) {
+	new->base = (char *) XtCalloc (r*c,s);
+	if (new->base == NULL) {
+	    XtFree ((char *)new);
+	    new = NULL;
+	} else {
+	    new->rows = r;
+	    new->cols = c;
+	    new->size = s;
+	}
+    }
+    return new;
+}
+
+#define min(A,B) ((A < B) ? A : B)
+static void move_array (sheet_array a, sheet_array b)
+{
+    size_t r,c;
+    int i;
+
+    c = min (a->cols*a->size, b->cols*b->size);
+    r = min (a->rows, b->rows);
+    for (i=0; i<r; i++)
+	memcpy(
+	    (char *) GET_ARRAY_CELL(b,i,0),
+	    (char *) GET_ARRAY_CELL(a,i,0),
+	    c
+	);
+
+}
+#undef min
+
+static void extend_array (sheet_array *a, int r, int c)
+/*
+** Extending stategy: For rows.
+** Do rows really need extending?
+** Yes:
+**    Will twice old rows do?
+**    Yes:
+**       Use twice old rows.
+**    No:
+**	 Use new rows plus EXTEND_ROWS_GUESS (A wild quess that's enough)
+** No:
+**    Don't extend
+**
+** Extending stategy: For columns.
+** As for rows.
+*/
+{
+    int newr;
+    int newc;
+    sheet_array b;
+
+#define EXTEND_ROWS_GUESS 5
+#define EXTEND_COLS_GUESS 5
+    newr = (r<=(*a)->rows)
+	?(*a)->rows
+	:(r<=(*a)->rows*2)
+	    ?(*a)->rows*2
+	    :r+EXTEND_ROWS_GUESS;
+    if (newr > SHEET_MAX_ROWS) newr= SHEET_MAX_ROWS;
+
+    newc = (c<=(*a)->cols)
+	?(*a)->cols
+	:(c<=(*a)->cols*2)
+	    ?(*a)->cols*2
+	    :r+EXTEND_COLS_GUESS;
+    if (newc > SHEET_MAX_COLS) newc= SHEET_MAX_COLS;
+
+    if (newr!=(*a)->rows || newc!=(*a)->cols) {
+	b = create_array(newr,newc,(*a)->size);
+	move_array(*a,b);
+	destroy_array(*a);
+	*a = b;
+    }
+}
+
+/*
+static char * get_array_element(sheet_array a, int r, int c)
+{
+    if (r < 0 || c < 0)
+	return NULL;
+    if (a->rows > r || a->cols > c)
+	return NULL;
+
+    return GET_ARRAY_CELL(a,r,c);
+}
+*/
+
+
+/* ---- Private Routines ---- */
+#define fontWidth(F) ((F)->max_bounds.width)
+#define fontHeight(F) ((F)->max_bounds.ascent + (F)->max_bounds.descent)
+#define COL_TO_PIXEL(W,C) \
+    ((W)->sheet.font->max_bounds.width * (C) + (W)->sheet.margin.left)
+#define ROW_TO_BASELINE_PIXEL(W,R) \
+    (fontHeight((W)->sheet.font) * (R) + (W)->sheet.margin.top + (W)->sheet.font->max_bounds.ascent)
+#define ROW_TO_PIXEL(W,R) \
+    (fontHeight((W)->sheet.font) * ((R)+1) + (W)->sheet.margin.top)
+#define PIXEL_TO_COL(W,P) \
+    (((long)(P) - (long)(W)->sheet.margin.left) / (long)(W)->sheet.font->max_bounds.width)
+#define PIXEL_WIDTH_TO_COLS(W,P) \
+    (((long)(P) - (long)(W)->sheet.margin.left - (long)(W)->sheet.margin.right) / (long)(W)->sheet.font->max_bounds.width)
+#define COLS_TO_PIXEL_WIDTH(W,C) \
+    (((C) * (W)->sheet.font->max_bounds.width) + (W)->sheet.margin.left + (W)->sheet.margin.right)
+#define BASELINE_PIXEL_TO_ROW(W,P) \
+    (((long)(P) - (long)(W)->sheet.margin.top - (long)(W)->sheet.font->max_bounds.ascent) / (long)fontHeight((W)->sheet.font) )
+#define PIXEL_TO_ROW(W,P) \
+    (((long)(P) - (long)(W)->sheet.margin.top) / (long)fontHeight((W)->sheet.font) )
+#define PIXEL_HEIGHT_TO_ROWS(W,P) \
+    (((long)(P) - (long)(W)->sheet.margin.top - (long)(W)->sheet.margin.bottom) / (long)fontHeight((W)->sheet.font) )
+#define ROWS_TO_PIXEL_HEIGHT(W,R) \
+    (((R) * fontHeight((W)->sheet.font)) + (W)->sheet.margin.top + (W)->sheet.margin.bottom)
+#define FONT_WIDTH(W) (fontWidth((W)->sheet.font))
+#define FONT_HEIGHT(W) (fontHeight((W)->sheet.font))
+
+static void setGC(SheetWidget sw, GC gc, sheet_ink ink_base)
+{
+    if (ink_base->sh & sh_inverse) {
+	if (ink_base->sh&sh_bg)
+	    XSetForeground(XtDisplay(sw),gc,ink_base->bg);
+	else
+	    XSetForeground(XtDisplay(sw),gc,sw->sheet.background);
+	if (ink_base->sh&sh_fg)
+	    XSetBackground(XtDisplay(sw),gc,ink_base->fg);
+	else
+	    XSetBackground(XtDisplay(sw),gc,sw->sheet.foreground);
+    } else {
+	if (ink_base->sh&sh_fg)
+	    XSetForeground(XtDisplay(sw),gc,ink_base->fg);
+	else
+	    XSetForeground(XtDisplay(sw),gc,sw->sheet.foreground);
+	if (ink_base->sh&sh_bg)
+	    XSetBackground(XtDisplay(sw),gc,ink_base->bg);
+	else
+	    XSetBackground(XtDisplay(sw),gc,sw->sheet.background);
+    }
+
+}
+
+static void redrawCursor(SheetWidget sw, Boolean draw);
+
+static void old_repaint(SheetWidget sw, int c, int r, int l, sheet_ink ink, char *s)
+{
+    /*
+    int cursorZapped;
+
+    cursorZapped = (sw->sheet.display_cursor &&
+	sw->sheet.cursor_row >= r &&
+	sw->sheet.cursor_row < (r+l) &&
+	sw->sheet.cursor_column == c);
+    */
+    
+    if (ink->sh==sh_default) {
+	XDrawImageString(
+	    XtDisplay(sw),
+	    XtWindow(sw),
+	    sw->sheet.normgc,
+	    (int) COL_TO_PIXEL(sw,c),
+	    (int) ROW_TO_BASELINE_PIXEL(sw,r),
+	    s,
+	    l);
+    } else {
+	if (ink->sh & sh_light) {
+	    if (DisplayPlanes(XtDisplay(sw),DefaultScreen(XtDisplay(sw)))==1){
+		XFillRectangle(
+			       XtDisplay(sw),
+			       XtWindow(sw),
+			       sw->sheet.whitegc,
+			       (int) COL_TO_PIXEL(sw,c),
+			       (int) ROW_TO_PIXEL(sw,r-1),
+			       FONT_WIDTH(sw) * l,
+			       FONT_HEIGHT(sw)
+			       );
+		setGC(sw, sw->sheet.greygc, ink);
+		XDrawString(
+			    XtDisplay(sw),
+			    XtWindow(sw),
+			    sw->sheet.greygc,
+			    (int) COL_TO_PIXEL(sw,c),
+			    (int) ROW_TO_BASELINE_PIXEL(sw,r),
+			    s,
+			    l);
+	    } else {
+		sheet_ink_struct my_ink;
+		my_ink.sh = sh_fg;
+		my_ink.fg = sw->sheet.light;
+		setGC(sw, sw->sheet.sparegc, &my_ink);
+		XDrawImageString(
+				 XtDisplay(sw),
+				 XtWindow(sw),
+				 sw->sheet.sparegc,
+				 (int) COL_TO_PIXEL(sw,c),
+				 (int) ROW_TO_BASELINE_PIXEL(sw,r),
+				 s,
+				 l);
+	    }
+	} else {
+	    if (DisplayPlanes(XtDisplay(sw),DefaultScreen(XtDisplay(sw)))==1){
+		XFillRectangle(
+			       XtDisplay(sw),
+			       XtWindow(sw),
+			       sw->sheet.greygc,
+			       (int) COL_TO_PIXEL(sw,c),
+			       (int) ROW_TO_PIXEL(sw,r-1),
+			       FONT_WIDTH(sw) * l,
+			       FONT_HEIGHT(sw)
+			       );
+		XDrawString(
+			    XtDisplay(sw),
+			    XtWindow(sw),
+			    sw->sheet.normgc,
+			    (int) COL_TO_PIXEL(sw,c),
+			    (int) ROW_TO_BASELINE_PIXEL(sw,r),
+			    s,
+			    l);
+	    } else {
+		setGC(sw, sw->sheet.sparegc, ink);
+
+
+		XDrawImageString(
+				 XtDisplay(sw),
+				 XtWindow(sw),
+				 sw->sheet.sparegc,
+				 (int) COL_TO_PIXEL(sw,c),
+				 (int) ROW_TO_BASELINE_PIXEL(sw,r),
+				 s,
+				 l);
+	    }
+	}
+	if (ink->sh & sh_select || ink->sh & sh_underline) {
+	    setGC(sw, sw->sheet.sparegc, ink);
+	    XDrawLine(
+		XtDisplay(sw),
+		XtWindow(sw),
+		sw->sheet.sparegc,
+		(int) COL_TO_PIXEL(sw,c),
+		(int) ROW_TO_BASELINE_PIXEL(sw,r),
+		(int) COL_TO_PIXEL(sw,c+l)-1,
+		(int) ROW_TO_BASELINE_PIXEL(sw,r)
+		);
+	}
+    }
+
+    /*
+    if (cursorZapped)
+	redrawCursor(sw,True);
+    */
+}
+
+
+
+
+static void _repaint_colour(SheetWidget sw, int c, int r, int l, sheet_ink ink, char *s)
+{
+    sheet_ink_struct my_ink;
+    my_ink = *ink;
+
+    if (ink->sh & sh_light) {
+	my_ink.sh = (my_ink.sh | sh_fg) & ~sh_bg;
+	my_ink.fg = sw->sheet.light;
+    }	
+
+    setGC(sw, sw->sheet.sparegc, &my_ink);
+    XDrawImageString(
+		     XtDisplay(sw),
+		     XtWindow(sw),
+		     sw->sheet.sparegc,
+		     (int) COL_TO_PIXEL(sw,c),
+		     (int) ROW_TO_BASELINE_PIXEL(sw,r),
+		     s,
+		     l);
+
+    if (ink->sh & sh_select || ink->sh & sh_underline) {
+	XDrawLine(
+		  XtDisplay(sw),
+		  XtWindow(sw),
+		  sw->sheet.sparegc,
+		  (int) COL_TO_PIXEL(sw,c),
+		  (int) ROW_TO_BASELINE_PIXEL(sw,r),
+		  (int) COL_TO_PIXEL(sw,c+l)-1,
+		  (int) ROW_TO_BASELINE_PIXEL(sw,r)
+		  );
+    }
+
+}
+
+
+static void _repaint_monochrome(SheetWidget sw, int c, int r, int l, sheet_ink ink, char *s)
+{
+    
+    GC fg_gc;
+    GC bg_gc;
+
+#define L  ( ink->sh & sh_light )
+#define I  ( ink->sh & sh_inverse )
+#define BG ( ink->sh & (sh_bg | sh_fg) )
+    /*
+    ** bg_determination
+    */
+    bg_gc = ( I && !L ) ? sw->sheet.normgc :
+	(I || (!L && BG)) ? sw->sheet.greygc :
+	    sw->sheet.whitegc;
+    /*
+    ** fg_determination
+    */
+    fg_gc = ( !I && !L ) ? sw->sheet.normgc :
+	(!I || (!L && BG)) ? sw->sheet.greygc :
+	    sw->sheet.whitegc;
+#undef L
+#undef I
+#undef BG
+
+    XFillRectangle(
+		   XtDisplay(sw),
+		   XtWindow(sw),
+		   bg_gc,
+		   (int) COL_TO_PIXEL(sw,c),
+		   (int) ROW_TO_PIXEL(sw,r-1),
+		   FONT_WIDTH(sw) * l,
+		   FONT_HEIGHT(sw)
+		   );
+    XDrawString(
+		XtDisplay(sw),
+		XtWindow(sw),
+		fg_gc,
+		(int) COL_TO_PIXEL(sw,c),
+		(int) ROW_TO_BASELINE_PIXEL(sw,r),
+		s,
+		l);
+
+    if (ink->sh & sh_select || ink->sh & sh_underline) {
+	setGC(sw, sw->sheet.sparegc, ink);
+	XDrawLine(
+		  XtDisplay(sw),
+		  XtWindow(sw),
+		  fg_gc,
+		  (int) COL_TO_PIXEL(sw,c),
+		  (int) ROW_TO_BASELINE_PIXEL(sw,r),
+		  (int) COL_TO_PIXEL(sw,c+l)-1,
+		  (int) ROW_TO_BASELINE_PIXEL(sw,r)
+		  );
+    }
+}
+
+
+
+static void _repaint(SheetWidget sw, int c, int r, int l, sheet_ink ink, char *s)
+{
+    if (ink->sh==sh_default) {
+	XDrawImageString(
+	    XtDisplay(sw),
+	    XtWindow(sw),
+	    sw->sheet.normgc,
+	    (int) COL_TO_PIXEL(sw,c),
+	    (int) ROW_TO_BASELINE_PIXEL(sw,r),
+	    s,
+	    l);
+    } else {
+	if (DisplayPlanes(XtDisplay(sw),DefaultScreen(XtDisplay(sw)))==1)
+	    _repaint_monochrome(sw,c,r,l,ink,s);
+	else
+	    _repaint_colour(sw,c,r,l,ink,s);
+    }
+
+}
+
+
+static void redrawCursor(SheetWidget sw, Boolean draw)
+{
+    XawSheetRow    r = sw->sheet.cursor_row;
+    XawSheetColumn c = sw->sheet.cursor_column;
+    sheet_ink ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c);
+    sheet_paper paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+    sheet_ink_struct ink;
+    int cursor_was;
+
+    /* check cursor is on screen */
+    if (r < 0 || r > sw->sheet.rows-1 ||
+	c < 0 || c > sw->sheet.columns-1 ) return;
+
+    ink.fg = ink_base->fg;
+    ink.bg = ink_base->bg;
+    if (draw)
+	ink.sh = ink_base->sh | sh_inverse;
+    else
+	ink.sh = ink_base->sh;
+
+    /*
+    cursor_was = sw->sheet.display_cursor;
+    sw->sheet.display_cursor = 0;
+    */
+    _repaint(sw, c, r, 1, &ink, paper_base);
+    /*
+    sw->sheet.display_cursor = cursor_was;
+    */
+
+}
+
+static void repaintText(SheetWidget sw, int c, int r, int l)
+{
+    sheet_ink ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c);
+    sheet_ink ink_peek;
+    sheet_paper paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+    sheet_paper paper_peek;
+    XawSheetColumn c_peek;
+    int i;
+
+    while (l > 0) {
+	/* find stretch where all hilight the same */
+	ink_peek = ink_base;
+	ink_peek++;
+	paper_peek = paper_base;
+	paper_peek++;
+	c_peek = c;
+	c_peek++;
+	i = 1;
+	l--;
+#define implies(A,B) ((B)|!(A))
+	while ( (l > 0) &&
+	    (ink_peek->sh == ink_base->sh) &&
+	    implies(ink_base->sh&sh_fg,ink_peek->fg==ink_base->fg) &&
+	    implies(ink_base->sh&sh_bg,ink_peek->bg==ink_base->bg) ) {
+	    ink_peek++;
+	    paper_peek++;
+	    c_peek++;
+	    i++;
+	    l--;
+	}
+
+	_repaint(sw, c, r, i, ink_base, paper_base);
+
+	paper_base = paper_peek;
+	ink_base = ink_peek;
+	c = c_peek;
+    }
+
+}
+
+static void redisplayRegion(Widget w, XRectangle *expose)
+{
+    SheetWidget sw = (SheetWidget) w;
+    sheet_ink cell;
+
+    int tlc,brc,c;
+    int tlr,brr,r;
+
+    tlc = PIXEL_TO_COL(sw,expose->x);
+    tlr = PIXEL_TO_ROW(sw,expose->y);
+    brc = PIXEL_TO_COL(sw,expose->x+expose->width-1)+1;
+    brr = PIXEL_TO_ROW(sw,expose->y+expose->height-1)+1;
+    if (tlc < 0) tlc = 0;
+    if (tlr < 0) tlr = 0;
+    if (brc < 0) brc = 0;
+    if (brr < 0) brr = 0;
+    if (tlc >= sw->sheet.columns) tlc = sw->sheet.columns-1;
+    if (tlr >= sw->sheet.rows)    tlr = sw->sheet.rows-1;
+    if (brc >= sw->sheet.columns) brc = sw->sheet.columns-1;
+    if (brr >= sw->sheet.rows)    brr = sw->sheet.rows-1;
+
+    for (r=tlr;r<=brr;r++) {
+	repaintText(sw, tlc, r, brc-tlc+1);
+    }
+
+    if (sw->sheet.display_cursor &&
+	sw->sheet.cursor_row >= tlr &&
+	sw->sheet.cursor_row <= brr &&
+	sw->sheet.cursor_column >= tlc &&
+	sw->sheet.cursor_column <= brc)
+    {
+	/* better redraw cursor */
+	redrawCursor(sw,True);
+    }
+}
+
+/* ---- Exported procedures ---- */
+
+
+
+void XawSheetPutText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, String s)
+/*
+** Put plain text
+*/
+{
+    SheetWidget sw = (SheetWidget) w;
+    int i;
+    sheet_ink ink_base;
+    sheet_paper paper_base;
+    String sp;
+
+    if (r>=0 && r<sw->sheet.rows &&
+	c+l>0 && c<sw->sheet.columns &&
+	l > 0) {
+	if (c<0) { l += c; s -= c; c = 0; }
+	if (c+l>sw->sheet.columns) l = sw->sheet.columns - c;
+	for (
+	    i = 0, sp = s,
+	    ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c),
+	    paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+	    i < l;
+	    i++, ink_base++, paper_base++, sp++) {
+	    ink_base->sh = sh_default;
+	    *paper_base = *sp;
+	}
+	if (XtIsRealized(w)) {
+	    _repaint(sw, c, r, l, (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c), s);
+
+	    if (sw->sheet.display_cursor &&
+		sw->sheet.cursor_row == r &&
+		sw->sheet.cursor_column >= c &&
+		sw->sheet.cursor_column < c+l)
+	    {
+		/* better redraw cursor */
+		redrawCursor(sw,True);
+	    }
+	}
+    }
+}
+
+
+void XawSheetPutJazzyText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, String s, XawSheetInk *ink_list)
+/*
+** Put multi-coloured text
+*/
+{
+    SheetWidget sw = (SheetWidget) w;
+    int i;
+    sheet_ink ink_base;
+    sheet_paper paper_base;
+    String sp;
+
+    if (r>=0 && r<sw->sheet.rows &&
+	c+l>0 && c<sw->sheet.columns &&
+	l > 0) {
+	if (c<0) { l += c; s -= c; c = 0; }
+	if (c+l>sw->sheet.columns) l = sw->sheet.columns - c;
+	for (
+	    i = 0, sp = s,
+	    ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c),
+	    paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+	    i < l;
+	    i++, ink_base++, ink_list++, paper_base++, sp++) {
+	    ink_base->fg = ink_list->fg;
+	    ink_base->bg = ink_list->bg;
+	    ink_base->sh = ink_list->sh;
+	    *paper_base = *sp;
+	}
+	if (XtIsRealized(w)) {
+	    repaintText(sw, c, r, l);
+
+	    if (sw->sheet.display_cursor &&
+		sw->sheet.cursor_row == r &&
+		sw->sheet.cursor_column >= c &&
+		sw->sheet.cursor_column < c+l)
+	    {
+		/* better redraw cursor */
+		redrawCursor(sw,True);
+	    }
+	}
+    }
+}
+
+
+void XawSheetPutHilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, String s)
+/*
+** Put text using default hilights
+*/
+{
+    SheetWidget sw = (SheetWidget) w;
+    int i;
+    sheet_ink ink_base;
+    sheet_paper paper_base;
+    String sp;
+
+    if (r>=0 && r<sw->sheet.rows &&
+	c+l>0 && c<sw->sheet.columns &&
+	l > 0) {
+	if (c<0) { l += c; s -= c; c = 0; }
+	if (c+l>sw->sheet.columns) l = sw->sheet.columns - c;
+	for (
+	    i = 0, sp = s,
+	    ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c),
+	    paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+	    i < l;
+	    i++, ink_base++, paper_base++, sp++) {
+	    ink_base->sh = sw->sheet.default_sh;
+	    ink_base->fg = sw->sheet.default_fg;
+	    ink_base->bg = sw->sheet.default_bg;
+	    *paper_base = *sp;
+	}
+	if (XtIsRealized(w)) {
+	    _repaint(sw, c, r, l, (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c), s);
+	    if (sw->sheet.display_cursor &&
+		sw->sheet.cursor_row == r &&
+		sw->sheet.cursor_column >= c &&
+		sw->sheet.cursor_column < c+l)
+	    {
+		/* better redraw cursor */
+		redrawCursor(sw,True);
+	    }
+	}
+    }
+}
+
+void XawSheetHilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, Pixel fg, Pixel bg, XawSheetHilight h)
+/*
+** Hilight already draw text
+*/
+{
+SheetWidget sw = (SheetWidget) w;
+sheet_ink ink_base;
+sheet_paper paper_base;
+
+/*
+Hilights currently supported:
+
+    sh_default		yes
+    sh_fg		yes
+    sh_bg		yes
+    sh_underline	yes
+    sh_inverse		yes
+    sh_light		no
+    sh_tick		no
+    sh_bold		no
+    sh_italic		no
+*/
+
+    if (r>=0 && r<sw->sheet.rows &&
+	c+l>0 && c<sw->sheet.columns &&
+	l > 0) {
+	int i;
+
+	if (c<0) { l += c; c = 0; }
+	if (c+l>sw->sheet.columns) l = sw->sheet.columns - c;
+	for (
+	    i = 0,
+	    ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c),
+	    paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+	    i < l;
+	    i++, ink_base++, paper_base++)
+	{
+	    if (h==sh_default) {
+	        ink_base->sh = sh_default;
+	    } else {
+		if (h & sh_fg) ink_base->fg  = fg;
+		if (h & sh_bg) ink_base->bg  = bg;
+		ink_base->sh |= h;
+	    }
+	}
+	repaintText(sw, (int)c, (int)r, (int)l);
+    }
+}
+
+void XawSheetUnhilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, Pixel fg, Pixel bg, XawSheetHilight h)
+/*
+** Remove hilighting from text
+*/
+{
+SheetWidget sw = (SheetWidget) w;
+sheet_ink ink_base;
+sheet_paper paper_base;
+
+
+    if (r>=0 && r<sw->sheet.rows &&
+	c+l>0 && c<sw->sheet.columns &&
+	l > 0) {
+	int i;
+
+	if (c<0) { l += c; c = 0; }
+	if (c+l>sw->sheet.columns) l = sw->sheet.columns - c;
+	for (
+	    i = 0,
+	    ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c),
+	    paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+	    i < l;
+	    i++, ink_base++, paper_base++)
+	{
+	    if (h==sh_default) {
+	    } else {
+		if (h & sh_fg) ink_base->fg  = fg;
+		if (h & sh_bg) ink_base->bg  = bg;
+		ink_base->sh &= !h&sh_mask;
+	    }
+	}
+	repaintText(sw, (int)c, (int)r, (int)l);
+    }
+}
+
+void XawSheetOpHilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, XawSheetHilight h, int op)
+/*
+** Perform boolean operations on text
+*/
+{
+SheetWidget sw = (SheetWidget) w;
+sheet_ink ink_base;
+sheet_paper paper_base;
+
+
+    if (r>=0 && r<sw->sheet.rows &&
+	c+l>0 && c<sw->sheet.columns &&
+	l > 0) {
+	int i;
+
+	if (c<0) { l += c; c = 0; }
+	if (c+l>sw->sheet.columns) l = sw->sheet.columns - c;
+	for (
+	    i = 0,
+	    ink_base = (sheet_ink) GET_ARRAY_CELL(sw->sheet.ink,r,c),
+	    paper_base = (sheet_paper) GET_ARRAY_CELL(sw->sheet.paper,r,c);
+	    i < l;
+	    i++, ink_base++, paper_base++)
+	{
+	    ink_base->sh = binary_op(h,ink_base->sh,op)&sh_mask;
+	}
+	repaintText(sw, (int)c, (int)r, (int)l);
+	if (sw->sheet.display_cursor &&
+	    sw->sheet.cursor_row == r &&
+	    sw->sheet.cursor_column >= c &&
+	    sw->sheet.cursor_column < c+l)
+	{
+	    /* better redraw cursor */
+	    redrawCursor(sw,True);
+	}
+
+    }
+}
+
+void XawSheetPositionCursor(Widget w, XawSheetColumn c, XawSheetRow r)
+{
+    SheetWidget sw = (SheetWidget) w;
+    if (XtIsRealized(w) && sw->sheet.display_cursor)
+	redrawCursor(sw,False);
+    sw->sheet.cursor_column = c;
+    sw->sheet.cursor_row = r;
+    if (XtIsRealized(w) && sw->sheet.display_cursor)
+	redrawCursor(sw,True);
+}
+
+void XawSheetDisplayCursor(Widget w, Boolean b)
+{
+    SheetWidget sw = (SheetWidget) w;
+    if (sw->sheet.display_cursor^b) {/*state change*/
+	sw->sheet.display_cursor = b;
+	if (XtIsRealized(w)) redrawCursor(sw, b);
+    }
+}
+
+Pixel XawSheetColourNameToPixel(Widget w, String c)
+{
+    XColor rgb_db_def, hardware_def;
+    Colormap cmap;
+    Status s;
+
+    cmap = DefaultColormap(XtDisplay(w),DefaultScreen(XtDisplay(w)));
+    s = XAllocNamedColor(XtDisplay(w), cmap, c, &rgb_db_def, &hardware_def);
+
+    return hardware_def.pixel;
+}
+
+
+
+void XawSheetSetHilight(Widget w, Pixel fg, Pixel bg, XawSheetHilight h)
+{
+    SheetWidget sw = (SheetWidget) w;
+    
+    if (h & sh_fg) sw->sheet.default_fg  = fg;
+    if (h & sh_bg) sw->sheet.default_bg  = bg;
+    sw->sheet.default_sh = h;
+
+}
+
+void XawSheetSetRows(Widget w, XawSheetRow r)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetSetColumns(Widget w, XawSheetColumn c)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetDeleteRow(Widget w, XawSheetRow r)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetDeleteColumn(Widget w, XawSheetColumn c)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetInsertRow(Widget w, XawSheetRow r)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetInsertColumn(Widget w, XawSheetColumn c)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetInsert(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetDelete(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+void XawSheetClearSheet(Widget w)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+
+
+
+/* ---- Private procedures -- Class Methods ---- */
+static Pixel ColourNameToPixel(Widget w, String c)
+{
+    XColor rgb_db_def, hardware_def;
+    Colormap cmap;
+    Status s;
+
+    cmap = DefaultColormap(XtDisplay(w),DefaultScreen(XtDisplay(w)));
+    s = XAllocNamedColor(XtDisplay(w), cmap, c, &rgb_db_def, &hardware_def);
+
+    return hardware_def.pixel;
+}
+
+
+static void Initialize(Widget request, Widget new,
+		       ArgList args, Cardinal *num_args)
+{   SheetWidget    sw = (SheetWidget)new;
+    Display       *display;
+    int            screen;
+
+    display  = XtDisplay(request);
+    screen   = DefaultScreen(display);
+
+    if (DisplayPlanes(display,screen)==1)
+    {   /* We are on a one-plane monochrome display.
+           Use dashes to make lines distinguishable.
+        */
+
+    }
+    else
+    {   /* Some sort of multi-plane display.
+        */
+
+    }
+    sw->sheet.paper = create_array(sw->sheet.rows,sw->sheet.columns, sizeof(sheet_paper_struct));
+    sw->sheet.ink = create_array(sw->sheet.rows,sw->sheet.columns, sizeof(sheet_ink_struct));
+    sw->sheet.cursor.row = 0;
+    sw->sheet.cursor.column = 0;
+    sw->sheet.cursor.visible = False;
+    sw->sheet.default_sh = sh_default;
+
+    /* get font details */
+
+    sw->sheet.width_in_pixels  = COLS_TO_PIXEL_WIDTH(sw,sw->sheet.columns);
+    sw->sheet.height_in_pixels = ROWS_TO_PIXEL_HEIGHT(sw,sw->sheet.rows);
+
+    sw->core.width = sw->sheet.width_in_pixels;
+    sw->core.height = sw->sheet.height_in_pixels;
+
+    /* GCs and things */
+    {
+	XtGCMask valuemask = (GCFont |
+			      GCGraphicsExposures | GCForeground | GCBackground );
+	XGCValues values;
+	
+	values.font = sw->sheet.font->fid;
+	values.graphics_exposures = (Bool) FALSE;
+	
+	values.foreground = sw->sheet.foreground;
+	values.background = sw->sheet.background;
+	sw->sheet.normgc = XtGetGC((Widget)sw, valuemask, &values);
+
+	values.foreground = sw->sheet.background;
+	values.background = sw->sheet.background;
+	sw->sheet.sparegc = XtGetGC((Widget)sw, valuemask, &values);
+	
+	values.foreground = sw->sheet.foreground;
+	values.background = sw->sheet.background;
+	sw->sheet.greygc = XCreateGC(XtDisplayOfObject((Widget)sw),
+				      RootWindowOfScreen(XtScreenOfObject(
+					                 (Widget)sw)),
+				      valuemask, &values);
+	
+	values.foreground = sw->sheet.background;
+	values.background = sw->sheet.foreground;
+	sw->sheet.whitegc = XtGetGC((Widget)sw, valuemask, &values);
+	
+	if (DisplayPlanes(XtDisplay(sw),DefaultScreen(XtDisplay(sw)))==1){
+#define grey_width 2
+#define grey_height 2
+	    static char grey_bits[] = { 0x01 , 0x02 };
+	    
+	    sw->sheet.grey_stipple =
+		XCreateBitmapFromData(XtDisplay(sw),
+				      RootWindowOfScreen(XtScreenOfObject(
+								 (Widget)sw)),
+				      grey_bits,
+				      grey_width,
+				      grey_height);
+	    XSetFillStyle(XtDisplay(sw),sw->sheet.greygc,FillOpaqueStippled);
+	    XSetStipple(XtDisplay(sw),sw->sheet.greygc,sw->sheet.grey_stipple);
+	} else {
+	    sw->sheet.light = ColourNameToPixel((Widget)sw,"lightGray");
+	}
+
+    }
+
+
+}
+
+
+
+static void Destroy (Widget w)
+{
+    SheetWidget sw = (SheetWidget) w;
+
+    destroy_array(sw->sheet.paper);
+    destroy_array(sw->sheet.ink);
+}
+
+/* ARGSUSED */
+static void Redisplay(Widget w, XEvent *event, Region region)
+{
+    SheetWidget sw = (SheetWidget) w;
+    XRectangle expose, cursor;
+
+    if (!XtIsRealized(w))
+	return;
+
+    if (event->type == Expose) {
+        expose.x = event->xexpose.x;
+        expose.y = event->xexpose.y;
+        expose.width = event->xexpose.width;
+        expose.height = event->xexpose.height;
+    }
+    else {  /* Graphics Expose. */
+        expose.x = event->xgraphicsexpose.x;
+        expose.y = event->xgraphicsexpose.y;
+        expose.width = event->xgraphicsexpose.width;
+        expose.height = event->xgraphicsexpose.height;
+    }
+
+    redisplayRegion((Widget)sw, &expose);
+}
+
+
+
+
+static void Resize(Widget w)
+{
+    SheetWidget sw = (SheetWidget) w;
+    int newRows, newCols;
+
+    newRows = PIXEL_HEIGHT_TO_ROWS(sw,sw->core.height);
+    newCols = PIXEL_WIDTH_TO_COLS(sw,sw->core.width);
+    if (newRows > SHEET_MAX_ROWS) {
+	newRows = SHEET_MAX_ROWS;
+	sw->sheet.height_in_pixels = ROWS_TO_PIXEL_HEIGHT(sw,newRows);
+	/*
+	sw->core.height = sw->sheet.height_in_pixels;
+	*/
+    }
+    sw->sheet.rows = newRows;
+
+    if (newCols > SHEET_MAX_COLS) {
+	newCols = SHEET_MAX_COLS;
+	sw->sheet.width_in_pixels = COLS_TO_PIXEL_WIDTH(sw,newCols);
+	/*
+	sw->core.width = sw->sheet.width_in_pixels;
+	*/
+    }
+    sw->sheet.columns = newCols;
+
+    extend_array (
+	&sw->sheet.paper,
+	newRows,
+	newCols
+	);
+    extend_array (
+	&sw->sheet.ink,
+	newRows,
+	newCols
+	);
+}
+
+
+static void InputAction(Widget w, XEvent *event,
+			String *params,Cardinal *num_params)
+{
+    SheetWidget sw = (SheetWidget) w;
+}
+
+
+static Boolean SetValues(Widget current, Widget request, Widget new,
+			 ArgList args, Cardinal *num_args)
+{
+    SheetWidget oldsw = (SheetWidget) current;
+    SheetWidget newsw = (SheetWidget) new;
+    Boolean redisplay;
+    Boolean resize;
+
+    redisplay = FALSE;
+    resize = FALSE;
+    if (oldsw->sheet.rows != newsw->sheet.rows) {
+	if (newsw->sheet.rows < 1 ||
+	    newsw->sheet.rows >= SHEET_MAX_ROWS)
+	    newsw->sheet.rows = oldsw->sheet.rows;
+	if (oldsw->sheet.rows != newsw->sheet.rows) {
+	    redisplay = TRUE;
+	    resize = TRUE;
+	    newsw->sheet.height_in_pixels = ROWS_TO_PIXEL_HEIGHT(newsw,newsw->sheet.rows);
+	    newsw->core.height = newsw->sheet.height_in_pixels;
+	}
+    }
+    if (oldsw->sheet.columns != newsw->sheet.columns) {
+	if (newsw->sheet.columns < 1 ||
+	    newsw->sheet.columns >= SHEET_MAX_COLS)
+	    newsw->sheet.columns = oldsw->sheet.columns;
+	if (oldsw->sheet.columns != newsw->sheet.columns) {
+	    redisplay = TRUE;
+	    resize = TRUE;
+	    newsw->sheet.width_in_pixels  = COLS_TO_PIXEL_WIDTH(newsw,newsw->sheet.columns);
+	    newsw->core.width = newsw->sheet.width_in_pixels;
+	}
+    }
+
+    if (resize) {
+	extend_array (
+	    &newsw->sheet.paper,
+	    newsw->sheet.rows,
+	    newsw->sheet.columns
+	    );
+	extend_array (
+	    &newsw->sheet.ink,
+	    newsw->sheet.rows,
+	    newsw->sheet.columns
+	    );
+    }
+
+    return (redisplay && XtIsRealized((Widget)oldsw));
+
+}
+
diff --git a/src/staden/Sheet.h b/src/staden/Sheet.h
new file mode 100644
index 0000000..a938e2a
--- /dev/null
+++ b/src/staden/Sheet.h
@@ -0,0 +1,136 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: Sheet.h,v 1.2 88/10/25 17:22:09 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#ifndef _Sheet_h
+#define _Sheet_h
+
+/****************************************************************
+ *
+ * Sheet widget
+ *
+ ****************************************************************/
+
+#include <X11/Xaw/Simple.h>
+
+/* Resources:
+
+ Name		     Class		RepType		Default Value
+ ----		     -----		-------		-------------
+ background	     Background		Pixel		XtDefaultBackground
+ border		     BorderColor	Pixel		XtDefaultForeground
+ borderWidth	     BorderWidth	Dimension	1
+ bottomMargin        Margin             Dimension       4
+ callback            Callback           Callback        NULL
+ columns             Width              Dimension       80
+ cursor              Cursor             Cursor          None
+ cursorColumn        Width              Dimension       0
+ cursorRow           Height             Dimension       0
+ destroyCallback     Callback		Pointer		NULL
+ displayCursor       Output             Boolean         False
+ exposeCallback      Callback           Callback        NULL
+ font                Font               XFontStruct*    XtDefaultFont
+ foreground          Foreground         Pixel           XtDefaultForeground
+ height		     Height		Dimension	1
+ insensitiveBorder   Insensitive	Pixmap		Gray
+ leftMargin          Margin             Dimension       4
+ mappedWhenManaged   MappedWhenManaged	Boolean		True
+ resizeCallback      Callback           Callback        NULL
+ rightMargin         Margin             Dimension       4
+ rows                Height             Dimension       10
+ sensitive	     Sensitive		Boolean		True
+ topMargin           Margin             Dimension       4
+ width		     Width		Dimension	1
+
+*/
+
+/* define any special resource names here that are not in <X11/StringDefs.h> */
+
+#define XtNsheetResource		"sheetResource"
+#define XtCSheetResource		"SheetResource"
+
+#define XtNdimBackground                "dimBackground"
+#define XtNexposeCallback               "exposeCallback"
+#define XtNresizeCallback               "resizeCallback"
+#define XtNrows                         "rows"
+#define XtNcolumns                      "columns"
+#define XtNleftMargin                   "leftMargin"
+#define XtNrightMargin                  "rightMargin"
+#define XtNtopMargin                    "topMargin"
+#define XtNbottomMargin                 "bottomMargin"
+#define XtNdisplayCursor                "displayCursor"
+#define XtNcursorRow                    "cursorRow"
+#define XtNcursorColumn                 "cursorColumn"
+
+#define XtCOutput			"Output"
+
+
+/* declare specific SheetWidget class and instance datatypes */
+
+typedef struct _SheetClassRec  *SheetWidgetClass;
+typedef struct _SheetRec       *SheetWidget;
+typedef Dimension XawSheetRow;
+typedef Dimension XawSheetColumn;
+typedef long XawSheetHilight;
+typedef struct {
+    Pixel fg;
+    Pixel bg;
+    XawSheetHilight sh;
+} XawSheetInk;
+
+/* hilights available */
+#define sh_default	(0L)
+#define sh_fg		(1L<<0)
+#define sh_bg		(1L<<1)
+#define sh_underline	(1L<<2)
+#define sh_inverse	(1L<<3)
+#define sh_light	(1L<<4)
+#define sh_tick		(1L<<5)
+#define sh_bold		(1L<<6)
+#define sh_italic	(1L<<7)
+#define sh_select       (1L<<8)
+#define sh_mask         ((1L<<9) - 1)
+
+/* hilight operations */
+#define HOP_MASK 0xF
+#define HOP_SRC 0xC
+#define HOP_DST 0xA
+#define HOP_SET 0xD
+#define HOP_CLR 0x2
+#define HOP_TOG 0x6
+#define HOP_AND(S,D) ((S & D) & HOP_MASK)
+#define HOP_OR(S,D) ((S | D) & HOP_MASK)
+#define HOP_NOT(S)  ((!S) & HOP_MASK)
+
+#define SHEET_MAX_COLS  512
+#define SHEET_MAX_ROWS  1024
+
+/* declare the class constant */
+
+extern WidgetClass sheetWidgetClass;
+
+
+/* ---- Exported procedures ---- */
+
+extern void XawSheetPositionCursor(Widget w, XawSheetColumn c, XawSheetRow r);
+extern void XawSheetDisplayCursor(Widget w, Boolean B);
+extern void XawSheetPutText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, String s);
+extern void XawSheetSetRows(Widget w, XawSheetRow r);
+extern void XawSheetSetColumns(Widget w, XawSheetColumn c);
+extern void XawSheetDeleteRow(Widget w, XawSheetRow r);
+extern void XawSheetDeleteColumn(Widget w, XawSheetColumn c);
+extern void XawSheetInsertRow(Widget w, XawSheetRow r);
+extern void XawSheetInsertColumn(Widget w, XawSheetColumn c);
+extern void XawSheetInsert(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l);
+extern void XawSheetDelete(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l);
+extern Pixel XawSheetColourNameToPixel(Widget w, String c);
+extern void XawSheetSetHilight(Widget w, Pixel fg, Pixel bg, XawSheetHilight h);
+extern void XawSheetPutHilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, String s);
+extern void XawSheetHilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, Pixel fg, Pixel bg, XawSheetHilight h);
+extern void XawSheetUnhilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, Pixel fg, Pixel bg, XawSheetHilight h);
+extern void XawSheetOpHilightText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, XawSheetHilight h, int op);
+extern void XawSheetClearSheet(Widget w);
+extern void XawSheetPutJazzyText(Widget w, XawSheetColumn c, XawSheetRow r, Dimension l, String s, XawSheetInk *ink_list);
+
+#endif /* _Sheet_h */
diff --git a/src/staden/SheetP.h b/src/staden/SheetP.h
new file mode 100644
index 0000000..4c51e75
--- /dev/null
+++ b/src/staden/SheetP.h
@@ -0,0 +1,103 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: SheetP.h,v 1.2 88/10/25 17:37:59 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#ifndef _SheetP_h
+#define _SheetP_h
+
+#include "Sheet.h"
+/* include superclass private header file */
+#include <X11/Xaw/SimpleP.h>
+
+
+/* define unique representation types not found in <X11/StringDefs.h> */
+
+#define XtRSheetResource		"SheetResource"
+
+typedef struct {
+    int empty;
+} SheetClassPart;
+
+typedef struct _SheetClassRec {
+    CoreClassPart	core_class;
+    SimpleClassPart     simple_class;
+    SheetClassPart	sheet_class;
+} SheetClassRec;
+
+extern SheetClassRec sheetClassRec;
+
+/*
+typedef struct {
+    Pixel fg;
+    Pixel bg;
+    XawSheetHilight sh;
+    char c;
+    char pad[3];
+} *sheet_cell, sheet_cell_struct;
+*/
+
+typedef struct {
+    XawSheetRow rows;
+    XawSheetColumn cols;
+    char *base;
+    size_t size;
+} *sheet_array, sheet_array_struct;
+
+typedef struct {
+    Pixel fg;
+    Pixel bg;
+    XawSheetHilight sh;
+} *sheet_ink, sheet_ink_struct;
+
+typedef char *sheet_paper, sheet_paper_struct;
+
+typedef struct {
+    XawSheetRow row;
+    XawSheetColumn column;
+    Boolean visible;
+} sheet_cursor;
+
+typedef struct {
+    Dimension left,right,top,bottom;
+} XawSheetMargin;
+
+
+typedef struct {
+    /* resources */
+    Pixel          foreground;
+    Pixel          background;
+    XFontStruct*   font;
+    XtCallbackList expose_callback;
+    XtCallbackList resize_callback;
+    XtCallbackList input_callback;
+    XawSheetMargin margin;
+    XawSheetRow    rows;
+    XawSheetColumn columns;
+    Boolean        display_cursor;
+    XawSheetRow    cursor_row;
+    XawSheetColumn cursor_column;
+    /* private state */
+    sheet_array    paper;
+    sheet_array    ink;
+    sheet_cursor   cursor;
+    int            width_in_pixels;
+    int            height_in_pixels;
+    GC             normgc;
+    GC             greygc;
+    GC             whitegc;
+    GC             sparegc;
+    Pixel	   default_fg;
+    Pixel	   default_bg;
+    Pixel          light;
+    Pixmap         grey_stipple;
+    XawSheetHilight default_sh;
+} SheetPart;
+
+typedef struct _SheetRec {
+    CorePart	core;
+    SimplePart  simple;
+    SheetPart	sheet;
+} SheetRec;
+
+#endif /* _SheetP_h */
diff --git a/src/staden/SmeLine.c b/src/staden/SmeLine.c
new file mode 100644
index 0000000..70eea17
--- /dev/null
+++ b/src/staden/SmeLine.c
@@ -0,0 +1,301 @@
+#if ( !defined(lint) && !defined(SABER) )
+static char Xrcsid[] = "$XConsortium: SmeLine.c,v 1.7 89/12/11 15:20:17 kit Exp $";
+#endif 
+
+/*
+ * Copyright 1989 Massachusetts Institute of Technology
+ *
+ * Permission to use, copy, modify, distribute, and sell this software and its
+ * documentation for any purpose is hereby granted without fee, provided that
+ * the above copyright notice appear in all copies and that both that
+ * copyright notice and this permission notice appear in supporting
+ * documentation, and that the name of M.I.T. not be used in advertising or
+ * publicity pertaining to distribution of the software without specific,
+ * written prior permission.  M.I.T. makes no representations about the
+ * suitability of this software for any purpose.  It is provided "as is"
+ * without express or implied warranty.
+ *
+ * M.I.T. DISCLAIMS ALL WARRANTIES WITH REGARD TO THIS SOFTWARE, INCLUDING ALL
+ * IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS, IN NO EVENT SHALL M.I.T.
+ * BE LIABLE FOR ANY SPECIAL, INDIRECT OR CONSEQUENTIAL DAMAGES OR ANY DAMAGES
+ * WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, WHETHER IN AN ACTION
+ * OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF OR IN 
+ * CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE.
+ *
+ * Author:  Chris D. Peterson, MIT X Consortium
+ */
+
+/*
+ * Sme.c - Source code for the generic menu entry
+ *
+ * Date:    September 26, 1989
+ *
+ * By:      Chris D. Peterson
+ *          MIT X Consortium 
+ *          kit@expo.lcs.mit.edu
+ */
+
+#include <stdio.h>
+#include <X11/IntrinsicP.h>
+#include <X11/StringDefs.h>
+
+#include <X11/Xaw/XawInit.h>
+#include <X11/Xaw/SmeLineP.h>
+#include <X11/Xaw/Cardinals.h>
+
+#define offset(field) XtOffset(SmeLineObject, sme_line.field)
+static XtResource resources[] = {
+  {XtNlineWidth, XtCLineWidth, XtRDimension, (Cardinal)sizeof(Dimension),
+     offset(line_width), XtRImmediate, (XtPointer) 1},
+  {XtNstipple, XtCStipple, XtRBitmap, (Cardinal)sizeof(Pixmap),
+     offset(stipple), XtRImmediate, (XtPointer) XtUnspecifiedPixmap},
+  {XtNforeground, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+     offset(foreground), XtRString, "XtDefaultForeground"},
+};   
+#undef offset
+
+/*
+ * Function definitions. 
+ */
+
+static void Redisplay(), ClassInitialize(), Initialize();
+static void DestroyGC(), CreateGC();
+static Boolean SetValues();
+static XtGeometryResult QueryGeometry();
+
+
+#define SUPERCLASS (&smeClassRec)
+
+SmeLineClassRec smeLineClassRec = {
+  {
+    /* superclass         */    (WidgetClass) SUPERCLASS,
+    /* class_name         */    "SmeLine",
+    /* size               */    (Cardinal)sizeof(SmeLineRec),
+    /* class_initialize   */	ClassInitialize,
+    /* class_part_initialize*/	NULL,
+    /* Class init'ed      */	FALSE,
+    /* initialize         */    Initialize,
+    /* initialize_hook    */	NULL,
+    /* realize            */    NULL,
+    /* actions            */    NULL,
+    /* num_actions        */    ZERO,
+    /* resources          */    resources,
+    /* resource_count     */	XtNumber(resources),
+    /* xrm_class          */    NULLQUARK,
+    /* compress_motion    */    FALSE, 
+    /* compress_exposure  */    FALSE,
+    /* compress_enterleave*/ 	FALSE,
+    /* visible_interest   */    FALSE,
+    /* destroy            */    DestroyGC,
+    /* resize             */    NULL,
+    /* expose             */    Redisplay,
+    /* set_values         */    SetValues,
+    /* set_values_hook    */	NULL,
+    /* set_values_almost  */	XtInheritSetValuesAlmost,  
+    /* get_values_hook    */	NULL,			
+    /* accept_focus       */    NULL,
+    /* intrinsics version */	XtVersion,
+    /* callback offsets   */    NULL,
+    /* tm_table		  */    NULL,
+    /* query_geometry	  */    QueryGeometry,
+    /* display_accelerator*/    NULL,
+    /* extension	  */    NULL
+  },{
+    /* Menu Entry Fields */
+      
+    /* highlight */             XtInheritHighlight,
+    /* unhighlight */           XtInheritUnhighlight,
+    /* notify */		XtInheritNotify,		
+    /* extension */             NULL				
+  },{
+    /* Line Menu Entry Fields */
+    /* extension */             NULL				
+  }
+};
+
+WidgetClass smeLineObjectClass = (WidgetClass) &smeLineClassRec;
+
+/************************************************************
+ *
+ * Semi-Public Functions.
+ *
+ ************************************************************/
+
+/*	Function Name: ClassInitialize
+ *	Description: The Line Menu Entry Object's class initialization proc.
+ *	Arguments: none.
+ *	Returns: none.
+ */
+
+static void 
+ClassInitialize()
+{
+    static XtConvertArgRec screenConvertArg[] = {
+        {XtWidgetBaseOffset, (caddr_t) XtOffset(Widget, core.screen),
+	     (Cardinal)sizeof(Screen *)}
+    };
+
+    XawInitializeWidgetSet();
+    XtAddConverter(XtRString, XtRBitmap, XmuCvtStringToBitmap,
+		   screenConvertArg, XtNumber(screenConvertArg));
+}
+
+/*      Function Name: Initialize
+ *      Description: Initializes the simple menu widget
+ *      Arguments: request - the widget requested by the argument list.
+ *                 new     - the new widget with both resource and non
+ *                           resource values.
+ *      Returns: none.
+ */
+
+/* ARGSUSED */
+static void
+Initialize(request, new)
+Widget request, new;
+{
+    SmeLineObject entry = (SmeLineObject) new;
+
+    if (entry->rectangle.height == 0)
+	entry->rectangle.height = entry->sme_line.line_width;
+
+    CreateGC(new);
+}
+
+/*	Function Name: CreateGC
+ *	Description: Creates the GC for the line entry widget.
+ *	Arguments: w - the Line entry widget.
+ *	Returns: none
+ *
+ *      We can only share the GC if there is no stipple, because
+ *      we need to change the stipple origin when drawing.
+ */
+
+static void
+CreateGC(w)
+Widget w;
+{
+    SmeLineObject entry = (SmeLineObject) w;
+    XGCValues values;
+    XtGCMask mask = GCForeground | GCGraphicsExposures | GCLineWidth ;
+    
+    values.foreground = entry->sme_line.foreground;
+    values.graphics_exposures = FALSE;
+    values.line_width = entry->sme_line.line_width;
+    
+    if (entry->sme_line.stipple != XtUnspecifiedPixmap) {
+	values.stipple = entry->sme_line.stipple;
+	values.fill_style = FillStippled; 
+	mask |= GCStipple | GCFillStyle;
+	
+	entry->sme_line.gc = XCreateGC(XtDisplayOfObject(w), 
+				      RootWindowOfScreen(XtScreenOfObject(w)),
+				      mask, &values);
+    }
+    else
+	entry->sme_line.gc = XtGetGC(w, mask, &values);
+}
+
+/*	Function Name: DestroyGC
+ *	Description: Destroys the GC when we are done with it.
+ *	Arguments: w - the Line entry widget.
+ *	Returns: none
+ */
+
+static void
+DestroyGC(w)
+Widget w;
+{
+    SmeLineObject entry = (SmeLineObject) w;
+
+    if (entry->sme_line.stipple != XtUnspecifiedPixmap) 
+	XFreeGC(XtDisplayOfObject(w), entry->sme_line.gc);
+    else
+	XtReleaseGC(w, entry->sme_line.gc);
+}
+
+/*	Function Name: Redisplay
+ *	Description: Paints the Line
+ *	Arguments: w - the menu entry.
+ *                 event, region - NOT USED.
+ *	Returns: none
+ */
+
+static void
+Redisplay(w, event, region)
+Widget w;
+XEvent * event;
+Region region;
+{
+    SmeLineObject entry = (SmeLineObject) w;
+    int y = entry->rectangle.y + 
+	    (entry->rectangle.height - entry->sme_line.line_width) / 2;
+
+    if (entry->sme_line.stipple != XtUnspecifiedPixmap) 
+	XSetTSOrigin(XtDisplayOfObject(w), entry->sme_line.gc, 0, y);
+
+    XFillRectangle(XtDisplayOfObject(w), XtWindowOfObject(w),
+		   entry->sme_line.gc, 
+		   0, y, (unsigned int) entry->rectangle.width, 
+		   (unsigned int) entry->sme_line.line_width );
+}
+
+/*      Function Name: SetValues
+ *      Description: Relayout the menu when one of the resources is changed.
+ *      Arguments: current - current state of the widget.
+ *                 request - what was requested.
+ *                 new - what the widget will become.
+ *      Returns: none
+ */
+
+/* ARGSUSED */
+static Boolean
+SetValues(current, request, new)
+Widget current, request, new;
+{
+    SmeLineObject entry = (SmeLineObject) new;
+    SmeLineObject old_entry = (SmeLineObject) current;
+  
+    if ( (entry->sme_line.line_width != old_entry->sme_line.line_width) &&
+	 (entry->sme_line.stipple != old_entry->sme_line.stipple) ) {
+	DestroyGC(current);
+	CreateGC(new);
+	return(TRUE);
+    }
+    return(FALSE);
+}
+
+/*	Function Name: QueryGeometry.
+ *	Description: Returns the preferred geometry for this widget.
+ *	Arguments: w - the menu entry object.
+ *                 itended, return - the intended and return geometry info.
+ *	Returns: A Geometry Result.
+ *
+ * See the Intrinsics manual for details on what this function is for.
+ * 
+ * I just return the height and a width of 1.
+ */
+
+static XtGeometryResult
+QueryGeometry(w, intended, return_val) 
+Widget w;
+XtWidgetGeometry *intended, *return_val;
+{
+    SmeObject entry = (SmeObject) w;
+    Dimension width;
+    XtGeometryResult ret_val = XtGeometryYes;
+    XtGeometryMask mode = intended->request_mode;
+
+    width = 1;			/* we can be really small. */
+
+    if ( ((mode & CWWidth) && (intended->width != width)) ||
+	 !(mode & CWWidth) ) {
+	return_val->request_mode |= CWWidth;
+	return_val->width = width;
+	mode = return_val->request_mode;
+	
+	if ( (mode & CWWidth) && (width == entry->rectangle.width) )
+	    return(XtGeometryNo);
+	return(XtGeometryAlmost);
+    }
+    return(ret_val);
+}
diff --git a/src/staden/aix-bits.c b/src/staden/aix-bits.c
new file mode 100644
index 0000000..fab2e8f
--- /dev/null
+++ b/src/staden/aix-bits.c
@@ -0,0 +1,15 @@
+#include <stdio.h>
+ 
+/*
+ * KH 2/06/92
+ * simulate a CALL FLUSH(devsepc) from SUN FORTRAN
+ * for STADEN programs
+ */
+ 
+void * flush (int *fd)
+{
+  FILE *a = fdopen(*fd,"a");
+  if (ferror(a) ) perror("problems with flushing: fdopen(int, char *)");
+  fflush(a);
+  if (ferror(a) ) perror("problems with flushing: fflush(FILE *)");
+}
diff --git a/src/staden/analps89.f b/src/staden/analps89.f
new file mode 100644
index 0000000..3c4caba
--- /dev/null
+++ b/src/staden/analps89.f
@@ -0,0 +1,2329 @@
+C   SUBROUTINES FOR ANALYSEP
+C   AUTHOR: RODGER STADEN
+C   8-2-90 back translation: zeroed righthand end of workr
+C   9-7-90 removed menu routine
+C   24-7-90 edited helixw to use getint for compatibility with x versions
+C   5-11-90 signlp call to rdwmt added zero as last parameter
+C   4-12-90 Changed name of compc to compcp for the stupid sun linker
+C  21-12-90 Changed signlp so that text output gives position as 
+C           I+MIDDLE (was I)
+      SUBROUTINE SETPAR(IOK)
+      PARAMETER (MINOPT = 17,
+     +           MAXOPT = 28,
+     +           NUMBER = 9)
+      INTEGER VALUES(NUMBER,MINOPT:MAXOPT)
+C      INTEGER TEMP(NUMBER)
+      SAVE /PAMDEF/
+      COMMON /PAMDEF/VALUES
+      IOK = 0
+C  SEARCH
+C      WRITE(KBOUT,*)' Search for short sequences'
+C      CALL GETPAR(17,7,IOK,IFILE,OPTION,OPTOUT,NOPOUT,
+C     +MININC,MAXINC,INC)
+      VALUES(1,17) = 1
+      VALUES(2,17) = 0
+      VALUES(3,17) = 0
+      VALUES(4,17) = 0
+      VALUES(5,17) = 1
+      VALUES(6,17) = 12
+      VALUES(7,17) = 1
+C  PERCENT MATCH
+      VALUES(1,18) = 0
+      VALUES(2,18) = 70
+C  SCORE MATRIX MATCH
+      VALUES(1,19) = 0
+C  WEIGHT MATRIX
+C      WRITE(KBOUT,*)' MOTIF SEARCH USING WEIGHT MATRIX'
+C      CALL GETPAR(20,2,IOK,IOPT,IGON)
+      VALUES(1,20) = 0
+      VALUES(2,20) = 0
+C  HYDROPHOBICITY
+      VALUES(1,22) = 1
+      VALUES(2,22) = 101
+      VALUES(3,22) = 11
+      VALUES(4,22) = 1
+      VALUES(5,22) = 101
+      VALUES(6,22) = 3
+C  CHARGE
+      VALUES(1,23) = 1
+      VALUES(2,23) = 101
+      VALUES(3,23) = 11
+      VALUES(4,23) = 1
+      VALUES(5,23) = 101
+      VALUES(6,23) = 3
+C  ROBSON
+C      CALL GETPAR(24,4,IOK,MINIW,MAXIW,IWRIT,IGON)
+      VALUES(1,24) = 1
+      VALUES(2,24) = 101
+      VALUES(3,24) = 1
+      VALUES(4,24) = 0
+C  HYDROPHOBIC MOMENT
+      VALUES(1,25) = 1
+      VALUES(2,25) = 130
+      VALUES(3,25) = 100
+      VALUES(4,25) = 1
+      VALUES(5,25) = 60
+      VALUES(6,25) = 18
+      VALUES(7,25) = 1
+      VALUES(8,25) = 101
+      VALUES(9,25) = 3
+C  HELIX WHEEL
+      VALUES(1,26) = 1
+      VALUES(2,26) = 130
+      VALUES(3,26) = 100
+      VALUES(4,26) = 1
+      VALUES(5,26) = 60
+      VALUES(6,26) = 18
+C  BACK TRANSLATE
+      VALUES(1,27) = 0
+      VALUES(2,27) = 0
+      VALUES(3,27) = 1
+      VALUES(4,27) = 11
+      VALUES(5,27) = 11
+      VALUES(6,27) = 1
+      VALUES(7,27) = 11
+      VALUES(8,27) = 3
+C  PATTERNS TO DO
+C      OPEN(UNIT=99,FILE='ANALYSEP.PAR',STATUS ='OLD')
+C10    CONTINUE
+C      READ(99,*,ERR=100,END=200)TEMP
+C      IF((TEMP(1).GE.MINOPT).AND.(TEMP(1).LE.MAXOPT)) THEN
+C        DO 20 I = 1,NUMBER-1
+C          VALUES(I,TEMP(1)) = TEMP(I+1)
+C          WRITE(*,*)TEMP(1),TEMP(I+1)
+C20      CONTINUE
+C      END IF
+C      GO TO 10
+C100   CONTINUE
+C      WRITE(*,*)'ERROR IN PARMS FILE'
+C      GO TO 10
+C200   CONTINUE
+C      CLOSE(UNIT = 99)
+      IOK = 0
+      END
+      SUBROUTINE GETPAR(IOPT,NPAM,IOK,I1,I2,I3,I4,I5,I6,I7,I8,I9)
+      PARAMETER (MINOPT = 17,
+     +           MAXOPT = 28,
+     +           NUMBER = 9)
+      INTEGER VALUES(NUMBER,MINOPT:MAXOPT)
+      COMMON /PAMDEF/VALUES
+      IOK = 0
+      IF((IOPT.GE.MINOPT).AND.(IOPT.LE.MAXOPT)) THEN
+        I = 1
+        I1 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I2 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I3 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I4 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I5 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I6 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I7 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I8 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I9 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+      END IF
+      IOK = 1
+      END
+C    COMPH
+      SUBROUTINE COMPH(SEQ,IDIM1P,MXSPAN,
+     1MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +SCORES,IDSCOR,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER DIALOG
+      INTEGER SPAN
+      REAL SCORES(IDSCOR)
+      CALL SHOWFU(KBOUT,'Plot hydrophobicity')
+      CALL GETPAR(22,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL PHYDRO(SEQ,IDIM1P,MXSPAN,
+     1MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,
+     +SCORES,IDSCOR,SPAN,IWRIT)
+      END
+      SUBROUTINE PHYDRO(SEQ,IDIM1P,MXSPAN,
+     1MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,
+     +SCORES,IDSCOR,SPAN,IWRIT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,CTONUM
+      REAL SCORES(IDSCOR)
+      EXTERNAL CTONUM
+C   max score
+      SCRMAX=-9999.
+      SCRMIN=99999.
+      DO 1 I=1,IDSCOR
+        IF(SCORES(I).GT.SCRMAX)SCRMAX=SCORES(I)
+        IF(SCORES(I).LT.SCRMIN)SCRMIN=SCORES(I)
+1     CONTINUE
+      I1INM1=1+MXSPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+C   set ymax
+      RANGE=SPAN*(SCRMAX-SCRMIN)
+      RANGEP=RANGE/100.
+      YMIN=SPAN*SCRMIN
+      YMAX=SPAN*SCRMAX
+      XBIT=XMIN+(XMAX-XMIN)*0.01
+      YMID=YMIN+RANGE*0.5
+      YMID45=YMID-RANGEP*5.0
+C   do edge
+      CALL STARTR(SEQ(J1P-LB),IDIM1P,SCORES,IDSCOR,SPAN,SUM)
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      DO 40 K=-40,40,10
+        YPLOT=YMID+RANGEP*K
+        CALL LINE(XMIN,XBIT,YPLOT,YPLOT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+40    CONTINUE
+C   PLOT MID LINE
+      CALL LINE(XMIN,XMAX,YMID45,YMID45,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   do rest
+      IDONE=0
+      XF=J1
+      YF=SUM
+      DO 100 I=J1P,J2P
+        IDONE=IDONE+1
+        SUM=SUM+SCORES(CTONUM(SEQ(I+LF)))
+     +  -SCORES(CTONUM(SEQ(I-LB)))
+        IF(MOD(I,IWRIT).EQ.0)THEN
+          XT=I-I1INM1
+          CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          XF=XT
+          YF=SUM
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+C    COMPCP
+      SUBROUTINE COMPCP(SEQ,IDIM1P,MXSPAN,
+     1MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +SCORES,IDSCOR,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER DIALOG
+      INTEGER SPAN
+      REAL SCORES(IDSCOR)
+      CALL SHOWFU(KBOUT,'Plot charge')
+      CALL GETPAR(23,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL PCOMPC(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,
+     +SCORES,IDSCOR,SPAN,IWRIT)
+      END
+      SUBROUTINE PCOMPC(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,
+     +SCORES,IDSCOR,SPAN,IWRIT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,CTONUM
+      REAL SCORES(IDSCOR)
+      EXTERNAL CTONUM
+      SCRMAX=-9999.
+      SCRMIN=9999.
+      DO 1 I=1,IDSCOR
+        IF(SCORES(I).GT.SCRMAX)SCRMAX=SCORES(I)
+        IF(SCORES(I).LT.SCRMIN)SCRMIN=SCORES(I)
+1     CONTINUE
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+      I1INM1=1+MXSPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+C   set ymax
+      RANGE=SPAN*(SCRMAX-SCRMIN)
+      RANGEP=RANGE/100.
+      YMIN=SPAN*SCRMIN
+      YMAX=SPAN*SCRMAX
+      XBIT=XMIN+(XMAX-XMIN)*0.01
+      YMID=YMIN+RANGE*0.5
+C   do edge
+      CALL STARTR(SEQ(J1P-LB),IDIM1P,SCORES,IDSCOR,SPAN,SUM)
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      DO 40 K=-40,40,10
+        YPLOT=YMID+RANGEP*K
+        CALL LINE(XMIN,XBIT,YPLOT,YPLOT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+40    CONTINUE
+C   PLOT MID LINE
+      CALL LINE(XMIN,XMAX,YMID,YMID,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   do rest
+      IDONE=0
+      XF=J1
+      YF=SUM
+      DO 100 I=J1P,J2P
+        IDONE=IDONE+1
+        SUM=SUM+SCORES(CTONUM(SEQ(I+LF)))
+     +  -SCORES(CTONUM(SEQ(I-LB)))
+        IF(MOD(I,IWRIT).EQ.0)THEN
+          XT=I-I1INM1
+          CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          XF=XT
+          YF=SUM
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE STARTR(SEQ,IDIM,SCORES,IDIMS,LENW,SUM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL  SCORES(IDIMS)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SUM=0.
+      DO 10 I=1,LENW
+        SUM = SUM + SCORES(CTONUM(SEQ(I)))
+10    CONTINUE
+      END
+C   this routine uses the following arrays and variables:
+C   a concatenated array of recognition sequences recseq maxrec idrseq prec
+C   a concatenated array of enzyme names names maxnam idnaml pnam
+C   a list of pointers to enzyme names in names namep maxnam itoten
+C   a list of enzyme name lengths namlen maxen itoten
+C   a list of numbers of recognition sequences per enzyme numrec maxen
+C   a list of lenghts of recognition sequences per enzyme (all expected
+C    to be the same) lenen maxen itoten
+C   a list of pointers to the first character of the first recognition
+C   sequence per enzyme recstr maxen itoten
+C   it also needs temporary storage of matches in psave and nsave of
+C   size maxmat and temporary storage of lengths of fragments in length
+C   output of results is on unit idev
+      SUBROUTINE SERCHP(SEQ,IDSEQ,J1,J2,KSTART,
+     +RENZYM,MAXREN,RECSEQ,MAXREC,
+     +NAMES,MAXNAM,LENGTH,PSAVE,NSAVE,MAXMAT,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,NAMLST,MAXENZ,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,
+     +IDEVEN,FILNAM,FILEA,FILEAA,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),FILEA*(*),FILEAA*(*),HELPF*(*)
+      CHARACTER SEQ(IDSEQ),RENZYM(MAXREN),RECSEQ(MAXREC),NAMES(MAXNAM)
+      INTEGER OPTION,OPTOUT
+      INTEGER LENGTH(MAXMAT),PSAVE(MAXMAT),NSAVE(MAXMAT)
+      INTEGER NAMEP(MAXENZ),NAMLEN(MAXENZ),NUMREC(MAXENZ)
+      INTEGER LENEN(MAXENZ)
+      INTEGER RECSTR(MAXENZ),NAMLST(MAXENZ),DIALOG
+      PARAMETER (IBH=256)
+      CALL SHOWFU(KBOUT,'Search for short sequences')
+      CALL GETPAR(17,7,IOK,IFILE,OPTION,OPTOUT,NOPOUT,
+     +MININC,MAXINC,INC,
+     +IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+10    CONTINUE
+      IDREN  = MAXREN
+      IDRSEQ = MAXREC
+      IDNAML = MAXNAM
+      IDNLST = MAXENZ
+      MAXEN  = MAXENZ
+      IDMAT  = MAXMAT
+      LEVEL = 0
+      CALL SERCHD(RENZYM,IDREN,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,MARGT,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,
+     +IDEVEN,FILNAM,FILEA,FILEAA,KBIN,KBOUT,
+     +OPTION,OPTOUT,NOPOUT,IFILE,MININC,MAXINC,INC,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL BUSY(KBOUT)
+      IF(OPTOUT.EQ.2) THEN
+C        CALL FINDL1(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+C     +  NAMEP,NAMLEN,
+C     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+C     +  PSAVE,NSAVE,LENGTH,IDMAT,NOPOUT,IBH,LEVEL,
+C     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+C     +  IHELPS,IHELPE,HELPF,IDEVH)
+      ELSE IF((OPTOUT.EQ.0).AND.(NOPOUT.EQ.1)) THEN
+        CALL FINDL2(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PSAVE,NSAVE,LENGTH,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      ELSE IF((OPTOUT.EQ.1).AND.(NOPOUT.EQ.1)) THEN
+        CALL FINDL3(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PSAVE,NSAVE,LENGTH,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      ELSE IF((OPTOUT.EQ.0).AND.(NOPOUT.EQ.0)) THEN
+        CALL FINDL4(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PSAVE,NSAVE,LENGTH,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      ELSE IF((OPTOUT.EQ.1).AND.(NOPOUT.EQ.0)) THEN
+        CALL FINDL5(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PSAVE,NSAVE,LENGTH,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      END IF
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      DIALOG = 1
+      GO TO 10
+      END
+      SUBROUTINE GETRNP(RENZYM,IDREN,IDEV,FILNAM,FILEA,FILEAA,
+     + IOK,KBIN,KBOUT,IFILE,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),FILEA*(*),FILEAA*(*),RENZYM(IDREN)
+      INTEGER DIALOG
+      PARAMETER (MAXPRM = 16)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      IOK = 1
+      IF(DIALOG.EQ.0) NOPT = IFILE
+      IOP = 1
+1     CONTINUE
+      IF(DIALOG.EQ.1) THEN
+        PROMPT(1) = 'Search'
+        PROMPT(2) = 'List enzyme file'
+        PROMPT(3) = 'Clear text'
+        PROMPT(4) = 'Clear graphics'
+        IOP = 1
+        CALL RADION('Select option',PROMPT,4,IOP,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.1) THEN
+          IOK = 1
+          RETURN
+        END IF
+        IF(IOP.EQ.3) THEN
+          CALL CLEARV
+          CALL BPAUSE(KBIN,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+          GO TO 1
+        END IF
+        IF(IOP.EQ.4) THEN
+          CALL CLEARG
+          CALL BPAUSE(KBIN,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+          GO TO 1
+        END IF
+        IF((IOP.EQ.1).OR.(IOP.EQ.2)) THEN
+          NOPT = IFILE
+          PROMPT(1) = 'All acids'
+          PROMPT(2) = 'Named groups'
+          PROMPT(3) = 'Personal file'
+          PROMPT(4) = 'Keyboard'
+          CALL RADION('Select string definition mode',PROMPT,4,NOPT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(NOPT.LT.1) THEN
+              IOK = 1
+            RETURN
+          END IF
+        END IF
+      END IF
+      IF(NOPT.EQ.1)THEN
+        CALL OPENRS(IDEV,FILEAA,IOK,LRECL,2)
+        IF(IOK.NE.0)GO TO 100
+        IF(IOP.EQ.2)THEN
+          CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+          GO TO 1
+        END IF
+        CALL ARRFIL(IDEV,RENZYM,IDREN,KBOUT)
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      IF(NOPT.EQ.2)THEN
+        CALL OPENRS(IDEV,FILEA,IOK,LRECL,2)
+        IF(IOK.NE.0)GO TO 100
+        IF(IOP.EQ.2)THEN
+          CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+          GO TO 1
+        END IF
+        CALL ARRFIL(IDEV,RENZYM,IDREN,KBOUT)
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      IF(NOPT.EQ.3)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Named sequences file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.EQ.2) GO TO 1
+        IF(IOK.NE.0)GO TO 100
+        IF(IOP.EQ.2)THEN
+          CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+          GO TO 1
+        END IF
+        CALL ARRFIL(IDEV,RENZYM,IDREN,KBOUT)
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      IF(NOPT.EQ.4)THEN
+        CALL GETRKB(RENZYM,IDREN,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +  IOK)
+        RETURN
+      END IF
+      GO TO 1
+100   CONTINUE
+      CALL ERROM(KBOUT,'Error opening file')
+      IOK=1
+      GO TO 1
+      END
+      SUBROUTINE DECREN(RENZYM,IDREN,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,KBOUT)
+C   routine to decode a restriction enzyme file array renzym idren
+C   into the following:
+C   a concatenated array of recognition sequences recseq idrseq prec
+C   a concatenated array of enzyme names names idnaml,pnam
+C   a list of pointers to enzyme names in names namep maxen itoten
+C   a list of enzyme name lengths namlen maxen itoten
+C   a list of numbers of recognition sequences per enzyme numrec maxen
+C   a list of lenghts of recognition sequences per enzyme (all expected
+C    to be the same) lenen maxen itoten
+C   a list of pointers to the first character of the first recognition
+C   sequence per enzyme recstr maxen itoten
+C   AUTHOR: RODGER STADEN
+      CHARACTER RENZYM(IDREN),RECSEQ(IDRSEQ),NAMES(IDNAML)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),TOTEN,PREC,PREN,PNAM
+      TOTEN=0
+      PREC=1
+      PREN=0
+      PNAM=1
+10    CONTINUE
+      PREN=PREN+1
+      IF(PREN.LT.IDREN)THEN
+      CALL CHARCT(RENZYM,IDREN,PREN,NCHAR)
+      IF(NCHAR.GT.0)THEN
+      TOTEN=TOTEN+1
+      IF(TOTEN.LT.MAXEN)THEN
+      IF((PNAM+NCHAR-1).GT.IDNAML)THEN
+      WRITE(KBOUT,1006)
+1006  FORMAT(' Names array full')
+      GO TO 101
+      END IF
+      NAMEP(TOTEN)=PNAM
+      NAMLEN(TOTEN)=NCHAR
+      CALL SQCOPY(RENZYM(PREN),NAMES(PNAM),NCHAR)
+      PNAM=PNAM+NCHAR
+      NREC=0
+      LMAX=0
+20    CONTINUE
+      PREN=PREN+NCHAR+1
+      NCHAR=0
+      IF(PREN.LT.IDREN)CALL CHARCT(RENZYM,IDREN,PREN,NCHAR)
+      IF(NCHAR.EQ.0)THEN
+      IF(NREC.EQ.0)THEN
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Name with no strings!')
+      TOTEN=TOTEN-1
+      GO TO 100
+      END IF
+      NUMREC(TOTEN)=NREC
+      LENEN(TOTEN)=LMAX
+      GO TO 10
+      END IF
+      NREC=NREC+1
+      IF(NCHAR.GT.LMAX)LMAX=NCHAR
+      CALL SQCOPY(RENZYM(PREN),RECSEQ(PREC),NCHAR)
+      IF(NREC.EQ.1)RECSTR(TOTEN)=PREC
+      PREC=PREC+NCHAR
+      GO TO 20
+      END IF
+      CALL ERROM(KBOUT,'Too many names')
+      GO TO 100
+      END IF
+      CALL ERROM(KBOUT,'Error in names and strings file')
+      END IF
+100   CONTINUE
+101   CONTINUE
+C      WRITE(KBOUT,1001)TOTEN
+C1001  FORMAT(' number of names=',I6)
+      IDNAML=PNAM-1
+      MAXEN=TOTEN
+      IDRSEQ=PREC
+      END
+      SUBROUTINE FIND4(SEQ,IDIM1,STRING,IDIM2,INC,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2),DASH
+      INTEGER PSEQ,PSTR
+      SAVE DASH
+      DATA DASH/'-'/
+      PSEQ=1-1*INC
+      PSTR=1
+      IMATCH=0
+100   CONTINUE
+      PSEQ=PSEQ+1-PSTR
+      PSEQ=PSEQ+INC-1
+400   CONTINUE
+      PSTR=0
+500   CONTINUE
+      PSEQ=PSEQ+1
+      IF(PSEQ.GT.IDIM1)GO TO 300
+      PSTR=PSTR+1
+      IF(STRING(PSTR).EQ.DASH)GO TO 450
+      IF(SEQ(PSEQ).NE.STRING(PSTR))GO TO 100
+450   CONTINUE
+      IF(PSTR.LT.IDIM2)GO TO 500
+      IMATCH=PSEQ-IDIM2+1
+300   CONTINUE
+      END
+      SUBROUTINE S1(SEQ,IDSEQ,NREC,LREC,JEN,IREC,J1,J2,INC,KSTART,
+     +RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),RECSEQ(IDRSEQ)
+      INTEGER RECSTR(MAXEN),PSAVE(MAXMAT),NSAVE(MAXMAT)
+      IOK = 1
+      DO 50 IREC=1,NREC
+        ISTR=RECSTR(JEN)+(IREC-1)*(LREC)
+C         NOTE WE NEED TO POINT TO THE CORRECT ARRAY ELEMENT BUT
+C         TO REMEMBER THE SEQUENCE ELEMENT FOR OUTPUT
+C         IAT IS THE ARRAY ELEMENT
+        IAT=J1-INC-KSTART+1
+10      CONTINUE
+          IAT=IAT+INC
+          ILEFT=J2-IAT+1-KSTART+1
+          IF(ILEFT.GT.0)THEN
+            CALL FIND8(SEQ(IAT),ILEFT,
+     +      RECSEQ(ISTR),LREC,INC,IMATCH)
+            IF(IMATCH.GT.0)THEN
+              IFOUND=IFOUND+1
+              IF(IFOUND.GT.MAXMAT)THEN
+                  CALL ERROM(KBOUT,'Too many matches')
+                  RETURN
+              END IF
+              IAT=IAT+IMATCH-1
+C             SAVE POSITION RELATIVE TO ARRAY
+              PSAVE(IFOUND)=IAT
+              NSAVE(IFOUND)=JEN
+              GO TO 10
+           END IF
+         END IF
+50    CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE S2(SEQ,IDSEQ,IFOUND,NAMEP,NAMLEN,LENEN,MAXEN,
+     +NSAVE,PSAVE,LENGTH,MAXMAT,NAMES,IDNAML,
+     +LENN,IDEV,J2,KSTART,JOB)
+      PARAMETER (IDT = 20)
+      CHARACTER SEQ(IDSEQ),TEMP1(IDT),TEMP2(IDT),NAMES(IDNAML)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),LENEN(MAXEN)
+      INTEGER NSAVE(MAXMAT),PSAVE(MAXMAT),LENGTH(MAXMAT)
+      DO 60 KOUT=1,IFOUND
+        KR1=NAMEP(NSAVE(KOUT))
+        KS1=PSAVE(KOUT)
+        CALL FILLC(TEMP1,IDT,' ')
+        CALL FILLC(TEMP2,IDT,' ')
+        KR2 = NAMLEN(NSAVE(KOUT))
+        CALL SQCOPY(NAMES(KR1),TEMP1,KR2)
+        KR2 = LENEN(NSAVE(KOUT))
+        CALL SQCOPY(SEQ(KS1),TEMP2,KR2)
+        LENN=PSAVE(KOUT)-LENN
+        IF(JOB.EQ.0) THEN
+          WRITE(IDEV,1008)
+     +    TEMP1,TEMP2,PSAVE(KOUT)+KSTART-1,LENN,LENGTH(KOUT)
+        ELSE IF(JOB.EQ.1) THEN
+          WRITE(IDEV,1008)
+     +    TEMP1,TEMP2,PSAVE(KOUT)+KSTART-1,LENGTH(KOUT)
+        END IF
+        LENN=PSAVE(KOUT)
+1008    FORMAT(' ',20A1,2X,20A1,2X,I6,2X,I6,2X,I6)
+60    CONTINUE
+      IF(JOB.EQ.0) THEN
+        LEND=J2-PSAVE(IFOUND)+1-KSTART+1
+        WRITE(IDEV,1020)LEND,LENGTH(IFOUND+1)
+1020    FORMAT(' ',52X,I6,2X,I6)
+      ELSE IF (JOB.EQ.1) THEN
+        LEND=J2-PSAVE(IFOUND)+1-KSTART+1
+        WRITE(IDEV,1020)LEND
+      END IF
+      END
+      SUBROUTINE FINDL2(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     1NAMES,IDNAML,NAMEP,NAMLEN,
+     1NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     1PSAVE,NSAVE,LENGTH,MAXMAT,NOPOUT,IBH,LEVEL,
+     1MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER LENGTH(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      CHARACTER HELPF*(*)
+      CHARACTER*20 ATOS
+      EXTERNAL ATOS
+C   OPTOUT=0 ENZYME BY ENZYME
+C   NOPOUT=1 GRAPHICAL OUTPUT
+C   FOR GRAPHICS NEED TO CHECK FOR END OF SCREEN
+      CALL CLEARV
+      CALL VECTOM
+      LEVEL1=LEVEL
+      LEVELM=MARGT-IBH
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      DO 100 IEN=1,LAST
+        IFOUND=0
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        LREC=LENEN(JEN)
+        CALL S1(SEQ,IDSEQ,NREC,LREC,JEN,IREC,J1,J2,INC,KSTART,
+     +  RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,KBOUT,
+     +  IOK)
+        IF(IOK.NE.0) THEN
+          CALL VT100M
+          RETURN
+        END IF
+        IF(IFOUND.GT.0) THEN
+          IF(LEVEL.GE.LEVELM)THEN
+            CALL PSRCHX(LEVEL1,LEVEL,KBIN,KBOUT,IQUIT,
+     +      IHELPS,IHELPE,HELPF,IDEVH)
+            IF(IQUIT.EQ.1) THEN
+              CALL VT100M
+              RETURN
+            END IF
+          END IF
+          CALL PLSRCH(J1-KSTART+1,J2-KSTART+1,
+     +    PSAVE,IFOUND,IBH,LEVEL,
+     +    ATOS(NAMES(NAMEP(JEN)),NAMLEN(JEN)),
+     +    NAMLEN(JEN),
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE FINDL3(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +PSAVE,NSAVE,LENGTH,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER LENGTH(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      CHARACTER HELPF*(*)
+C   OPTOUT=1 ALL ENZYMES TOGETHER
+C   NOPOUT=1 GRAPHICAL OUTPUT
+C   FOR GRAPHICS NEED TO CHECK FOR END OF SCREEN
+      CALL CLEARV
+      CALL VECTOM
+      LEVEL1=LEVEL
+      LEVELM=MARGT-IBH
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      IFOUND=0
+      DO 100 IEN=1,LAST
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        LREC=LENEN(JEN)
+        CALL S1(SEQ,IDSEQ,NREC,LREC,JEN,IREC,J1,J2,INC,KSTART,
+     +  RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,KBOUT,
+     +  IOK)
+        IF(IOK.NE.0) THEN
+          CALL VT100M
+          RETURN
+        END IF
+100   CONTINUE
+      IF(IFOUND.GT.0)THEN
+        CALL PLSRCH(J1-KSTART+1,J2-KSTART+1,
+     +  PSAVE,IFOUND,IBH,LEVEL,'ALL',3,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      CALL VT100M
+      IF(IFOUND.EQ.0)WRITE(IDEV,3000)
+3000  FORMAT(' no matches')
+      END
+      SUBROUTINE FINDL4(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +PSAVE,NSAVE,LENGTH,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER LENGTH(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      CHARACTER HELPF*(*)
+C   OPTOUT=0 ENZYME BY ENZYME
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      DO 100 IEN=1,LAST
+        IFOUND=0
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        LREC=LENEN(JEN)
+        CALL S1(SEQ,IDSEQ,NREC,LREC,JEN,IREC,J1,J2,INC,KSTART,
+     +  RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,KBOUT,
+     +  IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IFOUND.GT.0) THEN
+          WRITE(IDEV,1010)IFOUND
+1010      FORMAT('  matches=',I6)
+          CALL BUB2AS(PSAVE,NSAVE,IFOUND)
+C         calc lengths
+          LENGTH(1)=PSAVE(1)-(J1-KSTART+1)
+          DO 45 KOUT=2,IFOUND
+            LENGTH(KOUT)=PSAVE(KOUT)-PSAVE(KOUT-1)
+45        CONTINUE
+C         do length to end
+          LENGTH(IFOUND+1)=J2-KSTART+1-PSAVE(IFOUND)+1
+          IFP1=IFOUND+1
+          CALL BUBBLE(LENGTH,IFP1)
+          LENN=J1-KSTART
+          WRITE(IDEV,1011)
+1011     FORMAT(' NAME                  SEQUENCE            POSITION',
+     +    '  FRAGMENT LENGTHS')
+          CALL S2(SEQ,IDSEQ,IFOUND,NAMEP,NAMLEN,LENEN,MAXEN,
+     +    NSAVE,PSAVE,LENGTH,MAXMAT,NAMES,IDNAML,
+     +    LENN,IDEV,J2,KSTART,0)
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE FINDL5(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +PSAVE,NSAVE,LENGTH,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER LENGTH(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      CHARACTER HELPF*(*)
+C   OPTOUT=1 ALL ENZYMES TOGETHER
+C   NOPOUT = 0 (LISTED)
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      IFOUND=0
+      DO 100 IEN=1,LAST
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        LREC=LENEN(JEN)
+        CALL S1(SEQ,IDSEQ,NREC,LREC,JEN,IREC,J1,J2,INC,KSTART,
+     +  RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,KBOUT,
+     +  IOK)
+        IF(IOK.NE.0) RETURN
+100   CONTINUE
+      IF(IFOUND.GT.0) THEN
+        CALL BUB2AS(PSAVE,NSAVE,IFOUND)
+        LENN=J1-KSTART
+        WRITE(IDEV,1011)
+1011    FORMAT(' NAME                  SEQUENCE            POSITION',
+     +  '  FRAGMENT LENGTHS')
+        LENGTH(1)=PSAVE(1)-(J1-KSTART+1)
+        DO 59 KOUT=2,IFOUND
+          LENGTH(KOUT)=PSAVE(KOUT)-PSAVE(KOUT-1)
+59      CONTINUE
+        CALL S2(SEQ,IDSEQ,IFOUND,NAMEP,NAMLEN,LENEN,MAXEN,
+     +  NSAVE,PSAVE,LENGTH,MAXMAT,NAMES,IDNAML,
+     +  LENN,IDEV,J2,KSTART,1)
+      ELSE
+        WRITE(IDEV,3000)
+3000    FORMAT(' no matches')
+      END IF
+      END
+      SUBROUTINE SERCHD(RENZYM,IDREN,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,MARGT,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,
+     +IDEVEN,FILNAM,FILEA,FILEAA,KBIN,KBOUT,
+     +OPTION,OPTOUT,NOPOUT,IFILE,MININC,MAXINC,INC,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),FILEA*(*),FILEAA*(*),HELPF*(*)
+      CHARACTER RENZYM(IDREN),RECSEQ(IDRSEQ),NAMES(IDNAML)
+      INTEGER OPTION,OPTOUT
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN)
+      INTEGER LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),DIALOG
+      IOK = 0
+      CALL GETRNP(RENZYM,IDREN,IDEVEN,FILNAM,FILEA,FILEAA,
+     +IOK,KBIN,KBOUT,IFILE,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF(IDREN.LT.5) THEN
+        IOK = 1
+        RETURN
+      END IF
+      CALL DECREN(RENZYM,IDREN,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,KBOUT)
+      IF(MAXEN.LT.1) THEN
+        IOK = 1
+        RETURN
+      END IF
+      IF(DIALOG.EQ.0) RETURN
+      IOP = OPTION
+      CALL YESONO(IOP,'All names','Selected names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      OPTION = IOP
+      IF(OPTION.EQ.1)THEN
+        CALL GETNAM(NAMLST,IDNLST,NAMES,IDNAML,NAMEP,NAMLEN,
+     +  MAXEN,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IDNLST.LT.1) THEN
+          IOK = 1
+          RETURN
+        END IF
+      END IF
+      IOP = OPTOUT
+      CALL YESONO(IOP,'Show results name by name',
+     +'Show results all names together',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      OPTOUT = IOP
+      IOP = NOPOUT
+      CALL YESONO(IOP,'List matches','Plot matches',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      NOPOUT = IOP
+C      IF(NOPOUT.EQ.1)THEN
+C        LEVEL = LEVELI
+C        CALL SD2(IBLIPH,IBH,LEVEL,MARGT,
+C     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C        LEVELI = LEVEL
+C        IF(IOK.NE.0) RETURN
+C      END IF
+C      CALL GETINT(MININC,MAXINC,INC,'Step',
+C     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+C      INC = IVAL
+      INC = 1
+      END
+      SUBROUTINE SIGNLP(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,
+     +SUM,WT,TOT,IDM,MAXLEN,LINE,CHRSET,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILNAM*(*),HELPF*(*),CHRSET(IDM)
+      CHARACTER LINE(MAXLEN)
+      INTEGER SUM(IDM,MAXLEN),TOT(MAXLEN)
+      INTEGER DIALOG
+      REAL WT(IDM,MAXLEN)
+      CALL SHOWFU(KBOUT,'Motif search using weight matrix')
+      CALL GETPAR(20,2,IOK,IOPT,IGON,
+     +IPAR3,IPAR4,
+     +IPAR5,IPAR6,IPAR7,IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+      CALL SIGNLD(IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WT,LENGTH,MIDDLE,YMIN,YMIN1,YMAX,
+     +IGON,IOPT,IDEVOT,LINE,CHRSET,DIALOG,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL PSIGNL(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WT,IDM,MAXLEN,LENGTH,
+     +YMIN,YMIN1,YMAX,IGON,MIDDLE)
+      END
+      SUBROUTINE SIGNLD(IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WT,LENGTH,MIDDLE,YMIN,YMIN1,YMAX,
+     +IGON,IOPT,IDEVOT,LINE,CHRSET,DIALOG,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*),CHRSET(IDM),LINE(MAXLEN)
+      INTEGER SUM(IDM,MAXLEN),TOT(MAXLEN),DIALOG
+      REAL WT(IDM,MAXLEN)
+      INTEGER VALUE
+      PARAMETER (MAXPRM = 21)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      IOK = 1
+      IDO = 1
+      PROMPT(1) = 'Use weight matrix'
+      PROMPT(2) = 'Make weight matrix'
+      PROMPT(3) = 'Rescale weight matrix'
+      CALL RADION('Select operation',PROMPT,3,IDO,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IDO.LT.1) RETURN
+      IF(IDO.GT.1) THEN
+        CALL MKWT(WT,SUM,TOT,CHRSET,IDM,MAXLEN,
+     +  IDEV,IDEVOT,KBIN,KBOUT,LINE,
+     +  FILNAM,IDO,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IOK = 1
+        RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Motif weight matrix file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      LENGTH=MAXLEN
+      CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,YMIN,YMAX,IDEV,
+     +  IOK,IDM,KBOUT,0)
+      IF(IOK.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+      CALL YESONO(IOPT,'Use frequencies as weights',
+     +'Membership of set',IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      IF(IOPT.EQ.1)THEN
+        CALL GETW1(SUM,WT,LENGTH,IDM,MAXLEN)
+        MININ = 1
+        MAXIN = LENGTH
+        CALL GETINT(MININ,MAXIN,LENGTH,'Cutoff score',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        YMIN = 0.
+        YMAX = LENGTH
+        YMIN1 = VALUE
+      ELSE
+        IF(DIALOG.EQ.1) THEN
+          AMN = -9999.
+          AMX = 9999.
+          CALL GETRL(AMN,AMX,YMIN,'Cutoff score',VAL,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+          IF(IOK.NE.0) RETURN
+          YMIN = VAL
+        END IF
+        IF(YMIN.LT.0.0)CALL GETW(TOT,SUM,WT,LENGTH,IDM,MAXLEN)
+        IF(YMIN.GE.0.0)CALL GETW2(SUM,WT,LENGTH,IDM,MAXLEN)
+        YMIN1 = YMIN
+      END IF  
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0) RETURN
+      IOK = 0
+      END
+      SUBROUTINE PSIGNL(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WT,IDM,MAXLEN,LENGTH,
+     +YMIN,YMIN1,YMAX,IGON,MIDDLE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL WT(IDM,MAXLEN)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      XMAX=J2
+      XMIN=J1
+      IF(IGON.EQ.0)THEN
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      DO 10 I=J1,J2-LENGTH+1
+        SUM1=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          SUM1=SUM1+WT(CTONUM(SEQ(J)),K)
+5       CONTINUE
+        IF(SUM1.GE.YMIN1)THEN
+          IF(IGON.EQ.0)THEN
+            X=I+MIDDLE
+            CALL LINE(X,X,YMIN,SUM1,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          END IF
+          IF(IGON.EQ.1)THEN
+            CALL VT100M
+            WRITE(IDEVOT,1010)I+MIDDLE,SUM1,(SEQ(K),K=I,I+LENGTH-1)
+1010        FORMAT(' ',I7,F10.2,' ',120A1)
+          END IF
+        END IF
+10    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE LSTSEP(SEQ,IDIM,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER DIALOG
+      CALL SHOWFU(KBOUT,'List the sequence')
+      I1 = 1
+      I2 = IDIM
+      LENGTH = 60
+      IF(DIALOG.EQ.1) THEN
+        CALL LSTSPD(IDIM,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,I1,I2,LENGTH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL LSTSP(SEQ,IDIM,I1,I2,LENGTH,IDEV)
+      END
+      SUBROUTINE LSTSPD(IDIM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,I1,I2,LENGTH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      INTEGER VALUE
+      IOK = 1
+1     CONTINUE
+      VALUE = I1
+      CALL GETINT(I1,I2,VALUE,
+     +'List from ',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I1 = VALUE
+      VALUE = I2
+      CALL GETINT(I1,I2,VALUE,
+     +'List to ',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I2 = VALUE
+      MININ = 10
+      MAXIN = 120
+      LENGTH = 60
+      CALL GETINT(MININ,MAXIN,LENGTH,
+     +'Line length ',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      IF(MOD(VALUE,10).NE.0)VALUE = LENGTH
+      LENGTH = VALUE
+      END
+      SUBROUTINE LSTSP(SEQ,IDIM,I1,I2,LENGTH,IDEV)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      CALL FMTSEP(SEQ,IDIM,I1,I2,LENGTH,IDEV)
+      END
+      SUBROUTINE FMTSEP(SEQ1,IDIM,ISW,ISE,LINLEN,IDEV)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM)
+      INTEGER KL(12)
+      ISWW=ISW-1
+      IE=ISW-1
+1     CONTINUE
+      WRITE(IDEV,1003)
+1003  FORMAT( )
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,LINLEN/10
+        ISWW=ISWW+10
+        KL(J)=ISWW
+50    CONTINUE
+      IS=IE+1
+      IE=IE+LINLEN
+      IF(IE.GT.ISE)IE=ISE
+      WRITE(IDEV,1001)(KL(KKK),KKK=1,MIN(IE-IS+1,LINLEN)/10)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+1002  FORMAT( '  ',12(10A1,1X))
+1001  FORMAT( ' ',12(5X,I6))
+      IF(IE.EQ.ISE)RETURN
+      GO TO 1
+      END
+C   ROUTINE TO PERFORM ROBSON SECONDARY STRUCTURE PREDICTION FOR PROTEINS
+      SUBROUTINE ROBSON(SEQ,IDIM1,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +IDEV,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,DIALOG)
+C   AUTHOR: RODGER STADEN
+      INTEGER DIALOG
+      CHARACTER HELPF*(*),FILNAM*(*)
+      CHARACTER SEQ(IDIM1)
+      INTEGER HELIX(-8:8,26),EXTEND(-8:8,26),TURN(-8:8,26),COIL(-8:8,26)
+      CALL SHOWFU(KBOUT,'Plot Robson secondary structure predictions')
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        CALL ERROM(KBOUT,'Error opening Robson weights file')
+        RETURN
+      END IF
+      CALL REDROB(HELIX,IDEV,IFAIL)
+      CALL REDROB(EXTEND,IDEV,IFAIL)
+      CALL REDROB(TURN,IDEV,IFAIL)
+      CALL REDROB(COIL,IDEV,IFAIL)
+      CLOSE(UNIT=IDEV)
+      IF(IFAIL.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in Robson score matrix')
+        RETURN
+      END IF
+      CALL GETPAR(24,4,IOK,MINIW,MAXIW,IWRIT,IGON,
+     +IPAR5,IPAR6,IPAR7,IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) THEN
+        CALL ROBSND(MINIW,MAXIW,IWRIT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IGON,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL ROBSNP(SEQ,IDIM1,
+     +HELIX,EXTEND,TURN,COIL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,
+     +IDEVOT,IWRIT,IGON)
+      END
+      SUBROUTINE ROBSNP(SEQ,IDIM1,
+     +HELIX,EXTEND,TURN,COIL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,
+     +IDEVOT,IWRIT,IGON)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      INTEGER CTONUM
+      INTEGER HELIX(-8:8,26),EXTEND(-8:8,26),TURN(-8:8,26),COIL(-8:8,26)
+      EXTERNAL CTONUM
+      YMINH=-400.
+      YMAXH=400.
+      YMINE=-400.
+      YMAXE=400.
+      YMINT=-400.
+      YMAXT=400.
+      YMINC=-400.
+      YMAXC=400.
+      YMEAN = 0.
+      YMIND=0.0
+      YMAXD=100.0
+      YHD=20.
+      YED=40.
+      YTD=60.
+      YCD=80.
+      XMIN=J1
+      XMAX=J2
+      YMID=YMINH+(YMAXH-YMINH)*0.75
+      XMARG=REAL(MARGT)/5.0
+      IMARG=NINT(XMARG)
+      MARGBH=MARGB+XMARG
+      MARGTH=IMARG
+      MARGBE=MARGB+2*XMARG
+      MARGTE=IMARG
+      MARGBT=MARGB+3*XMARG
+      MARGTT=IMARG
+      MARGBC=MARGB+4*XMARG
+      MARGTC=IMARG
+      MARGBD=MARGB
+      MARGTD=IMARG
+      NH = 0
+      NE = 0
+      NT = 0
+      NC = 0
+      IF(IGON.EQ.0)THEN
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGBH,MARGTH,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL,MARGR,MARGBE,MARGTE,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL,MARGR,MARGBT,MARGTT,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL,MARGR,MARGBC,MARGTC,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL,MARGR,MARGBD,MARGTD,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,YMID,' H',2,0,XMAX,XMIN,YMAXH,YMINH,
+     1MARGL,MARGR,MARGBH,MARGTH,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,YMID,' E',2,0,XMAX,XMIN,YMAXE,YMINE,
+     1MARGL,MARGR,MARGBE,MARGTE,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,YMID,' T',2,0,XMAX,XMIN,YMAXT,YMINT,
+     1MARGL,MARGR,MARGBT,MARGTT,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,YMID,' C',2,0,XMAX,XMIN,YMAXC,YMINC,
+     1MARGL,MARGR,MARGBC,MARGTC,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,YCD,' D',2,0,XMAX,XMIN,YMAXD,YMIND,
+     1MARGL,MARGR,MARGBD,MARGTD,ISXMAX,ISYMAX)
+      END IF
+      IDONE=0
+      XP=MAX(J1,9)
+      X=XP
+      YPH=0.
+      YPE=0.
+      YPC=0.
+      YPT=0.
+      IOPT = 1
+      DO 10 I=MAX(J1,9),MIN(J2,IDIM1-8)
+        IDONE=IDONE+1
+        SUMH=0.
+        SUME=0.
+        SUMC=0.
+        SUMT=0.
+        K=I-9
+        DO 5 J=8,-8,-1
+          K=K+1
+          ISEQ=CTONUM(SEQ(K))
+          SUMH=SUMH+HELIX(J,ISEQ)
+          SUME=SUME+EXTEND(J,ISEQ)
+          SUMC=SUMC+COIL(J,ISEQ)
+          SUMT=SUMT+TURN(J,ISEQ)
+5       CONTINUE
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          YMAX=MAX(SUMH,SUME,SUMC,SUMT)
+          X=I
+          IF(IGON.EQ.0)THEN
+            CALL LINE(XP,X,YPH,SUMH,XMAX,XMIN,YMAXH,YMINH,
+     +      MARGL,MARGR,MARGBH,MARGTH,ISXMAX,ISYMAX)
+            IF(YMAX.EQ.SUMH)THEN
+              CALL POINT(X,YHD,XMAX,XMIN,YMAXD,YMIND,
+     +        MARGL,MARGR,MARGBD,MARGTD,ISXMAX,ISYMAX)
+              CALL POINT(X,YMEAN,XMAX,XMIN,YMAXH,YMINH,
+     +        MARGL,MARGR,MARGBH,MARGTH,ISXMAX,ISYMAX)
+              NH = NH + 1
+            END IF
+            YPH=SUMH
+            CALL LINE(XP,X,YPE,SUME,XMAX,XMIN,YMAXE,YMINE,
+     +      MARGL,MARGR,MARGBE,MARGTE,ISXMAX,ISYMAX)
+            IF(YMAX.EQ.SUME)THEN
+              CALL POINT(X,YED,XMAX,XMIN,YMAXD,YMIND,
+     +        MARGL,MARGR,MARGBD,MARGTD,ISXMAX,ISYMAX)
+              CALL POINT(X,YMEAN,XMAX,XMIN,YMAXE,YMINE,
+     +        MARGL,MARGR,MARGBE,MARGTE,ISXMAX,ISYMAX)
+              NE = NE + 1
+            END IF
+            YPE=SUME
+            CALL LINE(XP,X,YPT,SUMT,XMAX,XMIN,YMAXT,YMINT,
+     +      MARGL,MARGR,MARGBT,MARGTT,ISXMAX,ISYMAX)
+            IF(YMAX.EQ.SUMT)THEN
+              CALL POINT(X,YTD,XMAX,XMIN,YMAXD,YMIND,
+     +        MARGL,MARGR,MARGBD,MARGTD,ISXMAX,ISYMAX)
+              CALL POINT(X,YMEAN,XMAX,XMIN,YMAXT,YMINT,
+     +        MARGL,MARGR,MARGBT,MARGTT,ISXMAX,ISYMAX)
+              NT = NT + 1
+            END IF
+            YPT=SUMT
+            CALL LINE(XP,X,YPC,SUMC,XMAX,XMIN,YMAXC,YMINC,
+     +      MARGL,MARGR,MARGBC,MARGTC,ISXMAX,ISYMAX)
+            IF(YMAX.EQ.SUMC)THEN
+              CALL POINT(X,YCD,XMAX,XMIN,YMAXD,YMIND,
+     +        MARGL,MARGR,MARGBD,MARGTD,ISXMAX,ISYMAX)
+              CALL POINT(X,YMEAN,XMAX,XMIN,YMAXC,YMINC,
+     +        MARGL,MARGR,MARGBC,MARGTC,ISXMAX,ISYMAX)
+              NC = NC + 1
+            END IF
+            YPC=SUMC
+            XP=I
+          END IF
+          IF(IGON.EQ.1)THEN
+            CALL VT100M
+            WRITE(IDEVOT,1005)
+     +      I,SEQ(I),INT(SUMH),INT(SUME),INT(SUMT),INT(SUMC)
+1005        FORMAT(' ',I6,' ',A1,' ',4I5)
+            IF(IGON.EQ.0)CALL VECTOM
+          END IF
+        END IF
+10    CONTINUE
+      CALL VT100M
+      WRITE(IDEVOT,1006)NH,NE,NT,NC
+1006  FORMAT(
+     +'     Helix =',I6,', Extended =',I6,', Turn =',I6,', Coil =',I6)
+      END
+      SUBROUTINE ROBSND(MINIW,MAXIW,IWRIT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IGON,IOK)
+      CHARACTER HELPF*(*)
+      IOK = 1
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0) RETURN
+      IF(IGON.EQ.0) THEN
+        CALL GETINT(MINIW,MAXIW,IWRIT,'Plot interval',IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IWRIT = IVAL
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE REDROB(TABLE,IDEV,IFAIL)
+C   AUTHOR: RODGER STADEN
+      INTEGER TABLE(-8:8,26)
+C   SET FAIL FLAG
+      IFAIL=1
+1000  FORMAT( )
+1001  FORMAT(3X,17I4)
+C   READ TITLE
+      READ(IDEV,1000,END=100,ERR=100)
+C   READ POSITIONS
+      READ(IDEV,1000,END=100,ERR=100)
+C   READ VALUES
+      DO 10 I=1,26
+        READ(IDEV,1001,ERR=100,END=100)(TABLE(K,I),K=-8,8)
+10    CONTINUE
+C   SET FAIL FLAG TO GOOD
+      IFAIL=0
+      RETURN
+100   CONTINUE
+      RETURN
+      END
+C   SUBROUTINE TO DRAW HELIX WHEELS
+      SUBROUTINE HELIXW(SEQ,IDSEQ,KSTART,J1,J2,IX0,IXLEN,IY0,IYLEN,
+     +ISXMAX,ISYMAX,KBIN,KBOUT,HYDRO,IDSCOR,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C  AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),HELPF*(*)
+      INTEGER DIALOG
+      INTEGER WINDOW
+      REAL HYDRO(IDSCOR)
+      PARAMETER (DEGTOR = 0.017453)
+      CALL SHOWFU(KBOUT,'Draw helix wheel')
+      CALL GETPAR(26,6,IOK,MINANG,MAXANG,IANGLE,MINIW,MAXIW,WINDOW,
+     +IPAR7,IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) THEN
+        CALL HELIXD(MINANG,MAXANG,IANGLE,MINIW,MAXIW,WINDOW,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      ANGLE = REAL(IANGLE)*DEGTOR
+      CALL HELIXP(SEQ,IDSEQ,KSTART,J1,J2,IX0,IXLEN,IY0,IYLEN,
+     +ISXMAX,ISYMAX,KBIN,KBOUT,HYDRO,IDSCOR,
+     +IHELPS,IHELPE,HELPF,IDEVH,WINDOW,ANGLE)
+      END
+      SUBROUTINE HELIXP(SEQ,IDSEQ,KSTART,J1,J2,IX0,IXLEN,IY0,IYLEN,
+     +ISXMAX,ISYMAX,KBIN,KBOUT,HYDRO,IDSCOR,
+     +IHELPS,IHELPE,HELPF,IDEVH,WINDOW,ANGLE)
+C  AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),HELPF*(*)
+      CHARACTER SYMBOL*26,ATOS*60,SYMT*2,POSN*6,POSNHP*8
+      CHARACTER*2 NUMBER(1:60)
+      INTEGER CTONUM,WINDOW
+      REAL HYDRO(IDSCOR)
+      EXTERNAL CTONUM,ATOS
+      PARAMETER (
+     +           RAD1   = 0.78,
+     +           RAD2   = 1.25,
+     +           RAD3   = 1.50,
+     +           RAD4   = 1.0,
+     +           XMIN   = -1.75,
+     +           XMAX   = 1.75,
+     +           YMIN   = -1.75,
+     +           YMAX   = 1.75,
+     +           CENTRX = 0.0,
+     +           CENTRY = 0.0,
+     +           TXMIN  = XMIN + (XMAX-XMIN)*0.05,
+     +           TYMIN  = YMAX - (YMAX-YMIN)*0.04,
+     +           PXMIN  = XMIN + (XMAX-XMIN)*0.05,
+     +           PYMIN  = YMAX - (YMAX-YMIN)*0.16,
+     +           HMXMIN  = XMIN + (XMAX-XMIN)*0.05,
+     +           HMYMIN  = YMAX - (YMAX-YMIN)*0.08,
+     +           HXMIN  = XMIN + (XMAX-XMIN)*0.05,
+     +           HYMIN  = YMAX - (YMAX-YMIN)*0.12)
+C     +           TXMIN  = XMIN + (XMAX-XMIN)*0.05,
+C     +           TYMIN  = YMAX - (YMAX-YMIN)*0.03,
+C     +           PXMIN  = XMIN + (XMAX-XMIN)*0.05,
+C     +           PYMIN  = YMAX - (YMAX-YMIN)*0.14,
+C     +           HMXMIN  = XMIN + (XMAX-XMIN)*0.05,
+C     +           HMYMIN  = YMAX - (YMAX-YMIN)*0.06,
+C     +           HXMIN  = XMIN + (XMAX-XMIN)*0.05,
+C     +           HYMIN  = YMAX - (YMAX-YMIN)*0.1)
+      PARAMETER (DELTA = 0.001)
+C
+      SYMBOL ='       --   +++ ......    '
+      DO 1 I = 1,60
+        WRITE(NUMBER(I),1005)I
+1     CONTINUE
+1005  FORMAT(I2)
+      POSNHP=' '
+C
+C  SEQUENCE CHARACTER IN ELEMENT 1 OF SEQ ARRAY IS KSTART
+C  FIRST SEQUENCE CHARACTER IN ACTIVE REGION IS J1, LAST J2
+C  DIMENSION OF SEQ ARRAY IS IDSEQ
+C  ELEMENT NUMBER FOR J1 IS J1-KSTART+1
+C  ELEMENT NUMBER FOR J2 IS J2-KSTART+1
+C  NUMBER OF ELEMENTS TO PROCESS IS J2-J1+1
+C  WORK IN ELEMENT NUMBERS AND SET TO SEQUENCE NUMBERS FOR OUTPUT
+C  SET FIRST ELEMENT NUMBER
+       I1 = J1-KSTART+1
+C
+C
+10     CONTINUE
+C
+C
+       CALL CLEARV
+       CALL VECTOM
+       CALL CLEARG
+       CALL VECTOM
+       CALL FRAME(IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+       XF = 0.0
+       YF = RAD1
+       ICLOCK = 0
+       I2 = MIN(I1+WINDOW-1,J2-KSTART+1)
+       IWIN = I2-I1+1
+       DO 20 I = I1,I2
+         ICLOCK = ICLOCK + 1
+         TX = COS(1.5708-((I-I1)*ANGLE))
+         TY = SIN(1.5708-((I-I1)*ANGLE))
+         X = RAD2*TX
+         Y = RAD2*TY
+         SYMT = SEQ(I)
+         CALL TEXT(X,Y,SYMT,1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+         X = RAD4*TX
+         Y = RAD4*TY
+         SYMT = NUMBER(ICLOCK)
+         CALL TEXT(X,Y,SYMT,2,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+         X = RAD3*TX
+         Y = RAD3*TY
+         K = CTONUM(SEQ(I))
+         SYMT = SYMBOL(K:K)
+         CALL TEXT(X,Y,SYMT,1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+         X = RAD1*TX
+         Y = RAD1*TY
+         CALL LINE(XF,X,YF,Y,XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+         XF = X
+         YF = Y
+         I3 = I
+20     CONTINUE
+       CALL EISEN(SEQ(I1),IWIN,HYDRO,IDSCOR,ANGLE,HM,H)
+       CALL TEXT(TXMIN,TYMIN,ATOS(SEQ(I1),IWIN),IWIN,ISIZE,
+     +   XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+       WRITE(POSN,1000)I1+KSTART-1
+1000   FORMAT(I6)
+       CALL TEXT(PXMIN,PYMIN,POSN,6,ISIZE,
+     +   XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+       WRITE(POSN,1001)HM*WINDOW
+1001   FORMAT(F6.2)
+       POSNHP(2:) = POSN
+       POSNHP(1:1) = 'M'
+       CALL TEXT(HMXMIN,HMYMIN,POSNHP,8,ISIZE,
+     +   XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+       WRITE(POSN,1001)H*WINDOW
+       POSNHP(2:) = POSN
+       POSNHP(1:1) = 'H'
+       CALL TEXT(HXMIN,HYMIN,POSNHP,8,ISIZE,
+     +   XMAX,XMIN,YMAX,YMIN,
+     +   IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+         ICLOCK = ICLOCK + 1
+         TX = COS(1.5708-((I3+1-I1)*ANGLE))
+         TY = SIN(1.5708-((I3+1-I1)*ANGLE))
+         X = RAD1*TX
+         Y = RAD1*TY
+        IF((ABS(X).LT.DELTA).AND.(ABS(Y-RAD1).LT.DELTA))
+     +  CALL LINE(XF,0.0,YF,RAD1,XMAX,XMIN,YMAX,YMIN,
+     +  IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+        CALL FLUSHG
+30     CONTINUE
+       CALL VT100M
+       MINSTP = -1000
+       MAXSTP = 1000
+       NEXT = 1
+       CALL GETINT(MINSTP,MAXSTP,NEXT,'Step',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 40
+       NEXT = IVAL
+       I1 = I1 + NEXT
+       IF(I1.LT.0)GO TO 40
+       IF(I1.GT.J2-KSTART+1)GO TO 40
+       IF(I1.LT.J1-KSTART+1)GO TO 40
+       GO TO 10
+40     CONTINUE
+       CALL VT100M
+       END
+      SUBROUTINE HELIXD(MINANG,MAXANG,IANGLE,MINIW,MAXIW,WINDOW,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C  AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      INTEGER WINDOW
+      IOK = 1
+      CALL GETINT(MINANG,MAXANG,IANGLE,'Angle',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IANGLE = IVAL
+      CALL GETINT(MINIW,MAXIW,WINDOW,'Window',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      WINDOW = IVAL
+      END
+C   SUBROUTINE TO PLOT HYDROPHOBIC MOMENTS
+      SUBROUTINE HYDMOM(SEQ,IDSEQ,HYDRO,IDSCOR,
+     +IX0,IXLEN,IY0,IYLEN,
+     +ISXMAX,ISYMAX,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C  AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),HELPF*(*)
+      INTEGER DIALOG
+      INTEGER WINDOW
+      REAL HYDRO(IDSCOR)
+      PARAMETER (DEGTOR = 0.017453)
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'Plot hydrophobic moment')
+      CALL GETPAR(25,9,IOK,MINANG,MAXANG,IANGLE,MINWIN,MAXWIN,WINDOW,
+     +MINIW,MAXIW,IWRIT)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) THEN
+        CALL HYDMD(MINANG,MAXANG,IANGLE,MINWIN,MAXWIN,WINDOW,
+     +  MINIW,MAXIW,IWRIT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      ANGLE = REAL(IANGLE)*DEGTOR
+      CALL HYDMP(SEQ,IDSEQ,HYDRO,IDSCOR,
+     +IX0,IXLEN,IY0,IYLEN,
+     +ISXMAX,ISYMAX,WINDOW,ANGLE,IWRIT)
+      END
+      SUBROUTINE HYDMP(SEQ,IDSEQ,HYDRO,IDSCOR,
+     +IX0,IXLEN,IY0,IYLEN,
+     +ISXMAX,ISYMAX,WINDOW,ANGLE,IWRIT)
+C  AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ)
+      INTEGER CTONUM,WINDOW
+      REAL HYDRO(IDSCOR)
+      EXTERNAL CTONUM
+      XMIN = 1
+      XMAX = IDSEQ
+      CALL CLEARV
+      CALL VECTOM
+       Y = REAL(IYLEN)/2.
+       IYLEN1 = NINT(Y)
+       IY1 = IY0 + IYLEN1
+       CALL FRAME(IX0,IXLEN,IY0,IYLEN1,ISXMAX,ISYMAX)
+       CALL FRAME(IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX)
+       IBH = 150
+       CALL SCALES(XMAX,XMIN,1.5,-1.0,
+     1          IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX,
+     1          0.5,IBH,-1.0,2)
+       CALL SCALES(XMAX,XMIN,1.5,0.0,
+     1          IX0,IXLEN,IY0,IYLEN1,ISXMAX,ISYMAX,
+     1          0.5,IBH,0.0,2)
+       IBH = 50
+       CALL SCALES(XMAX,XMIN,1.5,-1.0,
+     1          IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX,
+     1          0.1,IBH,-1.0,2)
+       CALL SCALES(XMAX,XMIN,1.5,0.0,
+     1          IX0,IXLEN,IY0,IYLEN1,ISXMAX,ISYMAX,
+     1          0.1,IBH,0.0,2)
+C       CALL TEXT(2.,1.4,'HYDROPHOBIC MOMENT',18,ISIZE,
+C     +   100.0,1.0,1.5,0.0,
+C     +   IX0,IXLEN,IY0,IYLEN1,ISXMAX,ISYMAX)
+C       CALL TEXT(2.0,1.3,'HYDROPHOBICITY',14,ISIZE,
+C     +   100.,1.0,1.5,-1.0,
+C     +   IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX)
+C         CALL LINE(1.,XMAX,0.55,0.55,XMAX,XMIN,1.5,-1.0,
+C     +   IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX)
+C         CALL LINE(1.,XMAX,0.68,0.68,XMAX,XMIN,1.5,-1.0,
+C     +   IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX)
+C         CALL LINE(1.,XMAX,0.42,0.42,XMAX,XMIN,1.5,-1.0,
+C     +   IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX)
+      IDONE = 0
+      XF = 0.
+      HMF = 0.
+      HF = 0.
+      DO 40 J = 1,IDSEQ-WINDOW+1
+       I1 = J
+       CALL EISEN(SEQ(I1),WINDOW,HYDRO,IDSCOR,ANGLE,HM,H)
+       IDONE = IDONE + 1
+       IF(MOD(IDONE,IWRIT).EQ.0)THEN
+         X = I1
+         CALL LINE(XF,X,HMF,HM,XMAX,XMIN,1.5,0.0,
+     +   IX0,IXLEN,IY0,IYLEN1,ISXMAX,ISYMAX)
+         CALL LINE(XF,X,HF,H,XMAX,XMIN,1.5,-1.0,
+     +   IX0,IXLEN,IY1,IYLEN1,ISXMAX,ISYMAX)
+         XF = X
+         HMF = HM
+         HF = H
+       END IF
+C
+40    CONTINUE
+       CALL VT100M
+       END
+      SUBROUTINE HYDMD(MINANG,MAXANG,IANGLE,MINWIN,MAXWIN,WINDOW,
+     +MINIW,MAXIW,IWRIT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C  AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      INTEGER WINDOW
+      IOK = 1
+      CALL GETINT(MINANG,MAXANG,IANGLE,'Angle',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IANGLE = IVAL
+      CALL GETINT(MINWIN,MAXWIN,WINDOW,'Window',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      WINDOW = IVAL
+      CALL GETINT(MINIW,MAXIW,IWRIT,'Plot interval',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IWRIT = IVAL
+      IOK = 0
+      END
+       SUBROUTINE EISEN(SEQ,WINDOW,HYDRO,IDHYD,DELTA,HM,H)
+       INTEGER WINDOW
+       CHARACTER SEQ(WINDOW)
+       REAL HYDRO(IDHYD)
+       INTEGER CTONUM
+       EXTERNAL CTONUM
+C  AUTHOR RODGER STADEN
+C  CALCULATES THE MEAN HYDROPHOBICITY AND MEAN HYDROPHOBIC MOMENT
+C  AFTER EISENBERG, FOR A WINDOW OF SIZE WINDOW, AND ANGLE DELTA
+       T1 = 0.
+       T2 = 0.
+       H = 0.
+       DO 20 I = 1, WINDOW
+C
+         IT =CTONUM(SEQ(I))
+         T = HYDRO(IT)
+         T3 = DELTA * I - 1
+         T1 = T1 + T * SIN(T3)
+         T2 = T2 + T * COS(T3)
+         H = H + T
+C
+20    CONTINUE
+      T1 = T1 * T1
+      T2 = T2 * T2
+      HM = T1 + T2
+      IF(HM.GT.0.0)HM = SQRT(HM)
+      H = H / WINDOW
+      HM = HM / WINDOW
+      END
+C   BACK
+      SUBROUTINE BACK(SEQ,IDSEQ,CODIN,KBIN,KBOUT,FILNAM,IDEV,
+     +DNA,MAXSEQ,PCOD,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),DNA(MAXSEQ),FILNAM*(*),PAA(5,5,5)
+      CHARACTER*3 CODON(26),CODIN(26)
+      REAL CODS(4,4,4),PROB(26),PCOD(IDIM1P)
+      CHARACTER HELPF*(*)
+      INTEGER DIALOG
+      INTEGER SPAN
+      SAVE PAA
+      DATA PAA/'F','F','L','L','-','S','S','S','S','S',
+     1'Y','Y','*','*','-','C','C','*','W','-',
+     1'-','-','-','-','-','L','L','L','L','L',
+     1'P','P','P','P','P','H','H','Q','Q','-',
+     1'R','R','R','R','R','-','-','-','-','-','I','I','I','M','-',
+     1'T','T','T','T','T',
+     1'N','N','K','K','-','S','S','R','R','-','-','-','-','-','-',
+     1'V','V','V','V','V','A','A','A','A','A','D','D','E','E','-',
+     1'G','G','G','G','G',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-'/
+      CALL GETPAR(27,8,IOK,IOP,IPLOT,MINWIN,MAXWIN,SPAN,
+     +MINIW,MAXIW,IWRIT,IPAR9)
+      IF(IOK.NE.0) RETURN
+      CALL SHOWFU(KBOUT,'Back translate')
+      DO 10 I = 1,26
+        CODON(I) = CODIN(I)
+10    CONTINUE
+      CALL FILLR(PROB,26,0.)
+      CALL FILLR(CODS,64,0.)
+C   GET CODON TABLE FROM FILE IF REQUIRED
+      IF(DIALOG.EQ.1) THEN
+        CALL BACKD1(IOP,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,
+     +  FILNAM,IDEV,CODS,PAA,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+C   BACK TRANSLATE
+      CALL BACKA(CODS,PAA,CODON,PROB,PCOD,IDIM1P,MXSPAN,
+     +SEQ,IDSEQ,DNA,MAXSEQ)
+C   SET UP PLOTTING
+      IF(DIALOG.EQ.1) THEN
+        CALL BACKD2(IPLOT,MINWIN,MAXWIN,SPAN,MINIW,MAXIW,IWRIT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IPLOT.LT.0) RETURN
+      END IF
+      IF(IPLOT.EQ.0)THEN
+C   PLOT REDUNDANCY
+        CALL PLOTRD(PCOD,IDIM1P,MXSPAN,1,IDSEQ,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,SPAN,IWRIT)
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IDIM = 3*IDSEQ
+C   SAVE TO DISK
+      CALL BACKO(DNA,IDIM,FILNAM,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      END
+      SUBROUTINE BACKD2(IPLOT,MINWIN,MAXWIN,SPAN,MINIW,MAXIW,IWRIT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      INTEGER SPAN
+      CALL YESNO(IPLOT,'Plot redundancy',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IPLOT.LT.0)RETURN
+      IF(IPLOT.EQ.0) THEN
+        CALL GSPIN(MINWIN,MAXWIN,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      END
+      SUBROUTINE BACKO(DNA,IDIM,FILNAM,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER DNA(IDIM)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CALL YESNO(IN,'Save DNA to disk',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IN.LT.0)RETURN
+      IF(IN.EQ.0)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File name for DNA sequence',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.EQ.0)THEN
+          CALL FMTDK(IDEV,DNA,IDIM)
+          CLOSE(UNIT=IDEV)
+        END IF
+      END IF
+      END
+      SUBROUTINE BACKD1(IOP,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,
+     +FILNAM,IDEV,CODS,PAA,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*),PAA(5,5,5)
+      REAL CODS(4,4,4)
+      IOK = 1
+      CALL YESONO(IOP,'No codon preference',
+     +'Use file of codon preferences',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) RETURN
+      IF(IOP.EQ.0) THEN
+        IOK = 0
+        RETURN
+      END IF
+      IF(IOP.EQ.1)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Codon table file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.EQ.0)THEN
+          CALL REDCOD(CODS,IDEV)
+          CLOSE(UNIT=IDEV)
+          CALL WRTCOD(CODS,KBOUT,PAA)
+        END IF
+      END IF
+      END
+      SUBROUTINE BACKA(CODS,PAA,CODON,PROB,PCOD,IDIM1P,MXSPAN,
+     +SEQ,IDSEQ,DNA,MAXSEQ)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),DNA(MAXSEQ),PAA(5,5,5)
+      CHARACTER*3 CODON(26)
+      REAL CODS(4,4,4),PROB(26),PCOD(IDIM1P)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C   USE TOP CODONS
+      CALL TOPCOD(CODS,PAA,CODON,PROB)
+C   BACK TRANSLATE
+      I1 = 1+MXSPAN/2
+      CALL FILLR(PCOD,I1,0.)
+      DO 100 I = 1,IDSEQ
+        K = CTONUM(SEQ(I))
+        DNA(1+3*(I-1)) = CODON(K)(1:1)
+        DNA(2+3*(I-1)) = CODON(K)(2:2)
+        DNA(3+3*(I-1)) = CODON(K)(3:3)
+        PCOD(I+I1) = PROB(K)
+100   CONTINUE
+      CALL FILLR(PCOD(IDSEQ+I1+1),I1,0.)
+      END
+C   CHECK EACH AMINO ACID IN TURN FOR EVEN CODON USE
+      SUBROUTINE TOPCOD(CODUSE,PAA,CODON,PROB)
+C   AUTHOR: RODGER STADEN
+      CHARACTER PAA(5,5,5),AA(22),BASE(4)
+      CHARACTER*3 CODON(26)
+      REAL CODUSE(4,4,4),PROB(26)
+      SAVE AA,BASE
+      DATA AA/'C','S','T','P','A','G','N','D','E','Q','B',
+     +'Z','H','R','K','M','I','L','V','F','Y','W'/
+      DATA BASE/'T','C','A','G'/
+C   IF CODONS FOR AN ACID ARE EVENLY USED EMPLOY ALL CODONS
+C   OTHERWISE EMPLOY THE TOP SCORER. CHECK THE USAGE
+C   LOOP FOR EACH ACID
+      DO 300 L=1,22
+        IMAX = 0
+        JMAX = 0
+        KMAX = 0
+        AMAX=0.0
+        NCOD = 0
+        TCOD = 0.0
+        IEVEN = 1
+C   LOOP FOR EACH CODON
+        DO 200 I=1,4
+          DO 200 J=1,4
+            DO 200 K=1,4
+C   EXAMINE CODONS FOR ACID L
+              IF(PAA(K,J,I).EQ.AA(L))THEN
+                NCOD = NCOD + 1
+                TCOD = TCOD + CODUSE(I,J,K)
+                IF(CODUSE(I,J,K).GT.AMAX)THEN
+                  AMAX = CODUSE(I,J,K)
+                  IMAX = I
+                  JMAX = J
+                  KMAX = K
+                  IEVEN = 0
+                ELSE IF(CODUSE(I,J,K).EQ.AMAX)THEN
+                  IEVEN = 1
+                END IF
+              END IF
+200     CONTINUE
+C  IF THERE IS A PREFERENCE, USE IT
+        IF((AMAX.GT.0.0).AND.(IEVEN.EQ.0))THEN
+          CODON(L) = BASE(IMAX)//BASE(JMAX)//BASE(KMAX)
+          PROB(L) = AMAX/TCOD
+        ELSE
+          PROB(L) = 1.0/(MAX(1,NCOD))
+        END IF
+300   CONTINUE
+      END
+      SUBROUTINE PLOTRD(SEQ,IDIM1P,MXSPAN,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,SPAN,IWRIT)
+C   AUTHOR: RODGER STADEN
+      REAL SEQ(IDIM1P)
+      INTEGER SPAN
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+C   max score
+      SCRMAX=1.
+      SCRMIN=0.16
+      I1INM1=1+MXSPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+C   set ymax
+      YMIN=SPAN*SCRMIN
+      YMAX=SPAN*SCRMAX
+C   do edge
+      SUM=0.
+      DO 90 I=J1P-LB,J1P-LB+SPAN
+        SUM=SUM+SEQ(I)
+90    CONTINUE
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   do rest
+      IDONE=0
+      XF=J1
+      YF=SUM
+      DO 100 I=J1P,J2P
+        IDONE=IDONE+1
+        SUM=SUM+SEQ(I+LF)-SEQ(I-LB)
+        IF(MOD(I,IWRIT).EQ.0)THEN
+          XT=I-I1INM1
+          CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          XF=XT
+          YF=SUM
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE FIND8(SEQ,IDIM1,STRING,IDIM2,INC,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER PSEQ,PSTR
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      PSEQ=1-1*INC
+      PSTR=1
+      IMATCH=0
+100   CONTINUE
+        PSEQ=PSEQ+1-PSTR
+        PSEQ=PSEQ+INC-1
+        PSTR=0
+500     CONTINUE
+          PSEQ=PSEQ+1
+          IF(PSEQ.GT.IDIM1)RETURN
+          PSTR=PSTR+1
+          KSTRNG = CTONUM(STRING(PSTR))
+          KSEQ = CTONUM(SEQ(PSEQ))
+          IF((KSTRNG.NE.KSEQ).AND.(KSTRNG.NE.23)) GO TO 100
+      IF(PSTR.LT.IDIM2)GO TO 500
+      IMATCH=PSEQ-IDIM2+1
+      END
+      SUBROUTINE SQPFIT(SEQNCE,IDIM1,SEQ2,IDIM3I,ITOT,ITOTEL,ITOTID,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,
+     +IDEV,IDEVIN,FILE2,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      INTEGER DIALOG
+      PARAMETER (MAXSTR = 50)
+      CHARACTER HELPF*(*)
+      CHARACTER SEQNCE(IDIM1),STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3I),FILE2*(*)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      INTEGER ANSTY
+      CALL SHOWFU(KBOUT,'Find percentage matches')
+      IDIM2 = 0
+      IDIM2I = MAXSTR
+      IDIM3 = IDIM3I
+      IS = J1
+      IE = J2
+      ISS = 1
+      ISE = 10
+      CALL GETPAR(18,2,IOK,ANSTY,IPR,
+     +IPAR3,IPAR4,
+     +IPAR5,IPAR6,IPAR7,IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+      PR = REAL(IPR)
+5     CONTINUE
+      IF(DIALOG.EQ.1) THEN
+        CALL SQPFD1(SEQ2,IDIM3I,IDIM3,ANSTY,IDEVIN,FILE2,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+10    CONTINUE
+      CALL SQPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,PR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL SQPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOK)
+      IF(IOK.EQ.2)THEN
+        WRITE(KBOUT,1017)ITOTID
+1017    FORMAT(/,' more than',I6,' matches.',
+     +  ' try changing percentage or region'/)
+        GO TO 10
+      END IF
+      IF(IOK.NE.0) RETURN
+      CALL SQPF7(SEQNCE,IDIM1,STRING,IDIM2,MATCH,ITOT,ITOTEL,ITOTID,
+     +ITOTP,KSTART,J1,J2,ISS,PR,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      GO TO 10
+      END
+      SUBROUTINE SQPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,PR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ANSC,ANSTY
+      IOK = 1
+      CALL YESNO(ANSC,'Keep picture',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSC.LT.0)RETURN
+      IF(ANSC.EQ.1)CALL CLEARG
+      IF(ANSTY.EQ.1) THEN
+        CALL SQPF2(SEQ2,IDIM3,STRING,IDIM2I,IDIM2,ISS,ISE,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(ANSTY.EQ.0) THEN
+        CALL SQPF3(STRING,MATCH,IDIM2I,IDIM2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+C      CALL SQPF4(KBIN,KBOUT,J1,J2,IS,IE,
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+      CALL SQPF5(IDIM2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PR,MINP,IOK)
+C      IF(IOK.NE.0) RETURN
+      RETURN
+      END
+      SUBROUTINE SQPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOK)
+      CHARACTER SEQNCE(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      IOK = 1
+      CALL FILLI(ITOT,ITOTID,0)
+      CALL FILLI(ITOTEL,ITOTID,0)
+      CALL SQFIT7(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART)
+      IF(ITOTP.GT.ITOTID)THEN
+        IOK = 2
+        RETURN
+      END IF
+      IF(ITOTP.GT.0)THEN
+        XMIN=J1
+        XMAX=J2
+        YMIN=0.
+        YMAX=IDIM2
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        DO 203 I=1,ITOTP
+          X=ITOTEL(I)
+          Y=ITOT(I)
+          CALL LINE(X,X,YMIN,Y,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+203     CONTINUE
+      END IF
+      CALL VT100M
+      IOK = 0
+      END
+      SUBROUTINE SEPFIT(SEQNCE,IDIM1,SEQ2,IDIM3I,ITOT,ITOTEL,ITOTID,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,
+     +IDEV,IDEVIN,FILE2,KBIN,KBOUT,MATRIX,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      INTEGER DIALOG
+      PARAMETER (MAXSTR = 50)
+      CHARACTER HELPF*(*)
+      CHARACTER SEQNCE(IDIM1),STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3I),FILE2*(*)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID),MATRIX(IDM,IDM)
+      INTEGER ANSTY,SMIN,SMAX
+      CALL SHOWFU(KBOUT,'Find matches using a score matrix')
+      MINP = 0
+      IDIM2 = 0
+      IDIM2I = MAXSTR
+      IDIM3 = IDIM3I
+      IS = J1
+      IE = J2
+      ISS = 1
+      ISE = 10
+      CALL GETPAR(19,1,IOK,ANSTY,
+     +IPAR2,IPAR3,IPAR4,
+     +IPAR5,IPAR6,IPAR7,IPAR8,IPAR9)
+      IF(IOK.NE.0) RETURN
+5     CONTINUE
+      IF(DIALOG.EQ.1) THEN
+        CALL SQPFD1(SEQ2,IDIM3I,IDIM3,ANSTY,IDEVIN,FILE2,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+10    CONTINUE
+      CALL SEPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL SEPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+      IF(IOK.EQ.2)THEN
+        WRITE(KBOUT,1017)ITOTID
+1017    FORMAT(/,' more than',I6,' matches.',
+     +  ' try changing score or region'/)
+        GO TO 10
+      END IF
+      IF(IOK.NE.0) RETURN
+      CALL SEPF7(SEQNCE,IDIM1,STRING,IDIM2,MATCH,ITOT,ITOTEL,ITOTID,
+     +ITOTP,MINP,KSTART,J1,J2,ISS,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      GO TO 10
+      END
+      SUBROUTINE SEPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ANSC,ANSTY,MATRIX(IDM,IDM),SMIN,SMAX
+      IOK = 1
+      CALL YESNO(ANSC,'Keep picture',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSC.LT.0)RETURN
+      IF(ANSC.EQ.1)CALL CLEARG
+      IF(ANSTY.EQ.1) THEN
+        CALL SEPF2(SEQ2,IDIM3,STRING,IDIM2I,IDIM2,ISS,ISE,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(ANSTY.EQ.0) THEN
+        CALL SQPF3(STRING,MATCH,IDIM2I,IDIM2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+C      CALL SEPF4(KBIN,KBOUT,J1,J2,IS,IE,
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+      CALL SEPF5(IDIM2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,MINP,MATRIX,IDM,
+     +SMIN,SMAX,STRING,IOK)
+C      IF(IOK.NE.0) RETURN
+      RETURN
+      END
+      SUBROUTINE SEPF2(SEQ2,IDIM3,STRING,IDIM2I,IDIM2,I1,I2,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ2(IDIM3),STRING(IDIM2I)
+      IOK = 1
+      MININ = 1
+      MAXIN = IDIM3
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Define string ends')
+      CALL GETINT(MININ,MAXIN,I1,'Start',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(IVAL.NE.I1) THEN
+        I2 = IVAL + 10
+      END IF
+      I1 = IVAL
+      MININ = I1 + 1
+      MAXIN = I1 + IDIM2I - 1
+      CALL GETINT(MININ,MAXIN,I2,'End',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      I2 = IVAL
+      IDIM2 =I2 - I1 + 1
+      CALL SQCOPY(SEQ2(I1),STRING,IDIM2)
+      WRITE(KBOUT,1001)(STRING(K),K=1,IDIM2)
+1001  FORMAT(' string=',50A1)
+      IOK = 0
+      END
+      SUBROUTINE SEPF5(IDIM2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,MINP,MATRIX,IDM,
+     +SMIN,SMAX,STRING,IOK)
+      CHARACTER HELPF*(*)
+      CHARACTER STRING(IDIM2)
+      INTEGER MATRIX(IDM,IDM),SMIN,SMAX,VALUE
+      IOK = 1
+      CALL GTSCR(STRING,IDIM2,MATRIX,IDM,SMIN,SMAX)
+      WRITE(KBOUT,1000)SMIN,SMAX
+1000  FORMAT(' Minimum score=',I6,' Maximum score=',I6)
+      MININ = SMIN
+      MAXIN = SMAX
+      IF(MINP.EQ.0) MINP = SMAX
+      CALL GETINT(MININ,MAXIN,MINP,'Score',VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINP = VALUE
+      IOK = 0
+      END
+      SUBROUTINE SEPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+      CHARACTER SEQNCE(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID),MATRIX(IDM,IDM)
+      INTEGER SMIN,SMAX
+      IOK = 1
+      CALL FILLI(ITOT,ITOTID,0)
+      CALL FILLI(ITOTEL,ITOTID,0)
+      CALL SPFIT(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,MATRIX,IDM,KSTART)
+      IF(ITOTP.GT.ITOTID)THEN
+        IOK = 2
+        RETURN
+      END IF
+      IF(ITOTP.GT.0)THEN
+        XMIN = J1
+        XMAX = J2
+        YMIN = SMIN
+        YMAX = SMAX
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        DO 203 I=1,ITOTP
+          X=ITOTEL(I)
+          Y=ITOT(I)
+          CALL LINE(X,X,YMIN,Y,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+203     CONTINUE
+      END IF
+      CALL VT100M
+      IOK = 0
+      END
+C      SQFITP
+      SUBROUTINE SQFITP(SEQ,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS-KSTART+1
+      ITOTP=0
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          IF(SEQ(IP).EQ.STRING(J))THEN
+            NTOT=NTOT+1
+          ELSE IF (STRING(J).EQ.'-')THEN
+            NTOT=NTOT+1
+          END IF
+          IP=IP+1
+100     CONTINUE
+        IF(NTOT.GE.MINP)THEN
+          ITOTP=ITOTP+1
+          IF(ITOTP.GT.ITOTID)RETURN
+          ITOT(ITOTP)=NTOT
+          ITOTEL(ITOTP)=IP-IDIM2+KSTART-1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
+C      SQFIT7
+      SUBROUTINE SQFIT7(SEQ,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID
+     1,IS,IE,MINS,ITOTP,KSTART)
+C   AUTHOR: RODGER STADEN
+C   COMPARE POINTERS, NOT ACTUAL CHARACTERS
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS-KSTART+1
+      ITOTP=0
+      DO 200 I=1,IDIF
+        NTOT = 0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          IF(STRING(J).EQ.'-')THEN
+            NTOT = NTOT + 1
+          ELSE
+            K = CTONUM(STRING(J))
+            IF(K.EQ.CTONUM(SEQ(IP)))NTOT = NTOT + 1
+          END IF
+          IP=IP+1
+100     CONTINUE
+        IF(NTOT.GE.MINS)THEN
+          ITOTP=ITOTP+1
+          IF(ITOTP.GT.ITOTID)RETURN
+          ITOT(ITOTP)=NTOT
+          ITOTEL(ITOTP)=IP-IDIM2+KSTART-1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
diff --git a/src/staden/anals89.f b/src/staden/anals89.f
new file mode 100644
index 0000000..d8805fb
--- /dev/null
+++ b/src/staden/anals89.f
@@ -0,0 +1,10995 @@
+C   ANALS89 SUBROUTINES FOR NIP
+C   AUTHOR: RODGER STADEN
+C  11-8-92 Fixed bug in x version: use of gtstr and opratr='all' when
+C          using feature tables: xversion returns default, fortran returns
+C          lin=0: now check for both.
+C  29-7-92 Added routines for finding repeated words
+C   6-8-92 trna: previously did not allow for case in conserved base check
+C   29-5-92 set max open reading frame to idim/3 in opentr
+C   27-5-92 trnad made max intron length default = min intron length (was 0)
+C   21-5-92 fixed bug in restriction enzyme search: if 4 cutter file was 
+C   selected it always got listed!
+C   16-4-92 fixed bug in find inverted repeats: the second position was out!
+C   8-1-90
+C   Changed all kstran to jstran
+C   Changed all mstran to jstran
+C   Replaced calls codid1 by trand1 and deleted codid1
+C   Added clos(unit=ideve) to tran3,codimp
+C   Changed format statement in trpiro to produce embl ft output
+C   26-3-90 TRNA changed it to allow intron lengths from 0 to N (previously
+C   by mistake, had to be M to N where M is non zero).
+C   29-3-90 Minor change to CODTDK call to CODTBO was erroneously sent ANST
+C   not ANSTO. Now sent ANSTO.
+C   2-4-90 Major change to codtdk: normalised data not written to disk!
+C   3-5-90 Minor change to inverted repeat search for case when a pallindrome
+C          is found: stop the subtraction of 2 from position for listed
+C          results by making intx1=intx2
+C   9-7-90 Removed menu routine
+C   21-8-90 Changed getpar for option 39 to use completely new routines
+C           Changed TRPIRP and TRPIR and their callers
+C   5-11-90 Changed calls to rdwmt to include flag for show/not show title
+C   7-11-90 Huge number of changes to replace radio by radion, including
+C           writing new routine gstrnd. This has increased the returned 
+C           value of the choice flag by 1 and so is a potential source
+C           of minor cockup
+C  21-12-90 Changed weight matrix searches so that text output gives position
+C           as middle + i (was i)
+C  11-1-91  Greatly modified positional base preferences method adding 
+C           facility to read codon table from disk.
+C  19-2-91  Modified s2 to accomodate longer restriction enzyme recognition 
+C           seqs, and moved a write statement from findl4, findl5 to s2.
+C           For output from s2 names are now limited to 15 chars 
+C           recognition seqs to 35 chars.
+C  22-2-91  trpir had forgotten to reverse translations from the complementary
+C           strand. Added call to sqrev.
+C  11-4-91 modified trand7 to make 6 phase translation more obvious
+C  18-4-91  Replaced splice search and changed setpar
+C  23-4-91  Minor change to trand8 to allow both strands
+C           New routines: trnpir, trndop, trndom, openfr, trndp, trndm,
+C           wpair, pstop, mstop, minil. For translating open frames to disk
+C           and finding open frames
+C 11-6-91 Following routines replaced or added.Changes relating to new fts
+C 24-2-92 Bracketed the factor of -1 in MBPRIM
+C 2-3-92    set   FILNAM = ' ' for all openf1 calls
+C      SUBROUTINE OPENTR(SEQ,IDIM,KSTART,IDEV,KBIN,KBOUT,PAA,
+C      SUBROUTINE TRANEM(SEQ,IDIM,J1,J2,FRAMEC,
+C      SUBROUTINE TRAND8(IDIM,J1,J2,KSTRAN,IP1,IP2,PROMPT,
+C      SUBROUTINE TRAND7(KBIN,KBOUT,I1,I2,
+C      SUBROUTINE EMBLF(IDEVE,FILNAM,
+C      SUBROUTINE TRAND9(KBIN,KBOUT,IDEVE,FILNAM,OPRATR,
+C      SUBROUTINE TRANEN(SEQ,IDIM,J1,J2,FRAMEC,JSTRAN,IDEVE,
+C      SUBROUTINE TRAND1(JSTRAN,ANSE,IDEVE,FILNAM,
+C      SUBROUTINE TRANDK(SEQ1,IDIM1,OUTP1,IDIMP,ISTART,ISTOP,IDIMT,
+C      SUBROUTINE TRNEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,IDEV,
+C      SUBROUTINE CODTDK(SEQ,IDIM,J1,J2,IDEVE,FILNAM,KBIN,KBOUT,
+C      SUBROUTINE CODND1(JSTRAN,NORM,ANSE,IDEVE,FILNAM,OPRATR,
+C      SUBROUTINE CTDEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,
+C      SUBROUTINE CODEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,IDEV,
+C      SUBROUTINE CODONS(SEQ,IDIM,J1,J2,IDEV,IDEVE,FILNAM,KBIN,KBOUT,
+C      SUBROUTINE CODIMP(SEQ,IDIM,FTABLE,IDFTAB,IDEVE,FILNAM,
+C      SUBROUTINE CDIEMB(SEQ,IDIM,J1,J2,SEQW,JSTRAN,IDEVE,IDEV,
+C      SUBROUTINE PLTEMB(JSTRAN,IDEVE,
+C      SUBROUTINE PLTMAQ(IDEV,FILNAM,POSNS,MAXPOS,
+C      SUBROUTINE PLTBAR(POSNL,POSNR,YF,YF,BLIPB,BLIPT,
+C      SUBROUTINE MBPRIM(FRAMEC,IDIMS,J1,J2,PRIME)
+C 17-7-91 trndom replced line to write out title! which had disapeared
+      SUBROUTINE GETPAR(IOPT,NPAM,IOK,
+     +I1,I2,I3,I4,I5,I6,I7,I8,I9,I10,
+     +I11,I12,I13,I14,I15,I16,I17,I18,I19,I20,
+     +I21,I22,I23,I24,I25,I26,I27,I28,I29,I30,
+     +I31,I32,I33,I34,I35)
+      PARAMETER (MINOPT = 17,
+     +           MAXOPT = 68,
+     +           NUMBER = 35)
+      INTEGER VALUES(NUMBER,MINOPT:MAXOPT)
+      COMMON /PAMDEF/VALUES
+      IOK = 0
+      IF((IOPT.GE.MINOPT).AND.(IOPT.LE.MAXOPT)) THEN
+        I = 1
+        I1 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I2 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I3 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I4 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I5 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I6 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I7 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I8 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I9 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I10 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I11 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I12 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I13 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I14 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I15 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I16 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I17 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I18 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I19 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I20 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I21 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I22 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I23 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I24 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I25 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I26 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I27 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I28 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I29 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I30 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I31 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I32 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I33 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I34 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+        I = I + 1
+        I35 = VALUES(I,IOPT)
+        IF(I.EQ.NPAM) RETURN
+      END IF
+      IOK = 1
+      END
+      SUBROUTINE SETPAR(IOK)
+      PARAMETER (MINOPT = 17,
+     +           MAXOPT = 68,
+     +           NUMBER = 35)
+      INTEGER VALUES(NUMBER,MINOPT:MAXOPT)
+C      INTEGER TEMP(NUMBER)
+      COMMON /PAMDEF/VALUES
+      SAVE /PAMDEF/
+      IOK = 0
+C  SEARCH
+C      CALL GETPAR(17,14,IOK,
+C     +MINLEN,MAXLEN,LINLEN,MNCUT,MAXCUT,MINCUT,
+C     +OPTION,OPTOUT,IFILE,JTRAN,ITRAN,IROUND,IDEF)
+      VALUES(1,17) = 10
+      VALUES(2,17) = 100
+      VALUES(3,17) = 60
+      VALUES(4,17) = 0
+      VALUES(5,17) = 100
+      VALUES(6,17) = 0
+      VALUES(7,17) = 0
+      VALUES(8,17) = 0
+      VALUES(9,17) = 2
+      VALUES(10,17) = 0
+      VALUES(11,17) = 0
+      VALUES(12,17) = 0
+      VALUES(13,17) = 0
+C  PERCENT MATCH
+      VALUES(1,18) = 0
+      VALUES(2,18) = 70
+C  SCORE MATRIX MATCH
+      VALUES(1,19) = 0
+C  WEIGHT MATRIX
+      VALUES(1,20) = 0
+C      WRITE(KBOUT,*)' CALCULATE CODON USAGE, BASE COMPOSITION'
+C      WRITE(KBOUT,*)' AND AMINO ACID COMPOSITION'
+C      CALL GETPAR(23,3,IOK,JSTRAN,NORM,ANSE)
+      VALUES(1,23) = 0
+      VALUES(2,23) = 0
+      VALUES(3,23) = 0
+C
+C  PLOT BASE COMPOSITION
+C
+C      CALL GETPAR(24,10,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+C     +SCORES(1),SCORES(2),SCORES(3),SCORES(4))
+C
+      VALUES(1,24) = 1
+      VALUES(2,24) = 201
+      VALUES(3,24) = 31
+      VALUES(4,24) = 1
+      VALUES(5,24) = 11
+      VALUES(6,24) = 5
+      VALUES(7,24) = 1
+      VALUES(8,24) = 0
+      VALUES(9,24) = 1
+      VALUES(10,24) = 0
+C      WRITE(KBOUT,*)' PLOT BASE COMPOSITION DIFFERENCES AS CHI SQUARED'
+C      CALL GETPAR(25,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,25) = 11
+      VALUES(2,25) = 301
+      VALUES(3,25) = 33
+      VALUES(4,25) = 1
+      VALUES(5,25) = 21
+      VALUES(6,25) = 5
+C      WRITE(KBOUT,*)' PLOT DI COMPOSITION DIFFERENCES AS CHI SQUARED'
+C      CALL GETPAR(26,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,26) = 11
+      VALUES(2,26) = 301
+      VALUES(3,26) = 33
+      VALUES(4,26) = 1
+      VALUES(5,26) = 21
+      VALUES(6,26) = 5
+C      WRITE(KBOUT,*)' PLOT TRI COMPOSITION DIFFERENCES AS CHI SQUARED'
+C      CALL GETPAR(27,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,27) = 11
+      VALUES(2,27) = 301
+      VALUES(3,27) = 33
+      VALUES(4,27) = 1
+      VALUES(5,27) = 21
+      VALUES(6,27) = 5
+C      WRITE(KBOUT,*)' CALCULATE CODON IMPROBABILITY'
+C      CALL GETPAR(28,2,IOK,JSTRAN,ANSE)
+      VALUES(1,28) = 0
+      VALUES(2,28) = 0
+C      WRITE(KBOUT,*)' SEARCH FOR HAIRPIN LOOPS'
+C      CALL GETPAR(30,10,IOK,MINLPI,MAXLPI,MINLP,
+C     +MINLPX,MAXLPX,MAXLP,MINBP,MAXBP,MININ,IGON)
+      VALUES(1,30) = 1
+      VALUES(2,30) = 30
+      VALUES(3,30) = 1
+      VALUES(4,30) = 3
+      VALUES(5,30) = 120
+      VALUES(6,30) = 3
+      VALUES(7,30) = 2
+      VALUES(8,30) = 20
+      VALUES(9,30) = 6
+      VALUES(10,30) = 0
+C      WRITE(KBOUT,*)' PLOT INVERTED REPEATS'
+C      CALL GETPAR(31,4,IOK,MINRP,MAXRP,MINREP,IGON)
+      VALUES(1,31) = 6
+      VALUES(2,31) = 30
+      VALUES(3,31) = 12
+      VALUES(4,31) = 0
+C      WRITE(KBOUT,*)' PLOT REPEATS'
+C      CALL GETPAR(32,4,IOK,MINRP,MAXRP,MINREP,IGON)
+      VALUES(1,32) = 6
+      VALUES(2,32) = 30
+      VALUES(3,32) = 12
+      VALUES(4,32) = 0
+C      WRITE(KBOUT,*)' PLOT Z DNA POTENTIAL '
+C      CALL GETPAR(33,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,33) = 3
+      VALUES(2,33) = 33
+      VALUES(3,33) = 15
+      VALUES(4,33) = 1
+      VALUES(5,33) = 21
+      VALUES(6,33) = 3
+C      WRITE(KBOUT,*)' PLOT Z DNA POTENTIAL '
+C      CALL GETPAR(34,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,34) = 3
+      VALUES(2,34) = 33
+      VALUES(3,34) = 21
+      VALUES(4,34) = 1
+      VALUES(5,34) = 11
+      VALUES(6,34) = 1
+C      WRITE(KBOUT,*)' PLOT Z DNA POTENTIAL '
+C      CALL GETPAR(35,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,35) = 5
+      VALUES(2,35) = 33
+      VALUES(3,35) = 15
+      VALUES(4,35) = 1
+      VALUES(5,35) = 21
+      VALUES(6,35) = 1
+C      WRITE(KBOUT,*)' SEARCH FOR LOCAL SIMILARITY OR COMPLEMENTARITY'
+C      CALL GETPAR(36,15,IOK,MINSP,MAXSP,SPAN,MINPR,MAXPR,IPR,
+C     +MNIR1,MXIR1,IRAN1,MNIR2,MXIR2,IRAN2,
+C     +ANSRV,ANSC,IGON)
+      VALUES(1,36) = 5
+      VALUES(2,36) = 200
+      VALUES(3,36) = 15
+      VALUES(4,36) = 5
+      VALUES(5,36) = 100
+      VALUES(6,36) = 70
+      VALUES(7,36) = 1
+      VALUES(8,36) = 50
+      VALUES(9,36) = 1
+      VALUES(10,36) = 1
+      VALUES(11,36) = 50
+      VALUES(12,36) = 1
+      VALUES(13,36) = 0
+      VALUES(14,36) = 0
+      VALUES(15,36) = 0
+C      WRITE(KBOUT,*)' LIST TRANSLATION (UP TO 3 PHASES)'
+C      CALL GETPAR(38,6,IOK,MINLEN,MAXLEN,LINLEN,JSTRAN,ANSE,ITRAN)
+      VALUES(1,38) = 30
+      VALUES(2,38) = 120
+      VALUES(3,38) = 60
+      VALUES(4,38) = 0
+      VALUES(5,38) = 0
+      VALUES(6,38) = 0
+C      WRITE(KBOUT,*)' TRANSLATE IN SIX PHASES'
+C      CALL GETPAR(39,11,IOK,MINLEN,MAXLEN,LINLEN,ITRAN,
+C     CALL GETPAR(39,10,IOK,MINO,MINOPN,JSTRAN,
+C     MINLEN,MAXLEN,LINLEN,ITRAN,JTRAN,INUM,ANSE,
+      VALUES(1,39) = 0
+      VALUES(2,39) = 30
+      VALUES(3,39) = 0
+      VALUES(4,39) = 30
+      VALUES(5,39) = 120
+      VALUES(6,39) = 60
+      VALUES(7,39) = 0
+      VALUES(8,39) = 0
+      VALUES(9,39) = 0
+      VALUES(10,39) = 3
+C      WRITE(KBOUT,*)' TRANSLATION AND WRITE PROTEIN SEQUENCE TO DISK'
+C      CALL GETPAR(40,7,IOK,JSTRAN,ANSE,IOPEN,MINO,MAXO,MINOPN,JSTRAN)
+      VALUES(1,40) = 0
+      VALUES(2,40) = 0
+      VALUES(3,40) = 0
+      VALUES(4,40) = 5
+      VALUES(5,40) = 1000
+      VALUES(6,40) = 30
+      VALUES(7,40) = 0
+C      WRITE(KBOUT,*)' CALCULATE AND WRITE CODON TABLE TO DISK'
+C      CALL GETPAR(41,5,IOK,JSTRAN,NORM,ANSE,ANST,ANSTO)
+      VALUES(1,41) = 0
+      VALUES(2,41) = 0
+      VALUES(3,41) = 0
+      VALUES(4,41) = 0
+      VALUES(5,41) = 0
+C      WRITE(KBOUT,*)'STADEN AND MCLACHLAN CODON USAGE METHOD'
+C      CALL GETPAR(42,7,IOK,MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,ANSTY)
+      VALUES(1,42) = 11
+      VALUES(2,42) = 101
+      VALUES(3,42) = 25
+      VALUES(4,42) = 1
+      VALUES(5,42) = 11
+      VALUES(6,42) = 5
+      VALUES(7,42) = 0
+C      WRITE(KBOUT,*)' POSITIONAL BASE PREFERENCES METHOD TO FIND',
+C     +' PROTEIN GENES'
+C      CALL GETPAR(43,9,IOK,MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,
+C     +ANSABS,ANSTY,ANSS)
+      VALUES(1,43) = 31
+      VALUES(2,43) = 101
+      VALUES(3,43) = 67
+      VALUES(4,43) = 1
+      VALUES(5,43) = 11
+      VALUES(6,43) = 5
+      VALUES(7,43) = 0
+      VALUES(8,43) = 0
+      VALUES(9,43) = 0
+C      WRITE(KBOUT,*)' UNEVEN POSITIONAL BASE FREQUENCIES METHOD'
+C      CALL GETPAR(44,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,44) = 11
+      VALUES(2,44) = 101
+      VALUES(3,44) = 67
+      VALUES(4,44) = 1
+      VALUES(5,44) = 21
+      VALUES(6,44) = 5
+C      WRITE(KBOUT,*)' SEARCH FOR PROTEIN GENES USING MCLACHLAN, STADEN'
+C      WRITE(KBOUT,*)' AND BOSWELL IMPROBABILITY METHOD'
+C      WRITE(KBOUT,*)' EXPECTING CODON COMPOSITION TO DEPEND ON'
+C      WRITE(KBOUT,*)' BASE COMPOSITION'
+C      CALL GETPAR(45,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,45) = 67
+      VALUES(2,45) = 99
+      VALUES(3,45) = 99
+      VALUES(4,45) = 1
+      VALUES(5,45) = 21
+      VALUES(6,45) = 5
+C      WRITE(KBOUT,*)' SEARCH FOR PROTEIN GENES USING MCLACHLAN, STADEN'
+C      WRITE(KBOUT,*)' AND BOSWELL IMPROBABILITY METHOD'
+C      WRITE(KBOUT,*)' EXPECTING CODON COMPOSITION TO DEPEND ON'
+C      WRITE(KBOUT,*)' AMINO ACID COMPOSITION'
+C      CALL GETPAR(46,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,46) = 67
+      VALUES(2,46) = 99
+      VALUES(3,46) = 99
+      VALUES(4,46) = 1
+      VALUES(5,46) = 21
+      VALUES(6,46) = 5
+C      WRITE(KBOUT,*)' SHEPHERDS METHOD'
+C      CALL GETPAR(47,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,47) = 11
+      VALUES(2,47) = 101
+      VALUES(3,47) = 67
+      VALUES(4,47) = 1
+      VALUES(5,47) = 21
+      VALUES(6,47) = 5
+C      WRITE(KBOUT,*)' FICKETTS METHOD'
+C      CALL GETPAR(48,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,48) = 11
+      VALUES(2,48) = 101
+      VALUES(3,48) = 67
+      VALUES(4,48) = 1
+      VALUES(5,48) = 21
+      VALUES(6,48) = 5
+C      WRITE(KBOUT,*)' TRNA SEARCH '
+C      CALL GETPAR(49,35,IOK,
+C     +MNLEN,MXLEN,MAXLNT,
+C     +MNAC,MXAC,MINACL,
+C     +MNTU,MXTU,MINTU,
+C     +MNAN,MXAN,MINANT,
+C     +MND,MXD,MIND,
+C     +MNI1,MXI1,INT1,
+C     +MNI2,MXI2,INT2,
+C     +MNTUL1,MXTUL1,LTUMIN,
+C     +MNTUL2,MXTUL2,LTUMAX,
+C     +MNB,MXB,MINBAS,
+C     +MNC,MXC,MINCON,IGON,ANSCON)
+      VALUES(1,49) = 70
+      VALUES(2,49) = 130
+      VALUES(3,49) = 92
+      VALUES(4,49) = 0
+      VALUES(5,49) = 14
+      VALUES(6,49) = 11
+      VALUES(7,49) = 0
+      VALUES(8,49) = 10
+      VALUES(9,49) = 8
+      VALUES(10,49) = 0
+      VALUES(11,49) = 10
+      VALUES(12,49) = 8
+      VALUES(13,49) = 0
+      VALUES(14,49) = 8
+      VALUES(15,49) = 3
+      VALUES(16,49) = 0
+      VALUES(17,49) = 30
+      VALUES(18,49) = 0
+      VALUES(19,49) = 30
+      VALUES(20,49) = 30
+      VALUES(21,49) = 0
+      VALUES(22,49) = 4
+      VALUES(23,49) = 12
+      VALUES(24,49) = 6
+      VALUES(25,49) = 6
+      VALUES(26,49) = 12
+      VALUES(27,49) = 9
+      VALUES(28,49) = 0
+      VALUES(29,49) = 44
+      VALUES(30,49) = 0
+      VALUES(31,49) = 0
+      VALUES(32,49) = 0
+      VALUES(33,49) = 0
+      VALUES(34,49) = 0
+      VALUES(35,49) = 1
+C      WRITE(KBOUT,*)' FIND OPEN READING FRAMES'
+C      CALL GETPAR(54,4,IOK,MINO,MAXO,MINOPN,JSTRAN)
+      VALUES(1,54) = 5
+      VALUES(2,54) = 1000
+      VALUES(3,54) = 30
+      VALUES(4,54) = 0
+C      WRITE(KBOUT,*)' PLOT NEGENTROPY'
+C      CALL GETPAR(29,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,59) = 11
+      VALUES(2,59) = 301
+      VALUES(3,59) = 33
+      VALUES(4,59) = 1
+      VALUES(5,59) = 21
+      VALUES(6,59) = 5
+C  SPLICE SEARCH
+      VALUES(1,62) = 0
+C      WRITE(KBOUT,*)'PLOT COMPOSITION DIFFERENCES (OBS-EXP))'
+C      CALL GETPAR(64,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT)
+      VALUES(1,64) = 3
+      VALUES(2,64) = 401
+      VALUES(3,64) = 101
+      VALUES(4,64) = 1
+      VALUES(5,64) = 20
+      VALUES(6,64) = 5
+C  PATTERNS TO DO
+C      OPEN(UNIT=99,FILE='ANALYSEP.PAR',STATUS ='OLD')
+C10    CONTINUE
+C      READ(99,*,ERR=100,END=200)TEMP
+C      IF((TEMP(1).GE.MINOPT).AND.(TEMP(1).LE.MAXOPT)) THEN
+C        DO 20 I = 1,NUMBER-1
+C          VALUES(I,TEMP(1)) = TEMP(I+1)
+C          WRITE(*,*)TEMP(1),TEMP(I+1)
+C20      CONTINUE
+C      END IF
+C      GO TO 10
+C100   CONTINUE
+C      WRITE(*,*)'ERROR IN PARMS FILE'
+C      GO TO 10
+C200   CONTINUE
+C      CLOSE(UNIT = 99)
+      IOK = 0
+      END
+      SUBROUTINE SEARCH(SEQ,IDSEQ,J1,J2,KSTART,
+     +RENZYM,MAXREN,RECSEQ,MAXREC,
+     +NAMES,MAXNAM,CUTADD,PSAVE,NSAVE,SORTA,MAXMAT,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,NAMLST,
+     +CUTSIT,PCUT,MAXENZ,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,
+     +IDEVEN,FILNAM,FILE4,FILE6,FILEA,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,POSN,WORDP,MAXDIC,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),FILEA*(*),FILE4*(*),FILE6*(*),HELPF*(*)
+      CHARACTER SEQ(IDSEQ),RENZYM(MAXREN),RECSEQ(MAXREC),NAMES(MAXNAM)
+      CHARACTER PAA(125)
+      INTEGER OPTION,OPTOUT
+      INTEGER CUTSIT(MAXENZ),PCUT(MAXENZ)
+      INTEGER CUTADD(MAXMAT),PSAVE(MAXMAT),NSAVE(MAXMAT),SORTA(MAXMAT)
+      INTEGER NAMEP(MAXENZ),NAMLEN(MAXENZ),NUMREC(MAXENZ)
+      INTEGER LENEN(MAXENZ)
+      INTEGER RECSTR(MAXENZ),NAMLST(MAXENZ),DIALOG
+      INTEGER POSN(IDSEQ),WORDP(MAXDIC),CONST(0:24)
+      PARAMETER (IBH=256)
+      CALL SHOWFU(KBOUT,'Search for restriction enzyme sites')
+      CALL GETPAR(17,13,IOK,
+     +MINLEN,MAXLEN,LINLEN,MNCUT,MAXCUT,MINCUT,
+     +OPTION,OPTOUT,IFILE,JTRAN,KTRAN,IROUND,IDEF,
+     +IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      INC = 1
+      IDNDEX = 1024
+      LENW = 5
+      LENC = 4*LENW
+      CALL SETCN(CONST,LENW,5,LENC)
+      ITRAN = KTRAN
+10    CONTINUE
+      IDREN  = MAXREN
+      IDRSEQ = MAXREC
+      IDNAML = MAXNAM
+      IDNLST = MAXENZ
+      MAXEN  = MAXENZ
+      IDMAT  = MAXMAT
+      LEVEL = 0
+      CALL SERCHD(RENZYM,IDREN,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,MARGT,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,
+     +IDEVEN,FILNAM,FILE4,FILE6,FILEA,KBIN,KBOUT,
+     +OPTION,OPTOUT,NOPOUT,IFILE,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK,CUTSIT,PCUT,MAXS,LINLEN,ITRAN,
+     +JTRAN,MINLEN,MAXLEN,IROUND,IDEF,MINCUT,MNCUT,MAXCUT)
+      IF(IOK.NE.0) RETURN
+      CALL BUSY(KBOUT)
+      CALL CONNUM(SEQ(J1),POSN(J1),J2-J1+1)
+      CALL ENCOND(POSN(J1),J2-J1+1,WORDP,IDNDEX,5,CONST,LENW,LENC)
+      IF(OPTOUT.EQ.2) THEN
+        CALL FINDL1(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +  PSAVE,NSAVE,CUTADD,SORTA,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH,MINCUT,IDEF,
+     +  POSN(J1),WORDP,IDNDEX,CONST,LENW,LENC,IOK)
+      ELSE IF((OPTOUT.EQ.0).AND.(NOPOUT.EQ.1)) THEN
+        CALL FINDL2(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +  PSAVE,NSAVE,CUTADD,SORTA,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +  POSN(J1),WORDP,IDNDEX,CONST,LENW,LENC,IOK)
+      ELSE IF((OPTOUT.EQ.1).AND.(NOPOUT.EQ.1)) THEN
+        CALL FINDL3(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +  PSAVE,NSAVE,CUTADD,SORTA,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +  POSN(J1),WORDP,IDNDEX,CONST,LENW,LENC,IOK)
+      ELSE IF((OPTOUT.EQ.0).AND.(NOPOUT.EQ.0)) THEN
+        CALL FINDL4(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +  PSAVE,NSAVE,CUTADD,SORTA,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +  POSN(J1),WORDP,IDNDEX,CONST,LENW,LENC,IOK)
+      ELSE IF((OPTOUT.EQ.1).AND.(NOPOUT.EQ.0)) THEN
+        CALL FINDL5(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +  PSAVE,NSAVE,CUTADD,SORTA,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +  POSN(J1),WORDP,IDNDEX,CONST,LENW,LENC,IOK)
+      ELSE IF((OPTOUT.EQ.3).AND.(NOPOUT.EQ.0)) THEN
+        CALL FINDL6(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +  NAMEP,NAMLEN,
+     +  NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +  PSAVE,NSAVE,CUTADD,SORTA,IDMAT,NOPOUT,IBH,LEVEL,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH,ITRAN,LINLEN,PAA,IDEF,
+     +  POSN(J1),WORDP,IDNDEX,CONST,LENW,LENC,IOK)
+      END IF
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      DIALOG = 1
+      GO TO 10
+      END
+      SUBROUTINE SERCHD(RENZYM,IDREN,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,MARGT,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,
+     +IDEVEN,FILNAM,FILE4,FILE6,FILEA,KBIN,KBOUT,
+     +OPTION,OPTOUT,NOPOUT,IFILE,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK,CUTSIT,PCUT,MAXS,LINLEN,ITRAN,
+     +JTRAN,MINLEN,MAXLEN,IROUND,IDEF,MINCUT,MNCUT,MAXCUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),FILEA*(*),FILE4*(*),FILE6*(*),HELPF*(*)
+      CHARACTER RENZYM(IDREN),RECSEQ(IDRSEQ),NAMES(IDNAML)
+      INTEGER OPTION,OPTOUT
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN)
+      INTEGER LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),DIALOG
+      INTEGER CUTSIT(IDNLST),PCUT(IDNLST)
+      PARAMETER (MAXPRM = 30)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      IOK = 0
+      NOPOUT = 0
+      CALL GETREN(RENZYM,IDREN,IDEVEN,FILNAM,FILE4,FILE6,FILEA,
+     +IOK,KBIN,KBOUT,IFILE,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF(IDREN.LT.5) THEN
+        IOK = 1
+        RETURN
+      END IF
+      CALL DECREN(RENZYM,IDREN,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,KBOUT,CUTSIT,PCUT,MAXS)
+      IF(MAXEN.LT.1) THEN
+        IOK = 1
+        RETURN
+      END IF
+      IF(DIALOG.EQ.0) RETURN
+      IOP = OPTION
+      CALL YESONO(IOP,'Search for all names','select names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      OPTION = IOP
+      IF(OPTION.EQ.1)THEN
+        CALL GETNAM(NAMLST,IDNLST,NAMES,IDNAML,NAMEP,NAMLEN,
+     +  MAXEN,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IDNLST.LT.1) THEN
+          IOK = 1
+          RETURN
+        END IF
+      END IF
+      IOP = OPTOUT + 1
+      PROMPT(1) = 'Order results enzyme by enzyme'
+      PROMPT(2) = 'Order results by position'
+      PROMPT(3) = 'Show only infrequent cutters'
+      PROMPT(4) = 'Show names above the sequence'
+      CALL RADION('Select results display mode',PROMPT,4,IOP,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.1) THEN
+        IOK = 1
+        RETURN
+      END IF
+      OPTOUT = IOP - 1
+      IF(OPTOUT.EQ.2) THEN
+        CALL GETINT(MNCUT,MAXCUT,MINCUT,'Maximum number of cuts',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        MINCUT = IVAL
+      END IF
+      IF(OPTOUT.EQ.3) THEN
+        IOP = JTRAN
+        CALL YESONO(IOP,'Hide translation','Show translation',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.0) THEN
+          IOK = 1
+          RETURN
+        END IF
+        JTRAN = IOP
+        IF(JTRAN.EQ.1) THEN
+          IOP = ITRAN
+          IF(IOP.EQ.3) IOP = 1
+          CALL YESONO(IOP,'Use 1 letter codes',
+     +    'Use 3 letter codes',
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(IOP.LT.0) THEN
+            IOK = 1
+            RETURN
+          END IF
+          IF(IOP.EQ.1) ITRAN = 3
+          IF(IOP.EQ.0) ITRAN = 0
+        ELSE IF(JTRAN.EQ.0) THEN
+          ITRAN = 1
+        END IF
+        IF(ITRAN.NE.1) THEN
+          MAXLEN = 90
+          MINLEN = 30
+        END IF
+        LENDEF = LINLEN
+        CALL GETINT(MINLEN,MAXLEN,LENDEF,'Line length',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        LINLEN = IVAL
+        IF((ITRAN.NE.1).AND.(MOD(LINLEN,3).NE.0)) LINLEN = 60
+      END IF
+      IOP = NOPOUT
+      IF((OPTOUT.EQ.0).OR.(OPTOUT.EQ.1)) THEN
+        CALL YESONO(IOP,'List matches','Plot matches',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.0) THEN
+          IOK = 1
+          RETURN
+        END IF
+        NOPOUT = IOP
+      END IF
+C      IF(NOPOUT.EQ.1)THEN
+C        LEVEL = LEVELI
+C        CALL SD2(IBLIPH,IBH,LEVEL,MARGT,
+C     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C        LEVELI = LEVEL
+C        IF(IOK.NE.0) RETURN
+C      END IF
+C      CALL GETINT(MININC,MAXINC,INC,'Step',
+C     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+C      INC = IVAL
+      IOP = IROUND
+      CALL YESONO(IOP,'The sequence is linear',
+     +'The sequence is circular',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      IROUND = IOP
+      IOP = IDEF
+      CALL YESONO(IOP,'Search for definite matches',
+     +'Search for possible matches',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      IDEF = IOP
+      END
+      SUBROUTINE FINDL1(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +PSAVE,NSAVE,CUTADD,SORTA,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,MINCUT,IDEF,
+     +POSN,WORDP,IDNDEX,
+     +CONST,LENW,LENC,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXS)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      INTEGER CUTSIT(MAXS),PCUT(MAXEN),CUTADD(MAXMAT),SORTA(MAXMAT)
+      CHARACTER HELPF*(*)
+      INTEGER POSN(IDSEQ),WORDP(IDNDEX),CONST(0:LENC)
+C   OPTOUT=0 ENZYME BY ENZYME
+C   LOW FREQUENCY CUTTERS
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      DO 100 IEN=1,LAST
+        IFOUND=0
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        IF(IDEF.EQ.0) THEN
+          CALL S1D0(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,
+     +    POSN,WORDP,IDNDEX,
+     +    CONST,LENW,LENC,KBOUT,IOK)
+        ELSE
+          CALL S1D1(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,KBOUT,IOK)
+        END IF
+        IF(IOK.NE.0) RETURN
+        IF(IFOUND.LE.MINCUT) THEN
+          K1=NAMEP(JEN)
+          K2=K1+NAMLEN(JEN)-1
+          WRITE(IDEV,1009)IFOUND,(NAMES(K),K=K1,K2)
+1009      FORMAT(' ',I6,' ',20A1)
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE FINDL2(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +PSAVE,NSAVE,CUTADD,SORTA,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +POSN,WORDP,IDNDEX,
+     +CONST,LENW,LENC,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXS)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      INTEGER CUTSIT(MAXS),PCUT(MAXEN),CUTADD(MAXMAT),SORTA(MAXMAT)
+      CHARACTER HELPF*(*)
+      CHARACTER*20 ATOS
+      INTEGER POSN(IDSEQ),WORDP(IDNDEX),CONST(0:LENC)
+      EXTERNAL ATOS
+C   OPTOUT=0 ENZYME BY ENZYME
+C   NOPOUT=1 GRAPHICAL OUTPUT
+C   FOR GRAPHICS NEED TO CHECK FOR END OF SCREEN
+      CALL CLEARV
+      CALL VECTOM
+      LEVEL1=LEVEL
+      LEVELM=MARGT-IBH
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      DO 100 IEN=1,LAST
+        IFOUND=0
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        IF(IDEF.EQ.0) THEN
+          CALL S1D0(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,
+     +    POSN,WORDP,IDNDEX,
+     +    CONST,LENW,LENC,KBOUT,IOK)
+        ELSE
+          CALL S1D1(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,KBOUT,IOK)
+        END IF
+        IF(IOK.NE.0) THEN
+          CALL VT100M
+          RETURN
+        END IF
+        IF(IFOUND.GT.0) THEN
+          IF(LEVEL.GE.LEVELM)THEN
+            CALL PSRCHX(LEVEL1,LEVEL,KBIN,KBOUT,IQUIT,
+     +      IHELPS,IHELPE,HELPF,IDEVH)
+            IF(IQUIT.EQ.1) THEN
+              CALL VT100M
+              RETURN
+            END IF
+          END IF
+          CALL PLSRCH(J1-KSTART+1,J2-KSTART+1,
+     +    PSAVE,IFOUND,IBH,LEVEL,
+     +    ATOS(NAMES(NAMEP(JEN)),NAMLEN(JEN)),
+     +    NAMLEN(JEN),
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE FINDL3(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +PSAVE,NSAVE,CUTADD,SORTA,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +POSN,WORDP,IDNDEX,
+     +CONST,LENW,LENC,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXS)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      INTEGER CUTSIT(MAXS),PCUT(MAXEN),CUTADD(MAXMAT),SORTA(MAXMAT)
+      CHARACTER HELPF*(*)
+      INTEGER POSN(IDSEQ),WORDP(IDNDEX),CONST(0:LENC)
+C   OPTOUT=1 ALL ENZYMES TOGETHER
+C   NOPOUT=1 GRAPHICAL OUTPUT
+C   FOR GRAPHICS NEED TO CHECK FOR END OF SCREEN
+      CALL CLEARV
+      CALL VECTOM
+      LEVEL1=LEVEL
+      LEVELM=MARGT-IBH
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      IFOUND=0
+      DO 100 IEN=1,LAST
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        IF(IDEF.EQ.0) THEN
+          CALL S1D0(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,
+     +    POSN,WORDP,IDNDEX,
+     +    CONST,LENW,LENC,KBOUT,IOK)
+        ELSE
+          CALL S1D1(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,KBOUT,IOK)
+        END IF
+        IF(IOK.NE.0) THEN
+          CALL VT100M
+          RETURN
+        END IF
+100   CONTINUE
+      IF(IFOUND.GT.0)THEN
+        CALL PLSRCH(J1-KSTART+1,J2-KSTART+1,
+     +  PSAVE,IFOUND,IBH,LEVEL,'ALL',3,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      CALL VT100M
+      IF(IFOUND.EQ.0)WRITE(IDEV,3000)
+3000  FORMAT(' No matches found')
+      END
+      SUBROUTINE FINDL4(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +PSAVE,NSAVE,CUTADD,SORTA,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +POSN,WORDP,IDNDEX,
+     +CONST,LENW,LENC,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXS)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      INTEGER CUTSIT(MAXS),PCUT(MAXEN),CUTADD(MAXMAT),SORTA(MAXMAT)
+      CHARACTER HELPF*(*)
+      INTEGER POSN(IDSEQ),WORDP(IDNDEX),CONST(0:LENC)
+C   OPTOUT=0 ENZYME BY ENZYME
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      DO 100 IEN=1,LAST
+        IFOUND=0
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        IF(IDEF.EQ.0) THEN
+          CALL S1D0(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,
+     +    POSN,WORDP,IDNDEX,
+     +    CONST,LENW,LENC,KBOUT,IOK)
+        ELSE
+          CALL S1D1(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,KBOUT,IOK)
+        END IF
+        IF(IOK.NE.0) RETURN
+        IF(IFOUND.GT.0) THEN
+          WRITE(IDEV,1010)IFOUND
+1010      FORMAT('  Matches found=',I6)
+          CALL BUB3AS(PSAVE,NSAVE,CUTADD,IFOUND)
+C         calc lengths
+          IF(IROUND.EQ.0) THEN
+            LENN = J1 - KSTART + 1
+            SORTA(1) = PSAVE(1) - LENN
+          ELSE IF(IROUND.EQ.1) THEN
+            LENN = J2 - PSAVE(IFOUND) + 1
+            SORTA(1) = LENN + PSAVE(1) - (J1 - KSTART + 1)
+            LENN = 1 - LENN
+          END IF
+          DO 45 KOUT=2,IFOUND
+            SORTA(KOUT)=PSAVE(KOUT)-PSAVE(KOUT-1)
+45        CONTINUE
+C         do length to end
+          IFP1 = IFOUND
+          IF(IROUND.EQ.0) THEN
+            SORTA(IFOUND+1)=J2-KSTART+1-PSAVE(IFOUND)+1
+            IFP1=IFOUND+1
+          END IF
+          CALL BUBBLE(SORTA,IFP1)
+          CALL S2(SEQ,IDSEQ,IFOUND,NAMEP,NAMLEN,LENEN,MAXEN,
+     +    NSAVE,PSAVE,SORTA,MAXMAT,NAMES,IDNAML,
+     +    CUTSIT,MAXS,CUTADD,IROUND,
+     +    LENN,IDEV,J2,KSTART,0)
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE FINDL5(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +PSAVE,NSAVE,CUTADD,SORTA,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEF,
+     +POSN,WORDP,IDNDEX,
+     +CONST,LENW,LENC,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXS)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      INTEGER CUTSIT(MAXS),PCUT(MAXEN),CUTADD(MAXMAT),SORTA(MAXMAT)
+      CHARACTER HELPF*(*)
+      INTEGER POSN(IDSEQ),WORDP(IDNDEX),CONST(0:LENC)
+C   OPTOUT=1 ALL ENZYMES TOGETHER 
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      IFOUND=0
+      DO 100 IEN=1,LAST
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        IF(IDEF.EQ.0) THEN
+          CALL S1D0(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,
+     +    POSN,WORDP,IDNDEX,
+     +    CONST,LENW,LENC,KBOUT,IOK)
+        ELSE
+          CALL S1D1(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,KBOUT,IOK)
+        END IF
+        IF(IOK.NE.0) RETURN
+100   CONTINUE
+      IF(IFOUND.GT.0) THEN
+        CALL BUB3AS(PSAVE,NSAVE,CUTADD,IFOUND)
+        IF(IROUND.EQ.0) THEN
+          LENN = J1 - KSTART + 1
+          SORTA(1) = PSAVE(1) - LENN
+        ELSE IF(IROUND.EQ.1) THEN
+          LENN = J2 - PSAVE(IFOUND) + 1
+          SORTA(1) = LENN + PSAVE(1) - (J1 - KSTART + 1)
+          LENN = 1 - LENN
+        END IF
+        DO 45 KOUT=2,IFOUND
+          SORTA(KOUT)=PSAVE(KOUT)-PSAVE(KOUT-1)
+45      CONTINUE
+        CALL S2(SEQ,IDSEQ,IFOUND,NAMEP,NAMLEN,LENEN,MAXEN,
+     +  NSAVE,PSAVE,SORTA,MAXMAT,NAMES,IDNAML,
+     +  CUTSIT,MAXS,CUTADD,IROUND,
+     +  LENN,IDEV,J2,KSTART,1)
+      ELSE
+        WRITE(IDEV,1010)
+1010    FORMAT(' No matches found')
+      END IF
+      END
+      SUBROUTINE FINDL6(SEQ,IDSEQ,J1,J2,KSTART,RECSEQ,IDRSEQ,
+     +NAMES,IDNAML,NAMEP,NAMLEN,
+     +NUMREC,LENEN,RECSTR,MAXEN,NAMLST,IDNLST,OPTION,OPTOUT,
+     +  PCUT,CUTSIT,MAXS,IROUND,
+     +PSAVE,NSAVE,CUTADD,SORTA,MAXMAT,NOPOUT,IBH,LEVEL,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,INC,KBIN,KBOUT,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,ITRAN,LINLEN,PAA,IDEF,
+     +POSN,WORDP,IDNDEX,
+     +CONST,LENW,LENC,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),RECSEQ(IDRSEQ)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXS)
+      INTEGER RECSTR(MAXEN),NAMLST(IDNLST),PSAVE(MAXMAT)
+      INTEGER NSAVE(MAXMAT),OPTION,OPTOUT
+      INTEGER CUTSIT(MAXS),PCUT(MAXEN),CUTADD(MAXMAT),SORTA(MAXMAT)
+      CHARACTER HELPF*(*),PAA(125)
+      INTEGER POSN(IDSEQ),WORDP(IDNDEX),CONST(0:LENC)
+C   OPTOUT=3 NAMES ABOVE SEQUENCE
+C   names max length 20 chars, strings 20 chars
+      LAST=MAXEN
+      IF(OPTION.EQ.1)LAST=IDNLST
+      IFOUND=0
+      DO 100 IEN=1,LAST
+        JEN=IEN
+        IF(OPTION.EQ.1)JEN=NAMLST(IEN)
+        NREC=NUMREC(JEN)
+        IF(IDEF.EQ.0) THEN
+          CALL S1D0(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,
+     +    POSN,WORDP,IDNDEX,
+     +    CONST,LENW,LENC,KBOUT,IOK)
+        ELSE
+          CALL S1D1(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +    RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +    PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,KBOUT,IOK)
+        END IF
+        IF(IOK.NE.0) RETURN
+100   CONTINUE
+      IF(IFOUND.GT.0) THEN
+        CALL BUB2AS(PSAVE,NSAVE,IFOUND)
+        CALL PETER(SEQ,IDSEQ,J1,J2,KSTART,
+     +  NAMES,IDNAML,NAMEP,NAMLEN,MAXEN,
+     +  PSAVE,NSAVE,IDEV,IFOUND,LINLEN,PAA,ITRAN)
+      ELSE
+        WRITE(IDEV,1010)
+1010    FORMAT(' No matches')
+      END IF
+      END
+      SUBROUTINE S1D0(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,POSN,WORDP,IDNDEX,
+     +CONST,LENW,LENC,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),RECSEQ(IDRSEQ),TEMP3(40)
+      INTEGER RECSTR(MAXEN),PSAVE(MAXMAT),NSAVE(MAXMAT)
+      INTEGER PCUT(MAXEN),CUTSIT(MAXS),CUTADD(MAXMAT),SPOINT
+      INTEGER LENEN(MAXS)
+      INTEGER POSN(IDSEQ),WORDP(IDNDEX),CONST(0:LENC),GENRCI
+      EXTERNAL MC,GENRCI
+      IOK = 1
+      LREC = 0
+      DO 50 IREC=1,NREC
+C       POINT TO START OF STRING
+        ISTR=RECSTR(JEN)+(IREC-1)*(LREC)
+        SPOINT = PCUT(JEN) + IREC - 1
+        LREC = LENEN (SPOINT)
+C       FIND FIRST AND LAST NON N CHARS IN STRING
+        CALL GETNON(RECSEQ(ISTR),LREC,INON,LNON)
+C       CHECK FOR ALL N'S IN STRING!!
+        IF(LNON.GT.0)THEN
+          ISTR = ISTR + INON - 1
+          IENTRY = 0
+5         CONTINUE
+          NINDEX = GENRCI(RECSEQ(ISTR),LNON,LENW,CONST,LENC,IENTRY)
+          IF(NINDEX.NE.0) THEN
+            K1 = WORDP(NINDEX)
+            IF(K1.NE.0) THEN
+              IMATCH = 0
+              K2 = K1 + J1 - 1
+              IF(LNON.GT.LENW) THEN
+                IF((K2+LNON-1).LE.IDSEQ) THEN
+                  IMATCH = MC(SEQ(K2+LENW),RECSEQ(ISTR+LENW),
+     +            LNON-LENW)
+                ELSE
+                  IMATCH = 1
+                END IF
+              END IF
+              IF(IMATCH.EQ.0) THEN
+                IFOUND = IFOUND + 1
+                IF(IFOUND.GT.MAXMAT)THEN
+                  CALL ERROM(KBOUT,'Too many matches')
+                  RETURN
+                END IF
+                NSAVE(IFOUND) = JEN
+C  NEED THE CUTPOINT FOR THIS STRING; THE FIRST ONE THIS NAME
+C  IS AT PCUT(JEN) IN ARRAY CUTSIT, SO THIS ONE IS AT
+C  PCUT(JEN)+IREC-1 = SPOINT IN CUTSIT
+                CUTADD(IFOUND) = SPOINT
+C               SAVE POSITION RELATIVE TO ARRAY
+                KT = K2 + CUTSIT(SPOINT) - INON + 1
+                IF(KT.GT.IDSEQ)KT = KT - IDSEQ
+                IF(KT.LT.0)KT = ABS(KT) + 1
+                PSAVE(IFOUND) = KT
+              END IF
+10            CONTINUE
+              K3 = K1
+              K1 = POSN(K3)
+              IF(K1.NE.0) THEN
+                IMATCH = 0
+                K2 = K1 + J1 - 1
+                IF(LNON.GT.LENW) THEN
+                  IF((K2+LNON-1).LE.IDSEQ) THEN
+                    IMATCH = MC(SEQ(K2+LENW),RECSEQ(ISTR+LENW),
+     +              LNON-LENW)
+                  ELSE
+                    IMATCH = 1
+                  END IF
+                END IF
+                IF(IMATCH.EQ.0) THEN
+                  IFOUND = IFOUND + 1
+                  IF(IFOUND.GT.MAXMAT)THEN
+                    CALL ERROM(KBOUT,'Too many matches')
+                    RETURN
+                  END IF
+                  NSAVE(IFOUND) = JEN
+                  CUTADD(IFOUND) = SPOINT
+                  KT = K2 + CUTSIT(SPOINT) - INON + 1
+                  IF(KT.GT.IDSEQ)KT = KT - IDSEQ
+                  IF(KT.LT.0)KT = ABS(KT) + 1
+                  PSAVE(IFOUND) = KT
+                END IF
+                GO TO 10  
+              END IF
+            END IF
+            GO TO 5
+          END IF
+          IF(LNON.GE.IDSEQ)GO TO 50
+          IF(IROUND.NE.1)GO TO 50
+          IAT = 0
+          LNONM1 = LNON - 1
+          IF(LNONM1.GT.0)THEN
+            CALL SQCOPY(SEQ(IDSEQ-LNON+2),TEMP3,LNONM1)
+            CALL SQCOPY(SEQ,TEMP3(LNON),LNONM1)
+          END IF
+51        CONTINUE
+          IAT = IAT + 1
+          ILEFT = 2 * LNONM1 - IAT + 1
+          IF(ILEFT.GT.0)THEN
+            CALL FIND6(TEMP3(IAT),ILEFT,RECSEQ(ISTR),LNON,INC,IMATCH)
+            IF(IMATCH.GT.0)THEN
+              IFOUND = IFOUND + 1
+              IF(IFOUND.GT.MAXMAT)THEN
+                CALL ERROM(KBOUT,'Too many matches')
+                RETURN
+              END IF
+              NSAVE(IFOUND)=JEN
+C  NEED THE CUTPOINT FOR THIS STRING; THE FIRST ONE THIS NAME
+C  IS AT PCUT(JEN) IN ARRAY CUTSIT, SO THIS ONE IS AT
+C  PCUT(JEN)+IREC-1 = SPOINT IN CUTSIT
+              CUTADD(IFOUND) = SPOINT
+              IAT = IAT + IMATCH - 1
+              KT=IDSEQ - LNON + 1 + IAT + CUTSIT(SPOINT) - INON + 1
+              IF(KT.GT.IDSEQ) KT = KT - IDSEQ
+              IF(KT.LT.1) KT = ABS(KT) + 1
+              PSAVE(IFOUND) = KT
+            END IF
+            GO TO 51
+          END IF
+        END IF
+50    CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE S1D1(SEQ,IDSEQ,NREC,JEN,IREC,J1,J2,INC,KSTART,
+     +RECSTR,MAXEN,PSAVE,NSAVE,MAXMAT,RECSEQ,IDRSEQ,IFOUND,
+     +PCUT,CUTSIT,LENEN,MAXS,CUTADD,IROUND,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),RECSEQ(IDRSEQ),TEMP3(40)
+      INTEGER RECSTR(MAXEN),PSAVE(MAXMAT),NSAVE(MAXMAT)
+      INTEGER PCUT(MAXEN),CUTSIT(MAXS),CUTADD(MAXMAT),SPOINT
+      INTEGER LENEN(MAXS)
+      IOK = 1
+      LREC = 0
+      DO 50 IREC=1,NREC
+C       POINT TO START OF STRING
+        ISTR=RECSTR(JEN)+(IREC-1)*(LREC)
+C       NOTE WE NEED TO POINT TO THE CORRECT ARRAY ELEMENT BUT
+C       TO REMEMBER THE SEQUENCE ELEMENT FOR OUTPUT
+C       IAT IS THE ARRAY ELEMENT
+        IAT = J1-INC-KSTART+1
+        SPOINT = PCUT(JEN) + IREC - 1
+        LREC = LENEN (SPOINT)
+C       FIND FIRST AND LAST NON N CHARS IN STRING
+        CALL GETNON(RECSEQ(ISTR),LREC,INON,LNON)
+C       CHECK FOR ALL N'S IN STRING!!
+        IF(LNON.GT.0)THEN
+          ISTR = ISTR + INON - 1
+10        CONTINUE
+          IAT = IAT+INC
+          ILEFT = J2-IAT+1-KSTART+1
+          IF(ILEFT.GT.0)THEN
+            CALL FIND7(SEQ(IAT),ILEFT,
+     +      RECSEQ(ISTR),LNON,INC,IMATCH)
+            IF(IMATCH.GT.0)THEN
+              IFOUND = IFOUND + 1
+              IF(IFOUND.GT.MAXMAT)THEN
+                CALL ERROM(KBOUT,'Too many matches')
+                RETURN
+              END IF
+              IAT = IAT + IMATCH - 1
+              NSAVE(IFOUND) = JEN
+C  NEED THE CUTPOINT FOR THIS STRING; THE FIRST ONE THIS NAME
+C  IS AT PCUT(JEN) IN ARRAY CUTSIT, SO THIS ONE IS AT
+C  PCUT(JEN)+IREC-1 = SPOINT IN CUTSIT
+              CUTADD(IFOUND) = SPOINT
+C             SAVE POSITION RELATIVE TO ARRAY
+              KT = IAT + CUTSIT(SPOINT) - INON + 1
+              IF(KT.GT.IDSEQ)KT = KT - IDSEQ
+              IF(KT.LT.0)KT = ABS(KT) + 1
+              PSAVE(IFOUND) = KT
+              GO TO 10
+            END IF
+          END IF
+          IF(LNON.GE.IDSEQ)GO TO 50
+          IF(IROUND.NE.1)GO TO 50
+          IAT = 0
+          LNONM1 = LNON - 1
+          IF(LNONM1.GT.0)THEN
+            CALL SQCOPY(SEQ(IDSEQ-LNON+2),TEMP3,LNONM1)
+            CALL SQCOPY(SEQ,TEMP3(LNON),LNONM1)
+          END IF
+51        CONTINUE
+          IAT = IAT + 1
+          ILEFT = 2 * LNONM1 - IAT + 1
+          IF(ILEFT.GT.0)THEN
+            CALL FIND7(TEMP3(IAT),ILEFT,RECSEQ(ISTR),LNON,INC,IMATCH)
+            IF(IMATCH.GT.0)THEN
+              IFOUND = IFOUND + 1
+              IF(IFOUND.GT.MAXMAT)THEN
+                CALL ERROM(KBOUT,'Too many matches')
+                RETURN
+              END IF
+              NSAVE(IFOUND)=JEN
+C  NEED THE CUTPOINT FOR THIS STRING; THE FIRST ONE THIS NAME
+C  IS AT PCUT(JEN) IN ARRAY CUTSIT, SO THIS ONE IS AT
+C  PCUT(JEN)+IREC-1 = SPOINT IN CUTSIT
+              CUTADD(IFOUND) = SPOINT
+              IAT = IAT + IMATCH - 1
+              KT=IDSEQ - LNON + 1 + IAT + CUTSIT(SPOINT) - INON + 1
+              IF(KT.GT.IDSEQ) KT = KT - IDSEQ
+              IF(KT.LT.1) KT = ABS(KT) + 1
+              PSAVE(IFOUND) = KT
+            END IF
+            GO TO 51
+          END IF
+        END IF
+50    CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE DECREN(RENZYM,IDREN,RECSEQ,IDRSEQ,NAMES,IDNAML,
+     +NAMEP,NAMLEN,NUMREC,LENEN,RECSTR,MAXEN,KBOUT,CUTSIT,PCUT,MAXS)
+C   ROUTINE TO DECODE A RESTRICTION ENZYME FILE ARRAY RENZYM IDREN
+C   INTO THE FOLLOWING:
+C   A CONCATENATED ARRAY OF RECOGNITION SEQUENCES RECSEQ IDRSEQ PREC
+C   A CONCATENATED ARRAY OF ENZYME NAMES NAMES IDNAML,PNAM
+C   A LIST OF POINTERS TO ENZYME NAMES IN NAMES NAMEP MAXEN ITOTEN
+C   A LIST OF ENZYME NAME LENGTHS NAMLEN MAXEN ITOTEN
+C   A LIST OF NUMBERS OF RECOGNITION SEQUENCES PER ENZYME NUMREC MAXEN
+C   A LIST OF LENGHTS OF RECOGNITION SEQUENCES PER RECOGNITION SEQUENCE
+C   LENEN MAXEN ITOTEN
+C   A LIST OF POINTERS TO THE FIRST CHARACTER OF THE FIRST RECOGNITION
+C   SEQUENCE PER ENZYME RECSTR MAXEN ITOTEN
+C   A LIST OF CUTSITES PER SEQUENCE CUTSIT
+C   A LIST OF START POSITIONS FOR CUTSITES PER NAME PCUT (ALSO USED
+C   FOR LENGTHS OF RECOGNITION SEQUENCES)
+C   AUTHOR: RODGER STADEN
+      CHARACTER RENZYM(IDREN),RECSEQ(IDRSEQ),NAMES(IDNAML)
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),NUMREC(MAXEN),LENEN(MAXEN)
+      INTEGER RECSTR(MAXEN),TOTEN,PREC,PREN,PNAM
+      INTEGER CUTSIT(MAXEN),PCUT(MAXEN)
+      CHARACTER CUT
+      EXTERNAL INDEXA
+      SAVE CUT
+      DATA CUT/''''/
+C NB CUTSIT SHOULD REALLY BE DIMENSIONED TO BE LARGER THAN THE 
+C MAX NUMBER OF ENZYMES AS IT WILL BE LONGER
+C
+C   set enzyme count to zero
+      TOTEN=0
+C   point to string store
+      PREC=1
+C   point to renzym
+      PREN=0
+C   point to names array
+      PNAM=1
+C   SET NUMBER OF CUTPOINTS TO ZERO. IT IS EQUAL TO THE NUMBER OF 
+C   OF RECOGNITION SEQUENCES. NOTE THEY CAN ALMOST ALL BE CODED AS SINGLE
+C   SEQUENCES USING THE NC-IUB CODES, BUT SOMETIMES, IF THE CUTSITE IS
+C   NON-SYMMETRICAL TWO ARE NEEDED
+      ICUT = 0
+C
+10    CONTINUE
+      PREN=PREN+1
+      IF(PREN.LT.IDREN)THEN
+C   some data left
+C   should be pointing at start of name, count its chars
+        CALL CHARCT(RENZYM,IDREN,PREN,NCHAR)
+        IF(NCHAR.GT.0)THEN
+C   found a name
+C   increment number of enzymes
+          TOTEN=TOTEN+1
+          IF(TOTEN.LT.MAXEN)THEN
+C   space left for enzymes
+            IF((PNAM+NCHAR-1).GT.IDNAML)THEN
+              CALL ERROM(KBOUT,'Names array full')
+              GO TO 101
+            END IF
+C   space for name
+C   store name start position, length of name
+            NAMEP(TOTEN)=PNAM
+            NAMLEN(TOTEN)=NCHAR
+            CALL SQCOPY(RENZYM(PREN),NAMES(PNAM),NCHAR)
+            PNAM=PNAM+NCHAR
+C   count number of strings this enzyme
+            NREC=0
+20          CONTINUE
+C   point to first char of string
+            PREN=PREN+NCHAR+1
+C   count chars in string
+            NCHAR=0
+            IF(PREN.LT.IDREN)CALL CHARCT(RENZYM,IDREN,PREN,NCHAR)
+            IF(NCHAR.EQ.0)THEN
+C   no chars in string. If first string then error, else no more strings
+              IF(NREC.EQ.0)THEN
+                CALL ERROM(KBOUT,'Name with no strings')
+C   decrease number of enzymes
+                TOTEN=TOTEN-1
+                GO TO 100
+              END IF
+C   some strings
+C   save number of strings this enzyme
+              NUMREC(TOTEN)=NREC
+C
+              GO TO 10
+            END IF
+C   nonzero string length
+            NREC=NREC+1
+******************************************
+C   SAVE ITS START POSITION IN RECSEQ IF FIRST THIS NAME
+            IF(NREC.EQ.1)RECSTR(TOTEN)=PREC
+C FIND CUTSITE
+            JCUT=INDEXA(RENZYM(PREN),NCHAR,CUT)
+C IF JCUT IS ZERO THEN NO CUTSITE MARKED
+            IF(ICUT.GE.MAXEN)GO TO 200
+            ICUT = ICUT + 1
+            CUTSIT(ICUT) = 0
+C IF FIRST CUTSITE THIS NAME SAVE ITS POINTER
+            IF(NREC.EQ.1)PCUT(TOTEN) = ICUT
+            IF(JCUT.GT.0)THEN
+              NL = JCUT - 1
+              NR = NCHAR - JCUT
+              CUTSIT(ICUT) = NL
+C  COPY UP TO CUT
+              IF(NL.GT.0)THEN
+                CALL SQCOPY(RENZYM(PREN),RECSEQ(PREC),NL)
+                PREC = PREC + NL
+              END IF
+C  COPY AFTER CUT
+              IF(NR.GT.0)THEN
+                IPREN = PREN + NL + 1
+                CALL SQCOPY(RENZYM(IPREN),RECSEQ(PREC),NR)
+                PREC = PREC + NR
+              END IF
+              LENEN(ICUT) = NCHAR - 1
+C  (REDUCED NCHAR BY 1 TO ALLOW FOR CUT)
+            ELSE
+C
+C  NO CUT SITE MARKED, USE OLD CODE
+******************************************
+C   copy string to recseq
+              CALL SQCOPY(RENZYM(PREN),RECSEQ(PREC),NCHAR)
+C
+              PREC=PREC+NCHAR
+              LENEN(ICUT) = NCHAR
+            END IF
+C   now look for next string
+            GO TO 20
+          END IF
+C   too many enzymes
+          CALL ERROM(KBOUT,'Too many names')
+          GO TO 100
+        END IF
+C   deal with name of zero length
+        CALL ERROM(KBOUT,'Error in names and strings file')
+      END IF
+C   now at end of file
+100   CONTINUE
+101   CONTINUE
+C   return length of names array
+      IDNAML=PNAM-1
+C   return number of enzymes
+      MAXEN=TOTEN
+C   return length of recseq
+      IDRSEQ=PREC
+C  RETURN NUMBER OF RECOGNITION SEQUENCES
+      MAXS = ICUT
+      RETURN
+200   CONTINUE
+C  COME HERE WITH OVERFLOW OF CUTSITES
+C
+C  REDUCE NUMBER OF NAMES BY 1
+      MAXEN = TOTEN - 1
+      IDNAML = PNAM - 1
+      IDRSEQ = PREC
+C  RETURN NUMBER OF RECOGNITION SEQUENCES
+      MAXS = ICUT
+      END
+      SUBROUTINE GETREN(RENZYM,IDREN,IDEV,FILNAM,FILE4,FILE6,FILEA,
+     + IOK,KBIN,KBOUT,IFILE,DIALOG,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),FILE4*(*),FILE6*(*),FILEA*(*),RENZYM(IDREN)
+      INTEGER DIALOG
+      PARAMETER (MAXPRM = 16)
+      CHARACTER PROMPT(5)*(MAXPRM)
+      IOK = 1
+      IF(DIALOG.EQ.0) NOPT = IFILE
+      IOP = 1
+1     CONTINUE
+      IF(DIALOG.EQ.1) THEN
+        IOP = 1
+        PROMPT(1) = 'Search'
+        PROMPT(2) = 'List enzyme file'
+        PROMPT(3) = 'Clear text'
+        PROMPT(4) = 'Clear graphics'
+        CALL RADION('Select operation',PROMPT,4,IOP,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.1) THEN
+          IOK = 1
+          RETURN
+        END IF
+        IF(IOP.EQ.3) THEN
+          CALL CLEARV
+          CALL BPAUSE(KBIN,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+          GO TO 1
+        END IF
+        IF(IOP.EQ.4) THEN
+          CALL CLEARG
+          CALL BPAUSE(KBIN,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+          GO TO 1
+        END IF
+        IF((IOP.EQ.1).OR.(IOP.EQ.2)) THEN
+          NOPT = IFILE
+          PROMPT(1) = 'All enzymes file'
+          PROMPT(2) = 'Six cutter file'
+          PROMPT(3) = 'Four cutter file'
+          PROMPT(4) = 'Personal file'
+          PROMPT(5) = 'Keyboard'
+          CALL RADION('Select input source',PROMPT,5,NOPT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(NOPT.LT.1) THEN
+              IOK = 1
+            RETURN
+          END IF
+        END IF
+      END IF
+      IF(NOPT.EQ.3)THEN
+        CALL OPENRS(IDEV,FILE4,IOK,LRECL,2)
+        IF(IOK.NE.0)GO TO 100
+        IF(IOP.EQ.2)THEN
+          CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+          GO TO 1
+        END IF
+        CALL ARRFIL(IDEV,RENZYM,IDREN,KBOUT)
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      IF(NOPT.EQ.2)THEN
+        CALL OPENRS(IDEV,FILE6,IOK,LRECL,2)
+        IF(IOK.NE.0)GO TO 100
+        IF(IOP.EQ.2)THEN
+          CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+          GO TO 1
+        END IF
+        CALL ARRFIL(IDEV,RENZYM,IDREN,KBOUT)
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      IF(NOPT.EQ.1)THEN
+        CALL OPENRS(IDEV,FILEA,IOK,LRECL,2)
+        IF(IOK.NE.0)GO TO 100
+        IF(IOP.EQ.2)THEN
+          CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+          GO TO 1
+        END IF
+        CALL ARRFIL(IDEV,RENZYM,IDREN,KBOUT)
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      IF(NOPT.EQ.4)THEN
+10      CONTINUE
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Restriction enzyme file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.EQ.2) GO TO 1
+        IF(IOK.NE.0)GO TO 100
+        IF(IOP.EQ.2)THEN
+          CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+          GO TO 1
+        END IF
+        CALL ARRFIL(IDEV,RENZYM,IDREN,KBOUT)
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      IF(NOPT.EQ.5)THEN
+        CALL GETRKB(RENZYM,IDREN,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +  IOK)
+        RETURN
+      END IF
+      GO TO 1
+100   CONTINUE
+      CALL ERROM(KBOUT,'Error opening file')
+      IOK=1
+      GO TO 1
+      END
+      SUBROUTINE FIND6(SEQ,IDIM1,STRING,IDIM2,INC,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER PSEQ,PSTR
+      EXTERNAL IUBM1
+      PSEQ=1-1*INC
+      PSTR=1
+      IMATCH=0
+100   CONTINUE
+        PSEQ=PSEQ+1-PSTR
+        PSEQ=PSEQ+INC-1
+        PSTR=0
+500     CONTINUE
+          PSEQ=PSEQ+1
+          IF(PSEQ.GT.IDIM1)RETURN
+          PSTR=PSTR+1
+          IUBMAT = IUBM1(STRING(PSTR),SEQ(PSEQ))
+        IF(IUBMAT.EQ.0)GO TO 100
+      IF(PSTR.LT.IDIM2)GO TO 500
+      IMATCH=PSEQ-IDIM2+1
+      END
+      SUBROUTINE FIND7(SEQ,IDIM1,STRING,IDIM2,INC,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER PSEQ,PSTR
+      EXTERNAL IUBM2
+      PSEQ=1-1*INC
+      PSTR=1
+      IMATCH=0
+100   CONTINUE
+        PSEQ=PSEQ+1-PSTR
+        PSEQ=PSEQ+INC-1
+        PSTR=0
+500     CONTINUE
+          PSEQ=PSEQ+1
+          IF(PSEQ.GT.IDIM1)RETURN
+          PSTR=PSTR+1
+          IUBMAT = IUBM2(STRING(PSTR),SEQ(PSEQ))
+        IF(IUBMAT.EQ.0)GO TO 100
+      IF(PSTR.LT.IDIM2)GO TO 500
+      IMATCH=PSEQ-IDIM2+1
+      END
+      SUBROUTINE GETNON(STRING,IDS,LNOTN,RNOTN)
+C   AUTHOR: RODGER STADEN
+      CHARACTER STRING(IDS),CHAR
+      INTEGER RNOTN
+      EXTERNAL NNDEXL,NNDEXR
+      PARAMETER (CHAR='N')
+C  FIND ENDS OF CHARACTER STRING THAT ARE NOT N
+C  RNOTN IS RETURNED AS THE LENGTH OF THE STRING WITHOUT N'S
+C  IT WILL BE < 1 IF THE WHOLE STRING IS N'S
+      LNOTN = NNDEXL(STRING,IDS,CHAR)
+      RNOTN = NNDEXR(STRING,IDS,CHAR)
+      RNOTN = RNOTN - LNOTN + 1
+      END
+      INTEGER FUNCTION NNDEXL(STRING,ID,CHAR)
+      CHARACTER STRING(ID),CHAR
+C  FUNCTION TO FIND FIRST NON-OCCURRENCE OF CHAR IN STRING
+      DO 10 I = 1,ID
+        IF(STRING(I).NE.CHAR)THEN
+          NNDEXL = I
+          RETURN
+        END IF
+10    CONTINUE
+      NNDEXL = ID + 1
+      END
+      INTEGER FUNCTION NNDEXR(STRING,ID,CHAR)
+      CHARACTER STRING(ID),CHAR
+C  FUNCTION TO FIND FIRST NON-OCCURRENCE OF CHAR IN STRING
+      DO 10 I = ID,1,-1
+        IF(STRING(I).NE.CHAR)THEN
+          NNDEXR = I
+          RETURN
+        END IF
+10    CONTINUE
+      NNDEXR = 0
+      END
+      SUBROUTINE S2(SEQ,IDSEQ,IFOUND,NAMEP,NAMLEN,LENEN,MAXEN,
+     +NSAVE,PSAVE,LENGTH,MAXMAT,NAMES,IDNAML,
+     +CUTSIT,MAXS,CUTADD,IROUND,
+     +LENN,IDEV,J2,KSTART,JOB)
+C 19-2-91 Changed idt and idtrs from 20 each to 15 and 35
+C         so names have max length 15 chars and recognition seqs 35
+C         for this part of the code.
+      PARAMETER (IDT = 15, IDTRS = 35)
+      CHARACTER SEQ(IDSEQ),TEMP1(IDT),TEMP2(IDTRS),NAMES(IDNAML),CUTSYM
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),LENEN(MAXS)
+      INTEGER NSAVE(MAXMAT),PSAVE(MAXMAT),LENGTH(MAXMAT)
+      INTEGER CUTSIT(MAXS),CUTADD(MAXMAT)
+      EXTERNAL LWRAP3
+      SAVE CUTSYM
+      DATA CUTSYM/''''/
+      WRITE(IDEV,1011)
+1011  FORMAT(
+     +'      Name            Sequence',
+     +'                          Position',
+     +' Fragment lengths')
+      DO 60 KOUT=1,IFOUND
+        KR1=NAMEP(NSAVE(KOUT))
+        KS1=PSAVE(KOUT) - CUTSIT(CUTADD(KOUT))
+        CALL FILLC(TEMP1,IDT,' ')
+        CALL FILLC(TEMP2,IDTRS,' ')
+        KR2 = NAMLEN(NSAVE(KOUT))
+        CALL SQCOPY(NAMES(KR1),TEMP1,KR2)
+        ITEMP1 = 0
+        DO 58 ITEMP = 1,CUTSIT(CUTADD(KOUT))
+          ITEMP1 = ITEMP
+          KS2 = LWRAP3(J2,KS1)
+          TEMP2(ITEMP) = SEQ(KS2)
+          KS1 = KS1 + 1
+58      CONTINUE
+        ITEMP1 = ITEMP1 + 1
+        TEMP2(ITEMP1) = CUTSYM
+        ITEMP1 = ITEMP1 + 1
+        DO 59 ITEMP = ITEMP1,LENEN(CUTADD(KOUT))+1
+          KS2 = LWRAP3(J2,KS1)
+          TEMP2(ITEMP) = SEQ(KS2)
+          KS1 = KS1 + 1
+59      CONTINUE
+        LENN=PSAVE(KOUT)-LENN
+        IF(JOB.EQ.0) THEN
+          WRITE(IDEV,1008)KOUT,
+     +    TEMP1,TEMP2,PSAVE(KOUT)+KSTART-1,LENN,LENGTH(KOUT)
+        ELSE IF(JOB.EQ.1) THEN
+          WRITE(IDEV,1008)KOUT,
+     +    TEMP1,TEMP2,PSAVE(KOUT)+KSTART-1,LENGTH(KOUT)
+        END IF
+        LENN=PSAVE(KOUT)
+1008    FORMAT(' ',I4,' ',15A1,1X,35A1,1X,I6,1X,I6,1X,I6)
+60    CONTINUE
+      IF(IROUND.EQ.0)THEN
+C       need length to end
+        LEND = J2-PSAVE(IFOUND)+1-KSTART+1
+        IF(JOB.EQ.0)WRITE(IDEV,1020)LEND,LENGTH(IFOUND+1)
+        IF(JOB.EQ.1)WRITE(IDEV,1020)LEND
+1020    FORMAT(' ',64X,I6,1X,I6)
+      END IF
+      END
+      SUBROUTINE PETER(SEQ,IDSEQ,J1,J2,KSTART,
+     +NAMES,IDNAML,NAMEP,NAMLEN,MAXEN,
+     +PSAVE,NSAVE,IDEV,IFOUND,LINLEN,PAA,ITRAN)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NAMES(IDNAML),TEMP1*130
+      INTEGER NAMEP(MAXEN),NAMLEN(MAXEN),PSAVE(IFOUND),NSAVE(IFOUND)
+      CHARACTER TRANF3*3,PAA(125)
+      EXTERNAL TRANF3
+      EXTERNAL NOTIRL
+C  KSTART = SEQ ELEMENT IN SEQ(1)
+C  J1 = START OF ACTIVE REGION RELATIVE TO SEQ(1)
+C  PSAVE ARE RELATIVE TO SEQ(1)
+C  WORK RELATIVE TO ARRAY SEQ
+      NLINES = 1 + (J2 - J1)/LINLEN
+      I1 = J1
+      I2 = MIN(J2,I1+LINLEN-1)
+      NEXTE = 1
+      ITMAX = J2 - 2
+      DO 100 KK = 1,NLINES
+        TEMP1 = ' '
+        LP1 = 0
+        DO 50 I = NEXTE,IFOUND
+          KS1=PSAVE(I)
+          IF(KS1.GT.I2)THEN
+            NEXTES = I
+            GO TO 51
+          END IF
+            KR1=NAMEP(NSAVE(I))
+            LP = KS1 - I1 + 1
+C TRAP FOR CUTSITE OUT OF RANGE!!!!!!
+            IF(LP.LT.0)THEN
+              WRITE(IDEV,1004)(NAMES(K),K=KR1,KR1+NAMLEN(NSAVE(I))-1)
+1004          FORMAT(' WARNING:',
+     +        ' Recognition site with cut off end for enzyme ',20A1)
+              WRITE(IDEV,1005)
+1005          FORMAT(//)
+            END IF
+            IF(LP.GT.0)THEN
+            IF(LP.LE.LP1)THEN
+              WRITE(IDEV,1000)TEMP1(1:MAX(1,NOTIRL(TEMP1,130,' ')))
+1000          FORMAT(' ',A)
+              CALL DOTTY(TEMP1)
+            END IF
+            DO 66 ITEMP=1,NAMLEN(NSAVE(I))
+              TEMP1(LP:LP)=NAMES(KR1)
+              KR1=KR1+1
+              LP = LP + 1
+66          CONTINUE
+            LP1 = LP
+            END IF
+            NEXTES = I + 1
+50       CONTINUE
+51       CONTINUE
+         WRITE(IDEV,1000)TEMP1(1:MAX(1,NOTIRL(TEMP1,130,' ')))
+         NEXTE = NEXTES
+         WRITE(IDEV,1001)(SEQ(K),K=I1,I2)
+         WRITE(IDEV,1002)(K,K=I1+9+KSTART-1,I2+KSTART-1,10)
+         IF(ITRAN.NE.1)THEN
+           WRITE(IDEV,1010)(TRANF3(SEQ(K),PAA,ITRAN),
+     +     K=I1,MIN(ITMAX,I2-1),3)
+           WRITE(IDEV,1011)(TRANF3(SEQ(K),PAA,ITRAN),
+     +     K=I1+1,MIN(ITMAX,I2),3)
+           WRITE(IDEV,1012)(TRANF3(SEQ(K),PAA,ITRAN),
+     +     K=I1+2,MIN(ITMAX,I2+1),3)
+         END IF
+         WRITE(IDEV,1003)
+1003     FORMAT(' ')
+1002     FORMAT(' ',10I10)
+1001     FORMAT(' ',100A1)
+1010  FORMAT(1X,40(A))
+1011  FORMAT(2X,40(A))
+1012  FORMAT(3X,40(A))
+         I1 = I2 + 1
+         I2 = MIN((I1+LINLEN-1),J2)
+100    CONTINUE
+       END
+       SUBROUTINE DOTTY(TEMP)
+       CHARACTER TEMP*(*),DOT
+       PARAMETER (DOT='.')
+       DO 10 I = LEN(TEMP),2,-1
+         IF(TEMP(I:I).NE.' ')THEN
+           IF(TEMP(I:I).NE.DOT)THEN
+             IF((TEMP(I-1:I-1).EQ.' ').OR.(TEMP(I-1:I-1).EQ.DOT))THEN
+               TEMP(I:I) = DOT
+             ELSE
+               TEMP(I:I) = ' '
+             END IF
+           END IF
+         END IF
+10     CONTINUE
+       IF(TEMP(1:1).NE.' ')TEMP(1:1) = DOT
+       END
+      SUBROUTINE SEQFIT(SEQNCE,IDIM1,SEQ2,IDIM3I,ITOT,ITOTEL,ITOTID,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,
+     +IDEV,IDEVIN,FILE2,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      INTEGER DIALOG
+      PARAMETER (MAXSTR = 50)
+      CHARACTER HELPF*(*)
+      CHARACTER SEQNCE(IDIM1),STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3I),FILE2*(*)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      INTEGER ANSTY
+      CALL SHOWFU(KBOUT,'Find percentage matches')
+      IDIM2 = 0
+      IDIM2I = MAXSTR
+      IDIM3 = IDIM3I
+      IS = J1
+      IE = J2
+      ISS = 1
+      ISE = 10
+      CALL GETPAR(18,2,IOK,ANSTY,IPR,
+     +IPAR3,IPAR4,IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      PR = REAL(IPR)
+5     CONTINUE
+      IF(DIALOG.EQ.1) THEN
+        CALL SQPFD1(SEQ2,IDIM3I,IDIM3,ANSTY,IDEVIN,FILE2,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+10    CONTINUE
+      CALL SQPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,PR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL SQPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOK)
+      IF(IOK.EQ.2)THEN
+        WRITE(KBOUT,1017)ITOTID
+1017    FORMAT(/,' More than',I6,' matches.',
+     +  ' Try changing percentage or region'/)
+        GO TO 10
+      END IF
+      IF(IOK.NE.0) RETURN
+      CALL SQPF7(SEQNCE,IDIM1,STRING,IDIM2,MATCH,ITOT,ITOTEL,ITOTID,
+     +ITOTP,KSTART,J1,J2,ISS,PR,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      GO TO 10
+      END
+      SUBROUTINE SQPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,PR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ANSC,ANSTY
+      IOK = 1
+      CALL YESNO(ANSC,'Keep picture',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSC.LT.0)RETURN
+      IF(ANSC.EQ.1)CALL CLEARG
+      IF(ANSTY.EQ.1) THEN
+        CALL SQPF2(SEQ2,IDIM3,STRING,IDIM2I,IDIM2,ISS,ISE,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(ANSTY.EQ.0) THEN
+        CALL SQPF3(STRING,MATCH,IDIM2I,IDIM2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL SQPF8(STRING,IDIM2,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+C      CALL SQPF4(KBIN,KBOUT,J1,J2,IS,IE,
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+      CALL SQPF5(IDIM2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PR,MINP,IOK)
+C      IF(IOK.NE.0) RETURN
+      RETURN
+      END
+      SUBROUTINE SQPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOK)
+      CHARACTER SEQNCE(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      IOK = 1
+      CALL FILLI(ITOT,ITOTID,0)
+      CALL FILLI(ITOTEL,ITOTID,0)
+      CALL SQFIT6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART)
+      IF(ITOTP.GT.ITOTID)THEN
+        IOK = 2
+        RETURN
+      END IF
+      IF(ITOTP.GT.0)THEN
+        XMIN=J1
+        XMAX=J2
+        YMIN=0.
+        YMAX=IDIM2
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        DO 203 I=1,ITOTP
+          X=ITOTEL(I)
+          Y=ITOT(I)
+          CALL LINE(X,X,YMIN,Y,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+203     CONTINUE
+      END IF
+      CALL VT100M
+      IOK = 0
+      END
+      SUBROUTINE SQPF8(STRING,LENGTH,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER STRING(LENGTH)
+      IOK = 1
+      IANSC = 0
+      WRITE(KBOUT,1000)(STRING(K),K=1,LENGTH)
+1000  FORMAT(' STRING=',50A1)
+      CALL YESNO(IANSC,'This sense',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IANSC.LT.0)RETURN
+      IF(IANSC.EQ.1)THEN
+        CALL SQREV(STRING,LENGTH)
+        CALL SQCOM2(STRING,LENGTH)
+        WRITE(KBOUT,1000)(STRING(K),K=1,LENGTH)
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE SEPFIT(SEQNCE,IDIM1,SEQ2,IDIM3I,ITOT,ITOTEL,ITOTID,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,
+     +IDEV,IDEVIN,FILE2,KBIN,KBOUT,MATRIX,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      INTEGER DIALOG
+      PARAMETER (MAXSTR = 50)
+      CHARACTER HELPF*(*)
+      CHARACTER SEQNCE(IDIM1),STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3I),FILE2*(*)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID),MATRIX(IDM,IDM)
+      INTEGER ANSTY,SMIN,SMAX
+      CALL SHOWFU(KBOUT,'Find matches using a score matrix')
+      MINP = 0
+      IDIM2 = 0
+      IDIM2I = MAXSTR
+      IDIM3 = IDIM3I
+      IS = J1
+      IE = J2
+      ISS = 1
+      ISE = 10
+      CALL GETPAR(19,1,IOK,ANSTY,
+     +IPAR2,IPAR3,IPAR4,IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+5     CONTINUE
+      IF(DIALOG.EQ.1) THEN
+        CALL SQPFD1(SEQ2,IDIM3I,IDIM3,ANSTY,IDEVIN,FILE2,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+10    CONTINUE
+      CALL SEPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL SEPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+      IF(IOK.EQ.2)THEN
+        WRITE(KBOUT,1017)ITOTID
+1017    FORMAT(/,' More than',I6,' matches.',
+     +  ' Try changing score or region'/)
+        GO TO 10
+      END IF
+      IF(IOK.NE.0) RETURN
+      CALL SEPF7(SEQNCE,IDIM1,STRING,IDIM2,MATCH,ITOT,ITOTEL,ITOTID,
+     +ITOTP,MINP,KSTART,J1,J2,ISS,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      GO TO 10
+      END
+      SUBROUTINE SEPFD2(SEQ2,IDIM3,MATCH,STRING,
+     +J1,J2,KSTART,ISS,ISE,IS,IE,IDIM2I,IDIM2,MAXSTR,ANSTY,MINP,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER STRING(MAXSTR)
+      CHARACTER SEQ2(IDIM3)
+      CHARACTER MATCH(MAXSTR)
+      INTEGER ANSC,ANSTY,MATRIX(IDM,IDM),SMIN,SMAX
+      IOK = 1
+      CALL YESNO(ANSC,'Keep picture',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSC.LT.0)RETURN
+      IF(ANSC.EQ.1)CALL CLEARG
+      IF(ANSTY.EQ.1) THEN
+        CALL SQPF2(SEQ2,IDIM3,STRING,IDIM2I,IDIM2,ISS,ISE,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(ANSTY.EQ.0) THEN
+        CALL SQPF3(STRING,MATCH,IDIM2I,IDIM2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL SQPF8(STRING,IDIM2,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+C      CALL SEPF4(KBIN,KBOUT,J1,J2,IS,IE,
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+      CALL SEPF5(IDIM2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,MINP,MATRIX,IDM,
+     +SMIN,SMAX,STRING,IOK)
+C      IF(IOK.NE.0) RETURN
+      RETURN
+      END
+      SUBROUTINE SEPF5(IDIM2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,MINP,MATRIX,IDM,
+     +SMIN,SMAX,STRING,IOK)
+      CHARACTER HELPF*(*)
+      CHARACTER STRING(IDIM2)
+      INTEGER MATRIX(IDM,IDM),SMIN,SMAX,VALUE
+      IOK = 1
+      CALL GTSCR(STRING,IDIM2,MATRIX,IDM,SMIN,SMAX)
+      WRITE(KBOUT,1000)SMIN,SMAX
+1000  FORMAT(' Minimum score=',I6,' Maximum score=',I6)
+      MININ = SMIN
+      MAXIN = SMAX
+      IF(MINP.EQ.0) MINP = SMAX
+      CALL GETINT(MININ,MAXIN,MINP,'Score',VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINP = VALUE
+      IOK = 0
+      END
+      SUBROUTINE SEPF6(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MATRIX,IDM,
+     +SMIN,SMAX,IOK)
+      CHARACTER SEQNCE(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID),MATRIX(IDM,IDM)
+      INTEGER SMIN,SMAX
+      IOK = 1
+      CALL FILLI(ITOT,ITOTID,0)
+      CALL FILLI(ITOTEL,ITOTID,0)
+      CALL SPFIT(SEQNCE,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,MATRIX,IDM,KSTART)
+      IF(ITOTP.GT.ITOTID)THEN
+        IOK = 2
+        RETURN
+      END IF
+      IF(ITOTP.GT.0)THEN
+        XMIN = J1
+        XMAX = J2
+        YMIN = SMIN
+        YMAX = SMAX
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        DO 203 I=1,ITOTP
+          X=ITOTEL(I)
+          Y=ITOT(I)
+          CALL LINE(X,X,YMIN,Y,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+203     CONTINUE
+      END IF
+      CALL VT100M
+      IOK = 0
+      END
+C      SQFITP
+      SUBROUTINE SQFITP(SEQ,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,KSTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS-KSTART+1
+      ITOTP=0
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          IF(SEQ(IP).EQ.STRING(J))THEN
+            NTOT=NTOT+1
+          ELSE IF (STRING(J).EQ.'-')THEN
+            NTOT=NTOT+1
+          END IF
+          IP=IP+1
+100     CONTINUE
+        IF(NTOT.GE.MINP)THEN
+          ITOTP=ITOTP+1
+          IF(ITOTP.GT.ITOTID)RETURN
+          ITOT(ITOTP)=NTOT
+          ITOTEL(ITOTP)=IP-IDIM2+KSTART-1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
+      SUBROUTINE SIGNAL(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,
+     +SUM,WT,TOT,IDM,MAXLEN,LINE,CHRSET,DIALOG,JOB)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILNAM*(*),HELPF*(*)
+      CHARACTER LINE(MAXLEN),CHRSET(IDM)
+      INTEGER SUM(IDM,MAXLEN),TOT(MAXLEN),DIALOG
+      REAL WT(IDM,MAXLEN)
+      CALL SHOWFU(KBOUT,'Motif search using weight matrix')
+C      IF(JOB.EQ.2)CALL SHOWFU(KBOUT,'on complementary strand')
+      CALL GETPAR(20,1,IOK,IGON,
+     +IPAR2,IPAR3,IPAR4,IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL SIGNLD(IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WT,LENGTH,MIDDLE,YMIN,YMAX,
+     +IGON,JOB,IDEVOT,LINE,CHRSET,DIALOG,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL SIGNLP(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WT,IDM,MAXLEN,LENGTH,
+     +YMIN,YMAX,IGON,MIDDLE)
+      END
+      SUBROUTINE SIGNLD(IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WT,LENGTH,MIDDLE,YMIN,YMAX,
+     +IGON,JOB,IDEVOT,LINE,CHRSET,DIALOG,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*),LINE(MAXLEN),CHRSET(IDM)
+      INTEGER SUM(IDM,MAXLEN),TOT(MAXLEN),DIALOG
+      REAL WT(IDM,MAXLEN)
+      PARAMETER (MAXPRM = 21)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      IOK = 1
+      IDO = 1
+      PROMPT(1) = 'Use weight matrix'
+      PROMPT(2) = 'Make weight matrix'
+      PROMPT(3) = 'Rescale weight matrix'
+      CALL RADION('Select operation',PROMPT,3,IDO,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IDO.LT.1) RETURN
+      IF(IDO.GT.1) THEN
+        CALL MKWT(WT,SUM,TOT,CHRSET,IDM,MAXLEN,
+     +  IDEV,IDEVOT,KBIN,KBOUT,LINE,
+     +  FILNAM,IDO,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IOK = 1
+        RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Motif weight matrix file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      LENGTH=MAXLEN
+      CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,YMIN,YMAX,IDEV,
+     +  IOK,IDM,KBOUT,0)
+      IF(IOK.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+      IF(DIALOG.EQ.1) THEN
+        AMN = -9999.
+        AMX = 9999.
+        CALL GETRL(AMN,AMX,YMIN,'Cutoff score',VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        YMIN = VAL
+      END IF
+      IF(YMIN.LT.0.0)CALL GETW(TOT,SUM,WT,LENGTH,IDM,MAXLEN)
+      IF(YMIN.GE.0.0)CALL GETW2(SUM,WT,LENGTH,IDM,MAXLEN)
+      IF(JOB.EQ.2)CALL COMPWM(WT,LENGTH,MAXLEN,IDM)
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0)RETURN
+      IOK = 0
+      END
+      SUBROUTINE SIGNLP(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WT,IDM,MAXLEN,LENGTH,
+     +YMIN,YMAX,IGON,MIDDLE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL WT(IDM,MAXLEN)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      XMAX=J2
+      XMIN=J1
+      IF(IGON.EQ.0)THEN
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      DO 10 I=J1,J2-LENGTH+1
+        SUM1=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          SUM1=SUM1+WT(CTONUM(SEQ(J)),K)
+5       CONTINUE
+        IF(SUM1.GE.YMIN)THEN
+          IF(IGON.EQ.0)THEN
+            X=I+MIDDLE
+            CALL LINE(X,X,YMIN,SUM1,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          END IF
+          IF(IGON.EQ.1)THEN
+            CALL VT100M
+            WRITE(IDEVOT,1010)I+MIDDLE,SUM1,(SEQ(K),K=I,I+LENGTH-1)
+1010        FORMAT(' ',I7,F10.2,' ',120A1)
+          END IF
+        END IF
+10    CONTINUE
+      CALL VT100M
+      RETURN
+      END
+      SUBROUTINE COMPWM(WTR,LENGTH,MAXL,IDM)
+C   AUTHOR: RODGER STADEN
+      REAL WTR(IDM,MAXL)
+C   COMPLEMENT THE WEIGHT MATRIX
+      DO 60 I=1,LENGTH
+       DO 60 J=1,2
+         T = WTR(J,I)
+         WTR(J,I)=WTR(J+2,I)
+         WTR(J+2,I)=T
+60    CONTINUE
+      L = LENGTH/2
+      DO 70 I=1,L
+       L1 = 1 + LENGTH-I
+       DO 70 J=1,IDM
+         T = WTR(J,L1)
+         WTR(J,L1)=WTR(J,I)
+         WTR(J,I)=T
+70    CONTINUE
+      END
+C     AACOMP
+      SUBROUTINE AACOMP(SUM,IDEV,PAA)
+C   AUTHOR: RODGER STADEN
+      REAL SUM(4,4,4),AAWTS(20),EXPERC(20),APERC(20),ASUM(20)
+      REAL HYDRO(20)
+      CHARACTER PAA(5,5,5),AA(20)
+      SAVE AA,AAWTS,EXPERC,HYDRO
+      DATA AA/'A','C','D','E','F',
+     +'G','H','I','K','L',
+     +'M','N','P','Q','R',
+     +'S','T','V','W','Y'/
+      DATA AAWTS/71.09,103.15,115.10,129.13,147.19,
+     +57.07,137.16,113.17,128.19,113.17,
+     +131.21,114.12,97.13,128.15,156.20,
+     +87.09,101.12,99.15,186.23,163.19/
+C  ARGOS VALUES
+      DATA EXPERC/8.3,1.7,5.3,6.2,3.9,7.2,2.2,5.2,5.7,
+     +9.0,2.4,4.4,5.1,4.0,5.7,6.9,5.8,6.6,1.3,3.2/
+C DAYHOFF VALUES
+C      DATA EXPERC/8.6,2.9,5.5,6.0,3.6,8.4,2.0,4.5,6.6,
+C     17.4,1.7,4.3,5.2,3.9,4.9,7.0,6.1,6.6,1.3,3.4/
+      DATA HYDRO/-0.4,0.17,-1.31,-1.22,1.92,
+     +-0.67,-0.64,1.25,-0.67,1.22,
+     +1.02,-0.92,-0.49,-0.91,-0.59,
+     +-0.55,-0.28,0.91,0.50,1.67/
+C   hydrophobicity values from r. m. sweet and d. eisenberg, j mol. biol
+C   (1983) 171, 479-488
+C
+C   calc amino acid composition
+      CALL FILLR(ASUM,20,0.0)
+      TOTAL=0.
+      ATOT=0.
+      DO 200 I=1,4
+      DO 200 J=1,4
+      DO 200 K=1,4
+      DO 150 L=1,20
+        IF(PAA(K,J,I).EQ.AA(L)) THEN
+          ASUM(L)=ASUM(L)+SUM(I,J,K)
+          ATOT=ATOT+SUM(I,J,K)
+          GO TO 151
+        END IF
+150   CONTINUE
+151   CONTINUE
+200   CONTINUE
+C  set hydrophobicity to zero
+      HYDROP=0.
+      DO 300 I=1,20
+        HYDROP=HYDROP+ASUM(I)*HYDRO(I)
+        IF(ATOT.NE.0.)APERC(I)=100.*((100.*ASUM(I)/ATOT)-EXPERC(I))
+     +  /EXPERC(I)
+C   botch-up to allow use of dayhoff aawts-18.0 (H2O really=18.015)
+        TOTAL=TOTAL+ASUM(I)*(AAWTS(I)-0.015)
+300   CONTINUE
+C   add on one water molecule
+      TOTAL=TOTAL+18.015
+      WRITE(IDEV,1002)(AA(K),K=1,10)
+      WRITE(IDEV,1003)(ASUM(K),K=1,10)
+      WRITE(IDEV,1005)(APERC(K),K=1,10)
+      WRITE(IDEV,1002)(AA(K),K=11,20)
+      WRITE(IDEV,1003)(ASUM(K),K=11,20)
+      WRITE(IDEV,1005)(APERC(K),K=11,20)
+1005  FORMAT(' O-E % ',10F5.0)
+1006  FORMAT(' EXP % ',10F4.0)
+1002  FORMAT(/'       ',10(3X,A1,1X))
+1003  FORMAT('       ',10F5.0)
+      WRITE(IDEV,1004)ATOT,TOTAL,HYDROP
+1004  FORMAT(' Total acids=',F6.0,' Molecular weight=',F10.0,
+     +' Hydrophobicity=',F6.1)
+      END
+C     BCOMPC
+      SUBROUTINE BCOMPC(SUM,IDEV)
+C   AUTHOR: RODGER STADEN
+      REAL SUM(4,4,4),COMP(4,3),COMPO(4,3),BCOMP(4),BCOMPX(4)
+      CHARACTER BASE(4)
+      SAVE BASE
+      DATA BASE/'T','C','A','G'/
+C   calc base composition
+      TCOMP=0.
+      DO 50 I=1,4
+        BCOMP(I)=0.
+        DO 50 J=1,3
+          COMPO(I,J)=0.
+          COMP(I,J)=0.
+50    CONTINUE
+      DO 55 J=1,4
+      DO 55 K=1,4
+      DO 55 L=1,4
+        TCOMP=TCOMP+SUM(J,K,L)
+        BCOMP(J)=BCOMP(J)+SUM(J,K,L)+SUM(K,J,L)+SUM(K,L,J)
+        COMP(J,1)=COMP(J,1)+SUM(J,K,L)
+        COMP(J,2)=COMP(J,2)+SUM(K,J,L)
+        COMP(J,3)=COMP(J,3)+SUM(K,L,J)
+55    CONTINUE
+      WRITE(IDEV,1013)TCOMP
+1013  FORMAT('  Total codons=',F10.0)
+C   calc % composition
+      DO 60 I=1,4
+      DO 60 J=1,3
+        IF(TCOMP.NE.0.)COMPO(I,J)=COMP(I,J)*100./TCOMP
+        IF(BCOMP(I).NE.0.)COMP(I,J)=COMP(I,J)*100./BCOMP(I)
+60    CONTINUE
+      WRITE(IDEV,1010)BASE
+1010  FORMAT(4(10X,A1))
+      WRITE(IDEV,1011)(J,(COMP(I,J),I=1,4),J=1,3)
+      WRITE(IDEV,1000)
+1000  FORMAT('  ',4('      -----'))
+      WRITE(IDEV,1002)
+1002  FORMAT('  =',4('     100%  '))
+      WRITE(IDEV,1001)(J,(COMPO(I,J),I=1,4),J=1,3)
+1011  FORMAT(3(/2X,I1,(4(F10.2,1X))))
+1001  FORMAT(3(/2X,I1,(4(F10.2,1X)),' = 100%'))
+      DO 65 I=1,4
+        IF(TCOMP.NE.0.)BCOMP(I)=BCOMP(I)*33.3333/TCOMP
+65    CONTINUE
+      WRITE(IDEV,1012)BCOMP
+1012  FORMAT(2X,'%',4(F10.2,1X),' Observed, overall totals')
+1014  FORMAT(2X,'%',4(F10.2,1X),' Expected, even codons per acid')
+      CALL BCOMPE(SUM,BCOMPX)
+      WRITE(IDEV,1014)BCOMPX
+      END
+      SUBROUTINE BCOMPE(FABC,BCOMP)
+C   AUTHOR: RODGER STADEN
+      INTEGER NCODON(21),IACID(4,4,4)
+      REAL FABC(4,4,4),FA(21),BCOMP(4)
+C    array with number of codons per acid
+      SAVE IACID,NCODON
+      DATA IACID/
+     +  1,2,3,5,6,7,8,9,
+     +  10,12,14,16,18,20,6,21,
+     +  1,2,3,5,6,7,8,9,
+     +  10,12,14,16,18,20,6,21,
+     +  2,2,3,5,6,7,8,9,
+     +  11,13,15,17,11,20,20,21,
+     +  2,2,4,5,6,7,8,9,
+     +  11,13,15,17,19,20,20,21/
+      DATA NCODON/
+     +  2,6,3,1,4,6,4,4,4,2,3,2,2,2,2,2,2,2,1,6,4/
+      TOTAL=0.
+      CALL FILLR(BCOMP,4,0.)
+      CALL FILLR(FA,21,0.)
+      DO 20 I=1,4
+      DO 20 J=1,4
+      DO 20 K=1,4
+C   point to this acid
+        JACID=IACID(I,J,K)
+C   sum its codons
+        FA(JACID)=FA(JACID)+FABC(I,J,K)
+        TOTAL=TOTAL+FABC(I,J,K)
+20    CONTINUE
+C   calculate even usage of each acids codons
+C   need to look up the acids number, its composition is divided by
+C   its number of codons
+      DO 21 I=1,4
+      DO 21 J=1,4
+      DO 21 K=1,4
+C   get acid number this codon
+        JACID=IACID(I,J,K)
+        BCOMP(I)=BCOMP(I)+FA(JACID)/(REAL(NCODON(JACID)))
+        BCOMP(J)=BCOMP(J)+FA(JACID)/(REAL(NCODON(JACID)))
+        BCOMP(K)=BCOMP(K)+FA(JACID)/(REAL(NCODON(JACID)))
+21    CONTINUE
+      DO 22 I=1,4
+        IF(TOTAL.NE.0.)BCOMP(I)=BCOMP(I)*33.3333/TOTAL
+22    CONTINUE
+      END
+      SUBROUTINE CALCOD(SUM,SEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+C   calculates codon totals for defined region
+      REAL SUM(4,4,4)
+      CHARACTER SEQ(IDIM)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      MIN=IDIM/3
+      J=1
+      DO 100 I=1,MIN
+C   test for bad chars
+        DO 50 L=1,3
+          IP=J-1+L
+          IF(CTONUM(SEQ(IP)).EQ.5)GO TO 59
+50      CONTINUE
+        JP0=J
+        JP1=J+1
+        JP2=J+2
+        JP0=CTONUM(SEQ(JP0))
+        JP1=CTONUM(SEQ(JP1))
+        JP2=CTONUM(SEQ(JP2))
+        SUM(JP0,JP1,JP2)=SUM(JP0,JP1,JP2)+1.
+59      CONTINUE
+        J=J+3
+100   CONTINUE
+      END
+C   codcom
+C   complements a codon table
+      SUBROUTINE CODCOM(SUM)
+C   AUTHOR: RODGER STADEN
+      REAL SUM(4,4,4),TEMP(4,4,4)
+      INTEGER IP(4)
+      SAVE IP
+      DATA IP/3,4,1,2/
+      DO 10 I=1,4
+      DO 10 J=1,4
+      DO 10 K=1,4
+        IC=IP(I)
+        JC=IP(J)
+        KC=IP(K)
+        TEMP(I,J,K)=SUM(KC,JC,IC)
+10    CONTINUE
+      DO 20 I=1,4
+      DO 20 J=1,4
+      DO 20 K=1,4
+        SUM(I,J,K)=TEMP(I,J,K)
+20    CONTINUE
+      END
+      SUBROUTINE CODTBL(IDEV,FILNAM,S2,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,ANSE,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*)
+      INTEGER ANSE
+      REAL S2(64)
+      IOK = 1
+      IOP = ANSE
+      CALL YESONO(IOP,'Start with empty table',
+     +'Start with an existing table',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) RETURN
+      ANSE = IOP
+      IF(ANSE.EQ.1) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Name of codon table file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL REDCOD(S2,IDEV)
+        CLOSE(UNIT=IDEV)
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE LSTSEQ(SEQ,IDIM,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,J1,J2,IDIMT,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER DIALOG
+C   J1 IS THE FIRST BASE IN THE RAM BUFFER
+C   J2 IS THE LAST BASE IN THE RAM BUFFER
+C   IDIM=J2-J1+1
+C   IDIMT IS THE ACTUAL SEQUENCE LENGTH
+C   WRAP AROUND IS ONLY POSSIBLE IF J1=1, AND J2=IDIMT
+      CALL SHOWFU(KBOUT,'List the sequence')
+      I1 = J1
+      I2 = J2
+      LENGTH = 60
+      NSTRND = 0
+      IF(DIALOG.EQ.1) THEN
+        CALL LSTSQD(IDIM,KBIN,KBOUT,J1,J2,IDIMT,NSTRND,
+     +  IHELPS,IHELPE,HELPF,IDEVH,I1,I2,LENGTH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL FMTSEQ(SEQ,IDIM,I1,I2,NSTRND,LENGTH,IDEV,J1,IDIMT)
+      END
+      SUBROUTINE FMTSEQ(SEQNCE,IDIM,IS1,IE1,NSTRND,LENGTH,IDEV,KSTART,
+     +IDIMT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM),SCOMP
+      INTEGER K(12)
+      EXTERNAL LWRAP,SCOMP
+      IS=IS1
+      IE=IE1
+      LD10=LENGTH/10
+      ITOT=IE-IS+1
+      IF(IE.LT.IS)ITOT=IDIMT+ITOT
+      IDONE=0
+      N=IS-1
+      WRITE(IDEV,1005)
+50    CONTINUE
+      IF(IDONE.GE.ITOT)RETURN
+      DO 20 I=1,LD10
+        N=N+10
+        K(I)=N
+        IF(K(I).GT.IDIMT)K(I)=K(I)-IDIMT
+20    CONTINUE
+C   need to find end of this line
+      IADD=LENGTH
+      IF((IDONE+IADD).GT.ITOT)IADD=ITOT-IDONE
+      IE=IS+IADD-1
+      LD10M=MIN(LD10,(IE-IS+1)/10)
+      IDONE=IDONE+IADD
+1001  FORMAT(' ',(12(5X,I6)))
+      WRITE(IDEV,1002)
+     +(SEQNCE(LWRAP(IDIMT,M)),M=IS-KSTART+1,IE-KSTART+1)
+      IF(NSTRND.EQ.1)WRITE(IDEV,1002)(SCOMP(
+     1(SEQNCE(LWRAP(IDIMT,M)))),M=IS-KSTART+1,IE-KSTART+1)
+      WRITE(IDEV,1001)(K(I),I=1,LD10M)
+1002  FORMAT( '  ',12(10A1,1X))
+      IS=IE+1
+      WRITE(IDEV,1005)
+1005  FORMAT(' ')
+      GO TO 50
+      END
+      SUBROUTINE LSTSQD(IDIM,KBIN,KBOUT,J1,J2,IDIMT,NSTRND,
+     +IHELPS,IHELPE,HELPF,IDEVH,I1,I2,LENGTH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      INTEGER VALUE
+      IOK = 1
+1     CONTINUE
+      VALUE = I1
+      CALL GETINT(I1,I2,VALUE,
+     +'List from ',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I1 = VALUE
+      VALUE = I2
+      CALL GETINT(1,I2,VALUE,
+     +'List to ',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I2 = VALUE
+C  IS WRAP AROUND ALLOWED?
+      IF(I2.LT.I1)THEN
+        IF((J1.NE.1).OR.(J2.NE.IDIMT))THEN
+          WRITE(KBOUT,1004)
+1004      FORMAT(' A sequence of this length cannot be treated as a',
+     +           ' circle')
+          GO TO 1
+        END IF
+      END IF
+      IOP = NSTRND
+      CALL YESONO(IOP,'Single stranded','Double stranded',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) RETURN
+      NSTRND = IOP
+      MININ = 10
+      MAXIN = 120
+      LENGTH = 60
+      CALL GETINT(MININ,MAXIN,LENGTH,
+     +'Line length',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      IF(MOD(VALUE,10).NE.0)VALUE = LENGTH
+      LENGTH = VALUE
+      END
+      SUBROUTINE COMPN(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,SCORES(5),DIALOG
+      CALL SHOWFU(KBOUT,'Plot base composition')
+      SCORES(5) = 0
+      CALL GETPAR(24,10,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +SCORES(1),SCORES(2),SCORES(3),SCORES(4),
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      MAXSP = MIN(MXSPAN,MAXSP,J2-J1+1)
+      IF(DIALOG.EQ.1) THEN
+        CALL COMPD(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +  SCORES,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN      
+      END IF
+      CALL COMPP(SEQ,IDIM1P,MXSPAN,SPAN,IWRIT,SCORES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2)
+      CALL VT100M
+      END
+      SUBROUTINE COMPD(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +SCORES,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+      INTEGER SCORES(5)
+      CALL CHECK4('T','C','A','G',
+     +SCORES(1),SCORES(2),SCORES(3),SCORES(4),
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      END
+      SUBROUTINE COMPP(SEQ,IDIM1P,MXSPAN,SPAN,IWRIT,SCORES,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,SCORES(5),CTONUM
+      EXTERNAL CTONUM
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+      SCRMAX=1.
+      I1INM1=1+MXSPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+C   set ymax
+      YMAX=SPAN*SCRMAX
+      YMIN=0.
+C   do edge
+      CALL START(SEQ(J1P-LB),IDIM1P,SCORES,5,SPAN,SUM)
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   do rest
+      IDONE=0
+      XF=J1
+      YF=SUM
+      DO 100 I=J1P,J2P
+        IDONE=IDONE+1
+        SUM=SUM+SCORES(CTONUM(SEQ(I+LF)))
+     +  -SCORES(CTONUM(SEQ(I-LB)))
+        IF(MOD(I,IWRIT).EQ.0)THEN
+          XT=I-I1INM1
+          CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          XF=XT
+          YF=SUM
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE START(SEQ,IDIM,SCORES,IDIMS,LENW,SUM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER SCORES(IDIMS),CTONUM
+      EXTERNAL CTONUM
+      SUM=0.
+      DO 10 I=1,LENW
+        SUM=SUM+SCORES(CTONUM(SEQ(I)))
+10    CONTINUE
+      END
+      SUBROUTINE CCHI(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,WORK,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      PARAMETER (ISIZE = 5)
+      INTEGER SPAN,DIALOG
+      REAL EXPEC(ISIZE),AVER(ISIZE),OBS(ISIZE),WORK(IDIM1P)
+C   NOTE J1 J2 IS THE ACTIVE REGION, KSTART IS THE NUMBER OF THE 
+C   BASE IN ELEMENT 1 OF SEQ, AND IDIM1 IS THE LENGTH FROM KSTART
+C   TO THE END OF SEQ. PLOTS ARE FROM J1 TO J2 BUT STANDARDS CAN
+C   BE TAKEN FROM KSTART TO KSTART+IDIM1-1 = IENDB
+C   CHANGED TO ALLOW USER TO DEFINE REGION FOR STANDARD
+C   AND TO DO SCALING ONLY OVER REAL SEQUENCE (NOT EDGE) 23-08-84
+      CALL SHOWFU(KBOUT,
+     +'Plot base composition differences as chi squared')
+      CALL GETPAR(25,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      K1 = KSTART
+      K2 = IENDB
+      IF(DIALOG.EQ.1) THEN
+        CALL CHID(KSTART,IENDB,MINSP,MAXSP,SPAN,
+     +  MINIW,MAXIW,IWRIT,K1,K2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL CCHIP1(SEQ,IDIM1P,MXSPAN,EXPEC,OBS,AVER,ISIZE,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+      END
+      SUBROUTINE CCHIP1(SEQ,IDIM1P,MXSPAN,EXPEC,OBS,AVER,ISIZE,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,CTONUM
+      REAL EXPEC(ISIZE),AVER(ISIZE),OBS(ISIZE),WORK(IDIM1P)
+      EXTERNAL CTONUM,CIE
+      DIF = K2 - K1 + 1
+      IF(DIF.LT.1.) RETURN
+      CALL FILLR(AVER,ISIZE,0.)
+      I1INM1= 1 + (MXSPAN/2)-KSTART+1
+      CALL CHI1(SEQ(I1INM1+1),IDIM1P,K1,K2,AVER)
+      DO 3 I=1,ISIZE
+        AVER(I)=AVER(I) / DIF
+3     CONTINUE
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+C   calc expected scores
+      DO 29 I=1,ISIZE
+        EXPEC(I)=AVER(I)*SPAN
+29    CONTINUE
+      CALL BUSY(KBOUT)
+C   do edge
+      CALL FILLR(OBS,ISIZE,0.)
+      CALL CHI1(SEQ,IDIM1P,J1P,J1P+SPAN-1,OBS)
+C   do rest
+      SUMMAX=-99999.
+      SUMMIN=999999.
+      J = 1
+      WORK(J) = CIE(OBS,EXPEC)
+      DO 200 I=J1P+LB,J2P-LF
+        ISF=CTONUM(SEQ(I+LF))
+        ISB=CTONUM(SEQ(I-LB))
+        OBS(ISF)=OBS(ISF)+1.
+        OBS(ISB)=OBS(ISB)-1.
+        SUM = CIE(OBS,EXPEC)
+        SUMMAX = MAX(SUMMAX,SUM)
+        SUMMIN = MIN(SUMMIN,SUM)
+        J = J + 1
+        WORK(J) = SUM
+200   CONTINUE
+      CALL CLEARV
+      CALL VECTOM
+      IDIMW = J2 - J1 + 1 - SPAN
+      XMIN = -LB
+      XMAX = IDIMW + LB
+      YMAX = SUMMAX
+      YMIN = SUMMIN
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL PLOTR(WORK,IDIMW,XMAX,XMIN,YMAX,YMIN,IWRIT,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      WRITE(KBOUT,4444)SUMMAX,SUMMIN
+4444  FORMAT(' Observed range=',2F12.6)
+      END
+      REAL FUNCTION CIE(OBS,EXPEC)
+      REAL OBS(4),EXPEC(4)
+      SUM = 0.
+      DO 10 K = 1,4
+        TE = EXPEC(K)
+        IF(TE.EQ.0.0) GO TO 10
+        TO = OBS(K)
+        D = TO - TE
+        SUM = SUM + D*D/TE
+10    CONTINUE
+      CIE = SUM
+      END
+      SUBROUTINE PLOTR(POINTS,IDIMP,XMAX,XMIN,YMAX,YMIN,IWRIT,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      REAL POINTS(IDIMP)
+      XF = XMIN
+      YF = POINTS(1)
+      DO 100 I=1,IDIMP,IWRIT
+        XT = I
+        YT = POINTS(I)
+        CALL LINE(XF,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        XF = XT
+        YF = YT
+100   CONTINUE
+      END
+      SUBROUTINE CHI1(SEQ,IDIM1P,J1P,J2P,OBS)
+      REAL OBS(5)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DO 10 I=J1P,J2P
+        IS = CTONUM(SEQ(I))
+        OBS(IS) = OBS(IS) + 1.
+10    CONTINUE
+      END
+      SUBROUTINE DICHI(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,WORK,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      PARAMETER (ISIZE = 25)
+      INTEGER SPAN,DIALOG
+      REAL EXPEC1(ISIZE),AVER1(ISIZE),OBS1(ISIZE)
+      REAL EXPEC(5,5),AVER(5,5),OBS(5,5),WORK(IDIM1P)
+      EQUIVALENCE (EXPEC1,EXPEC),(AVER1,AVER),(OBS1,OBS)
+C   NOTE J1 J2 IS THE ACTIVE REGION, KSTART IS THE NUMBER OF THE 
+C   BASE IN ELEMENT 1 OF SEQ, AND IDIM1 IS THE LENGTH FROM KSTART
+C   TO THE END OF SEQ. PLOTS ARE FROM J1 TO J2 BUT STANDARDS CAN
+C   BE TAKEN FROM KSTART TO KSTART+IDIM1-1 = IENDB
+      CALL SHOWFU(KBOUT,
+     +'Plot dinucleotide composition differences as chi squared')
+C   CHANGED TO ALLOW USER TO DEFINE REGION FOR STANDARD
+C   AND TO DO SCALING ONLY OVER REAL SEQUENCE (NOT EDGE) 23-08-84
+      CALL GETPAR(26,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      K1 = KSTART
+      K2 = IENDB
+      IF(DIALOG.EQ.1) THEN
+        CALL CHID(KSTART,IENDB,MINSP,MAXSP,SPAN,
+     +  MINIW,MAXIW,IWRIT,K1,K2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL DCHIP1(SEQ,IDIM1P,MXSPAN,EXPEC,OBS,AVER,ISIZE,
+     +EXPEC1,OBS1,AVER1,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+      END
+      SUBROUTINE DCHI1(SEQ,IDIM1P,J1P,J2P,OBS)
+      REAL OBS(5,5)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DO 10 I=J1P,J2P
+        IS = CTONUM(SEQ(I))
+        IS1 = CTONUM(SEQ(I+1))
+        OBS(IS,IS1) = OBS(IS,IS1) + 1.
+10    CONTINUE
+      END
+      SUBROUTINE DCHIP1(SEQ,IDIM1P,MXSPAN,EXPEC,OBS,AVER,ISIZE,
+     +EXPEC1,OBS1,AVER1,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,CTONUM
+      REAL EXPEC1(ISIZE),AVER1(ISIZE),OBS1(ISIZE)
+      REAL EXPEC(5,5),AVER(5,5),OBS(5,5),WORK(IDIM1P)
+      EXTERNAL CTONUM,CED
+      DIF = K2 - K1
+      IF(DIF.LT.2.) RETURN
+      CALL BUSY(KBOUT)
+      CALL FILLR(AVER1,ISIZE,0.)
+      I1INM1=1+(MXSPAN/2)-KSTART+1
+      CALL DCHI1(SEQ(I1INM1+1),IDIM1P,K1,K2-1,AVER)
+      DO 3 I=1,5
+      DO 3 J=1,5
+        AVER(I,J)=AVER(I,J) / DIF
+3     CONTINUE
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+C   calc expected scores
+      DO 29 I=1,5
+      DO 29 J=1,5
+        EXPEC(I,J)=AVER(I,J)*SPAN
+29    CONTINUE
+C   set ymax
+C   do edge
+      CALL FILLR(OBS1,ISIZE,0.)
+      CALL DCHI1(SEQ,IDIM1P,J1P,J1P+SPAN-1,OBS)
+      J = 1
+      WORK(J) = CED(OBS,EXPEC)
+C   do rest
+      SUMMAX=-999999.
+      SUMMIN=9999999.
+      DO 200 I=J1P+LB,J2P-1-LF
+        ISF=CTONUM(SEQ(I+LF))
+        ISF1=CTONUM(SEQ(I+LF+1))
+        ISB=CTONUM(SEQ(I-LB))
+        ISB1=CTONUM(SEQ(I-LB+1))
+        OBS(ISF,ISF1)=OBS(ISF,ISF1)+1.
+        OBS(ISB,ISB1)=OBS(ISB,ISB1)-1.
+        SUM = CED(OBS,EXPEC)
+        J = J + 1
+        WORK(J) = SUM
+        SUMMAX=MAX(SUMMAX,SUM)
+        SUMMIN=MIN(SUMMIN,SUM)
+200   CONTINUE
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      IDIMW = J2 - J1 - SPAN
+      XMIN = -LB
+      XMAX = IDIMW + LB
+      YMAX = SUMMAX
+      YMIN = SUMMIN
+      CALL PLOTR(WORK,IDIMW,XMAX,XMIN,YMAX,YMIN,IWRIT,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      WRITE(KBOUT,4444)SUMMAX,SUMMIN
+4444  FORMAT(' Observed range=',2F12.6)
+      RETURN
+      END
+      REAL FUNCTION CED(OBS,EXPEC)
+      REAL OBS(5,5),EXPEC(5,5)
+      SUM = 0.
+      DO 10 K=1,4
+      DO 10 J=1,4
+        TE = EXPEC(K,J)
+        IF(TE.EQ.0.) GO TO 10
+        TO = OBS(K,J)
+        D = TO - TE
+        SUM = SUM + D*D/TE
+10    CONTINUE
+      CED = SUM
+      END
+      SUBROUTINE TRICHI(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,WORK,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      PARAMETER (ISIZE = 125)
+      INTEGER SPAN,DIALOG
+      REAL EXPEC1(ISIZE),AVER1(ISIZE),OBS1(ISIZE)
+      REAL EXPEC(5,5,5),AVER(5,5,5),OBS(5,5,5),WORK(IDIM1P)
+      EQUIVALENCE (EXPEC1,EXPEC),(AVER1,AVER),(OBS1,OBS)
+C   NOTE J1 J2 IS THE ACTIVE REGION, KSTART IS THE NUMBER OF THE 
+C   BASE IN ELEMENT 1 OF SEQ, AND IDIM1 IS THE LENGTH FROM KSTART
+C   TO THE END OF SEQ. PLOTS ARE FROM J1 TO J2 BUT STANDARDS CAN
+C   BE TAKEN FROM KSTART TO KSTART+IDIM1-1 = IENDB
+      CALL SHOWFU(KBOUT,
+     +'Plot trinucleotide composition differences as chi squared')
+C   CHANGED TO ALLOW USER TO DEFINE REGION FOR STANDARD
+C   AND TO DO SCALING ONLY OVER REAL SEQUENCE (NOT EDGE) 23-08-84
+      CALL GETPAR(27,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      K1 = KSTART
+      K2 = IENDB
+      IF(DIALOG.EQ.1) THEN
+        CALL CHID(KSTART,IENDB,MINSP,MAXSP,SPAN,
+     +  MINIW,MAXIW,IWRIT,K1,K2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL TCHIP1(SEQ,IDIM1P,MXSPAN,EXPEC,OBS,AVER,ISIZE,
+     +EXPEC1,OBS1,AVER1,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+      END
+      SUBROUTINE TCHIP1(SEQ,IDIM1P,MXSPAN,EXPEC,OBS,AVER,ISIZE,
+     +EXPEC1,OBS1,AVER1,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,CTONUM
+      REAL EXPEC1(ISIZE),AVER1(ISIZE),OBS1(ISIZE)
+      REAL EXPEC(5,5,5),AVER(5,5,5),OBS(5,5,5),WORK(IDIM1P)
+      EXTERNAL CTONUM,CET
+      DIF = K2 - K1 - 2
+      IF(DIF.LT.3.) RETURN
+      CALL BUSY(KBOUT)
+      CALL FILLR(AVER1,ISIZE,0.)
+      I1INM1 = 1 + (MXSPAN/2) - KSTART + 1
+      CALL TCHI1(SEQ(I1INM1+1),IDIM1P,K1,K2-2,AVER)
+      DO 3 I=1,5
+      DO 3 J=1,5
+      DO 3 L=1,5
+        AVER(I,J,L)=AVER(I,J,L) / DIF
+3     CONTINUE
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+C   calc expected scores
+      DO 29 I=1,5
+      DO 29 J=1,5
+      DO 29 L=1,5
+        EXPEC(I,J,L)=AVER(I,J,L)*SPAN
+29    CONTINUE
+C   do edge
+      CALL FILLR(OBS1,ISIZE,0.)
+      CALL TCHI1(SEQ,IDIM1P,J1P,J1P+SPAN-1,OBS)
+      J = 1
+      WORK(J) = CET(OBS,EXPEC)
+C   do rest
+      SUMMAX=-99999.
+      SUMMIN=999999.
+      DO 200 I=J1P+LB,J2P-2-LF
+        IS=CTONUM(SEQ(I+LF))
+        IS1=CTONUM(SEQ(I+LF+1))
+        IS2=CTONUM(SEQ(I+LF+2))
+        IB=CTONUM(SEQ(I-LB))
+        IB1=CTONUM(SEQ(I-LB+1))
+        IB2=CTONUM(SEQ(I-LB+2))
+        OBS(IS,IS1,IS2)=OBS(IS,IS1,IS2)+1.
+        OBS(IB,IB1,IB2)=OBS(IB,IB1,IB2)-1.
+        SUM = CET(OBS,EXPEC)
+        J = J + 1
+        WORK(J) = SUM
+        SUMMAX=MAX(SUMMAX,SUM)
+        SUMMIN=MIN(SUMMIN,SUM)
+200   CONTINUE
+      CALL FILLR(OBS1,ISIZE,0.)
+      CALL TCHI1(SEQ,IDIM1P,J1P,J1P+SPAN-1,OBS)
+      J = 1
+      WORK(J) = CET(OBS,EXPEC)
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      IDIMW = J2 - J1 - 1 - SPAN
+      XMIN = -LB
+      XMAX = IDIMW + LB
+      YMAX = SUMMAX
+      YMIN = SUMMIN
+      CALL PLOTR(WORK,IDIMW,XMAX,XMIN,YMAX,YMIN,IWRIT,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      WRITE(KBOUT,4444)SUMMAX,SUMMIN
+4444  FORMAT(' Observed range=',2F12.6)
+      RETURN
+      END
+      REAL FUNCTION CET(OBS,EXPEC)
+      REAL OBS(5,5,5),EXPEC(5,5,5)
+      SUM=0.
+      DO 10 K=1,4
+      DO 10 J=1,4
+      DO 10 L=1,4
+        TEXP=EXPEC(K,J,L)
+        IF(TEXP.EQ.0.)GO TO 10
+        TOBS=OBS(K,J,L)
+        OME=TOBS-TEXP
+        SUM=SUM+(OME*OME)/TEXP
+10    CONTINUE
+      CET = SUM
+      END
+      SUBROUTINE TCHI1(SEQ,IDIM1P,J1P,J2P,OBS)
+      CHARACTER SEQ(IDIM1P)
+      REAL OBS(5,5,5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DO 10 I=J1P,J2P
+        IS=CTONUM(SEQ(I))
+        IS1=CTONUM(SEQ(I+1))
+        IS2=CTONUM(SEQ(I+2))
+        OBS(IS,IS1,IS2)=OBS(IS,IS1,IS2)+1.
+10    CONTINUE
+      END
+      SUBROUTINE NEGP1(SEQ,IDIM1P,MXSPAN,OBS,AVER,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,OBS(5),CTONUM
+      REAL AVER(5),WORK(IDIM1P)
+      REAL NEI
+      EXTERNAL CTONUM,NEI
+      I1INM1=1+(MXSPAN/2)-KSTART+1
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      DIF = K2 - K1 + 1
+      IF(DIF.LT.1.) RETURN
+      CALL FILLR(AVER,5,0.)
+      CALL NEG1(SEQ(I1INM1+1),IDIM1P,K1,K2,AVER)
+      DIF = DIF * SPAN
+      DO 3 I=1,5
+        AVER(I)=AVER(I) / DIF
+3     CONTINUE
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+      CALL BUSY(KBOUT)
+C   calc expected scores
+C   do edge
+      CALL FILLI(OBS,5,0)
+      DO 166 I=J1P,J1P+SPAN-1
+        ISF=CTONUM(SEQ(I+LF))
+        ISB=CTONUM(SEQ(I-LB))
+        OBS(ISF)=OBS(ISF)+1
+        OBS(ISB)=OBS(ISB)-1
+166   CONTINUE
+      SUMMAX=-99999.
+      SUMMIN=999999.
+      WIND=SPAN
+      J = 1
+      WORK(J) = NEI(OBS,AVER,WIND)
+      DO 200 I=J1P+LB,J2P-LF
+        ISF=CTONUM(SEQ(I+LF))
+        ISB=CTONUM(SEQ(I-LB))
+        OBS(ISF)=OBS(ISF)+1
+        OBS(ISB)=OBS(ISB)-1
+        SUM = NEI(OBS,AVER,WIND)
+        J = J + 1
+        WORK(J) = SUM
+        SUMMAX=MAX(SUMMAX,SUM)
+        SUMMIN=MIN(SUMMIN,SUM)
+200   CONTINUE
+C   set ymax
+      CALL CLEARV
+      CALL VECTOM
+      IDIMW = J2 - J1 + 1 - SPAN
+      XMIN = -LB
+      XMAX = IDIMW + LB
+      YMAX = SUMMAX
+      YMIN = SUMMIN
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL PLOTR(WORK,IDIMW,XMAX,XMIN,YMAX,YMIN,IWRIT,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      WRITE(KBOUT,4444)SUMMAX,SUMMIN
+4444  FORMAT(' Observed range=',2F12.6)
+      END
+      REAL FUNCTION NEI(OBS,AVER,WIND)
+      REAL AVER(5)
+      INTEGER OBS(5)
+      SUM=0.
+      DO 10 K=1,4
+        TEMP=REAL(OBS(K))*AVER(K)
+        IF(TEMP.GT.0.) TEMP1 = TEMP*LOG(TEMP)
+        SUM=SUM-TEMP1
+10    CONTINUE
+      SUM=SUM/WIND
+      NEI = SUM
+      END
+      SUBROUTINE NEG1(SEQ,IDIM1P,K1,K2,AVER)
+      CHARACTER SEQ(IDIM1P)
+      REAL AVER(5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DO 2 I=K1,K2
+        IS=CTONUM(SEQ(I))
+        AVER(IS)=AVER(IS)+1.
+2     CONTINUE
+      END
+      SUBROUTINE NEGENT(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,WORK,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      PARAMETER (ISIZE = 5)
+      INTEGER SPAN,DIALOG
+      REAL AVER(ISIZE),WORK(IDIM1P)
+      INTEGER OBS(ISIZE)
+C   NOTE J1 J2 IS THE ACTIVE REGION, KSTART IS THE NUMBER OF THE 
+C   BASE IN ELEMENT 1 OF SEQ, AND IDIM1 IS THE LENGTH FROM KSTART
+C   TO THE END OF SEQ. PLOTS ARE FROM J1 TO J2 BUT STANDARDS CAN
+C   BE TAKEN FROM KSTART TO KSTART+IDIM1-1 = IENDB
+C   CHANGED TO ALLOW USER TO DEFINE REGION FOR STANDARD
+C   AND TO DO SCALING ONLY OVER REAL SEQUENCE (NOT EDGE) 23-08-84
+      CALL SHOWFU(KBOUT,'Plot negentropy')
+      CALL GETPAR(59,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      K1 = KSTART
+      K2 = IENDB
+      IF(DIALOG.EQ.1) THEN
+        CALL CHID(KSTART,IENDB,MINSP,MAXSP,SPAN,
+     +  MINIW,MAXIW,IWRIT,K1,K2,
+     +  KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL NEGP1(SEQ,IDIM1P,MXSPAN,OBS,AVER,
+     +SPAN,IWRIT,K1,K2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBOUT,WORK)
+      END
+      SUBROUTINE CHID(KSTART,IENDB,MINSP,MAXSP,SPAN,
+     +MINIW,MAXIW,IWRIT,K1,K2,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER SPAN
+      IOK = 1
+      CALL GTREG(KBIN,KBOUT,KSTART,IENDB,K1,K2,
+     +'Define region for standard',
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+      RETURN
+      END
+      SUBROUTINE ZDNA(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,SCORE(5,5),CTONUM,DIALOG
+      REAL OBS(5,5)
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'Plot z dna potential')
+      CALL GETPAR(33,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      I1INM1=1+MXSPAN/2
+      DO 1 I=1,5
+      DO 1 J=1,5
+        SCORE(I,J)=0
+1     CONTINUE
+C  at
+      SCORE(3,1)=1
+      SCORE(1,3)=1
+C  cg
+      SCORE(2,4)=1
+      SCORE(4,2)=1
+C  gt
+      SCORE(4,1)=1
+      SCORE(1,4)=1
+C  ac
+      SCORE(3,2)=1
+      SCORE(2,3)=1
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+C   calc expected scores
+      YMAX=SPAN
+      YMIN=0.
+C   do edge
+      DO 65 I=1,5
+      DO 65 J=1,5
+        OBS(I,J)=0.
+65    CONTINUE
+      DO 66 I=J1P-LB,J1P-LB+SPAN-1
+        IS=CTONUM(SEQ(I))
+        IS1=CTONUM(SEQ(I+1))
+        OBS(IS,IS1)=OBS(IS,IS1)+1.
+66    CONTINUE
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   do rest
+      XF=J1
+      YF=SUM
+      DO 100 I=J1P,J2P-1
+        ILF=CTONUM(SEQ(I+LF))
+        ILF1=CTONUM(SEQ(I+LF+1))
+        ILB=CTONUM(SEQ(I-LB))
+        ILB1=CTONUM(SEQ(I-LB+1))
+        OBS(ILF,ILF1)=OBS(ILF,ILF1)+1.
+        OBS(ILB,ILB1)=OBS(ILB,ILB1)-1.
+        IF(MOD(I,IWRIT).EQ.0)THEN
+          SUM=0.
+          DO 99 K=1,4
+          DO 99 J=1,4
+            SUM=SUM+SCORE(K,J)*OBS(K,J)
+99        CONTINUE
+          XT=I-I1INM1
+          CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          XF=XT
+          YF=SUM
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE ZDNARD(SEQ,IDIM1P,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,RVAL(5),YVAL(5),CTONUM,DIALOG
+      EXTERNAL CTONUM
+      SAVE RVAL,YVAL
+      DATA RVAL/0,0,1,1,0/,YVAL/1,1,0,0,0/
+      CALL SHOWFU(KBOUT,'Plot z dna potential')
+      CALL GETPAR(35,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      I1INM1=1+MXSPAN/2
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+C   length forward and back
+      LF=SPAN/2
+      LB=1+SPAN/2
+C   calc expected scores
+      YMAX=SPAN+1
+      YMIN=LF+1
+C   do edge
+      RSUM1=0.
+      YSUM1=0.
+      RSUM2=0.
+      YSUM2=0.
+      DO 66 I=J1P-LB,J1P-LB+SPAN-1,2
+        IS=CTONUM(SEQ(I))
+        IS1=CTONUM(SEQ(I+1))
+        RSUM1=RSUM1+RVAL(IS)
+        YSUM1=YSUM1+YVAL(IS1)
+        RSUM2=RSUM2+RVAL(IS1)
+        YSUM2=YSUM2+YVAL(IS)
+66    CONTINUE
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   do rest
+      XF=J1
+      YF=SUM
+      DO 100 I=J1P,J2P-1,2
+        ILB=CTONUM(SEQ(I-LB))
+        ILB1=CTONUM(SEQ(I-LB+1))
+        ILF1=CTONUM(SEQ(I+LF+1))
+        ILF2=CTONUM(SEQ(I+LF+2))
+        RSUM1=RSUM1-RVAL(ILB)+RVAL(ILF1)
+        YSUM1=YSUM1-YVAL(ILB1)+YVAL(ILF2)
+        RSUM2=RSUM2-RVAL(ILB1)+RVAL(ILF2)
+        YSUM2=YSUM2-YVAL(ILB)+YVAL(ILF1)
+        IF(MOD(I,IWRIT).EQ.0)THEN
+          SUM=MAX((RSUM1+YSUM1),(RSUM2+YSUM2))
+          XT=I-I1INM1
+          CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          XF=XT
+          YF=SUM
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE ZDNARN(SEQ,IDIM1,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      INTEGER SPAN,CTONUM,DIALOG
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'Plot z dna potential')
+      CALL GETPAR(34,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      XMIN=J1
+      XMAX=J2
+      YMAX=SPAN
+      YMIN=0.
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   do rest
+      XF=J1
+      YF=0.
+      I=J1-1
+100   CONTINUE
+C
+      I=I+1
+      IF(I.GT.J2)GO TO 500
+      IS=CTONUM(SEQ(I))
+      IF(IS.EQ.5)GO TO 100
+      SUM=0.
+      IF(IS.LT.3)GO TO 300
+C   must be a or g
+200   CONTINUE
+C   want c or t
+      I=I+1
+      IF(I.GT.J2)GO TO 500
+      IS=CTONUM(SEQ(I))
+      IF(IS.EQ.5)GO TO 100
+      IF(IS.GT.2)GO TO 100
+C  have c or t
+      SUM=SUM+1.
+      IF(MOD(I,IWRIT).EQ.0)THEN
+        XT=I
+        CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        XF=XT
+        YF=SUM
+      END IF
+C   now want a or g
+300   CONTINUE
+      I=I+1
+      IF(I.GT.J2)GO TO 500
+      IS=CTONUM(SEQ(I))
+      IF(IS.EQ.5)GO TO 100
+      IF(IS.LT.3)GO TO 100
+C  have a or g
+      SUM=SUM+1.
+      IF(MOD(I,IWRIT).EQ.0)THEN
+        XT=I
+        CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        XF=XT
+        YF=SUM
+      END IF
+C   want c or t
+      GO TO 200
+500   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE GETRC1(KBIN,KBOUT,J1,J2,IDIMT,I1,I2,IDIM1,
+     +P,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*),P*(*)
+      INTEGER VALUE
+C  VERSION FOR LIST REGION
+      IOK = 1
+      WRITE(KBOUT,1000)P
+1000  FORMAT(' ',A)
+1     CONTINUE
+      VALUE = J1
+      I1 = J1
+      I2 = J2
+      CALL GETINT(I1,I2,VALUE,
+     +'From',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I1 = VALUE
+      VALUE = I2
+      K1 = MIN(1,J1)
+      CALL GETINT(K1,J2,VALUE,
+     +'To',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I2 = VALUE
+C  IS WRAP AROUND ALLOWED?
+      IF(I2.LT.I1)THEN
+        IF((J1.NE.1).OR.(J2.NE.IDIMT))THEN
+          WRITE(KBOUT,1001)
+1001      FORMAT(' a sequence of this length cannot be treated as a',
+     +    ' circle')
+          GO TO 1
+        END IF
+        I2 = I2 + IDIM1
+      END IF
+      END
+      SUBROUTINE GETRC2(KBIN,KBOUT,J1,J2,IDIMT,I1,I2,
+     +P,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*),P*(*)
+      INTEGER VALUE
+C  VERSION FOR TRANSLATE REGION
+      IOK = 1
+      WRITE(KBOUT,1000)P
+1000  FORMAT(' ',A)
+1     CONTINUE
+      VALUE = 0
+      I1 = 0
+      I2 = J2
+      CALL GETINT(I1,I2,VALUE,
+     +'From',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I1 = VALUE
+      IF(I1.EQ.0) THEN
+        IOK = 0
+        RETURN
+      END IF
+      VALUE = J2
+      I2 = J2
+      CALL GETINT(1,I2,VALUE,
+     +'To',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      I2 = VALUE
+C  IS WRAP AROUND ALLOWED?
+      IF(I2.LT.I1)THEN
+        IF((J1.NE.1).OR.(J2.NE.IDIMT))THEN
+          WRITE(KBOUT,1001)
+1001      FORMAT(' A sequence of this length cannot be treated as a',
+     +    ' circle')
+          GO TO 1
+        END IF
+      END IF
+      END
+      SUBROUTINE FMTRAN(SEQ,IDIM1,IS,IE,OUTP,IDMP,LENGTH,IDEV,
+     + KSTART,IDIMT,CODE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),OUTP(IDMP,3),BLANK
+      INTEGER IO1(12),CODE
+      CHARACTER IT,OUTLIN(132),THREEL*3,THREE*3
+      EXTERNAL LWRAP,THREEL
+      SAVE BLANK
+      DATA BLANK/' '/
+      LD10 = LENGTH/10
+      ITOT = IE - IS + 1
+      IF(IE.LE.IS)ITOT = IDIMT - ITOT
+      IDONE = 0
+C   SET POINTER TO ELEMENT NUMBERS (TREATING ARRAY AS LINEAR) TO
+C   START POINTER
+      IBEG = IS - KSTART + 1
+      I1 = IS - 1
+      WRITE(IDEV,1000)
+50    CONTINUE
+      IF(IDONE.GE.ITOT)RETURN
+C   NEED TO FIND END OF THIS LINE AND  ADD TO COUNT
+      IADD = LENGTH
+      IF((IDONE + IADD).GT.ITOT)IADD = ITOT - IDONE
+      IEND = IBEG + IADD - 1
+      IDONE = IDONE + IADD
+      LD10M = MIN(LD10,(IEND - IBEG + 1)/10)
+      DO 200 J = 1,3
+        IBEGP = IBEG + J - 1
+        IENDP = IEND + J - 1
+C   NUMBER OF CODONS THIS LINE?
+        NP = (IENDP - IBEGP + 1)/3
+C   TEST FOR NO CODONS
+        IF(NP.GT.0)THEN
+C   THIS LINE STARTS AT IBEG AND ENDS AT IEND (TREATING ARRAY AS
+C   LINEAR) NEED TO KNOW IF THERE ARE ANY AA'S TO OUTPUT.
+C   TRANSLATE POINTERS AND TEST THEM
+C   FILL OUTP WITH BLANKS
+          CALL FILLC(OUTLIN,132,' ')
+          IEMPT = 0
+          ITMPCS = IBEGP
+          L = J - 2
+          DO 100 I = 1,NP
+            IOUT = LWRAP(IDIM1,ITMPCS)/3 + 1
+            ITMPCS = ITMPCS + 3
+            IT = OUTP(IOUT,J)
+            IF(IT.NE.BLANK)IEMPT = 1
+            L = L + 3
+            IF(CODE.EQ.3)THEN
+              THREE = THREEL(IT)
+              OUTLIN(L - 1) = THREE(1:1)
+              OUTLIN(L) = THREE(2:2)
+              OUTLIN(L + 1) = THREE(3:3)
+            ELSE
+              OUTLIN(L) = IT
+            END IF
+100       CONTINUE
+          IF(IEMPT.GT.0)
+     +     WRITE(IDEV,1002)(OUTLIN(K),K = 1,LENGTH + 6)
+        END IF
+200   CONTINUE
+      WRITE(IDEV,1002)(SEQ(LWRAP(IDIM1,K)),K = IBEG,IEND)
+1002  FORMAT(' ',132A1)
+      DO 300 K = 1,LD10
+        I1 = I1 + 10
+        IO1(K) = I1
+        IF(IO1(K).GT.IDIMT)IO1(K) = IO1(K) - IDIMT
+300   CONTINUE
+      WRITE(IDEV,1001)(IO1(K),K = 1,LD10M)
+1001  FORMAT( ' ',12(4X,I6))
+      IBEG = IEND + 1
+      WRITE(IDEV,1000)
+1000  FORMAT(/)
+      GO TO 50
+      END
+      SUBROUTINE TRAND4(MINO,MAXO,MINOPN,JSTRAN,FILNAM,IDEV,IOK,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*),FILNAM*(*)
+      IOK = 1
+      CALL GETINT(MINO,MAXO,MINOPN,
+     +'Minimum open frame in amino acids',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINOPN = IVAL
+      IVAL = JSTRAN + 1
+      CALL GSTRND(IVAL,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IVAL.LT.1) THEN
+        IOK = 1
+        RETURN
+      END IF
+      JSTRAN = IVAL - 1
+30    CONTINUE
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File name for translation',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      END
+      SUBROUTINE TRAND5(MINO,MAXO,MINOPN,JSTRAN,IOK,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      IOK = 1
+      CALL GETINT(MINO,MAXO,MINOPN,
+     +'Minimum open frame in amino acids',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINOPN = IVAL
+      IVAL = JSTRAN + 1
+      CALL GSTRND(IVAL,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IVAL.LT.1) THEN
+        IOK = 1
+        RETURN
+      END IF
+      JSTRAN = IVAL - 1
+      END
+      SUBROUTINE TRNPIR(SEQ,IDIM,I1,I2,KSTART,KBIN,KBOUT,IDEV,PAA,
+     +OUTP,IDIMP,FILNAM,MINO,MAXO,MINOPN,JSTRAN,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIMP),PAA(125),OUTP(IDIMP)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CALL TRAND4(MINO,MAXO,MINOPN,JSTRAN,FILNAM,IDEV,IOK,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      MINOPD = 3 * MINOPN
+      IF(JSTRAN.EQ.0) THEN
+        CALL TRNDOP(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+      ELSE IF (JSTRAN.EQ.1) THEN
+        CALL TRNDOM(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+      ELSE IF (JSTRAN.EQ.2) THEN
+        CALL TRNDOP(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+        CALL TRNDOM(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+      END IF
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE TRNDOP(SEQ,IDDNA,J1,J2,
+     +ACIDS,IDACID,PAA,MINOPN,IDEV)
+      CHARACTER SEQ(IDDNA),ACIDS(IDACID),PAA(5,5,5)
+      CHARACTER LINE*60
+      INTEGER STEP(3),FRAME,PSTOP,WPAIR
+      EXTERNAL PSTOP,MINIL,WPAIR,ITOSL
+C Find open reading frames in their order of occurrence
+C For each of 3 frames remember next position to try
+      STEP(3) = J1 + 2
+      STEP (1) = J1
+      STEP(2) = J1 + 1
+ 10   CONTINUE
+      FRAME = MINIL(STEP,3)
+      I = STEP(FRAME)
+      IF(I.LT.J2) THEN
+        J = PSTOP(SEQ,IDDNA,I,J2,ACIDS,IDACID,PAA,IACID)
+        STEP(FRAME) = J + 4
+        IF((J-I+1).GE.MINOPN) THEN
+          LINE(1:) = '>'
+          IF (ITOSL(LINE(2:),I).EQ.0) WRITE(*,*)'Scream: ITOSL'
+          IF(WPAIR(LINE(22:),I,J).NE.0) WRITE(*,*)'Scream: WPAIR'
+            WRITE(IDEV,1001)LINE
+            IACID = IACID + 1
+            ACIDS(IACID) = '*'
+            CALL FMTDKN(IDEV,ACIDS,IACID)
+        END IF
+        GO TO 10
+      END IF
+ 1001 FORMAT(A)
+      END
+      SUBROUTINE TRNDOM(SEQ,IDDNA,J1,J2,ACIDS,IDACID,PAA,MINOPN,IDEV)
+      CHARACTER SEQ(IDDNA),ACIDS(IDACID),PAA(5,5,5),LINE*60
+      INTEGER STEP(3),FRAME,MSTOP,WPAIR
+      EXTERNAL MSTOP,MINIL,WPAIR,ITOSL
+C 17-7-91 replced line to write out title! which had disapeared
+C Find open reading frames in their order of occurrence
+C For each of 3 frames remember next position to try
+      STEP(3) = J1 + 2
+      STEP (1) = J1
+      STEP(2) = J1 + 1
+ 10   CONTINUE
+      FRAME = MINIL(STEP,3)
+      I = STEP(FRAME)
+      IF(I.LT.J2) THEN
+        J = MSTOP(SEQ,IDDNA,I,J2,ACIDS,IDACID,PAA,IACID)
+        STEP(FRAME) = J + 4
+        IF((J-I+1).GE.MINOPN) THEN
+          LINE(1:) = '>                    complement('
+          IF (ITOSL(LINE(2:),I).EQ.0) WRITE(*,*)'Scream: ITOSL'
+          IF(WPAIR(LINE(33:),I,J).NE.0) WRITE(*,*)'Scream: WPAIR'
+          LINE(32+INDEX(LINE(33:),' '):) = ')'
+          WRITE(IDEV,1001)LINE
+          CALL SQREV(ACIDS,IACID)
+          IACID = IACID + 1
+          ACIDS(IACID) = '*'
+          CALL FMTDKN(IDEV,ACIDS,IACID)
+        END IF
+        GO TO 10
+      END IF
+ 1001 FORMAT(A)
+      END
+      SUBROUTINE OPENFR(SEQ,IDIM,I1,I2,KSTART,IDEV,KBIN,KBOUT,PAA,
+     +OUTP,IDIMP,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIMP),PAA(125),OUTP(IDIMP)
+      CHARACTER HELPF*(*)
+      CALL SHOWFU(KBOUT,'Find open reading frames')
+      CALL GETPAR(54,4,IOK,MINO,MAXO,MINOPN,JSTRAN,
+     +IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL TRAND5(MINO,MAXO,MINOPN,JSTRAN,IOK,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      MINOPD = 3* MINOPN
+      IF(JSTRAN.EQ.0) THEN
+        CALL TRNDP(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+      ELSE IF (JSTRAN.EQ.1) THEN
+        CALL TRNDM(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+      ELSE IF (JSTRAN.EQ.2) THEN
+        CALL TRNDP(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+        CALL TRNDM(SEQ,IDIM,I1,I2,OUTP,IDIMP,PAA,MINOPD,IDEV)
+      END IF
+      END
+      SUBROUTINE TRNDP(SEQ,IDDNA,J1,J2,
+     +ACIDS,IDACID,PAA,MINOPN,IDEV)
+      CHARACTER SEQ(IDDNA),ACIDS(IDACID),PAA(5,5,5)
+      CHARACTER LINE*60
+      INTEGER STEP(3),FRAME,PSTOP,WPAIR
+      EXTERNAL PSTOP,MINIL,WPAIR
+C Find open reading frames in their order of occurrence
+C For each of 3 frames remember next position to try
+      STEP(3) = J1 + 2
+      STEP (1) = J1
+      STEP(2) = J1 + 1
+ 10   CONTINUE
+      FRAME = MINIL(STEP,3)
+      I = STEP(FRAME)
+      IF(I.LT.J2) THEN
+        J = PSTOP(SEQ,IDDNA,I,J2,ACIDS,IDACID,PAA,IACID)
+        STEP(FRAME) = J + 4
+        IF((J-I+1).GE.MINOPN) THEN
+          LINE(1:) = 'FT   CDS'
+          IF(WPAIR(LINE(22:),I,J).NE.0) WRITE(*,*)'Scream: WPAIR'
+            WRITE(IDEV,1001)LINE
+        END IF
+        GO TO 10
+      END IF
+ 1001 FORMAT(A)
+      END
+      SUBROUTINE TRNDM(SEQ,IDDNA,J1,J2,ACIDS,IDACID,PAA,MINOPN,IDEV)
+      CHARACTER SEQ(IDDNA),ACIDS(IDACID),PAA(5,5,5),LINE*60
+      INTEGER STEP(3),FRAME,MSTOP,WPAIR
+      EXTERNAL MSTOP,MINIL,WPAIR
+C Find open reading frames in their order of occurrence
+C For each of 3 frames remember next position to try
+      STEP(3) = J1 + 2
+      STEP (1) = J1
+      STEP(2) = J1 + 1
+ 10   CONTINUE
+      FRAME = MINIL(STEP,3)
+      I = STEP(FRAME)
+      IF(I.LT.J2) THEN
+        J = MSTOP(SEQ,IDDNA,I,J2,ACIDS,IDACID,PAA,IACID)
+        STEP(FRAME) = J + 4
+        IF((J-I+1).GE.MINOPN) THEN
+          LINE(1:) = 'FT   CDS             complement('
+          IF(WPAIR(LINE(33:),I,J).NE.0) WRITE(*,*)'Scream: WPAIR'
+            LINE(32+INDEX(LINE(33:),' '):) = ')'
+            WRITE(IDEV,1001)LINE
+        END IF
+        GO TO 10
+      END IF
+ 1001 FORMAT(A)
+      END
+      INTEGER FUNCTION PSTOP(SEQ,IDDNA,IDNA1,IDNA2,
+     +ACIDS,IDACID,PAA,IACID)
+      CHARACTER PAA(5,5,5),SEQ(IDDNA),ACIDS(IDACID),TRANF,ACID
+      EXTERNAL TRANF
+C Returns position of next stop codon or IDNA2 if end reached
+C Also the number of acids excluding any stop codon
+C Also the acids
+      IACID = 0
+      DO 10 IDNA=IDNA1,IDNA2-2,3
+        ACID = TRANF(SEQ(IDNA),PAA)
+        IF(ACID.EQ.'*') THEN
+          PSTOP = IDNA - 1
+          RETURN
+        ELSE
+          IACID = IACID + 1
+          ACIDS(IACID) = ACID
+        END IF
+ 10     CONTINUE
+      PSTOP = IDNA2
+      END
+      INTEGER FUNCTION MSTOP(SEQ,IDDNA,IDNA1,IDNA2,
+     +ACIDS,IDACID,PAA,IACID)
+      CHARACTER PAA(5,5,5),SEQ(IDDNA),ACIDS(IDACID),TRANB,ACID
+      EXTERNAL TRANB
+C Returns position of next stop codon or IDNA2 if end reached
+C Also the number of acids excluding any stop codon
+C Also the acids
+      IACID = 0
+      DO 10 IDNA=IDNA1,IDNA2-2,3
+        ACID = TRANB(SEQ(IDNA),PAA)
+        IF(ACID.EQ.'*') THEN
+          MSTOP = IDNA - 1
+          RETURN
+        ELSE
+          IACID = IACID + 1
+          ACIDS(IACID) = ACID
+        END IF
+ 10     CONTINUE
+      MSTOP = IDNA2
+      END
+      INTEGER FUNCTION MINIL(LIST,NLIST)
+      INTEGER LIST(NLIST)
+C sent list, return element number of first element whose
+C value equals the minimum value in the list.
+      MX = LIST(1)
+      DO 10 I=2,NLIST
+        MX = MIN(MX,LIST(I))
+ 10     CONTINUE
+      DO 20 I=1,NLIST
+        IF(MX.EQ.LIST(I)) THEN
+          MINIL = I
+          RETURN
+        END IF
+ 20     CONTINUE
+C Should never get here!
+      WRITE(*,*)'Scream: MINIL'
+      MINIL = 1
+      END
+      INTEGER FUNCTION WPAIR(LINE,I,J)
+      CHARACTER LINE*(*)
+      EXTERNAL ITOSL
+C puts a pair of integers into string line, as for a feature table
+      WPAIR = 1
+      IF(ITOSL(LINE,I).EQ.0) RETURN
+      K = INDEX(LINE,' ')
+      LINE(K:) = '..'
+      IF(ITOSL(LINE(K+2:),J).EQ.0) RETURN
+      WPAIR = 0
+      END
+      SUBROUTINE TRPIRP(SEQ,IDIM,I1,I2,KSTART,MINOP,IDEV,PAA,
+     +JSTRAN,OUTP,FRAMEC,IDIMP)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIMP),PAA(125),OUTP(IDIMP)
+      INTEGER B,FRAME,FRAMEC(IDIM)
+      EXTERNAL IPDNA
+      MINOPN = MINOP + 1
+      DO 200 FRAME = 1,3
+        B = I1 - KSTART + FRAME
+        LAA = IDIMP
+        CALL TRANSD(SEQ,IDIM,
+     +  B,I2-KSTART+1,OUTP,LAA,PAA,JSTRAN)
+        LAA = (I2 - B + 1) / 3
+      B = 1
+      I = 0
+20    CONTINUE
+      I = I + 1      
+      IF(I.LE.LAA) THEN
+        IF(OUTP(I).EQ.'*') THEN
+          LENGTH = I - B + 1
+          IF(LENGTH.GE.MINOPN) THEN
+            IF(JSTRAN.EQ.0) THEN
+               IP1 = IPDNA(B,FRAME,JSTRAN) + I1 - KSTART
+               IP2 = IPDNA(I,FRAME,JSTRAN) + I1 - KSTART
+               IP2 = MIN(IP2,IDIM)
+C              WRITE(IDEV,1002)IP1,IP2,SENSE(1),FRAME,IP2-IP1+1
+              CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,2)
+            ELSE IF(JSTRAN.EQ.1) THEN
+               IP1 = IPDNA(B,FRAME,JSTRAN) + I1 - KSTART
+               IF(B.NE.1) IP1 = IP1 - 3
+               IP2 = IPDNA(I,FRAME,JSTRAN) + I1 - KSTART - 3
+               IP2 = MIN(IP2,IDIM)
+C              WRITE(IDEV,1002)IP2,IP1,SENSE(2),FRAME,IP2-IP1+1
+              CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,3)
+            END IF
+          END IF
+          B = I + 1
+        END IF
+        GO TO 20
+      END IF
+      LENGTH = I - B
+      IF(LENGTH.GE.MINOPN) THEN
+        IF(JSTRAN.EQ.0) THEN
+           IP1 = IPDNA(B,FRAME,JSTRAN) + I1 - KSTART
+           IP2 = IPDNA(I,FRAME,JSTRAN) + I1 - KSTART
+           IP2 = MIN(IP2,IDIM)
+C          WRITE(IDEV,1002)IP1,IP2,SENSE(1),FRAME,IP2-IP1+1
+          CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,2)
+          LENGTH = LENGTH + 1
+        ELSE IF(JSTRAN.EQ.1) THEN
+           IP1 = IPDNA(B,FRAME,JSTRAN) + I1 - KSTART
+           IF(B.NE.1) IP1 = IP1 - 3
+           IP2 = IPDNA(I,FRAME,JSTRAN) + I1 - KSTART
+           IP2 = MIN(IP2,IDIM)
+C          WRITE(IDEV,1002)IP2,IP1,SENSE(2),FRAME,IP2-IP1+1
+          CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,3)
+        END IF
+      END IF
+200   CONTINUE
+C 1002 FORMAT(' FT ',I7,I7,'  ',A,I4,I7)
+      END
+      SUBROUTINE TRAN6X(SEQ,IDIM,JDEV,I1,I2,PAA,ITRAN,JSTRAN,
+     +INUM,LINLEN,FC)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),TRANF4*3,TRANB4*3,SCOMP,PAA(125)
+      CHARACTER LN*133,TICKS*120
+      INTEGER FC(IDIM)
+      EXTERNAL TRANF4,TRANB4,SCOMP,NOTRL
+      SAVE TICKS
+      DATA TICKS(1:60)/
+     +'    .    :    .    :    .    :    .    :    .    :    .    :'/
+      DATA TICKS(61:120)/
+     +'    .    :    .    :    .    :    .    :    .    :    .    :'/
+      K1=I1
+      K2=I1+LINLEN-1
+      K2 = MIN(K2,I2)
+      LN = ' '
+20    CONTINUE
+      WRITE(JDEV,1004)
+      WRITE(LN,1000)(TRANF4(SEQ(K),PAA,ITRAN,FC(K)),K=K1,K2-1,3)
+      IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)LN(1:NOTRL(LN,133,' '))
+      WRITE(LN,1001)(TRANF4(SEQ(K),PAA,ITRAN,FC(K)),K=K1+1,K2,3)
+      IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)LN(1:NOTRL(LN,133,' '))
+      IF(K1.NE.1) THEN
+        WRITE(LN,1002)(TRANF4(SEQ(K),PAA,ITRAN,FC(K)),
+     +  K=K1-1,K2-2,3)
+        IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)
+     +  LN(1:NOTRL(LN,133,' '))
+      ELSE
+        WRITE(LN,1008)(TRANF4(SEQ(K),PAA,ITRAN,FC(K)),K=3,K2-2,3)
+        IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)
+     +  LN(1:NOTRL(LN,133,' '))
+      END IF
+      IF(INUM.NE.0) THEN
+        WRITE(JDEV,1003)(SEQ(K),K=K1,K2)
+        WRITE(JDEV,1005)(N,N=K1+9,K2,10)
+      ELSE
+        WRITE(LN,1003)(SEQ(K),K=K1,K2)
+        WRITE(LN(NOTRL(LN,133,' ')+2:),1007)K2
+        WRITE(JDEV,1006)LN(1:NOTRL(LN,133,' '))
+        IF(JSTRAN.NE.0) WRITE(JDEV,1009)TICKS(1:K2-K1+1)
+      END IF
+      IF(JSTRAN.NE.0)WRITE(JDEV,1003)(SCOMP(SEQ(K)),K=K1,K2)
+      WRITE(LN,1000)(TRANB4(SEQ(K),PAA,ITRAN,FC(K)),K=K1,K2-1,3)
+      IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)LN(1:NOTRL(LN,133,' '))
+      WRITE(LN,1001)(TRANB4(SEQ(K),PAA,ITRAN,FC(K)),K=K1+1,K2,3)
+      IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)LN(1:NOTRL(LN,133,' '))
+      IF(K1.NE.1) THEN
+        WRITE(LN,1002)(TRANB4(SEQ(K),PAA,ITRAN,FC(K)),K=K1-1,K2-2,3)
+        IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)
+     +  LN(1:NOTRL(LN,133,' '))
+      ELSE
+        WRITE(LN,1008)(TRANB4(SEQ(K),PAA,ITRAN,FC(K)),K=3,K2-2,3)
+        IF(NOTRL(LN,133,' ').NE.0)WRITE(JDEV,1006)
+     +  LN(1:NOTRL(LN,133,' '))
+      END IF
+      IF(K2.EQ.I2) RETURN
+      K1=K2+1
+      K2=K2+LINLEN
+      K2 = MIN(K2,I2)
+      GO TO 20
+1000  FORMAT(4X,40(A))
+1001  FORMAT(5X,40(A))
+1002  FORMAT(3X,40(A))
+1003  FORMAT(4X,120A1)
+1004  FORMAT( )
+1005  FORMAT(4X,12I10)
+ 1006 FORMAT(A)
+ 1007 FORMAT(I7)
+ 1008 FORMAT(6X,40(A))
+ 1009 FORMAT(4X,A)
+      END
+      INTEGER FUNCTION IPDNA(IACID,FRAME,ISTRAN)
+      INTEGER FRAME
+C     returns a dna position for an amino acid in a translation
+C     frames on opposing strands: 123123123
+C                                 123123123
+      IPDNA = FRAME + (IACID-1)*3
+      END
+      CHARACTER*3 FUNCTION TRANF4(CODON,PAA,CODE,FRAMEC)
+C   AUTHOR RODGER STADEN
+      CHARACTER CODON(3),PAA(5,5,5),THREEL*3,TRANF,THREE*3
+      INTEGER CODE,FRAMEC
+      EXTERNAL THREEL,TRANF
+      THREE(1:3)='   '
+      IF(MOD(FRAMEC,2).EQ.0) THEN
+        IF(CODE.EQ.3)THEN
+          THREE(1:3)=THREEL(TRANF(CODON,PAA))
+        ELSE
+          THREE(2:2)=TRANF(CODON,PAA)
+        END IF
+      END IF
+      TRANF4 = THREE
+      END
+C   TRANB3
+      CHARACTER*3 FUNCTION TRANB4(CODON,PAA,CODE,FRAMEC)
+C   AUTHOR RODGER STADEN
+      CHARACTER CODON(3),PAA(5,5,5),THREEL*3,TRANB,THREE*3
+      INTEGER CODE,FRAMEC
+      EXTERNAL THREEL,TRANB
+      THREE='   '
+      IF(MOD(FRAMEC,3).EQ.0) THEN
+        IF(CODE.EQ.3)THEN
+          THREE(1:3)=THREEL(TRANB(CODON,PAA))
+        ELSE
+          THREE(2:2)=TRANB(CODON,PAA)
+        END IF
+      END IF
+      TRANB4 = THREE(1:3)
+      END
+      SUBROUTINE HAIRPN(SEQ,IDIM,ITOT,ITOTEL,ITOTAL,IDTOT,
+     +MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER ITOT(IDTOT),ITOTEL(IDTOT),ITOTAL(IDTOT)
+      CALL SHOWFU(KBOUT,'Search for hairpin loops')
+      CALL GETPAR(30,10,IOK,MINLPI,MAXLPI,MINLP,
+     +MINLPX,MAXLPX,MAXLP,MINBP,MAXBP,MININ,IGON,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      CALL HAIRPD(J1,J2,KBIN,KBOUT,
+     +MINLPI,MAXLPI,MINLP,MINLPX,MAXLPX,MAXLP,MINBP,MAXBP,MININ,
+     +IGON,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL HAIRPP(SEQ,IDIM,ITOT,ITOTEL,ITOTAL,IDTOT,
+     +MINLP,MAXLP,MININ,IGON,
+     +MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,IDEV,KBOUT)
+      END
+      SUBROUTINE HAIRPP(SEQ,IDIM,ITOT,ITOTEL,ITOTAL,IDTOT,
+     +MINLP,MAXLP,MININ,IGON,
+     +MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,IS,IN,KSTART,IDEV,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER SCORES(30)
+      CHARACTER SEQ(IDIM)
+      INTEGER ITOT(IDTOT),ITOTEL(IDTOT),ITOTAL(IDTOT)
+      INTEGER CTONUM
+      EXTERNAL CTONUM,LWRAP2
+      CALL BUSY(KBOUT)
+C   set scores as at=1,gc=1,gt=1
+C   could make gt selectable
+      DO 1 I=1,30
+1       SCORES(I)=0
+      SCORES(16)=1
+      SCORES(21)=1
+      SCORES(22)=1
+      SCORES(8)=1
+      SCORES(9)=1
+      SCORES(14)=1
+      IDO=IN-IS+1
+      I=IS-1
+      IP=0
+      DO 301 K=1,IDO
+        I=I+1
+        DO 300 J=MINLP,MAXLP
+          N=0
+100     CONTINUE
+        IMN=I-N
+        IPNPJ=I+N+J
+        IT=CTONUM(SEQ(LWRAP2(IN,IMN)))+
+     +  5*CTONUM(SEQ(LWRAP2(IN,IPNPJ)))
+        IT=SCORES(IT)
+        IF(IT.NE.0)THEN
+          N=N+IT
+          GO TO 100
+        END IF
+C   IS THIS LOOP WORTH KEEPING?
+        IF(N.GE.MININ)THEN
+          IP=IP+1
+          IF(IP.GT.IDTOT)THEN
+            WRITE(KBOUT,1234)IP,I
+1234        FORMAT(' Maximum of',I5,
+     +      ' loops found up to',I6,' No more searching')
+            GO TO 302
+          END IF
+          ITOT(IP)=N
+          ITOTEL(IP)=LWRAP2(IN,I)
+          ITOTAL(IP)=J-1
+        END IF
+300     CONTINUE
+301   CONTINUE
+302   CONTINUE
+      IF(IP.GT.0)THEN
+        NOUT=0
+        DO 700 J=1,IP
+          JP1=J+1
+          ID1 = ITOTEL(J) + ITOTAL(J)/2
+          DO 690 I=JP1,IP
+            ID2 = ITOTEL(I) + ITOTAL(I)/2
+            IF(ID1.NE.ID2)GO TO 690
+            LP2=ITOTAL(I)+2*ITOT(I)
+            IF(ITOTAL(J).GT.LP2)GO TO 690
+            ITOT(J)=0
+            ITOTAL(J)=0
+            ITOTEL(J)=0
+            NOUT=NOUT+1
+            GO TO 700
+690       CONTINUE
+700     CONTINUE
+        CALL BUB3AS(ITOTEL,ITOT,ITOTAL,IP)
+        IF(IGON.EQ.0)THEN
+          XMAX=IN
+          XMIN=IS
+C   PLOT ON SCALE 0 TO 4*MININ IN Y
+          YMIN=0.
+          YMAX=MININ*4
+          YF=0.
+          CALL CLEARV
+          CALL VECTOM
+          CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          DO 800 I=NOUT+1,IP
+            XF=ITOTEL(I)+ITOTAL(I)/2
+            XT=XF
+            YT=ITOT(I)
+            CALL LINE(XF,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+800       CONTINUE
+          CALL VT100M
+        ELSE IF(IGON.EQ.1) THEN
+          DO 900 I=NOUT+1,IP
+            CALL DRAWPN(
+     +      SEQ,IDIM,ITOT(I),ITOTAL(I),ITOTEL(I),
+     +      IDEV,IN,KSTART)
+900       CONTINUE
+        END IF
+      END IF
+      WRITE(KBOUT,1005)IP-NOUT
+1005  FORMAT(' Total loops found=',I6)
+      END
+      SUBROUTINE HAIRPD(IS,IN,KBIN,KBOUT,
+     +MINLPI,MAXLPI,MINLP,MINLPX,MAXLPX,MAXLP,MINBP,MAXBP,MININ,
+     +IGON,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Define the range of loop sizes')
+      CALL GETINT(MINLPI,MAXLPI,MINLP,'Minimum loop size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINLP = IVAL
+      MINLPX = MAX(MINLPX,MINLP)
+      MAXLP = MINLPX
+      CALL GETINT(MINLPX,MAXLPX,MAXLP,'Maximum loop size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXLP = IVAL
+      MINLP = MINLP + 1
+      MAXLP = MAXLP + 1
+      CALL GETINT(MINBP,MAXBP,MININ,'Minimum number of basepairs',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MININ = IVAL
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+C      IF(IGON.LT.0)RETURN
+      END
+C
+C   subroutine to draw hairpin loops on device idev
+C   hairpins are defined by:
+C   1) nstem the number of basepairs in the stem
+C   2) nloop the number of bases in the loop
+C   3) ip the position of the rightmost base in the left side of the stem
+C
+C   draw the stems very simply, just standing up with a few bases
+C   before and after
+C   maximum size is 20 up and 14 across
+      SUBROUTINE DRAWPN(SEQ,IDIM,NSTEM,NLOOP,IP,IDEV,IN,KSTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),ARRAY(14,20),SCORE(5,5),SPACE
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE SPACE
+      DATA SPACE/' '/
+C   CHECK FOR WRAP AROUND
+      IF((IP-NSTEM).LT.0)RETURN
+      IF((IP+NLOOP+NSTEM+1).GT.IN)RETURN
+      DO 1 I=1,5
+      DO 1 J=1,5
+        SCORE(I,J)=' '
+1     CONTINUE
+      SCORE(1,3)='-'
+      SCORE(3,1)='-'
+      SCORE(2,4)='-'
+      SCORE(4,2)='-'
+      SCORE(1,4)='.'
+      SCORE(4,1)='.'
+      DO 2 I=1,14
+      DO 2 J=1,20
+        ARRAY(I,J)=SPACE
+2     CONTINUE
+C   point to left stem start
+      IPL=IP-NSTEM
+C   point to right stem start
+      IPR=IP+NSTEM+NLOOP+1
+C   point to temporary array
+      IAL=6
+      IAR=8
+      KA=1
+C   fill in stem and loop
+      DO 10 I=1,MIN(NSTEM+NLOOP/2,19)
+        IPL=IPL+1
+        IPR=IPR-1
+        KA=KA+1
+        ARRAY(IAL,KA)=SEQ(IPL)
+        ARRAY(IAR,KA)=SEQ(IPR)
+C   basepaired?
+        ARRAY(IAL+1,KA)=
+     +  SCORE(CTONUM(SEQ(IPL)),CTONUM(SEQ(IPR)))
+10    CONTINUE
+C   odd number of bases in loop?
+      IF(MOD(NLOOP,2).NE.0)THEN
+C   put in extra base
+        IF((KA+1).LE.20)ARRAY((IAL+1),KA+1)=SEQ(IPL+1)
+      END IF
+C20    CONTINUE
+C   now do 6 bases before the stem and 7 after
+      IPL=IP-NSTEM-6
+      DO 30 I=1,6
+        IPL=IPL+1
+        IF(IPL.GT.0)ARRAY(I,1)=SEQ(IPL)
+30    CONTINUE
+      IPR=IP+NSTEM+NLOOP
+      DO 40 I=8,14
+        IPR=IPR+1
+        IF(IPR.LT.IN)ARRAY(I,1)=SEQ(IPR)
+40    CONTINUE
+      DO 50 I=1,20
+        K=21-I
+        WRITE(IDEV,1000)(ARRAY(L,K),L=1,14)
+50    CONTINUE
+1000  FORMAT(' ',5X,14A1)
+1001  FORMAT(' ',5X,I6)
+      WRITE(IDEV,1001)IP-NSTEM+KSTART-1
+      RETURN
+      END
+      SUBROUTINE WORDFP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+C   ROUTINE TO COMPARE OBSERVED WITH EXPECTED 'WORD' FREQUENCIES
+C   WORDS MAY BE UP TO 10 CHARACTERS IN LENGTH ALTHOUGH, AS THE
+C   EXPECTED FREQUENCIES ARE CALCULATED SIMPLY BY MULTIPLYING
+C   THE BASE FREQUENCIES TOGETHER, THE LONGER THE WORD THE MORE
+C   INAPPROPRIATE THE MODEL. USERS ARE SIMPLY PROMPTED FOR THE WORD
+C   TO SEARCH FOR - E.G CG - AND A WINDOW LENGTH TO COUNT OVER. 
+C   THE PROGRAM PLOTS (OBSERVED - EXPECTED)
+C   IF IT IS TOO SLOW, FASTER ALGORITHMS ARE POSSIBLE.
+      CHARACTER SEQ(IDIM1P),WORD(10),HELPF*(*)
+      INTEGER SPAN
+      CALL SHOWFU(KBOUT,'Plot composition differences (obs-exp))')
+      WORD(1) = 'C'
+      WORD(2) = 'G'
+      LENGTH = 2
+      CALL GETPAR(64,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL WORDFD(WORD,LENGTH,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +YMIN,YMAX,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL WRDFP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,
+     +YMIN,YMAX,SPAN,IWRIT,WORD,LENGTH)
+      END
+      SUBROUTINE WORDFD(WORD,LENGTH,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +YMIN,YMAX,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER WORD(10),WORDA(10),HELPF*(*)
+      INTEGER SPAN
+      CALL SQPF3(WORD,WORDA,10,LENGTH,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      YMAX1 = REAL(SPAN)/(4**LENGTH)
+      YMIN1 = -1.0*YMAX1
+      YMIN2 = 4.*YMIN1
+      YMAX2 = 4.*YMAX1
+      CALL GETRL(YMIN1,YMAX2,YMAX1,
+     +'Maximum plot value',VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      YMAX = VALUE
+      CALL GETRL(YMIN2,YMAX,YMIN1,
+     +'Minimum plot value',VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      YMIN = VALUE
+      IOK = 0
+      END
+      SUBROUTINE WRDFP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,
+     +YMIN,YMAX,SPAN,IWRIT,WORD,LENGTH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P),WORD(10)
+      INTEGER SPAN,CTONUM,IP(10),OBS(5)
+      EXTERNAL CTONUM
+      CALL CLEARV
+      CALL FILLI(OBS,5,0)
+      DO 15 I = 1,LENGTH
+        IP(I) = CTONUM(WORD(I))
+15    CONTINUE
+      I1INM1=1+(MXSPAN/2)-KSTART+1
+      INC = 1
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+      LF=SPAN/2
+      LB=1+SPAN/2
+      DO 66 I=J1P,J1P+SPAN-1
+        IS=CTONUM(SEQ(I))
+        OBS(IS)=OBS(IS)+1
+66    CONTINUE
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      XF = J1
+      XT = J2
+      YF = YMIN + (YMAX-YMIN)/2.
+      YT = YF
+      CALL LINE(XF,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      MARG = NINT(0.05*MARGT)
+      MARGB1 = MARGB+MARG
+      MARGT1 = MARGT - 2*MARG
+      XF=J1
+      YF=0.
+      DO 100 I=J1P+LB,J2P-(LENGTH-1)
+        ISF=CTONUM(SEQ(I+LF))
+        ISB=CTONUM(SEQ(I-LB))
+        OBS(ISF)=OBS(ISF)+1
+        OBS(ISB)=OBS(ISB)-1
+        IF(MOD(I,IWRIT).EQ.0)THEN
+          EXPEC = 1.0
+          DO 80 J=1,LENGTH
+            EXPEC = EXPEC * REAL(OBS(IP(J)))/SPAN
+80        CONTINUE
+          EXPEC = EXPEC * SPAN
+          NFOUND = 0
+          IAT=I-LB
+          IEND = I+LF
+90        CONTINUE
+            IAT=IAT+INC
+            IDIM = IEND - IAT + 1
+            IF(IDIM.GT.0)THEN
+            CALL FIND6(SEQ(IAT),IDIM,WORD,LENGTH,INC,JMATCH)
+            IF(JMATCH.NE.0)THEN
+              IAT=IAT+JMATCH-1
+              NFOUND = NFOUND + 1
+              GO TO 90
+            END IF
+          END IF
+          SUM = NFOUND - EXPEC
+          XT=I-I1INM1
+          CALL LINE(XF,XT,YF,SUM,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          XF=XT
+          YF=SUM
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE BPHASE(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,MARGL1,MARGR1,MARGB1,MARGT1,J1,J2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,DIALOG
+      CALL SHOWFU(KBOUT,'Uneven positional base frequencies method')
+      CALL GETPAR(44,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      MAXSP = MIN(MAXSP,IDIM1/3)
+      SPAN = MIN(MAXSP,SPAN)
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL BPHASP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,MARGL1,MARGR1,MARGB1,MARGT1,J1,J2,SPAN,IWRIT)
+      END
+      SUBROUTINE BPHASP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,MARGL1,MARGR1,MARGB1,MARGT1,J1,J2,LENW,IWRIT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      REAL POS(5,3),COMP(5),POS1(15)
+      EQUIVALENCE (POS,POS1)
+      I1IN=2+MXSPAN/2
+      I1INM1=I1IN-1
+      YMIN=0.4
+      YMAX=1.34
+      IDIMJ=J2-J1+1
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      LB=1+LENW/2
+      LB=3*LB
+      LF=LENW/2
+      LF=3*LF
+      LENW3=3*LENW
+      IDONE=0
+      XMIN=J1
+      XMAX=J2
+      CALL CLEARV
+      CALL VECTOM
+      XP=J1
+      YP=0.
+      X=J2
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+C   SET UP MARGINS
+      MARGB=MARGB1+NINT(0.006*ISYMAX)
+      MARGT=MARGT1-NINT(0.012*ISYMAX)
+      CALL LINE(XP,X,.78,.78,XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,0.78,'76%',3,0,XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL BPHAS1(SEQ(J1P-LB),IDIM1,POS,COMP,LENW3,POS1)
+      DO 300 IP=J1P,J2P,3
+        IDONE=IDONE+1
+        TESTC=ROTPHS(SEQ,IDIM1P,IP,LB,LF,POS,COMP)
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          X=IP-I1INM1
+          CALL LINE(XP,X,YP,TESTC,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX)
+          YP=TESTC
+          XP=X
+        END IF
+300   CONTINUE
+      CALL VT100M
+      END
+      REAL FUNCTION ROTPHS(SEQ,IDIM,IAT,LB,LF,POS,BCOMP)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL POS(5,3),BCOMP(5),DIFF(5,3),DIFF1(15)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      EQUIVALENCE (DIFF,DIFF1)
+C   zero variables
+      CALL FILLR(DIFF1,15,0.)
+C   LOOK BACK TO SUBTRACT A CODON
+      IPB=IAT-LB
+C   sum for each base for each codon position
+      K=0
+      DO 10 J=IPB,IPB+2
+        K=K+1
+        JJ=CTONUM(SEQ(J))
+        BCOMP(JJ)=BCOMP(JJ)-1.
+        POS(JJ,K)=POS(JJ,K)-1.
+10    CONTINUE
+C   look forwards and add a value
+      IPF=IAT+LF
+      K=0
+      DO 20 J=IPF,IPF+2
+        K=K+1
+        JJ=CTONUM(SEQ(J))
+        BCOMP(JJ)=BCOMP(JJ)+1.
+        POS(JJ,K)=POS(JJ,K)+1.
+20    CONTINUE
+      DO 30 I=1,4
+        TEMP=BCOMP(I)/3.
+        DO 30 J=1,3
+          DIFF(I,J)=ABS(POS(I,J)-TEMP)
+30    CONTINUE
+      R=0.
+      DO 60 I=1,4
+      DO 60 J=1,3
+        IF(BCOMP(I).NE.0.)R=R+DIFF(I,J)/BCOMP(I)
+60    CONTINUE
+      ROTPHS=R
+      END
+      SUBROUTINE BPHAS1(SEQ,IDIM,POS,COMP,LWIND3,POS1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL POS(5,3),COMP(5),POS1(15)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      CALL FILLR(POS1,15,0.)
+      CALL FILLR(COMP,5,0.)
+      DO 10 I=1,LWIND3-1,3
+        K=0
+        I1=I
+        I2=I1+2
+        DO 9 J=I1,I2
+          K=K+1
+          JJ=CTONUM(SEQ(J))
+          COMP(JJ)=COMP(JJ)+1.
+          POS(JJ,K)=POS(JJ,K)+1.
+9       CONTINUE
+10    CONTINUE
+      END
+      SUBROUTINE FICKET(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,MARGL1,MARGR1,MARGB1,MARGT1,J1,J2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,DIALOG
+      CALL SHOWFU(KBOUT,'Ficketts method to find protein genes')
+      CALL GETPAR(48,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL FICKTP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,MARGL1,MARGR1,MARGB1,MARGT1,J1,J2,SPAN,IWRIT)
+      END
+      SUBROUTINE FICKTP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,MARGL1,MARGR1,MARGB1,MARGT1,J1,J2,LENW,IWRIT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P)
+      REAL POS(5,3),COMP(5),POS1(15)
+      EQUIVALENCE (POS,POS1)
+      EXTERNAL ROTFCK
+      I1IN=2+MXSPAN/2
+      I1INM1=I1IN-1
+      IDIMJ=J2-J1+1.
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      YMIN=0.
+      YMAX=1.
+      XMIN=J1
+      XMAX=J2
+      LB=1+LENW/2
+      LB=3*LB
+      LF=LENW/2
+      LF=3*LF
+      LENW3=3*LENW
+C   calc fraction of window length for composition
+      FRAC=1./LENW3
+      IDONE=0
+      CALL CLEARV
+      CALL VECTOM
+C   set initial values
+      XP=J1
+      YP=0.
+      X=J2
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL LINE(XP,X,.4,.4,XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL LINE(XP,X,.77,.77,XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FCKSTR(SEQ(J1P-LB),IDIM1,POS,COMP,LENW3,POS1)
+      DO 300 IP=J1P,J2P,3
+        IDONE=IDONE+1
+        TESTC=ROTFCK(SEQ,IDIM1P,IP,LB,LF,POS,COMP,FRAC)
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          X=IP-I1INM1
+          CALL LINE(XP,X,YP,TESTC,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          YP=TESTC
+          XP=X
+        END IF
+300   CONTINUE
+      CALL VT100M
+      END
+      REAL FUNCTION ROTFCK(SEQ,IDIM,IAT,LB,LF,POS,BCOMP,FRAC)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL POS(5,3),BASMAX(4),BASMIN(4),BPOS(4),BCOMP(5),BCOMPT(5)
+      REAL WCOMP(4),WPOS(4),PCODP(10,4),PCODC(10,4),PCODCR(10)
+      REAL PCODPR(10)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE WCOMP,WPOS,PCODCR,PCODPR,PCODP,PCODC
+      DATA WCOMP/.14,.12,.11,.15/
+      DATA WPOS/.33,.18,.26,.31/
+      DATA PCODCR/.17,.19,.21,.23,.25,.27,.29,.31,.33,.99/
+      DATA PCODPR/1.1,1.2,1.3,1.4,1.5,1.6,1.7,1.8,1.9,99./
+      DATA PCODC/.58,.51,.69,.56,.75,.55,.40,.39,.24,.28,
+     1.31,.39,.44,.43,.59,.59,.64,.51,.64,.82,.21,.81,.65,.67,
+     2.49,.62,.55,.44,.49,.28,.29,.33,.41,.41,.73,.64,.64,.47,
+     3.54,.40/
+      DATA PCODP/.09,.09,.20,.54,.44,.69,.68,.91,.97,.97,
+     1.23,.30,.33,.51,.48,.66,.81,.70,.70,.80,
+     2.22,.20,.34,.45,.68,.58,.93,.84,.68,.94,
+     3.08,.08,.16,.27,.48,.53,.64,.74,.88,.90/
+C      DATA TESTCP/.00,.04,.07,.29,.40,.77,.92,.98,1.0,1.0/
+C      DATA TESTCR/.43,.53,.64,.74,.84,.95,1.05,1.16,1.26,99./
+      DO 5 I=1,4
+        BASMAX(I)=0.
+        BASMIN(I)=IDIM
+        BPOS(I)=0.
+5     CONTINUE
+C   look back to subtract a codon
+      IPB=IAT-LB
+C   sum for each base for each codon position
+      K=0
+      DO 10 J=IPB,IPB+2
+        K=K+1
+        JS=CTONUM(SEQ(J))
+        BCOMP(JS)=BCOMP(JS)-FRAC
+        POS(JS,K)=POS(JS,K)-1.
+10    CONTINUE
+C   look forwards and add a value
+      IPF=IAT+LF
+      K=0
+      DO 20 J=IPF,IPF+2
+        K=K+1
+        JS=CTONUM(SEQ(J))
+        BCOMP(JS)=BCOMP(JS)+FRAC
+        POS(JS,K)=POS(JS,K)+1.
+20    CONTINUE
+C   get max and min values for each base in each codon position
+      DO 30 J=1,3
+      DO 30 I=1,4
+        IF(POS(I,J).GT.BASMAX(I))BASMAX(I)=POS(I,J)
+        IF(POS(I,J).LT.BASMIN(I))BASMIN(I)=POS(I,J)
+30    CONTINUE
+      DO 40 I=1,4
+        BPOS(I)=BASMAX(I)/(BASMIN(I)+1.)
+40    CONTINUE
+C   look up prob of coding
+      DO 46 I=1,4
+      DO 45 J=1,10
+        IF(BCOMP(I).GT.PCODCR(J))GO TO 45
+C   lt current so set prob
+        BCOMPT(I)=PCODC(J,I)
+        GO TO 46
+45    CONTINUE
+46    CONTINUE
+      DO 48 I=1,4
+      DO 47 J=1,10
+        IF(BPOS(I).GT.PCODPR(J))GO TO 47
+C   lt current so set prob
+        BPOS(I)=PCODP(J,I)
+        GO TO 48
+47    CONTINUE
+48    CONTINUE
+C   apply weights
+      DO 50 I=1,4
+        BCOMPT(I)=BCOMPT(I)*WCOMP(I)
+        BPOS(I)=BPOS(I)*WPOS(I)
+50    CONTINUE
+C   calc testcode
+      TESTC=0.
+      DO 60 I=1,4
+        TESTC=TESTC+BPOS(I)+BCOMPT(I)
+60    CONTINUE
+      ROTFCK=TESTC
+      END
+      SUBROUTINE FCKSTR(SEQ,IDIM,POS,COMP,LWIND3,POS1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL POS(5,3),COMP(5),POS1(15)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C   calc fraction to add ie window length recip
+      FRAC=1./LWIND3
+C   zero variables
+      CALL FILLR(COMP,5,0.)
+      CALL FILLR(POS1,15,0.)
+C   sum for each base for each codon position
+      DO 10 I=1,LWIND3-1,3
+        K=0
+        I1=I
+        I2=I1+2
+        DO 9 J=I1,I2
+          K=K+1
+          JS=CTONUM(SEQ(J))
+          COMP(JS)=COMP(JS)+FRAC
+          POS(JS,K)=POS(JS,K)+1.
+9       CONTINUE
+10    CONTINUE
+      END
+      SUBROUTINE SHEPED(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1P),PAA(5,5,5)
+      INTEGER SPAN,DIALOG
+      CALL SHOWFU(KBOUT,'Shepherds method to find protein genes')
+      CALL GETPAR(47,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL SHEPP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,PAA,SPAN,IWRIT)
+      END
+      SUBROUTINE SHEPP(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,PAA,LENW,IWRIT)
+C   AUTHOR: RODGER STADEN
+      REAL SUMW(3),PROB(3)
+      CHARACTER SEQ(IDIM1P),PAA(5,5,5)
+      CHARACTER SPACE,RLINE(3),STAR
+      SAVE SPACE,STAR
+      DATA SPACE,STAR/' ','*'/
+      IMH=NINT(0.005*ISYMAX)
+      I1IN=2+MXSPAN/2
+      I1INM1=I1IN-1
+      IDIMJ=J2-J1+1.
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      YMIN=0.2
+      YMAX=0.4
+      YMID=YMIN+(YMAX-YMIN)/2.
+      XMIN=J1
+      XMAX=J2
+C   SORT OUT MARGIN IE FIND MAX AND DIVIDE BY THREE
+      IMARG=NINT(MARGT/3.)
+      MARGL1=MARGL
+      MARGR1=MARGR
+      MARGB1=MARGB
+      MARGB2=MARGB+IMARG
+      MARGB3=MARGB+2*IMARG
+      MARGT3=IMARG
+      MARGT2=IMARG
+      MARGT1=IMARG
+      LB=1+LENW/2
+      LB=3*LB
+      LF=LENW/2
+      LF=3*LF
+      LENW3=3*LENW
+      CALL FILLR(SUMW,3,0.)
+      IDONE=0
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      CALL SHEPST(SEQ(J1P-LB),IDIM1,LENW3,SUMW(1))
+      CALL SHEPST(SEQ(J1P+1-LB),IDIM1,LENW3,SUMW(2))
+      CALL SHEPST(SEQ(J1P+2-LB),IDIM1,LENW3,SUMW(3))
+C   set initial values
+      XP=J1
+      YP1=0.5
+      YP2=0.5
+      YP3=0.5
+      DO 300 IP=J1P,J2P,3
+        IDONE=IDONE+1
+        IP0=IP
+        IP1=IP+1
+        IP2=IP+2
+        CALL RTSHEP(SEQ,IDIM1P,IP0,LB,LF,SUMW(1))
+        CALL RTSHEP(SEQ,IDIM1P,IP1,LB,LF,SUMW(2))
+        CALL RTSHEP(SEQ,IDIM1P,IP2,LB,LF,SUMW(3))
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          T=SUMW(1)+SUMW(2)+SUMW(3)
+          PROB(1)=SUMW(1)/T
+          PROB(2)=SUMW(2)/T
+          PROB(3)=SUMW(3)/T
+C   get largest
+          DO 302 II=1,3
+            RLINE(II)=SPACE
+302       CONTINUE
+          TP=MAX(SUMW(1),SUMW(2),SUMW(3))
+          DO 303 II=1,3
+            IF(SUMW(II).EQ.TP)RLINE(II)=STAR
+303       CONTINUE
+C   set actual ip
+          IPA=IP-I1INM1
+          X=IPA
+          Y1=PROB(1)
+          Y2=PROB(2)
+          Y3=PROB(3)
+          CALL LINE(XP,X,YP1,Y1,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1+IMH,MARGT1-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP2,Y2,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2+IMH,MARGT2-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP3,Y3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3+IMH,MARGT3-2*IMH,ISXMAX,ISYMAX)
+          IF(RLINE(1).EQ.STAR)
+     +    CALL POINT(X,YMID,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          IF(RLINE(2).EQ.STAR)
+     +    CALL POINT(X,YMID,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+          IF(RLINE(3).EQ.STAR)
+     +    CALL POINT(X,YMID,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+          YP1=Y1
+          YP2=Y2
+          YP3=Y3
+          XP=X
+        END IF
+300   CONTINUE
+      CALL STARTS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL STOPS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL VT100M
+      END
+      SUBROUTINE SHEPST(SEQ,IDIM,LENW,SUM)
+C   AUTHOR: RODGER STADEN
+      REAL SUM
+      CHARACTER SEQ(IDIM)
+      INTEGER SCORER(5),SCOREY(5),CTONUM
+      EXTERNAL CTONUM
+      SAVE SCORER,SCOREY
+      DATA SCORER,SCOREY/0,0,1,1,0,1,1,0,0,0/
+      SUM=0.
+      DO 100 I=1,LENW,3
+        SUM=SUM+SCORER(CTONUM(SEQ(I)))+SCOREY(CTONUM(SEQ(I+2)))
+100   CONTINUE
+      END
+      SUBROUTINE RTSHEP(SEQ,IDIM1,I,LB,LF,H)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      INTEGER SCORER(4),SCOREY(4),CTONUM
+      EXTERNAL CTONUM
+      SAVE SCORER,SCOREY
+      DATA SCORER,SCOREY/0,0,1,1,1,1,0,0/
+C   set up dummy values
+      XS=0.
+      XA=0.
+C   look back
+      IPB=I-LB-1
+      DO 10 J=1,3
+        IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 30
+10    CONTINUE
+C   set value to subtract
+      IPB=IPB+1
+      IT1=CTONUM(SEQ(IPB))
+      IT2=CTONUM(SEQ(IPB+2))
+      XS=SCORER(IT1)+SCOREY(IT2)
+C   now look forward
+30    CONTINUE
+      IPB=I+LF-1
+C   look for bad char
+      DO 45 J=1,3
+        IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 50
+45    CONTINUE
+C   ok so use
+      IPB=IPB+1
+      IT1=CTONUM(SEQ(IPB))
+      IT2=CTONUM(SEQ(IPB+2))
+      XA=SCORER(IT1)+SCOREY(IT2)
+50    CONTINUE
+      H=H-XS+XA
+      END
+      SUBROUTINE IMPBC(SEQ,IDIM1P,IDIM1,MXSPAN,FTABLE,IDFTAB,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,DIALOG)
+      CHARACTER HELPF*(*),PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      INTEGER SPAN,DIALOG
+      REAL FTABLE(IDFTAB)
+      CHARACTER SEQ(IDIM1P)
+      CALL SHOWFU(KBOUT,'McLachlan,Staden,Boswell codon improbability')
+C      WRITE(KBOUT,*)' Search for protein genes using McLachlan, Staden'
+C      WRITE(KBOUT,*)' and Boswell improbability method'
+C      WRITE(KBOUT,*)' expecting codon composition to depend on'
+C      WRITE(KBOUT,*)' base composition'
+      CALL GETPAR(45,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL IMPBCP(SEQ,IDIM1P,IDIM1,MXSPAN,FTABLE,IDFTAB,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,PAA,SPAN,IWRIT)
+      END
+      SUBROUTINE IMPBCP(SEQ,IDIM1P,IDIM1,MXSPAN,FTABLE,IDFTAB,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,PAA,LENW,IWRIT)
+      CHARACTER PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      REAL RNBASE(4)
+      REAL W(3),RNFAC(3),COMP(3),FTABLE(IDFTAB),FACN
+      INTEGER NBASE1(4),NBASE2(4),NBASE3(4)
+      INTEGER NCOD1(4,4,4),NCOD2(4,4,4),NCOD3(4,4,4)
+      CHARACTER SEQ(IDIM1P),RLINE(3),STAR
+      SAVE STAR
+      DATA STAR/'*'/
+C   SORT OUT MARGIN IE FIND MAX AND DIVIDE BY THREE
+      IMARG=NINT(MARGT/3.)
+      MARGL1=MARGL
+      MARGR1=MARGR
+      MARGB1=MARGB
+      MARGB2=MARGB+IMARG
+      MARGB3=MARGB+2*IMARG
+      MARGT3=IMARG
+      MARGT2=IMARG
+      MARGT1=IMARG
+      IMH=NINT(0.005*ISYMAX)
+      SD=4.1
+      EXPM=0.
+      YMIN=EXPM
+      YMAX=EXPM+12.*SD
+      IBH=NINT(0.005*ISXMAX)
+C  ONLY CALC FACTORS UPTO 99!
+      CALL FACTAB(FTABLE,99)
+      I1IN=2+MXSPAN/2
+      I1INM1=I1IN-1
+      IDIMJ=J2-J1+1.
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      LB=1+LENW/2
+      LB=3*LB
+      LF=LENW/2
+      LF=3*LF
+      LENW3=3*LENW
+      RLENW3=LENW3
+      FACN=FACTOR(LENW,FTABLE,IDFTAB)
+      XMIN=J1
+      XMAX=J2
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB1,MARGT1,ISXMAX,ISYMAX,SD,IBH,YMIN,2)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB2,MARGT2,ISXMAX,ISYMAX,SD,IBH,YMIN,2)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB3,MARGT3,ISXMAX,ISYMAX,SD,IBH,YMIN,2)
+      CALL ROT1(SEQ(J1P-LB),IDIM1,NCOD1,RNFAC(1),FTABLE,
+     +IDFTAB,NBASE1,LENW3)
+      CALL ROT1(SEQ(J1P+1-LB),IDIM1,NCOD2,RNFAC(2),FTABLE,
+     +IDFTAB,NBASE2,LENW3)
+      CALL ROT1(SEQ(J1P+2-LB),IDIM1,NCOD3,RNFAC(3),FTABLE,
+     +IDFTAB,NBASE3,LENW3)
+C   set initial values
+      XP=J1
+      YP1=0.
+      YP2=0.
+      YP3=0.
+      IDONE=0
+      DO 300 IP=J1P,J2P,3
+        IDONE=IDONE+1
+        IP0=IP
+        IP1=IP+1
+        IP2=IP+2
+        CALL ROTFAC(SEQ,IDIM1P,IP0,LB,LF,NCOD1,RNFAC(1),
+     +  FTABLE,IDFTAB,NBASE1,COMP(1),LENW3)
+        CALL ROTFAC(SEQ,IDIM1P,IP1,LB,LF,NCOD2,RNFAC(2),
+     +  FTABLE,IDFTAB,NBASE2,COMP(2),LENW3)
+        CALL ROTFAC(SEQ,IDIM1P,IP2,LB,LF,NCOD3,RNFAC(3),
+     +  FTABLE,IDFTAB,NBASE3,COMP(3),LENW3)
+        DO 200 K=1,3
+          W(K)=RNFAC(K)-COMP(K)-FACN
+200     CONTINUE
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          DO 302 II=1,3
+            RLINE(II)=' '
+302       CONTINUE
+          T=MAX(W(1),W(2),W(3))
+          DO 303 II=1,3
+            IF(W(II).EQ.T)RLINE(II)=STAR
+303       CONTINUE
+          IPA=IP-I1INM1
+          DO 787 JJJ=1,4
+            RNBASE(JJJ)=FLOAT(NBASE1(JJJ))/RLENW3
+787       CONTINUE
+          CALL POISON(LENW,RNBASE,RNBASE,EW,WD,1)
+          YMAX=EXPM+12.*WD
+          WD3=WD*6.
+          X=IPA
+          Y1=W(1)-EW
+          Y2=W(2)-EW
+          Y3=W(3)-EW
+          CALL LINE(XP,X,YP1,Y1,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1+IMH,MARGT1-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP2,Y2,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2+IMH,MARGT2-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP3,Y3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3+IMH,MARGT3-2*IMH,ISXMAX,ISYMAX)
+          IF(RLINE(1).EQ.STAR)CALL POINT(X,WD3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          IF(RLINE(2).EQ.STAR)CALL POINT(X,WD3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+          IF(RLINE(3).EQ.STAR)CALL POINT(X,WD3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+          YP1=Y1
+          YP2=Y2
+          YP3=Y3
+          XP=X
+        END IF
+300   CONTINUE
+      CALL STARTS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL STOPS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL VT100M
+      END
+      SUBROUTINE IMPAC(SEQ,IDIM1P,IDIM1,MXSPAN,FTABLE,IDFTAB,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,DIALOG)
+      CHARACTER HELPF*(*),PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      REAL FTABLE(IDFTAB)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER SPAN,DIALOG
+      CALL SHOWFU(KBOUT,'McLachlan,Staden,Boswell codon improbability')
+C      WRITE(KBOUT,*)' Search for protein genes using McLachlan, Staden'
+C      WRITE(KBOUT,*)' and Boswell improbability method'
+C      WRITE(KBOUT,*)' expecting codon composition to depend on'
+C      WRITE(KBOUT,*)' amino acid composition'
+      CALL GETPAR(46,6,IOK,MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,
+     +IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(DIALOG.EQ.1) THEN
+        CALL GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL IMPACP(SEQ,IDIM1P,IDIM1,MXSPAN,FTABLE,IDFTAB,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,PAA,SPAN,IWRIT)
+      END
+      SUBROUTINE IMPACP(SEQ,IDIM1P,IDIM1,MXSPAN,FTABLE,IDFTAB,
+     +ISXMAX,ISYMAX,J1,J2,MARGL,MARGR,MARGB,MARGT,PAA,LENW,IWRIT)
+      CHARACTER PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      REAL W(3),RNFAC(3),COMP(3),FTABLE(IDFTAB),FACN
+      INTEGER NBASE1(4),NBASE2(4),NBASE3(4)
+      INTEGER NCOD1(4,4,4),NCOD2(4,4,4),NCOD3(4,4,4)
+      INTEGER NCODON(21),IACID(4,4,4)
+      REAL FABC1(4,4,4),FA1(21)
+      REAL FABC2(4,4,4),FA2(21)
+      REAL FABC3(4,4,4),FA3(21)
+      CHARACTER SEQ(IDIM1P),RLINE(3),STAR
+      SAVE IACID,NCODON,STAR
+      DATA STAR/'*'/
+C    array with number of codons per acid
+      DATA IACID/
+     +1,2,3,5,6,7,8,9,
+     +10,12,14,16,18,20,6,21,
+     +1,2,3,5,6,7,8,9,
+     +10,12,14,16,18,20,6,21,
+     +2,2,3,5,6,7,8,9,
+     +11,13,15,17,11,20,20,21,
+     +2,2,4,5,6,7,8,9,
+     +11,13,15,17,19,20,20,21/
+      DATA NCODON/
+     +2,6,3,1,4,6,4,4,4,2,3,2,2,2,2,2,2,2,1,6,4/
+C   SORT OUT MARGIN IE FIND MAX AND DIVIDE BY THREE
+      IMARG=NINT(MARGT/3.)
+      MARGL1=MARGL
+      MARGR1=MARGR
+      MARGB1=MARGB
+      MARGB2=MARGB+IMARG
+      MARGB3=MARGB+2*IMARG
+      MARGT3=IMARG
+      MARGT2=IMARG
+      MARGT1=IMARG
+      SD=4.1
+      EXPM=0.
+      YMIN=EXPM
+      YMAX=EXPM+15.*SD
+      IBH=NINT(0.005*ISXMAX)
+      IMH=NINT(0.005*ISYMAX)
+C  ONLY CALC FACTORS UPTO 99!
+      CALL FACTAB(FTABLE,99)
+      I1IN=2+MXSPAN/2
+      I1INM1=I1IN-1
+      IDIMJ=J2-J1+1.
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      LB=1+LENW/2
+      LB=3*LB
+      LF=LENW/2
+      LF=3*LF
+      LENW3=3*LENW
+      XLENW=LENW
+      FACN=FACTOR(LENW,FTABLE,IDFTAB)
+      XMIN=J1
+      XMAX=J2
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB1,MARGT1,ISXMAX,ISYMAX,SD,IBH,YMIN,2)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB2,MARGT2,ISXMAX,ISYMAX,SD,IBH,YMIN,2)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB3,MARGT3,ISXMAX,ISYMAX,SD,IBH,YMIN,2)
+      CALL ROT2(SEQ(J1P-LB),IDIM1,NCOD1,RNFAC(1),FTABLE,
+     +IDFTAB,NBASE1,LENW3,FA1,FABC1,NCODON,IACID)
+      CALL ROT2(SEQ(J1P+1-LB),IDIM1,NCOD2,RNFAC(2),FTABLE,
+     +IDFTAB,NBASE2,LENW3,FA2,FABC2,NCODON,IACID)
+      CALL ROT2(SEQ(J1P+2-LB),IDIM1,NCOD3,RNFAC(3),FTABLE,
+     +IDFTAB,NBASE3,LENW3,FA3,FABC3,NCODON,IACID)
+C   set initial values
+      XP=J1
+      YP1=0.
+      YP2=0.
+      YP3=0.
+      IDONE=0
+      DO 300 IP=J1P,J2P,3
+        IDONE=IDONE+1
+        IP0=IP
+        IP1=IP+1
+        IP2=IP+2
+        CALL ROTFAK(SEQ,IDIM1P,IP0,LB,LF,NCOD1,RNFAC(1),
+     +FTABLE,IDFTAB,NBASE1,COMP(1),LENW3,XLENW,FA1,FABC1,NCODON,IACID)
+        CALL ROTFAK(SEQ,IDIM1P,IP1,LB,LF,NCOD2,RNFAC(2),
+     +FTABLE,IDFTAB,NBASE2,COMP(2),LENW3,XLENW,FA2,FABC2,NCODON,IACID)
+        CALL ROTFAK(SEQ,IDIM1P,IP2,LB,LF,NCOD3,RNFAC(3),
+     +FTABLE,IDFTAB,NBASE3,COMP(3),LENW3,XLENW,FA3,FABC3,NCODON,IACID)
+        DO 200 K=1,3
+          W(K)=RNFAC(K)-COMP(K)-FACN
+200     CONTINUE
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          DO 302 II=1,3
+            RLINE(II)=' '
+302       CONTINUE
+          T=MAX(W(1),W(2),W(3))
+          DO 303 II=1,3
+            IF(W(II).EQ.T)RLINE(II)=STAR
+303       CONTINUE
+          IPA=IP-I1INM1
+          CALL POISON(LENW,FABC1,FABC1,EW1,WD1,2)
+          CALL POISON(LENW,FABC2,FABC2,EW2,WD2,2)
+          CALL POISON(LENW,FABC3,FABC3,EW3,WD3,2)
+          WDT3=(WD1+WD2+WD3)/3.
+          YMAX=EXPM+15.*WDT3
+          WDT3=(YMAX-YMIN)/2.
+          X=IPA
+          Y1=W(1)-EW1
+          Y2=W(2)-EW2
+          Y3=W(3)-EW3
+          CALL LINE(XP,X,YP1,Y1,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1+IMH,MARGT1-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP2,Y2,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2+IMH,MARGT2-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP3,Y3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3+IMH,MARGT3-2*IMH,ISXMAX,ISYMAX)
+          IF(RLINE(1).EQ.STAR)CALL POINT(X,WDT3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          IF(RLINE(2).EQ.STAR)CALL POINT(X,WDT3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+          IF(RLINE(3).EQ.STAR)CALL POINT(X,WDT3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+          YP1=Y1
+          YP2=Y2
+          YP3=Y3
+          XP=X
+        END IF
+300   CONTINUE
+      CALL STARTS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL STOPS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL VT100M
+      END
+      SUBROUTINE ROT1(SEQ,IDIM,NCOD,RNFAC,FTABLE,IDFTAB,
+     +NBASE,SPAN3)
+C   AUTHOR: RODGER STADEN
+      REAL RNFAC,FTABLE(IDFTAB),FACTOR
+      INTEGER NCOD(4,4,4),NBASE(4),SPAN3,CTONUM
+      CHARACTER SEQ(IDIM)
+      EXTERNAL FACTOR,CTONUM
+C   zero values
+      DO 1 I=1,4
+      NBASE(I)=0
+      DO 1 J=1,4
+      DO 1 K=1,4
+1     NCOD(I,J,K)=0
+C   loop for span length
+      DO 10 I=1,SPAN3-1,3
+      IP=I-1
+C   check for bad chars
+      DO 5 J=1,3
+5     IF(CTONUM(SEQ(IP+J)).EQ.5)GO TO 10
+C   ok
+      IS=CTONUM(SEQ(I))
+      IS1=CTONUM(SEQ(I+1))
+      IS2=CTONUM(SEQ(I+2))
+      NCOD(IS,IS1,IS2)=NCOD(IS,IS1,IS2)+1
+      NBASE(IS)=NBASE(IS)+1
+      NBASE(IS1)=NBASE(IS1)+1
+      NBASE(IS2)=NBASE(IS2)+1
+10    CONTINUE
+C   set up rnfac
+      RNFAC=0.
+      DO 20 I=1,4
+      DO 20 J=1,4
+      DO 20 K=1,4
+20    RNFAC=RNFAC+FACTOR(NCOD(I,J,K),FTABLE,IDFTAB)
+      END
+C   rotate factor
+      SUBROUTINE ROTFAC(SEQ,IDIM,I,LB,LF,NCOD,RNFAC,FTABLE,
+     +IDFTAB,NBASE,COMP,SPAN3)
+C   AUTHOR: RODGER STADEN
+      REAL RNFAC,FTABLE(IDFTAB),FACTOR
+      INTEGER NCOD(4,4,4),NBASE(4),SPAN3,CTONUM
+      CHARACTER SEQ(IDIM)
+      EXTERNAL FACTOR,CTONUM
+C   look back
+      IPB=I-LB-1
+C   look for bad char (it will not have been used)
+      DO 10 J=1,3
+10    IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 30
+C   ok
+      IPB=IPB+1
+      IS1=CTONUM(SEQ(IPB))
+      IS2=CTONUM(SEQ(IPB+1))
+      IS3=CTONUM(SEQ(IPB+2))
+C   subtract the current factorial value from rotated score
+      RNFAC=RNFAC-FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+C   subtract from codon table
+      NCOD(IS1,IS2,IS3)=NCOD(IS1,IS2,IS3)-1
+C   add this new value to the rotated score
+      RNFAC=RNFAC+FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+C   now do the composition part
+      NBASE(IS1)=NBASE(IS1)-1
+      NBASE(IS2)=NBASE(IS2)-1
+      NBASE(IS3)=NBASE(IS3)-1
+30    CONTINUE
+C   now look forwards
+      IPB=I+LF-1
+C   look for bad char
+      DO 40 J=1,3
+40    IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 50
+C   ok
+      IPB=IPB+1
+      IS1=CTONUM(SEQ(IPB))
+      IS2=CTONUM(SEQ(IPB+1))
+      IS3=CTONUM(SEQ(IPB+2))
+C   subtract the current factorial value from rotated score
+      RNFAC=RNFAC-FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+      NCOD(IS1,IS2,IS3)=NCOD(IS1,IS2,IS3)+1
+C   add this new value to the rotated score
+      RNFAC=RNFAC+FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+C   now do the composition part
+      NBASE(IS1)=NBASE(IS1)+1
+      NBASE(IS2)=NBASE(IS2)+1
+      NBASE(IS3)=NBASE(IS3)+1
+50    CONTINUE
+      COMP=0.
+      DO 60 J=1,4
+      FREQ=FLOAT(NBASE(J))/FLOAT(SPAN3)
+      IF(FREQ.GT.0.)THEN
+              COMP=COMP+FLOAT(NBASE(J))*LOG(FREQ)
+      GO TO 60
+      END IF
+60    CONTINUE
+      END
+      REAL FUNCTION FACTOR(N,FTABLE,IDFTAB)
+C   AUTHOR: RODGER STADEN
+      REAL FTABLE(IDFTAB)
+      FACTOR=0.0
+C   in range?
+      IF(N.LT.1)GO TO 10
+      IF(N.GT.IDFTAB)GO TO 20
+      FACTOR=FTABLE(N)
+      RETURN
+10    CONTINUE
+      FACTOR=1.
+      RETURN
+20    CONTINUE
+      WRITE(*,1000)N
+1000  FORMAT(' ERROR IN FACTOR, N=',I6)
+      END
+C   factorial table set up
+      SUBROUTINE FACTAB(FTABLE,IDFTAB)
+C   AUTHOR: RODGER STADEN
+      REAL FTABLE(IDFTAB),X1,LOG2PI
+      LOG2PI=0.5*LOG(6.2832)
+      X1=1.
+      DO 10 I=1,20
+      X1=X1*FLOAT(I)
+10    FTABLE(I)=LOG(X1)
+C   stirlings formula:
+C   lni!=(i+.5)*lni-i+0.5ln2pi
+C
+      DO 20 I=21,IDFTAB
+      X1=I
+20    FTABLE(I)=(X1+0.5)*LOG(X1)-X1+LOG2PI
+      END
+      SUBROUTINE ROT2(SEQ,IDIM,NCOD,RNFAC,FTABLE,IDFTAB,
+     +NBASE,SPAN3,FA,FABC,NCODON,IACID)
+C   AUTHOR: RODGER STADEN
+      REAL RNFAC,FTABLE(IDFTAB),FACTOR,FA(21),FABC(4,4,4)
+      INTEGER NCOD(4,4,4),NBASE(4),NCODON(21),IACID(4,4,4),SPAN3
+      INTEGER CTONUM
+      CHARACTER SEQ(IDIM)
+      EXTERNAL FACTOR,CTONUM
+      SPAN=SPAN3/3.
+C   zero values
+      DO 1 I=1,4
+      NBASE(I)=0
+      DO 1 J=1,4
+      DO 1 K=1,4
+      FABC(I,J,K)=0.
+1     NCOD(I,J,K)=0
+      DO 2 I=1,21
+      FA(I)=0.0
+2     CONTINUE
+C   loop for span length
+      DO 10 I=1,SPAN3-1,3
+      IP=I-1
+C   check for bad chars
+      DO 5 J=1,3
+5     IF(CTONUM(SEQ(IP+J)).EQ.5)GO TO 10
+C   ok
+      IS=CTONUM(SEQ(I))
+      IS1=CTONUM(SEQ(I+1))
+      IS2=CTONUM(SEQ(I+2))
+      NCOD(IS,IS1,IS2)=NCOD(IS,IS1,IS2)+1
+      NBASE(IS)=NBASE(IS)+1
+      NBASE(IS1)=NBASE(IS1)+1
+      NBASE(IS2)=NBASE(IS2)+1
+10    CONTINUE
+C   set up rnfac
+      RNFAC=0.
+      DO 20 I=1,4
+      DO 20 J=1,4
+      DO 20 K=1,4
+      JACID=IACID(I,J,K)
+      FA(JACID)=FA(JACID)+NCOD(I,J,K)
+20    RNFAC=RNFAC+FACTOR(NCOD(I,J,K),FTABLE,IDFTAB)
+C   CALC EVEN USE OF EACH ACIDS CODONS
+      DO 21 I=1,4
+      DO 21 J=1,4
+      DO 21 K=1,4
+      JACID=IACID(I,J,K)
+      FABC(I,J,K)=FA(JACID)/(SPAN*NCODON(JACID))
+21    CONTINUE
+      END
+C   rotate factor
+      SUBROUTINE ROTFAK(SEQ,IDIM,I,LB,LF,NCOD,RNFAC,FTABLE,
+     +IDFTAB,NBASE,COMP,SPAN3,SPAN,FA,FABC,NCODON,IACID)
+C   AUTHOR: RODGER STADEN
+      REAL RNFAC,FTABLE(IDFTAB),FACTOR,FABC(4,4,4),FA(21)
+      INTEGER NCOD(4,4,4),NBASE(4),SPAN3,NCODON(21),IACID(4,4,4)
+      INTEGER CTONUM
+      CHARACTER SEQ(IDIM)
+      EXTERNAL FACTOR,CTONUM
+C   look back
+      IPB=I-LB-1
+C   look for bad char (it will not have been used)
+      DO 10 J=1,3
+10    IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 30
+C   ok
+      IPB=IPB+1
+      IS1=CTONUM(SEQ(IPB))
+      IS2=CTONUM(SEQ(IPB+1))
+      IS3=CTONUM(SEQ(IPB+2))
+C   subtract the current factorial value from rotated score
+      RNFAC=RNFAC-FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+C   subtract from codon table
+      NCOD(IS1,IS2,IS3)=NCOD(IS1,IS2,IS3)-1
+C   add this new value to the rotated score
+      RNFAC=RNFAC+FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+C   now do the composition part
+      NBASE(IS1)=NBASE(IS1)-1
+      NBASE(IS2)=NBASE(IS2)-1
+      NBASE(IS3)=NBASE(IS3)-1
+      JACID=IACID(IS1,IS2,IS3)
+      FA(JACID)=FA(JACID)-1.
+      TOTA=FA(JACID)/(SPAN*NCODON(JACID))
+      CALL NORMA2(FABC,JACID,TOTA)
+30    CONTINUE
+C   now look forwards
+      IPB=I+LF-1
+C   look for bad char
+      DO 40 J=1,3
+40    IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 50
+C   ok
+      IPB=IPB+1
+      IS1=CTONUM(SEQ(IPB))
+      IS2=CTONUM(SEQ(IPB+1))
+      IS3=CTONUM(SEQ(IPB+2))
+C   subtract the current factorial value from rotated score
+      RNFAC=RNFAC-FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+      NCOD(IS1,IS2,IS3)=NCOD(IS1,IS2,IS3)+1
+C   add this new value to the rotated score
+      RNFAC=RNFAC+FACTOR(NCOD(IS1,IS2,IS3),FTABLE,IDFTAB)
+C   now do the composition part
+      NBASE(IS1)=NBASE(IS1)+1
+      NBASE(IS2)=NBASE(IS2)+1
+      NBASE(IS3)=NBASE(IS3)+1
+      JACID=IACID(IS1,IS2,IS3)
+      FA(JACID)=FA(JACID)+1.
+      TOTA=FA(JACID)/(SPAN*NCODON(JACID))
+      CALL NORMA2(FABC,JACID,TOTA)
+50    CONTINUE
+      COMP=0.
+      DO 60 J=1,4
+      FREQ=FLOAT(NBASE(J))/FLOAT(SPAN3)
+      IF(FREQ.GT.0.)THEN
+        COMP=COMP+FLOAT(NBASE(J))*LOG(FREQ)
+      GO TO 60
+      END IF
+60    CONTINUE
+      END
+      SUBROUTINE POISON(LCOD,F,FABC,OUTW,OUTWD,JOB)
+C   AUTHOR: D. R BOSWELL AND RODGER STADEN
+      REAL F(4),FABC(4,4,4)
+C
+C  modified by rs 2-2-83 to include correction of -.5*ln(2piN) where
+C  N is no of codons (lcod). This is required because I calc using
+C  multinomial distribution and poisson uses poisson distributon.
+C  maybe later I should also use poisson.
+C  Modified 25 Jan 83 to include corrections for lack of restriction
+C   of total codon number inherent in the Poisson model
+C   (see Andrew's notes of 7 Jan 83)
+C
+C  The following arrays contain values of <W> and <(delta W)**2>
+C   computed for Poisson distributions for nu from 0.0 to 10.0 in
+C   steps of 0.1
+C  The functions are sufficiently smooth to allow linear interpolation
+C   between these values
+C
+      REAL W(0:100),DW2(0:100),W10(10),DW210(10)
+      SAVE W10,DW2,W,DW210
+      DATA   W/ 0.0,0.333677,  0.535378,  0.691144, 0.819071,0.927637,
+     1  1.021753,  1.104597,  1.178383,  1.244725,  1.304842,
+     1  1.359685,  1.410006,  1.456414,  1.499409,  1.539405,
+     1  1.576749,  1.611735,  1.644613,  1.675600,  1.704883,
+     1  1.732621,  1.758958,  1.784016,  1.807907,  1.830726,
+     1  1.852561,  1.873487,  1.893574,  1.912883,  1.931471,
+     1  1.949385,  1.966674,  1.983377,  1.999532,  2.015173,
+     1  2.030330,  2.045035,  2.059310,  2.073183,  2.086673,
+     1  2.099801,  2.112587,  2.125048,  2.137200,  2.149058,
+     1  2.160635,  2.171947,  2.183003,  2.193815,  2.204396,
+     1  2.214752,  2.224895,  2.234834,  2.244576,  2.254128,
+     1  2.263500,  2.272696,  2.281723,  2.290591,  2.299299,
+     1  2.307859,  2.316271,  2.324544,  2.332680,  2.340684,
+     1  2.348562,  2.356317,  2.363950,  2.371470,  2.378875,
+     1  2.386173,  2.393366,  2.400454,  2.407444,  2.414335,
+     1  2.421134,  2.427840,  2.434457,  2.440988,  2.447433,
+     1  2.453796,  2.460078,  2.466282,  2.472411,  2.478464,
+     1  2.484445,  2.490354,  2.496194,  2.501966,  2.507673,
+     1  2.513316,  2.518894,  2.524412,  2.529868,  2.535266,
+     1  2.540605,  2.545888,  2.551117,  2.556289,  2.561411/
+      DATA DW2/ 0.0,0.564147,  0.615536,  0.604980,0.578119,0.548425,
+     1  0.520633,  0.496362,  0.475974,  0.459312,  0.446008,
+     1  0.435632,  0.427759,  0.421991,  0.417978,  0.415411,
+     1  0.414025,  0.413597,  0.413938,  0.414884,  0.416304,
+     1  0.418091,  0.420146,  0.422401,  0.424790,  0.427265,
+     1  0.429780,  0.432308,  0.434822,  0.437301,  0.439725,
+     1  0.442090,  0.444380,  0.446592,  0.448719,  0.450763,
+     1  0.452725,  0.454593,  0.456378,  0.458077,  0.459696,
+     1  0.461236,  0.462698,  0.464085,  0.465401,  0.466650,
+     1  0.467835,  0.468958,  0.470021,  0.471033,  0.471987,
+     1  0.472895,  0.473758,  0.474571,  0.475347,  0.476081,
+     1  0.476778,  0.477444,  0.478074,  0.478669,  0.479243,
+     1  0.479780,  0.480300,  0.480791,  0.481261,  0.481711,
+     1  0.482136,  0.482542,  0.482935,  0.483305,  0.483668,
+     1  0.484009,  0.484333,  0.484652,  0.484950,  0.485244,
+     1  0.485518,  0.485790,  0.486046,  0.486291,  0.486531,
+     1  0.486763,  0.486985,  0.487203,  0.487408,  0.487606,
+     1  0.487800,  0.487990,  0.488172,  0.488348,  0.488517,
+     1  0.488678,  0.488844,  0.488993,  0.489148,  0.489290,
+     1  0.489438,  0.489575,  0.489704,  0.489840,  0.489961/
+C
+C  The next two arrays are for nu from 10.0 to 100.0 in steps of 10.0
+C
+      DATA   W10/
+     1  2.561411,  2.912526,  3.116712,  3.261270,  3.373267,
+     1  3.464711,  3.541987,  3.608907,  3.667911,  3.720688/
+      DATA DW210/
+     1  0.489961,  0.495480,  0.497065,  0.497826,  0.498281,
+     1  0.498570,  0.498778,  0.498917,  0.499063,  0.499146/
+C
+C  rs correction factor
+      CORREC=0.5*LOG(6.2832*LCOD)
+C
+C
+C  Initialise accumulators to zero
+C   (Note that SDW will accumulate the sum of squares of DW, and
+C   we will need to take its square root at the end
+C
+      SW=0.0
+      SDW=0.0
+C  Step through all 64 codons
+      DO 200 I=1,4
+      DO 200 J=1,4
+      DO 200 K=1,4
+C  V (=nu) is the expected codon frequency in this slot of the table
+      IF(JOB.EQ.1)THEN
+                V=REAL(LCOD)*F(I)*F(J)*F(K)
+      ELSE IF(JOB.EQ.2)THEN
+                V=REAL(LCOD)*FABC(I,J,K)
+      END IF
+C  V<0.0 is a fatal error (implies bad parameters)
+      IF(V.LT.0.0)GO TO 900
+C  V>=10.0 requires separate handling
+      IF(V.GE.10.0)GOTO 100
+C  Now we can interpolate in the tables and accumulate the appropriate values
+      IV=IFIX(V*10.0)
+      DV=V*10.0-IV
+      SW=SW+W(IV)+DV*(W(IV+1)-W(IV))
+      SDW=SDW+DW2(IV)+DV*(DW2(IV+1)-DW2(IV))
+      GO TO 200
+C  Come here if V>=10.0
+100   IF(V.GT.100.0)GOTO 110
+      IV=IFIX(V/10.0)
+      DV=V/10.0-IV
+      SW=SW+W10(IV)+DV*(W10(IV+1)-W10(IV))
+      SDW=SDW+DW210(IV)+DV*(DW210(IV+1)-DW210(IV))
+      GO TO 200
+C  If V>100.0 we're quite justified in using the "large V" approximations
+110   SW=SW+0.5*(LOG(2.0*3.1415926536)+LOG(V)+1.0)
+      SDW=SDW+0.5
+200   CONTINUE
+C  Include correction factors
+      OUTW=MAX(0.0,SW-64.0/(4.0*REAL(LCOD)))
+      OUTWD=SQRT(MAX(0.0,SDW-64.0*64.0/(4.0*REAL(LCOD))))
+C  rs correction
+      OUTW=OUTW-CORREC
+      RETURN
+C  Come here if there's an error in the parameters
+900   WRITE(*,1)LCOD,F
+1     FORMAT(' POISSON ERROR -- INVALID PARAMETERS'/,
+     1' SEQUENCE LENGTH: ',I5,'   BASE FREQUENCIES: ',4F8.3)
+      OUTW=0.0
+      OUTWD=0.0
+      END
+      SUBROUTINE NORMA2(SUM,JACID,TOTA)
+C   AUTHOR: RODGER STADEN
+C     routine to set all the values for any acid to one
+C   particular value=tota for a table of codon freqs
+C   ncods is number of cods per acid numbering from f to g
+C   codnos are pointers to the values of codon usage in sum
+C   (which is 4,4,4) for each set of codons for each acid in
+C   turn from f to g (* is acid 11)
+C   codnos are pointers to the first element in codp for each
+C   acids set of codons
+C   if jacid gt 0 process only acid jacid, else process all
+C   tota is the total value each acid should sum to
+      REAL SUM(64)
+      INTEGER NCODS(21),CODNOS(21),CODP(64)
+      SAVE NCODS,CODNOS,CODP
+      DATA NCODS/2,6,3,1,4,6,4,4,4,2,3,2,2,2,2,2,2,2,1,6,4/
+      DATA CODNOS/1,3,9,12,13,17,23,27,31,35,37,40,42,44,46,
+     +48,50,52,54,55,61/
+      DATA CODP/1,17,33,49,2,18,34,50,3,19,35,51,
+     +4,20,36,52,5,21,37,53,15,31,6,22,38,54,
+     +7,23,39,55,8,24,40,56,9,25,41,57,45,
+     +10,26,42,58,11,27,43,59,12,28,44,60,
+     +13,29,61,14,30,46,62,47,63,16,32,48,64/
+      IACID=1
+      IF(JACID.NE.0)IACID=JACID
+1     CONTINUE
+C   loop for acid iacid from codnos to codnos+ncods-1
+      DO 20 J=CODNOS(IACID),CODNOS(IACID)+NCODS(IACID)-1
+      SUM(CODP(J))=TOTA
+20    CONTINUE
+      IF(JACID.NE.0)RETURN
+      IACID=IACID+1
+      IF(IACID.LT.22)GO TO 1
+      END
+      SUBROUTINE PCODUS(SEQ,IDIM1P,IEND,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,MARGL,MARGR,MARGB,MARGT,IDEV,FILNAM,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA)
+      CHARACTER HELPF*(*),PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      REAL FREQ(64),SUM(64),LOGFRQ(64)
+      CHARACTER SEQ(IDIM1P),FILNAM*(*)
+      INTEGER ANSTY
+C   NOTE J1 J2 IS THE ACTIVE REGION, KSTART IS THE NUMBER OF THE 
+C   BASE IN ELEMENT 1 OF SEQ, AND IEND IS THE SEQUENCE NUMBER OF THELAST 
+C   ELEMENT IN THE RAM BUFFER
+C   PLOTS ARE FROM J1 TO J2 BUT STANDARDS CAN
+C   BE TAKEN FROM KSTART TO IEND
+      CALL SHOWFU(KBOUT,'Staden and McLachlan codon usage')
+C      WRITE(KBOUT,*)'Staden and McLachlan codon usage method'
+C      WRITE(KBOUT,*)'Codon tables for standards may be read from disk'
+C      WRITE(KBOUT,*)'or calculated from parts of the current sequence'
+      CALL GETPAR(42,7,IOK,MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,ANSTY,
+     +IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL PCODD(SEQ,IDIM1P,IEND,MXSPAN,
+     +KSTART,IDEV,FILNAM,FREQ,SUM,LOGFRQ,ANSTY,
+     +MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,P1,P2,P3,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL PCODPP(SEQ,IDIM1P,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,MARGL,MARGR,MARGB,MARGT,PAA,
+     +LOGFRQ,LENW,IWRIT,P1,P2,P3)
+      END
+      SUBROUTINE PCODD(SEQ,IDIM1P,IEND,MXSPAN,
+     +KSTART,IDEV,FILNAM,FREQ,SUM,LOGFRQ,ANSTY,
+     +MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,P1,P2,P3,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,IOK)
+      CHARACTER HELPF*(*),PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      REAL FREQ(64),SUM(64),LOGFRQ(64)
+      CHARACTER SEQ(IDIM1P),FILNAM*(*)
+      INTEGER ANSTY
+      CALL FILLR(FREQ,64,0.)
+      CALL PCODD1(ANSTY,PAA,FREQ,IDEV,FILNAM,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOK.NE.0) RETURN
+      I1INM1 = 1+(MXSPAN/2)-KSTART+1
+      IF(ANSTY.EQ.0) THEN
+6       CONTINUE
+        I1 = 0
+        I2 = 0
+        CALL GTREG(KBIN,KBOUT,0,IEND,I1,I2,
+     +  'Define standard',
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(I1.NE.0) THEN
+          IDIM=I2-I1+1
+          IF(IDIM.GT.3) THEN
+            CALL FILLR(SUM,64,0.)
+            CALL CALCOD(SUM,SEQ(I1+I1INM1),IDIM)
+            CALL ADDR(SUM,FREQ,64)
+            CALL WRTCOD(SUM,KBOUT,PAA)
+            GO TO 6
+          END IF
+        END IF
+      END IF
+      SUMCOD=0.
+      DO 50 I=1,64
+        SUMCOD=SUMCOD+FREQ(I)
+50    CONTINUE
+      WRITE(KBOUT,1006)SUMCOD
+1006  FORMAT(' Total codons in standard= ',F8.0)
+C     DONT CONTINUE IF LESS THAN 64. CODONS SPECIFIED
+      IF(SUMCOD.LT.64.)THEN
+        CALL ERROM(KBOUT,'At least 64 codons required in standard')
+        IOK = 1
+        RETURN
+      END IF
+      CALL AACODM(KBIN,KBOUT,FREQ,PAA,SUMCOD,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.LT.0)RETURN
+      CALL WRTCOD(FREQ,KBOUT,PAA)
+      SUMCOD=0.
+      DO 51 I=1,64
+        SUMCOD=SUMCOD+FREQ(I)
+51    CONTINUE
+C     NORMALIZE, DO STOP CODONS AND ZERO CODONS
+      CALL PCOD2(FREQ,LOGFRQ,SUMCOD,LOGFRQ,PAA)
+C      CALL WRTCOD(LOGFRQ,KBOUT,PAA)
+C   CALC EXPECTED FREQS FRAME S AND T CODONS AND EXPECTED P'S FOR ALL FRAMES
+      LENWT = 11
+      DO 60 I = 1,3
+        LENWT = LENWT + 10
+        CALL PCOD3(FREQ,LOGFRQ,LENWT,P1,P2,P3)
+        WRITE(KBOUT,2000)LENWT,P1,P2,P3
+60    CONTINUE
+2000  FORMAT(' Span length ',I3,' expected mean values:',3(F6.1))
+      CALL GSPIN(MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL PCOD3(FREQ,LOGFRQ,LENW,P1,P2,P3)
+      END
+      SUBROUTINE PCODPP(SEQ,IDIM1P,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,MARGL,MARGR,MARGB,MARGT,PAA,
+     +LOGFRQ,LENW,IWRIT,P1,P2,P3)
+      CHARACTER PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      REAL LOGFRQ(64),SUMW(3)
+      CHARACTER SEQ(IDIM1P),RFRAME(3)
+      LB=1+LENW/2
+      LB=3*LB
+      LF=LENW/2
+      LF=3*LF
+      LENW3=3*LENW
+      EPS=0.0000001
+      IMARG=NINT(MARGT/3.)
+      MARGL1=MARGL
+      MARGR1=MARGR
+      MARGB1=MARGB
+      MARGB2=MARGB+IMARG
+      MARGB3=MARGB+2*IMARG
+      MARGT3=IMARG
+      MARGT2=IMARG
+      MARGT1=IMARG
+      I1IN=2+(MXSPAN/2)-KSTART+1
+      I1INM1=I1IN-1
+      IDIMJ=J2-J1+1.
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      XMIN=J1
+      XMAX=J2
+      IBH=NINT(0.005*ISXMAX)
+      IMH=NINT(0.005*ISYMAX)
+      SD=1.
+      CALL FILLR(SUMW,3,0.)
+      IDONE=0
+      YMIN=MIN(P2,P3)
+      YMIN=YMIN
+      YMAX=P1
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      YFIRST=INT(YMIN)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB1,MARGT1,ISXMAX,ISYMAX,SD,IBH,YFIRST,2)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB2,MARGT2,ISXMAX,ISYMAX,SD,IBH,YFIRST,2)
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,MARGL1,MARGR1,
+     +MARGB3,MARGT3,ISXMAX,ISYMAX,SD,IBH,YFIRST,2)
+      CALL CODSTR(SEQ(J1P-LB),IDIM1P,LOGFRQ,LENW3,SUMW(1))
+      CALL CODSTR(SEQ(J1P+1-LB),IDIM1P,LOGFRQ,LENW3,SUMW(2))
+      CALL CODSTR(SEQ(J1P+2-LB),IDIM1P,LOGFRQ,LENW3,SUMW(3))
+      XP=J1
+      YP1=0.
+      YP2=0.
+      YP3=0.
+      DO 300 IP=J1P,J2P,3
+        IDONE=IDONE+1
+        IP0=IP
+        IP1=IP+1
+        IP2=IP+2
+        CALL ROTCOD(SEQ,IDIM1P,IP0,LB,LF,LOGFRQ,SUMW(1))
+        CALL ROTCOD(SEQ,IDIM1P,IP1,LB,LF,LOGFRQ,SUMW(2))
+        CALL ROTCOD(SEQ,IDIM1P,IP2,LB,LF,LOGFRQ,SUMW(3))
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          CALL PCOD1(SUMW,Y1,Y2,Y3,RFRAME)
+C         CHECK FOR OVERFLOW SHOWN BY VALUES =0.
+          IF(ABS(Y1).LT.EPS)Y1=YP1
+          IF(ABS(Y2).LT.EPS)Y2=YP2
+          IF(ABS(Y3).LT.EPS)Y3=YP3
+          IPA=IP-I1INM1
+          X=IPA
+          CALL LINE(XP,X,YP1,Y1,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1+IMH,MARGT1-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP2,Y2,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2+IMH,MARGT2-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP3,Y3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3+IMH,MARGT3-2*IMH,ISXMAX,ISYMAX)
+          IF(RFRAME(1).EQ.'*')
+     +    CALL POINT(X,0.,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          IF(RFRAME(2).EQ.'*')
+     +    CALL POINT(X,0.,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+          IF(RFRAME(3).EQ.'*')
+     +    CALL POINT(X,0.,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+C         SAVE CURRENT VALUES
+          YP1=Y1
+          YP2=Y2
+          YP3=Y3
+          XP=X
+        END IF
+300   CONTINUE
+C   STARTS
+      CALL STARTS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+C   STOPS
+      CALL STOPS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL VT100M
+      END
+      SUBROUTINE PCODD1(ANSTY,PAA,FREQ,IDEV,FILNAM,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      REAL FREQ(64)
+      CHARACTER PAA(125),HELPF*(*),FILNAM*(*)
+      INTEGER CHOICE,ANSTY
+      IOK = 1
+      CHOICE = ANSTY
+      CALL YESONO(CHOICE,'Define internal standard',
+     +'Read standard from file',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(CHOICE.LT.0) RETURN
+      ANSTY = CHOICE
+      IF(ANSTY.EQ.1) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File name of standard',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL REDCOD(FREQ,IDEV)
+        CLOSE(UNIT=IDEV)
+        CALL WRTCOD(FREQ,KBOUT,PAA)
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE AACODM(KBIN,KBOUT,FREQ,PAA,SUMCOD,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      REAL FREQ(64)
+      CHARACTER PAA(5,5,5)
+      CHARACTER HELPF*(*)
+      INTEGER CHOICE
+      PARAMETER (MAXPRM = 43)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      CHOICE = 1
+      PROMPT(1) = 'Use observed frequencies'
+      PROMPT(2) = 'Normalize to average amino acid composition'
+      PROMPT(3) = 'Normalize to no amino acid bias'
+      CALL RADION('Select normalization',PROMPT,3,CHOICE,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(CHOICE.LT.1) THEN
+        IOK = 1
+        RETURN
+      END IF
+      SUMC = SUMCOD/64.
+      IF(CHOICE.EQ.2) CALL APLAAC(FREQ,PAA)
+      IF(CHOICE.EQ.3) CALL NORMAA(FREQ,SUMC,PAA)
+      IOK = 0
+      END
+      SUBROUTINE APLAAC(SUM,PAA)
+      CHARACTER PAA(5,5,5),AA(21)
+      REAL SUM(4,4,4),EXPERC(21)
+C  ARGOS VALUES
+      SAVE EXPERC,AA
+      DATA EXPERC/8.3,1.7,5.3,6.2,3.9,7.2,2.2,5.2,5.7,
+     +9.0,2.4,4.4,5.1,4.0,5.7,6.9,5.8,6.6,1.3,3.2,0.0/
+      DATA AA/'A','C','D','E','F',
+     1'G','H','I','K','L',
+     2'M','N','P','Q','R',
+     3'S','T','V','W','Y','*'/
+C   LOOP FOR EACH ACID
+      DO 150 L=1,21
+        ASUM=0.
+C   LOOP FOR EACH CODON
+        DO 200 I=1,4
+          DO 200 J=1,4
+            DO 200 K=1,4
+C             COUNT CODONS
+              IF(PAA(K,J,I).EQ.AA(L))ASUM=ASUM+SUM(I,J,K)
+200     CONTINUE
+        IF(ASUM.GT.0.0)ASUM = 6.4 / ASUM
+C   LOOP FOR EACH CODON
+        DO 100 I=1,4
+          DO 100 J=1,4
+            DO 100 K=1,4
+C             NORMALIZE
+        IF(PAA(K,J,I).EQ.AA(L))SUM(I,J,K)=SUM(I,J,K)*ASUM*EXPERC(L)
+100     CONTINUE
+150   CONTINUE
+      END
+      SUBROUTINE PCOD3(FREQ,LOGFRQ,LENW,P11,P22,P33)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (SMALL = 1.0E-30)
+      DOUBLE PRECISION U,V,W,SL,UL,VL,WL,P1,P2,P3
+      REAL FREQ(64),LOGFRQ(64),S2(64),S3(64)
+      CALL F2FF1(FREQ,S2)
+      CALL F3FF1(FREQ,S3)
+C   CALC EXPECTED MEAN SCORES FOR THIS WINDOW LENGTH AND STANDARD
+      U=0.
+      V=0.
+      W=0.
+      DO 62 I=1,64
+        U=U+FREQ(I)*LOGFRQ(I)
+        V=V+S2(I)*LOGFRQ(I)
+        W=W+S3(I)*LOGFRQ(I)
+62    CONTINUE
+      SL=U+V+W
+      SL=SL/3.
+      U=U-SL
+      V=V-SL
+      W=W-SL
+      UL=EXP(U*LENW)
+      VL=EXP(V*LENW)
+      WL=EXP(W*LENW)
+      SL=UL+VL+WL
+      P1=UL/SL
+      P2=VL/SL
+      P3=WL/SL
+      IF(ABS(P1-1.0).GT.SMALL)P11=LOG10(P1/(1.-P1))
+      IF(ABS(P2-1.0).GT.SMALL)P22=LOG10(P2/(1.-P2))
+      IF(ABS(P3-1.0).GT.SMALL)P33=LOG10(P3/(1.-P3))
+      END
+      SUBROUTINE PCOD2(FREQ,LOGFRQ,SUMCOD,LOGFR3,PAA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER PAA(5,5,5)
+      REAL FREQ(64),LOGFRQ(64),LOGFR3(4,4,4)
+C   NORMALIZE
+      DO 51 I=1,64
+        FREQ(I)=FREQ(I)/SUMCOD
+51    CONTINUE
+C   TAKE LOGS
+      DO 52 I=1,64
+        LOGFRQ(I)=0.0
+        IF(FREQ(I).NE.0.)LOGFRQ(I)=LOG(FREQ(I))
+52    CONTINUE
+C   NEED TO DEAL WITH STOP CODONS AND ZERO FREQS
+      ODX=1./SUMCOD
+      ODXL=LOG(ODX)
+      STOPT=0.
+      NSTOP=0
+C   LOOK FOR STOPS IN PAA
+      DO 10 I=1,4
+      DO 10 J=1,4
+      DO 10 K=1,4
+        IF(PAA(K,J,I).EQ.'*')THEN
+          LOGFR3(I,J,K)=99.
+          STOPT=STOPT+99.
+          NSTOP=NSTOP+1
+        END IF
+10    CONTINUE
+C   SET ZEROES TO LOG(1/TOTAL)
+      DO 53 I=1,64
+        IF(LOGFRQ(I).EQ.0.)LOGFRQ(I)=ODXL
+53    CONTINUE
+C   SET STOPS TO MEAN
+      X1=0.
+      DO 54 I=1,64
+        X1=X1+LOGFRQ(I)
+54    CONTINUE
+      IF(NSTOP.NE.64)X1=(X1-STOPT)/(64-NSTOP)
+C   LOOK FOR STOPS IN PAA
+      DO 20 I=1,4
+      DO 20 J=1,4
+      DO 20 K=1,4
+        IF(PAA(K,J,I).EQ.'*')LOGFR3(I,J,K)=X1
+20    CONTINUE
+      END
+      SUBROUTINE PCOD1(SUMW,Y1,Y2,Y3,LINE)
+C   AUTHOR: RODGER STADEN
+      DOUBLE PRECISION SUMWT(3),T,T1
+      REAL SUMW(3)
+      CHARACTER LINE(3)
+C   calc mean of sums to add to each
+      T=SUMW(1)+SUMW(2)+SUMW(3)
+      T=T/3.
+      T=-1.*T
+      DO 80 II=1,3
+C   add mean to each
+        SUMWT(II)=SUMW(II)+T
+C   calc exponential of each
+        SUMWT(II)=EXP(SUMWT(II))
+80    CONTINUE
+C   calc sum of exponentials
+      T=SUMWT(1)+SUMWT(2)+SUMWT(3)
+      DO 298 II=1,3
+C   divide each by sum of exponetials
+        IF(T.NE.0.)SUMWT(II)=SUMWT(II)/T
+        T1=1.-SUMWT(II)
+        IF(T1.NE.0.)SUMWT(II)=SUMWT(II)/T1
+        IF(SUMWT(II).GT.0.)SUMWT(II)=LOG10(SUMWT(II))
+298   CONTINUE
+C   get largest
+      CALL FILLC(LINE,3,' ')
+      T=MAX(SUMWT(1),SUMWT(2),SUMWT(3))
+      DO 10 I=1,3
+        IF(T.EQ.SUMWT(I))LINE(I)='*'
+10    CONTINUE
+      Y1=SUMWT(1)
+      Y2=SUMWT(2)
+      Y3=SUMWT(3)
+      END
+      SUBROUTINE CODSTR(SEQ,IDIM,LOGFRQ,LWIND3,SUM)
+C   AUTHOR: RODGER STADEN
+      REAL LOGFRQ(4,4,4)
+      CHARACTER SEQ(IDIM)
+      INTEGER CTONUM,IVAL(3)
+      EXTERNAL CTONUM
+      SUM=0.
+C   LOOP FOR WINDOW LENGTH
+      DO 100 I=1,LWIND3-1,3
+        IP=I-1
+C   CHECK FOR NULL CHARS
+        DO 10 J=1,3
+          IVAL(J)=CTONUM(SEQ(IP+J))
+          IF(IVAL(J).EQ.5)GO TO 100
+10      CONTINUE
+        SUM=SUM+LOGFRQ(IVAL(1),IVAL(2),IVAL(3))
+100   CONTINUE
+      END
+      SUBROUTINE F2FF1(SUM1,SUM2)
+C   AUTHOR: RODGER STADEN
+      REAL SUM1(4,4,4),SUM2(4,4,4),SXAB(4,4),SCXX(4)
+      DO 1 I=1,4
+      DO 1 J=1,4
+      DO 1 K=1,4
+        SUM2(K,J,I)=0.
+1     CONTINUE
+C     CALC FROM XABCXX ABC IE XAB * CXX
+C   XAB
+      DO 10 J=1,4
+      DO 10 K=1,4
+        SXAB(J,K)=0.
+        DO 10 I=1,4
+          SXAB(J,K)=SXAB(J,K)+SUM1(I,J,K)
+10    CONTINUE
+C   CXX
+      DO 20 I=1,4
+        SCXX(I)=0.
+        DO 20 J=1,4
+        DO 20 K=1,4
+          SCXX(I)=SCXX(I)+SUM1(I,J,K)
+20    CONTINUE
+C   SUM2= XAB * CXX
+      DO 30 I=1,4
+      DO 30 J=1,4
+      DO 30 K=1,4
+        SUM2(I,J,K)=SXAB(I,J)*SCXX(K)
+30    CONTINUE
+      END
+C    F3FF1
+      SUBROUTINE F3FF1(SUM1,SUM3)
+C   AUTHOR: RODGER STADEN
+      REAL SUM1(4,4,4),SUM3(4,4,4),SXXA(4),SBCX(4,4)
+      DO 1 I=1,4
+      DO 1 J=1,4
+      DO 1 K=1,4
+        SUM3(K,J,I)=0.
+1     CONTINUE
+C   CALC ABC FROM XXABCXX  IE XXA * BCX
+C   XXA
+      DO 10 K=1,4
+        SXXA(K)=0.
+        DO 10 I=1,4
+        DO 10 J=1,4
+          SXXA(K)=SXXA(K)+SUM1(I,J,K)
+10    CONTINUE
+C   BCX
+      DO 20 I=1,4
+      DO 20 J=1,4
+        SBCX(I,J)=0.
+        DO 20 K=1,4
+          SBCX(I,J)=SBCX(I,J)+SUM1(I,J,K)
+20    CONTINUE
+C  XXA * BCX
+      DO 30 I=1,4
+      DO 30 J=1,4
+      DO 30 K=1,4
+        SUM3(I,J,K)=SXXA(I)*SBCX(J,K)
+30    CONTINUE
+      END
+C   THIS ROUTINE DOES NOT CHECK FOR GOING OUT OF ARRAY
+      SUBROUTINE ROTCOD(SEQ,IDIM1,I,LB,LF,LOGFRQ,H)
+C   AUTHOR: RODGER STADEN
+      REAL LOGFRQ(4,4,4)
+      CHARACTER SEQ(IDIM1)
+      INTEGER CTONUM,IVAL(3)
+      EXTERNAL CTONUM
+      XS=0.
+      XA=0.
+C   LOOK BACK
+      IPB=I-LB-1
+      DO 10 J=1,3
+        IVAL(J)=CTONUM(SEQ(IPB+J))
+        IF(IVAL(J).EQ.5)GO TO 30
+10    CONTINUE
+C   SET VALUE TO SUBTRACT
+      XS=LOGFRQ(IVAL(1),IVAL(2),IVAL(3))
+C   NOW LOOK FORWARD
+30    CONTINUE
+      IPB=I+LF-1
+      DO 45 J=1,3
+        IVAL(J)=CTONUM(SEQ(IPB+J))
+        IF(IVAL(J).EQ.5)GO TO 50
+45    CONTINUE
+C   OK SO USE
+      XA=LOGFRQ(IVAL(1),IVAL(2),IVAL(3))
+50    H=H-XS+XA
+      RETURN
+      END
+C    PSRCH
+      SUBROUTINE PSRCH(SEQ,IDIM1,J1,J2,STRING,IDIMS,INC,
+     +XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,Y,BH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIMS)
+      YP=Y+BH
+      IAT=J1-INC
+10    CONTINUE
+      IAT=IAT+INC
+      IDIM=J2-IAT
+      IF(IDIM.GT.0)THEN
+        CALL FIND6(SEQ(IAT),IDIM,STRING,IDIMS,INC,JMATCH)
+        IF(JMATCH.NE.0)THEN
+          IAT=IAT+JMATCH-1
+          X=IAT
+          CALL LINE(X,X,YP,Y,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE PBASEP(SEQ,IDIM1P,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,IENDB,
+     +MARGL1,MARGR1,MARGB,MARGT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,IDEV,FILNAM,DIALOG)
+      CHARACTER HELPF*(*),PAA(5,5,5),FILNAM*(*)
+C   AUTHOR: RODGER STADEN
+C 11-1-91 Added facility to read standard in the form of codon table
+C         and greatly reorganised code! and added new routines.
+      REAL EXPEC(5,3),SCORES(5,3),WT(3),S(3)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER ANSTY,ANSABS,ANSS,DIALOG
+C  ARGOS VALUES
+      SAVE EXPEC
+       DATA EXPEC/0.1768,0.2108,0.2767,0.3357,0.,
+     + 0.2707,0.2378,0.3097,0.1818,0.,
+     + 0.2506,0.2506,0.2396,0.2592,0./
+C  ORIGINAL DAYHOFF VALUES
+C      DATA EXPEC/0.1835,0.1932,0.2719,0.3514,0.,
+C     10.2382,0.2459,0.3173,0.1985,0.,
+C     20.2548,0.2548,0.2377,0.2527,0./
+C   NOTE J1 J2 IS THE ACTIVE REGION, KSTART IS THE NUMBER OF THE 
+C   BASE IN ELEMENT 1 OF SEQ, AND IDIM1 IS THE LENGTH FROM KSTART
+C   TO THE END OF SEQ. PLOTS ARE FROM J1 TO J2 BUT STANDARDS CAN
+C   BE TAKEN FROM KSTART TO KSTART+IDIM1-1 = IENDB
+      CALL SHOWFU(KBOUT,
+     +'Positional base preferences method to find protein genes')
+      CALL GETPAR(43,9,IOK,MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,
+     +ANSABS,ANSTY,ANSS,
+     +IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL PBASPD(SEQ,IDIM1P,MXSPAN,
+     +KSTART,IENDB,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,
+     +ANSTY,ANSABS,ANSS,MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,
+     +EXPEC,SCORES,WT,S,SMIN,SMAX,DIALOG,PAA,IDEV,FILNAM,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL PBASPP(SEQ,IDIM1P,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,ANSABS,LENW,IWRIT,
+     +EXPEC,SCORES,WT,S,SMIN,SMAX,
+     +MARGL1,MARGR1,MARGB,MARGT,PAA)
+      END
+      SUBROUTINE PBASPD(SEQ,IDIM1P,MXSPAN,
+     +KSTART,IENDB,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,
+     +ANSTY,ANSABS,ANSS,MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,
+     +EXPEC,SCORES,WT,S,SMIN,SMAX,DIALOG,PAA,IDEV,FILNAM,IOK)
+C   AUTHOR: RODGER STADEN
+      REAL EXPEC(5,3),SCORES(5,3),WT(3),S(3)
+      CHARACTER SEQ(IDIM1P)
+      INTEGER ANSTY,ANSABS,ANSS,CHOICE,DIALOG
+      REAL FREQ(64)
+      CHARACTER PAA(125),HELPF*(*),FILNAM*(*)
+      PARAMETER (MAXPRM = 28)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      IOK = 1
+      DO 30 J=1,3
+      WT(J)=1.0
+      DO 30 I=1,5
+        SCORES(I,J)=EXPEC(I,J)
+30    CONTINUE
+      IF(DIALOG.EQ.0) THEN
+        CALL PBEXP(SCORES,WT,S)
+        CALL PBSCAL(SMIN,SMAX,S,ANSABS,LENW,SMIN5,SMAX5,DIF)
+        IOK = 0
+        RETURN
+      END IF
+C
+C  Select source of standard
+C
+      PROMPT(1) = 'Use global standard'
+      PROMPT(2) = 'Use internal standard'
+      PROMPT(3) = 'Use codon usage table'
+      IOK = 1
+      CHOICE = 1
+      CALL RADION('Select standard source',PROMPT,3,CHOICE,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(CHOICE.LT.1) RETURN
+      ANSTY = CHOICE
+      IF(ANSTY.EQ.3) THEN
+C
+C  Get standard from a codon table
+C
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File name of standard',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL REDCOD(FREQ,IDEV)
+        CLOSE(UNIT=IDEV)
+        CALL WRTCOD(FREQ,KBOUT,PAA)
+        CALL GETPBS(FREQ,SCORES)
+      ELSE IF(ANSTY.EQ.2) THEN
+C
+C  Get standard from a section of the sequence
+C
+        CALL PBD1(SEQ,IDIM1P,MXSPAN,KSTART,IENDB,KBIN,KBOUT,
+     +  SCORES,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(ANSTY.NE.1) THEN
+C
+C  Allow combination with global standard
+C
+        PROMPT(1) = 'Use observed frequencies'
+        PROMPT(2) = 'Combine with global standard'
+        IOK = 1
+        CHOICE = 1
+        CALL RADION('Select normalisation',PROMPT,2,CHOICE,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(CHOICE.LT.1) RETURN
+        IF(CHOICE.EQ.2)THEN
+          DO 36 I=1,2
+          DO 36 K=1,4
+            SCORES(K,I)=EXPEC(K,I)
+36        CONTINUE
+        END IF
+      END IF
+      WRITE(KBOUT,1004)
+1004  FORMAT('           T      C      A      G      Range')
+      DO 4 I=1,3
+        BIG=0.0
+        SMALL=99999.0
+        DO 5 K=1,4
+          BIG = MAX(SCORES(K,I),BIG)
+          SMALL = MIN(SCORES(K,I),SMALL)
+5       CONTINUE
+        RANGE=BIG-SMALL
+        WRITE(KBOUT,1005)I,(SCORES(K,I),K=1,4),RANGE
+1005    FORMAT('     ',I3,4(1X,F6.3),1X,F6.3)
+4     CONTINUE
+C
+C  Allow different weights for each position
+C
+      IOK = 1
+      CHOICE = 0
+      CALL YESONO(CHOICE,
+     +'Use 1.0 for positional weights',
+     +'Change positional weights',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(CHOICE.LT.0) RETURN
+      IOK = 0
+      IF(CHOICE.EQ.1)  CALL PBPD2(WT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL PBEXP(SCORES,WT,S)
+        WRITE(KBOUT,1001)S
+1001    FORMAT(
+     +  ' Expected scores per codon in each frame',/,'   ',3F10.3)
+        CALL GSPIN(MINSP,MAXSP,LENW,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+C
+C  Select relative or absolute values
+C
+        IOK = 1
+        CHOICE = ANSABS
+        CALL YESONO(CHOICE,
+     +  'Plot relative scores',
+     +  'Plot absolute scores',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(CHOICE.LT.0) RETURN
+        ANSABS = CHOICE
+      CALL PBSCAL(SMIN,SMAX,S,ANSABS,LENW,SMIN5,SMAX5,DIF)
+      WRITE(KBOUT,1002)SMIN,SMAX,DIF
+1002  FORMAT(' Scaling values:',/,
+     +'    Minimum  maximum    range',/,2X,3F9.4)
+C
+C  Allow plot scaling
+C
+      IOK = 1
+      CHOICE = ANSS
+      CALL YESONO(CHOICE,
+     +'Leave scaling values unchanged',
+     +'Change scaling values',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(CHOICE.LT.0) RETURN
+      ANSS = CHOICE
+      IF(ANSS.EQ.1) THEN
+        CALL GETRL(SMIN5,SMAX5,SMIN,'Minimum',VALUE,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        SMIN = VALUE
+        CALL GETRL(SMIN,SMAX5,SMAX,'Maximum',VALUE,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        SMAX = VALUE
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE PBSCAL(SMIN,SMAX,S,ANSABS,LENW,SMIN5,SMAX5,DIF)
+C   AUTHOR: RODGER STADEN
+      REAL S(3)
+      INTEGER ANSABS
+      SM=0.
+      IF(ANSABS.EQ.0) THEN
+        DO 25 I=1,3
+          SM=SM+S(I)
+25      CONTINUE
+        DO 26 I=1,3
+          S(I)=S(I)/SM
+26      CONTINUE
+      ELSE
+        DO 27 I=1,3
+          S(I)=S(I)*LENW
+27      CONTINUE
+      END IF
+      SMAX=MAX(S(1),S(2),S(3))
+      SMIN=MIN(S(1),S(2),S(3))
+      DIF=SMAX-SMIN
+      D20=DIF*0.2
+      SMAX=SMAX+D20
+      SMIN=SMIN-D20
+      D50 = DIF*0.5
+      SMAX5 = SMAX + D50
+      SMIN5 = SMIN - D50
+      END
+      SUBROUTINE PBPD2(WT,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      REAL WT(3)
+      CHARACTER HELPF*(*)
+        WRITE(KBOUT,1000)
+1000    FORMAT(' Give weights between 0.0 and 1.0',/,
+     +  ' to each of the 3 codon positions')
+        WMIN = 0.
+        WMAX = 1.
+        DEF = 1.
+9       CONTINUE
+        CALL GETRL(WMIN,WMAX,DEF,'Position 1',VALUE,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        WT(1) = VALUE
+        CALL GETRL(WMIN,WMAX,DEF,'Position 2',VALUE,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        WT(2) = VALUE
+        CALL GETRL(WMIN,WMAX,DEF,'Position 3',VALUE,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        WT(3) = VALUE
+C   CHECK FOR ZERO
+        X=WT(1)+WT(2)+WT(3)
+        IF(X.LT.0.0000001)THEN
+         WRITE(KBOUT,*)
+     +   'All weights zero, at least one must be non-zero'
+         GO TO 9
+        END IF
+      IOK = 0
+      END
+      SUBROUTINE PBD1(SEQ,IDIM1P,MXSPAN,KSTART,IENDB,KBIN,KBOUT,
+     +SCORES,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER SEQ(IDIM1P)
+      REAL SCORES(5,3)
+      CHARACTER HELPF*(*)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C       NOTE KSTART IS THE NUMBER OF THE 
+C       BASE IN ELEMENT 1 OF SEQ, AND IDIM1 IS THE LENGTH FROM KSTART
+C       STANDARDS CAN
+C       BE TAKEN FROM KSTART TO KSTART+IDIM1-1 = IENDB
+        I1INM1=1+(MXSPAN/2)-KSTART+1
+        IPASS = 0
+11      CONTINUE
+        IPASS = IPASS + 1
+        K1 = 0
+        K2 = 0
+        KS1 = KSTART-1
+        CALL GTREG(KBIN,KBOUT,KS1,IENDB,K1,K2,
+     +  'Define region for standard',
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF((K2-K1).LT.191) THEN
+          CALL ERROM(KBOUT,'At least 64 codons required in standard')
+          IF(IPASS.LT.2) GO TO 11
+          IOK = 1
+          RETURN
+        END IF
+        DO 1 I=1,5
+          DO 1 J=1,3
+            SCORES(I,J)=0.
+1         CONTINUE
+          NELTS=0
+          DO 2 I=K1,K2-2,3
+            NELTS=NELTS+1
+            IS=CTONUM(SEQ(I+I1INM1))  
+            IS1=CTONUM(SEQ(I+I1INM1+1))
+            IS2=CTONUM(SEQ(I+I1INM1+2))
+            SCORES(IS,1)=SCORES(IS,1)+1.
+            SCORES(IS1,2)=SCORES(IS1,2)+1.
+            SCORES(IS2,3)=SCORES(IS2,3)+1.
+2         CONTINUE
+          DO 3 I=1,5
+          DO 3 J=1,3
+            SCORES(I,J)=SCORES(I,J)/NELTS
+3         CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE GETPBS(SUM,COMP)
+C   AUTHOR: RODGER STADEN
+      REAL SUM(4,4,4),COMP(5,3)
+      TCOMP=0.
+      DO 50 I=1,5
+      DO 50 J=1,3
+        COMP(I,J)=0.
+50    CONTINUE
+      DO 55 J=1,4
+      DO 55 K=1,4
+      DO 55 L=1,4
+        TCOMP=TCOMP+SUM(J,K,L)
+        COMP(J,1)=COMP(J,1)+SUM(J,K,L)
+        COMP(J,2)=COMP(J,2)+SUM(K,J,L)
+        COMP(J,3)=COMP(J,3)+SUM(K,L,J)
+55    CONTINUE
+      IF(TCOMP.NE.0) THEN
+        DO 60 I=1,4
+        DO 60 J=1,3
+          COMP(I,J)=COMP(I,J)/TCOMP
+60      CONTINUE
+      END IF
+      END
+      SUBROUTINE PBASPP(SEQ,IDIM1P,MXSPAN,
+     +ISXMAX,ISYMAX,J1,J2,KSTART,ANSABS,LENW,IWRIT,
+     +EXPEC,SCORES,WT,S,SMIN,SMAX,
+     +MARGL1,MARGR1,MARGB,MARGT,PAA)
+      CHARACTER PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      REAL SUMW(3),PROB(3),EXPEC(5,3),SCORES(5,3),WT(3),S(3)
+      CHARACTER SEQ(IDIM1P)
+      CHARACTER TLINE(3),STAR
+      INTEGER ANSABS
+      SAVE STAR
+      DATA STAR/'*'/
+      CALL FILLR(SUMW,3,0.)
+      IDONE=0
+      IMARG=NINT(MARGT/3.)
+      MARGB1=MARGB
+      MARGB2=MARGB+IMARG
+      MARGB3=MARGB+2*IMARG
+      MARGT3=IMARG
+      MARGT2=IMARG
+      MARGT1=IMARG
+      I1IN=2+(MXSPAN/2)-KSTART+1
+      I1INM1=I1IN-1
+      IDIMJ=J2-J1+1.
+      J1P=J1+I1INM1
+      J2P=J2+I1INM1
+      IMH=NINT(0.005*ISYMAX)
+      LB=1+LENW/2
+      LB=3*LB
+      LF=LENW/2
+      LF=3*LF
+      LENW3=3*LENW
+      YMIN=SMIN
+      YMAX=SMAX
+      XMIN=J1
+      XMAX=J2
+C   set initial values
+      XP=J1
+      YMID=YMIN+(YMAX-YMIN)/2.
+      YP1=YMID
+      YP2=YMID
+      YP3=YMID
+      CALL CLEARV
+      CALL VECTOM
+      XMAX1=XMIN+(XMAX-XMIN)/80.
+      CALL LINE(XMIN,XMAX1,S(1),S(1),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(2),S(2),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(3),S(3),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(1),S(1),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(2),S(2),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(3),S(3),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(1),S(1),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(2),S(2),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX1,S(3),S(3),XMAX,XMIN,YMAX,YMIN,
+     +MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+      CALL PBASES(SEQ(J1P-LB),IDIM1P,LENW3,SUMW(1),SCORES,WT)
+      CALL PBASES(SEQ(J1P+1-LB),IDIM1P,LENW3,SUMW(2),SCORES,WT)
+      CALL PBASES(SEQ(J1P+2-LB),IDIM1P,LENW3,SUMW(3),SCORES,WT)
+      DO 300 IP=J1P,J2P,3
+        IDONE=IDONE+1
+        IP0=IP
+        IP1=IP+1
+        IP2=IP+2
+        CALL ROTPB1(SEQ,IDIM1P,IP0,LB,LF,SUMW(1),SCORES,WT)
+        CALL ROTPB1(SEQ,IDIM1P,IP1,LB,LF,SUMW(2),SCORES,WT)
+        CALL ROTPB1(SEQ,IDIM1P,IP2,LB,LF,SUMW(3),SCORES,WT)
+        IF(MOD(IDONE,IWRIT).EQ.0)THEN
+          PROB(1)=SUMW(1)
+          PROB(2)=SUMW(2)
+          PROB(3)=SUMW(3)
+          IF(ANSABS.EQ.0) THEN
+            T=SUMW(1)+SUMW(2)+SUMW(3)
+            PROB(1)=PROB(1)/T
+            PROB(2)=PROB(2)/T
+            PROB(3)=PROB(3)/T
+          END IF
+          CALL FILLC(TLINE,3,' ')
+          T=MAX(PROB(1),PROB(2),PROB(3))
+          DO 303 II=1,3
+            IF(PROB(II).EQ.T)TLINE(II)=STAR
+303       CONTINUE
+          IPA=IP-I1INM1
+          X=IPA
+          CALL LINE(XP,X,YP1,PROB(1),XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1+IMH,MARGT1-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP2,PROB(2),XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2+IMH,MARGT2-2*IMH,ISXMAX,ISYMAX)
+          CALL LINE(XP,X,YP3,PROB(3),XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3+IMH,MARGT3-2*IMH,ISXMAX,ISYMAX)
+          YP1=PROB(1)
+          YP2=PROB(2)
+          YP3=PROB(3)
+          IF(TLINE(1).EQ.STAR)CALL POINT(
+     +    X,YMID,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          IF(TLINE(2).EQ.STAR)CALL POINT(
+     +    X,YMID,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB2,MARGT2,ISXMAX,ISYMAX)
+          IF(TLINE(3).EQ.STAR)CALL POINT(
+     +    X,YMID,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB3,MARGT3,ISXMAX,ISYMAX)
+          XP=X
+        END IF
+300   CONTINUE
+      CALL STARTS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL STOPS(SEQ,IDIM1P,J1P,J2P,
+     +MARGL1,MARGR1,MARGB,MARGT,ISXMAX,ISYMAX,PAA)
+      CALL VT100M
+      END
+      SUBROUTINE PBASES(SEQ,IDIM,LENW,SUM,SCORES,WT)
+C   AUTHOR: RODGER STADEN
+      REAL SUM
+      CHARACTER SEQ(IDIM)
+      REAL SCORES(5,3),WT(3)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SUM=0.
+      DO 100 I=1,LENW,3
+        SUM=SUM+SCORES(CTONUM(SEQ(I)),1)*WT(1)+
+     +  SCORES(CTONUM(SEQ(I+1)),2)*WT(2)
+     +  +SCORES(CTONUM(SEQ(I+2)),3)*WT(3)
+100   CONTINUE
+      RETURN
+      END
+      SUBROUTINE ROTPB1(SEQ,IDIM1,I,LB,LF,H,SCORES,WT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      INTEGER CTONUM
+      REAL SCORES(5,3),WT(3)
+      EXTERNAL CTONUM
+C   set up dummy values
+      XS=0.
+      XA=0.
+C   look back
+      IPB=I-LB-1
+      DO 10 J=1,3
+        IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 30
+10    CONTINUE
+C   set value to subtract
+      IPB=IPB+1
+      XS=SCORES(CTONUM(SEQ(IPB)),1)*WT(1)+
+     +SCORES(CTONUM(SEQ(IPB+1)),2)*WT(2)
+     ++SCORES(CTONUM(SEQ(IPB+2)),3)*WT(3)
+C   now look forward
+30    CONTINUE
+      IPB=I+LF-1
+C   look for bad char
+      DO 45 J=1,3
+        IF(CTONUM(SEQ(IPB+J)).EQ.5)GO TO 50
+45    CONTINUE
+C   ok so use
+      IPB=IPB+1
+      XA=SCORES(CTONUM(SEQ(IPB)),1)*WT(1)+
+     +SCORES(CTONUM(SEQ(IPB+1)),2)*WT(2)
+     ++SCORES(CTONUM(SEQ(IPB+2)),3)*WT(3)
+50    H=H-XS+XA
+      END
+      SUBROUTINE PBEXP(SCORES,WT,S)
+C   AUTHOR: RODGER STADEN
+      REAL SCORES(5,3),S(3),WT(3)
+      DO 50 I=1,3
+        S(I)=0.
+50    CONTINUE
+      DO 60 I=1,4
+        S(1)=S(1)+SCORES(I,1)*SCORES(I,1)*WT(1)
+     +  +SCORES(I,2)*SCORES(I,2)*WT(2)
+     +  +SCORES(I,3)*SCORES(I,3)*WT(3)
+        S(2)=S(2)+SCORES(I,2)*SCORES(I,1)*WT(1)
+     +  +SCORES(I,3)*SCORES(I,2)*WT(2)
+     +  +SCORES(I,1)*SCORES(I,3)*WT(3)
+        S(3)=S(3)+SCORES(I,3)*SCORES(I,1)*WT(1)
+     +  +SCORES(I,1)*SCORES(I,2)*WT(2)
+     +  +SCORES(I,2)*SCORES(I,3)*WT(3)
+60    CONTINUE
+      END
+      SUBROUTINE TRNA(SEQ,IDIM,ISS,ISE,KSTART,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER CONSC(18)
+      INTEGER ANSCON,DIALOG
+      CALL SHOWFU(KBOUT,'tRNA search ')
+      CALL GETPAR(49,35,IOK,
+     +MNLEN,MXLEN,MAXLNT,
+     +MNAC,MXAC,MINACL,
+     +MNTU,MXTU,MINTU,
+     +MNAN,MXAN,MINANT,
+     +MND,MXD,MIND,
+     +MNI1,MXI1,INT1,
+     +MNI2,MXI2,JNT2,
+     +MNTUL1,MXTUL1,LTUMIN,
+     +MNTUL2,MXTUL2,LTUMAX,
+     +MNB,MXB,MINBAS,
+     +MNC,MXC,MINCON,IGON,ANSCON)
+      IF(IOK.NE.0) RETURN
+      IF(DIALOG.EQ.1) CALL TRNAD(
+     +MNLEN,MXLEN,MAXLNT,
+     +MNAC,MXAC,MINACL,
+     +MNTU,MXTU,MINTU,
+     +MNAN,MXAN,MINANT,
+     +MND,MXD,MIND,
+     +MNI1,MXI1,INT1,
+     +MNI2,MXI2,JNT2,
+     +MNTUL1,MXTUL1,LTUMIN,
+     +MNTUL2,MXTUL2,LTUMAX,
+     +MNB,MXB,MINBAS,
+     +MINCON,IGON,ANSCON,
+     +CONSC,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL TRNAP(SEQ,IDIM,ISS,ISE,KSTART,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,KBOUT,
+     +MAXLNT,MINACL,MINTU,MINANT,MIND,INT1,JNT2,
+     +LTUMIN,LTUMAX,MINBAS,MINCON,IGON,ANSCON,CONSC)
+      CALL VT100M
+      END
+      SUBROUTINE TRNAP(SEQ,IDIM,ISS,ISE,KSTART,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,KBOUT,
+     +MAXLNT,MINACL,MINTU,MINANT,MIND,INT1,JNT2,
+     +LTUMIN,LTUMAX,MINBAS,MINCON,IGON,ANSCON,CONSC)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER RAM,RAN,RTU
+      INTEGER SCORES(5,5)
+      INTEGER CONSC(18),TOTCON,CTONUM
+      INTEGER ANSCON
+      EXTERNAL CTONUM
+      SAVE SCORES
+      DATA SCORES/25*0/
+      SCORES(1,3)=2
+      SCORES(3,1)=2
+      SCORES(1,4)=1
+      SCORES(4,1)=1
+      SCORES(2,4)=2
+      SCORES(4,2)=2
+      CALL CLEARV
+      CALL BUSY(KBOUT)
+      IF(IGON.EQ.0)THEN
+        XMIN=ISS
+        XMAX=ISE
+C       ymin= half stems base-paired, ymax= all stems base-paired
+        YMIN=18.
+        YMAX=42.
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+C     loop for all stem start points
+      DO 300 N=ISS,ISE
+        IS=N+60+INT1
+        IE=N+MAXLNT+JNT2
+        IF(IE.GT.ISE)IE=ISE
+        IF(IE.LT.IS)RETURN
+C       STEM LEFT END IS N,RIGHT END RANGES FROM IS TO IE
+C       LOOP FOR ALL POSITIONS THIS STEM START POINT
+        DO 200 I=IS,IE
+C         SET UP POINTER TO LEFT STEM
+          LAM=N-1
+C         SET IANALL TO 0 AS FLAG FOR RE-ENTRY TO ANLOOP
+          IANALL=0
+          ISUM=0
+C         LOOP FOR STEM LENGTH 7
+          DO 100 J=1,7
+C           POINTER TO STEM LEFT END
+            LAM=LAM+1
+C           RIGHT HAND POINTER (MOVES BACKWARDS)
+            RAM=I-J
+            ISUM=ISUM+SCORES(
+     +      CTONUM(SEQ(LAM)),CTONUM(SEQ(RAM)))
+100       CONTINUE
+          IF(ISUM.LT.MINACL)GO TO 200
+C         SET TULOOP START POINT
+          MTU=0
+140       CONTINUE
+          IANALL=0
+          CALL TULOOP(SEQ,IDIM,RAM,LTUMIN,
+     +    LTUMAX,MINTU,ISUMTU,SCORES,RTU,LTU,MTU)
+          IF(ISUMTU.LT.MINTU)GO TO 200
+150       CONTINUE
+          CALL ANLOOP(SEQ,IDIM,MINANT,ISUMAN,N,
+     +    RTU,SCORES,JNT2,RAN,LAN,IANALL,INT1)
+          IF(ISUMAN.LT.MINANT)GO TO 140
+          CALL DLOOP(SEQ,IDIM,ISUMD,LAM,LAN,SCORES)
+          IF(ISUMD.GE.MIND)THEN
+C         HIGH ENOUGH OVERALL SCORE?
+            ITOTBS=ISUM+ISUMTU+ISUMAN+ISUMD
+            IF(ITOTBS.GE.MINBAS)THEN
+C             FILTER ON CONSERVED BASES
+              IF(ANSCON.EQ.0)THEN
+                CALL CONSCR(SEQ,IDIM,N,
+     +          LAN,RTU,CONSC,TOTCON)
+                IF(TOTCON.LT.MINCON)
+     +          GO TO 150
+              END IF
+              ISUMD=ISUMD-MIND
+              ISUMAN=ISUMAN-MINANT
+              IF(IGON.EQ.0)THEN
+                X=N
+                Y=ITOTBS
+                CALL LINE(X,X,0.,Y,XMAX,XMIN,
+     +          YMAX,YMIN,MARGL,MARGR,MARGB,
+     +          MARGT,ISXMAX,ISYMAX)
+              END IF
+              IF(IGON.EQ.1) CALL TRNADR(SEQ,IDIM,I,N,LAN,
+     +        RAN,LTU,RTU,IDEV,KSTART)
+C             HAVE ALL POSSIBLE ANTICODONS BEEN TRIED?
+            END IF
+          END IF
+          GO TO 150
+200     CONTINUE
+300   CONTINUE
+      END
+      SUBROUTINE TRNAD(
+     +MNLEN,MXLEN,MAXLNT,
+     +MNAC,MXAC,MINACL,
+     +MNTU,MXTU,MINTU,
+     +MNAN,MXAN,MINANT,
+     +MND,MXD,MIND,
+     +MNI1,MXI1,INT1,
+     +MNI2,MXI2,JNT2,
+     +MNTUL1,MXTUL1,LTUMIN,
+     +MNTUL2,MXTUL2,LTUMAX,
+     +MNB,MXB,MINBAS,
+     +MINCON,IGON,ANSCON,
+     +CONSC,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER CONSC(18),ANSCON
+      CHARACTER HELPF*(*)
+      IOK = 1
+      CALL GETINT(MNLEN,MXLEN,MAXLNT,
+     +'Maximum trna length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXLNT = IVAL
+      CALL GETINT(MNAC,MXAC,MINACL,
+     +'Aminoacyl stem score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINACL = IVAL
+      CALL GETINT(MNTU,MXTU,MINTU,
+     +'Tu stem score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINTU = IVAL
+      CALL GETINT(MNAN,MXAN,MINANT,
+     +'Anticodon stem score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINANT = IVAL
+      CALL GETINT(MND,MXD,MIND,
+     +'D stem score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MIND = IVAL
+      MNB = MIND + MINTU + MINACL + MINANT
+C      MXB = MXD + MXTU + MXAC + MNAN
+      MINBAS = MNB
+      CALL GETINT(MNB,MXB,MINBAS,
+     +'Minimum base pairing total',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINBAS = IVAL
+      CALL GETINT(MNI1,MXI1,INT1,
+     +'Minimum intron length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      INT1 = IVAL
+      MNI2 = INT1
+      JNT2 = INT1
+C      IF(INT1.GT.0) THEN
+        CALL GETINT(MNI2,MXI2,JNT2,
+     +  'Maximum intron length',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        JNT2 = IVAL
+C      END IF
+      CALL GETINT(MNTUL1,MXTUL1,LTUMIN,
+     +'Minimum length for TU loop',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LTUMIN = IVAL
+      MNTUL2 = LTUMIN
+      CALL GETINT(MNTUL2,MXTUL2,LTUMAX,
+     +'Maximum length for TU loop',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LTUMAX = IVAL
+C   conserved bases
+      CALL YESONO(ANSCON,
+     +'Search for conserved bases',
+     +'Skip search for conserved bases',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSCON.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      IOK = 1
+      IF(ANSCON.EQ.0)CALL CONSET(CONSC,MINCON,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(MINCON.LT.0)RETURN
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0)RETURN
+      IOK = 0
+      END
+C
+C     conset
+      SUBROUTINE CONSET(CONSC,MINCON,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER CONSC(18),NUM(18),CONSER(18)
+      CHARACTER CONBAS(18)
+      CHARACTER STRING*35
+      SAVE NUM,CONSER,CONBAS
+      DATA NUM/8,10,11,14,15,21,32,33,37,48,53,54,55,56,57,
+     +58,60,61/
+      DATA CONSER/100,95,96,100,100,97,100,98,91,100,100,
+     +95,97,100,100,100,92,100/
+      DATA CONBAS/'T','G','Y','A','R','A','Y','T','A','Y',
+     +'G','T','T','C','R','A','Y','C'/
+C   SET FLAG FOR RETURN
+      MINCON=-9
+      ITOT = 0
+      MN = 0
+      MX = 100
+      MNT = 0
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Give a score for each base, then a minimum',
+     +' total at the end')
+      DO 10 J=1,18
+        STRING = ' '
+        WRITE(STRING,1000)NUM(J),CONBAS(J),CONSER(J)
+        CALL GETINT(MN,MX,MN,
+     +  STRING,
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CONSC(J) = IVAL
+1000    FORMAT( 'Base ',I2,', ',A1,' is ',I3,'% conserved.',
+     +  ' Score')
+        ITOT = ITOT + CONSC(J)
+10    CONTINUE
+      CALL GETINT(MNT,ITOT,MNT,
+     +'Minimum total conserved base score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINCON = IVAL
+      END
+      SUBROUTINE CONSCR(SEQ,IDIM,N,LAN,RTU,CONSC,TOTCON)
+C   AUTHOR: RODGER STADEN
+C bug fix 6-8-92: previously did not allow for case!
+      CHARACTER SEQ(IDIM)
+      INTEGER CONSC(18),NUMS(18),TOTCON,RTU,B,CHAR1(18),CHAR2(18)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE CHAR1,CHAR2,NUMS
+      DATA CHAR1/1,4,2,3,3,3,2,1,3,2,4,1,1,2,3,3,2,2/
+      DATA CHAR2/1,4,1,3,4,3,1,1,3,1,4,1,1,2,4,3,1,2/
+      DATA NUMS/7,9,10,13,14,-10,1,2,6,-5,0,1,2,3,4,5,7,8/
+C      DATA CHAR1/'T','G','C','A','A','A','C','T','A','C','G',
+C     +           'T','T','C','A','A','C','C'/
+C      DATA CHAR2/'T','G','T','A','G','A','T','T','A','T','G',
+C     +           'T','T','C','G','A','T','C'/
+C
+C   test those from left end
+      TOTCON=0
+      DO 10 J=1,5
+        K=N+NUMS(J)
+        B = CTONUM(SEQ(K))
+        IF((B.EQ.CHAR1(J)).OR.(B.EQ.CHAR2(J)))
+     +  TOTCON=TOTCON+CONSC(J)
+10    CONTINUE
+C   those from anticodon
+      DO 20 J=6,9
+        K=LAN+NUMS(J)
+        B = CTONUM(SEQ(K))
+        IF((B.EQ.CHAR1(J)).OR.(B.EQ.CHAR2(J)))
+     +  TOTCON=TOTCON+CONSC(J)
+20    CONTINUE
+C   THOSE FROM TU LOOP
+      DO 30 J=10,18
+        K=RTU+NUMS(J)
+        B = CTONUM(SEQ(K))
+        IF((B.EQ.CHAR1(J)).OR.(B.EQ.CHAR2(J)))
+     +  TOTCON=TOTCON+CONSC(J)
+30    CONTINUE
+      END
+      SUBROUTINE ANLOOP(SEQ,IDIM,MINANT,ISUMAN,N,RTU,SCORES,JNT2,
+     +RAN,LAN,IANALL,INT1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER SCORES(5,5),RTU,RAN,CTONUM
+      EXTERNAL CTONUM
+C   NEED TO TEST FLAG FOR MULTIPLE ENTRY TO THIS SUBROUTINE
+C   WHEN SEARCHING FOR BEST ANTICODON.  IANALL =0 MEANS NO
+C   POSITIONS TRIED FOR THIS START, IANALL=1 MEANS SOME TRIED
+C   MIN DIST FROM N TO FIRST BASE IS 6+2+3+4+3+1=19
+C   MAX IS TUSTEM-3=RTU-8
+C   ALLOWING 3 IN VARIABLE LOOP
+      IAS=N+19
+C     allow 2 in variable loop to be sure
+      IEND=RTU-7
+C     restrict max d loop size to 17
+      IAE=N+36
+      IF(IAE.GT.IEND)IAE=IEND
+C   LOOP FOR ALL LEFT STEM STARTS
+C   NN IS EFFECTIVELY THE LEFT STEM START
+C   RIGHT STEM RANGE FROM NEAREST TO NEAREST + INTRON
+C   IE NN+16 TO NN+16+INTRON
+C   IAS TO IAE DEFINES THE REGION TO THRU  TRYING ALL START
+C   POSITIONS AND ALL INTRON LENGTHS
+C   TEST FLAG FOR MULTIPLE ENTRY. IF SET START SEARCHING 1 ON FROM
+C   LAST ANTICODON
+      IF(IANALL.EQ.1)IAS=LAN-3
+C   reset flag
+      IANALL=1
+      ISUMAN=0
+      IF(IAE.LT.IAS)RETURN
+      DO 750 NN=IAS,IAE
+        IIS=NN+16+INT1
+        IIE=NN+16+JNT2
+        IF(IIE.GT.IEND)IIE=IEND
+        IF(IIE.LT.IIS)RETURN
+C
+C   try all positions for the right stem from this left stem start
+C
+        DO 740 I=IIS,IIE
+C
+C   will only try one position if intron=0
+C   or intron max=intron min
+          LAN=NN-1
+          ISUMAN=0
+          RAN=I+1
+C
+C   loop for stem
+C
+          DO 730 J=1,5
+            LAN=LAN+1
+            RAN=RAN-1
+            ISUMAN=ISUMAN+SCORES
+     +      (CTONUM(SEQ(RAN)),CTONUM(SEQ(LAN)))
+730       CONTINUE
+C   score high enough?
+          IF(ISUMAN.GE.MINANT)RETURN
+740     CONTINUE
+750   CONTINUE
+      END
+C
+C    dloop
+      SUBROUTINE DLOOP(SEQ,IDIM,ISUMD,LAM,LAN,SCORES)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER SCORES(5,5),RD,CTONUM
+      EXTERNAL CTONUM
+C   first base left end is lam+3
+C   first base right end is lan-6
+C
+      LD=LAM+2
+      RD=LAN-5
+      ISUMD=0
+C
+      DO 10 I=1,4
+        LD=LD+1
+        RD=RD-1
+        ISUMD=ISUMD+SCORES(CTONUM(SEQ(LD)),CTONUM(SEQ(RD)))
+10    CONTINUE
+      END
+C    trnadr
+      SUBROUTINE TRNADR(SEQ,IDIM,I,N,LAN,RAN,LTU,RTU,IDEV,KSTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      CHARACTER BLANK
+      CHARACTER ARRAY(30,30)
+      INTEGER RAN,RTU
+      CHARACTER SCORES(5,5),SCOREZ(5,5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE SCORES,SCOREZ,BLANK
+      DATA BLANK/' '/
+      DATA SCORES/25*' '/,SCOREZ/25*' '/
+      SCORES(1,3)='-'
+      SCORES(3,1)='-'
+      SCORES(2,4)='-'
+      SCORES(4,2)='-'
+      SCORES(1,4)='+'
+      SCORES(4,1)='+'
+      SCOREZ(1,3)='!'
+      SCOREZ(3,1)='!'
+      SCOREZ(2,4)='!'
+      SCOREZ(4,2)='!'
+      SCOREZ(1,4)='+'
+      SCOREZ(4,1)='+'
+C
+C   blank arrays
+      DO 10 J=1,30
+      DO 10 K=1,30
+        ARRAY(K,J)=BLANK
+10    CONTINUE
+C   aminoacyl
+      ARRAY(16,5)=(SEQ(I))
+      J=I-1
+      K=N
+      DO 20 L=6,12
+        ARRAY(14,L)=(SEQ(K))
+        ARRAY(16,L)=(SEQ(J))
+        ARRAY(15,L)=SCORES(CTONUM(SEQ(K)),CTONUM(SEQ(J)))
+        K=K+1
+        J=J-1
+20    CONTINUE
+C
+C   gap between aminoacyl d stem
+      ARRAY(13,13)=(SEQ(N+7))
+      ARRAY(12,14)=(SEQ(N+8))
+C
+C
+C   anticodon
+      K=LAN-4
+      J=RAN+4
+      DO 30 L=19,23
+        ARRAY(13,L)=(SEQ(K))
+        ARRAY(15,L)=(SEQ(J))
+        ARRAY(14,L)=SCORES(CTONUM(SEQ(K)),CTONUM(SEQ(J)))
+        K=K+1
+        J=J-1
+30    CONTINUE
+C
+C   anticodon loop
+      ARRAY(12,24)=(SEQ(LAN+1))
+      ARRAY(12,25)=(SEQ(LAN+2))
+      ARRAY(16,25)=(SEQ(LAN+6))
+      ARRAY(16,24)=(SEQ(RAN-1))
+      K=LAN+3
+      DO 35 L=13,15
+        ARRAY(L,26)=(SEQ(K))
+        K=K+1
+35    CONTINUE
+C
+C   gap between dstem and anticodon
+      ARRAY(12,18)=(SEQ(LAN-5))
+C
+C   tustem
+      K=LTU+4
+      L=RTU-4
+      DO 40 J=17,21
+        ARRAY(J,13)=(SEQ(K))
+        ARRAY(J,15)=(SEQ(L))
+        ARRAY(J,14)=SCOREZ(CTONUM(SEQ(K)),CTONUM(SEQ(L)))
+        K=K-1
+        L=L+1
+40    CONTINUE
+C   tu loop
+C   length of loop
+      LNGTH=K-L+1
+      IF(LNGTH.GT.18)RETURN
+C   return if loop length lt 3
+      IF(LNGTH.LT.3)RETURN
+C   number in outer
+      LU=(LNGTH-1)/2
+      M=22
+      DO 45 J=1,LU
+        ARRAY(M,12)=(SEQ(K))
+        ARRAY(M,16)=(SEQ(L))
+        K=K-1
+        L=L+1
+        M=M+1
+45    CONTINUE
+C
+      ARRAY(M,15)=(SEQ(L))
+      L=L+1
+      IF(MOD(LNGTH,2).EQ.0)ARRAY(M,14)=(SEQ(L))
+C
+C
+46    CONTINUE
+C   dstem
+      K=N+9
+      L=LAN-6
+      LNGTH=L-K+1
+      J2=4
+      M=11
+      IF(LNGTH.GT.10)GO TO 47
+C   need stem of 3 not 4
+      M=10
+      J2=3
+47    CONTINUE
+      DO 50 J=1,J2
+        ARRAY(M,15)=(SEQ(K))
+        ARRAY(M,17)=(SEQ(L))
+        ARRAY(M,16)=SCOREZ(CTONUM(SEQ(K)),CTONUM(SEQ(L)))
+        K=K+1
+        L=L-1
+        M=M-1
+50    CONTINUE
+C
+C   dloop
+C   length of loop
+      LNGTH=L-K+1
+      IF(LNGTH.GT.14)RETURN
+C   if loop length lt 3 return
+      IF(LNGTH.LT.3)RETURN
+C   number in outer
+      LU=(LNGTH-1)/2
+      DO 80 J=1,LU
+        ARRAY(M,14)=(SEQ(K))
+        ARRAY(M,18)=(SEQ(L))
+        K=K+1
+        L=L-1
+        M=M-1
+80    CONTINUE
+      ARRAY(M,17)=(SEQ(L))
+      L=L-1
+      IF(MOD(LNGTH,2).EQ.0)ARRAY(M,16)=(SEQ(L))
+C
+C
+85    CONTINUE
+C   variable loop
+      LNGTH=(RTU-5)-(RAN+5)+1
+      IF(LNGTH.GT.26)RETURN
+      IF(LNGTH.LT.3)RETURN
+C   number of elements in lower diagonal
+      LL=(LNGTH/2)-1
+C   correction made in june 81 for lngthg=3
+      IF(LL.EQ.0)LL=1
+C   lsi must go thru loops even with 0!
+C   number in upper
+      LU=(LNGTH+1)/2
+C   do lower
+      K=16
+      L=19
+      M=RAN+5
+      DO 60 J=1,LL
+        ARRAY(K,L)=(SEQ(M))
+        K=K+1
+        L=L+1
+        M=M+1
+60    CONTINUE
+C   check for silly loop size
+      IF(LNGTH.LT.3)RETURN
+C   put in extra base
+      K=K+1
+      L=L-1
+      IF(LNGTH.GT.3)ARRAY(K,L)=(SEQ(M))
+C do upper
+      K=17
+      L=16
+      M=RTU-5
+      DO 70 J=1,LU
+        ARRAY(K,L)=(SEQ(M))
+        K=K+1
+        L=L+1
+        M=M-1
+70    CONTINUE
+C
+C   output
+      WRITE(IDEV,1002)N+KSTART-1
+1002  FORMAT(/5X,I6)
+      DO 200 M=1,30
+        DO 100 J=1,30
+          IF(ARRAY(J,M).EQ.BLANK)GO TO 100
+          WRITE(IDEV,1000)(ARRAY(L,M),L=1,30)
+          GO TO 200
+100     CONTINUE
+200   CONTINUE
+1000  FORMAT(5X,30A)
+      END
+      SUBROUTINE TULOOP(SEQ,IDIM,RAM,LTUMIN,LTUMAX,MINTU,ISUMTU,
+     +SCORES,RTU,LTU,MTU)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER SCORES(5,5)
+      INTEGER RAM,RTU,RAMM2,CTONUM
+      EXTERNAL CTONUM
+      ISUMTU=0
+C   loop must start at ram-1
+C   vary loop from ltumin to ltumax
+      RAMM2=RAM-2
+      M=LTUMIN+9
+      N=LTUMAX+9
+C   is this first pass thru this aminoacyl?
+      IF(MTU.GT.0)M=MTU+1
+C   all done?
+      IF(M.GT.N)RETURN
+C
+      DO 650 I=M,N
+        ISUMTU=0
+        RTU=RAMM2-I
+        DO 640 J=1,5
+          LTU=RAM-J
+          RTU=RTU+1
+          ISUMTU=ISUMTU+SCORES(
+     +    CTONUM(SEQ(RTU)),CTONUM(SEQ(LTU)))
+640     CONTINUE
+        MTU=I
+C   score high enough?
+        IF(ISUMTU.GE.MINTU)RETURN
+650   CONTINUE
+      END
+      SUBROUTINE CODMPP(SEQ,IDIM,FTABLE,IDFTAB,
+     +KBOUT,J1,J2,I1,I2,IDIM1,IDEV)
+C   AUTHOR: RODGER STADEN
+      REAL RNFAC,FTABLE(IDFTAB),FACTOR,FACN
+      INTEGER NBASE(4),NCOD(4,4,4)
+      EXTERNAL FACTOR
+      CHARACTER SEQ(IDIM)
+      INTEGER NCODON(21),IACID(4,4,4)
+      REAL FABC(4,4,4),FA(21)
+C    array with number of codons per acid
+      SAVE IACID,NCODON
+      DATA IACID/
+     +1,2,3,5,6,7,8,9,
+     +10,12,14,16,18,20,6,21,
+     +1,2,3,5,6,7,8,9,
+     +10,12,14,16,18,20,6,21,
+     +2,2,3,5,6,7,8,9,
+     +11,13,15,17,11,20,20,21,
+     +2,2,4,5,6,7,8,9,
+     +11,13,15,17,19,20,20,21/
+      DATA NCODON/
+     +2,6,3,1,4,6,4,4,4,2,3,2,2,2,2,2,2,2,1,6,4/
+      LB=150
+      LF=147
+      IF(IDIM1.GT.310)THEN
+        CALL BUSY(KBOUT)
+        FACN=FACTOR(99,FTABLE,IDFTAB)
+        SUMY=0.
+C  do calc
+        CALL ROT2(SEQ(I1-J1+1),IDIM1,NCOD,RNFAC,FTABLE,IDFTAB,
+     +  NBASE,297,FA,FABC,NCODON,IACID)
+        COMP=0.
+        DO 90 J=I1+LB-J1+1,I2-LF-2-J1+1,3
+          CALL ROTFAK(SEQ,IDIM,J,LB,LF,
+     +    NCOD,RNFAC,FTABLE,IDFTAB,
+     +    NBASE,COMP,297,99.,
+     +    FA,FABC,NCODON,IACID)
+          W=RNFAC-COMP-FACN
+          CALL POISON(99,FABC,FABC,EW,EWD,2)
+          Y=(W-EW)/EWD
+          SUMY=SUMY+Y
+90      CONTINUE
+        SUMY=SUMY/((I2-LF-2-(I1+LB))/3.)
+        WRITE(IDEV,5555)SUMY
+5555    FORMAT('  Mean (W-EW)/EWD, window 99 ',F10.1)
+      END IF
+      END
+      SUBROUTINE PREPT1(SEQ,POSNS,IDIM,J1IN,J2IN,KSTART,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,DIALOG,
+     +WORDP,WORDN,MAXWRD,MAXDIC)
+      INTEGER WORDP(MAXDIC),WORDN(MAXDIC)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER POSNS(IDIM),DIALOG
+      CHARACTER SEQ(IDIM)
+      INTEGER R
+      CALL SHOWFU(KBOUT,'Plot repeats')
+      CALL GETPAR(32,4,IOK,MINRP,MAXRP,MINREP,IGON,
+     +IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      L = J1IN
+      R = J2IN
+      IF(DIALOG.EQ.1) THEN
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0)RETURN
+      CALL GTREG(KBIN,KBOUT,J1IN,J2IN,L,R,
+     +'Define restricted region',
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL GETINT(MINRP,MAXRP,MINREP,
+     +'Minimum repeat',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINREP = IVAL
+      END IF
+      IDIM1=R-L+1
+      IDIM2=J2IN-J1IN+1
+      L1=L-J1IN+1
+      CALL BUSY(KBOUT)
+      CALL REPEAT(SEQ(L-KSTART+1),
+     +POSNS,IDIM1,MINREP,L1,J1IN,IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON,
+     +WORDP,WORDN,MAXWRD,MAXDIC)
+      CALL VT100M
+      END
+C   REPEAT
+      SUBROUTINE REPEAT(SEQ,POSNS,IDIM,MINR,L,KSTART,IDIM1,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON,
+     +WORDP,WORDN,MAXWRD,MAXDIC)
+      INTEGER WORDP(MAXDIC),WORDN(MAXDIC)
+C   AUTHOR: RODGER STADEN
+      INTEGER POSNS(IDIM),CONST(6)
+      CHARACTER SEQ(IDIM)
+      LENGTH=MAXWRD
+      LE4=MAXDIC
+      CALL ENCO(SEQ,IDIM,POSNS,CONST,LENGTH)
+      CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LE4,LENGTH)
+      CALL REPT1(SEQ,IDIM,IDIM1,L,KSTART,
+     +POSNS,WORDN,WORDP,LENGTH,LE4,MINR,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON)
+      END
+      SUBROUTINE REPT1(SEQ,IDIM,IDIM1,LEFT,KSTART,
+     +POSNS,WORDN,WORDP,LENGTH,LE4,MINR,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON)
+C   AUTHOR: RODGER STADEN
+      INTEGER POSNS(IDIM),WORDP(LE4)
+      INTEGER WORDN(LE4)
+      CHARACTER SEQ(IDIM)
+C   POSNS CONTAIN THE WORD NUMBERS ENCODED BY 4**LENGTH
+C   WORDN CONTAINS THE NUMBER OF OCCURENCES OF EACH POSSIBLE WORD
+C   WORDP CONTAINS THE POINTERS TO THE FIRST OCCURRENCE OF THE WORDS
+C   ENCODED--- EG POINTER TO FIRST OCCURRENCE OF THE WORD ENCODED AS 101
+C   WILL BE STORED IN ELEMENT 101
+      IF(IGON.EQ.0) CALL VECTOM
+      XMIN=1.
+      XMAX=IDIM1
+      YMIN=0.
+      YMAX=1.
+      X1=LEFT
+      Y1=0.
+      YINC=1./IDIM1
+C   LOOK FOR MATCHES BY THOSE ELEMENTS OF WORDN>1
+      DO 100 I=1,LE4
+        IF(WORDN(I).GT.1)THEN
+C   POINT TO FIRST POSITION AS ITS IN WORDP
+          IP=WORDP(I)
+C   A MATCH, TRY ALL PAIRS
+          DO 50 J=1,WORDN(I)
+C   COMPARE THIS POSITION WITH ALL OTHERS
+            IPS=IP
+            DO 40 K=J+1,WORDN(I)
+C   FIND LENGTH OF MATCH
+              L=LENGTH
+              IP1=IP+LENGTH
+              IPS=POSNS(IPS)
+              IP2=IPS+LENGTH
+20            CONTINUE
+              IF((IP1.LE.IDIM).AND.(IP2.LE.IDIM))THEN
+                IF(SEQ(IP1).EQ.SEQ(IP2))THEN
+                  IP1=IP1+1
+                  IP2=IP2+1
+                  L=L+1
+                  GO TO 20
+                END IF
+              END IF
+C   END OF MATCH, IS IT LONG ENOUGH?
+              IF(L.GE.MINR)THEN
+                X1=LEFT+IP-1
+                X2=LEFT+IPS-1
+                Y2=Y1+YINC*ABS(X2-X1)
+                IF(IGON.EQ.0)THEN
+                  CALL LINE(X1,X1,Y1,Y2,
+     +            XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,
+     +            MARGB,MARGT,ISXMAX,ISYMAX)
+                  CALL LINE(X1,X2,Y2,Y2,
+     +            XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,
+     +            MARGB,MARGT,ISXMAX,ISYMAX)
+                  CALL LINE(X2,X2,Y2,Y1,
+     +            XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,
+     +            MARGB,MARGT,ISXMAX,ISYMAX)
+                END IF
+                IF(IGON.EQ.1)THEN
+                  INTX1=X1+KSTART-1
+                  INTX2=X2+KSTART-1
+                  WRITE(IDEV,1000)INTX1,INTX2,L,(SEQ(KK),KK=IP,IP+L-1)
+                END IF
+              END IF
+1000          FORMAT(' ',I6,2X,I6,2X,I6,('  ',50A1))
+40          CONTINUE
+            IP=POSNS(IP)
+50        CONTINUE
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE PREPT2(SEQ,POSNS,IDIM,J1IN,J2IN,KSTART,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,DIALOG,
+     +WORDP,WORDN,MAXWRD,MAXDIC)
+      INTEGER WORDP(MAXDIC),WORDN(MAXDIC)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER POSNS(IDIM),DIALOG
+      CHARACTER SEQ(IDIM)
+      INTEGER R
+      CALL SHOWFU(KBOUT,'Plot long-range inverted repeats')
+      CALL GETPAR(31,4,IOK,MINRP,MAXRP,MINREP,IGON,
+     +IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      L = J1IN
+      R = J2IN
+      IF(DIALOG.EQ.1) THEN
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0)RETURN
+      CALL GTREG(KBIN,KBOUT,J1IN,J2IN,L,R,
+     +'Define restricted region',
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL GETINT(MINRP,MAXRP,MINREP,
+     +'Minimum inverted repeat',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINREP = IVAL
+      END IF
+      IDIM1=R-L+1
+      IDIM2=J2IN-J1IN+1
+      L1=L-J1IN+1
+      CALL BUSY(KBOUT)
+      CALL REPT2(SEQ(L-KSTART+1),POSNS,IDIM1,MINREP,L1,J1IN,IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON,
+     +WORDP,WORDN,MAXWRD,MAXDIC)
+      CALL VT100M
+      END
+C   REPT2
+      SUBROUTINE REPT2(SEQ,POSNS,IDIM,MINR,L,KSTART,IDIM1,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON,
+     +WORDP,WORDN,MAXWRD,MAXDIC)
+      INTEGER WORDP(MAXDIC),WORDN(MAXDIC)
+C   AUTHOR: RODGER STADEN
+      INTEGER POSNS(IDIM),CONST(6)
+      CHARACTER SEQ(IDIM)
+      LENGTH=MAXWRD
+      LE4=MAXDIC
+      CALL ENCO(SEQ,IDIM,POSNS,CONST,LENGTH)
+      CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LE4,LENGTH)
+      CALL REPT3(SEQ,IDIM,IDIM1,L,KSTART,
+     +POSNS,WORDN,WORDP,LENGTH,LE4,MINR,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON)
+      END
+      SUBROUTINE REPT3(SEQ,IDIM,IDIM1,LEFT,KSTART,
+     +POSNS,WORDN,WORDP,LENGTH,LE4,MINR,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,IGON)
+C   AUTHOR: RODGER STADEN
+      INTEGER POSNS(IDIM),WORDP(LE4)
+      INTEGER WORDN(LE4),CONST(6)
+      CHARACTER SEQ(IDIM),SCOMP
+      EXTERNAL SCOMP
+C   POSNS CONTAIN THE WORD NUMBERS ENCODED BY 4**LENGTH
+C   WORDN CONTAINS THE NUMBER OF OCCURENCES OF EACH POSSIBLE WORD
+C   WORDP CONTAINS THE POINTERS TO THE FIRST OCCURRENCE OF THE WORDS
+C   ENCODED--- EG POINTER TO FIRST OCCURRENCE OF THE WORD ENCODED AS 101
+C   WILL BE STORED IN ELEMENT 101
+      IF(IGON.EQ.0) CALL VECTOM
+      XMIN=1.
+      XMAX=IDIM1
+      YMIN=0.
+      YMAX=1.
+      X1=LEFT
+      Y1=0.
+      YINC=1./IDIM1
+C   SET UP CONST FOR ROUTINE THAT LOOKS FOR COMPLEMENTARY WORD POINTERS
+      ISTART=1
+      DO 1 I=1,LENGTH
+        CONST(I)=4**(I-1)
+        ISTART=ISTART-CONST(I)
+1     CONTINUE
+C
+C   LOOK FOR MATCHES BY THOSE ELEMENTS OF WORDN>1
+      DO 100 I=1,LE4
+        IF(WORDN(I).GE.1)THEN
+C   POINT TO FIRST POSITION AS ITS IN WORDP
+          IP=WORDP(I)
+C   DOES ITS COMPLEMENT EXIST?
+          JCOMPN=ICOMPN(SEQ,IDIM,IP,CONST,LENGTH,ISTART)
+          NCOMP=WORDN(JCOMPN)
+          IF(NCOMP.NE.0)THEN
+C       YES, NEED TO COMPARE ALL PAIRS IE COMPARE WORDN(I) POSITIONS
+C       WITH NCOMP POSITIONS. POINT TO FIRST OCCURRENCE
+            IPN=WORDP(JCOMPN)
+C   A MATCH, TRY ALL PAIRS
+            DO 50 J=1,WORDN(I)
+C   COMPARE THIS POSITION WITH ALL OTHERS
+              IPN1=IPN
+              DO 40 K=1,NCOMP
+C   FIND LENGTH OF MATCH
+                L=LENGTH
+                IP1=IP+LENGTH
+                IP2=IPN1-1
+20              CONTINUE
+                IF((IP1.LE.IDIM).AND.(IP2.GT.1))THEN
+                  IF(SEQ(IP1).EQ.SCOMP(SEQ(IP2)))THEN
+                    IP1=IP1+1
+                    IP2=IP2-1
+                    L=L+1
+                    GO TO 20
+                  END IF
+                END IF
+                IF(L.GE.MINR)THEN
+                  X1=LEFT+IP-1
+                  X2 = LEFT + IP2
+C                  X2=LEFT+IPN1-3
+                  Y2=Y1+YINC*ABS(X2-X1)
+                  IF(IGON.EQ.0)THEN
+                    CALL LINE(X1,X1,Y1,Y2,
+     +              XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,
+     +              MARGB,MARGT,ISXMAX,ISYMAX)
+                    CALL LINE(X1,X2,Y2,Y2,
+     +              XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,
+     +              MARGB,MARGT,ISXMAX,ISYMAX)
+                    CALL LINE(X2,X2,Y2,Y1,
+     +              XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,
+     +              MARGB,MARGT,ISXMAX,ISYMAX)
+                  END IF
+                  IF(IGON.EQ.1)THEN
+                    INTX1=X1+KSTART-1
+                    INTX2=X2+KSTART-1
+                    IF(IP.EQ.IPN1) INTX2 = INTX1
+                    WRITE(IDEV,1000)INTX1,INTX2,L,(SEQ(KK),KK=IP,IP+L-1)
+                  END IF
+                END IF
+                IPN1=POSNS(IPN1)
+1000            FORMAT(' ',I6,2X,I6,2X,I6,('  ',50A1))
+40            CONTINUE
+C   POINT TO NEXT WORD
+              IP=POSNS(IP)
+50          CONTINUE
+            WORDN(JCOMPN)=0
+          END IF
+C   SET THIS WORD TO ZERO SO WE DONT USE IT AGAIN
+          WORDN(I)=0
+        END IF
+100   CONTINUE
+      END
+      INTEGER FUNCTION ICOMPN(SEQ,IDIM,IP,CONST,LENGTH,ISTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER CONST(LENGTH),CTONUM
+      EXTERNAL CTONUM,ICOMP
+      N=ISTART
+      L=IP+LENGTH
+      K=0
+      DO 1 J=1,LENGTH
+        K=K+1
+        L=L-1
+        N=N+CONST(K)*ICOMP(CTONUM(SEQ(L)))
+1     CONTINUE
+      ICOMPN=N
+      END
+      SUBROUTINE DISIG(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,
+     +SUM,WT,TOT,IDM,MAXLEN,LINE,CHRSET,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILNAM*(*),HELPF*(*)
+      CHARACTER LINE(MAXLEN),CHRSET(IDM)
+      INTEGER SUM(IDM,IDM,MAXLEN),TOT(MAXLEN),DIALOG
+      REAL WT(IDM,IDM,MAXLEN)
+      CALL SHOWFU(KBOUT,
+     +'Motif search using dinucleotide weight matrix')
+      CALL GETPAR(60,1,IOK,IGON,
+     +IPAR2,IPAR3,IPAR4,IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL DISIGD(IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WT,LENGTH,MIDDLE,YMIN,YMAX,
+     +IGON,IDEVOT,LINE,CHRSET,DIALOG,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL DISIGP(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WT,IDM,MAXLEN,LENGTH,
+     +YMIN,YMAX,IGON,MIDDLE)
+      END
+      SUBROUTINE DISIGD(IDEV,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WT,LENGTH,MIDDLE,YMIN,YMAX,
+     +IGON,IDEVOT,LINE,CHRSET,DIALOG,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*),LINE(MAXLEN),CHRSET(IDM)
+      INTEGER SUM(IDM,IDM,MAXLEN),TOT(MAXLEN),DIALOG
+      REAL WT(IDM,IDM,MAXLEN)
+      PARAMETER (MAXPRM = 21)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      IOK = 1
+      IDO = 1
+      PROMPT(1) = 'Use weight matrix'
+      PROMPT(2) = 'Make weight matrix'
+      PROMPT(3) = 'Rescale weight matrix'
+      CALL RADION('Select operation',PROMPT,3,IDO,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IDO.LT.1) RETURN
+      IF(IDO.GT.1) THEN
+        CALL MKWTD(WT,SUM,TOT,CHRSET,IDM,MAXLEN,
+     +  IDEV,IDEVOT,KBIN,KBOUT,LINE,
+     +  FILNAM,IDO,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IOK = 1
+        RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Motif weight matrix file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      LENGTH=MAXLEN
+      CALL RDWMTD(TOT,SUM,MIDDLE,LENGTH,MAXLEN,YMIN,YMAX,IDEV,
+     +  IOK,IDM,KBOUT)
+      IF(IOK.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+      IF(DIALOG.EQ.1) THEN
+        AMN = -9999.
+        AMX = 9999.
+        CALL GETRL(AMN,AMX,YMIN,'Cutoff score',VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        YMIN = VAL
+      END IF
+      IF(YMIN.LT.0.0)CALL GETWD(TOT,SUM,WT,LENGTH,IDM,MAXLEN)
+      IF(YMIN.GE.0.0)CALL GETW2D(SUM,WT,LENGTH,IDM,MAXLEN)
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0)RETURN
+      IOK = 0
+      END
+      SUBROUTINE GETCD(TOT,SUM,LINE,IDM,MAXLEN,IDEV,KBOUT,LENGTH,
+     +IOK)
+      INTEGER TOT(MAXLEN),SUM(IDM,IDM,MAXLEN)
+      CHARACTER LINE(MAXLEN)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IOK = 1
+      DO 2 I=1,MAXLEN
+        TOT(I)=0
+        DO 1 J=1,IDM
+        DO 1 K=1,IDM
+          SUM(K,J,I)=0
+1       CONTINUE
+2     CONTINUE
+      N=0
+10    CONTINUE
+1003  FORMAT(1X,120A1)
+1004  FORMAT(' ',I6,' ',120A1)
+      READ(IDEV,1003,END=100)LINE
+      N=N+1
+      WRITE(KBOUT,1004)N,LINE
+      DO 20 I=1,MAXLEN
+        IF(LINE(I).EQ.' ')GO TO 10
+        J = CTONUM(LINE(I+1))
+        K = CTONUM(LINE(I))
+        SUM(K,J,I) = SUM(K,J,I) + 1
+20    CONTINUE
+      GO TO 10
+100   CONTINUE
+      IF(N.EQ.0)THEN
+        CALL ERROM(KBOUT,'Empty file of aligned sequences')
+        RETURN
+      END IF
+C   NOW FIND LENGTH OF MOTIF
+      DO 40 I=1,MAXLEN
+        K=0
+        L=I
+        DO 30 J=1,IDM
+        DO 30 M=1,IDM
+          K=K+SUM(M,J,I)
+30      CONTINUE
+        IF(K.EQ.0)GO TO 50
+        TOT(I)=TOT(I)+K
+40    CONTINUE
+50    CONTINUE
+      LENGTH=L-2
+      IOK = 0
+      END
+      SUBROUTINE GETWD(TOT,SUM,FREQ,LENGTH,MAXCHR,MAXLEN)
+      INTEGER TOT(LENGTH),SUM(MAXCHR,MAXCHR,MAXLEN)
+      REAL FREQ(MAXCHR,MAXCHR,MAXLEN)
+      DO 70 I=1,LENGTH
+        DO 60 J=1,MAXCHR
+        DO 60 K=1,MAXCHR
+          FREQ(K,J,I)=LOG((REAL(SUM(K,J,I)+1)/REAL(TOT(I)+MAXCHR)))
+60      CONTINUE
+70    CONTINUE
+      END
+      SUBROUTINE GETW2D(SUM,FREQ,LENGTH,MAXCHR,MAXLEN)
+      INTEGER SUM(MAXCHR,MAXCHR,MAXLEN)
+      REAL FREQ(MAXCHR,MAXCHR,MAXLEN)
+      DO 70 I=1,LENGTH
+        DO 60 J=1,MAXCHR
+        DO 60 K=1,MAXCHR
+          FREQ(K,J,I)=REAL(SUM(K,J,I))
+60      CONTINUE
+70    CONTINUE
+      END
+      SUBROUTINE MKWTD(FREQ,SUM,TOT,CHRSET,IDM,MAXLEN,
+     +IDEV2,IDEV3,KBIN,KBOUT,LINE,
+     +FILNAM,IOPT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER LINE(MAXLEN),TITLE*60,CHRSET(IDM)
+      INTEGER SUM(IDM,IDM,MAXLEN),TOT(MAXLEN)
+      REAL FREQ(IDM,IDM,MAXLEN)
+      IOK = 1
+      IF(IOPT.EQ.3)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Name of existing weight matrix file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL RDWMTD(TOT,SUM,MIDDLE,LENGTH,MAXLEN,
+     +  CUTMIN,CUTMAX,IDEV2,
+     +  IOK,IDM,KBOUT)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Name of aligned sequences file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF(IOPT.EQ.2) THEN
+        CALL GETCD(TOT,SUM,LINE,IDM,MAXLEN,IDEV2,
+     +  KBOUT,LENGTH,IOK)
+        IF(IOK.NE.0)RETURN
+      END IF
+      WRITE(KBOUT,1006)LENGTH
+1006  FORMAT(' Length of motif',I6)
+      IOK = 1
+      CALL YESNO(IOPT,'Sum logs of weights',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.0) RETURN
+      CUTMIN = -10.0
+      IF(IOPT.EQ.1)CUTMIN = 10.
+      CALL MASKWD(SUM,LENGTH,IDM,MAXLEN,KBIN,KBOUT,TITLE,
+     +IOPT,IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOPT.LT.0) RETURN
+C     NOW CALC WEIGHTS
+      IF(CUTMIN.LT.0.0)CALL GETWD(TOT,SUM,FREQ,LENGTH,IDM,MAXLEN)
+      IF(CUTMIN.GE.0.0)CALL GETW2D(SUM,FREQ,LENGTH,IDM,MAXLEN)
+C     NOW APPLY THE WEIGHTS
+      REWIND IDEV2
+      CALL APPLWD(FREQ,IDM,LENGTH,IDEV2,IDEV3,KBIN,KBOUT,LINE,MAXLEN,
+     +BOT,TOP,TITLE,MIDDLE,IOK)
+      CLOSE(UNIT=IDEV2)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for new weight matrix file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL WRTSCD(TITLE,LENGTH,MIDDLE,BOT,TOP,IDM,
+     +TOT,SUM,CHRSET,IDEV2,MAXLEN)
+      CLOSE(UNIT=IDEV2)
+      RETURN
+      END
+      SUBROUTINE MASKWD(SUM,LENGTH,IDM,MAXLEN,KBIN,KBOUT,MASK,
+     +IOPT,IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER SUM(IDM,IDM,MAXLEN)
+      CHARACTER MASK*(*),HELPF*(*)
+      CALL YESNO(IOPT,'Use all motif positions',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.0) RETURN
+5     CONTINUE
+      IF(IOPT.EQ.1)THEN
+        WRITE(KBOUT,1002)
+1002    FORMAT(' x means use, - means ignore',/,
+     +         ' e.g. xx-x---x-x means use positions 1,2,4,8,10')
+        LIN = 0
+        CALL GTSTR('Mask',' ',MASK,LIN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 5
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        IF(LIN.EQ.0) RETURN
+        DO 70 I=1,LENGTH
+          IF(MASK(I:I).EQ.'-')THEN
+            DO 60 J=1,IDM
+            DO 60 K=1,IDM
+              SUM(K,J,I) = 0
+60          CONTINUE
+          END IF
+70      CONTINUE
+      END IF
+      END
+      SUBROUTINE APPLWD(FREQ,IDM,LENGTH,IDEV,IDEV3,KBIN,KBOUT,
+     +LINE,MAXLEN,BOT,TOP,TITLE,MIDDLE,IOK)
+      REAL FREQ(IDM,IDM,MAXLEN)
+      CHARACTER LINE(MAXLEN),TITLE*(*)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IOK = 1
+      N=0
+      TOP=-99999.
+      BOT=9999999.
+      WRITE(KBOUT,*)' Applying weights to input sequences'
+      SMEAN = 0.
+      SUMSQ = 0.
+1003  FORMAT(1X,120A1)
+1004  FORMAT(' ',I4,' ',F12.3,' ',120A1)
+200   CONTINUE
+      READ(IDEV,1003,END=300)LINE
+      N=N+1
+      SCORE=0.
+      DO 210 I=1,LENGTH
+        J = CTONUM(LINE(I+1))
+        K = CTONUM(LINE(I))
+        SCORE=SCORE+FREQ(K,J,I)
+210   CONTINUE
+      WRITE(IDEV3,1004)N,SCORE,(LINE(K),K=1,LENGTH)
+      IF(SCORE.GT.TOP)TOP=SCORE
+      IF(SCORE.LT.BOT)BOT=SCORE
+      SMEAN=SMEAN+SCORE
+      SUMSQ=SUMSQ+SCORE*SCORE
+      GO TO 200
+300   CONTINUE
+      IF(N.LT.1)THEN
+        CALL ERROM(KBOUT,'Error: empty sequence file')
+        RETURN
+      END IF
+      SMEAN=SMEAN/N
+      SM=SMEAN
+      SMEAN=SMEAN*SMEAN
+      SUMSQ=SUMSQ/N
+      SD = 0.
+      T = SUMSQ - SMEAN
+      IF(T.GT.0.)SD = SQRT(T)
+      SMM3=SM-3*SD
+      SMP3=SM+3*SD
+      WRITE(KBOUT,1000)TOP,BOT
+1000  FORMAT(' Top score',F12.3,'  Bottom score',F12.3)
+      WRITE(KBOUT,1001)SM,SD
+1001  FORMAT(' Mean',F12.3,'  Standard deviation',F12.3)
+      WRITE(KBOUT,1002)SMM3,SMP3
+1002  FORMAT(' Mean minus 3.sd',F12.3,'  Mean plus 3.sd',F12.3)
+      BOT=SMM3
+      TOP=SMP3
+      XMN = -999.
+      XMX = 9999.
+      CALL GETRL(XMN,XMX,BOT,'Cutoff score',VAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      BOT = VAL
+      XMN = BOT
+      XMX = 999.
+      CALL GETRL(XMN,XMX,TOP,'Top score for scaling plots',
+     +VAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      TOP = VAL
+      MN = 0
+      MX = LENGTH
+      MIDDLE = 1
+      CALL GETINT(MN,MX,MIDDLE,'Position to identify',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+       MIDDLE = IVAL
+305   CONTINUE 
+      LIN = 0
+      CALL GTSTR('Title',' ',TITLE,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 305
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IOK = 0
+      END
+      SUBROUTINE GETW1D(SUM,FREQ,LENGTH,MAXCHR,MAXLEN)
+      INTEGER SUM(MAXCHR,MAXCHR,MAXLEN)
+      REAL FREQ(MAXCHR,MAXCHR,MAXLEN)
+      DO 70 I=1,LENGTH
+        DO 60 J=1,MAXCHR
+        DO 60 K=1,MAXCHR
+          FREQ(K,J,I) = 0.
+          IF(SUM(K,J,I).GT.0)FREQ(K,J,I) = 1.
+60      CONTINUE
+70    CONTINUE
+      END
+      SUBROUTINE WRTSCD(TITLE,LENGTH,MIDDLE,BOT,TOP,IDM,
+     +TOT,SUM,CHRSET,IDEV,MAXLEN)
+      INTEGER TOT(LENGTH),SUM(IDM,IDM,MAXLEN)
+      CHARACTER CHRSET(IDM),TITLE*(*)
+C   PROTEIN MATRICES DONT WRITE ROWS FOR -X? AND SPACE SO SET DIMENSION
+C   TO IDM-4
+      MINUS = 1
+      IF(IDM.EQ.26)MINUS = 4
+      WRITE(IDEV,1018)TITLE
+1018  FORMAT(' ',A)
+1019  FORMAT('  P',20I4)
+1020  FORMAT('  N',20I4)
+1021  FORMAT(' ',A,A,20I4)
+1022  FORMAT(' ',2I6,2F10.3)
+      WRITE(IDEV,1022)LENGTH,MIDDLE,BOT,TOP
+      NLINES=1+(LENGTH-1)/20
+      K1=1
+      DO 400 J=1,NLINES
+        K2=MIN((K1+19),LENGTH)
+        WRITE(IDEV,1019)(K,K=K1-MIDDLE,K2-MIDDLE)
+        WRITE(IDEV,1020)(TOT(K),K=K1,K2)
+        DO 390 I=1,IDM-MINUS
+        DO 390 K=1,IDM-MINUS
+          WRITE(IDEV,1021)CHRSET(K),CHRSET(I),(SUM(K,I,KK),KK=K1,K2)
+390     CONTINUE
+        K1=K1+20
+        IF(K1.GT.LENGTH)K1=LENGTH
+400   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE RDWMTD(TOT,WT,MIDDLE,LENGTH,MAXLEN,YMIN,YMAX,IDEV,
+     +IFAIL,IDM,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER WT(5,5,MAXLEN),TOT(MAXLEN)
+      CHARACTER LINE*79
+C   SET FAIL FLAG
+      IFAIL=1
+1000  FORMAT( )
+1001  FORMAT(3X,20I4)
+1002  FORMAT(1X,2I6,2F10.3)
+1003  FORMAT(A)
+1004  FORMAT(' ',A)
+      DO 3 I = 1,MAXLEN
+        TOT(I) = 0
+        DO 2 J = 1,5
+          DO 1 K = 1,5
+            WT(K,J,I) = 0
+1         CONTINUE
+2       CONTINUE
+3     CONTINUE
+C   READ TITLE
+      READ(IDEV,1003,ERR=100,END=100)LINE
+      WRITE(KBOUT,1004)LINE
+C   READ PLOT VALUES ETC
+      READ(IDEV,1002,ERR=100,END=100)
+     +LENGTH,MIDDLE,YMIN,YMAX
+C   HOW MANY LINES TO READ?
+      NLINES=1+(LENGTH-1)/20
+      K1=1
+      DO 10 JJ=1,NLINES
+        READ(IDEV,1000,ERR=100,END=100)
+        K2=MIN((K1+19),LENGTH)
+C        READ(IDEV,1001,ERR=100,END=100)(TOT(K),K=K1,K2)
+        READ(IDEV,1000,ERR=100,END=100)
+        DO 5 I=1,4
+        DO 5 J=1,4
+          READ(IDEV,1001,ERR=100,END=100)(WT(J,I,K),K=K1,K2)
+          DO 6 K=K1,K2
+            TOT(K) = TOT(K) + WT(J,I,K)
+6         CONTINUE
+5       CONTINUE
+        K1=K1+20
+        IF(K1.GT.LENGTH)K1=LENGTH
+10    CONTINUE
+      CLOSE(UNIT=IDEV)
+C   SET FAIL FLAG TO GOOD
+      IFAIL=0
+      RETURN
+100   CONTINUE
+      END
+      SUBROUTINE DISIGP(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WT,IDM,MAXLEN,LENGTH,
+     +YMIN,YMAX,IGON,MIDDLE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL WT(IDM,IDM,MAXLEN)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      XMAX=J2
+      XMIN=J1
+      IF(IGON.EQ.0)THEN
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      DO 10 I=J1,J2-LENGTH
+        SUM1=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          KT1=CTONUM(SEQ(J))
+          KT2=CTONUM(SEQ(J+1))
+          SUM1 = SUM1 + WT(KT1,KT2,K)
+5       CONTINUE
+        IF(SUM1.GE.YMIN)THEN
+          IF(IGON.EQ.0)THEN
+            X=I+MIDDLE
+            CALL LINE(X,X,YMIN,SUM1,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          END IF
+          IF(IGON.EQ.1)THEN
+            CALL VT100M
+            WRITE(IDEVOT,1010)I+MIDDLE,SUM1,(SEQ(K),K=I,I+LENGTH-1)
+1010        FORMAT(' ',I7,F10.2,' ',120A1)
+          END IF
+        END IF
+10    CONTINUE
+      CALL VT100M
+      RETURN
+      END
+C  LOCALF
+      SUBROUTINE LOCALF(SEQNCE,IDIM1,STRING,MATCH,IDIM3I,
+     +ITOT,ITOTEL,ITOTID,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,J1,J2,KSTART,
+     +IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM1),STRING(IDIM3I),MATCH(IDIM3I)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      INTEGER ANSRV,ANSC
+      INTEGER SPAN,DIALOG
+      CALL SHOWFU(KBOUT,
+     +'Search for local similarity or complementarity')
+      CALL GETPAR(36,15,IOK,MINSP,MAXSP,SPAN,MINPR,MAXPR,IPR,
+     +MNIR1,MXIR1,IRAN1,MNIR2,MXIR2,IRAN2,
+     +ANSRV,ANSC,IGON,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      PR = REAL(IPR)
+10    CONTINUE
+      CALL LOCALD(
+     +ANSRV,ANSC,IOK,MINSP,MAXSP,SPAN,MINPR,MAXPR,IPR,PR,
+     +MNIR1,MXIR1,IRAN1,MNIR2,MXIR2,IRAN2,IGON,
+     +J1,J2,IS,IE,KBIN,KBOUT,MINS,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL LOCALP(SEQNCE,IDIM1,STRING,MATCH,
+     +ITOT,ITOTEL,ITOTID,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KSTART,
+     +ANSRV,SPAN,IRAN1,IRAN2,IGON,MINS,
+     +J1,J2,IS,IE,KBIN,KBOUT,IDEV)
+      DIALOG = 1
+      GO TO 10
+      END
+      SUBROUTINE LOCALD(
+     +ANSRV,ANSC,IOK,MINSP,MAXSP,SPAN,MINPR,MAXPR,IPR,PR,
+     +MNIR1,MXIR1,IRAN1,MNIR2,MXIR2,IRAN2,IGON,
+     +J1,J2,IS,IE,KBIN,KBOUT,MINS,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER ANSRV,ANSC
+      INTEGER CHOICE,SPAN
+      IOK = 1
+      CHOICE = ANSRV
+      CALL YESONO(CHOICE,'Find direct repeats',
+     +'Find inverted repeats',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(CHOICE.LT.0) RETURN
+      ANSRV = CHOICE
+      CALL YESNO(ANSC,'Keep picture',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSC.LT.0) RETURN
+      IF(ANSC.EQ.1)CALL CLEARG
+      CALL GETINT(MINSP,MAXSP,SPAN,'Span',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      SPAN = IVAL
+      IS = J1
+      IE = J2
+      CALL GTREG(KBIN,KBOUT,0,J2,IS,IE,'Define restricted region',
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL SQPF5(SPAN,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PR,MINS,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL GETINT(MNIR1,MXIR1,IRAN1,'Range start',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IRAN1 = IVAL
+      MNIR2 = MAX(MNIR2,IRAN1)
+      MXIR2 = MAX(MNIR2,MXIR2)
+      IRAN2 = MAX(IRAN2,MNIR2)
+      CALL GETINT(MNIR2,MXIR2,IRAN2,'Range end',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IRAN2 = IVAL
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0)RETURN
+      IOK = 0
+      END
+      SUBROUTINE LOCALP(SEQNCE,IDIM1,STRING,MATCH,
+     +ITOT,ITOTEL,ITOTID,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KSTART,
+     +ANSRV,SPAN,IRAN1,IRAN2,IGON,MINS,
+     +J1,J2,IS,IE,KBIN,KBOUT,IDEV)
+C   AUTHOR: RODGER STADEN
+      INTEGER SPAN
+      CHARACTER SEQNCE(IDIM1),STRING(SPAN),MATCH(SPAN)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      INTEGER ANSRV
+      XMIN=J1
+      XMAX=J2
+      YMIN=0.
+      YMAX=SPAN
+      CALL BUSY(KBOUT)
+      IF(IGON.EQ.0)THEN
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+C LAST START POSITION
+      ILAST=IE-SPAN+1
+C   HOW MANY LOOPS?
+      IT1=IRAN1+SPAN-1
+      IT2=IRAN2+SPAN+SPAN-2
+      DO 50 J=IS,IE
+        IS1=J+IT1
+        IE1=J+IT2
+        IF(IE1.GT.IE)IE1=IE
+        IF(IS1.GT.ILAST)GO TO 51
+        CALL SQCOPY(SEQNCE(J-KSTART+1),STRING,SPAN)
+        IF(ANSRV.EQ.1)THEN
+          CALL SQREV(STRING,SPAN)
+          CALL SQCOM(STRING,SPAN)
+        END IF
+        CALL SQFIT(SEQNCE,IDIM1,STRING,SPAN,ITOT,ITOTEL,ITOTID,
+     +  IS1,IE1,MINS,ITOTP,KSTART)
+        IF(ITOTP.GT.ITOTID) THEN
+          IF(IGON.EQ.0)CALL VT100M
+          WRITE(KBOUT,1017)ITOTID
+1017      FORMAT(/10X,'More than',I4,' matches.',
+     +    ' Try changing percentage or region'/)
+          RETURN
+        END IF
+        IF(ITOTP.GT.0)THEN
+          IF(IGON.EQ.0)THEN
+            X1=J
+            DO 41 I=1,ITOTP
+              X=ITOTEL(I)
+              Y=ITOT(I)
+              CALL LINE(X1,X,Y,YMIN,
+     +        XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,
+     +        MARGT,ISXMAX,ISYMAX)
+41          CONTINUE
+          END IF
+          IF(IGON.EQ.1)THEN
+            DO 49 I=1,ITOTP
+              K=ITOTEL(I)-KSTART+1
+              WRITE(IDEV,1008)
+1008          FORMAT(/)
+              CALL SQMTCH(SEQNCE(K),STRING,MATCH,SPAN)
+              CALL FMT4LN(SEQNCE(K),STRING,MATCH,
+     +        SPAN,ITOTEL(I),J,IDEV)
+49          CONTINUE
+          END IF
+        END IF
+50    CONTINUE
+51    CONTINUE
+      IF(IGON.EQ.0)CALL VT100M
+      END
+C  SETGEN
+      SUBROUTINE SETGEN(PAAS,PAA,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER PAAS(5,5,5),PAA(5,5,5)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*),UGA(3),AUA(3),CUA(3),AGA(3),AGG(3)
+      INTEGER CTONUM
+      PARAMETER (MAXPRM = 23)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      EXTERNAL CTONUM
+      SAVE UGA,AUA,CUA,AGA,AGG
+      DATA UGA,AUA,CUA,AGA,AGG/'T','G','A','A','T','A','C','T','A',
+     +'A','G','A','A','G','G'/
+      NUM = 1
+C   SET STANDARD CODE
+      DO 10 I=1,5
+      DO 10 J=1,5
+      DO 10 K=1,5
+        PAA(I,J,K)=PAAS(I,J,K)
+10    CONTINUE
+      PROMPT(1) = 'Standard'
+      PROMPT(2) = 'Mammalian mitochondrial'
+      PROMPT(3) = 'Yeast mitochondrial'
+      PROMPT(4) = 'Personal'
+      CALL RADION('Select genetic code',PROMPT,4,NUM,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(NUM.LT.2) RETURN
+      IF(NUM.EQ.2)THEN
+         PAA(CTONUM(AUA(3)),CTONUM(AUA(2)),CTONUM(AUA(1)))='M'
+         PAA(CTONUM(UGA(3)),CTONUM(UGA(2)),CTONUM(UGA(1)))='W'
+         PAA(CTONUM(AGA(3)),CTONUM(AGA(2)),CTONUM(AGA(1)))='*'
+         PAA(CTONUM(AGG(3)),CTONUM(AGG(2)),CTONUM(AGG(1)))='*'
+      ELSE IF(NUM.EQ.3)THEN
+         PAA(CTONUM(CUA(3)),CTONUM(CUA(2)),CTONUM(CUA(1)))='T'
+         PAA(CTONUM(AUA(3)),CTONUM(AUA(2)),CTONUM(AUA(1)))='M'
+         PAA(CTONUM(UGA(3)),CTONUM(UGA(2)),CTONUM(UGA(1)))='W'
+      ELSE IF(NUM.EQ.4)THEN
+         CALL GETCOD(PAA,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      END IF
+      END
+      SUBROUTINE GETCOD(PAA,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER PAA(5,5,5),AA,CODON*3,AIN,AAA
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Define genetic code by typing a codon',
+     +/,' followed by a 1 letter amino acid symbol')
+1     CONTINUE
+      L = 0
+      CALL GTSTR('Codon',' ',CODON,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+         CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+         GO TO 1
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      AA = 
+     +PAA(CTONUM(CODON(3:3)),CTONUM(CODON(2:2)),CTONUM(CODON(1:1)))
+      IF(AA.NE.'-')THEN
+2       CONTINUE
+         L = 1
+         AAA = AA
+         CALL GTSTR('Amino acid symbol',AAA,AIN,L,KBOUT,KBIN,INFLAG)
+         IF(INFLAG.EQ.1) THEN
+           CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+           GO TO 2
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        IF(L.GT.0) AAA = AIN
+        PAA(CTONUM(CODON(3:3)),CTONUM(CODON(2:2)),
+     +  CTONUM(CODON(1:1))) = AAA
+        GO TO 1
+      END IF
+      GO TO 1
+      END
+      SUBROUTINE DINUCF(SEQ,IDIM,J1,J2,IDEV,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),BASE(4)
+      REAL OBS(5,5),EXPEC(5,5),COMP(5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE BASE
+      DATA BASE/'T','C','A','G'/
+      CALL SHOWFU(KBOUT,'Calculate dinucleotide frequencies')
+      DO 10 I=1,5
+        COMP(I)=0.
+        DO 10 J=1,5
+          EXPEC(I,J)=0.
+          OBS(I,J)=0.
+10    CONTINUE
+      DO 20 I=J1,J2-1
+        K0=CTONUM(SEQ(I))
+        K1=CTONUM(SEQ(I+1))
+        COMP(K0)=COMP(K0)+1.
+        OBS(K0,K1)=OBS(K0,K1)+1.
+20    CONTINUE
+      TOT=J2-J1
+      DO 25 I=1,5
+        COMP(I)=COMP(I)/TOT
+25    CONTINUE
+      TOT=TOT/100.
+      DO 26 I=1,5
+      DO 26 J=1,5
+        EXPEC(I,J)=COMP(I)*COMP(J)
+26    CONTINUE
+      DO 30 I=1,5
+      DO 30 J=1,5
+        EXPEC(I,J)=EXPEC(I,J)*100.
+        OBS(I,J)=OBS(I,J)/TOT
+30    CONTINUE
+      WRITE(IDEV,1002)BASE
+      WRITE(IDEV,1001)
+1002  FORMAT(/10X,4(4X,A1,9X))
+1001  FORMAT(' ',7X,4(' Obs  Expected'))
+      DO 40 K=1,4
+        WRITE(IDEV,1000)BASE(K),(OBS(K,J),EXPEC(K,J),J=1,4)
+1000    FORMAT('     ',A1,8F7.2)
+40    CONTINUE
+      END
+      INTEGER FUNCTION ICOMP(IN)
+C   AUTHOR: RODGER STADEN
+      INTEGER COMPS(5)
+      SAVE COMPS
+      DATA COMPS/3,4,1,2,5/
+      ICOMP=COMPS(IN)
+      END
+C   lwrap2
+      INTEGER FUNCTION LWRAP2(IDIM,I)
+C   AUTHOR: RODGER STADEN
+      LWRAP2=I
+      IF(LWRAP2.LT.1)LWRAP2=LWRAP2+IDIM+1
+      IF(LWRAP2.GT.IDIM)LWRAP2=LWRAP2-IDIM
+      END
+C   LWRAP3
+      INTEGER FUNCTION LWRAP3(IDIM,I)
+C   AUTHOR: RODGER STADEN
+      LWRAP3=I
+      IF(LWRAP3.LT.1)LWRAP3=LWRAP3+IDIM
+      IF(LWRAP3.GT.IDIM)LWRAP3=LWRAP3-IDIM
+      END
+      SUBROUTINE NORMAA(SUM,TOTA,PAA)
+      CHARACTER PAA(5,5,5),AA(21)
+      REAL SUM(4,4,4)
+      SAVE AA
+      DATA AA/'A','C','D','E','F',
+     +'G','H','I','K','L',
+     +'M','N','P','Q','R',
+     +'S','T','V','W','Y','*'/
+C   LOOP FOR EACH ACID
+      DO 150 L=1,21
+        ASUM=0.
+C   LOOP FOR EACH CODON
+        DO 200 I=1,4
+        DO 200 J=1,4
+        DO 200 K=1,4
+C   COUNT CODONS
+          IF(PAA(K,J,I).EQ.AA(L))ASUM=ASUM+SUM(I,J,K)
+200     CONTINUE
+        IF(ASUM.NE.0.0)ASUM=TOTA/ASUM
+C   LOOP FOR EACH CODON
+        DO 100 I=1,4
+        DO 100 J=1,4
+        DO 100 K=1,4
+C   NORMALIZE
+          IF(PAA(K,J,I).EQ.AA(L))SUM(I,J,K)=SUM(I,J,K)*ASUM
+100     CONTINUE
+150   CONTINUE
+      END
+      SUBROUTINE PPROM1(SEQ,IDIM,ISXMAX,ISYMAX,I1,I2,MARGL,MARGR,
+     + MARGB,MARGT,IDEV,FILEIN,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILEIN*(*)
+      INTEGER CTONUM
+      REAL WTM35(25,5),WTM10(19,5),WTP1(12,5),PENALT(7)
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'E coli promoter search')
+      CALL RPPROM(WTM35,WTM10,WTP1,IDEV,FILEIN,IFAIL)
+      IF(IFAIL.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+      IMARG=NINT(MARGT/2.0)
+      MARGB2=MARGB+IMARG
+      MARGT2=IMARG
+      MARGB1=MARGB
+      MARGT1=IMARG
+      PENALT(1)=0.02
+      PENALT(2)=0.2
+      PENALT(3)=1.0
+      PENALT(4)=0.2
+      PENALT(5)=0.05
+      PENALT(6)=0.02
+      PENALT(7)=0.01
+      DO 1 I=1,7
+        PENALT(I)=LOG(PENALT(I))
+1     CONTINUE
+      CUTM35=-36.85
+      CUTM10=-28.13
+      CUTP1=-21.8
+      XMAX=I2
+      XMIN=I1
+      YMIN=-77.3
+      YMAX=-60.4
+      YMIN1=-77.3*0.01
+      YMAX1=-60.4*0.01
+      YMIN1=-77.3
+      YMAX1=-60.4
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL,MARGR,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      DO 100 I=I1,I2-65
+C   DO -35 REGION
+        SUMM35=0.
+        K=0
+        DO 5 J=I,I+24
+          K=K+1
+          SUMM35=SUMM35+WTM35(K,CTONUM(SEQ(J)))
+5       CONTINUE
+C  HIGH ENOUGH?
+        IF(SUMM35.GE.CUTM35)THEN
+C  YES, TRY -10 REGION
+          BESTM1=-99999.
+          DO 20 J=I+25,I+25+6
+            K=0
+            SUMM10=0.
+            DO 10 L=J,J+18
+              K=K+1
+              SUMM10=SUMM10 + WTM10(K,CTONUM(SEQ(L)))
+10          CONTINUE
+C   IS THIS -10 SCORE HIGH ENOUGH?
+            IF((SUMM10).GE.CUTM10)THEN
+C   YES, IS IT THE HIGHEST?
+              IF(SUMM10.GT.BESTM1)THEN
+                BESTM1=SUMM10
+                IBEST=J
+                IBESTA=IBEST-I-24
+              END IF
+            END IF
+20        CONTINUE
+C   -10 DONE, HIGH ENOUGH SCORE?
+          IF(BESTM1.GE.CUTM10)THEN
+C   YES, DO +1 REGION
+            BESTP1=-99999.
+            DO 40 J=IBEST+19,IBEST+19+4
+              SUMP1=0.
+              K=0
+              DO 30 L=J,J+11
+                K=K+1
+                SUMP1=SUMP1 + WTP1(K,CTONUM(SEQ(L)))
+30            CONTINUE
+C   HIGH ENOUGH?
+              IF(SUMP1.GE.CUTP1)THEN
+C   YES, HIGHEST?
+                IF(SUMP1.GT.BESTP1)THEN
+                  BESTP1=SUMP1
+                  JBEST=J
+                END IF
+              END IF
+40          CONTINUE
+C   +1 REGION DONE, HIGH ENOUGH SCORE?
+            IF(BESTP1.GE.CUTP1)THEN
+C   YES, SO ALL HIGH ENOUGH
+              X=JBEST+2
+              Y=SUMM35+BESTM1+BESTP1
+              CALL LINE(X,X,YMIN,Y,XMAX,XMIN,
+     +        YMAX,YMIN,
+     +        MARGL,MARGR,MARGB2,MARGT2,
+     +        ISXMAX,ISYMAX)
+              Y=Y+PENALT(IBESTA)
+              CALL LINE(X,X,YMIN1,Y,XMAX,XMIN,
+     +        YMAX1,YMIN1,
+     +        MARGL,MARGR,MARGB1,MARGT1,
+     +        ISXMAX,ISYMAX)
+            END IF
+          END IF
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE RPPROM(WTM35,WTM10,WTP1,IDEV,FILEIN,IFAIL)
+C   AUTHOR: RODGER STADEN
+      REAL WTM35(25,5),WTM10(19,5),WTP1(12,5)
+      INTEGER WT(25),TOT(25)
+      CHARACTER FILEIN*(*)
+      IFAIL=0
+1001  FORMAT(2X,25I3)
+      CALL OPENRS(IDEV,FILEIN,IOK,LRECL,2)
+      IF(IOK.NE.0)GO TO 100
+C   READ SPACE,P
+      READ(IDEV,1000,ERR=100)
+      READ(IDEV,1000,ERR=100)
+      READ(IDEV,1000,ERR=100)
+1000  FORMAT()
+      READ(IDEV,1001,ERR=100)TOT
+      DO 5 I=1,4
+        READ(IDEV,1001,ERR=100)(WT(K),K=1,25)
+        DO 5 J=1,25
+          IF(WT(J).NE.0)THEN
+            WTM35(J,I) = LOG(FLOAT(WT(J))/FLOAT(TOT(J)))
+          ELSE
+            WTM35(J,I)=LOG(0.5/FLOAT(TOT(J)))
+          END IF
+5     CONTINUE
+C   DO 5'S
+      DO 20 I=1,25
+        WTM35(I,5)=LOG(0.25/TOT(I))
+20    CONTINUE
+C   do -10 region
+      READ(IDEV,1000,ERR=100)
+      READ(IDEV,1000,ERR=100)
+      READ(IDEV,1001)(TOT(K),K=1,19)
+      DO 30 I=1,4
+        READ(IDEV,1001,ERR=100)(WT(K),K=1,19)
+        DO 30 J=1,19
+          IF(WT(J).NE.0)THEN
+            WTM10(J,I) = LOG(FLOAT(WT(J))/FLOAT(TOT(J)))
+          ELSE
+            WTM10(J,I)=LOG(0.5/FLOAT(TOT(J)))
+          END IF
+30    CONTINUE
+C   do 5's
+      DO 35 I=1,19
+        WTM10(I,5)=LOG(0.25/TOT(I))
+35    CONTINUE
+C   DO +1 REGION
+      READ(IDEV,1000,ERR=100)
+      READ(IDEV,1000,ERR=100)
+      READ(IDEV,1001,ERR=100)(TOT(K),K=1,12)
+      DO 40 I=1,4
+        READ(IDEV,1001,ERR=100)(WT(K),K=1,12)
+        DO 40 J=1,12
+          IF(WT(J).NE.0)THEN
+            WTP1(J,I) = LOG(FLOAT(WT(J))/FLOAT(TOT(J)))
+          ELSE
+            WTP1(J,I)=LOG(0.5/FLOAT(TOT(J)))
+          END IF
+40    CONTINUE
+C   DO 5'S
+      DO 45 I=1,12
+        WTP1(I,5)=LOG(0.25/TOT(I))
+45    CONTINUE
+      CLOSE(UNIT=IDEV)
+      RETURN
+100   CONTINUE
+      IFAIL=1
+      END
+      SUBROUTINE PPROM2(SEQ,IDIM,ISXMAX,ISYMAX,I1,I2,MARGL,MARGR,
+     +MARGB,MARGT,IDEV,FILEIN,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILEIN*(*)
+      REAL WTM35(25,5),WTM10(19,5),WTP1(12,5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'E coli promoter -35 and -10 regions search')
+      CALL RPPROM(WTM35,WTM10,WTP1,IDEV,FILEIN,IFAIL)
+      IF(IFAIL.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+      IMARG=NINT(MARGT/2.0)
+      MARGB2=MARGB+IMARG
+      MARGT2=IMARG
+      MARGB1=MARGB
+      MARGT1=IMARG
+      XMAX=I2
+      XMIN=I1
+      YMIN35=-38.0
+      YMAX35=-23.5
+      YMIN1=-29.0
+      YMAX1=-18.4
+      YMID35=YMIN35+(YMAX35-YMIN35)/2.
+      YMID1=YMIN1+(YMAX1-YMIN1)/2.
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL FRAME(MARGL,MARGR,MARGB2,MARGT2,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,YMID35,'-35',3,0,XMAX,XMIN,YMAX35,YMIN35,
+     +MARGL,MARGR,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,YMID1,'-10',3,0,XMAX,XMIN,YMAX1,YMIN1,
+     +MARGL,MARGR,MARGB2,MARGT2,ISXMAX,ISYMAX)
+C
+      DO 100 I=I1,I2-25
+C   DO -35 REGION
+        SUMM35=0.
+        K=0
+        DO 5 J=I,I+24
+          K=K+1
+          SUMM35=SUMM35+WTM35(K,CTONUM(SEQ(J)))
+5       CONTINUE
+        X=I+48
+        Y=SUMM35
+        CALL LINE(X,X,YMIN35,Y,XMAX,XMIN,YMAX35,YMIN35,
+     +  MARGL,MARGR,MARGB2,MARGT2,ISXMAX,ISYMAX)
+100   CONTINUE
+      DO 200 I=I1,I2-19
+        SUMM10=0.
+        K=0
+        DO 20 J=I,I+18
+          K=K+1
+          SUMM10=SUMM10+WTM10(K,CTONUM(SEQ(J)))
+20      CONTINUE
+        X=I+22
+        Y=SUMM10
+        CALL LINE(X,X,YMIN1,Y,XMAX,XMIN,YMAX1,YMIN1,
+     +  MARGL,MARGR,MARGB1,MARGT1,ISXMAX,ISYMAX)
+200   CONTINUE
+      CALL VT100M
+      END
+C  PROMOTERS ON THE COMPLEMENTARY STRAND
+      SUBROUTINE PPROM3(SEQ,IDIM,ISXMAX,ISYMAX,I1,I2,MARGL1,MARGR1,
+     +MARGB1,MARGT1,IDEV,FILEIN,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILEIN*(*)
+      REAL WTM35(25,5),WTM10(19,5),WTP1(12,5)
+      REAL WTM35C(25,5),WTM10C(19,5),WTP1C(12,5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'E coli promters on the complementary strand')
+      CALL RPPROM(WTM35,WTM10,WTP1,IDEV,FILEIN,IFAIL)
+      IF(IFAIL.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+C   REVERSE AND COMPLEMENT MATRICES
+      DO 5 I=1,25
+      DO 5 J=1,2
+        WTM35C(I,J+2)=WTM35(26-I,J)
+        WTM35C(I,J)=WTM35(26-I,J+2)
+5     CONTINUE
+      DO 6 I=1,19
+      DO 6 J=1,2
+        WTM10C(I,J+2)=WTM10(20-I,J)
+        WTM10C(I,J)=WTM10(20-I,J+2)
+6     CONTINUE
+      DO 7 I=1,12
+      DO 7 J=1,2
+        WTP1C(I,J+2)=WTP1(13-I,J)
+        WTP1C(I,J)=WTP1(13-I,J+2)
+7     CONTINUE
+      DO 1 I = 1,25
+        WTM35C(I,5)=WTM35(26-I,5)
+1     CONTINUE
+      DO 2 I = 1,19
+        WTM10C(I,5)=WTM10(20-I,5)
+2     CONTINUE
+      DO 3 I = 1,12
+        WTP1C(I,5)=WTP1(13-I,5)
+3     CONTINUE
+      CUTM35=-36.85
+      CUTM10=-28.13
+      CUTP1=-21.8
+      XMAX=I2
+      XMIN=I1
+      YMIN=-77.3
+      YMAX=-60.4
+      CALL CLEARV
+      CALL VECTOM
+      CALL FRAME(MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX)
+      DO 100 I=I1,I2-65
+C   DO +1 REGION
+        SUMP1=0.
+        K=0
+        DO 8 J=I,I+11
+          K=K+1
+          SUMP1=SUMP1+WTP1C(K,CTONUM(SEQ(J)))
+8       CONTINUE
+C  HIGH ENOUGH?
+        IF(SUMP1.GE.CUTP1)THEN
+C  YES, TRY -10 REGION
+          BESTM1=-99999.
+          DO 20 J=I+12,I+12+4
+            K=0
+            SUMM10=0.
+            DO 10 L=J,J+18
+              K=K+1
+              SUMM10=SUMM10 + WTM10C(K,CTONUM(SEQ(L)))
+10          CONTINUE
+C   IS THIS -10 SCORE HIGH ENOUGH?
+            IF(SUMM10.GE.CUTM10)THEN
+C   YES, IS IT THE HIGHEST?
+              IF(SUMM10.GT.BESTM1)THEN
+                BESTM1=SUMM10
+                IBEST=J
+              END IF
+            END IF
+20        CONTINUE
+C   -10 DONE, HIGH ENOUGH SCORE?
+          IF(BESTM1.GE.CUTM10)THEN
+C   YES, DO -35 REGION
+            BESTM3=-99999.
+            DO 40 J=IBEST+19,IBEST+19+6
+              SUMM35=0.
+              K=0
+              DO 30 L=J,J+24
+                K=K+1
+                SUMM35=SUMM35+WTM35C(K,CTONUM(SEQ(L)))
+30            CONTINUE
+C   HIGH ENOUGH?
+              IF(SUMM35.GE.CUTM35)THEN
+C   YES, HIGHEST?
+                IF(SUMM35.GT.BESTM3)THEN
+                  BESTM3=SUMM35
+                END IF
+              END IF
+40          CONTINUE
+C   -35 REGION DONE, HIGH ENOUGH SCORE?
+            IF(BESTM3.GE.CUTM35)THEN
+C   YES, SO ALL HIGH ENOUGH
+              X=I+10
+              Y=SUMP1+BESTM3+BESTM1
+              CALL LINE(X,X,YMIN,Y,
+     +        XMAX,XMIN,YMAX,YMIN,
+     +        MARGL1,MARGR1,MARGB1,
+     +        MARGT1,ISXMAX,ISYMAX)
+            END IF
+          END IF
+        END IF
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE PRIBS(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,I1,I2,IDEV,FILE1,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILE1*(*)
+      INTEGER WEIGHT(505),MARGB1(3),MARGT1(3),CTONUM
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'Search for E coli ribosome binding sites')
+C   SORT OUT MARGIN IE FIND MAX AND DIVIDE BY THREE
+      IMARG=NINT(MARGT/3.)
+      MARGL1=MARGL
+      MARGR1=MARGR
+      MARGB1(1)=MARGB
+      MARGB1(2)=MARGB+IMARG
+      MARGB1(3)=MARGB+2*IMARG
+      MARGT1(1)=IMARG
+      MARGT1(2)=IMARG
+      MARGT1(3)=IMARG
+      YMAX=100.
+      YMIN=0.
+      CALL RPRIBS(IDEV,FILE1,WEIGHT,IOK,KBOUT)
+      IF(IOK.NE.0)RETURN
+      MINSMN = -50
+      MINSMX = 100
+      MINSI = 2
+      CALL GETINT(MINSMN,MINSMX,MINSI,'Minimum score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINS = IVAL
+      XMAX=I2
+      XMIN=I1
+      Y=0.
+      YP=20.0
+      CALL CLEARV
+      CALL BUSY(KBOUT)
+      CALL VECTOM
+      DO 10 I=I1,I2-100
+        ISUM=0
+        K=0
+        II=I
+        III=I+100
+        DO 5 J=II,III
+          K=K+1
+          KK=CTONUM(SEQ(J))-1
+          KK=KK*101
+          IP=K+KK
+          ISUM=ISUM+WEIGHT(IP)
+5       CONTINUE
+        IF(ISUM.GT.MINS)THEN
+          X=I+60
+          IMARG=1+MOD((II-I1),3)
+          CALL LINE(X,X,Y,YP,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB1(IMARG),MARGT1(IMARG),
+     +    ISXMAX,ISYMAX)
+        END IF
+10    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE RPRIBS(IDEV,FILE1,WT,IOK,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER WT(505)
+      CHARACTER FILE1*(*)
+1001  FORMAT(2X,25I3)
+      CALL OPENRS(IDEV,FILE1,IOK,LRECL,2)
+      IF(IOK.NE.0)GO TO 100
+      DO 20 J=1,4
+C   read space,p
+        READ(IDEV,1000,END=200,ERR=200)
+        READ(IDEV,1000,END=200,ERR=200)
+1000    FORMAT()
+        JJ=(J-1)*25
+        DO 5 I=1,4
+          II=(I-1)*101
+          K1=JJ+II+1
+          READ(IDEV,1001,END=200,ERR=200)(WT(K),K=K1,K1+24)
+5       CONTINUE
+20    CONTINUE
+C   do last bit
+      READ(IDEV,1000,END=200,ERR=200)
+      READ(IDEV,1000,END=200,ERR=200)
+      K1=101
+1002  FORMAT(2X,I3)
+      DO 30 I=1,4
+        READ(IDEV,1002,ERR=200,END=200)WT(K1)
+        K1=K1+101
+30    CONTINUE
+      CLOSE(UNIT=IDEV)
+      DO 40 I=405,505
+        WT(I)=0
+40    CONTINUE
+      RETURN
+100   CONTINUE
+      CALL ERROM(KBOUT,'Error opening file')
+      RETURN
+200   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      IOK = 1
+      END
+C   SCANS FOR EUKARYOTIC RIBOSOME BINDING SITES
+      SUBROUTINE RIBEUK(SEQ,IDIM,ISXMAX,ISYMAX,J1,J2,
+     +MARGL,MARGR,MARGB,MARGT,IDEV,FILEIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILEIN*(*)
+      REAL WT(10,5)
+      INTEGER CTONUM,MARGB1(3),MARGT1(3)
+      EXTERNAL CTONUM
+      CALL SHOWFU(KBOUT,'Search for eukaryotic ribosome binding sites')
+      CALL RRIBEU(WT,IDEV,FILEIN,IFAIL)
+      IF(IFAIL.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+C   SORT OUT MARGIN IE FIND MAX AND DIVIDE BY THREE
+      IMARG=NINT(MARGT/3.)
+      MARGL1=MARGL
+      MARGR1=MARGR
+      MARGB1(1)=MARGB
+      MARGB1(2)=MARGB+IMARG
+      MARGB1(3)=MARGB+2*IMARG
+      MARGT1(1)=IMARG
+      MARGT1(2)=IMARG
+      MARGT1(3)=IMARG
+      YMIN=-9.0
+      YMAX=0.0
+      XMAX=J2
+      XMIN=J1
+      CALL CLEARV
+      CALL VECTOM
+C   ADD 2 TO MAKE PHASE CORRECT
+      I1=J1+1
+      DO 20 II=1,3
+        DO 10 I=I1+II,J2-10,3
+          IF((SEQ(I+7).EQ.'A').AND.(SEQ(I+8).EQ.'T').
+     +    AND.(SEQ(I+9).EQ.'G'))THEN
+            SUM=0.
+            K=0
+            DO 5 J=I,I+9
+              K=K+1
+              SUM=SUM+WT(K,CTONUM(SEQ(J)))
+5           CONTINUE
+            X=I+7
+            CALL LINE(X,X,YMIN,SUM,XMAX,
+     +      XMIN,YMAX,YMIN,
+     +      MARGL1,MARGR1,MARGB1(II),
+     +      MARGT1(II),ISXMAX,ISYMAX)
+          END IF
+10      CONTINUE
+20    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE RRIBEU(WTR,IDEV,FILEIN,IFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILEIN*(*)
+      REAL WTR(10,5)
+      INTEGER WT(10),TOT(10)
+      IFAIL=0
+1001  FORMAT(2X,10I3)
+      CALL OPENRS(IDEV,FILEIN,IOK,LRECL,2)
+      IF(IOK.NE.0)GO TO 50
+C   read space,p
+      READ(IDEV,1000,ERR=50)
+      READ(IDEV,1000,ERR=50)
+1000  FORMAT()
+      READ(IDEV,1001,ERR=50)TOT
+      DO 6 I=1,4
+        READ(IDEV,1001,ERR=50)(WT(K),K=1,10)
+        DO 5 J=1,10
+          IF(WT(J).NE.0)THEN
+            WTR(J,I)=LOG(FLOAT(WT(J))/FLOAT(TOT(J)))
+          ELSE
+            WTR(J,I)=LOG(1./FLOAT(TOT(J)))
+          END IF
+5       CONTINUE
+6     CONTINUE
+      CLOSE(UNIT=IDEV)
+      Z=LOG(1./102.)
+      DO 40 I=1,10
+        WTR(I,5)=Z
+40    CONTINUE
+      RETURN
+50    CONTINUE
+      IFAIL=1
+      END
+      SUBROUTINE FSPLIC(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEV,DEFFIL,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,
+     +SUM,WTD,WTA,CEXACD,CEXACA,PEXACD,PEXACA,
+     +TOT,IDM,MAXLEN,LINE,CHRSET,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILNAM*(*),HELPF*(*),DEFFIL*(*)
+      CHARACTER LINE(MAXLEN),CHRSET(IDM)
+      INTEGER SUM(IDM,MAXLEN),TOT(MAXLEN),DIALOG,ANSFIL
+      REAL WTA(IDM,MAXLEN),WTD(IDM,MAXLEN)
+      INTEGER CEXACD(MAXLEN),PEXACD(MAXLEN)
+      INTEGER CEXACA(MAXLEN),PEXACA(MAXLEN)
+      CALL SHOWFU(KBOUT,'Splice search using weight matrix')
+      CALL GETPAR(62,1,IOK,IGON,
+     +IPAR2,IPAR3,IPAR4,IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL SPLICD(IDEV,DEFFIL,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WTD,LENGTD,MIDDLD,YMIND,YMAXD,
+     +WTA,LENGTA,MIDDLA,YMINA,YMAXA,
+     +IGON,IDEVOT,LINE,CHRSET,DIALOG,ANSFIL,IOK,
+     +CEXACD,PEXACD,IEXACD,CEXACA,PEXACA,IEXACA)
+      IF(IOK.NE.0) RETURN
+      MARGTD = MARGT/2
+      CALL SPLICP(SEQ,IDIM,MARGL,MARGR,MARGB,MARGTD,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WTD,IDM,MAXLEN,LENGTD,
+     +YMIND,YMAXD,IGON,MIDDLD,0,
+     +CEXACD,PEXACD,IEXACD)
+      MARGBA = MARGB + MARGTD
+      CALL SPLICP(SEQ,IDIM,MARGL,MARGR,MARGBA,MARGTD,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WTA,IDM,MAXLEN,LENGTA,
+     +YMINA,YMAXA,IGON,MIDDLA,1,
+     +CEXACA,PEXACA,IEXACA)
+      END
+      SUBROUTINE SPLICD(IDEV,DEFFIL,FILNAM,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +SUM,IDM,MAXLEN,TOT,WTD,LENGTD,MIDDLD,YMIND,YMAXD,
+     +WTA,LENGTA,MIDDLA,YMINA,YMAXA,
+     +IGON,IDEVOT,LINE,CHRSET,DIALOG,ANSFIL,IOK,
+     +CEXACD,PEXACD,IEXACD,CEXACA,PEXACA,IEXACA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*),DEFFIL*(*)
+      CHARACTER LINE(MAXLEN),CHRSET(IDM)
+      INTEGER SUM(IDM,MAXLEN),TOT(MAXLEN),DIALOG,ANSFIL
+      INTEGER CEXACD(MAXLEN),PEXACD(MAXLEN)
+      INTEGER CEXACA(MAXLEN),PEXACA(MAXLEN)
+      REAL WTA(IDM,MAXLEN),WTD(IDM,MAXLEN)
+      IOK = 1
+      ANSFIL = 0
+      IF(DIALOG.EQ.1) THEN
+        CALL YESNO(ANSFIL,'Use default weights file',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(ANSFIL.LT.0) RETURN
+      END IF
+      IF(ANSFIL.EQ.1) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Splice site weight matrix file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+      ELSE
+        CALL OPENRS(IDEV,DEFFIL,IOK,LRECL,2)
+        IF(IOK.NE.0) RETURN
+      END IF
+      LENGTD=MAXLEN
+      CALL RDWMTN(TOT,SUM,MIDDLD,LENGTD,MAXLEN,YMIND,YMAXD,IDEV,
+     +  IOK,IDM,KBOUT,0)
+      IF(IOK.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      CALL GETWC(TOT,SUM,LENGTD,IDM,MAXLEN,CEXACD,PEXACD,IEXACD)
+      IF(YMIND.LT.0.0)CALL GETW(TOT,SUM,WTD,LENGTD,IDM,MAXLEN)
+      IF(YMIND.GE.0.0)CALL GETW2(SUM,WTD,LENGTD,IDM,MAXLEN)
+      LENGTA=MAXLEN
+      CALL RDWMTN(TOT,SUM,MIDDLA,LENGTA,MAXLEN,YMINA,YMAXA,IDEV,
+     +  IOK,IDM,KBOUT,0)
+        CLOSE(UNIT=IDEV)
+      IF(IOK.NE.0)THEN
+        CALL ERROM(KBOUT,'Error in weight matrix')
+        RETURN
+      END IF
+      CALL GETWC(TOT,SUM,LENGTA,IDM,MAXLEN,CEXACA,PEXACA,IEXACA)
+      IF(YMINA.LT.0.0)CALL GETW(TOT,SUM,WTA,LENGTA,IDM,MAXLEN)
+      IF(YMINA.GE.0.0)CALL GETW2(SUM,WTA,LENGTA,IDM,MAXLEN)
+C assume donors in first file: and that the first and last bases in
+C exons are marked as middle. Subtract 1 from middla to make frames
+C equivalent, but send midc to splicp as a correction so that positions
+C of matches are positions on first and last bases in exons. midc=0 for
+C donors and 1 for acceptors
+      MIDDLA = MIDDLA - 1
+      IF(DIALOG.EQ.1) THEN
+        AMN = -9999.
+        AMX = 9999.
+        CALL GETRL(AMN,AMX,YMIND,'Donor cutoff score',VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        YMIND = VAL
+        CALL GETRL(AMN,AMX,YMINA,'Acceptor cutoff score',VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        YMINA = VAL
+        IOK = 1
+        CALL YESONO(IGON,'Plot results','List results',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IGON.LT.0)RETURN
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE SPLICP(SEQ,IDIM,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,J1,J2,IDEVOT,WT,IDM,MAXLEN,LENGTH,
+     +YMIN,YMAX,IGON,MIDDLE,MIDC,CEXACT,PEXACT,IEXACT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL WT(IDM,MAXLEN)
+      INTEGER CEXACT(IEXACT),PEXACT(IEXACT)
+      INTEGER CTONUM
+      EXTERNAL CTONUM,MATWTC
+      XMAX=J2
+      XMIN=J1
+      IF(IGON.EQ.0)THEN
+        CALL CLEARV
+        CALL VECTOM
+        IMARG = MARGT/3
+        CALL FRAME(MARGL,MARGR,MARGB,IMARG,ISXMAX,ISYMAX)
+        CALL FRAME(MARGL,MARGR,MARGB+IMARG,IMARG,ISXMAX,ISYMAX)
+        CALL FRAME(MARGL,MARGR,MARGB+(2*IMARG),IMARG,ISXMAX,ISYMAX)
+      END IF
+      I = J1 - 1
+ 1    CONTINUE
+      I = I + 1
+      IF(IEXACT.GT.0) I = MATWTC(SEQ,IDIM,J1,J2,I,CEXACT,PEXACT,IEXACT)
+      IF((I+LENGTH-2).LT.J2) THEN
+        SUM1=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          SUM1=SUM1+WT(CTONUM(SEQ(J)),K)
+5       CONTINUE
+        IF(SUM1.GE.YMIN)THEN
+          IF(IGON.EQ.0)THEN
+            M = I + MIDDLE
+            X = M + MIDC
+            M = MARGB + (MOD(M,3)*IMARG)
+            CALL LINE(X,X,YMIN,SUM1,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,M,IMARG,ISXMAX,ISYMAX)
+          END IF
+          IF(IGON.EQ.1)THEN
+            CALL VT100M
+            M = MOD((I+MIDDLE),3)
+            WRITE(IDEVOT,1010)I+MIDDLE+MIDC,M,SUM1,
+     +      (SEQ(K),K=I,I+LENGTH-1)
+1010        FORMAT(' ',I7,I2,F10.2,' ',120A1)
+          END IF
+        END IF
+        GO TO 1
+      END IF
+      CALL VT100M
+      END
+      SUBROUTINE SQCOM2(SEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),COMP*17
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+      SAVE COMP
+      DATA COMP/'AGTC-YRWSKMDVBHN-'/
+      DO 100 I=1,IDIM
+        J = DTONUM(SEQ(I))
+        SEQ(I) = COMP(J:J)
+100   CONTINUE
+      END
+C      SQFIT
+      SUBROUTINE SQFIT(SEQ,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MIN,ITOTP,KSTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+C   ALLOWS R,Y AND - SPECIAL CHARACTERS
+      IDIF=(IE-IS+2)-IDIM2
+C   IDIF IS THE NUMBER OF POSNS TO TRY
+C   IPSTR GOES FROM 1 TO IDIM2 IDIF TIMES
+C   TRY ALL POSSIBLE POSITIONS FOR MATCHING AND SCORE FOR EACH
+C   POINT TO ARRAY ELEMENT CORRESPONDING TO FIRST BASE
+      IPSEQ=IS-KSTART+1
+      ITOTP=0
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          IF(SEQ(IP).EQ.STRING(J))THEN
+            NTOT=NTOT+1
+          ELSE IF (STRING(J).EQ.'R')THEN
+            IF((SEQ(IP).EQ.'A').OR.(SEQ(IP).EQ.'G')) NTOT=NTOT+1
+          ELSE IF (STRING(J).EQ.'Y')THEN
+            IF((SEQ(IP).EQ.'C').OR.(SEQ(IP).EQ.'T')) NTOT=NTOT+1
+          ELSE IF (STRING(J).EQ.'-')THEN
+            NTOT=NTOT+1
+          END IF
+          IP=IP+1
+100     CONTINUE
+C   END OF COUNTING FOR THIS POSITION.IS TOTAL HIGH ENOUGH?
+        IF(NTOT.GE.MIN)THEN
+          ITOTP=ITOTP+1
+C   TEST FOR OVERFLOW
+          IF(ITOTP.GT.ITOTID)RETURN
+C   SAVE TOTAL AND POSITION
+          ITOT(ITOTP)=NTOT
+          ITOTEL(ITOTP)=IP-IDIM2+KSTART-1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
+C      SQFIT6
+      SUBROUTINE SQFIT6(SEQ,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINS,ITOTP,KSTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      INTEGER IUBM1
+      EXTERNAL IUBM1
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS-KSTART+1
+      ITOTP=0
+      DO 200 I=1,IDIF
+        NTOT = 0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          NTOT = NTOT + IUBM1(STRING(J),SEQ(IP))
+          IP=IP+1
+100     CONTINUE
+        IF(NTOT.GE.MINS)THEN
+          ITOTP=ITOTP+1
+          IF(ITOTP.GT.ITOTID)RETURN
+          ITOT(ITOTP)=NTOT
+          ITOTEL(ITOTP)=IP-IDIM2+KSTART-1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
+      SUBROUTINE SRCHP(SEQ,IDIM1,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,J1,J2,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(6)
+      SAVE STRING
+      DATA STRING/'A','A','T','A','A','A'/
+      CALL SHOWFU(KBOUT,'PolyA search. i.e. AATAAA search')
+      XMIN=J1
+      XMAX=J2
+      YMIN=0.
+      YMAX=MARGT
+      Y0=0.
+      IBH=MARGT
+      CALL VECTOM
+      CALL SRCHP1(SEQ,IDIM1,J1,J2,STRING,6,1,
+     +XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,Y0,IBH)
+      CALL VT100M
+      END
+      SUBROUTINE SRCHP1(SEQ,IDIM1,J1,J2,STRING,IDIMS,INC,
+     +XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,Y,IBH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(6)
+      YP=Y+IBH
+      IAT=J1-INC
+10    CONTINUE
+      IAT=IAT+INC
+      IDIM=J2-IAT+1
+      IF(IDIM.LE.0)RETURN
+      CALL FIND6(SEQ(IAT),IDIM,STRING,IDIMS,INC,JMATCH)
+      IF(JMATCH.EQ.0)RETURN
+      IAT=IAT+JMATCH-1
+      X=IAT
+      CALL LINE(X,X,YP,Y,XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX)
+      GO TO 10
+      END
+      SUBROUTINE STOPSB(SEQ,IDIM1,J1,J2,MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX,PAA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),PAA(5,5,5)
+      IMARG=NINT(MARGT/2.)
+      MARGB3=MARGB
+      MARGB5=MARGB+IMARG
+      MARGT3=IMARG
+      MARGT5=IMARG
+      CALL STOPS(SEQ,IDIM1,J1,J2,
+     +MARGL,MARGR,MARGB5,MARGT5,
+     +ISXMAX,ISYMAX,PAA)
+      CALL STOPSC(SEQ,IDIM1,J1,J2,MARGL,MARGR,
+     +MARGB3,MARGT3,
+     +ISXMAX,ISYMAX,PAA)
+      END
+C
+C   STOPS
+      SUBROUTINE STOPS(SEQ,IDIM1,J1,J2,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,PAA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER STOP(3),SEQ(IDIM1),PAA(5,5,5),BASE(5)
+      SAVE BASE
+      DATA BASE/'T','C','A','G','-'/
+      CALL VECTOM
+      XMIN=J1
+      XMAX=J2
+      YMIN=0.
+C   DIVIDE SCREEN INTO 3 STRIPS AND PUT BLIPS AT 1/6,3/6,5/6
+      YMAX=ISYMAX-(MARGB1+MARGT1)
+      YMAX3=YMAX/3.
+      BLIPH=YMAX*0.02
+      DO 10 ICOD=1,5
+      DO 10 JCOD=1,5
+      DO 10 KCOD=1,5
+        IF(PAA(KCOD,JCOD,ICOD).EQ.'*')THEN
+C         NEED TO KNOW THE CODON
+          STOP(1)=BASE(ICOD)
+          STOP(2)=BASE(JCOD)
+          STOP(3)=BASE(KCOD)
+          BLIPF=YMAX3/2.0
+          CALL PSRCH(SEQ,IDIM1,J1,J2,STOP,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+          BLIPF=BLIPF+YMAX3
+          J11=J1+1
+          CALL PSRCH(SEQ,IDIM1,J11,J2,STOP,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+          BLIPF=BLIPF+YMAX3
+          J11=J1+2
+          CALL PSRCH(SEQ,IDIM1,J11,J2,STOP,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+        END IF
+10    CONTINUE
+      CALL VT100M
+      END
+C   STOPSC
+      SUBROUTINE STOPSC(SEQ,IDIM1,J1,J2,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,PAA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER STOP(3),SEQ(IDIM1),PAA(5,5,5),BASE(5)
+      SAVE BASE
+      DATA BASE/'A','G','T','C','-'/
+      CALL VECTOM
+      XMIN=J1
+      XMAX=J2
+      YMIN=0.
+C   DIVIDE SCREEN INTO 3 STRIPS AND PUT BLIPS AT 1/12,5/12,9/12
+      YMAX=ISYMAX-(MARGB1+MARGT1)
+      YMAX3=YMAX/3.
+      BLIPH=YMAX*0.02
+      DO 10 ICOD=1,5
+      DO 10 JCOD=1,5
+      DO 10 KCOD=1,5
+        IF(PAA(KCOD,JCOD,ICOD).EQ.'*')THEN
+C         NEED TO KNOW THE CODON
+          STOP(1)=BASE(KCOD)
+          STOP(2)=BASE(JCOD)
+          STOP(3)=BASE(ICOD)
+          BLIPF=YMAX3/4.0
+          CALL PSRCH(SEQ,IDIM1,J1,J2,STOP,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+          BLIPF=BLIPF+YMAX3
+          J11=J1+1
+          CALL PSRCH(SEQ,IDIM1,J11,J2,STOP,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+          BLIPF=BLIPF+YMAX3
+          J11=J1+2
+          CALL PSRCH(SEQ,IDIM1,J11,J2,STOP,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+        END IF
+10    CONTINUE
+      CALL VT100M
+      END
+C
+C   STARTS
+      SUBROUTINE STARTS(SEQ,IDIM1,J1,J2,
+     +MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,PAA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER CODON(3),SEQ(IDIM1),PAA(5,5,5),BASE(5)
+      SAVE BASE
+      DATA BASE/'T','C','A','G','-'/
+      CALL VECTOM
+      XMIN=J1
+      XMAX=J2
+C   DIVIDE SCREEN INTO 3 STRIPS AND PUT BLIPS AT 0.0,1/3,2/3
+      YMAX=ISYMAX-(MARGB1+MARGT1)
+      YMAX3=YMAX/3.
+      BLIPH=YMAX*0.013
+      YMIN=0.
+      DO 10 ICOD=1,5
+      DO 10 JCOD=1,5
+      DO 10 KCOD=1,5
+        IF(PAA(KCOD,JCOD,ICOD).EQ.'M')THEN
+C         NEED TO KNOW THE CODON
+          CODON(1)=BASE(ICOD)
+          CODON(2)=BASE(JCOD)
+          CODON(3)=BASE(KCOD)
+          BLIPF=0.0
+          CALL PSRCH(SEQ,IDIM1,J1,J2,CODON,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+          BLIPF=BLIPF+YMAX3
+          J11=J1+1
+          CALL PSRCH(SEQ,IDIM1,J11,J2,CODON,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+          BLIPF=BLIPF+YMAX3
+          J11=J1+2
+          CALL PSRCH(SEQ,IDIM1,J11,J2,CODON,3,3,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL1,MARGR1,MARGB1,MARGT1,ISXMAX,ISYMAX,BLIPF,BLIPH)
+        END IF
+10    CONTINUE
+      CALL VT100M
+      END
+      CHARACTER*1 FUNCTION SCOMP(CHAR1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER CHAR1,CHARSU(5),CHARSL(5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM,ICOMP
+      SAVE CHARSU,CHARSL
+      DATA CHARSU/'T','C','A','G','-'/
+      DATA CHARSL/'t','c','a','g','-'/
+      JCHAR=CTONUM(CHAR1)
+      IF(CHAR1.EQ.CHARSU(JCHAR)) THEN
+        SCOMP=CHARSU(ICOMP(JCHAR))
+      ELSE
+        SCOMP=CHARSL(ICOMP(JCHAR))
+      END IF
+      END
+C    TRANS
+C   SUBROUTINE TO TRANSLATE A SEQUENCE IN 1 PHASE GIVEN START & STOP
+C   POSITIONS AND A SEQUENCE
+      SUBROUTINE TRANS(SEQNCE,IDIM1,IS,IE,OUTP,PAA,IDIM2,KSTART,
+     +JSTRAN)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM1),PAA(125),OUTP(IDIM2),CODON(3),TRANF,TRANB
+      INTEGER POUT
+      EXTERNAL LWRAP,TRANF,TRANB
+      IX2=IE
+C   ADD ON TO END IF OVER JOIN. REALLY A COUNTER OF
+C   NUMBER OF ELEMENTS TO TRANSLATE. LWRAP TAKES CARE OF OVERFLOW.
+C   WRAP AROUND IF OVER JOIN
+      IF(IX2.LE.IS)IX2=IX2+IDIM1
+      DO 500 I=IS-KSTART+1,IX2-KSTART+1,3
+         DO 400 J=1,3
+           IP1=I+J-1
+           IP1=LWRAP(IDIM1,IP1)
+           CODON(J)=SEQNCE(IP1)
+400      CONTINUE
+         IP=I
+         IP=LWRAP(IDIM1,IP)
+         POUT=IP/3+1
+         IF(JSTRAN.EQ.0)OUTP(POUT)=TRANF(CODON,PAA)
+         IF(JSTRAN.EQ.1)OUTP(POUT)=TRANB(CODON,PAA)
+500   CONTINUE
+      END
+      SUBROUTINE TRANSD(SEQNCE,IDIM1,IS,IE,OUTP,IDIM2,PAA,JSTRAN)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM1),PAA(125),OUTP(IDIM2),CODON(3),TRANF,TRANB
+      INTEGER POUT
+      EXTERNAL JCODNO,TRANF,TRANB,LWRAP
+      POUT=0
+C   SET UP LOOP VALUE
+      IX2=IE
+C   ADD ON TO END IF OVER JOIN. REALLY A COUNTER OF
+C   NUMBER OF ELEMENTS TO TRANSLATE. LWRAP TAKES CARE OF OVERFLOW.
+C   WRAP AROUND IF OVER JOIN
+      IF(IX2.LE.IS)IX2=IX2+IDIM1
+      DO 500 I=IS,IX2,3
+         DO 400 J=1,3
+           IP1=I+J-1
+           IP1=LWRAP(IDIM1,IP1)
+           CODON(J)=SEQNCE(IP1)
+400      CONTINUE
+         POUT=POUT+1
+         IF(JSTRAN.EQ.0)OUTP(POUT)=TRANF(CODON,PAA)
+         IF(JSTRAN.EQ.1)OUTP(POUT)=TRANB(CODON,PAA)
+500   CONTINUE
+      IDIM2=POUT
+      END
+      INTEGER FUNCTION JCODNO(SEQ,IDIM,I)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER CTONUM
+      EXTERNAL CTONUM,LWRAP
+C   DECODE NEXT 3 CHARS TO GIVE POINTERS TO AMINO ACIDS
+      J0=I
+      J0=LWRAP(IDIM,J0)
+      J1=LWRAP(IDIM,J0+1)
+      J2=LWRAP(IDIM,J1+1)
+      K0=CTONUM(SEQ(J0))*25
+      K1=CTONUM(SEQ(J1))*5
+      K2=CTONUM(SEQ(J2))
+      JCODNO=K0+K1+K2-30
+      END
+      SUBROUTINE OPENTR(SEQ,IDIM,KSTART,IDEV,KBIN,KBOUT,PAA,
+     +OUTP,IDIMP,FRAMEC,POSNS,MAXPS,IDEVE,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,DIALOG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIMP),PAA(125),OUTP(IDIMP)
+      CHARACTER HELPF*(*),FILNAM*(*),OPRATR*20
+      INTEGER FRAMEC(IDIMP),POSNS(MAXPS),ANSE,DIALOG
+      CALL SHOWFU(KBOUT,
+     +'Find open reading frames, translate and list')
+      CALL GETPAR(39,10,IOK,MINO,MINOPN,JSTRAN,
+     +MINLEN,MAXLEN,LINLEN,ITRAN,JTRAN,INUM,ANSE,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      MAXPOS = MAXPS
+      MAXO = IDIM/3
+      I1 = 1
+      I2 = IDIM
+      IF(DIALOG.EQ.1)CALL TRAND7(KBIN,KBOUT,I1,I2,ITRAN,
+     +MINLEN,MAXLEN,LINLEN,
+     +MINO,MAXO,MINOPN,JSTRAN,INUM,IDEVE,ANSE,JTRAN,FILNAM,
+     +IOK,OPRATR,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL FILLI(FRAMEC,IDIM,1)
+      SEQ(IDIM+1) = SEQ(1)
+      SEQ(IDIM+2) = SEQ(2)
+      IF(JTRAN.EQ.0) THEN
+        IF(ANSE.EQ.1) THEN
+          CALL TRANEM(SEQ,IDIM,I1,I2,FRAMEC,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+        ELSE IF(ANSE.EQ.2) THEN
+          CALL TRANEN(SEQ,IDIM,I1,I2,FRAMEC,JSTRAN,IDEVE,
+     +    POSNS,MAXPOS,KBOUT,OPRATR,IOK)
+        ELSE
+          IF((JSTRAN.EQ.0).OR.(JSTRAN.EQ.1))THEN
+            CALL TRPIRP(SEQ,IDIM,I1,I2,KSTART,MINOPN,IDEV,PAA,
+     +      JSTRAN,OUTP,FRAMEC,IDIMP)
+          ELSE IF (JSTRAN.EQ.2) THEN
+            CALL TRPIRP(SEQ,IDIM,I1,I2,KSTART,MINOPN,IDEV,PAA,
+     +      0,OUTP,FRAMEC,IDIMP)
+            CALL TRPIRP(SEQ,IDIM,I1,I2,KSTART,MINOPN,IDEV,PAA,
+     +      1,OUTP,FRAMEC,IDIMP)
+          END IF
+        END IF
+      END IF
+      CALL TRAN6X(SEQ,IDIMP,IDEV,I1,I2,PAA,ITRAN,JSTRAN,INUM,LINLEN,
+     +FRAMEC)
+      END
+      SUBROUTINE TRANEM(SEQ,IDIM,J1,J2,FRAMEC,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C get start and end points for translation, then set up
+C the integer array for tran6x
+      CHARACTER SEQ(IDIM)
+      INTEGER FRAMEC(IDIM),FRAME
+      KSTRAN = 0
+ 50   CONTINUE
+      KSTRAN = 0
+      CALL TRAND8(IDIM,J1,J2,KSTRAN,IP1,IP2,'Translate',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IP1.EQ.0) RETURN
+      FRAME = MOD(IP1,3)
+      IF(FRAME.EQ.0) FRAME = 3
+      IF(KSTRAN.EQ.0) THEN
+        CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,2)
+      ELSE
+        CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,3)
+      END IF
+      GO TO 50
+      END
+      SUBROUTINE TRAND8(IDIM,J1,J2,KSTRAN,IP1,IP2,PROMPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C get start and end points and strand for translation
+      CHARACTER PROMPT*(*),HELPF*(*)
+        CALL GETRC2(KBIN,KBOUT,J1,J2,IDIM,IP1,IP2,PROMPT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IP1.EQ.0) RETURN
+        IOK = 1
+        IOP = KSTRAN
+        CALL YESONO(IOP,'+ strand','- strand',
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.0) RETURN
+        KSTRAN = IOP
+      IOK = 0
+      END
+      SUBROUTINE TRAND7(KBIN,KBOUT,I1,I2,
+     +ITRAN,MINLEN,MAXLEN,LINLEN,MINO,MAXO,MINOPN,JSTRAN,INUM,
+     +IDEVE,ANSE,JTRAN,FILNAM,IOK,OPRATR,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+C organise user interaction for translation to screen
+      CHARACTER HELPF*(*),FILNAM*(*)
+      PARAMETER (MAXPRM = 25)
+      CHARACTER PROMPT(4)*(MAXPRM),OPRATR*(*)
+      INTEGER ANSE
+C  Modified 11-4-91 to make 6 phase translation more obvious
+C   Translate if JTRAN = 0
+C   Keyboard input if ANSE = 1
+C   feature table input if ANSE = 2
+C   open reading frames is ANSE = 3
+C   + strand if JSTRAN = 0
+C   - strand if JSTRAN = 1
+C   both strands JSTRAN = 2
+C   one letter codes if ITRAN = 0
+C   three letter codes if ITRAN = 3
+C   number ends of lines if INUM = 0
+C   Display I1 to I2
+      IOK = 1
+      J1 = I1
+      J2 = I2
+      IOP =  0
+      CALL YESONO(IOP,'Show translation','Hide translation',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.0) THEN
+          IOK = 1
+          RETURN
+        END IF
+      JTRAN = IOP
+      IF(JTRAN.EQ.0) THEN
+        PROMPT(1) = 'Typed on the keyboard'
+        PROMPT(2) = 'Read from a feature table'
+        PROMPT(3) = 'Open reading frames'
+        PROMPT(4) = 'All six frames'
+        CALL RADION('The segments to translate can be',
+     +  PROMPT,4,ANSE,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(ANSE.LT.1) RETURN
+        IF(ANSE.EQ.2) THEN
+          CALL TRAND9(KBIN,KBOUT,IDEVE,FILNAM,OPRATR,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+          IF(IOK.NE.0) RETURN
+        END IF
+        IF(ANSE.EQ.3) THEN
+          CALL GETINT(MINO,MAXO,MINOPN,
+     +    'Minimum open frame in amino acids',
+     +    IVAL,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+          IF(IOK.NE.0) RETURN
+          MINOPN = IVAL
+        END IF
+        IF(ANSE.EQ.4) THEN
+          MINOPN = 0
+          JSTRAN = 2
+        END IF
+        IOP = ITRAN
+        IF(IOP.EQ.3) IOP = 1
+        CALL YESONO(IOP,'Use 1 letter codes',
+     +  'Use 3 letter codes',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.0) THEN
+          IOK = 1
+          RETURN
+        END IF
+        IF(IOP.EQ.1) ITRAN = 3
+        IF(IOP.EQ.0) ITRAN = 0
+      END IF
+      CALL GTREG(KBIN,KBOUT,J1,J2,I1,I2,
+     +'Define section of DNA to display',
+     + IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL GETINT(MINLEN,MAXLEN,LINLEN,'Line length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LINLEN = IVAL
+      IF(JTRAN.EQ.0) THEN
+        IF(MOD(LINLEN,3).NE.0) LINLEN = 60
+      END IF
+      IF(ANSE.NE.4) THEN
+        IVAL = JSTRAN + 1
+        CALL GSTRND(IVAL,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IVAL.LT.1) THEN
+          IOK = 1
+          RETURN
+        END IF
+        JSTRAN = IVAL - 1
+      END IF
+      IOP = INUM
+      CALL YESONO(IOP,'Number ends of lines',
+     +'Number every tenth base',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) THEN
+        IOK = 1
+        RETURN
+      END IF
+      INUM = IOP
+      IOK = 0
+      END
+      SUBROUTINE EMBLF(IDEVE,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,ANSE,OPRATR,IOK)
+C   AUTHOR: RODGER STADEN
+C ask about keyboard or ft input. If ft open file and anse = 1
+      CHARACTER FILNAM*(*),HELPF*(*),OPRATR*(*)
+      INTEGER ANSE
+      IOK = 1
+      IOP = ANSE
+      CALL YESONO(IOP,'Define segments using keyboard',
+     +'Define segments using embl feature table',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) RETURN
+      ANSE = IOP
+      IF(ANSE.EQ.1) THEN
+          CALL TRAND9(KBIN,KBOUT,IDEVE,FILNAM,OPRATR,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+          IF(IOK.NE.0) RETURN
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE TRAND9(KBIN,KBOUT,IDEVE,FILNAM,OPRATR,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C open feature table file and get ft operator
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER OPRATR*(*)
+      FILNAM = ' '
+      CALL OPENF1(IDEVE,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Feature table file name',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+ 20   CONTINUE
+      IOK = 1
+      LIN = 3
+      CALL GTSTR('Operator','all',OPRATR,LIN,
+     +KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      CALL CCASE(OPRATR,1)
+      IF((LIN.EQ.0).OR.(OPRATR(1:3).EQ.'ALL')) OPRATR(1:1) = ' '
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 20
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE TRANEN(SEQ,IDIM,J1,J2,FRAMEC,JSTRAN,IDEVE,
+     +POSNS,MAXPOS,KBOUT,OPRATR,IOK)
+C use ft to set up array for tran6x
+      CHARACTER SEQ(IDIM),OPRATR*(*)
+      INTEGER POSNS(MAXPOS),FRAMEC(IDIM)
+      INTEGER EMBLFT
+      EXTERNAL EMBLFT
+      ISTRAN = 0
+      IF(JSTRAN.EQ.1) ISTRAN = 1
+10    CONTINUE
+      NOBJ = 0
+      IOK = EMBLFT(IDEVE,KBOUT,'CDS',OPRATR,ISTRAN,
+     +POSNS,MAXPOS,NPOS,NOBJ)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ISTRAN.EQ.0) THEN
+      K = NPOS
+      DO 100 I=1,NOBJ
+        NSEG = POSNS(K)
+        IADD = 0
+        ITOT = 0
+        DO 50 J=K-NSEG,K-1,2
+          IP1 = POSNS(J)
+          IP2 = POSNS(J+1)
+          ITOT = ITOT + IP2 - IP1 + 1
+          IP1 = IP1 - IADD
+          CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,2)
+          IADD = MOD(ITOT,3)
+ 50       CONTINUE
+          K = K - NSEG - 1
+ 100      CONTINUE
+      ELSE
+      K = NPOS
+      DO 200 I=1,NOBJ
+        NSEG = POSNS(K)
+        IADD = 0
+        ITOT = 0
+        DO 60 J=K-1,K-NSEG,-2
+          IP1 = POSNS(J-1)
+          IP2 = POSNS(J)
+          ITOT = ITOT + IP2 - IP1 + 1
+          IP2 = IP2 + IADD
+C mbprim expect complementary strand codon positions to start
+C from the same position as their counterparts on the other strand
+C ie the first position should correspond to the third
+C + strand 123
+C - strand 123
+          CALL MBPRIM(FRAMEC,IDIM,IP1,IP2,-3)
+          IADD = MOD(ITOT,3)
+ 60       CONTINUE
+          K = K - NSEG - 1
+ 200      CONTINUE
+      END IF
+      IF((JSTRAN.EQ.2).AND.(ISTRAN.EQ.0)) THEN
+        ISTRAN = 1
+        REWIND(IDEVE)
+        GO TO 10
+      END IF
+      CLOSE(UNIT=IDEVE)
+      END
+      SUBROUTINE TRAND1(JSTRAN,ANSE,IDEVE,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,OPRATR,IOK)
+C   AUTHOR: RODGER STADEN
+C ask for ft or kb input, get operator and strand
+      INTEGER ANSE
+      CHARACTER FILNAM*(*),HELPF*(*),OPRATR*(*)
+      IOK = 1
+      CALL EMBLF(IDEVE,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,ANSE,OPRATR,IOK)
+      IF(IOK.NE.0)RETURN
+      IF(ANSE.EQ.1)THEN
+        IOP = JSTRAN + 1
+        CALL GSTRND(IOP,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOP.LT.1) THEN
+          IOK = 1
+          RETURN
+        END IF
+        JSTRAN = IOP - 1
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE TRANDK(SEQ1,IDIM1,OUTP1,IDIMP,ISTART,ISTOP,IDIMT,
+     +IDEVE,IDEV,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,J1,J2,POSNS,MAXPOS)
+      CHARACTER HELPF*(*),OPRATR*20
+      INTEGER ANSE,POSNS(MAXPOS)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1),OUTP1(IDIMP),PAA(125),FILNAM*(*)
+C   J1 IS THE FIRST BASE IN THE RAM BUFFER
+C   J2 IS THE LAST BASE IN THE RAM BUFFER
+C   IDIMT IS THE ACTUAL SEQUENCE LENGTH
+C   WRAP AROUND IS ONLY POSSIBLE IF J1=1, AND J2=IDIMT
+      CALL SHOWFU(KBOUT,'Translate and write protein sequence to disk')
+      CALL GETPAR(40,7,IOK,JSTRAN,ANSE,IOPEN,MINO,MAXO,MINOPN,JSTRAN,
+     +IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IOP = IOPEN
+      CALL YESONO(IOP,'Translate selected regions',
+     +'Translate open reading frames',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) RETURN
+      IF(IOP.EQ.1) THEN
+        CALL TRNPIR(SEQ1,IDIM1,J1,J2,ISTART,KBIN,KBOUT,IDEVE,PAA,
+     +  OUTP1,IDIMP,FILNAM,MINO,MAXO,MINOPN,JSTRAN,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        RETURN
+      END IF
+      CALL TRAND1(JSTRAN,ANSE,IDEVE,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,OPRATR,IOK)
+      IF(IOK.NE.0) THEN
+        IF(ANSE.EQ.1) CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ANSE.EQ.1) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Output file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL TRNEMB(SEQ1,IDIM1,J1,J2,OUTP1,PAA,JSTRAN,IDEVE,IDEV,
+     +  POSNS,MAXPOS,KBOUT,OPRATR,IOK)
+        RETURN
+      END IF
+      IOUT = -4
+      CALL FILLC(OUTP1,IDIMP,'-')
+C
+50    CONTINUE
+C
+      CALL TRAND8(IDIMT,J1,J2,JSTRAN,I1,N1,'Translate',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(I1.NE.0) THEN
+        IB = N1 - I1 + 1
+        IF(N1.LT.I1) IB = IB + IDIM1
+        IAA = IB/3 + 1
+        IF(IAA.GE.2) THEN
+          IOUT = IOUT + 5
+          CALL TRANSD(SEQ1,IDIM1,
+     +    I1-ISTART+1,N1-ISTART+1,OUTP1(IOUT),IAA,PAA,JSTRAN)
+          IF(JSTRAN.EQ.1) CALL SQREV(OUTP1(IOUT),IAA)
+          IOUT = IOUT + IAA - 1
+        END IF
+        GO TO 50
+      END IF
+      IF(IOUT.GT.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEVE,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Output file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL FMTDKN(IDEVE,OUTP1,IOUT)
+        CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      END
+      SUBROUTINE TRNEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,IDEV,
+     +POSNS,MAXPOS,KBOUT,OPRATR,IOK)
+C translate to disk using ft
+      CHARACTER SEQ(IDIM),OPRATR*(*),SEQW(IDIM),PAA(125),LINE*60
+      INTEGER POSNS(MAXPOS)
+      INTEGER EMBLFT,WPAIR
+      EXTERNAL EMBLFT,ITOSL,WPAIR
+      ISTRAN = 0
+      IF(JSTRAN.EQ.1) ISTRAN = 1
+10    CONTINUE
+      NOBJ = 0
+      IOK = EMBLFT(IDEVE,KBOUT,'CDS',OPRATR,ISTRAN,
+     +POSNS,MAXPOS,NPOS,NOBJ)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ISTRAN.EQ.0) THEN
+      K = NPOS
+      DO 100 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+        DO 50 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+ 1001     FORMAT(A)
+ 50       CONTINUE
+          IB = IB - 1
+          IACIDS = IB/3 + 1
+          CALL TRANSD(SEQW,IDIM,1,IB,SEQW,IACIDS,PAA,ISTRAN)
+          LINE(1:) = '>'
+          IF(ITOSL(LINE(2:),POSNS(K-NSEG)).EQ.0)
+     +     WRITE(*,*)'Scream: ITOSL'
+          IF(WPAIR(LINE(22:),POSNS(K-NSEG),POSNS(K-1)).NE.0) 
+     +    WRITE(*,*)'Scream: WPAIR'
+          WRITE(IDEV,1001)LINE
+          IF (SEQW(IACIDS).NE.'*') THEN
+            IACIDS = IACIDS + 1
+            SEQW(IACIDS) = '*'
+          END IF
+          CALL FMTDKN(IDEV,SEQW,IACIDS)
+        END IF
+          K = K - NSEG - 1
+ 100      CONTINUE
+      ELSE
+      K = NPOS
+      DO 200 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+        DO 60 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+ 60       CONTINUE
+          IB = IB - 1
+          CALL SQREV(SEQW,IB)
+          CALL SQCOM2(SEQW,IB)
+          IACIDS = IB/3 + 1
+          CALL TRANSD(SEQW,IDIM,1,IB,SEQW,IACIDS,PAA,0)
+          IF (SEQW(IACIDS).NE.'*') THEN
+            IACIDS = IACIDS + 1
+            SEQW(IACIDS) = '*'
+          END IF
+          LINE(1:) = '>                    complement('
+          IF (ITOSL(LINE(2:),POSNS(K-1)).EQ.0) WRITE(*,*)'Scream: ITOSL'
+          IF(WPAIR(LINE(33:),POSNS(K-NSEG),POSNS(K-1)).NE.0) 
+     +    WRITE(*,*)'Scream: WPAIR'
+          LINE(32+INDEX(LINE(33:),' '):) = ')'
+          WRITE(IDEV,1001)LINE
+          CALL FMTDKN(IDEV,SEQW,IACIDS)
+        END IF
+          K = K - NSEG - 1
+ 200      CONTINUE
+      END IF
+      IF((JSTRAN.EQ.2).AND.(ISTRAN.EQ.0)) THEN
+        ISTRAN = 1
+        REWIND(IDEVE)
+        GO TO 10
+      END IF
+      CLOSE(UNIT=IDEVE)
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE CODTDK(SEQ,IDIM,J1,J2,IDEVE,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,POSNS,MAXPOS,SEQW)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILNAM*(*),PAA(5,5,5),OPRATR*20
+      REAL S1(64),S2(64)
+      CHARACTER SEQW(IDIM)
+      INTEGER POSNS(MAXPOS)
+      INTEGER NORM,ANSE,ANST,ANSTO
+      CALL SHOWFU(KBOUT,'Calculate codon table and write it to disk')
+      CALL GETPAR(41,5,IOK,JSTRAN,NORM,ANSE,ANST,ANSTO,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      JSTRAN = 0
+      NORM = 0
+      ANSE = 0
+      ANST = 0
+      ANSTO = 0
+C ANSTO REDUNDANT
+      CALL FILLR(S2,64,0.0)
+      CALL CODTBL(IDEVE,FILNAM,S2,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,ANST,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL CODND1(JSTRAN,NORM,ANSE,IDEVE,FILNAM,OPRATR,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(ANSE.EQ.1) THEN
+        CALL CTDEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,
+     +  POSNS,MAXPOS,KBIN,KBOUT,OPRATR,S1,S2,NORM,FILNAM,IOK)
+      RETURN
+      END IF
+100   CONTINUE
+      CALL FILLR(S1,64,0.0)
+      CALL TRAND8(IDIM,J1,J2,JSTRAN,I1,I2,'Count over',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(I1.EQ.0) GO TO 200
+      IDIM1 = I2-I1+1
+      CALL CALCOD(S1,SEQ(I1),IDIM1)
+      IF(JSTRAN.EQ.1)CALL CODCOM(S1)
+      CALL ADDR(S1,S2,64)
+      IF(NORM.EQ.1)CALL NORMAA(S1,100.,PAA)
+      CALL WRTCOD(S1,KBOUT,PAA)
+      GO TO 100
+200   CONTINUE
+      WRITE(KBOUT,9991)
+9991  FORMAT(5X,'Codon totals over all genes')
+      IF(NORM.EQ.1) CALL NORMAA(S2,100.,PAA)
+      CALL WRTCOD(S2,KBOUT,PAA)
+      FILNAM = ' '
+      CALL OPENF1(IDEVE,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Name for codon table file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL WRTCOD(S2,IDEVE,PAA)
+      CLOSE(UNIT=IDEVE)
+      END
+      SUBROUTINE CODND1(JSTRAN,NORM,ANSE,IDEVE,FILNAM,OPRATR,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+C get observed or normalised, kb or ft, operator, strand
+      INTEGER ANSE
+      CHARACTER FILNAM*(*),HELPF*(*),OPRATR*(*)
+      IOK = 1
+      IOP = NORM
+      CALL YESONO(IOP,'Show observed counts',
+     +'Normalize counts',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOP.LT.0) RETURN
+      NORM = IOP
+      CALL TRAND1(JSTRAN,ANSE,IDEVE,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,OPRATR,IOK)
+      IF(IOK.NE.0) THEN
+        IF(ANSE.EQ.1) CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE CTDEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,
+     +POSNS,MAXPOS,KBIN,KBOUT,OPRATR,S1,S2,NORM,FILNAM,IOK)
+C codons to disk using ft
+      CHARACTER SEQ(IDIM),OPRATR*(*),SEQW(IDIM),PAA(125)
+      INTEGER POSNS(MAXPOS)
+      REAL S1(64),S2(64)
+      CHARACTER FILNAM*(*)
+      INTEGER EMBLFT
+      EXTERNAL EMBLFT
+      ISTRAN = 0
+      IF(JSTRAN.EQ.1) ISTRAN = 1
+10    CONTINUE
+      NOBJ = 0
+      IOK = EMBLFT(IDEVE,KBOUT,'CDS',OPRATR,ISTRAN,
+     +POSNS,MAXPOS,NPOS,NOBJ)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ISTRAN.EQ.0) THEN
+      K = NPOS
+      DO 100 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+          WRITE(KBOUT,*)'Data from end point ',POSNS(K-NSEG)
+        DO 50 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+C note not checking for within j1,j2
+ 50       CONTINUE
+          IB = IB - 1
+          IB = 3*(IB/3)
+          CALL FILLR(S1,64,0.0)
+          CALL CALCOD(S1,SEQW,IB)
+          CALL ADDR(S1,S2,64)
+          IF(NORM.EQ.1)CALL NORMAA(S1,100.,PAA)
+          CALL WRTCOD(S1,KBOUT,PAA)
+        END IF
+          K = K - NSEG - 1
+ 100      CONTINUE
+      ELSE
+      K = NPOS
+      DO 200 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+          WRITE(KBOUT,*)'Data from end point ',POSNS(K-1)
+        DO 60 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+ 60       CONTINUE
+          IB = IB - 1
+          CALL SQREV(SEQW,IB)
+          CALL SQCOM2(SEQW,IB)
+          IB = 3*(IB/3)
+          CALL FILLR(S1,64,0.0)
+          CALL CALCOD(S1,SEQW,IB)
+          CALL ADDR(S1,S2,64)
+          IF(NORM.EQ.1)CALL NORMAA(S1,100.,PAA)
+          CALL WRTCOD(S1,KBOUT,PAA)
+        END IF
+          K = K - NSEG - 1
+ 200      CONTINUE
+      END IF
+      IF((JSTRAN.EQ.2).AND.(ISTRAN.EQ.0)) THEN
+        ISTRAN = 1
+        REWIND(IDEVE)
+        GO TO 10
+      END IF
+      CLOSE(UNIT=IDEVE)
+      WRITE(KBOUT,9991)
+9991  FORMAT(5X,'Codon totals over all genes')
+      IF(NORM.EQ.1) CALL NORMAA(S2,100.,PAA)
+      CALL WRTCOD(S2,KBOUT,PAA)
+      FILNAM = ' '
+      CALL OPENF1(IDEVE,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Name for codon table file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL WRTCOD(S2,IDEVE,PAA)
+      CLOSE(UNIT=IDEVE)
+      END
+      SUBROUTINE CODEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,IDEV,
+     +POSNS,MAXPOS,KBIN,KBOUT,OPRATR,S1,S2,S3,NORM,FILNAM,IOK)
+C codons, acids, bases to screen using ft
+      CHARACTER SEQ(IDIM),OPRATR*(*),SEQW(IDIM),PAA(125)
+      INTEGER POSNS(MAXPOS)
+      REAL S1(64),S2(64),S3(64)
+      CHARACTER FILNAM*(*)
+      INTEGER EMBLFT
+      EXTERNAL EMBLFT
+      CALL FILLR(S2,64,0.0)
+      ISTRAN = 0
+      IF(JSTRAN.EQ.1) ISTRAN = 1
+10    CONTINUE
+      NOBJ = 0
+      IOK = EMBLFT(IDEVE,KBOUT,'CDS',OPRATR,ISTRAN,
+     +POSNS,MAXPOS,NPOS,NOBJ)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ISTRAN.EQ.0) THEN
+      K = NPOS
+      DO 100 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+          WRITE(KBOUT,*)'Data from end point ',POSNS(K-NSEG)
+        DO 50 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+C note not checking for within j1,j2
+ 50       CONTINUE
+          IB = IB - 1
+          IB = 3*(IB/3)
+          CALL FILLR(S1,64,0.0)
+          CALL CALCOD(S1,SEQW,IB)
+          CALL ADDR(S1,S2,64)
+          IF(NORM.EQ.1)THEN
+            CALL COPYR(S1,S3,64)
+            CALL NORMAA(S3,100.,PAA)
+            CALL WRTCOD(S3,IDEV,PAA)
+          ELSE
+            CALL WRTCOD(S1,IDEV,PAA)
+          END IF
+          IF(IDEV.EQ.KBOUT)THEN
+            CALL BPAUSE(KBIN,KBOUT,IOK)
+            IF(IOK.NE.0) RETURN
+          END IF
+          CALL BCOMPC(S1,IDEV)
+          CALL AACOMP(S1,IDEV,PAA)
+          IF(IDEV.EQ.KBOUT)THEN
+            CALL BPAUSE(KBIN,KBOUT,IOK)
+            IF(IOK.NE.0) RETURN
+          END IF
+          END IF
+          K = K - NSEG - 1
+ 100      CONTINUE
+      ELSE
+      K = NPOS
+      DO 200 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+          WRITE(KBOUT,*)'Data from end point ',POSNS(K-1)
+        DO 60 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+ 60       CONTINUE
+          IB = IB - 1
+          CALL SQREV(SEQW,IB)
+          CALL SQCOM2(SEQW,IB)
+          IB = 3*(IB/3)
+          CALL FILLR(S1,64,0.0)
+          CALL CALCOD(S1,SEQW,IB)
+          CALL ADDR(S1,S2,64)
+          IF(NORM.EQ.1)THEN
+            CALL COPYR(S1,S3,64)
+            CALL NORMAA(S3,100.,PAA)
+            CALL WRTCOD(S3,IDEV,PAA)
+          ELSE
+            CALL WRTCOD(S1,IDEV,PAA)
+          END IF
+          IF(IDEV.EQ.KBOUT)THEN
+            CALL BPAUSE(KBIN,KBOUT,IOK)
+            IF(IOK.NE.0) RETURN
+          END IF
+          CALL BCOMPC(S1,IDEV)
+          CALL AACOMP(S1,IDEV,PAA)
+          IF(IDEV.EQ.KBOUT)THEN
+            CALL BPAUSE(KBIN,KBOUT,IOK)
+            IF(IOK.NE.0) RETURN
+          END IF
+          END IF
+          K = K - NSEG - 1
+ 200      CONTINUE
+      END IF
+      IF((JSTRAN.EQ.2).AND.(ISTRAN.EQ.0)) THEN
+        ISTRAN = 1
+        REWIND(IDEVE)
+        GO TO 10
+      END IF
+      WRITE(IDEV,9991)
+9991  FORMAT(5X,'Codon totals over all genes')
+      IF(NORM.EQ.1)THEN
+        CALL COPYR(S2,S3,64)
+        CALL NORMAA(S3,100.,PAA)
+        CALL WRTCOD(S3,IDEV,PAA)
+      ELSE
+        CALL WRTCOD(S2,IDEV,PAA)
+      END IF
+      IF(IDEV.EQ.KBOUT)THEN
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL BCOMPC(S2,IDEV)
+      CALL AACOMP(S2,IDEV,PAA)
+      END
+      SUBROUTINE CODONS(SEQ,IDIM,J1,J2,IDEV,IDEVE,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PAA,POSNS,MAXPOS,SEQW)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),FILNAM*(*),PAA(5,5,5),SEQW(IDIM),OPRATR*20
+      REAL S1(64),S2(64),S3(64)
+      INTEGER NORM,ANSE,POSNS(MAXPOS)
+      CALL SHOWFU(KBOUT,
+     +'Calculate base, codon and amino acid compositions')
+      CALL GETPAR(23,3,IOK,JSTRAN,NORM,ANSE,
+     +IPAR4,IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      JSTRAN = 0
+      NORM = 0
+      ANSE = 0
+      CALL CODND1(JSTRAN,NORM,ANSE,IDEVE,FILNAM,OPRATR,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(ANSE.EQ.1) THEN
+        CALL CODEMB(SEQ,IDIM,J1,J2,SEQW,PAA,JSTRAN,IDEVE,IDEV,
+     +  POSNS,MAXPOS,KBIN,KBOUT,OPRATR,S1,S2,S3,NORM,FILNAM,IOK)
+        CLOSE(UNIT=IDEVE)
+      RETURN
+      END IF
+      CALL FILLR(S2,64,0.0)
+100   CONTINUE
+      CALL FILLR(S1,64,0.0)
+      CALL TRAND8(IDIM,J1,J2,JSTRAN,I1,I2,'Count over',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(I1.EQ.0) GO TO 200
+      IDIM1 = I2-I1+1
+      CALL CALCOD(S1,SEQ(I1),IDIM1)
+      IF(JSTRAN.EQ.1)CALL CODCOM(S1)
+      CALL ADDR(S1,S2,64)
+      IF(NORM.EQ.1)THEN
+        CALL COPYR(S1,S3,64)
+        CALL NORMAA(S3,100.,PAA)
+        CALL WRTCOD(S3,IDEV,PAA)
+      ELSE
+        CALL WRTCOD(S1,IDEV,PAA)
+      END IF
+      IF(IDEV.EQ.KBOUT)THEN
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL BCOMPC(S1,IDEV)
+      CALL AACOMP(S1,IDEV,PAA)
+      IF(IDEV.EQ.KBOUT)THEN
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      GO TO 100
+200   CONTINUE
+      WRITE(IDEV,9991)
+9991  FORMAT(5X,'Codon totals over all genes')
+      IF(NORM.EQ.1)THEN
+        CALL COPYR(S2,S3,64)
+        CALL NORMAA(S3,100.,PAA)
+        CALL WRTCOD(S3,IDEV,PAA)
+      ELSE
+        CALL WRTCOD(S2,IDEV,PAA)
+      END IF
+      IF(IDEV.EQ.KBOUT)THEN
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL BCOMPC(S2,IDEV)
+      CALL AACOMP(S2,IDEV,PAA)
+      END
+      SUBROUTINE CODIMP(SEQ,IDIM,FTABLE,IDFTAB,IDEVE,FILNAM,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,J1,J2,IDEV,POSNS,MAXPOS,SEQW)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      REAL FTABLE(IDFTAB)
+      CHARACTER SEQ(IDIM),FILNAM*(*),OPRATR*20,SEQW(IDIM)
+      INTEGER ANSE,POSNS(MAXPOS)
+      CALL FACTAB(FTABLE,99)
+      CALL SHOWFU(KBOUT,'Calculate codon improbability')
+      CALL GETPAR(28,2,IOK,JSTRAN,ANSE,
+     +IPAR3,IPAR4,IPAR5,
+     +IPAR6,IPAR7,IPAR8,IPAR9,IPAR10,
+     +IPAR11,IPAR12,IPAR13,IPAR14,IPAR15,
+     +IPAR16,IPAR17,IPAR18,IPAR19,IPAR20,
+     +IPAR21,IPAR22,IPAR23,IPAR24,IPAR25,
+     +IPAR26,IPAR27,IPAR28,IPAR29,IPAR30,
+     +IPAR31,IPAR32,IPAR33,IPAR34,IPAR35)
+      IF(IOK.NE.0) RETURN
+      CALL TRAND1(JSTRAN,ANSE,IDEVE,FILNAM,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,OPRATR,IOK)
+      IF(IOK.NE.0) THEN
+        IF(ANSE.EQ.1)CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ANSE.EQ.1) THEN
+        CALL CDIEMB(SEQ,IDIM,J1,J2,SEQW,JSTRAN,IDEVE,IDEV,
+     +  POSNS,MAXPOS,KBOUT,FTABLE,IDFTAB,OPRATR,IOK)
+        CLOSE(UNIT=IDEVE)
+      RETURN
+      END IF
+100   CONTINUE
+      CALL TRAND8(IDIM,J1,J2,JSTRAN,I1,I2,'Count over',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(I1.EQ.0) RETURN
+      IDIM1 = I2-I1+1
+      IF(JSTRAN.EQ.1) THEN
+        CALL SQCOPY(SEQ(I1),SEQW,IDIM1)
+        CALL SQREV(SEQW,IDIM1)
+        CALL SQCOM2(SEQW,IDIM1)
+        CALL CODMPP(SEQW,IDIM,FTABLE,IDFTAB,KBOUT,1,IDIM1,1,IDIM1,
+     +  IDIM1,IDEV)
+      ELSE
+        CALL CODMPP(SEQ,IDIM,FTABLE,IDFTAB,
+     +  KBOUT,1,IDIM,I1,I2,IDIM1,IDEV)
+      END IF
+      GO TO 100
+      END
+      SUBROUTINE CDIEMB(SEQ,IDIM,J1,J2,SEQW,JSTRAN,IDEVE,IDEV,
+     +POSNS,MAXPOS,KBOUT,FTABLE,IDFTAB,OPRATR,IOK)
+C codon improbability using ft
+      CHARACTER SEQ(IDIM),OPRATR*(*),SEQW(IDIM)
+      INTEGER POSNS(MAXPOS)
+      REAL FTABLE(IDFTAB)
+      INTEGER EMBLFT
+      EXTERNAL EMBLFT
+      ISTRAN = 0
+      IF(JSTRAN.EQ.1) ISTRAN = 1
+10    CONTINUE
+      NOBJ = 0
+      IOK = EMBLFT(IDEVE,KBOUT,'CDS',OPRATR,ISTRAN,
+     +POSNS,MAXPOS,NPOS,NOBJ)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ISTRAN.EQ.0) THEN
+      K = NPOS
+      DO 100 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+          WRITE(KBOUT,*)'Data from end point ',POSNS(K-NSEG)
+        DO 50 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+C note not checking for within j1,j2
+ 50       CONTINUE
+          IB = IB - 1
+          IB = 3*(IB/3)
+          CALL CODMPP(SEQW,IDIM,FTABLE,IDFTAB,KBOUT,1,IB,1,IB,IB,IDEV)
+        END IF
+          K = K - NSEG - 1
+ 100      CONTINUE
+      ELSE
+      K = NPOS
+      DO 200 I=1,NOBJ
+        NSEG = POSNS(K)
+        IB = 1
+        IF((POSNS(K-NSEG).GE.J1).AND.(POSNS(K-1).LE.J2)) THEN
+          WRITE(KBOUT,*)'Data from end point ',POSNS(K-1)
+        DO 60 J=K-NSEG,K-1,2
+          L = POSNS(J+1) - POSNS(J) + 1
+          CALL SQCOPY(SEQ(POSNS(J)),SEQW(IB),L)
+          IB = IB + L
+ 60       CONTINUE
+          IB = IB - 1
+          CALL SQREV(SEQW,IB)
+          CALL SQCOM2(SEQW,IB)
+          IB = 3*(IB/3)
+          CALL CODMPP(SEQW,IDIM,FTABLE,IDFTAB,KBOUT,1,IB,1,IB,IB,IDEV)
+        END IF
+          K = K - NSEG - 1
+ 200      CONTINUE
+      END IF
+      IF((JSTRAN.EQ.2).AND.(ISTRAN.EQ.0)) THEN
+        ISTRAN = 1
+        REWIND(IDEVE)
+        GO TO 10
+      END IF
+      END
+      SUBROUTINE PLTEMB(JSTRAN,IDEVE,
+     +POSNS,MAXPOS,KBOUT,KEYWRD,OPRATR,YF,BLIPB,BLIPT,
+     +    XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C plot features
+      CHARACTER OPRATR*(*),KEYWRD*(*)
+      INTEGER POSNS(MAXPOS)
+      INTEGER EMBLFT
+      EXTERNAL EMBLFT
+      ISTRAN = 0
+      IF(JSTRAN.EQ.1) ISTRAN = 1
+10    CONTINUE
+      NOBJ = 0
+      IOK = EMBLFT(IDEVE,KBOUT,KEYWRD,OPRATR,ISTRAN,
+     +POSNS,MAXPOS,NPOS,NOBJ)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEVE)
+        RETURN
+      END IF
+      IF(ISTRAN.EQ.0) THEN
+      K = NPOS
+      DO 100 I=1,NOBJ
+        NSEG = POSNS(K)
+        DO 50 J=K-NSEG,K-1,2
+          CALL PLTBAR(REAL(POSNS(J)),REAL(POSNS(J+1)),
+     +    YF,BLIPB,BLIPT,
+     +    XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C note not checking for within j1,j2
+ 50       CONTINUE
+          K = K - NSEG - 1
+ 100      CONTINUE
+      ELSE
+      K = NPOS
+      DO 200 I=1,NOBJ
+        NSEG = POSNS(K)
+        DO 60 J=K-NSEG,K-1,2
+          CALL PLTBAR(REAL(POSNS(J)),REAL(POSNS(J+1)),
+     +    YF,BLIPB,BLIPT,
+     +    XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+ 60       CONTINUE
+          K = K - NSEG - 1
+ 200      CONTINUE
+      END IF
+      IF((JSTRAN.EQ.2).AND.(ISTRAN.EQ.0)) THEN
+        ISTRAN = 1
+        REWIND(IDEVE)
+        GO TO 10
+      END IF
+      END
+C    PLOTMAP
+      SUBROUTINE PLTMAQ(IDEV,FILNAM,POSNS,MAXPOS,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IS,IE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      INTEGER POSNS(MAXPOS)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),OPRATR*20,KEYWRD*15
+      PARAMETER (IBLIPH=128)
+      CALL SHOWFU(KBOUT,
+     +'Display a map using a feature table file')
+      XMAX=IE
+      XMIN=IS
+      YMIN=0.
+      YMAX=ISYMAX
+      IY = IBLIPH
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Feature table file name',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+10    CONTINUE
+      REWIND IDEV
+      LIN = 3
+      CALL GTSTR('Feature code','CDS',KEYWRD,LIN,
+     +KBOUT,KBIN,INFLAG)
+      IF(LIN.EQ.0) KEYWRD(1:3) = 'CDS'
+      CALL CCASE(KEYWRD,1)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) GO TO 900
+      IF(KEYWRD.EQ.' ') GO TO 900
+ 20   CONTINUE
+      LIN = 3
+      CALL GTSTR('Operator','all',OPRATR,LIN,
+     +KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) GO TO 900
+      CALL CCASE(OPRATR,1)
+      IF((LIN.EQ.0).OR.(OPRATR(1:3).EQ.'ALL')) OPRATR(1:1) = ' '
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 20
+      END IF
+      JSTRAN = 1
+      CALL GSTRND(JSTRAN,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      JSTRAN = JSTRAN - 1
+      IF(JSTRAN.LT.0) GO TO 900
+      MININ = 0
+      MAXIN = MARGT
+      CALL GETINT(MININ,MAXIN,IY,
+     +'level',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 900
+      IY = IVAL
+C      YBASE=IBASE
+      YF=IY
+      CALL CLEARV
+      CALL VECTOM
+C   NEED BARS AT ENDS OF FEATURES FROM BLIPB TO BLIPT
+      BLIPB=YF-IBLIPH/2
+      BLIPT=YF+IBLIPH/2
+      CALL PLTEMB(JSTRAN,IDEV,
+     +POSNS,MAXPOS,KBOUT,KEYWRD,OPRATR,YF,BLIPB,BLIPT,
+     +    XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      GO TO 10
+900   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE PLTBAR(POSNL,POSNR,YF,BLIPB,BLIPT,
+     +XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C plot a bar for a feature
+      CALL LINE(POSNL,POSNR,YF,YF,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(POSNL,POSNL,BLIPB,BLIPT,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(POSNR,POSNR,BLIPB,BLIPT,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END
+      SUBROUTINE MBPRIM(FRAMEC,IDIMS,J1,J2,PRIME)
+      INTEGER FRAMEC(IDIMS),PRIME,FROTAT(0:2,0:2)
+      SAVE FROTAT
+      DATA FROTAT/3,2,1,1,3,2,2,1,3/
+C  AUTHOR RODGER STADEN
+C     multiplies array framec(i1-i2) by 2 or 3 
+C     frame rotation depends on frame and sequence length
+C     on - strand only works for complete codons
+      IF(PRIME.LT.0) THEN
+C special case for ft input
+        DO 1 I = J2-2,J1,-3
+          FRAMEC(I) = FRAMEC(I) * PRIME * (-1)
+1       CONTINUE
+        RETURN
+      END IF
+      I1 = J1
+      I2 = J2
+      IF(J1.GT.J2) THEN
+        I2 = IDIMS
+        DO 5 I = I1,I2,3
+          FRAMEC(I) = FRAMEC(I) * PRIME
+ 5      CONTINUE
+        LFRAME = MOD(IDIMS,3)
+        IFRAME = MOD(J1,3)
+        I1 = FROTAT(LFRAME,IFRAME)
+        I2 = J2
+        DO 10 I = I1,I2,3
+          FRAMEC(I) = FRAMEC(I) * PRIME
+ 10     CONTINUE
+      ELSE
+        DO 20 I = I1,I2,3
+          FRAMEC(I) = FRAMEC(I) * PRIME
+ 20     CONTINUE
+      END IF
+      END
+      SUBROUTINE SQTREE(SEQ,SEQN,IDIM,WORDP,IDE,TREE,MAXTRE,
+     +LEVELS,MAXLEV,IDM,KSTART,
+     +IDEV,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER SEQ(IDIM),HELPF*(*)
+      INTEGER SEQN(IDIM),WORDP(IDE),TREE(MAXTRE)
+      INTEGER LEVELS(MAXLEV)
+      PARAMETER (MAXLEN = 6,
+     +           MAXCON = 4*MAXLEN)
+      INTEGER IALPHA(5)
+      INTEGER CONSTS(0:MAXCON)
+C      INTEGER CONSTS(MAXLEN)
+C    PROGRAM TO REPRESENT A SEQUENCE AS A TREE
+C    THE FIRST LEVEL OF THE TREE IS THE SEQUENCE COMPOSITION
+C    THE SECOND LEVEL CONTAINS ALL THE TWO LETTER WORDS PRESENT
+C    AND SO ON. AS THE WORD LENGTH INCREASES THE NUMBER OF
+C    DIFFERENT WORDS PRESENT DECREASES, AS DOES THE NUMBER
+C    OF OCCURRENCES OF EACH. THE ROUTINE FOLLOWS WORDS UNTIL
+C    THEY ONLY OCCUR ONCE.
+C    IN ITS SIMPLEST FORM THE ALGORITHM STARTS WITH THE COMPOSITION
+C    AND FOLLOWS WORDS UNTIL THEY ONLY OCCUR ONCE. THE METHOD
+C    ENCODED BELOW STARTS WITH SOME MINIMUM WORD LENGTH
+C    USING THE HASHING METHOD OF THE SHOTGUN PROGRAMS. IT THEN
+C    CONVERTS TO THE DATA STRUCTURE FOR THE TREE METHOD. THE CURRENT
+C    DATA STRUCTURE IS AS FOLLOWS: FOR ANY WORD LENGTH THERE ARE
+C    NWORD DIFFERENT REPEATED WORDS, FOR EACH OF THOSE WORDS
+C    THERE ARE IWORD OCCURRENCES. THE TREE IS STORED IN AN ARRAY
+C    NAMED TREE. THE FIRST ELEMENT FOR EACH LENGTH OF WORD CONTAINS
+C    THE NUMBER OF DIFFERENT WORDS OF THAT LENGTH THAT ARE REPEATED,
+C    THE NEXT ELEMENT THE NUMBER OF OCCURRENCES OF THE FIRST WORD, THE
+C    NEXT ELEMENTS POINT TO THE STARTS OF THE INDIVIDUAL WORDS, UNTIL
+C    THE COUNT FOR THE NEXT REPEATED WORD. THE START POSITION IN THE
+C    TREE FOR EACH WORD LENGTH IS STORED IN ARRAY LEVEL.
+C
+C tree structure:
+C
+C    word length                                        = wordl
+C    number of different repeated words of length wordl = nword
+C    number of occurrences of each repeated word        = iword
+C    positions of words in the sequence                 = pword
+C
+C for each length of word (level of the tree) store: 
+C
+C                 nword(wordl(level)),iword(1),pword(iword(1),1),pword(iword(1),2)
+C                               ,,, [iword(1) elements],iword(2),pword(iword(2),1)
+C                               ,,,
+C
+C we also store in array levels a pointer into tree: levels(n) points to the
+C start of info for words of length n
+C
+      CALL CONNUM(SEQ,SEQN,IDIM)
+C   COUNT OCCURENCES OF ALPHABET
+      CALL FILLI(IALPHA,IDM,0)
+      DO 10 I=1,IDIM
+        IALPHA(SEQN(I))=IALPHA(SEQN(I))+1
+10    CONTINUE
+      NALPHA=0
+      DO 11 I=1,IDM
+        IF(IALPHA(I).NE.0)NALPHA=NALPHA+1
+11    CONTINUE
+C   CALC EXPECTED LONGEST DIRECT REPEAT
+C   lN = 2LOGN/LOG(1/V) - [1+LOG(1-V)/LOGV) + 0.5772/LOGV] + LOG2/LOGV
+C   WHERE V=SUM PI, I=1,NALPHA AND N=IDIM
+      V=0.
+      DO 12 I=1,NALPHA
+        V1=REAL(IALPHA(I))/REAL(IDIM)
+        V=V+V1*V1
+12    CONTINUE
+      VLOG=LOG(V)
+      VLOGR=LOG(1.0/V)
+      VLOG1M=LOG(1.0-V)
+      REALLN=(2.*LOG(REAL(IDIM))/VLOGR)-(1+(VLOG1M/VLOG)+
+     +        0.5772/VLOG)+LOG(2.)/VLOG
+      LENEXP = NINT(REALLN)
+      WRITE(KBOUT,1010)LENEXP
+ 1010 FORMAT(' Expected length of longest repeat',I6)
+      LENGTH = MIN(MAXLEN,LENEXP)
+      MN = 1
+      MX = LENGTH
+      JLEVEL = MX
+      CALL GETINT(MN,MX,JLEVEL,'Minumim word length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF (IOK.NE.0) RETURN
+      LENGTH = IVAL
+      CALL BUSY(KBOUT)
+      IF (LENGTH.EQ.1) THEN
+        CALL SQTRE1(SEQN,IDIM,TREE,MAXTRE,IALPHA,IDM,IENDT)
+      ELSE
+        CALL SETCN(CONSTS,LENGTH,IDM,MAXCON)
+        CALL ENCOND(SEQN,IDIM,WORDP,IDE,IDM,CONSTS,LENGTH,MAXCON)
+        CALL CONV(SEQN,IDIM,WORDP,IDE,TREE,MAXTRE,IENDT,LENGTH)
+        CALL CONNUM(SEQ,SEQN,IDIM)
+      END IF
+C
+      LEVEL = LENGTH
+      LEVELS(LEVEL) = 1
+      IENDT = IENDT + 1
+C
+C
+C
+300   CONTINUE
+C
+C
+      LEVEL=LEVEL+1
+      IF(LEVEL.GT.MAXLEV)GO TO 650
+      LEVELS(LEVEL)=IENDT
+      IENDT=IENDT+1
+      NODES=TREE(LEVELS(LEVEL-1))
+      NBRAN=0
+      ITREE=LEVELS(LEVEL-1)+1
+      DO 400 I=1,NODES
+        NBRAN1=TREE(ITREE)
+        ITREE=ITREE+1
+        CALL NEWBRA(TREE,MAXTRE,IALPHA,NALPHA,NBRAN1,ITREE,
+     +  NBRAN2,SEQN,IDIM,IENDT,IOK)
+        IF(IOK.NE.0)GO TO 600
+        NBRAN=NBRAN+NBRAN2
+        ITREE=ITREE+NBRAN1
+400   CONTINUE
+      TREE(LEVELS(LEVEL))=NBRAN
+      IF(NBRAN.LT.2) THEN
+        IF(NBRAN.EQ.0) LEVEL = LEVEL - 1
+        GO TO 700
+      END IF
+      GO TO 300
+C
+C Come here with problems
+C
+600   CONTINUE
+      CALL ERROM(KBOUT,'Not enough memory')
+      LEVEL = LEVEL - 1
+      GO TO 700
+650   CONTINUE
+      CALL ERROM(KBOUT,'Maximum repeat length too long')
+      LEVEL = LEVEL - 1
+C
+C Come here to show results
+C
+700   CONTINUE
+      WRITE(KBOUT,1006)4*(IENDT-1),LEVEL
+ 1006 FORMAT(' Memory used in bytes',I8,'. Length of longest repeat',I6)
+      MN = LENGTH
+      MX = LEVEL
+      JLEVEL = MX
+      CALL GETINT(MN,MX,JLEVEL,
+     +'Show repeat frequencies for words of at least length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF (IOK.NE.0) RETURN
+      JLEVEL = IVAL
+      ITREE=LEVELS(JLEVEL)
+      DO 800 ILEVEL=JLEVEL,LEVEL
+        ITREE=LEVELS(ILEVEL)
+        WRITE(IDEV,1008)ILEVEL,TREE(ITREE)
+ 1008   FORMAT(' For length',I6,
+     +  ' the number of different repeated words is',I6)
+C   find numbers of each word
+        INODES=TREE(ITREE)
+        ITREE=ITREE+1
+        IWORD=TREE(ITREE)
+        DO 750 NODE=1,INODES
+          ITREE=ITREE+IWORD+1
+          IWORD=TREE(ITREE)
+750     CONTINUE
+800   CONTINUE
+C
+C
+801   CONTINUE
+C
+C
+      MN = LENGTH
+      MX = LEVEL
+      JLEVEL = MX
+      CALL GETINT(MN,MX,JLEVEL,'Show repeats for words of length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF (IOK.NE.0) RETURN
+      JLEVEL = IVAL
+      MN = 2
+      MX = 9999
+      NOCCUR = MN
+      CALL GETINT(MN,MX,NOCCUR,
+     +'Show repeats for words occuring with frequency',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF (IOK.NE.0) RETURN
+      NOCCUR = IVAL
+Ccccccccccccccccccccccccccccccccccccccccccccc
+      ITREE=LEVELS(JLEVEL)
+C   find numbers of different words
+      INODES=TREE(ITREE)
+      ITREE=ITREE+1
+C   find number of occurences of first word
+      IWORD=TREE(ITREE)
+C   loop for each of the different words
+      DO 850 NODE=1,INODES
+        IF(IWORD.LT.NOCCUR)GO TO 876
+C  note this last line used to go to 850 not 876
+C   enough occurences so output
+        WRITE(IDEV,1014)(SEQ(K),K=TREE(ITREE+1)-(JLEVEL-1),
+     +TREE(ITREE+1))
+1014  FORMAT(' ',50A1)
+        JJ=ITREE
+        DO 875 J=1,IWORD
+          JJ=JJ+1
+          WRITE(IDEV,1015)(TREE(JJ)-JLEVEL+1)+KSTART-1
+1015      FORMAT(' occurs at ',I6)
+875     CONTINUE
+876     CONTINUE
+C   move tree pointer on to next word
+        ITREE=ITREE+IWORD+1
+C   point to next node
+        IWORD=TREE(ITREE)
+850   CONTINUE
+900   CONTINUE
+      GO TO 801
+      END
+      SUBROUTINE NEWBRA(TREE,IDTREE,IALPHA,NALPHA,NBRAN1,ITREE,
+     +NBRAN2,SEQ,IDSEQ,IENDT,IOK)
+      INTEGER IALPHA(NALPHA),TREE(IDTREE)
+      INTEGER SEQ(IDSEQ)
+      DO 10 I=1,NALPHA
+        IALPHA(I)=0
+10    CONTINUE
+C   loop for all elements this node
+      DO 20 I=ITREE,ITREE+NBRAN1-1
+C        WRITE(*,*)'I',I
+C   point to seq
+        K=TREE(I)+1
+C        WRITE(*,*)'K',K
+        IF(K.LE.IDSEQ)IALPHA(SEQ(K))=IALPHA(SEQ(K))+1
+20    CONTINUE
+C   zero number of branches next level
+      NBRAN2=0
+C   how many new branches?
+      DO 30 I=1,NALPHA
+        IF(IALPHA(I).GT.1)NBRAN2=NBRAN2+1
+30    CONTINUE
+      IF(NBRAN2.LT.1)RETURN
+C
+C   now store the values
+      DO 50 ICHAR=1,NALPHA
+        IF(IALPHA(ICHAR).LT.2)GO TO 50
+C   save thickness
+        IF(IENDT.GT.IDTREE)GO TO 100
+        TREE(IENDT)=IALPHA(ICHAR)
+        IENDT=IENDT+1
+C   now look thru sequence and save relevent pointers
+        DO 40 I=ITREE,ITREE+NBRAN1-1
+          K=TREE(I)+1
+          IF(K.GT.IDSEQ)GO TO 40
+          IF(SEQ(K).NE.ICHAR)GO TO 40
+C   save pointer
+          IF(IENDT.GT.IDTREE)GO TO 100
+          TREE(IENDT)=K
+          IENDT=IENDT+1
+40      CONTINUE
+50    CONTINUE
+      IOK = 0
+      RETURN
+100   CONTINUE
+      IOK = 1
+      END
+      SUBROUTINE CONV(POSN,IDIM,WORDP,IDE,TREE,MAXTRE,IENDT,LENGTH)
+      INTEGER POSN(IDIM),WORDP(IDE),TREE(MAXTRE)
+      IENDT = 1
+      NWORD = 0
+      DO 20 I = 1,IDE
+        IF(WORDP(I).NE.0)THEN
+          IP = WORDP(I)
+          IF(POSN(IP).NE.0)THEN
+            IWORD = 1
+            IENDT = IENDT + 1
+            IT = IENDT
+            NWORD = NWORD + 1
+            K = IP
+            IENDT = IENDT + 1
+            TREE(IENDT) = K + LENGTH - 1
+10          CONTINUE
+            IF(POSN(K).NE.0)THEN
+              K = POSN(K)
+              IENDT =IENDT + 1
+              TREE(IENDT) = K + LENGTH - 1
+              IWORD = IWORD + 1
+              GO TO 10
+            END IF
+            TREE(IT) = IWORD
+          END IF
+        END IF
+20    CONTINUE
+      TREE(1) = NWORD
+      END
+      SUBROUTINE SQTRE1(SEQN,IDIM,TREE,MAXTRE,IALPHA,IDM,IENDT)
+      INTEGER SEQN(IDIM),TREE(MAXTRE),IALPHA(IDM)
+      IENDT = 2
+      TREE(1) = IDM - 1
+      DO 200 I = 1,IDM - 1
+        TREE(IENDT) = IALPHA(I)
+        IENDT = IENDT + 1
+        DO 100 J = 1,IDIM
+          IF(SEQN(J).EQ.I) THEN
+            TREE(IENDT) = J
+            IENDT = IENDT + 1
+          END IF
+ 100    CONTINUE
+ 200  CONTINUE
+      IENDT = IENDT - 1
+      END
diff --git a/src/staden/asubs89.f b/src/staden/asubs89.f
new file mode 100644
index 0000000..d9cc01c
--- /dev/null
+++ b/src/staden/asubs89.f
@@ -0,0 +1,1055 @@
+C   ASUBS89
+C   SUBROUTINES FOR ANALYSIS PROGRAMS ANALYSEQ AND ANALYSEP
+C   AUTHOR RODGER STADEN
+C   11-1-90 GETNAM changed use of inflag=2 to set idnlst to 0
+C           Added sqpf7, sepf7 sqpf5 and removed them from anals89, 
+C           analps89
+C   6-7-90  Added showfu
+C   5-11-90 Changed rdwmt to show/not show title, changed mkwt accordingly
+C   7-11-90 Changed iopt in mkwt to fit with changes in analps89 and 
+C           anals89 (means adding 1 to old values)
+C           added new routine gstrnd
+C  18-4-91  added new weight matrix routines (initially for splice search)
+C   BUBDEL
+C   GETNAM
+C   PSRCHX
+C   BUBBLE
+C   PLTMAP
+C   IEMBL
+C   SEPFIT
+C   SPFIT
+C   GETSCR
+C   EDITSQ ETC
+      SUBROUTINE SQPF7(SEQNCE,IDIM1,STRING,IDIM2,MATCH,
+     +ITOT,ITOTEL,ITOTID,
+     +ITOTP,KSTART,J1,J2,ISS,PR,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      CHARACTER SEQNCE(IDIM1),STRING(IDIM2),MATCH(IDIM2),HELPF*(*)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      IOK = 1
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      WRITE(KBOUT,1013)PR,ITOTP
+1013  FORMAT(/,' Total scoring positions above',F7.3,' percent =',I4)
+      IF(ITOTP.GT.0) THEN
+        IF(ITOTP.GT.1)CALL BUBDEL(ITOT,ITOTEL,ITOTP)
+        WRITE(KBOUT,1002)(ITOT(K),K=1,MIN(10,ITOTP))
+        WRITE(KBOUT,1006)(ITOTEL(K),K=1,MIN(10,ITOTP))
+1002    FORMAT( ' Scores   ',10I7)
+1006    FORMAT( ' Positions',10I7)
+        NSEE = 0
+        CALL GETINT(0,ITOTP,NSEE,'Display',IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        NSEE = IVAL
+        IF(NSEE.GT.0)THEN
+          DO 300 I=1,NSEE
+            K=ITOTEL(I)-KSTART+1
+            WRITE(IDEV,1008)
+1008        FORMAT( )
+            CALL SQMTCH( SEQNCE(K),STRING,MATCH,IDIM2)
+            CALL FMT4LN(SEQNCE(K),STRING,MATCH,IDIM2,K,ISS,IDEV)
+300       CONTINUE
+        END IF
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE SEPF7(SEQNCE,IDIM1,STRING,IDIM2,MATCH,
+     +ITOT,ITOTEL,ITOTID,
+     +ITOTP,MINP,KSTART,J1,J2,ISS,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      CHARACTER SEQNCE(IDIM1),STRING(IDIM2),MATCH(IDIM2),HELPF*(*)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      IOK = 1
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+      WRITE(KBOUT,1013)MINP,ITOTP
+1013  FORMAT(/,' For score',I6,' the number of matches=',I6)
+      IF(ITOTP.GT.0) THEN
+        IF(ITOTP.GT.1)CALL BUBDEL(ITOT,ITOTEL,ITOTP)
+        WRITE(KBOUT,1002)(ITOT(K),K=1,MIN(10,ITOTP))
+        WRITE(KBOUT,1006)(ITOTEL(K),K=1,MIN(10,ITOTP))
+1002    FORMAT( ' Scores   ',10I7)
+1006    FORMAT( ' Positions',10I7/)
+        NSEE = 0
+        CALL GETINT(0,ITOTP,NSEE,'Display',IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        NSEE = IVAL
+        IF(NSEE.GT.0)THEN
+          DO 300 I=1,NSEE
+            K=ITOTEL(I)-KSTART+1
+            WRITE(IDEV,1008)
+1008        FORMAT( )
+            CALL SQMTCH( SEQNCE(K),STRING,MATCH,IDIM2)
+            CALL FMT4LN(SEQNCE(K),STRING,MATCH,IDIM2,K,ISS,IDEV)
+300       CONTINUE
+        END IF
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE SQPF5(IDIM2,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,PR,MINP,IOK)
+      CHARACTER HELPF*(*)
+      REAL MININ,MAXIN
+      IOK = 1
+      MININ = 1.
+      MAXIN = 100.
+      CALL GETRL(MININ,MAXIN,PR,'Percent match',VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      PR = VALUE
+      PRR = PR/100.
+      XIDIM2 = REAL(IDIM2)
+      XIDIM2 = XIDIM2 * PRR
+      MINP = NINT(XIDIM2)
+      IOK = 0
+      END
+      SUBROUTINE GETRKB(SEQ,ID,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,
+     +IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER SEQ(ID),A
+      IOK = 1
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Define search strings by typing a string name',
+     +/,' followed by the string(s)')
+      JD = 1
+1     CONTINUE
+      L = 0
+      CALL GETSTR('Name',A,SEQ(JD),20,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+         CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+         GO TO 1
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) GO TO 50
+      JD = JD + L
+      SEQ(JD) = '/'
+      JD = JD + 1
+2       CONTINUE
+         L = 0
+         CALL GETSTR('String(s)',A,SEQ(JD),75,L,KBOUT,KBIN,INFLAG)
+         IF(INFLAG.EQ.1) THEN
+           CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+           GO TO 2
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.3) GO TO 50
+C        IF(L.EQ.0) GO TO 2
+        DO 40 I = JD, JD + L - 1
+          IF(SEQ(I).EQ.' ') SEQ(I) = '/'
+40      CONTINUE
+        JD = JD + L
+        SEQ(JD) = '/'
+        SEQ(JD+1) = '/'
+        JD = JD + 2
+      GO TO 1
+50    CONTINUE
+      ID = JD - 1
+      IOK = 0
+      END
+      SUBROUTINE BUBDEL(LIST,LISTEL,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM),LISTEL(IDIM)
+      I=0
+      J=0
+10    CONTINUE
+      IF(J.GT.I)I=J
+      I=I+1
+      IF(I.EQ.IDIM)RETURN
+20    CONTINUE
+      IF(LIST(I).GE.LIST(I+1))GO TO 10
+      IF(J.LT.I)J=I
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+      ITEMP=LISTEL(I)
+      LISTEL(I)=LISTEL(I+1)
+      LISTEL(I+1)=ITEMP
+      IF(I.GT.1)I=I-1
+      GO TO 20
+      END
+      SUBROUTINE GETNAM(NAMLST,IDNLST,NAMES,IDNAML,NAMEP,NAMLEN,
+     +MAXEN,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+C 2-9-91 Added tupper to deal with lower case enzyme names
+C   routine to return namlst with numbers of wanted enzymes
+      CHARACTER NAMES(IDNAML),NAME*20,NEWNAM*20,HELPF*(*),TUPPER
+      INTEGER NAMLST(IDNLST),NAMEP(MAXEN),NAMLEN(MAXEN)
+      EXTERNAL TUPPER
+      NENZ=0
+10    CONTINUE
+      LENNAM = 0
+      CALL GTSTR('Name',NAME,NEWNAM,LENNAM,KBOUT,KBIN,INFLAG)
+      IF(LENNAM.EQ.0) GO TO 50
+      CALL CCASE(NEWNAM,1)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) THEN
+        IDNLST = 0
+        RETURN
+      END IF
+      NAME = NEWNAM
+C   search for it in names
+      DO 20 I=1,MAXEN
+        IF(NAMLEN(I).NE.LENNAM)GO TO 20
+C  name of correct length, do its chars match?
+        DO 19 J=1,LENNAM
+         IF(TUPPER(NAMES(NAMEP(I)+J-1)).NE.NAME(J:J))GO TO 20
+19      CONTINUE
+C   must match
+        NENZ=NENZ+1
+        IF(NENZ.GT.IDNLST)THEN
+          WRITE(KBOUT,1003)
+1003      FORMAT(' Too many names selected')
+          RETURN
+        END IF
+        NAMLST(NENZ)=I
+        GO TO 10
+20    CONTINUE
+C   no name match
+      WRITE(KBOUT,1004)NAME(1:LENNAM)
+1004  FORMAT(' ',A,' not found in file')
+      GO TO 10
+50    CONTINUE
+      IDNLST=NENZ
+      END
+      SUBROUTINE PLSRCH(J1,J2,
+     +PSAVE,IFOUND,IBH,LEVEL,NAME,NAMLEN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C   AUTHOR: RODGER STADEN
+      CHARACTER NAME*(*)
+      INTEGER PSAVE(IFOUND)
+      CALL VECTOM
+      XMAX=J2
+      XMIN=J1
+      YMAX=MARGT
+      YMIN=0.
+      Y=LEVEL
+      YP=Y+IBH
+      CALL TEXT(XMIN,Y,NAME,NAMLEN,0
+     +,XMAX,XMIN,YMAX,YMIN,8,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX,Y,Y,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,
+     +ISXMAX,ISYMAX)
+      DO 10 I=1,IFOUND
+        X=PSAVE(I)
+        CALL LINE(X,X,Y,YP,XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX)
+10    CONTINUE
+C   now increase level ready for next entry
+      LEVEL=LEVEL+IBH
+      END
+C    PSRCHX
+      SUBROUTINE PSRCHX(LEVEL1,LEVEL,KBIN,KBOUT,IQUIT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CALL VT100M
+      CALL BPAUSE(KBIN,KBOUT,IQUIT)
+      IF(IQUIT.NE.0) RETURN
+      CALL YESNO(IQUIT,'Restart plotting from bottom of frame',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CALL VECTOM
+      CALL CLEARV
+      IF(IQUIT.EQ.1)RETURN
+      CALL BPAUSE(KBIN,KBOUT,IQUIT)
+      IF(IQUIT.NE.0) RETURN
+      CALL CLEARG
+      LEVEL=LEVEL1
+      END
+      SUBROUTINE BUBBLE(LIST,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM)
+      I=0
+      J=0
+10    CONTINUE
+      IF(J.GT.I)I=J
+      I=I+1
+      IF(I.EQ.IDIM)RETURN
+20    CONTINUE
+      IF(LIST(I).LE.LIST(I+1))GO TO 10
+      IF(J.LT.I)J=I
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+      IF(I.GT.1)I=I-1
+      GO TO 20
+      END
+C    PLOTMAP
+      SUBROUTINE PLTMAP(IDEV,FILNAM,IDIM,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IS,IE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),FEATUR*8,STRAND,DESCRP(38)
+      CHARACTER ATOS*38
+      EXTERNAL IEMBL
+      EXTERNAL ATOS
+      PARAMETER (IBLIPH=256)
+      CALL SHOWFU(KBOUT,
+     +'Display a map using an EMBL feature table file')
+      XMAX=IE
+      XMIN=IS
+      YMIN=0.
+      YMAX=ISYMAX
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Map file name',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+10    CONTINUE
+      REWIND IDEV
+      LIN = 3
+      CALL GTSTR('Feature code','CDS',FEATUR,LIN,
+     +KBOUT,KBIN,INFLAG)
+      IF(LIN.EQ.0) FEATUR(1:3) = 'CDS'
+      CALL CCASE(FEATUR,1)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) GO TO 900
+1006  FORMAT(A)
+      IF(FEATUR.EQ.' ') GO TO 900
+      ISTRND = 1
+      CALL GSTRND(ISTRND,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      ISTRND = ISTRND - 1
+      IF(ISTRND.LT.0) GO TO 900
+      STRAND=' '
+      IF(ISTRND.EQ.1)STRAND='C'
+      IF(ISTRND.EQ.2)STRAND='B'
+C7     WRITE(KBOUT,1001)
+C1001  FORMAT(' ? SEQUENCE LEVEL IN DRAWING BOARD UNITS(DEF=0)=',$)
+C      READ(KBIN,1002,ERR=7)IBASE
+C      IF(IBASE.EQ.-99)THEN
+C        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+C        GO TO 7
+C      END IF
+C      IF(IBASE.LT.0)RETURN
+C      IF(IBASE.GT.MARGT)IBASE=0
+      IY = IBLIPH
+      MININ = 0
+      MAXIN = MARGT
+      CALL GETINT(MININ,MAXIN,IY,
+     +'level',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 900
+      IY = IVAL
+C      YBASE=IBASE
+      YF=IY
+      CALL CLEARV
+      CALL VECTOM
+C   NEED BARS AT ENDS OF FEATURES FROM BLIPB TO BLIPT
+      BLIPB=YF-IBLIPH/2
+      BLIPT=YF+IBLIPH/2
+C   FEATURE NAME PUT AT YT
+      YT=YF+IBLIPH/2
+C      CALL LINE(XMIN,XMAX,YBASE,YBASE,XMAX,XMIN,YMAX,YMIN,
+C     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+20    CONTINUE
+      LEMBL=IEMBL(IDEV,'FT',FEATUR,STRAND,IS,IE,IPOSNL,IPOSNR,0,
+     +DESCRP,KBOUT,JSTRAN)
+      IF(LEMBL.EQ.0)THEN
+        POSNL=IPOSNL
+        POSNR=IPOSNR
+        XMID=POSNL+(POSNR-POSNL)/2.
+        CALL LINE(POSNL,POSNR,YF,YF,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(POSNL,POSNL,BLIPB,BLIPT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(POSNR,POSNR,BLIPB,BLIPT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL TEXT(XMID,YT,ATOS(DESCRP,1),1,
+     +  0,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        GO TO 20
+      END IF
+      CALL VT100M
+      GO TO 10
+900   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+C    IEMBL
+      INTEGER FUNCTION IEMBL(IDEV,SCODE,FEATUR,STRAND,IS,IE,
+     +POSNL,POSNR,IWRITE,DESCR,KBOUT,JSTRAN)
+C   AUTHOR: RODGER STADEN
+      CHARACTER LINE(80),FEATUR*(*),STRAND,DESCR(38)
+      CHARACTER KEYNAM(8),LFTEND(6),RITEND(6),DIR(3),DESCRP(38)
+      CHARACTER*2 CODE,SCODE
+      INTEGER POSNL,POSNR,IWRITE
+      EQUIVALENCE (LINE(1),CODE)
+      EQUIVALENCE (LINE(6),KEYNAM)
+      EQUIVALENCE (LINE(15),LFTEND)
+      EQUIVALENCE (LINE(22),RITEND)
+      EQUIVALENCE (LINE(29),DIR)
+      EQUIVALENCE (LINE(35),DESCRP)
+C
+C   READS UNTIL FINDS WANTED LINE TYPE (SCODE)
+C  OR END OF TEXT THIS SEQUENCE, OR END OF FILE
+C  RETURN CODES ARE 0,1,2 RESPECTIVELY
+C  WRITES ALL LINES, WANTED LINE, OR NONE CODE IWRITE=2,1,0 RESPECTIVELY
+C  JSTRAN IS A NUMBER DENOTING THE STRAND FOUND (0,1, (3=ERROR IN LIB))
+C   SET FLAG FOR MORE DATA
+      IEMBL=0
+10    CONTINUE
+      READ(IDEV,1000,END=600)LINE
+1000  FORMAT(80A1)
+      IF(IWRITE.EQ.2)WRITE(KBOUT,2000)LINE
+2000  FORMAT(' ',80A1)
+      IF(CODE.EQ.'SQ')GO TO 550
+      IF(CODE.NE.SCODE)GO TO 10
+      IF(IWRITE.EQ.1)WRITE(KBOUT,2000)LINE
+      IF(SCODE.NE.'FT')RETURN
+50    CONTINUE
+      DO 51 J=1,8
+        IF(KEYNAM(J).NE.FEATUR(J:J))GO TO 10
+51    CONTINUE
+      IF(STRAND.NE.'B')THEN
+        IF(DIR(2).NE.STRAND)GO TO 10
+      END IF
+C  SET FOUND STRAND
+      JSTRAN=3
+      IF(DIR(2).EQ.'C')JSTRAN=1
+      IF(DIR(2).EQ.' ')JSTRAN=0
+      DO 52 J=1,38
+        DESCR(J)=DESCRP(J)
+52    CONTINUE
+C   GET POSITIONS
+C   <1 ?
+      IF(LFTEND(5).EQ.'<')THEN
+        POSNL=1
+      ELSE
+        POSNL=IFROMC(LFTEND,6,KBOUT)
+      END IF
+C   MAY CONTAIN > SO NEED TO CHECK EACH POSITION!
+      DO 56 K=1,6
+        KK=7-K
+        IF(RITEND(KK).EQ.'>')GO TO 57
+56    CONTINUE
+      KK=0
+57    CONTINUE
+C   MAY CONTAIN < !
+      IF(RITEND(5).EQ.'<')THEN
+        POSNR=1
+      ELSE
+C   THE > IS AT POSITION KK
+        POSNR=IFROMC(RITEND(KK+1),6-KK,KBOUT)
+      END IF
+C   COMPLEMENTARY STRAND?
+      IF(DIR(2).EQ.'C')THEN
+        I=POSNL
+        POSNL=POSNR
+        POSNR=I
+      END IF
+C   in range?
+      IF(POSNR.LT.IS)GO TO 10
+      IF(POSNL.GT.IE)GO TO 10
+      RETURN
+550   CONTINUE
+C   SET FLAG FOR END OF THIS SEQUENCE'S HEADING
+      IEMBL=1
+      RETURN
+600   CONTINUE
+C   SET FLAG FOR END OF THIS LIBRARY
+      IEMBL=2
+      END
+C      SQFIT
+      SUBROUTINE SPFIT(SEQ,IDIM1,STRING,IDIM2,ITOT,ITOTEL,ITOTID,
+     +IS,IE,MINP,ITOTP,SCORES,IDSCOR,KSTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOT(ITOTID),ITOTEL(ITOTID)
+      INTEGER SCORES(IDSCOR,IDSCOR)
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+C
+      IDIF=(IE-IS+2)-IDIM2
+C   IDIF IS THE NUMBER OF POSNS TO TRY
+C   IPSTR GOES FROM 1 TO IDIM2 IDIF TIMES
+      IPSEQ=IS-KSTART+1
+      ITOTP=0
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          NTOT=NTOT+SCORES(DTONUM(SEQ(IP)),DTONUM(STRING(J)))
+          IP=IP+1
+100     CONTINUE
+        IF(NTOT.GE.MINP)THEN
+          ITOTP=ITOTP+1
+          IF(ITOTP.GT.ITOTID)RETURN
+          ITOT(ITOTP)=NTOT
+          ITOTEL(ITOTP)=IP-IDIM2+KSTART-1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
+      SUBROUTINE GETSCR(STRING,IDIM2,SCORES,IDSCOR,SMAX,SMIN)
+C   AUTHOR: RODGER STADEN
+      CHARACTER STRING(IDIM2)
+      INTEGER SCORES(IDSCOR,IDSCOR),SMAX,SMIN,DTONUM
+      EXTERNAL DTONUM
+      SMAX=0
+      SMIN=IDIM2*SCORES(IDSCOR,IDSCOR)
+      DO 2 I=1,IDIM2
+         SMAX=SMAX+SCORES(DTONUM(STRING(I)),DTONUM(STRING(I)))
+2     CONTINUE
+      END
+      SUBROUTINE GETC(TOT,SUM,LINE,IDM,MAXLEN,IDEV,KBOUT,LENGTH,
+     +IOK)
+      INTEGER TOT(MAXLEN),SUM(IDM,MAXLEN)
+      CHARACTER LINE(MAXLEN)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IOK = 1
+      DO 2 I=1,MAXLEN
+        TOT(I)=0
+        DO 1 J=1,IDM
+          SUM(J,I)=0
+1       CONTINUE
+2     CONTINUE
+      N=0
+10    CONTINUE
+1003  FORMAT(1X,120A1)
+1004  FORMAT(' ',I6,' ',120A1)
+      READ(IDEV,1003,END=100)LINE
+      N=N+1
+      WRITE(KBOUT,1004)N,LINE
+      DO 20 I=1,MAXLEN
+        IF(LINE(I).EQ.' ')GO TO 10
+        SUM(CTONUM(LINE(I)),I)=SUM(CTONUM(LINE(I)),I)+1
+20    CONTINUE
+      GO TO 10
+100   CONTINUE
+      IF(N.EQ.0)THEN
+        WRITE(KBOUT,*)' Empty file of aligned sequences'
+        RETURN
+      END IF
+C   NOW FIND LENGTH OF MOTIF
+      DO 40 I=1,MAXLEN
+        K=0
+        L=I
+        DO 30 J=1,IDM
+          K=K+SUM(J,I)
+30      CONTINUE
+        IF(K.EQ.0)GO TO 50
+        TOT(I)=TOT(I)+K
+40    CONTINUE
+50    CONTINUE
+      LENGTH=L-1
+      IOK = 0
+      END
+      SUBROUTINE GETW(TOT,SUM,FREQ,LENGTH,MAXCHR,MAXLEN)
+      INTEGER TOT(LENGTH),SUM(MAXCHR,MAXLEN)
+      REAL FREQ(MAXCHR,MAXLEN)
+      DO 70 I=1,LENGTH
+        DO 60 J=1,MAXCHR
+          FREQ(J,I)=LOG((REAL(SUM(J,I)+1)/REAL(TOT(I)+MAXCHR)))
+60      CONTINUE
+70    CONTINUE
+      END
+      SUBROUTINE GETW2(SUM,FREQ,LENGTH,MAXCHR,MAXLEN)
+      INTEGER SUM(MAXCHR,MAXLEN)
+      REAL FREQ(MAXCHR,MAXLEN)
+      DO 70 I=1,LENGTH
+        DO 60 J=1,MAXCHR
+          FREQ(J,I)=REAL(SUM(J,I))
+60      CONTINUE
+70    CONTINUE
+      END
+      SUBROUTINE WRTSCM(TITLE,LENGTH,MIDDLE,BOT,TOP,IDM,
+     +TOT,SUM,CHRSET,IDEV,MAXLEN)
+      INTEGER TOT(LENGTH),SUM(IDM,MAXLEN)
+      CHARACTER CHRSET(IDM),TITLE*(*)
+C   PROTEIN MATRICES DONT WRITE ROWS FOR -X? AND SPACE SO SET DIMENSION
+C   TO IDM-4
+      MINUS = 1
+      IF(IDM.EQ.26)MINUS = 4
+      WRITE(IDEV,1018)TITLE
+1018  FORMAT(' ',A)
+1019  FORMAT(' P',20I4)
+1020  FORMAT(' N',20I4)
+1021  FORMAT(' ',A,20I4)
+1022  FORMAT(' ',2I6,2F10.3)
+      WRITE(IDEV,1022)LENGTH,MIDDLE,BOT,TOP
+      NLINES=1+(LENGTH-1)/20
+      K1=1
+      DO 400 J=1,NLINES
+        K2=MIN((K1+19),LENGTH)
+        WRITE(IDEV,1019)(K,K=K1-MIDDLE,K2-MIDDLE)
+        WRITE(IDEV,1020)(TOT(K),K=K1,K2)
+        DO 390 I=1,IDM-MINUS
+          WRITE(IDEV,1021)CHRSET(I),(SUM(I,K),K=K1,K2)
+390     CONTINUE
+        K1=K1+20
+        IF(K1.GT.LENGTH)K1=LENGTH
+400   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE RDWMTN(TOT,WT,MIDDLE,
+     +LENGTH,MAXLEN,YMIN,YMAX,IDEV,IFAIL,IDM,KBOUT,IPROB)
+C   AUTHOR: RODGER STADEN
+C  Same as rdwmt except for file closing
+      INTEGER WT(IDM,MAXLEN),TOT(MAXLEN)
+      CHARACTER LINE*79
+C   PROTEIN MATRICES DONT READ ROWS FOR -X? AND SPACE SO SET DIMENSION
+C   TO IDM-4
+      MINUS = 1
+      IF(IDM.EQ.26)MINUS = 4
+C   SET FAIL FLAG
+      IFAIL=1
+1000  FORMAT( )
+1001  FORMAT(2X,20I4)
+1003  FORMAT(A)
+1002  FORMAT(2X,2I6,2F10.3)
+1004  FORMAT(' ',A)
+      DO 10 I = 1,MAXLEN
+        TOT(I) = 0
+        DO 5 J = 1,IDM
+          WT(J,I) = 0
+5       CONTINUE
+10    CONTINUE
+C   READ TITLE
+      READ(IDEV,1003,END=100,ERR=100)LINE
+      IF(IPROB.EQ.0) WRITE(KBOUT,1004)LINE
+C   READ PLOT VALUES ETC
+      READ(IDEV,1002,ERR=100,END=100)
+     +LENGTH,MIDDLE,YMIN,YMAX
+C   HOW MANY LINES TO READ?
+      NLINES=1+(LENGTH-1)/20
+      K1=1
+      DO 50 J=1,NLINES
+C   READ POSITION
+        READ(IDEV,1000,END=100,ERR=100)
+        K2=MIN((K1+19),LENGTH)
+C   READ TOTALS
+C        READ(IDEV,1001,ERR=100,END=100)(TOT(K),K=K1,K2)
+        READ(IDEV,1001,ERR=100,END=100)
+C   READ COUNTS
+        DO 25 I=1,IDM-MINUS
+          READ(IDEV,1001,ERR=100,END=100)(WT(I,K),K=K1,K2)
+C  ALLOW TOTALS IN FILE TO BE WRONG!
+          DO 24 K = K1,K2
+            TOT(K) = TOT(K) + WT(I,K)
+24        CONTINUE
+25      CONTINUE
+        K1=K1+20
+        IF(K1.GT.LENGTH)K1=LENGTH
+50    CONTINUE
+C   SET FAIL FLAG TO GOOD
+      IFAIL=0
+      RETURN
+100   CONTINUE
+      END
+      SUBROUTINE GETWC(TOT,SUM,LENGTH,MAXCHR,MAXLEN,
+     +CEXACT,PEXACT,IEXACT)
+      INTEGER TOT(LENGTH),SUM(MAXCHR,MAXLEN)
+      INTEGER PEXACT(MAXLEN),CEXACT(MAXLEN)
+C routine to find 100% conserved residues in wt matrices
+      IEXACT = 0
+      DO 70 I=1,LENGTH
+        IF(TOT(I).NE.0) THEN
+          DO 60 J=1,MAXCHR
+            IF(TOT(I).EQ.SUM(J,I)) THEN
+              IEXACT = IEXACT + 1
+              CEXACT(IEXACT) = J
+              PEXACT(IEXACT) = I
+              GO TO 61
+            END IF
+60        CONTINUE
+61        CONTINUE
+        END IF
+70    CONTINUE
+      END
+      INTEGER FUNCTION MATWTC(SEQ,IDSEQ,J1,J2,K1,
+     +CEXACT,PEXACT,IEXACT)
+C find first position in seq where the chars (in index form)
+C contained in cexact, and with relative positions pexact, is found
+C else return end +1
+      CHARACTER SEQ(IDSEQ)
+      INTEGER PEXACT(IEXACT),CEXACT(IEXACT)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      MATWTC = J2 + 1
+      DO 100 I=K1,J2-PEXACT(IEXACT)+1
+        DO 50 J=1,IEXACT
+          IF(CTONUM(SEQ(I+PEXACT(J)-1)).NE.CEXACT(J)) GO TO 51
+50      CONTINUE
+        MATWTC = I
+        RETURN
+51      CONTINUE
+100   CONTINUE
+      END
+      SUBROUTINE RDWMT(TOT,WT,MIDDLE,
+     +LENGTH,MAXLEN,YMIN,YMAX,IDEV,IFAIL,IDM,KBOUT,IPROB)
+C   AUTHOR: RODGER STADEN
+      INTEGER WT(IDM,MAXLEN),TOT(MAXLEN)
+      CHARACTER LINE*79
+C   PROTEIN MATRICES DONT READ ROWS FOR -X? AND SPACE SO SET DIMENSION
+C   TO IDM-4
+      MINUS = 1
+      IF(IDM.EQ.26)MINUS = 4
+C   SET FAIL FLAG
+      IFAIL=1
+1000  FORMAT( )
+1001  FORMAT(2X,20I4)
+1003  FORMAT(A)
+1002  FORMAT(2X,2I6,2F10.3)
+1004  FORMAT(' ',A)
+      DO 10 I = 1,MAXLEN
+        TOT(I) = 0
+        DO 5 J = 1,IDM
+          WT(J,I) = 0
+5       CONTINUE
+10    CONTINUE
+C   READ TITLE
+      READ(IDEV,1003,END=100,ERR=100)LINE
+      IF(IPROB.EQ.0) WRITE(KBOUT,1004)LINE
+C   READ PLOT VALUES ETC
+      READ(IDEV,1002,ERR=100,END=100)
+     +LENGTH,MIDDLE,YMIN,YMAX
+C   HOW MANY LINES TO READ?
+      NLINES=1+(LENGTH-1)/20
+      K1=1
+      DO 50 J=1,NLINES
+C   READ POSITION
+        READ(IDEV,1000,END=100,ERR=100)
+        K2=MIN((K1+19),LENGTH)
+C   READ TOTALS
+C        READ(IDEV,1001,ERR=100,END=100)(TOT(K),K=K1,K2)
+        READ(IDEV,1001,ERR=100,END=100)
+C   READ COUNTS
+        DO 25 I=1,IDM-MINUS
+          READ(IDEV,1001,ERR=100,END=100)(WT(I,K),K=K1,K2)
+C  ALLOW TOTALS IN FILE TO BE WRONG!
+          DO 24 K = K1,K2
+            TOT(K) = TOT(K) + WT(I,K)
+24        CONTINUE
+25      CONTINUE
+        K1=K1+20
+        IF(K1.GT.LENGTH)K1=LENGTH
+50    CONTINUE
+      CLOSE(UNIT=IDEV)
+C   SET FAIL FLAG TO GOOD
+      IFAIL=0
+      RETURN
+100   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE MKWT(FREQ,SUM,TOT,CHRSET,IDM,MAXLEN,
+     +IDEV2,IDEV3,KBIN,KBOUT,LINE,
+     +FILNAM,IOPT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER LINE(MAXLEN),TITLE*60,CHRSET(IDM)
+      INTEGER SUM(IDM,MAXLEN),TOT(MAXLEN)
+      REAL FREQ(IDM,MAXLEN)
+      IOK = 1
+      IF(IOPT.EQ.3)THEN
+        CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Name of existing weight matrix file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,
+     +  CUTMIN,CUTMAX,IDEV2,
+     +  IOK,IDM,KBOUT,0)
+        IF(IOK.NE.0) RETURN
+      END IF
+      CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Name of aligned sequences file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF(IOPT.EQ.2) THEN
+        CALL GETC(TOT,SUM,LINE,IDM,MAXLEN,IDEV2,
+     +  KBOUT,LENGTH,IOK)
+        IF(IOK.NE.0)RETURN
+      END IF
+      WRITE(KBOUT,1006)LENGTH
+1006  FORMAT(' Length of motif',I6)
+      IOK = 1
+      CALL YESNO(IOPT,'Sum logs of weights',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.0) RETURN
+      CUTMIN = -10.0
+      IF(IOPT.EQ.1)CUTMIN = 10.
+      CALL MASKW(SUM,LENGTH,IDM,MAXLEN,KBIN,KBOUT,TITLE,
+     +IOPT,IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOPT.LT.0) RETURN
+C     NOW CALC WEIGHTS
+      IF(CUTMIN.LT.0.0)CALL GETW(TOT,SUM,FREQ,LENGTH,IDM,MAXLEN)
+      IF(CUTMIN.GE.0.0)CALL GETW2(SUM,FREQ,LENGTH,IDM,MAXLEN)
+C     NOW APPLY THE WEIGHTS
+      REWIND IDEV2
+      CALL APPLWT(FREQ,IDM,LENGTH,IDEV2,IDEV3,KBIN,KBOUT,LINE,MAXLEN,
+     +BOT,TOP,TITLE,MIDDLE,IOK)
+      CLOSE(UNIT=IDEV2)
+      IF(IOK.NE.0) RETURN
+      CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for new weight matrix file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL WRTSCM(TITLE,LENGTH,MIDDLE,BOT,TOP,IDM,
+     +TOT,SUM,CHRSET,IDEV2,MAXLEN)
+      CLOSE(UNIT=IDEV2)
+      RETURN
+      END
+      SUBROUTINE MASKW(SUM,LENGTH,IDM,MAXLEN,KBIN,KBOUT,MASK,
+     +IOPT,IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER SUM(IDM,MAXLEN)
+      CHARACTER MASK*(*),HELPF*(*)
+      CALL YESNO(IOPT,'Use all motif positions',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.0) RETURN
+5     CONTINUE
+      IF(IOPT.EQ.1)THEN
+        WRITE(KBOUT,1002)
+1002    FORMAT(' x means use, - means ignore',/,
+     +         ' e.g. xx-x---x-x means use positions 1,2,4,8,10')
+        LIN = 0
+        CALL GTSTR('Mask',' ',MASK,LIN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 5
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        IF(LIN.EQ.0) RETURN
+        DO 70 I=1,LENGTH
+          IF(MASK(I:I).EQ.'-')THEN
+            DO 60 J=1,IDM
+              SUM(J,I) = 0
+60          CONTINUE
+          END IF
+70      CONTINUE
+      END IF
+      END
+      SUBROUTINE APPLWT(FREQ,IDM,LENGTH,IDEV,IDEV3,KBIN,KBOUT,
+     +LINE,MAXLEN,BOT,TOP,TITLE,MIDDLE,IOK)
+      REAL FREQ(IDM,MAXLEN)
+      CHARACTER LINE(MAXLEN),TITLE*(*)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IOK = 1
+      N=0
+      TOP=-99999.
+      BOT=9999999.
+      WRITE(KBOUT,*)' Applying weights to input sequences'
+      SMEAN = 0.
+      SUMSQ = 0.
+1003  FORMAT(1X,120A1)
+1004  FORMAT(' ',I4,' ',F12.3,' ',120A1)
+200   CONTINUE
+      READ(IDEV,1003,END=300)LINE
+      N=N+1
+      SCORE=0.
+      DO 210 I=1,LENGTH
+        SCORE=SCORE+FREQ(CTONUM(LINE(I)),I)
+210   CONTINUE
+      WRITE(IDEV3,1004)N,SCORE,(LINE(K),K=1,LENGTH)
+      IF(SCORE.GT.TOP)TOP=SCORE
+      IF(SCORE.LT.BOT)BOT=SCORE
+      SMEAN=SMEAN+SCORE
+      SUMSQ=SUMSQ+SCORE*SCORE
+      GO TO 200
+300   CONTINUE
+      IF(N.LT.1)THEN
+        WRITE(KBOUT,*)' Error: empty sequence file'
+        RETURN
+      END IF
+      SMEAN=SMEAN/N
+      SM=SMEAN
+      SMEAN=SMEAN*SMEAN
+      SUMSQ=SUMSQ/N
+      SD = 0.
+      T = SUMSQ - SMEAN
+      IF(T.GT.0.)SD = SQRT(T)
+      SMM3=SM-3*SD
+      SMP3=SM+3*SD
+      WRITE(KBOUT,1000)TOP,BOT
+1000  FORMAT(' Top score',F12.3,'  Bottom score',F12.3)
+      WRITE(KBOUT,1001)SM,SD
+1001  FORMAT(' Mean',F12.3,'  Standard deviation',F12.3)
+      WRITE(KBOUT,1002)SMM3,SMP3
+1002  FORMAT(' Mean minus 3.sd',F12.3,'  Mean plus 3.sd',F12.3)
+      BOT=SMM3
+      TOP=SMP3
+      XMN = -999.
+      XMX = 9999.
+      CALL GETRL(XMN,XMX,BOT,'Cutoff score',VAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      BOT = VAL
+      XMN = BOT
+      XMX = 999.
+      CALL GETRL(XMN,XMX,TOP,'Top score for scaling plots',
+     +VAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      TOP = VAL
+      MN = 0
+      MX = LENGTH
+      MIDDLE = 1
+      CALL GETINT(MN,MX,MIDDLE,'Position to identify',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+       MIDDLE = IVAL
+305   CONTINUE 
+      LIN = 0
+      CALL GTSTR('Title',' ',TITLE,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 305
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IOK = 0
+      END
+      SUBROUTINE GETWM(TOT,SUM,FREQ,LENGTH,MATRIX,MAXCHR,MAXLEN)
+      INTEGER TOT(LENGTH),SUM(MAXCHR,MAXLEN),MATRIX(MAXCHR,MAXCHR)
+      REAL FREQ(MAXCHR,MAXLEN)
+      DO 70 I=1,LENGTH
+        IPROD = 0
+        DO 60 J=1,MAXCHR
+          DO 50 K = 1,MAXCHR
+            IPROD = IPROD + MATRIX(J,K) * (SUM(J,I)+1)
+50        CONTINUE
+60      CONTINUE
+        TOT(I) = IPROD
+70    CONTINUE
+      DO 170 I=1,LENGTH
+        DO 160 J=1,MAXCHR
+          IPROD = 0
+          DO 150 K = 1,MAXCHR
+            IPROD = IPROD + MATRIX(J,K) * (SUM(J,I)+1)
+150       CONTINUE
+          FREQ(J,I)=LOG(REAL(IPROD)/REAL(TOT(I)))
+160     CONTINUE
+170   CONTINUE
+      END
+      SUBROUTINE GETW1(SUM,FREQ,LENGTH,MAXCHR,MAXLEN)
+      INTEGER SUM(MAXCHR,MAXLEN)
+      REAL FREQ(MAXCHR,MAXLEN)
+      DO 70 I=1,LENGTH
+        DO 60 J=1,MAXCHR
+          FREQ(J,I) = 0.
+          IF(SUM(J,I).GT.0)FREQ(J,I) = 1.
+60      CONTINUE
+70    CONTINUE
+      END
+      SUBROUTINE MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +HELPS,HELPE,MAXOPT,HELPF,IDEVH,KBIN,KBOUT)
+C   AUTHOR RODGER STADEN
+      INTEGER MARGB(MAXOPT),MARGT(MAXOPT),HELPS,HELPE
+      CHARACTER HELPF*(*)
+      CALL SHOWFU(KBOUT,'Reset plot positions')
+      NOPT = 0
+      CALL GETINT(0,MAXOPT,NOPT,'Number of option to reposition',
+     +IVAL,KBIN,KBOUT,HELPS,HELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(IVAL.EQ.0) RETURN
+      NOPT = IVAL
+      CALL MARGC1(ISXMAX,ISYMAX,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +HELPS,HELPE,HELPF,IDEVH,KBIN,KBOUT)
+      RETURN
+      END
+      SUBROUTINE SQPF3(STRING,NEW,MAXSTR,LENGTH,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER STRING(MAXSTR),NEW(MAXSTR)
+      IOK = 1
+10    CONTINUE
+      LIN = LENGTH
+      CALL GETSTR('String',STRING,NEW,MAXSTR,LIN,KBOUT,KBIN,INFLAG)
+      IF((LIN.LT.1).AND.(LENGTH.LT.1)) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.GT.0)THEN
+        CALL SQCOPY(NEW,STRING,LIN)
+        LENGTH = LIN
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE SQPF2(SEQ2,IDIM3,STRING,IDIM2I,IDIM2,I1,I2,
+     +KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ2(IDIM3),STRING(IDIM2I)
+      IOK = 1
+      MININ = 1
+      MAXIN = IDIM3
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Define string ends')
+      CALL GETINT(MININ,MAXIN,I1,'Start',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(IVAL.NE.I1) THEN
+        I2 = IVAL + 10
+      END IF
+      I1 = IVAL
+      MININ = I1 + 1
+      MAXIN = I1 + IDIM2I - 1
+      CALL GETINT(MININ,MAXIN,I2,'End',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      I2 = IVAL
+      IDIM2 =I2 - I1 + 1
+      CALL SQCOPY(SEQ2(I1),STRING,IDIM2)
+      WRITE(KBOUT,1001)(STRING(K),K=1,IDIM2)
+1001  FORMAT(' string=',50A1)
+      IOK = 0
+      END
+      SUBROUTINE SQPF1(SEQ2,IDIM3I,IDIM3,
+     +IDEV,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ2(IDIM3I),FILNAM*(*)
+      IDIM3 = IDIM3I
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'String file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      IOK = 1
+      CALL ARRFIL(IDEV,SEQ2,IDIM3,KBOUT)
+      CLOSE(UNIT=IDEV)
+      IF(IDIM3.LT.1)RETURN
+      IOK = 0
+      END
+      SUBROUTINE SQPFD1(SEQ2,IDIM3I,IDIM3,ANSTY,IDEVIN,FILE2,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*),FILE2*(*)
+      CHARACTER SEQ2(IDIM3I)
+      INTEGER ANSTY,CHOICE
+      CHOICE = ANSTY
+      IOK = 1
+      CALL YESONO(CHOICE,'Type in string','Extract string from file',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(CHOICE.LT.0) RETURN
+      ANSTY = CHOICE
+      IF(ANSTY.EQ.1)THEN
+        CALL SQPF1(SEQ2,IDIM3I,IDIM3,
+     +  IDEVIN,FILE2,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      END IF  
+      IOK = 0
+      END
+      SUBROUTINE GSTRND(IVAL,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      PARAMETER (MAXPRM = 13)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      PROMPT(1) = '+ strand only'
+      PROMPT(2) = '- strand only'
+      PROMPT(3) = 'Both strands'
+      CALL RADION('Select strands',PROMPT,3,IVAL,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      END
diff --git a/src/staden/bsplit.f b/src/staden/bsplit.f
new file mode 100644
index 0000000..2f327bf
--- /dev/null
+++ b/src/staden/bsplit.f
@@ -0,0 +1,62 @@
+      SUBROUTINE FMAIN()
+      CHARACTER SEQ1(10000),SEQ2(10000)
+      CHARACTER*40 FILNAM,HELPF*40
+      INTEGER DEVNOS(2)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,2)
+      WRITE(KBOUT,*)'  bsplit v2.2'
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(1),FILNAM,0,IOK,KBIN,KBOUT,
+     +'Name of gelinb file',
+     + IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)STOP
+      IDIM=10000
+      CALL ARRFIL(DEVNOS(1),SEQ1,IDIM,KBOUT)
+      CLOSE(UNIT=1)
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(2),FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for new file of filenames',
+     + IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)STOP
+C   REMOVE UNWANTED CHARS note this is very dangerous and is only for
+C                         bbc data that has spurious characters. When 
+C                         we move to mac only we should skip this
+      I2=0
+      DO 100 I=1,IDIM
+        IF((LGE(SEQ1(I),'-')).AND.(LLE(SEQ1(I),'Z')) .OR.
+     +     (LGE(SEQ1(I),'a')).AND.(LLE(SEQ1(I),'z'))) THEN
+          I2=I2+1
+          SEQ2(I2)=SEQ1(I)
+        END IF
+100   CONTINUE
+      IDIM2=I2
+      IP1=1
+      WRITE(KBOUT,1005)IDIM-IDIM2
+1005  FORMAT(' number of illegal characters removed',I6)
+200   CONTINUE
+      CALL CHARCT(SEQ2,IDIM2,IP1,NCHAR)
+      IF(NCHAR.GT.0)THEN
+        FILNAM(1:40)=' '
+        DO 210 I=1,NCHAR
+          FILNAM(I:I)=SEQ2(IP1+I-1)
+210     CONTINUE
+        WRITE(KBOUT,1001)FILNAM
+1001    FORMAT(' creating file ',A)
+        CALL OPENRS(DEVNOS(1),FILNAM,LRECL,IOK,1)
+      IF(IOK.NE.0)STOP
+      WRITE(DEVNOS(2),1004)FILNAM
+1004  FORMAT(A)
+      END IF
+      IP1=IP1+NCHAR+1
+      IF(IP1.LT.IDIM2)THEN
+        CALL CHARCT(SEQ2,IDIM2,IP1,NCHAR)
+        IF(NCHAR.GT.0)THEN
+          CALL FMTDK(DEVNOS(1),SEQ2(IP1),NCHAR)
+        END IF
+        WRITE(KBOUT,1002)NCHAR
+1002    FORMAT(' length',I6)
+        IP1=IP1+NCHAR+1
+        IF(IP1.LT.IDIM2)GO TO 200
+      END IF
+      CLOSE(DEVNOS(2))
+      CLOSE(DEVNOS(1))
+      END
diff --git a/src/staden/conEdit.f b/src/staden/conEdit.f
new file mode 100644
index 0000000..7105aa7
--- /dev/null
+++ b/src/staden/conEdit.f
@@ -0,0 +1,34 @@
+      SUBROUTINE CONEDT(KBIN,KBOUT,
+     +GELNOS,GELSTR,GELEND,
+     +MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,IDEV1,IDEV2,IDEV3,
+     +IDEVT,IDEVC,LINLEN,FILNAM,LLINO,IOK,IERR,TEMP1)
+C   AUTHOR: SIMON DEAR
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2)
+C   DUMMY ROUTINE FOR NON X VERSIONS
+      END
+      SUBROUTINE JOINED(KBIN,KBOUT,
+     +GELNOS,GELSTR,GELEND,
+     +MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,IDEV1,IDEV2,IDEV3,
+     +IDEVT,IDEVC,LINLEN,FILNAM,LLINO,IOK,IERR,TEMP1)
+C   AUTHOR: SIMON DEAR
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2)
+C   DUMMY ROUTINE FOR NON X VERSIONS
+      END
+      SUBROUTINE JXEDIT(IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +RELPG,LNGTHG,LNBR,RNBR,MAXGEL,
+     +IDBSIZ,LR,NR,LL,NL,
+     +I,IE,J,IE2,P,NG,NC,IDM,IEXT,IOK)
+C   DUMMY ROUTINE FOR NON X VERSIONS
+      END
diff --git a/src/staden/contigEditor.c b/src/staden/contigEditor.c
new file mode 100644
index 0000000..bf0c642
--- /dev/null
+++ b/src/staden/contigEditor.c
@@ -0,0 +1,1244 @@
+/*
+    Title: 	 contigEditor
+
+    File: 	 contigEditor.c
+    Purpose:	 C language entry point and initialisation functions
+    Last update: 5 April 1993
+
+*/
+
+
+/*
+*/
+
+#define CHKPNT(W,S) \
+    { XSync(XtDisplay(W),False); fprintf(stderr,"CHKPNT: %s\n",S); }
+
+
+/* ---- Includes ---- */
+
+#include <stdio.h>    /* IMPORT: freopen, stdout, stderr, fprintf */
+/*#include <string.h>*/
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Scrollbar.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/SmeBSB.h>
+#include <X11/Xaw/SmeLine.h>
+#include <X11/Xaw/Text.h>
+#include <X11/Xaw/Toggle.h>
+
+#include "Sheet.h"
+#include "contigEditor.h"
+#include "locks.h"
+#include "main.h"
+#include "select.h"
+#include "tagUtils.h"
+#include "edUtils.h"
+#include "extend.h"
+#include "undo.h"
+#include "fort.h"
+
+
+
+/* ---- Static variables ---- */
+static Widget editorShellWid;	/* main editor shell */
+static Widget mainFormWid;	/* main editor form */
+
+static Widget buttonsWid;
+static Widget exitWid;		/* Leave Editor button */
+static Widget undoWid;		/* Undo last edit command button */
+static Widget supermanWid;	/* switch for Superuser class of edit commands */
+static Widget revealWid;	/* switch for revealing hidden cutoff information */
+#ifdef nana
+static Widget saveWid;	        /* save command button */
+#endif
+static Widget lockWid;	        /* lock button for when joining */
+#ifdef nana
+static Widget joinWid;	        /* lock button for when joining */
+#endif
+static Widget labelWid;	/* label displaying contig name */
+static Widget insertWid;	/* edit mode - insertion */
+static Widget replaceWid;	/* edit mode - replacement */
+static Widget nextWid;          /* search for next problem in consensus */
+
+static Widget disagreeFormWid;	/* for displaying disagreements when joining */
+static Widget dummyWid;
+Widget disagreeWid;
+
+float pcCut;              /* cut off for consensus calculation */
+int editorMode;
+enum States {StateDown=0,StateUp} editorState = StateDown;
+static int lockOffset;
+
+/* save/lock state */
+int_f save_state;
+
+SaveStruct saveState;
+
+DefColours defColours;
+
+
+
+/* ---- My translation tables ---- */
+static char translationTable[] =
+    "\
+	 Ctrl<Btn1Down>:   XawPositionSimpleMenu(edMenu) XtMenuPopup(edMenu)\n\
+	 <Btn1Down>:    StartHighlight() buttonDown()\n\
+	 <Btn1Motion>:  ExtendHighlight()\n\
+	 <Btn1Up>:      MakeSelection()\n\
+	 <Btn2Down>(2): buttonDown() invokeTrace()\n\
+	 <Btn2Down>:    buttonDown()\n\
+	 <Btn3Down>:    ExtendHighlight()\n\
+	 <Btn3Motion>:  ExtendHighlight()\n\
+	 <Btn3Up>:      ExtendHighlight()\n\
+	 Meta<Key>Left:        MetaLeft()\n\
+	 Meta<Key>Right:       MetaRight()\n\
+	 <Key>Right:        caretRight()\n\
+         <Key>Left:         caretLeft()\n\
+         <Key>Down:         caretDown()\n\
+         <Key>Up:           caretUp()\n\
+         <Key>Delete:       deleteKey()\n\
+         <Key>:             keyPress()";
+/*
+  <Btn2Down>:    SelectTag()\n\
+  Meta<Key>Up:          MetaUp()\n\
+  */
+static XtActionsRec actionTable[] = {
+    {"caretRight",      caretRight},
+    {"caretLeft",       caretLeft},
+    {"caretDown",       caretDown},
+    {"caretUp",         caretUp},
+    {"deleteKey",       deleteKey},
+    {"keyPress",        keyPress},
+    {"buttonDown",      buttonDown},
+    {"invokeTrace",     invokeTrace},
+    {"StartHighlight",  start_highlight},
+    {"ExtendHighlight", extend_highlight},
+    {"MakeSelection",   make_selection},
+    {"SelectTag",       select_tag},
+    {"MetaLeft",        meta_left},
+    {"MetaRight",       meta_right},
+    {"MetaUp",          meta_up},
+};
+XtTranslations parsedTTable;
+
+
+static char translationTable2[] =
+    "<Btn1Down>:    selectRead()";
+
+static XtActionsRec actionTable2[] = {
+    {"selectRead",      selectRead},
+};
+XtTranslations parsedTTable2;
+
+
+
+
+
+
+
+
+
+/* ---- Private Functions ---- */
+static int LeaveAllowed;
+
+static void save()
+/*
+** Save Changes 
+*/
+{
+    EdStruct *xx = intToEdStruct(0);
+
+    saveDB(
+	xx,
+	saveState.idevr,
+	saveState.idevw,
+	saveState.idevn,
+	saveState.relpg,
+	saveState.lngthg,
+	saveState.lnbr,
+	saveState.rnbr,
+	saveState.maxgel
+    );
+}
+
+static void join()
+/*
+** Join request made
+*/
+{
+    joinDB(
+        saveState.idevr,
+        saveState.idevw,
+        saveState.idevn,
+        saveState.relpg,
+        saveState.lngthg,
+        saveState.lnbr,
+        saveState.rnbr,
+        saveState.maxgel,
+        saveState.llinol,
+        saveState.lnconl,
+        saveState.llinor,
+        saveState.lnconr,
+        saveState.ngels,
+        saveState.nconts,
+        saveState.idbsiz
+    );
+
+}
+
+static void warningYesCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A yes response to the
+** warning about data not saved
+*/
+{
+    Widget popupWid = (Widget) client_data;
+    
+
+    if (editorMode == EDITMODE)
+	save();
+    else
+	join();
+
+    LeaveAllowed = 1;
+    XtDestroyWidget(popupWid);
+}
+
+static void warningNoCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A No response to the
+** warning about data not saved
+*/
+{   Widget popupWid = (Widget) client_data;
+
+    LeaveAllowed = 1;
+    XtDestroyWidget(popupWid);
+}
+
+ static void warningCancelCallback(Widget w,
+				  XtPointer client_data, XtPointer call_data)
+/*
+** A Cancel response to the
+** warning about data not saved
+*/
+{   Widget popupWid = (Widget) client_data;
+
+     LeaveAllowed = 0;
+     XtDestroyWidget(popupWid);
+}
+
+static void createLabelsForBox(Widget parentWid, char *warning)
+{
+    Arg	args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, warning); nargs++;
+    (void) XtCreateManagedWidget("warnPrompt", labelWidgetClass,
+				 parentWid, args, nargs);
+    
+}
+
+static int warnNotSaved(Widget parentWid, String warning)
+/*
+** Warn that changes have been made but contig hasn't been saved.
+*/
+{   Widget warningWid, warnFormWid, warnPromptBox;
+    Widget yesWid, noWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+
+
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) 0, (Position) height,
+		      &x, &y);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    warningWid = XtCreatePopupShell("warning", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    /* Create the form */
+    nargs = 0;
+    warnFormWid = XtCreateManagedWidget("warnForm", formWidgetClass,
+				      warningWid, args, nargs);
+
+    nargs = 0;
+    warnPromptBox = XtCreateManagedWidget("warnBox", boxWidgetClass,
+					warnFormWid, args, nargs);
+
+    createLabelsForBox(warnPromptBox,warning);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, warnPromptBox); nargs++;
+    yesWid = XtCreateManagedWidget("Yes",  commandWidgetClass,
+				  warnFormWid, args, nargs);
+    XtAddCallback(yesWid, XtNcallback, warningYesCallback, (XtPointer) warningWid);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, yesWid); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, warnPromptBox); nargs++;
+    noWid = XtCreateManagedWidget("No",  commandWidgetClass,
+				  warnFormWid, args, nargs);
+    XtAddCallback(noWid, XtNcallback, warningNoCallback, (XtPointer) warningWid);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, noWid);           nargs++;
+    XtSetArg(args[nargs], XtNfromVert, warnPromptBox); nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      warnFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback, 
+		  warningCancelCallback, (XtPointer) warningWid);
+
+    /* This is a modal dialogue */
+    LeaveAllowed = -1;
+    XtPopup(warningWid, XtGrabNonexclusive);
+
+    while (LeaveAllowed==-1)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(parentWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+
+    return LeaveAllowed;
+}
+
+
+
+static void editNextCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Next Problem button has been pushed
+*/
+{
+    EdStruct *xx = intToEdStruct(0);
+
+    if (editorState == StateDown) return;
+
+    findNextProblem(xx);
+}
+
+
+static void editRevealCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Next Problem button has been pushed
+*/
+{
+    EdStruct *xx;
+    Arg args[10];
+    int nargs;
+    Boolean state;
+    int i;
+
+    if (editorState == StateDown) return;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, &state); nargs++;
+    XtGetValues(w, args, nargs);
+
+    for (i=0;i<editorMode;i++) {
+	xx = intToEdStruct(i);
+	xx->reveal_cutoffs=state;
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+    }
+}
+
+static void editLockCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Editor has been locked
+*/
+{
+    EdStruct *xx0 = intToEdStruct(0);
+    EdStruct *xx1 = intToEdStruct(1);
+
+    /*
+    int nargs;
+    Arg args[10];
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNstate, &currMode); nargs++;
+    XtGetValues(lockWid, args, nargs);
+
+    if (currMode)
+    */
+	lockOffset = xx1->displayPos - xx0->displayPos;
+
+    redisplaySequences (xx0,xx0->namesWid, xx0->sequencesWid, xx0->displayPos, xx0->displayWidth);
+    redisplaySequences (xx1,xx1->namesWid, xx1->sequencesWid, xx1->displayPos, xx1->displayWidth);
+}
+
+static void editUndoCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Undo last command request made
+*/
+{
+    undoLastCommand();
+}
+
+static void editExitCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Request made to leave the editor
+*/
+{
+    EdStruct *xx = intToEdStruct(0);
+
+    if (editorState == StateDown) return;
+
+    switch (editorMode) {
+	case EDITMODE:
+            if (editsMade(xx)) {
+                if (warnNotSaved(w,"Save changes?"))
+                {
+                    XtPopdown(editorShellWid);
+                    editorState = StateDown;
+                    semaphoreRelease(activeLock);
+                }
+	    } else {
+                XtPopdown(editorShellWid);
+                editorState = StateDown;
+                semaphoreRelease(activeLock);
+	    }
+	    break;
+        case JOINMODE: {
+	    int overlapLength, wingeCount;
+	    char warning[200];
+
+	    countDisagreements(&overlapLength, &wingeCount);
+	    if (overlapLength<=0) {
+		sprintf(warning,
+		    "Contigs do not overlap\nPerform Join?\n");
+	    } else {
+		sprintf(warning,
+		    "Percentage Mismatch:\n   %5.2f%%\nPerform Join?\n",
+		    (float)(100*wingeCount)/(float)overlapLength);
+	    }
+
+            if (warnNotSaved(w,warning))
+            {
+                XtPopdown(editorShellWid);
+                editorState = StateDown;
+                semaphoreRelease(activeLock);
+            }
+            break;
+	}
+        default:
+            XtPopdown(editorShellWid);
+            editorState = StateDown;
+            semaphoreRelease(activeLock);
+    }
+    /*
+    ** We must shut down the search window as well
+    */
+    if (editorState == StateDown) {
+	destroySearchWindow();
+    }
+}
+
+static void sliderScrollCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** left or right mouse button has caused the
+** slider to move
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(wid));
+    int position = (int) call_data;
+
+    float     topOfThumb;
+    int nargs;
+    Arg args[10];
+
+    if (editorState == StateDown) return;
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtGetValues(wid, args, nargs);
+
+    /* Move the thumb by one screenful in the appropriate direction */
+    if (position>0)
+	incDisplayPos(xx,D_screen);
+    else
+	decDisplayPos(xx,D_screen);
+}
+
+static void leftleftCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump left a long way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    decDisplayPos (xx,D_halfScreen);
+}
+
+static void leftCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump left a short way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    decDisplayPos (xx,D_character);
+}
+
+static void rightCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump right a short way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    incDisplayPos (xx,D_character);
+}
+
+static void rightrightCallback(Widget wid,
+                              XtPointer client_data, XtPointer call_data)
+/*
+** Jump right a long way
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(wid)));
+    if (editorState == StateDown) return;
+
+    incDisplayPos (xx,D_halfScreen);
+}
+
+
+static void sliderJumpCallback(Widget wid,
+                            XtPointer client_data, XtPointer call_data)
+/*
+** Middle mouse button used to reposition scroll bar
+*/
+{
+    float percent = *((float *) call_data);
+    EdStruct *xx = widgetToEdStruct(XtParent(wid));
+
+    if (editorState == StateDown) return;
+
+    setDisplayPosPercent(xx,percent);
+}
+
+static void getFontDetails(int *width, int *height)
+/*
+** Get the font width and height
+** for the contig editor window
+** ASSUME a single font is used!!
+*/
+{
+    Arg args[10];
+    int nargs;
+    XFontStruct *font;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfont, &font); nargs++;
+    XtGetValues(labelWid, args, nargs);
+
+    *width = font->max_bounds.width;
+    *height = font->max_bounds.ascent + font->max_bounds.descent;
+}
+
+static void setInitialStates(int reveal_state)
+/*
+** Set the initial states of togglewidgets
+*/
+{
+    Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, False); nargs++;
+    XtSetValues(lockWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, reveal_state); nargs++;
+    XtSetValues(revealWid, args, nargs);
+
+}
+
+static void tweakGeometry1()
+/*
+** Some geometry tweaking must be done before widgets are realised
+*/
+{
+    EdStruct *xx;
+
+    xx = intToEdStruct(1);
+    if (editorMode==EDITMODE) {
+	XtManageChild(nextWid);
+	XtUnmanageChild(lockWid);
+#ifdef nana
+	XtUnmanageChild(joinWid);
+#endif
+	XtUnmanageChild(disagreeFormWid);
+	XtUnmanageChild(xx->edWid);
+	XawFormDoLayout(mainFormWid,True);
+    } else {
+	XtUnmanageChild(nextWid);
+	XtManageChild(lockWid);
+#ifdef nana
+	XtManageChild(joinWid);
+#endif
+	XtManageChild(disagreeFormWid);
+	XtManageChild(xx->edWid);
+	XawFormDoLayout(mainFormWid,True);
+    }
+    
+}
+
+static void tweakGeometry2()
+/*
+** Some geometry tweaking must be done after widgets are realised
+*/
+{
+    int i;
+    Arg args[10];
+    int nargs;
+
+    EdStruct *xx;
+
+    /*
+    ** scroll button wids
+    */
+    for (i=0;i<editorMode;i++) {
+	xx = intToEdStruct(i);
+	nargs = 0;
+	XtSetArg(args[nargs], XtNwidth, xx->fontWidth*NAMELEN+10); nargs++;
+	XtSetValues(xx->scrollButtonsWid, args, nargs);
+
+	/*
+	Dimension width;
+	xx = intToEdStruct(i);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, &width); nargs++;
+	XtGetValues(xx->namesWid, args, nargs);
+	fprintf(stderr,"width.a = %d\n",(int)width);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, width); nargs++;
+        XtSetValues(xx->scrollButtonsWid, args, nargs);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, &width); nargs++;
+	XtGetValues(xx->sequencesWid, args, nargs);
+	fprintf(stderr,"width.b = %d\n",(int)width);
+        nargs = 0;
+        XtSetArg(args[nargs], XtNwidth, width); nargs++;
+        XtSetValues(xx->sliderWid, args, nargs);
+	*/
+    }
+
+
+}
+
+
+/* ---- Exported functions ---- */
+
+
+void jxedit_(
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *idevt,   /* unit number for tag information */
+	int_f *idevc,   /* unit number for comment file */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *lnconl,	/* left contig to join */
+	int_f *llinol,	/* left-most gel in contig of left contig */
+	int_f *lnconr,	/* right contig for join */
+	int_f *llinor,	/* left-most gel in contig of right contig */
+	int_f *igell,   /* left gel specified on entry */
+	int_f *igellpos,/* position in left-most gel */
+	int_f *igelr,   /* right gel specified on entry */
+	int_f *igelrpos,/* position in left-most gel */
+	int_f *perced,	/* cutoff for consensus calculation */
+	int_f *ngels,	/* number of gels in database */
+	int_f *nconts,	/* number of contigs in database */
+	int_f *idm,     /* database type */
+	int_f *rcstate, /* reveal cutoff state */
+	int_f *iok      /* returns 0-saved (1) | 1-saved (2) | joined (4) */
+	)
+/*
+** Interface to FORTRAN for Join editor
+*/
+{
+    EdStruct *xx[2];
+    int reveal_state;
+    int i;
+
+    semaphoreGrab(activeLock);
+    editorState = StateUp;
+    editorMode = JOINMODE;
+    save_state = (int_f)0;
+
+    reveal_state = (*rcstate != 0);
+    setInitialStates(reveal_state);
+
+    /*
+    ** Save arguments for later use
+    */
+    saveState.idevr  = idevr;
+    saveState.idevw  = idevw;
+    saveState.idevn  = idevn;
+    saveState.idevt  = idevt;
+    saveState.idevc  = idevc;
+    saveState.relpg  = relpg;
+    saveState.lngthg = lngthg;
+    saveState.lnbr   = lnbr;
+    saveState.rnbr   = rnbr;
+    saveState.maxgel = maxgel;
+    saveState.idbsiz = idbsiz;
+    saveState.llinol = llinol;
+    saveState.lnconl = lnconl;
+    saveState.llinor = llinor;
+    saveState.lnconr = lnconr;
+    saveState.perced = perced;
+    saveState.ngels  = ngels;
+    saveState.nconts = nconts;
+    saveState.idm    = idm;
+    pcCut = * (float *) perced;
+
+    for (i=0; i<2; i++) {
+	int_f *llino = (i==0)?llinol:llinor;
+
+        /*
+        ** Set up data structures
+        */
+        xx[i] = intToEdStruct(i);
+	xx[i]->reveal_cutoffs = reveal_state;
+        if (initialiseDB(xx[i],idevr,idevw,idevn,relpg,lngthg,lnbr,rnbr,maxgel,idbsiz,llino)) {
+	    if (i==1) freeDB(xx[0]);
+	    return;
+        }
+
+        /*
+        ** Set up display
+        */
+        if (createEdDisplay(xx[i],
+			    xx[i]->namesWid,
+			    xx[i]->sequencesWid,
+			    (i)?*igelr:*igell,
+			    (i)?*igelrpos:*igellpos)) {
+	    freeDB(xx[i]);
+	    if (i==1) freeDB(xx[0]);
+	    return;
+        }
+
+    }
+
+    tweakGeometry1();
+    XtPopup(editorShellWid,   XtGrabNone);
+    tweakGeometry2();
+
+    while (editorState != StateDown)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(xx[0]->edWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    for (i=0;i<2;i++) {
+	disown_selection(xx[i]);
+        freeDB(xx[i]);
+        cleanUpAllStacks();
+    }
+
+    *iok = save_state;
+}
+
+void cxedit_(
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *idevt,   /* unit number for tag information */
+	int_f *idevc,   /* unit number for comment file */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *lincon,	/* current contig to edit */
+	int_f *llino,	/* left-most gel in contig */
+	int_f *igelno,  /* gel number specified on entry */
+	int_f *igelpos, /* position in gel */
+	int_f *perced,	/* cutoff for consensus calculation */
+	int_f *idm,     /* database type */
+	int_f *rcstate, /* reveal cutoff state */
+	int_f *iok      /* returns 0-saved (1) | 1-saved (2) | joined (4) */
+	)
+/*
+** Interface to FORTRAN for Contig Editor
+*/
+{
+
+    int reveal_state;
+
+    EdStruct *xx = intToEdStruct(0);
+    semaphoreGrab(activeLock);
+    editorState = StateUp;
+    editorMode = EDITMODE;
+    save_state = (int_f)0;
+
+    reveal_state = (*rcstate != 0);
+    setInitialStates(reveal_state);
+
+    /*
+    ** Save arguments for later use
+    */
+    saveState.idevr  = idevr;
+    saveState.idevw  = idevw;
+    saveState.idevn  = idevn;
+    saveState.idevt  = idevt;
+    saveState.idevc  = idevc;
+    saveState.relpg  = relpg;
+    saveState.lngthg = lngthg;
+    saveState.lnbr   = lnbr;
+    saveState.rnbr   = rnbr;
+    saveState.maxgel = maxgel;
+    saveState.idbsiz = idbsiz;
+    saveState.lincon = lincon;
+    saveState.llino  = llino;
+    saveState.perced = perced;
+    saveState.idm    = idm;
+
+    /*
+    ** Set up data structures
+    */
+    xx->reveal_cutoffs = reveal_state;
+    if (initialiseDB(xx,idevr,idevw,idevn,relpg,lngthg,lnbr,rnbr,maxgel,idbsiz,llino))
+	return;
+
+    /*
+    ** Set up display
+    */
+    pcCut = * (float *) perced;
+    if (createEdDisplay(xx,xx->namesWid,xx->sequencesWid,*igelno,*igelpos)) {
+	freeDB(xx);
+	return;
+    }
+
+    tweakGeometry1();
+    XtPopup(editorShellWid,   XtGrabNone);
+    tweakGeometry2();
+
+    while (editorState != StateDown)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(xx->namesWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    disown_selection(xx);
+    freeDB(xx);
+    cleanUpAllStacks();
+
+    *iok = save_state;
+}
+
+Widget createEditWid(Widget parentWid, Widget fromVertWid, int closeToVert)
+/*
+** Create an form with all the trappings of an editor window
+*/
+{
+    EdStruct *xx;
+    Widget llWid, lWid, rWid, rrWid;
+    Arg args[10];
+    int nargs;
+
+    xx = getFreeEdStruct();
+    if (xx == NULL) return NULL;
+
+    getFontDetails(&xx->fontWidth,&xx->fontHeight);
+
+    /*
+    ** create main form
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  fromVertWid); nargs++;
+    if (closeToVert) {
+        XtSetArg(args[nargs], XtNvertDistance,  0); nargs++;
+    }
+    xx->edWid = XtCreateManagedWidget("edForm",  formWidgetClass,
+                                  parentWid, args, nargs);
+
+    /*
+    ** Scrollers
+    */
+    nargs = 0;
+    xx->scrollButtonsWid = XtCreateManagedWidget("scrollButtons", formWidgetClass,
+                                   xx->edWid, args, nargs);
+    nargs = 0;
+    llWid = XtCreateManagedWidget("leftleft",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(llWid, XtNcallback, leftleftCallback, NULL );
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  llWid); nargs++;
+    lWid = XtCreateManagedWidget("left",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(lWid, XtNcallback, leftCallback, NULL );
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  lWid); nargs++;
+    rWid = XtCreateManagedWidget("right",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(rWid, XtNcallback, rightCallback, NULL );
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  rWid); nargs++;
+    rrWid = XtCreateManagedWidget("rightright",  commandWidgetClass,
+                                  xx->scrollButtonsWid, args, nargs);
+    XtAddCallback(rrWid, XtNcallback, rightrightCallback, NULL );
+
+
+
+    /*
+    ** A Slider
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz,  xx->scrollButtonsWid); nargs++;
+    XtSetArg(args[nargs], XtNhorizDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation,  XtorientHorizontal); nargs++;
+    XtSetArg(args[nargs], XtNheight, xx->fontHeight+4); nargs++;
+    XtSetArg(args[nargs], XtNwidth, xx->fontWidth*(xx->displayWidth)+8); nargs++;
+
+    xx->sliderWid = XtCreateManagedWidget("slider", scrollbarWidgetClass,
+                                   xx->edWid, args, nargs);
+    XtAddCallback(xx->sliderWid, XtNjumpProc,   sliderJumpCallback, NULL);
+    XtAddCallback(xx->sliderWid, XtNscrollProc, sliderScrollCallback, NULL);
+
+
+
+
+    /*
+    ** ... and a text widget or two or three
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, xx->sliderWid); nargs++;
+    XtSetArg(args[nargs], XtNvertDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, NAMELEN); nargs++;
+    xx->namesWid = XtCreateManagedWidget("names", sheetWidgetClass,
+                             xx->edWid, args, nargs);
+    XtUninstallTranslations(xx->namesWid);
+    /* Add string to function bindings for our application actions */
+    XtAppAddActions(XtWidgetToApplicationContext(xx->edWid),
+                    actionTable2, XtNumber(actionTable2));
+    parsedTTable2 = XtParseTranslationTable(translationTable2);
+    XtUninstallTranslations(xx->namesWid);
+    XtAugmentTranslations(xx->namesWid,parsedTTable2);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, xx->sliderWid); nargs++;
+    XtSetArg(args[nargs], XtNvertDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, xx->namesWid); nargs++;
+    XtSetArg(args[nargs], XtNhorizDistance, 0); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, DEFAULT_DISPLAY_WIDTH); nargs++;
+    xx->sequencesWid = XtCreateManagedWidget("sequences", sheetWidgetClass,
+                             xx->edWid, args, nargs);
+
+    /* Add string to function bindings for our application actions */
+    XtAppAddActions(XtWidgetToApplicationContext(xx->edWid),
+                    actionTable, XtNumber(actionTable));
+    parsedTTable = XtParseTranslationTable(translationTable);
+    XtUninstallTranslations(xx->sequencesWid);
+
+    XtAugmentTranslations(xx->sequencesWid,parsedTTable);
+
+    /*
+    ** Pop-up menu
+    */
+    createEdMenu(xx->sequencesWid);
+
+    return xx->edWid;
+}
+
+Widget createDisagreeWid(Widget parentWid, Widget fromVertWid, int closeToVert)
+/*
+** Create a form widget with all the trappings of a
+** disagreement window
+*/
+{
+    Arg args[10];
+    int nargs;
+    int fontWidth, fontHeight;
+
+    getFontDetails(&fontWidth, &fontHeight);
+
+    /*
+    ** create main form
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  fromVertWid); nargs++;
+    if (closeToVert) {
+        XtSetArg(args[nargs], XtNvertDistance, 0); nargs++;
+    }
+    disagreeFormWid = XtCreateManagedWidget("disagreeForm",  formWidgetClass,
+                                  parentWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, 1); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, NAMELEN); nargs++;
+    dummyWid = XtCreateManagedWidget("dummy", sheetWidgetClass,
+                             disagreeFormWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, 1); nargs++;
+    XtSetArg(args[nargs], XtNcolumns, DEFAULT_DISPLAY_WIDTH); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, dummyWid); nargs++;
+    disagreeWid = XtCreateManagedWidget("disagreements", sheetWidgetClass,
+                             disagreeFormWid, args, nargs);
+
+    XtUninstallTranslations(dummyWid);
+    XtUninstallTranslations(disagreeWid);
+
+    return disagreeFormWid;
+}
+
+
+Widget CreateEditorShell(Widget parentWid)
+/*
+** Create all components of the
+**	Contig Editor
+** and
+**	Join Editor
+*/
+{
+    Arg args[10];
+    int nargs;
+    Widget ed1Wid;
+    Widget ed2Wid;
+    Widget dw;
+
+    /*
+    ** Create the contig editor shell
+    */
+    editorShellWid = XtCreatePopupShell ("Editor",
+				      topLevelShellWidgetClass,
+				      parentWid,
+				      NULL, (Cardinal) 0);
+
+    mainFormWid = XtCreateManagedWidget("mainForm", formWidgetClass,
+					editorShellWid,
+                                        NULL, (Cardinal) 0);
+
+
+    /*
+    **  The form widget holds a label
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, exitWid); nargs++;
+    labelWid = XtCreateManagedWidget("title", labelWidgetClass,
+                                     mainFormWid, args, nargs);
+
+
+    /*
+    ** Button box for contig/join editor
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, labelWid); nargs++;
+    buttonsWid = XtCreateManagedWidget("buttons", boxWidgetClass,
+                                   mainFormWid, args, nargs);
+
+    /*
+    ** Toggle widgets for insert/replace edit modes
+    */
+    nargs = 0;
+    insertWid = XtCreateManagedWidget("insert", toggleWidgetClass,
+                                   buttonsWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, insertWid); nargs++;
+    replaceWid = XtCreateManagedWidget("replace", toggleWidgetClass,
+                                   buttonsWid, args, nargs);
+
+    /*
+    ** Superman toggle widget (for super-duper editing)
+    */
+    nargs = 0;
+    supermanWid = XtCreateManagedWidget("superman",  toggleWidgetClass,
+                                  buttonsWid, args, nargs);
+
+    /*
+    ** Reveal cutoffs
+    */
+    nargs = 0;
+    revealWid = XtCreateManagedWidget("reveal",  toggleWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(revealWid, XtNcallback, editRevealCallback, NULL );
+
+    /*
+    ** Undo command button
+    */
+    nargs = 0;
+    undoWid = XtCreateManagedWidget("undo",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(undoWid, XtNcallback, editUndoCallback, NULL );
+
+    /*
+    ** Next (problem) command button
+    */
+    nargs = 0;
+    nextWid = XtCreateManagedWidget("next",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(nextWid, XtNcallback, editNextCallback, NULL );
+
+#ifdef nana
+    /*
+    ** Save command button
+    */
+    nargs = 0;
+    saveWid = XtCreateManagedWidget("save",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(saveWid, XtNcallback, editSaveCallback, NULL );
+#endif
+
+    /*
+    ** Lock command button
+    */
+    nargs = 0;
+    lockWid = XtCreateManagedWidget("lock",  toggleWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(lockWid, XtNcallback, editLockCallback, NULL );
+
+#ifdef nana
+    /*
+    ** Join command button
+    */
+    nargs = 0;
+    joinWid = XtCreateManagedWidget("join",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(joinWid, XtNcallback, editJoinCallback, NULL );
+#endif
+
+    /*
+    ** and an exit button
+    */
+    nargs = 0;
+    exitWid = XtCreateManagedWidget("exit",  commandWidgetClass,
+                                  buttonsWid, args, nargs);
+    XtAddCallback(exitWid, XtNcallback, editExitCallback, NULL );
+
+    /*
+    ** create editor
+    */
+    ed1Wid = createEditWid(mainFormWid,buttonsWid,0);
+    dw = createDisagreeWid(mainFormWid,ed1Wid,1);
+    ed2Wid = createEditWid(mainFormWid,dw,1);
+
+    /*
+    ** Create tag editor
+    */
+    (void) createTagEditor(parentWid);
+
+
+    /*
+    ** Create search diologue
+    */
+    (void) createSearchWidget(parentWid);
+
+    return editorShellWid;
+
+}
+
+
+int editModeIsInsert()
+/*
+** Find out if editor is in insert mode rather than replace mode
+*/
+{
+    String currMode;
+
+    currMode = (String) XawToggleGetCurrent(insertWid);
+    if (currMode == NULL)
+	return 0;
+    else 
+        return (strcmp (currMode,"insert") == 0);
+}
+
+int editModeIsSuperman()
+/*
+** Find out if we are entitled to use privilaged edit commands
+*/
+{
+    Boolean currMode;
+    int nargs;
+    Arg args[10];
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNstate, &currMode); nargs++;
+    XtGetValues(supermanWid, args, nargs);
+
+    return currMode;
+}
+
+int inJoinMode()
+/*
+** Find out if the editor is in join mode
+*/
+{
+    return editorMode==JOINMODE;
+}
+
+int editorLocked()
+/*
+** Find out if the editor is locked (and in join mode of course)
+*/
+{
+    Boolean currMode;
+    int nargs;
+    Arg args[10];
+
+    if (inJoinMode()) {
+        nargs=0;
+        XtSetArg(args[nargs], XtNstate, &currMode); nargs++;
+        XtGetValues(lockWid, args, nargs);
+
+        return (currMode);
+    } else
+	return 0;
+}
+
+int editorLockedPos(int force)
+/*
+** Find out the locked position
+*/
+{
+    if (force) {
+        EdStruct *xx0 = intToEdStruct(0);
+        EdStruct *xx1 = intToEdStruct(1);
+
+        return xx1->displayPos - xx0->displayPos;
+    } else
+        return (lockOffset);
+}
diff --git a/src/staden/contigEditor.h b/src/staden/contigEditor.h
new file mode 100644
index 0000000..77b5668
--- /dev/null
+++ b/src/staden/contigEditor.h
@@ -0,0 +1,47 @@
+#ifndef _contigEditor_h
+#define _contigEditor_h
+#include <X11/Intrinsic.h>
+#include "fort.h"
+extern Widget disagreeWid;
+extern float  pcCut;           /* cut off used for consensus */
+
+typedef struct _def_colours {
+    Pixel lightGrey;
+} DefColours;
+extern DefColours defColours;
+
+#define EDITMODE 1
+#define JOINMODE 2
+typedef struct {
+        int_f *idevr;    /* unit number for relationships */
+        int_f *idevw;    /* unit number for working versions of sequences */
+        int_f *idevn;    /* unit number for sequence names */
+	int_f *idevt;    /* unit number for tag information */
+	int_f *idevc;    /* unit number for comment file */
+        int_f *relpg;    /* relative positions of gels in sequences */
+        int_f *lngthg;   /* lengths of sequences */
+        int_f *lnbr;     /* left neighbours */
+        int_f *rnbr;     /* right neighbours */
+        int_f *maxgel;   /* maximum length of gel */
+        int_f *idbsiz;   /* size of database */
+        int_f *lincon;   /* current contig to edit */
+        int_f *llino;    /* left-most gel in contig */
+        int_f *lnconl;   /* left contig for join */
+        int_f *llinol;   /* left-most gel in left contig for join */
+        int_f *lnconr;   /* right contig fro join */
+        int_f *llinor;   /* left-most gel in right contig for join */
+        int_f *perced;   /* cutoff for consensus calculation */
+        int_f *ngels;    /* number of gels in database */
+        int_f *nconts;   /* number of contigs in database */
+	int_f *idm;      /* database type */
+    } SaveStruct;
+extern SaveStruct saveState;
+
+extern int editorMode;
+
+extern int editModeIsInsert();
+extern int editModeIsSuperman();
+extern int inJoinMode();
+extern int editorLocked();
+extern int editorLockedPos(int force);
+#endif  /* _contigEditor_h */
diff --git a/src/staden/copyright.h b/src/staden/copyright.h
new file mode 100644
index 0000000..e69de29
diff --git a/src/staden/cvt.c b/src/staden/cvt.c
new file mode 100644
index 0000000..c80b7f3
--- /dev/null
+++ b/src/staden/cvt.c
@@ -0,0 +1,292 @@
+#include <ctype.h>
+#include <string.h>
+#include <stdio.h>
+#include <sys/types.h>
+#include <sys/stat.h>
+/*#include <sys/unistd.h>*/
+#include "fort.h"
+
+#define l_len 100
+#define f_len 200
+
+#define MAXDB 1000
+
+struct _ar_rec {
+    int_f idbsiz;
+    int_f maxgel;
+    int_f idm;
+} AR_rec;
+
+struct _rl_rec {
+    int_f ngels;
+    int_f nconts;
+    int_f dum1;
+    int_f dum2;
+} RL_rec;
+
+struct _tg_rec {
+    int_f pos; /* and count */
+    int_f len;
+    int_f com;
+    int_f type;
+    int_f next;
+} TG_rec;
+
+#define COMMENT_LENGTH 40
+struct _cc_rec {
+    int_f next;
+    char comment[COMMENT_LENGTH];
+} CC_rec;
+
+struct _cch_rec{
+    int_f next;
+    int_f count;
+    char comment[COMMENT_LENGTH-sizeof(int_f)];
+} CCH_rec;
+
+struct _rd_rec{
+    int_f len;
+    int_f lcut;
+    int_f wlen;
+    char type[4];
+    char name[12];
+} RD_rec;
+
+int main()
+{
+  char projectName[l_len];
+  char versionNumber[l_len];
+
+  fprintf(stdout,"Database conversion program\n");
+  fprintf(stdout,"Converts *.RD? file to *.TG? and *.CC? files\n\n");
+
+  fprintf(stdout,"Project name ? ");
+  gets(projectName);
+
+  fprintf(stdout,"Version ? ");
+  gets(versionNumber);
+
+  if (process(projectName,versionNumber))
+    fprintf(stdout,"Error: conversion aborted.\n");
+  else
+    fprintf(stdout,"Conversion completed.\n");
+
+  return 0;
+}
+
+int read_ar(char *AR,int_f *idbsiz)
+{
+
+    FILE *AR_fp;
+
+    /*
+    ** Check AR file exists
+    */
+    if ((AR_fp=fopen(AR,"rb"))==NULL) {
+	fprintf(stderr,"Cannot open file %s\n",AR);
+	return 1;
+    }
+
+    /*
+    ** Read details from AR file
+    */
+    fseek(AR_fp,(off_t)((MAXDB-1)*sizeof(AR_rec)),/*SEEK_SET*/0);
+    fread(&AR_rec,sizeof(AR_rec),1,AR_fp);
+
+    if (ferror(AR_fp)) {
+	fprintf(stderr,"Cannot read file %s\n",AR);
+	return 1;
+    }
+
+    fclose(AR_fp);
+
+    *idbsiz = AR_rec.idbsiz;
+    return 0;
+}
+
+int read_rl(char *RL,int_f idbsiz,     int_f * ngels)
+{
+
+    FILE *RL_fp;
+
+    /*
+    ** Check RL file exists
+    */
+    if ((RL_fp=fopen(RL,"rb"))==NULL) {
+	fprintf(stderr,"Cannot open file %s\n",RL);
+	return 1;
+    }
+
+    /*
+    ** Read details from RL file
+    */
+    fseek(RL_fp,(off_t)((AR_rec.idbsiz-1)*sizeof(RL_rec)),/*SEEK_SET*/0);
+    fread(&RL_rec,sizeof(RL_rec),1,RL_fp);
+
+    if (ferror(RL_fp)) {
+	fprintf(stderr,"Cannot read file %s\n",RL);
+	return 1;
+    }
+
+    fclose(RL_fp);
+
+    *ngels = RL_rec.ngels;
+    return 0;
+}
+
+void write_tg(FILE *fp,int_f rec, int_f pos, int_f len, int_f com, int_f type, int_f next)
+{
+    TG_rec.pos = pos;
+    TG_rec.len = len;
+    TG_rec.com = com;
+    TG_rec.type = type;
+    TG_rec.next = next;
+
+    fseek(fp,(off_t)((rec-1)*sizeof(TG_rec)),/*SEEK_SET*/0);
+    fwrite(&TG_rec,sizeof(TG_rec),1,fp);
+}
+
+void write_cc(FILE *fp,int_f rec, int_f next, char *comment)
+{
+    CC_rec.next = next;
+    strncpy(CC_rec.comment,comment,COMMENT_LENGTH);
+
+    fseek(fp,(off_t)((rec-1)*sizeof(CC_rec)),/*SEEK_SET*/0);
+    fwrite(&CC_rec,sizeof(CC_rec),1,fp);
+}
+
+void write_cc_head(FILE *fp,int_f next, int_f count )
+{
+    CCH_rec.next = next;
+    CCH_rec.count = count;
+
+    fseek(fp,(off_t)0,/*SEEK_SET*/0);
+    fwrite(&CCH_rec,sizeof(CC_rec),1,fp);
+}
+
+void read_rd(FILE *fp, int_f rec, int_f *len, int_f *lcut, int_f *wlen, char *type,
+char *name)
+{
+    fseek(fp,(off_t)((rec-1)*sizeof(RD_rec)),/*SEEK_SET*/0);
+    fread(&RD_rec,sizeof(RD_rec),1,fp);
+
+    *len = RD_rec.len;
+    *lcut = RD_rec.lcut;
+    *wlen = RD_rec.wlen;
+    strncpy(type,RD_rec.type,4);
+    strncpy(name,RD_rec.name,12);
+
+}
+
+int process(char *name,char *vers)
+{
+    char AR[f_len];
+    char RD[f_len];
+    char CC[f_len];
+    char TG[f_len];
+    char RL[f_len];
+
+    FILE *RD_fp;
+    FILE *CC_fp;
+    FILE *TG_fp;
+
+    struct stat statBuff;
+
+    int_f IDBSIZ;
+    int_f NGELS;
+
+    if (!*vers) strcpy(vers,"0");
+
+    /* convert bits to upper case */
+    {
+        char *s;
+        for (s=name; *s = islower(*s)?toupper(*s):*s ;s++);
+	for (s=vers; *s = islower(*s)?toupper(*s):*s ;s++);
+    }
+
+    /*
+    ** create file names
+    */
+    strcpy(RD,name); strcat(RD,".RD"); strncat(RD,vers,1);
+    strcpy(TG,name); strcat(TG,".TG"); strncat(TG,vers,1);
+    strcpy(CC,name); strcat(CC,".CC"); strncat(CC,vers,1);
+
+    /*
+    ** Get AR details
+    */
+    strcpy(AR,name); strcat(AR,".AR"); strncat(AR,vers,1);
+    if (read_ar(AR,&IDBSIZ)) return 1;
+
+    /*
+    ** Get RL details
+    */
+    strcpy(RL,name); strcat(RL,".RL"); strncat(RL,vers,1);
+    if (read_rl(RL,IDBSIZ,   &NGELS)) return 1;
+
+    /*
+    ** open files TG and CC
+    */
+    if ( stat(TG,&statBuff) >= 0 ) {
+	fprintf(stderr,"%s already exists\n",TG);
+	return 1;
+    }
+
+    if ( stat(CC,&statBuff) >= 0 ) {
+	fprintf(stderr,"%s already exists\n",CC);
+	return 1;
+    }
+
+    if ((TG_fp=fopen(TG,"wb"))==NULL) {
+	 fprintf(stderr,"cannot open %s for writing\n",TG);
+	 return 1;
+    }
+
+    if ((CC_fp=fopen(CC,"wb"))==NULL) {
+	 fprintf(stderr,"cannot open %s for writing\n",CC);
+	 fclose(TG_fp);
+	 return 1;
+    }
+
+    /*
+    ** Check RD file exists
+    */
+    if ((RD_fp=fopen(RD,"rb"))==NULL) {
+	/*
+	** None:
+	** Create anyway
+	*/
+	write_tg(TG_fp,IDBSIZ/*rec*/,IDBSIZ,0,0,0,0);
+	write_cc_head(CC_fp,0/*next*/,1/*count*/);
+
+    } else {
+	/*
+	** Do the hard graft
+	*/
+	int i;
+
+	write_tg(TG_fp,IDBSIZ/*rec*/,IDBSIZ,0,0,0,0);
+	write_cc_head(CC_fp,0/*next*/,NGELS+1/*count*/);
+
+	for (i=1;i<=NGELS;i++) {
+	    int_f len;
+	    int_f lcut;
+	    int_f wlen;
+	    char type[5];
+	    char name[13];
+	    char comment[COMMENT_LENGTH];
+
+	    read_rd(RD_fp,i,&len,&lcut,&wlen,type,name);
+	    type[4] = '\0';
+	    name[12] = '\0';
+
+	    sprintf(comment,"%6d%6d%6d%-4s%-18s",len,lcut,wlen,type,name);
+
+	    write_tg(TG_fp,i,0,0,i+1,0,0);
+	    write_cc(CC_fp,i+1,0,comment);
+	}
+    }
+
+    fclose(TG_fp);
+    fclose(CC_fp);
+    return 0;
+}
diff --git a/src/staden/dap.f b/src/staden/dap.f
new file mode 100644
index 0000000..dc028c6
--- /dev/null
+++ b/src/staden/dap.f
@@ -0,0 +1,656 @@
+C     SAP (Sequence assembly program)
+C     AUTHOR RODGER STADEN
+C     9-1-90  sent idev instead of kbout to join and enter
+C     16-3-90 removed escape after bpause
+C     2-5-90 Many changes related to introducing graphics
+C     4-5-90 Allowed escape from getreg
+C     9-5-90 Introduced a default gel reading
+C    23-8-90 Added calls to SHOWFU
+C    28-11-90 See dbsys89
+C    3-12-90 Add invocation of contig editor
+C    25-02-91 tag and comment files
+C    28-2-91 changed maxsav from 1000 to maxglm
+C    30-4-91 changes help references to DAP*
+C    28-7-91 added extra parameter for quality calc: maxgood, only the parts
+C    of reads upto this length are used
+C    21-8-91 Added an option to find internal joins: AUTOJ
+C    27-8-91 Added buffer flushing for database files
+C    1-11-91 Upped MAXSEQ to 150000
+      SUBROUTINE FMAIN()
+      INTEGER CHRSIZ,ECHRSZ
+      PARAMETER (
+     +           NAMLEN = 40,
+     +           MAXDEV = 12,
+     +           MAXSEQ=150000,
+     +           MAXGLM=4096,
+     +           MAXGL2=MAXGLM*2,
+     +           MINMEN=-3,
+     +           CHRSIZ=4,
+     +           ECHRSZ = 6,
+     +           IDT1=ECHRSZ*MAXGL2,
+     +           IDT2=2*CHRSIZ*MAXGL2,
+     +           IDT3=2*MAXGL2,
+     +           IDT4=2*ECHRSZ*MAXGL2,
+     +           MAXDB=1000)
+      INTEGER BOTOPT,TOPOPT,DEVNOS(MAXDEV)
+      INTEGER RELPG(MAXDB),LREG,RREG,ANS
+      INTEGER TEMP1(IDT1),TEMP2(IDT2),TEMP3(IDT4)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILNAM,NAMPRO,FILE,NAMARC,FILMAR
+      CHARACTER*(NAMLEN) SHONAM
+      CHARACTER GEL(MAXGLM)
+      CHARACTER GEL2(MAXGLM),GEL3(MAXGLM),GEL4(MAXGLM)
+      PARAMETER (BOTOPT=0,TOPOPT=35)
+      PARAMETER (HELPF='DAPHELP')
+      PARAMETER (POINTF='DAPHPNT')
+      PARAMETER (FILMAR='DAPMARG')
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      INTEGER MARGB(TOPOPT),MARGT(TOPOPT)
+      CHARACTER SEQ1(MAXSEQ)
+C     DBAUTO TEMP3A = TEMP1, SEQ3A = GEL3, SEQ2A = GEL2
+C     MATCHA = GEL, SEQ4A = GEL4
+      PARAMETER (LENGTH = 6,
+     +           MAXCON = 500,
+     +           MAXSAV = MAXGLM,
+     +           LPOWRC = CHRSIZ**LENGTH)
+      INTEGER CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ5(MAXGLM),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      INTEGER WINDOW
+C     DBAUTO
+      EQUIVALENCE (TEMP2,TEMP1)
+      EQUIVALENCE (TEMP1,TEMP3)
+      COMMON /DEVILS/ IDEVT,IDEVC,IDBSIZ,RELPG
+      IDM = 5
+      CALL INITS
+      CALL INITLU(IDM)
+      CALL INTHLP('dap', TOPOPT)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(10))
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' DAP (Development assembly program) V5.1 Aug 1992',/,
+     +' Author: Rodger Staden')
+      IGORT = 0
+      IDEV = KBOUT
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +TOPOPT,DEVNOS(5),FILMAR)
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(5),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,IDEVH)
+      IOPEN=1
+      LINLEN=50
+      PERCD=0.75
+      WINDOW = 25
+      MXGOOD = MAXGLM
+      IGWIND = 1000
+      FILE = ' '
+      MAXGEL = MAXGLM
+      IDEVT  = DEVNOS(11)
+      IDEVC  = DEVNOS(12)
+      IDBSIZ=MAXDB
+      IDBSIS=IDBSIZ
+      SHONAM = ' '
+      CALL SHOWFI(KBOUT,SHONAM)
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        IF(ANS.EQ.1)THEN
+C          CALL SHOWFU(KBOUT,'Start new database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +    IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          LLINO = 0
+          IF(IERR.EQ.0) THEN
+            IOPEN=0
+            SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+            CALL SHOWFI(KBOUT,SHONAM)
+          END IF
+          CALL INITLU(IDM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+C        CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    IDEVT,IDEVC,MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      MENUN = 0
+10    CONTINUE
+C
+C Flush the database buffers, just in case!
+C
+      IF (IOPEN.EQ.0) THEN
+        CALL FLUSHL(DEVNOS(1))
+        CALL FLUSHL(DEVNOS(2))
+        CALL FLUSHL(DEVNOS(3))
+        CALL FLUSHL(IDEVT)
+        CALL FLUSHL(IDEVC)
+      END IF
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+C      CALL SHOWFU(KBOUT,' ')
+C      IF(IOK.NE.0) GO TO 9999
+C      CALL DBMENT(MENU,NOPT,TOPOPT,HELPS(0),HELPE(0),HELPF,DEVNOS(5),
+C     +KBIN,KBOUT)
+      CALL MENU(NOPT, KOPT, MENUN, TOPOPT, MINMEN, KBIN, KBOUT,
+     +     HELPS(0), HELPE(0), HELPF, DEVNOS(5))
+      IF(NOPT.EQ.3)THEN
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          CLOSE(UNIT=IDEVC)
+          CLOSE(UNIT=IDEVT)
+          IOPEN=1
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+C          CALL SHOWFU(KBOUT,'Start a database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +    IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          IF(IERR.EQ.0)IOPEN=0
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+C          CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    IDEVT,IDEVC,MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      END IF
+C  STOP
+      IF(NOPT.EQ.2)THEN
+        IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBOUT)
+        GO TO 9999
+      END IF
+
+
+C   clear all
+C
+      IF(NOPT.EQ.10)THEN
+C        CALL SHOWFU(KBOUT,'Clear graphics')
+        CALL CLEARG
+        GO TO 10
+      END IF
+C   clear VT100
+C
+      IF(NOPT.EQ.11)THEN
+C        CALL SHOWFU(KBOUT,'Clear text')
+        CALL CLEARV
+        GO TO 10
+      END IF
+C
+C   xhairs
+C
+      IF(NOPT.EQ.13) THEN
+C          CALL SHOWFU(KBOUT,'Use xhair')
+          CALL XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,NCONTS,LLINO,LINCON,LREG,RREG,
+     +    WINDOW,IGWIND,LENCON,IDEPTH,JDEPTH,
+     +    MARGL,MARGR,MARGB,MARGT,TOPOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +    KBOUT,GEL,GEL2,DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM,
+     +    SEQ1,MAXSEQ,NGELS,TEMP3,ECHRSZ,MAXGL2,GEL3,GEL4,33,34,29,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MXGOOD)
+        GO TO 10
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(NOPT.EQ.14)THEN
+C        CALL SHOWFU(KBOUT,'Change margins')
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(NOPT),HELPE(NOPT),TOPOPT,HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+C  ruler
+C
+      IF(NOPT.EQ.12) THEN
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+C          CALL SHOWFU(KBOUT,'Show ruler')
+          CALL RULER(LREG,RREG,MARGL,MARGR,
+     +    MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          GO TO 10
+        END IF
+      END IF
+C  labler
+      IF(NOPT.EQ.15)THEN
+C        CALL SHOWFU(KBOUT,'Label diagram')
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.16) THEN
+C        CALL SHOWFU(KBOUT,'Plot map')
+C  map
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+          CALL PLTMAP(DEVNOS(4),FILNAM,LENCON,MARGL,MARGR,MARGB(NOPT),
+     +    MARGT(NOPT),ISXMAX,ISYMAX,LREG,RREG,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        END IF
+      GO TO 10
+      END IF
+C      IF((NOPT.EQ.21).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Enter new gel reading')
+CC       ENTER
+C        CALL ENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +  GEL,GEL2,IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+C     +  DEVNOS(3),DEVNOS(4),LINLEN,PERCD,FILNAM,
+C     +  HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+C     +  MAXGEL,IDM,LLINO)
+C        GO TO 10
+C      END IF
+      IF((NOPT.EQ.25).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Show relationships')
+C       PRINT DB
+        CALL DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,IDEV,
+     +  KBIN,KBOUT,DEVNOS(3),LLINO,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.5).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Display contig')
+C       DISPLAY
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+          IF(IERR.EQ.0)THEN
+          CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +    GEL,LLINO,LINCON,LREG,RREG,GEL2,I1,I2,0,I,IDBSIZ,IDEV,KBOUT,
+     +    DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+200   CONTINUE
+C      IF((NOPT.EQ.-22).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Join contigs')
+CC       JOIN
+C        WRITE(KBOUT,1006)
+C1006    FORMAT(' Which contigs do you want to join ?')
+C        WRITE(KBOUT,1017)
+C1017    FORMAT( ' Left contig')
+C        LLINOL = 0
+C        LLINOR = 0
+C        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LNCONL,LLINOL,
+C     +  IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+C     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+C        IF(IERR.EQ.0)THEN
+C          WRITE(KBOUT,1018)
+C1018      FORMAT( ' Right contig')
+C          CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +    LNCONR,LLINOR,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+C     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+C          IF(IERR.EQ.0)THEN
+CC           CHECK TO SEE IF MAKING A CIRCLE!
+C            IF(LLINOL.EQ.LLINOR)THEN
+C              WRITE(KBOUT,*)' Making circles is not allowed!'
+C              GO TO 200
+C            END IF
+C            CALL JOIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +      LNCONL,LLINOL,LNCONR,LLINOR,GEL,GEL2,
+C     +      IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+C     +      DEVNOS(3),LINLEN,PERCD,
+C     +      HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+C     +      MAXGEL,IDM)
+C          END IF
+C        END IF
+C        GO TO 10
+C      END IF
+      IF((NOPT.EQ.23).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Complement contig')
+C       COMPLEMENT
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LINCON,LLINO,GEL,IDBSIZ,KBOUT,DEVNOS(1),DEVNOS(2),
+     +    MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+CC  EDIT
+C      IF((NOPT.EQ.-4).AND.(IOPEN.EQ.0))THEN
+CC       EDIT DB
+C        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+C     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+C     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+C        IF(IERR.EQ.0)THEN
+C          CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +    GEL,LINCON,0,0,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+C     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL)
+C        END IF
+C        GO TO 10
+C      END IF
+C  PLOT SINGLE CONTIG
+      IF((NOPT.EQ.33).AND.(IOPEN.EQ.0))THEN
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0)THEN
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.26).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Alter relationships')
+C       FIX
+        CALL DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),
+     +  HELPS(4),HELPE(4),HELPF,DEVNOS(5),MAXGEL,IDEVT,IDEVC)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.24).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Copy database')
+C       COPY
+        CALL DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +  NAMPRO,DEVNOS(4),IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),
+     +  DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,MAXDB,IDM)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.19).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Check database for consistency')
+C       CHECK
+        CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP1,IERR,KBOUT)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.29).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Examine quality')
+C       SCAN
+        IDIM1=MAXSEQ
+        CALL DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),IDEV,LINLEN,PERCD,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,GEL3,GEL4,
+     +  MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,DEVNOS(3),
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.8).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Make consensus')
+C       CONSENSUS
+        IDIM1=MAXSEQ
+        CALL CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),DEVNOS(4),FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,IDM,PERCD,
+     +  DEVNOS(3),LLINO)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+C        CALL SHOWFU(KBOUT,'Redirect output')
+        CALL REDIR(IDEV,DEVNOS(6),DEVNOS(10),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),KOPT)
+        GO TO 10
+      END IF
+C  HELP
+      IF(NOPT.EQ.1)THEN
+C        CALL SHOWFU(KBOUT,'Help')
+C       HELP
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,
+     +  HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.27)THEN
+C        CALL SHOWFU(KBOUT,'Set parameters')
+C  SET DISPLAY PARAMETERS
+        MN = 10
+        MX = 100
+        CALL GETINT(MN,MX,LINLEN,'Display line length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) LINLEN = IVAL
+        IPCMIN = 51
+        IPCMAX = 100
+        IPCD = INT(PERCD*100.)
+        CALL GETINT(IPCMIN,IPCMAX,IPCD,
+     +  'Percentage score for consensus',
+     +  IPVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) THEN
+          PERCD = REAL(IPVAL)/100.
+        END IF
+        MN = 1
+        MX = MAXGLM
+        CALL GETINT(MN,MX,MXGOOD,
+     +  'Maximum read length for quality analysis',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) MXGOOD = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,WINDOW,'Text window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) WINDOW = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,IGWIND,'Graphics window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) IGWIND = IVAL
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.6)THEN
+C        CALL SHOWFU(KBOUT,'Display text file')
+       CALL TTEXT(DEVNOS(4),FILNAM,KBIN,KBOUT,
+     + HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+       GO TO 10
+      END IF
+      IF(NOPT.EQ.28) THEN
+C        CALL SHOWFU(KBOUT,'Highlight disagreements')
+        CALL HIGHLT(SEQ1,SEQ1(6001),SEQ1(10001),KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(7),
+     +  DEVNOS(8),FILE,IOK)
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.34).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Plot contigs')
+C  PLOT ALL CONTIGS
+        CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,0,0,
+     +  NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+CC  AUTO EDIT
+C      IF((NOPT.EQ.30).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'AutoEdit a contig')
+C        WRITE(KBOUT,*)'Make a copy first!'
+C        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+C     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+C     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+C        IF(IERR.EQ.0)THEN
+C          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+C     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+C        IF(IERR.EQ.0) THEN
+C        IDIM1=MAXSEQ
+C        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,IDIM1,
+C     +  GEL,LREG,RREG,LLINO,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+C     +  DEVNOS(2),MAXGEL,IDM,PERCD)
+C        CALL AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,LINCON,
+C     +  GEL,MAXGEL,SEQ1,IDIM1,DEVNOS(2),DEVNOS(1),LREG,RREG,
+C     +  KBOUT)
+C      END IF
+C      END IF
+C      GO TO 10
+C      END IF
+C      IF((NOPT.EQ.9).AND.(IOPEN.EQ.0)) THEN
+C        CALL SHOWFU(KBOUT,'Screen Edit a contig')
+C        CALL CEDT(KBIN,KBOUT,
+C     +  POSNS(1),POSNS(1001),POSNS(2001),
+C     +  MAXDB,POSNS(3001),POSNS(3101),50,
+C     +  RELPG,LNGTHG,LNBR,RNBR,
+C     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+C     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(4),
+C     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),LINLEN,FILNAM,LLINO,IOK)
+C        GO TO 10
+C      END IF
+      IF((NOPT.EQ.4).AND.(IOPEN.EQ.0))THEN
+C       CALL UP CONTIG EDITOR
+        CALL CONEDT(KBIN,KBOUT,
+     +  POSNS(1),POSNS(1001),POSNS(2001),
+     +  MAXDB,POSNS(3001),POSNS(3101),50,
+     +  RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +  HELPS(9),HELPE(9),HELPF,DEVNOS(5),DEVNOS(4),
+     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +  LINLEN,FILNAM,LLINO,IOK,
+     +  IERR,TEMP1)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.22).AND.(IOPEN.EQ.0))THEN
+C       CALL UP JOIN EDITOR
+        CALL JOINED(KBIN,KBOUT,
+     +  POSNS(1),POSNS(1001),POSNS(2001),
+     +  MAXDB,POSNS(3001),POSNS(3101),50,
+     +  RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +  HELPS(9),HELPE(9),HELPF,DEVNOS(5),DEVNOS(4),
+     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),IDEVT,IDEVC,
+     +  LINLEN,FILNAM,LLINO,IOK,
+     +  IERR,TEMP1)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.17) THEN
+C        CALL SHOWFU(KBOUT,'Screen for restriction sites')
+        CALL SCRENR(GEL2,MAXGEL,GEL3,NAMARC,FILE,
+     +  DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.18) THEN
+C        CALL SHOWFU(KBOUT,'Screen against vector')
+        CALL SCRENV(MAXGEL,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +  SEQ1,MAXSEQ,GEL2,GEL3,GEL4,LENGTH,
+     +  SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +  KBIN,KBOUT,DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,FILE,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.20).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+C        CALL SHOWFU(KBOUT,'Auto assemble sequences')
+      CALL DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,IDEVC,IDEVT,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.35).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+C        CALL SHOWFU(KBOUT,'Find internal joins')
+      CALL AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.32).AND.(IOPEN.EQ.0)) THEN
+C        CALL SHOWFU(KBOUT,'Extract gel readings')
+        CALL GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +  NCONTS,GEL,MAXGEL,DEVNOS(2),DEVNOS(3),DEVNOS(4),DEVNOS(7),
+     +  KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),FILE)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+C      IF(NOPT.EQ.31) THEN
+C        CALL SHOWFU(KBOUT,'Type in gel readings')
+C        CALL BATIN(SEQ1,MAXSEQ,KBIN,KBOUT,
+C     +  DEVNOS(4),DEVNOS(7),IDEV,
+C     +  FILNAM,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+C        GO TO 10
+C      END IF
+      IF((NOPT.GT.3).AND.(IOPEN.NE.0))THEN
+        WRITE(KBOUT,1012)
+1012    FORMAT(' You have not opened a database!')
+        GO TO 10
+      END IF
+      GO TO 10
+9999  CONTINUE
+      CALL SHUTD
+      END
diff --git a/src/staden/dbchek.f b/src/staden/dbchek.f
new file mode 100644
index 0000000..ad3c838
--- /dev/null
+++ b/src/staden/dbchek.f
@@ -0,0 +1,133 @@
+C     DBCHEK
+C   TO CHECK THE  SELF CONSISTENCY OF A DB
+C   ARE ALL NEIGHBOURS HOLDING HANDS OR DO SOME HAVE
+C   GELS WHICH DO NOT BELONG TO ANY CONTIG? OR WORSE BELONG TO MORE
+C   THAN ONE CONTIG?
+C   ARE THERE ANY LOOPS IN CONTIGS (THE WORST PROBLEM)
+C   ARE ALL RELATIVE POSITIONS IN SAME ORDER AS NEIGHBOURS?
+C   ARE THERE ANY CONTIGS OF LENGTH <1 ?
+C   ARE THERE ANY GELS OF ZERO LENGTH ?
+C   RETURN ERROR CODE OF ZERO FOR ALL OK, 1 FOR GELS NOT USED
+C   AND 2 FOR ALL OTHER ERRORS
+      SUBROUTINE DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP,IERR,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP(IDBSIZ)
+      IERR=0
+C   HAND HOLDING OK?
+      DO 100 I=1,NGELS
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 50
+        IF(RNBR(K).EQ.I)GO TO 50
+        WRITE(KBOUT,1000)I
+1000    FORMAT(' Hand holding problem for gel reading',I6)
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+1001    FORMAT(' ',5I6)
+        IERR=2
+50      CONTINUE
+        K=RNBR(I)
+        IF(K.EQ.0)GO TO 100
+        IF(LNBR(K).EQ.I)GO TO 100
+        WRITE(KBOUT,1000)I
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+        IERR=2
+100   CONTINUE
+C
+C   ALL GELS IN CONTIGS OR IN MORE THAN ONE CONTIG?
+C  TEMP IS USED TO COUNT THE NUMBER OF TIMES EACH GEL IS USED
+      CALL FILLI(TEMP,IDBSIZ,0)
+      N=IDBSIZ-NCONTS
+C   COUNT NUMBER OF PASSES THRU LOOP 320 TO 300. IF THIS EXCEEDS
+C   THE DATABASE SIZE A LOOP HAS BEEN FOUND!
+      ICOUNT=0
+      DO 300 I=N,IDBSIZ-1
+        II=I
+        K=LNBR(I)
+        J=RNBR(I)
+        IF((K.NE.0).AND.(J.NE.0))GO TO 310
+C   THIS CONTIG POINTS TO ZERO GEL NUMBER AS LEFT OR RIGHT END!
+        WRITE(KBOUT,1002)I
+1002    FORMAT(' Contig',I4,' has zero gel numbers')
+        IERR=2
+        GO TO 290
+310     CONTINUE
+        IF((LNBR(K).EQ.0).AND.(RNBR(J).EQ.0))GO TO 290
+C   THIS LEFT END IS NOT A LEFT END OR THIS RIGHT END IS NOT A RIGHT END
+        WRITE(KBOUT,1004)I
+1004  FORMAT(' The end gels of contig',I4,' have outward neighbours')
+        IERR=2
+290     CONTINUE
+C       DOES THE CONTIG HAVE SOME +VE LENGTH?
+        IF(RELPG(I).GT.0)GO TO 320
+        WRITE(KBOUT,1010)I
+1010    FORMAT(' The contig on line number',I4,' has zero length')
+        IERR=2
+320     CONTINUE
+        TEMP(K)=TEMP(K)+1
+        ICOUNT=ICOUNT+1
+        IF(ICOUNT.GT.IDBSIZ)GO TO 601
+        K=RNBR(K)
+        IF(K.NE.0)GO TO 320
+300   CONTINUE
+      DO 400 I=1,NGELS
+        IF(TEMP(I).EQ.1)GO TO 390
+        IF(TEMP(I).EQ.0)GO TO 410
+        WRITE(KBOUT,1005)I,TEMP(I)
+1005    FORMAT(' Gel number ',I6,' is used ',I6,' times')
+        IERR=2
+        GO TO 400
+390     CONTINUE
+C       DOES THE GEL HAVE NONZERO LENGTH? (ONLY CHECK THOSE USED ONCE)
+        IF(LNGTHG(I).NE.0)GO TO 400
+        WRITE(KBOUT,1011)I
+1011    FORMAT(' Gel number',I6,' has zero length')
+        IERR=2
+        GO TO 400
+410     CONTINUE
+        WRITE(KBOUT,1006)I
+1006    FORMAT(' Gel number ',I6,' is not used')
+C       NEED TO INCREMENT THE ERROR COUNT NOT RESET IT TO A LOWER VALUE!
+        IF(IERR.LT.2)IERR=1
+400   CONTINUE
+C   ALL RELATIVE POSNS OK?
+      N=IDBSIZ-NCONTS
+      DO 500 I=N,IDBSIZ-1
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 500
+510     CONTINUE
+        J=RNBR(K)
+        IF(J.EQ.0)GO TO 500
+        IF(RELPG(K).GT.RELPG(J))GO TO 520
+        K=J
+        GO TO 510
+520     CONTINUE
+        WRITE(KBOUT,1007)K,RELPG(K),J,RELPG(J)
+1007    FORMAT(' Gel number',I6,' with position',I6,
+     +  ' is the left neighbour of',
+     +  /,' gel number',I6,' with position',I6)
+        K=J
+        IERR=2
+        GO TO 510
+500   CONTINUE
+      IF(IERR.EQ.0)WRITE(KBOUT,1013)
+1013  FORMAT(' Database is logically consistent')
+      RETURN
+601   CONTINUE
+      IERR=2
+      WRITE(KBOUT,1008)II
+1008  FORMAT(' Loop in contig',I6,/,
+     +' No further checking done but gel numbers follow')
+      CALL FILLI(TEMP,IDBSIZ,0)
+      K=LNBR(II)
+710   CONTINUE
+      TEMP(K)=TEMP(K)+1
+      WRITE(KBOUT,1009)K
+1009  FORMAT(' ',I6)
+      IF(TEMP(K).GT.1)RETURN
+      K=RNBR(K)
+      GO TO 710
+      END
diff --git a/src/staden/dbsyscommon.f b/src/staden/dbsyscommon.f
new file mode 100644
index 0000000..d7a9b04
--- /dev/null
+++ b/src/staden/dbsyscommon.f
@@ -0,0 +1,5835 @@
+C  DBSYS ROUTINES COMMON TO PRE AND POST .RD PROGRAMS
+C 8-4-92 altered gtconc, summer,summar,dsplay
+C 6-4-92 bug fix in autocn: i did not copy the correct elements
+C        for some cases where a better alignment was found: i was
+C        copying from jlefts instead of ilefts, jlc and jsense.
+C  Further sap routines are in dbsysold.f (pre .rd) and dbsysnew.f (post)
+C  the split was made by rs 23-1-91
+C  4-5-90 Change to getreg to allow escape
+C  4-5-90 addition of graphics routines and changes to menus
+C  9-5-90 added default gel reading: many changes
+C  17-5-90 Fixed 3 bugs in screen editing: 1) rightjustified names
+C          caused problems; 2) beginnings of sequences starting at far
+C          right of lines where not seen; 3) lines with no numbers at
+C          the end of a contig (i.e. with <10 chars) were flagged
+C          as errors. Changes to ltype for 1, linlen for 2, dsplay for 3
+C  9-7-90  removed menu routines
+C  20-8-90 changed gelid to add / to reading name because xsap did
+C          not return the INFLAG = 3 for the default
+C  23-8-90 Changes to dbauto and autocn to deal with failures better
+C          Plus addition of calls to BUSY
+C  9-11-90 Replaced call to radio with call to radion
+C  19-11-90 Changed max match length in dbauto to maxglm+1  (was 50)
+C  25-11-90 Very important bug fix in tpchek. Old versions could
+C           duplicate bits of working versions.
+C  28-11-90 Modified slider to receive maxpg and maxpc and to allow exactly
+C           the requested number of matches at each end of the two 
+C           sequences. 
+C           Added two new options to dbauto: all gels to new contigs, all 
+C           gels to contig 1; plus resurrected forbidding joins to allow
+C           sequences to be entered only into the contig the overlap best.
+C           Changed autocn to sort overlaps into order based on % mismatch
+C           (previously it saved the best two in any order)
+C           Minor change to dbstar
+C 3-1-91   Discovered bug in dbopen: incorrect call to getint when the
+C          database is very old and needs values for the current format
+C 21-1-91  GELID allowed illegal gel numbers to be returned! Fixed it.
+C 22-1-91  Modified autocn, adism4,adism3 to give more info about
+C          overlaps, and to allow 10 overlaps. Modified dbopen to
+C          return version number, ditto dbstar
+C 23-1-91  Split into dbsyscommon, dbsysold, dbsysnew
+C 26-2-91  Improved overflow check in padcop
+C 28-7-91 added extra parameter to quality calc: mxgood is the maximum
+C         reading length in which we have confidence, so only add this
+C         many chars from the start of each reading. Also changed the
+C         quality calc to make it the same as the consensus one. Made
+C         all characters have nonzero score and made lowercase = 100
+C 21-8-91 Changed arrfil to arrfim which does not display comments
+C 22-8-91 Added routine to find contig line number given left gel (CLINNO)
+C 8-11-91 Added fmt4lp which is used by "find internal joins" and would
+C         be useful elsewhere as a replacement for fmt4ln
+C 18-11-91 New routine GETLN2 with returns gel number specified
+C 27-2-92  Added use of ctonum to cfgel to deal with gel and vector
+C          being different case.
+C 2-3-92   set       FILNAM = ' ' for some calls to openf1
+C   SUBROUTINE TO READ CHARACTER DATA FROM IDEV, REMOVE SPACES, FILL
+C   ARRAY AND RETURN NUMBER OF ELEMENTS USED. ANY LINES STARTING WITH
+C   A ; ARE TREATED AS COMMENTS
+      SUBROUTINE ARRFIM(IDEV,SEQNCE,J,KBOUT)
+C 14-8-91 Added err= option to read, and set length to 0 if error found
+C   AUTHOR: RODGER STADEN
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,ENDCHR,TITCHR
+      SAVE ENDCHR,SPACE,TITCHR
+      DATA ENDCHR/'@'/
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=40)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR)THEN
+C        WRITE(KBOUT,1003)(TEMP(K),K=2,80)
+C1003    FORMAT(' ',79A1)
+        GO TO 1
+      END IF
+10    CONTINUE
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(TEMP(I).EQ.ENDCHR)RETURN
+          IF(J.EQ.IDMX)THEN
+            WRITE(KBOUT,1002)IDMX
+1002        FORMAT(
+     +             ' Too much data. Maximum possible',
+     +             ' =',I6,', input stopped there')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+      RETURN
+ 40   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      J = 0
+      END
+C      ABEDIN
+C
+C   ROUTINE TO EDIT THE DB USING A PADDED SEQ
+C   HAVE AN ARRAY SEQC2 LENGTH IDC OF PADDED SECTION OF CONTIG LINCON
+C  THE LEFT END OF THE PADDED CONTIG STARTS AT X
+C   THERE ARE ITOTPC PADS TO MAKE
+C
+      SUBROUTINE ABEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,POSN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQC2(IDC),GEL(MAXGEL),P
+      SAVE P
+      DATA P/','/
+C
+C   POINT TO CONTIG
+      POSN=X-1
+C   POINT TO SEQC2
+      IAT=0
+C   COUNT PADS DONE
+      IDONE=0
+C   LOOP FOR ALL SEQC2
+      DO 100 J=1,IDC
+      POSN=POSN+1
+      IAT=IAT+1
+      IPAD=0
+C   IS THIS A PADDING CHAR?
+      IF(SEQC2(IAT).NE.P)GO TO 100
+50    CONTINUE
+C   COUNT PADS
+      IPAD=IPAD+1
+      IAT=IAT+1
+      IF(SEQC2(IAT).EQ.P)GO TO 50
+C   END OF THIS STRETCH OF PADS,DO INSERT
+C   HAVE IPAD INSERTS TO MAKE AT POSN
+      CALL PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,IPAD,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   MOVE POINTER TO CONTIG
+      POSN=POSN+IPAD
+C   COUNT PADS DONE
+      IDONE=IDONE+IPAD
+C   ANY MORE TO DO?
+      IF(IDONE.EQ.ITOTPC)GO TO 101
+100   CONTINUE
+C   ERROR SHOULD HAVE DONE ALL PADS
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Problem: some pads were not done!')
+101   CONTINUE
+      END
+C      ACONS
+      SUBROUTINE ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,ANS,LLINO,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(' CALCULATING A CONSENSUS')
+C
+C
+C
+      CALL BUSY(KBOUT)
+      IFAIL=0
+      IF(ANS.EQ.1)GO TO 150
+      N=IDBSIZ-NCONTS
+      DO 110 I=N,IDBSIZ-1
+      J=LNBR(I)
+      X=1
+      Y=RELPG(I)
+      ISTART=ISTART+1
+      IF((ISTART+19+Y).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+1009     FORMAT(
+     +' Database maximum consensus length(',I6,') exceeded',/,
+     +' calculation aborted')
+         IFAIL=1
+         RETURN
+      END IF
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,MAXGEL,
+     +IDM,PERCD)
+      ISTART=ISTART+Y-1
+110   CONTINUE
+      RETURN
+C   SELECTED CONTIGS ONLY
+150   CONTINUE
+      ISTART=ISTART+1
+      IDIM11=RREG-LREG+1
+      IF((ISTART+19+IDIM11).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+         IFAIL=1
+         RETURN
+      END IF
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+C
+C
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1(ISTART),IDIM11,GEL,LREG,RREG,LLINO,IDBSIZ,TEMP,
+     +CHRSIZ,MAXGL2,
+     2IDEVW,MAXGEL,IDM,PERCD)
+      ISTART=ISTART+IDIM11-1
+C
+      RETURN
+      END
+      SUBROUTINE ADDTIT(SEQ1,NAMPRO,NGELS,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(20),NAMPRO*(*)
+      CHARACTER NUMS(10)
+      SAVE NUMS
+      DATA NUMS/'0','1','2','3','4','5','6','7','8','9'/
+      CALL FILLC(SEQ1(2),18,'-')
+      SEQ1(1) = '<'
+      SEQ1(20) = '>'
+      IEND = INDEX(NAMPRO,'.')
+      N=NGELS
+      K=IEND+3
+      DO 10 J=1,3
+        N=MOD(N,10)+1
+        NAMPRO(K:K)=NUMS(N)
+        N=NGELS/(10**J)
+        K=K-1
+10    CONTINUE
+      K = 18-IEND
+      K=K/2
+      DO 20 I=1,IEND+3
+        SEQ1(K)=NAMPRO(I:I)
+        K=K+1
+20    CONTINUE
+      IDIM1=IDIM1+20
+      RETURN
+      END
+      SUBROUTINE ADISM1(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,
+     +ISTRAN,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C  NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCCC
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C
+C   EDITED 07-02-83 TO ALLOW FOR CASE WHERE A GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL. USE PARM THAT CONTAINS
+C   THE POSITION OF THE LEFT END OF THE NEXT CONTIG. SET TO VERY HIGH
+C   VALUE TO START
+      NEXTC=IDIM+1
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPERATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C  COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     1SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C   COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     1SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+C
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       ADISM2
+C   ROUTINE TO DISPLAY MATCHES
+      SUBROUTINE ADISM2(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C   NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCC
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC. SEE ADISM1.
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with contig number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   UPDATE END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.GT.2)RETURN
+      ILEFTS(IMATC)=CENDS(JJ)+20
+      ILC(IMATC)=LCL+LCR+1
+      IPOSC(IMATC)=LCL+1
+      IPOSG(IMATC)=SAVPG
+      LLINO(IMATC)=NENDS(JJ)
+      ISENSE(IMATC)=1
+      IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      RETURN
+      END
+      SUBROUTINE ADISM3(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+        WRITE(KBOUT,1000)LLINO(IMATC),IPOSC(IMATC),ISTRAN,
+     +  IPOSG(IMATC)
+ 1000   FORMAT
+     +  (' Contig',I5,' position',I6,' matches strand',I2,
+     +  ' at position',I5)
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+      SUBROUTINE ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      RETURN
+      END
+      SUBROUTINE AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +GEL,MAXGEL,CON,IDC,IDEVW,IDEVR,LREG,RREG,KBOUT)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),CON(IDC)
+      INTEGER RREG,PC,PCA,PG
+      NG = LGEL
+      PG = RELPG(NG)
+      PC = LREG
+      NTT = 0
+      NCT = 0
+      NDT = 0
+10    CONTINUE
+C      WRITE(*,*)'GEL',NG
+      CALL READW(IDEVW,NG,GEL,MAXGEL)
+      LG = ABS(LNGTHG(NG))
+      IF(PC.LT.LREG) PC = LREG
+      PCA = PC - LREG + 1
+      IG = PC - PG + 1
+      LC = MIN(LG,RREG-PC+1)
+C      WRITE(*,*)'PC,PG,IG,LG,PCA,LC',PC,PG,IG,LG,PCA,LC
+      CALL ET(GEL(IG),LG,CON(PCA),LC,NE)
+      NTT = NTT + NE
+      CALL EC(GEL(IG),LG,CON(PCA),LC,NE)
+      NCT = NCT + NE
+      CALL ED(GEL(IG),LG,CON(PCA),LC,ND)
+      NDT = NDT + ND
+      CALL WRITEW(IDEVW,NG,GEL,MAXGEL)
+      IF(ND.GT.0) THEN
+        K = LNGTHG(NG)
+        LNGTHG(NG) = ABS(LNGTHG(NG)) - ND
+        LNGTHG(NG) = SIGN(LNGTHG(NG),K)
+        CALL WRITER(IDEVR,NG,RELPG(NG),LNGTHG(NG),LNBR(NG),RNBR(NG))
+      END IF
+      IF(RNBR(NG).NE.0) THEN
+        NG = RNBR(NG)
+        PG = RELPG(NG)
+        PC = PG
+        IF(PG.LE.RREG) GO TO 10
+      END IF
+      CALL EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      WRITE(KBOUT,1000)NTT
+1000  FORMAT(' Number of transpositions=',I6)
+      WRITE(KBOUT,1001)NCT
+1001  FORMAT(' Number of changes       =',I6)
+      WRITE(KBOUT,1002)NDT
+1002  FORMAT(' Number of deletions     =',I6)
+      END
+C    AJOIN2
+C   COMPLETES JOIN AND RETURNS LENGTH OF NEW CONTIG IN LLINOR
+      SUBROUTINE AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +RELX,LLINOL,LLINOR,LNCONL,LNCONR,IDEVR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNBR(IDBSIZ),RNBR(IDBSIZ),LNGTHG(IDBSIZ)
+      INTEGER RELX
+C   RELX IS THE POSITION OF THE JOINT
+C   LLINOL IS THE LEFT GEL NUMBER OF THE LEFT CONTIG
+C   LLINOR IS THE LEFT GEL OF THE RIGHT CONTIG
+C   LNCONL IS THE LEFT CONTIG LINE NUMBER
+C   LNCONR IS THE RIGHT CONTIG LINE NUMBER
+C
+C   ADJUST ALL RELATIVE POSITIONS IN RIGHT CONTIG
+      N=LLINOR
+      RELPG(N)=RELX
+50    CONTINUE
+      IF(RNBR(N).EQ.0)GO TO 60
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+RELX-1
+      GO TO 50
+60    CONTINUE
+C
+C   FIX UP NEW GEL LINE FOR OLD LEFT OF RIGHT CONTIG
+      LNBR(LLINOR)=RNBR(LNCONL)
+C   FIX UP RIGHT GEL OF LEFT CONTIG
+      N=RNBR(LNCONL)
+      RNBR(N)=LLINOR
+C   MERGE WILL SORT OUT THE CORRECT NEIGHBOURS
+C
+      CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LNCONL,IDBSIZ)
+C   MERGE DOES NOT WRITE TO DISK
+      N=LNBR(LNCONL)
+65    CONTINUE
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N))
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 65
+C   CONTIG LINES
+      X=RELPG(LNCONR)+RELX-1
+C   LENGTH MAY NOT HAVE INCREASED!
+      IF(X.GT.RELPG(LNCONL))RELPG(LNCONL)=X
+C   SAVE LENGTH OF NEW CONTIG
+      RELX=RELPG(LNCONL)
+C      WRITE(IDEVR,REC=LNCONL)RELPG(LNCONL),LNGTHG(LNCONL),LNBR(LNCONL),
+C     1RNBR(LNCONL)
+      CALL WRITER(IDEVR,LNCONL,RELPG(LNCONL),LNGTHG(LNCONL),
+     +LNBR(LNCONL),RNBR(LNCONL))
+C
+C   NOW MOVE ALL DATA DOWN TO DELETE OLD RIGHT END
+      N=IDBSIZ-NCONTS
+      M=LNCONR-N
+      IF(M.EQ.0)GO TO 80
+      K=LNCONR
+      J=LNCONR-1
+      DO 70 I=1,M
+      RELPG(K)=RELPG(J)
+      LNGTHG(K)=LNGTHG(J)
+      LNBR(K)=LNBR(J)
+      RNBR(K)=RNBR(J)
+C      WRITE(IDEVR,REC=K)RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K))
+      K=K-1
+      J=J-1
+70    CONTINUE
+80    CONTINUE
+      NCONTS=NCONTS-1
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+90    CONTINUE
+      RETURN
+      END
+C     SUBROUTINE AJOIN3
+      SUBROUTINE AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,IDIM22,
+     +KLASS,IOVER,KBOUT,PL,PR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LINCON(2),IDIM22(2)
+      INTEGER ITYPE(2),ISENSE(2),JOINT(2),PL(2),PR(2)
+C
+C   CALC POSITIONS OF CONTIGS RELATIVE TO FIXED GEL
+      DO 20 I=1,2
+C   R+
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.1))GO TO 11
+      PL(I)=-1*JOINT(I)+2
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   L+
+11    CONTINUE
+      IF((ITYPE(I).NE.1).OR.(ISENSE(I).NE.1))GO TO 12
+      PL(I)=JOINT(I)
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   R-
+12    CONTINUE
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.-1))GO TO 13
+      PR(I)=JOINT(I)+IDIM22(I)-1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+      GO TO 20
+C   L-
+13    CONTINUE
+      PR(I)=IDIM22(I)-JOINT(I)+1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+20    CONTINUE
+C  LENGTH OF OVERLAP
+      IOVER=MIN(PR(1),PR(2))-MAX(PL(1),PL(2))+1
+      WRITE(KBOUT,1002)IOVER
+1002  FORMAT(' Length of overlap between the contigs=',I6)
+C
+C  CLASS NUMBER 1-16
+      KLASS=1
+      IF(ITYPE(1).EQ.1)KLASS=KLASS+8
+      IF(ISENSE(1).EQ.-1)KLASS=KLASS+4
+      IF(ITYPE(2).EQ.1)KLASS=KLASS+2
+      IF(ISENSE(2).EQ.-1)KLASS=KLASS+1
+C      WRITE(KBOUT,1001)KLASS
+C1001  FORMAT(' CLASS OF JOIN=',I6)
+      RETURN
+      END
+C      ALINE
+C
+C    ROUTINE TO LINE UP 2 SEQS.
+C   IT SLIDES,REMOVES OVERLAPPING MATCHES,
+C   SORTS MATCHES INTO ASCENDING ORDER, THEN DOES DOES A TOPOLOGICAL
+C   CHECK, AND THEN PRODUCES 2 LINED UP SEQS WITH PADDING CHARS
+C   VARIABLES
+C       SEQ1 CONSENSUS
+C       SEQ2 GEL ORIGINAL IN CORRECT ORIENTATION
+C       SEQG2 ALIGNED GEL
+C       SEQC2 ALIGNED CONSENSUS
+C       SEQ3 SAVED GEL RAW DATA
+C       ISAV1,2,3 STORE MATCHES AND POSITIONS
+C       IDSAV NUMBER ISAV'S
+C       IDC LENGTH OF INPUT SEQ1
+C       IDIM2 LENGTH OF INPUT SEQ2
+C       IDOUT LENGTH OF OUTPUT ALIGNED SEQ1
+C       IDIM2 LENGTH OF SEQ2 ON OUTPUT AFTER ALIGNMENT
+C       MINSLI MIN MATCH FOR SLIDING
+C       IFAIL FLAG TO SHOW IF ALIGNMENT FAILED DUE TO TOO
+C   MANY MISMATCHES OR TOPOLIGICAL CHECK OR TOO MANY OR TOO MANY
+C   PADDING CHARS. 1=FAIL,0=PASS
+C
+      SUBROUTINE ALINE(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ3,MAXGEL,
+     +PERCM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV)
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      IFAIL=0
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+      WRITE(KBOUT,1052)ITOTPC,ITOTPG
+1052  FORMAT(' Padding in contig= ',I4,' and in gel= ',I4)
+      CALL DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IFAIL)
+C   ARE ALL CHECKS OK?
+      IF(ITOTPC.GT.MAXPC)IFAIL=1
+      IF(ITOTPG.GT.MAXPG)IFAIL=1
+      IF(PERCM.GT.PERMAX)IFAIL=1
+      END
+      SUBROUTINE ARCSER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*10,NAME2*10
+10    CONTINUE
+      L = 0
+      CALL GTSTR('Archive name',' ',NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(L.EQ.0) RETURN
+      CALL CCASE(NAME1,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      IF(NAME1(1:1).EQ.' ') RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      DO 100 I=1,NGELS
+        CALL READN(IDEVN,I,NAME2)
+        IF(NAME1.EQ.NAME2) THEN
+          WRITE(KBOUT,1003)NAME2,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1003      FORMAT(' ',A,2X,I4,2X,I6,2X,I6,2X,I6,2X,I6/)
+          GO TO 10
+        END IF
+100   CONTINUE
+      WRITE(KBOUT,1004)NAME1
+1004  FORMAT(' ',A,' Not in database')
+      GO TO 10
+      END
+      SUBROUTINE AUTOCN(SEQ1,IDIM,GEL,IDIMG,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,KBOUT,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL)
+C   AUTHOR: RODGER STADEN
+C   changed 29-11-90 to make first in list of alignments the best
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(IDIM),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(IDIM),GEL(MAXGLM)
+C
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL(2),ITYPE(2)
+      PARAMETER (MAXC = 10)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC)
+      REAL PERMIS(2)
+      IFAIL(1) = 1
+      IFAIL(2) = 1
+      KFAIL = 0
+C  23-8-90 Need to deal with failures in a better way. Problem is
+C          case where overlaps are found but fail to align. In future
+C          signal them with new variable KFAIL which will be nonzero
+C          if any alignment fails.
+C  29-11-90 Changed sorting of overlaps so that the best is first in the 
+C           list returned to caller.
+C   SAVE GEL
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+C  COUNT NUMBER OF CONTIGS THAT MATCH
+      IMATC=0
+      IDCEND=MAXCON
+      CALL BUSY(KBOUT)
+      CALL FNDCON(SEQ1,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+      CALL ENCO(SEQ1,IDIM,POSNS,CONST,LENGTH)
+      CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH)
+1     CONTINUE
+      ISTRAN=1
+2     CONTINUE
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      IDSAV=MAXSAV
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ1,GEL,MINMAT,IFCOMP,KBOUT)
+      IF(IFCOMP.NE.0)RETURN
+      IF(IDSAV.NE.0)THEN
+        CALL ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +  IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +  IMATC,ISTRAN,MAXC,KBOUT)
+      END IF
+      ISTRAN=ISTRAN+1
+      IF(ISTRAN.EQ.2) THEN
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        CALL SQREV(GEL,IDIMG)
+        CALL SQCOM(GEL,IDIMG)
+        GO TO 2
+      END IF
+      CALL SQCOPY(GELCOP,GEL,IDIMG)
+      KSENSE = 0
+      WRITE(KBOUT,*)'Total matches found',IMATC
+      IF(IMATC.EQ.0) THEN
+        IFAIL(1) = 0
+        RETURN
+      END IF
+      JMATC = 0
+      DO 100 I = 1,IMATC
+        IF(JSENSE(I).EQ.-1) THEN
+          IF(KSENSE.EQ.0) THEN 
+            CALL SQREV(GEL,IDIMG)
+            CALL SQCOM(GEL,IDIMG)
+            KSENSE = 1
+          END IF
+        END IF
+        JDIM22 = IDIMG
+        JDOUT = MAXGEL
+        IDSAV = MAXSAV
+        WRITE(KBOUT,*)'Trying to align with contig',JLLINO(I)
+        CALL ALINE(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +  SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +  JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +  JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ4,MAXGEL,PERMS)
+        IF(JFAIL.EQ.0) THEN
+          JMATC = JMATC + 1
+          IF(JMATC.EQ.1) THEN
+C    Save in elements 1
+             CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            IFAIL(1) = 0
+          ELSE IF(JMATC.EQ.2) THEN
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+              IFAIL(2) = 0
+            END IF
+          ELSE
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE IF(PERMS.LT.PERMIS(2)) THEN
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+            END IF
+          END IF
+        ELSE
+          KFAIL = 1
+        END IF
+100   CONTINUE
+      IMATC = MIN(2,JMATC)
+      END
+      SUBROUTINE BATIN(SEQ,MAXSEQ,KBIN,KBOUT,IDEV1,IDEV2,IDEV3,
+     +FILNAM,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR RODGER STADEN
+      CHARACTER SEQ(MAXSEQ)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER BASE(4),BKEY(4)
+      SAVE BASE,BKEY
+      DATA BASE/'A','C','G','T'/
+      DATA BKEY/'N','M',',','.'/
+      CALL CBASE(BASE,BKEY,IOK,ICBASE,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV1,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      NGELS=0
+10    CONTINUE
+      IDIM=MAXSEQ
+      WRITE(KBOUT,1003)
+1003  FORMAT(' Type data on lines of < 80 characters, finish with @'/)
+      CALL ARRFIM(KBIN,SEQ,IDIM,KBOUT)
+      IF(IDIM.GT.0)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File name for gel reading',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.EQ.0) THEN
+          IF(ICBASE.EQ.0) THEN
+            CALL CCASEA(SEQ,IDIM,2)
+            DO 5 J = 1,4
+              CALL SUBS(SEQ,IDIM,BKEY(J),BASE(J))
+5           CONTINUE
+          END IF
+          CALL CCASEA(SEQ,IDIM,1)
+          CALL FMTDKN(IDEV2,SEQ,IDIM)
+          CLOSE(UNIT=IDEV2)
+          NGELS=NGELS+1
+          WRITE(IDEV1,1002)FILNAM
+1002      FORMAT( A)
+        END IF
+      END IF
+      CALL YESNO(IANS,'Type in another gel reading',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IANS.EQ.0) GO TO 10
+      CALL YESNO(IANS,'List gel readings',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IANS.EQ.0) THEN
+        REWIND IDEV1
+        DO 50 I=1,NGELS
+          IDIM=MAXSEQ
+          READ(IDEV1,1002)FILNAM
+          CALL OPENRS(IDEV2,FILNAM,IOK,LRECL,2)
+          IF(IOK.NE.0)THEN
+            WRITE(KBOUT,*)' Error opening file'
+            GO TO 50
+          END IF
+          WRITE(IDEV3,1007)FILNAM
+1007      FORMAT(' ',A)
+          CALL ARRFIM(IDEV2,SEQ,IDIM,KBOUT)
+          IF(IDIM.GT.0)CALL FMTDB(SEQ,IDIM,1,IDIM,60,IDEV3)
+          CLOSE(UNIT=IDEV2)
+50      CONTINUE
+      END IF
+      CLOSE(UNIT=IDEV1)
+      WRITE(KBOUT,*)'Number of gel readings entered',NGELS
+      END
+C      BEDIN
+C
+C   SUBROUTINE TO BED A NEW GEL INTO THE DATABASE. IT DISPLAYS
+C   THE JOIN AND ALLOWS CHANGES TO IT, THEN IT ALLOWS EDITING OF
+C   THE NEW GEL USING EDITG OR THE GELS IN THE DATABASE USING
+C   DBEDIT. ANY CHANGES CAN BE DISPLAYED USING DSPLAY. THIS SUBROUTINE
+C   RETURNS NOPT TO TELL OF THE OUTCOME OF ITS USE:3=OK,1=REJOIN,4=GIVE UP
+C   ON INPUT NOPT=2 MEANS LEFT END OVERLAP
+      SUBROUTINE BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,LLINO,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      CHARACTER FILEH*(*)
+      INTEGER HELPS1,HELPE1,HELPS2,HELPE2
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER GEL2(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(5)*(MAXPRM)
+      INTEGER LREG,RIGHTM
+      INTEGER X,RREG
+C   SAVE NOPT FOR LEFT END OR RIGHT JOIN
+      NTYPE=NOPT
+      LREG=X
+      IF(NTYPE .EQ.2)LREG=1
+      RREG=LREG+49
+      IF(RREG.GT.RELPG(LINCON))RREG=RELPG(LINCON)
+      I1=1
+      IF(NTYPE.EQ.2)I1=X
+      I2=I1+49
+      IF(I2.GT.ABS(LNGTHG(NGELS)))I2=ABS(LNGTHG(NGELS))
+      CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,
+     +LLINO,LINCON,LREG,RREG,GEL,I1,I2,2,LLINOR,IDBSIZ,IDEV,
+     +KBOUT,IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+      CALL YESNO(ANS,'Joint correct',
+     +HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.NE.0) THEN
+        NOPT = 1
+        RETURN
+      END IF
+20    CONTINUE
+      CALL BELL(1,KBOUT)
+C     DBMENU now defunct for sap - so we use RADION instead
+C      CALL DBMENU(2,JOPT,HELPS1,HELPE1,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PROMPT(1) = 'Complete entry'
+      PROMPT(2) = 'Edit contig'
+      PROMPT(3) = 'Display'
+      PROMPT(4) = 'Edit new gel reading'
+      JOPT = 1
+      CALL RADION('Enter options', PROMPT, 4, JOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+C  GIVE UP
+      IF(JOPT.EQ.-1)THEN
+        CALL YESNO(ANS,'Quit',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.EQ.0) THEN
+          NOPT = 4
+          RETURN
+        END IF
+      END IF
+C     COMPLETE JOIN
+      IF(JOPT.EQ.1)THEN
+        CALL YESNO(ANS,'Complete entry',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.NE.0) GO TO 20
+        NOPT=3
+        RETURN
+      END IF
+100   CONTINUE
+      IF(JOPT.EQ.4)THEN
+C       EDIT NEW GEL
+        IDIM=ABS(LNGTHG(NGELS))
+        CALL EDITGL(GEL,GEL2,IDIM,KBIN,KBOUT,MAXGEL,
+     +  HELPS2,HELPE2,FILEH,IDEVH,1)
+        LNGTHG(NGELS)=SIGN(IDIM,LNGTHG(NGELS))
+      END IF
+200   CONTINUE
+      IF(JOPT.EQ.3)THEN
+C       DISPLAY
+        LLINO=LNBR(LINCON)
+        LEFTMN=X
+        RIGHTM=X+ABS(LNGTHG(NGELS))-1
+        RIGHTM=MIN(RIGHTM,RELPG(LINCON))
+        IF(NTYPE.EQ.2)THEN
+          LEFTMN=1
+          RIGHTM=ABS(LNGTHG(NGELS))-X+1
+          RIGHTM=MIN(RIGHTM,RELPG(LINCON))
+        END IF
+        CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LEFTMN,
+     +  RIGHTM,LREG,
+     +  RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS1,HELPE1,FILEH,IDEVH,IERR)
+        IF(IERR.NE.0) GO TO 20
+        IF(NTYPE.EQ.2)GO TO 210
+        I1=LREG-X+1
+        IF(I1.GT.ABS(LNGTHG(NGELS)))GO TO 200
+        I2=RREG-X+1
+        IF(I2.GT.ABS(LNGTHG(NGELS)))GO TO 200
+        GO TO 230
+210     CONTINUE
+        I1=X+LREG-1
+        I2=X+RREG-1
+        IF(I1.GT.ABS(LNGTHG(NGELS)))GO TO 200
+        IF(I2.GT.ABS(LNGTHG(NGELS)))GO TO 200
+230     CONTINUE
+        CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,
+     +  LLINO,LINCON,LREG,RREG,GEL,I1,I2,2,LLINOR,IDBSIZ,IDEV,
+     +  KBOUT,IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+        LLINO=LNBR(LINCON)
+      END IF
+300   CONTINUE
+      IF(JOPT.EQ.2)THEN
+C       EDIT DATABASE
+        IF(NTYPE.EQ.2)THEN
+          IMAXL=1
+          IMAXR=ABS(LNGTHG(NGELS))-X+1
+        ELSE IF(NTYPE.EQ.1)THEN
+          IMAXL=X
+          IMAXR=X+ABS(LNGTHG(NGELS))-1
+        END IF
+        CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL2,
+     +  LINCON,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +  HELPS2,HELPE2,FILEH,IDEVH,MAXGEL)
+      END IF
+      IF(JOPT.EQ.1)THEN
+        CALL HELP2(HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      END IF
+      GO TO 20
+      END
+      SUBROUTINE BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CLEN,CHAINL,GCLIN
+      EXTERNAL CLEN,CHAINL,GCLIN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTR = IDBSIZ - NCONTS - 1
+      IF(NCONTR.LE.NGELS) THEN
+        WRITE(KBOUT,*)'Insufficient space for new contig line.'
+        WRITE(KBOUT,*)'Increase database size with copy'
+        RETURN
+      END IF
+10    CONTINUE
+        MN = 0
+        MX = NGELS
+        IR  = 0
+        CALL GETINT(MN,MX,IR,
+     +  'Number of gel reading that will become a left end',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IVAL.LT.1) RETURN
+        IR = IVAL
+      IL = LNBR(IR)
+      IF(IL.EQ.0)THEN
+        WRITE(KBOUT,*)'Gel number',IR,' is already a left end'
+        GO TO 10
+      END IF
+      ILO = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IR)
+      IF(ILO.EQ.0)THEN
+        WRITE(KBOUT,*)
+     +'Problem with this contig. Check logical consistency'
+        WRITE(KBOUT,*)'of database. Break not made'
+        RETURN
+      END IF
+      NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,ILO)
+      IF(NCONTO.EQ.0)THEN
+        WRITE(KBOUT,*)'No contig line for this contig. Check logical'
+        WRITE(KBOUT,*)'consistency of database. Break not made'
+        RETURN
+      END IF
+      LCONTO = RELPG(NCONTO)
+      IF(LCONTO.LT.1)THEN
+        WRITE(KBOUT,*)'Contig has zero length. Break not made'
+        RETURN
+      END IF
+      CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+      END
+      SUBROUTINE CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CLEN
+      EXTERNAL CLEN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTS = NCONTS + 1
+C  WRITE LAST LINE OF DB
+      WRITE(KBOUT,*)'Increasing number of contigs by 1'
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C  MAKE NEW CONTIG A COPY OF OLD
+      RELPG(NCONTR) = RELPG(NCONTO)
+      LNGTHG(NCONTR) = LNGTHG(NCONTO)
+      LNBR(NCONTR) = IR
+      RNBR(NCONTR) = RNBR(NCONTO)
+      WRITE(KBOUT,*)'Writing new right contig line'
+      CALL WRITER(IDEVR,NCONTR,RELPG(NCONTR),LNGTHG(NCONTR),
+     +LNBR(NCONTR),RNBR(NCONTR))
+C  NEED LENGTH FOR OLD LEFT CONTIG
+      RNBR(IL) = 0
+      L = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,ILO)
+      IF(L.LT.1)THEN
+        WRITE(KBOUT,*)
+     +  'New left contig has zero length. Break not made'
+        RETURN
+      END IF
+      RELPG(NCONTO) = L
+      RNBR(NCONTO) = IL
+C  DO CONTIG LINE FOR NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new left contig line'
+      WRITE(KBOUT,*)'New length=',RELPG(NCONTO)
+      WRITE(KBOUT,*)'New right gel=',RNBR(NCONTO)
+      CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +LNBR(NCONTO),RNBR(NCONTO))
+C  DO GEL LINE FOR RIGHT GEL OF NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new right gel of left contig'
+      WRITE(KBOUT,*)'Gel number=',IL
+      CALL WRITER(IDEVR,IL,RELPG(IL),LNGTHG(IL),
+     +LNBR(IL),RNBR(IL))
+C  DO GEL LINE FOR NEW RIGHT CONTIG
+      LNBR(IR) = 0
+      WRITE(KBOUT,*)'Writing new left gel of right contig'
+      WRITE(KBOUT,*)'Gel number=',IR
+      CALL WRITER(IDEVR,IR,RELPG(IR),LNGTHG(IR),
+     +LNBR(IR),RNBR(IR))
+C  NOW SHIFT
+      I = 1 - RELPG(IR)
+      WRITE(KBOUT,*)'Shifting gels in right contig by distance=',I
+      CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IR,NCONTR,I)
+      WRITE(KBOUT,*)'Right contig length=',RELPG(NCONTR)
+      WRITE(KBOUT,*)'Break completed'
+      IOK = 0
+      END
+C     BUBBL3
+C   SUBROUTINE TO SORT INTEGER ARRAY (LIST) INTO ASCENDING  ORDER
+C
+      SUBROUTINE BUBBL3(LIST,LISTEL,LISTAL,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM),LISTEL(IDIM),LISTAL(IDIM)
+C
+C   SET POINTERS TO ZERO
+      I=0
+      J=0
+C
+10    CONTINUE
+C
+C   SET I=J IF WE HAVE JUST CORRECTLY POSITIONED AN ELEMENT
+      IF(J.GT.I)I=J
+C
+C   INCREMENT POINTER TO NEXT ELEMENT
+      I=I+1
+C   TEST FOR END OF ARRAY
+      IF(I.EQ.IDIM)RETURN
+C
+20    CONTINUE
+C
+C   COMPARE ADJACENT ELEMENTS
+      IF(LIST(I).GE.LIST(I+1))GO TO 10
+C
+C   FIRST MOVE THIS ELEMENT? IF SO SET POINTER TO ITS INITIAL POSITION
+      IF(J.LT.I)J=I
+C
+C   EXCHANGE ADJACENT ELEMENTS
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+C
+      ITEMP=LISTEL(I)
+      LISTEL(I)=LISTEL(I+1)
+      LISTEL(I+1)=ITEMP
+      ITEMP=LISTAL(I)
+      LISTAL(I)=LISTAL(I+1)
+      LISTAL(I+1)=ITEMP
+C
+C
+C   DECREMENT BACK THRU LIST WITH THIS ELEMENT
+      IF(I.GT.1)I=I-1
+C
+      GO TO 20
+      END
+      SUBROUTINE CBASE(BASE,BKEY,IOK,ICBASE,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR RODGER STADEN
+      CHARACTER BASE(4),BKEY(4),HELPF*(*),NEW(4)
+      CHARACTER PROMPT*14
+      SAVE PROMPT
+      DATA PROMPT/'Key for base  '/
+      IOK = 1
+      CALL YESNO(ICBASE,'Use special keys for ACTG',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ICBASE.EQ.1) THEN
+        IOK = 0
+        RETURN
+      ELSE IF(ICBASE.NE.0) THEN
+       RETURN
+      END IF
+      I = 1
+1     CONTINUE
+      IF(I.LT.5) THEN
+        PROMPT(14:14) = BASE(I)
+        L = 1
+        CALL GETSTR(PROMPT,BKEY(I),NEW,4,L,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.1) THEN
+           CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+           GO TO 1
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        IF(NEW(1).EQ.';') THEN
+          WRITE(KBOUT,*)' ; is not allowed'
+          GO TO 1
+        END IF
+        IF(L.NE.0) BKEY(I) = NEW(1)
+        I = I + 1
+        GO TO 1
+      END IF
+      CALL CCASEA(BKEY,4,2)
+      IOK = 0
+      END
+      SUBROUTINE CCTA(SEQ,ID)
+      CHARACTER SEQ(ID),COM,AS
+      SAVE COM,AS
+      DATA COM/','/,AS/'*'/
+      DO 10 I = 1,ID
+        IF(SEQ(I).EQ.COM) SEQ(I) = AS
+10    CONTINUE
+      END
+      SUBROUTINE CEDT(KBIN,KBOUT,
+     +GELNOS,GELSTR,GELEND,
+     +MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,IDEV1,IDEV2,IDEV3,LINLEN,
+     +FILNAM,LLINO,IOK)
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER LINEIN*80,GEL(MAXGEL),GEL2(MAXGEL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2),RREG
+      WRITE(KBOUT,*)'Identify contig to edit'
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +LLINO,IOK,IDBSIZ,KBIN,KBOUT,IDEV3,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for temporary editing file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      LREG = 1
+      RREG = RELPG(LINCON)
+      CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +GEL,LLINO,LINCON,LREG,RREG,GEL2,I1,I2,0,I,IDBSIZ,IDEV,KBOUT,
+     +IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+      CLOSE(UNIT=IDEV)
+      CALL CEDIT(FILNAM)
+      CALL YESNO(IWANT,'Put edited contig into database',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IWANT.NE.0) RETURN
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,2)
+      IF(IOK.NE.0) RETURN
+      KGEL = LLINO
+      CALL BUSY(KBOUT)
+      CALL POSTC(GELNOS,GELSTR,GELEND,LINEIN,
+     +IOK,KBOUT,IDEV,MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,
+     +LNBR,RNBR,IDBSIZ,KGEL)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      REWIND IDEV
+      CALL NEWDB(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,KBOUT,IDEV,
+     +IDEV1,IDEV2,IDEV3,MAXDB,GEL,MAXGEL,LINCON,KGEL,IOK)
+      CLOSE(UNIT=IDEV)
+      END
+C
+C     CFGEL
+C
+C   ROUTINE TO COMPARE A STRING OF WORD NUMBERS FOR A GEL WITH A SERIES
+C   OF ARRAYS REPRESENTING A CONSENSUS SEQUENCE. WE LOOK FOR OCCURENCES
+C   OF PAIRS OF WORDS (EACH WORD IS LENGTH CHARS LONG AND SO TOTAL MATCH IS
+C   2*LENGTH CHARS LONG). THE ARRAYS SENT ARE OF SIZE 4**LENGTH (LE4)
+      SUBROUTINE CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LE4,
+     +SAVPG,
+     +SAVPS,SAVL,IDSAV,SEQ,GEL,MINMAT,IFAIL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),GEL(IDIMG)
+      INTEGER POSNS(IDIM),WORDP(LE4),SAVPS(IDSAV),SAVPG(IDSAV)
+      INTEGER GELN(IDIMG),SAVL(IDSAV)
+      INTEGER WORDN(LE4)
+      INTEGER W1,W2,PS1W1,PS1W2
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C bug fix 27-2-92: if sequence had different case matches would
+C not be extended correctly because we look for identical characters
+C switched to using ctonum
+      IDSAVM=IDSAV
+      IDSAV=0
+C   LENGTH OF PAIR OF WORDS
+      LX2=2*LENGTH
+C   NUMBER OF PAIRS OF WORDS OF LENGTH LENGTH IN GEL
+      NW=IDIMG-(LX2-1)
+C   LOOP FOR EACH START POINT
+      DO 200 I=1,NW
+C   IS THIS WORD A ZERO?
+      W1=GELN(I)
+      IF(W1.EQ.0)GO TO 200
+C   POINT TO NEXT WORD OF PAIR
+      W2=GELN(I+LENGTH)
+      IF(W2.EQ.0)GO TO 200
+C   DOES W1 EXIST IN SEQ?
+      N1S1=WORDN(W1)
+      IF(N1S1.EQ.0)GO TO 200
+      N2S1=WORDN(W2)
+      IF(N2S1.EQ.0)GO TO 200
+C   BOTH EXIST, SO POINT TO THE FIRST + LENGTH
+      PS1W1=WORDP(W1)+LENGTH
+C   LOOP FOR ALL PAIRS
+      DO 50 J=1,N1S1
+C   POINT TO FIRST W2 BECAUSE IT IS IN WORDP NOT POSNS
+      PS1W2=WORDP(W2)
+C   LOOP FOR THESE
+      DO 40 K=1,N2S1
+C   ARE THIS PAIR LENGTH APART?
+      N=PS1W1-PS1W2
+      IF(N.NE.0)GO TO 20
+C   THEY ARE SO, IF REQUIRED LOOK FOR REST OF MATCH
+      LMAT=LX2
+      IF(MINMAT.EQ.LX2)GO TO 15
+      IPC=PS1W2+LENGTH
+      IPG=I+LX2
+16    CONTINUE
+      IF(IPG.GT.IDIMG)GO TO 15
+      IF(IPC.GT.IDIM)GO TO 15
+C
+      IF(CTONUM(SEQ(IPC)).NE.CTONUM(GEL(IPG)))GO TO 15
+      LMAT=LMAT+1
+      IPC=IPC+1
+      IPG=IPG+1
+      GO TO 16
+15    CONTINUE
+C  IS MATCH LONG ENOUGH?
+      IF(LMAT.LT.MINMAT)GO TO 20
+      IDSAV=IDSAV+1
+      IF(IDSAV.LE.IDSAVM)GO TO 18
+      WRITE(KBOUT,1000)IDSAVM
+1000  FORMAT(' More than ',I6,' matches. Search aborted')
+      IFAIL=1
+      RETURN
+18    CONTINUE
+      SAVL(IDSAV)=LMAT
+      SAVPG(IDSAV)=I
+      SAVPS(IDSAV)=PS1W1-LENGTH
+20    CONTINUE
+C   POINT TO NEXT W2
+      PS1W2=POSNS(PS1W2)
+40    CONTINUE
+C   ALL TRIED THIS PS1W1, TRY NEXT
+      PS1W1=POSNS(PS1W1-LENGTH)+LENGTH
+50    CONTINUE
+200   CONTINUE
+      IFAIL=0
+      RETURN
+      END
+      INTEGER FUNCTION CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      J = I
+      CHAINL = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        J = I
+        I = LNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CHAINL = J
+      END
+C
+C      CHANGE
+C
+C   ROUTINE TO EXCHANGE ALL THE CHARS IN A CHARACTER ARRAY USING
+C   A PAIR OF LOOKUP ARRAYS SENT BY CALLING PROG
+C
+C
+      SUBROUTINE CHANGE(SEQ,IDIM1,CHAR1,CHAR2,IDIM2,ELSE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      CHARACTER CHAR1(IDIM2),CHAR2(IDIM2)
+      CHARACTER ELSE
+      DO 100 I=1,IDIM1
+C
+      DO 50 J=1,IDIM2
+C
+      IF(SEQ(I).NE.CHAR1(J))GO TO 50
+C   MATCH SO EXCHANGE CHARS
+      SEQ(I)=CHAR2(J)
+      GO TO 100
+50    CONTINUE
+      SEQ(I)=ELSE
+100   CONTINUE
+C
+      RETURN
+      END
+      CHARACTER*1 FUNCTION CHARSL(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'ctag*-'/
+      CHARSL = C(I:I)
+      END
+      CHARACTER*1 FUNCTION CHARSU(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'CTAG*-'/
+      CHARSU = C(I:I)
+      END
+      INTEGER FUNCTION CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = LGEL
+      CHNRP = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP = I
+        RETURN
+      END IF
+      END
+      INTEGER FUNCTION CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +LGEL,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = LGEL
+      CHNRP1 = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I)+ABS(LNGTHG(I))-1.LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP1 = I
+        RETURN
+      END IF
+      END
+      INTEGER FUNCTION CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      CLEN= 0
+      LEN = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        LEN = MAX(LEN,(RELPG(I) + ABS(LNGTHG(I)) - 1))
+        I = RNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CLEN = LEN
+      END
+      SUBROUTINE CLIST(GELNO1,LINNO1,IGEL1,GELNO2,LINNO2,
+     +IGEL2,GELNOS,GELSTR,GELEND,IUNIQ1,IUNIQ,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER GELNO1(IGEL1),GELNO2(IGEL2),GELNOS(IUNIQ1)
+      INTEGER GELSTR(IUNIQ1),GELEND(IUNIQ1)
+      INTEGER LINNO1(IGEL1),LINNO2(IGEL2)
+      EXTERNAL INLIST
+C   GELNOS === GEL NUMBERS (GELNOS)
+C   GELSTR === GEL START LINES
+C   GELEND === GEL END LINES
+C   GELNO  === GEL NUMBERS PER STRIP
+C   LINNO  === GEL LINE NUMBERS PER STRIP
+C   IGEL   === NUMBER OF GELS PER STRIP
+C   LINENO === CURRENT LINE NUMBER
+C
+C     WHICH GELS IN GELNO2 DO NOT APPEAR IN GELNO1
+C     IE HAVE STARTED IN GELNO2
+        DO 20 I=1,IGEL2
+        MATCH=INLIST(GELNO1,IGEL1,GELNO2(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH SO NEW
+C         PUT IN GELSTR
+          IUNIQ=IUNIQ+1
+          GELNOS(IUNIQ)=GELNO2(I)
+          GELSTR(IUNIQ)=LINNO2(I)
+        END IF
+20    CONTINUE
+C     WHICH GELS IN GELNO1 DO NOT APPEAR IN GELNO2
+C     IE WHICH HAVE ENDED IN GELNO1
+      DO 10 I=1,IGEL1
+        MATCH=INLIST(GELNO2,IGEL2,GELNO1(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH  SO MUST HAVE ENDED
+C         WHERE IS IT STORED IN GELNOS?
+          MATCH=INLIST(GELNOS,IUNIQ,GELNO1(I))
+          IF(MATCH.NE.0)THEN
+            GELEND(MATCH)=LINNO1(I)
+            GO TO 10
+          END IF
+C         ERROR
+          WRITE(KBOUT,1000)GELNO1(I)
+1000      FORMAT( ' Error: gel number ',I5,
+     +    ' expected but not found in list')
+          IOK = 1
+          RETURN
+        END IF
+10    CONTINUE
+      IOK = 0
+      RETURN
+      END
+C
+C      CMPLMT
+C
+C   SUBROUTINE TO REVERSE AND COMPLEMENT GELS AND DATA BASE
+C   THE POSITIONS OF THE RIGHT ENDS OF GELS ARE FIRST STORED
+C   IN RELPG THEN WE DO A BUBBLE SORT ON THESE POSITIONS
+C   UPDATING RELATIONSHIPS AS WE GO
+C   ALSO SEQUENCES ARE COMPLEMENTED, SIGNS OF LENGTH ARE
+C   MULTIPLIED BY -1 AND THE CONTIG LINE IS ALTERED
+      SUBROUTINE CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,GEL,IDBSIZ,KBOUT,IDEVR,IDEVW,MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER X
+C
+      WRITE(KBOUT,1000)LLINO
+1000  FORMAT( ' Complementing contig',I6)
+C   CHAIN THRU AND PUT RIGHT ENDS IN RELPG
+      N=LLINO
+10    CONTINUE
+      RELPG(N)=RELPG(N)+(ABS(LNGTHG(N)))-1
+      IF(RNBR(N).EQ.0)GO TO 20
+      N=RNBR(N)
+      GO TO 10
+20    CONTINUE
+C
+C   NOW EFFECTIVELY BUBBLE SORT ON RELPG
+      N=RNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+      N=NL
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+      NL=LNBR(N)
+      IF(NL.EQ.0)GO TO 30
+      I1=0
+23    CONTINUE
+      IF(RELPG(N).GE.RELPG(NL))GO TO 21
+C   NOT IN CORRECT ORDER SO CHAIN ALONG UNTIL CORRECT,THEN COME
+C   BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE THIS LINE SET POINTER TO CURRENT POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C
+C   EXCHANGE NEIGHBOURS. CURRENTLY LOOKING AT N AND ITS LEFT
+C   NBR, AND THE LEFT NBR IS FURTHER RIGHT THAN N
+C   FIX UP POINTERS TO LEFT AND RIGHT OF THESE TWO
+      M=LNBR(NL)
+      IF(M.NE.0)RNBR(M)=N
+      M=RNBR(N)
+      IF(M.NE.0)LNBR(M)=NL
+      LNBR(N)=LNBR(NL)
+      LNBR(NL)=N
+      RNBR(NL)=RNBR(N)
+      RNBR(N)=NL
+C   CHAIN BACK THRU LIST WITH THIS LINE
+      N=RNBR(NL)
+      IF(N.EQ.0)GO TO 21
+C   IE END MET
+      GO TO 23
+30    CONTINUE
+C   FINISH WITH LEFT END IN N
+40    CONTINUE
+C   NOW REVERSE NBRS SO CHAIN BACK RIGHT
+      NL=RNBR(N)
+      IF(NL.EQ.0)GO TO 50
+      RNBR(N)=LNBR(N)
+      LNBR(N)=NL
+      N=NL
+      GO TO 40
+50    CONTINUE
+C   NEED TO FIX UP NEW LEFT END
+      RNBR(N)=LNBR(N)
+      LNBR(N)=0
+C   ALL POINTERS FIXED NOW DO RELATIVE POSITION
+C   FINISH WITH LEFT END IN N
+C   SO CHAIN BACK RIGHT
+C   SAVE RIGHT LINE NUMBER
+      NL=N
+      X=RELPG(N)
+60    CONTINUE
+      RELPG(N)=1+(-1*(RELPG(N)-X))
+      IF(RNBR(N).EQ.0)GO TO 70
+      N=RNBR(N)
+      GO TO 60
+70    CONTINUE
+C   NOW FIX CONTIG LINE
+      LNBR(LINCON)=NL
+      RNBR(LINCON)=N
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+C   NOW REVERSE AND COMPLEMENT GELS
+      N=NL
+80    CONTINUE
+C      READ(IDEVW,REC=N)GEL
+      CALL READW(IDEVW,N,GEL,MAXGEL)
+      M=ABS(LNGTHG(N))
+      CALL SQREV(GEL,M)
+      CALL SQCOM(GEL,M)
+      CALL WRITEW(IDEVW,N,GEL,MAXGEL)
+C      WRITE(IDEVW,REC=N)GEL
+C   CHANGE SIGNS
+      LNGTHG(N)=-1*LNGTHG(N)
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   ANY MORE?
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 80
+C   NO MORE
+      RETURN
+      END
+C      CONSEN
+C   CALCULATES A CONSENSUS USING THE RULES OUTLINED IN THE DOCUMENTATION
+C   AND SUBROUTINE SUMMER
+C   UNIT IDEV IS USED FOR OUTPUT
+      SUBROUTINE CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +KBIN,KBOUT,IDEVW,IDEV,NAMCON,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,IDM,PERCD,IDEVN,LLINO)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CHARACTER NAMCON*(*)
+100   CONTINUE
+      ISTART=1
+      NAMCON = ' '
+      CALL OPENF1(IDEV,NAMCON,1,IOK,KBIN,KBOUT,
+     +'Name for consensus file',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(ANS,'Make consensus for whole database',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.1)GO TO 150
+      N=IDBSIZ-NCONTS
+      CALL BUSY(KBOUT)
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        IF((ISTART+19+Y).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+1009      FORMAT(
+     +    ' Maximum consensus length(',I6,') exceeded,',/,
+     +    ' calculation aborted')
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        ISTART=ISTART+Y
+110   CONTINUE
+      ISTART=ISTART-1
+      CALL FMTDK(IDEV,SEQ1,ISTART)
+      GO TO 400
+150   CONTINUE
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)GO TO 400
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)GO TO 400
+      IDIM2=RREG-LREG+1
+      IF((ISTART+19+IDIM2).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+         RETURN
+      END IF
+      CALL BUSY(KBOUT)
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1(ISTART),IDIM2,GEL,LREG,RREG,LLINO,IDBSIZ,TEMP,
+     +CHRSIZ,MAXGL2,IDEVW,MAXGEL,IDM,PERCD)
+      ISTART=ISTART+IDIM2
+300   CONTINUE
+      CALL YESNO(ANS,'Select another contig',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0) GO TO 150
+      ISTART=ISTART-1
+      CALL FMTDK(IDEV,SEQ1,ISTART)
+400   CONTINUE
+      CALL YESNO(ANS,'Make another consensus',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0)GO TO 100
+      RETURN
+      END
+      SUBROUTINE COPYM(JLEFTS,ILEFTS,JLC,ILC,
+     +JPOSC,IPOSC,JSENSE,ISENSE,JLLINO,LLINO,
+     +JJOINT,JOINT,JTOTPC,ITOTPC,JTOTPG,ITOTPG,
+     +JTYPE,ITYPE,JDOUT,IDOUT,JDIM22,IDIM22,
+     +SEQG3,SEQG2,SEQC3,SEQC2,PERMS,PERMIS)
+      CHARACTER SEQG3(JDIM22),SEQG2(JDIM22),SEQC3(JDOUT),SEQC2(JDOUT)
+      ILEFTS = JLEFTS
+      ILC = JLC
+      IPOSC = JPOSC
+      IPOSG = JPOSG
+      ISENSE = JSENSE
+      LLINO = JLLINO
+      JOINT = JJOINT
+      ITOTPC = JTOTPC
+      ITOTPG = JTOTPG
+      ITYPE = JTYPE
+      IDOUT = JDOUT
+      IDIM22 = JDIM22
+      CALL SQCOPY(SEQG3,SEQG2,JDIM22)
+      CALL SQCOPY(SEQC3,SEQC2,JDOUT)
+      PERMIS = PERMS
+      END
+      SUBROUTINE CRUNS(GEL,IFIRST,ILAST,RUN)
+      CHARACTER GEL*50,RUN*50,PAD
+      SAVE PAD
+      DATA PAD/'*'/
+C  MARK RUNS IN THIS CONSENSUS SO THAT
+C  ATGGGCC BECOMES
+C     ** *
+      RUN(1:ILAST) = ' '
+      DO 10 I = IFIRST+1,ILAST
+C  MUST ALLOW FOR THE EXTEMELY UNLIKELY CASE OF A RUN OF DASHES
+C  IN THE CONSENSUS
+        IF(GEL(I:I).NE.'-')THEN
+          IF(GEL(I:I).EQ.GEL(I-1:I-1)) RUN(I:I) = PAD
+        END IF
+10    CONTINUE
+      END
+C     SUBROUTINE DALIGN
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+100   CONTINUE
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      X=FLOAT(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(
+     +' Matching region too long for routine dalign. Alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      WRITE(KBOUT,1000)X
+1000  FORMAT(' Percentage mismatch after alignment = ',F4.1)
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Best alignment found')
+      CALL SQMTCH(SEQC2(1),SEQG2(IENDG),SEQ3,LO)
+      CALL FMT4LN(SEQC2(1),SEQG2(IENDG),SEQ3,LO,IENDC,IENDG,KBOUT)
+      IFAIL=0
+      END
+C     DBCHEK
+C   TO CHECK THE  SELF CONSISTENCY OF A DB
+C   ARE ALL NEIGHBOURS HOLDING HANDS OR DO SOME HAVE
+C   GELS WHICH DO NOT BELONG TO ANY CONTIG? OR WORSE BELONG TO MORE
+C   THAN ONE CONTIG?
+C   ARE THERE ANY LOOPS IN CONTIGS (THE WORST PROBLEM)
+C   ARE ALL RELATIVE POSITIONS IN SAME ORDER AS NEIGHBOURS?
+C   ARE THERE ANY CONTIGS OF LENGTH <1 ?
+C   ARE THERE ANY GELS OF ZERO LENGTH ?
+C   RETURN ERROR CODE OF ZERO FOR ALL OK, 1 FOR GELS NOT USED
+C   AND 2 FOR ALL OTHER ERRORS
+      SUBROUTINE DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP,IERR,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP(IDBSIZ)
+      IERR=0
+C   HAND HOLDING OK?
+      DO 100 I=1,NGELS
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 50
+        IF(RNBR(K).EQ.I)GO TO 50
+        WRITE(KBOUT,1000)I
+1000    FORMAT(' Hand holding problem for gel reading',I6)
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+1001    FORMAT(' ',5I6)
+        IERR=2
+50      CONTINUE
+        K=RNBR(I)
+        IF(K.EQ.0)GO TO 100
+        IF(LNBR(K).EQ.I)GO TO 100
+        WRITE(KBOUT,1000)I
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+        IERR=2
+100   CONTINUE
+C
+C   ALL GELS IN CONTIGS OR IN MORE THAN ONE CONTIG?
+C  TEMP IS USED TO COUNT THE NUMBER OF TIMES EACH GEL IS USED
+      CALL FILLI(TEMP,IDBSIZ,0)
+      N=IDBSIZ-NCONTS
+C   COUNT NUMBER OF PASSES THRU LOOP 320 TO 300. IF THIS EXCEEDS
+C   THE DATABASE SIZE A LOOP HAS BEEN FOUND!
+      ICOUNT=0
+      DO 300 I=N,IDBSIZ-1
+        II=I
+        K=LNBR(I)
+        J=RNBR(I)
+        IF((K.NE.0).AND.(J.NE.0))GO TO 310
+C   THIS CONTIG POINTS TO ZERO GEL NUMBER AS LEFT OR RIGHT END!
+        WRITE(KBOUT,1002)I
+1002    FORMAT(' Contig',I4,' has zero gel numbers')
+        IERR=2
+        GO TO 290
+310     CONTINUE
+        IF((LNBR(K).EQ.0).AND.(RNBR(J).EQ.0))GO TO 290
+C   THIS LEFT END IS NOT A LEFT END OR THIS RIGHT END IS NOT A RIGHT END
+        WRITE(KBOUT,1004)I
+1004  FORMAT(' The end gels of contig',I4,' have outward neighbours')
+        IERR=2
+290     CONTINUE
+C       DOES THE CONTIG HAVE SOME +VE LENGTH?
+        IF(RELPG(I).GT.0)GO TO 320
+        WRITE(KBOUT,1010)I
+1010    FORMAT(' The contig on line number',I4,' has zero length')
+        IERR=2
+320     CONTINUE
+        TEMP(K)=TEMP(K)+1
+        ICOUNT=ICOUNT+1
+        IF(ICOUNT.GT.IDBSIZ)GO TO 601
+        K=RNBR(K)
+        IF(K.NE.0)GO TO 320
+300   CONTINUE
+      DO 400 I=1,NGELS
+        IF(TEMP(I).EQ.1)GO TO 390
+        IF(TEMP(I).EQ.0)GO TO 410
+        WRITE(KBOUT,1005)I,TEMP(I)
+1005    FORMAT(' Gel number ',I6,' is used ',I6,' times')
+        IERR=2
+        GO TO 400
+390     CONTINUE
+C       DOES THE GEL HAVE NONZERO LENGTH? (ONLY CHECK THOSE USED ONCE)
+        IF(LNGTHG(I).NE.0)GO TO 400
+        WRITE(KBOUT,1011)I
+1011    FORMAT(' Gel number',I6,' has zero length')
+        IERR=2
+        GO TO 400
+410     CONTINUE
+        WRITE(KBOUT,1006)I
+1006    FORMAT(' Gel number ',I6,' is not used')
+C       NEED TO INCREMENT THE ERROR COUNT NOT RESET IT TO A LOWER VALUE!
+        IF(IERR.LT.2)IERR=1
+400   CONTINUE
+C   ALL RELATIVE POSNS OK?
+      N=IDBSIZ-NCONTS
+      DO 500 I=N,IDBSIZ-1
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 500
+510     CONTINUE
+        J=RNBR(K)
+        IF(J.EQ.0)GO TO 500
+        IF(RELPG(K).GT.RELPG(J))GO TO 520
+        K=J
+        GO TO 510
+520     CONTINUE
+        WRITE(KBOUT,1007)K,RELPG(K),J,RELPG(J)
+1007    FORMAT(' Gel number',I6,' with position',I6,
+     +  ' is the left neighbour of',
+     +  /,' gel number',I6,' with position',I6)
+        K=J
+        IERR=2
+        GO TO 510
+500   CONTINUE
+      IF(IERR.EQ.0)WRITE(KBOUT,1013)
+1013  FORMAT(' Database is logically consistent')
+      RETURN
+601   CONTINUE
+      IERR=2
+      WRITE(KBOUT,1008)II
+1008  FORMAT(' Loop in contig',I6,/,
+     +' No further checking done but gel numbers follow')
+      CALL FILLI(TEMP,IDBSIZ,0)
+      K=LNBR(II)
+710   CONTINUE
+      TEMP(K)=TEMP(K)+1
+      WRITE(KBOUT,1009)K
+1009  FORMAT(' ',I6)
+      IF(TEMP(K).GT.1)RETURN
+      K=RNBR(K)
+      GO TO 710
+      END
+      SUBROUTINE DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SUBROUTINE TO EDIT GEL READINGS IN DATA BASE
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER POSN,Y,X
+      CHARACTER GEL(MAXGEL)
+      CHARACTER CHARS(80),PROMPT*32
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PRMPT(3)*(MAXPRM)
+      SAVE PROMPT
+      DATA PROMPT/'Characters for gel reading      '/
+1     CONTINUE
+C      CALL BELL(1,KBOUT)
+C      CALL DBMENU(5,NOPT,IHELPS,IHELPE,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PRMPT(1) = 'Insert'
+      PRMPT(2) = 'Delete'
+      PRMPT(3) = 'Change'
+      NOPT = 1
+      CALL RADION('Edit options', PRMPT, 3, NOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.EQ.-1)RETURN
+      IF(NOPT.EQ.3)GO TO 200
+10    CONTINUE
+      MN = 1
+      MX = RELPG(LINCON)
+      POSN = 0
+      IF(IMAXL.GT.0) THEN
+        MN = IMAXL
+        MX = IMAXR      
+      END IF
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 1
+      POSN = IVAL
+20    CONTINUE
+      MN = 1
+      MX = 50
+      NC = 1
+      CALL GETINT(MN,MX,NC,
+     +'Number of characters',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      NC = IVAL
+C   NEED TO CHECK FOR DELETION TO END OF CONTIG
+      Y=POSN+NC
+      K=RELPG(LINCON)-POSN+1
+      IF((NOPT.EQ.2).AND.(Y.GT.RELPG(LINCON)))NC=K
+C   NOW FIND FIRST CHAR THAT OVERLAPS REGION
+      LLINO=LNBR(LINCON)
+30    CONTINUE
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.POSN)GO TO 40
+      LLINO=RNBR(LLINO)
+      GO TO 30
+40    CONTINUE
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+C   CALC POSN IN THIS GEL TO EDIT
+      K=POSN-RELPG(LLINO)+1
+      IF(NOPT.EQ.2)GO TO 100
+50    CONTINUE
+      WRITE(PROMPT(28:),1005)LLINO
+1005  FORMAT(I5)
+      LC = NC
+      CALL FILLC(CHARS,NC,'*')
+      CALL GETSTR(PROMPT,CHARS,CHARS,NC,LC,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 50
+      END IF
+C   MOVE THE DATA RIGHT
+      M=ABS(LNGTHG(LLINO))
+      LNGTHG(LLINO)=LNGTHG(LLINO)+SIGN(NC,LNGTHG(LLINO))
+      N=ABS(LNGTHG(LLINO))
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,2000)LLINO
+2000    FORMAT(
+     +' Data pushed off end of gel',I5,' During insertion')
+        NM512=N-MAXGEL
+        N=MAXGEL
+C       SET M SO THAT THE DATA AT THE END IS LOST
+        M=M-NM512
+        LNGTHG(LLINO)=SIGN(N,LNGTHG(LLINO))
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      IC=0
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        IC=IC+1
+C       DONT ALLOW SPACES - SET THEM TO *'S
+        IF(CHARS(IC).EQ.' ')CHARS(IC)='*'
+        GEL(I)=CHARS(IC)
+60    CONTINUE
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+65    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 70
+      IF(RELPG(LLINO).GT.POSN)GO TO 70
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 65
+      GO TO 40
+70    CONTINUE
+C   INSERTS FINISHED SO NEED TO INCREMENT ALL THOSE GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+75    CONTINUE
+      IF(RELPG(LLINO).GT.POSN)GO TO 80
+76    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 90
+      GO TO 75
+80    CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)+(NC)
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      GO TO 76
+90    CONTINUE
+      RELPG(LINCON)=RELPG(LINCON)+(NC)
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      GO TO 1
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   DELETE
+100   CONTINUE
+C
+C   MOVE DATA TO LEFT
+      M=K+NC
+C   POSN TO MOVE TO
+      N=MAX(1,K)
+C   NUMBER TO MOVE
+      L=ABS(LNGTHG(LLINO))-M+1
+C   NONE TO MOVE?
+      IF(L.GT.0) THEN
+        DO 160 I=1,L
+          GEL(N)=GEL(M)
+          N=N+1
+          M=M+1
+160     CONTINUE
+      END IF
+C   NEDE TO FIX LENGTHS AND REL POSNS OF GELS EDITED
+C   3 CLASSES GELS LOSING LEFT ENDS,RIGHT ENDS,CENTRES
+C   FOR LEFT ENDS K<1,RIGHT ENDS L<1
+C   SO SET RELPG FOR THOSE LOSING LEFT ENDS
+      IF(K.LT.1)RELPG(LLINO)=POSN
+C   FIX LENGTHS
+      N=N-1
+      IF(L.GT.0)LNGTHG(LLINO)=SIGN(N,LNGTHG(LLINO))
+C   THOSE LOSING RIGHT ENDS
+      M=(POSN-RELPG(LLINO))
+      IF(L.LT.1)LNGTHG(LLINO)=SIGN(M,LNGTHG(LLINO))
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+165   CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 170
+C   DOES IT HAVE DATA IN REGION?
+      X=POSN+(NC)
+      IF(RELPG(LLINO).GE.X)GO TO 170
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 165
+C   WITHIN
+      GO TO 40
+170   CONTINUE
+C   DELETIONS DONE SO DECREMENT ALL GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+      X=POSN+(NC)
+175   CONTINUE
+      IF(RELPG(LLINO).GE.X)GO TO 180
+176   CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 190
+      GO TO 175
+180   CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)-(NC)
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      GO TO 176
+190   CONTINUE
+      RELPG(LINCON)=RELPG(LINCON)-(NC)
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      GO TO 1
+200   CONTINUE
+C
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   CHANGE
+      MN = 1
+      MX = NGELS
+      LLINO = 0
+      CALL GETINT(MN,MX,LLINO,'Gel number',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 200
+      LLINO = IVAL
+      WRITE(KBOUT,1022)RELPG(LLINO),LNGTHG(LLINO)
+1022  FORMAT( ' This gels relative position =',I6,' and length =',I6)
+220   CONTINUE
+      MN = RELPG(LLINO)
+      MX = RELPG(LLINO) + ABS(LNGTHG(LLINO)) - 1
+      POSN = 0
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 220
+      POSN = IVAL
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+225   CONTINUE
+      LC = 50
+      NC = 0
+      CALL GETSTR('New chararacters',' ',CHARS,LC,NC,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(NC.EQ.0) GO TO 1
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 225
+      END IF
+      K=POSN-RELPG(LLINO)+1
+C   COPY CHARS
+      DO 230 I=1,NC
+        IF(CHARS(I).EQ.' ')CHARS(I)='*'
+        GEL(K)=CHARS(I)
+        K=K+1
+        IF(K.GT.MAXGEL) GO TO 231
+230   CONTINUE
+231   CONTINUE
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+      GO TO 1
+      END
+C   DBPRNT
+C   PRINTS A DATABASE. IE ITS RELATIONSHIPS
+      SUBROUTINE DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +IDEV,KBIN,KBOUT,IDEVN,LLINO,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LREG,RREG,ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMARC*10
+      IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +NCONTS,IDBSIZ,IDEV)
+      WRITE(IDEV,10011)NGELS,NCONTS
+10011 FORMAT(' Number of gel readings',I5,' Number of contigs',I5)
+20    CONTINUE
+      CALL YESNO(ANS,'Select contigs',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0) GO TO 45
+      N=IDBSIZ-NCONTS
+25    CONTINUE
+      CALL YESNO(ANS,'Show gel readings in positional order',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0)GO TO 41
+      WRITE(IDEV,1009)
+1009  FORMAT(' CONTIG LINES')
+      WRITE(IDEV,1000)
+1000  FORMAT(' CONTIG      LINE  LENGTH               ENDS'/
+     +'                                     LEFT   RIGHT')
+      DO 30 I=N,IDBSIZ-1
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+30    CONTINUE
+1007  FORMAT( ' ',12X,I4,2X,I6,10X,I6,2X,I6)
+      WRITE(IDEV,1008)
+1008  FORMAT(' GEL LINES')
+      WRITE(IDEV,1001)
+1001  FORMAT(' NAME      NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                     LEFT   RIGHT')
+      DO 40 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        WRITE(IDEV,1006)NAMARC,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1006    FORMAT( ' ',A,2X,I4,2X,I7,2X,I5,2X,I6,2X,I6)
+40    CONTINUE
+      RETURN
+C
+41    CONTINUE
+C
+C   SORTED DATA
+      DO 43 I=N,IDBSIZ-1
+        WRITE(IDEV,1021)
+1021    FORMAT( )
+        WRITE(IDEV,1000)
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+        J=LNBR(I)
+        WRITE(IDEV,1001)
+42      CONTINUE
+        CALL READN(IDEVN,J,NAMARC)
+        WRITE(IDEV,1006)NAMARC,J,RELPG(J),LNGTHG(J),LNBR(J),RNBR(J)
+        J=RNBR(J)
+        IF(J.NE.0)GO TO 42
+43    CONTINUE
+      RETURN
+45    CONTINUE
+C   SELECTED CONTIGS ONLY
+C
+C   GET GEL NUMBER AND CONTIG NUMBER
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1009)
+      WRITE(IDEV,1000)
+      WRITE(IDEV,1007)LINCON,RELPG(LINCON),LNBR(LINCON),RNBR(LINCON)
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1008)
+      N=LLINO
+      WRITE(IDEV,1001)
+46    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).EQ.0)GO TO 48
+      N=RNBR(N)
+      IF(RELPG(N).GT.RREG)GO TO 48
+      GO TO 46
+48    CONTINUE
+      GO TO 45
+      END
+      SUBROUTINE DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,KBIN,KBOUT,IDEVW,
+     +IDEV,LINLEN,PERCD,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,LINOU1,LINOU2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEVN,
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CHARACTER FILEH*(*)
+      PARAMETER (MAXPRM = 10)
+      CHARACTER PROMPT(2)*(MAXPRM)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2),ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0) RETURN
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0) RETURN
+      IDIM2=RREG-LREG+1
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL DBSCSM(SEQ1(LREG),IDIM2,KBOUT)
+160   CONTINUE
+      ANS = 1
+      PROMPT(1) = 'List codes'
+      PROMPT(2) = 'Plot codes'
+      CALL RADION('Select results display mode',PROMPT,2,ANS,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.1) RETURN
+      IF(ANS.EQ.1) THEN
+        CALL FMTDB(SEQ1,IDIM1,LREG,RREG,LINLEN,IDEV)
+        RETURN
+      ELSE
+        CALL PLTQ(SEQ1(LREG),IDIM2,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      END
+      SUBROUTINE DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,LLINO,
+     +PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      IDIM2=RREG-LREG+1
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL PLTQ(SEQ1(LREG),IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END
+      SUBROUTINE DBSCSM(SEQ1,IDIM1,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER CODES(5)
+      REAL X(5)
+      SAVE CODES
+      DATA CODES/'0','1','2','3','4'/
+      DO 50 J=1,5
+        X(J)=0.
+50    CONTINUE
+      DO 100 I=1,IDIM1
+        DO 60 J=1,5
+          IF(SEQ1(I).NE.CODES(J))GO TO 60
+          X(J)=X(J)+1.
+          GO TO 61
+60      CONTINUE
+61      CONTINUE
+100   CONTINUE
+      SUM=0.
+      DO 130 J=1,5
+        SUM=SUM+X(J)
+130   CONTINUE
+      DO 140 J=1,5
+        IF(SUM.NE.0)X(J)=X(J)*100./SUM
+140   CONTINUE
+      WRITE(KBOUT,1001)X(1)
+1001  FORMAT(' ',F6.2,'% OK on both strands and they agree(0)')
+      WRITE(KBOUT,1002)X(2)
+1002  FORMAT(' ',F6.2,'% OK on plus strand only(1)')
+      WRITE(KBOUT,1003)X(3)
+1003  FORMAT(' ',F6.2,'% OK on minus strand only(2)')
+      WRITE(KBOUT,1004)X(4)
+1004  FORMAT(' ',F6.2,'% Bad on both strands(3)')
+      WRITE(KBOUT,1005)X(5)
+1005  FORMAT(' ',F6.2,'% OK on both strands but they disagree(4)')
+      RETURN
+      END
+C     DBSTAT
+      SUBROUTINE DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      N=IDBSIZ-NCONTS
+      SUM=0.
+      DO 20 I=N,IDBSIZ-1
+        SUM=SUM+RELPG(I)
+20    CONTINUE
+      AV=SUM/NCONTS
+      WRITE(KBOUT,1020)SUM,AV
+1020  FORMAT( ' Total contig length ',F10.0,'  Average',
+     +' length ',F10.1)
+      SUMG=0.
+      DO 30 I=1,NGELS
+        SUMG=SUMG+FLOAT(ABS(LNGTHG(I)))
+30    CONTINUE
+      AV=SUMG/SUM
+      WRITE(KBOUT,1021)SUMG
+1021  FORMAT( ' Total characters in gel readings ',F10.0)
+      WRITE(KBOUT,1022)AV
+1022  FORMAT
+     +( ' Average gel characters per consensus character ',F10.2)
+99    CONTINUE
+      RETURN
+      END
+C     DELCON
+C
+C   DELETES CONTIG FROM CONSENSUS SEQUENCE
+      SUBROUTINE DELCON(SEQ1,ILEFT,ILC,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+C   FIRST CHAR TO REPLACE
+      I1=ILEFT-20
+C   FIRST CHAR TO MOVE
+      I2=ILEFT+ILC
+C   IS THIS RIGHTMOST CONTIG ANYWAY?
+      IF(I2.GT.IDIM1)GO TO 10
+C   NUMBER TO MOVE
+      ID=IDIM1-I2+1
+C   MOVE
+      CALL SQCOPY(SEQ1(I2),SEQ1(I1),ID)
+C   RESET LENGTH
+      IDIM1=I1+ID-1
+      RETURN
+10    CONTINUE
+C   RIGHTMOST CONTIG SO DONT MOVE
+      IDIM1=I1-1
+C
+      RETURN
+      END
+      SUBROUTINE DISMAT(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C   EDIT 07-02-83 TO CHECK FOR CASE WHEN GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL!  DONE BY HAVING A
+C   PARAMETER THAT STORES THE POSITION OF THE LEFT END OF THE
+C   NEXT CONTIG (IE THE ONE AFTER THE ONE THE CURRENT GEL OVERLAPS)
+C   SET IT TO A VERY LARGE VALUE INITIALLY
+      NEXTC=99999
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPARATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C      WRITE(KBOUT,1000)SAVPG(1),SAVPS(1)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     +SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C      WRITE(KBOUT,1000)SAVPG(I),SAVPS(I)
+C1000  FORMAT(' ',2I6)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     +SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       DISMAU
+C   ROUTINE TO DISPLAY MATCHES
+C   EDITED 17-12-81 TO NOT SUBTRACT 1 FROM LCL AND LGR
+      SUBROUTINE DISMAU(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC (SEE DISMAT)
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with vector number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   SAVE POSN OF END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      RETURN
+      END
+      SUBROUTINE DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +GEL,LLINOO,LINCON,LREG,RREG,GEL2,I1,IDIM,NOPT,
+     +LLINOR,IDBSIZ,IDEV,KBOUT,IDEVW,IDEVN,LINLEN,PERCD,
+     +MAXGEL,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER CHRSIZ
+      PARAMETER (CHRSIZ = 6)
+      PARAMETER (IDC1 = CHRSIZ*100)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,X,XLS2,XLS1,XRS2,XRS1,RREG2
+      CHARACTER MATCH(100)
+      INTEGER CHARS(CHRSIZ,100),CHARS1(IDC1)
+      CHARACTER NAMARC*10
+      CHARACTER GEL(MAXGEL)
+      CHARACTER GEL2(MAXGEL)
+      INTEGER RELPOS(10),RELPO2(10)
+      INTEGER GELC
+      INTEGER RP
+      INTEGER LSEQNO,RSEQNO
+      CHARACTER LINOUT(100)
+      CHARACTER MUNOTP
+      CHARACTER GTCONC
+      EXTERNAL GTCONC
+      EQUIVALENCE (CHARS1,CHARS)
+      CALL FILLI(CHARS1,IDC1,0)
+C   SET CONTIG NUMBER
+      ICON=1
+      LLINO=LLINOO
+      NLEN=LINLEN/10
+      LSEQNO=LREG
+      X=LINLEN+LSEQNO-1
+      RSEQNO=MIN(RREG,X)
+C   SET LEFT GEL NUMBER FOR RIGHT CONTIG
+      LN2=LLINOR
+C   FIRST GEL NO IS LLINOO
+C   SET RREG FOR RIGHT CONTIG
+      RREG2=IDIM
+C   SET UP LSEQNO,RSEQNO FOR FOR NOPT=3
+      XLS2=I1
+      XRS2=RSEQNO-LSEQNO+XLS2
+9     CONTINUE
+C   IF RIGHT CONTIG SKIP NUMBER PRINTING
+      IF(ICON.EQ.2)GO TO 8
+C   NEED TO KEEP LONGEST LINE LENGTH FOR OUTPUT OF CONSENSUS
+      IE=0
+C   SETUP AND WRITE NUMBERS
+      RELPOS(1)=LSEQNO+9
+      DO 5 I=2,NLEN
+        RELPOS(I)=RELPOS(I-1)+10
+5     CONTINUE
+      WRITE(IDEV,1023)
+     +(RELPOS(K),K=1,MIN(NLEN,MAX(1,(RSEQNO-LSEQNO+1)/10)))
+1023  FORMAT( ' ',19X,10(I9,1X))
+C   SET CURRENT LINE NUMBER
+8     CONTINUE
+      GELC=LLINO
+10    CONTINUE
+C   IS LEFT END OF CURRENT GEL >RREG
+      IF(RELPG(GELC).GT.RSEQNO)GO TO 200
+C   ALSO NEED TO KNOW IF RIGHT END  ON THIS LINE (IF .LT. NO DATA
+C   TO DISPLAY)
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      IF(X.LT.LSEQNO)GO TO 190
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+      CALL FILLC(LINOUT,LINLEN,' ')
+      CALL READN(IDEVN,GELC,NAMARC)
+C
+C   NEED TO KNOW HOW MANY CHARS TO COPY OVER TO OUTPUT LINE
+C   AND WHERE IN LINE TO PUT THEM
+C   CURRENT LINE LEFT END IS LSEQNO,RIGHT END RSEQNO
+C   SO LEFT START CHAR IS
+      X=MAX(LSEQNO,RELPG(GELC))
+C   POSITION IN ARRAY LINE
+      LP=X-LSEQNO+1
+C   RIGHT END CHAR IS
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      X=MIN(RSEQNO,X)
+C   POSITION IN ARRAY LINE
+      RP=X-LSEQNO+1
+C   LOOK FOR LONGEST LINE
+      IF(RP.GT.IE)IE=RP
+C   NEED LEFT START IN GEL
+      K=LSEQNO-RELPG(GELC)+1
+      IF(K.LT.1)K=1
+      NCOP=RP-LP+1
+      IF(NCOP.GT.0)CALL SQCOPY(GEL(K),LINOUT(LP),NCOP)
+      N=LP+NCOP-1
+      II=K-1
+      IF(IDM.EQ.26)THEN
+        DO 50 I = LP,N
+          II = II + 1
+          CALL PCON1(GEL(II),CHARS(1,I))
+50      CONTINUE
+      ELSE
+        DO 70 I=LP,N
+          II=II+1
+          JJ = INDEXS(GEL(II),JSCORE)
+          CHARS(JJ,I) = CHARS(JJ,I) + JSCORE
+70      CONTINUE
+      END IF
+      I=SIGN(GELC,LNGTHG(GELC))
+      WRITE(IDEV,1020)I,NAMARC,(LINOUT(K),K=1,RP)
+1020  FORMAT( ' ',I4,2X,A,2X,100A1)
+C
+190   CONTINUE
+C   NOW GET NEXT GEL TO RIGHT
+      GELC=RNBR(GELC)
+      IF(GELC.NE.0)GO TO 10
+200   CONTINUE
+C   CALC CONSENSUS AND WRITE IT
+      IF(IDM.EQ.26)THEN
+        DO 49 I = 1,LINLEN
+          LINOUT(I) = MUNOTP(CHARS(1,I))
+          CHARS(1,I) = 0
+49      CONTINUE
+      ELSE
+        DO 230 I=1,LINLEN
+          LINOUT(I) = GTCONC(CHARS(1,I),CHRSIZ,PERCD)
+          CALL FILLI(CHARS(1,I),CHRSIZ,0)
+230     CONTINUE
+      END IF
+      WRITE(IDEV,1019)(LINOUT(K),K=1,IE)
+C   IF REQUIRED WRITE COMPARISON GEL
+C   WHICH OPTION IN OPERATION?
+      IF(NOPT.EQ.2)GO TO 52
+      IF(NOPT.NE.3)GO TO 250
+53    CONTINUE
+C   ALREADY DONE THIS LINE CONTIG2?
+      IF(ICON.EQ.2)GO TO 54
+      ICON=2
+C   NEED TO SAVE CONSENSUS FROM LEFT CONTIG
+      CALL SQCOPY(LINOUT,GEL2,IE)
+C   SAVE VALUES FROM LEFT CONTIG
+      XLS1=LSEQNO
+      XRS1=RSEQNO
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN1=LLINO
+C   SET UP VALUES FOR RIGHT CONTIG
+      LSEQNO=XLS2
+      RSEQNO=XRS2
+C   SET LEFT GEL NUMBER
+      LLINO=LN2
+C   GET NEXT GEL
+      GO TO 150
+54    CONTINUE
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN2=LLINO
+C   SET VALUES FOR RIGHT CONTIG NEXT PASS
+      XLS2=XRS2+1
+      XRS2=XLS2+LINLEN-1
+      IF(XRS2.GT.RREG2)XRS2=RREG2
+C   SET UP VALUES FOR LEFT CONTIG
+      LLINO=LN1
+      ICON=1
+      LSEQNO=XLS1
+      RSEQNO=XRS1
+C   SET DECREMENT FOR POINTER TO GEL2
+      MMM=I1-1
+52    CONTINUE
+1017  FORMAT('       NEWGEL      ',100A1)
+1018  FORMAT('       MISMATCH    ',100A1)
+1019  FORMAT('       CONSENSUS   ',100A1)
+1022  FORMAT( ' ',18X,100A1)
+      I2=I1+LINLEN-1
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(NOPT.EQ.2)WRITE(IDEV,1017)(GEL2(K),K=I1,I2)
+C   SET DECREMENT
+      IF(NOPT.EQ.2)MMM=0
+55    CONTINUE
+      CALL FILLC(MATCH,LINLEN,'*')
+      K=0
+      DO 667 J=I1,I2
+        K=K+1
+        IF(GEL2(J-MMM).EQ.LINOUT(K))MATCH(K) = ' '
+667   CONTINUE
+      WRITE(IDEV,1018)(MATCH(K),K=1,IE)
+      RELPO2(1)=(I1)+9
+      DO 240 I=2,NLEN
+        RELPO2(I)=RELPO2(I-1)+10
+240   CONTINUE
+      WRITE(IDEV,1023)(RELPO2(K),K=1,NLEN)
+      I1=I2+1
+      I2=I2+LINLEN
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(I1.GT.I2)RETURN
+250   CONTINUE
+C
+      WRITE(IDEV,1021)
+1021  FORMAT( )
+C   NEXT LINE LENGTH
+C   NEXT LENGTH IS OLD RIGHT +1
+      LSEQNO=RSEQNO+1
+C   NEW RIGHT IS LEFT +LENGTH
+      RSEQNO=LSEQNO+(LINLEN)-1
+C   ARE WE OVER END OF REGION
+      IF(RSEQNO.GT.RREG)RSEQNO=RREG
+C   HAVE WE FINISHED REGION COMPLETELY
+      IF(RSEQNO.LT.LSEQNO) RETURN
+C   NOT FINISHED SO NEED TO FIND CURRENT LEFT GEL NO
+C   CURRENT LEFT GEL IS LLINO
+C
+150   CONTINUE
+C   NEED TO KNOW IF CURRENT LEFT GELS RIGHT END IS INSIDE REGION
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.LSEQNO)GO TO 9
+C   LOOK AT NEXT GEL TO RIGHT
+      LLINO=RNBR(LLINO)
+C   MAY HAVE GONE OVER END OF CONTIG?????
+      IF(LLINO.GT.0)GO TO 150
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE EC(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      PARAMETER (IDASH = 6)
+      K = 0
+      DO 10 I = 1,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.IDASH) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            GEL(I) = CHARSL(JC)
+            K = K + 1
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS CORRECTED=',K
+      END
+      SUBROUTINE ED(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = MIN(IDC,IDG),1,-1
+        JC = INDEXS(CON(I),J)
+        IF(JC.EQ.5) THEN
+          IF(I.LT.IDG) CALL SQCOPY(GEL(I+1),GEL(I),IDG-I)
+          K = K + 1
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS DELETED=',K
+      END
+      SUBROUTINE EDITGL(GEL,CHARS,IDIMG,KBIN,KBOUT,MAXGEL,
+     +IHELPS,IHELPE,FILEH,IDEVH,ISTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILEH*(*)
+      INTEGER POSN,Y
+      CHARACTER GEL(MAXGEL),CHARS(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PRMPT(3)*(MAXPRM)
+1     CONTINUE
+C      CALL BELL(1,KBOUT)
+C      CALL DBMENU(5,NOPT,IHELPS,IHELPE,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PRMPT(1) = 'Insert'
+      PRMPT(2) = 'Delete'
+      PRMPT(3) = 'Change'
+      NOPT = 1
+      CALL RADION('Edit options', PRMPT, 3, NOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.EQ.1)THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(NOPT.EQ.-1)RETURN
+      IF(NOPT.EQ.3)GO TO 200
+C  INSERT, DELETE
+      MN = ISTART
+      MX = ISTART + IDIMG
+      POSN = 0
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 1
+      POSN = IVAL
+      POSN=POSN-ISTART+1
+      MN = 1
+      MX = 50
+      NC = 1
+      CALL GETINT(MN,MX,NC,
+     +'Number of characters',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      NC = IVAL
+      Y=POSN+NC
+      K=IDIMG-POSN+1
+      IF((NOPT.EQ.2).AND.(Y.GT.IDIMG))NC=K
+C   CALC POSN IN THIS GEL TO EDIT
+      K=POSN
+C   IS THIS  DELETE?
+      IF(NOPT.EQ.4)GO TO 100
+50    CONTINUE
+      LC = NC
+      CALL FILLC(CHARS,NC,'*')
+      CALL GETSTR('characters to insert',CHARS,CHARS,NC,LC,KBOUT,KBIN,
+     +INFLAG)
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(IFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 50
+      END IF
+C   MOVE THE DATA RIGHT
+      M=IDIMG
+      N=IDIMG+NC
+C   CHECK FOR OVERFLOW OF GEL
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,2000)
+2000    FORMAT(' Data pushed off end of gel during insertion')
+C       HOW MUCH OVER?
+        NM512=N-MAXGEL
+        N=MAXGEL
+C       SET M SO THAT THE DATA AT THE END IS LOST
+        M=M-NM512
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      IC=0
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        IC=IC+1
+C       DONT ALLOW SPACES - SET THEM TO *'S
+        IF(CHARS(IC).EQ.' ')CHARS(IC)='*'
+        GEL(I)=CHARS(IC)
+60    CONTINUE
+      IDIMG=MIN((IDIMG+NC),MAXGEL)
+      GO TO 1
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   DELETE
+100   CONTINUE
+C
+C   MOVE DATA TO LEFT
+      M=K+NC
+C   POSN TO MOVE TO
+      N=MAX(1,K)
+C   NUMBER TO MOVE
+      L=IDIMG-M+1
+C   NONE TO MOVE?
+      IF(L.GT.0) THEN
+C       MOVE THE DATA
+        DO 160 I=1,L
+          GEL(N)=GEL(M)
+          N=N+1
+          M=M+1
+160     CONTINUE
+      END IF
+C  RESET LENGTH
+      IDIMG=IDIMG-NC
+      GO TO 1
+200   CONTINUE
+C
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   CHANGE
+      MN = ISTART
+      MX = ISTART + IDIMG
+      POSN = 0
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 1
+      POSN = IVAL
+C  ADD IN START POSITION HERE
+      POSN=POSN-ISTART+1
+225   CONTINUE
+      LC = 50
+      NC = 0
+      CALL GETSTR('New characters',' ',CHARS,LC,NC,KBOUT,KBIN,INFLAG)
+      IF(NC.EQ.0) GO TO 1
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(IFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 225
+      END IF
+C   CALC POSITION IN THIS GEL TO EDIT
+      K=POSN
+C   COPY CHARS
+      DO 230 I=1,NC
+        GEL(K)=CHARS(I)
+        K=K+1
+230   CONTINUE
+      GO TO 1
+      END
+      SUBROUTINE EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER CON(IDC)
+      INTEGER CHNRP
+      EXTERNAL CHNRP
+C  CHANGE RELATIVE POSITIONS FOR AE
+      ND = 0
+      DO 10 I = IDC,1,-1
+        IF(CON(I).EQ.'*') THEN
+          ND = ND + 1
+          K = I + LREG - 1
+          J = CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,K)
+          IF(J.NE.0) THEN
+            CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,IDUM,JDUM,IDEVR,
+     +      IDBSIZ,J,NCONT,-1)
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF DELETIONS=',ND
+      END
+      SUBROUTINE ET(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = 2,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.6) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            JNG = INDEXS(GEL(I-1),J)
+            JNC = INDEXS(CON(I-1),J)
+            IF(JNC.NE.JNG) THEN
+              IF((JNG.EQ.JC).AND.(JNC.EQ.JG)) THEN
+                GEL(I) = CHARSL(JNG)
+                GEL(I-1) = CHARSL(JG)
+                K = K + 1
+              END IF
+            END IF
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF CHARS TRANSPOSED=',K
+      END
+      SUBROUTINE FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND
+      STRAND = 1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHP)
+      IF(DEPTHP.LT.0) RETURN
+      STRAND = -1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHM)
+      IF(DEPTHM.LT.0) RETURN
+      CALL PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      END
+      SUBROUTINE FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTH,STRAND
+      EXTERNAL NCDEP
+C  LREG = left contig position
+C  RREG = right ''      ''
+C  LENCON = RREG-LREG+1
+      I = LGEL
+      DEPTH = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      ELSE
+        DEPTH = -1
+        RETURN
+      END IF
+C      WRITE(*,*)'LGEL',LGEL
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            K = RELPG(I) + ABS(LNGTHG(I)) -1
+            DEPTH = MAX(NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,
+     +      STRAND,K),DEPTH)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+C      WRITE(*,*)'DEPTH',DEPTH
+      END
+C      FIND
+C
+C   SUBROUTINE TO FIND THE FIRST OCCURENCE OF A GIVEN STRING
+C   IN A GIVEN ARRAY
+C
+      SUBROUTINE FIND(SEQ,IDIM1,STRING,IDIM2,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2),DASH
+      INTEGER PSEQ,PSTR
+      SAVE DASH
+      DATA DASH/'-'/
+      PSEQ=0
+      PSTR=1
+      IMATCH=0
+C
+100   CONTINUE
+C
+C   PUT PSEQ TO WHERE THIS FAILED MATCH STARTED
+      PSEQ=PSEQ+1-PSTR
+C
+400   CONTINUE
+C
+      PSTR=0
+C
+500   CONTINUE
+C
+C   POINT TO NEXT SEQ CHAR
+      PSEQ=PSEQ+1
+C   TEST FOR END
+      IF(PSEQ.GT.IDIM1)GO TO 300
+C   POINT TO NEXT STRING CHAR
+      PSTR=PSTR+1
+C   TEST FOR DASH IN STRING
+      IF(STRING(PSTR).EQ.DASH)GO TO 450
+C   TEST FOR DASH IN SEQ
+      IF(SEQ(PSEQ).EQ.DASH)GO TO 400
+C   TEST FOR MATCH
+      IF(SEQ(PSEQ).NE.STRING(PSTR))GO TO 100
+C
+450   CONTINUE
+C
+C   TEST FOR END OF STRING IE. WHOLE STRING MATCH
+      IF(PSTR.LT.IDIM2)GO TO 500
+C   HAVE MATCH. GET POINTER TO WHERE IT STARTED
+      IMATCH=PSEQ-IDIM2+1
+C
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE FMT4LP(SEQ1,SEQ2,IDIM,ISW,ISX,IDEV,NAME1,NAME2)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(60),NAME1*(*),NAME2*(*)
+      INTEGER KL(6)
+      ISXX=ISX
+      ISWW=ISW
+      IE=0
+10    CONTINUE
+      IS=IE+1
+      IE=IE+60
+      IF(IE.GT.IDIM)IE=IDIM
+      N=IE-IS+1
+      N=1+(N-1)/10
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,N
+        KL(J)=ISWW
+        ISWW=ISWW+10
+50    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+      WRITE(IDEV,1002)NAME1,(SEQ1(K),K=IS,IE)
+      IL = IE - IS + 1
+      CALL SQMTCH(SEQ1(IS),SEQ2(IS),MATCH,IL)
+      WRITE(IDEV,1003)(MATCH(K),K=1,IL)
+      WRITE(IDEV,1002)NAME2,(SEQ2(K),K=IS,IE)
+1002  FORMAT(2X,A,2X,6(10A1,1X))
+ 1003 FORMAT(10X,6(10A1,1X))
+C   SET UP DECIMAL COUNTERS
+      DO 60 J=1,N
+        KL(J)=ISXX
+        ISXX=ISXX+10
+60    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+1001  FORMAT( 5X,6(I6,5X))
+      IF(IE.LT.IDIM) GO TO 10
+      END
+      SUBROUTINE FMTDB(SEQ1,IDIM,ISW,ISE,LINLEN,IDEV)
+C   NOTE SAME AS FMTSEP!
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM)
+      INTEGER KL(12)
+      ISWW=ISW-1
+      IE=ISW-1
+1     CONTINUE
+      WRITE(IDEV,1003)
+1003  FORMAT( )
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,LINLEN/10
+        ISWW=ISWW+10
+        KL(J)=ISWW
+50    CONTINUE
+      IS=IE+1
+      IE=IE+LINLEN
+      IF(IE.GT.ISE)IE=ISE
+      WRITE(IDEV,1001)(KL(KKK),KKK=1,MIN(IE-IS+1,LINLEN)/10)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+1002  FORMAT( '  ',12(10A1,1X))
+1001  FORMAT( ' ',12(5X,I6))
+      IF(IE.EQ.ISE)RETURN
+      GO TO 1
+      END
+      SUBROUTINE FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   AUTHOR: RODGER STADEN
+C   STORES THEIR POSITIONS IN CENDS AND THEIR LEFT LINE NUMBERS IN NENDS
+      CHARACTER SEQ(IDIM),DC(3)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      EXTERNAL IFROMC,INDEXA
+      IDCEND=0
+      DO 10 I=1,IDIM
+        IF(SEQ(I).NE.'<')GO TO 10
+        IDCEND=IDCEND+1
+C       PUT POSITION OF LEFT END OF CONTIG IN CENDS
+        CENDS(IDCEND)=I
+        K = INDEXA(SEQ(I),20,'.')
+        IF(K.EQ.0) THEN
+           WRITE(KBOUT,*)'Error in contig title: no dot!'
+           IDCEND = 0
+           RETURN
+         END IF
+        K = K + I
+C        K=I+11
+        DO 5 J=1,3
+          DC(J)=SEQ(K)
+          K=K+1
+5       CONTINUE
+        NENDS(IDCEND)=IFROMC(DC,3,KBOUT)
+10    CONTINUE
+C     STORE POSITION OF LAST CHAR +1 TO SIMPLIFY DISPLAY ROUTINES
+      CENDS(IDCEND+1)=IDIM+1
+      RETURN
+      END
+      INTEGER FUNCTION GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      GCLIN = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          GCLIN = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      INTEGER FUNCTION GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LLINO,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*11,NAME2*10,NAME3*11,NFLAG
+      PARAMETER (NFLAG='/')
+      NAME3 = ' '
+      IF(LLINO.NE.0) THEN
+        NAME3(1:1) = NFLAG
+        CALL READN(IDEVN,LLINO,NAME3(2:))
+      END IF
+      GELID = 0      
+10    CONTINUE
+      L = 0
+      IF(LLINO.NE.0) L = 11
+      CALL GTSTR('Contig identfier',NAME3,
+     +NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(INFLAG.EQ.3) THEN
+        GELID = LLINO
+        RETURN
+      END IF
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NAME1(1:1).EQ.NFLAG) THEN
+        CALL CCASE(NAME1,1)
+        DO 20 I=1,NGELS
+          CALL READN(IDEVN,I,NAME2)
+          CALL CCASE(NAME2,1)
+          IF(NAME1(2:11).EQ.NAME2) THEN
+            GELID = I
+            RETURN
+          END IF
+20     CONTINUE
+        WRITE(KBOUT,1004)NAME1(2:)
+1004    FORMAT(' ',A,' is not in the database!')
+      ELSE
+        CALL RJST(NAME1)
+        READ(NAME1,1001,ERR=10,END=10)GELID
+1001    FORMAT(I11)
+        IF((GELID.LT.1).OR.(GELID.GT.NGELS)) THEN
+          CALL ERROM(KBOUT,'Illegal gel reading number')
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +NCONTS,GEL,MAXGEL,IDEV3,IDEV4,IDEV5,IDEV1,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER  GEL(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER NAMARC*10
+      FILNAM = ' '
+      CALL OPENF1(IDEV5,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for names of extracted gel readings',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(I,'Extract ends of contigs only',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(I.LT.0) RETURN
+      IF(I.EQ.0) GO TO 15
+      DO 10 I=1,NGELS
+        L=ABS(LNGTHG(I))
+        IF(L.GT.0)THEN
+          CALL READN(IDEV4,I,NAMARC)
+          WRITE(KBOUT,1002)NAMARC
+1002      FORMAT(' ',A)
+          WRITE(IDEV5,1003)NAMARC
+1003      FORMAT(A)
+          FILNAM = NAMARC
+          CALL OPENRS(IDEV1,FILNAM,IOK,LRECL,1)
+          IF(IOK.NE.0) GO TO 100
+          CALL READW(IDEV3,I,GEL,MAXGEL)
+          IF(LNGTHG(I).LT.0)THEN
+            CALL SQREV(GEL,L)
+            CALL SQCOM(GEL,L)
+          END IF
+          CALL FMTDKN(IDEV1,GEL,L)
+          CLOSE(UNIT=IDEV1)
+      END IF
+10    CONTINUE
+      RETURN
+15    CONTINUE
+C   NUMBER OF LINES TO PROCESS
+      N=IDBSIZ-NCONTS
+      DO 20 I=N,IDBSIZ-1
+        JL=LNBR(I)
+        JR=RNBR(I)
+        CALL READN(IDEV4,JL,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        FILNAM = NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JL,GEL,MAXGEL)
+        L=ABS(LNGTHG(JL))
+        IF(LNGTHG(JL).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+        IF(JR.EQ.JL)GO TO 20
+        CALL READN(IDEV4,JR,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JR,GEL,MAXGEL)
+        L=ABS(LNGTHG(JR))
+        IF(LNGTHG(JR).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+20    CONTINUE
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,*)'Error opening file for extracted gel reading'
+      RETURN
+      END
+      SUBROUTINE GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),GELID
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER FILEH*(*)
+      EXTERNAL GELID
+      IERR = 1
+      NCONTC = GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LLINO,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(NCONTC.EQ.0) RETURN
+      IGELNO = NCONTC
+      IF(LNBR(NCONTC).NE.0) THEN
+        WRITE(KBOUT,1013)RELPG(NCONTC)
+1013    FORMAT(' Position of this reading=',I6)
+25      CONTINUE
+        NCONTC = LNBR(NCONTC)
+        IF(LNBR(NCONTC).NE.0) GO TO 25
+        WRITE(KBOUT,1014)NCONTC
+1014    FORMAT( ' Number of leftmost reading this contig=',I6)
+      END IF
+30    CONTINUE
+      N = IDBSIZ - NCONTS
+      DO 20 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.NCONTC) THEN
+          LINCON=J
+          GO TO 21
+        END IF
+20    CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' No contig line for this gel! Fix the database')
+      RETURN
+21    CONTINUE
+      LLINO = NCONTC
+      IERR = 0
+      END
+      SUBROUTINE GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CALL GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      END
+      SUBROUTINE GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LEFTMN,RIGHTM,LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,RIGHTM
+      CHARACTER FILEH*(*)
+40    CONTINUE
+      MN = LEFTMN
+      MX = RIGHTM
+      LREG = MN
+      CALL GETINT(MN,MX,LREG,
+     +'Start position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LREG = IVAL
+      MN = LREG
+      MX = RIGHTM
+      RREG = MX
+      CALL GETINT(MN,MX,RREG,
+     +'End position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      RREG = IVAL
+C   NOW FIND FIRST GEL THAT OVER LAPS
+50    CONTINUE
+      X=RELPG(LLINO)+(ABS(LNGTHG(LLINO)))-1
+      IF(X.GE.LREG)GO TO 60
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 50
+60    CONTINUE
+      RETURN
+      END
+      SUBROUTINE GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+      CHARACTER T
+        IF(T.EQ.'0') THEN
+          YF = Y0
+          YT = Y0
+        ELSE IF(T.EQ.'1') THEN
+          YF = Y0
+          YT = YM1
+        ELSE IF(T.EQ.'2') THEN
+          YF = Y0
+          YT = YP1
+        ELSE IF(T.EQ.'3') THEN
+          YF = YP1
+          YT = YM1
+        ELSE IF(T.EQ.'4') THEN
+          YF = YP2
+          YT = YM2
+        END IF
+      END
+      CHARACTER*1 FUNCTION GTCONC(COUNTS,IDM,CUT)
+      INTEGER IDM
+      INTEGER COUNTS(IDM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU
+C 8-4-92 made this routine sum counts
+      GTCONC = '-'
+      ISUM = 0
+      DO 5 I=1,IDM
+        ISUM = ISUM + COUNTS(I)
+ 5    CONTINUE
+      IF(ISUM.EQ.0.) RETURN
+      Y = ISUM
+      DO 10 I = 1,IDM - 1
+        X = REAL(COUNTS(I))/Y
+        IF(X.GE.CUT) THEN
+          GTCONC = CHARSU(I)
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE HIGHLT(GELSAV,NAMSAV,NUMSAV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV1,IDEV2,
+     +FILNAM,IOK)
+      CHARACTER LINEIN*119,CONSEN*100
+      CHARACTER GELNO*4,GEL*100,GELSAV*100,GELNAM*10
+      CHARACTER NAMSAV*10,NUMSAV*4
+      CHARACTER FILNAM*(*),HELPF*(*)
+      DIMENSION GELSAV(50),NAMSAV(50),NUMSAV(50)
+      CHARACTER PLUS*4,MINUS*4
+      EQUIVALENCE (LINEIN(2:2),GELNO),(LINEIN(8:8),GELNAM)
+      EQUIVALENCE (LINEIN(20:20),GEL)
+      EXTERNAL NOTIRL
+      CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File containing contig display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for problem display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+C   FORMAT:
+C
+C12345678901234567890       10        20         30    ETC
+C   12  GELNAM0000  CAGACGCGCGCGCGCGCGGATATAGTCTCTCCGCTCT
+C  100  GELNAM0000       TGATACGCTCGCTCTCTCTCTCTCTCTCTTTC
+C                   AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C                           70        80       ETC
+C   12  GELNAM0000     AAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C
+      LIN = 1
+      CALL GTSTR('plus strand symbol',':',PLUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) PLUS = ':'
+      LIN = 1
+      CALL GTSTR('minus strand symbol','.',MINUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) MINUS = '.'
+C   COUNT LINE NUMBERS
+      LINNO=0
+10    CONTINUE
+C   READ LINE OF NOS
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+1003  FORMAT(A)
+C   WRITE IT OUT AGAIN
+      WRITE(IDEV2,1003)LINEIN
+C   ZERO GEL COUNT FOR THIS STRIP
+      IGEL=0
+20    CONTINUE
+C
+C   READ A LINE, COULD BE 1 GEL, 2 CONSENSUS OR BLANK
+C       LINEIN=' '
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+C   WHAT SORT OF LINE?  ONLY A GEL WILL HAVE NON BLANK CHARS AT THE LEFT END
+      IF(LINEIN(2:5).NE.' ')THEN
+C   GEL LINE SO SAVE
+      IGEL=IGEL+1
+      GELSAV(IGEL)=GEL
+      NAMSAV(IGEL)=GELNAM
+      NUMSAV(IGEL)=GELNO
+      GO TO 20
+      END IF
+C   MUST BE CONSENSUS
+      CONSEN=GEL
+C   PROCESS THIS STRIP OF GELS (IGEL OF THEM)
+      DO 50 I=1,IGEL
+C       WHERE DOES DATA START AND END?
+        IFIRST=1
+40      CONTINUE
+        IF(GELSAV(I)(IFIRST:IFIRST).NE.' ')GO TO 45
+        IFIRST=IFIRST+1
+        IF(IFIRST.LE.100)GO TO 40
+C       ERROR --- NO DATA FOUND
+        WRITE(KBOUT,1004)LINNO
+1004    FORMAT(' Error on line',I6,' of file')
+        RETURN
+45      CONTINUE
+C       NOW WHERE DOES IT END
+        ILAST=NOTIRL(GELSAV(I),100,' ')
+C       COMPARE WITH CONSENSUS
+        READ(NUMSAV(I),'(I4)',ERR=900)INTEG
+        IF(INTEG.GE.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,PLUS)
+        IF(INTEG.LT.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,MINUS)
+        WRITE(IDEV2,1008)NUMSAV(I),NAMSAV(I),GELSAV(I)(1:ILAST)
+1008    FORMAT(' ',A,2X,A,2X,A)
+50    CONTINUE
+      WRITE(IDEV2,1009)CONSEN
+1009  FORMAT(' ',18X,A)
+1006  FORMAT( )
+C     READ A BLANK LINE
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+      WRITE(IDEV2,1003)LINEIN
+C     NO GO BACK FOR THE NEXT LINE OF NUMBERS
+      GO TO 10
+100   CONTINUE
+      WRITE(KBOUT,1005)
+1005  FORMAT(' Finished')
+      RETURN
+900   WRITE(KBOUT,*)'Error reading gel number'
+      END
+      SUBROUTINE IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +NCONTS,IX,IY,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,
+     +DBTDUX,DBTDUY,NCONT,IGEL,IS)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER CHNRP1
+      EXTERNAL CWORLD,CHNRP1
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      XX = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+      YX = CWORLD(IY,MARGB,MARGT,YMIN,YMAX)
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        IF((XX.GT.XF).AND.(XX.LT.XT)) THEN
+          IS = NINT(((XX-XF)/(XT-XF)) * RELPG(I))
+          JGEL = LNBR(I)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,JGEL,IS)
+          NCONT = I
+          RETURN
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      IGEL = 0
+      NCONT = 0
+      END
+      SUBROUTINE IDTOD(TOPLIN,GEL,IFIRST,ILAST,SYMBOL)
+      CHARACTER TOPLIN*100,GEL*100,SYMBOL*4
+      DO 10 I=IFIRST,ILAST
+        IF(GEL(I:I).EQ.TOPLIN(I:I))GEL(I:I)=SYMBOL(1:1)
+10    CONTINUE
+      END
+      INTEGER FUNCTION INDEXS(C,S)
+      PARAMETER (IDM = 29)
+      CHARACTER C
+      INTEGER POINTS(0:255),SCORES(IDM),IND(IDM),S
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      SAVE SCORES,IND
+      DATA 
+     +IND/1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,6,6,6,6,6,6,1,2,3,4,5,5,6/
+C      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  changed 28-7-91 to give 10 to old zeroes and 100 to lowercase
+      DATA SCORES/
+     +100,100,100,100,
+     +75,75,75,75,
+     +100,100,100,100,
+     +100,100,100,100,
+     +10,10,10,10,10,10,
+     +100,100,100,100,100,100,10/
+      I = ICHAR(C)
+      I = POINTS(I)
+      S = SCORES(I)
+      INDEXS = IND(I)
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP FOR SHOTGUN SEQUENCING
+      SUBROUTINE INITS
+C  AUTHOR RODGER STADEN
+      INTEGER POINTS(0:255)
+      PARAMETER (IDM = 29)
+      CHARACTER DUP*29
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-4 FOR ACGT
+C                 acgt
+C                 1234
+C                 BDHV
+C                 KLMN
+C      5 FOR      *
+C      6 FOR      5678- AND ELSE
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 6
+C
+        DO 30 I = 0,255
+          POINTS(I) = IDM
+30      CONTINUE
+        DO 35 I = 1,IDM
+          J = ICHAR(DUP(I:I))
+          POINTS(J) = I
+35      CONTINUE
+      END
+      FUNCTION INLIST(LIST,IDLIST,ITEM)
+C   AUTHOR: RODGER STADEN
+C   SENT LIST LIST, AND ITEM ITEM. IF IN LIST RETURNS ELEMENT NUMBER, ELSE 0
+      INTEGER LIST(IDLIST)
+      INLIST=0
+      DO 1 I=1,IDLIST
+        IF(LIST(I).NE.ITEM)GO TO 1
+        INLIST=I
+      RETURN
+1     CONTINUE
+      RETURN
+      END
+      SUBROUTINE IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH,X,Y,KBOUT,
+     +IGEL,IOK)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      IOK = 1
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      YINCO2 = STRAND*YINC/2.
+      I = LGEL
+      IGEL = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            IF((X.GE.XF).AND.(X.LE.XT)) THEN
+              IGEL = I
+              IF((Y.GE.YF-YINCO2).AND.(Y.LE.YF+YINCO2)) THEN
+                IOK = 0
+                RETURN
+              END IF
+            END IF
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE JOIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LNCONL,LLINOL,LNCONR,LLINOR,GEL,GEL2,
+     +IDBSIZ,IDEV,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      CHARACTER FILEH*(*)
+      INTEGER HELPS1,HELPE1,HELPS2,HELPE2
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS
+      INTEGER LREG,RREG,X,RELX,RIGHTM
+      INTEGER LNGTHG(IDBSIZ),RNBR(IDBSIZ),LNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PRMPT(5)*(MAXPRM)
+15    CONTINUE
+      WRITE(KBOUT,1001)
+1001  FORMAT(
+     +' The joint is the position in the left contig that',/,
+     +' the first character of the right contig overlaps.')
+      MN = 1
+      MX = RELPG(LNCONL)
+      RELX = 0
+      CALL GETINT(MN,MX,RELX,
+     +'Position of join',
+     +IVAL,KBIN,KBOUT,HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(IVAL.EQ.0) RETURN
+      RELX = IVAL
+      I1=1
+      I2=LINLEN
+      IF((RELPG(LNCONR)).LT.I2)I2=(RELPG(LNCONR))
+      CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,
+     +LLINOL,LNCONL,RELX,RELPG(LNCONL),GEL,I1,I2,3,LLINOR,
+     +IDBSIZ,IDEV,KBOUT,IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+30    CONTINUE
+      CALL BELL(1,KBOUT)
+C      CALL DBMENU(3,NOPT,HELPS1,HELPE1,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PRMPT(1) = 'Complete join'
+      PRMPT(2) = 'Edit left contig'
+      PRMPT(3) = 'Display join'
+      PRMPT(4) = 'Edit right contig'
+      PRMPT(5) = 'Move join'
+      NOPT = 1
+      CALL RADION('Join options', PRMPT, 4, NOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.LT.1)RETURN
+100   CONTINUE
+      IF(NOPT.EQ.5)GO TO 15
+      IF(NOPT.EQ.2)THEN
+C       EDIT LEFT CONTIG
+        IMAXL=RELX
+        IMAXR=MIN(RELPG(LNCONL),(RELX+RELPG(LNCONR)-1))
+        CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL2,
+     +  LNCONL,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +  HELPS2,HELPE2,FILEH,IDEVH,MAXGEL)
+      END IF
+200   CONTINUE
+      IF(NOPT.EQ.4)THEN
+C       EDIT RIGHT CONTIG
+        IMAXL=1
+        IMAXR=MIN((RELPG(LNCONL)-RELX+1),RELPG(LNCONR))
+        CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL2,
+     +  LNCONR,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +  HELPS2,HELPE2,FILEH,IDEVH,MAXGEL)
+      END IF
+300   CONTINUE
+      IF(NOPT.EQ.3)THEN
+C       DISPLAY
+        LLINO=LLINOL
+        LEFTMN=RELX
+        RIGHTM=RELPG(LNCONL)
+        IF((RIGHTM-LEFTMN+1).GT.RELPG(LNCONR))RIGHTM=
+     +  LEFTMN+RELPG(LNCONR)-1
+        CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LEFTMN,RIGHTM,LREG,RREG,
+     +  LNCONL,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS1,HELPE1,FILEH,IDEVH,IERR)
+        IF(IERR.NE.0) GO TO 30
+        I1=(LREG-RELX)+1
+        I2=(RREG-RELX)+1
+        CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,LLINO,
+     +  LNCONL,LREG,RREG,GEL,I1,I2,3,LLINOR,IDBSIZ,IDEV,KBOUT,
+     +  IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+      END IF
+40    CONTINUE
+      IF(NOPT.EQ.1)THEN
+C       COMPLETE JOIN
+        CALL YESNO(ANS,'Sure',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.LT.0) RETURN
+        IF(ANS.NE.0) GO TO 30
+C       ADJUST ALL RELATIVE POSITIONS IN RIGHT CONTIG
+        N=LLINOR
+        RELPG(N)=RELX
+50      CONTINUE
+        IF(RNBR(N).EQ.0)GO TO 60
+        N=RNBR(N)
+        RELPG(N)=RELPG(N)+RELX-1
+        GO TO 50
+60      CONTINUE
+C
+C       FIX UP NEW GEL LINE FOR OLD LEFT OF RIGHT CONTIG
+        LNBR(LLINOR)=RNBR(LNCONL)
+C       FIX UP RIGHT GEL OF LEFT CONTIG
+        N=RNBR(LNCONL)
+        RNBR(N)=LLINOR
+        CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LNCONL,IDBSIZ)
+C       MERGE DOES NOT WRITE TO DISK
+        N=LNBR(LNCONL)
+65      CONTINUE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+        N=RNBR(N)
+        IF(N.NE.0)GO TO 65
+C       CONTIG LINES
+        X=RELPG(LNCONR)+RELX-1
+        IF(X.GT.RELPG(LNCONL))RELPG(LNCONL)=X
+      CALL WRITER(IDEVR,LNCONL,RELPG(LNCONL),LNGTHG(LNCONL),
+     +LNBR(LNCONL),RNBR(LNCONL))
+C       NOW MOVE ALL DATA DOWN TO DELETE OLD RIGHT END
+        N=IDBSIZ-NCONTS
+        M=LNCONR-N
+        IF(M.EQ.0)GO TO 80
+        K=LNCONR
+        J=LNCONR-1
+        DO 70 I=1,M
+          RELPG(K)=RELPG(J)
+          LNGTHG(K)=LNGTHG(J)
+          LNBR(K)=LNBR(J)
+          RNBR(K)=RNBR(J)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+          K=K-1
+          J=J-1
+70      CONTINUE
+80      CONTINUE
+        NCONTS=NCONTS-1
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        RETURN
+      END IF
+      GO TO 30
+      END
+C     LINEUP
+C
+C   TAKES 2 SEQS SET OF MATCHES AND PRODUCES LINED UP SEQS
+C   FINDS IF WE HAVE A LEFT OVERLAP
+C   RETURNS POSITION OF JOINT. THIS IS RELATIVE TO THE CONTIG
+C   FOR MOST MATCHES BUT I RELATIVE TO THE GEL FOR A LEFT OVERLAP
+      SUBROUTINE LINEUP(SEQG,SEQC,SEQG2,SEQC2,IDC,IDG,IDOUT,
+     1MATG,MATC,MATL,IP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQG(IDG),SEQC(IDC),SEQG2(IDOUT),SEQC2(IDOUT),PAD
+      INTEGER MATG(IP),MATC(IP),MATL(IP)
+      SAVE PAD
+      DATA PAD/','/
+      IFAIL=0
+C   ZERO PADDING CHARS IN CONTIG (GEL DONE AT END BY DIFFERENCE
+C   IN INPUT AND OUTPUT LENGTHS)
+      ITOTPC=0
+C   FILL OUTPUT WITH PADDING
+      DO 10 I=1,IDOUT
+        SEQG2(I)=PAD
+        SEQC2(I)=PAD
+10    CONTINUE
+      NMTCH=0
+C   SET INITIAL POINTERS TO OUTPUT
+C   CONSENSUS
+      IS1=1
+C   GEL
+      IS2=1
+C   FIND DISTANCE FROM LEFT MATCH IN GEL TO LEFT OF GEL
+      IG2=MATG(1)-1
+      IF(IG2.EQ.0)THEN
+C       THE LEFT END OF THE GEL MATCHES SO THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET JOINT
+        JOINT=MATC(1)
+C       SKIP NEXT SECTION
+        GO TO 50
+      END IF
+C   FIND DISTANCE FROM LEFT MATCH IN CONTIG TO LEFT OF CONTIG
+      IC2=MATC(1)-1
+C   GET DISTANCE FROM FIRST MATCH IN CONTIG TO FIRST MATCH IN GEL.
+C   IF THIS DISTANCE <0 THEN WE HAVE A LEFT OVERLAP
+      IC1=IC2-IG2+1
+      IF(IC1.GT.0)THEN
+C       THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET LEFT END
+        JOINT=IC1
+C       COPY THE GEL UPTO THE FIRST MATCH, INTO THE OUTPUT ARRAY
+C       CHECK FOR OVERFLOW
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+C       COPY THE CONTIG FOR THE SAME REGION
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(IC1),SEQC2(1),IG2)
+        IS1=IS1+IG2
+        IS2=IS2+IG2
+        GO TO 50
+      END IF
+C   MUST BE LEFT END OVERLAP
+C   SET TYPE
+      ITYPE=1
+C   SET POSITION OF JOINT RELATIVE TO GEL
+      JOINT=ABS(IC1)+2
+C   COPY OVER THE GEL UPTO THE JOINT
+C   CHECK FOR OVERFLOW
+      IF(IG2.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+      IS2=IS2+IG2
+C   WE MAY ALSO HAVE MISMATCHING
+C   DATA AT THE JOIN SO DEAL WITH THAT NOW
+C   IF IC2 >0 THE LEFT END OF THE CONTIG MATCHES THE GEL BUT OTHERWISE
+C   WE HAVE SOME MISMATCHED DATA TO DEAL WITH - WE NEED TO TRANSFER
+C   THE MISMATCHED REGION OF THE CONTIG TO THE OUTPUT ARRAY
+      IF(IC2.GT.0)THEN
+        IF(IC2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(1),SEQC2(1),IC2)
+        IS1=IS1+IC2
+      END IF
+C   WHEN WE GET HERE WE HAVE SORTED OUT THE LEFT ENDS FOR LEFT OVERLAP
+C   AND MISMATCHED LEFT ENDS, WE NOW DEAL WITH THE REST OF THE SEQUENCE
+C   STARTING WITH THE FIRST BLOCK OF IDENTITY
+C
+C IG1 POSITION IN INPUT GEL
+C IS2 POSITION IN OUTPUT GEL
+C IC1 POSITION IN INPUT CONTIG
+C IS1 POSITION IN OUTPUT CONTIG
+C LG1 POSITION OF END OF CURRENT MATCH IN OUTPUT GEL
+C LC1 POSITION OF END OF CURRENT MATCH IN OUTPUT CONTIG
+C LG2 DISTANCE FROM CURRENT MATCH IN INPUT GEL TO NEXT MATCH
+C LC2 DISTANCE FROM CURRENT MATCH IN INPUT CONTIG TO NEXT MATCH
+C
+50    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   COPY NEXT MATCH
+      IG1=MATG(NMTCH)
+      IC1=MATC(NMTCH)
+      L=MATL(NMTCH)
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(IG1),SEQG2(IS2),L)
+C   CHECK FOR OVERFLOW
+      IF(IS1+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQC(IC1),SEQC2(IS1),L)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+L
+      IS2=IS2+L
+C   END OF CURRENT MATCH
+      LG1=IG1+L
+      LC1=IC1+L
+C   ANY MORE MATCHES
+      IF(NMTCH.EQ.IP)GO TO 500
+      K=NMTCH+1
+      LG2=MATG(K)-LG1
+      LC2=MATC(K)-LC1
+C   ANY DIFFERENCE IN LENGTH? IF SO WE HAVE TO PAD SO THEY BECOME THE SAME
+      L5=ABS(LG2-LC2)
+C   COUNT PADDING CHARS IN CONTIG
+      IF(LG2.GT.LC2)ITOTPC=ITOTPC+L5
+C   IF DIFFERENCE INCREMENT SHORTER
+      IF(LG2.GT.LC2)IS1=IS1+L5
+C   IF GEL NEEDS PADDING TRY TO PUT PADS NEXT TO DOUBLE CODES
+      IF(LC2.GT.LG2)CALL PADCOP(SEQG,SEQG2,
+     +LG1,MATG(K),L5,IS2,LG2,MAXGEL,IFAIL,KBOUT,SEQC,LC1)
+C   CHECK FOR OVERFLOW
+      IF(IFAIL.EQ.1)GO TO 700
+C   NOW COPY MISSMATCHED REGION
+C   CHECK FOR OVERFLOW
+      IF(IS2+LG2-1.GT.MAXGEL)GO TO 700
+      IF(LG2.GT.0)CALL SQCOPY(SEQG(LG1),SEQG2(IS2),LG2)
+C   CHECK FOR OVERFLOW
+      IF(IS1+LC2-1.GT.MAXGEL)GO TO 700
+      IF(LC2.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),LC2)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+LC2
+      IS2=IS2+LG2
+C   GET NEXT MATCH
+      GO TO 50
+500   CONTINUE
+C
+C   FINISH RIGHT ENDS
+C   ONLY COPY TO END OF GEL IN GEL AND TO THE SAME RELATIVE POSITION
+C   IN THE CONTIG FOR DISPLAY PURPOSES AND FOR COUNTING MISMATCH
+C   CURRENT ENDS AT LG1,LC1
+C   HOW FAR TO END OF GEL?
+C   SET M
+      M=0
+      L=IDG-LG1+1
+      IF(L.LT.1)GO TO 600
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(LG1),SEQG2(IS2),L)
+C   NEED TO COPY TO END OF GEL IN CONTIG FOR DISPLAY
+C   POINT TO POSN IN CONTIG LEVEL WITH END OF GEL
+      M=LC1+L-1
+C   IS THIS OVER END OF CONTIG?
+      IF(M.GT.IDC)M=IDC
+C   NUMBER TO COPY
+      M=M-LC1+1
+C   CHECK FOR OVERFLOW
+      IF(IS1+M-1.GT.MAXGEL)GO TO 700
+      IF(M.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),M)
+600   CONTINUE
+C   COUNT PADDING IN GEL
+      ITOTPG=IS2+L-1-IDG
+C   SET NEW LENGTHS FOR RETURN TO CALLING ROUTINE
+      IDOUT=IS1+M-1
+      IDG=IS2+L-1
+      IFAIL=0
+      RETURN
+700   CONTINUE
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Matching region too long for routine lineup,',
+     +' alignment aborted')
+      IFAIL=1
+      RETURN
+      END
+      SUBROUTINE LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,
+     +RREG,IDEV,IDEVN,NAMARC)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER RREG
+      CHARACTER NAMARC*(*)
+      N = LLINO
+      WRITE(IDEV,1001)
+10    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).NE.0) THEN
+        N = RNBR(N)
+        IF(RELPG(N).LE.RREG) GO TO 10
+      END IF
+1001  FORMAT(' NAME      NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                     LEFT   RIGHT')
+1006    FORMAT( ' ',A,2X,I4,2X,I7,2X,I5,2X,I6,2X,I6)
+      END
+C12345678901234567890
+C                          710       720       730       740       750
+C   -1  HINW.004    CGTCAGACGCACGCTGGAAAA
+      INTEGER FUNCTION LTYPE(LINE,LL,J1,J2,N,MAXDB,KBOUT)
+      CHARACTER LINE*(*),NUM*5,SPACE
+      EXTERNAL NOTRL,NOTLR
+      PARAMETER (SPACE= ' ')
+      J1 = NOTLR(LINE,LL,SPACE)
+      IF(J1.EQ.0) THEN
+C       BLANK LINE
+        LTYPE = 1
+        RETURN
+      END IF
+      IF(J1.GT.20) THEN
+C       LINE OF NUMBERS
+        LTYPE = 2
+        RETURN
+      END IF
+      IF(J1.GT.5) THEN
+C       CONSENSUS LINE
+        LTYPE = 3
+        RETURN
+      END IF
+C       SHOULD BE A SEQUENCE LINE
+      J = INDEX(LINE(J1:),SPACE)
+      NUM = SPACE
+      NUM = LINE(J1:J1+J-2)
+      CALL RJST(NUM)
+C      N = IFROMS(NUM,5,KBOUT)
+      READ(NUM,'(I5)',ERR=10) N
+      IF(N.GT.MAXDB-2) GO TO 10
+C     NUMBER ENDS AT J1+J-2
+      J1 = J1 + J - 1
+C     LOOK FOR BEGINNING OF NAME
+      J = NOTLR(LINE(J1:),LL-J1+1,SPACE)
+      N1 = J1 + J - 1
+C     LOOK FOR END OF NAME
+      J = INDEX(LINE(N1:),SPACE)
+      N2 = N1 + J - 2
+C     LOOK FOR BEGINNING OF SEQ
+      J = NOTLR(LINE(N2+1:),LL-N2,SPACE)
+      J1 = N2 + J
+      LTYPE = 4
+C     LOOK FOR END OF SEQ
+      J2 = NOTRL(LINE,LL,SPACE)
+      IF(J2.GT.N2) RETURN
+10    CONTINUE
+      LTYPE = 0
+      END
+      INTEGER FUNCTION LWRAPS(I,J)
+      K = MOD(I,J)
+      IF(K.EQ.0) K = J
+      LWRAPS = K
+      END
+C     MERGE
+C
+C   ROUTINE SENT CONTIG WHOSE GELS MAY BE OUT OF ORDER
+C   REORDERS GELS ON POSITION OF LEFT ENDS AND SETS LEFT
+C   GEL NUMBER FOR THE REORDERED CONTIG
+C
+      SUBROUTINE MERGE(RELPG,LNGTHG,LNBR,RNBR,LINCON,IDBSIZ)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C   START AT LEFT END
+      N=LNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT IN LIST
+      N=NR
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT
+      NR=RNBR(N)
+      IF(NR.EQ.0)GO TO 30
+C   HAVENT REACHED END YET
+      I1=0
+23    CONTINUE
+C   ARE THESE 2 IN CORRECT ORDER IE N<=NR ?
+      IF(RELPG(N).LE.RELPG(NR))GO TO 21
+C   NOT IN ORDER SO CHAIN LEFT UNTIL CORRECTLY POSITIONED
+C   THEN COME BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE SAVE POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C   EXCHANGE NEIGHBOURS
+      M=RNBR(NR)
+      IF(M.NE.0)LNBR(M)=N
+      M=LNBR(N)
+      IF(M.NE.0)RNBR(M)=NR
+      RNBR(N)=RNBR(NR)
+      RNBR(NR)=N
+      LNBR(NR)=LNBR(N)
+      LNBR(N)=NR
+C   CHAIN BACK THRU LIST
+      N=LNBR(NR)
+      IF(N.EQ.0)GO TO 21
+C   END NOT REACHED
+      GO TO 23
+30    CONTINUE
+C  ALL DONE POINTER AT RIGHT GEL
+      RNBR(LINCON)=N
+      RETURN
+      END
+      SUBROUTINE MINCOM(SEQ1,IDIM1,SEQ2,IDIM2,SAV1,SAV2,SAV3,
+     +IP,MINM,KBOUT)
+C   AUTHOR: RODGER STADEN
+C
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER SAV1(IP),SAV2(IP),SAV3(IP)
+C
+      IP1=IP
+      IP=0
+C
+C   SITUATION 1
+      NT1=IDIM2-MINM
+      IES1=MINM-1
+      ISS2=NT1+1
+C
+      DO 100 I=1,NT1
+C
+C   POINT TO FIRST CHAR-1 OF SEQ2
+      ISS2=ISS2-1
+C   POINT TO LAST CHAR SEQ1
+      IES1=IES1+1
+C
+      N=0
+C
+      DO 200 J=1,IES1
+C   STORE POINTER
+      JJ=J
+C
+C   POINT TO SEQ2
+      K=ISS2+J
+C   TEST FOR EQUALITY
+      IF(SEQ1(J).NE.SEQ2(K))GO TO 220
+C   INCREMENT N
+      N=N+1
+      GO TO 200
+220   CONTINUE
+C   TEST FOR SUFFICENTLY LARGE N
+      IF(N.GE.MINM)CALL SAVIT(N,J,K,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C   RESET N TO ZERO
+      N=0
+200   CONTINUE
+C
+C   GOOD SCORE AT END?
+C   NEED TO INCREMENT POINTERS AS SAVIT EXPECTS TO BE POINTING AT NEXT
+C   MISMATCH
+      JJ=JJ+1
+      KK=K+1
+      IF(N.GE.MINM)CALL SAVIT(N,JJ,KK,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C
+100   CONTINUE
+C
+C
+C   SITUATION 2
+      NT2=IDIM1-IDIM2+1
+C
+      DO 300 I=1,NT2
+      N=0
+C
+      DO 400 J=1,IDIM2
+C   SAVE POINTER
+      JJ=J
+C
+C   SET POINTER TO SEQ1
+      L=I+J-1
+      IF(SEQ1(L).NE.SEQ2(J))GO TO 420
+      N=N+1
+      GO TO 400
+420   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+400   CONTINUE
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+300   CONTINUE
+C
+C
+C   SITUATION 3
+      ISS1=IDIM1-IDIM2
+C
+      DO 500 I=1,NT1
+C
+C   POINT TO FIRST CHAR SEQ1
+      K=ISS1+I
+      IES2=IDIM2-I
+      N=0
+C
+      DO 600 J=1,IES2
+C   SAVE POINTER
+      JJ=J
+C
+C   POINT TO SEQ1
+      L=K+J
+      IF(SEQ1(L).NE.SEQ2(J))GO TO  620
+      N=N+1
+      GO TO 600
+620   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C    TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+600   CONTINUE
+C
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+500   CONTINUE
+C
+      RETURN
+5000  CONTINUE
+C   OVERFLOW
+C
+      WRITE(KBOUT,1000)IP1
+1000  FORMAT(/' TOO MANY MATCHES. LIMIT = ',I6)
+      RETURN
+      END
+      SUBROUTINE ML(PC,PG,L,N,J)
+      INTEGER PC(N),PG(N),L(N)
+      DO 10 I = J,N-1
+        PC(I) = PC(I+1)
+        PG(I) = PG(I+1)
+        L(I) = L(I+1)
+10    CONTINUE
+      END
+      SUBROUTINE MSTLKL(SEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU,INDEXS
+      DO 100 I=1,IDIM
+        J = INDEXS(SEQ(I),K)
+        SEQ(I) = CHARSU(J)
+100   CONTINUE
+      END
+      CHARACTER FUNCTION MUNOTP(IP)
+C     AUTHOR RODGER STADEN
+      CHARACTER PUP*26
+      SAVE PUP
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      MUNOTP = '-'
+      IF((IP.GT.0).AND.(IP.LT.23))MUNOTP = PUP(IP:IP)
+      END
+      INTEGER FUNCTION NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +STRAND,RREG)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+      NCDEP = 0
+      N = 0
+      I = IGEL
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) N = N + 1
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      NCDEP = N
+      END
+      SUBROUTINE NEWDB(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,KBOUT,
+     +IDEVF,IDEVR,IDEVW,IDEVN,MAXDB,WGEL,MAXGEL,LINCON,KGEL,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER LINEIN*80,GEL*62,WGEL(MAXGEL),GELNAM*10
+      EQUIVALENCE (LINEIN(18:18),GEL)
+      EQUIVALENCE (LINEIN(8:8),GELNAM)
+      EXTERNAL LTYPE
+      LINLEN = 80
+      LINENO=0
+C     SET LENGTH OF CONTIG
+      LENCON=0
+C     SET POSITION OF RIGHTMOST CHAR OF LAST BLOCK
+      ILASTR=0
+C     SET NUMBER OF LAST GEL TO START SO WE CAN IDENTIFY RIGHT AND LEFT
+C     NEIGHBOURS
+      LASTS=0
+C     NEED TO WHERE THE LEFT OF EACH BLOCK IS. ASSUME THE FIRST GEL IN A BLOCK
+C     IS LEVEL WITH THE LEFT EDGE FOR THAT BLOCK (IT SHOULD BE). SO NEED A FLAG
+C     (IGF)=0 IF FIRST =1 IF NOT, AND A POSITION (ILEFT) TO MARK THE LEFT EDGE
+C     NEED TO SET LEFT EDGE TO 1 SO LENGTH CALCULATION WORKS FOR FIRST BLOCK
+      ILEFT=1
+10    CONTINUE
+C     NEED TO ADD LENGTH OF LONGEST LINE IN LAST BLOCK TO CONSENSUS LENGTH
+      LENCON=LENCON+ILASTR-ILEFT+1
+C     RESET POSITION OF RIGHTMOST CHAR OF LAST BLOCK
+      ILASTR=0
+C     READ LINE OF NOS
+      READ(IDEVF,1003,END=100)LINEIN
+1003  FORMAT(A)
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+C      WRITE(*,*)LINENO,LT
+      IF(LT.NE.2) GO TO 400
+C     SET FLAG FOR NEW BLOCK
+      IGF=0
+20    CONTINUE
+C
+C     READ A LINE, COULD BE A GEL OR A CONSENSUS
+      READ(IDEVF,1003,END=100)LINEIN
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+C      WRITE(*,*)LINENO,LT
+      IF(LT.EQ.0) GO TO 400
+C     WHAT SORT OF LINE?
+      IF(LT.EQ.1) GO TO 400
+      IF(LT.EQ.4) THEN
+        IFIRST = J1 - 17
+        ILAST = J2 - 17
+        NPOS=ABS(N)
+C       GEL LINE SO SAVE
+C       NEED TO FIND THE RIGHTMOST POSITION FOR EACH BLOCK
+        IF(ILAST.GT.ILASTR)ILASTR=ILAST
+C       IF FIRST GEL THIS BLOCK SAVE POSITION OF LEFT EDGE
+        IF(IGF.EQ.0)ILEFT=IFIRST
+C       SET FLAG FOR NOT FIRST GEL THIS BLOCK
+        IGF=1
+C       START OF GEL?
+        IF(LNGTHG(NPOS).EQ.0)THEN
+C         YES SO SET RELPG
+          RELPG(NPOS)=LENCON+IFIRST-ILEFT+1
+C          LNBR(NPOS)=LASTS
+C         SET RIGHT NEIGHBOUR OF LAST GEL TO START
+C          IF(LASTS.NE.0)RNBR(LASTS)=NPOS
+          LASTS=NPOS
+          CALL WRITEN(IDEVN,NPOS,GELNAM)
+        END IF
+C       COPY NEW SEQUENCE TO DISK
+        CALL READW(IDEVW,NPOS,WGEL,MAXGEL)
+        K1=ABS(LNGTHG(NPOS))+1
+        K2=K1+ILAST-IFIRST
+        K11=IFIRST-1
+C        WRITE(*,*)K1,K2,K11
+        DO 25 I=K1,K2
+          K11=K11+1
+          WGEL(I)=GEL(K11:K11)
+25      CONTINUE
+        CALL WRITEW(IDEVW,NPOS,WGEL,MAXGEL)
+C       UPDATE LENGTH
+        LNGTHG(NPOS)=SIGN(K2,N)
+C       GO BACK FOR NEXT LINE (CONSENSUS OR NEW GEL)
+        GO TO 20
+      END IF
+C     SHOULD BE CONSENSUS, CHECK
+      IF(LT.NE.3) GO TO 400
+C     READ WHAT SHOULD BE A BLANK LINE
+      READ(IDEVF,1003,END=100)LINEIN
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,IFIRST,ILAST,N,MAXDB,KBOUT)
+C      WRITE(*,*)LINENO,LT
+      IF(LT.NE.1) GO TO 400
+      GO TO 10
+100   CONTINUE
+C     CHECK FOR MISSING BLANK AT END OF FILE
+      IF(LT.NE.1) LENCON=LENCON+ILASTR-ILEFT+1
+      LNBR(LINCON) = KGEL
+      CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LINCON,IDBSIZ)
+C     ON INPUT LNBR(LINCON) MAY BE WRONG, BUT ON EXIT RNBR IS CORRECT
+      I = RNBR(LINCON)
+300   CONTINUE
+C      WRITE(*,*)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+      CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+      J = I
+      I = LNBR(I)
+      IF(I.NE.0) GO TO 300
+C     WRITE CONTIG LINE
+      I = LINCON
+      RELPG(I) = LENCON
+      LNGTHG(I) = 0
+      LNBR(I) = J
+C      WRITE(*,*)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+      CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+      WRITE(KBOUT,1005)
+1005  FORMAT(' Finished, no errors found')
+C
+      IOK = 0
+      RETURN
+400   CONTINUE
+C   ERROR DETECTED, REPORT LINE NUMBER AND STOP
+      WRITE(KBOUT,5002)LINENO
+5002  FORMAT(' Error in line ',I6,' of file. Processing stopped',
+     +/,' but your database may be corrupted. Revert to a copy')
+      IOK = 1
+      RETURN
+      END
+      SUBROUTINE PADCOP(SEQG,SEQG2,LG1,MG,L5,IS2,LG2,MAXGEL,IFAIL,
+     +KBOUT,SEQC,IC1)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (NDUBL = 4)
+      CHARACTER SEQG(MAXGEL),SEQG2(MAXGEL),DUBBL(NDUBL),SEQC(MAXGEL)
+      SAVE DUBBL
+      DATA DUBBL/'D','B','V','H'/
+      JC1 = IC1
+C Make seqg2 from seqg placing L5 padding chars before position MG
+C which is the start of the next block of identity. Try to put the
+C padding either in line with consensus pads, or next to double
+C codes. The positions in seqg are LG1 to MG-1. seqg2 needs to be long
+C enough to be extended from IS2 to IS2 + L5 -1 + MGM1-LG1 +1
+C ie we add L5 pads, plus the chars between and including  LG1 and MGM1
+      IDONE=0
+C   POINT TO END OF MISMATCH
+      MGM1=MG-1
+C   MAY BE NO CHARS TO COPY
+      IF(MGM1.LT.LG1)GO TO 111
+C  Next check added 26-2-91
+      MAXREQ = IS2 + L5 - 1 + MGM1 - LG1 + 1
+      IF((MGM1.GT.MAXGEL).OR.(MAXREQ.GT.MAXGEL)) THEN
+        WRITE(KBOUT,1000)
+1000    FORMAT(' Matching region too large for routine padcop,',
+     +  ' alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 110 J=LG1,MGM1
+        IF(IDONE.LT.L5) THEN
+          IF((JC1.GT.0).AND.(JC1.LT.MAXGEL)) THEN
+          IF(SEQC(JC1).EQ.'*') THEN
+            IS2 = IS2 + 1
+            JC1 = JC1 + 1
+            IDONE = IDONE + 1
+            GO TO 109
+          END IF
+          END IF
+          DO 108 M=1,NDUBL
+            IF(SEQG(J).EQ.DUBBL(M)) THEN
+              IS2 = IS2 + 1
+              JC1 = JC1 + 1
+              IDONE = IDONE + 1
+              GO TO 109
+            END IF
+108       CONTINUE
+109       CONTINUE
+        END IF
+        SEQG2(IS2) = SEQG(J)
+        IS2 = IS2 + 1
+        JC1 = JC1 + 1
+110   CONTINUE
+111   CONTINUE
+C   ALL CHARS COPIED. ENOUGH PADDING?
+      IF(IDONE.LT.L5)IS2=IS2+L5-IDONE
+C   IS2 SHOULD NOW BE POINTING AT NEXT CHAR
+C   ZERO LG2 TO SHOW CALLING ROUTINE COPYING DONE
+      LG2=0
+      IFAIL=0
+      END
+      SUBROUTINE PADRUN(GEL,IFIRST,ILAST,RUN)
+      CHARACTER PAD,GEL*50,RUN*50
+      SAVE PAD
+      DATA PAD/'*'/
+C  FIND PADS IN THIS GEL
+      DO 20 I = IFIRST,ILAST
+        IF(GEL(I:I).EQ.PAD) THEN
+C  IS IT AT LEAST THE SECOND IN A RUN
+          IF(RUN(I:I).EQ.PAD) THEN
+C  FIND THE LEFT END OF THE RUN
+            JJ = I
+1           CONTINUE
+            IF(JJ.GT.1)THEN
+              JJ = JJ - 1
+              IF(RUN(JJ:JJ).EQ.PAD)GO TO 1
+            END IF
+C  FOUND LEFT END OF RUN AT JJ, SO MOVE PAD IF NOT ALREADY A PAD
+5           CONTINUE
+            IF(GEL(JJ:JJ).NE.PAD)THEN
+C  EXCHANGE ELEMENTS
+              GEL(I:I) = GEL(JJ:JJ)
+              GEL(JJ:JJ) = PAD
+            ELSE
+C  ALREADY A PAD, SO TRY NEXT TO RIGHT
+              JJ = JJ + 1
+              IF(JJ.LT.I) GO TO 5
+            END IF
+          END IF
+        END IF
+20    CONTINUE
+      END
+      SUBROUTINE PCON1(CHAR,CHRSUM)
+C  AUTHOR RODGER STADEN
+C  PART OF PROTEIN 'CONSENSUS' CALCULATION
+      CHARACTER CHAR
+      INTEGER CHRSUM
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      K = CTONUM(CHAR)
+      IF(K.NE.26)THEN
+        IF(CHRSUM.EQ.0)THEN
+          CHRSUM = K
+        ELSE
+          IF(K.NE.CHRSUM)CHRSUM = -1
+        END IF
+      END IF
+      END
+      SUBROUTINE PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +NCONTS,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      YMIN = 0.
+      YMAX = ISYMAX
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        CALL LINE(XF,XT,Y,Y,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        IF((IGEL.NE.0).AND.(I.EQ.LINCON)) THEN
+          XZ = XF + RELPG(IGEL) + ABS(LNGTHG(IGEL))/2
+          CALL LINE(XZ,XZ,YMAX,YMIN,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      I = LGEL
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            CALL LINE(XF,XT,YF,YF,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      INTEGER DEPTHP,DEPTHM
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+C  have window size margt starting at margb
+C  depths depthp, depthm
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = LREG
+      XMAX = RREG
+      RINC = YMAX / (DEPTHP + DEPTHM + 2)
+      RMID =(DEPTHM+1) * RINC
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX,RMID,RMID,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMAX,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      STRAND = 1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHP)
+      STRAND = -1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHM)
+      CALL VT100M
+      END
+      SUBROUTINE PLTQ(SEQ,IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CHARACTER SEQ(IDIM2),T
+      PARAMETER (Y0  = 0.,
+     +           YP1 = 1.,
+     +           YP2 = 2.,
+     +           YM1 = -1.,
+     +           YM2 = -2.)
+      XMIN = 0.
+      XMAX = IDIM2
+      YMIN = YM2
+      YMAX = YP2
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XIN,XMAX,Y0,Y0,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      I = 1
+10    CONTINUE
+      XF = I
+      T = SEQ(I)
+20    CONTINUE
+      IF(SEQ(I).NE.T) THEN
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I - 1
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+      I = I + 1
+      IF(I.LT.IDIM2) GO TO 20
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      END
+      SUBROUTINE POSTC(GELNOS,GELSTR,GELEND,LINEIN,IERR,
+     +KBOUT,IDEVF,MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,KGEL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER LINEIN*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2),IGEL(2)
+      INTEGER LNBR(MAXDB),RNBR(MAXDB),RELPG(MAXDB),LNGTHG(MAXDB)
+      EXTERNAL LTYPE
+C   GELNOS === GEL NUMBERS (UNIQUE)
+C   GELSTR === GEL START LINES
+C   GELEND === GEL END LINES
+C   LINNO  === GEL LINE NUMBERS PER STRIP
+C   IGEL   === NUMBER OF GELS PER STRIP
+C   LINENO === CURRENT LINE NUMBER
+C   IUNIQ IS NUMBER OF GELS FOUND
+C
+C  PROBLEM IS TO FIND IF THERE ARE LINES OF WORKING VERSIONS MISSING
+C  WE PROCESS THE DATA IN PAIRS OF BLOCKS OR STRIPS (A BLOCK OR STRIP IS ONE 50
+C  CHARACTER WIDTH OF THE CONTIG). IF WE FIND ALL THE GELS THAT APPEAR
+C  IN STRIP B THEN IF THEY WEREN'T PRESENT IN STRIP A THEY MUST START IN
+C  STRIP B, IF THEY DONT APPEAR IN STRIP C THEY END IN STRIP B.
+C  STRATEGY IS TO READ THRU FILE AND REMEMBER WHICH GELS APPEAR IN EACH
+C  STRIP. BY COMPARING THEM WITH THE LAST STRIP WE CAN KNOW WHICH HAVE ENDED
+C  AND WHICH HAVE STARTED. HENCE WE STORE A LIST OF GELS AND THEIR START AND
+C  END LINE NUMBERS. USING THIS WE CAN SEE IF THE LIST IS UNIQUE
+C  SUB CLIST IS SENT THE CURRENT TWO LISTS OF GEL NUMBERS AND FINDS WHICH HAVE
+C  STARTED AND ENDED AND BUILDS UP A LIST OF UNIQUE GELS AND THEIR START
+C  AND END LINE NUMBERS
+C
+      LINLEN = 80
+      IERR=0
+      JSTRIP=1
+      LINENO=0
+      ISTRIP=1
+      IUNIQ=0
+10    CONTINUE
+C     READ WHAT SHOULD BE A LINE OF NOS
+      READ(IDEVF,1003,END=100)LINEIN
+1003  FORMAT(A)
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+C     IS IT A LINE OF NOS?
+      IF(LT.NE.2) GO TO 400
+C     ZERO GEL COUNT FOR THIS STRIP
+      IGEL(ISTRIP)=0
+20    CONTINUE
+C     READ A LINE, COULD BE 1 GEL, 2 CONSENSUS OR BLANK
+      READ(IDEVF,1003,END=100)LINEIN
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+      IF(LT.EQ.0) GO TO 400
+      IF(LT.EQ.1) GO TO 400
+      IF(LT.EQ.4) THEN
+C       GEL LINE SO SAVE
+C       INCREMENT COUNT TO NUMBER OF GEL
+        IGEL(ISTRIP)=IGEL(ISTRIP)+1
+        GELNO(IGEL(ISTRIP),ISTRIP)=N
+        LINNO(IGEL(ISTRIP),ISTRIP)=LINENO
+C       BO BACK FOR NEXT LINE (CONSENSUS OR NEW GEL)
+        GO TO 20
+      END IF
+C     SHOULD THEN BE A CONSENSUS
+      IF(LT.NE.3) GO TO 400
+C     MUST BE CONSENSUS SO PROCESS THIS STRIP
+C     PROCESS THIS STRIP OF GELS (IGEL OF THEM)
+C     WHICH GELS ARE NEW? AND WHICH HAVE ENDED
+C     IF FIRST EVER STRIP DONT DO THIS
+      IF(JSTRIP.EQ.1)THEN
+        DO 6 I=1,IGEL(1)
+          GELNOS(I)=GELNO(I,1)
+          GELSTR(I)=LINNO(I,1)
+6       CONTINUE
+        JSTRIP=JSTRIP+1
+        ISTRIP=2
+        IUNIQ=IGEL(1)
+C       READ WHAT SHOULD BE A BLANK LINE
+        READ(IDEVF,1003,END=100)LINEIN
+        LINENO=LINENO+1
+        LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+        IF(LT.EQ.0) GO TO 400
+        IF(LT.NE.1)GO TO 400
+        GO TO 10
+      END IF
+C     ORER OF ARGUMENTS DEPENDS ON ISTRIP
+      IF(ISTRIP.EQ.2)THEN
+        CALL CLIST(GELNO(1,1),LINNO(1,1),IGEL(1),
+     +  GELNO(1,2),LINNO(1,2),IGEL(2),
+     +  GELNOS,GELSTR,GELEND,MAXDB,IUNIQ,KBOUT,IERR)
+        IF(IERR.NE.0) GO TO 400
+        ISTRIP=1
+C       READ WHAT SHOULD BE A BLANK LINE
+        READ(IDEVF,1003,END=100)LINEIN
+        LINENO=LINENO+1	
+        LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+        IF(LT.EQ.0) GO TO 400
+        IF(LT.NE.1)GO TO 400
+        GO TO 10
+      END IF
+      IF(ISTRIP.EQ.1)THEN
+        CALL CLIST(GELNO(1,2),LINNO(1,2),IGEL(2),
+     +  GELNO(1,1),LINNO(1,1),IGEL(1),
+     +  GELNOS,GELSTR,GELEND,MAXDB,IUNIQ,KBOUT,IERR)
+        IF(IERR.NE.0) GO TO 400
+        ISTRIP=2
+        READ(IDEVF,1003,END=100)LINEIN
+        LINENO=LINENO+1	
+        LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+        IF(LT.EQ.0) GO TO 400
+        IF(LT.NE.1)GO TO 400
+        GO TO 10
+      END IF
+100   CONTINUE
+C     MAY BE SOME LEFT IN CURRENT STRIP THAT HAVE NOT BEEN ENDED
+C     BUT ISTRIP WILL JUST HAVE BEEN CHANGED SO CHANGE IT BACK
+      IF(ISTRIP.EQ.1)THEN
+        ISTRIP=2
+        GO TO 101
+      END IF
+      ISTRIP=1
+101   CONTINUE
+      DO 110 I=1,IGEL(ISTRIP)
+        MATCH=INLIST(GELNOS,IUNIQ,GELNO(I,ISTRIP))
+        IF(MATCH.NE.0)THEN
+          GELEND(MATCH)=LINNO(I,ISTRIP)
+          GO TO 110
+        END IF
+C       ERROR
+        WRITE(KBOUT,1300)GELNO(I,ISTRIP)
+1300    FORMAT( ' Error: gel number ',I5,
+     +  ' expected but not found in list')
+        IERR = 1
+        GO TO 400
+110   CONTINUE
+C     AS A CHECK: ARE THE UNIQUE GELS UNIQUE?
+      DO 120 I=2,IUNIQ
+        KD=IUNIQ-I+1
+        IN=INLIST(GELNOS(I),KD,GELNOS(I-1))
+        IF(IN.EQ.0)GO TO 120
+        IERR=1
+        WRITE(KBOUT,1006)GELNOS(I-1),GELSTR(I-1),GELSTR(IN)
+1006    FORMAT(' Error: gel',I6,' has a line of data missing!'/
+     +  ' and appears to start on lines',I6,' and',I6)
+        GO TO 400
+120   CONTINUE
+      IF(IERR.EQ.0)WRITE(KBOUT,1005)
+1005  FORMAT(' Successfully checked line order and missing lines')
+C      WRITE(*,*)(GELNOS(K),K=1,IUNIQ)
+      KGEL1 = KGEL
+      KGEL = ABS(GELNOS(1))
+C     CHECK THAT THE LEFT GEL NUMBER IS UNCHANGED
+      IF(KGEL1.NE.KGEL) THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(' Left gel number of edited contig does not agree',/,
+     +  ' with original, processing aborted')
+        IERR = 1
+        RETURN
+      END IF
+      LNBR(KGEL) = 0
+      RELPG(KGEL) = 0
+      LNGTHG(KGEL) = 0
+      DO 200 I = 2,IUNIQ
+        J1 = ABS(GELNOS(I-1))
+        J2 = ABS(GELNOS(I))
+        RNBR(J1) = J2
+        LNBR(J2) = J1
+        RELPG(J2) = 0
+        LNGTHG(J2) = 0
+200   CONTINUE
+      RNBR(J2) = 0
+      IERR = 0
+      RETURN
+400   CONTINUE
+C   ERROR
+      WRITE(KBOUT,1007)LINENO
+1007  FORMAT(' Error in line',I6,', of file: lines out of order'/,
+     +' processing stopped')
+      IERR=1
+      RETURN
+      END
+      SUBROUTINE READN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      READ(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE READR(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      READ(IDEVR,REC=N)RELPG,LNGTHG,LNBR,RNBR
+      RETURN
+      END
+      SUBROUTINE READW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      READ(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE REMOVL(MATC,MATG,MATL,IP)
+C   AUTHOR: RODGER STADEN
+      INTEGER MATC(IP),MATG(IP),MATL(IP)
+C
+C   SET POINTER TO FIRST MATCH
+      NMTCH=0
+10    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   SORT MATCHES ON LENGTH
+      IPP=IP-NMTCH+1
+      CALL BUBBL3(MATL(NMTCH),MATG(NMTCH),MATC(NMTCH),IPP)
+C   LOOK FOR END OF POSITIVES
+      DO 20 I=NMTCH,IP
+      J=I
+20    IF(MATL(I).LT.1)GO TO 30
+      J=J+1
+30    CONTINUE
+      IP=J-1
+C   END OF POSITIVES AT IP
+      IF(NMTCH.GE.IP)RETURN
+      K1=MATC(NMTCH)
+      K2=K1+MATL(NMTCH)-1
+      K3=MATG(NMTCH)
+      K4=K3+MATL(NMTCH)-1
+C   POINT TO FIRST MATCH TO TEST
+      K6=NMTCH+1
+      DO 200 I=K6,IP
+C   DO CONSENSUS FIRST
+C   OVERLAP?
+      IF(MATC(I).GT.K2)GO TO 100
+      K5=MATC(I)+MATL(I)-1
+      IF(K5.LT.K1)GO TO 100
+C   DOES OVERLAP
+C   WHICH END
+      IF(K5.LE.K2)GO TO 80
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K2-MATC(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C  MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 100
+80    CONTINUE
+C   LENGTH
+      MATL(I)=K1-MATC(I)
+100   CONTINUE
+C   NOW LOOK FOR OVERLAPS WITH GEL
+C   OVERLAP?
+      IF(MATG(I).GT.K4)GO TO 200
+      K5=MATG(I)+MATL(I)-1
+      IF(K5.LT.K3)GO TO 200
+C   DOES OVERLAP
+C   WHICH END?
+      IF(K5.LE.K4)GO TO 180
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K4-MATG(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C   MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 200
+180   CONTINUE
+C   LENGTH
+      MATL(I)=K3-MATG(I)
+200   CONTINUE
+      GO TO 10
+      END
+C     SAVIT
+C
+      SUBROUTINE SAVIT(N,J,K,IP,S1,S2,S3,IP1)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1(IP1),S2(IP1),S3(IP1)
+C
+      IP=IP+1
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)RETURN
+      S1(IP)=N
+      S2(IP)=J-N
+      S3(IP)=K-N
+C
+      RETURN
+      END
+      SUBROUTINE SCRENR(GEL,MAXGEL,STRING,NAME,FILNAM,
+     +IDEV1,IDEV2,IDEV3,IDEV4,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER NAME*(*),FILNAM*(*),HELPF*(*)
+      CHARACTER GEL(MAXGEL),STRING(60)
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of sequences that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of recognition sequences',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+      JGEL = 0
+      IGEL = 0
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+31      CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 31
+        END IF
+      ELSE
+        READ(IDEV1,1002,END=100)NAME
+      END IF
+1002  FORMAT(A)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing', JGEL,' in batch'
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV4,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(KBOUT,*)' Error opening gel reading file'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV4,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV4)
+2     CONTINUE
+      IF(IDIMG.LT.1)THEN
+        WRITE(KBOUT,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL MSTLKL(GEL,IDIMG)
+3     CONTINUE
+      READ(IDEV3,1005,END=6)STRING
+1005  FORMAT(60A1)
+C   FIND LENGTH OF STRING ASSUMING NO SPACES
+      DO 4 I=1,60
+        II=I
+        IF(STRING(I).EQ.' ')GO TO 5
+4     CONTINUE
+5     CONTINUE
+      II=II-1
+      IF(II.GT.0)CALL FIND(GEL,IDIMG,STRING,II,JMATCH)
+      IF(JMATCH.EQ.0)GO TO 3
+C   A MATCH
+      WRITE(IDEV,1007)JMATCH,(STRING(K),K=1,II)
+1007  FORMAT(' Match at',I6,' with ',60A1)
+      REWIND IDEV3
+      GO TO 1
+C   NO MATCH SO SAVE
+6     CONTINUE
+      WRITE(IDEV2,1002)NAME
+      IGEL = IGEL + 1
+      REWIND IDEV3
+      GO TO 1
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGEL,' passed'
+      RETURN
+      END
+      SUBROUTINE SCRENV(MAXGEL,
+     +WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ,MAXSEQ,GEL,GELCOP,MATCH,
+     +LENGTH,
+     +SAVPS,SAVPG,SAVL,MAXMAT,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM,NAME,IOK)
+      INTEGER POSNS(MAXSEQ),GELN(MAXGEL),WORDP(LPOWRC),SAVPS(MAXMAT)
+      INTEGER SAVPG(MAXMAT),SAVL(MAXMAT)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER FILNAM*(*),NAME*(*),HELPF*(*)
+      CHARACTER GELCOP(MAXGEL)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ(MAXSEQ),GEL(MAXGEL),MATCH(MAXGEL)
+      JGEL = 0
+      IGELS = 0
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of gel readings that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV4,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of vector sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      IDIM = MAXSEQ
+      CALL ARRFIM(IDEV4,SEQ,IDIM,KBOUT)
+      CLOSE(UNIT=IDEV4)
+      MN = LENGTH*2
+      MX = 50
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   IS THE VECTOR SEQUENCE IN THE CORRECT FORMAT WITH A TITLE AT THE FRONT?
+      IF(IDCEND.EQ.0)THEN
+        CENDS(1) = -19
+        NENDS(1) = 1
+        CENDS(2) = IDIM + 1
+        IDCEND = 1
+      END IF
+C      WRITE(KBOUT,9999)
+C9999  FORMAT(' VECTOR SEQUENCE REQUIRES A TITLE EG ',
+C     1' <---M13MP7.001----->')
+C      RETURN
+C      END IF
+      CALL BUSY(KBOUT)
+      CALL ENCO(SEQ,IDIM,POSNS,CONST,LENGTH)
+      CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH)
+C
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        READ(IDEV1,1002,END=100)NAME
+      END IF
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+1002  FORMAT(A)
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV3,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(IDEV,*)' Gel reading file not found'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV3,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV3)
+C  LONG ENOUGH ?
+      IF(IDIMG.LT.MINMAT)THEN
+        WRITE(IDEV,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+      ISTRAN=1
+      IMATCH=0
+2     CONTINUE
+      CALL BUSY(KBOUT)
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      WRITE(IDEV,1009)ISTRAN
+1009  FORMAT(' Searching strand',I6)
+      IDSAV=MAXMAT
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ,GELCOP,MINMAT,IFAIL,KBOUT)
+      IF(IDSAV.GT.0) THEN
+        IMATCH=1
+        CALL DISMAT(SEQ,IDIM,GELCOP,IDIMG,SAVPS,SAVPG,IDSAV,
+     +  CENDS,NENDS,IDCEND,MAXCON,IDEV,MATCH)
+      END IF
+      IF(ISTRAN.EQ.1) THEN
+        CALL SQREV(GELCOP,IDIMG)
+        CALL SQCOM(GELCOP,IDIMG)
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        ISTRAN = 2
+        GO TO 2
+      END IF
+      IF(IMATCH.EQ.0) THEN
+        WRITE(IDEV2,1010)NAME
+        IGELS = IGELS + 1
+      END IF
+      GO TO 1
+1010  FORMAT(A)
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGELS,' passed'
+      RETURN
+      END
+      SUBROUTINE SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IGN,NCONT,DIST)
+C  AUTHOR: RODGER STADEN
+C  SHIFTS PART OF A CONTIG FORM GEL IGN TO RIGHT END
+C  CONTIG LINE NUMBER IF NCONT
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER DIST
+      I = IGN
+10    CONTINUE
+      IF(I.NE.0)THEN
+        RELPG(I) = RELPG(I) + DIST
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        I = RNBR(I)
+        GO TO 10
+      END IF
+C  UPDATE CONTIG LENGTH
+      RELPG(NCONT) = RELPG(NCONT) + DIST
+      CALL WRITER(IDEVR,NCONT,RELPG(NCONT),LNGTHG(NCONT),
+     +LNBR(NCONT),RNBR(NCONT))
+      END
+      SUBROUTINE SLIDER(SEQ1,IDC,SEQ2,IDIM2,MS1,MS2,MAXPG,MAXPC,MINSLI,
+     +MATL,MATC,MATG,IP)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2)
+      INTEGER MATL(IP),MATC(IP),MATG(IP),P1S,P1,P2
+      IP1 = IP
+      IP = 0
+C   LEFT END S2 RELATIVE S1 - MAX PADS -2 READY FOR LOOP
+      P1S = MS1 - MS2 - MAXPC - 1
+C   TRY NSLIDE START POSNS FOR SEQ2
+      DO 100 I=1,MAXPG+MAXPC+1
+C       POINT TO SEQ1 START
+        P1S = P1S + 1
+C       POINT TO CURRENT SEQ1 POSN
+        P1 = P1S
+        N = 0
+C       COMPARE WHOLE LENGTH OF SEQ2 (IF P1 WITHIN RANGE)
+        DO 50 J=1,IDIM2
+          P2 = J
+          P1 = P1 + 1
+          IF(P1.LT.1)GO TO 50
+C         OFF RIGHT END? IF SO MAY HAVE BEEN A MATCH
+          IF(P1.GT.IDC)GO TO 40
+          IF(SEQ1(P1).EQ.SEQ2(P2))GO TO 45
+40        CONTINUE
+          IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+          N = 0
+          GO TO 50
+45        CONTINUE
+          N = N + 1
+50      CONTINUE
+C       GOOD SCORE AT END? NEED TO INCREMENT POINTERS FOR SAVIT
+        P1 = P1 + 1
+        P2 = P2 + 1
+        IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+100   CONTINUE
+      END
+      SUBROUTINE SUBS(SEQ,IDIMS,FROM,TO)
+      CHARACTER SEQ(IDIMS),FROM,TO
+C   AUTHOR RODGER STADEN
+      DO 10 I = 1,IDIMS
+        IF(SEQ(I).EQ.FROM) SEQ(I) = TO
+10    CONTINUE
+      END
+      SUBROUTINE SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,LINOU1,LINOU2,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      CHARACTER LINOU1(MAXGEL),LINOU2(MAXGEL),GTCONC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      EXTERNAL INDEXS,LWRAPS,GTCONC
+C 28-7-91 added extra parameter: mxgood is the maximum reading
+C length for which we have confidence, so only the parts of
+C reads 1 to mxgood will be included in the calculation
+C   SET INITIAL VALUES
+C hard to understand this very old code! rewrite it.
+C we have a summing array of twice the length of the longest sequence
+C posn is posn in contig of next to write, lseqno is current posn in contig
+C we write when lseqno-posn ge the length of the longest seq
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+40    CONTINUE
+50    CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read
+C
+      IF(ISS.EQ.1) THEN
+        N = MIN(MXGOOD,ABS(LNGTHG(GELC)))
+        IF(LSEQNO.LT.LREG)LSEQNO=LREG
+        IS = LSEQNO-RELPG(GELC)+1
+      ELSE
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read (right end for these)
+C
+        IREND = RELPG(GELC) - LNGTHG(GELC) + 1
+        IF (MXGOOD.LT.MAXGEL) THEN
+          LSEQNO = IREND - MXGOOD + 1
+        ELSE
+          LSEQNO = RELPG(GELC)
+        END IF
+        LSEQNO = MAX(LSEQNO,LREG)
+        IS = LSEQNO - RELPG(GELC) + 1
+        N  = ABS(LNGTHG(GELC))
+      END IF
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        LINOU1(I) = GTCONC(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        LINOU2(I) = GTCONC(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+250     CONTINUE
+        POSN=POSN+1
+230   CONTINUE
+C
+C   COMPARE STRANDS
+C
+      DO 500 I=1,LINLEN
+C        WRITE(*,*)I,LINOU1(I),LINOU2(I)
+        POSN1=POSN1+1
+        IF(LINOU1(I).EQ.LINOU2(I)) THEN
+          IF(LINOU1(I).EQ.'-') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF(LINOU1(I).EQ.'*') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          SEQ1(POSN1) = '0'
+        ELSE
+          IF((LINOU1(I).EQ.'*').AND.(LINOU2(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU2(I).EQ.'*').AND.(LINOU1(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*')) THEN
+            SEQ1(POSN1) = '1'
+            IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) 
+     +      SEQ1(POSN1) = '4'
+              GO TO 500
+          END IF
+          IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) THEN
+            SEQ1(POSN1) = '2'
+            IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*'))
+     +      SEQ1(POSN1) = '4'
+            GO TO 500
+          END IF
+        END IF
+500   CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+C      SUMMER
+C
+C   SUBROUTINE TO PRODUCE A CONSENSUS FROM LINED UP GEL READINGS
+      SUBROUTINE SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1,IDIM1,GEL,LREG,RREG,IGELC,IDBSIZ,CHARS,CHRSIZ,MAXGL2,
+     +IDEVW,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,RREG,LSEQNO,POSN,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER GELC
+      CHARACTER SEQ1(IDIM1)
+      INTEGER CHARS(CHRSIZ,MAXGL2)
+      CHARACTER MUNOTP,GTCONC
+      EXTERNAL MUNOTP,INDEXS,GTCONC,LWRAPS
+C
+C   SET INITIAL VALUES
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      IPSEQ1=0
+C
+C   ZERO ARRAY
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,CHRSIZ
+      CHARS(J,I)=0
+40    CONTINUE
+50    CONTINUE
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C   LOOP FOR RELEVANT ELEMENTS THIS GEL
+      N=ABS(LNGTHG(GELC))
+      IF(LSEQNO.LT.LREG)LSEQNO=LREG
+      IS=(LSEQNO-RELPG(GELC))+1
+****************************
+      IF(IDM.EQ.26)THEN
+        DO 51 I = IS,N
+          JJJ=(MOD(LSEQNO,MAXGL2))
+          IF(JJJ.EQ.0)JJJ=MAXGL2
+          CALL PCON1(GEL(I),CHARS(1,JJJ))
+          LSEQNO = LSEQNO + 1
+51      CONTINUE
+      ELSE
+****************************
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,JJJ) = CHARS(JJ,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      END IF
+C
+C   LOOK AT NEXT GEL TO RIGHT
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+C   RESET LSEQNO
+      LSEQNO=RELPG(GELC)
+C   IS THIS OVER END?
+      IF(LSEQNO.GT.RREG)GO TO 200
+C   ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C   SET FLAG TO SHOW END REACHED
+      IEND=1
+C   NEED TO SUM AND OUTPUT
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SUM NEXT SECTION OF CHARS
+      IF(IDM.EQ.26)THEN
+        DO 211 I = 1,LINLEN
+          IPSEQ1 = IPSEQ1 + 1
+          SEQ1(IPSEQ1) = '-'
+          JJJ = MOD(POSN,MAXGL2)
+          IF(JJJ.EQ.0)JJJ = MAXGL2
+          SEQ1(IPSEQ1) = MUNOTP(CHARS(1,JJJ))
+          CHARS(1,JJJ) = 0
+          POSN = POSN + 1
+211     CONTINUE
+      ELSE
+      DO 230 I=1,LINLEN
+        IPSEQ1=IPSEQ1+1
+        ISUM=0
+        JJJ = LWRAPS(POSN,MAXGL2)
+        SEQ1(IPSEQ1) = GTCONC(CHARS(1,JJJ),CHRSIZ,PERCD)
+        CALL FILLI(CHARS(1,JJJ),CHRSIZ,0)
+        POSN = POSN + 1
+230   CONTINUE
+      END IF
+C
+C
+C   ANY MORE TO OUTPUT?
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGLEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+      SUBROUTINE TPCHEK(PC,PG,L,N)
+      INTEGER PC(N),PG(N),L(N)
+C     AUTHOR RODGER STADEN
+C     IF OVERLAPPING BLOCKS ARE FOUND REMOVE THE SHORTER ONE
+C     THEN REMOVE LARGE GAPS AT ENDS (THOSE AS LARGE AS THE END BLOCK)
+      K1 = 2
+1     CONTINUE
+      DO 10 I = K1,N
+        J1 = I
+        IF(PC(I).LE.PC(I-1)) GO TO 20
+        IF(PG(I).LE.PG(I-1)) GO TO 20
+10    CONTINUE
+C     REMOVE LARGE GAPS FROM ENDS
+C     THIS RULE OF THUMB COULD BE CHANGED TO USE A DIFFERENCE
+C     BETWEEN THE NUMBERS OF MISMATCHING CHARACTERS
+      IF(N.GT.1) THEN
+        K1 = PC(2) - PC(1) - L(1) 
+        J1 = PG(2) - PG(1) - L(1)
+        IF(MAX(K1,J1).GT.L(1)) THEN
+          CALL ML(PC,PG,L,N,1)
+          N = N - 1
+        END IF
+        IF(N.GT.1) THEN
+          K1 = PC(N) - PC(N-1) - L(N-1)
+          J1 = PG(N) - PG(N-1) - L(N-1)
+          IF(MAX(K1,J1).GT.L(N)) THEN
+            CALL ML(PC,PG,L,N,N)
+            N = N - 1
+          END IF
+        END IF
+      END IF
+      RETURN
+20    CONTINUE
+      IF(L(J1-1).GT.L(J1)) THEN
+        CALL ML(PC,PG,L,N,J1)
+      ELSE
+        CALL ML(PC,PG,L,N,J1-1)
+      END IF
+C  Until 25-11-90 next line was k1=j1 but this does not deal with all 
+C  cases: when a line is deleted we must compare it with the previous
+C  one before dealing with the rest, because it could be left of that
+C   one as well!
+      K1 = MAX(2,J1-1)
+      N = N - 1
+      GO TO 1
+      END
+      SUBROUTINE WRITEN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      WRITE(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE WRITER(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      WRITE(IDEVR,REC=N)RELPG,LNGTHG,LNBR,RNBR
+      RETURN
+      END
+      SUBROUTINE WRITEW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      WRITE(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NCONTS,LLINOI,LINCNI,LREG,RREG,
+     +WINDOW,GWIND,LENCON,DEPTHP,DEPTHM,
+     +MARGL,MARGR,MARGB,MARGT,MAXOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +KBOUT,GEL,GEL2,IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM,
+     +SEQ1,IDIM1,NGELS,TEMP3,CHRSIZ,MAXGL2,LINOU1,LINOU2,
+     +NOPT1,NOPT2,NOPT3,
+     +IHELPS,IHELPE,HELPF,IDEVH,MXGOOD)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),WINDOW,CHRSIZ,GWIND
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER MARGB(MAXOPT),MARGT(MAXOPT)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND,CHNRP1,HQN
+      INTEGER TEMP3(2,CHRSIZ,MAXGL2)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER TERM,TUPPER,NAMARC*10,HELPF*(*)
+      CHARACTER SEQ1(IDIM1),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      EXTERNAL NOPWIN,CWORLD,TUPPER,CHNRP1,HQN
+C  nopt1 = single contig
+C  nopt2 = all contigs
+C  nopt3 = scan
+10    CONTINUE
+      LLINO = LLINOI
+      LINCON = LINCNI
+      LOCLR = 0
+      LOCRR = 0
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL CLEARV
+      CALL XHAIRR(ISXMAX,ISYMAX,IX,IY,TERM,DBTDUX,DBTDUY)
+      CALL VT100M
+      INFLAG = HQN(TERM)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      NOPT = NOPWIN(IY,MARGB,MARGT,MAXOPT)
+      TERM = TUPPER(TERM)
+      IF(NOPT.EQ.0) RETURN
+      IF(NOPT.EQ.NOPT3) THEN
+        IF(TERM.EQ.'S') THEN
+          XMIN = LREG
+          XMAX = RREG
+          X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF((TERM.EQ.'N').OR.(TERM.EQ.'Z').OR.(TERM.EQ.'I')) GO TO 10
+      END IF
+      IF(NOPT.EQ.NOPT1) THEN
+        STRAND = 1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHP)
+        IF(DEPTHP.LT.0) RETURN
+        STRAND = -1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHM)
+        IF(DEPTHM.LT.0) RETURN
+        YMAX = ISYMAX
+        YMIN = 0.
+        XMIN = LREG
+        XMAX = RREG
+        RINC = ISYMAX / (DEPTHP + DEPTHM + 2)
+        RMID =(DEPTHM+1) * RINC
+        X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+        Y = CWORLD(IY,MARGB(NOPT),MARGT(NOPT),YMIN,YMAX)
+        IF(TERM.EQ.'I') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(ICLOSE.EQ.1) GO TO 10
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+1006      FORMAT
+     +    ( ' Name ',A,' Number ',I4,' Rel. Posn. ',I7,' Length ',I5)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Z') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(IGEL.EQ.0) GO TO 10
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT2),MARGT(NOPT2),ISXMAX,ISYMAX)
+          LREG = MAX(1,RELPG(IGEL)-GWIND)
+          RREG = MIN(RELPG(LINCON),RELPG(IGEL)+GWIND)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +    LLINO,LREG)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+      END IF
+      IF(NOPT.EQ.NOPT2) THEN
+        CALL IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  NCONTS,IX,IY,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +  ISXMAX,ISYMAX,DBTDUX,DBTDUY,
+     +  LINCON,IGEL,IS)
+        IF(IGEL.EQ.0) RETURN
+        IF(TERM.EQ.'Z') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'I') THEN
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION CLINNO(LNBR,IDBSIZ,NCONTS,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER LNBR(IDBSIZ)
+      CLINNO = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          CLINNO = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
diff --git a/src/staden/dbsyscommon.f~ b/src/staden/dbsyscommon.f~
new file mode 100644
index 0000000..42a162b
--- /dev/null
+++ b/src/staden/dbsyscommon.f~
@@ -0,0 +1,5835 @@
+C  DBSYS ROUTINES COMMON TO PRE AND POST .RD PROGRAMS
+C 8-4-92 altered gtconc, summer,summar,dsplay
+C 6-4-92 bug fix in autocn: i did not copy the correct elements
+C        for some cases where a better alignment was found: i was
+C        copying from jlefts instead of ilefts, jlc and jsense.
+C  Further sap routines are in dbsysold.f (pre .rd) and dbsysnew.f (post)
+C  the split was made by rs 23-1-91
+C  4-5-90 Change to getreg to allow escape
+C  4-5-90 addition of graphics routines and changes to menus
+C  9-5-90 added default gel reading: many changes
+C  17-5-90 Fixed 3 bugs in screen editing: 1) rightjustified names
+C          caused problems; 2) beginnings of sequences starting at far
+C          right of lines where not seen; 3) lines with no numbers at
+C          the end of a contig (i.e. with <10 chars) were flagged
+C          as errors. Changes to ltype for 1, linlen for 2, dsplay for 3
+C  9-7-90  removed menu routines
+C  20-8-90 changed gelid to add / to reading name because xsap did
+C          not return the INFLAG = 3 for the default
+C  23-8-90 Changes to dbauto and autocn to deal with failures better
+C          Plus addition of calls to BUSY
+C  9-11-90 Replaced call to radio with call to radion
+C  19-11-90 Changed max match length in dbauto to maxglm+1  (was 50)
+C  25-11-90 Very important bug fix in tpchek. Old versions could
+C           duplicate bits of working versions.
+C  28-11-90 Modified slider to receive maxpg and maxpc and to allow exactly
+C           the requested number of matches at each end of the two 
+C           sequences. 
+C           Added two new options to dbauto: all gels to new contigs, all 
+C           gels to contig 1; plus resurrected forbidding joins to allow
+C           sequences to be entered only into the contig the overlap best.
+C           Changed autocn to sort overlaps into order based on % mismatch
+C           (previously it saved the best two in any order)
+C           Minor change to dbstar
+C 3-1-91   Discovered bug in dbopen: incorrect call to getint when the
+C          database is very old and needs values for the current format
+C 21-1-91  GELID allowed illegal gel numbers to be returned! Fixed it.
+C 22-1-91  Modified autocn, adism4,adism3 to give more info about
+C          overlaps, and to allow 10 overlaps. Modified dbopen to
+C          return version number, ditto dbstar
+C 23-1-91  Split into dbsyscommon, dbsysold, dbsysnew
+C 26-2-91  Improved overflow check in padcop
+C 28-7-91 added extra parameter to quality calc: mxgood is the maximum
+C         reading length in which we have confidence, so only add this
+C         many chars from the start of each reading. Also changed the
+C         quality calc to make it the same as the consensus one. Made
+C         all characters have nonzero score and made lowercase = 100
+C 21-8-91 Changed arrfil to arrfim which does not display comments
+C 22-8-91 Added routine to find contig line number given left gel (CLINNO)
+C 8-11-91 Added fmt4lp which is used by "find internal joins" and would
+C         be useful elsewhere as a replacement for fmt4ln
+C 18-11-91 New routine GETLN2 with returns gel number specified
+C 27-2-92  Added use of ctonum to cfgel to deal with gel and vector
+C          being different case.
+C 2-3-92   set       FILNAM = ' ' for some calls to openf1
+C   SUBROUTINE TO READ CHARACTER DATA FROM IDEV, REMOVE SPACES, FILL
+C   ARRAY AND RETURN NUMBER OF ELEMENTS USED. ANY LINES STARTING WITH
+C   A ; ARE TREATED AS COMMENTS
+      SUBROUTINE ARRFIM(IDEV,SEQNCE,J,KBOUT)
+C 14-8-91 Added err= option to read, and set length to 0 if error found
+C   AUTHOR: RODGER STADEN
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,ENDCHR,TITCHR
+      SAVE ENDCHR,SPACE,TITCHR
+      DATA ENDCHR/'@'/
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=40)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR)THEN
+C        WRITE(KBOUT,1003)(TEMP(K),K=2,80)
+C1003    FORMAT(' ',79A1)
+        GO TO 1
+      END IF
+10    CONTINUE
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(TEMP(I).EQ.ENDCHR)RETURN
+          IF(J.EQ.IDMX)THEN
+            WRITE(KBOUT,1002)IDMX
+1002        FORMAT(
+     +             ' Too much data. Maximum possible',
+     +             ' =',I6,', input stopped there')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+      RETURN
+ 40   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      J = 0
+      END
+C      ABEDIN
+C
+C   ROUTINE TO EDIT THE DB USING A PADDED SEQ
+C   HAVE AN ARRAY SEQC2 LENGTH IDC OF PADDED SECTION OF CONTIG LINCON
+C  THE LEFT END OF THE PADDED CONTIG STARTS AT X
+C   THERE ARE ITOTPC PADS TO MAKE
+C
+      SUBROUTINE ABEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,POSN
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQC2(IDC),GEL(MAXGEL),P
+      SAVE P
+      DATA P/','/
+C
+C   POINT TO CONTIG
+      POSN=X-1
+C   POINT TO SEQC2
+      IAT=0
+C   COUNT PADS DONE
+      IDONE=0
+C   LOOP FOR ALL SEQC2
+      DO 100 J=1,IDC
+      POSN=POSN+1
+      IAT=IAT+1
+      IPAD=0
+C   IS THIS A PADDING CHAR?
+      IF(SEQC2(IAT).NE.P)GO TO 100
+50    CONTINUE
+C   COUNT PADS
+      IPAD=IPAD+1
+      IAT=IAT+1
+      IF(SEQC2(IAT).EQ.P)GO TO 50
+C   END OF THIS STRETCH OF PADS,DO INSERT
+C   HAVE IPAD INSERTS TO MAKE AT POSN
+      CALL PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,IPAD,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   MOVE POINTER TO CONTIG
+      POSN=POSN+IPAD
+C   COUNT PADS DONE
+      IDONE=IDONE+IPAD
+C   ANY MORE TO DO?
+      IF(IDONE.EQ.ITOTPC)GO TO 101
+100   CONTINUE
+C   ERROR SHOULD HAVE DONE ALL PADS
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Problem: some pads were not done!')
+101   CONTINUE
+      END
+C      ACONS
+      SUBROUTINE ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,ANS,LLINO,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(' CALCULATING A CONSENSUS')
+C
+C
+C
+      CALL BUSY(KBOUT)
+      IFAIL=0
+      IF(ANS.EQ.1)GO TO 150
+      N=IDBSIZ-NCONTS
+      DO 110 I=N,IDBSIZ-1
+      J=LNBR(I)
+      X=1
+      Y=RELPG(I)
+      ISTART=ISTART+1
+      IF((ISTART+19+Y).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+1009     FORMAT(
+     +' Database maximum consensus length(',I6,') exceeded',/,
+     +' calculation aborted')
+         IFAIL=1
+         RETURN
+      END IF
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,MAXGEL,
+     +IDM,PERCD)
+      ISTART=ISTART+Y-1
+110   CONTINUE
+      RETURN
+C   SELECTED CONTIGS ONLY
+150   CONTINUE
+      ISTART=ISTART+1
+      IDIM11=RREG-LREG+1
+      IF((ISTART+19+IDIM11).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+         IFAIL=1
+         RETURN
+      END IF
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+C
+C
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1(ISTART),IDIM11,GEL,LREG,RREG,LLINO,IDBSIZ,TEMP,
+     +CHRSIZ,MAXGL2,
+     2IDEVW,MAXGEL,IDM,PERCD)
+      ISTART=ISTART+IDIM11-1
+C
+      RETURN
+      END
+      SUBROUTINE ADDTIT(SEQ1,NAMPRO,NGELS,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(20),NAMPRO*(*)
+      CHARACTER NUMS(10)
+      SAVE NUMS
+      DATA NUMS/'0','1','2','3','4','5','6','7','8','9'/
+      CALL FILLC(SEQ1(2),18,'-')
+      SEQ1(1) = '<'
+      SEQ1(20) = '>'
+      IEND = INDEX(NAMPRO,'.')
+      N=NGELS
+      K=IEND+3
+      DO 10 J=1,3
+        N=MOD(N,10)+1
+        NAMPRO(K:K)=NUMS(N)
+        N=NGELS/(10**J)
+        K=K-1
+10    CONTINUE
+      K = 18-IEND
+      K=K/2
+      DO 20 I=1,IEND+3
+        SEQ1(K)=NAMPRO(I:I)
+        K=K+1
+20    CONTINUE
+      IDIM1=IDIM1+20
+      RETURN
+      END
+      SUBROUTINE ADISM1(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,
+     +ISTRAN,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C  NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCCC
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C
+C   EDITED 07-02-83 TO ALLOW FOR CASE WHERE A GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL. USE PARM THAT CONTAINS
+C   THE POSITION OF THE LEFT END OF THE NEXT CONTIG. SET TO VERY HIGH
+C   VALUE TO START
+      NEXTC=IDIM+1
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPERATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C  COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     1SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C   COUNT NUMBER OF MATCHING CONTIGS
+      IMATC=IMATC+1
+      CALL ADISM2(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     1SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     1ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,KBOUT,
+     2MATCH)
+C
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       ADISM2
+C   ROUTINE TO DISPLAY MATCHES
+      SUBROUTINE ADISM2(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+C   NEW PARMS
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+CCCCCCCCCCC
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC. SEE ADISM1.
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with contig number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   UPDATE END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.GT.2)RETURN
+      ILEFTS(IMATC)=CENDS(JJ)+20
+      ILC(IMATC)=LCL+LCR+1
+      IPOSC(IMATC)=LCL+1
+      IPOSG(IMATC)=SAVPG
+      LLINO(IMATC)=NENDS(JJ)
+      ISENSE(IMATC)=1
+      IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      RETURN
+      END
+      SUBROUTINE ADISM3(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+        WRITE(KBOUT,1000)LLINO(IMATC),IPOSC(IMATC),ISTRAN,
+     +  IPOSG(IMATC)
+ 1000   FORMAT
+     +  (' Contig',I5,' position',I6,' matches strand',I2,
+     +  ' at position',I5)
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+      SUBROUTINE ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM3(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      RETURN
+      END
+      SUBROUTINE AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +GEL,MAXGEL,CON,IDC,IDEVW,IDEVR,LREG,RREG,KBOUT)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),CON(IDC)
+      INTEGER RREG,PC,PCA,PG
+      NG = LGEL
+      PG = RELPG(NG)
+      PC = LREG
+      NTT = 0
+      NCT = 0
+      NDT = 0
+10    CONTINUE
+C      WRITE(*,*)'GEL',NG
+      CALL READW(IDEVW,NG,GEL,MAXGEL)
+      LG = ABS(LNGTHG(NG))
+      IF(PC.LT.LREG) PC = LREG
+      PCA = PC - LREG + 1
+      IG = PC - PG + 1
+      LC = MIN(LG,RREG-PC+1)
+C      WRITE(*,*)'PC,PG,IG,LG,PCA,LC',PC,PG,IG,LG,PCA,LC
+      CALL ET(GEL(IG),LG,CON(PCA),LC,NE)
+      NTT = NTT + NE
+      CALL EC(GEL(IG),LG,CON(PCA),LC,NE)
+      NCT = NCT + NE
+      CALL ED(GEL(IG),LG,CON(PCA),LC,ND)
+      NDT = NDT + ND
+      CALL WRITEW(IDEVW,NG,GEL,MAXGEL)
+      IF(ND.GT.0) THEN
+        K = LNGTHG(NG)
+        LNGTHG(NG) = ABS(LNGTHG(NG)) - ND
+        LNGTHG(NG) = SIGN(LNGTHG(NG),K)
+        CALL WRITER(IDEVR,NG,RELPG(NG),LNGTHG(NG),LNBR(NG),RNBR(NG))
+      END IF
+      IF(RNBR(NG).NE.0) THEN
+        NG = RNBR(NG)
+        PG = RELPG(NG)
+        PC = PG
+        IF(PG.LE.RREG) GO TO 10
+      END IF
+      CALL EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      WRITE(KBOUT,1000)NTT
+1000  FORMAT(' Number of transpositions=',I6)
+      WRITE(KBOUT,1001)NCT
+1001  FORMAT(' Number of changes       =',I6)
+      WRITE(KBOUT,1002)NDT
+1002  FORMAT(' Number of deletions     =',I6)
+      END
+C    AJOIN2
+C   COMPLETES JOIN AND RETURNS LENGTH OF NEW CONTIG IN LLINOR
+      SUBROUTINE AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +RELX,LLINOL,LLINOR,LNCONL,LNCONR,IDEVR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNBR(IDBSIZ),RNBR(IDBSIZ),LNGTHG(IDBSIZ)
+      INTEGER RELX
+C   RELX IS THE POSITION OF THE JOINT
+C   LLINOL IS THE LEFT GEL NUMBER OF THE LEFT CONTIG
+C   LLINOR IS THE LEFT GEL OF THE RIGHT CONTIG
+C   LNCONL IS THE LEFT CONTIG LINE NUMBER
+C   LNCONR IS THE RIGHT CONTIG LINE NUMBER
+C
+C   ADJUST ALL RELATIVE POSITIONS IN RIGHT CONTIG
+      N=LLINOR
+      RELPG(N)=RELX
+50    CONTINUE
+      IF(RNBR(N).EQ.0)GO TO 60
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+RELX-1
+      GO TO 50
+60    CONTINUE
+C
+C   FIX UP NEW GEL LINE FOR OLD LEFT OF RIGHT CONTIG
+      LNBR(LLINOR)=RNBR(LNCONL)
+C   FIX UP RIGHT GEL OF LEFT CONTIG
+      N=RNBR(LNCONL)
+      RNBR(N)=LLINOR
+C   MERGE WILL SORT OUT THE CORRECT NEIGHBOURS
+C
+      CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LNCONL,IDBSIZ)
+C   MERGE DOES NOT WRITE TO DISK
+      N=LNBR(LNCONL)
+65    CONTINUE
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N))
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 65
+C   CONTIG LINES
+      X=RELPG(LNCONR)+RELX-1
+C   LENGTH MAY NOT HAVE INCREASED!
+      IF(X.GT.RELPG(LNCONL))RELPG(LNCONL)=X
+C   SAVE LENGTH OF NEW CONTIG
+      RELX=RELPG(LNCONL)
+C      WRITE(IDEVR,REC=LNCONL)RELPG(LNCONL),LNGTHG(LNCONL),LNBR(LNCONL),
+C     1RNBR(LNCONL)
+      CALL WRITER(IDEVR,LNCONL,RELPG(LNCONL),LNGTHG(LNCONL),
+     +LNBR(LNCONL),RNBR(LNCONL))
+C
+C   NOW MOVE ALL DATA DOWN TO DELETE OLD RIGHT END
+      N=IDBSIZ-NCONTS
+      M=LNCONR-N
+      IF(M.EQ.0)GO TO 80
+      K=LNCONR
+      J=LNCONR-1
+      DO 70 I=1,M
+      RELPG(K)=RELPG(J)
+      LNGTHG(K)=LNGTHG(J)
+      LNBR(K)=LNBR(J)
+      RNBR(K)=RNBR(J)
+C      WRITE(IDEVR,REC=K)RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K))
+      K=K-1
+      J=J-1
+70    CONTINUE
+80    CONTINUE
+      NCONTS=NCONTS-1
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+90    CONTINUE
+      RETURN
+      END
+C     SUBROUTINE AJOIN3
+      SUBROUTINE AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,IDIM22,
+     +KLASS,IOVER,KBOUT,PL,PR)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LINCON(2),IDIM22(2)
+      INTEGER ITYPE(2),ISENSE(2),JOINT(2),PL(2),PR(2)
+C
+C   CALC POSITIONS OF CONTIGS RELATIVE TO FIXED GEL
+      DO 20 I=1,2
+C   R+
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.1))GO TO 11
+      PL(I)=-1*JOINT(I)+2
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   L+
+11    CONTINUE
+      IF((ITYPE(I).NE.1).OR.(ISENSE(I).NE.1))GO TO 12
+      PL(I)=JOINT(I)
+      PR(I)=PL(I)+RELPG(LINCON(I))-1
+      GO TO 20
+C   R-
+12    CONTINUE
+      IF((ITYPE(I).NE.-1).OR.(ISENSE(I).NE.-1))GO TO 13
+      PR(I)=JOINT(I)+IDIM22(I)-1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+      GO TO 20
+C   L-
+13    CONTINUE
+      PR(I)=IDIM22(I)-JOINT(I)+1
+      PL(I)=PR(I)-RELPG(LINCON(I))+1
+20    CONTINUE
+C  LENGTH OF OVERLAP
+      IOVER=MIN(PR(1),PR(2))-MAX(PL(1),PL(2))+1
+      WRITE(KBOUT,1002)IOVER
+1002  FORMAT(' Length of overlap between the contigs=',I6)
+C
+C  CLASS NUMBER 1-16
+      KLASS=1
+      IF(ITYPE(1).EQ.1)KLASS=KLASS+8
+      IF(ISENSE(1).EQ.-1)KLASS=KLASS+4
+      IF(ITYPE(2).EQ.1)KLASS=KLASS+2
+      IF(ISENSE(2).EQ.-1)KLASS=KLASS+1
+C      WRITE(KBOUT,1001)KLASS
+C1001  FORMAT(' CLASS OF JOIN=',I6)
+      RETURN
+      END
+C      ALINE
+C
+C    ROUTINE TO LINE UP 2 SEQS.
+C   IT SLIDES,REMOVES OVERLAPPING MATCHES,
+C   SORTS MATCHES INTO ASCENDING ORDER, THEN DOES DOES A TOPOLOGICAL
+C   CHECK, AND THEN PRODUCES 2 LINED UP SEQS WITH PADDING CHARS
+C   VARIABLES
+C       SEQ1 CONSENSUS
+C       SEQ2 GEL ORIGINAL IN CORRECT ORIENTATION
+C       SEQG2 ALIGNED GEL
+C       SEQC2 ALIGNED CONSENSUS
+C       SEQ3 SAVED GEL RAW DATA
+C       ISAV1,2,3 STORE MATCHES AND POSITIONS
+C       IDSAV NUMBER ISAV'S
+C       IDC LENGTH OF INPUT SEQ1
+C       IDIM2 LENGTH OF INPUT SEQ2
+C       IDOUT LENGTH OF OUTPUT ALIGNED SEQ1
+C       IDIM2 LENGTH OF SEQ2 ON OUTPUT AFTER ALIGNMENT
+C       MINSLI MIN MATCH FOR SLIDING
+C       IFAIL FLAG TO SHOW IF ALIGNMENT FAILED DUE TO TOO
+C   MANY MISMATCHES OR TOPOLIGICAL CHECK OR TOO MANY OR TOO MANY
+C   PADDING CHARS. 1=FAIL,0=PASS
+C
+      SUBROUTINE ALINE(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ3,MAXGEL,
+     +PERCM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV)
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      IFAIL=0
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+      WRITE(KBOUT,1052)ITOTPC,ITOTPG
+1052  FORMAT(' Padding in contig= ',I4,' and in gel= ',I4)
+      CALL DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IFAIL)
+C   ARE ALL CHECKS OK?
+      IF(ITOTPC.GT.MAXPC)IFAIL=1
+      IF(ITOTPG.GT.MAXPG)IFAIL=1
+      IF(PERCM.GT.PERMAX)IFAIL=1
+      END
+      SUBROUTINE ARCSER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*10,NAME2*10
+10    CONTINUE
+      L = 0
+      CALL GTSTR('Archive name',' ',NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(L.EQ.0) RETURN
+      CALL CCASE(NAME1,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      IF(NAME1(1:1).EQ.' ') RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      DO 100 I=1,NGELS
+        CALL READN(IDEVN,I,NAME2)
+        IF(NAME1.EQ.NAME2) THEN
+          WRITE(KBOUT,1003)NAME2,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1003      FORMAT(' ',A,2X,I4,2X,I6,2X,I6,2X,I6,2X,I6/)
+          GO TO 10
+        END IF
+100   CONTINUE
+      WRITE(KBOUT,1004)NAME1
+1004  FORMAT(' ',A,' Not in database')
+      GO TO 10
+      END
+      SUBROUTINE AUTOCN(SEQ1,IDIM,GEL,IDIMG,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,KBOUT,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL)
+C   AUTHOR: RODGER STADEN
+C   changed 29-11-90 to make first in list of alignments the best
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(IDIM),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(IDIM),GEL(MAXGLM)
+C
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL(2),ITYPE(2)
+      PARAMETER (MAXC = 10)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC)
+      REAL PERMIS(2)
+      IFAIL(1) = 1
+      IFAIL(2) = 1
+      KFAIL = 0
+C  23-8-90 Need to deal with failures in a better way. Problem is
+C          case where overlaps are found but fail to align. In future
+C          signal them with new variable KFAIL which will be nonzero
+C          if any alignment fails.
+C  29-11-90 Changed sorting of overlaps so that the best is first in the 
+C           list returned to caller.
+C   SAVE GEL
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+C  COUNT NUMBER OF CONTIGS THAT MATCH
+      IMATC=0
+      IDCEND=MAXCON
+      CALL BUSY(KBOUT)
+      CALL FNDCON(SEQ1,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+      CALL ENCO(SEQ1,IDIM,POSNS,CONST,LENGTH)
+      CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH)
+1     CONTINUE
+      ISTRAN=1
+2     CONTINUE
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      IDSAV=MAXSAV
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ1,GEL,MINMAT,IFCOMP,KBOUT)
+      IF(IFCOMP.NE.0)RETURN
+      IF(IDSAV.NE.0)THEN
+        CALL ADISM4(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +  IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +  IMATC,ISTRAN,MAXC,KBOUT)
+      END IF
+      ISTRAN=ISTRAN+1
+      IF(ISTRAN.EQ.2) THEN
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        CALL SQREV(GEL,IDIMG)
+        CALL SQCOM(GEL,IDIMG)
+        GO TO 2
+      END IF
+      CALL SQCOPY(GELCOP,GEL,IDIMG)
+      KSENSE = 0
+      WRITE(KBOUT,*)'Total matches found',IMATC
+      IF(IMATC.EQ.0) THEN
+        IFAIL(1) = 0
+        RETURN
+      END IF
+      JMATC = 0
+      DO 100 I = 1,IMATC
+        IF(JSENSE(I).EQ.-1) THEN
+          IF(KSENSE.EQ.0) THEN 
+            CALL SQREV(GEL,IDIMG)
+            CALL SQCOM(GEL,IDIMG)
+            KSENSE = 1
+          END IF
+        END IF
+        JDIM22 = IDIMG
+        JDOUT = MAXGEL
+        IDSAV = MAXSAV
+        WRITE(KBOUT,*)'Trying to align with contig',JLLINO(I)
+        CALL ALINE(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +  SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +  JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +  JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ4,MAXGEL,PERMS)
+        IF(JFAIL.EQ.0) THEN
+          JMATC = JMATC + 1
+          IF(JMATC.EQ.1) THEN
+C    Save in elements 1
+             CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            IFAIL(1) = 0
+          ELSE IF(JMATC.EQ.2) THEN
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+              IFAIL(2) = 0
+            END IF
+          ELSE
+            IF(PERMS.LT.PERMIS(1)) THEN
+C    Better match so save in elements 1, so copy 1 to 2 first
+              CALL COPYM(ILEFTS(1),ILEFTS(2),ILC(1),ILC(2),
+     +          IPOSC(1),IPOSC(2),ISENSE(1),ISENSE(2),
+     +          LLINO(1),LLINO(2),JOINT(1),JOINT(2),ITOTPC(1),
+     +          ITOTPC(2),ITOTPG(1),ITOTPG(2),ITYPE(1),ITYPE(2),
+     +          IDOUT(1),IDOUT(2),IDIM22(1),IDIM22(2),
+     +          SEQG2(1,1),SEQG2(1,2),SEQC2(1,1),SEQC2(1,2),
+     +          PERMIS(1),PERMIS(2))
+                IFAIL(2) = 0
+C    Now save in 1
+                CALL COPYM(JLEFTS(I),ILEFTS(1),JLC(I),ILC(1),
+     +          JPOSC(I),IPOSC(1),JSENSE(I),ISENSE(1),
+     +          JLLINO(I),LLINO(1),JJOINT,JOINT(1),JTOTPC,
+     +          ITOTPC(1),JTOTPG,ITOTPG(1),JTYPE,ITYPE(1),
+     +          JDOUT,IDOUT(1),JDIM22,IDIM22(1),
+     +          SEQG3,SEQG2(1,1),SEQC3,SEQC2(1,1),
+     +          PERMS,PERMIS(1))
+            ELSE IF(PERMS.LT.PERMIS(2)) THEN
+C    Save in element 2
+                CALL COPYM(JLEFTS(I),ILEFTS(2),JLC(I),ILC(2),
+     +          JPOSC(I),IPOSC(2),JSENSE(I),ISENSE(2),
+     +          JLLINO(I),LLINO(2),JJOINT,JOINT(2),JTOTPC,
+     +          ITOTPC(2),JTOTPG,ITOTPG(2),JTYPE,ITYPE(2),
+     +          JDOUT,IDOUT(2),JDIM22,IDIM22(2),
+     +          SEQG3,SEQG2(1,2),SEQC3,SEQC2(1,2),
+     +          PERMS,PERMIS(2))
+            END IF
+          END IF
+        ELSE
+          KFAIL = 1
+        END IF
+100   CONTINUE
+      IMATC = MIN(2,JMATC)
+      END
+      SUBROUTINE BATIN(SEQ,MAXSEQ,KBIN,KBOUT,IDEV1,IDEV2,IDEV3,
+     +FILNAM,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR RODGER STADEN
+      CHARACTER SEQ(MAXSEQ)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER BASE(4),BKEY(4)
+      SAVE BASE,BKEY
+      DATA BASE/'A','C','G','T'/
+      DATA BKEY/'N','M',',','.'/
+      CALL CBASE(BASE,BKEY,IOK,ICBASE,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV1,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      NGELS=0
+10    CONTINUE
+      IDIM=MAXSEQ
+      WRITE(KBOUT,1003)
+1003  FORMAT(' Type data on lines of < 80 characters, finish with @'/)
+      CALL ARRFIM(KBIN,SEQ,IDIM,KBOUT)
+      IF(IDIM.GT.0)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File name for gel reading',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.EQ.0) THEN
+          IF(ICBASE.EQ.0) THEN
+            CALL CCASEA(SEQ,IDIM,2)
+            DO 5 J = 1,4
+              CALL SUBS(SEQ,IDIM,BKEY(J),BASE(J))
+5           CONTINUE
+          END IF
+          CALL CCASEA(SEQ,IDIM,1)
+          CALL FMTDKN(IDEV2,SEQ,IDIM)
+          CLOSE(UNIT=IDEV2)
+          NGELS=NGELS+1
+          WRITE(IDEV1,1002)FILNAM
+1002      FORMAT( A)
+        END IF
+      END IF
+      CALL YESNO(IANS,'Type in another gel reading',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IANS.EQ.0) GO TO 10
+      CALL YESNO(IANS,'List gel readings',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IANS.EQ.0) THEN
+        REWIND IDEV1
+        DO 50 I=1,NGELS
+          IDIM=MAXSEQ
+          READ(IDEV1,1002)FILNAM
+          CALL OPENRS(IDEV2,FILNAM,IOK,LRECL,2)
+          IF(IOK.NE.0)THEN
+            WRITE(KBOUT,*)' Error opening file'
+            GO TO 50
+          END IF
+          WRITE(IDEV3,1007)FILNAM
+1007      FORMAT(' ',A)
+          CALL ARRFIM(IDEV2,SEQ,IDIM,KBOUT)
+          IF(IDIM.GT.0)CALL FMTDB(SEQ,IDIM,1,IDIM,60,IDEV3)
+          CLOSE(UNIT=IDEV2)
+50      CONTINUE
+      END IF
+      CLOSE(UNIT=IDEV1)
+      WRITE(KBOUT,*)'Number of gel readings entered',NGELS
+      END
+C      BEDIN
+C
+C   SUBROUTINE TO BED A NEW GEL INTO THE DATABASE. IT DISPLAYS
+C   THE JOIN AND ALLOWS CHANGES TO IT, THEN IT ALLOWS EDITING OF
+C   THE NEW GEL USING EDITG OR THE GELS IN THE DATABASE USING
+C   DBEDIT. ANY CHANGES CAN BE DISPLAYED USING DSPLAY. THIS SUBROUTINE
+C   RETURNS NOPT TO TELL OF THE OUTCOME OF ITS USE:3=OK,1=REJOIN,4=GIVE UP
+C   ON INPUT NOPT=2 MEANS LEFT END OVERLAP
+      SUBROUTINE BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,LLINO,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      CHARACTER FILEH*(*)
+      INTEGER HELPS1,HELPE1,HELPS2,HELPE2
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER GEL2(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(5)*(MAXPRM)
+      INTEGER LREG,RIGHTM
+      INTEGER X,RREG
+C   SAVE NOPT FOR LEFT END OR RIGHT JOIN
+      NTYPE=NOPT
+      LREG=X
+      IF(NTYPE .EQ.2)LREG=1
+      RREG=LREG+49
+      IF(RREG.GT.RELPG(LINCON))RREG=RELPG(LINCON)
+      I1=1
+      IF(NTYPE.EQ.2)I1=X
+      I2=I1+49
+      IF(I2.GT.ABS(LNGTHG(NGELS)))I2=ABS(LNGTHG(NGELS))
+      CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,
+     +LLINO,LINCON,LREG,RREG,GEL,I1,I2,2,LLINOR,IDBSIZ,IDEV,
+     +KBOUT,IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+      CALL YESNO(ANS,'Joint correct',
+     +HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.NE.0) THEN
+        NOPT = 1
+        RETURN
+      END IF
+20    CONTINUE
+      CALL BELL(1,KBOUT)
+C     DBMENU now defunct for sap - so we use RADION instead
+C      CALL DBMENU(2,JOPT,HELPS1,HELPE1,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PROMPT(1) = 'Complete entry'
+      PROMPT(2) = 'Edit contig'
+      PROMPT(3) = 'Display'
+      PROMPT(4) = 'Edit new gel reading'
+      JOPT = 1
+      CALL RADION('Enter options', PROMPT, 4, JOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+C  GIVE UP
+      IF(JOPT.EQ.-1)THEN
+        CALL YESNO(ANS,'Quit',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.EQ.0) THEN
+          NOPT = 4
+          RETURN
+        END IF
+      END IF
+C     COMPLETE JOIN
+      IF(JOPT.EQ.1)THEN
+        CALL YESNO(ANS,'Complete entry',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.NE.0) GO TO 20
+        NOPT=3
+        RETURN
+      END IF
+100   CONTINUE
+      IF(JOPT.EQ.4)THEN
+C       EDIT NEW GEL
+        IDIM=ABS(LNGTHG(NGELS))
+        CALL EDITGL(GEL,GEL2,IDIM,KBIN,KBOUT,MAXGEL,
+     +  HELPS2,HELPE2,FILEH,IDEVH,1)
+        LNGTHG(NGELS)=SIGN(IDIM,LNGTHG(NGELS))
+      END IF
+200   CONTINUE
+      IF(JOPT.EQ.3)THEN
+C       DISPLAY
+        LLINO=LNBR(LINCON)
+        LEFTMN=X
+        RIGHTM=X+ABS(LNGTHG(NGELS))-1
+        RIGHTM=MIN(RIGHTM,RELPG(LINCON))
+        IF(NTYPE.EQ.2)THEN
+          LEFTMN=1
+          RIGHTM=ABS(LNGTHG(NGELS))-X+1
+          RIGHTM=MIN(RIGHTM,RELPG(LINCON))
+        END IF
+        CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LEFTMN,
+     +  RIGHTM,LREG,
+     +  RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS1,HELPE1,FILEH,IDEVH,IERR)
+        IF(IERR.NE.0) GO TO 20
+        IF(NTYPE.EQ.2)GO TO 210
+        I1=LREG-X+1
+        IF(I1.GT.ABS(LNGTHG(NGELS)))GO TO 200
+        I2=RREG-X+1
+        IF(I2.GT.ABS(LNGTHG(NGELS)))GO TO 200
+        GO TO 230
+210     CONTINUE
+        I1=X+LREG-1
+        I2=X+RREG-1
+        IF(I1.GT.ABS(LNGTHG(NGELS)))GO TO 200
+        IF(I2.GT.ABS(LNGTHG(NGELS)))GO TO 200
+230     CONTINUE
+        CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,
+     +  LLINO,LINCON,LREG,RREG,GEL,I1,I2,2,LLINOR,IDBSIZ,IDEV,
+     +  KBOUT,IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+        LLINO=LNBR(LINCON)
+      END IF
+300   CONTINUE
+      IF(JOPT.EQ.2)THEN
+C       EDIT DATABASE
+        IF(NTYPE.EQ.2)THEN
+          IMAXL=1
+          IMAXR=ABS(LNGTHG(NGELS))-X+1
+        ELSE IF(NTYPE.EQ.1)THEN
+          IMAXL=X
+          IMAXR=X+ABS(LNGTHG(NGELS))-1
+        END IF
+        CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL2,
+     +  LINCON,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +  HELPS2,HELPE2,FILEH,IDEVH,MAXGEL)
+      END IF
+      IF(JOPT.EQ.1)THEN
+        CALL HELP2(HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      END IF
+      GO TO 20
+      END
+      SUBROUTINE BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CLEN,CHAINL,GCLIN
+      EXTERNAL CLEN,CHAINL,GCLIN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTR = IDBSIZ - NCONTS - 1
+      IF(NCONTR.LE.NGELS) THEN
+        WRITE(KBOUT,*)'Insufficient space for new contig line.'
+        WRITE(KBOUT,*)'Increase database size with copy'
+        RETURN
+      END IF
+10    CONTINUE
+        MN = 0
+        MX = NGELS
+        IR  = 0
+        CALL GETINT(MN,MX,IR,
+     +  'Number of gel reading that will become a left end',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IVAL.LT.1) RETURN
+        IR = IVAL
+      IL = LNBR(IR)
+      IF(IL.EQ.0)THEN
+        WRITE(KBOUT,*)'Gel number',IR,' is already a left end'
+        GO TO 10
+      END IF
+      ILO = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IR)
+      IF(ILO.EQ.0)THEN
+        WRITE(KBOUT,*)
+     +'Problem with this contig. Check logical consistency'
+        WRITE(KBOUT,*)'of database. Break not made'
+        RETURN
+      END IF
+      NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,ILO)
+      IF(NCONTO.EQ.0)THEN
+        WRITE(KBOUT,*)'No contig line for this contig. Check logical'
+        WRITE(KBOUT,*)'consistency of database. Break not made'
+        RETURN
+      END IF
+      LCONTO = RELPG(NCONTO)
+      IF(LCONTO.LT.1)THEN
+        WRITE(KBOUT,*)'Contig has zero length. Break not made'
+        RETURN
+      END IF
+      CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+      END
+      SUBROUTINE CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER CLEN
+      EXTERNAL CLEN
+C ROUTINE TO BREAK A CONTIG INTO 2
+C LEFT GEL OF NEW RIGHT CONTIG IS IR
+C RIGHT GEL OF NEW LEFT CONTIG IS IL
+C LEFT GEL OF OLD LEFT CONTIG IS ILO
+C CONTIG LINE OF OLD CONTIG IS NCONTO
+C CONTIG LINE OF NEW RIGHT CONTIG IS NCONTR
+C CONTIG LINE OF NEW LEFT CONTIG IS NCONTO
+C LENGTH OF OLD CONTIG IS LCONTO
+      IOK = 1
+      NCONTS = NCONTS + 1
+C  WRITE LAST LINE OF DB
+      WRITE(KBOUT,*)'Increasing number of contigs by 1'
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C  MAKE NEW CONTIG A COPY OF OLD
+      RELPG(NCONTR) = RELPG(NCONTO)
+      LNGTHG(NCONTR) = LNGTHG(NCONTO)
+      LNBR(NCONTR) = IR
+      RNBR(NCONTR) = RNBR(NCONTO)
+      WRITE(KBOUT,*)'Writing new right contig line'
+      CALL WRITER(IDEVR,NCONTR,RELPG(NCONTR),LNGTHG(NCONTR),
+     +LNBR(NCONTR),RNBR(NCONTR))
+C  NEED LENGTH FOR OLD LEFT CONTIG
+      RNBR(IL) = 0
+      L = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IL)
+      IF(L.LT.1)THEN
+        WRITE(KBOUT,*)
+     +  'New left contig has zero length. Break not made'
+        RETURN
+      END IF
+      RELPG(NCONTO) = L
+      RNBR(NCONTO) = IL
+C  DO CONTIG LINE FOR NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new left contig line'
+      WRITE(KBOUT,*)'New length=',RELPG(NCONTO)
+      WRITE(KBOUT,*)'New right gel=',RNBR(NCONTO)
+      CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +LNBR(NCONTO),RNBR(NCONTO))
+C  DO GEL LINE FOR RIGHT GEL OF NEW LEFT CONTIG
+      WRITE(KBOUT,*)'Writing new right gel of left contig'
+      WRITE(KBOUT,*)'Gel number=',IL
+      CALL WRITER(IDEVR,IL,RELPG(IL),LNGTHG(IL),
+     +LNBR(IL),RNBR(IL))
+C  DO GEL LINE FOR NEW RIGHT CONTIG
+      LNBR(IR) = 0
+      WRITE(KBOUT,*)'Writing new left gel of right contig'
+      WRITE(KBOUT,*)'Gel number=',IR
+      CALL WRITER(IDEVR,IR,RELPG(IR),LNGTHG(IR),
+     +LNBR(IR),RNBR(IR))
+C  NOW SHIFT
+      I = 1 - RELPG(IR)
+      WRITE(KBOUT,*)'Shifting gels in right contig by distance=',I
+      CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IR,NCONTR,I)
+      WRITE(KBOUT,*)'Right contig length=',RELPG(NCONTR)
+      WRITE(KBOUT,*)'Break completed'
+      IOK = 0
+      END
+C     BUBBL3
+C   SUBROUTINE TO SORT INTEGER ARRAY (LIST) INTO ASCENDING  ORDER
+C
+      SUBROUTINE BUBBL3(LIST,LISTEL,LISTAL,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM),LISTEL(IDIM),LISTAL(IDIM)
+C
+C   SET POINTERS TO ZERO
+      I=0
+      J=0
+C
+10    CONTINUE
+C
+C   SET I=J IF WE HAVE JUST CORRECTLY POSITIONED AN ELEMENT
+      IF(J.GT.I)I=J
+C
+C   INCREMENT POINTER TO NEXT ELEMENT
+      I=I+1
+C   TEST FOR END OF ARRAY
+      IF(I.EQ.IDIM)RETURN
+C
+20    CONTINUE
+C
+C   COMPARE ADJACENT ELEMENTS
+      IF(LIST(I).GE.LIST(I+1))GO TO 10
+C
+C   FIRST MOVE THIS ELEMENT? IF SO SET POINTER TO ITS INITIAL POSITION
+      IF(J.LT.I)J=I
+C
+C   EXCHANGE ADJACENT ELEMENTS
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+C
+      ITEMP=LISTEL(I)
+      LISTEL(I)=LISTEL(I+1)
+      LISTEL(I+1)=ITEMP
+      ITEMP=LISTAL(I)
+      LISTAL(I)=LISTAL(I+1)
+      LISTAL(I+1)=ITEMP
+C
+C
+C   DECREMENT BACK THRU LIST WITH THIS ELEMENT
+      IF(I.GT.1)I=I-1
+C
+      GO TO 20
+      END
+      SUBROUTINE CBASE(BASE,BKEY,IOK,ICBASE,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR RODGER STADEN
+      CHARACTER BASE(4),BKEY(4),HELPF*(*),NEW(4)
+      CHARACTER PROMPT*14
+      SAVE PROMPT
+      DATA PROMPT/'Key for base  '/
+      IOK = 1
+      CALL YESNO(ICBASE,'Use special keys for ACTG',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ICBASE.EQ.1) THEN
+        IOK = 0
+        RETURN
+      ELSE IF(ICBASE.NE.0) THEN
+       RETURN
+      END IF
+      I = 1
+1     CONTINUE
+      IF(I.LT.5) THEN
+        PROMPT(14:14) = BASE(I)
+        L = 1
+        CALL GETSTR(PROMPT,BKEY(I),NEW,4,L,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.1) THEN
+           CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+           GO TO 1
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        IF(NEW(1).EQ.';') THEN
+          WRITE(KBOUT,*)' ; is not allowed'
+          GO TO 1
+        END IF
+        IF(L.NE.0) BKEY(I) = NEW(1)
+        I = I + 1
+        GO TO 1
+      END IF
+      CALL CCASEA(BKEY,4,2)
+      IOK = 0
+      END
+      SUBROUTINE CCTA(SEQ,ID)
+      CHARACTER SEQ(ID),COM,AS
+      SAVE COM,AS
+      DATA COM/','/,AS/'*'/
+      DO 10 I = 1,ID
+        IF(SEQ(I).EQ.COM) SEQ(I) = AS
+10    CONTINUE
+      END
+      SUBROUTINE CEDT(KBIN,KBOUT,
+     +GELNOS,GELSTR,GELEND,
+     +MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,IDEV1,IDEV2,IDEV3,LINLEN,
+     +FILNAM,LLINO,IOK)
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER LINEIN*80,GEL(MAXGEL),GEL2(MAXGEL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2),RREG
+      WRITE(KBOUT,*)'Identify contig to edit'
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +LLINO,IOK,IDBSIZ,KBIN,KBOUT,IDEV3,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for temporary editing file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      LREG = 1
+      RREG = RELPG(LINCON)
+      CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +GEL,LLINO,LINCON,LREG,RREG,GEL2,I1,I2,0,I,IDBSIZ,IDEV,KBOUT,
+     +IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+      CLOSE(UNIT=IDEV)
+      CALL CEDIT(FILNAM)
+      CALL YESNO(IWANT,'Put edited contig into database',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IWANT.NE.0) RETURN
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,2)
+      IF(IOK.NE.0) RETURN
+      KGEL = LLINO
+      CALL BUSY(KBOUT)
+      CALL POSTC(GELNOS,GELSTR,GELEND,LINEIN,
+     +IOK,KBOUT,IDEV,MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,
+     +LNBR,RNBR,IDBSIZ,KGEL)
+      IF(IOK.NE.0) THEN
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      REWIND IDEV
+      CALL NEWDB(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,KBOUT,IDEV,
+     +IDEV1,IDEV2,IDEV3,MAXDB,GEL,MAXGEL,LINCON,KGEL,IOK)
+      CLOSE(UNIT=IDEV)
+      END
+C
+C     CFGEL
+C
+C   ROUTINE TO COMPARE A STRING OF WORD NUMBERS FOR A GEL WITH A SERIES
+C   OF ARRAYS REPRESENTING A CONSENSUS SEQUENCE. WE LOOK FOR OCCURENCES
+C   OF PAIRS OF WORDS (EACH WORD IS LENGTH CHARS LONG AND SO TOTAL MATCH IS
+C   2*LENGTH CHARS LONG). THE ARRAYS SENT ARE OF SIZE 4**LENGTH (LE4)
+      SUBROUTINE CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LE4,
+     +SAVPG,
+     +SAVPS,SAVL,IDSAV,SEQ,GEL,MINMAT,IFAIL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),GEL(IDIMG)
+      INTEGER POSNS(IDIM),WORDP(LE4),SAVPS(IDSAV),SAVPG(IDSAV)
+      INTEGER GELN(IDIMG),SAVL(IDSAV)
+      INTEGER WORDN(LE4)
+      INTEGER W1,W2,PS1W1,PS1W2
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C bug fix 27-2-92: if sequence had different case matches would
+C not be extended correctly because we look for identical characters
+C switched to using ctonum
+      IDSAVM=IDSAV
+      IDSAV=0
+C   LENGTH OF PAIR OF WORDS
+      LX2=2*LENGTH
+C   NUMBER OF PAIRS OF WORDS OF LENGTH LENGTH IN GEL
+      NW=IDIMG-(LX2-1)
+C   LOOP FOR EACH START POINT
+      DO 200 I=1,NW
+C   IS THIS WORD A ZERO?
+      W1=GELN(I)
+      IF(W1.EQ.0)GO TO 200
+C   POINT TO NEXT WORD OF PAIR
+      W2=GELN(I+LENGTH)
+      IF(W2.EQ.0)GO TO 200
+C   DOES W1 EXIST IN SEQ?
+      N1S1=WORDN(W1)
+      IF(N1S1.EQ.0)GO TO 200
+      N2S1=WORDN(W2)
+      IF(N2S1.EQ.0)GO TO 200
+C   BOTH EXIST, SO POINT TO THE FIRST + LENGTH
+      PS1W1=WORDP(W1)+LENGTH
+C   LOOP FOR ALL PAIRS
+      DO 50 J=1,N1S1
+C   POINT TO FIRST W2 BECAUSE IT IS IN WORDP NOT POSNS
+      PS1W2=WORDP(W2)
+C   LOOP FOR THESE
+      DO 40 K=1,N2S1
+C   ARE THIS PAIR LENGTH APART?
+      N=PS1W1-PS1W2
+      IF(N.NE.0)GO TO 20
+C   THEY ARE SO, IF REQUIRED LOOK FOR REST OF MATCH
+      LMAT=LX2
+      IF(MINMAT.EQ.LX2)GO TO 15
+      IPC=PS1W2+LENGTH
+      IPG=I+LX2
+16    CONTINUE
+      IF(IPG.GT.IDIMG)GO TO 15
+      IF(IPC.GT.IDIM)GO TO 15
+C
+      IF(CTONUM(SEQ(IPC)).NE.CTONUM(GEL(IPG)))GO TO 15
+      LMAT=LMAT+1
+      IPC=IPC+1
+      IPG=IPG+1
+      GO TO 16
+15    CONTINUE
+C  IS MATCH LONG ENOUGH?
+      IF(LMAT.LT.MINMAT)GO TO 20
+      IDSAV=IDSAV+1
+      IF(IDSAV.LE.IDSAVM)GO TO 18
+      WRITE(KBOUT,1000)IDSAVM
+1000  FORMAT(' More than ',I6,' matches. Search aborted')
+      IFAIL=1
+      RETURN
+18    CONTINUE
+      SAVL(IDSAV)=LMAT
+      SAVPG(IDSAV)=I
+      SAVPS(IDSAV)=PS1W1-LENGTH
+20    CONTINUE
+C   POINT TO NEXT W2
+      PS1W2=POSNS(PS1W2)
+40    CONTINUE
+C   ALL TRIED THIS PS1W1, TRY NEXT
+      PS1W1=POSNS(PS1W1-LENGTH)+LENGTH
+50    CONTINUE
+200   CONTINUE
+      IFAIL=0
+      RETURN
+      END
+      INTEGER FUNCTION CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      J = I
+      CHAINL = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        J = I
+        I = LNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CHAINL = J
+      END
+C
+C      CHANGE
+C
+C   ROUTINE TO EXCHANGE ALL THE CHARS IN A CHARACTER ARRAY USING
+C   A PAIR OF LOOKUP ARRAYS SENT BY CALLING PROG
+C
+C
+      SUBROUTINE CHANGE(SEQ,IDIM1,CHAR1,CHAR2,IDIM2,ELSE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1)
+      CHARACTER CHAR1(IDIM2),CHAR2(IDIM2)
+      CHARACTER ELSE
+      DO 100 I=1,IDIM1
+C
+      DO 50 J=1,IDIM2
+C
+      IF(SEQ(I).NE.CHAR1(J))GO TO 50
+C   MATCH SO EXCHANGE CHARS
+      SEQ(I)=CHAR2(J)
+      GO TO 100
+50    CONTINUE
+      SEQ(I)=ELSE
+100   CONTINUE
+C
+      RETURN
+      END
+      CHARACTER*1 FUNCTION CHARSL(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'ctag*-'/
+      CHARSL = C(I:I)
+      END
+      CHARACTER*1 FUNCTION CHARSU(I)
+      CHARACTER C*6
+      SAVE C
+      DATA C/'CTAG*-'/
+      CHARSU = C(I:I)
+      END
+      INTEGER FUNCTION CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = LGEL
+      CHNRP = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP = I
+        RETURN
+      END IF
+      END
+      INTEGER FUNCTION CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +LGEL,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = LGEL
+      CHNRP1 = 0
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I)+ABS(LNGTHG(I))-1.LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 10
+        END IF
+        CHNRP1 = I
+        RETURN
+      END IF
+      END
+      INTEGER FUNCTION CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LEFT GEL NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      I = IIN
+      CLEN= 0
+      LEN = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        LEN = MAX(LEN,(RELPG(I) + ABS(LNGTHG(I)) - 1))
+        I = RNBR(I)
+        IF(I.EQ.IIN)RETURN
+        GO TO 10
+      END IF
+      CLEN = LEN
+      END
+      SUBROUTINE CLIST(GELNO1,LINNO1,IGEL1,GELNO2,LINNO2,
+     +IGEL2,GELNOS,GELSTR,GELEND,IUNIQ1,IUNIQ,KBOUT,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER GELNO1(IGEL1),GELNO2(IGEL2),GELNOS(IUNIQ1)
+      INTEGER GELSTR(IUNIQ1),GELEND(IUNIQ1)
+      INTEGER LINNO1(IGEL1),LINNO2(IGEL2)
+      EXTERNAL INLIST
+C   GELNOS === GEL NUMBERS (GELNOS)
+C   GELSTR === GEL START LINES
+C   GELEND === GEL END LINES
+C   GELNO  === GEL NUMBERS PER STRIP
+C   LINNO  === GEL LINE NUMBERS PER STRIP
+C   IGEL   === NUMBER OF GELS PER STRIP
+C   LINENO === CURRENT LINE NUMBER
+C
+C     WHICH GELS IN GELNO2 DO NOT APPEAR IN GELNO1
+C     IE HAVE STARTED IN GELNO2
+        DO 20 I=1,IGEL2
+        MATCH=INLIST(GELNO1,IGEL1,GELNO2(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH SO NEW
+C         PUT IN GELSTR
+          IUNIQ=IUNIQ+1
+          GELNOS(IUNIQ)=GELNO2(I)
+          GELSTR(IUNIQ)=LINNO2(I)
+        END IF
+20    CONTINUE
+C     WHICH GELS IN GELNO1 DO NOT APPEAR IN GELNO2
+C     IE WHICH HAVE ENDED IN GELNO1
+      DO 10 I=1,IGEL1
+        MATCH=INLIST(GELNO2,IGEL2,GELNO1(I))
+        IF(MATCH.EQ.0)THEN
+C         NO MATCH  SO MUST HAVE ENDED
+C         WHERE IS IT STORED IN GELNOS?
+          MATCH=INLIST(GELNOS,IUNIQ,GELNO1(I))
+          IF(MATCH.NE.0)THEN
+            GELEND(MATCH)=LINNO1(I)
+            GO TO 10
+          END IF
+C         ERROR
+          WRITE(KBOUT,1000)GELNO1(I)
+1000      FORMAT( ' Error: gel number ',I5,
+     +    ' expected but not found in list')
+          IOK = 1
+          RETURN
+        END IF
+10    CONTINUE
+      IOK = 0
+      RETURN
+      END
+C
+C      CMPLMT
+C
+C   SUBROUTINE TO REVERSE AND COMPLEMENT GELS AND DATA BASE
+C   THE POSITIONS OF THE RIGHT ENDS OF GELS ARE FIRST STORED
+C   IN RELPG THEN WE DO A BUBBLE SORT ON THESE POSITIONS
+C   UPDATING RELATIONSHIPS AS WE GO
+C   ALSO SEQUENCES ARE COMPLEMENTED, SIGNS OF LENGTH ARE
+C   MULTIPLIED BY -1 AND THE CONTIG LINE IS ALTERED
+      SUBROUTINE CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,GEL,IDBSIZ,KBOUT,IDEVR,IDEVW,MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER X
+C
+      WRITE(KBOUT,1000)LLINO
+1000  FORMAT( ' Complementing contig',I6)
+C   CHAIN THRU AND PUT RIGHT ENDS IN RELPG
+      N=LLINO
+10    CONTINUE
+      RELPG(N)=RELPG(N)+(ABS(LNGTHG(N)))-1
+      IF(RNBR(N).EQ.0)GO TO 20
+      N=RNBR(N)
+      GO TO 10
+20    CONTINUE
+C
+C   NOW EFFECTIVELY BUBBLE SORT ON RELPG
+      N=RNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+      N=NL
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+      NL=LNBR(N)
+      IF(NL.EQ.0)GO TO 30
+      I1=0
+23    CONTINUE
+      IF(RELPG(N).GE.RELPG(NL))GO TO 21
+C   NOT IN CORRECT ORDER SO CHAIN ALONG UNTIL CORRECT,THEN COME
+C   BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE THIS LINE SET POINTER TO CURRENT POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C
+C   EXCHANGE NEIGHBOURS. CURRENTLY LOOKING AT N AND ITS LEFT
+C   NBR, AND THE LEFT NBR IS FURTHER RIGHT THAN N
+C   FIX UP POINTERS TO LEFT AND RIGHT OF THESE TWO
+      M=LNBR(NL)
+      IF(M.NE.0)RNBR(M)=N
+      M=RNBR(N)
+      IF(M.NE.0)LNBR(M)=NL
+      LNBR(N)=LNBR(NL)
+      LNBR(NL)=N
+      RNBR(NL)=RNBR(N)
+      RNBR(N)=NL
+C   CHAIN BACK THRU LIST WITH THIS LINE
+      N=RNBR(NL)
+      IF(N.EQ.0)GO TO 21
+C   IE END MET
+      GO TO 23
+30    CONTINUE
+C   FINISH WITH LEFT END IN N
+40    CONTINUE
+C   NOW REVERSE NBRS SO CHAIN BACK RIGHT
+      NL=RNBR(N)
+      IF(NL.EQ.0)GO TO 50
+      RNBR(N)=LNBR(N)
+      LNBR(N)=NL
+      N=NL
+      GO TO 40
+50    CONTINUE
+C   NEED TO FIX UP NEW LEFT END
+      RNBR(N)=LNBR(N)
+      LNBR(N)=0
+C   ALL POINTERS FIXED NOW DO RELATIVE POSITION
+C   FINISH WITH LEFT END IN N
+C   SO CHAIN BACK RIGHT
+C   SAVE RIGHT LINE NUMBER
+      NL=N
+      X=RELPG(N)
+60    CONTINUE
+      RELPG(N)=1+(-1*(RELPG(N)-X))
+      IF(RNBR(N).EQ.0)GO TO 70
+      N=RNBR(N)
+      GO TO 60
+70    CONTINUE
+C   NOW FIX CONTIG LINE
+      LNBR(LINCON)=NL
+      RNBR(LINCON)=N
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+C   NOW REVERSE AND COMPLEMENT GELS
+      N=NL
+80    CONTINUE
+C      READ(IDEVW,REC=N)GEL
+      CALL READW(IDEVW,N,GEL,MAXGEL)
+      M=ABS(LNGTHG(N))
+      CALL SQREV(GEL,M)
+      CALL SQCOM(GEL,M)
+      CALL WRITEW(IDEVW,N,GEL,MAXGEL)
+C      WRITE(IDEVW,REC=N)GEL
+C   CHANGE SIGNS
+      LNGTHG(N)=-1*LNGTHG(N)
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   ANY MORE?
+      N=RNBR(N)
+      IF(N.NE.0)GO TO 80
+C   NO MORE
+      RETURN
+      END
+C      CONSEN
+C   CALCULATES A CONSENSUS USING THE RULES OUTLINED IN THE DOCUMENTATION
+C   AND SUBROUTINE SUMMER
+C   UNIT IDEV IS USED FOR OUTPUT
+      SUBROUTINE CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +KBIN,KBOUT,IDEVW,IDEV,NAMCON,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,IDM,PERCD,IDEVN,LLINO)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CHARACTER NAMCON*(*)
+100   CONTINUE
+      ISTART=1
+      NAMCON = ' '
+      CALL OPENF1(IDEV,NAMCON,1,IOK,KBIN,KBOUT,
+     +'Name for consensus file',
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(ANS,'Make consensus for whole database',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.1)GO TO 150
+      N=IDBSIZ-NCONTS
+      CALL BUSY(KBOUT)
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        IF((ISTART+19+Y).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+1009      FORMAT(
+     +    ' Maximum consensus length(',I6,') exceeded,',/,
+     +    ' calculation aborted')
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,
+     +  IDEVW,MAXGEL,IDM,PERCD)
+        ISTART=ISTART+Y
+110   CONTINUE
+      ISTART=ISTART-1
+      CALL FMTDK(IDEV,SEQ1,ISTART)
+      GO TO 400
+150   CONTINUE
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)GO TO 400
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)GO TO 400
+      IDIM2=RREG-LREG+1
+      IF((ISTART+19+IDIM2).GT.IDIM1)THEN
+         WRITE(KBOUT,1009)IDIM1
+         RETURN
+      END IF
+      CALL BUSY(KBOUT)
+      CALL ADDTIT(SEQ1(ISTART),NAMPRO,LLINO,ISTART)
+      CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1(ISTART),IDIM2,GEL,LREG,RREG,LLINO,IDBSIZ,TEMP,
+     +CHRSIZ,MAXGL2,IDEVW,MAXGEL,IDM,PERCD)
+      ISTART=ISTART+IDIM2
+300   CONTINUE
+      CALL YESNO(ANS,'Select another contig',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0) GO TO 150
+      ISTART=ISTART-1
+      CALL FMTDK(IDEV,SEQ1,ISTART)
+400   CONTINUE
+      CALL YESNO(ANS,'Make another consensus',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.EQ.0)GO TO 100
+      RETURN
+      END
+      SUBROUTINE COPYM(JLEFTS,ILEFTS,JLC,ILC,
+     +JPOSC,IPOSC,JSENSE,ISENSE,JLLINO,LLINO,
+     +JJOINT,JOINT,JTOTPC,ITOTPC,JTOTPG,ITOTPG,
+     +JTYPE,ITYPE,JDOUT,IDOUT,JDIM22,IDIM22,
+     +SEQG3,SEQG2,SEQC3,SEQC2,PERMS,PERMIS)
+      CHARACTER SEQG3(JDIM22),SEQG2(JDIM22),SEQC3(JDOUT),SEQC2(JDOUT)
+      ILEFTS = JLEFTS
+      ILC = JLC
+      IPOSC = JPOSC
+      IPOSG = JPOSG
+      ISENSE = JSENSE
+      LLINO = JLLINO
+      JOINT = JJOINT
+      ITOTPC = JTOTPC
+      ITOTPG = JTOTPG
+      ITYPE = JTYPE
+      IDOUT = JDOUT
+      IDIM22 = JDIM22
+      CALL SQCOPY(SEQG3,SEQG2,JDIM22)
+      CALL SQCOPY(SEQC3,SEQC2,JDOUT)
+      PERMIS = PERMS
+      END
+      SUBROUTINE CRUNS(GEL,IFIRST,ILAST,RUN)
+      CHARACTER GEL*50,RUN*50,PAD
+      SAVE PAD
+      DATA PAD/'*'/
+C  MARK RUNS IN THIS CONSENSUS SO THAT
+C  ATGGGCC BECOMES
+C     ** *
+      RUN(1:ILAST) = ' '
+      DO 10 I = IFIRST+1,ILAST
+C  MUST ALLOW FOR THE EXTEMELY UNLIKELY CASE OF A RUN OF DASHES
+C  IN THE CONSENSUS
+        IF(GEL(I:I).NE.'-')THEN
+          IF(GEL(I:I).EQ.GEL(I-1:I-1)) RUN(I:I) = PAD
+        END IF
+10    CONTINUE
+      END
+C     SUBROUTINE DALIGN
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE DALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+100   CONTINUE
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      X=FLOAT(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(
+     +' Matching region too long for routine dalign. Alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      WRITE(KBOUT,1000)X
+1000  FORMAT(' Percentage mismatch after alignment = ',F4.1)
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Best alignment found')
+      CALL SQMTCH(SEQC2(1),SEQG2(IENDG),SEQ3,LO)
+      CALL FMT4LN(SEQC2(1),SEQG2(IENDG),SEQ3,LO,IENDC,IENDG,KBOUT)
+      IFAIL=0
+      END
+C     DBCHEK
+C   TO CHECK THE  SELF CONSISTENCY OF A DB
+C   ARE ALL NEIGHBOURS HOLDING HANDS OR DO SOME HAVE
+C   GELS WHICH DO NOT BELONG TO ANY CONTIG? OR WORSE BELONG TO MORE
+C   THAN ONE CONTIG?
+C   ARE THERE ANY LOOPS IN CONTIGS (THE WORST PROBLEM)
+C   ARE ALL RELATIVE POSITIONS IN SAME ORDER AS NEIGHBOURS?
+C   ARE THERE ANY CONTIGS OF LENGTH <1 ?
+C   ARE THERE ANY GELS OF ZERO LENGTH ?
+C   RETURN ERROR CODE OF ZERO FOR ALL OK, 1 FOR GELS NOT USED
+C   AND 2 FOR ALL OTHER ERRORS
+      SUBROUTINE DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP,IERR,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER TEMP(IDBSIZ)
+      IERR=0
+C   HAND HOLDING OK?
+      DO 100 I=1,NGELS
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 50
+        IF(RNBR(K).EQ.I)GO TO 50
+        WRITE(KBOUT,1000)I
+1000    FORMAT(' Hand holding problem for gel reading',I6)
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+1001    FORMAT(' ',5I6)
+        IERR=2
+50      CONTINUE
+        K=RNBR(I)
+        IF(K.EQ.0)GO TO 100
+        IF(LNBR(K).EQ.I)GO TO 100
+        WRITE(KBOUT,1000)I
+        WRITE(KBOUT,1001)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        WRITE(KBOUT,1001)K,RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+        IERR=2
+100   CONTINUE
+C
+C   ALL GELS IN CONTIGS OR IN MORE THAN ONE CONTIG?
+C  TEMP IS USED TO COUNT THE NUMBER OF TIMES EACH GEL IS USED
+      CALL FILLI(TEMP,IDBSIZ,0)
+      N=IDBSIZ-NCONTS
+C   COUNT NUMBER OF PASSES THRU LOOP 320 TO 300. IF THIS EXCEEDS
+C   THE DATABASE SIZE A LOOP HAS BEEN FOUND!
+      ICOUNT=0
+      DO 300 I=N,IDBSIZ-1
+        II=I
+        K=LNBR(I)
+        J=RNBR(I)
+        IF((K.NE.0).AND.(J.NE.0))GO TO 310
+C   THIS CONTIG POINTS TO ZERO GEL NUMBER AS LEFT OR RIGHT END!
+        WRITE(KBOUT,1002)I
+1002    FORMAT(' Contig',I4,' has zero gel numbers')
+        IERR=2
+        GO TO 290
+310     CONTINUE
+        IF((LNBR(K).EQ.0).AND.(RNBR(J).EQ.0))GO TO 290
+C   THIS LEFT END IS NOT A LEFT END OR THIS RIGHT END IS NOT A RIGHT END
+        WRITE(KBOUT,1004)I
+1004  FORMAT(' The end gels of contig',I4,' have outward neighbours')
+        IERR=2
+290     CONTINUE
+C       DOES THE CONTIG HAVE SOME +VE LENGTH?
+        IF(RELPG(I).GT.0)GO TO 320
+        WRITE(KBOUT,1010)I
+1010    FORMAT(' The contig on line number',I4,' has zero length')
+        IERR=2
+320     CONTINUE
+        TEMP(K)=TEMP(K)+1
+        ICOUNT=ICOUNT+1
+        IF(ICOUNT.GT.IDBSIZ)GO TO 601
+        K=RNBR(K)
+        IF(K.NE.0)GO TO 320
+300   CONTINUE
+      DO 400 I=1,NGELS
+        IF(TEMP(I).EQ.1)GO TO 390
+        IF(TEMP(I).EQ.0)GO TO 410
+        WRITE(KBOUT,1005)I,TEMP(I)
+1005    FORMAT(' Gel number ',I6,' is used ',I6,' times')
+        IERR=2
+        GO TO 400
+390     CONTINUE
+C       DOES THE GEL HAVE NONZERO LENGTH? (ONLY CHECK THOSE USED ONCE)
+        IF(LNGTHG(I).NE.0)GO TO 400
+        WRITE(KBOUT,1011)I
+1011    FORMAT(' Gel number',I6,' has zero length')
+        IERR=2
+        GO TO 400
+410     CONTINUE
+        WRITE(KBOUT,1006)I
+1006    FORMAT(' Gel number ',I6,' is not used')
+C       NEED TO INCREMENT THE ERROR COUNT NOT RESET IT TO A LOWER VALUE!
+        IF(IERR.LT.2)IERR=1
+400   CONTINUE
+C   ALL RELATIVE POSNS OK?
+      N=IDBSIZ-NCONTS
+      DO 500 I=N,IDBSIZ-1
+        K=LNBR(I)
+        IF(K.EQ.0)GO TO 500
+510     CONTINUE
+        J=RNBR(K)
+        IF(J.EQ.0)GO TO 500
+        IF(RELPG(K).GT.RELPG(J))GO TO 520
+        K=J
+        GO TO 510
+520     CONTINUE
+        WRITE(KBOUT,1007)K,RELPG(K),J,RELPG(J)
+1007    FORMAT(' Gel number',I6,' with position',I6,
+     +  ' is the left neighbour of',
+     +  /,' gel number',I6,' with position',I6)
+        K=J
+        IERR=2
+        GO TO 510
+500   CONTINUE
+      IF(IERR.EQ.0)WRITE(KBOUT,1013)
+1013  FORMAT(' Database is logically consistent')
+      RETURN
+601   CONTINUE
+      IERR=2
+      WRITE(KBOUT,1008)II
+1008  FORMAT(' Loop in contig',I6,/,
+     +' No further checking done but gel numbers follow')
+      CALL FILLI(TEMP,IDBSIZ,0)
+      K=LNBR(II)
+710   CONTINUE
+      TEMP(K)=TEMP(K)+1
+      WRITE(KBOUT,1009)K
+1009  FORMAT(' ',I6)
+      IF(TEMP(K).GT.1)RETURN
+      K=RNBR(K)
+      GO TO 710
+      END
+      SUBROUTINE DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SUBROUTINE TO EDIT GEL READINGS IN DATA BASE
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER POSN,Y,X
+      CHARACTER GEL(MAXGEL)
+      CHARACTER CHARS(80),PROMPT*32
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PRMPT(3)*(MAXPRM)
+      SAVE PROMPT
+      DATA PROMPT/'Characters for gel reading      '/
+1     CONTINUE
+C      CALL BELL(1,KBOUT)
+C      CALL DBMENU(5,NOPT,IHELPS,IHELPE,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PRMPT(1) = 'Insert'
+      PRMPT(2) = 'Delete'
+      PRMPT(3) = 'Change'
+      NOPT = 1
+      CALL RADION('Edit options', PRMPT, 3, NOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.EQ.-1)RETURN
+      IF(NOPT.EQ.3)GO TO 200
+10    CONTINUE
+      MN = 1
+      MX = RELPG(LINCON)
+      POSN = 0
+      IF(IMAXL.GT.0) THEN
+        MN = IMAXL
+        MX = IMAXR      
+      END IF
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 1
+      POSN = IVAL
+20    CONTINUE
+      MN = 1
+      MX = 50
+      NC = 1
+      CALL GETINT(MN,MX,NC,
+     +'Number of characters',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      NC = IVAL
+C   NEED TO CHECK FOR DELETION TO END OF CONTIG
+      Y=POSN+NC
+      K=RELPG(LINCON)-POSN+1
+      IF((NOPT.EQ.2).AND.(Y.GT.RELPG(LINCON)))NC=K
+C   NOW FIND FIRST CHAR THAT OVERLAPS REGION
+      LLINO=LNBR(LINCON)
+30    CONTINUE
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.POSN)GO TO 40
+      LLINO=RNBR(LLINO)
+      GO TO 30
+40    CONTINUE
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+C   CALC POSN IN THIS GEL TO EDIT
+      K=POSN-RELPG(LLINO)+1
+      IF(NOPT.EQ.2)GO TO 100
+50    CONTINUE
+      WRITE(PROMPT(28:),1005)LLINO
+1005  FORMAT(I5)
+      LC = NC
+      CALL FILLC(CHARS,NC,'*')
+      CALL GETSTR(PROMPT,CHARS,CHARS,NC,LC,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 50
+      END IF
+C   MOVE THE DATA RIGHT
+      M=ABS(LNGTHG(LLINO))
+      LNGTHG(LLINO)=LNGTHG(LLINO)+SIGN(NC,LNGTHG(LLINO))
+      N=ABS(LNGTHG(LLINO))
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,2000)LLINO
+2000    FORMAT(
+     +' Data pushed off end of gel',I5,' During insertion')
+        NM512=N-MAXGEL
+        N=MAXGEL
+C       SET M SO THAT THE DATA AT THE END IS LOST
+        M=M-NM512
+        LNGTHG(LLINO)=SIGN(N,LNGTHG(LLINO))
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      IC=0
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        IC=IC+1
+C       DONT ALLOW SPACES - SET THEM TO *'S
+        IF(CHARS(IC).EQ.' ')CHARS(IC)='*'
+        GEL(I)=CHARS(IC)
+60    CONTINUE
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+65    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 70
+      IF(RELPG(LLINO).GT.POSN)GO TO 70
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 65
+      GO TO 40
+70    CONTINUE
+C   INSERTS FINISHED SO NEED TO INCREMENT ALL THOSE GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+75    CONTINUE
+      IF(RELPG(LLINO).GT.POSN)GO TO 80
+76    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 90
+      GO TO 75
+80    CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)+(NC)
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      GO TO 76
+90    CONTINUE
+      RELPG(LINCON)=RELPG(LINCON)+(NC)
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      GO TO 1
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   DELETE
+100   CONTINUE
+C
+C   MOVE DATA TO LEFT
+      M=K+NC
+C   POSN TO MOVE TO
+      N=MAX(1,K)
+C   NUMBER TO MOVE
+      L=ABS(LNGTHG(LLINO))-M+1
+C   NONE TO MOVE?
+      IF(L.GT.0) THEN
+        DO 160 I=1,L
+          GEL(N)=GEL(M)
+          N=N+1
+          M=M+1
+160     CONTINUE
+      END IF
+C   NEDE TO FIX LENGTHS AND REL POSNS OF GELS EDITED
+C   3 CLASSES GELS LOSING LEFT ENDS,RIGHT ENDS,CENTRES
+C   FOR LEFT ENDS K<1,RIGHT ENDS L<1
+C   SO SET RELPG FOR THOSE LOSING LEFT ENDS
+      IF(K.LT.1)RELPG(LLINO)=POSN
+C   FIX LENGTHS
+      N=N-1
+      IF(L.GT.0)LNGTHG(LLINO)=SIGN(N,LNGTHG(LLINO))
+C   THOSE LOSING RIGHT ENDS
+      M=(POSN-RELPG(LLINO))
+      IF(L.LT.1)LNGTHG(LLINO)=SIGN(M,LNGTHG(LLINO))
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+165   CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 170
+C   DOES IT HAVE DATA IN REGION?
+      X=POSN+(NC)
+      IF(RELPG(LLINO).GE.X)GO TO 170
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 165
+C   WITHIN
+      GO TO 40
+170   CONTINUE
+C   DELETIONS DONE SO DECREMENT ALL GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+      X=POSN+(NC)
+175   CONTINUE
+      IF(RELPG(LLINO).GE.X)GO TO 180
+176   CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 190
+      GO TO 175
+180   CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)-(NC)
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+      GO TO 176
+190   CONTINUE
+      RELPG(LINCON)=RELPG(LINCON)-(NC)
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      GO TO 1
+200   CONTINUE
+C
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   CHANGE
+      MN = 1
+      MX = NGELS
+      LLINO = 0
+      CALL GETINT(MN,MX,LLINO,'Gel number',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 200
+      LLINO = IVAL
+      WRITE(KBOUT,1022)RELPG(LLINO),LNGTHG(LLINO)
+1022  FORMAT( ' This gels relative position =',I6,' and length =',I6)
+220   CONTINUE
+      MN = RELPG(LLINO)
+      MX = RELPG(LLINO) + ABS(LNGTHG(LLINO)) - 1
+      POSN = 0
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 220
+      POSN = IVAL
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+225   CONTINUE
+      LC = 50
+      NC = 0
+      CALL GETSTR('New chararacters',' ',CHARS,LC,NC,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(NC.EQ.0) GO TO 1
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 225
+      END IF
+      K=POSN-RELPG(LLINO)+1
+C   COPY CHARS
+      DO 230 I=1,NC
+        IF(CHARS(I).EQ.' ')CHARS(I)='*'
+        GEL(K)=CHARS(I)
+        K=K+1
+        IF(K.GT.MAXGEL) GO TO 231
+230   CONTINUE
+231   CONTINUE
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+      GO TO 1
+      END
+C   DBPRNT
+C   PRINTS A DATABASE. IE ITS RELATIONSHIPS
+      SUBROUTINE DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +IDEV,KBIN,KBOUT,IDEVN,LLINO,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),LREG,RREG,ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMARC*10
+      IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +NCONTS,IDBSIZ,IDEV)
+      WRITE(IDEV,10011)NGELS,NCONTS
+10011 FORMAT(' Number of gel readings',I5,' Number of contigs',I5)
+20    CONTINUE
+      CALL YESNO(ANS,'Select contigs',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0) GO TO 45
+      N=IDBSIZ-NCONTS
+25    CONTINUE
+      CALL YESNO(ANS,'Show gel readings in positional order',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0)GO TO 41
+      WRITE(IDEV,1009)
+1009  FORMAT(' CONTIG LINES')
+      WRITE(IDEV,1000)
+1000  FORMAT(' CONTIG      LINE  LENGTH               ENDS'/
+     +'                                     LEFT   RIGHT')
+      DO 30 I=N,IDBSIZ-1
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+30    CONTINUE
+1007  FORMAT( ' ',12X,I4,2X,I6,10X,I6,2X,I6)
+      WRITE(IDEV,1008)
+1008  FORMAT(' GEL LINES')
+      WRITE(IDEV,1001)
+1001  FORMAT(' NAME      NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                     LEFT   RIGHT')
+      DO 40 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        WRITE(IDEV,1006)NAMARC,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+1006    FORMAT( ' ',A,2X,I4,2X,I7,2X,I5,2X,I6,2X,I6)
+40    CONTINUE
+      RETURN
+C
+41    CONTINUE
+C
+C   SORTED DATA
+      DO 43 I=N,IDBSIZ-1
+        WRITE(IDEV,1021)
+1021    FORMAT( )
+        WRITE(IDEV,1000)
+        WRITE(IDEV,1007)I,RELPG(I),LNBR(I),RNBR(I)
+        J=LNBR(I)
+        WRITE(IDEV,1001)
+42      CONTINUE
+        CALL READN(IDEVN,J,NAMARC)
+        WRITE(IDEV,1006)NAMARC,J,RELPG(J),LNGTHG(J),LNBR(J),RNBR(J)
+        J=RNBR(J)
+        IF(J.NE.0)GO TO 42
+43    CONTINUE
+      RETURN
+45    CONTINUE
+C   SELECTED CONTIGS ONLY
+C
+C   GET GEL NUMBER AND CONTIG NUMBER
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1009)
+      WRITE(IDEV,1000)
+      WRITE(IDEV,1007)LINCON,RELPG(LINCON),LNBR(LINCON),RNBR(LINCON)
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0)RETURN
+      WRITE(IDEV,1008)
+      N=LLINO
+      WRITE(IDEV,1001)
+46    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).EQ.0)GO TO 48
+      N=RNBR(N)
+      IF(RELPG(N).GT.RREG)GO TO 48
+      GO TO 46
+48    CONTINUE
+      GO TO 45
+      END
+      SUBROUTINE DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,KBIN,KBOUT,IDEVW,
+     +IDEV,LINLEN,PERCD,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,LINOU1,LINOU2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEVN,
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CHARACTER FILEH*(*)
+      PARAMETER (MAXPRM = 10)
+      CHARACTER PROMPT(2)*(MAXPRM)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2),ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      IF(IERR.NE.0) RETURN
+      CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IERR)
+      IF(IERR.NE.0) RETURN
+      IDIM2=RREG-LREG+1
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL DBSCSM(SEQ1(LREG),IDIM2,KBOUT)
+160   CONTINUE
+      ANS = 1
+      PROMPT(1) = 'List codes'
+      PROMPT(2) = 'Plot codes'
+      CALL RADION('Select results display mode',PROMPT,2,ANS,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.1) RETURN
+      IF(ANS.EQ.1) THEN
+        CALL FMTDB(SEQ1,IDIM1,LREG,RREG,LINLEN,IDEV)
+        RETURN
+      ELSE
+        CALL PLTQ(SEQ1(LREG),IDIM2,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      END
+      SUBROUTINE DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +IDIM1,GEL,IDBSIZ,TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,LLINO,
+     +PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RREG, RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,TEMP3(ID1,CHRSIZ,MAXGL2)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      CHARACTER SEQ1(IDIM1)
+      IDIM2=RREG-LREG+1
+C 28-7-91 added extra parameter mxgood: the max length of read
+C we have confidence in
+      CALL SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,LLINO,PERCD,IDBSIZ,
+     +TEMP3,ID1,CHRSIZ,MAXGL2,IDEVW,
+     +MAXGEL,LINOU1,LINOU2,MXGOOD)
+      CALL PLTQ(SEQ1(LREG),IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END
+      SUBROUTINE DBSCSM(SEQ1,IDIM1,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER CODES(5)
+      REAL X(5)
+      SAVE CODES
+      DATA CODES/'0','1','2','3','4'/
+      DO 50 J=1,5
+        X(J)=0.
+50    CONTINUE
+      DO 100 I=1,IDIM1
+        DO 60 J=1,5
+          IF(SEQ1(I).NE.CODES(J))GO TO 60
+          X(J)=X(J)+1.
+          GO TO 61
+60      CONTINUE
+61      CONTINUE
+100   CONTINUE
+      SUM=0.
+      DO 130 J=1,5
+        SUM=SUM+X(J)
+130   CONTINUE
+      DO 140 J=1,5
+        IF(SUM.NE.0)X(J)=X(J)*100./SUM
+140   CONTINUE
+      WRITE(KBOUT,1001)X(1)
+1001  FORMAT(' ',F6.2,'% OK on both strands and they agree(0)')
+      WRITE(KBOUT,1002)X(2)
+1002  FORMAT(' ',F6.2,'% OK on plus strand only(1)')
+      WRITE(KBOUT,1003)X(3)
+1003  FORMAT(' ',F6.2,'% OK on minus strand only(2)')
+      WRITE(KBOUT,1004)X(4)
+1004  FORMAT(' ',F6.2,'% Bad on both strands(3)')
+      WRITE(KBOUT,1005)X(5)
+1005  FORMAT(' ',F6.2,'% OK on both strands but they disagree(4)')
+      RETURN
+      END
+C     DBSTAT
+      SUBROUTINE DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      N=IDBSIZ-NCONTS
+      SUM=0.
+      DO 20 I=N,IDBSIZ-1
+        SUM=SUM+RELPG(I)
+20    CONTINUE
+      AV=SUM/NCONTS
+      WRITE(KBOUT,1020)SUM,AV
+1020  FORMAT( ' Total contig length ',F10.0,'  Average',
+     +' length ',F10.1)
+      SUMG=0.
+      DO 30 I=1,NGELS
+        SUMG=SUMG+FLOAT(ABS(LNGTHG(I)))
+30    CONTINUE
+      AV=SUMG/SUM
+      WRITE(KBOUT,1021)SUMG
+1021  FORMAT( ' Total characters in gel readings ',F10.0)
+      WRITE(KBOUT,1022)AV
+1022  FORMAT
+     +( ' Average gel characters per consensus character ',F10.2)
+99    CONTINUE
+      RETURN
+      END
+C     DELCON
+C
+C   DELETES CONTIG FROM CONSENSUS SEQUENCE
+      SUBROUTINE DELCON(SEQ1,ILEFT,ILC,IDIM1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1)
+C   FIRST CHAR TO REPLACE
+      I1=ILEFT-20
+C   FIRST CHAR TO MOVE
+      I2=ILEFT+ILC
+C   IS THIS RIGHTMOST CONTIG ANYWAY?
+      IF(I2.GT.IDIM1)GO TO 10
+C   NUMBER TO MOVE
+      ID=IDIM1-I2+1
+C   MOVE
+      CALL SQCOPY(SEQ1(I2),SEQ1(I1),ID)
+C   RESET LENGTH
+      IDIM1=I1+ID-1
+      RETURN
+10    CONTINUE
+C   RIGHTMOST CONTIG SO DONT MOVE
+      IDIM1=I1-1
+C
+      RETURN
+      END
+      SUBROUTINE DISMAT(SEQ,IDIM,GEL,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      CHARACTER SEQ(IDIM),GEL(IDIMG),MATCH(IDIMG)
+C   EDIT 07-02-83 TO CHECK FOR CASE WHEN GEL OVERLAPS ADJACENT
+C   CONTIGS WITHIN THE LENGTH OF THE GEL!  DONE BY HAVING A
+C   PARAMETER THAT STORES THE POSITION OF THE LEFT END OF THE
+C   NEXT CONTIG (IE THE ONE AFTER THE ONE THE CURRENT GEL OVERLAPS)
+C   SET IT TO A VERY LARGE VALUE INITIALLY
+      NEXTC=99999
+C   SORT THE MATCHING WORDS INTO ASCENDING ORDER ON POSITION IN SEQ
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+C   LOOK FOR SEPARATE MATCHES
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+C      WRITE(KBOUT,1000)SAVPG(1),SAVPS(1)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(1),
+     +SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+      DO 10 I=2,IDSAV
+      IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+C   NEW MATCH, DISPLAY IT
+C      WRITE(KBOUT,1000)SAVPG(I),SAVPS(I)
+C1000  FORMAT(' ',2I6)
+      CALL DISMAU(SEQ,IDIM,GEL,IDIMG,SAVPS(I),
+     +SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +NEXTC,KBOUT,MATCH)
+C   RESET LEND
+      LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      RETURN
+      END
+C
+C       DISMAU
+C   ROUTINE TO DISPLAY MATCHES
+C   EDITED 17-12-81 TO NOT SUBTRACT 1 FROM LCL AND LGR
+      SUBROUTINE DISMAU(SEQ,IDIM1,GEL,IDIMG,ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,NEXTC,KBOUT,MATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),GEL(IDIMG),MATCH(IDIMG)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+C   EDITED 07-02-83 FOR NEXTC (SEE DISMAT)
+C   DELETE 20 FROM END OF CONSENSUS MATCH
+      SAVPS=ISAVPS-19
+C   FIND CONTIG CONSENSUS ENDS
+      JJ=1
+      DO 5 J=2,IDCEND
+      IF(SAVPS.GT.CENDS(J))GO TO 5
+C   GONE PAST SO LAST IS THE ONE
+      JJ=J-1
+      GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C   SUBTRACT 1 FROM END
+      SAVPS=SAVPS-1
+C   LENGTH FROM MATCH TO LEFT OF CONTIG
+      LCL=SAVPS-CENDS(JJ)
+C   RIGHT
+      LCR=CENDS(JJ+1)-ISAVPS-1
+C   LEFT GEL
+      LGL=SAVPG-1
+      LGR=IDIMG-SAVPG
+C   NEED MIN OF EACH PAIR
+      LL=MIN(LCL,LGL)
+      LR=MIN(LCR,LGR)
+C   LENGTH OF OVERLAP
+      LM=LR+LL+1
+C  DISPLAY STARTS
+      ICL=ISAVPS-LL
+      IGL=SAVPG-LL
+      WRITE(KBOUT,1000)NENDS(JJ)
+1000  FORMAT(' Match found with vector number =',I6)
+      CALL SQMTCH(SEQ(ICL),GEL(IGL),MATCH,LM)
+      L=ICL-CENDS(JJ)-19
+      CALL FMT4LN(SEQ(ICL),GEL(IGL),MATCH,LM,L,IGL,KBOUT)
+C   SAVE POSN OF END OF NEXT CONTIG
+      NEXTC=CENDS(JJ+1)+20
+      RETURN
+      END
+      SUBROUTINE DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +GEL,LLINOO,LINCON,LREG,RREG,GEL2,I1,IDIM,NOPT,
+     +LLINOR,IDBSIZ,IDEV,KBOUT,IDEVW,IDEVN,LINLEN,PERCD,
+     +MAXGEL,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER CHRSIZ
+      PARAMETER (CHRSIZ = 6)
+      PARAMETER (IDC1 = CHRSIZ*100)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,X,XLS2,XLS1,XRS2,XRS1,RREG2
+      CHARACTER MATCH(100)
+      INTEGER CHARS(CHRSIZ,100),CHARS1(IDC1)
+      CHARACTER NAMARC*10
+      CHARACTER GEL(MAXGEL)
+      CHARACTER GEL2(MAXGEL)
+      INTEGER RELPOS(10),RELPO2(10)
+      INTEGER GELC
+      INTEGER RP
+      INTEGER LSEQNO,RSEQNO
+      CHARACTER LINOUT(100)
+      CHARACTER MUNOTP
+      CHARACTER GTCONC
+      EXTERNAL GTCONC
+      EQUIVALENCE (CHARS1,CHARS)
+      CALL FILLI(CHARS1,IDC1,0)
+C   SET CONTIG NUMBER
+      ICON=1
+      LLINO=LLINOO
+      NLEN=LINLEN/10
+      LSEQNO=LREG
+      X=LINLEN+LSEQNO-1
+      RSEQNO=MIN(RREG,X)
+C   SET LEFT GEL NUMBER FOR RIGHT CONTIG
+      LN2=LLINOR
+C   FIRST GEL NO IS LLINOO
+C   SET RREG FOR RIGHT CONTIG
+      RREG2=IDIM
+C   SET UP LSEQNO,RSEQNO FOR FOR NOPT=3
+      XLS2=I1
+      XRS2=RSEQNO-LSEQNO+XLS2
+9     CONTINUE
+C   IF RIGHT CONTIG SKIP NUMBER PRINTING
+      IF(ICON.EQ.2)GO TO 8
+C   NEED TO KEEP LONGEST LINE LENGTH FOR OUTPUT OF CONSENSUS
+      IE=0
+C   SETUP AND WRITE NUMBERS
+      RELPOS(1)=LSEQNO+9
+      DO 5 I=2,NLEN
+        RELPOS(I)=RELPOS(I-1)+10
+5     CONTINUE
+      WRITE(IDEV,1023)
+     +(RELPOS(K),K=1,MIN(NLEN,MAX(1,(RSEQNO-LSEQNO+1)/10)))
+1023  FORMAT( ' ',19X,10(I9,1X))
+C   SET CURRENT LINE NUMBER
+8     CONTINUE
+      GELC=LLINO
+10    CONTINUE
+C   IS LEFT END OF CURRENT GEL >RREG
+      IF(RELPG(GELC).GT.RSEQNO)GO TO 200
+C   ALSO NEED TO KNOW IF RIGHT END  ON THIS LINE (IF .LT. NO DATA
+C   TO DISPLAY)
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      IF(X.LT.LSEQNO)GO TO 190
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+      CALL FILLC(LINOUT,LINLEN,' ')
+      CALL READN(IDEVN,GELC,NAMARC)
+C
+C   NEED TO KNOW HOW MANY CHARS TO COPY OVER TO OUTPUT LINE
+C   AND WHERE IN LINE TO PUT THEM
+C   CURRENT LINE LEFT END IS LSEQNO,RIGHT END RSEQNO
+C   SO LEFT START CHAR IS
+      X=MAX(LSEQNO,RELPG(GELC))
+C   POSITION IN ARRAY LINE
+      LP=X-LSEQNO+1
+C   RIGHT END CHAR IS
+      X=RELPG(GELC)+ABS(LNGTHG(GELC))-1
+      X=MIN(RSEQNO,X)
+C   POSITION IN ARRAY LINE
+      RP=X-LSEQNO+1
+C   LOOK FOR LONGEST LINE
+      IF(RP.GT.IE)IE=RP
+C   NEED LEFT START IN GEL
+      K=LSEQNO-RELPG(GELC)+1
+      IF(K.LT.1)K=1
+      NCOP=RP-LP+1
+      IF(NCOP.GT.0)CALL SQCOPY(GEL(K),LINOUT(LP),NCOP)
+      N=LP+NCOP-1
+      II=K-1
+      IF(IDM.EQ.26)THEN
+        DO 50 I = LP,N
+          II = II + 1
+          CALL PCON1(GEL(II),CHARS(1,I))
+50      CONTINUE
+      ELSE
+        DO 70 I=LP,N
+          II=II+1
+          JJ = INDEXS(GEL(II),JSCORE)
+          CHARS(JJ,I) = CHARS(JJ,I) + JSCORE
+70      CONTINUE
+      END IF
+      I=SIGN(GELC,LNGTHG(GELC))
+      WRITE(IDEV,1020)I,NAMARC,(LINOUT(K),K=1,RP)
+1020  FORMAT( ' ',I4,2X,A,2X,100A1)
+C
+190   CONTINUE
+C   NOW GET NEXT GEL TO RIGHT
+      GELC=RNBR(GELC)
+      IF(GELC.NE.0)GO TO 10
+200   CONTINUE
+C   CALC CONSENSUS AND WRITE IT
+      IF(IDM.EQ.26)THEN
+        DO 49 I = 1,LINLEN
+          LINOUT(I) = MUNOTP(CHARS(1,I))
+          CHARS(1,I) = 0
+49      CONTINUE
+      ELSE
+        DO 230 I=1,LINLEN
+          LINOUT(I) = GTCONC(CHARS(1,I),CHRSIZ,PERCD)
+          CALL FILLI(CHARS(1,I),CHRSIZ,0)
+230     CONTINUE
+      END IF
+      WRITE(IDEV,1019)(LINOUT(K),K=1,IE)
+C   IF REQUIRED WRITE COMPARISON GEL
+C   WHICH OPTION IN OPERATION?
+      IF(NOPT.EQ.2)GO TO 52
+      IF(NOPT.NE.3)GO TO 250
+53    CONTINUE
+C   ALREADY DONE THIS LINE CONTIG2?
+      IF(ICON.EQ.2)GO TO 54
+      ICON=2
+C   NEED TO SAVE CONSENSUS FROM LEFT CONTIG
+      CALL SQCOPY(LINOUT,GEL2,IE)
+C   SAVE VALUES FROM LEFT CONTIG
+      XLS1=LSEQNO
+      XRS1=RSEQNO
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN1=LLINO
+C   SET UP VALUES FOR RIGHT CONTIG
+      LSEQNO=XLS2
+      RSEQNO=XRS2
+C   SET LEFT GEL NUMBER
+      LLINO=LN2
+C   GET NEXT GEL
+      GO TO 150
+54    CONTINUE
+C   SAVE CURRENT LEFT GEL NUMBER
+      LN2=LLINO
+C   SET VALUES FOR RIGHT CONTIG NEXT PASS
+      XLS2=XRS2+1
+      XRS2=XLS2+LINLEN-1
+      IF(XRS2.GT.RREG2)XRS2=RREG2
+C   SET UP VALUES FOR LEFT CONTIG
+      LLINO=LN1
+      ICON=1
+      LSEQNO=XLS1
+      RSEQNO=XRS1
+C   SET DECREMENT FOR POINTER TO GEL2
+      MMM=I1-1
+52    CONTINUE
+1017  FORMAT('       NEWGEL      ',100A1)
+1018  FORMAT('       MISMATCH    ',100A1)
+1019  FORMAT('       CONSENSUS   ',100A1)
+1022  FORMAT( ' ',18X,100A1)
+      I2=I1+LINLEN-1
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(NOPT.EQ.2)WRITE(IDEV,1017)(GEL2(K),K=I1,I2)
+C   SET DECREMENT
+      IF(NOPT.EQ.2)MMM=0
+55    CONTINUE
+      CALL FILLC(MATCH,LINLEN,'*')
+      K=0
+      DO 667 J=I1,I2
+        K=K+1
+        IF(GEL2(J-MMM).EQ.LINOUT(K))MATCH(K) = ' '
+667   CONTINUE
+      WRITE(IDEV,1018)(MATCH(K),K=1,IE)
+      RELPO2(1)=(I1)+9
+      DO 240 I=2,NLEN
+        RELPO2(I)=RELPO2(I-1)+10
+240   CONTINUE
+      WRITE(IDEV,1023)(RELPO2(K),K=1,NLEN)
+      I1=I2+1
+      I2=I2+LINLEN
+      IF(I2.GT.IDIM)I2=IDIM
+      IF(I1.GT.I2)RETURN
+250   CONTINUE
+C
+      WRITE(IDEV,1021)
+1021  FORMAT( )
+C   NEXT LINE LENGTH
+C   NEXT LENGTH IS OLD RIGHT +1
+      LSEQNO=RSEQNO+1
+C   NEW RIGHT IS LEFT +LENGTH
+      RSEQNO=LSEQNO+(LINLEN)-1
+C   ARE WE OVER END OF REGION
+      IF(RSEQNO.GT.RREG)RSEQNO=RREG
+C   HAVE WE FINISHED REGION COMPLETELY
+      IF(RSEQNO.LT.LSEQNO) RETURN
+C   NOT FINISHED SO NEED TO FIND CURRENT LEFT GEL NO
+C   CURRENT LEFT GEL IS LLINO
+C
+150   CONTINUE
+C   NEED TO KNOW IF CURRENT LEFT GELS RIGHT END IS INSIDE REGION
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.LSEQNO)GO TO 9
+C   LOOK AT NEXT GEL TO RIGHT
+      LLINO=RNBR(LLINO)
+C   MAY HAVE GONE OVER END OF CONTIG?????
+      IF(LLINO.GT.0)GO TO 150
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE EC(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      PARAMETER (IDASH = 6)
+      K = 0
+      DO 10 I = 1,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.IDASH) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            GEL(I) = CHARSL(JC)
+            K = K + 1
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS CORRECTED=',K
+      END
+      SUBROUTINE ED(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = MIN(IDC,IDG),1,-1
+        JC = INDEXS(CON(I),J)
+        IF(JC.EQ.5) THEN
+          IF(I.LT.IDG) CALL SQCOPY(GEL(I+1),GEL(I),IDG-I)
+          K = K + 1
+        END IF
+10    CONTINUE
+C      WRITE(*,*)'NUMBER OF CHARS DELETED=',K
+      END
+      SUBROUTINE EDITGL(GEL,CHARS,IDIMG,KBIN,KBOUT,MAXGEL,
+     +IHELPS,IHELPE,FILEH,IDEVH,ISTART)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILEH*(*)
+      INTEGER POSN,Y
+      CHARACTER GEL(MAXGEL),CHARS(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PRMPT(3)*(MAXPRM)
+1     CONTINUE
+C      CALL BELL(1,KBOUT)
+C      CALL DBMENU(5,NOPT,IHELPS,IHELPE,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PRMPT(1) = 'Insert'
+      PRMPT(2) = 'Delete'
+      PRMPT(3) = 'Change'
+      NOPT = 1
+      CALL RADION('Edit options', PRMPT, 3, NOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.EQ.1)THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(NOPT.EQ.-1)RETURN
+      IF(NOPT.EQ.3)GO TO 200
+C  INSERT, DELETE
+      MN = ISTART
+      MX = ISTART + IDIMG
+      POSN = 0
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 1
+      POSN = IVAL
+      POSN=POSN-ISTART+1
+      MN = 1
+      MX = 50
+      NC = 1
+      CALL GETINT(MN,MX,NC,
+     +'Number of characters',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      NC = IVAL
+      Y=POSN+NC
+      K=IDIMG-POSN+1
+      IF((NOPT.EQ.2).AND.(Y.GT.IDIMG))NC=K
+C   CALC POSN IN THIS GEL TO EDIT
+      K=POSN
+C   IS THIS  DELETE?
+      IF(NOPT.EQ.4)GO TO 100
+50    CONTINUE
+      LC = NC
+      CALL FILLC(CHARS,NC,'*')
+      CALL GETSTR('characters to insert',CHARS,CHARS,NC,LC,KBOUT,KBIN,
+     +INFLAG)
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(IFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 50
+      END IF
+C   MOVE THE DATA RIGHT
+      M=IDIMG
+      N=IDIMG+NC
+C   CHECK FOR OVERFLOW OF GEL
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,2000)
+2000    FORMAT(' Data pushed off end of gel during insertion')
+C       HOW MUCH OVER?
+        NM512=N-MAXGEL
+        N=MAXGEL
+C       SET M SO THAT THE DATA AT THE END IS LOST
+        M=M-NM512
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      IC=0
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        IC=IC+1
+C       DONT ALLOW SPACES - SET THEM TO *'S
+        IF(CHARS(IC).EQ.' ')CHARS(IC)='*'
+        GEL(I)=CHARS(IC)
+60    CONTINUE
+      IDIMG=MIN((IDIMG+NC),MAXGEL)
+      GO TO 1
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   DELETE
+100   CONTINUE
+C
+C   MOVE DATA TO LEFT
+      M=K+NC
+C   POSN TO MOVE TO
+      N=MAX(1,K)
+C   NUMBER TO MOVE
+      L=IDIMG-M+1
+C   NONE TO MOVE?
+      IF(L.GT.0) THEN
+C       MOVE THE DATA
+        DO 160 I=1,L
+          GEL(N)=GEL(M)
+          N=N+1
+          M=M+1
+160     CONTINUE
+      END IF
+C  RESET LENGTH
+      IDIMG=IDIMG-NC
+      GO TO 1
+200   CONTINUE
+C
+CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
+C   CHANGE
+      MN = ISTART
+      MX = ISTART + IDIMG
+      POSN = 0
+      CALL GETINT(MN,MX,POSN,
+     +'Position to edit',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) GO TO 1
+      IF(IVAL.EQ.0) GO TO 1
+      POSN = IVAL
+C  ADD IN START POSITION HERE
+      POSN=POSN-ISTART+1
+225   CONTINUE
+      LC = 50
+      NC = 0
+      CALL GETSTR('New characters',' ',CHARS,LC,NC,KBOUT,KBIN,INFLAG)
+      IF(NC.EQ.0) GO TO 1
+      IF(INFLAG.EQ.2) GO TO 1
+      IF(IFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 225
+      END IF
+C   CALC POSITION IN THIS GEL TO EDIT
+      K=POSN
+C   COPY CHARS
+      DO 230 I=1,NC
+        GEL(K)=CHARS(I)
+        K=K+1
+230   CONTINUE
+      GO TO 1
+      END
+      SUBROUTINE EDR(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,
+     +CON,IDC,IDEVW,IDEVR,LREG)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER CON(IDC)
+      INTEGER CHNRP
+      EXTERNAL CHNRP
+C  CHANGE RELATIVE POSITIONS FOR AE
+      ND = 0
+      DO 10 I = IDC,1,-1
+        IF(CON(I).EQ.'*') THEN
+          ND = ND + 1
+          K = I + LREG - 1
+          J = CHNRP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LGEL,NCONT,K)
+          IF(J.NE.0) THEN
+            CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,IDUM,JDUM,IDEVR,
+     +      IDBSIZ,J,NCONT,-1)
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF DELETIONS=',ND
+      END
+      SUBROUTINE ET(GEL,IDG,CON,IDC,K)
+      CHARACTER GEL(IDG),CON(IDC),CHARSL
+      EXTERNAL CHARSL,INDEXS
+      K = 0
+      DO 10 I = 2,MIN(IDC,IDG)
+        JC = INDEXS(CON(I),J)
+        IF(JC.NE.6) THEN
+          JG = INDEXS(GEL(I),J)
+          IF(JG.NE.JC) THEN
+            JNG = INDEXS(GEL(I-1),J)
+            JNC = INDEXS(CON(I-1),J)
+            IF(JNC.NE.JNG) THEN
+              IF((JNG.EQ.JC).AND.(JNC.EQ.JG)) THEN
+                GEL(I) = CHARSL(JNG)
+                GEL(I-1) = CHARSL(JG)
+                K = K + 1
+              END IF
+            END IF
+          END IF
+        END IF
+10    CONTINUE
+C      WRITE(*,*)' NUMBER OF CHARS TRANSPOSED=',K
+      END
+      SUBROUTINE FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND
+      STRAND = 1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHP)
+      IF(DEPTHP.LT.0) RETURN
+      STRAND = -1
+      CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTHM)
+      IF(DEPTHM.LT.0) RETURN
+      CALL PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      END
+      SUBROUTINE FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,LENCON,STRAND,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,DEPTH,STRAND
+      EXTERNAL NCDEP
+C  LREG = left contig position
+C  RREG = right ''      ''
+C  LENCON = RREG-LREG+1
+      I = LGEL
+      DEPTH = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      ELSE
+        DEPTH = -1
+        RETURN
+      END IF
+C      WRITE(*,*)'LGEL',LGEL
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            K = RELPG(I) + ABS(LNGTHG(I)) -1
+            DEPTH = MAX(NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,
+     +      STRAND,K),DEPTH)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+C      WRITE(*,*)'DEPTH',DEPTH
+      END
+C      FIND
+C
+C   SUBROUTINE TO FIND THE FIRST OCCURENCE OF A GIVEN STRING
+C   IN A GIVEN ARRAY
+C
+      SUBROUTINE FIND(SEQ,IDIM1,STRING,IDIM2,IMATCH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2),DASH
+      INTEGER PSEQ,PSTR
+      SAVE DASH
+      DATA DASH/'-'/
+      PSEQ=0
+      PSTR=1
+      IMATCH=0
+C
+100   CONTINUE
+C
+C   PUT PSEQ TO WHERE THIS FAILED MATCH STARTED
+      PSEQ=PSEQ+1-PSTR
+C
+400   CONTINUE
+C
+      PSTR=0
+C
+500   CONTINUE
+C
+C   POINT TO NEXT SEQ CHAR
+      PSEQ=PSEQ+1
+C   TEST FOR END
+      IF(PSEQ.GT.IDIM1)GO TO 300
+C   POINT TO NEXT STRING CHAR
+      PSTR=PSTR+1
+C   TEST FOR DASH IN STRING
+      IF(STRING(PSTR).EQ.DASH)GO TO 450
+C   TEST FOR DASH IN SEQ
+      IF(SEQ(PSEQ).EQ.DASH)GO TO 400
+C   TEST FOR MATCH
+      IF(SEQ(PSEQ).NE.STRING(PSTR))GO TO 100
+C
+450   CONTINUE
+C
+C   TEST FOR END OF STRING IE. WHOLE STRING MATCH
+      IF(PSTR.LT.IDIM2)GO TO 500
+C   HAVE MATCH. GET POINTER TO WHERE IT STARTED
+      IMATCH=PSEQ-IDIM2+1
+C
+300   CONTINUE
+      RETURN
+      END
+      SUBROUTINE FMT4LP(SEQ1,SEQ2,IDIM,ISW,ISX,IDEV,NAME1,NAME2)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(60),NAME1*(*),NAME2*(*)
+      INTEGER KL(6)
+      ISXX=ISX
+      ISWW=ISW
+      IE=0
+10    CONTINUE
+      IS=IE+1
+      IE=IE+60
+      IF(IE.GT.IDIM)IE=IDIM
+      N=IE-IS+1
+      N=1+(N-1)/10
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,N
+        KL(J)=ISWW
+        ISWW=ISWW+10
+50    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+      WRITE(IDEV,1002)NAME1,(SEQ1(K),K=IS,IE)
+      IL = IE - IS + 1
+      CALL SQMTCH(SEQ1(IS),SEQ2(IS),MATCH,IL)
+      WRITE(IDEV,1003)(MATCH(K),K=1,IL)
+      WRITE(IDEV,1002)NAME2,(SEQ2(K),K=IS,IE)
+1002  FORMAT(2X,A,2X,6(10A1,1X))
+ 1003 FORMAT(10X,6(10A1,1X))
+C   SET UP DECIMAL COUNTERS
+      DO 60 J=1,N
+        KL(J)=ISXX
+        ISXX=ISXX+10
+60    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+1001  FORMAT( 5X,6(I6,5X))
+      IF(IE.LT.IDIM) GO TO 10
+      END
+      SUBROUTINE FMTDB(SEQ1,IDIM,ISW,ISE,LINLEN,IDEV)
+C   NOTE SAME AS FMTSEP!
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM)
+      INTEGER KL(12)
+      ISWW=ISW-1
+      IE=ISW-1
+1     CONTINUE
+      WRITE(IDEV,1003)
+1003  FORMAT( )
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,LINLEN/10
+        ISWW=ISWW+10
+        KL(J)=ISWW
+50    CONTINUE
+      IS=IE+1
+      IE=IE+LINLEN
+      IF(IE.GT.ISE)IE=ISE
+      WRITE(IDEV,1001)(KL(KKK),KKK=1,MIN(IE-IS+1,LINLEN)/10)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+1002  FORMAT( '  ',12(10A1,1X))
+1001  FORMAT( ' ',12(5X,I6))
+      IF(IE.EQ.ISE)RETURN
+      GO TO 1
+      END
+      SUBROUTINE FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   AUTHOR: RODGER STADEN
+C   STORES THEIR POSITIONS IN CENDS AND THEIR LEFT LINE NUMBERS IN NENDS
+      CHARACTER SEQ(IDIM),DC(3)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      EXTERNAL IFROMC,INDEXA
+      IDCEND=0
+      DO 10 I=1,IDIM
+        IF(SEQ(I).NE.'<')GO TO 10
+        IDCEND=IDCEND+1
+C       PUT POSITION OF LEFT END OF CONTIG IN CENDS
+        CENDS(IDCEND)=I
+        K = INDEXA(SEQ(I),20,'.')
+        IF(K.EQ.0) THEN
+           WRITE(KBOUT,*)'Error in contig title: no dot!'
+           IDCEND = 0
+           RETURN
+         END IF
+        K = K + I
+C        K=I+11
+        DO 5 J=1,3
+          DC(J)=SEQ(K)
+          K=K+1
+5       CONTINUE
+        NENDS(IDCEND)=IFROMC(DC,3,KBOUT)
+10    CONTINUE
+C     STORE POSITION OF LAST CHAR +1 TO SIMPLIFY DISPLAY ROUTINES
+      CENDS(IDCEND+1)=IDIM+1
+      RETURN
+      END
+      INTEGER FUNCTION GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +IDBSIZ,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      GCLIN = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          GCLIN = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      INTEGER FUNCTION GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LLINO,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C   SEARCHES FOR ARCHIVE NAMES
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAME1*11,NAME2*10,NAME3*11,NFLAG
+      PARAMETER (NFLAG='/')
+      NAME3 = ' '
+      IF(LLINO.NE.0) THEN
+        NAME3(1:1) = NFLAG
+        CALL READN(IDEVN,LLINO,NAME3(2:))
+      END IF
+      GELID = 0      
+10    CONTINUE
+      L = 0
+      IF(LLINO.NE.0) L = 11
+      CALL GTSTR('Contig identfier',NAME3,
+     +NAME1,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(INFLAG.EQ.3) THEN
+        GELID = LLINO
+        RETURN
+      END IF
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NAME1(1:1).EQ.NFLAG) THEN
+        CALL CCASE(NAME1,1)
+        DO 20 I=1,NGELS
+          CALL READN(IDEVN,I,NAME2)
+          CALL CCASE(NAME2,1)
+          IF(NAME1(2:11).EQ.NAME2) THEN
+            GELID = I
+            RETURN
+          END IF
+20     CONTINUE
+        WRITE(KBOUT,1004)NAME1(2:)
+1004    FORMAT(' ',A,' is not in the database!')
+      ELSE
+        CALL RJST(NAME1)
+        READ(NAME1,1001,ERR=10,END=10)GELID
+1001    FORMAT(I11)
+        IF((GELID.LT.1).OR.(GELID.GT.NGELS)) THEN
+          CALL ERROM(KBOUT,'Illegal gel reading number')
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +NCONTS,GEL,MAXGEL,IDEV3,IDEV4,IDEV5,IDEV1,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER  GEL(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CHARACTER NAMARC*10
+      FILNAM = ' '
+      CALL OPENF1(IDEV5,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for names of extracted gel readings',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(I,'Extract ends of contigs only',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(I.LT.0) RETURN
+      IF(I.EQ.0) GO TO 15
+      DO 10 I=1,NGELS
+        L=ABS(LNGTHG(I))
+        IF(L.GT.0)THEN
+          CALL READN(IDEV4,I,NAMARC)
+          WRITE(KBOUT,1002)NAMARC
+1002      FORMAT(' ',A)
+          WRITE(IDEV5,1003)NAMARC
+1003      FORMAT(A)
+          FILNAM = NAMARC
+          CALL OPENRS(IDEV1,FILNAM,IOK,LRECL,1)
+          IF(IOK.NE.0) GO TO 100
+          CALL READW(IDEV3,I,GEL,MAXGEL)
+          IF(LNGTHG(I).LT.0)THEN
+            CALL SQREV(GEL,L)
+            CALL SQCOM(GEL,L)
+          END IF
+          CALL FMTDKN(IDEV1,GEL,L)
+          CLOSE(UNIT=IDEV1)
+      END IF
+10    CONTINUE
+      RETURN
+15    CONTINUE
+C   NUMBER OF LINES TO PROCESS
+      N=IDBSIZ-NCONTS
+      DO 20 I=N,IDBSIZ-1
+        JL=LNBR(I)
+        JR=RNBR(I)
+        CALL READN(IDEV4,JL,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        FILNAM = NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JL,GEL,MAXGEL)
+        L=ABS(LNGTHG(JL))
+        IF(LNGTHG(JL).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+        IF(JR.EQ.JL)GO TO 20
+        CALL READN(IDEV4,JR,NAMARC)
+        WRITE(KBOUT,1002)NAMARC
+        WRITE(IDEV5,1003)NAMARC
+        CALL OPENRS(IDEV1,NAMARC,IOK,LRECL,1)
+        IF(IOK.NE.0) GO TO 100
+        CALL READW(IDEV3,JR,GEL,MAXGEL)
+        L=ABS(LNGTHG(JR))
+        IF(LNGTHG(JR).LT.0)THEN
+          CALL SQREV(GEL,L)
+          CALL SQCOM(GEL,L)
+        END IF
+        CALL FMTDKN(IDEV1,GEL,L)
+        CLOSE(UNIT=IDEV1)
+20    CONTINUE
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,*)'Error opening file for extracted gel reading'
+      RETURN
+      END
+      SUBROUTINE GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),GELID
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER FILEH*(*)
+      EXTERNAL GELID
+      IERR = 1
+      NCONTC = GELID(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LLINO,
+     +IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.4) RETURN
+      IF(NCONTC.EQ.0) RETURN
+      IGELNO = NCONTC
+      IF(LNBR(NCONTC).NE.0) THEN
+        WRITE(KBOUT,1013)RELPG(NCONTC)
+1013    FORMAT(' Position of this reading=',I6)
+25      CONTINUE
+        NCONTC = LNBR(NCONTC)
+        IF(LNBR(NCONTC).NE.0) GO TO 25
+        WRITE(KBOUT,1014)NCONTC
+1014    FORMAT( ' Number of leftmost reading this contig=',I6)
+      END IF
+30    CONTINUE
+      N = IDBSIZ - NCONTS
+      DO 20 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.NCONTC) THEN
+          LINCON=J
+          GO TO 21
+        END IF
+20    CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' No contig line for this gel! Fix the database')
+      RETURN
+21    CONTINUE
+      LLINO = NCONTC
+      IERR = 0
+      END
+      SUBROUTINE GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      CALL GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LINCON,LLINO,IGELNO,IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+      END
+      SUBROUTINE GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LEFTMN,RIGHTM,LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER LREG,RREG,RIGHTM
+      CHARACTER FILEH*(*)
+40    CONTINUE
+      MN = LEFTMN
+      MX = RIGHTM
+      LREG = MN
+      CALL GETINT(MN,MX,LREG,
+     +'Start position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LREG = IVAL
+      MN = LREG
+      MX = RIGHTM
+      RREG = MX
+      CALL GETINT(MN,MX,RREG,
+     +'End position in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      RREG = IVAL
+C   NOW FIND FIRST GEL THAT OVER LAPS
+50    CONTINUE
+      X=RELPG(LLINO)+(ABS(LNGTHG(LLINO)))-1
+      IF(X.GE.LREG)GO TO 60
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 50
+60    CONTINUE
+      RETURN
+      END
+      SUBROUTINE GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+      CHARACTER T
+        IF(T.EQ.'0') THEN
+          YF = Y0
+          YT = Y0
+        ELSE IF(T.EQ.'1') THEN
+          YF = Y0
+          YT = YM1
+        ELSE IF(T.EQ.'2') THEN
+          YF = Y0
+          YT = YP1
+        ELSE IF(T.EQ.'3') THEN
+          YF = YP1
+          YT = YM1
+        ELSE IF(T.EQ.'4') THEN
+          YF = YP2
+          YT = YM2
+        END IF
+      END
+      CHARACTER*1 FUNCTION GTCONC(COUNTS,IDM,CUT)
+      INTEGER IDM
+      INTEGER COUNTS(IDM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU
+C 8-4-92 made this routine sum counts
+      GTCONC = '-'
+      ISUM = 0
+      DO 5 I=1,IDM
+        ISUM = ISUM + COUNTS(I)
+ 5    CONTINUE
+      IF(ISUM.EQ.0.) RETURN
+      Y = ISUM
+      DO 10 I = 1,IDM - 1
+        X = REAL(COUNTS(I))/Y
+        IF(X.GE.CUT) THEN
+          GTCONC = CHARSU(I)
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE HIGHLT(GELSAV,NAMSAV,NUMSAV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV1,IDEV2,
+     +FILNAM,IOK)
+      CHARACTER LINEIN*119,CONSEN*100
+      CHARACTER GELNO*4,GEL*100,GELSAV*100,GELNAM*10
+      CHARACTER NAMSAV*10,NUMSAV*4
+      CHARACTER FILNAM*(*),HELPF*(*)
+      DIMENSION GELSAV(50),NAMSAV(50),NUMSAV(50)
+      CHARACTER PLUS*4,MINUS*4
+      EQUIVALENCE (LINEIN(2:2),GELNO),(LINEIN(8:8),GELNAM)
+      EQUIVALENCE (LINEIN(20:20),GEL)
+      EXTERNAL NOTIRL
+      CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File containing contig display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      FILNAM = ' '
+      CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File for problem display',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+C   FORMAT:
+C
+C12345678901234567890       10        20         30    ETC
+C   12  GELNAM0000  CAGACGCGCGCGCGCGCGGATATAGTCTCTCCGCTCT
+C  100  GELNAM0000       TGATACGCTCGCTCTCTCTCTCTCTCTCTTTC
+C                   AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C                           70        80       ETC
+C   12  GELNAM0000     AAAAAAAAAAAAAAAAAAAAAAAAAAAA
+C
+C
+      LIN = 1
+      CALL GTSTR('plus strand symbol',':',PLUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) PLUS = ':'
+      LIN = 1
+      CALL GTSTR('minus strand symbol','.',MINUS,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) MINUS = '.'
+C   COUNT LINE NUMBERS
+      LINNO=0
+10    CONTINUE
+C   READ LINE OF NOS
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+1003  FORMAT(A)
+C   WRITE IT OUT AGAIN
+      WRITE(IDEV2,1003)LINEIN
+C   ZERO GEL COUNT FOR THIS STRIP
+      IGEL=0
+20    CONTINUE
+C
+C   READ A LINE, COULD BE 1 GEL, 2 CONSENSUS OR BLANK
+C       LINEIN=' '
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+C   WHAT SORT OF LINE?  ONLY A GEL WILL HAVE NON BLANK CHARS AT THE LEFT END
+      IF(LINEIN(2:5).NE.' ')THEN
+C   GEL LINE SO SAVE
+      IGEL=IGEL+1
+      GELSAV(IGEL)=GEL
+      NAMSAV(IGEL)=GELNAM
+      NUMSAV(IGEL)=GELNO
+      GO TO 20
+      END IF
+C   MUST BE CONSENSUS
+      CONSEN=GEL
+C   PROCESS THIS STRIP OF GELS (IGEL OF THEM)
+      DO 50 I=1,IGEL
+C       WHERE DOES DATA START AND END?
+        IFIRST=1
+40      CONTINUE
+        IF(GELSAV(I)(IFIRST:IFIRST).NE.' ')GO TO 45
+        IFIRST=IFIRST+1
+        IF(IFIRST.LE.100)GO TO 40
+C       ERROR --- NO DATA FOUND
+        WRITE(KBOUT,1004)LINNO
+1004    FORMAT(' Error on line',I6,' of file')
+        RETURN
+45      CONTINUE
+C       NOW WHERE DOES IT END
+        ILAST=NOTIRL(GELSAV(I),100,' ')
+C       COMPARE WITH CONSENSUS
+        READ(NUMSAV(I),'(I4)',ERR=900)INTEG
+        IF(INTEG.GE.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,PLUS)
+        IF(INTEG.LT.0)CALL IDTOD(CONSEN,GELSAV(I),IFIRST,ILAST,MINUS)
+        WRITE(IDEV2,1008)NUMSAV(I),NAMSAV(I),GELSAV(I)(1:ILAST)
+1008    FORMAT(' ',A,2X,A,2X,A)
+50    CONTINUE
+      WRITE(IDEV2,1009)CONSEN
+1009  FORMAT(' ',18X,A)
+1006  FORMAT( )
+C     READ A BLANK LINE
+      READ(IDEV1,1003,END=100)LINEIN
+      LINNO=LINNO+1
+      WRITE(IDEV2,1003)LINEIN
+C     NO GO BACK FOR THE NEXT LINE OF NUMBERS
+      GO TO 10
+100   CONTINUE
+      WRITE(KBOUT,1005)
+1005  FORMAT(' Finished')
+      RETURN
+900   WRITE(KBOUT,*)'Error reading gel number'
+      END
+      SUBROUTINE IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +NCONTS,IX,IY,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,
+     +DBTDUX,DBTDUY,NCONT,IGEL,IS)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER CHNRP1
+      EXTERNAL CWORLD,CHNRP1
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      XX = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+      YX = CWORLD(IY,MARGB,MARGT,YMIN,YMAX)
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        IF((XX.GT.XF).AND.(XX.LT.XT)) THEN
+          IS = NINT(((XX-XF)/(XT-XF)) * RELPG(I))
+          JGEL = LNBR(I)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,JGEL,IS)
+          NCONT = I
+          RETURN
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      IGEL = 0
+      NCONT = 0
+      END
+      SUBROUTINE IDTOD(TOPLIN,GEL,IFIRST,ILAST,SYMBOL)
+      CHARACTER TOPLIN*100,GEL*100,SYMBOL*4
+      DO 10 I=IFIRST,ILAST
+        IF(GEL(I:I).EQ.TOPLIN(I:I))GEL(I:I)=SYMBOL(1:1)
+10    CONTINUE
+      END
+      INTEGER FUNCTION INDEXS(C,S)
+      PARAMETER (IDM = 29)
+      CHARACTER C
+      INTEGER POINTS(0:255),SCORES(IDM),IND(IDM),S
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      SAVE SCORES,IND
+      DATA 
+     +IND/1,2,3,4,1,2,3,4,1,2,3,4,1,2,3,4,6,6,6,6,6,6,1,2,3,4,5,5,6/
+C      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  changed 28-7-91 to give 10 to old zeroes and 100 to lowercase
+      DATA SCORES/
+     +100,100,100,100,
+     +75,75,75,75,
+     +100,100,100,100,
+     +100,100,100,100,
+     +10,10,10,10,10,10,
+     +100,100,100,100,100,100,10/
+      I = ICHAR(C)
+      I = POINTS(I)
+      S = SCORES(I)
+      INDEXS = IND(I)
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP FOR SHOTGUN SEQUENCING
+      SUBROUTINE INITS
+C  AUTHOR RODGER STADEN
+      INTEGER POINTS(0:255)
+      PARAMETER (IDM = 29)
+      CHARACTER DUP*29
+      COMMON /SHOTC/POINTS
+      SAVE /SHOTC/
+      DATA DUP/'CTAG1234DVBHKLMNRY5678ctag*,-'/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-4 FOR ACGT
+C                 acgt
+C                 1234
+C                 BDHV
+C                 KLMN
+C      5 FOR      *
+C      6 FOR      5678- AND ELSE
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 6
+C
+        DO 30 I = 0,255
+          POINTS(I) = IDM
+30      CONTINUE
+        DO 35 I = 1,IDM
+          J = ICHAR(DUP(I:I))
+          POINTS(J) = I
+35      CONTINUE
+      END
+      FUNCTION INLIST(LIST,IDLIST,ITEM)
+C   AUTHOR: RODGER STADEN
+C   SENT LIST LIST, AND ITEM ITEM. IF IN LIST RETURNS ELEMENT NUMBER, ELSE 0
+      INTEGER LIST(IDLIST)
+      INLIST=0
+      DO 1 I=1,IDLIST
+        IF(LIST(I).NE.ITEM)GO TO 1
+        INLIST=I
+      RETURN
+1     CONTINUE
+      RETURN
+      END
+      SUBROUTINE IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH,X,Y,KBOUT,
+     +IGEL,IOK)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      IOK = 1
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      YINCO2 = STRAND*YINC/2.
+      I = LGEL
+      IGEL = 0
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            IF((X.GE.XF).AND.(X.LE.XT)) THEN
+              IGEL = I
+              IF((Y.GE.YF-YINCO2).AND.(Y.LE.YF+YINCO2)) THEN
+                IOK = 0
+                RETURN
+              END IF
+            END IF
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE JOIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LNCONL,LLINOL,LNCONR,LLINOR,GEL,GEL2,
+     +IDBSIZ,IDEV,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      CHARACTER FILEH*(*)
+      INTEGER HELPS1,HELPE1,HELPS2,HELPE2
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS
+      INTEGER LREG,RREG,X,RELX,RIGHTM
+      INTEGER LNGTHG(IDBSIZ),RNBR(IDBSIZ),LNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PRMPT(5)*(MAXPRM)
+15    CONTINUE
+      WRITE(KBOUT,1001)
+1001  FORMAT(
+     +' The joint is the position in the left contig that',/,
+     +' the first character of the right contig overlaps.')
+      MN = 1
+      MX = RELPG(LNCONL)
+      RELX = 0
+      CALL GETINT(MN,MX,RELX,
+     +'Position of join',
+     +IVAL,KBIN,KBOUT,HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(IVAL.EQ.0) RETURN
+      RELX = IVAL
+      I1=1
+      I2=LINLEN
+      IF((RELPG(LNCONR)).LT.I2)I2=(RELPG(LNCONR))
+      CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,
+     +LLINOL,LNCONL,RELX,RELPG(LNCONL),GEL,I1,I2,3,LLINOR,
+     +IDBSIZ,IDEV,KBOUT,IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+30    CONTINUE
+      CALL BELL(1,KBOUT)
+C      CALL DBMENU(3,NOPT,HELPS1,HELPE1,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PRMPT(1) = 'Complete join'
+      PRMPT(2) = 'Edit left contig'
+      PRMPT(3) = 'Display join'
+      PRMPT(4) = 'Edit right contig'
+      PRMPT(5) = 'Move join'
+      NOPT = 1
+      CALL RADION('Join options', PRMPT, 4, NOPT, IHELPS,
+     +     IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.LT.1)RETURN
+100   CONTINUE
+      IF(NOPT.EQ.5)GO TO 15
+      IF(NOPT.EQ.2)THEN
+C       EDIT LEFT CONTIG
+        IMAXL=RELX
+        IMAXR=MIN(RELPG(LNCONL),(RELX+RELPG(LNCONR)-1))
+        CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL2,
+     +  LNCONL,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +  HELPS2,HELPE2,FILEH,IDEVH,MAXGEL)
+      END IF
+200   CONTINUE
+      IF(NOPT.EQ.4)THEN
+C       EDIT RIGHT CONTIG
+        IMAXL=1
+        IMAXR=MIN((RELPG(LNCONL)-RELX+1),RELPG(LNCONR))
+        CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL2,
+     +  LNCONR,IMAXL,IMAXR,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,
+     +  HELPS2,HELPE2,FILEH,IDEVH,MAXGEL)
+      END IF
+300   CONTINUE
+      IF(NOPT.EQ.3)THEN
+C       DISPLAY
+        LLINO=LLINOL
+        LEFTMN=RELX
+        RIGHTM=RELPG(LNCONL)
+        IF((RIGHTM-LEFTMN+1).GT.RELPG(LNCONR))RIGHTM=
+     +  LEFTMN+RELPG(LNCONR)-1
+        CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LEFTMN,RIGHTM,LREG,RREG,
+     +  LNCONL,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS1,HELPE1,FILEH,IDEVH,IERR)
+        IF(IERR.NE.0) GO TO 30
+        I1=(LREG-RELX)+1
+        I2=(RREG-RELX)+1
+        CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,GEL2,LLINO,
+     +  LNCONL,LREG,RREG,GEL,I1,I2,3,LLINOR,IDBSIZ,IDEV,KBOUT,
+     +  IDEVW,IDEVN,LINLEN,PERCD,MAXGEL,IDM)
+      END IF
+40    CONTINUE
+      IF(NOPT.EQ.1)THEN
+C       COMPLETE JOIN
+        CALL YESNO(ANS,'Sure',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.LT.0) RETURN
+        IF(ANS.NE.0) GO TO 30
+C       ADJUST ALL RELATIVE POSITIONS IN RIGHT CONTIG
+        N=LLINOR
+        RELPG(N)=RELX
+50      CONTINUE
+        IF(RNBR(N).EQ.0)GO TO 60
+        N=RNBR(N)
+        RELPG(N)=RELPG(N)+RELX-1
+        GO TO 50
+60      CONTINUE
+C
+C       FIX UP NEW GEL LINE FOR OLD LEFT OF RIGHT CONTIG
+        LNBR(LLINOR)=RNBR(LNCONL)
+C       FIX UP RIGHT GEL OF LEFT CONTIG
+        N=RNBR(LNCONL)
+        RNBR(N)=LLINOR
+        CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LNCONL,IDBSIZ)
+C       MERGE DOES NOT WRITE TO DISK
+        N=LNBR(LNCONL)
+65      CONTINUE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+        N=RNBR(N)
+        IF(N.NE.0)GO TO 65
+C       CONTIG LINES
+        X=RELPG(LNCONR)+RELX-1
+        IF(X.GT.RELPG(LNCONL))RELPG(LNCONL)=X
+      CALL WRITER(IDEVR,LNCONL,RELPG(LNCONL),LNGTHG(LNCONL),
+     +LNBR(LNCONL),RNBR(LNCONL))
+C       NOW MOVE ALL DATA DOWN TO DELETE OLD RIGHT END
+        N=IDBSIZ-NCONTS
+        M=LNCONR-N
+        IF(M.EQ.0)GO TO 80
+        K=LNCONR
+        J=LNCONR-1
+        DO 70 I=1,M
+          RELPG(K)=RELPG(J)
+          LNGTHG(K)=LNGTHG(J)
+          LNBR(K)=LNBR(J)
+          RNBR(K)=RNBR(J)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+          K=K-1
+          J=J-1
+70      CONTINUE
+80      CONTINUE
+        NCONTS=NCONTS-1
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        RETURN
+      END IF
+      GO TO 30
+      END
+C     LINEUP
+C
+C   TAKES 2 SEQS SET OF MATCHES AND PRODUCES LINED UP SEQS
+C   FINDS IF WE HAVE A LEFT OVERLAP
+C   RETURNS POSITION OF JOINT. THIS IS RELATIVE TO THE CONTIG
+C   FOR MOST MATCHES BUT I RELATIVE TO THE GEL FOR A LEFT OVERLAP
+      SUBROUTINE LINEUP(SEQG,SEQC,SEQG2,SEQC2,IDC,IDG,IDOUT,
+     1MATG,MATC,MATL,IP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQG(IDG),SEQC(IDC),SEQG2(IDOUT),SEQC2(IDOUT),PAD
+      INTEGER MATG(IP),MATC(IP),MATL(IP)
+      SAVE PAD
+      DATA PAD/','/
+      IFAIL=0
+C   ZERO PADDING CHARS IN CONTIG (GEL DONE AT END BY DIFFERENCE
+C   IN INPUT AND OUTPUT LENGTHS)
+      ITOTPC=0
+C   FILL OUTPUT WITH PADDING
+      DO 10 I=1,IDOUT
+        SEQG2(I)=PAD
+        SEQC2(I)=PAD
+10    CONTINUE
+      NMTCH=0
+C   SET INITIAL POINTERS TO OUTPUT
+C   CONSENSUS
+      IS1=1
+C   GEL
+      IS2=1
+C   FIND DISTANCE FROM LEFT MATCH IN GEL TO LEFT OF GEL
+      IG2=MATG(1)-1
+      IF(IG2.EQ.0)THEN
+C       THE LEFT END OF THE GEL MATCHES SO THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET JOINT
+        JOINT=MATC(1)
+C       SKIP NEXT SECTION
+        GO TO 50
+      END IF
+C   FIND DISTANCE FROM LEFT MATCH IN CONTIG TO LEFT OF CONTIG
+      IC2=MATC(1)-1
+C   GET DISTANCE FROM FIRST MATCH IN CONTIG TO FIRST MATCH IN GEL.
+C   IF THIS DISTANCE <0 THEN WE HAVE A LEFT OVERLAP
+      IC1=IC2-IG2+1
+      IF(IC1.GT.0)THEN
+C       THIS IS NOT A LEFT OVERLAP
+C       SET TYPE
+        ITYPE=-1
+C       SET LEFT END
+        JOINT=IC1
+C       COPY THE GEL UPTO THE FIRST MATCH, INTO THE OUTPUT ARRAY
+C       CHECK FOR OVERFLOW
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+C       COPY THE CONTIG FOR THE SAME REGION
+        IF(IG2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(IC1),SEQC2(1),IG2)
+        IS1=IS1+IG2
+        IS2=IS2+IG2
+        GO TO 50
+      END IF
+C   MUST BE LEFT END OVERLAP
+C   SET TYPE
+      ITYPE=1
+C   SET POSITION OF JOINT RELATIVE TO GEL
+      JOINT=ABS(IC1)+2
+C   COPY OVER THE GEL UPTO THE JOINT
+C   CHECK FOR OVERFLOW
+      IF(IG2.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(1),SEQG2(1),IG2)
+      IS2=IS2+IG2
+C   WE MAY ALSO HAVE MISMATCHING
+C   DATA AT THE JOIN SO DEAL WITH THAT NOW
+C   IF IC2 >0 THE LEFT END OF THE CONTIG MATCHES THE GEL BUT OTHERWISE
+C   WE HAVE SOME MISMATCHED DATA TO DEAL WITH - WE NEED TO TRANSFER
+C   THE MISMATCHED REGION OF THE CONTIG TO THE OUTPUT ARRAY
+      IF(IC2.GT.0)THEN
+        IF(IC2.GT.MAXGEL)GO TO 700
+        CALL SQCOPY(SEQC(1),SEQC2(1),IC2)
+        IS1=IS1+IC2
+      END IF
+C   WHEN WE GET HERE WE HAVE SORTED OUT THE LEFT ENDS FOR LEFT OVERLAP
+C   AND MISMATCHED LEFT ENDS, WE NOW DEAL WITH THE REST OF THE SEQUENCE
+C   STARTING WITH THE FIRST BLOCK OF IDENTITY
+C
+C IG1 POSITION IN INPUT GEL
+C IS2 POSITION IN OUTPUT GEL
+C IC1 POSITION IN INPUT CONTIG
+C IS1 POSITION IN OUTPUT CONTIG
+C LG1 POSITION OF END OF CURRENT MATCH IN OUTPUT GEL
+C LC1 POSITION OF END OF CURRENT MATCH IN OUTPUT CONTIG
+C LG2 DISTANCE FROM CURRENT MATCH IN INPUT GEL TO NEXT MATCH
+C LC2 DISTANCE FROM CURRENT MATCH IN INPUT CONTIG TO NEXT MATCH
+C
+50    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   COPY NEXT MATCH
+      IG1=MATG(NMTCH)
+      IC1=MATC(NMTCH)
+      L=MATL(NMTCH)
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(IG1),SEQG2(IS2),L)
+C   CHECK FOR OVERFLOW
+      IF(IS1+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQC(IC1),SEQC2(IS1),L)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+L
+      IS2=IS2+L
+C   END OF CURRENT MATCH
+      LG1=IG1+L
+      LC1=IC1+L
+C   ANY MORE MATCHES
+      IF(NMTCH.EQ.IP)GO TO 500
+      K=NMTCH+1
+      LG2=MATG(K)-LG1
+      LC2=MATC(K)-LC1
+C   ANY DIFFERENCE IN LENGTH? IF SO WE HAVE TO PAD SO THEY BECOME THE SAME
+      L5=ABS(LG2-LC2)
+C   COUNT PADDING CHARS IN CONTIG
+      IF(LG2.GT.LC2)ITOTPC=ITOTPC+L5
+C   IF DIFFERENCE INCREMENT SHORTER
+      IF(LG2.GT.LC2)IS1=IS1+L5
+C   IF GEL NEEDS PADDING TRY TO PUT PADS NEXT TO DOUBLE CODES
+      IF(LC2.GT.LG2)CALL PADCOP(SEQG,SEQG2,
+     +LG1,MATG(K),L5,IS2,LG2,MAXGEL,IFAIL,KBOUT,SEQC,LC1)
+C   CHECK FOR OVERFLOW
+      IF(IFAIL.EQ.1)GO TO 700
+C   NOW COPY MISSMATCHED REGION
+C   CHECK FOR OVERFLOW
+      IF(IS2+LG2-1.GT.MAXGEL)GO TO 700
+      IF(LG2.GT.0)CALL SQCOPY(SEQG(LG1),SEQG2(IS2),LG2)
+C   CHECK FOR OVERFLOW
+      IF(IS1+LC2-1.GT.MAXGEL)GO TO 700
+      IF(LC2.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),LC2)
+C   POINT TO NEXT OUTPUT POSITIONS
+      IS1=IS1+LC2
+      IS2=IS2+LG2
+C   GET NEXT MATCH
+      GO TO 50
+500   CONTINUE
+C
+C   FINISH RIGHT ENDS
+C   ONLY COPY TO END OF GEL IN GEL AND TO THE SAME RELATIVE POSITION
+C   IN THE CONTIG FOR DISPLAY PURPOSES AND FOR COUNTING MISMATCH
+C   CURRENT ENDS AT LG1,LC1
+C   HOW FAR TO END OF GEL?
+C   SET M
+      M=0
+      L=IDG-LG1+1
+      IF(L.LT.1)GO TO 600
+C   CHECK FOR OVERFLOW
+      IF(IS2+L-1.GT.MAXGEL)GO TO 700
+      CALL SQCOPY(SEQG(LG1),SEQG2(IS2),L)
+C   NEED TO COPY TO END OF GEL IN CONTIG FOR DISPLAY
+C   POINT TO POSN IN CONTIG LEVEL WITH END OF GEL
+      M=LC1+L-1
+C   IS THIS OVER END OF CONTIG?
+      IF(M.GT.IDC)M=IDC
+C   NUMBER TO COPY
+      M=M-LC1+1
+C   CHECK FOR OVERFLOW
+      IF(IS1+M-1.GT.MAXGEL)GO TO 700
+      IF(M.GT.0)CALL SQCOPY(SEQC(LC1),SEQC2(IS1),M)
+600   CONTINUE
+C   COUNT PADDING IN GEL
+      ITOTPG=IS2+L-1-IDG
+C   SET NEW LENGTHS FOR RETURN TO CALLING ROUTINE
+      IDOUT=IS1+M-1
+      IDG=IS2+L-1
+      IFAIL=0
+      RETURN
+700   CONTINUE
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Matching region too long for routine lineup,',
+     +' alignment aborted')
+      IFAIL=1
+      RETURN
+      END
+      SUBROUTINE LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,
+     +RREG,IDEV,IDEVN,NAMARC)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)      
+      INTEGER RREG
+      CHARACTER NAMARC*(*)
+      N = LLINO
+      WRITE(IDEV,1001)
+10    CONTINUE
+      CALL READN(IDEVN,N,NAMARC)
+      WRITE(IDEV,1006)NAMARC,N,RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      IF(RNBR(N).NE.0) THEN
+        N = RNBR(N)
+        IF(RELPG(N).LE.RREG) GO TO 10
+      END IF
+1001  FORMAT(' NAME      NUMBER POSITION LENGTH     NEIGHBOURS'/
+     +'                                     LEFT   RIGHT')
+1006    FORMAT( ' ',A,2X,I4,2X,I7,2X,I5,2X,I6,2X,I6)
+      END
+C12345678901234567890
+C                          710       720       730       740       750
+C   -1  HINW.004    CGTCAGACGCACGCTGGAAAA
+      INTEGER FUNCTION LTYPE(LINE,LL,J1,J2,N,MAXDB,KBOUT)
+      CHARACTER LINE*(*),NUM*5,SPACE
+      EXTERNAL NOTRL,NOTLR
+      PARAMETER (SPACE= ' ')
+      J1 = NOTLR(LINE,LL,SPACE)
+      IF(J1.EQ.0) THEN
+C       BLANK LINE
+        LTYPE = 1
+        RETURN
+      END IF
+      IF(J1.GT.20) THEN
+C       LINE OF NUMBERS
+        LTYPE = 2
+        RETURN
+      END IF
+      IF(J1.GT.5) THEN
+C       CONSENSUS LINE
+        LTYPE = 3
+        RETURN
+      END IF
+C       SHOULD BE A SEQUENCE LINE
+      J = INDEX(LINE(J1:),SPACE)
+      NUM = SPACE
+      NUM = LINE(J1:J1+J-2)
+      CALL RJST(NUM)
+C      N = IFROMS(NUM,5,KBOUT)
+      READ(NUM,'(I5)',ERR=10) N
+      IF(N.GT.MAXDB-2) GO TO 10
+C     NUMBER ENDS AT J1+J-2
+      J1 = J1 + J - 1
+C     LOOK FOR BEGINNING OF NAME
+      J = NOTLR(LINE(J1:),LL-J1+1,SPACE)
+      N1 = J1 + J - 1
+C     LOOK FOR END OF NAME
+      J = INDEX(LINE(N1:),SPACE)
+      N2 = N1 + J - 2
+C     LOOK FOR BEGINNING OF SEQ
+      J = NOTLR(LINE(N2+1:),LL-N2,SPACE)
+      J1 = N2 + J
+      LTYPE = 4
+C     LOOK FOR END OF SEQ
+      J2 = NOTRL(LINE,LL,SPACE)
+      IF(J2.GT.N2) RETURN
+10    CONTINUE
+      LTYPE = 0
+      END
+      INTEGER FUNCTION LWRAPS(I,J)
+      K = MOD(I,J)
+      IF(K.EQ.0) K = J
+      LWRAPS = K
+      END
+C     MERGE
+C
+C   ROUTINE SENT CONTIG WHOSE GELS MAY BE OUT OF ORDER
+C   REORDERS GELS ON POSITION OF LEFT ENDS AND SETS LEFT
+C   GEL NUMBER FOR THE REORDERED CONTIG
+C
+      SUBROUTINE MERGE(RELPG,LNGTHG,LNBR,RNBR,LINCON,IDBSIZ)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+C
+C   START AT LEFT END
+      N=LNBR(LINCON)
+      GO TO 22
+21    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT IN LIST
+      N=NR
+      IF(I1.GT.0)N=I2
+22    CONTINUE
+C   SET POINTER TO NEXT GEL TO RIGHT
+      NR=RNBR(N)
+      IF(NR.EQ.0)GO TO 30
+C   HAVENT REACHED END YET
+      I1=0
+23    CONTINUE
+C   ARE THESE 2 IN CORRECT ORDER IE N<=NR ?
+      IF(RELPG(N).LE.RELPG(NR))GO TO 21
+C   NOT IN ORDER SO CHAIN LEFT UNTIL CORRECTLY POSITIONED
+C   THEN COME BACK TO THIS POINT AND CONTINUE
+C   IF FIRST MOVE SAVE POSITION
+      IF(I1.EQ.0)I2=N
+      I1=1
+C   EXCHANGE NEIGHBOURS
+      M=RNBR(NR)
+      IF(M.NE.0)LNBR(M)=N
+      M=LNBR(N)
+      IF(M.NE.0)RNBR(M)=NR
+      RNBR(N)=RNBR(NR)
+      RNBR(NR)=N
+      LNBR(NR)=LNBR(N)
+      LNBR(N)=NR
+C   CHAIN BACK THRU LIST
+      N=LNBR(NR)
+      IF(N.EQ.0)GO TO 21
+C   END NOT REACHED
+      GO TO 23
+30    CONTINUE
+C  ALL DONE POINTER AT RIGHT GEL
+      RNBR(LINCON)=N
+      RETURN
+      END
+      SUBROUTINE MINCOM(SEQ1,IDIM1,SEQ2,IDIM2,SAV1,SAV2,SAV3,
+     +IP,MINM,KBOUT)
+C   AUTHOR: RODGER STADEN
+C
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER SAV1(IP),SAV2(IP),SAV3(IP)
+C
+      IP1=IP
+      IP=0
+C
+C   SITUATION 1
+      NT1=IDIM2-MINM
+      IES1=MINM-1
+      ISS2=NT1+1
+C
+      DO 100 I=1,NT1
+C
+C   POINT TO FIRST CHAR-1 OF SEQ2
+      ISS2=ISS2-1
+C   POINT TO LAST CHAR SEQ1
+      IES1=IES1+1
+C
+      N=0
+C
+      DO 200 J=1,IES1
+C   STORE POINTER
+      JJ=J
+C
+C   POINT TO SEQ2
+      K=ISS2+J
+C   TEST FOR EQUALITY
+      IF(SEQ1(J).NE.SEQ2(K))GO TO 220
+C   INCREMENT N
+      N=N+1
+      GO TO 200
+220   CONTINUE
+C   TEST FOR SUFFICENTLY LARGE N
+      IF(N.GE.MINM)CALL SAVIT(N,J,K,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C   RESET N TO ZERO
+      N=0
+200   CONTINUE
+C
+C   GOOD SCORE AT END?
+C   NEED TO INCREMENT POINTERS AS SAVIT EXPECTS TO BE POINTING AT NEXT
+C   MISMATCH
+      JJ=JJ+1
+      KK=K+1
+      IF(N.GE.MINM)CALL SAVIT(N,JJ,KK,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+C
+100   CONTINUE
+C
+C
+C   SITUATION 2
+      NT2=IDIM1-IDIM2+1
+C
+      DO 300 I=1,NT2
+      N=0
+C
+      DO 400 J=1,IDIM2
+C   SAVE POINTER
+      JJ=J
+C
+C   SET POINTER TO SEQ1
+      L=I+J-1
+      IF(SEQ1(L).NE.SEQ2(J))GO TO 420
+      N=N+1
+      GO TO 400
+420   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+400   CONTINUE
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+300   CONTINUE
+C
+C
+C   SITUATION 3
+      ISS1=IDIM1-IDIM2
+C
+      DO 500 I=1,NT1
+C
+C   POINT TO FIRST CHAR SEQ1
+      K=ISS1+I
+      IES2=IDIM2-I
+      N=0
+C
+      DO 600 J=1,IES2
+C   SAVE POINTER
+      JJ=J
+C
+C   POINT TO SEQ1
+      L=K+J
+      IF(SEQ1(L).NE.SEQ2(J))GO TO  620
+      N=N+1
+      GO TO 600
+620   CONTINUE
+      IF(N.GE.MINM)CALL SAVIT(N,L,J,IP,SAV1,SAV2,SAV3,IP1)
+C    TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+      N=0
+600   CONTINUE
+C
+      LL=L+1
+      JJ=JJ+1
+      IF(N.GE.MINM)CALL SAVIT(N,LL,JJ,IP,SAV1,SAV2,SAV3,IP1)
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)GO TO 5000
+500   CONTINUE
+C
+      RETURN
+5000  CONTINUE
+C   OVERFLOW
+C
+      WRITE(KBOUT,1000)IP1
+1000  FORMAT(/' TOO MANY MATCHES. LIMIT = ',I6)
+      RETURN
+      END
+      SUBROUTINE ML(PC,PG,L,N,J)
+      INTEGER PC(N),PG(N),L(N)
+      DO 10 I = J,N-1
+        PC(I) = PC(I+1)
+        PG(I) = PG(I+1)
+        L(I) = L(I+1)
+10    CONTINUE
+      END
+      SUBROUTINE MSTLKL(SEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      CHARACTER CHARSU
+      EXTERNAL CHARSU,INDEXS
+      DO 100 I=1,IDIM
+        J = INDEXS(SEQ(I),K)
+        SEQ(I) = CHARSU(J)
+100   CONTINUE
+      END
+      CHARACTER FUNCTION MUNOTP(IP)
+C     AUTHOR RODGER STADEN
+      CHARACTER PUP*26
+      SAVE PUP
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      MUNOTP = '-'
+      IF((IP.GT.0).AND.(IP.LT.23))MUNOTP = PUP(IP:IP)
+      END
+      INTEGER FUNCTION NCDEP(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +STRAND,RREG)
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+      NCDEP = 0
+      N = 0
+      I = IGEL
+10    CONTINUE
+      IF(I.NE.0) THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) N = N + 1
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      NCDEP = N
+      END
+      SUBROUTINE NEWDB(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,KBOUT,
+     +IDEVF,IDEVR,IDEVW,IDEVN,MAXDB,WGEL,MAXGEL,LINCON,KGEL,IOK)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER LINEIN*80,GEL*62,WGEL(MAXGEL),GELNAM*10
+      EQUIVALENCE (LINEIN(18:18),GEL)
+      EQUIVALENCE (LINEIN(8:8),GELNAM)
+      EXTERNAL LTYPE
+      LINLEN = 80
+      LINENO=0
+C     SET LENGTH OF CONTIG
+      LENCON=0
+C     SET POSITION OF RIGHTMOST CHAR OF LAST BLOCK
+      ILASTR=0
+C     SET NUMBER OF LAST GEL TO START SO WE CAN IDENTIFY RIGHT AND LEFT
+C     NEIGHBOURS
+      LASTS=0
+C     NEED TO WHERE THE LEFT OF EACH BLOCK IS. ASSUME THE FIRST GEL IN A BLOCK
+C     IS LEVEL WITH THE LEFT EDGE FOR THAT BLOCK (IT SHOULD BE). SO NEED A FLAG
+C     (IGF)=0 IF FIRST =1 IF NOT, AND A POSITION (ILEFT) TO MARK THE LEFT EDGE
+C     NEED TO SET LEFT EDGE TO 1 SO LENGTH CALCULATION WORKS FOR FIRST BLOCK
+      ILEFT=1
+10    CONTINUE
+C     NEED TO ADD LENGTH OF LONGEST LINE IN LAST BLOCK TO CONSENSUS LENGTH
+      LENCON=LENCON+ILASTR-ILEFT+1
+C     RESET POSITION OF RIGHTMOST CHAR OF LAST BLOCK
+      ILASTR=0
+C     READ LINE OF NOS
+      READ(IDEVF,1003,END=100)LINEIN
+1003  FORMAT(A)
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+C      WRITE(*,*)LINENO,LT
+      IF(LT.NE.2) GO TO 400
+C     SET FLAG FOR NEW BLOCK
+      IGF=0
+20    CONTINUE
+C
+C     READ A LINE, COULD BE A GEL OR A CONSENSUS
+      READ(IDEVF,1003,END=100)LINEIN
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+C      WRITE(*,*)LINENO,LT
+      IF(LT.EQ.0) GO TO 400
+C     WHAT SORT OF LINE?
+      IF(LT.EQ.1) GO TO 400
+      IF(LT.EQ.4) THEN
+        IFIRST = J1 - 17
+        ILAST = J2 - 17
+        NPOS=ABS(N)
+C       GEL LINE SO SAVE
+C       NEED TO FIND THE RIGHTMOST POSITION FOR EACH BLOCK
+        IF(ILAST.GT.ILASTR)ILASTR=ILAST
+C       IF FIRST GEL THIS BLOCK SAVE POSITION OF LEFT EDGE
+        IF(IGF.EQ.0)ILEFT=IFIRST
+C       SET FLAG FOR NOT FIRST GEL THIS BLOCK
+        IGF=1
+C       START OF GEL?
+        IF(LNGTHG(NPOS).EQ.0)THEN
+C         YES SO SET RELPG
+          RELPG(NPOS)=LENCON+IFIRST-ILEFT+1
+C          LNBR(NPOS)=LASTS
+C         SET RIGHT NEIGHBOUR OF LAST GEL TO START
+C          IF(LASTS.NE.0)RNBR(LASTS)=NPOS
+          LASTS=NPOS
+          CALL WRITEN(IDEVN,NPOS,GELNAM)
+        END IF
+C       COPY NEW SEQUENCE TO DISK
+        CALL READW(IDEVW,NPOS,WGEL,MAXGEL)
+        K1=ABS(LNGTHG(NPOS))+1
+        K2=K1+ILAST-IFIRST
+        K11=IFIRST-1
+C        WRITE(*,*)K1,K2,K11
+        DO 25 I=K1,K2
+          K11=K11+1
+          WGEL(I)=GEL(K11:K11)
+25      CONTINUE
+        CALL WRITEW(IDEVW,NPOS,WGEL,MAXGEL)
+C       UPDATE LENGTH
+        LNGTHG(NPOS)=SIGN(K2,N)
+C       GO BACK FOR NEXT LINE (CONSENSUS OR NEW GEL)
+        GO TO 20
+      END IF
+C     SHOULD BE CONSENSUS, CHECK
+      IF(LT.NE.3) GO TO 400
+C     READ WHAT SHOULD BE A BLANK LINE
+      READ(IDEVF,1003,END=100)LINEIN
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,IFIRST,ILAST,N,MAXDB,KBOUT)
+C      WRITE(*,*)LINENO,LT
+      IF(LT.NE.1) GO TO 400
+      GO TO 10
+100   CONTINUE
+C     CHECK FOR MISSING BLANK AT END OF FILE
+      IF(LT.NE.1) LENCON=LENCON+ILASTR-ILEFT+1
+      LNBR(LINCON) = KGEL
+      CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LINCON,IDBSIZ)
+C     ON INPUT LNBR(LINCON) MAY BE WRONG, BUT ON EXIT RNBR IS CORRECT
+      I = RNBR(LINCON)
+300   CONTINUE
+C      WRITE(*,*)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+      CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+      J = I
+      I = LNBR(I)
+      IF(I.NE.0) GO TO 300
+C     WRITE CONTIG LINE
+      I = LINCON
+      RELPG(I) = LENCON
+      LNGTHG(I) = 0
+      LNBR(I) = J
+C      WRITE(*,*)I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+      CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+      WRITE(KBOUT,1005)
+1005  FORMAT(' Finished, no errors found')
+C
+      IOK = 0
+      RETURN
+400   CONTINUE
+C   ERROR DETECTED, REPORT LINE NUMBER AND STOP
+      WRITE(KBOUT,5002)LINENO
+5002  FORMAT(' Error in line ',I6,' of file. Processing stopped',
+     +/,' but your database may be corrupted. Revert to a copy')
+      IOK = 1
+      RETURN
+      END
+      SUBROUTINE PADCOP(SEQG,SEQG2,LG1,MG,L5,IS2,LG2,MAXGEL,IFAIL,
+     +KBOUT,SEQC,IC1)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (NDUBL = 4)
+      CHARACTER SEQG(MAXGEL),SEQG2(MAXGEL),DUBBL(NDUBL),SEQC(MAXGEL)
+      SAVE DUBBL
+      DATA DUBBL/'D','B','V','H'/
+      JC1 = IC1
+C Make seqg2 from seqg placing L5 padding chars before position MG
+C which is the start of the next block of identity. Try to put the
+C padding either in line with consensus pads, or next to double
+C codes. The positions in seqg are LG1 to MG-1. seqg2 needs to be long
+C enough to be extended from IS2 to IS2 + L5 -1 + MGM1-LG1 +1
+C ie we add L5 pads, plus the chars between and including  LG1 and MGM1
+      IDONE=0
+C   POINT TO END OF MISMATCH
+      MGM1=MG-1
+C   MAY BE NO CHARS TO COPY
+      IF(MGM1.LT.LG1)GO TO 111
+C  Next check added 26-2-91
+      MAXREQ = IS2 + L5 - 1 + MGM1 - LG1 + 1
+      IF((MGM1.GT.MAXGEL).OR.(MAXREQ.GT.MAXGEL)) THEN
+        WRITE(KBOUT,1000)
+1000    FORMAT(' Matching region too large for routine padcop,',
+     +  ' alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 110 J=LG1,MGM1
+        IF(IDONE.LT.L5) THEN
+          IF((JC1.GT.0).AND.(JC1.LT.MAXGEL)) THEN
+          IF(SEQC(JC1).EQ.'*') THEN
+            IS2 = IS2 + 1
+            JC1 = JC1 + 1
+            IDONE = IDONE + 1
+            GO TO 109
+          END IF
+          END IF
+          DO 108 M=1,NDUBL
+            IF(SEQG(J).EQ.DUBBL(M)) THEN
+              IS2 = IS2 + 1
+              JC1 = JC1 + 1
+              IDONE = IDONE + 1
+              GO TO 109
+            END IF
+108       CONTINUE
+109       CONTINUE
+        END IF
+        SEQG2(IS2) = SEQG(J)
+        IS2 = IS2 + 1
+        JC1 = JC1 + 1
+110   CONTINUE
+111   CONTINUE
+C   ALL CHARS COPIED. ENOUGH PADDING?
+      IF(IDONE.LT.L5)IS2=IS2+L5-IDONE
+C   IS2 SHOULD NOW BE POINTING AT NEXT CHAR
+C   ZERO LG2 TO SHOW CALLING ROUTINE COPYING DONE
+      LG2=0
+      IFAIL=0
+      END
+      SUBROUTINE PADRUN(GEL,IFIRST,ILAST,RUN)
+      CHARACTER PAD,GEL*50,RUN*50
+      SAVE PAD
+      DATA PAD/'*'/
+C  FIND PADS IN THIS GEL
+      DO 20 I = IFIRST,ILAST
+        IF(GEL(I:I).EQ.PAD) THEN
+C  IS IT AT LEAST THE SECOND IN A RUN
+          IF(RUN(I:I).EQ.PAD) THEN
+C  FIND THE LEFT END OF THE RUN
+            JJ = I
+1           CONTINUE
+            IF(JJ.GT.1)THEN
+              JJ = JJ - 1
+              IF(RUN(JJ:JJ).EQ.PAD)GO TO 1
+            END IF
+C  FOUND LEFT END OF RUN AT JJ, SO MOVE PAD IF NOT ALREADY A PAD
+5           CONTINUE
+            IF(GEL(JJ:JJ).NE.PAD)THEN
+C  EXCHANGE ELEMENTS
+              GEL(I:I) = GEL(JJ:JJ)
+              GEL(JJ:JJ) = PAD
+            ELSE
+C  ALREADY A PAD, SO TRY NEXT TO RIGHT
+              JJ = JJ + 1
+              IF(JJ.LT.I) GO TO 5
+            END IF
+          END IF
+        END IF
+20    CONTINUE
+      END
+      SUBROUTINE PCON1(CHAR,CHRSUM)
+C  AUTHOR RODGER STADEN
+C  PART OF PROTEIN 'CONSENSUS' CALCULATION
+      CHARACTER CHAR
+      INTEGER CHRSUM
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      K = CTONUM(CHAR)
+      IF(K.NE.26)THEN
+        IF(CHRSUM.EQ.0)THEN
+          CHRSUM = K
+        ELSE
+          IF(K.NE.CHRSUM)CHRSUM = -1
+        END IF
+      END IF
+      END
+      SUBROUTINE PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +NCONTS,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      YMIN = 0.
+      YMAX = ISYMAX
+      XMIN = 0.
+      LENCON = 0
+      DO 10 I = IDBSIZ-NCONTS,IDBSIZ-1
+        LENCON = LENCON + RELPG(I)
+10    CONTINUE
+      XMAX = LENCON
+      YINC = (YMAX-YMIN)/3.
+      Y = 0.
+      XF = XMIN
+      N = 0
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      DO 20 I = IDBSIZ-NCONTS,IDBSIZ-1
+        N = N + 1
+        XT = XF + RELPG(I)
+        Y = Y + YINC
+        CALL LINE(XF,XT,Y,Y,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        IF((IGEL.NE.0).AND.(I.EQ.LINCON)) THEN
+          XZ = XF + RELPG(IGEL) + ABS(LNGTHG(IGEL))/2
+          CALL LINE(XZ,XZ,YMAX,YMIN,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+        XF = XT
+        IF(N.EQ.2) THEN
+          N = 0
+          Y = 0.
+        END IF
+20    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,YMID,YINC,DEPTH)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND,DEPTH
+      XMIN = LREG
+      XMAX = RREG
+      YMAX = ISYMAX
+      YMIN = 0.
+      I = LGEL
+5     CONTINUE
+      IF(I.NE.0) THEN
+        IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.LREG) THEN
+          I = RNBR(I)
+          GO TO 5
+        END IF
+      END IF
+      N = 0
+10    CONTINUE
+      IF(I.NE.0)THEN
+        IF(RELPG(I).LE.RREG) THEN
+          IF(SIGN(1,LNGTHG(I)).EQ.STRAND) THEN
+            XF = MAX(RELPG(I),LREG)
+            XT = MIN(ABS(LNGTHG(I))+RELPG(I)-1,RREG)
+            N = N + 1
+            IF(N.GT.DEPTH) N = 1
+            YF = YMID + N * YINC
+            CALL LINE(XF,XT,YF,YF,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          END IF
+          I = RNBR(I)
+          GO TO 10
+        END IF
+      END IF
+      END
+      SUBROUTINE PLTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +MARGL,MARGR,MARGB,
+     +MARGT,ISXMAX,ISYMAX,LGEL,LREG,RREG,DEPTHP,DEPTHM)
+      INTEGER DEPTHP,DEPTHM
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER RREG,STRAND
+C  have window size margt starting at margb
+C  depths depthp, depthm
+      YMAX = ISYMAX
+      YMIN = 0.
+      XMIN = LREG
+      XMAX = RREG
+      RINC = YMAX / (DEPTHP + DEPTHM + 2)
+      RMID =(DEPTHM+1) * RINC
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XMIN,XMAX,RMID,RMID,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMIN,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL TEXT(XMAX,RMID,'*',1,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      STRAND = 1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHP)
+      STRAND = -1
+      YINC = RINC * STRAND
+      CALL PLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,LGEL,LREG,RREG,STRAND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,RMID,YINC,DEPTHM)
+      CALL VT100M
+      END
+      SUBROUTINE PLTQ(SEQ,IDIM2,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CHARACTER SEQ(IDIM2),T
+      PARAMETER (Y0  = 0.,
+     +           YP1 = 1.,
+     +           YP2 = 2.,
+     +           YM1 = -1.,
+     +           YM2 = -2.)
+      XMIN = 0.
+      XMAX = IDIM2
+      YMIN = YM2
+      YMAX = YP2
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL LINE(XIN,XMAX,Y0,Y0,XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      I = 1
+10    CONTINUE
+      XF = I
+      T = SEQ(I)
+20    CONTINUE
+      IF(SEQ(I).NE.T) THEN
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I - 1
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+      I = I + 1
+      IF(I.LT.IDIM2) GO TO 20
+        CALL GLEVEL(T,YF,YT,Y0,YP1,YP2,YM1,YM2)
+        XT = I
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XF,XT,YT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL LINE(XT,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      END
+      SUBROUTINE POSTC(GELNOS,GELSTR,GELEND,LINEIN,IERR,
+     +KBOUT,IDEVF,MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,KGEL)
+C   AUTHOR: RODGER STADEN
+      CHARACTER LINEIN*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2),IGEL(2)
+      INTEGER LNBR(MAXDB),RNBR(MAXDB),RELPG(MAXDB),LNGTHG(MAXDB)
+      EXTERNAL LTYPE
+C   GELNOS === GEL NUMBERS (UNIQUE)
+C   GELSTR === GEL START LINES
+C   GELEND === GEL END LINES
+C   LINNO  === GEL LINE NUMBERS PER STRIP
+C   IGEL   === NUMBER OF GELS PER STRIP
+C   LINENO === CURRENT LINE NUMBER
+C   IUNIQ IS NUMBER OF GELS FOUND
+C
+C  PROBLEM IS TO FIND IF THERE ARE LINES OF WORKING VERSIONS MISSING
+C  WE PROCESS THE DATA IN PAIRS OF BLOCKS OR STRIPS (A BLOCK OR STRIP IS ONE 50
+C  CHARACTER WIDTH OF THE CONTIG). IF WE FIND ALL THE GELS THAT APPEAR
+C  IN STRIP B THEN IF THEY WEREN'T PRESENT IN STRIP A THEY MUST START IN
+C  STRIP B, IF THEY DONT APPEAR IN STRIP C THEY END IN STRIP B.
+C  STRATEGY IS TO READ THRU FILE AND REMEMBER WHICH GELS APPEAR IN EACH
+C  STRIP. BY COMPARING THEM WITH THE LAST STRIP WE CAN KNOW WHICH HAVE ENDED
+C  AND WHICH HAVE STARTED. HENCE WE STORE A LIST OF GELS AND THEIR START AND
+C  END LINE NUMBERS. USING THIS WE CAN SEE IF THE LIST IS UNIQUE
+C  SUB CLIST IS SENT THE CURRENT TWO LISTS OF GEL NUMBERS AND FINDS WHICH HAVE
+C  STARTED AND ENDED AND BUILDS UP A LIST OF UNIQUE GELS AND THEIR START
+C  AND END LINE NUMBERS
+C
+      LINLEN = 80
+      IERR=0
+      JSTRIP=1
+      LINENO=0
+      ISTRIP=1
+      IUNIQ=0
+10    CONTINUE
+C     READ WHAT SHOULD BE A LINE OF NOS
+      READ(IDEVF,1003,END=100)LINEIN
+1003  FORMAT(A)
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+C     IS IT A LINE OF NOS?
+      IF(LT.NE.2) GO TO 400
+C     ZERO GEL COUNT FOR THIS STRIP
+      IGEL(ISTRIP)=0
+20    CONTINUE
+C     READ A LINE, COULD BE 1 GEL, 2 CONSENSUS OR BLANK
+      READ(IDEVF,1003,END=100)LINEIN
+      LINENO=LINENO+1
+      LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+      IF(LT.EQ.0) GO TO 400
+      IF(LT.EQ.1) GO TO 400
+      IF(LT.EQ.4) THEN
+C       GEL LINE SO SAVE
+C       INCREMENT COUNT TO NUMBER OF GEL
+        IGEL(ISTRIP)=IGEL(ISTRIP)+1
+        GELNO(IGEL(ISTRIP),ISTRIP)=N
+        LINNO(IGEL(ISTRIP),ISTRIP)=LINENO
+C       BO BACK FOR NEXT LINE (CONSENSUS OR NEW GEL)
+        GO TO 20
+      END IF
+C     SHOULD THEN BE A CONSENSUS
+      IF(LT.NE.3) GO TO 400
+C     MUST BE CONSENSUS SO PROCESS THIS STRIP
+C     PROCESS THIS STRIP OF GELS (IGEL OF THEM)
+C     WHICH GELS ARE NEW? AND WHICH HAVE ENDED
+C     IF FIRST EVER STRIP DONT DO THIS
+      IF(JSTRIP.EQ.1)THEN
+        DO 6 I=1,IGEL(1)
+          GELNOS(I)=GELNO(I,1)
+          GELSTR(I)=LINNO(I,1)
+6       CONTINUE
+        JSTRIP=JSTRIP+1
+        ISTRIP=2
+        IUNIQ=IGEL(1)
+C       READ WHAT SHOULD BE A BLANK LINE
+        READ(IDEVF,1003,END=100)LINEIN
+        LINENO=LINENO+1
+        LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+        IF(LT.EQ.0) GO TO 400
+        IF(LT.NE.1)GO TO 400
+        GO TO 10
+      END IF
+C     ORER OF ARGUMENTS DEPENDS ON ISTRIP
+      IF(ISTRIP.EQ.2)THEN
+        CALL CLIST(GELNO(1,1),LINNO(1,1),IGEL(1),
+     +  GELNO(1,2),LINNO(1,2),IGEL(2),
+     +  GELNOS,GELSTR,GELEND,MAXDB,IUNIQ,KBOUT,IERR)
+        IF(IERR.NE.0) GO TO 400
+        ISTRIP=1
+C       READ WHAT SHOULD BE A BLANK LINE
+        READ(IDEVF,1003,END=100)LINEIN
+        LINENO=LINENO+1	
+        LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+        IF(LT.EQ.0) GO TO 400
+        IF(LT.NE.1)GO TO 400
+        GO TO 10
+      END IF
+      IF(ISTRIP.EQ.1)THEN
+        CALL CLIST(GELNO(1,2),LINNO(1,2),IGEL(2),
+     +  GELNO(1,1),LINNO(1,1),IGEL(1),
+     +  GELNOS,GELSTR,GELEND,MAXDB,IUNIQ,KBOUT,IERR)
+        IF(IERR.NE.0) GO TO 400
+        ISTRIP=2
+        READ(IDEVF,1003,END=100)LINEIN
+        LINENO=LINENO+1	
+        LT = LTYPE(LINEIN,LINLEN,J1,J2,N,MAXDB,KBOUT)
+        IF(LT.EQ.0) GO TO 400
+        IF(LT.NE.1)GO TO 400
+        GO TO 10
+      END IF
+100   CONTINUE
+C     MAY BE SOME LEFT IN CURRENT STRIP THAT HAVE NOT BEEN ENDED
+C     BUT ISTRIP WILL JUST HAVE BEEN CHANGED SO CHANGE IT BACK
+      IF(ISTRIP.EQ.1)THEN
+        ISTRIP=2
+        GO TO 101
+      END IF
+      ISTRIP=1
+101   CONTINUE
+      DO 110 I=1,IGEL(ISTRIP)
+        MATCH=INLIST(GELNOS,IUNIQ,GELNO(I,ISTRIP))
+        IF(MATCH.NE.0)THEN
+          GELEND(MATCH)=LINNO(I,ISTRIP)
+          GO TO 110
+        END IF
+C       ERROR
+        WRITE(KBOUT,1300)GELNO(I,ISTRIP)
+1300    FORMAT( ' Error: gel number ',I5,
+     +  ' expected but not found in list')
+        IERR = 1
+        GO TO 400
+110   CONTINUE
+C     AS A CHECK: ARE THE UNIQUE GELS UNIQUE?
+      DO 120 I=2,IUNIQ
+        KD=IUNIQ-I+1
+        IN=INLIST(GELNOS(I),KD,GELNOS(I-1))
+        IF(IN.EQ.0)GO TO 120
+        IERR=1
+        WRITE(KBOUT,1006)GELNOS(I-1),GELSTR(I-1),GELSTR(IN)
+1006    FORMAT(' Error: gel',I6,' has a line of data missing!'/
+     +  ' and appears to start on lines',I6,' and',I6)
+        GO TO 400
+120   CONTINUE
+      IF(IERR.EQ.0)WRITE(KBOUT,1005)
+1005  FORMAT(' Successfully checked line order and missing lines')
+C      WRITE(*,*)(GELNOS(K),K=1,IUNIQ)
+      KGEL1 = KGEL
+      KGEL = ABS(GELNOS(1))
+C     CHECK THAT THE LEFT GEL NUMBER IS UNCHANGED
+      IF(KGEL1.NE.KGEL) THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(' Left gel number of edited contig does not agree',/,
+     +  ' with original, processing aborted')
+        IERR = 1
+        RETURN
+      END IF
+      LNBR(KGEL) = 0
+      RELPG(KGEL) = 0
+      LNGTHG(KGEL) = 0
+      DO 200 I = 2,IUNIQ
+        J1 = ABS(GELNOS(I-1))
+        J2 = ABS(GELNOS(I))
+        RNBR(J1) = J2
+        LNBR(J2) = J1
+        RELPG(J2) = 0
+        LNGTHG(J2) = 0
+200   CONTINUE
+      RNBR(J2) = 0
+      IERR = 0
+      RETURN
+400   CONTINUE
+C   ERROR
+      WRITE(KBOUT,1007)LINENO
+1007  FORMAT(' Error in line',I6,', of file: lines out of order'/,
+     +' processing stopped')
+      IERR=1
+      RETURN
+      END
+      SUBROUTINE READN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      READ(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE READR(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      READ(IDEVR,REC=N)RELPG,LNGTHG,LNBR,RNBR
+      RETURN
+      END
+      SUBROUTINE READW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      READ(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE REMOVL(MATC,MATG,MATL,IP)
+C   AUTHOR: RODGER STADEN
+      INTEGER MATC(IP),MATG(IP),MATL(IP)
+C
+C   SET POINTER TO FIRST MATCH
+      NMTCH=0
+10    CONTINUE
+C   POINT TO NEXT MATCH
+      NMTCH=NMTCH+1
+C   SORT MATCHES ON LENGTH
+      IPP=IP-NMTCH+1
+      CALL BUBBL3(MATL(NMTCH),MATG(NMTCH),MATC(NMTCH),IPP)
+C   LOOK FOR END OF POSITIVES
+      DO 20 I=NMTCH,IP
+      J=I
+20    IF(MATL(I).LT.1)GO TO 30
+      J=J+1
+30    CONTINUE
+      IP=J-1
+C   END OF POSITIVES AT IP
+      IF(NMTCH.GE.IP)RETURN
+      K1=MATC(NMTCH)
+      K2=K1+MATL(NMTCH)-1
+      K3=MATG(NMTCH)
+      K4=K3+MATL(NMTCH)-1
+C   POINT TO FIRST MATCH TO TEST
+      K6=NMTCH+1
+      DO 200 I=K6,IP
+C   DO CONSENSUS FIRST
+C   OVERLAP?
+      IF(MATC(I).GT.K2)GO TO 100
+      K5=MATC(I)+MATL(I)-1
+      IF(K5.LT.K1)GO TO 100
+C   DOES OVERLAP
+C   WHICH END
+      IF(K5.LE.K2)GO TO 80
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K2-MATC(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C  MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 100
+80    CONTINUE
+C   LENGTH
+      MATL(I)=K1-MATC(I)
+100   CONTINUE
+C   NOW LOOK FOR OVERLAPS WITH GEL
+C   OVERLAP?
+      IF(MATG(I).GT.K4)GO TO 200
+      K5=MATG(I)+MATL(I)-1
+      IF(K5.LT.K3)GO TO 200
+C   DOES OVERLAP
+C   WHICH END?
+      IF(K5.LE.K4)GO TO 180
+C   LENGTH TO REDUCE MATCH BY IS IDELT
+      IDELT=K4-MATG(I)+1
+C   NEW LENGTH
+      MATL(I)=MATL(I)-IDELT
+C   MOVE LEFT ENDS
+      MATC(I)=MATC(I)+IDELT
+      MATG(I)=MATG(I)+IDELT
+      GO TO 200
+180   CONTINUE
+C   LENGTH
+      MATL(I)=K3-MATG(I)
+200   CONTINUE
+      GO TO 10
+      END
+C     SAVIT
+C
+      SUBROUTINE SAVIT(N,J,K,IP,S1,S2,S3,IP1)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1(IP1),S2(IP1),S3(IP1)
+C
+      IP=IP+1
+C   TEST FOR OVERFLOW
+      IF(IP.GT.IP1)RETURN
+      S1(IP)=N
+      S2(IP)=J-N
+      S3(IP)=K-N
+C
+      RETURN
+      END
+      SUBROUTINE SCRENR(GEL,MAXGEL,STRING,NAME,FILNAM,
+     +IDEV1,IDEV2,IDEV3,IDEV4,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER NAME*(*),FILNAM*(*),HELPF*(*)
+      CHARACTER GEL(MAXGEL),STRING(60)
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of sequences that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of recognition sequences',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+C
+      JGEL = 0
+      IGEL = 0
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+31      CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 31
+        END IF
+      ELSE
+        READ(IDEV1,1002,END=100)NAME
+      END IF
+1002  FORMAT(A)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing', JGEL,' in batch'
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV4,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(KBOUT,*)' Error opening gel reading file'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV4,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV4)
+2     CONTINUE
+      IF(IDIMG.LT.1)THEN
+        WRITE(KBOUT,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL MSTLKL(GEL,IDIMG)
+3     CONTINUE
+      READ(IDEV3,1005,END=6)STRING
+1005  FORMAT(60A1)
+C   FIND LENGTH OF STRING ASSUMING NO SPACES
+      DO 4 I=1,60
+        II=I
+        IF(STRING(I).EQ.' ')GO TO 5
+4     CONTINUE
+5     CONTINUE
+      II=II-1
+      IF(II.GT.0)CALL FIND(GEL,IDIMG,STRING,II,JMATCH)
+      IF(JMATCH.EQ.0)GO TO 3
+C   A MATCH
+      WRITE(IDEV,1007)JMATCH,(STRING(K),K=1,II)
+1007  FORMAT(' Match at',I6,' with ',60A1)
+      REWIND IDEV3
+      GO TO 1
+C   NO MATCH SO SAVE
+6     CONTINUE
+      WRITE(IDEV2,1002)NAME
+      IGEL = IGEL + 1
+      REWIND IDEV3
+      GO TO 1
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGEL,' passed'
+      RETURN
+      END
+      SUBROUTINE SCRENV(MAXGEL,
+     +WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ,MAXSEQ,GEL,GELCOP,MATCH,
+     +LENGTH,
+     +SAVPS,SAVPG,SAVL,MAXMAT,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,FILNAM,NAME,IOK)
+      INTEGER POSNS(MAXSEQ),GELN(MAXGEL),WORDP(LPOWRC),SAVPS(MAXMAT)
+      INTEGER SAVPG(MAXMAT),SAVL(MAXMAT)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER FILNAM*(*),NAME*(*),HELPF*(*)
+      CHARACTER GELCOP(MAXGEL)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ(MAXSEQ),GEL(MAXGEL),MATCH(MAXGEL)
+      JGEL = 0
+      IGELS = 0
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'File for names of gel readings that pass',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      FILNAM = ' '
+      CALL OPENF1(IDEV4,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of vector sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      IDIM = MAXSEQ
+      CALL ARRFIM(IDEV4,SEQ,IDIM,KBOUT)
+      CLOSE(UNIT=IDEV4)
+      MN = LENGTH*2
+      MX = 50
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ,IDIM,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C   IS THE VECTOR SEQUENCE IN THE CORRECT FORMAT WITH A TITLE AT THE FRONT?
+      IF(IDCEND.EQ.0)THEN
+        CENDS(1) = -19
+        NENDS(1) = 1
+        CENDS(2) = IDIM + 1
+        IDCEND = 1
+      END IF
+C      WRITE(KBOUT,9999)
+C9999  FORMAT(' VECTOR SEQUENCE REQUIRES A TITLE EG ',
+C     1' <---M13MP7.001----->')
+C      RETURN
+C      END IF
+      CALL BUSY(KBOUT)
+      CALL ENCO(SEQ,IDIM,POSNS,CONST,LENGTH)
+      CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH)
+C
+1     CONTINUE
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAME,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        READ(IDEV1,1002,END=100)NAME
+      END IF
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+1002  FORMAT(A)
+      WRITE(IDEV,1003)NAME
+1003  FORMAT(' Gel reading name ',A)
+      IDIMG=MAXGEL
+      CALL OPENRS(IDEV3,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(IDEV,*)' Gel reading file not found'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV3,GEL,IDIMG,KBOUT)
+      CLOSE(UNIT=IDEV3)
+C  LONG ENOUGH ?
+      IF(IDIMG.LT.MINMAT)THEN
+        WRITE(IDEV,*)' Gel reading too short to compare'
+        GO TO 1
+      END IF
+      CALL SQCOPY(GEL,GELCOP,IDIMG)
+      ISTRAN=1
+      IMATCH=0
+2     CONTINUE
+      CALL BUSY(KBOUT)
+      CALL MSTLKL(GEL,IDIMG)
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      WRITE(IDEV,1009)ISTRAN
+1009  FORMAT(' Searching strand',I6)
+      IDSAV=MAXMAT
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ,GELCOP,MINMAT,IFAIL,KBOUT)
+      IF(IDSAV.GT.0) THEN
+        IMATCH=1
+        CALL DISMAT(SEQ,IDIM,GELCOP,IDIMG,SAVPS,SAVPG,IDSAV,
+     +  CENDS,NENDS,IDCEND,MAXCON,IDEV,MATCH)
+      END IF
+      IF(ISTRAN.EQ.1) THEN
+        CALL SQREV(GELCOP,IDIMG)
+        CALL SQCOM(GELCOP,IDIMG)
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        ISTRAN = 2
+        GO TO 2
+      END IF
+      IF(IMATCH.EQ.0) THEN
+        WRITE(IDEV2,1010)NAME
+        IGELS = IGELS + 1
+      END IF
+      GO TO 1
+1010  FORMAT(A)
+100   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' compared and ',IGELS,' passed'
+      RETURN
+      END
+      SUBROUTINE SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +IDBSIZ,IGN,NCONT,DIST)
+C  AUTHOR: RODGER STADEN
+C  SHIFTS PART OF A CONTIG FORM GEL IGN TO RIGHT END
+C  CONTIG LINE NUMBER IF NCONT
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER DIST
+      I = IGN
+10    CONTINUE
+      IF(I.NE.0)THEN
+        RELPG(I) = RELPG(I) + DIST
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        I = RNBR(I)
+        GO TO 10
+      END IF
+C  UPDATE CONTIG LENGTH
+      RELPG(NCONT) = RELPG(NCONT) + DIST
+      CALL WRITER(IDEVR,NCONT,RELPG(NCONT),LNGTHG(NCONT),
+     +LNBR(NCONT),RNBR(NCONT))
+      END
+      SUBROUTINE SLIDER(SEQ1,IDC,SEQ2,IDIM2,MS1,MS2,MAXPG,MAXPC,MINSLI,
+     +MATL,MATC,MATG,IP)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2)
+      INTEGER MATL(IP),MATC(IP),MATG(IP),P1S,P1,P2
+      IP1 = IP
+      IP = 0
+C   LEFT END S2 RELATIVE S1 - MAX PADS -2 READY FOR LOOP
+      P1S = MS1 - MS2 - MAXPC - 1
+C   TRY NSLIDE START POSNS FOR SEQ2
+      DO 100 I=1,MAXPG+MAXPC+1
+C       POINT TO SEQ1 START
+        P1S = P1S + 1
+C       POINT TO CURRENT SEQ1 POSN
+        P1 = P1S
+        N = 0
+C       COMPARE WHOLE LENGTH OF SEQ2 (IF P1 WITHIN RANGE)
+        DO 50 J=1,IDIM2
+          P2 = J
+          P1 = P1 + 1
+          IF(P1.LT.1)GO TO 50
+C         OFF RIGHT END? IF SO MAY HAVE BEEN A MATCH
+          IF(P1.GT.IDC)GO TO 40
+          IF(SEQ1(P1).EQ.SEQ2(P2))GO TO 45
+40        CONTINUE
+          IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+          N = 0
+          GO TO 50
+45        CONTINUE
+          N = N + 1
+50      CONTINUE
+C       GOOD SCORE AT END? NEED TO INCREMENT POINTERS FOR SAVIT
+        P1 = P1 + 1
+        P2 = P2 + 1
+        IF(N.GE.MINSLI)CALL SAVIT(N,P1,P2,IP,MATL,MATC,MATG,IP1)
+100   CONTINUE
+      END
+      SUBROUTINE SUBS(SEQ,IDIMS,FROM,TO)
+      CHARACTER SEQ(IDIMS),FROM,TO
+C   AUTHOR RODGER STADEN
+      DO 10 I = 1,IDIMS
+        IF(SEQ(I).EQ.FROM) SEQ(I) = TO
+10    CONTINUE
+      END
+      SUBROUTINE SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,LINOU1,LINOU2,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      CHARACTER LINOU1(MAXGEL),LINOU2(MAXGEL),GTCONC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      EXTERNAL INDEXS,LWRAPS,GTCONC
+C 28-7-91 added extra parameter: mxgood is the maximum reading
+C length for which we have confidence, so only the parts of
+C reads 1 to mxgood will be included in the calculation
+C   SET INITIAL VALUES
+C hard to understand this very old code! rewrite it.
+C we have a summing array of twice the length of the longest sequence
+C posn is posn in contig of next to write, lseqno is current posn in contig
+C we write when lseqno-posn ge the length of the longest seq
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+40    CONTINUE
+50    CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read
+C
+      IF(ISS.EQ.1) THEN
+        N = MIN(MXGOOD,ABS(LNGTHG(GELC)))
+        IF(LSEQNO.LT.LREG)LSEQNO=LREG
+        IS = LSEQNO-RELPG(GELC)+1
+      ELSE
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read (right end for these)
+C
+        IREND = RELPG(GELC) - LNGTHG(GELC) + 1
+        IF (MXGOOD.LT.MAXGEL) THEN
+          LSEQNO = IREND - MXGOOD + 1
+        ELSE
+          LSEQNO = RELPG(GELC)
+        END IF
+        LSEQNO = MAX(LSEQNO,LREG)
+        IS = LSEQNO - RELPG(GELC) + 1
+        N  = ABS(LNGTHG(GELC))
+      END IF
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        LINOU1(I) = GTCONC(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        LINOU2(I) = GTCONC(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+250     CONTINUE
+        POSN=POSN+1
+230   CONTINUE
+C
+C   COMPARE STRANDS
+C
+      DO 500 I=1,LINLEN
+C        WRITE(*,*)I,LINOU1(I),LINOU2(I)
+        POSN1=POSN1+1
+        IF(LINOU1(I).EQ.LINOU2(I)) THEN
+          IF(LINOU1(I).EQ.'-') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF(LINOU1(I).EQ.'*') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          SEQ1(POSN1) = '0'
+        ELSE
+          IF((LINOU1(I).EQ.'*').AND.(LINOU2(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU2(I).EQ.'*').AND.(LINOU1(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*')) THEN
+            SEQ1(POSN1) = '1'
+            IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) 
+     +      SEQ1(POSN1) = '4'
+              GO TO 500
+          END IF
+          IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) THEN
+            SEQ1(POSN1) = '2'
+            IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*'))
+     +      SEQ1(POSN1) = '4'
+            GO TO 500
+          END IF
+        END IF
+500   CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+C      SUMMER
+C
+C   SUBROUTINE TO PRODUCE A CONSENSUS FROM LINED UP GEL READINGS
+      SUBROUTINE SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     1SEQ1,IDIM1,GEL,LREG,RREG,IGELC,IDBSIZ,CHARS,CHRSIZ,MAXGL2,
+     +IDEVW,MAXGEL,IDM,PERCD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LREG,RREG,LSEQNO,POSN,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      INTEGER GELC
+      CHARACTER SEQ1(IDIM1)
+      INTEGER CHARS(CHRSIZ,MAXGL2)
+      CHARACTER MUNOTP,GTCONC
+      EXTERNAL MUNOTP,INDEXS,GTCONC,LWRAPS
+C
+C   SET INITIAL VALUES
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      IPSEQ1=0
+C
+C   ZERO ARRAY
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,CHRSIZ
+      CHARS(J,I)=0
+40    CONTINUE
+50    CONTINUE
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C   LOOP FOR RELEVANT ELEMENTS THIS GEL
+      N=ABS(LNGTHG(GELC))
+      IF(LSEQNO.LT.LREG)LSEQNO=LREG
+      IS=(LSEQNO-RELPG(GELC))+1
+****************************
+      IF(IDM.EQ.26)THEN
+        DO 51 I = IS,N
+          JJJ=(MOD(LSEQNO,MAXGL2))
+          IF(JJJ.EQ.0)JJJ=MAXGL2
+          CALL PCON1(GEL(I),CHARS(1,JJJ))
+          LSEQNO = LSEQNO + 1
+51      CONTINUE
+      ELSE
+****************************
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,JJJ) = CHARS(JJ,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      END IF
+C
+C   LOOK AT NEXT GEL TO RIGHT
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+C   RESET LSEQNO
+      LSEQNO=RELPG(GELC)
+C   IS THIS OVER END?
+      IF(LSEQNO.GT.RREG)GO TO 200
+C   ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C   SET FLAG TO SHOW END REACHED
+      IEND=1
+C   NEED TO SUM AND OUTPUT
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SUM NEXT SECTION OF CHARS
+      IF(IDM.EQ.26)THEN
+        DO 211 I = 1,LINLEN
+          IPSEQ1 = IPSEQ1 + 1
+          SEQ1(IPSEQ1) = '-'
+          JJJ = MOD(POSN,MAXGL2)
+          IF(JJJ.EQ.0)JJJ = MAXGL2
+          SEQ1(IPSEQ1) = MUNOTP(CHARS(1,JJJ))
+          CHARS(1,JJJ) = 0
+          POSN = POSN + 1
+211     CONTINUE
+      ELSE
+      DO 230 I=1,LINLEN
+        IPSEQ1=IPSEQ1+1
+        ISUM=0
+        JJJ = LWRAPS(POSN,MAXGL2)
+        SEQ1(IPSEQ1) = GTCONC(CHARS(1,JJJ),CHRSIZ,PERCD)
+        CALL FILLI(CHARS(1,JJJ),CHRSIZ,0)
+        POSN = POSN + 1
+230   CONTINUE
+      END IF
+C
+C
+C   ANY MORE TO OUTPUT?
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGLEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
+      SUBROUTINE TPCHEK(PC,PG,L,N)
+      INTEGER PC(N),PG(N),L(N)
+C     AUTHOR RODGER STADEN
+C     IF OVERLAPPING BLOCKS ARE FOUND REMOVE THE SHORTER ONE
+C     THEN REMOVE LARGE GAPS AT ENDS (THOSE AS LARGE AS THE END BLOCK)
+      K1 = 2
+1     CONTINUE
+      DO 10 I = K1,N
+        J1 = I
+        IF(PC(I).LE.PC(I-1)) GO TO 20
+        IF(PG(I).LE.PG(I-1)) GO TO 20
+10    CONTINUE
+C     REMOVE LARGE GAPS FROM ENDS
+C     THIS RULE OF THUMB COULD BE CHANGED TO USE A DIFFERENCE
+C     BETWEEN THE NUMBERS OF MISMATCHING CHARACTERS
+      IF(N.GT.1) THEN
+        K1 = PC(2) - PC(1) - L(1) 
+        J1 = PG(2) - PG(1) - L(1)
+        IF(MAX(K1,J1).GT.L(1)) THEN
+          CALL ML(PC,PG,L,N,1)
+          N = N - 1
+        END IF
+        IF(N.GT.1) THEN
+          K1 = PC(N) - PC(N-1) - L(N-1)
+          J1 = PG(N) - PG(N-1) - L(N-1)
+          IF(MAX(K1,J1).GT.L(N)) THEN
+            CALL ML(PC,PG,L,N,N)
+            N = N - 1
+          END IF
+        END IF
+      END IF
+      RETURN
+20    CONTINUE
+      IF(L(J1-1).GT.L(J1)) THEN
+        CALL ML(PC,PG,L,N,J1)
+      ELSE
+        CALL ML(PC,PG,L,N,J1-1)
+      END IF
+C  Until 25-11-90 next line was k1=j1 but this does not deal with all 
+C  cases: when a line is deleted we must compare it with the previous
+C  one before dealing with the rest, because it could be left of that
+C   one as well!
+      K1 = MAX(2,J1-1)
+      N = N - 1
+      GO TO 1
+      END
+      SUBROUTINE WRITEN(IDEVN,N,NAME)
+      CHARACTER NAME*(*)
+      WRITE(IDEVN,REC=N)NAME
+      RETURN
+      END
+      SUBROUTINE WRITER(IDEVR,N,RELPG,LNGTHG,LNBR,RNBR)
+      INTEGER RELPG,RNBR
+      WRITE(IDEVR,REC=N)RELPG,LNGTHG,LNBR,RNBR
+      RETURN
+      END
+      SUBROUTINE WRITEW(IDEVW,N,GEL,MAXGEL)
+      CHARACTER GEL(MAXGEL)
+      WRITE(IDEVW,REC=N)GEL
+      RETURN
+      END
+      SUBROUTINE XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NCONTS,LLINOI,LINCNI,LREG,RREG,
+     +WINDOW,GWIND,LENCON,DEPTHP,DEPTHM,
+     +MARGL,MARGR,MARGB,MARGT,MAXOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +KBOUT,GEL,GEL2,IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM,
+     +SEQ1,IDIM1,NGELS,TEMP3,CHRSIZ,MAXGL2,LINOU1,LINOU2,
+     +NOPT1,NOPT2,NOPT3,
+     +IHELPS,IHELPE,HELPF,IDEVH,MXGOOD)
+C  AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),WINDOW,CHRSIZ,GWIND
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER MARGB(MAXOPT),MARGT(MAXOPT)
+      INTEGER RREG,DEPTHP,DEPTHM,STRAND,CHNRP1,HQN
+      INTEGER TEMP3(2,CHRSIZ,MAXGL2)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER TERM,TUPPER,NAMARC*10,HELPF*(*)
+      CHARACTER SEQ1(IDIM1),LINOU1(MAXGEL),LINOU2(MAXGEL)
+      EXTERNAL NOPWIN,CWORLD,TUPPER,CHNRP1,HQN
+C  nopt1 = single contig
+C  nopt2 = all contigs
+C  nopt3 = scan
+10    CONTINUE
+      LLINO = LLINOI
+      LINCON = LINCNI
+      LOCLR = 0
+      LOCRR = 0
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL CLEARV
+      CALL XHAIRR(ISXMAX,ISYMAX,IX,IY,TERM,DBTDUX,DBTDUY)
+      CALL VT100M
+      INFLAG = HQN(TERM)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.3) RETURN
+      NOPT = NOPWIN(IY,MARGB,MARGT,MAXOPT)
+      TERM = TUPPER(TERM)
+      IF(NOPT.EQ.0) RETURN
+      IF(NOPT.EQ.NOPT3) THEN
+        IF(TERM.EQ.'S') THEN
+          XMIN = LREG
+          XMAX = RREG
+          X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF((TERM.EQ.'N').OR.(TERM.EQ.'Z').OR.(TERM.EQ.'I')) GO TO 10
+      END IF
+      IF(NOPT.EQ.NOPT1) THEN
+        STRAND = 1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHP)
+        IF(DEPTHP.LT.0) RETURN
+        STRAND = -1
+        CALL FDPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,LLINO,LREG,RREG,LENCON,STRAND,DEPTHM)
+        IF(DEPTHM.LT.0) RETURN
+        YMAX = ISYMAX
+        YMIN = 0.
+        XMIN = LREG
+        XMAX = RREG
+        RINC = ISYMAX / (DEPTHP + DEPTHM + 2)
+        RMID =(DEPTHM+1) * RINC
+        X = CWORLD(IX,MARGL,MARGR,XMIN,XMAX)
+        Y = CWORLD(IY,MARGB(NOPT),MARGT(NOPT),YMIN,YMAX)
+        IF(TERM.EQ.'I') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(ICLOSE.EQ.1) GO TO 10
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+1006      FORMAT
+     +    ( ' Name ',A,' Number ',I4,' Rel. Posn. ',I7,' Length ',I5)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Z') THEN
+          STRAND = 1
+          YINC = RINC * STRAND
+          CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +    ISXMAX,ISYMAX,RMID,YINC,DEPTHP,X,Y,
+     +    KBOUT,IGEL,ICLOSE)
+          IF(ICLOSE.EQ.1) THEN
+            STRAND = -1
+            YINC = RINC * STRAND
+            CALL IPLTC(RELPG,LNGTHG,LNBR,RNBR,
+     +      IDBSIZ,LLINO,LREG,RREG,STRAND,
+     +      MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +      ISXMAX,ISYMAX,RMID,YINC,DEPTHM,X,Y,
+     +      KBOUT,IGEL,ICLOSE)
+          END IF
+          IF(IGEL.EQ.0) GO TO 10
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT2),MARGT(NOPT2),ISXMAX,ISYMAX)
+          LREG = MAX(1,RELPG(IGEL)-GWIND)
+          RREG = MIN(RELPG(LINCON),RELPG(IGEL)+GWIND)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+            GO TO 10
+          END IF
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(LREG,NINT(X)-WINDOW)
+          LOCRR = MIN(RREG,NINT(X)+WINDOW-1)
+          IGEL = CHNRP1(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +    LLINO,LREG)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+      END IF
+      IF(NOPT.EQ.NOPT2) THEN
+        CALL IDPLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,
+     +  NCONTS,IX,IY,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +  ISXMAX,ISYMAX,DBTDUX,DBTDUY,
+     +  LINCON,IGEL,IS)
+        IF(IGEL.EQ.0) RETURN
+        IF(TERM.EQ.'Z') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'Q') THEN
+          CALL CLEARG
+          CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LINCON,IGEL,
+     +    NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+          LREG = 1
+          RREG = RELPG(LINCON)
+          LLINO = LNBR(LINCON)
+          LLINOI = LLINO
+          LINCNI = LINCON
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT1),MARGT(NOPT1),ISXMAX,ISYMAX)
+          CALL DBSCNP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +    IDIM1,GEL,IDBSIZ,TEMP3,2,CHRSIZ,MAXGL2,IDEV2,LLINO,
+     +    PERCD,MAXGEL,LINOU1,LINOU2,LREG,RREG,
+     +    MARGL,MARGR,MARGB(NOPT3),MARGT(NOPT3),ISXMAX,ISYMAX,
+     +    MXGOOD)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'I') THEN
+          CALL READN(IDEV3,IGEL,NAMARC)
+          WRITE(IDEV,1006)NAMARC,IGEL,RELPG(IGEL),LNGTHG(IGEL)
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'S') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +      GEL,LLINO,LINCON,LOCLR,LOCRR,GEL2,I1,I2,0,I,
+     +      IDBSIZ,IDEV,KBOUT,
+     +      IDEV2,IDEV3,LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+          GO TO 10
+        END IF
+        IF(TERM.EQ.'N') THEN
+          LOCLR = MAX(1,IS-WINDOW)
+          LOCRR = MIN(RELPG(LINCON),IS+WINDOW-1)
+          LLINO = LNBR(LINCON)
+          IF(LOCLR.NE.0) THEN
+            CALL LSTCON(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,IGEL,
+     +      LOCRR,IDEV,IDEV3,NAMARC)
+          END IF
+          GO TO 10
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION CLINNO(LNBR,IDBSIZ,NCONTS,IIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS CONTIG LINE NUMBER OR ZERO FOR ERROR
+      INTEGER LNBR(IDBSIZ)
+      CLINNO = 0
+      N=IDBSIZ-NCONTS
+      DO 10 J=N,IDBSIZ-1
+        IF(LNBR(J).EQ.IIN) THEN
+          CLINNO = J
+          RETURN
+        END IF
+10    CONTINUE
+      END
diff --git a/src/staden/dbsysnew.f b/src/staden/dbsysnew.f
new file mode 100644
index 0000000..0a08d13
--- /dev/null
+++ b/src/staden/dbsysnew.f
@@ -0,0 +1,2963 @@
+C 8-4-92 removed minimum block length changing option in dbauto
+C   29.05.91  IMPLEMENTED  REMOVE GEL READING
+C   21-8-91 Added routines to find internal overlaps
+C    2-9-91 Fixed bug in copytg
+C   8-11-91 fixed bugs in "find internal joins"
+C   24-02-92 fixed some bugs courtesy of Kozlowski's AIX port
+C   23-Jun-92 COPYCC - params to READCC and WRITCC in wrong order
+C
+C   SUBROUTINE TO ENTER NEW GEL SEQUENCES INTO DATA BASE.
+C   IT READS IN AN ARCHIVE VERSION AND WRITES OUT A WORKING VERSION.
+C   IT ALSO SETS UP ANY RELATIONSHIPS WITH OTHER DATA IN THE DATABASE
+C   BOTH BY POSITION IN A CONTIG AND POINTERS TO LEFT AND RIGHT
+C   NEIGHBOURS.
+      SUBROUTINE AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,NAMARC,X,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM,IDC,NCONTC,LINCON,IFAIL,IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IDEVT,IDEVC,IDEVG,MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),NAMARC*(*)
+      CHARACTER SEQC2(IDC)
+      CHARACTER NAMARK*10
+C      WRITE(*,*)'X,ITYPE,ISENSE,IDIM,IDC'
+C      WRITE(*,*)X,ITYPE,ISENSE,IDIM,IDC
+C   SET FAIL FLAG
+      IFAIL=0
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(' TRYING TO ENTER NEW GEL READING INTO DATABASE')
+C   IS THERE SPACE?
+      IF((IDBSIZ-(NGELS+NCONTS)).GT.2)GO TO 5
+C   FULL
+      WRITE(KBOUT,1999)IDBSIZ
+1999  FORMAT(' Database full, current size=',I6,' Extend with copy')
+      IFAIL=7
+      RETURN
+5     CONTINUE
+C   NEED TO CHECK TO SEE IF GEL ALREADY IN DB
+C   LOOK THRU ARC FILE
+      DO 10 J=1,NGELS
+C      READ(IDEVN,REC=J)NAMARK
+      CALL READN(IDEVN,J,NAMARK)
+      IF(NAMARK.NE.NAMARC(1:10))GO TO 10
+C   FOUND
+      WRITE(KBOUT,1013)J
+1013  FORMAT(' New gel already in database with number',I6,
+     +' Entry aborted')
+      IFAIL=6
+      RETURN
+10    CONTINUE
+C   INCREMENT NUMBER OF GELS
+      NGELS=NGELS+1
+C   SET LENGTH THIS GEL
+      LNGTHG(NGELS)=IDIM*ISENSE
+C   WRITE NAME OF ARCHIVE TO LIST OF ARCHIVES
+C   NAMPRO,ARC
+C      WRITE(IDEVN,REC=NGELS)NAMARC(1:10)
+      NAMARK=NAMARC(1:10)
+      CALL WRITEN(IDEVN,NGELS,NAMARK)
+      WRITE(KBOUT,1003)NGELS
+1003  FORMAT(' This gel reading has been given the number ',I6)
+C   WRITE GEL TO WORKING VERSION
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      IF(IDEVT.GT.0) CALL ENTRD(IDEVG,IDEVT,IDEVC,NAMARC,NGELS,IOK)
+C   CREATE TAGS FOR THIS NASTY
+      CALL TAGGEL(NGELS,LNGTHG(NGELS),GEL)
+C   SET UP RELATIONSHIPS
+C   DOES THIS GEL OVERLAP?
+      IF(ITYPE.NE.0)GO TO 100
+C
+C   DOES NOT OVERLAP SO IT STARTS A CONTIG OF ITS OWN
+C   SET LEFT AND RIGHT POINTERS TO ZERO,RELPG TO 1
+      LNBR(NGELS)=0
+      RNBR(NGELS)=0
+      RELPG(NGELS)=1
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),
+C     1RNBR(NGELS)
+C
+C   SET CONTIG POINTERS AND GENERAL VALUES
+C   INCREMENT NUMBER OF CONTIGS
+      NCONTS=NCONTS+1
+C   POINTER TO THIS CONTIG
+      N=IDBSIZ-NCONTS
+C   POINTER TO LEFT GEL THIS CONTIG
+      LNBR(N)=NGELS
+C   POINTER TO RIGHT GEL THIS CONTIG
+      RNBR(N)=NGELS
+C   LENGTH OF CONTIG
+      RELPG(N)=IDIM
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   WRITE DB DESCRIPTOR
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      RETURN
+C
+100   CONTINUE
+C
+C
+C   DOES OVERLAP
+150   CONTINUE
+C
+C   LEFT END OR RIGHT OVERLAP?
+      IF(ITYPE.EQ.1)GO TO 400
+C   RIGHT END OR INTERNAL OVERLAP
+C
+160   CONTINUE
+C   NEED TO SEARCH THRU THIS CONTIG TO FIND LEFT AND RIGHT
+C   NEIGHBOURS FOR THIS NEW GEL
+C   LINE NUMBER OF LEFT END OF CONTIG
+      N=NCONTC
+C   LOOK THRU UNTIL CURRENT IS >= THEN IT MUST BE THE PREVIOUS ONE
+200   CONTINUE
+      IF(RELPG(N).GT.X)GO TO 250
+C   IS THIS THE LAST GEL IN CONTIG?
+      IF(RNBR(N).EQ.0)GO TO 350
+C   NO SO LOOK AT NEXT
+      N=RNBR(N)
+      GO TO 200
+250   CONTINUE
+C   GEL LIES BETWEEN N AND LNBR(N)
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS IN NEW GEL
+      LNBR(NGELS)=LNBR(N)
+      RNBR(NGELS)=N
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),
+C     1RNBR(NGELS)
+C   SET POINTERS  IN LEFT AND RIGHT NEIGHBOURS
+      K=LNBR(N)
+      RNBR(K)=NGELS
+C      RNBR(LNBR(N))=NGELS
+C   WRITE LEFT AND RIGHT NEIGHBOURS
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+C      WRITE(IDEVR,REC=K)RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+      LNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   WRITE NGELS NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   ITS LINE NUMBER IS LINCON
+C   NEED TO UPDATE IDIM IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      IF(Y.LE.RELPG(LINCON))RETURN
+      RELPG(LINCON)=Y
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+      RETURN
+350   CONTINUE
+C   MUST BE A RIGHT END OVERLAP
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS FOR NEW GEL
+      LNBR(NGELS)=N
+      RNBR(NGELS)=0
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)
+C     1RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),RNBR(NGELS)
+C   OLD RIGHT END
+      RNBR(N)=NGELS
+C   WRITE NEW RIGHT LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C
+C   RESET RIGHT NAME IN CONTIG
+C   ITS LINE NUMBER IS LINCON
+      RNBR(LINCON)=NGELS
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   NEED TO UPDATE LENGTH OF GEL IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      RELPG(LINCON)=MAX(RELPG(LINCON),Y)
+C   WRITE HERE
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON)
+C     1,RNBR(LINCON)
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      RETURN
+C
+400   CONTINUE
+C
+C   ADDING TO LEFT END
+410   CONTINUE
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,1,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+420   CONTINUE
+C   SET POINTERS IN NEW GEL
+      RELPG(NGELS)=1
+      RNBR(NGELS)=NCONTC
+      LNBR(NGELS)=0
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),
+C     1RNBR(NGELS)
+C   SET POINTERS IN OLD LEFT END
+      LNBR(NCONTC)=NGELS
+      RELPG(NCONTC)=X
+C   WRITE NEW LEFT END
+      CALL WRITER(IDEVR,NCONTC,RELPG(NCONTC),LNGTHG(NCONTC),
+     +LNBR(NCONTC),RNBR(NCONTC))
+C      WRITE(IDEVR,REC=NCONTC)RELPG(NCONTC),LNGTHG(NCONTC),LNBR(NCONTC)
+C     1,RNBR(NCONTC)
+C   NEW LENGTH OF CONTIG
+      RELPG(LINCON)=RELPG(LINCON)+X-1
+C   MAY HAVE JUST ADDED A GEL LONGER THAN CONTIG
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=IDIM
+      IF(Y.GT.RELPG(LINCON))RELPG(LINCON)=Y
+C   NEW NAME OF LEFT END OF CONTIG
+      LNBR(LINCON)=NGELS
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON)
+C     1,RNBR(LINCON)
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+C   NOW GO THRU AND CHANGE ALL RELATIVE POSITIONS
+      N=NCONTC
+440   CONTINUE
+      IF(RNBR(N).EQ.0)RETURN
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+X-1
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      GO TO 440
+      END
+C      SUBROUTINE COPYRD(IDEVRD,IDEV,NGELS,NEWSIZ,IOK)
+C      CHARACTER MTYPE*4,NAMFIL*12
+C      DO 10 I = 1,NGELS
+C        CALL READRD(IDEVRD,I,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C        CALL WRITRD(IDEV,I,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C 10   CONTINUE
+C      CALL WRITRD(IDEV,NEWSIZ,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C      IOK = 0
+C      END
+      SUBROUTINE COPYTG(IDEVT,IDEV,IOK,IDBSIZ,NEWSIZ,NGELS)
+C Read tag details
+      IDIFF = NEWSIZ - IDBSIZ
+      CALL READTG(IDEVT,IDBSIZ,ICNT,LLEN,LCOM,LTYPE,NEXT)
+      IF (NEXT.NE.0) NEXT = NEXT + IDIFF
+      CALL WRITTG(IDEV,NEWSIZ,ICNT+IDIFF,LLEN,LCOM,LTYPE,NEXT)
+C Copy headers for each gels
+      DO 10 I = 1,NGELS
+        CALL READTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        IF (NEXT.NE.0) NEXT = NEXT + IDIFF
+        CALL WRITTG(IDEV,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+ 10   CONTINUE
+C Copy rest of tags
+      DO 20 I = IDBSIZ+1, ICNT
+        CALL READTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        IF (NEXT.NE.0) NEXT = NEXT + IDIFF
+        CALL WRITTG(IDEV,I+IDIFF,LPOS,LLEN,LCOM,LTYPE,NEXT)
+ 20   CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE COPYCC(IDEVC,IDEV,IOK)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+      CALL READCC(IDEVC,1,ICNT,NEXT,NOTE)
+      CALL WRITCC(IDEV,1,ICNT,NEXT,NOTE)
+      DO 10 I = 2,ICNT
+        CALL READCC(IDEVC,I,ICNT,NEXT,NOTE)
+        CALL WRITCC(IDEV,I,ICNT,NEXT,NOTE)
+ 10   CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,IDEVT,IDEVC,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB),PL(2),PR(2),RMOST
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LINCON(2),LLINO(2),ITYPE(2),IFAIL(2)
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER LREG,RREG,X,ANS,ANSJOK
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER GET,SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      SAVE GET
+      DATA GET/'>'/
+      WRITE(KBOUT,*)' Automatic sequence assembler'
+      IFAIL(1) = 0
+      IEMPTY=0
+      IF(NGELS.LT.1)IEMPTY=1
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      CALL YESNO(IOKENT,'Permit entry',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOKENT.LT.0) RETURN
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILE = ' '
+        CALL OPENF1(IDEV7,FILE,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(IOKENT.EQ.0) THEN
+        FILE = ' '
+        CALL OPENF1(IDEV8,FILE,1,IOK,KBIN,KBOUT,
+     +  'File for names of failures',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      PROMPT(1) = 'Perform normal shotgun assembly'
+      PROMPT(2) = 'Put all sequences in one contig'
+      PROMPT(3) = 'Put all sequences in new contigs'
+      IOPT = 1
+      CALL RADION('Select entry mode',PROMPT,3,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) RETURN
+      IF(IOPT.EQ.1) THEN
+C  parameters for normal assembly
+      ANSJOK = 0
+      CALL YESNO(ANSJOK,'Permit joins',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSJOK.LT.0) RETURN
+      MN = LENGTH*2
+      MX = MAXGLM + 1
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      MINSLI = 3
+      MN = 0
+      MX = 25
+      MAXPG = 8
+      CALL GETINT(MN,MX,MAXPG,
+     +'Maximum pads per gel',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MN = 0
+      MX = 25
+      MAXPC = 8
+      CALL GETINT(MN,MX,MAXPC,
+     +'Maximum pads per gel in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPC = IVAL
+      IF(IOKENT.EQ.0) THEN
+        RMN = 0.
+        RMX = 15.
+        PERMAX = 8.
+        CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PERMAX = VAL
+      END IF
+      IDIM1=0
+      MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+      ANS=0
+      IF(IEMPTY.EQ.0)
+     +CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      END IF
+      JGEL = 0
+      JNGEL = 0
+      JNJOIN = 0
+      IMATC = 0
+      IF(IFAIL(1).NE.0)GO TO 900
+C
+C
+1     CONTINUE
+C
+C
+      IDIM2=MAXGEL
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAMARC,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        READ(IDEV7,1002,END=900)NAMARC
+      END IF
+1002  FORMAT( A)
+      DO 77 MM=1,80
+        MATCH(MM)=GET
+77    CONTINUE
+      WRITE(IDEV,1077)(MATCH(KK),KK=1,79)
+1077  FORMAT(' ',79A1)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+1007  FORMAT(' Gel reading name=',A)
+      WRITE(IDEV,1007)NAMARC
+      CALL OPENRS(IDEV4,NAMARC,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(IDEV,*)' Error opening gel reading file'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV4,SEQ2,IDIM2,KBOUT)
+      CLOSE(UNIT=IDEV4)
+      WRITE(IDEV,1800)IDIM2
+1800  FORMAT(' Gel reading length=',I6)
+      IF(IOPT.NE.1) THEN
+        CALL DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +  IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +  IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL,IMATC,IEMPTY,IOPT)
+        IF(IFAIL(1).NE.0) GO TO 800
+        JNGEL = JNGEL + 1
+        GO TO 1
+      END IF
+      IF(IDIM2.LT.MINMAT)THEN
+        WRITE(IDEV,*)
+     +' Reading shorter than minimum match, so not processed'
+        GO TO 1
+      END IF
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      IFCOMP=0
+      IMATC=0
+      IF(IEMPTY.EQ.0)
+     +CALL AUTOCN(SEQ1,IDIM1,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL)
+      IF(IOKENT.NE.0) GO TO 1
+C     THIS RETURNS THE FOLLOWING:
+C     ILEFTS  POSITION IN CONSENSUS OF LEFT END OF MATCHING CONTIGS
+C     ILC     LENGTHS OF MATCHING CONTIGS
+C     IPOSC   POSITION OF MATCH RELATIVE TO CONTIG
+C     IPOSG   POSITION OF MATCH RELATIVE TO NEW GEL
+C     ISENSE  SENSE OF NEW GEL
+C     LLINO   LEFT GEL NUMBER IN MATCHING CONTIGS
+C     IMATC   THE NUMBER OF MATCHING CONTIGS (>2 IS ERROR!)
+C     IFCOMP  ERROR FLAG FOR COMPARISON (COMPARISON ARRAYS OVERFLOWED)
+      IF(IFCOMP.NE.0)GO TO 800
+      CALL SQCOPY(SEQ3,SEQ2,IDIM2)
+      IF(IMATC.GT.0)GO TO 200
+C
+C                         NO OVERLAP NEW CONTIG
+C
+C     ITYPE 0 = NO OVERLAP
+C     ISENSE 1 = SAME SENSE AS ARCHIVE
+      IF(IFAIL(1).NE.0) GO TO 800
+      ITYPE(1)=0
+      ISENSE(1)=1
+      IDOUT(1)=MAXGEL
+      WRITE(IDEV,1015)
+1015  FORMAT(' New gel reading does not overlap: start a new contig')
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2,NAMARC,X,ITYPE,ISENSE,SEQC2(1,1),ITOTPC(1),
+     +IDIM2,IDOUT(1),LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+      IF(IFAIL(1).NE.0)GO TO 800
+      IEMPTY=0
+      IDIM1=IDIM1+1
+      IF((IDIM1+19+IDIM2).GT.MAXSEQ)THEN
+        WRITE(IDEV,1021)MAXSEQ
+1021    FORMAT(' Database maximum consensus length (',I6,') exceeded')
+        GO TO 900
+      END IF
+      CALL ADDTIT(SEQ1(IDIM1),NAMPRO,NGELS,IDIM1)
+      CALL MSTLKL(SEQ2,IDIM2)
+      CALL SQCOPY(SEQ2,SEQ1(IDIM1),IDIM2)
+      IDIM1=IDIM1+IDIM2-1
+      JNGEL = JNGEL + 1
+      GO TO 1
+C
+C
+C
+200   CONTINUE
+C
+C
+C   OVERLAP SO TRY TO ALIGN THE SEQUENCES
+C
+C
+      DO 100 I=1,IMATC
+        N=IDBSIZ-NCONTS
+        DO 99 J=N,IDBSIZ-1
+          IF(LNBR(J).NE.LLINO(I))GO TO 99
+          LINCON(I)=J
+          GO TO 100
+99      CONTINUE
+        WRITE(IDEV,10077)LLINO(I)
+10077   FORMAT(' Contig line for contig',I6,' not found!')
+        GO TO 800
+100   CONTINUE
+C
+      IF((IMATC.EQ.2).AND.(ANSJOK.EQ.0))GO TO 400
+C
+C
+C                           SINGLE OVERLAP
+C
+C
+C
+      WRITE(IDEV,1014)LLINO(1)
+1014  FORMAT(' New gel reading overlaps contig',I6)
+      IF(ITOTPG(1).GT.0) CALL CCTA(SEQG2(1,1),IDIM22(1))
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,1),NAMARC,JOINT(1),ITYPE(1),ISENSE(1),
+     +SEQC2(1,1),
+     +ITOTPC(1),IDIM22(1),IDOUT(1),LLINO(1),LINCON(1),
+     +IFAIL(1),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+      IF(IFAIL(1).NE.0)GO TO 800
+      CALL DELCON(SEQ1,ILEFTS(1),ILC(1),IDIM1)
+      LREG=1
+      RREG=RELPG(LINCON(1))
+      IGELC=LNBR(LINCON(1))
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      JNGEL = JNGEL + 1
+      IF(KFAIL.NE.0) GO TO 800
+      GO TO 1
+C
+C
+C                        DOUBLE OVERLAP
+C
+C
+400   CONTINUE
+      WRITE(IDEV,1013)LLINO
+1013  FORMAT(' Overlap between contigs',I6,' and',I6)
+      IF(ANSJOK.NE.0)GO TO 800
+      IF(LLINO(1).EQ.LLINO(2))THEN
+        WRITE(IDEV,*)' Trying to form loop in contig',LLINO(1)
+        WRITE(IDEV,*)' Gel not entered'
+        GO TO 800
+      END IF
+      CALL AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,
+     +IDIM22,KLASS,IOVER,IDEV,PL,PR)
+      IF(IOVER.GT.MAXOVR)THEN
+        WRITE(IDEV,*)' Overlap too large: entry only'
+        IFAIL(2)=1
+        GO TO 600
+       END IF
+C   WHICH CONTIG IS LEFTMOST?
+      LMOST=1
+      RMOST=2
+      IF(PL(1).GT.PL(2))THEN
+        LMOST=2
+        RMOST=1
+      END IF
+C   SAVE LENGTH OF RMOST CONTIG FOR DELETION STEP LATER
+      ILCR=ILC(RMOST)
+      IF(ITOTPG(LMOST).GT.0) CALL CCTA(SEQG2(1,LMOST),IDIM22(LMOST))
+      WRITE(IDEV,1012)LLINO(LMOST)
+1012  FORMAT(' Entering the new gel reading into contig',I6)
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,LMOST),NAMARC,JOINT(LMOST),ITYPE(LMOST),
+     +ISENSE(LMOST),SEQC2(1,LMOST),ITOTPC(LMOST),
+     +IDIM22(LMOST),IDOUT(LMOST),LLINO(LMOST),LINCON(LMOST),
+     +IFAIL(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,
+     +MAXGEL)
+      IF(IFAIL(LMOST).NE.0)GO TO 800
+      JNGEL = JNGEL + 1
+      IF(ITYPE(LMOST).EQ.1)LLINO(LMOST)=NGELS
+      IF(ILEFTS(LMOST).LT.ILEFTS(RMOST))THEN
+        ILEFTS(RMOST)=ILEFTS(RMOST)-ILC(LMOST)-20
+      END IF
+      CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+      LREG=1
+      RREG=RELPG(LINCON(LMOST))
+      IGELC=LNBR(LINCON(LMOST))
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      ILC(LMOST)=RREG
+      ILEFTS(LMOST)=IDIM1-RREG+1
+      DO 500 I=1,2
+        IF(ISENSE(I).EQ.-1)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON(I),
+     +    LLINO(I),SEQ2,IDBSIZ,IDEV,IDEV1,IDEV2,MAXGEL)
+          CALL SQREV(SEQ1(ILEFTS(I)),ILC(I))
+          CALL SQCOM(SEQ1(ILEFTS(I)),ILC(I))
+          KT=IDIM1
+          CALL ADDTIT(SEQ1((ILEFTS(I)-20)),NAMPRO,LNBR(LINCON(I)),KT)
+        END IF
+500   CONTINUE
+C   NEED TO KNOW POSITION OF OVERLAP RELATIVE TO CONTIG, TO CONSENSUS
+C   WHICH BITS TO SEND TO ALIGNMENT ROUTINES
+C   SET UP FOR ALINE (NOTE RMOST IS EQUIVALENT TO THE GEL READING AND
+C   SO IS SLID ALONG THE LMOST CONTIG. THE SECTION SENT TO ALINE MUST 
+C   BE OF LENGTH < MAXGEL-2*MAX(MAXPC,MAXPG)
+C   IT MUST START AT POSITION 1 IN THE RMOST CONTIG AND EXTEND
+      IPOSC(LMOST)=PL(RMOST)+RELPG(NGELS)-1
+      ILCT = RELPG(LINCON(LMOST)) - RELPG(NGELS) - PL(RMOST) + 2
+      ILC(RMOST)=MIN(ILCT,ILC(RMOST))
+      IPOSC(RMOST)=1
+      IDOUT(LMOST)=MAXGEL
+      IDOUT(RMOST)=MAXGEL
+      IDSAV=MAXSAV
+C  ON INPUT TO ALINE ILC(RMOST) CONTAINS THE OVERLAP LENGTH
+C  ON OUTPUT IT CONTAINS THE LENGTH OF THE ALIGNED SECTION (IE INCLUDING 
+C  PADS)
+      WRITE(IDEV,1009)
+1009  FORMAT(' Trying to align the two contigs')
+      CALL ALINE(SEQ1(ILEFTS(LMOST)),SEQ1(ILEFTS(RMOST)),
+     +SEQC2(1,RMOST),SEQC2(1,LMOST),SAV1,SAV2,SAV3,IDSAV,
+     +ILC(LMOST),ILC(RMOST),IDOUT(LMOST),IPOSC(LMOST),IPOSC(RMOST),
+     +MINSLI,JOINT(LMOST),ITOTPC(LMOST),ITOTPC(RMOST),IFAIL(1),
+     +ITYPE(1),MAXPC,MAXPC,PERMAX,IDEV,SEQ4,MAXGEL,Z)
+C SEQC2(1,LMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE LMOST CONTIG
+C SEQC2(1,RMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE RMOST CONTIG
+C ILC(RMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE RMOST CONTIG
+C IDOUT(LMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE LMOST CONTIG
+C JOINT(LMOST)  IS THE POSITION OF THE JOIN RLETIVE TO THE LMOST CONTIG
+C ITYPE IS TYPE OF OVERLAP (-1 = RIGHT END OR INTERNAL, 1 = LEFT END)
+C  NB SHOULD ALWAYS BE -1
+C  IF THIS HAS BEEN DONE OK WE CAN EDIT THE TWO CONTIGS THEN JOIN
+      IF(IFAIL(1).NE.0)THEN
+        WRITE(IDEV,*)' Failed to align the two overlapping contigs'
+        GO TO 800
+      END IF
+      IF(ITOTPC(LMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(LMOST)
+1017    FORMAT(' Editing contig',I6)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(LMOST),JOINT(LMOST),SEQC2(1,LMOST),
+     +  ITOTPC(LMOST),IDOUT(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      JOINT(RMOST)=1
+      IDOUT(RMOST)=ILC(RMOST)
+      IF(ITOTPC(RMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(RMOST)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(RMOST),JOINT(RMOST),SEQC2(1,RMOST),
+     +  ITOTPC(RMOST),IDOUT(RMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      ILC(RMOST)=ILCR
+      LTL=LNBR(LINCON(LMOST))
+      LTR=LNBR(LINCON(RMOST))
+      WRITE(IDEV,1018)LNBR(LINCON(LMOST)),LNBR(LINCON(RMOST))
+1018  FORMAT(' Completing the join between contigs',I6,' and',I6)
+      CALL AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +JOINT(LMOST),LTL,LTR,LINCON(LMOST),LINCON(RMOST),IDEV1)
+      LLINO(1)=LTL
+      IF(ILEFTS(LMOST).GT.ILEFTS(RMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+      END IF
+      IF(ILEFTS(RMOST).GE.ILEFTS(LMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+      END IF
+      LREG=1
+      RREG=JOINT(LMOST)
+      IGELC=LLINO(1)
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+C      CALL FMTDB(SEQ1,IDIM1,1,IDIM1,60,IDEV)
+      JNJOIN = JNJOIN + 1
+      IF(KFAIL.NE.0) GO TO 800
+      GO TO 1
+C
+C
+C             JOINS THAT FAIL INITIAL ALIGNMENT CRITERIA
+C
+C
+600   CONTINUE
+C   COME HERE FOR JOINS THAT FAIL: WE MIGHT BE ABLE TO ENTER THE GEL 
+C   INTO A SINGLE CONTIG
+      IGOOD=0
+      IF(IFAIL(1).EQ.0)IGOOD=1
+      IF(IFAIL(2).EQ.0)IGOOD=2
+      IF(IGOOD.EQ.0)GO TO 800
+      IF(ITOTPG(IGOOD).GT.0) CALL CCTA(SEQG2(1,IGOOD),IDIM22(IGOOD))
+      WRITE(IDEV,1012)LLINO(IGOOD)
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,IGOOD),NAMARC,JOINT(IGOOD),ITYPE(IGOOD),
+     +ISENSE(IGOOD),SEQC2(1,IGOOD),ITOTPC(IGOOD),
+     +IDIM22(IGOOD),IDOUT(IGOOD),LLINO(IGOOD),LINCON(IGOOD),
+     +IFAIL(IGOOD),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,
+     +MAXGEL)
+      IF(IFAIL(IGOOD).NE.0)GO TO 800
+      JNGEL = JNGEL + 1
+      CALL DELCON(SEQ1,ILEFTS(IGOOD),ILC(IGOOD),IDIM1)
+      ANS=1
+      IGELC=LNBR(LINCON(IGOOD))
+      LREG=1
+      RREG=RELPG(LINCON(IGOOD))
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      WRITE(IDEV,1020)LLINO,LLINO(IGOOD)
+1020  FORMAT(' Could not join contigs',I4,' and',I4,' but the gel',
+     +' has been entered into contig',I4,/,
+     +' If required do the join manually.')
+C
+C
+C                      ALL FAILURES
+C
+C
+C
+800   CONTINUE
+      WRITE(IDEV,10888)
+10888 FORMAT(' Failed gel name written to error file')
+      WRITE(IDEV8,1002)NAMARC
+      GO TO 1
+900   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' sequences processed'
+      WRITE(KBOUT,*)JNGEL,' sequences entered into database'
+      WRITE(KBOUT,*)JNJOIN,' joins made'
+      RETURN
+      END
+      SUBROUTINE DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL,IMATC,IEMPTY,IOPT)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER SEQ2(MAXGEL),SEQC2(MAXGEL)
+      CHARACTER NAMARC*(*)
+C  deals with entering all readings into contig 1 (IOPT=2)
+C  or all readings into new contigs (IOPT=3)
+      IF(IOPT.EQ.2) THEN
+        IF(IMATC.EQ.0) THEN
+          ITYPE=0
+          ISENSE=1
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+          IEMPTY=0
+          IMATC = 1
+        ELSE
+          ITYPE= - 1
+          ISENSE=1
+          JOINT = 1
+          LLINO = 1
+          LINCON = IDBSIZ - 1
+          ITOTPC = 0
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,IDEVT,IDEVT,IDEV4,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+        END IF
+      ELSE IF(IOPT.EQ.3) THEN
+        ITYPE=0
+        ISENSE=1
+        IDOUT=MAXGEL
+        CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,IDEV4,MAXGEL)
+        IF(IFAIL.NE.0) RETURN
+      END IF
+      END
+      SUBROUTINE DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +NAMPRO,IDEV,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IDEVT,IDEVC,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,MAXDB,IDM)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMPRO*(*),NAMARC*10,GEL(MAXGEL)
+      CHARACTER V2,V1
+      INTEGER IWORD,DELDB
+      PARAMETER (IWORD=4)
+      PARAMETER (MAXPRM = 21)
+      CHARACTER PERR(2)*(MAXPRM)
+      EXTERNAL DELDB
+      IERR=1
+1     CONTINUE
+      L = 1
+      V1='1'
+      CALL GTSTR('Make version',V1,V2,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(V2,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) V2 = V1
+      MN = NGELS + NCONTS + 1
+      MX = MAXDB
+      NEWSIZ = IDBSIZ
+      CALL GETINT(MN,MX,NEWSIZ,
+     +'New database size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL BUSY(KBOUT)
+      NEWSIZ = IVAL
+C     WHERE SHOULD CHARS BE CHANGED ?
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='RL'//V2
+       CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0) THEN
+C problem opening file
+        IF(IOK.EQ.2) THEN
+          CALL ERROM(KBOUT,'File already exists')
+          PERR(1) = 'Retype version number'
+          PERR(2) = 'Replace database'
+          IDO = 1
+          CALL RADION('Select action',PERR,2,IDO,
+     +    IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          IF(IDO.LT.1) RETURN
+          IF(IDO.EQ.1) GO TO 1
+          IF(IDO.EQ.2) THEN
+            IOK =  DELDB(NAMPRO,V2,IDEV,MAXGEL)
+            IF(IOK.EQ.0) THEN
+              LLL = INDEX(NAMPRO,'.') + 1
+              NAMPRO(LLL:)='RL'//V2
+              CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+              IF(IOK.EQ.0) GO TO 2
+            ELSE
+              CALL ERROM(KBOUT,'File delete failed')
+            END IF
+          END IF
+        END IF
+        GO TO 100
+      END IF
+ 2    CONTINUE
+      CALL WRITER(IDEV,NEWSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      DO 10 I=1,NGELS
+        CALL WRITER(IDEV,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      M=NEWSIZ-NCONTS
+      N=IDBSIZ-NCONTS
+      DO 15 I=N,IDBSIZ-1
+        CALL WRITER(IDEV,M,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        M=M+1
+15    CONTINUE
+      CLOSE(UNIT=IDEV)
+C  DO SEQUENCES
+      NAMPRO(LLL:)='SQ'//V2
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+       CALL OPENRS(IDEV,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+C
+C  MAKE SURE FILE IS NEWSIZ LINES LONG!
+      CALL WRITEW(IDEV,NEWSIZ,GEL,MAXGEL)
+      DO 20 I=1,NGELS
+        CALL READW(IDEVW,I,GEL,MAXGEL)
+        CALL WRITEW(IDEV,I,GEL,MAXGEL)
+20    CONTINUE
+      CLOSE(UNIT=IDEV)
+C   DO ARCHIVE NAMES
+      NAMPRO(LLL:)='AR'//V2
+       CALL OPENRS(IDEV,NAMPRO,IOK,3,3)
+      IF(IOK.NE.0)GO TO 100
+      DO 30 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        CALL WRITEN(IDEV,I,NAMARC)
+30    CONTINUE
+C   MAKE SURE FILE IS 1000 LINES LONG
+C  ********  PROBLEMS HERE WRITING INTEGER TO NAMES FILE
+      WRITE(IDEV,REC=MAXDB)NEWSIZ,MAXGEL,IDM
+      CLOSE(UNIT=IDEV)
+C      IF(IDEVRD.GT.0) THEN
+C        NAMPRO(LLL:) = 'RD'//V2
+C        CALL OPENRS(IDEV,NAMPRO,IOK,7,3)
+C        IF(IOK.NE.0) GO TO 100
+C        CALL COPYRD(IDEVRD,IDEV,NGELS,NEWSIZ,IOK)
+C        IF(IOK.NE.0) GO TO 100
+C      END IF
+C   DO TAGS + COMMENTS
+      IF(IDEVT.GT.0.AND.IDEVC.GT.0) THEN
+        NAMPRO(LLL:)='TG'//V2
+        CALL OPENRS(IDEV,NAMPRO,IOK,5,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL COPYTG(IDEVT,IDEV,IOK,IDBSIZ,NEWSIZ,NGELS)
+        NAMPRO(LLL:)='CC'//V2
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+        CALL OPENRS(IDEV,NAMPRO,IOK,11,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL COPYCC(IDEVC,IDEV,IOK)
+      ENDIF
+      IERR=0
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error opening new database, copy aborted')
+      RETURN
+      END
+      INTEGER FUNCTION DELDB(NAMPRO,VERSN,IDEV,MAXGEL)
+      CHARACTER NAMPRO*(*),VERSN
+      INTEGER DELF
+      EXTERNAL DELF
+C
+C delete an xdap database
+C
+C assume relationships are 4 words, names are 3 and seqs are maxgel
+C all recls in BYTES
+C
+      DELDB = 1
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='RL'//VERSN
+      IF(DELF(NAMPRO,IDEV,16,4).NE.0) RETURN
+      NAMPRO(LLL:)='AR'//VERSN
+      IF(DELF(NAMPRO,IDEV,12,4).NE.0) RETURN
+      NAMPRO(LLL:)='SQ'//VERSN
+      IF(DELF(NAMPRO,IDEV,MAXGEL,4).NE.0) RETURN
+      NAMPRO(LLL:)='CC'//VERSN
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+      IF(DELF(NAMPRO,IDEV,44,4).NE.0) RETURN
+      NAMPRO(LLL:)='TG'//VERSN
+      IF(DELF(NAMPRO,IDEV,20,4).NE.0) RETURN
+      DELDB = 0
+      END
+      SUBROUTINE DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,MAXGEL,IDEVT,IDEVC)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+C  12-12-90 Added function to change raw data parameter file
+C  and changed menu routines accordingly
+      INTEGER RELPG(IDBSIZ),X
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAME*10,NEWNAM*10
+      INTEGER L,M,N
+      CHARACTER GEL2(MAXGEL)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(9)*(MAXPRM)
+      INTEGER GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' Warning:',
+     +' make a copy first, and check logical consistency after use')
+10    CONTINUE
+C
+C   SELECT OPTION
+C      CALL BELL(1,KBOUT)
+C     DBMENU now defunct for bap - so we use RADION instead
+C      CALL DBMENU(4,NOPT,IHELPS,IHELPE,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PROMPT(1) = 'Line change'
+      PROMPT(2) = 'Edit gel reading'
+      PROMPT(3) = 'Delete contig line'
+      PROMPT(4) = 'Shift'
+      PROMPT(5) = 'Move gel reading'
+      PROMPT(6) = 'Rename gel reading'
+      PROMPT(7) = 'Break a contig'
+      PROMPT(8) = 'Remove a gel reading'
+      PROMPT(9) = 'Alter raw data parameters'
+      NOPT = 1
+      CALL RADION('Alter relationships', PROMPT, 9, NOPT, IHELPS,
+     +  IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.LT.1)RETURN
+      IF(NOPT.EQ.1)THEN
+C       LINE CHANGE
+        MN = 0
+        MX = IDBSIZ
+        LNO = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of line to change',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.EQ.0) GO TO 10
+        LNO = IVAL
+        IF(LNO.EQ.IDBSIZ)GO TO 19
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)RELPG(LNO),LNGTHG(LNO),LNBR(LNO),RNBR(LNO)
+1001    FORMAT(' ',4I6)
+        MN = 0
+        MX = 99999
+        X  = RELPG(LNO)
+        CALL GETINT(MN,MX,X,
+     +  'Relative position',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        X = IVAL
+        MN = -MAXGEL
+        MX = 99999
+        L  = LNGTHG(LNO)
+        CALL GETINT(MN,MX,L,
+     +  'Length',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 0
+        MX = IDBSIZ
+        M  = LNBR(LNO)
+        CALL GETINT(MN,MX,M,
+     +  'Left neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        M = IVAL
+        MN = 0
+        MX = IDBSIZ
+        N  = RNBR(LNO)
+        CALL GETINT(MN,MX,N,
+     +  'Right neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        N = IVAL
+        CALL WRITER(IDEVR,LNO,X,L,M,N)
+        RELPG(LNO)=X
+        LNGTHG(LNO)=L
+        LNBR(LNO)=M
+        RNBR(LNO)=N
+        GO TO 10
+19      CONTINUE
+C       NCONTS NGELS LINES
+        MN = 0
+        MX = IDBSIZ
+        LL = NGELS
+        CALL GETINT(MN,MX,LL,
+     +  'Number of gel readings',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        LL = IVAL
+        MN = 0
+        MX = IDBSIZ
+        MM  = NCONTS
+        CALL GETINT(MN,MX,MM,
+     +  'Number of contigs',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        MM = IVAL
+        CALL WRITER(IDEVR,IDBSIZ,LL,MM,LL,MM)
+        NGELS=LL
+        NCONTS=MM
+        GO TO 10
+      END IF
+C
+C
+      IF(NOPT.EQ.4)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of first gel reading to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNO)
+        IF(I.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'Problem with this gel reading. Check logical consistency'
+          WRITE(KBOUT,*)'of database. Shift not done'
+          GO TO 10
+        END IF
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'No contig line for this contig. Check logical'
+          WRITE(KBOUT,*)'consistency of database. Shift not done'
+          GO TO 10
+        END IF
+        MN = 1 - RELPG(LNO)
+        MX  = RELPG(NCONTO) - RELPG(LNO)
+        X = MN
+        CALL GETINT(MN,MX,X,
+     +  'Distance to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        X = IVAL
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,LNO,NCONTO,X)
+        WRITE(KBOUT,*)'Shift complete'
+        GO TO 10
+      END IF
+C
+      IF(NOPT.EQ.2)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of gel reading to edit',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        IDIM=ABS(LNGTHG(LNO))
+C       ALLOW EDITING TO BE RELATIVE TO CONTIG POSITION
+        ISTART=RELPG(LNO)
+        CALL READW(IDEVW,LNO,GEL,MAXGEL)
+C @jkb@
+C        CALL EDITGL(GEL,GEL2,IDIM,KBIN,KBOUT,MAXGEL,
+C     +  IHELP1,IHELP2,FILEH,IDEVH,ISTART)
+        CALL WRITEW(IDEVW,LNO,GEL,MAXGEL)
+        LNGTHG(LNO)=SIGN(IDIM,LNGTHG(LNO))
+        CALL WRITER(IDEVR,LNO,RELPG(LNO),LNGTHG(LNO),
+     +  LNBR(LNO),RNBR(LNO))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.3)THEN
+        MN=IDBSIZ-NCONTS
+        MX = IDBSIZ - 1
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of contig line to delete',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        LNO = IVAL
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  LNO,IDEVR)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.6)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of gel reading to rename',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        CALL READN(IDEVN,LNO,NAME)
+        L = 10
+        CALL GTSTR('name for gel reading',
+     +  NAME,NEWNAM,L,KBOUT,KBIN,INFLAG)
+        IF(L.GT.0)CALL WRITEN(IDEVN,LNO,NEWNAM)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.5)THEN
+        MN = 0
+        MX = NGELS
+        IFROM  = 0
+        CALL GETINT(MN,MX,IFROM,
+     +  'Number of gel reading to move',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        IFROM = IVAL
+        MN = 0
+        MX = NGELS
+        ITO  = 0
+        CALL GETINT(MN,MX,ITO,
+     +  'New number for gel reading',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        ITO = IVAL
+        CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  GEL,IFROM,ITO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+        CALL BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+        GO TO 10        
+      END IF
+      IF(NOPT.EQ.8) THEN
+        CALL REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.9) THEN
+        CALL FIXRD(IDEVT,IDEVC,IDBSIZ,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH)
+        GO TO 10
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER HELPF*(*),GEL(MAXGEL)
+      INTEGER REMME,GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+C assumes db is logical consistent
+      WRITE(KBOUT,*)'Remove reading from database'
+      REMME = NGELS
+      CALL GETINT(1,NGELS,REMME,
+     +'Number of reading to remove',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      REMME = IVAL
+      I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,REMME)
+      ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,I)
+      IF(ICONT.EQ.0) THEN
+        WRITE(KBOUT,*)'No contig line for this reading'
+        IOK = 1
+        RETURN
+      END IF
+      CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+      END
+      SUBROUTINE REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+C Routine to remove a reading from a database
+C Cases: 1 left end
+C        2 right end
+C        3 internal and dispensible
+C        4 internal and indispensible
+C if 1 change contig lnbr, contig length, lnbr of rnbr of remme, relpgs
+C if 2 change contig rnbr, contig length, rnbr of lnbr of remme
+C if 3 change contig length, lnbr of rnbr of remme rnbr of lnbr of remme
+C if 4 need to break contig, then as for 1
+C if 1 and 2 then also remove contig line
+C for all cases move gel ngels to remme (if remme/=ngels)
+C and update line idbsiz
+C
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME,CLEN
+      LOGICAL LEFTE,RIGHTE,DISPEN
+      CHARACTER GEL(MAXGEL)
+      EXTERNAL CLEN
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+      DISPEN = .FALSE.
+C
+C Left end ?
+C
+      IF(LNBR(REMME).EQ.0) LEFTE = .TRUE.
+C
+C Right end ?
+C
+      IF(RNBR(REMME).EQ.0) RIGHTE = .TRUE.
+C
+C If both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading and contig'
+        IFROM = NGELS
+        NGELS = NGELS - 1
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  ICONT,IDEVR)
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+      ELSE IF(LEFTE) THEN
+        WRITE(KBOUT,*)'Removing reading from left end of contig'
+        LNBR(ICONT) = RNBR(REMME)
+        I = 1 - RELPG(RNBR(REMME))
+        WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,RNBR(REMME),ICONT,I)
+        I = LNBR(ICONT)
+        LNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE IF(RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading from right end of contig'
+        RNBR(ICONT) = LNBR(REMME)
+        I = RNBR(ICONT)
+        RNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNBR(ICONT))
+        CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +  LNBR(ICONT),RNBR(ICONT))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE
+C Is remme indispensible ?
+        NSTART = RELPG(RNBR(REMME))
+        I = REMME
+10      CONTINUE
+        I = LNBR(I)
+        IF(I.NE.0) THEN
+          IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.NSTART) GO TO 10
+          DISPEN = .TRUE.
+        END IF
+        IF(DISPEN) THEN
+          WRITE(KBOUT,*)
+     +    'Removing dispensible reading from middle of contig'
+          I = LNBR(REMME)
+          RNBR(I) = RNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          I = RNBR(REMME)
+          LNBR(I) = LNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,LNBR(ICONT))
+          CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +    LNBR(ICONT),RNBR(ICONT))
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        ELSE
+          WRITE(KBOUT,*)
+     +    'Removing indispensible reading from middle of contig'
+          WRITE(KBOUT,*)'So breaking contig first'
+          IR = REMME
+          IL = LNBR(REMME)
+          ILO = LNBR(ICONT)
+          NCONTO = ICONT
+          NCONTR = IDBSIZ - NCONTS - 1
+          CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +    KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+          IF(IOK.NE.0) RETURN
+          WRITE(KBOUT,*)'Removing reading from left end of contig'
+          ICONT = IDBSIZ - NCONTS
+          LNBR(ICONT) = RNBR(REMME)
+          I = 1 - RELPG(RNBR(REMME))
+          WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+          CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +    IDBSIZ,RNBR(REMME),ICONT,I)
+          I = LNBR(ICONT)
+          LNBR(I) = 0
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        END IF
+      END IF
+      END
+      SUBROUTINE REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,IDEVR)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME
+C Routine to remove a contig line from a db
+C Loop deals with case of remove top contig
+C Move down all lines from above
+      DO 10 I = REMME,IDBSIZ-NCONTS+1,-1
+        RELPG(I) = RELPG(I-1)
+        LNGTHG(I) = LNGTHG(I-1)
+        LNBR(I) = LNBR(I-1)
+        RNBR(I) = RNBR(I-1)
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      NCONTS = NCONTS - 1
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      END
+      SUBROUTINE MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +GEL,FROM,TO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+C   Subroutine to move a gel from line from to line to
+C   Extended 22-5-91
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),FROM,TO
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMGEL*10,GEL(MAXGEL)
+      INTEGER GCLIN,CHAINL
+      LOGICAL LEFTE,RIGHTE
+      EXTERNAL GCLIN,CHAINL
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+C
+C left end ?
+C
+      IF(LNBR(FROM).EQ.0) LEFTE = .TRUE.
+C
+C right end ?
+C
+      IF(RNBR(FROM).EQ.0) RIGHTE = .TRUE.
+C
+C if both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          RNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(LEFTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(RIGHTE) THEN
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no right neighbour and does not'
+          WRITE(KBOUT,*)'appear in a contig!'
+        ELSE
+          IF(RNBR(NCONTO).NE.FROM)THEN
+            WRITE(KBOUT,*)
+     +        'This gel has no right neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+          ELSE
+            RNBR(NCONTO) = TO
+            CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +        LNBR(NCONTO),RNBR(NCONTO))
+          END IF
+        END IF
+      END IF
+      RELPG(TO)=RELPG(FROM)
+      LNGTHG(TO)=LNGTHG(FROM)
+      LNBR(TO)=LNBR(FROM)
+      RNBR(TO)=RNBR(FROM)
+      CALL READW(IDEVW,FROM,GEL,MAXGEL)
+      CALL WRITEW(IDEVW,TO,GEL,MAXGEL)
+      CALL READN(IDEVN,FROM,NAMGEL)
+      CALL WRITEN(IDEVN,TO,NAMGEL)
+      CALL WRITER(IDEVR,TO,RELPG(TO),LNGTHG(TO),
+     +LNBR(TO),RNBR(TO))
+C   Do neighbours
+      IF(LNBR(FROM).NE.0) THEN
+        I=LNBR(FROM)
+        RNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      IF(RNBR(FROM).NE.0) THEN
+        I=RNBR(FROM)
+        LNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      CALL MOVTAG(FROM,TO)
+      END
+      SUBROUTINE DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +IDBSIS,IDBSIZ,IERR,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,IDEVT,IDEVC,
+     +MAXGEL,MAXGLM,LLINO,
+     +IDM,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER GEL(MAXGLM)
+      INTEGER RELPG(IDBSIS)
+      INTEGER LNGTHG(IDBSIS),LNBR(IDBSIS),RNBR(IDBSIS)
+      CHARACTER NAMPRO*(*),COPYNO*4,HELPF*(*)
+      INTEGER IWORD,ANS
+      PARAMETER (IWORD=4)
+C  NOTE THIS IS THE MACHINES WORD LENGTH IE HOW MANY CHARS PER WORD
+      CALL FILLI(RELPG,IDBSIS,0)
+      CALL FILLI(LNGTHG,IDBSIS,0)
+      CALL FILLI(LNBR,IDBSIS,0)
+      CALL FILLI(RNBR,IDBSIS,0)
+      NAMPRO(1:)=' '
+      IERR=1
+1     CONTINUE
+      L = 0
+      CALL GTSTR('Project name',' ',NAMPRO,L,KBOUT,KBIN,INFLAG)
+      IF(L.LT.1) RETURN
+      LL = L
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      L = 1
+      CALL GTSTR('Version','0',COPYNO,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(COPYNO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) COPYNO(1:1) = '0'
+C   GET .ARN FIRST AS IT CONTAINS LENGTH OF OTHER FILES
+      NAMPRO(LL+1:LL+3)='.AR'
+      NAMPRO(LL+4:LL+4)=COPYNO(1:1)
+       CALL OPENRS(IDEVN,NAMPRO,IOK,3,4)
+      IF(IOK.NE.0)GO TO 100
+C  ******* PROBLEM HERE READING RECORD 1000 AND INTEGER FROM IDEVN!!!
+      READ(IDEVN,REC=IDBSIS,ERR=50)IDBSIZ,MAXGEL,IDM
+      IF(MAXGEL.LT.512)GO TO 50
+      IF(MAXGEL.GT.MAXGLM)GO TO 50
+      IF(MOD(MAXGEL,512).NE.0)GO TO 50
+20    CONTINUE
+      IF((IDM.NE.5).AND.(IDM.NE.26))GO TO 51
+21    CONTINUE
+      NAMPRO(LL+2:LL+3)='RL'
+       CALL OPENRS(IDEVR,NAMPRO,IOK,4,4)
+      IF(IOK.NE.0)GO TO 100
+      NAMPRO(LL+2:LL+3)='SQ'
+C  DEFINE RECORD LENGTH IN TERMS OF NUMBER OF CHARS PER WORD (4 ON VAX)
+C  AND MAXGEL SIZE 
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+       CALL OPENRS(IDEVW,NAMPRO,IOK,IREC,4)
+      IF(IOK.NE.0)GO TO 100
+C      NAMPRO(LL+2:LL+3) = 'RD'
+C      CALL OPENRS(IDEVRD,NAMPRO,IOK,7,4)
+C      IF(IOK.NE.0) IDEVRD = -9
+      NAMPRO(LL+2:LL+3) = 'TG'
+      CALL OPENRS(IDEVT,NAMPRO,IOK,5,4)
+      IF(IOK.NE.0) IDEVT = -1
+      NAMPRO(LL+2:LL+3) = 'CC'
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+      CALL OPENRS(IDEVC,NAMPRO,IOK,11,4)
+      IF(IOK.NE.0) IDEVC = -1
+C   READ A LINE FOR LUCK
+      CALL READW(IDEVW,1,GEL,MAXGEL)
+      CALL READR(IDEVR,IDBSIZ,NGELS,NCONTS,IDUM1,IDUM2)
+      WRITE(KBOUT,10011)NGELS,NCONTS,IDBSIZ,MAXGEL
+10011 FORMAT(' Number of gel readings=',I3,' Number of contigs=',I3,/,
+     +' Database size=',I4,' Maximum gel reading length=',I4)
+      LLINO = 0
+      IF(NGELS.LT.1)GO TO 5
+      DO 3 I=1,NGELS
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+3     CONTINUE
+      N=IDBSIZ-NCONTS
+      MXT = 0
+      DO 4 I=N,IDBSIZ-1
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        IF(RELPG(I).GT.MXT) THEN
+          MXT = RELPG(I)
+          LLINO = LNBR(I)
+        END IF
+4     CONTINUE
+5     CONTINUE
+      IERR=0
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error encountered opening database files')
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      RETURN
+50    CONTINUE
+      WRITE(KBOUT,2000)MAXGLM
+2000  FORMAT(' Error reading maximum gel reading length',/,
+     +' The maximum gel reading length must be a multiple',
+     +' of 512.',/,' E.G. 512, 1024, 1536, 2048, etc up to',I6)
+      MN = 512
+      MX = MAXGLM
+      MAXIN1 = MN
+      CALL GETINT(MN,MX,MAXIN1,
+     +'Maximum gel reading length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXGEL = IVAL
+      IF(MOD(MAXGEL,MAXIN1).NE.0)GO TO 50
+C  PROBLEM WRITING TO RECORD IDBSIZ WITH NO SUBROUTINE CALL!!!!!!!!!!
+      WRITE(IDEVN,REC=IDBSIS)IDBSIZ,MAXGEL
+      GO TO 20
+51    CONTINUE
+      WRITE(KBOUT,2003)
+2003  FORMAT(' Error reading protein or dna label')
+      CALL YESNO(ANS,'DNA',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0)IDM = 5
+      IF(ANS.EQ.1)IDM = 26
+C  PROBLEM WRITING TO RECORD IDBSIZ WITH NO SUBROUTINE CALL!!!!!!!!!!
+      WRITE(IDEVN,REC=IDBSIS)IDBSIZ,MAXGEL,IDM
+      GO TO 21
+      END
+      SUBROUTINE DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,
+     +IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +IERR,IHELPS,IHELPE,IDEVH,FILEH,
+     +MAXGEL,MAXGLM,IDM)
+      CHARACTER GEL(MAXGLM),FILEH*(*)
+      CHARACTER NAMPRO*(*)
+      INTEGER IWORD
+      PARAMETER (IWORD=4)
+      IERR=1
+3     CONTINUE
+      MN = 0
+      CALL GTSTR('New project name',' ',NAMPRO,MN,KBOUT,KBIN,INFLAG)
+      IF(MN.LT.1) RETURN
+      LL = MIN(12,MN)
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 3
+      END IF
+      MN = 10
+      MX = IDBSIS
+      IDBSIZ = 50
+      CALL GETINT(MN,MX,IDBSIZ,
+     +'Database size',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IDBSIZ = IVAL
+5     CONTINUE
+      MN = 512
+      MX = MAXGLM
+      MAXIN1 = MIN(512,MAXGEL)
+      CALL GETINT(MN,MX,MAXIN1,
+     +'Maximum gel reading length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXGEL = IVAL
+      IF(MOD(IVAL,MAXIN1).NE.0) THEN
+        MAXGEL = 512 + (IVAL/512)*512
+        WRITE(KBOUT,*)'Maximum set to',MAXGEL
+      END IF
+      CALL YESNO(IDM,'Database is for DNA',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(IDM.LT.0) RETURN
+      IF(IDM.EQ.0)IDM = 5
+      IF(IDM.EQ.1)IDM = 26
+      NAMPRO(LL+1:LL+4)='.RL0'
+      CALL OPENRS(IDEV1,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+C      IRAW = 0
+C      IDEVRT = IDEVRD
+C      IDEVRD = -9
+C      CALL YESNO(IRAW,'Create raw data pointer file',
+C     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+C      IF(IRAW.LT.0) RETURN
+C      IF(IRAW.EQ.0) THEN
+C        NAMPRO(LL+1:LL+4)='.RD0'
+C        CALL OPENRS(IDEVRT,NAMPRO,IOK,7,3)
+C        IF(IOK.EQ.0) IDEVRD = IDEVRT
+C      END IF
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Writing new database files')
+      J = 0
+      K = 0
+      N = 0
+      M = 0
+      DO 10 I=1,IDBSIZ
+        CALL WRITER(IDEV1,I,J,K,M,N)
+10    CONTINUE      
+      NAMPRO(LL+2:LL+3)='SQ'
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+      CALL OPENRS(IDEV2,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+      CALL FILLC(GEL,MAXGEL,' ')
+      CALL WRITEW(IDEV2,IDBSIZ,GEL,MAXGEL)
+      NAMPRO(LL+2:LL+3)='AR'
+      CALL OPENRS(IDEV3,NAMPRO,IOK,3,3)
+      IF(IOK.NE.0)GO TO 100
+C  NOTE HERE IS A WRITE TO NAMES FILE NOT USING A SUBROUTINE CALL
+      WRITE(IDEV3,REC=IDBSIS)IDBSIZ,MAXGEL,IDM
+C      IF(IDEVRD.GT.0) THEN
+C        NAMFIL = ' '
+C        MTYPE = ' '
+C        I = 0
+C        J = 0
+C        K = 0
+C        CALL WRITRD(IDEVRD,IDBSIZ,I,J,K,MTYPE,NAMFIL)
+C      ENDIF
+C  CREATE TAG FILES (TAGS AND COMMENTS)
+      IF (.TRUE.) THEN
+        NAMPRO(LL+2:LL+3)='TG'
+        CALL OPENRS(IDEVT,NAMPRO,IOK,5,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL WRITTG(IDEVT,IDBSIZ,IDBSIZ,0,0,0,0)
+        NAMPRO(LL+2:LL+3)='CC'
+C  COMMENT_LENGTH: 11 = (40 + long)/long
+        CALL OPENRS(IDEVC,NAMPRO,IOK,11,3)
+        IF(IOK.NE.0)GO TO 100
+        CALL WRITCC(IDEVC,1,1,0,' ')
+      ENDIF
+      WRITE(KBOUT,1003)NAMPRO(1:LL),IDBSIZ
+1003  FORMAT(' Database ',A,' version 0, size',I5,
+     +' successfully started')
+C      WRITE(KBOUT,1006)NAMPRO(1:LL)
+1006  FORMAT(
+     +' Note the 3 database files  are named ',A,'.AR0, .SQ0, .RL0')
+      IERR=0
+      NAMPRO(LL+2:LL+2) = '0'
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error when trying to open database,',
+     +' database not started')
+      RETURN
+      END
+      SUBROUTINE ENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,IDBSIZ,IDEV,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,IDEVG,
+     +IDEVT,IDEVC,LINLEN,PERCD,NAMARC,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM,NCONTC)
+      CHARACTER FILEH*(*)
+      INTEGER HELPS1,HELPS2,HELPE1,HELPE2
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER X,Y,ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),NAMARC*(*)
+      CHARACTER NAMARK*10
+      CHARACTER GEL2(MAXGEL)
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(
+C     +' Enter a new gel reading into the database.',//,
+C     +' Note that before a gel reading is entered into the',/,
+C     +' database it must be compared  with the database to ',/,
+C     +' search for overlaps using a function such as auto assemble',/,
+C     +' The replies to all the questions that follow are given',/,
+C     +' by these searches',//)
+      IF((IDBSIZ-(NGELS+NCONTS)).GT.2)GO TO 5
+      WRITE(KBOUT,1999)IDBSIZ
+1999  FORMAT(' Current database size=',I5,' extend with copy')
+      RETURN
+5     CONTINUE
+C      WRITE(KBOUT,1018)
+C1018  FORMAT(
+C     +' We need the name of the file containing the gel reading')
+      CALL OPENF1(IDEVG,NAMARC,0,IOK,KBIN,KBOUT,
+     +'File name of gel reading to enter',
+     +HELPS1,HELPE1,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF(NGELS.GT.0) THEN
+        DO 10 J=1,NGELS
+          CALL READN(IDEVN,J,NAMARK)
+          IF(NAMARC(1:10).NE.NAMARK)GO TO 10
+          WRITE(KBOUT,1013)J
+1013      FORMAT( ' Gel reading already in database (number,',I6,
+     +    '), entry aborted')
+          RETURN
+10      CONTINUE
+      END IF
+      IDIM=MAXGEL
+      CALL ARRFIM(IDEVG,GEL,IDIM,KBOUT)
+      CLOSE(UNIT=IDEVG)
+      NGELS=NGELS+1
+      LNGTHG(NGELS)=IDIM
+      NAMARK=NAMARC(1:10)
+      CALL WRITEN(IDEVN,NGELS,NAMARK)
+      WRITE(KBOUT,1003)NGELS
+1003  FORMAT( ' This gel reading has been given the number ',I6)
+      IF(IDEVT.GT.0) CALL ENTRD(IDEVG,IDEVRD,NAMARC,NGELS,IOK)
+      ANS=0
+      IF(NGELS.GT.1)THEN
+C        WRITE(KBOUT,1017)
+C1017    FORMAT(
+C     +' We need to know if the gel reading overlaps data already in',
+C     +' the database')
+        CALL YESNO(ANS,'New gel overlaps a contig',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.LT.0) THEN
+          NGELS = NGELS - 1
+          RETURN
+        END IF
+        IF(ANS.EQ.0)GO TO 100
+      END IF
+C
+C   DOES NOT OVERLAP SO IT STARTS A CONTIG OF ITS OWN
+      LNBR(NGELS)=0
+      RNBR(NGELS)=0
+      RELPG(NGELS)=1
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      NCONTS=NCONTS+1
+      N=IDBSIZ-NCONTS
+      LNBR(N)=NGELS
+      RNBR(N)=NGELS
+      RELPG(N)=(IDIM)
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      RETURN
+C
+100   CONTINUE
+C      WRITE(KBOUT,1019)
+C1019  FORMAT(
+C     +' We need to know if the new gel reading overlaps in the',/,
+C     +' normal or complementary sense')
+C   COMPLEMENT GEL IF REQUIRED
+      CALL YESNO(ANS,'Gel overlaps in normal sense',
+     +HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      IF(ANS.EQ.1) THEN
+        CALL SQREV(GEL,IDIM)
+        CALL SQCOM(GEL,IDIM)
+        LNGTHG(NGELS)=-1*LNGTHG(NGELS)
+      END IF
+C      WRITE(KBOUT,1020)
+C1020  FORMAT(
+C     +' We need to know which contig the new gel reading overlaps')
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,NCONTC,
+     +IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +HELPS1,HELPE1,FILEH,IDEVH)
+      IF(IERR.NE.0) THEN
+        NGELS=NGELS-1
+        RETURN
+      END IF
+C      WRITE(KBOUT,1014)
+C1014  FORMAT(
+C     +' We need information about the position of the overlap.',/,
+C     +' First there are two types: those that extend the contig',/,
+C     +' leftwards and those that start internally.')
+        CALL YESNO(ANS,'Left end of gel reading is inside contig',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0)THEN
+        NGELS=NGELS-1
+        RETURN
+      END IF
+      IF(ANS.EQ.1)GO TO 400
+C   RIGHT END OR INTERNAL OVERLAP
+C
+C      WRITE(KBOUT,1015)
+C1015  FORMAT(' Now we need to know the exact position of the overlap')
+160   CONTINUE
+      MN = 1
+      MX = RELPG(LINCON)
+      X = 0
+      CALL GETINT(MN,MX,X,
+     +'Position in contig of left end of gel reading',
+     +IVAL,KBIN,KBOUT,
+     +HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      X = IVAL
+      IF(X.EQ.0)GO TO 160
+      N=NCONTC
+200   CONTINUE
+      IF(RELPG(N).GT.X)GO TO 250
+      IF(RNBR(N).EQ.0)GO TO 350
+      N=RNBR(N)
+      GO TO 200
+250   CONTINUE
+C   GEL LIES BETWEEN N AND LNBR(N)
+C   DISPLAY JOINT
+      NOPT = 1
+C @jkb@
+C      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+C     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+C     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+C   CHECK ON RESULT
+      IF(NOPT.EQ.1)GO TO 160
+      IF(NOPT.EQ.3)GO TO 260
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+260   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      LNBR(NGELS)=LNBR(N)
+      RNBR(NGELS)=N
+      RELPG(NGELS)=X
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      RNBR(LNBR(N))=NGELS
+      K=LNBR(N)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+      LNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+(IDIM)-1
+      IF(Y.LE.RELPG(LINCON))RETURN
+      RELPG(LINCON)=Y
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      RETURN
+350   CONTINUE
+C   MUST BE A RIGHT END OVERLAP
+C   DSPLAY JOINT
+      NOPT = 1
+C @jkb@
+C      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+C     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+C     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      IF(NOPT.EQ.1)GO TO 160
+      IF(NOPT.EQ.3)GO TO 360
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+360   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      LNBR(NGELS)=N
+      RNBR(NGELS)=0
+      RELPG(NGELS)=X
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      RNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      RNBR(LINCON)=NGELS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+(IDIM)-1
+      RELPG(LINCON)=MAX(RELPG(LINCON),Y)
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      RETURN
+C
+400   CONTINUE
+C
+C   ADDING TO LEFT END
+C      WRITE(KBOUT,1015)
+410   CONTINUE
+      MN = 1
+      MX = IDIM
+      X = 0
+      CALL GETINT(MN,MX,X,
+     +'Position in new gel reading of left end of contig',
+     +IVAL,KBIN,KBOUT,
+     +HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      X = IVAL
+      IF(X.EQ.0)GO TO 410
+C   SET FLAG TO SHOW LEFT END OVERLAP
+      NOPT=2
+C @jkb@
+C      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+C     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+C     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+C     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      IF(NOPT.EQ.1)GO TO 400
+      IF(NOPT.EQ.3)GO TO 420
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+420   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      RELPG(NGELS)=1
+      RNBR(NGELS)=NCONTC
+      LNBR(NGELS)=0
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      LNBR(NCONTC)=NGELS
+      RELPG(NCONTC)=X
+      CALL WRITER(IDEVR,NCONTC,RELPG(NCONTC),LNGTHG(NCONTC),
+     +LNBR(NCONTC),RNBR(NCONTC))
+      RELPG(LINCON)=RELPG(LINCON)+X-1
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=IDIM
+      IF(Y.GT.RELPG(LINCON))RELPG(LINCON)=Y
+      LNBR(LINCON)=NGELS
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      N=NCONTC
+440   CONTINUE
+      IF(RNBR(N).EQ.0)RETURN
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+X-1
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      GO TO 440
+      END
+C      SUBROUTINE XENTRD(IDEVG,IDEVRD,NAMARC,NGEL,IOK)
+C      CHARACTER NAMARC*(*),NAMFIL*12,MTYPE*4
+CC Enters raw data info into database
+C      IOK = 0
+C      IF(IDEVRD.GT.0) THEN
+C        CALL OPENRS(IDEVG,NAMARC,IOK,LRECL,2)
+CC        IF(IOK.EQ.0) THEN
+C          CALL RRD(IDEVG,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C          CALL WRITRD(IDEVRD,NGEL,LENR,LCUT,LENW,MTYPE,NAMFIL)
+CC        END IF
+C      END IF
+C      END
+      SUBROUTINE FIXRD(IDEVT,IDEVC,IDBSIZ,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH)
+C FILE_NAME_LENGTH
+      CHARACTER NAMFIL*18,NEWNAM*18,MTYPE*4,NEWMT*4,FILEH*(*)
+      IF(IDEVRD.LT.0) THEN
+        WRITE(KBOUT,*)'No raw data file!'
+        RETURN
+      END IF
+ 10   CONTINUE
+C  Change raw data record
+        MN = 0
+        MX = IDBSIZ-1
+        LNO = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of line to change',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IF(IVAL.EQ.0) RETURN
+        LNO = IVAL
+        CALL READRD(IDEVT,IDEVC,LNO,LENR,LCUT,LENW,MTYPE,NAMFIL)
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)LENR,LCUT,LENW,MTYPE,NAMFIL
+1001    FORMAT(' ',3I6,' ',A,' ',A)
+        MN = 1
+        MX = 9999
+        LX  = LENR
+        CALL GETINT(MN,MX,LX,
+     +  'Length raw sequence',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        LX = IVAL
+        MN = 1
+        MX = LX
+        L  = LCUT
+        CALL GETINT(MN,MX,L,
+     +  'Left cutoff',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 1
+        MX = LX
+        M  = LENW
+        CALL GETINT(MN,MX,M,
+     +  'Length of original working sequence',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        M = IVAL
+ 20     CONTINUE
+        LNAM = 4
+        CALL GTSTR('Machine type',
+     +  MTYPE,NEWMT,LNAM,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 20
+        END IF
+        IF(INFLAG.EQ.3) NEWMT = MTYPE
+ 30     CONTINUE
+C FILE_NAME_LENGTH
+        LNAM = 18
+        CALL GTSTR('Name for raw data file',
+     +  NAMFIL,NEWNAM,LNAM,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 30
+        END IF
+        IF(INFLAG.EQ.3) NEWNAM = NAMFIL
+        WRITE(KBOUT,1001)LX,L,M,NEWMT,NEWNAM
+        CALL WRITRD(IDEVT,IDEVC,LNO,LX,L,M,NEWMT,NEWNAM)
+        WRITE(KBOUT,*)'New record written to disk'
+        GO TO 10
+        END
+C      SUBROUTINE READRD(IDEVT,IDEVC,NGEL,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C      CHARACTER MTYPE*(*),NAMFIL*(*)
+C      IF(IDEVT.GT.0)READ(IDEVRD,REC=NGEL)LENR,LCUT,LENW,MTYPE,NAMFIL
+C      END
+C      SUBROUTINE RRD(IDEVG,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C      CHARACTER LINE*80,NAMFIL*(*),MTYPE*(*)
+C Reads raw data info from reading file assuming it starts with ;
+C 1    CONTINUE
+C      READ(IDEVG,1000,END=100,ERR=100)LINE
+C 1000 FORMAT(A)
+C      IF(LINE(1:1).NE.';') GO TO 1
+C      READ(LINE,1001,ERR=100)LENR,LCUT,LENW,MTYPE,NAMFIL
+C 1001 FORMAT(1X,3I6,A,A)
+C      RETURN
+C 100  CONTINUE
+C      LENR = 0
+C      LCUT = 0
+C      LENW = 0
+C      MTYPE = ' '
+C      NAMFIL = ' '
+C      END
+C      SUBROUTINE WRITRD(IDEVT,IDEVC,NGEL,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C      CHARACTER MTYPE*(*),NAMFIL*(*)
+C      IF(IDEVT.GT.0)WRITE(IDEVRD,REC=NGEL)LENR,LCUT,LENW,MTYPE,NAMFIL
+C      END
+      SUBROUTINE READRD(IDEVT,IDEVC,NGEL,LENR,LCUT,LENW,MTYPE,NAMFIL)
+      CHARACTER MTYPE*(*),NAMFIL*(*)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+      IF(IDEVT.GT.0)THEN
+        CALL READTG(IDEVT,NGEL,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        CALL READCC(IDEVC,LCOM,ICNT,NEXT,NOTE)
+        READ(NOTE,1001,ERR=100)LENR,LCUT,LENW,MTYPE,NAMFIL
+ 1001   FORMAT(3I6,A,A)
+      ENDIF
+      RETURN
+ 100  CONTINUE
+      LENR = 0
+      LCUT = 0
+      LENW = 0
+      MTYPE = ' '
+      NAMFIL = ' '
+      END
+      SUBROUTINE WRITRD(IDEVT,IDEVC,NGEL,LENR,LCUT,LENW,MTYPE,NAMFIL)
+      CHARACTER MTYPE*(*),NAMFIL*(*)
+      INTEGER FREECC
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+      IF(IDEVT.GT.0)THEN
+        CALL READTG(IDEVT,NGEL,LPOS,LLEN,LCOM,LTYPE,NEXT)
+        IF(LCOM.EQ.0)THEN
+          LCOM = FREECC(IDEVC)
+        ENDIF
+        WRITE(NOTE,1001,ERR=100)LENR,LCUT,LENW,MTYPE,NAMFIL
+ 1001   FORMAT(3I6,A,A)
+        NEXT = 0
+        CALL WRITCC(IDEVC,LCOM,ICNT,NEXT,NOTE)
+      ENDIF
+      RETURN
+ 100  CONTINUE
+      END
+      SUBROUTINE READTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+      IF(IDEVT.GT.0) THEN
+        READ(IDEVT,REC=I)LPOS,LLEN,LCOM,LTYPE,NEXT
+      ENDIF
+      END
+      SUBROUTINE WRITTG(IDEVT,I,LPOS,LLEN,LCOM,LTYPE,NEXT)
+      IF (IDEVT.GT.0) THEN
+        WRITE(IDEVT,REC=I)LPOS,LLEN,LCOM,LTYPE,NEXT
+      ENDIF
+      END
+      SUBROUTINE READCC(IDEVC,I,ICNT,NEXT,NOTE)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+C  COMMENT_LENGTH - 4
+      CHARACTER DUMM*36
+      IF(IDEVC.GT.0)THEN
+        READ(IDEVC,REC=1)NEXT,ICNT,DUMM
+        IF(I.EQ.0.OR.I.GT.ICNT)THEN
+          NEXT = 0
+          NOTE = ' '
+        ELSE
+          READ(IDEVC,REC=I)NEXT,NOTE
+        ENDIF
+      ENDIF
+      END
+      SUBROUTINE WRITCC(IDEVC,I,ICNT,NEXT,NOTE)
+C  COMMENT_LENGTH
+      CHARACTER NOTE*40
+C  COMMENT_LENGTH - 4
+      CHARACTER DUMM*36
+      IF(IDEVC.GT.0)THEN
+        IF(I.EQ.1) THEN
+          WRITE(IDEVC,REC=1)NEXT,ICNT,DUMM
+        ELSE
+          READ(IDEVC,REC=1)IDUM,ICNT,DUMM
+          IF(I.LE.ICNT) WRITE(IDEVC,REC=I)NEXT,NOTE
+        ENDIF
+      ENDIF
+      END
+      SUBROUTINE PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,NC,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),POSN,X
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER PAD
+      SAVE PAD
+      DATA PAD/'*'/
+C   NOW FIND FIRST CHAR THAT OVERLAPS REGION
+      LLINO=LNBR(LINCON)
+30    CONTINUE
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.POSN)GO TO 40
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 30
+40    CONTINUE
+C   NOW GET THIS GEL FROM DISK
+C      READ(IDEVW,REC=LLINO)GEL
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+C   CALC POSN IN THIS GEL TO EDIT
+      X=POSN-RELPG(LLINO)+1
+      K=X
+C   MOVE THE DATA RIGHT
+      M=ABS(LNGTHG(LLINO))
+      LNGTHG(LLINO)=LNGTHG(LLINO)+SIGN(NC,LNGTHG(LLINO))
+C   CHECK FOR OVER END OF ARRAY
+      N=ABS(LNGTHG(LLINO))
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,1000)LLINO
+1000    FORMAT(
+     +' Data pushed off end of gel',I4,' during padding')
+        M=M-(N-MAXGEL)
+        N=MAXGEL
+        LNGTHG(LLINO)=SIGN(MAXGEL,LNGTHG(LLINO))
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        GEL(I)=PAD
+60    CONTINUE
+C   WRITE BACK TO DISK
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+C      WRITE(IDEVW,REC=LLINO)GEL
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+C      WRITE(IDEVR,REC=LLINO)
+C     1RELPG(LLINO),LNGTHG(LLINO),LNBR(LLINO),RNBR(LLINO)
+C
+C   NOW UPDATE TAG FILES ACCORDINGLY
+      CALL PADTAG(LLINO,K,NC,LNGTHG(LLINO))
+C
+65    CONTINUE
+C   NOW GET NEXT GEL
+      LLINO=RNBR(LLINO)
+C   LAST GEL?
+      IF(LLINO.EQ.0)GO TO 70
+C   DOES IT HAVE DATA IN REGION?
+C   IE DO RELPG  AND RELPG+LNGTHG-1 LIE EITHER SIDE OF POSN?
+      IF(RELPG(LLINO).GT.POSN)GO TO 70
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 65
+C  WITHIN
+      GO TO 40
+70    CONTINUE
+C   INSERTS FINISHED SO NEED TO INCREMENT ALL THOSE GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+75    CONTINUE
+      IF(RELPG(LLINO).GT.POSN)GO TO 80
+76    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 90
+      GO TO 75
+80    CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)+NC
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+C      WRITE(IDEVR,REC=LLINO)
+C     1RELPG(LLINO),LNGTHG(LLINO),LNBR(LLINO),RNBR(LLINO)
+      GO TO 76
+90    CONTINUE
+C   NEED TO INCREMENT CONTIG LINE
+      RELPG(LINCON)=RELPG(LINCON)+NC
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+      RETURN
+      END
+      SUBROUTINE AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LLINO(2),ITYPE(2),IFAIL(2)
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER ANS,WINDOW
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      IFAIL(1) = 0
+      IF(NGELS.LT.1) RETURN
+      MN = LENGTH*2
+      MX = MAXGLM + 1
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      MN = 2
+      MX = 5
+      MINSLI = MAX(3,MN)
+      CALL GETINT(MN,MX,MINSLI,
+     +'Minimum alignment block',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINSLI = IVAL
+      MN = 0
+      MX = 50
+      MAXPG = 8
+      CALL GETINT(MN,MX,MAXPG,
+     +'Maximum pads per sequence',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MAXPC = IVAL
+        RMN = 0.
+        RMX = 100.
+        PERMAX = 8.
+        CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PERMAX = VAL
+      MN = MINMAT
+      MX = MAXGEL
+      WINDOW = 100
+      CALL GETINT(MN,MX,WINDOW,
+     +'Probe length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      WINDOW = IVAL
+       IOK = 1
+       I = 0
+       CALL YESNO(I,'Use clipped data',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+       IF(I.LT.0) RETURN
+       IWING = 0
+       IF(I.EQ.0) THEN
+         MN = 1
+         MX = MAXGEL
+         IWING = 100
+         CALL GETINT(MN,MX,IWING,
+     +   'Window size for good data scan',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+         IF(IOK.NE.0) RETURN
+         IWING = IVAL
+         MN = 1
+         MX = MIN(100,IWING)
+C Note nbad must be compatible with nok2
+         NBAD = MIN(IWING,5)
+         CALL GETINT(MN,MX,NBAD,
+     +   'Maximum number of dashes in scan window',
+     +   IVAL,KBIN,KBOUT,
+     +   IHELPS,IHELPE,HELPF,IDEVH,IOK)
+         IF(IOK.NE.0) RETURN
+         NBAD = IVAL
+       END IF
+      IDIM1=0
+      MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+      ANS=0
+      CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD,SEQ5,
+     +CENDS,NENDS,MAXCON,IWING,NBAD)
+      IDIM2=MAXGEL
+C
+C Note I am doing something horrible here to save space:
+C sending cends and nends to jcons to get the extension lengths
+C then swapping them with temp3 by sending temp to ajoin and
+C receiving them as cends and nends, and sending cends nends
+C and receiving them as iladd,iradd
+      CALL AUTOJN(SEQ1,IDIM1,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,TEMP3,TEMP3(MAXCON+1,1),MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,CENDS,NENDS,RELPG,LNBR,IDBSIZ,NCONTS)
+      END
+      SUBROUTINE AUTOJN(SEQ1,IDIMIN,GEL,IDIMGI,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,KBOUT,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,ILADD,IRADD,RELPG,LNBR,IDBSIZ,NCONTS)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(IDIMIN),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC),RELPG(IDBSIZ),LNBR(IDBSIZ)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON),ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(IDIMIN),GEL(MAXGLM)
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL(2),ITYPE(2)
+      PARAMETER (MAXC = 10)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC),MCON(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC),WINDOW
+      IFAIL(1) = 1
+      IFAIL(2) = 1
+      KFAIL = 0
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ1,IDIMIN,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C find possible missed joins
+C we have consensus in seq1 in order first contig,second contig etc
+C compare the ends in reverse order, simultaneously shortening the consensus
+      IDIM = IDIMIN
+      DO 200 JCON = IDCEND,2,-1
+        IDIM = CENDS(JCON) - 1
+        CALL ENCO(SEQ1,IDIM,POSNS,CONST,LENGTH)
+        CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH)
+        JS = CENDS(JCON) + 20
+        JE = CENDS(JCON+1) - 1
+        IEND = 1
+        IDIMG = MIN(WINDOW,JE-JS+1)
+C check for case where contig is shorter than probe (window)
+C in which case only compare the left hand end
+         IF(JE-JS+1.LE.WINDOW) IEND = 2
+C        WRITE(*,*)JCON,JS,JE,IDIMG
+1       CONTINUE
+        IMATC = 0
+        IF(IEND.EQ.1) THEN
+          CALL SQCOPY(SEQ1(JE-IDIMG+1),GEL,IDIMG)
+        ELSE
+          CALL SQCOPY(SEQ1(JS),GEL,IDIMG)
+        END IF
+        CALL SQCOPY(GEL,GELCOP,IDIMG)
+      ISTRAN=1
+2     CONTINUE
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      IDSAV=MAXSAV
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ1,GEL,MINMAT,IFCOMP,KBOUT)
+      IF(IFCOMP.NE.0) THEN
+         CALL ERROM(KBOUT,'Error in CFGEL')
+         RETURN
+      END IF
+      IF(IDSAV.NE.0)THEN
+        CALL ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +  IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +  IMATC,ISTRAN,MAXC,KBOUT,MCON)
+      END IF
+      ISTRAN=ISTRAN+1
+      IF(ISTRAN.EQ.2) THEN
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        CALL SQREV(GEL,IDIMG)
+        CALL SQCOM(GEL,IDIMG)
+        GO TO 2
+      END IF
+      CALL SQCOPY(GELCOP,GEL,IDIMG)
+      KSENSE = 0
+C      WRITE(KBOUT,*)'Total matches found',IMATC
+      IF(IMATC.NE.0) THEN
+      JMATC = 0
+      DO 100 I = 1,IMATC
+        IF(JSENSE(I).EQ.-1) THEN
+          IF(KSENSE.EQ.0) THEN 
+            CALL SQREV(GEL,IDIMG)
+            CALL SQCOM(GEL,IDIMG)
+            KSENSE = 1
+          END IF
+        END IF
+        JDIM22 = IDIMG
+        JDOUT = MAXGEL
+        IDSAV = MAXSAV
+C        IF(IEND.EQ.1) WRITE(KBOUT,*)'Left end of Contig',NENDS(JCON)
+C        IF(IEND.EQ.2) WRITE(KBOUT,*)'Right end of Contig',NENDS(JCON)
+C        WRITE(KBOUT,*)'Trying to align with contig',JLLINO(I)
+        CALL ALINEJ(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +  SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +  JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +  JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ4,MAXGEL,PERMS,
+     +  NENDS(JCON),IEND,JLLINO(I),JSENSE(I),ILADD,IRADD,MAXCON,MCON(I),
+     +  JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+100   CONTINUE
+      END IF
+      IEND = IEND + 1
+      IF(IEND.EQ.2) GO TO 1
+C      IDIM = CENDS(JCON)-1
+ 200  CONTINUE
+      END
+      SUBROUTINE ALINEJ(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ3,MAXGEL,
+     +PERCM,JCONN,IEND,NCON,JSENSE,ILADD,IRADD,MAXCON,MCON,JCON,
+     +RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV)
+      INTEGER ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER RELPG(IDBSIZ),LNBR(IDBSIZ),WINDOW
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      IFAIL=0
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+C      WRITE(KBOUT,1052)ITOTPC,ITOTPG
+C1052  FORMAT(' Padding in contig= ',I4,' and in gel= ',I4)
+      CALL JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   ARE ALL CHECKS OK?
+      IF(ITOTPC.GT.MAXPC)IFAIL=1
+      IF(ITOTPG.GT.MAXPG)IFAIL=1
+      IF(PERCM.GT.PERMAX)IFAIL=1
+      END
+C     SUBROUTINE DALIGN
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH,STRAND,NAME1*6,NAME2*6
+      INTEGER ILADD(MAXCON),IRADD(MAXCON),RELPG(IDBSIZ),LNBR(IDBSIZ)
+      INTEGER CLINNO,WINDOW
+      EXTERNAL CLINNO
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+C
+C  where are the overlaps?
+C
+C
+C
+C  if ITYPE is 1 the overlap starts within the reading at JOINT
+C  else it starts at the left end of the reading at JOINT in the contig
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+100   CONTINUE
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      X=FLOAT(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(
+     +' Matching region too long for routine dalign. Alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      IF(X.LT.PERMAX) THEN
+        IC = CLINNO(LNBR,IDBSIZ,NCONTS,JCONN)
+        IF(IC.EQ.0) THEN
+          IFAIL = 99
+          RETURN
+        END IF
+        IF(JSENSE.EQ.-1) THEN
+          STRAND = '-'
+          IF(IEND.EQ.1) THEN
+C
+C  probe is complement of right hand end of contig. Give posns assuming
+C  this contig is complemented.
+C
+            IENDGT = IENDG - IRADD(JCON)
+C
+C Next line gives posns relative to original orientation
+C            IENDGT = RELPG(IC) + IRADD(JCON) - IENDG + 1
+          ELSE
+C
+C  probe is complement of left end of contig. Give posns assuming this
+C  contig is going to be complemented.
+C
+            IENDGT = RELPG(IC) - WINDOW + ILADD(JCON) + IENDG
+          END IF
+        ELSE
+          STRAND = '+'
+          IF(IEND.EQ.2) THEN
+C
+C  probe is left hand end of contig in original sense
+C
+            IENDGT = IENDG - ILADD(JCON)
+          ELSE
+C
+C  probe is right hand end of contig in original sense
+C
+            IENDGT = RELPG(IC) + IRADD(JCON) - WINDOW + IENDG
+          END IF
+        END IF
+        IENDCT = IENDC - ILADD(MCON)
+          WRITE(KBOUT,*)
+     +    ' Possible join between contig ',JCONN,' in the ',
+     +    STRAND,' sense and contig ',NCON
+        WRITE(KBOUT,1000)X
+1000    FORMAT(' Percentage mismatch after alignment = ',F4.1)
+        WRITE(NAME1,1002)JCONN
+        WRITE(NAME2,1002)NCON
+ 1002   FORMAT(I6)
+        CALL FMT4LP(SEQC2(1),SEQG2(IENDG),LO,IENDCT,IENDGT,KBOUT,
+     +  NAME2,NAME1)
+      END IF
+      IFAIL=0
+      END
+      SUBROUTINE ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      END
+      SUBROUTINE ADISM6(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C new bit need to save contig number for alinej
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        MCON(IMATC) = JJ
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+C      JCONS
+      SUBROUTINE JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,ANS,LLINO,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD,TGEL,ILADD,IRADD,MAXCON,
+     +IWIN,NBAD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ,ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),TGEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CALL BUSY(KBOUT)
+C      IF(IWIN.GT.0) PERC = REAL(NBAD)/REAL(IWIN)
+      IEND = 1
+      IFAIL=0
+      N=IDBSIZ-NCONTS
+      NCONS = 0
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        ISTART=ISTART+1
+        IF((ISTART+19+Y+2*MAXGEL).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+1009      FORMAT(
+     +    ' Maximum consensus length(',I6,') exceeded',/,
+     +    ' calculation aborted')
+          IFAIL=1
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        NCONS = NCONS + 1
+        IDIN = 0
+        IF(IWIN.GT.0) THEN
+          IDIN = MAXGEL
+          CALL GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,1,
+     +    GEL,TGEL,IDIN,IWIN,NBAD)
+          IF(IDIN.GT.0)CALL SQCOPY(GEL,SEQ1(ISTART),IDIN)
+        END IF
+        ILADD(NCONS) = IDIN
+        ISTART = ISTART + IDIN
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,MAXGEL,
+     +  IDM,PERCD)
+        ISTART=ISTART+Y-1
+        IDIN = 0
+        IF(IWIN.GT.0) THEN
+          IDIN = MAXGEL
+          CALL GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,I,2,
+     +    GEL,TGEL,IDIN,IWIN,NBAD)
+          IF(IDIN.GT.0)CALL SQCOPY(GEL,SEQ1(ISTART+1),IDIN)
+        END IF
+        IRADD(NCONS) = IDIN
+        ISTART = ISTART + IDIN
+110   CONTINUE
+      END
+      SUBROUTINE GETEX(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,ICONT,IEND,
+     +GEL,GELT,ID,IWIN,NBAD)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ),RELPG(IDBSIZ)
+      CHARACTER GEL(ID),GELT(ID)
+      EXTERNAL NOK2
+      MAXGEL = ID
+      IDT = 0
+C routine to find a possible extension to a contig by looking in a tag file
+C contig ICONT end IEND = 1 =left 2=right
+C return data in GEL, of length ID
+C The worst aspect of this is that if we dont choose the very end reading
+C we dont know where it lies relative to the consensus. For now just assume
+C there are no length difference between the extension and the consensus
+C and just add it on the end
+      IF(IEND.EQ.1) THEN
+        LMOST = 1
+        IGEL = LNBR(ICONT)
+ 10     CONTINUE
+        IF(IGEL.EQ.0) GO TO 100
+        IF(RELPG(IGEL).GT.MAXGEL) GO TO 100
+        IF(LNGTHG(IGEL).LT.0) THEN
+          ID = MAXGEL
+          CALL GETEXT(IGEL,GELT,ID,IOK)
+          IF(IOK.EQ.0) THEN
+C  look for number of n's here and change id accordingly
+            K =  NOK2(GELT,ID,IWIN,NBAD)
+            LT = MIN(LMOST,RELPG(IGEL)-K)
+            IF(LT.LT.LMOST) THEN
+              IS = RELPG(IGEL)
+              N = K - IS + 1
+              CALL SQCOPY(GELT(IS),GEL,N)
+              CALL SQREV(GEL,N)
+              CALL SQCOM(GEL,N)
+              IDT = N
+              LMOST = LT
+            END IF
+          ELSE
+C            WRITE(*,*)'COCKUP IN GETEXT, gel',IGEL
+          END IF
+        END IF
+        IGEL = RNBR(IGEL)
+        GO TO 10
+      ELSE
+        IGEL = RNBR(ICONT)
+        LMOST = RELPG(ICONT)
+        IDC = RELPG(ICONT)
+ 20     CONTINUE
+        IF(IGEL.EQ.0) GO TO 100
+        IF(LMOST-RELPG(IGEL).GT.MAXGEL) GO TO 100
+        IF(LNGTHG(IGEL).GT.0) THEN
+          ID = MAXGEL
+          CALL GETEXT(IGEL,GELT,ID,IOK)
+          IF(IOK.EQ.0) THEN
+            K =  NOK2(GELT,ID,IWIN,NBAD)
+            LT = MAX(LMOST,RELPG(IGEL)+LNGTHG(IGEL)+K-1)
+            IF(LT.GT.LMOST) THEN
+              IS = RELPG(ICONT) - (RELPG(IGEL) + LNGTHG(IGEL)) + 2
+              N = K - IS + 1
+              CALL SQCOPY(GELT(IS),GEL,N)
+              IDT = N
+              LMOST = LT
+            END IF
+          ELSE
+C            WRITE(*,*)'COCKUP IN GETEXT, GEL',IGEL
+          END IF
+        END IF
+        IGEL = LNBR(IGEL)
+        GO TO 20
+      END IF
+ 100  CONTINUE
+      ID = IDT
+      END
+      INTEGER FUNCTION NOK2(GEL,ID,IWIN,NBADIN)
+      CHARACTER GEL(ID)
+      PARAMETER (MAXPOS = 101)
+      INTEGER POSNS(MAXPOS),R
+      EXTERNAL KWRAP
+C count N's over a window of iwin, return position
+C when over NBAD
+C INIT
+      NBAD = NBADIN + 1
+      I = 0
+      N = 0
+      NOK2 = ID
+      L = 1
+      R = 0
+      IF(NBAD.GT.MAXPOS)THEN
+        WRITE(*,*)'Scream: nok2 not happy'
+        RETURN
+      END IF
+ 10   CONTINUE
+      I = I + 1
+      IF(I.GT.ID) RETURN
+      IF(GEL(I).EQ.'-') THEN
+        N = N + 1
+        R = KWRAP(R,NBAD)
+        POSNS(R) = I
+        IF(N.GE.NBAD) THEN
+          IF(POSNS(R)-POSNS(L)+1.LT.IWIN) THEN
+            NOK2 = POSNS(L)
+            RETURN
+          END IF
+          L = KWRAP(L,NBAD)
+        END IF
+      END IF
+      GO TO 10
+      END
+      INTEGER FUNCTION KWRAP(I,J)
+      IT = I + 1
+      IF(IT.GT.J) IT = 1
+      KWRAP = IT
+      END
diff --git a/src/staden/dbsysold.f b/src/staden/dbsysold.f
new file mode 100644
index 0000000..6bfa0e8
--- /dev/null
+++ b/src/staden/dbsysold.f
@@ -0,0 +1,2509 @@
+C   dbsys routines that were changed when starting to use .rd files
+C         these versions are the pre .rd ones and afre used by sap and xsap
+C         post .rd equivalents are in dbsysnew.f
+C 8-4-92 removed prompt for minsli in dbauto
+C 22-5-91 added new remove gel reading function. Changed movgel, breakc
+C   21-8-91 Added routines to find internal overlaps
+C   8-11-91 Fixed bugs in routines to find internal joins
+C  2-3-92  Set filnam = ' ' for some call to openf1
+C
+C   SUBROUTINE TO ENTER NEW GEL SEQUENCES INTO DATA BASE.
+C   IT READS IN AN ARCHIVE VERSION AND WRITES OUT A WORKING VERSION.
+C   IT ALSO SETS UP ANY RELATIONSHIPS WITH OTHER DATA IN THE DATABASE
+C   BOTH BY POSITION IN A CONTIG AND POINTERS TO LEFT AND RIGHT
+C   NEIGHBOURS.
+      SUBROUTINE AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,NAMARC,X,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM,IDC,NCONTC,LINCON,IFAIL,IDBSIZ,KBOUT,IDEVR,IDEVW,IDEVN,
+     +MAXGEL)
+C   AUTHOR: RODGER STADEN
+      INTEGER  RELPG(IDBSIZ),X,Y
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),NAMARC*(*)
+      CHARACTER SEQC2(IDC)
+      CHARACTER NAMARK*10
+C      WRITE(*,*)'X,ITYPE,ISENSE,IDIM,IDC'
+C      WRITE(*,*)X,ITYPE,ISENSE,IDIM,IDC
+C   SET FAIL FLAG
+      IFAIL=0
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(' TRYING TO ENTER NEW GEL READING INTO DATABASE')
+C   IS THERE SPACE?
+      IF((IDBSIZ-(NGELS+NCONTS)).GT.2)GO TO 5
+C   FULL
+      WRITE(KBOUT,1999)IDBSIZ
+1999  FORMAT(' Database full, current size=',I6,' Extend with copy')
+      IFAIL=7
+      RETURN
+5     CONTINUE
+C   NEED TO CHECK TO SEE IF GEL ALREADY IN DB
+C   LOOK THRU ARC FILE
+      DO 10 J=1,NGELS
+C      READ(IDEVN,REC=J)NAMARK
+      CALL READN(IDEVN,J,NAMARK)
+      IF(NAMARK.NE.NAMARC(1:10))GO TO 10
+C   FOUND
+      WRITE(KBOUT,1013)J
+1013  FORMAT(' New gel already in database with number',I6,
+     +' Entry aborted')
+      IFAIL=6
+      RETURN
+10    CONTINUE
+C   INCREMENT NUMBER OF GELS
+      NGELS=NGELS+1
+C   SET LENGTH THIS GEL
+      LNGTHG(NGELS)=IDIM*ISENSE
+C   WRITE NAME OF ARCHIVE TO LIST OF ARCHIVES
+C   NAMPRO,ARC
+C      WRITE(IDEVN,REC=NGELS)NAMARC(1:10)
+      NAMARK=NAMARC(1:10)
+      CALL WRITEN(IDEVN,NGELS,NAMARK)
+      WRITE(KBOUT,1003)NGELS
+1003  FORMAT(' This gel reading has been given the number ',I6)
+C   WRITE GEL TO WORKING VERSION
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+C      WRITE(IDEVW,REC=NGELS)GEL
+C
+C
+C   SET UP RELATIONSHIPS
+C   DOES THIS GEL OVERLAP?
+      IF(ITYPE.NE.0)GO TO 100
+C
+C   DOES NOT OVERLAP SO IT STARTS A CONTIG OF ITS OWN
+C   SET LEFT AND RIGHT POINTERS TO ZERO,RELPG TO 1
+      LNBR(NGELS)=0
+      RNBR(NGELS)=0
+      RELPG(NGELS)=1
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),
+C     1RNBR(NGELS)
+C
+C   SET CONTIG POINTERS AND GENERAL VALUES
+C   INCREMENT NUMBER OF CONTIGS
+      NCONTS=NCONTS+1
+C   POINTER TO THIS CONTIG
+      N=IDBSIZ-NCONTS
+C   POINTER TO LEFT GEL THIS CONTIG
+      LNBR(N)=NGELS
+C   POINTER TO RIGHT GEL THIS CONTIG
+      RNBR(N)=NGELS
+C   LENGTH OF CONTIG
+      RELPG(N)=IDIM
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   WRITE DB DESCRIPTOR
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      RETURN
+C
+100   CONTINUE
+C
+C
+C   DOES OVERLAP
+150   CONTINUE
+C
+C   LEFT END OR RIGHT OVERLAP?
+      IF(ITYPE.EQ.1)GO TO 400
+C   RIGHT END OR INTERNAL OVERLAP
+C
+160   CONTINUE
+C   NEED TO SEARCH THRU THIS CONTIG TO FIND LEFT AND RIGHT
+C   NEIGHBOURS FOR THIS NEW GEL
+C   LINE NUMBER OF LEFT END OF CONTIG
+      N=NCONTC
+C   LOOK THRU UNTIL CURRENT IS >= THEN IT MUST BE THE PREVIOUS ONE
+200   CONTINUE
+      IF(RELPG(N).GT.X)GO TO 250
+C   IS THIS THE LAST GEL IN CONTIG?
+      IF(RNBR(N).EQ.0)GO TO 350
+C   NO SO LOOK AT NEXT
+      N=RNBR(N)
+      GO TO 200
+250   CONTINUE
+C   GEL LIES BETWEEN N AND LNBR(N)
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS IN NEW GEL
+      LNBR(NGELS)=LNBR(N)
+      RNBR(NGELS)=N
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),
+C     1RNBR(NGELS)
+C   SET POINTERS  IN LEFT AND RIGHT NEIGHBOURS
+      K=LNBR(N)
+      RNBR(K)=NGELS
+C      RNBR(LNBR(N))=NGELS
+C   WRITE LEFT AND RIGHT NEIGHBOURS
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+C      WRITE(IDEVR,REC=K)RELPG(K),LNGTHG(K),LNBR(K),RNBR(K)
+      LNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C   WRITE NGELS NCONTS
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   ITS LINE NUMBER IS LINCON
+C   NEED TO UPDATE IDIM IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      IF(Y.LE.RELPG(LINCON))RETURN
+      RELPG(LINCON)=Y
+C   WRITE NEW CONTIG LINE
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+      RETURN
+350   CONTINUE
+C   MUST BE A RIGHT END OVERLAP
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,X,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+C
+C   SET POINTERS FOR NEW GEL
+      LNBR(NGELS)=N
+      RNBR(NGELS)=0
+      RELPG(NGELS)=X
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)
+C     1RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),RNBR(NGELS)
+C   OLD RIGHT END
+      RNBR(N)=NGELS
+C   WRITE NEW RIGHT LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+C
+C   RESET RIGHT NAME IN CONTIG
+C   ITS LINE NUMBER IS LINCON
+      RNBR(LINCON)=NGELS
+C   HAVE WE INCREASED LENGTH OF CONTIG?
+C   NEED TO UPDATE LENGTH OF GEL IN CASE OF EDITS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+IDIM-1
+      RELPG(LINCON)=MAX(RELPG(LINCON),Y)
+C   WRITE HERE
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON)
+C     1,RNBR(LINCON)
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+      RETURN
+C
+400   CONTINUE
+C
+C   ADDING TO LEFT END
+410   CONTINUE
+C   NEED TO EDIT DB HERE
+      IF(ITOTPC.GT.0)CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     1NGELS,NCONTS,
+     2GEL,LINCON,1,SEQC2,ITOTPC,IDC,IDBSIZ,KBOUT,IDEVR,IDEVW,
+     +MAXGEL)
+C
+420   CONTINUE
+C   SET POINTERS IN NEW GEL
+      RELPG(NGELS)=1
+      RNBR(NGELS)=NCONTC
+      LNBR(NGELS)=0
+C   WRITE NEW GEL LINE
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+C      WRITE(IDEVR,REC=NGELS)RELPG(NGELS),LNGTHG(NGELS),LNBR(NGELS),
+C     1RNBR(NGELS)
+C   SET POINTERS IN OLD LEFT END
+      LNBR(NCONTC)=NGELS
+      RELPG(NCONTC)=X
+C   WRITE NEW LEFT END
+      CALL WRITER(IDEVR,NCONTC,RELPG(NCONTC),LNGTHG(NCONTC),
+     +LNBR(NCONTC),RNBR(NCONTC))
+C      WRITE(IDEVR,REC=NCONTC)RELPG(NCONTC),LNGTHG(NCONTC),LNBR(NCONTC)
+C     1,RNBR(NCONTC)
+C   NEW LENGTH OF CONTIG
+      RELPG(LINCON)=RELPG(LINCON)+X-1
+C   MAY HAVE JUST ADDED A GEL LONGER THAN CONTIG
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=IDIM
+      IF(Y.GT.RELPG(LINCON))RELPG(LINCON)=Y
+C   NEW NAME OF LEFT END OF CONTIG
+      LNBR(LINCON)=NGELS
+C   WRITE CONTIG DESCRIPTOR
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON)
+C     1,RNBR(LINCON)
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+C      WRITE(IDEVR,REC=IDBSIZ)NGELS,NCONTS
+C   NOW GO THRU AND CHANGE ALL RELATIVE POSITIONS
+      N=NCONTC
+440   CONTINUE
+      IF(RNBR(N).EQ.0)RETURN
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+X-1
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+C      WRITE(IDEVR,REC=N)RELPG(N),LNGTHG(N),LNBR(N),RNBR(N)
+      GO TO 440
+      END
+      SUBROUTINE DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB),PL(2),PR(2),RMOST
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LINCON(2),LLINO(2),ITYPE(2),IFAIL(2)
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER LREG,RREG,X,ANS,ANSJOK
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER GET,SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      SAVE GET
+      DATA GET/'>'/
+      WRITE(KBOUT,*)' Automatic sequence assembler'
+      IFAIL(1) = 0
+      IEMPTY=0
+      IF(NGELS.LT.1)IEMPTY=1
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      CALL YESNO(IOKENT,'Permit entry',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOKENT.LT.0) RETURN
+      CALL YESNO(INF,'Use file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(INF.LT.0) RETURN
+      IF(INF.EQ.0) THEN
+        FILE = ' '
+        CALL OPENF1(IDEV7,FILE,0,IOK,KBIN,KBOUT,
+     +  'File of gel reading names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      IF(IOKENT.EQ.0) THEN
+        FILE = ' '
+        CALL OPENF1(IDEV8,FILE,1,IOK,KBIN,KBOUT,
+     +  'File for names of failures',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+      END IF
+      PROMPT(1) = 'Perform normal shotgun assembly'
+      PROMPT(2) = 'Put all sequences in one contig'
+      PROMPT(3) = 'Put all sequences in new contigs'
+      IOPT = 1
+      CALL RADION('Select entry mode',PROMPT,3,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) RETURN
+      IF(IOPT.EQ.1) THEN
+C  parameters for normal assembly
+      ANSJOK = 0
+      CALL YESNO(ANSJOK,'Permit joins',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ANSJOK.LT.0) RETURN
+      MN = LENGTH*2
+      MX = MAXGLM + 1
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      MINSLI = 3
+      MN = 0
+      MX = 25
+      MAXPG = 8
+      CALL GETINT(MN,MX,MAXPG,
+     +'Maximum pads per gel',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MN = 0
+      MX = 25
+      MAXPC = 8
+      CALL GETINT(MN,MX,MAXPC,
+     +'Maximum pads per gel in contig',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPC = IVAL
+      IF(IOKENT.EQ.0) THEN
+        RMN = 0.
+        RMX = 15.
+        PERMAX = 8.
+        CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PERMAX = VAL
+      END IF
+      IDIM1=0
+      MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+      ANS=0
+      IF(IEMPTY.EQ.0)
+     +CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      END IF
+      JGEL = 0
+      JNGEL = 0
+      JNJOIN = 0
+      IMATC = 0
+      IF(IFAIL(1).NE.0)GO TO 900
+C
+C
+1     CONTINUE
+C
+C
+      IDIM2=MAXGEL
+      IF(INF.EQ.1) THEN
+3       CONTINUE
+        MN = 0
+        CALL GTSTR('Gel reading name',' ',NAMARC,MN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.3) RETURN
+        IF(INFLAG.EQ.2) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+      ELSE
+        READ(IDEV7,1002,END=900)NAMARC
+      END IF
+1002  FORMAT( A)
+      DO 77 MM=1,80
+        MATCH(MM)=GET
+77    CONTINUE
+      WRITE(IDEV,1077)(MATCH(KK),KK=1,79)
+1077  FORMAT(' ',79A1)
+      JGEL = JGEL + 1
+      WRITE(IDEV,*)'Processing',JGEL,' in batch'
+1007  FORMAT(' Gel reading name=',A)
+      WRITE(IDEV,1007)NAMARC
+      CALL OPENRS(IDEV4,NAMARC,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        IF(INF.EQ.1) RETURN
+        WRITE(IDEV,*)' Error opening gel reading file'
+        GO TO 1
+      END IF
+      CALL ARRFIM(IDEV4,SEQ2,IDIM2,KBOUT)
+      CLOSE(UNIT=IDEV4)
+      WRITE(IDEV,1800)IDIM2
+1800  FORMAT(' Gel reading length=',I6)
+      IF(IOPT.NE.1) THEN
+        CALL DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +  IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +  IDEV1,IDEV2,IDEV3,MAXGEL,IMATC,IEMPTY,IOPT)
+        IF(IFAIL(1).NE.0) GO TO 800
+        JNGEL = JNGEL + 1
+        GO TO 1
+      END IF
+      IF(IDIM2.LT.MINMAT)THEN
+        WRITE(IDEV,*)
+     +' Reading shorter than minimum match, so not processed'
+        GO TO 1
+      END IF
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      IFCOMP=0
+      IMATC=0
+      IF(IEMPTY.EQ.0)
+     +CALL AUTOCN(SEQ1,IDIM1,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL)
+      IF(IOKENT.NE.0) GO TO 1
+C     THIS RETURNS THE FOLLOWING:
+C     ILEFTS  POSITION IN CONSENSUS OF LEFT END OF MATCHING CONTIGS
+C     ILC     LENGTHS OF MATCHING CONTIGS
+C     IPOSC   POSITION OF MATCH RELATIVE TO CONTIG
+C     IPOSG   POSITION OF MATCH RELATIVE TO NEW GEL
+C     ISENSE  SENSE OF NEW GEL
+C     LLINO   LEFT GEL NUMBER IN MATCHING CONTIGS
+C     IMATC   THE NUMBER OF MATCHING CONTIGS (>2 IS ERROR!)
+C     IFCOMP  ERROR FLAG FOR COMPARISON (COMPARISON ARRAYS OVERFLOWED)
+      IF(IFCOMP.NE.0)GO TO 800
+      CALL SQCOPY(SEQ3,SEQ2,IDIM2)
+      IF(IMATC.GT.0)GO TO 200
+C
+C                         NO OVERLAP NEW CONTIG
+C
+C     ITYPE 0 = NO OVERLAP
+C     ISENSE 1 = SAME SENSE AS ARCHIVE
+      IF(IFAIL(1).NE.0) GO TO 800
+      ITYPE(1)=0
+      ISENSE(1)=1
+      IDOUT(1)=MAXGEL
+      WRITE(IDEV,1015)
+1015  FORMAT(' New gel reading does not overlap: start a new contig')
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2,NAMARC,X,ITYPE,ISENSE,SEQC2(1,1),ITOTPC(1),
+     +IDIM2,IDOUT(1),LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +IDEV1,IDEV2,IDEV3,MAXGEL)
+      IF(IFAIL(1).NE.0)GO TO 800
+      IEMPTY=0
+      IDIM1=IDIM1+1
+      IF((IDIM1+19+IDIM2).GT.MAXSEQ)THEN
+        WRITE(IDEV,1021)MAXSEQ
+1021    FORMAT(' Database maximum consensus length (',I6,') exceeded')
+        GO TO 900
+      END IF
+      CALL ADDTIT(SEQ1(IDIM1),NAMPRO,NGELS,IDIM1)
+      CALL MSTLKL(SEQ2,IDIM2)
+      CALL SQCOPY(SEQ2,SEQ1(IDIM1),IDIM2)
+      IDIM1=IDIM1+IDIM2-1
+      JNGEL = JNGEL + 1
+      GO TO 1
+C
+C
+C
+200   CONTINUE
+C
+C
+C   OVERLAP SO TRY TO ALIGN THE SEQUENCES
+C
+C
+      DO 100 I=1,IMATC
+        N=IDBSIZ-NCONTS
+        DO 99 J=N,IDBSIZ-1
+          IF(LNBR(J).NE.LLINO(I))GO TO 99
+          LINCON(I)=J
+          GO TO 100
+99      CONTINUE
+        WRITE(IDEV,10077)LLINO(I)
+10077   FORMAT(' Contig line for contig',I6,' not found!')
+        GO TO 800
+100   CONTINUE
+C
+      IF((IMATC.EQ.2).AND.(ANSJOK.EQ.0))GO TO 400
+C
+C
+C                           SINGLE OVERLAP
+C
+C
+C
+      WRITE(IDEV,1014)LLINO(1)
+1014  FORMAT(' New gel reading overlaps contig',I6)
+      IF(ITOTPG(1).GT.0) CALL CCTA(SEQG2(1,1),IDIM22(1))
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,1),NAMARC,JOINT(1),ITYPE(1),ISENSE(1),
+     +SEQC2(1,1),
+     +ITOTPC(1),IDIM22(1),IDOUT(1),LLINO(1),LINCON(1),
+     +IFAIL(1),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,MAXGEL)
+      IF(IFAIL(1).NE.0)GO TO 800
+      CALL DELCON(SEQ1,ILEFTS(1),ILC(1),IDIM1)
+      LREG=1
+      RREG=RELPG(LINCON(1))
+      IGELC=LNBR(LINCON(1))
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      JNGEL = JNGEL + 1
+      IF(KFAIL.NE.0) GO TO 800
+      GO TO 1
+C
+C
+C                        DOUBLE OVERLAP
+C
+C
+400   CONTINUE
+      WRITE(IDEV,1013)LLINO
+1013  FORMAT(' Overlap between contigs',I6,' and',I6)
+      IF(ANSJOK.NE.0)GO TO 800
+      IF(LLINO(1).EQ.LLINO(2))THEN
+        WRITE(IDEV,*)' Trying to form loop in contig',LLINO(1)
+        WRITE(IDEV,*)' Gel not entered'
+        GO TO 800
+      END IF
+      CALL AJOIN3(RELPG,IDBSIZ,LINCON,ITYPE,ISENSE,JOINT,
+     +IDIM22,KLASS,IOVER,IDEV,PL,PR)
+      IF(IOVER.GT.MAXOVR)THEN
+        WRITE(IDEV,*)' Overlap too large: entry only'
+        IFAIL(2)=1
+        GO TO 600
+       END IF
+C   WHICH CONTIG IS LEFTMOST?
+      LMOST=1
+      RMOST=2
+      IF(PL(1).GT.PL(2))THEN
+        LMOST=2
+        RMOST=1
+      END IF
+C   SAVE LENGTH OF RMOST CONTIG FOR DELETION STEP LATER
+      ILCR=ILC(RMOST)
+      IF(ITOTPG(LMOST).GT.0) CALL CCTA(SEQG2(1,LMOST),IDIM22(LMOST))
+      WRITE(IDEV,1012)LLINO(LMOST)
+1012  FORMAT(' Entering the new gel reading into contig',I6)
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,LMOST),NAMARC,JOINT(LMOST),ITYPE(LMOST),
+     +ISENSE(LMOST),SEQC2(1,LMOST),ITOTPC(LMOST),
+     +IDIM22(LMOST),IDOUT(LMOST),LLINO(LMOST),LINCON(LMOST),
+     +IFAIL(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,
+     +MAXGEL)
+      IF(IFAIL(LMOST).NE.0)GO TO 800
+      JNGEL = JNGEL + 1
+      IF(ITYPE(LMOST).EQ.1)LLINO(LMOST)=NGELS
+      IF(ILEFTS(LMOST).LT.ILEFTS(RMOST))THEN
+        ILEFTS(RMOST)=ILEFTS(RMOST)-ILC(LMOST)-20
+      END IF
+      CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+      LREG=1
+      RREG=RELPG(LINCON(LMOST))
+      IGELC=LNBR(LINCON(LMOST))
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      ILC(LMOST)=RREG
+      ILEFTS(LMOST)=IDIM1-RREG+1
+      DO 500 I=1,2
+        IF(ISENSE(I).EQ.-1)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON(I),
+     +    LLINO(I),SEQ2,IDBSIZ,IDEV,IDEV1,IDEV2,MAXGEL)
+          CALL SQREV(SEQ1(ILEFTS(I)),ILC(I))
+          CALL SQCOM(SEQ1(ILEFTS(I)),ILC(I))
+          KT=IDIM1
+          CALL ADDTIT(SEQ1((ILEFTS(I)-20)),NAMPRO,LNBR(LINCON(I)),KT)
+        END IF
+500   CONTINUE
+C   NEED TO KNOW POSITION OF OVERLAP RELATIVE TO CONTIG, TO CONSENSUS
+C   WHICH BITS TO SEND TO ALIGNMENT ROUTINES
+C   SET UP FOR ALINE (NOTE RMOST IS EQUIVALENT TO THE GEL READING AND
+C   SO IS SLID ALONG THE LMOST CONTIG. THE SECTION SENT TO ALINE MUST 
+C   BE OF LENGTH < MAXGEL-2*MAX(MAXPC,MAXPG)
+C   IT MUST START AT POSITION 1 IN THE RMOST CONTIG AND EXTEND
+      IPOSC(LMOST)=PL(RMOST)+RELPG(NGELS)-1
+      ILCT = RELPG(LINCON(LMOST)) - RELPG(NGELS) - PL(RMOST) + 2
+      ILC(RMOST)=MIN(ILCT,ILC(RMOST))
+      IPOSC(RMOST)=1
+      IDOUT(LMOST)=MAXGEL
+      IDOUT(RMOST)=MAXGEL
+      IDSAV=MAXSAV
+C  ON INPUT TO ALINE ILC(RMOST) CONTAINS THE OVERLAP LENGTH
+C  ON OUTPUT IT CONTAINS THE LENGTH OF THE ALIGNED SECTION (IE INCLUDING 
+C  PADS)
+      WRITE(IDEV,1009)
+1009  FORMAT(' Trying to align the two contigs')
+      CALL ALINE(SEQ1(ILEFTS(LMOST)),SEQ1(ILEFTS(RMOST)),
+     +SEQC2(1,RMOST),SEQC2(1,LMOST),SAV1,SAV2,SAV3,IDSAV,
+     +ILC(LMOST),ILC(RMOST),IDOUT(LMOST),IPOSC(LMOST),IPOSC(RMOST),
+     +MINSLI,JOINT(LMOST),ITOTPC(LMOST),ITOTPC(RMOST),IFAIL(1),
+     +ITYPE(1),MAXPC,MAXPC,PERMAX,IDEV,SEQ4,MAXGEL,Z)
+C SEQC2(1,LMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE LMOST CONTIG
+C SEQC2(1,RMOST)  NOW CONTAINS THE ALIGNED SECTION OF THE RMOST CONTIG
+C ILC(RMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE RMOST CONTIG
+C IDOUT(LMOST)  IS NOW THE LENGTH OF ALIGNED SECTION OF THE LMOST CONTIG
+C JOINT(LMOST)  IS THE POSITION OF THE JOIN RLETIVE TO THE LMOST CONTIG
+C ITYPE IS TYPE OF OVERLAP (-1 = RIGHT END OR INTERNAL, 1 = LEFT END)
+C  NB SHOULD ALWAYS BE -1
+C  IF THIS HAS BEEN DONE OK WE CAN EDIT THE TWO CONTIGS THEN JOIN
+      IF(IFAIL(1).NE.0)THEN
+        WRITE(IDEV,*)' Failed to align the two overlapping contigs'
+        GO TO 800
+      END IF
+      IF(ITOTPC(LMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(LMOST)
+1017    FORMAT(' Editing contig',I6)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(LMOST),JOINT(LMOST),SEQC2(1,LMOST),
+     +  ITOTPC(LMOST),IDOUT(LMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      JOINT(RMOST)=1
+      IDOUT(RMOST)=ILC(RMOST)
+      IF(ITOTPC(RMOST).GT.0)THEN
+        WRITE(IDEV,1017)LLINO(RMOST)
+        CALL ABEDIN(RELPG,LNGTHG,LNBR,RNBR,
+     +  NGELS,NCONTS,SEQ3,LINCON(RMOST),JOINT(RMOST),SEQC2(1,RMOST),
+     +  ITOTPC(RMOST),IDOUT(RMOST),IDBSIZ,IDEV,IDEV1,IDEV2,
+     +  MAXGEL)
+      END IF
+      ILC(RMOST)=ILCR
+      LTL=LNBR(LINCON(LMOST))
+      LTR=LNBR(LINCON(RMOST))
+      WRITE(IDEV,1018)LNBR(LINCON(LMOST)),LNBR(LINCON(RMOST))
+1018  FORMAT(' Completing the join between contigs',I6,' and',I6)
+      CALL AJOIN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +JOINT(LMOST),LTL,LTR,LINCON(LMOST),LINCON(RMOST),IDEV1)
+      LLINO(1)=LTL
+      IF(ILEFTS(LMOST).GT.ILEFTS(RMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+      END IF
+      IF(ILEFTS(RMOST).GE.ILEFTS(LMOST))THEN
+        CALL DELCON(SEQ1,ILEFTS(RMOST),ILC(RMOST),IDIM1)
+        CALL DELCON(SEQ1,ILEFTS(LMOST),ILC(LMOST),IDIM1)
+      END IF
+      LREG=1
+      RREG=JOINT(LMOST)
+      IGELC=LLINO(1)
+      ANS=1
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+C      CALL FMTDB(SEQ1,IDIM1,1,IDIM1,60,IDEV)
+      JNJOIN = JNJOIN + 1
+      IF(KFAIL.NE.0) GO TO 800
+      GO TO 1
+C
+C
+C             JOINS THAT FAIL INITIAL ALIGNMENT CRITERIA
+C
+C
+600   CONTINUE
+C   COME HERE FOR JOINS THAT FAIL: WE MIGHT BE ABLE TO ENTER THE GEL 
+C   INTO A SINGLE CONTIG
+      IGOOD=0
+      IF(IFAIL(1).EQ.0)IGOOD=1
+      IF(IFAIL(2).EQ.0)IGOOD=2
+      IF(IGOOD.EQ.0)GO TO 800
+      IF(ITOTPG(IGOOD).GT.0) CALL CCTA(SEQG2(1,IGOOD),IDIM22(IGOOD))
+      WRITE(IDEV,1012)LLINO(IGOOD)
+      CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQG2(1,IGOOD),NAMARC,JOINT(IGOOD),ITYPE(IGOOD),
+     +ISENSE(IGOOD),SEQC2(1,IGOOD),ITOTPC(IGOOD),
+     +IDIM22(IGOOD),IDOUT(IGOOD),LLINO(IGOOD),LINCON(IGOOD),
+     +IFAIL(IGOOD),IDBSIZ,IDEV,IDEV1,IDEV2,IDEV3,
+     +MAXGEL)
+      IF(IFAIL(IGOOD).NE.0)GO TO 800
+      JNGEL = JNGEL + 1
+      CALL DELCON(SEQ1,ILEFTS(IGOOD),ILC(IGOOD),IDIM1)
+      ANS=1
+      IGELC=LNBR(LINCON(IGOOD))
+      LREG=1
+      RREG=RELPG(LINCON(IGOOD))
+      CALL ACONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,IGELC,LREG,RREG,TEMP3,
+     +ECHRSZ,MAXGL2,IDEV,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD)
+      IF(IFAIL(1).NE.0)GO TO 900
+      WRITE(IDEV,1020)LLINO,LLINO(IGOOD)
+1020  FORMAT(' Could not join contigs',I4,' and',I4,' but the gel',
+     +' has been entered into contig',I4,/,
+     +' If required do the join manually.')
+C
+C
+C                      ALL FAILURES
+C
+C
+C
+800   CONTINUE
+      WRITE(IDEV,10888)
+10888 FORMAT(' Failed gel name written to error file')
+      WRITE(IDEV8,1002)NAMARC
+      GO TO 1
+900   CONTINUE
+      WRITE(KBOUT,*)'Batch finished'
+      WRITE(KBOUT,*)JGEL,' sequences processed'
+      WRITE(KBOUT,*)JNGEL,' sequences entered into database'
+      WRITE(KBOUT,*)JNJOIN,' joins made'
+      RETURN
+      END
+      SUBROUTINE DBAUTP(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,MAXDB,IDEV,
+     +IDEV1,IDEV2,IDEV3,MAXGEL,IMATC,IEMPTY,IOPT)
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER SEQ2(MAXGEL),SEQC2(MAXGEL)
+      CHARACTER NAMARC*(*)
+C  deals with entering all readings into contig 1 (IOPT=2)
+C  or all readings into new contigs (IOPT=3)
+      IF(IOPT.EQ.2) THEN
+        IF(IMATC.EQ.0) THEN
+          ITYPE=0
+          ISENSE=1
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+          IEMPTY=0
+          IMATC = 1
+        ELSE
+          ITYPE= - 1
+          ISENSE=1
+          JOINT = 1
+          LLINO = 1
+          LINCON = IDBSIZ - 1
+          ITOTPC = 0
+          IDOUT=MAXGEL
+          CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,MAXGEL)
+          IF(IFAIL.NE.0) RETURN
+        END IF
+      ELSE IF(IOPT.EQ.3) THEN
+        ITYPE=0
+        ISENSE=1
+        IDOUT=MAXGEL
+        CALL AENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    SEQ2,NAMARC,JOINT,ITYPE,ISENSE,SEQC2,ITOTPC,
+     +    IDIM2,IDOUT,LLINO,LINCON,IFAIL,IDBSIZ,IDEV,
+     +    IDEV1,IDEV2,IDEV3,MAXGEL)
+        IF(IFAIL.NE.0) RETURN
+      END IF
+      END
+      SUBROUTINE DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +NAMPRO,IDEV,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,MAXDB,IDM)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMPRO*(*),NAMARC*10,GEL(MAXGEL)
+      CHARACTER V2,V1
+      INTEGER IWORD,DELDB
+      PARAMETER (IWORD=4)
+      PARAMETER (MAXPRM = 21)
+      CHARACTER PERR(2)*(MAXPRM)
+      EXTERNAL DELDB
+      IERR=1
+1     CONTINUE
+      L = 1
+      V1='1'
+      CALL GTSTR('Make version',V1,V2,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(V2,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) V2 = V1
+      MN = NGELS + NCONTS + 1
+      MX = MAXDB
+      NEWSIZ = IDBSIZ
+      CALL GETINT(MN,MX,NEWSIZ,
+     +'New database size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL BUSY(KBOUT)
+      NEWSIZ = IVAL
+C     WHERE SHOULD CHARS BE CHANGED ?
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='RL'//V2
+      CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0) THEN
+C problem opening file
+        IF(IOK.EQ.2) THEN
+          CALL ERROM(KBOUT,'File already exists')
+          PERR(1) = 'Retype version number'
+          PERR(2) = 'Replace database'
+          IDO = 1
+          CALL RADION('Select action',PERR,2,IDO,
+     +    IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+          IF(IDO.LT.1) RETURN
+          IF(IDO.EQ.1) GO TO 1
+          IF(IDO.EQ.2) THEN
+            IOK =  DELDB(NAMPRO,V2,IDEV,MAXGEL)
+            IF(IOK.EQ.0) THEN
+              LLL = INDEX(NAMPRO,'.') + 1
+              NAMPRO(LLL:)='RL'//V2
+              CALL OPENRS(IDEV,NAMPRO,IOK,4,3)
+              IF(IOK.EQ.0) GO TO 2
+            ELSE
+              CALL ERROM(KBOUT,'File delete failed')
+            END IF
+          END IF
+        END IF
+        GO TO 100
+      END IF
+ 2    CONTINUE
+      CALL WRITER(IDEV,NEWSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      DO 10 I=1,NGELS
+        CALL WRITER(IDEV,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      M=NEWSIZ-NCONTS
+      N=IDBSIZ-NCONTS
+      DO 15 I=N,IDBSIZ-1
+        CALL WRITER(IDEV,M,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        M=M+1
+15    CONTINUE
+      CLOSE(UNIT=IDEV)
+C  DO SEQUENCES
+      NAMPRO(LLL:)='SQ'//V2
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+       CALL OPENRS(IDEV,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+C
+C  MAKE SURE FILE IS NEWSIZ LINES LONG!
+      CALL WRITEW(IDEV,NEWSIZ,GEL,MAXGEL)
+      DO 20 I=1,NGELS
+        CALL READW(IDEVW,I,GEL,MAXGEL)
+        CALL WRITEW(IDEV,I,GEL,MAXGEL)
+20    CONTINUE
+      CLOSE(UNIT=IDEV)
+C   DO ARCHIVE NAMES
+      NAMPRO(LLL:)='AR'//V2
+       CALL OPENRS(IDEV,NAMPRO,IOK,3,3)
+      IF(IOK.NE.0)GO TO 100
+      DO 30 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        CALL WRITEN(IDEV,I,NAMARC)
+30    CONTINUE
+C   MAKE SURE FILE IS 1000 LINES LONG
+C  ********  PROBLEMS HERE WRITING INTEGER TO NAMES FILE
+      WRITE(IDEV,REC=MAXDB)NEWSIZ,MAXGEL,IDM
+      CLOSE(UNIT=IDEV)
+      IERR=0
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error opening new database, copy aborted')
+      RETURN
+      END
+      INTEGER FUNCTION DELDB(NAMPRO,VERSN,IDEV,MAXGEL)
+      CHARACTER NAMPRO*(*),VERSN
+      INTEGER DELF
+      EXTERNAL DELF
+C
+C delete an xdap database
+C
+C assume relationships are 4 words, names are 3 and seqs are maxgel
+C all recls in BYTES
+C
+      DELDB = 1
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='RL'//VERSN
+      IF(DELF(NAMPRO,IDEV,16,4).NE.0) RETURN
+      NAMPRO(LLL:)='AR'//VERSN
+      IF(DELF(NAMPRO,IDEV,12,4).NE.0) RETURN
+      NAMPRO(LLL:)='SQ'//VERSN
+      IF(DELF(NAMPRO,IDEV,MAXGEL,4).NE.0) RETURN
+      DELDB = 0
+      END
+      SUBROUTINE DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,MAXGEL)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),X
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER NAME*10,NEWNAM*10
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(8)*(MAXPRM)
+      INTEGER L,M,N
+      CHARACTER GEL2(MAXGEL)
+      INTEGER GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' Warning:',
+     +' make a copy first, and check logical consistency after use')
+10    CONTINUE
+C
+C   SELECT OPTION
+C      CALL BELL(1,KBOUT)
+C      CALL DBMENU(4,NOPT,IHELPS,IHELPE,FILEH,IDEVH,
+C     +KBIN,KBOUT)
+      PROMPT(1) = 'Line change'
+      PROMPT(2) = 'Edit gel reading'
+      PROMPT(3) = 'Delete contig line'
+      PROMPT(4) = 'Shift'
+      PROMPT(5) = 'Move gel reading'
+      PROMPT(6) = 'Rename gel reading'
+      PROMPT(7) = 'Break a contig'
+      PROMPT(8) = 'Remove a gel reading'
+      NOPT = 1
+      CALL RADION('Alter relationships', PROMPT, 8, NOPT, IHELPS,
+     +  IHELPE, FILEH, IDEVH, KBIN, KBOUT)
+      IF(NOPT.LT.1)RETURN
+      IF(NOPT.EQ.-1)RETURN
+      IF(NOPT.EQ.1)THEN
+C       LINE CHANGE
+        MN = 0
+        MX = IDBSIZ
+        LNO = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of line to change',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.EQ.0) GO TO 10
+        LNO = IVAL
+        IF(LNO.EQ.IDBSIZ)GO TO 19
+        WRITE(KBOUT,*)'Current line'
+        WRITE(KBOUT,1001)RELPG(LNO),LNGTHG(LNO),LNBR(LNO),RNBR(LNO)
+1001    FORMAT(' ',4I6)
+        MN = 0
+        MX = 99999
+        X  = RELPG(LNO)
+        CALL GETINT(MN,MX,X,
+     +  'Relative position',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        X = IVAL
+        MN = -MAXGEL
+        MX = 99999
+        L  = LNGTHG(LNO)
+        CALL GETINT(MN,MX,L,
+     +  'Length',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        L = IVAL
+        MN = 0
+        MX = IDBSIZ
+        M  = LNBR(LNO)
+        CALL GETINT(MN,MX,M,
+     +  'Left neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        M = IVAL
+        MN = 0
+        MX = IDBSIZ
+        N  = RNBR(LNO)
+        CALL GETINT(MN,MX,N,
+     +  'Right neighbour',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        N = IVAL
+        CALL WRITER(IDEVR,LNO,X,L,M,N)
+        RELPG(LNO)=X
+        LNGTHG(LNO)=L
+        LNBR(LNO)=M
+        RNBR(LNO)=N
+        GO TO 10
+19      CONTINUE
+C       NCONTS NGELS LINES
+        MN = 0
+        MX = IDBSIZ
+        LL = NGELS
+        CALL GETINT(MN,MX,LL,
+     +  'Number of gel readings',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        LL = IVAL
+        MN = 0
+        MX = IDBSIZ
+        MM  = NCONTS
+        CALL GETINT(MN,MX,MM,
+     +  'Number of contigs',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        MM = IVAL
+        CALL WRITER(IDEVR,IDBSIZ,LL,MM,LL,MM)
+        NGELS=LL
+        NCONTS=MM
+        GO TO 10
+      END IF
+C
+C
+      IF(NOPT.EQ.4)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of first gel reading to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNO)
+        IF(I.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'Problem with this gel reading. Check logical consistency'
+          WRITE(KBOUT,*)'of database. Shift not done'
+          GO TO 10
+        END IF
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +    'No contig line for this contig. Check logical'
+          WRITE(KBOUT,*)'consistency of database. Shift not done'
+          GO TO 10
+        END IF
+        MN = 1 - RELPG(LNO)
+        MX  = RELPG(NCONTO) - RELPG(LNO)
+        X = MN
+        CALL GETINT(MN,MX,X,
+     +  'Distance to shift',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        X = IVAL
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,LNO,NCONTO,X)
+        WRITE(KBOUT,*)'Shift complete'
+        GO TO 10
+      END IF
+C
+      IF(NOPT.EQ.2)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of gel reading to edit',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        IDIM=ABS(LNGTHG(LNO))
+C       ALLOW EDITING TO BE RELATIVE TO CONTIG POSITION
+        ISTART=RELPG(LNO)
+        CALL READW(IDEVW,LNO,GEL,MAXGEL)
+        CALL EDITGL(GEL,GEL2,IDIM,KBIN,KBOUT,MAXGEL,
+     +  IHELP1,IHELP2,FILEH,IDEVH,ISTART)
+        CALL WRITEW(IDEVW,LNO,GEL,MAXGEL)
+        LNGTHG(LNO)=SIGN(IDIM,LNGTHG(LNO))
+        CALL WRITER(IDEVR,LNO,RELPG(LNO),LNGTHG(LNO),
+     +  LNBR(LNO),RNBR(LNO))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.3)THEN
+        MN=IDBSIZ-NCONTS
+        MX = IDBSIZ - 1
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of contig line to delete',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        LNO = IVAL
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  LNO,IDEVR)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.6)THEN
+        MN = 0
+        MX = NGELS
+        LNO  = 0
+        CALL GETINT(MN,MX,LNO,
+     +  'Number of gel reading to rename',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        LNO = IVAL
+        CALL READN(IDEVN,LNO,NAME)
+        L = 10
+        CALL GTSTR('name for gel reading',
+     +  NAME,NEWNAM,L,KBOUT,KBIN,INFLAG)
+        IF(L.GT.0)CALL WRITEN(IDEVN,LNO,NEWNAM)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.5)THEN
+        MN = 0
+        MX = NGELS
+        IFROM  = 0
+        CALL GETINT(MN,MX,IFROM,
+     +  'Number of gel reading to move',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        IFROM = IVAL
+        MN = 0
+        MX = NGELS
+        ITO  = 0
+        CALL GETINT(MN,MX,ITO,
+     +  'New number for gel reading',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        IF(IVAL.LT.1) GO TO 10
+        ITO = IVAL
+        CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  GEL,IFROM,ITO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+        CALL BREAKC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,IHELP1,IHELP2,FILEH,IDEVH,IOK)
+        GO TO 10        
+      END IF
+      IF(NOPT.EQ.8) THEN
+        CALL REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +  IHELPS,IHELPE,FILEH,IDEVH,IOK)
+        GO TO 10
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE REMGD(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +KBIN,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER HELPF*(*),GEL(MAXGEL)
+      INTEGER REMME,GCLIN,CHAINL
+      EXTERNAL GCLIN,CHAINL
+C assumes db is logical consistent
+      WRITE(KBOUT,*)'Remove reading from database'
+      REMME = NGELS
+      CALL GETINT(1,NGELS,REMME,
+     +'Number of reading to remove',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      REMME = IVAL
+      I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,REMME)
+      ICONT = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,I)
+      IF(ICONT.EQ.0) THEN
+        WRITE(KBOUT,*)'No contig line for this reading'
+        IOK = 1
+        RETURN
+      END IF
+      CALL REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+      END
+      SUBROUTINE REMGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,ICONT,KBOUT,GEL,MAXGEL,IDEVR,IDEVW,IDEVN)
+C Routine to remove a reading from a database
+C Cases: 1 left end
+C        2 right end
+C        3 internal and dispensible
+C        4 internal and indispensible
+C if 1 change contig lnbr, contig length, lnbr of rnbr of remme, relpgs
+C if 2 change contig rnbr, contig length, rnbr of lnbr of remme
+C if 3 change contig length, lnbr of rnbr of remme rnbr of lnbr of remme
+C if 4 need to break contig, then as for 1
+C if 1 and 2 then also remove contig line
+C for all cases move gel ngels to remme (if remme/=ngels)
+C and update line idbsiz
+C
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME,CLEN
+      LOGICAL LEFTE,RIGHTE,DISPEN
+      CHARACTER GEL(MAXGEL)
+      EXTERNAL CLEN
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+      DISPEN = .FALSE.
+C
+C Left end ?
+C
+      IF(LNBR(REMME).EQ.0) LEFTE = .TRUE.
+C
+C Right end ?
+C
+      IF(RNBR(REMME).EQ.0) RIGHTE = .TRUE.
+C
+C If both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading and contig'
+        IFROM = NGELS
+        NGELS = NGELS - 1
+        CALL REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  ICONT,IDEVR)
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+      ELSE IF(LEFTE) THEN
+        WRITE(KBOUT,*)'Removing reading from left end of contig'
+        LNBR(ICONT) = RNBR(REMME)
+        I = 1 - RELPG(RNBR(REMME))
+        WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+        CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +  IDBSIZ,RNBR(REMME),ICONT,I)
+        I = LNBR(ICONT)
+        LNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE IF(RIGHTE) THEN
+        WRITE(KBOUT,*)'Removing reading from right end of contig'
+        RNBR(ICONT) = LNBR(REMME)
+        I = RNBR(ICONT)
+        RNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+        RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,LNBR(ICONT))
+        CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +  LNBR(ICONT),RNBR(ICONT))
+        IFROM = NGELS
+        IF(REMME.NE.IFROM) THEN
+          WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+          CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+        END IF
+        NGELS = NGELS - 1
+        CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      ELSE
+C Is remme indispensible ?
+        NSTART = RELPG(RNBR(REMME))
+        I = REMME
+10      CONTINUE
+        I = LNBR(I)
+        IF(I.NE.0) THEN
+          IF((RELPG(I)+ABS(LNGTHG(I))-1).LT.NSTART) GO TO 10
+          DISPEN = .TRUE.
+        END IF
+        IF(DISPEN) THEN
+          WRITE(KBOUT,*)
+     +    'Removing dispensible reading from middle of contig'
+          I = LNBR(REMME)
+          RNBR(I) = RNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          I = RNBR(REMME)
+          LNBR(I) = LNBR(REMME)
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          RELPG(ICONT) = CLEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,LNBR(ICONT))
+          CALL WRITER(IDEVR,ICONT,RELPG(ICONT),LNGTHG(ICONT),
+     +    LNBR(ICONT),RNBR(ICONT))
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        ELSE
+          WRITE(KBOUT,*)
+     +    'Removing indispensible reading from middle of contig'
+          WRITE(KBOUT,*)'So breaking contig first'
+          IR = REMME
+          IL = LNBR(REMME)
+          ILO = LNBR(ICONT)
+          NCONTO = ICONT
+          NCONTR = IDBSIZ - NCONTS - 1
+          CALL CBREAK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +    KBOUT,IDEVR,IDEVW,IDEVN,IR,IL,ILO,NCONTO,NCONTR,IOK)
+          IF(IOK.NE.0) RETURN
+          WRITE(KBOUT,*)'Removing reading from left end of contig'
+          ICONT = IDBSIZ - NCONTS
+          LNBR(ICONT) = RNBR(REMME)
+          I = 1 - RELPG(RNBR(REMME))
+          WRITE(KBOUT,*)'Shifting readings in contig by distance=',I
+          CALL SHIFTC(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDEVR,
+     +    IDBSIZ,RNBR(REMME),ICONT,I)
+          I = LNBR(ICONT)
+          LNBR(I) = 0
+          CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +    LNBR(I),RNBR(I))
+          IFROM = NGELS
+          IF(REMME.NE.IFROM) THEN
+            WRITE(KBOUT,*)'Renumbering reading',IFROM,' to',REMME
+            CALL MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +      NCONTS,IDBSIZ,GEL,IFROM,REMME,IDEVR,IDEVW,IDEVN,
+     +      MAXGEL,KBOUT)
+          END IF
+          NGELS = NGELS - 1
+          CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        END IF
+      END IF
+      END
+      SUBROUTINE REMCNL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +REMME,IDEVR)
+      INTEGER RELPG(IDBSIZ),LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      INTEGER REMME
+C Routine to remove a contig line from a db
+C Loop deals with case of remove top contig
+C Move down all lines from above
+      DO 10 I = REMME,IDBSIZ-NCONTS+1,-1
+        RELPG(I) = RELPG(I-1)
+        LNGTHG(I) = LNGTHG(I-1)
+        LNBR(I) = LNBR(I-1)
+        RNBR(I) = RNBR(I-1)
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+10    CONTINUE
+      NCONTS = NCONTS - 1
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      END
+      SUBROUTINE MOVGEL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +GEL,FROM,TO,IDEVR,IDEVW,IDEVN,MAXGEL,KBOUT)
+C   Subroutine to move a gel from line from to line to
+C   Extended 22-5-91
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),FROM,TO
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMGEL*10,GEL(MAXGEL)
+      INTEGER GCLIN,CHAINL
+      LOGICAL LEFTE,RIGHTE
+      EXTERNAL GCLIN,CHAINL
+      LEFTE = .FALSE.
+      RIGHTE = .FALSE.
+C
+C left end ?
+C
+      IF(LNBR(FROM).EQ.0) LEFTE = .TRUE.
+C
+C right end ?
+C
+      IF(RNBR(FROM).EQ.0) RIGHTE = .TRUE.
+C
+C if both true remove the contig line, then overwrite the gel
+C     
+      IF(LEFTE.AND.RIGHTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          RNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(LEFTE) THEN
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no left neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+        ELSE
+          LNBR(NCONTO) = TO
+          CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +      LNBR(NCONTO),RNBR(NCONTO))
+        END IF
+      ELSE IF(RIGHTE) THEN
+        I = CHAINL(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,FROM)
+        NCONTO = GCLIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    IDBSIZ,I)
+        IF(NCONTO.EQ.0)THEN
+          WRITE(KBOUT,*)
+     +      'This gel has no right neighbour and does not'
+          WRITE(KBOUT,*)'appear in a contig!'
+        ELSE
+          IF(RNBR(NCONTO).NE.FROM)THEN
+            WRITE(KBOUT,*)
+     +        'This gel has no right neighbour but does not'
+            WRITE(KBOUT,*)'appear in a contig line!'
+          ELSE
+            RNBR(NCONTO) = TO
+            CALL WRITER(IDEVR,NCONTO,RELPG(NCONTO),LNGTHG(NCONTO),
+     +        LNBR(NCONTO),RNBR(NCONTO))
+          END IF
+        END IF
+      END IF
+      RELPG(TO)=RELPG(FROM)
+      LNGTHG(TO)=LNGTHG(FROM)
+      LNBR(TO)=LNBR(FROM)
+      RNBR(TO)=RNBR(FROM)
+      CALL READW(IDEVW,FROM,GEL,MAXGEL)
+      CALL WRITEW(IDEVW,TO,GEL,MAXGEL)
+      CALL READN(IDEVN,FROM,NAMGEL)
+      CALL WRITEN(IDEVN,TO,NAMGEL)
+      CALL WRITER(IDEVR,TO,RELPG(TO),LNGTHG(TO),
+     +LNBR(TO),RNBR(TO))
+C   Do neighbours
+      IF(LNBR(FROM).NE.0) THEN
+        I=LNBR(FROM)
+        RNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      IF(RNBR(FROM).NE.0) THEN
+        I=RNBR(FROM)
+        LNBR(I)=TO
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),
+     +  LNBR(I),RNBR(I))
+      END IF
+      END
+      SUBROUTINE DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +IDBSIS,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,MAXGEL,MAXGLM,
+     +LLINO,
+     +IDM,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER GEL(MAXGLM)
+      INTEGER RELPG(IDBSIS)
+      INTEGER LNGTHG(IDBSIS),LNBR(IDBSIS),RNBR(IDBSIS)
+      CHARACTER NAMPRO*(*),COPYNO*4,HELPF*(*)
+      INTEGER IWORD,ANS
+      PARAMETER (IWORD=4)
+C  NOTE THIS IS THE MACHINES WORD LENGTH IE HOW MANY CHARS PER WORD
+      CALL FILLI(RELPG,IDBSIS,0)
+      CALL FILLI(LNGTHG,IDBSIS,0)
+      CALL FILLI(LNBR,IDBSIS,0)
+      CALL FILLI(RNBR,IDBSIS,0)
+      NAMPRO(1:)=' '
+      IERR=1
+1     CONTINUE
+      L = 0
+      CALL GTSTR('Project name',' ',NAMPRO,L,KBOUT,KBIN,INFLAG)
+      IF(L.LT.1) RETURN
+      LL = L
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      L = 1
+      CALL GTSTR('Version','0',COPYNO,L,KBOUT,KBIN,INFLAG)
+      CALL CCASE(COPYNO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(L.LT.1) COPYNO(1:1) = '0'
+C   GET .ARN FIRST AS IT CONTAINS LENGTH OF OTHER FILES
+      NAMPRO(LL+1:LL+1)='.'
+      NAMPRO(LL+2:LL+2)='A'
+      NAMPRO(LL+3:LL+3)='R'
+      NAMPRO(LL+4:LL+4)=COPYNO(1:1)
+       CALL OPENRS(IDEVN,NAMPRO,IOK,3,4)
+      IF(IOK.NE.0)GO TO 100
+C  ******* PROBLEM HERE READING RECORD 1000 AND INTEGER FROM IDEVN!!!
+      READ(IDEVN,REC=IDBSIS,ERR=50)IDBSIZ,MAXGEL,IDM
+      IF(MAXGEL.LT.512)GO TO 50
+      IF(MAXGEL.GT.MAXGLM)GO TO 50
+      IF(MOD(MAXGEL,512).NE.0)GO TO 50
+20    CONTINUE
+      IF((IDM.NE.5).AND.(IDM.NE.26))GO TO 51
+21    CONTINUE
+      NAMPRO(LL+2:LL+2)='R'
+      NAMPRO(LL+3:LL+3)='L'
+       CALL OPENRS(IDEVR,NAMPRO,IOK,4,4)
+      IF(IOK.NE.0)GO TO 100
+      NAMPRO(LL+2:LL+2)='S'
+      NAMPRO(LL+3:LL+3)='Q'
+C  DEFINE RECORD LENGTH IN TERMS OF NUMBER OF CHARS PER WORD (4 ON VAX)
+C  AND MAXGEL SIZE 
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+       CALL OPENRS(IDEVW,NAMPRO,IOK,IREC,4)
+      IF(IOK.NE.0)GO TO 100
+C   READ A LINE FOR LUCK
+      CALL READW(IDEVW,1,GEL,MAXGEL)
+      CALL READR(IDEVR,IDBSIZ,NGELS,NCONTS,IDUM1,IDUM2)
+      WRITE(KBOUT,10011)NGELS,NCONTS,IDBSIZ,MAXGEL
+10011 FORMAT(' Number of gel readings=',I3,' Number of contigs=',I3,/,
+     +' Database size=',I4,' Maximum gel reading length=',I4)
+      LLINO = 0
+      IF(NGELS.LT.1)GO TO 5
+      DO 3 I=1,NGELS
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+3     CONTINUE
+      N=IDBSIZ-NCONTS
+      MXT = 0
+      DO 4 I=N,IDBSIZ-1
+        CALL READR(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        IF(RELPG(I).GT.MXT) THEN
+          MXT = RELPG(I)
+          LLINO = LNBR(I)
+        END IF
+4     CONTINUE
+5     CONTINUE
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      IERR=0
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error encountered opening database files')
+      NAMPRO(LL+2:LL+2) = COPYNO(1:1)
+      RETURN
+50    CONTINUE
+      WRITE(KBOUT,2000)MAXGLM
+2000  FORMAT(' Error reading maximum gel reading length',/,
+     +' The maximum gel reading length must be a multiple',
+     +' of 512.',/,' E.G. 512, 1024, 1536, 2048, etc up to',I6)
+      MN = 512
+      MX = MAXGLM
+      MAXIN1 = MN
+      CALL GETINT(MN,MX,MAXIN1,
+     +'Maximum gel reading length',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXGEL = IVAL
+      IF(MOD(MAXGEL,MAXIN1).NE.0)GO TO 50
+C  PROBLEM WRITING TO RECORD IDBSIZ WITH NO SUBROUTINE CALL!!!!!!!!!!
+      WRITE(IDEVN,REC=IDBSIS)IDBSIZ,MAXGEL
+      GO TO 20
+51    CONTINUE
+      WRITE(KBOUT,2003)
+2003  FORMAT(' Error reading protein or dna label')
+      CALL YESNO(ANS,'DNA',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) RETURN
+      IF(ANS.EQ.0)IDM = 5
+      IF(ANS.EQ.1)IDM = 26
+C  PROBLEM WRITING TO RECORD IDBSIZ WITH NO SUBROUTINE CALL!!!!!!!!!!
+      WRITE(IDEVN,REC=IDBSIS)IDBSIZ,MAXGEL,IDM
+      GO TO 21
+      END
+      SUBROUTINE DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,
+     +IDEV1,IDEV2,IDEV3,IERR,IHELPS,IHELPE,IDEVH,FILEH,MAXGEL,
+     +MAXGLM,IDM)
+      CHARACTER GEL(MAXGLM),FILEH*(*)
+      CHARACTER NAMPRO*(*)
+      INTEGER IWORD
+      PARAMETER (IWORD=4)
+C      WRITE(KBOUT,1005)
+1005  FORMAT(
+     +'  Define:',/,
+     +' 1) the maximum database size ',
+     +'(= number of gel readings + number of contigs)',/,
+     +' 2) the maximum gel reading length',/,
+     +' 3) whether the database is for DNA or protein sequences')
+      IERR=1
+3     CONTINUE
+      MN = 0
+      CALL GTSTR('New project name',' ',NAMPRO,MN,KBOUT,KBIN,INFLAG)
+      IF(MN.LT.1) RETURN
+      LL = MIN(12,MN)
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+        GO TO 3
+      END IF
+      MN = 10
+      MX = IDBSIS
+      IDBSIZ = 50
+      CALL GETINT(MN,MX,IDBSIZ,
+     +'Database size',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IDBSIZ = IVAL
+5     CONTINUE
+      MN = 512
+      MX = MAXGLM
+      MAXIN1 = MIN(512,MAXGEL)
+      CALL GETINT(MN,MX,MAXIN1,
+     +'Maximum gel reading length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXGEL = IVAL
+      IF(MOD(IVAL,MAXIN1).NE.0) THEN
+        MAXGEL = 512 + (IVAL/512)*512
+        WRITE(KBOUT,*)'Maximum set to',MAXGEL
+      END IF
+      CALL YESNO(IDM,'Database is for DNA',
+     +IHELPS,IHELPE,FILEH,IDEVH,KBIN,KBOUT)
+      IF(IDM.LT.0) RETURN
+      IF(IDM.EQ.0)IDM = 5
+      IF(IDM.EQ.1)IDM = 26
+      NAMPRO(LL+1:LL+1)='.'
+      NAMPRO(LL+2:LL+2)='R'
+      NAMPRO(LL+3:LL+3)='L'
+      NAMPRO(LL+4:LL+4)='0'
+      CALL OPENRS(IDEV1,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Writing new database files')
+      J = 0
+      K = 0
+      N = 0
+      M = 0
+      DO 10 I=1,IDBSIZ
+        CALL WRITER(IDEV1,I,J,K,M,N)
+10    CONTINUE      
+      NAMPRO(LL+2:LL+2)='S'
+      NAMPRO(LL+3:LL+3)='Q'
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+      CALL OPENRS(IDEV2,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+      CALL FILLC(GEL,MAXGEL,' ')
+      CALL WRITEW(IDEV2,IDBSIZ,GEL,MAXGEL)
+      NAMPRO(LL+2:LL+2)='A'
+      NAMPRO(LL+3:LL+3)='R'
+      CALL OPENRS(IDEV3,NAMPRO,IOK,3,3)
+      IF(IOK.NE.0)GO TO 100
+C  NOTE HERE IS A WRITE TO NAMES FILE NOT USING A SUBROUTINE CALL
+      WRITE(IDEV3,REC=IDBSIS)IDBSIZ,MAXGEL,IDM
+      WRITE(KBOUT,1003)NAMPRO(1:LL),IDBSIZ
+1003  FORMAT(' Database ',A,' version 0, size',I5,
+     +' successfully started')
+      WRITE(KBOUT,1006)NAMPRO(1:LL)
+1006  FORMAT(
+     +' Note the 3 database files  are named ',A,'.AR0, .SQ0, .RL0')
+      IERR=0
+      NAMPRO(LL+2:LL+2) = '0'
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error when trying to open database,',
+     +' database not started')
+      RETURN
+      END
+      SUBROUTINE ENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,IDBSIZ,IDEV,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,IDEVG,
+     +LINLEN,PERCD,NAMARC,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM,NCONTC)
+      CHARACTER FILEH*(*)
+      INTEGER HELPS1,HELPS2,HELPE1,HELPE2
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER X,Y,ANS
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),NAMARC*(*)
+      CHARACTER NAMARK*10
+      CHARACTER GEL2(MAXGEL)
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(
+C     +' Enter a new gel reading into the database.',//,
+C     +' Note that before a gel reading is entered into the',/,
+C     +' database it must be compared  with the database to ',/,
+C     +' search for overlaps using a function such as auto assemble',/,
+C     +' The replies to all the questions that follow are given',/,
+C     +' by these searches',//)
+      IF((IDBSIZ-(NGELS+NCONTS)).GT.2)GO TO 5
+      WRITE(KBOUT,1999)IDBSIZ
+1999  FORMAT(' Current database size=',I5,' extend with copy')
+      RETURN
+5     CONTINUE
+C      WRITE(KBOUT,1018)
+C1018  FORMAT(
+C     +' We need the name of the file containing the gel reading')
+      NAMARC = ' '
+      CALL OPENF1(IDEVG,NAMARC,0,IOK,KBIN,KBOUT,
+     +'File name of gel reading to enter',
+     +HELPS1,HELPE1,FILEH,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IF(NGELS.GT.0) THEN
+        DO 10 J=1,NGELS
+          CALL READN(IDEVN,J,NAMARK)
+          IF(NAMARC(1:10).NE.NAMARK)GO TO 10
+          WRITE(KBOUT,1013)J
+1013      FORMAT( ' Gel reading already in database (number,',I6,
+     +    '), entry aborted')
+          RETURN
+10      CONTINUE
+      END IF
+      IDIM=MAXGEL
+      CALL ARRFIM(IDEVG,GEL,IDIM,KBOUT)
+      CLOSE(UNIT=IDEVG)
+      NGELS=NGELS+1
+      LNGTHG(NGELS)=IDIM
+      NAMARK=NAMARC(1:10)
+      CALL WRITEN(IDEVN,NGELS,NAMARK)
+      WRITE(KBOUT,1003)NGELS
+1003  FORMAT( ' This gel reading has been given the number ',I6)
+      ANS=0
+      IF(NGELS.GT.1)THEN
+C        WRITE(KBOUT,1017)
+C1017    FORMAT(
+C     +' We need to know if the gel reading overlaps data already in',
+C     +' the database')
+        CALL YESNO(ANS,'New gel overlaps a contig',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+        IF(ANS.LT.0) THEN
+          NGELS = NGELS - 1
+          RETURN
+        END IF
+        IF(ANS.EQ.0)GO TO 100
+      END IF
+C
+C   DOES NOT OVERLAP SO IT STARTS A CONTIG OF ITS OWN
+      LNBR(NGELS)=0
+      RNBR(NGELS)=0
+      RELPG(NGELS)=1
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      NCONTS=NCONTS+1
+      N=IDBSIZ-NCONTS
+      LNBR(N)=NGELS
+      RNBR(N)=NGELS
+      RELPG(N)=(IDIM)
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      RETURN
+C
+100   CONTINUE
+C      WRITE(KBOUT,1019)
+C1019  FORMAT(
+C     +' We need to know if the new gel reading overlaps in the',/,
+C     +' normal or complementary sense')
+C   COMPLEMENT GEL IF REQUIRED
+      CALL YESNO(ANS,'Gel overlaps in normal sense',
+     +HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      IF(ANS.EQ.1) THEN
+        CALL SQREV(GEL,IDIM)
+        CALL SQCOM(GEL,IDIM)
+        LNGTHG(NGELS)=-1*LNGTHG(NGELS)
+      END IF
+C      WRITE(KBOUT,1020)
+C1020  FORMAT(
+C     +' We need to know which contig the new gel reading overlaps')
+      CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,NCONTC,
+     +IERR,IDBSIZ,KBIN,KBOUT,IDEVN,
+     +HELPS1,HELPE1,FILEH,IDEVH)
+      IF(IERR.NE.0) THEN
+        NGELS=NGELS-1
+        RETURN
+      END IF
+C      WRITE(KBOUT,1014)
+C1014  FORMAT(
+C     +' We need information about the position of the overlap.',/,
+C     +' First there are two types: those that extend the contig',/,
+C     +' leftwards and those that start internally.')
+        CALL YESNO(ANS,'Left end of gel reading is inside contig',
+     +  HELPS1,HELPE1,FILEH,IDEVH,KBIN,KBOUT)
+      IF(ANS.LT.0)THEN
+        NGELS=NGELS-1
+        RETURN
+      END IF
+      IF(ANS.EQ.1)GO TO 400
+C   RIGHT END OR INTERNAL OVERLAP
+C
+C      WRITE(KBOUT,1015)
+C1015  FORMAT(' Now we need to know the exact position of the overlap')
+160   CONTINUE
+      MN = 1
+      MX = RELPG(LINCON)
+      X = 0
+      CALL GETINT(MN,MX,X,
+     +'Position in contig of left end of gel reading',
+     +IVAL,KBIN,KBOUT,
+     +HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      X = IVAL
+      IF(X.EQ.0)GO TO 160
+      N=NCONTC
+200   CONTINUE
+      IF(RELPG(N).GT.X)GO TO 250
+      IF(RNBR(N).EQ.0)GO TO 350
+      N=RNBR(N)
+      GO TO 200
+250   CONTINUE
+C   GEL LIES BETWEEN N AND LNBR(N)
+C   DISPLAY JOINT
+      NOPT = 1
+      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+C   CHECK ON RESULT
+      IF(NOPT.EQ.1)GO TO 160
+      IF(NOPT.EQ.3)GO TO 260
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+260   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      LNBR(NGELS)=LNBR(N)
+      RNBR(NGELS)=N
+      RELPG(NGELS)=X
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      RNBR(LNBR(N))=NGELS
+      K=LNBR(N)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+      LNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+(IDIM)-1
+      IF(Y.LE.RELPG(LINCON))RETURN
+      RELPG(LINCON)=Y
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      RETURN
+350   CONTINUE
+C   MUST BE A RIGHT END OVERLAP
+C   DSPLAY JOINT
+      NOPT = 1
+      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      IF(NOPT.EQ.1)GO TO 160
+      IF(NOPT.EQ.3)GO TO 360
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+360   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      LNBR(NGELS)=N
+      RNBR(NGELS)=0
+      RELPG(NGELS)=X
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      RNBR(N)=NGELS
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      RNBR(LINCON)=NGELS
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=X+(IDIM)-1
+      RELPG(LINCON)=MAX(RELPG(LINCON),Y)
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      RETURN
+C
+400   CONTINUE
+C
+C   ADDING TO LEFT END
+C      WRITE(KBOUT,1015)
+410   CONTINUE
+      MN = 1
+      MX = IDIM
+      X = 0
+      CALL GETINT(MN,MX,X,
+     +'Position in new gel reading of left end of contig',
+     +IVAL,KBIN,KBOUT,
+     +HELPS1,HELPE1,FILEH,IDEVH,IOK)
+      IF(IOK.NE.0) THEN
+        NGELS = NGELS - 1
+        RETURN
+      END IF
+      X = IVAL
+      IF(X.EQ.0)GO TO 410
+C   SET FLAG TO SHOW LEFT END OVERLAP
+      NOPT=2
+      CALL BEDIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,GEL2,LINCON,NCONTC,NOPT,X,IDBSIZ,IDEV,KBIN,KBOUT,
+     +IDEVR,IDEVW,IDEVN,LINLEN,PERCD,
+     +HELPS1,HELPE1,HELPS2,HELPE2,FILEH,IDEVH,MAXGEL,IDM)
+      IF(NOPT.EQ.1)GO TO 400
+      IF(NOPT.EQ.3)GO TO 420
+C   COCKUP
+      NGELS=NGELS-1
+      RETURN
+420   CONTINUE
+      CALL WRITEW(IDEVW,NGELS,GEL,MAXGEL)
+      RELPG(NGELS)=1
+      RNBR(NGELS)=NCONTC
+      LNBR(NGELS)=0
+      CALL WRITER(IDEVR,NGELS,RELPG(NGELS),LNGTHG(NGELS),
+     +LNBR(NGELS),RNBR(NGELS))
+      LNBR(NCONTC)=NGELS
+      RELPG(NCONTC)=X
+      CALL WRITER(IDEVR,NCONTC,RELPG(NCONTC),LNGTHG(NCONTC),
+     +LNBR(NCONTC),RNBR(NCONTC))
+      RELPG(LINCON)=RELPG(LINCON)+X-1
+      IDIM=ABS(LNGTHG(NGELS))
+      Y=IDIM
+      IF(Y.GT.RELPG(LINCON))RELPG(LINCON)=Y
+      LNBR(LINCON)=NGELS
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+      N=NCONTC
+440   CONTINUE
+      IF(RNBR(N).EQ.0)RETURN
+      N=RNBR(N)
+      RELPG(N)=RELPG(N)+X-1
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+      GO TO 440
+      END
+      SUBROUTINE PADCON(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +GEL,LINCON,POSN,NC,IDBSIZ,IDEVR,IDEVW,MAXGEL,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),POSN,X
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL)
+      CHARACTER PAD
+      SAVE PAD
+      DATA PAD/'*'/
+C   NOW FIND FIRST CHAR THAT OVERLAPS REGION
+      LLINO=LNBR(LINCON)
+30    CONTINUE
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.GE.POSN)GO TO 40
+C   NOT IN REGION
+      LLINO=RNBR(LLINO)
+      GO TO 30
+40    CONTINUE
+C   NOW GET THIS GEL FROM DISK
+C      READ(IDEVW,REC=LLINO)GEL
+      CALL READW(IDEVW,LLINO,GEL,MAXGEL)
+C   CALC POSN IN THIS GEL TO EDIT
+      X=POSN-RELPG(LLINO)+1
+      K=X
+C   MOVE THE DATA RIGHT
+      M=ABS(LNGTHG(LLINO))
+      LNGTHG(LLINO)=LNGTHG(LLINO)+SIGN(NC,LNGTHG(LLINO))
+C   CHECK FOR OVER END OF ARRAY
+      N=ABS(LNGTHG(LLINO))
+      IF(N.GT.MAXGEL)THEN
+        WRITE(KBOUT,1000)LLINO
+1000    FORMAT(
+     +' Data pushed off end of gel',I4,' during padding')
+        M=M-(N-MAXGEL)
+        N=MAXGEL
+        LNGTHG(LLINO)=SIGN(MAXGEL,LNGTHG(LLINO))
+      END IF
+      J=M-K+1
+      DO 55 I=1,J
+        GEL(N)=GEL(M)
+        N=N-1
+        M=M-1
+55    CONTINUE
+C   PERFORM THE INSERTION
+      DO 60 I=K,MIN(MAXGEL,K+NC-1)
+        GEL(I)=PAD
+60    CONTINUE
+C   WRITE BACK TO DISK
+      CALL WRITEW(IDEVW,LLINO,GEL,MAXGEL)
+C      WRITE(IDEVW,REC=LLINO)GEL
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+C      WRITE(IDEVR,REC=LLINO)
+C     1RELPG(LLINO),LNGTHG(LLINO),LNBR(LLINO),RNBR(LLINO)
+65    CONTINUE
+C   NOW GET NEXT GEL
+      LLINO=RNBR(LLINO)
+C   LAST GEL?
+      IF(LLINO.EQ.0)GO TO 70
+C   DOES IT HAVE DATA IN REGION?
+C   IE DO RELPG  AND RELPG+LNGTHG-1 LIE EITHER SIDE OF POSN?
+      IF(RELPG(LLINO).GT.POSN)GO TO 70
+      X=RELPG(LLINO)+ABS(LNGTHG(LLINO))-1
+      IF(X.LT.POSN)GO TO 65
+C  WITHIN
+      GO TO 40
+70    CONTINUE
+C   INSERTS FINISHED SO NEED TO INCREMENT ALL THOSE GELS TO RIGHT
+      LLINO=LNBR(LINCON)
+75    CONTINUE
+      IF(RELPG(LLINO).GT.POSN)GO TO 80
+76    CONTINUE
+      LLINO=RNBR(LLINO)
+      IF(LLINO.EQ.0)GO TO 90
+      GO TO 75
+80    CONTINUE
+      RELPG(LLINO)=RELPG(LLINO)+NC
+C   WRITE NEW LINE
+      CALL WRITER(IDEVR,LLINO,RELPG(LLINO),LNGTHG(LLINO),
+     +LNBR(LLINO),RNBR(LLINO))
+C      WRITE(IDEVR,REC=LLINO)
+C     1RELPG(LLINO),LNGTHG(LLINO),LNBR(LLINO),RNBR(LLINO)
+      GO TO 76
+90    CONTINUE
+C   NEED TO INCREMENT CONTIG LINE
+      RELPG(LINCON)=RELPG(LINCON)+NC
+      CALL WRITER(IDEVR,LINCON,RELPG(LINCON),LNGTHG(LINCON),
+     +LNBR(LINCON),RNBR(LINCON))
+C      WRITE(IDEVR,REC=LINCON)RELPG(LINCON),LNGTHG(LINCON),LNBR(LINCON),
+C     1RNBR(LINCON)
+      RETURN
+      END
+      SUBROUTINE AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP3,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,SEQ2,SEQ3,SEQ4,SEQ5,SEQC2,SEQG2,MATCH,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,IDEV1,IDEV2,IDEV3,IDEV4,IDEV7,IDEV8,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH,NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+      INTEGER CHRSIZ,ECHRSZ
+      INTEGER RELPG(MAXDB)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      INTEGER JOINT(2),ITOTPC(2),ITOTPG(2),IDIM22(2),IDOUT(2)
+      INTEGER LLINO(2),ITYPE(2),IFAIL(2)
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2)
+      INTEGER ANS,WINDOW
+      INTEGER TEMP3(ECHRSZ,MAXGL2),CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQ3(MAXGLM),SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXGLM),MATCH(MAXGLM),SEQ4(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      CHARACTER NAMARC*(*),NAMPRO*(*),FILE*(*)
+      CHARACTER SEQ5(MAXGLM),HELPF*(*),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP3,IERR,KBOUT)
+      IF(IERR.GT.1) RETURN
+      IFAIL(1) = 0
+      IF(NGELS.LT.1) RETURN
+      MN = LENGTH*2
+      MX = MAXGLM + 1
+      MINMAT = MAX(15,MN)
+      CALL GETINT(MN,MX,MINMAT,
+     +'Minimum initial match',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      MN = 2
+      MX = 5
+      MINSLI = MAX(3,MN)
+      CALL GETINT(MN,MX,MINSLI,
+     +'Minimum alignment block',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINSLI = IVAL
+      MN = 0
+      MX = 25
+      MAXPG = 8
+      CALL GETINT(MN,MX,MAXPG,
+     +'Maximum pads per sequence',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXPG = IVAL
+      MAXPC = IVAL
+        RMN = 0.
+        RMX = 20.
+        PERMAX = 8.
+        CALL GETRL(RMN,RMX,PERMAX,
+     +  'Maximum percent mismatch after alignment',
+     +  VAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PERMAX = VAL
+      MN = MINMAT
+      MX = MAXGEL
+      WINDOW = 100
+      CALL GETINT(MN,MX,WINDOW,
+     +'Probe length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      WINDOW = IVAL
+       IWING = 0
+      IDIM1=0
+      MAXOVR=MAXGEL-3*MAX(MAXPC,MAXPG)
+      ANS=0
+      CALL JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,MAXSEQ,SEQ2,IDBSIZ,IDIM1,ANS,KDUMM,KDUMM,KDUMM,TEMP3,
+     +ECHRSZ,MAXGL2,KBOUT,IDEV2,IFAIL(1),MAXGEL,IDM,PERCD,SEQ5,
+     +CENDS,NENDS,MAXCON,IWING,NBAD)
+      IDIM2=MAXGEL
+C
+C Note I am doing something horrible here to save space:
+C sending cends and nends to jcons to get the extension lengths
+C then swapping them with temp3 by sending temp to ajoin and
+C receiving them as cends and nends, and sending cends nends
+C and receiving them as iladd,iradd
+      CALL AUTOJN(SEQ1,IDIM1,SEQ2,IDIM2,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,IDEV,MATCH,MAXGEL,MAXGLM,SEQ5,GELN,
+     +SAV1,SAV2,SAV3,MAXSAV,TEMP3,TEMP3(MAXCON+1,1),MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,CENDS,NENDS,RELPG,LNBR,IDBSIZ,NCONTS)
+      END
+      SUBROUTINE AUTOJN(SEQ1,IDIMIN,GEL,IDIMGI,ILEFTS,ILC,IPOSC,
+     +IPOSG,ISENSE,LLINO,IMATC,IFCOMP,MINMAT,POSNS,WORDP,WORDN,
+     +CONST,LENGTH,LPOWRC,KBOUT,MATCH,MAXGEL,MAXGLM,GELCOP,GELN,
+     +SAVPS,SAVPG,SAVL,MAXSAV,CENDS,NENDS,MAXCON,
+     +SEQG2,SEQC2,SEQ4,IDOUT,IDIM22,ITOTPG,ITOTPC,JOINT,IFAIL,
+     +ITYPE,MAXPC,MAXPG,PERMAX,MINSLI,SEQG3,SEQC3,KFAIL,
+     +WINDOW,ILADD,IRADD,RELPG,LNBR,IDBSIZ,NCONTS)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(2),ILC(2),IPOSC(2),IPOSG(2),ISENSE(2),LLINO(2)
+      INTEGER POSNS(IDIMIN),GELN(MAXGLM),WORDP(LPOWRC),SAVPS(MAXSAV)
+      INTEGER SAVPG(MAXSAV),SAVL(MAXSAV)
+      INTEGER WORDN(LPOWRC)
+      CHARACTER GELCOP(MAXGLM),MATCH(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON),ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER CONST(LENGTH)
+      CHARACTER SEQ1(IDIMIN),GEL(MAXGLM)
+      CHARACTER SEQG2(MAXGLM,2),SEQC2(MAXGLM,2),SEQ4(MAXGLM)
+      INTEGER IDOUT(2),IDIM22(2),ITOTPG(2),ITOTPC(2),JOINT(2)
+      INTEGER IFAIL(2),ITYPE(2)
+      PARAMETER (MAXC = 10)
+      CHARACTER SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER JLEFTS(MAXC),JLC(MAXC),JPOSC(MAXC),JPOSG(MAXC),MCON(MAXC)
+      INTEGER JSENSE(MAXC),JLLINO(MAXC),WINDOW
+      INTEGER RELPG(IDBSIZ),LNBR(IDBSIZ)
+      IFAIL(1) = 1
+      IFAIL(2) = 1
+      KFAIL = 0
+      IDCEND=MAXCON
+      CALL FNDCON(SEQ1,IDIMIN,CENDS,NENDS,IDCEND,MAXCON,KBOUT)
+C find possible missed joins
+C we have consensus in seq1 in order first contig,second contig etc
+C compare the ends in reverse order, simultaneously shortening the consensus
+      IDIM = IDIMIN
+      DO 200 JCON = IDCEND,2,-1
+        IDIM = CENDS(JCON) - 1
+        CALL ENCO(SEQ1,IDIM,POSNS,CONST,LENGTH)
+        CALL ENCONA(POSNS,IDIM,WORDP,WORDN,LPOWRC,LENGTH)
+        JS = CENDS(JCON) + 20
+        JE = CENDS(JCON+1) - 1
+        IEND = 1
+        IDIMG = MIN(WINDOW,JE-JS+1)
+C check for case where contig is shorter than probe (window)
+C in which case only compare the left hand end
+         IF(JE-JS+1.LE.WINDOW) IEND = 2
+C        WRITE(*,*)JCON,JS,JE,IDIMG
+1       CONTINUE
+        IMATC = 0
+        IF(IEND.EQ.1) THEN
+          CALL SQCOPY(SEQ1(JE-IDIMG+1),GEL,IDIMG)
+        ELSE
+          CALL SQCOPY(SEQ1(JS),GEL,IDIMG)
+        END IF
+        CALL SQCOPY(GEL,GELCOP,IDIMG)
+      ISTRAN=1
+2     CONTINUE
+      CALL ENCO(GEL,IDIMG,GELN,CONST,LENGTH)
+      IDSAV=MAXSAV
+      CALL CFGEL(GELN,IDIMG,POSNS,IDIM,WORDP,WORDN,LENGTH,LPOWRC,
+     +SAVPG,SAVPS,SAVL,
+     +IDSAV,SEQ1,GEL,MINMAT,IFCOMP,KBOUT)
+      IF(IFCOMP.NE.0) THEN
+         CALL ERROM(KBOUT,'Error in CFGEL')
+         RETURN
+      END IF
+      IF(IDSAV.NE.0)THEN
+        CALL ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,CENDS,NENDS,
+     +  IDCEND,MAXCON,JLEFTS,JLC,JPOSC,JPOSG,JSENSE,JLLINO,
+     +  IMATC,ISTRAN,MAXC,KBOUT,MCON)
+      END IF
+      ISTRAN=ISTRAN+1
+      IF(ISTRAN.EQ.2) THEN
+        CALL SQCOPY(GELCOP,GEL,IDIMG)
+        CALL SQREV(GEL,IDIMG)
+        CALL SQCOM(GEL,IDIMG)
+        GO TO 2
+      END IF
+      CALL SQCOPY(GELCOP,GEL,IDIMG)
+      KSENSE = 0
+C      WRITE(KBOUT,*)'Total matches found',IMATC
+      IF(IMATC.NE.0) THEN
+      JMATC = 0
+      DO 100 I = 1,IMATC
+        IF(JSENSE(I).EQ.-1) THEN
+          IF(KSENSE.EQ.0) THEN 
+            CALL SQREV(GEL,IDIMG)
+            CALL SQCOM(GEL,IDIMG)
+            KSENSE = 1
+          END IF
+        END IF
+        JDIM22 = IDIMG
+        JDOUT = MAXGEL
+        IDSAV = MAXSAV
+C        IF(IEND.EQ.1) WRITE(KBOUT,*)'Left end of Contig',NENDS(JCON)
+C        IF(IEND.EQ.2) WRITE(KBOUT,*)'Right end of Contig',NENDS(JCON)
+C        WRITE(KBOUT,*)'Trying to align with contig',JLLINO(I)
+        CALL ALINEJ(SEQ1(JLEFTS(I)),GEL,SEQG3,SEQC3,
+     +  SAVPS,SAVPG,SAVL,IDSAV,JLC(I),JDIM22,JDOUT,
+     +  JPOSC(I),JPOSG(I),MINSLI,JJOINT,JTOTPC,JTOTPG,
+     +  JFAIL,JTYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ4,MAXGEL,PERMS,
+     +  NENDS(JCON),IEND,JLLINO(I),JSENSE(I),ILADD,IRADD,MAXCON,MCON(I),
+     +  JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+100   CONTINUE
+      END IF
+      IEND = IEND + 1
+      IF(IEND.EQ.2) GO TO 1
+C      IDIM = CENDS(JCON)-1
+ 200  CONTINUE
+      END
+      SUBROUTINE ALINEJ(SEQ1,SEQ2,SEQG2,SEQC2,ISAV1,ISAV2,ISAV3,
+     +IDSAV,IDC,IDIM2,IDOUT,IC1,IG1,MINSLI,JOINT,
+     +ITOTPC,ITOTPG,IFAIL,ITYPE,MAXPC,MAXPG,PERMAX,KBOUT,SEQ3,MAXGEL,
+     +PERCM,JCONN,IEND,NCON,JSENSE,ILADD,IRADD,MAXCON,MCON,JCON,
+     +RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDC),SEQ2(IDIM2),SEQG2(IDOUT),SEQC2(IDOUT)
+      CHARACTER SEQ3(MAXGEL)
+      INTEGER ISAV1(IDSAV),ISAV2(IDSAV),ISAV3(IDSAV),WINDOW
+      INTEGER ILADD(MAXCON),IRADD(MAXCON),RELPG(IDBSIZ),LNBR(IDBSIZ)
+      MINSLT=MINSLI
+C   SAVE SEQ2
+      CALL SQCOPY(SEQ2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      IFAIL=1
+C   FIND MATCHES
+      IPP=IDSAV
+      CALL SLIDER(SEQ1,IDC,SEQ3,IDIM2,IC1,IG1,MAXPG,MAXPC,MINSLT,
+     +ISAV1,ISAV2,ISAV3,IPP)
+      IF(IPP.GT.IDSAV)RETURN
+      IF(IPP.LT.1)RETURN
+      CALL REMOVL(ISAV2,ISAV3,ISAV1,IPP)
+      CALL BUB3AS(ISAV2,ISAV3,ISAV1,IPP)
+C   DO TOPOLOGICAL CHECK
+      CALL TPCHEK(ISAV2,ISAV3,ISAV1,IPP)
+      IFAIL=0
+      CALL LINEUP(SEQ2,SEQ1,SEQG2,SEQC2,IDC,IDIM2,IDOUT,ISAV3,ISAV2,
+     +ISAV1,IPP,ITOTPC,ITOTPG,JOINT,ITYPE,KBOUT,MAXGEL,IFAIL)
+      IF(IFAIL.NE.0)RETURN
+C   IDIM2 IS NOW LENGTH OF ALIGNED GEL
+C      WRITE(KBOUT,1052)ITOTPC,ITOTPG
+C1052  FORMAT(' Padding in contig= ',I4,' and in gel= ',I4)
+      CALL JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,JOINT,
+     +ITYPE,PERCM,KBOUT,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   ARE ALL CHECKS OK?
+      IF(ITOTPC.GT.MAXPC)IFAIL=1
+      IF(ITOTPG.GT.MAXPG)IFAIL=1
+      IF(PERCM.GT.PERMAX)IFAIL=1
+      END
+C     SUBROUTINE DALIGN
+C
+C   COUNTS MISMATCHES AND DISPLAYS OVERLAP.
+      SUBROUTINE JALIGN(SEQC2,SEQG2,SEQ3,MAXGEL,IDOUT,IDIM2,
+     +JOINT,ITYPE,X,KBOUT,IFAIL,PERMAX,JCONN,IEND,NCON,JSENSE,
+     +ILADD,IRADD,MAXCON,MCON,JCON,RELPG,LNBR,IDBSIZ,NCONTS,WINDOW)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQC2(MAXGEL),SEQG2(MAXGEL),SEQ3(MAXGEL)
+      CHARACTER PAD,DASH,STRAND,NAME1*6,NAME2*6
+      INTEGER ILADD(MAXCON),IRADD(MAXCON),RELPG(IDBSIZ),LNBR(IDBSIZ)
+      INTEGER CLINNO,WINDOW
+      EXTERNAL CLINNO
+      SAVE PAD,DASH
+      DATA PAD,DASH/',','-'/
+      IENDG=1
+      IENDC=JOINT
+C   ONLY LOOK AT OVERLAP WHICH IS FROM JOINT FOR LEFT TYPE JOIN
+      IF(ITYPE.EQ.1)THEN
+        IENDG=JOINT
+        IENDC=1
+      END IF
+100   CONTINUE
+C   LENGTH OF OVERLAP?
+      LG=IDIM2-IENDG+1
+      LO=MIN(IDOUT,LG)
+C   SAVE RAW DATA
+      CALL SQCOPY(SEQG2,SEQ3,IDIM2)
+      CALL MSTLKL(SEQ3,IDIM2)
+      X=FLOAT(LO)
+      Y=X
+      K=IENDG+LO-1
+C   POINT TO CONSENSUS
+      J=0
+C   CHECK FOR OVERFLOW
+      IF(K.GT.MAXGEL)THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(
+     +' Matching region too long for routine dalign. Alignment aborted')
+        IFAIL=1
+        RETURN
+      END IF
+      DO 200 I=IENDG,K
+        J=J+1
+        IF(SEQC2(J).EQ.SEQ3(I))GO TO 200
+C        IF(SEQ3(I).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.DASH)GO TO 200
+C        IF(SEQC2(J).EQ.PAD)GO TO 200
+        X=X-1.
+200   CONTINUE
+      X=(Y-X)*100./Y
+      IF(X.LT.PERMAX) THEN
+        IC = CLINNO(LNBR,IDBSIZ,NCONTS,JCONN)
+        IF(IC.EQ.0) THEN
+          IFAIL = 99
+          RETURN
+        END IF
+        IF(JSENSE.EQ.-1) THEN
+          STRAND = '-'
+          IF(IEND.EQ.1) THEN
+C
+C  probe is complement of right hand end of contig. Give posns assuming
+C  this contig is complemented.
+C
+            IENDGT = IENDG - IRADD(JCON)
+C
+C Next line gives posns relative to original orientation
+C            IENDGT = RELPG(IC) + IRADD(JCON) - IENDG + 1
+          ELSE
+C
+C  probe is complement of left end of contig. Give posns assuming this
+C  contig is going to be complemented.
+C
+            IENDGT = RELPG(IC) - WINDOW + ILADD(JCON) + IENDG
+          END IF
+        ELSE
+          STRAND = '+'
+          IF(IEND.EQ.2) THEN
+C
+C  probe is left hand end of contig in original sense
+C
+            IENDGT = IENDG - ILADD(JCON)
+          ELSE
+C
+C  probe is right hand end of contig in original sense
+C
+            IENDGT = RELPG(IC) + IRADD(JCON) - WINDOW + IENDG
+          END IF
+        END IF
+        IENDCT = IENDC - ILADD(MCON)
+          WRITE(KBOUT,*)
+     +    ' Possible join between contig ',JCONN,' in the ',
+     +    STRAND,' sense and contig ',NCON
+        WRITE(KBOUT,1000)X
+1000    FORMAT(' Percentage mismatch after alignment = ',F4.1)
+        WRITE(NAME1,1002)JCONN
+        WRITE(NAME2,1002)NCON
+ 1002   FORMAT(I6)
+        CALL FMT4LP(SEQC2(1),SEQG2(IENDG),LO,IENDCT,IENDGT,KBOUT,
+     +  NAME2,NAME1)
+      END IF
+      IFAIL=0
+      END
+      SUBROUTINE ADISM5(IDIM,IDIMG,SAVPS,SAVPG,IDSAV,
+     +CENDS,NENDS,IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,
+     +LLINO,IMATC,ISTRAN,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      INTEGER SAVPS(IDSAV),SAVPG(IDSAV)
+      NEXTC=IDIM+1
+      CALL BUB2AS(SAVPS,SAVPG,IDSAV)
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(1),SAVPG(1),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+      LEND=IDIMG-SAVPG(1)+SAVPS(1)
+      DO 10 I=2,IDSAV
+        IF((SAVPS(I).LT.LEND).AND.(SAVPS(I).LT.NEXTC))GO TO 10
+        IMATC=IMATC+1
+        CALL ADISM6(SAVPS(I),SAVPG(I),CENDS,NENDS,IDCEND,MAXCON,
+     +  ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,NEXTC,MAXC,
+     +  KBOUT,MCON)
+        LEND=IDIMG-SAVPG(I)+SAVPS(I)
+10    CONTINUE
+      IMATC = MIN(IMATC,MAXC)
+      END
+      SUBROUTINE ADISM6(ISAVPS,SAVPG,CENDS,NENDS,
+     +IDCEND,MAXCON,ILEFTS,ILC,IPOSC,IPOSG,ISENSE,LLINO,IMATC,ISTRAN,
+     +NEXTC,MAXC,KBOUT,MCON)
+C   AUTHOR: RODGER STADEN
+      INTEGER ILEFTS(MAXC),ILC(MAXC),IPOSC(MAXC),IPOSG(MAXC)
+      INTEGER ISENSE(MAXC),LLINO(MAXC),MCON(MAXC)
+      INTEGER SAVPS,SAVPG,CENDS(MAXCON)
+      INTEGER NENDS(MAXCON)
+      SAVPS=ISAVPS-19
+      JJ=1
+      DO 5 J=2,IDCEND
+        IF(SAVPS.GT.CENDS(J))GO TO 5
+        JJ=J-1
+        GO TO 6
+5     CONTINUE
+      JJ=IDCEND
+6     CONTINUE
+C new bit need to save contig number for alinej
+      SAVPS=SAVPS-1
+      LCL=SAVPS-CENDS(JJ)
+      LCR=CENDS(JJ+1)-ISAVPS-1
+      NEXTC=CENDS(JJ+1)+20
+      IF(IMATC.LE.MAXC) THEN
+        MCON(IMATC) = JJ
+        ILEFTS(IMATC)=CENDS(JJ)+20
+        ILC(IMATC)=LCL+LCR+1
+        IPOSC(IMATC)=LCL+1
+        IPOSG(IMATC)=SAVPG
+        LLINO(IMATC)=NENDS(JJ)
+        ISENSE(IMATC)=1
+        IF(ISTRAN.EQ.2)ISENSE(IMATC)=-1
+      ELSE
+        CALL ERROM(KBOUT,'Warning: too many overlaps')
+      END IF
+      END
+C      JCONS
+      SUBROUTINE JCONS(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,IDBSIZ,ISTART,ANS,LLINO,LREG,RREG,TEMP,
+     +CHRSIZ,MAXGL2,KBOUT,
+     +IDEVW,IFAIL,MAXGEL,IDM,PERCD,TGEL,ILADD,IRADD,MAXCON,
+     +IWIN,NBAD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),ANS,CHRSIZ,ILADD(MAXCON),IRADD(MAXCON)
+      INTEGER LREG,RREG,X,Y,TEMP(CHRSIZ,MAXGL2)
+      CHARACTER SEQ1(IDIM1)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER GEL(MAXGEL),TGEL(MAXGEL)
+      CHARACTER NAMPRO*(*)
+      CALL BUSY(KBOUT)
+      IF(IWIN.GT.0) PERC = REAL(NBAD)/REAL(IWIN)
+      IEND = 1
+      IFAIL=0
+      N=IDBSIZ-NCONTS
+      NCONS = 0
+      DO 110 I=N,IDBSIZ-1
+        J=LNBR(I)
+        X=1
+        Y=RELPG(I)
+        ISTART=ISTART+1
+        IF((ISTART+19+Y+2*MAXGEL).GT.IDIM1)THEN
+          WRITE(KBOUT,1009)IDIM1
+1009      FORMAT(
+     +    ' Maximum consensus length(',I6,') exceeded',/,
+     +    ' calculation aborted')
+          IFAIL=1
+          RETURN
+        END IF
+        CALL ADDTIT(SEQ1(ISTART),NAMPRO,J,ISTART)
+        NCONS = NCONS + 1
+        IDIN = 0
+        ILADD(NCONS) = IDIN
+        ISTART = ISTART + IDIN
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  SEQ1(ISTART),Y,GEL,X,Y,J,IDBSIZ,TEMP,CHRSIZ,MAXGL2,IDEVW,MAXGEL,
+     +  IDM,PERCD)
+        ISTART=ISTART+Y-1
+        IDIN = 0
+        IRADD(NCONS) = IDIN
+        ISTART = ISTART + IDIN
+110   CONTINUE
+      END
diff --git a/src/staden/dialogues.c b/src/staden/dialogues.c
new file mode 100644
index 0000000..7ce5bed
--- /dev/null
+++ b/src/staden/dialogues.c
@@ -0,0 +1,985 @@
+/*
+    Title:       dialogues
+
+    File:        dialogues.c
+    Purpose:	 Routines to provide dialogue interaction
+    Last update: Wed Feb 04 1991
+
+    Change log:
+        23/01/91 SD  CRAction now operative for yesnoWid
+        04/02/91 SD  getint_x getrl_x getrls_x fixed bug in main while loop
+
+*/
+
+
+/*
+    This module provides a series of routines to enable dialogue
+    interaction with a user. They replace some routines removed from
+    subs89.
+
+    This module is initialised by a call to CreateDialogueShellWidget.
+    This creates the top half of the dialogue shell and a number of
+    ``template'' widgets which perform particular visual dialogue
+    patterns. At run time, when the Fortran calls come in, the
+    appropriate template is filled in and appended to the bottom half
+    of the dialogue shell.
+
+    After each client-called dialogue has been completed, we
+    call dispatchCurrentEvents() to make sure everything X-wise
+    is up to date before we return to the FORTRAN.
+
+    The dialogue policy, coded in `doDialogue' is to leave the current
+    dialogue up until a new one throws it out. The `busy_x' function
+    can be called just like any other to insert the busy dialogue.
+
+
+----
+
+    Error messages shouldn't go to stderr, but to the text output
+    Ensure each switch branch sets the result code.
+    The coding of openf1 is fetid.
+    The guessing of a width for the box in radio_x is horrid. An
+    application resource should be set up and retrieved.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "fort.h"	/* IMPORT: int_f definition */
+#include "dialogues.h"
+#include "userface.h"
+#include "help.h"       /* IMPORT: help */
+#include "main.h"       /* IMPORT: dispatchEventsUntil, finalx */
+#include "FtoC.h"       /* IMPORT: Cstr2Fstr, Fstr2Cstr */
+#include "progSpec.h"   /* IMPORT: CreateProgMenus */
+#include "textOutput.h" /* IMPORT: UpdateTextOutput */
+#include "mystdlib.h" /* IMPORT: strtod, strtol */
+#include "locks.h"
+#include "helpnmenu.h"
+
+#include <sys/types.h>
+#include <stdio.h>    /* IMPORT: fopen, fgets, fseek, fprintf, sprintf */
+#include <stdlib.h>
+/*#include <string.h>*/   /* IMPORT: strlen */
+#include "values.h"   /* IMPORT: MAXLONG */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Shell.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Dialog.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Toggle.h>
+
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef enum
+{ DialogueOK,
+  DialogueCancel,
+  DialogueYES,
+  DialogueNO,
+  DialogueDefault
+} DialogueResult;
+
+
+/* ---- Global variables ---- */
+
+Dimension dialogue_width;
+Dimension dialogue_height;
+
+/* ---- Static variables ---- */
+
+/* The overall dialogue form */
+static Widget dialogueFormWid;
+
+/* "Current function: ..." label stuff */
+static long currFunc = 0;
+static Widget currFuncWid;
+
+/* "Current file: ..." label stuff */
+static Widget currFileWid;
+
+
+
+
+/* ---- Some internal routines ---- */
+
+
+/* Dialogue management */
+
+static Boolean        dialogueDone = False;
+static DialogueResult dialogueResult;
+static Boolean DialogueDone()
+{   return(dialogueDone);
+}
+
+static Widget currentDialogueWid = NULL;
+
+static DialogueResult doDialogue(Widget dialogueWid)
+{   Widget prevDialogueWid = currentDialogueWid;
+
+    /* Swap out the previous widget, if any, put in the new one */
+    currentDialogueWid = dialogueWid;
+    XawFormDoLayout(dialogueFormWid, False);
+    if (prevDialogueWid != (Widget)NULL) XtUnmanageChild(prevDialogueWid);
+    XtManageChild(currentDialogueWid);
+    XawFormDoLayout(dialogueFormWid, True);
+
+    /* Do the dialogue */
+    dialogueDone = False;
+    dispatchEventsUntil(DialogueDone);
+
+    return(dialogueResult);
+}
+
+
+
+
+/* ---- Callback routines ---- */
+
+
+static void OKCallback(Widget w,
+		       XtPointer client_data, XtPointer call_data)
+{   dialogueResult = DialogueOK;
+    dialogueDone = True;
+}
+
+
+static void CancelCallback(Widget w,
+			   XtPointer client_data, XtPointer call_data)
+{   dialogueResult = DialogueCancel;
+    dialogueDone = True;
+}
+
+
+static void QuitAppCallback(Widget w,
+			    XtPointer client_data, XtPointer call_data)
+{
+    if (semaphoreFree(activeLock))
+	finalx(0);
+}
+
+
+static void HelpCallback(Widget w,
+			 XtPointer client_data, XtPointer call_data)
+{   help(currFunc);
+}
+
+static void YESCallback(Widget w,
+			XtPointer client_data, XtPointer call_data)
+{   dialogueResult = DialogueYES;
+    dialogueDone = True;
+}
+
+
+static void NOCallback(Widget w,
+		       XtPointer client_data, XtPointer call_data)
+{   dialogueResult = DialogueNO;
+    dialogueDone = True;
+}
+
+
+/* Null dialogue */
+
+static Widget nullDialogueWid;
+
+static void CreateNullDialogue(Widget parentWid, Widget fromVertWid)
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    nullDialogueWid = XtCreateManagedWidget("null", labelWidgetClass,
+					    parentWid,
+					    args, nargs);
+}
+
+
+/* NChoice dialogue */
+
+static Widget nChoiceWid;
+static Widget nChoiceButtonsWid;
+
+static void CreateNChoiceDialogue(Widget parentWid, Widget fromVertWid)
+{   Widget okw, cw;
+    Arg args[10];
+    int nargs;
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    nChoiceWid = XtCreateWidget("nChoice", formWidgetClass, parentWid,
+				args, nargs);
+
+    nargs = 0;
+    nChoiceButtonsWid = XtCreateManagedWidget("buttons", boxWidgetClass,
+					      nChoiceWid, args, nargs);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, nChoiceButtonsWid); nargs++;
+    okw = XtCreateManagedWidget("OK", commandWidgetClass,
+				nChoiceWid, args, nargs);
+    XtAddCallback(okw, XtNcallback, OKCallback,  NULL);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  nChoiceButtonsWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okw);               nargs++;
+    cw = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+			       nChoiceWid, args, nargs);
+    XtAddCallback(cw, XtNcallback, CancelCallback,  NULL);
+}
+
+
+/* getopt dialogue */
+
+static Widget  getoptWid;
+static long    getoptDialogueRes;
+static Boolean getoptExec; /* True=>execute. False=>execute with dialogue */
+
+static void MenuItemCallback(Widget w,
+			     XtPointer client_data, XtPointer call_data)
+/*
+    the number of the function selected comes as `call_data'.
+*/
+{   dialogueDone = True;
+    getoptDialogueRes = (long) call_data;
+}
+
+static void getoptExecCallback(Widget w,
+			       XtPointer client_data, XtPointer call_data)
+{   Arg args[1];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel,
+	     getoptExec ? "Execute with dialogue" : "Execute"); nargs++;
+    XtSetValues(w, args, nargs);
+    getoptExec = !getoptExec;   
+}
+
+static void CreateGetoptDialogue(Widget parentWid, Widget fromVertWid)
+{   Widget lw, ew, mbw;
+    Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNresizable, True); nargs++;
+    getoptWid = XtCreateWidget("getopt", formWidgetClass, parentWid,
+			       args, nargs);
+ 
+    nargs = 0;
+    lw = XtCreateManagedWidget("execlabel", labelWidgetClass, getoptWid,
+			       args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, lw); nargs++;
+    ew = XtCreateManagedWidget("execcommand", commandWidgetClass,
+			       getoptWid, args, nargs);
+    getoptExec = False;
+    XtAddCallback(ew, XtNcallback, getoptExecCallback, NULL);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, lw); nargs++;
+    mbw =  XtCreateManagedWidget("menubar", boxWidgetClass, getoptWid,
+				 args, nargs);
+    CreateProgMenus(mbw, MenuItemCallback, NULL);
+}
+
+
+/* getstring dialogue */
+
+static Widget getstringWid;
+
+static void CreateGetstringDialogue(Widget parentWid, Widget fromVertWid)
+{   Widget valueWid;
+    Arg args[10];
+    int nargs;
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    getstringWid = XtCreateWidget("getstring", dialogWidgetClass, parentWid,
+			       args, nargs);
+    XawDialogAddButton(getstringWid, "OK",
+		       OKCallback,    (XtPointer) getstringWid);
+    XawDialogAddButton(getstringWid, "Cancel",
+		       CancelCallback,(XtPointer) getstringWid);
+
+    /*
+        Prevent the value widget from resizing itself.
+	We cannot do this in the Xstaden default resources because
+	it is overriden when the dialog creates the value widget.
+    */    
+    valueWid = XtNameToWidget(getstringWid, "value");
+    nargs = 0;
+    XtSetArg(args[nargs], XtNresizable, False); nargs++;
+    XtSetValues(valueWid, args, nargs);
+
+}
+
+
+/* yesno dialogue */
+
+static Widget yesnoWid;
+static Widget yesnoLabelWid;
+
+static void CreateYesnoDialogue(Widget parentWid, Widget fromVertWid)
+{   Widget yw, nw, cw;
+    Arg args[10];
+    int nargs;
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    yesnoWid = XtCreateWidget("yesno", formWidgetClass, parentWid,
+			       args, nargs);
+
+
+    nargs = 0;
+    yesnoLabelWid = XtCreateManagedWidget("lab", labelWidgetClass,
+					  yesnoWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, yesnoLabelWid); nargs++;
+    yw = XtCreateManagedWidget("YES", commandWidgetClass,
+			       yesnoWid, args, nargs);
+    XtAddCallback(yw, XtNcallback, YESCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  yesnoLabelWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, yw);            nargs++;
+    nw = XtCreateManagedWidget("NO",  commandWidgetClass, 
+			       yesnoWid, args, nargs);
+    XtAddCallback(nw, XtNcallback, NOCallback,  NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  yesnoLabelWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, nw);            nargs++;
+    cw = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+			       yesnoWid, args, nargs);
+    XtAddCallback(cw, XtNcallback, CancelCallback,  NULL);
+}
+
+
+
+
+void CRAction(Widget wid, XEvent *event,
+	      String *params, Cardinal *num_params)
+/*
+    Action procedure to be called when CR is hit in a dialogue.
+    For dialogues which contain an "OK" widget, simulate a
+    pressing and callback of it.
+*/
+{
+    if (currentDialogueWid == nChoiceWid ||
+	currentDialogueWid == getstringWid)
+    {   dialogueResult = DialogueOK;
+	dialogueDone   = True;
+    }
+    else if (currentDialogueWid == yesnoWid)
+    {   dialogueResult = DialogueYES;
+        dialogueDone   = True;
+    }
+}
+
+
+
+/* ---- Exported routines ---- */
+
+
+Widget CreateDialogueShell(Widget toplevelWid)
+/*
+    Create the dialogue shell as a child of `toplevelWid'.
+*/
+{   Widget dialogueShellWid, qw, hw;
+    Arg args[10];
+    int nargs;
+    XFontStruct *font;
+    signed int w, h;
+
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+
+
+    /* Add string to function bindings for our application actions */
+    XtAppAddActions(XtWidgetToApplicationContext(toplevelWid),
+		    actionTable, XtNumber(actionTable));
+
+
+    nargs = 0;
+    dialogueShellWid = XtCreatePopupShell("Dialogue", topLevelShellWidgetClass,
+					  toplevelWid,
+					  args, nargs);
+
+    dialogueFormWid = XtCreateManagedWidget("dialogueForm", formWidgetClass,
+					    dialogueShellWid,
+					    NULL, (Cardinal) 0);
+
+    nargs = 0;
+    currFileWid = XtCreateManagedWidget("currFile", labelWidgetClass,
+					dialogueFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, currFileWid); nargs++;
+    currFuncWid = XtCreateManagedWidget("currFunc", labelWidgetClass,
+					dialogueFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, currFuncWid); nargs++;
+    hw = XtCreateManagedWidget("Help",  commandWidgetClass,
+			       dialogueFormWid, args, nargs);
+    XtAddCallback(hw, XtNcallback, HelpCallback,  NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  currFuncWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, hw);          nargs++;
+    qw = XtCreateManagedWidget("quit",  commandWidgetClass,
+			       dialogueFormWid, args, nargs);
+    XtAddCallback(qw, XtNcallback, QuitAppCallback,  NULL);
+
+
+    /*
+        There are many dialogues which can fit in here.
+        Create them all, but do not put them in yet.
+    */
+    CreateNullDialogue(dialogueFormWid, hw);
+    CreateNChoiceDialogue(dialogueFormWid, hw);
+    CreateGetoptDialogue(dialogueFormWid, hw);
+    CreateGetstringDialogue(dialogueFormWid,  hw);
+    CreateYesnoDialogue(dialogueFormWid,  hw);
+
+
+    /* Put the null dialogue in by hand */
+    XtManageChild(nullDialogueWid);
+    currentDialogueWid = nullDialogueWid;
+
+    /*
+     * Set the size to be 80x14 text characters
+     */
+    XtVaGetValues(currentDialogueWid, XtNfont, &font, NULL);
+    w = 136 - (19 * (font->max_bounds.width));
+    w = 80 * font->max_bounds.width + (w>0?w:0);
+    h = 86 - 7 *(font->max_bounds.ascent + font->max_bounds.descent);
+    h = 15 *(font->max_bounds.ascent + font->max_bounds.descent) + h;
+
+    XtVaSetValues(dialogueShellWid,
+		  XtNwidth, w,
+		  XtNheight, h,
+		  NULL);
+
+    dialogue_width = w;
+    dialogue_height = h;
+
+    return(dialogueShellWid);
+}
+
+/* --- C interface routines used by C, or FORTRAN (from userfacecom.c) --- */
+
+/*
+ * Reads in a string from stdin.
+ * Args:
+ *   prompt: obvious
+ *   defval: default string (if user types in nothing)
+ *   out   : where to store the actual string read.
+ *   outlen: sizeof(out)
+ * Returns:
+ *   -1 : cancel
+ *    0 : ok
+ *    1 : ok, but used default (blank string if no default)
+ */
+int gtstr(char *prompt, char *defval, char *out, size_t outlen) {
+    char label[255];
+    char *dialogueString;
+    int ret;
+
+    /*
+     * Stick the label and default values in the getstring dialogue
+     */
+    strcpy(label,prompt);
+    strcat(label," ?");
+
+    if (defval)
+	XtVaSetValues(getstringWid, XtNlabel, label, XtNvalue, defval, NULL);
+    else 
+	XtVaSetValues(getstringWid, XtNlabel, label, XtNvalue, "", NULL);
+
+    /*
+        Set the insertion point after the text
+    */
+    XawTextSetInsertionPoint(XtNameToWidget(getstringWid, "value"),
+			     (XawTextPosition)MAXLONG);
+    
+    switch (doDialogue(getstringWid)) {
+    case DialogueOK:
+	dialogueString = XawDialogGetValueString(getstringWid);
+	if (strlen(dialogueString) == 0) {
+	    if (defval)
+		strncpy(out, defval, outlen);
+	    else
+		*out = '\0';
+	    ret = 1;
+	} else {
+	    strcpy(out, dialogueString);
+	    ret = 0;
+	}
+	break;
+    case DialogueCancel:
+	*out = '\0';
+	ret = -1;
+	break;
+    default:;
+    }
+    
+    dispatchCurrentEvents();
+    return ret;
+}
+
+/*
+ * Prompts the user with a yes/no question (defaults to yes).
+ * Args:
+ *   prompt: what to ask.
+ * Returns:
+ *    0 = yes
+ *    1 = no
+ *   -1 = cancel
+ */
+int yesno(char *prompt) {
+    char label[255];
+    int ret;
+
+    /*
+        Stick the prompt into the yesno widget
+    */
+    strcpy(label, prompt);
+    strcat(label, " ?");
+    XtVaSetValues(yesnoLabelWid, XtNlabel, label, NULL);
+
+    switch (doDialogue(yesnoWid))
+    {   case DialogueYES:
+            ret = 0;
+	    break;
+	case DialogueNO:
+	    ret = 1;
+	    break;
+	case DialogueCancel:
+	    ret = -1;
+	    break;
+	default:;
+    }
+
+    dispatchCurrentEvents();
+    return ret;
+}
+
+/*
+ * Reads an integer in a given range from stdin.
+ * Args:
+ *   minval: lower end of range (inclusive)
+ *   maxval: upper end of range (inclusive)
+ *   defval: default integer
+ *   prompt: question to hassle user with
+ *   status: returned status on validity of returned value
+ *      0 = ok
+ *     -2 = quit
+ *     -3 = error
+ * Returns:
+ *   minval <= int <= maxval
+ */
+int getint(int minval, int maxval, int defval, char *prompt, int *status) {
+    char label[256], defVal[32];
+    int ret;
+    DialogueResult dr;
+    char *resultString;
+
+    /*
+        Generate the prompt and default value
+    */
+    sprintf(label,  "%s (%d -- %d) ?", prompt, minval, maxval);
+    sprintf(defVal, "%d", defval);
+
+    /*
+        Stick them in the getstring widget
+    */
+    XtVaSetValues(getstringWid, XtNlabel, label, XtNvalue, defVal, NULL);
+
+    /*
+        Set the insertion point after the text
+    */
+    XawTextSetInsertionPoint(XtNameToWidget(getstringWid, "value"),
+			     (XawTextPosition)MAXLONG);
+
+    do {
+	dr = doDialogue(getstringWid);
+        switch (dr)
+	{   case DialogueOK:
+		resultString = XawDialogGetValueString(getstringWid);
+		ret = strtol(resultString,NULL,10);
+		*status = 0;
+		break;
+
+	    case DialogueCancel:
+		*status = -2;
+		break;
+	    default:;
+	 }
+
+    } while(dr == DialogueOK && (ret > maxval || ret < minval));
+
+    dispatchCurrentEvents();
+    return ret;
+}
+
+/*
+ * Reads a float in a given range from stdin.
+ * Args:
+ *   minval: lower end of range (inclusive)
+ *   maxval: upper end of range (inclusive)
+ *   defval: default float
+ *   prompt: question to hassle user with
+ *   status: returned status on validity of returned value
+ *      0 = ok
+ *     -2 = quit
+ *     -3 = error
+ *   precision: how accurate to display the range and default values.
+ * Returns:
+ *   minval <= float <= maxval
+ */
+float getfloat(float minval, float maxval, float defval, char *prompt,
+	      int *status, int precision) {
+    char label[256], defVal[32];
+    float ret;
+    DialogueResult dr;
+    char  *resultString;
+
+    /*
+        Generate the prompt and default value
+    */
+    sprintf(label,  "%s (%.*f -- %.*f) ?", prompt, precision, minval,
+	    precision, maxval);
+    sprintf(defVal, "%.*f", precision, defval);
+
+    /*
+        Stick them in the getstring widget
+    */
+    XtVaSetValues(getstringWid, XtNlabel, label, XtNvalue, defVal, NULL);
+
+    /*
+        Set the insertion point after the text
+    */
+    XawTextSetInsertionPoint(XtNameToWidget(getstringWid, "value"), 
+			     (XawTextPosition)MAXLONG);
+
+    do {
+	dr = doDialogue(getstringWid);
+        switch (dr)
+	{   case DialogueOK:
+		resultString = XawDialogGetValueString(getstringWid);
+		ret = (float)atof(resultString);
+		*status = 0;
+		break;
+
+	    case DialogueCancel:
+		*status = -2;
+		break;
+
+	    default:;
+	 }
+
+    } while(dr == DialogueOK && (ret > maxval || ret < minval));
+
+    dispatchCurrentEvents();
+    return ret;
+}
+
+/*
+ * Displays a list of 'n' toggle-able items. An 'X' is displayed next to any
+ * currently selected items. User types in a number to toggle each item, or
+ * 0 to quit.
+ * Args:
+ *   num    : how many items to toggle
+ *   prompts: list of names for each item
+ *   bools  : location of list of initial boolean states for items (set or
+ *            unset). Also when returning, the final selected boolean states.
+ * Returns:
+ *   -1 = cancel
+ *    0 = ok
+ */
+int checkn(int num, char **prompts, int **bools) {
+    int i, status;
+    Widget *toggles;
+    Boolean *states;
+
+    toggles = (Widget *)malloc(num * sizeof(Widget));
+    states = (Boolean *)malloc(num * sizeof(Boolean));
+
+    XtVaSetValues(nChoiceButtonsWid, XtNorientation, XtEhorizontal, NULL);
+    for (i=0; i<num; i++) {
+	states[i] = (*bools)[i] ? True : False;
+	/*
+	 * Insert the toggles into nChoiceButtonsWid (a box)
+	 */
+	toggles[i] = XtVaCreateManagedWidget("tog", toggleWidgetClass,
+					     nChoiceButtonsWid,
+					     XtNlabel, prompts[i],
+					     XtNstate, states[i], NULL);
+    }
+    XtVaSetValues(nChoiceButtonsWid, XtNorientation, XtEvertical, NULL);
+
+    switch (doDialogue(nChoiceWid))
+    {   case DialogueOK:
+	    for (i=0; i<num; i++) {
+		XtVaGetValues(toggles[i], XtNstate, &states[i], NULL);
+		(*bools)[i] = states[i] ? 1 : 0;
+	    }
+	    status = 0;
+	    break;
+
+	case DialogueCancel:
+	    status = -1;
+	    break;
+
+	default:;
+    }
+
+    /*
+        Destroy the toggles
+    */
+    for (i=0; i<num; i++)
+	XtDestroyWidget(toggles[i]);
+
+    free(toggles);
+    free(states);
+
+    return status;
+}
+
+/*
+ * Displays a box of radio buttons of options and asks for a selection. Default
+ * option is highlighted.
+ * Args:
+ *   title  : short description of menu
+ *   options: list of options to display
+ *   numopts: how many options to display
+ *   def    : default option to chose.
+ * Returns:
+ *   -1 = cancel
+ *   otherwise the option number selected.
+ */
+int radion(char *title, char **options, int numopts, int def) {
+    int i, ret;
+    Dimension dialogueWidth;
+    Widget *toggles;
+    Boolean *states;
+
+    if (def < 1 || def > numopts)
+    {   fprintf(stderr, "Error in call to radion\n");
+	return 1;
+    }
+
+    toggles = (Widget *)malloc(numopts * sizeof(Widget));
+    states = (Boolean *)malloc(numopts * sizeof(Boolean));
+
+    /*
+        Get the width of the surrounding dialogue.
+	We will set the width of the button box to be this, minus
+	a random amount to account for borders, spacing etc.
+    */
+    XtVaGetValues(dialogueFormWid, XtNwidth, &dialogueWidth, NULL);
+
+    /*
+        Put each toggle into nChoiceButtonsWid (a box)
+    */
+    XawFormDoLayout(nChoiceWid, False);
+    for (i=0; i<numopts; i++)
+    {
+
+	/* Before we stick anything in the box, set the box width */
+	XtVaSetValues(nChoiceButtonsWid, XtNwidth, dialogueWidth-20, NULL);
+
+	/* radioData holds the button number (1..NB) */
+	toggles[i] = XtVaCreateManagedWidget("tog", toggleWidgetClass,
+					     nChoiceButtonsWid,
+					     XtNlabel, options[i],
+					     XtNradioData, i+1,
+					     XtNradioGroup, (i==0) ? NULL
+					     : toggles[i-1], NULL);
+    }
+    XawToggleSetCurrent(toggles[0], (XtPointer)def);
+    XawFormDoLayout(nChoiceWid, True);
+
+    switch (doDialogue(nChoiceWid))
+    {   case DialogueOK:
+	    ret = ((long) XawToggleGetCurrent(toggles[0]));
+	    break;
+
+	case DialogueCancel:
+	    ret = -1;
+	    break;
+	
+	default:;
+    }
+
+    for (i=0; i<numopts; i++)
+	{   XtDestroyWidget(toggles[i]);
+    }
+
+    free(toggles);
+    free(states);
+
+    dispatchCurrentEvents();
+    return ret;
+}
+
+/*
+ * Reads in an 'option number'.
+ * Takes into account requesting dialogue on an option (d), menu listing (m),
+ * and help (?) on general or specific items.
+ * Args:
+ *   status: contains information about the 'int' value returned.
+ *     -3 = error
+ *     -2 = quit (!) (always returns 2)
+ *     -1 = general help requested.
+ *      0 = normal
+ *      1 = dialogue requested
+ *      2 = help on specific subject.
+ *      3 = menu option
+ * Returns:
+ *   negative value if a menu asked for (menu 'x' returns '-x')
+ *   postive value for option selected.
+ *   0 for no selection.
+ */
+int getcopt(int *status) {
+    int ret;
+
+    (void) doDialogue(getoptWid);
+
+    if (getoptExec)
+	*status = 0;
+    else
+	*status = 1;
+
+    currFunc = getoptDialogueRes;    
+    ret  = getoptDialogueRes;
+
+    dispatchCurrentEvents();
+    return ret;
+}
+
+void showfu(char *fname) {
+    char label[256];
+
+    (void) strcpy(label, "Current function: ");
+    (void) strcat(label, fname);
+
+    XawFormDoLayout(dialogueFormWid,False);
+    XtVaSetValues(currFuncWid, XtNlabel, label, NULL);
+    XawFormDoLayout(dialogueFormWid,True);
+
+    dispatchCurrentEvents();
+}
+
+void showfi(char *fname) {
+    char label[256];
+
+    (void) strcpy(label, "Current file: ");
+    (void) strcat(label, fname);
+
+    XawFormDoLayout(dialogueFormWid,False);
+    XtVaSetValues(currFileWid, XtNlabel, label, NULL);
+    XawFormDoLayout(dialogueFormWid,True);
+
+    dispatchCurrentEvents();
+}
+
+/*
+ * Print an error message.
+ * Args:
+ *   errmsg: error with no trailing newline.
+ */
+void errom(char *errmsg) {
+    /*
+        Make all current output visible.
+    */
+    UpdateTextOutput();
+
+    /*
+        Add this message to stdout.
+    */
+    fprintf(stdout, "*** %s\n", errmsg);
+
+    /*
+        Make it visible.
+    */
+    UpdateTextOutput();
+
+    dispatchCurrentEvents();
+}
+
+/*
+ * Inform the user that there will be a long pause before the
+ * next dialogue function is called.
+ */
+void busy() {
+    Widget prevDialogueWid = currentDialogueWid;
+
+    /* Swap out the previous widget, put in the null one */
+    currentDialogueWid = nullDialogueWid;
+    XawFormDoLayout(dialogueFormWid, False);
+    XtUnmanageChild(prevDialogueWid);
+    XtManageChild(currentDialogueWid);
+    XawFormDoLayout(dialogueFormWid, True);
+
+    dispatchCurrentEvents();
+}
+
+void menu_x(int_f *OPT_p,
+	    int_f *KOPT_p,
+	    int_f *MOPT_p,
+	    int_f *MAXOPT_p,
+	    int_f *MINMEN_p,
+	    int_f *KBIN_p,
+	    int_f *KBOUT_p,
+	    int_f *IHELPS_p,
+	    int_f *IHELPE_p,
+	    char  *HELPF_p,
+	    int_f *IDEVH_p,
+	    int_fl HELPF_l)
+/*
+    Display the menu dialogue and get the number of a function.
+    OPT       number of function selected
+    KOPT=0 => OK
+    KOPT=1 => dialogue wanted
+
+    At the moment, the menu to display is hardwired via the generic
+    `progSpec.h' and associated `.c' file for each program. This
+    function should be rewritten so that it reads the menu data in
+    at run time, though it would almost certainly be best to do
+    this at initialisation time and then simply allow this function
+    to display the pre-built dialogue with the menu in it.
+*/
+{
+    showfu("menu selection");
+    getopt_x(KBIN_p, KOPT_p, OPT_p);
+    if (helpindex[query_opt()].name)
+	showfu(helpindex[(int)*OPT_p].name);
+}
+
+int bpause() {
+    XBell (GetDisplay(),100);
+
+    /* flush and always succeed */
+    updout_();
+    return 0;
+}
diff --git a/src/staden/dialogues.h b/src/staden/dialogues.h
new file mode 100644
index 0000000..66788d9
--- /dev/null
+++ b/src/staden/dialogues.h
@@ -0,0 +1,39 @@
+/*
+    Title:       dialogues
+
+    File: 	 dialogues.h
+    Purpose:	 Routines to provide dialogue interaction
+    Last update: Wed Jun 27 1990
+*/
+
+
+/*
+    This module provides a series of routines to enable dialogue
+    interaction with a user.
+
+    (They replace some routines removed from subs89).
+*/
+
+
+/*
+    C (internal) - SUN FORTRAN (external) name conversions
+    This table is for the compiler specifically mentioned above, so
+    as well as being hostage to its horrors we can make use of its
+    advantages, such as long identifiers.
+*/
+
+/* ---- Includes ---- */
+
+#include "fort.h"		/* IMPORT: int_f definition */
+#include <X11/Intrinsic.h>	/* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget CreateDialogueShell(Widget toplevelWid);
+/*
+    Create the dialogue shell as a child of `toplevelWid'.
+*/
diff --git a/src/staden/dias89.f b/src/staden/dias89.f
new file mode 100644
index 0000000..db0e303
--- /dev/null
+++ b/src/staden/dias89.f
@@ -0,0 +1,2199 @@
+C   DIASUB SUBROUTINES FOR DIAGON
+C   AUTHOR: RODGER STADEN
+C   16-7-92 Now passing isame to qicks so that main diagonal can be ignored
+C   16-7-92 Now passing isame to cfsq so that main diagonal can be ignored
+C   7-2-90 mhist changed to using reals (rm and rmsq) for sd calc
+C   8-2-90 switched idim1,idim2 in 2 places in mhist, once in qicks
+C  14-2-90 removed call getreg from actout for compatibility with other
+C          programs
+C  12-6-90 Changed all occurrences of lh to lf
+C   9-7-90 removed menu routine
+C  13-11-90 replaced all radio by radion
+C  12-1-91 ALIGNM changed call to alignd to use nmax instead of maxseq
+C          Changed alignd to check for pout exceeding array bounds
+C  25-4-91 Fixed bug in cfsq that allowed cfseq to extend off end of arrays
+C  6-6-91  Added a check for pout over end of array to alignd
+C          added ctonum to pcon to allow for difference case letters
+C  2-3-92 added filnam = ' ' for som ecalls to openf1
+C         ROUTINES IN THIS LIBRARY:
+C         DMENU
+C         DIAPRW
+C         SHOBOX
+C         WRITAL
+C         ACTOUT
+C         DIAPER
+C         DIAPRO
+C         DIABOX
+C         DIAEXP
+C         DIAOBS
+C         DIALIN
+C         ALIGN
+C         FMT2
+C         FRAME
+C         MOVEI
+C         HELPD
+C         FILEDG
+C         QVORH
+C         CFSQ
+C         ENCONB
+C         CFSEQ
+C         NCODE
+C         SETCON
+C         EXTNDM
+C         DSTAT
+C         REDEFD
+C         PRINTD
+C         MXPTHD
+C         SCORED
+C         APBIAS
+C         ACALCD
+C         GAPRM
+C         ALIGND
+      SUBROUTINE SWTCHD(SAME,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER SAME,OPT
+      CHARACTER HELPF*(*)
+      CALL YESNO(OPT,'Show main diagonal',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(OPT.LT.0) RETURN
+      SAME = 0
+      IF(OPT.EQ.1) SAME = 1
+      END
+      SUBROUTINE SWTCHI(MARKI,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER OPT
+      CHARACTER HELPF*(*)
+      CALL YESNO(OPT,'Plot identities in matching spans',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(OPT.LT.0) RETURN
+      MARKI = 1
+      IF(OPT.EQ.1) MARKI = 0
+      END
+C  DIAPRW
+      SUBROUTINE DIAPRW(S1M,S1P,IDIM1P,S2M,S2P,IDIM2P,
+     +M,IDM,LINEB,
+     +LINEC,LINEE,LENGTH,MINS,SEQVC,IDIM1,SEQHC,IDIM2,
+     +KBOUT,IDEV1,IV1,IH1,ISTARH,ISTARV,SAME)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1M(IDIM1P),S1P(IDIM1P)
+      INTEGER S2M(IDIM2P),S2P(IDIM2P)
+      INTEGER LINEB(IDIM2P),LINEC(IDIM2P),M(IDM,IDM),LINEE(IDIM1P)
+      INTEGER TEMPB,TEMPP,SAME
+      CHARACTER SEQVC(IDIM1),SEQHC(IDIM2)
+1     CONTINUE
+C      WRITE(KBOUT,1000)
+C1000  FORMAT(' List matching spans')
+      CALL BUSY(KBOUT)
+      LB=LENGTH/2
+      IDIM1T=IDIM1P-LENGTH
+      IDIM2T=IDIM2P-LENGTH
+C   NEED TO SET LINEB TO INITAL VALUE FOR FIRST LENGTH POSITIONS
+C   AND LINEE TO LEFT EDGE VALUES
+C   FIX LEFT EDGE BY SUMMING ALL THE VALUES FOR A LENGTH LENGTH/2
+C   EITHER SIDE OF THE REAL SEQUENCE EDGE
+      DO 10 I=1,IDIM1T
+        LINEE(I)=0
+        IM1 = I - 1
+      DO 10 J=1,LENGTH
+        K = IM1 + J
+        LINEE(I)=LINEE(I)+M(S1M(K),S2M(J))
+10    CONTINUE
+C   NOW DO TOP EDGE
+      DO 20 I=1,IDIM2T
+C   SET UP POINTERS
+        LINEC(I)=0
+        IM1 = I - 1
+        DO 15 J=1,LENGTH
+          K = IM1 + J
+          LINEC(I)=LINEC(I)+M(S1M(J),S2M(K))
+15      CONTINUE
+20    CONTINUE
+      DO 21 I=1,IDIM2T
+        LINEC(I)=LINEC(I+1)
+21    CONTINUE
+      LINEC(IDIM2T)=0
+C   MAIN LOOPS NOW
+C   LOOP FOR EACH ROW
+C
+      ITV1P=LB+IV1-1-ISTARV+1
+      ITV1M=IV1-LB-1-ISTARV+1
+      ITH1P=LB+IH1-1-ISTARH+1
+      ITH1M=IH1-LB-1-ISTARH+1
+      ITV1MM=ITV1M+ISTARV-1
+      ITH1MM=ITH1M+ISTARH-1
+      DO 200 I=1,IDIM1T
+C  SET LINEB TO LINEC, THEN ZERO LINEC
+        DO 110 J=1,IDIM2T
+          LINEB(J+1)=LINEC(J)
+          LINEC(J)=0
+110     CONTINUE
+C   SET LINEB(1) TO EDGE VALUE AS ITS OFF PAGE
+        LINEB(1)=LINEE(I)
+C
+C   NOW COMPARE THIS CHAR OF SEQ1 WITH WHOLE OF SEQ2
+        TEMPP=S1P(I)
+        TEMPB=S1M(I)
+        DO 150 J=1,IDIM2T
+         LINEC(J)=LINEB(J)+M(TEMPP,S2P(J))-M(TEMPB,S2M(J))
+         IF(LINEC(J).LT.MINS)GO TO 150
+C   ABOVE CUTOFF SO WRITE
+          IF(I.EQ.J)THEN
+            IF(SAME.NE.1)THEN
+       WRITE(IDEV1,1006)I+ITV1MM
+       WRITE(IDEV1,1005)(SEQVC(K),K=MAX(1,I+ITV1M),MIN(IDIM1,I+ITV1P))
+       WRITE(IDEV1,1005)(SEQHC(K),K=MAX(1,J+ITH1M),MIN(IDIM2,J+ITH1P))
+       WRITE(IDEV1,1006)J+ITH1MM
+            END IF
+          ELSE
+       WRITE(IDEV1,1006)I+ITV1MM
+       WRITE(IDEV1,1005)(SEQVC(K),K=MAX(1,I+ITV1M),MIN(IDIM1,I+ITV1P))
+       WRITE(IDEV1,1005)(SEQHC(K),K=MAX(1,J+ITH1M),MIN(IDIM2,J+ITH1P))
+       WRITE(IDEV1,1006)J+ITH1MM
+          END IF
+1006     FORMAT(' ',I7)
+1005     FORMAT(' ',200A1)
+150     CONTINUE
+200   CONTINUE
+      END
+      SUBROUTINE SHOBOX(SEQVC,IDIMV,SEQHC,IDIMH,IWX1,IWX2,IWX3,IWX4,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,KSTARH,KSTARV,IENDH,IENDV)
+C  AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      CHARACTER SEQVC(IDIMV),SEQHC(IDIMH)
+      IF(0.EQ.0) GO TO 671
+1000  FORMAT(' Zoom-in to matrix',/)
+      WRITE(KBOUT,1000)
+1002  FORMAT(' Horizontal sequence')
+601   CONTINUE
+      WRITE(KBOUT,1002)
+      WRITE(KBOUT,10091)
+10091 FORMAT(' MAX SIZE=36,DEFAULT=XHAIR POSITION - SPAN/2 TO',
+     +' XHAIR POSITION + SPAN/2')
+C      CALL FSTLST(IWB3,IWB4,KBIN,KBOUT)
+      IF(IWB3.LT.0)RETURN
+      IF(IWB4.LT.0)RETURN
+      IF((IWB3.EQ.0).AND.(IWB4.EQ.0))GO TO 651
+      IF((IWB4-IWB3).GT.35)GO TO 601
+      IF(IWB3.LT.KSTARH)GO TO 601
+      IF(IWB3.GT.IENDH)GO TO 601
+      IF(IWB4.GT.IENDH)GO TO 601
+      IF(IWB3.GE.IWB4)GO TO 601
+C  NE 0 SO USE
+      IWX3=IWB3
+      IWX4=IWB4
+651   CONTINUE
+      IF((IWX4-IWX3).GT.35)GO TO 601
+      IF(IWX3.LT.1)GO TO 601
+      IF(IWX4.LT.IWX3)GO TO 601
+661   CONTINUE
+      WRITE(KBOUT,1001)
+1001  FORMAT(' Vertical sequence')
+      WRITE(KBOUT,10091)
+C      CALL FSTLST(IWB1,IWB2,KBIN,KBOUT)
+      IF((IWB1.EQ.-99).OR.(IWB2.EQ.-99))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 661
+      END IF
+      IF(IWB1.LT.0)RETURN
+      IF(IWB2.LT.0)RETURN
+      IF((IWB1.EQ.0).AND.(IWB2.EQ.0))GO TO 671
+      IF((IWB2-IWB1).GT.35)GO TO 661
+      IF(IWB1.LT.KSTARV)GO TO 661
+      IF(IWB1.GT.IENDV)GO TO 661
+      IF(IWB2.GT.IENDV)GO TO 661
+      IF(IWB1.GE.IWB2)GO TO 661
+C   NE 0 SO USE
+      IWX1=IWB1
+      IWX2=IWB2
+671   CONTINUE
+C      IF((IWX2-IWX1).GT.35)GO TO 661
+C      IF(IWX1.LT.1)GOTO 661
+C      IF(IWX2.LT.IWX1)GO TO 661
+      IF(IWX1.LT.1) RETURN
+      IF(IWX2.LT.1) RETURN
+      IF(IWX3.LT.1) RETURN
+      IF(IWX4.LT.1) RETURN
+      IF(IWX1.GT.IDIMV) RETURN
+      IF(IWX2.GT.IDIMV) RETURN
+      IF(IWX3.GT.IDIMH) RETURN
+      IF(IWX4.GT.IDIMH) RETURN
+      IF(IWX2-IWX1.GT.35) RETURN
+      IF(IWX4-IWX3.GT.35) RETURN
+      CALL DIABOX(SEQVC(KSTARV),IDIMV,SEQHC(KSTARH),
+     +IDIMH,IWX1,IWX2,IWX3,IWX4,KBOUT)
+      RETURN
+      END
+      SUBROUTINE WRITAL(SEQHC,IDIMH,ISTARH,IENDH,
+     +SEQVC,IDIMV,ISTARV,IENDV,IDEVOT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C  AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*),SEQHC(IDIMH),SEQVC(IDIMV)
+C      WRITE(KBOUT,*)' Write out aligned sequences'
+      IW3 = ISTARH
+      IW4 = IENDH
+      CALL GTREG(KBIN,KBOUT,ISTARH,IENDH,IW3,IW4,
+     +'Region of horizontal sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IW1 = ISTARV
+      IW2 = IENDV
+      CALL GTREG(KBIN,KBOUT,ISTARV,IENDV,IW1,IW2,
+     +'Region of vertical sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IDIML=MIN(IW2-IW1,IW4-IW3) + 1
+      CALL FMT2(IDEVOT,SEQVC(IW1-ISTARV+1),SEQHC(IW3-ISTARH+1),
+     +IDIML,IW1,IW3)
+      RETURN
+      END
+      SUBROUTINE ACTOUT(SEQ,IDIM,ISTART,IEND,IDEVOT,FILOUT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER FILOUT*(*),HELPF*(*),SEQ(IDIM)
+C  AUTHOR: RODGER STADEN
+C  FILE SEQUENCE TO DISK
+      FILOUT = ' '
+      CALL OPENF1(IDEVOT,FILOUT,1,IOK,KBIN,KBOUT,
+     +'File name',IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IW1 = ISTART
+      IW2 = IEND
+C      CALL GTREG(KBIN,KBOUT,ISTART,IEND,IW1,IW2,
+C     +'Region of sequence',
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+      IDIML=IW2-IW1+1
+      CALL TITOUT(IDEVOT,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+      CALL FMTDK(IDEVOT,SEQ(IW1-ISTART+1),IDIML)
+      CLOSE(UNIT=IDEVOT)
+      RETURN
+      END
+C
+C    DIAPER
+      SUBROUTINE DIAPER(SEQ1,IDIM1,SEQ2,IDIM2,LINEB,LINEC,IDL,MINS,
+     +MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER LINEB(IDL),LINEC(IDL)
+      INTEGER TEMPB
+      CALL BUSY(KBOUT)
+C      WRITE(KBOUT,*)'Working'
+      XMAX=IDIM2
+      XMIN=1.
+      YMAX=IDIM1
+      YMIN=1.
+      DO 10 I=1,IDIM2+1
+      LINEB(I)=0
+      LINEC(I)=0
+10    CONTINUE
+C   LOOP FOR EACH ROW
+      DO 200 I=1,IDIM1
+C   ZERO LINE C AND SET LINEB TO LINEC
+      DO 110 J=1,IDIM2
+      LINEB(J+1)=LINEC(J)
+110   LINEC(J)=0
+C   SET LINEB(1) TO ZERO AS ITS OFF PAGE
+      LINEB(1)=0
+C
+C   NOW COMPARE THIS CHAR OF SEQ1 WITH WHOLE OF SEQ2
+      TEMPB=SEQ1(I)
+      DO 150 J=1,IDIM2
+      IF(TEMPB.NE.SEQ2(J))GO TO 150
+C   MATCH SO COUNT LENGTH BY ADDING CURRENT LENGTH OF THIS DIAGONAL
+      LINEC(J)=LINEB(J)+1
+C   IS THIS HIGH ENOUGH?
+      IF(LINEC(J).LT.MINS)GO TO 150
+      X=J
+      Y=I
+      CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     1MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX)
+150   CONTINUE
+200   CONTINUE
+      RETURN
+      END
+C
+C   DIAPRO
+      SUBROUTINE DIAPRO(S1M,S1P,IDIM1P,S2M,S2P,IDIM2P,
+     +M,IDM,LINEB,
+     +LINEC,LINEE,LENGTH,MINS,
+     +MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX,KBOUT,SAME)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1M(IDIM1P),S1P(IDIM1P)
+      INTEGER S2M(IDIM2P),S2P(IDIM2P)
+      INTEGER LINEB(IDIM2P),LINEC(IDIM2P),M(IDM,IDM),LINEE(IDIM1P)
+      INTEGER TEMPB,TEMPP,SAME
+      CALL BUSY(KBOUT)
+C      WRITE(KBOUT,*)'Working'
+      IDIM1=IDIM1P-LENGTH
+      IDIM2=IDIM2P-LENGTH
+      XMAX=IDIM2
+      XMIN=1.
+      YMAX=IDIM1
+      YMIN=1.
+C   NEED TO SET LINEB TO INITAL VALUE FOR FIRST LENGTH POSITIONS
+C   AND LINEE TO LEFT EDGE VALUES
+C   FIX LEFT EDGE BY SUMMING ALL THE VALUES FOR A LENGTH LENGTH/2
+C   EITHER SIDE OF THE REAL SEQUENCE EDGE
+      DO 10 I=1,IDIM1
+        LINEE(I)=0
+        IM1 = I - 1
+        DO 9 J=1,LENGTH
+          K = IM1 + J
+          LINEE(I)=LINEE(I)+M(S1M(K),S2M(J))
+9       CONTINUE
+10    CONTINUE
+C   NOW DO TOP EDGE
+      DO 20 I=1,IDIM2
+C   SET UP POINTERS
+        LINEC(I)=0
+        IM1 = I - 1
+        DO 15 J=1,LENGTH
+          K = IM1 + J
+          LINEC(I)=LINEC(I)+M(S1M(J),S2M(K))
+15      CONTINUE
+20    CONTINUE
+      DO 21 I=1,IDIM2
+        LINEC(I)=LINEC(I+1)
+21    CONTINUE
+      LINEC(IDIM2)=0
+C   MAIN LOOPS NOW
+C   LOOP FOR EACH ROW
+C
+      DO 200 I=1,IDIM1
+C  SET LINEB TO LINEC, THEN ZERO LINEC
+        DO 110 J=1,IDIM2
+          LINEB(J+1)=LINEC(J)
+          LINEC(J)=0
+110     CONTINUE
+C   SET LINEB(1) TO EDGE VALUE AS ITS OFF PAGE
+        LINEB(1)=LINEE(I)
+C
+C   NOW COMPARE THIS CHAR OF SEQ1 WITH WHOLE OF SEQ2
+        TEMPP=S1P(I)
+        TEMPB=S1M(I)
+        DO 150 J=1,IDIM2
+          LINEC(J)=LINEB(J)+M(TEMPP,S2P(J))-M(TEMPB,S2M(J))
+          IF(LINEC(J).LT.MINS)GO TO 150
+C   ABOVE CUTOFF SO PLOT
+          IF(I.EQ.J)THEN
+            IF(SAME.NE.1)THEN
+              X=J
+              Y=I
+              CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     1        MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX)
+            END IF
+          ELSE
+            X=J
+            Y=I
+            CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     1      MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX)
+          END IF
+150     CONTINUE
+200   CONTINUE
+      END
+C   DIAPRI
+      SUBROUTINE DIAPRI(S1M,S1P,IDIM1P,S2M,S2P,IDIM2P,
+     +M,IDM,LINEB,
+     +LINEC,LINEE,LENGTH,MINS,
+     +MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX,KBOUT,SAME)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1M(IDIM1P),S1P(IDIM1P)
+      INTEGER S2M(IDIM2P),S2P(IDIM2P)
+      INTEGER LINEB(IDIM2P),LINEC(IDIM2P),M(IDM,IDM),LINEE(IDIM1P)
+      INTEGER TEMPB,TEMPP,SAME
+C  VERSION TO MARK ALL IDENTITIES
+      CALL BUSY(KBOUT)
+      IDIM1=IDIM1P-LENGTH
+      IDIM2=IDIM2P-LENGTH
+      XMAX=IDIM2
+      XMIN=1.
+      YMAX=IDIM1
+      YMIN=1.
+C   NEED TO SET LINEB TO INITAL VALUE FOR FIRST LENGTH POSITIONS
+C   AND LINEE TO LEFT EDGE VALUES
+C   FIX LEFT EDGE BY SUMMING ALL THE VALUES FOR A LENGTH LENGTH/2
+C   EITHER SIDE OF THE REAL SEQUENCE EDGE
+      DO 10 I=1,IDIM1
+        LINEE(I)=0
+        IM1 = I - 1
+        DO 9 J=1,LENGTH
+          K = IM1 + J
+          LINEE(I)=LINEE(I)+M(S1M(K),S2M(J))
+9       CONTINUE
+10    CONTINUE
+C   NOW DO TOP EDGE
+      DO 20 I=1,IDIM2
+C   SET UP POINTERS
+        LINEC(I)=0
+        IM1 = I - 1
+        DO 15 J=1,LENGTH
+          K = IM1 + J
+          LINEC(I)=LINEC(I)+M(S1M(J),S2M(K))
+15      CONTINUE
+20    CONTINUE
+      DO 21 I=1,IDIM2
+        LINEC(I)=LINEC(I+1)
+21    CONTINUE
+      LINEC(IDIM2)=0
+C   MAIN LOOPS NOW
+C   LOOP FOR EACH ROW
+C
+      DO 200 I=1,IDIM1
+C  SET LINEB TO LINEC, THEN ZERO LINEC
+        DO 110 J=1,IDIM2
+          LINEB(J+1)=LINEC(J)
+          LINEC(J)=0
+110     CONTINUE
+C   SET LINEB(1) TO EDGE VALUE AS ITS OFF PAGE
+        LINEB(1)=LINEE(I)
+C
+C   NOW COMPARE THIS CHAR OF SEQ1 WITH WHOLE OF SEQ2
+        TEMPP=S1P(I)
+        TEMPB=S1M(I)
+        DO 150 J=1,IDIM2
+          LINEC(J)=LINEB(J)+M(TEMPP,S2P(J))-M(TEMPB,S2M(J))
+          IF(LINEC(J).LT.MINS)GO TO 150
+C   ABOVE CUTOFF SO PLOT
+          IF(I.EQ.J)THEN
+            IF(SAME.NE.1)THEN
+              K1 = I - 1
+              K2 = J - 1
+              DO 130 K = 1,LENGTH
+                K1 = K1 + 1
+                K2 = K2 + 1
+                IF(S1M(K1).EQ.S2M(K2))THEN
+                  X = K2
+                  Y = K1
+                  CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     1            MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX)
+                END IF
+130           CONTINUE
+            END IF
+          ELSE
+            K1 = I - 1
+            K2 = J - 1
+            DO 140 K = 1,LENGTH
+              K1 = K1 + 1
+              K2 = K2 + 1
+              IF(S1M(K1).EQ.S2M(K2))THEN
+                X = K2
+                Y = K1
+                CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     1          MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX)
+              END IF
+140         CONTINUE
+          END IF
+150     CONTINUE
+200   CONTINUE
+      END
+C     ALIGN
+C   ROUTINE TO INSERT PADDING AS DASHES INTO DIAGON SEQS
+      SUBROUTINE ALIGN(SEQVV,IDIMV,SEQHH,IDIMH,
+     +MAXSEQ,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,ISTARV,IENDV,ISTARH,IENDH,
+     +FILEH,FILEV)
+      CHARACTER HELPF*(*),FILEH*(*),FILEV*(*)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQVV(MAXSEQ),SEQHH(MAXSEQ)
+      CHARACTER VORH,DASH,IORD
+      PARAMETER (MAXPRM = 6)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      SAVE DASH
+      DATA DASH/'-'/
+C  ONLY ALLOW EDITING TO SEQUENCES THAT CAN BE TOTALLY CONTAINED IN THE 
+C  RAM BUFFER, AND WHICH START AT 1 (WE DONT PLAN TO WRITE OUT TO THE 
+C  DISK BUFFER)
+C  IE ISTARH=1, IDIMH<MAXSEQ
+C  IE ISTARV=1, IDIMV<MAXSEQ
+C
+1     CONTINUE
+      CALL QVORH(VORH,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,
+     +FILEH,FILEV)
+1004  FORMAT(A)
+      IF(VORH.EQ.' ')RETURN
+C  CHECK IF OK
+      IF(VORH.EQ.'H')THEN
+        IF(ISTARH.NE.1)GO TO 100
+        IF(IDIMH.GT.MAXSEQ)GO TO 100
+      END IF
+      IF(VORH.EQ.'V')THEN
+        IF(ISTARV.NE.1)GO TO 100
+        IF(IDIMV.GT.MAXSEQ)GO TO 100
+      END IF
+      IF(VORH.EQ.'H') THEN
+        MN = ISTARH
+        MX = IENDH
+      ELSE IF(VORH.EQ.'V') THEN
+        MN = ISTARV
+        MX = IENDV
+      END IF
+      MPOS = 1
+      CALL GETINT(MN,MX,MPOS,'Position to edit',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IPOS = IVAL
+      MN = 0
+      MX = MIN(IENDH,IENDV)
+      MPOS = 0 
+      CALL GETINT(MN,MX,MPOS,'Number of characters',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(IVAL.LT.1) RETURN
+      NCHAR = IVAL
+      I = 1
+      PROMPT(1) = 'Insert'
+      PROMPT(2) = 'Delete'
+      CALL RADION('Select edit command',PROMPT,2,I,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(I.LT.1) RETURN
+      IF(I.EQ.2) IORD = 'D'
+      IF(I.EQ.1) IORD = 'I'
+      IF(IORD.EQ.'D')NCHAR=SIGN(NCHAR,-1)
+C
+C  V
+      IF(VORH.EQ.'V')THEN
+        IF((IDIMV+NCHAR).GT.MAXSEQ)THEN
+          WRITE(KBOUT,1008)MAXSEQ
+1008      FORMAT(' This would make sequence longer than ',I6,
+     +    ' so not done.')
+          GO TO 1
+        END IF
+        CALL MOVEC(SEQVV,MAXSEQ,IDIMV,IPOS,NCHAR)
+        IF(IORD(1:1).EQ.'I')CALL FILLC(SEQVV(IPOS),NCHAR,DASH)
+        IDIMV=IDIMV+NCHAR
+        IENDV = IDIMV
+        WRITE(KBOUT,1010)
+        GO TO 1
+      END IF
+C  H
+      IF(VORH.EQ.'H')THEN
+        IF((IDIMH+NCHAR).GT.MAXSEQ)THEN
+          WRITE(KBOUT,1008)MAXSEQ
+          GO TO 1
+        END IF
+        CALL MOVEC(SEQHH,MAXSEQ,IDIMH,IPOS,NCHAR)
+        IF(IORD(1:1).EQ.'I')CALL FILLC(SEQHH(IPOS),NCHAR,DASH)
+        IDIMH=IDIMH+NCHAR
+        IENDH = IDIMH
+        WRITE(KBOUT,1010)
+      END IF
+1010  FORMAT(' Edit done')
+      GO TO 1
+100   CONTINUE
+C  COME HERE IF DISK BUFFERING SEQUENCE AND ITS TOO LONG
+      WRITE(KBOUT,1011)
+1011  FORMAT(' Sorry but sequences of this length cannot be edited',
+     +' on this machine')
+      GO TO 1
+      END
+C   DIABOX
+C   WRITES BOX OF PERFECT MATCHES
+      SUBROUTINE DIABOX(SEQ1,IDIM1,SEQ2,IDIM2,I11,I12,I21,I22,
+     +KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),LINE(36),SPACE
+      DATA SPACE/' '/
+1000  FORMAT(4X,36A1)
+      WRITE(KBOUT,*)
+      DO 100 I=I12,I11,-1
+      DO 10 J=1,36
+10    LINE(J)=SPACE
+      L=0
+      DO 50 J=I21,I22
+      L=L+1
+50    IF(SEQ1(I).EQ.SEQ2(J))LINE(L)=SEQ1(I)
+      WRITE(KBOUT,1001)SEQ1(I),(LINE(K),K=1,L)
+1001  FORMAT(3X,A1,36A1)
+100   CONTINUE
+      WRITE(KBOUT,1000)(SEQ2(K),K=I21,I22)
+      RETURN
+      END
+C   DIAEXP COPIED ORIGINALLY FROM ANDREW MC LACHLAN
+C   NA=IDIM1
+C   NB=IDIM2
+C   MATS=MATRIX THE SCORE MATRIX
+C   NACID=IDM THE DIMENSION OF THE SCORE MATRIX
+C   IS=LENGTH THE SPAN LENGTH
+C   LIMIT=HIGHEST INDIVIDUAL SCORE IN SCORE MATRIX MATS
+      SUBROUTINE DIAEXP(SEQ1,NA,SEQ2,NB,MATS,NACID,IS1,
+     +LIMIT,POLYA,POLYB,IDPOLY,KBIN,KBOUT,IDEV,CHRSET,NORO,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      REAL POLYG(50),POLYA(IDPOLY),POLYB(IDPOLY)
+      INTEGER JCOMPA(26),JCOMPB(26)
+      REAL FCOMPA(26),FCOMPB(26),PR(10)
+      INTEGER MATS(NACID,NACID)
+      INTEGER NUM(10),OPT
+      CHARACTER SEQ1(NA),SEQ2(NB),CHRSET(NACID),NORO,CHART
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      PARAMETER (SMALL = 1.0E-30)
+      PARAMETER (MAXPRM = 29)
+      CHARACTER PROMPT(3)*(MAXPRM)
+C      DATA SMALL/0.00000000000000000000000000001/
+C   IF HAVE ALREADY DONE CALC FOR THIS SPAN SKIP
+      IF(NORO.EQ.'O')GO TO 533
+C   SET NORO TO OLD IE BEEN DONE BEFORE
+      NORO='O'
+      CALL BUSY(KBOUT)
+C   SET SPAN
+      IS=IS1
+C   SEQ'S ENTER ALREADY TRANSLATED TO AMINOACID POINTERS 1-26
+C   CALC COMPOSITION
+      DO 300 IC=1,NACID
+      JCOMPA(IC)=0
+      JCOMPB(IC)=0
+300   CONTINUE
+      DO 301 JA=1,NA
+        CHART = SEQ1(JA)
+        ITEMP = CTONUM(CHART)
+        JCOMPA(ITEMP) = JCOMPA(ITEMP) + 1
+301   CONTINUE
+      DO 302 JB=1,NB
+        CHART = SEQ2(JB)
+        ITEMP = CTONUM(CHART)
+        JCOMPB(ITEMP) = JCOMPB(ITEMP) + 1
+302   CONTINUE
+      ANA=NA
+      ANB=NB
+      DO 303 IC=1,NACID
+      FCOMPA(IC)=JCOMPA(IC)
+      FCOMPA(IC)=FCOMPA(IC)/(ANA+0.0000001)
+      FCOMPB(IC)=JCOMPB(IC)
+      FCOMPB(IC)=FCOMPB(IC)/(ANB+0.0000001)
+303   CONTINUE
+C      WRITE(KBOUT,1004)JCOMPA
+C      WRITE(KBOUT,1004)JCOMPB
+1004  FORMAT(2((1X,13I4)/))
+C   CALC EXPECTED AV SCORE AND MEAN SQUARE
+      S1=0.0
+      S2=0.0
+      DO 310 L=1,NACID
+      DO 310 M=1,NACID
+      AM=MATS(L,M)
+      S1=S1+FCOMPA(L)*AM*FCOMPB(M)
+      S2=S2+FCOMPA(L)*(AM**2)*FCOMPB(M)
+310   CONTINUE
+C   RMS SCORE PER POSITION
+      IF(S3.GT.0.0)S3=SQRT(S3)
+      S3=SQRT(S2-(S1**2))
+      S4=IS
+      S3=S3*SQRT(S4)
+      S1=S1*IS
+C
+      WRITE(KBOUT,1000)S1
+1000  FORMAT(' Average score=',F12.5)
+      WRITE(KBOUT,1001)S3
+1001  FORMAT(' RMS deviation=',F12.5)
+C
+C   SET UP GENERATING POLYNOMIAL WHICH IS THE PROBABILITY OF SELECTION
+C   FOR EACH OF THE LEVELS OF SCORE IN THE SCORE MATRIX MATS AND SO
+C   DEPENDS ONLY ON THE COMPOSITION OF THE TWO SEQUENCES AND THE
+C   DISTRIBUTION OF SCORES IN THE SCORE MATRIX
+      DO 320 J=1,50
+      POLYG(J)=0.
+320   CONTINUE
+C   SUM THE PROBABILITY OF SELECTION FOR EACH LEVEL OF SCORE
+      DO 322 L=1,NACID
+      DO 321 M=1,NACID
+C   SCORE?
+      J=MATS(L,M)+1
+      POLYG(J)=POLYG(J)+FCOMPA(L)*FCOMPB(M)
+321   CONTINUE
+322   CONTINUE
+C   SET THOSE LESS THAN SMALL TO ZERO
+      DO 325 J=1,50
+      IF(ABS(POLYG(J)).LT.SMALL)POLYG(J)=0.0
+325   CONTINUE
+C
+C  COMPUTE PROBABILITIES
+C   ZERO POLYA,POLYB AND SET POLYA(1) TO 1.
+      DO 400 I=1,IDPOLY
+      POLYA(I)=0.
+      POLYB(I)=0.
+400   CONTINUE
+      POLYA(1)=1.0
+C   SET UP
+      LORDER=LIMIT
+      MORDER=1
+C   LOOP SPAN TIMES TO RAISE THE POLYNOMIAL TO THE POWER SPAN
+      DO 500 JS=1,IS
+      LR=LORDER+1
+      MR=MORDER+1
+      DO 450 JG=1,LR
+      DO 451 JA=1,MR
+C  CAN SUM THE JB TERMS BECAUSE THEY GIVE THE SAME SCORE WHEN COMBINED
+      JB=JA+JG-1
+      POLYB(JB)=POLYB(JB)+POLYA(JA)*POLYG(JG)
+451   CONTINUE
+450   CONTINUE
+      MORDER=MORDER+LORDER
+      MR=MORDER+1
+C   SET VERY SMALL VALUES TO ZERO AND PREPARE FOR NEXT MULTIPLICATION STEP
+      DO 453 JA=1,MR
+      P=POLYB(JA)
+      IF(ABS(P).LT.SMALL)P=0.0
+      POLYA(JA)=P
+      POLYB(JA)=0.
+453   CONTINUE
+500   CONTINUE
+C   CALC CUMULATIVE PROBABILITY
+      MR=MORDER+1
+C   THIS LOOP LEAVES THE CUMULATIVE PROBABILITY FOR SCORE I IN POLYA(I)
+C   IE ALL THE PROBABILITY OF ACHEIVING AT LEAST SCORE I IS STORED IN POLYA(I)
+      DO 510 JB=1,MR
+      JA=MR+1-JB
+      POLYA(JA)=POLYA(JA)+POLYA(JA+1)
+510   CONTINUE
+      MORDER=LIMIT*IS
+      XMN = 0.0000000001
+      MXS = MORDER
+      DO 520 J=1,MORDER
+        IF(POLYA(J).GT.XMN) GO TO 520
+          MXS = J
+          GO TO 530
+520   CONTINUE
+530   CONTINUE
+C
+533   CONTINUE
+      OPT = 1
+590   CONTINUE
+      PROMPT(1) = 'Show probability for a score'
+      PROMPT(2) = 'Show score for a probability'
+      PROMPT(3) = 'List scores and probabilities'
+      CALL RADION('Select probability display mode',PROMPT,3,OPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(OPT.LT.1) RETURN
+      IF(OPT.EQ.2)GO TO 700
+      IF(OPT.EQ.3)GO TO 800
+      IF(OPT.NE.1)GO TO 590
+      MN = 1
+      ISCORE = S1 + (MXS - S1) / 2
+      CALL GETINT(MN,MXS,ISCORE,'Show probability for score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      ISCORE = IVAL
+      WRITE(KBOUT,1008)ISCORE,POLYA(ISCORE+1)
+1008  FORMAT(' Probability of score ',I6,' is ',F12.10)
+      GO TO 590
+700   CONTINUE
+      XMN = 0.0000000001
+      XMX = 1.
+      PROB = 0.00001
+      CALL GETRLS(XMN,XMX,PROB,'Show score for probability',
+     +VAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      PROB = VAL
+C   LOOK FOR FIRST SCORE GIVING THIS PROBABILITY
+      DO 710 J=1,MORDER
+      IF(POLYA(J).GT.PROB)GO TO 710
+      WRITE(KBOUT,1012)PROB,J
+1012  FORMAT(' Score for probability ',F12.10,' is ',I5)
+      GO TO 590
+710   CONTINUE
+      GO TO 590
+800   CONTINUE
+      MN = 1
+      MX = 10
+      JSTEP = 5
+      CALL GETINT(MN,MX,JSTEP,'Number of steps between scores',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      JSTEP = IVAL
+      MMAX=MORDER+1
+      LROW=MMAX/(3*JSTEP)
+      DO 541 JROW=1,LROW
+      DO 540 J=1,3
+        NUM(J)=(JROW-1+(J-1)*LROW)*JSTEP
+        PR(J)=POLYA(NUM(J)+1)
+540     CONTINUE
+        WRITE(IDEV,1002)(NUM(J),PR(J),J=1,3)
+541   CONTINUE
+1002  FORMAT(3(2X,I5,1X,E12.5))
+      GO TO 590
+      END
+C
+C  DIAOBS
+      SUBROUTINE DIAOBS(S1M,S1P,IDIM1P,S2M,S2P,
+     +IDIM2P,M,IDM,LINEB,
+     +LINEC,LINEE,LENGTH,NORO,PERC,SCORE,IDSCOR,KBIN,KBOUT,
+     +IDEV,IHELPS,IHELPE,HELPF,IDEVH,SAME)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER S1M(IDIM1P),S1P(IDIM1P)
+      INTEGER S2M(IDIM2P),S2P(IDIM2P)
+      INTEGER LINEB(IDIM2P),LINEC(IDIM2P),M(IDM,IDM),LINEE(IDIM1P)
+      INTEGER SCORE(IDSCOR)
+      REAL PERC(IDSCOR)
+      INTEGER TEMPB,TEMPP,OPT,SAME
+      PARAMETER (MAXPRM = 32)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      CHARACTER NORO
+C   ALREADY DONE CALC FOR THIS LENGTH AND REGION?
+      IF(NORO.EQ.'O')GO TO 533
+C   SET NORO TO OLD IE BEEN DONE BEFORE
+      NORO='O'
+      CALL BUSY(KBOUT)
+      DO 5 I=1,IDSCOR
+        SCORE(I)=0
+5     CONTINUE
+      IDIM1=IDIM1P-LENGTH
+      IDIM2=IDIM2P-LENGTH
+C   NEED TO SET LINEB TO INITAL VALUE FOR FIRST LENGTH POSITIONS
+C   AND LINEE TO LEFT EDGE VALUES
+C   FIX LEFT EDGE BY SUMMING ALL THE VALUES FOR A LENGTH LENGTH/2
+C   EITHER SIDE OF THE REAL SEQUENCE EDGE
+      DO 10 I=1,IDIM1
+        LINEE(I)=0
+        IM1 = I - 1
+        DO 9 J=1,LENGTH
+          K = IM1 + J
+          LINEE(I)=LINEE(I)+M(S1M(K),S2M(J))
+9       CONTINUE
+10    CONTINUE
+C   NOW DO TOP EDGE
+      DO 20 I=1,IDIM2
+C   SET UP POINTERS
+        LINEC(I)=0
+        IM1 = I - 1
+        DO 15 J=1,LENGTH
+          K = IM1 + J
+          LINEC(I)=LINEC(I)+M(S1M(J),S2M(K))
+15      CONTINUE
+20    CONTINUE
+      DO 21 I=1,IDIM2
+        LINEC(I)=LINEC(I+1)
+21    CONTINUE
+      LINEC(IDIM2)=0
+C   MAIN LOOPS NOW
+C   LOOP FOR EACH ROW
+C
+      DO 200 I=1,IDIM1
+C  SET LINEB TO LINEC, THEN ZERO LINEC
+        DO 110 J=1,IDIM2
+          LINEB(J+1)=LINEC(J)
+          LINEC(J)=0
+110     CONTINUE
+C   SET LINEB(1) TO EDGE VALUE AS ITS OFF PAGE
+        LINEB(1)=LINEE(I)
+C
+C   NOW COMPARE THIS CHAR OF SEQ1 WITH WHOLE OF SEQ2
+        TEMPP=S1P(I)
+        TEMPB=S1M(I)
+        DO 150 J=1,IDIM2
+          LINEC(J)=LINEB(J)+M(TEMPP,S2P(J))-M(TEMPB,S2M(J))
+          IF(LINEC(J).NE.0)THEN
+            IF(I.EQ.J)THEN
+              IF(SAME.NE.1)THEN
+                SCORE(LINEC(J)) = SCORE(LINEC(J)) + 1
+              END IF
+            ELSE
+              SCORE(LINEC(J)) = SCORE(LINEC(J)) + 1
+            END IF
+          END IF
+150     CONTINUE
+200   CONTINUE
+C   FIND MAX SCORE
+      IDSCOS=IDSCOR+1
+      DO 300 I=1,IDSCOR
+        J=IDSCOS-I
+        IF(SCORE(J).NE.0)THEN
+C   FOUND A NON ZERO SCORE. IT MUST BE LARGEST
+          MAX=J
+          GO TO 301
+        END IF
+300   CONTINUE
+301   CONTINUE
+      WRITE(KBOUT,1000)MAX
+1000  FORMAT(' Maximum observed score is ',I6)
+C   CALC CUMULATIVE SCORE
+      DO 400 I=1,MAX-1
+        J=MAX-I
+        SCORE(J)=SCORE(J)+SCORE(J+1)
+400   CONTINUE
+      ISQ=IDIM1*IDIM2
+      TOTPER=100./ISQ
+C   CALC PERCENT SCORES
+      DO 410 I=1,MAX
+        PERC(I)=SCORE(I)*TOTPER
+410   CONTINUE
+401   CONTINUE
+533   CONTINUE
+      OPT = 1
+590   CONTINUE
+      PROMPT(1) = 'Show percentage reaching a score'
+      PROMPT(2) = 'Show score for a percentage'
+      PROMPT(3) = 'List scores and percentages'
+      CALL RADION('Select score display mode',PROMPT,3,OPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(OPT.LT.1) RETURN
+      IF(OPT.EQ.2)GO TO 700
+      IF(OPT.EQ.3)GO TO 800
+      IF(OPT.NE.1)GO TO 590
+      MN = 1
+      ISCORE = 0.75 * MAX
+      CALL GETINT(MN,MAX,ISCORE,'Show percentage for score',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      ISCORE = IVAL
+      WRITE(KBOUT,1008)ISCORE,PERC(ISCORE)
+1008  FORMAT(' Percentage of points with score ',I6,' is ',F13.9)
+      GO TO 590
+700   CONTINUE
+      XMN = 0.00001
+      XMX = 1.
+      PROB = 0.001
+      CALL GETRLS(XMN,XMX,PROB,'Show score for percentage',
+     +VAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      PROB = VAL
+C   LOOK FOR FIRST SCORE GIVING THIS PROBABILITY
+      DO 710 J=1,MAX+1
+      IF(PERC(J).GT.PROB)GO TO 710
+      WRITE(KBOUT,1012)PROB,J
+1012  FORMAT(' Score for percentage ',F13.9,' is ',I5)
+      GO TO 590
+710   CONTINUE
+      GO TO 590
+800   CONTINUE
+      MN = 1
+      MX = 10
+      JSTEP = 5
+      CALL GETINT(MN,MX,JSTEP,'Number of steps between scores',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      JSTEP = IVAL
+C   LIST
+      DO 420 I=1,MAX,JSTEP
+      IF(PERC(I).NE.100.)WRITE(IDEV,1005)I,SCORE(I),PERC(I)
+1005  FORMAT(1X,I5,1X,I8,1X,E12.5)
+420   CONTINUE
+      GO TO 590
+      END
+      SUBROUTINE DIALIN(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT)
+C   AUTHOR: RODGER STADEN
+      XF=MARGL
+      XT=XF+MARGR
+      YF=MARGB
+      YT=YF+MARGT
+      XMIN=XF
+      XMAX=XT
+      YMIN=YF
+      YMAX=YT
+      CALL LINE(XF,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     1MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      CALL VT100M
+      RETURN
+      END
+C   SUBROUTINE TO WRITE 3 LINES OF SEQUENCES
+C
+      SUBROUTINE FMT2(IDEV,SEQ1,SEQ2,IDIM,ISW,ISX)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(60)
+      INTEGER KL(6)
+C
+C
+      ISXX=ISX
+      ISWW=ISW
+      IE=0
+1     CONTINUE
+C       DO 100 I=1,100
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,6
+      KL(J)=ISWW
+      ISWW=ISWW+10
+50    CONTINUE
+      IS=IE+1
+      IE=IE+60
+      IF(IE.GT.IDIM)IE=IDIM
+C   COMPARE THE TWO SECTIONS OF SEQUENCE
+      IL=IE-IS+1
+      WRITE(IDEV,1001)'V',(KL(K),K=1,1+(IL-1)/10)
+      CALL SQMTCH(SEQ1(IS),SEQ2(IS),MATCH,IL)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+      WRITE(IDEV,1002)(MATCH(K),K=1,IL)
+      WRITE(IDEV,1002)(SEQ2(K),K=IS,IE)
+1002  FORMAT( 10X,6(10A1,1X))
+C   SET UP DECIMAL COUNTERS
+      DO 60 J=1,6
+      KL(J)=ISXX
+      ISXX=ISXX+10
+60    CONTINUE
+      WRITE(IDEV,1001)'H',(KL(K),K=1,1+(IL-1)/10)
+1001  FORMAT( '    ',A,' ',6(I5,6X))
+C1001  FORMAT( 6X,6(I5,6X))
+      IF(IE.EQ.IDIM)RETURN
+C100    CONTINUE
+      GO TO 1
+      END
+C   MOVEI
+C   MOVES INTEGER ARRAY CONTENTS LEFT OR RIGHT
+      SUBROUTINE MOVEI(SEQ,IDIMX,IDIM,IPOS,NCHAR1)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDIMX)
+      INTEGER TO,FROM
+      NCHAR=ABS(NCHAR1)
+C   LEFT OR RIGHT?
+      IF(NCHAR1.LT.0)GO TO 20
+C   RIGHT
+      FROM=IDIM
+      TO=IDIM+NCHAR
+C   NUMBER TO MOVE?
+      NUM=IDIM-IPOS+1
+      DO 10 I=1,NUM
+      SEQ(TO)=SEQ(FROM)
+      TO=TO-1
+      FROM=FROM-1
+10    CONTINUE
+      RETURN
+20    CONTINUE
+C   LEFT
+      FROM=IPOS+NCHAR
+      TO=IPOS
+C   NUMBER TO MOVE?
+      NUM=IDIM-FROM+1
+      DO 30 I=1,NUM
+      SEQ(TO)=SEQ(FROM)
+      TO=TO+1
+      FROM=FROM+1
+30    CONTINUE
+      RETURN
+      END
+      SUBROUTINE FILEDG(SEQC,IDIMC,I1,I2,
+     +SEQN,IDIMN,SPAN,CHRSET,IDCHR,MSPO2,ISTART,IEND)
+C   AUTHOR: RODGER STADEN
+C   ROUTINE TO SORT OUT EDGES OF ACTIVE SEQUENCE:
+C   WE NOW COPY THE ACTIVE SEQUENCE INTO A TEMPORARY ARRAY
+C   FOR PROCESSING (ACTUALLY ALSO CONVERTING CHARS TO NUMBERS)
+C   AND WE NEED TO EITHER FILL UP THE EDGES WITH NULL CHARS
+C   OR, IF THERE IS REAL DATA AVAILABLE, WITH THAT
+      CHARACTER SEQC(IDIMC),CHRSET(IDCHR)
+      INTEGER SEQN(IDIMN),SPAN,SPANO2
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C   DO LEFT EDGE
+C   FILL WITH VALUES IDCHR FOR SPAN/2
+C   CALC ACTIVE SEQUENCE LENGTH
+      IDIMCA=I2-I1+1
+      SPANO2=SPAN/2
+      DO 1 I=MSPO2-SPANO2,MSPO2
+1     SEQN(I)=IDCHR
+C   NOW PUT ANY IN REAL SEQ CHARACTERS AVAILABLE (IF I1>ISTART)
+      IF(I1.GT.ISTART)THEN
+C   NEED TO INSERT AT MOST SPAN/2 ELEMENTS
+      N=MIN(SPANO2+1,I1-ISTART)
+C   FIRST TO COPY?
+      J1=I1-1-ISTART+1
+      K1=MSPO2
+      DO 2 I=1,N
+      SEQN(K1)=CTONUM(SEQC(J1))
+      J1=J1-1
+      K1=K1-1
+2     CONTINUE
+      END IF
+C   DO RIGHT EDGE
+      DO 3 I=MSPO2+IDIMCA+1,MSPO2+IDIMCA+SPANO2
+3     SEQN(I)=IDCHR
+      IF(I2.LT.IEND)THEN
+      N=MIN(SPANO2,(IEND-I2))
+      J1=I2+1-ISTART+1
+      K1=MSPO2+1+IDIMCA
+      DO 4 I=1,N
+      SEQN(K1)=CTONUM(SEQC(J1))
+      J1=J1+1
+      K1=K1+1
+4     CONTINUE
+      END IF
+      RETURN
+      END
+      SUBROUTINE QVORH(VHOUT,IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,
+     +FILEH,FILEV)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*),VHOUT,FILEH*(*),FILEV*(*)
+      PARAMETER (MAXPRM = 19)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      PROMPT(1) = 'Horizontal sequence'
+      PROMPT(2) = 'Vertical sequence'
+      WRITE(KBOUT,1001)FILEH
+1001  FORMAT(' Horizontal sequence is ',A)
+      WRITE(KBOUT,1002)FILEV
+1002  FORMAT(' Vertical sequence is ',A)
+      VHOUT = ' '
+      I = 1
+      CALL RADION('Select sequence',PROMPT,2,I,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(I.LT.1) RETURN
+      IF(I.EQ.2) VHOUT = 'V'
+      IF(I.EQ.1) VHOUT = 'H'
+      END
+      SUBROUTINE CFSQ(SEQ1,IDIM1,SEQ2,IDIM2,POSN,WORDP,IDE,IDCHAR,
+     +CONSTS,LCONST,LENGTH,MINMAT,
+     +MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX,KBOUT,ISAME)
+C  25-4-91 Fixed bug that allowed cfseq to extend off end of arrays
+C          by changing call to enconc from idim1 to idim1-minmat+1
+C
+C  NOTE !!!!!!!  this "fix" does NOT work well for short sequences:
+C  enconc only process 1 to idim-length+1 so if idim = idim1-minmat+1
+C  <= length-1 nothing gets hashed !!!!!!!!!! FIX IT sometime.
+C  should it be idim1-length+1 that gets sent to enconc ??
+C
+      INTEGER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER POSN(IDIM1),WORDP(IDE),CONSTS(0:LCONST)
+      CALL BUSY(KBOUT)
+      CALL SETCN(CONSTS,LENGTH,IDCHAR,LCONST)
+      CALL ENCONC(SEQ1,IDIM1-MINMAT+1,POSN,WORDP,IDE,IDCHAR,
+     +CONSTS,LENGTH,LCONST)
+      CALL CFSEQ(SEQ1,IDIM1,POSN,WORDP,IDE,SEQ2,IDIM2,CONSTS,LCONST,
+     +LENGTH,IDCHAR,MINMAT,
+     +MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX,ISAME)
+      END
+      SUBROUTINE CFSEQ(SEQ1,IDIM1,POSN,WORDP,IDE,SEQ2,IDIM2,CONSTS,
+     +LCONST,
+     +LENGTH,IDCHAR,MINMAT,
+     +MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX,ISAME)
+      INTEGER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER POSN(IDIM1),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER NCODEA,EXTNDM
+      EXTERNAL NCODEA,EXTNDM
+      LEX = MINMAT - LENGTH
+      XMAX=IDIM2
+      XMIN=1.
+      YMAX=IDIM1
+      YMIN=1.
+      DO 20 I = 1,IDIM2-MINMAT+1
+        J = NCODEA(SEQ2(I),LENGTH,CONSTS,IDCHAR,LCONST)
+        IF(J.NE.0)THEN
+          J1 = WORDP(J)
+          IF(J1.NE.0)THEN
+            IF (((ISAME.EQ.1).AND.(I.NE.J1)).OR.(ISAME.EQ.0)) THEN
+              IMATCH = 0
+              IF(MINMAT.GT.LENGTH) IMATCH = EXTNDM(SEQ1(J1+LENGTH),
+     +        SEQ2(I+LENGTH),LEX)
+              IF(IMATCH.EQ.0)THEN
+                X=I
+                Y=J1
+                CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     +          MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX)
+              END IF
+            END IF
+10        CONTINUE
+            J2 = J1
+            J1 = POSN(J2)
+            IF(J1.NE.0)THEN
+              IF (((ISAME.EQ.1).AND.(I.NE.J1)).OR.(ISAME.EQ.0)) THEN
+                IMATCH = 0
+                IF(MINMAT.GT.LENGTH) IMATCH = EXTNDM(SEQ1(J1+LENGTH),
+     +          SEQ2(I+LENGTH),LEX)
+                IF(IMATCH.EQ.0)THEN
+                  X=I
+                  Y=J1
+                  CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     +            MARGXL,MARGXR,MARGYL,MARGYR,ISXMAX,ISYMAX)
+                END IF
+              END IF
+              GO TO 10
+            END IF
+          END IF        
+        END IF
+20    CONTINUE
+      END
+      INTEGER FUNCTION EXTNDM(SEQ1,SEQ2,LENGTH)
+      INTEGER SEQ1(LENGTH),SEQ2(LENGTH)
+      EXTNDM = 1
+      DO 10 I = 1,LENGTH
+        IF(SEQ1(I).NE.SEQ2(I))RETURN
+10    CONTINUE
+      EXTNDM = 0
+      END
+      SUBROUTINE DSTAT(FILEH,ISH1,ISH2,FILEV,ISV1,ISV2,
+     +LENGTH,MINPRO,MINPER,KBOUT,ISAME,IMARK)
+      CHARACTER FILEH*(*),FILEV*(*)
+      WRITE(KBOUT,*)'Horizontal sequence'
+      WRITE(KBOUT,1023)FILEH
+      WRITE(KBOUT,10010)ISH1,ISH2
+      WRITE(KBOUT,*)'Vertical sequence'
+      WRITE(KBOUT,1023)FILEV
+1023  FORMAT(' ',A)
+      WRITE(KBOUT,10010)ISV1,ISV2
+10010 FORMAT(' Positions',/,' ',I6,' TO ',I6)
+      WRITE(KBOUT,10011)LENGTH
+10011 FORMAT(' Span length=',I6)
+      WRITE(KBOUT,10014)
+10014 FORMAT(' Scores')
+      WRITE(KBOUT,10012)MINPRO
+10012 FORMAT(' Proportional=',I6)
+      WRITE(KBOUT,10013)MINPER
+10013 FORMAT(' Identities=',I6)
+      IF(IMARK.EQ.1)THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(' Identities on')
+      ELSE
+        WRITE(KBOUT,1002)
+1002    FORMAT(' Identites off')
+      END IF
+      IF(ISAME.EQ.1)THEN
+        WRITE(KBOUT,1004)
+1004    FORMAT(' Main diagonal blank')
+      ELSE
+        WRITE(KBOUT,1005)
+1005    FORMAT(' Main diagonal shown')
+      END IF
+      END
+C   REDEFD
+      SUBROUTINE REDEFD(IDIMT,J1,J2,MAXSEQ,IDIMA,ISTART,IEND,IDIMB)
+C  AUTHOR RODGER STADEN
+C  DIAGON IS COMPLICATED BY HAVING SEVERAL ARRARY LIMITS
+C  THE SMALL MACHINE VERSION USES A DISK BUFFER
+C  ALL VERSIONS HAVE A CHUNK OF SEQUENCE IN RAM (ON THE LARGE MACHINE 
+C  VERSION THIS WILL BE THE WHOLE SEQUENCE IF < MAXSEQ)
+C  ALL VERSIONS HAVE AN ACTIVE REGION FOR COMPARISON
+C  FOR ALL PROGRAMS OTHER THAN DIAGON THE ACTIVE REGION IS THE CHUNK IN THE 
+C  RAM BUFFER, BUT FOR DIAGON IT MAY BE ONLY A PART OF THIS UP TO MAXCOM
+C  THIS CHUNK IS ALSO KEPT AS INTEGERS.
+C  THIS ROUTINE IS TO INITIALIZE THE ARRAY POINTERS AND SIZES
+C  MAXSEQ = THE DIMENSION OF THE RAM BUFFER SEQ
+C  IDIMT  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C          IN THE DISK BUFFER)
+C  ISTART = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQ(1)
+C  J1     = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  J2     = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIMA  = J2-J1+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IEND   = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMB  = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+      K1 = 0
+      K2 = 0
+      IF(K1.EQ.0)K1=J1
+      K2MAX=MIN((K1+MAXSEQ-1),IDIMT)
+      IF(K2.GT.K2MAX)K2=K2MAX
+      IF(K2.EQ.0)K2=MIN(J2,K2MAX)
+      J1=K1
+      J2=K2
+      IDIMA=J2-J1+1
+      IDIMB=IEND-ISTART+1
+      END
+      SUBROUTINE MATTIN(MATRIX,IDM,FILNAM,FILEP,CHRSET,
+     +KBIN,KBOUT,IDEV,MATMAX,IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER MATRIX(IDM,IDM)
+      CHARACTER FILEP*(*),FILNAM*(*),CHRSET(IDM),HELPF*(*)
+      PARAMETER (MAXPRM = 20)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      IN = 1
+      PROMPT(1) = 'Identity matrix'
+      PROMPT(2) = 'MDM78 matrix'
+      PROMPT(3) = 'Personal matrix file'
+10    CONTINUE
+      CALL RADION('Select score matrix',PROMPT,3,IN,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IN.LT.1) RETURN
+      IF(IN.EQ.1)THEN
+C  SET IDENTITIES
+        DO 15 I = 1,IDM
+          DO 15 J = 1,IDM
+            IF(I.EQ.J)THEN
+              MATRIX(I,J) = 1
+            ELSE 
+              MATRIX(I,J) = 0
+            END IF
+15      CONTINUE
+      ELSE IF(IN.EQ.2)THEN
+C  READ IN MDM78
+        CALL GETMAT(IDEV,FILEP,MATRIX,IDM,CHRSET,KBOUT,IOK)
+        IF(IOK.NE.0)GO TO 100
+      ELSE IF(IN.EQ.3)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Matrix file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 10
+        CALL GETMAT(IDEV,FILNAM,MATRIX,IDM,CHRSET,KBOUT,IOK)
+        IF(IOK.NE.0)GO TO 100
+      ELSE 
+        GO TO 10  
+      END IF
+C   NEED TO KNOW LARGEST SCORE IN MATRIX FOR STATS
+      MATMAX=0
+      DO 20 I=1,IDM
+        DO 20 J=1,IDM
+          IF(MATMAX.LT.MATRIX(I,J))MATMAX=MATRIX(I,J)
+20    CONTINUE
+      MATMAX=MATMAX+1
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Remember to reset the score for',/,
+     +       ' the proportional algorithm')
+      RETURN
+100   CONTINUE
+      CALL ERROM(KBOUT,'Error in score matrix file')
+      GO TO 10
+      END
+      SUBROUTINE ALIGNM(SEQ1N,SEQ2N,IDIM1,IDIM2,CC,DD,RR,SS,SOP,
+     +NMAX,IV1,IH1,IDIMV,IDIMH,MAXSEQ,WTS,IDM,ISCORE,IG,IH,KBIN,
+     +KBOUT,IDEV,SEQ1,SEQ2,SEQ1A,SEQ2A,
+     +IHELPS,IHELPE,HELPF,IDEVH,KEEP,STACK,MAXSTK,STKREC,IOK)
+      INTEGER CC(0:NMAX+1),DD(0:NMAX+1),RR(0:NMAX+1),SS(0:NMAX+1)
+      INTEGER WTS(0:IDM,0:IDM),SOP(0:2*NMAX),R1,R2
+      INTEGER SEQ1N(NMAX),SEQ2N(NMAX),STKREC,STACK(0:MAXSTK)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXSEQ),SEQ1A(NMAX),SEQ2A(NMAX)
+      CHARACTER HELPF*(*),PAD
+      SAVE PAD
+      DATA PAD/','/
+C
+C  12-1-91 modified call to alignd to send nmax instead of maxseq
+C
+C      MN = 1
+C      MX = NMAX
+C      WINDOW = NMAX
+C      CALL GETINT(MN,MX,WINDOW,'Window size',
+C     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) RETURN
+C      WINDOW = IVAL
+C10    CONTINUE
+      CALL BUSY(KBOUT)
+      LW1 = MIN(IDIM1,NMAX)
+      LW2 = MIN(IDIM2,NMAX)
+      N = MAX(LW1,LW2)
+      CALL DIFF(SEQ1N,SEQ2N,LW1,LW2,
+     +CC,DD,RR,SS,SOP,N,
+     +WTS,IDM,ISCORE,IG,IH,KBOUT,STACK,MAXSTK,STKREC,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL ALIGND(SEQ1(IV1),SEQ2(IH1),SOP,
+     +SEQ1A,SEQ2A,N,NMAX,R1,R2,PAD,NP1,NP2,IDIM1,IDIM2)
+      KPOUT = MAX(R1,R2)
+      PC = PCON(SEQ1A,SEQ2A,KPOUT,PAD)
+      CALL FMT2(IDEV,SEQ1A,SEQ2A,KPOUT,IV1,IH1)
+      WRITE(IDEV,1001)PC
+1001  FORMAT(' Conservation ',F5.1,'%')
+      WRITE(IDEV,1002)NP1,NP2
+1002  FORMAT(' Number of padding characters inserted',I6,' and',I6)
+      CALL YESNO(KEEP,'Keep alignment',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(KEEP.LT.0) RETURN
+      IF(KEEP.EQ.1) RETURN
+      NMOV = KPOUT - IDIM1
+      IV2 = IV1 + IDIM1
+      CALL MOVEC(SEQ1,MAXSEQ,IDIMV,IV2,NMOV)
+      IDIMV = IDIMV + NMOV
+      CALL SQCOPY(SEQ1A,SEQ1(IV1),KPOUT)
+      NMOV = KPOUT - IDIM2
+      IV2 = IH1 + IDIM2
+      CALL MOVEC(SEQ2,MAXSEQ,IDIMH,IV2,NMOV)
+      IDIMH = IDIMH + NMOV
+      CALL SQCOPY(SEQ2A,SEQ2(IH1),KPOUT)
+      END
+      SUBROUTINE ALIGND(SEQ1,SEQ2,S,SEQ1A,SEQ2A,NMAX,MAXSEQ,
+     +R1,R2,PAD,NP1,NP2,IDIM1,IDIM2)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXSEQ),SEQ1A(MAXSEQ),SEQ2A(MAXSEQ)
+      CHARACTER PAD
+      INTEGER S(0:2*NMAX),P1,P2,POUT,R1,R2
+C  12-1-91 Added checks for pout going off end of array
+C  6-6-91  Added another check!
+      P1 = 1
+      P2 = 1
+      NP1 = 0
+      NP2 = 0
+      POUT = 1
+      I = -1
+10    CONTINUE
+      IF((P1.LE.IDIM1).AND.(P2.LE.IDIM2).AND.(POUT.LE.MAXSEQ)) THEN
+        I = I + 1
+        IF(S(I).EQ.0) THEN
+C          WRITE(*,*)P1,'=',P2,SEQ1(P1),SEQ2(P2)
+          SEQ1A(POUT) = SEQ1(P1)
+          SEQ2A(POUT) = SEQ2(P2)
+          P1 = P1 + 1
+          P2 = P2 + 1
+          POUT = POUT + 1
+        ELSE
+          IF(S(I).LT.0) THEN
+            K = ABS(S(I))
+C            WRITE(*,*)'INSERT ',K,' AT',P2
+            CALL SQCOPY(SEQ1(P1),SEQ1A(POUT),K)
+            CALL FILLC(SEQ2A(POUT),K,PAD)
+            NP2 = NP2 + K
+            P1 = P1 + K
+            POUT = POUT + K
+          ELSE
+            K = S(I)
+C            WRITE(*,*)'INSERT ',K,' AT',P1
+            CALL SQCOPY(SEQ2(P2),SEQ2A(POUT),K)
+            CALL FILLC(SEQ1A(POUT),K,PAD)
+            NP1 = NP1 + K
+            P2 = P2 + K
+            POUT = POUT + K
+          END IF
+        END IF
+        GO TO 10
+      END IF
+      R1 = POUT - 1
+      R2 = POUT - 1
+      J = 0
+      K = 0
+      IF((P1.LE.IDIM1).AND.(POUT.LE.MAXSEQ)) THEN
+        J = IDIM1 - P1 + 1
+        J = MIN(J,MAXSEQ-POUT+1)
+        CALL SQCOPY(SEQ1(P1),SEQ1A(POUT),J)
+        R1 = R1 + J
+      END IF
+      IF((P2.LE.IDIM2).AND.(POUT.LE.MAXSEQ)) THEN
+        K = IDIM2 - P2 + 1
+        K = MIN(K,MAXSEQ-POUT+1)
+        CALL SQCOPY(SEQ2(P2),SEQ2A(POUT),K)
+        R2 = R2 + K
+      END IF
+      I = R1 - R2
+      IF(I.GT.0) THEN
+        CALL FILLC(SEQ2A(POUT+K),I,PAD)
+      ELSE IF(I.LT.0) THEN
+        I = ABS(I)
+        CALL FILLC(SEQ1A(POUT+J),I,PAD)
+      END IF
+      END
+      REAL FUNCTION PCON(SEQ1,SEQ2,L,PAD)
+      CHARACTER SEQ1(L),SEQ2(L),PAD
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      N = 0
+      DO 10 I = 1,L
+        IF(SEQ1(I).EQ.PAD) SEQ1(I) = '-'
+        IF(SEQ2(I).EQ.PAD) SEQ2(I) = '-'
+        IF(CTONUM(SEQ1(I)).EQ.CTONUM(SEQ2(I))) N = N + 1
+10    CONTINUE
+      PCON = 100. * REAL(N)/REAL(L)
+      END
+      SUBROUTINE DIFF(SA,SB,IDIM1,IDIM2,CC,DD,RR,SS,SOP,NMAX,
+     +W,IDM,SCORE,G,H,KBOUT,STACK,MAXSTK,STKREC,IOK)
+      IMPLICIT INTEGER(A-Z)
+      INTEGER CC(0:NMAX+1),DD(0:NMAX+1),RR(0:NMAX+1),SS(0:NMAX+1)
+      INTEGER W(0:IDM,0:IDM),SOP(0:2*NMAX),STACK(0:MAXSTK)
+      INTEGER SA(NMAX),SB(NMAX)
+      EXTERNAL GAP
+      IOK = 0
+      CALL FILLI(SOP,2*NMAX+1,0)
+      M = IDIM1
+      N = IDIM2
+      LM = G + H
+      LAST = 0
+      SAPP = 0
+      A = 1
+      B = 1
+      TB = G
+      TE = H
+      MIDI = 0
+      MIDJ = 0
+      MIDC = 0
+      TYPE = 0
+C  INITIALISE THE STACK WITH A PUSH
+      CALL STACKH(1,A,B,M,N,TB,TE,MIDI,MIDJ,TYPE,MIDC,990,
+     +STACK,MAXSTK,STKREC,IOK)
+      IF(IOK.NE.0) GO TO 999
+90    CONTINUE
+       IF(N.LE.0)THEN
+         IF(M.GT.0) CALL DEL(M,LAST,SOP,NMAX,SAPP)
+         ANS =  GAP(M,G,H)
+         GO TO 980
+       END IF
+       IF(M.LE.1) THEN
+         IF(M.LE.0) THEN
+           CALL INS(N,LAST,SOP,NMAX,SAPP)
+           ANS = GAP(N,G,H)
+           GO TO 980
+         END IF
+         IF(TB.GT.TE) TB = TE
+         MIDC = TB + H + GAP(N,G,H)
+         MIDJ = 0
+         ITP = B - 1
+         DO 100 J = 1,N
+           C = GAP(J-1,G,H) + W(SA(A),SB(J+ITP)) + GAP(N-J,G,H)
+           IF (C.LT.MIDC) THEN
+             MIDC = C
+             MIDJ = J
+           END IF
+100      CONTINUE
+         IF(MIDJ.EQ.0)THEN
+           CALL INS(N,LAST,SOP,NMAX,SAPP)
+           CALL DEL(1,LAST,SOP,NMAX,SAPP)
+         ELSE
+           IF(MIDJ.GT.1) CALL INS(MIDJ-1,LAST,SOP,NMAX,SAPP)
+           CALL REP(LAST,SOP,NMAX,SAPP)
+           IF(MIDJ.LT.N) CALL INS(N-MIDJ,LAST,SOP,NMAX,SAPP)
+         END IF
+         ANS = MIDC
+         GO TO 980
+       END IF
+      MIDI = M/2
+C   FORWARD PHASE
+      CC(0) = 0
+      T = G
+      DO 200 J=1,N
+        T = T + H
+        CC(J) = T
+        DD(J) = T + G
+200   CONTINUE
+      T = TB
+      JTP = B - 1
+      DO 400 I = 1,MIDI
+        S = CC(0)
+        T = T + H
+        C = T
+        CC(0) = C
+        E = T + G
+        ITP = I + A - 1
+        DO 300 J = 1,N
+          C = C + LM
+          E = E + H
+          IF(C.LT.E) E = C
+          C = CC(J) + LM
+          D = DD(J) + H
+          IF(C.LT.D) D = C
+          C = S + W(SA(ITP),SB(J+JTP))
+          IF(E.LT.C) C = E
+          IF(D.LT.C) C = D
+          S = CC(J)
+          CC(J) = C
+          DD(J) = D
+300     CONTINUE
+400   CONTINUE
+      DD(0) = CC(0)
+C  REVERSE PHASE
+      RR(N) = 0
+      T = G
+      DO 500 J =N-1,0,-1
+        T = T + H
+        RR(J) = T
+        SS(J) = T + G
+500   CONTINUE
+      T = TE
+      DO 700 I=M-1,MIDI,-1
+        S = RR(N)
+        T = T + H
+        C = T
+        RR(N) = C
+        E = T + G
+        ITP = I + A
+        DO 600 J = N-1,0,-1
+          C = C + LM
+          E = E + H
+          IF(C.LT.E) E = C
+          C = RR(J) + LM
+          D = SS(J) + H
+          IF(C.LT.D) D = C
+          C = S + W(SA(ITP),SB(J+B))
+          IF(E.LT.C) C = E
+          IF(D.LT.C) C = D
+          S = RR(J)
+          RR(J) = C
+          SS(J) = D
+600     CONTINUE
+700   CONTINUE
+      SS(N) = RR(N)
+C   FIND OPTIMAL MIDPOINT
+      MIDC = CC(0) + RR(0)
+      MIDJ = 0
+      TYPE = 1
+      DO 800 J = 0,N
+        C = CC(J) + RR(J)
+        IF(C.LE.MIDC) THEN
+          IF((C.LT.MIDC).OR.(CC(J).NE.DD(J)).AND.(RR(J).EQ.SS(J)))THEN
+            MIDC = C
+            MIDJ = J
+          END IF
+        END IF
+800   CONTINUE
+      DO 900 J = N,0,-1
+        C = DD(J) + SS(J) - G
+        IF(C.LT.MIDC) THEN
+          MIDC = C
+          MIDJ = J
+          TYPE = 2
+        END IF
+900   CONTINUE
+C  CONQUER RECURSIVELY AROUND MIDPOINT
+       IF(TYPE.NE.1) GO TO 960
+       IF(TYPE.EQ.1) THEN
+         CALL STACKH(1,A,B,M,N,TB,TE,MIDI,MIDJ,TYPE,MIDC,950,
+     +STACK,MAXSTK,STKREC,IOK)
+         IF(IOK.NE.0) GO TO 999
+         M = MIDI
+         N = MIDJ
+         TE = G
+         GO TO 90
+       END IF
+950    CONTINUE
+       CALL STACKH(1,A,B,M,N,TB,TE,MIDI,MIDJ,TYPE,MIDC,980,
+     +STACK,MAXSTK,STKREC,IOK)
+       IF(IOK.NE.0) GO TO 999
+       A = A + MIDI
+       B = B + MIDJ
+       M = M - MIDI
+       N = N - MIDJ
+       TB = G
+       GO TO 90
+960    CONTINUE
+       CALL STACKH(1,A,B,M,N,TB,TE,MIDI,MIDJ,TYPE,MIDC,970,
+     +STACK,MAXSTK,STKREC,IOK)
+       IF(IOK.NE.0) GO TO 999
+       M = MIDI - 1
+       N = MIDJ
+       TE = 0
+       GO TO 90
+970    CONTINUE
+       CALL DEL(2,LAST,SOP,NMAX,SAPP)
+       CALL STACKH(1,A,B,M,N,TB,TE,MIDI,MIDJ,TYPE,MIDC,980,
+     +STACK,MAXSTK,STKREC,IOK)
+       IF(IOK.NE.0) GO TO 999
+       A = A + MIDI + 1
+       B = B + MIDJ
+       M = M - MIDI - 1
+       N = N - MIDJ
+       TB = 0
+       GO TO 90
+980    CONTINUE
+       SCORE = MIDC
+       CALL STACKH(2,A,B,M,N,TB,TE,MIDI,MIDJ,TYPE,MIDC,ADDR,
+     +STACK,MAXSTK,STKREC,IOK)
+       IF(IOK.NE.0) GO TO 999
+       IF(ADDR.EQ.950) GO TO 950
+       IF(ADDR.EQ.970) GO TO 970
+       IF(ADDR.EQ.980) GO TO 980
+       IF(ADDR.EQ.990) GO TO 990
+       WRITE(KBOUT,*)'Unexpected address in align'
+       IOK = 4
+       RETURN
+999    CONTINUE
+       IF(IOK.EQ.1) THEN
+         WRITE(KBOUT,*)'Stack overflow'
+       ELSE IF(IOK.EQ.2) THEN
+         WRITE(KBOUT,*)'Stack underflow'
+       ELSE IF (IOK.EQ.3) THEN
+         WRITE(KBOUT,*)'Unexpected stack task'
+       END IF
+990    CONTINUE
+       END
+      SUBROUTINE GETGAP(KBIN,KBOUT,IG,IH,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDM,IOK)
+      CHARACTER HELPF*(*)
+C FOR GAP OF LENGTH K, COST IS G + H*K
+C WHERE G IS COST OF STARTING GAP, AND H IS ADDED FOR EACH ELEMENT IN GAP
+C      WRITE(KBOUT,*)'THE COST OF A GAP OF LENGTH K = G + H*K'
+      MN = 1
+      MX = 100
+      IG = 10
+      IF(IDM.EQ.5) IG = 20
+      CALL GETINT(MN,MX,IG,'Penalty for starting a gap',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IG = IVAL
+      MN = 1
+      MX = 100
+      IH = 10
+      IF(IDM.EQ.5) IH = 5
+      CALL GETINT(MN,MX,IH,'Penalty for each residue in gap',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IH = IVAL
+      END
+      INTEGER FUNCTION GAP(K,G,H)
+      INTEGER K,G,H
+C  NOTE FOR DES: HE HAS EQ HERE
+      IF(K.LE.0) THEN
+        GAP = 0
+      ELSE
+        GAP = G + H * K
+      END IF
+      END
+      SUBROUTINE DEL(K,LAST,S,NMAX,SAPP)
+      INTEGER S(0:NMAX*2),SAPP
+      IF(LAST.LT.0) THEN
+        S(SAPP-1) = S(SAPP-1) - K
+        LAST = -K
+      ELSE
+        S(SAPP) = -K
+        SAPP = SAPP + 1
+        LAST = -K
+      END IF
+      END
+       SUBROUTINE INS(K,LAST,S,NMAX,SAPP)
+       INTEGER S(0:NMAX*2),SAPP
+       IF (LAST.LT.0) THEN
+         S(SAPP-1) = K
+         S(SAPP) = LAST
+C  DES HAS NEXT LINE. I REMOVED IT 16-5-89
+C         LAST = K
+         SAPP = SAPP + 1
+       ELSE
+         S(SAPP) = K
+         SAPP = SAPP + 1
+         LAST = K
+       END IF
+       END
+       SUBROUTINE REP(LAST,S,NMAX,SAPP)
+       INTEGER S(0:NMAX*2),SAPP
+       S(SAPP) = 0
+       SAPP = SAPP + 1
+       LAST = 0
+       END
+      SUBROUTINE STACKH(JOB,A,B,M,N,TB,TE,MIDI,MIDJ,TYPE,MIDC,ADDR,
+     +STACK,MAXSTK,STKREC,IOK)
+       IMPLICIT INTEGER (A-Z)
+       INTEGER STACK(0:MAXSTK)
+       SAVE
+       DATA SP/-1/
+C  HANDLE STACK: 1 = PUSH, 2 = POP
+       IF(JOB.EQ.1) THEN
+         SP = SP + STKREC
+         IF(SP.GT.MAXSTK) THEN
+C           WRITE(*,*)'HELP, STACK OVERFLOW'
+           IOK = 1
+           RETURN
+         END IF
+         STACK(SP-10) = A
+         STACK(SP-9) = B
+         STACK(SP-8) = M
+         STACK(SP-7) = N
+         STACK(SP-6) = TB
+         STACK(SP-5) = TE
+         STACK(SP-4) = MIDI
+         STACK(SP-3) = MIDJ
+         STACK(SP-2) = TYPE
+         STACK(SP-1) = MIDC
+         STACK(SP) = ADDR
+       ELSE IF(JOB.EQ.2) THEN
+         SP = SP - STKREC
+         IF(SP.LT.-1) THEN
+C           WRITE(*,*)'HELP, STACK UNDERFLOW'
+           IOK = 2
+           RETURN
+         END IF
+         A = STACK(SP+1)
+         B = STACK(SP+2)
+         M = STACK(SP+3)
+         N = STACK(SP+4)
+         TB = STACK(SP+5)
+         TE = STACK(SP+6)
+         MIDI = STACK(SP+7)
+         MIDJ = STACK(SP+8)
+         TYPE = STACK(SP+9)
+         MIDC = STACK(SP+10)
+         ADDR = STACK(SP+11)
+       ELSE
+C        WRITE(*,*)'HELP, STACK COCKUP'
+         IOK = 3
+C         STOP
+       END IF
+       END
+      SUBROUTINE PAMDIS(MATRIX,WTS,IDM)
+      INTEGER MATRIX(IDM,IDM),WTS(0:IDM,0:IDM)
+      IF(IDM.EQ.26) THEN
+      N = 8
+      M = 0
+      DO 10 I = 1,IDM
+        DO 5 J = 1,IDM
+          K = MATRIX(I,J)
+          M = MAX(M,K)
+5       CONTINUE
+10    CONTINUE
+      DO 20 I = 1,IDM
+        DO 15 J = 1,IDM
+          K = M - MATRIX(I,J)
+          WTS(I,J) = K
+15      CONTINUE
+20    CONTINUE
+      DO 30 I = 0,IDM
+        WTS(I,0) = N
+        WTS(0,I) = N
+30    CONTINUE
+      RETURN
+      END IF
+      DO 40 I = 1,IDM
+      DO 40 J = 1,IDM
+         IF(I.EQ.J) THEN
+           WTS(I,J) = 0
+         ELSE
+           WTS(I,J) = 20
+         END IF
+40    CONTINUE
+      DO 50 I = 0,IDM
+        WTS(I,0) = 10
+        WTS(0,I) = 10
+50    CONTINUE
+      END
+      SUBROUTINE ENCONC(SEQ,IDIM,POSN,WORDP,IDE,IDCHAR,CONSTS,LENGTH,
+     +LCONST)
+C  AUTHOR RODGER STADEN
+      INTEGER SEQ(IDIM)
+      INTEGER POSN(IDIM),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER NCODEA
+      EXTERNAL NCODEA
+C  ENCODES A SEQUENCE OF LENGTH IDIM AND CHARACTERSET SIZE IDCHAR
+C  INTO TWO ARRAYS: WORDP(I) CONTAINS THE POSITION OF THE FIRST OCCURRENCE
+C  OF WORD(I), POSN(I) CONTAINS A LINKED LIST OF SECOND, THIRD,... OCCURENCES
+C  OF WORD
+      CALL FILLI(POSN,IDIM,0)
+      CALL FILLI(WORDP,IDE,0)
+      DO 20 I = 1, IDIM-LENGTH+1
+        J = NCODEA(SEQ(I),LENGTH,CONSTS,IDCHAR,LCONST)
+        IF(J.NE.0)THEN
+          J1 = WORDP(J)
+          IF(J1.EQ.0)THEN
+            WORDP(J) = I
+          ELSE
+10          CONTINUE
+              J2 = J1
+              J1 = POSN(J2)
+              IF(J1.NE.0) GO TO 10
+              POSN(J2) = I
+          END IF
+        END IF
+20    CONTINUE
+      END
+      SUBROUTINE QICKS(SEQ1,IDIM1,POSN,WORDP,IDE,SEQ2,IDIM2,CONSTS,
+     +LENGTH,IDM,LCONST,HIST,MAXSEQ,MATRIX,SEQS,MAIND,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,SPAN,MINPRO,KBOUT,RNSD,
+     +ISAME)
+      INTEGER SEQ1(IDIM1),SEQ2(IDIM2),SEQS(IDIM1),SPAN
+      INTEGER POSN(IDIM1),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER MAIND(IDM)
+      INTEGER HIST(-MAXSEQ:MAXSEQ)
+      PARAMETER (MAXDIA = 20)
+      INTEGER TOPD(MAXDIA),TOPI(MAXDIA),TOPJ(MAXDIA),MATRIX(IDM,IDM)
+      EXTERNAL NCODEA
+      NDIAG = MAXDIA
+      CALL BUSY(KBOUT)
+      CALL FILLI(HIST(-IDIM1),IDIM2+IDIM1+1,0)
+      CALL SETCN(CONSTS,LENGTH,IDM,LCONST)
+C      WRITE(*,*)'CONSTS'
+C      WRITE(*,*)CONSTS
+      CALL ENCONC(SEQ1,IDIM1,POSN,WORDP,IDE,IDM,CONSTS,LENGTH,
+     +LCONST)
+      DO 4 I = 1,IDM
+        MAIND(I) = MATRIX(I,I)
+4     CONTINUE
+C      WRITE(*,*)'MAIND'
+C      WRITE(*,*)MAIND
+      CALL WDSCR(SEQ1,SEQS,IDIM1,LENGTH,MAIND,IDM)
+C      WRITE(*,*)'SEQS'
+C      WRITE(*,*)(SEQS(K),K=1,30)
+C      WRITE(*,*)'SEQ1'
+C      WRITE(*,*)(SEQ1(K),K=1,30)
+C      WRITE(*,*)IDIM1,IDIM2,LENGTH
+      DO 20 I = 1,IDIM2-LENGTH+1
+        J = NCODEA(SEQ2(I),LENGTH,CONSTS,IDM,LCONST)
+        IF(J.NE.0)THEN
+          J1 = WORDP(J)
+          IF(J1.NE.0)THEN
+            K = I - J1
+C  FOR IDENTITIES ADD 1 ON NEXT LINE (NOT SEQS)
+            HIST(K) = HIST(K) + SEQS(J1)
+10          CONTINUE
+            J2 = J1
+            J1 = POSN(J2)
+            IF(J1.NE.0)THEN
+              K = I - J1
+C  FOR IDENTITIES ADD 1 ON NEXT LINE (NOT SEQS)
+              HIST(K) = HIST(K) + SEQS(J1)
+              GO TO 10
+            END IF
+          END IF        
+        END IF
+20    CONTINUE
+      IF (ISAME.EQ.1) HIST(0) = 0
+      CALL MHIST(HIST,IDIM1,IDIM2,TOPD,TOPI,TOPJ,
+     +NDIAG,MAXSEQ,RNSD)
+      IF(NDIAG.EQ.0) THEN
+        WRITE(KBOUT,*)' No diagonals found scoring',RNSD,
+     +  ' sd above mean'
+        RETURN
+      END IF
+      ISPO2 = SPAN/2
+      XMIN = 1.
+      XMAX = IDIM2
+      YMIN = 1.
+      YMAX = IDIM1
+      CALL VECTOM
+      DO 40 I = 1,NDIAG
+        IF(TOPD(I).NE.0) THEN
+          L = MIN(IDIM1-TOPJ(I),IDIM2-TOPI(I)) + 1
+          CALL DSCORP(SEQ1(TOPJ(I)),SEQ2(TOPI(I)),L,MATRIX,IDM,
+     +    SPAN,MINPRO,ISPO2,TOPJ(I),TOPI(I),
+     +XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+40    CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE DSCORP(SEQ1,SEQ2,L,MATRIX,IDM,SPAN,MINSCR,SPO2,JS,IS,
+     +XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      INTEGER SEQ1(L),SEQ2(L),MATRIX(IDM,IDM),SPAN,FRONT,BACK,SPO2
+C 8-6-91 Fixed bug that allowed span>L
+      M = 0
+      FRONT = SPAN
+      BACK = 0
+      DO 10 I = 1,MIN(SPAN,L)
+        M = M + MATRIX(SEQ1(I),SEQ2(I))
+10    CONTINUE
+      IF(M.GE.MINSCR) THEN
+        Y = JS + SPO2
+        X = IS + SPO2
+C          WRITE(*,*)IX,IY
+        CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      DO 20 I = 2,L-SPAN+1
+        FRONT = FRONT + 1
+        BACK = BACK + 1
+        MM = MATRIX(SEQ1(BACK),SEQ2(BACK))
+        MP =  MATRIX(SEQ1(FRONT),SEQ2(FRONT))
+        M = M - MM + MP
+        IF(M.GE.MINSCR) THEN
+          Y = JS + BACK + SPO2
+          X = IS + BACK + SPO2
+C          WRITE(*,*)IX,IY
+          CALL POINT(X,Y,XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        END IF
+20    CONTINUE
+      END
+      SUBROUTINE MHIST(HIST,IDIM1,IDIM2,TOPD,TOPI,TOPJ,NDIAG,
+     + MAXSEQ,RNSD)
+      INTEGER HIST(-MAXSEQ:MAXSEQ),TOPD(NDIAG),TOPI(NDIAG),TOPJ(NDIAG)
+C   ANALYSE HISTOGRAM TO FIND BEST NDIAG DIAGONALS
+C   LOOK AT THE TOP TEN SCORES, TOP DOWN
+C   IF ELEMENT I > TOP(J) THEN MOVE J+1 TO NDIAG-1 DOWN
+C   PUT I AT TOP(J)
+C   DIAGO FINDS THE INTERSECTION OF DIAGONAL I ON THE 2 AXES
+C   FIRST FIND THE TOP SCORES AND THERE HIST ELEMENT NO
+C   THEN GET THEIR AXES INTERSECTIONS
+      IDIAG = 0
+      CALL FILLI(TOPD,NDIAG,0)
+      CALL FILLI(TOPI,NDIAG,0)
+      CALL FILLI(TOPJ,NDIAG,0)
+      RMSQ = 0.
+      RM = 0.
+      DO 20 I = -IDIM1,IDIM2
+        RJ = HIST(I)
+        RM = RM + RJ
+        RMSQ = RMSQ + RJ * RJ
+20    CONTINUE
+      N = IDIM1 + IDIM2
+      RM = RM / N
+      RMSQ = RMSQ / N
+      RM2 = RM * RM
+      SD = 0.
+      T = RMSQ - RM2
+      IF(T.GT.0.) SD = SQRT(T)
+      MINS = NINT(RM + RNSD * SD)
+C   TRY TO FIND HIGHEST SCORES FIRST
+      DO 100 I = 0,IDIM2
+        M = HIST(I)
+        IF(M.GT.MINS) THEN
+        IDIAG = IDIAG + 1
+        IF(M.GT.TOPD(NDIAG)) THEN
+          DO 50 J = 1,NDIAG
+            IF(M.GT.TOPD(J)) THEN
+              DO 40 K = NDIAG-1,J,-1
+                TOPD(K+1) = TOPD(K)
+                TOPI(K+1) = TOPI(K)
+40            CONTINUE
+              TOPD(J) = M
+              TOPI(J) = I
+              GO TO 60
+            END IF
+50        CONTINUE
+        END IF
+        END IF
+60      CONTINUE
+100   CONTINUE
+      DO 200 I = -1,-IDIM1,-1
+        M = HIST(I)
+        IF(M.GT.MINS) THEN
+        IDIAG = IDIAG + 1
+        IF(M.GT.TOPD(NDIAG)) THEN
+          DO 150 J = 1,NDIAG
+            IF(M.GT.TOPD(J)) THEN
+              DO 140 K = NDIAG-1,J,-1
+                TOPD(K+1) = TOPD(K)
+                TOPI(K+1) = TOPI(K)
+140           CONTINUE
+              TOPD(J) = M
+              TOPI(J) = I
+              GO TO 160
+            END IF
+150       CONTINUE
+        END IF
+        END IF
+160     CONTINUE
+200   CONTINUE
+      NDIAG = MIN(IDIAG,NDIAG)
+      DO 300 I = 1,NDIAG
+        J = TOPI(I)
+        CALL DIAGO(J,TOPI(I),TOPJ(I))
+300   CONTINUE
+      END
+      SUBROUTINE DIAGO(I,II,JJ)
+      IF(I.GE.0) THEN
+        JJ = 1 
+        II = I + 1
+      ELSE
+        II = 1 
+        JJ = ABS(I) + 1
+      END IF
+      END
+      SUBROUTINE WDSCR(SEQ1N,SEQS,IDIM1,KTUP,MAIND,IDM)
+      INTEGER SEQ1N(IDIM1),SEQS(IDIM1),MAIND(IDM)
+      DO 10 I =1,IDIM1-KTUP+1
+        K = 0
+        DO 5 J = I,I+KTUP-1
+          K = K + MAIND(SEQ1N(J))
+5       CONTINUE
+        SEQS(I) = K
+10    CONTINUE
+      END
+      SUBROUTINE DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+      MN = 1
+      MX = MAX(1,MATMAX-1) * LENGTH
+      MINPRO = INT(AVSCOR*LENGTH)
+      CALL GETINT(MN,MX,MINPRO,'Proportional score',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.EQ.0) MINPRO = IVAL
+      END
+      SUBROUTINE DP22(IDM,MINPER,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+      MN = 1
+      IF(IDM.EQ.5) THEN
+        MX = 100
+      ELSE
+        MX = 20
+      END IF
+      CALL GETINT(MN,MX,MINPER,'Identity score',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.EQ.0) MINPER = IVAL
+      END
+      SUBROUTINE DP20(LENGTH,MXSPAN,IDIMVA,IDIMHA,IDIMVP,IDIMHP,
+     +NOROO,NOROE,MAXSEQ,
+     +SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,
+     +CHRSET,IDM,MSPO2,ISTARV,IENDV,
+     +SEQHC,IDIMBH,ISH1,ISH2,SEQH,
+     +ISTARH,IENDH,LB,LF,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*),NOROO,NOROE,CHRSET(IDM)
+      CHARACTER SEQHC(MAXSEQ),SEQVC(MAXSEQ)
+      INTEGER SEQH(MXCOMP),SEQV(MXCOMP)
+      LTEMP=LENGTH
+5     CONTINUE
+      MN = 1
+      MX = MIN(MXSPAN,MIN(IDIMVA,IDIMHA))
+      CALL GETINT(MN,MX,LENGTH,'Odd span length',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(MOD(IVAL,2).NE.1)GO TO 5
+      LENGTH = IVAL
+      LB=(LENGTH+1)/2
+      LF=LENGTH/2
+      IDIMVP=IDIMVA+LENGTH
+      IDIMHP=IDIMHA+LENGTH
+      NOROO='N'
+      NOROE='N'
+C   IF LENGTH LONGER THAN BEFORE EXTEND INTEGER BUFFERS
+      IF(LTEMP.LT.LENGTH)THEN
+        CALL FILEDG(SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,LENGTH,
+     +    CHRSET,IDM,MSPO2,ISTARV,IENDV)
+        CALL FILEDG(SEQHC,IDIMBH,ISH1,ISH2,SEQH,MXCOMP,LENGTH,
+     +    CHRSET,IDM,MSPO2,ISTARH,IENDH)
+      END IF
+      END
+      SUBROUTINE DP33(RNSD,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+      RMN = 0.
+      RMX = 10.
+      CALL GETRL(RMN,RMX,RNSD,'Number of sd above mean',
+     +  RVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.EQ.0) RNSD = RVAL
+      END
+      SUBROUTINE CFSQL(SEQ1,IDIM1,SEQ2,IDIM2,POSN,WORDP,IDE,IDCHAR,
+     +CONSTS,LCONST,LENGTH,MINMAT,IDEV,KBOUT,ISAME,SEQ,HIST,MAXSEQ)
+      INTEGER SEQ1(IDIM1),SEQ2(IDIM2),HIST(-MAXSEQ:MAXSEQ)
+      INTEGER POSN(IDIM1),WORDP(IDE),CONSTS(0:LCONST)
+      CHARACTER SEQ(IDIM1)
+      CALL BUSY(KBOUT)
+      CALL SETCN(CONSTS,LENGTH,IDCHAR,LCONST)
+      CALL ENCONC(SEQ1,IDIM1-MINMAT+1,POSN,WORDP,IDE,IDCHAR,
+     +CONSTS,LENGTH,LCONST)
+      CALL CFSEQL(SEQ1,IDIM1,POSN,WORDP,IDE,SEQ2,IDIM2,CONSTS,LCONST,
+     +LENGTH,IDCHAR,MINMAT,IDEV,ISAME,SEQ,HIST,MAXSEQ)
+      END
+      SUBROUTINE CFSEQL(SEQ1,IDIM1,POSN,WORDP,IDE,SEQ2,IDIM2,CONSTS,
+     +LCONST,
+     +LENGTH,IDCHAR,MINMAT,IDEV,ISAME,SEQ,HIST,MAXSEQ)
+      INTEGER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER POSN(IDIM1),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER NCODEA,EXTNDN
+      INTEGER HIST(-MAXSEQ:MAXSEQ)
+      CHARACTER SEQ(IDIM1)
+      EXTERNAL NCODEA,EXTNDN
+      LEX = MINMAT - LENGTH - 1
+C
+C if we are looking for internal repeats we switch off half the matrix
+C by making the saved score high
+C
+      IF (ISAME.EQ.1) THEN
+        CALL FILLI(HIST(-IDIM1),IDIM1,IDIM1+IDIM2)
+        CALL FILLI(HIST(0),IDIM2,0)
+      ELSE
+        CALL FILLI(HIST(-IDIM1),IDIM1+IDIM2+1,0)
+      END IF
+      DO 20 I = 1,IDIM2-MINMAT+1
+        J = NCODEA(SEQ2(I),LENGTH,CONSTS,IDCHAR,LCONST)
+        IF(J.NE.0)THEN
+          J1 = WORDP(J)
+          IF(J1.NE.0)THEN
+            IF (((ISAME.EQ.1).AND.(I.NE.J1)).OR.(ISAME.EQ.0)) THEN
+              LT = 1 + MIN(IDIM1-(J1+LENGTH),IDIM2-(I+LENGTH))
+              IMATCH = EXTNDN(SEQ1(J1+LENGTH),SEQ2(I+LENGTH),LT)
+              IF(IMATCH.GT.LEX)THEN
+                L = I + IMATCH + LENGTH
+                K = I - J1
+C
+C if this match ends furthest away we display it and save it
+C
+                IF (HIST(K).LT.L) THEN
+                  HIST(K) = L
+                  WRITE(IDEV,1000)I,J1,IMATCH+LENGTH
+ 1000             FORMAT(' Positions',I7,'h',I7,'v and length',I7)
+                  WRITE(IDEV,1001)(SEQ(K),K=J1,J1+IMATCH+LENGTH-1)
+ 1001             FORMAT(' ',50A1)
+                END IF
+              END IF
+            END IF
+10        CONTINUE
+            J2 = J1
+            J1 = POSN(J2)
+            IF(J1.NE.0)THEN
+              IF (((ISAME.EQ.1).AND.(I.NE.J1)).OR.(ISAME.EQ.0)) THEN
+                LT = 1 + MIN(IDIM1-(J1+LENGTH),IDIM2-(I+LENGTH))
+                IMATCH = EXTNDN(SEQ1(J1+LENGTH),SEQ2(I+LENGTH),LT)
+                IF(IMATCH.GT.LEX)THEN
+                  L = I + IMATCH + LENGTH
+                  K = I - J1
+                  IF (HIST(K).LT.L) THEN
+                    HIST(K) = L
+                    WRITE(IDEV,1000)I,J1,IMATCH+LENGTH
+                    WRITE(IDEV,1001)(SEQ(K),K=J1,J1+IMATCH+LENGTH-1)
+                  END IF
+                END IF
+              END IF
+              GO TO 10
+            END IF        
+          END IF
+        END IF
+20    CONTINUE
+      END
+      INTEGER FUNCTION EXTNDN(SEQ1,SEQ2,LENGTH)
+      INTEGER SEQ1(LENGTH),SEQ2(LENGTH)
+      DO 10 I = 1,LENGTH
+        IF(SEQ1(I).NE.SEQ2(I)) THEN
+          EXTNDN = I - 1
+          RETURN
+        END IF
+10    CONTINUE
+      EXTNDN = LENGTH
+      END
diff --git a/src/staden/edMenu.c b/src/staden/edMenu.c
new file mode 100644
index 0000000..aef3084
--- /dev/null
+++ b/src/staden/edMenu.c
@@ -0,0 +1,140 @@
+/*
+** Menu driver for xdap contig editor
+**
+**
+** To add new menu entries:
+**    1. Add a new option name to menuOptions.
+**    2. Create a new ed_menu_option line, providing the string to appear on
+**       menu, and menu option.
+**    3. Add a call to your interface routine to the switch statement in
+**       MenuSelectCallBack.
+**    4. That's all
+*/
+
+#include <X11/StringDefs.h>
+#include <X11/Intrinsic.h>
+
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/SmeBSB.h>
+#include <X11/Xaw/SmeLine.h>
+
+#include "edMenu.h"
+#include "edUtils.h"
+#include "contigEditor.h"
+
+/* ---- Types ---- */
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+enum menuOptions {
+  OptLine,
+  OptSearchGeneric,
+  OptSaveContig,
+  OptCreateTag,
+  OptEditTag,
+  OptDeleteTag,
+  OptShowDifferences,
+  OptDumpContig
+  };
+
+static MenuData ed_menu_options[] =
+{
+    {"Search",                                 OptSearchGeneric},
+    {"Highlight Disagreements",                OptShowDifferences},
+    {"",                                       OptLine},
+    {"Save Contig",                            OptSaveContig},
+    {"Dump Contig to File",                    OptDumpContig},
+    {"",                                       OptLine},
+    {"Create Tag",                             OptCreateTag},
+    {"Edit Tag",                               OptEditTag},
+    {"Delete Tag",                             OptDeleteTag},
+};
+
+
+
+
+
+
+
+
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(XtParent(XtParent(w))));
+
+    switch ( (int) i ) {
+        case OptSaveContig: 
+	    saveDB(
+		xx,
+		saveState.idevr,
+		saveState.idevw,
+		saveState.idevn,
+		saveState.relpg,
+		saveState.lngthg,
+		saveState.lnbr,
+		saveState.rnbr,
+		saveState.maxgel
+	    );
+	    break;
+        case OptCreateTag:
+            createTag(xx);
+	    break;
+        case OptEditTag:
+            editTag(xx);
+	    break;
+        case OptDeleteTag:
+            deleteTag(xx);
+	    break;
+        case OptSearchGeneric:
+	    invokeSearchGeneric(xx);
+	    break;
+	case OptShowDifferences:
+	    xx->showDifferences ^= 1;
+	    redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	    break;
+        case OptDumpContig:
+            dumpContig(xx);
+	    break;
+	default:
+	    break;
+    }
+}
+
+
+Widget createEdMenu(Widget parentWid)
+{
+    Widget menuWid;
+    int i;
+
+    Cardinal num_md = XtNumber(ed_menu_options);
+    /*
+      Create the menu parent widget
+    */
+    menuWid = XtCreatePopupShell("edMenu", simpleMenuWidgetClass, parentWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {
+        if (ed_menu_options[i].number == OptLine) {
+            (void) XtCreateManagedWidget("line",  smeLineObjectClass, menuWid, NULL, (Cardinal)0);
+	} else {
+            Widget entryWid = XtCreateManagedWidget(ed_menu_options[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	    XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) ed_menu_options[i].number);
+	}
+    }
+
+    return menuWid;
+
+}
+
diff --git a/src/staden/edMenu.h b/src/staden/edMenu.h
new file mode 100644
index 0000000..4aceab1
--- /dev/null
+++ b/src/staden/edMenu.h
@@ -0,0 +1,10 @@
+#ifndef _edMenu_h
+#define _edMenu_h
+
+#include <X11/Intrinsic.h>
+
+/* ---- Exported Functions ---- */
+
+extern Widget createEdMenu(Widget parentWid);
+
+#endif /* _edMenu_h */
diff --git a/src/staden/edUtils.c b/src/staden/edUtils.c
new file mode 100644
index 0000000..61690c1
--- /dev/null
+++ b/src/staden/edUtils.c
@@ -0,0 +1,2285 @@
+/*
+    Title: 	 edUtils
+
+    File: 	 edUtils.c
+    Purpose:	 C language utilities for contig editor
+    Last update: 5 April 1993
+*/
+
+
+/*
+  9/7/91  SD Added Trace Title field to call manageTrace
+  26/7/91 SD Tightened up editing options:
+             1. Allow only insertion and deletion of asterisks from consensus
+	        This means no replacement. This CANNOT be overridden
+	     2. Do not allow edits when cursor is not visible on the screen
+  28/7/91 SD Use FILE_NAME_LENGTH for trace file name length in showTrace
+  1/10/91 SD Removed static from calculateConsensusLength
+             Removed LLINOL and LLINOR from call to dojoin_
+
+  28/4/92 SD General Speedup
+  20/8/92 SD save state variable for returning what happened during edit/join
+  01/03/93 SD New dump contig command
+  05/04/93 JKB Check for editorState (disable edits after 'Leave Editor').
+
+*/
+
+
+#define CHKPNT(W,S) \
+    { XSync(XtDisplay(W),False); fprintf(stderr,"CHKPNT: %s\n",S); }
+
+
+/* ---- Includes ---- */
+
+#include <stdio.h>
+#include <ctype.h>
+#include <stdlib.h>
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Text.h>
+#include <X11/Xaw/AsciiSrc.h>
+#include <X11/Xlib.h>   /* IMPORT: XBell */
+#include "Sheet.h"
+
+#include "edUtils.h"
+#include "contigEditor.h"
+#include "tman_main.h"
+#include "main.h"
+#include "select.h"
+#include "tagUtils.h"
+#include "undo.h"
+#include "extend.h"
+#include "fortran.h"
+#include "fort.h"
+#include "misc.h"
+
+
+/* ------ External variables ------- */
+extern int_f save_state;
+extern enum States {StateDown=0,StateUp} editorState;
+
+/*
+** Static variables
+** defining the state of the contig editor
+*/
+#define MAXEDSTATES 2
+static int EdStructCount = 0;
+static EdStruct edstate[MAXEDSTATES] = {
+    {
+	/* oldMax             */ 0,
+	/* DB_flags           */ DB_ACCESS_UPDATE |
+				 DB_DELAYED_READ |
+				 DB_DATA_TYPE_DNA,
+	/* DB_gelCount        */ 0,
+	/* DB_contigNum       */ 0,
+	/* DBlist             */ NULL,
+	/* DBorder            */ NULL,
+	/* DB                 */ NULL,
+	/* displayPos         */ 1,
+	/* displayWidth       */ DEFAULT_DISPLAY_WIDTH,
+	/* displayHeight      */ 0,
+	/* cursorPos          */ 1,
+	/* cursorSeq          */ 0,
+	/* rulerDisplayed     */ 1,
+	/* consensusDisplayed */ 1,
+	/* fontWidth          */ 0,
+	/* fontHeight         */ 0,
+        /* edWid              */ NULL,
+	/* namesWid           */ NULL,
+	/* sequencesWid       */ NULL,
+	/* scrollButtonsWid   */ NULL,
+	/* sliderWid          */ NULL,
+	/* displayedConsensus */ "",
+	/* select_made        */ 0,
+	/* select_seq         */ 0,
+	/* select_start_pos   */ 0,
+	/* select_end_pos     */ 0,
+	/* select_tag         */ NULL,
+	/* reveal_cutoffs     */ 0,
+	/* showDifferences    */ 0
+    },
+    {
+	/* oldMax             */ 0,
+	/* DB_flags           */ DB_ACCESS_UPDATE |
+				 DB_DELAYED_READ |
+				 DB_DATA_TYPE_DNA,
+	/* DB_gelCount        */ 0,
+	/* DB_contigNum       */ 0,
+	/* DBlist             */ NULL,
+	/* DBorder            */ NULL,
+	/* DB                 */ NULL,
+	/* displayPos         */ 1,
+	/* displayWidth       */ DEFAULT_DISPLAY_WIDTH,
+	/* displayHeight      */ 0,
+	/* cursorPos          */ 1,
+	/* cursorSeq          */ 0,
+	/* rulerDisplayed     */ 1,
+	/* consensusDisplayed */ 1,
+	/* fontWidth          */ 0,
+	/* fontHeight         */ 0,
+        /* edWid              */ NULL,
+	/* namesWid           */ NULL,
+	/* sequencesWid       */ NULL,
+	/* scrollButtonsWid   */ NULL,
+	/* sliderWid          */ NULL,
+	/* displayedConsensus */ "",
+	/* select_made        */ 0,
+	/* select_seq         */ 0,
+	/* select_start_pos   */ 0,
+	/* select_end_pos     */ 0,
+	/* select_tag         */ NULL,
+	/* reveal_cutoffs     */ 0,
+	/* showDifferences    */ 0
+    }
+};
+
+static void bell()
+/*
+** Make a silly beep
+*/
+{
+    XBell (GetDisplay(),100);
+}
+
+
+
+void calculateConsensusLength(EdStruct *xx)
+/*
+** Calculate dynamic consensus length
+*/
+{
+    int sequenceEnd,i;
+
+    DBsetLength(xx,0,0);
+    for (i=1; i <= xx->DB_gelCount; i++) {
+        sequenceEnd = DBgetRelPos(xx,i) + DBgetLength(xx,i) -1;
+	if (DBgetLength(xx,0) < sequenceEnd) DBsetLength(xx,0,sequenceEnd);
+    }
+}
+
+EdStructPtr getFreeEdStruct()
+/*
+** Get the next free EdStruct
+*/
+{
+    if (EdStructCount == MAXEDSTATES)
+	return NULL;
+    else
+	return &edstate[EdStructCount++];
+}
+
+EdStructPtr intToEdStruct(int i)
+/*
+** Given an integer,
+** return the corresponding edit structure
+*/
+{
+    return &edstate[i];
+}
+
+EdStructPtr widgetToEdStruct(Widget w)
+/*
+** Given a widget,
+** return the corresponding edit structure
+*/
+{
+    int i;
+    if (w==NULL)
+	return NULL;
+    else
+	for (i=0;i<EdStructCount;i++)
+	    if (w==edstate[i].edWid)
+		return &edstate[i];
+    return NULL;
+}
+
+/* ---- Private Functions ---- */
+
+/************************************************/
+
+
+
+
+char *DBgetSeq(EdStruct *xx, int seq)
+/*
+** Force reading in the sequence for seq
+*/
+{
+    int i;
+
+    /* already in memory? */
+    if (!seq || DBgetFlags(xx,seq) & DB_FLAG_SEQ_IN_MEMORY)
+	return DB_Seq(xx,seq);
+
+    /* allocate memory */
+    if ((DBsetSeq(xx,seq,(char *) malloc((int)*saveState.maxgel)))==NULL)
+	return NULL;
+
+    /* force reading */
+    i = DBgetNumber(xx,seq);
+    readw_(saveState.idevw,&i,DB_Seq(xx,seq),saveState.maxgel,*saveState.maxgel);
+
+    /* mark as read */
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_IN_MEMORY);
+
+    return DB_Seq(xx,seq);
+}
+
+
+
+
+
+tagStruct *DBgetTags (EdStruct *xx, int seq)
+/*
+** Force tags into memory
+*/
+{
+    int i;
+
+    /* already in memory? */
+    if (!seq || DBgetFlags(xx,seq) & DB_FLAG_TAG_IN_MEMORY)
+	return (tagStruct *) DB_Tags(xx,seq);
+
+    /* read in tag list */
+    i = DBgetNumber(xx,seq);
+    DBsetTags(xx,seq,readTagList(i));
+
+    /* mark as read */
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_IN_MEMORY);
+
+    return (tagStruct *) DB_Tags(xx,seq);
+}
+
+
+
+char *DBgetName(EdStruct *xx, int seq)
+/*
+** Force reading in the sequence for seq
+*/
+{
+    int i;
+    char buf[NAMELEN+1];
+
+    /* already in memory? */
+    if (!seq || DBgetFlags(xx,seq) & DB_FLAG_NAME_IN_MEMORY)
+	return DB_Name(xx,seq);
+
+    /* allocate memory */
+    if ((DBsetName(xx,seq,(char *) malloc(sizeof(char)*(NAMELEN+1))))==NULL)
+	return NULL;
+
+    /* force reading */
+    i = DBgetNumber(xx,seq);
+    readn_(saveState.idevn,&i,buf,DB_NAMELEN); buf[10]='\0';
+    sprintf(DB_Name(xx,seq),"%*d %-*s",
+	    DB_GELNOLEN,(DBgetComp(xx,seq)==COMPLEMENTED)?-DBgetNumber(xx,seq):DBgetNumber(xx,seq),
+	    DB_NAMELEN, buf);
+
+    /* mark as read */
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_NAME_IN_MEMORY);
+
+    return DB_Name(xx,seq);
+}
+
+
+
+
+int initialiseDB(
+	EdStruct *xx,
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *llino	/* left-most gel in contig */
+	)
+/*
+** Create an internal database and
+** read all relevant data into it
+*/
+{
+    int i,c;
+
+    /*
+    ** Reset things
+    */
+    xx->showDifferences = 0;
+
+    /*
+    ** determine gel number
+    */
+    {
+	int dummy,numContigs;
+	int firstGel;
+
+	readr_(idevr,idbsiz,&dummy,&numContigs,&dummy,&dummy);
+	xx->DB_contigNum = (int)*idbsiz-numContigs;
+	readr_(idevr,&xx->DB_contigNum,&dummy,&dummy,&firstGel,&dummy);
+	while (xx->DB_contigNum < (int)*idbsiz && firstGel!=(int)*llino) {
+	    xx->DB_contigNum++;
+	    readr_(idevr,&xx->DB_contigNum,&dummy,&dummy,&firstGel,&dummy);
+	}
+
+    }
+
+    /*
+    ** count number of gels in contig
+    */
+    for (xx->DB_gelCount=1,i=(int) *llino;
+         xx->DB_gelCount<(int)*idbsiz && (int)rnbr[i-1];
+         xx->DB_gelCount++,i=(int)rnbr[i-1]);
+
+    if ((xx->DB = (DBStruct *) calloc (xx->DB_gelCount+1, sizeof(DBStruct)))==NULL)
+	goto disaster;
+    if ((xx->DBlist = (int *)  calloc (xx->DB_gelCount+1, sizeof(int) ))==NULL)
+	goto disaster;
+    if ((xx->DBorder = (int *) calloc (xx->DB_gelCount+1, sizeof(int) ))==NULL)
+	goto disaster;
+
+    /*
+    ** read information into local database
+    */
+    for (c=1,i=(int) *llino;
+         c<(int)*idbsiz && i;
+         c++,i=(int)rnbr[i-1]) {
+
+        DBsetRelPos(xx,c,relpg[i-1]);
+	DBsetLength(xx,c,abs(lngthg[i-1]));
+	DBsetComp(xx,c,(lngthg[i-1]<0)?-1:1);
+	DBsetNumber(xx,c,i);
+        DBsetFlags(xx,c,DB_FLAG_NONE);
+
+	xx->DBorder[c] = c;
+
+	if (xx->DB_flags & DB_STORAGE_INTERNAL) {
+	    if (DBgetSeq(xx,c)==NULL) goto disaster;
+	    if (DBgetName(xx,c)==NULL) goto disaster;
+	    (void)DBgetTags(xx,c);
+	}
+
+    }
+    /*
+    ** Set up consensus
+    */
+    {
+        DBsetRelPos(xx,0,1);
+	DBsetComp(xx,0,UNCOMPLEMENTED);
+	calculateConsensusLength(xx);
+        if ((DBsetSeq(xx,0,(char *) malloc(MAX_DISPLAY_WIDTH)))==NULL)
+	    goto disaster;
+	if ((DBsetName(xx,0,(char *) malloc(sizeof(char)*(NAMELEN+1))))==NULL)
+	    goto disaster;
+	sprintf(DB_Name(xx,0),"%*s %-*s",
+                  DB_GELNOLEN," ",
+                  DB_NAMELEN, "CONSENSUS");
+        xx->DBorder[0] = 0;
+    }
+
+    return 0;
+
+disaster:
+
+    freeDB(xx);
+    return 1;
+}
+
+void freeDB(EdStruct *xx)
+/*
+** Free an internal database
+*/
+{
+    int i;
+
+    if (xx->DB != NULL)
+        for (i=0; i <= xx->DB_gelCount; i++) {
+	    free(DB_Name(xx,i));
+            free(DB_Seq(xx,i));
+	    destroyTagList(DB_Tags(xx,i));
+        }
+
+    free(xx->DB);
+    free(xx->DBlist);
+    free(xx->DBorder);
+
+    xx->DB = NULL;
+    xx->DBlist = NULL;
+    xx->DBorder = NULL;
+}
+
+
+void saveDB(
+	EdStruct *xx,
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel	/* maximum length of gel */
+    )
+/*
+** Save an internal database
+*/
+{
+    int i;
+    int N,leftN,rightN;
+    int flag;
+
+    for (i=1; i<=xx->DB_gelCount; i++) {
+	/*
+	** update relationships
+	*/
+	N = DBgetNumber(xx,xx->DBorder[i]);
+	relpg[N-1]  = DBgetRelPos(xx,xx->DBorder[i]);
+	lngthg[N-1] = (DBgetComp(xx,xx->DBorder[i])==COMPLEMENTED)?-DBgetLength(xx,xx->DBorder[i]):DBgetLength(xx,xx->DBorder[i]);
+	if (i==1)
+	    leftN = 0;
+	else
+	    leftN = DBgetNumber(xx,xx->DBorder[i-1]);
+	if (i==xx->DB_gelCount)
+	    rightN = 0;
+	else
+	    rightN = DBgetNumber(xx,xx->DBorder[i+1]);
+	lnbr[N-1] = leftN;
+	rnbr[N-1] = rightN;
+	writer_(idevr,&N,&relpg[N-1],&lngthg[N-1],&lnbr[N-1],&rnbr[N-1]);
+
+	flag = DBgetFlags(xx,xx->DBorder[i]);
+	/*
+	** update working versions
+	*/
+	if (flag&(DB_FLAG_SEQ_IN_MEMORY|DB_FLAG_SEQ_MODIFIED))
+	    writew_(idevw,&N,DB_Seq(xx,xx->DBorder[i]),maxgel,*maxgel);
+	/*
+	** update tag list
+	*/
+	if (flag&(DB_FLAG_TAG_IN_MEMORY|DB_FLAG_TAG_MODIFIED)) {
+            writeTagList(xx, xx->DBorder[i]);
+            destroyTagList(DB_Tags(xx,xx->DBorder[i]));
+            DBsetTags(xx,xx->DBorder[i],readTagList(DBgetNumber(xx,xx->DBorder[i])));
+        }
+
+        /*
+        ** Preserve only ``sequence in memory flag''
+        */
+	DBsetFlags(xx,xx->DBorder[i],
+		   flag&(DB_FLAG_SEQ_IN_MEMORY ||
+			 DB_FLAG_NAME_IN_MEMORY ||
+			 DB_FLAG_TAG_IN_MEMORY));
+
+    }
+
+    /*
+    ** update contig relationships
+    */
+    calculateConsensusLength(xx);
+    relpg[xx->DB_contigNum-1]  = DBgetLength(xx,0);
+    lngthg[xx->DB_contigNum-1] = 0;
+    lnbr[xx->DB_contigNum-1] = DBgetNumber(xx,xx->DBorder[1]);
+    rnbr[xx->DB_contigNum-1] = DBgetNumber(xx,xx->DBorder[xx->DB_gelCount]);
+    writer_(idevr,
+	    &xx->DB_contigNum,
+	    &relpg[xx->DB_contigNum-1],
+	    &lngthg[xx->DB_contigNum-1],
+	    &lnbr[xx->DB_contigNum-1],
+	    &rnbr[xx->DB_contigNum-1]
+	    );
+    flushl_(idevr);
+    flushl_(idevw);
+    flushl_(idevn); /* not needed really */
+    flushl_(&devils_.idevt);
+    flushl_(&devils_.idevc);
+    
+    cleanUpStack(&bucketStack);
+
+    /* update save state */
+    if(xx == intToEdStruct(0))
+	save_state |= (int_f)1;
+    else
+	save_state |= (int_f)2;
+
+
+}
+
+void joinDB(
+	int_f *idevr,	/* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *llinol,
+	int_f *lnconl,
+	int_f *llinor,
+	int_f *lnconr,
+	int_f *ngels,
+	int_f *nconts,
+	int_f *idbsiz
+    )
+{
+    int_f relx;
+    int i;
+
+    /*
+    ** Save an internal databases
+    */
+    for (i=0;i<2;i++)
+        saveDB(
+	    intToEdStruct(i),
+	    idevr,
+	    idevw,
+	    idevn,
+	    relpg,
+	    lngthg,
+	    lnbr,
+	    rnbr,
+	    maxgel
+        );
+    relx = editorLockedPos(1/*force*/);
+    if (relx<0) {
+	relx = 1-relx;
+        dojoin_(relpg,lngthg,lnbr,rnbr,ngels,nconts,lnconl,lnconr,
+                idbsiz,idevr,idevw,&relx);
+    } else {
+	relx = 1+relx;
+        dojoin_(relpg,lngthg,lnbr,rnbr,ngels,nconts,lnconr,lnconl,
+                idbsiz,idevr,idevw,&relx);
+    }
+
+    save_state |= (int_f)4;
+
+}
+
+
+static int linesInRegion(EdStruct *xx, int pos, int width)
+/*
+** Return number of sequences on screen
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ;
+	 i<=xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[i])<(pos+width) ;
+	 i++) {
+        if (DBgetRelPos(xx,xx->DBorder[i])+DBgetLength(xx,xx->DBorder[i])>pos &&
+	    DBgetLength(xx,xx->DBorder[i])) count++;
+    }
+    count += xx->consensusDisplayed;
+
+    return count;
+
+}
+
+
+static int linesOnScreen (EdStruct *xx, int pos, int width)
+/*
+** Return number of sequences on screen
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ; i<=xx->DB_gelCount ; i++) {
+	int relPos, length;
+	int len_lcut, len_rcut;
+
+	if (xx->reveal_cutoffs) {
+	    len_lcut = lenLCut(xx,xx->DBorder[i]);
+	    len_rcut = lenRCut(xx,xx->DBorder[i]);
+	} else
+	    len_lcut = len_rcut = 0;
+	relPos = DBgetRelPos(xx,xx->DBorder[i]) - len_lcut;
+	length = DBgetLength(xx,xx->DBorder[i]) + len_lcut + len_rcut;
+        if (relPos < pos+width && relPos+length>pos)
+	    count++;
+    }
+    count += xx->consensusDisplayed;
+
+    return count;
+}
+
+
+
+#ifdef notdef
+static int maxLinesOnScreen_x (EdStruct *xx, int width)
+/*
+** Given width, return maximum number of lines on screen
+*/
+{
+    int i,max,linesAtX,X;
+
+    max = linesOnScreen(xx,1,width);
+    for (i=1 ; i <= xx->DB_gelCount ; i++) {
+        X=DBgetRelPos(xx,i)-width+1;
+	if (X<=1) continue;
+	linesAtX=linesOnScreen(xx,X,width);
+	if (linesAtX>max) max = linesAtX;
+    }
+    max += xx->consensusDisplayed;
+
+    return max;
+}
+#endif
+
+int *sequencesInRegion(EdStruct *xx,int pos, int width)
+/*
+** Return a pointer to list of sequences in region of contig
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ; i<=xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[i])<(pos+width) ; i++) {
+        if (DBgetRelPos(xx,xx->DBorder[i])+DBgetLength(xx,xx->DBorder[i])>pos && DBgetLength(xx,xx->DBorder[i]))
+	    xx->DBlist[count++]=xx->DBorder[i];
+    }
+    if (xx->consensusDisplayed) xx->DBlist[count++] = 0;
+
+    return xx->DBlist;
+
+}
+
+
+int *sequencesOnScreen(EdStruct *xx,int pos, int width)
+/*
+** Return a pointer to list of sequences on screen
+*/
+{
+    int i, count;
+
+    for (i=1,count=0 ; i<=xx->DB_gelCount ; i++) {
+	int relPos, length;
+	int len_lcut, len_rcut;
+
+	if (xx->reveal_cutoffs) {
+	    len_lcut = lenLCut(xx,xx->DBorder[i]);
+	    len_rcut = lenRCut(xx,xx->DBorder[i]);
+	} else
+	    len_lcut = len_rcut = 0;
+	relPos = DBgetRelPos(xx,xx->DBorder[i]) - len_lcut;
+	length = DBgetLength(xx,xx->DBorder[i]) + len_lcut + len_rcut;
+        if (relPos < pos+width && relPos+length>pos)
+	    xx->DBlist[count++]=xx->DBorder[i];
+    }
+    if (xx->consensusDisplayed) xx->DBlist[count++] = 0;
+
+    return xx->DBlist;
+
+}
+
+void DBgetSequence(EdStruct *xx, int seq, int pos, int width, char *str)
+/*
+** get part of a sequence from its `pos' base for `width' bases
+** Bases number from 0?
+*/
+{
+    char *src;
+    int length = DBgetLength(xx,seq);
+    int i;
+
+    src = DBgetSeq(xx,seq);
+
+    /* Lefthand cut off */
+    if (pos<0) {
+	i = (width<-pos)?width:-pos;
+	getLCut(xx,seq, -pos, i, str);
+    } else
+	i=0;
+    
+    /*copy sequence*/
+    for (;
+	 i<width && (pos+i)<length;
+	 i++) {
+	str[i]=src[pos+i]; 
+    }
+    
+    /* Righthand cut off */
+    if (i<width) {
+	getRCut(xx,seq, pos+i-length, width-i, &str[i]);
+    }
+    
+    str[width]='\0';
+
+}
+
+
+
+void DBcalcConsensus (EdStruct *xx,int pos, int width, char *str, int mode)
+/*
+** calculate the consensus for position `pos' in contig,
+** for `width' characters
+*/
+{
+    /*
+    ** When calculating the consensus,
+    **   handle DNA differently from protein
+    ** switch (xx->DB_flags & DB_DATA_TYPE) {
+    **     case DB_DATA_TYPE_DNA:     do dna stuff
+    **     case DB_DATA_TYPE_PROTEIN: do protein stuff
+    **     default : do dna stuff???
+    ** }
+    ** BASES = 6 for dna, 26 for protein (specified by fortran variable IDM?)
+    */
+    int_f scores[MAX_DISPLAY_WIDTH][BASES];
+    int_f index,score;
+    int_f i,j,k;
+    int_f seven=BASES;
+    int jlow,klow,jhigh /* ,khigh */;
+    int chunk; /* do in chucks of MAX_DISPLAY_WIDTH */
+    int cwidth;
+
+    for (chunk=0; chunk<width; chunk+=MAX_DISPLAY_WIDTH) {
+	cwidth = width - chunk;
+	if (cwidth>MAX_DISPLAY_WIDTH) cwidth=MAX_DISPLAY_WIDTH;
+	/*now do for pos+chunk to pos+chunk+cwidth*/
+
+	for (i=0;i<MAX_DISPLAY_WIDTH;i++)
+	    for (j=0;j<BASES;j++) scores[i][j] = 0;
+
+	for (i=1 ;
+	    i<=xx->DB_gelCount &&
+	    DBgetRelPos(xx,xx->DBorder[i])<(pos+chunk+cwidth) ;
+	    i++)
+	{
+	    if (DBgetRelPos(xx,xx->DBorder[i]) +
+		DBgetLength(xx,xx->DBorder[i]) > (pos+chunk) &&
+		DBgetLength(xx,xx->DBorder[i]) &&
+		(mode == BOTH_STRANDS ||
+		 mode == DBgetComp(xx,xx->DBorder[i])))
+	    {
+		if (DBgetRelPos(xx,xx->DBorder[i]) < (pos+chunk))
+		{
+		    klow = (pos+chunk) - DBgetRelPos(xx,xx->DBorder[i]);
+		    jlow = 0;
+		}
+		else
+		{
+		    klow = 0;
+		    jlow = DBgetRelPos(xx,xx->DBorder[i]) - (pos+chunk);
+		}
+
+		if (DBgetRelPos(xx,xx->DBorder[i]) +
+		    DBgetLength(xx,xx->DBorder[i]) > pos+chunk+cwidth)
+		{
+		    /*khigh = klow + cwidth;*/
+		    jhigh = cwidth;
+		}
+		else
+		{
+		    /*khigh = DBgetLength(xx,xx->DBorder[i]);*/
+		    jhigh = jlow + DBgetLength(xx,xx->DBorder[i]) - klow;
+		}
+
+		{
+		    char s[MAX_DISPLAY_WIDTH+1];
+		    DBgetSequence(xx,xx->DBorder[i],klow,jhigh-jlow,s);
+		    for (j=jlow,k=0; j<jhigh; j++,k++)
+		    {
+			if (*saveState.idm == 26) {
+			    pcon1_(&s[k],&scores[j][0]);
+			} else {
+			    index = indexs_(&s[k],&score);
+			    scores[j][index-1] += score;
+			    /* No longer needed
+			    scores[j][BASES-1] += score;
+			    */
+			}
+		    }
+		}
+
+	    }
+	}
+
+	for (i=0;i<cwidth;i++){
+	    if (*saveState.idm == 26) 
+		munotp_(&str[chunk+i],(int_fl)1,&scores[i][0]);
+	    else
+		gtconc_(&str[chunk+i],(int_fl)1,&scores[i][0],&seven,&pcCut);
+	}
+    }
+    str[width]='\0';
+    
+}
+
+
+
+int positionInContig(EdStruct *xx, int seq, int pos)
+/*
+** returns relative position in a sequence as an 
+** absolute position in the contig
+*/
+{
+    return DBgetRelPos(xx,seq) + pos - 1;
+}
+
+static int onScreen (EdStruct *xx, int seq, int pos)
+/*
+** returns true if base in `seq' at position `pos' is currently
+** being displayed on screen 
+*/
+{
+    int posInContig;
+
+    posInContig = positionInContig(xx,seq,pos);
+    return (posInContig >= xx->displayPos &&
+            posInContig < xx->displayPos + xx->displayWidth );
+}
+
+
+#ifdef notdef
+static void getJoinedData(EdStruct *xx, int *leftPos, int* rightPos)
+/*
+** get information about relative positions of two joined contigs
+*/
+{
+    int offset = editorLockedPos(0/*don't force recalculation*/);
+    EdStruct *otherxx;
+
+    otherxx = intToEdStruct(0);
+    if (otherxx==xx) {
+	otherxx = intToEdStruct(1);
+        *leftPos = min(1,1-offset);
+	*rightPos = max(DBgetLength(xx,0),DBgetLength(otherxx,0)-offset);
+    } else {
+        *leftPos = min(1,1+offset);
+	*rightPos = max(DBgetLength(xx,0),DBgetLength(otherxx,0)+offset);
+    }
+}
+#endif /*notdef*/
+
+#ifdef notdef
+static int getJoinedLength(EdStruct *xx)
+/*
+** Get the combined length of the two joined contigs
+*/
+{
+    int leftPos, rightPos;
+    getJoinedData(xx,&leftPos,&rightPos);
+    return rightPos - leftPos;
+}
+
+static int getJoinedLeftPos(EdStruct *xx)
+/*
+** Get the position (relative to xx) of the leftmost base in
+** the joined contig
+*/
+{
+    int leftPos, rightPos;
+    getJoinedData(xx,&leftPos,&rightPos);
+    return leftPos;
+}
+
+static int getJoinedRightPos(EdStruct *xx)
+/*
+** Get the position (relative to xx) of the rightmost base in
+** the joined contig
+*/
+{
+    int leftPos, rightPos;
+    getJoinedData(xx,&leftPos,&rightPos);
+    return rightPos;
+}
+#endif
+
+static void extents(EdStruct *xx, int *left, int *right)
+/*
+** Get maximum extents of sequence, taking into account cutoffs.
+*/
+{
+
+    if (xx->reveal_cutoffs) {
+	int eleft, eright;
+	int i;
+
+	eleft = eright = 0;
+	
+	for (i=1; i<=xx->DB_gelCount ; i++) {
+	    int thisleft,thisright;
+
+	    thisleft = DBgetRelPos(xx,xx->DBorder[i]) - lenLCut(xx,xx->DBorder[i]);
+	    thisright =  DBgetRelPos(xx,xx->DBorder[i]) + DBgetLength(xx,xx->DBorder[i]) + lenRCut(xx,xx->DBorder[i]) - 1;
+
+	    if (eleft > thisleft)
+		eleft = thisleft;
+	    if (eright < thisright)
+		eright = thisright;
+	}
+
+	*left = eleft;
+	*right = eright;
+
+    } else {
+	*left = 1;
+	*right = DBgetLength(xx,0);
+    }
+
+}
+
+
+
+
+static void joinedExtents(EdStruct *xx, int *leftPos, int* rightPos)
+/*
+** get information about relative positions of two joined contigs
+*/
+{
+    int offset = editorLockedPos(0/*don't force recalculation*/);
+    EdStruct *otherxx;
+    int left,right;
+    int otherleft,otherright;
+
+    otherxx = intToEdStruct(0);
+    extents(xx,&left,&right);
+    if (otherxx==xx) {
+	otherxx = intToEdStruct(1);
+	extents(otherxx,&otherleft,&otherright);
+        *leftPos = min(left,otherleft-offset);
+	*rightPos = max(right,otherright-offset);
+    } else {
+	extents(otherxx,&otherleft,&otherright);
+        *leftPos = min(left,otherleft+offset);
+	*rightPos = max(right,otherright+offset);
+    }
+}
+
+
+
+
+
+
+static void getExtents(EdStruct *xx, int *left, int *right)
+{
+    if (inJoinMode() && editorLocked())
+	joinedExtents(xx,left,right);
+    else {
+	extents(xx,left,right);
+	if (inJoinMode()  && !editorLocked()) {
+	    *right += xx->displayWidth - 2;
+	    *left  -= xx->displayWidth - 1;
+	}
+    }
+
+
+}
+
+
+
+/***************************************************/
+/**/
+/**/
+
+
+
+
+
+static void setSliderPos (EdStruct *xx, int pos)
+/*
+** set slider as a position in contig
+*/
+{
+    float percentSet;
+    float percentShown;
+    int left, right;
+    getExtents(xx,&left,&right);
+    percentSet = (float)(pos-left)/(float)(right-left+1+1-xx->displayWidth);
+    percentShown = (float)xx->displayWidth/(float)(right-left+1+1);
+
+    XawScrollbarSetThumb(xx->sliderWid,percentSet,percentShown);
+}
+
+
+
+
+
+
+static void incDisplayPosP (EdStruct *xx, int distance)
+/*
+** Increase the leftmost base position on the screen by a symbolic amount
+*/
+{
+    int oldPos = xx->displayPos;
+    int left,right;
+    switch (distance) {
+        case D_screen     : xx->displayPos += xx->displayWidth; break;
+        case D_halfScreen : xx->displayPos += xx->displayWidth/2; break;
+        case D_character  : xx->displayPos += 1; break;
+    }
+    getExtents(xx,&left,&right);
+    if (xx->displayPos > right + 2 - xx->displayWidth)
+	xx->displayPos = right + 2 - xx->displayWidth;
+    if (oldPos!=xx->displayPos)
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+}
+
+
+
+
+
+
+
+
+static void decDisplayPosP (EdStruct *xx, int distance)
+/*
+** Decrease the leftmost base position on the screen by a symbolic ammount
+*/
+{
+    int oldPos = xx->displayPos;
+    int left,right;
+    switch (distance) {
+        case D_screen     : xx->displayPos -= xx->displayWidth; break;
+        case D_halfScreen : xx->displayPos -= xx->displayWidth/2; break;
+        case D_character  : xx->displayPos -= 1; break;
+    }
+    getExtents(xx,&left,&right);
+    if (xx->displayPos < left)
+	xx->displayPos = left;
+    if (oldPos!=xx->displayPos)
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+}
+
+
+
+
+
+
+
+
+
+static void setDisplayPosPercentP (EdStruct *xx, float percent)
+/*
+** Set the leftmost base position on the screen given a percentage
+*/
+{
+    int oldPos = xx->displayPos;
+    int left,right;
+    getExtents(xx,&left,&right);
+    xx->displayPos  = (int)((float) (right-left+1+1-xx->displayWidth) * percent) + left;
+    if (oldPos!=xx->displayPos)
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+}
+
+
+
+
+
+
+
+
+static void positionCursor(EdStruct *xx, Widget w, int seq, int pos)
+/*
+** position the cursor int sequence seq at position pos
+** and indicate cursor if on screen
+*/
+{
+    if (onScreen(xx,seq,pos)) {
+        int screenRow,screenColumn;
+        int *seqList;
+
+	screenColumn = positionInContig(xx,seq,pos) - xx->displayPos;
+        seqList = sequencesOnScreen(xx,xx->displayPos, xx->displayWidth);
+	for(screenRow=0;
+	    screenRow<xx->displayHeight && seqList[screenRow] != seq;
+	    screenRow++);
+	XawSheetDisplayCursor(w,True);
+	XawSheetPositionCursor(w,screenColumn,screenRow+xx->rulerDisplayed);
+    } else
+	XawSheetDisplayCursor(w,False);
+}
+
+static void redisplayDisagreement()
+/*
+** Recalculate and redisplay all disagreements
+*/
+{
+    char spare[MAX_DISPLAY_WIDTH];
+    EdStruct *xx[2];
+    int i;
+
+    if (inJoinMode()) {
+        xx[0] = intToEdStruct(0);
+        xx[1] = intToEdStruct(1);
+        for (i=0;i<xx[0]->displayWidth;i++)
+	    spare[i]=(xx[0]->displayedConsensus[i] ==
+	              xx[1]->displayedConsensus[i])?' ':'!';
+	XawSheetPutText(disagreeWid,0,0,xx[0]->displayWidth,spare);
+    }
+}
+
+
+void redisplaySequences (EdStruct *xx, Widget namesWid, Widget sequencesWid,
+                      int pos, int width)
+/*
+** Redisplay the whole sequence display
+*/
+{
+    Arg args[10];
+    int nargs;
+    int *seqList;
+
+    setSliderPos(xx,pos);
+
+    /*
+    ** Set Up Text Window sizes
+    */
+    xx->displayHeight = linesOnScreen(xx,pos,width);
+    seqList = sequencesOnScreen(xx,pos, width);
+
+    /*
+    ** Update names list
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, xx->displayHeight+xx->rulerDisplayed); nargs++;
+    XtSetValues(namesWid, args, nargs);
+    {
+	int i;
+	XawSheetInk splodge[NAMELEN];
+	for (i=0 ; i < NAMELEN ; i++) splodge[i].sh = sh_inverse;
+	for (i=0 ; i < xx->displayHeight ; i++ ) {
+	    if (DBgetFlags(xx,seqList[i]) & DB_FLAG_SELECTED)
+		XawSheetPutJazzyText(namesWid,0,(i+xx->rulerDisplayed),NAMELEN,DBgetName(xx,seqList[i]),splodge);
+	    else
+		XawSheetPutText(namesWid,0,(i+xx->rulerDisplayed),NAMELEN,DBgetName(xx,seqList[i]));
+	}
+    }
+
+    /*
+    ** Update sequence list
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNrows, xx->displayHeight+xx->rulerDisplayed); nargs++;
+    XtSetValues(sequencesWid, args, nargs);
+    {
+	int i;
+	char spare[MAX_DISPLAY_WIDTH];
+	if (xx->rulerDisplayed) {
+	    char *k;
+	    int j,lower,times;
+	    lower = (pos - pos%10);
+	    times = width/10 + 2;
+	    for (j=0,k=spare;j<times;j++,k+=10,lower+=10) sprintf(k,"%10d",lower);
+	    XawSheetPutText(sequencesWid,0,0,width,&spare[9+pos%10]);
+	}
+	
+	DBcalcConsensus(xx,pos,width,xx->displayedConsensus,BOTH_STRANDS);
+	for (i=0 ; i < xx->displayHeight ; i++ ) {
+	    char * ptr;
+	    XawSheetInk splodge[MAX_DISPLAY_WIDTH];
+	    
+	    if (seqList[i]==0){
+		ptr      = xx->displayedConsensus;
+	    }else{
+		DBgetSequence(xx,seqList[i],pos-DBgetRelPos(xx,seqList[i]),width,spare);
+		ptr      = spare;
+	    }
+	    if (xx->showDifferences) {
+		int j;
+		for (j=0;j<width;j++) if (spare[j]==xx->displayedConsensus[j])
+		    spare[j]='.';
+	    }
+	    getTagSplodge(xx,seqList[i],pos-DBgetRelPos(xx,seqList[i]),width,splodge);
+	    XawSheetPutJazzyText(sequencesWid,0,(i+xx->rulerDisplayed),width,ptr,splodge);
+	}
+	
+    }
+
+    positionCursor(xx,sequencesWid,xx->cursorSeq,xx->cursorPos);
+    redisplaySelection(xx);
+
+}
+
+
+
+
+static void dumpSequence(EdStruct *xx, int seq, int pos, int width, char *str)
+/*
+** get part of a sequence from its `pos' base for `width' bases
+** Bases number from 0?
+*/
+{
+    char *src;
+    int length = DBgetLength(xx,seq);
+    int i,j;
+
+    src = DBgetSeq(xx,seq);
+
+    /* Lefthand cut off */
+    if (pos<0) {
+	i = (width<-pos)?width:-pos;
+	getLCut(xx,seq, -pos, i, str);
+	for(j=0;j<i;j++) if (isupper(str[j])) str[j] = tolower(str[j]);
+    } else
+	i=0;
+    
+    /*copy sequence*/
+    for (;
+	 i<width && (pos+i)<length;
+	 i++) {
+	str[i]=src[pos+i]; 
+    }
+    
+    /* Righthand cut off */
+    if (i<width) {
+	getRCut(xx,seq, pos+i-length, width-i, &str[i]);
+	for(j=i;j<width;j++) if (isupper(str[j])) str[j] = tolower(str[j]);
+    }
+    
+    str[width]='\0';
+
+}
+
+void dumpLine(EdStruct *xx, FILE *fp, int pos, int width)
+/*
+** Print out a section
+*/
+{
+    int *seqList;
+    int i;
+    char spare[MAX_DISPLAY_WIDTH];
+    char consensus[MAX_DISPLAY_WIDTH];
+    int displayHeight;
+
+    displayHeight = linesOnScreen(xx,pos,width);
+    seqList = sequencesOnScreen(xx,pos, width);
+
+
+    if (xx->rulerDisplayed) {
+	char *k;
+	int j,lower,times;
+	lower = (pos - pos%10);
+	times = width/10 + 2;
+	for (j=0,k=spare;j<times;j++,k+=10,lower+=10) sprintf(k,"%10d",lower);
+	fprintf(fp,"%*.*s   %*.*s\n",
+		NAMELEN,NAMELEN," ",
+		width,width,&spare[9+pos%10]);
+    }
+    DBcalcConsensus(xx,pos,width,consensus,BOTH_STRANDS);
+    
+    for (i=0 ; i < displayHeight ; i++ ) {
+	char * ptr;
+	
+	if (DBgetFlags(xx,seqList[i]) & DB_FLAG_SELECTED)
+	    fprintf(fp,"%*.*s * ",NAMELEN,NAMELEN, DBgetName(xx,seqList[i]));
+	else
+	    fprintf(fp,"%*.*s   ",NAMELEN,NAMELEN, DBgetName(xx,seqList[i]));
+	
+	if (seqList[i]==0){
+	    ptr = consensus;
+	}else{
+	    dumpSequence(xx,seqList[i],pos-DBgetRelPos(xx,seqList[i]),width,spare);
+	    ptr = spare;
+	}
+	if (xx->showDifferences) {
+	    int j;
+	    for (j=0;j<width;j++) if (spare[j]==consensus[j])
+		spare[j]='.';
+	}
+	fprintf(fp,"%*.*s\n",width,width,ptr);
+    }
+    
+    fprintf(fp,"\n");
+
+}
+
+
+void dumpRegion(EdStruct *xx, FILE *fp, int start, int end, int width)
+{
+    for(;start<=end;start+=width)
+	dumpLine(xx, fp, start, (end-start+1<width)?end-start+1:width);
+}
+
+
+
+
+
+
+void dumpContig(EdStruct *xx)
+{
+    int left,right;
+
+    static int i = 0;
+    char fn[1024];
+    FILE *fp;
+
+    i++;
+    sprintf(fn,"dump.%d.%d",getpid(),i);
+
+    if ( (fp = fopen(fn,"w")) != NULL ) {
+	extents(xx, &left, &right);
+	bell();
+	dumpRegion(xx,fp,left,right,60);
+	bell();
+	fclose(fp);
+    }
+}
+
+
+
+int createEdDisplay(EdStruct *xx, Widget namesWid, Widget sequencesWid, int seq, int pos)
+/*
+** Create the initial sequence display
+*/
+{
+    int i;
+
+    /*
+    ** Initial position on screen
+    */
+    xx->cursorSeq = 0;
+    xx->cursorPos = pos;
+
+    for (i=1; i<=xx->DB_gelCount; i++) {
+	if (DBgetNumber(xx,i) == seq) {
+	    xx->cursorSeq = i;
+	    xx->cursorPos = pos;
+	    break;
+	}
+    }
+
+    /*
+    ** Display.
+    ** set xx->displayPos to force repositioning of cursor
+    */
+    xx->displayPos = positionInContig(xx,xx->cursorSeq,xx->cursorPos) +
+	2*xx->displayWidth;
+    redisplayWithCursor(xx);
+    return 0;
+}
+
+
+void incDisplayPos (EdStruct *xx, int distance)
+/*
+** Increase the leftmost base position on the screen by a symbolic ammount
+*/
+{
+    if (editorLocked()) {
+	incDisplayPosP(intToEdStruct(0), distance);
+	incDisplayPosP(intToEdStruct(1), distance);
+    } else
+	incDisplayPosP(xx, distance);
+
+    redisplayDisagreement();
+}
+
+
+void decDisplayPos (EdStruct *xx, int distance)
+/*
+** Decrease the leftmost base position on the screen by a symbolic ammount
+*/
+{
+    if (editorLocked()) {
+	decDisplayPosP(intToEdStruct(0), distance);
+	decDisplayPosP(intToEdStruct(1), distance);
+    } else
+	decDisplayPosP(xx, distance);
+
+    redisplayDisagreement();
+}
+
+
+
+
+
+void setDisplayPosPercent (EdStruct *xx, float percent)
+/*
+** Set the leftmost base position on the screen given a percentage
+*/
+{
+    if (editorLocked()) {
+	setDisplayPosPercentP(intToEdStruct(0),percent);
+	setDisplayPosPercentP(intToEdStruct(1),percent);
+    } else
+	setDisplayPosPercentP(xx,percent);
+
+    redisplayDisagreement();
+}
+
+void setDisplayPos(EdStruct *xx, int pos)
+/*
+** centralise pos on screen
+*/
+{
+    if (pos==1) pos = 2; /*if pos==1 no redraw is done. So force*/
+
+    if (editorLocked()) {
+        int offset = editorLockedPos(0/*don't force recalculation*/);
+	EdStruct *otherxx;
+	otherxx = intToEdStruct(0);
+	if (otherxx == xx) {
+	    otherxx = intToEdStruct(1);
+	    otherxx->displayPos = pos + offset;
+	} else {
+	    otherxx->displayPos = pos - offset;
+	}
+    }
+
+    xx->displayPos = pos;
+    decDisplayPos(xx,D_halfScreen);
+
+}
+
+static void showCursor(EdStruct *xx, int seq, int pos)
+/*
+** ensure that the cursor is visible on the screen
+*/
+{
+    if (onScreen(xx,seq,pos))
+        positionCursor(xx,xx->sequencesWid,seq,pos);
+    else {
+	setDisplayPos(xx,positionInContig(xx,seq,pos));
+/*
+	if (xx->displayPos > positionInContig(xx,seq,pos))
+	    decDisplayPos(xx,D_halfScreen);
+	else
+	    incDisplayPos(xx,D_halfScreen);
+	xx->displayPos = positionInContig(xx,seq,pos) - (xx->displayWidth/2);
+	if (xx->displayPos < 1) xx->displayPos = 1;
+	if (xx->displayPos > (DBgetLength(xx,0)+2 - xx->displayWidth))
+	    xx->displayPos = (DBgetLength(xx,0)+2 - xx->displayWidth);
+	redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+*/
+    }
+}
+
+void redisplayWithCursor(EdStruct *xx)
+/*
+** Redisplay screen, ensuring cursor display
+*/
+{
+    if (onScreen(xx,xx->cursorSeq,xx->cursorPos)) {
+	redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	redisplayDisagreement();
+    } else
+	showCursor(xx,xx->cursorSeq,xx->cursorPos);
+}
+
+void caretRight (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor right
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    if (xx->cursorPos<=DBgetLength(xx,xx->cursorSeq)) {
+	xx->cursorPos++;
+        showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    } else
+	bell();
+
+}
+
+
+void caretLeft (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor left
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+
+    if (xx->cursorPos>1) {
+	xx->cursorPos--;
+        showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    } else
+	bell();
+
+}
+
+void caretDown (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor down,
+** cycle if necessary
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    int *seqList,seqCount;
+    int posInContig;
+    int i;
+
+    posInContig = positionInContig(xx,xx->cursorSeq,xx->cursorPos);
+    seqList = sequencesInRegion(xx,posInContig-1,2);
+    seqCount = linesInRegion(xx,posInContig-1,2);
+    for(i=0;
+	i<seqCount && seqList[i]!=xx->cursorSeq;
+	i++);
+
+    i++;
+    if (i == seqCount) i = 0;
+
+    xx->cursorSeq = seqList[i];
+    xx->cursorPos = posInContig - DBgetRelPos(xx,xx->cursorSeq) + 1;
+    if (xx->cursorPos<1)
+	xx->cursorPos = 1;
+    else
+	if (xx->cursorPos > DBgetLength(xx,xx->cursorSeq)+1)
+	    xx->cursorPos = DBgetLength(xx,xx->cursorSeq)+1;
+    showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    
+}
+
+void caretUp (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** Move cursor up,
+** cycle if necessary
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    int *seqList,seqCount;
+    int posInContig;
+    int i;
+
+    posInContig = positionInContig(xx,xx->cursorSeq,xx->cursorPos);
+    seqList = sequencesInRegion(xx,posInContig-1,2);
+    seqCount = linesInRegion(xx,posInContig-1,2);
+    for(i=0;
+	i<seqCount && seqList[i]!=xx->cursorSeq;
+	i++);
+
+    if (i==0) i = seqCount;
+    i--;
+
+    xx->cursorSeq = seqList[i];
+    xx->cursorPos = posInContig - DBgetRelPos(xx,xx->cursorSeq) + 1;
+    if (xx->cursorPos<1)
+	xx->cursorPos = 1;
+    else
+	if (xx->cursorPos > DBgetLength(xx,xx->cursorSeq)+1)
+	    xx->cursorPos = DBgetLength(xx,xx->cursorSeq)+1;
+    showCursor(xx,xx->cursorSeq, xx->cursorPos);
+    
+}
+
+
+static void deleteBase (EdStruct *xx, int seq, int pos)
+/*
+** delete a single base in a sequence
+** and update the screen if necessary
+*/
+{
+    if (seq && pos) {
+	int j;
+	char *s;
+	char deletedBase;
+	s=DBgetSeq(xx,seq);
+	deletedBase = s[pos-1];
+        for (j = pos; j < DBgetLength(xx,seq); j++)
+            s[j-1] = s[j];
+	DBsetLength(xx,seq,DBgetLength(xx,seq)-1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	tagDeleteBase(xx,seq,pos,&deletedBase);
+        selectDeleteBase(xx, seq, pos);
+    }
+    else if (seq) {
+	/*
+	** deletion at position 0 - shift left sequence
+	*/
+	int i,j;	
+
+        if (DBgetRelPos(xx,seq)==1) {
+	    for (i=1; i <= xx->DB_gelCount; i++) {
+		if (seq == xx->DBorder[i]) {
+		    for (j=i;j>1;j--)
+			xx->DBorder[j] = xx->DBorder[j-1];
+		    xx->DBorder[1] = seq;
+		} else
+		    DBsetRelPos(xx,xx->DBorder[i],DBgetRelPos(xx,xx->DBorder[i])+1);
+	    }
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	} else {
+	    DBsetRelPos(xx,seq,DBgetRelPos(xx,seq)-1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	    for (i=1; seq != xx->DBorder[i] && i <= xx->DB_gelCount; i++);
+	    for (j=i; DBgetRelPos(xx,xx->DBorder[j-1]) > DBgetRelPos(xx,seq) ;j--)
+		xx->DBorder[j] = xx->DBorder[j-1];
+	    xx->DBorder[j] = seq;
+	}
+
+    }
+
+}
+
+void deleteKey (Widget widget, XEvent *event, String *params,
+		Cardinal *num_params)
+/*
+** delete a character
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    int i;
+    char ch;
+
+    if (!(xx->DB_flags & DB_ACCESS_UPDATE)) return;
+    if (! onScreen(xx, xx->cursorSeq, xx->cursorPos)) {
+	/*
+	** When cursor is currently displayed on the screen,
+	** give a warning tone, and recentre screen on cursor
+	*/
+	showCursor(xx,xx->cursorSeq, xx->cursorPos);
+	bell();
+    } else if (positionInContig(xx,xx->cursorSeq,xx->cursorPos)==1 && !xx->cursorSeq) {
+	bell();
+    } else if (xx->cursorSeq) {
+	if (editModeIsSuperman()) {
+	    if (xx->cursorPos==1)
+		ch = ' ';
+	    else {
+	        char *s;
+	        s = DBgetSeq(xx,xx->cursorSeq);
+	        ch = s[xx->cursorPos-2];
+	    }
+
+	    recordEdit(&bucketStack,xx,undoDelete,xx->cursorSeq,xx->cursorPos-1,ch,0);
+
+	    deleteBase (xx,xx->cursorSeq, xx->cursorPos-1);
+
+	    calculateConsensusLength(xx);
+
+	    if (xx->cursorPos>1) xx->cursorPos--;
+	    redisplayWithCursor(xx);
+
+	} else bell();
+    } else {
+        char consensus[2];
+	DBcalcConsensus(xx,xx->cursorPos-1,1,consensus,BOTH_STRANDS);
+	if (*consensus != '*') {
+	    /*
+	    ** Only allow deletion from consensus when it's an asterisk
+	    */
+	    bell();
+	} else {
+	    recordEdit(&bucketStack,xx,undoMark,0,xx->cursorPos,'\0',0);
+            for (i=1; i<=xx->DB_gelCount; i++) {
+                if (DBgetRelPos(xx,i) + DBgetLength(xx,i) <= xx->cursorPos) ;
+		else if (DBgetRelPos(xx,i) >= xx->cursorPos) {
+		    recordEdit(&bucketStack,xx,undoDelete,i,0,' ',0);
+		    deleteBase (xx,i, 0);
+		}
+		else {
+	            char *s;
+		    s=DBgetSeq(xx,i);
+	            ch = s[xx->cursorPos-DBgetRelPos(xx,i)-1];
+		    recordEdit(&bucketStack,xx,undoDelete,i,xx->cursorPos-DBgetRelPos(xx,i),ch,0);
+		    deleteBase (xx,i, xx->cursorPos-DBgetRelPos(xx,i));
+		}
+
+            }
+	    recordEdit(&bucketStack,xx,undoMark,0,0,'\0',0);
+
+	    DBsetLength(xx,0,DBgetLength(xx,0)-1);
+	    redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	    redisplayDisagreement();
+	    xx->cursorPos--;
+	    redisplayWithCursor(xx);
+	}
+    }
+    
+}
+
+static void insertBase (EdStruct *xx, int seq, int pos, char base)
+/*
+** insert a single base in a sequence
+** and update the screen if necessary
+*/
+{
+    if (seq && base!=' ') {
+	int j;
+	char *s;
+	int len = DBgetLength(xx,seq);
+	/* don't let user overrun maximum gel length */
+	if (len<(int)*saveState.maxgel) {
+	    s=DBgetSeq(xx,seq);
+	    for (j = len+1; j > pos; j--)
+		s[j-1] = s[j-2];
+	    s[pos-1] = base;
+	    DBsetLength(xx,seq,len+1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	    tagInsertBase(xx,seq,pos,&base);
+	    selectInsertBase(xx, seq, pos);
+	}
+    }
+    else if (seq) {
+	/*
+	** space inserted at start: shift sequence right
+	*/
+	int i,j;	
+
+	if (DBgetRelPos(xx,seq)==1)
+	    for (i=1; i<xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[i+1])==1;i++);
+	else
+	    i=0;
+        if (i==1) {
+	    for (i=2; i <= xx->DB_gelCount; i++) {
+		DBsetRelPos(xx,xx->DBorder[i],DBgetRelPos(xx,xx->DBorder[i])-1);
+	    }
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	} else {
+	    DBsetRelPos(xx,seq,DBgetRelPos(xx,seq)+1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	    for (i=1; seq != xx->DBorder[i] && i <= xx->DB_gelCount; i++);
+	    for (j=i; j < xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[j+1]) < DBgetRelPos(xx,seq) ; j++)
+		xx->DBorder[j] = xx->DBorder[j+1];
+	    xx->DBorder[j] = seq;
+	}
+
+    }
+
+}
+
+static void insertChar (EdStruct *xx,int seq, int pos, char key)
+/*
+** insert a character
+*/
+{
+    int i;
+
+    /* Insert the base to the right of the caret */
+    /* perform a check to see that sequences aren't too long */
+    if (seq) {
+	/* don't let user overrun maximum gel length */
+	if (editModeIsSuperman() &&
+	    DBgetLength(xx,seq)<(int)*saveState.maxgel) {
+
+	    recordEdit(&bucketStack,xx,undoInsert,seq,pos,key,0);
+
+	    insertBase(xx, seq, pos, key);
+
+	    calculateConsensusLength(xx);
+
+	    if (key!=' ') xx->cursorPos = ++pos;
+	    redisplayWithCursor(xx);
+	} else bell();
+    } else if (key == '*') {
+	/*
+	** Only allow insertion of asterisks from consensus
+	*/
+	recordEdit(&bucketStack,xx,undoMark,0,pos,'\0',0);
+	for (i=1; i<=xx->DB_gelCount; i++) {
+	    int len = DBgetLength(xx,i);
+	    int relPos = DBgetRelPos(xx,i);
+	    if (relPos + len < pos ||
+		len>=(int)*saveState.maxgel) ;
+	    else if (relPos > pos) {
+		recordEdit(&bucketStack,xx,undoInsert,i,0,key,0);
+		insertBase(xx, i,0,' ');
+	    }
+	    else if (len) {
+		recordEdit(&bucketStack,xx,undoInsert,i, pos-relPos+1,key,0);
+	        insertBase(xx, i, pos-relPos+1, key);
+	    }
+	}
+	recordEdit(&bucketStack,xx,undoMark,0,0,'\0',0);
+
+	DBsetLength(xx,0,DBgetLength(xx,0)+1);
+ 
+	xx->cursorPos = ++pos;
+	redisplayWithCursor(xx);
+    } else {
+	bell();
+    }
+}
+
+static void replaceBase (EdStruct *xx, int seq, int pos, char base)
+/*
+** replace a single base in a sequence
+** and update the screen if necessary
+*/
+{
+    if (seq) {
+	char *s;
+	char deletedBase;
+	s=DBgetSeq(xx,seq);
+	deletedBase=s[pos-1];
+        s[pos-1] = base;
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED);
+	tagReplaceBase(xx,seq,pos,&deletedBase, &base);
+    }
+
+}
+
+static void replaceChar (EdStruct *xx, int seq, int pos, char key)
+/*
+** replace a character
+*/
+{
+    /* int i,j; */
+    char ch;
+
+    /*
+    ** Don't allow replacing past end of sequence
+    */
+    if (DBgetLength(xx,seq) < pos) {
+	bell();
+	return;
+    }
+    if (seq) {
+	char *s;
+	s=DBgetSeq(xx,seq);
+
+	ch = s[pos-1];
+	recordEdit(&bucketStack,xx,undoReplace,seq,pos,ch,0);
+
+	replaceBase(xx,seq, pos, key);
+
+	xx->cursorPos = ++pos;
+	redisplayWithCursor(xx);
+
+    } else {
+	bell();
+	/*
+	** Don't allow replacements on consensus at ANY stage
+	**
+	** recordEdit(&bucketStack,xx,undoMark,0,pos,'\0',0);
+	** for (i=1; i<=xx->DB_gelCount; i++) {
+	**     if (DBgetRelPos(xx,i) + DBgetLength(xx,i) <= pos) ;
+	**     else if (DBgetRelPos(xx,i) > pos) ;
+	**     else if (DBgetLength(xx,i)) {
+	** 	char *s;
+	** 	s=DBgetSeq(xx,i);
+	** 	ch = s[pos-DBgetRelPos(xx,i)];
+	** 	recordEdit(&bucketStack,xx,undoReplace,i,pos-DBgetRelPos(xx,i)+1,ch,0);
+	**         replaceBase(xx,i, pos-DBgetRelPos(xx,i)+1, key);
+	**     }
+	** }
+	** recordEdit(&bucketStack,xx,undoMark,0,0,'\0',0);
+	**
+	**
+	** xx->cursorPos = ++pos;
+        ** redisplayWithCursor(xx);
+	*/
+    }
+}
+
+static int validKey(char key)
+{
+    static char validKeys[] = "CcTtAaGg1234DVBHKLMNRY5678-*";
+    if (*saveState.idm == 26)
+	return (key!=' ');
+    else
+	return (strchr(validKeys,key) != NULL);
+}
+
+void keyPress (Widget widget, XEvent *event, String *params,
+	       Cardinal *num_params)
+/*
+** Handle a key press
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    char keys[8];
+    char key;
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+    /*
+    ** validkeys is different when database is for DNA and when
+    ** database is for protein
+    **
+    ** switch (xx->DB_flags & DB_DATA_TYPE) {
+    **     case DB_DATA_TYPE_DNA:     do dna stuff
+    **     case DB_DATA_TYPE_PROTEIN: do protein stuff
+    **     default : do dna stuff???
+    ** }
+    ** BASES = 6 for dna, 26 for protein (specified by fortran variable IDM?)
+    static char validKeys[] = "CcTtAaGg1234DVBHKLMNRY5678-*";
+    */
+
+    if (!(xx->DB_flags & DB_ACCESS_UPDATE)) return;
+
+    charCount = XLookupString((XKeyEvent *)event, keys,
+			      (int)sizeof(keys), &keySym,&compose);
+    key = keys[0];
+    if (charCount==1) {
+	if (! onScreen(xx, xx->cursorSeq, xx->cursorPos)) {
+	    /*
+	    ** When cursor is currently displayed on the screen,
+	    ** give a warning tone, and recentre screen on cursor
+	    */
+	    showCursor(xx,xx->cursorSeq, xx->cursorPos);
+	    bell();
+	} else if (validKey(key)) {
+	    if (editModeIsInsert())
+	        insertChar (xx,xx->cursorSeq, xx->cursorPos, key);
+	    else
+	        replaceChar (xx,xx->cursorSeq, xx->cursorPos, key);
+        }
+	else {
+	    if (xx->cursorSeq && xx->cursorPos==1 && key == ' ' && editModeIsInsert())
+	        insertChar (xx,xx->cursorSeq, xx->cursorPos, key);
+	    else bell();
+	}
+    }
+}
+
+void buttonDown (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** An attempt has been made to reposition the cursor
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+#define left_margin 4
+#define top_margin 4
+
+    int x,y;
+    int *seqList;
+
+    if (editorState == StateDown) return;
+
+    if (widget != xx->sequencesWid) return;
+
+    x = (event->xbutton.x-left_margin) / xx->fontWidth;
+    y = (event->xbutton.y-top_margin) / xx->fontHeight - xx->rulerDisplayed;
+
+
+    if (y>=0 && y < xx->displayHeight && x>=0 && x < xx->displayWidth) {
+        seqList = sequencesOnScreen(xx,xx->displayPos,xx->displayWidth);
+        xx->cursorSeq = seqList[y];
+        xx->cursorPos = xx->displayPos - DBgetRelPos(xx,xx->cursorSeq) + x + 1;
+	if (xx->cursorPos<1)
+	    xx->cursorPos = 1;
+	else
+	    if (xx->cursorPos > DBgetLength(xx,xx->cursorSeq)+1)
+		xx->cursorPos = DBgetLength(xx,xx->cursorSeq)+1;
+        positionCursor(xx,widget,xx->cursorSeq,xx->cursorPos);
+    }
+
+}
+
+static void undoLastEdit (EdStruct *xx, int command, int seq, int pos, char ch, int t)
+/*
+** Undo a command
+*/
+{
+    switch (command) {
+	case undoDelete:
+	    insertBase(xx, seq,pos,ch);
+	    break;
+	case undoInsert:
+	    deleteBase(xx,seq,(ch==' ')?0:pos);
+	    break;
+	case undoReplace:
+	    replaceBase(xx,seq,pos,ch);
+	    break;
+	case undoExtend:
+	    unextend(xx,seq,ch);
+	    break;
+	case undoUnextend:
+	    undo_unextend(xx,seq,ch,t);
+	    break;
+	default:
+	    break;
+	}
+}
+
+void undoLastCommand()
+/*
+** Undo last keypress that modified contig
+*/
+{
+    EdStruct *xx;
+    int command;
+    int seq;
+    int pos;
+    char ch;
+    int t;
+
+    retrieveEdit(&bucketStack,&xx,&command,&seq,&pos,&ch,&t);
+    if (command == undoNullCommand) {
+	bell();
+	return;
+    }
+    else if (command == undoMark) {
+
+        retrieveEdit(&bucketStack,&xx,&command,&seq,&pos,&ch,&t);
+	while (command != undoMark) {
+	    undoLastEdit(xx,command,seq,pos,ch,t);
+            retrieveEdit(&bucketStack,&xx,&command,&seq,&pos,&ch,&t);
+	}
+
+        xx->cursorSeq = 0;
+        xx->cursorPos = pos;
+
+    } else {
+
+	undoLastEdit(xx,command,seq,pos,ch,t);
+
+        xx->cursorSeq = seq;
+        xx->cursorPos = pos+(command==undoDelete);
+
+    }
+
+    calculateConsensusLength(xx);
+    redisplayWithCursor(xx);
+}
+
+static int showTrace(EdStruct *xx, int seq, int pos, int baseSpacing)
+{
+
+    char fileName[200];
+    tagStruct *t;
+    /*
+    int traceNo;
+    */
+    int t_len;        /* number of bases in trace */
+    int t_lcut;       /* left cut off of usable data */
+    int t_ulen;       /* length of usable data */
+    char t_type[5];   /* type of trace */
+    char t_fname[FILE_NAME_LENGTH+1]; /* file name of trace */
+    char *rawData;
+    int baseNum;
+
+    t = (tagStruct *) DBgetTags(xx,seq);
+
+    force_comment(t);
+
+    if (! t->newcommentlen) return 1;
+    sscanf(t->newcomment,"%6d%6d%6d%*s",&t_len,&t_lcut,&t_ulen);
+    strncpy(t_type,&t->newcomment[18],4);
+    strncpy(t_fname,&t->newcomment[22],FILE_NAME_LENGTH);
+
+    t_type[4] = '\0';
+    {   /* convert fortran string to c string */
+        int i;
+        for (i=FILE_NAME_LENGTH-1;i>=0 && (!t_fname[i] || isspace(t_fname[i]));i--) ;
+        t_fname[++i] = '\0';
+    }
+
+    /* skip if no raw data file for trace */
+    if (t_fname[0] == '\0') return 1;
+
+    /* check trace file exists */
+    if ( ! file_exists(t_fname) ) {
+	char *s;
+        /* try in rawData */
+	/*
+	** get environment details
+	*/
+	rawData = (char *)getenv ("RAWDATA");
+        if (rawData == NULL) return 1;
+	/*
+        sprintf (fileName,"%s/%s",rawData,t_fname);
+	*/
+	if ((s=findfile(t_fname,rawData))==NULL) return 1;
+	strcpy (fileName,s);
+    } else
+        /* ok */
+        strcpy (fileName,t_fname);
+
+    if (DBgetComp(xx,seq) == UNCOMPLEMENTED) {
+        baseNum = origpos(xx, seq, pos) + t_lcut;
+	manageTrace(t_type, fileName, baseNum, t_lcut, t_ulen, /*not complemented*/0, baseSpacing, DBgetName(xx,seq));
+    } else {
+        baseNum = t_len - t_lcut - origpos(xx, seq, normalisePos(xx,seq,pos,1));
+	manageTrace(t_type, fileName, baseNum, t_lcut, t_ulen, /*complemented*/1, baseSpacing, DBgetName(xx,seq));
+    }
+
+
+    return 0;
+}
+
+void invokeTrace (Widget widget, XEvent *event, String *params,
+		  Cardinal *num_params)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+    int baseSpacing = xx->fontWidth * 2;
+
+    if (xx->cursorSeq) {
+	showTrace(
+	    xx,
+	    xx->cursorSeq,
+	    xx->cursorPos,
+	    baseSpacing);
+    } else {
+	int *seqList;
+	int i;
+        seqList = sequencesOnScreen(xx,xx->cursorPos,1);
+	for (i=0; i<xx->displayHeight && seqList[i]; i++)
+	    showTrace(
+		xx,
+		seqList[i],
+		xx->cursorPos-DBgetRelPos(xx,seqList[i])+1,
+		baseSpacing);
+
+    }
+}
+
+void countDisagreements(int *overlapLength, int *wingeCount)
+{
+    int left0,right0;
+    int left1/*,right1*/;
+    int length0,length1;
+    int offset = editorLockedPos(1/*force recalculation*/);
+    int i;
+    EdStruct *xx[2];
+    char *ol0,*ol1;
+
+    xx[0] = intToEdStruct(0);
+    xx[1] = intToEdStruct(1);
+
+    if (offset < 0) {
+	left0 = 1-offset;
+	left1 = 1;
+    } else {
+	left0 = 1;
+	left1 = 1+offset;
+    }
+    length0 = DBgetLength(xx[0],0);
+    length1 = DBgetLength(xx[1],0);
+    if (offset+length0 < length1) {
+	right0 = length0;
+	/*
+	right1 = length0-offset;
+	*/
+    } else {
+	right0 = length1-offset;
+	/*
+	right1 = length1;
+	*/
+    }
+    *overlapLength = right0 - left0+1;
+    *wingeCount  = 0;
+
+    if (*overlapLength > 0) {
+	ol0 = (char *) malloc(*overlapLength+1);
+	ol1 = (char *) malloc(*overlapLength+1);
+	DBcalcConsensus(xx[0],left0,*overlapLength,ol0,BOTH_STRANDS);
+	DBcalcConsensus(xx[1],left1,*overlapLength,ol1,BOTH_STRANDS);
+	for (i=0;i<*overlapLength;i++) if(ol0[i]!=ol1[i])(*wingeCount)++;
+	free(ol0);
+	free(ol1);
+    }
+}
+
+
+void selectRead (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+/*
+** A gel reading name has been selected - toggle highlighting
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+#define left_margin 4
+#define top_margin 4
+
+    int y;
+    int *seqList;
+
+    if (editorState == StateDown) return;
+
+    if (widget != xx->namesWid) return;
+
+    y = (event->xbutton.y-top_margin) / xx->fontHeight - xx->rulerDisplayed;
+
+    if (y>=0 && y < xx->displayHeight) {
+	int flag;
+        seqList = sequencesOnScreen(xx,xx->displayPos,xx->displayWidth);
+	flag = DBgetFlags(xx,seqList[y]);
+	flag ^= DB_FLAG_SELECTED;
+	DBsetFlags(xx,seqList[y],flag);
+        redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+    }
+
+}
+
+
+
+
+/*************************************************************
+**
+** Fast accessing routines... utilising binary search
+**
+*************************************************************/
+
+int posToIndex(EdStruct *xx, int pos)
+/*
+** Find the first sequence that starts at or to the right of a
+** given position
+*/
+{
+    int Min, Max, Mid;
+
+    /* binary search */
+    /* Min, Max, Mid refer to pairs of numbers: ie MAX --> [MAX-1],[MAX] */
+    Min = 1;
+    Max = xx->DB_gelCount + 1;
+    
+    do {
+	int r1,r2;
+
+	Mid = (Max+Min)/2;
+
+	/* compare */
+	r1 = (Mid==1)?(pos-1):DBgetRelPos(xx,xx->DBorder[Mid-1]);
+	r2 = (Mid==xx->DB_gelCount+1)?(pos+1):DBgetRelPos(xx,xx->DBorder[Mid]);
+
+	if (r1 < pos && r2 >= pos) 
+	    return (Mid==xx->DB_gelCount+1)?(0):Mid;
+
+	if (r1 < pos)
+	    Min = Mid+1;
+	else
+	    Max = Mid-1;
+
+    } while (Max>=Min);
+
+    return 0;
+
+}
+
+
+int posToSeq(EdStruct *xx, int pos)
+/*
+** Find the first sequence that starts at or to the right of a
+** given position
+*/
+{
+    int ind;
+
+    ind = posToIndex(xx,pos);
+    if (ind)
+	return xx->DBorder[ind];
+    else
+	return 0;
+
+}
+
+
+int seqToIndex(EdStruct *xx, int seq)
+/*
+**
+*/
+{
+    int i;
+    int ind;
+
+    ind = posToIndex(xx,DBgetRelPos(xx,seq));
+
+    if (ind) {
+	for (i=ind; i<=xx->DB_gelCount && xx->DBorder[i]!=seq;i++) ;
+
+	if (i<=xx->DB_gelCount) return i;
+    }
+
+    return 0;
+	
+}
+
+
+
+
diff --git a/src/staden/edUtils.h b/src/staden/edUtils.h
new file mode 100644
index 0000000..cc3b1a4
--- /dev/null
+++ b/src/staden/edUtils.h
@@ -0,0 +1,257 @@
+#include <X11/Intrinsic.h>
+#include "tagUtils.h"
+#include "fort.h"
+
+#ifndef _edUtils_h
+#define _edUtils_h
+
+/*
+** constant definitions
+*/
+#define MAX_DISPLAY_WIDTH 200
+#define DEFAULT_DISPLAY_WIDTH 80
+#define DB_NAMELEN 12  /* size of records in AR file */
+#define DB_GELNOLEN 5
+#define NAMELEN (DB_NAMELEN + DB_GELNOLEN + 1)
+#define BASES 6
+
+/*
+** Useful distances
+** (treat as symbolic rather than actual distances)
+*/
+#define D_screen     80
+#define D_halfScreen 40
+#define D_character   1
+
+/*
+** Flags for the internal database
+*/
+/* for whole database */
+#define DB_ACCESS            (1L<<0)
+#define DB_DATA_TYPE         (1L<<1)
+#define DB_STORAGE           (1L<<2)
+#define DB_ACCESS_READONLY   (0L)
+#define DB_ACCESS_UPDATE     (DB_ACCESS)
+#define DB_DATA_TYPE_DNA     (0L)
+#define DB_DATA_TYPE_PROTEIN (DB_DATA_TYPE)
+#define DB_STORAGE_DISK      (0L)
+#define DB_STORAGE_INTERNAL  (DB_STORAGE)
+#define DB_DELAYED_READ      (0L)
+/* for each sequence */
+#define DB_FLAG_NONE             (0L)
+#define DB_FLAG_IN_MEMORY        (1L<<0)
+#define DB_FLAG_SEQ_MODIFIED     (1L<<1)
+#define DB_FLAG_REL_MODIFIED     (1L<<2)
+#define DB_FLAG_TAG_MODIFIED     (1L<<3)
+#define DB_FLAG_SELECTED         (1L<<4)
+#define DB_FLAG_TAG_IN_MEMORY    (1L<<5)
+#define DB_FLAG_SEQ_IN_MEMORY    (1L<<0)
+#define DB_FLAG_NAME_IN_MEMORY   (1L<<6)
+/*
+** type definitions
+*/
+typedef struct {
+        int relPos;
+        int length;
+        int number;
+        int complemented;
+        char *name;
+        char *sequence;
+	long flags;
+	tagStruct *tagList;
+        } DBStruct, *DBptr;
+
+
+/* REMEMBER TO: update structure initialisation in edUtils.c */
+typedef struct {
+        int oldMax;
+	long DB_flags;
+        int DB_gelCount;
+        int DB_contigNum;
+        int *DBlist;
+        int *DBorder;
+        DBStruct *DB;
+        int displayPos ;
+        int displayWidth;
+        int displayHeight;
+        int cursorPos;
+        int cursorSeq;
+        int rulerDisplayed;
+        int consensusDisplayed;
+        int fontWidth;
+        int fontHeight;
+	Widget edWid;
+        Widget namesWid;
+	Widget sequencesWid;
+	Widget scrollButtonsWid;
+	Widget sliderWid;
+        char displayedConsensus[ MAX_DISPLAY_WIDTH ];
+	int select_made;
+	int select_seq;
+	int select_start_pos;
+	int select_end_pos;
+	tagStruct *select_tag;
+	int reveal_cutoffs;
+	int showDifferences;
+        } EdStruct, *EdStructPtr;
+/* REMEMBER TO: update structure initialisation in edUtils.c */
+
+
+/*
+** Useful macros concerning internal database
+*/
+#define DBgetRelPos(X,A)   ((X)->DB[(A)].relPos)
+#define DBgetLength(X,A)   ((X)->DB[(A)].length)
+#define DBgetNumber(X,A)   ((X)->DB[(A)].number)
+#define DBgetComp(X,A)     ((X)->DB[(A)].complemented)
+#define DBgetFlags(X,A)    ((X)->DB[(A)].flags)
+extern char *DBgetSeq();
+extern tagStruct *DBgetTags();
+extern char *DBgetName();
+
+/****************/
+#define DB_RelPos(X,A)   ((X)->DB[(A)].relPos)
+#define DB_Length(X,A)   ((X)->DB[(A)].length)
+#define DB_Number(X,A)   ((X)->DB[(A)].number)
+#define DB_Comp(X,A)     ((X)->DB[(A)].complemented)
+#define DB_Name(X,A)     ((X)->DB[(A)].name)
+#define DB_Seq(X,A)      ((X)->DB[(A)].sequence)
+#define DB_Flags(X,A)    ((X)->DB[(A)].flags)
+#define DB_Tags(X,A)     ((X)->DB[(A)].tagList)
+/****************/
+#define COMPLEMENTED -1
+#define BOTH_STRANDS 0
+#define UNCOMPLEMENTED 1
+#define DBgetGelName(xx,i) ( &( DBgetName(xx,i) )[DB_GELNOLEN+1] )
+
+#define DBsetRelPos(X,A,B) (X)->DB[(A)].relPos = (B)
+#define DBsetLength(X,A,B) (X)->DB[(A)].length = (B)
+#define DBsetNumber(X,A,B) (X)->DB[(A)].number = (B)
+#define DBsetComp(X,A,B)   (X)->DB[(A)].complemented = (B)
+#define DBsetName(X,A,B)   (X)->DB[(A)].name = (B)
+#define DBsetSeq(X,A,B)    (X)->DB[(A)].sequence = (B)
+#define DBsetFlags(X,A,B)  (X)->DB[(A)].flags = (B)
+#define DBsetTags(X,A,B)   (X)->DB[(A)].tagList = (B)
+
+/*
+** Useful macros
+*/
+#define normalisePos(X,S,P,L) \
+    ( (DBgetComp((X),(S))==UNCOMPLEMENTED) ? (P) : (DBgetLength((X),(S)) - (P) - (L) + 2) )
+
+
+/*
+** External definitions
+*/
+extern int initialiseDB(
+	EdStruct *xx,
+        int_f *idevr,    /* unit number for relationships */
+	int_f *idevw,	/* unit number for working versions of sequences */
+	int_f *idevn,	/* unit number for sequence names */
+	int_f *relpg,	/* relative positions of gels in sequences */
+	int_f *lngthg,	/* lengths of sequences */
+	int_f *lnbr,	/* left neighbours */
+	int_f *rnbr,	/* right neighbours */
+	int_f *maxgel,	/* maximum length of gel */
+	int_f *idbsiz,	/* size of database */
+	int_f *llino	/* left-most gel in contig */
+	);
+
+extern void saveDB(
+	EdStruct *xx,
+        int_f *idevr,    /* unit number for relationships */
+        int_f *idevw,    /* unit number for working versions of sequences */
+        int_f *idevn,    /* unit number for sequence names */
+        int_f *relpg,    /* relative positions of gels in sequences */
+        int_f *lngthg,   /* lengths of sequences */
+        int_f *lnbr,     /* left neighbours */
+        int_f *rnbr,     /* right neighbours */
+        int_f *maxgel    /* maximum length of gel */
+    );
+
+extern void joinDB(
+        int_f *idevr,    /* unit number for relationships */
+        int_f *idevw,    /* unit number for working versions of sequences */
+        int_f *idevn,    /* unit number for sequence names */
+        int_f *relpg,    /* relative positions of gels in sequences */
+        int_f *lngthg,   /* lengths of sequences */
+        int_f *lnbr,     /* left neighbours */
+        int_f *rnbr,     /* right neighbours */
+        int_f *maxgel,   /* maximum length of gel */
+        int_f *llinol,
+        int_f *lnconl,
+        int_f *llinor,
+        int_f *lnconr,
+        int_f *ngels,
+        int_f *nconts,
+        int_f *idbsiz
+    );
+
+
+extern void freeDB(EdStruct *xx);
+extern void countDisagreements(int *overlapLength, int *wingeCount);
+
+extern int  createEdDisplay(EdStruct *xx,Widget namesWid, Widget sequencesWid,int seq,int pos);
+
+extern void incDisplayPos(EdStruct *xx, int distance);
+extern void decDisplayPos(EdStruct *xx, int distance);
+extern void setDisplayPos(EdStruct *xx, int pos);
+extern void setDisplayPosPercent(EdStruct *xx, float percent);
+
+extern int posToIndex(EdStruct *xx, int pos);
+
+extern void undoLastCommand();
+extern void saveDatabase(EdStruct *xx);
+
+extern void caretRight(Widget, XEvent *, String *, Cardinal *);
+extern void caretLeft(Widget, XEvent *, String *, Cardinal *);
+extern void caretDown(Widget, XEvent *, String *, Cardinal *);
+extern void caretUp(Widget, XEvent *, String *, Cardinal *);
+extern void deleteKey(Widget, XEvent *, String *, Cardinal *);
+extern void keyPress(Widget, XEvent *, String *, Cardinal *);
+extern void buttonDown(Widget, XEvent *, String *, Cardinal *);
+extern void invokeTrace(Widget, XEvent *, String *, Cardinal *);
+extern void selectRead(Widget, XEvent *, String *, Cardinal *);
+
+extern EdStructPtr intToEdStruct(int i);
+extern EdStructPtr widgetToEdStruct(Widget w);
+extern EdStructPtr getFreeEdStruct();
+
+extern void DBgetSequence(EdStruct *xx, int seq, int pos, int width, char *str);
+/*
+** get part of a sequence from its `pos' base for `width' bases
+** Bases number from 0?
+*/
+extern int *sequencesInRegion(EdStruct *xx,int pos, int width);
+/*
+** Return a pointer to list of sequences in region of contig
+*/
+extern int *sequencesOnScreen(EdStruct *xx,int pos, int width);
+/*
+** Return a pointer to list of sequences on screen
+*/
+extern int positionInContig(EdStruct *xx, int seq, int pos);
+/*
+** returns relative position in a sequence as an 
+** absolute position in the contig
+*/
+extern void DBcalcConsensus (EdStruct *xx,int pos, int width, char *str, int strand);
+/*
+** calculate the consensus for position `pos' in contig,
+** for `width' characters. Take into accound readings on `strand' only:
+**
+*/
+extern void redisplaySequences (EdStruct *xx, Widget namesWid, Widget sequencesWid, int pos, int width);
+/*
+** Redisplay the whole sequence display
+*/
+extern void redisplayWithCursor(EdStruct *xx);
+/*
+** Redisplay screen, ensuring cursor display
+*/
+void calculateConsensusLength(EdStruct *xx);
+/*
+** Calculate dynamic consensus length
+*/
+
+#endif /* _edUtils_h */
diff --git a/src/staden/extend.c b/src/staden/extend.c
new file mode 100644
index 0000000..5890e81
--- /dev/null
+++ b/src/staden/extend.c
@@ -0,0 +1,722 @@
+/*
+** Change log:
+**
+**   1/10/91 SD  Added calculateConsensusLength to extend, unextend, undo_unextend
+**   29/4/92 SD  Changes related to general speed up in edUtils.c
+**   18/5/92 SD  Construct (*nc=*++nc) not liked by dec
+*/
+
+#include "edUtils.h"
+#include "fortran.h"
+#include "tagUtils.h"
+#include "contigEditor.h"
+#include "undo.h"
+#include "select.h"
+#include <stdio.h>
+#include <stdlib.h>
+
+#define LEFT 1
+#define RIGHT 2
+
+
+tagStruct *force_get_tag(EdStruct *xx, int seq, char *type)
+/*
+** find a control tag of a particular type
+** Create one if one doesn't exist
+*/
+{
+    tagStruct *t;
+
+    t = (tagStruct *)findTagByType(xx,seq,type);
+
+    if (t == NULL) {
+	/*
+	** create one
+	*/
+	t = newTag();
+	strncpy(t->tagrec.type.c,type,4);
+	t->flags = TAG_INSERTED;
+	insertTag(xx,seq,t);
+    }
+
+    /*
+    ** Force comment to be in memory
+    */
+    force_comment(t);
+
+    return t;
+
+}
+
+
+
+int extend(EdStruct *xx, int seq, int dir, int time)
+{
+    int c = (DBgetComp(xx,seq)==COMPLEMENTED);
+    char t_type[4];
+    tagStruct *t;
+    int use_tag;
+    char extend_char;
+    int l;
+
+    if (DBgetLength(xx,seq) >= (int)*saveState.maxgel) return 1;
+
+    use_tag = (c && dir==LEFT || !c && dir==RIGHT )?RIGHT: LEFT;
+    strncpy(t_type,
+	(use_tag==RIGHT )?TAG_TYPE_RCUT:TAG_TYPE_LCUT,4);
+
+    /*
+    ** Find appropriate tag,
+    ** create one if one doesn't exist
+    */
+    t = force_get_tag(xx,seq,t_type);
+
+    /*
+    ** No cutoff - ignore request
+    */
+    l = t->newcommentlen;
+    if (! l)
+	return 1;
+
+    /*
+    ** Determine character to extend
+    */
+    if (use_tag==LEFT) {
+	/* Get character off end of cutoff */
+	extend_char = t->newcomment[l-1];
+	t->newcomment[l-1]='\0';
+	t->newcommentlen--;
+    } else {
+	/* Get character off start of cutoff */
+	int i;
+	extend_char = t->newcomment[0];
+	for (i=0;t->newcomment[i];i++)
+	    t->newcomment[i] = t->newcomment[i+1];
+	t->newcommentlen--;
+    }
+    t->flags |= TAG_COMMENT_CHANGED;
+
+    /*
+    ** complement if necessary
+    */
+    if (use_tag != dir) {
+	int_f i=1;
+	sqcom_(&extend_char,&i,(int_f)1);
+    }
+
+
+    if (dir==LEFT){
+	int i,j;	
+	char *s;
+
+	/*
+	** insert base at start of sequence
+	*/
+	s=DBgetSeq(xx,seq);
+	for (j = DBgetLength(xx,seq)+1; j > 1; j--)
+            s[j-1] = s[j-2];
+	s[0] = extend_char;
+	DBsetLength(xx,seq,DBgetLength(xx,seq)+1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	selectInsertBase(xx,seq,1);
+
+	/*
+	** shift sequence left
+	*/
+        if (DBgetRelPos(xx,seq)==1) {
+	    for (i=1; i <= xx->DB_gelCount; i++) {
+		if (seq == xx->DBorder[i]) {
+		    for (j=i;j>1;j--)
+			xx->DBorder[j] = xx->DBorder[j-1];
+		    xx->DBorder[1] = seq;
+		} else
+		    DBsetRelPos(xx,xx->DBorder[i],DBgetRelPos(xx,xx->DBorder[i])+1);
+	    }
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	} else {
+	    DBsetRelPos(xx,seq,DBgetRelPos(xx,seq)-1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	    for (i=1; seq != xx->DBorder[i] && i <= xx->DB_gelCount; i++);
+	    for (j=i; DBgetRelPos(xx,xx->DBorder[j-1]) > DBgetRelPos(xx,seq) ;j--)
+		xx->DBorder[j] = xx->DBorder[j-1];
+	    xx->DBorder[j] = seq;
+	}
+
+    } else {
+	/*
+	** insert base at end of sequence
+	*/
+	char *s;
+	int l=DBgetLength(xx,seq);
+	s=DBgetSeq(xx,seq);
+	s[l] = extend_char;
+	DBsetLength(xx,seq,l+1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	selectInsertBase(xx,seq,l+1);
+    }
+
+    if (use_tag == LEFT) {
+	/*
+	** shift position of all tags right 1
+	*/
+	tagStruct *t;
+	t = (tagStruct *) DBgetTags(xx,seq);
+	while (t != NULL) {
+	    if (!(t->flags & TAG_DELETED) &&
+	       t->tagrec.position) {
+		t->tagrec.position++;
+		t->flags |= TAG_POSITION_CHANGED;
+		DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	    }
+	    t = t->next;
+	}
+    }
+
+    /*
+    ** Modify raw data stuff
+    */
+    {
+	tagStruct *t;
+	char type[5];
+	char name[FILE_NAME_LENGTH+1];
+	int llen,lcut,wlen;
+	t = (tagStruct *) DBgetTags(xx,seq);
+	force_comment(t);
+        sscanf(t->newcomment,"%6d%6d%6d%*s",&llen,&lcut,&wlen);
+	strncpy(type,&t->newcomment[18],4);
+	strncpy(name,&t->newcomment[22],FILE_NAME_LENGTH);
+	wlen++;
+	if (use_tag==LEFT) lcut--;
+        sprintf(t->newcomment,"%6d%6d%6d",llen,lcut,wlen);
+	strncat(t->newcomment,type,4);
+	strncat(t->newcomment,name,FILE_NAME_LENGTH);
+	t->newcommentlen = (int)strlen(t->newcomment);
+	t->flags |= TAG_COMMENT_CHANGED;
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+
+    }
+
+    if (dir == RIGHT) xx->cursorPos++;
+
+    /*
+    ** Lengths are a changing
+    */
+    calculateConsensusLength(xx);
+
+    return 0;
+
+}
+
+
+
+
+
+int unextend(EdStruct *xx, int seq, int dir, int time)
+{
+
+    int c = (DBgetComp(xx,seq)==COMPLEMENTED);
+    char t_type[4];
+    tagStruct *t;
+    int use_tag;
+    char extend_char[2];
+    char external_buff[100];
+    char internal_buff[100];
+    char *ec = extend_char;
+    char *eb = external_buff;
+    char *ib = internal_buff;
+    int l;
+    int end_char_inserted = 0;
+
+    use_tag = (c && dir==LEFT || !c && dir==RIGHT )?RIGHT: LEFT;
+    strncpy(t_type,
+	(use_tag==RIGHT )?TAG_TYPE_RCUT:TAG_TYPE_LCUT,4);
+
+    /*
+    ** Find appropriate tag,
+    ** create one if one doesn't exist
+    */
+    t = force_get_tag(xx,seq,t_type);
+
+    /*
+    ** Process tag information
+    ** collating *DE? flags and *INS flags
+    ** and adjusting other flags accordingly
+    */
+    {
+	tagStruct *t;
+	int ipos;
+
+	t = (tagStruct *) DBgetTags(xx,seq);
+	ipos = normalisePos(xx,seq,(dir==LEFT)?1:DBgetLength(xx,seq),1);
+
+	while (t != NULL) {
+	    if (!(t->flags & TAG_DELETED) &&
+	       t->tagrec.position == ipos &&
+	       strncmp(t->tagrec.type.c,TAG_TYPE_INSERT,3)==0) {
+		/*
+		** Handle unextending past inserts
+		*/
+		end_char_inserted++;
+		t->flags |= TAG_DELETED;
+		t->time = time;
+		DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	    }else if (!(t->flags & TAG_DELETED) &&
+	       strncmp(t->tagrec.type.c, TAG_TYPE_DELETE,3)==0) {
+		/*
+		** Handle unextending past deletes
+		*/
+		if (use_tag==LEFT && t->tagrec.position == ipos ||
+		   use_tag==RIGHT && t->tagrec.position == (ipos+1)) {
+		    *eb++ = t->tagrec.type.c[3];
+		    t->time = time;
+		    t->flags |= TAG_DELETED;
+		    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+		} else if (use_tag==RIGHT && t->tagrec.position == ipos) {
+		    *ib++ = t->tagrec.type.c[3];
+		    t->time = time;
+		    t->flags |= TAG_DELETED;
+		    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+		} else if (use_tag == LEFT) {
+		    t->tagrec.position--;
+		    t->flags |= TAG_POSITION_CHANGED;
+		    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+		}
+	    } else if (!(t->flags & TAG_DELETED) &&
+	       t->tagrec.position <= ipos &&
+	       t->tagrec.position + t->tagrec.length > ipos){
+		/*
+		** Handle unextending past other tags
+		*/
+		t->tagrec.length--;
+		t->flags |= TAG_LENGTH_CHANGED;
+		DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	    } else if (!(t->flags & TAG_DELETED) &&
+	        use_tag==LEFT && t->tagrec.position) {
+		/*
+		** Handle all other positional adjustments
+		*/
+		t->tagrec.position--;
+		t->flags |= TAG_POSITION_CHANGED;
+		DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	    }
+	    t = t->next;
+	}
+    }
+
+    if (dir==LEFT){
+	int i,j;	
+	char *s;
+	int l=DBgetLength(xx,seq);
+
+	/*
+	** get base at start of sequence
+	*/
+	s=DBgetSeq(xx,seq);
+	*ec++ = s[0];
+	for (j = 2; j <= l; j++)
+            s[j-2] = s[j-1];
+	DBsetLength(xx,seq,l-1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	selectDeleteBase(xx,seq,1);
+
+	/*
+	** shift sequence right
+	*/
+	if (DBgetRelPos(xx,seq)==1)
+	    for (i=1; i<xx->DB_gelCount && DBgetRelPos(xx,xx->DBorder[i+1])==1;i++);
+	else
+	    i=0;
+        if (i==1) {
+	    for (i=2; i <= xx->DB_gelCount; i++) {
+		DBsetRelPos(xx,xx->DBorder[i],DBgetRelPos(xx,xx->DBorder[i])-1);
+	    }
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	} else {
+	    DBsetRelPos(xx,seq,DBgetRelPos(xx,seq)+1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	    for (i=1; seq != xx->DBorder[i] && i <= xx->DB_gelCount; i++);
+	    for (j=i; j < xx->DB_gelCount &&
+               DBgetRelPos(xx,xx->DBorder[j+1]) < DBgetRelPos(xx,seq) ; j++)
+		xx->DBorder[j] = xx->DBorder[j+1];
+	    xx->DBorder[j] = seq;
+	}
+
+
+    } else {
+	/*
+	** get base at end of sequence
+	*/
+	char *s;
+	int l=DBgetLength(xx,seq);
+	s=DBgetSeq(xx,seq);
+	*ec++ = s[l-1];
+	DBsetLength(xx,seq,l-1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	selectDeleteBase(xx,seq,l);
+    }
+
+    *eb='\0';
+    *ib='\0';
+    *ec='\0';
+    
+    if (end_char_inserted) {
+	/*
+	** Throw it away
+	*/
+	*extend_char = '\0';
+    } else {
+	/*
+	** complement if necessary
+	*/
+	if (use_tag != dir) {
+	    int_f i=1;
+	    sqcom_(extend_char,&i,(int_fl)1);
+	}
+    }
+
+    /*
+    ** Insert unextend character on end of appropriate tag
+    */
+    l = (int)(strlen(extend_char) + strlen(external_buff) +
+	      strlen(internal_buff));
+    if (l) {
+	char *newercomment;
+	newercomment = (char *) TAG_MALLOC( t->newcommentlen + l + 1);
+	if (use_tag==LEFT) {
+	    /* add buffers to end of cutoff */
+	    strcpy(newercomment,t->newcomment);
+	    strcat(newercomment,external_buff);
+	    strcat(newercomment,extend_char);
+	    strcat(newercomment,internal_buff);
+	} else {
+	    /* add buffer to start of cutoff */
+	    strcpy(newercomment,internal_buff);
+	    strcat(newercomment,extend_char);
+	    strcat(newercomment,external_buff);
+	    strcat(newercomment,t->newcomment);
+	}
+	TAG_FREE(t->newcomment);
+	t->newcomment = newercomment;
+	t->newcommentlen = (int)strlen(t->newcomment);
+	t->flags |= TAG_COMMENT_CHANGED;
+    }
+
+    /*
+    ** Modify raw data stuff
+    */
+    if (l) {
+	tagStruct *t;
+	char type[5];
+	char name[FILE_NAME_LENGTH];
+	int llen,lcut,wlen;
+	t = (tagStruct *) DBgetTags(xx,seq);
+	force_comment(t);
+        sscanf(t->newcomment,"%6d%6d%6d%*s",&llen,&lcut,&wlen);
+	strncpy(type,&t->newcomment[18],4);
+	strncpy(name,&t->newcomment[22],FILE_NAME_LENGTH);
+	wlen -= l;
+	if (use_tag==LEFT) lcut+=l;
+        sprintf(t->newcomment,"%6d%6d%6d",llen,lcut,wlen);
+	strncat(t->newcomment,type,4);
+	strncat(t->newcomment,name,FILE_NAME_LENGTH);
+	t->newcommentlen = (int)strlen(t->newcomment);
+	t->flags |= TAG_COMMENT_CHANGED;
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+    }
+
+    if (dir == RIGHT) xx->cursorPos--;
+
+    /*
+    ** Lengths are a changing
+    */
+    calculateConsensusLength(xx);
+
+    return 0;
+
+}
+
+int undo_unextend(EdStruct *xx, int seq, int dir, int time)
+{
+
+    int c = (DBgetComp(xx,seq)==COMPLEMENTED);
+    char t_type[4];
+    tagStruct *t;
+    int use_tag;
+    char extend_char;
+    int l;
+    int lextend;
+    int end_char_inserted = 0;
+    int added_internal = 0;
+    int added_external = 0;
+
+    use_tag = (c && dir==LEFT || !c && dir==RIGHT )?RIGHT: LEFT;
+    strncpy(t_type,
+	(use_tag==RIGHT )?TAG_TYPE_RCUT:TAG_TYPE_LCUT,4);
+
+    /*
+    ** Process tag information
+    ** collating *DE? flags and *INS flags
+    ** and adjusting other flags accordingly
+    */
+    {
+	tagStruct *t;
+	int ipos;
+
+	t = (tagStruct *) DBgetTags(xx,seq);
+	ipos = normalisePos(xx,seq,(dir==LEFT)?1:DBgetLength(xx,seq),1);
+	if (use_tag==RIGHT) ipos++;
+
+	while (t != NULL) {
+	    if ((t->time == time) &&
+	       strncmp(t->tagrec.type.c,TAG_TYPE_INSERT,3)==0) {
+		/*
+		** Handle unextending past inserts
+		*/
+		end_char_inserted++;
+		extend_char = normaliseBase(xx,seq,t->tagrec.type.c[3]);
+		/*
+		extend_char = t->tagrec.type.c[3];
+		*/
+		t->flags &= ~TAG_DELETED;
+		t->time = 0;
+	    }else if ((t->time == time) &&
+	       strncmp(t->tagrec.type.c, TAG_TYPE_DELETE,3)==0) {
+		/*
+		** Handle unextending past deletes
+		*/
+		if (use_tag==LEFT && t->tagrec.position == ipos ||
+		   use_tag==RIGHT && t->tagrec.position == (ipos+1)) {
+		    added_external++;
+		} else {
+		    added_internal++;
+		}
+		t->flags &= ~TAG_DELETED;
+		t->time = 0;
+	    } else if (!(t->flags & TAG_DELETED) &&
+	       use_tag==LEFT && t->tagrec.position) {
+		/*
+		** Handle all other positional adjustments
+		*/
+		t->tagrec.position++;
+	    }
+	    t = t->next;
+	}
+    }
+
+    /*
+    ** Find appropriate tag,
+    ** create one if one doesn't exist
+    */
+    t = force_get_tag(xx,seq,t_type);
+
+    /*
+    ** No cutoff - ignore request
+    */
+    l = t->newcommentlen;
+    if (! l)
+	return 1;
+
+    /*
+    ** Determine character to extend
+    */
+    lextend = added_internal+added_external+1-end_char_inserted;
+    if (use_tag==LEFT) {
+	/* Get character off end of cutoff */
+        if (!end_char_inserted) {
+	    extend_char = t->newcomment[l-1-added_internal];
+	    if (use_tag != dir) {
+		int_f i=1;
+		sqcom_(&extend_char,&i,(int_fl)1);
+	    }
+	}
+	t->newcomment[l-lextend]='\0';
+	t->newcommentlen = l-lextend;
+    } else {
+	/* Get character off start of cutoff */
+	char *nc;
+        if (!end_char_inserted) {
+	    extend_char = t->newcomment[added_internal];
+	    if (use_tag != dir) {
+		int_f i=1;
+		sqcom_(&extend_char,&i,(int_fl)1);
+	    }
+	}
+	for (nc=t->newcomment;*nc=nc[lextend];nc++);
+	t->newcommentlen -= lextend;
+    }
+
+
+
+    if (dir==LEFT){
+	int i,j;	
+	char *s;
+
+	/*
+	** insert base at start of sequence
+	*/
+	s=DBgetSeq(xx,seq);
+	for (j = DBgetLength(xx,seq)+1; j > 1; j--)
+            s[j-1] = s[j-2];
+	s[0] = extend_char;
+	DBsetLength(xx,seq,DBgetLength(xx,seq)+1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	selectInsertBase(xx,seq,1);
+
+	/*
+	** shift sequence left
+	*/
+        if (DBgetRelPos(xx,seq)==1) {
+	    for (i=1; i <= xx->DB_gelCount; i++) {
+		if (seq == xx->DBorder[i]) {
+		    for (j=i;j>1;j--)
+			xx->DBorder[j] = xx->DBorder[j-1];
+		    xx->DBorder[1] = seq;
+		} else
+		    DBsetRelPos(xx,xx->DBorder[i],DBgetRelPos(xx,xx->DBorder[i])+1);
+	    }
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	} else {
+	    DBsetRelPos(xx,seq,DBgetRelPos(xx,seq)-1);
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_REL_MODIFIED);
+
+	    for (i=1; seq != xx->DBorder[i] && i <= xx->DB_gelCount; i++);
+	    for (j=i; DBgetRelPos(xx,xx->DBorder[j-1]) > DBgetRelPos(xx,seq) ;j--)
+		xx->DBorder[j] = xx->DBorder[j-1];
+	    xx->DBorder[j] = seq;
+	}
+
+    } else {
+	/*
+	** insert base at end of sequence
+	*/
+	char *s;
+	int l=DBgetLength(xx,seq);
+	s=DBgetSeq(xx,seq);
+	s[l] = extend_char;
+	DBsetLength(xx,seq,l+1);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED);
+	selectInsertBase(xx,seq,l+1);
+    }
+
+    /*
+    ** Modify raw data stuff
+    */
+    {
+	tagStruct *t;
+	char type[5];
+	char name[FILE_NAME_LENGTH+1];
+	int llen,lcut,wlen;
+	t = (tagStruct *) DBgetTags(xx,seq);
+	force_comment(t);
+        sscanf(t->newcomment,"%6d%6d%6d%*s",&llen,&lcut,&wlen);
+	strncpy(type,&t->newcomment[18],4);
+	strncpy(name,&t->newcomment[22],FILE_NAME_LENGTH);
+	wlen+=lextend;
+	if (use_tag==LEFT) lcut-=lextend;
+        sprintf(t->newcomment,"%6d%6d%6d",llen,lcut,wlen);
+	strncat(t->newcomment,type,4);
+	strncat(t->newcomment,name,FILE_NAME_LENGTH);
+	t->newcommentlen = (int)strlen(t->newcomment);
+	t->flags |= TAG_COMMENT_CHANGED;
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+
+    }
+
+    if (dir == RIGHT) xx->cursorPos++;
+
+    /*
+    ** Lengths are a changing
+    */
+    calculateConsensusLength(xx);
+
+    return 0;
+
+}
+
+
+void meta_arrow (Widget widget, XEvent *event, int key)
+/*
+** Handle cut-off adjust
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+    int e_time = get_uniq_id(NULL);
+    int seq,pos;
+    int seq_length;
+
+    seq = xx->cursorSeq;
+    pos = xx->cursorPos;
+    seq_length = DBgetLength(xx,seq);
+
+    /*
+    ** determine which operation is to take place
+    */
+    if (key==LEFT) {
+	if (seq) {
+	    if (pos==1) {
+		if (! extend(xx,seq,LEFT,e_time)) {
+		    recordEdit (&bucketStack, xx, undoExtend, seq, pos,LEFT,e_time);
+		    redisplayWithCursor(xx);
+		}
+	    } else if (pos>seq_length) {
+	    	if (! unextend(xx,seq,RIGHT,e_time)) {
+		    recordEdit (&bucketStack, xx, undoUnextend, seq, pos, RIGHT, e_time);
+		    redisplayWithCursor(xx);
+		}
+	    } else
+		caretLeft(widget,event,NULL,0);
+	} else
+	    caretLeft(widget,event,NULL,0);
+    } else {
+	if (seq) {
+	    if (pos==1) {
+		if (! unextend(xx,seq,LEFT,e_time)) {
+		    redisplayWithCursor(xx);
+		    recordEdit (&bucketStack, xx, undoUnextend, seq, pos, LEFT,e_time);
+		}
+	    } else if (pos>seq_length) {
+		if (! extend(xx,seq,RIGHT,e_time)) {
+		    redisplayWithCursor(xx);
+		    recordEdit (&bucketStack, xx, undoExtend, seq, pos, RIGHT, e_time);
+		}
+	    } else
+		caretRight(widget,event,NULL,0);
+	} else
+	    caretRight(widget,event,NULL,0);
+    }
+
+}
+
+
+
+
+
+
+void meta_left (Widget widget, XEvent *event, String *params,
+		Cardinal *num_params)
+{
+    meta_arrow(widget,event,LEFT);
+}
+
+void meta_right (Widget widget, XEvent *event, String *params,
+		 Cardinal *num_params)
+{
+    meta_arrow(widget,event,RIGHT);
+}
+
+void meta_up (Widget widget, XEvent *event, String *params,
+	      Cardinal *num_params)
+/*
+** Handle cut-off adjust
+*/
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(widget));
+
+    dump_tags(xx,xx->cursorSeq);
+
+}
diff --git a/src/staden/extend.h b/src/staden/extend.h
new file mode 100644
index 0000000..828a70f
--- /dev/null
+++ b/src/staden/extend.h
@@ -0,0 +1,13 @@
+#ifndef _extend_h
+#define _extend_h
+
+#include "edUtils.h"
+
+extern void meta_left (Widget widget, XEvent *event, String *, Cardinal *);
+extern void meta_right (Widget widget, XEvent *event, String *, Cardinal *);
+extern void meta_up (Widget widget, XEvent *event, String *, Cardinal *);
+extern int extend(EdStruct *xx, int seq, int dir);
+extern int unextend(EdStruct *xx, int seq, int dir);
+extern int undo_unextend(EdStruct *xx, int seq, int dir, int time);
+
+#endif /* _extend_h */
diff --git a/src/staden/fmain.f b/src/staden/fmain.f
new file mode 100644
index 0000000..561491a
--- /dev/null
+++ b/src/staden/fmain.f
@@ -0,0 +1,3 @@
+      EXTERNAL FMAIN
+      CALL FMAIN()
+      END
diff --git a/src/staden/fort.h b/src/staden/fort.h
new file mode 100644
index 0000000..67b4952
--- /dev/null
+++ b/src/staden/fort.h
@@ -0,0 +1,86 @@
+#ifndef _FORT_H
+#define _FORT_H
+
+/*
+ * When passing INTEGERs from Fortran to C we need to know the type they'll be
+ * at the C end. We could define int_f as 'int *' (or 'long *') but this may
+ * cause obscureness and unreadability of C.
+ *
+ * Also we define the extra argument given to C by fortran for the length of
+ * an array passed. This may not necessarily be the same as int_f (either
+ * now or the future). So we define it as a separate type (int_fl).
+ */
+
+typedef int int_f;
+typedef int int_fl;
+
+/*
+ * Prototypes for Fortran functions. This helps to establish when we pass the
+ * wrong arguments to a function.
+ *
+ * Terminology:
+ * name_a : name is an array
+ * name_s : name is a string
+ * name_l : name is the length of items in array name_a or length of string
+ *          name_s
+ * name   : name is simple type
+ */
+
+/* dbsyscommon.f */
+void  readw_ (int_f *idevw,   int_f *n,       char *gel_a,    int_f *maxgel,
+	      int_fl gel_l); /* gel_l == *maxgel */
+void  writew_(int_f *idevw,   int_f *n,       char *gel_a,    int_f *maxgel,
+              int_fl gel_l);
+
+void  readn_ (int_f *idevn,   int_f *n,       char *name_s,   int_fl name_l);
+             /* name_l == DB_NAMELEN */
+void  writen_(int_f *idevn,   int_f *n,       char *name_s,   int_fl name_l);
+
+void  readr_ (int_f *idev_r,  int_f *n,       int_f *relpg,   int_f *lngthg,
+ 	      int_f *lnbr,    int_f *rnbr);
+void  writer_(int_f *idev_r,  int_f *n,       int_f *relpg,   int_f *lngthg,
+ 	      int_f *lnbr,    int_f *rnbr);
+
+void  pcon1_ (char  *chr,     int_f *chrsum);
+
+int_f indexs_(char  *c,       int_f *s);
+
+void  munotp_(char  *ret_s,   int_fl ret_l,   int_f *ip);
+
+void  gtconc_(char  *ret_s,   int_fl rel_l,   int_f *counts_a,int_f *idm,
+	      float *cut/*,   int_fl counts_l */);
+
+/* xsapConEdit.f */
+void  dojoin_(int_f *relpg_a, int_f *lngthg_a,int_f *lnbr_a,  int_f *rnbr_a,
+	      int_f *ngels,   int_f *nconts,  int_f *lnconl,  int_f *lnconr,
+	      int_f *idbsiz,  int_f *idevr,   int_f *idevw,   int_f *relx/*,
+	      int_fl relpg_l, int_fl lngthg_l,int_fl lnbr_l,  int_fl rnbr_l*/);
+
+/* subs89.f */
+void  sqcom_ (char  *seq_a,   int_f *idim,    int_fl seq_l);
+
+/* dbsysnew.f */
+void  readtg_(int_f *idevt,   int_f *i,       int_f *lpos,    int_f *llen,
+	      int_f *lcom,    int_f *ltype,   int_f *next);
+
+void  writtg_(int_f *idevt,   int_f *i,       int_f *lpos,    int_f *llen,
+	      int_f *lcom,    int_f *ltype,   int_f *next);
+
+void  readcc_(int_f *idevc,   int_f *i,       int_f *icnt,    int_f *next,
+	      char  *note_s,  int_fl note_l);
+
+void  writcc_(int_f *idevc,   int_f *i,       int_f *icnt,    int_f *next,
+	      char  *note_s,  int_fl note_l);
+
+void  sqrev_ (char  *seqnce_a,int_f *idim,    int_fl seqnce_l);
+
+/* seeme.f */
+void  flusho_();
+
+/* fmain.f */
+void  fmain_ ();
+
+/* xspec.f / nxspec.f */
+int_f xversn_();
+
+#endif
diff --git a/src/staden/fortran.h b/src/staden/fortran.h
new file mode 100644
index 0000000..90d44ef
--- /dev/null
+++ b/src/staden/fortran.h
@@ -0,0 +1,26 @@
+#ifndef _fortran_h
+#define _fortran_h
+
+#include "fort.h"
+
+/* This defines the interfaces between fortran and c */
+
+/*
+** COMMON /DEVILS/ IDEVT,IDEVC,IDBSIZ,RELPG
+*/
+
+/*
+** Map onto fortran common block holding RELPG array and IDEVRD
+*/
+/*
+** Snatch Fortran Common Block
+** Don't EVER EVER let Rodger see this!!
+*/
+extern struct {
+	 int_f idevt;
+	 int_f idevc;
+	 int_f idbsiz;
+	 int_f relpg[1000];
+	} devils_;
+
+#endif /* _fortran_h */
diff --git a/src/staden/fxmain.f b/src/staden/fxmain.f
new file mode 100644
index 0000000..46f7f93
--- /dev/null
+++ b/src/staden/fxmain.f
@@ -0,0 +1,4 @@
+      CALL XCMAIN()
+C      call handle_unaligned_traps
+C      call print_unaligned_summary
+      END
diff --git a/src/staden/gip.f b/src/staden/gip.f
new file mode 100644
index 0000000..fa2c599
--- /dev/null
+++ b/src/staden/gip.f
@@ -0,0 +1,672 @@
+C     GIP
+C   SCREEN OF 24 LINES, 80 CHARACTERS ASSUMED
+C
+C
+C   14 May 1991
+C      Modified to work on Sparcstations - connect digitiser to RS232-C port B
+C   24 Sept 1992
+C      digitizer is now specified by environment variable DIGITIZER
+C
+      SUBROUTINE FMAIN()
+      PARAMETER (IDMENU=30)
+      INTEGER XLANE(4),DY,DXMIN,XPEN,YPEN,YMEAN
+      INTEGER XLMENU(IDMENU),XRMENU(IDMENU)
+      INTEGER YBMENU(IDMENU),YTMENU(IDMENU)
+      CHARACTER CHARS(IDMENU),GEL(512),BASE(4),MENUE,GELNAM*40,ANS
+      INTEGER MENUP(IDMENU),SEQROW,SEQCOL,SEQLEN
+      INTEGER DEVNOS(6)
+      INTEGER ESC
+      CHARACTER*40 FILDIG,FILVT
+      CHARACTER SPACE,CESC,CBELL
+C   ALL SCREEN OUTPUT CONTROLLED USING CURSOR
+C   SCREEN SIZE MAXROW, MAXCOL
+C   DIRECTION FROM HERE IS INC (1 OR -1)
+      INTEGER TITLER,TITLEC,CURER,CUREC,CURFR,CURFC
+      COMMON /ESCCHR/ CESC
+      COMMON /BELCHR/ CBELL
+      PARAMETER (ESC=27,IBELL=7)
+      PARAMETER (INC=1,MAXR=24)
+      PARAMETER (TITLER=1,TITLEC=20,LORDR=3,LORDC=20,
+     +IERRR=MAXR,IERRC=10,
+     +INSTR1=MAXR-INC,INSTC1=10,INSTR2=MAXR-2*INC,INSTC2=10,
+     +CURER=MAXR-4*INC,CUREC=1,CURFR=2,CURFC=20)
+      PARAMETER (SEQROW=6,SEQCOL=10,SEQLEN=60)
+C      PARAMETER (FILDIG='/dev/ttyb')
+      PARAMETER (FILDIG='DIGITIZER')
+      DATA XLMENU/0,
+     +0,200,400,600,0,200,400,600,
+     +0,200,400,600,0,200,400,600,
+     +0,200,400,600,0,200,400,600,
+     +0,400,0,0,0/
+      DATA XRMENU/800,
+     +200,400,600,800,200,400,600,800,
+     +200,400,600,800,200,400,600,800,
+     +200,400,600,800,200,400,600,800,
+     +400,800,800,800,800/
+      DATA YBMENU/000,
+     +1800,1800,1800,1800,
+     +1600,1600,1600,1600,
+     +1400,1400,1400,1400,
+     +1200,1200,1200,1200,
+     +1000,1000,1000,1000,
+     +800,800,800,800,
+     +600,600,400,200,0/
+      DATA YTMENU/2000,2000,2000,2000,2000,
+     +1800,1800,1800,1800,
+     +1600,1600,1600,1600,
+     +1400,1400,1400,1400,
+     +1200,1200,1200,1200,
+     +1000,1000,1000,1000,
+     +800,800,600,400,200/
+      DATA CHARS/'W',
+     +'T','C','G','A',
+     +'2','1','4','3',
+     +'V','D','H','B',
+     +'L','K','N','M',
+     +'R','Y','X','-',
+     +'5','6','7','8',
+     +'D','R','S','S','C'/
+      DATA MENUP/0,1,2,3,4,5,6,7,8,9,10,
+     +           11,12,13,14,15,16,17,18,19,20,
+     +           21,22,23,24,50,51,53,52,56/
+      DATA SPACE/' '/
+      DATA BASE/'T','C','G','A'/
+C   PROBLEM WITH DEFINING ESCAPE AND BELL SO DO IT HERE AND PUT IT COMMON
+C      WRITE(CESC,2000)ESC
+      CESC=CHAR(ESC)
+2000  FORMAT(A1)
+      CBELL=CHAR(IBELL)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,6)
+C   OPEN LOGICAL UNIT FOR SOME TERMINAL OUTPUT
+      CALL OPENT(DEVNOS(6),FILVT)
+C   CLEAR SCREEN
+      CALL CLEAR
+C   WRITE TITLE
+      CALL CURSOR(TITLER,TITLEC)
+      CALL WRITES('GIP v1.0 Author: Rodger Staden')
+C   define allowed Y distance from current pen position
+      DY=80
+      CALL OPENRS(DEVNOS(1),FILDIG,IOK,LRECL,8)
+      IF(IOK.NE.0)THEN
+        CALL CURSOR(IERRR,1)
+        CALL BLINE
+        CALL CURSOR(IERRR,IERRC)
+        CALL WRITES('******UNABLE TO OPEN DIGITIZER******')
+          STOP
+      END IF
+C   FLUSH DIGITIZER
+C      CALL FLUSH(DEVNOS(1))
+      CALL CURSOR(INSTR1,1)
+      CALL BLINE
+      CALL CURSOR(INSTR1,INSTC1)
+      CALL WRITES('The film must be firmly fixed to the light box')
+5     CONTINUE
+      CALL CURSOR(CURFR,1)
+      CALL BLINE
+      CALL CURSOR(CURFR,CURFC)
+      CALL WRITES('? File of file names=')
+      READ(KBIN,1000,ERR=5)GELNAM
+1000  FORMAT(A)
+      CALL OPENRS(DEVNOS(3),GELNAM,IOK,LRECL,1)
+      IF(IOK.NE.0)THEN
+        CALL CURSOR(IERRR,1)
+        CALL BLINE
+        CALL CURSOR(IERRR,IERRC)
+        CALL WRITES('******ERROR OPENING FILE******')
+        GO TO 5
+      END IF
+C   CLEAR ERROR LINE IN CASE ITS BEEN USED
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,INSTC2)
+      CALL WRITES(
+     +'Hit device menu origin, program origin, then hit start')
+90    CONTINUE
+      CALL READPN(XPEN,YPEN,1,DEVNOS(1),KBOUT)
+C   START?
+      CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+      IF(MENU.NE.52)GO TO 90
+100   CONTINUE
+C   CLEAR ERROR LINE IN CASE ITS BEEN USED
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+C   GET LANE ORDER
+120   CONTINUE
+C   GET LANE ORDER
+      CALL LORDER(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,
+     +CHARS,MENUE,IDMENU,BASE,DEVNOS(1),KBOUT,INSTR2,INSTC2,
+     +LORDR,LORDC)
+C   CLEAR ERROR LINE IN CASE ITS BEEN USED
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+C   GET LANE COORDS
+      CALL GETLAN(XLANE,YMEAN,DXMIN,
+     +XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,MENUE,IDMENU,
+     +DEVNOS(1),KBOUT,INSTR2,INSTC2)
+C   CLEAR ERROR LINE IN CASE ITS BEEN USED
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+C   POINT TO GEL
+      IPGEL=0
+******************************************************
+200   CONTINUE
+      CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,INSTC2)
+      CALL WRITES('Hit start when ready to begin reading')
+      CALL READPN(XPEN,YPEN,1,DEVNOS(1),KBOUT)
+C   START?
+      CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+      IF(MENU.EQ.52)THEN
+        CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,INSTC2)
+        CALL WRITES
+     +  ('Hit bands, uncertainty codes, DELETE, RESET or STOP')
+        CALL READG(
+     +  XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +  IDMENU,XLANE,YMEAN,DY,DXMIN,BASE,GEL,IPGEL,
+     +  SEQROW,SEQCOL,SEQLEN,DEVNOS(1),KBOUT,INSTR2,INSTC2)
+        CALL CURSOR(INSTR2,INSTC2)
+        IF(IPGEL.GT.0)THEN
+50        CONTINUE
+          CALL CURSOR(INSTR2,1)
+          CALL BLINE
+          CALL CURSOR(INSTR2,INSTC2)
+          CALL WRITES('File name for this gel reading=')
+          READ(KBIN,1000,ERR=50)GELNAM
+          CALL OPENRS(DEVNOS(5),GELNAM,IOK,LRECL,1)
+          IF(IOK.EQ.0)THEN
+            CALL FMTDK(DEVNOS(5),GEL,IPGEL)
+            CLOSE(UNIT=DEVNOS(5))
+            WRITE(DEVNOS(3),1008)GELNAM
+1008        FORMAT(A)
+            CALL CURSOR(INSTR2,INSTC2)
+          ELSE
+            CALL CURSOR(IERRR,1)
+            CALL BLINE
+            CALL CURSOR(IERRR,IERRC)
+            CALL WRITES('******ERROR OPENING FILE******')
+            GO TO 50
+          END IF
+C   CLEAR GEL FROM SCREEN
+          DO 441 I=1,IPGEL
+            CALL WRITEG(' ',SEQCOL,SEQROW,SEQLEN,I)
+441       CONTINUE
+        END IF
+        CALL CURSOR(INSTR2,1)
+        CALL BLINE
+        CALL CURSOR(INSTR2,INSTC2)
+        CALL WRITES('? (y/n) (y) Read another sequence ')
+        READ(KBIN,1010)ANS
+1010    FORMAT(A1)
+        IF((ANS.EQ.' ').OR.(ANS.EQ.'Y').OR.(ANS.EQ.'y'))GO TO 100
+        CALL CLEAR
+        STOP
+      END IF
+      GO TO 200
+      END
+C   DIGIT
+C   AUTHOR RODGER STADEN
+C   ROUTINES USED BY GELIN
+C   NONSTANDARD: CARRIAGE CONTROL $ USED BY ROUTINE BELL
+C   ROUTINES IN THIS LIB:
+C   READG
+C   INOUT
+C   LANE
+C   GETLAN
+C   GETMEN
+C   LORDER
+C   WRITEG
+C   INTERP
+C   READPN
+      SUBROUTINE READG(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +IDMENU,XLANE,YMEAN,DY,DXMIN,BASE,GEL,IPGEL,SEQROW,SEQCOL,
+     +SEQLEN,IDEVD,KBOUT,INSTR2,INSTC2)
+C   AUTHOR RODGER STADEN
+C   READS A SEQUENCE USING THE DIGITIZER UNTIL A STOP COMMAND IS GIVEN
+C   LANE COORDINATES CAN BE RESET.
+      INTEGER XLANE(4),DY,DXMIN,XPEN,YPEN,YMEAN
+      INTEGER XLMENU(IDMENU),XRMENU(IDMENU)
+      INTEGER YBMENU(IDMENU),YTMENU(IDMENU)
+      CHARACTER CHARS(IDMENU),GEL(512),BASE(4),MENUE,LASTCH
+      INTEGER MENUP(100),SEQCOL,SEQROW,SEQLEN,INSTR2,INSTC2
+      EXTERNAL LANE
+      LASTCH=' '
+300   CONTINUE
+      CALL READPN(XPEN,YPEN,0,IDEVD,KBOUT)
+C   IN LANES?
+C*****************************************************
+      L=LANE(XLANE,YMEAN,DY,DXMIN,XPEN,YPEN)
+      IF(L.NE.0)THEN
+        CALL GBELL(1,KBOUT)
+C   IN LANES
+C   EXPECTING A BAND SO DEAL WITH IT
+C   INCREMENT POINTER TO GEL CHARACTERS
+        IPGEL=IPGEL+1
+        GEL(IPGEL)=BASE(L)
+C   PUT ON SCREEN ETC
+        CALL WRITEG(GEL(IPGEL),SEQCOL,SEQROW,SEQLEN,IPGEL)
+        LASTCH='C'
+      ELSE
+C   IN MENU?
+        CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +          IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+C   UNCERTAINTY CODE?
+        IF((MENU.GT.0).AND.(MENU.LT.25))THEN
+C   UNCERTAINTY CODE
+          CALL GBELL(1,KBOUT)
+          IPGEL=IPGEL+1
+          GEL(IPGEL)=MENUE
+          LASTCH='C'
+          CALL WRITEG(GEL(IPGEL),SEQCOL,SEQROW,SEQLEN,IPGEL)
+C   DELETE?
+        ELSE IF(MENU.EQ.50)THEN
+C   DELETE
+C   IF LAST CHAR A BASE THEN NEED TO REMOVE IT FROM GEL
+          IF((LASTCH.EQ.'C').AND.(IPGEL.GT.0))THEN
+            CALL GBELL(1,KBOUT)
+            CALL WRITEG(' ',SEQCOL,SEQROW,SEQLEN,IPGEL)
+            IPGEL=IPGEL-1
+          END IF
+C   RESET LANE CENTRES?
+        ELSE IF(MENU.EQ.51)THEN
+          CALL GBELL(1,KBOUT)
+          CALL GETLAN(XLANE,YMEAN,DXMIN,
+     +    XLMENU,XRMENU,YBMENU,YTMENU,
+     +    MENUP,CHARS,MENUE,IDMENU,
+     +    IDEVD,KBOUT,INSTR2,INSTC2)
+          CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,INSTC2)
+          CALL WRITES
+     +    ('Hit bands, uncertainty codes, DELETE, RESET or STOP')
+C   STOP?
+        ELSE IF(MENU.EQ.53)THEN
+          CALL GBELL(1,KBOUT)
+          RETURN
+C   FOR NOW NOT FOUND SO IGNORE
+       END IF
+      END IF
+C   NOT IN MENU
+C   NOT IN MENU OR LANES SO IGNORE
+      GO TO 300
+      END
+C   INOUT
+C
+C   FUNCTION TO RETURN VALUE 1 IF XPEN,YPEN IN BOX, 0 ELSE
+C   AUTHOR RODGER STADEN
+      INTEGER FUNCTION INOUT(X1,X2,Y1,Y2,XPEN,YPEN)
+      IMPLICIT INTEGER (A-Z)
+      INOUT=0
+      IF((XPEN.LT.X1).OR.(XPEN.GT.X2))RETURN
+      IF((YPEN.LT.Y1).OR.(YPEN.GT.Y2))RETURN
+      INOUT=1
+      RETURN
+      END
+C   LANEORDER
+C   GETS LANE ORDER FROM TABLET
+      SUBROUTINE LORDER(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,
+     +CHARS,MENUE,IDMENU,BASE,IDEVD,KBOUT,INSTR2,INSTC2,
+     +LORDR,LORDC)
+C   AUTHOR RODGER STADEN
+      INTEGER  XLMENU(IDMENU),XRMENU(IDMENU)
+      INTEGER YBMENU(IDMENU),YTMENU(IDMENU)
+      INTEGER MENUP(IDMENU),INSTR2,INSTC2
+      CHARACTER BASE(4),BASES(4),CHARS(IDMENU),MENUE,SPACE
+      CHARACTER BASET*4
+      PARAMETER (IERRR=24,IERRC=10)
+      SAVE BASES,SPACE
+      DATA SPACE/' '/
+      DATA BASES/'T','C','G','A'/
+C
+10    CONTINUE
+      DO 5 I=1,4
+        BASET(I:I)=BASE(I)
+5     CONTINUE
+C
+      CALL CURSOR(LORDR,1)
+      CALL BLINE
+      CALL CURSOR(LORDR,LORDC)
+      CALL WRITES('Lane order is ')
+      CALL WRITES(BASET)
+      CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,INSTC2)
+      CALL WRITES('If lane order correct hit CONFIRM, else hit RESET')
+      CALL READPN(XPEN,YPEN,1,IDEVD,KBOUT)
+      CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+      IF(MENU.EQ.56)THEN
+C   CLEAR ERROR LINE IN CASE ITS BEEN USED
+        CALL CURSOR(IERRR,1)
+        CALL BLINE
+        RETURN
+      END IF
+      IF(MENU.EQ.51)THEN
+      CALL CURSOR(INSTR2,1)
+      CALL BLINE
+        CALL CURSOR(INSTR2,INSTC2)
+        CALL WRITES('Define lane order, left to right, using menu')
+C   COUNT POINTS
+        IP=1
+20      CONTINUE
+        IF(IP.LE.4)THEN
+           CALL READPN(XPEN,YPEN,1,IDEVD,KBOUT)
+           CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,
+     +     CHARS,IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+           DO 30 I=1,4
+              IF(MENUE.EQ.BASES(I))THEN
+                 BASE(IP)=MENUE
+                 GO TO 31
+              END IF
+30         CONTINUE
+31         CONTINUE
+           IP=IP+1
+           GO TO 20
+        END IF
+C     ALL DIFFERENT?
+        DO 40 I=1,4
+          DO 41 J=I+1,4
+          IF(BASE(I).EQ.BASE(J))THEN
+C           SAME!
+            CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(IERRR,IERRC)
+            CALL WRITES('*****DUPLICATION IN LANE ORDER*****')
+            DO 35 K=1,4
+               BASE(K)=BASET(K:K)
+35          CONTINUE
+            GO TO 10
+          END IF
+41       CONTINUE
+40      CONTINUE
+      END IF
+      GO TO 10
+      END
+C   INTERPXY
+C
+C   INTERPRETS X,Y COORDS FROM DIGITIZER
+      SUBROUTINE INTERP(XL,XR,YB,YT,MENUP,CHARS,
+     +IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+C   AUTHOR RODGER STADEN
+      INTEGER XL(IDMENU),XR(IDMENU),YB(IDMENU),YT(IDMENU)
+      INTEGER MENUP(IDMENU),XPEN,YPEN
+      CHARACTER CHARS(IDMENU),MENUE
+      EXTERNAL INOUT
+      PARAMETER (IERRR=24,IERRC=10)
+      MENU=0
+      MENUE=CHARS(1)
+C   IN MENU AT ALL?
+      IF(INOUT(XL(1),XR(1),YB(1),YT(1),XPEN,YPEN).EQ.0)RETURN
+      DO 100 I=2,IDMENU
+        IF(INOUT(XL(I),XR(I),YB(I),YT(I),XPEN,YPEN).NE.1)GO TO 100
+C       MATCH
+        MENU=MENUP(I)
+        MENUE=CHARS(I)
+        RETURN
+100   CONTINUE
+C   ERROR IN MENU
+      MENU=0
+      MENUE=CHARS(1)
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+      CALL CURSOR(IERRR,IERRC)
+      CALL WRITES('******ERROR IN MENU******')
+      RETURN
+      END
+C
+C   READPEN GETS COORDS FROM DIGITIZER
+      SUBROUTINE READPN(XPEN,YPEN,IBELL,IDEVD,KBOUT)
+C   AUTHOR RODGER STADEN
+      CHARACTER INPUT*11,INPUTX*5,INPUTY*5
+      INTEGER XPEN,YPEN
+C      EQUIVALENCE (INPUT(2:6),INPUTX),(INPUT(7:11),INPUTY)
+C  NOTE ON IBM NEXT LINE EQUIVALENCE USED
+      EQUIVALENCE (INPUT(1:5),INPUTX),(INPUT(6:10),INPUTY)
+      PARAMETER (IERRR=24,IERRC=10)
+1000  FORMAT(A)
+1     CONTINUE
+      READ(IDEVD,1000,ERR=1)INPUT
+1002  FORMAT(I5)
+      READ(INPUTX,1002,ERR=100)XPEN
+      READ(INPUTY,1002,ERR=100)YPEN
+      IF(IBELL.EQ.1)CALL GBELL(1,KBOUT)
+C READ 2nd NEWLINE ON SUN
+      READ(IDEVD,1000,ERR=1)INPUT
+      RETURN
+100   CONTINUE
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+      CALL CURSOR(IERRR,IERRC)
+      CALL WRITES('*********ERROR READING PEN********')
+      RETURN
+      END
+C   WRITEGEL
+      SUBROUTINE WRITEG(CHAR,STARTX,STARTY,LENGTH,IPGEL)
+C   AUTHOR RODGER STADEN
+C   WRITE OUT A CHARACTER
+      CHARACTER CHAR
+      INTEGER STARTX,STARTY,ROW,COL
+      ROW=(IPGEL-1)/LENGTH
+      COL=MOD(IPGEL,LENGTH)
+      IF(COL.EQ.0)COL=LENGTH
+      ROW=STARTY+ROW
+      COL=STARTX+COL
+      CALL CURSOR(ROW,COL)
+      CALL WRITES(CHAR)
+      RETURN
+      END
+C
+C   GETLANE
+      SUBROUTINE GETLAN(XLANE,YMEAN,DXMIN,
+     +XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,MENUE,IDMENU,
+     +IDEVD,KBOUT,INSTR2,INSTC2)
+C   AUTHOR RODGER STADEN
+C   GETS LANE ORDER
+      INTEGER XLANE(4)
+      INTEGER YTEMP(4)
+      INTEGER XLMENU(IDMENU),XRMENU(IDMENU)
+      INTEGER YBMENU(IDMENU),YTMENU(IDMENU)
+      INTEGER MENUP(IDMENU),INSTR2,INSTC2
+      INTEGER XPEN,YPEN,YMEAN,DXMIN
+      CHARACTER CHARS(IDMENU),MENUE,SEPAR*10
+      PARAMETER (IERRR=24,IERRC=10)
+      INSTR3=4
+      INSTC3=20
+100   CONTINUE
+      CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,INSTC2)
+      CALL WRITES
+     +('Hit START, then, left to right,')
+      CALL WRITES(' the start centres of this clones lanes')
+      CALL READPN(XPEN,YPEN,1,IDEVD,KBOUT)
+      CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+C   START?
+      IF(MENU.NE.52)GO TO 100
+C   GOT START, EXPECT LANES NOW
+200   CONTINUE
+      DO 300 I=1,4
+      CALL READPN(XPEN,YPEN,1,IDEVD,KBOUT)
+C   IN MENU?
+      CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+      IF(MENU.NE.0)GO TO 100
+C   NOT IN MENU SO RECORD POSITION
+      XLANE(I)=XPEN
+      YTEMP(I)=YPEN
+300   CONTINUE
+      YMEAN=(YTEMP(1)+YTEMP(2)+YTEMP(3)+YTEMP(4))/4
+C   FIND LANE SEPARATION
+      DXMIN=(XLANE(4)-XLANE(1))/3
+      WRITE(SEPAR,1011)DXMIN
+      CALL CURSOR(INSTR3,1)
+      CALL BLINE
+      CALL CURSOR(INSTR3,INSTC3)
+      CALL WRITES('Mean lane separation = ')
+      CALL WRITES(SEPAR)
+      CALL CURSOR(INSTR2,1)
+      CALL BLINE
+      CALL CURSOR(INSTR2,INSTC2)
+      CALL WRITES(
+     +'If separation ok hit CONFIRM, else hit RESET')
+1011  FORMAT(I6)
+      CALL READPN(XPEN,YPEN,1,IDEVD,KBOUT)
+C   IN MENU?
+      CALL INTERP(XLMENU,XRMENU,YBMENU,YTMENU,MENUP,CHARS,
+     +IDMENU,XPEN,YPEN,MENU,MENUE,KBOUT)
+C
+      IF(MENU.NE.56)GO TO 100
+C   SET MINIMUM LANE CENTRE CLOSENESS TO 50% OF SEPARATION
+      DXMIN=DXMIN/2
+C   CLEAR ERROR LINE IN CASE ITS BEEN USED
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+      RETURN
+      END
+C   LANE
+C   RETURNS THE LANE NUMBER FOR GELIN
+      INTEGER FUNCTION LANE(XLANE,YMEAN,DY,DXMIN,XPEN,YPEN)
+C   AUTHOR RODGER STADEN
+      INTEGER XLANE(4),YMEAN,D(4)
+      INTEGER XPEN,YPEN,DXMIN,DY,X1,X2,Y1,Y2,DMIN
+      EXTERNAL INOUT
+      LANE=0
+C   IS THE POINT IN THE LANES?
+      X1=XLANE(1)-DXMIN
+      X2=XLANE(4)+DXMIN
+      Y1=YMEAN-DY
+      Y2=YMEAN+DY
+      IF(INOUT(X1,X2,Y1,Y2,XPEN,YPEN).EQ.0)RETURN
+C
+C   IN LANES
+C  WHICH IS NEAREST XLANE VALUE
+      DO 10 I=1,4
+        D(I)=ABS(XLANE(I)-XPEN)
+10    CONTINUE
+C   WHICH IS LEAST?
+      DMIN=D(1)
+      ICLOSE=1
+      DO 20 I=2,4
+      IF(D(I).GT.DMIN)GO TO 20
+C  CLOSER
+      ICLOSE=I
+      DMIN=D(I)
+20    CONTINUE
+C   CLOSEST TO ICLOSE SO SET THIS CHAR AND SHIFT THIS LANE CENTRE
+      XLANE(ICLOSE)=XPEN
+      YMEAN=YPEN
+      LANE=ICLOSE
+C   MAKE SURE LANES ARE NOT TOO CLOSE
+      DO 30 I=2,4
+        IF(ABS(XLANE(I-1)-XLANE(I)).LT.DXMIN)LANE=0
+30    CONTINUE
+C  RESET LANE SEPARATION
+      DXMIN=(XLANE(4)-XLANE(1))/6
+      RETURN
+      END
+C   BELL
+C       SUBROUTINE TO RING BELL N TIMES
+      SUBROUTINE GBELL(N,KBOUT)
+C   AUTHOR RODGER STADEN
+      CHARACTER CBELL
+      COMMON /BELCHR/ CBELL
+      CALL WRITES(CBELL)
+      RETURN
+      END
+        SUBROUTINE OPENT(IDEV,FILNAM)
+C   AUTHOR: RODGER STADEN
+        CHARACTER FILNAM*(*)
+        COMMON /TERM/IDEVT
+      SAVE /TERM/
+      PARAMETER (IERRR=24,IERRC=10)
+      IDEVT=IDEV
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,10)
+      IF(IOK.EQ.0)RETURN
+      CALL CURSOR(IERRR,1)
+      CALL BLINE
+      CALL CURSOR(IERRR,IERRC)
+      CALL WRITES('****ERROR OPENING SCREEN FOR OUTPUT****')
+      RETURN
+      END
+      SUBROUTINE CLEAR
+C   AUTHOR: RODGER STADEN
+       CHARACTER STRING*4,CESC
+      COMMON /ESCCHR/ CESC
+      DATA STRING(2:4)/'[2J'/
+      STRING(1:1)=CESC
+C   CLEAR SCREEN
+      CALL WRITES(STRING)
+      RETURN
+      END
+      SUBROUTINE WRITES(CHARS)
+C   AUTHOR: RODGER STADEN
+      CHARACTER CHARS*(*)
+      INTEGER NUL
+        COMMON /TERM/IDEVT
+      SAVE /TERM/
+      DATA NUL/0/
+C        WRITE(IDEVT,1000)NUL,CHARS
+C1000  FORMAT(A1,A)
+        WRITE(IDEVT,1000)CHARS
+1000  FORMAT(A,$)
+      CALL FLUSH(IDEVT)
+      RETURN
+      END
+        SUBROUTINE CURSOR(LINE,COLUMN)
+C   AUTHOR: RODGER STADEN
+C   SETS CURSOR TO LINE AND COLUMN
+      INTEGER COLUMN
+       CHARACTER BLIN*2,BCOL*2,SPACE,ZERO,STRING*8,CESC
+      COMMON /ESCCHR/ CESC
+      EQUIVALENCE (STRING(3:4),BLIN),(STRING(6:7),BCOL)
+      SAVE STRING,ZERO,SPACE
+      DATA STRING(2:2)/'['/,STRING(8:8)/'H'/
+      DATA STRING(5:5)/';'/
+      DATA SPACE/' '/,ZERO/'0'/
+      STRING(1:1)=CESC
+      WRITE(BCOL,1002,ERR=100)COLUMN
+      WRITE(BLIN,1002,ERR=100)LINE
+1002  FORMAT(I2.2)
+C  NEED TO SET SPACES TO ZEROS
+C      IF(BCOL(1:1).EQ.SPACE)BCOL(1:1)=ZERO
+C      IF(BLIN(1:1).EQ.SPACE)BLIN(1:1)=ZERO
+      CALL WRITES(STRING)
+      RETURN
+100   CONTINUE
+      WRITE(*,*)'ERROR IN CURSOR SUBROUTINE'
+      RETURN
+      END
+      SUBROUTINE BLINE
+      CHARACTER BLANK*79
+      SAVE BLANK
+      DATA BLANK/' '/
+      CALL WRITES(BLANK)
+      RETURN
+      END
+      SUBROUTINE FMTDK(IDEV,SEQNCE,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM)
+C   SET POINTERS TO FIRST AND LAST ELEMENTS ONE WRITE
+      JS=1
+      JE=60
+10    CONTINUE
+C   SET JE TO LAST ELEMENT IF NECESSARY
+      IF(JE.GT.IDIM)JE=IDIM
+      WRITE(IDEV,1002)(SEQNCE(I),I=JS,JE)
+1002  FORMAT(' ',60A1)
+C   TEST FOR END
+      IF(JE.EQ.IDIM)GO TO 20
+C   INCREMENT FIRST AND LAST POINTERS
+      JS=JE+1
+      JE=JE+60
+      GO TO 10
+20    CONTINUE
+      RETURN
+      END
diff --git a/src/staden/help.c b/src/staden/help.c
new file mode 100644
index 0000000..2041c67
--- /dev/null
+++ b/src/staden/help.c
@@ -0,0 +1,305 @@
+/* 
+    Title:       help
+    
+    File:        help.c
+    Purpose:	 Routines to provide a help service
+    Last update: Wed Jun 20 1990
+*/
+
+
+/*
+    This module provides a help service, including an independent
+    help widget.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "help.h"
+#include "FtoC.h"      /* IMPORT: Fstr2Cstr */
+#include "progSpec.h"  /* IMPORT: botHelpOpt, topHelpOpt,
+			          helpTextFN, helpPtrsFN */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf, fscanf, fseek, 
+			          fclose, EOF */
+#include "mystdlib.h"  /* IMPORT: calloc */
+/*#include <string.h>*/    /* IMPORT: strcat, strcpy */
+#include "values.h"    /* IMPORT: MAXLONG */
+#include "fort.h"
+#include "helpnmenu.h" /* IMPORT: createmenu, optTransTab, helptopics */
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/List.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Viewport.h>
+#include <X11/Xaw/Paned.h>
+
+/* ---- Constants ---- */
+
+#define FORTRANRecLen 80
+
+/* ---- Global variables ---- */
+
+extern Dimension dialogue_width;
+extern Dimension dialogue_height;
+
+/* ---- Static variables ---- */
+
+static Widget helpShellWid; 
+static Widget labelWid;     /* Which says `Currently displaying: ...' */
+static Widget formWid;      /* Which holds either ... */
+static Widget panedWid;     /* Which holds either ... */
+static Widget listWid;      /* ... the list of topics or ... */
+static Widget listWid1;     /* ... the list of topics or ... */
+static Widget textWid;      /* ... which holds help on the current topic */
+static Widget topicsButton; /* 'Display/Remove topics' */
+static Boolean helpShellMapped = False;
+
+#define labelLen 256
+static char label[labelLen];
+
+static void DisplayTopic(int index)
+/*
+    Fill the text widget with help on the selected option.
+    Ensure the text widget is displayed.
+*/
+{
+    int i;
+    XawTextBlock textBlock;
+    char *cptr;
+
+    /* Clear the text widget */
+    textBlock.firstPos=0;
+    textBlock.length=0;
+    textBlock.ptr=0;
+    textBlock.format=FMT8BIT;
+    XawTextReplace(textWid, (long) 0, MAXLONG, &textBlock);
+
+    /* insert the lines of text into the text widget */
+    while (cptr = gethelp2(index)) {
+	textBlock.ptr=cptr;
+        textBlock.length=strlen(cptr);	
+        XawTextReplace(textWid, MAXLONG, MAXLONG, &textBlock);
+    }
+
+    strcpy(label, "Currently displaying: ");
+    strcat(label, helpindex[index].name);
+
+    XtVaSetValues(labelWid, XtNlabel, label, NULL);
+
+    /* Ensure correct topic is highlighted */
+    for (i = 0; i<MAXOPTS; i++)
+	if (optTransTab[i] == index) {
+	    XawListHighlight(listWid, i);
+	    break;
+	}
+}
+
+/* ---- Callback routines ---- */
+
+
+static void RemoveCallback(Widget w,
+			   XtPointer client_data, XtPointer call_data)
+{    XtPopdown(helpShellWid);
+     helpShellMapped=False;
+}
+
+
+static void ListCallback(Widget w,
+			 XtPointer client_data,
+			 XtPointer call_data)
+{
+    DisplayTopic(optTransTab[((XawListReturnStruct *)call_data)->list_index]);
+}
+
+static void topicsCallback(Widget w,XtPointer client_data,XtPointer call_data)
+{
+    static int state = 0;
+    
+    if (state ^= 1) {
+	XtUnmanageChild(listWid1);
+	XtVaSetValues(topicsButton, XtNlabel, "Display topics", NULL);
+    } else {
+	/* Make sure we have things in the correct order. */
+	XtUnmanageChild(textWid);
+	XtVaSetValues(listWid1, XtNpreferredPaneSize, 100, NULL);
+	XtManageChild(listWid1);
+	XtManageChild(textWid);
+	XtVaSetValues(topicsButton, XtNlabel, "Remove  topics", NULL);
+    }
+}
+
+/* ---- Exported routines ---- */
+
+Widget CreateHelpShell(Widget toplevelWid)
+/*
+    Create the help shell as a child of `toplevelWid', initially with
+    the list of topics available.
+    The shell is not initially mapped.
+*/
+{   Widget removeButton;
+
+    /*
+        The toplevel shell widget holds a form widget
+    */
+    helpShellWid = XtVaCreatePopupShell("Help", topLevelShellWidgetClass,
+				      toplevelWid, NULL);
+
+    XtVaSetValues(helpShellWid,
+		  XtNwidth, dialogue_width,
+		  XtNminWidth, dialogue_width,
+		  XtNheight, dialogue_height * 4/3,
+		  XtNminHeight, dialogue_height * 4/3, NULL);
+
+    formWid = XtVaCreateManagedWidget("helpForm", formWidgetClass,
+				      helpShellWid,
+				      XtNresizable, True,
+				      NULL);
+
+
+    /*
+        The form widget holds a label and two button ...
+    */
+    strcpy(label, "Currently displaying: ");
+    if (helpindex[0].name)
+	strcat(label, helpindex[0].name);
+
+    labelWid = XtVaCreateManagedWidget("currDisp", labelWidgetClass, formWid,
+				       XtNlabel, label,
+				       XtNfromVert, labelWid,
+				       NULL);
+
+    removeButton = XtVaCreateManagedWidget("rmHelp", commandWidgetClass,
+					   formWid,
+					   XtNfromVert, labelWid,
+					   NULL);
+    XtAddCallback(removeButton, XtNcallback, RemoveCallback, NULL);
+
+    topicsButton = XtVaCreateManagedWidget("topipButton", commandWidgetClass,
+					   formWid,
+					   XtNlabel, "Remove  topics",
+					   XtNfromVert, labelWid,
+					   XtNfromHoriz, removeButton,
+					   XtNresize, True,
+					   NULL);
+    XtAddCallback(topicsButton, XtNcallback, topicsCallback, NULL);
+
+    /*
+     * Create a form widget specifically to hold only the text widget
+     * or the list widget & viewport.
+     */
+    panedWid = XtVaCreateManagedWidget("helpPane", panedWidgetClass,
+				       formWid, 
+				       XtNfromVert, removeButton,
+				       XtNtop, XtChainTop,
+				       XtNbottom, XtChainBottom,
+				       XtNleft, XtChainLeft,
+				       XtNright, XtChainRight,
+				       NULL);
+    
+    /*
+        The pane widget also holds (togglable)
+        (a) a list widget and a viewport....
+    */
+    listWid1 = XtVaCreateManagedWidget("topicsViewport", viewportWidgetClass,
+				       panedWid,
+				       XtNfromVert, removeButton,
+				       XtNallowVert, True,
+				       XtNforceBars, True,
+				       XtNpreferredPaneSize, dialogue_height/2,
+				       NULL);
+
+    listWid = XtVaCreateManagedWidget("topicsList", listWidgetClass,
+				      listWid1,
+				      XtNlist, helptopics,
+				      NULL);
+    XtAddCallback(listWid, XtNcallback, ListCallback, NULL);
+
+    /*
+        ... and a ...
+	(b) a text widget
+    */
+    textWid = XtVaCreateManagedWidget("text", asciiTextWidgetClass, panedWid,
+				      XtNeditType, XawtextEdit,
+				      XtNscrollVertical, XawtextScrollAlways,
+				      NULL);
+
+    return(helpShellWid);
+}
+
+
+
+
+void help(int topic)
+/*
+    Ensure the help shell is displayed, with ``topic'' selected.
+*/
+{   if (!helpShellMapped)
+    {   XtPopup(helpShellWid, XtGrabNone);
+	helpShellMapped = True;
+    }
+
+    DisplayTopic(topic);
+}
+
+
+
+
+void help_x(int_f *HELPS_p,
+	    int_f *HELPE_p,
+	    int_f *BOTOPT_p,
+	    int_f *TOPOPT_p,
+	    char  *HELPF_p,
+	    int_f *IDEV_p,
+	    int_f *KBIN_p,
+	    int_f *KBOUT_p,
+	    int_fl HELPF_l)
+/*
+    This function does nothing
+*/
+{
+}
+
+
+
+
+void help2_x(int_f *HELPS_p, int_f *HELPE_p, char *HELPF_p,
+	     int_f *IDEV_p,
+	     int_f *KBIN_p, int_f *KBOUT_p,
+	     int_fl HELPF_l)
+/*
+    This function does nothing
+*/
+{
+}
+
+
+
+
+void sethelp_x(int_f  HELPS[],  /* Array of start record numbers */
+	       int_f  HELPE[],  /* Array of end   record numbers */
+	       int_f *BOTOPT_p, /* Lower bound */
+	       int_f *TOPOPT_p, /* Upper bound */
+	       char  *POINTF_p, /* Name of help record pointer file */
+	       int_f *IDEV_p,
+	       int_f *KBOUT,
+	       int_fl POINTF_l)
+/*
+    This function does nothing
+*/
+{
+}
+
+
+
+
+
+
+
diff --git a/src/staden/help.h b/src/staden/help.h
new file mode 100644
index 0000000..f5762d3
--- /dev/null
+++ b/src/staden/help.h
@@ -0,0 +1,93 @@
+/* 
+    Title:       help
+
+    File: 	 help.h
+    Purpose:	 Routines to provide a help service
+    Last update: Mon May 12 1990
+*/
+
+
+/*
+    This module provides a help service, including an independent
+    help widget.
+*/
+
+
+/*
+    C (internal) - SUN FORTRAN (external) name conversions
+    This table is for the compiler specifically mentioned above, so
+    as well as being hostage to its horrors we can make use of its
+    advantages, such as long identifiers.
+*/
+#define help_x    help_
+#define help2_x   help2_
+#define sethelp_x sethlp_
+
+
+
+/* ---- Includes ---- */
+
+#include "fort.h"
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget CreateHelpShell(Widget toplevelWid);
+/*
+    Create the help shell as a child of `toplevelWid', initially with
+    the list of topics available.
+    The shell is not initially mapped.
+*/
+
+
+
+
+extern void help(int topic);
+/*
+    Ensure the help shell is displayed, with ``topic'' selected.
+*/
+
+
+
+
+extern void help_x(int_f *HELPS_p, int_f *HELPE_p,
+		   int_f *BOTOPT,  int_f *TOPOPT,
+		   char  *HELPF_p,
+		   int_f *IDEV_p,
+		   int_f *KBIN_p,  int_f *KBOUT_p,
+		   int_fl HELPF_l);
+/*
+    This function does nothing
+*/
+
+
+
+
+extern void help2_x(int_f *HELPS_p, int_f *HELPE_p, char *HELPF_p,
+		    int_f *IDEV_p,
+		    int_f *KBIN_p,  int_f *KBOUT_p,
+		    int_fl HELPF_l);
+/*
+    This function does nothing
+*/
+
+
+
+
+
+extern void sethelp_x(int_f  HELPS[],  /* Array of start record numbers */
+		      int_f  HELPE[],  /* Array of end   record numbers */
+		      int_f *BOTOPT_p, /* Lower bound */
+		      int_f *TOPOPT_p, /* Upper bound */
+		      char  *POINTF_p, /* Name of help record pointer file */
+		      int_f *IDEV_p,
+		      int_f *KBOUT,
+		      int_fl POINTF_l);
+/*
+    This function does nothing
+*/
+
diff --git a/src/staden/helpnmenu.c b/src/staden/helpnmenu.c
new file mode 100644
index 0000000..5ddf434
--- /dev/null
+++ b/src/staden/helpnmenu.c
@@ -0,0 +1,285 @@
+/*--------------------------------------------------------------------------*\
+|*									    *|
+|* Help and menu routines. 						    *|
+|*									    *|
+\*--------------------------------------------------------------------------*/
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/param.h>
+#include <sys/types.h>
+#include <unistd.h>
+#include <string.h> /* IMPORT: strdup (hopefully!) */
+#include "mcspec.h" /* IMPORT: getenv */
+#include "FtoC.h"   /* IMPORT: Fstr2Cstr */
+#include "helpnmenu.h"
+#include "userface.h"
+#include "fort.h"
+#include "misc.h"
+
+#define HELPENV "STADHELP"
+
+/* --- statically declared global variables --- */
+static helpindex_t helpbase[MAXOPTS];	/* table of help pointers for opts */
+static int helperr;			/* are we allowed to give help?    */
+static FILE *helpfp;			/* FILE pointer for help	   */
+static int linesleft;			/* no. of lines left in help	   */
+static int currentopt;			/* which option we are now using   */
+
+/* --- globaly declared variables --- */
+
+helpindex_t *helpindex = &helpbase[MAXMENUS];
+char *helptopics[MAXOPTS];
+int optTransTab[MAXOPTS];
+int maxopts;
+
+/* ---------------------- help functions --------------------------------- */
+
+/*
+ * Supplies a line of help on the current option.
+ * Returns:
+ *   The next line of help for this option if one exists.
+ *   Otherwise NULL.
+ */
+char *gethelp() {
+    static char helpbuf[256];
+
+    if (helperr)
+	return NULL;
+
+    if (linesleft) {
+	/* already got the help open */
+	if (!--linesleft) {
+	    /* need to press a key at the end of the help */
+	    (void)bpause();
+	    return NULL;
+	}
+	else
+	    return fgets(helpbuf, sizeof(helpbuf), helpfp);
+    } else {
+	/* new help topic */
+	if (currentopt < -MAXMENUS || currentopt > MAXOPTS)
+	    linesleft = 0;
+	else
+	    linesleft = helpindex[currentopt].lines;
+	if (linesleft == 0) {
+	    linesleft = 1;
+	    sprintf(helpbuf, "  No help available on option %d\n",
+		    currentopt);
+	    return helpbuf;
+	}
+	fseek(helpfp, (size_t)helpindex[currentopt].offset, SEEK_SET);
+	sprintf(helpbuf, "  Help on '%s' (option %d)\n",
+		helpindex[currentopt].name, currentopt);
+	return helpbuf;
+    }
+}
+
+/*
+ * Supplies a line of help on a specific function.
+ * Args:
+ *   optnum: which function to find help for.
+ * Returns:
+ *   a line of help. (NULL if none)
+ */
+char *gethelp2(int optnum) {
+    int co = currentopt;
+    char *h;
+
+    if (optnum >= MAXOPTS-MAXMENUS)
+	return NULL;
+    /* bit of a hack - but it works fine. */
+    currentopt = optnum;
+    h = gethelp();
+    currentopt = co;
+    return h;
+}
+
+/*
+ * Initialises the help routines.
+ * Args:
+ *   progname: the name of this program (or at least which programs help
+ *             to use)
+ *   numopts : maximum number of options used
+ * Returns:
+ *   -1 for error
+ *    0 for success
+ */
+int inithelp(char *progname, int numopts) {
+    FILE *hp;
+    int optnum, menunum, offset, lines, hind = 0;
+    char prog, name[256], help_file[MAXPATHLEN], menu_file[MAXPATHLEN];
+    char *stadenenv, mode;
+
+    maxopts = numopts;
+
+    /* what type of program are we? */
+    mode = "TX"[xversn_()];
+
+    /* generate the pathnames of the files to use */
+    if ((stadenenv = getenv(HELPENV)) == NULL) {
+	printf("Error - couldn't find environment variable '%s'\n",
+	       HELPENV);
+	exit(1);
+    }
+    sprintf(help_file, "%s/%s_help", stadenenv, progname);
+    sprintf(menu_file, "%s/%s_menu", stadenenv, progname);
+
+    helperr = 0;
+    currentopt = 0;
+    linesleft = 0;
+
+    /* open up the help files */
+    if ((hp = fopen(menu_file, "r")) == NULL) {
+	perror(menu_file);
+	exit(1);
+    }
+    if ((helpfp = fopen(help_file, "r")) == NULL) {
+	perror(help_file);
+	helperr = 1;
+	return -1;
+    }
+
+    for (optnum = 0; optnum < MAXOPTS; optnum++) {
+	helpbase[optnum].menus = 0;
+	helpbase[optnum].name = 0;
+	for (menunum = 0; menunum < MAXDUP; menunum++)
+	    helpbase[optnum].menunum[menunum] = -1;
+    }
+
+    /* read in the menu index for use in the help/menu functions */
+    while(fscanf(hp, "%d %d %d %d %c %[^\n]", &optnum, &menunum, &offset,
+		 &lines, &prog, name) > 0) {
+	if (prog == mode) {
+	    if (optnum > MAXOPTS || optnum < -MAXMENUS) {
+		puts("Too many options for menu table");
+		fclose(hp);
+		return -1;
+	    }
+	    helpindex[optnum].menunum[helpindex[optnum].menus] = menunum;
+	    if (helpindex[optnum].menus == 0) {
+		helpindex[optnum].offset  = offset;
+		helpindex[optnum].lines   = lines;
+		helpindex[optnum].name    = strdup(name);
+	    }
+	    helpindex[optnum].menus++;
+	}
+    }
+
+    /* help table (in option number order) - no help on menus! */
+    helptopics[hind] = helpindex[0].name;
+    optTransTab[hind++] = 0;
+
+    for (optnum = xversn_()?3:1; optnum < MAXOPTS-MAXMENUS; optnum++)
+	if (helpindex[optnum].name) {
+	    helptopics[hind] = helpindex[optnum].name;
+	    optTransTab[hind++] = optnum;
+	}
+
+    /* make sure helptopics ends in a NULL (to be of XtNlist type */
+    helptopics[hind] = NULL;
+
+    fclose(hp);
+    return 0;
+}
+
+void inthlp_x(char *PROG_p, int_f *NUMOPTS_p, int_fl PROG_l) {
+    char prog[256];
+    
+    Fstr2Cstr(PROG_p, PROG_l, prog, (int_fl)sizeof(prog));
+    inithelp(prog, (int)*NUMOPTS_p);
+}
+
+/* ---------------------- menu functions --------------------------------- */
+
+/*
+ * Create a list of menu items.
+ * Args:
+ *   menunum: The menu number.
+ *   menu   : pointer to menu list (as an array). Each element of the array
+ *            is a structure containing the name and option number.
+ * Returns
+ *   How many items are in this menu.
+ */
+int create_menu(int menunum, menuarr *menu, int menusize) {
+    int i, ii, m;
+
+    if (menunum < 0)
+	menunum = 0;
+#if 0
+    /*
+     * It's not really our job to do the deallocation - but for tidyness
+     * sake we try not to waste any memory
+     */
+    for (m=0; m<menusize; m++)
+	if (menu[m].name)
+	    free(menu[m].name);
+#endif
+
+    /* list of menus always appears on each menu */
+    if (menunum == 0) {
+	menu[0].name = strdup("This menu");
+	strcpy(menu[0].optname, "m0");
+    } else {
+	menu[0].name = strdup("List of menus");
+	strcpy(menu[0].optname, " 0");
+    }
+    menu[0].number = 0;
+
+    m = 1;
+    for (i=0; i>-MAXMENUS; i--)
+	if (helpindex[i].menunum[0] == menunum && helpindex[i].name) {
+	    /* menu[m].name = strdup(helpindex[i].name); */
+	    menu[m].name = helpindex[i].name;
+	    sprintf(menu[m].optname, "m%d", -i);
+	    menu[m].number = i;
+	    m++;
+	}
+    for (i=0; i<MAXOPTS-MAXMENUS; i++)
+	for (ii=0; ii<MAXDUP; ii++) {
+	    if (helpindex[i].menunum[ii] == -1)
+		break;
+	    if (helpindex[i].menunum[ii] == menunum && helpindex[i].name) {
+		/* strdup()!? why not just use a pointer to const data? */
+		/* menu[m].name = strdup(helpindex[i].name); */
+		menu[m].name = helpindex[i].name;
+		if (i == 1)
+		    strcpy(menu[m].optname, " ?");
+		else if (i == 2)
+		    strcpy(menu[m].optname, " !");
+		else
+		    sprintf(menu[m].optname, "%2d", i);
+		menu[m].number = i;
+		m++;
+	    }
+	}
+
+#if 0
+    /* all menus have the 'help' and 'quit' options listed */
+    menu[m].name = strdup("Help");
+    menu[m].number = 1;
+    strcpy(menu[m++].optname, " ?");
+    menu[m].name = strdup("Quit");
+    menu[m].number = 2;
+    strcpy(menu[m++].optname, " !");
+#endif
+
+    return m;
+}
+
+/* ---------------------- general functions ------------------------------- */
+
+/*
+ * Ask which option we are currently in.
+ */
+int query_opt() {
+    return currentopt;
+}
+
+/*
+ * Set which option we are currently in.
+ */
+void set_opt(int op) {
+    currentopt = op;
+}
diff --git a/src/staden/helpnmenu.h b/src/staden/helpnmenu.h
new file mode 100644
index 0000000..4967319
--- /dev/null
+++ b/src/staden/helpnmenu.h
@@ -0,0 +1,37 @@
+#define help_x		help_
+#define help2_x		help2_
+#define sethlp_x	sethlp_
+#define inthlp_x	inthlp_
+
+#define MAXOPTS 	100 /* Should be big enough for a fair while */
+#define MAXMENUS 	10  /* Leaves MAXOPTS-MAXMENUS for actual options */
+#define MAXDUP		3   /* maximum number of menus for a single option */
+
+/* structures and types */
+
+typedef struct menuarr {
+    char *name;
+    char optname[4];
+    int number;
+} menuarr;
+
+typedef struct helpindex_t {
+    int menus;		/* how many menus it appears in */
+    int menunum[MAXDUP];/* which menus it appears in */
+    int offset;		/* byte offset into help file */
+    int lines;		/* number of lines of help for this option */
+    char *name;		/* Name of option */
+} helpindex_t;
+
+extern helpindex_t *helpindex;
+extern int optTransTab[];
+extern char *helptopics[];
+extern int maxopts;
+
+/* function prototypes */
+char *gethelp();
+char *gethelp2();
+int inithelp(char *progname, int numopts);
+int create_menu(int menunum, menuarr *menu, int menusize);
+int query_opt();
+void set_opt(int op);
diff --git a/src/staden/lip.f b/src/staden/lip.f
new file mode 100644
index 0000000..7c3770d
--- /dev/null
+++ b/src/staden/lip.f
@@ -0,0 +1,86 @@
+      SUBROUTINE FMAIN()
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) HELPF,FILNAM
+      CHARACTER*(NAMLEN) LIBLF
+      PARAMETER (
+     +           MAXSEQ=330000,
+     +           MAXWIR=330000,
+     +           MAXDEV=9)
+
+      PARAMETER (
+     +             LIBLF='SEQUENCELIBRARIES')
+      INTEGER DEVNOS(MAXDEV)
+      INTEGER WORKI(MAXWIR)
+      CHARACTER SEQ(MAXSEQ)
+      PARAMETER (MAXPRM = 16, MAXMEN = 3)
+      CHARACTER PROMPT(MAXMEN)*(MAXPRM)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' LIP (Library interface program) V1.0 April 1993',/,
+     +' Author: Rodger Staden'/)
+C   READ IN THE POINTERS TO THE HELP FILE
+C      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(4),KBOUT)
+C      CALL INITLU(IDM)
+      IDEVLL = DEVNOS(5)
+      IDEVEN = DEVNOS(6)
+      IDEVAN = DEVNOS(7)
+      IDEVDL = DEVNOS(8)
+      IDEVLF = DEVNOS(9)
+      IDEV1  = DEVNOS(1)
+      IDEV2  = DEVNOS(2)
+      IDEVH  = DEVNOS(4)
+      IOPEN = 0
+      IDEVOT = KBOUT
+      LIBIN = 1
+ 1    CONTINUE
+      IDIMT = MAXSEQ
+      PROMPT(1) = 'Search a library'
+      PROMPT(2) = 'Extract entries'
+      PROMPT(3) = 'Redirect output'
+      JOPT = 1
+      CALL RADION('Select a task',PROMPT,3,JOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(JOPT.LT.1) THEN
+        IF (IOPEN.EQ.1) THEN
+          CLOSE(UNIT=IDEVOT)
+          IOPEN = 0
+        END IF
+        STOP
+      END IF
+      IF (JOPT.EQ.1) THEN
+        CALL RDLIB(SEQ,IDIMT,FILNAM,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +  IDEVLF,LIBIN,LIBLF,WORKI,MAXWIR,IOK)
+        IF (IOPEN.EQ.1) THEN
+          CLOSE(UNIT=IDEVOT)
+          IOPEN = 0
+          IDEVOT = KBOUT
+        END IF
+      ELSE IF (JOPT.EQ.2) THEN
+        IF (IOPEN.EQ.1) THEN
+          CLOSE(UNIT=IDEVOT)
+          IOPEN = 0
+          IDEVOT = KBOUT
+        END IF
+        CALL RDLIBB(SEQ,IDIMT,FILNAM,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEV2,IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +  IDEVLF,LIBIN,LIBLF,WORKI,MAXWIR,IOK)
+      ELSE IF (JOPT.EQ.3) THEN
+        IF (IOPEN.EQ.1) THEN
+          CLOSE(UNIT=IDEVOT)
+          IOPEN = 0
+        END IF
+        IDEVOT = IDEV1
+        FILNAM = ' '
+        CALL OPENF1(IDEVOT,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Results file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) THEN
+          IDEVOT = KBOUT
+          GO TO 1
+        END IF
+        IOPEN = 1
+      END IF
+      GO TO 1
+      END
diff --git a/src/staden/locks.c b/src/staden/locks.c
new file mode 100644
index 0000000..1ed41c9
--- /dev/null
+++ b/src/staden/locks.c
@@ -0,0 +1,84 @@
+/*
+    Title: 	 locks
+
+    File: 	 locks.c
+    Purpose:	 General routines for locking resources
+    Last update: 
+*/
+
+#include "locks.h"
+#include <stdio.h>
+#include <stdlib.h>
+
+Semaphore activeLock;
+
+Semaphore semaphoreCreate(int max)
+{
+    Semaphore sem;
+
+    if ((sem = (Semaphore) malloc (sizeof(SemaphoreStruct)))!=NULL) {
+	sem->count = 0;
+	sem->max = max;
+    }
+    return sem;
+}
+
+int semaphoreGrab(Semaphore sem)
+{
+    return (sem->count==sem->max)?0:sem->count++,1;
+}
+
+int semaphoreRelease(Semaphore sem)
+{
+    return (sem->count==0)?0:sem->count--,1;
+}
+
+int semaphoreGrabN(Semaphore sem, int n)
+{
+    return (sem->count+n>sem->max)?0:(sem->count+=n),1;
+}
+
+int semaphoreReleaseN(Semaphore sem, int n)
+{
+    return (sem->count<n)?0:(sem->count-=n),1;
+}
+
+int semaphoreGrabExclusive(Semaphore sem)
+{
+    return semaphoreGrabN(sem, sem->max);
+}
+
+int semaphoreUsed(Semaphore sem)
+{
+    return sem->count;
+}
+
+int semaphoreFree(Semaphore sem)
+{
+    return (sem->count == 0);
+}
+
+Flag flagCreate()
+{
+    return (Flag) semaphoreCreate(1);
+}
+
+int flagSet(Flag flag)
+{
+    return semaphoreGrab((Semaphore) flag);
+}
+
+int flagUnset(Flag flag)
+{
+    return semaphoreRelease((Semaphore) flag);
+}
+
+int flagUsed(Flag flag)
+{
+    return (flag->count);
+}
+
+int flagFree(Flag flag)
+{
+    return (flag->count == 0);
+}
diff --git a/src/staden/locks.h b/src/staden/locks.h
new file mode 100644
index 0000000..bf0cb11
--- /dev/null
+++ b/src/staden/locks.h
@@ -0,0 +1,52 @@
+#ifndef _locks_h
+#define _locks_h
+/*
+    Title: 	 locks
+
+    File: 	 locks.h
+    Purpose:	 General routines for locking resources
+    Last update: 
+*/
+
+
+typedef struct {
+	int count;
+	int max;
+	} SemaphoreStruct, *Semaphore, *Flag;
+
+
+extern Semaphore activeLock;
+
+#ifdef noddy
+
+#define semaphoreCreate(S,N) (S=(Semaphore)malloc(sizeof(Semaphore)))!=NULL)?S->count=0,S->max=N,S:S
+#define semaphoreGrab(S) (S->count==S->max)?0:S->count++,1
+#define semaphoreRelease(S) (S->count==0)?0:S->count--,1
+#define semaphoreGrabN(S,N) (S->count+N>S->max)?0:(S-count+=N),1
+#define semaphoreRealeaseN(S,N) (S->count<N)?0:(S->count-=N),1
+#define semaphoreGrabExclusive(S) semaphoreGrabN(S,S->max)
+#define semaphoreUsed(S) S->count
+#define semaphoreFree(S) (S->count==0)
+#define flagCreate (Flag) semaphoreCreate(1)
+#define flagSet(F) semaphoreGrab((Semaphore) F)
+#define flagUnset(F) semaphoreRelease((Semaphore) F)
+#define flagUsed(F) F->count
+#define flagFree(F) (F->count==0)
+
+#else
+extern Semaphore semaphoreCreate(int max);
+extern int semaphoreGrab(Semaphore sem);
+extern int semaphoreRelease(Semaphore sem);
+extern int semaphoreGrabN(Semaphore sem, int n);
+extern int semaphoreReleaseN(Semaphore sem, int n);
+extern int semaphoreGrabExclusive(Semaphore sem);
+extern int semaphoreUsed(Semaphore sem);
+extern int semaphoreFree(Semaphore sem);
+extern Flag flagCreate();
+extern int flagSet(Flag flag);
+extern int flagUnset(Flag flag);
+extern int flagUsed(Flag flag);
+extern int flagFree(Flag flag);
+#endif
+
+#endif /* _locks_h */
diff --git a/src/staden/main.c b/src/staden/main.c
new file mode 100644
index 0000000..bb88eba
--- /dev/null
+++ b/src/staden/main.c
@@ -0,0 +1,400 @@
+/*
+    Title: 	 main
+
+    File: 	 main.c
+    Purpose:	 C language entry point and initialisation functions
+    Last update: Wed Jan 23 1991
+
+    Change log:
+        23/01/90 SD  Resorce *yesno.YES.borderWidth now 3
+
+*/
+
+
+/*
+    This module contains the C language entry point `main' and
+    initialisation for the X system. It provides the main event loop,
+    as far as X is concerned.
+
+    Three top-level shells are created: control, dialogue and help.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include <stdio.h>    /* IMPORT: freopen, stdout, stderr, fprintf */
+#include <stdlib.h>
+#include "mystdlib.h" /* IMPORT: exit */
+
+#include "main.h"
+#include "dialogues.h"     /* IMPORT: CreateDialogueShell */
+#include "help.h"          /* IMPORT: CreateHelpShell */
+#include "plX.h"           /* IMPORT: CreateGraphicsOutput */
+#include "textOutput.h"    /* IMPORT: CreateTextOutput
+                                      UpdateTextOutput */
+#include "postscript.h"    /* IMPORT: closep_x() */
+
+#include "locks.h"
+#include "mcspec.h"
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Types ---- */
+typedef enum {Alive, Dying, Dead} LifeForce;
+
+
+
+
+/* ---- Static variables ---- */
+
+static int global_argc;
+static char **global_argv;
+static XtAppContext app_con;
+static Display *display;
+static Widget toplevelWid;
+static Widget dialogueShellWid, helpShellWid, outputShellWid, graphicsShellWid;
+static LifeForce lifeForce = Alive;
+static int exitcode;
+/*
+static XrmOptionDescRec options[] =
+{
+};
+*/
+/* The following are used if Xstaden cannot be found */
+static String fallback_resources[] = {
+    /* Dialogue shell */
+    /* -------------- */
+    "Xstaden.Dialgue.allowShellResize: False",
+    "Xstaden.Dialogue.width:	 550",
+    "Xstaden.Dialogue.minWidth:  550",
+    "Xstaden.Dialogue.maxWidth:  550",
+    "Xstaden.Dialogue.height:    180",
+    "Xstaden.Dialogue.minHeight: 180",
+    "Xstaden.Dialogue.maxHeight: 180",
+    "Xstaden.Dialogue.dialogueForm*translations: #override \\n\
+         <Key>Return: CRAction()",
+    /* The default chaining for all children is Top,Top,Left,Left */
+    "Xstaden.Dialogue.dialogueForm*top:    chainTop",
+    "Xstaden.Dialogue.dialogueForm*bottom: chainTop",
+    "Xstaden.Dialogue.dialogueForm*left:   chainLeft",
+    "Xstaden.Dialogue.dialogueForm*right:  chainLeft",
+    /* currFile */
+    "Xstaden.Dialogue.dialogueForm.currFile.label: Current file:",
+    "Xstaden.Dialogue.dialogueForm.currFile.borderWidth: 0",
+    "Xstaden.Dialogue.dialogueForm.currFile.resizable: True",
+    /* currFunc */
+    "Xstaden.Dialogue.dialogueForm.currFunc.label: Current function:",
+    "Xstaden.Dialogue.dialogueForm.currFunc.borderWidth: 0",
+    "Xstaden.Dialogue.dialogueForm.currFunc.resizable: True",
+    /* quit */
+    "Xstaden.Dialogue.dialogueForm.quit.label: Quit application",
+    "Xstaden.Dialogue.dialogueForm.quit.foreground:  red",
+    "Xstaden.Dialogue.dialogueForm.quit.borderColor: red",
+
+    /* null */
+    "Xstaden.Dialogue.dialogueForm.null.borderWidth: 0",
+    "Xstaden.Dialogue.dialogueForm.null.label: Working ... ",
+
+    /* nChoice */
+    "Xstaden.Dialogue.dialogueForm.nChoice.resizable: True",
+    /* nChoice buttons */
+    "Xstaden.Dialogue.dialogueForm.nChoice.buttons.resizable: True",
+    /* nChoice OK */
+    "Xstaden.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3",
+
+    /* getopt */
+    "Xstaden.Dialogue.dialogueForm.getopt.width: 540",
+    /* getopt execlabel */
+    "Xstaden.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:",
+    "Xstaden.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0",
+    /* getopt execcommand */
+    "Xstaden.Dialogue.dialogueForm.getopt.execcommand.label: Execute with dialogue",
+    "Xstaden.Dialogue.dialogueForm.getopt.execcommand.resizable: True",
+    /* getopt menubar */
+    "Xstaden.Dialogue.dialogueForm.getopt.menubar.width:	530",
+    "Xstaden.Dialogue.dialogueForm.getopt.menubar.height:	25",
+
+    /* getstring */
+    "Xstaden.Dialogue.dialogueForm.getstring.label:",
+    "Xstaden.Dialogue.dialogueForm.getstring.value:",
+    /* getstring label */
+    "Xstaden.Dialogue.dialogueForm.getstring.label.resizable: True",
+    /* getstring value */
+    "Xstaden.Dialogue.dialogueForm.getstring.value.width: 350",
+    /* getstring OK */
+    "Xstaden.Dialogue.dialogueForm.getstring.OK.borderWidth: 3",
+
+    /* yesno */
+    "Xstaden.Dialogue.dialogueForm.yesno.resizable: True",
+    /* yesno label */
+    "Xstaden.Dialogue.dialogueForm.yesno.lab.label:",
+    "Xstaden.Dialogue.dialogueForm.yesno.lab.borderWidth: 0",
+    "Xstaden.Dialogue.dialogueForm.yesno.lab.resizable: True",
+    "Xstaden.Dialogue.dialogueForm.yesno.YES.borderWidth: 3",
+
+    /* Help shell */
+    /* ---------- */
+    "Xstaden.Help.width:     550",
+    "Xstaden.Help.minWidth:  550",
+    "Xstaden.Help.height:    400",
+    "Xstaden.Help.minHeight: 150",
+    "Xstaden.Help*top:    chainTop",
+    "Xstaden.Help*bottom: chainTop",
+    "Xstaden.Help*left:   chainLeft",
+    "Xstaden.Help*right:  chainLeft",
+    /* currDisp */
+    "Xstaden.Help.helpForm.currDisp.resizable: True",
+    "Xstaden.Help.helpForm.currDisp.borderWidth: 0",
+    /* rmHelp */
+    "Xstaden.Help.helpForm.rmHelp.label: Remove help",
+    /* topicsViewport */
+    "Xstaden.Help.helpForm.topicsViewport.allowVert: True",
+    "Xstaden.Help.helpForm.topicsViewport.bottom: chainBottom",
+    "Xstaden.Help.helpForm.topicsViewport.right:  chainRight",
+    "Xstaden.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1",
+    /* text */
+    "Xstaden.Help.helpForm.text.editType: edit",
+    "Xstaden.Help.helpForm.text.scrollVertical: always",
+    "Xstaden.Help.helpForm.text.bottom: chainBottom",
+    "Xstaden.Help.helpForm.text.right:  chainRight",
+
+    /* Output shell */
+    /* ------------ */
+    "Xstaden.Graphics.graph.width:  550",
+    "Xstaden.Graphics.graph.height: 400",
+    "Xstaden.Output.text.width:   550",
+    "Xstaden.Output.text.height:  100",
+
+    NULL,
+};
+
+
+
+
+/* ---- Callback routines ---- */
+
+
+static void destroyCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{   lifeForce = Dead;
+}
+
+
+
+
+/* ---- Other functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    fprintf(stderr, "%s: unknown options:", argv[0]);
+    for (i=1; i <argc; i++)
+	fprintf(stderr, " %s", argv[i]);
+    fprintf(stderr, "\n\n");
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr, 
+"       [-geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]]\n");
+    fprintf(stderr,
+"       [-display [{host}]:[{vs}]]\n");
+    fprintf(stderr,
+"       [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]\n");
+}
+
+
+
+
+static void finalxPhase2(void)
+/*
+    Recover all X resources and exit with `status'.
+    This function does not return.
+*/
+{
+    /* close any opened postscript file. */
+    closep_x();
+
+    XtDestroyApplicationContext(app_con);
+
+    exit(exitcode);
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+#ifdef sgi
+#    define VMS_FLAGS	4
+#    define VMS_CC	0
+#    define OLD_RL	1
+#    define VMS_IN	2
+#    define VMS_EF	3
+extern unsigned short f77vms_flag_[VMS_FLAGS];
+#endif /* sgi */
+
+void main(unsigned int argc, char **argv)
+{
+    activeLock = semaphoreCreate(65535/*a big number for max*/);
+
+#ifdef sgi
+    /* Fix on Iris Indigo to allow fortran to use a C main() */
+    f77vms_flag_[OLD_RL] = 1;
+#endif /* sgi */
+
+    global_argc = argc;
+    global_argv = argv;
+
+    fmain_();
+
+    finalx(0);
+    finalxPhase2();
+}
+
+
+
+
+void initx_(void)
+/*
+    Initialise the X system, creating all necessary fixtures and
+    fittings for the application.
+    `stdout' may be redirected.
+*/
+{
+
+
+    toplevelWid = XtAppInitialize(&app_con, "Xstaden",
+				  NULL, (Cardinal) 0,
+				  &global_argc,
+				  (String *)global_argv,
+				  fallback_resources,
+				  NULL, (Cardinal) 0);
+
+    XtAddCallback(toplevelWid, XtNdestroyCallback, destroyCallback, NULL);
+    display = XtDisplay(toplevelWid);
+
+    if (global_argc != 1)
+    {   usage(global_argc, global_argv);
+        finalx(1);
+    }
+
+    /*
+        Create the dialogue shell.
+    */
+    dialogueShellWid = CreateDialogueShell(toplevelWid);
+
+    /*
+        Create the graphics shell.
+    */
+    graphicsShellWid = XtCreatePopupShell("Graphics", topLevelShellWidgetClass,
+                                         toplevelWid,
+                                         NULL, (Cardinal) 0);
+
+    (void) CreateGraphicsOutput(graphicsShellWid);
+
+    /*
+        Create the output shell.
+    */
+    outputShellWid = XtCreatePopupShell("Output", topLevelShellWidgetClass,
+                                         toplevelWid,
+                                         NULL, (Cardinal) 0);
+
+    (void) CreateTextOutput(outputShellWid);
+
+
+    /*
+        Create the help shell.
+    */
+    helpShellWid = CreateHelpShell(toplevelWid);
+
+    /*
+        The output and dialogue shells are initially displayed.
+    */
+    XtPopup(dialogueShellWid, XtGrabNone);
+    XtPopup(graphicsShellWid, XtGrabNone);
+    XtPopup(outputShellWid,   XtGrabNone);
+
+}
+
+
+
+
+void finalx(int status)
+/*
+    Recover all X resources and exit with `status'.
+
+    Death under X is a two phase process. XtDestroyWidget is called on
+    the toplevel, but this may only take effect when we return to the
+    main loop. When it does take effect, the toplevel destroyCallback
+    will be called and set ``lifeForce'' to dead, which indicates that
+    the second phase can be called. This then calls exit.
+*/
+{   lifeForce = Dying;
+    exitcode = status;
+
+    XtDestroyWidget(toplevelWid);
+
+    if (lifeForce == Dead) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+void dispatchEventsUntil(Boolean (*stopPred)(void))
+/*
+    X events are cycled until the stopping predicate returns true or
+    we are in the process of dying. If this is the case, then the
+    second phase of finalisation is carried out and this function
+    does not return.
+*/
+{   XEvent event;
+
+    UpdateTextOutput();
+
+    while ((lifeForce==Alive) && !stopPred())
+    {   XtAppNextEvent(app_con, &event);
+	XtDispatchEvent(&event);
+    }
+
+    if (lifeForce!=Alive) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+void dispatchCurrentEvents()
+/*
+    All X events in the queue are cycled. If we are in the process of
+    dying, then the second phase of finalisation is carried out and
+    this function does not return.
+*/
+{   XEvent event;
+
+    UpdateTextOutput();
+
+    while ((lifeForce==Alive) && (XtAppPending(app_con) != 0))
+    {   XtAppNextEvent(app_con, &event);
+	XtDispatchEvent(&event);
+    }
+
+    if (lifeForce!=Alive) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+
+Display *GetDisplay(void)
+/*
+    Return the X display running.
+    This function can only be called after initx_().
+*/
+{   return(display);
+}
diff --git a/src/staden/main.h b/src/staden/main.h
new file mode 100644
index 0000000..85e0395
--- /dev/null
+++ b/src/staden/main.h
@@ -0,0 +1,65 @@
+/*
+    Title: 	 main
+
+    File: 	 main.c
+    Purpose:	 C language entry point and initialisation functions
+    Last update: Thur May 17 1990
+*/
+
+
+/*
+    This module contains the C language entry point `main' and
+    initialisation for the X system. It provides the main event loop,
+    as far as X is concerned.
+
+    Three top-level shells are created: control, dialogue and help.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern void initx_(void);
+/*
+    Initialise the X system, creating all necessary fixtures and
+    fittings for the application.
+    `stdout' may be redirected.
+*/
+
+
+extern void finalx(int status);
+/*
+    Recover all X resources and exit with `status'.
+*/
+
+
+extern void dispatchEventsUntil(Boolean (*stopPred)(void));
+/*
+    X events are cycled until the stopping predicate returns true or
+    we are in the process of dying. If this is the case then this
+    function does not return.
+*/
+
+
+extern void dispatchCurrentEvents();
+/*
+    All X events in the queue are cycled. If we are in the process of
+    dying, then the second phase of finalisation is carried out and
+    this function does not return.
+*/
+
+
+extern Display *GetDisplay(void);
+/*
+    Return the X display running.
+    This function can only be called after initx_().
+*/
diff --git a/src/staden/makefile-aix b/src/staden/makefile-aix
new file mode 100644
index 0000000..d369cfa
--- /dev/null
+++ b/src/staden/makefile-aix
@@ -0,0 +1,589 @@
+# Makefile for staden X programs (IBM AIX RS/6000)
+# ------------------------------------------------
+# Many thanks to Klaus Hartmuth for making this version possible
+#
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   dap  sap  sip   splitp1 splitp2 splitp3 gip sethelp convert_project sapf nipf vep
+LPROGS=           nipl  pipl            sipl
+XPROGS=     xmep xnip  xpip  xdap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+#.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Where the wild things are
+#
+SRC=.
+TEDSRC=$(SRC)/../ted
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+XINCLUDE= /usr/local/X11R5/include
+INCLUDES= -I$(SRC) -I$(TEDSRC) -I$(MISCSRC) -I$(XINCLUDE)
+
+ 
+#
+# X11R5 & UTILITIES lib
+#
+ 
+X11R5LIBS= -L/usr/local/X11R5/lib -lXaw -lXt -lXmu -lXext -lX11
+UTILITIES= -L$(HOME)/lib
+ 
+LIBS= $(UTILITIES) $(X11R5LIBS)
+
+
+#
+# Compilers and flags
+#
+CC= cc
+CFLAGS= $(INCLUDES) -DAIX -qlanglvl=extended
+CLFLAGS=$(LIBS)
+ 
+F77= xlf
+FFLAGS= -qcharlen=1024 -NN500 -NP500
+FLFLAGS=
+ 
+LFLAGS= -lxlf -lc -lxlfutil $(ADDLIBS)
+XLFLAGS= $(LIBS)
+ 
+#
+# type of linking requested
+#
+LD=$(F77)
+LDFLAGS=$(FLFLAGS) -bnodelcsect
+
+
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	seqlibsubs.o
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	Graph.o\
+	plotLog.o\
+	help.o\
+	mcspec.o\
+	dialogues.o\
+	locks.o
+
+COBJS=\
+	main.o\
+	$(CCORE)
+
+XDAPCOBJS=\
+	xdapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJS=\
+	fmain.o\
+	seeme-aix.o\
+	aix-bits.o\
+	nxspec.o\
+	userface.a\
+	subs89.a\
+	userface.a\
+	subs89.a
+
+XCOMMONOBJS=\
+	seeme-aix.o\
+	aix-bits.o\
+	xspec.o\
+	subs89.a\
+	$(COBJS)
+
+XDAPCOMMONOBJS=\
+	seeme-aix.o\
+	aix-bits.o\
+	xspec.o\
+	subs89.a\
+	$(XDAPCOBJS)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(COMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+
+
+MEP=\
+	mep.o\
+	mepsub.o\
+	asubs89.a\
+	plot89.a
+
+MEPOBJS=\
+	$(MEP)\
+	mepmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	xmepSpec.o\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	nip.o\
+	patternn.a\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	plot89.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	nipmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	xnipSpec.o\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	nipl.o\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(COMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+NIPF=\
+	nipf.o\
+	asubs89.a\
+	plot89.a
+
+NIPFOBJS=\
+	$(NIPF)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+
+nipf: $(NIPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPFOBJS) $(LFLAGS)
+
+
+
+
+
+PIP=\
+	pip.o\
+	analps89.a\
+	patternp.a\
+	patternpc.a\
+	asubs89.a\
+	plot89.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	pipmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	xpipSpec.o\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	pipl.o\
+	patternpc.a\
+	analps89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(COMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	seq.o\
+	seqIOABI.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqIOPlain.o\
+	opp.o\
+	fpoint.o\
+	mach-io.o
+
+TMANOBJS=\
+	tman_main.o\
+	tman_display.o\
+	tman_traceDisplay.o\
+	tman_basesDisplay.o\
+	tman_context.o\
+	tman_gadgets.o\
+	$(STDTEDFILES)
+
+# Some versions of X11R4 may have a bug in SmeLine.c
+XHACK=\
+	SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	undo.o\
+	Sheet.o\
+	select.o\
+	extend.o\
+	searchUtils.o\
+	edMenu.o\
+	trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	tagEditor.o\
+	tagdbparse.o\
+	tagU2.o\
+	tagU1.o
+
+DAP=\
+	dap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	asubs89.a\
+	plot89.a
+
+DAPOBJS=\
+	$(DAP)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o\
+	conEdit.o\
+	tagU2.o
+
+XDAPOBJS=\
+	$(DAP)\
+	xdapSpec.o\
+	$(XDAPCOMMONOBJS)\
+	plX.o\
+	textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)
+
+dap: $(DAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(DAPOBJS) $(LFLAGS)
+
+xdap: $(XDAPOBJS)
+	$(CC) $(CFLAGS) -o $@ $(XDAPOBJS) $(XLFLAGS)
+
+convert_project: cvt.o
+	$(CC) $(CFLAGS) -o $@ cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot89.a
+
+SAPOBJS=\
+	$(SAP)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	xsapSpec.o\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	sapf.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot89.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	sip.o\
+	dias89.a\
+	plot89.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	sipmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	xsipSpec.o\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	sipl.o\
+	dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(COMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)\
+	$(COMMONOBJS)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(COMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(COMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(COMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+
+
+
+
+
+
+VEP=\
+	vep.o\
+	dias89.a\
+	subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(COMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+#
+# Special dependencies
+#
+#
+
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+.f.o:
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+.c.o:
+	$(CC) $(CFLAGS) -c $<
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+#.f.a:
+#	BINDIR=`pwd`; \
+#	cd $(SRC); \
+#	SRCDIR=`pwd`; \
+#	TMPDIR=/tmp/staden$$$$; \
+#	if test ! -d $$TMPDIR; \
+#	then mkdir $$TMPDIR; \
+#	fi; \
+#	cd $$TMPDIR; \
+#	rm -f *.f *.o; \
+#	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+#	$(F77) $(FFLAGS) -c *.f; \
+#	rm *.f; \
+#	rm -f $$BINDIR/$@; \
+#	ar rcv $$BINDIR/$@ *.o; \
+#	ranlib $$BINDIR/$@; \
+#	rm *.o; \
+#	cd $$BINDIR; \
+#	rm -r $$TMPDIR
+#
+
+
+
+
+all: $(PROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.a *.o *~
+
diff --git a/src/staden/makefile-alpha b/src/staden/makefile-alpha
new file mode 100644
index 0000000..f9b9043
--- /dev/null
+++ b/src/staden/makefile-alpha
@@ -0,0 +1,592 @@
+# Makefile for staden X programs (Alpha OSF/1.2 version)
+# ------------------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   dap  sap  sip   splitp1 splitp2 splitp3 gip sethelp convert_project rep lip sapf nipf vep
+LPROGS=           nipl  pipl            sipl
+XPROGS=     xmep xnip  xpip  xdap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+#
+# Where the wild things are
+#
+SRC=.
+TEDSRC=$(SRC)/../ted
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)/alpha-binaries
+INCLUDES= -I$(SRC) -I$(TEDSRC) -I$(MISCSRC)
+
+#
+# Compilers and flags
+#
+CC= cc
+CFLAGS= -g -std1 -ieee_with_inexact $(INCLUDES)
+F77= f77
+FFLAGS= -g -C
+
+FLIBS= -lfor -lutil -lUfor -lots -lFutil -lm -lfor -lutil -lUfor -lots -lFutil
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11 -ldnet_stub
+LFLAGS= $(FLIBS)
+XLFLAGS= $(FLIBS) $(XLIBS) 
+
+LD = f77
+CLD = cc
+LDFLAGS = -g -non_shared # Comment out for faster linking when debugging
+
+
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	seqlibsubs.o
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	postscript.o\
+	Graph.o\
+	plotLog.o\
+	help.o\
+	dialogues.o\
+	userfacecom.o\
+	helpnmenu.o\
+	xmenu.o\
+	FtoC.o\
+	mcspec.o\
+	locks.o
+
+COBJS=\
+	main.o\
+	$(CCORE)
+
+XDAPCOBJS=\
+	xdapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	fmain.o\
+	seeme-dec.o\
+	nxspec.o\
+	userface.o\
+	userfacecom.o\
+	nxhelpmenu.o\
+	helpnmenu.o\
+	FtoC.o\
+	subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	seeme-dec.o\
+	xspec.o\
+	subs89.a\
+	$(COBJS)
+
+XDAPCOMMONOBJS=\
+	seeme-dec.o\
+	xspec.o\
+	subs89.a\
+	$(XDAPCOBJS)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+LIP=\
+	lip.o
+ 
+LIPOBJS=\
+	$(LIP)\
+	$(SEQLIB)\
+	$(COMMONOBJS)
+ 
+lip: $(LIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(LIPOBJS) $(LFLAGS)
+
+
+MEP=\
+	mep.o\
+	mepsub.o\
+	asubs89.a\
+	plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(CLD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	nip.o\
+	patternn.a\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	plot92.a \
+	$(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(CLD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	nipl.o\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+NIPF=\
+	nipf.o\
+	asubs89.a\
+	plot92.a
+
+NIPFOBJS=\
+	$(NIPF)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+
+nipf: $(NIPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPFOBJS) $(LFLAGS)
+
+
+
+
+
+PIP=\
+	pip.o\
+	analps89.a\
+	patternp.a\
+	patternpc.a\
+	asubs89.a\
+	plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(CLD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	pipl.o\
+	patternpc.a\
+	analps89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	seq.o\
+	seqIOABI.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqIOPlain.o\
+	opp.o\
+	fpoint.o\
+	mach-io.o
+
+TMANOBJS=\
+	tman_main.o\
+	tman_display.o\
+	tman_traceDisplay.o\
+	tman_basesDisplay.o\
+	tman_context.o\
+	tman_gadgets.o\
+	$(STDTEDFILES)
+
+# Some versions of X11R4 may have a bug in SmeLine.c
+XHACK=\
+	SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	undo.o\
+	Sheet.o\
+	select.o\
+	extend.o\
+	searchUtils.o\
+	edMenu.o\
+	trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	tagEditor.o\
+	tagdbparse.o\
+	tagU2.o\
+	tagU1.o
+
+DAP=\
+	dap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	asubs89.a\
+	plot92.a
+
+DAPOBJS=\
+	$(DAP)\
+	$(COMMONOBJS)\
+	pl4010.o\
+	conEdit.o\
+	tagU2.o
+
+XDAPOBJS=\
+	$(DAP)\
+	$(XDAPCOMMONOBJS)\
+	plX.o\
+	textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)
+
+dap: $(DAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(DAPOBJS) $(LFLAGS)
+
+xdap: $(XDAPOBJS)
+	$(CLD) $(LDFLAGS) -o $@ $(XDAPOBJS) $(XLFLAGS)
+
+convert_project: cvt.o
+	$(CC) $(CFLAGS) -o $@ cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(CLD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	sapf.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	sip.o\
+	dias89.a\
+	plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(CLD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	sipl.o\
+	dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)
+
+sethelp: $(SETHELPOBJS)
+	$(CLD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+REP=\
+	rep.o\
+	dias89.a\
+	subs89.a
+
+REPOBJS=\
+	$(REP)\
+	$(COMMONOBJS)
+ 
+rep:	$(REPOBJS)
+	$(F77) $(LDFLAGS) -o $@ $(REPOBJS) $(LFLAGS)
+
+
+
+
+
+VEP=\
+	vep.o\
+	dias89.a\
+	subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(SCOMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+
+#
+# Special dependencies
+#
+#
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a:	$(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+#.f.a:
+#	$(F77) $(FFLAGS) -c $<
+#	ar rcv $@ $*.o
+#	ranlib $@
+#	rm $*.o
+
+all: $(PROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.a *.o *~
+
+lint:
+	lint -Q -Nn5000 -DNOSTRDUP $(INCLUDES) *.c
+
+
diff --git a/src/staden/makefile-dec b/src/staden/makefile-dec
new file mode 100644
index 0000000..2cfd305
--- /dev/null
+++ b/src/staden/makefile-dec
@@ -0,0 +1,595 @@
+# Makefile for staden X programs (Dec version)
+# --------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   dap  sap  sip   splitp1 splitp2 splitp3 gip sethelp convert_project rep lip sapf nipf vep
+LPROGS=           nipl  pipl            sipl
+XPROGS=     xmep xnip  xpip  xdap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Where the wild things are
+#
+SRC=.
+TEDSRC=$(SRC)/../ted
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+INCLUDES= -I/usr/include/mit -I$(SRC) -I$(TEDSRC) -I$(MISCSRC)
+
+
+#
+# Compilers and flags
+#
+CC= c89
+CFLAGS= -g -common -DNOSTRDUP $(INCLUDES)
+F77= f77
+FFLAGS= -g -c
+
+#FLIBS= -lfor -lutil -lUfor -li -lots -lm
+FLIBS= $(MISCBIN)/misc.a -lfor -lutil -lUfor -li -lots -lm -lfor
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS= $(FLIBS)
+XLFLAGS= $(FLIBS) $(XLIBS) 
+#LFLAGS= 
+#XLFLAGS= $(XLIBS) 
+LD = c89
+LDFLAGS = -g
+
+
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	seqlibsubs.o
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	postscript.o\
+	Graph.o\
+	plotLog.o\
+	help.o\
+	dialogues.o\
+	userfacecom.o\
+	helpnmenu.o\
+	xmenu.o\
+	FtoC.o\
+	mcspec.o\
+	locks.o
+
+COBJS=\
+	main.o\
+	$(CCORE)
+
+XDAPCOBJS=\
+	xdapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	fmain.o\
+	seeme-dec.o\
+	nxspec.o\
+	userface.o\
+	userfacecom.o\
+	nxhelpmenu.o\
+	helpnmenu.o\
+	FtoC.o\
+	subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	seeme-dec.o\
+	xspec.o\
+	subs89.a\
+	$(COBJS)
+
+XDAPCOMMONOBJS=\
+	seeme-dec.o\
+	xspec.o\
+	subs89.a\
+	$(XDAPCOBJS)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+LIP=\
+	lip.o
+ 
+LIPOBJS=\
+	$(LIP)\
+	$(SEQLIB)\
+	$(COMMONOBJS)
+ 
+lip: $(LIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(LIPOBJS) $(LFLAGS)
+
+
+MEP=\
+	mep.o\
+	mepsub.o\
+	asubs89.a\
+	plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	nip.o\
+	patternn.a\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	nipl.o\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+NIPF=\
+	nipf.o\
+	asubs89.a\
+	plot92.a
+
+NIPFOBJS=\
+	$(NIPF)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+
+nipf: $(NIPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPFOBJS) $(LFLAGS)
+
+
+
+
+
+PIP=\
+	pip.o\
+	analps89.a\
+	patternp.a\
+	patternpc.a\
+	asubs89.a\
+	plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	pipl.o\
+	patternpc.a\
+	analps89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	seq.o\
+	seqIOABI.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqIOPlain.o\
+	opp.o\
+	fpoint.o\
+	mach-io.o
+
+TMANOBJS=\
+	tman_main.o\
+	tman_display.o\
+	tman_traceDisplay.o\
+	tman_basesDisplay.o\
+	tman_context.o\
+	tman_gadgets.o\
+	$(STDTEDFILES)
+
+# Some versions of X11R4 may have a bug in SmeLine.c
+XHACK=\
+	SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	undo.o\
+	Sheet.o\
+	select.o\
+	extend.o\
+	searchUtils.o\
+	edMenu.o\
+	trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	tagEditor.o\
+	tagdbparse.o\
+	tagU2.o\
+	tagU1.o
+
+DAP=\
+	dap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	asubs89.a\
+	plot92.a
+
+DAPOBJS=\
+	$(DAP)\
+	$(COMMONOBJS)\
+	pl4010.o\
+	conEdit.o\
+	tagU2.o
+
+XDAPOBJS=\
+	$(DAP)\
+	$(XDAPCOMMONOBJS)\
+	plX.o\
+	textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)
+
+dap: $(DAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(DAPOBJS) $(LFLAGS)
+
+xdap: $(XDAPOBJS)
+	$(CC) $(CFLAGS) -o $@ $(XDAPOBJS) $(XLFLAGS)
+
+convert_project: cvt.o
+	$(CC) $(CFLAGS) -o $@ cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	sapf.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	sip.o\
+	dias89.a\
+	plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	sipl.o\
+	dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+REP=\
+	rep.o\
+	dias89.a\
+	subs89.a
+
+REPOBJS=\
+	$(REP)\
+	$(COMMONOBJS)
+ 
+rep:	$(REPOBJS)
+	$(F77) $(LDFLAGS) -o $@ $(REPOBJS) $(LFLAGS)
+
+
+
+VEP=\
+	vep.o\
+	dias89.a\
+	subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(SCOMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+#
+# Special dependencies
+#
+#
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(PROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.a *.o *~
+
diff --git a/src/staden/makefile-sgi b/src/staden/makefile-sgi
new file mode 100644
index 0000000..b7bb6b4
--- /dev/null
+++ b/src/staden/makefile-sgi
@@ -0,0 +1,604 @@
+# Makefile for staden X programs (Silicon Graphics version)
+# ---------------------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   dap  sap  sip   splitp1 splitp2 splitp3 gip sethelp convert_project sapf nipf vep rep lip
+LPROGS=           nipl  pipl            sipl
+XPROGS=     xmep xnip  xpip  xdap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Where the wild things are
+#
+SRC=.
+TEDSRC=$(SRC)/../ted
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+INCLUDES= -I$(SRC) -I$(TEDSRC) -I$(MISCSRC)
+
+
+#
+# Compilers and flags
+#
+CC= cc
+CFLAGS= -g -ansi -prototypes $(INCLUDES)
+F77= f77
+FFLAGS=  -g -c -old_rl
+
+FLIBS= -lF77 -lU77 -lm -lI77 -lisam
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS= $(FLIBS)
+XLFLAGS= $(XLIBS) $(FLIBS)
+
+LD = cc
+LDFLAGS = -g 
+
+#
+# Shell to perform make commands
+#
+SHELL=/bin/sh
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	seqlibsubs.o
+#	/usr/lib/fixade.o
+
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	postscript.o\
+	Graph.o\
+	plotLog.o\
+	help.o\
+	dialogues.o\
+	userfacecom.o\
+	helpnmenu.o\
+	xmenu.o\
+	FtoC.o\
+	mcspec.o\
+	locks.o
+
+COBJS=\
+	main.o\
+	$(CCORE)
+
+XDAPCOBJS=\
+	xdapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	fmain.o\
+	seeme-sg.o\
+	nxspec.o\
+	userface.o\
+	userfacecom.o\
+	nxhelpmenu.o\
+	helpnmenu.o\
+	FtoC.o\
+	subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	seeme-sg.o\
+	xspec.o\
+	subs89.a\
+	$(COBJS)
+
+XDAPCOMMONOBJS=\
+	seeme-sg.o\
+	xspec.o\
+	subs89.a\
+	$(XDAPCOBJS)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+LIP=\
+	lip.o
+ 
+LIPOBJS=\
+	$(LIP)\
+	$(SEQLIB)\
+	$(COMMONOBJS)
+ 
+lip: $(LIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(LIPOBJS) $(LFLAGS)
+
+
+
+
+MEP=\
+	mep.o\
+	mepsub.o\
+	asubs89.a\
+	plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	nip.o\
+	patternn.a\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	plot92.a \
+	$(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	nipl.o\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+NIPF=\
+	nipf.o\
+	asubs89.a\
+	plot92.a
+
+NIPFOBJS=\
+	$(NIPF)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+
+nipf: $(NIPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPFOBJS) $(LFLAGS)
+
+
+
+
+
+PIP=\
+	pip.o\
+	analps89.a\
+	patternp.a\
+	patternpc.a\
+	asubs89.a\
+	plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	pipl.o\
+	patternpc.a\
+	analps89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	seq.o\
+	seqIOABI.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqIOPlain.o\
+	opp.o\
+	fpoint.o\
+	mach-io.o
+
+TMANOBJS=\
+	tman_main.o\
+	tman_display.o\
+	tman_traceDisplay.o\
+	tman_basesDisplay.o\
+	tman_context.o\
+	tman_gadgets.o\
+	$(STDTEDFILES)
+
+# Some versions of X11R4 may have a bug in SmeLine.c
+XHACK=\
+	SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	undo.o\
+	Sheet.o\
+	select.o\
+	extend.o\
+	searchUtils.o\
+	edMenu.o\
+	trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	tagEditor.o\
+	tagdbparse.o\
+	tagU2.o\
+	tagU1.o
+
+DAP=\
+	dap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	asubs89.a\
+	plot92.a
+
+DAPOBJS=\
+	$(DAP)\
+	$(COMMONOBJS)\
+	pl4010.o\
+	conEdit.o\
+	tagU2.o
+
+XDAPOBJS=\
+	$(DAP)\
+	$(XDAPCOMMONOBJS)\
+	plX.o\
+	textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)
+
+dap: $(DAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(DAPOBJS) $(LFLAGS)
+
+xdap: $(XDAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XDAPOBJS) $(XLFLAGS)
+
+convert_project: cvt.o
+	$(CC) $(CFLAGS) -o $@ cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	sapf.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	sip.o\
+	dias89.a\
+	plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	sipl.o\
+	dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)\
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+REP=\
+	rep.o\
+	dias89.a\
+	subs89.a
+
+REPOBJS=\
+	$(REP)\
+	$(COMMONOBJS)
+ 
+rep:	$(REPOBJS)
+	$(F77) $(LDFLAGS) -o $@ $(REPOBJS) $(LFLAGS)
+
+
+
+
+
+VEP=\
+	vep.o\
+	dias89.a\
+	subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(SCOMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+
+#
+# Special dependencies
+#
+#
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a:	$(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	echo ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(PROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.a *.o *~
+
+
+
diff --git a/src/staden/makefile-solaris b/src/staden/makefile-solaris
new file mode 100644
index 0000000..3e91b59
--- /dev/null
+++ b/src/staden/makefile-solaris
@@ -0,0 +1,617 @@
+# Makefile for staden X programs (Solaris 2.1 version)
+# ----------------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   dap  sap  sip   splitp1 splitp2 splitp3 gip sethelp convert_project rep lip sapf nipf vep
+LPROGS=           nipl  pipl            sipl
+XPROGS=     xmep xnip  xpip  xdap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Where the wild things are
+#
+SRC=.
+TEDSRC=$(SRC)/../ted
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+OPENWININC=/usr/openwin/include
+INCLUDES= -I$(SRC) -I$(TEDSRC) -I$(MISCSRC) -I$(OPENWININC)
+
+
+#
+# Compilers and flags
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -Wformat
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(INCLUDES)
+F77= f77
+FFLAGS= -ansi -g -C -Nx500
+
+#
+# Linking flags
+#
+#
+#LD=	ld
+#LDFLAGS=-t $(LIBPATH) -Bstatic \
+#	$(SPRO)/crti.o $(SPRO)/crt1.o $(SPRO)/cg89/__fstd.o \
+#	$(SPRO)/values-xi.o -Y P,$(SPRO)/../lib:$(SPRO)/:/usr/ccs/lib:/usr/lib
+#
+#SPRO=	/opt/SUNWspro/SC2.0.1
+#LIBPATH=-L$(SPRO) -L/usr/ccs/lib -L/usr/ucblib
+#XLIBS=	-lXaw -lXt -lXmu -lXext -lX11
+#FLIBS=	-lV77 -lF77 -lM77 -lm
+#LDOBJ=	$(SPRO)/crtn.o
+#LIBS=	-lm -lc -lsocket -lnsl -lintl -Bdynamic -ldl -Bstatic -lucb -lelf \
+#	$(LDOBJ)
+#LFLAGS=	$(FLIBS) $(LIBS)
+#XLFLAGS=$(FLIBS) $(XLIBS) $(LIBS)
+
+LD	= f77
+SPRO	= /opt/SUNWspro/SC2.0.1
+LDFLAGS	= -Bstatic -g -L$(SPRO) -L/usr/ccs/lib -L/usr/ucblib -L/usr/openwin/lib
+LIBS	= -lm -lucb -lelf -Bdynamic -ldl -Bstatic -lsocket -lnsl -lintl
+FLIBS	= -lV77 -lF77 -lM77
+XLIBS	= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS	= $(FLIBS) $(LIBS)
+XLFLAGS	= $(FLIBS) $(XLIBS) $(LIBS)
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	seqlibsubs.o
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	postscript.o\
+	Graph.o\
+	plotLog.o\
+	help.o\
+	dialogues.o\
+	userfacecom.o\
+	helpnmenu.o\
+	xmenu.o\
+	FtoC.o\
+	mcspec.o\
+	locks.o
+
+COBJS=\
+	main.o\
+	$(CCORE)
+
+XDAPCOBJS=\
+	xdapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	fmain.o\
+	seeme.o\
+	nxspec.o\
+	userface.o\
+	userfacecom.o\
+	nxhelpmenu.o\
+	helpnmenu.o\
+	FtoC.o\
+	subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	seeme.o\
+	xspec.o\
+	subs89.a\
+	$(COBJS)
+
+XDAPCOMMONOBJS=\
+	seeme.o\
+	xspec.o\
+	subs89.a\
+	$(XDAPCOBJS)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+LIP=\
+	lip.o
+ 
+LIPOBJS=\
+	$(LIP)\
+	$(SEQLIB)\
+	$(COMMONOBJS)
+ 
+lip: $(LIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(LIPOBJS) $(LFLAGS)
+ 
+ 
+
+
+MEP=\
+	mep.o\
+	mepsub.o\
+	asubs89.a\
+	plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	nip.o\
+	patternn.a\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	nipl.o\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+NIPF=\
+	nipf.o\
+	asubs89.a\
+	plot92.a
+
+NIPFOBJS=\
+	$(NIPF)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+
+nipf: $(NIPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPFOBJS) $(LFLAGS)
+
+
+
+
+
+PIP=\
+	pip.o\
+	analps89.a\
+	patternp.a\
+	patternpc.a\
+	asubs89.a\
+	plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	pipl.o\
+	patternpc.a\
+	analps89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	seq.o\
+	seqIOABI.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqIOPlain.o\
+	opp.o\
+	fpoint.o\
+	mach-io.o
+
+TMANOBJS=\
+	tman_main.o\
+	tman_display.o\
+	tman_traceDisplay.o\
+	tman_basesDisplay.o\
+	tman_context.o\
+	tman_gadgets.o\
+	$(STDTEDFILES)
+
+# Some versions of X11R4 may have a bug in SmeLine.c
+XHACK=\
+	SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	undo.o\
+	Sheet.o\
+	select.o\
+	extend.o\
+	searchUtils.o\
+	edMenu.o\
+	trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	tagEditor.o\
+	tagdbparse.o\
+	tagU2.o\
+	tagU1.o
+
+DAP=\
+	dap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	asubs89.a\
+	plot92.a
+
+DAPOBJS=\
+	$(DAP)\
+	$(COMMONOBJS)\
+	pl4010.o\
+	conEdit.o\
+	tagU2.o
+
+XDAPOBJS=\
+	$(DAP)\
+	$(XDAPCOMMONOBJS)\
+	plX.o\
+	textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)
+
+dap: $(DAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(DAPOBJS) $(LFLAGS)
+
+xdap: $(XDAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XDAPOBJS) $(XLFLAGS)
+
+convert_project: cvt.o
+	$(CC) $(CFLAGS) -o $@ cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	sapf.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	sip.o\
+	dias89.a\
+	plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	sipl.o\
+	dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+
+
+REP=\
+	rep.o\
+	dias89.a\
+	subs89.a
+
+REPOBJS=\
+	$(REP)\
+	$(COMMONOBJS)
+ 
+rep:	$(REPOBJS)
+	$(F77) $(LDFLAGS) -o $@ $(REPOBJS) $(LFLAGS)
+
+
+
+
+
+VEP=\
+	vep.o\
+	dias89.a\
+	subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(SCOMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+
+#
+# Special dependencies
+#
+#
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(PROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.a *.o *~
+
diff --git a/src/staden/makefile-sun b/src/staden/makefile-sun
new file mode 100644
index 0000000..82262fd
--- /dev/null
+++ b/src/staden/makefile-sun
@@ -0,0 +1,602 @@
+# Makefile for staden X programs (Sun version)
+# --------------------------------------------
+
+# This makefile generates the following staden programs which
+# run under xterm (normal, `N') and those which run directly under X.
+NPROGS=      mep  nip   pip   dap  sap  sip   splitp1 splitp2 splitp3 gip sethelp convert_project rep lip sapf nipf vep
+LPROGS=           nipl  pipl            sipl
+XPROGS=     xmep xnip  xpip  xdap xsap xsip
+PROGS= $(NPROGS) $(LPROGS) $(XPROGS)
+
+
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Where the wild things are
+#
+SRC=.
+TEDSRC=$(SRC)/../ted
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+INCLUDES= -I$(SRC) -I$(TEDSRC) -I$(MISCSRC)
+
+
+#
+# Compilers and flags
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(INCLUDES)
+# KFLAGS - for compiling non ANSI C modules
+KFLAGS= -g $(INCLUDES)
+F77= f77
+FFLAGS= -ansi -g -C -Nx500
+
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+# cc -Bstatic, gcc-2.x -static, else nothing
+LFLAGS= -static -lF77 -lI77 -lU77 -lm
+XLFLAGS= $(LFLAGS) $(XLIBS)
+
+#LD=cc
+# If using gcc-2.x, better link with it
+LD=gcc
+LDFLAGS= -g
+# Alternative for those with problems with libraries
+#XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+#LFLAGS= -Bstatic
+#XLFLAGS= $(LFLAGS) $(XLIBS)
+#LD=f77
+#LDFLAGS= -g
+
+#
+# Sequence library handling routines
+#
+SEQLIB=\
+	seqlibsubs.o
+
+#
+# The C objects, needed by every X program
+#
+CCORE=\
+	postscript.o\
+	Graph.o\
+	plotLog.o\
+	help.o\
+	dialogues.o\
+	userfacecom.o\
+	helpnmenu.o\
+	xmenu.o\
+	FtoC.o\
+	mcspec.o\
+	locks.o
+
+COBJS=\
+	main.o\
+	$(CCORE)
+
+XDAPCOBJS=\
+	xdapmain.o\
+	$(CCORE)
+
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJSB=\
+	fmain.o\
+	seeme.o\
+	nxspec.o\
+	userface.o\
+	userfacecom.o\
+	nxhelpmenu.o\
+	helpnmenu.o\
+	FtoC.o\
+	subs89.a
+
+COMMONOBJS=\
+	$(COMMONOBJSB)\
+	postscript.o
+
+SCOMMONOBJS=\
+	$(COMMONOBJSB)
+
+XCOMMONOBJS=\
+	seeme.o\
+	xspec.o\
+	subs89.a\
+	$(COBJS)
+
+XDAPCOMMONOBJS=\
+	seeme.o\
+	xspec.o\
+	subs89.a\
+	$(XDAPCOBJS)
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+GIP=\
+	gip.o
+
+GIPOBJS=\
+	$(GIP)\
+	$(SCOMMONOBJS)
+
+gip: $(GIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(GIPOBJS) $(LFLAGS)
+
+
+LIP=\
+	lip.o
+ 
+LIPOBJS=\
+	$(LIP)\
+	$(SEQLIB)\
+	$(COMMONOBJS)
+ 
+lip: $(LIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(LIPOBJS) $(LFLAGS)
+
+
+MEP=\
+	mep.o\
+	mepsub.o\
+	asubs89.a\
+	plot92.a
+
+MEPOBJS=\
+	$(MEP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XMEPOBJS=\
+	$(MEP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+mep: $(MEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(MEPOBJS)  $(LFLAGS)
+
+xmep: $(XMEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XMEPOBJS) $(XLFLAGS)
+
+
+
+
+NIP=\
+	nip.o\
+	patternn.a\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	plot92.a $(SEQLIB)
+
+NIPOBJS=\
+	$(NIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XNIPOBJS=\
+	$(NIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+nip: $(NIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPOBJS) $(LFLAGS)
+
+xnip: $(XNIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XNIPOBJS) $(XLFLAGS)
+
+
+
+
+NIPL=\
+	nipl.o\
+	patternnc.a\
+	anals89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+NIPLOBJS=\
+	$(NIPL)\
+	$(SCOMMONOBJS)
+
+nipl: $(NIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPLOBJS) $(LFLAGS)
+
+
+
+NIPF=\
+	nipf.o\
+	asubs89.a\
+	plot92.a
+
+NIPFOBJS=\
+	$(NIPF)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+
+nipf: $(NIPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(NIPFOBJS) $(LFLAGS)
+
+
+
+
+
+PIP=\
+	pip.o\
+	analps89.a\
+	patternp.a\
+	patternpc.a\
+	asubs89.a\
+	plot92.a\
+	$(SEQLIB)
+
+PIPOBJS=\
+	$(PIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XPIPOBJS=\
+	$(PIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+pip: $(PIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPOBJS)  $(LFLAGS)
+
+xpip:$(XPIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XPIPOBJS) $(XLFLAGS)
+
+
+
+
+PIPL=\
+	pipl.o\
+	patternpc.a\
+	analps89.a\
+	asubs89.a\
+	$(SEQLIB)
+
+PIPLOBJS=\
+	$(PIPL)\
+	$(SCOMMONOBJS)
+
+pipl: $(PIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(PIPLOBJS)  $(LFLAGS)
+
+
+
+#
+# Trace manager objects
+#
+STDTEDFILES=\
+	seq.o\
+	seqIOABI.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqIOPlain.o\
+	opp.o\
+	fpoint.o\
+	mach-io.o
+
+TMANOBJS=\
+	tman_main.o\
+	tman_display.o\
+	tman_traceDisplay.o\
+	tman_basesDisplay.o\
+	tman_context.o\
+	tman_gadgets.o\
+	$(STDTEDFILES)
+
+# Some versions of X11R4 may have a bug in SmeLine.c
+XHACK=\
+	SmeLine.o
+
+EDITOR=\
+	$(XHACK)\
+	xsapConEdit.o\
+	contigEditor.o\
+	edUtils.o\
+	undo.o\
+	Sheet.o\
+	select.o\
+	extend.o\
+	searchUtils.o\
+	edMenu.o\
+	trans.o\
+	$(MISCBIN)/misc.a
+
+TAGEDITOR=\
+	tagEditor.o\
+	tagdbparse.o\
+	tagU2.o\
+	tagU1.o
+
+DAP=\
+	dap.o\
+	dbsysnew.o\
+	dbsyscommon.o\
+	asubs89.a\
+	plot92.a
+
+DAPOBJS=\
+	$(DAP)\
+	$(COMMONOBJS)\
+	pl4010.o\
+	conEdit.o\
+	tagU2.o
+
+XDAPOBJS=\
+	$(DAP)\
+	$(XDAPCOMMONOBJS)\
+	plX.o\
+	textOutput.o\
+	$(EDITOR)\
+	$(TMANOBJS)\
+	$(TAGEDITOR)
+
+dap: $(DAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(DAPOBJS) $(LFLAGS)
+
+xdap: $(XDAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XDAPOBJS) $(XLFLAGS)
+
+convert_project: cvt.o
+	$(CC) $(CFLAGS) -o $@ cvt.o
+
+SAP=\
+	sap.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPOBJS=\
+	$(SAP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSAPOBJS=\
+	$(SAP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sap: $(SAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPOBJS) $(LFLAGS)
+
+xsap: $(XSAPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSAPOBJS) $(XLFLAGS)
+
+
+
+
+SAPF=\
+	sapf.o\
+	dbsyscommon.o\
+	dbsysold.o\
+	asubs89.a\
+	plot92.a
+
+SAPFOBJS=\
+	$(SAPF)\
+	sapmen.o\
+	$(COMMONOBJS)\
+	pl4010.o
+
+sapf: $(SAPFOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SAPFOBJS) $(LFLAGS)
+
+
+
+
+
+
+SIP=\
+	sip.o\
+	dias89.a\
+	plot92.a\
+	$(SEQLIB)
+
+SIPOBJS=\
+	$(SIP)\
+	$(COMMONOBJS)\
+	pl4010.o
+
+XSIPOBJS=\
+	$(SIP)\
+	$(XCOMMONOBJS)\
+	plX.o\
+	textOutput.o 
+
+sip: $(SIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPOBJS) $(LFLAGS)
+
+xsip: $(XSIPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(XSIPOBJS) $(XLFLAGS)
+
+
+
+
+SIPL=\
+	sipl.o\
+	dias89.a\
+	$(SEQLIB)
+
+SIPLOBJS=\
+	$(SIPL)\
+	$(SCOMMONOBJS)
+
+sipl: $(SIPLOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SIPLOBJS) $(LFLAGS)
+
+
+
+
+SETHELP=\
+	sethelp.o
+
+SETHELPOBJS=\
+	$(SETHELP)
+
+sethelp: $(SETHELPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SETHELPOBJS) $(LFLAGS)
+
+
+
+
+SPLITP1=\
+	splitp1.o
+SPLITP1OBJS=\
+	$(SPLITP1)\
+	$(SCOMMONOBJS)
+
+splitp1: $(SPLITP1OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP1OBJS) $(LFLAGS)
+
+
+
+
+SPLITP2=\
+	splitp2.o
+SPLITP2OBJS=\
+	$(SPLITP2)\
+	$(SCOMMONOBJS)
+
+splitp2: $(SPLITP2OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP2OBJS) $(LFLAGS)
+
+
+
+
+
+SPLITP3=\
+	splitp3.o
+SPLITP3OBJS=\
+	$(SPLITP3)\
+	$(SCOMMONOBJS)
+
+splitp3: $(SPLITP3OBJS)
+	$(LD) $(LDFLAGS) -o $@ $(SPLITP3OBJS) $(LFLAGS)
+
+REP=\
+	rep.o\
+	dias89.a\
+	subs89.a
+
+REPOBJS=\
+	$(REP)\
+	$(COMMONOBJS)
+ 
+rep:	$(REPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(REPOBJS) $(LFLAGS)
+
+
+
+VEP=\
+	vep.o\
+	dias89.a\
+	subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(SCOMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+
+#
+# Special dependencies
+#
+#
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(TEDSRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+
+#
+# The UNIX library (archive) mechanism is fairly weak so the following
+# procedure is used to generate libraries.
+#   * split the source file up into files which each contain
+#     a single function or subroutine
+#   * compile each of those files
+#   * insert the object files individually into the library
+# Each library is built by using a temporary directory.
+# The fortran compilations must be done sequentially in order to
+# avoid filling up the (small on this machine) /tmp directory
+# with compiler debugging information.
+#
+%.a: $(SRC)/%.f
+	BINDIR=`pwd`; \
+	cd $(SRC); \
+	SRCDIR=`pwd`; \
+	TMPDIR=/tmp/staden$$$$; \
+	if test ! -d $$TMPDIR; \
+	then mkdir $$TMPDIR; \
+	fi; \
+	cd $$TMPDIR; \
+	rm -f *.f *.o; \
+	fsplit $$SRCDIR/`expr $< : '.*/\(.*\)' \| $<`; \
+	$(F77) $(FFLAGS) -c *.f; \
+	rm *.f; \
+	rm -f $$BINDIR/$@; \
+	ar rcv $$BINDIR/$@ *.o; \
+	ranlib $$BINDIR/$@; \
+	rm *.o; \
+	cd $$BINDIR; \
+	rm -r $$TMPDIR
+
+
+
+
+
+
+all: $(PROGS)
+
+nprogs: $(NPROGS)
+
+xprogs: $(XPROGS)
+
+lprogs: $(LPROGS)
+
+clean:
+	rm -f $(PROGS) *.a *.o *~
+
diff --git a/src/staden/mcspec.c b/src/staden/mcspec.c
new file mode 100644
index 0000000..edee3ea
--- /dev/null
+++ b/src/staden/mcspec.c
@@ -0,0 +1,42 @@
+/* Title: 	mcspec
+
+   File: 	mcspec.c
+   Purpose:	Machine, OS and language specific features
+   Last update: Mon Mar 19th 1990
+*/
+
+#include "fort.h"
+#include <stdio.h>
+#include <unistd.h>
+
+
+/* ---- Xlib includes ---- */
+#include <X11/Xlib.h> /* IMPORT: XBell */
+
+
+/* ---- Application includes ---- */
+#include "mcspec.h"
+#include "main.h" /* IMPORT: GetDisplay */
+
+
+
+
+void bell_(int_f *n_ptr, int_f *kbout_ptr)
+/* Ring the bell `n' times
+*/   
+{   int_f i;
+
+    for (i=1; i<=*n_ptr; i++) {
+        XBell(GetDisplay(), 100);
+    }
+}
+
+#ifdef NOREMOVE
+int remove(const char *filename)
+/* This is an ANSI standard function.
+   It is implemented here because few UNIX systems support it.
+*/
+{   
+    return(unlink(filename));
+}
+#endif
diff --git a/src/staden/mcspec.h b/src/staden/mcspec.h
new file mode 100644
index 0000000..8471119
--- /dev/null
+++ b/src/staden/mcspec.h
@@ -0,0 +1,37 @@
+/*
+    Title: 	 mcspec
+
+    File: 	 mcspec.h
+    Purpose:	 Machine, OS and language specific features
+    Last update: Mon Jun 10 1993
+*/
+
+#ifndef _MCSPEC_H_
+#define _MCSPEC_H_
+
+
+#include "fort.h"
+
+extern void bell_(int_f *n_ptr, int_f *kbout_ptr);
+/*
+    Ring the bell `n' times
+*/   
+
+
+extern int remove(const char *filename);
+/*
+    This is an ANSI standard function.
+    It is implemented here because few UNIX systems support it.
+*/
+
+
+extern char *getenv(const char *name);
+/*
+    This is an ANSI standard function.
+    It's type is declared here becuase there is no stdlib.h
+    on this machine.
+*/
+
+
+
+#endif /*_MCSPEC_H_*/
diff --git a/src/staden/mep.f b/src/staden/mep.f
new file mode 100644
index 0000000..5c1dd95
--- /dev/null
+++ b/src/staden/mep.f
@@ -0,0 +1,394 @@
+C     author Rodger Staden
+C     22-oct-1992 introduced mask
+C     8-jul-1992 allowed words as short as 2 letters
+C     16-mar-1990 removed escape after bpause
+C      5-4 90 changed getdev to redir (see redir)
+C     22-6-90 Added work for use by makeds (it had its own array
+C     but it was too small.)
+C     14-11-90 replaced radio by radion
+C     13-04-91 removed initial CLEARV
+      SUBROUTINE FMAIN()
+      PARAMETER ( MAXSEQ = 1000,
+     +            MAXWRD = 8,
+     +            IDM    = 5,
+     +            MAXDEV = 7,
+     +            MAXDIC = (IDM-1)**MAXWRD,
+     +            MAXLIS = 100,
+     +            MAXCHR = 20000,
+     +            MINMEN = -3,
+     +            MAXMEN = 33,
+     +            MAXMAS = 25)
+      CHARACTER SEQ(MAXCHR)
+      CHARACTER*60 FILNAM,HELPF,POINTF,FILMAR
+      PARAMETER ( HELPF = 'MEPHELP',
+     +            POINTF = 'MEPHPNT',
+     +            FILMAR = 'MEPMARG')
+      INTEGER SEQN(MAXCHR),DW(MAXDIC),DEVNOS(MAXDEV),DS(MAXDIC)
+      INTEGER LENSEQ(MAXSEQ),DM(MAXDIC),DT(MAXDIC),DH(MAXDIC)
+      INTEGER TEMPC(MAXLIS),TEMPP(MAXLIS),TEMPI(MAXLIS),TEMPN(MAXLIS)
+      INTEGER HELPS(0:MAXMEN),HELPE(0:MAXMEN),WT(4,MAXWRD)
+      INTEGER MARGB(MAXMEN),MARGT(MAXMEN),WORK(MAXCHR)
+      REAL COMP(5)
+      PARAMETER (MAXPRM = 27)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      LOGICAL MASK(MAXMAS)
+      CALL INTHLP('mep', MAXMEN)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(7))
+      CALL INITLU(IDM)
+      WRITE(KBOUT,*)' MEP (Motif exploration program) V2.0 Oct 1992'
+      WRITE(KBOUT,*)' Author Rodger Staden'
+      IGORT = 0
+      IDEV = KBOUT
+      IDEVH = DEVNOS(3)
+      IDEVO = DEVNOS(5)
+      CALL SETHLP(HELPS,HELPE,0,MAXMEN,POINTF,IDEVH,KBOUT)
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +MAXMEN,DEVNOS(2),FILMAR)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,IDEVH)
+1     CONTINUE
+      CALL RSEQ(DEVNOS(1),DEVNOS(2),FILNAM,SEQ,MAXCHR,MAXSEQ,LENSEQ,
+     +NFILE,LMAX,KBIN,KBOUT,HELPS(3),HELPE(3),HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      CALL CONNUM(SEQ,SEQN,LENSEQ(NFILE))
+      I1 = 1
+      I2 = LMAX
+      XMIN = I1
+      XMAX = I2
+      YMIN = 1
+      YMAX = NFILE - 1
+      CALL GETCOM(SEQN,LENSEQ(NFILE),COMP)
+      ICOMP = 1
+      KCOMP = 0
+      LENGTH = 6
+      LMASK = LENGTH
+      MAXPOS = (IDM-1)**LENGTH
+      NFUZ = 1
+      IDICW = 1
+      IDICM = 0
+      IDICH = 0
+      IDICS = 1
+      DO 2 I=1,LMASK
+        MASK(I) = .TRUE.
+ 2    CONTINUE
+10    CONTINUE
+      CALL VT100M
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+C      IF(IOK.NE.0) GO TO 9999
+      CALL MENU(NOPT,KOPT,MOPT,MAXMEN,MINMEN,KBIN,KBOUT,
+     +HELPS,HELPE,HELPF,IDEVH)
+      IF(NOPT.EQ.2) GO TO 9999
+      IF(NOPT.EQ.3)GO TO 1
+C
+C   help
+C
+      IF(NOPT.EQ.1) THEN
+        CALL HELP(HELPS,HELPE,0,MAXMEN,HELPF,IDEVH,
+     +  KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.4)THEN
+        MN = 1
+        MX = LMAX
+        CALL GETINT(MN,MX,I1,'Start position',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        I1 = IVAL
+        MN = I1 + LENGTH - 1
+        MX = LMAX
+        CALL GETINT(MN,MX,I2,'End position',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        I2 = IVAL
+        IDICW = 1
+        IDICM = 0
+        IDICH = 0
+        IDICS = 1
+       GO TO 10
+      END IF
+      IF(NOPT.EQ.5) THEN
+        LINLEN = 50
+        CALL LISTN(SEQ,MAXCHR,LENSEQ,NFILE,LMAX,LINLEN,I1,I2,
+     +  IDEV,KBOUT)
+      END IF
+      IF(NOPT.EQ.6)THEN
+        CALL TTEXT(DEVNOS(4),FILNAM,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+C   GET DISK OUTPUT FILE ON UNIT DEVNOS(2) IF REQUIRED
+        CALL REDIR(IDEV,DEVNOS(6),DEVNOS(7),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.33)THEN
+        CALL HAIRPN(SEQ,MAXCHR,LENSEQ,NFILE,LMAX,
+     +  DW,DM,DT,LMAX,
+     +  MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,
+     +  KBOUT,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        IDICW = 1
+        IDICH = 0
+        IDICM = 0
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.17)THEN
+C  PLOT
+        IB = 1
+        PROMPT(1) = 'Plot match positions'
+        PROMPT(2) = 'Plot histogram of matches'
+        CALL RADION('Select plot mode',PROMPT,2,IB,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+        IF(IB.LT.1) GO TO 10
+        IF(IB.EQ.1) THEN
+          CALL PLOTP(SEQ,MAXCHR,LENSEQ,NFILE,DT,LMAX,KBIN,KBOUT,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,
+     +    I1,I2,IDEV,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        ELSE
+          CALL PLOTH(SEQ,MAXCHR,LENSEQ,NFILE,DT,LMAX,KBIN,KBOUT,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,I1,I2,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        END IF
+        GO TO 10
+      END IF
+C   XHAIRS
+      IF(NOPT.EQ.13)THEN
+      CALL CLEARV
+      CALL XHAIRS(XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,
+     +MARGB(NOPT),MARGT(NOPT),
+     +ISXMAX,ISYMAX,IHX,IHY,N,KBOUT)
+      GO TO 10
+      END IF
+C   RULER
+      IF(NOPT.EQ.12)THEN
+           WRITE(KBOUT,*)' Horizontal scale'
+           ISH1=XMIN
+           ISH2=XMAX
+           CALL RULER(ISH1,ISH2,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +               ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +          HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(3))
+           WRITE(KBOUT,*)' Vertical scale'
+           ISV1=YMIN
+           ISV2=YMAX
+           CALL RULER(ISV1,ISV2,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),
+     +               ISXMAX,ISYMAX,KBIN,KBOUT,2,
+     +          HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(3))
+           GO TO 10
+      END IF
+C   CLEAR GRAPHICS
+      IF(NOPT.EQ.10)THEN
+      CALL CLEARG
+      GO TO 10
+      END IF
+C   KLEAR TEXT
+      IF(NOPT.EQ.11)THEN
+      CALL CLEARV
+      GO TO 10
+      END IF
+C  CHANGE MARGINS
+      IF(NOPT.EQ.14)THEN
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(NOPT),HELPE(NOPT),MAXMEN,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+C  LABEL
+      IF(NOPT.EQ.15)THEN
+      CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(3))
+      GO TO 10
+      END IF
+      IF(NOPT.EQ.20)THEN
+C        MN = 2
+C        MX = MAXWRD
+C        CALL GETINT(MN,MX,LENGTH,'Word length',
+C     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,IOK)
+C        IF(IOK.NE.0) GO TO 10
+C        LENGTH = IVAL
+C
+C  DEFINE A MASK
+C
+        WRITE(KBOUT,*)'Define word as a mask'
+        LMASK = MAXMAS
+        LENGTH = MAXWRD
+        CALL GMASK(KBIN,KBOUT,MASK,LMASK,LENGTH,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        MAXPOS = (IDM-1)**LENGTH
+        IDICS = 1
+        IDICW = 1
+        IDICM = 0
+        IDICH = 0
+      ELSE IF (NOPT.EQ.22) THEN
+        CALL SHOWST(NFUZ,I1,I2,KCOMP,ICOMP,IDICW,IDICS,
+     +  IDICM,IDICH,MASK,LMASK,KBOUT)
+      ELSE IF (NOPT.EQ.21)THEN
+        MN = 0
+        MX = 2
+        CALL GETINT(MN,MX,NFUZ,'Number of mismatches',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 10
+        NFUZ = IVAL
+        IDICW = 1
+        IDICM = 0
+        IDICH = 0
+      ELSE IF (NOPT.EQ.18)THEN
+        IB = KCOMP + 1
+        PROMPT(1) = 'Search input strand'
+        PROMPT(2) = 'Search complementary strand'
+        PROMPT(3) = 'Search both strands'
+        CALL RADION('Select search mode',PROMPT,3,IB,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+        IF(IB.LT.1) GO TO 10
+        KCOMP = IB - 1
+        CALL TCOMP(COMP,KCOMP)
+      ELSE IF (NOPT.EQ.19) THEN
+        CALL GETCM(COMP,KBIN,KBOUT,SEQN,LENSEQ(NFILE),ICOMP,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+      ELSE IF (NOPT.EQ.23)THEN
+C
+C  MAKE DW
+C
+C       GENERATE THE DICTIONARY FOR THIS WORD LENGTH
+        CALL FILLI(DW,MAXPOS,0)
+        WRITE(KBOUT,*)'Making Dw'
+        CALL MAKEDW(SEQN,MAXCHR,DW,DT,MAXPOS,LENSEQ,
+     +    MAXSEQ,NFILE,LENGTH,KCOMP,I1,I2,MASK,LMASK)
+        IDICW = 0
+      ELSE IF (NOPT.EQ.24) THEN
+C
+C  MAKE DS
+C
+        WRITE(KBOUT,*)'Making Ds'
+        CALL FILLI(DS,MAXPOS,0)
+        CALL MAKEDS(SEQN,MAXCHR,DS,DT,MAXPOS,LENSEQ,
+     +    MAXSEQ,NFILE,LENGTH,KCOMP,I1,I2,WORK,MASK,LMASK)
+        IDICS = 0
+      ELSE IF (NOPT.EQ.25) THEN
+C
+C  MAKE DM FROM DW
+C
+        IF(IDICW.NE.0) THEN
+          WRITE(KBOUT,*)'Please make Dw'
+        ELSE
+          WRITE(KBOUT,*)'Making Dm from Dw with ',NFUZ,' mismatch' 
+          CALL MAKEDM(DW,DM,LENGTH,NFUZ,MAXPOS)
+          IDICM = 1
+          IDICH = 0
+        END IF
+      ELSE IF (NOPT.EQ.26) THEN
+C
+C  MAKE DM FROM DS
+C
+        IF(IDICS.NE.0) THEN
+          WRITE(KBOUT,*)'Please make Ds'
+        ELSE
+          WRITE(KBOUT,*)'Making Dm from Ds with ',NFUZ,' mismatch' 
+          CALL MAKEDM(DS,DM,LENGTH,NFUZ,MAXPOS)
+          IDICM = 2
+          IDICH = 0
+        END IF
+      ELSE IF (NOPT.EQ.27)THEN
+C
+C  MAKE DH
+C
+        IF(IDICM.EQ.0)THEN
+          WRITE(KBOUT,*)'Please make dictionary'
+        ELSE
+          IF(IDICM.NE.0)THEN
+            WRITE(KBOUT,*)'Making Dh'
+            CALL MAKEDH(DH,DM,LENGTH,NFUZ,MAXPOS)
+            IDICH = IDICM
+          END IF
+        END IF
+      ELSE IF (NOPT.EQ.28)THEN
+        IF(IDICM.EQ.0) THEN
+          WRITE(KBOUT,*)'Please make dictionary'
+        ELSE IF (IDICM.EQ.1) THEN
+           CALL EXAMDM(DM,DW,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        ELSE IF (IDICM.EQ.2) THEN
+           CALL EXAMDM(DM,DS,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        END IF
+      ELSE IF (NOPT.EQ.29)THEN
+        IF(IDICH.EQ.0) THEN
+          WRITE(KBOUT,*)'Please make dictionary'
+        ELSE IF (IDICH.EQ.1) THEN
+           CALL EXAMDM(DH,DW,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        ELSE IF (IDICH.EQ.2) THEN
+           CALL EXAMDM(DH,DS,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        END IF
+      ELSE IF (NOPT.EQ.30)THEN
+        IF(IDICM.EQ.0) THEN
+          WRITE(KBOUT,*)'Please make dictionary'
+        ELSE IF (IDICM.EQ.1) THEN
+           CALL EXAMDW(DM,DW,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        ELSE IF (IDICM.EQ.2) THEN
+           CALL EXAMDW(DM,DS,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        END IF
+      ELSE IF (NOPT.EQ.31)THEN
+        IF(IDICH.EQ.0) THEN
+          WRITE(KBOUT,*)'Please make dictionary'
+        ELSE IF (IDICH.EQ.1) THEN
+           CALL EXAMDW(DH,DW,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        ELSE IF (IDICH.EQ.2) THEN
+           CALL EXAMDW(DH,DS,
+     +     MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +     TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+        END IF
+      ELSE IF (NOPT.EQ.32) THEN
+        CALL DICIO(KBIN,KBOUT,IOPT,IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOPT.EQ.1) THEN
+          CALL WDIC(IDEVO,FILNAM,DW,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        ELSE IF(IOPT.EQ.2) THEN
+          CALL WDIC(IDEVO,FILNAM,DS,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        ELSE IF(IOPT.EQ.3) THEN
+          CALL WDIC(IDEVO,FILNAM,DM,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        ELSE IF(IOPT.EQ.4) THEN
+          CALL WDIC(IDEVO,FILNAM,DH,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+        ELSE IF(IOPT.EQ.5) THEN
+          CALL RDIC(IDEVO,FILNAM,DW,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+          IF(IOK.EQ.0)IDICW = 0
+C          IDICM = 1
+C          IDICH = 1
+        ELSE IF(IOPT.EQ.6) THEN
+          CALL RDIC(IDEVO,FILNAM,DS,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+          IF(IOK.EQ.0)IDICS = 0
+C          IDICH = 1
+        ELSE IF(IOPT.EQ.7) THEN
+          WRITE(KBOUT,*)'Assuming from Dw'
+          CALL RDIC(IDEVO,FILNAM,DM,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+          IF(IOK.EQ.0)IDICM = 1
+        ELSE IF(IOPT.EQ.8) THEN
+          WRITE(KBOUT,*)'Assuming from Dw'
+          CALL RDIC(IDEVO,FILNAM,DH,MAXPOS,KBIN,KBOUT,IOK,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,IDEVH)
+          IF(IOK.EQ.0)IDICH = 1
+        END IF
+      END IF
+      GO TO 10
+9999  CONTINUE
+      CALL VT100M
+      CALL SHUTD
+      END
diff --git a/src/staden/mepmen.f b/src/staden/mepmen.f
new file mode 100644
index 0000000..5db4d9a
--- /dev/null
+++ b/src/staden/mepmen.f
@@ -0,0 +1,81 @@
+      SUBROUTINE MENU(OPT,KOPT,MOPT,MAXOPT,MINMEN,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+      INTEGER OPT
+      CHARACTER HELPF*(*)
+1     CONTINUE
+      CALL CLEARV
+      IF(MOPT.EQ.0) THEN
+      WRITE(KBOUT,1000)
+ 1000 FORMAT(' Menus and their numbers are',/,
+     +' m0 = This menu',/,
+     +' m1 = General menu',/,
+     +' m2 = Screen control menu',/,
+     +' m3 = Dictionary analysis menu',/,
+     +'  ? = Help',/,
+     +'  ! = Quit')
+      ELSE IF (MOPT.EQ.-1) THEN
+        WRITE(KBOUT,1001)
+1001    FORMAT(' General menu',/,
+     +  '   0 = List of menus',/,
+     +  '   ? = Help',/,
+     +  '   ! = Quit'/,
+     +  '   3 = Read new sequences',/,
+     +  '   4 = Redefine active region',/,
+     +  '   5 = List the sequences',/,
+     +  '   6 = List text file',/,
+     +  '   7 = Direct output to disk',/,
+     +  '  17 = Search for strings',/,
+     +  '  33 = Find inverted repeats')
+       ELSE IF (MOPT.EQ.-2) THEN
+        WRITE(KBOUT,1002)
+1002    FORMAT(' Screen control menu',/,
+     +  '   0 = List of menus',/,
+     +  '   ? = Help',/,
+     +  '   ! = Quit'/,
+     +  '  10 = Clear graphics',/,
+     +  '  11 = Clear text',/,
+     +  '  12 = Draw ruler',/,
+     +  '  13 = Use cross hair',/,
+     +  '  14 = Reset margins',/,
+     +  '  15 = Label diagram',/,
+     +  '  16 = Draw map')
+       ELSE IF (MOPT.EQ.-3) THEN
+        WRITE(KBOUT,1003)
+1003    FORMAT(' Dictionary analysis menu',/,
+     +  '  18 = Set strand',/,
+     +  '  19 = Set composition',/,
+     +  '  20 = Set word mask',/,
+     +  '  21 = Set number of mismatches',/,
+     +  '  22 = Show settings',/,
+     +  '  23 = Make dictionary Dw',/,
+     +  '  24 = Make dictionary Ds',/,
+     +  '  25 = Make fuzzy dictionary Dm from Dw',/,
+     +  '  26 = Make fuzzy dictionary Dm from Ds',/,
+     +  '  27 = Make fuzzy dictionary Dh from Dm',/,
+     +  '  28 = Examine fuzzy dictionary Dm',/,
+     +  '  29 = Examine fuzzy dictionary Dh',/,
+     +  '  30 = Examine words in Dm',/,
+     +  '  31 = Examine words in Dh',/,
+     +  '  32 = Save or restore a dictionary')
+      END IF
+      WRITE(KBOUT,1004)
+1004  FORMAT(' ? Menu or option number=',$)
+      CALL GETOPT(KBIN,KOPT,OPT)
+      IF((OPT.LT.MINMEN).OR.(OPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MOPT = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(OPT.GE.0)) THEN
+        CALL HELP2(IHELPS(OPT),IHELPE(OPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(OPT.EQ.0) THEN
+        MOPT = 0
+        GO TO 1
+      END IF
+      IF(OPT.LT.0) THEN
+        MOPT = OPT
+        GO TO 1
+       END IF
+       END
diff --git a/src/staden/mepsub.f b/src/staden/mepsub.f
new file mode 100644
index 0000000..d76cbff
--- /dev/null
+++ b/src/staden/mepsub.f
@@ -0,0 +1,1614 @@
+C     Subroutines for program MEP
+C     author Rodger Staden
+C     22-jun-1990 changed makeds to send work
+C     9-7-90 removed menu routine
+C     14-11-90 Replaced all radio by radion
+C     2-3-92 set filnam = ' ' for some calls to openf1
+C
+C   CCODE
+C   DCODE
+C   DCODEI
+C   DICIO
+C   DRAWPN
+C   EXAMDM
+C   EXAMDW
+C   GETCM
+C   GETCOM
+C   GETV
+C   HAIRPD
+C   HAIRPN
+C   HARPN
+C   KTONUM
+C   LISTN
+C   LWRAP2
+C   MAKEDH
+C   MAKEDM
+C   MAKEDS
+C   MAKEDW
+C   MENU
+C   NCODE
+C   NCODES
+C   OUTP
+C   PLOTH
+C   PLOTP
+C   RDDIC
+C   RDIC
+C   RINF
+C   RSEQ
+C   RSEQ1
+C   RSEQ2
+C   SHLSRT
+C   SHOWST
+C   SQFIT7
+C   SQFIT8
+C   STRNIN
+C   TCOMP
+C   WDCODE
+C   WDIC
+C   WFUZ2
+C   WRTDIC
+C   WRTWMT
+C   WTMCON
+C
+      SUBROUTINE RSEQ(IDEV1,IDEV2,FILNAM,SEQ,MAXCHR,MAXSEQ,LENSEQ,
+     +NFILE,LMAX,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      INTEGER LENSEQ(MAXSEQ)
+      CHARACTER SEQ(MAXCHR),FILNAM*(*),HELPF*(*)
+      PARAMETER (MAXPRM = 30)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      IOK = 1
+      IB = 1
+      PROMPT(1) = 'Read file of aligned sequences'
+      PROMPT(2) = 'Use file of file names'
+      CALL RADION('Select input mode',PROMPT,2,IB,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IB.LT.1) RETURN
+      IF(IB.EQ.2)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of sequence names',
+     +  HELPS,HELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL RSEQ1(SEQ,MAXCHR,MAXSEQ,LENSEQ,
+     +  IDEV2,IDEV1,KBIN,KBOUT,FILNAM,NFILE,LMAX,IOK)
+        RETURN
+      END IF
+      IF(IB.EQ.1) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV2,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of aligned sequences',
+     +  IHELPS,HELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) RETURN
+        CALL RSEQ2(SEQ,MAXCHR,MAXSEQ,LENSEQ,
+     +  IDEV2,KBIN,KBOUT,FILNAM,NFILE,LMAX,IOK)
+      END IF
+      END
+      SUBROUTINE RSEQ1(SEQ,MAXCHR,MAXSEQ,LENSEQ,
+     +IDEVN,IDEVSQ,KBIN,KBOUT,FILNAM,NFILE,LMAX,IOK)
+      INTEGER LENSEQ(MAXSEQ)
+      CHARACTER SEQ(MAXCHR),FILNAM*(*)
+C  CONCATENATE SEQUENCES IN SEQ, PUT THE START OF SEQ(J) IN LENSEQ(J)
+C  SO THE LENGTH OF SEQ(J) IS LENSEQ(J+1) - LENSEQ(J)
+      LMAX = 0
+      NFILE = 1
+      LENSEQ(1) = 1
+      ISEQ = 1
+      REWIND IDEVN
+10    CONTINUE
+      IF(NFILE.LT.MAXSEQ)THEN
+        READ(IDEVN,1000,END=20,ERR=30)FILNAM
+1000    FORMAT(A)
+        CALL OPENRS(IDEVSQ,FILNAM,IOK,LRECL,2)
+        IF(IOK.NE.0)THEN
+          WRITE(KBOUT,1002)FILNAM
+1002      FORMAT(' Error opening file ',A)
+          GO TO 10
+        END IF
+        WRITE(KBOUT,1001)FILNAM
+1001    FORMAT(' Reading file ',A)
+        IDIM = MAXCHR - ISEQ + 1
+        CALL ARRFIL(IDEVSQ,SEQ(ISEQ),IDIM,KBOUT)
+        CLOSE(UNIT=IDEVSQ)
+        NFILE = NFILE + 1
+        ISEQ = ISEQ + IDIM
+        LENSEQ(NFILE) = ISEQ
+        LMAX = MAX(LMAX,IDIM)
+        GO TO 10
+      END IF
+20    CONTINUE
+      CLOSE(UNIT=IDEVN)
+      IOK = 0
+      WRITE(KBOUT,*)'Number of files ',NFILE - 1
+      RETURN
+30    CONTINUE
+      WRITE(KBOUT,*)' Error reading file of file names'
+      IOK = 1
+      END
+      SUBROUTINE RSEQ2(SEQ,MAXCHR,MAXSEQ,LENSEQ,
+     +IDEVSQ,KBIN,KBOUT,FILNAM,NFILE,LMAX,IOK)
+      INTEGER LENSEQ(MAXSEQ)
+      CHARACTER SEQ(MAXCHR),LINE*80
+      INTEGER NOTIRL
+      EXTERNAL NOTIRL
+      LMAX = 0
+      NFILE = 0
+      ISEQ = 1
+10    CONTINUE
+      IF(NFILE.LT.MAXSEQ)THEN
+        NFILE = NFILE + 1
+        LENSEQ(NFILE) = ISEQ
+        READ(IDEVSQ,1000,ERR=40,END=30)LINE
+1000    FORMAT(1X,A)
+        IDIM = NOTIRL(LINE,80,' ')
+        LMAX = MAX(LMAX,IDIM)
+        K = MAXCHR - ISEQ + 1
+        IDIM = MIN(IDIM,K)
+        K = ISEQ
+        DO 20 I = 1,IDIM
+          SEQ(K) = LINE(I:I)
+          K = K + 1
+20      CONTINUE
+        ISEQ = K
+        GO TO 10
+      END IF
+30    CONTINUE
+      CLOSE(UNIT=IDEVSQ)
+      IOK = 0
+      WRITE(KBOUT,*)'Number of files ',NFILE - 1
+      RETURN
+40    CONTINUE
+      WRITE(KBOUT,*)' Error reading file'
+      IOK = 1
+      CLOSE(UNIT=IDEVSQ)
+      END
+      SUBROUTINE PLOTH(SEQ,MAXCHR,LENSEQ,NFILE,HIS,LMAX,KBIN,KBOUT,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,I1,I2,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      PARAMETER (MAXSTR = 11)
+      CHARACTER SEQ(MAXCHR),STRING(MAXSTR),HELPF*(*)
+      INTEGER LENSEQ(NFILE),HIS(LMAX)
+30    CONTINUE
+      IDIM2 = 0
+      CALL GETSTR('Word to search for',SEQ,STRING,MAXSTR,IDIM2,
+     +KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      IF(INFLAG.NE.0) RETURN
+      MN = 1
+      MX = IDIM2
+      MINMAT = IDIM2
+      CALL GETINT(MN,MX,MINMAT,'Minimum match',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      CALL YESONO(MOPT,'Join dots','Separate dots',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(MOPT.LT.0)RETURN
+      XMAX=LMAX
+      XMIN=1
+      YMIN=0
+      CALL FILLI(HIS,LMAX,0)
+      DO 40 I = 1,NFILE-1
+        K = LENSEQ(I)
+        I4 = LENSEQ(I+1) - K
+        I3 = MIN(I2,I4)
+        CALL SQFIT7(SEQ(K),I4,STRING,IDIM2,HIS,LMAX,
+     +  I1,I3,MINMAT,1)
+40    CONTINUE
+      MAXHIS = 0
+      DO 50 I = 1,LMAX
+        IF(HIS(I).GT.MAXHIS)MAXHIS = HIS(I)
+50    CONTINUE
+      YMAX = MAXHIS
+      WRITE(KBOUT,*)' Maximum in histogram=',MAXHIS
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      XF=XMIN
+      YF=YMIN
+      DO 100 I = 1,LMAX
+        XT = I
+        YT = HIS(I)
+        IF(MOPT.EQ.0)CALL LINE(XF,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        IF(MOPT.EQ.1)CALL POINT(XT,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        XF=XT
+        YF=YT
+100   CONTINUE
+      CALL VT100M
+      END
+C      SQFIT7
+      SUBROUTINE SQFIT7(SEQ,IDIM1,STRING,IDIM2,ITOTEL,ITOTID,
+     +IS,IE,MINS,KSTART)
+C   AUTHOR: RODGER STADEN
+C   COMPARE POINTERS, NOT ACTUAL CHARACTERS
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOTEL(ITOTID)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS-KSTART+1
+      DO 200 I=1,IDIF
+        NTOT = 0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          IF(STRING(J).EQ.'-')THEN
+            NTOT = NTOT + 1
+          ELSE
+            K = CTONUM(STRING(J))
+            IF(K.EQ.CTONUM(SEQ(IP)))NTOT = NTOT + 1
+          END IF
+          IP=IP+1
+100     CONTINUE
+        IF(NTOT.GE.MINS)THEN
+          K = IP - IDIM2+KSTART-1
+          ITOTEL(K) = ITOTEL(K) + 1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
+      SUBROUTINE PLOTP(SEQ,MAXCHR,LENSEQ,NFILE,HIS,LMAX,KBIN,KBOUT,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,I1,I2,IDEV,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      PARAMETER (MAXSTR=40)
+      CHARACTER SEQ(MAXCHR),STRING(MAXSTR),HELPF*(*)
+      INTEGER LENSEQ(NFILE),HIS(LMAX)
+30    CONTINUE
+      IDIM2 = 0
+      CALL GETSTR('Word to search for',SEQ,STRING,MAXSTR,IDIM2,
+     +KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      IF(INFLAG.NE.0) RETURN
+      MN = 1
+      MX = IDIM2
+      MINMAT = IDIM2
+      CALL GETINT(MN,MX,MINMAT,'Minimum match',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      MINMAT = IVAL
+      IGON = 0
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IGON.LT.0)RETURN
+C  CHECK RANGE
+      XMAX=LMAX
+      XMIN=1
+      YMIN=0
+      YMAX = 1
+      IF(IGON.EQ.0)THEN
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        XMARG = REAL(MARGT)/(NFILE-1)
+        MARGT1 = NINT(XMARG)
+        MARGB1 = MARGB
+      END IF
+      NMATT = 0
+      DO 200 I = 1,NFILE-1
+        CALL FILLI(HIS,LMAX,0)
+        K = LENSEQ(I)
+        I4 = LENSEQ(I+1) - K
+        I3 = MIN(I2,I4)
+        CALL SQFIT8(SEQ(K),I4,STRING,IDIM2,HIS,LMAX,
+     +  I1,I3,MINMAT,NMAT,1)
+        NMATT = NMATT + NMAT
+        YF=YMIN
+        YT = YMAX
+        DO 100 J = 1,NMAT
+          XT = HIS(J)
+          XF = XT
+          IF(IGON.EQ.0)THEN
+            CALL LINE(XF,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +        MARGL,MARGR,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          END IF
+          IF(IGON.EQ.1)THEN
+            K1 = LENSEQ(I) + HIS(J) - 1
+            K2 = K1 + IDIM2 - 1
+            WRITE(IDEV,2000)I,HIS(J),(SEQ(K),K=K1,K2)
+2000          FORMAT(' ',I6,I6,' ',20A1)
+           END IF
+100     CONTINUE
+        MARGB1 = MARGB1 + MARGT1
+200   CONTINUE
+      CALL VT100M
+      END
+C      SQFIT8
+      SUBROUTINE SQFIT8(SEQ,IDIM1,STRING,IDIM2,ITOTEL,ITOTID,
+     +IS,IE,MINS,ITOTP,KSTART)
+C   AUTHOR: RODGER STADEN
+C   COMPARE POINTERS, NOT ACTUAL CHARACTERS
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER ITOTEL(ITOTID)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS-KSTART+1
+      ITOTP=0
+      DO 200 I=1,IDIF
+        NTOT = 0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          IF(STRING(J).EQ.'-')THEN
+            NTOT = NTOT + 1
+          ELSE
+            K = CTONUM(STRING(J))
+            IF(K.EQ.CTONUM(SEQ(IP)))NTOT = NTOT + 1
+          END IF
+          IP=IP+1
+100     CONTINUE
+        IF(NTOT.GE.MINS)THEN
+          ITOTP=ITOTP+1
+          IF(ITOTP.GT.ITOTID)RETURN
+C          ITOT(ITOTP)=NTOT
+          ITOTEL(ITOTP)=IP-IDIM2+KSTART-1
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      END
+      SUBROUTINE HAIRPN(SEQ,MAXCHR,LENSEQ,NFILE,LMAX,
+     +  ITOT,ITOTEL,ITOTAL,IDTOT,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IDEV,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER SCORES(30)
+      CHARACTER SEQ(MAXCHR)
+      INTEGER LENSEQ(NFILE)
+      INTEGER ITOT(IDTOT),ITOTEL(IDTOT),ITOTAL(IDTOT)
+      DO 1 I=1,30
+        SCORES(I)=0
+1     CONTINUE
+      SCORES(16)=1
+      SCORES(21)=1
+      SCORES(22)=1
+      SCORES(8)=1
+      SCORES(9)=1
+      SCORES(14)=1
+      LEVEL=2
+      IS = 1
+      IN = LMAX
+      MINLPI = 0
+      MAXLPI = 10
+      MINLP = 3
+      MINLPX = 1
+      MAXLPX = 20
+      MAXLP = 6
+      MINBP = 1
+      MAXBP = 20
+      MININ = 6
+      CALL HAIRPD(IS,IN,KBIN,KBOUT,
+     +MINLPI,MAXLPI,MINLP,MINLPX,MAXLPX,MAXLP,MINBP,MAXBP,MININ,
+     +IGON,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      XMARG = REAL(MARGT)/(NFILE-1)
+      MARGT1 = NINT(XMARG)
+      MARGB1 = MARGB
+      WRITE(KBOUT,*)' Searching'
+      IF(IGON.EQ.0) THEN
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      END IF
+      DO 900 II = 1,NFILE-1
+        CALL FILLI(ITOT,IDTOT,0)
+        CALL FILLI(ITOTEL,IDTOT,0)
+        CALL FILLI(ITOTAL,IDTOT,0)
+        K = LENSEQ(II)
+        IDIM = LENSEQ(II+1) - LENSEQ(II)
+        CALL HARPN(SEQ(K),IDIM,ITOT,ITOTEL,ITOTAL,IDTOT,
+     +  1,IDIM,KBOUT,IP,NOUT,MINLP,MAXLP,MININ,SCORES)
+        IF(IP.GT.0)THEN
+          XMAX=LMAX
+          XMIN=1
+C         PLOT ON SCALE 0 TO 4*MININ IN Y
+          YMIN=0.
+          YMAX=MININ*4
+          YF=0.
+        DO 800 I=NOUT+1,IP
+          IF(IGON.EQ.0)CALL VECTOM
+          XF=ITOTEL(I)+ITOTAL(I)/2
+          XT=XF
+          YT=ITOT(I)
+          IF(IGON.EQ.0)CALL LINE(XF,XT,YF,YT,
+     +    XMAX,XMIN,YMAX,YMIN,
+     +    MARGL,MARGR,MARGB1,MARGT1,ISXMAX,ISYMAX)
+          IF(IGON.EQ.1)THEN
+            WRITE(IDEV,1000)II,ITOTEL(I)-ITOT(I)+1
+1000        FORMAT(/,' Sequence',I6,I6)
+            CALL DRAWPN(
+     +      SEQ(K),IDIM,ITOT(I),ITOTAL(I),ITOTEL(I),
+     +      IDEV,IDIM)
+          END IF
+800     CONTINUE
+      END IF
+      MARGB1 = MARGB1 + MARGT1
+900   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE HARPN(SEQ,IDIM,ITOT,ITOTEL,ITOTAL,IDTOT,
+     +IS,IN,KBOUT,IP,NOUT,MIN,MAX,MININ,SCORES)
+C   AUTHOR: RODGER STADEN
+      INTEGER SCORES(30)
+      CHARACTER SEQ(IDIM)
+      INTEGER ITOT(IDTOT),ITOTEL(IDTOT),ITOTAL(IDTOT)
+      INTEGER CTONUM
+      EXTERNAL CTONUM,LWRAP2
+      IDO=IN-IS+1
+      IF(IN.LE.IS)IDO=IDIM+IDO
+      I=IS-1
+      IP=0
+      DO 301 K=1,IDO
+        I=I+1
+        DO 300 J=MIN,MAX
+         N=0
+100      CONTINUE
+           IMN=I-N
+           IPNPJ=I+N+J
+           IT=CTONUM(SEQ(LWRAP2(IN,IMN)))+
+     +     5*CTONUM(SEQ(LWRAP2(IN,IPNPJ)))
+           IT=SCORES(IT)
+           IF(IT.NE.0)THEN
+             N=N+IT
+             GO TO 100
+           END IF
+           IF(N.GE.MININ)THEN
+             IP=IP+1
+             IF(IP.GT.IDTOT)THEN
+               WRITE(KBOUT,1234)IP,I
+1234           FORMAT(' Maximum of',I5,
+     +         ' loops found up to',I6,', no more searching')
+               GO TO 302
+             END IF
+             ITOT(IP)=N
+             ITOTEL(IP)=LWRAP2(IN,I)
+             ITOTAL(IP)=J-1
+           END IF
+300      CONTINUE
+301      CONTINUE
+302   CONTINUE
+      IF(IP.GT.0)THEN
+        NOUT=0
+        DO 700 J=1,IP
+          JP1=J+1
+           D1=FLOAT(ITOTEL(J))+FLOAT(ITOTAL(J))/2.
+           DO 690 I=JP1,IP
+             D2=FLOAT(ITOTEL(I))+FLOAT(ITOTAL(I))/2.
+             IF(D1.NE.D2)GO TO 690
+             LP2=ITOTAL(I)+2*ITOT(I)
+             IF(ITOTAL(J).GT.LP2)GO TO 690
+             ITOT(J)=0
+             ITOTAL(J)=0
+             ITOTEL(J)=0
+             NOUT=NOUT+1
+             GO TO 700
+690        CONTINUE
+700     CONTINUE
+        CALL BUB3AS(ITOTEL,ITOT,ITOTAL,IP)
+      END IF
+      END
+C
+C   subroutine to draw hairpin loops on device idev
+C   hairpins are defined by:
+C   1) nstem the number of basepairs in the stem
+C   2) nloop the number of bases in the loop
+C   3) ip the position of the rightmost base in the left side of the stem
+C
+C   draw the stems very simply, just standing up with a few bases
+C   before and after
+C   maximum size is 20 up and 14 across
+      SUBROUTINE DRAWPN(SEQ,IDIM,NSTEM,NLOOP,IP,IDEV,IN)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),ARRAY(14,20),SCORE(5,5),SPACE
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DATA SPACE/' '/
+C   CHECK FOR WRAP AROUND
+      IF((IP-NSTEM).LT.0)RETURN
+      IF((IP+NLOOP+NSTEM+1).GT.IN)RETURN
+      DO 1 I=1,5
+      DO 1 J=1,5
+        SCORE(I,J)=' '
+1     CONTINUE
+      SCORE(1,3)='-'
+      SCORE(3,1)='-'
+      SCORE(2,4)='-'
+      SCORE(4,2)='-'
+      SCORE(1,4)='.'
+      SCORE(4,1)='.'
+      DO 2 I=1,14
+      DO 2 J=1,20
+        ARRAY(I,J)=SPACE
+2     CONTINUE
+C   point to left stem start
+      IPL=IP-NSTEM
+C   point to right stem start
+      IPR=IP+NSTEM+NLOOP+1
+C   point to temporary array
+      IAL=6
+      IAR=8
+      KA=1
+C   fill in stem and loop
+      DO 10 I=1,MIN(NSTEM+NLOOP/2,19)
+        IPL=IPL+1
+        IPR=IPR-1
+        KA=KA+1
+        ARRAY(IAL,KA)=SEQ(IPL)
+        ARRAY(IAR,KA)=SEQ(IPR)
+C       basepaired?
+        ARRAY(IAL+1,KA)=
+     +  SCORE(CTONUM(SEQ(IPL)),CTONUM(SEQ(IPR)))
+10    CONTINUE
+C     odd number of bases in loop?
+      IF(MOD(NLOOP,2).NE.0)THEN
+C       put in extra base
+        IF((KA+1).LE.20)ARRAY((IAL+1),KA+1)=SEQ(IPL+1)
+      END IF
+C     now do 6 bases before the stem and 7 after
+      IPL=IP-NSTEM-6
+      DO 30 I=1,6
+        IPL=IPL+1
+30    IF(IPL.GT.0)ARRAY(I,1)=SEQ(IPL)
+      IPR=IP+NSTEM+NLOOP
+      DO 40 I=8,14
+        IPR=IPR+1
+        IF(IPR.LT.IN)ARRAY(I,1)=SEQ(IPR)
+40    CONTINUE
+      DO 60 I=1,20
+        K=21-I
+        DO 50 JEMPT = 1,14
+          IF(ARRAY(JEMPT,K).NE.' ') THEN
+            WRITE(IDEV,1000)(ARRAY(L,K),L=1,14)
+            GO TO 51
+          END IF
+50      CONTINUE
+51      CONTINUE
+60    CONTINUE
+1000  FORMAT(' ',5X,14A1)
+1001  FORMAT(' ',5X,I6)
+      WRITE(IDEV,1001)IP-NSTEM
+      END
+      SUBROUTINE SHOWST(NFUZ,I1,I2,KCOMP,ICOMP,IDICW,IDICS,
+     +IDICM,IDICH,MASK,LMASK,KBOUT)
+      LOGICAL MASK(LMASK)
+      CHARACTER STRING*25
+      STRING = ' '
+      DO 10 I=1,LMASK
+        IF (MASK(I)) THEN
+          STRING(I:I) = 'x'
+        ELSE
+          STRING(I:I) = '-'
+        END IF
+ 10   CONTINUE
+      WRITE(KBOUT,1001)STRING(1:LMASK),NFUZ,I1,I2
+1001  FORMAT(' Current mask         = ',A,/,
+     +       ' Number of mismatches =',I4,/,
+     +       ' Start position       =',I6,/,
+     +       ' End position         =',I6)
+      IF(KCOMP.EQ.0)WRITE(KBOUT,*)' Input strand only'
+      IF(KCOMP.EQ.1)WRITE(KBOUT,*)' Complementary strand only'
+      IF(KCOMP.EQ.2)WRITE(KBOUT,*)' Both strands'
+      IF(ICOMP.EQ.0)WRITE(KBOUT,*)' Even composition'
+      IF(ICOMP.EQ.1)WRITE(KBOUT,*)' Observed composition'
+      IF(ICOMP.EQ.2)WRITE(KBOUT,*)' Entered composition'
+      IF(IDICW.EQ.0)WRITE(KBOUT,*)' Dictionary Dw made'
+      IF(IDICW.NE.0)WRITE(KBOUT,*)' Dictionary Dw unmade'
+      IF(IDICS.NE.0)WRITE(KBOUT,*)' Dictionary Ds unmade'
+      IF(IDICS.EQ.0)WRITE(KBOUT,*)' Dictionary Ds made'
+      IF(IDICM.EQ.0)WRITE(KBOUT,*)' Dictionary Dm unmade'
+      IF(IDICM.EQ.1)WRITE(KBOUT,*)' Dictionary Dm made from Dw'
+      IF(IDICM.EQ.2)WRITE(KBOUT,*)' Dictionary Dm made from Ds'
+      IF(IDICH.EQ.0)WRITE(KBOUT,*)' Dictionary Dh unmade'
+      IF(IDICH.EQ.1)WRITE(KBOUT,*)' Dictionary Dh made from Dw'
+      IF(IDICH.EQ.2)WRITE(KBOUT,*)' Dictionary Dh made from Ds'
+      END
+      SUBROUTINE GETCM(COMP,KBIN,KBOUT,SEQN,LENSEQ,ICOMP,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER SEQN(LENSEQ)
+      REAL COMP(5),COMPT(4)
+      CHARACTER BASE(4),HELPF*(*),STRING*22
+      PARAMETER (MAXPRM = 20)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      SAVE BASE
+      DATA BASE/'T','C','A','G'/
+      NOPT = ICOMP + 1
+      PROMPT(1) = 'Even composition'
+      PROMPT(2) = 'Observed composition'
+      PROMPT(3) = 'Type in composition'
+        CALL RADION('Select composition',PROMPT,3,NOPT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(NOPT.LT.1) RETURN
+        ICOMP = NOPT - 1
+      IF(NOPT.EQ.1) THEN
+        CALL FILLR(COMP,5,0.25)
+        RETURN
+      ELSE IF (NOPT.EQ.2) THEN
+        CALL GETCOM(SEQN,LENSEQ,COMP)
+        RETURN
+      ELSE IF (NOPT.EQ.3) THEN
+        RMN = 0.
+        TOT = 0.
+        DO 30 I = 1,4
+          RMX = 1.0 - TOT
+          RMM = MIN(0.25,RMX)
+          WRITE(STRING,1002)BASE(I)
+1002      FORMAT('Composition for base ',A1)
+          CALL GETRL(RMN,RMX,RMM,STRING,VAL,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+          IF(IOK.NE.0) RETURN
+          TOT = TOT + VAL
+          COMPT(I) = VAL
+30      CONTINUE
+        CALL COPYR(COMPT,COMP,4)
+        RETURN
+      END IF
+      END
+      SUBROUTINE EXAMDW(DICTOT,WINDEX,
+     +MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+      INTEGER DICTOT(MAXPOS),WINDEX(MAXPOS),WT(4,LENGTH)
+      CHARACTER DCODE*8,STRING*8,STRNIN*8,UNMASK*25,SOUT*25,ST*8
+      CHARACTER MMASK*25
+      REAL COMP(5)
+      INTEGER TEMPC(MAXLIS),TEMPP(MAXLIS),TEMPI(MAXLIS),TEMPN(MAXLIS)
+      LOGICAL MASK(LMASK)
+      EXTERNAL DCODE,NCODES,STRNIN,RINF,UNMASK,MMASK
+      IOK = 1
+      IFILL = 4 * LENGTH
+30    CONTINUE
+      L = 0
+      CALL GTSTR('Word to examine',' ',ST,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      IF(INFLAG.NE.0) RETURN
+      STRING = MMASK(ST,MASK,LMASK,L)      
+      IF(L.NE.LENGTH) RETURN
+      CALL FILLI(WT,IFILL,1)
+      CALL WFUZ2(STRING,LENGTH,NFUZ,WINDEX,MAXPOS,WT)
+      I = NCODES(STRING,LENGTH)
+      R = RINF(WT,LENGTH,COMP)
+      ST = STRNIN(STRING,LENGTH)
+      SOUT = UNMASK(ST,MASK,LMASK)
+      WRITE(IDEV,*)SOUT(1:LMASK),' ',DICTOT(I),R
+      CALL WRTWMT(WT,WT,4,LENGTH,IDEV,1)
+      GO TO 30
+      END
+      SUBROUTINE EXAMDM(DICTOT,WINDEX,
+     +MAXPOS,LENGTH,KBIN,KBOUT,WT,NFUZ,IOK,COMP,NFILE,IDEV,
+     +TEMPC,TEMPI,TEMPP,TEMPN,MAXLIS,MASK,LMASK)
+      INTEGER DICTOT(MAXPOS),WINDEX(MAXPOS),WT(4,LENGTH)
+      CHARACTER DCODE*8,STRING*8,STRNIN*8
+      REAL COMP(5)
+      LOGICAL MASK(LMASK)
+      INTEGER TEMPC(MAXLIS),TEMPP(MAXLIS),TEMPI(MAXLIS),TEMPN(MAXLIS)
+      PARAMETER (MAXPRM = 19)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      EXTERNAL DCODE,NCODES,STRNIN,RINF
+      MAXTOT = 0
+      IOK = 1
+      WRITE(KBOUT,*)'Looking for highest scoring words'
+      IFILL = 4 * LENGTH
+      DO 10 I = 1,MAXPOS
+        IF(DICTOT(I).GT.MAXTOT)MAXTOT = DICTOT(I)
+10    CONTINUE
+15    CONTINUE
+      WRITE(KBOUT,*)
+     +'The highest word score = ',MAXTOT
+      MN = 0
+      MX = MAXTOT
+      I = MIN(NFILE/2,MAXTOT)
+      CALL GETINT(MN,MX,I,'Minimum word score',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      MINTOT = IVAL
+      RI = 0.
+      RMN = 0.
+      RMX = 1.0
+      CALL GETRL(RMN,RMX,RI,'Minimum information',
+     +  VAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      XINF = VAL
+      I = 1
+      PROMPT(1) = 'Sort on information'
+      PROMPT(2) = 'Sort word scores'
+      CALL RADION('Select sort mode',PROMPT,2,I,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(I.LT.1) RETURN
+      ISORT = I - 1
+      MN = 0
+      MX = MAXLIS
+      I = MAXLIS
+      CALL GETINT(MN,MX,I,'Maximum number to list',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+      MAXL = IVAL
+      IF((MINTOT.GT.0).OR.(XINF.GT.0.0))THEN
+        IF(MAXTOT.GE.MINTOT)THEN
+          WRITE(KBOUT,*)'The words are'
+          N = 0
+          XINFM = 0.
+          DO 20 I = 1,MAXPOS
+            IF(DICTOT(I).GE.MINTOT)THEN
+              STRING = DCODE(I,LENGTH)
+              CALL FILLI(WT,IFILL,1)
+              CALL WFUZ2(STRING,LENGTH,NFUZ,WINDEX,MAXPOS,WT)
+              R = RINF(WT,LENGTH,COMP)
+              IF(R.GE.XINF)THEN
+                XINFM = MAX(XINFM,R)
+C                WRITE(IDEV,*)STRNIN(STRING,LENGTH),DICTOT(I),R
+                N = N + 1
+                IF(N.LE.MAXL) THEN
+                  TEMPI(N) = R * 10000.
+                  TEMPC(N) = DICTOT(I)
+                  TEMPP(N) = I
+                END IF
+              END IF
+            END IF
+20        CONTINUE
+          WRITE(KBOUT,*)' Total words=',N,' Maximum information=',XINFM
+          MLIS = MIN(MAXL,N)
+          INCDEC = -1
+          IF(ISORT.EQ.0) THEN
+            CALL SHLSRT(TEMPI,MLIS,TEMPN,INCDEC)
+            CALL OUTP(TEMPI,TEMPC,TEMPP,TEMPN,MLIS,LENGTH,IDEV,MASK,
+     +      LMASK)
+          ELSE IF(ISORT.EQ.1) THEN
+            CALL SHLSRT(TEMPC,MLIS,TEMPN,INCDEC)
+            CALL OUTP(TEMPI,TEMPC,TEMPP,TEMPN,MLIS,LENGTH,IDEV,MASK,
+     +      LMASK)
+          END IF
+          GO TO 15
+        END IF
+      END IF
+      END
+      CHARACTER*(*) FUNCTION STRNIN(STRING,LENGTH)
+      CHARACTER STRING*(*),TUPPER
+      EXTERNAL TUPPER,KTONUM
+      CALL CCASE(STRING,2)
+      STRNIN(1:LENGTH) = STRING(1:LENGTH)
+      LO2 = LENGTH / 2
+      J = LENGTH
+      DO 10 I = 1,LO2
+        K = KTONUM(STRING(I:I)) + KTONUM(STRING(J:J))
+        IF (K.EQ.5) THEN
+          STRNIN(I:I) = TUPPER(STRING(I:I))
+          STRNIN(J:J) = TUPPER(STRING(J:J))
+        END IF
+        J = J - 1
+10    CONTINUE
+      END
+      INTEGER FUNCTION KTONUM(CHAR)
+      CHARACTER CHAR
+      INTEGER CTONUM,MAP(5)
+      EXTERNAL CTONUM
+      SAVE MAP
+      DATA MAP/1,2,4,3,5/
+      KTONUM = MAP(CTONUM(CHAR))
+      END
+      SUBROUTINE MAKEDW(SEQ,MAXCHR,WINDEX,TMPDIC,MAXPOS,
+     +LENSEQ,MAXSEQ,NFILE,LENGTH,KCOMP,I1,I2,MASK,LMASK)
+      INTEGER SEQ(MAXCHR),WINDEX(MAXPOS),TMPDIC(MAXPOS)
+      INTEGER LENSEQ(MAXSEQ),CCODEM
+      LOGICAL MASK(LMASK)
+      EXTERNAL NCODEM,CCODEM
+      CALL FILLI(WINDEX,MAXPOS,0)
+      IF((KCOMP.EQ.0).OR.(KCOMP.EQ.2))THEN
+        DO 10 I = 1,NFILE-1
+          I11 = LENSEQ(I) + I1 - 1
+          I22 = MIN(LENSEQ(I) + I2 - 1, LENSEQ(I+1)) - LMASK
+          DO 5 J = I11,I22
+            N = NCODEM(SEQ(J),LENGTH,MASK,LMASK)
+            IF(N.NE.0) WINDEX(N) = WINDEX(N) + 1
+5         CONTINUE
+10      CONTINUE
+      END IF
+      IF((KCOMP.EQ.1).OR.(KCOMP.EQ.2)) THEN
+        DO 20 I = 1,NFILE-1
+          I11 = LENSEQ(I) + I1 - 1
+          I22 = MIN(LENSEQ(I) + I2 - 1, LENSEQ(I+1)) - LMASK
+          DO 15 J = I11,I22
+            N = CCODEM(SEQ(J),LENGTH,MASK,LMASK)
+            IF(N.NE.0) WINDEX(N) = WINDEX(N) + 1
+15        CONTINUE
+20      CONTINUE
+      END IF
+      END
+      INTEGER FUNCTION CCODE(STRING,LENGTH)
+      IMPLICIT INTEGER (A-Z)
+      DIMENSION CONSTS(8),STARTS(8),STRING(LENGTH),COMP(5)
+      SAVE CONSTS,STARTS,COMP
+      DATA CONSTS/1,4,16,64,256,1024,4096,16384/
+      DATA STARTS/0,-4,-20,-84,-340,-1364,-5460,-21844/
+      DATA COMP/3,4,1,2,5/
+      CCODE = 0
+      N = STARTS(LENGTH)
+      L = LENGTH
+      DO 10 I = LENGTH,1,-1
+        J = STRING(I)
+        IF(J.NE.5) THEN
+          N = N + COMP(J) * CONSTS(L)
+          L = L - 1
+        ELSE
+          RETURN
+        END IF
+10    CONTINUE
+      CCODE = N
+      END
+      INTEGER FUNCTION CCODEM(STRING,LENGTH,MASK,LMASK)
+      IMPLICIT INTEGER (A-Z)
+      DIMENSION CONSTS(8),STARTS(8),STRING(LENGTH),COMP(5)
+      LOGICAL MASK(LMASK)
+      SAVE CONSTS,STARTS,COMP
+      DATA CONSTS/1,4,16,64,256,1024,4096,16384/
+      DATA STARTS/0,-4,-20,-84,-340,-1364,-5460,-21844/
+      DATA COMP/3,4,1,2,5/
+      CCODEM = 0
+      N = STARTS(LENGTH)
+      L = LENGTH
+      DO 10 I = LENGTH,1,-1
+        IF (MASK(I)) THEN
+          J = STRING(I)
+          IF(J.NE.5) THEN
+            N = N + COMP(J) * CONSTS(L)
+            L = L - 1
+          ELSE
+            RETURN
+          END IF
+        END IF
+10    CONTINUE
+      CCODEM = N
+      END
+      INTEGER FUNCTION NCODEM(STRING,LENGTH,MASK,LMASK)
+      IMPLICIT INTEGER (A-Z)
+      DIMENSION CONSTS(8),STARTS(8),STRING(LENGTH)
+      LOGICAL MASK(LMASK)
+      SAVE CONSTS,STARTS
+      DATA CONSTS/1,4,16,64,256,1024,4096,16384/
+      DATA STARTS/0,-4,-20,-84,-340,-1364,-5460,-21844/
+      NCODEM = 0
+      N = STARTS(LENGTH)
+      L = LENGTH
+      DO 10 I = 1,LMASK
+        IF (MASK(I)) THEN
+          J = STRING(I)
+          IF(J.NE.5) THEN
+            N = N + J * CONSTS(L)
+            L = L - 1
+          ELSE
+            RETURN
+          END IF
+        END IF
+10    CONTINUE
+      NCODEM = N
+      END
+      INTEGER FUNCTION NCODE(STRING,LENGTH)
+      IMPLICIT INTEGER (A-Z)
+      DIMENSION CONSTS(8),STARTS(8),STRING(LENGTH)
+      SAVE CONSTS,STARTS
+      DATA CONSTS/1,4,16,64,256,1024,4096,16384/
+      DATA STARTS/0,-4,-20,-84,-340,-1364,-5460,-21844/
+      NCODE = 0
+      N = STARTS(LENGTH)
+      L = LENGTH
+      DO 10 I = 1,LENGTH
+        J = STRING(I)
+        IF(J.NE.5) THEN
+          N = N + J * CONSTS(L)
+          L = L - 1
+        ELSE
+          RETURN
+        END IF
+10    CONTINUE
+      NCODE = N
+      END
+      INTEGER FUNCTION NCODES(STRING,LENGTH)
+      INTEGER CONSTS(8),STARTS(8)
+      CHARACTER STRING*(*)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE CONSTS,STARTS
+      DATA CONSTS/1,4,16,64,256,1024,4096,16384/
+      DATA STARTS/0,-4,-20,-84,-340,-1364,-5460,-21844/
+      NCODES = STARTS(LENGTH)
+      DO 10 I = 1,LENGTH
+        NCODES = NCODES + CTONUM(STRING(I:I)) * CONSTS(LENGTH-I+1)
+10    CONTINUE
+      END
+      SUBROUTINE WFUZ2(WORD,LENGTH,NFUZ,WINDEX,MAXPOS,WT)
+      INTEGER STRING(8),WINDEX(MAXPOS)
+      INTEGER DIF(3,4),WT(4,LENGTH)
+      INTEGER CTONUM
+      CHARACTER WORD*(*),CHAR
+      EXTERNAL CTONUM,NCODE
+      SAVE DIF
+      DATA DIF/2,3,4,
+     +         1,3,4,
+     +         1,2,4,
+     +         1,2,3/
+      DO 1 I = 1,LENGTH
+        CHAR = WORD(I:I)
+        STRING(I) = CTONUM(CHAR)
+1     CONTINUE      
+C  KEEP ADDING COUNTS UNTIL LEVEL OF FUZ EXCEEDED
+      NALL = NCODE(STRING(1),LENGTH)
+        IF(WINDEX(NALL).NE.0) 
+     +    CALL WDCODE(WT,STRING,LENGTH,WINDEX(NALL))
+          IF(NFUZ.EQ.0)RETURN
+C  ONLY ONE POSITION MUST CHANGE, CHANGE THEM ALL IN TURN
+          N = 1
+          DO 10 I1 = 1,LENGTH
+            DO 5 I2 = 1,3
+            N = N + 1
+            IT = STRING(I1)
+            NC1 = DIF(I2,STRING(I1))
+            STRING(I1) = NC1
+            NALL = NCODE(STRING(1),LENGTH)
+            IF(WINDEX(NALL).NE.0)
+     +      CALL WDCODE(WT,STRING,LENGTH,WINDEX(NALL))
+            STRING(I1) = IT
+5         CONTINUE
+10      CONTINUE
+        IF(NFUZ.EQ.1)RETURN
+C  TWO POSITIONS MUST CHANGE, CHANGE THEM ALL IN TURN
+        DO 40 I1 = 1,LENGTH
+          DO 30 I2 = I1+1,LENGTH
+            DO 25 I3 = 1,3
+              IT1 = STRING(I1)
+              NC1 = DIF(I3,STRING(I1))
+              STRING(I1) = NC1
+              DO 24 I4 = 1,3
+                IT2 = STRING(I2)
+                NC2 = DIF(I4,STRING(I2))
+                STRING(I2) = NC2
+                N = N + 1
+                NALL = NCODE(STRING(1),LENGTH)
+                IF(WINDEX(NALL).NE.0)
+     +           CALL WDCODE(WT,STRING,LENGTH,WINDEX(NALL))
+                STRING(I2) = IT2
+24            CONTINUE
+              STRING(I1) = IT1
+25          CONTINUE
+30        CONTINUE
+40      CONTINUE
+      END
+      SUBROUTINE MAKEDM(WINDEX,DM,LENGTH,NFUZ,MAXPOS)
+      INTEGER STRING(8),WINDEX(MAXPOS),DM(MAXPOS),DIF(3,4)
+      SAVE DIF
+      EXTERNAL NCODE
+      DATA DIF/2,3,4,
+     +         1,3,4,
+     +         1,2,4,
+     +         1,2,3/
+C      IF(NFUZ.EQ.0)THEN
+      DO 100 IWORD = 1,MAXPOS
+C CHANGED NEXT LINE 16-7-89
+C        DICTOT(IWORD) = DICTOT(IWORD) + WINDEX(IWORD)
+        DM(IWORD) = WINDEX(IWORD)
+100   CONTINUE
+      IF(NFUZ.EQ.0) RETURN
+      DO 200 IWORD = 1,MAXPOS
+C  ONLY ONE POSITION MUST CHANGE, CHANGE THEM ALL IN TURN
+C  IF THIS WORD EXISTS IN SEQ
+        KT = WINDEX(IWORD)
+        IF(KT.NE.0)THEN
+          CALL DCODEI(IWORD,LENGTH,STRING)
+          DO 10 I1 = 1,LENGTH
+            DO 5 I2 = 1,3
+              IT = STRING(I1)
+              NC1 = DIF(I2,STRING(I1))
+              STRING(I1) = NC1
+              JWORD = NCODE(STRING(1),LENGTH)
+              DM(JWORD) = DM(JWORD) + KT
+              STRING(I1) = IT
+5           CONTINUE
+10        CONTINUE
+        END IF
+200   CONTINUE
+      IF(NFUZ.EQ.1)RETURN
+        DO 300 IWORD = 1,MAXPOS
+C  TWO POSITIONS MUST CHANGE, CHANGE THEM ALL IN TURN
+          KT = WINDEX(IWORD)
+          IF(KT.NE.0)THEN
+            CALL DCODEI(IWORD,LENGTH,STRING)
+            DO 40 I1 = 1,LENGTH
+              DO 30 I2 = I1+1,LENGTH
+                DO 25 I3 = 1,3
+                  IT1 = STRING(I1)
+                  NC1 = DIF(I3,STRING(I1))
+                  STRING(I1) = NC1
+                  DO 24 I4 = 1,3
+                    IT2 = STRING(I2)
+                    NC2 = DIF(I4,STRING(I2))
+                    STRING(I2) = NC2
+                    JWORD = NCODE(STRING(1),LENGTH)
+                    DM(JWORD) = DM(JWORD) + KT
+                    STRING(I2) = IT2
+24                CONTINUE
+                  STRING(I1) = IT1
+25              CONTINUE
+30            CONTINUE
+40          CONTINUE
+          END IF
+300     CONTINUE
+        RETURN
+C      END IF
+      END
+      SUBROUTINE MAKEDH(TMPDIC,DICTOT,LENGTH,NFUZ,MAXPOS)
+      INTEGER STRING(8),TMPDIC(MAXPOS),DICTOT(MAXPOS)
+      INTEGER GETV
+      EXTERNAL GETV
+      DO 100 IWORD = 1,MAXPOS
+        TMPDIC(IWORD) = DICTOT(IWORD)
+100   CONTINUE
+      IF(NFUZ.EQ.0) RETURN
+      DO 200 IWORD = 1,MAXPOS
+C  ONLY ONE POSITION MUST CHANGE, CHANGE THEM ALL IN TURN
+C  IF THIS WORD EXISTS IN SEQ
+        IF(DICTOT(IWORD).NE.0)THEN
+          CALL DCODEI(IWORD,LENGTH,STRING)
+          IV = GETV(IWORD,LENGTH,STRING,DICTOT,MAXPOS,NFUZ)
+          IF(IV.GT.TMPDIC(IWORD))TMPDIC(IWORD) = 0
+        END IF
+200   CONTINUE
+      IF(NFUZ.EQ.1)RETURN
+      DO 300 IWORD = 1,MAXPOS
+C  TWO POSITIONS MUST CHANGE, CHANGE THEM ALL IN TURN
+        IF(DICTOT(IWORD).NE.0)THEN
+          CALL DCODEI(IWORD,LENGTH,STRING)
+          IV = GETV(IWORD,LENGTH,STRING,DICTOT,MAXPOS,NFUZ)
+          IF(IV.GT.TMPDIC(IWORD))TMPDIC(IWORD) = 0
+        END IF
+300   CONTINUE
+      RETURN
+      END
+      INTEGER FUNCTION GETV(IWORD,LENGTH,STRING,DICTOT,MAXPOS,NFUZ)
+      INTEGER DICTOT(MAXPOS),STRING(8)
+      INTEGER DIF(3,4)
+      EXTERNAL NCODE
+      SAVE DIF
+      DATA DIF/2,3,4,
+     +         1,3,4,
+     +         1,2,4,
+     +         1,2,3/
+      IV = DICTOT(IWORD)
+      DO 10 I1 = 1,LENGTH
+        DO 5 I2 = 1,3
+          IT = STRING(I1)
+          NC1 = DIF(I2,STRING(I1))
+          STRING(I1) = NC1
+          JWORD = NCODE(STRING(1),LENGTH)
+          IV = MAX(IV,DICTOT(JWORD))
+          STRING(I1) = IT
+5       CONTINUE
+10    CONTINUE
+      IF(NFUZ.EQ.1)THEN
+        GETV = IV
+        RETURN
+      END IF
+      DO 40 I1 = 1,LENGTH
+        DO 30 I2 = I1+1,LENGTH
+          DO 25 I3 = 1,3
+            IT1 = STRING(I1)
+            NC1 = DIF(I3,STRING(I1))
+            STRING(I1) = NC1
+            DO 24 I4 = 1,3
+              IT2 = STRING(I2)
+              NC2 = DIF(I4,STRING(I2))
+              STRING(I2) = NC2
+              JWORD = NCODE(STRING(1),LENGTH)
+              IV = MAX(IV,DICTOT(JWORD))
+              STRING(I2) = IT2
+24          CONTINUE
+            STRING(I1) = IT1
+25        CONTINUE
+30      CONTINUE
+40    CONTINUE
+      GETV = IV
+      END
+      SUBROUTINE WDCODE(WT,STRING,LENGTH,IADD)
+      INTEGER WT(4,LENGTH),STRING(LENGTH)
+      DO 10 I = 1,LENGTH
+        WT(STRING(I),I) = WT(STRING(I),I) + IADD
+10    CONTINUE
+      END
+      REAL FUNCTION RINF(WT,LENGTH,COMP)
+      INTEGER WT(4,LENGTH)
+      REAL COMP(5)
+      HSS = 0.
+      T = 0.
+      DO 1 I = 1,4
+        T = T + WT(I,1)
+1     CONTINUE
+      DO 10 I = 1,LENGTH
+        DO 5 J = 1,4
+          HS = 0.
+          Y = COMP(J)
+          X = REAL(WT(J,I))/T
+          HS = X * LOG(X/Y)
+          HSS = HSS + HS
+5       CONTINUE
+10    CONTINUE
+      RINF = HSS/REAL(LENGTH)
+      END
+      SUBROUTINE DCODEI(A,LENGTH,STRING)
+      INTEGER A,CONSTS(8),STRING(LENGTH)
+      SAVE CONSTS
+      DATA CONSTS/1,4,16,64,256,1024,4096,16384/
+      K = 0
+      J = A
+      DO 10 I = LENGTH,1,-1
+        J1 = (J-1) / CONSTS(I)
+        K = K + 1
+        STRING(K) = J1 + 1
+        J = J - J1*CONSTS(I)
+10    CONTINUE
+      END
+      SUBROUTINE WRTWMT(WEIGHT,IWT,MAXCHR,LENGTH,IDEV,IFLAG)
+      REAL WEIGHT(MAXCHR,LENGTH)
+      INTEGER IWT(MAXCHR,LENGTH)
+      CHARACTER STRING*8
+      CHARACTER WTMCON
+      EXTERNAL WTMCON
+      DO 10 I = 1,MAXCHR
+        IF(IFLAG.EQ.0)
+     +  WRITE(IDEV,1000,ERR=100)(WEIGHT(I,J),J=1,LENGTH)
+        IF(IFLAG.EQ.1)
+     +  WRITE(IDEV,1001,ERR=100)(IWT(I,J),J=1,LENGTH)
+10    CONTINUE
+1000  FORMAT(' ',20F5.2)
+1001  FORMAT(' ',20I6)
+100   CONTINUE
+      STRING = ' '
+      DO 20 I = 1,LENGTH
+        STRING(I:I) = WTMCON(IWT,LENGTH,MAXCHR,I)
+20    CONTINUE
+      WRITE(IDEV,1002)STRING
+1002  FORMAT(' ',A)
+      END
+      CHARACTER*1 FUNCTION WTMCON(IWT,LENGTH,MAXCHR,IP)
+      INTEGER IWT(MAXCHR,LENGTH),C(5)
+      CHARACTER STRING(5,5)
+      SAVE STRING
+      DATA STRING/
+     +'T','Y','W','K','T',
+     +'Y','C','M','S','C',
+     +'W','M','A','R','A',
+     +'K','S','R','G','G',
+     +'T','C','A','G','-'/            
+      N = 0
+      DO 10 J = 1,MAXCHR
+        I = IWT(J,IP)
+        IF (I.GT.N) N = I
+10    CONTINUE
+      NC = 0
+      DO 20 J = 1,MAXCHR
+        C(J) = 0
+        I = IWT(J,IP)
+        IF (I.EQ.N) THEN
+          M = J
+          NC = NC + 1
+          C(NC) = J
+        END IF
+20    CONTINUE
+      IF(NC.EQ.1)THEN
+        J = C(1)
+        WTMCON = STRING(J,J)
+        RETURN
+      END IF
+      IF(NC.EQ.2)THEN
+        J = C(1)
+        I = C(2)
+        WTMCON = STRING(I,J)
+        RETURN
+      END IF
+      IF(NC.EQ.3)THEN
+        J = 0
+        DO 30 I = 1,3
+          J = J + C(I)
+30      CONTINUE
+        IF(J.EQ.6) WTMCON = 'H'
+        IF(J.EQ.7) WTMCON = 'B'
+        IF(J.EQ.8) WTMCON = 'D'
+        IF(J.EQ.9) WTMCON = 'V'
+        RETURN
+      END IF
+      WTMCON = 'N'
+      END
+      CHARACTER*(*) FUNCTION DCODE(A,LENGTH)
+      IMPLICIT INTEGER (A-Z)
+      DIMENSION CONSTS(8)
+      CHARACTER BASE(5)
+      SAVE CONSTS,BASE
+      DATA BASE/'T','C','A','G','-'/
+      DATA CONSTS/1,4,16,64,256,1024,4096,16384/
+      K = 0
+      J = A
+      DCODE = ' '
+      DO 10 I = LENGTH,1,-1
+        J1 = (J-1) / CONSTS(I)
+        K = K + 1
+        DCODE(K:K) = BASE(J1+1)
+        J = J - J1*CONSTS(I)
+10    CONTINUE
+      END
+      SUBROUTINE GETCOM(SEQN,IDIM,COMP)
+      INTEGER SEQN(IDIM)
+      REAL COMP(5)
+C NB ADD 1 TO EACH BASE FOR LATER
+      CALL FILLR(COMP,5,1.)
+      DO 10 I = 1,IDIM
+        J = SEQN(I)
+        COMP(J) = COMP(J) + 1.
+10    CONTINUE
+      X = REAL(IDIM) + 4.
+      DO 20 I = 1,4
+        COMP(I) = COMP(I)/X
+20    CONTINUE
+      END
+      SUBROUTINE TCOMP(COMP,KCOMP)
+      REAL COMP(5)
+      IF(KCOMP.EQ.1)THEN
+        T1 = COMP(1)
+        COMP(1) = COMP(3)
+        COMP(3) = T1
+        T1 = COMP(2)
+        COMP(2) = COMP(4)
+        COMP(4) = T1
+      ELSE IF (KCOMP.EQ.2)THEN
+        T1 = COMP(1)
+        T2 = COMP(3)
+        T1 = (T1 + T2) / 2.
+        COMP(1) = T1
+        COMP(3) = T1
+        T1 = COMP(2)
+        T2 = COMP(4)
+        T1 = (T1 + T2) / 2.
+        COMP(2) = T1
+        COMP(4) = T1
+      END IF
+      END
+      SUBROUTINE WRTDIC(IDEV,FILNAM,DIC,MAXPOS,IOK)
+      INTEGER DIC(MAXPOS)
+      CHARACTER FILNAM*(*)
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,6)
+      IF(IOK.NE.0)RETURN
+      WRITE(IDEV)DIC
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE RDDIC(IDEV,FILNAM,DIC,MAXPOS,IOK)
+      INTEGER DIC(MAXPOS)
+      CHARACTER FILNAM*(*)
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,7)
+      IF(IOK.NE.0)RETURN
+      READ(IDEV)DIC
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE RDIC(IDEV,FILNAM,DIC,MAXPOS,KBIN,KBOUT,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER DIC(MAXPOS)
+      CHARACTER FILNAM*(*),HELPF*(*)
+30    CONTINUE
+      L = 0
+      CALL GTSTR(
+     +'Name of dictionary file',' ',FILNAM,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      IF(INFLAG.NE.0) RETURN
+      CALL RDDIC(IDEV,FILNAM,DIC,MAXPOS,IOK)
+      END
+      SUBROUTINE WDIC(IDEV,FILNAM,DIC,MAXPOS,KBIN,KBOUT,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      INTEGER DIC(MAXPOS)
+      CHARACTER FILNAM*(*),HELPF*(*)
+30    CONTINUE
+      L = 0
+      CALL GTSTR(
+     +'Name for dictionary file',' ',FILNAM,L,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      IF(INFLAG.NE.0) RETURN
+      CALL WRTDIC(IDEV,FILNAM,DIC,MAXPOS,IOK)
+      END
+      SUBROUTINE DICIO(KBIN,KBOUT,NOPT,IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      PARAMETER (MAXPRM = 10)
+      CHARACTER PROMPT(8)*(MAXPRM)
+      WRITE(KBOUT,*)'Save or restore a dictionary'
+      NOPT = 0
+      IB = 1
+      PROMPT(1) = 'Save Dw'
+      PROMPT(2) = 'Save Ds'
+      PROMPT(3) = 'Save Dm'
+      PROMPT(4) = 'Save Dh'
+      PROMPT(5) = 'Restore Dw'
+      PROMPT(6) = 'Restore Ds'
+      PROMPT(7) = 'Restore Dm'
+      PROMPT(8) = 'Restore Dh'
+      CALL RADION('Select save or restore mode',PROMPT,8,IB,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IB.LT.1) RETURN
+      NOPT = IB
+      END
+      SUBROUTINE SHLSRT(KEY,N,IPOINT,INCDEC)
+C     SHELL SORT
+C     REFERENCES:  D.L. SHELL, CACM 2, 32 (JULY 1959)
+C                  D.E. KNUTH, TAOCP III, SECT. 5.2.1
+C     CALLING SEQUENCE:
+C
+C     KEY    IS AN ARRAY OF KEYS ON WHICH TO SORT
+C     N      IS THE NUMBER OF ITEMS
+C     IPOINT IS THE ARRAY OF POINTERS
+C            (ONLY THE POINTERS WILL MOVE)
+C     INCDEC .GE. 0 FOR SORTING INTO INCREASING ORDER;
+C            .LT. 0 FOR SORTING INTO DECREASING ORDER
+C
+      INTEGER KEY(N),IPOINT(N),K
+      INTEGER H,S,T
+      IPOINT(1) = 1
+      IF(N.LT.2)RETURN
+C     INITIALIZE POINTER ARRAY
+      DO 2 I = 2,N
+        IPOINT(I) = I
+    2 CONTINUE
+C
+C     CHOICE OF SEQUENCE OF INCREMENTS SUGGESTED
+C     BY KNUTH III, EQ. 8, P. 95.   HIS FORMULA
+C     IS EQUIVALENT TO:
+C
+C            H(S) = (3**S - 1)/2
+C            INITIAL VALUE OF S IS MINIMAL INTEGER
+C              SUCH THAT H(S+2) .GE. N
+C
+C
+C     SMAX = (ALOG(2N + 1)/ALOG(3)) - 2 + 1
+      S = INT( (ALOG(FLOAT(2*N+1))/1.09861229) - 0.95 )
+      S = MAX(S,1)
+      H = (3**S - 1)/2
+      DO 7 T = 1,S
+        JMIN = H + 1
+        DO 6 J = JMIN,N
+          I = J - H
+          JJ = IPOINT(J)
+          K = KEY(JJ)
+          IPT = IPOINT(J)
+   3      CONTINUE
+          II = IPOINT(I)
+          IF((K-KEY(II)).LE.0)THEN
+            IPLUSH = I + H
+            IPOINT(IPLUSH) = IPOINT(I)
+            I = I - H
+            IF(I.GT.0) GO TO 3
+          END IF
+          IPLUSH = I + H
+          IPOINT(IPLUSH) = IPT
+    6   CONTINUE
+C     CHANGE INCREMENT
+        IF(H.LT.2) GO TO 8
+        H = (H-1)/3
+7     CONTINUE
+C      CHECK INCDEC: IF NEGATIVE, SWITCH POINTER ARRAY
+8     CONTINUE
+      IF(INCDEC.LT.0)THEN
+        M = N/2
+        NP1MI = N
+        DO 10 I = 1,M
+          NTEMP = IPOINT(I)
+          IPOINT(I) = IPOINT(NP1MI)
+          IPOINT(NP1MI) = NTEMP
+          NP1MI = NP1MI - 1
+  10    CONTINUE
+      END IF
+      END
+      SUBROUTINE MAKEDS(SEQ,MAXCHR,DS,DT,MAXPOS,
+     +LENSEQ,MAXSEQ,NFILE,LENGTH,KCOMP,I1,I2,TEMP,MASK,LMASK)
+      INTEGER SEQ(MAXCHR),DS(MAXPOS),DT(MAXPOS)
+      INTEGER LENSEQ(MAXSEQ),CCODEM,TEMP(MAXCHR)
+      LOGICAL MASK(LMASK)
+      EXTERNAL NCODEM,CCODEM
+      CALL FILLI(DS,MAXPOS,0)
+      CALL FILLI(DT,MAXPOS,0)
+      ITEMP = 0
+      IF((KCOMP.EQ.0).OR.(KCOMP.EQ.2))THEN
+        DO 10 I = 1,NFILE-1
+          I11 = LENSEQ(I) + I1 - 1
+          I22 = MIN(LENSEQ(I) + I2 - 1, LENSEQ(I+1)) - LMASK
+          IF(ITEMP.GT.0) THEN
+            DO 4 J = 1,ITEMP
+              DT(TEMP(J)) = 0
+4           CONTINUE
+            ITEMP = 0
+          END IF
+          DO 5 J = I11,I22
+            N = NCODEM(SEQ(J),LENGTH,MASK,LMASK)
+            IF(N.NE.0)THEN
+              IF(DT(N).EQ.0) THEN
+                DT(N) = 1
+                ITEMP = ITEMP + 1
+                TEMP(ITEMP) = N
+                DS(N) = DS(N) + 1
+              END IF
+            END IF
+5         CONTINUE
+10      CONTINUE
+      END IF
+      IF((KCOMP.EQ.1).OR.(KCOMP.EQ.2)) THEN
+        DO 20 I = 1,NFILE-1
+          I11 = LENSEQ(I) + I1 - 1
+          I22 = MIN(LENSEQ(I) + I2 - 1, LENSEQ(I+1)) - LMASK
+          IF(ITEMP.GT.0) THEN
+            DO 14 J = 1,ITEMP
+              DT(TEMP(J)) = 0
+14           CONTINUE
+            ITEMP = 0
+          END IF
+          DO 15 J = I11,I22
+            N = CCODEM(SEQ(J),LENGTH,MASK,LMASK)
+            IF(N.NE.0)THEN
+              IF(DT(N).EQ.0) THEN
+                DT(N) = 1
+                ITEMP = ITEMP + 1
+                TEMP(ITEMP) = N
+                DS(N) = DS(N) + 1
+              END IF
+            END IF
+15        CONTINUE
+20      CONTINUE
+      END IF
+      END
+      SUBROUTINE OUTP(TEMPI,TEMPC,TEMPP,TEMPN,MLIS,LENGTH,IDEV,
+     +MASK,LMASK)
+      INTEGER TEMPI(MLIS),TEMPC(MLIS),TEMPP(MLIS),TEMPN(MLIS)
+      CHARACTER*8 STRING,DCODE
+      CHARACTER UNMASK*25,SOUT*25
+      LOGICAL MASK(LMASK)
+      EXTERNAL DCODE,UMASK
+      DO 100 I = 1,MLIS
+        N = TEMPN(I)
+        NW = TEMPP(N)
+        STRING = DCODE(NW,LENGTH)
+        SOUT = UNMASK(STRING,MASK,LMASK)
+        WRITE(IDEV,1000)SOUT(1:LMASK),TEMPC(N),REAL(TEMPI(N))/10000.
+1000    FORMAT(' ',A,' ',I6,F10.5)
+100   CONTINUE
+      END
+      SUBROUTINE HAIRPD(IS,IN,KBIN,KBOUT,
+     +MINLPI,MAXLPI,MINLP,MINLPX,MAXLPX,MAXLP,MINBP,MAXBP,MININ,
+     +IGON,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Define the range of loop sizes')
+      CALL GETINT(MINLPI,MAXLPI,MINLP,'Minimum loop size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINLP = IVAL
+      MINLPX = MAX(MINLPX,MINLP)
+      MAXLP = MINLPX
+      CALL GETINT(MINLPX,MAXLPX,MAXLP,'Maximum loop size',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXLP = IVAL
+      MINLP = MINLP + 1
+      MAXLP = MAXLP + 1
+      CALL GETINT(MINBP,MAXBP,MININ,'Minimum number of basepairs',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MININ = IVAL
+      CALL YESONO(IGON,'Plot results','List results',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+C      IF(IGON.LT.0)RETURN
+      END
+C   lwrap2
+      INTEGER FUNCTION LWRAP2(IDIM,I)
+C   AUTHOR: RODGER STADEN
+C   test for end of array,ifso wrap around
+      LWRAP2=I
+      IF(LWRAP2.LT.1)LWRAP2=LWRAP2+IDIM+1
+      IF(LWRAP2.GT.IDIM)LWRAP2=LWRAP2-IDIM
+      END
+      SUBROUTINE LISTN(SEQ,MAXCHR,LENSEQ,NFILE,LMAX,LINLEN,I1,I2,
+     +IDEV,KBOUT)
+      CHARACTER SEQ(MAXCHR)
+      INTEGER LENSEQ(NFILE)
+C     SET WIDTH FOR LAST PAGE
+      LPAGE=MOD(I2-I1+1,LINLEN)
+C     HOW MANY PAGE WIDTHS?
+      NPAGE=1+(I2-I1+1)/LINLEN
+      IF(MOD(I2-I1+1,LINLEN).EQ.0)THEN
+        NPAGE=NPAGE-1
+        LPAGE=LINLEN
+      END IF
+      ISTART=I1-LINLEN
+      DO 50 I=1,NPAGE
+        ISTART=ISTART+LINLEN
+        IF(I.EQ.NPAGE)LINLEN=LPAGE
+        WRITE(IDEV,1006)(K,K=ISTART+9,ISTART+LINLEN-1,10)
+        DO 40 J=1,NFILE-1
+          KF = ISTART + LENSEQ(J) - 1
+          KT = MIN(KF+LINLEN,LENSEQ(J+1)) - 1
+          WRITE(IDEV,1003,ERR=60)J,(SEQ(K),K=KF,KT)
+1003      FORMAT(' ',I4,'  ',100A1)
+1006      FORMAT('       ',10I10)
+40      CONTINUE
+        WRITE(IDEV,1008)
+1008    FORMAT(/)
+50    CONTINUE
+      RETURN
+60    CONTINUE
+      WRITE(KBOUT,*)' Error writing file'
+      END
+      SUBROUTINE GMASK(KBIN,KBOUT,MASK,LMASK1,LENGTH,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER CMASK*25,HELPF*(*),TUPPER
+      LOGICAL MASK(LMASK1)
+      EXTERNAL TUPPER
+ 1    CONTINUE
+      LMASK = LMASK1
+      DO 2 I=1,LMASK
+        MASK(I) = .TRUE.
+ 2    CONTINUE
+      LMASK = LENGTH
+5     CONTINUE
+        WRITE(KBOUT,1002)
+1002    FORMAT(' x means use, - means ignore',/,
+     +         ' e.g. xx-x---x-x means use positions 1,2,4,8,10')
+        LIN = 0
+        CALL GTSTR('Mask',' ',CMASK,LIN,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 5
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        IF(LIN.EQ.0) RETURN
+        LMASK = LIN
+        LENGTH = 0
+        DO 70 I=1,LIN
+          IF(TUPPER(CMASK(I:I)).EQ.'X') THEN
+            LENGTH = LENGTH + 1
+            MASK(I) = .TRUE.
+          ELSE
+            MASK(I) = .FALSE.
+          END IF
+70      CONTINUE
+      IF (LENGTH.GT.8) THEN
+        WRITE(KBOUT,*)'More than 8 active bases'
+        GO TO 1
+      END IF
+      LMASK1 = LMASK
+      END
+      CHARACTER*(*) FUNCTION UNMASK(STRING,MASK,LMASK)
+      CHARACTER STRING*(*)
+      LOGICAL MASK(LMASK)
+      UNMASK = ' '
+      DO 5 I=1,LMASK
+        UNMASK(I:I) = '-'
+ 5    CONTINUE
+      J = 0
+      DO 10 I=1,LMASK
+        IF (MASK(I)) THEN
+          J  = J + 1
+          UNMASK(I:I) = STRING(J:J)
+        END IF
+ 10   CONTINUE
+      END
+      CHARACTER*(*) FUNCTION MMASK(STRING,MASK,LMASK,J)
+      CHARACTER STRING*(*)
+      LOGICAL MASK(LMASK)
+      J = 0
+      DO 10 I=1,LMASK
+        IF (MASK(I)) THEN
+          J  = J + 1
+          MMASK(J:J) = STRING(I:I)
+        END IF
+ 10   CONTINUE
+      END
diff --git a/src/staden/mystdlib.h b/src/staden/mystdlib.h
new file mode 100644
index 0000000..7a51472
--- /dev/null
+++ b/src/staden/mystdlib.h
@@ -0,0 +1,38 @@
+/*
+    Title:       stdlib
+
+    File: 	 mystdlib.h
+    Purpose:	 ANSII C stdlib.h replacement
+    Last update: Thu May 24 1990
+*/
+
+
+/*
+    The SUN does not provide the ANSII C stdlib.h header file, so
+    so this file provides some of the definitions which should be
+    found there.
+
+    Note that we cannot override existing type definitions.
+
+    (3/3/93) For this reason this file is now no longer needed. If you are on
+    a system needing this then define NOSTDLIB in the makefile.
+*/
+
+#ifdef NOSTDLIB
+
+
+/* String processing functions */
+extern double strtod(char *str, char **ptr);
+extern double atof(char *str);
+extern long   strtol(char *str, char **ptr, int base);
+
+
+/* Control functions */
+extern void exit(int status);
+
+
+/* Storage allocation */
+extern char *malloc(unsigned size);
+extern char *calloc(unsigned elt_count, unsigned elt_size);
+
+#endif
diff --git a/src/staden/nip.f b/src/staden/nip.f
new file mode 100644
index 0000000..0d49c49
--- /dev/null
+++ b/src/staden/nip.f
@@ -0,0 +1,1023 @@
+C   NIP Nucleotide interpretation program
+C
+C   author: Rodger Staden, Medical Research Council Centre,
+C           Laboratory of Molecular Biology, Hills Road,
+C           Cambridge, England
+C     12-1-90 Closed idev if.ne.kbout after rdseq
+C     16-3-90 Removed escape after bpause
+C     5-4-90  Changed getdev to redir and removed all close(unit=idev)
+C             etc (see redir)
+C     6-7-90  Added showfi
+C     9-7-90  Renamed amenu to menu
+C    14-8-90  Modified 6 phase translation
+C    22-8-90  Modified 6 phase translation so it will do all listings
+C    24-9-90  Fixed bug in graphics output for pattern search
+C    5-11-90  Very many changes for addition of file of file names
+C             search for patterns - affects patternn,patternnc,nipl,
+C             nip,anals89,asubs89 (rdwmt)
+C   11-12-90  Chabges relating to new way of handling pir libs. New filnll 
+C             here and new parameter
+C   11-1-91  Modified positional base preferences to allow standard to be
+C            read form a file: call from main has extra arguments
+C   22-2-91  For find open frames and trandk send maxseq instead of idim3
+C            in odrer to give sufficient space for 6 phase translation
+C  17-4-91  New splice junction search
+C  7-5-91  Many changes to do with use of feature tables
+C          including the argument lists to: codons, codtdk, trandk, opentr, pltmap,
+C          codimp. Also forced out use of option 38 by caling 39 instead
+C 7-6-91 New sequence library routines for cdrom format. PIR stuff junked
+C 18-7-91 trndom did not write title!
+C 18-7-91 Added titles to pattern files
+C 25-2-92 Changed call to rdseq !!!!!!!!
+C 8-5-92 upped maxseq to 330000
+C 17-6-92 Declared SIGNAL as an external as there's an intrinsic with
+C         the same name under AIX
+      SUBROUTINE FMAIN()
+      INTEGER BOTOPT,TOPOPT
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) FILE1,FILE2,FILE3,FILE4,FILE5,FILE6,FILE7
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILMAR,FILNAM,FILEIN
+      CHARACTER*(NAMLEN) FOFNAM
+      CHARACTER*(NAMLEN) LIBLF
+      PARAMETER (BOTOPT=-10,TOPOPT=70,
+     +           MAXSEQ=330000,
+     +           MXSPAN=603,
+     +           MAXWIN=MAXSEQ+MXSPAN,
+     +           MAXWIR=330000,
+     +           MAXD48=MAXWIR/48,
+     +           MAXD2=MAXWIR/2,
+     +           MAXD3=MAXWIR/3,
+     +           MAXSD2=MAXSEQ/2,
+     +           MAXSD3=MAXSEQ/3,
+     +           MAXMEN=-8,
+     +           MAXOPT=67,
+     +           MAXDEV=9,
+     +           MXWTLN = 120,
+     +           IDM = 5,
+     +           IDME = 17)
+      PARAMETER (FILE1='EUKRIBS',
+     +             FILE2='INTRONS',
+     +             FILE3='ECPROMS',
+     +             FILE4='ECRIBS',
+     +             FILE5='RENZYM4',
+     +             FILE6='RENZYM6',
+     +             FILE7='RENZYMAL',
+     +             HELPF='NIPHELP',
+     +             POINTF='NIPHPNT',
+     +             FILMAR='NIPMARG',
+     +             LIBLF = 'SEQUENCELIBRARIES')
+      PARAMETER (MAXMOT = 50, MAXWTS = 4000)
+      CHARACTER*(NAMLEN) NAMSAV(MAXMOT)
+      CHARACTER*8 KEYNS(MAXMOT)
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT),DEVNOS(MAXDEV)
+      REAL WORKR(MAXWIR)
+      INTEGER WORKI(MAXWIR),OPT,MARGB(MAXOPT),MARGT(MAXOPT)
+      PARAMETER (MAXWRD = 6)
+      PARAMETER (MAXDIC = 4**MAXWRD)
+      INTEGER WORDP(MAXDIC),WORDN(MAXDIC)
+      PARAMETER (MAXNZ1 = MAXSEQ/10,
+     +           MAXEN  = 1000,
+     +           MAXEN8 = MAXEN*8)
+      INTEGER ENZYM1(MAXNZ1),ENZYM2(MAXNZ1),ENZYM3(MAXNZ1)
+      INTEGER ENZYME(MAXEN8),ENZYM4(MAXNZ1)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER MAT1(IDM,IDM),MAT2(IDME,IDME)
+      INTEGER MAT3(IDME,IDME),MAT4(IDM,IDM)
+      CHARACTER SEQ(MAXWIN),SEQW(MAXSEQ),PAA(5,5,5),PAAS(5,5,5)
+      CHARACTER CHRSET(IDM)
+      EQUIVALENCE (WORKR,WORKI)
+      EQUIVALENCE (ENZYME,WORDN)
+C   EXTERNALS
+      EXTERNAL SIGNAL
+      DATA CHRSET/'T','C','A','G','-'/
+      DATA MAT1/
+     +            1,0,0,0,0,
+     +            0,1,0,0,0,
+     +            0,0,1,0,0,
+     +            0,0,0,1,0,
+     +            0,0,0,0,0/
+      DATA MAT2/
+     +1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,
+     +1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,
+     +0,1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,
+     +0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,
+     +1,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,
+     +1,1,1,0,0,0,0,1,0,1,0,1,0,0,0,0,0,
+     +1,1,0,1,0,0,1,0,1,0,1,0,1,0,0,0,0,
+     +0,1,1,1,0,1,0,0,1,1,0,0,0,1,0,0,0,
+     +1,0,1,1,0,1,0,1,0,0,1,0,0,0,1,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+      DATA MAT3/
+     +     36, 0, 0, 0, 9, 0,18,18, 0, 0,18,12,12, 0,12, 9, 0,
+     +      0,36, 0, 0, 9, 0,18, 0,18,18, 0,12,12,12, 0, 9, 0,
+     +      0, 0,36, 0, 9,18, 0,18, 0,18, 0,12, 0,12,12, 9, 0,
+     +      0, 0, 0,36, 9,18, 0, 0,18, 0,18, 0,12,12,12, 9, 0,
+     +      9, 9, 9, 9,36,18,18,18,18,18,18,27,27,27,27,36, 0,
+     +      0, 0,18,18,18,36, 0, 9, 9, 9, 9, 6, 6,12,12,18, 0,
+     +     18,18, 0, 0,18, 0,36, 9, 9, 9, 9,12,12, 6, 6,18, 0,
+     +     18, 0,18, 0,18, 9, 9,36, 0, 9, 9,12, 6, 6,12,18, 0,
+     +      0,18, 0,18,18, 9, 9, 0,36, 9, 9, 6,12,12, 6,18, 0,
+     +      0,18,18, 0,18, 9, 9, 9, 9,36, 0,12, 6,12, 6,18, 0,
+     +     18, 0, 0,18,18, 9, 9, 9, 9, 0,36, 6,12, 6,12,18, 0,
+     +     12,12,12, 0,27, 6,12,12, 6,12, 6,36, 8, 8, 8,27, 0,
+     +     12,12, 0,12,27, 6,12, 6,12, 6,12, 8,36, 8, 8,27, 0,
+     +      0,12,12,12,27,12, 6, 6,12,12, 6, 8, 8,36, 8,27, 0,
+     +     12, 0,12,12,27,12, 6,12, 6, 6,12, 8, 8, 8,36,27, 0,
+     +      9, 9, 9, 9,36,18,18,18,18,18,18,27,27,27,27,36, 0,
+     +      0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0/
+      DATA MAT4/
+     +            0,0,2,1,0,
+     +            0,0,0,2,0,
+     +            2,0,0,0,0,
+     +            1,2,0,0,0,
+     +            0,0,0,0,0/
+      DATA PAAS/'F','F','L','L','-','S','S','S','S','S',
+     1'Y','Y','*','*','-','C','C','*','W','-',
+     1'-','-','-','-','-','L','L','L','L','L',
+     1'P','P','P','P','P','H','H','Q','Q','-',
+     1'R','R','R','R','R','-','-','-','-','-','I','I','I','M','-',
+     1'T','T','T','T','T',
+     1'N','N','K','K','-','S','S','R','R','-','-','-','-','-','-',
+     1'V','V','V','V','V','A','A','A','A','A','D','D','E','E','-',
+     1'G','G','G','G','G',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-'/
+      DATA PAA/'F','F','L','L','-','S','S','S','S','S',
+     1'Y','Y','*','*','-','C','C','*','W','-',
+     1'-','-','-','-','-','L','L','L','L','L',
+     1'P','P','P','P','P','H','H','Q','Q','-',
+     1'R','R','R','R','R','-','-','-','-','-','I','I','I','M','-',
+     1'T','T','T','T','T',
+     1'N','N','K','K','-','S','S','R','R','-','-','-','-','-','-',
+     1'V','V','V','V','V','A','A','A','A','A','D','D','E','E','-',
+     1'G','G','G','G','G',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-'/
+C   Initialise help
+      CALL INTHLP('nip', TOPOPT)
+C   GET DEVICE NUMBERS
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(3))
+      IFORNO = 0
+      LIBNO = 1
+      IGORT = 0
+C      CALL CLEARV
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' NIP (Nucleotide interpretation program) V7.0 July 1992',/,
+     +' Author: Rodger Staden'/)
+C   READ IN THE POINTERS TO THE HELP FILE
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(4),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,DEVNOS(4))
+      IOK=0
+      CALL INITLU(IDM)
+      CALL SETPAR(IOK)
+      IF(IOK.NE.0) GO TO 9999
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +MAXOPT,DEVNOS(1),FILMAR)
+      IDEV=KBOUT
+      MOPT=0
+      IDIMT = 0
+2     CONTINUE
+      IDEVLL = DEVNOS(5)
+      IDEVEN = DEVNOS(6)
+      IDEVAN = DEVNOS(7)
+      IDEVDL = DEVNOS(8)
+      IDEVLF = DEVNOS(9)
+      CALL RDSEQ(
+     +SEQ(2+MXSPAN/2),MAXSEQ,IDIMT,J1,J2,ISTART,IEND,IDIM1,IDIMB,
+     +DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +HELPS(3),HELPE(3),HELPF,DEVNOS(4),IDEV,IFORNO,
+     +IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBNO,LIBLF,WORKI,MAXWIR,IOK)
+C  SAVE FILE NAME
+      FILEIN=FILNAM
+      IF(IOK.NE.0)GO TO 1
+      CALL SHOWFI(KBOUT,FILEIN)
+C   GIVE COMPOSITION AS A CHECK
+      IF(IDIMB.GT.0)CALL BCOMP(SEQ(2+MXSPAN/2),IDIMB,J1,J2,ISTART,
+     +KBOUT)
+C   set pointers to sequence: all this stuff relates to buffering sequences
+C   in disk files, which is no longer done. As it was such a pain to put in
+C   and may become useful again it is being left, although newer functions
+C   cannot be sure of conforming.
+********
+C  MAXSEQ = THE DIMENSION OF THE RAM BUFFER SEQ
+C  IDIMT  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C           IN THE DISK BUFFER)
+C  ISTART = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQ(1)
+C  J1     = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  J2     = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIM1  = J2-J1+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IEND   = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMB  = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+********
+      IDIM1P=IDIMB+MXSPAN
+C   need array size divisible by 3 for translation
+      IDIM3=3*((IDIMB+3)/3)
+1     CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+C      IF(IOK.NE.0)GO TO 9999
+C   give menu, get option
+C
+      CALL MENU(OPT,KOPT,MOPT,MAXOPT,MAXMEN,KBIN,KBOUT,
+     +HELPS(0),HELPE(0),HELPF,DEVNOS(4))
+C
+C   change region
+C
+      IF((OPT.EQ.4).AND.(IDIMT.GT.0))THEN
+C        CALL REDEFA
+C     +  (SEQ(2+MXSPAN/2),IDIMT,J1,J2,MAXSEQ,IDIM1,ISTART,IEND,IDIMB,
+C     +  DEVNOS(5),KBIN,KBOUT,
+C     +  IOK,SEQW,LREC,NREC,
+C     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),MACTYP,MACSHT,MACLNG)
+        CALL GTREG(KBIN,KBOUT,ISTART,IEND,J1,J2,
+     +  'Define active region',
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IDIM1 = J2 - J1 + 1
+        GO TO 1
+      END IF
+C
+C   STOP
+C
+      IF(OPT.EQ.2)GO TO 9999
+C
+C   LIST
+C
+      IF((OPT.EQ.5).AND.(IDIMB.GT.0))THEN
+        CALL LSTSEQ(SEQ(2+MXSPAN/2),IDIMB,IDEV,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),ISTART,IEND,IDIMT,KOPT)
+        GO TO 1
+      END IF
+C
+C   codon usage method to find genes
+C
+      IF((OPT.EQ.42).AND.(IDIM1.GT.0))THEN
+        CALL PCODUS(
+     +  SEQ,IDIM1P,IEND,MXSPAN,
+     +  ISXMAX,ISYMAX,J1,J2,ISTART,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA)
+        GO TO 1
+      END IF
+C
+C   fickett
+C
+      IF((OPT.EQ.48).AND.(IDIM1.GT.0)) THEN
+        CALL FICKET(SEQ,IDIM1P,IDIM1,MXSPAN,
+     +  ISXMAX,ISYMAX,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  J1-ISTART+1,J2-ISTART+1,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   dinucfreq
+C
+      IF((OPT.EQ.22).AND.(IDIM1.GT.0))THEN
+        CALL DINUCF(SEQ(2+MXSPAN/2),IDIMB,
+     +  J1-ISTART+1,J2-ISTART+1,IDEV,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   BASE COMPOSITION
+C
+      IF((OPT.EQ.21).AND.(IDIM1.GT.0))THEN
+        CALL BCOMP(SEQ(2+MXSPAN/2),IDIMB,
+     +  J1,J2,ISTART,IDEV)
+        GO TO 1
+      END IF
+C
+C   WRITE OUT ACTIVE SEQUENCE
+C
+      IF((OPT.EQ.8).AND.(IDIM1.GT.0))THEN
+        CALL WRTACT(DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  SEQ(J1+1-ISTART+1+MXSPAN/2),IDIM1,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C  ruler
+C
+      IF((OPT.EQ.12).AND.(IDIM1.GT.0))THEN
+        CALL RULER(J1,J2,MARGL,MARGR,
+     +  MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   composition
+C
+      IF((OPT.EQ.24).AND.(IDIM1.GT.0))THEN
+        CALL COMPN(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1-ISTART+1,J2-ISTART+1,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   composition chi
+C
+      IF((OPT.EQ.25).AND.(IDIM1.GT.0))THEN
+        CALL CCHI(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,ISTART,IEND,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),WORKR,KOPT)
+        GO TO 1
+      END IF
+C
+C   dichi
+C
+      IF((OPT.EQ.26).AND.(IDIM1.GT.0))THEN
+        CALL DICHI(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,ISTART,IEND,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),WORKR,KOPT)
+        GO TO 1
+      END IF
+C
+C   trichi
+C
+      IF((OPT.EQ.27).AND.(IDIM1.GT.0))THEN
+        CALL TRICHI(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,ISTART,IEND,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),WORKR,KOPT)
+        GO TO 1
+      END IF
+C
+C   negentropy
+C
+      IF((OPT.EQ.59).AND.(IDIM1.GT.0))THEN
+        CALL NEGENT(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,ISTART,IEND,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),WORKR,KOPT)
+        GO TO 1
+      END IF
+C
+C   ribosomes (prokaryotic)
+C
+      IF((OPT.EQ.58).AND.(IDIM1.GT.0))THEN
+        CALL PRIBS(SEQ(2+MXSPAN/2),IDIMB,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  DEVNOS(1),FILE4,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   clear all
+C
+      IF(OPT.EQ.10) THEN
+        CALL CLEARG
+        GO TO 1
+      END IF
+C
+C   xhairs
+C
+      IF((OPT.EQ.13).AND.(IDIM1.GT.0))THEN
+        XMAX=J2
+        XMIN=J1
+        YMAX=ISYMAX
+        YMIN=0.
+        IIIIX=0
+        IIIIY=0    
+        CALL CLEARV
+        CALL XHAIRN(XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,IIIIX,IIIIY,N,KBOUT,
+     +  SEQ(2+MXSPAN/2),ISTART,IDIMB,
+     +  SEQ(2+MXSPAN/2),ISTART,IDIMB,1)
+        GO TO 1
+      END IF
+C
+C   clear vt100
+C
+      IF(OPT.EQ.11)THEN
+        CALL CLEARV
+        GO TO 1
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(OPT.EQ.14)THEN
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(OPT),HELPE(OPT),MAXOPT,HELPF,DEVNOS(4),KBIN,KBOUT)
+        GO TO 1
+      END IF
+C
+C   reverse
+C
+      IF((OPT.EQ.29).AND.(IDIMB.GT.0))THEN
+        WRITE(KBOUT,4001)J1,J2
+4001    FORMAT(' Reverse and complement sequence from',I7,' to',I7)
+        CALL SQREV(SEQ(2+(MXSPAN/2)-ISTART+J1),IDIM1)
+        CALL SQCOM(SEQ(2+(MXSPAN/2)-ISTART+J1),IDIM1)
+        GO TO 1
+      END IF
+C
+C   translation list
+C
+C      IF((OPT.EQ.38).AND.(IDIMB.GT.0))THEN
+C        CALL TRAN3(SEQ(2+MXSPAN/2),IDIMB,SEQW,IDIM3,
+C     +  IDEV,DEVNOS(1),FILNAM,KBIN,KBOUT,
+C     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),ISTART,IEND,IDIMT,
+C     +  PAA)
+C        GO TO 1
+C      END IF
+C
+C   Analyse frequencies of repeats
+C
+      IF((OPT.EQ.38).AND.(IDIM1.GT.0))THEN
+        CALL SQTREE(SEQ(J1-ISTART+2+MXSPAN/2),WORKI,IDIM1,
+     +  WORDP,MAXDIC,WORKI(IDIM1+1),MAXWIR-IDIM1+1,
+     +  ENZYME,MAXEN8,IDM,ISTART,
+     +  IDEV,KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   translation to disk
+C
+      IF((OPT.EQ.40).AND.(IDIM1.GT.0))THEN
+        CALL TRANDK(SEQ(2+MXSPAN/2),IDIMB,SEQW,MAXSEQ,
+     +  ISTART,IEND,IDIMT,
+     +  DEVNOS(1),DEVNOS(8),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,J1,J2,
+     +  WORKI,MAXSEQ)
+        GO TO 1
+      END IF
+C
+C   longest open frame
+C
+      IF((OPT.EQ.54).AND.(IDIM1.GT.0))THEN
+        CALL OPENFR(SEQ(2+MXSPAN/2),IDIMB,J1,J2,ISTART,
+     +  IDEV,KBIN,KBOUT,PAA,
+     +  SEQW,MAXSEQ,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   codons and base composition and molecular weights
+C
+      IF((OPT.EQ.23).AND.(IDIMB.GT.0))THEN
+        CALL CODONS(SEQ(2+MXSPAN/2),IDIMB,J1,J2,IDEV,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,WORKI,MAXSEQ,SEQW)
+        GO TO 1
+      END IF
+C
+C   shepherd
+C
+      IF((OPT.EQ.47).AND.(IDIM1.GT.0)) THEN
+        CALL SHEPED(
+     +  SEQ,IDIM1P,IDIM1,MXSPAN,
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,MARGL,MARGR,
+     +  MARGB(OPT),MARGT(OPT),KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,KOPT)
+        GO TO 1
+      END IF
+C
+C   positional base preferences for average amino acid composition
+C
+      IF((OPT.EQ.43).AND.(IDIM1.GT.0)) THEN
+        CALL PBASEP(
+     +  SEQ,IDIM1P,MXSPAN,
+     +  ISXMAX,ISYMAX,J1,J2,ISTART,IEND,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,DEVNOS(1),
+     +  FILNAM,KOPT)
+        GO TO 1
+      END IF
+C
+C   hairpins
+C
+      IF((OPT.EQ.30).AND.(IDIM1.GT.0))THEN
+        CALL HAIRPN(SEQ(2+MXSPAN/2),IDIMB,WORKI(1),
+     +  WORKI(1+MAXD3),
+     +  WORKI(1+2*MAXD3),MAXD3,MARGL,MARGR,MARGB(OPT),
+     +  MARGT(OPT),ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  ISTART,IDEV,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   plot map
+C
+      IF((OPT.EQ.16).AND.(IDIM1.GT.0))THEN
+        CALL PLTMAQ(DEVNOS(1),FILNAM,WORKI,MAXSEQ,MARGL,MARGR,
+     +  MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   reading frames using codon improbability expecting codon frequencies
+C   to depend on base composition
+C
+      IF((OPT.EQ.45).AND.(IDIM1.GT.0)) THEN
+        CALL IMPBC(
+     +  SEQ,IDIM1P,IDIM1,MXSPAN,WORKR,
+     +  IDIM1P,ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),
+     +  MARGT(OPT),KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,KOPT)
+        GO TO 1
+      END IF
+C
+C   new file
+C
+      IF(OPT.EQ.3)GO TO 2
+C
+C   SPLICE JUNCTIONS
+C
+      IF((OPT.EQ.62).AND.(IDIM1.GT.0)) THEN
+C        CALL SPLIC1(
+C     +  SEQ(2+MXSPAN/2),IDIMB,
+C     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+C     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+C     +  DEVNOS(1),
+C     +  FILE2,KBOUT)
+C        GO TO 1
+        CALL FSPLIC(
+     +  SEQ(2+MXSPAN/2),IDIMB,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  DEVNOS(1),FILE2,FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,
+     +  WORKI,WORKR(1+MXWTLN*IDM),WORKR(1+2*MXWTLN*IDM),
+     +  WORKI(1+3*MXWTLN*IDM),WORKI(1+3*MXWTLN*IDM+MXWTLN),
+     +  WORKI(1+3*MXWTLN*IDM+2*MXWTLN),
+     +  WORKI(1+3*MXWTLN*IDM+3*MXWTLN),
+     +  WORKI(1+3*MXWTLN*IDM+4*MXWTLN),
+     +  IDM,MXWTLN,SEQW,CHRSET,KOPT)
+        GO TO 1
+      END IF
+C     +HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+C
+C   SPLICE JUNCTIONS AG,GT NOT OBLIGATORY
+C
+C      IF((OPT.EQ.63).AND.(IDIM1.GT.0))CALL SPLIC2(
+C     +SEQ(2+MXSPAN/2),IDIMB,
+C     1ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+C     +MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+C     +DEVNOS(1),FILE2,KBOUT)
+C     +HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+C   SPLICE JUNCTIONS AG,GT REMOVED
+C
+C      IF((OPT.EQ.64).AND.(IDIM1.GT.0))CALL SPLIC3(
+C     +SEQ(2+MXSPAN/2),IDIMB,
+C     1ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+C     +MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+C     +DEVNOS(1),FILE2,KBOUT)
+C     +HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+C
+C   signal searches from named plot files
+C
+      IF((OPT.EQ.63).AND.(IDIM1.GT.0))THEN
+        CALL SIGNAL(
+     +  SEQ(2+MXSPAN/2),IDIMB,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,
+     +  WORKI,WORKR(1+MXWTLN*IDM),WORKI(1+2*MXWTLN*IDM),
+     +  IDM,MXWTLN,SEQW,CHRSET,KOPT,2)
+        GO TO 1
+      END IF
+C
+C  PLOT WORD FREQ/ EXPECTED WORD FREQ
+C
+      IF((OPT.EQ.64).AND.(IDIM1.GT.0))THEN
+        CALL WORDFP(SEQ,IDIM1P,IDIMB,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,ISTART,IEND,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   PROKARYOTIC PROMOTERS
+C
+      IF((OPT.EQ.55).AND.(IDIM1.GT.0)) THEN
+        CALL PPROM1(
+     +  SEQ(2+MXSPAN/2),IDIMB,
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  DEVNOS(1),FILE3,KBOUT)
+        GO TO 1
+      END IF
+C     +HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+C
+C   PROKARYOTIC PROMOTERS ON COMPLEMENTARY STRAND
+C
+      IF((OPT.EQ.56).AND.(IDIM1.GT.0)) THEN
+        CALL PPROM3(
+     +  SEQ(2+MXSPAN/2),IDIMB,
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  DEVNOS(1),FILE3,KBOUT)
+        GO TO 1
+      END IF
+C     +HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+C
+C   PROKARYOTIC PROMOTERS -35 AND -10 SEPARATELY
+C
+      IF((OPT.EQ.57).AND.(IDIM1.GT.0)) THEN
+        CALL PPROM2(
+     +  SEQ(2+MXSPAN/2),IDIMB,
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  DEVNOS(1),FILE3,KBOUT)
+        GO TO 1
+      END IF
+C     +HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+C
+C   ribosomes (eukaryotic)
+C
+      IF((OPT.EQ.61).AND.(IDIM1.GT.0)) THEN
+        CALL RIBEUK(
+     +  SEQ(2+MXSPAN/2),IDIMB,
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  DEVNOS(1),FILE1,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   polyA
+C
+      IF((OPT.EQ.65).AND.(IDIM1.GT.0)) THEN
+        CALL SRCHP(
+     +  SEQ(2+MXSPAN/2),
+     +  IDIMB,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,KBOUT)
+        GO TO 1
+      END IF
+C
+C  type text
+C
+      IF(OPT.EQ.6) THEN
+        CALL TTEXT(DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   help
+C
+      IF(OPT.EQ.1) THEN
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,HELPF,DEVNOS(4),
+     +  KBIN,KBOUT)
+        GO TO 1
+      END IF
+C
+C   start codons
+C
+      IF((OPT.EQ.50).AND.(IDIM1.GT.0))THEN
+        CALL CLEARV
+        CALL STARTS(SEQ(2+MXSPAN/2),IDIMB,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,PAA)
+        GO TO 1
+      END IF
+C
+C   stop codons
+C
+      IF((OPT.EQ.51).AND.(IDIM1.GT.0))THEN
+        CALL CLEARV
+        CALL STOPS(SEQ(2+MXSPAN/2),IDIMB,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,PAA)
+        GO TO 1
+      END IF
+C
+C   stop codons complementary
+C
+      IF((OPT.EQ.52).AND.(IDIM1.GT.0))THEN
+        CALL CLEARV
+        CALL STOPSC(SEQ(2+MXSPAN/2),IDIMB,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,PAA)
+        GO TO 1
+      END IF
+C
+C   stop codons complementary and normal
+C
+      IF((OPT.EQ.53).AND.(IDIM1.GT.0))THEN
+        CALL CLEARV
+        CALL STOPSB(SEQ(2+MXSPAN/2),IDIMB,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,PAA)
+        GO TO 1
+      END IF
+C
+C   codon improbability
+C
+      IF((OPT.EQ.28).AND.(IDIMB.GT.0))THEN
+        CALL CODIMP(SEQ(2+MXSPAN/2),IDIMB,WORKR,IDIMB,DEVNOS(1),
+     +  FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),J1,J2,IDEV,
+     =  WORKI(101),MAXSEQ-101,SEQW)
+        GO TO 1
+      END IF
+C
+C   REPEATS
+C
+      IF((OPT.EQ.32).AND.(IDIM1.GT.0))THEN
+        IF(IDIM1.LE.MAXWIR)THEN
+          CALL PREPT1(
+     +    SEQ(2+MXSPAN/2),WORKI,IDIMB,J1,J2,ISTART,
+     +    MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,KBIN,KBOUT,
+     +    HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,KOPT,
+     +    WORDP,WORDN,MAXWRD,MAXDIC)
+        ELSE
+          WRITE(KBOUT,1001)MAXWIR
+1001      FORMAT(' Maximum length of active region for this option=',
+     +    I7)
+        END IF
+        GO TO 1
+      END IF
+C
+C   codons to disk
+C
+      IF((OPT.EQ.41).AND.(IDIMB.GT.0)) THEN
+        CALL CODTDK(
+     +  SEQ(2+MXSPAN/2),IDIMB,J1,J2,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,WORKI,MAXSEQ,SEQW)
+        GO TO 1
+      END IF
+C
+C   zdna
+C
+      IF((OPT.EQ.33).AND.(IDIM1.GT.0))THEN
+        CALL ZDNA(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1-ISTART+1,J2-ISTART+1,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   zdna runs of RY or YR
+C
+      IF((OPT.EQ.34).AND.(IDIM1.GT.0)) THEN
+        CALL ZDNARN(
+     +  SEQ(2+MXSPAN/2),IDIMB,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1-ISTART+1,J2-ISTART+1,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   zdnaROD
+C
+      IF((OPT.EQ.35).AND.(IDIM1.GT.0))THEN
+        CALL ZDNARD(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1-ISTART+1,J2-ISTART+1,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   base phasing due to coding
+C
+      IF((OPT.EQ.44).AND.(IDIM1.GT.0)) THEN
+        CALL BPHASE(
+     +  SEQ,IDIM1P,IDIM1,MXSPAN,
+     +  ISXMAX,ISYMAX,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  J1-ISTART+1,J2-ISTART+1,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   trna
+C
+      IF((OPT.EQ.49).AND.(IDIM1.GT.0)) THEN
+        CALL TRNA(SEQ(2+MXSPAN/2),IDIMB,J1-ISTART+1,J2-ISTART+1,
+     +  ISTART,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  IDEV,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C  REPEATS
+C
+      IF((OPT.EQ.31).AND.(IDIM1.GT.0))THEN
+        IF(IDIM1.LE.MAXWIR)THEN
+          CALL PREPT2(
+     +    SEQ(2+MXSPAN/2),WORKI,IDIMB,J1,J2,ISTART,
+     +    MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,KBIN,KBOUT,
+     +    HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,KOPT,
+     +    WORDP,WORDN,MAXWRD,MAXDIC)
+        ELSE
+          WRITE(KBOUT,1001)MAXWIR
+        END IF
+        GO TO 1
+      END IF
+C
+C   reading frame search using codon improbability expecting even codons per
+C   acid
+C
+      IF((OPT.EQ.46).AND.(IDIM1.GT.0)) THEN
+        CALL IMPAC(
+     +  SEQ,IDIM1P,IDIM1,MXSPAN,WORKR,
+     +  IDIM1P,ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,KOPT)
+        GO TO 1
+      END IF
+C
+C   new search
+C
+      IF((OPT.EQ.17).AND.(IDIM1.GT.0)) THEN
+        KOPT = 1
+        CALL SEARCH(SEQ(2+MXSPAN/2),IDIMB,J1,J2,ISTART,SEQW,MAXSD3,
+     +  SEQW(1+MAXSD3),MAXSD3,
+     +  SEQW(1+2*MAXSD3),MAXSD3,
+     +  ENZYM1,ENZYM2,ENZYM3,ENZYM4,MAXNZ1,
+     +  ENZYME,ENZYME(1+MAXEN),ENZYME(1+2*MAXEN),ENZYME(1+3*MAXEN),
+     +  ENZYME(1+4*MAXEN),ENZYME(1+5*MAXEN),ENZYME(1+6*MAXEN),
+     +  ENZYME(1+7*MAXEN),MAXEN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,IDEV,
+     +  DEVNOS(1),FILNAM,FILE5,FILE6,FILE7,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),PAA,WORKI,WORDP,MAXDIC,
+     +  KOPT)
+        GO TO 1
+      END IF
+C
+C    Translate and list 0-6 phases
+C
+      IF((OPT.EQ.39).AND.(IDIM1.GT.0))THEN
+        CALL OPENTR(SEQ(2+MXSPAN/2),IDIMB,ISTART,
+     +  IDEV,KBIN,KBOUT,PAA,
+     +  SEQW,MAXSEQ,WORKI,WORKI(IDIMB+1),MAXSEQ-IDIMB,
+     +  DEVNOS(1),FILNAM,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   GET DISK OUTPUT FILE ON UNIT DEVNOS(2) IF REQUIRED
+C
+      IF(OPT.EQ.7)THEN
+        CALL REDIR(IDEV,DEVNOS(2),DEVNOS(3),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   signal searches from named plot files
+C
+      IF((OPT.EQ.20).AND.(IDIM1.GT.0))THEN
+        CALL SIGNAL(
+     +  SEQ(2+MXSPAN/2),IDIMB,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,
+     +  WORKI,WORKR(1+MXWTLN*IDM),WORKI(1+2*MXWTLN*IDM),
+     +  IDM,MXWTLN,SEQW,CHRSET,KOPT,1)
+        GO TO 1
+      END IF
+C
+C   SIGNAL SEARCH ON DINUCLEOTIDES
+C
+      IF((OPT.EQ.60).AND.(IDIM1.GT.0)) THEN
+        CALL DISIG(
+     +  SEQ(2+MXSPAN/2),IDIMB,MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,
+     +  WORKI,WORKR(1+MXWTLN*IDM*IDM),WORKI(1+2*MXWTLN*IDM*IDM),
+     +  IDM,MXWTLN,SEQW,CHRSET,KOPT)
+        GO TO 1
+      END IF
+C
+C  SEQFIT
+C
+      IF((OPT.EQ.18).AND.(IDIM1.GT.0))THEN
+        CALL SEQFIT(SEQ(2+MXSPAN/2),IDIMB,SEQW,MAXSEQ,
+     +  WORKI,WORKI(1+MAXD2),MAXD2,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,ISTART,IDEV,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C  SEPFIT
+C
+      IF((OPT.EQ.19).AND.(IDIM1.GT.0))THEN
+        CALL SEPFIT(SEQ(2+MXSPAN/2),IDIMB,SEQW,MAXSEQ,
+     +  WORKI,WORKI(1+MAXD2),MAXD2,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,J1,J2,
+     +  ISTART,IDEV,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,MAT3,IDME,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C  LOCAL SIMILARITY OR COMPLEMENTARITY
+C
+      IF((OPT.EQ.36).AND.(IDIM1.GT.0))THEN
+        CALL LOCALF(SEQ(2+MXSPAN/2),IDIMB,SEQW,SEQW(MAXSD2+1),MAXSD2,
+     +  WORKI,WORKI(1+MAXD2),MAXD2,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1,J2,ISTART,
+     +  IDEV,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),DIALOG)
+        GO TO 1
+      END IF
+C
+C  SET GENETIC CODE
+C
+      IF(OPT.EQ.37) THEN
+        CALL SETGEN(PAAS,PAA,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C  WRITE LABELS
+C
+      IF(OPT.EQ.15) THEN
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C  CHANGE DNA TO RNA OR RNA TO DNA
+C
+      IF((OPT.EQ.66).AND.(IDIM1.GT.0))THEN
+        WRITE(KBOUT,1002)ISTART,IEND
+1002    FORMAT(' Exchange T and U for the region',I7,' TO',I7)
+        CALL DNARNA(SEQ(2+MXSPAN/2),IDIMB)
+        GO TO 1
+      END IF
+C
+C  SHOW SETTINGS
+C
+      IF(OPT.EQ.9)THEN
+        WRITE(KBOUT,4000)FILEIN,J1,J2,IDIM1
+4000    FORMAT(' Sequence=',A,/,' Start=',I7,' End=',I7,
+     +' LENGTH=',I7)
+C       THIS NEXT LINE IS BECAUSE SOMETIMES THE FILE IS NOT READY TO EDIT!
+        CLOSE(UNIT=DEVNOS(2))
+        CALL SEQEDT(SEQ(2+MXSPAN/2),MAXSEQ,IDIMB,KBIN,
+     +  KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),DEVNOS(1),
+     +  FILNAM,FILEIN,IDM,IOK)
+        IF(IOK.NE.0) GO TO 1
+********
+C  MAXSEQ = THE DIMENSION OF THE RAM BUFFER SEQ
+C  IDIMT  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C           IN THE DISK BUFFER)
+C  ISTART = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQ(1)
+C  J1     = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  J2     = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIM1  = J2-J1+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IEND   = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMB  = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+C  IDIM3  = A BUFFER SIZE USED BY TRANSLATION ROUTINE
+C  THIS IS A FUDGE AND ONLY WORKS FOR UNBUFFERED SEQUENCES!!!!!!!!!!!
+        J1 = 1
+        J2 = IDIMB
+        IDIMT = IDIMB
+        IEND = IDIMB
+        IDIM1 = IDIMB
+        IDIM3=3*((IDIMB+3)/3)
+        WRITE(KBOUT,4000)FILEIN,J1,J2,IDIM1
+        GO TO 1
+      END IF
+C
+C  PATTERN SEARCH
+C
+      IF((OPT.EQ.67).AND.(IDIM1.GT.0))THEN
+        CALL PATTEN(SEQ(J1-ISTART+2+MXSPAN/2),IDIM1,SEQW,MAXSEQ,
+     +  WORKI(1),WORKI(MAXMOT+1),WORKI(2*MAXMOT+1),WORKI(3*MAXMOT+1),
+     +  WORKI(4*MAXMOT+1),WORKI(5*MAXMOT+1),WORKI(6*MAXMOT+1),
+     +  WORKI(7*MAXMOT+1),WORKI(8*MAXMOT+1),WORKI(9*MAXMOT+1),
+     +  WORKI(10*MAXMOT+1),WORKI(11*MAXMOT+1),WORKI(12*MAXMOT+1),
+     +  WORKI(13*MAXMOT+1),WORKI(14*MAXMOT+1),WORKI(15*MAXMOT+1),
+     +  WORKI(16*MAXMOT+1),WORKI(17*MAXMOT+1),WORKI(18*MAXMOT+1),
+     +  WORKI(19*MAXMOT+1),WORKI(20*MAXMOT+1),
+     +  WORKR(22*MAXMOT+1),WORKR(23*MAXMOT+1),
+     +  FILNAM,MAXMOT,MAXWTS,IDEV,DEVNOS(6),DEVNOS(7),
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,J1,
+     +  KBIN,KBOUT,DEVNOS(8),IDM,SEQ(1),IDME,
+     +  MAT1,MAT2,MAT3,MAT4,NAMSAV,KEYNS,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),FOFNAM,DEVNOS(1))
+C  NB IVE SENT THE START OF SEQ TO BE USED IN PATTEN. THIS IS OK
+C  AS LONG AS MAXMOT<MXSPAN/2, AND NO MOTIF REQUIRES TO START TO
+C  THE LEFT OF THE FIRST REAL SEQUENCE ELEMENT
+        GO TO 1
+      END IF
+      GO TO 1
+C
+9999  CONTINUE
+      CALL SHUTD
+      END
diff --git a/src/staden/nipf.f b/src/staden/nipf.f
new file mode 100644
index 0000000..505fe4f
--- /dev/null
+++ b/src/staden/nipf.f
@@ -0,0 +1,2110 @@
+C NIPF
+      SUBROUTINE FMAIN()
+      PARAMETER (MAXSEQ=400000,MAXFIL=200,MAXCOM=20000)
+      CHARACTER SEQ(MAXSEQ),PAA(5,5,5)
+      INTEGER BOTOPT,TOPOPT
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILMAR,FILNAM,NAMES(MAXFIL)
+      INTEGER STARTS(MAXFIL),ST(MAXFIL),LE(MAXFIL)
+      INTEGER SUMCOD(4096),MTT(64,3)
+      REAL MT(4,4),ET(4,4)
+      INTEGER COUNTS(MAXCOM),POINTS(MAXCOM),TOTCOU,SUMCS,DIFFER(MAXSEQ)
+      REAL WORKR(MAXSEQ)
+      PARAMETER (BOTOPT=-10,TOPOPT=70,
+     +           MAXMEN=-3,
+     +           MAXOPT=39,
+     +           MAXDEV=9,
+     +           IDM = 5)
+      PARAMETER (
+     +             HELPF='NIPFHELP',
+     +             POINTF='NIPFHPNT',
+     +             FILMAR='NIPFMARG')
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT),DEVNOS(MAXDEV)
+      INTEGER OPT,MARGB(MAXOPT),MARGT(MAXOPT)
+      INTEGER BATCH,AVALL,FVREST,ON,OFF,SORT,GRAPH,BCHANG(5,5)
+      PARAMETER (AVALL = 1, FVREST = 2, OFF = 1, ON = 2)
+      CHARACTER CHRSET(IDM),ACID,CANACD,TRANF
+      PARAMETER (MAXPRM = 28)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      INTEGER AMINOP,FACID(22),FBASE(5),CTONUM,BCHAMG(5,5,3)
+      EXTERNAL IMAXA,IMINA,TRANF,VARIM,AMINOP,FINF,CTONUM,RMAXA,RMINA
+      EQUIVALENCE (WORKR,DIFFER)
+      DATA CHRSET/'T','C','A','G','-'/
+      DATA PAA/'F','F','L','L','-','S','S','S','S','S',
+     1'Y','Y','*','*','-','C','C','*','W','-',
+     1'-','-','-','-','-','L','L','L','L','L',
+     1'P','P','P','P','P','H','H','Q','Q','-',
+     1'R','R','R','R','R','-','-','-','-','-','I','I','I','M','-',
+     1'T','T','T','T','T',
+     1'N','N','K','K','-','S','S','R','R','-','-','-','-','-','-',
+     1'V','V','V','V','V','A','A','A','A','A','D','D','E','E','-',
+     1'G','G','G','G','G',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-',
+     1'-','-','-','-','-','-','-','-','-','-'/
+C   Initialise help
+      CALL INTHLP('nipf', TOPOPT)
+C   GET DEVICE NUMBERS
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(3))
+      IGORT = 0
+      MTCALL = 0
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' NIPF  v1.0 Copyright Rodger Staden'/)
+C   READ IN THE POINTERS TO THE HELP FILE
+      IDEVH = DEVNOS(4)
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,IDEVH,KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,IDEVH)
+      IOK=0
+      CALL INITLU(IDM)
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +MAXOPT,DEVNOS(1),FILMAR)
+      IDEV=KBOUT
+      MOPT=0
+      IDIMT = 0
+      CALL EMT(MTT)
+C
+C
+1     CONTINUE
+C
+C
+C      IF(IOK.NE.0)GO TO 9999
+C   give menu, get option
+C
+      IDEV=KBOUT
+      LINLEN=60
+      PROMPT(1) = 'Use file of file names'
+      PROMPT(2) = 'Type in a pair of file names'
+      IB = 1
+      CALL RADION('Select input mode',PROMPT,2,IB,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IB.LT.1) GO TO 999
+      IF(IB.EQ.1) THEN
+        CALL GETSQS(DEVNOS(1),DEVNOS(2),SEQ,MAXSEQ,
+     +  STARTS,NAMES,
+     +  FILNAM,NFILE,MAXFIL,MAXNAM,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        SORT = ON
+        BATCH = FVREST
+      ELSE
+C
+C Read in the two sequences
+C   
+        CALL TGETF(DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  SEQ,MAXSEQ,STARTS,NAMES,MAXFIL,MAXNAM,NFILE,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        SORT = OFF
+        BATCH = FVREST
+      END IF
+      WRITE(KBOUT,*)'Working with ',NFILE,' files'
+C
+C   SET DEFAULTS FOR ACTIVE REGION
+C
+C ST(I) contains the active region start for sequence I relative to its start
+C LE(I) contains the active region end of sequence I
+C STARTS(I) contains the start for sequence I in seq
+C
+      DO 2 I=1,NFILE
+        ST(I) = 1
+        LE(I) = STARTS(I+1) - STARTS(I)
+ 2      CONTINUE
+        LONGS = IMAXA(ST,NFILE)
+        LONGE = IMINA(LE,NFILE)
+      I1=1
+      N1=IDIM1
+      I2=1
+      N2=IDIM2
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+      IDIML=MIN(IDIM3,IDIM4)
+      BATCH = FVREST
+ 100  CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL MENU(OPT,KOPT,MOPT,MAXOPT,MAXMEN,KBIN,KBOUT,
+     +HELPS(0),HELPE(0),HELPF,DEVNOS(4))
+C      IF(0.EQ.0) GO TO 99
+C
+C stop
+C
+      IF(OPT.EQ.2) GO TO 999
+C
+C  type text
+C
+      IF(OPT.EQ.6) THEN
+        CALL TTEXT(DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 100
+      END IF
+C
+C   help
+C
+      IF(OPT.EQ.1) THEN
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,HELPF,DEVNOS(4),
+     +  KBIN,KBOUT)
+        GO TO 100
+      END IF
+C
+C   clear all
+C
+      IF(OPT.EQ.10) THEN
+        CALL CLEARG
+        GO TO 100
+      END IF
+C
+C  ruler
+C
+      IF(OPT.EQ.12)THEN
+        CALL RULER(LONGS,LONGE,MARGL,MARGR,
+     +  MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 100
+      END IF
+C
+C   xhairs
+C
+      IF(OPT.EQ.13) THEN
+        XMAX = LONGE
+        XMIN = LONGS
+        YMAX=ISYMAX
+        YMIN=0.
+        IIIIX=0
+        IIIIY=0    
+        CALL CLEARV
+        CALL XHAIRN(XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,IIIIX,IIIIY,N,KBOUT,
+     +  SEQ(LONGS),LONGS,LONGE-LONGS+1,
+     +  SEQ(LONGS),LONGS,LONGE-LONGS+1,1)
+        GO TO 100
+      END IF
+C
+C   clear vt100
+C
+      IF(OPT.EQ.11)THEN
+        CALL CLEARV
+        GO TO 100
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(OPT.EQ.14)THEN
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(OPT),HELPE(OPT),MAXOPT,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 100
+      END IF
+C
+C  WRITE LABELS
+C
+      IF(OPT.EQ.15) THEN
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 100
+      END IF
+C
+C   GET DISK OUTPUT FILE ON UNIT DEVNOS(2) IF REQUIRED
+C
+      IF(OPT.EQ.7)THEN
+        CALL REDIR(IDEV,DEVNOS(2),DEVNOS(3),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,IDEVH,KOPT)
+        GO TO 100
+      END IF
+      IF(OPT.EQ.5)THEN
+        LINLEN=60
+        CALL LISTN(SEQ,MAXSEQ,STARTS,NFILE,LINLEN,LONGS,LONGE,
+     +  IDEV,KBOUT,NAMES,MAXFIL)
+        GO TO 100
+      END IF
+      IF(OPT.EQ.32)THEN
+        LINLEN=60
+        CALL LISTND(SEQ,MAXSEQ,STARTS,NFILE,LINLEN,LONGS,LONGE,
+     +  IDEV,KBOUT,NAMES,MAXFIL)
+        GO TO 100
+      END IF
+      IF(OPT.EQ.9)THEN
+        LINLEN=60
+        CALL LISTNT(SEQ,MAXSEQ,STARTS,NFILE,LINLEN,LONGS,LONGE,
+     +  IDEV,KBOUT,NAMES,MAXFIL,PAA)
+        GO TO 100
+      END IF
+      IF(OPT.EQ.33)THEN
+        LINLEN=60
+        CALL LISTNV(SEQ,MAXSEQ,STARTS,NFILE,LINLEN,LONGS,LONGE,
+     +  IDEV,KBOUT,NAMES,MAXFIL,PAA)
+        GO TO 100
+      END IF
+      IF ((OPT.EQ.37).AND.(NFILE.EQ.2)) THEN
+        LINLEN=60
+        CALL TRAN(SEQ(STARTS(1)),MAXSEQ,ST(1),LE(1),
+     +  SEQ(STARTS(2)),MAXSEQ,ST(2),LE(2),
+     +  KBIN,KBOUT,IDEV,LINLEN,PAA)
+        GO TO 100
+      END IF
+      IF(OPT.EQ.4)THEN
+        ISTART = 1
+        IEND = STARTS(2) - 1
+        DO 7 I=1,NFILE
+          CALL GTREG(KBIN,KBOUT,ISTART,IEND,ST(I),LE(I),
+     +    'Define active region',
+     +    HELPS(OPT),HELPE(OPT),HELPF,IDEVH,IOK)
+          ST(I+1) = ST(I)
+          LE(I+1) = LE(I)
+          IF(IOK.NE.0) GO TO 100
+ 7        CONTINUE
+C
+C Get max values
+C
+        LONGS = IMAXA(ST,NFILE)
+        LONGE = IMINA(LE,NFILE)
+        GO TO 100
+      END IF
+      IF(OPT.EQ.17) THEN
+        PROMPT(1) = 'All versus all'
+        PROMPT(2) = 'First versus rest'
+        IB = BATCH
+        CALL RADION('Select comparison mode',PROMPT,2,IB,
+     +  HELPS(OPT),HELPE(OPT),HELPF,IDEVH,KBIN,KBOUT)
+        IF(IB.LT.1) GO TO 100
+        BATCH = IB
+        GO TO 100
+      END IF
+      IF(OPT.EQ.18) THEN
+        PROMPT(ON) = 'Sort on'
+        PROMPT(OFF) = 'Sort off'
+        IB = ON
+        CALL RADION('Select sort mode',PROMPT,2,IB,
+     +  HELPS(OPT),HELPE(OPT),HELPF,IDEVH,KBIN,KBOUT)
+        IF(IB.LT.1) GO TO 100
+        SORT = IB
+        GO TO 100
+      END IF
+C
+C start of batch processing
+C
+      ICOUNT = 0
+      TOTCOU = 0
+      SUMCS = 0
+C
+C need to allow different batch comparison modes
+C
+      IF(BATCH.EQ.AVALL) THEN
+        KFILES = 1
+        KFILEE = NFILE-1
+        NEEDS = (NFILES-1)*NFILES/2
+        IF (NEEDS.GT.MAXCOM) THEN
+          WRITE(*,*)'Sorry unsufficient space for this comparison'
+          GO TO 100
+        END IF
+      ELSE IF(BATCH.EQ.FVREST) THEN
+        KFILES = 1
+        KFILEE = KFILES
+      ELSE
+        WRITE(*,*)'Batch mode not set!!'
+        GO TO 100
+      END IF
+      IF((OPT.EQ.24).OR.(OPT.EQ.35).OR.(OPT.EQ.36).OR.(OPT.EQ.39)) THEN
+        DO 111 I=1,5
+          DO 111 J=1,5
+            BCHANG(I,J) = 0
+            DO 111 K = 1,3
+             BCHAMG(I,J,K) = 0
+ 111        CONTINUE
+      END IF
+      IF(OPT.EQ.38) THEN
+        CALL INITMT(MT,SUMCOD,MTCALL)
+        MTCALL = MTCALL + 1
+      END IF
+      GRAPH = -1
+      IF((OPT.EQ.25).OR.(OPT.EQ.26).OR.(OPT.EQ.27).OR.(OPT.EQ.31)
+     +.OR.(OPT.EQ.34)) THEN
+        PROMPT(1) = 'List results'
+        PROMPT(2) = 'Plot results'
+        GRAPH = ON
+        CALL RADION('Select output mode',PROMPT,2,GRAPH,
+     +  HELPS(OPT),HELPE(OPT),HELPF,IDEVH,KBIN,KBOUT)
+        IF(GRAPH.LT.1) GO TO 100
+        CALL FILLI(DIFFER,LONGE-LONGS+1,0)
+      END IF
+      IF(OPT.EQ.31) THEN
+        IPOSM = (LONGE-LONGS+1)/3
+        CALL FILLR(WORKR,IPOSM,0.0)
+        IW = 0
+        DO 600 IPOS = LONGS,LONGS-1+IPOSM*3,3
+          CALL FILLI(FACID,22,0)
+          DO 500 IFILE = KFILES,KFILEE
+            CANACD = TRANF(SEQ(STARTS(IFILE)+IPOS-1),PAA)
+            DO 450 JFILE = IFILE+1,NFILE
+              ACID = TRANF(SEQ(STARTS(JFILE)+IPOS-1),PAA)
+              IF(CANACD.NE.ACID) 
+     +        FACID(AMINOP(ACID)) = FACID(AMINOP(ACID)) + 1
+ 450          CONTINUE
+ 500          CONTINUE
+            IW = IW + 1
+            WORKR(IW) = VARIM(FACID)
+ 600      CONTINUE
+          IF(GRAPH.EQ.ON) THEN
+            XMIN = 1.
+            XMAX = REAL(LONGE - LONGS + 1)/3.
+            YMIN = 0.
+            YMAX = RMAXA(WORKR,IPOSM)
+            CALL GETRL(0.,9999.,YMAX,'Maximum Y value for plotting',
+     +      VAL,KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,IDEVH,IOK)
+            IF(IOK.NE.0) GO TO 100
+            YMAX = VAL
+            CALL PLTR(WORKR,IPOSM,
+     +      XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX)
+            GO TO 100
+          END IF
+          IF(GRAPH.EQ.OFF) THEN
+            WRITE(IDEV,6667)
+     +      (LONGS+3*(KKKK-1),WORKR(KKKK),KKKK=1,IPOSM)
+ 6667       FORMAT(I6,F10.3)
+            GO TO 100
+          END IF
+        END IF
+      IF(OPT.EQ.34) THEN
+        IPOSM = LONGE-LONGS+1
+        CALL FILLR(WORKR,IPOSM,0.0)
+        IW = 0
+        DO 700 IPOS = LONGS,LONGE
+          CALL FILLI(FBASE,5,0)
+          DO 650 JFILE = 1,NFILE
+C            WRITE(*,*)JFILE
+            I = CTONUM(SEQ(STARTS(JFILE)+IPOS-1))
+            FBASE(I) = FBASE(I) + 1
+ 650      CONTINUE
+            IW = IW + 1
+            WORKR(IW) = FINF(FBASE)
+ 700      CONTINUE
+          IF(GRAPH.EQ.ON) THEN
+            XMIN = 1.
+            XMAX = REAL(LONGE - LONGS + 1)
+            YMIN = RMINA(WORKR,IPOSM)
+            YMAX = RMAXA(WORKR,IPOSM)
+            CALL GETRL(-9999.,9999.,YMAX,'Maximum Y value for plotting',
+     +      VAL,KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,IDEVH,IOK)
+            IF(IOK.NE.0) GO TO 100
+            YMAX = VAL
+            CALL PLTR(WORKR,IPOSM,
+     +      XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX)
+            GO TO 100
+          END IF
+          IF(GRAPH.EQ.OFF) THEN
+            WRITE(IDEV,6667)
+     +      (LONGS+KKKK-1,WORKR(KKKK),KKKK=1,IPOSM)
+            GO TO 100
+          END IF
+        END IF
+      DO 300 IFILE = KFILES,KFILEE
+        DO 200 JFILE = IFILE+1,NFILE
+          IF(OPT.EQ.21)THEN
+C
+C           MISMAT NUMBER OF BASE DIFFERENCES
+C
+            CALL BSCHN1(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      MISMAT,IDEV)
+         END IF
+          IF(OPT.EQ.22)THEN
+C
+C           CODON CHANGES
+C
+            CALL COCHAN(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      ICONSA,ISAME,ICONSC,MISMAT,IDEV,SUMCOD)
+C           CONSERVED ACIDS             ICONSA=KSUM
+C           CONSERVATIVE CHANGES        ICONSC=SUMC
+C           UNCHANGED CODONS            ISAME=ISUM
+            SUMCS = SUMCS + ICONSC
+          END IF
+          IF(OPT.EQ.23)THEN
+C           NUMBER OF GENETIC EVENTS
+            CALL BSCHN2(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      MISMAT,IDEV)
+          END IF
+          IF(OPT.EQ.24)THEN
+C   MISMAT NUMBER OF BASE DIFFERENCES
+            CALL BSCHN3(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),BCHANG)
+          END IF
+          IF(OPT.EQ.35)THEN
+C   MISMAT NUMBER OF BASE DIFFERENCES
+            CALL BSCHM3(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),BCHAMG)
+          END IF
+          IF(OPT.EQ.3000)THEN
+C   MISMAT NUMBER OF BASE DIFFERENCES
+            CALL BSCHN5(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      MISMAT,IDEV,PAA)
+          END IF
+          IF(OPT.EQ.28)THEN
+C   MISMAT NUMBER OF expressed BASE DIFFERENCES
+            CALL BSCHN7(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      MISMAT,IDEV,PAA)
+          END IF
+          IF(OPT.EQ.29)THEN
+C   MISMAT NUMBER OF silent BASE DIFFERENCES
+            CALL BSCHN8(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      MISMAT,IDEV,PAA)
+        END IF
+          IF(OPT.EQ.30)THEN
+C   MISMAT NUMBER OF Amino acid DIFFERENCES
+            CALL BSCHN9(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      MISMAT,IDEV,PAA)
+        END IF
+C
+C   GET DIFFERENCES IN DIFFER
+C
+          IF(OPT.EQ.25)THEN
+            CALL BSCHN4(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      DIFFER)
+          END IF
+C
+C   GET DIFFERENCES IN DIFFER (accumulated counts of bases that change the
+C   amino acid)
+C
+          IF(OPT.EQ.26)THEN
+            CALL COCHN1(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      DIFFER,PAA)
+          END IF
+C
+C   estimate mutation rate from observed silent changes
+C
+          IF(OPT.EQ.38)THEN
+            CALL COCHN4(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      SUMCOD)
+          END IF
+C
+C   GET DIFFERENCES IN DIFFER (accumulated counts of bases that dont change the
+C   amino acid)
+C
+          IF(OPT.EQ.27)THEN
+            CALL COCHN2(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      DIFFER,PAA)
+          END IF
+          IF(OPT.EQ.36)THEN
+            CALL COCHN3(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      BCHANG,PAA)
+          END IF
+          IF(OPT.EQ.39)THEN
+            CALL COCHN5(
+     +      SEQ(STARTS(IFILE)),STARTS(IFILE+1)-STARTS(IFILE),
+     +      ST(IFILE),LE(IFILE),
+     +      SEQ(STARTS(JFILE)),STARTS(JFILE+1)-STARTS(JFILE),
+     +      ST(JFILE),LE(JFILE),
+     +      BCHANG,PAA)
+          END IF
+          ICOUNT = ICOUNT + 1
+          COUNTS(ICOUNT) = MISMAT
+          POINTS(ICOUNT) = ICOUNT
+          TOTCOU = TOTCOU + MISMAT
+ 200    CONTINUE
+ 300  CONTINUE
+      IF(OPT.EQ.38) THEN
+C        CALL UPDET(SUMCOD,PAA,MTT)
+        CALL UPDE(MT,ET,MTT,SUMCOD,PAA)
+        WRITE(IDEV,1010)(CHRSET(K),K=1,4)
+        DO 323 I=1,4
+          WRITE(IDEV,1012)CHRSET(I),(MT(I,K),K=1,4)
+ 323   CONTINUE
+ 1012  FORMAT(' ',A1,'   ',4(F5.4,1X))
+        GO TO 100
+      END IF
+      IF(GRAPH.EQ.ON) THEN
+        XMIN = 1.
+        XMAX = LONGE - LONGS + 1
+        YMIN = 0.
+        YMAX = IMAXA(DIFFER,LONGE-LONGS+1)
+        IYMAX = YMAX
+        CALL GETINT(0,9999,IYMAX,'Maximum Y value for plotting',
+     +  IVAL,KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 100
+        YMAX = IVAL
+        CALL PLTI(DIFFER,LONGE-LONGS+1,
+     +  XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX)
+        GO TO 100
+      END IF
+      IF(GRAPH.EQ.OFF) THEN
+        WRITE(IDEV,6666)
+     +  (KKKK+LONGS-1,DIFFER(KKKK),KKKK=1,LONGE-LONGS+1)
+6666  FORMAT(2I6)
+C        CLOSE(UNIT=DEVNOS(1))
+        GO TO 100
+      END IF
+      IF(GRAPH.EQ.ON) THEN
+        XMIN = 1.
+        XMAX = LONGE - LONGS + 1
+        YMIN = 0.
+        YMAX = IMAXA(DIFFER,LONGE-LONGS+1)
+        IYMAX = YMAX
+        CALL GETINT(0,9999,IYMAX,'Maximum Y value for plotting',
+     +  IVAL,KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 100
+        YMAX = IVAL
+        CALL PLTI(DIFFER,LONGE-LONGS+1,
+     +  XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX)
+        GO TO 100
+      END IF
+      IF(GRAPH.EQ.OFF) THEN
+        WRITE(IDEV,6666)
+     +  (KKKK+LONGS-1,DIFFER(KKKK),KKKK=1,LONGE-LONGS+1)
+        GO TO 100
+      END IF
+      IF(OPT.EQ.24) THEN
+        WRITE(IDEV,1010)CHRSET
+ 1010   FORMAT('     ',5(2X,A1,3X))
+        DO 320 I=1,5
+          WRITE(IDEV,1011)CHRSET(I),(BCHANG(I,K),K=1,5)
+ 1011     FORMAT(' ',A1,5I6)
+ 320      CONTINUE
+          CALL KIMURA(BCHANG,DIST)
+          WRITE(*,*)'Kimura evolutionary distance=',DIST
+        GO TO 100
+      END IF
+      IF(OPT.EQ.35) THEN
+        DO 333 IP=1,3
+        WRITE(IDEV,1010)CHRSET
+        DO 321 I=1,5
+          WRITE(IDEV,1011)CHRSET(I),(BCHAMG(I,K,IP),K=1,5)
+ 321   CONTINUE
+ 333      CONTINUE
+        GO TO 100
+      END IF
+      IF(OPT.EQ.36) THEN
+        WRITE(IDEV,1010)CHRSET
+        DO 322 I=1,5
+          WRITE(IDEV,1011)CHRSET(I),(BCHANG(I,K),K=1,5)
+ 322   CONTINUE
+       GO TO 100
+      END IF
+      IF(OPT.EQ.39) THEN
+        WRITE(IDEV,1010)CHRSET
+        DO 324 I=1,5
+          WRITE(IDEV,1011)CHRSET(I),(BCHANG(I,K),K=1,5)
+ 324   CONTINUE
+       GO TO 100
+      END IF
+      IF(SORT.EQ.OFF) GO TO 100
+      CALL BUB2AS(COUNTS,POINTS,ICOUNT)
+      WRITE(IDEV,*)'                Sorted list'
+      DO 400 II=1,ICOUNT
+        N = II
+        CALL IANDJ(NFILE,POINTS(N),I,J)
+        WRITE(IDEV,1015)NAMES(I)(1:20),NAMES(J)(1:20),COUNTS(II)
+1015    FORMAT(' ',A,' ',A,' distance',I6)
+400   CONTINUE
+      WRITE(IDEV,1020)TOTCOU
+1020  FORMAT(' Total distance ',I6)
+      IF(OPT.EQ.22)WRITE(IDEV,1021)SUMCS 
+1021    FORMAT(' Total conservative changes',I6)
+      GO TO 100
+ 999  CONTINUE
+      CALL SHUTD
+      END
+      REAL FUNCTION VARIM(FACID)
+      INTEGER FACID(22),IMAXA
+      EXTERNAL IMAXA
+C Variability defined (as kabat):
+C
+C  Number of different acids / Freq of most common acid
+C
+      NACIDS = 0
+      TACIDS = 0.
+      DO 10 K = 1,22
+        TACIDS = TACIDS + FACID(K)
+        IF(FACID(K).NE.0) NACIDS = NACIDS + 1
+10    CONTINUE
+      VARIM = 0.
+      IF(TACIDS.GT.0.) THEN
+        FREQC = REAL(IMAXA(FACID,22)) / TACIDS
+        VARIM = REAL(NACIDS) / FREQC
+      END IF
+      END
+      INTEGER FUNCTION AMINOP(ACID)
+      CHARACTER ALLONE*22,ACID
+      SAVE ALLONE
+      DATA ALLONE/'ACDEFGHIKLMNPQRSTVWY*-'/
+      AMINOP = 22
+      DO 10 I=1,21
+        IF(ACID.EQ.ALLONE(I:I)) THEN
+          AMINOP = I
+          RETURN
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE TGETF(IDEV,FILNAM,KBIN,KBOUT,
+     +SEQ1,MAXSEQ,STARTS,NAMES,MAXFIL,MAXNAM,NFILE,IOK,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER SEQ1(MAXSEQ),FILNAM*(*),HELPF*(*)
+      CHARACTER*(*) NAMES(MAXFIL)
+      INTEGER STARTS(MAXFIL)
+C
+C Routine to read in two sequences
+C
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of first sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      IDIM1 = MAXSEQ
+      CALL ARRFIL(IDEV,SEQ1,IDIM1,KBOUT)
+      CLOSE(UNIT=IDEV)
+      NAMES(1) = FILNAM
+      STARTS(1) = 1
+      FILNAM = ' '
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of second sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      STARTS(2) = IDIM1 + 1
+      IDIM1 = MAXSEQ - IDIM1
+      CALL ARRFIL(IDEV,SEQ1(STARTS(2)),IDIM1,KBOUT)
+      CLOSE(UNIT=IDEV)
+      NAMES(2) = FILNAM
+      STARTS(3) = STARTS(2) + IDIM1
+      NFILE = 2
+      END
+      SUBROUTINE GETSQS(IDEV1,IDEV2,SEQ,MAXSEQ,STARTS,NAMES,
+     +FILNAM,NFILE,MAXFIL,MAXNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER SEQ(MAXSEQ),HELPF*(*)
+      CHARACTER*(*) FILNAM,NAMES(MAXFIL)
+      INTEGER STARTS(MAXFIL)
+      NFILE = 0
+      IEND = 0
+      FILNAM = ' '
+      CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+     +'File of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+2     CONTINUE
+      IF (NFILE.LT.MAXFIL) THEN
+        READ(IDEV1,1000,END=3)FILNAM
+ 1000   FORMAT(A)
+        NFILE = NFILE + 1
+        STARTS(NFILE) = IEND + 1
+        NAMES(NFILE) = FILNAM
+        IDIM2 = MAXSEQ - IEND
+        IF (IDIM2.GT.0) THEN
+          CALL OPENRS(IDEV2,FILNAM,IOK,LRECL,2)
+          CALL ARRFIL(IDEV2,SEQ(STARTS(NFILE)),IDIM2,KBOUT)
+          CLOSE(UNIT=IDEV2)
+          IEND = IEND + IDIM2
+          GO TO 2
+        END IF
+C
+C if we get here weve run out of array space
+C
+        NFILE = NFILE - 1
+        WRITE(KBOUT,*)'Only ',NFILE,' files read: no more memory'
+        CLOSE(UNIT=IDEV1)
+        RETURN
+      ELSE
+C
+C if we get here weve got too many files
+C
+        NFILE = NFILE - 1
+        CLOSE(UNIT=IDEV1)
+        WRITE(KBOUT,*)'Maximum files ',NFILE,' read'
+        RETURN
+      END IF
+ 3    CONTINUE
+      STARTS(NFILE+1) = IEND + 1
+      CLOSE(UNIT=IDEV1)
+      END
+      SUBROUTINE LISTN(SEQ,MAXCHR,LENSEQ,NFILE,LINLEN,I1,I2,
+     +IDEV,KBOUT,NAMES,MAXFIL)
+      CHARACTER*(*) NAMES(MAXFIL)
+      PARAMETER (MAXLIN = 120)
+      CHARACTER SEQ(MAXCHR),LINE*(MAXLIN)
+      INTEGER LENSEQ(MAXFIL)
+      EXTERNAL NOTIRL
+C     SET WIDTH FOR LAST PAGE
+      LPAGE=MOD(I2-I1+1,LINLEN)
+C     HOW MANY PAGE WIDTHS?
+      NPAGE=1+(I2-I1+1)/LINLEN
+      IF(MOD(I2-I1+1,LINLEN).EQ.0)THEN
+        NPAGE=NPAGE-1
+        LPAGE=LINLEN
+      END IF
+      ISTART=I1-LINLEN
+      DO 50 I=1,NPAGE
+        ISTART=ISTART+LINLEN
+        IF(I.EQ.NPAGE)LINLEN=LPAGE
+        WRITE(IDEV,1006)(K,K=ISTART+9,ISTART+LINLEN-1,10)
+        DO 40 J=1,NFILE
+          KF = ISTART + LENSEQ(J) - 1
+          KT = MIN(KF+LINLEN,LENSEQ(J+1)) - 1
+          LINE(1:) = ' '
+          WRITE(LINE,1003,ERR=60)(SEQ(K),K=KF,KT)
+1003      FORMAT(' ',60A1)
+          LINE(KT-KF+4:) = NAMES(J)(1:15)
+          WRITE(IDEV,1004,ERR=60)LINE(1:NOTIRL(LINE,MAXLIN,' '))
+ 1004     FORMAT(A)
+1006      FORMAT(' ',10I10)
+40      CONTINUE
+        WRITE(IDEV,1008)
+1008    FORMAT( )
+50    CONTINUE
+      RETURN
+60    CONTINUE
+      WRITE(KBOUT,*)' Error writing file'
+      END
+      SUBROUTINE LISTND(SEQ,MAXCHR,LENSEQ,NFILE,LINLEN,I1,I2,
+     +IDEV,KBOUT,NAMES,MAXFIL)
+      CHARACTER*(*) NAMES(MAXFIL)
+      PARAMETER (MAXLIN = 120)
+      CHARACTER SEQ(MAXCHR),LINE*(MAXLIN)
+      INTEGER LENSEQ(MAXFIL),CTONUM
+      EXTERNAL NOTIRL,CTONUM
+C     SET WIDTH FOR LAST PAGE
+      LPAGE=MOD(I2-I1+1,LINLEN)
+C     HOW MANY PAGE WIDTHS?
+      NPAGE=1+(I2-I1+1)/LINLEN
+      IF(MOD(I2-I1+1,LINLEN).EQ.0)THEN
+        NPAGE=NPAGE-1
+        LPAGE=LINLEN
+      END IF
+      ISTART=I1-LINLEN
+      DO 50 I=1,NPAGE
+        ISTART=ISTART+LINLEN
+        IF(I.EQ.NPAGE)LINLEN=LPAGE
+        WRITE(IDEV,1006)(K,K=ISTART+9,ISTART+LINLEN-1,10)
+          KF = ISTART + LENSEQ(1) - 1
+          KT = MIN(KF+LINLEN,LENSEQ(2)) - 1
+          WRITE(LINE,1003,ERR=60)(SEQ(K),K=KF,KT)
+1003      FORMAT(' ',60A1)
+          LINE(KT-KF+4:) = NAMES(1)(1:15)
+          WRITE(IDEV,1004,ERR=60)LINE(1:NOTIRL(LINE,MAXLIN,' '))
+        DO 40 J=2,NFILE
+          KF = ISTART + LENSEQ(J) - 1
+          KT = MIN(KF+LINLEN,LENSEQ(J+1)) - 1
+          KSF = ISTART + LENSEQ(1) - 2
+          LINE(1:) = ' '
+          KKL = 1
+          DO 35 KK = KF,KT
+            KKL = KKL + 1
+            KSF = KSF + 1
+            IF(CTONUM(SEQ(KK)).EQ.CTONUM(SEQ(KSF))) THEN
+              LINE(KKL:KKL) = '-'
+            ELSE
+              LINE(KKL:KKL) = SEQ(KK)
+            END IF
+ 35         CONTINUE
+          LINE(KT-KF+4:) = NAMES(J)(1:15)
+          WRITE(IDEV,1004,ERR=60)LINE(1:NOTIRL(LINE,MAXLIN,' '))
+ 1004     FORMAT(A)
+1006      FORMAT(' ',10I10)
+40      CONTINUE
+        WRITE(IDEV,1008)
+1008    FORMAT( )
+50    CONTINUE
+      RETURN
+60    CONTINUE
+      WRITE(KBOUT,*)' Error writing file'
+      END
+      SUBROUTINE LISTNT(SEQ,MAXCHR,LENSEQ,NFILE,LINLIN,I1,I2,
+     +IDEV,KBOUT,NAMES,MAXFIL,PAA)
+      CHARACTER*(*) NAMES(MAXFIL)
+      PARAMETER (MAXLIN = 120)
+      CHARACTER SEQ(MAXCHR),LINE*(MAXLIN),PAA*(*),TRANF
+      INTEGER LENSEQ(MAXFIL)
+      EXTERNAL NOTIRL,TRANF
+C
+C line length is linlen acids which needs 3*linlen bases
+C
+      LINLEN = LINLIN
+      I2MI1 = I2 - I1 + 1
+C number of whole codons
+      I2MI1 = I2MI1/3
+C     SET WIDTH FOR LAST PAGE
+      LPAGE=MOD(I2MI1,LINLEN)
+C     HOW MANY PAGE WIDTHS?
+      NPAGE=1+I2MI1/LINLEN
+      IF(MOD(I2MI1,LINLEN).EQ.0)THEN
+        NPAGE=NPAGE-1
+        LPAGE=LINLEN
+      END IF
+      ISTART=I1-3*LINLEN
+      DO 50 I=1,NPAGE
+        ISTART=ISTART+3*LINLEN
+        IF(I.EQ.NPAGE)LINLEN=LPAGE
+        WRITE(IDEV,1006)(K,K=ISTART+29,ISTART+3*LINLEN-1,30)
+        DO 40 J=1,NFILE
+          KF = ISTART + LENSEQ(J) - 1
+          KT = MIN(KF+3*LINLEN,LENSEQ(J+1)) - 1
+          LINE(1:) = ' '
+          LL = KF - 3
+          DO 30 L = 1,LINLEN
+            LL = LL + 3
+            LINE(L:L) = TRANF(SEQ(LL),PAA)
+ 30         CONTINUE
+          LINE(1+(KT-KF+4)/3:) = NAMES(J)(1:15)
+          WRITE(IDEV,1004,ERR=60)LINE(1:NOTIRL(LINE,MAXLIN,' '))
+ 1004     FORMAT(' ',A)
+1006      FORMAT(' ',10I10)
+40      CONTINUE
+        WRITE(IDEV,1008)
+1008    FORMAT( )
+50    CONTINUE
+      RETURN
+60    CONTINUE
+      WRITE(KBOUT,*)' Error writing file'
+      END
+      SUBROUTINE LISTNV(SEQ,MAXCHR,LENSEQ,NFILE,LINLIN,I1,I2,
+     +IDEV,KBOUT,NAMES,MAXFIL,PAA)
+      CHARACTER*(*) NAMES(MAXFIL)
+      PARAMETER (MAXLIN = 120)
+      CHARACTER SEQ(MAXCHR),LINE*(MAXLIN),PAA*(*),TRANF
+      INTEGER LENSEQ(MAXFIL)
+      EXTERNAL NOTIRL,TRANF
+C
+C line length is linlen acids which needs 3*linlen bases
+C
+      LINLEN = LINLIN
+      I2MI1 = I2 - I1 + 1
+C number of whole codons
+      I2MI1 = I2MI1/3
+C     SET WIDTH FOR LAST PAGE
+      LPAGE=MOD(I2MI1,LINLEN)
+C     HOW MANY PAGE WIDTHS?
+      NPAGE=1+I2MI1/LINLEN
+      IF(MOD(I2MI1,LINLEN).EQ.0)THEN
+        NPAGE=NPAGE-1
+        LPAGE=LINLEN
+      END IF
+      ISTART=I1-3*LINLEN
+      DO 50 I=1,NPAGE
+        ISTART=ISTART+3*LINLEN
+        IF(I.EQ.NPAGE)LINLEN=LPAGE
+        WRITE(IDEV,1006)(K,K=ISTART+29,ISTART+3*LINLEN-1,30)
+          KF = ISTART + LENSEQ(1) - 1
+          KT = MIN(KF+3*LINLEN,LENSEQ(2)) - 1
+          LINE(1:) = ' '
+          LL = KF - 3
+          DO 20 L = 1,LINLEN
+            LL = LL + 3
+            LINE(L:L) = TRANF(SEQ(LL),PAA)
+ 20         CONTINUE
+          LINE(1+(KT-KF+4)/3:) = NAMES(1)(1:15)
+          WRITE(IDEV,1004,ERR=60)LINE(1:NOTIRL(LINE,MAXLIN,' '))
+        DO 40 J=2,NFILE
+          KF = ISTART + LENSEQ(J) - 1
+          KT = MIN(KF+3*LINLEN,LENSEQ(J+1)) - 1
+          KFS = ISTART + LENSEQ(1) - 1
+          LINE(1:) = ' '
+          LL = KF - 3
+          LLS = KFS - 3
+          DO 30 L = 1,LINLEN
+            LL = LL + 3
+            LLS = LLS + 3
+            IF(TRANF(SEQ(LLS),PAA).EQ.TRANF(SEQ(LL),PAA)) THEN
+              LINE(L:L) = '-'
+            ELSE
+              LINE(L:L) = TRANF(SEQ(LL),PAA)
+            END IF
+ 30         CONTINUE
+          LINE(1+(KT-KF+4)/3:) = NAMES(J)(1:15)
+          WRITE(IDEV,1004,ERR=60)LINE(1:NOTIRL(LINE,MAXLIN,' '))
+ 1004     FORMAT(' ',A)
+1006      FORMAT(' ',10I10)
+40      CONTINUE
+        WRITE(IDEV,1008)
+1008    FORMAT( )
+50    CONTINUE
+      RETURN
+60    CONTINUE
+      WRITE(KBOUT,*)' Error writing file'
+      END
+      SUBROUTINE IANDJ(N,L,I,J)
+C   ROUTINE, GIVEN N FILES AND ELEMENT L FROM LIST
+C   CALC ROW I AND COL J NOS ASSUMING L LIES IN TOP
+C   RIGHT CORNER OF MATRIX
+      I=1
+      J1=0
+      K=0
+10    CONTINUE
+      J1=J1+1
+      J=J1
+11    CONTINUE
+C   REACHED ELEMENT L?
+      IF(K.EQ.L)RETURN
+      J=J+1
+      K=K+1
+C   REACHED END OF ROW?
+      IF(J.LE.N)GO TO 11
+C  YES, POINT TO NEX ROW
+      I=I+1
+      K=K-1
+      GO TO 10
+      END
+      INTEGER FUNCTION IMAXA(I,N)
+      INTEGER I(N)
+      IMAXA = I(1)
+      DO 10 J=2,N
+        IMAXA = MAX(IMAXA,I(J))
+ 10     CONTINUE
+      END
+      INTEGER FUNCTION IMINA(I,N)
+      INTEGER I(N)
+      IMINA = I(1)
+      DO 10 J=2,N
+        IMINA = MIN(IMINA,I(J))
+ 10     CONTINUE
+      END
+      SUBROUTINE PLTI(IY,NY,
+     +XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      INTEGER IY(NY)
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      YF = YMIN
+      DO 100 I=1,NY
+        XF = I
+        YT = IY(I)
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+100   CONTINUE
+      CALL VT100M
+      END
+      REAL FUNCTION RMAXA(I,N)
+      REAL I(N)
+      RMAXA = I(1)
+      DO 10 J=2,N
+        RMAXA = MAX(RMAXA,I(J))
+ 10     CONTINUE
+      END
+      REAL FUNCTION RMINA(I,N)
+      REAL I(N)
+      RMINA = I(1)
+      DO 10 J=2,N
+        RMINA = MIN(RMINA,I(J))
+ 10     CONTINUE
+      END
+      SUBROUTINE PLTR(IY,NY,
+     +XMAX,XMIN,YMAX,YMIN,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      REAL IY(NY)
+      CALL VECTOM
+      CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+      YF = YMIN
+      DO 100 I=1,NY
+        XF = I
+        YT = IY(I)
+        CALL LINE(XF,XF,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+100   CONTINUE
+      CALL VT100M
+      END
+      SUBROUTINE COCHAN(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,
+     +ICONSA,ISAME,ICONSC,MISMAT,IDEV,SUM)
+      INTEGER SJ(6)
+      INTEGER SUM(4096),SAME(77),SUMC
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM1)
+      CHARACTER AA(18)
+      EXTERNAL JCODNO
+      DATA AA/'F','L','I','V','S','P','T','A','Y','H','Q','N',
+     +'K','D','E','C','R','G'/
+      DATA SAME/2,1,2,6,3,4,17,18,19,20,3,33,34,35,4,49,50,51,
+     +52,6,5,6,7,8,45,46,4,21,22,23,24,4,37,38,39,40,4,53,54,
+     +55,56,2,9,10,2,25,26,2,27,28,2,41,42,2,43,44,2,57,58,2,
+     +59,60,2,13,14,6,29,30,31,32,47,48,4,61,62,63,64/
+C   CONSERVED ACIDS             ICONSA=KSUM
+C   CONSERVATIVE CHANGES        ICONSC=SUMC
+C   UNCHANGED CODONS            ISAME=ISUM
+      DO 55 I=1,4096
+        SUM(I)=0
+55    CONTINUE
+C
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+C   SET MIN TO NUMBER OF CODONS
+      MINL=MINL/3
+      MAXL=MAX(IDIM3,IDIM4)
+      MAXL=MAXL/3
+      J=I1
+      K=I2
+C
+C   MAIN LOOP
+C
+      DO 100 I=1,MINL
+        IP1=JCODNO(SEQ1,IDIM1,J)
+        IF(IP1.EQ.0)GO TO 199
+        IP2=JCODNO(SEQ2,IDIM2,K)
+        IF(IP2.EQ.0)GO TO 199
+        IP=IP1*64+IP2-64
+        SUM(IP)=SUM(IP)+1
+199     CONTINUE
+        J=J+3
+        K=K+3
+100   CONTINUE
+C
+C   SUM THOSE CONSERVED
+C
+      ISUM=0
+      SUMC=0
+      IK1=2
+C
+C   LOOP FOR EACH AMINO ACID WITH MORE THAN 1 CODON
+C
+      DO 120 K=1,18
+        NUM=SAME(IK1-1)
+C
+C       FIRST ELEMENT IN SAME IS THE NUMBER OF CODONS FOR THIS AMINO ACID
+C       SUCCEEDING ELEMENTS ARE THE CODON NUMBERS
+C       LOOP FOR EACH CODON
+C
+        DO 110 I=1,NUM
+          IP=SAME(IK1+I-1)
+          NP=(IP-1)*64
+C
+C         LOOP FOR EACH CODON CALCULATING POINTERS TO EACH CHANGE NOT
+C         AFFECTING AMINO ACID
+C
+          DO 105 J=1,NUM
+            MP=SAME(IK1+J-1)
+            JP=NP+MP
+            SJ(J)=SUM(JP)
+105       CONTINUE
+          DO 104 N=1,NUM
+            IF(N.EQ.I)ISUM=ISUM+SJ(N)
+            SUMC=SUMC+SJ(N)
+            SJ(N)=0
+104       CONTINUE
+110     CONTINUE
+        IK1=IK1+NUM+1
+120   CONTINUE
+C   ADD MET AND TRP CODONS TO THOSE CONSERVING ACID
+      KSUM=SUMC+SUM(2276)+SUM(976)
+      FKSUM=100.*FLOAT(KSUM)/FLOAT(MAXL)
+      WRITE(IDEV,1005)KSUM,FKSUM
+ 1005 FORMAT(' Number of conserved amino acids ',I6,' = ',F6.2,'%')
+C1005  FORMAT(' NUMBER OF CONSERVED AMINO ACIDS=',I6,', AS PERCENT='
+C     1,F6.2)
+C
+C   SUBTRACT CONSERVED CODONS FROM THOSE CONSERVING AMINO ACID
+      SUMC=SUMC-ISUM
+      FSUMC=100.*FLOAT(SUMC)/FLOAT(MAXL)
+      WRITE(IDEV,1001)SUMC,FSUMC
+ 1001 FORMAT(' Number of conservative codon changes ',I6,' = ',F6.2,'%')
+C1001  FORMAT(' NUMBER OF CONSERVATIVE CODON CHANGES=',I6,', AS',
+C     1' PERCENT=',
+C     1F6.2)
+C   ADD MET AND TRP TO UNCHANGED CODONS
+        ISUM=ISUM+SUM(2276)+SUM(976)
+      FISUM=100.*FLOAT(ISUM)/FLOAT(MAXL)
+      WRITE(IDEV,1004)ISUM,FISUM
+ 1004 FORMAT(' Number of unchanged codons ',I6,' = ',F6.2,'%')
+C1004  FORMAT(' NUMBER OF UNCHANGED CODONS=',I6,', AS PERCENT=',F6.2)
+C
+C   CONSERVED ACIDS
+      ICONSA=KSUM
+C   CONSERVATIVE CHANGES
+      ICONSC=SUMC
+C   UNCHANGED CODONS
+      ISAME=ISUM
+C   CODONS CHANGING AMINO ACID
+      MISMAT=MAXL-ICONSA
+      RETURN
+      END
+      SUBROUTINE BSCHN2(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,MISMAT,
+     1IDEV)
+C   THIS ROUTINE COUNTS GENETIC EVENTS AND 1 BASE INSERTION
+C   EQUALS N BASE INSERTION
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+C   SET MIN TO NUMBER OF BASES
+      MAXL=MAX(IDIM3,IDIM4)
+      MISMAT=0
+      J=I2-1
+      IDASH1=0
+      IDASH2=0
+      DO 5 I=I1,I1+MINL-1
+        J=J+1
+        IF(CTONUM(SEQ1(I)).EQ.5)IDASH1=IDASH1+1
+        IF(CTONUM(SEQ1(I)).NE.5)IDASH1=0
+        IF(CTONUM(SEQ2(J)).EQ.5)IDASH2=IDASH2+1
+        IF(CTONUM(SEQ2(J)).NE.5)IDASH2=0
+        IF(SEQ1(I).EQ.SEQ2(J))GO TO 5
+        MISMAT=MISMAT+1
+        IF((IDASH1.GT.1).OR.(IDASH2.GT.1))MISMAT=MISMAT-1
+5     CONTINUE
+C cantor and jukes
+      P = REAL(MISMAT)/REAL(MAXL)
+      D = -0.75*LOG(1.0 - 4.0*P/3.0)
+      V = P*(1.0-P)/(REAL(MAXL)*(1.0-4.0*P/3.0)**2)
+      CONS=MINL-MISMAT
+      CONS=CONS*100./MAXL
+      WRITE(IDEV,1001)MISMAT,CONS
+ 1001 FORMAT(' Number of events ',I6,' = ',F6.2,'%')
+C1001  FORMAT(' NUMBER OF EVENTS=',I6,' PERCENTAGE CONSERVATION=',
+C     1F6.2)
+      WRITE(IDEV,1002)D,V
+ 1002 FORMAT(' Expected number of substitutions per site and variance '
+     +,F10.5,F10.7)
+C1002  FORMAT(
+C     +' JC SUBSTITUTIONS PER SITE=',F10.5,' AND VARIANCE=',F10.7) 
+      END
+C
+      SUBROUTINE BSCHN1(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,MISMAT,
+     +IDEV)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+C     COUNT DIFFERENCES (BASE CHANGES)
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+C   SET MIN TO NUMBER OF BASES
+      MAXL=MAX(IDIM3,IDIM4)
+      MISMAT=0
+      J=I2-1
+      DO 10 I=I1,I1+MINL-1
+        J=J+1
+        IF(CTONUM(SEQ1(I)).NE.CTONUM(SEQ2(J))) MISMAT=MISMAT+1
+10    CONTINUE
+      CONS=MINL-MISMAT
+      CONS=CONS*100./MAXL
+      WRITE(IDEV,1001)MISMAT,CONS
+ 1001 FORMAT(' Number of differences ',I6,' = ',F6.2,'%')
+      END
+      SUBROUTINE BSCHN6(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,MISMAT,
+     1IDEV)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+C     COUNT DIFFERENCES (BASE CHANGES)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+C   SET MIN TO NUMBER OF BASES
+      MAXL=MAX(IDIM3,IDIM4)
+      MISMAT=0
+      J=I2-1
+      DO 10 I=I1,I1+MINL-1
+        J=J+1
+        K1 = CTONUM(SEQ1(I))
+        K2 = CTONUM(SEQ2(J))
+        IF(K1.EQ.K2) GO TO 10
+        IF(K1.EQ.5) GO TO 10
+        IF(K2.EQ.5) GO TO 10
+        MISMAT=MISMAT+1
+10    CONTINUE
+      CONS=MINL-MISMAT
+      CONS=CONS*100./MAXL
+      WRITE(IDEV,1001)MISMAT,CONS
+1001  FORMAT(' NUMBER OF DIFFERENCES=',I6,' PERCENTAGE CONSERVATION',
+     1' = ',F6.2)
+      RETURN
+      END
+C
+      SUBROUTINE BSCHN4(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,DIFFER)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER DIFFER(IDIM1)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1 = I1 - 1
+      J2 = I2 - 1
+      DO 10 I=1,MINL
+        J1 = J1 + 1
+        J2 = J2 + 1
+        IF(CTONUM(SEQ1(J1)).NE.CTONUM(SEQ2(J2))) 
+     +  DIFFER(I)=DIFFER(I)+1
+10    CONTINUE
+      END
+C
+      SUBROUTINE BSCHN3(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,CHANGE)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER CHANGE(5,5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+C   SET MIN TO NUMBER OF BASES
+      MAXL=MAX(IDIM3,IDIM4)
+      MISMAT=0
+      J=I2-1
+      DO 10 I=I1,I1+MINL-1
+        J=J+1
+        II=CTONUM(SEQ1(I))
+        JJ=CTONUM(SEQ2(J))
+        CHANGE(II,JJ)=CHANGE(II,JJ)+1
+10    CONTINUE
+      END
+      SUBROUTINE BSCHM3(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,CHANGE)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER CHANGE(5,5,3)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+C   SET MIN TO NUMBER OF BASES
+      MAXL=MAX(IDIM3,IDIM4)
+      MISMAT=0
+      J=I2-3
+      DO 10 I=I1,I1+MINL-2,3
+        J=J+3
+        DO 5 K=0,2
+          II=CTONUM(SEQ1(I+K))
+          JJ=CTONUM(SEQ2(J+K))
+          CHANGE(II,JJ,K+1)=CHANGE(II,JJ,K+1)+1
+ 5      CONTINUE
+10    CONTINUE
+      END
+      SUBROUTINE KIMURA(CHANGE,DIST)
+      INTEGER CHANGE(5,5)
+      EXTERNAL EVDIST
+C
+C Calc P and Q, then calc the evolutionary distance
+C
+      P = CHANGE(1,2) + CHANGE(2,1) + CHANGE(3,4) + CHANGE(4,3)
+      Q = CHANGE(1,3) + CHANGE(3,1) + CHANGE(1,4) + CHANGE(4,1)
+      Q = Q + CHANGE(3,2) + CHANGE(2,3) + CHANGE(2,4) + CHANGE(4,2)
+      NSITES = 0
+C
+C dont count changes to dash
+C
+      DO 10 I=1,4
+        DO 5 J=1,4
+          NSITES = NSITES + CHANGE(I,J)
+ 5        CONTINUE
+ 10       CONTINUE
+      WRITE(*,*)'NP=',P
+      WRITE(*,*)'NQ=',Q
+      WRITE(*,*)'NSITES=',NSITES
+      P = 58
+      Q = 63
+      NSITES = 438
+      WRITE(*,*)'NP=',P
+      WRITE(*,*)'NQ=',Q
+      WRITE(*,*)'NSITES=',NSITES
+      P = P / REAL(NSITES)
+      Q = Q / REAL(NSITES)
+      DIST = EVDIST(P,Q)
+      END
+      REAL FUNCTION EVDIST(P,Q)
+C
+C Calc evolutionary distance per site according to kimura
+C
+C P and Q are the fractions if sites with respectively type I and II
+C substitutions (ie transitions and transversions)
+C
+      EVDIST = -0.5 * LOG((1.0 - 2.0*P - Q) * SQRT(1.0 - 2.0 * Q))
+C
+C standard error
+C
+      A = 1.0 / (1.0 - 2.0 * P - Q)
+      B = 0.5 * (1.0 / ( A - 1.0 / (1.0 - 2.0 * Q)))
+C
+C !!!!!!!!!!!!!! NOT SURE I CAN READ THIS IN THE PAPER
+C AND CANT READ THE NEXT BIT 
+      END
+C
+C   SUBROUTINE TO WRITE 3 LINES OF SEQUENCES
+C
+      SUBROUTINE FMT2(IDEV,SEQ1,SEQ2,IDIM,ISW,ISX,LINLEN)
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(120)
+      INTEGER KL(12)
+C
+C
+      ISXX=ISX-1
+      ISWW=ISW-1
+      IE=0
+1     CONTINUE
+      WRITE(IDEV,1003)
+1003  FORMAT( )
+      DO 50 J=1,LINLEN/10
+        ISWW=ISWW+10
+        KL(J)=ISWW
+50    CONTINUE
+      WRITE(IDEV,1001)(KL(KKK),KKK=1,LINLEN/10)
+      IS=IE+1
+      IE=IE+LINLEN
+      IF(IE.GT.IDIM)IE=IDIM
+C   COMPARE THE TWO SECTIONS OF SEQUENCE
+      IL=IE-IS+1
+      CALL SQMTCH(SEQ1(IS),SEQ2(IS),MATCH,IL)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+      WRITE(IDEV,1002)(MATCH(K),K=1,IL)
+      WRITE(IDEV,1002)(SEQ2(K),K=IS,IE)
+1002  FORMAT( '  ',12(10A1,1X))
+C   SET UP DECIMAL COUNTERS
+      DO 60 J=1,LINLEN/10
+        ISXX=ISXX+10
+        KL(J)=ISXX
+60    CONTINUE
+      WRITE(IDEV,1001)(KL(KKK),KKK=1,LINLEN/10)
+1001  FORMAT( ' ',12(5X,I6))
+      IF(IE.EQ.IDIM)RETURN
+      GO TO 1
+      END
+C
+C
+C
+C  RETURNS POINTER TO CODON NUMBER FROM 1 - 64
+      INTEGER FUNCTION JCODNO(SEQ,IDIM,I)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER CTONUM,JP(3)
+      EXTERNAL CTONUM
+C   CHECK FOR BAD CHAR IN SEQ
+      JCODNO=0
+      IP=I-1
+      DO 1 J=1,3
+      JP(J)=CTONUM(SEQ(IP+J))
+1     IF(JP(J).EQ.5)RETURN
+      JCODNO=JP(1)*16+4*JP(2)+JP(3)-20
+      RETURN
+      END
+C
+      SUBROUTINE BSCHN5(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,MISMAT,
+     1IDEV,PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER CFCOD1
+      EXTERNAL CFCOD1
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      ISUM = 0
+      JSUM = 0
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        ISUM = ISUM + CFCOD1(SEQ1(J1),SEQ2(J2),PAA)
+        JSUM = JSUM + CFCOD1(SEQ2(J2),SEQ1(J1),PAA)
+10    CONTINUE
+      WRITE(IDEV,1000)ISUM
+      WRITE(IDEV,1001)JSUM
+1000  FORMAT('  1 TO 2 =',I6)
+1001  FORMAT('  2 TO 1 =',I6)
+      MISMAT = ISUM + JSUM
+      END
+      SUBROUTINE BSCHN7(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,MISMAT,
+     +IDEV,PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER CFCOD1
+      EXTERNAL CFCOD1
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      MISMAT = 0
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        MISMAT = MISMAT + CFCOD1(SEQ1(J1),SEQ2(J2),PAA)
+10    CONTINUE
+      WRITE(IDEV,1000)MISMAT
+1000  FORMAT(' Expressed base changes ',I6)
+      END
+      SUBROUTINE BSCHN8(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,MISMAT,
+     +IDEV,PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER CFCOD2
+      EXTERNAL CFCOD2
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      MISMAT = 0
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        MISMAT = MISMAT + CFCOD2(SEQ1(J1),SEQ2(J2),PAA)
+10    CONTINUE
+      WRITE(IDEV,1000)MISMAT
+1000  FORMAT(' Silent base changes ',I6)
+      END
+      SUBROUTINE BSCHN9(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,MISMAT,
+     +IDEV,PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER CFCOD3
+      EXTERNAL CFCOD3
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      MISMAT = 0
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        MISMAT = MISMAT + CFCOD3(SEQ1(J1),SEQ2(J2),PAA)
+10    CONTINUE
+      WRITE(IDEV,1000)MISMAT
+1000  FORMAT(' Number of changed acids ',I6)
+      END
+      SUBROUTINE COCHN1(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,DIFFER,
+     +PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER DIFFER(IDIM1)
+C count number of expressed base changes
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        CALL SCFCD1(SEQ1(J1),SEQ2(J2),PAA,DIFFER(I))
+C        DIFFER(I) = DIFFER(I) + CFCOD1(SEQ1(J1),SEQ2(J2),PAA)
+10    CONTINUE
+      END
+      SUBROUTINE SCFCD1(COD1,COD2,PAA,DIFF)
+      CHARACTER COD1(3),COD2(3),COD(3),ACID,TRANF,PAA(5,5,5)
+      INTEGER DIFF(3)
+      EXTERNAL TRANF
+C
+C returns 1 for each base in a codon cod2 that changes the acid from that
+C encoded by cod1
+C
+      ACID = TRANF(COD1,PAA)
+      DO 10 I = 1,3
+       COD(I) = COD1(I)
+10    CONTINUE
+      DO 20 I = 1,3
+        COD(I) = COD2(I)
+        IF(TRANF(COD,PAA).NE.ACID)DIFF(I) = DIFF(I) + 1
+        COD(I) = COD1(I)
+20    CONTINUE
+      END
+      INTEGER FUNCTION CFCOD1(COD1,COD2,PAA)
+      CHARACTER COD1(3),COD2(3),COD(3),ACID,TRANF,PAA(5,5,5)
+      EXTERNAL TRANF
+C
+C returns 1 for each base in a codon cod2 that changes the acid from that
+C encoded by cod1
+C
+      CFCOD1 = 0
+      ACID = TRANF(COD1,PAA)
+      DO 10 I = 1,3
+       COD(I) = COD1(I)
+10    CONTINUE
+      DO 20 I = 1,3
+        COD(I) = COD2(I)
+        IF(TRANF(COD,PAA).NE.ACID)CFCOD1 = CFCOD1 + 1
+        COD(I) = COD1(I)
+20    CONTINUE
+      END
+      SUBROUTINE COCHN2(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,DIFFER,
+     +PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER DIFFER(IDIM1)
+C count number of silent base changes
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        CALL SCFCD2(SEQ1(J1),SEQ2(J2),PAA,DIFFER(I))
+C        DIFFER(I) = DIFFER(I) + CFCOD2(SEQ1(J1),SEQ2(J2),PAA)
+10    CONTINUE
+      END
+      SUBROUTINE SCFCD2(COD1,COD2,PAA,DIFF)
+      CHARACTER COD1(3),COD2(3),COD(3),ACID,TRANF,PAA(5,5,5)
+      INTEGER DIFF(3)
+      EXTERNAL TRANF
+C
+C compare each base, if different does this base alone change the acid
+C if not then add 1 to score
+      ACID = TRANF(COD1,PAA)
+      DO 10 I = 1,3
+       COD(I) = COD1(I)
+10    CONTINUE
+      DO 20 I = 1,3
+        IF(COD1(I).NE.COD2(I)) THEN
+          COD(I) = COD2(I)
+          IF(TRANF(COD,PAA).EQ.ACID)DIFF(I) = DIFF(I) + 1
+          COD(I) = COD1(I)
+        END IF
+20    CONTINUE
+      END
+      INTEGER FUNCTION CFCOD2(COD1,COD2,PAA)
+      CHARACTER COD1(3),COD2(3),COD(3),ACID,TRANF,PAA(5,5,5)
+      EXTERNAL TRANF
+C
+C compare each base, if different does this base alone change the acid
+C if not then add 1 to score
+      CFCOD2 = 0
+      ACID = TRANF(COD1,PAA)
+      DO 10 I = 1,3
+       COD(I) = COD1(I)
+10    CONTINUE
+      DO 20 I = 1,3
+        IF(COD1(I).NE.COD2(I)) THEN
+          COD(I) = COD2(I)
+          IF(TRANF(COD,PAA).EQ.ACID)CFCOD2 = CFCOD2 + 1
+          COD(I) = COD1(I)
+        END IF
+20    CONTINUE
+      END
+      INTEGER FUNCTION CFCOD3(COD1,COD2,PAA)
+      CHARACTER COD1(3),COD2(3),TRANF,PAA(5,5,5)
+      EXTERNAL TRANF
+C
+C compare each acid, count if different
+C
+      CFCOD3 = 0
+      IF(TRANF(COD1,PAA).NE.TRANF(COD2,PAA)) CFCOD3 = 1
+      END
+      REAL FUNCTION FINF(COMP)
+      INTEGER COMP(5)
+      F = 0.
+      S = 5.
+      DO 10 I=1,5
+        S = S + COMP(I)
+ 10     CONTINUE
+      DO 20 I=1,5
+        X = (1.0+REAL(COMP(I)))/S
+        F = F - X * LOG(X)
+ 20     CONTINUE
+      FINF = F
+      END
+      SUBROUTINE COCHN3(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,CHANGE,
+     +PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER CHANGE(5,5)
+C count number of expressed base changes
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        CALL SCFCD3(SEQ1(J1),SEQ2(J2),PAA,CHANGE)
+10    CONTINUE
+      END
+      SUBROUTINE SCFCD3(COD1,COD2,PAA,CHANGE)
+      CHARACTER COD1(3),COD2(3),COD(3),ACID,TRANF,PAA(5,5,5)
+      INTEGER CHANGE(5,5),CTONUM
+      EXTERNAL TRANF,CTONUM
+C
+C compare each base, if different does this base alone change the acid
+C if so then add 1 to score
+      ACID = TRANF(COD1,PAA)
+      DO 10 I = 1,3
+       COD(I) = COD1(I)
+10    CONTINUE
+      DO 20 I = 1,3
+        IF(COD1(I).NE.COD2(I)) THEN
+          COD(I) = COD2(I)
+          IF(TRANF(COD,PAA).NE.ACID) THEN
+            K1 = CTONUM(COD1(I))
+            K2 = CTONUM(COD2(I))
+            CHANGE(K1,K2) = CHANGE(K1,K2) + 1
+          END IF
+          COD(I) = COD1(I)
+        END IF
+20    CONTINUE
+      END
+      SUBROUTINE COCHN5(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,CHANGE,
+     +PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),PAA(5,5,5)
+      INTEGER CHANGE(5,5)
+C count number of expressed base changes
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        CALL SCFCD4(SEQ1(J1),SEQ2(J2),PAA,CHANGE)
+10    CONTINUE
+      END
+      SUBROUTINE SCFCD4(COD1,COD2,PAA,CHANGE)
+      CHARACTER COD1(3),COD2(3),TRANF,PAA(5,5,5)
+      INTEGER CHANGE(5,5),CTONUM
+      EXTERNAL TRANF,CTONUM
+C
+C if the acid is the same count the mutations
+C
+      IF (TRANF(COD1,PAA).EQ.TRANF(COD2,PAA)) THEN
+        DO 20 I = 1,3
+          K1 = CTONUM(COD1(I))
+          K2 = CTONUM(COD2(I))
+          CHANGE(K1,K2) = CHANGE(K1,K2) + 1
+20      CONTINUE
+      END IF
+      END
+      SUBROUTINE TRAN(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,
+     +KBIN,KBOUT,IDEV,LINLEN,PAA)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),SYM1,SYM2,MATCH(120),TRANF
+      CHARACTER PAA(5,5,5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM,TRANF
+      SYM2 = ':'
+      SYM1 = ' '
+      IF(0.EQ.1) THEN
+      WRITE(KBOUT,2000)
+2000  FORMAT(' ? IDENTITY SYMBOL=',$)
+      READ(KBIN,2001)SYM2
+2001  FORMAT(A1)
+      WRITE(KBOUT,2002)
+2002  FORMAT(' ? MISMATCH SYMBOL=',$)
+      READ(KBIN,2001)SYM1
+      END IF
+      J1=I1
+      J2=I1+LINLEN-1
+      IF(J2.GT.N1)J2=N1
+      K1=I2
+      K2=I2+LINLEN-1
+      IF(K2.GT.N2)K2=N2
+20    CONTINUE
+      WRITE(IDEV,1004)
+      WRITE(IDEV,1000)(TRANF(SEQ1(K),PAA),K=J1,J2-1,3)
+      WRITE(IDEV,1003)(SEQ1(K),K=J1,J2)
+      DO 30 JJ=1,LINLEN
+        MATCH(JJ)=SYM1
+30    CONTINUE
+      DO 40 JJ=1,MIN((J2-J1+1),(K2-K1+1),(N1-J1+1),(N2-K1+1))
+        KK=CTONUM(SEQ1(J1-1+JJ))
+        LL=CTONUM(SEQ2(K1-1+JJ))
+        IF(KK.EQ.LL)MATCH(JJ)=SYM2
+40    CONTINUE
+      WRITE(IDEV,1003)(MATCH(K),K=1,MIN((J2-J1+1),(K2-K1+1),
+     +(N1-J1+1),(N2-K1+1)))
+      WRITE(IDEV,1003)(SEQ2(K),K=K1,K2)
+      WRITE(IDEV,1000)(TRANF(SEQ2(K),PAA),K=K1,K2-1,3)
+C
+      J1=J2+1
+      J2=J2+LINLEN
+      IF(J1.GT.N1)RETURN
+      IF(J2.GT.N1)J2=N1
+      K1=K2+1
+      K2=K2+LINLEN
+      IF(K1.GT.N2)RETURN
+      IF(K2.GT.N2)K2=N2
+      GO TO 20
+1000  FORMAT(3X,40(2X,1A1))
+1001  FORMAT(4X,40(2X,1A1))
+1002  FORMAT(5X,40(2X,1A1))
+1003  FORMAT(4X,120A1)
+1004  FORMAT( )
+1005  FORMAT(4X,12I10)
+      END
+      SUBROUTINE INITMT(M,S,JOB)
+      INTEGER S(4096)
+      REAL M(16)
+      CALL FILLI(S,4096,0)
+C only initialise mutation rates if first pass
+      IF(JOB.EQ.0) CALL FILLR(M,16,0.25)
+      END
+      SUBROUTINE MAKED(M,D)
+      REAL M(4,4),D(17)
+      INTEGER T,C,A,G
+      SAVE T,C,A,G
+      DATA T,C,A,G/1,2,3,4/
+C
+C To use these values: 1. add m(x,y) to d where x,y is the selected change
+C                      2. correct each of the appropriate m's using the
+C                         d values and add m(x,y)/d to m(x,y)
+C
+      D(1) = M(T,C) + M(T,A) + M(T,G)
+      D(2) = M(T,A) + M(T,G)
+      D(3) = M(T,G)
+      D(4) = 0.
+      D(5) = M(C,T) + M(C,A) + M(C,G)
+      D(6) = M(C,A) + M(C,G)
+      D(7) = 0.
+      D(8) = M(A,T) + M(A,C) + M(A,G)
+      D(9) = M(A,T) + M(A,C)
+      D(10) = M(A,G)
+      D(11) = 0.
+      D(12) = M(G,T) + M(G,C) + M(G,A)
+      D(13) = M(G,T) + M(G,C)
+      D(14) = 0.
+      D(15) = M(C,T) + M(C,G)
+      D(16) = M(C,G)
+      D(17) = M(A,T) + M(A,G)
+C      DO 10 I=1,17
+C        WRITE(*,*)I,D(I)
+C 10     CONTINUE
+      END
+      SUBROUTINE UPDM(M,D,E,MC,B1,B2,IADD,ND)
+      REAL M(4,4),D(17),E(4,4)
+      INTEGER T,C,A,G,B1,B2,ND(17)
+      SAVE T,C,A,G
+      PARAMETER (DELTA = 1E-10)
+      DATA T,C,A,G/1,2,3,4/
+      IF (IADD.EQ.0) RETURN
+C      ND(MC) = ND(MC) +  IADD
+      ADD = IADD
+C      E(B1,B2) = E(B1,B2) + ADD
+C      IF(0.EQ.0) RETURN
+C
+C add the rate for the observed mutation to the denominator
+C
+      X = D(MC) + M(B1,B2) + DELTA
+C      WRITE(*,*)'B1,B2,MC,ADD,D,X',B1,B2,MC,ADD,D(MC),X
+      IF (MC.EQ.1) THEN
+C T>T
+        IF(0.EQ.0) RETURN
+C        WRITE(*,*)'D(1),M(B1,B2)',D(1),M(B1,B2)
+C
+C add in the observed mutation to the appropriate mutation rate
+C which is equivalent to adding (1 - the corrections)
+C
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+C
+C now add in the corrections for the barred bases
+C
+        E(B1,C) = E(B1,C) + ADD*M(B1,C)/X
+        E(B1,A) = E(B1,A) + ADD*M(B1,A)/X
+        E(B1,G) = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.2) THEN
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,A)  = E(B1,A) + ADD*M(B1,A)/X
+        E(B1,G)  = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.3) THEN
+C        WRITE(*,*)'D(3),M(B1,B2)',D(3),M(B1,B2)
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,G)  = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.4) THEN
+C T>TCAG
+C        WRITE(*,*)'D(),M(B1,B2)',D(4),M(B1,B2)
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+      ELSE IF (MC.EQ.5) THEN
+C C>C
+        IF(0.EQ.0) RETURN
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,T) = E(B1,T) + ADD*M(B1,T)/X
+        E(B1,A) = E(B1,A) + ADD*M(B1,A)/X
+        E(B1,G) = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.6) THEN
+C C>CT
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,A)  = E(B1,A) + ADD*M(B1,A)/X
+        E(B1,G)  = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.7) THEN
+C C>TCAG
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+      ELSE IF (MC.EQ.8) THEN
+C A>A
+        IF(0.EQ.0) RETURN
+C        WRITE(*,*)'D(8),M(B1,B2)',D(8),M(B1,B2)
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,T) = E(B1,T) + ADD*M(B1,T)/X
+        E(B1,C) = E(B1,C) + ADD*M(B1,C)/X
+        E(B1,G) = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.9) THEN
+C A>AG
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,T) = E(B1,T) + ADD*M(B1,T)/X
+        E(B1,C) = E(B1,C) + ADD*M(B1,C)/X
+      ELSE IF (MC.EQ.10) THEN
+C A>TCA
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,G) = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.11) THEN
+C A>TCAG
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+      ELSE IF (MC.EQ.12) THEN
+C G>G
+        IF(0.EQ.0) RETURN
+C        WRITE(*,*)'D(12),M(B1,B2)',D(12),M(B1,B2)
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,T) = E(B1,T) + ADD*M(B1,T)/X
+        E(B1,C) = E(B1,C) + ADD*M(B1,C)/X
+        E(B1,A) = E(B1,A) + ADD*M(B1,A)/X
+      ELSE IF (MC.EQ.13) THEN
+C G>AG
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,T) = E(B1,T) + ADD*M(B1,T)/X
+        E(B1,C) = E(B1,C) + ADD*M(B1,C)/X
+      ELSE IF (MC.EQ.14) THEN
+C G>TCAG
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+      ELSE IF (MC.EQ.15) THEN
+C C>CA
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,T) = E(B1,T) + ADD*M(B1,T)/X
+        E(B1,G) = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.16) THEN
+C C>TCA
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,G) = E(B1,G) + ADD*M(B1,G)/X
+      ELSE IF (MC.EQ.17) THEN
+C A>CA
+        E(B1,B2) = E(B1,B2) + ADD*M(B1,B2)/X
+        E(B1,T) = E(B1,T) + ADD*M(B1,T)/X
+        E(B1,G) = E(B1,G) + ADD*M(B1,G)/X
+      ELSE
+        WRITE(*,*)'SCREAM UPDM',MC
+      END IF
+      END
+      SUBROUTINE UPDE(M,E,MC,SUMS,PAA)
+      REAL M(4,4),D(17),E(4,4),T(4),PREV(4,4),DELTA
+      INTEGER SUMS(4,4,4,4,4,4),MC(4,4,4,3),CYCLE,ND(17)
+      CHARACTER PAA(5,5,5)
+      PARAMETER (SMALL = 1E-10)
+      SAVE DELTA
+      DATA DELTA/0.01/
+      DO 1 I=1,4
+        DO 1 J=1,4
+          PREV(I,J) = M(I,J)
+ 1        CONTINUE
+      DO 3 I=1,17
+        ND(I) = 0
+ 3      CONTINUE
+      IR = 0
+ 5    CONTINUE
+      IR = IR + 1
+      CALL MAKED(M,D)
+      DO 10 I=1,4
+        DO 10 J=1,4
+          E(I,J) = 0.
+ 10       CONTINUE
+      DO 20 I=1,4
+        DO 20 J=1,4
+          DO 20 K=1,4
+            DO 15 II=1,4
+              DO 15 JJ=1,4
+                DO 15 KK=1,4
+                  IF(PAA(K,J,I).EQ.PAA(KK,JJ,II)) THEN
+       IF((I.NE.II).OR.(J.NE.JJ).OR.(K.NE.KK)) THEN
+           CALL UPDM(M,D,E,MC(K,J,I,1),I,II,SUMS(I,J,K,II,JJ,KK),ND)
+           CALL UPDM(M,D,E,MC(K,J,I,2),J,JJ,SUMS(I,J,K,II,JJ,KK),ND)
+           CALL UPDM(M,D,E,MC(K,J,I,3),K,KK,SUMS(I,J,K,II,JJ,KK),ND)
+                  END IF
+                  END IF
+ 15               CONTINUE
+ 20               CONTINUE
+C       WRITE(*,*)'NEW E '
+C       DO 55 I=1,4
+C         WRITE(*,*)(E(I,J),J=1,4)
+C  55   CONTINUE
+      CALL FILLR(T,4,SMALL)
+      DO 30 I=1,4
+        DO 30 J=1,4
+          T(I) = T(I) + E(I,J)
+ 30       CONTINUE
+C      WRITE(*,*)'ROW TOTS',T
+      DO 40 I=1,4
+        DO 40 J=1,4
+          M(I,J) = E(I,J)/T(I)
+ 40       CONTINUE
+C      WRITE(*,*)' NEW M'
+C      DO 50 I=1,4
+C        WRITE(*,1000)(M(I,J),J=1,4)
+C        WRITE(*,1000)(PREV(I,J),J=1,4)
+C 50     CONTINUE
+C 1000   FORMAT(4F8.5)
+      CYCLE = 0
+      DO 60 I=1,4
+        DO 60 J=1,4
+          IF (ABS(M(I,J)-PREV(I,J)).GT.DELTA) CYCLE = 1
+C          PREV(I,J) = M(I,J)
+ 60       CONTINUE
+C      WRITE(*,*)IR
+      DO 70 I=1,4
+        DO 70 J=1,4
+          PREV(I,J) = M(I,J)
+ 70       CONTINUE
+      IF(CYCLE.EQ.1) GO TO 5
+      WRITE(*,*)'Number of iterations',IR
+C      WRITE(*,*)(K,ND(K),K=1,17)
+      END
+      SUBROUTINE UPDET(SUMS,PAA,MC)
+      INTEGER SUMS(4,4,4,4,4,4),MC(4,4,4,3)
+      CHARACTER PAA(5,5,5),BASES(4)
+      SAVE BASES
+      DATA BASES/'T','C','A','G'/
+      DO 20 I=1,4
+        DO 20 J=1,4
+          DO 20 K=1,4
+            DO 15 II=1,4
+              DO 15 JJ=1,4
+                DO 15 KK=1,4
+                  IF(PAA(K,J,I).EQ.PAA(KK,JJ,II)) THEN
+C                  IF(PAA(I,J,K).EQ.PAA(II,JJ,KK)) THEN
+C                    WRITE(*,*)I,J,K,' to',II,JJ,KK,SUMS(I,J,K,II,JJ,KK)
+             WRITE(*,*)BASES(I),BASES(J),BASES(K),
+     +BASES(II),BASES(JJ),BASES(KK),MC(K,J,I,1),MC(K,J,I,2),
+     +MC(K,J,I,3),SUMS(I,J,K,II,JJ,KK)
+                  END IF
+ 15               CONTINUE
+ 20               CONTINUE
+      END
+      SUBROUTINE COCHN4(SEQ1,IDIM1,I1,N1,SEQ2,IDIM2,I2,N2,CHANGE)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2)
+      INTEGER CHANGE(4,4,4,4,4,4),CTONUM,K1(3),K2(3)
+      EXTERNAL CTONUM
+      IDIM3=N1-I1+1
+      IDIM4=N2-I2+1
+C   GET MINIMUM LENGTH TO SCAN
+      MINL=MIN(IDIM3,IDIM4)
+      J1=I1 - 3
+      J2 = I2 - 3
+      DO 10 I=1,MINL-2,3
+        J1 = J1 + 3
+        J2 = J2 + 3
+        DO 5 JJ=1,3
+         KK = CTONUM(SEQ1(J1+JJ-1))
+         IF(KK.EQ.5) GO TO 10
+         K1(JJ) = KK
+         KK = CTONUM(SEQ2(J2+JJ-1))
+         IF(KK.EQ.5) GO TO 10
+         K2(JJ) = KK
+ 5       CONTINUE
+C         WRITE(*,*)I,K1,K2
+         CHANGE(K1(1),K1(2),K1(3),K2(1),K2(2),K2(3)) =
+     +   CHANGE(K1(1),K1(2),K1(3),K2(1),K2(2),K2(3)) + 1
+10    CONTINUE
+      END
+      SUBROUTINE EMT(MTT)
+      INTEGER MTT(64,3)
+      CHARACTER MTTC(64)*3
+      SAVE MTTC
+      DATA MTTC/
+     +'112','116','219','21D',
+     +'154','157','15B','15E',
+     +'182','186','199','19D',
+     +'1C2','1C6','1D8','1CC',
+     +'514','517','61B','61E',
+     +'554','557','55B','55E',
+     +'582','586','589','58D',
+     +'5C4','5C7','FCB','FCE',
+     +'813','81G','81A','81C',
+     +'854','857','85B','85E',
+     +'882','886','889','88D',
+     +'8C2','8C6','HC9','HCD',
+     +'C14','C17','C1B','C1E',
+     +'C54','C57','C5B','C5E',
+     +'C82','C86','C89','C8D',
+     +'CC4','CC7','CCB','CCE'/
+      CALL CTI(MTTC,MTT)
+C      CALL WRTCOI(SUM,6,PAA,MTTC)
+C      CALL EM(PAA,KBOUT)
+      END
+      SUBROUTINE CTI(MTTC,MTT)
+      CHARACTER MTTC(64)*3
+      INTEGER MTT(64,3)
+      CHARACTER C*17
+      DATA C/'123456789ABCDEFGH'/
+      DO 10 J=1,64
+        DO 8 M=1,3
+          DO 5 K=1,17
+            IF(MTTC(J)(M:M).EQ.C(K:K)) THEN
+              MTT(J,M) = K
+              GO TO 8
+            END IF
+ 5          CONTINUE
+ 8          CONTINUE
+ 10         CONTINUE
+      END
+      SUBROUTINE WRTCOI(SUM,IDEV,PAA,MTT)
+C   AUTHOR: RODGER STADEN
+      INTEGER SUM(4,4,4)
+      CHARACTER BASE(4),PAA(5,5,5),MTT(4,4,4)*3
+      SAVE BASE
+      DATA BASE/'T','C','A','G'/
+C
+      WRITE(IDEV,1001)
+1001  FORMAT(6X,'===========================================')
+      DO 10 I=1,4
+      DO 20 K=1,4
+      WRITE(IDEV,1000)(PAA(K,J,I),
+     +BASE(I),BASE(J),BASE(K),SUM(I,J,K),J=1,4)
+      WRITE(IDEV,1002)(MTT(K,J,I),J=1,4)
+ 1002 FORMAT(5X,4(1X,1X,A,5X))
+20    CONTINUE
+10    WRITE(IDEV,1001)
+1000  FORMAT(5X,4(1X,A1,1X,3A1,I5))
+      END
diff --git a/src/staden/nipl.f b/src/staden/nipl.f
new file mode 100644
index 0000000..f1afda5
--- /dev/null
+++ b/src/staden/nipl.f
@@ -0,0 +1,1207 @@
+C  NIPL (Nucleotide interpretation program (library))
+C  2-12-92 upped maxseq, maxwir to 360,000
+C  9-7-92 added fasta format capability
+C  29-3-1990 DSPLAZ thought repeat was class 5 corrected to 8
+C  11-4-1990 Changed title to be string rather than array
+C  18-4-1990 Changed to allow use of library index when appropriate
+C   8-11-90 Changed call to getmf for compatibility with nip
+C           and replaced all calls to radio by radion
+C  11-12-90 Changed library handling so added filnll and a parameter
+C  3-7-91 set namlen = 60
+C  4-7-91 inlined weight matrix search (copied from pipl). Should do rest!
+C  18-7-91 Added titles to pattern files
+C  16-12-91 Added access to pir library in codata form to pipl so make this the same
+C  2-3-92 set filnam = ' ' for some calls to openf1
+      SUBROUTINE FMAIN()
+      INTEGER BOTOPT,TOPOPT
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) FILNAM,HELPF,LIBLF
+      PARAMETER (
+     +           MAXSEQ=360000,
+     +           MXSPAN=603,
+     +           MAXWIN=MAXSEQ+MXSPAN,
+     +           MAXWIR=360000,
+     +           MAXD36=MAXWIR/36,
+     +           MAXD2=MAXWIR/2,
+     +           MAXD3=MAXWIR/3,
+     +           MAXSD2=MAXSEQ/2,
+     +           MAXSD3=MAXSEQ/3,
+     +           MAXDEV=10,
+     +           LIBLF = 'SEQUENCELIBRARIES',
+     +           LENNAM = 10)
+      PARAMETER (MAXMOT =   50,
+     +           MAXWTS = 5000,
+     +           IDM    = 5,
+     +           IDME   = 17)
+      INTEGER BESTP(MAXMOT),BESTQ(MAXMOT)
+      REAL BESTS(MAXMOT)
+      CHARACTER*(LENNAM) ENAMEL,ENAME,NAMSAV(MAXMOT)
+      PARAMETER (MAXDIV = 15)
+      INTEGER DIVDEV(MAXDIV),RSIZEN
+C  COMPATIBILITY WITH ANALYSEQ HELP STUFF
+      PARAMETER (BOTOPT=0,TOPOPT=1)
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      INTEGER DEVNOS(MAXDEV)
+      REAL WORKR(MAXWIR)
+      INTEGER WORKI(MAXWIR)
+      CHARACTER SEQ(MAXWIN),SEQW(MAXSEQ)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER MAT1(IDM,IDM),MAT2(IDME,IDME)
+      INTEGER MAT3(IDME,IDME),MAT4(IDM,IDM)
+      CHARACTER*8 KEYNS(MAXMOT)
+      CHARACTER LTYPE
+      EQUIVALENCE (WORKR,WORKI)
+      DATA MAT1/
+     +            1,0,0,0,0,
+     +            0,1,0,0,0,
+     +            0,0,1,0,0,
+     +            0,0,0,1,0,
+     +            0,0,0,0,0/
+      DATA MAT2/
+     +1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,
+     +1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,
+     +0,1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,
+     +0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,
+     +1,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,
+     +1,1,1,0,0,0,0,1,0,1,0,1,0,0,0,0,0,
+     +1,1,0,1,0,0,1,0,1,0,1,0,1,0,0,0,0,
+     +0,1,1,1,0,1,0,0,1,1,0,0,0,1,0,0,0,
+     +1,0,1,1,0,1,0,1,0,0,1,0,0,0,1,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+      DATA MAT3/
+     +     36, 0, 0, 0, 9, 0,18,18, 0, 0,18,12,12, 0,12, 9, 0,
+     +      0,36, 0, 0, 9, 0,18, 0,18,18, 0,12,12,12, 0, 9, 0,
+     +      0, 0,36, 0, 9,18, 0,18, 0,18, 0,12, 0,12,12, 9, 0,
+     +      0, 0, 0,36, 9,18, 0, 0,18, 0,18, 0,12,12,12, 9, 0,
+     +      9, 9, 9, 9,36,18,18,18,18,18,18,27,27,27,27,36, 0,
+     +      0, 0,18,18,18,36, 0, 9, 9, 9, 9, 6, 6,12,12,18, 0,
+     +     18,18, 0, 0,18, 0,36, 9, 9, 9, 9,12,12, 6, 6,18, 0,
+     +     18, 0,18, 0,18, 9, 9,36, 0, 9, 9,12, 6, 6,12,18, 0,
+     +      0,18, 0,18,18, 9, 9, 0,36, 9, 9, 6,12,12, 6,18, 0,
+     +      0,18,18, 0,18, 9, 9, 9, 9,36, 0,12, 6,12, 6,18, 0,
+     +     18, 0, 0,18,18, 9, 9, 9, 9, 0,36, 6,12, 6,12,18, 0,
+     +     12,12,12, 0,27, 6,12,12, 6,12, 6,36, 8, 8, 8,27, 0,
+     +     12,12, 0,12,27, 6,12, 6,12, 6,12, 8,36, 8, 8,27, 0,
+     +      0,12,12,12,27,12, 6, 6,12,12, 6, 8, 8,36, 8,27, 0,
+     +     12, 0,12,12,27,12, 6,12, 6, 6,12, 8, 8, 8,36,27, 0,
+     +      9, 9, 9, 9,36,18,18,18,18,18,18,27,27,27,27,36, 0,
+     +      0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0/
+      DATA MAT4/
+     +            0,0,2,1,0,
+     +            0,0,0,2,0,
+     +            2,0,0,0,0,
+     +            1,2,0,0,0,
+     +            0,0,0,0,0/
+C   GET DEVICE NUMBERS
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      WRITE(KBOUT,1000)
+1000  FORMAT(/,
+     +' NIPL (Nucleotide interpretation program (Library))',
+     +' V4.1 Jul 1991',/,
+     +' Author: Rodger Staden',/,
+     +' Searches nucleotide libraries for patterns of motifs',/)
+      CALL INITLU(IDM)
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(1),FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for results file',
+     +HELPS(1),HELPE(1),HELPF,DEVNOS(2))
+      IF(IOK.NE.0)STOP
+C  OPEN LIBRARY
+      LIBIN = 1
+      IDEVNL = DEVNOS(7)
+      IDEVLL = DEVNOS(8)
+      IDEVEN = DEVNOS(9)
+      IDEVD = DEVNOS(MAXDEV)
+      CALL RDLIBL(FILNAM,KBIN,KBOUT,
+     +HELPS(1),HELPE(1),HELPF,DEVNOS(2),IDEVLL,IDEVEN,IDEVNL,
+     +LIBLF,LIBIN,DIVDEV,MAXDIV,IDEVD,
+     +LIST,ENAMEL,LIBTYP,LTYPE,NDIV,RSIZEN,NRECEN,IOK)
+      IF(IOK.NE.0)STOP
+      IDEVOT=DEVNOS(1)
+      J1 = 1
+        CALL PATTEO(SEQ(MXSPAN+1),MAXSEQ,SEQW,MAXSEQ,
+     +  WORKI(1),WORKI(MAXMOT+1),WORKI(2*MAXMOT+1),WORKI(3*MAXMOT+1),
+     +  WORKI(4*MAXMOT+1),WORKI(5*MAXMOT+1),WORKI(6*MAXMOT+1),
+     +  WORKI(7*MAXMOT+1),WORKI(8*MAXMOT+1),WORKI(9*MAXMOT+1),
+     +  WORKI(10*MAXMOT+1),WORKI(11*MAXMOT+1),WORKI(12*MAXMOT+1),
+     +  WORKI(13*MAXMOT+1),WORKI(14*MAXMOT+1),WORKI(15*MAXMOT+1),
+     +  WORKI(16*MAXMOT+1),WORKI(17*MAXMOT+1),WORKI(18*MAXMOT+1),
+     +  WORKI(19*MAXMOT+1),WORKI(20*MAXMOT+1),
+     +  WORKR(22*MAXMOT+1),WORKR(23*MAXMOT+1),
+     +  FILNAM,MAXMOT,MAXWTS,IDEVOT,DEVNOS(3),DEVNOS(4),J1,
+     +  KBIN,KBOUT,DEVNOS(5),IDM,SEQ(1),IDME,
+     +  MAT1,MAT2,MAT3,MAT4,NAMSAV,KEYNS,
+     +  WORKI(23*MAXMOT+1+MAXWTS),
+     +  WORKI(23*MAXMOT+MAXWTS+3001),LIST,
+     +  HELPS,HELPE,HELPF,DEVNOS(2),DEVNOS(6),ENAMEL,ENAME,LIBTYP,
+     +  IDEVEN,RSIZEN,NRECEN,IDEVNL,DIVDEV,NDIV,BESTP,BESTQ,BESTS,
+     +  LTYPE)
+C NB HAVE SET MAXIMUM COMBINED STRING LENGTH TO 3000 ON PREVIOUS LINE
+C AND MAX INTEGER VERSION OF SEQUENCE IS WHATEVER IS LEFT AND THIS IS NOT
+C SENT OR CHECKED BY THE CODE !!!!!!!!!!!!!!!!!!!!!!!!!!
+C
+900   CONTINUE
+      END
+      SUBROUTINE PATTEO(SEQ,MAXSEQ,STRING,MAXSTR,
+     +LENGTH,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,IENTRY,
+     +START2,IEND2,WTSTR,START,IEND,MATCHQ,RELEND,MATCHP,
+     +STRNGS,LAST5,LAST3S,LAST3E,MATCHS,CUTOFF,WEIGHT,FILNAM,
+     +MAXMOT,MAXWTS,
+     +IDEV1,IDEV2,IDEV3,
+     +KSTART,KBIN,KBOUT,
+     +IDEV4,IDM,COMBIN,IDME,
+     +  MAT1,MAT2,MAT3,MAT4,NAMSAV,KEYNS,STRNGI,SEQI,LIST,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEVN,ENAMEL,NAMIN,LIBTYP,
+     +  IDEVEN,RSIZEN,NRECEN,IDEVNL,DIVDEV,NDIV,BESTP,BESTQ,BESTS,
+     +  LTYPE)
+      INTEGER DIVDEV(NDIV),RSIZEN
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT)
+      INTEGER RANGET(MAXMOT),RANGEM(MAXMOT),IENTRY(MAXMOT)
+      INTEGER START2(MAXMOT),IEND2(MAXMOT)
+      CHARACTER SEQ(MAXSEQ),STRING(MAXSTR)
+      INTEGER WTSTR(MAXMOT),START(MAXMOT),IEND(MAXMOT)
+      INTEGER MATCHQ(MAXMOT),RELEND(MAXMOT)
+      INTEGER MATCHP(MAXMOT),STRNGS(MAXMOT)
+      INTEGER LAST5(MAXMOT),LAST3S(MAXMOT),LAST3E(MAXMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT),MATCHS(MAXMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER FILNAM*(*),NAMIN*(*),ENAMEL*(*),HELPF*(*)
+      CHARACTER*(*) KEYNS(MAXMOT),NAMSAV(MAXMOT)
+      CHARACTER COMBIN(MAXMOT),LTYPE
+C NB PROBLEM ABOUT USING MAXSEQ AS DIMENSION!!!!!!!!!!!!
+      INTEGER STRNGI(MAXSTR),SEQI(MAXSEQ)
+      INTEGER BESTP(MAXMOT),BESTQ(MAXMOT),ENTRYN
+      REAL BESTS(MAXMOT)
+      CHARACTER TITLE*60,TITLEP*80
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER MAT1(IDM,IDM),MAT2(IDME,IDME)
+      INTEGER MAT3(IDME,IDME),MAT4(IDM,IDM)
+      REAL EXPECC(4)
+      PARAMETER (MAXCLS = 8)
+      PARAMETER (MAXPRM = 25)
+      CHARACTER PROMPT(5)*(MAXPRM)
+      SAVE EXPECC
+      DATA EXPECC/.25,.25,.25,.25/
+      IDSEQ = 1000
+C  ZERO ARRAYS
+C
+        CALL FILLI(LENGTH,MAXMOT,0)
+        CALL FILLI(CLASS,MAXMOT,0) 
+        CALL FILLI(RELMOT,MAXMOT,0)
+        CALL FILLI(RANGES,MAXMOT,0)
+        CALL FILLI(RANGEL,MAXMOT,0)
+        CALL FILLI(RANGET,MAXMOT,0)
+        CALL FILLI(RANGEM,MAXMOT,0)
+        CALL FILLI(IENTRY,MAXMOT,0)
+        CALL FILLI(START2,MAXMOT,0)
+        CALL FILLI(IEND2,MAXMOT,0) 
+        CALL FILLI(WTSTR,MAXMOT,0) 
+        CALL FILLI(START,MAXMOT,0) 
+        CALL FILLI(IEND,MAXMOT,0)  
+        CALL FILLI(MATCHQ,MAXMOT,0)
+        CALL FILLI(RELEND,MAXMOT,0)
+        CALL FILLI(MATCHP,MAXMOT,0)
+        CALL FILLI(STRNGS,MAXMOT,0)
+        CALL FILLI(LAST5,MAXMOT,0) 
+        CALL FILLI(LAST3S,MAXMOT,0)
+        CALL FILLI(LAST3E,MAXMOT,0)
+        CALL FILLR(CUTOFF,MAXMOT,0.0)
+        CALL FILLR(MATCHS,MAXMOT,0.0)
+        CALL FILLR(WEIGHT,MAXWTS,0.0)
+        CALL FILLC(COMBIN,MAXMOT,'A')
+        CALL FILLI(BESTP,MAXMOT,0)
+        CALL FILLC(BESTQ,MAXMOT,0)
+        CALL FILLC(BESTS,MAXMOT,0.)
+        ITOTAL = 0
+        CALL SETCMP(EXPECC,4)
+        PROMPT(1) = 'Motif by motif'
+        PROMPT(2) = 'Inclusive'
+        PROMPT(3) = 'Scores only'
+        PROMPT(4) = 'Complete padded sequences'
+        PROMPT(5) = 'Padded sections'
+        IOPT = 1
+        CALL RADION('Select results display mode',PROMPT,5,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOPT.LT.1) RETURN
+        IF(IOPT.EQ.4)THEN
+          MININ = 1
+          MAXIN = 9999
+          JSTART = 1
+          WRITE(KBOUT,1020)
+1020      FORMAT(
+     +' For output option 4, we need to position the first motifs',/,
+     +' in a pattern so that they are aligned with one another')
+         CALL GETINT(MININ,MAXIN,JSTART,'Position of first motif',
+     +   IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+         IF(IOK.NE.0) RETURN
+         JSTART = IVAL
+       END IF
+       JOPT = 0
+       CALL YESNO(JOPT,'Report all matches',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+       IF(JOPT.LT.0) RETURN
+        FILNAM = ' '
+        CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Pattern definition file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        JDEV = IDEV3
+C
+C  GET MOTIF DEFINITIONS
+C
+C  RETURN STRING LENGTH FOR COMPATIBILITY WITH LIB SEARCH
+      NSTRNG = MAXSTR
+        CALL GETMF(KBIN,KBOUT,STRING,NSTRNG,ISTRNG,
+     +  LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV2,
+     +  WTSTR,JDEV,IOK,RELEND,IDSEQ,IDEV4,IDM,COMBIN,
+     +  MAXCLS,
+     +  MAT1,IDM,MAT2,IDME,MAT3,IDME,MAT4,IDM,
+     +  PMINT,PMAXT,PROBT,EXPTT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,0,TITLEP)
+      IF(IOK.NE.0)RETURN
+      IF(NMOT.LT.1)RETURN
+C
+C
+C  DISPLAY THE SIGNAL DESCRIPTION
+C
+      RANGES(1) = 1
+      CALL DESSIG(
+     +            KBOUT,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,COMBIN,KEYNS,TITLEP)
+      WRITE(KBOUT,2003)PROBT
+2003  FORMAT(' Probability of finding pattern = ',E10.4)
+      WRITE(KBOUT,2004)EXPTT
+2004  FORMAT(' Expected number of matches per 1000 residues = ',E10.4)
+      CALL GETRL(0.,1.,1.,'Maximum pattern probability',
+     +XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      PMINC = XP
+      CALL GETRL(-9999.,9999.,-9999.,'Minimum pattern score',
+     +XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CUTSCR = XP
+      MINSCR = 9999999.
+      MAXSCR = -9999999.
+      IF(NSTRNG.GT.0)CALL CONNUM(STRING,STRNGI,NSTRNG)
+C
+C
+      ICREC = 0
+      IFINEX = 0
+      NENTRY = 0
+      ENTRYN = 0
+20    CONTINUE
+      IDSEQ = MAXSEQ
+      IF(LIBTYP.EQ.1) THEN
+        CALL CDROML(LIST,NAMIN,ENAMEL,
+     +  IDEVEN,RSIZEN,NRECEN,IDEVNL,SEQ,IDSEQ,
+     +  DIVDEV,NDIV,ICREC,IFINEX,TITLE,KBOUT,LTYPE,IOK)
+        IF(IOK.NE.0) GO TO 900
+        FILNAM = NAMIN
+      ELSE IF(LIBTYP.EQ.2) THEN
+        CALL RDPIRA(SEQ,IDSEQ,
+     +  IDEVEN,KBOUT,TITLE,FILNAM,LIST,ENAMEL,IDEVNL)
+      ELSE IF(LIBTYP.EQ.3) THEN
+        CALL RDFASA(SEQ,IDSEQ,
+     +  IDEVEN,KBOUT,TITLE,FILNAM,LIST,ENAMEL,IDEVNL,ENTRYN)
+      ELSE
+        RETURN
+      END IF
+      IF(IDSEQ.LT.1)GO TO 900
+      NENTRY = NENTRY + 1
+C  CONVERT TO INTEGER
+      IF(IDSEQ.GT.0)CALL CONNUM(SEQ,SEQI,IDSEQ)
+C
+      RANGES(1) = 1
+      RANGEL(1) = IDSEQ
+C
+C  DO THE SEARCH
+C
+      CALL SRCSIH(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATCHQ,RELEND,
+     +IDEV1,LAST5,LAST3S,LAST3E,IOPT,
+     +ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,IDME,
+     +SEQI,STRNGI,TITLE,FILNAM,
+     +PMINT,PMAXT,PROBT,MAT1,MAT2,MAT3,
+     +MAT4,PMINC,IDEVN,KEYNS,BESTP,BESTQ,BESTS,JOPT)
+C
+C
+      GO TO 20
+C
+C
+900   CONTINUE
+      WRITE(KBOUT,1002)ITOTAL
+1002  FORMAT(' Total matches found',I7)
+      WRITE(KBOUT,1006)MINSCR,MAXSCR
+1006  FORMAT(' Minimum and maximum observed scores=',2F12.2)
+      WRITE(KBOUT,1009)NENTRY
+1009  FORMAT(' ',I7,' Entries processed')
+      END
+      SUBROUTINE SRCSIH(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATCHQ,RELEND,
+     +IDEVOT,LAST5,LAST3S,LAST3E,
+     +IOPT,ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,IDME,SEQI,STRNGI,TITLE,FILNAM,
+     +PMINT,PMAXT,PROBT,MAT1,MAT2,MAT3,
+     +MAT4,PMINC,IDEVN,KEYNS,BESTP,BESTQ,BESTS,JOPT)
+C  ROUTINE TO SEARCH FOR SIGNALS COMPOSED OF MOTIFS
+C  WEIGHT  = WEIGHTS FOR MATRICES
+C  CUTOFF  = CUTOFF SCORES
+C  LENGTH  = MOTIF LENGTHS
+C  CLASS   = MOTIF CLASS
+C  COMBIN  = LOGICAL COMBINATION A, O, N
+C  WTSTR   = POINTER TO WEIGHT STARTS
+C  RANGES  = RANGES START
+C  RANGEL  = RANGE LENGTH (A DISTANCE MEASURED FROM RANGES)
+C  RELMOT  = MOTIF NUMBER THAT A RANGE IS RELATIVE TO IE THE
+C            FIRST MOTIF'S RANGE IS RELATIVE TO MOTIF 0, BUT
+C            ANY OTHER MOTIF MAY HAVE TO BE DEFINED RELATIVE
+C            TO ANY OTHER. THE MOST COMMON WOULD BE THE FIRST
+C            MOTIF OR THE LAST ONE SEARCHED FOR.
+C  RELEND  = IS A SPECIAL CASE FLAG FOR STEMS. IT ALLOWS OTHER
+C            MOTIFS TO HAVE THEIR POSITIONS RELATIVE TO THE 3' SIDE
+C            OF A STEM. IT IS 5 FOR THE 5 PRIME SIDE, 3 FOR 3' SIDE
+C  START   = RANGE START DURING SEARCH (SOME POSITIONS MAY HAVE
+C            BEEN TRIED)
+C  IEND    = RANGE END POSITION FOR CURRENT INITIAL START (WHEN IT
+C            IS RESET DEPENDS ON WHETHER IT IS DEFINED RELATIVE TO 
+C            THE FIRST OR THE PREVIOUS MOTIF. IF IT IS DEFINED RELATIVE
+C            TO THE FIRST MOTIF IT IS RESET WHEN WE FIND A MATCH FOR THE
+C            FIRST MOTIF. IF IT IS DEFINED RELATIVE TO THE PREVIOUS MOTIF
+C            WE MUST RESET WHEN WE MOVE FORWARD ONE MOTIF. I THINK THIS
+C            CAN BE TAKEN CARE OF BY UPDATING ALL THOSE MOTIFS THAT ARE
+C            DEFINED TO THE CURRENT MOTIF EVERY TIME WE MOVE FORWARD 
+C            ONE MOTIF (IE IT INCLUDES THE FIRST MOTIF SO IT IS NOT A
+C            SPECIAL CASE). OTHERWISE WE UPDATE POSITIONS WHEN WE FIND
+C            A MATCH FOR THEM (WE SET TO THE MATCH POSITION PLUS 1)
+C  MATCHP  = LIST OF CURRENT MATCH POSITIONS FOR EACH MOTIF
+C  MATCHS  = LIST OF CURRENT MATCH SCORES FOR EACH MOTIF
+C  IFOUND  = A FLAG TO INDICATE SUCCESS OR FAILURE OF A SEARCH ROUTINE
+C            1 = SUCCESS, 0 = FAIL
+C  STRNGS  = POINTER TO STRING STARTS IN CHARACTER ARRAY STRING
+C  TEMPORARY VALUES ARE:
+C  MOTIF   = ACTUAL MOTIF NUMBER
+C  ICLASS  = CLASS
+C  ILEN    = LENGTH OF MOTIF
+C  CUT     = CUTOFF
+C  WT      = START OF WEIGHTS FOR THIS MOTIF
+C  ISTRST  = START OF STRING
+C  RANGET  = START OF 3' RANGE FOR STEM SEARCHES
+C  RANGEM  = END OF 3' RANGE FOR STEM SEARCHES
+C  IENTRY  = FLAG TO SIGNIFY MORE 3' STEM POSITIONS FOR LAST 5' START
+C            0 = NONE, ON RETURN FROM MOTIF6 IT CONTAINS THE 3' MATCH 
+C            POSITION
+C  MATCHQ  = MATCH POSITION FOR STEM SEARCH
+C  COMB    = LOGICAL COMBINATION A, O, N
+      INTEGER WTSTR(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),START(NMOT),IEND(NMOT)
+      INTEGER RELMOT(NMOT),MATCHP(NMOT),STRNGS(NMOT)
+      INTEGER RANGET(NMOT),RANGEM(NMOT),IENTRY(NMOT),RELEND(NMOT)
+      INTEGER START2(NMOT),IEND2(NMOT),MATCHQ(NMOT)
+      INTEGER LAST5(NMOT),LAST3S(NMOT),LAST3E(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT),MATCHS(NMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER SEQ(IDSEQ),STRING(MAXSTR)
+      CHARACTER COMBIN(NMOT),COMB
+      INTEGER STRNGI(MAXSTR),SEQI(IDSEQ)
+      CHARACTER TITLE*(*),FILNAM*(*)
+      CHARACTER*(*) KEYNS(NMOT)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER MAT1(IDM,IDM),MAT2(IDME,IDME)
+      INTEGER MAT3(IDME,IDME),MAT4(IDM,IDM)
+      INTEGER BESTP(NMOT),BESTQ(NMOT)
+      REAL BESTS(NMOT),MAXSS,MINMSS
+      PARAMETER (MINMSS = -999999.)
+C
+C
+C  INITIALIZE
+      JMOT = 0
+      IRET = 0
+5     CONTINUE
+      JMOT = JMOT + 1
+      IF(JMOT.LE.NMOT)THEN
+        IF(RELMOT(JMOT).EQ.0)THEN
+          START(JMOT) = RANGES(1)
+          IEND(JMOT) = RANGES(1) + RANGEL(1) -1
+          GO TO 5
+        END IF
+      END IF
+      MOTIF  = 1
+      ICLASS = CLASS(1)
+      ILEN   = LENGTH(1)
+      CUT    = CUTOFF(1)
+      IWT    = WTSTR(1)
+      ISTRST = STRNGS(1)
+      IENTRY(1) = 0
+      COMB   = COMBIN(1)
+      DO 10 I = 1,NMOT
+        MATCHP(I) = 0
+10    CONTINUE
+      MAXSS = MINMSS
+C
+C
+C
+C
+C
+100   CONTINUE
+C
+C
+C  THIS A CLASS CLASS MOTIF, PERFORM THE APPROPRIATE SEARCH IF THE START 
+C  POSITION IS >0. (IF IT IS NOT THE CURRENT MOTIF IS A NOT THAT HAS
+C  ALREADY BEEN SEARCHED FOR
+C
+      IFOUND = 0
+      IF(START(MOTIF).GT.0)THEN
+C
+C
+      IF(ICLASS.EQ.1)THEN
+        CALL MOTIF1(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,
+     +  CUTOFF(MOTIF),0)
+      ELSE IF(ICLASS.EQ.2)THEN
+        CALL MOTIF2(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND)
+      ELSE IF(ICLASS.EQ.3)THEN
+        CALL MOTFI3(SEQI,IDSEQ,STRNGI(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,MAT3,
+     +  IDME)
+      ELSE IF(ICLASS.EQ.4)THEN
+        CALL MOTFI4(SEQI,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+      ELSE IF(ICLASS.EQ.5)THEN
+        CALL MOTFI4(SEQI,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+      ELSE IF(ICLASS.EQ.6)THEN
+        CALL MOTIF6(SEQI,IDSEQ,MAT4,LENGTH(MOTIF),START(MOTIF),
+     +            IEND(MOTIF),RANGET(MOTIF),RANGEM(MOTIF),
+     +            CUTOFF(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),
+     +            IENTRY(MOTIF),IFOUND,MATCHQ(MOTIF),
+     +            LAST5(MOTIF),LAST3S(MOTIF),LAST3E(MOTIF),IDM)
+      ELSE IF(ICLASS.EQ.7)THEN
+        CALL MOTIF1(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,
+     +  CUTOFF(MOTIF),1)
+      ELSE IF(ICLASS.EQ.8)THEN
+      CALL MOTIF8(SEQI,IDSEQ,MAT1,LENGTH(MOTIF),START(MOTIF),
+     +            IEND(MOTIF),RANGET(MOTIF),RANGEM(MOTIF),
+     +            CUTOFF(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),
+     +            IENTRY(MOTIF),IFOUND,MATCHQ(MOTIF),
+     +            LAST5(MOTIF),LAST3S(MOTIF),LAST3E(MOTIF),IDM)
+      ELSE
+        WRITE(KBOUT,*)'UNKNOWN CLASS!!'
+      END IF
+C
+C
+      END IF
+C
+C
+C   MATCH FOUND WHEN MATCH WANTED ?
+C
+C
+C
+      IF(((IFOUND.EQ.0).AND.(COMB.NE.'N')).OR.
+     +   ((IFOUND.GT.0).AND.(COMB.EQ.'N')))THEN
+C
+C   NO SO GO BACK OR SIDEWAYS ONE MOTIF
+C
+C
+        CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +  RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +  COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+        IF(IRET.NE.0) THEN
+          IF(MAXSS.GT.MINMSS) THEN
+            CALL DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +      CLASS,MATCHQ,IOPT,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +      TITLE,FILNAM,
+     +      PMINT,PMAXT,PROBT,IDM,MAT1,IDM,MAT2,IDME,MAT3,IDM,
+     +      MAT4,IDM,WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,RANGES,RANGEL,
+     +      IDEVN,KEYNS,BESTP,BESTQ,BESTS,2,MAXSS)
+            ITOTAL = ITOTAL + 1
+          END IF
+          RETURN
+        END IF
+C
+C
+      ELSE
+C
+C
+C   MATCH FOUND. 
+C
+C
+C   PREPARE FOR NEXT SEARCH THIS MOTIF BY INCREMENTING POINTER
+C   TO SEARCH RANGE (NOT FOR CLASS 6 WHICH IS HANDLED BY MOTIF6)
+C
+C
+       IF(COMB.EQ.'N')THEN
+         START(MOTIF) = -9
+       ELSE
+         IF((ICLASS.NE.6).AND.(ICLASS.NE.8))
+     +     START(MOTIF) = MATCHP(MOTIF) + 1
+         IF(ICLASS.EQ.7)START(MOTIF) = MATCHP(MOTIF) + CUTOFF(MOTIF)
+       END IF
+C
+C   TRY GOING FORWARD ONE MOTIF
+C
+C
+      CALL FORWAD(CLASS,LENGTH,CUTOFF,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,RELMOT,START,IEND,
+     +  RANGES,RANGEL,STRNGS,ISTRST,WTSTR,IDSEQ,IENTRY,
+     +  RANGET,RANGEM,START2,IEND2,MATCHQ,RELEND,MATCHP,IDSPLY,
+     +  COMBIN,COMB)
+C
+C
+C
+C   IS THIS THE LAST MOTIF? IF SO DISPLAY THE MATCH
+C
+C
+        IF(IDSPLY.EQ.1)THEN
+          IF(JOPT.NE.1) ITOTAL = ITOTAL + 1
+          CALL DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +    CLASS,MATCHQ,IOPT,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +    TITLE,FILNAM,
+     +    PMINT,PMAXT,PROBT,IDM,MAT1,IDM,MAT2,IDME,MAT3,IDM,
+     +    MAT4,IDM,WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,RANGES,RANGEL,
+     +    IDEVN,KEYNS,BESTP,BESTQ,BESTS,JOPT,MAXSS)
+C
+C
+C  HORRIBLE SPECIAL CASE - IF LAST MOTIF IS NOTTED WE MUST MOVE BACK AFTER
+C  DISPLAY
+          IF(COMB.EQ.'N')THEN
+            CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +      MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +      RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +      COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+            IF(IRET.NE.0) THEN
+              IF(MAXSS.GT.MINMSS) THEN
+                CALL DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +          CLASS,MATCHQ,IOPT,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +          TITLE,FILNAM,
+     +          PMINT,PMAXT,PROBT,IDM,MAT1,IDM,MAT2,IDME,MAT3,IDM,
+     +          MAT4,IDM,WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,RANGES,RANGEL,
+     +          IDEVN,KEYNS,BESTP,BESTQ,BESTS,2,MAXSS)
+                ITOTAL = ITOTAL + 1
+              END IF
+              RETURN
+            END IF
+          END IF
+        END IF
+C
+C
+      END IF
+C
+C
+C   GO BACK FOR NEXT SEARCH
+C
+C
+      GO TO 100
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF8(SEQ,IDSEQ,MATRIX,LENGTH,I5STAR,I5END,
+     +                  I3STAR,I3END,CUTOFF,MATCHP,MATCHS,
+     +                  IENTRY,IFOUND,MATCHQ,
+     +                  LAST5,LAST3S,LAST3E,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM),REPEET
+      REAL MATCHS
+      EXTERNAL REPEET
+C  WE HAVE A START POSITION FOR THE 5' END OF THE 5' END OF
+C  A POTENTIAL REPEAT I5STAR AND AN END DEFINED BY A RANGE I5END
+C  WE HAVE A REPEAT LENGTH LENGTH
+C  WE HAVE A RANGE OF POSITIONS FOR THE 3' STEM TO START
+C  I3STAR TO I3END
+C  TRY THE TIGHTEST LOOPS FIRST
+C  BUT FIRST WE MAY HAVE TO FINISH A PREVIOUS SEARCH
+C  THIS IS DENOTED BY IENTRY NE 0.
+C  NOTE IENTRY IS ALSO USED TO RETURN THE 3' MATCH POSITION
+      ICUT = CUTOFF
+C      WRITE(*,*)'ICUT',ICUT
+      IFOUND = 0
+      JENTRY = IENTRY
+      IENTRY = 0
+      IF(I5STAR.LT.1)I5STAR=1
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      IF(JENTRY.NE.0)THEN
+        I1 = LAST5
+C        WRITE(*,*)'I1,LAST3S,LAST3E',I1,LAST3S,LAST3E
+        DO 50 J=LAST3S+1,LAST3E
+          J1 = J
+          ISUM = REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+          IF(ISUM.GE.ICUT)THEN
+            MATCHP = I1
+            IENTRY = J1
+            MATCHQ = J1
+            MATCHS = ISUM
+            IFOUND = MATCHP
+            LAST3S = J1
+            RETURN
+          END IF
+50      CONTINUE
+C  NOW MOVE 5' STEM START POSITION (WE HAVE JUST FINISHED THE LAST)
+C  TO THE LAST MATCH + 1
+        I5STAR = MATCHP + 1
+      END IF
+C  SET ENTRY FLAG TO ZERO TO SIGNIFY LAST SEARCH NOW COMPLETED
+      IENTRY = 0
+      ISUM = 0
+      LOOPI1 = I5STAR
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      LOOPI2 = MIN(IDSEQ-2*LENGTH+1,I5END)
+C      WRITE(*,*)'IDSEQ,LENGTH,I5STAR,I5END',
+C     +IDSEQ,LENGTH,I5STAR,I5END
+C
+C  TRY ALL STEM STARTS FROM 5' START TO 5' END
+C
+C
+      DO 200 I = LOOPI1,LOOPI2
+C
+C
+        I1 = I
+C
+C   TRY ALL LOOPS FROM 3' START TO 3' END
+C
+        LOOPJ1 = I + I3STAR -1
+        IF((LOOPJ1+LENGTH-1).GT.IDSEQ)RETURN
+        LOOPJ2 = MIN(IDSEQ-LENGTH+1,I+I3END-1)
+C      WRITE(*,*)'I3STAR,I3END',I3STAR,I3END
+C
+C
+C
+        DO 100 J = LOOPJ1,LOOPJ2
+C
+C
+C
+         J1 = J
+C  IN REPEAT NOTE THAT
+C  THE 5' END POINTER I1 GOES FORWARDS
+C  THE 3' END POINTER J1 GOES FORWARDS
+C
+         ISUM = REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+C         WRITE(*,*)ISUM
+         IF(ISUM.GE.ICUT)THEN
+           MATCHP = I1
+           IENTRY = J1
+           MATCHQ = J1
+           MATCHS = ISUM
+           IFOUND = MATCHP
+C  SAVE CURRENT POSITION FOR LATER ENTRIES
+           LAST5  = I1
+           LAST3S = J1
+           LAST3E = LOOPJ2
+           RETURN
+         END IF
+100    CONTINUE
+200   CONTINUE
+      END
+C*********************************************************************
+      INTEGER FUNCTION REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I5P,I3P,IDM)
+      INTEGER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM)
+C  THE 5' END POINTER GOES FORWARDS
+C  THE 3' END POINTER GOES FORWARDS
+      L=0
+      I5=I5P-1
+      I3=I3P-1
+      DO 100 I=1,LENGTH
+       I5 = I5 + 1
+       I3 = I3 + 1
+C      WRITE(*,*)'I5,I3',I5,I3
+       L5 = SEQ(I5)
+       L3 = SEQ(I3)
+       L = L + MATRIX(L5,L3)
+100   CONTINUE
+      REPEET = L
+      END
+      SUBROUTINE PADOUT(IDEV,CHAR,LINLEN,NCHAR)
+      CHARACTER CHAR
+C  HOW MANY LINES?
+      NLINE = 1 + (NCHAR-1)/LINLEN
+      K1 = 1
+      K2 = MIN(NCHAR,LINLEN)
+      DO 10 I = 1,NLINE
+        WRITE(IDEV,1000,ERR=20)(CHAR,K=K1,K2)
+        K1 = K2 + 1
+        K2 = K1 + LINLEN - 1
+        K2 = MIN(K2,NCHAR)
+10    CONTINUE
+20    CONTINUE
+1000  FORMAT(' ',60A1)
+      END
+C*********************************************************************
+      SUBROUTINE MOTFI3(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,
+     +MATCHP,MATCHS,IFOUND,MATRIX,IDM)
+      INTEGER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER MATRIX(IDM,IDM)
+      REAL MATCHS
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      IF(ISTART.GT.IDIM1)RETURN
+      CALL SQFTI5(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,MATCHS,
+     +IFOUND,MATRIX,IDM)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = IFOUND
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE SQFTI5(SEQ,IDIM1,STRING,IDIM2,
+     1IS,IE,MINSC,MATCHS,IFOUND,MATRIX,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS,MINSC
+      INTEGER MATRIX(IDM,IDM)
+      MINSCR = MINSC
+C
+      IDIF=(IE-IS+2)-IDIM2
+C   IDIF IS THE NUMBER OF POSNS TO TRY
+C   IPSTR GOES FROM 1 TO IDIM2 IDIF TIMES
+C   TRY ALL POSSIBLE POSITIONS FOR MATCHING AND SCORE FOR EACH
+C   POINT TO ARRAY ELEMENT CORRESPONDING TO FIRST BASE
+      IPSEQ=IS
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          NTOT = NTOT + MATRIX(SEQ(IP),STRING(J))
+          IP=IP+1
+100     CONTINUE
+C   END OF COUNTING FOR THIS POSITION.IS TOTAL HIGH ENOUGH?
+        IF(NTOT.GE.MINSCR)THEN
+          MATCHS = NTOT
+          IFOUND = IP-IDIM2
+          RETURN
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      IFOUND = 0
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE MOTFI4(SEQ,IDIM1,LENGTH,ISTART,IEND,
+     +WEIGHT,CUTOFF,MATCHP,MATCHS,IFOUND,IDM)
+      REAL WEIGHT(IDM,LENGTH)
+      REAL MATCHS
+      INTEGER SEQ(IDIM1)
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      L1 = IEND-ISTART+1
+      IF(ISTART.GT.IDIM1)RETURN
+      IF(L1.LT.LENGTH)RETURN
+      DO 10 I=ISTART,ISTART+L1-LENGTH
+        SUM = 0.
+        K = 0
+        DO 5 J=I,I+LENGTH-1
+          K = K + 1
+          SUM = SUM + WEIGHT(SEQ(J),K)
+ 5        CONTINUE
+          IF(SUM.GE.CUTOFF) THEN
+            MATCHP = I
+            MATCHS = SUM
+            IFOUND = I
+            RETURN
+          END IF
+ 10       CONTINUE
+      IFOUND = 0
+      END
+C*********************************************************************
+      SUBROUTINE FMOTI4(SEQ,IDIM,WT,LENGTH,CUTOFF,SUM,IFOUND,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDIM)
+      REAL WT(IDM,LENGTH)
+      DO 10 I=1,IDIM-LENGTH+1
+        SUM=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          SUM=SUM+WT(SEQ(J),K)
+5       CONTINUE
+        IF(SUM.GE.CUTOFF)THEN
+        IFOUND = I
+        RETURN
+        END IF
+10    CONTINUE
+      IFOUND = 0
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF6(SEQ,IDSEQ,MATRIX,LENGTH,I5STAR,I5END,
+     +                  I3STAR,I3END,CUTOFF,MATCHP,MATCHS,
+     +                  IENTRY,IFOUND,MATCHQ,
+     +                  LAST5,LAST3S,LAST3E,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM)
+      REAL MATCHS
+      EXTERNAL LOOP
+C  WE HAVE A START POSITION FOR THE 5' END OF THE 5' END OF
+C  A POTENTIAL STEM I5STAR AND AN END DEFINED BY A RANGE I5END
+C  WE HAVE A STEM LENGTH LENGTH
+C  WE HAVE A RANGE OF POSITIONS FOR THE 3' STEM TO START
+C  I3STAR TO I3END
+C  TRY THE TIGHTEST LOOPS FIRST
+C  BUT FIRST WE MAY HAVE TO FINISH A PREVIOUS SEARCH
+C  THIS IS DENOTED BY IENTRY NE 0.
+C  NOTE IENTRY IS ALSO USED TO RETURN THE 3' MATCH POSITION
+      ICUT = CUTOFF
+C      WRITE(*,*)'ICUT',ICUT
+      IFOUND = 0
+      JENTRY = IENTRY
+      IENTRY = 0
+      IF(I5STAR.LT.1)I5STAR=1
+      IF((I5STAR+I3STAR-1).GT.IDSEQ)RETURN
+      IF(JENTRY.NE.0)THEN
+        I1 = LAST5
+C        WRITE(*,*)'I1,LAST3S,LAST3E',I1,LAST3S,LAST3E
+        DO 50 J=LAST3S+1,LAST3E
+          J1 = J
+          ISUM = LOOP(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+          IF(ISUM.GE.CUTOFF)THEN
+            MATCHP = I1
+            IENTRY = J1
+            MATCHQ = J1
+            MATCHS = ISUM
+            IFOUND = MATCHP
+            LAST3S = J1
+            RETURN
+          END IF
+50      CONTINUE
+C  NOW MOVE 5' STEM START POSITION (WE HAVE JUST FINISHED THE LAST)
+C  TO THE LAST MATCH + 1
+        I5STAR = MATCHP + 1
+      END IF
+C  SET ENTRY FLAG TO ZERO TO SIGNIFY LAST SEARCH NOW COMPLETED
+      IENTRY = 0
+      ISUM = 0
+      LOOPI1 = I5STAR
+      IF((I5STAR+I3STAR-1).GT.IDSEQ)RETURN
+      LOOPI2 = MIN(IDSEQ,I5END)
+C      WRITE(*,*)'IDSEQ,LENGTH,I5STAR,I5END',
+C     +IDSEQ,LENGTH,I5STAR,I5END
+C
+C  TRY ALL STEM STARTS FROM 5' START TO 5' END
+C
+C
+      DO 200 I = LOOPI1,LOOPI2
+C
+C
+        I1 = I
+C
+C   TRY ALL LOOPS FROM 3' START TO 3' END
+C
+        LOOPJ1 = I+I3STAR-1
+        IF(LOOPJ1.GT.IDSEQ)RETURN
+        LOOPJ2 = MIN(IDSEQ,I+I3END-1)
+C      WRITE(*,*)'I3STAR,I3END',I3STAR,I3END
+C
+C
+C
+        DO 100 J = LOOPJ1,LOOPJ2
+C
+C
+C
+         J1 = J
+C  IN LOOP NOTE THAT
+C  THE 5' END POINTER I1 GOES FORWARDS
+C  THE 3' END POINTER J1 GOES BACKWARDS
+C
+         ISUM = LOOP(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+C         WRITE(*,*)ISUM
+         IF(ISUM.GE.ICUT)THEN
+           MATCHP = I1
+           IENTRY = J1
+           MATCHQ = J1
+           MATCHS = ISUM
+           IFOUND = MATCHP
+C  SAVE CURRENT POSITION FOR LATER ENTRIES
+           LAST5  = I1
+           LAST3S = J1
+           LAST3E = LOOPJ2
+           RETURN
+         END IF
+100    CONTINUE
+200   CONTINUE
+      END
+C*********************************************************************
+      INTEGER FUNCTION LOOP(SEQ,IDSEQ,MATRIX,LENGTH,I5P,I3P,IDM)
+      INTEGER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM)
+C  THE 5' END POINTER GOES FORWARDS
+C  THE 3' END POINTER GOES BACKWARDS
+      L=0
+      I5=I5P-1
+      I3=I3P+1
+      DO 100 I=1,LENGTH
+       I5 = I5 + 1
+       I3 = I3 - 1
+       L5 = SEQ(I5)
+       L3 = SEQ(I3)
+       L = L + MATRIX(L5,L3)
+100   CONTINUE
+      LOOP = L
+      END
+      SUBROUTINE SETCMP(COMPIN,IDM)
+      REAL COMPIN(IDM)
+      PARAMETER (MAXCHR = 17)
+      COMMON /COMPC/COMP(MAXCHR)
+      SAVE /COMPC/
+      DO 10 I = 1,MAXCHR
+        COMP(I) = 0.0
+10    CONTINUE
+      T = 0.
+      DO 20 I = 1,IDM
+        COMP(I) = COMPIN(I)
+        T = T + COMPIN(I)
+20    CONTINUE
+      DO 30 I = 1,IDM
+        COMP(I) = COMP(I) / T
+30    CONTINUE
+      END
+C*****************************************************************
+      SUBROUTINE DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEV,
+     +CLASS,MATCHQ,IOPT,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +TITLE,FILNAM,
+     +    PMINT,PMAXT,PROBT,IDM,MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,
+     +    MAT4,IDMAT4,WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,
+     +    RANGES,RANGEL,IDEV1,KEYNS,BESTP,BESTQ,BESTS,JOPT,MAXSS)
+      INTEGER RANGES(NMOT),RANGEL(NMOT)
+      INTEGER MATCHP(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER MATCHQ(NMOT)
+      CHARACTER SEQ(IDSEQ),TITLE*(*),FILNAM*(*)
+      CHARACTER*(*) KEYNS(NMOT)
+      REAL MATCHS(NMOT),MINSCR,MAXSCR
+      INTEGER WTSTR(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER MAT1(IDMAT1,IDMAT1),MAT2(IDMAT2,IDMAT2)
+      INTEGER MAT3(IDMAT3,IDMAT3),MAT4(IDMAT4,IDMAT4)
+      CHARACTER DASH
+C stuff for best
+      INTEGER BESTP(NMOT),BESTQ(NMOT)
+      REAL BESTS(NMOT),MAXSS
+      EXTERNAL PSCORE
+      SAVE DASH
+      DATA DASH/'-'/
+C jopt 1 get best match for any individual sequence, then
+C display it. So check each match for being best (>maxss), if it is 
+C save its coords in bestp, bestq. Keep a note that a score
+C has been recorded for this sequence (actually noted by bestp(1) ne.0)
+C when we finish a sequence (denoted by jopt = 2) put all the
+C saved values into matchp, matchq and process as normal.
+      IF(JOPT.NE.0) THEN
+        IF(JOPT.EQ.2) THEN
+          DO 5 I=1,NMOT
+            MATCHP(I) = BESTP(I)
+            MATCHQ(I) = BESTQ(I)
+            MATCHS(I) = BESTS(I)
+ 5          CONTINUE
+        ELSE IF(JOPT.EQ.1) THEN
+C  Add scores
+          T = 0.
+          DO 6 I = 1,NMOT
+            IF(MATCHP(I).NE.0) T = T + MATCHS(I)
+ 6        CONTINUE
+          IF(T.GT.MAXSS) THEN
+            DO 7 I=1,NMOT
+              BESTP(I) = MATCHP(I)
+              BESTQ(I) = MATCHQ(I)
+              BESTS(I) = MATCHS(I)
+ 7            CONTINUE
+            MAXSS = T
+          END IF
+          RETURN
+        END IF
+      END IF
+C  Add scores
+      T = 0.
+      DO 10 I = 1,NMOT
+        IF(MATCHP(I).NE.0) T = T + MATCHS(I)
+10    CONTINUE
+C
+C If a probability cut off set calc its value
+C
+      POBS = 1.0
+      IF(PMINC.LT.1.0)THEN
+        DO 20 I = 1,NMOT
+          IF(MATCHP(I).NE.0)THEN
+            CALL GETP(CLASS(I),SEQ(MATCHP(I)+KSTART-1),LENGTH(I),
+     +      IDM,MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +      WEIGHT(MAX(1,WTSTR(I))))
+            PROB = PSCORE(MATCHS(I))
+            POBS = POBS * PROB
+          END IF
+20      CONTINUE
+        IF((PMINC.LT.1.0).AND.(POBS.GT.PMINC))RETURN
+      END IF
+C
+      IF(T.GT.MAXSCR) MAXSCR = T
+      IF(T.LT.MINSCR) MINSCR = T
+      IF(T.LT.CUTSCR) RETURN
+C
+C Write out mofif by motif
+C
+      IF(IOPT.EQ.1)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+1001    FORMAT(' >',A,' ',F10.3,' ',A)
+        DO 100 I=1,NMOT
+          J = I
+C  Check for no match (needed for ored motifs)
+          IF(MATCHP(J).NE.0)THEN
+            WRITE(IDEV,1000)MATCHP(J)+KSTART-1,MATCHS(J),KEYNS(I)
+            WRITE(IDEV,1002)(SEQ(K),K=MATCHP(J),MATCHP(J)+LENGTH(J)-1)
+C  Repeat ?
+            IF(CLASS(J).EQ.8)THEN
+              WRITE(IDEV,1002)(SEQ(K),K=MATCHQ(J),MATCHQ(J)+LENGTH(J)-1)
+              WRITE(IDEV,1000)MATCHQ(J)+KSTART-1
+            END IF
+C  Stem ?
+            IF(CLASS(J).EQ.6)THEN
+              WRITE(IDEV,1002)
+     +        (SEQ(K),K=MATCHQ(J),MATCHQ(J)-LENGTH(J)+1,-1)
+              WRITE(IDEV,1000)MATCHQ(J)+KSTART-1
+            END IF
+          END IF
+100     CONTINUE
+1000    FORMAT(' ',I7,' ',F10.3,' ',A8)
+1002    FORMAT('       ',60A1)
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+1004    FORMAT(' Probability =',E10.4)
+        RETURN
+      END IF
+C
+C write only name, score, title
+C
+      IF(IOPT.EQ.3)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+      END IF
+C
+C write name, score, title and the inclusive match
+C
+      IF(IOPT.EQ.2)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        MINP = 999999
+        MAXP = -999999
+        DO 300 I = 1, NMOT
+          K = MATCHP(I)
+          IF(K.NE.0)THEN
+            IF(K.LT.MINP)MINP = K
+            K = K + LENGTH(I) - 1
+C  Inverted repeat ?
+            IF(CLASS(I).EQ.6) K = MATCHQ(I)
+C  Repeat ?
+            IF(CLASS(I).EQ.8) K = MATCHQ(I) + LENGTH(I) - 1
+            IF(K.GT.MAXP)MAXP = K
+          END IF
+300     CONTINUE
+        WRITE(IDEV,1000)MINP+KSTART-1
+        WRITE(IDEV,1002)
+     +  ((SEQ(K1),K1=K2,MIN(K2+59,MAXP)),K2=MINP,MAXP,60)
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+        RETURN
+      END IF
+C
+C write out whole seq to a new file
+C
+      IF(IOPT.EQ.4)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        CALL OPENRS(IDEV1,FILNAM,IOK,LLL,1)
+        IF(IOK.NE.0)THEN
+          WRITE(IDEV,*)' Error opening sequence file'
+          RETURN
+        END IF
+C  Write up to and including first motif
+        J1 = MATCHP(1)+LENGTH(1)-1
+        WRITE(IDEV1,1005,ERR=401)(SEQ(K),K=1,J1)
+        DO 400 I=2,NMOT
+          J = I
+C  Put dashes for next gap in now. what is max gap?
+          IMAXG = RANGES(J) - LENGTH(J-1) + RANGEL(J) - LENGTH(J)
+C  What is actual gap?
+          IG = MATCHP(J) - MATCHP(J-1) - LENGTH(J-1)
+C  Want to put in difference number of dashes
+          IDASH = IMAXG - IG
+          IF(IDASH.GT.0)CALL PADOUT(IDEV1,DASH,60,IDASH)
+C  Write up to and including next motif
+          J2 = MATCHP(J) + LENGTH(J) - 1
+          WRITE(IDEV1,1005,ERR=401)(SEQ(K),K=J1+1,J2)
+          J1 = J2
+400     CONTINUE
+C  Write to end of sequence
+        WRITE(IDEV1,1005,ERR=401)(SEQ(K),K=J1+1,IDSEQ)
+401     CONTINUE
+        CLOSE(UNIT=IDEV1)
+1005    FORMAT(' ',60A1)
+      END IF
+C
+C write inclusive to new file
+C
+      IF(IOPT.EQ.5)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        CALL OPENRS(IDEV1,FILNAM,IOK,LLL,1)
+        IF(IOK.NE.0)THEN
+          WRITE(IDEV,*)' Error opening sequence file'
+          RETURN
+        END IF
+C  Write first motif
+        J1 = MATCHP(1)+LENGTH(1)-1
+        WRITE(IDEV1,1005,ERR=501)(SEQ(K),K=MATCHP(1),J1)
+        DO 500 I=2,NMOT
+          J = I
+C  Put dashes for next gap in now. what is max gap?
+          IMAXG = RANGES(J) - LENGTH(J-1) + RANGEL(J) - LENGTH(J)
+C  What is actual gap?
+          IG = MATCHP(J) - MATCHP(J-1) - LENGTH(J-1)
+C  Want to put in difference number of dashes
+          IDASH = IMAXG - IG
+          IF(IDASH.GT.0)CALL PADOUT(IDEV1,DASH,60,IDASH)
+C  Write up to and including next motif
+          J2 = MATCHP(J) + LENGTH(J) - 1
+          WRITE(IDEV1,1005,ERR=501)(SEQ(K),K=J1+1,J2)
+          J1 = J2
+500     CONTINUE
+501     CONTINUE
+        CLOSE(UNIT=IDEV1)
+      END IF
+      END
diff --git a/src/staden/nipmen.f b/src/staden/nipmen.f
new file mode 100644
index 0000000..24e98d8
--- /dev/null
+++ b/src/staden/nipmen.f
@@ -0,0 +1,151 @@
+C  nipmen
+C  9-7-90 Reorganised lists
+      SUBROUTINE MENU(OPT,KOPT,MOPT,MAXOPT,MINMEN,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+      INTEGER OPT
+      CHARACTER HELPF*(*)
+1     CONTINUE
+      CALL CLEARV
+      IF(MOPT.EQ.0)THEN
+      WRITE(KBOUT,5000)
+5000  FORMAT(' Menus and their numbers are '/
+     +' m0 = This menu'/
+     +' m1 = General'/
+     +' m2 = Screen control'/
+     +' m3 = Statistical analysis of content'/
+     +' m4 = Structures and repeats'/
+     +' m5 = Translation and codons'/
+     +' m6 = Gene search by content'/
+     +' m7 = General signals'/
+     +' m8 = Specific signals'/
+     +'  ? = Help'/
+     +'  ! = Quit')
+1001  FORMAT(
+     +' ? Menu or option number=',$)
+      ELSE IF(MOPT.EQ.-1)THEN
+              WRITE(KBOUT,5001)
+5001            FORMAT(' General menu'/
+     +          '  0 = List of menus'/
+     +          '  ? = Help'/
+     +          '  ! = Quit'/
+     1          '  3 = Read a new sequence'/
+     1          '  4 = Define active region'/
+     +          '  5 = List the sequence'/
+     +          '  6 = List a text file'/
+     +          '  7 = Direct output to disk'/
+     +          '  8 = Write active sequence to disk'/
+     +          '  9 = Edit the sequence'/
+     +          ' 17 = Search for restriction enzymes'/
+     +          ' 29 = Reverse and complement the active sequence'/
+     +          ' 66 = Exchange t and u')
+      ELSE IF(MOPT.EQ.-2)THEN
+                WRITE(KBOUT,5002)
+5002            FORMAT(' Screen control menu'/
+     +          '  0 = List of menus'/
+     +          ' 10 = Clear graphics'/
+     +          ' 11 = Clear text'/
+     +          ' 12 = Draw a ruler'/
+     +          ' 13 = Use crosshair'/
+     +          ' 14 = Reposition plots'/
+     +          ' 15 = Label diagram'/
+     +          ' 16 = Display a map')
+      ELSE IF(MOPT.EQ.-3)THEN
+                WRITE(KBOUT,5005)
+5005            FORMAT(' Statistical analysis of content'/
+     +          '  0 = List of menus'/
+     +          ' 21 = Count bases'/
+     +          ' 22 = Count dinucleotides'/
+     +          ' 24 = Plot base composition'/
+     2          ' 25 = Plot local deviations in base composition'/
+     3          ' 26 = Plot local deviations in dinucleotide',
+     +          ' composition'/
+     4          ' 27 = Plot local deviations in trinucleotide',
+     +          ' composition'/
+     3          ' 38 = Examine repeats'/
+     5          ' 59 = Plot negentropy'/
+     6          ' 64 = Plot observed-expected word frequencies')
+      ELSE IF(MOPT.EQ.-4)THEN
+                WRITE(KBOUT,5007)
+5007            FORMAT(' Structures and repeats menu'/
+     +          '  0 = List of menus'/
+     5          ' 30 = Search for hairpin loops'/
+     1          ' 31 = Search for long range inverted repeats'/
+     1          ' 32 = Search for repeats'/
+     1          ' 33 = Search for z dna (total ry,yr)'/
+     1          ' 34 = Search for z dna (runs of ry or yr)'/
+     1          ' 35 = Search for z dna (best phased value)'/
+     1          ' 36 = Search for local similarity or complementarity'/
+     3          ' 38 = Examine repeats')
+      ELSE IF(MOPT.EQ.-5)THEN
+                WRITE(KBOUT,5006)
+5006            FORMAT(' Translation and codon menu'/
+     +          '  0 = List of menus'/
+     +          ' 23 = Count codons, amino acids and bases',/,
+     +          ' 28 = Calculate codon constraint',/,
+     +          ' 37 = Set genetic code'/
+     1          ' 39 = Translate and list in up to six phases'/
+     5          ' 40 = Translate and write protein sequence to disk'/
+     5          ' 41 = Write codon table to disk',/,
+     +          ' 54 = Search for open reading frames')
+      ELSE IF(MOPT.EQ.-6)THEN
+                WRITE(KBOUT,5003)
+5003            FORMAT(' Gene search by content'/
+     +          '  0 = List of menus'/
+     1          ' 42 = Codon usage method'/
+     5          ' 43 = Positional base preferences method'/
+     5          ' 44 = Uneven positional base frequencies method'/
+     4          ' 45 = Codon improbability on base composition'/
+     4          ' 46 = Codon improbability on amino acid composition'/
+     2          ' 47 = Shepherd RNY preference method'/
+     3          ' 48 = Ficketts method'/
+     5          ' 49 = tRNA gene search')
+      ELSE IF(MOPT.EQ.-7)THEN
+                WRITE(KBOUT,5004)
+5004            FORMAT(' General signals menu'/
+     +          '  0 = List of menus'/
+     +          ' 18 = Compare a short sequence'/
+     +          ' 19 = Compare a short sequence using a score matrix'/
+     +          ' 20 = Search using a weight matrix'/
+     1          ' 50 = Plot start codons'/
+     2          ' 51 = Plot stop codons'/
+     3          ' 52 = Plot stop codons on the complementary strand'/
+     3          ' 53 = Plot stop codons on both strands'/
+     +          ' 60 = Search using a dinucleotide weight matrix'/
+     +          ' 63 = Search the complementary strand using a ',
+     +                'weight matrix'/
+     +          ' 67 = Search for patterns of motifs')
+      ELSE IF(MOPT.EQ.-8)THEN
+                WRITE(KBOUT,5008)
+5008            FORMAT(' Eukaryotic signal search'/
+     +          '  0 = List of menus'/
+     3          ' 55 = Search for E. coli promoters'/
+     3          ' 56 = Search for E. coli promoters',
+     +                ' on the complementary strand'/
+     3          ' 57 = Search for E. coli promoters (-35 and -10)'/
+     4          ' 58 = Search for prokaryotic ribosome binding sites'/
+     5          ' 61 = Search for eukaryotic ribosome binding sites'/
+     6          ' 62 = Search for splice junctions'/
+     6          ' 65 = Search for  polya sites')
+      END IF
+3     CONTINUE
+      WRITE(KBOUT,1001)
+      CALL GETOPT(KBIN,KOPT,OPT)
+      IF((OPT.LT.MINMEN).OR.(OPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MOPT = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(OPT.GE.0))THEN
+        CALL HELP2(IHELPS(OPT),IHELPE(OPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(OPT.EQ.0) THEN
+        MOPT = 0
+        GO TO 1
+      END IF
+      IF(OPT.LT.0) THEN
+        MOPT = OPT
+        GO TO 1
+      END IF
+      END
diff --git a/src/staden/nxhelpmenu.c b/src/staden/nxhelpmenu.c
new file mode 100644
index 0000000..90a16b2
--- /dev/null
+++ b/src/staden/nxhelpmenu.c
@@ -0,0 +1,157 @@
+#include <stdio.h>
+#include <string.h>
+#include <stdlib.h>
+#include <sys/types.h>
+#include "helpnmenu.h"
+#include "nxhelpmenu.h"
+#include "userface.h"
+#include "fort.h"
+
+static int linecount = 0;
+static int currentmenu = 0;
+
+/*
+ * Returns 0 for ok,
+ * -1 for cancel.
+ */
+static int scroll(char *text) {
+    printf("%s", text);
+    linecount++;
+
+    if (linecount > 20) {
+	linecount = 0;
+	return bpause();
+    }
+    return 0;
+}
+
+void menu_x(int_f *OPT_p,
+	    int_f *KOPT_p,
+	    int_f *MOPT_p,
+	    int_f *MAXOPT_p,
+	    int_f *MINMEN_p,
+	    int_f *KBIN_p,
+	    int_f *KBOUT_p,
+	    int_f *IHELPS_p,
+	    int_f *IHELPE_p,
+	    char  *HELPF_p,
+	    int_f *IDEVH_p,
+	    int_fl HELPF_l)
+/*
+ * Display the menu dialogue and get the number of a function.
+ * OPT       number of function selected
+ * KOPT=0 => OK
+ * KOPT=1 => dialogue wanted
+ */
+{
+    int m, i, mnum;
+    menuarr menu[MAXOPTS];
+
+    memset((void *)menu, 0, MAXOPTS*sizeof(menuarr));
+    /* should we ignore this arg totally!? */
+    /* mnum = *MOPT_p; */
+    mnum = currentmenu;
+    do {
+	m = create_menu(mnum, menu, MAXOPTS);
+	do {
+	    puts("");
+	    if (mnum == 0)
+		puts(" Menus and their numbers are");
+	    else
+		printf(" %s menu\n", (helpindex - mnum)->name);
+	    for (i=0; i<m; i++) {
+		printf(" %s = %s\n", menu[i].optname, menu[i].name);
+	    }
+	    printf(" ? Menu or option number=");
+	    set_opt(-1);
+	    *OPT_p = getcopt(KOPT_p);
+	    /* reprint menu if help requested */
+	} while (*KOPT_p == -1 || *KOPT_p == 2);
+	if (*OPT_p > *MAXOPT_p || *OPT_p < *MINMEN_p)
+	    mnum = 0;
+/*
+ * This next bit was here to *always* stay in the same menu until we chose
+ * another. Now pressing return will take us back to root menu.
+	else if (*KOPT_p == 3) {
+	    mnum = -*OPT_p;
+	    currentmenu = mnum;
+	} else if (*OPT_p == 0)
+	    mnum = currentmenu;
+*/
+	else if (*OPT_p <= 0) {
+	    mnum = (int)-*OPT_p;
+	    currentmenu = mnum;
+	}
+    } while (*OPT_p <= 0 || *OPT_p > *MAXOPT_p);
+    set_opt((int)*OPT_p);
+    showfunc();
+}
+
+void help() {
+    if (query_opt() == -1)
+	ihelp();
+    else
+	help2(query_opt());
+}
+
+void ihelp() {
+    int i, status;
+    static int level = 0;
+    int oldopt = query_opt();
+
+    /* Avoid interactive help recursion */
+    if (level == 1)
+	return;
+
+    set_opt(-2);
+    level++;
+    do {
+	puts("\n For information on any option type its option number.");
+	puts(" In addition,");
+	puts("             0 = Introduction to the program,");
+	puts("             ? = List of options");
+	i = getint(0, maxopts, 1, "Option number", &status);
+	if (status == 0) {
+	    help2(i);
+	    if (i == 1) {
+		char hbuf[256];
+		int hcont = 0, j = 0;
+
+		/* now display list of options */
+		while (helptopics[j] != NULL && hcont == 0) {
+		    sprintf(hbuf, "\t%2d\t%s\n",
+			    optTransTab[j], helptopics[j]);
+		    j++;
+		    hcont = scroll(hbuf);
+		}
+	    }
+	}
+    } while (status == 0);
+
+    level--;
+    set_opt(oldopt);
+}
+
+void help_x() {
+    ihelp();
+}
+
+void help2(int optnum) {
+    char *cptr;
+
+    linecount = 0;
+    while (cptr = gethelp2(optnum))
+	if (scroll(cptr) == -1) {
+	    /* waste away unread help */
+	    while (cptr = gethelp2(optnum));
+	    break;
+	}
+}
+
+void help2_x() {
+    
+}
+
+void sethlp_x() {
+}
+
diff --git a/src/staden/nxhelpmenu.h b/src/staden/nxhelpmenu.h
new file mode 100644
index 0000000..0cea706
--- /dev/null
+++ b/src/staden/nxhelpmenu.h
@@ -0,0 +1,8 @@
+#define help_x help_
+#define help2_x help2_
+#define menu_x menu_
+#define sethlp_x sethlp_
+
+void help();
+void help2(int topic);
+void ihelp();
diff --git a/src/staden/nxspec.f b/src/staden/nxspec.f
new file mode 100644
index 0000000..d0b0067
--- /dev/null
+++ b/src/staden/nxspec.f
@@ -0,0 +1,31 @@
+C  NonX specific bits
+C   AUTHOR: RODGER STADEN
+      SUBROUTINE INITRS()
+      END
+C
+C   BELL
+C       SUBROUTINE TO RING BELL N TIMES
+      SUBROUTINE BELL(N,KBOUT)
+C   AUTHOR: RODGER STADEN
+      SAVE NUL
+      DATA NUL/0/
+      DO 1 I=1,N
+1     WRITE(KBOUT,1000)NUL,CHAR(7)
+1000  FORMAT(A1,A)
+      END
+      INTEGER FUNCTION XVERSN()
+C
+C set xversn to false for xterm version
+C
+      XVERSN = 0
+      END
+      SUBROUTINE UPDOUT()
+      END
+      SUBROUTINE CXEDIT(IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +RELPG,LNGTHG,LNBR,RNBR,MAXGEL,
+     +IDBSIZ,LINCON,LNBRL,IRNO,LNGTHI,
+     +PERCD,IDM,J,IOK)
+C
+C contig editor (note most of these parameters are not used by it anyway!!)
+C
+      END
diff --git a/src/staden/patternn.f b/src/staden/patternn.f
new file mode 100644
index 0000000..3a984e6
--- /dev/null
+++ b/src/staden/patternn.f
@@ -0,0 +1,1151 @@
+C 24-9-90 Changed definition of xmax from xmax=idseq to xmax=idseq+kstart-1
+C         because a match with j1=53000, j2=59000 did not get plotted. Note
+C         that kstart is called j1 in main
+C    5-11-90  Very many changes for addition of file of file names, plus
+C             replacement of calls to radio by radion
+C 18-7-91 added titles to pattern files
+C 2-3-92 set filnam = ' ' for calls to openf1
+      SUBROUTINE PATTEN(SEQ,IDSEQ,STRING,MAXSTR,
+     +LENGTH,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,IENTRY,
+     +START2,IEND2,WTSTR,START,IEND,MATCHQ,RELEND,MATCHP,
+     +STRNGS,LAST5,LAST3S,LAST3E,MATCHS,CUTOFF,WEIGHT,FILNAM,
+     +MAXMOT,MAXWTS,
+     +IDEV1,IDEV2,IDEV3,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KSTART,KBIN,KBOUT,
+     +IDEV4,IDM,COMBIN,IDME,MAT1,MAT2,MAT3,MAT4,NAMSAV,KEYNS,
+     +IHELPS,IHELPE,HELPF,IDEVH,FOFNAM,IDEVFF)
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT)
+      INTEGER RANGET(MAXMOT),RANGEM(MAXMOT),IENTRY(MAXMOT)
+      INTEGER START2(MAXMOT),IEND2(MAXMOT)
+      CHARACTER SEQ(IDSEQ),STRING(MAXSTR)
+      INTEGER WTSTR(MAXMOT),START(MAXMOT),IEND(MAXMOT)
+      INTEGER MATCHQ(MAXMOT),RELEND(MAXMOT)
+      INTEGER MATCHP(MAXMOT),STRNGS(MAXMOT)
+      INTEGER LAST5(MAXMOT),LAST3S(MAXMOT),LAST3E(MAXMOT)
+      INTEGER MAT1(IDM,IDM),MAT2(IDME,IDME)
+      INTEGER MAT3(IDME,IDME),MAT4(IDM,IDM)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT),MATCHS(MAXMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER FILNAM*(*),HELPF*(*),FOFNAM*(*),TITLE*80
+      CHARACTER COMBIN(MAXMOT)
+      CHARACTER*(*) NAMSAV(MAXMOT),KEYNS(MAXMOT)
+      PARAMETER (MAXCLS = 8)
+      PARAMETER (SMALL=1.0E-5)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+C
+      PARAMETER (MAXPRM = 30)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      WRITE(KBOUT,*)'  Pattern searcher'
+      JDEV = KBIN
+      PROMPT(1) = 'Use keyboard'
+      PROMPT(2) = 'Use pattern file'
+      PROMPT(3) = 'Use file of pattern file names'
+      IN = 1
+      CALL RADION('Select pattern definition mode',PROMPT,3,IN,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IN.LT.1) RETURN
+      IF(IN.EQ.2)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Pattern definition file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        JDEV = IDEV3
+      ELSE IF(IN.EQ.3) THEN
+        FOFNAM = ' '
+        CALL OPENF1(IDEVFF,FOFNAM,0,IOK,KBIN,KBOUT,
+     +  'File of pattern file names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        JDEV = IDEV3
+      END IF
+      IOPT = 1
+      PROMPT(1) = 'Motif by motif'
+      PROMPT(2) = 'Inclusive'
+      PROMPT(3) = 'Graphical'
+      PROMPT(4) = 'EMBL feature table'
+      CALL RADION('Select results display mode',PROMPT,4,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) GO TO 201
+      IF((IN.EQ.3).AND.(IOPT.EQ.3)) IOPT = 2
+      PMINC = 1.
+      CUTSCR = -99999.
+      NPAT = 0
+      IPROB = 0
+      NTOTAL = 0
+      JTOTAL = 0
+      CALL SETCMP(SEQ,IDSEQ,IDME)
+      IF(IN.EQ.3) THEN
+        IPROB = 1
+        CALL BUSY(KBOUT)
+      END IF
+10    CONTINUE
+      IF(IN.EQ.3) THEN
+        READ(IDEVFF,1000,ERR=100,END=200)FOFNAM
+        CALL OPENRS(IDEV3,FOFNAM,IOK,LRECL,2)
+        IF(IOK.NE.0) GO TO 200
+        NPAT = NPAT + 1
+      END IF
+1000  FORMAT(A)
+C  ZERO ARRAYS
+C
+        CALL FILLI(LENGTH,MAXMOT,0)
+        CALL FILLI(CLASS,MAXMOT,0) 
+        CALL FILLI(RELMOT,MAXMOT,0)
+        CALL FILLI(RANGES,MAXMOT,0)
+        CALL FILLI(RANGEL,MAXMOT,0)
+        CALL FILLI(RANGET,MAXMOT,0)
+        CALL FILLI(RANGEM,MAXMOT,0)
+        CALL FILLI(IENTRY,MAXMOT,0)
+        CALL FILLI(START2,MAXMOT,0)
+        CALL FILLI(IEND2,MAXMOT,0) 
+        CALL FILLI(WTSTR,MAXMOT,0) 
+        CALL FILLI(START,MAXMOT,0) 
+        CALL FILLI(IEND,MAXMOT,0)  
+        CALL FILLI(MATCHQ,MAXMOT,0)
+        CALL FILLI(RELEND,MAXMOT,0)
+        CALL FILLI(MATCHP,MAXMOT,0)
+        CALL FILLI(STRNGS,MAXMOT,0)
+        CALL FILLI(LAST5,MAXMOT,0) 
+        CALL FILLI(LAST3S,MAXMOT,0)
+        CALL FILLI(LAST3E,MAXMOT,0)
+        CALL FILLR(CUTOFF,MAXMOT,0.0)
+        CALL FILLR(MATCHS,MAXMOT,0.0)
+        CALL FILLR(WEIGHT,MAXWTS,0.0)
+        CALL FILLC(COMBIN,MAXMOT,'A')
+        ITOTAL = 0
+C
+C
+C
+C  GET MOTIF DEFINITIONS
+C
+C  RETURN STRING LENGTH FOR COMPATIBILITY WITH LIB SEARCH
+      NSTRNG = MAXSTR
+      IF(IN.EQ.1) THEN
+        CALL GETMK(KBIN,KBOUT,STRING,NSTRNG,ISTRNG,
+     +  LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV2,
+     +  WTSTR,JDEV,IOK,RELEND,IDSEQ,IDEV4,IDM,COMBIN,
+     +  MAXCLS,
+     +  MAT1,IDM,MAT2,IDME,MAT3,IDME,MAT4,IDM,
+     +  PMINT,PMAXT,PROBT,EXPTT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,TITLE)
+      ELSE 
+        CALL GETMF(KBIN,KBOUT,STRING,NSTRNG,ISTRNG,
+     +  LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV2,
+     +  WTSTR,JDEV,IOK,RELEND,IDSEQ,IDEV4,IDM,COMBIN,
+     +  MAXCLS,
+     +  MAT1,IDM,MAT2,IDME,MAT3,IDME,MAT4,IDM,
+     +  PMINT,PMAXT,PROBT,EXPTT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,IPROB,TITLE)
+      END IF
+      IF(IOK.NE.0) GO TO 201
+      IF(NMOT.LT.1) GO TO 201
+C
+C
+C  DISPLAY THE SIGNAL DESCRIPTION
+C
+      RANGES(1) = 1
+      IF(IN.NE.3) THEN
+        CALL DESSIG(
+     +            KBOUT,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,COMBIN,KEYNS,TITLE)
+C
+C
+        WRITE(KBOUT,2003)PROBT
+2003    FORMAT(' Probability of finding pattern = ',E10.4)
+        WRITE(KBOUT,2004)EXPTT
+2004    FORMAT(' Expected number of matches  = ',E10.4)
+        CALL GETRL(0.,1.,1.0,'Maximum pattern probability',
+     +  XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        PMINC = XP
+        CALL GETRL(-9999.,9999.,-9999.,'Minimum pattern score',
+     +  XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTSCR = XP
+        MINSCR = 9999999.
+        MAXSCR = -9999999.
+        YMIN = LOG(1.-PMINT)
+        YMAX = LOG(1.-PMAXT)
+        YDEL = ABS(YMIN - YMAX)
+        IF(YDEL.LT.SMALL)THEN
+          YMAX = YMAX + SMALL
+          YMIN = YMIN - SMALL
+        END IF
+        YDEL = ABS(YMIN - YMAX) * 0.1
+        YMIN = YMIN - YDEL
+        YMAX = YMAX + YDEL
+        CALL BUSY(KBOUT)
+      END IF
+      IF(IOPT.EQ.3)THEN
+        XMAX = IDSEQ + KSTART - 1
+        XMIN = KSTART
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL CLEARV
+      END IF
+C
+C  DO THE SEARCH
+C
+      CALL SRCSIG(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATCHQ,RELEND,
+     +IDEV1,LAST5,LAST3S,LAST3E,MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,IOPT,XMAX,XMIN,YMAX,YMIN,ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,IDME,PMINT,PMAXT,PROBT,MAT1,MAT2,MAT3,
+     +MAT4,PMINC,KEYNS)
+      CALL VT100M
+      IF(IN.EQ.3) THEN
+        IF(ITOTAL.NE.0) THEN
+          WRITE(IDEV1,1003)TITLE
+ 1003     FORMAT(' ',A)
+          WRITE(IDEV1,1001)FOFNAM,ITOTAL
+1001      FORMAT(' ',A,' ',I6,/)
+          NTOTAL = NTOTAL + ITOTAL
+          JTOTAL = JTOTAL + 1
+        END IF
+        GO TO 10
+      END IF
+      WRITE(KBOUT,1002)ITOTAL
+1002  FORMAT(' Total matches found',I7)
+      IF((ITOTAL.GT.0).AND.(IN.NE.3))WRITE(KBOUT,1006)MINSCR,MAXSCR
+1006  FORMAT(' Minimum and maximum observed scores',2F12.2)
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,*)'Error reading file of file names'
+200   CONTINUE
+      WRITE(KBOUT,*)'Of the ',NPAT,' patterns processed'
+      WRITE(KBOUT,*)JTOTAL,' gave a total of',NTOTAL,' matches'
+201   CONTINUE
+      CLOSE(UNIT=IDEVFF)
+      END
+      SUBROUTINE GETMK(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,
+     +MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +PMINT,PMAXT,PROBT,EXPTT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,TITLE)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT),STRNGS(MAXMOT)
+      INTEGER WTSTR(MAXMOT),RANGET(MAXMOT),RANGEM(MAXMOT)
+      INTEGER RELEND(MAXMOT)
+      CHARACTER STRING(MAXSTR),FILNAM*(*),HELPF*(*)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT)
+      CHARACTER COMBIN(MAXMOT),TCLASS
+      INTEGER MAT1(IDMAT1,IDMAT1),MAT2(IDMAT2,IDMAT2)
+      INTEGER MAT3(IDMAT3,IDMAT3),MAT4(IDMAT4,IDMAT4)
+      CHARACTER*(*) KEYNS(MAXMOT)
+      CHARACTER*(*) NAMSAV(MAXMOT),TITLE
+      PARAMETER (MAXPRM = 31)
+      CHARACTER PROMPT(9)*(MAXPRM)
+      EXTERNAL PSCORE
+C   GETS PATTERN DEFINITIONS IN TERMS OF MOTIFS
+      PMINT = 1.0
+      PMAXT = 1.0
+      PROBT = 1.0
+      EXPTS = 0.0
+      PROBS = 0.0
+      PMINS = 0.0
+      PMAXS = 0.0
+      EXPTT = 1.0
+      IOK = 0
+      IPROB = 0
+      DO 10 I=1,MAXMOT
+        RELEND(I) = 5
+        COMBIN(I) = 'A'
+10    CONTINUE
+      RANGES(1) = 0
+      RANGEL(1) = IDSEQ
+      IREL = 0
+C   COUNT MOTIFS
+      NMOT = 0
+      MOTIF = 0
+C   SET POINTER TO SEARCH STRINGS ARRAY
+      ISTRNG = 1
+C   SET POINTER TO WEIGHT ARRAY
+      IWT = 1
+      TCLASS = 'A'
+C   SET CLASS TO EXACT MATCH
+      ICLASS = 1
+100   CONTINUE
+      IOK = 1
+      PROMPT(1) = 'Exact match'
+      PROMPT(2) =  'Percentage match'
+      PROMPT(3) = 'Cut-off score and score matrix'
+      PROMPT(4) = 'Cut-off score and weight matrix'
+      PROMPT(5) = 'Complement of weight matrix'
+      PROMPT(6) = 'Inverted repeat or stem-loop'
+      PROMPT(7) = 'Exact match, defined step'
+      PROMPT(8) = 'Direct repeat'
+      PROMPT(9) = 'Pattern complete'
+      CALL RADION('Select motif definition mode',PROMPT,9,ICLASS,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ICLASS.LT.1) RETURN
+      IF(ICLASS.EQ.9)GO TO 900
+101   CONTINUE
+      LKEY = 0
+      CALL GTSTR('Motif name',' ',KEYNS(MOTIF+1),LKEY,
+     +KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 101
+      END IF
+      IF(MOTIF.GT.0) THEN
+        IF(COMBIN(MOTIF).NE.'N') THEN
+          PROMPT(1) = 'And'
+          PROMPT(2) = 'Or'
+          PROMPT(3) = 'Not'
+          IC = 1
+          CALL RADION('Select logical operator',PROMPT,3,IC,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(IC.LT.1) RETURN
+          IF(IC.EQ.1) TCLASS = 'A'
+          IF(IC.EQ.2) TCLASS = 'O'
+          IF(IC.EQ.3) TCLASS = 'N'
+        ELSE
+          PROMPT(1) = 'And'
+          PROMPT(2) = 'Not'
+          IC = 1
+          CALL RADION('Select logical operator',PROMPT,2,IC,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(IC.LT.1) RETURN
+          IF(IC.EQ.1) TCLASS = 'A'
+          IF(IC.EQ.2) TCLASS = 'N'
+        END IF
+      END IF
+      NMOT = NMOT + 1
+      MOTIF = MOTIF + 1
+      IF(NMOT.GT.MAXMOT)THEN
+        WRITE(KBOUT,*)'Maximum number of motifs exceeded'
+        GO TO 901
+      END IF
+      CLASS(MOTIF) = ICLASS
+      COMBIN(MOTIF) = TCLASS
+C  IF NOT THE FIRST MOTIF, AND AN ANDED MOTIF OR THE FIRST IN A LIST OF ORS
+C  GET ITS RANGE ETC
+      IOK = 0
+      IF((MOTIF.GT.1).AND.(COMBIN(MOTIF).NE.'O'))THEN
+C        WRITE(KBOUT,1002)
+C1002    FORMAT(' RANGES ARE DEFINED RELATIVE TO OTHER MOTIFS',/,
+C     +' AND BY STARTS AND DISTANCES.')
+        IOK = 1
+        MININ = 1
+        MAXIN = MOTIF
+211     CONTINUE
+        MAXIN = MAXIN - 1
+        IF(COMBIN(MAXIN).NE.'A') GO TO 211
+        IREL = MAXIN
+        CALL GETINT(MININ,MAXIN,IREL,'Number of reference motif',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IREL = IVAL
+        RELMOT(MOTIF) = IREL
+        IOK = 1
+        IF(CLASS(IREL).EQ.6)THEN
+          CALL YESNO(MEND,'Relative to 5 prime end',
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(MEND.LT.0)RETURN
+          RELEND(MOTIF) = 5
+          IF(MEND.EQ.1)RELEND(MOTIF) = 3
+        END IF
+        MININ = -1000
+        MAXIN = 1000
+        ID = LENGTH(IREL) + 1
+        IF((CLASS(IREL).EQ.6).OR.(CLASS(IREL).EQ.8)) ID = 1
+        CALL GETINT(MININ,MAXIN,ID,'Relative start position',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        ID = IVAL
+        RANGES(MOTIF) = ID - 1
+        MININ = 0
+        MAXIN = 1000
+        IR = 0
+        CALL GETINT(MININ,MAXIN,IR,'Number of extra positions',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IR = IVAL
+        RANGEL(MOTIF) = IR
+      ELSE IF(COMBIN(MOTIF).EQ.'O')THEN
+C  NEED TO SET RANGES TO THOSE OF THE FIRST IN A SET OF ORED MOTIFS
+C  SET TO THOSE OF THE PREVIOUS MOTIF BECAUSE IT MUST BE THE SAME
+        RANGES(MOTIF) = RANGES(MOTIF-1)
+        RANGEL(MOTIF) = RANGEL(MOTIF-1)
+        RELEND(MOTIF) = RELEND(MOTIF-1)
+        RELMOT(MOTIF) = IREL
+      END IF
+      CALL GETMC(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,
+     +MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +PMINT,PMAXT,PROBT,EXPTT,EXPTS,PROBS,PMINS,PMAXS,
+     +IHELPS,IHELPE,HELPF,IDEVH,MOTIF,IWT,NAMSAV,FILNAM,IPROB)
+      IF(IOK.EQ.0) GO TO 100
+901   CONTINUE
+      WRITE(KBOUT,*)' Error in pattern definition'
+      IOK = 1
+      RETURN
+900   CONTINUE
+      EXPTT = EXPTT * EXPTS
+      PROBT = PROBT * PROBS
+      PMINT = PMINT * PMINS
+      PMAXT = PMAXT * PMAXS
+      CALL YESNO(IN,'Save pattern in a file',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IN.LT.0)RETURN
+      IF(IN.EQ.0)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEVSV,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Pattern definition file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        CALL SAVSIG(
+     +  IDEVSV,STRING,MAXSTR,
+     +  LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +  WTSTR,RELEND,NAMSAV,COMBIN,KEYNS,TITLE,KBIN,KBOUT)
+        CLOSE(UNIT=IDEVSV)
+      END IF
+      DO 899 I = 2,NMOT
+        IF((CLASS(I).EQ.6).OR.(CLASS(I).EQ.8)) THEN
+          RANGEL(I) = RANGEL(I) + 1
+        ELSE
+          RANGEL(I) = RANGEL(I) + LENGTH(I)
+        END IF
+899   CONTINUE
+C  RETURN STRING LENGTH
+      MAXSTR = ISTRNG - 1
+      IOK = 0
+      END
+      SUBROUTINE SRCSIG(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATCHQ,RELEND,
+     +IDEVOT,LAST5,LAST3S,LAST3E,MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,IOPT,XMAX,XMIN,YMAX,YMIN,ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,IDME,PMINT,PMAXT,PROBT,MAT1,MAT2,MAT3,MAT4,
+     +PMINC,KEYNS)
+C  ROUTINE TO SEARCH FOR SIGNALS COMPOSED OF MOTIFS
+C  WEIGHT  = WEIGHTS FOR MATRICES
+C  CUTOFF  = CUTOFF SCORES
+C  LENGTH  = MOTIF LENGTHS
+C  CLASS   = MOTIF CLASS
+C  COMBIN  = LOGICAL COMBINATION A, O, N
+C  WTSTR   = POINTER TO WEIGHT STARTS
+C  RANGES  = RANGES START
+C  RANGEL  = RANGE LENGTH (A DISTANCE MEASURED FROM RANGES)
+C  RELMOT  = MOTIF NUMBER THAT A RANGE IS RELATIVE TO IE THE
+C            FIRST MOTIF'S RANGE IS RELATIVE TO MOTIF 0, BUT
+C            ANY OTHER MOTIF MAY HAVE TO BE DEFINED RELATIVE
+C            TO ANY OTHER. THE MOST COMMON WOULD BE THE FIRST
+C            MOTIF OR THE LAST ONE SEARCHED FOR.
+C  RELEND  = IS A SPECIAL CASE FLAG FOR STEMS. IT ALLOWS OTHER
+C            MOTIFS TO HAVE THEIR POSITIONS RELATIVE TO THE 3' SIDE
+C            OF A STEM. IT IS 5 FOR THE 5 PRIME SIDE, 3 FOR 3' SIDE
+C  START   = RANGE START DURING SEARCH (SOME POSITIONS MAY HAVE
+C            BEEN TRIED)
+C  IEND    = RANGE END POSITION FOR CURRENT INITIAL START (WHEN IT
+C            IS RESET DEPENDS ON WHETHER IT IS DEFINED RELATIVE TO 
+C            THE FIRST OR THE PREVIOUS MOTIF. IF IT IS DEFINED RELATIVE
+C            TO THE FIRST MOTIF IT IS RESET WHEN WE FIND A MATCH FOR THE
+C            FIRST MOTIF. IF IT IS DEFINED RELATIVE TO THE PREVIOUS MOTIF
+C            WE MUST RESET WHEN WE MOVE FORWARD ONE MOTIF. I THINK THIS
+C            CAN BE TAKEN CARE OF BY UPDATING ALL THOSE MOTIFS THAT ARE
+C            DEFINED TO THE CURRENT MOTIF EVERY TIME WE MOVE FORWARD 
+C            ONE MOTIF (IE IT INCLUDES THE FIRST MOTIF SO IT IS NOT A
+C            SPECIAL CASE). OTHERWISE WE UPDATE POSITIONS WHEN WE FIND
+C            A MATCH FOR THEM (WE SET TO THE MATCH POSITION PLUS 1)
+C  MATCHP  = LIST OF CURRENT MATCH POSITIONS FOR EACH MOTIF
+C  MATCHS  = LIST OF CURRENT MATCH SCORES FOR EACH MOTIF
+C  IFOUND  = A FLAG TO INDICATE SUCCESS OR FAILURE OF A SEARCH ROUTINE
+C            1 = SUCCESS, 0 = FAIL
+C  STRNGS  = POINTER TO STRING STARTS IN CHARACTER ARRAY STRING
+C  TEMPORARY VALUES ARE:
+C  MOTIF   = ACTUAL MOTIF NUMBER
+C  ICLASS  = CLASS
+C  ILEN    = LENGTH OF MOTIF
+C  CUT     = CUTOFF
+C  WT      = START OF WEIGHTS FOR THIS MOTIF
+C  ISTRST  = START OF STRING
+C  RANGET  = START OF 3' RANGE FOR STEM SEARCHES
+C  RANGEM  = END OF 3' RANGE FOR STEM SEARCHES
+C  IENTRY  = FLAG TO SIGNIFY MORE 3' STEM POSITIONS FOR LAST 5' START
+C            0 = NONE, ON RETURN FROM MOTIF6 IT CONTAINS THE 3' MATCH 
+C            POSITION
+C  MATCHQ  = MATCH POSITION FOR STEM SEARCH
+C  COMB    = LOGICAL COMBINATION A, O, N
+      INTEGER WTSTR(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),START(NMOT),IEND(NMOT)
+      INTEGER RELMOT(NMOT),MATCHP(NMOT),STRNGS(NMOT)
+      INTEGER RANGET(NMOT),RANGEM(NMOT),IENTRY(NMOT),RELEND(NMOT)
+      INTEGER START2(NMOT),IEND2(NMOT),MATCHQ(NMOT)
+      INTEGER LAST5(NMOT),LAST3S(NMOT),LAST3E(NMOT)
+      INTEGER MAT1(IDM,IDM),MAT2(IDME,IDME)
+      INTEGER MAT3(IDME,IDME),MAT4(IDM,IDM)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT),MATCHS(NMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER SEQ(IDSEQ),STRING(MAXSTR)
+      CHARACTER COMBIN(NMOT),COMB
+      CHARACTER*(*) KEYNS(NMOT)
+C
+C
+C  INITIALIZE
+      JMOT = 0
+      IRET = 0
+5     CONTINUE
+      JMOT = JMOT + 1
+      IF(JMOT.LE.NMOT)THEN
+        IF(RELMOT(JMOT).EQ.0)THEN
+          START(JMOT) = RANGES(1)
+          IEND(JMOT) = RANGES(1) + RANGEL(1) -1
+          GO TO 5
+        END IF
+      END IF
+      MOTIF  = 1
+      ICLASS = CLASS(1)
+      ILEN   = LENGTH(1)
+      CUT    = CUTOFF(1)
+      IWT    = WTSTR(1)
+      ISTRST = STRNGS(1)
+      IENTRY(1) = 0
+      COMB   = COMBIN(1)
+      DO 10 I = 1,NMOT
+        MATCHP(I) = 0
+10    CONTINUE
+C
+C
+C
+100   CONTINUE
+C
+C
+C  THIS A CLASS CLASS MOTIF, PERFORM THE APPROPRIATE SEARCH IF THE START 
+C  POSITION IS >0. (IF IT IS NOT THE CURRENT MOTIF IS A NOT THAT HAS
+C  ALREADY BEEN SEARCHED FOR
+C
+      IFOUND = 0
+      IF(START(MOTIF).GT.0)THEN
+C
+C
+      IF(ICLASS.EQ.1)THEN
+C        WRITE(*,*)'SRCSIG,MOTIF,ICLASS,START,END'
+C        WRITE(*,*)MOTIF,ICLASS,START(MOTIF),IEND(MOTIF)
+        CALL MOTIF1(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,
+     +  CUTOFF(MOTIF),0)
+      ELSE IF(ICLASS.EQ.2)THEN
+        CALL MOTIF2(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND)
+      ELSE IF(ICLASS.EQ.3)THEN
+        CALL MOTIF3(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,MAT3,
+     +  IDME)
+      ELSE IF(ICLASS.EQ.4)THEN
+        CALL MOTIF4(SEQ,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+      ELSE IF(ICLASS.EQ.5)THEN
+        CALL MOTIF4(SEQ,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+      ELSE IF(ICLASS.EQ.6)THEN
+        CALL MOTIF6(SEQ,IDSEQ,MAT4,LENGTH(MOTIF),START(MOTIF),
+     +            IEND(MOTIF),RANGET(MOTIF),RANGEM(MOTIF),
+     +            CUTOFF(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),
+     +            IENTRY(MOTIF),IFOUND,MATCHQ(MOTIF),
+     +            LAST5(MOTIF),LAST3S(MOTIF),LAST3E(MOTIF),IDM)
+      ELSE IF(ICLASS.EQ.7)THEN
+        CALL MOTIF1(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,
+     +  CUTOFF(MOTIF),1)
+      ELSE IF(ICLASS.EQ.8)THEN
+      CALL MOTIF8(SEQ,IDSEQ,MAT2,LENGTH(MOTIF),START(MOTIF),
+     +            IEND(MOTIF),RANGET(MOTIF),RANGEM(MOTIF),
+     +            CUTOFF(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),
+     +            IENTRY(MOTIF),IFOUND,MATCHQ(MOTIF),
+     +            LAST5(MOTIF),LAST3S(MOTIF),LAST3E(MOTIF),IDME)
+      ELSE
+        WRITE(KBOUT,*)'UNKNOWN CLASS!!'
+      END IF
+C
+C
+      END IF
+C
+C
+C   MATCH FOUND WHEN MATCH WANTED ?
+C
+C
+C
+      IF(((IFOUND.EQ.0).AND.(COMB.NE.'N')).OR.
+     +   ((IFOUND.GT.0).AND.(COMB.EQ.'N')))THEN
+C
+C   NO SO GO BACK OR SIDEWAYS ONE MOTIF
+C
+C
+        CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +  RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +  COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+        IF(IRET.NE.0)RETURN
+C
+C
+      ELSE
+C
+C
+C   MATCH FOUND. 
+C
+C
+C   PREPARE FOR NEXT SEARCH THIS MOTIF BY INCREMENTING POINTER
+C   TO SEARCH RANGE (NOT FOR CLASS 6 WHICH IS HANDLED BY MOTIF6)
+C
+C
+       IF(COMB.EQ.'N')THEN
+         START(MOTIF) = -9
+       ELSE
+C         IF((ICLASS.NE.6).AND.(ICLASS.NE.8))
+C         REPLACE LAST LINE 7-7-89
+         IF(ICLASS.NE.6)START(MOTIF) = MATCHP(MOTIF) + 1
+C  ERROR?????????
+         IF(ICLASS.EQ.7)START(MOTIF) = MATCHP(MOTIF) + CUTOFF(MOTIF)
+       END IF
+C
+C   TRY GOING FORWARD ONE MOTIF
+C
+C
+      CALL FORWAD(CLASS,LENGTH,CUTOFF,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,RELMOT,START,IEND,
+     +  RANGES,RANGEL,STRNGS,ISTRST,WTSTR,IDSEQ,IENTRY,
+     +  RANGET,RANGEM,START2,IEND2,MATCHQ,RELEND,MATCHP,IDSPLY,
+     +  COMBIN,COMB)
+C
+C
+C
+C   IS THIS THE LAST MOTIF? IF SO DISPLAY THE MATCH
+C
+C
+        IF(IDSPLY.EQ.1)THEN
+          ITOTAL = ITOTAL + 1
+          CALL DSPLAY(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +    CLASS,MATCHQ,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOPT,
+     +    XMAX,XMIN,YMAX,YMIN,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +    PMINT,PMAXT,PROBT,IDM,MAT1,IDM,MAT2,IDME,MAT3,IDM,
+     +    MAT4,IDM,WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,KEYNS)
+C
+C
+C  HORRIBLE SPECIAL CASE - IF LAST MOTIF IS NOTTED WE MUST MOVE BACK AFTER
+C  DISPLAY
+          IF(COMB.EQ.'N')THEN
+            CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +      MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +      RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +      COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+            IF(IRET.NE.0)RETURN
+          END IF
+        END IF
+C
+C
+      END IF
+C
+C
+C   GO BACK FOR NEXT SEARCH
+C
+C
+      GO TO 100
+      END
+C*********************************************************************
+      SUBROUTINE DSPLAY(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEV,
+     +CLASS,MATCHQ,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOPT,
+     +XMAX,XMIN,YMAX,YMIN,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +PMINT,PMAXT,PROBT,IDM,MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,
+     +MAT4,IDMAT4,WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,KEYNS)
+      INTEGER MATCHP(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER MATCHQ(NMOT)
+      CHARACTER SEQ(IDSEQ)
+      REAL MATCHS(NMOT),MINSCR,MAXSCR
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+      INTEGER MAT1(IDMAT1,IDMAT1),MAT2(IDMAT2,IDMAT2)
+      INTEGER MAT3(IDMAT3,IDMAT3),MAT4(IDMAT4,IDMAT4)
+      INTEGER WTSTR(NMOT)
+      CHARACTER*(*) KEYNS(NMOT)
+      EXTERNAL PSCORE
+C  ADD SCORES
+      T = 0.
+      DO 10 I = 1,NMOT
+        IF(MATCHP(I).NE.0) T = T + MATCHS(I)
+10    CONTINUE
+      POBS = 1.0
+      IF((PMINC.LT.1.0).OR.(IOPT.EQ.3))THEN
+C  CALC PROB
+        DO 20 I = 1,NMOT
+          IF(MATCHP(I).NE.0)THEN
+            CALL GETP(CLASS(I),SEQ(MATCHP(I)+KSTART-1),LENGTH(I),
+     +      IDM,MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +      WEIGHT(MAX(1,WTSTR(I))))
+            PROB = PSCORE(MATCHS(I))
+            POBS = POBS * PROB
+          END IF
+20      CONTINUE
+        IF((PMINC.LT.1.0).AND.(POBS.GT.PMINC))RETURN
+      END IF
+C
+      IF(T.GT.MAXSCR) MAXSCR = T
+      IF(T.LT.MINSCR) MINSCR = T
+      IF(T.LT.CUTSCR) RETURN
+C
+      IF(IOPT.EQ.1)THEN
+      WRITE(IDEV,1001)
+1001  FORMAT(' Match')
+      DO 100 I=1,NMOT
+        J = I
+C  CHECK FOR NO MATCH (NEEDED FOR ORED MOTIFS)
+        IF(MATCHP(J).NE.0)THEN
+        WRITE(IDEV,1000)MATCHP(J)+KSTART-1,KEYNS(I)
+        WRITE(IDEV,1002)(SEQ(K),K=MATCHP(J),MATCHP(J)+LENGTH(J)-1)
+C  STEM ?
+        IF(CLASS(J).EQ.6)THEN
+          WRITE(IDEV,1002)(SEQ(K),K=MATCHQ(J),MATCHQ(J)-LENGTH(J)+1,-1)
+          WRITE(IDEV,1000)MATCHQ(J)+KSTART-1
+        END IF
+C  REPEAT ?
+        IF(CLASS(J).EQ.8)THEN
+          WRITE(IDEV,1002)(SEQ(K),K=MATCHQ(J),MATCHQ(J)+LENGTH(J)-1)
+          WRITE(IDEV,1000)MATCHQ(J)+KSTART-1
+        END IF
+        END IF
+100   CONTINUE
+1000  FORMAT(' ',I7,' ',A8)
+1002  FORMAT('       ',60A1)
+      IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+1004  FORMAT(' Probability =',E10.4)
+      RETURN
+      END IF
+      IF(IOPT.EQ.3)THEN
+C       MARK THE POSITION OF THE FIRST NON-ZERO MATCH ONLY
+        DO 200 I = 1,NMOT
+          IF(MATCHP(I).NE.0)THEN
+            X = MATCHP(I) + KSTART - 1
+            Y = LOG(1.-POBS)
+            CALL VECTOM
+            CALL LINE(X,X,YMIN,Y,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+            CALL VT100M
+            GO TO 201
+          END IF
+200     CONTINUE
+201     CONTINUE
+      END IF
+      IF(IOPT.EQ.2)THEN
+        MINP = 999999
+        MAXP = -999999
+        DO 300 I = 1, NMOT
+          K = MATCHP(I)
+          IF(K.NE.0)THEN
+            IF(K.LT.MINP)MINP = K
+            K = K + LENGTH(I) - 1
+C  INVERTED REPEAT ?
+            IF(CLASS(I).EQ.6) K = MATCHQ(I)
+C  REPEAT ?
+            IF(CLASS(I).EQ.8) K = MATCHQ(I) + LENGTH(I) - 1
+            IF(K.GT.MAXP)MAXP = K
+          END IF
+300     CONTINUE
+        WRITE(IDEV,1000)MINP+KSTART-1,KEYNS(1)
+        WRITE(IDEV,1002)
+     +  ((SEQ(K1),K1=K2,MIN(K2+59,MAXP)),K2=MINP,MAXP,60)
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+        RETURN
+      END IF
+      IF(IOPT.EQ.4)THEN
+        MINP = 999999
+        MAXP = -999999
+        K1 = 0
+        DO 400 I = 1, NMOT
+          K = MATCHP(I)
+          IF(K.NE.0)THEN
+            IF(K1.EQ.0) INAM = I
+            K1 = 1
+            IF(K.LT.MINP)MINP = K
+            K = K + LENGTH(I) - 1
+C           INVERTED REPEAT ?
+            IF(CLASS(I).EQ.6) K = MATCHQ(I)
+C           REPEAT ?
+            IF(CLASS(I).EQ.8) K = MATCHQ(I) + LENGTH(I) - 1
+            IF(K.GT.MAXP)MAXP = K
+          END IF
+400     CONTINUE
+        CALL EFTOUT(KEYNS(INAM),MINP+KSTART-1,MAXP+KSTART-1,IDEV)
+      END IF
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF3(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,
+     +MATCHP,MATCHS,IFOUND,MATRIC,IDM)
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER MATRIC(IDM,IDM)
+      REAL MATCHS
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      IF(ISTART.GT.IDIM1)RETURN
+      CALL SQFIT5(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,MATCHS,
+     +IFOUND,MATRIC,IDM)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = IFOUND
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF4(SEQ,IDIM1,LENGTH,ISTART,IEND,
+     +WEIGHT,CUTOFF,MATCHP,MATCHS,IFOUND,IDM)
+      REAL MATCHS
+      CHARACTER SEQ(IDIM1)
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      L1 = IEND-ISTART+1
+      IF(ISTART.GT.IDIM1)RETURN
+      IF(L1.LT.LENGTH)RETURN
+      CALL FMOT4(SEQ(ISTART),L1,WEIGHT,LENGTH,CUTOFF,SCORE,IFOUND,IDM)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = ISTART+IFOUND-1
+      MATCHS = SCORE
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE FMOT4(SEQ,IDIM,WT,LENGTH,CUTOFF,SUM,IFOUND,IDM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL WT(IDM,LENGTH)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DO 10 I=1,IDIM-LENGTH+1
+        SUM=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          IP = CTONUM(SEQ(J))
+          SUM=SUM+WT(IP,K)
+5       CONTINUE
+        IF(SUM.GE.CUTOFF)THEN
+        IFOUND = I
+        RETURN
+        END IF
+10    CONTINUE
+      IFOUND = 0
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF6(SEQ,IDSEQ,MATRIX,LENGTH,I5STAR,I5END,
+     +                  I3STAR,I3END,CUTOFF,MATCHP,MATCHS,
+     +                  IENTRY,IFOUND,MATCHQ,
+     +                  LAST5,LAST3S,LAST3E,IDM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM)
+      REAL MATCHS
+      EXTERNAL LOOP
+C  WE HAVE A START POSITION FOR THE 5' END OF THE 5' END OF
+C  A POTENTIAL STEM I5STAR AND AN END DEFINED BY A RANGE I5END
+C  WE HAVE A STEM LENGTH LENGTH
+C  WE HAVE A RANGE OF POSITIONS FOR THE 3' STEM TO START
+C  I3STAR TO I3END
+C  TRY THE TIGHTEST LOOPS FIRST
+C  BUT FIRST WE MAY HAVE TO FINISH A PREVIOUS SEARCH
+C  THIS IS DENOTED BY IENTRY NE 0.
+C  NOTE IENTRY IS ALSO USED TO RETURN THE 3' MATCH POSITION
+      ICUT = CUTOFF
+C      WRITE(*,*)'ICUT',ICUT
+      IFOUND = 0
+      JENTRY = IENTRY
+      IENTRY = 0
+      IF(I5STAR.LT.1)I5STAR=1
+      IF((I5STAR+I3STAR-1).GT.IDSEQ)RETURN
+      IF(JENTRY.NE.0)THEN
+        I1 = LAST5
+C        WRITE(*,*)'I5STAR,I5END',I5STAR,I5END
+C        WRITE(*,*)'I1,LAST3S,LAST3E',I1,LAST3S,LAST3E
+        DO 50 J=LAST3S+1,LAST3E
+          J1 = J
+          ISUM = LOOP(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+          IF(ISUM.GE.CUTOFF)THEN
+            MATCHP = I1
+            IENTRY = J1
+            MATCHQ = J1
+            MATCHS = ISUM
+            IFOUND = MATCHP
+            LAST3S = J1
+CDEBUG
+C            WRITE(*,*)'MATCHP',MATCHP
+            RETURN
+          END IF
+50      CONTINUE
+C  NOW MOVE 5' STEM START POSITION (WE HAVE JUST FINISHED THE LAST)
+C  TO THE LAST MATCH + 1
+        I5STAR = MATCHP + 1
+      END IF
+C  SET ENTRY FLAG TO ZERO TO SIGNIFY LAST SEARCH NOW COMPLETED
+      IENTRY = 0
+      ISUM = 0
+      LOOPI1 = I5STAR
+      IF((I5STAR+I3STAR-1).GT.IDSEQ)RETURN
+      LOOPI2 = MIN(IDSEQ,I5END)
+C      WRITE(*,*)'IDSEQ,LENGTH,I5STAR,I5END',
+C     +IDSEQ,LENGTH,I5STAR,I5END
+C
+C  TRY ALL STEM STARTS FROM 5' START TO 5' END
+C
+C
+      DO 200 I = LOOPI1,LOOPI2
+C
+C
+        I1 = I
+C
+C   TRY ALL LOOPS FROM 3' START TO 3' END
+C
+        LOOPJ1 = I+I3STAR-1
+        IF(LOOPJ1.GT.IDSEQ)RETURN
+        LOOPJ2 = MIN(IDSEQ,I+I3END-1)
+C      WRITE(*,*)'I3STAR,I3END',I3STAR,I3END
+C
+C
+C
+        DO 100 J = LOOPJ1,LOOPJ2
+C
+C
+C
+         J1 = J
+C  IN LOOP NOTE THAT
+C  THE 5' END POINTER I1 GOES FORWARDS
+C  THE 3' END POINTER J1 GOES BACKWARDS
+C
+         ISUM = LOOP(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+C         WRITE(*,*)ISUM
+         IF(ISUM.GE.ICUT)THEN
+           MATCHP = I1
+           IENTRY = J1
+           MATCHQ = J1
+           MATCHS = ISUM
+           IFOUND = MATCHP
+C  SAVE CURRENT POSITION FOR LATER ENTRIES
+           LAST5  = I1
+           LAST3S = J1
+           LAST3E = LOOPJ2
+CDEBUG
+C            WRITE(*,*)'MATCHP',MATCHP
+           RETURN
+         END IF
+100    CONTINUE
+200   CONTINUE
+CDEBUG
+C            WRITE(*,*)'MATCHPEND',MATCHP
+      END
+C*********************************************************************
+      INTEGER FUNCTION LOOP(SEQ,IDSEQ,MATRIX,LENGTH,I5P,I3P,IDM)
+      CHARACTER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM),CTONUM
+      EXTERNAL CTONUM
+C  THE 5' END POINTER GOES FORWARDS
+C  THE 3' END POINTER GOES BACKWARDS
+      L=0
+      I5=I5P-1
+      I3=I3P+1
+      DO 100 I=1,LENGTH
+       I5 = I5 + 1
+       I3 = I3 - 1
+C      WRITE(*,*)'I5,I3',I5,I3
+       L5 = CTONUM(SEQ(I5))
+       L3 = CTONUM(SEQ(I3))
+       L = L + MATRIX(L5,L3)
+100   CONTINUE
+      LOOP = L
+      END
+      SUBROUTINE MOTIF8(SEQ,IDSEQ,MATRIX,LENGTH,I5STAR,I5END,
+     +                  I3STAR,I3END,CUTOFF,MATCHP,MATCHS,
+     +                  IENTRY,IFOUND,MATCHQ,
+     +                  LAST5,LAST3S,LAST3E,IDM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM),REPEET
+      REAL MATCHS
+      EXTERNAL REPEET
+C  WE HAVE A START POSITION FOR THE 5' END OF THE 5' END OF
+C  A POTENTIAL REPEAT I5STAR AND AN END DEFINED BY A RANGE I5END
+C  WE HAVE A REPEAT LENGTH LENGTH
+C  WE HAVE A RANGE OF POSITIONS FOR THE 3' STEM TO START
+C  I3STAR TO I3END
+C  TRY THE TIGHTEST LOOPS FIRST
+C  BUT FIRST WE MAY HAVE TO FINISH A PREVIOUS SEARCH
+C  THIS IS DENOTED BY IENTRY NE 0.
+C  NOTE IENTRY IS ALSO USED TO RETURN THE 3' MATCH POSITION
+      ICUT = CUTOFF
+C      WRITE(*,*)'ICUT',ICUT
+      IFOUND = 0
+      JENTRY = IENTRY
+      IENTRY = 0
+      IF(I5STAR.LT.1)I5STAR=1
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      IF(JENTRY.NE.0)THEN
+        I1 = LAST5
+C        WRITE(*,*)'I1,LAST3S,LAST3E',I1,LAST3S,LAST3E
+        DO 50 J=LAST3S+1,LAST3E
+          J1 = J
+          ISUM = REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+          IF(ISUM.GE.ICUT)THEN
+            MATCHP = I1
+            IENTRY = J1
+            MATCHQ = J1
+            MATCHS = ISUM
+            IFOUND = MATCHP
+            LAST3S = J1
+            RETURN
+          END IF
+50      CONTINUE
+C  NOW MOVE 5' STEM START POSITION (WE HAVE JUST FINISHED THE LAST)
+C  TO THE LAST MATCH + 1
+        I5STAR = MATCHP + 1
+      END IF
+C  SET ENTRY FLAG TO ZERO TO SIGNIFY LAST SEARCH NOW COMPLETED
+      IENTRY = 0
+      ISUM = 0
+      LOOPI1 = I5STAR
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      LOOPI2 = MIN(IDSEQ-2*LENGTH+1,I5END)
+C      WRITE(*,*)'IDSEQ,LENGTH,I5STAR,I5END',
+C     +IDSEQ,LENGTH,I5STAR,I5END
+C
+C  TRY ALL STEM STARTS FROM 5' START TO 5' END
+C
+C
+      DO 200 I = LOOPI1,LOOPI2
+C
+C
+        I1 = I
+C
+C   TRY ALL LOOPS FROM 3' START TO 3' END
+C
+        LOOPJ1 = I + I3STAR -1
+        IF((LOOPJ1+LENGTH-1).GT.IDSEQ)RETURN
+        LOOPJ2 = MIN(IDSEQ-LENGTH+1,I+I3END-1)
+C      WRITE(*,*)'I3STAR,I3END',I3STAR,I3END
+C
+C
+C
+        DO 100 J = LOOPJ1,LOOPJ2
+C
+C
+C
+         J1 = J
+C  IN REPEAT NOTE THAT
+C  THE 5' END POINTER I1 GOES FORWARDS
+C  THE 3' END POINTER J1 GOES FORWARDS
+C
+         ISUM = REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+C         WRITE(*,*)ISUM
+         IF(ISUM.GE.ICUT)THEN
+           MATCHP = I1
+           IENTRY = J1
+           MATCHQ = J1
+           MATCHS = ISUM
+           IFOUND = MATCHP
+C  SAVE CURRENT POSITION FOR LATER ENTRIES
+           LAST5  = I1
+           LAST3S = J1
+           LAST3E = LOOPJ2
+           RETURN
+         END IF
+100    CONTINUE
+200   CONTINUE
+      END
+C*********************************************************************
+      INTEGER FUNCTION REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I5P,I3P,IDM)
+      CHARACTER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM),DTONUM
+      EXTERNAL DTONUM
+C  THE 5' END POINTER GOES FORWARDS
+C  THE 3' END POINTER GOES FORWARDS
+      L=0
+      I5=I5P-1
+      I3=I3P-1
+      DO 100 I=1,LENGTH
+       I5 = I5 + 1
+       I3 = I3 + 1
+C      WRITE(*,*)'I5,I3',I5,I3
+       L5 = DTONUM(SEQ(I5))
+       L3 = DTONUM(SEQ(I3))
+       L = L + MATRIX(L5,L3)
+100   CONTINUE
+      REPEET = L
+      END
+      SUBROUTINE SQFIT5(SEQ,IDIM1,STRING,IDIM2,
+     1IS,IE,MINSC,MATCHS,IFOUND,MATRIC,IDM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS,MINSC
+      INTEGER MATRIC(IDM,IDM),DTONUM
+      EXTERNAL DTONUM
+      MINSCR = MINSC
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          NTOT = NTOT + MATRIC(DTONUM(SEQ(IP)),DTONUM(STRING(J)))
+          IP=IP+1
+100     CONTINUE
+C   END OF COUNTING FOR THIS POSITION.IS TOTAL HIGH ENOUGH?
+        IF(NTOT.GE.MINSCR)THEN
+          MATCHS = NTOT
+          IFOUND = IP-IDIM2
+          RETURN
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      IFOUND = 0
+      RETURN
+      END
+      SUBROUTINE SETCMP(SEQ,IDIM,IDM)
+      CHARACTER SEQ(IDIM)
+      PARAMETER (MAXCHR = 17)
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+      COMMON /COMPC/COMP(MAXCHR)
+      SAVE /COMPC/
+      DO 10 I = 1,IDM
+        COMP(I) = 0.0
+10    CONTINUE
+      DO 20 I = 1,IDIM
+        J = DTONUM(SEQ(I))
+        COMP(J) = COMP(J) + 1.
+20    CONTINUE
+      T = MAX(1,IDIM)
+      DO 30 I = 1,IDM
+        COMP(I) = COMP(I) / T
+30    CONTINUE
+      END
diff --git a/src/staden/patternnc.f b/src/staden/patternnc.f
new file mode 100644
index 0000000..d4afb63
--- /dev/null
+++ b/src/staden/patternnc.f
@@ -0,0 +1,1668 @@
+C   9-11-90 very many changes concerning use of file of file names
+C           and switching radio to radion
+C  3-7-91 Removed annotation "filename" from pattern files
+C  4-7-91 replaced embout by eftout
+C  18-7-91 added titles to pattern files
+C  2-3-92 set filnam = ' ' for some calls to openf1
+      SUBROUTINE GETMF(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,
+     +MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +PMINT,PMAXT,PROBT,EXPTT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,IPROB,TITLE)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT),STRNGS(MAXMOT)
+      INTEGER WTSTR(MAXMOT),RANGET(MAXMOT),RANGEM(MAXMOT)
+      INTEGER RELEND(MAXMOT)
+      CHARACTER STRING(MAXSTR),FILNAM*(*),HELPF*(*)
+      CHARACTER*(*) KEYNS(MAXMOT),TITLE
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT)
+      CHARACTER COMBIN(MAXMOT),TCLASS
+      INTEGER MAT1(IDMAT1,IDMAT1),MAT2(IDMAT2,IDMAT2)
+      INTEGER MAT3(IDMAT3,IDMAT3),MAT4(IDMAT4,IDMAT4)
+      CHARACTER*(*) NAMSAV(MAXMOT)
+      EXTERNAL PSCORE
+C   GETS PATTERN DEFINITIONS IN TERMS OF MOTIFS
+      PMINT = 1.0
+      PMAXT = 1.0
+      PROBT = 1.0
+      EXPTS = 0.0
+      PROBS = 0.0
+      PMINS = 0.0
+      PMAXS = 0.0
+      EXPTT = 1.0
+      IOK = 1
+      DO 10 I=1,MAXMOT
+        RELEND(I) = 5
+        COMBIN(I) = 'A'
+10    CONTINUE
+      RANGES(1) = 0
+      RANGEL(1) = IDSEQ
+      IREL = 0
+C   COUNT MOTIFS
+      NMOT = 0
+      MOTIF = 0
+C   SET POINTER TO SEARCH STRINGS ARRAY
+      ISTRNG = 1
+C   SET POINTER TO WEIGHT ARRAY
+      IWT = 1
+C get title
+      READ(JDEV,1000,ERR=901,END=901)TITLE
+ 1000 FORMAT(A)
+100   CONTINUE
+      CALL GETCLS(JDEV,KBOUT,ICLASS,TCLASS,KEYNS(MOTIF+1))
+1001  FORMAT(I7)
+      IF(ICLASS.LT.0) GO TO 901
+      IF(ICLASS.GT.MAXCLS)GO TO 901
+      IF(ICLASS.EQ.0)GO TO 900
+      IF((TCLASS.EQ.'O').AND.(NMOT.LT.1))THEN
+        WRITE(KBOUT,*)'CANNOT OR FIRST MOTIF'
+        GO TO 901
+      END IF
+      IF((TCLASS.EQ.'N').AND.(NMOT.LT.1))THEN
+        WRITE(KBOUT,*)'CANNOT NOT FIRST MOTIF'
+        GO TO 901
+      END IF
+      IF((TCLASS.EQ.'O').AND.(COMBIN(MOTIF).EQ.'N'))THEN
+        WRITE(KBOUT,*)'CANNOT OR WITH NOTTED MOTIF'
+        GO TO 901
+      END IF
+      NMOT = NMOT + 1
+      MOTIF = MOTIF + 1
+      IF(NMOT.GT.MAXMOT)THEN
+        WRITE(KBOUT,*)'MAXIMUM NUMBER OF MOTIFS EXCEEDED'
+        GO TO 901
+      END IF
+      CLASS(MOTIF) = ICLASS
+      COMBIN(MOTIF) = TCLASS
+C  IF NOT THE FIRST MOTIF, AND AN ANDED MOTIF OR THE FIRST IN A LIST OF ORS
+C  GET ITS RANGE ETC
+      IF((MOTIF.GT.1).AND.(COMBIN(MOTIF).NE.'O'))THEN
+        READ(JDEV,1001,ERR=901)IREL
+        IF(IREL.LT.0)GO TO 901
+        IF(IREL.LT.1)GO TO 901
+        IF(IREL.GT.MOTIF-1)THEN
+          WRITE(KBOUT,*)'CAN ONLY REFER BACK'
+          GO TO 901
+        END IF
+        IF(COMBIN(IREL).NE.'A')THEN
+          WRITE(KBOUT,*)'CAN ONLY REFER TO AN ANDED MOTIF'
+          GO TO 901
+        END IF
+        RELMOT(MOTIF) = IREL
+        IF(CLASS(IREL).EQ.6)THEN
+          READ(JDEV,1001,ERR=901)MEND
+         IF(MEND.LT.0)GO TO 901
+          RELEND(MOTIF) = 5
+          IF(MEND.EQ.3)RELEND(MOTIF) = 3
+          READ(JDEV,1001,ERR=901)ID
+          IF(ID.LT.0)GO TO 901
+        ELSE
+          READ(JDEV,1001,ERR=901)ID
+        END IF
+        READ(JDEV,1001,ERR=901)IR
+        IF(IR.LT.0)GO TO 901
+        RANGES(MOTIF) = ID - 1
+        RANGEL(MOTIF) = IR
+      ELSE IF(COMBIN(MOTIF).EQ.'O')THEN
+C  NEED TO SET RANGES TO THOSE OF THE FIRST IN A SET OF ORED MOTIFS
+C  SET TO THOSE OF THE PREVIOUS MOTIF BECAUSE IT MUST BE THE SAME
+        RANGES(MOTIF) = RANGES(MOTIF-1)
+        RANGEL(MOTIF) = RANGEL(MOTIF-1)
+        RELEND(MOTIF) = RELEND(MOTIF-1)
+        RELMOT(MOTIF) = IREL
+      END IF
+C      KEYNS(MOTIF) = KEYNAM
+      XRAN = 1.0
+C  NOW GET DETAILS SPECIFIC TO EACH CLASS OF MOTIF
+      CALL GETMC(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,
+     +MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +PMINT,PMAXT,PROBT,EXPTT,EXPTS,PROBS,PMINS,PMAXS,
+     +IHELPS,IHELPE,HELPF,IDEVH,MOTIF,IWT,NAMSAV,FILNAM,IPROB)
+      IF(IOK.EQ.0) GO TO 100
+901   CONTINUE
+      WRITE(KBOUT,*)' Error in pattern definition'
+      IOK = 1
+      CLOSE(UNIT=JDEV)
+      RETURN
+900   CONTINUE
+      EXPTT = EXPTT * EXPTS
+      PROBT = PROBT * PROBS
+      PMINT = PMINT * PMINS
+      PMAXT = PMAXT * PMAXS
+      CLOSE(UNIT=JDEV)
+      DO 899 I = 2,NMOT
+        IF((CLASS(I).EQ.6).OR.(CLASS(I).EQ.8)) THEN
+          RANGEL(I) = RANGEL(I) + 1
+        ELSE
+          RANGEL(I) = RANGEL(I) + LENGTH(I)
+        END IF
+899   CONTINUE
+C  RETURN STRING LENGTH
+      MAXSTR = ISTRNG - 1
+      IOK = 0
+      END
+      SUBROUTINE GETMC(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,
+     +MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +PMINT,PMAXT,PROBT,EXPTT,EXPTS,PROBS,PMINS,PMAXS,
+     +IHELPS,IHELPE,HELPF,IDEVH,MOTIF,IWT,NAMSAV,FILNAM,IPROB)
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT),STRNGS(MAXMOT)
+      INTEGER WTSTR(MAXMOT),RANGET(MAXMOT),RANGEM(MAXMOT)
+      INTEGER RELEND(MAXMOT)
+      CHARACTER STRING(MAXSTR),FILNAM*(*),HELPF*(*)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT)
+      CHARACTER COMBIN(MAXMOT)
+      INTEGER MAT1(IDMAT1,IDMAT1),MAT2(IDMAT2,IDMAT2)
+      INTEGER MAT3(IDMAT3,IDMAT3),MAT4(IDMAT4,IDMAT4)
+      CHARACTER*(*) NAMSAV(MAXMOT)
+      EXTERNAL PSCORE
+      XRAN = 1.0
+      PMIN = 1.
+      PMAX = 1.
+      PROB = 1.
+C  NOW GET DETAILS SPECIFIC TO EACH CLASS OF MOTIF
+      IF(CLASS(MOTIF).EQ.1)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        CALL GETMT1(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),KBIN,KBOUT,
+     +  IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          SCMIN  = LENGTH(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMIN = PROB
+          PMAX = PROB
+        END IF
+        STRNGS(MOTIF) = ISTRNG
+        ISTRNG = ISTRNG + LENGTH(MOTIF)
+        CUTOFF(MOTIF) = LENGTH(MOTIF)
+      ELSE IF(CLASS(MOTIF).EQ.2)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        CALL GETMT2(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),
+     +  CUTOFF(MOTIF),KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = LENGTH(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMIN = PROB
+          PMAX = PSCORE(SCMAX)
+        END IF
+        STRNGS(MOTIF) = ISTRNG
+        ISTRNG = ISTRNG + LENGTH(MOTIF)
+      ELSE IF(CLASS(MOTIF).EQ.3)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        CALL GETMT3(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),
+     +  CUTOFF(MOTIF),MAT3,IDMAT3,KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = LENGTH(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        STRNGS(MOTIF) = ISTRNG
+        ISTRNG = ISTRNG + LENGTH(MOTIF)
+      ELSE IF(CLASS(MOTIF).EQ.4)THEN
+        MXWT = MAXWTS - IWT + 1
+        CALL GETMT4(WEIGHT(IWT),MXWT,LENGTH(MOTIF),CUTOFF(MOTIF),
+     +  SCMAX,IDEV,KBIN,KBOUT,IOK,JDEV,FILNAM,IDM,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        WTSTR(MOTIF) = IWT
+        IWT = IWT + LENGTH(MOTIF)*IDM
+        NAMSAV(MOTIF) = FILNAM
+      ELSE IF(CLASS(MOTIF).EQ.5)THEN
+        MXWT = MAXWTS - IWT + 1
+        CALL GETMT5(WEIGHT(IWT),MXWT,LENGTH(MOTIF),CUTOFF(MOTIF),
+     +  SCMAX,IDEV,KBIN,KBOUT,IOK,JDEV,FILNAM,IDM,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+        IF(IOK.NE.0)GO TO 901
+        SCMIN = CUTOFF(MOTIF)
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        WTSTR(MOTIF) = IWT
+        IWT = IWT + LENGTH(MOTIF)*IDM
+        NAMSAV(MOTIF) = FILNAM
+      ELSE IF(CLASS(MOTIF).EQ.6)THEN
+        CALL GETMT6(RANGET(MOTIF),RANGEM(MOTIF),LENGTH(MOTIF),
+     +  CUTOFF(MOTIF),KBIN,KBOUT,JDEV,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = LENGTH(MOTIF)*2
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        XRAN = ABS(RANGEM(MOTIF))-ABS(RANGET(MOTIF))+1
+      ELSE IF(CLASS(MOTIF).EQ.7)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        CALL GETMT7(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),KBIN,KBOUT,
+     +  IOK,JDEV,CUTOFF(MOTIF),
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          STRNGS(MOTIF) = ISTRNG
+          ISTRNG = ISTRNG + LENGTH(MOTIF)
+          SCMIN  = LENGTH(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMIN = PROB
+          PMAX = PROB
+        END IF
+        XRAN = 1.0 / CUTOFF(MOTIF)
+      ELSE IF(CLASS(MOTIF).EQ.8)THEN
+        CALL GETMT8(RANGET(MOTIF),RANGEM(MOTIF),LENGTH(MOTIF),
+     +  CUTOFF(MOTIF),KBIN,KBOUT,JDEV,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,
+     +    MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,
+     +    WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = LENGTH(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        XRAN = ABS(RANGEM(MOTIF))-ABS(RANGET(MOTIF))+1
+      END IF
+C  GET RANGE OF POSITIONS FOR PROBABILITY CALC
+      LRANGE = RANGEL(MOTIF) + 1
+      IF(COMBIN(MOTIF).EQ.'A')THEN
+        IF(EXPTS.GT.0.0)THEN
+          EXPTT = EXPTT * EXPTS
+          PROBT = PROBT * PROBS
+          PMINT = PMINT * PMINS
+          PMAXT = PMAXT * PMAXS
+        END IF
+        EXPTS = LRANGE * XRAN * PROB
+        PROBS = PROB
+        PMINS = PMIN
+        PMAXS = PMAX
+      ELSE IF(COMBIN(MOTIF).EQ.'N')THEN
+        IF(EXPTS.GT.0.0)THEN
+          EXPTT = EXPTT * EXPTS
+          PROBT = PROBT * PROBS
+          PMINT = PMINT * PMINS
+          PMAXT = PMAXT * PMAXS
+        END IF
+        PMIN = 1.0 - PMIN
+        PMAX = 1.0 - PMAX
+        PROB = 1.0 - PROB
+        EXPTS = PROB ** (LRANGE * XRAN)
+        PROBS = PROB
+        PMINS = PMIN
+        PMAXS = PMAX
+      ELSE IF(COMBIN(MOTIF).EQ.'O')THEN
+        EXPTS = EXPTS + LRANGE * XRAN * PROB
+        PROBS = PROBS + PROB
+        PMINS = PMINS + PMIN
+        PMAXS = PMAXS + PMAX
+      END IF
+      IF(IPROB.EQ.0) CALL WRTSCR(SCMIN,PROB,KBOUT)
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+C*********************************************************************
+      SUBROUTINE GETMT1(STRING,MAXSTR,LENGTH,KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      CHARACTER NEW(50),STRING(MAXSTR)
+C  GETS DETAILS FOR MOTIF CLASS 1
+      IOK = 1
+      IF(KBIN.EQ.JDEV)THEN
+10      CONTINUE
+        LENGTH = 0
+        CALL GETSTR('String',STRING,NEW,50,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+      ELSE
+        LENGTH = MAXSTR
+        CALL ARRFIL(JDEV,STRING,LENGTH,KBOUT)
+        IF(LENGTH.LT.1)THEN
+          WRITE(KBOUT,*)'ZERO LENGTH STRING'
+          RETURN
+        END IF
+      END IF
+      IOK = 0
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE GETMT2(STRING,MAXSTR,LENGTH,CUTOFF,KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      CHARACTER NEW(50),STRING(MAXSTR)
+      REAL MININ,MAXIN
+C  GETS DETAILS FOR MOTIF CLASS 2
+      IOK = 1
+      IF(KBIN.EQ.JDEV)THEN
+10      CONTINUE
+        LENGTH = 0
+        CALL GETSTR('string',STRING,NEW,50,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+        MININ = 1.
+        MAXIN = LENGTH
+        CUTOFF = LENGTH
+        CALL GETRL(MININ,MAXIN,CUTOFF,'Minimum matches',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTOFF = VALUE
+      ELSE
+        LENGTH = MAXSTR
+        CALL ARRFIL(JDEV,STRING,LENGTH,KBOUT)
+        IF(LENGTH.LT.1)THEN
+          WRITE(KBOUT,*)'ZERO LENGTH STRING'
+          RETURN
+        END IF
+        READ(JDEV,1002,ERR=901)CUTOFF
+1002    FORMAT(F10.0)
+      END IF
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+C*********************************************************************
+      SUBROUTINE GETMT3(STRING,MAXSTR,LENGTH,CUTOFF,MAT3,IDMAT3,
+     +KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      CHARACTER NEW(50),STRING(MAXSTR)
+      INTEGER MAT3(IDMAT3,IDMAT3)
+      REAL MININ,MAXIN
+C  GETS DETAILS FOR MOTIF CLASS 3
+      IOK = 1
+      IF(KBIN.EQ.JDEV)THEN
+10      CONTINUE
+        LENGTH = 0
+        CALL GETSTR('String',STRING,NEW,50,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+        CALL GTSCR(STRING,LENGTH,MAT3,IDMAT3,ISMIN,ISMAX)
+        MININ = ISMIN
+        MAXIN = ISMAX
+        CUTOFF = MAXIN
+        CALL GETRL(MININ,MAXIN,CUTOFF,'Minimum score',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTOFF = VALUE
+      ELSE
+        LENGTH = MAXSTR
+        CALL ARRFIL(JDEV,STRING,LENGTH,KBOUT)
+        IF(LENGTH.LT.1)THEN
+          WRITE(KBOUT,*)'ZERO LENGTH STRING'
+          RETURN
+        END IF
+        READ(JDEV,1002,ERR=901)CUTOFF
+1002    FORMAT(F10.0)
+      END IF
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+      SUBROUTINE GETMT4(WEIGHT,MAXWTS,LENGTH,CUTOFF,YMAX,IDEV,
+     +KBIN,KBOUT,IOK,JDEV,FILNAM,IDM,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+      CHARACTER HELPF*(*)
+      PARAMETER (MAXLEN = 120, MAXCHR = 5)
+      REAL WEIGHT(MAXWTS)
+      CHARACTER FILNAM*(*)
+      INTEGER TOT(MAXLEN),SUM(MAXCHR,MAXLEN)
+      IF(JDEV.EQ.KBIN) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,JDEV,KBOUT,
+     +  'Weight matrix file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      ELSE
+        CALL OPENF(IDEV,FILNAM,0,IOK,JDEV,KBOUT)
+      END IF
+      IF(IOK.NE.0)RETURN
+      LENGTH=MAXLEN
+      CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,CUTOFF,YMAX,IDEV,
+     +  IOK,IDM,KBOUT,IPROB)
+      IF(IOK.NE.0)THEN
+        WRITE(KBOUT,*)' Error in weight matrix, option left'
+        RETURN
+      END IF
+      IF(CUTOFF.LT.0.0)CALL GETW(TOT,SUM,WEIGHT,LENGTH,IDM,MAXLEN)
+      IF(CUTOFF.GE.0.0)CALL GETW2(SUM,WEIGHT,LENGTH,IDM,MAXLEN)
+      END
+C*********************************************************************
+      SUBROUTINE GETMT5(WEIGHT,MAXWTS,LENGTH,CUTOFF,SCMAX,IDEV,
+     +KBIN,KBOUT,IOK,JDEV,FILNAM,IDM,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+      CHARACTER HELPF*(*)
+      REAL WEIGHT(MAXWTS)
+      CHARACTER FILNAM*(*)
+      IF(JDEV.EQ.KBIN) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,JDEV,KBOUT,
+     +  'Weight matrix file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      ELSE
+        CALL OPENF(IDEV,FILNAM,0,IOK,JDEV,KBOUT)
+      END IF
+      IF(IOK.NE.0)RETURN
+      CALL RDMT5(WEIGHT,MAXWTS,LENGTH,CUTOFF,SCMAX,IDEV,IOK,IDM,KBOUT,
+     +IPROB)
+      END
+C*********************************************************************
+      SUBROUTINE RDMT5(WEIGHT,MAXWTS,LENGTH,CUTOFF,YMAX,IDEV,IOK,IDM,
+     +KBOUT,IPROB)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (MAXLEN = 120, MAXCHR = 5)
+      REAL WEIGHT(MAXWTS)
+      INTEGER TOT(MAXLEN),SUM(MAXCHR,MAXLEN)
+      LENGTH=MAXLEN
+      CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,CUTOFF,YMAX,IDEV,
+     +  IOK,IDM,KBOUT,IPROB)
+      IF(IOK.NE.0)THEN
+        WRITE(KBOUT,*)' Error in weight matrix, option left'
+        RETURN
+      END IF
+      CALL GETW(TOT,SUM,WEIGHT,LENGTH,IDM,MAXLEN)
+C   NOW COMPLEMENT THE WEIGHT MATRIX
+      CALL COMPWM(WEIGHT,LENGTH,MAXLEN,IDM)
+      END
+C*********************************************************************
+      SUBROUTINE GETMT6(RANGES,RANGEL,LENGTH,CUTOFF,
+     +KBIN,KBOUT,JDEV,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      INTEGER RANGES,RANGEL
+C  GETS DETAILS FOR MOTIF CLASS 6
+      IF(JDEV.EQ.KBIN)THEN
+      IOK = 1
+      MININ = 1
+      MAXIN = 60
+      LENGTH = 6
+      CALL GETINT(MININ,MAXIN,LENGTH,'Stem length',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LENGTH = IVAL
+      MININ = -LENGTH
+      MAXIN = 60
+      ID = 0
+      CALL GETINT(MININ,MAXIN,ID,'Minimum loop length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      ID = IVAL
+      MININ = ID
+      MAXIN = 60
+      IR = ID
+      CALL GETINT(MININ,MAXIN,IR,'Maximum loop length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IR = IVAL
+        SMININ = 1
+        SMAXIN = 2 * LENGTH
+        CUTOFF = SMAXIN
+        CALL GETRL(SMININ,SMAXIN,CUTOFF,'Minimum score',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTOFF = VALUE
+      ELSE
+      READ(JDEV,1001,ERR=901)LENGTH
+      IF(LENGTH.LT.1)RETURN
+      READ(JDEV,1001,ERR=901)ID
+1001  FORMAT(I7)
+C      IF(ID.LT.0)GO TO 220
+      READ(JDEV,1001,ERR=901)IR
+C  COMMENT NEXT LINE TO ALLOW PALLINDROMES
+C      IF(IR.LT.0)GO TO 230
+      READ(JDEV,1006,ERR=901)CUTOFF
+1006  FORMAT(F10.0)
+      END IF
+      RANGES = ID + 2*LENGTH
+      RANGEL = IR + 2*LENGTH
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+C*********************************************************************
+      SUBROUTINE GETMT7(STRING,MAXSTR,LENGTH,KBIN,KBOUT,IOK,JDEV,STEP,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      CHARACTER NEW(50),STRING(MAXSTR)
+      REAL STEP
+C  GETS DETAILS FOR MOTIF CLASS 1
+      IOK = 1
+      IF(KBIN.EQ.JDEV)THEN
+10      CONTINUE
+        LENGTH = 0
+        CALL GETSTR('String',STRING,NEW,50,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+        MININ = 1
+        MAXIN = 20
+        ISTEP = 3
+        CALL GETINT(MININ,MAXIN,ISTEP,'Step',IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        ISTEP = IVAL
+      ELSE
+        LENGTH = MAXSTR
+        CALL ARRFIL(JDEV,STRING,LENGTH,KBOUT)
+        IF(LENGTH.LT.1)THEN
+          WRITE(KBOUT,*)'ZERO LENGTH STRING'
+          RETURN
+        END IF
+        READ(JDEV,1002,ERR=901)ISTEP
+1002    FORMAT(I6)
+      END IF
+      STEP = ISTEP
+      IF(ISTEP.LT.1)STEP = 3.0
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1     
+      END
+      SUBROUTINE GETMT8(RANGES,RANGEL,LENGTH,CUTOFF,
+     +KBIN,KBOUT,JDEV,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      INTEGER RANGES,RANGEL
+C  GETS DETAILS FOR MOTIF CLASS 8
+      IF(JDEV.EQ.KBIN)THEN
+      IOK = 1
+      MININ = 1
+      MAXIN = 60
+      LENGTH = 6
+      CALL GETINT(MININ,MAXIN,LENGTH,'Repeat length',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LENGTH = IVAL
+      MININ = 0
+      MAXIN = 60
+      ID = MININ
+      CALL GETINT(MININ,MAXIN,ID,'Minimum gap',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      ID = IVAL
+      MININ = ID
+      MAXIN = MININ + 60
+      IR = ID
+      CALL GETINT(MININ,MAXIN,IR,'Maximum gap',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IR = IVAL
+        SMININ = 1
+        SMAXIN = LENGTH
+        CUTOFF = SMAXIN
+        CALL GETRL(SMININ,SMAXIN,CUTOFF,'Minimum score',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTOFF = VALUE
+      ELSE
+      READ(JDEV,1001,ERR=901)LENGTH
+      IF(LENGTH.LT.1)RETURN
+      READ(JDEV,1001,ERR=901)ID
+1001  FORMAT(I7)
+C      IF(ID.LT.0)GO TO 220
+      READ(JDEV,1001,ERR=901)IR
+      READ(JDEV,1006,ERR=901)CUTOFF
+1006  FORMAT(F10.0)
+      END IF
+      RANGES = ID + LENGTH + 1
+      RANGEL = IR + LENGTH + 1
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+      SUBROUTINE DESSIG(
+     +            KBOUT,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,COMBIN,KEYNS,TITLE)
+      CHARACTER STRING(MAXSTR)
+      INTEGER LENGTH(NMOT),CLASS(NMOT),RELMOT(NMOT),RELEND(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),RANGET(NMOT),RANGEM(NMOT)
+      INTEGER WTSTR(NMOT),STRNGS(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT)
+      CHARACTER*(*) KEYNS(NMOT),TITLE
+C
+C
+C  DESCRIBE THE SIGNAL
+C
+      WRITE(KBOUT,1000)
+1000  FORMAT(/' Pattern description',/)
+      WRITE(KBOUT,1007)TITLE
+ 1007 FORMAT(' ',A)
+      DO 100 I = 1,NMOT
+        WRITE(KBOUT,1001)I,KEYNS(I),CLASS(I)
+1001    FORMAT(' Motif ',I2,' named ',A8,' is of class ',I4)
+C
+        IF(CLASS(I).EQ.1)THEN
+          WRITE(KBOUT,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+1002      FORMAT(' Which is an exact match to the string',
+     +    /,(' ',50A1))
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELEND(I),RELMOT(I)
+1008      FORMAT(' and the 5 prime base can take positions',
+     +            I7,' to ',I7,/,
+     +           ' relative to the ',I1,' prime end of motif',I4)
+        ELSE IF(CLASS(I).EQ.2)THEN
+          WRITE(KBOUT,1003)CUTOFF(I),
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+1003      FORMAT(' which is a match of score ',F6.0,
+     +            ' to the string',/,(' ',50A1))
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELEND(I),RELMOT(I)
+        ELSE IF(CLASS(I).EQ.3)THEN
+          WRITE(KBOUT,1003)CUTOFF(I),
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELEND(I),RELMOT(I)
+        ELSE IF(CLASS(I).EQ.4)THEN
+          WRITE(KBOUT,1004)CUTOFF(I)
+1004      FORMAT(' Which is a match to a weight matrix with score',
+     +             F8.3)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELEND(I),RELMOT(I)
+        ELSE IF(CLASS(I).EQ.5)THEN
+          WRITE(KBOUT,1011)CUTOFF(I)
+1011      FORMAT(' Which is a match to the complement of a',
+     +           ' weight matrix with score',
+     +             F8.3)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELEND(I),RELMOT(I)
+        ELSE IF(CLASS(I).EQ.6)THEN
+          WRITE(KBOUT,1005)LENGTH(I),CUTOFF(I)
+1005      FORMAT(' Which is a stem-loop structure with stem length',I5,
+     +           ' and score ',F6.0)
+          WRITE(KBOUT,1006)RANGET(I)-2*LENGTH(I),RANGEM(I)-2*LENGTH(I)
+1006      FORMAT(' The loop can have sizes ',I6,' to ',I6)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I),RELEND(I),RELMOT(I)
+        ELSE IF(CLASS(I).EQ.7)THEN
+          WRITE(KBOUT,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          WRITE(KBOUT,1013)INT(CUTOFF(I))
+1013      FORMAT(' with a step size of',I6)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELEND(I),RELMOT(I)
+        ELSE IF(CLASS(I).EQ.8)THEN
+          WRITE(KBOUT,1014)LENGTH(I),CUTOFF(I)
+1014      FORMAT(' Which is a repeat with repeat length',I5,
+     +           ' and score ',F6.0)
+          WRITE(KBOUT,1015)RANGET(I)-LENGTH(I)-1,RANGEM(I)-LENGTH(I)-1
+1015      FORMAT(' The loop-out can have sizes ',I6,' to ',I6)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I),RELEND(I),RELMOT(I)
+        END IF
+        IF(COMBIN(I).EQ.'O')WRITE(KBOUT,1010)
+1010    FORMAT(' It is orred with the previous motif.')
+        IF(COMBIN(I).EQ.'N')WRITE(KBOUT,1012)
+1012    FORMAT(' It is notted with the previous motif.')
+        IF((COMBIN(I).EQ.'A').AND.(I.GT.1))WRITE(KBOUT,1009)
+1009    FORMAT(' It is anded with the previous motif.')
+100   CONTINUE
+      END
+C*********************************************************************
+      SUBROUTINE SAVSIG(
+     +            IDEV,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,NAMSAV,COMBIN,KEYNS,TITLE,KBIN,KBOUT)
+      CHARACTER STRING(MAXSTR)
+      INTEGER LENGTH(NMOT),CLASS(NMOT),RELMOT(NMOT),RELEND(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),RANGET(NMOT),RANGEM(NMOT)
+      INTEGER WTSTR(NMOT),STRNGS(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT),SCLASS*10
+      CHARACTER*(*) KEYNS(NMOT)
+      CHARACTER*(*) NAMSAV(NMOT),TITLE
+C
+C
+C  SAVE THE SIGNAL
+C
+ 1    CONTINUE
+      LIN = 0
+      CALL GTSTR('Pattern title',' ',TITLE,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      WRITE(IDEV,1000)TITLE
+ 1000 FORMAT(' ',A)
+      DO 100 I = 1,NMOT
+        WRITE(SCLASS,1014)CLASS(I)
+1014    FORMAT(I10)
+        DO 10 K = 1,10
+          IF(SCLASS(K:K).NE.' ')THEN
+            J = 0
+            DO 5 K1 = K,10
+              J = J+1
+              SCLASS(J:J) = SCLASS(K:K)
+              SCLASS(K:K) = ' '
+5           CONTINUE
+          END IF
+10      CONTINUE
+        WRITE(IDEV,1001)COMBIN(I),SCLASS,KEYNS(I)
+1001        FORMAT(' ',A,A,' ',A8,' Class ')
+          IF((I.NE.1).AND.(COMBIN(I).NE.'O'))THEN
+            WRITE(IDEV,1008)RELMOT(I)
+1008        FORMAT(I7,'      Relative motif')
+            IF(CLASS(RELMOT(I)).EQ.6)WRITE(IDEV,1007)RELEND(I)
+1007        FORMAT(I7,'      Relative end')
+1003        FORMAT(I7,'      Relative start position')
+1004        FORMAT(I7,'      Number of extra positions')
+            WRITE(IDEV,1003)RANGES(I)+1
+            WRITE(IDEV,1004)RANGEL(I)
+          END IF
+        IF(CLASS(I).EQ.1)THEN
+          WRITE(IDEV,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+1002      FORMAT((' ',50A1))
+          WRITE(IDEV,1006)
+1006      FORMAT(' @ End of string')
+        ELSE IF(CLASS(I).EQ.2)THEN
+          WRITE(IDEV,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          WRITE(IDEV,1006)
+          WRITE(IDEV,1009)CUTOFF(I)
+        ELSE IF(CLASS(I).EQ.3)THEN
+          WRITE(IDEV,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          WRITE(IDEV,1006)
+          WRITE(IDEV,1009)CUTOFF(I)
+        ELSE IF(CLASS(I).EQ.4)THEN
+          WRITE(IDEV,1005)NAMSAV(I)
+1005      FORMAT(A)
+        ELSE IF(CLASS(I).EQ.5)THEN
+          WRITE(IDEV,1005)NAMSAV(I)
+        ELSE IF(CLASS(I).EQ.6)THEN
+          WRITE(IDEV,1010)LENGTH(I)
+1010      FORMAT(I7,'      Length')
+          WRITE(IDEV,1012)RANGET(I)-2*LENGTH(I)
+          WRITE(IDEV,1013)RANGEM(I)-2*LENGTH(I)
+1012      FORMAT(I7,'      Minimum loop')
+1013      FORMAT(I7,'      Maximum loop')
+          WRITE(IDEV,1009)CUTOFF(I)
+1009      FORMAT(F10.5,'   Cutoff')
+        ELSE IF(CLASS(I).EQ.7)THEN
+          WRITE(IDEV,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          WRITE(IDEV,1006)
+          WRITE(IDEV,1015)INT(CUTOFF(I))
+1015      FORMAT(I7,'      Step size')
+        ELSE IF(CLASS(I).EQ.8)THEN
+          WRITE(IDEV,1010)LENGTH(I)
+          WRITE(IDEV,1012)RANGET(I)-LENGTH(I)-1
+          WRITE(IDEV,1013)RANGEM(I)-LENGTH(I)-1
+          WRITE(IDEV,1009)CUTOFF(I)
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE FORWAD(CLASS,LENGTH,CUTOFF,NMOT,
+     +MOTIF,ICLASS,ILEN,CUT,IWT,RELMOT,START,IEND,
+     +RANGES,RANGEL,STRNGS,ISTRST,WTSTR,IDSEQ,IENTRY,
+     +RANGET,RANGEM,START2,IEND2,MATCHQ,RELEND,MATCHP,IDSPLY,
+     +COMBIN,COMB)
+      INTEGER CLASS(NMOT),LENGTH(NMOT),WTSTR(NMOT)
+      INTEGER STRNGS(NMOT),START(NMOT),IEND(NMOT),IENTRY(NMOT)
+      INTEGER RELMOT(NMOT),RANGES(NMOT),RANGEL(NMOT)
+      INTEGER RANGET(NMOT),RANGEM(NMOT),START2(NMOT),IEND2(NMOT)
+      INTEGER MATCHQ(NMOT),RELEND(NMOT),MATCHP(NMOT)
+      REAL CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT),COMB
+CDEBUG
+C      WRITE(*,*)'IN FORWAD FOR MOTIF',MOTIF
+C  ROUTINE TO MOVE FORWARDS ONE MOTIF
+C  FIRST IT UPDATES THE RANGES FOR ALL THE MOTIFS THAT DEPEND
+C  ON THE CURRENT MOTIF
+C  IT POINTS TO THE NEXT MOTIF AND GETS ALL THE REQUIRED VARIABLES
+C  WE MUST FIRST LOOK BACK THRU THE LIST UNTIL WE FIND THE FIRST
+C  NON ORED MOTIF. THEN WE MUST SET THE RELATIVE POSITIONS FOR ALL
+C  THOSE MOTIFS THAT DEPEND ON THIS SET OF ORED MOTIFS. THE RELATIVE
+C  POSITIONS MUST BE SET RELATIVE TO THE POSITION OF THE MATCH FOUND
+C  FOR THE CURRENT MOTIF EVEN IF IT IS NOT THE FIRST IN THE LIST OF ORS
+C  THIS IS BECAUSE RELATIVE POSITIONS CAN ONLY BE DEFINED RELATIVE TO
+C  THE FIRST OF A SET OF ORS, BUT IF A MATCH IS FOUND FOR ANY OF THE OTHERS
+C  IN THE SET THEN ITS POSITION DEFINES THE RANGE.
+C
+C
+C
+C  IF THIS IS THE LAST MOTIF RETURN TO DISPLAY THE MATCH
+      IF(MOTIF.EQ.NMOT)THEN
+        IDSPLY = 1
+        RETURN
+      END IF
+C
+C  NOT THE LAST
+C
+      IDSPLY = 0
+C
+C  IF THIS IS A NOTTED MOTIF THEN IT WILL NOT BE ORED OR HAVE ANY
+C  OTHER MOTIFS RELATIVE TO IT, SO DEAL WITH IT HERE
+C
+      IF(COMB.EQ.'N')THEN
+        MOTIF  = MOTIF + 1
+        ICLASS = CLASS(MOTIF)
+        ILEN   = LENGTH(MOTIF)
+        CUT    = CUTOFF(MOTIF)
+        IWT    = WTSTR(MOTIF)
+        ISTRST = STRNGS(MOTIF)
+        COMB   = COMBIN(MOTIF)
+        RETURN
+      END IF
+C
+C
+C   NOW LOOK FOR THE FIRST NON ORRED CLASS FROM HERE BACK
+C
+      NOTIF = MOTIF
+10    CONTINUE
+C
+      IF(COMBIN(NOTIF).EQ.'O')THEN
+        NOTIF = NOTIF - 1
+        GO TO 10
+      END IF
+C
+C  NOTIF IS A NON-ORED MOTIF, OR THE FIRST OF A LIST OF ORS 
+C  SO SET RELATIVE POSITIONS FOR ALL THOSE THAT DEPEND ON IT
+C
+        DO 20 I = MOTIF+1,NMOT
+          JMOT = I
+            IF(RELMOT(JMOT).EQ.NOTIF)THEN
+              IF(CLASS(NOTIF).EQ.6)THEN
+                IF(RELEND(JMOT).EQ.3) START(JMOT) = MATCHQ(MOTIF) + 
+     +                                          RANGES(JMOT)
+                IF(RELEND(JMOT).EQ.5) START(JMOT) = MATCHP(MOTIF) + 
+     +                                          RANGES(JMOT)
+              ELSE
+                START(JMOT) = START(MOTIF) + RANGES(JMOT) - 1
+              END IF
+C             NEXT LINE FOR RAH BUG
+              START(JMOT) = MAX(START(JMOT),1)
+              IEND(JMOT) = START(JMOT) + RANGEL(JMOT) - 1
+              IF(IEND(JMOT).GT.IDSEQ)IEND(JMOT)=IDSEQ
+C   SET FLAG TO SAY THAT FOR STEMS WE DO NOT HAVE TO CONTINUE A PREVIOUS
+C   5' STEM START POSITION (IE TRY ALL ITS REMAINING LOOPS)
+              IENTRY(JMOT) = 0
+            END IF
+C
+20      CONTINUE
+C
+C
+C   OK WEVE FOUND THE FIRST NON-ORRED CLASS AND RESET RANGES
+C   RANGES ACCORDINGLY. NOW WE HAVE TO GO FORARDS THRU THE LIST UNTIL
+C   WE FIND THE NEXT NON-ORRED CLASS. IF WE SET THIS TO BE THE 
+C   CURRENT MOTIF WE WILL HAVE MOVED FORWARDS. WE MUST BE CAREFUL THAT
+C   WE DONT FALL OFF THE END OF THE LIST!
+C
+          JMOT = MOTIF
+30        CONTINUE
+          JMOT   = JMOT + 1
+          IF(JMOT.GT.NMOT)THEN
+            IDSPLY = 1
+            RETURN
+          END IF
+          IF(COMBIN(JMOT).EQ.'O')GO TO 30
+          MOTIF  = JMOT
+          ICLASS = CLASS(MOTIF)
+          ILEN   = LENGTH(MOTIF)
+          CUT    = CUTOFF(MOTIF)
+          IWT    = WTSTR(MOTIF)
+          ISTRST = STRNGS(MOTIF)
+          COMB   = COMBIN(MOTIF)
+CDEBUG
+C          DO 1000 I = 1,NMOT
+C            WRITE(*,*)'START',START(I)
+C1000      CONTINUE
+        END
+C*********************************************************************
+      SUBROUTINE BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +COMBIN,COMB)
+      INTEGER CLASS(NMOT),LENGTH(NMOT),WTSTR(NMOT)
+      INTEGER STRNGS(NMOT),RELMOT(NMOT),START(NMOT),IEND(NMOT)
+      INTEGER MATCHQ(NMOT),RANGES(NMOT),RANGEL(NMOT),RELEND(NMOT)
+      INTEGER MATCHP(NMOT)
+      REAL CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT),COMB
+CDEBUG
+C      WRITE(*,*)'IN BAKSID FOR MOTIF',MOTIF
+C   ROUTINE TO MOVE BAKWARDS OR SIDEWAYS
+C   IE WE COME HERE AFTER FAILING TO FIND A MATCH FOR THE PREVIOUS MOTIF
+C   WE CHECK IF WE CAN MOVE SIDEWAYS BY LOOKING TO SEE IF THE NEXT MOTIF
+C   IN THE LIST IS ORRED.
+C   IF IT IS WE LEAVE THE CURRENT MOTIFS POSITION UNCHANGED SO THAT IT
+C   IS READY TO SIGNIFIY TO A SUBSEQUENT MOVE BACKWARDS THAT ALL ITS
+C   POSITIONS HAVE BEEN TRIED FOR THE MATCHES FOUND FURTHER UP THE LIST.
+C
+C   IF THE NEXT MOTIF IN THE LIST IS NOT ORED THEN WE MUST MOVE BACKWARDS.
+C   BEFORE DOING SO WE MUST RESET THE RANGES FOR ALL THE MOTIFS IN THE 
+C   CURRENT SET OF ORS. THEY MUST BE RESET TO THEIR ORIGINAL FULL RANGE
+C   BECAUSE WE ARE TRYING TO FIND ALL POSSIBLE COMBINATIONS OF MATCH AND
+C   WHEN WE GO BACK WE MIGHT NOT GO BACK AS FAR AS THE MOTIF THE CURRENT 
+C   SET DEPEND ON, AND SO THEY WILL NOT BE RESET COMING FORWARDS. SO IF WE 
+C   FIND ANOTHER MATCH TO A MOTIF IN A PREVIOUS SET WE MUST ALLOW THE SAME
+C   MATCHES TO BE REFOUND IN THE CURRENT SET BECAUSE IT GIVES A DIFFERENT 
+C   COMBINATION TO THOSE ALREADY FOUND. TO FIND THE ONES TO RESET WE MUST
+C   MOVE BACK THRU THE LIST UNTIL A NON-ORRED CLASS IS FOUND: THIS 
+C   IS THE LAST ONE TO RESET.
+C   HAVING DONE THIS, TO MOVE BACK WE MUST LOOK BACK FROM THE LAST ONE 
+C   RESET, UNTIL WE FIND THE FIRST NON-ORRED CLASS. THIS WILL
+C   BE AN ANDED OR NOTTED MOTIF OR THE FIRST IN A LIST OF ORS. 
+C   WE MUST ALWAYS GO BACK
+C   TO THE BEGINNING OF A LIST OF ORS. IF THEY HAVE NO FURTHER POSITIONS TO
+C   TRY WE CAN THEN GO SIDEWAYS UNTIL EITHER ONE WITH SOME PLACES TO TRY
+C   IS FOUND, OR WE HAVE TO GIVE UP AND GO BACKWARDS.
+C   MOTIF 1 AND ANY MOTIFS ORED WITH IT ARE SPECIAL CASES FOR THE ROUTINES
+C   BECAUSE THEIR RANGES MUST NOT BE RESET. THEIR SPECIALNESS IS SIGNIFIED
+C   BY THEIR HAVING A ZERO RELATIVE MOTIF NUMBER. ALSO IF WE TRY TO GO BACK
+C   FOR MOTIF 1, WE MUST HAVE FINISHED TRYING ALL POSSIBLE COMBINATIONS,
+C   SO WE ESCAPE.
+C   
+C
+C
+C   RESET ITS MATCH POSITION TO ZERO AS A FLAG TO THE DISPLAY ROUTINES
+C
+C
+      MATCHP(MOTIF) = 0
+C
+C
+      IF(MOTIF.LT.NMOT)THEN
+        IF(COMBIN(MOTIF+1).EQ.'O')THEN
+C
+C  MOVE SIDEWAYS
+C
+          MOTIF = MOTIF + 1
+          ICLASS = CLASS(MOTIF)
+          ILEN   = LENGTH(MOTIF)
+          CUT    = CUTOFF(MOTIF)
+          IWT    = WTSTR(MOTIF)
+          ISTRST = STRNGS(MOTIF)
+          MATCHP(MOTIF) = 0
+          COMB   = COMBIN(MOTIF)
+          RETURN
+        END IF
+      END IF
+C
+C
+C
+C   WE MUST MOVE BACKWARDS.   RESET RANGE FOR CURRENT MOTIF IF IT IS NOT 
+C   NOTTED
+C
+C
+      IF(COMB.NE.'N')THEN
+C
+C
+      IREL = RELMOT(MOTIF)
+      IF(IREL.NE.0)THEN
+C  CHANGE ON NEXT LINE MADE 30-07-7
+C        IF((CLASS(MOTIF).EQ.6).AND.(RELEND(MOTIF).EQ.3)) 
+C     +    START(MOTIF) = MATCHQ(IREL) + RANGES(MOTIF)
+        IF(CLASS(IREL).EQ.6)THEN
+          IF(RELEND(MOTIF).EQ.3)
+     +      START(MOTIF) = MATCHQ(IREL) + RANGES(MOTIF)
+          IF(RELEND(MOTIF).EQ.5)
+     +      START(MOTIF) = MATCHP(IREL) + RANGES(MOTIF)
+        ELSE
+          START(MOTIF) = START(IREL) + RANGES(MOTIF)
+        END IF
+C       NEXT LINE FOR RAH BUG
+        START(MOTIF) = MAX(START(MOTIF),1)
+        IEND(MOTIF)  = START(MOTIF) + RANGEL(MOTIF) - 1 
+      END IF
+C
+C
+C
+10    CONTINUE
+C
+C
+      IF(MOTIF.EQ.1)THEN
+        IRET = 1
+        RETURN
+      END IF
+      IF(COMBIN(MOTIF).EQ.'O')THEN
+        MOTIF = MOTIF - 1
+        MATCHP(MOTIF) = 0
+        IREL = RELMOT(MOTIF)
+        IF(IREL.NE.0)THEN
+C  CHANGE ON NEXT LINE MADE 30-07-87
+C          IF((CLASS(MOTIF).EQ.6).AND.(RELEND(MOTIF).EQ.3)) 
+C     +      START(MOTIF) = MATCHQ(IREL) + RANGES(MOTIF)
+          IF(CLASS(IREL).EQ.6)THEN
+            IF(RELEND(MOTIF).EQ.3)
+     +        START(MOTIF) = MATCHQ(IREL) + RANGES(MOTIF)
+            IF(RELEND(MOTIF).EQ.5)
+     +        START(MOTIF) = MATCHP(IREL) + RANGES(MOTIF)
+          ELSE
+            START(MOTIF) = START(IREL) + RANGES(MOTIF)
+          END IF
+C         NEXT LINE FOR RAH BUG
+          START(MOTIF) = MAX(START(MOTIF),1)
+          IEND(MOTIF)  = START(MOTIF) + RANGEL(MOTIF) - 1 
+        END IF
+        GO TO 10
+      END IF
+C
+      END IF
+C
+C   IF WE GET HERE WE HAVE FOUND A NON-ORRED CLASS AND IF THE MOTIF WE CAME 
+C   IN WITH WAS NOT NOTTED
+C   WE HAVE RESET ITS RANGE AND
+C   ALL THOSE UP THE LIST THAT ARE ORRED, AND THE CURRENT ONE
+C   NOW WE MUST FIND THE TOP OF THE NEXT LIST SIGNIFIED BY AN ANDED
+C   CLASS
+C
+20    CONTINUE
+      MOTIF = MOTIF - 1
+      IF(COMBIN(MOTIF).NE.'A')GO TO 20
+C      IF(COMBIN(MOTIF).EQ.'O')GO TO 20
+      ICLASS = CLASS(MOTIF)
+      ILEN   = LENGTH(MOTIF)
+      CUT    = CUTOFF(MOTIF)
+      IWT    = WTSTR(MOTIF)
+      ISTRST = STRNGS(MOTIF)
+      COMB   = COMBIN(MOTIF)
+CDEBUG
+C          DO 1000 I = 1,NMOT
+C            WRITE(*,*)'START',START(I)
+C1000      CONTINUE
+      END
+      SUBROUTINE MOTIF1(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,
+     +MATCHP,MATCHS,IFOUND,CUT,JOB)
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS
+      IFOUND = 0
+      ISTEP = 1
+      IF(JOB.EQ.1)ISTEP = CUT
+      IF(ISTART.LT.1)ISTART=1
+      L1 = IEND-ISTART+1
+      IF(ISTART.GT.IDIM1)RETURN
+      IF(L1.LT.IDIM2)RETURN
+      CALL FIND6(SEQ(ISTART),L1,STRING,IDIM2,ISTEP,IFOUND)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = ISTART+IFOUND-1
+      MATCHS = IDIM2
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF2(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,
+     +MATCHP,MATCHS,IFOUND)
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      IF(ISTART.GT.IDIM1)RETURN
+      CALL SQFIT4(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,MATCHS,
+     +IFOUND)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = IFOUND
+      RETURN
+      END
+      SUBROUTINE GETCLS(JDEV,KBOUT,ICLASS,TCLASS,KEYNAM)
+      CHARACTER TCLASS,STRING*21,NUMBER*10,STRNG2*10,KEYNAM*(*)
+      SAVE NUMBER
+      DATA NUMBER/'0987654321'/
+C      WRITE(*,*)'JDEV,KBOUT,ICLASS,TCLASS',JDEV,KBOUT,ICLASS,TCLASS
+      ICLASS = 0
+10    CONTINUE
+      STRING = ' '
+      READ(JDEV,1000,ERR=10,END=70)STRING
+1000  FORMAT(A)
+      KEYNAM = STRING(14:21)
+      DO 20 I = 10,1,-1
+        K = I
+        IF(STRING(I:I).NE.' ')GO TO 21
+20    CONTINUE
+      RETURN
+21    CONTINUE
+      L = 1
+      DO 30 I = 1,K
+        L = I
+        IF(STRING(I:I).NE.' ')GO TO 31
+30    CONTINUE
+31    CONTINUE
+      TCLASS = STRING(L:L)
+      DO 40 I = 1,10
+        IF(TCLASS.EQ.NUMBER(I:I))THEN
+          TCLASS = 'A'
+          GO TO 41
+        END IF
+40    CONTINUE
+      IF((TCLASS.NE.'O').AND.(TCLASS.NE.'N').AND.(TCLASS.NE.'A')
+     +.AND.(TCLASS.NE.'-'))THEN
+        ICLASS = 99999
+        RETURN
+      END IF
+      L = L + 1
+41    CONTINUE
+      J = 11
+      STRNG2 = ' '
+      DO 50 I = K,L,-1
+        J = J-1
+        STRNG2(J:J) = STRING(I:I)
+50    CONTINUE
+        READ(STRNG2,1001,ERR=60)ICLASS
+1001    FORMAT(I10)
+        IF(TCLASS.EQ.'-')ICLASS=-1*ICLASS
+      RETURN
+60    CONTINUE
+      ICLASS = 99999
+      RETURN
+70    CONTINUE
+      END
+C*********************************************************************
+C*********************************************************************
+      SUBROUTINE SQFIT4(SEQ,IDIM1,STRING,IDIM2,
+     1IS,IE,MINSC,MATCHS,IFOUND)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS,MINSC
+      INTEGER IUBM1
+      EXTERNAL IUBM1
+      MINSCR = MINSC
+      IDIF=(IE-IS+2)-IDIM2
+      IPSEQ=IS
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          NTOT = NTOT + IUBM1(STRING(J),SEQ(IP))
+          IP=IP+1
+100     CONTINUE
+C   END OF COUNTING FOR THIS POSITION.IS TOTAL HIGH ENOUGH?
+        IF(NTOT.GE.MINSCR)THEN
+          MATCHS = NTOT
+          IFOUND = IP-IDIM2
+          RETURN
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      IFOUND = 0
+      RETURN
+      END
+      SUBROUTINE GETP(ICLASS,STRING,LENGTH,IDM,
+     +MAT1,IDMAT1,MAT2,IDMAT2,MAT3,IDMAT3,MAT4,IDMAT4,WEIGHT)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      PARAMETER (
+     +           MAXCHR=17,
+     +           MAXSIG=120)
+      INTEGER MAT1(IDMAT1,IDMAT1),MAT2(IDMAT2,IDMAT2)
+      INTEGER MAT3(IDMAT3,IDMAT3),MAT4(IDMAT4,IDMAT4)
+      INTEGER IWT(MAXCHR,MAXSIG)
+      REAL WEIGHT(IDM,LENGTH),FB(MAXCHR),F(MAXCHR)
+      CHARACTER STRING(LENGTH)
+      COMMON /COMPC/COMP(MAXCHR)
+      DO 10 I = 1,MAXCHR
+        F(I) = COMP(I)
+10    CONTINUE
+      IF(ICLASS.EQ.1)THEN
+        CALL GETP1(STRING,LENGTH,IDMAT3,MAT2,IDMAT2,IWT,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.2)THEN
+        CALL GETP1(STRING,LENGTH,IDMAT3,MAT2,IDMAT2,IWT,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.3)THEN
+        CALL GETP3(STRING,LENGTH,IDMAT3,MAT3,IWT,IDM,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.4)THEN
+        CALL GETP4(WEIGHT,IWT,IDM,LENGTH,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.5)THEN
+        CALL GETP4(WEIGHT,IWT,IDM,LENGTH,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.6)THEN
+        CALL GETP8(MAT4,IDMAT4,IWT,IDM,LENGTH,F,FB)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.7)THEN
+        CALL GETP1(STRING,LENGTH,IDMAT3,MAT2,IDMAT2,IWT,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.8)THEN
+        CALL GETP8(MAT1,IDMAT1,IWT,IDM,LENGTH,F,FB)
+        RETURN
+      END IF
+      END
+      SUBROUTINE WTGEN(IWT,MAXCHR,LENGTH,F,MAXS,SMIN,SCALE,JOB)
+      PARAMETER (MAXPOL = 4000)
+      INTEGER IWT(MAXCHR,LENGTH)
+      REAL F(MAXCHR)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      SAVE /POLY/
+      CMIN = SMIN
+      CSCALE = SCALE
+      LENTHC = LENGTH
+      DO 10 I = 0, MAXPOL
+        POLYA(I) = 0.0
+10    CONTINUE
+      DO 400 I = 1,MAXCHR
+        K = IWT(I,1)
+        POLYA(K) = POLYA(K) + F(I)
+400   CONTINUE
+      MAXS1 = MAXS
+      DO 600 J = 2,LENGTH
+        DO 490 I = 0, MAXPOL
+          POLYB(I) = 0.0
+490     CONTINUE
+        DO 500 I = 1,MAXCHR
+          K = IWT(I,J)
+          POLYB(K) = POLYB(K) + F(I)
+500     CONTINUE
+        CALL POLMUL(MAXS1,MAXS,MAXSP1)
+        MAXS1 = MAXSP1
+600   CONTINUE
+C  IF JOB = 1 WANT DISTRIBUTION ONLY
+C     JOB = 2 WANT CUMMULATIVE VALUES
+C     JOB = 3 WANT DISTRIBUTION AND UNSCALED SCORES
+C     JOB = 4 WANT CUMMULATIVE VALUES AND UNSCALED SCORES
+      IF((JOB.EQ.2).OR.(JOB.EQ.4))THEN
+        DO 610 I = 1,MAXS1
+          J = MAXS1 - I
+          POLYA(J) = POLYA(J) + POLYA(J+1)
+610     CONTINUE
+      END IF
+      IF((JOB.EQ.3).OR.(JOB.EQ.4))THEN
+        T = SMIN * LENGTH
+        DO 620 I = 0,MAXS1
+          POLYB(I) = (REAL(I)/SCALE) + T
+620     CONTINUE
+      END IF
+      END
+      SUBROUTINE SCGEN(IWT,MAXCHR,LENGTH,FA,FB,MAXS,
+     +SCALE,SMIN,JOB)
+C  AUTHOR RODGER STADEN
+C  PROBABILITIES FOR DIRECT AND INVERTED REPEATS
+      PARAMETER (MAXPOL = 4000)
+      INTEGER IWT(MAXCHR,MAXCHR)
+      REAL FA(MAXCHR),FB(MAXCHR)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      SAVE /POLY/
+      CMIN = SMIN
+      CSCALE = SCALE
+      LENTHC = LENGTH
+      DO 10 I = 0, MAXPOL
+        POLYA(I) = 0.0
+        POLYB(I) = 0.0
+10    CONTINUE
+      DO 400 I = 1,MAXCHR
+        DO 300 J = 1,MAXCHR
+          K = IWT(I,J)
+          POLYA(K) = POLYA(K) + FA(I) * FB(J)
+          POLYB(K) = POLYA(K)
+300     CONTINUE
+400   CONTINUE
+      MAXS1 = MAXS
+      DO 600 J = 2,LENGTH
+        CALL POLMUL(MAXS1,MAXS,MAXSP1)
+        MAXS1 = MAXSP1
+600   CONTINUE
+C  IF JOB = 1 WANT DISTRIBUTION ONLY
+C     JOB = 2 WANT CUMMULATIVE VALUES
+C     JOB = 3 WANT DISTRIBUTION AND UNSCALED SCORES
+C     JOB = 4 WANT CUMMULATIVE VALUES AND UNSCALED SCORES
+      IF((JOB.EQ.2).OR.(JOB.EQ.4))THEN
+        DO 610 I = 1,MAXS1
+          J = MAXS1 - I
+          POLYA(J) = POLYA(J) + POLYA(J+1)
+610     CONTINUE
+      END IF
+      IF((JOB.EQ.3).OR.(JOB.EQ.4))THEN
+        T = SMIN * LENGTH
+        DO 620 I = 0,MAXS1
+          POLYB(I) = (REAL(I)/SCALE) + T
+620     CONTINUE
+      END IF
+      END
+      SUBROUTINE SCGEN1(IWT,MAXCHR,LENGTH,FA,MAXS,
+     +SCALE,SMIN,JOB,STRING)
+C  AUTHOR RODGER STADEN
+C  PROBABILITY FOR STRINGS
+      CHARACTER STRING(LENGTH)
+      INTEGER DTONUM
+      PARAMETER (MAXPOL = 4000)
+      INTEGER IWT(MAXCHR,MAXCHR)
+      REAL FA(MAXCHR)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      EXTERNAL DTONUM
+      SAVE /POLY/
+      CMIN = SMIN
+      CSCALE = SCALE
+      LENTHC = LENGTH
+      DO 10 I = 0, MAXPOL
+        POLYA(I) = 0.0
+10    CONTINUE
+      K1 = DTONUM(STRING(1))
+      DO 400 I = 1,MAXCHR
+        K = IWT(I,K1)
+        POLYA(K) = POLYA(K) + FA(I)
+400   CONTINUE
+      MAXS1 = MAXS
+      DO 600 J = 2,LENGTH
+        DO 490 I = 0,MAXPOL
+           POLYB(I) = 0.
+490     CONTINUE
+        K1 = DTONUM(STRING(J))
+        DO 500 I = 1,MAXCHR
+          K = IWT(I,K1)
+          POLYB(K) = POLYB(K) + FA(I)
+500     CONTINUE
+        CALL POLMUL(MAXS1,MAXS,MAXSP1)
+        MAXS1 = MAXSP1
+600   CONTINUE
+C  IF JOB = 1 WANT DISTRIBUTION ONLY
+C     JOB = 2 WANT CUMMULATIVE VALUES
+C     JOB = 3 WANT DISTRIBUTION AND UNSCALED SCORES
+C     JOB = 4 WANT CUMMULATIVE VALUES AND UNSCALED SCORES
+      IF((JOB.EQ.2).OR.(JOB.EQ.4))THEN
+        DO 610 I = 1,MAXS1
+          J = MAXS1 - I
+          POLYA(J) = POLYA(J) + POLYA(J+1)
+610     CONTINUE
+      END IF
+      IF((JOB.EQ.3).OR.(JOB.EQ.4))THEN
+        T = SMIN * LENGTH
+        DO 620 I = 0,MAXS1
+          POLYB(I) = (REAL(I)/SCALE) + T
+620     CONTINUE
+      END IF
+      END
+      SUBROUTINE POLMUL(NA,NB,NC)
+      PARAMETER (MAXPOL = 4000)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      PARAMETER (ZERO=0.0,SMALL=1E-10)
+      SAVE /POLY/
+C  POLYA, POLYB ARE INPUT POLYNOMIAL COEEFICIENTS
+C  POLYC  IS OUTPUT POLYNOMIAL COEEFICIENTS
+C
+      NC = NA + NB
+      DO 210 I = 0,MAXPOL
+        POLYC(I) = ZERO
+210   CONTINUE
+      IF(NC.GT.MAXPOL)RETURN
+      DO 230 I = 0,NA
+        DO 220 J = 0,NB
+          POLYC(I+J) = POLYC(I+J) + POLYA(I) * POLYB(J)
+220     CONTINUE
+230   CONTINUE
+      DO 240 I = 0,NC
+        T = POLYC(I)
+        IF(T.LT.SMALL) T = ZERO
+        POLYA(I) = T
+240   CONTINUE
+      END
+      SUBROUTINE WRTWMT(WEIGHT,IWT,MAXCHR,LENGTH,IDEV,IFLAG)
+      REAL WEIGHT(MAXCHR,LENGTH)
+      INTEGER IWT(MAXCHR,LENGTH)
+      DO 10 I = 1,MAXCHR
+        IF(IFLAG.EQ.0)
+     +  WRITE(IDEV,1000,ERR=100)(WEIGHT(I,J),J=1,LENGTH)
+        IF(IFLAG.EQ.1)
+     +  WRITE(IDEV,1001,ERR=100)(IWT(I,J),J=1,LENGTH)
+10    CONTINUE
+1000  FORMAT(' ',20F5.2)
+1001  FORMAT(' ',20I4)
+100   CONTINUE
+      END
+      SUBROUTINE WTSC(WEIGHT,IWT,MAXCHR,LENGTH,MAXS,SCALE,SMIN)
+      INTEGER IWT(MAXCHR,LENGTH)
+      REAL WEIGHT(MAXCHR,LENGTH)
+      PARAMETER (SMALL=1E-10)
+      PARAMETER (MAXPOL = 4000)
+      SMIN = 9999999.9
+      SMAX = -99999999.9
+      DO 100 J = 1,LENGTH
+        DO 90 I = 1,MAXCHR
+          T = WEIGHT(I,J)
+          SMIN = MIN(SMIN,T)
+          SMAX = MAX(SMAX,T)
+90      CONTINUE
+100   CONTINUE
+      SMAX = SMAX - SMIN
+      IF(SMAX.EQ.0.0)SMAX = SMALL
+C  USE LENGTH+1 TO DIVIDE MAXPOL. THIS IS AN ATTEMPT TO AVOID
+C  THE USE OF NINT GIVING A SET OF SCORES THAT SUM TO TOO
+C  HIGH A VALUE (I.E GT MAXPOL)
+      MAXS = MAXPOL / (LENGTH+1)
+      MAXS = MIN(MAXS,40)
+      SCALE = REAL(MAXS)/SMAX
+      DO 200 J = 1,LENGTH
+        DO 190 I = 1,MAXCHR
+          K = NINT((WEIGHT(I,J)-SMIN)*SCALE)
+          IWT(I,J) = K
+190     CONTINUE
+200   CONTINUE
+      END
+      SUBROUTINE GETCMP(SEQ,IDIM,COMP,IDM)
+      CHARACTER SEQ(IDIM)
+      REAL COMP(IDM)
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+      DO 10 I = 1,IDM
+        COMP(I) = 0.0
+10    CONTINUE
+      DO 20 I = 1,IDIM
+        J = DTONUM(SEQ(I))
+        COMP(J) = COMP(J) + 1.
+20    CONTINUE
+      T = MAX(1,IDIM)
+      DO 30 I = 1,IDM
+        COMP(I) = COMP(I) / T
+30    CONTINUE
+      END
+      SUBROUTINE MATSC(MAT,IDMAT,IWT,MAXCHR,LENGTH,MAXS,
+     +SMIN,SCALE)
+      INTEGER IWT(IDMAT,IDMAT)
+      INTEGER MAT(IDMAT,IDMAT)
+      PARAMETER (SMALL=1E-10)
+      PARAMETER (MAXPOL = 4000)
+      ISMIN = 99999999
+      ISMAX = -99999999
+      DO 100 J = 1,IDMAT
+        DO 90 I = 1,IDMAT
+          K = MAT(I,J)
+          ISMIN = MIN(ISMIN,K)
+          ISMAX = MAX(ISMAX,K)
+90      CONTINUE
+100   CONTINUE
+C      SMAX = ISMAX - ISMIN
+C      IF(SMAX.EQ.0.0)SMAX = SMALL
+C  DO NOTHING HERE - ASSUME VALUES ARE IN RANGE AND JUST COPY
+C  USE LENGTH+1 TO DIVIDE MAXPOL. THIS IS AN ATTEMPT TO AVOID
+C  THE USE OF NINT GIVING A SET OF SCORES THAT SUM TO TOO
+C  HIGH A VALUE ( IE GT MAXPOL)
+C    MAXS = MAXPOL(LENGTH+1)
+C      SCALE = REAL(MAXS)/SMAX
+      DO 200 J = 1,IDMAT
+        DO 190 I = 1,IDMAT
+C          K = NINT((REAL(WEIGHT(I,J))-ISMIN)*SCALE)
+          IWT(I,J) = MAT(I,J) - ISMIN
+190     CONTINUE
+200   CONTINUE
+      SCALE = 1.0
+      SMIN = ISMIN
+      MAXS = ISMAX
+      END
+      REAL FUNCTION PSCORE(SCORE)
+      PARAMETER (MAXPOL = 4000)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +SMIN,SCALE,LENGTH
+      SAVE /POLY/
+C WANT PROBABILITY OF SCORING AT LEAST SCORE
+C CUMMULATIVE PROBABILITIES IN CUMP
+C SCALE FACTORS SMIN AND SCALE AND MOTIF LENGTH IS LENGTH
+      I = NINT((SCORE - SMIN * LENGTH) * SCALE)
+      PSCORE = -1.
+      IF((I.GE.0).AND.(I.LE.MAXPOL)) PSCORE = POLYA(I)
+      END
+      SUBROUTINE GETP1(STRING,LENGTH,IDMAT3,MAT2,IDMAT2,IWT,F)
+      INTEGER MAT2(IDMAT2,IDMAT2)
+      INTEGER IWT(IDMAT2,IDMAT2)
+      REAL F(IDMAT3)
+      CHARACTER STRING(LENGTH)
+      JOB = 2
+        CALL MATSC(MAT2,IDMAT2,IWT,IDMAT3,LENGTH,MAXS,SMIN,SCALE)
+        CALL SCGEN1(IWT,IDMAT2,LENGTH,F,MAXS,SCALE,SMIN,JOB,STRING)
+      END
+      SUBROUTINE GETP3(STRING,LENGTH,IDMAT3,MAT3,IWT,IDM,F)
+      INTEGER MAT3(IDMAT3,IDMAT3)
+      INTEGER IWT(IDMAT3,IDMAT3)
+      REAL F(IDMAT3)
+      CHARACTER STRING(LENGTH)
+      JOB = 2
+        CALL MATSC(MAT3,IDMAT3,IWT,IDM,LENGTH,MAXS,SMIN,SCALE)
+        CALL SCGEN1(IWT,IDMAT3,LENGTH,F,MAXS,SCALE,SMIN,JOB,
+     +  STRING)
+      END
+      SUBROUTINE GETP4(WEIGHT,IWT,IDM,LENGTH,F)
+      REAL F(IDM)
+      REAL WEIGHT(IDM,LENGTH)
+      INTEGER IWT(IDM,LENGTH)
+      JOB = 2
+        CALL WTSC(WEIGHT,IWT,IDM,LENGTH,MAXS,SCALE,SMIN)
+        CALL WTGEN(IWT,IDM,LENGTH,F,MAXS,SMIN,SCALE,JOB)
+      END
+      SUBROUTINE GETP8(MAT1,IDMAT1,IWT,IDM,LENGTH,F,FB)
+      REAL F(IDM),FB(IDM)
+      INTEGER MAT1(IDMAT1,IDMAT1)
+      INTEGER IWT(IDM,LENGTH)
+        DO 30 I = 1,IDM
+          FB(I) = F(I)
+30      CONTINUE
+      JOB = 2
+        CALL MATSC(MAT1,IDMAT1,IWT,IDM,LENGTH,MAXS,SMIN,SCALE)
+        CALL SCGEN(IWT,IDMAT1,LENGTH,F,FB,MAXS,SCALE,SMIN,JOB)
+      END
+      SUBROUTINE WRTSCR(SCMIN,PROB,KBOUT)
+      WRITE(KBOUT,1000)SCMIN,PROB
+1000  FORMAT(' Probability of score ',F10.4,' =',E10.3)
+      END
+      SUBROUTINE EFTOUT(FTNAME,I1,I2,IDEV)
+      CHARACTER LINOUT*80,FTNAME*(*)
+      EXTERNAL ITOSL
+C  produces ft lines from a pair of positions
+      LINOUT = 'FT '
+      LINOUT(6:80) = FTNAME
+      L = 22
+      J = ITOSL(LINOUT(L:),I1)
+      IF(J.EQ.0) GO TO 100
+      L = L + J
+      LINOUT(L:L+1) = '..'
+      L = L + 2
+      J = ITOSL(LINOUT(L:),I2)
+      IF(J.EQ.0) GO TO 100
+      WRITE(IDEV,1000,ERR=100)LINOUT
+ 1000 FORMAT(A)
+      RETURN
+100   CONTINUE
+      WRITE(*,*)'Error in EFTOUT'
+      END
diff --git a/src/staden/patternp.f b/src/staden/patternp.f
new file mode 100644
index 0000000..411c571
--- /dev/null
+++ b/src/staden/patternp.f
@@ -0,0 +1,1027 @@
+C 22-7-91 fixed bug in dsplay: graphics did not work
+C 2-3-92 set filnam = ' ' for some calls to openf1
+      SUBROUTINE PATTEN(SEQ,IDSEQ,STRING,MAXSTR,
+     +LENGTH,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,IENTRY,
+     +START2,IEND2,WTSTR,START,IEND,MATCHQ,RELEND,MATCHP,
+     +STRNGS,LAST5,LAST3S,LAST3E,MATCHS,CUTOFF,WEIGHT,FILNAM,
+     +MAXMOT,MAXWTS,MATRIX,
+     +IDEV1,IDEV2,IDEV3,
+     +MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KSTART,KBIN,KBOUT,
+     +IDEV4,IDM,COMBIN,MAT1,NAMSAV,KEYNS,CHRSET,
+     +IHELPS,IHELPE,HELPF,IDEVH,FOFNAM,IDEVFF)
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT)
+      INTEGER RANGET(MAXMOT),RANGEM(MAXMOT),IENTRY(MAXMOT)
+      INTEGER START2(MAXMOT),IEND2(MAXMOT)
+      CHARACTER SEQ(IDSEQ),STRING(MAXSTR)
+      INTEGER WTSTR(MAXMOT),START(MAXMOT),IEND(MAXMOT)
+      INTEGER MATCHQ(MAXMOT),RELEND(MAXMOT)
+      INTEGER MATCHP(MAXMOT),STRNGS(MAXMOT)
+      INTEGER LAST5(MAXMOT),LAST3S(MAXMOT),LAST3E(MAXMOT)
+      INTEGER MAT1(IDM,IDM),MATRIX(IDM,IDM)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT),MATCHS(MAXMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER FILNAM*(*),HELPF*(*),FOFNAM*(*),TITLE*80
+      CHARACTER COMBIN(MAXMOT),CHRSET(IDM)
+      CHARACTER*(*) NAMSAV(MAXMOT),KEYNS(MAXMOT)
+      PARAMETER (MAXCLS = 6)
+      PARAMETER (SMALL=1.0E-5)
+      PARAMETER (MAXPRM = 30)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      WRITE(KBOUT,*)'  Pattern searcher'
+      JDEV = KBIN
+      PROMPT(1) = 'Use keyboard'
+      PROMPT(2) = 'Use pattern file'
+      PROMPT(3) = 'Use file of pattern file names'
+      IN = 1
+      CALL RADION('Select pattern definition mode',PROMPT,3,IN,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IN.LT.1) RETURN
+      IF(IN.EQ.2)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Pattern definition file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        JDEV = IDEV3
+      ELSE IF(IN.EQ.3) THEN
+        FOFNAM = ' '
+        CALL OPENF1(IDEVFF,FOFNAM,0,IOK,KBIN,KBOUT,
+     +  'File of pattern file names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        JDEV = IDEV3
+      END IF
+      IOPT = 1
+      PROMPT(1) = 'Motif by motif'
+      PROMPT(2) = 'Inclusive'
+      PROMPT(3) = 'Graphical'
+      PROMPT(4) = 'EMBL feature table'
+      CALL RADION('Select results display mode',PROMPT,4,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) GO TO 201
+      IF((IN.EQ.3).AND.(IOPT.EQ.3)) IOPT = 2
+      PMINC = 1.
+      CUTSCR = -99999.
+      NPAT = 0
+      IPROB = 0
+      NTOTAL = 0
+      JTOTAL = 0
+      CALL SETCMP(SEQ,IDSEQ,IDM)
+      IF(IN.EQ.3) THEN
+        IPROB = 1
+        CALL BUSY(KBOUT)
+      END IF
+10    CONTINUE
+      IF(IN.EQ.3) THEN
+        READ(IDEVFF,1000,ERR=100,END=200)FOFNAM
+        CALL OPENRS(IDEV3,FOFNAM,IOK,LRECL,2)
+        IF(IOK.NE.0) GO TO 200
+        NPAT = NPAT + 1
+      END IF
+1000  FORMAT(A)
+        CALL FILLI(LENGTH,MAXMOT,0)
+        CALL FILLI(CLASS,MAXMOT,0) 
+        CALL FILLI(RELMOT,MAXMOT,0)
+        CALL FILLI(RANGES,MAXMOT,0)
+        CALL FILLI(RANGEL,MAXMOT,0)
+        CALL FILLI(RANGET,MAXMOT,0)
+        CALL FILLI(RANGEM,MAXMOT,0)
+        CALL FILLI(IENTRY,MAXMOT,0)
+        CALL FILLI(START2,MAXMOT,0)
+        CALL FILLI(IEND2,MAXMOT,0) 
+        CALL FILLI(WTSTR,MAXMOT,0) 
+        CALL FILLI(START,MAXMOT,0) 
+        CALL FILLI(IEND,MAXMOT,0)  
+        CALL FILLI(MATCHQ,MAXMOT,0)
+        CALL FILLI(RELEND,MAXMOT,0)
+        CALL FILLI(MATCHP,MAXMOT,0)
+        CALL FILLI(STRNGS,MAXMOT,0)
+        CALL FILLI(LAST5,MAXMOT,0) 
+        CALL FILLI(LAST3S,MAXMOT,0)
+        CALL FILLI(LAST3E,MAXMOT,0)
+        CALL FILLR(CUTOFF,MAXMOT,0.0)
+        CALL FILLR(MATCHS,MAXMOT,0.0)
+        CALL FILLR(WEIGHT,MAXWTS,0.0)
+        CALL FILLC(COMBIN,MAXMOT,'A')
+        DO 1 I = 1,MAXMOT
+          NAMSAV(I) = 'FILENOTUSED'
+C          WRITE(*,*)NAMSAV(I)
+1       CONTINUE        
+        ITOTAL = 0
+C
+C
+C
+C  GET MOTIF DEFINITIONS
+C
+C  RETURN STRING LENGTH FOR COMPATIBILITY WITH LIB SEARCH
+      NSTRNG = MAXSTR
+      IF(IN.EQ.1) THEN
+        CALL GETMK(KBIN,KBOUT,STRING,NSTRNG,ISTRNG,
+     +  LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV2,
+     +  WTSTR,JDEV,IOK,RELEND,IDSEQ,IDEV4,IDM,COMBIN,
+     +  MAXCLS,MATRIX,MAT1,
+     +  PMINT,PMAXT,PROBT,EXPTT,CHRSET,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,TITLE)
+      ELSE
+        CALL GETMF(KBIN,KBOUT,STRING,NSTRNG,ISTRNG,
+     +  LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV2,
+     +  WTSTR,JDEV,IOK,RELEND,IDSEQ,IDEV4,IDM,COMBIN,
+     +  MAXCLS,MATRIX,MAT1,
+     +  PMINT,PMAXT,PROBT,EXPTT,CHRSET,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,IPROB,TITLE)
+      END IF
+      IF(IOK.NE.0) GO TO 201
+      IF(NMOT.LT.1) GO TO 201
+C
+C
+C  DISPLAY THE SIGNAL DESCRIPTION
+C
+      RANGES(1) = 1
+      IF(IN.NE.3) THEN
+        CALL DESSIG(
+     +            KBOUT,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,COMBIN,KEYNS,TITLE)
+C
+C
+        WRITE(KBOUT,2003)PROBT
+2003    FORMAT(' Probability of finding pattern = ',E10.4)
+        WRITE(KBOUT,2004)EXPTT
+2004    FORMAT(' Expected number of matches  = ',E10.4)
+        CALL GETRL(0.,1.,1.0,'Maximum pattern probability',
+     +  XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 201
+        PMINC = XP
+        CALL GETRL(-9999.,9999.,-9999.,'Minimum pattern score',
+     +  XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) GO TO 201
+        CUTSCR = XP
+        MINSCR = 9999999.
+        MAXSCR = -9999999.
+        YMIN = LOG(1.-PMINT)
+        YMAX = LOG(1.-PMAXT)
+        YDEL = ABS(YMIN - YMAX)
+        IF(YDEL.LT.SMALL)THEN
+          YMAX = YMAX + SMALL
+          YMIN = YMIN - SMALL
+        END IF
+        YDEL = ABS(YMIN - YMAX) * 0.1
+        YMIN = YMIN - YDEL
+        YMAX = YMAX + YDEL
+        CALL BUSY(KBOUT)
+      END IF
+      IF(IOPT.EQ.3)THEN
+        XMAX = IDSEQ
+        XMIN = KSTART
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        CALL CLEARV
+      END IF
+C
+C  DO THE SEARCH
+C
+      CALL SRCSIG(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATRIX,MATCHQ,RELEND,
+     +IDEV1,LAST5,LAST3S,LAST3E,MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,IOPT,XMAX,XMIN,YMAX,YMIN,ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,MAT1,PMINT,PMAXT,PROBT,PMINC,KEYNS)
+      CALL VT100M
+      IF(IN.EQ.3) THEN
+        IF(ITOTAL.NE.0) THEN
+          WRITE(IDEV1,1003)TITLE
+ 1003      FORMAT(' ',A)
+          WRITE(IDEV1,1001)FOFNAM,ITOTAL
+1001      FORMAT(' ',A,' ',I6,/)
+          NTOTAL = NTOTAL + ITOTAL
+          JTOTAL = JTOTAL + 1
+        END IF
+        GO TO 10
+      END IF
+      WRITE(KBOUT,1002)ITOTAL
+1002  FORMAT(' Total matches found',I7)
+      IF((ITOTAL.GT.0).AND.(IN.NE.3))WRITE(KBOUT,1006)MINSCR,MAXSCR
+1006  FORMAT(' Minimum and maximum observed scores',2F12.2)
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,*)'Error reading file of file names'
+200   CONTINUE
+      WRITE(KBOUT,*)'Of the ',NPAT,' patterns processed'
+      WRITE(KBOUT,*)JTOTAL,' gave a total of',NTOTAL,' matches'
+201   CONTINUE
+      CLOSE(UNIT=IDEVFF)
+      END
+      SUBROUTINE SRCSIG(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATRIX,MATCHQ,RELEND,
+     +IDEVOT,LAST5,LAST3S,LAST3E,MARGL,MARGR,MARGB,MARGT,ISXMAX,
+     +ISYMAX,IOPT,XMAX,XMIN,YMAX,YMIN,ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,MAT1,PMINT,PMAXT,PROBT,PMINC,KEYNS)
+C  ROUTINE TO SEARCH FOR SIGNALS COMPOSED OF MOTIFS
+C  WEIGHT  = WEIGHTS FOR MATRICES
+C  CUTOFF  = CUTOFF SCORES
+C  LENGTH  = MOTIF LENGTHS
+C  CLASS   = MOTIF CLASS
+C  COMBIN  = LOGICAL COMBINATION A, O, N
+C  WTSTR   = POINTER TO WEIGHT STARTS
+C  RANGES  = RANGES START
+C  RANGEL  = RANGE LENGTH (A DISTANCE MEASURED FROM RANGES)
+C  RELMOT  = MOTIF NUMBER THAT A RANGE IS RELATIVE TO IE THE
+C            FIRST MOTIF'S RANGE IS RELATIVE TO MOTIF 0, BUT
+C            ANY OTHER MOTIF MAY HAVE TO BE DEFINED RELATIVE
+C            TO ANY OTHER. THE MOST COMMON WOULD BE THE FIRST
+C            MOTIF OR THE LAST ONE SEARCHED FOR.
+C  RELEND  = IS A SPECIAL CASE FLAG FOR STEMS. IT ALLOWS OTHER
+C            MOTIFS TO HAVE THEIR POSITIONS RELATIVE TO THE 3' SIDE
+C            OF A STEM. IT IS 5 FOR THE 5 PRIME SIDE, 3 FOR 3' SIDE
+C  START   = RANGE START DURING SEARCH (SOME POSITIONS MAY HAVE
+C            BEEN TRIED)
+C  IEND    = RANGE END POSITION FOR CURRENT INITIAL START (WHEN IT
+C            IS RESET DEPENDS ON WHETHER IT IS DEFINED RELATIVE TO 
+C            THE FIRST OR THE PREVIOUS MOTIF. IF IT IS DEFINED RELATIVE
+C            TO THE FIRST MOTIF IT IS RESET WHEN WE FIND A MATCH FOR THE
+C            FIRST MOTIF. IF IT IS DEFINED RELATIVE TO THE PREVIOUS MOTIF
+C            WE MUST RESET WHEN WE MOVE FORWARD ONE MOTIF. I THINK THIS
+C            CAN BE TAKEN CARE OF BY UPDATING ALL THOSE MOTIFS THAT ARE
+C            DEFINED TO THE CURRENT MOTIF EVERY TIME WE MOVE FORWARD 
+C            ONE MOTIF (IE IT INCLUDES THE FIRST MOTIF SO IT IS NOT A
+C            SPECIAL CASE). OTHERWISE WE UPDATE POSITIONS WHEN WE FIND
+C            A MATCH FOR THEM (WE SET TO THE MATCH POSITION PLUS 1)
+C  MATCHP  = LIST OF CURRENT MATCH POSITIONS FOR EACH MOTIF
+C  MATCHS  = LIST OF CURRENT MATCH SCORES FOR EACH MOTIF
+C  IFOUND  = A FLAG TO INDICATE SUCCESS OR FAILURE OF A SEARCH ROUTINE
+C            1 = SUCCESS, 0 = FAIL
+C  STRNGS  = POINTER TO STRING STARTS IN CHARACTER ARRAY STRING
+C  TEMPORARY VALUES ARE:
+C  MOTIF   = ACTUAL MOTIF NUMBER
+C  ICLASS  = CLASS
+C  ILEN    = LENGTH OF MOTIF
+C  CUT     = CUTOFF
+C  WT      = START OF WEIGHTS FOR THIS MOTIF
+C  ISTRST  = START OF STRING
+C  RANGET  = START OF 3' RANGE FOR STEM SEARCHES
+C  RANGEM  = END OF 3' RANGE FOR STEM SEARCHES
+C  IENTRY  = FLAG TO SIGNIFY MORE 3' STEM POSITIONS FOR LAST 5' START
+C            0 = NONE, ON RETURN FROM MOTIF6 IT CONTAINS THE 3' MATCH 
+C            POSITION
+C  MATCHQ  = MATCH POSITION FOR STEM SEARCH
+C  COMB    = LOGICAL COMBINATION A, O, N
+      INTEGER WTSTR(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),START(NMOT),IEND(NMOT)
+      INTEGER RELMOT(NMOT),MATCHP(NMOT),STRNGS(NMOT)
+      INTEGER RANGET(NMOT),RANGEM(NMOT),IENTRY(NMOT),RELEND(NMOT)
+      INTEGER START2(NMOT),IEND2(NMOT),MATRIX(IDM,IDM),MATCHQ(NMOT)
+      INTEGER LAST5(NMOT),LAST3S(NMOT),LAST3E(NMOT),MAT1(IDM,IDM)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT),MATCHS(NMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER SEQ(IDSEQ),STRING(MAXSTR)
+      CHARACTER COMBIN(NMOT),COMB
+      CHARACTER*(*) KEYNS(NMOT)
+C
+C
+C  INITIALIZE
+      JMOT = 0
+      IRET = 0
+5     CONTINUE
+      JMOT = JMOT + 1
+      IF(JMOT.LE.NMOT)THEN
+        IF(RELMOT(JMOT).EQ.0)THEN
+          START(JMOT) = RANGES(1)
+          IEND(JMOT) = RANGES(1) + RANGEL(1) -1
+          GO TO 5
+        END IF
+      END IF
+      MOTIF  = 1
+      ICLASS = CLASS(1)
+      ILEN   = LENGTH(1)
+      CUT    = CUTOFF(1)
+      IWT    = WTSTR(1)
+      ISTRST = STRNGS(1)
+      IENTRY(1) = 0
+      COMB   = COMBIN(1)
+      DO 10 I = 1,NMOT
+        MATCHP(I) = 0
+10    CONTINUE
+C
+C
+C
+C
+C
+100   CONTINUE
+C
+C
+C  THIS A CLASS CLASS MOTIF, PERFORM THE APPROPRIATE SEARCH IF THE START 
+C  POSITION IS >0. (IF IT IS NOT THE CURRENT MOTIF IS A NOT THAT HAS
+C  ALREADY BEEN SEARCHED FOR
+C
+      IFOUND = 0
+      IF(START(MOTIF).GT.0)THEN
+C
+C
+      IF(ICLASS.EQ.1)THEN
+        CALL MOTIF1(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,
+     +  CUTOFF(MOTIF),0)
+      ELSE IF(ICLASS.EQ.2)THEN
+        CALL MOTIF2(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND)
+      ELSE IF(ICLASS.EQ.3)THEN
+        CALL MOTIF3(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,MATRIX,IDM)
+      ELSE IF(ICLASS.EQ.4)THEN
+        CALL MOTIF4(SEQ,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+      ELSE IF(ICLASS.EQ.5)THEN
+      CALL MOTIF8(SEQ,IDSEQ,MATRIX,LENGTH(MOTIF),START(MOTIF),
+     +            IEND(MOTIF),RANGET(MOTIF),RANGEM(MOTIF),
+     +            CUTOFF(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),
+     +            IENTRY(MOTIF),IFOUND,MATCHQ(MOTIF),
+     +            LAST5(MOTIF),LAST3S(MOTIF),LAST3E(MOTIF),IDM)
+      ELSE IF(ICLASS.EQ.6)THEN
+        CALL MOTIF4(SEQ,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+C        CALL MOTIF6(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+C     +  IEND(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,
+C     +  CUTOFF(MOTIF),0)
+      ELSE
+        WRITE(KBOUT,*)'UNKNOWN CLASS!!'
+      END IF
+C
+C
+      END IF
+C
+C
+C   MATCH FOUND WHEN MATCH WANTED ?
+C
+C
+C
+      IF(((IFOUND.EQ.0).AND.(COMB.NE.'N')).OR.
+     +   ((IFOUND.GT.0).AND.(COMB.EQ.'N')))THEN
+C
+C   NO SO GO BACK OR SIDEWAYS ONE MOTIF
+C
+C
+        CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +  RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +  COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+        IF(IRET.NE.0)RETURN
+C
+C
+      ELSE
+C
+C
+C   MATCH FOUND. 
+C
+C
+C   PREPARE FOR NEXT SEARCH THIS MOTIF BY INCREMENTING POINTER
+C   TO SEARCH RANGE (NOT FOR CLASS 6 WHICH IS HANDLED BY MOTIF6)
+C
+C
+       IF(COMB.EQ.'N')THEN
+         START(MOTIF) = -9
+       ELSE
+         IF(ICLASS.NE.5) START(MOTIF) = MATCHP(MOTIF) + 1
+       END IF
+C
+C   TRY GOING FORWARD ONE MOTIF
+C
+C
+      CALL FORWAD(CLASS,LENGTH,CUTOFF,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,RELMOT,START,IEND,
+     +  RANGES,RANGEL,STRNGS,ISTRST,WTSTR,IDSEQ,IENTRY,
+     +  RANGET,RANGEM,START2,IEND2,MATCHQ,RELEND,MATCHP,IDSPLY,
+     +  COMBIN,COMB)
+C
+C
+C
+C   IS THIS THE LAST MOTIF? IF SO DISPLAY THE MATCH
+C
+C
+        IF(IDSPLY.EQ.1)THEN
+          ITOTAL = ITOTAL + 1
+          CALL DSPLAY(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +    CLASS,MATCHQ,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOPT,
+     +    XMAX,XMIN,YMAX,YMIN,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +    MATRIX,MAT1,IDM,PMINT,PMAXT,PROBT,
+     +    WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,KEYNS)
+C
+C
+C  HORRIBLE SPECIAL CASE - IF LAST MOTIF IS NOTTED WE MUST MOVE BACK AFTER
+C  DISPLAY
+          IF(COMB.EQ.'N')THEN
+            CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +      MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +      RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +      COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+            IF(IRET.NE.0)RETURN
+          END IF
+        END IF
+C
+C
+      END IF
+C
+C
+C   GO BACK FOR NEXT SEARCH
+C
+C
+      GO TO 100
+      END
+      SUBROUTINE GETMK(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,MATRIX,MAT1,
+     +PMINT,PMAXT,PROBT,EXPTT,CHRSET,
+     +IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,TITLE)
+C  MAT1 SIMPLE IDENTITY
+C  MAT2 IUB SCORES 0-1
+C  MAT3 IUB SCORES 0-36
+C  MAT4 INVERTED REPEAT
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT),STRNGS(MAXMOT)
+      INTEGER WTSTR(MAXMOT),RANGET(MAXMOT),RANGEM(MAXMOT)
+      INTEGER RELEND(MAXMOT),MATRIX(IDM,IDM),MAT1(IDM,IDM)
+      CHARACTER STRING(MAXSTR),FILNAM*(*),HELPF*(*)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT)
+      CHARACTER COMBIN(MAXMOT),TCLASS,CHRSET(IDM)
+      CHARACTER*(*) KEYNS(MAXMOT)
+      CHARACTER*(*) NAMSAV(MAXMOT),TITLE
+      PARAMETER (MAXPRM = 31)
+      CHARACTER PROMPT(7)*(MAXPRM)
+      EXTERNAL PSCORE
+C   GETS PATTERN DEFINITIONS IN TERMS OF MOTIFS
+      PMINT = 1.0
+      PMAXT = 1.0
+      PROBT = 1.0
+      EXPTS = 0.0
+      PROBS = 0.0
+      PMINS = 0.0
+      PMAXS = 0.0
+      EXPTT = 1.0
+      IOK = 0
+      IPROB = 0
+      DO 10 I=1,MAXMOT
+        RELEND(I) = 5
+        COMBIN(I) = 'A'
+10    CONTINUE
+      RANGES(1) = 0
+      RANGEL(1) = IDSEQ
+      IREL = 0
+C   COUNT MOTIFS
+      NMOT = 0
+      MOTIF = 0
+C   SET POINTER TO SEARCH STRINGS ARRAY
+      ISTRNG = 1
+C   SET POINTER TO WEIGHT ARRAY
+      IWT = 1
+      TCLASS = 'A'
+C   SET CLASS TO EXACT MATCH
+      ICLASS = 1
+100   CONTINUE
+      IOK = 1
+      PROMPT(1) = 'Exact match'
+      PROMPT(2) = 'Percentage match'
+      PROMPT(3) = 'Cut-off score and score matrix'
+      PROMPT(4) = 'Cut-off score and weight matrix'
+      PROMPT(5) = 'Direct repeat'
+      PROMPT(6) = 'Membership of set'
+      PROMPT(7) = 'Pattern complete'
+      CALL RADION('Select motif definition mode',PROMPT,7,ICLASS,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(ICLASS.LT.1) RETURN
+      IF(ICLASS.EQ.7)GO TO 900
+101   CONTINUE
+      LKEY = 0
+      CALL GTSTR('Motif name',' ',KEYNS(MOTIF+1),LKEY,
+     +KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 101
+      END IF
+      IF(MOTIF.GT.0) THEN
+        IF(COMBIN(MOTIF).NE.'N') THEN
+          IC = 1
+          PROMPT(1) = 'And'
+          PROMPT(2) = 'Or'
+          PROMPT(3) = 'Not'
+          CALL RADION('Select logical operator',PROMPT,3,IC,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(IC.LT.1) RETURN
+          IF(IC.EQ.1) TCLASS = 'A'
+          IF(IC.EQ.2) TCLASS = 'O'
+          IF(IC.EQ.3) TCLASS = 'N'
+        ELSE
+          IC = 1
+          PROMPT(1) = 'And'
+          PROMPT(2) = 'Not'
+          CALL RADION('Select logical operator',PROMPT,2,IC,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(IC.LT.1) RETURN
+          IF(IC.EQ.1) TCLASS = 'A'
+          IF(IC.EQ.2) TCLASS = 'N'
+        END IF
+      END IF
+      NMOT = NMOT + 1
+      MOTIF = MOTIF + 1
+      IF(NMOT.GT.MAXMOT)THEN
+        WRITE(KBOUT,*)'Maximum number of motifs exceeded'
+        GO TO 901
+      END IF
+      CLASS(MOTIF) = ICLASS
+      COMBIN(MOTIF) = TCLASS
+C  IF NOT THE FIRST MOTIF, AND AN ANDED MOTIF OR THE FIRST IN A LIST OF ORS
+C  GET ITS RANGE ETC
+      IOK = 0
+      IF((MOTIF.GT.1).AND.(COMBIN(MOTIF).NE.'O'))THEN
+C        WRITE(KBOUT,1002)
+C1002    FORMAT(' RANGES ARE DEFINED RELATIVE TO OTHER MOTIFS',/,
+C     +' AND BY STARTS AND DISTANCES.')
+        IOK = 1
+        MININ = 1
+        MAXIN = MOTIF
+211     CONTINUE
+        MAXIN = MAXIN - 1
+        IF(COMBIN(MAXIN).NE.'A') GO TO 211
+        IREL = MAXIN
+        CALL GETINT(MININ,MAXIN,IREL,'Number of reference motif',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IREL = IVAL
+        RELMOT(MOTIF) = IREL
+        IOK = 1
+        RELEND(MOTIF) = 5
+        MININ = -1000
+        MAXIN = 1000
+        ID = LENGTH(IREL) + 1
+        IF(CLASS(IREL).EQ.5) ID = 1
+        CALL GETINT(MININ,MAXIN,ID,'Relative start position',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        ID = IVAL
+        RANGES(MOTIF) = ID - 1
+        MININ = 0
+        MAXIN = 1000
+        IR = 0
+        CALL GETINT(MININ,MAXIN,IR,'Number of extra positions',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        IR = IVAL
+        RANGEL(MOTIF) = IR
+      ELSE IF(COMBIN(MOTIF).EQ.'O')THEN
+C  NEED TO SET RANGES TO THOSE OF THE FIRST IN A SET OF ORED MOTIFS
+C  SET TO THOSE OF THE PREVIOUS MOTIF BECAUSE IT MUST BE THE SAME
+        RANGES(MOTIF) = RANGES(MOTIF-1)
+        RANGEL(MOTIF) = RANGEL(MOTIF-1)
+        RELEND(MOTIF) = RELEND(MOTIF-1)
+        RELMOT(MOTIF) = IREL
+      END IF
+      CALL GETMC(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,MATRIX,MAT1,
+     +PMINT,PMAXT,PROBT,EXPTT,EXPTS,PROBS,PMINS,PMAXS,CHRSET,
+     +IHELPS,IHELPE,HELPF,IDEVH,MOTIF,IWT,NAMSAV,FILNAM,IPROB)
+      IF(IOK.EQ.0) GO TO 100
+901   CONTINUE
+      WRITE(KBOUT,*)' Error in pattern definition'
+      IOK = 1
+      RETURN
+900   CONTINUE
+      EXPTT = EXPTT * EXPTS
+      PROBT = PROBT * PROBS
+      PMINT = PMINT * PMINS
+      PMAXT = PMAXT * PMAXS
+      CALL YESNO(IN,'Save pattern in a file',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IN.LT.0)RETURN
+      IF(IN.EQ.0)THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEVSV,FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Pattern definition file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        CALL SAVSIG(
+     +  IDEVSV,STRING,MAXSTR,
+     +  LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +  WTSTR,RELEND,NAMSAV,COMBIN,KEYNS,IDEV,FILNAM,
+     +  KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,CHRSET,IDM,TITLE)
+        CLOSE(UNIT=IDEVSV)
+      END IF
+      DO 899 I = 2,NMOT
+        IF(CLASS(I).EQ.5) THEN
+          RANGEL(I) = RANGEL(I) + 1
+        ELSE
+          RANGEL(I) = RANGEL(I) + LENGTH(I)
+        END IF
+899   CONTINUE
+C  RETURN STRING LENGTH
+      MAXSTR = ISTRNG - 1
+      IOK = 0
+      END
+      SUBROUTINE DSPLAY(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEV,
+     +CLASS,MATCHQ,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,IOPT,
+     +XMAX,XMIN,YMAX,YMIN,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +MATRIX,MAT1,IDM,PMINT,PMAXT,PROBT,
+     +WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,KEYNS)
+      INTEGER MATCHP(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER MATCHQ(NMOT)
+      CHARACTER SEQ(IDSEQ)
+      REAL MATCHS(NMOT),MINSCR,MAXSCR
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+      INTEGER MAT1(IDM,IDM),MATRIX(IDM,IDM)
+      INTEGER WTSTR(NMOT)
+      CHARACTER*(*) KEYNS(NMOT)
+      EXTERNAL PSCORE
+C  ADD SCORES
+      T = 0.
+      DO 10 I = 1,NMOT
+        IF(MATCHP(I).NE.0) T = T + MATCHS(I)
+10    CONTINUE
+      POBS = 1.0
+      IF((PMINC.LT.1.0).OR.(IOPT.EQ.3))THEN
+C  CALC PROB
+        DO 20 I = 1,NMOT
+          IF(MATCHP(I).NE.0)THEN
+            CALL GETP(CLASS(I),SEQ(MATCHP(I)+KSTART-1),LENGTH(I),
+     +      IDM,MATRIX,MAT1,WEIGHT(MAX(1,WTSTR(I))))
+            PROB = PSCORE(MATCHS(I))
+            POBS = POBS * PROB
+          END IF
+20      CONTINUE
+        IF((PMINC.LT.1.0).AND.(POBS.GT.PMINC))RETURN
+      END IF
+C
+      IF(T.GT.MAXSCR) MAXSCR = T
+      IF(T.LT.MINSCR) MINSCR = T
+      IF(T.LT.CUTSCR) RETURN
+C
+      IF(IOPT.EQ.1)THEN
+      WRITE(IDEV,1001)
+1001  FORMAT(' Match')
+      DO 100 I=1,NMOT
+        J = I
+C  CHECK FOR NO MATCH (NEEDED FOR ORED MOTIFS)
+        IF(MATCHP(J).NE.0)THEN
+        WRITE(IDEV,1000)MATCHP(J)+KSTART-1,KEYNS(I)
+        WRITE(IDEV,1002)(SEQ(K),K=MATCHP(J),MATCHP(J)+LENGTH(J)-1)
+C  REPEAT ?
+        IF(CLASS(J).EQ.5)THEN
+          WRITE(IDEV,1002)(SEQ(K),K=MATCHQ(J),MATCHQ(J)+LENGTH(J)-1)
+          WRITE(IDEV,1000)MATCHQ(J)+KSTART-1
+        END IF
+        END IF
+100   CONTINUE
+1000  FORMAT(' ',I7,' ',A8)
+1002  FORMAT('       ',60A1)
+      IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+1004  FORMAT(' Probability =',E10.4)
+      RETURN
+      END IF
+      IF(IOPT.EQ.3)THEN
+C       MARK THE POSITION OF THE FIRST NON-ZERO MATCH ONLY
+        DO 200 I = 1,NMOT
+          IF(MATCHP(I).NE.0)THEN
+            X = MATCHP(I) + KSTART - 1
+            Y = LOG(1.-POBS)
+            CALL VECTOM
+            CALL LINE(X,X,YMIN,Y,XMAX,XMIN,YMAX,YMIN,
+     +      MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+            CALL VT100M
+            GO TO 201
+          END IF
+200     CONTINUE
+201     CONTINUE
+      END IF
+      IF(IOPT.EQ.2)THEN
+        MINP = 999999
+        MAXP = -999999
+        DO 300 I = 1, NMOT
+          K = MATCHP(I)
+          IF(K.NE.0)THEN
+            IF(K.LT.MINP)MINP = K
+            K = K + LENGTH(I) - 1
+C  REPEAT ?
+            IF(CLASS(I).EQ.5) K = MATCHQ(I) + LENGTH(I) - 1
+            IF(K.GT.MAXP)MAXP = K
+          END IF
+300     CONTINUE
+        WRITE(IDEV,1000)MINP+KSTART-1
+        WRITE(IDEV,1002)
+     +  ((SEQ(K1),K1=K2,MIN(K2+59,MAXP)),K2=MINP,MAXP,60)
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+        RETURN
+      END IF
+      IF(IOPT.EQ.4)THEN
+        MINP = 999999
+        MAXP = -999999
+        K1 = 0
+        DO 400 I = 1, NMOT
+          K = MATCHP(I)
+          IF(K.NE.0)THEN
+            IF(K1.EQ.0) INAM = I
+            K1 = 1
+            IF(K.LT.MINP)MINP = K
+            K = K + LENGTH(I) - 1
+C           REPEAT ?
+            IF(CLASS(I).EQ.5) K = MATCHQ(I) + LENGTH(I) - 1
+            IF(K.GT.MAXP)MAXP = K
+          END IF
+400     CONTINUE
+        CALL EMBOUT(KEYNS(INAM),MINP+KSTART-1,MAXP+KSTART-1,' ',
+     +  'Program',IDEV)
+      END IF
+      END
+      SUBROUTINE MOTIF3(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,
+     +MATCHP,MATCHS,IFOUND,MATRIX,IDM)
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER MATRIX(IDM,IDM)
+      REAL MATCHS
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      IF(ISTART.GT.IDIM1)RETURN
+      CALL SQFIT5(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,MATCHS,
+     +IFOUND,MATRIX,IDM)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = IFOUND
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE SQFIT5(SEQ,IDIM1,STRING,IDIM2,
+     1IS,IE,MINSC,MATCHS,IFOUND,MATRIX,IDM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS,MINSC
+      INTEGER MATRIX(IDM,IDM),DTONUM
+      EXTERNAL DTONUM
+      MINSCR = MINSC
+C
+      IDIF=(IE-IS+2)-IDIM2
+C   IDIF IS THE NUMBER OF POSNS TO TRY
+C   IPSTR GOES FROM 1 TO IDIM2 IDIF TIMES
+C   TRY ALL POSSIBLE POSITIONS FOR MATCHING AND SCORE FOR EACH
+C   POINT TO ARRAY ELEMENT CORRESPONDING TO FIRST BASE
+      IPSEQ=IS
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          NTOT = NTOT + MATRIX(DTONUM(SEQ(IP)),DTONUM(STRING(J)))
+          IP=IP+1
+100     CONTINUE
+C   END OF COUNTING FOR THIS POSITION.IS TOTAL HIGH ENOUGH?
+        IF(NTOT.GE.MINSCR)THEN
+          MATCHS = NTOT
+          IFOUND = IP-IDIM2
+          RETURN
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      IFOUND = 0
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF4(SEQ,IDIM1,LENGTH,ISTART,IEND,
+     +WEIGHT,CUTOFF,MATCHP,MATCHS,IFOUND,IDM)
+      REAL WEIGHT(IDM,LENGTH)
+      REAL MATCHS
+      CHARACTER SEQ(IDIM1)
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      L1 = IEND-ISTART+1
+      IF(ISTART.GT.IDIM1)RETURN
+      IF(L1.LT.LENGTH)RETURN
+      CALL FMOT4(SEQ(ISTART),L1,WEIGHT,LENGTH,CUTOFF,SCORE,IFOUND,IDM)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = ISTART+IFOUND-1
+      MATCHS = SCORE
+      END
+C*********************************************************************
+      SUBROUTINE FMOT4(SEQ,IDIM,WT,LENGTH,CUTOFF,SUM,IFOUND,IDM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL WT(IDM,LENGTH)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DO 10 I=1,IDIM-LENGTH+1
+        SUM=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          IP = CTONUM(SEQ(J))
+          SUM=SUM+WT(IP,K)
+5       CONTINUE
+        IF(SUM.GE.CUTOFF)THEN
+        IFOUND = I
+        RETURN
+        END IF
+10    CONTINUE
+      IFOUND = 0
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF8(SEQ,IDSEQ,MATRIX,LENGTH,I5STAR,I5END,
+     +                  I3STAR,I3END,CUTOFF,MATCHP,MATCHS,
+     +                  IENTRY,IFOUND,MATCHQ,
+     +                  LAST5,LAST3S,LAST3E,IDM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM),REPEAT
+      REAL MATCHS
+      EXTERNAL REPEAT
+C  WE HAVE A START POSITION FOR THE 5' END OF THE 5' END OF
+C  A POTENTIAL REPEAT I5STAR AND AN END DEFINED BY A RANGE I5END
+C  WE HAVE A REPEAT LENGTH LENGTH
+C  WE HAVE A RANGE OF POSITIONS FOR THE 3' STEM TO START
+C  I3STAR TO I3END
+C  TRY THE TIGHTEST LOOPS FIRST
+C  BUT FIRST WE MAY HAVE TO FINISH A PREVIOUS SEARCH
+C  THIS IS DENOTED BY IENTRY NE 0.
+C  NOTE IENTRY IS ALSO USED TO RETURN THE 3' MATCH POSITION
+      ICUT = CUTOFF
+C      WRITE(*,*)'ICUT',ICUT
+      IFOUND = 0
+      JENTRY = IENTRY
+      IENTRY = 0
+      IF(I5STAR.LT.1)I5STAR=1
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      IF(JENTRY.NE.0)THEN
+        I1 = LAST5
+C        WRITE(*,*)'I1,LAST3S,LAST3E',I1,LAST3S,LAST3E
+        DO 50 J=LAST3S+1,LAST3E
+          J1 = J
+          ISUM = REPEAT(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+          IF(ISUM.GE.ICUT)THEN
+            MATCHP = I1
+            IENTRY = J1
+            MATCHQ = J1
+            MATCHS = ISUM
+            IFOUND = MATCHP
+            LAST3S = J1
+            RETURN
+          END IF
+50      CONTINUE
+C  NOW MOVE 5' STEM START POSITION (WE HAVE JUST FINISHED THE LAST)
+C  TO THE LAST MATCH + 1
+        I5STAR = MATCHP + 1
+      END IF
+C  SET ENTRY FLAG TO ZERO TO SIGNIFY LAST SEARCH NOW COMPLETED
+      IENTRY = 0
+      ISUM = 0
+      LOOPI1 = I5STAR
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      LOOPI2 = MIN(IDSEQ-2*LENGTH+1,I5END)
+C      WRITE(*,*)'IDSEQ,LENGTH,I5STAR,I5END',
+C     +IDSEQ,LENGTH,I5STAR,I5END
+C
+C  TRY ALL STEM STARTS FROM 5' START TO 5' END
+C
+C
+      DO 200 I = LOOPI1,LOOPI2
+C
+C
+        I1 = I
+C
+C   TRY ALL LOOPS FROM 3' START TO 3' END
+C
+        LOOPJ1 = I + I3STAR -1
+        IF((LOOPJ1+LENGTH-1).GT.IDSEQ)RETURN
+        LOOPJ2 = MIN(IDSEQ-LENGTH+1,I+I3END-1)
+C      WRITE(*,*)'I3STAR,I3END',I3STAR,I3END
+C
+C
+C
+        DO 100 J = LOOPJ1,LOOPJ2
+C
+C
+C
+         J1 = J
+C  IN REPEAT NOTE THAT
+C  THE 5' END POINTER I1 GOES FORWARDS
+C  THE 3' END POINTER J1 GOES FORWARDS
+C
+         ISUM = REPEAT(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+C         WRITE(*,*)ISUM
+         IF(ISUM.GE.ICUT)THEN
+           MATCHP = I1
+           IENTRY = J1
+           MATCHQ = J1
+           MATCHS = ISUM
+           IFOUND = MATCHP
+C  SAVE CURRENT POSITION FOR LATER ENTRIES
+           LAST5  = I1
+           LAST3S = J1
+           LAST3E = LOOPJ2
+           RETURN
+         END IF
+100    CONTINUE
+200   CONTINUE
+      END
+C*********************************************************************
+      INTEGER FUNCTION REPEAT(SEQ,IDSEQ,MATRIX,LENGTH,I5P,I3P,IDM)
+      CHARACTER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM),CTONUM
+      EXTERNAL CTONUM
+C  THE 5' END POINTER GOES FORWARDS
+C  THE 3' END POINTER GOES FORWARDS
+      L=0
+      I5=I5P-1
+      I3=I3P-1
+      DO 100 I=1,LENGTH
+       I5 = I5 + 1
+       I3 = I3 + 1
+C      WRITE(*,*)'I5,I3',I5,I3
+       L5 = CTONUM(SEQ(I5))
+       L3 = CTONUM(SEQ(I3))
+       L = L + MATRIX(L5,L3)
+100   CONTINUE
+      REPEAT = L
+      END
+      SUBROUTINE MOTIF6(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,
+     +MATCHP,MATCHS,IFOUND,CUT,JOB)
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS
+      INTEGER CUT
+      INTEGER INSET1
+      EXTERNAL INSET1
+      IFOUND = 0
+      ISTEP = 1
+C      IF(JOB.EQ.1)ISTEP = CUT
+      IF(ISTART.LT.1)ISTART=1
+      L1 = IEND-ISTART+1
+      IF(ISTART.GT.IDIM1)RETURN
+      IF(L1.LT.1)RETURN
+      IFOUND = INSET1(SEQ(ISTART),L1,STRING,IDIM2,ISTEP)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = ISTART+IFOUND-1
+      MATCHS = 1.
+      END
+      INTEGER FUNCTION INSET1(SEQ,IDIM,STRING,IDS,INC)
+      CHARACTER SEQ(IDIM),STRING(IDS)
+      INTEGER INSET
+      EXTERNAL INSET
+      INSET1 = 0
+      ISEQ = 1 - INC
+10    CONTINUE
+        ISEQ = ISEQ + INC
+        IF(ISEQ.GT.IDIM)RETURN
+        IMATCH = INSET(SEQ(ISEQ),STRING,IDS)
+        IF(IMATCH.EQ.1)THEN
+          INSET1 = ISEQ
+          RETURN
+        END IF
+        GO TO 10
+      END
+      INTEGER FUNCTION INSET(SEQ,STRING,IDS)
+      CHARACTER SEQ,STRING(IDS)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      INSET = 1
+      KSEQ = CTONUM(SEQ)
+      DO 20 I = 1,IDS
+        IF(KSEQ.EQ.CTONUM(STRING(I)))RETURN
+20    CONTINUE
+      INSET = 0
+      END
+      SUBROUTINE SETCMP(SEQ,IDIM,IDM)
+      PARAMETER (MAXCHR = 26)
+      CHARACTER SEQ(IDIM)
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+      COMMON /COMPC/COMP(MAXCHR)
+      SAVE /COMPC/
+      DO 10 I = 1,IDM
+        COMP(I) = 0.0
+10    CONTINUE
+      DO 20 I = 1,IDIM
+        J = DTONUM(SEQ(I))
+        COMP(J) = COMP(J) + 1.
+20    CONTINUE
+      T = MAX(1,IDIM)
+      DO 30 I = 1,IDM
+        COMP(I) = COMP(I) / T
+30    CONTINUE
+      END
diff --git a/src/staden/patternpc.f b/src/staden/patternpc.f
new file mode 100644
index 0000000..56e11d4
--- /dev/null
+++ b/src/staden/patternpc.f
@@ -0,0 +1,1593 @@
+C   9-11-90 Large number of changes relating to use of file of
+C           file names and removal of radio
+C  3-7-91 Removed annotation "filename" from pattern files
+C 18-7-91 Added titles to pattern files
+C 2-3-92 set filnam = ' ' for calls to  openf1
+      SUBROUTINE GETMF(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,MATRIX,MAT1,
+     +PMINT,PMAXT,PROBT,EXPTT,CHRSET,
+     +IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,IPROB,TITLE)
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT),STRNGS(MAXMOT)
+      INTEGER WTSTR(MAXMOT),RANGET(MAXMOT),RANGEM(MAXMOT)
+      INTEGER RELEND(MAXMOT),MATRIX(IDM,IDM),MAT1(IDM,IDM)
+      CHARACTER STRING(MAXSTR),FILNAM*(*),HELPF*(*)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT)
+      CHARACTER COMBIN(MAXMOT),TCLASS,CHRSET(IDM)
+      CHARACTER*(*) NAMSAV(MAXMOT),KEYNS(MAXMOT),TITLE
+      EXTERNAL PSCORE
+C   GETS PATTERN DEFINITIONS IN TERMS OF MOTIFS
+      PMINT = 1.0
+      PMAXT = 1.0
+      PROBT = 1.0
+      EXPTS = 0.0
+      PROBS = 0.0
+      PMINS = 0.0
+      PMAXS = 0.0
+      EXPTT = 1.0
+      IOK = 1
+      DO 10 I=1,MAXMOT
+        RELEND(I) = 5
+        COMBIN(I) = 'A'
+10    CONTINUE
+      RANGES(1) = 0
+      RANGEL(1) = IDSEQ
+      IREL = 0
+C   COUNT MOTIFS
+      NMOT = 0
+      MOTIF = 0
+C   SET POINTER TO SEARCH STRINGS ARRAY
+      ISTRNG = 1
+C   SET POINTER TO WEIGHT ARRAY
+      IWT = 1
+C get title
+      READ(JDEV,1000,ERR=901,END=901)TITLE
+ 1000 FORMAT(A)
+100   CONTINUE
+      CALL GETCLS(JDEV,KBOUT,ICLASS,TCLASS,KEYNS(MOTIF+1))
+1001  FORMAT(I7)
+      IF(ICLASS.LT.0) GO TO 901
+      IF(ICLASS.GT.MAXCLS)GO TO 901
+      IF(ICLASS.EQ.0)GO TO 900
+      IF((TCLASS.EQ.'O').AND.(NMOT.LT.1))THEN
+        WRITE(KBOUT,*)'CANNOT OR FIRST MOTIF'
+        GO TO 901
+      END IF
+      IF((TCLASS.EQ.'N').AND.(NMOT.LT.1))THEN
+        WRITE(KBOUT,*)'CANNOT NOT FIRST MOTIF'
+        GO TO 901
+      END IF
+      IF((TCLASS.EQ.'O').AND.(COMBIN(MOTIF).EQ.'N'))THEN
+        WRITE(KBOUT,*)'CANNOT OR WITH NOTTED MOTIF'
+        GO TO 901
+      END IF
+      NMOT = NMOT + 1
+      MOTIF = MOTIF + 1
+      IF(NMOT.GT.MAXMOT)THEN
+        WRITE(KBOUT,*)'MAXIMUM NUMBER OF MOTIFS EXCEEDED'
+        GO TO 901
+      END IF
+      CLASS(MOTIF) = ICLASS
+      COMBIN(MOTIF) = TCLASS
+C  IF NOT THE FIRST MOTIF, AND AN ANDED MOTIF OR THE FIRST IN A LIST OF ORS
+C  GET ITS RANGE ETC
+      IF((MOTIF.GT.1).AND.(COMBIN(MOTIF).NE.'O'))THEN
+        READ(JDEV,1001,ERR=901)IREL
+        IF(IREL.LT.0)GO TO 901
+        IF(IREL.LT.1)GO TO 901
+        IF(IREL.GT.MOTIF-1)THEN
+          WRITE(KBOUT,*)'CAN ONLY REFER BACK'
+          GO TO 901
+        END IF
+        IF(COMBIN(IREL).NE.'A')THEN
+          WRITE(KBOUT,*)'CAN ONLY REFER TO AN ANDED MOTIF'
+          GO TO 901
+        END IF
+        RELMOT(MOTIF) = IREL
+        READ(JDEV,1001,ERR=901)ID
+        READ(JDEV,1001,ERR=901)IR
+        IF(IR.LT.0)GO TO 901
+        RANGES(MOTIF) = ID - 1
+        RANGEL(MOTIF) = IR
+      ELSE IF(COMBIN(MOTIF).EQ.'O')THEN
+C  NEED TO SET RANGES TO THOSE OF THE FIRST IN A SET OF ORED MOTIFS
+C  SET TO THOSE OF THE PREVIOUS MOTIF BECAUSE IT MUST BE THE SAME
+        RANGES(MOTIF) = RANGES(MOTIF-1)
+        RANGEL(MOTIF) = RANGEL(MOTIF-1)
+        RELEND(MOTIF) = RELEND(MOTIF-1)
+        RELMOT(MOTIF) = IREL
+      END IF
+      XRAN = 1.0
+C  NOW GET DETAILS SPECIFIC TO EACH CLASS OF MOTIF
+      CALL GETMC(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,MATRIX,MAT1,
+     +PMINT,PMAXT,PROBT,EXPTT,EXPTS,PROBS,PMINS,PMAXS,CHRSET,
+     +IHELPS,IHELPE,HELPF,IDEVH,MOTIF,IWT,NAMSAV,FILNAM,IPROB)
+      IF(IOK.EQ.0) GO TO 100
+901   CONTINUE
+      WRITE(KBOUT,*)' Error in pattern definition'
+      IOK = 1
+      CLOSE(UNIT=JDEV)
+      RETURN
+900   CONTINUE
+      EXPTT = EXPTT * EXPTS
+      PROBT = PROBT * PROBS
+      PMINT = PMINT * PMINS
+      PMAXT = PMAXT * PMAXS
+      CLOSE(UNIT=JDEV)
+      DO 899 I = 2,NMOT
+        IF(CLASS(I).EQ.5) THEN
+          RANGEL(I) = RANGEL(I) + 1
+        ELSE
+          RANGEL(I) = RANGEL(I) + LENGTH(I)
+        END IF
+899   CONTINUE
+C  RETURN STRING LENGTH
+      MAXSTR = ISTRNG - 1
+      IOK = 0
+      END
+      SUBROUTINE GETMC(KBIN,KBOUT,STRING,MAXSTR,ISTRNG,
+     +LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,
+     +STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV,WTSTR,JDEV,IOK,
+     +RELEND,IDSEQ,IDEVSV,IDM,COMBIN,MAXCLS,MATRIX,MAT1,
+     +PMINT,PMAXT,PROBT,EXPTT,EXPTS,PROBS,PMINS,PMAXS,CHRSET,
+     +IHELPS,IHELPE,HELPF,IDEVH,MOTIF,IWT,NAMSAV,FILNAM,IPROB)
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT),STRNGS(MAXMOT)
+      INTEGER WTSTR(MAXMOT),RANGET(MAXMOT),RANGEM(MAXMOT)
+      INTEGER RELEND(MAXMOT),MATRIX(IDM,IDM),MAT1(IDM,IDM)
+      CHARACTER STRING(MAXSTR),FILNAM*(*),HELPF*(*)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT)
+      CHARACTER COMBIN(MAXMOT),CHRSET(IDM)
+      CHARACTER*(*) NAMSAV(MAXMOT)
+      EXTERNAL PSCORE
+      XRAN = 1.0
+      PMIN = 1.
+      PMAX = 1.
+      PROB = 1.
+C  NOW GET DETAILS SPECIFIC TO EACH CLASS OF MOTIF
+      IF(CLASS(MOTIF).EQ.1)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        CALL GETMT1(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),KBIN,KBOUT,
+     +  IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,MATRIX,MAT1,WEIGHT(IWT))
+          SCMIN  = LENGTH(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMIN = PROB
+          PMAX = PROB
+        END IF
+        STRNGS(MOTIF) = ISTRNG
+        ISTRNG = ISTRNG + LENGTH(MOTIF)
+        CUTOFF(MOTIF) = LENGTH(MOTIF)
+      ELSE IF(CLASS(MOTIF).EQ.2)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        CALL GETMT2(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),
+     +  CUTOFF(MOTIF),KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,MATRIX,MAT1,WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = LENGTH(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMIN = PROB
+          PMAX = PSCORE(SCMAX)
+        END IF
+        STRNGS(MOTIF) = ISTRNG
+        ISTRNG = ISTRNG + LENGTH(MOTIF)
+      ELSE IF(CLASS(MOTIF).EQ.3)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        CALL GETMT3(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),
+     +  CUTOFF(MOTIF),MATRIX,IDM,KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,MATRIX,MAT1,WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = LENGTH(MOTIF) * 15.7
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        STRNGS(MOTIF) = ISTRNG
+        ISTRNG = ISTRNG + LENGTH(MOTIF)
+      ELSE IF(CLASS(MOTIF).EQ.4)THEN
+        MXWT = MAXWTS - IWT + 1
+        CALL GETMT4(WEIGHT(IWT),MXWT,LENGTH(MOTIF),CUTOFF(MOTIF),
+     +  SCMAX,IDEV,KBIN,KBOUT,IOK,JDEV,FILNAM,IDM,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,MATRIX,MAT1,WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        WTSTR(MOTIF) = IWT
+        IWT = IWT + LENGTH(MOTIF)*IDM
+        NAMSAV(MOTIF) = FILNAM
+      ELSE IF(CLASS(MOTIF).EQ.5)THEN
+        CALL GETMT8(RANGET(MOTIF),RANGEM(MOTIF),LENGTH(MOTIF),
+     +  CUTOFF(MOTIF),KBIN,KBOUT,JDEV,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,MATRIX,MAT1,WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = LENGTH(MOTIF) * 15.7
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        XRAN = ABS(RANGEM(MOTIF))-ABS(RANGET(MOTIF))+1
+      ELSE IF(CLASS(MOTIF).EQ.6)THEN
+        MXSTR = MAXSTR - ISTRNG + 1
+        MXWT = MAXWTS - IWT + 1
+        FILNAM = ' '
+        CALL GETMT6(STRING(ISTRNG),MXSTR,LENGTH(MOTIF),WEIGHT(IWT),
+     +  MXWT,CUTOFF(MOTIF),KBIN,KBOUT,JDEV,IDEV,IDM,
+     +  FILNAM,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+        IF(IOK.NE.0)GO TO 901
+        IF(IPROB.EQ.0) THEN
+          CALL GETP(CLASS(MOTIF),STRING(ISTRNG),
+     +    LENGTH(MOTIF),IDM,MATRIX,MAT1,WEIGHT(IWT))
+          SCMIN = CUTOFF(MOTIF)
+          SCMAX = SCMIN
+          PROB = PSCORE(SCMIN)
+          PMAX = PSCORE(SCMAX)
+          PMIN = PROB
+        END IF
+        WTSTR(MOTIF) = IWT
+        IWT = IWT + LENGTH(MOTIF)*IDM
+        IF(FILNAM.NE.' ') NAMSAV(MOTIF) = FILNAM
+      END IF
+C  GET RANGE OF POSITIONS FOR PROBABILITY CALC
+      LRANGE = RANGEL(MOTIF) + 1
+      IF(COMBIN(MOTIF).EQ.'A')THEN
+        IF(EXPTS.GT.0.0)THEN
+          EXPTT = EXPTT * EXPTS
+          PROBT = PROBT * PROBS
+          PMINT = PMINT * PMINS
+          PMAXT = PMAXT * PMAXS
+        END IF
+        EXPTS = LRANGE * XRAN * PROB
+        PROBS = PROB
+        PMINS = PMIN
+        PMAXS = PMAX
+      ELSE IF(COMBIN(MOTIF).EQ.'N')THEN
+        IF(EXPTS.GT.0.0)THEN
+          EXPTT = EXPTT * EXPTS
+          PROBT = PROBT * PROBS
+          PMINT = PMINT * PMINS
+          PMAXT = PMAXT * PMAXS
+        END IF
+        PMIN = 1.0 - PMIN
+        PMAX = 1.0 - PMAX
+        PROB = 1.0 - PROB
+        EXPTS = PROB ** (LRANGE * XRAN)
+        PROBS = PROB
+        PMINS = PMIN
+        PMAXS = PMAX
+      ELSE IF(COMBIN(MOTIF).EQ.'O')THEN
+        EXPTS = EXPTS + LRANGE * XRAN * PROB
+        PROBS = PROBS + PROB
+        PMINS = PMINS + PMIN
+        PMAXS = PMAXS + PMAX
+      END IF
+      IF(IPROB.EQ.0) CALL WRTSCR(SCMIN,PROB,KBOUT)
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+C*********************************************************************
+      SUBROUTINE DESSIG(
+     +            KBOUT,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,COMBIN,KEYNS,TITLE)
+      CHARACTER STRING(MAXSTR)
+      INTEGER LENGTH(NMOT),CLASS(NMOT),RELMOT(NMOT),RELEND(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),RANGET(NMOT),RANGEM(NMOT)
+      INTEGER WTSTR(NMOT),STRNGS(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT)
+      CHARACTER*(*) KEYNS(NMOT),TITLE
+C
+C
+C  DESCRIBE THE SIGNAL
+C
+      WRITE(KBOUT,1000)
+1000  FORMAT(/' Pattern description',/)
+      WRITE(KBOUT,1006)TITLE
+ 1006 FORMAT(' ',A)
+      DO 100 I = 1,NMOT
+        WRITE(KBOUT,1001)I,KEYNS(I),CLASS(I)
+1001    FORMAT(' Motif ',I2,' named ',A8,' is of class ',I4)
+C
+        IF(CLASS(I).EQ.1)THEN
+          WRITE(KBOUT,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+1002      FORMAT(' Which is an exact match to the string',
+     +    /,(' ',50A1))
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELMOT(I)
+1008      FORMAT(' and the N-terminal residue can take positions',
+     +            I7,' to ',I7,/,
+     +           ' relative to the N-terminal end of motif',I4)
+        ELSE IF(CLASS(I).EQ.2)THEN
+          WRITE(KBOUT,1003)CUTOFF(I),
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+1003      FORMAT(' which is a match of score ',F6.0,
+     +            ' to the string',/,(' ',50A1))
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELMOT(I)
+        ELSE IF(CLASS(I).EQ.3)THEN
+          WRITE(KBOUT,1003)CUTOFF(I),
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELMOT(I)
+        ELSE IF(CLASS(I).EQ.4)THEN
+          WRITE(KBOUT,1004)CUTOFF(I)
+1004      FORMAT(' Which is a match to a weight matrix with score',
+     +             F8.3)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I)-LENGTH(I)+1,RELMOT(I)
+        ELSE IF(CLASS(I).EQ.5)THEN
+          WRITE(KBOUT,1014)LENGTH(I),CUTOFF(I)
+1014      FORMAT(' Which is a repeat with repeat length',I5,
+     +           ' and score ',F6.0)
+          WRITE(KBOUT,1015)RANGET(I)-LENGTH(I)-1,RANGEM(I)-LENGTH(I)-1
+1015      FORMAT(' The loop-out can have sizes ',I6,' to ',I6)
+          IF(I.NE.1)
+     +    WRITE(KBOUT,1008)RANGES(I)+1,
+     +    RANGES(I)+RANGEL(I),RELMOT(I)
+        ELSE IF(CLASS(I).EQ.6)THEN
+          WRITE(KBOUT,1005)CUTOFF(I)
+1005      FORMAT(' Which is membership of a set with score',F8.3)
+        END IF
+        IF(COMBIN(I).EQ.'O')WRITE(KBOUT,1010)
+1010    FORMAT(' It is orred with the previous motif.')
+        IF(COMBIN(I).EQ.'N')WRITE(KBOUT,1012)
+1012    FORMAT(' It is notted with the previous motif.')
+        IF((COMBIN(I).EQ.'A').AND.(I.GT.1))WRITE(KBOUT,1009)
+1009    FORMAT(' It is anded with the previous motif.')
+100   CONTINUE
+      END
+C*********************************************************************
+      SUBROUTINE SAVSIG(
+     +            IDEV,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,NAMSAV,COMBIN,KEYNS,IDEVW,FILET,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,CHRSET,IDM,TITLEP)
+C 18-7-91 Added pattern title
+      CHARACTER STRING(MAXSTR),FILET*(*)
+      INTEGER LENGTH(NMOT),CLASS(NMOT),RELMOT(NMOT),RELEND(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),RANGET(NMOT),RANGEM(NMOT)
+      INTEGER WTSTR(NMOT),STRNGS(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT),SCLASS*10,TITLE(60),NEWT(60)
+      CHARACTER HELPF*(*),CHRSET(IDM)
+      CHARACTER*(*) KEYNS(NMOT)
+      CHARACTER*(*) NAMSAV(NMOT),TITLEP
+      EXTERNAL NOTIRL
+C
+C
+C  SAVE THE SIGNAL
+C
+ 1    CONTINUE
+      LIN = 0
+      CALL GTSTR('Pattern title',' ',TITLEP,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      WRITE(IDEV,1000)TITLEP
+ 1000 FORMAT(' ',A)
+      DO 100 I = 1,NMOT
+        WRITE(SCLASS,1014)CLASS(I)
+1014    FORMAT(I10)
+        DO 10 K = 1,10
+          IF(SCLASS(K:K).NE.' ')THEN
+            J = 0
+            DO 5 K1 = K,10
+              J = J+1
+              SCLASS(J:J) = SCLASS(K:K)
+              SCLASS(K:K) = ' '
+5           CONTINUE
+          END IF
+10      CONTINUE
+        WRITE(IDEV,1001)COMBIN(I),SCLASS,KEYNS(I)
+1001        FORMAT(' ',A,A,' ',A8,' Class ')
+          IF((I.NE.1).AND.(COMBIN(I).NE.'O'))THEN
+            WRITE(IDEV,1008)RELMOT(I)
+1008        FORMAT(I7,'      Relative motif')
+1003        FORMAT(I7,'      Relative start position')
+1004        FORMAT(I7,'      Number of extra positions')
+            WRITE(IDEV,1003)RANGES(I)+1
+            WRITE(IDEV,1004)RANGEL(I)
+          END IF
+        IF(CLASS(I).EQ.1)THEN
+          WRITE(IDEV,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+1002      FORMAT((' ',50A1))
+          WRITE(IDEV,1006)
+1006      FORMAT(' @ End of string')
+        ELSE IF(CLASS(I).EQ.2)THEN
+          WRITE(IDEV,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          WRITE(IDEV,1006)
+          WRITE(IDEV,1009)CUTOFF(I)
+        ELSE IF(CLASS(I).EQ.3)THEN
+          WRITE(IDEV,1002)
+     +    (STRING(K),K=STRNGS(I),STRNGS(I)+LENGTH(I)-1)
+          WRITE(IDEV,1006)
+          WRITE(IDEV,1009)CUTOFF(I)
+        ELSE IF(CLASS(I).EQ.4)THEN
+          WRITE(IDEV,1005)NAMSAV(I)
+1005      FORMAT(A)
+1010      FORMAT(I7,'      Length')
+1012      FORMAT(I7,'      Minimum loop')
+1013      FORMAT(I7,'      Maximum loop')
+1009      FORMAT(F10.5,'   Cutoff')
+        ELSE IF(CLASS(I).EQ.5)THEN
+          WRITE(IDEV,1010)LENGTH(I)
+          WRITE(IDEV,1012)RANGET(I)-LENGTH(I)-1
+          WRITE(IDEV,1013)RANGEM(I)-LENGTH(I)-1
+          WRITE(IDEV,1009)CUTOFF(I)
+        ELSE IF(CLASS(I).EQ.6)THEN
+C          WRITE(*,*)NAMSAV(I)
+          IF(NAMSAV(I)(1:11).EQ.'FILENOTUSED') THEN
+            WRITE(KBOUT,1023)I
+1023        FORMAT(' Motif',I3,
+     +      ' needs a file name to store set as a weight matrix')
+            FILET = ' '
+            CALL OPENF1(IDEVW,FILET,1,IOK,KBIN,KBOUT,
+     +      'Weight matrix file name',
+     +      IHELPS,IHELPE,HELPF,IDEVH)
+            IF(IOK.NE.0)RETURN
+            NAMSAV(I) = FILET
+            CALL FILLC(TITLE,60,' ')
+20          CONTINUE
+            WRITE(KBOUT,1026)
+1026        FORMAT(' Weight matrix needs a title')
+            L = 0
+            CALL GETSTR('Title',TITLE,NEWT,60,L,KBOUT,KBIN,INFLAG)
+            IF(L.LT.0) RETURN
+            IF(INFLAG.EQ.1) THEN
+              CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+            GO TO 20
+            END IF
+            IF(INFLAG.EQ.2) RETURN
+            IF(L.GT.0)CALL SQCOPY(NEWT,TITLE,L)
+            MIDDLE = 0
+            TOP = LENGTH(I)
+            CALL WRTSCN(TITLE,LENGTH(I),MIDDLE,CUTOFF(I),TOP,
+     +      IDM,WEIGHT(WTSTR(I)),CHRSET,
+     +      IDEVW)
+          END IF
+          WRITE(IDEV,1005)NAMSAV(I)
+        END IF
+100   CONTINUE
+      END
+      SUBROUTINE GETMT1(STRING,MAXSTR,LENGTH,KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      CHARACTER NEW(50),STRING(MAXSTR)
+C  GETS DETAILS FOR MOTIF CLASS 1
+      IOK = 1
+      IF(KBIN.EQ.JDEV)THEN
+10      CONTINUE
+        LENGTH = 0
+        CALL GETSTR('String',STRING,NEW,50,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+      ELSE
+        LENGTH = MAXSTR
+        CALL ARRFIL(JDEV,STRING,LENGTH,KBOUT)
+        IF(LENGTH.LT.1)THEN
+          WRITE(KBOUT,*)'ZERO LENGTH STRING'
+          RETURN
+        END IF
+      END IF
+      IOK = 0
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE GETMT2(STRING,MAXSTR,LENGTH,CUTOFF,KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      CHARACTER NEW(50),STRING(MAXSTR)
+      REAL MININ,MAXIN
+C  GETS DETAILS FOR MOTIF CLASS 2
+      IOK = 1
+      IF(KBIN.EQ.JDEV)THEN
+10      CONTINUE
+        LENGTH = 0
+        CALL GETSTR('String',STRING,NEW,50,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+        MININ = 1.
+        MAXIN = LENGTH
+        CUTOFF = LENGTH
+        CALL GETRL(MININ,MAXIN,CUTOFF,'Minimum matches',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTOFF = VALUE
+      ELSE
+        LENGTH = MAXSTR
+        CALL ARRFIL(JDEV,STRING,LENGTH,KBOUT)
+        IF(LENGTH.LT.1)THEN
+          WRITE(KBOUT,*)'ZERO LENGTH STRING'
+          RETURN
+        END IF
+        READ(JDEV,1002,ERR=901)CUTOFF
+1002    FORMAT(F10.0)
+      END IF
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+C*********************************************************************
+      SUBROUTINE GETMT3(STRING,MAXSTR,LENGTH,CUTOFF,MAT3,IDMAT3,
+     +KBIN,KBOUT,IOK,JDEV,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      CHARACTER NEW(50),STRING(MAXSTR)
+      INTEGER MAT3(IDMAT3,IDMAT3)
+      REAL MININ,MAXIN
+C  GETS DETAILS FOR MOTIF CLASS 3
+      IOK = 1
+      IF(KBIN.EQ.JDEV)THEN
+10      CONTINUE
+        LENGTH = 0
+        CALL GETSTR('String',STRING,NEW,50,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+        CALL GTSCR(STRING,LENGTH,MAT3,IDMAT3,ISMIN,ISMAX)
+        MININ = ISMIN
+        MAXIN = ISMAX
+        CUTOFF = MAXIN
+        CALL GETRL(MININ,MAXIN,CUTOFF,'Minimum score',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTOFF = VALUE
+      ELSE
+        LENGTH = MAXSTR
+        CALL ARRFIL(JDEV,STRING,LENGTH,KBOUT)
+        IF(LENGTH.LT.1)THEN
+          WRITE(KBOUT,*)'ZERO LENGTH STRING'
+          RETURN
+        END IF
+        READ(JDEV,1002,ERR=901)CUTOFF
+1002    FORMAT(F10.0)
+      END IF
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+      SUBROUTINE GETMT6(STRING,MAXSTR,LENGTH,
+     +WEIGHT,MAXWTS,CUTOFF,KBIN,KBOUT,JDEV,IDEV,IDM,
+     +FILNAM,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+      CHARACTER HELPF*(*)
+      PARAMETER (MAXS = 75)
+      CHARACTER STRING(MAXSTR),FILNAM*(*),NEW(MAXS)
+      REAL WEIGHT(MAXWTS),MININ,MAXIN
+      PARAMETER (MAXCHR = 26,MAXLEN = 120)
+      INTEGER SUM(MAXCHR,MAXLEN),TOT(MAXLEN)
+      PARAMETER (MAXPRM = 8)
+      CHARACTER PROMPT(2)*(MAXPRM)
+C  GETS DETAILS FOR MOTIF CLASS 6
+      IOK = 1
+      IOPT = 0
+      DO 10 I = 1,MAXLEN
+        TOT(I) = 0
+10    CONTINUE
+100   CONTINUE
+      IF(KBIN.EQ.JDEV)THEN
+        IOPT = 1
+        PROMPT(1) = 'Keyboard'
+        PROMPT(2) = 'File'
+        CALL RADION('Select input mode',PROMPT,2,IOPT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IOPT.LT.1) RETURN
+        IF(IOPT.EQ.1)THEN
+20      CONTINUE
+        WRITE(KBOUT,1000)
+1000    FORMAT(' Separate sets with commas')
+        LENGTH = 0
+        CALL GETSTR('String',STRING,NEW,MAXS,LENGTH,KBOUT,KBIN,INFLAG)
+        IF(LENGTH.LT.1) RETURN
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 20
+        END IF
+        IF(INFLAG.EQ.2) RETURN
+        CALL SQCOPY(NEW,STRING,LENGTH)
+          L = LENGTH
+          CALL INTRP6(STRING,L,LENGTH,SUM,MAXCHR,MAXLEN,CUTOFF,IOK)
+          IF(IOK.NE.0)RETURN
+          MININ = 1.
+          MAXIN = LENGTH
+C          CUTOFF = LENGTH
+          CALL GETRL(MININ,MAXIN,CUTOFF,'Minimum matches',
+     +    VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+          IF(IOK.NE.0) RETURN
+          CUTOFF = VALUE
+        ELSE
+          FILNAM = ' '
+          CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +    'Weight matrix file name',
+     +    IHELPS,IHELPE,HELPF,IDEVH)
+          IF(IOK.NE.0)RETURN
+          LENGTH=MAXLEN
+          CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,CUTOFF,YMAX,IDEV,
+     +    IOK,IDM,KBOUT,IPROB)
+          IF(IOK.NE.0)THEN
+            WRITE(KBOUT,1006)
+1006        FORMAT(' Error in weight matrix')
+            RETURN
+          END IF
+        END IF
+      ELSE
+        CALL OPENF(IDEV,FILNAM,0,IOK,JDEV,KBOUT)
+        IF(IOK.NE.0)RETURN
+        LENGTH=MAXLEN
+        CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,CUTOFF,YMAX,IDEV,
+     +    IOK,IDM,KBOUT,IPROB)
+        IF(IOK.NE.0)THEN
+          WRITE(KBOUT,1006)
+          RETURN
+        END IF
+      END IF
+      CALL GETW1(SUM,WEIGHT,LENGTH,MAXCHR,MAXLEN)
+      END
+      SUBROUTINE GETMT4(WEIGHT,MAXWTS,LENGTH,CUTOFF,YMAX,IDEV,
+     +KBIN,KBOUT,IOK,JDEV,FILNAM,IDM,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IPROB)
+      CHARACTER HELPF*(*)
+      PARAMETER (MAXLEN = 120, MAXCHR = 26)
+      REAL WEIGHT(MAXWTS)
+      CHARACTER FILNAM*(*)
+      INTEGER TOT(MAXLEN),SUM(MAXCHR,MAXLEN)
+      IF(JDEV.EQ.KBIN) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,JDEV,KBOUT,
+     +  'Weight matrix file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      ELSE
+        CALL OPENF(IDEV,FILNAM,0,IOK,JDEV,KBOUT)
+      END IF
+      IF(IOK.NE.0)RETURN
+      LENGTH=MAXLEN
+      CALL RDWMT(TOT,SUM,MIDDLE,LENGTH,MAXLEN,CUTOFF,YMAX,IDEV,
+     +  IOK,IDM,KBOUT,IPROB)
+      IF(IOK.NE.0)THEN
+        WRITE(KBOUT,*)' Error in weight matrix, option left'
+        RETURN
+      END IF
+      IF(CUTOFF.LT.0.0)CALL GETW(TOT,SUM,WEIGHT,LENGTH,IDM,MAXLEN)
+      IF(CUTOFF.GE.0.0)CALL GETW2(SUM,WEIGHT,LENGTH,IDM,MAXLEN)
+      END
+C*********************************************************************
+      SUBROUTINE INTRP6(STRING,ISEND,LENGTH,WT,MAXCHR,MAXLEN,
+     +CUTOFF,IOK)
+      CHARACTER STRING(ISEND),TERM
+      INTEGER WT(MAXCHR,MAXLEN)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      PARAMETER (TERM = ',')
+      IOK = 0
+C  POINT TO STRING
+      IS = 1
+      ICOL = 1
+C  COUNT FILLED COLUMNS
+      CUTOFF = 0.
+10    CONTINUE
+      CALL FILLI(WT(1,ICOL),MAXCHR,0)
+      CUTOFF = CUTOFF + 1.
+20    CONTINUE
+      IF(IS.LE.ISEND)THEN
+        IF(STRING(IS).NE.TERM)THEN
+          IROW = CTONUM(STRING(IS))
+          WT(IROW,ICOL) = 1
+          IS = IS + 1
+          GO TO 20
+        END IF
+        NC = 1
+30      CONTINUE
+        IF(IS.LE.ISEND)THEN
+          IF(STRING(IS).EQ.TERM)THEN
+            ICOL = ICOL + 1
+            CALL FILLI(WT(1,ICOL),MAXCHR,0)
+            NC = NC + 1
+            IS = IS + 1
+            GO TO 30
+          END IF
+C
+C  END OF TERMINATORS
+C
+          IF(ICOL.GT.1) GO TO 10
+C  ERROR TERMINATOR BEFORE ANY GOOD COLUMNS
+          IOK = 1
+          RETURN
+        END IF
+C  STRING ENDED WITH TERMINATOR
+        LENGTH = ICOL - NC + 1
+        RETURN
+      END IF
+C
+C  END REACHED WITH NO TERMINATOR (NORMAL)
+      IF(ICOL.GT.0)THEN
+        LENGTH = ICOL
+        IOK = 0
+        RETURN
+      END IF
+      IOK = 1
+      END
+      SUBROUTINE GETMT8(RANGES,RANGEL,LENGTH,CUTOFF,
+     +KBIN,KBOUT,JDEV,IOK,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER HELPF*(*)
+      INTEGER RANGES,RANGEL
+C  GETS DETAILS FOR MOTIF CLASS 8
+      IF(JDEV.EQ.KBIN)THEN
+      IOK = 1
+      MININ = 1
+      MAXIN = 60
+      LENGTH = 6
+      CALL GETINT(MININ,MAXIN,LENGTH,'Repeat length',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      LENGTH = IVAL
+      MININ = 0
+      MAXIN = 60
+      ID = MININ
+      CALL GETINT(MININ,MAXIN,ID,'Minimum gap',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      ID = IVAL
+      MININ = ID
+      MAXIN = MININ + 60
+      IR = ID
+      CALL GETINT(MININ,MAXIN,IR,'Maximum gap',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IR = IVAL
+C       FUDGE HERE: SHOULD REALLY LOOK AT SEQUENCE COMPOSITION
+C       AND SCORE MATRIX VALUES. ASSUMING MDM78
+        SMININ = 11.0
+        SMAXIN = LENGTH*20.
+        CUTOFF = LENGTH * 12.
+        CALL GETRL(SMININ,SMAXIN,CUTOFF,'Minimum score',
+     +  VALUE,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) RETURN
+        CUTOFF = VALUE
+      ELSE
+      READ(JDEV,1001,ERR=901)LENGTH
+      IF(LENGTH.LT.1)RETURN
+      READ(JDEV,1001,ERR=901)ID
+1001  FORMAT(I7)
+C      IF(ID.LT.0)GO TO 220
+      READ(JDEV,1001,ERR=901)IR
+      READ(JDEV,1006,ERR=901)CUTOFF
+1006  FORMAT(F10.0)
+      END IF
+      RANGES = ID + LENGTH + 1
+      RANGEL = IR + LENGTH + 1
+      IOK = 0
+      RETURN
+901   CONTINUE
+      IOK = 1
+      END
+      SUBROUTINE WRTSCN(TITLE,LENGTH,MIDDLE,BOT,TOP,IDM,
+     +SUM,CHRSET,IDEV)
+      INTEGER TOT(120)
+      REAL SUM(IDM,LENGTH)
+      CHARACTER CHRSET(IDM),TITLE(60)
+C   PROTEIN MATRICES DONT WRITE ROWS FOR -X? AND SPACE SO SET DIMENSION
+C   TO IDM-4
+      CALL FILLI(TOT,120,0)
+      WRITE(IDEV,1018)TITLE
+1018  FORMAT(' ',60A1)
+1019  FORMAT(' P',20I4)
+1020  FORMAT(' N',20I4)
+1021  FORMAT(' ',A,20I4)
+1022  FORMAT(' ',2I6,2F10.3)
+      WRITE(IDEV,1022)LENGTH,MIDDLE,BOT,TOP
+      NLINES=1+(LENGTH-1)/20
+      K1=1
+      DO 400 J=1,NLINES
+        K2=MIN((K1+19),LENGTH)
+        WRITE(IDEV,1019)(K,K=K1-MIDDLE,K2-MIDDLE)
+        WRITE(IDEV,1020)(TOT(K),K=K1,K2)
+        DO 390 I=1,IDM-4
+          WRITE(IDEV,1021)CHRSET(I),(INT(SUM(I,K)),K=K1,K2)
+390     CONTINUE
+        K1=K1+20
+        IF(K1.GT.LENGTH)K1=LENGTH
+400   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE GETCLS(JDEV,KBOUT,ICLASS,TCLASS,KEYNAM)
+      CHARACTER TCLASS,STRING*21,NUMBER*10,STRNG2*10,KEYNAM*(*)
+      SAVE NUMBER
+      DATA NUMBER/'0987654321'/
+      ICLASS = 0
+10    CONTINUE
+      STRING = ' '
+      READ(JDEV,1000,ERR=10,END=70)STRING
+1000  FORMAT(A)
+      KEYNAM = STRING(14:21)
+      DO 20 I = 10,1,-1
+        K = I
+        IF(STRING(I:I).NE.' ')GO TO 21
+20    CONTINUE
+      RETURN
+21    CONTINUE
+      L = 1
+      DO 30 I = 1,K
+        L = I
+        IF(STRING(I:I).NE.' ')GO TO 31
+30    CONTINUE
+31    CONTINUE
+      TCLASS = STRING(L:L)
+      DO 40 I = 1,10
+        IF(TCLASS.EQ.NUMBER(I:I))THEN
+          TCLASS = 'A'
+          GO TO 41
+        END IF
+40    CONTINUE
+      IF((TCLASS.NE.'O').AND.(TCLASS.NE.'N').AND.(TCLASS.NE.'A')
+     +.AND.(TCLASS.NE.'-'))THEN
+        ICLASS = 99999
+        RETURN
+      END IF
+      L = L + 1
+41    CONTINUE
+      J = 11
+      STRNG2 = ' '
+      DO 50 I = K,L,-1
+        J = J-1
+        STRNG2(J:J) = STRING(I:I)
+50    CONTINUE
+        READ(STRNG2,1001,ERR=60)ICLASS
+1001    FORMAT(I10)
+        IF(TCLASS.EQ.'-')ICLASS=-1*ICLASS
+      RETURN
+60    CONTINUE
+      ICLASS = 99999
+      RETURN
+70    CONTINUE
+      END
+      SUBROUTINE GETP(ICLASS,STRING,LENGTH,IDM,
+     +MATRIX,MAT1,WEIGHT)
+C  MAT1 SIMPLE IDENTITY
+C  MATRIX MDM78
+      PARAMETER (
+     +           MAXCHR=26,
+     +           MAXSIG=120)
+      INTEGER MAT1(IDM,IDM),MATRIX(IDM,IDM)
+      INTEGER IWT(MAXCHR,MAXSIG)
+      REAL WEIGHT(IDM,LENGTH),FB(MAXCHR),F(MAXCHR)
+      CHARACTER STRING(LENGTH)
+      COMMON /COMPC/COMP(MAXCHR)
+      DO 10 I = 1,IDM
+        F(I) = COMP(I)
+10    CONTINUE
+      IF(ICLASS.EQ.1)THEN
+        CALL GETP1(STRING,LENGTH,IDM,MAT1,IDM,IWT,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.2)THEN
+        CALL GETP1(STRING,LENGTH,IDM,MAT1,IDM,IWT,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.3)THEN
+        CALL GETP3(STRING,LENGTH,IDM,MATRIX,IWT,IDM,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.4)THEN
+        CALL GETP4(WEIGHT,IWT,IDM,LENGTH,F)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.5)THEN
+        CALL GETP8(MATRIX,IDM,IWT,IDM,LENGTH,F,FB)
+        RETURN
+      END IF
+      IF(ICLASS.EQ.6)THEN
+        CALL GETP4(WEIGHT,IWT,IDM,LENGTH,F)
+        RETURN
+      END IF
+      END
+      SUBROUTINE WTGEN(IWT,MAXCHR,LENGTH,F,MAXS,SMIN,SCALE,JOB)
+      PARAMETER (MAXPOL = 4000)
+      INTEGER IWT(MAXCHR,LENGTH)
+      REAL F(MAXCHR)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      SAVE /POLY/
+      CMIN = SMIN
+      CSCALE = SCALE
+      LENTHC = LENGTH
+      DO 10 I = 0, MAXPOL
+        POLYA(I) = 0.0
+10    CONTINUE
+      DO 400 I = 1,MAXCHR
+        K = IWT(I,1)
+        POLYA(K) = POLYA(K) + F(I)
+400   CONTINUE
+      MAXS1 = MAXS
+      DO 600 J = 2,LENGTH
+        DO 490 I = 0, MAXPOL
+          POLYB(I) = 0.0
+490     CONTINUE
+        DO 500 I = 1,MAXCHR
+          K = IWT(I,J)
+          POLYB(K) = POLYB(K) + F(I)
+500     CONTINUE
+        CALL POLMUL(MAXS1,MAXS,MAXSP1)
+        MAXS1 = MAXSP1
+600   CONTINUE
+C  IF JOB = 1 WANT DISTRIBUTION ONLY
+C     JOB = 2 WANT CUMMULATIVE VALUES
+C     JOB = 3 WANT DISTRIBUTION AND UNSCALED SCORES
+C     JOB = 4 WANT CUMMULATIVE VALUES AND UNSCALED SCORES
+      IF((JOB.EQ.2).OR.(JOB.EQ.4))THEN
+        DO 610 I = 1,MAXS1
+          J = MAXS1 - I
+          POLYA(J) = POLYA(J) + POLYA(J+1)
+610     CONTINUE
+      END IF
+      IF((JOB.EQ.3).OR.(JOB.EQ.4))THEN
+        T = SMIN * LENGTH
+        DO 620 I = 0,MAXS1
+          POLYB(I) = (REAL(I)/SCALE) + T
+620     CONTINUE
+      END IF
+      END
+      SUBROUTINE SCGEN(IWT,MAXCHR,LENGTH,FA,FB,MAXS,
+     +SCALE,SMIN,JOB)
+      PARAMETER (MAXPOL = 4000)
+      INTEGER IWT(MAXCHR,MAXCHR)
+      REAL FA(MAXCHR),FB(MAXCHR)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      SAVE /POLY/
+      CMIN = SMIN
+      CSCALE = SCALE
+      LENTHC = LENGTH
+      DO 10 I = 0, MAXPOL
+        POLYA(I) = 0.0
+        POLYB(I) = 0.0
+10    CONTINUE
+      DO 400 I = 1,MAXCHR
+        DO 300 J = 1,MAXCHR
+          K = IWT(I,J)
+          POLYA(K) = POLYA(K) + FA(I) * FB(J)
+          POLYB(K) = POLYA(K)
+300     CONTINUE
+400   CONTINUE
+      MAXS1 = MAXS
+      DO 600 J = 2,LENGTH
+        CALL POLMUL(MAXS1,MAXS,MAXSP1)
+        MAXS1 = MAXSP1
+600   CONTINUE
+C  IF JOB = 1 WANT DISTRIBUTION ONLY
+C     JOB = 2 WANT CUMMULATIVE VALUES
+C     JOB = 3 WANT DISTRIBUTION AND UNSCALED SCORES
+C     JOB = 4 WANT CUMMULATIVE VALUES AND UNSCALED SCORES
+      IF((JOB.EQ.2).OR.(JOB.EQ.4))THEN
+        DO 610 I = 1,MAXS1
+          J = MAXS1 - I
+          POLYA(J) = POLYA(J) + POLYA(J+1)
+610     CONTINUE
+      END IF
+      IF((JOB.EQ.3).OR.(JOB.EQ.4))THEN
+        T = SMIN * LENGTH
+        DO 620 I = 0,MAXS1
+          POLYB(I) = (REAL(I)/SCALE) + T
+620     CONTINUE
+      END IF
+      END
+      SUBROUTINE SCGEN1(IWT,MAXCHR,LENGTH,FA,MAXS,
+     +SCALE,SMIN,JOB,STRING)
+C  AUTHOR RODGER STADEN
+C  PROBABILITY FOR STRINGS
+      CHARACTER STRING(LENGTH)
+      INTEGER DTONUM
+      PARAMETER (MAXPOL = 4000)
+      INTEGER IWT(MAXCHR,MAXCHR)
+      REAL FA(MAXCHR)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      EXTERNAL DTONUM
+      SAVE /POLY/
+      CMIN = SMIN
+      CSCALE = SCALE
+      LENTHC = LENGTH
+      DO 10 I = 0, MAXPOL
+        POLYA(I) = 0.0
+10    CONTINUE
+      K1 = DTONUM(STRING(1))
+      DO 400 I = 1,MAXCHR
+        K = IWT(I,K1)
+        POLYA(K) = POLYA(K) + FA(I)
+400   CONTINUE
+      MAXS1 = MAXS
+      DO 600 J = 2,LENGTH
+        DO 490 I = 0,MAXPOL
+           POLYB(I) = 0.
+490     CONTINUE
+        K1 = DTONUM(STRING(J))
+        DO 500 I = 1,MAXCHR
+          K = IWT(I,K1)
+          POLYB(K) = POLYB(K) + FA(I)
+500     CONTINUE
+        CALL POLMUL(MAXS1,MAXS,MAXSP1)
+        MAXS1 = MAXSP1
+600   CONTINUE
+C  IF JOB = 1 WANT DISTRIBUTION ONLY
+C     JOB = 2 WANT CUMMULATIVE VALUES
+C     JOB = 3 WANT DISTRIBUTION AND UNSCALED SCORES
+C     JOB = 4 WANT CUMMULATIVE VALUES AND UNSCALED SCORES
+      IF((JOB.EQ.2).OR.(JOB.EQ.4))THEN
+        DO 610 I = 1,MAXS1
+          J = MAXS1 - I
+          POLYA(J) = POLYA(J) + POLYA(J+1)
+610     CONTINUE
+      END IF
+      IF((JOB.EQ.3).OR.(JOB.EQ.4))THEN
+        T = SMIN * LENGTH
+        DO 620 I = 0,MAXS1
+          POLYB(I) = (REAL(I)/SCALE) + T
+620     CONTINUE
+      END IF
+      END
+      SUBROUTINE POLMUL(NA,NB,NC)
+      PARAMETER (MAXPOL = 4000)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +CMIN,CSCALE,LENTHC
+      PARAMETER (ZERO=0.0,SMALL=1E-10)
+      SAVE /POLY/
+C  POLYA, POLYB ARE INPUT POLYNOMIAL COEEFICIENTS
+C  POLYC  IS OUTPUT POLYNOMIAL COEEFICIENTS
+C
+      NC = NA + NB
+      DO 210 I = 0,MAXPOL
+        POLYC(I) = ZERO
+210   CONTINUE
+      IF(NC.GT.MAXPOL)RETURN
+      DO 230 I = 0,NA
+        DO 220 J = 0,NB
+          POLYC(I+J) = POLYC(I+J) + POLYA(I) * POLYB(J)
+220     CONTINUE
+230   CONTINUE
+      DO 240 I = 0,NC
+        T = POLYC(I)
+        IF(T.LT.SMALL) T = ZERO
+        POLYA(I) = T
+240   CONTINUE
+      END
+      SUBROUTINE WRTWMT(WEIGHT,IWT,MAXCHR,LENGTH,IDEV,IFLAG)
+      REAL WEIGHT(MAXCHR,LENGTH)
+      INTEGER IWT(MAXCHR,LENGTH)
+      DO 10 I = 1,MAXCHR
+        IF(IFLAG.EQ.0)
+     +  WRITE(IDEV,1000,ERR=100)(WEIGHT(I,J),J=1,LENGTH)
+        IF(IFLAG.EQ.1)
+     +  WRITE(IDEV,1001,ERR=100)(IWT(I,J),J=1,LENGTH)
+10    CONTINUE
+1000  FORMAT(' ',20F5.2)
+1001  FORMAT(' ',20I4)
+100   CONTINUE
+      END
+      SUBROUTINE WTSC(WEIGHT,IWT,MAXCHR,LENGTH,MAXS,SCALE,SMIN)
+      INTEGER IWT(MAXCHR,LENGTH)
+      REAL WEIGHT(MAXCHR,LENGTH)
+      PARAMETER (SMALL=1E-10)
+      PARAMETER (MAXPOL = 4000)
+      SMIN = 9999999.9
+      SMAX = -99999999.9
+      DO 100 J = 1,LENGTH
+        DO 90 I = 1,MAXCHR
+          T = WEIGHT(I,J)
+          IF(T.LT.SMIN)SMIN = T
+          IF(T.GT.SMAX)SMAX = T
+90      CONTINUE
+100   CONTINUE
+      SMAX = SMAX - SMIN
+      IF(SMAX.EQ.0.0)SMAX = SMALL
+C  USE LENGTH+1 TO DIVIDE MAXPOL. THIS IS AN ATTEMPT TO AVOID
+C  THE USE OF NINT GIVING A SET OF SCORES THAT SUM TO TOO
+C  HIGH A VALUE (IE GT MAXPOL)
+      MAXS = MAXPOL/(LENGTH+1)
+      MAXS = MIN(MAXS,40)
+      SCALE = REAL(MAXS)/SMAX
+      DO 200 J = 1,LENGTH
+        DO 190 I = 1,MAXCHR
+          K = NINT((WEIGHT(I,J)-SMIN)*SCALE)
+          IWT(I,J) = K
+190     CONTINUE
+200   CONTINUE
+      END
+      SUBROUTINE GETCMP(SEQ,IDIM,COMP,IDM)
+      CHARACTER SEQ(IDIM)
+      REAL COMP(IDM)
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+      DO 10 I = 1,IDM
+        COMP(I) = 0.0
+10    CONTINUE
+      T = 0.
+      DO 20 I = 1,IDIM
+        J = DTONUM(SEQ(I))
+        IF(J.LT.23)THEN
+          COMP(J) = COMP(J) + 1.
+          T = T + 1.
+        END IF
+20    CONTINUE
+      IF(T.GT.0.0)THEN
+        DO 30 I = 1,IDM
+          COMP(I) = COMP(I) / T
+30      CONTINUE
+      END IF
+      END
+      SUBROUTINE MATSC(WEIGHT,IDMAT,IWT,MAXCHR,LENGTH,MAXS,
+     +SMIN,SCALE)
+      INTEGER IWT(IDMAT,IDMAT)
+      INTEGER WEIGHT(IDMAT,IDMAT)
+      PARAMETER (SMALL=1E-10)
+      PARAMETER (MAXPOL = 4000)
+      ISMIN = 99999999
+      ISMAX = -99999999
+      DO 100 J = 1,IDMAT
+        DO 90 I = 1,IDMAT
+          K = WEIGHT(I,J)
+          IF(K.LT.ISMIN) ISMIN = K
+          IF(K.GT.ISMAX) ISMAX = K
+90      CONTINUE
+100   CONTINUE
+C      SMAX = ISMAX - ISMIN
+C      IF(SMAX.EQ.0.0)SMAX = SMALL
+C  DO NOTHING HERE - ASSUME VALUES ARE IN RANGE AND JUST COPY
+C  USE LENGTH+1 TO DIVIDE MAXPOL. THIS IS AN ATTEMPT TO AVOID
+C  THE USE OF NINT GIVING A SET OF SCORES THAT SUM TO TOO
+C  HIGH A VALUE (IE GT MAXPOL)
+C      MAXS = MAXPOL/(LENGTH+1)
+C      SCALE = REAL(MAXS)/SMAX
+      DO 200 J = 1,IDMAT
+        DO 190 I = 1,IDMAT
+C          K = NINT((REAL(WEIGHT(I,J))-ISMIN)*SCALE)
+          IWT(I,J) = WEIGHT(I,J) - ISMIN
+190     CONTINUE
+200   CONTINUE
+      SCALE = 1.0
+      SMIN = ISMIN
+      MAXS = ISMAX
+      END
+      REAL FUNCTION PSCORE(SCORE)
+      PARAMETER (MAXPOL = 4000)
+      COMMON /POLY/POLYA(0:MAXPOL),POLYB(0:MAXPOL),POLYC(0:MAXPOL),
+     +SMIN,SCALE,LENGTH
+      SAVE /POLY/
+C WANT PROBABILITY OF SCORING AT LEAST SCORE
+C CUMMULATIVE PROBABILITIES IN CUMP
+C SCALE FACTORS SMIN AND SCALE AND MOTIF LENGTH IS LENGTH
+      I = NINT((SCORE - SMIN * LENGTH) * SCALE)
+      PSCORE = -1.
+      IF((I.GE.0).AND.(I.LE.MAXPOL)) PSCORE = POLYA(I)
+      END
+      SUBROUTINE GETP1(STRING,LENGTH,IDMAT3,MAT2,IDMAT2,IWT,F)
+      INTEGER MAT2(IDMAT2,IDMAT2)
+      INTEGER IWT(IDMAT2,IDMAT2)
+      REAL F(IDMAT3)
+      CHARACTER STRING(LENGTH)
+      JOB = 2
+        CALL MATSC(MAT2,IDMAT2,IWT,IDMAT3,LENGTH,MAXS,SMIN,SCALE)
+        CALL SCGEN1(IWT,IDMAT2,LENGTH,F,MAXS,SCALE,SMIN,JOB,STRING)
+      END
+      SUBROUTINE GETP3(STRING,LENGTH,IDMAT3,MAT3,IWT,IDM,F)
+      INTEGER MAT3(IDMAT3,IDMAT3)
+      INTEGER IWT(IDMAT3,IDMAT3)
+      REAL F(IDMAT3)
+      CHARACTER STRING(LENGTH)
+      JOB = 2
+        CALL MATSC(MAT3,IDMAT3,IWT,IDM,LENGTH,MAXS,SMIN,SCALE)
+        CALL SCGEN1(IWT,IDMAT3,LENGTH,F,MAXS,SCALE,SMIN,JOB,
+     +  STRING)
+      END
+      SUBROUTINE GETP4(WEIGHT,IWT,IDM,LENGTH,F)
+      REAL F(IDM)
+      REAL WEIGHT(IDM,LENGTH)
+      INTEGER IWT(IDM,LENGTH)
+      JOB = 2
+        CALL WTSC(WEIGHT,IWT,IDM,LENGTH,MAXS,SCALE,SMIN)
+        CALL WTGEN(IWT,IDM,LENGTH,F,MAXS,SMIN,SCALE,JOB)
+      END
+      SUBROUTINE GETP8(MAT1,IDMAT1,IWT,IDM,LENGTH,F,FB)
+      REAL F(IDM),FB(IDM)
+      INTEGER MAT1(IDMAT1,IDMAT1)
+      INTEGER IWT(IDM,LENGTH)
+        DO 30 I = 1,IDM
+          FB(I) = F(I)
+30      CONTINUE
+      JOB = 2
+        CALL MATSC(MAT1,IDMAT1,IWT,IDM,LENGTH,MAXS,SMIN,SCALE)
+        CALL SCGEN(IWT,IDMAT1,LENGTH,F,FB,MAXS,SCALE,SMIN,JOB)
+      END
+      SUBROUTINE WRTSCR(SCMIN,PROB,KBOUT)
+      WRITE(KBOUT,1000)SCMIN,PROB
+1000  FORMAT(' Probability of score ',F10.4,' =',E10.3)
+      END
+      SUBROUTINE GIDMAT(MAT,IDM,IDMX)
+      INTEGER MAT(IDM,IDM)
+C  AUTHOR RODGER STADEN
+C  ALLOW ONLY - AS WILD CARD FOR PROTEINS
+      DO 10 I = 1,IDM
+        DO 5 J = 1,IDM
+          MAT(I,J) = 0
+5       CONTINUE
+10    CONTINUE
+      DO 20 I = 1,IDMX+1
+        DO 15 J = 1,IDMX+1
+          IF(I.EQ.J)MAT(I,J) = 1
+15      CONTINUE
+20    CONTINUE
+      DO 30 I = 1,IDMX
+        MAT(I,IDMX+1) = 1
+        MAT(IDMX+1,I) = 1
+30    CONTINUE
+      END
+      SUBROUTINE FORWAD(CLASS,LENGTH,CUTOFF,NMOT,
+     +MOTIF,ICLASS,ILEN,CUT,IWT,RELMOT,START,IEND,
+     +RANGES,RANGEL,STRNGS,ISTRST,WTSTR,IDSEQ,IENTRY,
+     +RANGET,RANGEM,START2,IEND2,MATCHQ,RELEND,MATCHP,IDSPLY,
+     +COMBIN,COMB)
+      INTEGER CLASS(NMOT),LENGTH(NMOT),WTSTR(NMOT)
+      INTEGER STRNGS(NMOT),START(NMOT),IEND(NMOT),IENTRY(NMOT)
+      INTEGER RELMOT(NMOT),RANGES(NMOT),RANGEL(NMOT)
+      INTEGER RANGET(NMOT),RANGEM(NMOT),START2(NMOT),IEND2(NMOT)
+      INTEGER MATCHQ(NMOT),RELEND(NMOT),MATCHP(NMOT)
+      REAL CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT),COMB
+C      WRITE(*,*)'IN FORWAD FOR MOTIF',MOTIF
+C  ROUTINE TO MOVE FORWARDS ONE MOTIF
+C  FIRST IT UPDATES THE RANGES FOR ALL THE MOTIFS THAT DEPEND
+C  ON THE CURRENT MOTIF
+C  IT POINTS TO THE NEXT MOTIF AND GETS ALL THE REQUIRED VARIABLES
+C  WE MUST FIRST LOOK BACK THRU THE LIST UNTIL WE FIND THE FIRST
+C  NON ORED MOTIF. THEN WE MUST SET THE RELATIVE POSITIONS FOR ALL
+C  THOSE MOTIFS THAT DEPEND ON THIS SET OF ORED MOTIFS. THE RELATIVE
+C  POSITIONS MUST BE SET RELATIVE TO THE POSITION OF THE MATCH FOUND
+C  FOR THE CURRENT MOTIF EVEN IF IT IS NOT THE FIRST IN THE LIST OF ORS
+C  THIS IS BECAUSE RELATIVE POSITIONS CAN ONLY BE DEFINED RELATIVE TO
+C  THE FIRST OF A SET OF ORS, BUT IF A MATCH IS FOUND FOR ANY OF THE OTHERS
+C  IN THE SET THEN ITS POSITION DEFINES THE RANGE.
+C
+C
+C
+C  IF THIS IS THE LAST MOTIF RETURN TO DISPLAY THE MATCH
+      IF(MOTIF.EQ.NMOT)THEN
+        IDSPLY = 1
+        RETURN
+      END IF
+C
+C  NOT THE LAST
+C
+      IDSPLY = 0
+C
+C  IF THIS IS A NOTTED MOTIF THEN IT WILL NOT BE ORED OR HAVE ANY
+C  OTHER MOTIFS RELATIVE TO IT, SO DEAL WITH IT HERE
+C
+      IF(COMB.EQ.'N')THEN
+        MOTIF  = MOTIF + 1
+        ICLASS = CLASS(MOTIF)
+        ILEN   = LENGTH(MOTIF)
+        CUT    = CUTOFF(MOTIF)
+        IWT    = WTSTR(MOTIF)
+        ISTRST = STRNGS(MOTIF)
+        COMB   = COMBIN(MOTIF)
+        RETURN
+      END IF
+C
+C
+C   NOW LOOK FOR THE FIRST NON ORRED CLASS FROM HERE BACK
+C
+      NOTIF = MOTIF
+10    CONTINUE
+C
+      IF(COMBIN(NOTIF).EQ.'O')THEN
+        NOTIF = NOTIF - 1
+        GO TO 10
+      END IF
+C
+C  NOTIF IS A NON-ORED MOTIF, OR THE FIRST OF A LIST OF ORS 
+C  SO SET RELATIVE POSITIONS FOR ALL THOSE THAT DEPEND ON IT
+C
+        DO 20 I = MOTIF+1,NMOT
+          JMOT = I
+            IF(RELMOT(JMOT).EQ.NOTIF)THEN
+              START(JMOT) = START(MOTIF) + RANGES(JMOT) - 1
+C             NEXT LINE IS FOR RAH BUG
+              START(JMOT) = MAX(START(JMOT),1)
+              IEND(JMOT) = START(JMOT) + RANGEL(JMOT) - 1
+              IF(IEND(JMOT).GT.IDSEQ)IEND(JMOT)=IDSEQ
+C   SET FLAG TO SAY THAT FOR STEMS WE DO NOT HAVE TO CONTINUE A PREVIOUS
+C   5' STEM START POSITION (IE TRY ALL ITS REMAINING LOOPS)
+              IENTRY(JMOT) = 0
+            END IF
+C
+20      CONTINUE
+C
+C
+C   OK WEVE FOUND THE FIRST NON-ORRED CLASS AND RESET RANGES
+C   RANGES ACCORDINGLY. NOW WE HAVE TO GO FORARDS THRU THE LIST UNTIL
+C   WE FIND THE NEXT NON-ORRED CLASS. IF WE SET THIS TO BE THE 
+C   CURRENT MOTIF WE WILL HAVE MOVED FORWARDS. WE MUST BE CAREFUL THAT
+C   WE DONT FALL OFF THE END OF THE LIST!
+C
+          JMOT = MOTIF
+30        CONTINUE
+          JMOT   = JMOT + 1
+          IF(JMOT.GT.NMOT)THEN
+            IDSPLY = 1
+            RETURN
+          END IF
+          IF(COMBIN(JMOT).EQ.'O')GO TO 30
+          MOTIF  = JMOT
+          ICLASS = CLASS(MOTIF)
+          ILEN   = LENGTH(MOTIF)
+          CUT    = CUTOFF(MOTIF)
+          IWT    = WTSTR(MOTIF)
+          ISTRST = STRNGS(MOTIF)
+          COMB   = COMBIN(MOTIF)
+        END
+C*********************************************************************
+      SUBROUTINE BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +COMBIN,COMB)
+      INTEGER CLASS(NMOT),LENGTH(NMOT),WTSTR(NMOT)
+      INTEGER STRNGS(NMOT),RELMOT(NMOT),START(NMOT),IEND(NMOT)
+      INTEGER MATCHQ(NMOT),RANGES(NMOT),RANGEL(NMOT),RELEND(NMOT)
+      INTEGER MATCHP(NMOT)
+      REAL CUTOFF(NMOT)
+      CHARACTER COMBIN(NMOT),COMB
+C      WRITE(*,*)'IN BAKSID FOR MOTIF',MOTIF
+C   ROUTINE TO MOVE BAKWARDS OR SIDEWAYS
+C   IE WE COME HERE AFTER FAILING TO FIND A MATCH FOR THE PREVIOUS MOTIF
+C   WE CHECK IF WE CAN MOVE SIDEWAYS BY LOOKING TO SEE IF THE NEXT MOTIF
+C   IN THE LIST IS ORRED.
+C   IF IT IS WE LEAVE THE CURRENT MOTIFS POSITION UNCHANGED SO THAT IT
+C   IS READY TO SIGNIFIY TO A SUBSEQUENT MOVE BACKWARDS THAT ALL ITS
+C   POSITIONS HAVE BEEN TRIED FOR THE MATCHES FOUND FURTHER UP THE LIST.
+C
+C   IF THE NEXT MOTIF IN THE LIST IS NOT ORED THEN WE MUST MOVE BACKWARDS.
+C   BEFORE DOING SO WE MUST RESET THE RANGES FOR ALL THE MOTIFS IN THE 
+C   CURRENT SET OF ORS. THEY MUST BE RESET TO THEIR ORIGINAL FULL RANGE
+C   BECAUSE WE ARE TRYING TO FIND ALL POSSIBLE COMBINATIONS OF MATCH AND
+C   WHEN WE GO BACK WE MIGHT NOT GO BACK AS FAR AS THE MOTIF THE CURRENT 
+C   SET DEPEND ON, AND SO THEY WILL NOT BE RESET COMING FORWARDS. SO IF WE 
+C   FIND ANOTHER MATCH TO A MOTIF IN A PREVIOUS SET WE MUST ALLOW THE SAME
+C   MATCHES TO BE REFOUND IN THE CURRENT SET BECAUSE IT GIVES A DIFFERENT 
+C   COMBINATION TO THOSE ALREADY FOUND. TO FIND THE ONES TO RESET WE MUST
+C   MOVE BACK THRU THE LIST UNTIL A NON-ORRED CLASS IS FOUND: THIS 
+C   IS THE LAST ONE TO RESET.
+C   HAVING DONE THIS, TO MOVE BACK WE MUST LOOK BACK FROM THE LAST ONE 
+C   RESET, UNTIL WE FIND THE FIRST NON-ORRED CLASS. THIS WILL
+C   BE AN ANDED OR NOTTED MOTIF OR THE FIRST IN A LIST OF ORS. 
+C   WE MUST ALWAYS GO BACK
+C   TO THE BEGINNING OF A LIST OF ORS. IF THEY HAVE NO FURTHER POSITIONS TO
+C   TRY WE CAN THEN GO SIDEWAYS UNTIL EITHER ONE WITH SOME PLACES TO TRY
+C   IS FOUND, OR WE HAVE TO GIVE UP AND GO BACKWARDS.
+C   MOTIF 1 AND ANY MOTIFS ORED WITH IT ARE SPECIAL CASES FOR THE ROUTINES
+C   BECAUSE THEIR RANGES MUST NOT BE RESET. THEIR SPECIALNESS IS SIGNIFIED
+C   BY THEIR HAVING A ZERO RELATIVE MOTIF NUMBER. ALSO IF WE TRY TO GO BACK
+C   FOR MOTIF 1, WE MUST HAVE FINISHED TRYING ALL POSSIBLE COMBINATIONS,
+C   SO WE ESCAPE.
+C   
+C
+C
+C   RESET ITS MATCH POSITION TO ZERO AS A FLAG TO THE DISPLAY ROUTINES
+C
+C
+      MATCHP(MOTIF) = 0
+C
+C
+      IF(MOTIF.LT.NMOT)THEN
+        IF(COMBIN(MOTIF+1).EQ.'O')THEN
+C
+C  MOVE SIDEWAYS
+C
+          MOTIF = MOTIF + 1
+          ICLASS = CLASS(MOTIF)
+          ILEN   = LENGTH(MOTIF)
+          CUT    = CUTOFF(MOTIF)
+          IWT    = WTSTR(MOTIF)
+          ISTRST = STRNGS(MOTIF)
+          MATCHP(MOTIF) = 0
+          COMB   = COMBIN(MOTIF)
+          RETURN
+        END IF
+      END IF
+C
+C
+C
+C   WE MUST MOVE BACKWARDS.   RESET RANGE FOR CURRENT MOTIF IF IT IS NOT 
+C   NOTTED
+C
+C
+      IF(COMB.NE.'N')THEN
+C
+C
+      IREL = RELMOT(MOTIF)
+      IF(IREL.NE.0)THEN
+        START(MOTIF) = START(IREL) + RANGES(MOTIF)
+C       NEXT LINE FOR RAH BUG
+        START(MOTIF) = MAX(START(MOTIF),1)
+        IEND(MOTIF)  = START(MOTIF) + RANGEL(MOTIF) - 1 
+      END IF
+C
+C
+C
+10    CONTINUE
+C
+C
+      IF(MOTIF.EQ.1)THEN
+        IRET = 1
+        RETURN
+      END IF
+      IF(COMBIN(MOTIF).EQ.'O')THEN
+        MOTIF = MOTIF - 1
+        MATCHP(MOTIF) = 0
+        IREL = RELMOT(MOTIF)
+        IF(IREL.NE.0)THEN
+          START(MOTIF) = START(IREL) + RANGES(MOTIF)
+C         NEXT LINE FOR RAH BUG
+          START(MOTIF) = MAX(START(MOTIF),1)
+          IEND(MOTIF)  = START(MOTIF) + RANGEL(MOTIF) - 1 
+        END IF
+        GO TO 10
+      END IF
+C
+      END IF
+C
+C   IF WE GET HERE WE HAVE FOUND A NON-ORRED CLASS AND IF THE MOTIF WE CAME 
+C   IN WITH WAS NOT NOTTED
+C   WE HAVE RESET ITS RANGE AND
+C   ALL THOSE UP THE LIST THAT ARE ORRED, AND THE CURRENT ONE
+C   NOW WE MUST FIND THE TOP OF THE NEXT LIST SIGNIFIED BY AN ANDED
+C   CLASS
+C
+20    CONTINUE
+      MOTIF = MOTIF - 1
+      IF(COMBIN(MOTIF).NE.'A')GO TO 20
+C      IF(COMBIN(MOTIF).EQ.'O')GO TO 20
+      ICLASS = CLASS(MOTIF)
+      ILEN   = LENGTH(MOTIF)
+      CUT    = CUTOFF(MOTIF)
+      IWT    = WTSTR(MOTIF)
+      ISTRST = STRNGS(MOTIF)
+      COMB   = COMBIN(MOTIF)
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF1(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,
+     +MATCHP,MATCHS,IFOUND,CUT,JOB)
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS
+      IFOUND = 0
+      ISTEP = 1
+      IF(JOB.EQ.1)ISTEP = CUT
+      IF(ISTART.LT.1)ISTART=1
+      L1 = IEND-ISTART+1
+      IF(ISTART.GT.IDIM1)RETURN
+      IF(L1.LT.IDIM2)RETURN
+      CALL FIND8(SEQ(ISTART),L1,STRING,IDIM2,ISTEP,IFOUND)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = ISTART+IFOUND-1
+      MATCHS = IDIM2
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF2(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,
+     +MATCHP,MATCHS,IFOUND)
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      IF(ISTART.GT.IDIM1)RETURN
+      CALL SQFIT4(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,MATCHS,
+     +IFOUND)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = IFOUND
+      END
+C*********************************************************************
+      SUBROUTINE SQFIT4(SEQ,IDIM1,STRING,IDIM2,
+     1IS,IE,MINSC,MATCHS,IFOUND)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS,MINSC
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      MINSCR = MINSC
+C
+      IDIF=(IE-IS+2)-IDIM2
+C   IDIF IS THE NUMBER OF POSNS TO TRY
+C   IPSTR GOES FROM 1 TO IDIM2 IDIF TIMES
+C   TRY ALL POSSIBLE POSITIONS FOR MATCHING AND SCORE FOR EACH
+C   POINT TO ARRAY ELEMENT CORRESPONDING TO FIRST BASE
+      IPSEQ=IS
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          IF(CTONUM(SEQ(IP)).EQ.CTONUM(STRING(J)))THEN
+            NTOT = NTOT + 1
+          ELSE
+            IF(STRING(J).EQ.'-')NTOT = NTOT + 1
+          END IF
+          IP=IP+1
+100     CONTINUE
+C   END OF COUNTING FOR THIS POSITION.IS TOTAL HIGH ENOUGH?
+        IF(NTOT.GE.MINSCR)THEN
+          MATCHS = NTOT
+          IFOUND = IP-IDIM2
+          RETURN
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      IFOUND = 0
+      END
+      SUBROUTINE EMBOUT(KEYNAM,FROM,TO,STRAND,DESCRP,IDEV)
+      CHARACTER KEYNAM*(*),STRAND,DESCRP*(*)
+      INTEGER FROM,TO
+C     note keynam*8, strand*1 descrp*38
+      WRITE(IDEV,1000,ERR=100)KEYNAM,FROM,TO,STRAND,DESCRP
+1000  FORMAT('FT','   ',A8,' ',I6,' ',I6,'  ',A,'    ',A)
+      RETURN
+100   CONTINUE
+      END
diff --git a/src/staden/pip.f b/src/staden/pip.f
new file mode 100644
index 0000000..8fd4b0b
--- /dev/null
+++ b/src/staden/pip.f
@@ -0,0 +1,469 @@
+C   PIP (protein interpretation program)
+C
+C   author: Rodger Staden, Medical Research Council Centre,
+C           Laboratory of Molecular Biology, Hills Road,
+C           Cambridge, England
+C    16-3-90 Removed escape after bpause
+C     5-4-90 Changed getdev to redir and all that implies (see redir)
+C    27-6-90 added hopp and woods hydrophilicity table. Not used yet!
+C    but could be sent to compc
+C     6-7-90 added showfi
+C     9-7-90 Renamed menu routine
+C     5-11-90 Very many changes for addition of file of file names
+C             search for patterns - affects patternp,patternpc,pipl,
+C             pip,nip,nipl,patternn,patternnc,analps89 and subs89 (rdwmt)
+C    4-12-90  CHANGED NAME OF ROUTINE COMPC TO COMPCP because the sun 
+C             linker cannot cope with subroutines and common blocks 
+C              having the same name
+C    11-12-90 Changed handling of sequence libraries. New logical name
+C             set here and passed to rdseq
+C     13-04-91 removed initial CLEARV
+C   The maximum sequence length is defined by the parameter MAXSEQ
+C   and MAXWIN.
+C
+C   Either Staden (ie. no heading and effectively free format with
+C   lines of any length <81 and allowing gaps that will be removed)
+C   or EMBL or GENBANK format files can be read.
+C
+C 7-6-91 New sequence library routines for cdrom format. PIR stuff junked
+C 3-7-91 set namlen = 60
+C 18-7-91 added titles to pattern files. Fixed graphics output for patterns
+C 25-2-92 changed call to rdseq
+      SUBROUTINE FMAIN()
+      INTEGER BOTOPT,TOPOPT
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILMAR,FILNAM,FILEP,FILEA,FILEAA
+      CHARACTER*(NAMLEN) FILEIN,FILE12,FOFNAM
+      CHARACTER*(NAMLEN) LIBLF
+      PARAMETER (BOTOPT=0,TOPOPT=28,
+     +           MAXSEQ=100000,
+     +           MXSPAN=603,
+     +           MAXWIN=MAXSEQ+MXSPAN,
+     +           MAXSD2=MAXSEQ/2,
+     +           MAXSD3=MAXSEQ/3,
+     +           MAXWIR=100000,
+     +           MAXD36=MAXWIR/36,
+     +           MAXD2=MAXWIR/2,
+     +           MAXD3=MAXWIR/3,
+     +           MAXMEN=-5,
+     +           MAXOPT=28,
+     +           MAXDEV=9,
+     +           MXWTLN = 120,
+     +           IDM = 26)
+      PARAMETER (
+     +             HELPF='PIPHELP',
+     +             POINTF='PIPHPNT',
+     +             FILMAR='PIPMARG',
+     +             FILEA='PROTGRP',
+     +             FILEAA='PROTALL',
+     +             FILEP='PROTMAT',
+     +             FILE12='ROBSON',
+     +             LIBLF='SEQUENCELIBRARIES')
+      PARAMETER ( MAXMOT = 50,MAXWTS = 4000)
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT),DEVNOS(MAXDEV)
+      INTEGER WORKI(MAXWIR),OPT,MARGB(MAXOPT),MARGT(MAXOPT)
+      INTEGER MATRIX(IDM,IDM),MAT1(IDM,IDM)
+      REAL WORKR(MAXWIR)
+      CHARACTER*(NAMLEN) NAMSAV(MAXMOT)
+      CHARACTER*8 KEYNS(MAXMOT)
+      CHARACTER SEQ(MAXWIN),SEQW(MAXSEQ),CHRSET(IDM)
+      REAL HYDRO(IDM),ISOP(IDM),HYDRE(IDM),HYDHW(IDM)
+      CHARACTER*3 CODONS(26)
+      EQUIVALENCE (WORKI,WORKR)
+C 1,2,3,4,5,6,7,8,9,0,1,2,3,4,5,6,7,8,9,0,1,2,3,4,5,6
+C c,s,t,p,a,g,n,d,e,q,?,?,h,r,k,m,i,l,v,f,y,w
+      DATA HYDHW/-1.0,0.3,-0.4,0.0,-0.5,0.0,0.2,3.0,3.0,0.2,0.0,0.0,
+     +-0.5,3.0,3.0,-1.3,-1.8,-1.8,-1.5,-2.5,-2.3,-3.4,0.0,0.0,0.0,0.0/
+      DATA HYDRO/70.,36.,38.,29.,63.,41.,10.,10.,10.,10.,10.,10.,
+     +13.,0.,6.,64.,90.,82.,87.,72.,32.,36.,41.,41.,41.,41./
+      DATA HYDRE/0.29,-0.18,-0.05,0.12,0.62,0.48,-0.78,-0.9,-0.74,
+     +-0.85,-0.74,-0.85,-0.4,-2.53,-1.5,0.64,1.38,1.06,1.08,1.19,
+     +0.26,0.81,0.0,0.0,0.0,0.0/
+C      DATA ISOP/5.02,6.68,6.53,6.10,6.02,5.97,5.41,2.87,3.22,
+C     +5.65,5.00,5.00,7.58,10.76,9.74,5.75,6.02,5.98,5.97,5.98,
+C     +5.65,5.88,5.00,5.00,5.00,5.00/
+      DATA ISOP/0.0,0.0,0.0,0.0,0.0,0.0,0.0,-1.0,-1.0,
+     +0.0,0.0,0.0,0.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,
+     +0.0,0.0,0.0,0.0,0.0,0.0/
+      DATA CODONS/'TGY','WSN','ACN','CCN',
+     +'GCN','GGN','AAY','GAY','GAR','CAR',
+     +'RAY','SAR','CAY','MGN','AAR','ATG',
+     +'ATH','YTN','GTN','TTY','TAY','TGG',
+     +'---','---','---','---'/
+C   Initialise help
+      CALL INTHLP('pip', TOPOPT)
+C   GET DEVICE NUMBERS
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(3))
+      CALL GIDMAT(MAT1,IDM,22)
+      IGORT = 0
+      IFORNO = 0
+      LIBNO = 2
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' PIP (Protein interpretation program) V5.0 Feb 1992',/,
+     +' Author: Rodger Staden'/)
+C   READ IN THE POINTERS TO THE HELP FILE
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(4),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,DEVNOS(4))
+      CALL INITLU(IDM)
+      IOK=0
+C      IF(MACTYP.EQ.MACSHT)CALL OPENB(DEVNOS(5),FILBUF,LREC,NREC,IOK)
+C      IF(IOK.NE.0)STOP
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +MAXOPT,DEVNOS(1),FILMAR)
+      IIIIX=0
+      IIIIY=0
+      IDEV=KBOUT
+      MOPT=0
+      IDIMT = 0
+C   READ IN THE SCORE MATRIX (USUALLY MDM78)
+      CALL GETMAT(DEVNOS(1),FILEP,MATRIX,IDM,CHRSET,KBOUT,IOK)
+      IF(IOK.NE.0) GO TO 9999
+      CALL SETPAR(IOK)
+      IF(IOK.NE.0) GO TO 9999
+2     CONTINUE
+      IDEVLL = DEVNOS(5)
+      IDEVEN = DEVNOS(6)
+      IDEVAN = DEVNOS(7)
+      IDEVDL = DEVNOS(8)
+      IDEVLF = DEVNOS(9)
+      CALL RDSEQ(
+     +SEQ(2+MXSPAN/2),MAXSEQ,IDIMT,J1,J2,ISTART,IEND,IDIM1,IDIMB,
+     +DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +HELPS(3),HELPE(3),HELPF,DEVNOS(4),IDEV,IFORNO,
+     +IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBNO,LIBLF,WORKI,MAXWIR,IOK)
+      IF(IOK.NE.0)GO TO 1
+      FILEIN=FILNAM
+      CALL SHOWFI(KBOUT,FILEIN)
+C   GIVE COMPOSITION AS A CHECK
+       IF(IDIMB.GT.0)CALL MWCALC(SEQ(2+MXSPAN/2),IDIMB,
+     +J1-ISTART+1,J2-ISTART+1,KBOUT,KBOUT)
+C   set pointers to sequence
+********
+C  MAXSEQ = THE DIMENSION OF THE RAM BUFFER SEQ
+C  IDIMT  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C           IN THE DISK BUFFER)
+C  ISTART = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQ(1)
+C  J1     = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  J2     = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIM1  = J2-J1+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IEND   = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMB  = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+********
+      IDIM1P=IDIMB+MXSPAN
+1     CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+C      IF(IOK.NE.0) GO TO 9999
+C   give menu, get option
+C
+      CALL MENU(OPT,KOPT,MOPT,MAXOPT,MAXMEN,KBIN,KBOUT,
+     +        HELPS(0),HELPE(0),HELPF,DEVNOS(4))
+C   change region
+      IF((OPT.EQ.4).AND.(IDIMT.GT.0))THEN
+C        CALL REDEFA
+C     +  (SEQ(2+MXSPAN/2),IDIMT,J1,J2,MAXSEQ,IDIM1,ISTART,IEND,IDIMB,
+C     +  DEVNOS(5),KBIN,KBOUT,IOK,SEQW,LREC,NREC,HELPS(OPT),
+C     +  HELPE(OPT),HELPF,DEVNOS(4),MACTYP,MACSHT,MACLNG)
+        CALL GTREG(KBIN,KBOUT,ISTART,IEND,J1,J2,
+     +  'Define active region',
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IDIM1 = J2 - J1 + 1
+        GO TO 1
+      END IF
+C   STOP
+      IF(OPT.EQ.2)GO TO 9999
+C
+C   LIST
+C
+      IF((OPT.EQ.5).AND.(IDIMB.GT.0))THEN
+        CALL LSTSEP(SEQ(2+MXSPAN/2),IDIMB,IDEV,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   WRITE OUT ACTIVE SEQUENCE
+C
+      IF((OPT.EQ.8).AND.(IDIM1.GT.0))THEN
+        CALL WRTACT(DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  SEQ(J1+1-ISTART+1+MXSPAN/2),IDIM1,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C  ruler
+C
+      IF((OPT.EQ.12).AND.(IDIM1.GT.0))THEN
+        CALL RULER(J1,J2,MARGL,MARGR,
+     +  MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   composition
+C
+      IF((OPT.EQ.22).AND.(IDIM1.GT.0))THEN
+        CALL COMPH(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,KBIN,KBOUT,HYDRO,IDM,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C   clear all
+C
+      IF(OPT.EQ.10)THEN
+        CALL CLEARG
+        GO TO 1
+      END IF
+C
+C   xhairs
+C
+      IF((OPT.EQ.13).AND.(IDIM1.GT.0))THEN
+        XMAX=J2
+        XMIN=J1
+        YMAX=ISYMAX
+        YMIN=0.
+        CALL CLEARV
+        CALL XHAIRN(XMAX,XMIN,YMAX,YMIN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,IIIIX,IIIIY,N,KBOUT,
+     +  SEQ(2+MXSPAN/2),ISTART,IDIMB,
+     +  SEQ(2+MXSPAN/2),ISTART,IDIMB,1)
+        GO TO 1
+      END IF
+C
+C   clear vt100
+C
+      IF(OPT.EQ.11)THEN
+        CALL CLEARV
+        GO TO 1
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(OPT.EQ.14)THEN
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(OPT),HELPE(OPT),MAXOPT,HELPF,DEVNOS(4),KBIN,KBOUT)
+        GO TO 1
+      END IF
+C
+C   plot map
+C
+      IF((OPT.EQ.16).AND.(IDIM1.GT.0))THEN
+         CALL PLTMAP(DEVNOS(1),FILNAM,IDIM1,MARGL,MARGR,MARGB(OPT),
+     +  MARGT(OPT),ISXMAX,ISYMAX,J1,J2,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   new file
+C
+      IF(OPT.EQ.3)GO TO 2
+C
+C  type text
+C
+      IF(OPT.EQ.6)THEN
+        CALL TTEXT(DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C   help
+C
+      IF(OPT.EQ.1)THEN
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,HELPF,DEVNOS(4),
+     +  KBIN,KBOUT)
+        GO TO 1
+      END IF
+C
+C  search
+C
+      IF((OPT.EQ.17).AND.(IDIM1.GT.0))THEN
+        KOPT = 1
+        CALL SERCHP(SEQ(2+MXSPAN/2),IDIMB,J1,J2,ISTART,SEQW,MAXSD3,
+     +  SEQW(1+MAXSD3),MAXSD3,
+     +  SEQW(1+2*MAXSD3),MAXSD3,
+     +  WORKI,WORKI(1+10*MAXD36),WORKI(1+20*MAXD36),10*MAXD36,
+     +  WORKI(1+30*MAXD36),WORKI(1+31*MAXD36),WORKI(1+32*MAXD36),
+     +  WORKI(1+33*MAXD36),
+     +  WORKI(1+34*MAXD36),WORKI(1+35*MAXD36),MAXD36,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  IDEV,DEVNOS(1),FILNAM,FILEA,FILEAA,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   direct output to disk
+C
+      IF(OPT.EQ.7)THEN
+        CALL REDIR(IDEV,DEVNOS(2),DEVNOS(3),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   signal searches from named plot files
+C
+      IF((OPT.EQ.20).AND.(IDIM1.GT.0))THEN
+        CALL SIGNLP(SEQ(2+MXSPAN/2),IDIMB,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,J1-ISTART+1,J2-ISTART+1,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,
+     +  WORKI,WORKR(1+IDM*MXWTLN),WORKI(1+2*IDM*MXWTLN),IDM,
+     +  MXWTLN,SEQW,CHRSET,KOPT)
+        GO TO 1
+      END IF
+C
+C  SEQFIT
+C
+      IF((OPT.EQ.18).AND.(IDIM1.GT.0))THEN
+        CALL SQPFIT(SEQ(2+MXSPAN/2),IDIMB,SEQW,MAXSEQ,
+     +  WORKI,WORKI(1+MAXD2),MAXD2,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1,J2,ISTART,IDEV,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C  calc mol wt
+C
+      IF((OPT.EQ.21).AND.(IDIM1.GT.0))THEN
+        CALL MWCALC(SEQ(2+MXSPAN/2),IDIMB,
+     +  J1-ISTART+1,J2-ISTART+1,KBOUT,IDEV)
+        GO TO 1
+      END IF
+C
+C  plot charge
+C
+      IF((OPT.EQ.23).AND.(IDIM1.GT.0))THEN
+        CALL COMPCP(SEQ,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  J1-ISTART+1,J2-ISTART+1,KBIN,KBOUT,ISOP,IDM,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C  sepfit
+C
+      IF((OPT.EQ.19).AND.(IDIM1.GT.0))THEN
+        CALL SEPFIT(SEQ(2+MXSPAN/2),IDIMB,SEQW,MAXSEQ,
+     +  WORKI,WORKI(1+MAXD2),MAXD2,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,J1,J2,
+     +  ISTART,IDEV,
+     +  DEVNOS(1),FILNAM,KBIN,KBOUT,MATRIX,IDM,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C  write labels
+C
+      IF(OPT.EQ.15)THEN
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 1
+      END IF
+C
+C  show settings and edit
+C
+      IF(OPT.EQ.9)THEN
+        WRITE(KBOUT,4000)FILEIN,J1,J2,IDIM1
+4000    FORMAT(' Sequence=',A,/,' Start=',I7,' End=',I7,' Length=',I7)
+C       THIS NEXT LINE IS BECAUSE SOMETIMES THE FILE IS NOT READY TO EDIT!
+        CLOSE(UNIT=DEVNOS(2))
+        CALL SEQEDT(SEQ(2+MXSPAN/2),MAXSEQ,IDIMB,KBIN,
+     +  KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),DEVNOS(1),
+     +  FILNAM,FILEIN,IDM,IOK)
+        IF(IOK.NE.0) GO TO 1
+C        CALL EDITSQ(SEQ(2+MXSPAN/2),MAXSEQ,SEQW,MAXSEQ,IDIMB,KBIN,
+C     +  KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),1,DEVNOS(1),
+C     +  FILNAM,PAA,IDM)
+********
+C  MAXSEQ = THE DIMENSION OF THE RAM BUFFER SEQ
+C  IDIMT  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C           IN THE DISK BUFFER)
+C  ISTART = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQ(1)
+C  J1     = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  J2     = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIM1  = J2-J1+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IEND   = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMB  = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+C  THIS IS A FUDGE AND ONLY WORKS FOR UNBUFFERED SEQUENCES!!!!!!!!!!!
+        J1 = 1
+        J2 = IDIMB
+        IDIMT = IDIMB
+        IEND = IDIMB
+        IDIM1 = IDIMB
+        WRITE(KBOUT,4000)FILEIN,J1,J2,IDIM1
+        GO TO 1
+      END IF
+C
+C  robson
+C
+      IF((OPT.EQ.24).AND.(IDIM1.GT.0))THEN
+        CALL ROBSON(SEQ(2+MXSPAN/2),IDIMB,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,J1-ISTART+1,
+     +  J2-ISTART+1,KBIN,KBOUT,
+     +  DEVNOS(1),FILE12,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDEV,KOPT)
+        GO TO 1
+      END IF
+C
+C  hydrophobic moment
+C
+      IF((OPT.EQ.25).AND.(IDIM1.GT.0))THEN
+        CALL HYDMOM(SEQ(J1+1-ISTART+1+MXSPAN/2),IDIM1,HYDRE,IDM,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),
+     +  ISXMAX,ISYMAX,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   helix wheel
+C
+      IF((OPT.EQ.26).AND.(IDIM1.GT.0))THEN
+        CALL HELIXW(SEQ(2+MXSPAN/2),IDIMB,ISTART,J1,J2,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,KBIN,KBOUT,
+     +  HYDRE,IDM,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   back translate
+C
+      IF((OPT.EQ.27).AND.(IDIM1.GT.0))THEN
+        CALL BACK(SEQ(J1+1-ISTART+1+MXSPAN/2),IDIM1,
+     +  CODONS,KBIN,KBOUT,FILNAM,
+     +  DEVNOS(1),SEQW,MAXSEQ,WORKR,IDIM1P,MXSPAN,
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 1
+      END IF
+C
+C   pattern search
+C
+      IF((OPT.EQ.28).AND.(IDIM1.GT.0))THEN
+        CALL PATTEN(SEQ(J1-ISTART+2+MXSPAN/2),IDIM1,SEQW,MAXSEQ,
+     +  WORKI(1),WORKI(MAXMOT+1),WORKI(2*MAXMOT+1),WORKI(3*MAXMOT+1),
+     +  WORKI(4*MAXMOT+1),WORKI(5*MAXMOT+1),WORKI(6*MAXMOT+1),
+     +  WORKI(7*MAXMOT+1),WORKI(8*MAXMOT+1),WORKI(9*MAXMOT+1),
+     +  WORKI(10*MAXMOT+1),WORKI(11*MAXMOT+1),WORKI(12*MAXMOT+1),
+     +  WORKI(13*MAXMOT+1),WORKI(14*MAXMOT+1),WORKI(15*MAXMOT+1),
+     +  WORKI(16*MAXMOT+1),WORKI(17*MAXMOT+1),WORKI(18*MAXMOT+1),
+     +  WORKI(19*MAXMOT+1),WORKI(20*MAXMOT+1),
+     +  WORKR(22*MAXMOT+1),WORKR(23*MAXMOT+1),
+     +  FILNAM,MAXMOT,MAXWTS,MATRIX,IDEV,DEVNOS(6),DEVNOS(7),
+     +  MARGL,MARGR,MARGB(OPT),MARGT(OPT),ISXMAX,ISYMAX,J1,
+     +  KBIN,KBOUT,DEVNOS(8),IDM,SEQ(1),MAT1,NAMSAV,KEYNS,CHRSET,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),FOFNAM,DEVNOS(1))
+C  NB IVE SENT THE START OF SEQ TO BE USED IN PATTEN. THIS IS OK
+C  AS LONG AS MAXMOT<MXSPAN/2, AND NO MOTIF REQUIRES TO START TO
+C  THE LEFT OF THE FIRST REAL SEQUENCE ELEMENT
+        GO TO 1
+      END IF
+      GO TO 1
+9999  CONTINUE
+      CALL SHUTD
+      END
diff --git a/src/staden/pipl.f b/src/staden/pipl.f
new file mode 100644
index 0000000..2eb3d63
--- /dev/null
+++ b/src/staden/pipl.f
@@ -0,0 +1,1015 @@
+C   PIPL (Protein interpretation program (library))
+C   9-7-92 added fasta format capability
+C   11-4-90 Changed title to be string rather than array
+C           corresponding change made to rdpira
+C   18-4-90 Changed to allow use of library index when appropriate
+C           so using opir1 instead of opir, and rdpirb as well as
+C           rdpira
+C   8-11-90 Changed call to getmf for compatibility with pip
+C           and replaced all calls to radio by radion
+C   11-12-90 Changed library opening, added filnll and paramter
+C  18-7-91 Added titles to pattern files
+C  16-12-91 Added access to pir library in codata format and cdrom indices
+C  2-3-92 set filnam = ' ' for calls to openf1
+C
+C   author: Rodger Staden, Medical Research Council Centre,
+C           Laboratory of Molecular Biology, Hills Road,
+C           Cambridge, England
+C
+      SUBROUTINE FMAIN()
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) FILE1,FILNAM,HELPF,LIBLF
+      INTEGER BOTOPT,TOPOPT
+      PARAMETER (
+     +           MAXSEQ=180000,
+     +           MXSPAN=603,
+     +           MAXWIN=MAXSEQ+MXSPAN,
+     +           MAXWIR=180000,
+     +           MAXD36=MAXWIR/36,
+     +           MAXD2=MAXWIR/2,
+     +           MAXD3=MAXWIR/3,
+     +           MAXSD2=MAXSEQ/2,
+     +           MAXSD3=MAXSEQ/3,
+     +           MAXDEV=10,
+     +           LIBLF = 'SEQUENCELIBRARIES',
+     +           LENNAM = 10)
+      PARAMETER (FILE1='PROTMAT')
+      PARAMETER (MAXMOT =   100,
+     +           MAXWTS = 5000,
+     +           IDM    = 26)
+      INTEGER BESTP(MAXMOT),BESTQ(MAXMOT)
+      REAL BESTS(MAXMOT)
+      CHARACTER*(LENNAM) ENAMEL,ENAME,NAMSAV(MAXMOT)
+      PARAMETER (MAXDIV = 15)
+      INTEGER DIVDEV(MAXDIV),RSIZEN
+      INTEGER DEVNOS(MAXDEV)
+      REAL WORKR(MAXWIR)
+      INTEGER WORKI(MAXWIR)
+      INTEGER MAT1(IDM,IDM),MATRIX(IDM,IDM)
+      CHARACTER SEQ(MAXWIN),SEQW(MAXSEQ),CHRSET(IDM)
+C  COMPATIBILITY WITH ANALYSEQ HELP STUFF
+      PARAMETER (BOTOPT=0,TOPOPT=1)
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      CHARACTER*8 KEYNS(MAXMOT)
+      CHARACTER LTYPE
+      EQUIVALENCE (WORKR,WORKI)
+C   GET DEVICE NUMBERS
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL GIDMAT(MAT1,IDM,22)
+      WRITE(KBOUT,1000)
+1000  FORMAT(/,
+     +' PIPL (Protein interpretation program (library))',
+     +' V4.1 Jul 1991',/,
+     +' Author: Rodger Staden',/,
+     +' Searches protein libraries for patterns of motifs',/)
+      CALL INITLU(IDM)
+      CALL GETMAT(DEVNOS(1),FILE1,MATRIX,IDM,CHRSET,KBOUT,IOK)
+      IF(IOK.NE.0)STOP
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(1),FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for results file',
+     +HELPS(1),HELPE(1),HELPF,DEVNOS(2))
+      IF(IOK.NE.0)STOP
+C  OPEN LIBRARY
+      LIBIN = 2
+      IDEVNL = DEVNOS(7)
+      IDEVLL = DEVNOS(8)
+      IDEVEN = DEVNOS(9)
+      IDEVD = DEVNOS(MAXDEV)
+      CALL RDLIBL(FILNAM,KBIN,KBOUT,
+     +HELPS(1),HELPE(1),HELPF,DEVNOS(2),IDEVLL,IDEVEN,IDEVNL,
+     +LIBLF,LIBIN,DIVDEV,MAXDIV,IDEVD,
+     +LIST,ENAMEL,LIBTYP,LTYPE,NDIV,RSIZEN,NRECEN,IOK)
+      IF(IOK.NE.0)STOP
+      IDEVOT=DEVNOS(1)
+      J1 = 1
+      OPT = 1
+      CALL PATTEO(SEQ(MXSPAN+1),MAXSEQ,SEQW,MAXSEQ,
+     +WORKI(1),WORKI(MAXMOT+1),WORKI(2*MAXMOT+1),WORKI(3*MAXMOT+1),
+     +WORKI(4*MAXMOT+1),WORKI(5*MAXMOT+1),WORKI(6*MAXMOT+1),
+     +WORKI(7*MAXMOT+1),WORKI(8*MAXMOT+1),WORKI(9*MAXMOT+1),
+     +WORKI(10*MAXMOT+1),WORKI(11*MAXMOT+1),WORKI(12*MAXMOT+1),
+     +WORKI(13*MAXMOT+1),WORKI(14*MAXMOT+1),WORKI(15*MAXMOT+1),
+     +WORKI(16*MAXMOT+1),WORKI(17*MAXMOT+1),WORKI(18*MAXMOT+1),
+     +WORKI(19*MAXMOT+1),WORKI(20*MAXMOT+1),
+     +WORKR(22*MAXMOT+1),WORKR(23*MAXMOT+1),
+     +FILNAM,MAXMOT,MAXWTS,IDEVOT,DEVNOS(3),DEVNOS(4),J1,
+     +KBIN,KBOUT,MATRIX,DEVNOS(5),IDM,SEQ(1),
+     +WORKI(23*MAXMOT+1+MAXWTS),
+     +WORKI(23*MAXMOT+MAXWTS+3001),LIST,MAT1,
+     +NAMSAV,KEYNS,CHRSET,
+     +  HELPS,HELPE,HELPF,DEVNOS(2),DEVNOS(6),ENAMEL,ENAME,LIBTYP,
+     +  IDEVEN,RSIZEN,NRECEN,IDEVNL,DIVDEV,NDIV,BESTP,BESTQ,BESTS,
+     +  LTYPE)
+C NB HAVE SET MAXIMUM COMBINED STRING LENGTH TO 3000 ON PREVIOUS LINE
+C AND MAX INTEGER VERSION OF SEQUENCE IS WHATEVER IS LEFT AND THIS IS NOT
+C SENT OR CHECKED BY THE CODE !!!!!!!!!!!!!!!!!!!!!!!!!!
+C
+900   CONTINUE
+      END
+      SUBROUTINE PATTEO(SEQ,MAXSEQ,STRING,MAXSTR,
+     +LENGTH,CLASS,RELMOT,RANGES,RANGEL,RANGET,RANGEM,IENTRY,
+     +START2,IEND2,WTSTR,START,IEND,MATCHQ,RELEND,MATCHP,
+     +STRNGS,LAST5,LAST3S,LAST3E,MATCHS,CUTOFF,WEIGHT,FILNAM,
+     +MAXMOT,MAXWTS,
+     +IDEV1,IDEV2,IDEV3,KSTART,KBIN,KBOUT,
+     +MATRIX,IDEV4,IDM,COMBIN,STRNGI,SEQI,LIST,MAT1,
+     +NAMSAV,KEYNS,CHRSET,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEVN,ENAMEL,NAMIN,LIBTYP,
+     +  IDEVEN,RSIZEN,NRECEN,IDEVNL,DIVDEV,NDIV,BESTP,BESTQ,BESTS,
+     +  LTYPE)
+      INTEGER DIVDEV(NDIV),RSIZEN
+      INTEGER LENGTH(MAXMOT),CLASS(MAXMOT),RELMOT(MAXMOT)
+      INTEGER RANGES(MAXMOT),RANGEL(MAXMOT)
+      INTEGER RANGET(MAXMOT),RANGEM(MAXMOT),IENTRY(MAXMOT)
+      INTEGER START2(MAXMOT),IEND2(MAXMOT),MATRIX(IDM,IDM)
+      CHARACTER SEQ(MAXSEQ),STRING(MAXSTR)
+      INTEGER WTSTR(MAXMOT),START(MAXMOT),IEND(MAXMOT)
+      INTEGER MATCHQ(MAXMOT),RELEND(MAXMOT)
+      INTEGER MATCHP(MAXMOT),STRNGS(MAXMOT)
+      INTEGER LAST5(MAXMOT),LAST3S(MAXMOT),LAST3E(MAXMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(MAXMOT),MATCHS(MAXMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER FILNAM*(*),HELPF*(*),NAMIN*(*),ENAMEL*(*),LTYPE
+      CHARACTER COMBIN(MAXMOT),CHRSET(IDM)
+      INTEGER BESTP(MAXMOT),BESTQ(MAXMOT),ENTRYN
+      REAL BESTS(MAXMOT)
+C NB PROBLEM ABOUT USING MAXSEQ AS DIMENSION!!!!!!!!!!!!
+      INTEGER STRNGI(MAXSTR),SEQI(MAXSEQ),MAT1(IDM,IDM)
+      CHARACTER TITLE*60,TITLEP*80
+      REAL EXPECC(26)
+      CHARACTER*(*) NAMSAV(MAXMOT),KEYNS(MAXMOT)
+      PARAMETER (MAXCLS = 6)
+      PARAMETER (MAXPRM = 25)
+      CHARACTER PROMPT(5)*(MAXPRM)
+      SAVE EXPECC
+      DATA EXPECC/2.9,7.0,6.1,5.2,8.6,8.4,4.3,5.5,6.0,3.9,0.0,0.0,
+     +2.0,4.9,6.6,1.7,4.5,7.4,6.6,3.6,3.4,1.3,0.0,0.0,0.0,0.0/
+C
+C  ZERO ARRAYS
+C
+        IDSEQ = 1000
+        CALL FILLI(LENGTH,MAXMOT,0)
+        CALL FILLI(CLASS,MAXMOT,0) 
+        CALL FILLI(RELMOT,MAXMOT,0)
+        CALL FILLI(RANGES,MAXMOT,0)
+        CALL FILLI(RANGEL,MAXMOT,0)
+        CALL FILLI(RANGET,MAXMOT,0)
+        CALL FILLI(RANGEM,MAXMOT,0)
+        CALL FILLI(IENTRY,MAXMOT,0)
+        CALL FILLI(START2,MAXMOT,0)
+        CALL FILLI(IEND2,MAXMOT,0) 
+        CALL FILLI(WTSTR,MAXMOT,0) 
+        CALL FILLI(START,MAXMOT,0) 
+        CALL FILLI(IEND,MAXMOT,0)  
+        CALL FILLI(MATCHQ,MAXMOT,0)
+        CALL FILLI(RELEND,MAXMOT,0)
+        CALL FILLI(MATCHP,MAXMOT,0)
+        CALL FILLI(STRNGS,MAXMOT,0)
+        CALL FILLI(LAST5,MAXMOT,0) 
+        CALL FILLI(LAST3S,MAXMOT,0)
+        CALL FILLI(LAST3E,MAXMOT,0)
+        CALL FILLR(CUTOFF,MAXMOT,0.0)
+        CALL FILLR(MATCHS,MAXMOT,0.0)
+        CALL FILLR(WEIGHT,MAXWTS,0.0)
+        CALL FILLC(COMBIN,MAXMOT,'A')
+        CALL FILLI(BESTP,MAXMOT,0)
+        CALL FILLC(BESTQ,MAXMOT,0)
+        CALL FILLC(BESTS,MAXMOT,0.)
+        ITOTAL = 0
+        CALL SETCMP(EXPECC,IDM)
+      PROMPT(1) = 'Motif by motif'
+      PROMPT(2) = 'Inclusive'
+      PROMPT(3) = 'Scores only'
+      PROMPT(4) = 'Complete padded sequences'
+      PROMPT(5) = 'Padded sections'
+      IOPT = 1
+      CALL RADION('Select results display mode',PROMPT,5,IOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) RETURN
+       JOPT = 0
+       CALL YESNO(JOPT,'Report all matches',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+       IF(JOPT.LT.0) RETURN
+      IF(IOPT.EQ.4)THEN
+        MININ = 1
+        MAXIN = 9999
+        JSTART = 1
+        WRITE(KBOUT,1020)
+1020    FORMAT(
+     +' For output option 4, we need to position the first motifs',/,
+     +' in a pattern so that they are aligned with one another')
+      CALL GETINT(MININ,MAXIN,JSTART,'Position of first motif',
+     +IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      JSTART = IVAL
+      END IF
+        FILNAM = ' '
+        CALL OPENF1(IDEV3,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Pattern definition file',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        JDEV = IDEV3
+C
+C  GET MOTIF DEFINITIONS
+C
+C  RETURN STRING LENGTH FOR COMPATIBILITY WITH LIB SEARCH
+      NSTRNG = MAXSTR
+        CALL GETMF(KBIN,KBOUT,STRING,NSTRNG,ISTRNG,
+     +  LENGTH,MAXMOT,CLASS,RELMOT,RANGES,RANGEL,
+     +  RANGET,RANGEM,
+     +  STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,IDEV2,
+     +  WTSTR,JDEV,IOK,RELEND,IDSEQ,IDEV4,IDM,COMBIN,
+     +  MAXCLS,MATRIX,MAT1,
+     +  PMINT,PMAXT,PROBT,EXPTT,CHRSET,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KEYNS,NAMSAV,FILNAM,0,TITLEP)
+      IF(IOK.NE.0)RETURN
+      IF(NMOT.LT.1)RETURN
+C
+C
+C  DISPLAY THE SIGNAL DESCRIPTION
+C
+      RANGES(1) = 1
+      CALL DESSIG(
+     +            KBOUT,STRING,MAXSTR,
+     +            LENGTH,CLASS,RELMOT,RANGES,RANGEL,
+     +            RANGET,RANGEM,
+     +            STRNGS,NMOT,WEIGHT,MAXWTS,CUTOFF,
+     +            WTSTR,RELEND,COMBIN,KEYNS,TITLEP)
+C
+13    CONTINUE
+      WRITE(KBOUT,2003)PROBT
+2003  FORMAT(' Probability of finding pattern = ',E10.4)
+      WRITE(KBOUT,2004)EXPTT
+2004  FORMAT(' Expected number of matches per 1000 residues = ',E10.4)
+      CALL GETRL(0.,1.,1.,'Maximum pattern probability',
+     +XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      PMINC = XP
+      CALL GETRL(-9999.,9999.,-9999.,'Minimum pattern score',
+     +XP,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CUTSCR = XP
+      MINSCR = 9999999.
+      MAXSCR = -9999999.
+      IF(NSTRNG.GT.0)CALL CONNUM(STRING,STRNGI,NSTRNG)
+C
+C
+      ICREC = 0
+      IFINEX = 0
+      NENTRY = 0
+      ENTRYN = 0
+20    CONTINUE
+      IDSEQ = MAXSEQ
+      IF(LIBTYP.EQ.1) THEN
+        CALL CDROML(LIST,NAMIN,ENAMEL,
+     +  IDEVEN,RSIZEN,NRECEN,IDEVNL,SEQ,IDSEQ,
+     +  DIVDEV,NDIV,ICREC,IFINEX,TITLE,KBOUT,LTYPE,IOK)
+        IF(IOK.NE.0) GO TO 900
+        FILNAM = NAMIN
+      ELSE IF(LIBTYP.EQ.2) THEN
+        CALL RDPIRA(SEQ,IDSEQ,
+     +  IDEVEN,KBOUT,TITLE,FILNAM,LIST,ENAMEL,IDEVNL)
+      ELSE IF(LIBTYP.EQ.3) THEN
+        CALL RDFASA(SEQ,IDSEQ,
+     +  IDEVEN,KBOUT,TITLE,FILNAM,LIST,ENAMEL,IDEVNL,ENTRYN)
+      ELSE
+        RETURN
+      END IF
+      IF(IDSEQ.LT.1)GO TO 900
+C  CONVERT TO INTEGER
+      IF(IDSEQ.GT.0)CALL CONNUM(SEQ,SEQI,IDSEQ)
+C
+      NENTRY = NENTRY + 1
+      RANGES(1) = 1
+      RANGEL(1) = IDSEQ
+C
+C  DO THE SEARCH
+C
+      CALL SRCSIH(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATRIX,MATCHQ,RELEND,
+     +IDEV1,LAST5,LAST3S,LAST3E,
+     +IOPT,ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,SEQI,STRNGI,TITLE,FILNAM,
+     +MAT1,PMINT,PMAXT,PROBT,PMINC,IDEVN,JSTART,KEYNS,
+     +BESTP,BESTQ,BESTS,JOPT)
+C
+C
+      GO TO 20
+C
+C
+900   CONTINUE
+      WRITE(KBOUT,1002)ITOTAL
+1002  FORMAT(' Total matches found',I7)
+      WRITE(KBOUT,1006)MINSCR,MAXSCR
+1006  FORMAT(' Minimum and maximum observed scores=',2F12.2)
+      WRITE(KBOUT,1009)NENTRY
+1009  FORMAT(' ',I7,' Entries processed')
+      END
+C***************************************************************
+      SUBROUTINE SRCSIH(KBIN,KBOUT,WTSTR,LENGTH,CLASS,
+     +RANGES,RANGEL,START,IEND,RELMOT,MATCHP,STRNGS,WEIGHT,
+     +MAXWTS,CUTOFF,MATCHS,NMOT,STRING,MAXSTR,SEQ,IDSEQ,
+     +RANGET,RANGEM,IENTRY,START2,IEND2,MATRIX,MATCHQ,RELEND,
+     +IDEVOT,LAST5,LAST3S,LAST3E,
+     +IOPT,ITOTAL,KSTART,IDM,COMBIN,
+     +CUTSCR,MINSCR,MAXSCR,SEQI,STRNGI,TITLE,FILNAM,
+     +MAT1,PMINT,PMAXT,PROBT,PMINC,IDEVN,JSTART,KEYNS,
+     +BESTP,BESTQ,BESTS,JOPT)
+C  ROUTINE TO SEARCH FOR SIGNALS COMPOSED OF MOTIFS
+C  WEIGHT  = WEIGHTS FOR MATRICES
+C  CUTOFF  = CUTOFF SCORES
+C  LENGTH  = MOTIF LENGTHS
+C  CLASS   = MOTIF CLASS
+C  COMBIN  = LOGICAL COMBINATION A, O, N
+C  WTSTR   = POINTER TO WEIGHT STARTS
+C  RANGES  = RANGES START
+C  RANGEL  = RANGE LENGTH (A DISTANCE MEASURED FROM RANGES)
+C  RELMOT  = MOTIF NUMBER THAT A RANGE IS RELATIVE TO IE THE
+C            FIRST MOTIF'S RANGE IS RELATIVE TO MOTIF 0, BUT
+C            ANY OTHER MOTIF MAY HAVE TO BE DEFINED RELATIVE
+C            TO ANY OTHER. THE MOST COMMON WOULD BE THE FIRST
+C            MOTIF OR THE LAST ONE SEARCHED FOR.
+C  RELEND  = IS A SPECIAL CASE FLAG FOR STEMS. IT ALLOWS OTHER
+C            MOTIFS TO HAVE THEIR POSITIONS RELATIVE TO THE 3' SIDE
+C            OF A STEM. IT IS 5 FOR THE 5 PRIME SIDE, 3 FOR 3' SIDE
+C  START   = RANGE START DURING SEARCH (SOME POSITIONS MAY HAVE
+C            BEEN TRIED)
+C  IEND    = RANGE END POSITION FOR CURRENT INITIAL START (WHEN IT
+C            IS RESET DEPENDS ON WHETHER IT IS DEFINED RELATIVE TO 
+C            THE FIRST OR THE PREVIOUS MOTIF. IF IT IS DEFINED RELATIVE
+C            TO THE FIRST MOTIF IT IS RESET WHEN WE FIND A MATCH FOR THE
+C            FIRST MOTIF. IF IT IS DEFINED RELATIVE TO THE PREVIOUS MOTIF
+C            WE MUST RESET WHEN WE MOVE FORWARD ONE MOTIF. I THINK THIS
+C            CAN BE TAKEN CARE OF BY UPDATING ALL THOSE MOTIFS THAT ARE
+C            DEFINED TO THE CURRENT MOTIF EVERY TIME WE MOVE FORWARD 
+C            ONE MOTIF (IE IT INCLUDES THE FIRST MOTIF SO IT IS NOT A
+C            SPECIAL CASE). OTHERWISE WE UPDATE POSITIONS WHEN WE FIND
+C            A MATCH FOR THEM (WE SET TO THE MATCH POSITION PLUS 1)
+C  MATCHP  = LIST OF CURRENT MATCH POSITIONS FOR EACH MOTIF
+C  MATCHS  = LIST OF CURRENT MATCH SCORES FOR EACH MOTIF
+C  IFOUND  = A FLAG TO INDICATE SUCCESS OR FAILURE OF A SEARCH ROUTINE
+C            1 = SUCCESS, 0 = FAIL
+C  STRNGS  = POINTER TO STRING STARTS IN CHARACTER ARRAY STRING
+C  TEMPORARY VALUES ARE:
+C  MOTIF   = ACTUAL MOTIF NUMBER
+C  ICLASS  = CLASS
+C  ILEN    = LENGTH OF MOTIF
+C  CUT     = CUTOFF
+C  WT      = START OF WEIGHTS FOR THIS MOTIF
+C  ISTRST  = START OF STRING
+C  RANGET  = START OF 3' RANGE FOR STEM SEARCHES
+C  RANGEM  = END OF 3' RANGE FOR STEM SEARCHES
+C  IENTRY  = FLAG TO SIGNIFY MORE 3' STEM POSITIONS FOR LAST 5' START
+C            0 = NONE, ON RETURN FROM MOTIF6 IT CONTAINS THE 3' MATCH 
+C            POSITION
+C  MATCHQ  = MATCH POSITION FOR STEM SEARCH
+C  COMB    = LOGICAL COMBINATION A, O, N
+      INTEGER WTSTR(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT),START(NMOT),IEND(NMOT)
+      INTEGER RELMOT(NMOT),MATCHP(NMOT),STRNGS(NMOT)
+      INTEGER RANGET(NMOT),RANGEM(NMOT),IENTRY(NMOT),RELEND(NMOT)
+      INTEGER START2(NMOT),IEND2(NMOT),MATRIX(IDM,IDM),MATCHQ(NMOT)
+      INTEGER LAST5(NMOT),LAST3S(NMOT),LAST3E(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT),MATCHS(NMOT)
+      REAL MINSCR,MAXSCR
+      CHARACTER SEQ(IDSEQ),STRING(MAXSTR)
+      CHARACTER COMBIN(NMOT),COMB
+      INTEGER STRNGI(MAXSTR),SEQI(IDSEQ),MAT1(IDM,IDM)
+      CHARACTER TITLE*(*),FILNAM*(*)
+      CHARACTER*(*) KEYNS(NMOT)
+      INTEGER BESTP(NMOT),BESTQ(NMOT)
+      REAL BESTS(NMOT),MAXSS,MINMSS
+      PARAMETER (MINMSS = -999999.)
+C
+C
+C  INITIALIZE
+      JMOT = 0
+      IRET = 0
+5     CONTINUE
+      JMOT = JMOT + 1
+      IF(JMOT.LE.NMOT)THEN
+        IF(RELMOT(JMOT).EQ.0)THEN
+          START(JMOT) = RANGES(1)
+          IEND(JMOT) = RANGES(1) + RANGEL(1) -1
+          GO TO 5
+        END IF
+      END IF
+      MOTIF  = 1
+      ICLASS = CLASS(1)
+      ILEN   = LENGTH(1)
+      CUT    = CUTOFF(1)
+      IWT    = WTSTR(1)
+      ISTRST = STRNGS(1)
+      IENTRY(1) = 0
+      COMB   = COMBIN(1)
+      DO 10 I = 1,NMOT
+        MATCHP(I) = 0
+10    CONTINUE
+      MAXSS = MINMSS
+C
+C
+C
+C
+C
+100   CONTINUE
+C
+C
+C  THIS A CLASS CLASS MOTIF, PERFORM THE APPROPRIATE SEARCH IF THE START 
+C  POSITION IS >0. (IF IT IS NOT THE CURRENT MOTIF IS A NOT THAT HAS
+C  ALREADY BEEN SEARCHED FOR
+C
+      IFOUND = 0
+      IF(START(MOTIF).GT.0)THEN
+C
+C
+      IF(ICLASS.EQ.1)THEN
+        CALL MOTIF1(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,
+     +  CUTOFF(MOTIF),0)
+      ELSE IF(ICLASS.EQ.2)THEN
+        CALL MOTIF2(SEQ,IDSEQ,STRING(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND)
+      ELSE IF(ICLASS.EQ.3)THEN
+        CALL MOTFI3(SEQI,IDSEQ,STRNGI(ISTRST),ILEN,START(MOTIF),
+     +  IEND(MOTIF),CUT,MATCHP(MOTIF),MATCHS(MOTIF),IFOUND,MATRIX,IDM)
+      ELSE IF(ICLASS.EQ.4)THEN
+        CALL MOTFI4(SEQI,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+      ELSE IF(ICLASS.EQ.5)THEN
+      CALL MOTIF8(SEQI,IDSEQ,MATRIX,LENGTH(MOTIF),START(MOTIF),
+     +            IEND(MOTIF),RANGET(MOTIF),RANGEM(MOTIF),
+     +            CUTOFF(MOTIF),MATCHP(MOTIF),MATCHS(MOTIF),
+     +            IENTRY(MOTIF),IFOUND,MATCHQ(MOTIF),
+     +            LAST5(MOTIF),LAST3S(MOTIF),LAST3E(MOTIF),IDM)
+      ELSE IF(ICLASS.EQ.6)THEN
+        CALL MOTFI4(SEQI,IDSEQ,ILEN,START(MOTIF),
+     +  IEND(MOTIF),WEIGHT(IWT),CUT,MATCHP(MOTIF),MATCHS(MOTIF),
+     +  IFOUND,IDM)
+      ELSE
+        WRITE(KBOUT,*)'UNKNOWN CLASS!!'
+      END IF
+C
+C
+      END IF
+C
+C
+C   MATCH FOUND WHEN MATCH WANTED ?
+C
+C
+C
+      IF(((IFOUND.EQ.0).AND.(COMB.NE.'N')).OR.
+     +   ((IFOUND.GT.0).AND.(COMB.EQ.'N')))THEN
+C
+C   NO SO GO BACK OR SIDEWAYS ONE MOTIF
+C
+C
+        CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +  RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +  COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+        IF(IRET.NE.0) THEN
+          IF(MAXSS.GT.MINMSS) THEN
+            CALL DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +      CLASS,MATCHQ,IOPT,
+     +      KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +      TITLE,FILNAM,
+     +      MATRIX,MAT1,IDM,PMINT,PMAXT,PROBT,
+     +      WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,RANGES,RANGEL,IDEVN,
+     +      JSTART,KEYNS,BESTP,BESTQ,BESTS,2,MAXSS)
+            ITOTAL = ITOTAL + 1
+          END IF
+          RETURN
+        END IF
+C
+C
+      ELSE
+C
+C
+C   MATCH FOUND. 
+C
+C
+C   PREPARE FOR NEXT SEARCH THIS MOTIF BY INCREMENTING POINTER
+C   TO SEARCH RANGE (NOT FOR CLASS 6 WHICH IS HANDLED BY MOTIF6)
+C
+C
+       IF(COMB.EQ.'N')THEN
+         START(MOTIF) = -9
+       ELSE
+         IF(ICLASS.NE.5) START(MOTIF) = MATCHP(MOTIF) + 1
+       END IF
+C
+C   TRY GOING FORWARD ONE MOTIF
+C
+C
+      CALL FORWAD(CLASS,LENGTH,CUTOFF,NMOT,
+     +  MOTIF,ICLASS,ILEN,CUT,IWT,RELMOT,START,IEND,
+     +  RANGES,RANGEL,STRNGS,ISTRST,WTSTR,IDSEQ,IENTRY,
+     +  RANGET,RANGEM,START2,IEND2,MATCHQ,RELEND,MATCHP,IDSPLY,
+     +  COMBIN,COMB)
+C
+C
+C
+C   IS THIS THE LAST MOTIF? IF SO DISPLAY THE MATCH
+C
+C
+        IF(IDSPLY.EQ.1)THEN
+          IF(JOPT.NE.1) ITOTAL = ITOTAL + 1
+          CALL DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +    CLASS,MATCHQ,IOPT,
+     +    KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +    TITLE,FILNAM,
+     +    MATRIX,MAT1,IDM,PMINT,PMAXT,PROBT,
+     +    WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,RANGES,RANGEL,IDEVN,
+     +    JSTART,KEYNS,BESTP,BESTQ,BESTS,JOPT,MAXSS)
+C
+C
+C  HORRIBLE SPECIAL CASE - IF LAST MOTIF IS NOTTED WE MUST MOVE BACK AFTER
+C  DISPLAY
+          IF(COMB.EQ.'N')THEN
+            CALL BAKSID(CLASS,LENGTH,CUTOFF,STRNGS,NMOT,
+     +      MOTIF,ICLASS,ILEN,CUT,IWT,ISTRST,WTSTR,
+     +      RELMOT,START,IEND,MATCHQ,RANGES,RANGEL,RELEND,IRET,MATCHP,
+     +      COMBIN,COMB)
+C
+C
+C   IF CANT GO BACK ANY FURTHER QUIT
+            IF(IRET.NE.0) THEN
+              IF(MAXSS.GT.MINMSS) THEN
+                CALL DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEVOT,
+     +          CLASS,MATCHQ,IOPT,
+     +          KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +          TITLE,FILNAM,
+     +          MATRIX,MAT1,IDM,PMINT,PMAXT,PROBT,
+     +          WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,RANGES,RANGEL,IDEVN,
+     +          JSTART,KEYNS,BESTP,BESTQ,BESTS,2,MAXSS)
+                ITOTAL = ITOTAL + 1
+              END IF
+              RETURN
+            END IF
+          END IF
+        END IF
+C
+C
+      END IF
+C
+C
+C   GO BACK FOR NEXT SEARCH
+C
+C
+      GO TO 100
+      END
+C*********************************************************************
+      SUBROUTINE MOTFI3(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,
+     +MATCHP,MATCHS,IFOUND,MATRIC,IDM)
+      INTEGER SEQ(IDIM1),STRING(IDIM2)
+      INTEGER MATRIC(IDM,IDM)
+      REAL MATCHS
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      IF(ISTART.GT.IDIM1)RETURN
+      CALL SQFTI5(SEQ,IDIM1,STRING,IDIM2,ISTART,IEND,CUTOFF,MATCHS,
+     +IFOUND,MATRIC,IDM)
+      IF(IFOUND.EQ.0)RETURN
+C  SAVE MATCH POSITION
+      MATCHP = IFOUND
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE SQFTI5(SEQ,IDIM1,STRING,IDIM2,
+     1IS,IE,MINSC,MATCHS,IFOUND,MATRIC,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDIM1),STRING(IDIM2)
+      REAL MATCHS,MINSC
+      INTEGER MATRIC(IDM,IDM)
+      MINSCR = MINSC
+C
+      IDIF=(IE-IS+2)-IDIM2
+C   IDIF IS THE NUMBER OF POSNS TO TRY
+C   IPSTR GOES FROM 1 TO IDIM2 IDIF TIMES
+C   TRY ALL POSSIBLE POSITIONS FOR MATCHING AND SCORE FOR EACH
+C   POINT TO ARRAY ELEMENT CORRESPONDING TO FIRST BASE
+      IPSEQ=IS
+      DO 200 I=1,IDIF
+        NTOT=0
+        IP=IPSEQ
+        DO 100 J=1,IDIM2
+          NTOT = NTOT + MATRIC(SEQ(IP),STRING(J))
+          IP=IP+1
+100     CONTINUE
+C   END OF COUNTING FOR THIS POSITION.IS TOTAL HIGH ENOUGH?
+        IF(NTOT.GE.MINSCR)THEN
+          MATCHS = NTOT
+          IFOUND = IP-IDIM2
+          RETURN
+        END IF
+        IPSEQ=IPSEQ+1
+200   CONTINUE
+      IFOUND = 0
+      RETURN
+      END
+C*********************************************************************
+      SUBROUTINE MOTFI4(SEQ,IDIM1,LENGTH,ISTART,IEND,
+     +WEIGHT,CUTOFF,MATCHP,MATCHS,IFOUND,IDM)
+      REAL WEIGHT(IDM,LENGTH)
+      REAL MATCHS
+      INTEGER SEQ(IDIM1)
+      IFOUND = 0
+      IF(ISTART.LT.1)ISTART=1
+      L1 = IEND-ISTART+1
+      IF(ISTART.GT.IDIM1)RETURN
+      IF(L1.LT.LENGTH)RETURN
+      DO 10 I=ISTART,ISTART+L1-LENGTH
+        SUM = 0.
+        K = 0
+        DO 5 J=I,I+LENGTH-1
+          K = K + 1
+          SUM = SUM + WEIGHT(SEQ(J),K)
+ 5        CONTINUE
+          IF(SUM.GE.CUTOFF) THEN
+            MATCHP = I
+            MATCHS = SUM
+            IFOUND = I
+            RETURN
+          END IF
+ 10       CONTINUE
+      IFOUND = 0
+      END
+C*********************************************************************
+      SUBROUTINE FMOTI4(SEQ,IDIM,WT,LENGTH,CUTOFF,SUM,IFOUND,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDIM)
+      REAL WT(IDM,LENGTH)
+      DO 10 I=1,IDIM-LENGTH+1
+        SUM=0.
+        K=0
+        DO 5 J=I,I+LENGTH-1
+          K=K+1
+          SUM=SUM+WT(SEQ(J),K)
+5       CONTINUE
+        IF(SUM.GE.CUTOFF)THEN
+        IFOUND = I
+        RETURN
+        END IF
+10    CONTINUE
+      IFOUND = 0
+      END
+C*********************************************************************
+      SUBROUTINE MOTIF8(SEQ,IDSEQ,MATRIX,LENGTH,I5STAR,I5END,
+     +                  I3STAR,I3END,CUTOFF,MATCHP,MATCHS,
+     +                  IENTRY,IFOUND,MATCHQ,
+     +                  LAST5,LAST3S,LAST3E,IDM)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM),REPEET
+      REAL MATCHS
+      EXTERNAL REPEET
+C  WE HAVE A START POSITION FOR THE 5' END OF THE 5' END OF
+C  A POTENTIAL REPEAT I5STAR AND AN END DEFINED BY A RANGE I5END
+C  WE HAVE A REPEAT LENGTH LENGTH
+C  WE HAVE A RANGE OF POSITIONS FOR THE 3' STEM TO START
+C  I3STAR TO I3END
+C  TRY THE TIGHTEST LOOPS FIRST
+C  BUT FIRST WE MAY HAVE TO FINISH A PREVIOUS SEARCH
+C  THIS IS DENOTED BY IENTRY NE 0.
+C  NOTE IENTRY IS ALSO USED TO RETURN THE 3' MATCH POSITION
+      ICUT = CUTOFF
+C      WRITE(*,*)'ICUT',ICUT
+      IFOUND = 0
+      JENTRY = IENTRY
+      IENTRY = 0
+      IF(I5STAR.LT.1)I5STAR=1
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      IF(JENTRY.NE.0)THEN
+        I1 = LAST5
+C        WRITE(*,*)'I1,LAST3S,LAST3E',I1,LAST3S,LAST3E
+        DO 50 J=LAST3S+1,LAST3E
+          J1 = J
+          ISUM = REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+          IF(ISUM.GE.ICUT)THEN
+            MATCHP = I1
+            IENTRY = J1
+            MATCHQ = J1
+            MATCHS = ISUM
+            IFOUND = MATCHP
+            LAST3S = J1
+            RETURN
+          END IF
+50      CONTINUE
+C  NOW MOVE 5' STEM START POSITION (WE HAVE JUST FINISHED THE LAST)
+C  TO THE LAST MATCH + 1
+        I5STAR = MATCHP + 1
+      END IF
+C  SET ENTRY FLAG TO ZERO TO SIGNIFY LAST SEARCH NOW COMPLETED
+      IENTRY = 0
+      ISUM = 0
+      LOOPI1 = I5STAR
+      IF((I5STAR+I3STAR+LENGTH-2).GT.IDSEQ)RETURN
+      LOOPI2 = MIN(IDSEQ-2*LENGTH+1,I5END)
+C      WRITE(*,*)'IDSEQ,LENGTH,I5STAR,I5END',
+C     +IDSEQ,LENGTH,I5STAR,I5END
+C
+C  TRY ALL STEM STARTS FROM 5' START TO 5' END
+C
+C
+      DO 200 I = LOOPI1,LOOPI2
+C
+C
+        I1 = I
+C
+C   TRY ALL LOOPS FROM 3' START TO 3' END
+C
+        LOOPJ1 = I + I3STAR -1
+        IF((LOOPJ1+LENGTH-1).GT.IDSEQ)RETURN
+        LOOPJ2 = MIN(IDSEQ-LENGTH+1,I+I3END-1)
+        
+C      WRITE(*,*)'I3STAR,I3END',I3STAR,I3END
+C
+C
+C
+        DO 100 J = LOOPJ1,LOOPJ2
+C
+C
+C
+         J1 = J
+C  IN REPEAT NOTE THAT
+C  THE 5' END POINTER I1 GOES FORWARDS
+C  THE 3' END POINTER J1 GOES FORWARDS
+C
+         ISUM = REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I1,J1,IDM)
+C  RETURN IF GOOD ENOUGH
+C         WRITE(*,*)ISUM
+         IF(ISUM.GE.ICUT)THEN
+           MATCHP = I1
+           IENTRY = J1
+           MATCHQ = J1
+           MATCHS = ISUM
+           IFOUND = MATCHP
+C  SAVE CURRENT POSITION FOR LATER ENTRIES
+           LAST5  = I1
+           LAST3S = J1
+           LAST3E = LOOPJ2
+           RETURN
+         END IF
+100    CONTINUE
+200   CONTINUE
+      END
+C*********************************************************************
+      INTEGER FUNCTION REPEET(SEQ,IDSEQ,MATRIX,LENGTH,I5P,I3P,IDM)
+      INTEGER SEQ(IDSEQ)
+      INTEGER MATRIX(IDM,IDM)
+C  THE 5' END POINTER GOES FORWARDS
+C  THE 3' END POINTER GOES FORWARDS
+      L=0
+      I5=I5P-1
+      I3=I3P-1
+      DO 100 I=1,LENGTH
+       I5 = I5 + 1
+       I3 = I3 + 1
+C      WRITE(*,*)'I5,I3',I5,I3
+       L5 = SEQ(I5)
+       L3 = SEQ(I3)
+       L = L + MATRIX(L5,L3)
+100   CONTINUE
+      REPEET = L
+      END
+      SUBROUTINE DSPLAZ(MATCHP,LENGTH,NMOT,SEQ,IDSEQ,IDEV,
+     +CLASS,MATCHQ,IOPT,KSTART,MATCHS,CUTSCR,MINSCR,MAXSCR,
+     +TITLE,FILNAM,
+     +    MATRIX,MAT1,IDM,PMINT,PMAXT,PROBT,
+     +    WEIGHT,MAXWTS,WTSTR,CUTOFF,PMINC,RANGES,RANGEL,
+     +    IDEV1,JSTART,KEYNS,BESTP,BESTQ,BESTS,JOPT,MAXSS)
+      INTEGER MATCHP(NMOT),LENGTH(NMOT),CLASS(NMOT)
+      INTEGER MATCHQ(NMOT)
+      CHARACTER SEQ(IDSEQ),TITLE*(*),FILNAM*(*)
+      REAL MATCHS(NMOT),MINSCR,MAXSCR
+      INTEGER MATRIX(IDM,IDM),MAT1(IDM,IDM),WTSTR(NMOT)
+      REAL WEIGHT(MAXWTS),CUTOFF(NMOT)
+      INTEGER RANGES(NMOT),RANGEL(NMOT)
+      CHARACTER DASH
+      CHARACTER*(*) KEYNS(NMOT)
+C stuff for best
+      INTEGER BESTP(NMOT),BESTQ(NMOT)
+      REAL BESTS(NMOT),MAXSS
+      EXTERNAL PSCORE
+      SAVE DASH
+      DATA DASH/'-'/
+C jopt 1 get best match for any individual sequence, then
+C display it. So check each match for being best (>maxss), if it is 
+C save its coords in bestp, bestq. Keep a note that a score
+C has been recorded for this sequence (actually noted by bestp(1) ne.0)
+C when we finish a sequence (denoted by jopt = 2) put all the
+C saved values into matchp, matchq and process as normal.
+      IF(JOPT.NE.0) THEN
+        IF(JOPT.EQ.2) THEN
+          DO 5 I=1,NMOT
+            MATCHP(I) = BESTP(I)
+            MATCHQ(I) = BESTQ(I)
+            MATCHS(I) = BESTS(I)
+ 5          CONTINUE
+        ELSE IF(JOPT.EQ.1) THEN
+C  Add scores
+          T = 0.
+          DO 6 I = 1,NMOT
+            IF(MATCHP(I).NE.0) T = T + MATCHS(I)
+ 6        CONTINUE
+          IF(T.GT.MAXSS) THEN
+            DO 7 I=1,NMOT
+              BESTP(I) = MATCHP(I)
+              BESTQ(I) = MATCHQ(I)
+              BESTS(I) = MATCHS(I)
+ 7            CONTINUE
+            MAXSS = T
+          END IF
+          RETURN
+        END IF
+      END IF
+C  Add scores
+      T = 0.
+      DO 10 I = 1,NMOT
+        IF(MATCHP(I).NE.0) T = T + MATCHS(I)
+10    CONTINUE
+      POBS = 1.0
+      IF(PMINC.LT.1.0)THEN
+C  Calc prob
+        DO 20 I = 1,NMOT
+          IF(MATCHP(I).NE.0)THEN
+            CALL GETP(CLASS(I),SEQ(MATCHP(I)+KSTART-1),LENGTH(I),
+     +      IDM,MATRIX,MAT1,WEIGHT(MAX(1,WTSTR(I))))
+            PROB = PSCORE(MATCHS(I))
+            POBS = POBS * PROB
+          END IF
+20      CONTINUE
+        IF((PMINC.LT.1.0).AND.(POBS.GT.PMINC))RETURN
+      END IF
+C
+      IF(T.GT.MAXSCR) MAXSCR = T
+      IF(T.LT.MINSCR) MINSCR = T
+      IF(T.LT.CUTSCR) RETURN
+C
+C Motif by motif
+C
+      IF(IOPT.EQ.1)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+1001    FORMAT(' >',A,' ',F10.3,' ',A)
+        DO 100 I=1,NMOT
+          J = I
+C
+C  Check for no match (needed for ored motifs)
+C
+          IF(MATCHP(J).NE.0)THEN
+            WRITE(IDEV,1000)MATCHP(J)+KSTART-1,MATCHS(J),KEYNS(I)
+            WRITE(IDEV,1002)
+     +      (SEQ(K),K=MATCHP(J),MATCHP(J)+LENGTH(J)-1)
+C  Repeat ?
+            IF(CLASS(J).EQ.5)THEN
+              WRITE(IDEV,1002)
+     +        (SEQ(K),K=MATCHQ(J),MATCHQ(J)+LENGTH(J)-1)
+              WRITE(IDEV,1000)MATCHQ(J)+KSTART-1
+            END IF
+          END IF
+100     CONTINUE
+1000    FORMAT(' ',I7,' ',F10.3,' ',A8)
+1002    FORMAT('       ',60A1)
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+1004    FORMAT(' Probability =',E10.4)
+        RETURN
+      END IF
+C
+C Title,score only
+C
+      IF(IOPT.EQ.3)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+1003    FORMAT(' ',F12.5)
+      END IF
+C
+C Inclusive
+C
+      IF(IOPT.EQ.2)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        MINP = 999999
+        MAXP = -999999
+        DO 300 I = 1, NMOT
+          K = MATCHP(I)
+          IF(K.NE.0)THEN
+            IF(K.LT.MINP)MINP = K
+            K = K + LENGTH(I) - 1
+C  Repeat ?
+            IF(CLASS(I).EQ.5) K = MATCHQ(I) + LENGTH(I) - 1
+            IF(K.GT.MAXP)MAXP = K
+          END IF
+300     CONTINUE
+        WRITE(IDEV,1000)MINP+KSTART-1
+        WRITE(IDEV,1002)
+     +  ((SEQ(K1),K1=K2,MIN(K2+59,MAXP)),K2=MINP,MAXP,60)
+        IF(PMINC.LT.1.0)WRITE(IDEV,1004)POBS
+        RETURN
+      END IF
+C
+C write file of whole seq
+C
+      IF(IOPT.EQ.4)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        CALL OPENRS(IDEV1,FILNAM,IOK,LLL,1)
+        IF(IOK.NE.0)THEN
+          WRITE(IDEV,*)' Error opening sequence file'
+          RETURN
+        END IF
+C  Want first character from sequence to start at jstart
+C  So how many dashes required?
+        IDASH = JSTART - MATCHP(1)
+        IF(IDASH.GT.0)CALL PADOUT(IDEV1,DASH,60,IDASH)
+C  Write up to and including first motif
+        J1 = MATCHP(1)+LENGTH(1)-1
+        WRITE(IDEV1,1005,ERR=401)(SEQ(K),K=1,J1)
+        DO 400 I=2,NMOT
+          J = I
+C  Put dashes for next gap in now. what is max gap?
+          IMAXG = RANGES(J) - LENGTH(J-1) + RANGEL(J) - LENGTH(J)
+C  What is actual gap?
+          IG = MATCHP(J) - MATCHP(J-1) - LENGTH(J-1)
+C  Want to put in difference number of dashes
+          IDASH = IMAXG - IG
+          IF(IDASH.GT.0)CALL PADOUT(IDEV1,DASH,60,IDASH)
+C  Write up to and including next motif
+          J2 = MATCHP(J) + LENGTH(J) - 1
+          WRITE(IDEV1,1005,ERR=401)(SEQ(K),K=J1+1,J2)
+          J1 = J2
+400     CONTINUE
+C  Write to end of sequence
+        WRITE(IDEV1,1005,ERR=401)(SEQ(K),K=J1+1,IDSEQ)
+401     CONTINUE
+        CLOSE(UNIT=IDEV1)
+1005    FORMAT(' ',60A1)
+      END IF
+C
+C write file inclusive
+C
+      IF(IOPT.EQ.5)THEN
+        WRITE(IDEV,1001)FILNAM(1:10),T,TITLE
+        CALL OPENRS(IDEV1,FILNAM,IOK,LLL,1)
+        IF(IOK.NE.0)THEN
+          WRITE(IDEV,*)' Error opening sequence file'
+          RETURN
+        END IF
+C  Write first motif
+        J1 = MATCHP(1)+LENGTH(1)-1
+        WRITE(IDEV1,1005,ERR=501)(SEQ(K),K=MATCHP(1),J1)
+        DO 500 I=2,NMOT
+          J = I
+C  Put dashes for next gap in now. what is max gap?
+          IMAXG = RANGES(J) - LENGTH(J-1) + RANGEL(J) - LENGTH(J)
+C  What is actual gap?
+          IG = MATCHP(J) - MATCHP(J-1) - LENGTH(J-1)
+C  Want to put in difference number of dashes
+          IDASH = IMAXG - IG
+          IF(IDASH.GT.0)CALL PADOUT(IDEV1,DASH,60,IDASH)
+C  Write up to and including next motif
+          J2 = MATCHP(J) + LENGTH(J) - 1
+          WRITE(IDEV1,1005,ERR=501)(SEQ(K),K=J1+1,J2)
+          J1 = J2
+500     CONTINUE
+501     CONTINUE
+        CLOSE(UNIT=IDEV1)
+      END IF
+      END
+      SUBROUTINE PADOUT(IDEV,CHAR,LINLEN,NCHAR)
+      CHARACTER CHAR
+C  HOW MANY LINES?
+      NLINE = 1 + (NCHAR-1)/LINLEN
+      K1 = 1
+      K2 = MIN(NCHAR,LINLEN)
+      DO 10 I = 1,NLINE
+        WRITE(IDEV,1000,ERR=20)(CHAR,K=K1,K2)
+        K1 = K2 + 1
+        K2 = K1 + LINLEN - 1
+        K2 = MIN(K2,NCHAR)
+10    CONTINUE
+20    CONTINUE
+1000  FORMAT(' ',60A1)
+      END
+      SUBROUTINE SETCMP(COMPIN,IDM)
+      PARAMETER (MAXCHR = 26)
+      REAL COMPIN(IDM)
+      COMMON /COMPC/COMP(MAXCHR)
+      SAVE /COMPC/
+      DO 10 I = 1,MAXCHR
+        COMP(I) = 0.0
+10    CONTINUE
+      T = 0.
+      DO 20 I = 1,IDM
+        COMP(I) = COMPIN(I)
+        T = T + COMPIN(I)
+20    CONTINUE
+      DO 30 I = 1,IDM
+        COMP(I) = COMP(I) / T
+30    CONTINUE
+      END
diff --git a/src/staden/pipmen.f b/src/staden/pipmen.f
new file mode 100644
index 0000000..06b7fec
--- /dev/null
+++ b/src/staden/pipmen.f
@@ -0,0 +1,88 @@
+      SUBROUTINE MENU(OPT,KOPT,MOPT,MAXOPT,MINMEN,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER OPT
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+      CHARACTER HELPF*(*)
+1     CONTINUE
+      CALL CLEARV
+      IF(MOPT.EQ.0)THEN
+      WRITE(KBOUT,5000)
+5000  FORMAT(' Menus and their numbers are '/
+     +' m0 = This menu'/
+     1' m1 = General'/
+     2' m2 = Screen control'/
+     5' m3 = Statistical analysis of content'/
+     1' m4 = Structure'/
+     3' m5 = Search'/
+     6'  ? = Help'/
+     1'  ! = Quit')
+      ELSE IF(MOPT.EQ.-1)THEN
+              WRITE(KBOUT,5001)
+5001            FORMAT(' General menu'/
+     +          '  0 = List of menus'/
+     1          '  3 = Read new sequence'/
+     1          '  4 = Redefine active region'/
+     +          '  5 = List the sequence'/
+     +          '  6 = List a text file'/
+     +          '  7 = Direct output to disk'/
+     +          '  8 = Write active sequence to disk'/
+     +          '  9 = Edit the sequence'/
+     +          ' 17 = Short sequence search'/
+     +          ' 18 = Compare a sequence'/
+     +          ' 19 = Compare a sequence using a score matrix'/
+     +          ' 27 = Back translate to dna')
+      ELSE IF(MOPT.EQ.-2)THEN
+                WRITE(KBOUT,5002)
+5002            FORMAT(' Screen control menu'/
+     +          '  0 = List of menus'/
+     +          ' 10 = Clear graphics'/
+     +          ' 11 = Clear text'/
+     +          ' 12 = Draw a ruler'/
+     +          ' 13 = Use cross hair'/
+     +          ' 14 = Reset margins'/
+     +          ' 15 = Label diagram'/
+     +          ' 16 = Display a map')
+      ELSE IF(MOPT.EQ.-3)THEN
+                WRITE(KBOUT,5005)
+5005            FORMAT(' Statistical analysis of content'/
+     +          '  0 = List of menus'/
+     +          ' 21 = Count amino acid composition'/
+     +          ' 22 = Plot hydrophobicity'/
+     +          ' 23 = Plot charge',/
+     +          ' 25 = Plot hydrophobic moment')
+      ELSE IF(MOPT.EQ.-4)THEN
+                WRITE(KBOUT,5007)
+5007            FORMAT(' Structure menu'/
+     +          '  0 = List of menus'/
+     +          ' 22 = Plot hydrophobicity'/
+     +          ' 23 = Plot charge'/,
+     +          ' 24 = Plot robson prediction',/
+     +          ' 25 = Plot hydrophobic moment',/
+     +          ' 26 = Draw helix wheel')
+      ELSE IF(MOPT.EQ.-5)THEN
+                WRITE(KBOUT,5004)
+5004            FORMAT(' Search menu'/
+     +          '  0 = List of menus'/
+     1          ' 17 = Search for short sequences'/
+     2          ' 18 = Compare a sequence'/
+     2          ' 19 = Compare a sequence using a score matrix'/
+     2          ' 20 = Search for a sequence using a weight matrix'/
+     2          ' 28 = Search for patterns of motifs')
+      END IF
+      WRITE(KBOUT,1001)
+1001  FORMAT(' ? Menu or option number=',$)
+      CALL GETOPT(KBIN,KOPT,OPT)
+      IF((OPT.LT.MINMEN).OR.(OPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MOPT = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(OPT.GE.0)) THEN
+        CALL HELP2(IHELPS(OPT),IHELPE(OPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(OPT.LE.0) THEN
+        MOPT = OPT
+        GO TO 1
+      END IF
+      END
diff --git a/src/staden/pl4010.f b/src/staden/pl4010.f
new file mode 100644
index 0000000..ac1a4cc
--- /dev/null
+++ b/src/staden/pl4010.f
@@ -0,0 +1,554 @@
+C   PL4010
+C   AUTHOR: RODGER STADEN 
+C   5-4-90 Changed to allow redirection of graphics to a file
+C          and included ndc which was in plot89
+C   15-10-91 Moved IANDRS, IORRS and WRITEB to seeme.f
+C   SET OF FORTRAN77 GRAPHICS ROUTINES FOR TEKTRONIX 4010
+C   AS IMPLEMENTED ON THE VT640 AND CIFER T4
+C   OPENGR OPENS GRAPHICS OUTPUT ON DEVICE IDEVGR
+C   IANDRS PERFORMS LOGICAL .AND. ON INTEGERS
+C   IORRS PERFORMS LOGICAL .OR. ON INTEGERS
+C   WRITEI CONVERTS INTEGER VARIABLES TO CHARACTER STRINGS
+C   WRITEB WRITES OUT CHARACTERS STORED IN A CHARACTER STRING
+C   WRITEB IS THE ONLY ROUTINE THAT WRITES TO THE GRAPHICS DEVICE
+C          AND NEEDS TO SUPPRESS CARRIAGE RETURN, LINE FEED.
+C          THIS IS DONE ON THE VAX BY SENDING A NUL CHARACTER, 
+C          OR ON THE SUN USING PUTC, OR ON THE IBM USING TRANSPARENT
+C          I/O. NOTE ON SOME MACHINES E.G.THE SUN BYTE SWITCHING MAY BE
+C          REQUIRED.
+C   XY4010 RETURNS THE CODES FOR 4010 X,Y COORDS
+C   ALPHAM PUTS THE TERMINAL IN ALPHA MODE
+C   VECTOM PUTS THE TERMINAL IN VECTOR MODE
+C   VT100M PUTS THE TERMINAL IN VT100 OR TRANSPARENT MODE
+C   BUFFGR RECEIVES AN INTEGER VALUE AND BUFFERS IT, IF
+C          THE BUFFER IS FULL OR A FLAG (FLSH) IS SET
+C          THE BUFFER IF FLUSHED
+C   CLEARG CLEARS THE GRAPHICS SCREEN (BOTH ON VT640)
+C   MOVEG MOVES TO X,Y
+C   DRAWG DRAWS TO X,Y
+C   POINTG DRAWS POINT AT X,Y USING MOVEG AND DRAWG
+C   XHAIR  PUTS UP X HAIR AND RETURNS X,Y
+C   DELAY  DELAYS PROCESS BY DOING NOTHING (REQUIRED FOR MODE SWITCHING)
+C   CLEARV CLEARS VT100 SCREEN WITHOUT BLANKING GRAPHICS
+C   WRITEG WRITES TEXT SENT AT GIVEN POSITION IX,IY
+C   INITGR INITIALISES MODE SWITCHING, WHICH IS DIFFERENT ON DIFFERENT
+C          MACHINES. SO FAR ONLY CATERS FOR VERSATERM PRO BEING DIFFERENT
+C          FROM ALL OTHERS WHEN GOING INTO VT100 MODE. ALSO DOES NOT CLEAR
+C          TEXT PLANE WHEN REQUESTED TO DO SO FOR VERSATERM.
+C   FLUSHG FLUSHES GRAPHICS BUFFER
+C   IN TEKTRONIX 4010 THERE ARE 4 MODES:
+C          ALPHA        FOR POSITIONING TEXT
+C          GRAPHIC PLOT FOR PLOTTING
+C          GRAPHIC INPUT FOR READING CURSOR POSITIONS
+*   THESE ROUTINES ARE FOR THE VT640 TERMINAL AND THE CIFER T4
+*   BOTH OF WHICH HAVE TWO PLANES: ONE A TEKTRONIX PLANE, THE OTHER
+*   A SCROLLING TEXT PLANE THAT CAN BE TREATED AS A VT100.
+*   THEY BOTH OFFER ENHANCEMENTS ON THE 4010 INCLUDING EXTRA MODES
+*   BUT THEY RESTRICT CHANGES BETWEEN MODES. SO FAR I HAVE ALWAYS
+*   GONE THROUGH WHAT IVE CALLED VECTOR MODE (EQUIVALENT TO GRAPHIC
+*   PLOT MODE)
+*   THE T4 NEEDS TO HAVE A HIGHER LEVEL OF MODES SETTING:
+*   TO RESPOND TO TEKTRONIX COMMANDS IT REQUIRES A PREVIOUS ESC P t THAT
+*   HAS NOT BEEN FOLLOWED BY ESC \ WHICH IS THE COMMAND TO GET OUT OF
+*   THIS HIGHER LEVEL MODE OR STATUS. THIS REQUIRES ALL ENTRIES TO VECTOR
+*   MODE TO BE PRECEDED BY ESC P T AND ENTRY TO VT100 MODE TO BE CHANGED.
+*   THESE EXTRA COMMANDS ARE INCLUDED IN THE CODE BUT ARE COMMENTED OUT
+*   IF YOU HAVE CIFER T4 TERMINALS REMOVE THE C'S FROM THE APPROPRIATE 
+*   LINES IN VT100M AND VECTOM
+*   STANDARD COMMANDS:
+*                             TEKTRONIX       VT640         T4
+*
+*  INTO GRAPHIC MODE          GS              GS            GS
+*                                                           (MUST BE IN
+*                                                            TEKTRONIX MODE)
+*  INTO ALPHA MODE            US
+*  PUT UP XHAIR               ESC SUB         ESC SUB
+*                                             (MUST BE IN VECTOR MODE)
+*  ERASE SCREEN,
+*  ENTER ALPHA MODE,
+*  HOME                       ESC FF          ESC FF
+*                                             (MUST BE IN VECTOR MODE)
+*  HARD COPY                  ESC ETB                     ESC ETB
+*                                                         (MUST BE IN ALPHA
+*                                                          MODE)
+*  ENTER VT100 MODE                           CAN         ESC \
+*                                                         (THIS TAKES US
+*                                                          RIGHT OUT OF
+*                                                          TEKTRONIX MODE)
+*  CLEAR VT100                                ESC [ 2 J   ESC [ 2 J
+*
+      SUBROUTINE XHAIR(IX,IY,TERM)
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER ESC,SUB,FLSH,CHARS(5)
+      CHARACTER TERM
+      EXTERNAL IANDRS
+      SAVE ESC,SUB,FLSH,NO
+      DATA ESC/27/,SUB/26/,FLSH/1/,NO/0/
+C   MUST START IN  VECTOR MODES
+      IF(IGORF.NE.0) RETURN
+      IF(MODE.NE.VECTOR)CALL VECTOM
+      CALL BUFFGR(ESC,NO)
+      CALL BUFFGR(SUB,FLSH)
+C   GET COORDS FROM TERMINAL
+      CALL XHM(CHARS,TERM,IDEVGR)
+C   DECODE THEM
+      IX=IANDRS(CHARS(2),31)
+      IX=IX*32
+      IX=IX+IANDRS(CHARS(3),31)
+      IY=IANDRS(CHARS(4),31)
+      IY=IY*32
+      IY=IY+IANDRS(CHARS(5),31)
+C
+C convert to drawing board units
+C
+      IX = NINT(REAL(IX)/DBTDUX)
+      IY = NINT(REAL(IY)/DBTDUY)
+C  THIS EXITS TO ALPHA MODE
+      MODE=ALPHA
+      END
+      SUBROUTINE XY4010(IX,IY,LSX,MSX,LSY,MSY)
+C   AUTHOR: RODGER STADEN 
+      EXTERNAL IANDRS,IORRS
+C   ENCODES IX AND IY INTO
+C   THEIR LEAST AND MOST SIGNIFICANT PARTS FOR 4010 CODES
+C      MSX=((IX.AND.992)/32).OR.32
+C      LSX=(IX.AND.31).OR.64
+C      MSY=((IY.AND.992)/32).OR.32
+C      LSY=(IY.AND.31).OR.96
+       MSX=IANDRS(IX,992)
+       MSX=MSX/32
+       MSX=IORRS(MSX,32)
+       LSX=IANDRS(IX,31)
+       LSX=IORRS(LSX,64)
+       MSY=IANDRS(IY,992)
+       MSY=MSY/32
+       MSY=IORRS(MSY,32)
+       LSY=IANDRS(IY,31)
+       LSY=IORRS(LSY,96)
+      END
+       SUBROUTINE OPENGR(IDEVG)
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+       CHARACTER*40 FILNAM
+C   SET DEVICE NAME
+       IDEVGR=IDEVG
+C   SET BUFFER POINTER TO ZERO
+       NBUFF=0
+C   SET MAXIMUM BUFFER SIZE
+       MAXBUF=80
+C   SET ARBITRARY VALUES TO MODES
+       VT100=0
+       VECTOR=1
+       ALPHA=2
+       MODE=0
+       CLOSE(UNIT=IDEVGR)
+       CALL OPENRS(IDEVG,FILNAM,IOK,LRECL,9)
+       IF(IOK.NE.0)WRITE(*,*)' ERROR OPENING GRAPHICS DEVICE'
+C   INITIALISE LAST VALUES
+      LMSY=0
+      LLSY=0
+      LMSX=0
+      LLSX=0
+      IGORF = 0
+      RETURN
+      END
+       SUBROUTINE OPENGF(IDEVG)
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+       IDEVGR = IDEVG
+       IGORF = 1
+      END
+      SUBROUTINE ALPHAM
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+C   PUTS TERMINAL IN ALPHA MODE
+      INTEGER US,NO
+      SAVE US,NO
+      DATA US/31/,NO/0/
+      IF(MODE.EQ.ALPHA)RETURN
+      CALL DELAY(100000)
+      IF((ITERM.EQ.2).AND.(MODE.EQ.VT100)) THEN
+C   SEND ESC [ ? 3 8 h
+        CALL BUFFGR(27,NO)
+        CALL BUFFGR(91,NO)
+        CALL BUFFGR(63,NO)
+        CALL BUFFGR(51,NO)
+        CALL BUFFGR(56,NO)
+        CALL BUFFGR(104,NO)
+      END IF
+C   SEND TO OUTPUT ROUTINE
+      CALL BUFFGR(US,NO)
+C   SET MODE TO ALHPA
+      MODE=ALPHA
+      END
+      SUBROUTINE VECTOM
+C   PUTS TERMINAL IN VECTOR MODE
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+      INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER GS,NO,UPPERP,LOWERT,ESC
+      SAVE GS,NO,UPPERP,LOWERT,ESC
+      DATA GS/29/,NO/0/,UPPERP/80/,LOWERT/116/,ESC/27/
+      IF(MODE.EQ.VECTOR)RETURN
+      CALL DELAY(100000)
+C   SEND TO OUTPUT ROUTINE
+C   NEXT THREE LINE REQUIRED FOR CIFER T4
+C      CALL BUFFGR(ESC,NO)
+C      CALL BUFFGR(UPPERP,NO)
+C      CALL BUFFGR(LOWERT,NO)
+      IF((ITERM.EQ.2).AND.(MODE.EQ.VT100)) THEN
+C   SEND ESC [ ? 3 8 h
+        CALL BUFFGR(27,NO)
+        CALL BUFFGR(91,NO)
+        CALL BUFFGR(63,NO)
+        CALL BUFFGR(51,NO)
+        CALL BUFFGR(56,NO)
+        CALL BUFFGR(104,NO)
+      ELSE IF((ITERM.EQ.3).AND.(MODE.EQ.VT100)) THEN
+C   SEND ESC [ ? 3 8 h
+        CALL BUFFGR(27,NO)
+        CALL BUFFGR(91,NO)
+        CALL BUFFGR(63,NO)
+        CALL BUFFGR(51,NO)
+        CALL BUFFGR(56,NO)
+        CALL BUFFGR(104,NO)
+      END IF
+      CALL BUFFGR(GS,NO)
+C   SET MODE TO VECTOR
+      MODE=VECTOR
+      END
+      SUBROUTINE VT100M
+C   PUTS TERMINAL IN VT100 MODE
+C   NOTE FOR CIFER T4 SEND ESC\
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER ESC,FLSH,BSLASH,NO
+      SAVE ESC,FLSH,BSLASH,NO
+      DATA ESC/27/,FLSH/1/,BSLASH/92/,NO/0/
+      IF(MODE.EQ.VT100)RETURN
+C      CALL DELAY(100000)
+C   SEND TO OUTPUT ROUTINE
+C   NEXT TWO LINES RQUIRED FOR CIFER T4
+C      CALL BUFFGR(ESC,NO)
+C      CALL BUFFGR(BSLASH,NO)
+      IF(ITERM.EQ.0) THEN
+C  VT640 CAN NUL
+        CALL BUFFGR(24,NO)
+        CALL BUFFGR(0,FLSH)
+      ELSE IF(ITERM.EQ.1) THEN
+C  VERSATERM ESC 2
+        CALL BUFFGR(ESC,NO)
+        CALL BUFFGR(50,FLSH)
+      ELSE IF(ITERM.EQ.2) THEN
+C  THE XTERM BUFFER WILL ONLY BE FLUSHED IF WE CHANGE FROM
+C  FROM GRAPHICS TO ALPHA MODE
+        CALL BUFFGR(31,NO)
+C  XTERM ESC ETX
+        CALL BUFFGR(ESC,NO)
+        CALL BUFFGR(3,FLSH)
+      ELSE IF(ITERM.EQ.3) THEN
+C   SEND ESC [ ? 3 8 l
+        CALL BUFFGR(27,NO)
+        CALL BUFFGR(91,NO)
+        CALL BUFFGR(63,NO)
+        CALL BUFFGR(51,NO)
+        CALL BUFFGR(56,NO)
+        CALL BUFFGR(108,FLSH)
+      END IF
+C   SET MODE TO VT100
+      MODE=VT100
+      CALL DELAY(100000)
+      END
+      SUBROUTINE BLANKG
+C   CLEARS SCREEN
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER ESC,FF,NO
+      SAVE ESC,FF,NO
+      DATA ESC/27/,FF/12/,NO/0/
+      CALL DELAY(100000)
+C   FIRST PUT IN VECTOR MODE
+      IF(MODE.NE.VECTOR)CALL VECTOM
+      CALL BUFFGR(ESC,NO)
+      IF(ITERM.EQ.2) THEN
+        CALL BUFFGR(12,NO)
+      ELSE
+        CALL BUFFGR(FF,NO)
+      END IF
+C   SETS MODE TO ALHPA SO PUT TO VT100
+      CALL VT100M
+      END
+      SUBROUTINE BUFFGR(VALUE,FLSH)
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER VALUE,FLSH,ARRAY(1)
+C   FLSH=0 MEANS ONLY FLUSH BUFFER IF IT IS FULL
+C   FLSH=1 MEANS FLUSH BUFFER AFTER ADDING VALUE
+C   FLSH=2 MEANS FLUSH BUFFER WITHOUT ADDING VALUE
+      IF(FLSH.EQ.0)THEN
+        NBUFF=NBUFF+1
+C   BUFFER FULL?
+        IF(NBUFF.GT.MAXBUF)THEN
+          CALL WRITEI(BUFFER,MAXBUF,IDEVGR)
+          CALL FLUSHS(IDEVGR)
+C   RESET BUFFER POINTER TO START FOR NEW VALUE
+          NBUFF=1
+        END IF
+        BUFFER(NBUFF)=VALUE
+      ELSE IF(FLSH.EQ.1)THEN
+        NBUFF=NBUFF+1
+        IF(NBUFF.GT.MAXBUF)THEN
+          CALL WRITEI(BUFFER,MAXBUF,IDEVGR)
+C   NOW WRITE VALUE
+C   NOTE SOME COMPILERS OBJECT IF A SINGLE VARIABLE IS SENT 
+C   WHEN A DUMMY ARRAY OF DIMENSION 1 IS EXPECTED. SO USE ARRAY(1)
+          ARRAY(1)=VALUE
+          CALL WRITEI(ARRAY,1,IDEVGR)
+          CALL FLUSHS(IDEVGR)
+          NBUFF=0
+        ELSE
+C               PUT VALUE IN BUFFER
+          BUFFER(NBUFF)=VALUE
+          CALL WRITEI(BUFFER,NBUFF,IDEVGR)
+          CALL FLUSHS(IDEVGR)
+          NBUFF=0
+        END IF
+      ELSE IF((FLSH.EQ.2).AND.(NBUFF.GT.0)) THEN
+        CALL WRITEI(BUFFER,NBUFF,IDEVGR)
+        CALL FLUSHS(IDEVGR)
+        NBUFF=0
+      END IF
+      END
+      SUBROUTINE MOVEXY(IX,IY)
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER NO,GS
+      SAVE NO,GS
+      DATA NO/0/,GS/29/
+C   SEND GS TO OUTPUT ROUTINE TO PUT INTO VECTOR MODE
+      CALL BUFFGR(GS,NO)
+C
+C convert from drawing board units
+C
+      IXT = NINT(REAL(IX)*DBTDUX)
+      IYT = NINT(REAL(IY)*DBTDUY)
+      CALL XY4010(IXT,IYT,LSX,MSX,LSY,MSY)
+C   REMEMBER ORDER IMPORTANT
+      CALL BUFFGR(GS,NO)
+      CALL BUFFGR(MSY,NO)
+      CALL BUFFGR(LSY,NO)
+      CALL BUFFGR(MSX,NO)
+      CALL BUFFGR(LSX,NO)
+C   UPDATE LAST VALUES
+      LMSY=MSY
+      LLSY=LSY
+      LMSX=MSX
+      LLSX=LSX
+C   LEAVES IN VECTOR MODE
+      MODE=VECTOR
+      END
+      SUBROUTINE DRAWXY(IX,IY)
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER NO
+      SAVE NO
+      DATA NO/0/
+      IF(MODE.NE.VECTOR)CALL VECTOM
+C
+C convert from drawing board units
+C
+      IXT = NINT(REAL(IX)*DBTDUX)
+      IYT = NINT(REAL(IY)*DBTDUY)
+      CALL XY4010(IXT,IYT,LSX,MSX,LSY,MSY)
+C   IF BYTES DO NOT CHANGE THEY DONT NEED TO BE SENT
+C   EXCEPT LSX WHICH ALWAYS REQUIRED
+      IF(MSY.NE.LMSY)CALL BUFFGR(MSY,NO)
+      IF(LSY.NE.LLSY)CALL BUFFGR(LSY,NO)
+      IF(MSX.NE.LMSX)THEN
+C   MUST SEND LSY IF MSX CHANGED (BUT NOT IF ALREADY SENT)
+      IF(LSY.EQ.LLSY)CALL BUFFGR(LSY,NO)
+      CALL BUFFGR(MSX,NO)
+      END IF
+C   ALWAYS SEND LSX
+      CALL BUFFGR(LSX,NO)
+C   UPDATE LAST VALUES
+      LMSY=MSY
+      LLSY=LSY
+      LMSX=MSX
+      LLSX=LSX
+      END
+      SUBROUTINE CLEARV
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+      INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      INTEGER ESC,LB,TWO,J,FLSH,NO
+      SAVE ESC,LB,TWO,J,FLSH,NO
+      DATA ESC/27/,LB/91/,TWO/50/,J/74/,FLSH/1/,NO/0/
+******  THIS IS A VT100 COMMAND ESC[2J
+      IF(MODE.NE.VT100)CALL VT100M
+      IF(IGORF.NE.0) RETURN
+C DONT CLEAR FOR VERSATERM OR XTERM AS IT ISNT REQUIRED
+      IF((ITERM.EQ.1).OR.(ITERM.EQ.2)) RETURN
+      CALL BUFFGR(ESC,NO)
+      CALL BUFFGR(LB,NO)
+      CALL BUFFGR(TWO,NO)
+      CALL BUFFGR(J,FLSH)
+      END
+      SUBROUTINE WRITXY(IX,IY,TEXT,NCHAR)
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      CHARACTER TEXT*(*)
+C   ROUTINE TO OUTPUT TEXT AT POSITION IX,IY
+      CALL MOVEXY(IX,IY)
+      CALL ALPHAM
+C   FLUSH BUFFER
+      CALL BUFFGR(IVAL,2)
+      CALL WRITEB(TEXT,NCHAR,IDEVGR)
+      END
+      SUBROUTINE INITGR(KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH)
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+      CHARACTER HELPF*(*)
+      PARAMETER (MAXPRM = 12)
+      CHARACTER PROMPT(4)*(MAXPRM)
+      PARAMETER (DEVX   = 1023.0,
+     +           DEVY   = 779.0,
+     +           DBUNIT = 10000)
+      DBTDUX = DEVX/DBUNIT
+      DBTDUY = DEVY/DBUNIT
+C  SET TO VT640
+      ITERM = 0
+      IN = 3
+      PROMPT(1) = 'VT640'
+      PROMPT(2) = 'VersatermPro'
+      PROMPT(3) = 'Xterm'
+      PROMPT(4) = 'MS-Kermit'
+      CALL RADION('Select graphics mode',PROMPT,4,IN,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IN = IN - 1
+      ITERM = MAX(0,IN)
+      END
+      SUBROUTINE FLUSHG
+C   AUTHOR: RODGER STADEN 
+       COMMON /GRAPHS/BUFFER,NBUFF,MODE,VECTOR,ALPHA,VT100,IDEVGR,
+     +                LMSY,LLSY,LMSX,LLSX,MAXBUF,IGORF,ITERM,
+     +                DBTDUX,DBTDUY
+       SAVE /GRAPHS/
+       INTEGER BUFFER(80),VECTOR,ALPHA,VT100
+C   ROUTINE TO FLUSH GRAPHICS BUFFER
+C   FLUSH BUFFER
+      CALL BUFFGR(IVAL,2)
+      END
+      SUBROUTINE NDC(ISXMAX,ISYMAX,XNDC,YNDC)
+C  AUTHOR RODGER STADEN
+C  CONVERTS FROM DRAWING BOARD SCALES ISXMAX,ISYMAX TO NORMALIZED DEVICE
+C  COORDINATES DEFINED BY A PARAMETER STATEMENT
+      PARAMETER (XMAXDV=1023.,YMAXDV=779.)
+      XNDC=XMAXDV/REAL(ISXMAX)
+      YNDC=YMAXDV/REAL(ISYMAX)
+      END
+C  SUN/UNIX PL4010 MACHINE SPECIFIC
+      SUBROUTINE XHM(CHARS,TERM,IDEVGR)
+      INTEGER CHARS(5)
+      CHARACTER CCHARS*5,TERM,CHAR
+      INTRINSIC ICHAR,CHAR
+10    CONTINUE
+      READ(IDEVGR,1000,ERR=10,END=10)CCHARS
+1000  FORMAT(A)
+C     IF THE USER HITS CR WITHOUT A PRECEDING CHARACTER WE GET
+C     A BLANK RECORD --- WE DETECT THIS BY LOOKING FOR SPACE
+C     CHARACTERS, THOUGH THIS COULD BE A VALID POSITION! THIS
+C     ALSO DEPENDS ON THE STRIPPING THAT GOES ON. NOTE:
+C       (A) WE GET SOME GARBAGE ON THE SCREEN (TOUGH LUCK)
+C       (B) WE HAVE TO STICK THE TERMINATOR CHAR (CR) IN BY HAND
+      IF(CCHARS.EQ.'     ') THEN
+        READ(IDEVGR,1000,ERR=10,END=10)CCHARS(2:5)
+        CCHARS(1:1)=CHAR(13)
+      END IF
+C warning following only works for ascii !!!!!!!!!
+C      WRITE(*,*)'CHARS 1',CCHARS(1:1)
+      DO 42 I=1,5
+        CHARS(I)= ICHAR(CCHARS(I:I))
+C      WRITE(*,*)'CHARS1',I,CCHARS(1:1)
+42    CONTINUE
+C     XTERM CAN GET TIED UP IN ALPHA MODE UNLESS SOMETHING IS PRINTED
+      CALL BUFFGR(10,0)
+      TERM=CCHARS(1:1)
+      END
+      SUBROUTINE WRITEI(BUFFER,IBUFF,IDEVGR)
+C   AUTHOR: RODGER STADEN 
+       INTEGER BUFFER(IBUFF)
+       BYTE BA(80)
+       CHARACTER STRING*80
+       EQUIVALENCE (BA,STRING)
+       DO 10 I=1,IBUFF
+         BA(I) = BUFFER(I)
+10     CONTINUE
+       CALL WRITEB(STRING,IBUFF,IDEVGR)
+       RETURN
+       END
+      SUBROUTINE DELAY(TIME)
+C   AUTHOR: RODGER STADEN 
+      INTEGER TIME
+      DO 1 I=1,TIME
+1     CONTINUE
+      RETURN
+      END
+      SUBROUTINE FLUSHS(IDEVGR)
+      CALL FLUSH(IDEVGR)
+      END
+      SUBROUTINE DOTXY(IX,IY)
+C   AUTHOR: RODGER STADEN 
+      CALL MOVEXY(IX,IY)
+      CALL DRAWXY(IX,IY)
+      END
diff --git a/src/staden/plX.c b/src/staden/plX.c
new file mode 100644
index 0000000..ddf573d
--- /dev/null
+++ b/src/staden/plX.c
@@ -0,0 +1,513 @@
+/*
+    Title: 	 plX
+
+    File: 	 plX.c
+    Purpose:	 A pseudo-device for graphics under X
+    Last update: Tue May 22 1990
+*/
+
+
+/*
+    This module exists to be syntactically compatible with the
+    pl4010 pseudo-device for Tektronics terminals. However, it
+    only implements a single graphics plane.
+
+    This ``device'' operates in a space XMINDEV - XMAXDEV and
+    YMINDEV - YMAXDEV. Values in an external scale can be converted
+    to an appropriate scale using ndc_x().
+
+    There is a notional pen which has a ``current position'' in
+    the graphics space.
+
+    This is implemented using a Graph widget as a window in which
+    to plot. This gives us back suitable resize and expose callbacks.
+    The state of the plot is stored, and recalled when replotting
+    is necessary, from a plotLog.
+
+    Plotting is actually carried out in the window coordinate system
+    and this can change dynamically: its current size is given by
+    (currWidth,currHeight) which are static variables updated when
+    necessary. There are routines to convert between the two coordinate
+    systems.
+
+    For the XHAIR call, both key and button events are collected, but
+    they are all turned into key events.
+*/
+
+
+
+
+#include <stdio.h>
+
+/* ---- Includes ---- */
+
+#include "fort.h"	   /* IMPORT: int_f definition */
+#include "plX.h"
+#include "Graph.h"         /* IMPORT: graphWidgetClass */
+#include "main.h"          /* IMPORT: dispatchEventsUntil */
+#include "plotLog.h"       /* IMPORT: all */
+#include "textOutput.h"    /* IMPORT: UpdateTextOutput */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/cursorfont.h> /* IMPORT: XC_crosshair */
+
+
+
+/* ---- Constants ---- */
+
+#define DEVXMIN 0
+#define DEVXMAX 10000
+#define DEVWIDTH (DEVXMAX-DEVXMIN)
+#define DEVYMIN 0
+#define DEVYMAX 10000
+#define DEVHEIGHT (DEVYMAX-DEVYMIN)
+
+
+
+
+/* ---- Types ---- */
+
+typedef struct _AppResources {
+    Pixel fg;
+} AppResources;
+
+
+/* ---- Static variables ---- */
+
+/* Unchanging state */
+static Display *display;
+static Widget  goWid;
+static GC      goGC;
+
+/* Current window size */
+static Dimension currWidth, currHeight;
+
+/* Current pen position */
+static int penX = 0;
+static int penY = 0;
+
+/* Graphics input cursor */
+static Cursor ginCursor;
+
+/* Keypress information for GIN mode */
+Boolean keyPressed = False;
+char    keyVal;
+int     keyX;
+int     keyY;
+
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+static int scaleX(int x)
+/* Convert (fixed) device coordinate to (variable) window coordinate */
+{   return((x*currWidth)/DEVWIDTH);
+}
+
+
+static int scaleY(int y)
+/* Convert (fixed) device coordinate to (variable) window coordinate */
+{   return(currHeight - ((y*currHeight)/DEVHEIGHT));
+}
+
+
+static int invScaleX(int x)
+/* Convert (variable) window coordinate to (fixed) device coordinate */
+{   return((x*DEVWIDTH)/currWidth);
+}
+
+
+static int invScaleY(int y)
+/* Convert (variable) window coordinate to (fixed) device coordinate */
+{   return(DEVHEIGHT - ((y*DEVHEIGHT)/currHeight));
+}
+
+
+static Boolean KeyPressed(void)
+{   return(keyPressed);
+}
+
+
+
+
+/* ---- Callback routines ---- */
+
+
+static void destroyCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{   (void) DestroyPL();
+    XFreeCursor(display, ginCursor);
+}
+
+
+static void exposeCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{   PLItem i;
+    PLResult res;
+    char textBuf[256];
+
+
+    /* Replot, using logged commands */
+    if (ResetReadPL() != OK)
+    {   fprintf(stderr, "\n*** Trouble with plot temporary file. Exiting.\n");
+        finalx(1);
+    }
+    i.u.t.textp = textBuf;
+    while ((res = ReadPL(&i)) == OK)
+    {   switch (i.op)
+	{   case DrawLineOp:
+                XDrawLine(display, XtWindow(goWid), goGC,
+			  scaleX(i.x1),      scaleY(i.y1),
+			  scaleX(i.u.p2.x2), scaleY(i.u.p2.y2));
+		break;
+
+	    case DrawPointOp:
+		XDrawPoint(display, XtWindow(goWid), goGC,
+			   scaleX(i.x1), scaleY(i.y1));
+		break;
+
+	    case DrawStringOp:
+		XDrawString(display, XtWindow(goWid), goGC,
+			    scaleX(i.x1), scaleY(i.y1),
+			    i.u.t.textp, (int)i.u.t.textl);
+		break;
+        }/*endswitch*/;
+	i.u.t.textp = textBuf;
+    }/*endwhile*/;
+
+    if (res != EndOfPL)
+    {   fprintf(stderr, "\n*** Trouble with plot temporary file. Exiting.\n");
+        finalx(1);
+    }
+}
+
+
+static void inputCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{   XEvent *eventP = (XEvent *) call_data;
+    char buffer[1];
+    KeySym key;
+    int charCount;
+    XComposeStatus compose;
+
+    keyPressed = True;
+
+    /* Get the key value and the pointer location */
+    switch (eventP->type)
+    {   case KeyPress:
+            charCount = XLookupString((XKeyEvent *)eventP, buffer, 1,
+				      &key, &compose);
+            keyVal = buffer[0];
+            keyX = eventP->xkey.x;
+            keyY = eventP->xkey.y;
+            break;
+        case ButtonPress:
+            /* We turn a button event into a key event sending an 'S' */
+            keyVal = 'S';
+            keyX = eventP->xbutton.x;
+            keyY = eventP->xbutton.y;
+            break;
+    }
+}
+
+static Arg arglist_1[] =     {
+    {XtNwidth,  (XtArgVal) &currWidth},
+    {XtNheight, (XtArgVal) &currHeight},
+};
+
+
+static void resizeCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{
+    XtGetValues(goWid, arglist_1, XtNumber(arglist_1));
+    XClearWindow(display, XtWindow(goWid));
+}
+
+
+
+
+/* ---- Exported routines ---- */
+
+
+/* ---- Initialisation routines ---- */
+
+
+static Arg getSizeArglist_2[] =
+{   {XtNwidth,  (XtArgVal) &currWidth},
+    {XtNheight, (XtArgVal) &currHeight},
+    };
+
+static XtResource resources_2[] =
+{   {XtNforeground, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+         XtOffset(AppResources *, fg), XtRString, "XtdefaultForeground"},
+    };
+
+
+Widget CreateGraphicsOutput(Widget parentWid)
+/*
+    Create and return a Graph widget as a child of `parentWid'.
+    Must be called before opengr_ and initgr_.
+
+    This routine initializes the statics `display', `goGC' and `goWid'.
+*/
+{   AppResources app_resources;
+
+    static Arg createArglist[2];
+
+    /* Find the application foreground colour to use.
+    */
+    XtGetApplicationResources(parentWid, (XtPointer) &app_resources,
+			      resources_2, XtNumber(resources_2), NULL, 0);
+
+
+    /* Create the graph widget and add its callbacks.
+    */
+    XtSetArg(createArglist[0], XtNgraphColour1, app_resources.fg);
+    goWid = XtCreateManagedWidget("graph", graphWidgetClass, parentWid,
+				  createArglist, 1);
+    XtAddCallback(goWid, XtNcallback,        inputCallback,   NULL);
+    XtAddCallback(goWid, XtNdestroyCallback, destroyCallback, NULL);
+    XtAddCallback(goWid, XtNexposeCallback,  exposeCallback,  NULL);
+    XtAddCallback(goWid, XtNresizeCallback,  resizeCallback,  NULL);
+
+
+    /* Get some permanent state.
+    */
+    goGC    = GraphGC1(goWid);
+    display = XtDisplay(goWid);
+    ginCursor = XCreateFontCursor(display, XC_crosshair);
+    XtGetValues(goWid, getSizeArglist_2, XtNumber(getSizeArglist_2));
+
+
+    if (CreatePL() != OK)
+    {   fprintf(stderr, "\n*** Can't create plot temporary file. Exiting.\n");
+        finalx(1);
+    }
+
+
+    return(goWid);
+}
+
+
+
+
+void opengr_x(int_f *IDEVG_p)
+/*
+    Open graphics output. This routine must be called before
+    initgr_x. The argument is ignored.
+*/
+{
+}
+
+
+void initgr_x(int_f *KBIN_p, int_f *KBOUT_p,
+	      int_f *IHELPS_p, int_f *IHELPE_p, char *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_fl HELPF_l)
+/*
+    This routine must be called before any of the subsequent
+    routines are used. All arguments are ignored.
+*/
+{
+}
+
+
+void opengf_x(int_f *IDEVG_p)
+/*
+    This routine does nothing.
+*/
+{
+}
+
+
+
+
+/* ---- Mode routines ---- */
+
+
+void alpham_x(void)
+/*
+    This function has no effect.
+*/
+{
+}
+
+
+void vectom_x(void)
+/*
+    This function has no effect.
+*/
+{
+}
+
+
+void vt100m_x(void)
+/*
+    This function has no effect.
+*/
+{
+}
+
+
+/* Clearing routines
+*/
+
+
+void clearv_x(void)
+/*
+    This function does nothing.
+*/
+{
+}
+
+
+void blankg_x(void)
+/*
+    This function clears the graphics screen.
+*/
+{   XClearWindow(display, XtWindow(goWid));
+    if (ResetWritePL() != OK)
+    {   fprintf(stderr, "\n*** Trouble with plot temporary file. Exiting.\n");
+        finalx(1);
+    }
+}
+
+
+
+
+/* ---- Moving and plotting routines ---- */
+
+
+void movexy_x(int_f *IX_p, int_f *IY_p)
+/*
+   Move the graphics pen to the indicated position.
+*/
+{   penX = *IX_p;
+    penY = *IY_p;
+}
+
+
+void drawxy_x(int_f *IX_p, int_f *IY_p)
+/*
+   Draw a line from the current position of the graphics
+   pen to the indicated position. Leave the pen there.
+*/
+{   PLItem i;
+
+    XDrawLine(display, XtWindow(goWid), goGC,
+              scaleX(penX), scaleY(penY),
+              scaleX(*IX_p), scaleY(*IY_p));
+
+    i.op = DrawLineOp;
+    i.x1 = penX;
+    i.y1 = penY;
+    i.u.p2.x2 = *IX_p;
+    i.u.p2.y2 = *IY_p;
+    if (WritePL(i) != OK)
+    {   fprintf(stderr, "\n*** Trouble with plot temporary file. Exiting.\n");
+        finalx(1);
+    }
+
+    penX = *IX_p;
+    penY = *IY_p;
+}
+
+
+void dotxy_x(int_f *IX_p, int_f *IY_p)
+/*
+   Plot a single point at (IX,IY). Leave the pen there.
+*/
+{   PLItem i;
+
+    XDrawPoint(display, XtWindow(goWid), goGC,
+               scaleX(*IX_p), scaleY(*IY_p));
+
+    i.op = DrawPointOp;
+    i.x1 = *IX_p;
+    i.y1 = *IY_p;
+    if (WritePL(i) != OK)
+    {   fprintf(stderr, "\n*** Trouble with plot temporary file. Exiting.\n");
+        finalx(1);
+    }
+
+    penX = *IX_p;
+    penY = *IY_p;
+}
+
+
+void writxy_x(int_f *IX_p, int_f *IY_p,
+	      char *TEXT_p, int_f *NCHAR_p,
+	      int_fl TEXT_l)
+/*
+    NCHAR characters from TEXT are written at position (IX,IY).
+    The pen position is left undefined.
+*/
+{   PLItem i;
+
+    XDrawString(display, XtWindow(goWid), goGC,
+                scaleX(*IX_p), scaleY(*IY_p), TEXT_p, *NCHAR_p);
+
+    i.op        = DrawStringOp;
+    i.x1        = *IX_p;
+    i.y1        = *IY_p;
+    i.u.t.textp = TEXT_p;
+    i.u.t.textl = *NCHAR_p;
+    if (WritePL(i) != OK)
+    {   fprintf(stderr, "\n*** Trouble with plot temporary file. Exiting.\n");
+        finalx(1);
+    }
+}
+
+
+void flushg_x(void)
+/*
+    Try very hard to ensure all graphics are visible.
+*/
+{   XFlush(display);
+}
+
+
+
+
+/* ---- Input routines ---- */
+
+
+void xhair_x(int_f *IX_p, int_f *IY_p, char *TERM_p, int_fl TERM_l)
+/*
+   The cursor in the graphics window is converted to a crosshair
+   and the function waits for a key or button to be pressed. When
+   this occurs the function returns with the cursor location (in
+   device space) and the key hit. Pressing a mouse button is
+   equivalent to hitting the 'S' key. The location returned is
+   not defined if the cursor was out of the graphics window.
+
+   The text output widget is refreshed with all output since this
+   was last called.
+   X events are cycled until a key event has occurred. The location
+   of this is converted to device space and returned along with the
+   terminating character.
+*/
+{   Arg oldCursorArg[1];
+    Arg newCursorArg[1];
+    Cursor oldCursor;
+
+    
+    /* Get the old cursor */
+    XtSetArg(oldCursorArg[0], XtNcursor, &oldCursor);
+    XtGetValues(goWid, oldCursorArg, 1);
+
+    /* Set the new cursor */
+    XtSetArg(newCursorArg[0], XtNcursor, ginCursor);
+    XtSetValues(goWid, newCursorArg, 1); 
+
+    UpdateTextOutput();
+    keyPressed = False;
+    dispatchEventsUntil(KeyPressed);
+    *IX_p  = (int_f) invScaleX(keyX);
+    *IY_p  = (int_f) invScaleY(keyY);
+    *TERM_p = keyVal;
+
+    /* Reset the old cursor */
+    XtSetArg(oldCursorArg[0], XtNcursor, oldCursor);
+    XtSetValues(goWid, oldCursorArg, 1);
+}
diff --git a/src/staden/plX.h b/src/staden/plX.h
new file mode 100644
index 0000000..b37cf43
--- /dev/null
+++ b/src/staden/plX.h
@@ -0,0 +1,196 @@
+/*
+    Title:       plX
+
+    File: 	 plX.h
+    Purpose:	 A pseudo-device for graphics under X
+    Last update: Tue May 15 1990
+*/
+
+
+/*
+    This module exists to be syntactically compatible with the
+    pl4010 pseudo-device for Tektronics terminals. However, it
+    only implements a single graphics plane.
+
+    This ``device'' operates in a space XMINDEV - XMAXDEV and
+    YMINDEV - YMAXDEV. Values in an external scale can be converted
+    to an appropriate scale using ndc_x().
+
+    There is a notional pen which has a ``current position'' in
+    the graphics space.
+*/
+
+
+
+
+/*
+    C (internal) - SUN FORTRAN (external) name conversions
+    This table is for the compiler specifically mentioned above, so
+    as well as being hostage to its horrors we can make use of its
+    advantages, such as long identifiers.
+*/
+
+#define opengr_x opengr_
+#define initgr_x initgr_
+#define opengf_x opengf_
+#define alpham_x alpham_
+#define vectom_x vectom_
+#define vt100m_x vt100m_
+#define clearv_x clearv_
+#define blankg_x blankg_
+#define movexy_x movexy_
+#define drawxy_x drawxy_
+#define dotxy_x dotxy_
+#define writxy_x writxy_
+#define flushg_x flushg_
+#define xhair_x  xhair_
+#define ndc_x    ndc_
+
+
+
+
+/* ---- Includes ---- */
+
+#include "fort.h"
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Initialisation routines ---- */
+
+
+extern Widget CreateGraphicsOutput(Widget parentWid);
+/*
+    Create and return a Graph widget as a child of `parentWid'.
+    Must be called before opengr_ and initgr_.
+*/
+
+
+extern void opengr_x(int_f *IDEV_p);
+/*
+    Open graphics output. This routine must be called before
+    initgr_x. The argument is ignored.
+*/
+
+
+extern void initgr_x(int_f *KBIN_p, int_f *KBOUT_p,
+		     int_f *IHELPS_p, int_f *IHELPE_p, char *HELPF_p,
+		     int_f *IDEVH_p,
+		     int_fl HELPF_l);
+/*
+    This routine must be called before any of the subsequent
+    routines are used. All arguments are ignored.
+*/
+
+
+extern void opengf_x(int_f *IDEVG_p);
+/*
+    This routine does nothing.
+*/
+
+
+
+
+/* ---- Mode routines ---- */
+
+
+extern void alpham_x(void);
+/*
+    This function has no effect.
+*/
+
+
+extern void vectom_x(void);
+/*
+    This function has no effect.
+*/
+
+
+extern void vt100m_x(void);
+/*
+    This function has no effect.
+*/
+
+
+/* Clearing routines.
+*/
+
+extern void clearv_x(void);
+/*
+    This function has no effect.
+*/
+
+
+extern void clearg_x(void);
+/*
+    This function clears the graphics screen.
+*/
+
+
+
+
+/* ---- Moving and plotting routines ---- */
+
+
+extern void movegr_x(int_f *IX_p, int_f *IY_p);
+/*
+   Move the graphics pen to the indicated position.
+*/
+
+
+extern void drawgr_x(int_f *IX_p, int_f *IY_p);
+/*
+   Draw a line from the current position of the graphics
+   pen to the indicated position. Leave the pen there.
+*/
+
+
+extern void pointg_x(int_f *IX_p, int_f *IY_p);
+/*
+   Plot a single point at (IX,IY). Leave the pen there.
+*/
+
+
+extern void writet_x(int_f *IX_p, int_f *IY_p,
+		     char *TEXT_p, int_f *NCHAR_p,
+		     int_fl  TEXT_l);
+/*
+    NCHAR characters from TEXT are written at position (IX,IY).
+    The pen position is left undefined.
+*/
+
+
+extern void flushg_x(void);
+/* 
+    Try very hard to ensure all graphics are visible.
+*/
+
+
+
+
+/* ---- Input routines ---- */
+
+
+extern void xhair_x(int_f *IX_p, int_f *IY_p, char *TERM_p, int_fl TERM_l);
+/*
+   The cursor in the graphics window is converted to a crosshair
+   and the function waits for a key or button to be pressed. When
+   this occurs the function returns with the cursor location (in
+   device space) and the key hit. Pressing a mouse button is
+   equivalent to hitting the 'S' key. The location returned is
+   not defined if the cursor was out of the graphics window.
+*/
+
+
+
+
+/* ---- Enquiry routines ---- */
+
+
+extern void ndc_x(int_f  *ISXMAX_p, int_f  *ISYMAX_p, /* in:  external scale */
+		  float *XNDC_p,   float *YNDC_p);  /* out: scaling factor */
+/*
+   Return a scaling factor between the external and internal scales.
+*/
+
diff --git a/src/staden/plot92.f b/src/staden/plot92.f
new file mode 100644
index 0000000..1b9147d
--- /dev/null
+++ b/src/staden/plot92.f
@@ -0,0 +1,699 @@
+C   PLOT89
+C   AUTHOR: RODGER STADEN
+C
+C   13-8-92 new organisation: let pl4010 do device scaling and hardwire
+C           it for 1023x768
+C
+C
+C   25-1-90  XHAIRN changed order of arguments to diabox and fmt4ln
+C            so that horizontal and vertical sequences were consistent
+C            with rest of sip
+C    5-4-90  Removed ndc and put it in pl4010. added redir
+C    4-5-90  Added the new routines for sap
+C    14-11-90 Replaced radio by radion
+C    2-3-92 set filnam = ' ' for calls to openf1
+C   ROUTINES IN THIS LIBRARY
+C
+C   LINE
+C   POINT
+C   FRAME
+C   TEXT
+C   RULER
+C   SCALES
+C   BLIP
+C   XHAIRS
+C   XHAIRN
+C   INQGT
+C   REDIR
+C    line
+      SUBROUTINE LINE(XF,XT,YF,YT,XMAX,XMIN,YMAX,YMIN,
+     +IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+C   AUTHOR: RODGER STADEN
+C   DRAWS A LINE FROM XF,YF TO XT,YT. COORDINATE SYSTEMS ARE
+C   1) WORLD SYSTEM: XMIN TO XMAX, YMIN TO YMAX
+C   2) DRAWING BOARD 0 TO ISXMAX, 0 TO ISYMAX
+C   3) DEVICE COORDINATES 0 TO MAXDVX, 0 TO MAXDVY
+C   THE POSITION OF THE PLOT IS DEFINED IN THE DRAWING BOARD SYSTEM BY A
+C   BOX STARTING AT IX0,IY0 WITH SIDES IXLEN, IYLEN (IN DRAWING BOARD UNITS)
+C  RANGE IN WORLD COORDS
+      RANGWX=XMAX-XMIN
+      IF(RANGWX.EQ.0.0)RETURN
+      RANGWY=YMAX-YMIN
+      IF(RANGWY.EQ.0.0)RETURN
+C  CLIP
+      CALL CLIP(XF,XMAX,XMIN,YF,YMAX,YMIN,XFF,YFF)
+      CALL CLIP(XT,XMAX,XMIN,YT,YMAX,YMIN,XTT,YTT)
+C  PROPORTIONAL X POSITIONS
+      XWF=(XFF-XMIN)/RANGWX
+      XWT=(XTT-XMIN)/RANGWX
+C  PROPORTIONAL Y POSITIONS
+      YWF=(YFF-YMIN)/RANGWY
+      YWT=(YTT-YMIN)/RANGWY
+C  POSITIONS IN DRAWING BOARD UNITS
+      XDF=IX0+XWF*IXLEN
+      XDT=IX0+XWT*IXLEN
+      YDF=IY0+YWF*IYLEN
+      YDT=IY0+YWT*IYLEN
+C  POSITIONS IN DEVICE UNITS
+      IXF=NINT(XDF)
+      IXT=NINT(XDT)
+      IYF=NINT(YDF)
+      IYT=NINT(YDT)
+C   DRAW IT
+      CALL MOVEGR(IXF,IYF)
+      CALL DRAWGR(IXT,IYT)
+      END
+C    POINT
+      SUBROUTINE POINT(XF,YF,XMAX,XMIN,YMAX,YMIN,
+     +IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+C   AUTHOR: RODGER STADEN
+C   DRAWS A POINT AT XF,YF. COORDINATE SYSTEMS ARE
+C   1) WORLD SYSTEM: XMIN TO XMAX, YMIN TO YMAX
+C   2) DRAWING BOARD 0 TO ISXMAX, 0 TO ISYMAX
+C   3) DEVICE COORDINATES 0 TO MAXDVX, 0 TO MAXDVY
+C   THE POSITION OF THE PLOT IS DEFINED IN THE DRAWING BOARD SYSTEM BY A
+C   BOX STARTING AT IX0,IY0 WITH SIDES IXLEN, IYLEN (IN DRAWING BOARD UNITS)
+C  RANGE IN WORLD COORDS
+      RANGWX=XMAX-XMIN
+      IF(RANGWX.EQ.0.0)RETURN
+      RANGWY=YMAX-YMIN
+      IF(RANGWY.EQ.0.0)RETURN
+C  CLIP
+      CALL CLIP(XF,XMAX,XMIN,YF,YMAX,YMIN,XFF,YFF)
+C  PROPORTIONAL X POSITIONS
+      XWF=(XFF-XMIN)/RANGWX
+C  PROPORTIONAL Y POSITIONS
+      YWF=(YFF-YMIN)/RANGWY
+C  POSITIONS IN DRAWING BOARD UNITS
+      XDF=IX0+XWF*IXLEN
+      YDF=IY0+YWF*IYLEN
+      IXF=NINT(XDF)
+      IYF=NINT(YDF)
+C   PLOT IT
+      CALL POINTG(IXF,IYF)
+      END
+      SUBROUTINE FRAME(IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+C   AUTHOR: RODGER STADEN
+C   DRAWS A BOX FROM IX0,IY0 OF SIDES IXLEN,IYLEN COORDINATE SYSTEMS ARE
+C   2) DRAWING BOARD 0 TO ISXMAX, 0 TO ISYMAX
+C   3) DEVICE COORDINATES 0 TO MAXDVX, 0 TO MAXDVY 
+      IXF=IX0
+      IXT=IX0+IXLEN
+      IYF=IY0
+      IYT=IY0+IYLEN
+      CALL MOVEGR(IXF,IYF)
+      CALL DRAWGR(IXF,IYT)
+      CALL DRAWGR(IXT,IYT)
+      CALL DRAWGR(IXT,IYF)
+      CALL DRAWGR(IXF,IYF)
+      END
+C    TEXT
+      SUBROUTINE TEXT(XF,YF,CHARS,NCHARS,ISIZE,XMAX,XMIN,YMAX,YMIN,
+     +IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX)
+C   AUTHOR: RODGER STADEN
+C      CHARACTER CHARS(NCHARS)
+      CHARACTER CHARS*(*)
+C   WRITE TEXT AT XF,YF. COORDINATE SYSTEMS ARE
+C   1) WORLD SYSTEM: XMIN TO XMAX, YMIN TO YMAX
+C   2) DRAWING BOARD 0 TO ISXMAX, 0 TO ISYMAX
+C   3) DEVICE COORDINATES 0 TO MAXDVX, 0 TO MAXDVY
+C   THE POSITION OF THE PLOT IS DEFINED IN THE DRAWING BOARD SYSTEM BY A
+C   BOX STARTING AT IX0,IY0 WITH SIDES IXLEN, IYLEN (IN DRAWING BOARD UNITS)
+C  RANGE IN WORLD COORDS
+      RANGWX=XMAX-XMIN
+      IF(RANGWX.EQ.0.0)RETURN
+      RANGWY=YMAX-YMIN
+      IF(RANGWY.EQ.0.0)RETURN
+C  CLIP
+C   IF OFF SCREEN DONT WRITE
+      IF(XF.GT.XMAX)RETURN
+      IF(XF.LT.XMIN)RETURN
+      IF(YF.GT.YMAX)RETURN
+      IF(YF.LT.YMIN)RETURN
+C  PROPORTIONAL X POSITIONS
+      XWF=(XF-XMIN)/RANGWX
+C  PROPORTIONAL Y POSITIONS
+      YWF=(YF-YMIN)/RANGWY
+C  POSITIONS IN DRAWING BOARD UNITS
+      XDF=IX0+XWF*IXLEN
+      YDF=IY0+YWF*IYLEN
+      IXF=NINT(XDF)
+      IYF=NINT(YDF)
+C   PLOT IT
+      CALL WRITET(IXF,IYF,CHARS,NCHARS)
+      END
+      SUBROUTINE SCALES(XMAX,XMIN,YMAX,YMIN,
+     +IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX,
+     +BIT,IBLIPH,FIRST,ISCALE)
+C   AUTHOR: RODGER STADEN
+C   ROUTINE TO PUT A SCALE ALONG AN AXIS EITHER X OR Y
+C   DEPENDING ON ISCALE (1=X,2=Y AXES)
+C   BLIPS OF HEIGHT IBLIPH ARE PLACED ALONG THE AXIS EVERY
+C   BIT OF LENGTH STARTING FROM FIRST. FIRST CAN BE USED
+C   TO MAKE SURE THAT A BLIP OCCURS AT A PARTICULAR POSITION
+C   FOR EXAMPLE AT A ZERO POINT, OTHERWISE FIRST IS SET TO
+C   XMIN OR YMIN
+C   1) WORLD SYSTEM: XMIN TO XMAX, YMIN TO YMAX
+C   2) DRAWING BOARD 0 TO ISXMAX, 0 TO ISYMAX
+C   3) DEVICE COORDINATES 0 TO MAXDVX, 0 TO MAXDVY
+C   THE POSITION OF THE PLOT IS DEFINED IN THE DRAWING BOARD SYSTEM BY A
+C   BOX STARTING AT IX0,IY0 WITH SIDES IXLEN, IYLEN (IN DRAWING BOARD UNITS)
+      IF(ISCALE.EQ.1)THEN
+C  NUMBER OF POINTS
+        NP=1+(XMAX-FIRST)/BIT
+C  RANGE IN WORLD COORDS
+        RANGWX=XMAX-XMIN
+        IF(RANGWX.EQ.0.0)RETURN
+C  GET END POINTS IN Y
+        IYF=IY0
+        IYT=IY0+IBLIPH
+C  CALC AND PLOT POINTS
+        DO 10 I=1,NP
+C  PROPORTIONAL X POSITIONS
+          XFF=FIRST+(I-1)*BIT
+          XWF=(XFF-XMIN)/RANGWX
+C  POSITION IN DRAWING BOARD UNITS
+          XDF=IX0+XWF*IXLEN
+          IXF=NINT(XDF)
+          CALL MOVEGR(IXF,IYF)
+          CALL DRAWGR(IXF,IYT)
+10      CONTINUE
+        END IF
+      IF(ISCALE.EQ.2)THEN
+C  NUMBER OF POINTS
+        NP=1+(YMAX-FIRST)/BIT
+C  RANGE IN WORLD COORDS
+        RANGWY=YMAX-YMIN
+        IF(RANGWY.EQ.0.0)RETURN
+C  GET END POINTS IN X
+        IXF=IX0
+        IXT=IX0+IBLIPH
+C  CALC AND PLOT POINTS
+        DO 20 I=1,NP
+C  PROPORTIONAL Y POSITIONS
+          YFF=FIRST+(I-1)*BIT
+          YWF=(YFF-YMIN)/RANGWY
+C  POSITION IN DRAWING BOARD UNITS
+          YDF=IY0+YWF*IYLEN
+          IYF=NINT(YDF)
+          CALL MOVEGR(IXF,IYF)
+          CALL DRAWGR(IXT,IYF)
+20      CONTINUE
+        END IF
+      END
+      INTEGER FUNCTION IPSEP(XMAX,XMIN,NMAX)
+C   FIND SENSIBLE DEFAULT BLIP SEPARATION (SAY NMAX BLIPS PER PLOT)
+      D=XMAX-XMIN
+      IPSEP=D
+      IF(NMAX.LT.1)RETURN
+      NSEP=0
+1     CONTINUE
+      NSEP=NSEP+1
+      D=D/NMAX
+      IF(D.GT.1.0)GO TO 1
+      IPSEP=NMAX**(NSEP-1)
+      END
+      SUBROUTINE XHAIRS(XMAX,XMIN,YMAX,YMIN,IX0,IXLEN,IY0,
+     +IYLEN,ISXMAX,ISYMAX,IHX,IHY,NCHAR,KBOUT)
+C   AUTHOR: RODGER STADEN
+C      WRITE(KBOUT,*)' CROSS HAIRS'
+      CHARACTER TERM,COMMA
+      PARAMETER (COMMA=',')
+C  RANGE IN WORLD COORDS
+      RANGWX=XMAX-XMIN
+      IF(RANGWX.EQ.0.0)RETURN
+      RANGWY=YMAX-YMIN
+      IF(RANGWY.EQ.0.0)RETURN
+      XFF=IHX
+      YFF=IHY
+C  PROPORTIONAL X POSITIONS
+      XWF=(XFF-XMIN)/RANGWX
+C  PROPORTIONAL Y POSITIONS
+      YWF=(YFF-YMIN)/RANGWY
+C  POSITIONS IN DRAWING BOARD UNITS
+      XDF=IX0+XWF*IXLEN
+      YDF=IY0+YWF*IYLEN
+      IXF=NINT(XDF)
+      IYF=NINT(YDF)
+10    CONTINUE
+      CALL XHAIR(IXF,IYF,TERM)
+C  RETURNS POSITION IN DB UNITS. CONVERT TO DRAWING BOARD UNITS
+C  RELATIVE POSITIONS
+      XFF=IXF-IX0
+      YFF=IYF-IY0
+      XFF=XFF/IXLEN
+      YFF=YFF/IYLEN
+C  CONVERT TO WORLD UNITS
+      XFW=XMIN+XFF*RANGWX
+      YFW=YMIN+YFF*RANGWY
+      IHX=NINT(XFW)
+      IHY=NINT(YFW)
+C   BACK TO VT100 MODE
+      CALL VT100M
+      WRITE(KBOUT,1014)IHX,IHY
+1014  FORMAT(' Cross hair at',2I7)
+      IF(TERM.EQ.COMMA)GO TO 10
+      END
+      SUBROUTINE LABLER(KBIN,KBOUT,
+     +ISXMAX,ISYMAX,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER LABEL*80,HELPF*(*)
+      CHARACTER LEFT,RIGHT,TERM,TUPPER
+      INTEGER CSIZE
+      EXTERNAL NOTIRL,TUPPER
+      PARAMETER (CSIZE=80)
+      PARAMETER (LEFT='L',RIGHT='R')
+C  FIDDLE TO CHECK ON GOING OFF SCREEN. ASSUME 80 CHARS PER SCREEN WIDTH
+C  NEED TO FIND WIDTH OF SCREEN IN DEVICE UNITS
+      CSCALE=ISXMAX/CSIZE
+      CALL CLEARV
+      WRITE(KBOUT,1004)
+1004  FORMAT(
+     +' Type label then drive cross hair to left or right end',/,
+     +' of label position then hit  "L"  to  write label left',/,
+     +' justified or  "R"  to  write label right justified or',/,
+     +' the space bar to quit')
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      IF(IOK.NE.0) RETURN
+10    CONTINUE
+      CALL VT100M
+      CALL CLEARV
+      LENGTH = 0
+      CALL GTSTR('Label',' ',LABEL,LENGTH,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LENGTH.EQ.0) RETURN
+20    CONTINUE
+      CALL XHAIR(IHX,IHY,TERM)
+      TERM = TUPPER(TERM)
+      IF((TERM.EQ.RIGHT).OR.(TERM.EQ.LEFT))THEN
+        IF(TERM.EQ.RIGHT)IHX=NINT(REAL(IHX)-LENGTH*CSCALE)
+        IF((IHX.LT.1).OR.((IHX+LENGTH*CSCALE).GT.ISXMAX))GO TO 10
+        CALL WRITET(IHX,IHY,LABEL,LENGTH)
+        GO TO 20
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE CLIP(X1,XMAX,XMIN,Y1,YMAX,YMIN,X2,Y2)
+C  AUTHOR RODGER STADEN
+C  CLIPS COORDINATES IN USER UNITS
+      X2=MAX(X1,XMIN)
+      X2=MIN(X2,XMAX)
+      Y2=MAX(Y1,YMIN)
+      Y2=MIN(Y2,YMAX)
+      RETURN
+      END
+      SUBROUTINE XHAIRN(XMAX,XMIN,YMAX,YMIN,IX0,IXLEN,IY0,
+     +IYLEN,ISXMAX,ISYMAX,IHX,IHY,NCHAR,KBOUT,
+     +SEQ1,ISTAR1,IDIM1,SEQ2,ISTAR2,IDIM2,NSEQ)
+C   AUTHOR: RODGER STADEN
+C      WRITE(KBOUT,*)' CROSS HAIRS'
+      CHARACTER TERM,COMMA
+      PARAMETER (COMMA=',',ISPAN = 19)
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),MATCH(ISPAN)
+C  RANGE IN WORLD COORDS
+      RANGWX=XMAX-XMIN
+      IF(RANGWX.EQ.0.0)RETURN
+      RANGWY=YMAX-YMIN
+      IF(RANGWY.EQ.0.0)RETURN
+      XFF=IHX
+      YFF=IHY
+C  PROPORTIONAL X POSITIONS
+      XWF=(XFF-XMIN)/RANGWX
+C  PROPORTIONAL Y POSITIONS
+      YWF=(YFF-YMIN)/RANGWY
+C  POSITIONS IN DRAWING BOARD UNITS
+      XDF=IX0+XWF*IXLEN
+      YDF=IY0+YWF*IYLEN
+      IXF=NINT(XDF)
+      IYF=NINT(YDF)
+10    CONTINUE
+      CALL XHAIR(IXF,IYF,TERM)
+C  RETURNS POSITION IN Db UNITS. CONVERT TO DRAWING BOARD UNITS
+C  RELATIVE POSITIONS
+      XFF=IXF-IX0
+      YFF=IYF-IY0
+      XFF=XFF/IXLEN
+      YFF=YFF/IYLEN
+C  CONVERT TO WORLD UNITS
+      XFW=XMIN+XFF*RANGWX
+      YFW=YMIN+YFF*RANGWY
+      IHX=NINT(XFW)
+      IHY=NINT(YFW)
+C   BACK TO VT100 MODE
+      CALL CLEARV
+      IF((TERM.EQ.'S').OR.(TERM.EQ.'s'))THEN
+        K1 = MAX(1,IHX) - ISTAR1 + 1
+        K2 = MIN(IHX+ISPAN,IDIM1) - ISTAR1 + 1
+        IF(NSEQ.NE.2)THEN
+        WRITE(KBOUT,1015)K1+ISTAR1-1,(SEQ1(K),K=K1,K2)
+1015    FORMAT(' ',I7,' ',21A1)
+        ELSE IF(NSEQ.EQ.2)THEN
+          KK1 = MAX(1,IHY) - ISTAR2 + 1
+          KK2 = MIN(IHY+ISPAN,IDIM2) - ISTAR2 + 1
+          L = MIN(K2-K1,KK2-KK1) + 1
+          IF(L.GT.0) THEN
+            CALL SQMTCH(SEQ1(K1),SEQ2(KK1),MATCH,L)
+            CALL FMT4LN(SEQ2(KK1),SEQ1(K1),MATCH,L,
+     +      KK1+ISTAR2-1,K1+ISTAR1-1,KBOUT)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((TERM.EQ.'M').OR.(TERM.EQ.'m').AND.(NSEQ.EQ.2))THEN
+        K1 = MAX(1,IHX) - ISTAR1 + 1
+        K2 = MIN(IHX+ISPAN,IDIM1) - ISTAR1 + 1
+        KK1 = MAX(1,IHY) - ISTAR2 + 1
+        KK2 = MIN(IHY+ISPAN,IDIM2) - ISTAR2 + 1
+        L = MIN(K2-K1,KK2-KK1) + 1
+        IF(L.GT.0) THEN
+          CALL DIABOX(SEQ2,IDIM2,SEQ1,IDIM1,KK1,K1,L,MATCH,KBOUT)
+        END IF
+        GO TO 10
+      END IF
+      WRITE(KBOUT,1014)IHX,IHY
+1014  FORMAT(' Cross hair at',2I7)
+      IF(TERM.EQ.COMMA)GO TO 10
+      RETURN
+      END
+      SUBROUTINE DIABOX(SEQ1,IDIM1,SEQ2,IDIM2,I11,I21,
+     +LENGTH,LINE,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM1),SEQ2(IDIM2),LINE(LENGTH)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      WRITE(KBOUT,*)
+      I12 = I11 + LENGTH - 1
+      I22 = I21 + LENGTH - 1
+      WRITE(KBOUT,1000)(SEQ2(K),K=I21,I22)
+      DO 100 I=I12,I11,-1
+        CALL FILLC(LINE,LENGTH,'.')
+        L=0
+        DO 50 J=I21,I22
+          L=L+1
+          IF(CTONUM(SEQ1(I)).EQ.CTONUM(SEQ2(J))) 
+     +    LINE(L) = SEQ1(I)
+50      CONTINUE
+        WRITE(KBOUT,1001)SEQ1(I),(LINE(K),K=1,L),SEQ1(I)
+100   CONTINUE
+      WRITE(KBOUT,1000)(SEQ2(K),K=I21,I22)
+1000  FORMAT(4X,36A1)
+1001  FORMAT(3X,A1,36A1,A1)
+      END
+      SUBROUTINE RULER(J1,J2,IX0,IXLEN,IY0,IYLEN,
+     +ISXMAX,ISYMAX,KBIN,KBOUT,ISCALE,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      EXTERNAL IPSEP
+C  RULER DRAWS A SCALE ON THE X OR Y AXIS. THE CURRENT ACTIVE REGION IS 
+C  J1,J2. THE USER CAN GIVE THE COORDINATE OF THE FIRST CHARACTER POSIITON
+C  TO BE MARKED (THE DEFAULT IS J1). THE SEPARATION AND HEIGHT IS ALSO 
+C  DEFINED BY THE USER
+      IBH1=ISYMAX/100
+      IF(ISCALE.EQ.2)IBH1=ISXMAX/100
+      XMAX=J2
+      XMIN=J1
+      YMAX=J2
+      YMIN=J1
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Draw a scale')
+      MMN = J1
+      MMX = J2
+      ISTART = J1
+      CALL GETINT(MMN,MMX,ISTART,'Put first tick at',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF((IOK.NE.0).OR.(IVAL.EQ.0)) THEN
+        CALL CLEARV
+        RETURN
+      END IF
+      ISTART = IVAL
+      FIRST=J1
+C   SET DEFAULT NUMBER OF BLIPS TO 10
+      NMAX=10
+C   CHECK ON SEPARATION (NO CLOSER THAT 1 EVERY 5 PIXELS)
+      UMIN=ABS(XMIN-XMAX)
+      SMIN=IXLEN
+      ISEP=IPSEP(XMAX,XMIN,NMAX)
+      IF(ISCALE.EQ.2)THEN
+        UMIN=ABS(YMAX-YMIN)
+        SMIN=IYLEN
+        ISEP=IPSEP(YMAX,YMIN,NMAX)
+      END IF
+      IF(SMIN.EQ.0.0)THEN
+        WRITE(KBOUT,*)' ERROR IN SCREEN DEFINITION'
+        RETURN
+      END IF
+C   FIVE PIXELS IS
+      ONEPIX=UMIN/SMIN
+      IFIVEP=ONEPIX*5.0
+100   CONTINUE
+      MMN = IFIVEP
+      MMX = J2 - J1
+      INC = ISEP
+      CALL GETINT(MMN,MMX,INC,'Tick spacing',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF((IOK.NE.0).OR.(IVAL.EQ.0)) THEN
+        CALL CLEARV
+        RETURN
+      END IF
+      INC = IVAL
+      BIT=INC
+      MMN = 0
+      IF(ISCALE.EQ.1) MMX = IYLEN
+      IF(ISCALE.EQ.2) MMX = IXLEN
+      IBH = IBH1
+      CALL GETINT(MMN,MMX,IBH,'Tick height',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF((IOK.NE.0).OR.(IVAL.EQ.0)) THEN
+        CALL CLEARV
+        RETURN
+      END IF
+      IBH = IVAL
+      IF(ISTART.GT.0)FIRST=ISTART
+      CALL CLEARV
+      CALL VECTOM
+      CALL SCALES(XMAX,XMIN,YMAX,YMIN,
+     +IX0,IXLEN,IY0,IYLEN,ISXMAX,ISYMAX,
+     +BIT,IBH,FIRST,ISCALE)
+      CALL VT100M
+      GO TO 100
+      END
+      SUBROUTINE REDIR(IDEV,DEVOUT,IDEVGR,GORT,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KOPT)
+      CHARACTER HELPF*(*)
+C   AUTHOR: RODGER STADEN
+C   routine to allow the user to select disk output
+C   if GORT = 0 redirection off
+C   if GORT = 1 redirection of text
+C   if GORT = 2 redirection of graphics
+C
+C   notice the redirection is handled differently for text and graphics:
+C   text is done by changing idev to devout and opening idev
+C   graphics is done by closing idevgr and opening devout and changing
+C   the value of idevgr in the graphics common block. Note also that
+C   IGORF is used by the graphics to suppress clearv and xhair
+C
+C   Postscript output: file opened here, flag is posto: 0 off, 1 open
+C   All output controlled from c but implemented by modifying move
+C   and draw subroutines in pl4010
+C
+C   Can do one job at a time: if opening only one, if closing only one
+C   ie 2 visits to close both postcript and redirection
+C   if either is open and we come in to open the other we will first
+C   be asked if we want to close the open one
+C
+      CHARACTER FILNAM*(*)
+      INTEGER DEVOUT,GORT,POSTO
+      PARAMETER (MAXPRM = 19)
+      CHARACTER PROMPT(3)*(MAXPRM)
+      CHARACTER FNDFIL*1024, FIL2*1024
+      INTEGER OPENP
+      EXTERNAL FNDFIL, OPENP
+      SAVE POSTO
+      DATA POSTO/0/
+      IF(GORT.NE.0) THEN
+        CALL YESNO(IN,'Close redirection file',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IN.EQ.0) THEN
+          IF(GORT.EQ.1) THEN
+            CLOSE(UNIT=DEVOUT)
+            IDEV = KBOUT
+          ELSE IF(GORT.EQ.2) THEN
+            CLOSE(UNIT=DEVOUT)
+            CALL OPENGR(IDEVGR)
+          END IF
+          GORT = 0
+          RETURN
+        END IF
+      END IF
+      IF(POSTO.NE.0) THEN
+        CALL YESNO(IN,'Close postscript file',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(IN.EQ.0) THEN
+          IJUNK=OPENP(' ',0,0,0,0,0,0)
+          POSTO = 0
+          RETURN
+        END IF
+      END IF
+      IN = 1
+      PROMPT(1) = 'Redirect text'
+      PROMPT(2) = 'Redirect graphics'
+      PROMPT(3) = 'Open postscipt file'
+      CALL RADION('Select redirection',PROMPT,3,IN,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IN.EQ.1) THEN
+        IF (GORT.NE.0) THEN
+          CALL ERROM(KBOUT,
+     +    'Output already redirected: please close file')
+          RETURN
+        END IF
+        FILNAM = ' '
+        CALL OPENF1(DEVOUT,FILNAM,1,IOK,KBIN,KBOUT,
+     +    'Text file name',IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        IDEV=DEVOUT
+        GORT = 1
+      ELSE IF(IN.EQ.2) THEN
+        IF (GORT.NE.0) THEN
+          CALL ERROM(KBOUT,
+     +    'Output already redirected: please close file')
+          RETURN
+        END IF
+        CLOSE(UNIT = IDEVGR)
+        FILNAM = ' '
+        CALL OPENF1(DEVOUT,FILNAM,1,IOK,KBIN,KBOUT,
+     +    'Graphics file name',IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) THEN
+          CALL OPENGR(IDEVGR)
+          GORT = 0
+          RETURN
+        END IF
+        CALL OPENGF(DEVOUT)
+        GORT = 2
+      ELSE IF(IN.EQ.3) THEN
+        IF (POSTO.NE.0) THEN
+          CALL ERROM(KBOUT,
+     +    'Postscript already selected: please close file')
+          RETURN
+        END IF
+ 10     CONTINUE
+        FILNAM = ' '
+        LIN = 0
+        CALL GTSTR('Postcript file name',' ',FILNAM,LIN,
+     +  KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 10
+        END IF
+        POSTO = 0
+        IF ((INFLAG.EQ.2).OR.(LIN.EQ.0)) THEN
+          RETURN
+        END IF
+CALL CROUTINE
+        IF (KOPT.EQ.1) THEN
+           CALL YESNO(LNDSCP, 'Print in portrait mode',
+     +          IHELPS, IHELPE, HELPF, IDEVH, KBIN, KBOUT)
+           IF (LNDSCP.EQ.-1) THEN
+              RETURN
+           ENDIF
+           CALL GETINT(0, 9999, 0, 'Bottom left X co-ord of region',
+     +          IX1, KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+           IF (IOK.EQ.1) THEN
+              RETURN
+           ENDIF
+           CALL GETINT(0, 9999, 0, 'Bottom left Y co-ord of region',
+     +          IY1, KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+           IF (IOK.EQ.1) THEN
+              RETURN
+           ENDIF
+           CALL GETINT(IX1,9999,9999,'Top right X co-ord of region',
+     +          IX2, KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+           IF (IOK.EQ.1) THEN
+              RETURN
+           ENDIF
+           CALL GETINT(IY1,9999,9999,'Top right Y co-ord of region',
+     +          IY2, KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+           IF (IOK.EQ.1) THEN
+              RETURN
+           ENDIF
+           CALL GETINT(1,20,5,'Line thickness',
+     +          IWIDTH, KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+           IF (IOK.EQ.1) THEN
+              RETURN
+           ENDIF
+        ELSE
+           LNDSCP = 0
+           IX1 = 0
+           IX2 = 9999
+           IY1 = 0
+           IY2 = 9999
+           IWIDTH = 5
+        END IF
+        FIL2 = FNDFIL(FILNAM)
+        POSTO = OPENP(FIL2, IX1, IY1, IX2, IY2, IWIDTH, LNDSCP)
+      END IF
+      END
+      SUBROUTINE SHUTD
+C
+C routine to contain all necessary shutdown procedures
+C
+      IJUNK=OPENP(' ',0,0,0,0,0,0)
+      END
+      SUBROUTINE XHAIRR(ISXMAX,ISYMAX,IX,IY,TERM,DBTDUX,DBTDUY)
+      CHARACTER TERM
+      CALL XHAIR(IX,IY,TERM)
+      END
+      REAL FUNCTION CWORLD(I,I0,ILEN,RMIN,RMAX)
+      CWORLD = RMIN + (REAL(I-I0)/ILEN) * (RMAX-RMIN)
+      END
+      INTEGER FUNCTION NOPWIN(IY,IY0,IYLEN,MAXOPT)
+      INTEGER IY0(MAXOPT),IYLEN(MAXOPT)
+      DO 20 I = 29,MAXOPT
+        IF(IY.GT.IY0(I)) THEN
+          IF(IY.LT.(IY0(I)+IYLEN(I))) THEN
+            NOPWIN = I
+            RETURN
+          END IF
+        END IF
+20    CONTINUE
+      NOPWIN = 0
+      END
+      SUBROUTINE MOVEGR(IX,IY)
+C
+C
+C
+CALL CROUTINE
+      CALL MOVEP(IX,IY)
+      CALL MOVEXY(IX,IY)
+      END
+      SUBROUTINE DRAWGR(IX,IY)
+C
+C
+C
+CALL CROUTINE
+      CALL DRAWP(IX,IY)
+      CALL DRAWXY(IX,IY)
+      END
+      SUBROUTINE WRITET(IX,IY,CHARS,NCHARS)
+      CHARACTER CHARS*(*)
+C
+C
+C
+CALL CROUTINE
+      CALL WRITEP(IX,IY,CHARS,NCHARS)
+      CALL WRITXY(IX,IY,CHARS,NCHARS)
+      END
+      SUBROUTINE POINTG(IX,IY)
+CALL C ROUTINE
+      CALL POINTP(IX,IY)
+      CALL DOTXY(IX,IY)
+      END
+      SUBROUTINE CLEARG
+CALL C ROUTINE HERE
+      CALL CLEARP
+      CALL BLANKG
+      END
diff --git a/src/staden/plotLog.c b/src/staden/plotLog.c
new file mode 100644
index 0000000..260487a
--- /dev/null
+++ b/src/staden/plotLog.c
@@ -0,0 +1,176 @@
+/*
+    Title: 	 plotLog
+
+    File: 	 plotLog.c
+    Purpose:	 Manage a log of plotting commands
+    Last update: Thursday May 9 1991
+*/
+
+
+/*
+    The log is implemented using two files. The main (binary) one
+    simply stores the PLItem's whilst the auxilliary one stores
+    the text for text plotting commands. This split is essential
+    so that the items in the main file are all of the same size
+    and can thus be processed in bulk. This decision relies on the
+    fact that text plot commands are a tiny minority.
+
+    As far as the abstraction of this module goes, the write pointer
+    is simply given by the end of file, the read pointer is held in
+    the file stream when `mode = reading'.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "plotLog.h"
+#include "fort.h"
+
+#include <stdio.h> /* IMPORT: fclose, remove, fseek, fwrite,
+		              fputc, fread, fgetc,
+			      FILE, tmpnam, L_tmpnam, EOF, NULL */
+
+#include <X11/Intrinsic.h> /* IMPORT: Dimension */
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef enum {reading, writing} Mode;
+
+
+
+
+/* ---- Static variables ---- */
+#ifndef L_tmpnam
+#define L_tmpnam        25              /* (sizeof(P_tmpdir) + 15) */
+#endif
+
+static char PLFileName[L_tmpnam];
+static char TFileName[L_tmpnam];
+static FILE *PLFileP;
+static FILE *TFileP;
+static Mode mode = writing;
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+PLResult CreatePL(void)
+/* Create the log, which is initially empty.
+   Read and write pointers are set to the start.
+*/
+{   /* Open the log files.
+       PlotLog - Update, truncated to empty, binary
+       Text    - Update, truncated to empty, text
+    */
+    PLFileP = fopen(tmpnam(PLFileName), "w+b");
+    TFileP  = fopen(tmpnam(TFileName),  "w+");
+
+    if ((PLFileP==NULL) || (TFileP==NULL))
+        return(CantCreate);
+    else
+        return(OK);
+}
+
+
+PLResult DestroyPL(void)
+/* Delete the log.
+   Free up resources.
+*/
+{   (void) fclose(PLFileP);
+    (void) remove(PLFileName);
+    (void) fclose(TFileP);
+    (void) remove(TFileName);
+
+    return(OK);
+}
+
+
+PLResult ResetWritePL(void)
+/* Reset the write pointer to the start of the log.
+   All previously written items are discarded.
+   The read pointer is left undefined.
+*/
+{   mode = writing;
+    PLFileP = freopen(PLFileName, "w+b", PLFileP);
+    TFileP  = freopen(TFileName,  "w+",  TFileP);
+
+    if ((PLFileP==NULL) || (TFileP==NULL))
+        return(Failed);
+    else
+        return(OK);
+}
+
+
+PLResult WritePL(PLItem i)
+/* Append item `i' to the log.
+   The write pointer is incremented.
+   The read pointer is left undefined.
+*/
+{   int_f x;
+
+    if (mode == reading)
+    {   mode = writing;
+        if (fseek(PLFileP, (off_t)0, 2) == EOF) return(Failed);
+        if (fseek(TFileP,  (off_t)0, 2) == EOF) return(Failed);
+    }
+
+    if (fwrite((char *) &i, sizeof(PLItem), 1, PLFileP) != 1) return(Failed);
+
+    if (i.op == DrawStringOp)
+    {  x = 0;
+       while (x < i.u.t.textl)
+       {   if (fputc(i.u.t.textp[x], TFileP) == EOF) return(Failed);
+           x++;
+       }
+    }
+
+    return(OK);
+}
+
+
+PLResult ResetReadPL()
+/* Reset the read pointer to the start of the log.
+*/
+{   mode = reading;
+    if (fseek(PLFileP, (off_t)0, 0) == EOF) return(Failed);
+    if (fseek(TFileP,  (off_t)0, 0) == EOF) return(Failed);
+
+    return(OK);
+}
+
+
+PLResult ReadPL(PLItem *i)
+/* Read an item from the log. and return in `i'.
+   The field `i.u.t.textp' must point to a character array into which
+   any `DrawStringOp' parameter is copied.
+   Increment the read pointer.
+   If the read pointer was past the last item in the log,
+   then return `EndOfPL'.
+
+   We must preserve the text pointer, because this will be overwritten
+   when we read the main record in.
+*/
+{   char *textp=i->u.t.textp;
+
+    if (fread((char *) i, sizeof(PLItem), 1, PLFileP) == 1)
+    {   if (i->op == DrawStringOp)
+	{  int_f x = 0;
+	   i->u.t.textp = textp;
+           while (x < i->u.t.textl)
+           {   if ((i->u.t.textp[x] =  fgetc(TFileP)) == EOF) return(Failed);
+               x++;
+           }
+	 }
+        return(OK);
+    }
+    else
+        return(EndOfPL);
+}
diff --git a/src/staden/plotLog.h b/src/staden/plotLog.h
new file mode 100644
index 0000000..59c7a05
--- /dev/null
+++ b/src/staden/plotLog.h
@@ -0,0 +1,100 @@
+/* Title: 	plotLog
+
+   File: 	plotLog.h
+   Purpose:	Manage a log of plotting commands
+   Last update: Mon Mar 19th 1990
+*/
+
+
+/*
+    This module manages a single log of plotting commands.
+
+    The structure of such commands is given by the PLItem type;
+    it is intended that the `PLOp op' item should indicate the
+    kind of operation, and the (x1,y1) position should indicate
+    where it starts. Further parameters may be required, depending
+    upon the operation; these are indicated by (x2,y2) or by `text'.
+
+    createPL() must be called before any operation, and deletePL()
+    should be called at the end to release all resources used.
+
+    The log maintains two pointers, a read pointer and a write pointer.
+    Items can be appended at the write pointer position with writePL()
+    and read from the read pointer position with readPL(). The
+    read and write pointers can be reset to the start of the log 
+    using resetWritePL() and resetReadPL() respectively.
+
+    The log can only be read sequentially, in a single sweep, because
+    the read pointer becomes undefined when anything is written.
+*/
+
+
+#include <X11/Intrinsic.h> /* IMPORT: Dimsension */
+#include "fort.h"
+
+
+typedef enum
+{   DrawLineOp,
+    DrawPointOp,
+    DrawStringOp
+} PLOp;
+
+
+
+typedef struct
+{   PLOp op;
+    Position x1, y1;
+    union {struct {Position x2, y2;} p2;/* For DrawLIneOp */
+           struct {char *textp;		/* For DrawStringOp */
+                   int_f  textl;} t;
+          } u;
+} PLItem;
+
+
+typedef enum
+{   OK,         /* Normal result */
+    EndOfPL,    /* End of plot log reached. PLItem is undefined */
+    CantCreate, /* The log could not be created */
+    Failed      /* Non-specific, fatal, error */
+} PLResult;
+
+
+extern PLResult CreatePL(void);
+/* Create the log, which is initally empty.
+   Read and write pointers are set to the start.
+*/
+
+
+extern PLResult DestroyPL(void);
+/* Delete the log.
+   Free up resources.
+*/
+
+
+extern PLResult ResetWritePL(void);
+/* Reset the write pointer to the start of the log.
+   All previously written items are discarded.
+   The read pointer is left undefined.
+*/
+
+
+extern PLResult WritePL(PLItem i);
+/* Append item `i'to the log.
+   The write pointer is incremented.
+   The read pointer is left undefined.
+*/
+
+
+extern PLResult ResetReadPL(void);
+/* Reset the read pointer to the start of the log.
+*/
+
+
+extern PLResult ReadPL(PLItem *i);
+/* Read an item from the log. and return in `i'.
+   The field `i.u.t.textp' must point to a character array into which
+   any `DrawStringOp' parameter is copied.
+   Increment the read pointer.
+   If the read pointer was past the last item in the log,
+   then return `EndOfPL'.
+*/
diff --git a/src/staden/postscript.c b/src/staden/postscript.c
new file mode 100644
index 0000000..48a6a13
--- /dev/null
+++ b/src/staden/postscript.c
@@ -0,0 +1,209 @@
+#include <stdio.h>
+#include <ctype.h>
+#include <sys/types.h>
+#include <sys/stat.h>
+#include "FtoC.h" /* IMPORT: Fstr2Cstr */
+#include "postscript.h"
+#include "fort.h"
+
+static FILE *psfp = NULL;	/* pointer to ps output file */
+static int pspages;		/* num of pages in current file */
+static int psupdated = 0;	/* any updates since last 'showpage'? */
+
+/* --- externable functions --- */
+
+/* returns 1 for open, 0 for closed */
+int_f openp_x(char *FILE_p,
+	      int_f *x1_p, int_f *y1_p, int_f *x2_p, int_f *y2_p,
+	      int_f *width_p, int_f *lndscp_p, int_fl FILE_l) {
+    char fname[256];
+    struct stat buf;
+
+    Fstr2Cstr(FILE_p, FILE_l, fname, 256);
+    /*
+     * If given null filename then do nothing - gives us a chance to 
+     * close the file without opening another!
+     */
+    if (fname[0] == '\0') {
+	/*fprintf(stdout, "Closing PostScript output file\n");*/
+	closep_x();
+	return 0;
+    }
+
+    /*
+     * Otherwise if we've already a file open then close it too before
+     * opening a new one.
+     */
+    if (psfp != NULL) {
+	closep_x();
+	psfp = NULL;
+    }
+
+    /* now open our file */
+    if (stat(fname, &buf) != -1) {
+	fprintf(stdout, "*** Error - file already exists\n");
+	return 0;
+    }
+    if ((psfp = fopen(fname,"w")) == NULL) {
+	fprintf(stdout,
+		"*** Error - could not open postscript output file '%s'\n",
+		fname);
+	return 0;
+    } /*else
+	fprintf(stdout, "Sucessfully opened PostScript file '%s'.\n", fname);
+	*/
+    /* and write out the standard PostScript header */
+    fputs("%!PS-Adobe-1.0\n\
+%%DocumentFonts: Times-Roman\n\
+%%For: Staden software package\n\
+%%Title: graphics_output.ps\n\
+%%Creator: Staden software package\n\
+%%CreationDate: (of header) 10/08/92\n\
+%%Pages: (atend)\n\
+%%EndComments\n\
+\n\
+% User tunable variables\n\
+", psfp);
+    fprintf(psfp, "/landscape %d		def	%% portrait vs landscape mode\n\
+/linewidth %2d		def	%% thickness of lines\n\
+\n\
+%% (x1,y1) must be bottom left and (x2,y2) must be top right\n\
+/x1	%4d		def	%% size and\n\
+/y1 	%4d		def	%%   location of the\n\
+/x2     %4d		def	%%   box on graphics\n\
+/y2     %4d		def	%%   window to draw\n\
+", *lndscp_p, *width_p, *x1_p, *y1_p, *x2_p, *y2_p);
+    fputs("\n\
+% location on postscript page\n\
+/inch {72 mul} def\n\
+/leftborder {.5 inch}	def	% left/right margin in inches\n\
+/botborder  {.5 inch}	def	% top/bottom margin in inches\n\
+\n\
+% size of an A4 page\n\
+/pagew {11.75 inch} def\n\
+/pageh { 8.25 inch} def\n\
+\n\
+botborder leftborder\n\
+landscape 1 eq {\n\
+  90 rotate\n\
+  /width  {pagew botborder 2 mul sub} def\n\
+  /height {pageh leftborder 2 mul sub} def\n\
+  0 -612 translate\n\
+  exch\n\
+} {\n\
+  /width  {pageh botborder 2 mul sub} def\n\
+  /height {pagew leftborder 2 mul sub} def\n\
+} ifelse translate\n\
+/conv {width 10000 div mul} def\n\
+\n\
+x2 x1 sub conv width  div\n\
+y2 y1 sub conv height div\n\
+dup 3 2 roll dup 4 1 roll exch\n\
+\n\
+% centre drawing\n\
+dup 3 2 roll dup 4 1 roll exch\n\
+gt {width mul height exch sub 2 div 0 exch 3 -1 roll pop}\n\
+   {pop width mul 2 div 0} ifelse pop pop 0 0 translate\n\
+\n\
+% scale drawing to large as possible, but still maintaining the aspect ratio\n\
+dup 3 2 roll dup 4 1 roll exch\n\
+gt {pop 1 exch div}\n\
+   {1 exch div exch pop} ifelse dup scale\n\
+\n\
+x1 conv neg y1 conv neg translate\n\
+\n\
+% abbreiviations to shrink the drawing data\n\
+/s {stroke} def\n\
+/l {2 copy lineto s moveto} def\n\
+/m {moveto} def\n\
+/r {currentlinewidth 0 rlineto s} def\n\
+/p {showpage} def\n\
+/t {show} def\n\
+/n {newpath} def\n\
+/i {\n\
+  /Times-Roman findfont 290 scalefont setfont\n\
+  2 setlinecap % square caps => equiv to square pixels\n\
+  linewidth setlinewidth\n\
+  1 conv dup scale\n\
+} def\n\
+\n\
+%%EndProlog\n\
+", psfp);
+    pspages = 0;
+    psupdated = 0;
+
+    return 1;
+}
+
+void closep_x() {
+    if (psfp != NULL) {
+	/* do showpage etc if we've written any more since last showpage */
+	if (psupdated)
+	    clearp_x();
+	fputs("%%Trailer\n", psfp);
+	fprintf(psfp, "%%%%Pages: %d\n", pspages);
+	fclose(psfp);
+    }
+}
+
+void clearp_x() {
+    if (psfp != NULL) {
+	fputs("s\n", psfp);
+	fputs("p\n", psfp);
+	psupdated = 0;
+    }
+}
+
+/*
+ * starts a new page - called when we do the first piece of PostScript
+ * after clearp_x().
+ */
+static void psnext() {
+    if (psupdated == 0) {
+	psupdated = 1;
+	pspages++;
+	fprintf(psfp, "%%%%Page: %d %d\n", pspages, pspages);
+	fputs("i n\n", psfp);
+    }
+}
+
+void movep_x(int_f *IX_p, int_f *IY_p) {
+    if (psfp) {
+	psnext();
+	fprintf(psfp, "%d %d m\n", (int)*IX_p, (int)*IY_p);
+    }
+}
+
+void drawp_x(int_f *IX_p, int_f *IY_p) {
+    if (psfp) {
+	psnext();
+	fprintf(psfp, "%d %d l\n", (int)*IX_p, (int)*IY_p);
+    }
+}
+
+void pointp_x(int_f *IX_p, int_f *IY_p) {
+    if (psfp) {
+	psnext();
+	fprintf(psfp, "%d %d m\nr\n", (int)*IX_p, (int)*IY_p);
+    }
+}
+
+void writep_x(int_f *IX_p, int_f *IY_p, char *TEXT_p, int_f *NCHAR_p,
+	      int_fl TEXT_l) {
+    int_f len = *NCHAR_p, i;
+    char c;
+
+    if (psfp) {
+	psnext();
+	fprintf(psfp, "%d %d m\n(", (int)*IX_p, (int)*IY_p);
+	/* need to check for '\', '(' and ')' */
+	for (i=0; i<len; i++) {
+	    if (isprint(c = TEXT_p[i])) {
+		if (c == '(' || c == ')' || c == '\\')
+		    fputc('\\', psfp);
+		fputc(c, psfp);
+	    }
+	}
+	fputs(") t\n", psfp);
+    }
+}
diff --git a/src/staden/postscript.h b/src/staden/postscript.h
new file mode 100644
index 0000000..ba8b530
--- /dev/null
+++ b/src/staden/postscript.h
@@ -0,0 +1,26 @@
+#define clearg_x clearg_
+#define movegr_x movegr_
+#define drawgr_x drawgr_
+#define pointg_x pointg_
+#define writet_x writet_
+#define openp_x openp_
+#define closep_x closep_
+#define clearp_x clearp_
+#define movep_x movep_
+#define drawp_x drawp_
+#define pointp_x pointp_
+#define writep_x writep_
+
+#include "fort.h"
+
+/* --- function prototypes --- */
+int_f openp_x(char *FILE_p,
+	      int_f *x1_p, int_f *y1_p, int_f *x2_p, int_f *y2_p,
+	      int_f *width_p, int_f *lndscp_p, int_fl FILE_l);
+void closep_x();
+void clearp_x();
+void movep_x(int_f *IX_p, int_f *IY_p);
+void drawp_x(int_f *IX_p, int_f *IY_p);
+void pointp_x(int_f *IX_p, int_f *IY_p);
+void writep_x(int_f *IX_p, int_f *IY_p, char *TEXT_p, int_f *NCHAR_p,
+	      int_fl TEXT_l);
diff --git a/src/staden/progSpec.h b/src/staden/progSpec.h
new file mode 100644
index 0000000..7ed963b
--- /dev/null
+++ b/src/staden/progSpec.h
@@ -0,0 +1,53 @@
+/*
+    Title:       progSpec
+
+    File: 	 progSpec.h
+    Purpose:	 Generic header file for program specific data
+    Last update: Tue Jun 19 1990
+*/
+
+
+/*
+    This module defines the shape of those entities which are specific
+    to each program. The actual definitions are held in files which
+    go by the name <progname>Spec.c.
+
+    Menu creation and help facilities are included.
+*/
+
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget, XtCallbackProc */
+
+
+
+
+extern void CreateProgMenus(Widget parentWid,
+			    XtCallbackProc cbp, XtPointer client_data);
+/*
+    Install the menus for this program into `parentWid'.
+    When pressed, each item will call `cbp' providing `client_data'
+    and the number of the item as `call_data'.
+*/
+
+
+
+
+extern const int botHelpOpt;
+extern const int topHelpOpt;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+extern const char helpTextFN[];
+extern const char helpPtrsFN[];
+/*
+    File names for the help text and pointer files.
+*/
+
+
+extern const char *helpTopics[];
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
diff --git a/src/staden/rep.f b/src/staden/rep.f
new file mode 100644
index 0000000..89440ec
--- /dev/null
+++ b/src/staden/rep.f
@@ -0,0 +1,954 @@
+      SUBROUTINE FMAIN()
+C
+C rep compares a set of sequence readings against a library of alu
+C sequences in both orientations. It creates 3 output files:
+C a list of files that pass
+C a list of files that fail
+C a log file which contains the highest score found for each reading
+C WARNING: rep also modifies all files in which it finds a match
+C by writing a record that is translated by the assembly program into
+C a tag to mark alu sequences. (First it copies the original file, say
+C called fred, to fred.ALU then it modifies the original).
+C The list of matching sequences includes:
+C the file name, the top score, the top score for the other end of the read (0
+C if not above the cutoff), and the number of bases that does not appear to 
+C contain ALU.
+C One assembly strategy would assemble the file of passes, and then sort the
+C fail file on the last column so that the reads with the most nonALU sequence
+C are at the top of the list, then assemble those.
+C The log file is simply for checking the sensitivity of the program.
+C
+      PARAMETER (MAXSEQ = 50000,
+     +           MAXRD = 1000,
+     +           MAXWLN = 4,
+     +           LCONST = 4*MAXWLN,
+     +           MAXWRD = 4**MAXWLN,
+     +           MAXFIL = 150,
+     +           MAXRSQ = 400,
+     +           MAXCF  = 4000,
+     +           MAXDEV = 7,
+     +           NAMLEN = 80)
+C
+C MAXFIL MAX NUMBER REPEAT FILES
+C MAXRSQ MAX LENGTH OF EACH REPEAT FILE
+C MAXCF  MAX COMPARISON LENGTH
+C
+      CHARACTER SEQV(MAXSEQ),SEQG(MAXRD)
+      INTEGER WORDP(MAXWRD,MAXFIL),POSN(MAXRSQ,MAXFIL)
+      REAL HIST(-MAXCF:MAXCF)
+      CHARACTER*(NAMLEN) NAMES(MAXFIL)
+      INTEGER STARTS(MAXFIL),MATRIX(5,5),RC,BESTRS(MAXRD),BESTRC(MAXRD)
+      REAL BESTSC(MAXRD)
+      CHARACTER BESTD(MAXRD)
+      INTEGER CONSTS(0:LCONST),DEVNOS(MAXDEV)
+      INTEGER SEQVI(MAXSEQ),SEQGI(MAXRD),SEQR(MAXRD)
+      CHARACTER*(NAMLEN) FILNAM,HELPF,NAME,TNAME,VNAME
+      INTEGER GNFFOF
+      EXTERNAL GNFFOF
+      PARAMETER (VNAME='.ALU')
+      ICG = 0
+      ICB = 0
+      ICB2 = 0
+      IDM = 5
+      CALL INITLU(IDM)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL MAKEID(MATRIX,IDM)
+      IDEVNI = DEVNOS(1)
+      IDEVV = DEVNOS(2)
+      IDEVNO = DEVNOS(3)
+      IDEVNF = DEVNOS(5)
+      IDEVNR = DEVNOS(6)
+      IDEVNL = DEVNOS(7)
+      WRITE(KBOUT,*)'rep v1.1: repeat examination program. April 93'
+      WRITE(KBOUT,*)'Author: Rodger Staden'
+      WRITE(KBOUT,*)'Copyright: Medical Research Council, UK'
+      FILNAM = ' '
+      CALL OPENF1(IDEVNI,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Input file of gel reading file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(IDEVNR,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Input file of repeat file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(IDEVNO,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Output file of passed file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(IDEVNF,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Output file of failed file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(IDEVNL,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Log file name',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      MN = 2
+      MX = MAXWLN
+      LENGTH = 4
+C      CALL GETINT(MN,MX,LENGTH,
+C     +'Word length',
+C     +IVAL,KBIN,KBOUT,
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) STOP
+C      LENGTH = IVAL
+      MN = 1
+      MX = 11
+      LW = 7
+C      CALL GETINT(MN,MX,LW,
+C     +'Number of diagonals to combine',
+C     +IVAL,KBIN,KBOUT,
+C     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C      IF(IOK.NE.0) STOP
+C      LW = IVAL
+      XMN = 0.0
+      XMX = LW*1.0
+      XMX = 1.
+      CUT = 0.6
+      CALL GETRL(XMN,XMX,CUT,
+     +'Cutoff score',
+     +VAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      CUT = VAL
+C
+C set initial values so we hash the first vector
+C
+      ICSITT = 0
+      IPSITT = 0
+      ICSITE = ICSITT
+      IPSITE = IPSITT
+      IDE = (IDM-1)**LENGTH
+      CALL SETCN(CONSTS,LENGTH,IDM,LCONST)
+      CALL GETSQS(IDEVNR,IDEVV,SEQV,MAXSEQ,STARTS,NAMES,
+     +FILNAM,NFILE,MAXFIL,MAXNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IDIMV = STARTS(NFILE+1)-1
+      NFILES = NFILE + 1
+      CALL CONNUM(SEQV,SEQVI,IDIMV)
+      DO 3 NSEQ = 1,NFILE
+        IS = STARTS(NSEQ)
+        IE = STARTS(NSEQ+1) - 1
+        IDIMV = IE - IS + 1
+C        WRITE(*,*)(SEQVI(K),K=IS,IE)
+        CALL ENCONC(SEQVI(IS),IDIMV,POSN(1,NSEQ),WORDP(1,NSEQ),
+     +  IDE,IDM,CONSTS,LENGTH,
+     +  LCONST)
+ 3    CONTINUE
+      CALL BLATA(WORDP,IDE,MAXFIL,LENGTH,NSEQ,CONSTS,LCONST)
+C      CALL LISTN(SEQV,MAXSEQ,STARTS,NFILE,60,1,200,IDEV,KBOUT,
+C     +NAMES,MAXFIL)
+      IDEVE = 0
+      IDEVG = IDEVV
+      MINR = 20
+      JGEL = 0
+ 10   CONTINUE
+C Get next experiment file name
+      IOK = GNFFOF(IDEVNI,NAME)
+      IF(IOK.EQ.1) THEN
+        WRITE(KBOUT,*)
+     + 'Finished after processing',JGEL,' files and finding'
+        WRITE(KBOUT,*)ICG,' free of the repeat'
+        WRITE(KBOUT,*)ICB,' contain the repeat'
+        WRITE(KBOUT,*)ICB2,' contain the repeat at both ends'
+        STOP
+      ELSE IF(IOK.EQ.2) THEN
+        CALL ERROM(KBOUT,'Empty line in file of file names')
+        GO TO 10
+      ELSE IF(IOK.EQ.3) THEN
+        CALL ERROM(KBOUT,'Error reading file of file names')
+        GO TO 10
+      END IF
+      IDIMG = MAXRD
+      CALL OPENRS(IDEVG,NAME,IOK,LRECL,2)
+      CALL ARRFIM(IDEVG,SEQG,IDIMG,KBOUT)
+      IF (IDIMG.GT.MINR) THEN
+        JGEL = JGEL + 1
+        CALL CONNUM(SEQG,SEQGI,IDIMG)
+        CALL VCUTT(SEQR,
+     +    POSN,WORDP,IDE,MAXRSQ,MAXFIL,SEQGI,IDIMG,CONSTS,
+     +    LENGTH,IDM,LCONST,HIST,MAXCF,KBOUT,CUT,LC,RC,
+     +    STARTS,NFILES,LW,
+     +    IDEVNF,IDEVNO,IDEVNL,
+     +    NAME,NAMES,MATRIX,SEQVI,MAXSEQ,SCORE,ICG,ICB,
+     +    ICB2,
+     +    BESTRS,BESTRC,BESTSC,BESTD,MAXRD,KCUT,LCUT,EMAX)
+C        write(*,*)lc,rc
+        IF(LC.EQ.0) THEN
+            CLOSE(UNIT=IDEVG)
+        ELSE
+          CALL VPOUTR(IDEVG,IDEVNR,NAME,LC,RC,
+     +    SCORE,VNAME,TNAME,KCUT,LCUT,EMAX,IOK)
+          IF(IOK.NE.0) STOP
+        END IF
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE SQCOMI(SEQ,ID)
+      INTEGER SEQ(ID),COMP(5)
+      SAVE COMP
+      DATA COMP/3,4,1,2,5/
+      DO 10 I=1,ID
+       SEQ(I) = COMP(SEQ(I))
+ 10   CONTINUE
+      END
+      SUBROUTINE SQREVI(SEQ,ID)
+      INTEGER SEQ(ID)
+      IEND = ID/2
+      DO 100 I=1,IEND
+       J = SEQ(I)
+       SEQ(I) = SEQ(ID+1-I)
+       SEQ(ID+1-I) = J
+ 100  CONTINUE
+      END
+      SUBROUTINE VCUTT(SEQR,
+     +POSN,WORDP,IDE,MAXRSQ,MAXFIL,SEQH,IDIMH,CONSTS,
+     +LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,ICUT,JCUT,
+     +STARTS,NFILES,LW,
+     +IDEVNF,IDEVNO,IDEVNL,
+     +NAME,NAMES,MATRIX,SEQV,MAXVSQ,DMAX,ICG,ICB,ICB2,
+     +BESTRS,BESTRC,BESTSC,BESTD,MAXRD,KCUT,LCUT,EMAX)
+      INTEGER SEQH(IDIMH),STARTS(NFILES),SEQR(IDIMH),RC,MATRIX(IDM,IDM)
+      INTEGER POSN(MAXRSQ,MAXFIL),WORDP(IDE,MAXFIL),CONSTS(0:LCONST)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      INTEGER BESTRS(MAXRD),BESTRC(MAXRD),SEQV(MAXVSQ)
+      REAL BESTSC(MAXRD)
+      CHARACTER BESTD(MAXRD)
+      CHARACTER NAME*(*),ORIEN*9
+      CHARACTER*(*) NAMES(NFILES)
+C      WRITE(*,*)IDIMH
+      CALL COPYI(SEQH,SEQR,IDIMH)
+      CALL SQREVI(SEQR,IDIMH)
+      CALL SQCOMI(SEQR,IDIMH)
+      CALL FILLI(BESTRS,IDIMH,0)
+      CALL FILLI(BESTRC,IDIMH,0)
+      CALL FILLR(BESTSC,IDIMH,0)
+      CALL FILLC(BESTD,IDIMH,'F')
+      DMAX = -1.
+C      WRITE(*,*)NFILES
+      IBLANK = INDEX(NAME,' ')
+      DO 30 NSEQ = 1,NFILES-1
+        IS = STARTS(NSEQ)
+        IE = STARTS(NSEQ+1) - 1
+        IDIMV = IE - IS + 1
+        IDIMHT = IDIMH
+          IBLANK = INDEX(NAME,' ')
+          JBLANK = INDEX(NAMES(NSEQ),' ')
+        CALL VCUT(IDIMV,POSN(1,NSEQ),WORDP(1,NSEQ),MAXRSQ,
+     +  IDE,SEQH,IDIMHT,CONSTS,
+     +  LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LC,RC,LW,DSCORE,
+     +  SDS,MATRIX,SEQV(STARTS(NSEQ)))
+C        WRITE(*,*)CUT,DMAX,DSCORE
+        IF(DSCORE.GT.CUT) THEN
+          IF(DSCORE.GT.BESTSC(LC)) THEN
+            BESTSC(LC) = DSCORE
+            BESTRC(LC) = RC
+            BESTRS(LC) = NSEQ
+          END IF
+        END IF
+        IF(DSCORE.GT.DMAX) THEN
+          DMAX = DSCORE
+          ORIEN = 'Left'
+          NBSEQ = NSEQ
+          ICUT = LC
+          JCUT = RC
+        END IF
+        CALL VCUT(IDIMV,POSN(1,NSEQ),WORDP(1,NSEQ),MAXRSQ,
+     +  IDE,SEQR,IDIMHT,CONSTS,
+     +  LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LC,RC,LW,DSCORE,
+     +  SDS,MATRIX,SEQV(STARTS(NSEQ)))
+C        WRITE(*,*)DMAX,DSCORE
+        IF(DSCORE.GT.CUT) THEN
+          JCUT = IDIMH - LC + 1
+          ICUT = IDIMH - RC + 1
+          LC = ICUT
+          RC = JCUT
+          IF(DSCORE.GT.BESTSC(LC)) THEN
+            BESTSC(LC) = DSCORE
+            BESTRC(LC) = RC
+            BESTRS(LC) = NSEQ
+            BESTD(LC) = 'R'
+          END IF
+        END IF
+        IF(DSCORE.GT.DMAX) THEN
+          DMAX = DSCORE
+          ORIEN = 'Rev'
+          NBSEQ = NSEQ
+          JCUT = IDIMH - LC + 1
+          ICUT = IDIMH - RC + 1
+        END IF
+ 30   CONTINUE
+      IF(DMAX.LT.CUT) THEN
+        EMAX = 0.
+        JBLANK = INDEX(NAMES(NBSEQ),' ')
+        WRITE(IDEVNO,1001)NAME(1:IBLANK),DMAX,EMAX,IDIMH
+ 1001   FORMAT(A,2F5.2,I5)
+        WRITE(IDEVNL,1000)NAME(1:IBLANK),NAMES(NBSEQ)(1:JBLANK),
+     +  ORIEN,ICUT,JCUT,IDIMH,DMAX
+        ICG = ICG + 1
+        ICUT = 0
+        RETURN
+      END IF
+C
+C find max value
+C
+      DMAX = CUT
+      DO 40 I=1,IDIMH
+        IF(BESTSC(I).GT.DMAX) THEN
+          DMAX = BESTSC(I)
+          ICUT = I
+          JCUT = BESTRC(I)
+          NBSEQ = BESTRS(I)
+          IF(BESTD(I).EQ.'F') THEN
+            ORIEN = 'Left'
+          ELSE
+            ORIEN = 'Rev'
+          END IF
+        END IF
+ 40   CONTINUE
+      JBLANK = INDEX(NAMES(NBSEQ),' ')
+        WRITE(IDEVNL,1000)NAME(1:IBLANK),NAMES(NBSEQ)(1:JBLANK),
+     +  ORIEN,ICUT,JCUT,IDIMH,DMAX
+ 1000   FORMAT(A,' ',A,' ',A,3I6,F7.3)
+        ICB = ICB + 1
+C
+C now find best value in biggest bit thats left
+C
+      EMAX = CUT
+      IF(ICUT.GT.IDIMH-JCUT) THEN
+        LEFT = ICUT - 1
+        DO 50 I=1,ICUT-1
+          IF((BESTSC(I).GT.EMAX).AND.(BESTRC(I).LT.ICUT)) THEN
+            EMAX = BESTSC(I)
+            KCUT = I
+            LCUT = BESTRC(I)
+            MBSEQ = BESTRS(I)
+            LEFT = KCUT - 1
+            IF(BESTD(I).EQ.'F') THEN
+              ORIEN = 'Left'
+            ELSE
+              ORIEN = 'Rev'
+            END IF
+          END IF
+ 50     CONTINUE
+      ELSE
+        LEFT = IDIMH - JCUT
+        DO 60 I=JCUT,IDIMH
+          IF(BESTSC(I).GT.EMAX) THEN
+            EMAX = BESTSC(I)
+            KCUT = I
+            LCUT = BESTRC(I)
+            MBSEQ = BESTRS(I)
+            LEFT = IDIMH - LCUT
+            IF(BESTD(I).EQ.'F') THEN
+              ORIEN = 'Left'
+            ELSE
+              ORIEN = 'Rev'
+            END IF
+          END IF
+ 60     CONTINUE
+      END IF
+      IF(EMAX.GT.CUT) THEN
+        JBLANK = INDEX(NAMES(MBSEQ),' ')
+        WRITE(IDEVNL,1000)NAME(1:IBLANK),NAMES(MBSEQ)(1:JBLANK),
+     +  ORIEN,KCUT,LCUT,IDIMH,EMAX
+        WRITE(IDEVNF,1001)NAME(1:IBLANK),DMAX,EMAX,LEFT
+        ICB2 = ICB2 + 1
+      ELSE
+        EMAX = 0.
+        WRITE(IDEVNF,1001)NAME(1:IBLANK),DMAX,EMAX,LEFT
+        KCUT = 0
+      END IF
+      END
+      SUBROUTINE VCUT(IDIMV,
+     +POSN,WORDP,MAXRSQ,IDE,SEQH,IDIMH,CONSTS,
+     +LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LC,RC,LW,DSCORE,
+     +SDS,MATRIX,SEQV)
+      INTEGER SEQH(IDIMH),RC,MATRIX(IDM,IDM),SEQV(IDIMV)
+      INTEGER POSN(MAXRSQ),WORDP(IDE),CONSTS(0:LCONST)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      EXTERNAL NCODEA
+C      WRITE(*,*)'VCUT'
+C        WRITE(*,*)(SEQH(KKK),KKK=1,IDIMH)
+C        WRITE(*,*)(SEQV(KKK),KKK=1,IDIMV)
+C
+C when we get here idimh is the length we want to screen
+C
+        CALL FILLR(HIST(LENGTH-IDIMV),IDIMH+IDIMV+1,0.)
+        DO 20 I = 1,IDIMH-LENGTH+1
+          J = NCODEA(SEQH(I),LENGTH,CONSTS,IDM,LCONST)
+          IF(J.NE.0)THEN
+            J1 = WORDP(J)
+C            write(*,*)i,seqh(i),j,j1
+            IF(J1.NE.0)THEN
+              K = I - J1
+              HIST(K) = HIST(K) + 1.
+10            CONTINUE
+              J2 = J1
+              J1 = POSN(J2)
+              IF(J1.NE.0)THEN
+                K = I - J1
+                HIST(K) = HIST(K) + 1.
+                GO TO 10
+              END IF
+            END IF
+          END IF
+20      CONTINUE
+C        I1 = LENGTH - IDIMV
+C        I2 = IDIMH - LENGTH
+C        DO 55 IJK=I1,I2
+C          WRITE(33,*)IJK,HIST(IJK)
+C 55       CONTINUE
+        CALL PHIST(HIST,IDIMV,IDIMH,LENGTH,MAXSEQ)
+C        DO 56 IJK=I1,I2
+C          WRITE(33,*)IJK,HIST(IJK)
+C 56    CONTINUE
+        CALL FCUT(HIST,IDIMV,IDIMH,SEQV,SEQH,LENGTH,
+     +  MAXSEQ,CUT,LC,RC,LW,DSCORE,SDS,MATRIX,IDM)
+      END
+      SUBROUTINE PHIST(HIST,IDIMV,IDIMH,LENGTH,MAXSEQ)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+C
+C modified 29-3-93 to correct for diagonal length being shorter by length -1
+C
+      IF(IDIMV.GE.IDIMH) THEN
+        D  = 1
+        DO 10 I=LENGTH-IDIMV,IDIMH-IDIMV-1
+          HIST(I) = HIST(I)/D
+          D = D + 1
+ 10       CONTINUE
+        D  = IDIMH - LENGTH + 1
+        DO 20 I=IDIMH-IDIMV,0
+          HIST(I) = HIST(I)/D
+ 20       CONTINUE  
+        D  = IDIMH - 1 - LENGTH + 1
+        DO 30 I=1,IDIMH-LENGTH
+          HIST(I) = HIST(I)/D
+          D = D - 1
+ 30       CONTINUE
+      ELSE
+        D  = 1
+        DO 40 I=LENGTH-IDIMV,-1
+          HIST(I) = HIST(I)/D
+          D = D + 1
+ 40       CONTINUE
+        D  = IDIMV - LENGTH + 1
+        DO 50 I=0,IDIMH-IDIMV
+          HIST(I) = HIST(I)/D
+ 50       CONTINUE
+        D  = IDIMV - 1 - LENGTH + 1
+        DO 60 I=IDIMH-IDIMV+1,IDIMH-LENGTH
+          HIST(I) = HIST(I)/D
+          D = D - 1
+ 60       CONTINUE
+      END IF
+      END
+      SUBROUTINE FCUT(HIST,IDIMV,IDIMH,SEQV,SEQH,LENGTH,
+     + MAXSEQ,CUT,LC,RC,LW,D,SDS,MATRIX,IDM)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      INTEGER RC,SEQH(IDIMH),MATRIX(IDM,IDM),SEQV(IDIMV)
+      PARAMETER (MINDIA = 25)
+      REAL DSCAN
+      EXTERNAL DSCAN
+C
+C dont look within mindia of edges
+C
+C and rotate over a window of length lw
+C
+      LB = 1 + LW/2
+      LF = LW/2
+      I1 = LENGTH - IDIMV + MINDIA
+      I2 = IDIMH - LENGTH - MINDIA
+C first we do the first window length
+      DT = 0.
+      DO 1 I=I1-LB,I1+LF-1
+        DT = DT + HIST(I)
+ 1    CONTINUE
+      LC = 0
+      RC = 0
+      DMAX = 0.
+      ID = 0
+C      WRITE(*,*)'LOOKING AT ',I1,I2
+C      RMSQ = 0.
+C      RM = 0.
+C      write(*,*)i1,i2
+      DO 10 I=I1,I2
+        DT = DT - HIST(I-LB) + HIST(I+LF)
+C        DT = HIST(I)
+C        WRITE(*,*)I,DT
+C        RM = RM + DT
+C        RMSQ = RMSQ + DT*DT
+        IF(DT.GT.DMAX) THEN
+          DMAX = DT
+          ID = I
+C          write(*,*)'i,dt',i,dt
+        END IF
+ 10     CONTINUE
+C      N = I2 - I1
+C      RM = RM / N
+C      RMSQ = RMSQ / N
+C      RM2 = RM * RM
+C      SD = 0.
+C      T = RMSQ - RM2
+C      IF (T.GT.0.) SD = SQRT(T)
+C      SDS = (DMAX - RM ) / SD
+C      D = DMAX
+C      WRITE(*,*)'Best diagonal, score and local sum',ID,DMAX,D
+C      WRITE(*,1000)ID,DMAX,D
+C 1000 FORMAT(I6,2F10.3)
+C      IF(D.LT.CUT) RETURN
+      IF(IDIMV.GE.IDIMH) THEN
+        IF(ID.GE.0) THEN
+          LC = ID + 1
+          RC = IDIMH
+        ELSE
+          LC = 1
+          RC = MIN(IDIMH,IDIMV + ID)
+        END IF
+      ELSE
+        IF(ID.GE.0) THEN
+          LC = ID + 1
+          RC = MIN(ID+IDIMV,IDIMH)
+        ELSE
+          LC = 1
+          RC = ID + IDIMV
+        END IF
+      END IF
+C      WRITE(*,*)' Discard ',LC, ' to ',RC
+      LS = 5
+      MS = 4
+      CALL DIAGO(ID,IX,IY)
+C      write(*,*)'ix,iy',ix,iy
+      D = DSCAN(SEQH,IDIMH,IX,SEQV,IDIMV,IY,LW,LS,MATRIX,IDM,MS)
+C      WRITE(*,*)IX,IY,D
+      END
+      INTEGER FUNCTION GNFFOF(IDEV,NAME)
+      CHARACTER NAME*(*)
+      EXTERNAL NOTLR
+C
+C routine to read a file of file names and return a name
+C deals with leading spaces and trims names at first space
+C after name: eg '  fred is a bum' is returned as 'fred'
+C needed because file names can contain spaces (not our file names!)
+C and the open statement expects the names to match precisely
+C
+C return 0 = ok, 2 = empty line in file, 3 = error in read, 1 = end of file
+C
+      READ(IDEV,1000,ERR=100,END=200)NAME
+ 1000 FORMAT(A)
+C
+C get first non space position
+C
+      LENGTH = LEN(NAME)
+      I = NOTLR(NAME,LENGTH,' ')
+C empty line ?
+      IF(I.EQ.0) THEN
+        GNFFOF = 2
+        RETURN
+      END IF
+C now want first space after I
+      J = INDEX(NAME(I+1:),' ')
+      IF(J.EQ.0) THEN
+        J = LENGTH
+      ELSE
+        J = J + I - 1
+      END IF
+      CALL SHFTLS(NAME,I,1,J)
+      NAME(J-I+2:) = ' '
+      GNFFOF = 0
+      RETURN
+ 100  CONTINUE
+      GNFFOF = 3
+      RETURN
+ 200  CONTINUE
+      GNFFOF = 1
+      END
+      SUBROUTINE SHFLCA(STRING,MAXAR,FROMS,TO,FROME)
+      CHARACTER STRING(MAXAR)
+      INTEGER FROMS,FROME,TO
+C
+C  shift left from from to to
+C
+      J = TO
+      DO 10 I=FROMS,FROME
+        STRING(J) = STRING(I)
+        J = J + 1
+ 10   CONTINUE
+      END
+      SUBROUTINE AERROR(IDEVS,IDEVF,NAME,IERR)
+      CHARACTER NAME*(*)
+C
+C handle errors for assembly
+C
+C errors are:
+C 0 file not found
+C 1 read too short
+C 2 failed to align and not entered
+C 3 failed on entry
+C 4 failed to align but entered
+      WRITE(IDEVF,1000)NAME(1:INDEX(NAME,' ')),IERR
+ 1000 FORMAT(A,I2)
+      CALL ERROM(IDEVS,'Failed reading written to error file')
+      END
+      SUBROUTINE GETSQS(IDEV1,IDEV2,SEQ,MAXSEQ,STARTS,NAMES,
+     +FILNAM,NFILE,MAXFIL,MAXNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER SEQ(MAXSEQ),HELPF*(*)
+      CHARACTER*(*) FILNAM,NAMES(MAXFIL)
+      INTEGER STARTS(MAXFIL)
+C
+C concatenate sequences into seq
+C number of them is nfile
+C sequence i starts at starts(i) and has file name names(i)
+C and starts(nfile+1) is the position of the first unused element
+      NFILE = 0
+      IEND = 0
+      FILNAM = ' '
+C      CALL OPENF1(IDEV1,FILNAM,0,IOK,KBIN,KBOUT,
+C     +'File of file names',
+C     +IHELPS,IHELPE,HELPF,IDEVH)
+2     CONTINUE
+      IF (NFILE.LT.MAXFIL) THEN
+        READ(IDEV1,1000,END=3)FILNAM
+ 1000   FORMAT(A)
+C        WRITE(*,*)NFILE,IEND,FILNAM
+        NFILE = NFILE + 1
+        STARTS(NFILE) = IEND + 1
+        NAMES(NFILE) = FILNAM
+        IDIM2 = MAXSEQ - IEND
+        IF (IDIM2.GT.0) THEN
+          CALL OPENRS(IDEV2,FILNAM,IOK,LRECL,2)
+          CALL ARRFIL(IDEV2,SEQ(STARTS(NFILE)),IDIM2,KBOUT)
+          CLOSE(UNIT=IDEV2)
+          IEND = IEND + IDIM2
+          GO TO 2
+        END IF
+C
+C if we get here weve run out of array space
+C
+        NFILE = NFILE - 1
+        WRITE(KBOUT,*)'Only ',NFILE,' files read: no more memory'
+        CLOSE(UNIT=IDEV1)
+        RETURN
+      ELSE
+C
+C if we get here weve got too many files
+C
+        NFILE = NFILE - 1
+        CLOSE(UNIT=IDEV1)
+        WRITE(KBOUT,*)'Maximum files ',NFILE,' read'
+        RETURN
+      END IF
+ 3    CONTINUE
+      STARTS(NFILE+1) = IEND + 1
+      CLOSE(UNIT=IDEV1)
+      END
+      SUBROUTINE LISTN(SEQ,MAXCHR,LENSEQ,NFILE,LINLEN,I1,I2,
+     +IDEV,KBOUT,NAMES,MAXFIL)
+      CHARACTER*(*) NAMES(MAXFIL)
+      PARAMETER (MAXLIN = 120)
+      CHARACTER SEQ(MAXCHR),LINE*(MAXLIN)
+      INTEGER LENSEQ(MAXFIL)
+      EXTERNAL NOTIRL
+C     SET WIDTH FOR LAST PAGE
+      LPAGE=MOD(I2-I1+1,LINLEN)
+C     HOW MANY PAGE WIDTHS?
+      NPAGE=1+(I2-I1+1)/LINLEN
+      IF(MOD(I2-I1+1,LINLEN).EQ.0)THEN
+        NPAGE=NPAGE-1
+        LPAGE=LINLEN
+      END IF
+      ISTART=I1-LINLEN
+      DO 50 I=1,NPAGE
+        ISTART=ISTART+LINLEN
+        IF(I.EQ.NPAGE)LINLEN=LPAGE
+        WRITE(IDEV,1006)(K,K=ISTART+9,ISTART+LINLEN-1,10)
+        DO 40 J=1,NFILE
+          KF = ISTART + LENSEQ(J) - 1
+          KT = MIN(KF+LINLEN,LENSEQ(J+1)) - 1
+          LINE(1:) = ' '
+          WRITE(LINE,1003,ERR=60)(SEQ(K),K=KF,KT)
+1003      FORMAT(' ',60A1)
+          LINE(KT-KF+4:) = NAMES(J)(1:15)
+          WRITE(IDEV,1004,ERR=60)LINE(1:NOTIRL(LINE,MAXLIN,' '))
+ 1004     FORMAT(A)
+1006      FORMAT(' ',10I10)
+40      CONTINUE
+        WRITE(IDEV,1008)
+1008    FORMAT( )
+50    CONTINUE
+      RETURN
+60    CONTINUE
+      WRITE(KBOUT,*)' Error writing file'
+      END
+C   SUBROUTINE TO READ CHARACTER DATA FROM IDEV, REMOVE SPACES, FILL
+C   ARRAY AND RETURN NUMBER OF ELEMENTS USED. ANY LINES STARTING WITH
+C   A ; ARE TREATED AS COMMENTS
+      SUBROUTINE ARRFIM(IDEV,SEQNCE,J,KBOUT)
+C 14-8-91 Added err= option to read, and set length to 0 if error found
+C   AUTHOR: RODGER STADEN
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,ENDCHR,TITCHR
+      SAVE ENDCHR,SPACE,TITCHR
+      DATA ENDCHR/'@'/
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=40)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR)THEN
+C        WRITE(KBOUT,1003)(TEMP(K),K=2,80)
+C1003    FORMAT(' ',79A1)
+        GO TO 1
+      END IF
+10    CONTINUE
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(TEMP(I).EQ.ENDCHR)RETURN
+          IF(J.EQ.IDMX)THEN
+            WRITE(KBOUT,1002)IDMX
+1002        FORMAT(
+     +             ' Too much data. Maximum possible',
+     +             ' =',I6,', input stopped there')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+      RETURN
+ 40   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      J = 0
+      END
+      SUBROUTINE BLATA(WORDP,IDE,MAXFIL,LENGTH,NSEQ,CONSTS,LCONST)
+      INTEGER WORDP(IDE,MAXFIL)
+      INTEGER SEQA(4),CONSTS(0:LCONST)
+      EXTERNAL NCODEA
+C
+C fudge: set xyxy to 0
+C
+      DO 10 I = 1,4
+      DO 10 J = 1,4
+        SEQA(1) = I
+        SEQA(3) = I
+        SEQA(2) = J
+        SEQA(4) = J
+        JA = NCODEA(SEQA,LENGTH,CONSTS,5,LCONST)
+        DO 5 K = 1,NSEQ
+          WORDP(JA,K) = 0
+ 5      CONTINUE
+ 10   CONTINUE
+      END
+      SUBROUTINE MAKEID(MATRIX,IDM)
+      INTEGER MATRIX(IDM,IDM)
+C
+C set main diagonal to 1 except idm,idm
+C
+      DO 10 I=1,IDM
+        DO 10 J=1,IDM
+          IF(I.EQ.J) THEN
+            MATRIX(I,J) = 1
+          ELSE
+            MATRIX(I,J) = 0
+          END IF
+ 10   CONTINUE
+      MATRIX(IDM,IDM) = 0
+      END
+      SUBROUTINE DIAGO(D,I,J)
+      INTEGER D
+      IF(D.GE.0) THEN
+        J = 1
+        I = D + 1
+      ELSE
+        I = 1 
+        J = ABS(D) + 1
+      END IF
+      END
+      REAL FUNCTION DSCAN(SX,LX,PXIN,SY,LY,PYIN,NDD,
+     +LW,SCORES,MAXC,MS)
+C seqs sx,sy length lx,ly positions px,py
+C number of diagonals nd
+C length of windw lw
+C min score ms
+      PARAMETER (MAXD = 21)
+      INTEGER SX(LX),PXIN,PYIN,SCORES(MAXC,MAXC),SY(LY)
+      INTEGER PX,PY,CS(-MAXD:MAXD),CX(-MAXD:MAXD),CY(-MAXD:MAXD)
+      PX = PXIN
+      PY = PYIN
+C      WRITE(*,*)'PXIN,PYIN,ND,LW,MS,MAXC,MAXW',
+C     +PXIN,PYIN,ND,LW,MS,MAXC
+C
+C new scheme for choosing number of diagonals and reducing chnace
+C of finding matches close to edge. Let the number of diagonals nd
+C depend on the length of the diagonal we are going to scan along.
+C <29 nd = 3, >269 = 15
+C
+      L = MIN(LX-PX,LY-PY)
+      IF(L.LT.31) THEN
+        ND = 3
+      ELSE
+        ND = MIN(MAXD,3+6*((L-30)/40))
+      END IF
+C      WRITE(*,*)L,ND
+      ND2 = ND/2
+C
+C get start safely positioned by shifting the original p's
+C
+      I = 0
+      IF(PX.LT.ND2+1) I = ND2 - PX + 1
+      PX = PX + I
+      PY = PY + I
+C      WRITE(*,*)PX,PY
+C
+C set current x
+C
+      DO 10 I=-ND2,ND2
+        CX(I) = PX + I
+        CY(I) = PY
+ 10   CONTINUE
+C      WRITE(*,*)(CX(K),CY(K),K=-ND2,ND2)
+      DO 33 I=-MAXD,MAXD
+        CS(I) = 0
+ 33     CONTINUE
+C      CALL FILLI(CS(-ND2),ND,0)
+C
+C do the first window length
+C
+C      WRITE(*,*)(SY(K),K=1,10)
+      DO 30 J=0,LW-1
+        DO 20 I=-ND2,ND2
+C       WRITE(*,*)'J,I,I+J,CX,CY,SX,SY',J,I,I+J,CX(I)+J,CY(I)+J,
+C     + SX(CX(I)+J),SY(CY(I)+J)
+C          WRITE(*,*)CX(I)+J,CY(I)+J
+C          WRITE(*,*)SX(CX(I)+J),SY(CY(I)+J)
+          CS(I) = CS(I) + SCORES(SX(CX(I)+J),SY(CY(I)+J))
+ 20     CONTINUE
+ 30   CONTINUE
+C      WRITE(*,*)CS
+      IDSCAN = 0
+      IS = IMAXA(CS(-ND2),ND)
+      IF(IS.GE.MS) DSCAN = DSCAN + 1
+      NSTEPS = MIN(LX - (PX+LW-1+ND2),LY - (PY+LW-1+ND2))
+C      NSTEPS = MIN(LX - (PX+LW-1),LY - (PY+LW-1+ND2))
+C      WRITE(*,*)'NSTEPS',NSTEPS
+      DO 50 J=1,NSTEPS
+        DO 40 I=-ND2,ND2
+          ICX = CX(I)
+          ICY = CY(I)
+          CS(I) = CS(I) - SCORES(SX(ICX),SY(ICY))
+     +                  + SCORES(SX(ICX+LW),SY(ICY+LW))
+C          CS(I) = CS(I) - SCORES(SX(CX(I)),CTONUM(SY(CY(I))))
+C     +                  + SCORES(SX(CX(I)+LW),CTONUM(SY(CY(I)+LW)))
+C          WRITE(*,*)'I,CX(I),CY(I)',I,CX(I),CY(I)
+          CX(I) = CX(I) + 1
+          CY(I) = CY(I) + 1
+ 40     CONTINUE
+C      WRITE(*,*)J,CX(0),CY(0),(CS(K),K=-ND2,ND2)
+      IS = IMAXA(CS(-ND2),ND)
+      IF(IS.GE.MS) IDSCAN = IDSCAN + 1
+C      WRITE(*,*)IS,IDSCAN
+50    CONTINUE
+      DSCAN = REAL(IDSCAN)/NSTEPS
+      END
+      INTEGER FUNCTION IMAXA(I,N)
+      INTEGER I(N)
+      IMAXA = I(1)
+      DO 10 J=2,N
+        IMAXA = MAX(IMAXA,I(J))
+ 10     CONTINUE
+      END
+      SUBROUTINE VPOUTR(IDEVO,IDEVN,NAME,LC,RC,
+     +SCORE,VNAME,TNAME,KCUT,LCUT,EMAX,IOK)
+      CHARACTER LINE*80,NAME*(*),VNAME*(*),TNAME*(*)
+      INTEGER RC
+      EXTERNAL NOTRL
+C
+C IDEVO original file
+C IDEVN new reading file
+C IDEVNG fofn for good data
+C
+ 1005   FORMAT(A)
+C
+C Some ALU, so rename original file, and use old name for new file
+C
+      TNAME = NAME
+      CLOSE(IDEVO)
+      K = INDEX(NAME,' ')
+      NAME(K:) = VNAME
+      IOK = NCFRS(TNAME,NAME)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error renaming',TNAME
+        RETURN
+      END IF
+C
+C Reopen the file with its new name
+C
+      CALL OPENRS(IDEVO,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error opening file',NAME
+        RETURN
+      END IF
+C
+C Open the new file with the original name
+C
+      CALL OPENRS(IDEVN,TNAME,IOK,LRECL,1)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error opening file',NAME
+        RETURN
+      END IF
+ 5    CONTINUE
+      READ(IDEVO,1005,ERR=6,END=7)LINE
+      IF(LINE(1:1).EQ.';') THEN
+        WRITE(IDEVN,1005)LINE(1:NOTRL(LINE,80,' '))
+        GO TO 5
+      END IF
+ 7    CONTINUE
+      WRITE(IDEVN,1000)LC,RC-LC+1,SCORE
+      IF(KCUT.GT.0) THEN
+        WRITE(IDEVN,1000)KCUT,LCUT-KCUT+1,EMAX
+      END IF
+ 1000 FORMAT(';;ALUS',2I7,F10.3)
+      WRITE(IDEVN,1005,ERR=6)LINE(1:NOTRL(LINE,80,' '))
+ 8    CONTINUE
+      READ(IDEVO,1005,ERR=6,END=9)LINE
+      WRITE(IDEVN,1005,ERR=6)LINE(1:NOTRL(LINE,80,' '))
+      GO TO 8
+ 9    CONTINUE
+      CLOSE(IDEVO)
+      CLOSE(IDEVN)
+      IOK = 0
+      RETURN
+ 6    CONTINUE
+      WRITE(*,*)'Error in VPOUTR'
+      IOK = 1
+      END
+      INTEGER FUNCTION NCFRS(FROM,TO)
+      CHARACTER FROM*(*),TO*(*)
+      INTEGER RENAME
+      EXTERNAL RENAME
+C
+C rename file using system call to RENAME
+C
+      IFD = 0
+      ITD = 0
+      IF(INDEX(FROM,'/').EQ.0) THEN
+        FROM = './'//FROM
+        IFD = 1
+      END IF
+      IF(INDEX(TO,'/').EQ.0) THEN
+        TO = './'//TO
+        ITD = 1
+      END IF
+      NCFRS = RENAME(FROM,TO)
+      IF(IFD.EQ.1) FROM = FROM(3:)
+      IF(ITD.EQ.1) TO = TO(3:)
+      END
diff --git a/src/staden/sap.f b/src/staden/sap.f
new file mode 100644
index 0000000..32fd7d8
--- /dev/null
+++ b/src/staden/sap.f
@@ -0,0 +1,622 @@
+C     SAP (Sequence assembly program)
+C     AUTHOR RODGER STADEN
+C     9-1-90  sent idev instead of kbout to join and enter
+C     16-3-90 removed escape after bpause
+C     2-5-90 Many changes related to introducing graphics
+C     4-5-90 Allowed escape from getreg
+C     9-5-90 Introduced a default gel reading
+C    23-8-90 Added calls to SHOWFU
+C    28-11-90 See dbsys89
+C    22-1-91 Changed use of showfi to include version number
+C    27-1-91 Changed maxsav from 1000 to maxglm
+C     13-04-91 removed initial CLEARV
+C    28-7-91 added extra parameter for quality calc: mxgood, only the parts
+C    of reads upto this length are used
+C    21-8-91 Added function to find internal joins
+C    27-8-91 Added buffer flushing for database files
+      SUBROUTINE FMAIN()
+      INTEGER CHRSIZ,ECHRSZ
+      PARAMETER (
+     +           NAMLEN = 40,
+     +           MAXDEV = 10,
+     +           MAXSEQ=50000,
+     +           MAXGLM=4096,
+     +           MAXGL2=MAXGLM*2,
+     +           MINMEN=-3,
+     +           CHRSIZ=4,
+     +           ECHRSZ = 6,
+     +           IDT1=ECHRSZ*MAXGL2,
+     +           IDT2=2*CHRSIZ*MAXGL2,
+     +           IDT3=2*MAXGL2,
+     +           IDT4=2*ECHRSZ*MAXGL2,
+     +           MAXDB=1000)
+      INTEGER BOTOPT,TOPOPT,DEVNOS(MAXDEV)
+      INTEGER RELPG(MAXDB),LREG,RREG,ANS
+      INTEGER TEMP1(IDT1),TEMP2(IDT2),TEMP3(IDT4)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILNAM,NAMPRO,FILE,NAMARC,FILMAR
+      CHARACTER*(NAMLEN) SHONAM
+      CHARACTER GEL(MAXGLM)
+      CHARACTER GEL2(MAXGLM),GEL3(MAXGLM),GEL4(MAXGLM)
+      PARAMETER (BOTOPT=0,TOPOPT=35)
+      PARAMETER (HELPF='SAPHELP')
+      PARAMETER (POINTF='SAPHPNT')
+      PARAMETER (FILMAR='SAPMARG')
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      INTEGER MARGB(TOPOPT),MARGT(TOPOPT)
+      CHARACTER SEQ1(MAXSEQ)
+C     DBAUTO TEMP3A = TEMP1, SEQ3A = GEL3, SEQ2A = GEL2
+C     MATCHA = GEL, SEQ4A = GEL4
+      PARAMETER (LENGTH = 6,
+     +           MAXCON = 500,
+     +           MAXSAV = MAXGLM,
+     +           LPOWRC = CHRSIZ**LENGTH)
+      INTEGER CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ5(MAXGLM),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      INTEGER WINDOW
+C     DBAUTO
+      EQUIVALENCE (TEMP2,TEMP1)
+      EQUIVALENCE (TEMP1,TEMP3)
+      IDM = 5
+      CALL INITS
+      CALL INITLU(IDM)
+      CALL INTHLP('sap', TOPOPT)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(10))
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' SAP (Sequence assembly program) V5.0 Aug 1991',/,
+     +' Author: Rodger Staden')
+      IGORT = 0
+      IDEV = KBOUT
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +TOPOPT,DEVNOS(5),FILMAR)
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(5),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,IDEVH)
+      IOPEN=1
+      LINLEN=50
+      PERCD=0.75
+      WINDOW = 25
+      MXGOOD = MAXGLM
+      IGWIND = 1000
+      FILE = ' '
+      MAXGEL = MAXGLM
+      SHONAM = ' '
+      CALL SHOWFI(KBOUT,SHONAM)
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          IOPEN=1
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+C        CALL SHOWFU(KBOUT,'Start new database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          LLINO = 0
+          IF(IERR.EQ.0) THEN
+            IOPEN=0
+            SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+            CALL SHOWFI(KBOUT,SHONAM)
+          END IF
+          CALL INITLU(IDM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+C        CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      MENUN = 0
+10    CONTINUE
+C
+C Flush the database buffers, just in case!
+C
+      CALL FLUSHL(DEVNOS(1))
+      CALL FLUSHL(DEVNOS(2))
+      CALL FLUSHL(DEVNOS(3))
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+C      CALL SHOWFU(KBOUT,' ')
+C      IF(IOK.NE.0) GO TO 9999
+C      CALL DBMENT(MENU,NOPT,TOPOPT,HELPS(0),HELPE(0),HELPF,DEVNOS(5),
+C     +KBIN,KBOUT)
+      CALL MENU(NOPT, KOPT, MENUN, TOPOPT, MINMEN, KBIN, KBOUT,
+     +     HELPS(0), HELPE(0), HELPF, DEVNOS(5))
+      IF(NOPT.EQ.3)THEN
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          IOPEN=1
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+C          CALL SHOWFU(KBOUT,'Start a database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          IF(IERR.EQ.0)IOPEN=0
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+C          CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      END IF
+C  STOP
+      IF(NOPT.EQ.2)THEN
+        IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBOUT)
+        GO TO 9999
+      END IF
+C   clear all
+C
+      IF(NOPT.EQ.10)THEN
+C        CALL SHOWFU(KBOUT,'Clear graphics')
+        CALL CLEARG
+        GO TO 10
+      END IF
+C   clear VT100
+C
+      IF(NOPT.EQ.11)THEN
+C        CALL SHOWFU(KBOUT,'Clear text')
+        CALL CLEARV
+        GO TO 10
+      END IF
+C
+C   xhairs
+C
+      IF(NOPT.EQ.13) THEN
+C          CALL SHOWFU(KBOUT,'Use xhair')
+          CALL XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,NCONTS,LLINO,LINCON,LREG,RREG,
+     +    WINDOW,IGWIND,LENCON,IDEPTH,JDEPTH,
+     +    MARGL,MARGR,MARGB,MARGT,TOPOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +    KBOUT,GEL,GEL2,DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM,
+     +    SEQ1,MAXSEQ,NGELS,TEMP3,ECHRSZ,MAXGL2,GEL3,GEL4,33,34,29,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MXGOOD)
+        GO TO 10
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(NOPT.EQ.14)THEN
+C        CALL SHOWFU(KBOUT,'Change margins')
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(NOPT),HELPE(NOPT),TOPOPT,HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+C  ruler
+C
+      IF(NOPT.EQ.12) THEN
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+C          CALL SHOWFU(KBOUT,'Show ruler')
+          CALL RULER(LREG,RREG,MARGL,MARGR,
+     +    MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          GO TO 10
+        END IF
+      END IF
+C  labler
+      IF(NOPT.EQ.15)THEN
+C        CALL SHOWFU(KBOUT,'Label diagram')
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.16) THEN
+C        CALL SHOWFU(KBOUT,'Plot map')
+C  map
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+          CALL PLTMAP(DEVNOS(4),FILNAM,LENCON,MARGL,MARGR,MARGB(NOPT),
+     +    MARGT(NOPT),ISXMAX,ISYMAX,LREG,RREG,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.21).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Enter new gel reading')
+C       ENTER
+        CALL ENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +  DEVNOS(3),DEVNOS(4),LINLEN,PERCD,FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+     +  MAXGEL,IDM,LLINO)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.25).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Show relationships')
+C       PRINT DB
+        CALL DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,IDEV,
+     +  KBIN,KBOUT,DEVNOS(3),LLINO,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.5).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Display contig')
+C       DISPLAY
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+          IF(IERR.EQ.0)THEN
+          CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +    GEL,LLINO,LINCON,LREG,RREG,GEL2,I1,I2,0,I,IDBSIZ,IDEV,KBOUT,
+     +    DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+200   CONTINUE
+      IF((NOPT.EQ.22).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Join contigs')
+C       JOIN
+        WRITE(KBOUT,1006)
+1006    FORMAT(' Which contigs do you want to join ?')
+        WRITE(KBOUT,1017)
+1017    FORMAT( ' Left contig')
+        LLINOL = 0
+        LLINOR = 0
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LNCONL,LLINOL,
+     +  IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          WRITE(KBOUT,1018)
+1018      FORMAT( ' Right contig')
+          CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LNCONR,LLINOR,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(IERR.EQ.0)THEN
+C           CHECK TO SEE IF MAKING A CIRCLE!
+            IF(LLINOL.EQ.LLINOR)THEN
+              WRITE(KBOUT,*)' Making circles is not allowed!'
+              GO TO 200
+            END IF
+            CALL JOIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +      LNCONL,LLINOL,LNCONR,LLINOR,GEL,GEL2,
+     +      IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +      DEVNOS(3),LINLEN,PERCD,
+     +      HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+     +      MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.23).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Complement contig')
+C       COMPLEMENT
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LINCON,LLINO,GEL,IDBSIZ,KBOUT,DEVNOS(1),DEVNOS(2),
+     +    MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+C  EDIT
+      IF((NOPT.EQ.4).AND.(IOPEN.EQ.0))THEN
+C       EDIT DB
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    GEL,LINCON,0,0,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+C  PLOT SINGLE CONTIG
+      IF((NOPT.EQ.33).AND.(IOPEN.EQ.0))THEN
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0)THEN
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.26).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Alter relationships')
+C       FIX
+        CALL DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),
+     +  HELPS(4),HELPE(4),HELPF,DEVNOS(5),MAXGEL)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.24).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Copy database')
+C       COPY
+        CALL DBCOPY(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +  NAMPRO,DEVNOS(4),IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),
+     +  DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,MAXDB,IDM)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.19).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Check database for consistency')
+C       CHECK
+        CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP1,IERR,KBOUT)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.29).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Examine quality')
+C       SCAN
+        IDIM1=MAXSEQ
+        CALL DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),IDEV,LINLEN,PERCD,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,GEL3,GEL4,
+     +  MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,DEVNOS(3),
+     +  LLINO,LINCON,LREG,RREG,MXGOOD)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.8).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Make consensus')
+C       CONSENSUS
+        IDIM1=MAXSEQ
+        CALL CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),DEVNOS(4),FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,IDM,PERCD,
+     +  DEVNOS(3),LLINO)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+C        CALL SHOWFU(KBOUT,'Redirect output')
+        CALL REDIR(IDEV,DEVNOS(6),DEVNOS(10),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),KOPT)
+        GO TO 10
+      END IF
+C  HELP
+      IF(NOPT.EQ.1)THEN
+C        CALL SHOWFU(KBOUT,'Help')
+C       HELP
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,
+     +  HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.27)THEN
+C        CALL SHOWFU(KBOUT,'Set parameters')
+C  SET DISPLAY PARAMETERS
+        MN = 10
+        MX = 100
+        CALL GETINT(MN,MX,LINLEN,'Display line length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) LINLEN = IVAL
+        IPCMIN = 51
+        IPCMAX = 100
+        IPCD = INT(PERCD*100.)
+        CALL GETINT(IPCMIN,IPCMAX,IPCD,
+     +  'Percentage score for consensus',
+     +  IPVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) THEN
+          PERCD = REAL(IPVAL)/100.
+        END IF
+        MN = 1
+        MX = MAXGLM
+        CALL GETINT(MN,MX,MXGOOD,
+     +  'Maximum read length for quality analysis',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) MXGOOD = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,WINDOW,'Text window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) WINDOW = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,IGWIND,'Graphics window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) IGWIND = IVAL
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.6)THEN
+C        CALL SHOWFU(KBOUT,'Display text file')
+       CALL TTEXT(DEVNOS(4),FILNAM,KBIN,KBOUT,
+     + HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+       GO TO 10
+      END IF
+      IF(NOPT.EQ.28) THEN
+C        CALL SHOWFU(KBOUT,'Highlight disagreements')
+        CALL HIGHLT(SEQ1,SEQ1(6001),SEQ1(10001),KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(7),
+     +  DEVNOS(8),FILE,IOK)
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.34).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'Plot contigs')
+C  PLOT ALL CONTIGS
+        CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,0,0,
+     +  NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+C  AUTO EDIT
+      IF((NOPT.EQ.30).AND.(IOPEN.EQ.0))THEN
+C        CALL SHOWFU(KBOUT,'AutoEdit a contig')
+        WRITE(KBOUT,*)'Make a copy first!'
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0) THEN
+        IDIM1=MAXSEQ
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,IDIM1,
+     +  GEL,LREG,RREG,LLINO,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+     +  DEVNOS(2),MAXGEL,IDM,PERCD)
+        CALL AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,LINCON,
+     +  GEL,MAXGEL,SEQ1,IDIM1,DEVNOS(2),DEVNOS(1),LREG,RREG,
+     +  KBOUT)
+      END IF
+      END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.9).AND.(IOPEN.EQ.0)) THEN
+C        CALL SHOWFU(KBOUT,'Screen Edit a contig')
+        CALL CEDT(KBIN,KBOUT,
+     +  POSNS(1),POSNS(1001),POSNS(2001),
+     +  MAXDB,POSNS(3001),POSNS(3101),50,
+     +  RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(4),
+     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),LINLEN,FILNAM,LLINO,IOK)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.17) THEN
+C        CALL SHOWFU(KBOUT,'Screen for restriction sites')
+        CALL SCRENR(GEL2,MAXGEL,GEL3,NAMARC,FILE,
+     +  DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.18) THEN
+C        CALL SHOWFU(KBOUT,'Screen against vector')
+        CALL SCRENV(MAXGEL,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +  SEQ1,MAXSEQ,GEL2,GEL3,GEL4,LENGTH,
+     +  SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +  KBIN,KBOUT,DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,FILE,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.20).AND.(IOPEN.EQ.0)) THEN
+C        CALL SHOWFU(KBOUT,'Auto assemble sequences')
+      CALL DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.35).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+C        CALL SHOWFU(KBOUT,'Find internal joins')
+      CALL AUTOJ(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.32).AND.(IOPEN.EQ.0)) THEN
+C        CALL SHOWFU(KBOUT,'Extract gel readings')
+        CALL GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +  NCONTS,GEL,MAXGEL,DEVNOS(2),DEVNOS(3),DEVNOS(4),DEVNOS(7),
+     +  KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),FILE)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.31) THEN
+C        CALL SHOWFU(KBOUT,'Type in gel readings')
+        CALL BATIN(SEQ1,MAXSEQ,KBIN,KBOUT,
+     +  DEVNOS(4),DEVNOS(7),IDEV,
+     +  FILNAM,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.GT.3).AND.(IOPEN.NE.0))THEN
+        WRITE(KBOUT,1012)
+1012    FORMAT(' You have not opened a database!')
+        GO TO 10
+      END IF
+      GO TO 10
+9999  CONTINUE
+      CALL SHUTD
+      END
diff --git a/src/staden/sapf.f b/src/staden/sapf.f
new file mode 100644
index 0000000..1f2885d
--- /dev/null
+++ b/src/staden/sapf.f
@@ -0,0 +1,775 @@
+C     SAPF (Sequence assembly program)
+C     AUTHOR RODGER STADEN
+C     9-1-90  sent idev instead of kbout to join and enter
+C     16-3-90 removed escape after bpause
+C     2-5-90 Many changes related to introducing graphics
+C     4-5-90 Allowed escape from getreg
+C     9-5-90 Introduced a default gel reading
+C    23-8-90 Added calls to SHOWFU
+C    28-11-90 See dbsys89
+C    22-1-91 Changed use of showfi to include version number
+      SUBROUTINE FMAIN()
+      INTEGER CHRSIZ,ECHRSZ
+      PARAMETER (
+     +           NAMLEN = 40,
+     +           MAXDEV = 10,
+     +           MAXSEQ=50000,
+     +           MAXGLM=4096,
+     +           MAXGL2=MAXGLM*2,
+     +           CHRSIZ=4,
+     +           ECHRSZ = 6,
+     +           IDT1=ECHRSZ*MAXGL2,
+     +           IDT2=2*CHRSIZ*MAXGL2,
+     +           IDT3=2*MAXGL2,
+     +           IDT4=2*ECHRSZ*MAXGL2,
+     +           MAXDB=1000)
+      INTEGER BOTOPT,TOPOPT,DEVNOS(MAXDEV)
+      INTEGER RELPG(MAXDB),LREG,RREG,ANS
+      INTEGER TEMP1(IDT1),TEMP2(IDT2),TEMP3(IDT4)
+      INTEGER LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILNAM,NAMPRO,FILE,NAMARC,FILMAR
+      CHARACTER*(NAMLEN) SHONAM
+      CHARACTER GEL(MAXGLM)
+      CHARACTER GEL2(MAXGLM),GEL3(MAXGLM),GEL4(MAXGLM)
+      PARAMETER (BOTOPT=0,TOPOPT=34)
+      PARAMETER (HELPF='SAPHELP')
+      PARAMETER (POINTF='SAPHPNT')
+      PARAMETER (FILMAR='SAPMARG')
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      INTEGER MARGB(TOPOPT),MARGT(TOPOPT)
+      CHARACTER SEQ1(MAXSEQ)
+C     DBAUTO TEMP3A = TEMP1, SEQ3A = GEL3, SEQ2A = GEL2
+C     MATCHA = GEL, SEQ4A = GEL4
+      PARAMETER (LENGTH = 6,
+     +           MAXCON = 500,
+     +           MAXSAV = 1000,
+     +           LPOWRC = CHRSIZ**LENGTH)
+      INTEGER CONST(LENGTH)
+      INTEGER POSNS(MAXSEQ),WORDP(LPOWRC),WORDN(LPOWRC),GELN(MAXGLM)
+      INTEGER CENDS(MAXCON),NENDS(MAXCON)
+      CHARACTER SEQC2(MAXGLM,2),SEQG2(MAXGLM,2)
+      CHARACTER SEQ5(MAXGLM),SEQG3(MAXGLM),SEQC3(MAXGLM)
+      INTEGER SAV1(MAXSAV),SAV2(MAXSAV),SAV3(MAXSAV)
+      INTEGER WINDOW
+C     DBAUTO
+      EQUIVALENCE (TEMP2,TEMP1)
+      EQUIVALENCE (TEMP1,TEMP3)
+      IDM = 5
+      CALL INITS
+      CALL INITLU(IDM)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      CALL OPENGR(DEVNOS(10))
+      CALL CLEARV
+      WRITE(KBOUT,1000)
+1000  FORMAT(/,
+     +' SAPF (Sequence assembly program (flat files) V3.1 Jan 1991',/,
+     +' Author: Rodger Staden')
+      IGORT = 0
+      IDEV = KBOUT
+C   GET SCREEN AND MARGIN SIZES
+      CALL GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +TOPOPT,DEVNOS(5),FILMAR)
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(5),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,IDEVH)
+      IOPEN=1
+      LINLEN=50
+      PERCD=0.75
+      WINDOW = 25
+      IGWIND = 1000
+      FILE = ' '
+      MAXGEL = MAXGLM
+      SHONAM = ' '
+      CALL SHOWFI(KBOUT,SHONAM)
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          IOPEN=1
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+        CALL SHOWFU(KBOUT,'Start new database from flat file')
+          CALL DBSTAF(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+        CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      MENU = 0
+10    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      CALL SHOWFU(KBOUT,' ')
+C      IF(IOK.NE.0) GO TO 9999
+      CALL DBMENT(MENU,NOPT,TOPOPT,HELPS(0),HELPE(0),HELPF,DEVNOS(5),
+     +KBIN,KBOUT)
+      IF(NOPT.EQ.3)THEN
+        ANS = 0
+        CALL YESONO(ANS,'Open existing database','Start new database',
+     +  HELPS(3),HELPE(3),HELPF,DEVNOS(5),KBIN,KBOUT)
+        IF(ANS.LT.0)GO TO 10
+        IF(ANS.GT.1)GO TO 10
+        MAXGEL = MAXGLM
+        IF(IOPEN.EQ.0)THEN
+          CLOSE(UNIT=DEVNOS(1))
+          CLOSE(UNIT=DEVNOS(2))
+          CLOSE(UNIT=DEVNOS(3))
+          IOPEN=1
+        END IF
+        IDBSIZ=MAXDB
+        IDBSIS=IDBSIZ
+        IF(ANS.EQ.1)THEN
+          CALL SHOWFU(KBOUT,'Start a database')
+          CALL DBSTAR(NAMPRO,GEL,IDBSIS,IDBSIZ,KBIN,KBOUT,DEVNOS(1),
+     +    DEVNOS(2),DEVNOS(3),IERR,HELPS(NOPT),HELPE(NOPT),
+     +    DEVNOS(5),HELPF,MAXGEL,MAXGLM,IDM)
+          NGELS=0
+          NCONTS=0
+          IF(IERR.EQ.0)IOPEN=0
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+        IF(ANS.EQ.0)THEN
+          CALL SHOWFU(KBOUT,'Open database')
+          NGELS=0
+          NCONTS=0
+          CALL DBOPEN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +    IDBSIS,IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +    MAXGEL,MAXGLM,LLINO,IDM,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(JERR.EQ.0)IOPEN=0
+          IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,
+     +    NCONTS,IDBSIZ,KBOUT)
+          CALL INITLU(IDM)
+          SHONAM = NAMPRO(1:MAX(1,INDEX(NAMPRO,'.'))+1)
+          CALL SHOWFI(KBOUT,SHONAM)
+          GO TO 10
+        END IF
+      END IF
+C  STOP
+      IF(NOPT.EQ.2)THEN
+        IF(NGELS.GT.0)CALL DBSTAT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  IDBSIZ,KBOUT)
+        GO TO 9999
+      END IF
+C   clear all
+C
+      IF(NOPT.EQ.10)THEN
+        CALL SHOWFU(KBOUT,'Clear graphics')
+        CALL CLEARG
+        GO TO 10
+      END IF
+C   clear VT100
+C
+      IF(NOPT.EQ.11)THEN
+        CALL SHOWFU(KBOUT,'Clear text')
+        CALL CLEARV
+        GO TO 10
+      END IF
+C
+C   xhairs
+C
+      IF(NOPT.EQ.13) THEN
+          CALL SHOWFU(KBOUT,'Use xhair')
+          CALL XHSAP(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,NCONTS,LLINO,LINCON,LREG,RREG,
+     +    WINDOW,IGWIND,LENCON,IDEPTH,JDEPTH,
+     +    MARGL,MARGR,MARGB,MARGT,TOPOPT,ISXMAX,ISYMAX,KBIN,IDEV,
+     +    KBOUT,GEL,GEL2,DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM,
+     +    SEQ1,MAXSEQ,NGELS,TEMP3,ECHRSZ,MAXGL2,GEL3,GEL4,33,34,29,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+C
+C   CHANGE MARGINS
+C
+      IF(NOPT.EQ.14)THEN
+        CALL SHOWFU(KBOUT,'Change margins')
+        CALL MARGC(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(NOPT),HELPE(NOPT),TOPOPT,HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+C  ruler
+C
+      IF(NOPT.EQ.12) THEN
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+          CALL SHOWFU(KBOUT,'Show ruler')
+          CALL RULER(LREG,RREG,MARGL,MARGR,
+     +    MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          GO TO 10
+        END IF
+      END IF
+C  labler
+      IF(NOPT.EQ.15)THEN
+        CALL SHOWFU(KBOUT,'Label diagram')
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.16) THEN
+        CALL SHOWFU(KBOUT,'Plot map')
+C  map
+        LENCON = RREG - LREG + 1
+        IF(LENCON.GT.0) THEN
+          CALL PLTMAP(DEVNOS(4),FILNAM,LENCON,MARGL,MARGR,MARGB(NOPT),
+     +    MARGT(NOPT),ISXMAX,ISYMAX,LREG,RREG,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.21).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Enter new gel reading')
+C       ENTER
+        CALL ENTER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +  DEVNOS(3),DEVNOS(4),LINLEN,PERCD,FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+     +  MAXGEL,IDM,LLINO)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.25).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Show relationships')
+C       PRINT DB
+        CALL DBPRNT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,IDEV,
+     +  KBIN,KBOUT,DEVNOS(3),LLINO,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.5).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Display contig')
+C       DISPLAY
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+          IF(IERR.EQ.0)THEN
+          CALL DSPLAY(RELPG,LNGTHG,LNBR,RNBR,
+     +    GEL,LLINO,LINCON,LREG,RREG,GEL2,I1,I2,0,I,IDBSIZ,IDEV,KBOUT,
+     +    DEVNOS(2),DEVNOS(3),LINLEN,PERCD,MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+200   CONTINUE
+      IF((NOPT.EQ.22).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Join contigs')
+C       JOIN
+        WRITE(KBOUT,1006)
+1006    FORMAT(' Which contigs do you want to join ?')
+        WRITE(KBOUT,1017)
+1017    FORMAT( ' Left contig')
+        LLINOL = 0
+        LLINOR = 0
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LNCONL,LLINOL,
+     +  IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          WRITE(KBOUT,1018)
+1018      FORMAT( ' Right contig')
+          CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LNCONR,LLINOR,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+          IF(IERR.EQ.0)THEN
+C           CHECK TO SEE IF MAKING A CIRCLE!
+            IF(LLINOL.EQ.LLINOR)THEN
+              WRITE(KBOUT,*)' Making circles is not allowed!'
+              GO TO 200
+            END IF
+            CALL JOIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +      LNCONL,LLINOL,LNCONR,LLINOR,GEL,GEL2,
+     +      IDBSIZ,IDEV,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +      DEVNOS(3),LINLEN,PERCD,
+     +      HELPS(NOPT),HELPE(NOPT),HELPS(4),HELPE(4),HELPF,DEVNOS(5),
+     +      MAXGEL,IDM)
+          END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.23).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Complement contig')
+C       COMPLEMENT
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  LINCON,LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL CMPLMT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    LINCON,LLINO,GEL,IDBSIZ,KBOUT,DEVNOS(1),DEVNOS(2),
+     +    MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+C  EDIT
+      IF((NOPT.EQ.4).AND.(IOPEN.EQ.0))THEN
+C       EDIT DB
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL DBEDIT(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    GEL,LINCON,0,0,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL)
+        END IF
+        GO TO 10
+      END IF
+C  PLOT SINGLE CONTIG
+      IF((NOPT.EQ.33).AND.(IOPEN.EQ.0))THEN
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +    HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0)THEN
+          LENCON = RREG - LREG + 1
+          CALL FDEPTH(RELPG,LNGTHG,LNBR,RNBR,
+     +    IDBSIZ,LLINO,LREG,RREG,LENCON,
+     +    MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        END IF
+        END IF
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.26).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Alter relationships')
+C       FIX
+        CALL DBFIX(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +  GEL,GEL2,IDBSIZ,KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),
+     +  HELPS(4),HELPE(4),HELPF,DEVNOS(5),MAXGEL)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.24).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Copy database to flat format')
+C       COPY
+        CALL DBCOPF(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +  NAMPRO,DEVNOS(4),IDBSIZ,JERR,KBIN,KBOUT,DEVNOS(1),
+     +  DEVNOS(2),DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,MAXDB,IDM)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.19).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Check database for consistency')
+C       CHECK
+        CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +  TEMP1,IERR,KBOUT)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.29).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Examine quality')
+C       SCAN
+        IDIM1=MAXSEQ
+        CALL DBSCAN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP3,2,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),IDEV,LINLEN,PERCD,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,GEL3,GEL4,
+     +  MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX,DEVNOS(3),
+     +  LLINO,LINCON,LREG,RREG)
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.8).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Make consensus')
+C       CONSENSUS
+        IDIM1=MAXSEQ
+        CALL CONSEN(RELPG,LNGTHG,LNBR,RNBR,NAMPRO,NGELS,NCONTS,SEQ1,
+     +  IDIM1,GEL,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+     +  KBIN,KBOUT,DEVNOS(2),DEVNOS(4),FILNAM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),MAXGEL,IDM,PERCD,
+     +  DEVNOS(3),LLINO)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.7)THEN
+        CALL SHOWFU(KBOUT,'Redirect output')
+        CALL REDIR(IDEV,DEVNOS(6),DEVNOS(10),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),KOPT)
+        GO TO 10
+      END IF
+C  HELP
+      IF(NOPT.EQ.1)THEN
+        CALL SHOWFU(KBOUT,'Help')
+C       HELP
+        CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,
+     +  HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.27)THEN
+        CALL SHOWFU(KBOUT,'Set parameters')
+C  SET DISPLAY PARAMETERS
+        MN = 10
+        MX = 100
+        LINLEN = 50
+        CALL GETINT(MN,MX,LINLEN,'Display line length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) LINLEN = IVAL
+        IPCMIN = 51
+        IPCMAX = 100
+        IPCD = 75
+        CALL GETINT(IPCMIN,IPCMAX,IPCD,
+     +  'Percentage score for consensus',
+     +  IPVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) THEN
+          PERCD = REAL(IPVAL)/100.
+        END IF
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,WINDOW,'Text window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) WINDOW = IVAL
+        MN = 1
+        MX = MAXSEQ
+        CALL GETINT(MN,MX,IGWIND,'Graphics window length',
+     +  IVAL,KBIN,KBOUT,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IOK)
+        IF(IOK.EQ.0) IGWIND = IVAL
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.6)THEN
+        CALL SHOWFU(KBOUT,'Display text file')
+       CALL TTEXT(DEVNOS(4),FILNAM,KBIN,KBOUT,
+     + HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+       GO TO 10
+      END IF
+      IF(NOPT.EQ.28) THEN
+        CALL SHOWFU(KBOUT,'Highlight disagreements')
+        CALL HIGHLT(SEQ1,SEQ1(6001),SEQ1(10001),KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(7),
+     +  DEVNOS(8),FILE,IOK)
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.34).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'Plot contigs')
+C  PLOT ALL CONTIGS
+        CALL PLC(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,0,0,
+     +  NCONTS,MARGL,MARGR,MARGB(NOPT),MARGT(NOPT),ISXMAX,ISYMAX)
+        GO TO 10
+      END IF
+C  AUTO EDIT
+      IF((NOPT.EQ.30).AND.(IOPEN.EQ.0))THEN
+        CALL SHOWFU(KBOUT,'AutoEdit a contig')
+        WRITE(KBOUT,*)'Make a copy first!'
+        CALL GETLN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +  LLINO,IERR,IDBSIZ,KBIN,KBOUT,DEVNOS(3),
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        IF(IERR.EQ.0)THEN
+          CALL GETREG(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +    1,RELPG(LINCON),LREG,RREG,LINCON,LLINO,IDBSIZ,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),IERR)
+        IF(IERR.EQ.0) THEN
+        IDIM1=MAXSEQ
+        CALL SUMMER(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,SEQ1,IDIM1,
+     +  GEL,LREG,RREG,LLINO,IDBSIZ,TEMP1,ECHRSZ,MAXGL2,
+     +  DEVNOS(2),MAXGEL,IDM,PERCD)
+        CALL AEDIT(RELPG,LNGTHG,LNBR,RNBR,IDBSIZ,LLINO,LINCON,
+     +  GEL,MAXGEL,SEQ1,IDIM1,DEVNOS(2),DEVNOS(1),LREG,RREG,
+     +  KBOUT)
+      END IF
+      END IF
+      GO TO 10
+      END IF
+      IF((NOPT.EQ.9).AND.(IOPEN.EQ.0)) THEN
+        CALL SHOWFU(KBOUT,'Screen Edit a contig')
+        CALL CEDT(KBIN,KBOUT,
+     +  POSNS(1),POSNS(1001),POSNS(2001),
+     +  MAXDB,POSNS(3001),POSNS(3101),50,
+     +  RELPG,LNGTHG,LNBR,RNBR,
+     +  IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),DEVNOS(4),
+     +  DEVNOS(1),DEVNOS(2),DEVNOS(3),LINLEN,FILNAM,LLINO,IOK)
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.17) THEN
+        CALL SHOWFU(KBOUT,'Screen for restriction sites')
+        CALL SCRENR(GEL2,MAXGEL,GEL3,NAMARC,FILE,
+     +  DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.18) THEN
+        CALL SHOWFU(KBOUT,'Screen against vector')
+        CALL SCRENV(MAXGEL,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +  SEQ1,MAXSEQ,GEL2,GEL3,GEL4,LENGTH,
+     +  SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +  KBIN,KBOUT,DEVNOS(4),DEVNOS(7),DEVNOS(8),DEVNOS(9),IDEV,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,FILE,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        CLOSE(UNIT=DEVNOS(9))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.20).AND.(IOPEN.EQ.0).AND.(IDM.EQ.5)) THEN
+        CALL SHOWFU(KBOUT,'Auto assemble sequences')
+      CALL DBAUTO(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,
+     +NGELS,NCONTS,MAXGEL,
+     +TEMP1,WORDP,WORDN,LPOWRC,POSNS,GELN,
+     +SEQ1,MAXSEQ,GEL2,GEL3,GEL4,SEQ5,SEQC2,SEQG2,GEL,
+     +MAXGLM,MAXGL2,CHRSIZ,ECHRSZ,LENGTH,
+     +SAV1,SAV2,SAV3,MAXSAV,CENDS,NENDS,MAXCON,CONST,
+     +KBIN,KBOUT,DEVNOS(1),DEVNOS(2),DEVNOS(3),
+     +DEVNOS(4),DEVNOS(7),DEVNOS(8),IDEV,
+     +HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),NAMARC,NAMPRO,FILE,
+     +PERCD,IOPEN,IDM,SEQG3,SEQC3,IOK)
+        CLOSE(UNIT=DEVNOS(4))
+        CLOSE(UNIT=DEVNOS(7))
+        CLOSE(UNIT=DEVNOS(8))
+        GO TO 10
+      END IF
+      IF((NOPT.EQ.32).AND.(IOPEN.EQ.0)) THEN
+        CALL SHOWFU(KBOUT,'Extract gel readings')
+        CALL GELOUT(RELPG,LNGTHG,LNBR,RNBR,MAXDB,IDBSIZ,NGELS,
+     +  NCONTS,GEL,MAXGEL,DEVNOS(2),DEVNOS(3),DEVNOS(4),DEVNOS(7),
+     +  KBIN,KBOUT,
+     +  HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5),FILE)
+        CLOSE(UNIT=DEVNOS(4))
+        GO TO 10
+      END IF
+      IF(NOPT.EQ.31) THEN
+        CALL SHOWFU(KBOUT,'Type in gel readings')
+        CALL BATIN(SEQ1,MAXSEQ,KBIN,KBOUT,
+     +  DEVNOS(4),DEVNOS(7),IDEV,
+     +  FILNAM,HELPS(NOPT),HELPE(NOPT),HELPF,DEVNOS(5))
+        GO TO 10
+      END IF
+      IF((NOPT.GT.3).AND.(IOPEN.NE.0))THEN
+        WRITE(KBOUT,1012)
+1012    FORMAT(' You have not opened a database!')
+        GO TO 10
+      END IF
+      GO TO 10
+9999  CONTINUE
+      CALL SHUTD
+      END
+      SUBROUTINE DBCOPF(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,GEL,
+     +NAMPRO,IDEV,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,
+     +IHELPS,IHELPE,FILEH,IDEVH,MAXGEL,MAXDB,IDM)
+      CHARACTER FILEH*(*)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ)
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER NAMPRO*(*),NAMARC*10,GEL(MAXGEL)
+C      CHARACTER NAMFIL*12,MTYPE*4
+      INTEGER IWORD
+      PARAMETER (IWORD=4)
+      IERR=1
+1     CONTINUE
+      L = 1
+      CALL BUSY(KBOUT)
+      NEWSIZ = IVAL
+      LLL = INDEX(NAMPRO,'.') + 1
+      NAMPRO(LLL:)='FLAT'
+       CALL OPENRS(IDEV,NAMPRO,IOK,LRECL,1)
+      IF(IOK.NE.0)GO TO 100
+      IRAW = 0
+      IF(IDEVRD.LT.0) IRAW = 1
+      WRITE(IDEV,1000,ERR=999)IDBSIZ,MAXGEL,IDM,NGELS,NCONTS,IRAW
+ 1000 FORMAT(6I6)
+      DO 10 I=1,NGELS
+        WRITE(IDEV,1000,ERR=999)RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+10    CONTINUE
+      N=IDBSIZ-NCONTS
+      DO 15 I=N,IDBSIZ-1
+        WRITE(IDEV,1000,ERR=999)RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+15    CONTINUE
+C
+      DO 20 I=1,NGELS
+        CALL READN(IDEVN,I,NAMARC)
+        CALL READW(IDEVW,I,GEL,MAXGEL)
+        WRITE(IDEV,1001,ERR=999)NAMARC
+        WRITE(IDEV,1002,ERR=999)(GEL(K),K=1,ABS(LNGTHG(I)))
+        WRITE(IDEV,1004)
+20    CONTINUE
+ 1001 FORMAT(A)
+ 1002 FORMAT(64A1)
+ 1004 FORMAT('/')
+C      IF(IDEVRD.GT.0) THEN
+C        DO 30 I=1,NGELS
+C        CALL READRD(IDEVRD,I,LENR,LCUT,LENW,MTYPE,NAMFIL)
+C        WRITE(IDEV,1003,ERR=999)LENR,LCUT,LENW,MTYPE,NAMFIL
+C 30     CONTINUE
+C 1003   FORMAT(3I6,' ',A,' ',A)
+C      END IF
+      IERR=0
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9998)
+ 9998 FORMAT(' Error opening file, copy aborted')
+      RETURN
+ 999  CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error writing data, copy aborted')
+      RETURN
+      END
+      SUBROUTINE DBSTAF(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,NAMPRO,GEL,
+     +IDBSIS,IDBSIZ,IERR,KBIN,KBOUT,IDEVR,IDEVW,IDEVN,MAXGEL,
+     +MAXGLM,LLINO,
+     +IDM,IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER GEL(MAXGLM)
+      INTEGER RELPG(IDBSIS)
+      INTEGER LNGTHG(IDBSIS),LNBR(IDBSIS),RNBR(IDBSIS)
+      CHARACTER NAMPRO*(*),MTYPE*4,NAMFIL*12
+      INTEGER IWORD
+      PARAMETER (IWORD=4)
+C  SETTING DEVICE NO HERE !!!!!!!!!!!!!
+      IDEV = 33
+      NAMPRO(1:)=' '
+      IERR=1
+1     CONTINUE
+      L = 0
+      CALL GTSTR('Project name of flat database',
+     +' ',NAMPRO,L,KBOUT,KBIN,INFLAG)
+      IF(L.LT.1) RETURN
+      LL = L
+      CALL CCASE(NAMPRO,1)
+      IF(INFLAG.EQ.2) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      LL = INDEX(NAMPRO,' ')
+      NAMPRO(LL:)='.FLAT'
+      CALL OPENRS(IDEV,NAMPRO,IOK,LRECL,2)
+      IF(IOK.NE.0) RETURN
+      LL = LL - 1
+      NAMPRO(LL+1:LL+5)='.RL0 '
+      CALL OPENRS(IDEVR,NAMPRO,IOK,4,3)
+      IF(IOK.NE.0)GO TO 100
+      READ(IDEV,1000,ERR=999)IDBSIZ,MAXGEL,IDM,NGELS,NCONTS,IRAW
+ 1000 FORMAT(6I6)
+      DO 10 I=1,NGELS
+        READ(IDEV,1000,ERR=999)RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+ 10     CONTINUE
+      MXT = 0
+      DO 20 I=IDBSIZ-NCONTS,IDBSIZ-1
+        READ(IDEV,1000,ERR=999)RELPG(I),LNGTHG(I),LNBR(I),RNBR(I)
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+        IF(RELPG(I).GT.MXT) THEN
+          MXT = RELPG(I)
+          LLINO = LNBR(I)
+        END IF
+ 20     CONTINUE
+      IR = 0
+      IL = 0
+      DO 30 I=NGELS+1,IDBSIZ-NCONTS-1
+        RELPG(I) = 0
+        LNGTHG(I) = 0
+        LNBR(I) = 0
+        RNBR(I) = 0
+        CALL WRITER(IDEVR,I,RELPG(I),LNGTHG(I),LNBR(I),RNBR(I))
+ 30     CONTINUE
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,IR,IL)
+5     CONTINUE
+      MN = 512
+      MX = MAXGLM
+      CALL GETINT(MN,MX,MAXGEL,
+     +'Maximum gel reading length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MAXGEL = IVAL
+      IF(MOD(IVAL,MN).NE.0) THEN
+        MAXGEL = 512 + (IVAL/512)*512
+        WRITE(KBOUT,*)'Maximum set to',MAXGEL
+      END IF
+      IREC=MAXGEL/IWORD
+      IF(MOD(MAXGEL,IWORD).NE.0)IREC=IREC+1
+      NAMPRO(LL+2:LL+3) = 'SQ'
+      CALL OPENRS(IDEVW,NAMPRO,IOK,IREC,3)
+      IF(IOK.NE.0)GO TO 100
+      CALL FILLC(GEL,MAXGEL,' ')
+      CALL WRITEW(IDEVW,IDBSIZ,GEL,MAXGEL)
+      NAMPRO(LL+2:LL+3)='AR'
+      CALL OPENRS(IDEVN,NAMPRO,IOK,3,3)
+      IF(IOK.NE.0)GO TO 100
+C  NOTE HERE IS A WRITE TO NAMES FILE NOT USING A SUBROUTINE CALL
+      WRITE(IDEVN,REC=IDBSIS)IDBSIZ,MAXGEL,IDM
+      DO 40 I = 1,NGELS
+        READ(IDEV,1001,ERR=999)NAMFIL
+        CALL WRITEN(IDEVN,I,NAMFIL)
+        K1 = -63
+        CALL FILLC(GEL,MAXGEL,' ')
+ 35     CONTINUE
+        K1 = K1 + 64
+        K1 = MIN(K1,MAXGEL)
+        K2 = K1 + 63
+        K2 = MIN(K2,MAXGEL)
+        READ(IDEV,1002,ERR=999)(GEL(K),K=K1,K2)
+        IF(GEL(K1).NE.'/') GO TO 35
+        CALL WRITEW(IDEVW,I,GEL,MAXGEL)
+ 40     CONTINUE
+ 1001   FORMAT(A)
+ 1002   FORMAT(64A1)
+      IRAW = 1
+      IF(IRAW.EQ.0) THEN
+        NAMPRO(LL+2:LL+3)='RD'
+        CALL OPENRS(IDEVRD,NAMPRO,IOK,7,3)
+      ELSE
+        IDEVRD = -1
+      END IF
+      IF(IDEVRD.GT.0) THEN
+        DO 50 I=1,NGELS
+        READ(IDEV,1003,ERR=999)II,J,K,MTYPE,NAMFIL
+C        CALL WRITRD(IDEVRD,I,II,J,K,MTYPE,NAMFIL)
+ 50     CONTINUE
+ 1003   FORMAT(3I6,1X,A,1X,A)
+      END IF
+      WRITE(KBOUT,1004)NAMPRO(1:LL),IDBSIZ
+ 1004 FORMAT(' Database ',A,' version 0, size',I5,
+     +' successfully started')
+      IERR=0
+      NAMPRO(LL+2:LL+2) = '0'
+      CLOSE(UNIT=IDEV)
+      RETURN
+ 999  CONTINUE
+      WRITE(KBOUT,*)'Error reading flat file'
+      CLOSE(UNIT=IDEV)
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' Error when trying to open database,',
+     +' database not started')
+      CLOSE(UNIT=IDEV)
+      RETURN
+      END
diff --git a/src/staden/sapmen.f b/src/staden/sapmen.f
new file mode 100644
index 0000000..a05c586
--- /dev/null
+++ b/src/staden/sapmen.f
@@ -0,0 +1,188 @@
+      SUBROUTINE DBMENT(MENU,NOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+1     CONTINUE
+      IF(MENU.EQ.0) THEN
+        WRITE(KBOUT,5000)
+5000    FORMAT(' Menus and their numbers are',/,
+     +  ' m0 = This menu',/,
+     +  ' m1 = General',/,
+     +  ' m2 = Screen control',/,
+     +  ' m3 = Modification',/,
+     +  '  ? = Help',/,
+     +  '  ! = Quit')
+      ELSE IF(MENU.EQ.-1) THEN
+      WRITE(KBOUT,1001)
+1001  FORMAT(' General menu',/,
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Open a database',/,
+     +'  5 = Display a contig',/,
+     +'  6 = List a text file',/,
+     +'  7 = Redirect output',/,
+     +'  8 = Calculate a consensus',/,
+     +' 17 = Screen against restriction enzymes',/,
+     +' 18 = Screen against vector',/,
+     +' 19 = Check database',/,
+     +' 24 = Copy database',/,
+     +' 25 = Show relationships',/,
+     +' 27 = Set parameters',/,
+     +' 28 = Highlight disagreements',/,
+     +' 29 = Examine quality',/,
+     +' 35 = Find internal joins')
+      ELSE IF(MENU.EQ.-2) THEN
+      WRITE(KBOUT,1002)
+1002  FORMAT(' Screen control menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +' 10 = Clear graphics',/,
+     +' 11 = Clear text',/,
+     +' 12 = Draw ruler',/,
+     +' 13 = Use cross hair',/,
+     +' 14 = Change margins',/,
+     +' 15 = Label diagram',/,
+     +' 16 = Plot map',/,
+     +' 33 = Plot single contig',/,
+     +' 34 = Plot all contigs')
+      ELSE IF(MENU.EQ.-3) THEN
+      WRITE(KBOUT,1003)
+1003  FORMAT(' Modification menu',/, 
+     +'  0 = List of menus',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  4 = Edit contig',/,
+     +'  9 = Screen edit',/,
+     +' 20 = Auto assemble',/,
+     +' 21 = Enter new gel reading',/,
+     +' 22 = Join contigs',/,
+     +' 23 = Complement a contig',/,
+     +' 26 = Alter relationships',/,
+     +' 30 = Auto edit a contig',/,
+     +' 31 = Type in gel readings',/,
+     +' 32 = Extract gel readings')
+      END IF
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      IF((NOPT.LT.-3).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS(NOPT),IHELPE(NOPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(NOPT.EQ.0) THEN
+        MENU = 0
+        GO TO 1
+      END IF
+      IF(NOPT.LT.0) THEN
+        MENU = NOPT
+        GO TO 1
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      RETURN
+      END
+      SUBROUTINE DBMENU(MENU,NOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+1     CONTINUE
+      IF(MENU.EQ.2)THEN
+20    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1002)
+1002  FORMAT( ' Enter menu options are:',/,
+     +'  ? = Help',/,
+     +'  ! = Give up',/,
+     +'  3 = Complete entry',/,
+     +'  4 = Edit contig',/,
+     +'  5 = Display',/,
+     +'  6 = Edit new gel reading')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 6
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 20
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 20
+      END IF
+      ELSE IF(MENU.EQ.3)THEN
+30    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1003)
+1003  FORMAT( ' Join menu options are:'/
+     +'  ? = Help',/,
+     +'  ! = Give up',/,
+     +'  3 = Complete join',/,
+     +'  4 = Edit left contig',/,
+     +'  5 = Display join',/,
+     +'  6 = Edit right contig',/,
+     +'  7 = Move join')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 7
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 30
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 30
+      END IF
+      ELSE IF(MENU.EQ.4)THEN
+40    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1004)
+1004  FORMAT(' Alter relationships menu options are:'/
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Line change',/,
+     +'  4 = Edit gel reading',/,
+     +'  5 = Delete contig line',/,
+     +'  6 = Shift',/,
+     +'  7 = Move gel reading',/,
+     +'  8 = Rename gel reading',/,
+     +'  9 = Break a contig',/,
+     +' 10 = Remove a gel reading')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 10
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 40
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 40
+      END IF
+      ELSE IF(MENU.EQ.5)THEN
+50    CONTINUE
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+      WRITE(KBOUT,1005)
+1005  FORMAT(
+     +' Edit options are:',/,
+     +'  ? = Help',/,
+     +'  ! = Quit',/,
+     +'  3 = Insert',/,
+     +'  4 = Delete',/,
+     +'  5 = Change')
+      WRITE(KBOUT,1010)
+      CALL GETOPT(KBIN,KOPT,NOPT)
+      MAXOPT = 5
+      IF((NOPT.LT.0).OR.(NOPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        GO TO 50
+      END IF
+      IF((KOPT.EQ.2).AND.(NOPT.GE.0))THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 50
+      END IF
+      END IF
+1010  FORMAT(
+     +' ? Option number=',$)
+      IF(NOPT.LE.0) GO TO 1
+      RETURN
+      END
diff --git a/src/staden/searchUtils.c b/src/staden/searchUtils.c
new file mode 100644
index 0000000..216518d
--- /dev/null
+++ b/src/staden/searchUtils.c
@@ -0,0 +1,1254 @@
+/*
+** Search functions for the contig editor
+**
+** This file is split into two parts, probably implying that it should be
+** divided into two files. The first half consists of all the search
+** engines, the second half the user interface code.
+**
+** Observations:
+**   Only one search window can be up at one time.
+**
+** Changes:
+**
+**  7-Nov-1991 SD Select the tag when searching on tag.
+** 20-Feb-1992 SD Renamed "OK" button to "Quit"
+** 29-Apr-1992 SD Changes relevant to general speedup in edUtils.c
+** 11-May-1992 SD Search by name now dependant on cursor position
+** 15-May-1992 SD now NOSTRSTR and NOSTRDUP
+**
+*/
+
+
+#define REDISPLAY(X) setDisplayPos((X),positionInContig((X),(X)->cursorSeq,(X)->cursorPos))
+
+/*
+** The first half
+*/
+#include <stdio.h>
+#include <stdlib.h>
+#include "edUtils.h"
+#include "tagUtils.h"
+#include "searchUtils.h"
+#include "select.h"
+#include "string.h"
+#include "ctype.h"
+#include "misc.h"
+
+/*
+** Prototypes
+*/
+extern char *re_comp(char *s);
+extern int re_exec(char *s);
+
+#define in_interval(A,B,C) ( ((A)<(B))?((A)<=(C) && (C)<=(B)):((B)<=(C) && (C)<=(A)) )
+
+/*
+** We don't want to have all the textwidget translations
+** Here are the list of the ones we want.
+** This is taken from Xaw/TextTr.c source
+*/
+static char *defaultTranslations = "\
+<Key>Right:	forward-character() \n\
+<Key>Left:	backward-character() \n\
+<Key>Delete:	delete-previous-character() \n\
+<Key>BackSpace:	delete-previous-character() \n\
+<Key>:		insert-char() \n\
+<FocusIn>:	focus-in() \n\
+<FocusOut>:	focus-out() \n\
+<Btn1Down>:	select-start() \n\
+<Btn1Motion>:	extend-adjust() \n\
+<Btn1Up>:	extend-end(PRIMARY, CUT_BUFFER0) \n\
+<Btn2Down>:	insert-selection(PRIMARY, CUT_BUFFER0) \n\
+<Btn3Down>:	extend-start() \n\
+<Btn3Motion>:	extend-adjust() \n\
+<Btn3Up>:	extend-end(PRIMARY, CUT_BUFFER0) \
+";
+
+static void my_translations(Widget w)
+/*
+** Use my default translations
+*/
+{
+    XtTranslations parsedTranslations;
+
+    parsedTranslations = XtParseTranslationTable(defaultTranslations);
+
+    XtUninstallTranslations(w);
+    XtAugmentTranslations(w,parsedTranslations);
+}
+
+static int findGelByNumber (EdStruct *xx, char *s)
+/*
+** Position cursor on left end of gel sequence matched by string s.
+** If s starts with a slash '/' assume a gel name is specified.
+** Otherwise assume a gel number is specified.
+*/
+{
+    int i;
+
+    int gel;
+    gel = atoi(s);
+    for (i=1; i <= xx->DB_gelCount; i++) {
+	if (DBgetNumber(xx,i) == gel) {
+	    xx->cursorSeq = i;
+	    xx->cursorPos = 1;
+	    REDISPLAY(xx);
+	    return 1;
+	}
+    }
+
+    return 0;
+}
+
+static int findNextByName(EdStruct *xx, char *s)
+/*
+** Search forwards from the cursor position until the sequence specified
+** is found. The cursor is positioned at the left end of the sequence,
+** if found.
+**
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)+1;
+    int i;
+    int n;
+
+    if (!*s) return 0;
+    n = strlen(s);
+
+    if ((i = posToIndex(xx,spos))==0) return 0;
+
+    for (; i <= xx->DB_gelCount; i++) {
+	/* search through tag list for sequence */
+
+	if (strncmp(DBgetGelName(xx,xx->DBorder[i]),s,n)==0) {
+	    xx->cursorSeq = xx->DBorder[i];
+	    xx->cursorPos = 1;
+	    REDISPLAY(xx);
+	    return 1;
+	}
+    }
+
+
+    return 0;
+}
+
+
+static int findPrevByName(EdStruct *xx, char *s)
+/*
+** Search forwards from the cursor position until the sequence specified
+** is found. The cursor is positioned at the left end of the sequence,
+** if found.
+**
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)-1;
+    int i;
+    int n;
+
+    if (!*s) return 0;
+    n = strlen(s);
+
+    if ((i = posToIndex(xx,spos))==0) return 0;
+
+    for (i--; i >= 0 ; i--) {
+	/* search through tag list for sequence */
+
+	if (strncmp(DBgetGelName(xx,xx->DBorder[i]),s,n)==0) {
+	    xx->cursorSeq = xx->DBorder[i];
+	    xx->cursorPos = 1;
+	    REDISPLAY(xx);
+	    return 1;
+	}
+    }
+
+
+    return 0;
+}
+
+static int findNextGelByName (EdStruct *xx, char *s)
+/*
+** Position cursor on left end of gel sequence matched by string s.
+** If s starts with a slash '/' assume a gel name is specified.
+** Otherwise assume a gel number is specified.
+*/
+{
+    if (*s) {
+	if (*s == '/') {
+	    s++;
+	    return findNextByName(xx,s);
+	} else
+	    return findGelByNumber(xx,s);
+    }
+    return 0;
+}
+
+
+static int findPrevGelByName (EdStruct *xx, char *s)
+/*
+** Position cursor on left end of gel sequence matched by string s.
+** If s starts with a slash '/' assume a gel name is specified.
+** Otherwise assume a gel number is specified.
+*/
+{
+    if (*s) {
+	if (*s == '/') {
+	    s++;
+	    return findPrevByName(xx,s);
+	} else
+	    return findGelByNumber(xx,s);
+    }
+    return 0;
+}
+
+
+
+
+
+static int findNextTagByType (EdStruct *xx, char *type)
+/*
+** Search forwards from the cursor position until a tag of a specified
+** type is encountered. The cursor is positioned at the left end of the
+** tag, if found.
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)+1;
+    int epos = DBgetLength(xx,0);
+    int fpos,fseq,i;
+    int fseqpos;
+    int *seqList;
+    tagStruct *found_tag;
+
+    seqList = sequencesInRegion(xx,spos, epos);
+    fseq = 0;
+    fseqpos = 0;
+    fpos = epos + 1;
+
+    for (i=0; seqList[i] && DBgetRelPos(xx,seqList[i]) < fpos ; i++) {
+	/* search through tag list for sequence */
+	int seq = seqList[i];
+	tagStruct *t;
+
+	t = (tagStruct *) DBgetTags(xx,seq);
+	while (t != NULL) {
+	    int normpos;
+	    int tagpos;
+	    normpos = normalisePos(xx,seq,t->tagrec.position,t->tagrec.length);
+	    tagpos=positionInContig(xx,seq,normpos);
+	    if (!(t->flags & TAG_DELETED) &&
+		in_interval(spos,fpos,tagpos) &&
+		strncmp(t->tagrec.type.c,type,4)==0) {
+		fseq = seq;
+		fseqpos = normpos;
+		fpos = tagpos;
+		found_tag = t;
+		if (DBgetComp(xx,seq) == COMPLEMENTED) 
+		    /* Keep looking */
+		    t = t->next;
+		else
+		    /* Stop now */
+		    t = NULL;
+	    } else
+		t = t->next;
+	}
+  
+    }
+
+
+    if (fseq) {
+	xx->cursorSeq = fseq;
+	xx->cursorPos = fseqpos;
+	_select_tag(xx,fseq,found_tag);
+        REDISPLAY(xx);
+    }
+
+    return fseq;
+}
+
+
+
+
+static int findPrevTagByType (EdStruct *xx, char *type)
+/*
+** Search backwards from the current cursor position until a tag of the
+** specified type is encountered. The cursor is positioned at the left
+** end of the tag, if found.
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)-1;
+    int epos = 1;
+    int fpos,fseq,i;
+    int fseqpos;
+    int *seqList;
+    tagStruct *found_tag;
+
+    seqList = sequencesInRegion(xx,epos, spos);
+    fseq = 0;
+    fseqpos = 0;
+    fpos = epos - 1;
+
+    for (i=0; seqList[i]; i++) ;
+    for (i--; i>=0 && DBgetRelPos(xx,seqList[i])+DBgetLength(xx,seqList[i]) > fpos ; i--) {
+	/* search through tag list for sequence */
+	int seq = seqList[i];
+	tagStruct *t;
+
+	t = (tagStruct *) DBgetTags(xx,seq);
+	while (t != NULL) {
+	    int normpos;
+	    int tagpos;
+	    normpos = normalisePos(xx,seq,t->tagrec.position,t->tagrec.length);
+	    tagpos = positionInContig(xx,seq,normpos);
+	    if (!(t->flags & TAG_DELETED) &&
+		in_interval(spos,fpos,tagpos) &&
+		strncmp(t->tagrec.type.c,type,4)==0) {
+		fseq = seq;
+		fseqpos = normpos;
+		fpos = tagpos;
+		found_tag = t;
+		if (DBgetComp(xx,seq) != COMPLEMENTED) 
+		    /* Keep looking */
+		    t = t->next;
+		else
+		    /* Stop now */
+		    t = NULL;
+	    } else
+		t = t->next;
+	}
+  
+    }
+
+
+    if (fseq) {
+	xx->cursorSeq = fseq;
+	xx->cursorPos = fseqpos;
+	_select_tag(xx,fseq,found_tag);
+        REDISPLAY(xx);
+    }
+
+    return fseq;
+}
+
+
+
+
+
+static int findNextSequence(EdStruct *xx, char *s)
+/*
+** Search forwards from the cursor position until the sequence specified
+** is found. The cursor is positioned at the left end of the sequence,
+** if found.
+**
+** Observations:
+**   The search is done on a reading by reading basis, rather than from
+**   the consensus. 
+**   The search is not case sensitive.
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)+1;
+    int epos = DBgetLength(xx,0);
+    int fpos,fseq,i;
+    int fseqpos;
+    int *seqList;
+    char *reading;
+    int maxlen;
+
+    seqList = sequencesInRegion(xx,spos, epos);
+    fseq = 0;
+    fseqpos = 0;
+    fpos = epos + 1;
+
+
+    for (maxlen = 0, i=0; seqList[i] ; i++)
+	maxlen = max(maxlen, DBgetLength(xx,seqList[i]));
+
+    reading = malloc(maxlen+1);
+
+    for (i=0; seqList[i] && DBgetRelPos(xx,seqList[i]) < fpos ; i++) {
+	/* search through tag list for sequence */
+	int seq = seqList[i];
+	char *str;
+	char *ind;
+
+	str = DBgetSeq(xx,seq);
+	strncpy(reading,str,DBgetLength(xx,seq));
+	reading[DBgetLength(xx,seq)] = '\0';
+
+	if (in_interval(1,DBgetLength(xx,seq),spos - DBgetRelPos(xx,seq)+1)) {
+	    int offset = spos - DBgetRelPos(xx,seq);
+	    ind = strstr(reading + offset,s);
+	} else
+	    ind = strstr(reading,s);
+
+	if (ind != NULL) {
+	    int pos;
+	    pos = positionInContig(xx,seq,(int) (ind - reading) + 1);
+	    if (in_interval(spos,fpos,pos)) {
+		fseqpos = (int) (ind - reading) + 1;
+		fpos = pos;
+		fseq = seq;
+	    }
+	   
+	}
+
+    }
+
+    if (fseq) {
+	xx->cursorSeq = fseq;
+	xx->cursorPos = fseqpos;
+        REDISPLAY(xx);
+    }
+
+    free(reading);
+    return fseq;
+}
+
+
+
+
+static void reverse_str(char *s, int len)
+/*
+** Verse the character sequence of a string
+*/
+{
+    int i;
+    char temp;
+
+    for (i=0; i < len/2; i++) {
+	temp = s[i];
+	s[i] = s[len-i-1];
+	s[len-i-1] = temp;
+    }
+
+}
+
+
+
+
+static int findPrevSequence(EdStruct *xx, char *inseq)
+/*
+** Search backwards from the cursor position until the sequence specified
+** is found. The cursor is positioned at the left end of the sequence,
+** if found.
+**
+** Observations:
+**   The search is done on a reading by reading basis, rather than from
+**   the consensus.
+**   The search is not case sensitive.
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)+strlen(inseq)-2;
+    int epos = 1;
+    int fpos,fseq,i;
+    int fseqpos;
+    int *seqList;
+    char *reading;
+    char *s;
+    int maxlen;
+
+    s = strdup(inseq);
+    reverse_str(s,strlen(s));
+
+    seqList = sequencesInRegion(xx,epos, spos);
+    fseq = 0;
+    fseqpos = 0;
+    fpos = epos - 1;
+
+
+    for (maxlen = 0, i=0; seqList[i] ; i++)
+	maxlen = max(maxlen, DBgetLength(xx,seqList[i]));
+
+    reading = malloc(maxlen+1);
+
+    for (i=0; seqList[i]; i++) ;
+    for (i--; i>=0 && DBgetRelPos(xx,seqList[i])+DBgetLength(xx,seqList[i]) > fpos ; i--) {
+	/* search through tag list for sequence */
+	int seq = seqList[i];
+	char *str;
+	char *ind;
+
+	str = DBgetSeq(xx,seq);
+	strncpy(reading,str,DBgetLength(xx,seq));
+	reading[DBgetLength(xx,seq)] = '\0';
+	reverse_str(reading,DBgetLength(xx,seq));
+
+	if (in_interval(1,DBgetLength(xx,seq),spos - DBgetRelPos(xx,seq)+1)) {
+	    int offset = DBgetLength(xx,seq) - (spos - DBgetRelPos(xx,seq)+1);
+	    ind = strstr(reading + offset,s);
+	} else
+	    ind = strstr(reading,s);
+
+	if (ind != NULL) {
+	    int pos;
+	    pos = positionInContig(xx,
+				   seq,
+				   DBgetLength(xx,seq) - (int)(ind - reading) - strlen(s) + 1);
+	    if (in_interval(spos,fpos,pos)) {
+		fseqpos = pos - DBgetRelPos(xx,seq) + 1;
+		fpos = pos;
+		fseq = seq;
+	    }
+	   
+	}
+
+    }
+
+    if (fseq) {
+	xx->cursorSeq = fseq;
+	xx->cursorPos = fseqpos;
+        REDISPLAY(xx);
+    }
+
+    free(reading);
+    return fseq;
+}
+
+
+
+
+
+static int findNextAnno(EdStruct *xx, char *anno)
+/*
+** Search forwards from the cursor position until a tag containing the
+** specified annotation is found. The cursor is positioned at the left
+** end of the tag, if found.
+**
+** Observations:
+**   A regular expression search is found, giving unpredictable results
+**   to people unfamiliar with such searches
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)+1;
+    int epos = DBgetLength(xx,0);
+    int fpos,fseq,i;
+    int fseqpos;
+    int *seqList;
+    char *error;
+    char *find_all = "$";
+    tagStruct *found_tag;
+
+    if (! *anno) anno = find_all;
+
+    error = re_comp(anno);
+
+    if (error != NULL) {
+	fprintf(stderr,"findNextAnno: %s\n",error);
+	return 0;
+    }
+
+
+    seqList = sequencesInRegion(xx,spos, epos);
+    fseq = 0;
+    fseqpos = 0;
+    fpos = epos + 1;
+
+    for (i=0; seqList[i] && DBgetRelPos(xx,seqList[i]) < fpos ; i++) {
+	/* search through tag list for sequence */
+	int seq = seqList[i];
+	tagStruct *t;
+
+	t = (tagStruct *) DBgetTags(xx,seq);
+	while (t != NULL) {
+	    int normpos;
+	    int tagpos;
+	    int match;
+	    /* Code isn't very efficient here! */
+	    normpos = normalisePos(xx,seq,t->tagrec.position,t->tagrec.length);
+	    tagpos=positionInContig(xx,seq,normpos);
+	    /*
+	    ** Get annotation
+	    */
+	    force_comment(t);
+	    match = ( re_exec(t->newcomment) == 1);
+	    if (!(t->flags & TAG_DELETED) &&
+		in_interval(spos,fpos,tagpos) &&
+		t->tagrec.type.c[0] != '*' &&  /* avoid special tags */
+		t->tagrec.position != 0 &&
+		match) {
+		fseq = seq;
+		fseqpos = normpos;
+		fpos = tagpos;
+		found_tag = t;
+		if (DBgetComp(xx,seq) == COMPLEMENTED) 
+		    /* Keep looking */
+		    t = t->next;
+		else
+		    /* Stop now */
+		    t = NULL;
+	    } else
+		t = t->next;
+	}
+  
+    }
+
+
+    if (fseq) {
+	xx->cursorSeq = fseq;
+	xx->cursorPos = fseqpos;
+	_select_tag(xx,fseq,found_tag);
+        REDISPLAY(xx);
+    }
+
+    return fseq;
+}
+
+static int findPrevAnno(EdStruct *xx, char *anno)
+/*
+** Search backwards from the cursor position until a tag containing the
+** specified annotation is found. The cursor is positioned at the left
+** end of the tag, if found.
+**
+** Observations:
+**   A regular expression search is found, giving unpredictable results
+**   to people unfamiliar with such searches
+*/
+{
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)-1;
+    int epos = 1;
+    int fpos,fseq,i;
+    int fseqpos;
+    int *seqList;
+    char *error;
+    char *find_all = "$";
+    tagStruct *found_tag;
+
+    if (! *anno) anno = find_all;
+
+    error = re_comp(anno);
+
+    if (error != NULL) {
+	fprintf(stderr,"findPrevAnno: %s\n",error);
+	return 0;
+    }
+
+
+    seqList = sequencesInRegion(xx,epos, spos);
+    fseq = 0;
+    fseqpos = 0;
+    fpos = epos - 1;
+
+    for (i=0; seqList[i]; i++) ;
+    for (i--; i>=0 && DBgetRelPos(xx,seqList[i])+DBgetLength(xx,seqList[i]) > fpos ; i--) {
+	/* search through tag list for sequence */
+	int seq = seqList[i];
+	tagStruct *t;
+
+	t = (tagStruct *) DBgetTags(xx,seq);
+	while (t != NULL) {
+	    int normpos;
+	    int tagpos;
+	    int match;
+	    /* Code isn't very efficient here! */
+	    normpos = normalisePos(xx,seq,t->tagrec.position,t->tagrec.length);
+	    tagpos=positionInContig(xx,seq,normpos);
+	    /*
+	    ** Get annotation
+	    */
+	    force_comment(t);
+	    match = ( re_exec(t->newcomment) == 1);
+	    if (!(t->flags & TAG_DELETED) &&
+		in_interval(spos,fpos,tagpos) &&
+		t->tagrec.type.c[0] != '*' &&  /* avoid special tags */
+		t->tagrec.position != 0 &&
+		match) {
+		fseq = seq;
+		fseqpos = normpos;
+		fpos = tagpos;
+		found_tag = t;
+		if (DBgetComp(xx,seq) != COMPLEMENTED) 
+		    /* Keep looking */
+		    t = t->next;
+		else
+		    /* Stop now */
+		    t = NULL;
+	    } else
+		t = t->next;
+	}
+  
+    }
+
+
+    if (fseq) {
+	xx->cursorSeq = fseq;
+	xx->cursorPos = fseqpos;
+	_select_tag(xx,fseq,found_tag);
+        REDISPLAY(xx);
+    }
+
+    return fseq;
+}
+
+
+
+static int findPosition(EdStruct *xx, char *text_pos)
+/*
+** Position the cursor at the position specified.
+** There are three modes:
+**   1. By position in contig.       eg 30717
+**   2. By position in a gel.        eg @100
+**   3. By a relative offset.        eg +1000 eg -1000
+**
+** Observations:
+**   The cursor is positioned in the same gel if possible.
+**   If it possible to specify negative or large numbers when 
+**   specifying position in gel.
+*/
+{
+
+    int pos;
+    int cseq = xx->cursorSeq;
+    int cpos = xx->cursorPos;
+
+    for(; *text_pos && isspace(*text_pos) ; text_pos++) ;
+
+    switch (*text_pos) {
+    case '\0':
+	return 0;
+    case '+':
+    case '-':
+	pos = positionInContig(xx,cseq,cpos) + atoi(text_pos);
+	break;
+    case '@':
+	pos = DBgetRelPos(xx,cseq) + atoi(++text_pos) - 1;
+	break;
+    default:
+	pos = atoi(text_pos);
+	break;
+    }
+
+    if (pos > 0 && pos <= DBgetLength(xx,0)) {
+	if ( in_interval(DBgetRelPos(xx,cseq),DBgetRelPos(xx,cseq)+DBgetLength(xx,cseq)-1,pos ) ) {
+	    xx->cursorPos = pos - DBgetRelPos(xx,cseq) + 1;
+	} else {
+	    xx->cursorSeq = 0;
+	    xx->cursorPos = pos;
+	}
+        REDISPLAY(xx);
+	return 1;
+    }
+    return 0;
+    
+}
+
+
+
+int findNextProblem (EdStruct *xx)
+/*
+** Search forward from the cursor position until a the consensus is not
+** A, C, G or T. The cursor is positioned on the problem base, if found.
+*/
+{
+#define SEARCH_CHUNKS MAX_DISPLAY_WIDTH
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)+1;
+    int epos = DBgetLength(xx,0);
+    char buffer[SEARCH_CHUNKS+1];
+    int i,width;
+
+    for (i=spos; i<= epos; i+=SEARCH_CHUNKS) {
+	width = min(epos-i+1,SEARCH_CHUNKS);
+	DBcalcConsensus (xx,i,width, buffer,BOTH_STRANDS);
+	{
+	    int check = strcspn(buffer,"-*");
+	    if (check != width) { /* we have problem! */
+		xx->cursorSeq = 0;
+		xx->cursorPos = i+check;
+		REDISPLAY(xx);
+		return 1;
+	    }
+	}
+    }
+    return 0;
+}
+
+
+
+static int findPrevProblem (EdStruct *xx)
+/*
+** Search forward from the cursor position until a the consensus is not
+** A, C, G or T. The cursor is positioned on the problem base, if found.
+*/
+{
+#define SEARCH_CHUNKS MAX_DISPLAY_WIDTH
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)-1;
+    int epos = 1;
+    char buffer[SEARCH_CHUNKS+1];
+    int i,width;
+    int j;
+
+    for (i=spos; i>= epos; i-=SEARCH_CHUNKS) {
+	width = min(i-epos+1,SEARCH_CHUNKS);
+	DBcalcConsensus (xx,i-width+1,width, buffer, BOTH_STRANDS);
+	for (j=width-1;j>=0;j--) {
+	    char *check = strchr("-*",buffer[j]);
+	    if (check != NULL) { /* we have problem! */
+		xx->cursorSeq = 0;
+		xx->cursorPos = i-width+1+j;
+		REDISPLAY(xx);
+		return 1;
+	    }
+	}
+    }
+    return 0;
+}
+
+
+static int qual_prob(char a, char b)
+/*
+** Returns true if character 'a' is out of sorts with character 'b'
+*/
+{
+    switch (a) {
+    case ' ':
+	return 0;
+    case '*':
+    case '-':
+	return 1;
+    default:
+	switch (b) {
+	case ' ':
+	    return 0;
+	case '*':
+	case '-':
+	    return 1;
+	default:
+	    return (a != b);
+	}
+    }
+}
+
+
+
+
+static int findNextQualProb (EdStruct *xx)
+/*
+** Search forwards from the cursor position until a problem relating to
+** quality is found. The cursor is positioned on the problematic base, if found.
+**
+** Observations:
+**   Large stretches of sequence on one strand only could cause frustration
+*/
+{
+#define SEARCH_CHUNKS MAX_DISPLAY_WIDTH
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)+1;
+    int epos = DBgetLength(xx,0);
+    char buffer1[SEARCH_CHUNKS+1];
+    char buffer2[SEARCH_CHUNKS+1];
+    int i,width;
+    int j;
+
+    for (i=spos; i<= epos; i+=SEARCH_CHUNKS) {
+	width = min(epos-i+1,SEARCH_CHUNKS);
+	DBcalcConsensus (xx,i,width,buffer1,COMPLEMENTED);
+	DBcalcConsensus (xx,i,width,buffer2,UNCOMPLEMENTED);
+	for (j=0;j<width;j++)
+	{
+	    if (qual_prob(buffer1[j],buffer2[j])) { /* we have problem! */
+		xx->cursorSeq = 0;
+		xx->cursorPos = i+j;
+		REDISPLAY(xx);
+		return 1;
+	    }
+	}
+    }
+    return 0;
+}
+
+
+
+static int findPrevQualProb (EdStruct *xx)
+/*
+** Search backwards from the cursor position until a problem relating to
+** quality is found. The cursor is positioned on the problematic base, if found.
+**
+** Observations:
+**   Large stretches of sequence on one strand only could cause frustration
+*/
+{
+#define SEARCH_CHUNKS MAX_DISPLAY_WIDTH
+    int spos = positionInContig(xx,xx->cursorSeq,xx->cursorPos)-1;
+    int epos = 1;
+    char buffer1[SEARCH_CHUNKS+1];
+    char buffer2[SEARCH_CHUNKS+1];
+    int i,width;
+    int j;
+
+    for (i=spos; i>= epos; i-=SEARCH_CHUNKS) {
+	width = min(i-epos+1,SEARCH_CHUNKS);
+	DBcalcConsensus (xx,i-width+1,width,buffer1,COMPLEMENTED);
+	DBcalcConsensus (xx,i-width+1,width,buffer2,UNCOMPLEMENTED);
+	for (j=width-1;j>=0;j--) {
+	    if (qual_prob(buffer1[j],buffer2[j])) { /* we have problem! */
+		xx->cursorSeq = 0;
+		xx->cursorPos = i-width+1+j;
+		REDISPLAY(xx);
+		return 1;
+	    }
+	}
+    }
+    return 0;
+}
+
+
+
+
+
+/*
+** The second half
+*/
+
+/*
+** Widgets and things
+*/
+#include <X11/StringDefs.h>
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Text.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/Command.h>
+#include <ctype.h>
+
+#include "edMenu.h"
+#include "main.h"
+
+#define FORWARDS 1
+#define BACKWARDS 2
+
+static EdStruct *XX;
+static char tag_type[4];
+static int up = 0;
+static Widget oldFogieWid;
+
+static Widget searchWid = NULL;
+static Widget form;
+static Widget label;
+static Widget dir_box,    backwards, forwards;
+static Widget opt_box,    by_pos, by_name, by_tag_type, by_tag_val, by_seq, by_prob, by_qual;
+static Widget val_box,    text, type;
+static Widget act_box,    search,quit;
+
+
+
+
+static void searchCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Interrogate buttons to determine search direct and mode, and call
+** the appropriate routine.
+*/
+{
+    char *text_str;
+    Arg args[10];
+    Cardinal nargs;
+    int found;
+
+    int option, isforwards;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstring, &text_str); nargs++;
+    XtGetValues(text,args,nargs);
+
+    option = (int) XawToggleGetCurrent(by_pos);
+    isforwards = ( FORWARDS == (int) XawToggleGetCurrent(forwards) );
+
+    if (isforwards) {
+
+	switch(option){
+	case OptByGelName:
+	    found = findNextGelByName(XX,text_str);
+	    break;
+	case OptByAnnotation:
+	    found = findNextAnno(XX,text_str);
+	    break;
+	case OptBySequence:
+	    found = findNextSequence(XX,text_str);
+	    break;
+	case OptByTagType:
+	    found = findNextTagByType (XX, tag_type);
+	    break;
+	case OptByPosition:
+	    found = findPosition(XX,text_str);
+	    break;
+	case OptByProblem:
+	    found = findNextProblem(XX);
+	    break;
+	case OptByQuality:
+	    found = findNextQualProb(XX);
+	    break;
+	}
+
+    } else {
+
+	switch(option){
+	case OptByGelName:
+	    found = findPrevGelByName(XX,text_str);
+	    break;
+	case OptByAnnotation:
+	    found = findPrevAnno(XX,text_str);
+	    break;
+	case OptBySequence:
+	    found = findPrevSequence(XX,text_str);
+	    break;
+	case OptByTagType:
+	    found = findPrevTagByType (XX, tag_type);
+	    break;
+	case OptByPosition:
+	    found = findPosition(XX,text_str);
+	    break;
+	case OptByProblem:
+	    found = findPrevProblem(XX);
+	    break;
+	case OptByQuality:
+	    found = findPrevQualProb(XX);
+	    break;
+	}
+
+    }
+
+    if (!found) XBell(GetDisplay(),100);
+
+}
+
+
+
+
+static void quitCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** Leave the search window gracefully
+*/
+{
+    up = 0;
+    XtPopdown(searchWid);
+}
+
+
+#include "tagdb.h"
+static void tagMenuCallback(Widget w, XtPointer client_data, XtPointer call_data)
+/*
+** The changing of the tag type - don't miss it!
+*/
+{
+    tag_db_struct *t = (tag_db_struct *) client_data;
+    setButtonName(type,t->type);
+    strncpy(tag_type,t->id,4);
+
+}
+
+
+
+
+static void create_search_wid(Widget parentWid)
+/*
+** Create the widgets for the search window.
+*/
+{
+    /*
+    **      Label
+    **      Search: [Backwards] [Forwards]
+    **      Operation:
+    **                 [Position] [Gel Name]
+    **                 [Annotation Type] [Annotation Content]
+    **                 [Sequence] [Problems]
+    **	    Value: [         ] [Type]
+    **	    [Search] [Quit]
+    */
+
+
+
+    Cardinal nargs;
+    Arg args[10];
+    Position	x, y; 	 /* top-left hand corner of new widget */
+    Dimension	height;  /* height of parent widget */
+
+    /*
+    ** Determine the position on the screen for this widget
+    */
+#define fromVertWid (XX->edWid)
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(fromVertWid, args, nargs);
+    XtTranslateCoords(fromVertWid, (Position) 0, (Position) height, &x, &y);
+
+    /*
+    ** Create popup shell
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    searchWid = XtCreatePopupShell("search", transientShellWidgetClass, parentWid, args, nargs);
+
+    /*
+    ** Create main form
+    */
+    nargs = 0;
+    form = XtCreateManagedWidget("form", formWidgetClass, searchWid, args, nargs);
+
+    /*
+    ** Create title for form
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Search Window"); nargs++;
+    label = XtCreateManagedWidget("label", labelWidgetClass, form, args, nargs);
+
+    /*
+    ** Create buttons for search direction
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, label); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+#define XtOrientHorizontal "horizontal"
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    dir_box = XtCreateManagedWidget("dir_box", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Direction:"); nargs++;
+    (void) XtCreateManagedWidget("label", labelWidgetClass, dir_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioData,BACKWARDS); nargs++;
+    backwards = XtCreateManagedWidget("backwards", toggleWidgetClass, dir_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioGroup, backwards); nargs++;
+    XtSetArg(args[nargs],XtNstate, True); nargs++;
+    XtSetArg(args[nargs],XtNradioData,FORWARDS); nargs++;
+    forwards = XtCreateManagedWidget("forwards", toggleWidgetClass, dir_box, args, nargs);
+
+    /*
+    ** Create buttons for operation selection
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, dir_box); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+#define XtOrientHorizontal "horizontal"
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    opt_box = XtCreateManagedWidget("opt_box", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Search by:"); nargs++;
+    (void) XtCreateManagedWidget("label", labelWidgetClass, opt_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNstate, True); nargs++;
+    XtSetArg(args[nargs],XtNlabel,"Position"); nargs++;
+    XtSetArg(args[nargs],XtNradioData,OptByPosition); nargs++;
+    by_pos = XtCreateManagedWidget("by_pos", toggleWidgetClass, opt_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioGroup, by_pos); nargs++;
+    XtSetArg(args[nargs],XtNlabel,"Reading Name"); nargs++;
+    XtSetArg(args[nargs],XtNradioData,OptByGelName); nargs++;
+    by_name = XtCreateManagedWidget("by_name", toggleWidgetClass, opt_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioGroup, by_pos); nargs++;
+    XtSetArg(args[nargs],XtNlabel,"Tag Type"); nargs++;
+    XtSetArg(args[nargs],XtNradioData,OptByTagType); nargs++;
+    by_tag_type = XtCreateManagedWidget("by_tag_type", toggleWidgetClass, opt_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioGroup, by_pos); nargs++;
+    XtSetArg(args[nargs],XtNlabel,"Annotation"); nargs++;
+    XtSetArg(args[nargs],XtNradioData,OptByAnnotation); nargs++;
+    by_tag_val = XtCreateManagedWidget("by_tag_val", toggleWidgetClass, opt_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioGroup, by_pos); nargs++;
+    XtSetArg(args[nargs],XtNlabel,"Sequence"); nargs++;
+    XtSetArg(args[nargs],XtNradioData,OptBySequence); nargs++;
+    by_seq = XtCreateManagedWidget("by_seq", toggleWidgetClass, opt_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioGroup, by_pos); nargs++;
+    XtSetArg(args[nargs],XtNlabel,"Problem"); nargs++;
+    XtSetArg(args[nargs],XtNradioData,OptByProblem); nargs++;
+    by_prob = XtCreateManagedWidget("by_prob", toggleWidgetClass, opt_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs],XtNradioGroup, by_pos); nargs++;
+    XtSetArg(args[nargs],XtNlabel,"Quality"); nargs++;
+    XtSetArg(args[nargs],XtNradioData,OptByQuality); nargs++;
+    by_qual = XtCreateManagedWidget("by_qual", toggleWidgetClass, opt_box, args, nargs);
+    
+
+    /*
+    ** Create selection criteria box
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, opt_box); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    val_box = XtCreateManagedWidget("val_box", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "    Value:"); nargs++;
+    (void) XtCreateManagedWidget("label", labelWidgetClass, val_box, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNresizable, True);              nargs++;
+    XtSetArg(args[nargs], XtNresize, XawtextResizeBoth);    nargs++;
+    XtSetArg(args[nargs], XtNeditType, XawtextEdit);        nargs++;
+    text = XtCreateManagedWidget("text",asciiTextWidgetClass, val_box, args, nargs);
+    my_translations(text);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNmenuName, "tagMenu"); nargs++;
+    type = XtCreateManagedWidget("type",menuButtonWidgetClass, val_box, args, nargs);
+    createTagTypeMenu(type,tagMenuCallback);
+    /* Set default type */
+    setButtonName(type, tag_db[0].type);
+    strncpy(tag_type,tag_db[0].id,4);
+
+    /*
+    ** Create Exit Action Buttons
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, val_box); nargs++;
+    XtSetArg(args[nargs], XtNborderWidth, 0); nargs++;
+    XtSetArg(args[nargs], XtNorientation, XtOrientHorizontal); nargs++;
+    act_box = XtCreateManagedWidget("act_box", boxWidgetClass, form, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Search"); nargs++;
+    search = XtCreateManagedWidget("search", commandWidgetClass, act_box, args, nargs);
+    XtAddCallback(search, XtNcallback, searchCallback, (XtPointer) NULL);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, "Quit"); nargs++;
+    quit = XtCreateManagedWidget("quit", commandWidgetClass, act_box, args, nargs);
+    XtAddCallback(quit, XtNcallback, quitCallback, (XtPointer) NULL);
+
+}
+
+
+
+int invokeSearchGeneric(EdStruct *xx)
+/*
+** Pop up the search window, if it isn't already up
+*/
+{
+    if (up) return 0;
+    XX = xx;
+
+    if (searchWid == NULL) {
+	create_search_wid(oldFogieWid);
+    }
+
+    XtPopup(searchWid,   XtGrabNone);
+
+    up = 1;
+
+    return 0;
+}
+
+
+
+int destroySearchWindow()
+/*
+** Shut this baby down
+** Called by the routine that takes the contig editor down.
+*/
+{
+    if (up)
+	XtCallCallbacks(quit, XtNcallback, (XtPointer) NULL);
+
+    return 0;
+}
+
+void createSearchWidget(Widget parentWid)
+/*
+** Prtend to create it now
+*/
+{
+    oldFogieWid = parentWid;
+}
diff --git a/src/staden/searchUtils.h b/src/staden/searchUtils.h
new file mode 100644
index 0000000..5cd52dd
--- /dev/null
+++ b/src/staden/searchUtils.h
@@ -0,0 +1,24 @@
+#ifndef _searchUtils_h
+#define _searchUtils_h
+
+#include <X11/Intrinsic.h>
+#include "edUtils.h"
+
+
+enum searchOptions {
+  OptByGelName=1,
+  OptByTagType,
+  OptByAnnotation,
+  OptBySequence,
+  OptByPosition,
+  OptByProblem,
+  OptByQuality
+};
+
+extern void createSearchWidget(Widget parentWid);
+
+extern int invokeSearchGeneric(EdStruct *xx);
+
+extern int destroySearchWindow();
+
+#endif /* _searchUtils_h */
diff --git a/src/staden/seeme-aix.f b/src/staden/seeme-aix.f
new file mode 100644
index 0000000..17c8c0b
--- /dev/null
+++ b/src/staden/seeme-aix.f
@@ -0,0 +1,684 @@
+C  SEEME: MACHINE SPECIFIC ROUTINES
+C 14-8-91 Added check in openrs for empty file names
+C 27-8-91 Started files xspec.f and nxspec.f and split out the nonx
+C         and X specific bits from here.
+C 27-8-91 Added FLUSHL to flush unit idev
+C 15-10-91 Moved IANDRS, IORRS and WRITEB from pl4010.f
+C 29-10-91 Replaced lnblnk ( a sun routine) by notrl
+C 15-11-91 Added routines to call inquire to see if a file exists
+C          and another to delete a file
+C 25-2-92 added more bit handling routines for use by sequence library
+C         searching routines
+C 16-6-92 KH: added CALL NULLSB() in FNDFIL and SUBROUTINE NULLSB() for AIX;
+C   AUTHOR: RODGER STADEN
+C   25-JAN-1990 INILU: SAVE DUP,PUP,DLOW,PLOW
+C   UNITNO
+C   OPENRS
+C THE FOLLOWING ARE PROBABLY COMPLETELY PORTABLE
+C BUT ARE INCLUDED ANY WAY
+C   INITLU
+C   IFROMP
+C   CTONUM
+C   DTONUM
+C   IUBM
+C   IUBM1
+C   IUBM2
+C   bsw4
+C   bsw2
+      SUBROUTINE UNITNO(KBIN,KBOUT,DEVNOS,NDEVS)
+C   AUTHOR: RODGER STADEN
+      INTEGER DEVNOS(NDEVS)
+C   SETS CONSOLE I/O UNITS TO 5,6. OTHERS 10 ONWARDS
+      KBIN=5
+      KBOUT=6
+      ISTART=10
+      DO 1 I=1,NDEVS
+      DEVNOS(I)=ISTART+I-1
+1     CONTINUE
+      CALL INITRS()
+      END
+      SUBROUTINE OPENRS(IDEV,FN,IOK,LRECL,JOB)
+C   AUTHOR: RODGER STADEN
+      CHARACTER     FN*(*)
+      CHARACTER*1024 FILNAM,FNDFIL
+      EXTERNAL FNDFIL,NOTRL,INQF
+C 14-8-91 Added check for blank file names
+C 14-11-91 Added routines to check for file existence
+C       ROUTINE TO OPEN FILES
+C   JOB=1 SEQUENTIAL FORMATTED NEW, DEFAULT RECORD LENGTH
+C      =2 SEQUENTIAL FORMATTED OLD, DEFAULT RECORD LENGTH
+C      =3 DIRECT ACCESS, UNFORMATTED NEW
+C      =4 DIRECT ACCESS, UNFORMATTED OLD
+C      =5 DIRECT ACCESS, UNFORMATTED OLD
+C      =6 UNFORMATTED NEW
+C      =7 UNFORMATTED OLD
+C      =8 STATUS='UNKNOWN'
+C      =9 TERMINAL FOR GRAPHICS
+C      =10 TERMINAL FOR VT100
+C      =11 direct access, single byte record length, old
+C   RECORD LENGTHS SENT IN WORDS
+C   READONLY IS VAX SPECIFIC*********************
+C   NOTE THAT ONLY FOR DIRECT ACCESS FILES ARE RECORD
+C   LENGTHS SPECIFIED. ON THE VAX UNFORMATTED DIRECT ACCESS
+C   RECORDS ARE DEFINED IN WORDS; ON OTHER MACHINES E.G. SOME UNIX
+C   MACHINES THEY ARE SPECIFIED IN BYTES. THE VARIABLE IWORDL IS
+C   USED TO MULTIPLY RECORD LENGTHS SPECIFIED IN WORDS TO GIVE
+C   THE CORRECT VALUE FOR THE MACHINE. ON A VAX IWORDL=1; ON
+C   THE UNIX MACHINE IWORDL=4
+       PARAMETER (IWORDL=4)
+      IF(NOTRL(FN,LEN(FN),' ').EQ.0) GO TO 100
+      IF (JOB.LT.9 .OR. JOB.GT.10) THEN
+        FILNAM = FNDFIL(FN)
+        INQJ = INQF(FILNAM)
+      END IF
+      IF(JOB.EQ.1) THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.2)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.3)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.4)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.5)THEN
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.6)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               FORM='UNFORMATTED',ERR=100)
+      ELSE IF(JOB.EQ.7)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               FORM='UNFORMATTED',ERR=100)
+        ELSE IF(JOB.EQ.8)THEN
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.9)THEN
+C               DEVICE FOR GRAPHICS OUTPUT IS 'TT' IF USING A
+C               VT640 ON A VAX. IF USING A SEPARATE TERMINAL FOR
+C               GRAPHICS OUTPUT THE TERMINAL PROTECTION MUST BE
+C               SET W:RW AND THE DEVICE WILL HAVE A DIFFERENT NAME
+C               FOR EXAMPLE 'GRAPHICS' WHERE THE LOGICAL NAME 'GRAPHICS'
+C               IS ASSIGNED TO THE APPROPRIATE DEVICE. A VT240 SHOULD
+C               BE COMPATIBLE WITH THE TEKTRONIX PLOT COMMANDS.
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.10)THEN
+C               DEVICE FOR VT100 COMPATIBLE TERMINAL WHEN USED
+C               FOR GELIN WHICH REQUIRES COMPLETE CONTROL OF
+C               CURSOR POSITIONING. ON A VAX IT IS CALLED 'TT'
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.11)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=1
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      END IF
+      IOK=0
+      RETURN
+100   CONTINUE
+      IOK=1
+      RETURN
+      END
+      CHARACTER*1024 FUNCTION FNDFIL(NAME)
+      CHARACTER NAME*(*),TMPNAM*1024,PREFIX*1024
+      EXTERNAL NOTRL
+C  LOOKS IN ENVIRONMENT FOR FILE PATHS
+C  ? CASES DEALT WITH:
+C  1) IF NAME STARTS WITH / IT IS ABSOLUTE AND SO UNCHANGED
+C  2) IF NAME STARTS WITH %/ REPLACE WITH HOME DIRECTORY NAME
+C  3) IF NAME INCLUDES ANY OTHER / LOOK IN ENVIRONMENT FOR
+C     NAME UPTO /, AND ADD PATH TO REST IF FOUND
+C     IF NOT FOUND ADD PATH TO CURRENT WORKING DIRECTORY
+C  4) ELSE LOOK IN ENVIRONMENT FOR WHOLE NAME
+C     IF NOT FOUND ADD PATH TO CURRENT DIRECTORY
+C     IF FOUND MAKE IT FILE NAME
+C
+      call NULLSB(NAME)
+      IF(NAME(1:1).EQ.'/') THEN
+        FNDFIL = NAME
+      ELSE IF (NAME(1:2).EQ.'%/') THEN
+        CALL GETENV('HOME',TMPNAM)
+        CALL NULLSB(TMPNAM)
+        FNDFIL = TMPNAM(:NOTRL(TMPNAM,LEN(TMPNAM),' ')) //
+     +  NAME(2:NOTRL(NAME,LEN(NAME),' '))
+      ELSE IF (INDEX(NAME,'/').NE.0) THEN
+        TMPNAM = NAME(1:INDEX(NAME,'/')-1)
+        CALL GETENV(TMPNAM,PREFIX)
+        CALL NULLSB(PREFIX)
+       I = NOTRL(PREFIX,LEN(PREFIX),' ')
+        IF(I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          call NULLSB(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    NAME(INDEX(NAME,'/'):)
+        END IF
+      ELSE
+        CALL GETENV(NAME,TMPNAM)
+        CALL NULLSB(TMPNAM)
+        I = NOTRL(TMPNAM,LEN(TMPNAM),' ')
+        IF (I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          CALL NULLSB(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = TMPNAM
+        END IF
+      END IF
+      END
+      SUBROUTINE NULLSB(STRING)
+C
+C     KH: simply convert all char(0) to blanks in a given string
+C
+      CHARACTER STRING*(*), Z*1
+      Z=CHAR(0)
+      DO K=1, LEN(STRING)
+         IF (STRING(K:K) .EQ. Z) STRING(K:K)=' '
+      END DO
+      RETURN
+      END
+      INTEGER FUNCTION INQF(NAME)
+      CHARACTER NAME*(*)
+      LOGICAL EX
+      INQUIRE(FILE=NAME,EXIST=EX)
+C
+C if the file exists return 1, else 0
+C
+      IF (EX) THEN
+        INQF = 1
+      ELSE
+        INQF = 0
+      END IF
+      END
+      INTEGER FUNCTION DELF(FN,IDEV,JRECL,JOB)
+      CHARACTER FN*(*)
+      CHARACTER FILNAM*1024,FNDFIL*1024
+      EXTERNAL FNDFIL
+C
+C routine to delete files in the most horrible way i could find
+C deal with job 2: old sequential formatted
+C deal with job 4: old direct unformatted
+C deal with job 7: old sequential unformatted
+C if the file exists and is not one of these error =-1
+C if the file exists and cannot be deleted error =-2
+C if the file cannot be opened error =-3
+C
+C on the sun this technique will delete any file if the directory
+C priviliges are set.
+C I dont know if i need to worry about the file types: could
+C i just open them all the same way? Although its a mess leave it!
+      FILNAM = FNDFIL(FN)
+      IF(JOB.EQ.2) THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='SEQUENTIAL',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.4)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='DIRECT',RECL=JRECL,ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.7)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  FORM='UNFORMATTED',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE
+        DELF = -1
+        RETURN
+      END IF
+      RETURN
+ 100  CONTINUE
+      DELF = -3
+      RETURN
+ 200  CONTINUE
+      DELF = -2
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP
+C  FOR BOTH DNA AND PROTEIN SEQUENCES
+C  THE INITIALISING ROUTINES ARE SENT THE CHARACTERSET SIZE IDM
+C  WHICH DETERMINES WHICH CHARACTERSET IS USED
+      SUBROUTINE INITLU(IDM)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255),POINT2(0:255)
+      CHARACTER DUP*16,DLOW*16,PUP*26,PLOW*26
+      COMMON /IASCI1/POINT1
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI1/
+      SAVE /IASCI2/
+      SAVE DUP,PUP,DLOW,PLOW
+      DATA DUP/'TCAG-RYWSMKHBVDN'/
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      DATA DLOW/'tcag-rywsmkhbvdn'/
+      DATA PLOW/'cstpagndeqbzhrkmilvfyw-x? '/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-5 FOR ACGT OR 1-26 FOR AMINO ACIDS BY USING ICHAR.
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT
+C  ELEMENTS CONTAIN VALUES 1 - 5, OR 1 - 26
+C  WORKS ON UPPER AND LOWER CASE - REMOVE DLOW,PLOW AND LOOPS 41 AND 51
+C  IF LOWERCASE NOT ALLOWED
+C
+      IF(IDM.EQ.5)THEN
+        DO 30 I = 0,255
+          POINT1(I) = IDM
+          POINT2(I) = 17
+30      CONTINUE
+        DO 35 I = 1,5
+          J = ICHAR(DUP(I:I))
+          POINT1(J) = I
+35      CONTINUE
+        DO 36 I = 1,5
+          J = ICHAR(DLOW(I:I))
+          POINT1(J) = I
+36      CONTINUE
+        DO 40 I = 1,16
+          J = ICHAR(DUP(I:I))
+          POINT2(J) = I
+40      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('U')
+          POINT1(J) = 1
+          POINT2(J) = 1
+        DO 41 I = 1,16
+          J = ICHAR(DLOW(I:I))
+          POINT2(J) = I
+41      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('u')
+          POINT1(J) = 1
+          POINT2(J) = 1
+        ELSE IF(IDM.EQ.26)THEN
+          DO 45 I = 0,255
+            POINT1(I) = IDM
+45        CONTINUE
+C
+        DO 50 I = 1,26
+          J = ICHAR(PUP(I:I))
+          POINT1(J) = I
+50      CONTINUE
+        DO 51 I = 1,26
+          J = ICHAR(PLOW(I:I))
+          POINT1(J) = I
+51      CONTINUE
+        DO 60 I = 0,255
+          POINT2(I) = POINT1(I)
+60      CONTINUE
+      ELSE
+        WRITE(*,*)'ERROR INITIALISING CHARACTER LOOKUP POINTERS'
+      END IF
+      END
+      INTEGER FUNCTION IFROMP(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      IFROMP = POINT1(ICOL)
+      END
+      INTEGER FUNCTION CTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      CTONUM = POINT1(ICOL)
+      END
+      INTEGER FUNCTION DTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT2(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI2/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      DTONUM = POINT2(ICOL)
+      END
+      INTEGER FUNCTION IUBM(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, 2 FOR POSSIBLE, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,2,0,2,2,0,0,2,2,2,0,2,2,0,
+     +0,1,0,0,2,0,2,0,2,2,0,2,2,2,0,2,0,
+     +0,0,1,0,2,2,0,2,0,2,0,2,0,2,2,2,0,
+     +0,0,0,1,2,2,0,0,2,0,2,0,2,2,2,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,2,1,0,2,2,2,2,2,2,2,2,2,0,
+     +1,1,0,0,2,0,1,2,2,2,2,2,2,2,2,2,0,
+     +1,0,1,0,2,2,2,1,0,2,2,2,2,2,2,2,0,
+     +0,1,0,1,2,2,2,0,1,2,2,2,2,2,2,2,0,
+     +0,1,1,0,2,2,2,2,2,1,0,2,2,2,2,2,0,
+     +1,0,0,1,2,2,2,2,2,2,1,2,2,2,2,2,0,
+     +1,1,1,0,2,2,2,1,2,1,2,1,2,2,2,2,0,
+     +1,1,0,1,2,2,1,2,1,2,1,2,1,2,2,2,0,
+     +0,1,1,1,2,1,2,2,1,1,2,2,2,1,2,2,0,
+     +1,0,1,1,2,1,2,1,2,2,1,2,2,2,1,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM2(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A POSSIBLE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,1,0,1,1,0,0,1,1,1,0,1,1,0,
+     +0,1,0,0,1,0,1,0,1,1,0,1,1,1,0,1,0,
+     +0,0,1,0,1,1,0,1,0,1,0,1,0,1,1,1,0,
+     +0,0,0,1,1,1,0,0,1,0,1,0,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,1,1,0,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,0,1,0,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,0,
+     +0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,1,0,
+     +0,1,1,0,1,1,1,1,1,1,0,1,1,1,1,1,0,
+     +1,0,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM2 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM1(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,
+     +1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,
+     +0,1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,
+     +0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,
+     +1,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,
+     +1,1,1,0,0,0,0,1,0,1,0,1,0,0,0,0,0,
+     +1,1,0,1,0,0,1,0,1,0,1,0,1,0,0,0,0,
+     +0,1,1,1,0,1,0,0,1,1,0,0,0,1,0,0,0,
+     +1,0,1,1,0,1,0,1,0,0,1,0,0,0,1,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM1 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+C
+C
+      SUBROUTINE FLUSHL(IDEV)
+      CALL FLUSH(IDEV)
+C       This implementation uses the Sun Fortran FLUSH statement.
+      END
+      SUBROUTINE FLUSHO
+C       The (single) unit number is currently hard-wired, but must
+C       be the same as KBOUT in UNITNO.
+      CALL FLUSHL(6)
+      END
+C
+      SUBROUTINE CEDIT(FILNAM)
+      CHARACTER COMAND*256,EDITOR*256,FILNAM*(*)
+      INTEGER SYSTEM,I
+      EXTERNAL NOTRL
+      CALL GETENV('SEQEDT',EDITOR)
+      I=NOTRL(EDITOR,LEN(EDITOR),' ')
+      IF (I.EQ.0) THEN
+        EDITOR='vi'
+        I=2
+      ENDIF
+      COMAND = EDITOR(1:I)//' '//FILNAM
+      ISTAT = SYSTEM(COMAND)
+      END
+      SUBROUTINE BSW2(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER*2 IN,OUT
+C Sun
+        OUT(1:1) = IN(2:2)
+        OUT(2:2) = IN(1:2)
+C Alliant
+C        OUT = IN
+      END
+      SUBROUTINE BSW4(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER IN*4,OUT*4
+C Sun
+      DO 10 I = 1,2
+        OUT(I:I) = IN(5-I:5-I)
+        OUT(5-I:5-I) = IN(I:I)
+ 10     CONTINUE
+C Alliant
+C      OUT = IN
+      END
+      INTEGER FUNCTION IANDRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IAND
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.AND.LJIN
+C Sun
+        IANDRS=AND(IIN,JIN)
+C Alliant
+C        IANDRS=IAND(IIN,JIN)
+        END
+      INTEGER FUNCTION IORRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IOR
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.OR.LJIN
+C Sun
+        IORRS=OR(IIN,JIN)
+C Alliant
+C        IORRS=IOR(IIN,JIN)
+        END
+       SUBROUTINE WRITEB(CHARS,NCHAR,IDEVGR)
+C   AUTHOR: RODGER STADEN
+       CHARACTER CHARS*(*)
+       SAVE NUL
+       DATA NUL/0/
+C   NOTE NULS STOP CARRIAGE RETURN, LINE FEED
+C Sun
+       WRITE(IDEVGR,1000)CHARS(1:NCHAR)
+1000   FORMAT(A,$)
+C Alliant
+C      WRITE(IDEVGR,1000)NUL,CHARS(1:NCHAR)
+C1000   FORMAT(A1,A)
+       END
+      SUBROUTINE SABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BITNUM
+C
+C set bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BITNUM = MOD(I,32)
+C      WRITE(*,*)'ELEMENT, BIT',ELEMNT,BITNUM
+      CALL SBITRS(ARRAY(ELEMNT),BITNUM)
+      END
+      SUBROUTINE AAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IANDRS
+      EXTERNAL IANDRS
+C
+C and array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IANDRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      SUBROUTINE OAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IORRS
+      EXTERNAL IORRS
+C
+C OR array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IORRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      LOGICAL FUNCTION TABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BIT
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C test bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BIT = MOD(I,32)
+C      WRITE(*,*)'TEST ELEMENT, BIT',ELEMNT,BIT
+      TABIT = TBITRS(ARRAY(ELEMNT),BIT)
+      END
+      SUBROUTINE CABIT(ARRAY,MAXAR)
+      INTEGER ARRAY(0:MAXAR),POSN
+C
+C clear all bits in array (1 at a time!!!!!!!!)
+C
+      DO 10 POSN=0,MAXAR
+        DO 5 J = 0,31
+          CALL CBITRS(ARRAY(POSN),J)
+ 5      CONTINUE
+ 10     CONTINUE
+      END
+      SUBROUTINE TBITA(ARRAY,MAXAR,IDEV)
+      INTEGER ARRAY(0:MAXAR)
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C  test all bits in array to see if they are set
+C
+      DO 10 I=0,MAXAR
+        DO 5 J=0,31
+          IF(TBITRS(ARRAY(I),J)) THEN
+            K = I*32 + 1 + MOD(J,32)
+            WRITE(IDEV,*)' BIT SET FOR POSN',K
+          END IF
+ 5        CONTINUE
+ 10       CONTINUE
+      END
+C   routines to handle bits (machine specific) for use when
+C   searching through keyword indexes
+C
+C   positions for marking are numbered from 1 onwards
+C   but array elements and bits are numbered from 0 onwards
+C
+      SUBROUTINE SBITRS(WORD,BITNUM)
+C
+C set bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant & AIX
+       INTEGER IBSET
+       INTRINSIC IBSET
+       WORD = IBSET(WORD,BITNUM)
+C sun
+C     CALL BIS(BITNUM,WORD)
+      END
+      SUBROUTINE CBITRS(WORD,BITNUM)
+C
+C clear bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant & AIX
+       INTEGER IBCLR
+       INTRINSIC IBCLR
+       WORD = IBCLR(WORD,BITNUM)
+C sun
+C     CALL BIC(BITNUM,WORD)
+      END
+      LOGICAL FUNCTION TBITRS(WORD,BITNUM)
+C
+C test bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant & AIX
+       LOGICAL BTEST
+       INTRINSIC BTEST
+       TBITRS = BTEST(WORD,BITNUM)
+C sun
+C     LOGICAL BIT
+C     EXTERNAL BIT
+C     TBITRS = BIT(BITNUM,WORD)
+      END
diff --git a/src/staden/seeme-al.f b/src/staden/seeme-al.f
new file mode 100644
index 0000000..c940ee6
--- /dev/null
+++ b/src/staden/seeme-al.f
@@ -0,0 +1,715 @@
+C  SEEME: MACHINE SPECIFIC ROUTINES
+C 11-12-92 fseek fix for al new bug
+C 14-8-91 Added check in openrs for empty file names
+C 27-8-91 Started files xspec.f and nxspec.f and split out the nonx
+C         and X specific bits from here.
+C 27-8-91 Added FLUSHL to flush unit idev
+C 15-10-91 Moved IANDRS, IORRS and WRITEB from pl4010.f
+C 29-10-91 Replaced lnblnk ( a sun routine) by notrl
+C 15-11-91 Added routines to call inquire to see if a file exists
+C          and another to delete a file
+C 25-2-92 added more bit handling routines for use by sequence library
+C         searching routines
+C 20-5-92 moved getrs, getrsl and seekrs to here from seqlibsubs.f
+C   AUTHOR: RODGER STADEN
+C   25-JAN-1990 INILU: SAVE DUP,PUP,DLOW,PLOW
+C   UNITNO
+C   OPENRS
+C THE FOLLOWING ARE PROBABLY COMPLETELY PORTABLE
+C BUT ARE INCLUDED ANY WAY
+C   INITLU
+C   IFROMP
+C   CTONUM
+C   DTONUM
+C   IUBM
+C   IUBM1
+C   IUBM2
+C   bsw4
+C   bsw2
+      SUBROUTINE UNITNO(KBIN,KBOUT,DEVNOS,NDEVS)
+C   AUTHOR: RODGER STADEN
+      INTEGER DEVNOS(NDEVS)
+C   SETS CONSOLE I/O UNITS TO 5,6. OTHERS 10 ONWARDS
+      KBIN=5
+      KBOUT=6
+      ISTART=10
+      DO 1 I=1,NDEVS
+      DEVNOS(I)=ISTART+I-1
+1     CONTINUE
+      CALL INITRS()
+      END
+      SUBROUTINE OPENRS(IDEV,FN,IOK,LRECL,JOB)
+C   AUTHOR: RODGER STADEN
+      CHARACTER     FN*(*)
+      CHARACTER*1024 FILNAM,FNDFIL
+      EXTERNAL FNDFIL,NOTRL,INQF
+C 14-8-91 Added check for blank file names
+C 14-11-91 Added routines to check for file existence
+C       ROUTINE TO OPEN FILES
+C   JOB=1 SEQUENTIAL FORMATTED NEW, DEFAULT RECORD LENGTH
+C      =2 SEQUENTIAL FORMATTED OLD, DEFAULT RECORD LENGTH
+C      =3 DIRECT ACCESS, UNFORMATTED NEW
+C      =4 DIRECT ACCESS, UNFORMATTED OLD
+C      =5 DIRECT ACCESS, UNFORMATTED OLD
+C      =6 UNFORMATTED NEW
+C      =7 UNFORMATTED OLD
+C      =8 STATUS='UNKNOWN'
+C      =9 TERMINAL FOR GRAPHICS
+C      =10 TERMINAL FOR VT100
+C      =11 direct access, single byte record length, old
+C   RECORD LENGTHS SENT IN WORDS
+C   READONLY IS VAX SPECIFIC*********************
+C   NOTE THAT ONLY FOR DIRECT ACCESS FILES ARE RECORD
+C   LENGTHS SPECIFIED. ON THE VAX UNFORMATTED DIRECT ACCESS
+C   RECORDS ARE DEFINED IN WORDS; ON OTHER MACHINES E.G. SOME UNIX
+C   MACHINES THEY ARE SPECIFIED IN BYTES. THE VARIABLE IWORDL IS
+C   USED TO MULTIPLY RECORD LENGTHS SPECIFIED IN WORDS TO GIVE
+C   THE CORRECT VALUE FOR THE MACHINE. ON A VAX IWORDL=1; ON
+C   THE UNIX MACHINE IWORDL=4
+       PARAMETER (IWORDL=4)
+      IF(NOTRL(FN,LEN(FN),' ').EQ.0) GO TO 100
+      IF (JOB.LT.9 .OR. JOB.GT.10) THEN
+        FILNAM = FNDFIL(FN)
+        INQJ = INQF(FILNAM)
+      END IF
+      IF(JOB.EQ.1) THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.2)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.3)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.4)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.5)THEN
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.6)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               FORM='UNFORMATTED',ERR=100)
+      ELSE IF(JOB.EQ.7)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               FORM='UNFORMATTED',ERR=100)
+        ELSE IF(JOB.EQ.8)THEN
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.9)THEN
+C               DEVICE FOR GRAPHICS OUTPUT IS 'TT' IF USING A
+C               VT640 ON A VAX. IF USING A SEPARATE TERMINAL FOR
+C               GRAPHICS OUTPUT THE TERMINAL PROTECTION MUST BE
+C               SET W:RW AND THE DEVICE WILL HAVE A DIFFERENT NAME
+C               FOR EXAMPLE 'GRAPHICS' WHERE THE LOGICAL NAME 'GRAPHICS'
+C               IS ASSIGNED TO THE APPROPRIATE DEVICE. A VT240 SHOULD
+C               BE COMPATIBLE WITH THE TEKTRONIX PLOT COMMANDS.
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.10)THEN
+C               DEVICE FOR VT100 COMPATIBLE TERMINAL WHEN USED
+C               FOR GELIN WHICH REQUIRES COMPLETE CONTROL OF
+C               CURSOR POSITIONING. ON A VAX IT IS CALLED 'TT'
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.11)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=1
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      END IF
+      IOK=0
+      RETURN
+100   CONTINUE
+      IOK=1
+      RETURN
+      END
+      CHARACTER*1024 FUNCTION FNDFIL(NAME)
+      CHARACTER NAME*(*),TMPNAM*1024,PREFIX*1024
+      EXTERNAL NOTRL
+C  LOOKS IN ENVIRONMENT FOR FILE PATHS
+C  ? CASES DEALT WITH:
+C  1) IF NAME STARTS WITH / IT IS ABSOLUTE AND SO UNCHANGED
+C  2) IF NAME STARTS WITH ~/ REPLACE WITH HOME DIRECTORY NAME
+C  3) IF NAME INCLUDES ANY OTHER / LOOK IN ENVIRONMENT FOR
+C     NAME UPTO /, AND ADD PATH TO REST IF FOUND
+C     IF NOT FOUND ADD PATH TO CURRENT WORKING DIRECTORY
+C  4) ELSE LOOK IN ENVIRONMENT FOR WHOLE NAME
+C     IF NOT FOUND ADD PATH TO CURRENT DIRECTORY
+C     IF FOUND MAKE IT FILE NAME
+C
+      IF(NAME(1:1).EQ.'/') THEN
+        FNDFIL = NAME
+      ELSE IF (NAME(1:2).EQ.'~/') THEN
+        CALL GETENV('HOME',TMPNAM)
+        FNDFIL = TMPNAM(:NOTRL(TMPNAM,LEN(TMPNAM),' ')) //
+     +  NAME(2:NOTRL(NAME,LEN(NAME),' '))
+      ELSE IF (INDEX(NAME,'/').NE.0) THEN
+        TMPNAM = NAME(1:INDEX(NAME,'/')-1)
+        CALL GETENV(TMPNAM,PREFIX)
+        I = NOTRL(PREFIX,LEN(PREFIX),' ')
+        IF(I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    NAME(INDEX(NAME,'/'):)
+        END IF
+      ELSE
+        CALL GETENV(NAME,TMPNAM)
+        I = NOTRL(TMPNAM,LEN(TMPNAM),' ')
+        IF (I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = TMPNAM
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION INQF(NAME)
+      CHARACTER NAME*(*)
+      LOGICAL EX
+      INQUIRE(FILE=NAME,EXIST=EX)
+C
+C if the file exists return 1, else 0
+C
+      IF (EX) THEN
+        INQF = 1
+      ELSE
+        INQF = 0
+      END IF
+      END
+      INTEGER FUNCTION DELF(FN,IDEV,JRECL,JOB)
+      CHARACTER FN*(*)
+      CHARACTER FILNAM*1024,FNDFIL*1024
+      EXTERNAL FNDFIL
+C
+C routine to delete files in the most horrible way i could find
+C deal with job 2: old sequential formatted
+C deal with job 4: old direct unformatted
+C deal with job 7: old sequential unformatted
+C if the file exists and is not one of these error =-1
+C if the file exists and cannot be deleted error =-2
+C if the file cannot be opened error =-3
+C
+C on the sun this technique will delete any file if the directory
+C priviliges are set.
+C I dont know if i need to worry about the file types: could
+C i just open them all the same way? Although its a mess leave it!
+      FILNAM = FNDFIL(FN)
+      IF(JOB.EQ.2) THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='SEQUENTIAL',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.4)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='DIRECT',RECL=JRECL,ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.7)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  FORM='UNFORMATTED',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE
+        DELF = -1
+        RETURN
+      END IF
+      RETURN
+ 100  CONTINUE
+      DELF = -3
+      RETURN
+ 200  CONTINUE
+      DELF = -2
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP
+C  FOR BOTH DNA AND PROTEIN SEQUENCES
+C  THE INITIALISING ROUTINES ARE SENT THE CHARACTERSET SIZE IDM
+C  WHICH DETERMINES WHICH CHARACTERSET IS USED
+      SUBROUTINE INITLU(IDM)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255),POINT2(0:255)
+      CHARACTER DUP*16,DLOW*16,PUP*26,PLOW*26
+      COMMON /IASCI1/POINT1
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI1/
+      SAVE /IASCI2/
+      SAVE DUP,PUP,DLOW,PLOW
+      DATA DUP/'TCAG-RYWSMKHBVDN'/
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      DATA DLOW/'tcag-rywsmkhbvdn'/
+      DATA PLOW/'cstpagndeqbzhrkmilvfyw-x? '/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-5 FOR ACGT OR 1-26 FOR AMINO ACIDS BY USING ICHAR. 
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 5, OR 1 - 26
+C  WORKS ON UPPER AND LOWER CASE - REMOVE DLOW,PLOW AND LOOPS 41 AND 51
+C  IF LOWERCASE NOT ALLOWED
+C
+      IF(IDM.EQ.5)THEN
+        DO 30 I = 0,255
+          POINT1(I) = IDM
+          POINT2(I) = 17
+30      CONTINUE
+        DO 35 I = 1,5
+          J = ICHAR(DUP(I:I))
+          POINT1(J) = I
+35      CONTINUE
+        DO 36 I = 1,5
+          J = ICHAR(DLOW(I:I))
+          POINT1(J) = I
+36      CONTINUE
+        DO 40 I = 1,16
+          J = ICHAR(DUP(I:I))
+          POINT2(J) = I
+40      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('U')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        DO 41 I = 1,16
+          J = ICHAR(DLOW(I:I))
+          POINT2(J) = I
+41      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('u')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        ELSE IF(IDM.EQ.26)THEN
+          DO 45 I = 0,255
+            POINT1(I) = IDM
+45        CONTINUE
+C
+        DO 50 I = 1,26
+          J = ICHAR(PUP(I:I))
+          POINT1(J) = I
+50      CONTINUE
+        DO 51 I = 1,26
+          J = ICHAR(PLOW(I:I))
+          POINT1(J) = I
+51      CONTINUE
+        DO 60 I = 0,255
+          POINT2(I) = POINT1(I)
+60      CONTINUE
+      ELSE
+        WRITE(*,*)'ERROR INITIALISING CHARACTER LOOKUP POINTERS'
+      END IF
+      END
+      INTEGER FUNCTION IFROMP(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      IFROMP = POINT1(ICOL)
+      END
+      INTEGER FUNCTION CTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      CTONUM = POINT1(ICOL)
+      END
+      INTEGER FUNCTION DTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT2(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI2/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      DTONUM = POINT2(ICOL)
+      END
+      INTEGER FUNCTION IUBM(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, 2 FOR POSSIBLE, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,2,0,2,2,0,0,2,2,2,0,2,2,0,
+     +0,1,0,0,2,0,2,0,2,2,0,2,2,2,0,2,0,
+     +0,0,1,0,2,2,0,2,0,2,0,2,0,2,2,2,0,
+     +0,0,0,1,2,2,0,0,2,0,2,0,2,2,2,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,2,1,0,2,2,2,2,2,2,2,2,2,0,
+     +1,1,0,0,2,0,1,2,2,2,2,2,2,2,2,2,0,
+     +1,0,1,0,2,2,2,1,0,2,2,2,2,2,2,2,0,
+     +0,1,0,1,2,2,2,0,1,2,2,2,2,2,2,2,0,
+     +0,1,1,0,2,2,2,2,2,1,0,2,2,2,2,2,0,
+     +1,0,0,1,2,2,2,2,2,2,1,2,2,2,2,2,0,
+     +1,1,1,0,2,2,2,1,2,1,2,1,2,2,2,2,0,
+     +1,1,0,1,2,2,1,2,1,2,1,2,1,2,2,2,0,
+     +0,1,1,1,2,1,2,2,1,1,2,2,2,1,2,2,0,
+     +1,0,1,1,2,1,2,1,2,2,1,2,2,2,1,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM2(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A POSSIBLE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,1,0,1,1,0,0,1,1,1,0,1,1,0,
+     +0,1,0,0,1,0,1,0,1,1,0,1,1,1,0,1,0,
+     +0,0,1,0,1,1,0,1,0,1,0,1,0,1,1,1,0,
+     +0,0,0,1,1,1,0,0,1,0,1,0,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,1,1,0,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,0,1,0,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,0,
+     +0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,1,0,
+     +0,1,1,0,1,1,1,1,1,1,0,1,1,1,1,1,0,
+     +1,0,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM2 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM1(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,
+     +1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,
+     +0,1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,
+     +0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,
+     +1,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,
+     +1,1,1,0,0,0,0,1,0,1,0,1,0,0,0,0,0,
+     +1,1,0,1,0,0,1,0,1,0,1,0,1,0,0,0,0,
+     +0,1,1,1,0,1,0,0,1,1,0,0,0,1,0,0,0,
+     +1,0,1,1,0,1,0,1,0,0,1,0,0,0,1,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM1 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+C
+C
+      SUBROUTINE FLUSHL(IDEV)
+      CALL FLUSH(IDEV)
+C       This implementation uses the Sun Fortran FLUSH statement.
+      END
+      SUBROUTINE FLUSHO
+C       The (single) unit number is currently hard-wired, but must
+C       be the same as KBOUT in UNITNO.
+      CALL FLUSHL(6)
+      END
+C
+      SUBROUTINE CEDIT(FILNAM)
+      CHARACTER COMAND*256,EDITOR*256,FILNAM*(*)
+      INTEGER SYSTEM,I
+      EXTERNAL NOTRL
+      CALL GETENV('SEQEDT',EDITOR)
+      I=NOTRL(EDITOR,LEN(EDITOR),' ')
+      IF (I.EQ.0) THEN
+        EDITOR='vi'
+        I=2
+      ENDIF
+      COMAND = EDITOR(1:I)//' '//FILNAM
+      ISTAT = SYSTEM(COMAND)
+      END
+      SUBROUTINE BSW2(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER*2 IN,OUT
+C Sun
+C        OUT(1:1) = IN(2:2)
+C        OUT(2:2) = IN(1:2)
+C Alliant
+        OUT = IN
+      END
+      SUBROUTINE BSW4(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER IN*4,OUT*4
+C Sun
+C      DO 10 I = 1,2
+C        OUT(I:I) = IN(5-I:5-I)
+C        OUT(5-I:5-I) = IN(I:I)
+C 10     CONTINUE
+C Alliant
+      OUT = IN
+      END
+      INTEGER FUNCTION IANDRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IAND
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.AND.LJIN
+C Sun
+C        IANDRS=AND(IIN,JIN)
+C Alliant
+        IANDRS=IAND(IIN,JIN)
+        RETURN
+        END
+      INTEGER FUNCTION IORRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IOR
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.OR.LJIN
+C Sun
+C        IORRS=OR(IIN,JIN)
+C Alliant
+        IORRS=IOR(IIN,JIN)
+        RETURN
+        END
+       SUBROUTINE WRITEB(CHARS,NCHAR,IDEVGR)
+C   AUTHOR: RODGER STADEN 
+       CHARACTER CHARS*(*)
+       SAVE NUL
+       DATA NUL/0/
+C   NOTE NULS STOP CARRIAGE RETURN, LINE FEED
+C Sun
+C       WRITE(IDEVGR,1000)CHARS(1:NCHAR)
+C1000   FORMAT(A,$)
+C Alliant
+      WRITE(IDEVGR,1000)NUL,CHARS(1:NCHAR)
+1000   FORMAT(A1,A)
+       RETURN
+       END
+      SUBROUTINE SABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BITNUM
+C
+C set bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BITNUM = MOD(I,32)
+C      WRITE(*,*)'ELEMENT, BIT',ELEMNT,BITNUM
+      CALL SBITRS(ARRAY(ELEMNT),BITNUM)
+      END
+      SUBROUTINE AAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IANDRS
+      EXTERNAL IANDRS
+C
+C and array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IANDRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      SUBROUTINE OAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IORRS
+      EXTERNAL IORRS
+C
+C OR array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IORRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      LOGICAL FUNCTION TABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BIT
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C test bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BIT = MOD(I,32)
+C      WRITE(*,*)'TEST ELEMENT, BIT',ELEMNT,BIT
+      TABIT = TBITRS(ARRAY(ELEMNT),BIT)
+      END
+      SUBROUTINE CABIT(ARRAY,MAXAR)
+      INTEGER ARRAY(0:MAXAR),POSN
+C
+C clear all bits in array (1 at a time!!!!!!!!)
+C
+      DO 10 POSN=0,MAXAR
+        DO 5 J = 0,31
+          CALL CBITRS(ARRAY(POSN),J)
+ 5      CONTINUE
+ 10     CONTINUE
+      END
+      SUBROUTINE TBITA(ARRAY,MAXAR,IDEV)
+      INTEGER ARRAY(0:MAXAR)
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C  test all bits in array to see if they are set
+C
+      DO 10 I=0,MAXAR
+        DO 5 J=0,31
+          IF(TBITRS(ARRAY(I),J)) THEN
+            K = I*32 + 1 + MOD(J,32)
+            WRITE(IDEV,*)' BIT SET FOR POSN',K
+          END IF
+ 5        CONTINUE
+ 10       CONTINUE
+      END
+C   routines to handle bits (machine specific) for use when
+C   searching through keyword indexes
+C
+C   positions for marking are numbered from 1 onwards
+C   but array elements and bits are numbered from 0 onwards
+C
+      SUBROUTINE SBITRS(WORD,BITNUM)
+C
+C set bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant
+      INTEGER IBSET
+      INTRINSIC IBSET
+      WORD = IBSET(WORD,BITNUM)
+C sun
+C      CALL BIS(BITNUM,WORD)
+      END
+      SUBROUTINE CBITRS(WORD,BITNUM)
+C
+C clear bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant
+      INTEGER IBCLR
+      INTRINSIC IBCLR
+      WORD = IBCLR(WORD,BITNUM)
+C sun
+C      CALL BIC(BITNUM,WORD)
+      END
+      LOGICAL FUNCTION TBITRS(WORD,BITNUM)
+C
+C test bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant
+      LOGICAL BTEST
+      INTRINSIC BTEST
+      TBITRS = BTEST(WORD,BITNUM)
+C sun
+C      LOGICAL BIT
+C      EXTERNAL BIT
+C      TBITRS = BIT(BITNUM,WORD)
+      END
+      INTEGER FUNCTION GETRS(IDEV,STRING,NSTRNG,IBYTE)
+      CHARACTER STRING*(*)
+      IREC = IBYTE
+      DO 10 I=1,NSTRNG
+        READ(IDEV,REC=IREC,ERR=20)STRING(I:I)
+        IREC = IREC + 1
+ 10     CONTINUE
+      GETRS = 0
+      RETURN
+ 20   CONTINUE
+      GETRS = -1
+      END
+      INTEGER FUNCTION GETRSL(IDEV,STRING,NSTRNG)
+      CHARACTER STRING*(*)
+      CHARACTER CR
+      PARAMETER (CR=CHAR(13))
+      READ(IDEV,1000,ERR=100,END=200)STRING(1:NSTRNG)
+ 1000 FORMAT(A)
+      DO 10 I=1,NSTRNG
+        IF(STRING(I:I).EQ.CR) THEN
+          GETRSL = I-1
+          RETURN
+        END IF
+ 10     CONTINUE
+      GETRSL = NSTRNG
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'ERROR READING IN GETRSL'
+      GETRSL = -2
+      RETURN
+ 200  CONTINUE
+      GETRSL = -1
+      WRITE(*,*)'END IN GETRSL'
+      END
+      INTEGER FUNCTION SEEKRS(IDEV,IREC)
+      INTEGER FSEEK
+C OLD VERSION THIS LINE ONLY      SEEKRS = FSEEK(IDEV,IREC,0)
+C following 4 lines apparently fix the bug!
+      COMMON /ALBUGS/I,J,K
+      K = 0
+      J = IREC
+      I = FSEEK(IDEV,J,K)
+      SEEKRS = I
+      END
diff --git a/src/staden/seeme-dec.f b/src/staden/seeme-dec.f
new file mode 100644
index 0000000..0196c51
--- /dev/null
+++ b/src/staden/seeme-dec.f
@@ -0,0 +1,674 @@
+C  SEEME: MACHINE SPECIFIC ROUTINES for dec !!!!!!!!!
+C         DEC ULTRIX cahnges: iwordl = 1, seqlibs open 'stream'
+C 14-8-91 Added check in openrs for empty file names
+C 27-8-91 Started files xspec.f and nxspec.f and split out the nonx
+C         and X specific bits from here.
+C 27-8-91 Added FLUSHL to flush unit idev
+C 15-10-91 Moved IANDRS, IORRS and WRITEB from pl4010.f
+C 29-10-91 Replaced lnblnk ( a sun routine) by notrl
+C 15-11-91 Added routines to call inquire to see if a file exists
+C          and another to delete a file
+C 25-2-92 added more bit handling routines for use by sequence library
+C         searching routines
+C 20-5-92 Added readonly where required for dec!
+C         moved getrs, getrsl and seekrs from seqlibsubs.f to here
+C   AUTHOR: RODGER STADEN
+C   25-JAN-1990 INILU: SAVE DUP,PUP,DLOW,PLOW
+C   UNITNO
+C   OPENRS
+C THE FOLLOWING ARE PROBABLY COMPLETELY PORTABLE
+C BUT ARE INCLUDED ANY WAY
+C   INITLU
+C   IFROMP
+C   CTONUM
+C   DTONUM
+C   IUBM
+C   IUBM1
+C   IUBM2
+C   bsw4
+C   bsw2
+      SUBROUTINE UNITNO(KBIN,KBOUT,DEVNOS,NDEVS)
+C   AUTHOR: RODGER STADEN
+      INTEGER DEVNOS(NDEVS)
+C   SETS CONSOLE I/O UNITS TO 5,6. OTHERS 10 ONWARDS
+      KBIN=5
+      KBOUT=6
+      ISTART=10
+      DO 1 I=1,NDEVS
+      DEVNOS(I)=ISTART+I-1
+1     CONTINUE
+      CALL INITRS()
+      END
+      SUBROUTINE OPENRS(IDEV,FN,IOK,LRECL,JOB)
+C   AUTHOR: RODGER STADEN
+      CHARACTER     FN*(*)
+      CHARACTER*1024 FILNAM,FNDFIL
+      EXTERNAL FNDFIL,NOTRL,INQF
+C 14-8-91 Added check for blank file names
+C 14-11-91 Added routines to check for file existence
+C       ROUTINE TO OPEN FILES
+C   JOB=1 SEQUENTIAL FORMATTED NEW, DEFAULT RECORD LENGTH
+C      =2 SEQUENTIAL FORMATTED OLD, DEFAULT RECORD LENGTH
+C      =3 DIRECT ACCESS, UNFORMATTED NEW
+C      =4 DIRECT ACCESS, UNFORMATTED OLD
+C      =5 DIRECT ACCESS, UNFORMATTED OLD
+C      =6 UNFORMATTED NEW
+C      =7 UNFORMATTED OLD
+C      =8 STATUS='UNKNOWN'
+C      =9 TERMINAL FOR GRAPHICS
+C      =10 TERMINAL FOR VT100
+C      =11 direct access, single byte record length, old
+C   RECORD LENGTHS SENT IN WORDS
+C   READONLY IS VAX SPECIFIC*********************
+C   NOTE THAT ONLY FOR DIRECT ACCESS FILES ARE RECORD
+C   LENGTHS SPECIFIED. ON THE VAX UNFORMATTED DIRECT ACCESS
+C   RECORDS ARE DEFINED IN WORDS; ON OTHER MACHINES E.G. SOME UNIX
+C   MACHINES THEY ARE SPECIFIED IN BYTES. THE VARIABLE IWORDL IS
+C   USED TO MULTIPLY RECORD LENGTHS SPECIFIED IN WORDS TO GIVE
+C   THE CORRECT VALUE FOR THE MACHINE. ON A VAX IWORDL=1; ON
+C   THE UNIX MACHINE IWORDL=4
+       PARAMETER (IWORDL=1)
+      IF(NOTRL(FN,LEN(FN),' ').EQ.0) GO TO 100
+      IF (JOB.LT.9 .OR. JOB.GT.10) THEN
+        FILNAM = FNDFIL(FN)
+        INQJ = INQF(FILNAM)
+      END IF
+      IF(JOB.EQ.1) THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.2)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.3)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.4)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.5)THEN
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.6)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               FORM='UNFORMATTED',ERR=100)
+      ELSE IF(JOB.EQ.7)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,
+     +               FORM='UNFORMATTED',ERR=100)
+        ELSE IF(JOB.EQ.8)THEN
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.9)THEN
+C               DEVICE FOR GRAPHICS OUTPUT IS 'TT' IF USING A
+C               VT640 ON A VAX. IF USING A SEPARATE TERMINAL FOR
+C               GRAPHICS OUTPUT THE TERMINAL PROTECTION MUST BE
+C               SET W:RW AND THE DEVICE WILL HAVE A DIFFERENT NAME
+C               FOR EXAMPLE 'GRAPHICS' WHERE THE LOGICAL NAME 'GRAPHICS'
+C               IS ASSIGNED TO THE APPROPRIATE DEVICE. A VT240 SHOULD
+C               BE COMPATIBLE WITH THE TEKTRONIX PLOT COMMANDS.
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.10)THEN
+C               DEVICE FOR VT100 COMPATIBLE TERMINAL WHEN USED
+C               FOR GELIN WHICH REQUIRES COMPLETE CONTROL OF
+C               CURSOR POSITIONING. ON A VAX IT IS CALLED 'TT'
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.11)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=1
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,
+     +               RECORDTYPE='STREAM',ERR=100)
+      END IF
+      IOK=0
+      RETURN
+100   CONTINUE
+      IOK=1
+      RETURN
+      END
+      CHARACTER*1024 FUNCTION FNDFIL(NAME)
+      CHARACTER NAME*(*),TMPNAM*1024,PREFIX*1024
+      EXTERNAL NOTRL
+C  LOOKS IN ENVIRONMENT FOR FILE PATHS
+C  ? CASES DEALT WITH:
+C  1) IF NAME STARTS WITH / IT IS ABSOLUTE AND SO UNCHANGED
+C  2) IF NAME STARTS WITH ~/ REPLACE WITH HOME DIRECTORY NAME
+C  3) IF NAME INCLUDES ANY OTHER / LOOK IN ENVIRONMENT FOR
+C     NAME UPTO /, AND ADD PATH TO REST IF FOUND
+C     IF NOT FOUND ADD PATH TO CURRENT WORKING DIRECTORY
+C  4) ELSE LOOK IN ENVIRONMENT FOR WHOLE NAME
+C     IF NOT FOUND ADD PATH TO CURRENT DIRECTORY
+C     IF FOUND MAKE IT FILE NAME
+C
+      IF(NAME(1:1).EQ.'/') THEN
+        FNDFIL = NAME
+      ELSE IF (NAME(1:2).EQ.'~/') THEN
+        CALL GETENV('HOME',TMPNAM)
+        FNDFIL = TMPNAM(:NOTRL(TMPNAM,LEN(TMPNAM),' ')) //
+     +  NAME(2:NOTRL(NAME,LEN(NAME),' '))
+      ELSE IF (INDEX(NAME,'/').NE.0) THEN
+        TMPNAM = NAME(1:INDEX(NAME,'/')-1)
+        CALL GETENV(TMPNAM,PREFIX)
+        I = NOTRL(PREFIX,LEN(PREFIX),' ')
+        IF(I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    NAME(INDEX(NAME,'/'):)
+        END IF
+      ELSE
+        CALL GETENV(NAME,TMPNAM)
+        I = NOTRL(TMPNAM,LEN(TMPNAM),' ')
+        IF (I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = TMPNAM
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION INQF(NAME)
+      CHARACTER NAME*(*)
+      LOGICAL EX
+      INQUIRE(FILE=NAME,EXIST=EX)
+C
+C if the file exists return 1, else 0
+C
+      IF (EX) THEN
+        INQF = 1
+      ELSE
+        INQF = 0
+      END IF
+      END
+      INTEGER FUNCTION DELF(FN,IDEV,JRECL,JOB)
+      CHARACTER FN*(*)
+      CHARACTER FILNAM*1024,FNDFIL*1024
+      EXTERNAL FNDFIL
+C
+C routine to delete files in the most horrible way i could find
+C deal with job 2: old sequential formatted
+C deal with job 4: old direct unformatted
+C deal with job 7: old sequential unformatted
+C if the file exists and is not one of these error =-1
+C if the file exists and cannot be deleted error =-2
+C if the file cannot be opened error =-3
+C
+C on the sun this technique will delete any file if the directory
+C priviliges are set.
+C I dont know if i need to worry about the file types: could
+C i just open them all the same way? Although its a mess leave it!
+      FILNAM = FNDFIL(FN)
+      IF(JOB.EQ.2) THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='SEQUENTIAL',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.4)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='DIRECT',RECL=JRECL,ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.7)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  FORM='UNFORMATTED',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE
+        DELF = -1
+        RETURN
+      END IF
+      RETURN
+ 100  CONTINUE
+      DELF = -3
+      RETURN
+ 200  CONTINUE
+      DELF = -2
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP
+C  FOR BOTH DNA AND PROTEIN SEQUENCES
+C  THE INITIALISING ROUTINES ARE SENT THE CHARACTERSET SIZE IDM
+C  WHICH DETERMINES WHICH CHARACTERSET IS USED
+      SUBROUTINE INITLU(IDM)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255),POINT2(0:255)
+      CHARACTER DUP*16,DLOW*16,PUP*26,PLOW*26
+      COMMON /IASCI1/POINT1
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI1/
+      SAVE /IASCI2/
+      SAVE DUP,PUP,DLOW,PLOW
+      DATA DUP/'TCAG-RYWSMKHBVDN'/
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      DATA DLOW/'tcag-rywsmkhbvdn'/
+      DATA PLOW/'cstpagndeqbzhrkmilvfyw-x? '/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-5 FOR ACGT OR 1-26 FOR AMINO ACIDS BY USING ICHAR. 
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 5, OR 1 - 26
+C  WORKS ON UPPER AND LOWER CASE - REMOVE DLOW,PLOW AND LOOPS 41 AND 51
+C  IF LOWERCASE NOT ALLOWED
+C
+      IF(IDM.EQ.5)THEN
+        DO 30 I = 0,255
+          POINT1(I) = IDM
+          POINT2(I) = 17
+30      CONTINUE
+        DO 35 I = 1,5
+          J = ICHAR(DUP(I:I))
+          POINT1(J) = I
+35      CONTINUE
+        DO 36 I = 1,5
+          J = ICHAR(DLOW(I:I))
+          POINT1(J) = I
+36      CONTINUE
+        DO 40 I = 1,16
+          J = ICHAR(DUP(I:I))
+          POINT2(J) = I
+40      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('U')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        DO 41 I = 1,16
+          J = ICHAR(DLOW(I:I))
+          POINT2(J) = I
+41      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('u')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        ELSE IF(IDM.EQ.26)THEN
+          DO 45 I = 0,255
+            POINT1(I) = IDM
+45        CONTINUE
+C
+        DO 50 I = 1,26
+          J = ICHAR(PUP(I:I))
+          POINT1(J) = I
+50      CONTINUE
+        DO 51 I = 1,26
+          J = ICHAR(PLOW(I:I))
+          POINT1(J) = I
+51      CONTINUE
+        DO 60 I = 0,255
+          POINT2(I) = POINT1(I)
+60      CONTINUE
+      ELSE
+        WRITE(*,*)'ERROR INITIALISING CHARACTER LOOKUP POINTERS'
+      END IF
+      END
+      INTEGER FUNCTION IFROMP(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      IFROMP = POINT1(ICOL)
+      END
+      INTEGER FUNCTION CTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      CTONUM = POINT1(ICOL)
+      END
+      INTEGER FUNCTION DTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT2(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI2/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      DTONUM = POINT2(ICOL)
+      END
+      INTEGER FUNCTION IUBM(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, 2 FOR POSSIBLE, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,2,0,2,2,0,0,2,2,2,0,2,2,0,
+     +0,1,0,0,2,0,2,0,2,2,0,2,2,2,0,2,0,
+     +0,0,1,0,2,2,0,2,0,2,0,2,0,2,2,2,0,
+     +0,0,0,1,2,2,0,0,2,0,2,0,2,2,2,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,2,1,0,2,2,2,2,2,2,2,2,2,0,
+     +1,1,0,0,2,0,1,2,2,2,2,2,2,2,2,2,0,
+     +1,0,1,0,2,2,2,1,0,2,2,2,2,2,2,2,0,
+     +0,1,0,1,2,2,2,0,1,2,2,2,2,2,2,2,0,
+     +0,1,1,0,2,2,2,2,2,1,0,2,2,2,2,2,0,
+     +1,0,0,1,2,2,2,2,2,2,1,2,2,2,2,2,0,
+     +1,1,1,0,2,2,2,1,2,1,2,1,2,2,2,2,0,
+     +1,1,0,1,2,2,1,2,1,2,1,2,1,2,2,2,0,
+     +0,1,1,1,2,1,2,2,1,1,2,2,2,1,2,2,0,
+     +1,0,1,1,2,1,2,1,2,2,1,2,2,2,1,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM2(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A POSSIBLE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,1,0,1,1,0,0,1,1,1,0,1,1,0,
+     +0,1,0,0,1,0,1,0,1,1,0,1,1,1,0,1,0,
+     +0,0,1,0,1,1,0,1,0,1,0,1,0,1,1,1,0,
+     +0,0,0,1,1,1,0,0,1,0,1,0,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,1,1,0,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,0,1,0,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,0,
+     +0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,1,0,
+     +0,1,1,0,1,1,1,1,1,1,0,1,1,1,1,1,0,
+     +1,0,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM2 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM1(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,
+     +1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,
+     +0,1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,
+     +0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,
+     +1,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,
+     +1,1,1,0,0,0,0,1,0,1,0,1,0,0,0,0,0,
+     +1,1,0,1,0,0,1,0,1,0,1,0,1,0,0,0,0,
+     +0,1,1,1,0,1,0,0,1,1,0,0,0,1,0,0,0,
+     +1,0,1,1,0,1,0,1,0,0,1,0,0,0,1,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM1 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+C
+C
+      SUBROUTINE FLUSHL(IDEV)
+      CALL FLUSH(IDEV)
+C       This implementation uses the Sun Fortran FLUSH statement.
+      END
+      SUBROUTINE FLUSHO
+C       The (single) unit number is currently hard-wired, but must
+C       be the same as KBOUT in UNITNO.
+      CALL FLUSHL(6)
+      END
+C
+      SUBROUTINE CEDIT(FILNAM)
+      CHARACTER COMAND*256,EDITOR*256,FILNAM*(*)
+      INTEGER SYSTEM,I
+      EXTERNAL NOTRL
+      CALL GETENV('SEQEDT',EDITOR)
+      I=NOTRL(EDITOR,LEN(EDITOR),' ')
+      IF (I.EQ.0) THEN
+        EDITOR='vi'
+        I=2
+      ENDIF
+      COMAND = EDITOR(1:I)//' '//FILNAM
+      ISTAT = SYSTEM(COMAND)
+      END
+      SUBROUTINE BSW2(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER*2 IN,OUT
+C Sun
+C        OUT(1:1) = IN(2:2)
+C        OUT(2:2) = IN(1:2)
+C Alliant
+        OUT = IN
+      END
+      SUBROUTINE BSW4(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER IN*4,OUT*4
+C Sun
+C      DO 10 I = 1,2
+C        OUT(I:I) = IN(5-I:5-I)
+C        OUT(5-I:5-I) = IN(I:I)
+C 10     CONTINUE
+C Alliant
+      OUT = IN
+      END
+      INTEGER FUNCTION IANDRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IAND
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.AND.LJIN
+C dec
+        IANDRS=IAND(IIN,JIN)
+C Alliant
+C        IANDRS=IAND(IIN,JIN)
+        END
+      INTEGER FUNCTION IORRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IOR
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.OR.LJIN
+C dec
+        IORRS=IOR(IIN,JIN)
+C Alliant
+C        IORRS=IOR(IIN,JIN)
+        END
+       SUBROUTINE WRITEB(CHARS,NCHAR,IDEVGR)
+C   AUTHOR: RODGER STADEN 
+       CHARACTER CHARS*(*)
+       SAVE NUL
+       DATA NUL/0/
+C   NOTE NULS STOP CARRIAGE RETURN, LINE FEED
+C Sun
+       WRITE(IDEVGR,1000)CHARS(1:NCHAR)
+1000   FORMAT(A,$)
+C Alliant
+C      WRITE(IDEVGR,1000)NUL,CHARS(1:NCHAR)
+C1000   FORMAT(A1,A)
+       END
+      SUBROUTINE SABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BITNUM
+C
+C set bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BITNUM = MOD(I,32)
+C      WRITE(*,*)'ELEMENT, BIT',ELEMNT,BITNUM
+      CALL SBITRS(ARRAY(ELEMNT),BITNUM)
+      END
+      SUBROUTINE AAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IANDRS
+      EXTERNAL IANDRS
+C
+C and array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IANDRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      SUBROUTINE OAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IORRS
+      EXTERNAL IORRS
+C
+C OR array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IORRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      LOGICAL FUNCTION TABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BIT
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C test bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BIT = MOD(I,32)
+C      WRITE(*,*)'TEST ELEMENT, BIT',ELEMNT,BIT
+      TABIT = .FALSE.
+      IF (ARRAY(ELEMNT).NE.0) THEN
+        TABIT = TBITRS(ARRAY(ELEMNT),BIT)
+      END IF 
+      END
+      SUBROUTINE CABIT(ARRAY,MAXAR)
+      INTEGER ARRAY(0:MAXAR),POSN
+C
+C clear all bits in array (1 word at a time)
+C
+      DO 10 POSN=0,MAXAR
+          ARRAY(POSN) = 0
+ 10     CONTINUE
+      END
+      SUBROUTINE SBITRS(WORD,BITNUM)
+      INTEGER MASKS(0:31),WORD,BITNUM
+      EXTERNAL IORRS
+      SAVE MASKS
+      DATA MASKS/1,2,4,8,16,32,64,128,256,512,1024,2048,4096,
+     +8192,16384,32768,65536,131072,262144,524288,1048576,
+     +2097152,4194304,8388608,16777216,33554432,67108864,
+     +134217728,268435456,536870912,1073741824,-2147483648/
+      WORD = IORRS(WORD,MASKS(BITNUM))
+      END
+      LOGICAL FUNCTION TBITRS(WORD,BITNUM)
+      INTEGER MASKS(0:31),WORD,BITNUM
+      EXTERNAL IANDRS
+      SAVE MASKS
+      DATA MASKS/1,2,4,8,16,32,64,128,256,512,1024,2048,4096,
+     +8192,16384,32768,65536,131072,262144,524288,1048576,
+     +2097152,4194304,8388608,16777216,33554432,67108864,
+     +134217728,268435456,536870912,1073741824,-2147483648/
+      I = MASKS(BITNUM)
+      TBITRS = (IANDRS(WORD,I).EQ.I)
+      END
+      INTEGER FUNCTION GETRS(IDEV,STRING,NSTRNG,IBYTE)
+      CHARACTER STRING*(*)
+      INTEGER SEEKRS
+      EXTERNAL SEEKRS
+      GETRS = SEEKRS(IDEV,IBYTE-1)
+      IF(GETRS.NE.0) RETURN
+      READ(IDEV,1000,ERR=20,END=20)STRING(1:NSTRNG)
+ 1000 FORMAT(A)
+      GETRS = 0
+      RETURN
+ 20   CONTINUE
+      GETRS = -1
+      END
+      INTEGER FUNCTION GETRSL(IDEV,STRING,NSTRNG)
+      CHARACTER STRING*(*)
+      CHARACTER CR
+      PARAMETER (CR=CHAR(13))
+      READ(IDEV,1000,ERR=100,END=200)STRING(1:NSTRNG)
+ 1000 FORMAT(A)
+      DO 10 I=1,NSTRNG
+        IF(STRING(I:I).EQ.CR) THEN
+          GETRSL = I-1
+          RETURN
+        END IF
+ 10     CONTINUE
+      GETRSL = NSTRNG
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'ERROR READING IN GETRSL'
+      GETRSL = -2
+      RETURN
+ 200  CONTINUE
+      GETRSL = -1
+      WRITE(*,*)'END IN GETRSL'
+      END
+      INTEGER FUNCTION SEEKRS(IDEV,IREC)
+      INTEGER FSEEK
+      SEEKRS = FSEEK(IDEV,IREC,0)
+      END
diff --git a/src/staden/seeme-sg.f b/src/staden/seeme-sg.f
new file mode 100644
index 0000000..28bed53
--- /dev/null
+++ b/src/staden/seeme-sg.f
@@ -0,0 +1,680 @@
+C  SEEME: MACHINE SPECIFIC ROUTINES for silicon graphics
+C 14-8-91 Added check in openrs for empty file names
+C 27-8-91 Started files xspec.f and nxspec.f and split out the nonx
+C         and X specific bits from here.
+C 27-8-91 Added FLUSHL to flush unit idev
+C 15-10-91 Moved IANDRS, IORRS and WRITEB from pl4010.f
+C 29-10-91 Replaced lnblnk ( a sun routine) by notrl
+C 15-11-91 Added routines to call inquire to see if a file exists
+C          and another to delete a file
+C 25-2-92 added more bit handling routines for use by sequence library
+C         searching routines
+C 20-5-92 Added readonly where required for dec!
+C         moved getrs, getrsl and seekrs from seqlibsubs.f to here
+C   AUTHOR: RODGER STADEN
+C   25-JAN-1990 INILU: SAVE DUP,PUP,DLOW,PLOW
+C   UNITNO
+C   OPENRS
+C THE FOLLOWING ARE PROBABLY COMPLETELY PORTABLE
+C BUT ARE INCLUDED ANY WAY
+C   INITLU
+C   IFROMP
+C   CTONUM
+C   DTONUM
+C   IUBM
+C   IUBM1
+C   IUBM2
+C   bsw4
+C   bsw2
+      SUBROUTINE UNITNO(KBIN,KBOUT,DEVNOS,NDEVS)
+C   AUTHOR: RODGER STADEN
+      INTEGER DEVNOS(NDEVS)
+C   SETS CONSOLE I/O UNITS TO 5,6. OTHERS 10 ONWARDS
+      KBIN=5
+      KBOUT=6
+      ISTART=10
+      DO 1 I=1,NDEVS
+      DEVNOS(I)=ISTART+I-1
+1     CONTINUE
+      CALL INITRS()
+      END
+      SUBROUTINE OPENRS(IDEV,FN,IOK,LRECL,JOB)
+C   AUTHOR: RODGER STADEN
+      CHARACTER     FN*(*)
+      CHARACTER*1024 FILNAM,FNDFIL
+      EXTERNAL FNDFIL,NOTRL,INQF
+C 14-8-91 Added check for blank file names
+C 14-11-91 Added routines to check for file existence
+C       ROUTINE TO OPEN FILES
+C   JOB=1 SEQUENTIAL FORMATTED NEW, DEFAULT RECORD LENGTH
+C      =2 SEQUENTIAL FORMATTED OLD, DEFAULT RECORD LENGTH
+C      =3 DIRECT ACCESS, UNFORMATTED NEW
+C      =4 DIRECT ACCESS, UNFORMATTED OLD
+C      =5 DIRECT ACCESS, UNFORMATTED OLD
+C      =6 UNFORMATTED NEW
+C      =7 UNFORMATTED OLD
+C      =8 STATUS='UNKNOWN'
+C      =9 TERMINAL FOR GRAPHICS
+C      =10 TERMINAL FOR VT100
+C      =11 direct access, single byte record length, old
+C   RECORD LENGTHS SENT IN WORDS
+C   READONLY IS VAX SPECIFIC*********************
+C   NOTE THAT ONLY FOR DIRECT ACCESS FILES ARE RECORD
+C   LENGTHS SPECIFIED. ON THE VAX UNFORMATTED DIRECT ACCESS
+C   RECORDS ARE DEFINED IN WORDS; ON OTHER MACHINES E.G. SOME UNIX
+C   MACHINES THEY ARE SPECIFIED IN BYTES. THE VARIABLE IWORDL IS
+C   USED TO MULTIPLY RECORD LENGTHS SPECIFIED IN WORDS TO GIVE
+C   THE CORRECT VALUE FOR THE MACHINE. ON A VAX IWORDL=1; ON
+C   THE UNIX MACHINE IWORDL=4
+       PARAMETER (IWORDL=4)
+      IF(NOTRL(FN,LEN(FN),' ').EQ.0) GO TO 100
+      IF (JOB.LT.9 .OR. JOB.GT.10) THEN
+        FILNAM = FNDFIL(FN)
+        INQJ = INQF(FILNAM)
+      END IF
+      IF(JOB.EQ.1) THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.2)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.3)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.4)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.5)THEN
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.6)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               FORM='UNFORMATTED',ERR=100)
+      ELSE IF(JOB.EQ.7)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,
+     +               FORM='UNFORMATTED',ERR=100)
+        ELSE IF(JOB.EQ.8)THEN
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.9)THEN
+C               DEVICE FOR GRAPHICS OUTPUT IS 'TT' IF USING A
+C               VT640 ON A VAX. IF USING A SEPARATE TERMINAL FOR
+C               GRAPHICS OUTPUT THE TERMINAL PROTECTION MUST BE
+C               SET W:RW AND THE DEVICE WILL HAVE A DIFFERENT NAME
+C               FOR EXAMPLE 'GRAPHICS' WHERE THE LOGICAL NAME 'GRAPHICS'
+C               IS ASSIGNED TO THE APPROPRIATE DEVICE. A VT240 SHOULD
+C               BE COMPATIBLE WITH THE TEKTRONIX PLOT COMMANDS.
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.10)THEN
+C               DEVICE FOR VT100 COMPATIBLE TERMINAL WHEN USED
+C               FOR GELIN WHICH REQUIRES COMPLETE CONTROL OF
+C               CURSOR POSITIONING. ON A VAX IT IS CALLED 'TT'
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.11)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=1
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               READONLY,ACCESS='DIRECT',RECL=1,ERR=100)
+      END IF
+      IOK=0
+      RETURN
+100   CONTINUE
+      IOK=1
+      RETURN
+      END
+      CHARACTER*1024 FUNCTION FNDFIL(NAME)
+      CHARACTER NAME*(*),TMPNAM*1024,PREFIX*1024
+      EXTERNAL NOTRL
+C  LOOKS IN ENVIRONMENT FOR FILE PATHS
+C  ? CASES DEALT WITH:
+C  1) IF NAME STARTS WITH / IT IS ABSOLUTE AND SO UNCHANGED
+C  2) IF NAME STARTS WITH ~/ REPLACE WITH HOME DIRECTORY NAME
+C  3) IF NAME INCLUDES ANY OTHER / LOOK IN ENVIRONMENT FOR
+C     NAME UPTO /, AND ADD PATH TO REST IF FOUND
+C     IF NOT FOUND ADD PATH TO CURRENT WORKING DIRECTORY
+C  4) ELSE LOOK IN ENVIRONMENT FOR WHOLE NAME
+C     IF NOT FOUND ADD PATH TO CURRENT DIRECTORY
+C     IF FOUND MAKE IT FILE NAME
+C
+      IF(NAME(1:1).EQ.'/') THEN
+        FNDFIL = NAME
+      ELSE IF (NAME(1:2).EQ.'~/') THEN
+        CALL GETENV('HOME',TMPNAM)
+        FNDFIL = TMPNAM(:NOTRL(TMPNAM,LEN(TMPNAM),' ')) //
+     +  NAME(2:NOTRL(NAME,LEN(NAME),' '))
+      ELSE IF (INDEX(NAME,'/').NE.0) THEN
+        TMPNAM = NAME(1:INDEX(NAME,'/')-1)
+        CALL GETENV(TMPNAM,PREFIX)
+        I = NOTRL(PREFIX,LEN(PREFIX),' ')
+        IF(I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    NAME(INDEX(NAME,'/'):)
+        END IF
+      ELSE
+        CALL GETENV(NAME,TMPNAM)
+        I = NOTRL(TMPNAM,LEN(TMPNAM),' ')
+        IF (I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = TMPNAM
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION INQF(NAME)
+      CHARACTER NAME*(*)
+      LOGICAL EX
+      INQUIRE(FILE=NAME,EXIST=EX)
+C
+C if the file exists return 1, else 0
+C
+      IF (EX) THEN
+        INQF = 1
+      ELSE
+        INQF = 0
+      END IF
+      END
+      INTEGER FUNCTION DELF(FN,IDEV,JRECL,JOB)
+      CHARACTER FN*(*)
+      CHARACTER FILNAM*1024,FNDFIL*1024
+      EXTERNAL FNDFIL
+C
+C routine to delete files in the most horrible way i could find
+C deal with job 2: old sequential formatted
+C deal with job 4: old direct unformatted
+C deal with job 7: old sequential unformatted
+C if the file exists and is not one of these error =-1
+C if the file exists and cannot be deleted error =-2
+C if the file cannot be opened error =-3
+C
+C on the sun this technique will delete any file if the directory
+C priviliges are set.
+C I dont know if i need to worry about the file types: could
+C i just open them all the same way? Although its a mess leave it!
+      FILNAM = FNDFIL(FN)
+      IF(JOB.EQ.2) THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='SEQUENTIAL',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.4)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='DIRECT',RECL=JRECL,ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.7)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  FORM='UNFORMATTED',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE
+        DELF = -1
+        RETURN
+      END IF
+      RETURN
+ 100  CONTINUE
+      DELF = -3
+      RETURN
+ 200  CONTINUE
+      DELF = -2
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP
+C  FOR BOTH DNA AND PROTEIN SEQUENCES
+C  THE INITIALISING ROUTINES ARE SENT THE CHARACTERSET SIZE IDM
+C  WHICH DETERMINES WHICH CHARACTERSET IS USED
+      SUBROUTINE INITLU(IDM)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255),POINT2(0:255)
+      CHARACTER DUP*16,DLOW*16,PUP*26,PLOW*26
+      COMMON /IASCI1/POINT1
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI1/
+      SAVE /IASCI2/
+      SAVE DUP,PUP,DLOW,PLOW
+      DATA DUP/'TCAG-RYWSMKHBVDN'/
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      DATA DLOW/'tcag-rywsmkhbvdn'/
+      DATA PLOW/'cstpagndeqbzhrkmilvfyw-x? '/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-5 FOR ACGT OR 1-26 FOR AMINO ACIDS BY USING ICHAR. 
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 5, OR 1 - 26
+C  WORKS ON UPPER AND LOWER CASE - REMOVE DLOW,PLOW AND LOOPS 41 AND 51
+C  IF LOWERCASE NOT ALLOWED
+C
+      IF(IDM.EQ.5)THEN
+        DO 30 I = 0,255
+          POINT1(I) = IDM
+          POINT2(I) = 17
+30      CONTINUE
+        DO 35 I = 1,5
+          J = ICHAR(DUP(I:I))
+          POINT1(J) = I
+35      CONTINUE
+        DO 36 I = 1,5
+          J = ICHAR(DLOW(I:I))
+          POINT1(J) = I
+36      CONTINUE
+        DO 40 I = 1,16
+          J = ICHAR(DUP(I:I))
+          POINT2(J) = I
+40      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('U')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        DO 41 I = 1,16
+          J = ICHAR(DLOW(I:I))
+          POINT2(J) = I
+41      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('u')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        ELSE IF(IDM.EQ.26)THEN
+          DO 45 I = 0,255
+            POINT1(I) = IDM
+45        CONTINUE
+C
+        DO 50 I = 1,26
+          J = ICHAR(PUP(I:I))
+          POINT1(J) = I
+50      CONTINUE
+        DO 51 I = 1,26
+          J = ICHAR(PLOW(I:I))
+          POINT1(J) = I
+51      CONTINUE
+        DO 60 I = 0,255
+          POINT2(I) = POINT1(I)
+60      CONTINUE
+      ELSE
+        WRITE(*,*)'ERROR INITIALISING CHARACTER LOOKUP POINTERS'
+      END IF
+      END
+      INTEGER FUNCTION IFROMP(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      IFROMP = POINT1(ICOL)
+      END
+      INTEGER FUNCTION CTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      CTONUM = POINT1(ICOL)
+      END
+      INTEGER FUNCTION DTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT2(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI2/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      DTONUM = POINT2(ICOL)
+      END
+      INTEGER FUNCTION IUBM(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, 2 FOR POSSIBLE, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,2,0,2,2,0,0,2,2,2,0,2,2,0,
+     +0,1,0,0,2,0,2,0,2,2,0,2,2,2,0,2,0,
+     +0,0,1,0,2,2,0,2,0,2,0,2,0,2,2,2,0,
+     +0,0,0,1,2,2,0,0,2,0,2,0,2,2,2,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,2,1,0,2,2,2,2,2,2,2,2,2,0,
+     +1,1,0,0,2,0,1,2,2,2,2,2,2,2,2,2,0,
+     +1,0,1,0,2,2,2,1,0,2,2,2,2,2,2,2,0,
+     +0,1,0,1,2,2,2,0,1,2,2,2,2,2,2,2,0,
+     +0,1,1,0,2,2,2,2,2,1,0,2,2,2,2,2,0,
+     +1,0,0,1,2,2,2,2,2,2,1,2,2,2,2,2,0,
+     +1,1,1,0,2,2,2,1,2,1,2,1,2,2,2,2,0,
+     +1,1,0,1,2,2,1,2,1,2,1,2,1,2,2,2,0,
+     +0,1,1,1,2,1,2,2,1,1,2,2,2,1,2,2,0,
+     +1,0,1,1,2,1,2,1,2,2,1,2,2,2,1,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM2(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A POSSIBLE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,1,0,1,1,0,0,1,1,1,0,1,1,0,
+     +0,1,0,0,1,0,1,0,1,1,0,1,1,1,0,1,0,
+     +0,0,1,0,1,1,0,1,0,1,0,1,0,1,1,1,0,
+     +0,0,0,1,1,1,0,0,1,0,1,0,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,1,1,0,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,0,1,0,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,0,
+     +0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,1,0,
+     +0,1,1,0,1,1,1,1,1,1,0,1,1,1,1,1,0,
+     +1,0,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM2 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM1(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,
+     +1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,
+     +0,1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,
+     +0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,
+     +1,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,
+     +1,1,1,0,0,0,0,1,0,1,0,1,0,0,0,0,0,
+     +1,1,0,1,0,0,1,0,1,0,1,0,1,0,0,0,0,
+     +0,1,1,1,0,1,0,0,1,1,0,0,0,1,0,0,0,
+     +1,0,1,1,0,1,0,1,0,0,1,0,0,0,1,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM1 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+C
+C
+      SUBROUTINE FLUSHL(IDEV)
+      CALL FLUSH(IDEV)
+C       This implementation uses the Sun Fortran FLUSH statement.
+      END
+      SUBROUTINE FLUSHO
+C       The (single) unit number is currently hard-wired, but must
+C       be the same as KBOUT in UNITNO.
+      CALL FLUSHL(6)
+      END
+C
+      SUBROUTINE CEDIT(FILNAM)
+      CHARACTER COMAND*256,EDITOR*256,FILNAM*(*)
+      INTEGER SYSTEM,I
+      EXTERNAL NOTRL
+      CALL GETENV('SEQEDT',EDITOR)
+      I=NOTRL(EDITOR,LEN(EDITOR),' ')
+      IF (I.EQ.0) THEN
+        EDITOR='vi'
+        I=2
+      ENDIF
+      COMAND = EDITOR(1:I)//' '//FILNAM
+      ISTAT = SYSTEM(COMAND)
+      END
+      SUBROUTINE BSW2(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER*2 IN,OUT
+C Sun
+        OUT(1:1) = IN(2:2)
+        OUT(2:2) = IN(1:2)
+C Alliant
+C        OUT = IN
+      END
+      SUBROUTINE BSW4(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER IN*4,OUT*4
+C Sun
+      DO 10 I = 1,2
+        OUT(I:I) = IN(5-I:5-I)
+        OUT(5-I:5-I) = IN(I:I)
+ 10     CONTINUE
+C Alliant
+C      OUT = IN
+      END
+      INTEGER FUNCTION IANDRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IAND
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.AND.LJIN
+C dec
+        IANDRS=IAND(IIN,JIN)
+C Alliant
+C        IANDRS=IAND(IIN,JIN)
+        END
+      INTEGER FUNCTION IORRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IOR
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.OR.LJIN
+C dec
+        IORRS=IOR(IIN,JIN)
+C Alliant
+C        IORRS=IOR(IIN,JIN)
+        END
+       SUBROUTINE WRITEB(CHARS,NCHAR,IDEVGR)
+C   AUTHOR: RODGER STADEN 
+       CHARACTER CHARS*(*)
+       SAVE NUL
+       DATA NUL/0/
+C   NOTE NULS STOP CARRIAGE RETURN, LINE FEED
+C Sun
+       WRITE(IDEVGR,1000)CHARS(1:NCHAR)
+1000   FORMAT(A,$)
+C Alliant
+C      WRITE(IDEVGR,1000)NUL,CHARS(1:NCHAR)
+C1000   FORMAT(A1,A)
+       END
+      SUBROUTINE SABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BITNUM
+C
+C set bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BITNUM = MOD(I,32)
+C      WRITE(*,*)'ELEMENT, BIT',ELEMNT,BITNUM
+      CALL SBITRS(ARRAY(ELEMNT),BITNUM)
+      END
+      SUBROUTINE AAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IANDRS
+      EXTERNAL IANDRS
+C
+C and array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IANDRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      SUBROUTINE OAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IORRS
+      EXTERNAL IORRS
+C
+C OR array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IORRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      LOGICAL FUNCTION TABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BIT
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C test bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BIT = MOD(I,32)
+C      WRITE(*,*)'TEST ELEMENT, BIT',ELEMNT,BIT
+      TABIT = .FALSE.
+      IF (ARRAY(ELEMNT).NE.0) THEN
+        TABIT = TBITRS(ARRAY(ELEMNT),BIT)
+      END IF 
+      END
+      SUBROUTINE CABIT(ARRAY,MAXAR)
+      INTEGER ARRAY(0:MAXAR),POSN
+C
+C clear all bits in array (1 word at a time)
+C
+      DO 10 POSN=0,MAXAR
+          ARRAY(POSN) = 0
+ 10     CONTINUE
+      END
+      SUBROUTINE SBITRS(WORD,BITNUM)
+      INTEGER MASKS(0:31),WORD,BITNUM
+      EXTERNAL IORRS
+      SAVE MASKS
+      DATA MASKS/1,2,4,8,16,32,64,128,256,512,1024,2048,4096,
+     +8192,16384,32768,65536,131072,262144,524288,1048576,
+     +2097152,4194304,8388608,16777216,33554432,67108864,
+     +134217728,268435456,536870912,1073741824,-2147483648/
+      WORD = IORRS(WORD,MASKS(BITNUM))
+      END
+      LOGICAL FUNCTION TBITRS(WORD,BITNUM)
+      INTEGER MASKS(0:31),WORD,BITNUM
+      EXTERNAL IANDRS
+      SAVE MASKS
+      DATA MASKS/1,2,4,8,16,32,64,128,256,512,1024,2048,4096,
+     +8192,16384,32768,65536,131072,262144,524288,1048576,
+     +2097152,4194304,8388608,16777216,33554432,67108864,
+     +134217728,268435456,536870912,1073741824,-2147483648/
+      I = MASKS(BITNUM)
+      TBITRS = (IANDRS(WORD,I).EQ.I)
+      END
+      INTEGER FUNCTION GETRS(IDEV,STRING,NSTRNG,IBYTE)
+      CHARACTER STRING*(*)
+CRS_YUK
+C      CHARACTER ST
+      INTEGER FTELL
+      EXTERNAL FTELL
+      IREC = IBYTE
+      DO 10 I = 1,NSTRNG
+        READ(IDEV,REC=IREC,ERR=20)STRING(I:I)
+CRS_YUK-START
+C        READ(IDEV,REC=IREC)st
+C        STRING(I:I) = st
+C        WRITE(*,*)IREC,STRING(I:I),FTELL(IDEV)
+CRS_YUK-END
+        IREC = IREC + 1
+ 10   CONTINUE
+      GETRS = 0
+      RETURN
+ 20   CONTINUE
+      GETRS = -1
+      END
+      INTEGER FUNCTION GETRSL(IDEV,STRING,NSTRNG)
+      CHARACTER STRING*(*)
+      CHARACTER CR
+      PARAMETER (CR=CHAR(13))
+      READ(IDEV,1000,ERR=100,END=200)STRING(1:NSTRNG)
+ 1000 FORMAT(A)
+      DO 10 I=1,NSTRNG
+        IF(STRING(I:I).EQ.CR) THEN
+          GETRSL = I-1
+          RETURN
+        END IF
+ 10     CONTINUE
+      GETRSL = NSTRNG
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'ERROR READING IN GETRSL'
+      GETRSL = -2
+      RETURN
+ 200  CONTINUE
+      GETRSL = -1
+      WRITE(*,*)'END IN GETRSL'
+      END
+      INTEGER FUNCTION SEEKRS(IDEV,IREC)
+      INTEGER FSEEK
+      SEEKRS = FSEEK(IDEV,IREC,0)
+      END
diff --git a/src/staden/seeme.f b/src/staden/seeme.f
new file mode 100644
index 0000000..9e35d7a
--- /dev/null
+++ b/src/staden/seeme.f
@@ -0,0 +1,705 @@
+C  SEEME: MACHINE SPECIFIC ROUTINES
+C 14-8-91 Added check in openrs for empty file names
+C 27-8-91 Started files xspec.f and nxspec.f and split out the nonx
+C         and X specific bits from here.
+C 27-8-91 Added FLUSHL to flush unit idev
+C 15-10-91 Moved IANDRS, IORRS and WRITEB from pl4010.f
+C 29-10-91 Replaced lnblnk ( a sun routine) by notrl
+C 15-11-91 Added routines to call inquire to see if a file exists
+C          and another to delete a file
+C 25-2-92 added more bit handling routines for use by sequence library
+C         searching routines
+C moved getrs, getrsl and fseekrs here from seqlibsubs.f
+C   AUTHOR: RODGER STADEN
+C   25-JAN-1990 INILU: SAVE DUP,PUP,DLOW,PLOW
+C   UNITNO
+C   OPENRS
+C THE FOLLOWING ARE PROBABLY COMPLETELY PORTABLE
+C BUT ARE INCLUDED ANY WAY
+C   INITLU
+C   IFROMP
+C   CTONUM
+C   DTONUM
+C   IUBM
+C   IUBM1
+C   IUBM2
+C   bsw4
+C   bsw2
+      SUBROUTINE UNITNO(KBIN,KBOUT,DEVNOS,NDEVS)
+C   AUTHOR: RODGER STADEN
+      INTEGER DEVNOS(NDEVS)
+C   SETS CONSOLE I/O UNITS TO 5,6. OTHERS 10 ONWARDS
+      KBIN=5
+      KBOUT=6
+      ISTART=10
+      DO 1 I=1,NDEVS
+      DEVNOS(I)=ISTART+I-1
+1     CONTINUE
+      CALL INITRS()
+      END
+      SUBROUTINE OPENRS(IDEV,FN,IOK,LRECL,JOB)
+C   AUTHOR: RODGER STADEN
+      CHARACTER     FN*(*)
+      CHARACTER*1024 FILNAM,FNDFIL
+      EXTERNAL FNDFIL,NOTRL,INQF
+C 14-8-91 Added check for blank file names
+C 14-11-91 Added routines to check for file existence
+C       ROUTINE TO OPEN FILES
+C   JOB=1 SEQUENTIAL FORMATTED NEW, DEFAULT RECORD LENGTH
+C      =2 SEQUENTIAL FORMATTED OLD, DEFAULT RECORD LENGTH
+C      =3 DIRECT ACCESS, UNFORMATTED NEW
+C      =4 DIRECT ACCESS, UNFORMATTED OLD
+C      =5 DIRECT ACCESS, UNFORMATTED OLD
+C      =6 UNFORMATTED NEW
+C      =7 UNFORMATTED OLD
+C      =8 STATUS='UNKNOWN'
+C      =9 TERMINAL FOR GRAPHICS
+C      =10 TERMINAL FOR VT100
+C      =11 direct access, single byte record length, old
+C   RECORD LENGTHS SENT IN WORDS
+C   READONLY IS VAX SPECIFIC*********************
+C   NOTE THAT ONLY FOR DIRECT ACCESS FILES ARE RECORD
+C   LENGTHS SPECIFIED. ON THE VAX UNFORMATTED DIRECT ACCESS
+C   RECORDS ARE DEFINED IN WORDS; ON OTHER MACHINES E.G. SOME UNIX
+C   MACHINES THEY ARE SPECIFIED IN BYTES. THE VARIABLE IWORDL IS
+C   USED TO MULTIPLY RECORD LENGTHS SPECIFIED IN WORDS TO GIVE
+C   THE CORRECT VALUE FOR THE MACHINE. ON A VAX IWORDL=1; ON
+C   THE UNIX MACHINE IWORDL=4
+       PARAMETER (IWORDL=4)
+      IF(NOTRL(FN,LEN(FN),' ').EQ.0) GO TO 100
+      IF (JOB.LT.9 .OR. JOB.GT.10) THEN
+        FILNAM = FNDFIL(FN)
+        INQJ = INQF(FILNAM)
+      END IF
+      IF(JOB.EQ.1) THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.2)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='SEQUENTIAL',ERR=100)
+      ELSE IF(JOB.EQ.3)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.4)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.5)THEN
+                JRECL=LRECL*IWORDL
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      ELSE IF(JOB.EQ.6)THEN
+        IF(INQJ.NE.0) THEN
+C  file exists so tell the user
+          IOK = 2
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='NEW',
+     +               FORM='UNFORMATTED',ERR=100)
+      ELSE IF(JOB.EQ.7)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               FORM='UNFORMATTED',ERR=100)
+        ELSE IF(JOB.EQ.8)THEN
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.9)THEN
+C               DEVICE FOR GRAPHICS OUTPUT IS 'TT' IF USING A
+C               VT640 ON A VAX. IF USING A SEPARATE TERMINAL FOR
+C               GRAPHICS OUTPUT THE TERMINAL PROTECTION MUST BE
+C               SET W:RW AND THE DEVICE WILL HAVE A DIFFERENT NAME
+C               FOR EXAMPLE 'GRAPHICS' WHERE THE LOGICAL NAME 'GRAPHICS'
+C               IS ASSIGNED TO THE APPROPRIATE DEVICE. A VT240 SHOULD
+C               BE COMPATIBLE WITH THE TEKTRONIX PLOT COMMANDS.
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.10)THEN
+C               DEVICE FOR VT100 COMPATIBLE TERMINAL WHEN USED
+C               FOR GELIN WHICH REQUIRES COMPLETE CONTROL OF
+C               CURSOR POSITIONING. ON A VAX IT IS CALLED 'TT'
+                OPEN(UNIT=IDEV,FILE='/dev/tty',STATUS='UNKNOWN',
+     +               ERR=100)
+        ELSE IF(JOB.EQ.11)THEN
+        IF(INQJ.NE.1) THEN
+C  file inquire shows file does not exist
+C  so tell the user
+          IOK = 3
+          RETURN
+        END IF
+                JRECL=1
+                OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +               ACCESS='DIRECT',RECL=JRECL,ERR=100)
+      END IF
+      IOK=0
+      RETURN
+100   CONTINUE
+      IOK=1
+      RETURN
+      END
+      CHARACTER*1024 FUNCTION FNDFIL(NAME)
+      CHARACTER NAME*(*),TMPNAM*1024,PREFIX*1024
+      EXTERNAL NOTRL
+C  LOOKS IN ENVIRONMENT FOR FILE PATHS
+C  ? CASES DEALT WITH:
+C  1) IF NAME STARTS WITH / IT IS ABSOLUTE AND SO UNCHANGED
+C  2) IF NAME STARTS WITH ~/ REPLACE WITH HOME DIRECTORY NAME
+C  3) IF NAME INCLUDES ANY OTHER / LOOK IN ENVIRONMENT FOR
+C     NAME UPTO /, AND ADD PATH TO REST IF FOUND
+C     IF NOT FOUND ADD PATH TO CURRENT WORKING DIRECTORY
+C  4) ELSE LOOK IN ENVIRONMENT FOR WHOLE NAME
+C     IF NOT FOUND ADD PATH TO CURRENT DIRECTORY
+C     IF FOUND MAKE IT FILE NAME
+C
+      IF(NAME(1:1).EQ.'/') THEN
+        FNDFIL = NAME
+      ELSE IF (NAME(1:2).EQ.'~/') THEN
+        CALL GETENV('HOME',TMPNAM)
+        FNDFIL = TMPNAM(:NOTRL(TMPNAM,LEN(TMPNAM),' ')) //
+     +  NAME(2:NOTRL(NAME,LEN(NAME),' '))
+      ELSE IF (INDEX(NAME,'/').NE.0) THEN
+        TMPNAM = NAME(1:INDEX(NAME,'/')-1)
+        CALL GETENV(TMPNAM,PREFIX)
+        I = NOTRL(PREFIX,LEN(PREFIX),' ')
+        IF(I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    NAME(INDEX(NAME,'/'):)
+        END IF
+      ELSE
+        CALL GETENV(NAME,TMPNAM)
+        I = NOTRL(TMPNAM,LEN(TMPNAM),' ')
+        IF (I.EQ.0) THEN
+          CALL GETCWD(PREFIX)
+          FNDFIL = PREFIX(:NOTRL(PREFIX,LEN(PREFIX),' ')) //
+     +    '/' // NAME(:NOTRL(NAME,LEN(NAME),' '))
+        ELSE
+          FNDFIL = TMPNAM
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION INQF(NAME)
+      CHARACTER NAME*(*)
+      LOGICAL EX
+      INQUIRE(FILE=NAME,EXIST=EX)
+C
+C if the file exists return 1, else 0
+C
+      IF (EX) THEN
+        INQF = 1
+      ELSE
+        INQF = 0
+      END IF
+      END
+      INTEGER FUNCTION DELF(FN,IDEV,JRECL,JOB)
+      CHARACTER FN*(*)
+      CHARACTER FILNAM*1024,FNDFIL*1024
+      EXTERNAL FNDFIL
+C
+C routine to delete files in the most horrible way i could find
+C deal with job 2: old sequential formatted
+C deal with job 4: old direct unformatted
+C deal with job 7: old sequential unformatted
+C if the file exists and is not one of these error =-1
+C if the file exists and cannot be deleted error =-2
+C if the file cannot be opened error =-3
+C
+C on the sun this technique will delete any file if the directory
+C priviliges are set.
+C I dont know if i need to worry about the file types: could
+C i just open them all the same way? Although its a mess leave it!
+      FILNAM = FNDFIL(FN)
+      IF(JOB.EQ.2) THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='SEQUENTIAL',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.4)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  ACCESS='DIRECT',RECL=JRECL,ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE IF(JOB.EQ.7)THEN
+        OPEN(UNIT=IDEV,FILE=FILNAM,STATUS='OLD',
+     +  FORM='UNFORMATTED',ERR=100)
+        CLOSE(UNIT=IDEV,STATUS='DELETE',ERR=200)
+        DELF = 0
+      ELSE
+        DELF = -1
+        RETURN
+      END IF
+      RETURN
+ 100  CONTINUE
+      DELF = -3
+      RETURN
+ 200  CONTINUE
+      DELF = -2
+      END
+C  ROUTINES TO CONTROL CHARACTER LOOKUP
+C  FOR BOTH DNA AND PROTEIN SEQUENCES
+C  THE INITIALISING ROUTINES ARE SENT THE CHARACTERSET SIZE IDM
+C  WHICH DETERMINES WHICH CHARACTERSET IS USED
+      SUBROUTINE INITLU(IDM)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255),POINT2(0:255)
+      CHARACTER DUP*16,DLOW*16,PUP*26,PLOW*26
+      COMMON /IASCI1/POINT1
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI1/
+      SAVE /IASCI2/
+      SAVE DUP,PUP,DLOW,PLOW
+      DATA DUP/'TCAG-RYWSMKHBVDN'/
+      DATA PUP/'CSTPAGNDEQBZHRKMILVFYW-X? '/
+      DATA DLOW/'tcag-rywsmkhbvdn'/
+      DATA PLOW/'cstpagndeqbzhrkmilvfyw-x? '/
+C  ICHAR RETURNS THE COLLATING SEQUENCE NUMBER
+C  I WANT 1-5 FOR ACGT OR 1-26 FOR AMINO ACIDS BY USING ICHAR. 
+C  THE ACTUAL VALUE RETURNED BY ICHAR IS NOT PORTABLE 
+C  SO I NEED TO INITIALIZE POINTR SO THAT THE CORRECT 
+C  ELEMENTS CONTAIN VALUES 1 - 5, OR 1 - 26
+C  WORKS ON UPPER AND LOWER CASE - REMOVE DLOW,PLOW AND LOOPS 41 AND 51
+C  IF LOWERCASE NOT ALLOWED
+C
+      IF(IDM.EQ.5)THEN
+        DO 30 I = 0,255
+          POINT1(I) = IDM
+          POINT2(I) = 17
+30      CONTINUE
+        DO 35 I = 1,5
+          J = ICHAR(DUP(I:I))
+          POINT1(J) = I
+35      CONTINUE
+        DO 36 I = 1,5
+          J = ICHAR(DLOW(I:I))
+          POINT1(J) = I
+36      CONTINUE
+        DO 40 I = 1,16
+          J = ICHAR(DUP(I:I))
+          POINT2(J) = I
+40      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('U')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        DO 41 I = 1,16
+          J = ICHAR(DLOW(I:I))
+          POINT2(J) = I
+41      CONTINUE
+C  DEAL WITH U
+          J = ICHAR('u')
+          POINT1(J) = 1  
+          POINT2(J) = 1  
+        ELSE IF(IDM.EQ.26)THEN
+          DO 45 I = 0,255
+            POINT1(I) = IDM
+45        CONTINUE
+C
+        DO 50 I = 1,26
+          J = ICHAR(PUP(I:I))
+          POINT1(J) = I
+50      CONTINUE
+        DO 51 I = 1,26
+          J = ICHAR(PLOW(I:I))
+          POINT1(J) = I
+51      CONTINUE
+        DO 60 I = 0,255
+          POINT2(I) = POINT1(I)
+60      CONTINUE
+      ELSE
+        WRITE(*,*)'ERROR INITIALISING CHARACTER LOOKUP POINTERS'
+      END IF
+      END
+      INTEGER FUNCTION IFROMP(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      IFROMP = POINT1(ICOL)
+      END
+      INTEGER FUNCTION CTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT1(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI1/POINT1
+      SAVE /IASCI1/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      CTONUM = POINT1(ICOL)
+      END
+      INTEGER FUNCTION DTONUM(CHAR)
+C  AUTHOR RODGER STADEN
+      INTEGER POINT2(0:255)
+      CHARACTER CHAR
+      COMMON /IASCI2/POINT2
+      SAVE /IASCI2/
+C
+C  GET COLLATING SEQUENCE VALUE
+      ICOL = ICHAR(CHAR)
+C  THIS POINTS TO A VALUE IN POINTR
+      DTONUM = POINT2(ICOL)
+      END
+      INTEGER FUNCTION IUBM(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, 2 FOR POSSIBLE, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,2,0,2,2,0,0,2,2,2,0,2,2,0,
+     +0,1,0,0,2,0,2,0,2,2,0,2,2,2,0,2,0,
+     +0,0,1,0,2,2,0,2,0,2,0,2,0,2,2,2,0,
+     +0,0,0,1,2,2,0,0,2,0,2,0,2,2,2,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,2,1,0,2,2,2,2,2,2,2,2,2,0,
+     +1,1,0,0,2,0,1,2,2,2,2,2,2,2,2,2,0,
+     +1,0,1,0,2,2,2,1,0,2,2,2,2,2,2,2,0,
+     +0,1,0,1,2,2,2,0,1,2,2,2,2,2,2,2,0,
+     +0,1,1,0,2,2,2,2,2,1,0,2,2,2,2,2,0,
+     +1,0,0,1,2,2,2,2,2,2,1,2,2,2,2,2,0,
+     +1,1,1,0,2,2,2,1,2,1,2,1,2,2,2,2,0,
+     +1,1,0,1,2,2,1,2,1,2,1,2,1,2,2,2,0,
+     +0,1,1,1,2,1,2,2,1,1,2,2,2,1,2,2,0,
+     +1,0,1,1,2,1,2,1,2,2,1,2,2,2,1,2,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM2(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A POSSIBLE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,1,0,1,1,0,0,1,1,1,0,1,1,0,
+     +0,1,0,0,1,0,1,0,1,1,0,1,1,1,0,1,0,
+     +0,0,1,0,1,1,0,1,0,1,0,1,0,1,1,1,0,
+     +0,0,0,1,1,1,0,0,1,0,1,0,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,1,1,0,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,0,1,0,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,0,
+     +0,1,0,1,1,1,1,0,1,1,1,1,1,1,1,1,0,
+     +0,1,1,0,1,1,1,1,1,1,0,1,1,1,1,1,0,
+     +1,0,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,0,1,1,1,1,1,1,1,1,1,1,1,1,1,1,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM2 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+      INTEGER FUNCTION IUBM1(SEQ,ENZ)
+C  AUTHOR: RODGER STADEN
+C  RETURNS 1 FOR A DEFINITE MATCH, ELSE 0
+      CHARACTER SEQ,ENZ
+      INTEGER TABLE(17,17),DTONUM
+      EXTERNAL DTONUM
+      DATA TABLE/
+     +1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,1,1,0,1,0,0,0,0,0,0,0,0,0,0,0,
+     +1,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,
+     +1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,0,
+     +0,1,0,1,0,0,0,0,1,0,0,0,0,0,0,0,0,
+     +0,1,1,0,0,0,0,0,0,1,0,0,0,0,0,0,0,
+     +1,0,0,1,0,0,0,0,0,0,1,0,0,0,0,0,0,
+     +1,1,1,0,0,0,0,1,0,1,0,1,0,0,0,0,0,
+     +1,1,0,1,0,0,1,0,1,0,1,0,1,0,0,0,0,
+     +0,1,1,1,0,1,0,0,1,1,0,0,0,1,0,0,0,
+     +1,0,1,1,0,1,0,1,0,0,1,0,0,0,1,0,0,
+     +1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,1,
+     +0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0/
+C
+      IUBM1 = TABLE(DTONUM(ENZ),DTONUM(SEQ))
+      END
+C
+C
+      SUBROUTINE FLUSHL(IDEV)
+      CALL FLUSH(IDEV)
+C       This implementation uses the Sun Fortran FLUSH statement.
+      END
+      SUBROUTINE FLUSHO
+C       The (single) unit number is currently hard-wired, but must
+C       be the same as KBOUT in UNITNO.
+      CALL FLUSHL(6)
+      END
+C
+      SUBROUTINE CEDIT(FILNAM)
+      CHARACTER COMAND*256,EDITOR*256,FILNAM*(*)
+      INTEGER SYSTEM,I
+      EXTERNAL NOTRL
+      CALL GETENV('SEQEDT',EDITOR)
+      I=NOTRL(EDITOR,LEN(EDITOR),' ')
+      IF (I.EQ.0) THEN
+        EDITOR='vi'
+        I=2
+      ENDIF
+      COMAND = EDITOR(1:I)//' '//FILNAM
+      ISTAT = SYSTEM(COMAND)
+      END
+      SUBROUTINE BSW2(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER*2 IN,OUT
+C Sun
+        OUT(1:1) = IN(2:2)
+        OUT(2:2) = IN(1:2)
+C Alliant
+C        OUT = IN
+      END
+      SUBROUTINE BSW4(IN,OUT)
+C byte order on cd is least significant first
+C on some machines this routine should be changed to do nothing
+      CHARACTER IN*4,OUT*4
+C Sun
+      DO 10 I = 1,2
+        OUT(I:I) = IN(5-I:5-I)
+        OUT(5-I:5-I) = IN(I:I)
+ 10     CONTINUE
+C Alliant
+C      OUT = IN
+      END
+      INTEGER FUNCTION IANDRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IAND
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.AND.LJIN
+C Sun
+        IANDRS=AND(IIN,JIN)
+C Alliant
+C        IANDRS=IAND(IIN,JIN)
+        END
+      INTEGER FUNCTION IORRS(IIN,JIN)
+C   AUTHOR: RODGER STADEN 
+C        LOGICAL LIIN,LJIN
+C        INTEGER IN,JN
+C        EQUIVALENCE (LIIN,IN),(LJIN,JN)
+*******    PERFORMS LOGICAL OPERATIONS ON INTEGER VARIABLES
+*******    SO COULD BE FLAGGED AS ILLEGAL OR GIVE INCORRECT RESULTS
+*******    ON MANY MACHINES CAN BE REPLACED BY LOCAL INTRINSIC IOR
+C        IN=IIN
+C        JN=JIN
+C        LIIN=LIIN.OR.LJIN
+C Sun
+        IORRS=OR(IIN,JIN)
+C Alliant
+C        IORRS=IOR(IIN,JIN)
+        END
+       SUBROUTINE WRITEB(CHARS,NCHAR,IDEVGR)
+C   AUTHOR: RODGER STADEN 
+       CHARACTER CHARS*(*)
+       SAVE NUL
+       DATA NUL/0/
+C   NOTE NULS STOP CARRIAGE RETURN, LINE FEED
+C Sun
+       WRITE(IDEVGR,1000)CHARS(1:NCHAR)
+1000   FORMAT(A,$)
+C Alliant
+C      WRITE(IDEVGR,1000)NUL,CHARS(1:NCHAR)
+C1000   FORMAT(A1,A)
+       END
+      SUBROUTINE SABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BITNUM
+C
+C set bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BITNUM = MOD(I,32)
+C      WRITE(*,*)'ELEMENT, BIT',ELEMNT,BITNUM
+      CALL SBITRS(ARRAY(ELEMNT),BITNUM)
+      END
+      SUBROUTINE AAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IANDRS
+      EXTERNAL IANDRS
+C
+C and array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IANDRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      SUBROUTINE OAWORD(ARRAY1,ARRAY2,MAXAR)
+      INTEGER ARRAY1(0:MAXAR),ARRAY2(0:MAXAR)
+      INTEGER IORRS
+      EXTERNAL IORRS
+C
+C OR array1 and array2 and put result in array1
+C
+      DO 10 I=0,MAXAR
+        ARRAY1(I) = IORRS(ARRAY1(I),ARRAY2(I))
+ 10     CONTINUE
+      END
+      LOGICAL FUNCTION TABIT(ARRAY,MAXAR,POSN)
+      INTEGER ARRAY(0:MAXAR),POSN,ELEMNT,BIT
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C test bit corresponding to posn in array (bits 0-31 per word)
+C
+      I = POSN - 1
+      ELEMNT = I / 32
+      BIT = MOD(I,32)
+C      WRITE(*,*)'TEST ELEMENT, BIT',ELEMNT,BIT
+      TABIT = TBITRS(ARRAY(ELEMNT),BIT)
+      END
+      SUBROUTINE CABIT(ARRAY,MAXAR)
+      INTEGER ARRAY(0:MAXAR),POSN
+C
+C clear all bits in array (1 at a time!!!!!!!!)
+C
+      DO 10 POSN=0,MAXAR
+        DO 5 J = 0,31
+          CALL CBITRS(ARRAY(POSN),J)
+ 5      CONTINUE
+ 10     CONTINUE
+      END
+      SUBROUTINE TBITA(ARRAY,MAXAR,IDEV)
+      INTEGER ARRAY(0:MAXAR)
+      LOGICAL TBITRS
+      EXTERNAL TBITRS
+C
+C  test all bits in array to see if they are set
+C
+      DO 10 I=0,MAXAR
+        DO 5 J=0,31
+          IF(TBITRS(ARRAY(I),J)) THEN
+            K = I*32 + 1 + MOD(J,32)
+            WRITE(IDEV,*)' BIT SET FOR POSN',K
+          END IF
+ 5        CONTINUE
+ 10       CONTINUE
+      END
+C   routines to handle bits (machine specific) for use when
+C   searching through keyword indexes
+C
+C   positions for marking are numbered from 1 onwards
+C   but array elements and bits are numbered from 0 onwards
+C
+      SUBROUTINE SBITRS(WORD,BITNUM)
+C
+C set bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant
+C      INTEGER IBSET
+C      INTRINSIC IBSET
+C      WORD = IBSET(WORD,BITNUM)
+C sun
+      CALL BIS(BITNUM,WORD)
+      END
+      SUBROUTINE CBITRS(WORD,BITNUM)
+C
+C clear bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant
+C      INTEGER IBCLR
+C      INTRINSIC IBCLR
+C      WORD = IBCLR(WORD,BITNUM)
+C sun
+      CALL BIC(BITNUM,WORD)
+      END
+      LOGICAL FUNCTION TBITRS(WORD,BITNUM)
+C
+C test bit bitnum in word
+C
+      INTEGER WORD,BITNUM
+C alliant
+C      LOGICAL BTEST
+C      INTRINSIC BTEST
+C      TBITRS = BTEST(WORD,BITNUM)
+C sun
+      LOGICAL BIT
+      EXTERNAL BIT
+      TBITRS = BIT(BITNUM,WORD)
+      END
+      INTEGER FUNCTION GETRS(IDEV,STRING,NSTRNG,IBYTE)
+      CHARACTER STRING*(*)
+      IREC = IBYTE
+      DO 10 I=1,NSTRNG
+        READ(IDEV,REC=IREC,ERR=20)STRING(I:I)
+        IREC = IREC + 1
+ 10     CONTINUE
+      GETRS = 0
+      RETURN
+ 20   CONTINUE
+      GETRS = -1
+      END
+      INTEGER FUNCTION GETRSL(IDEV,STRING,NSTRNG)
+      CHARACTER STRING*(*)
+      CHARACTER CR
+      PARAMETER (CR=CHAR(13))
+      READ(IDEV,1000,ERR=100,END=200)STRING(1:NSTRNG)
+ 1000 FORMAT(A)
+      DO 10 I=1,NSTRNG
+        IF(STRING(I:I).EQ.CR) THEN
+          GETRSL = I-1
+          RETURN
+        END IF
+ 10     CONTINUE
+      GETRSL = NSTRNG
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'ERROR READING IN GETRSL'
+      GETRSL = -2
+      RETURN
+ 200  CONTINUE
+      GETRSL = -1
+      WRITE(*,*)'END IN GETRSL'
+      END
+      INTEGER FUNCTION SEEKRS(IDEV,IREC)
+      INTEGER FSEEK
+      SEEKRS = FSEEK(IDEV,IREC,0)
+      END
diff --git a/src/staden/select.c b/src/staden/select.c
new file mode 100644
index 0000000..0effe5e
--- /dev/null
+++ b/src/staden/select.c
@@ -0,0 +1,318 @@
+#include "edUtils.h"
+#include "select.h"
+#include "Sheet.h"
+#include "tagUtils.h"
+#include <X11/Xatom.h>
+#include <stdio.h>
+
+extern enum States {StateDown=0,StateUp} editorState;
+
+/* -----  private subroutines ----- */
+#define left_margin 4
+#define top_margin 4
+
+static void draw_select(EdStruct *xx, int seq, int from_pos, int to_pos)
+{
+    int *seqList;
+    int s_from,s_to;
+    int temp;
+    int screenRow;
+
+    /* sort from_pos, to_pos */
+    if (from_pos > to_pos) { temp = from_pos; from_pos = to_pos; to_pos = temp; }
+
+    /* clip to screen */
+    s_from = positionInContig(xx,seq,from_pos) - xx->displayPos;
+    if (s_from>=xx->displayWidth) return;
+    if (s_from<0) s_from=0;
+    s_to = positionInContig(xx,seq,to_pos) - xx->displayPos;
+    if (s_to<0) return;
+    if (s_to>=xx->displayWidth) s_to = xx->displayWidth-1;
+
+    seqList = sequencesOnScreen(xx,xx->displayPos, xx->displayWidth);
+    for(screenRow=0;
+	screenRow<xx->displayHeight && seqList[screenRow] != seq;
+	screenRow++);
+    if (screenRow==xx->displayHeight) return;
+	XawSheetOpHilightText(xx->sequencesWid,s_from,screenRow+xx->rulerDisplayed,s_to-s_from+1,sh_select,HOP_TOG);
+
+}
+
+static void undoSelection(EdStruct *xx)
+{
+    redisplaySelection(xx);
+    xx->select_made = 0;
+}
+
+static void lose_ownership_proc(Widget w, Atom *sel)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(w));
+    undoSelection(xx);
+}
+
+static Boolean convert_proc(
+	Widget w,
+	Atom *selection,
+	Atom *target,
+	Atom *type_return,
+	XtPointer *value_return,
+	unsigned long *length_return,
+	int *format_return)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(w));
+    int temp;
+
+    if (*target == XA_STRING ) {
+	int start_pos,end_pos;
+	start_pos = xx->select_start_pos;
+	end_pos = xx->select_end_pos;
+	if (start_pos > end_pos) {
+	    temp = start_pos;
+	    start_pos = end_pos;
+	    end_pos = temp;
+	}
+	*length_return = end_pos - start_pos;
+	*value_return = XtMalloc(*length_return+1);
+	if (xx->select_seq == 0)
+	    DBcalcConsensus(xx,start_pos,*length_return,*value_return,BOTH_STRANDS);
+	else
+	    DBgetSequence(xx,xx->select_seq,start_pos-1,*length_return,*value_return);
+	*type_return = XA_STRING;
+	*format_return = 8;
+	return (TRUE);
+    }
+    return (FALSE);
+}
+
+/* -----  exported subroutines ----- */
+void start_highlight (Widget w, XButtonEvent *event, String *params,
+		      Cardinal *num_params)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(w));
+    int x,y;
+    int *seqList;
+
+    if (editorState == StateDown) return;
+
+    if (xx->select_made) undoSelection(xx);
+    xx->select_made = 1;
+
+    x = (event->x-left_margin) / xx->fontWidth;
+    y = (event->y-top_margin) / xx->fontHeight - xx->rulerDisplayed;
+    if (y<0) y=0;
+    if (y>=xx->displayHeight) y=xx->displayHeight-1;
+    if (x<0) x=0;
+    if (x>=xx->displayWidth) x=xx->displayWidth-1;
+
+    seqList = sequencesOnScreen(xx,xx->displayPos,xx->displayWidth);
+    xx->select_seq= seqList[y];
+    xx->select_start_pos = xx->displayPos - DBgetRelPos(xx,xx->select_seq) + x + 1;
+    if (xx->select_start_pos<1)
+	xx->select_start_pos = 1;
+    else
+	if (xx->select_start_pos > DBgetLength(xx,xx->select_seq)+1)
+	    xx->select_start_pos = DBgetLength(xx,xx->select_seq)+1;
+    xx->select_end_pos = xx->select_start_pos;
+    xx->select_tag = NULL;
+
+}
+
+
+
+void make_selection (Widget w, XButtonEvent *event, String *params,
+		     Cardinal *num_params)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(w));
+
+    if (XtOwnSelection(w,XA_PRIMARY,event->time,convert_proc,
+	lose_ownership_proc, NULL) == FALSE) {
+	XtWarning("Editor: failed to become selection owner; make a new selection.\n");
+	undoSelection(xx);
+    }
+
+}
+
+void disown_selection(EdStruct *xx)
+{
+    if (xx->select_made)
+	XtDisownSelection(xx->sequencesWid,XA_PRIMARY,CurrentTime);
+    xx->select_made = 0;
+}
+
+void extend_highlight (Widget w, XButtonEvent *event, String *params,
+		       Cardinal *num_params)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(w));
+
+    int pos;
+    int x;
+
+    if ( ! xx->select_made) return;
+    x = (event->x-left_margin) / xx->fontWidth;
+    if (x<0) x=0;
+    if (x>=xx->displayWidth) x=xx->displayWidth-1;
+
+    pos = xx->displayPos - DBgetRelPos(xx,xx->select_seq) + x + 1;
+    if (pos<1)
+	pos = 1;
+    else
+	if (pos > DBgetLength(xx,xx->select_seq)+1)
+	    pos = DBgetLength(xx,xx->select_seq)+1;
+
+    if (pos == xx->select_end_pos) return;
+
+    if (xx->select_start_pos < xx->select_end_pos) {
+	if (xx->select_end_pos < pos)
+	    draw_select(xx, xx->select_seq, xx->select_end_pos,pos-1);
+	else
+	    draw_select(xx, xx->select_seq, pos,xx->select_end_pos-1);
+    } else {
+	if (xx->select_end_pos > pos)
+	    draw_select(xx, xx->select_seq, pos,xx->select_end_pos-1);
+	else
+	    draw_select(xx, xx->select_seq, xx->select_end_pos,pos-1);
+    }
+
+    xx->select_end_pos = pos;
+}
+
+
+
+void redisplaySelection(EdStruct *xx)
+{
+    if (!xx->select_made) return;
+    if (xx->select_start_pos == xx->select_end_pos) ;
+    else if (xx->select_start_pos < xx->select_end_pos) {
+	draw_select(xx, xx->select_seq, xx->select_start_pos,xx->select_end_pos-1);
+    } else {
+	draw_select(xx, xx->select_seq, xx->select_end_pos,xx->select_start_pos-1);
+    }
+}
+
+
+int getSelection(EdStruct *xx, int *seq, int *start, int *length, tagStruct **t)
+{
+    if (! xx->select_made) return 0;
+    if (xx->select_start_pos <= xx->select_end_pos) {
+	*seq = xx->select_seq;
+	*start = xx->select_start_pos;
+	*length = xx->select_end_pos - xx->select_start_pos;
+	*t = xx->select_tag;
+    } else {
+	*seq = xx->select_seq;
+	*start = xx->select_end_pos;
+	*length = xx->select_start_pos - xx->select_end_pos;
+	*t = xx->select_tag;
+    }
+    return 1;
+}
+
+void _select_tag(EdStruct *xx, int seq, tagStruct *t)
+{
+    XButtonEvent event;
+    if (t==NULL) return;
+
+    if (xx->select_made) undoSelection(xx);
+    xx->select_made = 1;
+    xx->select_seq = seq;
+    xx->select_start_pos = normalisePos(xx,seq,t->tagrec.position,t->tagrec.length);
+    xx->select_end_pos = xx->select_start_pos + t->tagrec.length;
+    xx->select_tag = t;
+
+    redisplaySelection(xx);
+    event.time = CurrentTime;
+    make_selection (xx->sequencesWid, &event, NULL, 0);
+
+}
+
+void select_tag(Widget w, XButtonEvent *event, String *params,
+		Cardinal *num_params)
+{
+    EdStruct *xx = widgetToEdStruct(XtParent(w));
+    int x,y;
+    int *seqList;
+    int seq,pos;
+    tagStruct *t;
+
+    x = (event->x-left_margin) / xx->fontWidth;
+    y = (event->y-top_margin) / xx->fontHeight - xx->rulerDisplayed;
+    if (y<0) y=0;
+    if (y>=xx->displayHeight) y=xx->displayHeight-1;
+    if (x<0) x=0;
+    if (x>=xx->displayWidth) x=xx->displayWidth-1;
+
+    seqList = sequencesOnScreen(xx,xx->displayPos,xx->displayWidth);
+    seq = seqList[y];
+    pos = xx->displayPos - DBgetRelPos(xx,seq) + x + 1;
+
+    if ( (t=findTag(xx,seq,pos)) != NULL) {
+	if (xx->select_made) undoSelection(xx);
+	xx->select_made = 1;
+	xx->select_seq = seq;
+	xx->select_start_pos = normalisePos(xx,seq,t->tagrec.position,t->tagrec.length);
+	xx->select_end_pos = xx->select_start_pos + t->tagrec.length;
+	xx->select_tag = t;
+
+	redisplaySelection(xx);
+	make_selection (w, event, NULL, 0);
+    }
+
+}
+
+void selectInsertBase(EdStruct *xx, int seq, int pos)
+/*
+** Adjust selection on insertion
+*/
+{
+    if (xx->select_made && xx->select_seq==seq) {
+	int inverted=(xx->select_end_pos < xx->select_start_pos);
+	int start,end;
+
+	if (inverted) {
+	    start=xx->select_end_pos;
+	    end  =xx->select_start_pos;
+	} else {
+	    end  =xx->select_end_pos;
+	    start=xx->select_start_pos;
+	}
+
+	if (pos <= start) {
+	    xx->select_start_pos++;
+	    xx->select_end_pos++;
+	} else if (pos < end) {
+	    if (inverted)
+		xx->select_start_pos++;
+	    else
+		xx->select_end_pos++;
+	}
+    }
+}
+
+void selectDeleteBase(EdStruct *xx, int seq, int pos)
+/*
+** Adjust selection on deletion
+*/
+{
+    if (xx->select_made && xx->select_seq==seq) {
+	int inverted=(xx->select_end_pos < xx->select_start_pos);
+	int start,end;
+
+	if (inverted) {
+	    start=xx->select_end_pos;
+	    end  =xx->select_start_pos;
+	} else {
+	    end  =xx->select_end_pos;
+	    start=xx->select_start_pos;
+	}
+
+	if (pos < start) {
+	    xx->select_start_pos--;
+	    xx->select_end_pos--;
+	} else if (pos < end) {
+	    if (inverted)
+		xx->select_start_pos--;
+	    else
+		xx->select_end_pos--;
+	}
+    }
+}
diff --git a/src/staden/select.h b/src/staden/select.h
new file mode 100644
index 0000000..728f029
--- /dev/null
+++ b/src/staden/select.h
@@ -0,0 +1,16 @@
+#ifndef _select_h
+#define _select_h
+
+#include "tagUtils.h"
+
+extern void start_highlight (Widget, XButtonEvent *, String *, Cardinal *);
+extern void make_selection  (Widget, XButtonEvent *, String *, Cardinal *);
+extern void extend_highlight(Widget, XButtonEvent *, String *, Cardinal *);
+extern void select_tag(Widget, XButtonEvent *, String *, Cardinal *);
+extern void _select_tag(EdStruct *xx, int seq, tagStruct *t);
+extern void redisplaySelection(EdStruct *xx);
+extern void disown_selection(EdStruct *xx);
+extern int getSelection(EdStruct *xx, int *seq, int *start, int *length, tagStruct **t);
+extern void selectInsertBase(EdStruct *xx, int seq, int pos);
+extern void selectDeleteBase(EdStruct *xx, int seq, int pos);
+#endif /* _select_h */
diff --git a/src/staden/seqlibsubs.f b/src/staden/seqlibsubs.f
new file mode 100644
index 0000000..7206bd1
--- /dev/null
+++ b/src/staden/seqlibsubs.f
@@ -0,0 +1,3290 @@
+C  seqlibsubs
+C 4-5-93 Added lip routine RDLIBB and lots of diagnostics
+C  17-2-93 Stopped keyword and author searches giving "error reading index"
+C          by changes to ikwrd and ianum
+C  28-9-92 Stopped rdlibl from giving "error reading index"
+C          for end of library - sipl,pipl,nipl
+C  17-6-92 Added routines to deal with fasta format (Not
+C          done very carefully!). Requires a change to library
+C          searching programs (sipl, pipl, nipl) to denote the
+C          first entry.
+C  30-5-91 changed major library format to embl cd
+C  1-10-91 removed references to cdseqs
+C  21-10-91 added keyword index search for what it is currently worth
+C           and changed to new format for brief.idx files
+C           In the future I hope the keyword index will be a full
+C           text index, then i will activate more of the code ive
+C           just added for the keywords. Also I ought to list out the
+C           contents of the brief.idx entry for any matches found.
+C 16-12-91  Added routines to deal with codata format and modified
+C           others accordingly. Externally only calls to cdroml are
+C           affected. At the moment am only dealing with an entryname
+C           accession number, and brief directory indexes for codata and,
+C           untidily offering more to the user
+C           which wil reslut in error messages (eg keyword searches).
+C           Have called codata format 'ltype b'
+C 25-2-92   Added routines to deal with genbank format (as for codata)
+C           Added full text index search for embl and swissprot. This
+C           needs two integer arrays of size nrecen/32 which 
+C           are passed down from main. Also removed filename strings from
+C           main.
+C           Have replaced the keyword search by the full text search.
+C 26-2-92   Fixed bug: ftype is now cleared in rdlb1
+C 2-3-92 set filnam = ' ' for calls to openf1
+C 3-3-92    correction to keyword access - use (irec-1)*recordsize
+C           NOT irec*recordsize
+C 11-3-92   Added more error messages, allowed file names of 80 characters
+C 12-5-92 added author searches (replaced brief.idx search)
+C 14-5-92 minimised the number of array elements used by the bit arrays
+C 20-5-92 moved getrs, getrsl seekrs to seeme
+      SUBROUTINE RDSEQ(SEQ,MAXSEQ,IDIMT,J1,J2,ISTART,IEND,
+     +IDIM1,IDIMB,IDEV,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,
+     +IDEVOT,IFORNO,IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBIN,LIBLF,WORKI,MAXWOR,IOK)
+      CHARACTER SEQ(MAXSEQ)
+      CHARACTER FILNAM*(*),LIBLF*(*)
+      CHARACTER HELPF*(*)
+      INTEGER ANSF,STYPE,WORKI(MAXWOR)
+      PARAMETER (MAXPRM = 16)
+      CHARACTER PROMPT(6)*(MAXPRM)
+      IDIMIN=IDIMT
+1     CONTINUE
+      PROMPT(1) = 'Personal file'
+      PROMPT(2) = 'Sequence library'
+      STYPE = IFORNO + 1
+      CALL RADION('Select sequence source',PROMPT,2,STYPE,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(STYPE.LT.1) GO TO 10
+      IF(STYPE.EQ.1) THEN
+        PROMPT(1) = 'Staden'
+        PROMPT(2) = 'EMBL'
+        PROMPT(3) = 'GenBank'
+        PROMPT(4) = 'PIR'
+        PROMPT(5) = 'GCG'
+        PROMPT(6) = 'FASTA'
+        ANSF = 1
+        CALL RADION('Select sequence file format',PROMPT,6,ANSF,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(ANSF.LT.1)GO TO 10
+        FILNAM = ' '
+        CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Sequence file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)GO TO 10
+        IDIMT=MAXSEQ
+        IF(ANSF.EQ.2)THEN
+C   READ FROM EMBL FILE
+          CALL REMBL(IDEV,SEQ(1),IDIMT,KBOUT)
+        ELSE IF(ANSF.EQ.5)THEN
+          CALL  RGCG(IDEV,SEQ(1),IDIMT,KBOUT)
+        ELSE IF(ANSF.EQ.3)THEN
+C   READ FROM GENBANK FILE
+          CALL RGEN(IDEV,SEQ(1),IDIMT,KBOUT)
+        ELSE IF(ANSF.EQ.1)THEN
+C   READ FROM STADEN FILE
+          CALL ARRFIL(IDEV,SEQ,IDIMT,KBOUT)
+C   REMOVE CONTIG NAME IF PRESENT
+          IF(SEQ(20).EQ.'>')THEN
+            CALL MOVEC(
+     +      SEQ(1),IDIMT,IDIMT,1,-20)
+            IDIMT=IDIMT-20
+            WRITE(KBOUT,*)' Contig title removed'
+          END IF
+        ELSE IF(ANSF.EQ.4)THEN
+C   READ FROM A PIR FILE
+          IDIMT=MAXSEQ
+          CALL RDPIRP(SEQ(1),IDIMT,
+     +    FILNAM,IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        ELSE IF(ANSF.EQ.6)THEN
+C   READ FROM A FASTA FILE
+          IDIMT=MAXSEQ
+          CALL RDFASP(SEQ(1),IDIMT,
+     +    FILNAM,IDEV,KBIN,KBOUT,
+     +    IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        END IF
+      ELSE IF(STYPE.EQ.2)THEN
+C   READ FROM A LIBRARY FILE
+        IDIMT=MAXSEQ
+        CALL RDLIB(SEQ,IDIMT,FILNAM,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +  IDEVLF,LIBIN,LIBLF,WORKI,MAXWOR,IOK)
+      END IF
+      CLOSE (UNIT=IDEV)
+10    CONTINUE
+      IF(IDIMT.EQ.0)IDIMT = IDIMIN
+      WRITE(KBOUT,1001)IDIMT
+1001  FORMAT(' Sequence length ',I6)
+      J1=1
+      J2=MIN(MAXSEQ,IDIMT)
+      ISTART=J1
+      IEND=J2
+      IDIMB=IEND-ISTART+1
+      IDIM1=J2-J1+1
+      END
+      SUBROUTINE REMBL(IDEV,SEQ,IDIM,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),CODE*2
+10    CONTINUE
+      READ(IDEV,1000,END=50)CODE
+1000  FORMAT(A)
+C   sequence?
+      IF(CODE.NE.'SQ')GO TO 10
+      IP1=1
+      IP2=60
+20    CONTINUE
+      READ(IDEV,1002,END=30)CODE,(SEQ(K),K=IP1,IP2)
+1002  FORMAT(A,3X,6(10A1,1X))
+C   end of data?
+      IF(CODE(1:1).NE.'/')THEN
+C   sequence
+        IP1=IP1+60
+        IP2=IP2+60
+C  check for overflow
+        IF(IP2.GT.IDIM)IP2=IDIM
+        IF(IP1.LE.IP2)GO TO 20
+        WRITE(KBOUT,1005)IDIM
+1005    FORMAT(' Maximum sequence length',I6,
+     +  ' exceeded, no more read')
+        RETURN
+      END IF
+30    CONTINUE
+C   find end of data
+      IP3=IP2+1
+40    IP3=IP3-1
+      IF(IP3.GT.0)THEN
+        IF(SEQ(IP3).EQ.' ')GO TO 40
+      END IF
+C  end found
+      IDIM=IP3
+      RETURN
+50    CONTINUE
+C   NO DATA FOUND
+      IDIM=0
+      END
+      SUBROUTINE RGEN(IDEV,SEQ,IDIM,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER LINE*80,LINE2*60,SEQ(IDIM)
+C   THIS ROUTINE READS A GENBANK FILE.
+C       IT ASSUMES THE WORD ORIGIN APPEARS ON THE LINE
+C       IMMEDIATELY PRECEDING THE SEQUENCE, AND THAT
+C       THE LAST LINE CONTAINS //
+      IDIMIN=IDIM
+      IDIM=0
+10    READ(IDEV,1000,END=40)LINE
+1000  FORMAT(A)
+      IF(LINE(1:6).NE.'ORIGIN')GO TO 10
+20    READ(IDEV,1000,END=40)LINE
+      IF(LINE(1:2).NE.'//')THEN
+        LINE2(1:60)=
+     +  LINE(11:20)//LINE(22:31)//LINE(33:42)//LINE(44:53)//
+     +  LINE(55:64)//LINE(66:75)
+        DO 30 I=1,60
+          IF(LINE2(I:I).EQ.' ')GO TO 40
+          IF(IDIM.LT.IDIMIN)THEN
+            IDIM=IDIM+1
+            SEQ(IDIM)=LINE2(I:I)
+          ELSE
+C   OVERFLOW!
+            WRITE(KBOUT,1001)IDIMIN
+1001  FORMAT(' Maximum sequence length=',I6,'. Only this much read')
+            GO TO 40
+          END IF
+30      CONTINUE
+        GO TO 20
+      END IF
+40    CONTINUE
+100   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE RDPIRP(SEQ,IDIM,FILNAM,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR RODGER STADEN
+      CHARACTER FILNAM*(*)
+      CHARACTER SEQ(IDIM),TEMP(80),NAME*20,NAMIN*20,NEWNAM*20
+      CHARACTER HELPF*(*)
+      EXTERNAL NOTRL,NMMTCH
+      IDIMIN = IDIM
+      IDIM = 0
+      IOK = 1
+          CALL YESNO(LIST,'Skip listing of entry names',
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(LIST.LT.0) RETURN
+4         CONTINUE 
+          LENGTH = NOTRL(NAMIN,20,' ')
+          CALL GTSTR('Entry name',NAMIN,NEWNAM,
+     +    LENGTH,KBOUT,KBIN,INFLAG)
+          IF(INFLAG.EQ.2) RETURN
+          IF(INFLAG.EQ.1) THEN
+            CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+            GO TO 4
+          END IF
+          IF (LENGTH.GT.0) NAMIN = NEWNAM
+          CALL CCASE(NAMIN,1)
+1005      FORMAT(A)
+10        CONTINUE
+          READ(IDEV,1005,ERR=200,END=300)NAME
+          IF(NAME(1:1).NE.'>')GO TO 10
+          IF(LIST.EQ.1)WRITE(KBOUT,1003)NAME
+1003      FORMAT(' ',A)
+C         IS THIS THE ENTRY WE WANT?
+C          IF(NAME(5:).NE.NAMIN(1:12))GO TO 10
+          IF(NMMTCH(NAME(5:),NAMIN).NE.0) GO TO 10
+          FILNAM(1:16)=NAME(5:20)
+          CALL RPIR(SEQ,IDIMIN,IDEV,KBOUT,TEMP,IOK)
+          IDIM = IDIMIN
+          RETURN
+200   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Error reading file')
+      RETURN
+300   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Unexpected end of file')
+      END
+      SUBROUTINE RDPIRA(SEQ,IDIM,
+     + IDEV,KBOUT,TITLE,FILNAM,LIST,NAMIN,IDEVL)
+C   AUTHOR RODGER STADEN
+      CHARACTER TITLE*(*),FILNAM*(*)
+      PARAMETER (NAMLEN = 10)
+      CHARACTER SEQ(IDIM),TEMP(80),NAMIN*(*)
+      EXTERNAL NMMTCH
+      IF(LIST.EQ.1) THEN
+        READ(IDEVL,1005,ERR=999,END=950)NAMIN
+      END IF
+10    CONTINUE
+        READ(IDEV,1005,ERR=200,END=300)TITLE
+1005    FORMAT(A)
+        IF(TITLE(1:1).NE.'>')GO TO 10
+        IF(LIST.EQ.1)THEN
+          IF(NMMTCH(TITLE(5:),NAMIN).NE.0) GO TO 10
+        END IF
+        IF(LIST.EQ.-1)THEN
+          IF(NMMTCH(TITLE(5:),NAMIN).EQ.0) THEN
+            READ(IDEVL,1005,ERR=999,END=900)NAMIN
+            GO TO 10
+          END IF
+        END IF
+      L = INDEX(TITLE(5:),' ')
+      IF(L.NE.0) THEN
+        L = 4 + L - 1
+      ELSE
+        L = NAMLEN
+      END IF
+      FILNAM = TITLE(5:L)
+C   GOT WANTED SEQUENCE SO READ IT. FIRST READ 1 LINE TITLE
+      CALL RDPIRS(SEQ,IDIM,TEMP,TITLE,IDEV,KBOUT,IOK)
+      IF(IOK.EQ.1) GO TO 200
+      IF(IOK.EQ.2) GO TO 300
+      RETURN
+200   CONTINUE
+      CALL ERROM(KBOUT,'Error reading library file')
+      IOK = 1
+      IDIM = -9
+      RETURN
+300   CONTINUE
+      CALL ERROM(KBOUT,'End of library file reached')
+      IDIM = -9
+      IOK = 2
+      RETURN
+900   CONTINUE
+      NAMIN = ' RODGER'
+      GO TO 10
+999   CONTINUE
+      CALL ERROM(KBOUT,'Error reading entry names file')
+      IDIM = -9
+      RETURN
+ 950  CONTINUE
+      IOK = 3
+      IDIM = -9
+      END
+      SUBROUTINE RDPIRD(SEQ,IDIM,
+     + IDEV,KBOUT,TITLE,NAMIN)
+C   AUTHOR RODGER STADEN
+C 20-12-90 Removed lines setting idim to 0. Could it have ever worked!
+      CHARACTER TITLE*(*)
+      CHARACTER SEQ(IDIM),TEMP(80),NAMIN*(*)
+      EXTERNAL NMMTCH
+10    CONTINUE
+        READ(IDEV,1001,ERR=200,END=300)TITLE
+        IF(TITLE(1:1).NE.'>')GO TO 10
+        IF(NMMTCH(TITLE(5:),NAMIN).NE.0) GO TO 10
+1001  FORMAT(A)
+      CALL RDPIRS(SEQ,IDIM,TEMP,TITLE,IDEV,KBOUT,IOK)
+      IF(IOK.EQ.1) GO TO 200
+      IF(IOK.EQ.2) GO TO 300
+      RETURN
+200   CONTINUE
+      CALL ERROM(KBOUT,'Error reading library file')
+      IOK = 1
+      IDIM = -9
+      RETURN
+300   CONTINUE
+      CALL ERROM(KBOUT,'End of library file reached')
+      IOK = 2
+      IDIM = -9
+      END
+      SUBROUTINE RDPIRS(SEQ,IDIM,TEMP,TITLE,IDEV,KBOUT,IOK)
+      CHARACTER SEQ(IDIM),TEMP(80),TITLE*(*)
+      IDIMIN = IDIM
+      IDIM = 0
+      ISEQ = 0
+      IOK = 0
+      READ(IDEV,1005,ERR=200,END=300)TITLE
+1005  FORMAT(A)
+      NCHRS = 80
+20    CONTINUE
+      READ(IDEV,1000,ERR=200,END=300)TEMP
+1000  FORMAT(80A1)
+      DO 40 J=1,NCHRS
+        IF(TEMP(J).NE.' ')THEN
+C         IS THIS THE END OF THE ENTRY SHOWN BY A * ?
+          IF(TEMP(J).EQ.'*')THEN
+            IDIM = ISEQ
+            RETURN
+          END IF
+          ISEQ = ISEQ + 1
+          IF(ISEQ.GT.IDIMIN)THEN
+            WRITE(KBOUT,1010)IDIMIN
+1010        FORMAT(' Maximum sequence length (',I7,') reached',
+     +      ' no more read')
+            IDIM = IDIMIN
+            RETURN
+          END IF
+          SEQ(ISEQ) = TEMP(J)
+        END IF
+40    CONTINUE
+      GO TO 20
+200   CONTINUE
+      IOK = 1
+      IDIM = -9
+      RETURN
+300   CONTINUE
+      IDIM = -9
+      IOK = 2
+      END
+      SUBROUTINE RPIR(SEQ,IDIM,IDEV,KBOUT,TEMP,IOK)
+      CHARACTER SEQ(IDIM),TEMP(80)
+      IDIMIN = IDIM
+      ISEQ = 0
+      IOK = 0
+1000  FORMAT(80A1)
+      READ(IDEV,1000,ERR=200,END=200)TEMP
+      WRITE(KBOUT,1014)(TEMP(K),K=1,60)
+1014  FORMAT(' ',60A)
+20    CONTINUE
+      READ(IDEV,1000,ERR=200,END=300)TEMP
+      DO 40 J=1,80
+        IF(TEMP(J).NE.' ')THEN
+          IF(TEMP(J).EQ.'*')GO TO 100
+          ISEQ=ISEQ+1
+          IF(ISEQ.GT.IDIMIN)THEN
+            WRITE(KBOUT,1010)IDIMIN
+1010        FORMAT(' Maximum sequence length (',I6,') reached.',
+     +      ' No more read')
+            GO TO 400
+          END IF
+          SEQ(ISEQ)=TEMP(J)
+        END IF
+40    CONTINUE
+      GO TO 20
+100   CONTINUE
+      IDIM=ISEQ
+      RETURN
+200   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Error reading file')
+      RETURN
+300   CONTINUE
+      WRITE(KBOUT,*)' Warning: No * at end of entry'
+      IDIM = ISEQ
+      RETURN
+400   CONTINUE
+      IDIM=IDIMIN
+      END
+      SUBROUTINE RDFASP(SEQ,IDIM,FILNAM,IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR RODGER STADEN
+      CHARACTER FILNAM*(*)
+      CHARACTER SEQ(IDIM),TEMP(80),NAME*80,NAMIN*20,NEWNAM*20
+      CHARACTER HELPF*(*)
+      EXTERNAL NOTRL,NMMTCH
+      SAVE NAMIN
+      DATA NAMIN/' '/
+      IF (NAMIN(1:1).EQ.' ')CALL GFASNM(IDEV,NAME,NAMIN)
+      IDIMIN = IDIM
+      IDIM = 0
+      IOK = 1
+          CALL YESNO(LIST,'Skip listing of entry names',
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(LIST.LT.0) RETURN
+4         CONTINUE 
+          LENGTH = NOTRL(NAMIN,20,' ')
+          CALL GTSTR('Entry name',NAMIN,NEWNAM,
+     +    LENGTH,KBOUT,KBIN,INFLAG)
+          IF(INFLAG.EQ.2) RETURN
+          IF(INFLAG.EQ.1) THEN
+            CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+            GO TO 4
+          END IF
+          IF (LENGTH.GT.0) NAMIN = NEWNAM
+          CALL CCASE(NAMIN,1)
+1005      FORMAT(A)
+10        CONTINUE
+          READ(IDEV,1005,ERR=200,END=300)NAME
+          IF(NAME(1:1).NE.'>')GO TO 10
+          IF(LIST.EQ.1)WRITE(KBOUT,1003)NAME
+1003      FORMAT(' ',A)
+C         IS THIS THE ENTRY WE WANT?
+C          IF(NAME(5:).NE.NAMIN(1:12))GO TO 10
+          IF(NMMTCH(NAME(2:),NAMIN).NE.0) GO TO 10
+          FILNAM = NAME(2:INDEX(NAME,' '))
+          CALL RFAS(SEQ,IDIMIN,IDEV,KBOUT,TEMP,IOK)
+          IDIM = IDIMIN
+          RETURN
+200   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Error reading file')
+      RETURN
+300   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Unexpected end of file')
+      END
+      SUBROUTINE GFASNM(IDEV,LINE,NAMIN)
+      CHARACTER LINE*(*),NAMIN*(*)
+      EXTERNAL NOTRL
+ 10   CONTINUE
+      READ(IDEV,1000,END=30,ERR=30)LINE
+      IF (LINE(1:1).NE.'>') GO TO 10
+      I = MIN(INDEX(LINE,' ')-1,LEN(NAMIN))
+      NAMIN  = LINE(2:I)
+ 30   CONTINUE
+      REWIND(UNIT=IDEV)
+ 1000 FORMAT(A)
+      END
+      SUBROUTINE RDFASA(SEQ,IDIM,
+     + IDEV,KBOUT,TITLE,FILNAM,LIST,NAMIN,IDEVL,ENTRYN)
+C   AUTHOR RODGER STADEN
+      CHARACTER TITLE*(*),FILNAM*(*)
+      PARAMETER (NAMLEN = 10)
+      CHARACTER SEQ(IDIM),TEMP*80,NAMIN*(*)
+      INTEGER ENTRYN
+      EXTERNAL NMMTCH
+      SAVE TEMP
+C
+C for first entry we must find the entry name, but on subsequent visits
+C we assume we have an entryname in title. This is because we have no
+C terminator and have to keep reading until we meet >
+C
+      IF(LIST.EQ.1) THEN
+        READ(IDEVL,1005,ERR=999,END=950)NAMIN
+      END IF
+      IF (ENTRYN.EQ.0) THEN
+ 5      CONTINUE
+        READ(IDEV,1005,ERR=200,END=300)TITLE
+1005    FORMAT(A)
+        IF(TITLE(1:1).NE.'>')GO TO 5
+      ELSE
+        TITLE = TEMP
+      END IF
+      ENTRYN = 1
+10    CONTINUE
+C      write(*,*)namin
+C      write(*,*)title
+      IF(LIST.EQ.1)THEN
+        IF(NMMTCH(TITLE(2:),NAMIN).NE.0) THEN
+20        CONTINUE
+          READ(IDEV,1005,ERR=200,END=300)TITLE
+          IF(TITLE(1:1).NE.'>')GO TO 20
+          IF(NMMTCH(TITLE(2:),NAMIN).NE.0) GO TO 20
+        END IF
+      END IF
+      IF(LIST.EQ.-1)THEN
+        IF(NMMTCH(TITLE(2:),NAMIN).EQ.0) THEN
+30        CONTINUE
+          READ(IDEVL,1005,ERR=999,END=900)NAMIN
+C          write(*,*)'newnam',namin
+40        CONTINUE
+          READ(IDEV,1005,ERR=200,END=300)TITLE
+          IF(TITLE(1:1).NE.'>')GO TO 40
+C          write(*,*)'newtit',title
+          IF(NMMTCH(TITLE(2:),NAMIN).EQ.0) GO TO 30
+        END IF
+      END IF
+      L = INDEX(TITLE,' ')
+      IF(L.NE.0) THEN
+        L = L - 1
+      ELSE
+        L = NAMLEN
+      END IF
+      FILNAM = TITLE(2:L)
+C   GOT WANTED SEQUENCE SO READ IT. FIRST READ 1 LINE TITLE
+      CALL RDFASS(SEQ,IDIM,TEMP,TITLE,IDEV,KBOUT,IOK)
+      IF(IOK.EQ.1) GO TO 200
+      IF(IOK.EQ.2) GO TO 300
+C      write(*,*)'process',filnam
+      RETURN
+200   CONTINUE
+      CALL ERROM(KBOUT,'Error reading library file')
+      IOK = 1
+      IDIM = -9
+      RETURN
+300   CONTINUE
+      CALL ERROM(KBOUT,'End of library file reached')
+      IDIM = -9
+      IOK = 2
+      RETURN
+900   CONTINUE
+C
+C end of names reached for list of excludes so set silly name
+C and read to next entry (allows last entry to be excluded)
+C
+50    CONTINUE
+      READ(IDEV,1005,ERR=200,END=300)TITLE
+      IF(TITLE(1:1).NE.'>')GO TO 50
+      NAMIN = ' RODGER'
+      GO TO 10
+999   CONTINUE
+      CALL ERROM(KBOUT,'Error reading entry names file')
+      IDIM = -9
+      RETURN
+ 950  CONTINUE
+      IOK = 3
+      IDIM = -9
+      END
+      SUBROUTINE RDFASD(SEQ,IDIM,
+     + IDEV,KBOUT,TITLE,NAMIN)
+C   AUTHOR RODGER STADEN
+      CHARACTER TITLE*(*)
+      CHARACTER SEQ(IDIM),TEMP*80,NAMIN*(*)
+      EXTERNAL NMMTCH
+10    CONTINUE
+        READ(IDEV,1001,ERR=200,END=300)TITLE
+        IF(TITLE(1:1).NE.'>')GO TO 10
+        IF(NMMTCH(TITLE(2:),NAMIN).NE.0) GO TO 10
+1001  FORMAT(A)
+      CALL RDFASS(SEQ,IDIM,TEMP,TITLE,IDEV,KBOUT,IOK)
+      IF(IOK.EQ.1) GO TO 200
+      IF(IOK.EQ.2) GO TO 300
+      RETURN
+200   CONTINUE
+      CALL ERROM(KBOUT,'Error reading library file')
+      IOK = 1
+      IDIM = -9
+      RETURN
+300   CONTINUE
+      CALL ERROM(KBOUT,'End of library file reached')
+      IOK = 2
+      IDIM = -9
+      END
+      SUBROUTINE RDFASS(SEQ,IDIM,TEMP,TITLE,IDEV,KBOUT,IOK)
+      CHARACTER SEQ(IDIM),TEMP*(*),TITLE*(*)
+      IDIMIN = IDIM
+      IDIM = 0
+      ISEQ = 0
+      IOK = 0
+      NCHRS = 80
+20    CONTINUE
+      READ(IDEV,1000,ERR=200,END=300)TEMP
+1000  FORMAT(A)
+C      write(*,1000)temp
+      IF(TEMP(1:1).EQ.'>') THEN
+        IDIM = ISEQ
+C        write(*,*)idim
+        RETURN
+      END IF
+      DO 40 J=1,NCHRS
+        IF(TEMP(J:J).NE.' ')THEN
+          ISEQ = ISEQ + 1
+          IF(ISEQ.GT.IDIMIN)THEN
+            WRITE(KBOUT,1010)IDIMIN
+1010        FORMAT(' Maximum sequence length (',I7,') reached',
+     +      ' no more read')
+            IDIM = IDIMIN
+            RETURN
+          END IF
+          SEQ(ISEQ) = TEMP(J:J)
+        END IF
+40    CONTINUE
+      GO TO 20
+200   CONTINUE
+      IOK = 1
+      IDIM = -9
+      RETURN
+300   CONTINUE
+      IDIM = ISEQ
+      END
+      SUBROUTINE RFAS(SEQ,IDIM,IDEV,KBOUT,TEMP,IOK)
+      CHARACTER SEQ(IDIM),TEMP(80)
+      IDIMIN = IDIM
+      ISEQ = 0
+      IOK = 0
+1000  FORMAT(80A1)
+20    CONTINUE
+      READ(IDEV,1000,ERR=200,END=100)TEMP
+      IF(TEMP(1).EQ.'>')GO TO 100
+      DO 40 J=1,80
+        IF(TEMP(J).NE.' ')THEN
+          ISEQ=ISEQ+1
+          IF(ISEQ.GT.IDIMIN)THEN
+            WRITE(KBOUT,1010)IDIMIN
+1010        FORMAT(' Maximum sequence length (',I6,') reached.',
+     +      ' No more read')
+            GO TO 400
+          END IF
+          SEQ(ISEQ)=TEMP(J)
+        END IF
+40    CONTINUE
+      GO TO 20
+100   CONTINUE
+      IDIM=ISEQ
+      RETURN
+200   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Error reading file')
+      RETURN
+400   CONTINUE
+      IDIM=IDIMIN
+      END
+      SUBROUTINE RGCG(IDEV,SEQ,MAXSEQ,KBOUT)
+      CHARACTER LINE*133,SEQ(MAXSEQ)
+      INTEGER GCGDOT
+      EXTERNAL GCGDOT
+      IOK = GCGDOT(IDEV,LINE)
+      IF(IOK.NE.0) THEN
+        WRITE(KBOUT,*)'No .. line found'
+        MAXSEQ = 0
+        CLOSE(UNIT=IDEV)
+        RETURN
+      END IF
+      CALL RGCGIN(SEQ,MAXSEQ,LINE,IDEV,KBOUT)
+      CLOSE(UNIT=IDEV)
+      END
+      INTEGER FUNCTION GCGDOT(IDEV,LINE)
+      CHARACTER LINE*(*)
+      GCGDOT = 1
+10    CONTINUE
+      READ(IDEV,1000,ERR=100,END=100)LINE
+1000  FORMAT(A)
+      I = INDEX(LINE,'..')
+      IF(I.EQ.0) GO TO 10
+      GCGDOT = 0
+      RETURN
+100   CONTINUE
+      END     
+      SUBROUTINE RGCGIN(SEQ,MAXSEQ,LINE,IDEV,KBOUT)
+      CHARACTER SEQ(MAXSEQ)
+      PARAMETER (NBAD=11)
+      CHARACTER LINE*(*),CHARB*(NBAD)
+      PARAMETER (CHARB='0123456789 ')
+      INTEGER COKBAD
+      EXTERNAL COKBAD
+      ISEQ = 0
+10    CONTINUE
+      READ(IDEV,1000,ERR=100,END=200)LINE
+1000  FORMAT(A)
+        DO 5 I = 1,LEN(LINE)
+          IF(COKBAD(LINE(I:I),CHARB,NBAD).EQ.0) THEN
+            ISEQ = ISEQ + 1
+            IF(ISEQ.GT.MAXSEQ) THEN
+              WRITE(KBOUT,1001)MAXSEQ
+1001          FORMAT(' Maximum sequence length',I6,' exceeded')
+              RETURN
+            END IF
+            SEQ(ISEQ) = LINE(I:I)
+          END IF
+5       CONTINUE
+        GO TO 10
+100   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+200   CONTINUE
+      MAXSEQ = ISEQ
+      END
+      INTEGER FUNCTION COKBAD(CHAR,BADC,NBAD)
+      CHARACTER CHAR,BADC*(*)
+      COKBAD = 1
+      DO 10 I = 1,NBAD
+        IF(CHAR.EQ.BADC(I:I)) RETURN
+10    CONTINUE
+      COKBAD = 0
+      END
+C routines for handling embl cdrom format files
+C
+C RDLIB  opens cdrom format library and selects options
+C CDROMS performs a number of jobs on a cdrom format library:
+C        get a sequence, get annotations, get entryname from accession no
+C OCDLBS gets the file names and types of a cdrom format lib for use by CDROMS
+C CDROML performs a number of jobs on a cdrom format library relating
+C        to its use when library searching: read the next entry off the
+C        entryname file, and get the seq; get the next named seq; get
+C        the next seq not on a list of excluded seqs; get a named seq
+C OCDLBL opens a library for use by CDROML
+C RDLB0  reads a list of libraries, gets the names of their descriptor files
+C        and the prompt to appear on the screen
+C RDLB1  gets a list of the file names and file types for a particular lib
+      SUBROUTINE RDLB0(LTYPE,LOGNAM,PROMPT,MAXMEN,NAMLEN,MAXPRM,
+     +IDEVLS,FILNLL,LINE,MAXLIN,KBOUT,ITEM,IOK)
+      CHARACTER PROMPT(MAXMEN)*(*),LOGNAM(MAXMEN)*(*),LTYPE*(*)
+      CHARACTER LINE*(*),FILNLL*(*)
+C  Read file until the desired line is found
+       CALL OPENRS(IDEVLS,FILNLL,IOK,LRECL,2)
+       IF(IOK.NE.0) THEN
+         CALL ERROM(KBOUT,'Unable to open file of library names')
+         CALL ERROM(KBOUT,FILNLL)
+         RETURN
+      END IF
+      LINENO = 0
+      ITEM = 1
+C  Read and parse until end of data
+      CALL RDLB2(LTYPE,LOGNAM,PROMPT,MAXMEN,NAMLEN,MAXPRM,IDEVLS,
+     +LINE,MAXLIN,LINENO,ITEM,IOK)
+      IF(IOK.NE.0) 
+     +WRITE(KBOUT,*)'Error in library menu file on line',LINENO
+      ITEM = ITEM - 1
+      CLOSE(UNIT=IDEVLS)
+      END
+      SUBROUTINE RDLB2(LTYPE,LOGNAM,PROMPT,MAXMEN,NAMLEN,MAXPRM,
+     +IDEVM,LINE,MAXLIN,LINENO,ITEM,IOK)
+      CHARACTER PROMPT(MAXMEN)*(*),LOGNAM(MAXMEN)*(*),LTYPE*(*)
+      CHARACTER LINE*(*)
+      IOK = 0
+C  Read and parse until end of data
+C  
+10    CONTINUE
+      LINENO = LINENO + 1
+      READ(IDEVM,1000,ERR=100,END=200)LINE
+1000  FORMAT(A)
+        CALL RDLB3(LTYPE(ITEM:ITEM),LINE,MAXLIN,LOGNAM(ITEM),
+     +  PROMPT(ITEM),
+     +  NAMLEN,MAXPRM,IOK)
+        IF(IOK.EQ.0) THEN
+          ITEM = ITEM + 1
+        END IF
+      GO TO 10
+100   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Error reading list of library files')
+      RETURN
+200   CONTINUE
+      IOK = 0
+      END
+      SUBROUTINE RDLB3(LTYPE,LINE,MAXLIN,LOGNAM,PROMPT,NAMLEN,
+     +MAXPRM,IOK)
+      CHARACTER LTYPE,LINE*(*),LOGNAM*(*),PROMPT*(*),COMMNT
+      PARAMETER (COMMNT = '!')
+      EXTERNAL NOTILR,NOTIRL
+C Extract LOGNAM, PROMPT
+      IOK = 1
+      MAXLN = INDEX(LINE,COMMNT) - 1
+      IF(MAXLN.EQ.-1) MAXLN = MAXLIN
+      IF(MAXLN.LT.6) RETURN
+      LOGS = NOTILR(LINE(2:),MAXLN-1,' ') + 1
+      IF(LOGS.EQ.MAXLN) RETURN
+      LOGE = INDEX(LINE(LOGS:),' ')
+      IF(LOGE.EQ.0) RETURN
+      LOGE = LOGE - 1
+      LOGEE = LOGE
+      LOGE = MIN(NAMLEN,LOGE)
+      LOGE = LOGE + LOGS - 1
+      LOGNAM = ' '
+      LOGNAM = LINE(LOGS:LOGE)
+      LOGEE = LOGEE + LOGS - 1
+      LOGS = NOTILR(LINE(LOGEE+1:),MAXLN,' ') + LOGEE
+      IF(LOGS.EQ.MAXLN) RETURN
+      LOGE = NOTIRL(LINE,MAXLN,' ')
+      IF(LOGE.EQ.0) RETURN
+      LOGE = MIN(MAXPRM,LOGE-LOGS+1) + LOGS - 1
+      PROMPT = ' '
+      PROMPT = LINE(LOGS:LOGE)
+      LTYPE = LINE(1:1)
+      IOK = 0
+      END
+      SUBROUTINE RDLB1(LIBNAM,MAXMEN,NAMLEN,IDEVLS,
+     +FILNAM,
+     +LINE,MAXLIN,FTYPE,LINENO,KBOUT,ITEM,IOK)
+      CHARACTER LIBNAM(MAXMEN)*(*)
+      CHARACTER LINE*(*),FTYPE*(*),FILNAM*(*)
+      IOK = 0
+C  ftype(i:i) stores the file type for libnam(i)
+C  and libnam(i) stores the name of a library file
+C  Read file until the desired line is found
+      FTYPE = ' '
+       CALL OPENRS(IDEVLS,FILNAM,IOK,LRECL,2)
+       IF(IOK.NE.0) THEN
+         CALL ERROM(KBOUT,'Unable to open file of library file names')
+         CALL ERROM(KBOUT,FILNAM)
+         RETURN
+      END IF
+      LINENO = 0
+      ITEM = 1
+C  Read and parse until end of data
+C  
+10    CONTINUE
+      LINENO = LINENO + 1
+      READ(IDEVLS,1000,ERR=100,END=200)LINE
+1000  FORMAT(A)
+        CALL RDLB4(LINE,MAXLIN,FTYPE(ITEM:ITEM),LIBNAM(ITEM),
+     +  NAMLEN,IOK)
+        IF(IOK.EQ.0) THEN
+          ITEM = ITEM + 1
+        END IF
+      GO TO 10
+100   CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Error reading names of library files')
+      RETURN
+200   CONTINUE
+      ITEM = ITEM - 1
+      IOK = 0
+      END
+      SUBROUTINE RDLB4(LINE,MAXLIN,LTYPE,LIBNAM,NAMLEN,IOK)
+      CHARACTER LINE*(*),LTYPE,LIBNAM*(*),COMMNT
+      PARAMETER (COMMNT = '!')
+      EXTERNAL NOTILR
+C Extract LTYPE, LIBNAM
+      IOK = 1
+      MAXLN = INDEX(LINE,COMMNT) - 1
+      IF(MAXLN.EQ.-1) MAXLN = MAXLIN
+      IF(MAXLN.LT.6) RETURN
+      LOGS = NOTILR(LINE(2:),MAXLN-1,' ') + 1
+      IF(LOGS.EQ.MAXLN) RETURN
+      LOGE = INDEX(LINE(LOGS:),' ')
+      IF(LOGE.EQ.0) RETURN
+      LOGE = LOGE - 1
+      LOGEE = LOGE
+      LOGE = MIN(NAMLEN,LOGE)
+      LOGE = LOGE + LOGS - 1
+      LIBNAM = ' '
+      LIBNAM = LINE(LOGS:LOGE)
+      LTYPE = LINE(1:1)
+      IOK = 0
+      END
+      SUBROUTINE RDLIB(SEQ,IDIM,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBIN,LIBLF,WORKI,MAXWOR,IOK)
+C   AUTHOR RODGER STADEN
+C IDEVLL = unit for library list LIBLF
+C IDEVOT = unit for output of results 
+C IDEVEN =          entry name file ENAMEF
+C IDEVAN = unit for accession number files ANUMTF ANUMHF and keywords
+C IDEVDL =          division lookup file DIVLUF
+C IDEVLF =          actual library data files DATAF
+C note to me: there is really no need to pass all these character strings
+C down from main: you have to declare the size here anyway. So sort it out!
+      PARAMETER (MAXNAM = 80,LENNAM = 10, MAXKWD = 5)
+      PARAMETER (MAXPRM = 50,MAXMEN=20,MAXLIN=80,MAXLBF=9)
+      CHARACTER*(MAXNAM) ENAMEF,DIVLUF,DATAF,ANUMTF,ANUMHF
+      CHARACTER*(MAXNAM) AUTHHF,AUTHTF
+      CHARACTER FILNAM*(*),LIBLF*(*)
+      CHARACTER SEQ(IDIM),NAMIN*10,NEWNAM*10
+      CHARACTER HELPF*(*),ACNUM*10
+      CHARACTER PROMPT(MAXMEN)*(MAXPRM),LOGNAM(MAXMEN)*(MAXNAM)
+      CHARACTER FTYPE*(MAXLBF),LINE*(MAXLIN),LIBNAM(MAXLBF)*(MAXNAM)
+      CHARACTER LTYPE*(MAXMEN)
+      CHARACTER*(MAXNAM) BRIEFF,KWRDTF,KWRDHF
+      CHARACTER KEYWDS*80,TITLE*80
+      INTEGER WORKI(MAXWOR)
+      INTEGER KWS(MAXKWD),KWE(MAXKWD)
+      INTEGER DIVCOD,ANNOFF,SEQOFF
+      EXTERNAL NOTRL,NMMTCH
+      IDIMIN = IDIM
+      IDIM = 0
+      NAMIN = ' '
+      ACNUM = ' '
+      NAMIN = ' '
+      LIB = LIBIN
+1     CONTINUE
+      CALL RDLB0(LTYPE,LOGNAM,PROMPT,MAXMEN,MAXNAM,MAXPRM,
+     +IDEVLL,LIBLF,LINE,MAXLIN,KBOUT,ITEM,IOK)
+      IF(IOK.NE.0) RETURN
+      IOK = 1
+      CALL RADION('Select a library',PROMPT,ITEM,LIB,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(LIB.LT.1) RETURN
+      FILNAM = LOGNAM(LIB)
+      CALL RDLB1(LIBNAM,MAXLBF,MAXNAM,IDEVLL,FILNAM,
+     +LINE,MAXLIN,FTYPE,LINENO,KBOUT,ITEM,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(LTYPE(LIB:LIB).EQ.'A') THEN
+        WRITE(KBOUT,*)'Library is in EMBL format with indexes'
+        CALL OCDLBS(LIBNAM,MAXNAM,ITEM,FTYPE,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,BRIEFF,KWRDTF,KWRDHF,
+     +  AUTHHF,AUTHTF,IOK)
+      ELSE IF(LTYPE(LIB:LIB).EQ.'B') THEN
+        WRITE(KBOUT,*)'Library is in CODATA format with indexes'
+        CALL OCDLBS(LIBNAM,MAXNAM,ITEM,FTYPE,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,BRIEFF,KWRDTF,KWRDHF,
+     +  AUTHHF,AUTHTF,IOK)
+      ELSE IF(LTYPE(LIB:LIB).EQ.'C') THEN
+        WRITE(KBOUT,*)'Library is in GenBank format with indexes'
+        CALL OCDLBS(LIBNAM,MAXNAM,ITEM,FTYPE,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,BRIEFF,KWRDTF,KWRDHF,
+     +  AUTHHF,AUTHTF,IOK)
+      ELSE
+        WRITE(KBOUT,*)'Unknown library type'
+        RETURN
+      END IF
+2     CONTINUE
+      IOPT = 1
+      PROMPT(1) = 'Get a sequence'
+      PROMPT(2) = 'Get annotations'
+      PROMPT(3) = 'Get entry names from accession numbers'
+      PROMPT(4) = 'Search author index'
+      PROMPT(5) = 'Search text index for keywords'
+      CALL RADION('Select a task',PROMPT,5,IOPT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) GO TO 1
+3     CONTINUE
+      IF((IOPT.EQ.1).OR.(IOPT.EQ.2)) THEN
+C  get seq or annot
+        LENGTH = NOTRL(NAMIN,LENNAM,' ')
+        CALL GTSTR('Entry name',NAMIN,NEWNAM,
+     +  LENGTH,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.2) GO TO 2
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+        IF(LENGTH.GT.0) NAMIN = NEWNAM
+        CALL CCASE(NAMIN,1)
+        CALL CDROMS(IOPT,NAMIN,ACNUM,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,DATAF,BRIEFF,KWRDTF,KWRDHF,
+     +  IDEVEN,IDEVAN,IDEVDL,IDEVLF,KBIN,AUTHHF,AUTHTF,
+     +  KBOUT,DIVCOD,SEQOFF,ANNOFF,SEQ,IDIMIN,IDEVOT,
+     +  KEYWDS,KWS,KWE,MAXKEY,LTYPE(LIB:LIB),WORKI,MAXWOR,TITLE,IOK)
+        IF((IOPT.EQ.1).AND.(IOK.EQ.0)) IDIM = IDIMIN
+        FILNAM = NAMIN
+        CLOSE(UNIT=IDEVEN)
+        CLOSE(UNIT=IDEVAN)
+        CLOSE(UNIT=IDEVLF)
+        CLOSE(UNIT=IDEVDL)
+      ELSE IF(IOPT.EQ.3) THEN
+ 4      CONTINUE
+        LENGTH = NOTRL(ACNUM,LENNAM,' ')
+        CALL GTSTR('Accession number',ACNUM,NEWNAM,
+     +  LENGTH,KBOUT,KBIN,INFLAG)
+        IF(INFLAG.EQ.2) GO TO 2
+        IF(INFLAG.EQ.1) THEN
+          CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          GO TO 3
+        END IF
+        IF(LENGTH.GT.0) ACNUM = NEWNAM
+        CALL CCASE(ACNUM,1)
+        CALL CDROMS(IOPT,NAMIN,ACNUM,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,DATAF,BRIEFF,KWRDTF,KWRDHF,
+     +  IDEVEN,IDEVAN,IDEVDL,IDEVLF,KBIN,AUTHHF,AUTHTF,
+     +  KBOUT,DIVCOD,SEQOFF,ANNOFF,SEQ,IDIMIN,IDEVOT,
+     +  KEYWDS,KWS,KWE,MAXKEY,LTYPE(LIB:LIB),WORKI,MAXWOR,TITLE,IOK)
+        CLOSE(UNIT=IDEVEN)
+        CLOSE(UNIT=IDEVAN)
+        CLOSE(UNIT=IDEVLF)
+        CLOSE(UNIT=IDEVDL)
+      ELSE IF(IOPT.EQ.4) THEN
+        NKEYS = MAXKWD
+        CALL SRCKEZ(KBIN,KBOUT,KEYWDS,KWS,KWE,NKEYS,'Authors',IOK)
+        IF(IOK.NE.0) GO TO 2
+        IF(NKEYS.LT.1) GO TO 2
+        CALL CDROMS(IOPT,NAMIN,ACNUM,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,DATAF,BRIEFF,KWRDTF,KWRDHF,
+     +  IDEVEN,IDEVAN,IDEVDL,IDEVLF,KBIN,AUTHHF,AUTHTF,
+     +  KBOUT,DIVCOD,SEQOFF,ANNOFF,SEQ,IDIMIN,IDEVOT,
+     +  KEYWDS,KWS,KWE,NKEYS,LTYPE(LIB:LIB),WORKI,MAXWOR,TITLE,IOK)
+        CLOSE(UNIT=IDEVEN)
+        GO TO 2
+      ELSE IF(IOPT.EQ.5) THEN
+        NKEYS = MAXKWD
+        CALL SRCKEZ(KBIN,KBOUT,KEYWDS,KWS,KWE,NKEYS,'Keywords',IOK)
+        IF(IOK.NE.0) GO TO 2
+        IF(NKEYS.LT.1) GO TO 2
+        CALL CDROMS(IOPT,NAMIN,ACNUM,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,DATAF,BRIEFF,KWRDTF,KWRDHF,
+     +  IDEVEN,IDEVAN,IDEVDL,IDEVLF,KBIN,AUTHHF,AUTHTF,
+     +  KBOUT,DIVCOD,SEQOFF,ANNOFF,SEQ,IDIMIN,IDEVOT,
+     +  KEYWDS,KWS,KWE,NKEYS,LTYPE(LIB:LIB),WORKI,MAXWOR,TITLE,IOK)
+        CLOSE(UNIT=IDEVEN)
+        GO TO 2
+      END IF
+      IF(IOPT.EQ.1) RETURN
+      GO TO 2
+      END
+      SUBROUTINE RDLIBB(SEQ,IDIM,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEVOT,IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBIN,LIBLF,WORKI,MAXWOR,IOK)
+C   AUTHOR RODGER STADEN
+C routine for lip
+C IDEVLL = unit for library list LIBLF
+C IDEVOT = unit for output of results 
+C IDEVEN =          entry name file ENAMEF
+C IDEVAN = unit for accession number files ANUMTF ANUMHF and keywords
+C IDEVDL =          division lookup file DIVLUF
+C IDEVLF =          actual library data files DATAF
+C note to me: there is really no need to pass all these character strings
+C down from main: you have to declare the size here anyway. So sort it out!
+      PARAMETER (MAXNAM = 80,LENNAM = 10, MAXKWD = 5)
+      PARAMETER (MAXPRM = 50,MAXMEN=20,MAXLIN=80,MAXLBF=9)
+      CHARACTER*(MAXNAM) ENAMEF,DIVLUF,DATAF,ANUMTF,ANUMHF
+      CHARACTER*(MAXNAM) AUTHHF,AUTHTF
+      CHARACTER FILNAM*(*),LIBLF*(*)
+      CHARACTER SEQ(IDIM),NAMIN*10,NEWNAM*14
+      CHARACTER HELPF*(*),ACNUM*10
+      CHARACTER PROMPT(MAXMEN)*(MAXPRM),LOGNAM(MAXMEN)*(MAXNAM)
+      CHARACTER FTYPE*(MAXLBF),LINE*(MAXLIN),LIBNAM(MAXLBF)*(MAXNAM)
+      CHARACTER LTYPE*(MAXMEN)
+      CHARACTER*(MAXNAM) BRIEFF,KWRDTF,KWRDHF
+      CHARACTER KEYWDS*80,TITLE*80,EXT*4
+      INTEGER WORKI(MAXWOR)
+      INTEGER KWS(MAXKWD),KWE(MAXKWD)
+      INTEGER DIVCOD,ANNOFF,SEQOFF,GNFFOF
+      EXTERNAL NOTRL,NMMTCH,GNFFOF
+      NAMIN = ' '
+      ACNUM = ' '
+      NAMIN = ' '
+      LIB = LIBIN
+1     CONTINUE
+      CALL RDLB0(LTYPE,LOGNAM,PROMPT,MAXMEN,MAXNAM,MAXPRM,
+     +IDEVLL,LIBLF,LINE,MAXLIN,KBOUT,ITEM,IOK)
+      IF(IOK.NE.0) RETURN
+      IOK = 1
+      CALL RADION('Select a library',PROMPT,ITEM,LIB,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(LIB.LT.1) RETURN
+      FILNAM = LOGNAM(LIB)
+      CALL RDLB1(LIBNAM,MAXLBF,MAXNAM,IDEVLL,FILNAM,
+     +LINE,MAXLIN,FTYPE,LINENO,KBOUT,ITEM,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(LTYPE(LIB:LIB).EQ.'A') THEN
+        WRITE(KBOUT,*)'Library is in EMBL format with indexes'
+        CALL OCDLBS(LIBNAM,MAXNAM,ITEM,FTYPE,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,BRIEFF,KWRDTF,KWRDHF,
+     +  AUTHHF,AUTHTF,IOK)
+      ELSE IF(LTYPE(LIB:LIB).EQ.'B') THEN
+        WRITE(KBOUT,*)'Library is in CODATA format with indexes'
+        CALL OCDLBS(LIBNAM,MAXNAM,ITEM,FTYPE,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,BRIEFF,KWRDTF,KWRDHF,
+     +  AUTHHF,AUTHTF,IOK)
+      ELSE IF(LTYPE(LIB:LIB).EQ.'C') THEN
+        WRITE(KBOUT,*)'Library is in GenBank format with indexes'
+        CALL OCDLBS(LIBNAM,MAXNAM,ITEM,FTYPE,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,BRIEFF,KWRDTF,KWRDHF,
+     +  AUTHHF,AUTHTF,IOK)
+      ELSE
+        WRITE(KBOUT,*)'Unknown library type'
+        RETURN
+      END IF
+2     CONTINUE
+      PROMPT(1) = 'Sequence only in FASTA format'
+      PROMPT(2) = 'Sequence only in STADEN format'
+      PROMPT(3) = 'Annotation only'
+      PROMPT(4) = 'Complete entries'
+      JOPT = 1
+      CALL RADION('Select a format',PROMPT,4,JOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(JOPT.LT.1) RETURN
+C      IOPT = 2
+        FILNAM = ' '
+        CALL OPENF1(IDEVLL,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of entry names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        IF (JOPT.EQ.1) THEN
+          EXT = '.seq'
+          IOPT = 1
+        ELSE IF (JOPT.EQ.2) THEN
+          EXT = '.SEQ'
+          IOPT = 1
+        ELSE IF (JOPT.EQ.3) THEN
+          EXT = '.txt'
+          IOPT = 2
+        ELSE IF (JOPT.EQ.4) THEN
+          EXT = '.all'
+          IOPT = 6
+        END IF
+3     CONTINUE
+        IOK = GNFFOF(IDEVLL,NAMIN)
+        IF (IOK.EQ.1) GO TO 1
+        IF (IOK.NE.0) GO TO 3
+        NEWNAM = NAMIN
+        K = INDEX(NEWNAM,' ')
+        NEWNAM(K:K+3) = EXT
+        CALL OPENRS(IDEVOT,NEWNAM,IOK,LRECL,1)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Unable to open file')
+          CALL ERROM(KBOUT,NEWNAM)
+        ELSE
+        CALL CCASE(NAMIN,1)
+        IDIMIN = IDIM
+        CALL CDROMS(IOPT,NAMIN,ACNUM,
+     +  ENAMEF,DIVLUF,ANUMTF,ANUMHF,DATAF,BRIEFF,KWRDTF,KWRDHF,
+     +  IDEVEN,IDEVAN,IDEVDL,IDEVLF,KBIN,AUTHHF,AUTHTF,
+     +  KBOUT,DIVCOD,SEQOFF,ANNOFF,SEQ,IDIMIN,IDEVOT,
+     +  KEYWDS,KWS,KWE,MAXKEY,LTYPE(LIB:LIB),WORKI,MAXWOR,TITLE,IOK)
+        CLOSE(UNIT=IDEVEN)
+        CLOSE(UNIT=IDEVAN)
+        CLOSE(UNIT=IDEVLF)
+        CLOSE(UNIT=IDEVDL)
+        IF (JOPT.EQ.1) THEN
+          CALL WRITFF(IDEVOT,SEQ,IDIMIN,NAMIN,TITLE)
+        ELSE IF (JOPT.EQ.2) THEN
+          WRITE(IDEVOT,1001)NAMIN
+          WRITE(IDEVOT,1001)TITLE
+ 1001     FORMAT(';',A)
+          CALL FMTDKN(IDEVOT,SEQ,IDIMIN)
+        END IF
+        CLOSE(UNIT=IDEVOT)
+      END IF
+      GO TO 3
+      END
+      SUBROUTINE OCDLBS(LIBNAM,NAMLEN,ITEMS,FTYPE,
+     +ENAMEF,DIVLUF,ANUMTF,ANUMHF,BRIEFF,KWRDTF,KWRDHF,
+     +AUTHHF,AUTHTF,IOK)
+C Assigns libnam names to file name strings
+      CHARACTER LIBNAM(ITEMS)*(*),AUTHHF*(*),AUTHTF*(*)
+      CHARACTER ENAMEF*(*),DIVLUF*(*),BRIEFF*(*),KWRDTF*(*)
+      CHARACTER ANUMTF*(*),ANUMHF*(*),FTYPE*(*),KWRDHF*(*)
+      ENAMEF = ' '
+      DIVLUF = ' '
+      ANUMTF = ' '
+      ANUMHF = ' '
+      BRIEFF = ' '
+      KWRDTF = ' '
+      KWRDHF = ' '
+      AUTHTF = ' '
+      AUTHHF = ' '
+      I = INDEX(FTYPE,'A')
+      IF(I.NE.0) DIVLUF = LIBNAM(I)
+      I = INDEX(FTYPE,'B')
+      IF(I.NE.0) ENAMEF = LIBNAM(I)
+      I = INDEX(FTYPE,'C')
+      IF(I.NE.0) ANUMTF = LIBNAM(I)
+      I = INDEX(FTYPE,'D')
+      IF(I.NE.0) ANUMHF = LIBNAM(I)
+      I = INDEX(FTYPE,'E')
+      IF(I.NE.0) BRIEFF = LIBNAM(I)
+      I = INDEX(FTYPE,'F')
+      IF(I.NE.0) KWRDTF = LIBNAM(I)
+      I = INDEX(FTYPE,'G')
+      IF(I.NE.0) KWRDHF = LIBNAM(I)
+      I = INDEX(FTYPE,'H')
+      IF(I.NE.0) AUTHTF = LIBNAM(I)
+      I = INDEX(FTYPE,'I')
+      IF(I.NE.0) AUTHHF = LIBNAM(I)
+      END
+      SUBROUTINE CDROMS(JOB,ENAME,ACNUM,
+     +ENAMEF,DIVLUF,ANUMTF,ANUMHF,LIBF,BRIEFF,KWRDTF,KWRDHF,
+     +IDEVEN,IDEVAN,IDEVDL,IDEVLF,KBIN,AUTHHF,AUTHTF,
+     +KBOUT,DIVCOD,SEQOFF,ANNOFF,SEQ,IDSEQ,IDEVOT,
+     +KEYS,SS,SE,NKEYS,LTYPE,WORKI,MAXWOR,TITLE,IOK)
+      CHARACTER ENAMEF*(*),DIVLUF*(*),LIBF*(*),KWRDTF*(*)
+      CHARACTER ANUMTF*(*),ANUMHF*(*),BRIEFF*(*),KWRDHF*(*)
+      CHARACTER ENAME*(*),ACNUM*(*),BARRAY*80,TERMA*5,TIT*5
+      CHARACTER SEQ(IDSEQ)
+      INTEGER DIVCOD,ANNOFF,SEQOFF,ENTFN,RDANUM,RSIZEN,OPENFU
+      INTEGER CDSEQ,CDANN,CDANNT,OPENFF,SEEKRS,RDKWRD,CODSEQ
+      CHARACTER KEYS*(*),LTYPE*1,AUTHHF*(*),AUTHTF*(*),TITLE*(*)
+      INTEGER SS(NKEYS),SE(NKEYS),GENSEQ,WORKI(MAXWOR)
+      EXTERNAL OPENFF,SEEKRS,IENAME,RDKWRD,CODSEQ,GENSEQ
+      EXTERNAL ENTFN,RDANUM,IHEAD,OPENFU,CDSEQ,CDANN,CDANNT
+C
+C read lib data from cdrom format
+C all files are opened and used as required
+C job = 1 get a sequence ENAME and display its title
+C job = 2 get annotation ENAME and display it
+C job = 3 find entry names from accession number ACNUM
+C job = 4 search author index for keywords
+C job = 5 search keyword index for keywords
+C job = 6 get complete entry ENAME
+C
+C
+C  LTYPE = A means embl, B means codata, C means GenBank
+C
+      IF(LTYPE.EQ.'A') THEN
+        TERMA = 'SQ   '
+        TIT = 'DE   '
+      ELSE IF(LTYPE.EQ.'B') THEN
+        TERMA = 'SEQUE'
+        TIT = 'TITLE'
+      ELSE IF(LTYPE.EQ.'C') THEN
+        TERMA = 'ORIGI'
+        TIT = 'DEFIN'
+      END IF
+C
+C open entryname file
+C
+      IOK = OPENFU(IDEVEN,ENAMEF)
+      IF(IOK.NE.0) THEN
+        CALL ERROM(KBOUT,'Error opening entry names index')
+        CALL ERROM(KBOUT,ENAMEF)
+        RETURN
+      END IF
+      IOK = IHEAD(IDEVEN,BARRAY,NRECEN,RSIZEN)
+      IF(IOK.NE.0) THEN
+        CALL ERROM(KBOUT,'Error reading entry name header')
+        CALL ERROM(KBOUT,ENAMEF)
+        RETURN
+      END IF
+      IF(JOB.EQ.1) THEN
+C
+C get a sequence
+C
+C
+C open division lookup file
+C
+        IOK = OPENFF(IDEVDL,DIVLUF)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening division lookup file')
+          CALL ERROM(KBOUT,DIVLUF)
+          RETURN
+        END IF
+C
+C get offsets and divcode
+C
+        IOK = IENAME(IDEVEN,NRECEN,RSIZEN,ENAME,ANNOFF,SEQOFF,
+     +  DIVCOD,BARRAY)
+        IF(IOK.NE.0) THEN
+          WRITE(KBOUT,*)ENAME,' not found'
+          RETURN
+        END IF
+C
+C get division file name
+C
+        IOK = ENTFN(IDEVDL,DIVCOD,LIBF,BARRAY)
+        CLOSE(UNIT=IDEVDL)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading division lookup file')
+          CALL ERROM(KBOUT,DIVLUF)
+          RETURN
+        END IF
+C
+C open division
+C
+        IOK = OPENFF(IDEVLF,LIBF)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening division file')
+          CALL ERROM(KBOUT,LIBF)
+          RETURN
+        END IF
+C
+C read seq, preceded by title
+C
+        IOK = CDANNT(IDEVLF,ANNOFF,BARRAY,TIT)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error locating title')
+          RETURN
+        END IF
+        WRITE(KBOUT,*)BARRAY
+        TITLE = BARRAY
+        IOK = 99
+        IF(LTYPE.EQ.'A') THEN
+          IOK = CDSEQ(IDEVLF,SEQOFF,SEQ,IDSEQ,LTYPE)
+        ELSE IF(LTYPE.EQ.'B') THEN
+          IOK = CODSEQ(IDEVLF,SEQOFF,SEQ,IDSEQ,LTYPE)
+        ELSE IF(LTYPE.EQ.'C') THEN
+          IOK = GENSEQ(IDEVLF,SEQOFF,SEQ,IDSEQ,LTYPE)
+        END IF
+        IF(IOK.EQ.0) RETURN
+        IF(IOK.EQ.2) THEN
+          WRITE(KBOUT,*)
+     +    'Only first ',IDSEQ,' characters of sequence read'
+          IOK = 0
+        END IF
+        RETURN
+      ELSE IF(JOB.EQ.2) THEN
+C
+C open division lookup file
+C
+        IOK = OPENFF(IDEVDL,DIVLUF)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening division lookup file')
+          CALL ERROM(KBOUT,DIVLUF)
+          RETURN
+        END IF
+C
+C get offsets and divcode
+C
+        IOK = IENAME(IDEVEN,NRECEN,RSIZEN,ENAME,ANNOFF,SEQOFF,
+     +  DIVCOD,BARRAY)
+        IF(IOK.NE.0) THEN
+          WRITE(KBOUT,*)ENAME,' not found'
+          RETURN
+        END IF
+C
+C get division file name
+C
+        IOK = ENTFN(IDEVDL,DIVCOD,LIBF,BARRAY)
+        CLOSE(UNIT=IDEVDL)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading division lookup file')
+          CALL ERROM(KBOUT,DIVLUF)
+          RETURN
+        END IF
+C
+C open division
+C
+        IOK = OPENFF(IDEVLF,LIBF)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening division file')
+          CALL ERROM(KBOUT,LIBF)
+          RETURN
+        END IF
+C
+C read ann
+C
+        IOK = CDANN(IDEVLF,ANNOFF,BARRAY,IDEVOT,KBIN,KBOUT,TERMA)
+        IF(IOK.EQ.2) IOK = 0
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading annotation file')
+        END IF
+        RETURN
+C
+C deal with accession number start point
+C
+      ELSE IF(JOB.EQ.3) THEN
+C
+C read accession number
+C
+        IOK = RDANUM(IDEVAN,ANUMTF,ANUMHF,IDEVEN,NRECEN,RSIZEN,
+     +  ACNUM,BARRAY,ENAME,IDEVOT,KBOUT)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading accession number files')
+          RETURN
+        END IF
+C
+C Do author search
+C
+      ELSE IF(JOB.EQ.4) THEN
+C
+C 
+C
+        MAXW2 = 1+(NRECEN-1)/32
+        IF(MAXWOR/2.LT.MAXW2) THEN
+          CALL ERROM(KBOUT,'Too many entries for bit files')
+          RETURN
+        END IF
+        IOK = RDKWRD(IDEVAN,AUTHTF,AUTHHF,IDEVEN,NRECEN,RSIZEN,
+     +  KEYS,SS,SE,NKEYS,BARRAY,ENAME,IDEVOT,KBIN,KBOUT,BRIEFF,
+     +  WORKI,WORKI(MAXW2+1),MAXW2)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error searching author index for keywords')
+          RETURN
+        END IF
+C
+C Do text search
+C
+      ELSE IF(JOB.EQ.5) THEN
+C
+C 
+C
+        MAXW2 = 1+(NRECEN-1)/32
+        IF(MAXWOR/2.LT.MAXW2) THEN
+          CALL ERROM(KBOUT,'Too many entries for bit files')
+          RETURN
+        END IF
+        IOK = RDKWRD(IDEVAN,KWRDTF,KWRDHF,IDEVEN,NRECEN,RSIZEN,
+     +  KEYS,SS,SE,NKEYS,BARRAY,ENAME,IDEVOT,KBIN,KBOUT,BRIEFF,
+     +  WORKI,WORKI(MAXW2+1),MAXW2)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error searching text index for keywords')
+          RETURN
+        END IF
+      ELSE IF(JOB.EQ.6) THEN
+C
+C open division lookup file
+C
+        IOK = OPENFF(IDEVDL,DIVLUF)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening division lookup file')
+          CALL ERROM(KBOUT,DIVLUF)
+          RETURN
+        END IF
+C
+C get offsets and divcode
+C
+        IOK = IENAME(IDEVEN,NRECEN,RSIZEN,ENAME,ANNOFF,SEQOFF,
+     +  DIVCOD,BARRAY)
+        IF(IOK.NE.0) THEN
+          WRITE(KBOUT,*)ENAME,' not found'
+          RETURN
+        END IF
+C
+C get division file name
+C
+        IOK = ENTFN(IDEVDL,DIVCOD,LIBF,BARRAY)
+        CLOSE(UNIT=IDEVDL)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading division lookup file')
+          CALL ERROM(KBOUT,DIVLUF)
+          RETURN
+        END IF
+C
+C open division
+C
+        IOK = OPENFF(IDEVLF,LIBF)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening division file')
+          CALL ERROM(KBOUT,LIBF)
+          RETURN
+        END IF
+C
+C read ann
+C
+        IOK = CDANN(IDEVLF,ANNOFF,BARRAY,IDEVOT,KBIN,KBOUT,'//')
+        IF(IOK.EQ.2) IOK = 0
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading annotation file')
+        END IF
+        RETURN
+      END IF
+      END
+      SUBROUTINE SRCKEY(IDEVIN,IDEVOT,KBIN,KBOUT,IOK)
+      CHARACTER STRING*80,NEW*80
+      INTEGER J1(5),J2(5)
+      INTEGER NOTLR
+      EXTERNAL NOTLR
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Search for keywords')
+10    CONTINUE
+      LENGTH = 0
+      STRING = ' '
+      NEW = ' '
+      CALL GTSTR('Keywords',STRING,NEW,LENGTH,KBOUT,KBIN,INFLAG)
+      LENGTH = 80
+      IF(INFLAG.NE.0) RETURN
+      STRING = NEW
+C FIND SEPARATE WORDS
+      I = 0
+      I2 = 0
+20    CONTINUE
+        I = I + 1
+        IF(I.LT.6)THEN
+          I1 = I2 + 1
+          LEFT = LENGTH - I1 + 1
+          IF(LEFT.GT.0)THEN
+            IT = NOTLR(STRING(I1:LENGTH),LEFT,' ')
+            IF(IT.NE.0)THEN
+              I1 = I1 + IT - 1
+              IT = INDEX(STRING(I1:LENGTH),' ')
+              I2 = I1 + IT - 2
+              J1(I) = I1
+              J2(I) = I2
+              I2 = J2(I)
+              IF(J2(I).LT.LENGTH) GO TO 20
+            END IF
+            GO TO 15
+          END IF
+          GO TO 15
+        END IF
+15    CONTINUE
+      I = I - 1
+      IF(I.GT.0) THEN
+        CALL SRCTTL(IDEVIN,IDEVOT,STRING,J1,J2,I,KBIN,KBOUT,IOK)
+      END IF
+      END
+      SUBROUTINE SRCKEZ(KBIN,KBOUT,STRING,J1,J2,I,PROMPT,IOK)
+      CHARACTER STRING*80,NEW*80,UNDER,SPACE,PROMPT*(*)
+      INTEGER J1(5),J2(5)
+      INTEGER NOTLR
+      EXTERNAL NOTLR
+      SAVE UNDER,SPACE
+      DATA UNDER/'_'/,SPACE/' '/
+      WRITE(KBOUT,1000)PROMPT
+1000  FORMAT(' Search for ',A)
+10    CONTINUE
+      LENGTH = 0
+      STRING = ' '
+      NEW = ' '
+      CALL GTSTR(PROMPT,STRING,NEW,LENGTH,KBOUT,KBIN,INFLAG)
+      LENGTH = 80
+      IF(INFLAG.NE.0) RETURN
+      STRING = NEW
+C FIND SEPARATE WORDS
+      I = 0
+      I2 = 0
+20    CONTINUE
+        I = I + 1
+        IF(I.LT.6)THEN
+          I1 = I2 + 1
+          LEFT = LENGTH - I1 + 1
+          IF(LEFT.GT.0)THEN
+            IT = NOTLR(STRING(I1:LENGTH),LEFT,' ')
+            IF(IT.NE.0)THEN
+              I1 = I1 + IT - 1
+              IT = INDEX(STRING(I1:LENGTH),' ')
+              I2 = I1 + IT - 2
+              J1(I) = I1
+              J2(I) = I2
+              I2 = J2(I)
+              IF(J2(I).LT.LENGTH) GO TO 20
+            END IF
+            GO TO 15
+          END IF
+          GO TO 15
+        END IF
+15    CONTINUE
+C
+C  keywords are in upper case
+C
+      CALL CCASE(STRING,1)
+C
+C  change uinderscore to space
+C
+      CALL EXCHNG(STRING,UNDER,SPACE)
+      I = I - 1
+C      IF(I.GT.0) I = 1
+      IOK = 0
+      END
+      SUBROUTINE EXCHNG(STRING,FROM,TO)
+      CHARACTER STRING*(*),FROM,TO
+      DO 10 I=1,LEN(STRING)
+        IF (STRING(I:I).EQ.FROM) STRING(I:I) = TO
+ 10     CONTINUE
+      END
+      SUBROUTINE SRCTTL(IDEVIN,IDEVOT,STRING,J1,J2,NSTRNG,
+     +KBIN,KBOUT,IOK)
+      CHARACTER LINE*80, STRING*(*),NAME*10,SLEN*4,LLINE*104
+      INTEGER J1(NSTRNG),J2(NSTRNG),GETRS
+      EXTERNAL GETRS
+      EQUIVALENCE (LLINE(1:1),NAME),(LLINE(25:104),LINE)
+      EQUIVALENCE(LLINE(21:24),SLEN)
+      IOK = 0
+      JPAGE = 0
+      IFOUND = 0
+      WRITE(KBOUT,1004)STRING(1:J2(NSTRNG))
+1004  FORMAT(' Searching for ',A)
+      CALL CCASE(STRING,2)
+1001  FORMAT(A,' ',I7,' ',A)
+ 1002 FORMAT(' ',A,' ',I7,' ',A)
+      IBYTE = 301
+10    CONTINUE
+      IOK = GETRS(IDEVIN,LLINE,104,IBYTE)
+      IF(IOK.NE.0) GO TO 30
+      IBYTE = IBYTE + 104
+        CALL CCASE(LINE,2)
+        DO 5 I = 1,NSTRNG
+          IF(INDEX(LINE,STRING(J1(I):J2(I))).EQ.0)GO TO 10
+5       CONTINUE
+        IFOUND = IFOUND + 1
+        CALL BSW4(SLEN,LENS)
+        IF(IDEVOT.EQ.KBOUT)THEN
+          CALL PAGER(KBIN,KBOUT,JPAGE,IOK)
+          IF (IOK.NE.0) RETURN
+        END IF
+        IF(IDEVOT.EQ.KBOUT) THEN
+          WRITE(IDEVOT,1002,ERR=20)NAME,LENS,LINE
+        ELSE
+          WRITE(IDEVOT,1001,ERR=20)NAME,LENS,LINE
+        END IF
+        GO TO 10
+20    CONTINUE
+      IOK = 1
+      RETURN
+30    CONTINUE
+      WRITE(KBOUT,1003)IFOUND
+1003  FORMAT(' ',I7,' entries found')
+      CALL BPAUSE(KBIN,KBOUT,IEX)
+      END
+      SUBROUTINE PAGER(KBIN,KBOUT,JPAGE,IOK)
+      PARAMETER (IPAGE = 22)
+      JPAGE = JPAGE + 1
+      IF(JPAGE.EQ.IPAGE)THEN
+        CALL BPAUSE(KBIN,KBOUT,IOK)
+        IF(IOK.NE.0)RETURN
+        JPAGE = 0
+      END IF
+      IOK = 0
+      END
+      SUBROUTINE RDLIBL(FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEVLL,IDEVEN,IDEVNL,
+     +LIBLF,LIBIN,DIVDEV,MAXDIV,IDEVD,
+     +LIST,ENAMEL,LIBTYP,LTYPEP,NDIV,RSIZEN,NRECEN,IOK)
+C   AUTHOR RODGER STADEN
+C IDEVLL = unit for library list LIBLF
+C IDEVEN =          entry name file ENAMEF
+C IDEVD    unit for division lookup file, and start for divdev
+      CHARACTER FILNAM*(*),LIBLF*(*)
+      CHARACTER HELPF*(*),LTYPEP
+      PARAMETER (MAXNAM = 80,LENNAM = 10)
+      PARAMETER (MAXPRM = 50,MAXMEN=20,MAXLIN=80)
+      CHARACTER PROMPT(MAXMEN)*(MAXPRM),LOGNAM(MAXMEN)*(MAXNAM)
+      CHARACTER FTYPE*(MAXMEN),LINE*(MAXLIN)
+      CHARACTER LTYPE*(MAXMEN),ENAMEL*(LENNAM)
+      INTEGER RSIZEN
+      INTEGER DIVDEV(MAXDIV)
+      IDIMIN = IDIM
+      IDIM = 0
+      LIB = LIBIN
+1     CONTINUE
+      CALL RDLB0(LTYPE,LOGNAM,PROMPT,MAXMEN,MAXNAM,MAXPRM,
+     +IDEVLL,LIBLF,LINE,MAXLIN,KBOUT,ITEM,IOK)
+      IF(IOK.NE.0) RETURN
+      IOK = 1
+      IF(ITEM.LT.MAXMEN) THEN
+        ITEM = ITEM + 1
+        PROMPT(ITEM) = 'Personal file in PIR format'
+      END IF
+      IF(ITEM.LT.MAXMEN) THEN
+        ITEM = ITEM + 1
+        PROMPT(ITEM) = 'Personal file in FASTA format'
+      END IF
+      CALL RADION('Select a library',PROMPT,ITEM,LIB,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(LIB.LT.1) RETURN
+      IF(LIB.EQ.ITEM-1) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEVEN,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Personal library in PIR format',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        LIBTYP = 2
+      ELSE IF(LIB.EQ.ITEM) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEVEN,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'Personal library in FASTA format',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        LIBTYP = 3
+      ELSE
+        FILNAM = LOGNAM(LIB)
+        CALL RDLB1(LOGNAM,MAXMEN,MAXNAM,IDEVLL,FILNAM,
+     +  LINE,MAXLIN,FTYPE,LINENO,KBOUT,ITEM,IOK)
+        IF(IOK.NE.0) RETURN
+        LTYPEP = LTYPE(LIB:LIB)
+        IF(LTYPEP.EQ.'A') THEN
+          LIBTYP = 1
+          WRITE(KBOUT,*)'Library is in EMBL format with indexes'
+          CALL OCDLBL(LOGNAM,MAXNAM,FTYPE,FILNAM,DIVDEV,
+     +    MAXDIV,NDIV,IDEVEN,RSIZEN,NRECEN,IDEVD,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+        ELSE IF(LTYPEP.EQ.'B') THEN
+          LIBTYP = 1
+          WRITE(KBOUT,*)'Library is in CODATA format with indexes'
+          CALL OCDLBL(LOGNAM,MAXNAM,FTYPE,FILNAM,DIVDEV,
+     +    MAXDIV,NDIV,IDEVEN,RSIZEN,NRECEN,IDEVD,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+        ELSE IF(LTYPE(LIB:LIB).EQ.'C') THEN
+          LIBTYP = 1
+          WRITE(KBOUT,*)'Library is in GenBank format with indexes'
+          CALL OCDLBL(LOGNAM,MAXNAM,FTYPE,FILNAM,DIVDEV,
+     +    MAXDIV,NDIV,IDEVEN,RSIZEN,NRECEN,IDEVD,KBOUT,IOK)
+          IF(IOK.NE.0) RETURN
+        ELSE
+          WRITE(KBOUT,*)'Unknown library type'
+          IOK = 1
+        RETURN
+        END IF
+      END IF
+2     CONTINUE
+      IOPT = 1
+      PROMPT(1) = 'Search whole library'
+      PROMPT(2) = 'Search only a list of entries'
+      PROMPT(3) = 'Search all but a list of entries'
+      CALL RADION('Select a task',PROMPT,3,IOPT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IOPT.LT.1) GO TO 1
+      IF((IOPT.EQ.2).OR.(IOPT.EQ.3)) THEN
+        FILNAM = ' '
+        CALL OPENF1(IDEVNL,FILNAM,0,IOK,KBIN,KBOUT,
+     +  'File of entry names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0)RETURN
+        IF(IOPT.EQ.3) THEN
+C need to read in the first name for exclusion
+C the rest handled by cdroml
+          READ(IDEVNL,1000,ERR=100,END=200)ENAMEL
+ 1000     FORMAT(A)
+        END IF
+      END IF
+      IFINEX = 0
+      LIST = 0
+      IF(IOPT.EQ.2) LIST = 1
+      IF(IOPT.EQ.3) LIST = -1
+      IOK = 0
+      RETURN
+ 100  CONTINUE
+      IOK = 1
+      CALL ERROM(KBOUT,'Error in names file')
+      RETURN
+ 200  CONTINUE
+      IOK = 2
+      CALL ERROM(KBOUT,'Empty names file')
+      END
+      SUBROUTINE OCDLBL(LIBNAM,MAXLBF,FTYPE,TEMPF,DIVDEV,
+     +MAXDIV,NDIV,IDEVEN,RSIZEN,NRECEN,IDEVD,KBOUT,IOK)
+C Assigns libnam names to file name strings, opens division lookup file,
+C reads names of each division and opens the files
+C seeks to first record in entry names file
+      CHARACTER LIBNAM(MAXLBF)*(*)
+      CHARACTER TEMPF*(*),BARRAY*10
+      CHARACTER FTYPE*(*)
+      INTEGER DIVDEV(MAXDIV),RSIZEN
+      INTEGER OPENFU,IHEAD,OPENFF
+      EXTERNAL OPENFU,IHEAD,OPENFF
+      TEMPF = ' '
+      TEMPF = LIBNAM(INDEX(FTYPE,'A'))
+      IOK = OPENFF(IDEVD,TEMPF)
+      IF(IOK.NE.0) THEN
+        CALL ERROM(KBOUT,'Error opening division lookup file')
+        CALL ERROM(KBOUT,TEMPF)
+        RETURN
+      END IF
+      IOK = 2
+      NDIV = 0
+C use division as additions to idevd
+ 10   READ(IDEVD,1000,ERR=200,END=100)IDIV,TEMPF
+ 1000 FORMAT(I6,1X,A)
+      IF((IDIV.GT.0).AND.(NDIV.LT.MAXDIV)) THEN
+        NDIV = NDIV + 1
+        DIVDEV(NDIV) = IDEVD + IDIV
+        IF(OPENFF(DIVDEV(NDIV),TEMPF).NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening division file')
+          CALL ERROM(KBOUT,TEMPF)
+          RETURN
+        END IF
+        GO TO 10
+      END IF
+      IOK = 5
+      RETURN
+ 100  CONTINUE
+      CLOSE(UNIT=IDEVD)
+      IOK = 3
+      TEMPF = LIBNAM(INDEX(FTYPE,'B'))
+      IF(OPENFU(IDEVEN,TEMPF).NE.0) THEN
+        CALL ERROM(KBOUT,'Error opening entryname index')
+        CALL ERROM(KBOUT,TEMPF)
+        RETURN
+      END IF
+      IOK = 5
+      IF(IHEAD(IDEVEN,BARRAY,NRECEN,RSIZEN).NE.0) THEN
+        CALL ERROM(KBOUT,'Error reading entryname index header')
+        CALL ERROM(KBOUT,TEMPF)
+        RETURN
+      END IF
+      IOK = 0
+      RETURN
+ 200  CONTINUE
+      CALL ERROM(KBOUT,'Error reading division file')
+      IOK = 4
+      END
+      SUBROUTINE CDROML(JOB,ENAME,ENAMEL,
+     +IDEVEN,RSIZEN,NRECEN,IDEVNL,SEQ,IDSEQ,
+     +DIVDEV,NDIV,ICREC,IFIN,TITLE,KBOUT,LTYPE,IOK)
+      CHARACTER ENAME*(*),ENAMEL*(*),BARRAY*80,TITLE*(*)
+      CHARACTER SEQ(IDSEQ),LTYPE,TIT*5
+      INTEGER DIVCOD,ANNOFF,SEQOFF,RSIZEN,DIVDEV(NDIV)
+      INTEGER CDSEQ,CDANNT,GNEXTN,CODSEQ,GENSEQ
+      EXTERNAL IENAME
+      EXTERNAL CDSEQ,CDANNT,GNEXTN,CODSEQ,GENSEQ
+C
+C read lib data from cdrom format
+C
+C all files are already open
+C the entry names files is poised on the first entry, when we come in
+C first time. Records are numbered 0 to nrecen.
+C
+C job = 0 process whole library, entry name by entryname
+C       1 process a list of entry names
+C       -1 process all but a list of entry names
+C       2 get a single entry name
+C
+C error returns
+C iok = 0 ok
+C       -1 not found
+C       -2 error
+C       -3 end
+C
+C  LTYPE = A means embl, B means codata, C means GenBank
+C
+      IF (ICREC.EQ.NRECEN) THEN
+        IOK = -3
+        CALL ERROM(KBOUT,'End of library reached')
+        RETURN
+      END IF
+      IF(LTYPE.EQ.'A') THEN
+        TIT = 'DE   '
+      ELSE IF(LTYPE.EQ.'B') THEN
+        TIT = 'TITLE'
+      ELSE IF(LTYPE.EQ.'C') THEN
+        TIT = 'DEFIN'
+      END IF
+      IF(JOB.EQ.0) THEN
+C
+C get next sequence in entry name index
+C
+C get offsets and divcode (error as for getrs)
+C
+        IOK = GNEXTN(IDEVEN,RSIZEN,ENAME,ANNOFF,SEQOFF,DIVCOD,
+     +  BARRAY,ICREC)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading entryname index')
+          RETURN
+        END IF
+C
+C read seq, preceded by title
+C
+        IOK = CDANNT(DIVDEV(DIVCOD),ANNOFF,BARRAY,TIT)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading title')
+          RETURN
+        END IF
+        TITLE = BARRAY
+        IOK = 99
+        IF(LTYPE.EQ.'A') THEN
+          IOK = CDSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'B') THEN
+          IOK = CODSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'C') THEN
+          IOK = GENSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        END IF
+        IF(IOK.EQ.0) RETURN
+        IF(IOK.EQ.2) THEN
+          WRITE(KBOUT,*)
+     +    'Only first ',IDSEQ,' characters of sequence read'
+          IOK = 0
+          RETURN
+        END IF
+        CALL ERROM(KBOUT,'Error reading sequence')
+        RETURN
+      ELSE IF(JOB.EQ.2) THEN
+C
+C get entry name
+C
+C
+C get offsets and divcode
+C
+        IOK = IENAME(IDEVEN,NRECEN,RSIZEN,ENAME,ANNOFF,SEQOFF,
+     +  DIVCOD,BARRAY)
+        IF(IOK.NE.0) THEN
+          WRITE(KBOUT,*)ENAME,' not found'
+          RETURN
+        END IF
+C
+C read seq, preceded by title
+C
+        IOK = CDANNT(DIVDEV(DIVCOD),ANNOFF,BARRAY,TIT)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading title')
+          RETURN
+        END IF
+        TITLE = BARRAY
+        IOK = 99
+        IF(LTYPE.EQ.'A') THEN
+          IOK = CDSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'B') THEN
+          IOK = CODSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'C') THEN
+          IOK = GENSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        END IF
+        IF(IOK.EQ.0) RETURN
+        IF(IOK.EQ.2) THEN
+          WRITE(KBOUT,*)
+     +    'Only first ',IDSEQ,' characters of sequence read'
+          IOK = 0
+          RETURN
+        END IF
+        CALL ERROM(KBOUT,'Error reading sequence')
+        RETURN
+      ELSE IF(JOB.EQ.1) THEN
+C
+C get next sequence in entry name list
+C
+ 90      CONTINUE
+         READ(IDEVNL,1000,ERR=100,END=200)ENAME
+C
+C get offsets and divcode
+C
+        IOK = IENAME(IDEVEN,NRECEN,RSIZEN,ENAME,ANNOFF,SEQOFF,
+     +  DIVCOD,BARRAY)
+        IF(IOK.NE.0) THEN
+          WRITE(KBOUT,*)ENAME,' not found'
+          GO TO 90
+        END IF
+C
+C read seq, preceded by title
+C
+        IOK = CDANNT(DIVDEV(DIVCOD),ANNOFF,BARRAY,TIT)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading title')
+          RETURN
+        END IF
+        TITLE = BARRAY
+        IOK = 99
+        IF(LTYPE.EQ.'A') THEN
+          IOK = CDSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'B') THEN
+          IOK = CODSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'C') THEN
+          IOK = GENSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        END IF
+        IF(IOK.EQ.0) RETURN
+        IF(IOK.EQ.2) THEN
+          WRITE(KBOUT,*)
+     +    'Only first ',IDSEQ,' characters of sequence read'
+          IOK = 0
+          RETURN
+        END IF
+        CALL ERROM(KBOUT,'Error reading sequence')
+        RETURN
+ 100    CONTINUE
+        IOK = -3
+        CALL ERROM(KBOUT,'Error reading entrynames list')
+        RETURN
+ 200    CONTINUE
+        IOK = -1
+        RETURN
+      ELSE IF(JOB.EQ.-1) THEN
+C
+C get offsets and divcode
+C
+ 250    CONTINUE
+        IOK = GNEXTN(IDEVEN,RSIZEN,ENAME,ANNOFF,SEQOFF,DIVCOD,
+     +  BARRAY,ICREC)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading entryname index')
+          RETURN
+        END IF
+C
+C is it the same as the one on the list?
+C
+        IF(ENAMEL.EQ.ENAME) THEN
+          IF(IFIN.EQ.0) THEN
+            READ(IDEVNL,1000,ERR=300,END=400)ENAMEL
+ 1000       FORMAT(A)
+          END IF
+          GO TO 250
+        END IF
+C
+C read seq, preceded by title
+C
+        IOK = CDANNT(DIVDEV(DIVCOD),ANNOFF,BARRAY,TIT)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error reading title')
+          RETURN
+        END IF
+        TITLE = BARRAY
+        IOK = 99
+        IF(LTYPE.EQ.'A') THEN
+          IOK = CDSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'B') THEN
+          IOK = CODSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        ELSE IF(LTYPE.EQ.'C') THEN
+          IOK = GENSEQ(DIVDEV(DIVCOD),SEQOFF,SEQ,IDSEQ)
+        END IF
+        IF(IOK.EQ.0) RETURN
+        IF(IOK.EQ.2) THEN
+          WRITE(KBOUT,*)
+     +    'Only first ',IDSEQ,' characters of sequence read'
+          IOK = 0
+          RETURN
+        END IF
+        CALL ERROM(KBOUT,'Error reading sequence')
+        RETURN
+ 300    CONTINUE
+        IOK = -3
+        CALL ERROM(KBOUT,'Error reading entrynames list')
+        RETURN
+ 400    CONTINUE
+C
+C end of exclude list reached, so include the rest
+C
+        IFIN = 1
+        ENAMEL = ' '
+        GO TO 250
+      END IF
+      END
+      INTEGER FUNCTION IENAME(IDEVEN,NREC,RSIZE,
+     +ENAME,ANNOFF,SEQOFF,DIVCOD,BARRAY)
+      INTEGER ANNOFF,SEQOFF,DIVCOD,GETRS,NREC,RSIZE,SEEKRS
+      INTEGER UB,B1,BEND
+      INTEGER*2 STAR2
+      CHARACTER BARRAY*(*),ENAME*(*)
+      EXTERNAL GETRS,SEEKRS
+      B1 = 0
+      BEND = NREC
+      LB = B1
+      UB = BEND
+ 20   CONTINUE
+      IF(UB.LT.LB) THEN
+        IENAME = -1
+        RETURN
+      END IF
+      IREC = (LB+UB)/2
+      IBYTE = 301 + RSIZE*IREC
+      IENAME = GETRS(IDEVEN,BARRAY,20,IBYTE)
+      IF(IENAME.NE.0) RETURN
+      IF(ENAME.LT.BARRAY(1:10)) THEN
+        UB = IREC - 1
+      ELSE IF(ENAME.GT.BARRAY(1:10)) THEN
+        LB = IREC + 1
+      ELSE
+        CALL BSW4(BARRAY(11:),ANNOFF)
+        CALL BSW4(BARRAY(15:),SEQOFF)
+        CALL BSW2(BARRAY(19:),STAR2)
+        DIVCOD = STAR2
+        RETURN
+      END IF
+      GO TO 20
+      END
+      INTEGER FUNCTION CDSEQ(IDEV,SEQOFF,SEQ,IDSEQ)
+C seek to seq and return it
+      CHARACTER SEQ(IDSEQ),LT
+      CHARACTER CR
+      PARAMETER (CR=CHAR(13))
+      INTEGER SEQOFF
+      INTEGER SEEKRS
+      EXTERNAL SEEKRS
+      I = SEQOFF - 5
+      CDSEQ = SEEKRS(IDEV,I)
+      IF(CDSEQ.NE.0) RETURN
+      NSEQ = 0
+ 10   CONTINUE
+      LSEQ = MIN(NSEQ+60,IDSEQ)
+      READ(IDEV,1000,ERR=100,END=200)LT,(SEQ(K),K=NSEQ+1,LSEQ)
+ 1000 FORMAT(A,4X,6(10A1,1X))
+      IF(LT.EQ.'/') THEN
+        CDSEQ = 0
+C
+C seq must have ended on previous line
+C so look for carriage return
+C
+        DO 20 I=NSEQ-59,NSEQ
+          IF(SEQ(I).EQ.CR) THEN
+            J = I-1
+            GO TO 21
+          END IF
+ 20     CONTINUE
+        J = NSEQ
+ 21     CONTINUE
+C
+C now look for a space
+C
+        DO 30 I=NSEQ-59,J
+          IF(SEQ(I).EQ.' ') THEN
+            IDSEQ = I-1
+            RETURN
+          END IF
+ 30     CONTINUE
+        IDSEQ = J
+        RETURN
+      END IF
+      NSEQ = LSEQ
+      IF(NSEQ.LT.IDSEQ) GO TO 10
+      CDSEQ = 2
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'ERROR IN CDSEQ'
+      CDSEQ = -2
+      RETURN
+ 200  CONTINUE
+      WRITE(*,*)'END IN CDSEQ'
+      CDSEQ = -1
+      END
+      INTEGER FUNCTION GENSEQ(IDEV,SEQOFF,SEQ,IDSEQ)
+C seek to seq and return it
+      CHARACTER SEQ(IDSEQ),LT
+      CHARACTER CR
+      PARAMETER (CR=CHAR(13))
+      INTEGER SEQOFF
+      INTEGER SEEKRS
+      EXTERNAL SEEKRS
+C
+C seek to beginning of line containing first seq data
+C
+      I = SEQOFF - 10
+      GENSEQ = SEEKRS(IDEV,I)
+      IF(GENSEQ.NE.0) RETURN
+      NSEQ = 0
+ 10   CONTINUE
+      LSEQ = MIN(NSEQ+60,IDSEQ)
+      READ(IDEV,1000,ERR=100,END=200)LT,(SEQ(K),K=NSEQ+1,LSEQ)
+ 1000 FORMAT(A,9X,6(10A1,1X))
+      IF(LT.EQ.'/') THEN
+        GENSEQ = 0
+C
+C seq must have ended on previous line
+C so look for carriage return
+C
+        DO 20 I=NSEQ-59,NSEQ
+          IF(SEQ(I).EQ.CR) THEN
+            J = I-1
+            GO TO 21
+          END IF
+ 20     CONTINUE
+        J = NSEQ
+ 21     CONTINUE
+C
+C now look for a space
+C
+        DO 30 I=NSEQ-59,J
+          IF(SEQ(I).EQ.' ') THEN
+            IDSEQ = I-1
+            RETURN
+          END IF
+ 30     CONTINUE
+        IDSEQ = J
+        RETURN
+      END IF
+      NSEQ = LSEQ
+      IF(NSEQ.LT.IDSEQ) GO TO 10
+      GENSEQ = 2
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'ERROR IN GENSEQ'
+      GENSEQ = -2
+      RETURN
+ 200  CONTINUE
+      WRITE(*,*)'END IN GENSEQ'
+      GENSEQ = -1
+      END
+      INTEGER FUNCTION CODSEQ(IDEV,SEQOFF,SEQ,IDSEQ)
+C seek to seq and return it
+      CHARACTER SEQ(IDSEQ),LT
+      CHARACTER CR
+      PARAMETER (CR=CHAR(13))
+      INTEGER SEQOFF
+      INTEGER SEEKRS
+      EXTERNAL SEEKRS
+C
+C seek to beginning of line containing first seq data
+C
+      I = SEQOFF - 8
+      CODSEQ = SEEKRS(IDEV,I)
+      IF(CODSEQ.NE.0) RETURN
+      NSEQ = 0
+ 10   CONTINUE
+      LSEQ = MIN(NSEQ+30,IDSEQ)
+      READ(IDEV,1000,ERR=100,END=200)LT,(SEQ(K),K=NSEQ+1,LSEQ)
+ 1000 FORMAT(A,7X,30(A1,1X))
+      IF(LT.EQ.'/') THEN
+        CODSEQ = 0
+C
+C seq must have ended on previous line
+C so look for carriage return
+C
+        DO 20 I=NSEQ-29,NSEQ
+          IF(SEQ(I).EQ.CR) THEN
+            J = I-1
+            GO TO 21
+          END IF
+ 20     CONTINUE
+        J = NSEQ
+ 21     CONTINUE
+C
+C now look for a space
+C
+        DO 30 I=NSEQ-29,J
+          IF(SEQ(I).EQ.' ') THEN
+            IDSEQ = I-1
+            RETURN
+          END IF
+ 30     CONTINUE
+        IDSEQ = J
+        RETURN
+      END IF
+      NSEQ = LSEQ
+      IF(NSEQ.LT.IDSEQ) GO TO 10
+      CODSEQ = 2
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'ERROR IN CODSEQ'
+      CODSEQ = -2
+      RETURN
+ 200  CONTINUE
+      WRITE(*,*)'END IN CODSEQ'
+      CODSEQ = -1
+      END
+      INTEGER FUNCTION CDANN(IDEV,ANNOFF,BARRAY,IDEVOT,KBIN,KBOUT,TERM)
+      CHARACTER BARRAY*(*),TERM*(*)
+      INTEGER ANNOFF
+      INTEGER SEEKRS,GETRSL
+      EXTERNAL SEEKRS,GETRSL
+      CDANN = SEEKRS(IDEV,ANNOFF)
+      IF(CDANN.NE.0) RETURN
+      JPAGE = 0
+ 10   CONTINUE
+      IOK = GETRSL(IDEV,BARRAY,80)
+      IF(IOK.LT.1) THEN
+        CDANN = IOK
+        RETURN
+      END IF
+      IF(IDEVOT.EQ.KBOUT) THEN
+        CALL PAGER(KBIN,KBOUT,JPAGE,JOK)
+        IF (JOK.NE.0) THEN
+          CDANN = 2
+          RETURN
+        END IF
+      END IF
+      IF (IDEVOT.EQ.KBOUT) THEN
+        WRITE(IDEVOT,1000)BARRAY(1:IOK)
+      ELSE
+        WRITE(IDEVOT,1001)BARRAY(1:IOK)
+      END IF
+      IF(BARRAY(1:LEN(TERM)).NE.TERM) GO TO 10
+      CDANN = 0
+ 1000 FORMAT(' ',A)
+ 1001 FORMAT(A)
+      END
+      INTEGER FUNCTION CDANNT(IDEV,ANNOFF,BARRAY,TIT)
+      CHARACTER BARRAY*(*),TIT*(*)
+      INTEGER ANNOFF
+      INTEGER SEEKRS,GETRSL
+      EXTERNAL SEEKRS,GETRSL
+      CDANNT = SEEKRS(IDEV,ANNOFF)
+      IF(CDANNT.NE.0) RETURN
+ 10   CONTINUE
+      IOK = GETRSL(IDEV,BARRAY,80)
+      IF(IOK.LT.1) THEN
+        CDANNT = IOK
+        RETURN
+      END IF
+      IF(BARRAY(1:LEN(TIT)).NE.TIT) GO TO 10
+      CDANNT = 0
+      IF(IOK.LT.80) BARRAY(IOK+1:80) = ' '
+      END
+      INTEGER FUNCTION RDANUM(IDEVAN,ANUMTF,ANUMHF,IDEVEN,
+     +NRECEN,RSIZEN,ACNUM,BARRAY,ENAME,IDEVOT,KBOUT)
+      CHARACTER ANUMTF*(*),ANUMHF*(*),ACNUM*(*),BARRAY*(*)
+      CHARACTER ENAME*(*)
+      INTEGER OPENFU,ACNUMP,GETRS,ACNUMQ
+      INTEGER ENAMEP,RSIZAN,RSIZEN
+      EXTERNAL IHEAD,OPENFU,GETRS,IANUM
+C start with accession number
+C sent an accesion number acnum, return list of entry names
+C
+C open acnum.trg, 
+C read until acnum found, get accession number pointer ACNUMP, number of hits
+C NHITS
+C close acnum.trg
+C open acnum.hit, seek to ACNUMP
+C read a record, write to screen, for each of NHITS
+C close acnum.hit
+C
+C open accession number target file
+C
+C      WRITE(*,*)'OPENING'
+C      WRITE(*,*)ANUMTF
+      RDANUM = OPENFU(IDEVAN,ANUMTF)
+      IF(RDANUM.NE.0) THEN
+        CALL ERROM(KBOUT,'Error opening target file')
+        CALL ERROM(KBOUT,ANUMTF)
+        RETURN
+       END IF
+C      WRITE(*,*)'OPEN'
+C
+C read its header
+C
+C      WRITE(*,*)'READING'
+      RDANUM = IHEAD(IDEVAN,BARRAY,NRECAN,RSIZAN)
+      IF(RDANUM.NE.0) RETURN
+C      WRITE(*,*)'NRECAN,RSIZAN',NRECAN,RSIZAN
+C      WRITE(*,*)'READ'
+C
+C get the number of hits and the record number of the first hit
+C
+      RDANUM = IANUM(IDEVAN,ACNUM,NRECAN,RSIZAN,
+     +       NHITS,ACNUMP,BARRAY)
+      CLOSE(UNIT=IDEVAN)
+C      WRITE(*,*)'NHITS,acnump',NHITS,ACNUMP
+      IF(NHITS.LT.1) THEN
+        WRITE(IDEVOT,*)ACNUM,' not found'
+        RETURN
+      END IF
+      IF(RDANUM.NE.0) RETURN
+C
+C open the accession number hit file
+C
+C      WRITE(*,*)'READING'
+      RDANUM = OPENFU(IDEVAN,ANUMHF)
+      IF(RDANUM.NE.0) THEN
+        CALL ERROM(KBOUT,'Error opening hit file')
+        CALL ERROM(KBOUT,ANUMHF)
+        RETURN
+      END IF
+C      WRITE(*,*)'READ'
+C
+C the records are 4 bytes and follow a 300 byte header
+C
+      DO 10 I = 1,NHITS
+        ACNUMQ = 301 + (ACNUMP-1)*4
+        RDANUM = GETRS(IDEVAN,BARRAY,4,ACNUMQ)
+        IF(RDANUM.NE.0) RETURN
+        CALL BSW4(BARRAY(1:),ENAMEP)
+C        WRITE(*,*)'ENAMEP',ENAMEP
+C
+C the entry name file has records of size rsizen and the usual 300 byte header
+C
+        ENAMEP = 301 + (ENAMEP-1)*RSIZEN
+        RDANUM = GETRS(IDEVEN,BARRAY,RSIZEN,ENAMEP)
+        IF(RDANUM.NE.0) RETURN
+        ENAME = BARRAY(1:10)
+        WRITE(IDEVOT,*)'Entry name ',ENAME
+C bug fix 18-10-91: added next line
+        ACNUMP = ACNUMP + 1
+ 10   CONTINUE
+      CLOSE(UNIT=IDEVAN)
+      END
+      INTEGER FUNCTION IANUM(IDEV,ACNUM,NREC,RSIZE,
+     +NHITS,ACNUMP,BARRAY)
+      INTEGER GETRS,NREC,RSIZE,ACNUMP
+      INTEGER UB,B1,BEND
+      CHARACTER BARRAY*(*),ACNUM*(*)
+      EXTERNAL GETRS
+C given an accession number acnum, does binary search.
+C returns the number of hits NHITS and the record number of the first hit ACNUMP
+      NHITS = 0
+      B1 = 0
+      BEND = NREC
+ 10   CONTINUE
+      LB = B1
+      UB = BEND
+ 20   CONTINUE
+      IF(UB.LT.LB) THEN
+        IANUM = 0
+        RETURN
+      END IF
+      IREC = (LB+UB)/2
+      IBYTE = 301 + RSIZE*IREC
+C      WRITE(*,*)IBYTE
+      IANUM = GETRS(IDEV,BARRAY,RSIZE,IBYTE)
+C      WRITE(*,*)BARRAY(9:18)
+      IF(IANUM.NE.0) RETURN
+      IF(ACNUM.LT.BARRAY(9:18)) THEN
+        UB = IREC - 1
+      ELSE IF(ACNUM.GT.BARRAY(9:18)) THEN
+        LB = IREC + 1
+      ELSE
+        CALL BSW4(BARRAY(1:),NHITS)
+        CALL BSW4(BARRAY(5:),ACNUMP)
+        RETURN
+      END IF
+      GO TO 20
+      END
+      INTEGER FUNCTION RDKWRD(IDEVKW,KWRDTF,KWRDHF,IDEVEN,
+     +NRECEN,RSIZEN,KEYS,SS,SE,NKEYS,BARRAY,ENAME,IDEVOT,
+     +KBIN,KBOUT,BRIEFF,BITAR0,BITAR1,MAXWRD)
+      CHARACTER KWRDTF*(*),KWRDHF*(*),KEYS*(*),BARRAY*(*)
+      CHARACTER ENAME*(*),BRIEFF*(*)
+      INTEGER OPENFU,GETRS
+      INTEGER ENAMEP,RSIZKW,RSIZEN
+      INTEGER BITAR0(0:MAXWRD),BITAR1(0:MAXWRD)
+      INTEGER NHITS(5),KWRDP(5),SS(NKEYS),SE(NKEYS)
+      EXTERNAL IHEAD,OPENFU,GETRS,IKWRD
+      RDKWRD = 1
+C      MAXWRD = (NRECEN-1)/32
+C start with keywords in KEYS
+C sent nkeys keywords, return list of entry names
+C
+C open keyword.trg, 
+C read until strings found, get keyword record pointer KWRDP,
+C  number of hits NHITS
+C close KEYWORD.trg
+C open KEYWORD.hit, seek to KWRDP
+C read a record, write to screen, for each of NHITS
+C close KEYWORD.hit
+      IFOUND = 0
+C
+C open KEYWORD target file
+C
+      RDKWRD = OPENFU(IDEVKW,KWRDTF)
+      IF(RDKWRD.NE.0) THEN
+        CALL ERROM(KBOUT,'Error opening target file')
+        CALL ERROM(KBOUT,KWRDTF)
+        RETURN
+      END IF
+C
+C read its header
+C
+      RDKWRD = IHEAD(IDEVKW,BARRAY,NRECKW,RSIZKW)
+      IF(RDKWRD.NE.0) THEN
+        CLOSE(IDEVKW)
+        RETURN
+      END IF
+C
+C get start record number and nhits for each string
+C
+      DO 10 I=1,NKEYS
+C
+C get the number of hits and the record number of the first hit
+C
+        LS = SE(I) - SS(I) + 1
+        RDKWRD = IKWRD(IDEVKW,KEYS(SS(I):),LS,NRECKW,RSIZKW,
+     +       NHITS(I),KWRDP(I),BARRAY)
+        IF(NHITS(I).LT.1) THEN
+          WRITE(IDEVOT,*)KEYS(SS(I):SE(I)),' not found'
+          CLOSE(UNIT=IDEVKW)
+          RETURN
+        END IF
+        IF(RDKWRD.NE.0) THEN
+          CLOSE(UNIT=IDEVKW)
+          RETURN
+        END IF
+        WRITE(IDEVOT,*)KEYS(SS(I):SE(I)),' hits',NHITS(I)
+ 10     CONTINUE
+        CLOSE(UNIT=IDEVKW)
+        CALL BPAUSE(KBIN,KBOUT,IEX)
+        IF(IEX.NE.0)RETURN
+C
+C open the KEYWORD hit file
+C
+      RDKWRD = OPENFU(IDEVKW,KWRDHF)
+      IF(RDKWRD.NE.0) THEN
+        CALL ERROM(KBOUT,'Error opening hit file')
+        CALL ERROM(KBOUT,KWRDHF)
+        RETURN
+      END IF
+C
+C the records are 4 bytes and follow a 300 byte header
+C
+C For each hit for keyword 1 find its entryname record pointer and
+C then find the entryname record pointers for each hit for each of
+C the other keywords.
+C
+C kwrdp(1) is the number of the record in the hit file that contains
+C the first hit for the first keyword
+C      WRITE(*,*)'NHITS(1)',NHITS(1)
+C
+C  new stuff: clear bit arrays, set bits in bitar0 for first word
+C  set bits in bitar1 for other words, then and/or them with array0
+      CALL CABIT(BITAR0,MAXWRD)
+      KWPR = KWRDP(1)
+      DO 40 I = 1,NHITS(1)
+        KWP = 301 + (KWPR-1)*4
+C add to header and then read the entrynum of the first matching entry
+        RDKWRD = GETRS(IDEVKW,BARRAY,4,KWP)
+        IF(RDKWRD.NE.0) THEN
+          CLOSE(UNIT=IDEVKW)
+          RETURN
+        END IF
+        CALL BSW4(BARRAY(1:),ENAMEP)
+C enamep is the record number of the first matching entry
+C so set the corresponding bit
+        CALL SABIT(BITAR0,MAXWRD,ENAMEP)
+        KWPR = KWPR + 1
+ 40     CONTINUE
+C
+C now do the other keywords
+C
+        DO 25 J  = 2,NKEYS
+          CALL CABIT(BITAR1,MAXWRD)
+          KWPR = KWRDP(J)
+          DO 30 I=1,NHITS(J)
+            KWP = 301 + (KWPR-1)*4
+            RDKWRD = GETRS(IDEVKW,BARRAY,4,KWP)
+            IF(RDKWRD.NE.0) THEN
+              CLOSE(UNIT=IDEVKW)
+              RETURN
+            END IF
+            CALL BSW4(BARRAY(1:),ENAMEP)
+C enamep is the record number of the first matching entry
+C so set the corresponding bit
+            CALL SABIT(BITAR1,MAXWRD,ENAMEP)
+            KWPR = KWPR + 1
+ 30     CONTINUE
+C
+C and the bits with bitar0
+C
+          CALL AAWORD(BITAR0,BITAR1,MAXWRD)   
+ 25     CONTINUE
+        CLOSE(UNIT=IDEVKW)
+C
+C open brief directory file
+C
+        IOK = OPENFU(IDEVKW,BRIEFF)
+        IF(IOK.NE.0) THEN
+          CALL ERROM(KBOUT,'Error opening brief directory file')
+          CALL ERROM(KBOUT,BRIEFF)
+          RETURN
+        END IF
+        CALL TBITAL(BITAR0,MAXWRD,IDEVKW,IDEVOT,KBIN,KBOUT,ENAME)
+        CLOSE(UNIT=IDEVKW)
+      END
+      SUBROUTINE TBITAO(BITAR,MAXENT,IDEVEN,BARRAY,RSIZEN,IDEVOT,
+     +ENAME)
+C old routine
+      CHARACTER BARRAY*(*)
+      CHARACTER ENAME*(*)
+      LOGICAL TBITRS
+      INTEGER GETRS,BITAR(0:MAXENT)
+      INTEGER ENAMEP,RSIZEN
+      EXTERNAL GETRS,TBITRS
+C
+C the entry name file has records of size rsizen and the usual 300 byte header
+C
+      IFOUND = 0
+      DO 10 I =0,MAXENT
+        DO 5 J =0,31
+          IF(TBITRS(BITAR(I),J)) THEN
+            ENAMEP = I*32 + 1 + MOD(J,32)
+            ENAMEP = 301 + (ENAMEP-1)*RSIZEN
+            IOK = GETRS(IDEVEN,BARRAY,RSIZEN,ENAMEP)
+            IF(IOK.NE.0) THEN
+              RETURN
+            END IF
+            ENAME = BARRAY(1:10)
+            IFOUND = IFOUND + 1
+            WRITE(IDEVOT,*)ENAME,IFOUND
+          END IF
+ 5      CONTINUE
+ 10   CONTINUE
+      END
+      SUBROUTINE TBITAL(BITAR,MAXENT,IDEVIN,IDEVOT,KBIN,KBOUT,ENAME)
+      LOGICAL TBITRS
+      INTEGER GETRS,BITAR(0:MAXENT)
+      EXTERNAL GETRS,TBITRS,NOTIRL
+      CHARACTER LINE*80,NAME*10,ACNUM*10,SLEN*4,LLINE*104,ENAME*(*)
+      EQUIVALENCE (LLINE(1:1),NAME),(LLINE(25:104),LINE)
+      EQUIVALENCE(LLINE(21:24),SLEN),(LLINE(11:20),ACNUM)
+C
+C the entry name file has records of size 104 and the usual 300 byte header
+C
+      IFOUND = 0
+      DO 4 I =0,MAXENT
+        IF (BITAR(I).NE.0) THEN
+          DO 3 J =0,31
+            IF(TBITRS(BITAR(I),J)) IFOUND = IFOUND + 1
+ 3        CONTINUE
+        END IF
+ 4    CONTINUE
+      WRITE(KBOUT,1004)IFOUND
+ 1004 FORMAT(' Different entries ',I7)
+      IFOUND = 0
+      JPAGE = 0
+      DO 10 I =0,MAXENT
+        IF (BITAR(I).NE.0) THEN
+          DO 5 J =0,31
+            IF(TBITRS(BITAR(I),J)) THEN
+              IBYTE = I*32 + 1 + MOD(J,32)
+              IBYTE = 301 + (IBYTE-1)*104
+              IOK = GETRS(IDEVIN,LLINE,104,IBYTE)
+              IF(IOK.NE.0) THEN
+                RETURN
+              END IF
+              CALL BSW4(SLEN,LENS)
+              IF(IDEVOT.EQ.KBOUT)THEN
+                CALL PAGER(KBIN,KBOUT,JPAGE,IOK)
+                IF (IOK.NE.0) RETURN
+              END IF
+              ENAME = NAME
+              LINEND = NOTIRL(LINE,80,' ')
+              IF(IDEVOT.EQ.KBOUT) THEN
+                WRITE(IDEVOT,1002,ERR=20)
+     +          NAME,ACNUM,LENS,LINE(1:LINEND)
+              ELSE
+                WRITE(IDEVOT,1001,ERR=20)
+     +          NAME,ACNUM,LENS,LINE(1:LINEND)
+ 1001           FORMAT(A,' ',A,' ',I7,' ',A)
+ 1002         FORMAT(' ',A,' ',A,' ',I7,' ',A)
+              END IF
+              IFOUND = IFOUND + 1
+            END IF
+ 5        CONTINUE
+        END IF
+ 10   CONTINUE
+ 20   CONTINUE
+      WRITE(KBOUT,1003)IFOUND
+1003  FORMAT(' ',I7,' different entries found')
+      CALL BPAUSE(KBIN,KBOUT,IEX)
+      END
+      INTEGER FUNCTION IKWRD(IDEV,KEYW,LS,NREC,RSIZE,
+     +NHITS,KWRDP,BARRAY)
+      INTEGER GETRS,NREC,RSIZE
+      INTEGER UB,B1,BEND
+      CHARACTER BARRAY*(*),KEYW*(*)
+      EXTERNAL GETRS,IKWRDB
+C given a KEYWORD KEYW OF LENGTH LSS, does binary search.
+C returns the number of hits NHITS and the record number of the first hit KWRDP
+C Note i make the assumption that the hit file will be ordered
+C so that say all words beginning with sugar follow one another
+C without interuption and in order. If not use the following:
+C set string length to lss + 1 so we know the string found
+C ends with a space: we wont find sugars if we search for sugar
+C      LS = LSS + 1
+      NHITS = 0
+      B1 = 0
+      BEND = NREC
+ 10   CONTINUE
+      LB = B1
+      UB = BEND
+ 20   CONTINUE
+      IF(UB.LT.LB) THEN
+        IKWRD = 0
+        RETURN
+      END IF
+      IREC = (LB+UB)/2
+      IBYTE = 301 + RSIZE*(IREC-1)
+C      WRITE(*,*)'IBYTE',IBYTE
+      IKWRD = GETRS(IDEV,BARRAY,RSIZE,IBYTE)
+      IF(IKWRD.NE.0) RETURN
+      IF(KEYW(1:LS).LT.BARRAY(9:8+LS)) THEN
+        UB = IREC - 1
+      ELSE IF(KEYW(1:LS).GT.BARRAY(9:8+LS)) THEN
+        LB = IREC + 1
+      ELSE
+C a hit but it may not be the first! so look back until not a hit
+C then go forward accumulating a count of hits until a nonmatch is found
+C next 2 lines instead of call to ikwrdb if assumption mentioned above
+C does not hold
+C        CALL BSW4(BARRAY(5:),KWRDP)
+C        CALL BSW4(BARRAY(1:),NHITS)
+        IKWRD =  IKWRDB(IDEV,KEYW,LS,NREC,RSIZE,
+     +  NHITS,KWRDP,BARRAY,IREC)
+C        WRITE(*,*)'NHITS,KWRDP,IKWRD',NHITS,KWRDP,IKWRD
+        RETURN
+      END IF
+      GO TO 20
+      END
+      INTEGER FUNCTION IKWRDB(IDEV,KEYW,LS,NREC,RSIZE,
+     +NHITS,KWRDP,BARRAY,IREC)
+      INTEGER GETRS,NREC,RSIZE
+      CHARACTER BARRAY*(*),KEYW*(*)
+      EXTERNAL GETRS
+C given a KEYWORD KEYW OF LENGTH LS,  and a start record with
+C a hit but it may not be the first! so look back until not a hit
+C then go forward accumulating a count of hits until a nonmatch is found
+      NHITS = 0
+ 10   CONTINUE
+C      WRITE(*,*)'IREC IN IKWRDB',IREC
+C SHOULD THIS BE 0 OR 1?
+      IREC = IREC - 1
+      IF(IREC.LT.0) THEN
+        IKWRDB = 0
+        RETURN
+      END IF
+      IBYTE = 301 + RSIZE*(IREC-1)
+      IKWRDB = GETRS(IDEV,BARRAY,RSIZE,IBYTE)
+      IF(IKWRDB.NE.0) RETURN
+      IF(KEYW(1:LS).EQ.BARRAY(9:8+LS)) GO TO 10
+C found first occurrence
+      IREC = IREC + 1
+      IBYTE = 301 + RSIZE*(IREC-1)
+      IKWRDB = GETRS(IDEV,BARRAY,RSIZE,IBYTE)
+      IF(IKWRDB.NE.0) RETURN
+      CALL BSW4(BARRAY(5:),KWRDP)
+ 20   CONTINUE
+      CALL BSW4(BARRAY(1:),JHITS)
+C      WRITE(*,*)'JHITS,NHITS',JHITS,NHITS
+      NHITS = NHITS + JHITS
+      IREC = IREC + 1
+C      WRITE(*,*)'IREC IN IKWRDB 2',IREC
+      IF(IREC.GT.NREC) THEN
+        IKWRDB = 0
+        RETURN
+      END IF
+      IBYTE = 301 + RSIZE*(IREC-1)
+C      WRITE(*,*)'IBYTE IN IKWRDB',IBYTE
+      IKWRDB = GETRS(IDEV,BARRAY,RSIZE,IBYTE)
+      IF(IKWRDB.NE.0) RETURN
+      IF(KEYW(1:LS).EQ.BARRAY(9:8+LS)) GO TO 20
+C      WRITE(*,*)'OFF END IN KWRDB'
+      END
+      INTEGER FUNCTION GNEXTN(IDEVEN,RSIZE,
+     +ENAME,ANNOFF,SEQOFF,DIVCOD,BARRAY,ICREC)
+C get next entry name from entrynam.idx leaving pointer at next name
+      INTEGER ANNOFF,SEQOFF,DIVCOD,GETRS,RSIZE
+      INTEGER*2 STAR2
+      CHARACTER BARRAY*(*),ENAME*(*)
+      EXTERNAL GETRS
+C this routine numbers records 0 to nrec-1 (unlike all the others!!!!)
+C sipl,nipl,pipl would need irec = 1 if we make this consistent and use irec-1
+      IBYTE = 301 + RSIZE * ICREC
+      GNEXTN = GETRS(IDEVEN,BARRAY,RSIZE,IBYTE)
+      IF(GNEXTN.NE.0) RETURN
+      CALL BSW4(BARRAY(11:),ANNOFF)
+      CALL BSW4(BARRAY(15:),SEQOFF)
+      CALL BSW2(BARRAY(19:),STAR2)
+      DIVCOD = STAR2
+      ENAME = BARRAY(1:10)
+      ICREC = ICREC + 1
+      END
+      INTEGER FUNCTION ENTFN(IDEV,DIVCOD,LIBF,BARRAY)
+      CHARACTER LIBF*(*),BARRAY*(*)
+      INTEGER DIVCOD
+      ENFTN = 1
+      REWIND IDEV
+ 10   CONTINUE
+      READ(IDEV,1000,ERR=100,END=200)I,BARRAY
+ 1000 FORMAT(I6,1X,A)
+      IF(I.NE.DIVCOD) GO TO 10
+      LIBF = BARRAY
+      ENTFN = 0
+      RETURN
+ 100  CONTINUE
+      ENTFN = -2
+      RETURN
+ 200  CONTINUE
+      ENTFN = -1
+      END
+      INTEGER FUNCTION OPENFU(IDEV,NAME)
+      CHARACTER NAME*(*)
+      CLOSE(UNIT=IDEV)
+      LRECL = 1
+      CALL OPENRS(IDEV,NAME,IOK,LRECL,11)
+      OPENFU = 0
+      IF(IOK.EQ.0) RETURN
+      OPENFU = 1
+      END
+      INTEGER FUNCTION OPENFF(IDEV,NAME)
+      CHARACTER NAME*(*)
+      CLOSE(UNIT=IDEV)
+      CALL OPENRS(IDEV,NAME,IOK,LRECL,2)
+      OPENFF = 0
+      IF(IOK.EQ.0) RETURN
+      OPENFF = 1
+      END
+      INTEGER FUNCTION IHEAD(IDEVEN,BARRAY,NREC,RSIZE)
+      INTEGER NREC,RSIZE,GETRS
+      INTEGER*2 IRSIZE
+      CHARACTER BARRAY*(*)
+      EXTERNAL GETRS
+      IHEAD = GETRS(IDEVEN,BARRAY,10,1)
+      IF(IHEAD.NE.0) RETURN
+C      CALL BSW4(BARRAY(1:),FSIZE)
+      CALL BSW4(BARRAY(5:),NREC)
+      CALL BSW2(BARRAY(9:),IRSIZE)
+      RSIZE = IRSIZE
+      END
+C below are routines for handling the embl and genbank feature tables
+C Not all cases are dealt with, but those for automatic translation to
+C protein are done for all the straightforward cases I can think of.
+C Their robustness has yet to be tested!
+      INTEGER FUNCTION EMBLFT(IDEV,KBOUT,KEYWRD,OPRATR,ISTRAN,
+     +POSNS,MAXPOS,NPOS,NOBJ)
+C interpret ft looking for keyword, operator and strand
+C return list of end points, number of positions and
+C number of objects
+      CHARACTER LINEIN*80,KEYWRD*(*),OPRATR*(*)
+      INTEGER POSNS(MAXPOS)
+      INTEGER FTIN1,FTIN4,FTIN5,FTIN6,FTIN7,FTIN8
+      INTEGER FTIN9,FTIN10,FTIN11,RDFTLN
+      LOGICAL NUMBER,STRNGM,STRNGN,WSTRND
+      EXTERNAL FTIN1,FTIN4,FTIN5,FTIN6,FTIN7,FTIN8
+      EXTERNAL FTIN9,FTIN10,FTIN11,STRNGM,STRNGN,WSTRND,RDFTLN
+      NPOS = 0
+      NPOS1 = 0
+C Length of opratr
+      LENOP = INDEX(OPRATR,' ') - 1
+C Find start
+10    CONTINUE
+      EMBLFT = RDFTLN(IDEV,LINEIN)
+      IF(EMBLFT.LT.-1) RETURN
+      IF(EMBLFT.NE.-1) GO TO 10
+C For embl allow extra FH line
+      IF(LINEIN(1:2).EQ.'FH') EMBLFT = RDFTLN(IDEV,LINEIN)
+      IF(EMBLFT.NE.-1) RETURN
+C Into ft
+20    CONTINUE
+      IERR = RDFTLN(IDEV,LINEIN)
+C End of ft ?
+      IF(IERR.EQ.-2) GO TO 400
+C Error ?
+      IF(IERR.NE.0) RETURN
+25    CONTINUE
+      IF(.NOT.STRNGM(LINEIN(6:),KEYWRD)) GO TO 20
+      IF(LENOP.GT.0) THEN
+        CALL CCASE(LINEIN,1)
+        J = INDEX(LINEIN,OPRATR(1:LENOP))
+        IF(J.EQ.0) GO TO 20
+      END IF
+C      IF(.NOT.STRNGN(LINEIN(22:),OPRATR)) GO TO 20
+      IF(.NOT.WSTRND(LINEIN,ISTRAN)) GO TO 20
+C
+C Only get here if keyword, operator and strand are correct
+C
+      IF(NUMBER(LINEIN(22:22))) THEN
+        EMBLFT = FTIN1(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(LINEIN(22:22).EQ.'<') THEN
+        EMBLFT = FTIN1(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(LINEIN(22:22).EQ.'>') THEN
+        EMBLFT = FTIN1(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(LINEIN(22:22).EQ.'"') THEN
+        WRITE(KBOUT,*)'Not handled'
+        EMBLFT = FTIN4(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+C
+C note order of elses important
+C
+      ELSE IF(STRNGM(LINEIN(22:37),'JOIN(COMPLEMENT(')) THEN
+        EMBLFT = FTIN8(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(STRNGM(LINEIN(22:26),'JOIN(')) THEN
+        EMBLFT = FTIN5(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(STRNGM(LINEIN(22:38),'ORDER(COMPLEMENT(')) THEN
+        EMBLFT = FTIN10(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(STRNGM(LINEIN(22:27),'ORDER(')) THEN
+        EMBLFT = FTIN9(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(STRNGM(LINEIN(22:38),'COMPLEMENT(ORDER(')) THEN
+        EMBLFT = FTIN11(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(STRNGM(LINEIN(22:32),'COMPLEMENT(')) THEN
+        EMBLFT = FTIN6(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE IF(LINEIN(22:22).EQ.'/') THEN
+        WRITE(KBOUT,*)'Not handled'
+        EMBLFT = FTIN7(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      ELSE 
+         WRITE(KBOUT,*)'line of unknown type'
+         EMBLFT = 9
+      END IF
+      IF(EMBLFT.NE.0) THEN
+C
+C -1 = start of ft
+C -2 = end of entry or file
+C -3 = error reading file
+C -4 = error getting numbers from a line
+C >0 = error in ftinN where error=N
+C -9 = overflow of storage space
+C
+        WRITE(KBOUT,*)'Error no',EMBLFT
+        WRITE(KBOUT,*)LINEIN
+        RETURN
+      END IF
+      IF(NPOS.GT.NPOS1) THEN
+        IF(NPOS+1.GT.MAXPOS) THEN
+          WRITE(KBOUT,*)'Number of endpoints and objects exceeds',MAXPOS
+          EMBLFT = -9
+          RETURN
+        END IF
+        POSNS(NPOS+1) = NPOS - NPOS1
+        NPOS = NPOS + 1
+        NPOS1 = NPOS
+        NOBJ = NOBJ + 1
+      END IF
+      GO TO 25
+400   CONTINUE
+      EMBLFT = 0
+      WRITE(KBOUT,*)'End of entry'
+      END
+      LOGICAL FUNCTION WSTRND(LINEIN,WANTED)
+C Return true if line is for wanted strand
+C Assume complementary strands will contain string "complement"
+C and that other strand wont
+C Plus strand wanted = 0, minus strand wanted = 1
+C (also works if wanted =2, meaning either strand)
+      CHARACTER LINEIN*(*)
+      INTEGER WANTED
+      WSTRND = .FALSE.
+      CALL CCASE(LINEIN,1)
+      I = INDEX(LINEIN,'COMPLEMENT')
+      IF((I.EQ.0).AND.(WANTED.EQ.1)) RETURN
+      IF((I.NE.0).AND.(WANTED.EQ.0)) RETURN
+      WSTRND = .TRUE.
+      END
+      LOGICAL FUNCTION STRNGN(S1,S2)
+C return true if the first L chars of s1 and s2 match
+C L is the length of the shortest string or the position of
+C the first space char -1. Note if one is only spaces they will match.
+      CHARACTER S1*(*),S2*(*),TUPPER
+      EXTERNAL TUPPER
+      STRNGN = .FALSE.
+      L = MIN(LEN(S1),LEN(S2))
+      I = MIN(INDEX(S1,' '),INDEX(S2,' '))
+      L = MIN(L,MAX(0,I-1))
+      DO 10 I = 1,L
+        IF(TUPPER(S1(I:I)).NE.(TUPPER(S2(I:I)))) RETURN
+ 10     CONTINUE
+      STRNGN = .TRUE.
+      END
+      INTEGER FUNCTION RDFTLN(IDEV,LINEIN)
+      CHARACTER LINEIN*(*)
+C
+C -1 = start of ft
+C -2 = end of entry or file
+C -3 = error reading file
+C
+C read ft lines
+      READ(IDEV,1000,END=200,ERR=300)LINEIN
+C check for start of data
+      IF((LINEIN(1:2).EQ.'FH').OR.(LINEIN(1:8).EQ.'FEATURES')) THEN
+        RDFTLN = -1
+        RETURN
+      END IF
+C check for end of data
+      IF((LINEIN(1:2).EQ.'SQ').OR.(LINEIN(1:2).EQ.'//')) GO TO 200
+C      WRITE(*,*)LINEIN
+      RDFTLN = 0
+      RETURN
+1000  FORMAT(A)
+200   CONTINUE
+      RDFTLN = -2
+      RETURN
+300   CONTINUE
+      RDFTLN = -3
+      END
+      INTEGER FUNCTION FTIN1(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS)
+      CHARACTER LINEIN*(*)
+      INTEGER FTPAIR,RDFTLN
+      EXTERNAL FTPAIR,RDFTLN
+C  467
+C  340..565
+C  102.110
+C  123^124
+C  145^177
+C  find start n1, end n2 of first number
+C  find start n3, end n3 of second number
+C  decode into posns
+      FTIN1 = 1
+      IF(NPOS+2.GT.MAXPOS) RETURN
+      N1 = 22
+      FTIN1 = FTPAIR(LINEIN,N1,J1,J2)
+      IF(FTIN1.NE.0) RETURN
+      POSNS(NPOS+1) = J1
+      POSNS(NPOS+2) = J2
+      NPOS = NPOS + 2
+      FTIN1 = RDFTLN(IDEV,LINEIN)
+      END
+      INTEGER FUNCTION FTIN6(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS)
+      CHARACTER LINEIN*(*)
+      INTEGER FTPAIR,RDFTLN
+      LOGICAL NUMBER
+      EXTERNAL FTPAIR,NUMBER,RDFTLN
+C  only deal with simplest case:
+C  complement(123..345)
+      FTIN6 = 6
+      N1 = 33
+10    CONTINUE
+      IF(NPOS+2.GT.MAXPOS) RETURN
+      FTIN6 = FTPAIR(LINEIN,N1,J1,J2)
+      IF(FTIN6.NE.0) THEN
+        IERR = RDFTLN(IDEV,LINEIN)
+        RETURN
+      END IF
+      POSNS(NPOS+1) = J1
+      POSNS(NPOS+2) = J2
+      NPOS = NPOS + 2
+C  allow next position to be  ')'
+      IF(LINEIN(N1+1:N1+1).EQ.')') THEN
+        FTIN6 = RDFTLN(IDEV,LINEIN)
+        RETURN
+      ELSE
+        IERR = RDFTLN(IDEV,LINEIN)
+        FTIN6 = 6
+      END IF
+      FTIN6 = 6
+      END
+      INTEGER FUNCTION FTIN5(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS)
+      CHARACTER LINEIN*(*)
+      INTEGER FTPAIR,RDFTLN
+      LOGICAL NUMBER
+      EXTERNAL FTPAIR,NUMBER,RDFTLN
+C  only deal with simplest cases:
+C  join(123..345,456..666)
+C  join(123..345,456..666,
+C  789..899)
+C  assume continues lines end with ", "
+C  always terminate at first closing bracket
+      FTIN5 = 5
+      N1 = 27
+10    CONTINUE
+      IF(NPOS+2.GT.MAXPOS) RETURN
+      FTIN5 = FTPAIR(LINEIN,N1,J1,J2)
+      IF(FTIN5.NE.0) THEN
+        IERR = RDFTLN(IDEV,LINEIN)
+        RETURN
+      END IF
+      POSNS(NPOS+1) = J1
+      POSNS(NPOS+2) = J2
+      NPOS = NPOS + 2
+C  allow next position to be ',' or ')'
+      IF(LINEIN(N1+1:N1+1).EQ.')') THEN
+        FTIN5 = RDFTLN(IDEV,LINEIN)
+        RETURN
+      ELSE IF(LINEIN(N1+1:N1+1).EQ.',') THEN
+        IF(NUMBER(LINEIN(N1+2:N1+2))) THEN
+          N1 = N1 + 2
+C if ,number
+          GO TO 10
+        ELSE
+          IF(LINEIN(N1+2:N1+2).NE.' ') THEN
+            IERR = RDFTLN(IDEV,LINEIN)
+C if ,notspace
+            FTIN5 = 5
+            RETURN
+          END IF
+        END IF
+      END IF
+      IERR = RDFTLN(IDEV,LINEIN)
+      IF(IERR.NE.0) THEN
+        FTIN5  = IERR
+        RETURN
+      END IF
+      IF(LINEIN(6:8).EQ.'  ') THEN
+        N1 = 22
+        GO TO 10
+      END IF
+      FTIN5 = 5
+      END
+      INTEGER FUNCTION FTIN8(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS)
+      CHARACTER LINEIN*(*)
+      INTEGER FTPAIR,RDFTLN
+      LOGICAL NUMBER,STRNGM
+      EXTERNAL FTPAIR,NUMBER,STRNGM,RDFTLN
+C  only deal with simplest cases:
+C  join(complement(123..345),complement(59..67))
+C  join(complement(123..345),complement(59..67),
+C  complement(44..55))
+C  assume continues lines end with ", "
+C  always terminate at first closing bracket
+C  remember start
+      NPOS1 = NPOS
+      FTIN8 = 8
+      N1 = 38
+10    CONTINUE
+      IF(NPOS+2.GT.MAXPOS) RETURN
+      FTIN8 = FTPAIR(LINEIN,N1,J1,J2)
+      IF(FTIN8.NE.0) THEN
+        IERR = RDFTLN(IDEV,LINEIN)
+      END IF
+      POSNS(NPOS+1) = J1
+      POSNS(NPOS+2) = J2
+      NPOS = NPOS + 2
+C  allow next position to be '),COMPLEMENT' or ')) '
+      IF(LINEIN(N1+1:N1+2).EQ.'))') THEN
+        FTIN8 = RDFTLN(IDEV,LINEIN)
+        CALL REVFTP(POSNS(NPOS1+1),NPOS-NPOS1)
+        RETURN
+      ELSE IF(STRNGM(LINEIN(N1+1:N1+12),'),COMPLEMENT')) THEN
+          N1 = N1 + 14
+C if ,number
+          GO TO 10
+        ELSE
+          IF(LINEIN(N1+1:N1+3).NE.'), ') THEN
+            IERR = RDFTLN(IDEV,LINEIN)
+C if ,notspace
+            FTIN8 = 8
+            RETURN
+C          END IF
+        END IF
+      END IF
+      IERR = RDFTLN(IDEV,LINEIN)
+      IF(IERR.NE.0) THEN
+        FTIN8 = IERR
+        RETURN
+      END IF
+      IF(LINEIN(6:8).EQ.'  ') THEN
+        N1 = 33
+        GO TO 10
+      END IF
+      FTIN8 = 8
+      END
+      SUBROUTINE REVFTP(POSNS,NPOS)
+C Reverses posns for join(complement()) and order(complement())
+      INTEGER POSNS(NPOS)
+      DO 10 I = 1,NPOS-1,2
+        J = POSNS(I)
+        POSNS(I) = POSNS(I+1)
+        POSNS(I+1) = J
+ 10     CONTINUE
+      DO 20 I = 1,NPOS/2
+        J = POSNS(I)
+        POSNS(I) = POSNS(NPOS-I+1)
+        POSNS(NPOS-I+1) = J
+ 20     CONTINUE
+      END
+      INTEGER FUNCTION FTIN9(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS)
+      CHARACTER LINEIN*(*)
+      INTEGER FTPAIR,RDFTLN
+      LOGICAL NUMBER
+      EXTERNAL FTPAIR,NUMBER,RDFTLN
+C  only deal with simplest cases:
+C  order(123..345,456..666)
+C  order(123..345,456..666,
+C  789..899)
+C  assume continues lines end with ", "
+C  always terminate at first closing bracket
+      FTIN9 = 9
+      N1 = 28
+10    CONTINUE
+      IF(NPOS+2.GT.MAXPOS) RETURN
+      FTIN9 = FTPAIR(LINEIN,N1,J1,J2)
+      IF(FTIN9.NE.0) THEN
+        IERR = RDFTLN(IDEV,LINEIN)
+        RETURN
+      END IF
+      POSNS(NPOS+1) = J1
+      POSNS(NPOS+2) = J2
+      NPOS = NPOS + 2
+C  allow next position to be ',' or ')'
+      IF(LINEIN(N1+1:N1+1).EQ.')') THEN
+        FTIN9 = RDFTLN(IDEV,LINEIN)
+        RETURN
+      ELSE IF(LINEIN(N1+1:N1+1).EQ.',') THEN
+        IF(NUMBER(LINEIN(N1+2:N1+2))) THEN
+          N1 = N1 + 2
+C if ,number
+          GO TO 10
+        ELSE
+          IF(LINEIN(N1+2:N1+2).NE.' ') THEN
+            IERR = RDFTLN(IDEV,LINEIN)
+C if ,notspace
+            FTIN9 = 9
+            RETURN
+          END IF
+        END IF
+      END IF
+      IERR = RDFTLN(IDEV,LINEIN)
+      IF(IERR.NE.0) THEN
+        FTIN9 = IERR
+        RETURN
+      END IF
+      IF(LINEIN(6:8).EQ.'  ') THEN
+        N1 = 22
+        GO TO 10
+      END IF
+      FTIN9 = 9
+      END
+      INTEGER FUNCTION FTIN10(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS)
+      CHARACTER LINEIN*(*)
+      INTEGER FTPAIR,RDFTLN
+      LOGICAL NUMBER,STRNGM
+      EXTERNAL FTPAIR,NUMBER,STRNGM,RDFTLN
+C  only deal with simplest cases:
+C  order(complement(123..345),complement(59..67))
+C  order(complement(123..345),complement(59..67),
+C  complement(44..55))
+C  assume continues lines end with ", "
+C  always terminate at first closing bracket
+C  remember start
+      NPOS1 = NPOS
+      FTIN10 = 10
+      N1 = 39
+10    CONTINUE
+      IF(NPOS+2.GT.MAXPOS) RETURN
+      FTIN10 = FTPAIR(LINEIN,N1,J1,J2)
+      IF(FTIN10.NE.0) THEN
+        IERR = RDFTLN(IDEV,LINEIN)
+        RETURN
+      END IF
+      POSNS(NPOS+1) = J1
+      POSNS(NPOS+2) = J2
+      NPOS = NPOS + 2
+C  allow next position to be '),COMPLEMENT' or ')) '
+      IF(LINEIN(N1+1:N1+2).EQ.'))') THEN
+        FTIN10 = RDFTLN(IDEV,LINEIN)
+        CALL REVFTP(POSNS(NPOS1+1),NPOS-NPOS1)
+        RETURN
+      ELSE IF(STRNGM(LINEIN(N1+1:N1+12),'),COMPLEMENT')) THEN
+          N1 = N1 + 13
+C if ,number
+          GO TO 10
+        ELSE
+          IF(LINEIN(N1+1:N1+3).NE.'), ') THEN
+            IERR = RDFTLN(IDEV,LINEIN)
+C if ,notspace
+            FTIN10 = 10
+            RETURN
+C          END IF
+        END IF
+      END IF
+      IERR = RDFTLN(IDEV,LINEIN)
+      IF(IERR.NE.0) THEN
+        FTIN10 = IERR
+        RETURN
+      END IF
+      IF(LINEIN(6:8).EQ.'  ') THEN
+        N1 = 33
+        GO TO 10
+      END IF
+      FTIN10 = 10
+      END
+      INTEGER FUNCTION FTIN11(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS)
+      CHARACTER LINEIN*(*)
+      INTEGER FTPAIR,RDFTLN
+      LOGICAL NUMBER,STRNGM
+      EXTERNAL FTPAIR,NUMBER,STRNGM,RDFTLN
+C  only deal with simplest cases:
+C  complement(order(123..345,456..666))
+C  complement(order(123..345,456..666,
+C  789..899))
+C  assume continues lines end with ", "
+C  always terminate at first closing bracket
+      FTIN11 = 11
+      N1 = 39
+10    CONTINUE
+      IF(NPOS+2.GT.MAXPOS) RETURN
+      FTIN11 = FTPAIR(LINEIN,N1,J1,J2)
+      IF(FTIN11.NE.0) THEN
+        IERR = RDFTLN(IDEV,LINEIN)
+        RETURN
+      END IF
+      POSNS(NPOS+1) = J1
+      POSNS(NPOS+2) = J2
+      NPOS = NPOS + 2
+C  allow next position to be ',' or ')) '
+      IF(LINEIN(N1+1:N1+2).EQ.'))') THEN
+        FTIN11 = RDFTLN(IDEV,LINEIN)
+        RETURN
+      ELSE IF(LINEIN(N1+1:N1+1).EQ.',') THEN
+        IF(NUMBER(LINEIN(N1+2:N1+2))) THEN
+          N1 = N1 + 2
+C if ,number
+          GO TO 10
+        ELSE
+          IF(LINEIN(N1+2:N1+2).NE.' ') THEN
+            IERR = RDFTLN(IDEV,LINEIN)
+C if ,notspace
+            FTIN11 = 11
+            RETURN
+          END IF
+        END IF
+      END IF
+      IERR = RDFTLN(IDEV,LINEIN)
+      IF(IERR.NE.0) THEN
+        FTIN11 = IERR
+        RETURN
+      END IF
+      IF(LINEIN(6:8).EQ.'  ') THEN
+        N1 = 22
+        GO TO 10
+      END IF
+      FTIN11 = 11
+      END
+      INTEGER FUNCTION FTPAIR(LINEIN,N1,J1,J2)
+      CHARACTER LINEIN*(*)
+      INTEGER FIRSTN
+      EXTERNAL LASTN,FIRSTN
+      FTPAIR = -4
+C  find first and last digits in each of two numbers (n1,n2 and n3,n4)
+C  then encode them
+      J = INDEX(LINEIN(N1:),'.')
+C  Could be cases 1,4,5 but for now error
+      IF(J.EQ.0) RETURN
+      J = FIRSTN(LINEIN(N1:),1)
+      IF(J.EQ.0) RETURN
+      N1 = N1 + J - 1
+      J = INDEX(LINEIN(N1:),'.')
+      N2 = N1 + J - 2
+      K = N2 + 2
+      J = INDEX(LINEIN(K:),'.')
+      IF(J.EQ.0) RETURN
+      IF(J.NE.1) RETURN
+      N3 = J + K
+      J = FIRSTN(LINEIN(N3:),1)
+      IF(J.EQ.0) RETURN
+      N3 = N3 + J - 1
+      K = N3
+      J = LASTN(LINEIN,K)
+      IF(J.EQ.0) RETURN
+      N4 = J
+      READ(LINEIN(N1:N2),1000,ERR=100)J1
+1000  FORMAT(I7)
+      READ(LINEIN(N3:N4),1000,ERR=100)J2
+      FTPAIR = 0
+      N1 = N4
+      RETURN
+100   CONTINUE
+C      WRITE(*,*)'Scream: ftpair'
+      END
+      INTEGER FUNCTION FTIN7(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS),RDFTLN
+      CHARACTER LINEIN*(*)
+      EXTERNAL RDFTLN
+      FTIN7 = RDFTLN(IDEV,LINEIN)
+      END
+      INTEGER FUNCTION FTIN4(IDEV,POSNS,MAXPOS,NPOS,LINEIN)
+      INTEGER POSNS(MAXPOS),RDFTLN
+      CHARACTER LINEIN*(*)
+      EXTERNAL RDFTLN
+      FTIN4 = RDFTLN(IDEV,LINEIN)
+      END
diff --git a/src/staden/sethelp.c b/src/staden/sethelp.c
new file mode 100644
index 0000000..d1a9482
--- /dev/null
+++ b/src/staden/sethelp.c
@@ -0,0 +1,88 @@
+/*
+ * sethelp: Generates the help index and menu data file.
+ */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <errno.h>
+#include <sys/param.h>
+
+void format_file(FILE *hp, FILE *mp);
+
+int main(int argc, char *argv[]) {
+    char helpfile[MAXPATHLEN], menufile[MAXPATHLEN];
+    FILE *hp, *mp;
+    int i, err = 0;
+
+    if (argc < 2) {
+	fprintf(stderr, "Usage:%s program ...\n", *argv);
+	return 1;
+    }
+
+    for (i=1; i<argc; i++) {
+	sprintf(helpfile,"%s_help", argv[i]);
+	sprintf(menufile,"%s_menu", argv[i]);
+
+	if ((hp = fopen(helpfile,"r")) == NULL) {
+	    perror(helpfile);
+	    err = 1;
+	    continue;
+	}
+	if ((mp = fopen(menufile,"w")) == NULL) {
+	    perror(menufile);
+	    err = 1;
+	    continue;
+	}
+	
+	format_file(hp,mp);
+	
+	fclose(hp);
+	fclose(mp);
+    }
+
+    return err;
+}
+
+void format_file(FILE *hp, FILE *mp) {
+    char buf[256], optname[256], progmode[10], *bufptr;
+    int optnum, currentpos, menunum[10], mindex, mindext;
+    long offset;
+    int first = 1, lines, m, l;
+    
+    while (fgets(buf, sizeof(buf), hp) != NULL) {
+	if (buf[1] == '@') {
+	    /* print out info remembered from last block */
+	    if (first)
+		first = 0;
+	    else {
+		l = strlen(progmode);
+		for (m = 0; m<l; m++) {
+		    mindext = mindex;
+		    for (;mindext;) {
+			fprintf(mp, "%d %ld %d %d %c %s\n", optnum,
+				menunum[--mindext], offset, lines, progmode[m],
+				optname);
+		    }
+		}
+	    }
+	    
+	    lines = 0;
+	    /* read option number and 'program char' */
+	    bufptr = buf;
+	    sscanf(bufptr, " @%d. %s %n", 
+		   &optnum, progmode, &currentpos);
+
+	    /* read the list of menus for this option */
+	    mindex = 0;
+	    do bufptr += currentpos;
+	    while (sscanf(bufptr,"%d%n", &menunum[mindex++], &currentpos));
+	    mindex--;
+	    
+	    /* and the name of the option and offset */
+	    sscanf(bufptr, " @ %[^\n]", optname);
+	    offset = ftell(hp);
+	}
+	lines++;
+    }
+}
diff --git a/src/staden/sethelp.f b/src/staden/sethelp.f
new file mode 100644
index 0000000..c38d51f
--- /dev/null
+++ b/src/staden/sethelp.f
@@ -0,0 +1,98 @@
+C   SETHELP
+C   AUTHOR RODGER STADEN
+C   CREATES A HELP FILE FROM A FILE OF DOCUMENTATION THAT HAS THE
+C   FOLLOWING CHARACTERISTICS
+C   1) EACH OPTION HAS A NUMBER BETWEEN BOTOPT AND TOPOPT
+C   2) THE OPTION DESCRIPTIONS ARE HEADED BY THEIR OPTION NUMBER
+C   3) OPTION HEADERS ARE FORMATTED IN THE FOLLOWING WAY
+C       A) ^@IJ. WHERE ^=SPACE, IJ=A 1 OR 2 DIGIT NUMBER, .=THE RIGHT
+C          DELIMITER OF THE NUMBER. THE NUMBER STARTS IN COLUMN 3.
+C   4) EACH LINE <81 CHARACTERS IN LENGTH
+C   5) THE BEST WAY TO FORMAT THE ORIGINAL TEXT FILE IS BY USING RUNOFF
+C   UNITS:         KEYBOARD INPUT       KBIN=5,
+C                  KEYBOARD OUTPUT      KBOUT=6,
+C                  TEXT FILE INPUT      DEVNOS(1)=1,
+C                  HELP FILE OUTPUT     DEVNOS(2)=2,
+C                  HELP POINTERS        DEVNOS(3)=3
+      SUBROUTINE FMAIN()
+      INTEGER BOTOPT,TOPOPT,BOT,TOP,DEVNOS(3)
+      PARAMETER (BOTOPT=-100,TOPOPT=300)
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      CHARACTER LINE*80
+      CHARACTER FILNAM*40
+        EXTERNAL IFROMC
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,3)
+      WRITE(KBOUT,*)'  SETHELP V2.1  AUTHOR: RODGER STADEN'
+        WRITE(KBOUT,2000)BOTOPT,TOPOPT
+2000    FORMAT(' MINIMUM AND MAXIMUM ALLOWED OPTION NUMBERS ARE',2I6)
+        WRITE(KBOUT,1000)
+1000  FORMAT(' ORIGINAL TEXT FILE NAME=',$)
+      CALL OPENF(DEVNOS(1),FILNAM,0,IOK,KBIN,KBOUT)
+      IF(IOK.NE.0)STOP
+      WRITE(KBOUT,1001)
+1001  FORMAT(' RANDOM ACCESS TEXT FILE NAME=',$)
+      READ(KBIN,1002)FILNAM
+1002  FORMAT(A)
+      FILNAM(30:30)=' '
+      CALL OPENRS(DEVNOS(2),FILNAM,IOK,20,3)
+      IF(IOK.NE.0)GO TO 100
+      WRITE(KBOUT,1003)
+1003  FORMAT(' RECORD POINTER FILE=',$)
+      CALL OPENF(DEVNOS(3),FILNAM,1,IOK,KBIN,KBOUT)
+      IF(IOK.NE.0)STOP
+C   SET SILLY RANGES
+      BOT=99999
+      TOP=-9999
+      DO 10 I=BOTOPT,TOPOPT
+      HELPS(I)=1
+      HELPE(I)=1
+10    CONTINUE
+C   SET NOPT, THE CURRENT OPTION NUMBER TO A FLAG VALUE TO MARK FIRST OPTION
+      NOPT=-9999
+C   IREC1 IS THE CURRENT LINE NUMBER
+C   IREC2 IS THE LINE NUMBER THAT THE LAST OPTION STARTS ON
+      IREC1=0
+      IREC2=1
+30    CONTINUE
+      IREC1=IREC1+1
+      READ(DEVNOS(1),1002,END=40)LINE
+      WRITE(DEVNOS(2),REC=IREC1)LINE
+C   IS THIS THE FIRST LINE OF A NEW OPTION?
+      IF(LINE(2:2).EQ.'@')THEN
+      NCHAR=2
+      IF(LINE(4:4).EQ.'.')NCHAR=1
+C   LAST OPTION (NOPT) STARTS ON LINE IREC2 AND FINISHIES ON IREC1-1
+C   IF FIRST OPTION SKIP
+      IF(NOPT.EQ.-9999)GO TO 35
+      HELPS(NOPT)=IREC2
+      HELPE(NOPT)=IREC1-1
+35    CONTINUE
+C   POINT TO START OF NEXT OPTION
+      IREC2=IREC1
+C  GET NEXT OPTION NUMBER
+C       NOPT=IFROMC(LINE(3:3),NCHAR,KBOUT)+ABS(BOT)
+      NOPT=IFROMC(LINE(3:3),NCHAR,KBOUT)
+      BOT=MIN(BOT,NOPT)
+      TOP=MAX(TOP,NOPT)
+        WRITE(KBOUT,2001)NOPT
+2001    FORMAT(' OPTION NUMBER=',I6)
+      END IF
+      GO TO 30
+40    CONTINUE
+C   DO LAST OPTION
+      HELPS(NOPT)=IREC2
+      HELPE(NOPT)=IREC1-1
+      DO 50 I=BOT,TOP
+        WRITE(KBOUT,2002)I,HELPS(I),HELPE(I)
+2002    FORMAT(' OPTION',I6,' STARTS AT RECORD',I6,' AND ENDS AT',I6)
+50    WRITE(DEVNOS(3),1004)I,HELPS(I),HELPE(I)
+1004  FORMAT(3I6)
+      CLOSE(UNIT=DEVNOS(1))
+      CLOSE(UNIT=DEVNOS(3))
+      CLOSE(UNIT=DEVNOS(2))
+      STOP
+100   CONTINUE
+      WRITE(KBOUT,9999)
+9999  FORMAT(' ERROR WHEN TRYING TO OPEN DIRECT ACCESS FILE')
+      STOP
+      END
diff --git a/src/staden/sip.f b/src/staden/sip.f
new file mode 100644
index 0000000..aa1bcb6
--- /dev/null
+++ b/src/staden/sip.f
@@ -0,0 +1,963 @@
+C   SIP (similarity investigation program)
+C   copyright Rodger Staden 
+C  23-7-92 Added facility to list exact matches
+C  16-7-92 changed equivalencing of arrays and upped maxseq to 100000 and
+C          maxcom to 50000. Also allowed quickscan  and identities plot to 
+C          ignore main diagonal.
+C   Jan 25th 1990 changes to allow switches between protein and dna sequences
+C   8-2-90 Changed to give xhair access when alignm chosen with D
+C   16-3-90 removed escape after bpause
+C    5-4-90 Changed getdev to redir and all that implies (see redir)
+C   12-06-90 Changed all occurrences of lh to lf
+C   21-06-90 Added calls to showfu
+C    6-07-90 If no seq read forced retry (used to exit)
+C    9-7-90 renamed menu routine
+C   11-12-90 Changed handling of pir libraries. New string and paramter
+C   25-4-91  Fixed bug in cfsq 
+C            statment
+C     13-04-91 removed initial CLEARV
+C  3-7-91 set namlen = 60
+C  25-2-92 changed call to rdseq
+C         AUTHOR RODGER STADEN MRC LMB, CAMBRIDGE
+C   DIMENSION VARIABLES USED:
+C             MXSPAN= MAXIMUM SPAN
+C             MAXSEQ= MAXIMUM SEQUENCE LENGTH
+C             MAXCOM= MAXIMUM LENGTH OF SEQUENCE THAT CAN BE COMPARED IN ONE
+C             MXCOMP= MAXIMUM SPAN + MAXCOM
+C             MXSPSP= MAXIMUM SPAN + MAXIMUM SEQUENCE - 1
+C             MSPO2= MAXIMUM SPAN / 2
+C             MSPO21= MAXIMUM SPAN / 2 +1
+C             IDIMV = ACTUAL SEQUENCE LENGTH (VERTICAL)
+C             IDIMVA= ACTIVE SEQUENCE
+C             IDIMVP= ACTIVE SEQUENCE LENGTH + SPAN
+C             IDPOLY= DIMENSION OF POLYNOMIAL, PERC, SCORE ARRAYS
+C             ISXMAX= MAXIMUM SCREEN DIMENSION IN X
+C   POINTERS
+C             ISV1, ISV2, IDIMVA = START, END, LENGTH OF ACTIVE REGION VERTICAL
+C             ISH1, ISH2, IDIMHA = START, END, LENGTH OF ACTIVE REGION HORIZ
+C             IDIMVP=IDIMVA+SPAN
+C             IDIMHP=IDIMHA+SPAN
+C             IW1, IW2, IDIML1 = START, END, LENGTH FOR WRITING BOTH SEQ (VERT)
+C             IW3, IW4, IDIML  = START, END, LENGTH FOR WRITING BOTH SEQ (HORIZ)
+C             IHX,IHY XHAIR POSITION IN SEQ UNITS
+C             IWX3, IWX4, = IHX+-SPAN/2 FOR SAVING XHAIR POSITION
+C             IWX1, IWX2, = IHY+-SPAN/2 FOR SAVING XHAIR POSITION
+C
+C
+      SUBROUTINE FMAIN()
+      PARAMETER (MAXDEV=9)
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) HELPF,POINTF,FILOUT,FILEH,FILEV,FILEM,FILEP
+      CHARACTER*(NAMLEN) FILNAM
+      CHARACTER*(NAMLEN) LIBLF
+      CHARACTER*(NAMLEN*2) CURFIL
+      PARAMETER (MAXPRM = 4)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      INTEGER BOTOPT,TOPOPT,DEVNOS(MAXDEV)
+      PARAMETER (MXSPAN=401,
+     +           MAXSEQ=100000,
+     +           MXSPSP=MXSPAN-1+MAXSEQ,
+     +           MAXCOM=50000,
+     +           MXCOMP=MAXCOM+MXSPAN-1,
+     +           MSPO2=MXSPAN/2,
+     +           MSPO21=MSPO2+1,
+     +           IDPOLY=4000,
+     +           BOTOPT=0,
+     +           TOPOPT=34,
+     +           MAXOPT=34,
+     +           MINMEN=-4,
+     +           HELPF='SIPHELP',
+     +           POINTF='SIPHPNT',
+     +           FILEM='SIPMARG',
+     +           FILEP='PROTMAT',
+     +           LIBLF='SEQUENCELIBRARIES')
+C   WARNING TO RS: MAX CONTINUATION LINES USED HERE!
+      PARAMETER (LCONSM = 75)
+C  LCONSM SET FOR MAX PROTEIN WORD LENGTH OF 3 (I.E. 3*(IDM-1))
+      PARAMETER (IDMM = 26)
+      PARAMETER (MAXWRD = 16384)
+      INTEGER SEQV(MXCOMP),SEQH(MXCOMP),OPT
+      CHARACTER SEQVC(MAXSEQ),SEQHC(MAXSEQ)
+      CHARACTER NOROO,NOROE,VORH
+      CHARACTER CHRSET(IDMM),CHRDNA(5)
+      INTEGER LINEB(MXCOMP),LINEC(MXCOMP),LINEE(MXCOMP)
+      INTEGER MATRIX(IDMM,IDMM),MATDNA(25)
+      REAL PERC(IDPOLY),POLYA(IDPOLY),POLYB(IDPOLY)
+      INTEGER SCORE(IDPOLY)
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      INTEGER MARG1(1),MARG2(1)
+C   NOTE THE MARG ARRAYS ARE FOR COMPATIBILITY WITH GETMRG
+      INTEGER WORDP(MAXWRD),CONSTS(0:LCONSM)
+C   ALIGNMENT PARAMETERS AND VARIABLES
+       INTEGER STKREC
+       PARAMETER (MAXSA = 5000,
+     +            STKREC = 11,
+     +            MAXSTK = MAXSA*3*STKREC)
+C   THE STACK SIZE ASSUMES A MAXIMUM SEQUENCE SIZE OF MAXSA
+C   AND THAT THE ESTIMATE OF 2.5 * SEQUENCE LENGTH FOR THE STACK
+C   IS CORRECT. 
+      INTEGER STACK(0:MAXSTK)
+      INTEGER WTS(0:IDMM,0:IDMM)
+      CHARACTER SEQ1A(MAXSA),SEQ2A(MAXSA)
+      INTEGER CC(0:MAXSA+1),DD(0:MAXSA+1)
+      INTEGER RR(0:MAXSA+1),SS(0:MAXSA+1)
+      INTEGER SOP(0:2*MAXSA)
+      INTEGER MAIND(IDMM),SEQS(MAXCOM),HIST(-MAXCOM:MAXCOM)
+      INTEGER POSN(MAXCOM)
+      PARAMETER (LENEQ = MAXCOM*4 + MXCOMP*3 + MAXWRD + 1)
+      INTEGER EQ1(LENEQ)
+      EQUIVALENCE (EQ1,HIST)
+      EQUIVALENCE (EQ1(2+2*MAXCOM),SEQS)
+      EQUIVALENCE (EQ1(2+3*MAXCOM),POSN)
+      EQUIVALENCE (EQ1(2+4*MAXCOM),LINEB)
+      EQUIVALENCE (EQ1(2+4*MAXCOM+MXCOMP),LINEC)
+      EQUIVALENCE (EQ1(2+4*MAXCOM+2*MXCOMP),LINEE)
+      EQUIVALENCE (EQ1(2+4*MAXCOM+3*MXCOMP),WORDP)
+      EQUIVALENCE (EQ1,STACK)
+      EQUIVALENCE (CHRSET,CHRDNA)
+C      EQUIVALENCE (STACK,HIST)
+C      EQUIVALENCE (STACK(1+2*MAXCOM),SEQS)
+C      EQUIVALENCE (STACK(1+3*MAXCOM),POSN)
+C      EQUIVALENCE (STACK(1+4*MAXCOM),LINEB)
+C      EQUIVALENCE (STACK(1+4*MAXCOM+MXCOMP),LINEC)
+C      EQUIVALENCE (STACK(1+4*MAXCOM+2*MXCOMP),LINEE)
+C      EQUIVALENCE (STACK(1+4*MAXCOM+3*MXCOMP),WORDP)
+C  NOTE 1+4*MAXCOM+3*MXCOMP MUST BE < MAXSA*STKREC*3
+      EQUIVALENCE (PERC,CC)
+      EQUIVALENCE (POLYA,DD)
+      EQUIVALENCE (POLYB,RR)
+      EQUIVALENCE (SCORE,SS)
+C  NOTE MAXSA SHOULD BE SIMILAR TO IDPOLY
+      EXTERNAL NORP
+      DATA MATDNA/
+     +1,0,0,0,0,
+     +0,1,0,0,0,
+     +0,0,1,0,0,
+     +0,0,0,1,0,
+     +0,0,0,0,0/
+      DATA CHRDNA/'T','C','A','G','-'/
+C   Initialise help
+      CALL INTHLP('sip', TOPOPT)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+C      I = 1+4*MAXCOM+3*MXCOMP
+C      J = MAXSA*STKREC*3
+C      IF(I.GT.J) THEN
+C        WRITE(KBOUT,*)'Equivalencing of arrays suspect'
+C        GO TO 9999
+C      END IF
+      IFORNO = 0
+      LIBNO = 2
+      IGORT = 0
+C   OPEN UNIT DEVNOS(3) FOR PLOTTING
+      CALL OPENGR(DEVNOS(3))
+      WRITE(KBOUT,1000)
+1000  FORMAT(
+     +' SIP (Similarity investigation program) V5.0 July 1992',/,
+     +' Author: Rodger Staden')
+      FILEV = ' '
+      FILEH = ' '
+C   GET HELP POINTERS
+      CALL SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,DEVNOS(4),KBOUT)
+      CALL INITGR(KBIN,KBOUT,HELPS(0),HELPE(0),HELPF,DEVNOS(4))
+      CALL GETMRG(
+     +ISXMAX,ISYMAX,MARGL,MARGR,MARG1,MARG2,1,DEVNOS(1),FILEM)
+      LCONST = LCONSM
+      JSAME = 0
+      MARKI = 0
+      RNSD = 3.
+      IGAPH = 10
+      IGAPG = 10
+      MARGB=MARG1(1)
+      MARGT=MARG2(1)
+      IHX=0
+      IHY=0
+      IOK=0
+      IDMIN = 0
+      LENGTH = 11
+      LB=(LENGTH+1)/2
+      LF=LENGTH/2
+      IDEV=KBOUT
+      MOPT=0
+      IDIMH = 0
+      IDIMV = 0
+20    CONTINUE
+C 
+C      CALL SHOWFU(KBOUT,'Select horizontal sequence')
+      FILEH = ' '
+      IDEVLL = DEVNOS(5)
+      IDEVEN = DEVNOS(6)
+      IDEVAN = DEVNOS(7)
+      IDEVDL = DEVNOS(8)
+      IDEVLF = DEVNOS(9)
+      CALL RDSEQ(
+     +SEQHC,MAXSEQ,IDIMH,ISH1,ISH2,ISTARH,IENDH,IDIMHA,IDIMBH,
+     +DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +HELPS(3),HELPE(3),HELPF,DEVNOS(4),IDEV,IFORNO,
+     +IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBNO,LIBLF,HIST(-MAXCOM),MAXCOM,IOK)
+      FILEH = FILNAM
+      CURFIL = FILEV(1:1+INDEX(FILEV,' '))//FILEH
+      CALL SHOWFI(KBOUT,CURFIL)
+      IF((IOK.NE.0).OR.(IDIMH.LT.LENGTH)) GO TO 20
+      CALL REDEFD(IDIMH,ISH1,ISH2,MAXCOM,IDIMHA,ISTARH,IENDH,IDIMBH)
+      IDM = NORP(SEQHC,IDIMBH)
+      IF(IDMIN.NE.IDM)THEN
+        IF(IDM.EQ.26) THEN
+C   READ IN THE SCORE MATRIX (USUALLY MDM78)
+          CALL GETMAT(DEVNOS(1),FILEP,MATRIX,IDM,CHRSET,KBOUT,IOK)
+          IF(IOK.NE.0) GO TO 9999
+          L17 = 3
+          MINPER = 2
+        ELSE
+          L17 = 7
+          MINPER = 8
+          CALL FILLI(MATRIX,676,0)
+          CALL COPYI(MATDNA,MATRIX,25)
+        END IF
+        CALL INITLU(IDM)
+C   NEED TO KNOW LARGEST SCORE IN MATRIX FOR STATS
+        MATMAX=0
+        DO 7 I=1,IDM
+        DO 7 J=1,IDM
+          IF(MATMAX.LT.MATRIX(I,J))MATMAX=MATRIX(I,J)
+7       CONTINUE
+        MATMAX=MATMAX+1
+C   SET AVERAGE SCORE FOR DEFAULTS
+        AVSCOR=0.75
+        IF(IDM.EQ.26)AVSCOR=AVSCOR*16.
+C   GET INITIAL SCORE
+        MINPRO = INT(AVSCOR*LENGTH)
+      END IF
+      IDMIN = IDM
+C   GIVE COMPOSITION AS A CHECK
+      IF(IDIMBH.GT.0)THEN
+        IF(IDM.EQ.5)CALL BCOMP(SEQHC,IDIMBH,
+     +ISH1,ISH2,ISTARH,KBOUT)
+        IF(IDM.EQ.26)CALL MWCALC(SEQHC,
+     +IDIMBH,ISH1-ISTARH+1,ISH2-ISTARH+1,KBOUT,KBOUT)
+      END IF
+C   set pointers to sequence
+********
+C  MAXSEQ = THE DIMENSION OF THE RAM BUFFER SEQ
+C  IDIMH  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C           IN THE DISK BUFFER)
+C  ISTARH = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQHC(1)
+C  ISH1   = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  ISH2   = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIMHA = ISH2-ISH11+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IENDH  = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMBH = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+********
+      IF((IOK.NE.0).OR.(IDIMHA.LT.LENGTH)) GO TO 9999
+      IDIMHP=IDIMHA+LENGTH
+      CALL CONNUM(SEQHC(ISH1-ISTARH+1),SEQH(MSPO21),IDIMHA)
+      CALL FILEDG(SEQHC,IDIMBH,ISH1,ISH2,SEQH,MXCOMP,LENGTH,
+     +CHRSET,IDM,MSPO2,ISTARH,IENDH)
+      NOROO='N'
+      NOROE='N'
+      IH1 = ISH1
+      I = MIN(MAXSA,IDIMHA) - 1
+      IH2 = IH1 + I
+      IF(OPT.EQ.3)GO TO 30
+C
+21    CONTINUE
+C      CALL SHOWFU(KBOUT,'Select vertical sequence')
+      FILEV = ' '
+      CALL RDSEQ(
+     +SEQVC,MAXSEQ,IDIMV,ISV1,ISV2,ISTARV,IENDV,IDIMVA,IDIMBV,
+     +DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +HELPS(3),HELPE(3),HELPF,DEVNOS(4),IDEV,IFORNO,
+     +IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBNO,LIBLF,HIST(-MAXCOM),MAXCOM,IOK)
+      FILEV = FILNAM
+      CURFIL = FILEV(1:1+INDEX(FILEV,' '))//FILEH
+      CALL SHOWFI(KBOUT,CURFIL)
+      IF((IOK.NE.0).OR.(IDIMV.LT.LENGTH)) GO TO 21
+      CALL REDEFD(IDIMV,ISV1,ISV2,MAXCOM,IDIMVA,ISTARV,IENDV,IDIMBV)
+C   GIVE COMPOSITION AS A CHECK
+      IDM = NORP(SEQVC,IDIMBV)
+      IF(IDM.NE.IDMIN) CALL INITLU(IDM)
+      IF(IDIMBV.GT.0)THEN
+        IF(IDM.EQ.5)CALL BCOMP(SEQVC,IDIMBV,
+     +ISV1,ISV2,ISTARV,KBOUT)
+        IF(IDM.EQ.26)CALL MWCALC(SEQVC,
+     +IDIMBV,ISV1-ISTARV+1,ISV2-ISTARV+1,KBOUT,KBOUT)
+      END IF
+C   set pointers to sequence
+********
+C  MAXSEQ = THE DIMENSION OF THE RAM BUFFER SEQ
+C  IDIMV  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C           IN THE DISK BUFFER)
+C  ISTARV = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQHC(1)
+C  ISV1   = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  ISV2   = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIMVA = ISV2-ISV1+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IENDV  = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMBV = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+********
+C   change region
+C      IF(IDIMV.GT.0)THEN
+C      CALL REDEFA
+C     +(SEQVC,IDIMV,ISV1,ISV2,MAXCOM,IDIMVA,ISTARV,IENDV,IDIMBV,
+C     +DEVNOS(6),KBIN,KBOUT,
+C     +IOK,SEQW,LREC,NREC,
+C     +HELPS(4),HELPE(4),HELPF,DEVNOS(4),MACTYP,MACSHT,MACLNG)
+C      END IF
+*****************
+      IF((IOK.NE.0).OR.(IDIMVA.LT.LENGTH)) GO TO 9999
+**********************
+      IDIMVP=IDIMVA+LENGTH
+C   DECODE CHARACTERS INTO INTEGERS IN ACTIVE REGION
+      CALL CONNUM(SEQVC(ISV1-ISTARV+1),SEQV(MSPO21),IDIMVA)
+      CALL FILEDG(SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,LENGTH,
+     +CHRSET,IDM,MSPO2,ISTARV,IENDV)
+      NOROO='N'
+      NOROE='N'
+      IV1 = ISV1
+      I = MIN(MAXSA,IDIMVA) - 1
+      IV2 = IV1 + I
+      IF(OPT.EQ.3)GO TO 30
+1008  FORMAT(I6)
+      WRITE(KBOUT,*)' '
+      CALL DSTAT(FILEH,ISH1,ISH2,FILEV,ISV1,ISV2,
+     +  LENGTH,MINPRO,MINPER,KBOUT,JSAME,MARKI)
+30    CONTINUE
+C
+C  MENU AND OPTION SELECTION  <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<
+C
+C
+      CALL BPAUSE(KBIN,KBOUT,IOK)
+C      IF(IOK.EQ.1) GO TO 9999
+C      CALL SHOWFU(KBOUT,' ')
+      CALL MENU(OPT,KOPT,MOPT,MAXOPT,MINMEN,KBIN,KBOUT,
+     +HELPS(0),HELPE(0),HELPF,DEVNOS(4))
+1010  FORMAT(I6)
+      IF(OPT.EQ.18) THEN
+C        CALL SHOWFU(KBOUT,'Plot proportional matches')
+        IF(KOPT.EQ.1) THEN
+        CALL DP20(LENGTH,MXSPAN,IDIMVA,IDIMHA,IDIMVP,IDIMHP,
+     +  NOROO,NOROE,MAXSEQ,
+     +  SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,
+     +  CHRSET,IDM,MSPO2,ISTARV,IENDV,
+     +  SEQHC,IDIMBH,ISH1,ISH2,SEQH,
+     +  ISTARH,IENDH,LB,LF,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        CALL DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        END IF
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        ISAME = 0
+        IF((JSAME.EQ.1).AND.(ISH1.EQ.ISV1)) ISAME = 1
+        IF(MARKI.EQ.1)THEN
+          CALL DIAPRI(SEQV(MSPO21-LB),SEQV(LF+MSPO21),
+     +    IDIMVP,SEQH(MSPO21-LB),SEQH(LF+MSPO21),
+     +    IDIMHP,MATRIX,IDM,LINEB,LINEC,LINEE,LENGTH,MINPRO,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KBOUT,ISAME)
+        ELSE
+          CALL DIAPRO(SEQV(MSPO21-LB),SEQV(LF+MSPO21),
+     +    IDIMVP,SEQH(MSPO21-LB),SEQH(LF+MSPO21),
+     +    IDIMHP,MATRIX,IDM,LINEB,LINEC,LINEE,LENGTH,MINPRO,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KBOUT,ISAME)
+        END IF
+        CALL VT100M
+        GO TO 30
+      END IF
+      IF(OPT.EQ.17) THEN
+C        CALL SHOWFU(KBOUT,'Plot identical matches')
+        IDO = 1
+        IF(KOPT.EQ.1) THEN
+          CALL DP22(IDM,MINPER,
+     +    KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+          IF(IOK.NE.0) GO TO 30
+          PROMPT(1) = 'Plot'
+          PROMPT(2) = 'List'
+          CALL RADION('Select output mode',PROMPT,2,IDO,
+     +    HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KBIN,KBOUT)
+          IF (IDO.LT.1) GO TO 30
+        END IF
+        L17A = MIN(MINPER,L17)
+        IDE = (IDM-1)**L17A
+        ISAME = 0
+        IF((JSAME.EQ.1).AND.(ISH1.EQ.ISV1)) ISAME = 1
+        IF (IDO.EQ.1) THEN
+          CALL CLEARV
+          CALL VECTOM
+          CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+          CALL CFSQ(SEQV(MSPO21),IDIMVA,SEQH(MSPO21),
+     +    IDIMHA,POSN,WORDP,IDE,IDM,CONSTS,LCONST,L17A,MINPER,
+     +    MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,KBOUT,ISAME)
+          CALL VT100M
+        ELSE
+          CALL CFSQL(SEQV(MSPO21),IDIMVA,SEQH(MSPO21),
+     +    IDIMHA,POSN,WORDP,IDE,IDM,CONSTS,LCONST,L17A,MINPER,
+     +    IDEV,KBOUT,ISAME,SEQVC(ISV1),HIST,MAXCOM)
+        END IF
+        GO TO 30
+      END IF
+C
+C   RESET PROPORTIONAL SCORE
+C
+      IF(OPT.EQ.21)THEN
+C        CALL SHOWFU(KBOUT,'Reset proportional score')
+        CALL DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        GO TO 30
+      END IF
+C
+C   RESET IDENTITIES SCORE
+C
+      IF(OPT.EQ.22)THEN
+C        CALL SHOWFU(KBOUT,'Reset identities score')
+        CALL DP22(IDM,MINPER,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        GO TO 30
+      END IF
+C
+C   RESET SPAN (LENGTH)
+C
+      IF(OPT.EQ.20)THEN
+C        CALL SHOWFU(KBOUT,'Reset span length')
+        CALL DP20(LENGTH,MXSPAN,IDIMVA,IDIMHA,IDIMVP,IDIMHP,
+     +  NOROO,NOROE,MAXSEQ,
+     +  SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,
+     +  CHRSET,IDM,MSPO2,ISTARV,IENDV,
+     +  SEQHC,IDIMBH,ISH1,ISH2,SEQH,
+     +  ISTARH,IENDH,LB,LF,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        GO TO 30
+      END IF
+C
+C   RESET HORIZONTAL REGION
+C
+      IF(OPT.EQ.4)THEN
+C        CALL SHOWFU(KBOUT,'Reset active region')
+        CALL QVORH(VORH,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),
+     +  KBIN,KBOUT,FILEH,FILEV)
+        IF(VORH.EQ.'H')THEN
+          CALL GTREG(KBIN,KBOUT,ISTARH,IENDH,ISH1,ISH2,
+     +    'Define active region',
+     +    HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+       CALL REDEFD(IDIMH,ISH1,ISH2,MAXCOM,IDIMHA,ISTARH,IENDH,IDIMBH)
+C      CALL REDEFA
+C     +(SEQHC,IDIMH,ISH1,ISH2,MAXCOM,IDIMHA,ISTARH,IENDH,IDIMBH,
+C     +DEVNOS(5),KBIN,KBOUT,
+C     +IOK,SEQW,LREC,NREC,
+C     +HELPS(4),HELPE(4),HELPF,DEVNOS(4),MACTYP,MACSHT,MACLNG)
+          IDIMHP=IDIMHA+LENGTH
+          CALL CONNUM(SEQHC(ISH1-ISTARH+1),SEQH(MSPO21),IDIMHA)
+          CALL FILEDG(SEQHC,IDIMBH,ISH1,ISH2,SEQH,MXCOMP,LENGTH,
+     +    CHRSET,IDM,MSPO2,ISTARH,IENDH)
+          NOROO='N'
+          NOROE='N'
+          IH1 = ISH1
+          I = MIN(MAXSA,IDIMHA) - 1
+          IH2 = IH1 + I
+          GO TO 30
+        END IF
+        IF(VORH.EQ.'V')THEN
+C
+C  RESET VERTICAL REGION
+C
+          CALL GTREG(KBIN,KBOUT,ISTARV,IENDV,ISV1,ISV2,
+     +    'Define active region',
+     +    HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+          CALL REDEFD(IDIMV,ISV1,ISV2,MAXCOM,IDIMVA,
+     +    ISTARV,IENDV,IDIMBV)
+C      CALL REDEFA
+C     +(SEQVC,IDIMV,ISV1,ISV2,MAXCOM,IDIMVA,ISTARV,IENDV,IDIMBV,
+C     +DEVNOS(6),KBIN,KBOUT,
+C     +IOK,SEQW,LREC,NREC,
+C     +HELPS(4),HELPE(4),HELPF,DEVNOS(4),MACTYP,MACSHT,MACLNG)
+          IDIMVP=IDIMVA+LENGTH
+C   DECODE CHARACTERS INTO INTEGERS IN ACTIVE REGION
+          CALL CONNUM(SEQVC(ISV1-ISTARV+1),SEQV(MSPO21),IDIMVA)
+          CALL FILEDG(SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,LENGTH,
+     +    CHRSET,IDM,MSPO2,ISTARV,IENDV)
+          NOROO='N'
+          NOROE='N'
+          I = MIN(MAXSA,IDIMVA) - 1
+          IV1 = ISV1
+          IV2 = IV1 + I
+          GO TO 30
+        END IF
+      END IF
+C
+C   WRITE ALIGNED SEQUENCES
+C
+      IF(OPT.EQ.5)THEN
+C        CALL SHOWFU(KBOUT,'List sequences')
+        CALL WRITAL(SEQHC,IDIMBH,ISTARH,IENDH,
+     +  SEQVC,IDIMBV,ISTARV,IENDV,IDEV,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 30
+      END IF
+C
+C   XHAIRS
+C
+      IF(OPT.EQ.13)THEN
+C        CALL SHOWFU(KBOUT,'Use crosshair')
+        XMIN=ISH1
+        XMAX=ISH2
+        YMIN=ISV1
+        YMAX=ISV2
+        CALL CLEARV
+        CALL XHAIRN(XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,
+     +  ISXMAX,ISYMAX,IHX,IHY,N,KBOUT,
+     +  SEQHC,ISTARH,IDIMBH,SEQVC,ISTARV,IDIMBV,2)
+        GO TO 30
+      END IF
+C
+C   RULER
+C
+      IF(OPT.EQ.12)THEN
+C        CALL SHOWFU(KBOUT,'Display ruler')
+        WRITE(KBOUT,*)' Horizontal scale'
+        CALL RULER(ISH1,ISH2,MARGL,MARGR,MARGB,MARGT,
+     +  ISXMAX,ISYMAX,KBIN,KBOUT,1,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        WRITE(KBOUT,*)' Vertical scale'
+        CALL RULER(ISV1,ISV2,MARGL,MARGR,MARGB,MARGT,
+     +  ISXMAX,ISYMAX,KBIN,KBOUT,2,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 30
+      END IF
+C
+C   CLEAR GRAPHICS
+C
+      IF(OPT.EQ.10)THEN
+        CALL CLEARG
+        GO TO 30
+      END IF
+C
+C   KLEAR TEXT
+C
+      IF(OPT.EQ.11)THEN
+        CALL CLEARV
+        GO TO 30
+      END IF
+C
+C   CALCULATE EXPECTED SCORES
+C
+      IF(OPT.EQ.23)THEN
+C        CALL SHOWFU(KBOUT,'Calculate expected proportional scores')
+        IF(KOPT.EQ.1) THEN
+        CALL DP20(LENGTH,MXSPAN,IDIMVA,IDIMHA,IDIMVP,IDIMHP,
+     +  NOROO,NOROE,MAXSEQ,
+     +  SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,
+     +  CHRSET,IDM,MSPO2,ISTARV,IENDV,
+     +  SEQHC,IDIMBH,ISH1,ISH2,SEQH,
+     +  ISTARH,IENDH,LB,LF,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        CALL DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        END IF
+C   ARRAYS LARGE ENOUGH?
+        ITEMP=MATMAX*LENGTH
+        IF(ITEMP.LE.IDPOLY)THEN
+          CALL DIAEXP(SEQVC,IDIMBV,SEQHC,IDIMBH,
+     +    MATRIX,IDM,LENGTH,MATMAX,POLYA,POLYB,IDPOLY,KBIN,KBOUT,
+     +    IDEV,CHRSET,NOROE,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+          GO TO 30
+        END IF
+C   OVER RANGE
+        CALL ERROM(KBOUT,
+     +'Polynomial out of range with this span and matrix')
+        GO TO 30
+      END IF
+C
+C   CALCULATE OBSERVED SCORES (NO PLOTING)
+C
+      IF(OPT.EQ.24)THEN
+C        CALL SHOWFU(KBOUT,'Calculate observed proportional scores')
+        IF(KOPT.EQ.1) THEN
+        CALL DP20(LENGTH,MXSPAN,IDIMVA,IDIMHA,IDIMVP,IDIMHP,
+     +  NOROO,NOROE,MAXSEQ,
+     +  SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,
+     +  CHRSET,IDM,MSPO2,ISTARV,IENDV,
+     +  SEQHC,IDIMBH,ISH1,ISH2,SEQH,
+     +  ISTARH,IENDH,LB,LF,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        CALL DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        END IF
+        ISAME = 0
+        IF((JSAME.EQ.1).AND.(ISH1.EQ.ISV1)) ISAME = 1
+        CALL DIAOBS(SEQV(MSPO21-LB),SEQV(LF+MSPO21),IDIMVP,
+     +  SEQH(MSPO21-LB),SEQH(LF+MSPO21),IDIMHP,MATRIX,IDM,
+     +  LINEB,LINEC,LINEE,LENGTH,NOROO,PERC,SCORE,IDPOLY,KBIN,KBOUT,
+     +  IDEV,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),ISAME)
+        GO TO 30
+      END IF
+C
+C   SHOW CURRENT PARAMETERS
+C
+      IF(OPT.EQ.25)THEN
+        CALL CLEARV
+C        CALL SHOWFU(KBOUT,'Show settings')
+        CALL DSTAT(FILEH,ISH1,ISH2,FILEV,ISV1,ISV2,
+     +  LENGTH,MINPRO,MINPER,KBOUT,JSAME,MARKI)
+        GO TO 30
+      END IF
+C
+C   QUICK SCAN
+C
+      IF(OPT.EQ.26)THEN
+C        CALL SHOWFU(KBOUT,'Quick scan')
+        IF(KOPT.EQ.1) THEN
+        CALL DP22(IDM,MINPER,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        CALL DP20(LENGTH,MXSPAN,IDIMVA,IDIMHA,IDIMVP,IDIMHP,
+     +  NOROO,NOROE,MAXSEQ,
+     +  SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,
+     +  CHRSET,IDM,MSPO2,ISTARV,IENDV,
+     +  SEQHC,IDIMBH,ISH1,ISH2,SEQH,
+     +  ISTARH,IENDH,LB,LF,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        CALL DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        CALL DP33(RNSD,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        END IF
+        CALL CLEARV
+        CALL VECTOM
+        CALL FRAME(MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX)
+        L17A = MIN(MINPER,L17)
+        IDE = (IDM-1)**L17A
+        CALL VT100M
+        ISAME = 0
+        IF((JSAME.EQ.1).AND.(ISH1.EQ.ISV1)) ISAME = 1
+        CALL QICKS(SEQV(MSPO21),IDIMVA,POSN,WORDP,IDE,SEQH(MSPO21),
+     +  IDIMHA,CONSTS,L17A,IDM,LCONST,HIST,
+     +  MAXCOM,MATRIX,SEQS,
+     +  MAIND,MARGL,MARGR,MARGB,MARGT,ISXMAX,ISYMAX,
+     +  LENGTH,MINPRO,KBOUT,RNSD,ISAME)
+        CALL VT100M
+C        CALL SHOBOX(SEQVC,IDIMBV,SEQHC,IDIMBH,IWX1,IWX2,IWX3,IWX4,
+C     +  HELPS(OPT),HELPE(OPT),
+C     +  HELPF,DEVNOS(4),KBIN,KBOUT,ISTARH,ISTARV,IENDH,IENDV)
+        GO TO 30
+      END IF
+C
+C  EDIT
+C
+      IF(OPT.EQ.9)THEN
+C        CALL SHOWFU(KBOUT,'Edit sequences')
+        CALL ALIGN(SEQVC,IDIMV,SEQHC,IDIMH,MAXSEQ,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),ISTARV,IENDV,
+     +  ISTARH,IENDH,FILEH,FILEV)
+        WRITE(KBOUT,1003)
+1003  FORMAT(
+     +' Warning: if you have just edited either sequence you must',/,
+     +' redefine the active region before comparing the sequences')
+C   MAY HAVE CHANGED SEQUENCE SO REDO INTEGER VERSION
+C   DECODE CHARACTERS INTO INTEGERS IN ACTIVE REGION
+C      CALL CONNUM(SEQVC(ISV1-ISTARV+1),SEQV(MSPO21),IDIMVA,CHRSET,IDM)
+C      CALL FILEDG(SEQVC,IDIMVA,ISV1,ISV2,SEQV,MXCOMP,LENGTH,
+C     +CHRSET,IDM,MSPO2,ISTARV,IENDV)
+C   DECODE CHARACTERS INTO INTEGERS IN ACTIVE REGION
+C      CALL CONNUM(SEQHC(ISH1-ISTARH+1),SEQH(MSPO21),IDIMHA,CHRSET,IDM)
+C      CALL FILEDG(SEQHC,IDIMHA,ISH1,ISH2,SEQH,MXCOMP,LENGTH,
+C     +CHRSET,IDM,MSPO2,ISTARH,IENDH)
+C      NOROO='N'
+C      NOROE='N'
+      GO TO 30
+      END IF
+C
+C   WRITE HORIZONTAL SEQUENCE TO DISK
+C
+      IF(OPT.EQ.8)THEN
+C        CALL SHOWFU(KBOUT,'Save active sequence')
+        CALL QVORH(VORH,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),
+     +  KBIN,KBOUT,FILEH,FILEV)
+        IF(VORH.EQ.'H')THEN
+          CALL WRTACT(DEVNOS(1),FILOUT,KBIN,KBOUT,
+     +    SEQHC(ISH1),IDIMHA,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+          GO TO 30
+        END IF
+C
+C  WRITE VERTICAL SEQUENCE TO DISK
+C
+        IF(VORH.EQ.'V')THEN
+          CALL WRTACT(DEVNOS(1),FILOUT,KBIN,KBOUT,
+     +    SEQVC(ISV1),IDIMVA,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+          GO TO 30
+         END IF
+      END IF
+C
+C   STOP
+C
+      IF(OPT.EQ.2) GO TO 9999
+C
+C  DRAW A /
+C
+      IF(OPT.EQ.27)THEN
+C        CALL SHOWFU(KBOUT,'Draw diagonal line')
+        CALL VECTOM
+        CALL DIALIN(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT)
+        GO TO 30
+      END IF
+C
+C   GIVE HELP
+C
+      IF(OPT.EQ.1)THEN
+C        CALL SHOWFU(KBOUT,'Help')
+      CALL HELP(HELPS,HELPE,BOTOPT,TOPOPT,HELPF,DEVNOS(4),KBIN,KBOUT)
+      GO TO 30
+      END IF
+C
+C   READ NEW SEQUENCE
+C
+      IF(OPT.EQ.3)THEN
+C        CALL SHOWFU(KBOUT,'Read new sequence')
+        CALL QVORH(VORH,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),
+     +  KBIN,KBOUT,FILEH,FILEV)
+        IF(VORH.EQ.'H')GO TO 20
+        IF(VORH.EQ.'V')GO TO 21
+      END IF
+C
+C  CHANGE MARGINS
+C
+      IF(OPT.EQ.14)THEN
+C        CALL SHOWFU(KBOUT,'Change margins')
+        CALL MARGC1(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KBIN,KBOUT)
+        GO TO 30
+      END IF
+C
+C  LIST MATCHING SPANS
+C
+      IF(OPT.EQ.19)THEN
+C        CALL SHOWFU(KBOUT,'List matching spans')
+        IF(KOPT.EQ.1) THEN
+        CALL DP20(LENGTH,MXSPAN,IDIMVA,IDIMHA,IDIMVP,IDIMHP,
+     +  NOROO,NOROE,MAXSEQ,
+     +  SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,
+     +  CHRSET,IDM,MSPO2,ISTARV,IENDV,
+     +  SEQHC,IDIMBH,ISH1,ISH2,SEQH,
+     +  ISTARH,IENDH,LB,LF,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        CALL DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        IF(IOK.NE.0) GO TO 30
+        END IF
+        ISAME = 0
+        IF((JSAME.EQ.1).AND.(ISH1.EQ.ISV1)) ISAME = 1
+        CALL DIAPRW(SEQV(MSPO21-LB),SEQV(LF+MSPO21),
+     +  IDIMVP,SEQH(MSPO21-LB),SEQH(LF+MSPO21),
+     +  IDIMHP,MATRIX,IDM,LINEB,LINEC,LINEE,
+     +  LENGTH,MINPRO,SEQVC,IDIMBV,SEQHC,IDIMBH,
+     +  KBOUT,IDEV,ISV1,ISH1,ISTARH,ISTARV,ISAME)
+        GO TO 30
+      END IF
+C
+C  LABEL
+C
+      IF(OPT.EQ.15)THEN
+C        CALL SHOWFU(KBOUT,'Label a diagram')
+        CALL LABLER(KBIN,KBOUT,ISXMAX,ISYMAX,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+      END IF
+C
+C  type text
+C
+      IF(OPT.EQ.6)THEN
+C        CALL SHOWFU(KBOUT,'Page through a text file')
+         CALL TTEXT(DEVNOS(1),FILNAM,KBIN,KBOUT,
+     +   HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+      END IF
+C
+C  DIRECT OUTPUT TO DISK
+C
+      IF(OPT.EQ.7)THEN
+C        CALL SHOWFU(KBOUT,'Redirect output')
+        CALL REDIR(IDEV,DEVNOS(2),DEVNOS(3),IGORT,FILNAM,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KOPT)
+        GO TO 30
+      END IF
+C
+C  ALIGN
+C
+      IF(OPT.EQ.28)THEN
+C        CALL SHOWFU(KBOUT,'Align the sequences')
+        IF(KOPT.EQ.1) THEN
+          WRITE(KBOUT,1019)
+          CALL BPAUSE(KBIN,KBOUT,IOK)
+          IF(IOK.NE.0) GO TO 30
+          XMIN=ISH1
+          XMAX=ISH2
+          YMIN=ISV1
+          YMAX=ISV2
+          CALL CLEARV
+          CALL XHAIRN(XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,
+     +    ISXMAX,ISYMAX,IHX,IHY,N,KBOUT,
+     +    SEQHC,ISTARH,IDIMBH,SEQVC,ISTARV,IDIMBV,2)
+          IH1 = MAX(1,IHX)
+          IV1 = MAX(1,IHY)
+          CALL XHAIRN(XMAX,XMIN,YMAX,YMIN,MARGL,MARGR,MARGB,MARGT,
+     +    ISXMAX,ISYMAX,IHX,IHY,N,KBOUT,
+     +    SEQHC,ISTARH,IDIMBH,SEQVC,ISTARV,IDIMBV,2)
+          IH2 = MIN(IHX,IDIMH)
+          IV2 = MIN(IHY,IDIMV)
+        END IF
+        LENV = IV2 - IV1 + 1
+        LENH = IH2 - IH1 + 1
+        IF((LENV.LT.10).OR.(LENH.LT.10)
+     +  .OR.(LENV.GT.MAXSA).OR.(LENH.GT.MAXSA)) THEN
+          WRITE(KBOUT,1019)
+1019      FORMAT(/,
+     +' Define the region to align using the cross-hair.',/,
+     +' First identify the bottom left position and exit',/,
+     +' the cross-hair routine. Then the top right.')
+          GO TO 30
+        END IF
+        IF(KOPT.EQ.1) THEN
+        CALL GETGAP(KBIN,KBOUT,IGAPG,IGAPH,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDM,IOK)
+        IF(IOK.NE.0) GO TO 30
+        END IF
+        WRITE(KBOUT,*)'Aligning region',IV1,' to',IV2
+        WRITE(KBOUT,*)'    with region',IH1,' to',IH2
+        CALL PAMDIS(MATRIX,WTS,IDM)
+        CALL ALIGNM(SEQV(MSPO21+IV1-ISV1),SEQH(MSPO21+IH1-ISH1),
+     +  LENV,LENH,CC,DD,RR,SS,SOP,MAXSA,IV1,IH1,IDIMBV,IDIMBH,
+     +  MAXSEQ,
+     +  WTS,IDM,ISCORE,IGAPG,IGAPH,KBIN,KBOUT,IDEV,SEQVC,
+     +  SEQHC,SEQ1A,SEQ2A,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),KEEP,STACK,MAXSTK,STKREC,
+     +  IOK)
+        NOROE = 'N'
+        NOROO = 'N'
+        IF(IOK.NE.0)GO TO 30
+C  IDIMH  = THE ACTUAL SEQUENCE LENGTH (AND THEREFORE THE NUMBER OF ELEMENTS 
+C           IN THE DISK BUFFER)
+C  ISTARH = THE SEQUENCE NUMBER OF THE CHARACTER OCCUPYING SEQHC(1)
+C  ISH1   = THE SEQUENCE NUMBER OF THE FIRST CHARACTER IN THE ACTIVE REGION
+C  ISH2   = THE SEQUENCE NUMBER OF THE LAST CHARACTER IN THE ACTIVE REGION
+C  IDIMHA = ISH2-ISH11+1 I.E. THE NUMBER OF ELEMENTS IN THE ACTIVE REGION
+C  IENDH  = THE SEQUENCE NUMBER OF THE LAST ELEMENT OF SEQ
+C  IDIMBH = IEND-ISTART+1 I.E. THE NUMBER OF ELEMENTS IN THE RAM BUFFER
+      IF(KEEP.EQ.0) THEN
+        WRITE(KBOUT,1004)
+1004    FORMAT(' Warning: active region extended rightwards')
+C  NEXT FOUR LINES FOR NONDISKBUFFERED VERSION !!!!!!!!!!!
+        IDIMV = MAX(IDIMV,IDIMBV)
+        IDIMBV = IDIMV
+        IDIMH = MAX(IDIMH,IDIMBH)
+        IDIMBH = IDIMH
+        IENDV = IDIMV
+        IENDH = IDIMH
+        ISV2  = MIN(IENDV,ISV1+MAXCOM-1)
+        ISH2  = MIN(IENDH,ISH1+MAXCOM-1)
+        IDIMVA = ISV2 - ISV1 + 1
+        IENDBV = IENDV - ISTARV + 1
+        IDIMHA = ISH2 - ISH1 + 1
+        IENDBH = IENDH - ISTARH + 1
+        IDIMHP=IDIMHA+LENGTH
+        CALL CONNUM(SEQHC(ISH1-ISTARH+1),SEQH(MSPO21),IDIMHA)
+        CALL FILEDG(SEQHC,IDIMBH,ISH1,ISH2,SEQH,MXCOMP,LENGTH,
+     +  CHRSET,IDM,MSPO2,ISTARH,IENDH)
+        IDIMVP=IDIMVA+LENGTH
+        CALL CONNUM(SEQVC(ISV1-ISTARV+1),SEQV(MSPO21),IDIMVA)
+        CALL FILEDG(SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,LENGTH,
+     +  CHRSET,IDM,MSPO2,ISTARV,IENDV)
+        NOROO='N'
+        NOROE='N'
+      END IF
+      GO TO 30
+      END IF
+********
+C
+C   COMPLEMENT SEQUENCE
+C
+      IF(OPT.EQ.29)THEN
+C        CALL SHOWFU(KBOUT,'Complement sequence')
+        IF(IDM.EQ.26)THEN
+          WRITE(KBOUT,*)'Not done for proteins!'
+          GO TO 30
+        ELSE
+          CALL QVORH(VORH,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),
+     +    KBIN,KBOUT,FILEH,FILEV)
+          IF(VORH.EQ.'H')THEN
+            CALL SQREV(SEQHC,IDIMBH)
+            CALL SQCOM(SEQHC,IDIMBH)
+            CALL CONNUM(SEQHC(ISH1-ISTARH+1),SEQH(MSPO21),IDIMHA)
+            CALL FILEDG(SEQHC,IDIMBH,ISH1,ISH2,SEQH,MXCOMP,LENGTH,
+     +      CHRSET,IDM,MSPO2,ISTARH,IENDH)
+            NOROO='N'
+            NOROE='N'
+            GO TO 30
+          END IF
+          IF(VORH.EQ.'V')THEN
+            CALL SQREV(SEQVC,IDIMBV)
+            CALL SQCOM(SEQVC,IDIMBV)
+C   DECODE CHARACTERS INTO INTEGERS IN ACTIVE REGION
+            CALL CONNUM(SEQVC(ISV1-ISTARV+1),SEQV(MSPO21),IDIMVA)
+            CALL FILEDG(SEQVC,IDIMBV,ISV1,ISV2,SEQV,MXCOMP,LENGTH,
+     +      CHRSET,IDM,MSPO2,ISTARV,IENDV)
+            NOROO='N'
+            NOROE='N'
+            GO TO 30
+          END IF
+        END IF
+        GO TO 30
+      END IF
+C
+C  SWITCH MAIN DIAGONAL
+C
+      IF(OPT.EQ.30)THEN
+C        CALL SHOWFU(KBOUT,'Toggle main diagonal switch')
+        CALL SWTCHD(JSAME,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        NOROO='N'
+        GO TO 30
+      END IF
+      IF(OPT.EQ.31)THEN
+C        CALL SHOWFU(KBOUT,'Toggle identities switch')
+        CALL SWTCHI(MARKI,KBIN,KBOUT,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+        GO TO 30
+      END IF
+C
+C  CHANGE SCORE MATRIX
+C
+      IF(OPT.EQ.32)THEN
+C        CALL SHOWFU(KBOUT,'Select score matrix')
+        CALL MATTIN(MATRIX,IDM,FILNAM,FILEP,CHRSET,
+     +  KBIN,KBOUT,DEVNOS(1),MATMAX,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4))
+C   SET AVERAGE SCORE FOR DEFAULTS
+        AVSCOR=0.75
+        IF(MATMAX.GT.3)AVSCOR=AVSCOR*16.
+        NOROO='N'
+        NOROE='N'
+        GO TO 30
+      END IF
+      IF(OPT.EQ.33) THEN
+C        CALL SHOWFU(KBOUT,'Reset number of standard deviations')
+        CALL DP33(RNSD,
+     +  KBIN,KBOUT,HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IOK)
+        GO TO 30
+      END IF
+      IF(OPT.EQ.34) THEN
+C        CALL SHOWFU(KBOUT,'Reset gap penalties')
+        CALL GETGAP(KBIN,KBOUT,IGAPG,IGAPH,
+     +  HELPS(OPT),HELPE(OPT),HELPF,DEVNOS(4),IDM,IOK)
+        GO TO 30
+      END IF
+      GO TO 30
+9999  CONTINUE
+      CALL SHUTD
+      END
diff --git a/src/staden/sipl.f b/src/staden/sipl.f
new file mode 100644
index 0000000..f883f30
--- /dev/null
+++ b/src/staden/sipl.f
@@ -0,0 +1,585 @@
+C  SIPL
+C  AUTHOR RODGER STADEN
+C  14-11-90 Replaced radio by radion
+C   11-12-90 Changed library handling: added filnll and paramter
+C  12-12-90 Changed max word length for dna from 4 to 6 and max 
+C           word length for protein to 3
+C  25-2-92 changed call to rdseq (uses the stack array for temp
+C          storage so call to begina is also changed)
+C  2-3-92 set filnam = ' ' for calls to openf1
+      SUBROUTINE FMAIN()
+      PARAMETER (NAMLEN = 60)
+      CHARACTER*(NAMLEN) FILNAM,FILEP,HELPF
+      CHARACTER*(NAMLEN) LIBLF
+      INTEGER STKREC
+      PARAMETER (
+     +           MAXEN = 10000,
+     +           MAXSEQ  = 20000,
+     +           NMAX  = 20000,
+     +           STKREC = 11,
+     +           MAXSTK = NMAX*3*STKREC,
+     +           MAXDEV=10,
+     +           MAXK = 3,
+     +           IDMM = 26,
+     +           LCONST = MAXK*(IDMM-1),
+     +           FILEP='PROTMAT',
+     +           LIBLF = 'SEQUENCELIBRARIES')
+      INTEGER MATRIX(IDMM,IDMM),DEVNOS(MAXDEV)
+      INTEGER WTS(0:IDMM,0:IDMM)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXSEQ)
+      CHARACTER SEQ1A(NMAX),SEQ2A(NMAX)
+      INTEGER SEQ1N(NMAX),SEQ2N(NMAX),SEQS(MAXSEQ)
+      INTEGER STACK(0:MAXSTK)
+      INTEGER CC(0:NMAX+1),DD(0:NMAX+1)
+      INTEGER RR(0:NMAX+1),SS(0:NMAX+1)
+      INTEGER SOP(0:2*NMAX)
+      INTEGER MATDNA(5,5),SPAN,SPANCT
+      INTEGER SCORE(MAXEN),PENTRY,PSCORE(MAXEN)
+      INTEGER HIST(-MAXSEQ:MAXSEQ),WORDP(IDMM**MAXK)
+      INTEGER POSN(MAXSEQ),CONSTS(0:LCONST),MAIND(IDMM)
+      CHARACTER CHRSET(IDMM)
+      CHARACTER TITLE*60,NAMIN*10,ENAMEL*10,LTYPE
+      CHARACTER ENAME(MAXEN)*10,TITLES(MAXEN)*60
+      PARAMETER (MAXDIV = 15)
+      INTEGER DIVDEV(MAXDIV),RSIZEN
+      EQUIVALENCE (MATRIX,MATDNA)
+      DATA MATDNA/
+     +1,0,0,0,0,
+     +0,1,0,0,0,
+     +0,0,1,0,0,
+     +0,0,0,1,0,
+     +0,0,0,0,0/
+C   GET DEVICE NUMBERS
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      IDEV1 = DEVNOS(1)
+      IDEVH = DEVNOS(2)
+      IDEVR = DEVNOS(3)
+      IDM = IDMM
+      NASD = 0
+      NDIAGT = 0
+      RNSD = 3.
+      IDEVNL = DEVNOS(4)
+      IDEVLL = DEVNOS(5)
+      IDEVEN = DEVNOS(6)
+      IDEVAN = DEVNOS(7)
+      IDEVDL = DEVNOS(8)
+      IDEVLF = DEVNOS(9)
+      IDEVD = DEVNOS(MAXDEV)
+      CALL BEGINA(KBIN,KBOUT,MAXSEQ,SEQ1,SEQ1N,
+     +IDEV1,IDEVR,IDEVH,
+     +FILNAM,HELPF,IALIGN,MINLEN,MINSCR,NSHOW,MAXEN,
+     +FILEP,MATRIX,WTS,IDM,CHRSET,IGAPG,IGAPH,IDIM1,KTUP,
+     +MAIND,RNSD,SPAN,SPANCT,
+     +IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBLF,LIBNO,
+     +STACK,MAXSTK,IOK)
+      IF(IOK.NE.0) STOP
+C  OPEN LIBRARY
+      LIBIN = 1
+      IF(IDM.EQ.26) LIBIN = 2
+      CALL RDLIBL(FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEVLL,IDEVEN,IDEVNL,
+     +LIBLF,LIBIN,DIVDEV,MAXDIV,IDEVD,
+     +LIST,ENAMEL,LIBTYP,LTYPE,NDIV,RSIZEN,NRECEN,IOK)
+      IF(IOK.NE.0)STOP
+      IDE = IDM**KTUP
+      CALL SETCN(CONSTS,KTUP,IDM,LCONST)
+      CALL ENCONC(SEQ1N,IDIM1,POSN,WORDP,IDE,IDM,CONSTS,KTUP,LCONST)
+      CALL WDSCR(SEQ1N,SEQS,IDIM1,KTUP,MAIND,IDM)
+      NENTRY = 0
+      PENTRY = 0
+       CALL ACALC1(SEQ1N,IDIM1,SEQ2N,MAXSEQ,SCORE,PENTRY,IDM,
+     + SEQ2,TITLE,ENAME,TITLES,MAXEN,MINLEN,
+     + NENTRY,LIST,FILNAM,IDEVR,
+     + DIVDEV,NDIV,ENAMEL,NAMIN,IDEVEN,RSIZEN,NRECEN,IDEVNL,LIBTYP,
+     + LTYPE,MINSCR,KBOUT,WORDP,IDE,POSN,CONSTS,LCONST,HIST,KTUP,
+     + MATRIX,SEQS,RNSD,SPAN,SPANCT,NASD,NDIAGT)
+900   CONTINUE
+      WRITE(IDEVR,1009)NENTRY,PENTRY
+1009  FORMAT(
+     +' ',I7,' entries processed,',I6,' above cutoff, sorting now')
+      IF(PENTRY.LT.1)GO TO 921
+      WRITE(IDEVR,*)'Entries exceeding sd cutoff=',NASD
+      WRITE(IDEVR,*)
+     +'Mean number of diagonals above span cutoff',REAL(NDIAGT)/NASD
+      N = MIN(MAXEN,PENTRY)
+      INCDEC = -1
+      CALL SHLSRT(SCORE,N,PSCORE,INCDEC)
+      J = MIN(PENTRY,NSHOW)
+      WRITE(IDEVR,*)'List in score order'
+      DO 910 I = 1,J
+         WRITE(IDEVR,1015)
+     +   SCORE(PSCORE(I)),ENAME(PSCORE(I)),TITLES(PSCORE(I))
+1015     FORMAT(' ',I6,' ',A,' ',A)
+910   CONTINUE
+C      IF(1.EQ.1)STOP
+      IF(IALIGN.EQ.0) THEN
+        WRITE(KBOUT,*)'Aligning'
+        CALL ACALC2(SEQ1,IDIM1,SEQ2,MAXSEQ,SEQ1A,SEQ2A,
+     +  SEQ1N,SEQ2N,SCORE,PENTRY,IDM,STACK,MAXSTK,STKREC,
+     +  CC,DD,RR,SS,SOP,NMAX,WTS,
+     +  TITLE,ENAME,TITLES,MAXEN,
+     +  NENTRY,IGAPG,IGAPH,IDEVR,
+     +  NAMIN,ENAMEL,IDEVEN,RSIZEN,NRECEN,IDEVNL,DIVDEV,NDIV,
+     +  ICREC,IFINEX,LIBTYP,LTYPE,
+     +  PSCORE,NSHOW,KBOUT)
+      END IF
+921   CONTINUE
+      CLOSE(UNIT=IDEVR)
+      GO TO 923
+999   CONTINUE
+      WRITE(KBOUT,1010)NENTRY
+1010  FORMAT(' Error reading names file',I7,' entries processed')
+      CLOSE(UNIT=IDEVR)
+923   CONTINUE
+      END
+      SUBROUTINE BEGINA(KBIN,KBOUT,MAXSEQ,SEQ,SEQN,
+     +IDEV1,IDEVR,IDEVH,
+     +FILNAM,HELPF,IALIGN,MINLEN,MINSCR,NSHOW,MAXEN,
+     +FILEP,MATRIX,WTS,IDM,CHRSET,IGAPG,IGAPH,IDIM1,MINPER,
+     +MAIND,RNSD,LENGTH,MINPRO,
+     +IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBLF,LIBNO,
+     +WORKI,MAXWIR,IOK)
+      INTEGER MATRIX(IDM,IDM),WTS(0:IDM,0:IDM),SEQN(MAXSEQ),MAIND(IDM)
+      INTEGER WORKI(MAXWIR)
+      CHARACTER SEQ(MAXSEQ),CHRSET(IDM)
+      CHARACTER FILNAM*(*),FILEP*(*),HELPF*(*)
+      CHARACTER*(*) LIBLF
+      EXTERNAL NORP
+C      IOK = 1
+      WRITE(KBOUT,1000)
+1000  FORMAT(/,
+     +' SIPL (Similarity investigation program (Library))',
+     +' V4.0 Feb 1992',/,
+     +' Author: Rodger Staden',/,
+     +' Compares a probe protein or nucleic acid',/,
+     +' sequence against a library of sequences',/)
+      IDEV=KBOUT
+      IDIMT = 0
+      IDIM1 = MAXSEQ
+      IFORNO = 0
+      LIBNO = 2
+2     CONTINUE
+      WRITE(KBOUT,*)'Select probe sequence'
+      CALL RDSEQ(
+     +SEQ,MAXSEQ,IDIMT,J1,J2,ISTART,IEND,IDIM1,IDIMB,
+     +IDEV1,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,IFORNO,
+     +IDEVLL,IDEVEN,IDEVAN,IDEVDL,
+     +IDEVLF,LIBNO,LIBLF,WORKI,MAXWIR,IOK)
+      IF(IOK.NE.0) RETURN
+      IDM = NORP(SEQ,IDIMT)
+      CALL INITLU(IDM)
+C   GIVE COMPOSITION AS A CHECK
+      IF(IDM.EQ.26)THEN
+       IF(IDIM1.GT.0)CALL MWCALC(SEQ,IDIM1,
+     +J1-ISTART+1,J2-ISTART+1,KBOUT,KBOUT)
+      ELSE
+        IF(IDIM1.GT.0)
+     +    CALL BCOMP(SEQ,IDIM1,J1-ISTART+1,J2-ISTART+1,ISTART,KBOUT)
+      END IF
+      CALL CONNUM(SEQ,SEQN,IDIM1)
+      IF(IDM.EQ.26)THEN
+C   READ IN THE SCORE MATRIX (USUALLY MDM78)
+        CALL GETMAT(IDEV1,FILEP,MATRIX,IDM,CHRSET,KBOUT,IOK)
+        IF(IOK.NE.0)RETURN
+      END IF
+C   NEED TO KNOW LARGEST SCORE IN MATRIX FOR STATS
+      MATMAX=0
+      DO 20 I=1,IDM
+        DO 20 J=1,IDM
+          IF(MATMAX.LT.MATRIX(I,J))MATMAX=MATRIX(I,J)
+20    CONTINUE
+      MATMAX=MATMAX+1
+C   SET AVERAGE SCORE FOR DEFAULTS
+        AVSCOR=0.75
+        IF(IDM.EQ.26)AVSCOR=AVSCOR*16.
+      FILNAM = ' '
+      CALL OPENF1(IDEVR,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Results file',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0)RETURN
+      CALL YESNO(IALIGN,'Display alignments',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(IALIGN.LT.0) THEN
+       IOK = 1
+       RETURN
+      END IF
+      MN = 10
+      MX = MAXSEQ
+      MINLEN = INT(IDIM1*0.8)
+      CALL GETINT(MN,MX,MINLEN,
+     +'Minimum library sequence length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINLEN = IVAL
+      MN = 1
+      MX = MAXEN
+      NSHOW = 20
+      CALL GETINT(MN,MX,NSHOW,
+     +'Maximum number of scores to list',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      NSHOW = IVAL
+      CALL DP22L(IDM,MINPER,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL DP33(RNSD,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MN = 1
+      MX = 31
+      LENGTH = 11
+      CALL GETINT(MN,MX,LENGTH,'Odd span length',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(MOD(IVAL,2).NE.1) IVAL = LENGTH
+      LENGTH = IVAL
+      MINPRO = INT(AVSCOR*LENGTH)
+      CALL DP21(MATMAX,AVSCOR,LENGTH,MINPRO,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINPRO = MINPRO - 1
+      MX = IDIM1 * AVSCOR * LENGTH * 2
+      MN = 1
+      MINSCR = MX / 40
+      CALL GETINT(MN,MX,MINSCR,'Minimum global score',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MINSCR = IVAL
+      CALL GETGAP(KBIN,KBOUT,IGAPG,IGAPH,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDM,IOK)
+        IF(IOK.NE.0)RETURN
+      CALL PAMDIS(MATRIX,WTS,IDM)
+      DO 30 I = 1,IDM
+        MAIND(I) = MATRIX(I,I)
+30    CONTINUE
+      END
+      SUBROUTINE DP22L(IDM,MINPER,
+     +KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*)
+      MN = 1
+      IF(IDM.EQ.5) THEN
+        MINPER = 6
+        MX = 6
+      ELSE
+        MINPER = 1
+        MX = 3
+      END IF
+      CALL GETINT(MN,MX,MINPER,'Identity score',
+     +  IVAL,KBIN,KBOUT,IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.EQ.0) MINPER = IVAL
+      END
+      SUBROUTINE ACALC1(SEQ1N,IDIM1,SEQ2N,MAXSEQ,SCORE,PENTRY,IDM,
+     +SEQ2,TITLE,ENAME,TITLES,MAXEN,MINLEN,
+     +NENTRY,LIST,FILNAM,IDEVR,
+     +DIVDEV,NDIV,ENAMEL,NAMIN,IDEVEN,RSIZEN,NRECEN,IDEVNL,LIBTYP,
+     +LTYPE,
+     +MINSCR,KBOUT,WORDP,IDE,POSN,CONSTS,LCONST,HIST,KTUP,MATRIX,
+     +SEQS,RNSD,SPAN,SPANCT,NASD,NDIAGT)
+      INTEGER WORDP(IDE),POSN(IDIM1),CONSTS(0:LCONST)
+      INTEGER HIST(-MAXSEQ:MAXSEQ),SPAN,SPANCT
+      INTEGER SCORE(MAXEN),PENTRY,SEQ1N(IDIM1),SEQ2N(MAXSEQ)
+      INTEGER MATRIX(IDM,IDM),SEQS(IDIM1)
+      CHARACTER SEQ2(MAXSEQ),FILNAM*(*),LTYPE
+      CHARACTER TITLE*60,NAMIN*10,ENAMEL*10
+      CHARACTER ENAME(MAXEN)*10,TITLES(MAXEN)*60
+      INTEGER RSIZEN,DIVDEV(NDIV),ENTRYN
+      IFINEX = 0
+      ICREC = 0
+      ENTRYN = 0
+ 20   CONTINUE
+      IDIM2 = MAXSEQ
+      IF(LIBTYP.EQ.1) THEN
+        CALL CDROML(LIST,NAMIN,ENAMEL,
+     +  IDEVEN,RSIZEN,NRECEN,IDEVNL,SEQ2,IDIM2,
+     +  DIVDEV,NDIV,ICREC,IFINEX,TITLE,KBOUT,LTYPE,IOK)
+        IF(IOK.NE.0) RETURN
+        FILNAM = NAMIN
+      ELSE IF(LIBTYP.EQ.2) THEN
+        CALL RDPIRA(SEQ2,IDIM2,
+     +  IDEVEN,KBOUT,TITLE,FILNAM,LIST,ENAMEL,IDEVNL)
+      ELSE IF(LIBTYP.EQ.3) THEN
+        CALL RDFASA(SEQ2,IDIM2,
+     +  IDEVEN,KBOUT,TITLE,FILNAM,LIST,ENAMEL,IDEVNL,ENTRYN)
+      ELSE
+        RETURN
+      END IF
+      IF(IDIM2.LT.1)GO TO 900
+      NENTRY = NENTRY + 1
+      IF(IDIM2.LT.MINLEN) GO TO 20
+C  CONVERT TO INTEGER
+      IF(IDIM2.GT.0)CALL CONNUM(SEQ2,SEQ2N,IDIM2)
+C
+C  DO THE SEARCH
+        CALL QICKSL(SEQ1N,IDIM1,POSN,WORDP,IDE,SEQ2N,IDIM2,CONSTS,
+     +  KTUP,IDM,LCONST,HIST,MAXSEQ,MATRIX,SEQS,
+     +  SPAN,SPANCT,RNSD,MSCORE,NASD,NDIAGT)
+      IF(MSCORE.GE.MINSCR)THEN
+        IF(PENTRY.LT.MAXEN)THEN
+          PENTRY = PENTRY + 1
+          SCORE(PENTRY) = MSCORE
+          ENAME(PENTRY) = FILNAM(1:10)
+          TITLES(PENTRY) = TITLE
+        ELSE
+        WRITE(KBOUT,*)'Maximum number of entries (',MAXEN,') exceeded'
+          RETURN
+        END IF
+      END IF
+      GO TO 20
+C
+900   CONTINUE
+      END
+      SUBROUTINE ACALC2(SEQ1,IDIM1,SEQ2,MAXSEQ,SEQ1A,SEQ2A,
+     +SEQ1N,SEQ2N,SCORE,PENTRY,IDM,STACK,MAXSTK,STKREC,
+     +CC,DD,RR,SS,SOP,NMAX,WTS,
+     +TITLE,ENAME,TITLES,MAXEN,
+     +NENTRY,IGAPG,IGAPH,IDEVR,
+     +NAMIN,ENAMEL,IDEVEN,RSIZEN,NRECEN,IDEVNL,DIVDEV,NDIV,
+     +ICREC,IFINEX,LIBTYP,LTYPE,
+     +PSCORE,NSHOW,KBOUT)
+      INTEGER STACK(0:MAXSTK),STKREC
+      INTEGER WTS(0:IDM,0:IDM)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXSEQ)
+      CHARACTER SEQ1A(NMAX),SEQ2A(NMAX)
+      INTEGER SEQ1N(NMAX),SEQ2N(NMAX)
+      INTEGER CC(0:NMAX+1),DD(0:NMAX+1)
+      INTEGER RR(0:NMAX+1),SS(0:NMAX+1)
+      INTEGER SOP(0:2*NMAX)
+      INTEGER SCORE(MAXEN),PENTRY,PSCORE(MAXEN)
+      CHARACTER TITLE*60,NAMIN*10,ENAMEL*10,LTYPE
+      CHARACTER ENAME(MAXEN)*10,TITLES(MAXEN)*60
+      INTEGER DIVDEV(NDIV)
+      LIST = 2
+      DO 20 I = 1,MIN(NSHOW,PENTRY)
+        J = PSCORE(I)
+        NAMIN = ENAME(J)
+        IDIM2 = MAXSEQ
+        IF(LIBTYP.EQ.1) THEN
+          CALL CDROML(LIST,NAMIN,ENAMEL,
+     +    IDEVEN,RSIZEN,NRECEN,IDEVNL,SEQ2,IDIM2,
+     +    DIVDEV,NDIV,ICREC,IFINEX,TITLE,KBOUT,LTYPE,IOK)
+          IF(IOK.NE.0) RETURN
+        ELSE IF(LIBTYP.EQ.2) THEN
+          REWIND(IDEVEN)
+          CALL RDPIRD(SEQ2,IDIM2,
+     +    IDEVEN,KBOUT,TITLE,NAMIN)
+        ELSE IF(LIBTYP.EQ.3) THEN
+          REWIND(IDEVEN)
+          CALL RDFASD(SEQ2,IDIM2,
+     +    IDEVEN,KBOUT,TITLE,NAMIN)
+        ELSE
+          RETURN
+        END IF
+        IF(IDIM2.LT.1) GO TO 20
+        CALL CONNUM(SEQ2,SEQ2N,IDIM2)
+        WRITE(IDEVR,1000)NAMIN,TITLE
+1000    FORMAT(' ',A,' ',A)
+        CALL ALIGNL(SEQ1N,SEQ2N,IDIM1,IDIM2,CC,DD,RR,SS,SOP,
+     +NMAX,MAXSEQ,WTS,IDM,ISCORE,IGAPG,IGAPH,
+     +IDEVR,SEQ1,SEQ2,SEQ1A,SEQ2A,
+     +STACK,MAXSTK,STKREC,IOK)
+      IF(IOK.NE.0) RETURN
+20    CONTINUE
+      END
+      SUBROUTINE DIFFD(SA,SB,M,N,CC,DD,NMAX,
+     +W,IDM,SCORE,G,H)
+C  RETURNS ALIGNMENT COST (SCORE) USING MYERS AND MILLERS
+C  VERSION OF GOTOH ALGORITHM
+      IMPLICIT INTEGER(A-Z)
+      INTEGER CC(0:NMAX+1),DD(0:NMAX+1)
+      INTEGER W(0:IDM,0:IDM)
+      INTEGER SA(NMAX),SB(NMAX)
+      CC(0) = 0
+      T = G
+      DO 200 J=1,N
+        T = T + H
+        CC(J) = T
+        DD(J) = T + G
+200   CONTINUE
+      T = G
+      DO 400 I = 1,M
+        S = CC(0)
+        T = T + H
+        C = T
+        CC(0) = C
+        E = T + G
+        DO 300 J = 1,N
+          E = MIN(E,C+G) + H
+          DD(J) = MIN(DD(J),(CC(J)+G)) + H
+          C = MIN(DD(J),E,S+W(SA(I),SB(J)))
+          S = CC(J)
+          CC(J) = C
+300     CONTINUE
+400   CONTINUE
+      SCORE = CC(N)
+      END
+      SUBROUTINE ALIGNL(SEQ1N,SEQ2N,IDIM1,IDIM2,CC,DD,RR,SS,SOP,
+     +NMAX,MAXSEQ,WTS,IDM,ISCORE,IG,IH,
+     +KBOUT,SEQ1,SEQ2,SEQ1A,SEQ2A,
+     +STACK,MAXSTK,STKREC,IOK)
+      INTEGER CC(0:NMAX+1),DD(0:NMAX+1),RR(0:NMAX+1),SS(0:NMAX+1)
+      INTEGER WTS(0:IDM,0:IDM),SOP(0:2*NMAX),R1,R2
+      INTEGER SEQ1N(NMAX),SEQ2N(NMAX),STKREC,STACK(0:MAXSTK)
+      CHARACTER SEQ1(MAXSEQ),SEQ2(MAXSEQ),SEQ1A(NMAX),SEQ2A(NMAX)
+      CHARACTER PAD
+      SAVE PAD
+      DATA PAD/','/
+      LW1 = MIN(IDIM1,NMAX)
+      LW2 = MIN(IDIM2,NMAX)
+      N = MAX(LW1,LW2)
+      CALL DIFF(SEQ1N,SEQ2N,LW1,LW2,
+     +CC,DD,RR,SS,SOP,N,
+     +WTS,IDM,ISCORE,IG,IH,KBOUT,STACK,MAXSTK,STKREC,IOK)
+      IF(IOK.NE.0) RETURN
+      CALL ALIGND(SEQ1,SEQ2,SOP,
+     +SEQ1A,SEQ2A,N,MAXSEQ,R1,R2,PAD,NP1,NP2,IDIM1,IDIM2)
+      KPOUT = MAX(R1,R2)
+      PC = PCON(SEQ1A,SEQ2A,KPOUT,PAD)
+      CALL FMT2(KBOUT,SEQ1A,SEQ2A,KPOUT,1,1)
+      WRITE(KBOUT,1001)PC
+1001  FORMAT(' Conservation ',F5.1,'%')
+      WRITE(KBOUT,1002)NP1,NP2
+1002  FORMAT(' Number of padding characters inserted',I6,' and',I6)
+      END
+      SUBROUTINE QICKSL(SEQ1,IDIM1,POSN,WORDP,IDE,SEQ2,IDIM2,CONSTS,
+     +LENGTH,IDM,LCONST,HIST,MAXSEQ,MATRIX,SEQS,
+     +SPAN,MINPRO,RNSD,MSCORE,NASD,NDIAGT)
+      INTEGER SEQ1(IDIM1),SEQ2(IDIM2),SEQS(IDIM1),SPAN
+      INTEGER POSN(IDIM1),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER ESCORE
+      INTEGER HIST(-MAXSEQ:MAXSEQ)
+      PARAMETER (MAXDIA = 20)
+      INTEGER TOPD(MAXDIA),TOPI(MAXDIA),TOPJ(MAXDIA),MATRIX(IDM,IDM)
+      EXTERNAL NCODEA,ESCORE
+      CALL FILLI(HIST(-IDIM1),IDIM2+IDIM1+1,0)
+      DO 20 I = 1,IDIM2-LENGTH+1
+        J = NCODEA(SEQ2(I),LENGTH,CONSTS,IDM,LCONST)
+        IF(J.NE.0)THEN
+          J1 = WORDP(J)
+          IF(J1.NE.0)THEN
+            K = I - J1
+            HIST(K) = HIST(K) + SEQS(J1)
+10          CONTINUE
+            J2 = J1
+            J1 = POSN(J2)
+            IF(J1.NE.0)THEN
+              K = I - J1
+              HIST(K) = HIST(K) + SEQS(J1)
+              GO TO 10
+            END IF
+          END IF        
+        END IF
+20    CONTINUE
+      NDIAG = MAXDIA
+      CALL MHIST(HIST,IDIM1,IDIM2,TOPD,TOPI,TOPJ,
+     +NDIAG,MAXSEQ,RNSD)
+      IF(NDIAG.GT.0) THEN
+        NASD = NASD + 1
+        NDIAGT = NDIAGT + NDIAG
+      END IF
+      MSCORE = 0
+      DO 40 I = 1,NDIAG
+        IF(TOPD(I).NE.0) THEN
+          L = MIN(IDIM1-TOPJ(I),IDIM2-TOPI(I)) + 1
+          M = ESCORE(SEQ1(TOPJ(I)),SEQ2(TOPI(I)),L,MATRIX,IDM,
+     +    SPAN,MINPRO)
+          MSCORE = MSCORE + M
+        END IF
+40    CONTINUE
+C      WRITE(*,*)'SCORE',MSCORE
+      END
+      SUBROUTINE SHLSRT(KEY,N,IPOINT,INCDEC)
+C     SHELL SORT
+C     REFERENCES:  D.L. SHELL, CACM 2, 32 (JULY 1959)
+C                  D.E. KNUTH, TAOCP III, SECT. 5.2.1
+C     CALLING SEQUENCE:
+C
+C     KEY    IS AN ARRAY OF KEYS ON WHICH TO SORT
+C     N      IS THE NUMBER OF ITEMS
+C     IPOINT IS THE ARRAY OF POINTERS
+C            (ONLY THE POINTERS WILL MOVE)
+C     INCDEC .GE. 0 FOR SORTING INTO INCREASING ORDER;
+C            .LT. 0 FOR SORTING INTO DECREASING ORDER
+C
+      INTEGER KEY(N),IPOINT(N),K
+      INTEGER H,S,T
+      IPOINT(1) = 1
+      IF(N.LT.2)RETURN
+C     INITIALIZE POINTER ARRAY
+      DO 2 I = 2,N
+        IPOINT(I) = I
+    2 CONTINUE
+C
+C     CHOICE OF SEQUENCE OF INCREMENTS SUGGESTED
+C     BY KNUTH III, EQ. 8, P. 95.   HIS FORMULA
+C     IS EQUIVALENT TO:
+C
+C            H(S) = (3**S - 1)/2
+C            INITIAL VALUE OF S IS MINIMAL INTEGER
+C              SUCH THAT H(S+2) .GE. N
+C
+C
+C     SMAX = (ALOG(2N + 1)/ALOG(3)) - 2 + 1
+      S = INT( (ALOG(FLOAT(2*N+1))/1.09861229) - 0.95 )
+      S = MAX(S,1)
+      H = (3**S - 1)/2
+      DO 7 T = 1,S
+        JMIN = H + 1
+        DO 6 J = JMIN,N
+          I = J - H
+          JJ = IPOINT(J)
+          K = KEY(JJ)
+          IPT = IPOINT(J)
+   3      CONTINUE
+          II = IPOINT(I)
+          IF((K-KEY(II)).LE.0)THEN
+            IPLUSH = I + H
+            IPOINT(IPLUSH) = IPOINT(I)
+            I = I - H
+            IF(I.GT.0) GO TO 3
+          END IF
+          IPLUSH = I + H
+          IPOINT(IPLUSH) = IPT
+    6   CONTINUE
+C     CHANGE INCREMENT
+        IF(H.LT.2) GO TO 8
+        H = (H-1)/3
+7     CONTINUE
+C      CHECK INCDEC: IF NEGATIVE, SWITCH POINTER ARRAY
+8     CONTINUE
+      IF(INCDEC.LT.0)THEN
+        M = N/2
+        NP1MI = N
+        DO 10 I = 1,M
+          NTEMP = IPOINT(I)
+          IPOINT(I) = IPOINT(NP1MI)
+          IPOINT(NP1MI) = NTEMP
+          NP1MI = NP1MI - 1
+  10    CONTINUE
+      END IF
+      END
+      INTEGER FUNCTION ESCORE(SEQ1,SEQ2,L,MATRIX,IDM,SPAN,MINSCR)
+      INTEGER SEQ1(L),SEQ2(L),MATRIX(IDM,IDM),SPAN,FRONT,BACK
+C 8-6-91 fixed bug which allowed span to be > L
+      M = 0
+      MTOT = 0
+      FRONT = SPAN
+      BACK = 0
+      DO 10 I = 1,MIN(SPAN,L)
+        M = M + MATRIX(SEQ1(I),SEQ2(I))
+10    CONTINUE
+      IF(M.GT.MINSCR) MTOT = M
+      DO 20 I = 2,L-SPAN+1
+        FRONT = FRONT + 1
+        BACK = BACK + 1
+        MM = MATRIX(SEQ1(BACK),SEQ2(BACK))
+        MP =  MATRIX(SEQ1(FRONT),SEQ2(FRONT))
+        M = M - MM + MP
+        IF(M.GT.MINSCR) MTOT = MTOT + M
+20    CONTINUE
+      ESCORE = MTOT
+      END
diff --git a/src/staden/sipmen.f b/src/staden/sipmen.f
new file mode 100644
index 0000000..913f090
--- /dev/null
+++ b/src/staden/sipmen.f
@@ -0,0 +1,84 @@
+      SUBROUTINE MENU(OPT,KOPT,MOPT,MAXOPT,MINMEN,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      INTEGER OPT
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+      CHARACTER HELPF*(*)
+1     CONTINUE
+      CALL CLEARV
+      IF(MOPT.EQ.0)THEN
+      WRITE(KBOUT,5000)
+5000  FORMAT(' Menus and their numbers are'/
+     +' m0 = This menu'/
+     1' m1 = General'/
+     2' m2 = Screen control'/
+     5' m3 = Set parameters'/
+     +' m4 = Comparison'/
+     6'  ? = Help'/
+     1'  ! = Quit')
+1001  FORMAT(' ? Menu or option number=',$)
+      ELSE IF(MOPT.EQ.-1)THEN
+              WRITE(KBOUT,5001)
+5001            FORMAT(' General menu'/
+     +          '  0 = List of menus'/
+     1          '  3 = Read new sequence'/
+     1          '  4 = Redefine active region'/
+     +          '  5 = List the sequences'/
+     +          '  6 = List a text file'/
+     +          '  7 = Direct output to disk'/
+     +          '  8 = Write active sequence to disk'/
+     +          '  9 = Edit the sequences'/
+     +          ' 29 = Complement sequences')
+      ELSE IF(MOPT.EQ.-2)THEN
+                WRITE(KBOUT,5002)
+5002            FORMAT(' Screen control menu'/
+     +          '  0 = List of menus'/
+     +          ' 10 = Clear graphics'/
+     +          ' 11 = Clear text'/
+     +          ' 12 = Draw a ruler'/
+     +          ' 13 = Use cross hair'/
+     +          ' 14 = Reposition plots'/
+     +          ' 15 = Label diagram'/
+     +          ' 16 = Display a map'/
+     +          ' 27 = Draw a /')
+      ELSE IF(MOPT.EQ.-3)THEN
+                WRITE(KBOUT,5005)
+5005            FORMAT(' Set parameters menu'/
+     +          '  0 = List of menus'/
+     +          ' 20 = Set span length'/
+     +          ' 21 = Set proportional score'/
+     +          ' 22 = Set identites score'/
+     +          ' 23 = Calculate expected scores'/
+     +          ' 24 = Calculate observed scores'/
+     +          ' 25 = Show current parameter settings'/
+     +          ' 30 = Switch main diagonal'/
+     +          ' 31 = Switch identities'/
+     +          ' 32 = Change score matrix'/
+     +          ' 33 = Set number of sd for Quickscan'/
+     +          ' 34 = Set gap penalties')
+      ELSE IF(MOPT.EQ.-4)THEN
+                WRITE(KBOUT,5006)
+5006            FORMAT(' Comparison menu'/
+     +          '  0 = List of menus'/
+     +          ' 17 = Apply identities algorithm'/
+     +          ' 18 = Apply proportional algorithm'/
+     +          ' 19 = List matching spans'/
+     +          ' 26 = Quick scan'/
+     +          ' 28 = Align sequences')
+      END IF
+3     CONTINUE
+      WRITE(KBOUT,1001)
+      CALL GETOPT(KBIN,KOPT,OPT)
+      IF((OPT.LT.MINMEN).OR.(OPT.GT.MAXOPT).OR.(KOPT.GT.2)) THEN
+        MOPT = 0
+        GO TO 1
+      END IF
+      IF((KOPT.EQ.2).AND.(OPT.GE.0)) THEN
+        CALL HELP2(IHELPS(OPT),IHELPE(OPT),HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 1
+      END IF
+      IF(OPT.LE.0) THEN
+        MOPT = OPT
+        GO TO 1
+      END IF
+      END
diff --git a/src/staden/splitp1.f b/src/staden/splitp1.f
new file mode 100644
index 0000000..3f7efe3
--- /dev/null
+++ b/src/staden/splitp1.f
@@ -0,0 +1,58 @@
+C  ROUTINE TO TRANSLATE PROSITE DATABASE
+C 2-3-92 set filnam = ' ' 
+      SUBROUTINE FMAIN()
+      PARAMETER (MAXSTR = 80, MAXDEV = 4)
+      INTEGER DEVNOS(MAXDEV)
+      CHARACTER*(MAXSTR) AMOS,IDLINE,TITLE
+      CHARACTER*40 PATNAM,HELPF,FILNAM
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      WRITE(KBOUT,*)'SPLITP1 splits prosite.dat into one file per entry'
+      WRITE(KBOUT,*)'        and creates and index'
+      IDEV1 = DEVNOS(1)
+      IDEV2 = DEVNOS(2)
+      IDEV4 = DEVNOS(4)
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(1),FILNAM,0,IOK,KBIN,KBOUT,
+     +'Prosite library file',
+     +IHELPS,IHELPE,HELPF,DEVNOS(3))
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(4),FILNAM,1,IOK,KBIN,KBOUT,
+     +'Index file',
+     +IHELPS,IHELPE,HELPF,DEVNOS(3))
+      IF(IOK.NE.0) STOP
+      IPAT = 0
+5     CONTINUE
+      READ(IDEV1,1000,ERR=200,END=100)AMOS
+1000  FORMAT(A)
+      IF(AMOS(1:2).EQ.'ID') THEN
+        IDLINE = AMOS
+        GO TO 5
+      END IF
+      IF(AMOS(1:2).EQ.'AC') THEN
+        IPAT = IPAT + 1
+        NAMLEN = INDEX(AMOS(1:),';') - 1
+        PATNAM = AMOS(6:NAMLEN)//'.DAT'
+1005    FORMAT(' ',A)
+      CALL OPENRS(IDEV2,PATNAM,IOK,LRECL,1)
+      IF(IOK.NE.0) WRITE(*,*)'SCREAM, FILE OPENING'
+1009    FORMAT(' ',A)
+        WRITE(IDEV2,1009,ERR=200)IDLINE
+        WRITE(IDEV2,1009,ERR=200)AMOS
+      END IF
+6       CONTINUE
+        READ(IDEV1,1000,ERR=200,END=100)AMOS
+        WRITE(IDEV2,1009,ERR=200)AMOS
+        IF(AMOS(1:2).EQ.'DE') TITLE = AMOS(6:)
+        IF(AMOS(1:2).EQ.'DO') THEN
+          TITLE(70:80) = PATNAM(3:7)//','//AMOS(10:14)
+          WRITE(IDEV4,1009,ERR=200)TITLE
+        END IF
+        IF(AMOS(1:2).NE.'//') GO TO 6
+        GO TO 5
+100     CONTINUE
+      WRITE(KBOUT,*)IPAT,' files created. Normal termination'
+      STOP
+200   CONTINUE
+      WRITE(KBOUT,*)IPAT,' files created. Abnormal termination'
+      END
diff --git a/src/staden/splitp2.f b/src/staden/splitp2.f
new file mode 100644
index 0000000..46c0e0d
--- /dev/null
+++ b/src/staden/splitp2.f
@@ -0,0 +1,40 @@
+C  ROUTINE TO TRANSLATE PROSITE DATABASE
+      SUBROUTINE FMAIN()
+      PARAMETER (MAXSTR = 80, MAXDEV = 3)
+      INTEGER DEVNOS(MAXDEV)
+      CHARACTER AMOS*(MAXSTR)
+      CHARACTER*40 PATNAM,HELPF,FILNAM
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      WRITE(KBOUT,*)'SPLITP2 splits prosite.doc into one file per entry'
+      IDEV1 = DEVNOS(1)
+      IDEV2 = DEVNOS(2)
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(1),FILNAM,0,IOK,KBIN,KBOUT,
+     +'Prosite library file',
+     +IHELPS,IHELPE,HELPF,DEVNOS(3))
+      IF(IOK.NE.0) STOP
+      IPAT = 0
+5     CONTINUE
+      READ(IDEV1,1000,ERR=200,END=100)AMOS
+1000  FORMAT(A)
+      IF(AMOS(1:5).EQ.'{PDOC') THEN
+        IPAT = IPAT + 1
+        NAMLEN = INDEX(AMOS(1:),'}') - 1
+        PATNAM = AMOS(2:NAMLEN)//'.DOC'
+1005    FORMAT(' ',A)
+      CALL OPENRS(IDEV2,PATNAM,IOK,LRECL,1)
+      IF(IOK.NE.0) WRITE(*,*)'SCREAM, FILE OPENING'
+1009    FORMAT(' ',A)
+        WRITE(IDEV2,1009,ERR=200)AMOS
+      END IF
+6       CONTINUE
+        READ(IDEV1,1000,ERR=200,END=100)AMOS
+        WRITE(IDEV2,1009,ERR=200)AMOS
+        IF(AMOS(1:4).NE.'{END') GO TO 6
+        GO TO 5
+100     CONTINUE
+      WRITE(KBOUT,*)IPAT,' files created. Normal termination'
+      STOP
+200   CONTINUE
+      WRITE(KBOUT,*)IPAT,' files created. Abnormal termination'
+      END
diff --git a/src/staden/splitp3.f b/src/staden/splitp3.f
new file mode 100644
index 0000000..616428f
--- /dev/null
+++ b/src/staden/splitp3.f
@@ -0,0 +1,392 @@
+C  Routine to translate prosite motif library into patterns
+C  assumes only exact match, membership of set and NOT
+C  assumes no errors in library
+C  assumes gaps up to 40
+C  assumes no variation in length of motif: only variation in
+C  their separation ie [fred](2,3) is not dealt with (but why should it exist)
+C  but x(2,3) causes a new motif to be started starting 2 away with 1 extra
+C  position.
+C  the translation is crude: all go to membership of set
+C  nots go to membership of set for the rest of the character set
+C  fixed gaps are included in "weight matrices" and so will slow searches
+C  18-7-91 Added titles to pattern files
+C  3-2-92 Changed program so it writes pattern and weight files
+C         to current directory. All file names in the fofn and
+C         pattern files are full path names.
+      SUBROUTINE FMAIN()
+      INTEGER AP,RP,SCORE
+      PARAMETER (MAXSTR = 255,MAXLEN = 120, IDM = 26, MAXDEV = 5)
+      INTEGER WTSMAT(IDM,MAXLEN)
+      INTEGER DEVNOS(MAXDEV)
+      CHARACTER AMOS*(MAXSTR),RS*(MAXSTR),CACIDS*20,COMMAS*40
+      CHARACTER AAMOS*(MAXSTR)
+      CHARACTER*60 PATNAM,WTSNAM,HELPF,FILNAM
+      CHARACTER CSET*26,TITLE*80
+      LOGICAL ACID,NUMBER
+      EQUIVALENCE (AAMOS(6:),AMOS)
+      EXTERNAL ACID,NUMBER,INTFC
+      DATA COMMAS/',,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,'/
+      DATA CSET/'ABCDEFGHIJKLMNOPQRSTUVWXYZ'/
+      CALL INITLU(IDM)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      WRITE(KBOUT,*)
+     +' SPLITP3 v3.0 Feb 1992, author Rodger Staden'
+      WRITE(KBOUT,*)
+     +' Splits Prosite motif library into Staden pattern files'
+      WRITE(KBOUT,*)
+     +' and creates a file of file names. The pattern files and'
+      WRITE(KBOUT,*)
+     +' weight matrices are written to the current directory but'
+      WRITE(KBOUT,*)
+     +' full path names are included in the file of file names'
+      IDEV1 = DEVNOS(1)
+      IDEV2 = DEVNOS(2)
+      IDEV3 = DEVNOS(3)
+      IDEV4 = DEVNOS(4)
+      NVAR = 0
+      NMOTIF = 0
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(1),FILNAM,0,IOK,KBIN,KBOUT,
+     +'Prosite library file',
+     +IHELPS,IHELPE,HELPF,DEVNOS(5))
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(2),FILNAM,1,IOK,KBIN,KBOUT,
+     +'Name for file of pattern file names',
+     +IHELPS,IHELPE,HELPF,DEVNOS(5))
+      IF(IOK.NE.0) STOP
+3     CONTINUE 
+      LIN = 0
+      CALL GTSTR('Path name of motif directory',' ',PATNAM,LIN,
+     +KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,DEVNOS(5),KBIN,KBOUT)
+        GO TO 3
+      END IF
+      IF(INFLAG.EQ.2) STOP
+      IDIR = MAX(1,INDEX(PATNAM,' '))
+C
+C add final slash if not present
+      IF(PATNAM(IDIR-1:IDIR-1).NE.'/') THEN
+        PATNAM(IDIR:IDIR) = '/'
+        IDIR = IDIR + 1
+      END IF
+5     CONTINUE
+      READ(IDEV1,1000,ERR=100,END=200)AAMOS
+1000  FORMAT(A)
+      IF(AAMOS(1:2).NE.'AC') GO TO 5
+        NAMLEN = INDEX(AMOS(1:),';') - 1
+        PATNAM(IDIR:) = AMOS(1:NAMLEN)
+        WTSNAM = PATNAM
+        PATNAM = PATNAM(1:NAMLEN+IDIR-1)//'.PAT'
+      CALL OPENRS(IDEV3,PATNAM(IDIR:),IOK,LRECL,1)
+      IF(IOK.NE.0) WRITE(*,*)'SCREAM, FILE OPENING'
+6       CONTINUE
+        READ(IDEV1,1000,ERR=100,END=200)AAMOS
+        IF(AAMOS(1:2).NE.'DE') GO TO 6
+        TITLE = AAMOS(6:)
+        WRITE(IDEV3,1002)TITLE
+9       CONTINUE
+        READ(IDEV1,1000,ERR=100,END=200)AAMOS
+C Check for empty definition ie no pa line
+        IF(AAMOS(1:2).EQ.'//') THEN
+          WRITE(KBOUT,*)TITLE
+          WRITE(KBOUT,*)'No PA line!'
+          CLOSE(UNIT=IDEV3)
+          GO TO 5
+        END IF
+        IF(AAMOS(1:2).NE.'PA') GO TO 9
+      IF(INDEX(AMOS,'.').EQ.0) THEN
+        J = INDEX(AMOS,' ')
+        READ(IDEV1,1001,ERR=100,END=200)AMOS(J:)
+1001    FORMAT(5X,A)
+      END IF
+      LAMOS = INDEX(AMOS,'.')
+C      WRITE(KBOUT,*)AMOS(1:LAMOS)
+      NMOTIF = NMOTIF + 1
+      AP = 0
+      RP = 0
+      SCORE = 0
+      MOTIF = 1
+      WRITE(IDEV3,1002)'A6           '//
+     +PATNAM(IDIR+2:IDIR+6)//'          Class'
+1002  FORMAT(A)
+10    CONTINUE
+      AP = AP + 1
+      IF(AP.LT.LAMOS) THEN
+        IF(AMOS(AP:AP).EQ.'-') AP = AP + 1
+        IF(ACID(AMOS(AP:))) THEN
+          RP = RP + 1
+          RS(RP:RP) = AMOS(AP:AP)
+          LASTS = RP
+          RP = RP + 1
+          RS(RP:RP) = ','
+          SCORE = SCORE + 1
+        ELSE IF(AMOS(AP:AP).EQ.'[') THEN
+          AP = AP + 1
+          LSET = INDEX(AMOS(AP:),']') - 1
+          RP = RP + 1
+          RS(RP:RP+LSET-1) = AMOS(AP:AP+LSET-1)
+          LASTS = RP
+          RP = RP + LSET
+          AP = AP + LSET
+          RS(RP:RP) = ','
+          AP = AP + 1
+          SCORE = SCORE + 1
+          IF(AMOS(AP:AP).EQ.'(') THEN
+            AP = AP + 1
+            LSET = INDEX(AMOS(AP:),')') - 1
+            ICOMMA = INDEX(AMOS(AP:),',')
+            IF((ICOMMA.GT.0).AND.(ICOMMA.LT.LSET)) THEN
+              WRITE(KBOUT,*)'Variable gap in []'
+              NVAR = NVAR + 1
+              GO TO 5
+            END IF
+            NREP = INTFC(AMOS(AP:),LSET)
+            LREP = RP - LASTS
+            RP = RP + 1
+            CALL REPST(RS(LASTS:),RS(RP:),LREP,NREP)
+            RP = RP + (NREP - 1) * (LREP + 1) - 1
+            AP = AP + LSET
+            SCORE = SCORE + NREP - 1
+          END IF
+        ELSE IF(AMOS(AP:AP).EQ.'{') THEN
+          AP = AP + 1
+          LSET = INDEX(AMOS(AP:),'}') - 1
+          RP = RP + 1
+          LSETS = LSET
+          CALL CACID(AMOS(AP:),CACIDS,LSET)
+          RS(RP:RP+LSET-1) = CACIDS(1:LSET)
+          LASTS = RP
+          RP = RP + LSET
+          AP = AP + LSETS
+          RS(RP:RP) = ','
+          AP = AP + 1
+          SCORE = SCORE + 1
+          IF(AMOS(AP:AP).EQ.'(') THEN
+            AP = AP + 1
+            LSET = INDEX(AMOS(AP:),')') - 1
+            ICOMMA = INDEX(AMOS(AP:),',')
+            IF((ICOMMA.GT.0).AND.(ICOMMA.LT.LSET)) THEN
+              WRITE(*,*)'Variable gap in {}'
+              NVAR = NVAR + 1
+              GO TO 5
+            END IF
+            NREP = INTFC(AMOS(AP:),LSET)
+            LREP = RP - LASTS
+            RP = RP + 1
+            CALL REPST(RS(LASTS:),RS(RP:),LREP,NREP)
+            RP = RP + (NREP - 1) * (LREP + 1) - 1
+            AP = AP + LSET
+            SCORE = SCORE + NREP - 1
+          END IF
+        ELSE IF(AMOS(AP:AP).EQ.'x') THEN
+          AP = AP + 1
+          IF(AMOS(AP:AP).EQ.'(') THEN
+            AP = AP + 1
+            LSET = INDEX(AMOS(AP:),')') - 1
+            ICOMMA = INDEX(AMOS(AP:),',')
+            IF((ICOMMA.GT.0).AND.(ICOMMA.LT.LSET)) THEN
+              IF(RS(RP:RP).EQ.',') RP = RP - 1
+      WTSNAM = WTSNAM(1:NAMLEN+IDIR-1)//'.WTS'//CSET(MOTIF:MOTIF)
+      WRITE(IDEV3,1002)WTSNAM(1:NAMLEN+IDIR+4)
+      CALL INTRP7(RS,RP,SCORE,WTSMAT,IDM,MAXLEN,
+     +CUTOFF,IOK)
+      IF(IOK.NE.0) WRITE(*,*)'SCREAM'
+      CALL OPENRS(IDEV4,WTSNAM(IDIR:),IOK,LRECL,1)
+      IF(IOK.NE.0) WRITE(*,*)'SCREAM, FILE OPENING'
+      TOP = SCORE
+      CALL WRTSCP(TITLE,SCORE,0,CUTOFF,TOP,IDM,WTSMAT,IDEV4)
+              NCOMMA = INTFC(AMOS(AP:),ICOMMA-1)
+              JCOMMA = INTFC(AMOS(AP+ICOMMA:),LSET-ICOMMA)
+      WRITE(IDEV3,1002)'A6           '//
+     +PATNAM(IDIR+2:IDIR+6)//'          Class'
+              WRITE(IDEV3,1008)MOTIF
+1008          FORMAT(I7,'      Relative motif')
+              WRITE(IDEV3,1003)SCORE+NCOMMA+1
+1003              FORMAT(I7,'      Relative start position')
+              WRITE(IDEV3,1004)JCOMMA-NCOMMA
+1004          FORMAT(I7,'      Number of extra positions')
+              AP = AP + LSET
+              RP = 0
+              SCORE = 0
+              MOTIF = MOTIF + 1
+            ELSE
+              NCOMMA = INTFC(AMOS(AP:),LSET)
+              RP = RP + 1
+              RS(RP:RP+NCOMMA-1) = COMMAS(1:NCOMMA)
+              RP = RP + NCOMMA - 1
+              AP = AP + LSET
+              SCORE = SCORE + NCOMMA
+            END IF
+          ELSE
+            RP = RP + 1
+            RS(RP:RP+1) = ','
+            SCORE = SCORE + 1
+          END IF
+        END IF
+        GO TO 10
+      END IF
+      RP = RP - 1
+      IF(RS(RP:RP).EQ.',') RP = RP - 1
+      WTSNAM = WTSNAM(1:NAMLEN+IDIR-1)//'.WTS'//CSET(MOTIF:MOTIF)
+      WRITE(IDEV3,1002)WTSNAM(1:NAMLEN+IDIR+4)
+      CALL INTRP7(RS,RP,SCORE,WTSMAT,IDM,MAXLEN,
+     +CUTOFF,IOK)
+      IF(IOK.NE.0) WRITE(*,*)'SCREAM'
+      CALL OPENRS(IDEV4,WTSNAM(IDIR:),IOK,LRECL,1)
+      IF(IOK.NE.0) WRITE(*,*)'SCREAM, FILE OPENING'
+      TOP = SCORE
+      CALL WRTSCP(TITLE,SCORE,0,CUTOFF,TOP,IDM,WTSMAT,IDEV4)
+      WRITE(IDEV2,1002)PATNAM
+      GO TO 5
+100   CONTINUE
+      WRITE(*,*)'READ ERROR'
+      STOP
+200   CONTINUE
+      WRITE(KBOUT,*)'Number of patterns',NMOTIF
+      WRITE(KBOUT,*)'Number of variable gaps',NVAR
+      END
+      SUBROUTINE REPST(S1,S2,L,N)
+      CHARACTER S1*(*),S2*(*)
+      J = 1
+      DO 10 I = 1,N-1
+        S2(J:J+L-1) = S1(1:L)
+        S2(J+L:J+L) = ','
+        J = J + L + 1
+10    CONTINUE
+      END
+      INTEGER FUNCTION INTFC(STRING,LS)
+      CHARACTER TEMP*10,STRING*(*)
+      TEMP = STRING(1:LS)
+      CALL RJST(TEMP)
+      READ(TEMP,1000,ERR=10)INTFC
+1000  FORMAT(I10)
+      RETURN
+10    CONTINUE
+      INTFC = 0
+      WRITE(*,*)'Error in encode'
+      END
+      SUBROUTINE CACID(ACID,REST,LACID)
+      CHARACTER ACID*(*),REST*20,ACIDS*20
+      SAVE ACIDS
+      DATA ACIDS/'QWERTYIPASDFGHKLCVNM'/
+      J = 0
+      DO 10 I = 1,20
+        DO 5 K = 1,LACID
+          IF(ACID(K:K).EQ.ACIDS(I:I)) GO TO 9
+5       CONTINUE
+        J = J + 1
+        REST(J:J) = ACIDS(I:I)
+9       CONTINUE
+10    CONTINUE
+      LACID = J
+      END
+      LOGICAL FUNCTION ACID(CHAR)
+      LOGICAL ONEOF
+      CHARACTER ACIDS*(20),CHAR
+      SAVE ACIDS
+      EXTERNAL ONEOF
+      DATA ACIDS/'QWERTYIPASDFGHKLCVNM'/
+      ACID = ONEOF(ACIDS,CHAR)
+      END
+      LOGICAL FUNCTION NUMBER(CHAR)
+      LOGICAL ONEOF
+      CHARACTER DIGITS*(10),CHAR
+      SAVE DIGITS
+      EXTERNAL ONEOF
+      DATA DIGITS/'1234567890'/
+      NUMBER = ONEOF(DIGITS,CHAR)
+      END
+      LOGICAL FUNCTION ONEOF(CHARS,CHAR)
+      CHARACTER CHARS*(*),CHAR
+      ONEOF = .FALSE.
+      IF(INDEX(CHARS,CHAR).NE.0) ONEOF = .TRUE.
+      END
+      SUBROUTINE INTRP7(STRING,ISEND,LENGTH,WT,MAXCHR,MAXLEN,
+     +CUTOFF,IOK)
+      CHARACTER STRING*(*),TERM
+      INTEGER WT(MAXCHR,MAXLEN)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      PARAMETER (TERM = ',')
+      IOK = 0
+C  POINT TO STRING
+      IS = 1
+      ICOL = 1
+C  COUNT FILLED COLUMNS
+      CUTOFF = 0.
+10    CONTINUE
+      CALL FILLI(WT(1,ICOL),MAXCHR,0)
+      CUTOFF = CUTOFF + 1.
+20    CONTINUE
+      IF(IS.LE.ISEND)THEN
+        IF(STRING(IS:IS).NE.TERM)THEN
+          IROW = CTONUM(STRING(IS:IS))
+          WT(IROW,ICOL) = 1
+          IS = IS + 1
+          GO TO 20
+        END IF
+        NC = 1
+30      CONTINUE
+        IF(IS.LE.ISEND)THEN
+          IF(STRING(IS:IS).EQ.TERM)THEN
+            ICOL = ICOL + 1
+            CALL FILLI(WT(1,ICOL),MAXCHR,0)
+            NC = NC + 1
+            IS = IS + 1
+            GO TO 30
+          END IF
+C
+C  END OF TERMINATORS
+C
+C          IF(ICOL.GT.1) GO TO 10
+C  ERROR TERMINATOR BEFORE ANY GOOD COLUMNS
+C          IOK = 1
+C          RETURN
+          GO TO 10
+        END IF
+C  STRING ENDED WITH TERMINATOR
+        LENGTH = ICOL - NC + 1
+        RETURN
+      END IF
+C
+C  END REACHED WITH NO TERMINATOR (NORMAL)
+      IF(ICOL.GT.0)THEN
+        LENGTH = ICOL
+        IOK = 0
+        RETURN
+      END IF
+      IOK = 1
+      END
+      SUBROUTINE WRTSCP(TITLE,LENGTH,MIDDLE,BOT,TOP,IDM,
+     +SUM,IDEV)
+      INTEGER TOT(120)
+      INTEGER SUM(IDM,LENGTH)
+      CHARACTER CHRSET*22,TITLE*(*)
+      SAVE CHRSET
+      DATA CHRSET/'CSTPAGNDEQBZHRKMILVFYW'/
+C   PROTEIN MATRICES DONT WRITE ROWS FOR -X? AND SPACE SO SET DIMENSION
+C   TO IDM-4
+      CALL FILLI(TOT,120,0)
+      WRITE(IDEV,1018)TITLE
+1018  FORMAT(' ',A)
+1019  FORMAT(' P',20I4)
+1020  FORMAT(' N',20I4)
+1021  FORMAT(' ',A,20I4)
+1022  FORMAT(' ',2I6,2F10.3)
+      WRITE(IDEV,1022)LENGTH,MIDDLE,BOT,TOP
+      NLINES=1+(LENGTH-1)/20
+      K1=1
+      DO 400 J=1,NLINES
+        K2=MIN((K1+19),LENGTH)
+        WRITE(IDEV,1019)(K,K=K1-MIDDLE,K2-MIDDLE)
+        WRITE(IDEV,1020)(TOT(K),K=K1,K2)
+        DO 390 I=1,IDM-4
+          WRITE(IDEV,1021)CHRSET(I:I),(SUM(I,K),K=K1,K2)
+390     CONTINUE
+        K1=K1+20
+        IF(K1.GT.LENGTH)K1=LENGTH
+400   CONTINUE
+      CLOSE(UNIT=IDEV)
+      END
diff --git a/src/staden/subs89.f b/src/staden/subs89.f
new file mode 100644
index 0000000..393f055
--- /dev/null
+++ b/src/staden/subs89.f
@@ -0,0 +1,1659 @@
+C     general subroutines
+C     author rodger staden
+C     14-2-90 changed titout to use gtstr (was using getstr)
+C             and returned inflag to wrtact
+C     16-3-90 replaced read in bpause by call to sin
+C     18-4-90 added nmmtch for comparing strings (used by pir reading
+C             routines to compare entry names)
+C     4-7-90  added routines shofu, showfi,errom,busy
+C     4-7-90  removed all routines dealing with sequence libraries
+C             and put them into seqlibsubs.for
+C     4-7-90  added radion, checkb
+C     6-7-90  removed all routines related to userinterface
+C    23-7-90  moved sethlp, help, help2 to userface
+C             put hqn back for sap
+C    10-12-90 Modified 2 lines in nmmtch to check for empty strings
+C    24-4-91 Added new routines
+C    18-11-91 moved openf1 from userface. deleted in userface and dialogues
+C             also changed it to allow file deletion using delf
+C    2-3-92 Altered ttext
+C 10-4-92 made enco more efficient
+      INTEGER FUNCTION HQN(STRING)
+      CHARACTER STRING*(*),EM,QM
+      PARAMETER (EM='!',QM='?')
+      EXTERNAL NOTLR
+C  AUTHOR: RODGER STADEN
+C  RETURNS:
+C          1 FOR HELP
+C          2 FOR QUIT
+C          3 FOR ALL BLANKS
+C          0 OTHERWISE
+      IF(STRING(1:1).EQ.QM) THEN
+        HQN = 1
+        RETURN
+      END IF
+      IF(STRING(1:1).EQ.EM) THEN
+        HQN = 2
+        RETURN
+      END IF
+      I = LEN(STRING)
+      IN = NOTLR(STRING,I,' ')
+      IF(IN.EQ.0) THEN
+        HQN = 3
+        RETURN
+      END IF
+      HQN = 0
+      END
+      SUBROUTINE RJST(STRING)
+      CHARACTER STRING*(*)
+      EXTERNAL NOTIRL
+      L = LEN(STRING)
+      I = NOTIRL(STRING,L,' ')
+      IF(I.NE.L) THEN
+        J = L
+        DO 10 K = I,1,-1
+          STRING(J:J) = STRING(K:K)
+          J = J - 1
+10      CONTINUE
+C       MOVED DISTANCE L - I CHARS, SO FILL WITH SPACES
+        I = L - I
+        STRING(1:I) = ' '
+      END IF
+      END
+      SUBROUTINE MARGC1(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,
+     +HELPS,HELPE,HELPF,IDEVH,KBIN,KBOUT)
+C   AUTHOR RODGER STADEN
+      INTEGER HELPS,HELPE
+      CHARACTER HELPF*(*)
+      WRITE(KBOUT,8006)ISYMAX,MARGB,MARGT
+8006  FORMAT(' y coordinates first:',/,
+     +' screen height=',I6,' start position=',I5,
+     +       ' height=',I5)
+      IBOT = MARGB
+      MNM = 1
+      MXM = ISYMAX
+      CALL GETINT(MNM,MXM,IBOT,'start height',
+     +IVAL,KBIN,KBOUT,HELPS,HELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MARGB = IVAL
+      ITOP = MIN(ISYMAX-MARGB,MARGT)
+      MNM = MIN(ISYMAX/100,ITOP)
+      MXM = ISYMAX-MARGB
+      CALL GETINT(MNM,MXM,ITOP,'height',
+     +IVAL,KBIN,KBOUT,HELPS,HELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MARGT = IVAL
+      WRITE(KBOUT,8009)ISXMAX,MARGL,MARGR
+8009  FORMAT(' x coordinates:',/,
+     +' screen width=',I6,' start position=',I5,
+     +       ' width=',I5)
+      IBOT = MARGL
+      MNM = 1
+      MXM = ISXMAX
+      CALL GETINT(MNM,MXM,IBOT,'start position',
+     +IVAL,KBIN,KBOUT,HELPS,HELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MARGL = IVAL
+      ITOP = MIN(ISXMAX-MARGL,MARGR)
+      MNM = MIN(ISXMAX/100,ITOP)
+      MXM = ISXMAX-MARGL
+      CALL GETINT(MNM,MXM,ITOP,'width',
+     +IVAL,KBIN,KBOUT,HELPS,HELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      MARGR = IVAL
+      END
+      INTEGER FUNCTION MC(SEQ1,SEQ2,LENGTH)
+      CHARACTER SEQ1(LENGTH),SEQ2(LENGTH)
+      EXTERNAL IUBM1
+      MC = 1
+      DO 10 I = 1,LENGTH
+        IF(IUBM1(SEQ2(I),SEQ1(I)).EQ.0) RETURN
+10    CONTINUE
+      MC = 0
+      END
+      INTEGER FUNCTION GENRCI(STRING,LENS,LENI,C,LC,NENTRY)
+      PARAMETER (LENM = 6)
+      CHARACTER STRING(LENS)
+      INTEGER REDS(17),REDE(17),RED(37),KS(LENM)
+      INTEGER LS(LENM),II(LENM),C(0:LC)
+      EXTERNAL NCODEA
+      SAVE
+C                 T,C,A,G,-       R   Y   W   S   M   K   H    B      V     D  
+C
+C              1,2,3,4,5,6,7,8,9,0,1,2,3,4,5,6,7,8,9,0,1,2,3,4,
+      DATA RED/1,2,3,4,1,2,3,4,3,4,1,2,1,3,2,4,2,3,1,4,1,2,3,1,
+C              5,6,7,8,9,0,1,2,3,4,5,6,7
+     +         2,4,2,3,4,1,3,4,1,2,3,4,5/
+C                       -  R Y   W  S  M  K  H  B  V  D  N  ?
+      DATA REDS/1,2,3,4,5, 9,11,13,15,17,19,21,24,27,30,33,37/
+      DATA REDE/1,2,3,4,8,10,12,14,16,18,20,23,26,29,32,36,37/
+      IENTRY = NENTRY
+      NENTRY = 1
+      IF(IENTRY.NE.0) GO TO 2
+      CALL FILLI(KS,LENI,16)
+      CALL CON17(STRING,KS,MIN(LENS,LENI))
+      NEXT = 1
+      II(1) = REDS(KS(1)) - 1
+1     CONTINUE
+      II(NEXT) = II(NEXT) + 1
+      LS(NEXT) = RED(II(NEXT))
+      IF(NEXT.LT.LENI) THEN
+        NEXT = NEXT + 1
+        II(NEXT) = REDS(KS(NEXT)) - 1
+        GO TO 1
+      END IF
+C      WRITE(*,*)LS
+      GENRCI = NCODEA(LS,LENI,C,5,LC)
+      RETURN
+2     CONTINUE
+      DO 3 I = LENI,1,-1
+        IF(II(I).LT.REDE(KS(I))) THEN
+          NEXT = I
+          GO TO 1
+        END IF
+3     CONTINUE
+      GENRCI = 0
+      END
+      SUBROUTINE SETCN(C,LENGTH,IDM,LC)
+      INTEGER C(0:LC)
+      C(0) = 0
+      K = IDM - 1
+      N = 0
+      DO 10 I = 1,LENGTH
+        M = K**(I-1)
+        C(0) = C(0) - C(N)
+        DO 5 J = 1,K
+          N = N + 1
+          C(N) = J*M
+5       CONTINUE
+10    CONTINUE
+      END
+      INTEGER FUNCTION NCODEA(SEQ,LENGTH,C,IDM,LC)
+      INTEGER SEQ(LENGTH),C(0:LC)
+      NCODEA = 0
+      N = C(0)
+      J = 0
+C  NEXT LINE NEW 4-7-89
+      K = IDM - 1
+      DO 10 I = 1,LENGTH
+        L = SEQ(I)
+        IF(L.EQ.IDM) RETURN
+        N = N + C(J+L)
+C  NEXT LINE NEW 4-7-89 (WAS J = J + 4)
+        J = J + K
+10    CONTINUE
+      NCODEA = N
+      END
+      SUBROUTINE ENCOND(SEQ,IDIM,WORDP,IDE,IDCHAR,CONSTS,LENGTH,
+     +LCONST)
+C  AUTHOR RODGER STADEN
+      INTEGER SEQ(IDIM),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER NCODEA
+      EXTERNAL NCODEA
+C  ENCODES A SEQUENCE OF LENGTH IDIM AND CHARACTERSET SIZE IDCHAR
+C  INTO TWO ARRAYS: WORDP(I) CONTAINS THE POSITION OF THE FIRST OCCURRENCE
+C  OF WORD(I), SEQ(I) CONTAINS A LINKED LIST OF SECOND, THIRD,... OCCURENCES
+C  OF WORD
+      CALL FILLI(WORDP,IDE,0)
+      DO 20 I = 1, IDIM-LENGTH+1
+        J = NCODEA(SEQ(I),LENGTH,CONSTS,IDCHAR,LCONST)
+        SEQ(I) = 0
+        IF(J.NE.0) THEN
+          J1 = WORDP(J)
+          IF(J1.EQ.0) THEN
+            WORDP(J) = I
+          ELSE
+10          CONTINUE
+              J2 = J1
+              J1 = SEQ(J2)
+              IF(J1.NE.0) GO TO 10
+              SEQ(J2) = I
+          END IF
+        END IF
+20    CONTINUE
+      END
+      SUBROUTINE CON17(S,N,L)
+      CHARACTER S(L)
+      INTEGER N(L),DTONUM
+      EXTERNAL DTONUM
+      DO 1 I = 1,L
+        N(I) = DTONUM(S(I))
+1     CONTINUE
+      END
+      SUBROUTINE SEQEDT(SEQ,MAXSEQ,ISEQ,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,FILNAM,FILEIN,IDM,IOK)
+      CHARACTER LINE*133,FILNAM*(*),FILEIN*(*),SEQ(MAXSEQ),HELPF*(*)
+      CHARACTER CHARG1*14,CHARG2*48
+      PARAMETER (CHARG1='TUCAG N-tucagn')
+      PARAMETER 
+     +(CHARG2='ABCDEFGHIKLMNPQRSTVWXYZabcdefghiklmnpqrstvwxyz- ')
+      PARAMETER (NGOOD1 = 14, NGOOD2 = 48)
+      EXTERNAL NOTRL
+      IOK = 1
+10    CONTINUE
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Name of file to edit.')
+      LIN = LEN(FILNAM)
+      LIN = NOTRL(FILNAM,LIN,' ')
+      CALL GTSTR('File name',FILNAM,FILEIN,LIN,KBOUT,KBIN,INFLAG)
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+      IF(LIN.EQ.0) FILEIN = FILNAM
+      CALL CEDIT(FILEIN)
+      CALL YESNO(IWANT,'Make edited sequence active',
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IOK = 1
+      IF(IWANT.NE.0) RETURN
+      CALL OPENRS(IDEV,FILEIN,IOK,LRECL,2)
+      IF(IOK.NE.0) RETURN
+      IF(IDM.EQ.5) THEN
+        CALL FILTF(SEQ,MAXSEQ,ISEQ,LINE,IDEV,KBOUT,CHARG1,NGOOD1,IOK)
+      ELSE IF(IDM.EQ.26) THEN
+        CALL FILTF(SEQ,MAXSEQ,ISEQ,LINE,IDEV,KBOUT,CHARG2,NGOOD2,IOK)
+      END IF
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE FILTF(SEQ,MAXSEQ,ISEQ,LINE,IDEV,KBOUT,CHARG,NGOOD,IOK)
+      CHARACTER SEQ(MAXSEQ)
+      CHARACTER LINE*(*),CHARG*(*)
+      EXTERNAL LOK
+      ISEQ = 0
+      LNO = 0
+      IOK = 1
+10    CONTINUE
+      LNO = LNO + 1
+      READ(IDEV,1000,ERR=100,END=200)LINE
+1000  FORMAT(A)
+      IOK = LOK(LINE,CHARG,NGOOD)
+      IF(IOK.EQ.0) THEN
+        DO 5 I = 1,LEN(LINE)
+          IF(LINE(I:I).NE.' ') THEN
+            ISEQ = ISEQ + 1
+            IF(ISEQ.GT.MAXSEQ) THEN
+              WRITE(KBOUT,1001)MAXSEQ
+1001          FORMAT(' Maximum sequence length',I6,' exceeded')
+              RETURN
+            END IF
+            SEQ(ISEQ) = LINE(I:I)
+          END IF
+5       CONTINUE
+      END IF
+      GO TO 10
+100   CONTINUE
+      WRITE(KBOUT,1002)LNO
+1002  FORMAT(' Error reading line',I6,' of file')
+      RETURN
+200   CONTINUE
+      WRITE(KBOUT,1003)ISEQ
+1003  FORMAT(' Sequence length is now',I6)
+      IOK = 0
+      END
+      INTEGER FUNCTION LOK(LINE,OKC,NOKC)
+      CHARACTER LINE*(*),OKC*(*)
+      INTEGER COK
+      EXTERNAL COK
+      LOK = 1
+      DO 10 I = 1,LEN(LINE)
+        IF(COK(LINE(I:I),OKC,NOKC).NE.0) RETURN
+10    CONTINUE
+      LOK = 0
+      END
+      INTEGER FUNCTION COK(CHAR,OKC,NOKC)
+      CHARACTER CHAR,OKC*(*)
+      COK = 0
+      DO 10 I = 1,NOKC
+        IF(CHAR.EQ.OKC(I:I)) RETURN
+10    CONTINUE
+      COK = 1
+      END
+      INTEGER FUNCTION NOTILR(TEXT,ITEXT,WORD)
+C   AUTHOR: RODGER STADEN
+C   LOOKS LEFT TO RIGHT THRU TEXT FOR FIRST ELEMENT THAT IS NOT WORD
+C   RETURNS ELEMENT NUMBER OR ZERO IF ALL ELEMENTS ARE WORD
+      CHARACTER TEXT*(*),WORD
+      NOTILR = 1
+      DO 1 I=1,ITEXT
+        NOTILR = I
+        IF(TEXT(I:I).NE.WORD)RETURN
+1     CONTINUE
+      END
+      INTEGER FUNCTION NOTIRL(TEXT,ITEXT,WORD)
+C   AUTHOR: RODGER STADEN
+C   LOOKS RIGHT TO LEFT THRU TEXT FOR FIRST ELEMENT THAT IS NOT WORD
+C   RETURNS ELEMENT NUMBER OR ZERO IF ALL ELEMENTS ARE WORD
+      CHARACTER TEXT*(*),WORD
+      NOTIRL = ITEXT
+      DO 1 I=ITEXT,1,-1
+        NOTIRL = I
+        IF(TEXT(I:I).NE.WORD)RETURN
+1     CONTINUE
+      END
+      INTEGER FUNCTION NOTRLA(TEXT,ITEXT,WORD)
+C   AUTHOR: RODGER STADEN
+C   LOOKS RIGHT TO LEFT THRU TEXT FOR FIRST ELEMENT THAT IS NOT WORD
+C   RETURNS ELEMENT NUMBER OR ZERO IF ALL ELEMENTS ARE WORD
+      CHARACTER TEXT(ITEXT),WORD
+      DO 1 I=ITEXT,1,-1
+        NOTRLA = I
+        IF(TEXT(I).NE.WORD)RETURN
+1     CONTINUE
+      NOTRLA = 0
+      END
+      INTEGER FUNCTION NOTLRA(TEXT,ITEXT,WORD)
+C   AUTHOR: RODGER STADEN
+C   LOOKS LEFT TO RIGHT THRU TEXT FOR FIRST ELEMENT THAT IS NOT WORD
+C   RETURNS ELEMENT NUMBER OR ITEXT + 1 IF ALL ELEMENTS ARE WORD
+      CHARACTER TEXT(ITEXT),WORD
+      DO 1 I=1,ITEXT
+        NOTLRA = I
+        IF(TEXT(I).NE.WORD)RETURN
+1     CONTINUE
+      NOTLRA = ITEXT + 1
+      END
+      SUBROUTINE WRTACT(IDEV,FILNAM,KBIN,KBOUT,
+     +SEQ,IDIM1,IHELPS,IHELPE,HELPF,IDEVH)
+      CHARACTER SEQ(IDIM1),FILNAM*(*),HELPF*(*)
+      CALL OPENF1(IDEV,FILNAM,1,IOK,KBIN,KBOUT,
+     +'File name for active sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.EQ.0)THEN
+        CALL TITOUT(IDEV,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.2)CALL FMTDK(IDEV,SEQ,IDIM1)
+        CLOSE(UNIT=IDEV)
+      END IF
+      END
+      SUBROUTINE GSPIN(MINSP,MAXSP,SPAN,MINIW,MAXIW,IWRIT,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      CHARACTER HELPF*(*)
+      INTEGER SPAN,VALUE
+      IOK = 1
+20    CONTINUE
+      CALL GETINT(MINSP,MAXSP,SPAN,'odd span length',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      IF(MOD(VALUE,2).EQ.0)GO TO 20
+      SPAN = VALUE
+      CALL GETINT(MINIW,MAXIW,IWRIT,'plot interval',
+     +VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0)RETURN
+      IWRIT = VALUE
+      END
+      SUBROUTINE GTSCR(STRING,IDIM,MATRIX,IDM,SMIN,SMAX)
+C   AUTHOR RODGER STADEN
+      INTEGER MATRIX(IDM,IDM),SMIN,SMAX
+      CHARACTER STRING(IDIM)
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+      SMIN = 0
+      SMAX = 0
+      DO 10 I = 1,IDIM
+        K1 = 99999
+        K2 = -99999
+        L = DTONUM(STRING(I))
+        DO 5 J = 1,IDM
+          M = MATRIX(L,J)
+          K1 = MIN(K1,M)
+          K2 = MAX(K2,M)
+5       CONTINUE
+        SMIN = SMIN + K1
+        SMAX = SMAX + K2
+10    CONTINUE
+      END
+      SUBROUTINE ADDR(FROM,TO,ID)
+C   AUTHOR: RODGER STADEN
+      REAL FROM(ID),TO(ID)
+      DO 10 I = 1,ID
+        TO(I) = TO(I) + FROM(I)
+10    CONTINUE
+      END
+      SUBROUTINE ADDI(FROM,TO,ID)
+C   AUTHOR: RODGER STADEN
+      INTEGER FROM(ID),TO(ID)
+      DO 10 I = 1,ID
+        TO(I) = TO(I) + FROM(I)
+10    CONTINUE
+      END
+      SUBROUTINE COPYR(FROM,TO,ID)
+C   AUTHOR: RODGER STADEN
+      REAL FROM(ID),TO(ID)
+      DO 10 I = 1,ID
+        TO(I) = FROM(I)
+10    CONTINUE
+      END
+      SUBROUTINE GTREG(KBIN,KBOUT,J1,J2,I1,I2,P,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      CHARACTER HELPF*(*),P*(*)
+C   AUTHOR: RODGER STADEN
+      IOK = 1
+      MININ = J1
+      MAXIN = J2
+      WRITE(KBOUT,1000)P
+1000  FORMAT(' ',A)
+      CALL GETINT(MININ,MAXIN,I1,'start',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      IF(IVAL.NE.I1) THEN
+        I2 = J2
+      END IF
+      I1 = IVAL
+      IF(I1.EQ.0) RETURN
+      MININ = I1 + 1
+      MAXIN = J2
+      CALL GETINT(MININ,MAXIN,I2,'end',IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) RETURN
+      I2 = IVAL
+      IOK = 0
+      END
+      SUBROUTINE CCASEA(STRING,L,IFLAG)
+      CHARACTER STRING(L),TUPPER,TLOWER
+      EXTERNAL TUPPER,TLOWER
+C   AUTHOR RODGER STADEN
+      IF(IFLAG.EQ.1)THEN
+        DO 10 I = 1,L
+          STRING(I) = TUPPER(STRING(I))
+10      CONTINUE
+      ELSE IF(IFLAG.EQ.2)THEN
+        DO 20 I = 1,L
+          STRING(I) = TLOWER(STRING(I))
+20      CONTINUE
+      END IF
+      END
+      INTEGER FUNCTION INDEXA(STRING,ID,CHAR)
+      CHARACTER STRING(ID),CHAR
+C  FUNCTION TO FIND FIRST OCCURRENCE OF CHAR IN STRING
+      DO 10 I = 1,ID
+        IF(STRING(I).EQ.CHAR)THEN
+          INDEXA = I
+          RETURN
+        END IF
+10    CONTINUE
+      INDEXA = 0
+      END
+      SUBROUTINE TITOUT(IDEV,KBIN,KBOUT,IHELPS,IHELPE,HELPF,
+     +IDEVH,INFLAG)
+C   AUTHOR: RODGER STADEN
+      CHARACTER LINE*60,HELPF*(*)
+10    CONTINUE
+      WRITE(KBOUT,1000)
+1000  FORMAT(' You may give your sequence a one line title.')
+      LIN = 0
+      CALL GTSTR('Title',' ',LINE,LIN,KBOUT,KBIN,INFLAG)
+      IF(LIN.LT.1) RETURN
+      IF(INFLAG.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(INFLAG.EQ.2) RETURN
+        WRITE(IDEV,1004,ERR=20)LINE(1:LIN)
+1004    FORMAT(';',A)
+20    CONTINUE
+      END     
+      SUBROUTINE TTEXT(IDEV,FILNAM,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*)
+      CALL OPENF1(IDEV,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Name of file to read',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL TTEXT1(IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+      END
+      SUBROUTINE TTEXT1(IDEV,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IQUIT)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (IPAGE=22)
+      CHARACTER LINE*80,HELPF*(*),SPACE
+      INTEGER NOTIRL
+      EXTERNAL NOTIRL
+      SAVE SPACE
+      DATA SPACE/' '/
+1002  FORMAT(' ',A)
+C   COUNT LINES OUTPUT
+      LINDON=0
+      IDONE=0
+10    CONTINUE
+        READ(IDEV,1003,ERR=110,END=30)LINE
+1003    FORMAT(A)
+        WRITE(KBOUT,1002)LINE(1:MAX(1,NOTIRL(LINE,79,SPACE)))
+        IDONE=IDONE+1
+        LINDON=LINDON+1
+        IF(IDONE.GE.IPAGE)THEN
+          CALL BPAUSE(KBIN,KBOUT,IQUIT)
+          IF(IQUIT.NE.0) GO TO 50
+          IDONE=0
+        END IF
+      GO TO 10
+30    CONTINUE
+      WRITE(KBOUT,1005)
+1005  FORMAT(' End of file')
+      CALL BPAUSE(KBIN,KBOUT,IQUIT)
+50    CONTINUE
+      CLOSE(UNIT=IDEV)
+      RETURN
+110   CONTINUE
+      WRITE(KBOUT,1010)
+1010  FORMAT(' Error reading file')
+      CLOSE(UNIT=IDEV)
+      END
+      SUBROUTINE CCASE(STRING,IFLAG)
+      CHARACTER STRING*(*),TUPPER,TLOWER
+      EXTERNAL TUPPER,TLOWER
+C   AUTHOR RODGER STADEN
+      L = LEN(STRING)
+      IF(IFLAG.EQ.1)THEN
+        DO 10 I = 1,L
+          STRING(I:I) = TUPPER(STRING(I:I))
+10      CONTINUE
+      ELSE IF(IFLAG.EQ.2)THEN
+        DO 20 I = 1,L
+          STRING(I:I) = TLOWER(STRING(I:I))
+20      CONTINUE
+      END IF
+      END
+      CHARACTER*1 FUNCTION TUPPER(CHAR1)
+C   AUTHOR RODGER STADEN
+      CHARACTER UP*26,LOW*26,CHAR1
+      SAVE LOW,UP
+      DATA UP/ 'ABCDEFGHIJKLMNOPQRSTUVWXYZ' /
+      DATA LOW/ 'abcdefghijklmnopqrstuvwxyz' /
+      I = INDEX(LOW,CHAR1)
+      TUPPER = CHAR1
+      IF (I.NE.0) TUPPER = UP(I:I)
+      END
+      CHARACTER*1 FUNCTION TLOWER(CHAR1)
+C   AUTHOR RODGER STADEN
+      CHARACTER UP*26,LOW*26,CHAR1
+      SAVE LOW,UP
+      DATA UP/ 'ABCDEFGHIJKLMNOPQRSTUVWXYZ' /
+      DATA LOW/ 'abcdefghijklmnopqrstuvwxyz' /
+      I = INDEX(UP,CHAR1)
+      TLOWER = CHAR1
+      IF (I.NE.0) TLOWER = LOW(I:I)
+      END
+      INTEGER FUNCTION NOTLR(TEXT,ITEXT,WORD)
+C   AUTHOR: RODGER STADEN
+C   LOOKS LEFT TO RIGHT THRU TEXT FOR FIRST ELEMENT THAT IS NOT WORD
+C   RETURNS ELEMENT NUMBER OR ZERO IF ALL ELEMENTS ARE WORD
+      CHARACTER TEXT*(*),WORD
+      DO 1 I=1,ITEXT
+        IF(TEXT(I:I).NE.WORD)THEN
+          NOTLR = I
+          RETURN
+        END IF
+1     CONTINUE
+      NOTLR = 0
+      END
+      INTEGER FUNCTION NOTRL(TEXT,ITEXT,WORD)
+C   AUTHOR: RODGER STADEN
+C   LOOKS RIGHT TO LEFT THRU TEXT FOR FIRST ELEMENT THAT IS NOT WORD
+C   RETURNS ELEMENT NUMBER OR ZERO IF ALL ELEMENTS ARE WORD
+      CHARACTER TEXT*(*),WORD
+      DO 1 I=ITEXT,1,-1
+        IF(TEXT(I:I).NE.WORD)THEN
+          NOTRL=I
+          RETURN
+        END IF
+1     CONTINUE
+      NOTRL = 0 
+      END
+      CHARACTER*3 FUNCTION TRANF3(CODON,PAA,CODE)
+C   AUTHOR RODGER STADEN
+      CHARACTER CODON(3),PAA(5,5,5),THREEL*3,TRANF,THREE*3
+      INTEGER CODE
+      EXTERNAL THREEL,TRANF
+      THREE(1:3)='   '
+      IF(CODE.EQ.3)THEN
+        THREE(1:3)=THREEL(TRANF(CODON,PAA))
+      ELSE
+        THREE(2:2)=TRANF(CODON,PAA)
+      END IF
+      TRANF3=THREE
+      RETURN
+      END
+C   TRANB3
+      CHARACTER*3 FUNCTION TRANB3(CODON,PAA,CODE)
+C   AUTHOR RODGER STADEN
+      CHARACTER CODON(3),PAA(5,5,5),THREEL*3,TRANB,THREE*3
+      INTEGER CODE
+      EXTERNAL THREEL,TRANB
+      THREE='   '
+      IF(CODE.EQ.3)THEN
+        THREE(1:3)=THREEL(TRANB(CODON,PAA))
+      ELSE
+        THREE(2:2)=TRANB(CODON,PAA)
+      END IF
+      TRANB3=THREE(1:3)
+      RETURN
+      END
+C   TRANF
+      CHARACTER*1 FUNCTION TRANF(CODON,PAA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER CODON(3),PAA(5,5,5)
+      INTEGER IC(3),CTONUM
+      EXTERNAL CTONUM
+C
+      DO 10 I=1,3
+      IC(I)=CTONUM(CODON(I))
+10    CONTINUE
+      TRANF=PAA(IC(3),IC(2),IC(1))
+      RETURN
+      END
+C   TRANB
+      CHARACTER*1 FUNCTION TRANB(CODON,PAA)
+C   AUTHOR: RODGER STADEN
+      CHARACTER*1 CODON(3),PAA(5,5,5)
+      INTEGER IC(3),CTONUM,ICOMP
+      EXTERNAL CTONUM,ICOMP
+C
+      DO 10 I=1,3
+      IC(I)=CTONUM(CODON(I))
+10    CONTINUE
+      TRANB=PAA(ICOMP(IC(1)),ICOMP(IC(2)),ICOMP(IC(3)))
+      RETURN
+      END
+C   THREEL
+      CHARACTER*3 FUNCTION THREEL(ONEL)
+      CHARACTER ONEL,ALLONE*21
+      CHARACTER*3 THREES(21)
+      SAVE ALLONE,THREES
+      DATA ALLONE/'ACDEFGHIKLMNPQRSTVWY*'/
+      DATA THREES/
+     +'Ala','Cys','Asp','Glu','Phe','Gly','His','Ile','Lys',
+     +'Leu','Met','Asn','Pro','Gln','Arg','Ser','Thr','Val',
+     +'Trp','Tyr','***'/
+      THREEL='   '
+      DO 10 I=1,21
+        IF(ONEL.EQ.ALLONE(I:I))THEN
+          THREEL=THREES(I)(1:3)
+          RETURN
+        END IF
+10    CONTINUE
+      RETURN
+      END
+      SUBROUTINE MWCALC(SEQ,IDIM,J1,J2,KBOUT,IDEV)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),CHRSET(26)
+      REAL AAWTS(26),AWTSUM(26),ASUM(26)
+      INTEGER CTONUM
+      EXTERNAL CTONUM 
+      SAVE CHRSET,AAWTS
+      DATA CHRSET/
+     +'C','S','T','P','A','G','N',
+     +'D','E','Q','B','Z','H','R',
+     +'K','M','I','L','V','F','Y',
+     +'W','-','X','?',' '/
+C VALUES CHANGED 26-2-91
+      DATA AAWTS/103.1388,87.0782,101.1051,97.1167,71.0788,57.0519,
+     +           114.1038,
+     +           115.0886,129.1155,128.1307,0.0,0.0,137.1411,156.1875,
+     +           128.1714,131.1926,113.1594,113.1594,99.1326,147.1766,
+     +           163.1760,
+     +           186.2132,0.0,0.0,0.0,0.0/
+C      DATA AAWTS/103.15,87.09,101.12,97.13,71.09,57.07,114.12,
+C     +           115.10,129.13,128.15,0.0,0.0,137.16,156.21,
+C     +           128.19,131.22,113.18,113.18,99.15,147.19,163.19,
+C     +           186.22,0.0,0.0,0.0,0.0/
+      WRITE(KBOUT,*)' Sequence composition'
+      REALN = J2 - J1 + 1
+      REALN = 100.0/REALN
+      SUMWT=0.
+C      SUMPOL=0.
+      DO 20 I=1,26
+         ASUM(I)=0.
+         AWTSUM(I)=0.
+20    CONTINUE
+      DO 30 I=J1,J2
+         IACID=CTONUM(SEQ(I))
+         ASUM(IACID)=ASUM(IACID)+1.
+         AWTSUM(IACID)=AWTSUM(IACID)+AAWTS(IACID)
+C         AWTSUM(IACID)=AWTSUM(IACID)+AAWTS(IACID)-0.015
+C         SUMPOL=SUMPOL+POLAR(IACID)
+C         SUMWT=SUMWT+AAWTS(IACID)-0.015
+         SUMWT=SUMWT+AAWTS(IACID)
+30    CONTINUE
+C   ADD ON 1 WATER MOLECULE
+      SUMWT=SUMWT+18.0152
+      WRITE(IDEV,1000)(CHRSET(K),K=1,13)
+      WRITE(IDEV,1001)(ASUM(K),K=1,13)
+      WRITE(IDEV,1004)(REALN*ASUM(K),K=1,13)
+      WRITE(IDEV,1002)(AWTSUM(K),K=1,13)
+      WRITE(IDEV,1000)(CHRSET(K),K=14,26)
+      WRITE(IDEV,1001)(ASUM(K),K=14,26)
+      WRITE(IDEV,1004)(REALN*ASUM(K),K=14,26)
+      WRITE(IDEV,1002)(AWTSUM(K),K=14,26)
+1000  FORMAT(/' A',13(3X,1A,2X))
+1001  FORMAT(' N',13(1X,F4.0,1X))
+1002  FORMAT(' W',13F6.0)
+1004  FORMAT(' %',13F6.1)
+      WRITE(IDEV,1003)SUMWT
+1003  FORMAT(' Total molecular weight=',F12.3)
+      RETURN
+      END
+C
+      SUBROUTINE BCOMP(SEQ,IDIM,J1,J2,KSTART,IDEV)
+C   AUTHOR RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      REAL TOT(5)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      WRITE(IDEV,*)' Sequence composition'
+      DO 10 I=1,5
+       TOT(I)=0.0
+10    CONTINUE
+      DO 20 I=J1-KSTART+1,J2-KSTART+1
+       J=CTONUM(SEQ(I))
+       TOT(J)=TOT(J)+1.
+20    CONTINUE
+      WRITE(IDEV,1000)
+1000  FORMAT(' ',10X,'T',10X,'C',10X,'A',10X,'G',10X,'-')
+      WRITE(IDEV,1001)TOT
+1001  FORMAT(' ',1X,5(F10.0,1X))
+      T=100.0/(J2-J1+1)
+      WRITE(IDEV,1002)(TOT(K)*T,K=1,5)
+1002  FORMAT(' ',2X,5(F10.1,'%'))
+      RETURN
+      END
+C   SUBROUTINE TO READ CHARACTER DATA FROM IDEV, REMOVE SPACES, FILL
+C   ARRAY AND RETURN NUMBER OF ELEMENTS USED. ANY LINES STARTING WITH
+C   A ; ARE TREATED AS COMMENTS AND WRITTEN TO DEVICE KBOUT
+      SUBROUTINE ARRFIL(IDEV,SEQNCE,J,KBOUT)
+C 14-8-91 Added err= option to read, and set length to 0 if error found
+C   AUTHOR: RODGER STADEN
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,ENDCHR,TITCHR
+      SAVE ENDCHR,SPACE,TITCHR
+      DATA ENDCHR/'@'/
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=40)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR)THEN
+        WRITE(KBOUT,1003)(TEMP(K),K=2,80)
+1003    FORMAT(' ',79A1)
+        GO TO 1
+      END IF
+10    CONTINUE
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(TEMP(I).EQ.ENDCHR)RETURN
+          IF(J.EQ.IDMX)THEN
+            WRITE(KBOUT,1002)IDMX
+1002        FORMAT(
+     +             ' Too much data. Maximum possible',
+     +             ' =',I6,', input stopped there')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+      RETURN
+ 40   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      J = 0
+      END
+C     BUB2AS
+C   SUBROUTINE TO SORT INTEGER ARRAY (LIST) INTO ASCENDING  ORDER
+      SUBROUTINE BUB2AS(LIST,LIST2,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM),LIST2(IDIM)
+      I=0
+      J=0
+10    CONTINUE
+C   SET I=J IF WE HAVE JUST CORRECTLY POSITIONED AN ELEMENT
+      IF(J.GT.I)I=J
+      I=I+1
+      IF(I.EQ.IDIM)RETURN
+20    CONTINUE
+      IF(LIST(I).LE.LIST(I+1))GO TO 10
+C   FIRST MOVE THIS ELEMENT? IF SO SET POINTER TO ITS INITIAL POSITION
+      IF(J.LT.I)J=I
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+      ITEMP=LIST2(I)
+      LIST2(I)=LIST2(I+1)
+      LIST2(I+1)=ITEMP
+C   DECREMENT BACK THRU LIST WITH THIS ELEMENT
+      IF(I.GT.1)I=I-1
+      GO TO 20
+      END
+      SUBROUTINE BUB3AS(LIST,LIST2,LIST3,IDIM)
+C   AUTHOR: RODGER STADEN
+      INTEGER LIST(IDIM),LIST2(IDIM),LIST3(IDIM)
+      I=0
+      J=0
+10    CONTINUE
+C   SET I=J IF WE HAVE JUST CORRECTLY POSITIONED AN ELEMENT
+      IF(J.GT.I)I=J
+      I=I+1
+      IF(I.EQ.IDIM)RETURN
+20    CONTINUE
+      IF(LIST(I).LE.LIST(I+1))GO TO 10
+C   FIRST MOVE THIS ELEMENT? IF SO SET POINTER TO ITS INITIAL POSITION
+      IF(J.LT.I)J=I
+      ITEMP=LIST(I)
+      LIST(I)=LIST(I+1)
+      LIST(I+1)=ITEMP
+      ITEMP=LIST2(I)
+      LIST2(I)=LIST2(I+1)
+      LIST2(I+1)=ITEMP
+      ITEMP=LIST3(I)
+      LIST3(I)=LIST3(I+1)
+      LIST3(I+1)=ITEMP
+C   DECREMENT BACK THRU LIST WITH THIS ELEMENT
+      IF(I.GT.1)I=I-1
+      GO TO 20
+      END
+C     CHARCT
+C   SUBROUTINE TO COUNT NUMBRE OF CHARS BETWSEN ENCLOSING /S
+      SUBROUTINE CHARCT(CHARS,IDIM,PCHAR,NCHAR)
+C   AUTHOR: RODGER STADEN
+      CHARACTER CHARS(IDIM),SLASH
+      INTEGER PCHAR
+      SAVE SLASH
+      DATA SLASH/'/'/
+      NCHAR=-1
+C
+10    CONTINUE
+      NCHAR=NCHAR+1
+      I=NCHAR+PCHAR
+      IF(I.EQ.IDIM)RETURN
+      IF(CHARS(I).EQ.SLASH)RETURN
+      GO TO 10
+      END
+C     encona
+C   routine to store positions of words in posns and first occurences
+C   in wordp and number of occurences in wordn
+C   each number is a value representing one of the le4 possible
+C   words of length length made up of 4 characters
+C   words in posns are numbers from 1 to 4**length
+      SUBROUTINE ENCONA(POSNS,IDIM,WORDP,WORDN,LE4,LENGTH)
+C   AUTHOR: RODGER STADEN
+      INTEGER WORDP(LE4),POSNS(IDIM)
+      INTEGER WORDN(LE4)
+C   number of words of length length
+      IDIM1 = IDIM - (LENGTH-1)
+      DO 10 I=1,LE4
+        WORDN(I) = 0
+10    CONTINUE
+C   loop for each word
+      DO 100 I=1,IDIM1
+        N = POSNS(I)
+        IF(N.EQ.0)GO TO 100
+C   is their already an entry for this word?
+        IF(WORDN(N).NE.0)GO TO 60
+C  first entry, put in wordp
+        WORDP(N)=I
+        GO TO 80
+60    CONTINUE
+C   need to chain along posn until find correct posn for i
+C   first posn is in wordp(n)
+      IP=WORDP(N)
+      DO 70 J=2,WORDN(N)
+70    IP=POSNS(IP)
+C   so put this i at posns(ip)
+      POSNS(IP)=I
+80    CONTINUE
+C   now increment number of occurences of n
+      WORDN(N)=WORDN(N)+1
+100   CONTINUE
+      RETURN
+      END
+C   routine to turn a sequence of 1,2,3,4,0 to numbers
+C   each number is a value representing one of the 4**length
+C   possible words of length length made up of 4 characters
+      SUBROUTINE ENCO(SEQ,IDIM,POSNS,CONST,LENGTH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM)
+      INTEGER POSNS(IDIM),CONST(LENGTH)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      ISTART = 1
+      DO 1 I=1,LENGTH
+        CONST(I) = 4**(I-1)
+        ISTART = ISTART - CONST(I)
+ 1    CONTINUE
+C   how many words of length length
+      IDIM1 = IDIM - (LENGTH-1)
+      DO 3 I=1,IDIM
+        POSNS(I) = 0
+ 3    CONTINUE
+C   loop for each word
+      DO 100 I=1,IDIM1
+        IP = I
+        N = ISTART
+        DO 30 J=1,LENGTH
+          IC = CTONUM(SEQ(IP))
+          IF (IC.EQ.5) GO TO 100
+          N = N + CONST(J) * IC
+          IP = IP + 1
+30      CONTINUE
+        POSNS(I) = N
+100   CONTINUE
+      END
+      INTEGER FUNCTION IFROMC(CHARS,LENGTH,KBOUT)
+C   AUTHOR: RODGER STADEN
+C   INTEGER FUNCTION TO CONVERT CHARACTER STRINGS OF
+C   NUMERALS TO BINARY FORM
+      CHARACTER NUMBER*10,CHARS(LENGTH)
+C   LENGTH OF STRING NUMBER
+      LENS=10
+      NUMBER=' '
+      CALL RJSTFY(CHARS,NUMBER,LENS,LENGTH)
+      READ(NUMBER,1002,ERR=100)LIST
+1002  FORMAT(I10)
+      IFROMC=LIST
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,1006)
+1006  FORMAT(' Error in internal read, value set to zero')
+      IFROMC=0
+      RETURN
+      END
+      SUBROUTINE RJSTFY(ARRAY,STRING,LENS,LENGTH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER STRING*(*),ARRAY(LENGTH)
+      STRING=' '
+C   LOOK FOR FIRST NON SPACE CHAR
+      K=LENGTH+1
+      DO 1 I=1,LENGTH
+      K=K-1
+1     IF(ARRAY(K).NE.' ')GO TO 2
+C   ALL SPACES!
+      RETURN
+2     CONTINUE
+      K1=K
+C  POINT TO RIGHT END OF STRING
+      K3=LENS+1
+      DO 3 I=1,K1
+      K3=K3-1
+      STRING(K3:K3)=ARRAY(K)
+3     K=K-1
+      RETURN
+      END
+      SUBROUTINE OPENF(IDEV,FILNAM,IWRITE,IOK,KBIN,KBOUT)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*)
+      IOK=1
+10    CONTINUE
+      READ(KBIN,1000,ERR=300,END=300)FILNAM
+1000  FORMAT(A)
+C   WANT TO WRITE?
+      IF(IWRITE.NE.0)GO TO 200
+C   READONLY
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,2)
+      RETURN
+C   WRITE
+200   CONTINUE
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,1)
+300   CONTINUE
+      RETURN
+      END
+C     SQCOM
+      SUBROUTINE SQCOM(SEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (MAXLST = 38)
+      CHARACTER SEQ(IDIM),LIST1(MAXLST),LIST2(MAXLST),TEMP
+      SAVE LIST1,LIST2
+      DATA LIST1/
+     +'C','T','A','G',
+     +'c','t','a','g',
+     +'D','V','B','H',
+     +'d','v','b','h',
+     +'K','L','M','N',
+     +'k','l','m','n',
+     +'R','Y','U',
+     +'r','y','u',
+     +'1','2','3','4',
+     +'5','6','7','8'/
+      DATA LIST2/
+     +'G','A','T','C',
+     +'g','a','t','c',
+     +'H','B','V','D',
+     +'h','b','v','d',
+     +'N','M','L','K',
+     +'n','m','l','k',
+     +'Y','R','A',
+     +'y','r','a',
+     +'4','3','2','1',
+     +'6','5','7','8'/
+      DO 100 I=1,IDIM
+        TEMP = SEQ(I)
+        DO 50 J=1,MAXLST
+          IF(TEMP.EQ.LIST1(J))THEN
+            SEQ(I)=LIST2(J)
+            GO TO 99
+          END IF
+50      CONTINUE
+99      CONTINUE
+100   CONTINUE
+      END
+C     SQCOPY
+C   SEQUENCE COPYING PROGRAM
+      SUBROUTINE SQCOPY(SEQNCE,COMSEQ,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM),COMSEQ(IDIM)
+      DO 100 I=1,IDIM
+        COMSEQ(I)=SEQNCE(I)
+100   CONTINUE
+      RETURN
+      END
+C   SQREV
+      SUBROUTINE SQREV(SEQNCE,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM),TEMP
+C   REVERSE THE SEQUENCE
+      IEND=IDIM/2
+      DO 100 I=1,IEND
+      TEMP=SEQNCE(I)
+        SEQNCE(I)=SEQNCE(IDIM+1-I)
+        SEQNCE(IDIM+1-I)=TEMP
+100   CONTINUE
+      RETURN
+      END
+C   DNARNA OR RNADNA
+      SUBROUTINE DNARNA(SEQ,IDSEQ)
+C   AUTHOR RODGER STADEN
+      CHARACTER SEQ(IDSEQ),TEMP,TCHAR,UCHAR,LTCHAR,LUCHAR
+      SAVE TCHAR,UCHAR,LTCHAR,LUCHAR
+      DATA TCHAR/'T'/,UCHAR/'U'/,LTCHAR/'t'/,LUCHAR/'u'/
+      DO 10 I=1,IDSEQ
+        TEMP = SEQ(I)
+        IF(TEMP.EQ.TCHAR)THEN
+          SEQ(I)=UCHAR
+        ELSE IF(TEMP.EQ.LTCHAR)THEN
+          SEQ(I)=LUCHAR
+        ELSE IF(TEMP.EQ.UCHAR)THEN
+          SEQ(I)=TCHAR
+        ELSE IF(TEMP.EQ.LUCHAR)THEN
+          SEQ(I)=LTCHAR
+        END IF
+10    CONTINUE
+      END
+      SUBROUTINE FMTDKN(IDEV,SEQNCE,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM)
+C   SET POINTERS TO FIRST AND LAST ELEMENTS ONE WRITE
+      JS=1
+      JE=60
+10    CONTINUE
+C   SET JE TO LAST ELEMENT IF NECESSARY
+      IF(JE.GT.IDIM)JE=IDIM
+      WRITE(IDEV,1002)(SEQNCE(I),I=JS,JE)
+1002  FORMAT(' ',60A1)
+C   TEST FOR END
+      IF(JE.NE.IDIM) THEN
+        JS=JE+1
+        JE=JE+60
+        GO TO 10
+      END IF
+      END
+      SUBROUTINE FMTDK(IDEV,SEQNCE,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQNCE(IDIM)
+C   SET POINTERS TO FIRST AND LAST ELEMENTS ONE WRITE
+      JS=1
+      JE=60
+10    CONTINUE
+C   SET JE TO LAST ELEMENT IF NECESSARY
+      IF(JE.GT.IDIM)JE=IDIM
+      WRITE(IDEV,1002)(SEQNCE(I),I=JS,JE)
+1002  FORMAT(' ',60A1)
+C   TEST FOR END
+      IF(JE.EQ.IDIM)GO TO 20
+C   INCREMENT FIRST AND LAST POINTERS
+      JS=JE+1
+      JE=JE+60
+      GO TO 10
+20    CONTINUE
+      CLOSE(UNIT=IDEV)
+      RETURN
+      END
+C
+C     LWRAP
+      INTEGER FUNCTION LWRAP(IDIM,I)
+C   AUTHOR: RODGER STADEN
+C   TEST FOR END OF ARRAY,IFSO WRAP AROUND
+      LWRAP=I
+      IF(LWRAP.GT.IDIM)LWRAP=LWRAP-IDIM
+      RETURN
+      END
+      SUBROUTINE FMT4LN(SEQ1,SEQ2,MATCH,IDIM,ISW,ISX,IDEV)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(IDIM)
+      INTEGER KL(6)
+      ISXX=ISX
+      ISWW=ISW
+      IE=0
+10    CONTINUE
+      IS=IE+1
+      IE=IE+60
+      IF(IE.GT.IDIM)IE=IDIM
+      N=IE-IS+1
+      N=1+(N-1)/10
+C   SET UP DECIMAL COUNTERS
+      DO 50 J=1,N
+        KL(J)=ISWW
+        ISWW=ISWW+10
+50    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+      WRITE(IDEV,1002)(SEQ1(K),K=IS,IE)
+      WRITE(IDEV,1002)(MATCH(K),K=IS,IE)
+      WRITE(IDEV,1002)(SEQ2(K),K=IS,IE)
+1002  FORMAT( 10X,6(10A1,1X))
+C   SET UP DECIMAL COUNTERS
+      DO 60 J=1,N
+        KL(J)=ISXX
+        ISXX=ISXX+10
+60    CONTINUE
+      WRITE(IDEV,1001)(KL(K),K=1,N)
+1001  FORMAT( 5X,6(I6,5X))
+      IF(IE.EQ.IDIM)RETURN
+      GO TO 10
+      END
+      SUBROUTINE SQMTCH(SEQ1,SEQ2,MATCH,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ1(IDIM),SEQ2(IDIM),MATCH(IDIM),BLANK,STAR
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      SAVE BLANK,STAR
+      DATA BLANK/' '/,STAR/'*'/
+      CALL FILLC(MATCH,IDIM,BLANK)
+C   LOOK FOR MATCHES
+      DO 20 I=1,IDIM
+        IF(CTONUM(SEQ1(I)).EQ.CTONUM(SEQ2(I)))MATCH(I)=STAR
+20    CONTINUE
+      RETURN
+      END
+      SUBROUTINE GETMRG(ISXMAX,ISYMAX,MARGL,MARGR,MARGB,MARGT,IMARG,
+     +IDEVM,FILNAM)
+C   AUTHOR: RODGER STADEN
+      INTEGER MARGB(IMARG),MARGT(IMARG),OPTION
+      CHARACTER FILNAM*(*)
+      DO 5 I=1,IMARG
+      MARGB(I)=0
+      MARGT(I)=0
+5     CONTINUE
+C      OPEN(UNIT=IDEVM,FILE=FILNAM,STATUS='OLD',READONLY,ERR=4)
+      CALL OPENRS(IDEVM,FILNAM,IOK,LRECL,2)
+      IF(IOK.NE.0)GO TO 4
+C  READ TITLE
+      READ(IDEVM,1002)
+1002  FORMAT()
+        READ(IDEVM,1000,ERR=3)ISXMAX,ISYMAX
+        READ(IDEVM,1000,ERR=3)MARGL,MARGR
+1000  FORMAT(2I6)
+1     CONTINUE
+      READ(IDEVM,1001,ERR=3,END=2)OPTION,M1,M2
+1001  FORMAT(3I6)
+      IF(OPTION.LE.IMARG)THEN
+        MARGB(OPTION)=M1
+        MARGT(OPTION)=M2
+      END IF
+      GO TO 1
+2     CONTINUE
+      CLOSE(UNIT=IDEVM)
+      RETURN
+3     CONTINUE
+      WRITE(*,*)' Error in margin file'
+      CLOSE(UNIT=IDEVM)
+      RETURN
+4     CONTINUE
+      WRITE(*,*)' Error opening margin file'
+      CLOSE(UNIT=IDEVM)
+      RETURN
+      END
+      SUBROUTINE MOVEC(SEQ,IDIMX,IDIM,IPOS,NCHAR1)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIMX)
+      INTEGER TO,FROM
+      NCHAR=ABS(NCHAR1)
+C   LEFT OR RIGHT?
+      IF(NCHAR1.LT.0)GO TO 20
+C   RIGHT
+      FROM=IDIM
+      TO=IDIM+NCHAR
+C   NUMBER TO MOVE?
+      NUM=IDIM-IPOS+1
+      DO 10 I=1,NUM
+      SEQ(TO)=SEQ(FROM)
+      TO=TO-1
+      FROM=FROM-1
+10    CONTINUE
+      RETURN
+20    CONTINUE
+C   LEFT
+      FROM=IPOS+NCHAR
+      TO=IPOS
+C   NUMBER TO MOVE?
+      NUM=IDIM-FROM+1
+      DO 30 I=1,NUM
+      SEQ(TO)=SEQ(FROM)
+      TO=TO+1
+      FROM=FROM+1
+30    CONTINUE
+      RETURN
+      END
+      CHARACTER*(*) FUNCTION ATOS(ARRAY,IDIM)
+C   AUTHOR RODGER STADEN
+      CHARACTER ARRAY(IDIM)
+C   FUNCTION TO CONVERT A CHARACTER ARRAY INTO A CHARACTER STRING
+      ATOS=' '
+      DO 10 I=1,IDIM
+        ATOS(I:I)=ARRAY(I)
+10    CONTINUE
+      RETURN
+      END
+      SUBROUTINE FILLC(SEQ,IDIM,CH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDIM),CH
+      DO 10 I=1,IDIM
+        SEQ(I) = CH
+10    CONTINUE
+      RETURN
+      END
+C   FILLI
+      SUBROUTINE FILLI(SEQ,IDIM,CH)
+C   AUTHOR: RODGER STADEN
+      INTEGER SEQ(IDIM),CH
+      DO 10 I=1,IDIM
+        SEQ(I) = CH
+10    CONTINUE
+      RETURN
+      END
+      SUBROUTINE FILLR(ARRAY,IDIM,CH)
+C   AUTHOR RODGER STADEN
+      REAL ARRAY(IDIM),CH
+C  ROUTINE TO FILL REAL ARRAY WITH VALUE CHAR
+      DO 10 I = 1 , IDIM
+        ARRAY(I) = CH
+10    CONTINUE
+      END
+      SUBROUTINE GETMAT(IDEV,FILNAM,MATRIX,IDM,CHRSET,KBOUT,IOK)
+C   AUTHOR RODGER STADEN
+      INTEGER MATRIX(IDM,IDM)
+      CHARACTER FILNAM*(*),CHRSET(IDM)
+      CALL OPENRS(IDEV,FILNAM,IOK,LRECL,2)
+      IF(IOK.NE.0)THEN
+        WRITE(KBOUT,*)'Error opening score matrix file'
+        RETURN
+      END IF
+      READ(IDEV,1000)
+      DO 6 I=1,IDM
+        READ(IDEV,1000,ERR=100)CHRSET(I),(MATRIX(I,K),K=1,IDM)
+1000    FORMAT(A1,26I3)
+6     CONTINUE
+      CLOSE(UNIT=IDEV)
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,*)'Error reading score matrix file'
+      CLOSE(UNIT=IDEV)
+      IOK = 1
+      END
+C
+      SUBROUTINE REDCOD(SUM,IDEV)
+C   AUTHOR: RODGER STADEN
+      REAL SUM(4,4,4)
+      READ(IDEV,1000)
+1000  FORMAT( )
+      DO 10 I=1,4
+      DO 20 K=1,4
+20    READ(IDEV,1001,ERR=30,END=30)(SUM(I,J,K),J=1,4)
+10    READ(IDEV,1000,ERR=30,END=30)
+1001  FORMAT(5X,4(6X,F5.0))
+      RETURN
+30    CONTINUE
+      WRITE(*,*)' Error reading codon table file'
+      END
+C
+      SUBROUTINE WRTCOD(SUM,IDEV,PAA)
+C   AUTHOR: RODGER STADEN
+      REAL SUM(4,4,4)
+      CHARACTER BASE(4),PAA(5,5,5)
+      SAVE BASE
+      DATA BASE/'T','C','A','G'/
+C
+      WRITE(IDEV,1001)
+1001  FORMAT(6X,'===========================================')
+      DO 10 I=1,4
+      DO 20 K=1,4
+      WRITE(IDEV,1000)(PAA(K,J,I),
+     +BASE(I),BASE(J),BASE(K),SUM(I,J,K),J=1,4)
+20    CONTINUE
+10    WRITE(IDEV,1001)
+1000  FORMAT(5X,4(1X,A1,1X,3A1,F5.0))
+      END
+      INTEGER FUNCTION NORP(SEQ,IDSEQ)
+C   AUTHOR RODGER STADEN
+      CHARACTER SEQ(IDSEQ),CHAR1,DNAC*5
+      CHARACTER TUPPER
+      SAVE DNAC
+      EXTERNAL TUPPER
+      DATA DNAC/'ATCGU'/
+C   RETURNS NORP = 5 IF >85% A,C,G,T,U  ELSE NORP = 26
+      DNA = 0.
+      DO 10 I = 1,IDSEQ
+        CHAR1 = TUPPER(SEQ(I))
+        DO 5 J=1,5
+          IF(CHAR1.EQ.DNAC(J:J)) THEN
+            DNA = DNA + 1.
+            GO TO 10
+          END IF
+ 5        CONTINUE
+10    CONTINUE
+      X = DNA/REAL(IDSEQ)
+      N = 26
+      IF(X.GT.0.85) N = 5
+      NORP = N
+      END
+      SUBROUTINE COPYI(IN,OUT,ID)
+C   AUTHOR RODGER STADEN
+      INTEGER IN(ID),OUT(ID)
+C   COPY INTEGER ARRAY IN TO ARRAY OUT
+      DO 10 I = 1, ID
+        OUT(I) = IN(I)
+10    CONTINUE
+      END
+      SUBROUTINE CONNUM(SEQIN,SEQOUT,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQIN(IDIM)
+      INTEGER SEQOUT(IDIM)
+      INTEGER CTONUM
+      EXTERNAL CTONUM
+      DO 10 I=1,IDIM
+        SEQOUT(I)=CTONUM(SEQIN(I))
+10    CONTINUE
+      RETURN
+      END
+      SUBROUTINE CONNUN(SEQIN,SEQOUT,IDIM)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQIN(IDIM)
+      INTEGER SEQOUT(IDIM)
+      INTEGER DTONUM
+      EXTERNAL DTONUM
+      DO 10 I=1,IDIM
+        SEQOUT(I) = DTONUM(SEQIN(I))
+10    CONTINUE
+      END
+      INTEGER FUNCTION NMMTCH(S1,S2)
+C   AUTHOR RODGER STADEN
+C   RETURNS 0 = MATCH, 1 = NO MATCH
+C   Blank names do not match ie all spaces
+      CHARACTER S1*(*),S2*(*)
+      NMMTCH = 1
+      L1 = LEN(S1)
+      L2 = LEN(S2)
+      L1S = INDEX(S1,' ')
+      L2S = INDEX(S2,' ')
+      IF(L1S.GT.1) L1 = L1S - 1
+      IF(L2S.GT.1) L2 = L2S - 1
+      IF(L1.NE.L2) RETURN
+      CALL CCASE(S1,2)
+      CALL CCASE(S2,2)
+      IF(S1(1:L1).NE.S2(1:L1)) RETURN
+      NMMTCH = 0
+      END
+      INTEGER FUNCTION ITOSL(LINE,N)
+C Sent int N return its character string left justified
+C and the length of the string. Length 0 means error.
+      CHARACTER STRING*10,LINE*(*)
+      ITOSL = 0
+      WRITE(STRING,1000,ERR=100)N
+1000  FORMAT(I10)
+      CALL LJST(STRING)
+      I = INDEX(STRING,' ') - 1
+      LINE(1:I) = STRING(1:I)
+      ITOSL = I
+100   CONTINUE
+      END
+      SUBROUTINE LJST(STRING)
+C Left justify a string, fill rest with spaces
+      CHARACTER STRING*(*)
+      EXTERNAL NOTILR
+      L = LEN(STRING)
+      I = NOTILR(STRING,L,' ')
+      IF(I.NE.1) THEN
+        J = 1
+        DO 10 K = I,L
+          STRING(J:J) = STRING(K:K)
+          J = J + 1
+10      CONTINUE
+        STRING(J:L) = ' '
+      END IF
+      END
+      SUBROUTINE DIVR(S,N,D)
+C Divide array S of N elements by D
+      REAL S(N)
+      IF(D.NE.0.) THEN
+        DO 10 I=1,N
+          S(I) = S(I)/D
+10      CONTINUE
+      END IF
+      END
+      REAL FUNCTION SUMR(R,N)
+C Sum the N elements of array R
+      REAL R(N)
+      S = 0.
+      DO 10 I = 1,N
+        S = S + R(I)
+10    CONTINUE
+      SUMR = S
+      END
+      LOGICAL FUNCTION NUMBER(CHR)
+C Return true if CHR is a digit
+      CHARACTER CHR,NUMS*10
+      SAVE NUMS
+      DATA NUMS/'0123456789'/
+      NUMBER = .FALSE.
+      IF(INDEX(NUMS,CHR).NE.0) NUMBER = .TRUE.
+      END
+      INTEGER FUNCTION LASTN(LINEIN,K)
+C Return position of last digit (moving left to right) in LINEIN
+      CHARACTER LINEIN*(*)
+      LOGICAL NUMBER
+      EXTERNAL NUMBER
+      LASTN = 0
+      DO 10 I = K,LEN(LINEIN)
+C       CALL OUTPT(LINEIN)(I:I)
+        IF(.NOT.(NUMBER(LINEIN(I:I)))) RETURN
+        LASTN = I
+10    CONTINUE
+      END
+      INTEGER FUNCTION FIRSTN(LINEIN,K)
+C Return position of first digit (moving left to right) in LINEIN
+      CHARACTER LINEIN*(*)
+      LOGICAL NUMBER
+      EXTERNAL NUMBER
+      DO 10 I = K,LEN(LINEIN)
+C       CALL OUTPT(LINEIN)(I:I)
+        FIRSTN = I
+        IF(NUMBER(LINEIN(I:I))) RETURN
+10    CONTINUE
+      FIRSTN = 0
+      END
+      LOGICAL FUNCTION ONEOF(CHRS,CHR)
+C Return true if CHR is in set of chars stored in CHRS
+      CHARACTER CHR,CHRS*(*)
+      ONEOF = .FALSE.
+      IF(INDEX(CHRS,CHR).NE.0) ONEOF = .TRUE.
+      END
+      LOGICAL FUNCTION STRNGM(S1,S2)
+C Return true if the first L chars of S1 and S2 match
+C L is the length of the shortest string. Independent of case.
+      CHARACTER S1*(*),S2*(*),TUPPER
+      EXTERNAL TUPPER
+      L = MIN(LEN(S1),LEN(S2))
+      STRNGM = .FALSE.
+      DO 10 I = 1,L
+       IF(TUPPER(S1(I:I)).NE.TUPPER(S2(I:I))) RETURN
+10    CONTINUE
+      STRNGM = .TRUE.
+      END
+C   OPENF1
+C   IWRITE NE 0 MEANS WRITE NEW FILE, ELSE MEANS READONLY
+      SUBROUTINE OPENF1(IDEV,FILNAM,IWRITE,IOK,KBIN,KBOUT,
+     +PROMPT,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+C   AUTHOR: RODGER STADEN
+      CHARACTER FILNAM*(*),HELPF*(*),PROMPT*(*)
+      PARAMETER (MAXPRM = 16, MAXNAM = 60)
+      CHARACTER PERR(2)*(MAXPRM),NEWFN*(MAXNAM)
+      INTEGER ANS,DELF
+      EXTERNAL DELF,NOTRL
+      IOK=1
+      ICOUNT = 0
+C
+      LENGTH = NOTRL(FILNAM,LEN(FILNAM),' ')
+      IF(IWRITE.EQ.1) LENGTH = 0
+      LIN = LENGTH
+10    CONTINUE
+      IF(ICOUNT.EQ.5) THEN
+        IOK = 1
+        RETURN
+      END IF
+      ICOUNT = ICOUNT + 1
+      CALL GTSTR(PROMPT,FILNAM,NEWFN,LENGTH,KBOUT,KBIN,K)
+      IF(K.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        ICOUNT = 0
+        GO TO 10
+      END IF
+      IF(K.EQ.2) THEN
+        IOK = 2
+        RETURN
+      END IF
+C if no default and blank line try again
+      IF((K.EQ.3).AND.(LIN.EQ.0)) GO TO 10
+      IF(K.EQ.4) GO TO 10
+      IF(LENGTH.GT.0) FILNAM = NEWFN
+      IF(IWRITE.EQ.0) THEN
+C       OLD FILE (I.E. READ ONLY)
+        CALL OPENRS(IDEV,FILNAM,IOK,LRECL,2)
+        IF(IOK.EQ.0) RETURN
+C problem opening file
+        IF(IOK.EQ.3) CALL ERROM(KBOUT,'File not found')
+        CALL YESNO(ANS,'Retype file name',
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(ANS.EQ.0) GO TO 10
+        RETURN
+      ELSE IF(IWRITE.EQ.1) THEN
+C       NEW FILE (I.E. WRITE)
+ 20     CONTINUE
+        CALL OPENRS(IDEV,FILNAM,IOK,LRECL,1)
+        IF(IOK.EQ.0) RETURN
+C problem opening file
+        IF(IOK.EQ.2) THEN
+          CALL ERROM(KBOUT,'File already exists')
+          PERR(1) = 'Retype file name'
+          PERR(2) = 'Replace file'
+          IDO = 1
+          CALL RADION('Select action',PERR,2,IDO,
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(IDO.LT.1) RETURN
+          IF(IDO.EQ.1) GO TO 10
+          IF(IDO.EQ.2) THEN
+            IOK = DELF(FILNAM,IDEV,JRECL,2)
+            IF(IOK.EQ.0) GO TO 20
+            CALL ERROM(KBOUT,'File delete failed')
+            RETURN
+          END IF
+        ELSE
+          CALL ERROM(KBOUT,'File opening failed')
+          CALL YESNO(ANS,'Retype file name',
+     +    IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+          IF(ANS.EQ.0) GO TO 10
+          RETURN
+        END IF
+      END IF
+      END
+      INTEGER FUNCTION GNFFOF(IDEV,NAME)
+      CHARACTER NAME*(*)
+      EXTERNAL NOTLR
+C
+C routine to read a file of file names and return a name
+C deals with leading spaces and trims names at first space
+C after name: eg '  fred is a bum' is returned as 'fred'
+C needed because file names can contain spaces (not our file names!)
+C and the open statement expects the names to match precisely
+C
+C return 0 = ok, 2 = empty line in file, 3 = error in read, 1 = end of file
+C
+      READ(IDEV,1000,ERR=100,END=200)NAME
+ 1000 FORMAT(A)
+C
+C get first non space position
+C
+      LENGTH = LEN(NAME)
+      I = NOTLR(NAME,LENGTH,' ')
+C empty line ?
+      IF(I.EQ.0) THEN
+        GNFFOF = 2
+        RETURN
+      END IF
+C now want first space after I
+      J = INDEX(NAME(I+1:),' ')
+      IF(J.EQ.0) THEN
+        J = LENGTH
+      ELSE
+        J = J + I - 1
+      END IF
+      CALL SHFTLS(NAME,I,1,J)
+      NAME(J-I+2:) = ' '
+      GNFFOF = 0
+      RETURN
+ 100  CONTINUE
+      GNFFOF = 3
+      RETURN
+ 200  CONTINUE
+      GNFFOF = 1
+      END
+      SUBROUTINE WRITFF(IDEV,SEQ,IDSEQ,ENAME,TITLE)
+C   AUTHOR: RODGER STADEN
+      CHARACTER SEQ(IDSEQ),NL,ENAME*(*),TITLE*(*)
+      NL = CHAR(10)
+      CALL WRITEB('>',1,IDEV)
+      CALL WRITEB(ENAME,LEN(ENAME),IDEV)
+      CALL WRITEB(' ',1,IDEV)
+      CALL WRITEB(TITLE,LEN(TITLE),IDEV)
+      CALL WRITEB(NL,1,IDEV)
+      JS=1
+      JE=60
+10    CONTINUE
+      IF (JE.GT.IDSEQ) JE = IDSEQ
+      DO 5 I=JS,JE
+        CALL WRITEB(SEQ(I),1,IDEV)
+ 5    CONTINUE
+      CALL WRITEB(NL,1,IDEV)
+      IF(JE.NE.IDSEQ) THEN
+        JS=JE+1
+        JE=JE+60
+        GO TO 10
+      END IF
+      END
+      SUBROUTINE SHFTLS(STRING,FROMS,TO,FROME)
+      CHARACTER STRING*(*)
+      INTEGER FROMS,TO,FROME
+C
+C shift a string left from froms to to
+C
+      J = TO
+      DO 10 I=FROMS,FROME
+        STRING(J:J) = STRING(I:I)
+        J = J + 1
+ 10   CONTINUE
+      END
diff --git a/src/staden/summar.f b/src/staden/summar.f
new file mode 100644
index 0000000..94c78b0
--- /dev/null
+++ b/src/staden/summar.f
@@ -0,0 +1,132 @@
+      SUBROUTINE SUMMAR(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +SEQ1,IDIM1,GEL,LREG,RREG,IGELC,PERCD,IDBSIZ,CHARS,
+     +ID1,CHRSIZ,MAXGL2,IDEVW,MAXGEL,LINOU1,LINOU2,MXGOOD)
+C   AUTHOR: RODGER STADEN
+      INTEGER RELPG(IDBSIZ),CHRSIZ
+      INTEGER LNGTHG(IDBSIZ),LNBR(IDBSIZ),RNBR(IDBSIZ)
+      CHARACTER SEQ1(IDIM1)
+      CHARACTER GEL(MAXGEL)
+      INTEGER LREG,RREG,LSEQNO,POSN,Y,POSN1
+      INTEGER GELC
+      CHARACTER LINOU1(MAXGEL),LINOU2(MAXGEL),GTCONC
+      INTEGER CHARS(CHRSIZ,ID1,MAXGL2)
+      EXTERNAL INDEXS,LWRAPS,GTCONC
+C 28-7-91 added extra parameter: mxgood is the maximum reading
+C length for which we have confidence, so only the parts of
+C reads 1 to mxgood will be included in the calculation
+C   SET INITIAL VALUES
+C hard to understand this very old code! rewrite it.
+C we have a summing array of twice the length of the longest sequence
+C posn is posn in contig of next to write, lseqno is current posn in contig
+C we write when lseqno-posn ge the length of the longest seq
+      POSN=LREG
+      GELC=IGELC
+      LINLEN=MAXGEL
+      LSEQNO=RELPG(GELC)
+      IEND=0
+      DO 40 I=1,MAXGL2
+      DO 40 J=1,ID1
+      DO 40 K=1,CHRSIZ
+        CHARS(K,J,I)=0
+40    CONTINUE
+50    CONTINUE
+      ISS=1
+      IF(LNGTHG(GELC).LT.0)ISS=2
+      CALL READW(IDEVW,GELC,GEL,MAXGEL)
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read
+C
+      IF(ISS.EQ.1) THEN
+        N = MIN(MXGOOD,ABS(LNGTHG(GELC)))
+        IF(LSEQNO.LT.LREG)LSEQNO=LREG
+        IS = LSEQNO-RELPG(GELC)+1
+      ELSE
+C     LOOP FOR RELEVANT ELEMENTS THIS GEL
+C only use mxgood characters from start of read (right end for these)
+C
+        N  = ABS(LNGTHG(GELC))
+        IS = MAX(1,(ABS(LNGTHG(GELC)) - MXGOOD + 1))
+        LSEQNO = RELPG(GELC) + MAX(0,ABS(LNGTHG(GELC)) - MXGOOD)
+        IF(LSEQNO.LT.LREG)LSEQNO=LREG
+      END IF
+      DO 70 I=IS,N
+        JJ = INDEXS(GEL(I),JSCORE)
+        JJJ = LWRAPS(LSEQNO,MAXGL2)
+        CHARS(JJ,ISS,JJJ) = CHARS(JJ,ISS,JJJ) + JSCORE
+        LSEQNO = LSEQNO + 1
+70    CONTINUE
+      IF(RNBR(GELC).EQ.0)GO TO 200
+      GELC=RNBR(GELC)
+      LSEQNO=RELPG(GELC)
+      IF(LSEQNO.GT.RREG)GO TO 200
+C     ENOUGH TO OUTPUT?
+      Y=LSEQNO-POSN
+      IF(Y.GE.MAXGEL)GO TO 210
+      GO TO 50
+200   CONTINUE
+C     SET FLAG TO SHOW END REACHED
+      IEND=1
+      LINLEN=MAXGEL
+      Y=RREG-POSN
+      IF(Y.LT.MAXGEL)LINLEN=Y+1
+210   CONTINUE
+C   SET POINTER TO SEQ1
+      POSN1=POSN-1
+C   PREPARE NEXT SECTION OF CHARS FOR OUTPUT
+      DO 230 I=1,LINLEN
+        JJJ = LWRAPS(POSN,MAXGL2)
+        LINOU1(I) = GTCONC(CHARS(1,1,JJJ),CHRSIZ,PERCD)
+        LINOU2(I) = GTCONC(CHARS(1,2,JJJ),CHRSIZ,PERCD)
+        DO 250 J=1,CHRSIZ
+          CHARS(J,1,JJJ)=0
+          CHARS(J,2,JJJ)=0
+250     CONTINUE
+        POSN=POSN+1
+230   CONTINUE
+C
+C   COMPARE STRANDS
+C
+      DO 500 I=1,LINLEN
+C        WRITE(*,*)I,LINOU1(I),LINOU2(I)
+        POSN1=POSN1+1
+        IF(LINOU1(I).EQ.LINOU2(I)) THEN
+          IF(LINOU1(I).EQ.'-') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF(LINOU1(I).EQ.'*') THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          SEQ1(POSN1) = '0'
+        ELSE
+          IF((LINOU1(I).EQ.'*').AND.(LINOU2(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU2(I).EQ.'*').AND.(LINOU1(I).EQ.'-')) THEN
+            SEQ1(POSN1) = '3'
+            GO TO 500
+          END IF
+          IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*')) THEN
+            SEQ1(POSN1) = '1'
+            IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) 
+     +      SEQ1(POSN1) = '4'
+              GO TO 500
+          END IF
+          IF((LINOU2(I).NE.'-').AND.(LINOU2(I).NE.'*')) THEN
+            SEQ1(POSN1) = '2'
+            IF((LINOU1(I).NE.'-').AND.(LINOU1(I).NE.'*'))
+     +      SEQ1(POSN1) = '4'
+            GO TO 500
+          END IF
+        END IF
+500   CONTINUE
+      IF(POSN.GT.RREG)RETURN
+      IF((IEND.EQ.1).AND.(POSN.LE.RREG))GO TO 200
+C   ANY MORE MAXGEL CHAR  LENGTHS TO OUTPUT
+      Y=LSEQNO-POSN
+      IF(Y.LT.MAXGEL)GO TO 50
+C   FINISHED COMPLETELY?
+      GO TO 210
+      END
diff --git a/src/staden/tagEditor.c b/src/staden/tagEditor.c
new file mode 100644
index 0000000..812d4e3
--- /dev/null
+++ b/src/staden/tagEditor.c
@@ -0,0 +1,312 @@
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/SmeBSB.h>
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/stat.h>
+
+#include "contigEditor.h"
+#include "tagUtils.h"
+#include "tagdb.h"
+
+#define TEXT_START ((XawTextPosition)0)
+#define TEXT_END   ((XawTextPosition)32767)
+
+/* ---- Private Variables ---- */
+static Widget tagEditorShellWid;
+static Widget tag_type_wid;
+static Widget tag_text_wid;
+static int source_changed;
+static int use_default;
+static int cur_tag_index;
+static int tagEditorIsUp; /* popup state of editor */
+static int tagEditorAbort; /* abort status when exiting */
+
+/* ---- Private Procedures ---- */
+
+static void quitCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{
+    XtPopdown(tagEditorShellWid);
+    tagEditorIsUp = 0;
+    tagEditorAbort = 1;
+
+}
+static void leaveCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{
+    XtPopdown(tagEditorShellWid);
+    tagEditorIsUp = 0;
+    tagEditorAbort = 0;
+}
+
+static void SourceChanged(Widget w, XtPointer junk, XtPointer garbage)
+{
+    XtRemoveCallback(w, XtNcallback, SourceChanged, NULL);
+    source_changed = TRUE;
+}
+
+void ResetSourceChanged(Widget widget)
+{
+    XtAddCallback(XawTextGetSource(widget), XtNcallback, SourceChanged, NULL);
+    source_changed = FALSE;
+}
+
+
+void setButtonName(Widget w, char *c)
+{
+    Arg args[10];
+    int nargs;
+    char buttonName[255];
+
+    /* set default tag type */
+    sprintf(buttonName,"Type: %s",c);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, buttonName); nargs++;
+    XtSetValues(w, args, nargs);
+}
+
+static void setDefaultText(char *s)
+{
+    XawTextBlock block;
+
+    block.firstPos = 0;
+    block.length = (s==NULL)?0:(int)strlen(s);
+    block.ptr = s;
+    block.format = FMT8BIT;
+
+    XawTextReplace(tag_text_wid,TEXT_START,TEXT_END,&block);
+    XawTextSetInsertionPoint(tag_text_wid,TEXT_END);
+
+}
+
+int idToIndex(char *id)
+{
+    int i;
+    if (id==NULL) return 0;
+    for (i=0; i<tag_db_count; i++) {
+	if (strncmp(id,tag_db[i].id,4)==0)
+	    return i;
+    }
+    return 0;
+}
+
+static void tagMenuCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{
+    tag_db_struct *t = (tag_db_struct *) client_data;
+
+    setButtonName(tag_type_wid, t->type);
+    cur_tag_index = idToIndex(t->id);
+
+    if (!source_changed && use_default) {
+	setDefaultText(t->default_text);
+	ResetSourceChanged(tag_text_wid);
+    }
+}
+
+static void readInTagDB()
+{
+#define TAGDB "TAGDB"
+    char *filename;
+    /* struct stat statBuff; */
+
+    /* check trace file exists */
+    if ((filename = (char *) getenv(TAGDB))!=NULL)
+	parse(filename);
+    else
+	parse(TAGDB);
+
+    /*
+    if ( stat(TAGDB,&statBuff) < 0 )
+	parse(TAGDB);
+    else {
+	filename = getenv(TAGDB);
+	parse(filename);
+    }
+    */
+}
+
+static Pixel ColourNameToPixel(Widget w, String c)
+{
+    XColor rgb_db_def, hardware_def;
+    Colormap cmap;
+    Status s;
+
+    cmap = DefaultColormap(XtDisplay(w),DefaultScreen(XtDisplay(w)));
+    s = XAllocNamedColor(XtDisplay(w), cmap, c, &rgb_db_def, &hardware_def);
+
+    return hardware_def.pixel;
+}
+
+
+static void setUpColourMap(Widget w)
+{
+    int i;
+
+    for (i=0;i<tag_db_count;i++) {
+	tag_db[i].fg_pixel =  (tag_db[i].fg_colour == NULL) ?
+	  1 : ColourNameToPixel(w,tag_db[i].fg_colour);
+        tag_db[i].bg_pixel =  (tag_db[i].bg_colour == NULL) ?
+	  0 : ColourNameToPixel(w,tag_db[i].bg_colour);
+    }
+
+    /* set up standard colours */
+    defColours.lightGrey = ColourNameToPixel(w,"lightGray");
+}
+
+
+
+static char* sourceToString()
+{
+    Widget textSrc = XawTextGetSource(tag_text_wid);
+    XawTextBlock block;
+    int len;
+    char *out;
+    int ici;
+
+    XawTextSetInsertionPoint(tag_text_wid,TEXT_END);
+    len =  (int) XawTextGetInsertionPoint(tag_text_wid);
+    if (!source_changed && !use_default) return NULL;
+    out = (char *) TAG_MALLOC(len+1);
+
+    ici=0;
+    block.format=FMT8BIT;
+    while (ici < len) {
+	/* read a chunk */
+	(void) XawTextSourceRead(textSrc,ici,&block,len-ici);
+	strncpy(&out[ici],block.ptr,block.length);
+	ici += block.length;
+    }
+    out[len]='\0';
+    return out;
+}
+
+
+
+
+
+/* ---- Exported Procedures ---- */
+
+
+void createTagTypeMenu(Widget parent, void (*call_back)() )
+{
+    /* create tag type menu */
+    Arg args[10];
+    Cardinal nargs;
+    Widget menuWid, menuItem;
+    int i;
+
+    nargs = 0;
+    menuWid = XtCreatePopupShell("tagMenu", simpleMenuWidgetClass,
+                                  parent,
+                                  args, nargs);
+    for (i=0;i<tag_db_count;i++) {
+	nargs = 0;
+	XtSetArg(args[nargs], XtNlabel, tag_db[i].type); nargs++;
+        
+	menuItem = XtCreateManagedWidget("tagMenuItem",  smeBSBObjectClass, menuWid, args, nargs);
+	XtAddCallback(menuItem, XtNcallback, call_back, (XtPointer) &tag_db[i]);
+    }
+
+}
+
+
+
+
+void createTagEditor(Widget parentWid)
+{
+    Arg args[10];
+    int nargs;
+
+    Widget mainFormWid;
+    Widget buttonBox,button;
+
+    tagEditorShellWid = XtCreatePopupShell ("Tag",
+	topLevelShellWidgetClass,
+	parentWid,
+	NULL, (Cardinal) 0);
+
+    mainFormWid = XtCreateManagedWidget("mainForm", formWidgetClass,
+	tagEditorShellWid,
+	NULL, (Cardinal) 0);
+
+    nargs = 0;
+    buttonBox = XtCreateManagedWidget("buttons", boxWidgetClass,
+	mainFormWid, args, nargs);
+
+    nargs = 0;
+    button = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+                                  buttonBox, args, nargs);
+    XtAddCallback(button, XtNcallback, quitCallback, NULL);
+
+    nargs = 0;
+    button = XtCreateManagedWidget("Leave",  commandWidgetClass,
+                                  buttonBox, args, nargs);
+    XtAddCallback(button, XtNcallback, leaveCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNmenuName, "tagMenu"); nargs++;
+    tag_type_wid = XtCreateManagedWidget("Type",  menuButtonWidgetClass,
+                                  buttonBox, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, buttonBox); nargs++;
+    XtSetArg(args[nargs], XtNeditType, XawtextEdit); nargs++;
+    XtSetArg(args[nargs], XtNscrollVertical, XawtextScrollAlways); nargs++;
+    XtSetArg(args[nargs], XtNwrap, XawtextWrapLine); nargs++;
+    tag_text_wid = XtCreateManagedWidget("tagText", asciiTextWidgetClass,
+	mainFormWid,
+	args, (Cardinal) nargs);
+
+    readInTagDB();
+    setUpColourMap(tagEditorShellWid /*any widget*/);
+
+    createTagTypeMenu(tagEditorShellWid,tagMenuCallback);
+}
+
+
+
+int invokeTagEditor(char *type_id, char *newType, char *tagComment, char **newComment)
+{
+    tagEditorIsUp = 1;
+    tagEditorAbort = 0;
+
+    /* set default tag type */
+    cur_tag_index = idToIndex(type_id);
+    setButtonName(tag_type_wid, tag_db[cur_tag_index].type);
+
+    /* set default comment */
+    if (tagComment==NULL || strlen(tagComment)==0) {
+	use_default = TRUE;
+	setDefaultText(tag_db[cur_tag_index].default_text);
+    } else {
+	use_default = FALSE;
+	setDefaultText(tagComment);
+    }
+
+    ResetSourceChanged(tag_text_wid);
+    XtPopup(tagEditorShellWid,   XtGrabExclusive);
+
+    while (tagEditorIsUp)
+    {
+        XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(tagEditorShellWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    if (! tagEditorAbort) {
+	*newComment = sourceToString();
+	strncpy(newType,tag_db[cur_tag_index].id,4);
+    }
+
+    return tagEditorAbort;
+
+}
diff --git a/src/staden/tagU1.c b/src/staden/tagU1.c
new file mode 100644
index 0000000..d362af5
--- /dev/null
+++ b/src/staden/tagU1.c
@@ -0,0 +1,1004 @@
+#include "contigEditor.h"
+#include "edUtils.h"
+#include "tagUtils.h"
+#include "Sheet.h"
+#include "tagdb.h"
+#include "fort.h"
+#include <stdio.h>
+static tagStruct *tagFreeList = NULL;
+/* tagUtils.c */
+/* define external routines */
+
+
+/* low level */
+
+static void replace_comment(comment_id cp, char *c)
+{
+    delete_comment(cp);
+    (void) put_comment(c);
+    /* a bit dodgy here. Should work ok with this data structure though */
+}
+
+void force_comment(tagStruct *t)
+/*
+** Force comment to be in memory
+*/
+{
+    if (!(t->flags & TAG_COMMENT_IN_MEMORY)) {
+	/*
+	** Read in from database
+	*/
+	if (t->tagrec.comment) {
+	    t->newcomment = get_comment(t->tagrec.comment);
+	    t->newcommentlen = (int)strlen(t->newcomment);
+	} else {
+	    t->newcomment = (char *) TAG_MALLOC(1);
+	    t->newcomment[0] = '\0';
+	    t->newcommentlen = 0;
+	}
+	t->flags |= TAG_COMMENT_IN_MEMORY;
+    }
+}
+
+
+/*
+** Tag internal memory management routines
+*/
+tagStruct *newTag()
+{
+    tagStruct *t;
+    if (tagFreeList == NULL) {
+        t = (tagStruct *) TAG_MALLOC(sizeof(tagStruct));
+    } else {
+	t = tagFreeList;
+	tagFreeList = t->next;
+    }
+
+    /*
+    ** Null all the fields
+    */
+    t->tagrec.position = 0;
+    t->tagrec.length = 0;
+    t->tagrec.comment = 0;
+    t->tagrec.type.i = 0x20202020;
+    t->tagrec.next = 0;
+    t->original_tag_id = 0;
+    t->newcomment = NULL;
+    t->newcommentlen = 0;
+    t->flags = TAG_UNCHANGED;
+    t->next = NULL;
+    t->time = 0;
+
+    return t;
+}
+
+static void freeTag(tagStruct* t)
+{
+    TAG_FREE(t->newcomment);
+    t->newcommentlen = 0;
+    t->next = tagFreeList;
+    tagFreeList = t;
+}
+
+tagStruct *readTagList(int seq)
+{
+    tagStruct *s,*t,*u;
+    s=t=newTag();
+    (void) read_tag(seq, &t->tagrec);
+    t->original_tag_id = seq;
+    while (t->tagrec.next) {
+        u=newTag();
+	read_tag(t->tagrec.next,&u->tagrec);
+	u->original_tag_id = t->tagrec.next;
+	t->next = u;
+	t=u;
+    }
+    t->next = NULL;
+
+    return s;
+}
+
+void destroyTagList(tagStruct *s)
+{
+    tagStruct *t,*u;
+
+    t=s;
+    while (t!=NULL) {
+        u=t->next;
+	freeTag(t);
+	t=u;
+    }
+}
+
+/*
+** Tag creation and modification
+*/
+void insertTag(EdStruct *xx, int seq, tagStruct *t)
+/*
+** insert tag, sorting by position
+*/
+{
+    tagStruct *u, *v;
+
+    u = (tagStruct *) DBgetTags(xx,seq);
+    v = NULL;
+    while (u != NULL &&
+	(u->tagrec.position <= t->tagrec.position || u->flags & TAG_DELETED) ) {
+	v = u;
+	u = u->next;
+    }
+    t->next = u;
+    if (v != NULL) {
+	v->next = t;
+    } else {
+	DBsetTags(xx,seq,t);
+    }
+
+}
+
+void createTag(EdStruct *xx)
+{
+    int seq,start,length;
+    char *newcomment;
+    char newtype[4];
+    int aborted;
+    tagStruct *t;
+
+    if (! getSelection(xx, &seq, &start, &length, &t)) {
+	/* default selection is current cursor position */
+	seq = xx->cursorSeq;
+	start = xx->cursorPos;
+	length = 1;
+    }
+    /* don't invoke Tag editor for consensus */
+    if (seq) {
+	aborted = invokeTagEditor("",newtype,"",&newcomment);
+	if (! aborted ) {
+	    /*
+	    ** Create a new tag and insert it with comment into bimbo falix
+	    */
+	    t = newTag();
+	    t->flags = TAG_INSERTED |
+		       TAG_LENGTH_CHANGED |
+		       TAG_POSITION_CHANGED |
+		       TAG_TYPE_CHANGED;
+	    t->tagrec.position = normalisePos(xx,seq,start,length);
+	    t->tagrec.length = length;
+	    strncpy(t->tagrec.type.c,newtype,4);
+	    if (newcomment != NULL)
+		t->flags |= TAG_COMMENT_CHANGED |
+			    TAG_COMMENT_IN_MEMORY;
+	    t->newcomment = newcomment;
+	    t->newcommentlen = (int)strlen(newcomment);
+	    insertTag(xx,seq,t);
+	    redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	    xx->select_tag = t;
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+        }
+    }
+}
+
+void editTag(EdStruct *xx)
+{
+    int seq,start,length;
+    char *newcomment;
+    char *oldcomment;
+    tag_types newtype;
+    int aborted;
+    tagStruct *t;
+
+    if (! getSelection(xx, &seq, &start, &length, &t)) {
+	/* default selection is current cursor position */
+	seq = xx->cursorSeq;
+	start = xx->cursorPos;
+        t = findTag(xx,seq,start);
+        _select_tag(xx,seq,t);
+        (void) getSelection(xx, &seq, &start, &length, &t);
+    } else if (t==NULL) {
+        t = findTag(xx,seq,start);
+        _select_tag(xx,seq,t);
+        (void) getSelection(xx, &seq, &start, &length, &t);
+    }
+    if (t==NULL) return;
+
+    /*
+    ** Find current comment
+    */
+    /*
+    if (t->flags & TAG_COMMENT_IN_MEMORY) {
+	oldcomment = t->newcomment;
+    }
+    else {
+	.*
+	** Read in from database
+	*.
+	if (t->tagrec.comment) {
+	    oldcomment = get_comment(t->tagrec.comment);
+	    t->flags |= TAG_COMMENT_IN_MEMORY;
+	} else {
+	    oldcomment = NULL;
+	}
+    }
+    */
+    force_comment(t);
+    oldcomment = t->newcomment;
+
+    /*
+    ** Edit
+    */
+    aborted = invokeTagEditor(t->tagrec.type.c,newtype.c,oldcomment,&newcomment);
+    if (! aborted ) {
+	/*
+	** Update components of tag that have changed
+	*/
+	/*
+	** Don't update position!!!
+	t->tagrec.position = start;
+	t->tagrec.length = length;
+	*/
+	/*
+	** Check for new type
+	*/
+	if (t->tagrec.type.i != newtype.i) {
+	    t->tagrec.type.i = newtype.i;
+	    t->flags |= TAG_TYPE_CHANGED;
+	}
+	/*
+	** Check comment is something new
+	*/
+	if (newcomment != NULL) {
+	    t->flags |= TAG_COMMENT_CHANGED | TAG_COMMENT_IN_MEMORY;
+	    /*
+	    ** throw old comment away
+	    */
+	    TAG_FREE(oldcomment);
+	    t->newcomment = newcomment;
+	    t->newcommentlen = (int)strlen(newcomment);
+	}
+
+	redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+	DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+    }
+
+}
+
+void _delete_tag(EdStruct *xx, int seq, tagStruct *t)
+/*
+** Delete tag t from data structure
+*/
+{
+    /*
+    ** Physical removal by delinking
+    */
+    if (t->flags & TAG_INSERTED) {
+	tagStruct *u, *v;
+	u = (tagStruct *) DBgetTags(xx,seq);
+	v = NULL;
+	while (u != NULL && u != t) {
+	    v = u;
+	    u = u->next;
+	}
+	if (u==NULL) return; /* not found */
+	if (v==NULL)
+	    DBsetTags(xx,seq,u->next);
+	else
+	    v->next = u->next;
+	freeTag(u);
+	u->flags |= TAG_DELETED;
+    } else
+	t->flags |= TAG_DELETED;
+
+    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+
+}
+
+void deleteTag(EdStruct *xx)
+/*
+** A rather brutal delete
+*/
+{
+    int seq,start,length;
+    tagStruct *t;
+
+    if (! getSelection(xx, &seq, &start, &length, &t)) {
+	/* default selection is current cursor position */
+	seq = xx->cursorSeq;
+	start = xx->cursorPos;
+        t = findTag(xx,seq,start);
+        _select_tag(xx,seq,t);
+        (void) getSelection(xx, &seq, &start, &length, &t);
+    } else if (t==NULL) {
+        t = findTag(xx,seq,start);
+        _select_tag(xx,seq,t);
+        (void) getSelection(xx, &seq, &start, &length, &t);
+    }
+    if (t==NULL) return;
+       
+    _delete_tag(xx,seq,t);
+    redisplaySequences (xx,xx->namesWid, xx->sequencesWid, xx->displayPos, xx->displayWidth);
+
+}
+
+void getTagSplodge(EdStruct *xx, int seq, int pos, int width, XawSheetInk *ink)
+/*
+** get the hilighting of a sequence from its `pos' base for `width' bases
+** Bases number from 0?
+*/
+{
+
+    int i;
+    tagStruct *t;
+    int npos,tpos;
+
+
+    if (xx->reveal_cutoffs) {
+	int length = DBgetLength(xx,seq);
+
+        /*blank start*/
+        for (i=0; i<width && i<-pos; i++)
+	    ink[i].sh=sh_light;
+
+        /*copy sequence*/
+        for (; i<width && (pos+i)<length; i++)
+	    ink[i].sh=sh_default;
+
+        /*blank end*/
+        for (;i<width;i++)
+	    ink[i].sh=sh_light;
+
+
+    } else
+	for (i=0;i<width;i++)
+	    ink[i].sh=sh_default;
+
+    pos++;
+    npos = normalisePos(xx,seq,pos,width);
+    t = (tagStruct *) DBgetTags(xx,seq);
+    /* skip over raw data */
+    if (seq && t != NULL) t = t->next;
+
+    while (t != NULL && 
+	(t->tagrec.position < npos+width || t->flags & TAG_DELETED) ) {
+	if (!(t->flags & TAG_DELETED) &&
+	    t->tagrec.position+t->tagrec.length > npos &&
+	    t->tagrec.type.c[0] != '*') {
+	    int l,r;
+	    int db=idToIndex(t->tagrec.type.c);
+	    tpos = normalisePos(xx,seq,t->tagrec.position,t->tagrec.length);
+	    if (tpos < pos)
+		l=0;
+	    else
+		l=tpos-pos;
+	    if (tpos + t->tagrec.length > pos+width)
+		r=width;
+	    else
+		r=tpos-pos + t->tagrec.length;
+	    for (i=l;i<r;i++) {
+		if (tag_db[db].fg_colour!=NULL) {
+		    ink[i].sh|=sh_fg;
+		    ink[i].fg=tag_db[db].fg_pixel;
+		}
+		if (tag_db[db].bg_colour!=NULL) {
+		    ink[i].sh|=sh_bg;
+		    ink[i].bg=tag_db[db].bg_pixel;
+		}
+	    }
+        }
+	t = t->next;
+    }
+
+}
+
+static void tagEditDelete(EdStruct *xx, int seq, int pos, char deletedBase)
+{
+    tagStruct *t;
+
+    t = (tagStruct *) DBgetTags(xx,seq);
+    while (t != NULL &&
+    (t->tagrec.position <= pos || t->flags & TAG_DELETED) ) {
+	if( !(t->flags & TAG_DELETED) &&
+	    t->tagrec.position==pos &&
+	    strncmp(t->tagrec.type.c,TAG_TYPE_INSERT,3)==0){
+	    _delete_tag(xx, seq, t);
+	    return;
+	}
+	t = t->next;
+    }
+
+    /*
+    ** We didn't find a tag so create a delete one
+    */
+    t = newTag();
+    t->flags = TAG_INSERTED |
+	       TAG_LENGTH_CHANGED |
+	       TAG_POSITION_CHANGED |
+	       TAG_TYPE_CHANGED;
+    t->tagrec.position = pos;
+    t->tagrec.length = 0;
+    strncpy(t->tagrec.type.c,TAG_TYPE_DELETE,4);
+
+    /*
+    ** Stash the deleted base somewhere
+    */
+    t->tagrec.type.c[3] = deletedBase;
+
+    insertTag(xx,seq,t);
+
+}
+
+static void tagEditInsert(EdStruct *xx, int seq, int pos, char insertedBase)
+{
+    tagStruct *t;
+    /*
+    ** Create an new insert tag here
+    */
+    t = newTag();
+    t->flags = TAG_INSERTED |
+	       TAG_LENGTH_CHANGED |
+	       TAG_POSITION_CHANGED |
+	       TAG_TYPE_CHANGED;
+    t->tagrec.position = pos;
+    t->tagrec.length = 1;
+    strncpy(t->tagrec.type.c,TAG_TYPE_INSERT,3);
+
+    /*
+    ** Stash the deleted base somewhere
+    */
+    t->tagrec.type.c[3] = insertedBase;
+
+    insertTag(xx,seq,t);
+}
+
+
+char normaliseBase(EdStruct *xx,int seq,char deletedBase)
+{
+
+    if (DBgetComp(xx,seq) == COMPLEMENTED) {
+	char base = deletedBase;
+	int_f i=1;
+	sqcom_(&base,&i,(int_fl)1);
+	return base;
+    } else
+	return deletedBase;
+}
+
+
+void tagInsertBase(EdStruct *xx,int seq,int pos,char *insertedBase)
+/*
+** A character has been inserted at position `pos' in sequence `seq'
+** Adjust tag positions and lengths accordingly
+*/
+{
+    int npos = normalisePos(xx,seq,pos,1);
+    char base = normaliseBase(xx,seq,*insertedBase);
+
+    tagStruct *t,*del_tag;
+    int atpos = 0;
+    t = (tagStruct *) DBgetTags(xx,seq);
+    del_tag = NULL;
+
+    while (t != NULL) {
+	if ( !(t->flags & TAG_DELETED) &&
+	    t->tagrec.position == npos &&
+	    strncmp(t->tagrec.type.c,TAG_TYPE_DELETE,3)==0) {
+	    if (! atpos && t->tagrec.type.c[3] == base) del_tag = t;
+	    atpos++;
+	}
+	if ( !(t->flags & TAG_DELETED) &&
+	    t->tagrec.position >= npos ) {
+	    t->tagrec.position++;
+	    t->flags |= TAG_POSITION_CHANGED;
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	}
+	else if ( !(t->flags & TAG_DELETED) &&
+	    t->tagrec.position + t->tagrec.length > npos ) {
+	    t->tagrec.length++;
+	    t->flags |= TAG_LENGTH_CHANGED;
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	}
+	t = t->next;
+    }
+
+    if (del_tag == NULL)
+	tagEditInsert(xx, seq, npos,base);
+    else
+	_delete_tag(xx, seq, del_tag);
+}
+
+void tagDeleteBase(EdStruct *xx,int seq,int pos, char *deletedBase)
+/*
+** A character has been deleted from position `pos' in sequence `seq'
+** Adjust tag positions and lengths accordingly
+*/
+{
+    int npos = normalisePos(xx,seq,pos,0);
+    char base = normaliseBase(xx,seq,*deletedBase);
+    tagStruct *t;
+
+    tagEditDelete(xx, seq, npos, base);
+
+    t = (tagStruct *) DBgetTags(xx,seq);
+    while (t != NULL) {
+	if ( !(t->flags & TAG_DELETED) &&
+	    t->tagrec.position > npos) {
+	    t->tagrec.position--;
+	    t->flags |= TAG_POSITION_CHANGED;
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	}
+	else if ( !(t->flags & TAG_DELETED ) &&
+	    t->tagrec.position + t->tagrec.length > npos) {
+	    t->tagrec.length--;
+	    t->flags |= TAG_LENGTH_CHANGED;
+	    DBsetFlags(xx,seq,DBgetFlags(xx,seq)|DB_FLAG_TAG_MODIFIED);
+	}
+	t = t->next;
+    }
+
+}
+
+void tagReplaceBase(EdStruct *xx,int seq,int pos, char *deletedBase, char *insertedBase)
+/*
+** A character has been replaced in position `pos' in sequence `seq'
+** Adjest tags accordingly
+*/
+{
+    int tpos = normalisePos(xx,seq,pos,1);
+    char dbase = normaliseBase(xx,seq,*deletedBase);
+    char ibase = normaliseBase(xx,seq,*insertedBase);
+    tagEditDelete(xx, seq, tpos, dbase);
+    tagEditInsert(xx, seq, tpos, ibase);
+}
+
+
+tagStruct *findTag(EdStruct *xx,int seq,int pos)
+/*
+** Find the tag (if any) at position `pos' in sequence `seq'
+*/
+{
+    int npos = normalisePos(xx,seq,pos,1/*character*/);
+
+    tagStruct *t;
+    t = (tagStruct *) DBgetTags(xx,seq);
+    while (t != NULL) {
+	if (!(t->flags & TAG_DELETED) &&
+	   t->tagrec.position <= npos &&
+	   t->tagrec.position + t->tagrec.length > npos &&
+	   t->tagrec.type.c[0] != '*')
+	    return t;
+	t = t->next;
+    }
+    return NULL;
+}
+
+tagStruct *findAllTags(EdStruct *xx,int seq,int pos)
+/*
+** Find the tag (if any) at position `pos' in sequence `seq'
+*/
+{
+    static tagStruct *t;
+    static int npos;
+    if (xx==NULL) {
+	if (t != NULL) t = t->next;
+    } else {
+	npos = normalisePos(xx,seq,pos,1/*character*/);
+	t = (tagStruct *) DBgetTags(xx,seq);
+    }
+
+    while (t != NULL) {
+	if (!(t->flags & TAG_DELETED) &&
+	   t->tagrec.position <= npos &&
+	   t->tagrec.position + t->tagrec.length > npos)
+	    return t;
+	t = t->next;
+    }
+    return NULL;
+}
+
+void writeTagList(EdStruct *xx, int seq)
+{
+    tagRecord anon_tagrec;
+    tagStruct *first_tag,*this_tag;
+    tag_id last_tagrec_id;
+    int flag;
+
+    first_tag = (tagStruct *) DBgetTags(xx,seq);
+
+    /*
+    ** *INS - flag as deleted all *INS flags with length 0
+    */
+    this_tag = first_tag;
+    while (this_tag != NULL) {
+	if ( !(this_tag->flags & TAG_DELETED) &&
+	    this_tag->tagrec.length == 0 &&
+	    strncmp(this_tag->tagrec.type.c,TAG_TYPE_INSERT,3)==0)
+	    this_tag->flags |= TAG_DELETED;
+	this_tag = this_tag->next;
+    }
+
+    /*
+    ** Assumption: first tag in list is always a header and doesn't change
+    */
+    last_tagrec_id = DBgetNumber(xx,seq);
+    this_tag = first_tag->next;
+
+    /*
+    ** Assumption: cutoff information is stored in header's comment
+    */
+    if (first_tag->flags & TAG_COMMENT_CHANGED) {
+	if (first_tag->tagrec.comment)
+	    (void) replace_comment(first_tag->tagrec.comment,first_tag->newcomment);
+	else {
+	    first_tag->tagrec.comment = put_comment(first_tag->newcomment);
+	    (void) write_tag(last_tagrec_id,first_tag->tagrec);
+	}
+    }
+
+    while (this_tag != NULL) {
+	flag = this_tag->flags;
+
+	if (flag == TAG_UNCHANGED || flag == TAG_COMMENT_IN_MEMORY) {
+	    /*
+	    ** get next tag in list
+	    */
+	    last_tagrec_id = this_tag->original_tag_id;
+	    this_tag = this_tag->next;
+	}
+
+	else if (flag & TAG_INSERTED && !(flag & TAG_DELETED) ) {
+	    tag_id this_tagrec_id;
+	    /*
+	    ** create a new tag
+	    */
+	    this_tagrec_id = get_free_tag();
+	    /*
+	    ** update previous tag record
+	    */
+	    (void) read_tag(last_tagrec_id,&anon_tagrec);
+	    this_tag->tagrec.next = anon_tagrec.next;
+	    anon_tagrec.next = this_tagrec_id;
+	    (void) write_tag(last_tagrec_id,anon_tagrec);
+	    /*
+	    ** write new tag record
+	    */
+	    if (this_tag->newcomment!=NULL && this_tag->newcommentlen!=0)
+		this_tag->tagrec.comment = put_comment(this_tag->newcomment);
+	    else
+		this_tag->tagrec.comment = 0;
+	    (void) write_tag(this_tagrec_id,this_tag->tagrec);
+	    /*
+	    ** get next tag in list
+	    */
+	    last_tagrec_id = this_tagrec_id;
+	    this_tag = this_tag->next;
+	}
+
+	else if (!(flag & TAG_INSERTED) && flag & TAG_DELETED) {
+	    tag_id this_tagrec_id;
+	    this_tagrec_id = this_tag->original_tag_id;
+	    /*
+	    ** delink record from file
+	    */
+	    (void) read_tag(last_tagrec_id,&anon_tagrec);
+	    anon_tagrec.next = this_tag->tagrec.next;
+	    (void) write_tag(last_tagrec_id,anon_tagrec);
+	    /*
+	    ** destroy deleted tag
+	    */
+	    (void) delete_tag_rec(this_tagrec_id);
+	    /*
+	    ** get next tag in list
+	    */
+	    this_tag = this_tag->next;
+	}
+
+	else if (!(flag & TAG_DELETED) )
+	{
+	    tag_id this_tagrec_id;
+	    this_tagrec_id = this_tag->original_tag_id;
+	    /*
+	    ** Deal with comment first
+	    */
+	    if (flag & TAG_COMMENT_CHANGED) {
+		if (this_tag->tagrec.comment)
+		    (void) replace_comment(this_tag->tagrec.comment,this_tag->newcomment);
+		else
+		    this_tag->tagrec.comment = put_comment(this_tag->newcomment);
+	    }
+	    (void) write_tag(this_tagrec_id,this_tag->tagrec);
+	    /*
+	    ** get next tag in list
+	    */
+	    last_tagrec_id = this_tagrec_id;
+	    this_tag = this_tag->next;
+	}
+	else {
+	    /*
+	    ** get next tag in list
+	    */
+	    this_tag = this_tag->next;
+	}
+    }
+}
+
+
+
+int origpos(EdStruct *xx, int seq, int pos)
+/*
+** Determine position in original sequence corresponding to pos
+** taking into account all insertions and deletions
+*/
+{
+    int npos = pos;
+
+    tagStruct *t;
+    t = (tagStruct *) DBgetTags(xx,seq);
+    while (t != NULL &&
+	(t->tagrec.position <= pos || t->flags & TAG_DELETED)) {
+	if ( !(t->flags & TAG_DELETED ) &&
+	    t->tagrec.type.c[0] == '*') {
+	    if (strncmp(t->tagrec.type.c, TAG_TYPE_INSERT,3)==0)
+		/*insertion*/
+		npos--;
+	    else if (strncmp(t->tagrec.type.c, TAG_TYPE_DELETE,3)==0)
+		/*deletion*/
+		npos++;
+	}
+	t = t->next;
+    }
+
+    return npos;
+}
+
+
+
+tagStruct *findTagByType(EdStruct *xx, int seq, char *type)
+/*
+** Find the first tag by type
+*/
+{
+    tagStruct *t;
+    int looking,found;
+    
+    /* find tag */
+    t = (tagStruct *) DBgetTags(xx,seq);
+    looking = 1;
+    found = 0;
+    while (looking && !found) {
+	found = ( strncmp( t->tagrec.type.c, type, 4 ) == 0 );
+	if (! found) {
+	    t = t->next;
+	    looking = (t==NULL)?0:( !t->tagrec.position );
+	}
+    }
+    if (found)
+	return t;
+    else
+	return NULL;
+}
+
+
+void getLeftCutOff(EdStruct *xx,int seq, int width, char *str)
+{
+    if (xx->reveal_cutoffs && width >0 ) {
+	tagStruct *t;
+	int c;
+	c = (DBgetComp(xx,seq)==COMPLEMENTED);
+
+	/* find tag */
+	t = findTagByType(xx,seq,(c?TAG_TYPE_RCUT:TAG_TYPE_LCUT));
+
+	if (t != NULL) {
+	    /*
+	    ** Find current comment
+	    */
+	    force_comment(t);
+
+	    if (t->newcomment != NULL) {
+		int l = t->newcommentlen;
+		for (;l<width;width--)*str++=' ';
+		if (c)
+		    for (width--;width>=0;width--) {
+			int_f i=1;
+			*str  = t->newcomment[width];
+			sqcom_(str,&i,(int_fl)1);
+			str++;
+		    }
+		else
+		    strncpy(str,&t->newcomment[l-width],width);
+		return;
+	    }
+	}
+    }
+
+    for(;width>0;width--)*str++=' ';
+}
+
+void getLCut(EdStruct *xx,int seq, int pos, int width, char *str)
+/*
+**
+*/
+{
+    if (xx->reveal_cutoffs && width >0 ) {
+	tagStruct *t;
+	int c;
+	c = (DBgetComp(xx,seq)==COMPLEMENTED);
+
+	/* find tag */
+	t = findTagByType(xx,seq,(c?TAG_TYPE_RCUT:TAG_TYPE_LCUT));
+
+	if (t != NULL) {
+	    /*
+	    ** Find current comment
+	    */
+	    force_comment(t);
+
+	    if (t->newcomment != NULL) {
+		int l = t->newcommentlen;
+		for (;l<pos;pos--,width--)*str++=' ';
+		if (c)
+		    for (pos--,width--;width>=0;width--,pos--) {
+			int_f i=1;
+			*str  = t->newcomment[pos];
+			sqcom_(str,&i,(int_fl)1);
+			str++;
+		    }
+		else
+		    strncpy(str,&t->newcomment[l-pos],width);
+		return;
+	    }
+	}
+    }
+
+    for(;width>0;width--)*str++=' ';
+}
+
+
+void getRightCutOff(EdStruct *xx,int seq, int width, char *str)
+{
+    if (xx->reveal_cutoffs && width >0 ) {
+	tagStruct *t;
+	int c;
+	c = (DBgetComp(xx,seq)==COMPLEMENTED);
+
+	/* find tag */
+	t = findTagByType(xx,seq,(c?TAG_TYPE_LCUT:TAG_TYPE_RCUT));
+
+	if (t != NULL) {
+	    /*
+	    ** Find current comment
+	    */
+	    force_comment(t);
+
+	    if (t->newcomment != NULL) {
+		int l = t->newcommentlen;
+		for (;l<width;width--)str[width-1]=' ';
+		if (c) {
+		    char *p;
+		    for (p=&t->newcomment[l-1];width>0;width--,p--) {
+			int_f i=1;
+			*str  = *p;
+			sqcom_(str,&i,(int_fl)1);
+			str++;
+		    }
+		} else
+		    strncpy(str,t->newcomment,width);
+		return;
+	    }
+	}
+    }
+
+    for(;width>0;width--)*str++=' ';
+}
+
+
+void getRCut(EdStruct *xx,int seq, int pos, int width, char *str)
+{
+    if (xx->reveal_cutoffs && width >0 ) {
+	tagStruct *t;
+	int c;
+	c = (DBgetComp(xx,seq)==COMPLEMENTED);
+
+	/* find tag */
+	t = findTagByType(xx,seq,(c?TAG_TYPE_LCUT:TAG_TYPE_RCUT));
+
+	if (t != NULL) {
+	    /*
+	    ** Find current comment
+	    */
+	    force_comment(t);
+
+	    if (t->newcomment != NULL) {
+		int l = t->newcommentlen;
+		for (;l<pos+width;width--)str[width-1]=' ';
+		if (c) {
+		    char *p;
+		    for (p=&t->newcomment[l-pos-1];width>0;width--,p--) {
+			int_f i=1;
+			*str  = *p;
+			sqcom_(str,&i,(int_fl)1);
+			str++;
+		    }
+		} else
+		    strncpy(str,&t->newcomment[pos],width);
+		return;
+	    }
+	}
+    }
+
+    for(;width>0;width--)*str++=' ';
+}
+
+
+int lenRCut(EdStruct *xx, int seq)
+{
+    tagStruct *t;
+    int c;
+    c = (DBgetComp(xx,seq)==COMPLEMENTED);
+    
+    /* find tag */
+    t = findTagByType(xx,seq,(c?TAG_TYPE_LCUT:TAG_TYPE_RCUT));
+    
+    if (t != NULL) {
+	/*
+	** Find current comment
+        */
+	force_comment(t);
+	
+	if (t->newcomment != NULL)
+	    return t->newcommentlen;
+	else
+	    return 0;
+    } else
+	return 0;
+
+}
+
+
+int lenLCut(EdStruct *xx, int seq)
+{
+    tagStruct *t;
+    int c;
+    c = (DBgetComp(xx,seq)==COMPLEMENTED);
+    
+    /* find tag */
+    t = findTagByType(xx,seq,(c?TAG_TYPE_RCUT:TAG_TYPE_LCUT));
+    
+    if (t != NULL) {
+	/*
+	** Find current comment
+        */
+	force_comment(t);
+	
+	if (t->newcomment != NULL)
+	    return t->newcommentlen;
+	else
+	    return 0;
+    } else
+	return 0;
+
+}
+
+void dump_tags(EdStruct *xx, int seq)
+{
+    tagStruct *t = (tagStruct *) DBgetTags(xx,seq);
+
+    fprintf(stderr,"Tags for %s\n",DBgetName(xx,seq));
+    while (t != NULL) {
+	
+	fprintf(stderr," %5d %3d   %4s %5d %c%c%c%c%c%c%c\n",
+	    t->tagrec.position,
+	    t->tagrec.length,
+	    t->tagrec.type.c,
+	    t->tagrec.comment,
+	    (t->flags & TAG_POSITION_CHANGED) ?'P':'-',
+	    (t->flags & TAG_LENGTH_CHANGED)   ?'L':'-',
+	    (t->flags & TAG_TYPE_CHANGED)     ?'T':'-',
+	    (t->flags & TAG_COMMENT_CHANGED)  ?'C':'-',
+	    (t->flags & TAG_INSERTED)         ?'I':'-',
+	    (t->flags & TAG_DELETED)          ?'D':'-',
+	    (t->flags & TAG_COMMENT_IN_MEMORY)?'M':'-'
+	    );
+
+	t = t->next;
+    }
+}
diff --git a/src/staden/tagU2.c b/src/staden/tagU2.c
new file mode 100644
index 0000000..763fdfc
--- /dev/null
+++ b/src/staden/tagU2.c
@@ -0,0 +1,706 @@
+#include "contigEditor.h"
+#include <stdio.h>
+#include <stdlib.h>
+#include "tagUtils.h"
+#include "fort.h"
+
+#define COMMENT_HEAD_ID (1)
+#define TAG_HEAD_ID (devils_.idbsiz)
+
+int read_tag(tag_id n, tagRecord *t)
+{
+    readtg_(&devils_.idevt,&n,&t->position,&t->length,&t->comment,&t->type.i,&t->next);
+    return 0;
+}
+int write_tag(tag_id n, tagRecord t)
+{
+    writtg_(&devils_.idevt,&n,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    return 0;
+}
+int read_comment(comment_id n, int *cnt, commentStruct *c)
+{
+    readcc_(&devils_.idevc,&n,cnt,&c->next,&c->comment[0],40);
+    return 0;
+}
+int write_comment(comment_id n, int cnt, commentStruct c)
+{
+    writcc_(&devils_.idevc,&n,&cnt,&c.next,&c.comment[0],40);
+    return 0;
+}
+/* comment interface */
+tag_id get_free_tag()
+{
+    tagRecord head;
+    tagRecord freerec;
+    tag_id free_id;
+    (void) read_tag(TAG_HEAD_ID,&head);
+    if (head.next != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.next;
+	(void) read_tag(free_id,&freerec);
+	head.next = freerec.next;
+	(void) write_tag(TAG_HEAD_ID,head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++head.position;
+	(void) write_tag(TAG_HEAD_ID,head);
+	(void) write_tag(free_id,freerec);
+    }
+
+    return free_id;
+}
+
+void delete_comment (comment_id cp)
+{
+    int dummy;
+    commentStruct head;
+    commentStruct freerec;
+    comment_id free_id;
+    if (!cp) return;
+    free_id = cp;
+    (void) read_comment(free_id,&dummy,&freerec);
+    while (freerec.next) {
+	free_id = freerec.next;
+        (void) read_comment(free_id,&dummy,&freerec);
+    }
+    (void) read_comment(COMMENT_HEAD_ID,&dummy,&head);
+    freerec.next = head.next;
+    (void) write_comment(free_id,dummy,freerec);
+    head.next = cp;
+    (void) write_comment(COMMENT_HEAD_ID,dummy,head);
+}
+
+
+
+
+
+void blank_tag_rec(tag_id t)
+/*
+** Blank out fields in tag record t
+*/
+{
+
+    tagRecord r;
+
+    (void) read_tag(t, &r);
+
+    r.position = 0;
+    r.length = 0;
+    r.type.i = 0x20202020;
+    r.comment = 0;
+    r.next = 0;
+    
+    (void) write_tag(t, r);
+
+}
+
+
+
+
+void delete_tag_rec(tag_id t)
+/*
+** remove t from file, discarding comment if necessary
+*/
+{
+    tagRecord head;
+    tagRecord freerec;
+
+    (void) read_tag(t,&freerec);
+
+    if (freerec.comment)
+	delete_comment(freerec.comment);
+
+    (void) read_tag(TAG_HEAD_ID,&head);
+    freerec.next = head.next;
+    (void) write_tag(t,freerec);
+    head.next = t;
+    (void) write_tag(TAG_HEAD_ID,head);
+    
+}
+
+static comment_id get_free_comment()
+{
+    commentStruct head;
+    commentStruct freerec;
+    comment_id free_id;
+    int count;
+    (void) read_comment(COMMENT_HEAD_ID,&count,&head);
+    if (head.next != 0) {
+	/*
+	** if a free slot somewhere, use it
+	*/
+	free_id = head.next;
+	(void) read_comment(free_id,&count,&freerec);
+	head.next = freerec.next;
+	(void) write_comment(COMMENT_HEAD_ID,count,head);
+    } else {
+	/*
+	** extend comment list file
+	*/
+	free_id = ++count;
+	(void) write_comment(COMMENT_HEAD_ID,free_id,head);
+	(void) write_comment(free_id,0,freerec);
+    }
+
+    return free_id;
+}
+
+comment_id put_comment(char *c)
+{
+    commentStruct com;
+    comment_id cur,next,this_comment;
+    size_t clen = strlen(c);
+    size_t piece;
+
+    /* write out first block of COMMENT_LENGTH */
+    this_comment=cur=get_free_comment();
+    if (clen>COMMENT_LENGTH)
+	piece = COMMENT_LENGTH;
+    else
+	piece = clen;
+
+    {int i; for(i=0;i<COMMENT_LENGTH;i++)com.comment[i]=' ';}
+    strncpy(com.comment,c,piece);
+
+    c+= piece;
+    clen -= piece;
+    while (clen > 0) {
+	next = get_free_comment();
+	com.next = next;
+	write_comment(cur,0,com);
+	cur = next;
+	if (clen<COMMENT_LENGTH)
+	    piece = clen;
+
+	{int i; for(i=0;i<COMMENT_LENGTH;i++)com.comment[i]=' ';}
+	strncpy(com.comment,c,piece);
+
+	c+= piece;
+	clen -= piece;
+    }
+    com.next = 0;
+    if (piece!=COMMENT_LENGTH)
+	com.comment[piece]='\0';
+    write_comment(cur,0,com);
+
+    return this_comment;
+}
+
+static void add_RD_comment(tag_id t_id, char *comment)
+{
+    tagRecord t;
+
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    t.comment = put_comment(comment);
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+static void insert_CUT_tags(tag_id t_id, int pos, int length, char *type, char *comment)
+{
+    tagRecord t,new_t;
+    tag_id next_id,new_id;
+
+    new_t.length = 1;
+    strncpy(new_t.type.c,type,4);
+    new_t.comment = put_comment(comment);
+
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    next_id = t.next;
+    new_id = get_free_tag();
+    new_t.position = pos;
+    new_t.next = next_id;
+    writtg_(&devils_.idevt,&new_id,&new_t.position,&new_t.length,&new_t.comment,&new_t.type.i,&new_t.next);
+    next_id = new_id;
+    t.next=next_id;
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+static void insert_INS_tags(tag_id t_id, int pos, int length)
+{
+    tagRecord t,new_t;
+    tag_id next_id,new_id;
+    int i;
+
+    new_t.length = 1;
+    strncpy(new_t.type.c,TAG_TYPE_INSERT,3);
+    new_t.type.c[3]='*';
+    new_t.comment = 0;
+
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+    next_id = t.next;
+    for (i=length-1;i>=0;i--) {
+	new_id = get_free_tag();
+	new_t.position = pos+i;
+	new_t.next = next_id;
+	writtg_(&devils_.idevt,&new_id,&new_t.position,&new_t.length,&new_t.comment,&new_t.type.i,&new_t.next);
+	next_id = new_id;
+    }
+    t.next=next_id;
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+static void myFstr2Cstr(char *Fstr, int_fl Flen,
+	       char *Cstr, int_fl Clen)
+/*
+    Copy the significant characters of a blank padded Fortran string
+    to a '\0' terminated C string, ignoring excess characters.
+
+    This function works if the strings are distinct or coincident, but
+    not if they overlap in any other way.
+*/
+{   int_fl FsigLen, i;
+
+    /* Find the significant length of Fstr */
+    FsigLen=Flen;
+    while ((FsigLen > 0) && (Fstr[FsigLen-1] == ' '))
+    {   FsigLen--;
+    }
+
+    /* Copy up to (Clen-1) significant characters */
+    i=0;
+    while ((i < FsigLen) && (i < (Clen-1)))
+    {   Cstr[i] = Fstr[i];
+        i++;
+    }
+
+    Cstr[i] = '\0';
+}
+
+/* C version of ENTRD */
+void entrd_(int_f *IDEVG,int_f *IDEVT,int_f *IDEVC,char *NAMARC,int_f *NGEL,
+       int_f *IOK, int_fl l_NAMARC)
+{
+/*
+    char NAMFIL[FILE_NAME_LENGTH], MTYPE[4];
+    char namfil[FILE_NAME_LENGTH+1], mtype[5];
+*/
+    int LENR,LCUT,LENW;
+    int RCUT;
+
+#define fn_len 100
+    char fn[fn_len];
+#define l_line 100
+    char line[l_line];
+    char *result;
+    char *cutbuff,*cb,*l;
+
+    FILE *fp;
+
+    /* blank tag rec */
+    blank_tag_rec((tag_id) *NGEL);
+
+    myFstr2Cstr(NAMARC,l_NAMARC,fn,(int_fl)fn_len);
+
+    *IOK = 0;
+    /* don't need this anymore
+    saveState.idevt  = &devils_.idevt;
+    saveState.idevc  = &devils_.idevc;
+    saveState.idbsiz = &devils_.idbsiz;
+    */
+    /* Enters raw data info into database */
+    if ( (fp = fopen(fn,"r")) != NULL ) {
+	result = fgets(line,l_line,fp);
+	if (result != NULL && line[0] == ';') { 
+	    /* process header */
+	    sscanf(line,";%6d%6d%6d%*s\n",&LENR,&LCUT,&LENW);
+	    RCUT = LENR - LCUT - LENW;
+            add_RD_comment(*NGEL, &line[1]);
+	    result = fgets(line,l_line,fp);
+	}
+	if (result != NULL && strlen(line)>1 && strncmp(line,";<",2)==0) {
+	    /* process left cut off */
+	    /* starting with this line we need to read LCUT ditties */
+	    cutbuff = (char *) TAG_MALLOC(LCUT+1);
+	    cb = cutbuff;
+	    while (result!=NULL && strlen(line)>1 && strncmp(line,";<",2)==0){
+		for(l=&line[2];*l;l++)if(strchr(" \n\t",*l)==NULL)*cb++=*l;
+		result = fgets(line,l_line,fp);
+	    }
+	    *cb='\0';
+            insert_CUT_tags(*NGEL, 0, 0, TAG_TYPE_LCUT, cutbuff);
+	    TAG_FREE(cutbuff);
+	}
+
+	if (result != NULL && strlen(line)>1 && strncmp(line,";>",2)==0) {
+	    /* process right cut off */
+	    /* starting with this line we need to read RCUT ditties */
+	    cutbuff = (char *) TAG_MALLOC(RCUT+1);
+	    cb = cutbuff;
+	    while (result != NULL && strlen(line)>1 && strncmp(line,";>",2)==0) {
+		for(l=&line[2];*l;l++)if(strchr(" \n\t",*l)==NULL)*cb++=*l;
+		result = fgets(line,l_line,fp);
+	    }
+	    *cb='\0';
+            insert_CUT_tags(*NGEL, 0, 0, TAG_TYPE_RCUT, cutbuff);
+	    TAG_FREE(cutbuff);
+	}
+
+	fclose(fp);
+    }
+}
+
+int_f freecc_ (int_f *IDEVC)
+{
+    return get_free_comment();
+}
+
+void padtag_(int_f *llino_, int_f *k_, int_f *nc_, int_f *lngthg_)
+/*
+** Play around with tags when auto assembling
+*/
+{
+    int seq;
+    int pos;
+    int length;
+    int seq_length;
+    tagRecord t;
+    tag_id last,next;
+    int last_pos;
+    if (devils_.idevt>0) {
+	/* fudge */
+	/* don't need this anymore
+	saveState.idevt  = &devils_.idevt;
+	saveState.idbsiz = &devils_.idbsiz;
+	*/
+	seq = *llino_;
+	length = *nc_;
+	if (*lngthg_ < 0) {
+	    seq_length = - *lngthg_;
+	    pos = seq_length - *k_ - *nc_ + 2;
+	} else {
+	    seq_length = *lngthg_;
+	    pos = *k_;
+	}
+
+	next = seq;
+	last = 0;
+	last_pos = 0;
+	while (next) {
+	    readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    /*
+	    ** Move tags accordingly
+	    */
+	    if (t.position >= pos)
+	    {
+		t.position+=length;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+	    else if (t.position + t.length > pos)
+	    {
+		t.length+=length;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+
+	    /*
+	    ** Insert *INS tags
+	    */
+	    if (last_pos < pos && t.position >= pos)
+		/*
+		** Insert after last `length' tags
+		*/
+		insert_INS_tags(last,pos,length);
+	    else if (t.position < pos && t.next==0)
+		/*
+		** Insert after next `length' tags
+		*/
+		insert_INS_tags(next,pos,length);
+
+            last = next;
+	    last_pos = t.position;
+	    next = t.next;
+	}
+
+    }
+}
+
+void taggel_(int_f *ngels_, int_f *lngthg_, char *gel_, int_fl gel_length)
+{
+    int_f length = abs(*lngthg_);
+    int_f one=1;
+    int_f i,j;
+    
+    if (*lngthg_ > 0) {
+	for (i=0,j=1; i<length; i++,j++)
+	    if (gel_[i]=='*')
+		padtag_(ngels_,&j,&one,lngthg_);
+    } else {
+	for (i=length-1,j=length; i>=0; i--,j--)
+	    if (gel_[i]=='*')
+		padtag_(ngels_,&j,&one,lngthg_);
+    }
+
+}
+
+void movtag_ (int_f *from, int_f *to )
+/*
+** Move tag information of gel ``from'' to gel ``to'',
+** and perform garbage collection on old gel ``to''
+*/
+{
+
+    tagRecord freerec;
+    tag_id this,next;
+
+    /* don't do anything if tag files aren't open */
+    if ( devils_.idevc < 0 || devils_.idevt < 0 ) return;
+
+    /* Throw away ``to'' tag records */
+    (void) read_tag((tag_id) *to,&freerec);
+
+    if (freerec.comment)
+	delete_comment(freerec.comment);
+
+    next = freerec.next;
+    while ( next ) {
+        this = next;
+	(void) read_tag( this , &freerec );
+        next = freerec.next;
+        delete_tag_rec (this);
+    }
+
+    /* copy ``from'' record to ``to'' record */
+    (void) read_tag((tag_id) *from,&freerec);
+    (void) write_tag((tag_id) *to,freerec);
+
+    /* blank out old tag rec */
+    blank_tag_rec(*from);
+
+}
+
+char *get_comment(comment_id cp)
+{
+    int dummy;
+    commentStruct c;
+    int count;
+    comment_id nc;
+    char *com,*comptr;
+
+    if (!cp) return NULL;
+    /* determine how long string is */
+    count = 1;
+    nc=cp;
+    read_comment(nc, &dummy, &c);
+    while (c.next != 0) {
+	nc = c.next;
+	count++;
+        read_comment(nc, &dummy, &c);
+    }
+
+    com = comptr = (char *)TAG_MALLOC(count * COMMENT_LENGTH+1);
+    nc=cp;
+    read_comment(nc, &dummy, &c);
+    strncpy(com,c.comment,COMMENT_LENGTH); com+=COMMENT_LENGTH;
+    while (c.next != 0) {
+	nc = c.next;
+	count++;
+        read_comment(nc, &dummy, &c);
+        strncpy(com,c.comment,COMMENT_LENGTH); com+=COMMENT_LENGTH;
+    }
+    *com = '\0';
+
+    return comptr;
+    
+}
+
+void getext_(int_f *gel, char *cutoff, int_f *lcutoff, int_f *ok,
+	     int_fl l_cutoff)
+/*
+** Get right cutoff for lowly Fortran Users
+*/
+{
+
+    tagRecord rec;
+    tag_id next;
+
+    (void) read_tag((tag_id) *gel,&rec);
+
+    next = rec.next;
+
+    while ( next && ! rec.position ) {
+	(void) read_tag( next , &rec );
+        next = rec.next;
+
+	if (strncmp(rec.type.c,TAG_TYPE_RCUT,4) == 0) {
+	    if (rec.comment) {
+	        char *c;
+		int i;
+		/* use this */
+		c = get_comment(rec.comment);
+		/* copy to fortran array */
+		for (i=0; i<*lcutoff && c[i]; i++) cutoff[i] = c[i];
+		/* set length of returned string */
+		*lcutoff = i;
+		for (; i<*lcutoff; i++) cutoff[i] = ' ';
+
+		/* rec temporary string space */
+		TAG_FREE(c);
+		*ok = 0;
+		return;
+	    } else {
+		*ok = 1;
+		return;
+	    }
+
+	}
+    }
+
+    *ok = 1;
+}
+
+
+
+
+
+
+
+
+
+
+
+/*************************************************************/
+/**      Routines for Rodger's pad shuffling                 */
+/*************************************************************/
+
+static void insert_edit_tag(tag_id t_id, int pos, char base, char *type)
+{
+    tagRecord t,new_t;
+    tag_id new_id;
+
+    /* read previous tag */
+    readtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,
+	    &t.type.i,&t.next);
+
+    /* set up new record */
+    strncpy(new_t.type.c,type,3);
+    new_t.type.c[3]=base;
+    new_t.length = 1;
+    new_t.comment = 0;
+    new_t.position = pos;
+    new_t.next = t.next;
+
+    /* write new record */
+    new_id = get_free_tag();
+    writtg_(&devils_.idevt,&new_id,&new_t.position,&new_t.length,&new_t.comment,&new_t.type.i,&new_t.next);
+
+    /* update previous tag */
+    t.next=new_id;
+    writtg_(&devils_.idevt,&t_id,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+}
+
+
+
+void insedt_(int_f *seq_, int_f *pos_, char *base_, int_fl base_l)
+/*
+** Insert a tag for a base at position pos in gel
+*/
+{
+    int seq;
+    int pos;
+    char base;
+
+    tagRecord t;
+    tag_id last,next;
+
+    if (devils_.idevt>0) {
+
+	seq = (int) *seq_;
+	pos = (int) *pos_;
+	base = base_[0];
+
+
+	/* skip over header */
+	last = next = seq;
+	readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	next = t.next;
+	if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	/* adjust lengths of all tags staring before pos */
+	while (next && t.position < pos) {
+	    if (t.position + t.length - 1 >= pos &&
+		t.type.c[0] != '*') {
+		t.length++;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+            last = next;
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	}
+
+	/* insert a tag after last and before next */
+	insert_edit_tag(last,pos,base,TAG_TYPE_INSERT);
+
+	/* shift everything else right */
+	while (next) {
+	    t.position++;
+	    writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	}
+
+    }
+}
+
+
+
+
+
+void deledt_(int_f *seq_, int_f *pos_, char *base_, int_fl base_l)
+/*
+** Delete a base at position pos in gel
+*/
+{
+    int_f seq;
+    int_f pos;
+    char base;
+
+    tagRecord t;
+    tag_id last,next;
+
+    if (devils_.idevt>0) {
+
+	seq = *seq_;
+	pos = *pos_;
+	base = base_[0];
+
+
+	/* skip over header */
+	last = next = (tag_id)seq;
+	readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	next = t.next;
+	if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	/* adjust lengths of all tags staring before pos */
+	while (next && t.position <= pos) {
+	    if (t.position + t.length - 1 >= pos &&
+		t.type.c[0] != '*') {
+		t.length--;
+		writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    }
+            last = next;
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+
+	}
+
+	/* insert a tag after last and before next */
+	insert_edit_tag(last,pos,base,TAG_TYPE_DELETE);
+
+	/* shift everything else right */
+	while (next) {
+	    t.position--;
+	    writtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	    next = t.next;
+	    if (next) readtg_(&devils_.idevt,&next,&t.position,&t.length,&t.comment,&t.type.i,&t.next);
+	}
+
+    }
+}
diff --git a/src/staden/tagUtils.h b/src/staden/tagUtils.h
new file mode 100644
index 0000000..f26153f
--- /dev/null
+++ b/src/staden/tagUtils.h
@@ -0,0 +1,146 @@
+#ifndef _tagUtils_h
+#define _tagUtils_h
+
+#include "fortran.h"
+#include "fort.h"
+/* #include "Sheet.h" */
+/*
+** The following describe two database files:
+**     The tag list file
+**     The comment list file
+**
+** the tag list file consists of at least IDBSIZ records.
+**
+**     1       : header tag for sequence 1
+**     2       : header tag for sequence 2
+**     ...
+**     IDBSIZ-1: header tag for sequence IDBSIZ-1:
+**     IDBSIZ  : descriptor record defining MAX_TAG
+**     IDBSIZ+1: supplimentary tags
+**     ...
+**     MAX_TAG : supplimentary tags
+**
+** the comment list file consists of at least 1 record.
+**
+**     1       : descriptor record defining MAX_COM
+**     2       : supplimentary comment
+**     ...
+**     MAX_COM : supplimentary comment
+**     
+*/
+
+#define COMMENT_LENGTH 40
+/* COMMENT_LENGTH is hard coded in dbsysnew.f */
+/* grep for COMMENT_LENGTH                    */
+#define FILE_NAME_LENGTH 18
+/* FILE_NAME_LENGTH is hard coded in dbsysnew.f */
+/* grep for FILE_NAME_LENGTH                    */
+
+typedef int_f tag_id;
+typedef int_f comment_id;
+
+/*
+** element in tag list
+*/
+typedef union {
+    char c[4];
+    int_f i;
+    } tag_types;
+
+typedef struct _tagRecord{
+    int_f position;        /* position in sequence */
+    int_f length;          /* length of tag */
+    tag_types type;
+    comment_id comment;  /* index to comment */
+    tag_id next;         /* link to next in structure */
+} tagRecord;
+
+/*
+** comment chain
+*/
+typedef struct _commentStruct{
+    char comment[COMMENT_LENGTH];
+    comment_id next;
+} commentStruct;
+
+typedef struct _tagstruct{
+    /*
+    ** Data from the original file
+    */
+    tagRecord tagrec;
+    /*
+    ** Data for database management
+    */
+    tag_id original_tag_id;
+    char *newcomment;
+    int newcommentlen;
+    long flags;
+    int time;
+    struct _tagstruct *next;
+} tagStruct,*tagptr;
+
+#define TAG_UNCHANGED         (0)
+#define TAG_POSITION_CHANGED  (1L<<1)
+#define TAG_LENGTH_CHANGED    (1L<<2)
+#define TAG_TYPE_CHANGED      (1L<<3)
+#define TAG_COMMENT_CHANGED   (1L<<4)
+#define TAG_INSERTED          (1L<<5)
+#define TAG_DELETED           (1L<<6)
+#define TAG_COMMENT_IN_MEMORY (1L<<7)
+
+#define TAG_MALLOC(s) (char *)malloc(s)
+#define TAG_FREE(c)   free(c)
+
+#define TAG_TYPE_INSERT "*INS"
+#define TAG_TYPE_DELETE "*DEL"
+#define TAG_TYPE_LCUT   "*LC*"
+#define TAG_TYPE_RCUT   "*RC*"
+#define TAG_TYPE_RAWDATA "*RAW"
+
+#include "edUtils.h"
+
+/* define external routines */
+extern void force_comment(tagStruct *t);
+
+/* comment interface */
+extern tagStruct *readTagList(int seq);
+extern void writeTagList(EdStruct *xx, int seq);
+extern void destroyTagList(tagStruct *s);
+
+extern void createTag(EdStruct *xx);
+extern void editTag(EdStruct *xx);
+extern void deleteTag(EdStruct *xx);
+/* extern void getTagSplodge(EdStruct *xx, int seq, int pos, int width, XawSheetInk *ink); */
+
+extern void insertTag(EdStruct *xx, int seq, tagStruct *t);
+extern tagStruct *newTag();
+/* this gives me lots of problems...why?
+extern char normaliseBase(EdStruct *xx,int seq,char deletedBase);
+*/
+
+extern void tagInsertBase(EdStruct *xx,int seq,int pos,char *base);
+extern void tagDeleteBase(EdStruct *xx,int seq,int pos,char *base);
+extern void tagReplaceBase(EdStruct *xx,int seq,int pos,char *dbse,char *ibse);
+extern tagStruct *findTag(EdStruct *xx,int seq,int pos);
+extern int origpos(EdStruct *xx, int seq, int pos);
+extern void getLeftCutOff(EdStruct *xx,int seq, int width, char *str);
+extern void getRightCutOff(EdStruct *xx,int seq, int width, char *str);
+extern char *get_comment(comment_id cp);
+extern int lenLCut(EdStruct *xx, int seq);
+extern int lenRCut(EdStruct *xx, int seq);
+extern void getLCut(EdStruct *xx,int seq, int pos, int width, char *str);
+extern void getRCut(EdStruct *xx,int seq, int pos, int width, char *str);
+extern tagStruct *findTagByType(EdStruct *xx, int seq, char *type);
+extern char normaliseBase(EdStruct *xx,int seq,char deletedBase);
+extern void dump_tags(EdStruct *xx, int seq);
+extern void setButtonName(Widget w, char *c);
+extern void createTagTypeMenu(Widget parent, void (*call_back)() );
+extern void delete_comment (comment_id cp);
+extern comment_id put_comment(char *c);
+extern int read_tag(tag_id n, tagRecord *t);
+extern int invokeTagEditor(char *type_id, char *newType, char *tagComment, char **newComment);
+extern int write_tag(tag_id n, tagRecord t);
+extern tag_id get_free_tag();
+extern void delete_tag_rec(tag_id t);
+
+#endif /* _tagUtils_h */
diff --git a/src/staden/tagdb.h b/src/staden/tagdb.h
new file mode 100644
index 0000000..8ecc875
--- /dev/null
+++ b/src/staden/tagdb.h
@@ -0,0 +1,35 @@
+#ifndef _tagdb_h
+#define _tagdb_h
+
+#include <X11/Intrinsic.h>
+
+#define MAX_TAG_TYPES 256
+
+#define TAG_UNKNOWN      0
+#define TAG_SEARCH_ID    1
+#define TAG_FG_COLOUR    2
+#define TAG_BG_COLOUR    3
+#define TAG_DEFAULT_TEXT 4
+
+typedef struct {
+    /*
+    ** values taken from TAGDB file
+    */
+    char *type;
+    char *search_id;
+    char *fg_colour;
+    char *bg_colour;
+    char *default_text;
+    /*
+    ** values derived from above
+    */
+    Pixel fg_pixel;
+    Pixel bg_pixel;
+    char id[4];
+    } tag_db_struct;
+
+extern tag_db_struct tag_db[MAX_TAG_TYPES];
+extern int tag_db_count;
+extern void parse(char *file);
+
+#endif /* _tagdb_h */
diff --git a/src/staden/tagdbparse.c b/src/staden/tagdbparse.c
new file mode 100644
index 0000000..31e97fe
--- /dev/null
+++ b/src/staden/tagdbparse.c
@@ -0,0 +1,327 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <ctype.h>
+#include "tagdb.h"
+    
+/* TOKENS */
+#define NULLTOKEN 0
+#define END 1
+#define ID 2
+#define SEPARATOR 3
+#define NL 4
+#define EQ 5
+    
+static char *infile; /* file being parsed */
+
+static char word[128];
+static int lineno;
+
+/* ---------------- private routines ----------------- */
+static void spring_clean_text (s,t)
+     /* parse out normal escape characters bar hex and octal */
+     char *s;
+     char *t;
+{
+    while (*s != '\0') {
+	if (*s == '\\') 
+	    switch (*++s) {
+	    case 'a':  *t++ = '\a'; s++; break;
+	    case 'b':  *t++ = '\b'; s++; break;
+	    case 'f':  *t++ = '\f'; s++; break;
+	    case 'n':  *t++ = '\n'; s++; break;
+	    case 'r':  *t++ = '\r'; s++; break;
+	    case 't':  *t++ = '\t'; s++; break;
+	    case 'v':  *t++ = '\v'; s++; break;
+	    case '\n':
+		while (isspace(*++s));
+		break;
+		/*
+		  case 'x' : 
+		  if (isxdigit(s[1]) && isxdigit(s[2])) {
+		  #define ASCII_TO_HEX(A) (isdigit(A)?(A-'0'):(tolower(A)-'a'+10))
+		  *t++ = ASCII_TO_HEX(s[1])*16 + ASCII_TO_HEX(s[2]);
+		  s++;s++;s++;
+		  }
+		  break;
+		  */
+	    case '\\':
+	    case '\?':
+	    case '\'':
+	    case '\"':
+	    case '\0': break;
+	    } else *t++ = *s++;
+    }
+    *t = '\0';
+    
+}
+
+
+static int next_word(FILE *fp)
+/*
+ ** lexical analyser
+ ** Get's the next word from the input stream
+ */
+{
+    int a;
+    int token;
+    char *s;
+    
+    token = NULLTOKEN;
+    s = word;
+    
+    while (token == NULLTOKEN) {
+	switch (a = getc(fp)) {
+	case EOF:
+	    token = END;
+	    break;
+	case '\n':
+	    lineno++;
+	    token = NL;
+	    break;
+	case '=':
+	    token = EQ;
+	    break;
+	case ':':
+	    token = SEPARATOR;
+	    break;
+	case '#':
+	    /* comment: skip to end of line */
+	    for(a=getc(fp); a!=EOF && a!='\n'; a=getc(fp));
+	    if (a=='\n') lineno++;
+	    if (a==EOF) token = END;
+	    break;
+	case '\\':
+	    /* back quoted newlines are skipped */
+	    /* back quoted "anything else" is "anything else" */
+	    a = getc(fp);
+	    if (a != EOF && a != '\n') ungetc(a,fp);
+	    break;
+	case '"':
+	    /* quoted string */
+	    for(a=getc(fp);a!=EOF && a!='"';a=getc(fp)) {
+		if (a=='\n') lineno++;
+		*s++ = a;
+	    }
+	    token = ID;
+	    break;
+	default:
+	    if (isalnum(a)) {
+		*s++ = a;
+		for(a=getc(fp);a!=EOF && isalnum(a);a=getc(fp)) *s++ = a;
+		if (a!=EOF) ungetc(a,fp);
+		token = ID;
+	    }
+	    break;
+	}
+    }
+    
+    *s = '\0';
+    spring_clean_text(word,word);
+    return token;
+    
+}
+
+
+static void parse_error(char *s)
+{
+    fprintf( stderr, "%s on line %d of %s\n", s,lineno,infile);
+}
+
+static int snatch(char *s)
+{
+    if (strcmp(s,"id")==0) return TAG_SEARCH_ID;
+    if (strcmp(s,"fg")==0) return TAG_FG_COLOUR;
+    if (strcmp(s,"bg")==0) return TAG_BG_COLOUR;
+    if (strcmp(s,"dt")==0) return TAG_DEFAULT_TEXT;
+    return TAG_UNKNOWN;
+}
+
+static void initTagDBFields(char *type)
+{
+    tag_db[tag_db_count].type = (char *)malloc(strlen(type)+1);
+    strcpy(tag_db[tag_db_count].type,type);
+    tag_db[tag_db_count].search_id = NULL;
+    tag_db[tag_db_count].fg_colour = NULL;
+    tag_db[tag_db_count].bg_colour = NULL;
+    tag_db[tag_db_count].default_text = NULL;
+}
+
+static void tidyUpTagDBFields()
+{
+    int len;
+    
+    if (tag_db[tag_db_count].search_id == NULL) {
+	tag_db[tag_db_count].search_id = tag_db[tag_db_count].type;
+    }
+    len =  strlen(tag_db[tag_db_count].search_id);
+    if (len < 4)
+	strncpy(tag_db[tag_db_count].id,"    ",4);
+    else
+	len = 4;
+    strncpy(tag_db[tag_db_count].id,tag_db[tag_db_count].search_id,len);
+    
+    if (tag_db_count < (MAX_TAG_TYPES-1))
+	tag_db_count++;
+}
+
+
+
+
+
+static int parse_file(FILE *fp)
+     /*
+      ** Parse file and load info into tag_db[]
+      */
+{
+    int at_end_of_file;
+    int at_end_of_entry;
+
+    /* for safe keeping */
+    int field;
+    char *WORD;
+    
+    at_end_of_file = 0;
+    
+    while (!at_end_of_file) {
+	switch (next_word(fp)) {
+	case END:
+	    at_end_of_file = 1;
+	case NL:
+	    break;
+	case ID: 
+	    /* the big time */
+	    initTagDBFields(word);
+	    switch(next_word(fp)) {
+	    case END:
+		at_end_of_file = 1;
+	    case NL:
+		break;
+	    case SEPARATOR:
+		/* arglist */
+		at_end_of_entry = 0;
+		while (!at_end_of_entry) {
+		    switch(next_word(fp)) {
+		    case SEPARATOR:
+			break;
+		    case ID:
+			field = snatch(word);
+			switch(next_word(fp)) {
+			case EQ:
+			    switch(next_word(fp)) {
+			    case ID:
+				WORD = (char *)malloc(strlen(word)+1);
+				strcpy(WORD,word);
+				switch (field) {
+				case TAG_SEARCH_ID :
+				    tag_db[tag_db_count].search_id = WORD;
+				    break;
+				case TAG_FG_COLOUR :
+				    tag_db[tag_db_count].fg_colour = WORD;
+				    break;
+				case TAG_BG_COLOUR :
+				    tag_db[tag_db_count].bg_colour = WORD;
+				    break;
+				case TAG_DEFAULT_TEXT :
+				    tag_db[tag_db_count].default_text = WORD;
+				    break;
+				default:
+				    break;
+				}
+				switch(next_word(fp)) {
+				case END:
+				    at_end_of_file = 1;
+				case NL:
+				    at_end_of_entry = 1;
+				case SEPARATOR:
+				    break;
+				default:
+				    parse_error("Syntax error");
+				    return ( 1 );
+				}
+				break;
+			    default:
+				parse_error("Syntax error");
+				return ( 1 );
+			    }
+			    break;
+			case END:
+			    at_end_of_file = 1;
+			case NL:
+			    at_end_of_entry = 1;
+			case SEPARATOR:
+			    break;
+			default:
+			    parse_error("Syntax error");
+			    return ( 1 );
+			}
+			break;
+		    case END:
+			at_end_of_file = 1;
+		    case NL:
+			at_end_of_entry = 1;
+			break;
+		    default:
+			parse_error("Syntax error");
+			return ( 1 );
+		    }
+		}
+		break;
+	    default:
+		parse_error("Syntax error");
+		return 1;
+	    }
+	    tidyUpTagDBFields();
+	    break;
+	default:
+	    parse_error("Syntax error");
+	    return 1;
+	}
+    }
+    
+    return 0;
+}
+
+
+
+
+
+/* ------------------- exported routines ------------------ */
+
+tag_db_struct tag_db[MAX_TAG_TYPES];
+int tag_db_count;
+
+void parse(char *file)
+{
+    int parse_failed;
+    FILE *fp;
+    
+    char *default_type = "Comment";
+    char *default_search_id = "DEFC";
+    char *default_fg_colour = NULL;
+    char *default_bg_colour = "yellow";
+    char *default_default_text = "Enter your comment here";
+    
+    infile = file;
+    tag_db_count = 0;
+    
+    if(fp = fopen(file,"r")) {
+	lineno = 1;
+	parse_failed = parse_file(fp);
+	if (parse_failed) {
+	    fprintf (stderr,"Error encountered while parsing tag database\nGiving up\n");
+	    tag_db_count = 0;
+	}
+	fclose(fp);
+    } else
+	fprintf (stderr,"Error parsing tag database\nFile `%s' does not exist\nGiving up\n",infile);
+    
+    if (! tag_db_count) {
+	tag_db[tag_db_count].type = default_type;
+	tag_db[tag_db_count].search_id = default_search_id;
+	tag_db[tag_db_count].fg_colour = default_fg_colour;
+	tag_db[tag_db_count].bg_colour = default_bg_colour;
+	tag_db[tag_db_count].default_text = default_default_text;
+	tag_db_count++;
+    }
+    
+}
diff --git a/src/staden/textOutput.c b/src/staden/textOutput.c
new file mode 100644
index 0000000..7671edd
--- /dev/null
+++ b/src/staden/textOutput.c
@@ -0,0 +1,191 @@
+/*
+    Title: 	 textOutput
+
+    File: 	 textOutput.h
+    Purpose:	 Manage a text widget displaying stdout
+    Last update: Thur May 17 1990
+*/
+
+
+/*
+    This module creates a text widget, and arranges for output
+    from stdout to be displayed. The function UpdateTextOutput()
+    must be called periodically to keep the display up to date.
+
+    It works by redirecting stdout to a temporary file. The contents
+    of this are retrieved everytime UpdateTextOutput() is called and
+    they are appended to the (string) text widget. The temporary
+    file is truncated each time, to keep it small, but the text
+    widget may grow arbitrarily large.
+
+    To position the caret at the end of the file we use the value
+    MAXLONG. This is dodgy on two counts:
+    (1) It assumes XawTextPosition is a long.
+    (2) It assumes all routines interpret positions past the end of
+        the file as implying the end of the file.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "textOutput.h"
+#include "main.h"   /* IMPORT: finalx */
+#include "mcspec.h" /* IMPORT: remove */
+
+#include <stdio.h>  /* IMPORT: fopen, fclose, freopen, fprintf, fread, remove, 
+		               BUFSIZ, FILE, tmpnam, L_tmpnam */
+#include "values.h" /* IMPORT: MAXLONG */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/AsciiText.h>
+
+
+/* ---- Global variables ---- */
+
+extern Dimension dialogue_width;
+extern Dimension dialogue_height;
+
+/* ---- Static variables ---- */
+#ifndef L_tmpnam
+#define L_tmpnam        25              /* (sizeof(P_tmpdir) + 15) */
+#endif
+
+static Widget textOutputWidget;
+static char  textLogFN[L_tmpnam];
+static FILE *textLogFP;
+
+
+
+
+/* ---- Callback routines ---- */
+
+
+static void destroyCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{   (void) fclose(textLogFP);
+    (void) remove(textLogFN);
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+Widget CreateTextOutput(Widget parentWid)
+/*
+    Create the text widget as a child of `parentWid'.
+    `stdout' may be redirected, but is not closed.
+    The display is initially empty.
+*/
+{   static Arg arglist[] =
+    {	{XtNeditType,       (XtArgVal) XawtextEdit},
+	{XtNscrollVertical, (XtArgVal) XawtextScrollAlways},
+	{XtNwrap,           (XtArgVal) XawtextWrapLine},
+    };
+
+    /*
+        Create, or truncate, a temporary file for output
+        redirecting everything from stdout there.
+        Also open it for reading.
+    */
+    if (   (freopen(tmpnam(textLogFN), "w", stdout) == NULL)
+        || ((textLogFP = fopen(textLogFN, "r"))     == NULL)
+       )
+    {   fprintf(stderr,"\n*** Failed to open temporary file. Exiting.\n");
+        finalx(1);
+    }
+
+
+    textOutputWidget = XtCreateManagedWidget("text", asciiTextWidgetClass,
+					     parentWid,
+					     arglist, XtNumber(arglist));
+
+    XawTextSetInsertionPoint(textOutputWidget, (XawTextPosition)MAXLONG);
+    XawTextDisplayCaret(textOutputWidget, False);
+    XtAddCallback(textOutputWidget, XtNdestroyCallback, destroyCallback, NULL);
+
+    XtVaSetValues(textOutputWidget, XtNwidth, dialogue_width,
+		  XtNheight, dialogue_height / 2, NULL);
+
+    return(textOutputWidget);
+}
+
+
+void updout_()
+{
+    UpdateTextOutput();
+}
+
+
+void UpdateTextOutput()
+/*
+    Any text written to `stdout' since the last call of
+    UpdateTextOutput() is added to the display.
+*/
+{   XawTextBlock text;
+    char buf[BUFSIZ];
+    XawTextPosition pos;
+
+
+    /* Flush all output at the FORTRAN & C level*/
+    flusho_();
+    fflush(stdout);
+
+    /* Get the end-of-text position, in a rather dodgy way */
+    XawTextSetInsertionPoint(textOutputWidget, (XawTextPosition)MAXLONG);
+    pos = XawTextGetInsertionPoint(textOutputWidget);
+
+    text.firstPos = 0;
+    text.ptr      = buf;
+    text.format   = FMT8BIT;
+
+    while ((text.length = fread(buf, 1, BUFSIZ, textLogFP)) > 0)
+    {   if (XawTextReplace(textOutputWidget, pos, pos, &text) != XawEditDone)
+        {   fprintf(stderr,"\n*** Problem with text output. Exiting.\n");
+            finalx(1);
+	}
+        pos += text.length;
+    }
+
+    XawTextSetInsertionPoint(textOutputWidget, pos);
+
+
+    /* Truncate the temporary file for output by
+       redirecting everything from stdout there.
+       Also open it again for reading.
+    */
+    if (   (freopen(textLogFN, "w", stdout)    == NULL)
+        || (freopen(textLogFN, "r", textLogFP) == NULL)
+       )
+    {   fprintf(stderr,"\n*** Failed to access temporary file. Exiting.\n");
+        finalx(1);
+    }
+}
+
+
+
+void ClearTextOutput()
+/*
+    The text output is cleared.
+*/
+{   XawTextBlock text;
+    char buf[1];
+    XawTextPosition pos;
+
+    /* Get the end-of-text position, in a rather dodgy way */
+    XawTextSetInsertionPoint(textOutputWidget, MAXLONG);
+    pos = XawTextGetInsertionPoint(textOutputWidget);
+
+    text.firstPos = 0;
+    text.ptr      = buf;
+    text.length   = 0;
+    text.format   = FMT8BIT;
+
+    (void) XawTextReplace(textOutputWidget, 0, pos, &text);
+
+    XawTextSetInsertionPoint(textOutputWidget, 0);
+}
+
diff --git a/src/staden/textOutput.h b/src/staden/textOutput.h
new file mode 100644
index 0000000..c8ce8e2
--- /dev/null
+++ b/src/staden/textOutput.h
@@ -0,0 +1,47 @@
+/*
+    Title: 	 textOutput
+
+    File: 	 textOutput.h
+    Purpose:	 Manage a text widget displaying stdout
+    Last update: Thur May 17 1990
+*/
+
+
+/*
+    This module creates a text widget, and arranges for output
+    from stdout to be displayed. The function UpdateTextOutput()
+    must be called periodically to keep the display up to date.
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget CreateTextOutput(Widget parentWid);
+/*
+    Create the text widget as a child of `parentWid'.
+    `stdout' may be redirected, but is not closed.
+    The display is initially empty.
+*/
+
+
+extern void   UpdateTextOutput();
+/*
+    Any text written to `stdout' since the last call of
+    UpdateTextOutput() is added to the display.
+*/
+
+
+extern void ClearTextOutput();
+/*
+    The text output is cleared.
+*/
diff --git a/src/staden/tman_basesDisplay.c b/src/staden/tman_basesDisplay.c
new file mode 100644
index 0000000..1e9727f
--- /dev/null
+++ b/src/staden/tman_basesDisplay.c
@@ -0,0 +1,373 @@
+/* 
+    Title:       basesDisplay
+
+    File: 	 basesDisplay.h
+    Purpose:	 Display the bases of a sequence on graphs
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+    `plotEdgeOffset' indicates the pixel position at which point 0
+    occurs. The characters for bases are printed with their centres
+    corresponding to their locations on the plot.
+
+    A caret is XOR drawn in the editable graph on top of (i.e. after)
+    all characters have been drawn.    
+
+    Changes to this module by lfw:
+        module displayEdBases and displayBases
+          added int baseNum as a parameter so that
+        the user could specify a baseNum via the
+        command line baseNum parameter or via
+        specifying a string to search for ( if
+        you specify a string to search for
+        ) and have that be where the caret
+        is originally positioned.  Before this
+        modification, the caret was always
+        initially positioned at base 0.
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "tman_basesDisplay.h"
+#include "tman_display.h"
+#include "tman_context.h"
+
+#include "Graph.h"
+#include "seq.h"     /* IMPORT: Seq, NULLSeq, getNPoints */
+
+#include <ctype.h>   /* IMPORT: toupper */
+
+#include <X11/Intrinsic.h>
+#include <X11/keysym.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Viewport.h>
+
+
+
+
+/* ---- Constants ---- */
+
+
+
+/* ---- Statics ---- */
+
+/* Current sequence information */
+
+/* Font information */
+static Dimension charWidth;
+static Dimension halfCharWidth;
+static Dimension charVOffset; /* Of char baseline from top of graph=ascent+1 */
+
+static GC Agc, Cgc, Ggc, Tgc;
+
+/* ---- Position translation ---- */
+
+/*
+    Convert a value from the `point-in-the-sequence' to the
+    `x-pixel-on-the-graph' scale, and vice-versa, given that
+    `plotWidth' holds the current pixel width of the graph which
+    starts `plotEdgeOffset' from either edge of the graph and
+    `NPoints' gives the current point width.
+    Fractional results are truncated - for pixelToPoint this may
+    mean a point one too low is indicated, but a point too high is
+    never indicated.
+*/
+
+
+static int pointToPixel(DisplayContext *dc, int point)
+{   return(((point * dc->plotWidth) / (dc->NPoints-1)) + dc->plotEdgeOffset);
+}
+
+
+int pixelToPoint(DisplayContext *dc, int pixel)
+{   return(((pixel-dc->plotEdgeOffset) * (dc->NPoints-1)) / dc->plotWidth);
+}
+
+
+static int leftCutoffPoint(DisplayContext *dc, int baseNum)
+/*
+    Return the point corresponding to a left cutoff at `baseNum'.
+*/
+{   int pL, pR;
+
+    pL = (baseNum == 0) ? 0 : baseNumToPoint(dc->seq, OrigBases, baseNum-1);
+    pR = (baseNum == 0) ? 0 : baseNumToPoint(dc->seq, OrigBases, baseNum);
+
+    return((pR==NULLPoint)?pL:(pL+pR)/2);
+}
+
+
+static int rightCutoffPoint(DisplayContext *dc, int baseNum)
+/*
+    Return the point corresponding to a right cutoff at `baseNum'.
+*/
+{   int NorigBases = getNBases(dc->seq, OrigBases);
+    int pL, pR;
+
+    pL = (baseNum == 0)
+         ? dc->NPoints-1
+	 : baseNumToPoint(dc->seq, OrigBases, NorigBases-baseNum);
+    pR = (baseNum == 0)
+         ? dc->NPoints-1
+	 : baseNumToPoint(dc->seq, OrigBases, NorigBases-baseNum-1);
+
+    return((pL+pR)/2);
+}
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+static void writeBase(DisplayContext *dc, Widget wid, int baseNum)
+{
+    char       base  = getBase(dc->seq, OrigBases, baseNum);
+    int        point = baseNumToPoint(dc->seq, OrigBases, baseNum);
+    GC gc;
+
+    /* Get the correct GC */
+    switch (base) 
+    {   case 'A': gc=Agc; break;
+        case 'C': gc=Cgc; break;
+        case 'G': gc=Ggc; break;
+        case 'T': gc=Tgc; break;
+        default:  gc=Ggc; break;
+    }
+
+    XDrawImageString(XtDisplay(wid),
+		     XtWindow(wid),
+		     gc,
+		     pointToPixel(dc,point)-halfCharWidth, charVOffset,
+		     &base, 1);
+}
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void exposeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Region region = (Region) call_data;
+    XRectangle rect;
+    int x0, xN;    /* Affected region: pixels */
+    int p0, pN;    /* Affected region: points */
+    int b, b0, bN; /* Affected region: bases */
+    int leftCutoffP, rightCutoffP;
+    int NBases;
+    int whichBases;
+    DisplayContext *dc = widgetToDisplayContext(XtParent(XtParent(XtParent(wid))));
+
+    if (dc->seq == NULLSeq) return;
+
+    leftCutoffP  = leftCutoffPoint(dc,dc->leftCutoff);
+    rightCutoffP = rightCutoffPoint(dc,dc->rightCutoff);
+    NBases     = dc->seq->NorigBases;
+    whichBases = OrigBases;
+
+
+    /*
+        `region' gives us the exposed graph region.
+	XClipBox will return a bounding rectangle, in pixels.
+    */
+    XClipBox(region, &rect);
+    x0 = rect.x;
+    xN = rect.x+rect.width;
+
+
+
+    /* Convert to affected points after trimming to the plot proper */
+    p0=pixelToPoint(dc,(x0<dc->plotEdgeOffset)          ?dc->plotEdgeOffset          :x0);
+    pN=pixelToPoint(dc,(xN>dc->plotWidth+dc->plotEdgeOffset)?dc->plotWidth+dc->plotEdgeOffset:xN);
+    if (pN<dc->NPoints-1) pN++;
+
+
+    /*
+        Now draw the affected bases.
+    */
+
+    /* The first base is to the left of the exposed region */
+    b0 = pointToBaseNum(dc->seq, whichBases, p0);
+    b0 =   (b0 == NULLBaseNum) ? NBases-1
+         : (b0>0)              ? b0-1
+	 :                       0;
+    /* The last base is to the right of the exposed region */
+    bN = pointToBaseNum(dc->seq, whichBases, pN);
+    bN = (bN == NULLBaseNum) ? NBases-1 : bN;
+
+    if (b0<dc->leftCutoff)
+    /*
+        Some of the bases are in the left cutoff area.
+	Draw them, the edited ones with the dim background.
+    */
+    {
+	for (b=b0; b<=dc->leftCutoff-1; b++) writeBase(dc,wid, b);
+    }
+
+    if (bN>=dc->leftCutoff && b0-dc->seq->NorigBases-1<dc->rightCutoff)
+    /*
+        Some of the exposed region is in the middle area.
+	Draw the bases.
+    */
+    {   int firstBase = (b0<dc->leftCutoff) ? dc->leftCutoff : b0;
+	int lastBase  =   (bN>dc->seq->NorigBases-1-dc->rightCutoff)
+	                 ? dc->seq->NorigBases-1-dc->rightCutoff
+			 : bN;
+
+	for (b=firstBase; b<=lastBase; b++) writeBase(dc, wid, b);
+    }
+
+    if (bN>dc->seq->NorigBases-1-dc->rightCutoff)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {
+       /* Plot the affected bases */
+       for (b=dc->seq->NorigBases-dc->rightCutoff; b<=bN; b++) writeBase(dc, wid, b);
+    }
+
+
+}
+
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+Widget createBasesDisplay(DisplayContext *dc, Widget parentWid, Widget fromVertWid,
+				 Dimension width)
+/*
+    Create the bases display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width'. Return the (lower) widget constructed.
+    No bases are initially displayed.
+*/
+{   Arg args[10];
+    int nargs;
+
+    dc->seq    = NULLSeq;
+    dc->graphWidth = width;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    dc->graphWidth);  nargs++;
+    dc->origSeqWid = XtCreateManagedWidget("origSeq", graphWidgetClass,
+				       parentWid, args, nargs);
+    XtAddCallback(dc->origSeqWid, XtNexposeCallback, exposeCallback, NULL);
+
+        /*
+	    Get font information.
+	    The text graphs are set to be two pixels higher than
+	    (ascent+descent). Characters are then plotted with
+	    a vertical offset one more than the ascent.
+	*/
+        { XFontStruct *fontStruct;
+
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNfont, &fontStruct); nargs++;
+	  XtGetValues(dc->origSeqWid, args, nargs);
+	  charWidth      = fontStruct->max_bounds.width;
+	  halfCharWidth  = fontStruct->max_bounds.width/2;
+	  charVOffset    = fontStruct->ascent+1;
+	  dc->plotEdgeOffset = halfCharWidth+2;
+	  dc->plotWidth      = dc->graphWidth-(2*dc->plotEdgeOffset);
+	  dc->graphHeight    = fontStruct->ascent + fontStruct->descent + 2;
+	  /* Make this graph tall enough to hold it */
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNheight, dc->graphHeight); nargs++;
+	  XtSetValues(dc->origSeqWid, args, nargs);
+	}
+
+
+    /*
+        Get the GCs
+    */
+    Agc = GraphGC1(dc->origSeqWid);
+    Cgc = GraphGC2(dc->origSeqWid);
+    Ggc = GraphGC3(dc->origSeqWid);
+    Tgc = GraphGC4(dc->origSeqWid);
+
+    return dc->origSeqWid;
+}
+
+
+
+
+void getBasesFontInfo(DisplayContext *dc,Dimension *offset, Dimension *chWidth)
+/*
+    In `offset' return the offset which the bases are drawn
+    from the left and right hand edges. The width of the characters
+    used is returned in `chWidth'.
+*/
+{   *offset  = dc->plotEdgeOffset;
+    *chWidth = charWidth;
+}
+
+
+
+
+void initBaseDisplay(Seq seq, int baseNum)
+/*
+    Initialise the base display of sequence `seq'.
+*/
+{
+    DisplayContext *dc = seqToDisplayContext(seq);
+
+    if (seq == NULLSeq) return;
+    dc->seq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    dc->NPoints     = getNPoints(dc->seq);
+    getCutoffs(dc->seq, &dc->leftCutoff, &dc->rightCutoff);
+
+
+}
+
+
+
+void setBasesWidth(DisplayContext *dc, Dimension width)
+/*
+    Set the width of the current sequence bases display.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, width); nargs++;
+    XtSetValues(dc->origSeqWid, args, nargs);
+
+    dc->graphWidth = width;
+    dc->plotWidth  = dc->graphWidth-(2*dc->plotEdgeOffset);
+
+    /*
+        Clear the graphs.
+    */
+    if (XtIsRealized(dc->origSeqWid))
+        XClearWindow(XtDisplay(dc->origSeqWid), XtWindow(dc->origSeqWid));
+}
+
+
+
+
+
+
diff --git a/src/staden/tman_basesDisplay.h b/src/staden/tman_basesDisplay.h
new file mode 100644
index 0000000..ac542fa
--- /dev/null
+++ b/src/staden/tman_basesDisplay.h
@@ -0,0 +1,65 @@
+/* 
+    Title:       basesDisplay
+
+    File: 	 basesDisplay.h
+    Purpose:	 Display the bases of a sequence on graphs
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+#include "tman_context.h"
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget createBasesDisplay(DisplayContext *dc,Widget parentWid, Widget fromVertWid,
+				 Dimension width);
+/*
+    Create the bases display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width'. Return the (lower) widget constructed.
+    No bases are initially displayed.
+*/
+
+
+extern void getBasesFontInfo(DisplayContext *dc, Dimension *offset, Dimension *chWidth);
+/*
+    In `offset' return the offset which the bases are drawn
+    from the left and right hand edges. The width of the characters
+    used is returned in `chWidth'.
+*/
+
+
+extern void initBaseDisplay(Seq seq, int baseNum);
+/*
+    Initialise the base display of sequence `seq'.
+*/
+
+
+extern void setBasesWidth(DisplayContext *dc, Dimension width);
+/*
+    Set the width of the current sequence bases display.
+*/
+
+
+extern int pixelToPoint(DisplayContext *dc,int pixel);
+/*
+    Given `pixel' on the bases display, return the corresponding point.
+    Fractional results are truncated - this may mean a point one too
+    low is indicated, but a point too high is never indicated.
+*/
+
diff --git a/src/staden/tman_context.c b/src/staden/tman_context.c
new file mode 100644
index 0000000..83fa88b
--- /dev/null
+++ b/src/staden/tman_context.c
@@ -0,0 +1,156 @@
+/*
+  Change log:
+  14 August 1991
+      add catchloop to destroyContexts, and loosen conditional
+      in destroyDisplayContext from "if (dc != NULL && dc->seq != NULL)"
+      to "if (dc != NULL)"
+
+  20 January 1992
+      New routine nameToDisplayContext added
+
+*/
+
+
+
+#include "seq.h"
+#include "tman_context.h"
+
+#include <stdio.h>
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Form.h>
+
+
+static DisplayContext contexts[MAXCONTEXTS];
+static contextList[MAXCONTEXTS];
+static int contextCount;
+
+void initialiseDisplayContexts()
+/*
+** null all contexts
+*/
+{
+    int i;
+
+    contextCount = 0;
+    for (i=0; i<MAXCONTEXTS; i++) {
+	contextList[i] = i;
+    }
+}
+
+
+DisplayContext *widgetToDisplayContext (Widget w)
+/*
+** Given a widget, return the display context containing it
+*/
+{
+    int i;
+
+    for (i=0;
+	 i<contextCount && contexts[contextList[i]].mainFormWid != w ;
+	 i++);
+    return (i==contextCount)?NULL:&contexts[contextList[i]];
+}
+
+DisplayContext *nameToDisplayContext (char *name)
+/*
+** Given a trace name, return the display context containing it
+*/
+{
+    int i;
+
+    for (i=0;
+	 i<contextCount && (strncmp(contexts[contextList[i]].traceName,name,FILE_NAME_LENGTH)!=0) ;
+	 i++);
+
+    return (i==contextCount)?NULL:&contexts[contextList[i]];
+}
+
+
+
+DisplayContext *seqToDisplayContext (Seq s)
+/*
+** Given a widget, return the display context containing it
+*/
+{
+    int i;
+    for (i=0;
+	 i<contextCount && contexts[contextList[i]].seq != s ;
+         i++);
+    return (i==contextCount)?NULL:&contexts[contextList[i]];
+}
+
+DisplayContext *getFreeDisplayContext ()
+/*
+** Find a free display context
+** If none, use oldest
+*/
+{
+    int i;
+
+    if (contextCount==MAXCONTEXTS) {
+        /*
+	** destroy old context
+        */
+	destroyDisplayContext(&contexts[contextList[0]]);
+    }
+    i = contextList[contextCount++];
+
+    return (&contexts[i]);
+}
+
+
+
+void destroyDisplayContext (DisplayContext *dc)
+{
+    int i,j;
+    Arg args[10];
+    int nargs;
+    int saveIndex;
+
+    if (dc != NULL) {
+        /*
+	** destroy context
+        */
+	Widget parent = XtParent(dc->mainFormWid);
+	XawFormDoLayout(parent,False);
+
+	for (i=0; i<contextCount && dc!=&contexts[contextList[i]]; i++);
+	saveIndex = contextList[i];
+
+        if (i < (contextCount-1)) {
+	    Widget bridge = (i==0)?NULL:contexts[contextList[i-1]].mainFormWid;
+            nargs = 0;
+            XtSetArg(args[nargs], XtNfromVert, bridge); nargs++;
+            XtSetValues(contexts[contextList[i+1]].mainFormWid, args, nargs);
+	}
+
+	XtUnmanageChild(dc->mainFormWid);
+	XtDestroyWidget(dc->mainFormWid);
+	freeSeq(dc->seq);
+        dc->seq = NULL;
+        dc->mainFormWid = NULL;
+
+	for (j=i+1; j<contextCount; j++) contextList[j-1] = contextList[j];
+	contextList[--contextCount] = saveIndex;
+
+	XawFormDoLayout(parent,True);
+    }
+}
+
+void destroyContexts ()
+{
+    DisplayContext *dc;
+    int catchloop = MAXCONTEXTS;
+
+    dc = getLastDisplayContext();
+    while (dc != NULL && catchloop--) {
+	destroyDisplayContext(dc);
+        dc = getLastDisplayContext();
+    }
+}
+
+DisplayContext * getLastDisplayContext()
+{
+    return (contextCount)?(&contexts[contextList[contextCount-1]]):NULL;
+}
diff --git a/src/staden/tman_context.h b/src/staden/tman_context.h
new file mode 100644
index 0000000..fc2d5f2
--- /dev/null
+++ b/src/staden/tman_context.h
@@ -0,0 +1,87 @@
+#ifndef _context_h
+#define _context_h
+
+
+/* 
+    Title:       context
+
+    File: 	 context.h
+    Purpose:	 To hold display context information
+    Last update: Monday 20 January 1992
+*/
+#include "seq.h"
+#include "tagUtils.h"         /* IMPORT - FILE_NAME_LENGTH */
+#include <X11/Intrinsic.h>
+
+#define MAXCONTEXTS 4
+typedef struct {
+    Seq seq;
+    Widget viewportWid;
+    Widget vpFormWid;
+    Widget traceWid;
+    Widget origSeqWid;
+    Widget mainFormWid;
+    Widget gadgetsWid;
+    
+    Dimension minGraphWidth;  /* All characters displayed
+				 (depends on width of parent viewport) */
+    Dimension maxGraphWidth;  /* Bases spaced out with `MaxMagCharSpace'
+				 blanks (depends on char size and
+				 NorigBases) */
+    Dimension graphWidth;     /* Current width of the graph */
+    int NPoints;
+    int maxTraceVal;
+    Dimension traceWidth;     /* Current width of the trace graph */
+    Dimension traceHeight;    /* Current height of the trace graph */
+    Dimension plotEdgeOffset;
+    Dimension plotWidth;      /* Current width of the plot
+				 = graphWidth - 2*plotEdgeOffset */
+    int leftCutoff, rightCutoff;
+    Dimension graphHeight;
+    float scaleFactor;
+    char traceName[FILE_NAME_LENGTH]; /* name of the trace file */
+
+} DisplayContext, *DisplayContextPtr;
+
+extern void initialiseDisplayContexts();
+/*
+** clear the slate
+*/
+
+extern DisplayContext *widgetToDisplayContext (Widget w);
+/*
+** Given a widget, return the display context containing it
+*/
+
+extern DisplayContext *nameToDisplayContext (char *name);
+/*
+** Given a trace name, return the display context containing it
+*/
+
+extern DisplayContext *seqToDisplayContext (Seq s);
+/*
+** Given a widget, return the display context containing it
+*/
+
+extern DisplayContext *getFreeDisplayContext ();
+/*
+** Find a free display context
+** If none, use oldest
+*/
+
+extern void destroyDisplayContext (DisplayContext *dc);
+/*
+** Do away with a single context
+*/
+
+extern void destroyContexts ();
+/*
+** Do away with all contexts
+*/
+
+extern DisplayContext * getLastDisplayContext();
+/*
+** return last context in list
+*/
+
+#endif /* _context_h */
diff --git a/src/staden/tman_display.c b/src/staden/tman_display.c
new file mode 100644
index 0000000..c6b42db
--- /dev/null
+++ b/src/staden/tman_display.c
@@ -0,0 +1,426 @@
+/* 
+    Title:       tman_display
+
+    File: 	 tman_display.c
+    Purpose:	 Sequence display for trace manager
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+    The display is created by a call to CreateDisplay. It principally
+    consists of two components: the first, created by a call to
+    createBasesDisplay displays the original and edited bases, the
+    second, created by a call to createTracesDisplay, displays the
+    traces.
+
+    The viewport widget does not itself hand out resize events, so we
+    create a dummy (graph) widget in the same patch, connected in the
+    same way to the walls of the form, to collect resize events and
+    allow us to adjust scalings for the graphs within the viewport.
+    This is slightly dodgy and should be redone some other way.
+
+  Changes to this program by lfw:
+  module displaySeq
+        Added the parameter mag to the parameters
+                passed to the subroutine
+        Changed to default magnification to 30%
+               by changing the call to setGraphWidth
+               to be 30 rather than -1.  And by
+               allowing the call to setGraphWidth
+               to be specified by the user which
+               comes from the main calling program
+               via the mag parameter
+
+    9 July: New trace title parameter in createDisplay 
+    20 January: reposition trace if already managed
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "tman_display.h"
+#include "tman_traceDisplay.h" /* IMPORT: createTraceDisplay, displayTrace,
+			             unDisplayTrace, setTraceWidth */
+#include "tman_basesDisplay.h" /* IMPORT: createBasesDisplay, displayBases,
+			             unDisplayBases, setBasesWidth */
+#include "tman_context.h"      /* IMPORT: DisplayContext getFreeDisplayContext */
+#include "tman_gadgets.h"
+#include "Graph.h"
+#include "seq.h"          /* IMPORT: Seq, NULLSeq, getNPoints */
+#include "edUtils.h"
+#include "tagUtils.h"     /* IMPORT: FILE_NAME_LENGTH */
+
+#include <stdio.h>
+#include <ctype.h>        /* IMPORT: isgraph */
+
+#include <X11/Intrinsic.h>
+#include <X11/keysym.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Viewport.h>
+#include <X11/Xaw/Scrollbar.h>
+
+
+
+
+/* ---- Constants ---- */
+
+
+#define MaxDimension 32767 /* By definition, 2^16-1 */
+#define MaxMagCharSpace 5 /* At maximum magnification, the distance in
+			     characters between each base */
+#define ViewportFudge 10  /* A guess of the space required within the
+			     viewport for the graph to fit */
+
+
+
+
+/* ---- Statics ---- */
+
+/* Current sequence information */
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+
+
+static void setGraphWidth(DisplayContext *dc,int percent)
+/*
+      Set the widths of the graphs to be `percent' between
+      the minimum and maximum widths.
+      The current ``centre of interest'' is maintained.
+*/
+{   int nargs;
+    Arg args[10];
+    float centre, topOfThumb, shown;
+    Widget hScrollWid = XtNameToWidget(dc->viewportWid, "horizontal");
+
+    /*
+        The ``centre of interest'' is the middle of the displayed
+        graph. Percentagewise, this is given by `topOfThumb+show/2'.
+    */
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtSetArg(args[nargs], XtNshown,      &shown);      nargs++;
+    XtGetValues(hScrollWid, args, nargs);
+    centre = topOfThumb+shown/2;
+
+    /*
+        Set the displays to the required width
+    */
+    dc->graphWidth = ((dc->maxGraphWidth-dc->minGraphWidth) * percent)/100 + dc->minGraphWidth;
+    XawFormDoLayout(dc->vpFormWid, False);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, dc->graphWidth); nargs++;
+    setBasesWidth(dc,dc->graphWidth);
+    setTraceWidth(dc,dc->graphWidth);
+    XawFormDoLayout(dc->vpFormWid, True);
+
+    /*
+        We have changed the size of the form widget within the viewport
+        widget. The viewport does not define where exactly the form
+        will now sit, so we manually reset the ``centre of interest'',
+        keeping `shown' at its (new) value.
+    */
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtSetArg(args[nargs], XtNshown,      &shown);      nargs++;
+    XtGetValues(hScrollWid, args, nargs);
+    topOfThumb = centre-shown/2;
+    XawScrollbarSetThumb(hScrollWid, topOfThumb, shown);
+    XtCallCallbacks(hScrollWid, XtNjumpProc, &topOfThumb);
+
+}
+
+
+
+
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+#ifdef notdef
+
+static void resizeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Dimension width;
+    int mag;
+    Arg args[10];
+    int nargs;
+
+    DisplayContext *dc = widgetToDisplayContext(XtParent(wid));
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtGetValues(dc->viewportWid, args, nargs);
+    
+    /* Back calculate what the current magnification is */
+    mag = ((dc->graphWidth-dc->minGraphWidth)*100) / (dc->maxGraphWidth-dc->minGraphWidth);
+
+    /* Adjust the minimum magnification */
+    dc->minGraphWidth = width-ViewportFudge;
+    dc->graphWidth    = dc->minGraphWidth;
+
+    /* Set us to the old magnification, but with the new dimensions */
+    setGraphWidth(dc,mag);
+}
+
+#endif
+
+
+
+
+
+/* ---- Exports ---- */
+
+
+DisplayContext *createDisplay(Widget parentWid, Widget fromVertWid, char* traceName, char *traceTitle)
+/*
+    Create the display within the Form widget `parentWid'
+    with the XtNfromVert constraint `fromVertWid'.
+    No sequence is initally displayed.
+*/
+{
+    EdStruct *xx = intToEdStruct(0);
+
+    Widget basesWid;
+    Arg args[10];
+    int nargs;
+    Dimension charWidth;      /* As used by basesDisplay */
+    Dimension plotEdgeOffset;
+    Dimension vpWidth = 0;
+    Dimension vpHeight = 0;
+
+    DisplayContext *dc = getFreeDisplayContext();
+
+    strncpy(dc->traceName,traceName,FILE_NAME_LENGTH);
+
+    if (fromVertWid != NULL) {
+	DisplayContext *ref = widgetToDisplayContext(fromVertWid);
+
+	nargs = 0;
+	XtSetArg(args[nargs], XtNwidth, &vpWidth); nargs++;
+	XtSetArg(args[nargs], XtNheight, &vpHeight); nargs++;
+	XtGetValues(ref->viewportWid , args, nargs);
+    } else {
+	/*
+	** make default width to be the same as the
+	** contig editor sequences widget
+	*/
+	nargs = 0;
+	XtSetArg(args[nargs], XtNwidth, &vpWidth); nargs++;
+	XtGetValues(xx->sequencesWid, args, nargs);
+
+    }
+
+    /* The main display */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    dc->mainFormWid = XtCreateWidget("form", formWidgetClass,
+					parentWid, args, nargs);
+
+
+    /* Other buttons */
+    nargs = 0;
+    dc->gadgetsWid = XtCreateManagedWidget("gadgets", formWidgetClass,
+					dc->mainFormWid, args, nargs);
+
+    createGadgets(dc->gadgetsWid,traceTitle);
+
+
+    /* create graph */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNforceBars, True); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, dc->gadgetsWid); nargs++;
+    if (vpWidth != 0) {
+        XtSetArg(args[nargs], XtNwidth, vpWidth); nargs++;
+    }
+    if (vpHeight != 0) {
+        XtSetArg(args[nargs], XtNheight, vpHeight); nargs++;
+    }
+    dc->viewportWid = XtCreateManagedWidget("viewport", viewportWidgetClass,
+					dc->mainFormWid, args, nargs);
+
+
+    /*
+    The minimum graph width is the viewport width minus a
+    random amount to allow for spacing between the viewport
+    and the graph, borders etc.
+    The initial graph width is this minimum width.
+    */
+
+    {
+	Dimension vpWidth;
+
+	nargs = 0;
+	XtSetArg(args[nargs], XtNwidth, &vpWidth); nargs++;
+	XtGetValues(dc->viewportWid, args, nargs);
+
+	dc->minGraphWidth = vpWidth-ViewportFudge;
+	dc->graphWidth = dc->minGraphWidth;
+    }
+
+
+
+
+    nargs = 0;
+    dc->vpFormWid = XtCreateManagedWidget("vpForm", formWidgetClass,
+				      dc->viewportWid, args, nargs);
+
+
+    basesWid = createBasesDisplay(dc, dc->vpFormWid, NULL, dc->graphWidth);
+
+    getBasesFontInfo(dc,&plotEdgeOffset, &charWidth);
+    createTraceDisplay(dc, dc->vpFormWid,basesWid,dc->graphWidth, plotEdgeOffset);
+
+    XtManageChild(dc->mainFormWid); 
+
+    return dc;
+}
+
+
+DisplayContext *getDisplay(Widget parentWid, Widget fromVertWid, char* traceName, char *traceTitle)
+/*
+  get a display context, create one if necessary
+*/
+{
+
+    DisplayContext *dc;
+
+    dc = nameToDisplayContext(traceName);
+
+    if (dc == NULL)
+	dc = createDisplay(parentWid, fromVertWid, traceName, traceTitle);
+
+    return dc;
+
+}
+
+
+void displaySeq(Seq seq, int baseNum, int leftCutOff, int cutLength, int baseSpacing)
+/*
+    Display the sequence `seq' for editing.
+    If baseNum is not equal to NULLBaseNum, display at 30% (or mag)
+    magnification centered on baseNum.
+*/
+{
+
+    DisplayContext *dc = seqToDisplayContext(seq);
+
+    if (seq == NULLSeq) return;
+
+    initBaseDisplay(seq,baseNum);
+    initTraceDisplay(seq);
+
+
+    /*
+      The maximum graph width is such to allow for a `MaxMagCharSpace'
+      spacing between bases.
+
+      There is a limit on Dimension sizes, noting that the graph we
+      generate has to fit into a Viewport.
+    */
+    {
+	long maxWidth;
+        int  NorigBases;
+        Dimension charWidth;      /* As used by basesDisplay */
+        Dimension plotEdgeOffset;
+
+        NorigBases = getNBases(seq, OrigBases);
+        getBasesFontInfo(dc,&plotEdgeOffset, &charWidth);
+/*
+        maxWidth = ((NorigBases-1) * charWidth * MaxMagCharSpace) +
+	                (2 * plotEdgeOffset);
+**
+**
+        maxWidth = (int) ((float) ((NorigBases-1) * baseSpacing) *
+		          (float) dc->NPoints /
+		          (float) baseNumToPoint(seq, OrigBases,NorigBases-1)
+		         ) + (2 * plotEdgeOffset);
+*/
+	if (leftCutOff > NorigBases)
+	    leftCutOff = NorigBases;
+	if (leftCutOff < 0)
+	    leftCutOff = 0;
+	if (leftCutOff+cutLength > NorigBases)
+	    cutLength = NorigBases-leftCutOff;
+
+        maxWidth = (int) ((float) ((cutLength-1) * baseSpacing) *
+		          (float) dc->NPoints / (
+		          (float) baseNumToPoint(seq, OrigBases,leftCutOff+cutLength-1) -
+		          (float) baseNumToPoint(seq, OrigBases,leftCutOff))
+		         ) + (2 * plotEdgeOffset);
+
+        dc->maxGraphWidth =
+	      (maxWidth>MaxDimension-ViewportFudge) ? MaxDimension-ViewportFudge
+	    : (maxWidth<dc->minGraphWidth)          ? dc->minGraphWidth
+	    :                                         maxWidth;
+    }
+
+
+    if (baseNum!=NULLBaseNum)
+    {   /* Centred on baseNum, half magnification */
+        Widget hScrollWid = XtNameToWidget(dc->viewportWid, "horizontal");
+        float topOfThumb;
+
+        /* Set the `centre of interest' on baseNum. Zero `shown'. */
+        topOfThumb = (float) baseNumToPoint(seq, OrigBases,baseNum) / (float) dc->NPoints;
+
+        XawScrollbarSetThumb(hScrollWid, topOfThumb, 0.0);
+
+    }
+
+    setGraphWidth(dc,100/*mag*/);
+}
+
+
+
+void repositionSeq(Seq seq, int baseNum)
+/*
+    Reposition the sequence `seq' at a given base number (centred)
+*/
+{
+
+    DisplayContext *dc = seqToDisplayContext(seq);
+
+    if (seq == NULLSeq) return;
+
+
+    if (baseNum!=NULLBaseNum)
+    {   /* Centred on baseNum, half magnification */
+        Widget hScrollWid = XtNameToWidget(dc->viewportWid, "horizontal");
+        float topOfThumb;
+	float shown;
+	int nargs;
+	Arg args[10];
+
+        /* Set the `centre of interest' on baseNum. */
+	nargs=0;
+	XtSetArg(args[nargs], XtNshown,      &shown);      nargs++;
+	XtGetValues(hScrollWid, args, nargs);
+        topOfThumb = (float) baseNumToPoint(seq, OrigBases,baseNum) / (float) dc->NPoints;
+	topOfThumb = topOfThumb - shown/2;
+        XawScrollbarSetThumb(hScrollWid, topOfThumb, shown);
+
+	/* force redraw the only way we know how */
+	XtCallCallbacks(hScrollWid, XtNjumpProc, &topOfThumb);
+    }
+
+}
+
+
+
+
+
+
diff --git a/src/staden/tman_display.h b/src/staden/tman_display.h
new file mode 100644
index 0000000..f4765f9
--- /dev/null
+++ b/src/staden/tman_display.h
@@ -0,0 +1,81 @@
+/* 
+    Title:       tman_display
+
+    File: 	 tman_display.h
+    Purpose:	 Sequence display for trace manager
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+#include "tman_context.h"
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern DisplayContext *createDisplay(Widget parentWid, Widget superiorWid, char *traceName, char *traceTitle);
+/*
+    Create the display within the Form widget `parent',
+    underneath `superiorWid'.
+    No sequence is initially displayed.
+*/
+
+extern DisplayContext *getDisplay(Widget parentWid, Widget superiorWid, char *traceName, char *traceTitle);
+/*
+    Get a display context for traceName, reusing one if one for the
+    same name exists.
+    No sequence is initially displayed.
+*/
+
+
+extern void displaySeq(Seq seq, int baseNum, int leftCutOff, int cutLength, int baseSpacing);
+/*
+    Display the sequence `seq' for editing.
+    If baseNum is not equal to -1, display at half magnification
+    centered on baseNum. (unless the user has also set mag)
+*/
+
+extern void repositionSeq(Seq seq, int baseNum);
+/*
+    Reposition the sequence `seq' at a given base number (centred)
+*/
+
+
+extern void unDisplaySeq();
+/*
+    Cease displaying the current sequence (if any).
+*/
+
+
+extern Seq displayedSeq();
+/*
+    Return the currently displayed sequence, or NULLSeq
+    if none is being displayed.
+*/
+
+
+extern void userEvent(XEvent *eventP);
+/*
+    Handle the (key or button) event generated in a subdisplay.
+*/
+
+
+
+
+
+
+
+
diff --git a/src/staden/tman_gadgets.c b/src/staden/tman_gadgets.c
new file mode 100644
index 0000000..bf9d793
--- /dev/null
+++ b/src/staden/tman_gadgets.c
@@ -0,0 +1,132 @@
+/* 
+    Title:       Trace manager graph gadgets
+
+    File: 	 tman_gadgets.c
+    Purpose:	 gadgets for the trace manager
+    Last update: Tuesday 9 July 1991
+*/
+
+
+/*
+9 July 1991 SD  Widened label field from 12 to 18 chars
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "tman_context.h"
+#include "tman_traceDisplay.h"
+#include "Graph.h"
+#include <X11/StringDefs.h>
+#include <X11/Intrinsic.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Form.h>
+
+/* ---- Constants ---- */
+
+
+/* ---- Statics ---- */
+
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void quitCallback(Widget wid,
+                            XtPointer client_data, XtPointer call_data)
+{
+    DisplayContext *dc=widgetToDisplayContext(XtParent(XtParent(wid)));
+
+    destroyDisplayContext(dc);
+}
+
+
+
+static void scaleUpCallback(Widget wid,
+                            XtPointer client_data, XtPointer call_data)
+{
+    DisplayContext *dc=widgetToDisplayContext(XtParent(XtParent(wid)));
+
+    incScaleFactor(dc);
+}
+
+static void scaleDownCallback(Widget wid,
+                            XtPointer client_data, XtPointer call_data)
+{
+    DisplayContext *dc=widgetToDisplayContext(XtParent(XtParent(wid)));
+
+    decScaleFactor(dc);
+}
+
+
+
+
+
+
+
+/* ---- Exports ---- */
+
+
+void createGadgets(Widget parentWid,char *traceName)
+/*
+*/
+{
+    Widget labelWid,scaleUpWid,scaleDownWid,/*lockWid,*/quitWid;
+    Arg args[10];
+    int nargs;
+
+    char traceLabel[18];
+    Cstr2Fstr(traceName,traceLabel,18);
+    traceLabel[17]='\0';
+
+    /*
+    ** A title
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNlabel, traceLabel); nargs++;
+    labelWid = XtCreateManagedWidget("traceName", labelWidgetClass,
+					parentWid, args, nargs);
+
+    /* for vertical scaling */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, labelWid); nargs++;
+    scaleUpWid = XtCreateManagedWidget("scaleUp", commandWidgetClass,
+                                   parentWid, args, nargs);
+    XtAddCallback(scaleUpWid, XtNcallback, scaleUpCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, scaleUpWid); nargs++;
+    scaleDownWid = XtCreateManagedWidget("scaleDown", commandWidgetClass,
+                                   parentWid, args, nargs);
+    XtAddCallback(scaleDownWid, XtNcallback, scaleDownCallback, NULL);
+
+    /* Other buttons */
+    /*
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, scaleDownWid); nargs++;
+    lockWid = XtCreateManagedWidget("lock", toggleWidgetClass,
+					parentWid, args, nargs);
+    */
+
+    nargs = 0;
+    /*
+    XtSetArg(args[nargs], XtNfromHoriz, lockWid); nargs++;
+    */
+    XtSetArg(args[nargs], XtNfromVert, scaleDownWid); nargs++;
+    quitWid = XtCreateManagedWidget("quit", commandWidgetClass,
+                                   parentWid, args, nargs);
+    XtAddCallback(quitWid, XtNcallback, quitCallback, NULL);
+
+}
diff --git a/src/staden/tman_gadgets.h b/src/staden/tman_gadgets.h
new file mode 100644
index 0000000..67e4c12
--- /dev/null
+++ b/src/staden/tman_gadgets.h
@@ -0,0 +1,27 @@
+/* 
+    Title:       graph gadgets
+
+    File: 	 gadgets.h
+    Purpose:	 gadgets for the graph viewport
+    Last update: Thursday Jan 10 1991
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include <X11/Intrinsic.h>
+
+
+/* ---- Exports ---- */
+
+
+extern void createGadgets(Widget parentWid, char *traceName);
+/*
+*/
diff --git a/src/staden/tman_main.c b/src/staden/tman_main.c
new file mode 100644
index 0000000..d386e3c
--- /dev/null
+++ b/src/staden/tman_main.c
@@ -0,0 +1,251 @@
+/*
+    Title: 	 tman_main
+
+    File: 	 tman_main.c
+    Purpose:	 Main module for Trace Manager
+    Last update: Tuesday 14 April 1992
+
+    Change log:
+
+	9 July 1991 New trace title parameter in manageTrace, and call to
+		createDisplay
+		Fixed problem with geometry management with a sleep(1)
+		after widget popup. Seems to work...not sure why
+
+	16 Jan 1992 When trace is already present in the trace manager
+	        reposition old trace
+	
+	14 Feb 1992 Support SCF format traces
+	14 Apr 1992 If file if SCF read in regardless of claimed format.
+*/
+
+
+/*
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "mystdlib.h"  /* IMPORT: exit */
+
+#include "tman_main.h"
+#include "tman_display.h"    /* IMPORT: displaySeq, unDisplaySeq */
+#include "tman_traceDisplay.h"    /* IMPORT: setScaleFactor */
+#include "tman_context.h"
+#include "seq.h"        /* IMPORT: Seq, NULLSeq */
+#include "seqIOABI.h"   /* IMPORT: readSeqABI   */
+#include "seqIOALF.h"   /* IMPORT: readSeqALF   */
+#include "seqIOSCF.h"   /* IMPORT: readSeqSCF   */
+#include "seqIOPlain.h" /* IMPORT: readSeqPlain   */
+
+
+#include <stdio.h>
+/*#include <string.h>*/
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Shell.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Box.h>
+
+/* ---- definitions ---- */
+#define TMAN_STATE_DOWN 0
+#define TMAN_STATE_UP   1
+
+
+/* ---- Static variables ---- */
+static int tman_state = TMAN_STATE_DOWN;
+static Widget traceManagerShellWidget;
+static Widget mainFormWid;
+static Widget buttonWid,quitWid,hideWid;
+static Widget traceForm;
+
+
+static void initialDisplayedSeq(DisplayContext *dc,
+			char *format, char *fn,
+			int baseNum, int leftCutOff, int cutLength,
+			int bottom, int baseSpacing)
+/*
+    This function may be called once, after the application
+    has been realised, to specify a sequence to be displayed
+    initially. If it is not called, no sequence is initially
+    displayed.
+*/
+{ 
+    if (dc->seq == NULLSeq) {
+	/*
+	** if file is in SCF read in as such regardless of `format'
+	*/
+	if (is_SCF(fn)) 
+	    dc->seq = readSeqSCF(fn);
+	else {
+	    if (strcmp(format, "ABI ")   == 0)
+		dc->seq = readSeqABI(fn);
+	    else if (strcmp(format, "ALF ")   == 0)
+		dc->seq = readSeqALF(fn);
+	    else if (strcmp(format, "SCF ")   == 0)
+		dc->seq = readSeqSCF(fn);
+	    else
+		dc->seq = readSeqPlain(fn);
+	}
+	    
+	if (dc->seq != NULLSeq)
+	    {
+		int i;
+		/*
+		 * YUK!
+		 * We need to do the following so that complement_seq() works
+		 */
+		/* All ed bases are the same are orig bases */
+		for (i=0;i<dc->seq->NorigBases;i++) dc->seq->edits[i] = i;
+	
+		/* The table of edits has all its entries free */
+		for (i=0;i<MaxEdits;i++) dc->seq->edBasePos[i] = NULLPoint;
+	
+		oppInitialize();
+		if (bottom) complement_seq(dc->seq);
+		setScaleFactor(dc,1.0);
+
+		/*
+		  Set the `file name' and `number of bases' labels.
+		  */
+		displaySeq(dc->seq,
+			   ((baseNum!=NULLBaseNum)                 &&
+			    (baseNum>=0)                           &&
+			    (baseNum<getNBases(dc->seq,OrigBases))
+			    )
+			   ? baseNum
+			   : NULLBaseNum,
+			   leftCutOff,
+			   cutLength,
+			   baseSpacing
+			   );
+	    }
+	else
+	    XBell(XtDisplay(traceManagerShellWidget), 100);
+    } else {
+	repositionSeq(dc->seq,
+		      ((baseNum!=NULLBaseNum)                 &&
+		       (baseNum>=0)                           &&
+		       (baseNum<getNBases(dc->seq,OrigBases))
+		       )
+		      ? baseNum
+		      : NULLBaseNum
+		      );
+	
+    }
+
+}
+
+
+
+/* ---- Callbacks ---- */
+static void quitCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{   /* hide the trace manager widget */
+    XtPopdown(traceManagerShellWidget);
+    destroyContexts();
+    tman_state = TMAN_STATE_DOWN;
+}
+
+static void hideCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{   /* hide the trace manager widget */
+    tman_state = TMAN_STATE_DOWN;
+    XtPopdown(traceManagerShellWidget);
+}
+
+/* ---- Exported functions ---- */
+
+
+void CreateTraceManager(Widget parentWid)
+{
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        Create the widgets
+    */
+    traceManagerShellWidget = XtCreatePopupShell ("Traces",
+                                      topLevelShellWidgetClass,
+                                      parentWid,
+                                      NULL, (Cardinal) 0);
+
+    mainFormWid = XtCreateManagedWidget("mainForm", formWidgetClass,
+					traceManagerShellWidget,
+					NULL, (Cardinal) 0);
+
+    
+    /*
+    ** Box widget for buttons
+    */
+    nargs = 0;
+    buttonWid = XtCreateManagedWidget("buttons",  boxWidgetClass,
+				   mainFormWid, args, nargs);
+
+    nargs = 0;
+    quitWid = XtCreateManagedWidget("quit",  commandWidgetClass,
+				   buttonWid, args, nargs);
+    XtAddCallback(quitWid, XtNcallback, quitCallback,  NULL);
+
+    nargs = 0;
+    hideWid = XtCreateManagedWidget("hide",  commandWidgetClass,
+				   buttonWid, args, nargs);
+    XtAddCallback(hideWid, XtNcallback, hideCallback,  NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, buttonWid); nargs++;
+    traceForm = XtCreateManagedWidget("traceForm",  formWidgetClass,
+				   mainFormWid, args, nargs);
+
+    initialiseDisplayContexts();
+    tman_state = TMAN_STATE_DOWN;
+
+}
+
+
+void manageTrace(
+	char *format,
+	char *rawDataFile,
+	int baseNum,
+	int leftCutOff,
+	int cutLength,
+	int complimented,
+	int baseSpacing,
+	char *traceTitle
+	)
+{
+    DisplayContext *dc,*lastdc;
+    char *traceName;
+
+
+    if ((traceName=(char *)strrchr(rawDataFile,'/'))==NULL)
+	traceName = rawDataFile;
+    else
+	traceName++;
+
+    lastdc = getLastDisplayContext();
+    dc = getDisplay(traceForm,(lastdc==NULL)?NULL:lastdc->mainFormWid,traceName, traceTitle);
+
+    if (tman_state == TMAN_STATE_DOWN) {
+	XtPopup(traceManagerShellWidget,XtGrabNone);
+	XSync(XtDisplay(traceManagerShellWidget),False);
+	/* We need to sleep here to let X perform its
+	** geometry negotiation correctly.
+	** It's a bit of a kludge...sorry!
+	*/
+	sleep(1);
+	tman_state = TMAN_STATE_UP;
+    }
+
+    initialDisplayedSeq(dc, format, rawDataFile, baseNum,
+	leftCutOff, cutLength, complimented, baseSpacing);
+
+    XawFormDoLayout(traceForm,True);
+
+
+}
diff --git a/src/staden/tman_main.h b/src/staden/tman_main.h
new file mode 100644
index 0000000..f28d0ca
--- /dev/null
+++ b/src/staden/tman_main.h
@@ -0,0 +1,33 @@
+/*
+    Title: 	 tman_main.h
+
+    File: 	 tman_main.h
+    Last update: Tueday 9 July 1991
+
+    Change log:
+
+*/
+
+
+/*
+    This module contains the C language entry point `main' and
+    initialisation for the X system.
+*/
+
+#include <X11/Intrinsic.h>
+
+
+
+/* ---- Exports ---- */
+extern void CreateTraceManager(Widget parentWid);
+
+extern void manageTrace(
+	char *format,
+	char *rawDataFile,
+	int baseNum,
+	int leftCutOff,
+	int cutLength,
+	int complimented,
+	int baseSpacing,
+	char *traceTitle
+	);
diff --git a/src/staden/tman_traceDisplay.c b/src/staden/tman_traceDisplay.c
new file mode 100644
index 0000000..432595e
--- /dev/null
+++ b/src/staden/tman_traceDisplay.c
@@ -0,0 +1,439 @@
+/* 
+    Title:       traceDisplay
+
+    File: 	 traceDisplay.c
+    Purpose:	 Display the trace of a sequence on a graph widget
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+    `plotEdgeOffset' indicates the pixel position at which point 0
+    occurs. The characters for bases are printed with their centres
+    corresponding to their locations on the plot.
+
+    The cutoffs are actually drawn between the positions of the
+    bases on either side.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "tman_traceDisplay.h"
+#include "tman_display.h"      /* IMPORT: userEvent */
+#include "Graph.h"
+#include "seq.h"          /* IMPORT: Seq, NULLSeq, getNPoints */
+#include "values.h"       /* IMPORT: M_SQRT2 */
+#include <stdio.h>
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Statics ---- */
+
+static GC Agc, Cgc, Ggc, Tgc;
+static Pixel normBackground, dimBackground;
+
+
+
+
+
+/* ---- Position translation ---- */
+
+/*
+    Convert a value from the `point-in-the-sequence' to the
+    `x-pixel-on-the-graph' scale, and vice-versa, given that
+    `plotWidth' holds the current pixel width of the graph which
+    starts `plotEdgeOffset' from either edge of the graph and
+    `NPoints' gives the current point width.
+    Fractional results are truncated - for pixelToPoint this may
+    mean a point one too low is indicated, but a point too high is
+    never indicated.
+*/
+
+
+static int pointToPixel(DisplayContext *dc, int point)
+{   return(((point * dc->plotWidth) / (dc->NPoints-1)) + dc->plotEdgeOffset);
+}
+
+
+static int pixelToPoint(DisplayContext *dc, int pixel)
+{   return(((pixel-dc->plotEdgeOffset) * (dc->NPoints-1)) / dc->plotWidth);
+}
+
+
+static int scaleTrace(DisplayContext *dc, int y)
+{
+    return(dc->traceHeight -
+	   (int)(dc->scaleFactor * (float)y * (float)dc->traceHeight / dc->maxTraceVal)
+           );
+}
+
+
+
+static int leftCutoffPoint(DisplayContext *dc, int baseNum)
+/*
+    Return the point corresponding to a left cutoff at `baseNum'.
+*/
+{   int pL, pR;
+
+    pL = (baseNum == 0) ? 0 : baseNumToPoint(dc->seq, OrigBases, baseNum-1);
+    pR = (baseNum == 0) ? 0 : baseNumToPoint(dc->seq, OrigBases, baseNum);
+
+    return((pR==NULLPoint)?pL:(pL+pR)/2);
+}
+
+
+static int rightCutoffPoint(DisplayContext *dc, int baseNum)
+/*
+    Return the point corresponding to a right cutoff at `baseNum'.
+*/
+{   int NorigBases = getNBases(dc->seq, OrigBases);
+    int pL, pR;
+
+    pL = (baseNum == 0)
+         ? dc->NPoints-1
+	 : baseNumToPoint(dc->seq, OrigBases, NorigBases-baseNum);
+    pR = (baseNum == 0)
+         ? dc->NPoints-1
+	 : baseNumToPoint(dc->seq, OrigBases, NorigBases-baseNum-1);
+
+    return((pL+pR)/2);
+}
+
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+
+
+static void drawTrace(DisplayContext *dc, Widget wid, char base, int p0, int pN)
+/*
+    Draw the `base' trace between points p0..pN (inclusive)
+*/
+{   GC gc;
+    int pSeg, pSeg0, pSegN;
+#define NSegs 500
+    int traces[NSegs];
+    XPoint xPoint[NSegs];
+
+    switch (base) 
+    {   case 'A': gc=Agc; break;
+        case 'C': gc=Cgc; break;
+        case 'G': gc=Ggc; break;
+        case 'T': gc=Tgc; break;
+    }
+
+
+    /*
+        The range (p0..pN) is drawn as a series of overlapping
+	segments (pSeg0..pSegN).
+    */
+    pSegN = p0;
+    do
+    {   int i;
+
+	/*
+	    This segment carries on from and including the last point
+	    of the last segment.
+	*/
+	pSeg0 = pSegN;
+	pSegN = ((pSeg0+NSegs-1)<=pN) ? pSeg0+NSegs-1 : pN;      
+
+	/* Get the traces for this segment */
+	getTraces(dc->seq, base, pSeg0, pSegN, traces);
+
+	/* Translate to pixel coords and place in an XPoint array */
+	pSeg = pSeg0;
+	i = 0;
+	while (pSeg<=pSegN)
+	{   xPoint[i].x = pointToPixel(dc,pSeg);
+	    xPoint[i].y = scaleTrace(dc,traces[i]);
+	    pSeg++;
+	    i++;
+	}
+
+	/* Draw the segments in the XPoint array */
+	XDrawLines(XtDisplay(wid),
+		   XtWindow(wid),
+		   gc,
+		   xPoint,
+		   pSegN-pSeg0+1,
+		   CoordModeOrigin);
+    } while (pSegN < pN);
+}
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+static void exposeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Region region = (Region) call_data;
+    XRectangle rect;
+    int x0, xN; /* Affected region: pixels */
+    int p0, pN; /* Affected region: points */
+    int leftCutoffP,  rightCutoffP;
+    DisplayContext *dc = widgetToDisplayContext(XtParent(XtParent(XtParent(wid))));
+    
+
+
+    if ((dc->seq == NULLSeq) || isBasesOnly(dc->seq)) return;
+
+
+    /*
+        `region' gives us the exposed graph region.
+	XClipBox will return a bounding rectangle, in pixels.
+    */
+    XClipBox(region, &rect);
+    x0 = rect.x;
+    xN = rect.x+rect.width;
+
+    /* Trim it so we only consider the plot proper */
+    if ((x0>dc->plotWidth+dc->plotEdgeOffset) || (xN<dc->plotEdgeOffset)) return;
+    x0 = (x0<dc->plotEdgeOffset)           ? dc->plotEdgeOffset           : x0;
+    xN = (xN>dc->plotWidth+dc->plotEdgeOffset) ? dc->plotWidth+dc->plotEdgeOffset : xN;
+
+    /* Convert to affected points */
+    p0 = pixelToPoint(dc,x0);
+    pN = pixelToPoint(dc,xN);
+    if (pN<dc->NPoints-1) pN++;
+
+    leftCutoffP  = leftCutoffPoint(dc,dc->leftCutoff);
+    rightCutoffP = rightCutoffPoint(dc,dc->rightCutoff);
+      
+
+    if (p0<leftCutoffP)
+    /*
+        Some of the exposed region is in the left cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = p0;
+       int lastP  = (pN<leftCutoffP) ? pN : leftCutoffP;
+
+       int pix0 = pointToPixel(dc,firstP);
+       int pixN = pointToPixel(dc,lastP);
+
+       XSetWindowBackground(XtDisplay(dc->traceWid), XtWindow(dc->traceWid),
+			    dimBackground);
+       XClearArea(XtDisplay(dc->traceWid), XtWindow(dc->traceWid),
+		  pix0, 0,
+		  pixN-pix0, dc->traceHeight,
+		  False);
+       XSetWindowBackground(XtDisplay(dc->traceWid), XtWindow(dc->traceWid),
+			    normBackground);
+    }
+      
+
+    if (pN>rightCutoffP)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = (p0>rightCutoffP) ? p0 : rightCutoffP;
+       int lastP  = pN;
+
+       int pix0 = pointToPixel(dc,firstP);
+       int pixN = pointToPixel(dc,lastP);
+
+       XSetWindowBackground(XtDisplay(dc->traceWid), XtWindow(dc->traceWid),
+			    dimBackground);
+       XClearArea(XtDisplay(dc->traceWid), XtWindow(dc->traceWid),
+		  pix0, 0,
+		  pixN-pix0, dc->traceHeight,
+		  False);
+       XSetWindowBackground(XtDisplay(dc->traceWid), XtWindow(dc->traceWid),
+			    normBackground);
+    }
+
+    /*
+        Now draw the traces.
+	The GC background is unused when drawing lines, so we do
+	not need to change it depending on whether we are in the
+	cutoff region or not.
+    */
+    drawTrace(dc,wid, 'A', p0, pN);
+    drawTrace(dc,wid, 'C', p0, pN);
+    drawTrace(dc,wid, 'G', p0, pN);
+    drawTrace(dc,wid, 'T', p0, pN);
+}
+
+
+
+
+
+static void resizeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Arg args[10];
+    int nargs;
+    DisplayContext *dc = widgetToDisplayContext(XtParent(XtParent(XtParent(wid))));
+
+    /*
+	This routine can be called in two ways.
+	(a) as a callback when the trace is resized by the user
+	(b) indirectly when setGraphWidth alters the graph width
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &dc->traceHeight); nargs++;
+    XtSetArg(args[nargs], XtNwidth,  &dc->traceWidth);  nargs++;
+    XtGetValues(wid, args, nargs);
+
+    if (XtIsRealized(wid)) XClearWindow(XtDisplay(wid), XtWindow(wid));
+}
+
+
+
+
+/* ---- Exports ---- */
+
+static void redisplayTraces(DisplayContext *dc)
+{
+    /*
+    **  Clear the area and generate an expose.
+    */
+    XClearArea(XtDisplay(dc->traceWid), XtWindow(dc->traceWid),
+		0, 0,
+		0, 0,
+		True);
+}
+
+
+void incScaleFactor(DisplayContext *dc)
+{
+    dc->scaleFactor *= M_SQRT2;
+    redisplayTraces(dc);
+}
+
+void decScaleFactor(DisplayContext *dc)
+{
+    dc->scaleFactor /= M_SQRT2;
+    redisplayTraces(dc);
+}
+
+
+void setScaleFactor(DisplayContext *dc, float sf)
+{
+    dc->scaleFactor = sf;
+    redisplayTraces(dc);
+}
+
+
+
+
+
+void createTraceDisplay(DisplayContext *dc, Widget parentWid, Widget fromVertWid,
+			Dimension width, Dimension offset)
+/*
+    Create the trace display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width' and the plot is drawn `offset' from
+    the left and right hand edges.
+    No trace is initially displayed.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    width);       nargs++;
+    dc->traceWid = XtCreateManagedWidget("trace", graphWidgetClass,
+				     parentWid, args, nargs);
+    XtAddCallback(dc->traceWid, XtNexposeCallback, exposeCallback, NULL);
+    XtAddCallback(dc->traceWid, XtNresizeCallback, resizeCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &dc->traceHeight); nargs++;
+    XtGetValues(dc->traceWid, args, nargs);
+    dc->traceWidth     = width;
+    dc->plotWidth      = dc->traceWidth-(2*dc->plotEdgeOffset);
+    dc->plotEdgeOffset = offset;
+
+    dc->seq = NULLSeq;
+    dc->scaleFactor = 1.0;
+
+    /*
+        Get the GCs
+    */
+    Agc = GraphGC1(dc->traceWid);
+    Cgc = GraphGC2(dc->traceWid);
+    Ggc = GraphGC3(dc->traceWid);
+    Tgc = GraphGC4(dc->traceWid);
+
+    /*
+        Get the backgrounds
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNbackground,    &normBackground);    nargs++;
+    XtSetArg(args[nargs], XtNdimBackground, &dimBackground); nargs++;
+    XtGetValues(dc->traceWid, args, nargs);
+}
+
+
+
+
+void initTraceDisplay(Seq seq)
+/*
+    Initialise the trace display of sequence `seq'.
+*/
+{
+    DisplayContext *dc = seqToDisplayContext(seq);
+    if (seq == NULLSeq) return;
+    dc->seq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    dc->NPoints     = getNPoints(dc->seq);
+    dc->maxTraceVal = getMaxTraceVal(dc->seq);
+    getCutoffs(dc->seq, &dc->leftCutoff, &dc->rightCutoff);
+}
+
+
+
+
+
+
+void setTraceWidth(DisplayContext *dc, Dimension width)
+/*
+    Set the width of the current sequence trace display.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, width); nargs++;
+    XtSetValues(dc->traceWid,   args, nargs);
+
+    dc->traceWidth = width;
+    dc->plotWidth  =  dc->traceWidth-(2*dc->plotEdgeOffset);
+
+    /*
+        Clear the graph.
+    */
+    if (XtIsRealized(dc->traceWid))
+        XClearWindow(XtDisplay(dc->traceWid), XtWindow(dc->traceWid));
+}
+
+
+
+
+
+
+
+
+
+
diff --git a/src/staden/tman_traceDisplay.h b/src/staden/tman_traceDisplay.h
new file mode 100644
index 0000000..49276c6
--- /dev/null
+++ b/src/staden/tman_traceDisplay.h
@@ -0,0 +1,68 @@
+/* 
+    Title:       traceDisplay
+
+    File: 	 traceDisplay.h
+    Purpose:	 Display the trace of a sequence on a graph widget
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+#include "tman_context.h"
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget, Dimension */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern void createTraceDisplay(DisplayContext *dc, Widget parentWid, Widget fromVertWid,
+			       Dimension width, Dimension offset);
+/*
+    Create the trace display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width' and the plot is drawn `plotEdgeOffset' from
+    the left and right hand edges.
+    No trace is initially displayed.
+*/
+
+
+extern void initTraceDisplay(Seq seq);
+/*
+    Initialise the trace display of sequence `seq'.
+*/
+
+
+extern void setTraceWidth(DisplayContext *dc, Dimension width);
+/*
+    Set the width of the current sequence trace display.
+*/
+
+
+extern void incScaleFactor();
+/*
+    Increment vertical scaling of trace
+*/
+
+
+extern void decScaleFactor();
+/*
+    Decrement vertical scaling of trace
+*/
+
+
+extern void setScaleFactor(DisplayContext *dc, float sf);
+/*
+    Set vertical scaling
+*/
+
diff --git a/src/staden/trans.c b/src/staden/trans.c
new file mode 100644
index 0000000..e3257c1
--- /dev/null
+++ b/src/staden/trans.c
@@ -0,0 +1,35 @@
+#include <X11/Intrinsic.h>
+/*
+** We don't want to have all the textwidget translations
+** Here are the list of the ones we want.
+** This is taken from Xaw/TextTr.c source
+*/
+char *defaultTranslations = "\
+<Key>Right:	forward-character() \n\
+<Key>Left:	backward-character() \n\
+<Key>Delete:	delete-previous-character() \n\
+<Key>BackSpace:	delete-previous-character() \n\
+<Key>:		insert-char() \n\
+<FocusIn>:	focus-in() \n\
+<FocusOut>:	focus-out() \n\
+<Btn1Down>:	select-start() \n\
+<Btn1Motion>:	extend-adjust() \n\
+<Btn1Up>:	extend-end(PRIMARY, CUT_BUFFER0) \n\
+<Btn2Down>:	insert-selection(PRIMARY, CUT_BUFFER0) \n\
+<Btn3Down>:	extend-start() \n\
+<Btn3Motion>:	extend-adjust() \n\
+<Btn3Up>:	extend-end(PRIMARY, CUT_BUFFER0) \
+";
+
+void my_translations(Widget w)
+/*
+** Use my default translations
+*/
+{
+    XtTranslations parsedTranslations;
+
+    parsedTranslations = XtParseTranslationTable(defaultTranslations);
+
+    XtUninstallTranslations(w);
+    XtAugmentTranslations(w,parsedTranslations);
+}
diff --git a/src/staden/undo.c b/src/staden/undo.c
new file mode 100644
index 0000000..3275858
--- /dev/null
+++ b/src/staden/undo.c
@@ -0,0 +1,157 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include "undo.h"
+#include "edUtils.h"
+
+UndoBucketPtr bucketStack;
+
+/*
+** undo handling code
+*/
+
+static UndoBucketPtr bucketFreeList = NULL;
+static UndoBucketPtr newBucket()
+/*
+** Create a new undo bucket for the undo stack
+*/
+{
+    UndoBucketPtr p;
+
+    if (bucketFreeList == NULL) {
+	p = (UndoBucketPtr) malloc (sizeof(UndoBucket));
+    } else {
+	p = bucketFreeList;
+	bucketFreeList = bucketFreeList->next;
+    }
+    return p;
+}
+
+static void freeBucket(UndoBucketPtr p)
+/*
+** Free an undo bucket and store it in a free list
+*/
+{
+    p->next = bucketFreeList;
+    bucketFreeList = p;
+}
+
+static UndoBucketPtr popBucket(UndoBucketPtr *stack)
+/*
+** Get the undo bucket that's on top of and undo stack
+*/
+{
+    UndoBucketPtr p;
+
+    if (*stack == NULL) {
+        p = NULL;
+    } else {
+        p = *stack;
+        *stack = (*stack)->next;
+    }
+    return p;
+}
+
+
+static void pushBucket(UndoBucketPtr *stack,UndoBucketPtr p)
+/*
+** Push a bucket onto an undo stack
+*/
+{
+    p->next = *stack;
+    *stack = p;
+}
+ 
+void cleanUpStack(UndoBucketPtr *stack)
+/*
+** free all stack buckets
+*/
+{
+    UndoBucketPtr p;
+    while ( (p = popBucket(stack)) != NULL )
+	freeBucket(p);
+}
+
+void recordEdit (UndoBucketPtr *stack, EdStruct *xx, int undoEvent, int seq, int pos, char ch, int time)
+/*
+** record edits in an undo bucket and stick it on a stack
+*/
+{
+    UndoBucketPtr p;
+
+#define record_multi_edits
+#ifndef record_multi_edits
+    static int state = 0;
+
+    if (state) {
+	if (undoEvent == undoMark) state = 0;
+    } else {
+	cleanUpStack(stack);
+	if (undoEvent == undoMark) state = 1;
+    }
+#endif
+
+    p = newBucket();
+    if (p==NULL) {
+        fprintf (stderr,"error: Undo stack is full... purging\n");
+	cleanUpStack(stack);
+        p = newBucket();
+    }
+    if (p!=NULL) {
+        p->event     = undoEvent;
+        p->sequence  = seq;
+        p->position  = pos;
+        p->character = ch;
+	p->xx        = xx;
+	p->time      = time;
+        pushBucket (stack,p);
+    }
+}
+
+void retrieveEdit (UndoBucketPtr *stack, EdStruct **xx, int *undoEvent, int *seq, int *pos, char *ch, int *time)
+/*
+** Get the last edit made from the undo stack and return its details
+*/
+{
+    UndoBucketPtr p;
+    p = popBucket(stack);
+    if (p==NULL) {
+	*undoEvent = undoNullCommand;
+    } else  {
+        *undoEvent = p->event;
+        *seq       = p->sequence;
+        *pos       = p->position;
+        *ch        = p->character;
+	*xx        = p->xx;
+	*time      = p->time;
+        freeBucket(p);
+    }
+}
+
+void cleanUpAllStacks()
+/*
+** free all stacks
+*/
+{
+    cleanUpStack(&bucketStack);
+}
+
+int editsMade(EdStruct *xx)
+/*
+**    0 - no edits made
+**    1 - edits made
+*/
+{
+    int i;
+    for (i=1; i<=xx->DB_gelCount; i++)
+	if ( DBgetFlags(xx,i)&(DB_FLAG_SEQ_MODIFIED|DB_FLAG_REL_MODIFIED|DB_FLAG_TAG_MODIFIED) )
+	    return 1;
+    return 0;
+}
+
+int get_uniq_id()
+{
+    static int count = 0;
+
+    return ++count;
+
+}
diff --git a/src/staden/undo.h b/src/staden/undo.h
new file mode 100644
index 0000000..1f27d02
--- /dev/null
+++ b/src/staden/undo.h
@@ -0,0 +1,36 @@
+#ifndef _undo_h
+#define _undo_h
+
+#include "edUtils.h"
+
+enum undoEvents {
+	undoDelete=0,
+	undoInsert,
+	undoReplace,
+	undoExtend,
+	undoUnextend,
+	undoNullCommand,
+	undoMark
+     };
+
+typedef struct bnode *UndoBucketPtr;
+
+typedef struct bnode{
+        int event;
+        int sequence;
+        int position;
+        int character;
+        struct bnode *next;
+        EdStructPtr xx;
+	int time;
+        } UndoBucket;
+
+extern UndoBucketPtr bucketStack;
+
+extern void cleanUpStack(UndoBucketPtr *stack);
+extern void recordEdit (UndoBucketPtr *stack, EdStruct *xx, int undoEvent, int seq, int pos, char ch, int time);
+extern void retrieveEdit (UndoBucketPtr *stack, EdStruct **xx, int *undoEvent, int *seq, int *pos, char *ch, int *time);
+extern void cleanUpAllStacks();
+extern int editsMade(EdStruct *xx);
+extern int get_uniq_id();
+#endif /* _undo_h */
diff --git a/src/staden/userface.c b/src/staden/userface.c
new file mode 100644
index 0000000..f9d45cc
--- /dev/null
+++ b/src/staden/userface.c
@@ -0,0 +1,499 @@
+/*-*-c-*-*/
+/*
+ * Routines to deal with the user interface in non X versions of the programs.
+ * C routines provided:
+ *   getint, getfloat, yesno, yesono, gtstr, radion, checkn, getopt, showfi,
+ *   showfu, errom, busy, bpause.
+ * FORTRAN routines provided:
+ *   GETINT, GETRL, GETRLS, YESNO, YESONO, GTSTR, GETSTR, RADION, CHECK4,
+ *   GETOPT, SHOWFI, SHOWFU, ERROM, BUSY, BPAUSE.
+ */
+
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <math.h>
+#include <sys/types.h>
+#include "userface.h"
+#include "helpnmenu.h"
+#include "nxhelpmenu.h"
+
+/*--------------------------------------------------------------------------*\
+|*									    *|
+|* Routines only internally used by userface.c. 			    *|
+|*									    *|
+\*--------------------------------------------------------------------------*/
+
+static char rdbuf[256];
+/*
+ * Reads a string from stdin storing (excluding newline at end) in buf.
+ * Args:
+ *   buf: where to store string
+ *   len: size of buf.
+ * Returns:
+ *   length of string (0 for null str),
+ *   -1 for help '?' (handled by this func.)
+ *   -2 for quit '!', 
+ *   -3 for error (fgets() failed).
+ */
+static size_t rdstr(char *buf, size_t len) {
+    size_t l = 0;
+
+    *buf = '\0';
+
+    if (len == 0)
+	return 0;
+
+#ifdef notdef
+    if (fgets(buf, len+2, stdin) == NULL) {
+	fputs("No input available!", stderr);
+	(void)fflush(stderr);
+	return -3;
+    }
+
+    /* remove trailing newline */
+    l = strlen(buf)-1;
+    /*
+     * A bit silly checking - if it's not got a newline on the end then
+     * something is wrong (we'll get it on the next read!)
+     */
+    if (buf[l] == '\n')
+	buf[l]='\0';
+#endif
+    /*
+     * We use our own getline code as fgets can cause some major hassles.
+     * Firstly if you wish to read 10 characters then you need to tell fgets
+     * to read 11. Then fgets is stubborn and only reads 10 of the 11 and
+     * leaves the newline on the input. It can be fixed by reading 12 of
+     * course - but it's a bit hacky!
+     */
+    for(;;) {
+	int c = getchar();
+
+	if (c == EOF) {
+	    fputs("No input available!", stderr);
+	    (void)fflush(stderr);
+	    return -3;
+	}
+	
+	if (c == '\n')
+	    break;
+	buf[l++] = c;
+    } 
+    buf[l] = '\0';
+
+    if (l > 0)
+	if (*buf == '?') {
+	    /* if already in interactive help mode then 'press 1' */
+	    if (query_opt() == -2) {
+		buf[0] = '1';
+		buf[1] = '\0';
+		return 1;
+	    }
+	    switch(l) {
+	    default:
+		if (buf[1] == '?') {
+		    ihelp();
+		    break;
+		} else if (query_opt() == -1) {
+		    help2(atoi(&buf[1]));
+		    break;
+		}
+	    case 1:
+		help();
+	    }
+	    return -1;
+	} else if (*buf == '!')
+	    return -2;
+    
+    return l;
+}
+
+/*
+ * Reads an integer from stdin. Note that '6G' returns '6'.
+ * Args:
+ *   status: returned status indicating validity of returned integer.
+ *   >1 = success
+ *    0 = null entry
+ *   -1 = re ask question (help called or non number typed in)
+ *   -2 = quit requested
+ *   -3 = read error.
+ * Returns:
+ *   the integer read (if valid) or 0 otherwise.
+ */
+static int rdint(int *status) {
+    int num;
+    char *end;
+
+    *status = rdstr(rdbuf, sizeof(rdbuf));
+
+    if (*status > 0) {
+	num = (int)strtol(rdbuf, &end, 10);
+	while (*end == ' ' || *end == '\t')
+	    end++;
+	return *end ? -1 : num;
+    } else
+	return 0;
+}
+
+/*
+ * Reads an float from stdin. Note that '6G' returns '6'.
+ * Args:
+ *   status: returned status indicating validity of returned float.
+ *   >1 = success
+ *    0 = null entry
+ *   -1 = re ask question (help called or non number typed in)
+ *   -2 = quit requested
+ *   -3 = read error.
+ * Returns:
+ *   the float read (if valid) or 0 otherwise.
+ */
+static float rdfloat(int *status) {
+    float num;
+    char *end;
+
+    *status = rdstr(rdbuf, sizeof(rdbuf));
+
+    if (*status > 0) {
+	num = (float)strtod(rdbuf, &end);
+	while (*end == ' ' || *end == '\t')
+	    end++;
+	return *end ? -1 : num;
+    } else
+	return 0;
+}
+
+/*--------------------------------------------------------------------------*\
+|*									    *|
+|* C interface routines							    *|
+|*									    *|
+\*--------------------------------------------------------------------------*/
+
+/*
+ * Reads an integer in a given range from stdin.
+ * Args:
+ *   minval: lower end of range (inclusive)
+ *   maxval: upper end of range (inclusive)
+ *   defval: default integer
+ *   prompt: question to hassle user with
+ *   status: returned status on validity of returned value
+ *      0 = ok
+ *     -2 = quit
+ *     -3 = error
+ * Returns:
+ *   minval <= int <= maxval
+ */
+int getint(int minval, int maxval, int defval, char *prompt, int *status) {
+    int val;
+
+    do {
+	printf(" ? %s (%d-%d) (%d) =", prompt, minval, maxval, defval);
+	(void)fflush(stdout);
+	val = rdint(status);
+    } while (*status == -1 || (*status > 0 && (val<minval || val>maxval)));
+
+    if (*status == 0)
+	return defval;
+    else if (*status > 0)
+	return *status = 0, val;
+    else
+	return 0;
+}
+
+/*
+ * Reads a float in a given range from stdin.
+ * Args:
+ *   minval: lower end of range (inclusive)
+ *   maxval: upper end of range (inclusive)
+ *   defval: default float
+ *   prompt: question to hassle user with
+ *   status: returned status on validity of returned value
+ *      0 = ok
+ *     -2 = quit
+ *     -3 = error
+ *   precision: how accurate to display the range and default values.
+ * Returns:
+ *   minval <= float <= maxval
+ */
+float getfloat(float minval, float maxval, float defval, char *prompt,
+	      int *status, int precision) {
+    float val;
+
+    do {
+	printf(" ? %s (%.*f-%.*f) (%.*f) =", prompt, precision, minval,
+	       precision, maxval, precision, defval);
+	(void)fflush(stdout);
+	val = rdfloat(status);
+    } while (*status == -1 || (*status > 0 && (val<minval || val>maxval)));
+
+    if (*status == 0)
+	return defval;
+    else if (*status > 0)
+	return *status = 0, val;
+    else
+	return 0;
+}
+
+/*
+ * Prompts the user with a yes/no question (defaults to yes).
+ * Args:
+ *   prompt: what to ask.
+ * Returns:
+ *    0 = yes
+ *    1 = no
+ *   -1 = cancel
+ */
+int yesno(char *prompt) {
+    int answer, val;
+
+    do {
+	printf(" ? %s (y/n) (y) = ", prompt);
+	(void)fflush(stdout);
+	val = rdstr(rdbuf, sizeof(rdbuf));
+	if (val == 0)
+	    answer = 0; /* default yes */
+	else if (*rdbuf == 'y' || *rdbuf == 'Y')
+	    answer = 0;
+	else if (*rdbuf == 'n' || *rdbuf == 'N')
+	    answer = 1;
+	else
+	    answer = -1;
+    } while (val > -2 && answer < 0);
+
+    if (val >= 0)
+	return answer;
+    else /* val == -2 (quit) || val == -3 (err) */
+	return -1;
+}
+
+/*
+ * Reads in a string from stdin.
+ * Args:
+ *   prompt: obvious
+ *   defval: default string (if user types in nothing)
+ *   out   : where to store the actual string read.
+ *   outlen: sizeof(out)
+ * Returns:
+ *   -1 : cancel
+ *    0 : ok
+ *    1 : ok, but used default (blank string if no default)
+ */
+int gtstr(char *prompt, char *defval, char *out, size_t outlen) {
+    size_t l;
+    
+    do {
+	if (defval)
+	    printf(" Default %s=%s\n", prompt, defval);
+	printf(" ? %s=", prompt);
+	(void)fflush(stdout);
+	
+	l = rdstr(out, outlen);
+	switch (l) {
+	case -3:
+	case -2:
+	    *out = '\0';
+	    return -1;
+	case 0:
+	    if (defval) {
+		l = strlen(defval);
+		strncpy(out, defval, (l>outlen)?outlen:l);
+	    } else
+		*out = '\0';
+	    return 1;
+	case -1:
+	    /* help called */
+	    break;
+	default:
+	    return 0;
+	}
+    } while (1);
+}
+
+/*
+ * Displays a menu (analogue of X radio buttons) of options and asks
+ * for a selection. An 'X' is displayed next to the default option.
+ * Args:
+ *   title  : short description of menu
+ *   options: list of options to display
+ *   numopts: how many options to display
+ *   def    : default option to chose.
+ * Returns:
+ *   -1 = cancel
+ *   otherwise the option number selected.
+ */
+int radion(char *title, char **options, int numopts, int def) {
+    int i, status, ret;
+
+    do {
+	printf(" %s\n", title);
+	/* display options and prompt for selection */
+	for (i=0; i<numopts; i++)
+	    printf(" %c%3d %s\n", ((i+1) == def)?'X':' ', i+1, options[i]);
+	printf(" ? Selection (1-%d) (%d) =", numopts, def);
+	(void)fflush(stdout);
+
+	/* get the user response */
+	ret = rdint(&status);
+    } while (status == -1 || (status > 0 && (ret > numopts || ret < 1)));
+
+    if (status >= 0)
+	return (ret==0)?def:ret;
+    else
+	return -1;
+}
+
+/*
+ * Displays a list of 'n' toggle-able items. An 'X' is displayed next to any
+ * currently selected items. User types in a number to toggle each item, or
+ * 0 to quit.
+ * Args:
+ *   num    : how many items to toggle
+ *   prompts: list of names for each item
+ *   bools  : location of list of initial boolean states for items (set or
+ *            unset). Also when returning, the final selected boolean states.
+ * Returns:
+ *   -1 = cancel
+ *    0 = ok
+ */
+int checkn(int num, char **prompts, int **bools) {
+    int i, ret, status;
+
+    do {
+	/* display options to toggle */
+	puts(" checkbox: those set marked X");
+	for (i=0; i<num; i++)
+	    printf(" %c%2d %s\n", (*bools)[i]?'X':' ', i+1, prompts[i]);
+	printf(" ? (0-%d) =", num);
+	(void)fflush(stdout);
+
+	/* read user response */
+	ret = rdint(&status);
+	if (status >= 0 && ret >= 1 && ret <= num) {
+	    /* toggle flag by exclusive or-ing with 1 */
+	    (*bools)[ret-1] ^= 1;
+	}
+    } while (status >= -1 && ret != 0);
+    if (status < 0)
+	return -1;
+    else
+	return 0;
+}
+
+/*
+ * Reads in an 'option number' from stdin.
+ * Takes into account requesting dialogue on an option (d), menu listing (m),
+ * and help (?) on general or specific items.
+ * Args:
+ *   status: contains information about the 'int' value returned.
+ *     -3 = error
+ *     -2 = quit (!) (always returns 2)
+ *     -1 = general help requested.
+ *      0 = normal
+ *      1 = dialogue requested
+ *      2 = help on specific subject.
+ *      3 = menu option
+ * Returns:
+ *   negative value if a menu asked for (menu 'x' returns '-x')
+ *   postive value for option selected.
+ *   0 for no selection.
+ */
+int getcopt(int *status) {
+    int ret;
+
+    ret = rdstr(rdbuf, sizeof(rdbuf));
+    if (ret <= -2) { /* quit/error */
+	*status = ret; 
+	return 2;
+    } else if (ret == -1) { /* help */
+	*status = -1;
+	return 0;
+#if 0
+	if (rdbuf[1] != '\0') {
+	    *status = 2;
+	    return atoi(&rdbuf[1]);
+	} else {
+	    *status = 0;
+	    return 1;
+	}
+#endif
+    } else if (ret > 0) { /* ok - +ve length string */
+	if (*rdbuf == 'd' || *rdbuf == 'D') {
+	    *status = 1;
+	    return atoi(&rdbuf[1]);
+	} else if (*rdbuf == 'm' || *rdbuf == 'M') {
+	    *status = 3;
+	    /*
+	     * Menus are negative option numbers.
+	     * Should really safeguard against people typing 'm-2' to
+	     * quit etc - but well... it's fun!
+	     */
+	    return -atoi(&rdbuf[1]);
+	} else {
+	    /* simple number */
+	    *status = 0;
+	    return atoi(rdbuf);
+	}
+    } else { /* ret == 0 (no info) */
+	*status = 0;
+	return 0;
+    }
+}
+
+/*
+ * Display name of current file - unused in non X version.
+ */
+void showfi(char *func) {
+}
+
+/*
+ * Display name of current function.
+ * Args:
+ *   func: Name of function to display with no newline on the end.
+ */
+void showfu(char *func) {
+    puts(func);
+    (void)fflush(stdout);
+}
+
+void showfunc() {
+    if (helpindex[query_opt()].name) {
+	printf(" %s\n",helpindex[query_opt()].name);
+	(void)fflush(stdout);
+    }
+}
+
+/*
+ * Print an error message.
+ * Args:
+ *   errmsg: error with no trailing newline.
+ */
+void errom(char *errmsg) {
+    fprintf(stdout, "%s\n", errmsg);
+    fflush(stdout);
+}
+
+/*
+ * Hardly worth explaining.
+ */
+void busy() {
+    puts(" Working");
+    fflush(stdout);
+}
+
+/*
+ * Beeps and waits for return to be pressed.
+ * Returns:
+ *    0 for ok
+ *   -1 for cancel
+ */
+int bpause() {
+    register int r;
+
+    /* assume char 7 is bell - ASCII dependant? */
+    putchar(7);
+    fflush(stdout);
+
+    r = rdstr(rdbuf, sizeof(rdbuf));
+    return r<-1 ? -1 : 0;
+}
+
diff --git a/src/staden/userface.f b/src/staden/userface.f
new file mode 100644
index 0000000..be77145
--- /dev/null
+++ b/src/staden/userface.f
@@ -0,0 +1,655 @@
+C  user interface routines: i.e. they deal with user interaction
+C  like string input, dialogue boxes, radio buttons, check boxes
+C  19-11-90 New version of radion to look more like other routines
+C  11-12-90 Set maxbox in radion to 20 (was 14)
+C  8-7-91 removed radio, modified openf1 to deal with blank file names
+C
+C   HELP
+C   HELP OPTION
+C   THE HELP FILE IS HELPF. THE FIRST AND LAST RECORD NUMBERS
+C   DESCRIBING EACH OPTION IN THIS FILE ARE STORED IN HELPS(OPTION)
+C   AND HELPE(OPTION). THIS ROUTINE SIMPLY LISTS THE INTERVENING RECORDS
+      SUBROUTINE HELP(HELPS,HELPE,BOTOPT,TOPOPT,HELPF,IDEV,KBIN,KBOUT)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (IPAGE=22)
+      INTEGER BOTOPT,TOPOPT
+      CHARACTER HELPF*(*),SPACE
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+        CHARACTER LINE*80
+      INTEGER NOTIRL
+      EXTERNAL NOTIRL
+      SAVE SPACE
+      DATA SPACE/' '/
+1002  FORMAT(' ',A)
+      CALL OPENRS(IDEV,HELPF,IOK,20,5)
+      IF(IOK.NE.0)GO TO 100
+20    CONTINUE
+      WRITE(KBOUT,1005)
+1005  FORMAT(' For information on any option type its option number.'
+     +,/,
+     +' In addition,',/,
+     +'             0 = Introduction to the program,',/,
+     +'             1 = List of options')
+      NOPT = 1
+      CALL GETINT(BOTOPT,TOPOPT,NOPT,'Option number',IVAL,
+     +KBIN,KBOUT,HELPS(1),HELPE(1),HELPF,IDEV,IOK)
+      IF(IOK.NE.0) GO TO 40
+      NOPT = IVAL
+C   COUNT LINES OUTPUT
+      IDONE=0
+      ISTART=HELPS(NOPT)
+      IEND=HELPE(NOPT)
+      I=ISTART-1
+21    CONTINUE
+      I=I+1
+      READ(IDEV,REC=I,ERR=110)LINE
+      WRITE(KBOUT,1002)LINE(1:MAX(1,NOTIRL(LINE,79,SPACE)))
+      IDONE=IDONE+1
+      IF(IDONE.EQ.IPAGE)THEN
+        CALL BPAUSE(KBIN,KBOUT,IQUIT)
+        IF(IQUIT.NE.0) GO TO 20
+        IDONE=0
+      END IF
+      IF(I.LT.IEND)GO TO 21
+      CALL BPAUSE(KBIN,KBOUT,IQUIT)
+      IF(IQUIT.NE.0) GO TO 40
+      GO TO 20
+40    CONTINUE
+      CLOSE(UNIT=IDEV)
+      RETURN
+100   CONTINUE
+1001  FORMAT(' Error opening help file')
+      WRITE(KBOUT,1001)
+      RETURN
+110   CONTINUE
+      WRITE(KBOUT,1010)
+1010  FORMAT(' Error reading help file')
+      END
+C   HELP2
+C   HELP OPTION FOR SINGLE FUNCTION
+C   THE HELP FILE IS HELPF. THE FIRST AND LAST RECORD NUMBERS
+C   DESCRIBING EACH OPTION IN THIS FILE ARE STORED IN HELPS
+C   AND HELPE. THIS ROUTINE SIMPLY LISTS THE INTERVENING RECORDS
+      SUBROUTINE HELP2(HELPS,HELPE,HELPF,IDEV,KBIN,KBOUT)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (IPAGE=22)
+      CHARACTER HELPF*(*),SPACE
+      INTEGER HELPS,HELPE
+        CHARACTER LINE*80
+      INTEGER NOTIRL
+      EXTERNAL NOTIRL
+      SAVE SPACE
+      DATA SPACE/' '/
+1002  FORMAT(' ',A)
+      CALL OPENRS(IDEV,HELPF,IOK,20,5)
+      IF(IOK.NE.0)GO TO 100
+C   COUNT LINES OUTPUT
+      IDONE=0
+      ISTART=HELPS
+      IEND=HELPE
+      I=ISTART-1
+21    CONTINUE
+      I=I+1
+      READ(IDEV,REC=I,ERR=110)LINE
+      WRITE(KBOUT,1002)LINE(1:MAX(1,NOTIRL(LINE,79,SPACE)))
+      IDONE=IDONE+1
+      IF(IDONE.EQ.IPAGE)THEN
+        CALL BPAUSE(KBIN,KBOUT,IQUIT)
+        IF(IQUIT.NE.0) GO TO 50
+        IDONE=0
+      END IF
+      IF(I.LT.IEND)GO TO 21
+      WRITE(KBOUT,1000)
+1000  FORMAT(' End of file')
+      CALL BPAUSE(KBIN,KBOUT,IQUIT)
+50    CONTINUE
+      CLOSE(UNIT=IDEV)
+      RETURN
+100   CONTINUE
+1001  FORMAT(' Error opening help file')
+      WRITE(KBOUT,1001)
+      RETURN
+110   CONTINUE
+      WRITE(KBOUT,1010)
+1010  FORMAT(' Error reading help file')
+      CLOSE(UNIT=IDEV)
+      END
+C   SETHLP
+C  READS POINTERS TO HELP FILE
+C  THESE POINTERS HAVE BEEN WRITTEN TO A FILE WHOSE NAME IS IN POINTF
+C  BY PROGRAM SETUPHELP.
+C  EACH OPTION (WITH NUMBERS BETWEEN BOTOPT AND TOPOPT)
+C  HAS ITS HELP TEXT POINTERS STORED IN HELPS(OPTION) AND HELPE(OPTION)
+C  WHERE HELPS CONTAINS THE FIRST RECORD NUMBER AND HELPE THE LAST
+C  RECORD NUMBER OF THE TEXT FILE HELPF THAT DESCRIBES THE OPTION
+      SUBROUTINE SETHLP(HELPS,HELPE,BOTOPT,TOPOPT,POINTF,IDEV,KBOUT)
+C   AUTHOR: RODGER STADEN
+      INTEGER BOTOPT,TOPOPT
+      CHARACTER POINTF*(*)
+      INTEGER HELPS(BOTOPT:TOPOPT),HELPE(BOTOPT:TOPOPT)
+      CALL OPENRS(IDEV,POINTF,IOK,LRECL,2)
+      IF(IOK.NE.0)GO TO 100
+1     CONTINUE
+      READ(IDEV,1004,END=11,ERR=110)I,J,K
+      IF((I.GE.BOTOPT).AND.(I.LE.TOPOPT))THEN
+        HELPS(I)=J
+        HELPE(I)=K
+      END IF
+      GO TO 1
+1004  FORMAT(3I6)
+11    CONTINUE
+      CLOSE(UNIT=IDEV)
+      RETURN
+100   CONTINUE
+      WRITE(KBOUT,1001)
+1001  FORMAT(' Error opening help record pointer file')
+      RETURN
+110   CONTINUE
+      WRITE(KBOUT,1005)
+1005  FORMAT(' Error reading help record pointer file')
+      RETURN
+      END
+      SUBROUTINE SHOWFU(KBOUT,STRING)
+      CHARACTER STRING*(*)
+      WRITE(KBOUT,1000)STRING
+1000  FORMAT(' ',A)
+      END
+      SUBROUTINE SHOWFI(KBOUT,STRING)
+      CHARACTER STRING*(*)
+      END
+      SUBROUTINE ERROM(KBOUT,STRING)
+      CHARACTER STRING*(*)
+      WRITE(KBOUT,1000)STRING
+1000  FORMAT(' ',A)
+      END
+      SUBROUTINE BUSY(KBOUT)
+      WRITE(KBOUT,1000)
+1000  FORMAT(' Working')
+      END
+      SUBROUTINE RADION(TITLE,PROMPT,NB,CHOICE,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      PARAMETER (MAXBOX = 20,MAXCHR = 3)
+      CHARACTER HELPF*(*),PROMPT(NB)*(*),CHECK*(MAXBOX)
+      CHARACTER TITLE*(*)
+      INTEGER CHOICE
+      CHARACTER*(MAXCHR) STR2,STR3
+      EXTERNAL NOTILR
+      EXTERNAL IGET
+      IF((CHOICE.LT.1).OR.(CHOICE.GT.NB)) THEN
+        WRITE(KBOUT,*)'Error in call to radion, default set to 1'
+        CHOICE = 1
+      END IF
+      IN = CHOICE
+      CALL FILLC(CHECK,NB,' ')
+      CHECK(IN:IN) = 'X'
+1002  FORMAT(I3)
+      WRITE(STR2,1002,ERR=30)NB
+      WRITE(STR3,1002,ERR=30)CHOICE
+10    CONTINUE
+      WRITE(KBOUT,1001)TITLE
+1001  FORMAT(' ',A)
+      DO 20 I = 1,NB
+        WRITE(KBOUT,1000)CHECK(I:I),I,PROMPT(I)
+1000  FORMAT(' ',A,I3,' ',A)
+20    CONTINUE
+      WRITE(KBOUT,1003)
+     +STR2(NOTILR(STR2,MAXCHR,' '):MAXCHR),
+     +STR3(NOTILR(STR3,MAXCHR,' '):MAXCHR)
+1003  FORMAT(' ? Selection ',' (1','-',A,') (',A,') =',$)
+      I = IGET(J,KBIN)
+      IF(I.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(I.EQ.2) THEN
+        CHOICE = -1
+        RETURN
+      END IF
+      IF(I.EQ.3) RETURN
+      IF(I.GT.0) GO TO 10
+      IF(J.GT.NB) GO TO 10
+      IF(J.LT.1) GO TO 10
+      CHOICE = MAX(1,J)
+      RETURN
+30    CONTINUE
+      WRITE(KBOUT,*)'Error in RADION! Choice set to default'
+      END
+      SUBROUTINE CHECKB(PROMPT,BOXES,NB,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      PARAMETER (MAXBOX = 14)
+      CHARACTER HELPF*(*),PROMPT(NB)*(*),CHECK*(MAXBOX),D*35
+      INTEGER BOXES(NB)
+      INTEGER TOGGLE
+      EXTERNAL TOGGLE,IGET
+      SAVE D
+      DATA D/' ? 1,2,3,4,5,6,7,8,9,10,11,12,13,14'/
+      IOK = 1
+10    CONTINUE
+      CALL FILLC(CHECK,NB,' ')
+      DO 15 I = 1,NB
+        IF(BOXES(I).EQ.1) CHECK(I:I) = 'X'
+15    CONTINUE
+      WRITE(KBOUT,1002)
+1002  FORMAT(/,' checkbox: those set are marked X')
+      DO 20 I = 1,NB
+      WRITE(KBOUT,1000)CHECK(I:I),I,PROMPT(I)
+1000  FORMAT(' ',A,I3,' ',A)
+20    CONTINUE
+      WRITE(KBOUT,1001)NB+1
+1001  FORMAT('  ',I3,' ','ALL')
+      IF(NB.LE.9)THEN
+        WRITE(KBOUT,1003)D(1:2+2*NB)
+      ELSE
+        WRITE(KBOUT,1003)D(1:20+3*(NB-9))
+      END IF
+1003  FORMAT(A,' =',$)
+      IN = IGET(J,KBIN)
+      IF(IN.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(IN.EQ.2) RETURN
+      IF(IN.GT.3) GO TO 10
+      IF((IN.EQ.3).OR.(J.EQ.0)) THEN
+        IOK = 0
+        RETURN
+      END IF
+      IF((J.LE.NB).AND.(J.GT.0)) THEN
+        BOXES(J) = TOGGLE(BOXES(J))
+      ELSE IF(J.EQ.NB+1) THEN
+        CALL FILLI(BOXES,NB,1)
+      END IF
+      GO TO 10
+      END
+      INTEGER FUNCTION SIN(KBIN,STRING)
+C  AUTHOR: RODGER STADEN
+C  RETURNS:
+C          1 FOR HELP
+C          2 FOR QUIT
+C          3 FOR ALL BLANKS
+C          0 OTHERWISE
+C          ABOVE FROM HQN
+C          4 FOR READ ERROR
+      CHARACTER STRING*(*)
+      INTEGER HQN
+      EXTERNAL HQN
+      READ(KBIN,1000,ERR=10,END=10)STRING
+1000  FORMAT(A)
+      SIN = HQN(STRING)
+      RETURN
+10    CONTINUE      
+      SIN = 4
+      END
+      INTEGER FUNCTION IGET(IN,KBIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS:
+C          1 FOR HELP
+C          2 FOR QUIT
+C          3 FOR ALL BLANKS
+C          4 FOR READ ERROR
+C          0 OTHERWISE
+      CHARACTER STRING*10
+      INTEGER SIN
+      EXTERNAL SIN
+      IN = 0
+      IGET = SIN(KBIN,STRING)
+      IF(IGET.NE.0) RETURN
+1001  FORMAT(I10)
+      CALL RJST(STRING)
+      READ(STRING,1001,ERR=10,END=10)IN
+      RETURN
+10    CONTINUE
+      IN = 0
+      IGET = 4
+      END
+      INTEGER FUNCTION IGETR(RIN,KBIN)
+C  AUTHOR: RODGER STADEN
+C  RETURNS:
+C          1 FOR HELP
+C          2 FOR QUIT
+C          3 FOR ALL BLANKS
+C          4 FOR READ ERROR
+C          0 OTHERWISE
+      CHARACTER STRING*12
+      INTEGER SIN
+      EXTERNAL SIN
+      RIN = 0.
+      IGETR = SIN(KBIN,STRING)
+      IF(IGETR.NE.0) RETURN
+1001  FORMAT(F12.0)
+      CALL RJST(STRING)
+      READ(STRING,1001,ERR=10,END=10)RIN
+      RETURN
+10    CONTINUE
+      RIN = 0.
+      IGETR = 4
+      END
+      SUBROUTINE GTSTR(P,STRING,NEW,LENGTH,KBOUT,KBIN,INFLAG)
+      CHARACTER STRING*(*),NEW*(*),P*(*)
+      INTEGER SIN
+      EXTERNAL NOTRL,SIN
+10    CONTINUE
+      IF(LENGTH.GT.0)WRITE(KBOUT,1000)P,STRING(1:LENGTH)
+1000  FORMAT(' Default ',A,'=',A)
+      WRITE(KBOUT,1001)P
+1001  FORMAT(' ? ',A,'=',$)
+      INFLAG = SIN(KBIN,NEW)
+      IF(INFLAG.EQ.4) GO TO 10
+      IF(INFLAG.EQ.0) THEN
+        MAXSTR = LEN(NEW)
+        LENGTH = NOTRL(NEW,MAXSTR,' ')
+        RETURN
+      END IF
+      IF(INFLAG.EQ.3) THEN
+        LENGTH = 0
+        RETURN
+      END IF
+      LENGTH = 1
+      END
+      SUBROUTINE GETSTR(P,STRING,NEW,MAXSTR,LENGTH,KBOUT,KBIN,INFLAG)
+      CHARACTER STRING(MAXSTR),NEW(MAXSTR),P*(*),NEWT*80
+      INTEGER SIN
+      EXTERNAL NOTRL,SIN
+      PARAMETER (MAXST = 80)
+10    CONTINUE
+      IF(LENGTH.GT.0)WRITE(KBOUT,1000)P,(STRING(K),K=1,LENGTH)
+1000  FORMAT(' Default ',A,'=',75A1)
+      WRITE(KBOUT,1001)P
+1001  FORMAT(' ? ',A,'=',$)
+      INFLAG = SIN(KBIN,NEWT)
+      IF(INFLAG.EQ.4) GO TO 10
+      IF(INFLAG.EQ.0) THEN
+        LENGTH = MIN(MAXSTR,NOTRL(NEWT,MAXST,' '))
+        CALL STOA(NEWT,NEW,LENGTH)
+        RETURN
+      END IF
+      IF(INFLAG.EQ.3) THEN
+        LENGTH = 0
+        RETURN
+      END IF
+      LENGTH = 1
+      END
+      SUBROUTINE STOA(STRING,ARRAY,LENGTH)
+      CHARACTER STRING*(*),ARRAY(LENGTH)
+      DO 10 I = 1,LENGTH
+        ARRAY(I) = STRING(I:I)
+10    CONTINUE
+      END
+      SUBROUTINE GETOPT(KBIN,KOPT,IOPT)
+      CHARACTER TCLASS,STRING*10,STRNG2*10
+      INTEGER SIN
+      EXTERNAL NOTILR,NOTIRL,SIN
+      IOPT = 0
+      KOPT = 0
+10    CONTINUE
+      INFLAG = SIN(KBIN,STRING)
+      IF(INFLAG.EQ.1) THEN
+        KOPT = 2
+      END IF
+      IF(INFLAG.EQ.2) THEN
+        KOPT = 0
+        IOPT = 2
+        RETURN
+      END IF
+      IF(INFLAG.EQ.4) GO TO 10
+      K = NOTIRL(STRING,10,' ')
+      L = NOTILR(STRING,10,' ')
+      IF(K.LT.L) RETURN
+      TCLASS = STRING(L:L)
+      IF((TCLASS.EQ.'D').OR.(TCLASS.EQ.'d')) THEN
+        KOPT = 1
+      ELSE IF((TCLASS.EQ.'M').OR.(TCLASS.EQ.'m')) THEN
+        KOPT = 3
+      END IF
+      IF(KOPT.NE.0) L = L + 1
+      IF((KOPT.EQ.2).AND.(L.GT.K)) THEN
+        KOPT = 0
+        IOPT = 1
+        RETURN
+      END IF  
+      J = 11
+      STRNG2 = ' '
+      DO 50 I = K,L,-1
+        J = J-1
+        STRNG2(J:J) = STRING(I:I)
+50    CONTINUE
+      READ(STRNG2,1001,ERR=60)IOPT
+1001  FORMAT(I10)
+      IF(KOPT.EQ.3) THEN
+        IOPT = -IOPT
+        KOPT = 0
+      END IF
+      RETURN
+60    CONTINUE
+      KOPT = 4
+      RETURN
+70    CONTINUE
+      KOPT = 4
+      END
+      SUBROUTINE YESNO(ANSWER,PROMPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      INTEGER ANSWER,SIN
+      CHARACTER PROMPT*(*),HELPF*(*),STRING,YESU,YESL,NOU,NOL
+      PARAMETER (YESU='Y',YESL='y',NOU='N',NOL='n')
+      EXTERNAL SIN
+10    CONTINUE
+      WRITE(KBOUT,1000)PROMPT
+1000  FORMAT(' ? ',A,' (y/n) (y) = ',$)
+      I = SIN(KBIN,STRING)
+      IF(I.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(I.EQ.2) THEN
+        ANSWER = -1
+        RETURN
+      END IF
+      IF(I.EQ.3) THEN
+        ANSWER = 0
+        RETURN
+      END IF
+      I = MAX(INDEX(STRING,NOU),INDEX(STRING,NOL))
+      IF(I.NE.0) THEN
+        ANSWER = 1
+        RETURN
+      END IF
+      I = MAX(INDEX(STRING,YESU),INDEX(STRING,YESL))
+      IF(I.NE.0) THEN
+        ANSWER = 0
+        RETURN
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE YESONO(CHOICE,P1,P2,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*),P1*(*),P2*(*)
+      INTEGER CHOICE
+      IF(CHOICE.EQ.0) THEN
+        CALL YESNO(I,P1,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      ELSE IF(CHOICE.EQ.1) THEN
+        CALL YESNO(I,P2,
+     +  IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        IF(I.EQ.1) THEN
+          I = 0
+        ELSE IF(I.EQ.0) THEN
+          I = 1
+        END IF
+      ELSE
+        WRITE(KBOUT,*)'ERROR IN INPUT TO YESONO'
+      END IF
+      CHOICE = I
+      END
+      SUBROUTINE GETINT(MININ,MAXIN,DEF,PROMPT,VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (MAXCHR = 8)
+      CHARACTER HELPF*(*),PROMPT*(*)
+      CHARACTER*8 STR1,STR2,STR3
+      INTEGER DEF,VALUE
+      EXTERNAL NOTILR,IGET
+      IOK = 1
+1000  FORMAT(I8)
+      WRITE(STR1,1000,ERR=20)MININ
+      WRITE(STR2,1000,ERR=20)MAXIN
+      WRITE(STR3,1000,ERR=20)DEF
+10    CONTINUE
+      WRITE(KBOUT,1003)PROMPT,
+     +STR1(NOTILR(STR1,MAXCHR,' '):MAXCHR),
+     +STR2(NOTILR(STR2,MAXCHR,' '):MAXCHR),
+     +STR3(NOTILR(STR3,MAXCHR,' '):MAXCHR)
+1003  FORMAT(' ? ',A,' (',A,'-',A,') (',A,') =',$)
+      IN = IGET(IVAL,KBIN)
+      IF(IN.EQ.1)THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(IN.EQ.2)RETURN
+      IF(IN.EQ.3) THEN
+        VALUE = DEF
+        IOK = 0
+        RETURN
+      END IF
+      IF(IN.GT.0) GO TO 10
+      IF(IVAL.GT.MAXIN)GO TO 10
+      IF(IVAL.LT.MININ)GO TO 10
+      VALUE = IVAL
+      IOK = 0
+      RETURN
+20    CONTINUE
+      WRITE(KBOUT,*)'Error in routine getint'
+      END
+      SUBROUTINE GETRL(MININ,MAXIN,DEF,PROMPT,VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (MAXCHR = 8)
+      CHARACTER HELPF*(*),PROMPT*(*)
+      CHARACTER*8 STR1,STR2,STR3
+      REAL MININ,MAXIN,IVAL
+      EXTERNAL NOTILR,IGETR
+      IOK = 1
+1000  FORMAT(F8.2)
+      WRITE(STR1,1000,ERR=20)MININ
+      WRITE(STR2,1000,ERR=20)MAXIN
+      WRITE(STR3,1000,ERR=20)DEF
+10    CONTINUE
+      WRITE(KBOUT,1003)PROMPT,
+     +STR1(NOTILR(STR1,MAXCHR,' '):MAXCHR),
+     +STR2(NOTILR(STR2,MAXCHR,' '):MAXCHR),
+     +STR3(NOTILR(STR3,MAXCHR,' '):MAXCHR)
+1003  FORMAT(' ? ',A,' (',A,'-',A,') (',A,') =',$)
+      IN = IGETR(IVAL,KBIN)
+      IF(IN.EQ.1)THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(IN.EQ.2)RETURN
+      IF(IN.EQ.3) THEN
+        VALUE = DEF
+        IOK = 0
+        RETURN
+      END IF
+      IF(IN.GT.0) GO TO 10
+      IF(IVAL.GT.MAXIN)GO TO 10
+      IF(IVAL.LT.MININ)GO TO 10
+      VALUE = IVAL
+      IOK = 0
+      RETURN
+20    CONTINUE
+      WRITE(KBOUT,*)'Error in routine getrl'
+      END
+      SUBROUTINE GETRLS(MININ,MAXIN,DEF,PROMPT,VALUE,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+C   AUTHOR: RODGER STADEN
+      PARAMETER (MAXCHR = 12)
+      CHARACTER HELPF*(*),PROMPT*(*)
+      CHARACTER*12 STR1,STR2,STR3
+      REAL MININ,MAXIN,IVAL
+      EXTERNAL NOTILR,IGETR,NOTIRL
+      IOK = 1
+1000  FORMAT(F12.10)
+      WRITE(STR1,1000,ERR=20)MININ
+      WRITE(STR2,1000,ERR=20)MAXIN
+      WRITE(STR3,1000,ERR=20)DEF
+10    CONTINUE
+      WRITE(KBOUT,1003)PROMPT,
+     +STR1(NOTILR(STR1,MAXCHR,' '):NOTIRL(STR1,MAXCHR,'0')),
+     +STR2(NOTILR(STR2,MAXCHR,' '):NOTIRL(STR2,MAXCHR,'0')),
+     +STR3(NOTILR(STR3,MAXCHR,' '):NOTIRL(STR3,MAXCHR,'0'))
+1003  FORMAT(' ? ',A,' (',A,'-',A,') (',A,') =',$)
+      IN = IGETR(IVAL,KBIN)
+      IF(IN.EQ.1)THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(IN.EQ.2)RETURN
+      IF(IN.EQ.3) THEN
+        VALUE = DEF
+        IOK = 0
+        RETURN
+      END IF
+      IF(IN.GT.0) GO TO 10
+      IF(IVAL.GT.MAXIN)GO TO 10
+      IF(IVAL.LT.MININ)GO TO 10
+      VALUE = IVAL
+      IOK = 0
+      RETURN
+20    CONTINUE
+      WRITE(KBOUT,*)'Error in routine getrls'
+      END
+      SUBROUTINE CHECK4(P1,P2,P3,P4,C1,C2,C3,C4,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT,IOK)
+      CHARACTER HELPF*(*),P1*(*),P2*(*),P3*(*),P4*(*),C*4
+      INTEGER C1,C2,C3,C4,CA(4)
+      INTEGER TOGGLE
+      EXTERNAL TOGGLE,IGET
+      IOK = 1
+      CA(1) = C1
+      CA(2) = C2
+      CA(3) = C3
+      CA(4) = C4
+10    CONTINUE
+      CALL FILLC(C,4,' ')
+      DO 15 I = 1,4
+        IF(CA(I).EQ.1) C(I:I) = 'X'
+15    CONTINUE
+      WRITE(KBOUT,1002)
+1002  FORMAT(/,' checkbox: those set are marked X')
+      WRITE(KBOUT,1000)
+     +C(1:1),P1,
+     +C(2:2),P2,
+     +C(3:3),P3,
+     +C(4:4),P4
+1000  FORMAT(
+     +' ',A,' 1 ',A,/,
+     +' ',A,' 2 ',A,/,
+     +' ',A,' 3 ',A,/,
+     +' ',A,' 4 ',A,/,
+     +' ? 0,1,2,3,4 =',$)
+      IN = IGET(J,KBIN)
+      IF(IN.EQ.1) THEN
+        CALL HELP2(IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+        GO TO 10
+      END IF
+      IF(IN.EQ.2) RETURN
+      IF(IN.GT.3) GO TO 10
+      IF((IN.EQ.3).OR.(J.EQ.0)) THEN
+        C1 = CA(1)
+        C2 = CA(2)
+        C3 = CA(3)
+        C4 = CA(4)
+        IOK = 0
+        RETURN
+      END IF
+      IF((J.LT.5).AND.(J.GT.0)) CA(J) = TOGGLE(CA(J))
+      GO TO 10
+      END
+      INTEGER FUNCTION TOGGLE(I)
+      IN = I
+      IF(IN.EQ.0) THEN
+        IN = 1
+      ELSE 
+        IN = 0
+      END IF
+      TOGGLE = IN
+      END
diff --git a/src/staden/userface.h b/src/staden/userface.h
new file mode 100644
index 0000000..a67c46d
--- /dev/null
+++ b/src/staden/userface.h
@@ -0,0 +1,197 @@
+#include "fort.h"
+
+#define yesno_x yesno_
+#define yesono_x yesono_
+#define gtstr_x gtstr_
+#define getstr_x getstr_
+#define getint_x getint_
+#define getrl_x getrl_
+#define getrls_x getrls_
+#define radion_x radion_
+#define check4_x check4_
+#define getopt_x getopt_
+#define showfi_x showfi_
+#define showfu_x showfu_
+#define errom_x errom_
+#define busy_x busy_
+#define bpause_x bpause_
+#define menu_x menu_
+
+int yesono(int choice, char *prompt1, char *prompt2);
+
+void getint_x(int_f *MININ_p,
+	      int_f *MAXIN_p,
+	      int_f *DEF_p,
+	      char  *PROMPT_p,
+	      int_f *VALUE_p,  /* out: result */
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *IOK_p,    /* out: status */
+	      int_fl PROMPT_l,
+	      int_fl HELPF_l);
+
+void getrl_x(float *MININ_p,
+	     float *MAXIN_p,
+	     float *DEF_p,
+	     char  *PROMPT_p,
+	     float *VALUE_p,  /* out: result */
+	     int_f *KBIN_p,
+	     int_f *KBOUT_p,
+	     int_f *IHELPS_p,
+	     int_f *IHELPE_p,
+	     char  *HELPF_p,
+	     int_f *IDEVH_p,
+	     int_f *IOK_p,
+	     int_fl PROMPT_l,
+	     int_fl HELPF_l);
+
+void getrls_x(float *MININ_p,
+	     float *MAXIN_p,
+	     float *DEF_p,
+	     char  *PROMPT_p,
+	     float *VALUE_p,  /* out: result */
+	     int_f *KBIN_p,
+	     int_f *KBOUT_p,
+	     int_f *IHELPS_p,
+	     int_f *IHELPE_p,
+	     char  *HELPF_p,
+	     int_f *IDEVH_p,
+	     int_f *IOK_p,
+	     int_fl PROMPT_l,
+	     int_fl HELPF_l);
+
+void yesno_x(int_f *ANSWER_p,
+	     char  *PROMPT_p,
+	     int_f *IHELPS_p,
+	     int_f *IHELPE_p,
+	     char  *HELPF_p,
+	     int_f *IDEVH_p,
+	     int_f *KBIN_p,
+	     int_f *KBOUT_p,
+	     int_fl PROMPT_l,
+	     int_fl HELPF_l);
+
+void yesono_x(int_f *CHOICE_p,
+	      char  *P1_p,
+	      char  *P2_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl P1_l,
+	      int_fl P2_l,
+	      int_fl HELPF_l);
+
+void gtstr_x(char  *P_p,      /* prompt */
+	     char  *STRING_p, /* default value (if *LENGTH_p > 0) */
+	     char  *NEW_p,    /* out: dialogue result */
+	     int_f *LENGTH_p, /* in:  significant length of *STRING_p
+				out: significant length of *NEW_p */
+	     int_f *KBOUT_p,
+	     int_f *KBIN_p,
+	     int_f *INFLAG_p, /* out: status */
+	     int_fl P_l,
+	     int_fl STRING_l,
+	     int_fl NEW_l);
+
+void getstr_x(char  *P_p,      /* prompt */
+	      char  *STRING_p, /* default value (if *LENGTH_p > 0) */
+	      char  *NEW_p,    /* out: dialogue result */
+	      int_f *MAXSTR_p, /* array length of *STRING_p and *NEW_p */
+	      int_f *LENGTH_p, /* in:  significant length of *STRING_p
+				 out: significant length of *NEW_p */
+	      int_f *KBOUT_p,
+	      int_f *KBIN_p,
+	      int_f *INFLAG_p, /* out: status */
+	      int_fl P_l);
+
+void radion_x(char  *TITLE_p,
+	      char  *PROMPT_p, /* Actually: *PROMPT_p[*NB_p] */
+	      int_f *NB_p,     /* Number of buttons (1..NB) */
+	      int_f *CHOICE_p, /* in:  default button
+			         out: selected button */
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl TITLE_l,
+	      int_fl PROMPT_l,
+	      int_fl HELPF_l);
+
+void check4_x(char  *P1_p, char  *P2_p, char  *P3_p, char  *P4_p,
+	      int_f *C1_p, int_f *C2_p, int_f *C3_p, int_f *C4_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_f *IOK_p,
+	      int_fl P1_l, int_fl P2_l, int_fl P3_l, int_fl P4_l,
+	      int_fl HELPF_l);
+
+void getopt_x(int_f *KBIN_p,
+	      int_f *KOPT_p,
+	      int_f *IOPT_p); /* out: option number selected */
+
+void showfi_x(int_f *KBOUT_p,
+	      char  *STRING_p,
+	      int_fl STRING_l);
+
+void showfu_x(int_f *KBOUT_p,
+	      char  *STRING_p,
+	      int_fl STRING_l);
+
+void errom_x(int_f *KBOUT_p,
+	     char  *STRING_p,
+	     int_fl STRING_l);
+
+void busy_x(int_f *KBOUT_p);
+
+int gtstr(char *prompt, char *defval, char *out, size_t outlen);
+
+int yesno(char *prompt);
+
+int getint(int minval, int maxval, int defval, char *prompt, int *status);
+
+float getfloat(float minval, float maxval, float defval, char *prompt,
+	       int *status, int precision);
+
+int checkn(int num, char **prompts, int **bools);
+
+int radion(char *title, char **options, int numopts, int def);
+
+int getcopt(int *status);
+
+void showfu(char *fname);
+
+void showfi(char *fname);
+
+void errom(char *errmsg);
+
+void busy();
+
+void menu_x(int_f *OPT_p,
+	    int_f *KOPT_p,
+	    int_f *MOPT_p,
+	    int_f *MAXOPT_p,
+	    int_f *MINMEN_p,
+	    int_f *KBIN_p,
+	    int_f *KBOUT_p,
+	    int_f *IHELPS_p,
+	    int_f *IHELPE_p,
+	    char  *HELPF_p,
+	    int_f *IDEVH_p,
+	    int_fl HELPF_l);
+
+int bpause();
+
+void showfunc();
diff --git a/src/staden/userfacecom.c b/src/staden/userfacecom.c
new file mode 100644
index 0000000..60b79c9
--- /dev/null
+++ b/src/staden/userfacecom.c
@@ -0,0 +1,430 @@
+#include <stdio.h>
+#include <stdlib.h>
+#include <string.h>
+#include <sys/types.h>
+#include "userface.h"
+#include "helpnmenu.h"
+#include "FtoC.h" /* IMPORT: Fstr2Cstr and Cstr2Fstr */
+
+/*
+ * Prompts the user with either one yes/no question or another.
+ * Args:
+ *   choice: which question to ask (0 = prompt1, 1 = prompt2)
+ *   prompt[12]: the questions.
+ * Returns;
+ *    0 = yes
+ *    1 = no
+ *   -1 = cancel
+ */
+int yesono(int choice, char *prompt1, char *prompt2) {
+    int a;
+
+    switch(choice) {
+    case 0:
+	return yesno(prompt1);
+    case 1:
+	a = yesno(prompt2);
+	if (a>=0)
+	    a ^= 1;
+	return a;
+    default:
+	fputs("yesno() called with bad 'choice'\n", stderr);
+	return -1;
+    }
+}
+
+/*--------------------------------------------------------------------------*\
+|*									    *|
+|* FORTRAN interface routines. These are overlays on top of the C routines. *|
+|* There is no actual 'real' code here - that is in either userface.c (for  *|
+|* the text version) or dialogues.c (for the X windows version)		    *|
+|*									    *|
+\*--------------------------------------------------------------------------*/
+
+void getint_x(int_f *MININ_p,
+	      int_f *MAXIN_p,
+	      int_f *DEF_p,
+	      char  *PROMPT_p,
+	      int_f *VALUE_p,  /* out: result */
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *IOK_p,    /* out: status */
+	      int_fl PROMPT_l,
+	      int_fl HELPF_l)
+/*
+ * Prompt for an integer in the range *MININ_p <= i <= *MAXIN_p
+ * IOK=0 => OK
+ * IOK=1 => Cancel
+ */
+{
+    char prompt[256];
+    int status;
+
+    Fstr2Cstr(PROMPT_p, PROMPT_l, prompt, (int_fl)sizeof(prompt));
+
+    *VALUE_p = getint((int)*MININ_p, (int)*MAXIN_p, (int)*DEF_p,
+		      prompt, &status);
+    *IOK_p = (int_f)((status < 0)? 1:0);
+
+    return;
+}
+
+void getrl_x(float *MININ_p,
+	     float *MAXIN_p,
+	     float *DEF_p,
+	     char  *PROMPT_p,
+	     float *VALUE_p,  /* out: result */
+	     int_f *KBIN_p,
+	     int_f *KBOUT_p,
+	     int_f *IHELPS_p,
+	     int_f *IHELPE_p,
+	     char  *HELPF_p,
+	     int_f *IDEVH_p,
+	     int_f *IOK_p,
+	     int_fl PROMPT_l,
+	     int_fl HELPF_l)
+/*
+ * Prompt for a real in the range *MININ_p <= i <= *MAXIN_p
+ * The prompt is printed using format F8.2
+ * IOK=0 => OK
+ * IOK=1 => Cancel
+ */
+{
+    char prompt[256];
+    int status;
+
+    Fstr2Cstr(PROMPT_p, PROMPT_l, prompt, (int_fl)sizeof(prompt));
+
+    *VALUE_p = getfloat(*MININ_p, *MAXIN_p, *DEF_p, prompt, &status, 2);
+    *IOK_p = (int_f)((status < 0)? 1:0);
+
+    return;
+}
+
+void getrls_x(float *MININ_p,
+	      float *MAXIN_p,
+	      float *DEF_p,
+	      char  *PROMPT_p,
+	      float *VALUE_p,  /* out: result */
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *IOK_p,
+	      int_fl PROMPT_l,
+	      int_fl HELPF_l)
+/*
+ * Prompt for a real in the range *MININ_p <= i <= *MAXIN_p
+ * The prompt is printed using format F8.2
+ * IOK=0 => OK
+ * IOK=1 => Cancel
+ */
+{
+    char prompt[256];
+    int status;
+
+    Fstr2Cstr(PROMPT_p, PROMPT_l, prompt, (int_fl)sizeof(prompt));
+
+    *VALUE_p = getfloat(*MININ_p, *MAXIN_p, *DEF_p, prompt, &status, 10);
+    *IOK_p = (int_f)((status < 0)? 1:0);
+
+    return;
+}
+
+void yesno_x(int_f *ANSWER_p,
+	     char  *PROMPT_p,
+	     int_f *IHELPS_p,
+	     int_f *IHELPE_p,
+	     char  *HELPF_p,
+	     int_f *IDEVH_p,
+	     int_f *KBIN_p,
+	     int_f *KBOUT_p,
+	     int_fl PROMPT_l,
+	     int_fl HELPF_l)
+/*
+ * Prompt the user with *PROMPT_l and get a YES or NO response.
+ * ANSWER=-1 => Cancel
+ * ANSWER=0  => YES
+ * ANSWER=1  => NO
+ */
+{
+    char prompt[256];
+
+    Fstr2Cstr(PROMPT_p, PROMPT_l, prompt, (int_fl)sizeof(prompt));
+    *ANSWER_p = (int_f)yesno(prompt);
+}
+
+void yesono_x(int_f *CHOICE_p,
+	      char  *P1_p,
+	      char  *P2_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl P1_l,
+	      int_fl P2_l,
+	      int_fl HELPF_l)
+/*
+ * If CHOICE is 0, present a YESNO dialogue with prompt P1.
+ * If CHOICE is 1, present a YESNO dialogue with prompt P2.
+ */
+{
+    char prompt1[256], prompt2[256];
+
+    Fstr2Cstr(P1_p, P1_l, prompt1, (int_fl)sizeof(prompt1));
+    Fstr2Cstr(P2_p, P2_l, prompt2, (int_fl)sizeof(prompt2));
+
+    *CHOICE_p = (int_f)yesono(*CHOICE_p, prompt1, prompt2);
+}
+
+void gtstr_x(char  *P_p,      /* prompt */
+	     char  *STRING_p, /* default value (if *LENGTH_p > 0) */
+	     char  *NEW_p,    /* out: dialogue result */
+	     int_f *LENGTH_p, /* in:  significant length of *STRING_p
+			 	out: significant length of *NEW_p */
+	     int_f *KBOUT_p,
+	     int_f *KBIN_p,
+	     int_f *INFLAG_p, /* out: status */
+	     int_fl P_l,
+	     int_fl STRING_l,
+	     int_fl NEW_l)
+/*
+ * Prompt with P for a string, default value STRING.
+ * Return new string in NEW.
+ * INFLAG=0 => OK
+ * INFLAG=1 => Help
+ * INFLAG=2 => Cancel, LENGTH=1
+ * INFLAG=3 => LENGTH=0 and NEW full of blanks (ie default)
+ */
+{
+    char question[256], def[256];
+    char *defptr;
+    int ret;
+
+    Fstr2Cstr(P_p, P_l, question, (int_fl)sizeof(question));
+    if (*LENGTH_p > 0) {
+	Fstr2Cstr(STRING_p, STRING_l, def, (int_fl)sizeof(def));
+	defptr = def;
+    } else
+	defptr = 0;
+
+    ret = gtstr(question, defptr, NEW_p, (size_t)((NEW_l==1)?40:NEW_l));
+    switch(ret) {
+    case -1:
+	*INFLAG_p = 2;
+	*LENGTH_p = 1;
+	return;
+    case 0:
+	*INFLAG_p = 0;
+	*LENGTH_p = (int_f)strlen(NEW_p);
+	Cstr2Fstr(NEW_p, NEW_p, NEW_l);
+	return;
+    case 1:
+	*INFLAG_p = 3;
+	*LENGTH_p = 0;
+    }
+}
+
+void getstr_x(char  *P_p,      /* prompt */
+	      char  *STRING_p, /* default value (if *LENGTH_p > 0) */
+	      char  *NEW_p,    /* out: dialogue result */
+	      int_f *MAXSTR_p, /* array length of *STRING_p and *NEW_p */
+	      int_f *LENGTH_p, /* in:  significant length of *STRING_p
+				 out: significant length of *NEW_p */
+	      int_f *KBOUT_p,
+	      int_f *KBIN_p,
+	      int_f *INFLAG_p, /* out: status */
+	      int_fl P_l)
+/*
+ * Present a dialogue requesting a string result with the given prompt
+ * and default.
+ *
+ * In the Fortran, P is a character string, so it passes both *P_p
+ * and P_l. STRING and NEW are character arrays, so we are merely
+ * given *STRING_p and *NEW_p.
+ */
+{
+    gtstr_x(P_p,
+	    STRING_p,
+	    NEW_p,   
+	    LENGTH_p,
+	    KBOUT_p,
+	    KBIN_p,
+	    INFLAG_p,
+	    P_l,
+	    *MAXSTR_p,
+	    *MAXSTR_p);
+}
+
+void radion_x(char  *TITLE_p,
+	      char  *PROMPT_p, /* Actually: *PROMPT_p[*NB_p] */
+	      int_f *NB_p,     /* Number of buttons (1..NB) */
+	      int_f *CHOICE_p, /* in:  default button
+			         out: selected button */
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl TITLE_l,
+	      int_fl PROMPT_l,
+	      int_fl HELPF_l)
+/*
+ * Present a ``radio button'' dialogue allowing the user to select
+ * one of a number of prompted items.
+ * In the text version this is simply a list of options and a prompt.
+ * CHOICE=(1..NB) => That option selected
+ * CHOICE=-1      => Cancel
+ */
+{
+    char **opt, title[256];
+    int i, numopts = (int)*NB_p;
+
+    /* initialise options array */
+    opt = (char **)malloc(numopts * sizeof(char *));
+    opt[0] = (char *)malloc((unsigned int)((PROMPT_l+1) * numopts));
+    for (i=0; i<numopts; i++) {
+	opt[i] = opt[0] + (PROMPT_l+1)*i;
+	Fstr2Cstr(PROMPT_p+(i*PROMPT_l), PROMPT_l, opt[i], PROMPT_l+1);
+    }
+
+    Fstr2Cstr(TITLE_p, TITLE_l, title, (int_fl)sizeof(title));
+
+    *CHOICE_p = (int_f)radion(title, opt, *NB_p, *CHOICE_p);
+    
+    free((void *)opt[0]);
+    free((void *)opt);
+    return;
+}
+
+void check4_x(char  *P1_p, char  *P2_p, char  *P3_p, char  *P4_p,
+	      int_f *C1_p, int_f *C2_p, int_f *C3_p, int_f *C4_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char  *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_f *IOK_p,
+	      int_fl P1_l, int_fl P2_l, int_fl P3_l, int_fl P4_l,
+	      int_fl HELPF_l)
+/*
+ * Present a ``checkbox'' dialogue allowing the user to alter the
+ * Boolean state of four items.
+ * C1 - C4 give the initial states of the items: 0 = unset, 1 = set.
+ * The final states of the items are also returned in these.
+ * IOK=0 => OK
+ * IOK=1 => Cancel
+ */
+{
+    int ret;
+    char *prompts[4];
+    int *bools[4];
+
+    prompts[0] = P1_p; bools[0] = (int *)C1_p;
+    prompts[1] = P2_p; bools[1] = (int *)C2_p;
+    prompts[2] = P3_p; bools[2] = (int *)C3_p;
+    prompts[3] = P4_p; bools[3] = (int *)C4_p;
+
+    ret = checkn(4, prompts, bools);
+
+    if (ret == 0)
+	*IOK_p = 0;
+    else
+	*IOK_p = 1;
+}
+
+void getopt_x(int_f *KBIN_p,
+	      int_f *KOPT_p,
+	      int_f *IOPT_p) /* out: option number selected */
+/*
+ * Select a function to execute
+ * KOPT=0 => OK
+ * KOPT=1 => dialogue wanted
+ * KOPT=3 => menu option requested
+ */
+{
+    *IOPT_p = (int_f)getcopt((int *)KOPT_p);
+    switch(*KOPT_p) {
+    case -3:
+    case -2:
+	*KOPT_p = 0;
+	*IOPT_p = 2;
+	break;
+    case -1:
+	*KOPT_p = 0;
+	*IOPT_p = 1;
+	break;
+    }
+}
+
+void showfi_x(int_f *KBOUT_p,
+	      char  *STRING_p,
+	      int_fl STRING_l)
+/*
+ * Display `STRING', the name of the current file(s).
+ */
+{
+    char func[256];
+
+    Fstr2Cstr(STRING_p, STRING_l, func, (int_fl)sizeof(func));
+    showfi(func);
+}
+
+
+void showfu_x(int_f *KBOUT_p,
+	      char  *STRING_p,
+	      int_fl  STRING_l)
+/*
+ * Display `STRING', the name of the current function.
+ */
+{
+    char func[256];
+
+    Fstr2Cstr(STRING_p, STRING_l, func, (int_fl)sizeof(func));
+    showfu(func);
+}
+
+void errom_x(int_f *KBOUT_p,
+	     char  *STRING_p,
+	     int_fl  STRING_l)
+/*
+ * Display the error message `STRING'.
+ */
+{
+    char err[256];
+
+    Fstr2Cstr(STRING_p, STRING_l, err, (int_fl)sizeof(err));
+    errom(err);
+}
+
+void busy_x(int_f *KBOUT_p)
+/*
+ * Inform the user that there will be a long pause before the
+ * next dialogue function is called.
+ */
+{
+    busy();
+}
+
+/*
+ * Returns:
+ *    0 for ok
+ *   -1 for cancel
+ */
+void bpause_x(int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_f *IOK_p) {
+
+    *IOK_p = (int_f)bpause();
+}
+
diff --git a/src/staden/values.h b/src/staden/values.h
new file mode 100644
index 0000000..792e966
--- /dev/null
+++ b/src/staden/values.h
@@ -0,0 +1,118 @@
+/*	@(#)values.h 1.7 89/10/02 SMI; from S5R2 1.16	*/
+
+#ifndef _values_h
+#define _values_h
+
+/* These values work with any binary representation of integers
+ * where the high-order bit contains the sign. */
+
+/* a number used normally for size of a shift */
+#if gcos
+#define BITSPERBYTE	9
+#else
+#define BITSPERBYTE	8
+#endif
+#define BITS(type)	(BITSPERBYTE * (int)sizeof(type))
+
+/* short, regular and long ints with only the high-order bit turned on */
+#define HIBITS	((short)(1 << BITS(short) - 1))
+#define HIBITI	(1 << BITS(int) - 1)
+#define HIBITL	(1L << BITS(long) - 1)
+
+/* largest short, regular and long int */
+#define MAXSHORT	((short)~HIBITS)
+#define MAXINT	(~HIBITI)
+#define MAXLONG	(~HIBITL)
+
+/* various values that describe the binary floating-point representation
+ * _EXPBASE	- the exponent base
+ * DMAXEXP 	- the maximum exponent of a double (as returned by frexp())
+ * FMAXEXP 	- the maximum exponent of a float  (as returned by frexp())
+ * DMINEXP 	- the minimum exponent of a double (as returned by frexp())
+ * FMINEXP 	- the minimum exponent of a float  (as returned by frexp())
+ * MAXDOUBLE	- the largest double
+			((_EXPBASE ** DMAXEXP) * (1 - (_EXPBASE ** -DSIGNIF)))
+ * MAXFLOAT	- the largest float
+			((_EXPBASE ** FMAXEXP) * (1 - (_EXPBASE ** -FSIGNIF)))
+ * MINDOUBLE	- the smallest double (_EXPBASE ** (DMINEXP - 1))
+ * MINFLOAT	- the smallest float (_EXPBASE ** (FMINEXP - 1))
+ * DSIGNIF	- the number of significant bits in a double
+ * FSIGNIF	- the number of significant bits in a float
+ * DMAXPOWTWO	- the largest power of two exactly representable as a double
+ * FMAXPOWTWO	- the largest power of two exactly representable as a float
+ * _IEEE	- 1 if IEEE standard representation is used
+ * _DEXPLEN	- the number of bits for the exponent of a double
+ * _FEXPLEN	- the number of bits for the exponent of a float
+ * _HIDDENBIT	- 1 if high-significance bit of mantissa is implicit
+ * LN_MAXDOUBLE	- the natural log of the largest double  -- log(MAXDOUBLE)
+ * LN_MINDOUBLE	- the natural log of the smallest double -- log(MINDOUBLE)
+ * LN_MAXFLOAT	- the natural log of the largest float  
+ * LN_MINFLOAT	- the natural log of the smallest float
+ */
+#if u3b || u3b5 || sun
+#define MAXDOUBLE	1.797693134862315708e+308
+#define MAXFLOAT	((float)3.40282346638528860e+38)
+#define MINDOUBLE	4.94065645841246544e-324
+#define MINFLOAT	((float)1.40129846432481707e-45)
+#define	_IEEE		1
+#define _DEXPLEN	11
+#define _HIDDENBIT	1
+#define DMINEXP	(-(DMAXEXP + DSIGNIF - _HIDDENBIT - 3))
+#define FMINEXP	(-(FMAXEXP + FSIGNIF - _HIDDENBIT - 3))
+#define LN_MAXFLOAT	8.8722839052068e+01
+#define LN_MINFLOAT	-1.03278929903432e+02
+#endif
+#if pdp11 || vax
+#define MAXDOUBLE	1.701411834604692293e+38
+#define MAXFLOAT	((float)1.701411733192644299e+38)
+/* The following is kludged because the PDP-11 compilers botch the simple form.
+   The kludge causes the constant to be computed at run-time on the PDP-11,
+   even though it is still "folded" at compile-time on the VAX. */
+#define MINDOUBLE	(0.01 * 2.938735877055718770e-37)
+#define MINFLOAT	((float)MINDOUBLE)
+#define _IEEE		0
+#define _DEXPLEN	8
+#define _HIDDENBIT	1
+#define DMINEXP	(-DMAXEXP)
+#define FMINEXP	(-FMAXEXP)
+#endif
+#if gcos
+#define MAXDOUBLE	1.7014118346046923171e+38
+#define MAXFLOAT	((float)1.7014118219281863150e+38)
+#define MINDOUBLE	2.9387358770557187699e-39
+#define MINFLOAT	((float)MINDOUBLE)
+#define _IEEE		0
+#define _DEXPLEN	8
+#define _HIDDENBIT	0
+#define DMINEXP	(-(DMAXEXP + 1))
+#define FMINEXP	(-(FMAXEXP + 1))
+#endif
+#if u370
+#define _LENBASE	4
+#else
+#define _LENBASE	1
+#endif
+#define _EXPBASE	(1 << _LENBASE)
+#define _FEXPLEN	8
+#define DSIGNIF	(BITS(double) - _DEXPLEN + _HIDDENBIT - 1)
+#define FSIGNIF	(BITS(float)  - _FEXPLEN + _HIDDENBIT - 1)
+#define DMAXPOWTWO	((double)(1L << BITS(long) - 2) * \
+				(1L << DSIGNIF - BITS(long) + 1))
+#define FMAXPOWTWO	((float)(1L << FSIGNIF - 1))
+#define DMAXEXP	((1 << _DEXPLEN - 1) - 1 + _IEEE)
+#define FMAXEXP	((1 << _FEXPLEN - 1) - 1 + _IEEE)
+#define LN_MAXDOUBLE	(M_LN2 * DMAXEXP)
+#define LN_MINDOUBLE	(M_LN2 * (DMINEXP - 1))
+
+#define H_PREC	(DSIGNIF % 2 ? (1L << DSIGNIF/2) * M_SQRT2 : 1L << DSIGNIF/2)
+#define X_EPS	(1.0/H_PREC)
+#define X_PLOSS	((double)(long)(M_PI * H_PREC))
+#define X_TLOSS	(M_PI * DMAXPOWTWO)
+#define M_LN2	0.69314718055994530942
+#define M_PI	3.14159265358979323846
+#define M_SQRT2	1.41421356237309504880
+#define MAXBEXP	DMAXEXP /* for backward compatibility */
+#define MINBEXP	DMINEXP /* for backward compatibility */
+#define MAXPOWTWO	DMAXPOWTWO /* for backward compatibility */
+
+#endif /*!_values_h*/
diff --git a/src/staden/vep.f b/src/staden/vep.f
new file mode 100644
index 0000000..19c9259
--- /dev/null
+++ b/src/staden/vep.f
@@ -0,0 +1,771 @@
+      SUBROUTINE FMAIN()
+      PARAMETER (MAXSEQ = 50000,
+     +           MAXWLN = 6,
+     +           LCONST = 4*MAXWLN,
+     +           MAXWRD = 4**MAXWLN,
+     +           MAXDEV = 5,
+     +           NAMLEN = 60)
+      CHARACTER SEQV(MAXSEQ*2),SEQG(MAXSEQ),SEQC(MAXSEQ)
+      INTEGER WORDP(MAXWRD),POSN(MAXSEQ)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      INTEGER CONSTS(0:LCONST),DEVNOS(MAXDEV)
+      INTEGER SEQVI(MAXSEQ),SEQGI(MAXSEQ),RC
+      CHARACTER*(NAMLEN) FILNAM,HELPF,NAME,TNAME,VNAME
+      EXTERNAL LASTO
+C
+C 5-2-92 replaced parts of the output routines vpout, vpouts
+C with routines fmtc and fmtg
+C
+C This routine prepares a reading for the assembly program.
+C It compares the sequence against vectors and clips off any found
+C Vector clipping is of 4 types:
+C 1) find 5' cloning site
+C 2) look for 3' cloning site
+C 3) test for insert being all vector
+C 4) look for cosmid "vector"
+C The first 3 can all be performed using one sequence, and are made easier
+C by telling the program exactly where the cloning site is, and which of
+C the 3 it is performing. Types 1,2 and 3 look only in one orientation,
+C whereas 4 should check both strands. Additionally they
+C differ in their outcomes: 1 and 2 write a new file with the clippoints
+C marked, 3 scrubs the reading (does not add it to a file of file names),
+C 4 writes out a new file if the reading contains some non vector sequence.
+C The clip should be marked differently for cosmid vector, just so we know where
+C it is (it can be tagged).
+C Clipping off crap should probably be done first by looking at the traces.
+C What do we need to tell the program?
+C 1) name of vector file
+C 2) position of cloning site
+C 3) position of primer
+C It would be helpful to have a standard orientation for vector sequences
+C for example Cloning site------ ... ------etis gninolC for ? strand
+C
+C producing the output files will be much simpler when we move to expt
+C file format for the sequence and its annotation
+C
+C
+C   Initialise help - currently none
+C      CALL INTHLP('vep', TOPOPT)
+      ICG = 0
+      ICB = 0
+      IPG = 0
+      IDM = 5
+      CALL INITLU(IDM)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      IDEVNI = DEVNOS(1)
+      IDEVN = DEVNOS(5)
+      IDEVO = DEVNOS(2)
+      IDEVNG = DEVNOS(3)
+      IDEVNB = DEVNOS(4)
+      WRITE(KBOUT,*)'VEP v2.1: vector excising program'
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(1),FILNAM,0,IOK,KBIN,KBOUT,
+     +'Input file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      FILNAM = 'm13mp18.seq'
+      CALL OPENF1(DEVNOS(2),FILNAM,0,IOK,KBIN,KBOUT,
+     +'File name of vector sequence',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+C
+C simplify vector name to last field
+C
+      VNAME = FILNAM(1+LASTO(FILNAM,'/'):)
+      IDIMV = MAXSEQ
+      CALL ARRFIL(DEVNOS(2),SEQV,IDIMV,KBOUT)
+      CLOSE(UNIT=DEVNOS(2))
+C  check for contig header (should not be there)
+      IF(SEQV(20).EQ.'>') THEN
+        CALL SHFLCA(SEQV,MAXSEQ,21,1,IDIMV)
+        IDIMV = IDIMV - 20
+      END IF
+      WRITE(KBOUT,*)'Vector length =',IDIMV
+C cloning site used to distinguish search for cloning site
+C from search for sequencing vector. A value of 0 signifies
+C we are searching for cosmid vector
+      MN = 0
+      MX = IDIMV
+      ICSITE = MIN(IDIMV,6249)
+      CALL GETINT(MN,MX,ICSITE,
+     +'3 prime side of cloning site (0 signifies cosmid vector)',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      ICSITE = IVAL
+      IPSITE = 1
+      IF(ICSITE.NE.0) THEN
+        MN = -IDIMV
+        MX = IDIMV
+        IPSITE = 41
+        CALL GETINT(MN,MX,IPSITE,
+     +  'Relative position of 3 prime end of primer site',
+     +  IVAL,KBIN,KBOUT,
+     +  IHELPS,IHELPE,HELPF,IDEVH,IOK)
+        IF(IOK.NE.0) STOP
+        IPSITE = IVAL
+C make cloning site end of seq, then start of seq is icsite + 1
+        CALL SQCOPY(SEQV(1),SEQV(IDIMV+1),ICSITE)
+C if forward primer then need to complement vector
+        IF(IPSITE.GT.0) THEN
+          CALL SQREV(SEQV(ICSITE+1),IDIMV)
+          CALL SQCOM(SEQV(ICSITE+1),IDIMV)
+        END IF
+      END IF
+      MN = 2
+      MX = MAXWLN
+      LENGTH = 4
+      CALL GETINT(MN,MX,LENGTH,
+     +'Word length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      LENGTH = IVAL
+      MN = 1
+      MX = 11
+      LW = 7
+      CALL GETINT(MN,MX,LW,
+     +'Number of diagonals to combine',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      LW = IVAL
+      XMN = 0.1
+      XMX = 1.0
+      CUT = 0.35
+      CALL GETRL(XMN,XMX,CUT,
+     +'Cutoff score',
+     +VAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      CUT = VAL
+      FILNAM = ' '
+      CALL OPENF1(DEVNOS(3),FILNAM,1,IOK,KBIN,KBOUT,
+     +'Output file of passed file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      IF(ICSITE.EQ.0) THEN
+        FILNAM = ' '
+        CALL OPENF1(DEVNOS(4),FILNAM,1,IOK,KBIN,KBOUT,
+     +  'Output file of failed file names',
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.NE.0) STOP
+      END IF
+      IDE = (IDM-1)**LENGTH
+      CALL SETCN(CONSTS,LENGTH,IDM,LCONST)
+      CALL CONNUM(SEQV(ICSITE+1),SEQVI,IDIMV)
+      CALL ENCONC(SEQVI,IDIMV,POSN,WORDP,IDE,IDM,CONSTS,LENGTH,
+     +LCONST)
+ 10   CONTINUE
+      READ(IDEVNI,1002,END=100)NAME
+      CALL OPENRS(IDEVO,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0) THEN
+        WRITE(KBOUT,*)'Error opening file',NAME
+        GO TO 10
+      END IF
+      IDIMGI = MAXSEQ
+      CALL ARRFIM(IDEVO,SEQG,IDIMGI,KBOUT)
+      JGEL = JGEL + 1
+      WRITE(KBOUT,*)'>>>> Read number',JGEL,' length',IDIMGI,' ',NAME
+1002  FORMAT(A)
+C  LONG ENOUGH ?
+      IF(IDIMGI.LT.LENGTH)THEN
+        WRITE(KBOUT,*)' Gel reading too short to compare'
+        GO TO 10
+      END IF
+      IDIMG = IDIMGI
+      IF(ICSITE.GT.0) THEN
+C
+C   Sequencing vector clipping
+C
+        CALL CONNUM(SEQG,SEQGI,IDIMG)
+        CALL VCUT(SEQVI,IDIMV,POSN,WORDP,IDE,SEQGI,IDIMG,CONSTS,
+     +  LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,ICSITE,
+     +  IPSITE)
+C        WRITE(*,*)'LC,RC',LC,RC
+        CALL VPOUTS(IDEVO,NAME,LC,RC,SEQG,IDIMG,IDEVNG,IDEVN,
+     +  VNAME,TNAME,ICB,ICG,IPG,IOK)
+        GO TO 10
+      END IF
+C
+C   Cosmid clipping
+C
+      CALL CONNUM(SEQG,SEQGI,IDIMG)
+      CALL VCUT(SEQVI,IDIMV,POSN,WORDP,IDE,SEQGI,IDIMG,CONSTS,
+     +  LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,ICSITE,
+     +  IPSITE)
+      IF(LC.NE.0) THEN
+C
+C Match found so do output
+C
+        CALL VPOUT(IDEVO,NAME,LC,RC,SEQG,IDIMG,IDEVNG,IDEVNB,IDEVN,
+     +  VNAME,TNAME,ICB,ICG,IPG,IOK)
+        GO TO 10
+      ELSE
+C
+C Try other strand
+C
+        CALL SQCOPY(SEQG,SEQC,IDIMG)
+        CALL SQREV(SEQC,IDIMG)
+        CALL SQCOM(SEQC,IDIMG)
+        CALL CONNUM(SEQC,SEQGI,IDIMG)
+        CALL VCUT(SEQVI,IDIMV,POSN,WORDP,IDE,SEQGI,IDIMG,CONSTS,
+     +  LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,ICSITE,
+     +  IPSITE)
+        IF(LC.NE.0) THEN
+          LC1 = IDIMG - RC + 1
+          RC = IDIMG - LC + 1
+          LC = LC1
+        END IF
+        CALL VPOUT(IDEVO,NAME,LC,RC,SEQG,IDIMG,IDEVNG,IDEVNB,IDEVN,
+     +  VNAME,TNAME,ICB,ICG,IPG,IOK)
+      END IF
+      GO TO 10
+ 100  CONTINUE
+      WRITE(KBOUT,*)
+     +'Finished after processing',JGEL,' files and finding'
+      WRITE(KBOUT,*)ICB,' completely vector'
+      WRITE(KBOUT,*)IPG,' partly vector'
+      WRITE(KBOUT,*)ICG,' free of vector'
+      END
+      INTEGER FUNCTION LASTO(STRING,C)
+      CHARACTER STRING*(*),C
+C find last occurrence of C in STRING
+      J1 = 1
+      J = 0
+      L = LEN(STRING)
+ 10   CONTINUE
+      J1 = J1 + J
+      J = INDEX(STRING(J1:L),C)
+      IF(J.NE.0) GO TO 10
+      LASTO = J1 - 1
+      END
+      SUBROUTINE VPOUT(IDEVO,NAME,LC,RC,SEQ,IDIMG,IDEVNG,IDEVNB,IDEVN,
+     +VNAME,TNAME,ICB,ICG,IPG,IOK)
+      CHARACTER LINE*80,NAME*(*),MTYPE*4,SEQ(IDIMG),VNAME*(*),TNAME*(*)
+      CHARACTER*2 GOODS,LCLIPS,RCLIPS
+      PARAMETER (GOODS='  ',LCLIPS=';<',RCLIPS=';>')
+      INTEGER RC,FMTC
+      EXTERNAL NCFRS,FMTC
+C
+C IDEVO original file
+C IDEVN new reading file
+C IDEVNG fofn for good data
+C IDEVNB fofn for bad data
+C      IDEVN = 6
+C      WRITE(*,*)'LC,RC vpout',LC,RC
+C
+C Only handles one pair of clip points and assumes we discard lc to rc
+C Possibilities and outcomes:
+C 1. no vector: write name to fofn for good files
+C 2. some vector: write name to fofn for good files, rename old file
+C                to oldname.vector name, rewrite sequence file with new
+C                clip points
+C 3. all vector: write name to fofn for bad files
+C
+C
+C If no vector:
+C
+      IF(LC.EQ.0) THEN
+        CLOSE(IDEVO)
+        WRITE(IDEVNG,1005)NAME
+        ICG = ICG + 1
+        RETURN
+      END IF
+C
+C If all vector:
+C
+      IF(RC-LC+1.GE.IDIMG) THEN
+        CLOSE(IDEVO)
+        WRITE(IDEVNB,1005)NAME
+ 1005   FORMAT(A)
+        ICB = ICB + 1
+        RETURN
+      END IF
+C
+C Some vector, so rename original file, and use old name for new file
+C
+      TNAME = NAME
+      CLOSE(IDEVO)
+      K = INDEX(NAME,' ')
+      NAME(K:K) = '.'
+      NAME(K+1:) = VNAME
+      IOK = NCFRS(TNAME,NAME)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error renaming',TNAME
+        RETURN
+      END IF
+C
+C Add the file name to the good file of file names
+C
+      WRITE(IDEVNG,1005)TNAME
+      IPG = IPG + 1
+C
+C Reopen the file with its new name
+C
+      CALL OPENRS(IDEVO,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error opening file',NAME
+        RETURN
+      END IF
+C
+C Open the new file with the original name
+C
+      CALL OPENRS(IDEVN,TNAME,IOK,LRECL,1)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error opening file',NAME
+        RETURN
+      END IF
+      READ(IDEVO,1000,ERR=100)LOR,LCOR,LLOR,MTYPE,NAME
+ 1000   FORMAT(1X,3I6,A,A)
+      IF(LC.EQ.1) THEN
+        LCOR = LCOR + (RC - LC + 1)
+        LLOR = LLOR - (RC - LC + 1)
+      ELSE
+        LLOR = LC - 1
+      END IF
+      WRITE(IDEVN,1006,ERR=100)LOR,LCOR,LLOR,MTYPE,
+     +NAME(1:INDEX(NAME,' ')-1)
+ 1006 FORMAT(';',3I6,A,A)
+      IF(FMTC(IDEVO,IDEVN,LINE,LCLIPS).NE.0) GO TO 100
+C
+C now add left clip if required (lc=1)
+C
+      IF(LC.EQ.1) THEN
+        K2 = 1
+        CALL FMTV(SEQ,IDIMG,K2,RC,LINE,LCLIPS,IDEVN)
+C
+C now add in the right clip info
+C
+        REWIND(IDEVO)
+        IF(FMTC(IDEVO,IDEVN,LINE,RCLIPS).NE.0) GO TO 100
+C
+C Now do seq between RC and end as good seq
+C
+        IF(RC.LT.IDIMG) THEN
+          K2 = RC + 1
+          CALL FMTV(SEQ,IDIMG,K2,IDIMG,LINE,GOODS,IDEVN)
+        END IF
+      ELSE
+C
+C do seq in between LC and RC  as a right clip (lc/=1)
+C
+        K2 = LC
+        CALL FMTV(SEQ,IDIMG,K2,RC,LINE,RCLIPS,IDEVN)
+C
+C now add in the extant right clip info
+C
+        REWIND(IDEVO)
+        IF(FMTC(IDEVO,IDEVN,LINE,RCLIPS).NE.0) GO TO 100
+C
+C Do data up to lc as good seq
+C
+        K2 = 1
+        CALL FMTV(SEQ,IDIMG,K2,LC-1,LINE,GOODS,IDEVN)
+      END IF
+      CLOSE(IDEVO)
+      CLOSE(IDEVN)
+      IOK = 0
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'Error in VPOUT'
+      IOK = 1
+      END
+      SUBROUTINE FMTV(SEQ,IDIMG,LC,RC,LINE,LT,IDEVN)
+      CHARACTER LINE*(*),SEQ(IDIMG),LT*(*)
+      INTEGER RC
+C      WRITE(*,*)'LC,RC IN FMTV',LC,RC
+C
+C  write out seq from lc to rc with line type lt
+C
+      K2 = LC - 1
+ 10   CONTINUE
+      K1 = K2 + 1
+      IF (K2.NE.RC) THEN
+        K2 = MIN(K1+49,RC)
+        WRITE(IDEVN,1000)LT,(SEQ(K),K=K1,K2)
+        GO TO 10
+      END IF
+ 1000 FORMAT(A,50A1)
+      END
+      INTEGER FUNCTION FMTC(IDEVO,IDEVN,LINE,LT)
+      CHARACTER LINE*(*),LT*(*)
+ 10   CONTINUE
+      READ(IDEVO,1000,ERR=100,END=200)LINE
+      IF(LINE(1:2).EQ.LT) THEN
+        WRITE(IDEVN,1000,ERR=100)LINE
+      END IF
+      GO TO 10
+ 100  CONTINUE
+      FMTC = 1
+      RETURN
+ 200  CONTINUE
+      FMTC = 0
+ 1000 FORMAT(A)
+      END
+      INTEGER FUNCTION NCFRS(FROM,TO)
+      CHARACTER FROM*(*),TO*(*)
+      INTEGER RENAME
+      EXTERNAL RENAME
+C
+C rename file using system call to RENAME
+C
+      IFD = 0
+      ITD = 0
+      IF(INDEX(FROM,'/').EQ.0) THEN
+        FROM = './'//FROM
+        IFD = 1
+      END IF
+      IF(INDEX(TO,'/').EQ.0) THEN
+        TO = './'//TO
+        ITD = 1
+      END IF
+      NCFRS = RENAME(FROM,TO)
+      IF(IFD.EQ.1) FROM = FROM(3:)
+      IF(ITD.EQ.1) TO = TO(3:)
+      END
+      SUBROUTINE VPOUTS(IDEVO,NAME,LC,RC,
+     +SEQ,IDIMG,IDEVNG,IDEVN,
+     +VNAME,TNAME,ICB,ICG,IPG,IOK)
+      CHARACTER LINE*80,NAME*(*),MTYPE*4,SEQ(IDIMG),VNAME*(*),TNAME*(*)
+      CHARACTER*2 GOODS,LCLIPS,RCLIPS
+      PARAMETER (GOODS='  ',LCLIPS=';<',RCLIPS=';>')
+      INTEGER RC,FMTC
+      EXTERNAL NCFRS,FMTC
+C
+C IDEVO original file
+C IDEVN new reading file
+C IDEVNG fofn for good data
+C      IDEVN = 6
+C       WRITE(*,*)'LC,RC',LC,RC
+C
+C handles 2 pairs of clip points
+C Possibilities and outcomes:
+C 1. no vector: write name to fofn for good files
+C 2. some vector: write name to fofn for good files, rename old file
+C                to oldname.vector name, rewrite sequence file with new
+C                clip points
+C 3. all vector: do nothing
+C
+C keep lc+1 to rc-1 ie discard 1 to lc and rc to end
+C if no right cut we come in with rc = idimg+1
+C if no left cut lc = 0
+C
+C If no vector:
+C
+      IF((LC.EQ.0).AND.(RC.EQ.IDIMG+1)) THEN
+        CLOSE(IDEVO)
+        WRITE(IDEVNG,1005)NAME
+        ICG = ICG + 1
+        RETURN
+      END IF
+C
+C If all vector: or < 20 bases left
+C
+      IF(RC-LC.LT.20) THEN
+        CLOSE(IDEVO)
+ 1005   FORMAT(A)
+        ICB = ICB + 1
+        RETURN
+      END IF
+C
+C Some vector, so rename original file, and use old name for new file
+C
+      TNAME = NAME
+      CLOSE(IDEVO)
+      K = INDEX(NAME,' ')
+      NAME(K:K) = '.'
+      NAME(K+1:) = VNAME
+      IOK = NCFRS(TNAME,NAME)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error renaming',TNAME
+        RETURN
+      END IF
+C
+C Add the file name to the good file of file names
+C
+      WRITE(IDEVNG,1005)TNAME
+      IPG = IPG + 1
+C
+C Reopen the file with its new name
+C
+      CALL OPENRS(IDEVO,NAME,IOK,LRECL,2)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error opening file',NAME
+        RETURN
+      END IF
+C
+C Open the new file with the original name
+C
+      CALL OPENRS(IDEVN,TNAME,IOK,LRECL,1)
+      IF(IOK.NE.0) THEN
+        WRITE(*,*)'Error opening file',NAME
+        RETURN
+      END IF
+      READ(IDEVO,1000,ERR=5)LOR,LCOR,LLOR,MTYPE,NAME
+ 1000   FORMAT(1X,3I6,A,A)
+      GO TO 6
+ 5    CONTINUE
+C
+C  assume no header present (say scanner data)
+C
+      LOR = IDIMG
+      LCOR = 0
+      LLOR = IDIMG
+      MTYPE = 'PLN '
+      NAME = ' '
+ 6    CONTINUE
+      LCOR = LCOR + LC
+      LLOR = LLOR - (LC + IDIMG-RC+1)
+      WRITE(IDEVN,1006,ERR=100)LOR,LCOR,LLOR,MTYPE,
+     +NAME(1:MAX(1,INDEX(NAME,' ')-1))
+ 1006 FORMAT(';',3I6,A,A)
+      IF(FMTC(IDEVO,IDEVN,LINE,LCLIPS).NE.0) GO TO 100
+C
+C now add left clip if required (lc=1)
+C
+      IF(LC.NE.0) THEN
+        K2 = 1
+        CALL FMTV(SEQ,IDIMG,K2,LC,LINE,LCLIPS,IDEVN)
+      END IF
+C
+C now add new right clip if required
+C
+      IF(RC.NE.IDIMG+1) THEN
+        K2 = RC
+        CALL FMTV(SEQ,IDIMG,K2,IDIMG,LINE,RCLIPS,IDEVN)
+      END IF
+C
+C now add in the extant right clip info
+C
+        REWIND(IDEVO)
+        IF(FMTC(IDEVO,IDEVN,LINE,RCLIPS).NE.0) GO TO 100
+C
+C Now do seq between LC+1 and RC-1 as good seq
+C
+      K2 = LC + 1
+      CALL FMTV(SEQ,IDIMG,K2,RC-1,LINE,GOODS,IDEVN)
+      CLOSE(IDEVO)
+      CLOSE(IDEVN)
+      IOK = 0
+      RETURN
+ 100  CONTINUE
+      WRITE(*,*)'Error in VPOUTS'
+      IOK = 1
+      END
+      SUBROUTINE VCUT(SEQV,IDIMV,POSN,WORDP,IDE,SEQH,IDIMH,CONSTS,
+     +LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,ICSITE,
+     +IPSITE)
+      INTEGER SEQV(IDIMV),SEQH(IDIMH)
+      INTEGER POSN(IDIMV),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER RC
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      EXTERNAL NCODEA
+C      CALL BUSY(KBOUT)
+      CALL FILLR(HIST(LENGTH-IDIMV),IDIMH+IDIMV+1,0.)
+      DO 20 I = 1,IDIMH-LENGTH+1
+        J = NCODEA(SEQH(I),LENGTH,CONSTS,IDM,LCONST)
+        IF(J.NE.0)THEN
+          J1 = WORDP(J)
+          IF(J1.NE.0)THEN
+            K = I - J1
+            HIST(K) = HIST(K) + 1.
+10          CONTINUE
+            J2 = J1
+            J1 = POSN(J2)
+            IF(J1.NE.0)THEN
+              K = I - J1
+              HIST(K) = HIST(K) + 1.
+              GO TO 10
+            END IF
+          END IF
+        END IF
+20    CONTINUE
+      CALL PHIST(HIST,IDIMV,IDIMH,LENGTH,MAXSEQ)
+      IF(ICSITE.NE.0) THEN
+C
+C  look for primer region
+C
+        CALL FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     +  MAXSEQ,CUT,LW,LC,RC,IPSITE,1)
+C        WRITE(*,*)'LC',LC
+C        WRITE(*,*)'RC',RC
+        IF (LC.EQ.0) THEN
+C          WRITE(KBOUT,*)' ***** No primer site found ********'
+        ELSE
+          LC = RC
+        END IF
+C
+C  look for running into vector at cloning site
+C
+        CALL FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     +  MAXSEQ,CUT,LW,LCR,IRCR,IPSITE,2)
+C
+C set right cut to 1 past the end of the sequence
+C
+C        WRITE(*,*)'LCR',LCR
+        RC = IDIMH + 1
+        IF(LCR.GT.0) THEN
+          RC = LCR
+        END IF
+      ELSE
+C
+C  look for cosmid vector
+C
+        CALL FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     +  MAXSEQ,CUT,LW,LC,RC,IPSITE,3)
+        IF(LC.GT.0) THEN
+C          WRITE(KBOUT,*)
+C     +'>>>>>>>>>>>>>>>>>>>>>>>>diagonal found'
+         RETURN
+        END IF
+      END IF
+      END
+      SUBROUTINE PHIST(HIST,IDIMV,IDIMH,LENGTH,MAXSEQ)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      IF(IDIMV.GE.IDIMH) THEN
+        D  = LENGTH
+        DO 10 I=LENGTH-IDIMV,IDIMH-IDIMV-1
+          HIST(I) = HIST(I)/D
+          D = D + 1
+ 10       CONTINUE
+        D  = IDIMH
+        DO 20 I=IDIMH-IDIMV,0
+          HIST(I) = HIST(I)/D
+ 20       CONTINUE  
+        D  = IDIMH - 1
+        DO 30 I=1,IDIMH-LENGTH
+          HIST(I) = HIST(I)/D
+          D = D - 1
+ 30       CONTINUE
+      ELSE
+        D  = LENGTH
+        DO 40 I=LENGTH-IDIMV,-1
+          HIST(I) = HIST(I)/D
+          D = D + 1
+ 40       CONTINUE
+        D  = IDIMV
+        DO 50 I=0,IDIMH-IDIMV
+          HIST(I) = HIST(I)/D
+ 50       CONTINUE
+        D  = IDIMV - 1
+        DO 60 I=IDIMH-IDIMV+1,IDIMH-LENGTH
+          HIST(I) = HIST(I)/D
+          D = D - 1
+ 60       CONTINUE
+      END IF
+      END
+      SUBROUTINE FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     + MAXSEQ,CUT,LW,LC,RC,PSITE,JOB)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      INTEGER RC,PSITE
+C PSITE is primer site
+      LC = 0
+      RC = 0
+      DMAX = 0.
+C If job = 1 look for cloning site from psite to the end of the vector
+C if job = 2 look for vector in the rest of the sequence
+C If job = 3 look for cosmid vector in whole of sequence
+C We discard lc to rc inclusive, lc=0 means discard nothing
+      IF(JOB.EQ.1) THEN
+        I1 = LENGTH - IDIMV
+        I2 =  -ABS(PSITE)
+      ELSE IF(JOB.EQ.2) THEN
+        I1 = -ABS(PSITE)
+        I2 = IDIMH - LENGTH
+      ELSE IF(JOB.EQ.3) THEN
+        I1 = LENGTH - IDIMV
+        I2 = IDIMH - LENGTH
+      ELSE
+        WRITE(*,*)'Error in FCUT'
+        RETURN
+      END IF
+C      WRITE(*,*)'LOOKING AT ',I1,I2
+      DO 10 I=I1,I2
+        DT = HIST(I)
+        IF(DT.GT.DMAX) THEN
+          DMAX = DT
+          ID = I
+        END IF
+ 10     CONTINUE
+      D = 0.
+      DO 35 I=MAX(ID-LW/2,LENGTH-IDIMV),
+     +        MIN(ID+LW/2,IDIMH-LENGTH)
+        D = D + HIST(I)
+ 35   CONTINUE
+C      WRITE(*,*)'Best diagonal, score and local sum',ID,DMAX,D
+C      WRITE(*,1000)ID,DMAX,D
+ 1000 FORMAT(I6,2F10.3)
+      IF(D.LT.CUT) RETURN
+      IF(IDIMV.GE.IDIMH) THEN
+        IF(ID.GE.0) THEN
+          LC = ID + 1
+          RC = IDIMH
+        ELSE
+          LC = 1
+          RC = MIN(IDIMH,IDIMV + ID)
+        END IF
+      ELSE
+        IF(ID.GE.0) THEN
+          LC = ID + 1
+          RC = MIN(ID+IDIMV,IDIMH)
+        ELSE
+          LC = 1
+          RC = ID + IDIMV
+        END IF
+      END IF
+      WRITE(*,*)' Discard ',LC, ' to ',RC
+      END
+      SUBROUTINE ARRFIM(IDEV,SEQNCE,J,KBOUT)
+C 14-8-91 Added err= option to read, and set length to 0 if error found
+C   AUTHOR: RODGER STADEN
+      CHARACTER TEMP(80),SEQNCE(J)
+      CHARACTER SPACE,ENDCHR,TITCHR
+      SAVE ENDCHR,SPACE,TITCHR
+      DATA ENDCHR/'@'/
+      DATA SPACE/' '/
+      DATA TITCHR/';'/
+      IDMX=J
+      J=0
+1     CONTINUE
+      READ(IDEV,1001,END=30,ERR=40)TEMP
+1001  FORMAT(80A1)
+      IF(TEMP(1).EQ.TITCHR)THEN
+C        WRITE(KBOUT,1003)(TEMP(K),K=2,80)
+C1003    FORMAT(' ',79A1)
+        GO TO 1
+      END IF
+10    CONTINUE
+      DO 20 I=1,80
+        IF(TEMP(I).NE.SPACE)THEN
+          IF(TEMP(I).EQ.ENDCHR)RETURN
+          IF(J.EQ.IDMX)THEN
+            WRITE(KBOUT,1002)IDMX
+1002        FORMAT(
+     +             ' Too much data. Maximum possible',
+     +             ' =',I6,', input stopped there')
+            RETURN
+          END IF
+          J=J+1
+          SEQNCE(J)=TEMP(I)
+        END IF
+20    CONTINUE
+      GO TO 1
+30    CONTINUE
+      RETURN
+ 40   CONTINUE
+      CALL ERROM(KBOUT,'Error reading file')
+      J = 0
+      END
+      SUBROUTINE SHFLCA(STRING,MAXAR,FROMS,TO,FROME)
+      CHARACTER STRING(MAXAR)
+      INTEGER FROMS,FROME,TO
+C
+C  shift left from from to to
+C
+      J = TO
+      DO 10 I=FROMS,FROME
+        STRING(J) = STRING(I)
+        J = J + 1
+ 10   CONTINUE
+      END
diff --git a/src/staden/xdapSpec.c b/src/staden/xdapSpec.c
new file mode 100644
index 0000000..cb71d49
--- /dev/null
+++ b/src/staden/xdapSpec.c
@@ -0,0 +1,432 @@
+/*
+    Title:       xdapSpec
+
+    File: 	 xdapSpec.c
+    Purpose:	 Definitions specific to the `xdap' program
+    Last update: Fri Dec 14 1990
+
+    Change log:
+
+        3/11/90 SD  Added extra menu option
+	22 August 1991 SD Added new menu option (Find internal joins)
+	    and associated help structures
+*/
+
+
+/*
+    This module contains data specific to the menus and help system
+    for the `xdap' program.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "fort.h"
+#include "progSpec.h"
+#include "userface.h" /* IMPORT: menu_x */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+
+static Widget menubarWid = (Widget)NULL; /* Initialised by CreateProgMenus */
+
+    
+static MenuData general_menu_data[] =
+{   {"Open a database",	                   3},
+    {"Display a contig",		   5},
+    {"List a text file",		   6},
+    {"Redirect output",		           7}, 
+    {"Calculate a consensus",	           8}, 
+    {"Screen against restriction enzymes", 17}, 
+    {"Screen against vector",	           18}, 
+    {"Check database",		           19}, 
+    {"Copy database",		           24}, 
+    {"Show relationships",		   25}, 
+    {"Set parameters",		           27}, 
+    {"Highlight disagreements",	           28}, 
+    {"Examine quality",		           29}, 
+    {"Find internal joins",	           35}, 
+};
+
+
+static MenuData screen_menu_data[] =
+{   {"Clear graphics",     10},
+    {"Clear text",	   11},
+    {"Draw ruler",	   12},
+    {"Use cross hair",	   13},
+    {"Change margins",	   14},
+    {"Label diagram",	   15},
+    {"Plot map",	   16},
+    {"Plot single contig", 33},
+    {"Plot all contigs",   34},
+};
+
+
+static MenuData modification_menu_data[] =
+{  {"Edit contig",           4},
+   {"Auto assemble",	     20},
+   {"Join contigs",	     22},
+   {"Complement a contig",   23},
+   {"Alter relationships",   26},
+   {"Extract gel readings",  32},
+};
+
+
+static MenuData enter_menu_data[] =
+{   {"Cancel",               2},
+    {"Complete entry",       3},
+    {"Edit contig",          4},
+    {"Display",              5},
+    {"Edit new gel reading", 6},
+};
+
+
+static MenuData join_menu_data[] =
+{   {"Cancel",            2},
+    {"Complete join",     3},
+    {"Edit left contig",  4},
+    {"Display join",      5},
+    {"Edit right contig", 6},
+    {"Move join",         7},
+};
+
+
+static MenuData alter_menu_data[] =
+{   {"Cancel",             2},
+    {"Line change",        3},
+    {"Edit gel reading",   4},
+    {"Delete contig",      5},
+    {"Shift",              6},
+    {"Move gel reading",   7},
+    {"Rename gel reading", 8},
+    {"Break a contig",     9},
+    {"Remove a gel reading",     10},
+    {"Alter raw data parameters", 11},
+};
+
+
+static MenuData edit_menu_data[] =
+{   {"Cancel",  2},
+    {"Insert",  3},
+    {"Delete",  4},
+    {"Change",  5},
+};
+
+
+
+
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button.
+	The button is initially unmanaged within its parent.
+    */
+    buttonWid = XtCreateWidget(menuButtonName, menuButtonWidgetClass,
+			       parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    They are initially all unmanaged.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+    menubarWid = parentWid;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Modification",
+	       modification_menu_data, XtNumber(modification_menu_data));
+    CreateMenu(parentWid, "Enter",
+	       enter_menu_data, XtNumber(enter_menu_data));
+    CreateMenu(parentWid, "Join",
+	       join_menu_data, XtNumber(join_menu_data));
+    CreateMenu(parentWid, "Alter",
+	       alter_menu_data, XtNumber(alter_menu_data));
+    CreateMenu(parentWid, "Edit",
+	       edit_menu_data, XtNumber(edit_menu_data));
+}
+
+
+
+
+/*
+    Special menu functions for SAP.
+
+    The generic menu functions, found in 'dialogues' are only
+    used indirectly by SAP.
+
+      SUBROUTINE DBMENT(MENU,NOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+
+      SUBROUTINE DBMENU(MENU,NOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+
+    If this code were in 'dialogues', then we wouldn't need to
+    keep the reference to 'menubarWid' which is making a big
+    assumption anyway.
+*/
+
+#define dbment_x dbment_
+#define dbmenu_x dbmenu_
+
+
+void dbment_x(int_f *MENU_p,
+	      int_f *NOPT_p,
+	      int_f *MAXOPT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    int_f KOPT, MOPT, MINMEM; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the 'top level' menus.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    XtManageChild(XtNameToWidget(menubarWid, "General"));
+    XtManageChild(XtNameToWidget(menubarWid, "Screen"));
+    XtManageChild(XtNameToWidget(menubarWid, "Modification"));
+
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	   &KOPT,
+	   &MOPT,
+	    MAXOPT_p,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+}
+
+
+void dbmenu_x(int_f *MENU_p,
+	      int_f *NOPT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    int_f KOPT, MOPT, MINMEM, MAXOPT; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the menu specified by 'MENU'.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    switch (*MENU_p)
+    {   case 2: XtManageChild(XtNameToWidget(menubarWid, "Enter"));
+                break;
+
+        case 3: XtManageChild(XtNameToWidget(menubarWid, "Join"));
+                break;
+      
+        case 4: XtManageChild(XtNameToWidget(menubarWid, "Alter"));
+                break;
+      
+        case 5: XtManageChild(XtNameToWidget(menubarWid, "Edit"));
+                break;
+    }
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	   &KOPT,
+	   &MOPT,
+	   &MAXOPT,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+
+}
+	      
+
+
+
+
+const int botHelpOpt = 0;
+const int topHelpOpt = 35;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "DAPHELP";
+const char helpPtrsFN[] = "DAPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "SAP",
+    "Help",
+    "Quit",
+    "Open a database",
+    "Edit contig",
+    "Display a contig",
+    "List a text file",
+    "Direct output to disk",
+    "Calculate a consensus",
+    "Screen edit",
+    "Clear graphics",
+    "Clear text",
+    "Draw ruler",
+    "Use cross hair",
+    "Change margins",
+    "Plot map",
+    "Label diagram",
+    "Screen against restriction enzymes",
+    "Screen against vector",
+    "Check consistency",
+    "Auto assemble",
+    "Enter new gel reading",
+    "Join contigs",
+    "Complement a contig",
+    "Copy database",
+    "Show relationships",
+    "Alter relationships",
+    "set parameters",
+    "Highlight disagreements",
+    "Examine quality",
+    "Auto edit a contig",
+    "Type in gel readings",
+    "Extract gel readings",
+    "Plot single contig",
+    "Plot all contigs",
+    "Find internal joins",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
+
+
+
+
+
+
+
+
+
+
+
+
diff --git a/src/staden/xdapmain.c b/src/staden/xdapmain.c
new file mode 100644
index 0000000..cbd0def
--- /dev/null
+++ b/src/staden/xdapmain.c
@@ -0,0 +1,549 @@
+/*
+    Title: 	 main
+
+    File: 	 main.c
+    Purpose:	 C language entry point and initialisation functions
+    Last update: Monday 20 January 1992
+*/
+
+
+/*
+    This module contains the C language entry point `main' and
+    initialisation for the X system. It provides the main event loop,
+    as far as X is concerned.
+
+    Three top-level shells are created: control, dialogue and help.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include <stdio.h>    /* IMPORT: freopen, stdout, stderr, fprintf */
+#include "mystdlib.h" /* IMPORT: exit */
+
+#include "main.h"
+#include "dialogues.h"     /* IMPORT: CreateDialogueShell */
+#include "help.h"          /* IMPORT: CreateHelpShell */
+#include "plX.h"           /* IMPORT: CreateGraphicsOutput */
+#include "textOutput.h"    /* IMPORT: CreateTextOutput
+                                      UpdateTextOutput */
+#include "contigEditor.h"  /* IMPORT: CreateEditorShell */
+#include "locks.h"
+#include "mcspec.h"
+#include "postscript.h"    /* IMPORT: closep_x() */
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Types ---- */
+typedef enum {Alive, Dying, Dead} LifeForce;
+
+
+
+
+/* ---- Static variables ---- */
+
+static unsigned int global_argc;
+static char **global_argv;
+static XtAppContext app_con;
+static Display *display;
+static Widget toplevelWid;
+static Widget dialogueShellWid, helpShellWid, outputShellWid,graphicsShellWid;
+static LifeForce lifeForce = Alive;
+static int exitcode;
+/*
+static XrmOptionDescRec options[] =
+{
+};
+*/
+/* The following are used if Xdap cannot be found */
+static String fallback_resources[] = {
+    /* Dialogue shell */
+    /* -------------- */
+    "Xdap.Dialogue.allowShellResize: False",
+    "Xdap.Dialogue.width:	 550",
+    "Xdap.Dialogue.minWidth:  550",
+    "Xdap.Dialogue.maxWidth:  550",
+    "Xdap.Dialogue.height:    180",
+    "Xdap.Dialogue.minHeight: 180",
+    "Xdap.Dialogue.maxHeight: 180",
+    "Xdap.Dialogue.dialogueForm*translations: #override \\n\
+         <Key>Return: CRAction()",
+    /* The default chaining for all children is Top,Top,Left,Left */
+    "Xdap.Dialogue.dialogueForm*top:    chainTop",
+    "Xdap.Dialogue.dialogueForm*bottom: chainTop",
+    "Xdap.Dialogue.dialogueForm*left:   chainLeft",
+    "Xdap.Dialogue.dialogueForm*right:  chainLeft",
+    /* currFile */
+    "Xdap.Dialogue.dialogueForm.currFile.label: Current file:",
+    "Xdap.Dialogue.dialogueForm.currFile.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.currFile.resizable: True",
+    /* currFunc */
+    "Xdap.Dialogue.dialogueForm.currFunc.label: Current function:",
+    "Xdap.Dialogue.dialogueForm.currFunc.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.currFunc.resizable: True",
+    /* quit */
+    "Xdap.Dialogue.dialogueForm.quit.label: Quit application",
+    "Xdap.Dialogue.dialogueForm.quit.foreground:  red",
+    "Xdap.Dialogue.dialogueForm.quit.borderColor: red",
+
+    /* null */
+    "Xdap.Dialogue.dialogueForm.null.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.null.label: Working ... ",
+
+    /* nChoice */
+    "Xdap.Dialogue.dialogueForm.nChoice.resizable: True",
+    /* nChoice buttons */
+    "Xdap.Dialogue.dialogueForm.nChoice.buttons.resizable: True",
+    /* nChoice OK */
+    "Xdap.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3",
+
+    /* getopt */
+    "Xdap.Dialogue.dialogueForm.getopt.width: 540",
+    /* getopt execlabel */
+    "Xdap.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:",
+    "Xdap.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0",
+    /* getopt execcommand */
+    "Xdap.Dialogue.dialogueForm.getopt.execcommand.label: Execute with dialogue",
+    "Xdap.Dialogue.dialogueForm.getopt.execcommand.resizable: True",
+    /* getopt menubar */
+    "Xdap.Dialogue.dialogueForm.getopt.menubar.width:	530",
+    "Xdap.Dialogue.dialogueForm.getopt.menubar.height:	25",
+
+    /* getstring */
+    "Xdap.Dialogue.dialogueForm.getstring.label:",
+    "Xdap.Dialogue.dialogueForm.getstring.value:",
+    "Xdap.Dialogue.dialogueForm.getstring.resizable: True",
+    /* getstring label */
+    "Xdap.Dialogue.dialogueForm.getstring.label.resizable: True",
+    /* getstring value */
+    "Xdap.Dialogue.dialogueForm.getstring.value.width: 350",
+    /* getstring OK */
+    "Xdap.Dialogue.dialogueForm.getstring.OK.borderWidth: 3",
+
+    /* yesno */
+    "Xdap.Dialogue.dialogueForm.yesno.resizable: True",
+    /* yesno label */
+    "Xdap.Dialogue.dialogueForm.yesno.lab.label:",
+    "Xdap.Dialogue.dialogueForm.yesno.lab.borderWidth: 0",
+    "Xdap.Dialogue.dialogueForm.yesno.lab.resizable: True",
+    "Xdap.Dialogue.dialogueForm.yesno.YES.borderWidth: 3",
+
+    /* Help shell */
+    /* ---------- */
+    "Xdap.Help.width:     550",
+    "Xdap.Help.minWidth:  550",
+    "Xdap.Help.height:    400",
+    "Xdap.Help.minHeight: 150",
+    "Xdap.Help*top:    chainTop",
+    "Xdap.Help*bottom: chainTop",
+    "Xdap.Help*left:   chainLeft",
+    "Xdap.Help*right:  chainLeft",
+
+    /* currDisp */
+    "Xdap.Help.helpForm.currDisp.resizable: True",
+    "Xdap.Help.helpForm.currDisp.borderWidth: 0",
+    /* rmHelp */
+    "Xdap.Help.helpForm.rmHelp.label: Remove help",
+    /* button */
+    "Xdap.Help.helpForm.button.label: List of topics",
+    /* topicsViewport */
+    "Xdap.Help.helpForm.topicsViewport.allowVert: True",
+    "Xdap.Help.helpForm.topicsViewport.bottom: chainBottom",
+    "Xdap.Help.helpForm.topicsViewport.right:  chainRight",
+    "Xdap.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1",
+    "Xdap.Help.helpForm.topicsViewporttopicsList.forceColumns:   2",
+    /* text */
+    "Xdap.Help.helpForm.text.editType: edit",
+    "Xdap.Help.helpForm.text.scrollVertical: always",
+    "Xdap.Help.helpForm.text.bottom: chainBottom",
+    "Xdap.Help.helpForm.text.right:  chainRight",
+
+    /* Output shell */
+    /* ------------ */
+    "Xdap.Graphics.graph.width:  550",
+    "Xdap.Graphics.graph.height: 400",
+    "Xdap.Output.text.width:   550",
+    "Xdap.Output.text.height:  100",
+
+    /* Editor shell */
+    /* ------------ */
+    "Xdap.Editor.allowShellResize: True",
+    "Xdap.Editor.mainForm.resizable: True",
+/*
+    "Xdap.Editor*font:  5x8",
+*/
+    "Xdap.Editor*font:  *Fixed-*-20-200-*",
+    /* The default chaining for all children is Top,Top,Left,Left */
+    "Xdap.Editor.mainForm*top:    chainTop",
+    "Xdap.Editor.mainForm*bottom: chainTop",
+    "Xdap.Editor.mainForm*left:   chainLeft",
+    "Xdap.Editor.mainForm*right:  chainLeft",
+    "Xdap.Editor.mainForm.title.borderWidth: 0",
+    "Xdap.Editor.mainForm.title.label: Contig Editor (V1.1)",
+    "Xdap.Editor.mainForm.buttons.borderWidth: 0",
+    "Xdap.Editor.mainForm.buttons.orientation: horizontal",
+    "Xdap.Editor.mainForm.buttons.resizable:  True",
+    "Xdap.Editor.mainForm.buttons*vertDistance: 0",
+    "Xdap.Editor.mainForm.buttons.insert.label: Insert",
+    "Xdap.Editor.mainForm.buttons.replace.label: Replace",
+    "Xdap.Editor.mainForm.buttons.replace.state: True",
+    "Xdap.Editor.mainForm.buttons.superman.label: Super Edit",
+    "Xdap.Editor.mainForm.buttons.superman.state: False",
+    "Xdap.Editor.mainForm.buttons.reveal.label: Reveal Cutoffs",
+    "Xdap.Editor.mainForm.buttons.reveal.state: False",
+    "Xdap.Editor.mainForm.buttons.undo.label: Undo",
+    "Xdap.Editor.mainForm.buttons.save.label: Save",
+    "Xdap.Editor.mainForm.buttons.exit.label: Leave Editor",
+    "Xdap.Editor.mainForm.buttons.exit.foreground:  red",
+    "Xdap.Editor.mainForm.buttons.exit.borderColor: red",
+    "Xdap.Editor.mainForm.buttons.lock.label: Lock",
+    "Xdap.Editor.mainForm.buttons.join.label: Join",
+    "Xdap.Editor.mainForm.buttons.next.label: Next Problem",
+    "Xdap.Editor.mainForm.edForm.resizable: True",
+    "Xdap.Editor.mainForm.edForm.defaultDistance: 0",
+    "Xdap.Editor.mainForm.edForm.borderWidth: 0",
+    "Xdap.Editor.mainForm.edForm.sequences.resizable: True",
+    "Xdap.Editor.mainForm.edForm.names.resizable: True",
+    "Xdap.Editor.mainForm.edForm.slider.length:  500",
+    "Xdap.Editor.mainForm.edForm.slider.orientation:  horizontal",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.defaultDistance:  0",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.borderWidth:  0",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.left:  ChainLeft",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.right:  ChainLeft",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.resizable:  True",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.resize:  True",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.left:  Rubber",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.*.right:  Rubber",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.leftleft.label:  <<",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.left.label:  <",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.right.label:  >",
+    "Xdap.Editor.mainForm.edForm.scrollButtons.rightright.label:  >>",
+    "Xdap.Editor*warnPrompt.label: Save changes? ",
+    "Xdap.Editor*warnBox.borderWidth: 0",
+    "Xdap.Editor*warnBox.orientation: vertical",
+    "Xdap.Editor*warnPrompt.borderWidth: 0",
+    "Xdap.Editor.mainForm.disagreeForm.resizable: True",
+    "Xdap.Editor.mainForm.disagreeForm.dummy.resizable: True",
+    "Xdap.Editor.mainForm.disagreeForm.disagreements.resizable: True",
+    "Xdap.Editor.mainForm.disagreeForm.borderWidth: 0",
+    "Xdap.Editor.mainForm.disagreeForm.defaultDistance: 0",
+
+    /* Trace Manager */
+    /* ------------- */
+    "*Traces.mainForm.buttons.borderWidth: 0",
+    "*Traces.mainForm.buttons.orientation: horizontal",
+    "*Traces.mainForm.buttons*top:    ChainTop",
+    "*Traces.mainForm.buttons*bottom: ChainTop",
+    "*Traces.mainForm.buttons*left:   ChainLeft",
+    "*Traces.mainForm.buttons*right:  ChainLeft",
+
+    "*Traces.allowShellResize: True",
+    "*Traces.mainForm.traceForm.resizable: True",
+    "*Traces.mainForm.traceForm.form.resizable: True",
+    "*Traces.mainForm.defaultDistance: 0",
+    "*Traces.mainForm.traceForm.form.defaultDistance: 1",
+    "*Traces.mainForm.traceForm.borderWidth: 0",
+    "*Traces.mainForm.traceForm.form.borderWidth: 0",
+
+    /* The main display: a viewport containing a form containing graphs */
+    "*Traces.mainForm.traceForm.form.top:    Rubber",
+    "*Traces.mainForm.traceForm.form.bottom: Rubber",
+    "*Traces.mainForm.traceForm.form.left: ChainLeft",
+    "*Traces.mainForm.traceForm.form.right: ChainRight",
+    "*Traces.mainForm.traceForm.form.viewport.top:    Rubber",
+    "*Traces.mainForm.traceForm.form.viewport.bottom: Rubber",
+    "*Traces.mainForm.traceForm.form.viewport.left:   ChainLeft",
+    "*Traces.mainForm.traceForm.form.viewport.right:  ChainRight",
+    "*Traces.mainForm.traceForm.form.viewport.allowHoriz: True",
+    "*Traces.mainForm.traceForm.form.viewport.forcebars:  True",
+    "*Traces.mainForm.traceForm.form.viewport.width:      700",
+    "*Traces.mainForm.traceForm.form.viewport.height: 150",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.defaultDistance: 0",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.resizable: True",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.top:    ChainTop",  
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.bottom: ChainTop",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.left:   ChainLeft",  
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.right:  ChainRight",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.trace.top:    ChainTop",  
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.trace.bottom: ChainBottom",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.font: -misc-*-bold-*15-*",
+    /* The text graph heights are dynamically set to accomodate the font */
+    /* The graphs are dynamically initially set to fill the viewport */
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.dimBackground: grey",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour1:  green3",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour2:  purple",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour3:  black",
+    "*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour4:  red",
+
+
+    /* Gadgets */
+    "*Traces.mainForm.traceForm.form.gadgets.borderWidth:    0",
+    "*Traces.mainForm.traceForm.form.gadgets.traceName.borderWidth:    0",
+    "*Traces.mainForm.traceForm.form.gadgets*top:    ChainTop",
+    "*Traces.mainForm.traceForm.form.gadgets*bottom: ChainTop",
+    "*Traces.mainForm.traceForm.form.gadgets*left:   ChainLeft",
+    "*Traces.mainForm.traceForm.form.gadgets*right:  ChainLeft",
+    "*Traces.mainForm.traceForm.form.gadgets.scaleUp.label:  Scale up",
+    "*Traces.mainForm.traceForm.form.gadgets.scaleDown.label:  Scale down",
+
+
+    /* Tag Editor */
+    /* ---------- */
+    "*Tag.mainForm.  defaultDistance: 0",
+    "*Tag.mainForm.buttons.borderWidth: 0",
+    "*Tag.mainForm.buttons*resize: True",
+    "*Tag.mainForm.buttons.orientation: horizontal",
+    "*Tag.mainForm.buttons*top:    ChainTop",
+    "*Tag.mainForm.buttons*bottom: ChainTop",
+    "*Tag.mainForm.buttons*left:   ChainLeft",
+    "*Tag.mainForm.buttons*right:  ChainLeft",
+    "*Tag.mainForm.tagText.width:  300",
+    "*Tag.mainForm.tagText.height: 100",
+    "*Tag.mainForm.tagText.left:   ChainLeft",
+    "*Tag.mainForm.tagText.right:  ChainRight",
+    "*Tag.mainForm.tagText.top:    ChainTop",
+    "*Tag.mainForm.tagText.bottom: ChainBottom",
+    "*Tag.mainForm.tagText.resize: True",
+
+    NULL,
+};
+
+
+
+
+/* ---- Callback routines ---- */
+
+
+static void destroyCallback(Widget w, XtPointer client_data, XtPointer call_data)
+{   lifeForce = Dead;
+}
+
+
+
+
+/* ---- Other functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    fprintf(stderr, "%s: unknown options:", argv[0]);
+    for (i=1; i <argc; i++)
+	fprintf(stderr, " %s", argv[i]);
+    fprintf(stderr, "\n\n");
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr, 
+"       [-geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]]\n");
+    fprintf(stderr,
+"       [-display [{host}]:[{vs}]]\n");
+    fprintf(stderr,
+"       [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]\n");
+}
+
+
+
+
+static void finalxPhase2(void)
+/*
+    Recover all X resources and exit with `status'.
+    This function does not return.
+*/
+{
+    /* close any opened postscript file. */
+    closep_x();
+
+    XtDestroyApplicationContext(app_con);
+
+    exit(exitcode);
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+#ifdef sgi
+#    define VMS_FLAGS	4
+#    define VMS_CC	0
+#    define OLD_RL	1
+#    define VMS_IN	2
+#    define VMS_EF	3
+extern unsigned short f77vms_flag_[VMS_FLAGS];
+#endif /* sgi */
+
+void main(unsigned int argc, char **argv)
+{
+    activeLock = semaphoreCreate(65535/*a big number for max*/);
+
+#ifdef sgi
+    /* Fix on Iris Indigo to allow fortran to use a C main() */
+    f77vms_flag_[OLD_RL] = 1;
+#endif /* sgi */
+
+    global_argc = argc;
+    global_argv = argv;
+
+    fmain_();
+
+    finalx(0);
+    finalxPhase2();
+}
+
+
+
+
+void initx_(void)
+/*
+    Initialise the X system, creating all necessary fixtures and
+    fittings for the application.
+    `stdout' may be redirected.
+*/
+{
+
+
+    toplevelWid = XtAppInitialize(&app_con, "Xdap",
+				  NULL, (Cardinal) 0,
+				  (int *)&global_argc, global_argv,
+				  fallback_resources,
+				  NULL, (Cardinal) 0);
+
+    XtAddCallback(toplevelWid, XtNdestroyCallback, destroyCallback, NULL);
+    display = XtDisplay(toplevelWid);
+
+    if (global_argc != 1)
+    {   usage(global_argc, global_argv);
+        finalx(1);
+    }
+
+    /*
+        Create the dialogue shell.
+    */
+    dialogueShellWid = CreateDialogueShell(toplevelWid);
+
+    /*
+        Create the graphics shell.
+    */
+    graphicsShellWid = XtCreatePopupShell("Graphics", topLevelShellWidgetClass,
+					 toplevelWid,
+					 NULL, (Cardinal) 0);
+
+    (void) CreateGraphicsOutput(graphicsShellWid);
+
+    /*
+        Create the output shell.
+    */
+    outputShellWid = XtCreatePopupShell("Output", topLevelShellWidgetClass,
+					 toplevelWid,
+					 NULL, (Cardinal) 0);
+
+    (void) CreateTextOutput(outputShellWid);
+
+    /*
+        Create the help shell.
+    */
+    helpShellWid = CreateHelpShell(toplevelWid);
+
+    /*
+    **  Create the editor shell
+    */
+    (void) CreateEditorShell(toplevelWid);
+
+    /*
+    **  Create the trace manager shell
+    */
+    CreateTraceManager(toplevelWid);
+
+    /*
+        The output, graphics and dialogue shells are initially displayed.
+    */
+    XtPopup(dialogueShellWid, XtGrabNone);
+    XtPopup(graphicsShellWid,   XtGrabNone);
+    XtPopup(outputShellWid,   XtGrabNone);
+
+}
+
+
+
+
+void finalx(int status)
+/*
+    Recover all X resources and exit with `status'.
+
+    Death under X is a two phase process. XtDestroyWidget is called on
+    the toplevel, but this may only take effect when we return to the
+    main loop. When it does take effect, the toplevel destroyCallback
+    will be called and set ``lifeForce'' to dead, which indicates that
+    the second phase can be called. This then calls exit.
+*/
+{   lifeForce = Dying;
+    exitcode = status;
+
+    XtDestroyWidget(toplevelWid);
+
+    if (lifeForce == Dead) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+void dispatchEventsUntil(Boolean (*stopPred)(void))
+/*
+    X events are cycled until the stopping predicate returns true or
+    we are in the process of dying. If this is the case, then the
+    second phase of finalisation is carried out and this function
+    does not return.
+*/
+{   XEvent event;
+
+    UpdateTextOutput();
+
+    while ((lifeForce==Alive) && !stopPred())
+    {   XtAppNextEvent(app_con, &event);
+	XtDispatchEvent(&event);
+    }
+
+    if (lifeForce!=Alive) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+void dispatchCurrentEvents()
+/*
+    All X events in the queue are cycled. If we are in the process of
+    dying, then the second phase of finalisation is carried out and
+    this function does not return.
+*/
+{   XEvent event;
+
+    UpdateTextOutput();
+
+    while ((lifeForce==Alive) && (XtAppPending(app_con) != 0))
+    {   XtAppNextEvent(app_con, &event);
+	XtDispatchEvent(&event);
+    }
+
+    if (lifeForce!=Alive) finalxPhase2(); /* Doesn't return */
+}
+
+
+
+
+
+Display *GetDisplay(void)
+/*
+    Return the X display running.
+    This function can only be called after initx_().
+*/
+{   return(display);
+}
diff --git a/src/staden/xmenu.c b/src/staden/xmenu.c
new file mode 100644
index 0000000..8059934
--- /dev/null
+++ b/src/staden/xmenu.c
@@ -0,0 +1,105 @@
+/* ---- Includes ---- */
+
+#include "progSpec.h"
+#include "userface.h" /* IMPORT: menu_x */
+#include "helpnmenu.h" /* IMPORT: createmenu, menuarr */
+
+#include <stdio.h>
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+/* ---- Static variables ---- */
+
+
+static Widget menubarWid = (Widget)NULL; /* Initialised by CreateProgMenus */
+static int num_menus = 0;
+
+    
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+/* ---- Private functions ---- */
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       menuarr *md, int num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button.
+    */
+    buttonWid = XtCreateManagedWidget(menuButtonName, menuButtonWidgetClass,
+			       parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++) {
+	/* Don't want help/quit/menus in our X menus */
+	Widget entryWid;
+
+	if (md[i].number < 3)
+	    continue;
+	entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+					 menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+/* ---- Exported functions ---- */
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    They are initially all unmanaged.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{
+    int i, menulen;
+    menuarr menu[MAXOPTS];
+
+    externalCallbackProc = cbp;
+    externalClient_data = client_data;
+    menubarWid = parentWid;
+
+    for (i = 1; i<MAXMENUS; i++) {
+	menulen = create_menu(i, menu, MAXOPTS*sizeof(menuarr));
+	if (menulen < 3)
+	    break; 
+	CreateMenu(parentWid, (String)(helpindex-i)->name, menu, menulen);
+    }
+    num_menus = i;
+}
diff --git a/src/staden/xmepSpec.c b/src/staden/xmepSpec.c
new file mode 100644
index 0000000..395126b
--- /dev/null
+++ b/src/staden/xmepSpec.c
@@ -0,0 +1,221 @@
+/*
+    Title: 	 xmepSpec
+
+    File: 	 xmepSpec.c
+    Purpose:     Definition specific to `xmep'
+    Last update: Wed Jun 20 1990
+*/
+
+
+/*
+    This module contains data specific to the menus and help system
+    for the `xmep' program.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "progSpec.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+    
+static MenuData general_menu_data[] =
+{   {"Read new sequence",                       3},
+    {"Redefine active region",                  4},
+    {"List the sequences",                      5},
+    {"List text file",                          6},
+    {"Direct text output to disc",              7},
+    {"Search for strings",                      17},
+    {"Compare a sequence",                      18},
+    {"Find inverted repeats",                   33},
+};
+  
+  
+static MenuData screen_menu_data[] =
+{   {"Clear graphics", 10},
+    {"Clear text",     11},
+    {"Draw a ruler",   12},
+    {"Use cross hair", 13},
+    {"Reset margins",  14},
+    {"Label diagram",  15},
+    {"Draw map",       16},
+};
+
+
+static  MenuData analysis_menu_data[] =
+{   {"Set strand",                       18},
+    {"Set composition",                  19},
+    {"Set word mask",                    20},
+    {"Set number of mismatches",         21},
+    {"Show settings",                    22},
+    {"Make dictionary Dw",               23},
+    {"Make dictionary Ds",               24},
+    {"Make fuzzy dictionary Dm from Dw", 25},
+    {"Make fuzzy dictionary Dm from Ds", 26},
+    {"Make fuzzy dictionary Dh from Dm", 27},
+    {"Examine fuzzy dictionary Dm",      28},
+    {"Examine fuzzy dictionary Dh",      29},
+    {"Examine words in Dm",              30},
+    {"Examine words in Dh",              31},
+    {"Save or restore a dictionary",     32},
+};
+
+
+
+
+/* ---- Callback functions ---- */
+
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button
+    */
+    buttonWid = XtCreateManagedWidget(menuButtonName, menuButtonWidgetClass,
+				      parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Dictionary analysis",
+	       analysis_menu_data, XtNumber(analysis_menu_data));
+}
+
+
+
+
+const int botHelpOpt = 0;
+const int topHelpOpt = 35;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "MEPHELP";
+const char helpPtrsFN[] = "MEPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "NIP",
+    "Help",
+    "Quit",
+    "Read new sequences",
+    "Redefine active region",
+    "List the sequences",
+    "List text file",
+    "Direct output to disk",
+    "Clear graphics",
+    "Clear text",
+    "Draw ruler",
+    "Use cross hair",
+    "Reset margins",
+    "Label diagram",
+    "Draw map",
+    "Search for strings",
+    "Set strand",
+    "Set composition",
+    "Set word length",
+    "Set number of mismatches",
+    "Show settings",
+    "Make dictionary Dw",
+    "Make dictionary Ds",
+    "Make fuzzy dictionary Dm from Dw",
+    "Make fuzzy dictionary Dm from Ds",
+    "Make fuzzy dictionary Dh from Dm",
+    "Examine fuzzy dictionary Dm",
+    "Examine fuzzy dictionary Dh",
+    "Examine words in Dm",
+    "Examine words in Dh",
+    "Save or restore a dictionary",
+    "Find inverted repeats",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
diff --git a/src/staden/xnipSpec.c b/src/staden/xnipSpec.c
new file mode 100644
index 0000000..65402e5
--- /dev/null
+++ b/src/staden/xnipSpec.c
@@ -0,0 +1,321 @@
+/*
+    Title:       xnipSpec
+
+    File: 	 xnipSpec.c
+    Purpose:	 Definitions specific to `xnip'
+    Last update: Tue May 19 1990
+*/
+
+
+/*
+    This module contains the setup for the menus for the `xnip' program
+    and also information needed for `xnip' help.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "progSpec.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String	name;   /* IMPORT: Name of function */
+    int 	number; /* IMPORT: Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+    
+static MenuData general_menu_data[] =
+{   {"Read new sequence",                   3},
+    {"Redefine active region",              4},
+    {"List the sequence",                   5},
+    {"List a text file",                    6},
+    {"Direct text output to disc",          7},
+    {"Write active sequence to disc",       8},
+    {"Edit the sequence",                   9},
+    {"Restriction enzyme search",           17},
+    {"Compare a sequence",                  18},
+    {"Reverse and complement the sequence", 29},
+    {"Interconvert t and u",                66},
+};
+  
+  
+static MenuData screen_menu_data[] =
+{   {"Clear graphics",   10},
+    {"Clear text",       11},
+    {"Draw a ruler",     12},
+    {"Use cross hair",   13},
+    {"Reposition plots", 14},
+    {"Label diagram",    15},
+    {"Display a map",    16},
+};
+
+
+static  MenuData statistics_menu_data[] =
+{   {"Count base composition",                             21},
+    {"Count dinucleotide frequencies",                     22},
+    {"Count codons and amino acids",                       23},
+    {"Plot base composition",                              24},
+    {"Plot local deviations in base composition",          25},
+    {"Plot local deviations in dinucleotide composition",  26},
+    {"Plot local deviations in trinucleotide composition", 27},
+    {"Plot negentropy",                                    59},
+    {"Plot observed-expected word frequencies",            64},
+};
+
+
+static  MenuData structure_menu_data[] =
+{   {"Search for hairpin loops",                 30},
+    {"Search for long range inverted repeats",   31},
+    {"Search for repeats",                       32},
+    {"Examine repeats",                          38},
+    {"Search for z dna(total ry,yr)",            33},
+    {"Search for z dna(runs of ry or yr",        34},
+    {"Search for z dna(best phased value)",      35},
+    {"Find local similarity or complementarity", 36},
+};
+
+
+static  MenuData translation_menu_data[] =
+{   {"Set genetic code",                             37},
+    {"Translate and list in up to six phases",       39},
+    {"Translate and write protein sequence to disc", 40},
+    {"Calculate codon constraint",                   28},
+    {"Count codons and amino acids",                 23},
+    {"Write codon table to disk",                    41},
+    {"Search for open reading frames",               54},
+};
+
+
+static  MenuData gene_menu_data[] =
+{   {"Codon usage",                                     42},
+    {"Positional base preferences",                     43},
+    {"Uneven positional base frequencies",              44},
+    {"Codon improbability (on base composition)",       45},
+    {"Codon improbability (on amino acid composition)", 46},
+    {"Shepherd RNY preference",                         47},
+    {"Fickett testcode",                                48},
+    {"tRNA gene search",                                49},
+};
+
+
+static  MenuData specific_signals_menu_data[] =
+{   {"Plot e. coli promters (general)",                 55},
+    {"Plot e. coli promters (complementary)",           56},
+    {"Plot e. coli promters (-35 and -10)",             57},
+    {"Plot e. coli ribosome binding sites",             58},
+    {"Plot eukaryotic ribosome binding sites",          61},
+    {"Plot splice junctions",                           62},
+    {"Plot polya sites",                                65},
+};
+
+
+static  MenuData general_signals_menu_data[] =
+{   {"Compare a short sequence",                        18},
+    {"Compare a sequence using a score matrix",         19},
+      {"Search using a weight matrix",                  20},
+      {"Search using a weight matrix (complementary)",  63},
+      {"Search using a dinucleotide weight matrix",     60},
+   {"Plot start codons",                               50},
+    {"Plot stop codons",                                51},
+    {"Plot complementary stop codons",                  52},
+    {"Plot stop codons on both strands",                53},
+    {"Search for patterns of motifs",                   67},
+};
+
+
+
+
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button
+    */
+    buttonWid = XtCreateManagedWidget(menuButtonName, menuButtonWidgetClass,
+				      parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Statistics",
+	       statistics_menu_data, XtNumber(statistics_menu_data));
+    CreateMenu(parentWid, "Structure",
+	       structure_menu_data, XtNumber(structure_menu_data));
+    CreateMenu(parentWid, "Translation and codons",
+	       translation_menu_data, XtNumber(translation_menu_data));
+    CreateMenu(parentWid, "Gene search by content",
+	       gene_menu_data, XtNumber(gene_menu_data));
+    CreateMenu(parentWid, "General signals menu",
+	       general_signals_menu_data, XtNumber(general_signals_menu_data));
+    CreateMenu(parentWid, "Specific signals menu",
+	       specific_signals_menu_data, XtNumber(specific_signals_menu_data));
+}
+
+
+
+
+const int botHelpOpt = -10;
+const int topHelpOpt = 70;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "NIPHELP";
+const char helpPtrsFN[] = "NIPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "NIP",
+    "HELP",
+    "Quit",
+    "read a new sequence",
+    "define active region",
+    "list the sequence",
+    "list a text file",
+    "direct output to disk",
+    "write active sequence to disk",
+    "edit the sequence",
+    "clear graphics screen",
+    "clear text screen",
+    "draw a ruler",
+    "use cross hair",
+    "reposition plots",
+    "label diagram",
+    "display a map",
+    "restriction enzyme search",
+    "compare a sequence",
+    "compare a sequence using a score matrix",
+    "search using a weight matrix",
+    "count base frequencies",
+    "count dinucleotide frequencies",
+    "count codons and amino acids",
+    "plot base composition",
+    "plot deviations in base composition",
+    "plot deviations in dinucleotide composition",
+    "plot deviations in trinucleotide composition",
+    "calculate codon constraint",
+    "plot negentropy",
+    "search for hairpin loops",
+    "search for long range inverted repeats",
+    "search for repeats",
+    "search for Z DNA (total RY, YR)",
+    "search for Z DNA (runs of RY or YR)",
+    "search for Z DNA (best phased value)",
+    "find local similarity or complementarity",
+    "set genetic code",
+    "examine repeats",
+    "translate and list in 6 phases",
+    "translate and write protein to disk",
+    "calculate and write codon table to disk",
+    "gene search using codon preference",
+    "gene search using positional base preferences",
+    "gene search using uneven positional base frequencies",
+    "gene search using codon improbability (on base composition)",
+    "gene search using codon improbability (on amino acid composition)",
+    "gene search using RNY preference",
+    "gene search using Ficketts method",
+    "tRNA gene search",
+    "plot start codons",
+    "plot stop codons",
+    "plot complementary stop codons",
+    "plot stop codons on both strands",
+    "find longest open reading frames",
+    "plot E. coli promoters (general)",
+    "plot E. coli promoters (complementary)",
+    "plot E. coli promoters (-5 and -10 separate)",
+    "plot E. coli ribosome binding sites",
+    "reverse and complement the active sequence",
+    "universal signal plot (on dinucleotides)",
+    "plot eukaryotic ribosome binding sites",
+    "plot splice junctions",
+    "search the complementary strand using a weight matrix",
+    "plot observed-expected word frequencies",
+    "plot polyA sites",
+    "interconvert T and U",
+    "search for patterns of motifs",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
diff --git a/src/staden/xpipSpec.c b/src/staden/xpipSpec.c
new file mode 100644
index 0000000..a561299
--- /dev/null
+++ b/src/staden/xpipSpec.c
@@ -0,0 +1,228 @@
+/*
+    Title: 	 xpipSpec
+
+    File: 	 xpipSpec.c
+    Purpose:	 Definitions specific to `xpip'
+    Last update: Wed Jun 20 1990
+*/
+
+
+/*
+    This module contains data specific to the menus and help system
+    for the `xpip' program.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "progSpec.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+    
+static MenuData general_menu_data[] =
+{   {"Read new sequence",                       3},
+    {"Redefine active region",                  4},
+    {"List the sequence",                       5},
+    {"List a text file",                        6},
+    {"Direct text output to disc",              7},
+    {"Write active sequence to disc",           8},
+    {"Edit the sequence",                       9},
+    {"Short sequence search",                   17},
+    {"Compare a sequence",                      18},
+    {"Compare a sequence using a score matrix", 19},
+    {"Back translate to dna",                   27},
+};
+  
+  
+static MenuData screen_menu_data[] =
+{   {"Clear graphics", 10},
+    {"Clear text",     11},
+    {"Draw a ruler",   12},
+    {"Use cross hair", 13},
+    {"Reset margins",  14},
+    {"Label diagram",  15},
+    {"Display a map",  16},
+};
+
+
+static  MenuData statistics_menu_data[] =
+{   {"Count amino acid composition", 21},
+    {"Plot hydrophobicity",          22},
+    {"Plot charge",                  23},
+    {"Plot hydrophobic moment",      25},
+};
+
+
+static  MenuData structure_menu_data[] =
+{   {"Plot hydrophobicity",          22},
+    {"Plot charge",                  23},
+    {"Plot robson prediction",       24},
+    {"Plot hydrophobic moment",      25},
+    {"Draw helix wheel",             26},
+};
+
+
+static  MenuData search_menu_data[] =
+{   {"Search for short sequences",                  17},
+    {"Compare a sequence",                          18},
+    {"Compare a sequence using a score matrix",     19},
+    {"Search for a sequence using a weight matrix", 20},
+    {"Search for patterns of motifs",               28},
+};
+
+
+
+
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button
+    */
+    buttonWid = XtCreateManagedWidget(menuButtonName, menuButtonWidgetClass,
+				      parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Statistics",
+	       statistics_menu_data, XtNumber(statistics_menu_data));
+    CreateMenu(parentWid, "Structure",
+	       structure_menu_data, XtNumber(structure_menu_data));
+    CreateMenu(parentWid, "Search",
+	       search_menu_data, XtNumber(search_menu_data));
+}
+
+
+
+
+const int botHelpOpt = 0;
+const int topHelpOpt = 28;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "PIPHELP";
+const char helpPtrsFN[] = "PIPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "PIP",
+    "Help",
+    "Quit",
+    "read a new sequence",
+    "define active region",
+    "list the sequence",
+    "list a text file",
+    "direct output to disk",
+    "write active sequence to disk",
+    "edit the sequence",
+    "clear graphics screen",
+    "clear text screen",
+    "draw a ruler",
+    "use cross hair",
+    "reposition plots",
+    "label diagram",
+    "display a map",
+    "search for short sequences",
+    "compare a sequence",
+    "compare a sequence using a score matrix",
+    "search for a sequence using a weight matrix",
+    "calculate amino acid composition",
+    "plot hydrophobicity",
+    "plot charge",
+    "plot Robson prediction",
+    "plot hydrophobic moment",
+    "draw helix wheel",
+    "back translate",
+    "search for patterns of motifs",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
diff --git a/src/staden/xsapConEdit.f b/src/staden/xsapConEdit.f
new file mode 100644
index 0000000..b1fa0f5
--- /dev/null
+++ b/src/staden/xsapConEdit.f
@@ -0,0 +1,137 @@
+C 1-10-91 SD  Remove prompting from CONEDT and JOINED
+C 1-10-91 SD  Removed LLINOL and LLINOR from DOJOIN
+C 20-Aug-92 SD Added new IOK argument to CXEDIT and JXEDIT call
+C
+      SUBROUTINE CONEDT(KBIN,KBOUT,
+     +GELNOS,GELSTR,GELEND,
+     +MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,IDEV1,IDEV2,IDEV3,
+     +IDEVT,IDEVC,LINLEN,FILNAM,LLINO,IOK,IERR,TEMP1)
+C   AUTHOR: SIMON DEAR
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2)
+      CALL SHOWFU(KBOUT,'Contig editor')
+C      WRITE(KBOUT,*)'Identify contig to edit'
+      CALL GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LINCON,
+     +LLINO,IGELNO,IOK,IDBSIZ,KBIN,KBOUT,IDEV3,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) RETURN
+      CALL CXEDIT(IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +RELPG,LNGTHG,LNBR,RNBR,MAXGEL,
+     +IDBSIZ,LINCON,LLINO,IGELNO,1,PERCD,IDM,0,IOK)
+      CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +TEMP1,IERR,KBOUT)
+      END
+      SUBROUTINE JOINED(KBIN,KBOUT,
+     +GELNOS,GELSTR,GELEND,
+     +MAXDB,GELNO,LINNO,MAXLIN,RELPG,LNGTHG,LNBR,RNBR,
+     +IDBSIZ,NGELS,NCONTS,GEL,GEL2,MAXGEL,LINCON,PERCD,IDM,
+     +IHELPS,IHELPE,HELPF,IDEVH,IDEV,IDEV1,IDEV2,IDEV3,
+     +IDEVT,IDEVC,LINLEN,FILNAM,LLINO,IOK,IERR,TEMP1)
+C   AUTHOR: SIMON DEAR
+      INTEGER RELPG(MAXDB),LNGTHG(MAXDB),LNBR(MAXDB),RNBR(MAXDB)
+      CHARACTER GEL(MAXGEL),GEL2(MAXGEL)
+      CHARACTER FILNAM*(*),HELPF*(*)
+      INTEGER GELNOS(MAXDB),GELSTR(MAXDB),GELEND(MAXDB)
+      INTEGER GELNO(MAXLIN,2),LINNO(MAXLIN,2)
+C   JOIN
+      CALL SHOWFU(KBOUT,'Join editor')
+C     JOIN
+      WRITE(KBOUT,1006)
+1006  FORMAT(' Which contigs do you want to join ?')
+C      WRITE(KBOUT,1017)
+C1017  FORMAT( ' Left contig')
+      LLINOL = 0
+      LLINOR = 0
+      CALL GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LNCONL,
+     +LLINOL,IGELL,IOK,IDBSIZ,KBIN,KBOUT,IDEV3,
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.EQ.0)THEN
+C        WRITE(KBOUT,1018)
+C1018    FORMAT( ' Right contig')
+        CALL GETLN2(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,LNCONR,
+     +  LLINOR,IGELR,IOK,IDBSIZ,KBIN,KBOUT,IDEV3,
+     +  IHELPS,IHELPE,HELPF,IDEVH)
+        IF(IOK.EQ.0)THEN
+C         CHECK TO SEE IF MAKING A CIRCLE!
+          IF(LLINOL.EQ.LLINOR)THEN
+            WRITE(KBOUT,*)' Making circles is not allowed!'
+            GO TO 200
+          END IF
+          CALL JXEDIT(IDEV1,IDEV2,IDEV3,IDEVT,IDEVC,
+     +    RELPG,LNGTHG,LNBR,RNBR,MAXGEL,
+     +    IDBSIZ,LNCONL,LLINOL,LNCONR,LLINOR,
+     +    IGELL,1,IGELR,1,PERCD,NGELS,NCONTS,IDM,0,IOK)
+          CALL DBCHEK(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,IDBSIZ,
+     +    TEMP1,IERR,KBOUT)
+        END IF
+      END IF
+200   CONTINUE
+      END
+      SUBROUTINE DOJOIN(RELPG,LNGTHG,LNBR,RNBR,NGELS,NCONTS,
+     +LNCONL,LNCONR,
+     +IDBSIZ,IDEVR,IDEVW,
+     +RELX)
+C   AUTHOR: RODGER STADEN
+C   TAKEN FROM: JOIN
+      INTEGER LLINOL,LLINOR
+      INTEGER RELPG(IDBSIZ)
+      INTEGER X,RELX
+      INTEGER LNGTHG(IDBSIZ),RNBR(IDBSIZ),LNBR(IDBSIZ)
+C       SET UP LLINOL, LLINOR
+        LLINOL = LNBR(LNCONL)
+        LLINOR = LNBR(LNCONR)
+C       COMPLETE JOIN
+C       ADJUST ALL RELATIVE POSITIONS IN RIGHT CONTIG
+        N=LLINOR
+        RELPG(N)=RELX
+50      CONTINUE
+        IF(RNBR(N).EQ.0)GO TO 60
+        N=RNBR(N)
+        RELPG(N)=RELPG(N)+RELX-1
+        GO TO 50
+60      CONTINUE
+C
+C       FIX UP NEW GEL LINE FOR OLD LEFT OF RIGHT CONTIG
+        LNBR(LLINOR)=RNBR(LNCONL)
+C       FIX UP RIGHT GEL OF LEFT CONTIG
+        N=RNBR(LNCONL)
+        RNBR(N)=LLINOR
+        CALL MERGE(RELPG,LNGTHG,LNBR,RNBR,LNCONL,IDBSIZ)
+C       MERGE DOES NOT WRITE TO DISK
+        N=LNBR(LNCONL)
+65      CONTINUE
+      CALL WRITER(IDEVR,N,RELPG(N),LNGTHG(N),
+     +LNBR(N),RNBR(N))
+        N=RNBR(N)
+        IF(N.NE.0)GO TO 65
+C       CONTIG LINES
+        X=RELPG(LNCONR)+RELX-1
+        IF(X.GT.RELPG(LNCONL))RELPG(LNCONL)=X
+      CALL WRITER(IDEVR,LNCONL,RELPG(LNCONL),LNGTHG(LNCONL),
+     +LNBR(LNCONL),RNBR(LNCONL))
+C       NOW MOVE ALL DATA DOWN TO DELETE OLD RIGHT END
+        N=IDBSIZ-NCONTS
+        M=LNCONR-N
+        IF(M.EQ.0)GO TO 80
+        K=LNCONR
+        J=LNCONR-1
+        DO 70 I=1,M
+          RELPG(K)=RELPG(J)
+          LNGTHG(K)=LNGTHG(J)
+          LNBR(K)=LNBR(J)
+          RNBR(K)=RNBR(J)
+      CALL WRITER(IDEVR,K,RELPG(K),LNGTHG(K),
+     +LNBR(K),RNBR(K))
+          K=K-1
+          J=J-1
+70      CONTINUE
+80      CONTINUE
+        NCONTS=NCONTS-1
+      CALL WRITER(IDEVR,IDBSIZ,NGELS,NCONTS,NGELS,NCONTS)
+        RETURN
+      END
diff --git a/src/staden/xsapSpec.c b/src/staden/xsapSpec.c
new file mode 100644
index 0000000..400272f
--- /dev/null
+++ b/src/staden/xsapSpec.c
@@ -0,0 +1,416 @@
+/*
+    Title:       xsapSpec
+
+    File: 	 xsapSpec.c
+    Purpose:	 Definitions specific to the `xsap' program
+    Last update: Mon Jun 25 1990
+*/
+
+
+/*
+    This module contains data specific to the menus and help system
+    for the `xsap' program.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "fort.h"
+#include "progSpec.h"
+#include "userface.h" /* IMPORT: menu_x */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+
+static Widget menubarWid = (Widget)NULL; /* Initialised by CreateProgMenus */
+
+    
+static MenuData general_menu_data[] =
+{   {"Open a database",	                   3},
+    {"Display a contig",		   5},
+    {"List a text file",		   6},
+    {"Redirect output",		           7}, 
+    {"Calculate a consensus",	           8}, 
+    {"Screen against restriction enzymes", 17}, 
+    {"Screen against vector",	           18}, 
+    {"Check database",		           19}, 
+    {"Copy database",		           24}, 
+    {"Show relationships",		   25}, 
+    {"Set parameters",		           27}, 
+    {"Highlight disagreements",	           28}, 
+    {"Examine quality",		           29}, 
+    {"Find internal joins",	           35}, 
+};
+
+
+static MenuData screen_menu_data[] =
+{   {"Clear graphics",     10},
+    {"Clear text",	   11},
+    {"Draw ruler",	   12},
+    {"Use cross hair",	   13},
+    {"Change margins",	   14},
+    {"Label diagram",	   15},
+    {"Plot map",	   16},
+    {"Plot single contig", 33},
+    {"Plot all contigs",   34},
+};
+
+
+static MenuData modification_menu_data[] =
+{  {"Edit contig",           4},
+   {"Screen edit",	     9},
+   {"Auto assemble",	     20},
+   {"Enter new gel reading", 21},
+   {"Join contigs",	     22},
+   {"Complement a contig",   23},
+   {"Alter relationships",   26},
+   {"Auto edit a contig",    30},
+   {"Type in gel readings",  31},
+   {"Extract gel readings",  32},
+};
+
+
+static MenuData enter_menu_data[] =
+{   {"Cancel",               2},
+    {"Complete entry",       3},
+    {"Edit contig",          4},
+    {"Display",              5},
+    {"Edit new gel reading", 6},
+};
+
+
+static MenuData join_menu_data[] =
+{   {"Cancel",            2},
+    {"Complete join",     3},
+    {"Edit left contig",  4},
+    {"Display join",      5},
+    {"Edit right contig", 6},
+    {"Move join",         7},
+};
+
+
+static MenuData alter_menu_data[] =
+{   {"Cancel",             2},
+    {"Line change",        3},
+    {"Edit gel reading",   4},
+    {"Delete contig",      5},
+    {"Shift",              6},
+    {"Move gel reading",   7},
+    {"Rename gel reading", 8},
+    {"Break a contig",     9},
+};
+
+
+static MenuData edit_menu_data[] =
+{   {"Cancel",  2},
+    {"Insert",  3},
+    {"Delete",  4},
+    {"Change",  5},
+};
+
+
+
+
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button.
+	The button is initially unmanaged within its parent.
+    */
+    buttonWid = XtCreateWidget(menuButtonName, menuButtonWidgetClass,
+			       parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    They are initially all unmanaged.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+    menubarWid = parentWid;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Modification",
+	       modification_menu_data, XtNumber(modification_menu_data));
+    CreateMenu(parentWid, "Enter",
+	       enter_menu_data, XtNumber(enter_menu_data));
+    CreateMenu(parentWid, "Join",
+	       join_menu_data, XtNumber(join_menu_data));
+    CreateMenu(parentWid, "Alter",
+	       alter_menu_data, XtNumber(alter_menu_data));
+    CreateMenu(parentWid, "Edit",
+	       edit_menu_data, XtNumber(edit_menu_data));
+}
+
+
+
+
+/*
+    Special menu functions for SAP.
+
+    The generic menu functions, found in 'dialogues' are only
+    used indirectly by SAP.
+
+      SUBROUTINE DBMENT(MENU,NOPT,MAXOPT,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS(0:MAXOPT),IHELPE(0:MAXOPT)
+
+      SUBROUTINE DBMENU(MENU,NOPT,IHELPS,IHELPE,HELPF,IDEVH,
+     +KBIN,KBOUT)
+      CHARACTER HELPF*(*)
+      INTEGER IHELPS,IHELPE
+
+    If this code were in 'dialogues', then we wouldn't need to
+    keep the reference to 'menubarWid' which is making a big
+    assumption anyway.
+*/
+
+#define dbment_x dbment_
+#define dbmenu_x dbmenu_
+
+
+void dbment_x(int_f *MENU_p,
+	      int_f *NOPT_p,
+	      int_f *MAXOPT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    int_f KOPT, MOPT, MINMEM; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the 'top level' menus.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    XtManageChild(XtNameToWidget(menubarWid, "General"));
+    XtManageChild(XtNameToWidget(menubarWid, "Screen"));
+    XtManageChild(XtNameToWidget(menubarWid, "Modification"));
+
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	   &KOPT,
+	   &MOPT,
+	    MAXOPT_p,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+}
+
+
+void dbmenu_x(int_f *MENU_p,
+	      int_f *NOPT_p,
+	      int_f *IHELPS_p,
+	      int_f *IHELPE_p,
+	      char *HELPF_p,
+	      int_f *IDEVH_p,
+	      int_f *KBIN_p,
+	      int_f *KBOUT_p,
+	      int_fl  HELPF_l)
+{   WidgetList sprogs;
+    int        nSprogs;
+    int_f KOPT, MOPT, MINMEM, MAXOPT; /* Dummy arguments to keep menu_x happy */
+    Arg args[10];
+    int nargs;
+
+
+    /*
+        All the menus are children of 'menubarWid'.
+	Manage just the menu specified by 'MENU'.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNchildren,    &sprogs); nargs++;
+    XtSetArg(args[nargs], XtNnumChildren, &nSprogs); nargs++;
+    XtGetValues(menubarWid, args, nargs);
+    XtUnmanageChildren(sprogs, nSprogs);
+
+    switch (*MENU_p)
+    {   case 2: XtManageChild(XtNameToWidget(menubarWid, "Enter"));
+                break;
+
+        case 3: XtManageChild(XtNameToWidget(menubarWid, "Join"));
+                break;
+      
+        case 4: XtManageChild(XtNameToWidget(menubarWid, "Alter"));
+                break;
+      
+        case 5: XtManageChild(XtNameToWidget(menubarWid, "Edit"));
+                break;
+    }
+
+    /*
+        Run the menu with the usual mechanism.
+    */
+    menu_x( NOPT_p,
+	   &KOPT,
+	   &MOPT,
+	   &MAXOPT,
+	   &MINMEM,
+	    KBIN_p,
+	    KBOUT_p,
+	    IHELPS_p,
+	    IHELPE_p,
+	    HELPF_p,
+	    IDEVH_p,
+	    HELPF_l);
+
+}
+	      
+
+
+
+
+const int botHelpOpt = 0;
+const int topHelpOpt = 35;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "SAPHELP";
+const char helpPtrsFN[] = "SAPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "SAP",
+    "Help",
+    "Quit",
+    "Open a database",
+    "Edit contig",
+    "Display a contig",
+    "List a text file",
+    "Direct output to disk",
+    "Calculate a consensus",
+    "Screen edit",
+    "Clear graphics",
+    "Clear text",
+    "Draw ruler",
+    "Use cross hair",
+    "Change margins",
+    "Plot map",
+    "Label diagram",
+    "Screen against restriction enzymes",
+    "Screen against vector",
+    "Check consistency",
+    "Auto assemble",
+    "Enter new gel reading",
+    "Join contigs",
+    "Complement a contig",
+    "Copy database",
+    "Show relationships",
+    "Alter relationships",
+    "set parameters",
+    "Highlight disagreements",
+    "Examine quality",
+    "Auto edit a contig",
+    "Type in gel readings",
+    "Extract gel readings",
+    "Plot single contig",
+    "Plot all contigs",
+    "Find internal joins",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
diff --git a/src/staden/xsipSpec.c b/src/staden/xsipSpec.c
new file mode 100644
index 0000000..83693cc
--- /dev/null
+++ b/src/staden/xsipSpec.c
@@ -0,0 +1,226 @@
+/*
+    Title:       xsipSpec
+
+    File: 	 xsipSpec.c
+    Purpose:	 Definitions specific to the `xsip' program
+    Last update: Tue Jun 19 1990
+*/
+
+
+/*
+    This module contains data specific to the menus and help system
+    for the `xmep' program.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "progSpec.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/MenuButton.h>
+#include <X11/Xaw/SimpleMenu.h>
+#include <X11/Xaw/Sme.h>
+#include <X11/Xaw/SmeBSB.h>
+
+
+
+
+/* ---- Types ---- */
+
+
+typedef struct
+{   String name;   /* Name of function */
+    int    number; /* Number of function */
+} MenuData, *MenuDataList;
+
+
+
+
+/* ---- Static variables ---- */
+
+    
+static MenuData general_menu_data[] =
+{   {"Read new sequence",                       3},
+    {"Redefine active region",                  4},
+    {"List the sequences",                      5},
+    {"List a text file",                        6},
+    {"Direct text output to disc",              7},
+    {"Write active sequence to disc",           8},
+    {"Edit the sequences",                      9},
+    {"Complement sequences",                    29},
+};
+  
+  
+static MenuData screen_menu_data[] =
+{   {"Clear graphics",   10},
+    {"Clear text",       11},
+    {"Draw a ruler",     12},
+    {"Use cross hair",   13},
+    {"Reposition plots", 14},
+    {"Label diagram",    15},
+    {"Display a map",    16},
+    {"Draw a /",         27},
+};
+
+
+static  MenuData parameters_menu_data[] =
+{   {"Set span length",                 20},
+    {"Set proportional score",          21},
+    {"Set identities score",            22},
+    {"Calculate expected scores",       23},
+    {"Calculate observed scores",       24},
+    {"Show current parameter settings", 25},
+    {"Switch main diagonal",            30},
+    {"Switch identities",               31},
+    {"Change score matrix",             32},
+    {"Set number of sd for Quickscan",  33},
+    {"Set gap penalties",               34},
+};
+
+
+static  MenuData comparison_menu_data[] =
+{   {"Apply identities algorithm",   17},
+    {"Apply proportional algorithm", 18},
+    {"List matching spans",          19},
+    {"Quick scan",                   26},
+    {"Align sequences",              28},
+};
+
+
+
+
+/* --- Callback functions ---- */
+
+static XtCallbackProc externalCallbackProc;
+static XtPointer      externalClient_data;
+
+static void MenuSelectCallback(Widget w, XtPointer i, XtPointer junk)
+/*
+    Pass the menu item callback back to `externalCallbackProc'
+    which the user supplied to `CreateProgMenus'.
+*/
+{   externalCallbackProc(w, externalClient_data, i);
+}
+
+
+
+
+/* ---- Private functions ---- */
+
+
+static void CreateMenu(Widget parentWid, String menuButtonName,
+		       MenuDataList md, Cardinal num_md)
+{   Widget buttonWid, menuWid;
+    int i;
+
+    /*
+        Create the (empty) menu button
+    */
+    buttonWid = XtCreateManagedWidget(menuButtonName, menuButtonWidgetClass,
+				      parentWid, NULL, 0);
+    menuWid = XtCreatePopupShell("menu", simpleMenuWidgetClass, buttonWid,
+				 NULL, 0);
+
+    /*
+        Put the individual items in.
+	When selected, each entry will generate a callback with
+	its associated number.
+    */
+    for (i = 0; i < (int) num_md ; i++)
+    {	Widget entryWid = XtCreateManagedWidget(md[i].name, smeBSBObjectClass,
+						menuWid, NULL, 0);
+	XtAddCallback(entryWid, XtNcallback, MenuSelectCallback,
+		      (XtPointer) md[i].number);
+    }
+
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void CreateProgMenus(Widget parentWid,
+		     XtCallbackProc cbp, XtPointer client_data)
+/*
+    Install the menus for this program into `parentWid'.
+    When pressed, each item (which is the name of a function) will
+    call `cbp' providing `client_data' and the number of the function
+    as `call_data'.
+*/
+{   externalCallbackProc = cbp;
+    externalClient_data = client_data;
+
+    CreateMenu(parentWid, "General",
+	       general_menu_data, XtNumber(general_menu_data));
+    CreateMenu(parentWid, "Screen",
+	       screen_menu_data, XtNumber(screen_menu_data));
+    CreateMenu(parentWid, "Set parameters",
+	       parameters_menu_data, XtNumber(parameters_menu_data));
+    CreateMenu(parentWid, "Comparison",
+	       comparison_menu_data, XtNumber(comparison_menu_data));
+}
+
+
+
+
+const int botHelpOpt = 0;
+const int topHelpOpt = 34;
+/*
+    The range of option numbers for the help system.
+*/
+
+
+const char helpTextFN[] = "SIPHELP";
+const char helpPtrsFN[] = "SIPHPNT";
+/*
+    File names for the help text and pointer files.
+*/
+
+
+const char *helpTopics[] =
+{   "SIP",
+    "Help",
+    "Quit",
+    "read a new sequence",
+    "define active region",
+    "list the sequence",
+    "list a text file",
+    "direct output to disk",
+    "write active sequence to disk",
+    "edit the sequences",
+    "clear graphics screen",
+    "clear text screen",
+    "draw a ruler",
+    "use cross hair",
+    "reposition plots",
+    "label diagram",
+    "display a map",
+    "apply identities algorithm",
+    "apply proportional algorithm",
+    "list matching spans",
+    "set span length",
+    "set proportional score",
+    "set identities score",
+    "calculate expected scores",
+    "calculate observed scores",
+    "show current parameter settings",
+    "quick scan",
+    "draw a /",
+    "align the sequences",
+    "complement the sequences",
+    "switch main diagonal",
+    "switch identities",
+    "change score matrix",
+    NULL,
+};
+/*
+    Help topics, indexed in C between 0 and topHelpOpt-botHelpOpt
+    but referring to topics botHelpOpt to topHelpOpt
+*/
diff --git a/src/staden/xspec.f b/src/staden/xspec.f
new file mode 100644
index 0000000..d1c0428
--- /dev/null
+++ b/src/staden/xspec.f
@@ -0,0 +1,11 @@
+C  X specific bits
+C   AUTHOR: RODGER STADEN
+      SUBROUTINE INITRS()
+      CALL INITX()
+      END
+      INTEGER FUNCTION XVERSN()
+C
+C set xversn to true for x version
+C
+      XVERSN = 1
+      END
diff --git a/src/ted/CRN b/src/ted/CRN
new file mode 100644
index 0000000..990a9c4
--- /dev/null
+++ b/src/ted/CRN
@@ -0,0 +1,11 @@
+ted - trace editor
+
+Copyright 1991 Timothy Gleeson, LaDeana Hillier and Simon Dear
+
+PLEASE NOTE:
+	373A sequence data file formats are the exclusive property
+	of Applied Biosystems, Inc.
+
+	ALF sequence data file formats are the exclusive property
+	of Pharmacia LKB Biotechnology, Inc.
+
diff --git a/src/ted/Graph.c b/src/ted/Graph.c
new file mode 100644
index 0000000..0e69bc7
--- /dev/null
+++ b/src/ted/Graph.c
@@ -0,0 +1,283 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: Graph.c,v 1.2 88/10/25 17:40:25 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#include <X11/IntrinsicP.h>
+#include <X11/StringDefs.h>
+#include "GraphP.h"
+
+
+#include <stdio.h>
+
+/* ---- Forward declarations ---- */
+
+
+static void Initialize(Widget request, Widget new,
+		       ArgList args, Cardinal *num_args);
+static void Destroy(Widget w);
+static void Redisplay(Widget w, XEvent *event, Region region);
+static void Resize(Widget w);
+static void InputAction(Widget w, XEvent *event,
+			String *params,Cardinal *num_params);
+static Boolean SetValues(Widget current, Widget request, Widget new,
+			 ArgList args, Cardinal *num_args);
+
+
+
+
+/* ---- Private data ---- */
+
+
+static XtResource resources[] = {
+#define  offset(field) XtOffset(GraphWidget, graph.field)
+#define coffset(field) XtOffset(Widget,      core.field)
+    /* {name, class, type, size, offset, default_type, default_addr}, */
+    { XtNcallback, XtCCallback, XtRCallback, (Cardinal)sizeof(XtCallbackList),
+	  offset(input_callback), XtRCallback, NULL },
+    { XtNdimBackground, XtCBackground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(dimBackground), XtRString, "XtDefaultBackground" },
+    { XtNexposeCallback, XtCCallback, XtRCallback,
+	  (Cardinal)sizeof(XtCallbackList),
+	  offset(expose_callback), XtRCallback, NULL },
+    { XtNfont, XtCFont, XtRFontStruct, (Cardinal)sizeof(XFontStruct*),
+	  offset(font), XtRString, XtDefaultFont },
+    { XtNgraphColour1, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour1), XtRString, "XtDefaultForeground" },
+    { XtNgraphColour2, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour2), XtRString, "XtDefaultForeground" },
+    { XtNgraphColour3, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour3), XtRString, "XtDefaultForeground" },
+    { XtNgraphColour4, XtCForeground, XtRPixel, (Cardinal)sizeof(Pixel),
+	  offset(graphColour4), XtRString, "XtDefaultForeground" },
+    { XtNheight, XtCHeight, XtRDimension, (Cardinal)sizeof(Dimension),
+	  coffset(height), XtRImmediate, (caddr_t)1},
+    { XtNwidth, XtCWidth, XtRDimension,  (Cardinal)sizeof(Dimension),
+	  coffset(width), XtRImmediate, (caddr_t)1},
+    { XtNresizeCallback, XtCCallback, XtRCallback,
+	  (Cardinal)sizeof(XtCallbackList),
+	  offset(resize_callback), XtRCallback, NULL },
+#undef  offset
+#undef coffset
+};
+
+
+static XtActionsRec actions[] =
+{
+  /* {name, procedure}, */
+    {"input",	InputAction},
+};
+
+
+static char translations[] =
+"<Key>:		input()	\n\
+ <BtnDown>:     input() \
+";
+
+
+GraphClassRec graphClassRec = {
+  { /* core fields initial values */
+    /* superclass               */      (WidgetClass) &simpleClassRec,
+    /* class_name		*/	"Graph",
+    /* widget_size		*/	(Cardinal)sizeof(GraphRec),
+    /* class_initialize		*/	NULL,
+    /* class_part_initialize	*/	NULL,
+    /* class_inited		*/	FALSE,
+    /* initialize		*/	Initialize,
+    /* initialize_hook		*/	NULL,
+    /* realize			*/	XtInheritRealize,
+    /* actions			*/	actions,
+    /* num_actions		*/	XtNumber(actions),
+    /* resources		*/	resources,
+    /* num_resources		*/	XtNumber(resources),
+    /* xrm_class		*/	NULLQUARK,
+    /* compress_motion		*/	TRUE,
+    /* compress_exposure	*/	XtExposeCompressMaximal,
+    /* compress_enterleave	*/	TRUE,
+    /* visible_interest		*/	FALSE,
+    /* destroy			*/	Destroy,
+    /* resize			*/	Resize,
+    /* expose			*/	Redisplay,
+    /* set_values		*/	SetValues,
+    /* set_values_hook		*/	NULL,
+    /* set_values_almost	*/	XtInheritSetValuesAlmost,
+    /* get_values_hook		*/	NULL,
+    /* accept_focus		*/	NULL,
+    /* version			*/	XtVersion,
+    /* callback_private		*/	NULL,
+    /* tm_table			*/	translations,
+    /* query_geometry		*/	XtInheritQueryGeometry,
+    /* display_accelerator	*/	XtInheritDisplayAccelerator,
+    /* extension		*/	NULL
+  },
+  { /* Simple class initial values */
+    /* change_sensitive         */      XtInheritChangeSensitive
+  },
+  { /* Graph class initial values */
+    /* empty			*/	0
+  }
+};
+
+WidgetClass graphWidgetClass = (WidgetClass)&graphClassRec;
+
+
+
+
+/* ---- Private procedures ---- */
+
+
+static void Initialize(Widget request, Widget new,
+		       ArgList args, Cardinal *num_args)
+{   GraphWidget    gw = (GraphWidget)new;
+    unsigned long  xGCMask;
+    XGCValues      xGCValues;
+    Display       *display;
+    int            screen;
+
+    display  = XtDisplay(request);
+    screen   = DefaultScreen(display);
+
+    if (DisplayPlanes(display,screen)==1)
+    {   /* We are on a one-plane monochrome display.
+           Use dashes to make lines distinguishable.
+        */
+        xGCMask = GCForeground | GCBackground | GCLineStyle;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineSolid;
+        gw->graph.graphGC1 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCMask = GCForeground | GCBackground | GCLineStyle | GCDashList;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineOnOffDash;
+        xGCValues.dashes     = 1;
+        gw->graph.graphGC2 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCMask = GCForeground | GCBackground | GCLineStyle | GCDashList;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineOnOffDash;
+        xGCValues.dashes     = 2;
+        gw->graph.graphGC3 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCMask = GCForeground | GCBackground | GCLineStyle | GCDashList;
+        xGCValues.foreground = BlackPixel(display,screen);
+        xGCValues.background = WhitePixel(display,screen);
+        xGCValues.line_style = LineOnOffDash;
+        xGCValues.dashes     = 4;
+        gw->graph.graphGC4 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+	gw->graph.graphColour1 = BlackPixel(display,screen);
+	gw->graph.graphColour2 = BlackPixel(display,screen);
+	gw->graph.graphColour3 = BlackPixel(display,screen);
+	gw->graph.graphColour4 = BlackPixel(display,screen);
+    }
+    else
+    {   /* Some sort of multi-plane display.
+        */
+        xGCMask = GCForeground | GCBackground;
+        xGCValues.background = gw->core.background_pixel;
+
+        xGCValues.foreground = gw->graph.graphColour1;
+        gw->graph.graphGC1 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCValues.foreground = gw->graph.graphColour2;
+        gw->graph.graphGC2 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCValues.foreground = gw->graph.graphColour3;
+        gw->graph.graphGC3 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+
+        xGCValues.foreground = gw->graph.graphColour4;
+        gw->graph.graphGC4 = XtGetGC((Widget) gw, xGCMask, &xGCValues);
+    }
+}
+
+
+static void Destroy (Widget w)
+{    GraphWidget gw = (GraphWidget) w;
+
+     XtDestroyGC(gw->graph.graphGC1);
+     XtDestroyGC(gw->graph.graphGC2);
+     XtDestroyGC(gw->graph.graphGC3);
+     XtDestroyGC(gw->graph.graphGC4);
+}
+
+
+static void Redisplay(Widget w, XEvent *event, Region region)
+{   XtCallCallbacks(w, XtNexposeCallback, (caddr_t)region);
+}
+
+
+static void Resize(Widget w)
+{   XtCallCallbacks(w, XtNresizeCallback, 0);
+}
+
+
+static void InputAction(Widget w, XEvent *event,
+			String *params,Cardinal *num_params)
+{   XtCallCallbacks(w, XtNcallback, (caddr_t)event);
+}
+
+
+static Boolean SetValues(Widget current, Widget request, Widget new,
+			 ArgList args, Cardinal *num_args)
+{   GraphWidget g_old = (GraphWidget) current;
+    GraphWidget g_new = (GraphWidget) new;
+
+    if	(DisplayPlanes(XtDisplay(new), DefaultScreen(XtDisplay(new))) == 1 ||
+	 !XtIsRealized(new))
+    {   /* Silly to change, or can't change.
+           Don't generate an expose.
+        */
+        return(False);
+    }
+    else
+    {   if (g_old->graph.graphColour1 != g_new->graph.graphColour1)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC1,
+			   g_new->graph.graphColour1);
+
+        if (g_old->graph.graphColour2 != g_new->graph.graphColour2)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC2,
+			   g_new->graph.graphColour2);
+
+        if (g_old->graph.graphColour3 != g_new->graph.graphColour3)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC3,
+			   g_new->graph.graphColour3);
+
+        if (g_old->graph.graphColour4 != g_new->graph.graphColour4)
+	    XSetForeground(XtDisplay(new),
+			   g_new->graph.graphGC4,
+			   g_new->graph.graphColour4);
+
+        return(True);
+    }
+}
+
+
+
+
+/* ---- Exported procedures ---- */
+
+
+GC GraphGC1(Widget w)
+{ return((GraphWidget)w)->graph.graphGC1;
+}
+
+
+GC GraphGC2(Widget w)
+{ return((GraphWidget)w)->graph.graphGC2;
+}
+
+
+GC GraphGC3(Widget w)
+{ return((GraphWidget)w)->graph.graphGC3;
+}
+
+
+GC GraphGC4(Widget w)
+{ return((GraphWidget)w)->graph.graphGC4;
+}
diff --git a/src/ted/Graph.h b/src/ted/Graph.h
new file mode 100644
index 0000000..99fe869
--- /dev/null
+++ b/src/ted/Graph.h
@@ -0,0 +1,81 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: Graph.h,v 1.2 88/10/25 17:22:09 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#ifndef _Graph_h
+#define _Graph_h
+
+/****************************************************************
+ *
+ * Graph widget
+ *
+ ****************************************************************/
+
+#include <X11/Xaw/Simple.h>
+
+/* Resources:
+
+ Name		     Class		RepType		Default Value
+ ----		     -----		-------		-------------
+ background	     Background		Pixel		XtDefaultBackground
+ border		     BorderColor	Pixel		XtDefaultForeground
+ borderWidth	     BorderWidth	Dimension	1
+ callback            Callback           Callback        NULL
+ cursor              Cursor             Cursor          None
+ destroyCallback     Callback		Pointer		NULL
+ dimBackground	     Background		Pixel		XtDefaultBackground
+ exposeCallback      Callback           Callback        NULL
+ font                Font               XFontStruct*    XtDefaultFont
+ graphColour1        Foreground         Pixel           XtDefaultForeground
+ graphColour2        Foreground         Pixel           XtDefaultForeground
+ graphColour3        Foreground         Pixel           XtDefaultForeground
+ graphColour4        Foreground         Pixel           XtDefaultForeground
+ height		     Height		Dimension	1
+ insensitiveBorder   Insensitive	Pixmap		Gray
+ mappedWhenManaged   MappedWhenManaged	Boolean		True
+ resizeCallback      Callback           Callback        NULL
+ sensitive	     Sensitive		Boolean		True
+ width		     Width		Dimension	1
+ x		     Position		Position	0
+ y		     Position		Position	0
+
+*/
+
+/* define any special resource names here that are not in <X11/StringDefs.h> */
+
+#define XtNgraphResource		"graphResource"
+#define XtCGraphResource		"GraphResource"
+
+#define XtNdimBackground                "dimBackground"
+#define XtNexposeCallback               "exposeCallback"
+#define XtNgraphColour1                 "graphColour1"
+#define XtNgraphColour2                 "graphColour2"
+#define XtNgraphColour3                 "graphColour3"
+#define XtNgraphColour4                 "graphColour4"
+#define XtNresizeCallback               "resizeCallback"
+
+
+/* declare specific GraphWidget class and instance datatypes */
+
+typedef struct _GraphClassRec  *GraphWidgetClass;
+typedef struct _GraphRec       *GraphWidget;
+
+
+/* declare the class constant */
+
+extern WidgetClass graphWidgetClass;
+
+
+/* ---- Exported procedures ---- */
+
+extern Pixel GraphColour1(Widget w);
+extern Pixel GraphColour2(Widget w);
+extern Pixel GraphColour3(Widget w);
+extern Pixel GraphColour4(Widget w);
+extern GC    GraphGC1(Widget w);
+extern GC    GraphGC2(Widget w);
+extern GC    GraphGC3(Widget w);
+extern GC    GraphGC4(Widget w);
+
+#endif /* _Graph_h */
diff --git a/src/ted/GraphP.h b/src/ted/GraphP.h
new file mode 100644
index 0000000..07c673f
--- /dev/null
+++ b/src/ted/GraphP.h
@@ -0,0 +1,54 @@
+#include <X11/copyright.h>
+
+/* $XConsortium: GraphP.h,v 1.2 88/10/25 17:37:59 swick Exp $ */
+/* Copyright	Massachusetts Institute of Technology	1987, 1988 */
+
+#ifndef _GraphP_h
+#define _GraphP_h
+
+#include "Graph.h"
+/* include superclass private header file */
+#include <X11/Xaw/SimpleP.h>
+
+
+/* define unique representation types not found in <X11/StringDefs.h> */
+
+#define XtRGraphResource		"GraphResource"
+
+typedef struct {
+    int empty;
+} GraphClassPart;
+
+typedef struct _GraphClassRec {
+    CoreClassPart	core_class;
+    SimpleClassPart     simple_class;
+    GraphClassPart	graph_class;
+} GraphClassRec;
+
+extern GraphClassRec graphClassRec;
+
+typedef struct {
+    /* resources */
+    Pixel dimBackground;
+    XFontStruct* font;
+    Pixel graphColour1;
+    Pixel graphColour2;
+    Pixel graphColour3;
+    Pixel graphColour4;
+    XtCallbackList expose_callback;
+    XtCallbackList resize_callback;
+    XtCallbackList input_callback;
+    /* private state */
+    GC graphGC1;
+    GC graphGC2;
+    GC graphGC3;
+    GC graphGC4;
+} GraphPart;
+
+typedef struct _GraphRec {
+    CorePart	core;
+    SimplePart  simple;
+    GraphPart	graph;
+} GraphRec;
+
+#endif /* _GraphP_h */
diff --git a/src/ted/README b/src/ted/README
new file mode 100644
index 0000000..670da50
--- /dev/null
+++ b/src/ted/README
@@ -0,0 +1,32 @@
+ted - trace editor
+------------------
+This is the source directory for the ted trace editor.
+As well as the sources you should find a makefile,
+a manual page "ted.manpage", documentation  "ted.tex"
+(this preprint is still in review and should be treated
+as confidential until it has been accepted for publication)
+a help file "ted.help" and a default resources file "Xted".
+
+The "ted.help" and "Xted" files should be placed in the
+"/usr/lib/X11/app-defaults" directory. The "ted.manpage"
+file should be placed in "/usr/man/manl/ted.l", or wherever
+local manual pages are stored on your system.
+
+The "ted" application can be constructed by simply typing
+"make ted". The "ted" application can then be placed in
+a generally accessible directory of binaries, perhaps
+"/usr/local/bin".
+
+
+
+sample files
+------------
+
+A sample ABI file is also included in this directory, trace.abi, which
+ can be opened using the following command:
+		ted -ABI trace.abi
+
+A sample ALF file is also included in this directory, trace.alf, which
+ can be opened using the following command:
+		ted -ALF trace.alf
+
diff --git a/src/ted/Xted b/src/ted/Xted
new file mode 100644
index 0000000..d01cbb9
--- /dev/null
+++ b/src/ted/Xted
@@ -0,0 +1,178 @@
+!
+! Resources for the "ted" trace editor (Dec 1990)
+!
+
+! default magnification
+Xted*magnif: 30
+! default enzyme string to search for as a left cutoff
+Xted*enz: CAGGTC
+
+! The application
+Xted.minWidth:  550
+Xted.minHeight: 250
+
+! Labels have no borders and are not resized
+Xted*Label.borderWidth: 0
+Xted*Label.top:         chainTop
+Xted*Label.bottom:      chainTop
+Xted*Label.left:        chainLeft
+Xted*Label.right:       chainLeft
+
+! Buttons are not resized
+Xted*Command.top:         chainTop
+Xted*Command.bottom:      chainTop
+Xted*Command.left:        chainLeft
+Xted*Command.right:       chainLeft
+
+! All the toggles are one-of-many
+Xted*Toggle.Translations: #override \n\
+         <EnterWindow>:        highlight(Always) \n\
+         <LeaveWindow>:        unhighlight()     \n\
+         <Btn1Down>,<Btn1Up>: set() notify()
+
+Xted.mainForm.title.label: Trace editor
+Xted.mainForm.version.label: \
+Dec 1990
+Xted.mainForm.seqNamePrompt.label: Input:
+Xted.mainForm.seqNamePrompt.height: 13
+Xted.mainForm.seqName.label:       NONE
+Xted.mainForm.seqName.resizable:   True
+Xted.mainForm.seqName.height: 13
+Xted.mainForm.search.label: Search
+Xted.mainForm.search.height: 13
+Xted.mainForm.NorigBasesPrompt.label: Bases:
+Xted.mainForm.NorigBases.label:
+Xted.mainForm.NorigBases.resizable:   True
+Xted.mainForm.strand.label: Strand: Top
+Xted.mainForm.strand.resizable: True
+
+! Display forms have no border
+Xted.mainForm.Form.borderWidth:     0
+
+ ! The edit mode toggles
+Xted.mainForm.Toggle.top:    chainTop
+Xted.mainForm.Toggle.bottom: chainTop
+Xted.mainForm.Toggle.left:   chainLeft
+Xted.mainForm.Toggle.right:  chainLeft
+Xted.mainForm.edmodelab.label: Mode:
+Xted.mainForm.adjustL.label:   Adj left cut
+Xted.mainForm.edSeq.label:     Edit seq
+Xted.mainForm.adjustR.label:   Adj right cut
+Xted.mainForm.edSeq.state:     True
+Xted.mainForm.adjustL.height:   13
+Xted.mainForm.edSeq.height:     13
+Xted.mainForm.adjustR.height:   13
+
+
+! The magnification slider
+Xted.mainForm.maglab.top:    chainTop
+Xted.mainForm.maglab.bottom: chainTop
+Xted.mainForm.maglab.left:   chainLeft
+Xted.mainForm.maglab.right:  chainLeft
+Xted.mainForm.maglab.label: Mag:
+Xted.mainForm.maglab.height: 13
+Xted.mainForm.magscr.top:    chainTop
+Xted.mainForm.magscr.bottom: chainTop
+Xted.mainForm.magscr.left:   chainLeft
+Xted.mainForm.magscr.right:  chainLeft
+Xted.mainForm.magscr.orientation: horizontal
+Xted.mainForm.magscr.length: 100
+Xted.mainForm.magscr.height: 13
+
+Xted.mainForm.scaleDown.label: Scale down
+Xted.mainForm.scaleDown.height: 13
+Xted.mainForm.scaleUp.label: Scale up
+Xted.mainForm.scaleUp.height: 13
+Xted.mainForm.switch.label: Switch
+Xted.mainForm.switch.height: 13
+
+! The main display: a viewport containing a form containing graphs
+Xted.mainForm.viewport.top:    chainTop
+Xted.mainForm.viewport.bottom: chainBottom
+Xted.mainForm.viewport.left:   chainLeft
+Xted.mainForm.viewport.right:  chainRight
+Xted.mainForm.viewport.allowHoriz: True
+Xted.mainForm.viewport.forcebars:  True
+Xted.mainForm.viewport.width:      700
+Xted.mainForm.viewport.vpForm.Graph.resizable: True
+Xted.mainForm.viewport.vpForm.Graph.top:    chainTop
+Xted.mainForm.viewport.vpForm.Graph.bottom: chainTop
+Xted.mainForm.viewport.vpForm.Graph.left:   chainLeft
+Xted.mainForm.viewport.vpForm.Graph.right:  chainRight
+Xted.mainForm.viewport.vpForm.trace.top:    chainTop
+Xted.mainForm.viewport.vpForm.trace.bottom: chainBottom
+Xted.mainForm.viewport.vpForm.Graph.font: -misc-*-bold-*15-*
+! The text graph heights are dynamically set to accomodate the font
+Xted.mainForm.viewport.vpForm.trace.height: 150
+! The graphs are dynamically initially set to fill the viewport
+Xted.mainForm.viewport.vpForm.Graph.dimBackground: grey
+Xted.mainForm.viewport.vpForm.Graph.graphColour1:  green
+Xted.mainForm.viewport.vpForm.Graph.graphColour2:  purple
+Xted.mainForm.viewport.vpForm.Graph.graphColour3:  black
+Xted.mainForm.viewport.vpForm.Graph.graphColour4:  red
+
+! Dummy widget - we only care about its width
+Xted.mainForm.dummy.top:    chainTop
+Xted.mainForm.dummy.bottom: chainBottom
+Xted.mainForm.dummy.left:   chainLeft
+Xted.mainForm.dummy.right:  chainRight
+Xted.mainForm.dummy.width:  700
+
+
+! Dialogues
+Xted*search.ioForm.ioPrompt.label:  Search string or base number
+Xted*input.ioForm.ioPrompt.label:   Input a file
+Xted*output.ioForm.ioPrompt.label:  Save a file
+Xted*check.ioForm.ioPrompt.label:   \
+The existing sequence has been altered.\n\
+Do you really want to overwrite it?
+Xted*quit.ioForm.ioPrompt.label:    \
+The existing sequence has been altered.\n\
+Do you really want to quit?
+Xted*check.ioForm..translations:   #override \n\
+         <Key>Return: CRAction()
+Xted*quit.ioForm..translations:    #override \n\
+         <Key>Return: CRAction()
+
+Xted*ioForm.namePrompt.label:       File name:
+Xted*ioForm.nameVal.width:          250
+Xted*ioForm.nameVal.translations:   #override \n\
+         <Key>Return: CRAction()
+Xted*ioForm.next.label: 
+Xted*ioForm.next.resizable:         True
+Xted*ioForm.formatPrompt.label:     File format:
+Xted*ioForm.formatBox.borderWidth:  0
+Xted*ioForm.formatBox.orientation:  horizontal
+Xted*ioForm.formatBox.plainFmt.label: Plain
+Xted*ioForm.formatBox.abiFmt.label:   ABI
+Xted*ioForm.formatBox.abiFmt.state:   True
+Xted*ioForm.formatBox.alfFmt.label:   ALF
+Xted*ioForm.formatBox.alfFmt.state:   False
+Xted*ioForm.formatBox.scfFmt.label:   SCF
+Xted*ioForm.formatBox.scfFmt.state:   False
+Xted*ioForm.formatBox.bottomFmt.label: Bottom
+Xted*ioForm.formatBox.topFmt.label:   Top
+Xted*ioForm.formatBox.topFmt.state:   True
+
+! The `default' button for the dialogue has a wider border
+Xted*input.ioForm.OK.borderWidth:     3
+Xted*output.ioForm.OK.borderWidth:    3
+Xted*check.ioForm.Cancel.borderWidth: 3
+Xted*quit.ioForm.Cancel.borderWidth:  3
+
+! Help shell widget
+Xted.Help.width:  525
+Xted.Help.height: 200
+Xted.Help.minWidth: 525
+Xted.Help.maxWidth: 525
+Xted.Help.minHeight: 150
+Xted.Help.title: ted help
+Xted.Help.form.rmButton.label:  Remove help
+Xted.Help.form.rmButton.top:    chainTop
+Xted.Help.form.rmButton.bottom: chainTop
+Xted.Help.form.rmButton.left:   chainLeft
+Xted.Help.form.rmButton.right:  chainLeft
+Xted.Help.form.text.top:    chainTop
+Xted.Help.form.text.bottom: chainBottom
+Xted.Help.form.text.left:   chainLeft
+Xted.Help.form.text.right:  chainRight
diff --git a/src/ted/basesDisplay.c b/src/ted/basesDisplay.c
new file mode 100644
index 0000000..79ada06
--- /dev/null
+++ b/src/ted/basesDisplay.c
@@ -0,0 +1,1014 @@
+/* 
+    Title:       basesDisplay
+
+    File: 	 basesDisplay.h
+    Purpose:	 Display the bases of a sequence on graphs
+    Last update: Wednesday 8 April 1992
+*/
+
+
+/*
+    `plotEdgeOffset' indicates the pixel position at which point 0
+    occurs. The characters for bases are printed with their centres
+    corresponding to their locations on the plot.
+
+    A caret is XOR drawn in the editable graph on top of (i.e. after)
+    all characters have been drawn.    
+
+    Changes to this module by lfw:
+        module displayEdBases and displayBases
+          added int baseNum as a parameter so that
+        the user could specify a baseNum via the
+        command line baseNum parameter or via
+        specifying a string to search for ( if
+        you specify a string to search for
+        ) and have that be where the caret
+        is originally positioned.  Before this
+        modification, the caret was always
+        initially positioned at base 0.
+    Changes made by sd:
+	leftCutoffPoint: handle NULLPoint
+    Changes made by SD:
+	Support monochrome displays when marking cutoffs
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "basesDisplay.h"
+#include "display.h"
+
+#include "Graph.h"
+#include "seq.h"     /* IMPORT: Seq, NULLSeq, getNPoints */
+
+#include <ctype.h>   /* IMPORT: toupper */
+
+#include <X11/Intrinsic.h>
+#include <X11/keysym.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Scrollbar.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Viewport.h>
+
+
+
+
+/* ---- Constants ---- */
+
+/* Caret stuff */
+#define caretWidth  ((Dimension) 6)
+#define caretHeight ((Dimension) 3)
+static char caretBits[] = {0x0c, 0x1e, 0x33};
+
+
+
+
+/* ---- Statics ---- */
+
+/* Current sequence information */
+static Seq currSeq = NULLSeq;
+static int NPoints;
+static int NorigBases, NedBases;
+
+
+/* Font information */
+static Dimension charWidth;
+static Dimension halfCharWidth;
+static Dimension charVOffset; /* Of char baseline from top of graph=ascent+1 */
+
+/* The two graphs */
+static Widget origSeqWid;
+static Widget edSeqWid;
+/* the numbers above the two graphs*/
+static Widget baseNumWid;
+
+static Dimension graphWidth;     /* Current width of the graph */
+static Dimension graphHeight;    /* Height of the graph */
+static Dimension plotEdgeOffset;
+static Dimension plotWidth;      /* Current width of the plot */
+
+static int leftCutoff, rightCutoff;
+
+static GC Agc, Cgc, Ggc, Tgc;
+static Pixel normBackground, dimBackground;
+
+/* monochrome stippling */
+static GC greygc;
+static int planes;
+
+/* Caret stuff */
+/*
+    The caret's position is represented by a base number, caretBN, (the
+    point at which the corresponding base occurs is maintained as caretP)
+    and is drawn to the right of that base. To allow insertion at the left
+    end and deletion at the right end, it can range from -1..NBase-1.
+*/
+static GC caretGC;
+static Pixmap caretPixmap = (Pixmap) 0;
+static int caretBN;
+static int caretP;
+
+
+
+
+
+/* ---- Position translation ---- */
+
+/*
+    Convert a value from the `point-in-the-sequence' to the
+    `x-pixel-on-the-graph' scale, and vice-versa, given that
+    `plotWidth' holds the current pixel width of the graph which
+    starts `plotEdgeOffset' from either edge of the graph and
+    `NPoints' gives the current point width.
+    Fractional results are truncated - for pixelToPoint this may
+    mean a point one too low is indicated, but a point too high is
+    never indicated.
+*/
+
+
+static int pointToPixel(int point)
+{   return(((point * plotWidth) / (NPoints-1)) + plotEdgeOffset);
+}
+
+
+int pixelToPoint(int pixel)
+{   return(((pixel-plotEdgeOffset) * (NPoints-1)) / plotWidth);
+}
+
+
+static int leftCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a left cutoff at `baseNum'.
+*/
+{   int pL, pR;
+
+    pL = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum-1);
+    pR = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum);
+
+    return((pR==NULLPoint)?pL:(pL+pR)/2);
+}
+
+
+static int rightCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a right cutoff at `baseNum'.
+*/
+{   int NedBases = getNBases(currSeq, EdBases);
+    int pL, pR;
+
+    pL = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum);
+    pR = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum-1);
+
+    return((pL+pR)/2);
+}
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+static void writeBase(Widget wid, int baseNum, Widget bnwid)
+{   WhichBases which = (wid == origSeqWid) ? OrigBases : EdBases;
+    char       base  = getBase(currSeq, which, baseNum);
+    int        point = baseNumToPoint(currSeq, which, baseNum);
+    GC gc;
+    char lbl[10];
+
+    /* Get the correct GC */
+    switch (base) 
+    {   case 'A': gc=Agc; break;
+        case 'C': gc=Cgc; break;
+        case 'G': gc=Ggc; break;
+        case 'T': gc=Tgc; break;
+        default:  gc=Ggc; break;
+    }
+
+    XDrawImageString(XtDisplay(wid),
+		     XtWindow(wid),
+		     gc,
+		     pointToPixel(point)-halfCharWidth, charVOffset,
+		     &base, 1);
+/*    if (currSeq->bottom)
+      sprintf(lbl,"%d",(currSeq->NorigBases-1)-baseNum);
+    else*/
+      sprintf(lbl,"%d",baseNum);
+    if ((wid == origSeqWid) &&  (baseNum%10 == 0)) {
+      XDrawString(XtDisplay(bnwid),
+		     XtWindow(bnwid),
+		     gc,
+		     pointToPixel(point)-halfCharWidth, charVOffset,
+		     lbl,strlen(lbl));
+    }
+}
+
+
+
+
+static void caretExpose(int *pix0, /* First exposed pixel */
+			int *pixN) /* Last exposed pixel */
+/*
+    Extend the expose region, if necessary, so that the caret
+    is either completely, or not at all exposed.
+*/
+{   int caretLwb, caretUpb;
+
+    caretLwb = pointToPixel(caretP)+halfCharWidth;
+    caretUpb = caretLwb + caretWidth;
+
+    /* Return if the caret is fully outside the exposed region */
+    if ((caretUpb<*pix0) || (caretLwb>*pixN)) return;
+
+    /* Return if the caret is fully inside the exposed region */
+    if ((caretLwb>=*pix0) && (caretUpb<=*pixN)) return;
+
+    /* Extend the exposed region */
+    *pix0 = caretLwb<*pix0 ? caretLwb : *pix0;
+    *pixN = caretUpb>*pixN ? caretUpb : *pixN;
+    XClearArea(XtDisplay(edSeqWid),
+	       XtWindow(edSeqWid),
+	       caretLwb, 0,
+	       caretWidth, graphHeight,
+	       False);
+}
+
+
+
+
+static void drawCaret()
+{   int caretLwb;
+
+    caretLwb = pointToPixel(caretP)+halfCharWidth;
+
+    XCopyPlane(XtDisplay(edSeqWid),
+	       caretPixmap, XtWindow(edSeqWid),
+	       caretGC,
+	       0, 0,
+	       (unsigned int) caretWidth, (unsigned int) caretHeight,
+	       caretLwb, charVOffset,
+	       1);
+}
+
+
+
+
+static void drawExposedCaret(int p0, /* First exposed pixel */
+			     int pN) /* Last exposed pixel */
+/*
+    Draw the caret, if it is within the exposed points.
+*/
+{   int caretLwb;
+
+    caretLwb = pointToPixel(caretP)+halfCharWidth;
+
+    if ((caretLwb<=pN) && (caretLwb>=p0))
+        drawCaret();
+    else
+        return;
+}
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void exposeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Region region = (Region) call_data;
+    XRectangle rect;
+    int x0, xN;    /* Affected region: pixels */
+    int p0, pN;    /* Affected region: points */
+    int b, b0, bN; /* Affected region: bases */
+    int leftCutoffP, rightCutoffP;
+    int NBases;
+    int whichBases;
+
+    if (currSeq == NULLSeq) return;
+
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+    leftCutoffP  = leftCutoffPoint(leftCutoff);
+    rightCutoffP = rightCutoffPoint(rightCutoff);
+    NBases     = (wid==origSeqWid) ? NorigBases : NedBases;
+    whichBases = (wid==origSeqWid) ? OrigBases  : EdBases;
+
+
+    /*
+        `region' gives us the exposed graph region.
+	XClipBox will return a bounding rectangle, in pixels.
+    */
+    XClipBox(region, &rect);
+    x0 = rect.x;
+    xN = rect.x+rect.width;
+
+
+    /* For the editable sequence, ensure the caret was not half exposed */
+    if (wid == edSeqWid) caretExpose(&x0, &xN);
+
+
+    /* Convert to affected points after trimming to the plot proper */
+    p0=pixelToPoint((x0<plotEdgeOffset)          ?plotEdgeOffset          :x0);
+    pN=pixelToPoint((xN>plotWidth+plotEdgeOffset)?plotWidth+plotEdgeOffset:xN);
+    if (pN<NPoints-1) pN++;
+
+
+    if (p0<leftCutoffP)
+    /*
+        Some of the exposed region is in the left cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = p0;
+       int lastP  = (pN<leftCutoffP) ? pN : leftCutoffP;
+
+       if (whichBases == EdBases)
+       {   int pix0 = pointToPixel(firstP);
+	   int pixN = pointToPixel(lastP);
+
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, graphHeight);
+       } else {
+           XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				dimBackground);
+           XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		      pix0, 0,
+		      pixN-pix0, graphHeight,
+		      False);
+	   XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				normBackground);
+       }
+       }
+    }
+      
+
+    if (pN>rightCutoffP)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = (p0>rightCutoffP) ? p0 : rightCutoffP;
+       int lastP  = pN;
+
+       if (whichBases == EdBases)
+       {   int pix0 = pointToPixel(firstP);
+	   int pixN = pointToPixel(lastP);
+
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, graphHeight);
+       } else {
+	   XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				dimBackground);
+	   XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		      pix0, 0,
+		      pixN-pix0, graphHeight,
+		      False);
+	   XSetWindowBackground(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+				normBackground);
+       }
+       }
+    }
+
+
+    /*
+        Now draw the affected bases.
+    */
+
+    /* The first base is to the left of the exposed region */
+    b0 = pointToBaseNum(currSeq, whichBases, p0);
+    b0 =   (b0 == NULLBaseNum) ? NBases-1
+         : (b0>0)              ? b0-1
+	 :                       0;
+    /* The last base is to the right of the exposed region */
+    bN = pointToBaseNum(currSeq, whichBases, pN);
+    bN = (bN == NULLBaseNum) ? NBases-1 : bN;
+
+    if (b0<leftCutoff)
+    /*
+        Some of the bases are in the left cutoff area.
+	Draw them, the edited ones with the dim background.
+    */
+    {   if (whichBases == EdBases)
+	{   /* Set all GCs to have dim backgrounds */
+	    XSetBackground(XtDisplay(wid), Agc, dimBackground);
+	    XSetBackground(XtDisplay(wid), Cgc, dimBackground);
+	    XSetBackground(XtDisplay(wid), Ggc, dimBackground);
+	    XSetBackground(XtDisplay(wid), Tgc, dimBackground);
+	}
+
+	for (b=b0; b<=leftCutoff-1; b++) writeBase(wid, b, baseNumWid);
+	
+	if (whichBases == EdBases)
+        {   XSetBackground(XtDisplay(wid), Agc, normBackground);
+	    XSetBackground(XtDisplay(wid), Cgc, normBackground);
+	    XSetBackground(XtDisplay(wid), Ggc, normBackground);
+	    XSetBackground(XtDisplay(wid), Tgc, normBackground);
+	}
+    }
+
+    if (bN>=leftCutoff && b0-NedBases-1<rightCutoff)
+    /*
+        Some of the exposed region is in the middle area.
+	Draw the bases.
+    */
+    {   int firstBase = (b0<leftCutoff) ? leftCutoff : b0;
+	int lastBase  =   (bN>NedBases-1-rightCutoff)
+	                 ? NedBases-1-rightCutoff
+			 : bN;
+
+	for (b=firstBase; b<=lastBase; b++) writeBase(wid, b, baseNumWid);
+    }
+
+    if (bN>NedBases-1-rightCutoff)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  if (whichBases == EdBases)
+       {   XSetBackground(XtDisplay(wid), Agc, dimBackground);
+	   XSetBackground(XtDisplay(wid), Cgc, dimBackground);
+	   XSetBackground(XtDisplay(wid), Ggc, dimBackground);
+	   XSetBackground(XtDisplay(wid), Tgc, dimBackground);
+       }
+
+       /* Plot the affected bases */
+       for (b=NedBases-rightCutoff; b<=bN; b++) writeBase(wid, b, baseNumWid);
+
+       if (whichBases == EdBases)
+       {   XSetBackground(XtDisplay(wid), Agc, normBackground);
+	   XSetBackground(XtDisplay(wid), Cgc, normBackground);
+	   XSetBackground(XtDisplay(wid), Ggc, normBackground);
+	   XSetBackground(XtDisplay(wid), Tgc, normBackground);
+       }
+    }
+
+
+    /* If this is the editable sequence, display the caret */
+    if (wid == edSeqWid) drawExposedCaret(x0, xN);
+}
+
+
+
+
+static void userCallback(Widget wid,
+			 XtPointer client_data, XtPointer call_data)
+/*
+    Handles all XtNcallbacks, i.e., buttons and keys.
+*/
+{   XEvent *eventP = (XEvent *) call_data;
+
+    if (currSeq == NULLSeq) return;
+
+    /*
+        Callback to the `display' module.
+    */
+    userEvent(eventP);
+}
+
+
+
+
+/* ---- Exports ---- */
+void redisplayBases()
+{
+    /*
+    **  Clear the area and generate an expose.
+    */
+    XClearArea(XtDisplay(baseNumWid), XtWindow(baseNumWid),
+		0, 0,
+		0, 0,
+		True);
+    XClearArea(XtDisplay(origSeqWid), XtWindow(origSeqWid),
+		0, 0,
+		0, 0,
+		True);
+    XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		0, 0,
+		0, 0,
+		True);
+}
+
+
+
+
+Widget createBasesDisplay(Widget parentWid, Widget fromVertWid,
+				 Dimension width)
+/*
+    Create the bases display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width'. Return the (lower) widget constructed.
+    No bases are initially displayed.
+*/
+{   Arg args[10];
+    int nargs;
+
+    currSeq    = NULLSeq;
+    graphWidth = width;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    graphWidth);  nargs++;
+    baseNumWid = XtCreateManagedWidget("baseNumWid", graphWidgetClass,
+				       parentWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, baseNumWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    graphWidth);  nargs++;
+    origSeqWid = XtCreateManagedWidget("origSeq", graphWidgetClass,
+				       parentWid, args, nargs);
+    XtAddCallback(origSeqWid, XtNcallback,       userCallback, NULL);
+    XtAddCallback(origSeqWid, XtNexposeCallback, exposeCallback, NULL);
+
+        /*
+	    Get font information.
+	    The text graphs are set to be two pixels higher than
+	    (ascent+descent). Characters are then plotted with
+	    a vertical offset one more than the ascent.
+	*/
+        { XFontStruct *fontStruct;
+
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNfont, &fontStruct); nargs++;
+	  XtGetValues(origSeqWid, args, nargs);
+	  charWidth      = fontStruct->max_bounds.width;
+	  halfCharWidth  = fontStruct->max_bounds.width/2;
+	  charVOffset    = fontStruct->ascent+1;
+	  plotEdgeOffset = halfCharWidth+2;
+	  plotWidth      = graphWidth-(2*plotEdgeOffset);
+	  graphHeight    = fontStruct->ascent + fontStruct->descent + 2;
+	  /* Make this graph tall enough to hold it */
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNheight, graphHeight); nargs++;
+	  XtSetValues(origSeqWid, args, nargs);
+	  /* Make this graph tall enough to hold it */
+	  nargs = 0;
+	  XtSetArg(args[nargs], XtNheight, graphHeight); nargs++;
+	  XtSetValues(baseNumWid, args, nargs);
+	}
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, origSeqWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,  graphWidth);   nargs++;
+    XtSetArg(args[nargs], XtNheight, graphHeight);  nargs++;
+    edSeqWid = XtCreateManagedWidget("edSeq", graphWidgetClass,
+				     parentWid, args, nargs);
+    XtAddCallback(edSeqWid, XtNcallback,       userCallback, NULL);
+    XtAddCallback(edSeqWid, XtNexposeCallback, exposeCallback, NULL);
+
+
+    /*
+        Get the GCs
+    */
+    Agc = GraphGC1(origSeqWid);
+    Cgc = GraphGC2(origSeqWid);
+    Ggc = GraphGC3(origSeqWid);
+    Tgc = GraphGC4(origSeqWid);
+
+    /*
+        Get the backgrounds
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNbackground,    &normBackground);    nargs++;
+    XtSetArg(args[nargs], XtNdimBackground, &dimBackground); nargs++;
+    XtGetValues(edSeqWid, args, nargs);
+
+    planes = DisplayPlanes(XtDisplay(edSeqWid),DefaultScreen(XtDisplay(edSeqWid)));
+    if (planes == 1) {
+	/* prepare for monochrome display */
+	XtGCMask valuemask = (GCStipple | GCFillStyle);
+	XGCValues values;
+	Pixmap grey_stipple;
+
+#define grey_width 4
+#define grey_height 4
+	static char grey_bits[] = {
+	   0x0e, 0x0b, 0x0e, 0x0b};
+
+	grey_stipple =
+	    XCreateBitmapFromData(XtDisplay(edSeqWid),
+				  RootWindowOfScreen(XtScreenOfObject(edSeqWid)),
+				  grey_bits,
+				  grey_width,
+				  grey_height);
+
+	values.stipple = grey_stipple;
+	values.fill_style = FillOpaqueStippled;
+	greygc = XCreateGC(XtDisplayOfObject(edSeqWid),
+			     RootWindowOfScreen(XtScreenOfObject(edSeqWid)),
+			     valuemask, &values);
+
+    }
+ 
+    return(edSeqWid);
+}
+
+
+
+
+void getBasesFontInfo(Dimension *offset, Dimension *chWidth)
+/*
+    In `offset' return the offset which the bases are drawn
+    from the left and right hand edges. The width of the characters
+    used is returned in `chWidth'.
+*/
+{   *offset  = plotEdgeOffset;
+    *chWidth = charWidth;
+}
+
+
+
+
+void displayBases(Seq seq, int baseNum)
+/*
+    Display the bases of sequence `seq'.
+*/
+{
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    NPoints     = getNPoints(currSeq);
+    NorigBases  = getNBases(currSeq, OrigBases);
+    NedBases    = getNBases(currSeq, EdBases); /* Should be the same */
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+
+
+        /* 
+        Ensure the caret for the editable graph exists.
+	This is *not* the place to do this, but it has to be done
+	when we are sure the graph has been realised.
+	In either case, move it to the start.
+    */
+/*    if (caretPixmap == (Pixmap) 0)*/
+    if (caretPixmap == (Pixmap) 0)
+    {   XGCValues values;
+
+	caretPixmap = XCreateBitmapFromData(XtDisplay(edSeqWid),
+					    XtWindow(edSeqWid),
+					    caretBits,
+					    caretWidth,
+					    caretHeight);
+    
+	values.function = GXxor;
+	caretGC = XCreateGC(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+			    GCFunction, &values);
+	XCopyGC(XtDisplay(edSeqWid),
+		GraphGC3(edSeqWid),
+		(GCForeground | GCBackground | GCLineStyle | GCDashList),
+		caretGC);
+    }
+
+    /*
+        Initially, the caret is placed hard to the left.
+    
+    caretBN = -1;
+    caretP  = 0; */
+
+/* modification by lfw 10/24/90,
+   the following section added to move the caret to the position
+  the user requests for baseNum , either by specifying 
+   -baseNum on the command line or -astring */
+
+    caretBN = baseNum-1;
+/* baseNum-1 because it inserts the caret at the space after baseNum */
+
+    if (baseNum == -1)
+      caretP = 0;
+    else
+      caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+    moveCaretTo(baseNum-1);
+
+}
+
+
+void displayEdBases(Seq seq, int baseNum)
+/*
+    Display the bases of sequence `seq'.
+*/
+{
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    NedBases    = getNBases(currSeq, EdBases);
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+
+
+    /* 
+        Ensure the caret for the editable graph exists.
+	This is *not* the place to do this, but it has to be done
+	when we are sure the graph has been realised.
+	In either case, move it to the start.
+    */
+/*    if (caretPixmap == (Pixmap) 0)*/
+    if (caretPixmap == (Pixmap) 0)
+    {   XGCValues values;
+
+	caretPixmap = XCreateBitmapFromData(XtDisplay(edSeqWid),
+					    XtWindow(edSeqWid),
+					    caretBits,
+					    caretWidth,
+					    caretHeight);
+    
+	values.function = GXxor;
+	caretGC = XCreateGC(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+			    GCFunction, &values);
+	XCopyGC(XtDisplay(edSeqWid),
+		GraphGC3(edSeqWid),
+		(GCForeground | GCBackground | GCLineStyle | GCDashList),
+		caretGC);
+    }
+
+    /*
+        Initially, the caret is placed hard to the left.
+    
+    caretBN = -1;
+    caretP  = 0; */
+
+/* modification by lfw 10/24/90,
+   the following section added to move the caret to the position
+  the user requests for baseNum , either by specifying 
+ -baseNum on the command line or -astring */
+
+    caretBN = baseNum-1;
+/* baseNum-1 because it inserts the caret at the space after baseNum */
+
+    if (baseNum == -1)
+      caretP = 0;
+    else
+      caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+    moveCaretTo(baseNum-1);
+	
+}
+
+
+
+
+void unDisplayBases()
+/*
+    Cease displaying the current sequence (if any).
+*/
+{   currSeq = NULLSeq;
+}
+
+
+
+
+void setBasesWidth(Dimension width)
+/*
+    Set the width of the current sequence bases display.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, width); nargs++;
+    XtSetValues(origSeqWid, args, nargs);
+    XtSetValues(edSeqWid,   args, nargs);
+    XtSetValues(baseNumWid, args, nargs);
+
+    graphWidth = width;
+    plotWidth  = graphWidth-(2*plotEdgeOffset);
+
+    /*
+        Clear the graphs.
+    */
+    if (XtIsRealized(origSeqWid))
+        XClearWindow(XtDisplay(origSeqWid), XtWindow(origSeqWid));
+    if (XtIsRealized(edSeqWid))
+        XClearWindow(XtDisplay(edSeqWid), XtWindow(edSeqWid));
+    if (XtIsRealized(baseNumWid))
+        XClearWindow(XtDisplay(baseNumWid), XtWindow(baseNumWid));
+}
+
+
+
+
+int getCaret()
+/*
+    Get the current position of the caret.
+*/
+{   return(caretBN);
+}
+
+
+
+
+void moveCaretLeft()
+/*
+    Move the caret left one base.
+*/
+{   drawCaret();
+    
+    caretBN = (caretBN==-1) ? -1 : caretBN-1;
+    caretP  = (caretBN==-1) ?  0 : baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+}
+
+
+
+
+void moveCaretRight()
+/*
+    Move the caret right one base.
+*/
+{   drawCaret();
+
+    caretBN = (caretBN==NedBases-1) ? NedBases-1 : caretBN+1;
+    caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+}
+
+
+
+
+void moveCaretTo(int baseNum)
+/*
+    Move the caret to after base `baseNum'.
+*/
+{   drawCaret();
+
+    caretBN = baseNum;
+    caretP  = (caretBN==-1) ? 0 : baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+}
+
+
+
+
+void baseInserted()
+/*
+    The editable sequence has changed by the insertion of a base
+    to the right of the base indicated by the caret. Update our display.
+*/
+{   int point, pix;
+
+    drawCaret();
+
+    NedBases++;
+    caretBN++;
+    caretP  = baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+
+    /* Expose the new base */
+    point = baseNumToPoint(currSeq,EdBases,caretBN);
+    pix   = pointToPixel(point) - halfCharWidth;
+    XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+	       pix, 0,
+	       charWidth, graphHeight,
+	       True);
+}
+
+
+
+
+void baseDeleted()
+/*
+    The editable sequence has changed by the deletion of a base
+    from the left of the caret. Update our display.
+*/
+{   int basePix = pointToPixel(caretP) - halfCharWidth;
+		    
+    drawCaret();
+
+    NedBases--;
+    caretBN--;;
+    caretP = (caretBN==-1) ?  0 : baseNumToPoint(currSeq, EdBases, caretBN);
+
+    drawCaret();
+
+    /* Clear and expose the deleted base */
+    XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+	       basePix, 0,
+	       charWidth, graphHeight,
+	       True);
+}
+
+
+
+
+void basesCutoffChange()
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
+{   int newLeftCutoff, newRightCutoff;
+
+    getCutoffs(currSeq, &newLeftCutoff, &newRightCutoff);
+
+
+    if (newLeftCutoff != leftCutoff)
+    /*
+        The region between newLeftCutoff and leftCutoff has changed.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	*/
+	if (newLeftCutoff<leftCutoff)
+	{   b0 = newLeftCutoff;
+	    bN = leftCutoff;
+	}
+	else
+	{   b0 = leftCutoff;
+	    bN = newLeftCutoff;
+	}
+
+	p0 = leftCutoffPoint(b0);
+	pN = leftCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		   pix0, 0,
+		   pixN-pix0, graphHeight,
+		   True);
+    }
+
+
+    if (newRightCutoff != rightCutoff)
+    /*
+        The region between newRightCutoff and rightCutoff has
+	changed. Expose it.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	    N.B. lower cutoff means the base is further right.
+	*/
+	if (newRightCutoff<rightCutoff)
+	{   b0 = rightCutoff;
+	    bN = newRightCutoff;
+	}
+	else
+	{   b0 = newRightCutoff;
+	    bN = rightCutoff;
+	}
+
+
+	p0 = rightCutoffPoint(b0);
+	pN = rightCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(edSeqWid), XtWindow(edSeqWid),
+		   pix0, 0,
+		   pixN-pix0, graphHeight,
+		   True);
+  
+    }
+
+
+    /*
+        Update static data.
+    */
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+}
+
+
+
+
+
+
+
+
diff --git a/src/ted/basesDisplay.h b/src/ted/basesDisplay.h
new file mode 100644
index 0000000..83eb02e
--- /dev/null
+++ b/src/ted/basesDisplay.h
@@ -0,0 +1,114 @@
+/* 
+    Title:       basesDisplay
+
+    File: 	 basesDisplay.h
+    Purpose:	 Display the bases of a sequence on graphs
+    Last update: Wed Jun 13 1990
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget createBasesDisplay(Widget parentWid, Widget fromVertWid,
+				 Dimension width);
+/*
+    Create the bases display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width'. Return the (lower) widget constructed.
+    No bases are initially displayed.
+*/
+
+
+extern void getBasesFontInfo(Dimension *offset, Dimension *chWidth);
+/*
+    In `offset' return the offset which the bases are drawn
+    from the left and right hand edges. The width of the characters
+    used is returned in `chWidth'.
+*/
+
+
+extern void displayBases(Seq seq, int baseNum);
+/*
+    Display the bases of sequence `seq'.
+*/
+
+
+extern void unDisplayBases();
+/*
+    Cease displaying the bases of the current sequence (if any).
+*/
+
+
+extern void setBasesWidth(Dimension width);
+/*
+    Set the width of the current sequence bases display.
+*/
+
+
+extern int getCaret();
+/*
+    Get the current position of the caret.
+*/
+
+
+extern void moveCaretLeft();
+/*
+    Move the caret left one base.
+*/
+
+
+extern void moveCaretRight();
+/*
+    Move the caret right one base.
+*/
+
+
+extern void moveCaretTo(int baseNum);
+/*
+    Move the caret to after base `baseNum'.
+*/
+
+
+extern void baseInserted();
+/*
+    The editable sequence has changed by the insertion of a base
+    to the right of the base indicated by the caret. Update our display.
+*/
+
+
+extern void baseDeleted();
+/*
+    The editable sequence has changed by the deletion of a base
+    from the left of the caret. Update our display.
+*/
+
+
+extern int pixelToPoint(int pixel);
+/*
+    Given `pixel' on the bases display, return the corresponding point.
+    Fractional results are truncated - this may mean a point one too
+    low is indicated, but a point too high is never indicated.
+*/
+
+
+extern void basesCutoffChange();
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
diff --git a/src/ted/dialogues.c b/src/ted/dialogues.c
new file mode 100644
index 0000000..c8ec834
--- /dev/null
+++ b/src/ted/dialogues.c
@@ -0,0 +1,1307 @@
+/* 
+    Title:       dialogues
+
+    File: 	 dialogues.c
+    Purpose:	 Dialogues
+    Last update: Thurs 31 October 1991
+
+    Change log:
+
+	27.11.90 SD  outputOKCallback() now passes seqName in writeSeqABI() call
+	28.11.90 SD  put undesirables under STLOUIS compilation flag
+	02.01.91 SD  remerged with St Louis version
+	22.07.91 LFW put in -enzyme as a command line option for STLOUIS
+	17.09.91 LFW changed MRC switch to !AUTO_CLIP
+		     changed STLOUIS switch to OUT_SEQ
+	31.10.91 SD  Popup dialogue if file exists on output
+	26.11.91 SD  Added SCF format
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include "values.h"     /* IMPORT: MAXLONG */
+
+#include "dialogues.h"
+#include "display.h"    /* IMPORT: displaySeq, unDisplaySeq */
+#include "info.h"       /* IMPORT: info */
+#include "seq.h"        /* IMPORT: Seq, NULLSeq */
+#include "seqRead.h"    /* IMPORT: readSeq */
+#include "seqIOEdit.h"  /* IMPORT: isDotNum, stripDotNum */
+#include "seqOutput.h"  /* IMPORT: writeSeq */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Shell.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Box.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Dialog.h>
+
+
+/* ---- Globals ---- */
+
+char o_fn[200];
+char r_fn[200];
+
+/* ---- Statics ---- */
+
+static Seq currSeq = NULLSeq;
+static int indices[100]; /* array of starting indices of all of the
+			  matches a search for a string found */
+static int num_matches; /* total number of matches to a given
+			    string found in the currSeq */
+static int next_indices; /* the next indices to search for
+			     if the user wishes to search for
+			     the next occurence of string */
+
+/* ----- get resources -------*/
+/* allows you to get the default enzString to search for
+   from the Xted file */
+
+typedef struct
+{   int magnif;
+    String enz;
+} AppData, *AppDataPtr;
+
+#define XtNmagnif "magnif"
+#define XtCMagnif "Magnif"
+#define XtNenz "enz"
+#define XtCEnz "Enz"
+
+static XtResource resources[] = {
+  { XtNmagnif,
+    XtCMagnif,
+    XtRInt,
+    sizeof(int),
+    XtOffset(AppDataPtr, magnif),
+    XtRImmediate,
+    NULL
+    },
+  { XtNenz,
+    XtCEnz,
+    XtRString,
+    sizeof(String),
+    XtOffset(AppDataPtr, enz),
+    XtRImmediate,
+    NULL
+    }
+    };
+
+
+
+void set_strand_label(Widget strandWid, Seq seq)
+{
+    Cardinal nargs;
+    Arg args[1];
+
+    nargs = 0;
+    if (seq->bottom) {
+	XtSetArg(args[nargs], XtNlabel, "Strand: Bottom"); nargs++;
+    } else {
+	XtSetArg(args[nargs], XtNlabel, "Strand: Top"); nargs++;
+    }
+    XtSetValues(strandWid, args, nargs);
+}    
+
+
+/* ---- Callbacks ---- */
+
+
+static void inputOKCallback(Widget w,
+			    XtPointer client_data, XtPointer call_data)
+{
+    Widget popupWid, nameValWid, radioGroupWid, radioGroupWid2;
+    Widget seqNameWid, NorigBasesWid,strandWid;
+    char *radioData, *radioData2, *fn;
+    Arg args[10];
+    int nargs;
+    Seq oldSeq = currSeq;
+    int bottom;
+    char enzString[100];
+    AppData app_data;
+    
+    
+    popupWid       = (Widget) client_data;
+    nameValWid     = XtNameToWidget(popupWid, "ioForm.nameVal");
+    radioGroupWid  = XtNameToWidget(popupWid, "ioForm.formatBox.plainFmt");
+    radioGroupWid2 = XtNameToWidget(popupWid, "ioForm.formatBox.bottomFmt");
+    seqNameWid     = XtNameToWidget(XtParent(XtParent(popupWid)), "seqName");
+    NorigBasesWid  = XtNameToWidget(XtParent(XtParent(popupWid)), "NorigBases");
+    strandWid      = XtNameToWidget(XtParent(XtParent(popupWid)), "strand");
+    
+    XtGetApplicationResources(popupWid, (XtPointer) &app_data,
+                              resources, XtNumber(resources),
+                              NULL, 0);
+    
+#ifndef AUTO_CLIP
+    app_data.enz='\0';
+#endif
+    
+    /* LFW, 05.02.91, for command line option to work */
+    if (strcmp(enzString,"")==0) {
+	if (app_data.enz != NULL)
+	    strcpy(enzString,app_data.enz);
+	else *enzString = '\0';
+    }
+    /*
+      Get the name and type of the new one and read it in.
+      */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstring, &fn); nargs++;
+    XtGetValues(nameValWid, args, nargs);
+    
+    radioData = (char *) XawToggleGetCurrent(radioGroupWid);
+    radioData2 = (char *) XawToggleGetCurrent(radioGroupWid2);
+    bottom = (strcmp(radioData2, "bottomFmt") == 0);
+    currSeq = readSeq(fn,bottom,enzString,radioData);
+    
+#ifdef DEF_OUT  /* out_seq means default output file name
+		   of input_filename.seq */
+    /* make a default output filename */
+    sprintf(o_fn,"%s",fn);
+    if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+#endif
+    
+    if (currSeq != NULLSeq)
+	{   char NBasesString[10];
+	    
+	    XtDestroyWidget(popupWid);
+	    
+	    /*
+	      Undisplay and dispose of the old sequence.
+	      */
+	    unDisplaySeq();
+	    freeSeq(oldSeq);
+	    
+	    /*
+	      Set the `file name' and `number of bases' labels.
+	      */
+	    nargs = 0;
+	    XtSetArg(args[nargs], XtNlabel, fn); nargs++;
+	    XtSetValues(seqNameWid, args, nargs);
+	    nargs = 0;
+	    sprintf(NBasesString, "%d", getNBases(currSeq, OrigBases));
+	    XtSetArg(args[nargs], XtNlabel, NBasesString); nargs++;
+	    XtSetValues(NorigBasesWid, args, nargs);
+	    /* set strand label */
+	    set_strand_label(strandWid, currSeq);
+	    
+	    /* original line        displaySeq(currSeq, -1, 1.0);, 
+	       new line added by lfw: */
+	    displaySeq(currSeq, 1, 0.3);
+	}
+    else
+	{   XBell(XtDisplay(popupWid), 100);
+	    nargs = 0;
+	    XtSetArg(args[nargs], XtNstring, ""); nargs++;
+	    XtSetValues(nameValWid, args, nargs);        
+	}
+}
+
+
+
+
+/********************************Check for overwriting***********************/
+#include <sys/types.h>
+#include <sys/stat.h>
+int checkOKwriteSeq(char *fn)
+/* See if file exists */
+{
+    struct stat buf;
+    return ( stat(fn,&buf) );
+}
+
+static void warningYesCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A yes response to the
+** warning about data not saved
+*/
+{
+    int *response = (int *) client_data;
+
+    *response = 1;
+}
+
+static void warningNoCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+/*
+** A No response to the
+** warning about data not saved
+*/
+{
+    int *response = (int *) client_data;
+
+    *response = 0;
+}
+
+
+static int warning(Widget parentWid, String warn_message)
+/*
+** Warn that changes have been made but contig hasn't been saved.
+*/
+{
+    Widget dialog;
+    Arg	args[10];
+    int nargs;
+    int Response;
+    Widget warningWid, warnFormWid;
+    Position  x, y;
+
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    XtTranslateCoords(parentWid,
+		      (Position) 0, (Position) 0,
+		      &x, &y);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    warningWid = XtCreatePopupShell("warning", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    /* Create the form */
+    nargs = 0;
+    warnFormWid = XtCreateManagedWidget("warnForm", formWidgetClass,
+				      warningWid, args, nargs);
+
+
+
+    /* Create the form */
+    nargs = 0;
+    XtSetArg(args[nargs],XtNlabel,warn_message); nargs++;
+    dialog = XtCreateManagedWidget("warnwid", dialogWidgetClass,
+				      warnFormWid, args, nargs);
+    XawDialogAddButton(dialog,"Yes",warningYesCallback,(XtPointer)&Response);
+    XawDialogAddButton(dialog,"No",warningNoCallback,(XtPointer)&Response);
+
+    /* This is a modal dialogue */
+    Response = -1;
+    XtPopup(warningWid, XtGrabExclusive);
+
+    while (Response==-1)
+    {
+	XEvent event;
+
+        XtAppNextEvent(XtWidgetToApplicationContext(parentWid), &event);
+        XtDispatchEvent(&event);
+    }
+
+    XtDestroyWidget(warningWid);
+
+    return Response;
+}
+
+
+
+int check_file_is_ok(Widget w, char *fn)
+{
+    if (!checkOKwriteSeq(fn)) {
+	return warning(w,"File already exists\nOK to overwrite?");
+    } else
+	return 1;
+}
+
+/********************************************************************/
+
+
+
+
+ static void inputCancelCallback(Widget w,
+				 XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+ }
+
+
+
+
+ static void outputOKCallback(Widget w,
+			      XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, nameValWid, radioGroupWid, seqNameWid;
+     Widget includeHeaderWid;
+     char *fn, *seqName;
+     Boolean rc;
+     Boolean includeHeader;
+     Arg args[10];
+     int nargs;
+
+     popupWid      = (Widget) client_data;
+     nameValWid    = XtNameToWidget(popupWid, "ioForm.nameVal");
+     radioGroupWid = XtNameToWidget(popupWid, "ioForm.formatBox.plainFmt");
+     includeHeaderWid = XtNameToWidget(popupWid, "ioForm.includeHeader");
+     seqNameWid    = XtNameToWidget(XtParent(XtParent(popupWid)), "seqName");
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNlabel, &seqName); nargs++;
+     XtGetValues(seqNameWid, args, nargs);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNstring, &fn); nargs++;
+     XtGetValues(nameValWid, args, nargs);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNstate, &includeHeader); nargs++;
+     XtGetValues(includeHeaderWid, args, nargs);
+
+     if (check_file_is_ok(w,fn)) {
+	 rc=writeSeq(currSeq, fn, (strlen(r_fn)==0)?seqName:r_fn,includeHeader);
+	 
+	 if (rc)
+	     {   XtDestroyWidget(popupWid);
+		 setDirty(currSeq, False);
+	     }
+	 else
+	     {   XBell(XtDisplay(popupWid), 100);
+		 nargs = 0;
+		 XtSetArg(args[nargs], XtNstring, ""); nargs++;
+		 XtSetValues(nameValWid, args, nargs);
+	     }
+     } else
+	 XBell(XtDisplay(popupWid), 100);
+ }
+
+
+
+
+ static void outputCancelCallback(Widget w,
+				  XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+ }
+
+
+
+
+ static void checkOKCallback(Widget w,
+			     XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, parentWid;
+
+     popupWid = (Widget) client_data;
+     parentWid = XtParent(popupWid);
+
+     XtDestroyWidget(popupWid);
+
+     /*
+	 The user does want to overwrite the existing sequence.
+	 Mark it as clean so `inputSeq' will dispose of it.
+     */
+     setDirty(currSeq, False);
+     inputSeq(parentWid);
+ }
+
+
+
+
+ static void checkCancelCallback(Widget w,
+				 XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+ }
+
+
+
+
+ static void quitOKCallback(Widget w,
+			    XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+
+     /*
+	 The user does not want to save the existing sequence.
+	 Quit the application.
+     */
+     freeSeq(currSeq);
+     XtDestroyApplicationContext(XtWidgetToApplicationContext(w));
+     exit(0);
+ }
+
+
+
+
+ static void quitCancelCallback(Widget w,
+				XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid = (Widget) client_data;
+
+     XtDestroyWidget(popupWid);
+
+     /*
+	 The user does not want to quit the application.
+     */
+ }
+
+  static void searchCallback(Widget w,
+			     XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, nameValWid, NorigBasesWid, nextWid;
+     char *astring;
+     Arg args[10];
+     int nargs;
+     int num,i,num_bases,baseNum;
+     int mag;
+     char *theSeq;
+
+     popupWid      = (Widget) client_data;
+     nameValWid    = XtNameToWidget(popupWid, "ioForm.nameVal");
+     NorigBasesWid = XtNameToWidget(XtParent(XtParent(popupWid)), "NorigBases");
+     nextWid    = XtNameToWidget(popupWid, "ioForm.next");
+
+     next_indices = 0;
+     num_matches = 0;
+
+     /*
+	 Get the name and type of the new one and read it in.
+     */
+     nargs = 0;
+     XtSetArg(args[nargs], XtNstring, &astring); nargs++;
+     XtGetValues(nameValWid, args, nargs);
+
+
+    if (currSeq != NULLSeq)
+    {    
+	 /* astring != NULL, find the string that corresponds
+	    to the app_resources.astring and center the display on that string,
+	    change by lfw, 10/24/90 from this line to free line */
+
+	 if (astring != NULL)  {
+
+	   num = 1; /* num = 1 if astring is a basenum and 0 if alpha */
+	   for (i = 0; i < strlen(astring); i++)
+	     if (!isdigit(astring[i])) {
+	       num = 0; 
+	       break;
+	     }
+	   if (num==0) {
+	     	   for (i = 0; i < strlen(astring); i++)
+		     if (islower(astring[i])) astring[i]=toupper(astring[i]);
+		 }
+	   /* if any of the entries were alphabetic, the program
+		searches for a string */
+
+	   if (num == 0) {
+
+	   num_bases = getNBases(currSeq,EdBases);
+	   theSeq = (char *)calloc(num_bases,sizeof(char));
+	   for (i = 0; i < num_bases; i++)
+	     theSeq[i] = getBase(currSeq, EdBases, i);
+
+	   num_matches = string_match(astring,strlen(astring),theSeq,num_bases,
+				      0, indices);
+
+	   if (num_matches == 0) baseNum = -5;
+	   else {
+	     baseNum = indices[0];
+	     next_indices++;
+	   }
+
+	   free(theSeq);
+
+	 }
+	   else {
+	     sscanf(astring,"%d",&baseNum) ;
+/*	     if (currSeq->bottom) baseNum = currSeq->NorigBases - 1 - baseNum;*/
+	   }
+
+	   mag = -5; /* by setting magnification to -5, the displaySeq
+			routine will go and back calculate the current
+			magnification */
+
+	 if (baseNum != -5) {
+	 displaySeq(currSeq,
+		    ((baseNum!=NULLBaseNum)                 &&
+		     (baseNum>=0)                           &&
+		     (baseNum<getNBases(currSeq,OrigBases))
+		     )
+		    ? baseNum
+		    : NULLBaseNum,
+		    mag
+		    );
+	 if (num_matches > 1) {
+	     nargs = 0;
+	     XtSetArg(args[nargs], XtNlabel, "Next?"); nargs++;
+	     XtSetValues(nextWid, args, nargs);
+	 }
+	 else
+	   XtDestroyWidget(popupWid);
+
+       }
+	   else {
+	     nargs = 0;
+	     XtSetArg(args[nargs], XtNstring, "String not found"); nargs++;
+	     XtSetValues(nameValWid, args, nargs);
+	   }
+	 }
+       }
+	 else
+	   XBell(XtDisplay(NorigBasesWid), 100);
+       }
+
+
+
+
+  static void searchNextCallback(Widget w,
+			     XtPointer client_data, XtPointer call_data)
+ {   Widget popupWid, nextWid;
+     Arg args[10];
+     int nargs;
+     int baseNum;
+     int mag;
+
+     popupWid      = (Widget) client_data;
+     nextWid    = XtNameToWidget(popupWid, "ioForm.next");
+
+    if (currSeq != NULLSeq)
+    {    
+	   mag = -5; /* by setting magnification to -5, the displaySeq
+			routine will go and back calculate the current
+			magnification */
+	   if (next_indices < num_matches) {
+	     baseNum = indices[next_indices];
+	     next_indices++;
+
+	     nargs = 0;
+	     XtSetArg(args[nargs], XtNlabel, "Next?"); nargs++;
+	     XtSetValues(nextWid, args, nargs);
+
+	     displaySeq(currSeq,
+		    ((baseNum!=NULLBaseNum)                 &&
+		     (baseNum>=0)                           &&
+		     (baseNum<getNBases(currSeq,OrigBases))
+		     )
+		    ? baseNum
+		    : NULLBaseNum,
+		    mag
+		    );
+	   }
+	   else 
+	     XtDestroyWidget(popupWid);
+	 }
+
+
+   }
+
+
+
+
+ /* ---- Action procedures ---- */
+
+
+ static void CRAction(Widget w, XEvent *event,
+		      String *params, Cardinal *num_params)
+ /*
+     Action procedure to be called when CR is hit.
+
+     This may be called from the `ioform' directly in the case of a
+     `check' dialogue, or from an `ioform.nameVal' widget in the case
+     of an input or ouput dialogue.
+     We must call the `default' button callback procedure. We can
+     detect which is the default by looking at its borderwidth, the
+     default one being of size 3.
+ */
+ {   Widget ioFormWid, okWid, cancelWid;
+     Arg args[10];
+     int nargs;
+     Dimension width;
+
+     ioFormWid = (strcmp("ioform", XtName(w))==0) ? w : XtParent(w);
+     okWid     = XtNameToWidget(ioFormWid, "*OK");
+     cancelWid = XtNameToWidget(ioFormWid, "*Cancel");
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNborderWidth, &width); nargs++;
+     XtGetValues(okWid, args, nargs);
+
+     XtCallCallbacks((width==3) ? okWid : cancelWid,
+		     XtNcallback, NULL);
+ }
+
+
+
+
+ /* ---- Internal functions ---- */
+
+
+ void checkInputSeq(Widget parentWid)
+ {   Widget checkWid, ioFormWid, ioPromptWid;
+     Widget okWid, cancelWid;
+     Arg	args[10];
+     int nargs;
+     Position  x, y;
+     Dimension width, height;
+
+
+     /* Add string to function bindings for our application actions */
+     XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+     XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		     actionTable, XtNumber(actionTable));
+
+
+     /*
+	 Position the upper left hand corner of the popup at the
+	 center of the parent widget.
+     */
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+     XtSetArg(args[nargs], XtNheight, &height); nargs++;
+     XtGetValues(parentWid, args, nargs);
+     XtTranslateCoords(parentWid,
+		       (Position) (width / 2), (Position) (height / 2),
+		       &x, &y);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNx, x); nargs++;
+     XtSetArg(args[nargs], XtNy, y); nargs++;
+     checkWid = XtCreatePopupShell("check", transientShellWidgetClass,
+				   parentWid,
+				   args, nargs);
+
+     nargs = 0;
+     ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				       checkWid, args, nargs);
+
+     nargs = 0;
+     ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					 ioFormWid, args, nargs);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+     okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				   ioFormWid, args, nargs);
+     XtAddCallback(okWid, XtNcallback,
+		   checkOKCallback, (XtPointer) checkWid);
+
+     nargs = 0;
+     XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+     XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+     cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				       ioFormWid, args, nargs);
+     XtAddCallback(cancelWid, XtNcallback,
+		   checkCancelCallback, (XtPointer) checkWid);
+
+
+     /* This is a modal dialogue */
+     XtPopup(checkWid, XtGrabExclusive);
+ }
+
+int string_match(seq1, n1, seq2, n2, nmiss, indices)
+
+/*
+  This function may be called once from initialDisplayedSeq if
+  the user has input a string they wish to search for in the
+  input sequence -- this subroutine returns *indices, the first
+  position at which the query sequence (app_resources.astring)
+  matched the input file sequence (currSeq). */
+
+/* a modified version of match.c 
+  finds alignments between a search sequence, seq1, and a target sequence, 
+  seq2, with no gaps (except possibly at ends) 
+   and at most nmiss mismatches (relative to seq1),
+   n1 is the length of seq1 where seq1 is assumed to start at indices 0,
+   n2 is the length of seq2 where seq2 is assumed to start at indices 0 */
+      char *seq1, *seq2;
+      int n1, n2, nmiss;
+      int *indices;
+{
+    int i, j, d, istart, iend, i_miss, n_match;
+    int mtable[100][5];  /* 100 would be, now the total number
+			    of matches the program may find between
+			    a given input string and the sequence */
+
+
+/* d = j - i is the "offset" between the two sequences */
+    if (n1 - n2 > nmiss) return (0);
+    n_match = 0;
+    for (d = -nmiss; d <= n2 + nmiss - n1; d++){
+	if (d < 0)  istart = i_miss = -d;
+	else istart = i_miss = 0;
+	if (d > n2 - n1) {
+	    iend = n2 - d;
+	    i_miss += n1 + d - n2;
+	}
+	else iend = n1;
+	for (i = istart, j = d + i; i < iend; i++, j++)
+	    if (seq1[i] != seq2[j] && ++i_miss > nmiss) goto nextd; 
+/* VOMIT! I want to PUKE!!!! who put this horrid goto here? */
+        mtable[n_match][0] = d + istart; /* indices (assuming they start at
+					    0) of starting nucleotide in the
+					    searched sequence */
+        mtable[n_match][1] = istart; /* nucleotide position in the query
+					sequence where match starts (assuming
+					query index starts with 0) */
+        mtable[n_match][2] = iend - istart; /* number of nucleotides in the
+					       match */
+	if (mtable[n_match][2]==n1) {
+	  indices[n_match] = mtable[n_match][0];
+	}
+	  
+        mtable[n_match][3] = i_miss; /* number of mismatches */
+	n_match++; /* number of matches */
+	if (n_match >= 100) return(n_match);
+
+    nextd:;
+    }
+
+    return (n_match);
+    /* return(0); *//*return 0 if it got to this point; that means
+		 it must have not found an exact match, so 
+		 you want the baseNum to be 0 in that case*/
+}
+
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+void  inputSeq(Widget parentWid)
+/*
+    Set up a dialogue which will read in and display a sequence.
+*/
+{   Widget inputWid, ioFormWid, ioPromptWid;
+    Widget namePromptWid, nameValWid;
+    Widget formatPromptWid, formatBoxWid, plainFmtWid, abiFmtWid, scfFmtWid;
+    Widget alfFmtWid;
+    Widget okWid, cancelWid;
+    Widget bottomFmtWid, topFmtWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+
+
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+
+
+    /*
+          Check whether there is an existing sequence which
+          is dirty which needs saving.
+    */
+    if (currSeq!=NULLSeq && isDirty(currSeq))
+    {   checkInputSeq(parentWid);
+        return;
+    }
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    inputWid = XtCreatePopupShell("input", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      inputWid, args, nargs);
+
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+    namePromptWid = XtCreateManagedWidget("namePrompt", labelWidgetClass,
+					  ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, namePromptWid);  nargs++;
+    XtSetArg(args[nargs], XtNeditType,  XawtextEdit);    nargs++;
+    nameValWid = XtCreateManagedWidget("nameVal", asciiTextWidgetClass,
+				       ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, namePromptWid); nargs++;
+    formatPromptWid = XtCreateManagedWidget("formatPrompt", labelWidgetClass,
+					    ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  namePromptWid);   nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, formatPromptWid); nargs++;
+    formatBoxWid = XtCreateManagedWidget("formatBox", boxWidgetClass,
+					 ioFormWid, args, nargs);
+
+    nargs = 0;
+    plainFmtWid = XtCreateManagedWidget("plainFmt", toggleWidgetClass,
+					formatBoxWid,
+					args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, plainFmtWid); nargs++;
+    abiFmtWid = XtCreateManagedWidget("abiFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, abiFmtWid); nargs++;
+    alfFmtWid = XtCreateManagedWidget("alfFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, abiFmtWid); nargs++;
+    scfFmtWid = XtCreateManagedWidget("scfFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+
+    nargs = 0;
+    bottomFmtWid = XtCreateManagedWidget("bottomFmt", toggleWidgetClass,
+					formatBoxWid,
+					args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNradioGroup, bottomFmtWid); nargs++;
+    topFmtWid = XtCreateManagedWidget("topFmt", toggleWidgetClass,
+				      formatBoxWid,
+				      args, nargs);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, formatPromptWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback, inputOKCallback, (XtPointer) inputWid);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  formatPromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback,
+		  inputCancelCallback, (XtPointer) inputWid);
+
+
+    /* nameVal (text) widget gets the keyboard focus */
+    XtSetKeyboardFocus(ioFormWid, nameValWid);
+
+
+    /* This is a modal dialogue */
+    XtPopup(inputWid, XtGrabExclusive);
+}
+
+
+
+
+void outputSeq(Widget parentWid, String defaultFileName)
+/*
+    Save the current sequence using the deafult file name, if given.
+*/
+{   Widget outputWid, ioFormWid, ioPromptWid;
+    Widget namePromptWid, nameValWid;
+    Widget includeHeaderWid,plainFormatWid;
+    Widget okWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+
+
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+
+
+    /* If there is no current sequence, then return */
+    if (currSeq==NULLSeq) return;
+
+
+    /*
+        Position the upper left hand corner of the popup at the
+	center of the parent widget.
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    outputWid = XtCreatePopupShell("output", transientShellWidgetClass,
+				  parentWid,
+				  args, nargs);
+
+    /* Create the form */
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      outputWid, args, nargs);
+
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+    namePromptWid = XtCreateManagedWidget("namePrompt", labelWidgetClass,
+					  ioFormWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid);     nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, namePromptWid);   nargs++;
+    XtSetArg(args[nargs], XtNeditType,  XawtextEdit);     nargs++;
+    XtSetArg(args[nargs], XtNstring,    defaultFileName); nargs++;
+    nameValWid = XtCreateManagedWidget("nameVal", asciiTextWidgetClass,
+				       ioFormWid, args, nargs);
+    /* Set the insertion point after the text - dodgy method used */
+    XawTextSetInsertionPoint(nameValWid, MAXLONG);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, True); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, namePromptWid); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Include Header"); nargs++;
+    includeHeaderWid = XtCreateManagedWidget("includeHeader", toggleWidgetClass,
+					ioFormWid,
+					args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNstate, False); nargs++;
+    XtSetArg(args[nargs], XtNfromVert, namePromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, includeHeaderWid); nargs++;
+    XtSetArg(args[nargs], XtNlabel, "Plain Format"); nargs++;
+    XtSetArg(args[nargs], XtNradioGroup, includeHeaderWid); nargs++;
+    plainFormatWid = XtCreateManagedWidget("plainFormat", toggleWidgetClass,
+					ioFormWid,
+					args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, includeHeaderWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback, outputOKCallback, (XtPointer) outputWid);
+
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, includeHeaderWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback, 
+		  outputCancelCallback, (XtPointer) outputWid);
+    
+    /* nameVal (text) widget gets the keyboard focus */
+    XtSetKeyboardFocus(ioFormWid, nameValWid);
+    
+    
+    /* This is a modal dialogue */
+    XtPopup(outputWid, XtGrabExclusive);
+}
+
+
+
+
+void quitApplication(Widget parentWid)
+/*
+  This function must be called to exit the application.
+  It ensures any current sequence is saved.
+  */
+{   Widget ioFormWid, ioPromptWid, quitWid;
+    Widget okWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+    
+    
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+    
+    
+    /*
+      Check whether there is an existing sequence which
+      is dirty which needs saving.
+      */
+    if (currSeq==NULLSeq || !isDirty(currSeq))
+	{   free(currSeq);
+	    XtDestroyApplicationContext(XtWidgetToApplicationContext(parentWid));
+	    exit(0);
+	}
+    
+    
+    /*
+      Position the upper left hand corner of the popup at the
+      center of the parent widget.
+      */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    quitWid = XtCreatePopupShell("quit", transientShellWidgetClass,
+				 parentWid,
+				 args, nargs);
+    
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      quitWid, args, nargs);
+    
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback,
+		  quitOKCallback, (XtPointer) quitWid);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback,
+		  quitCancelCallback, (XtPointer) quitWid);
+    
+    
+    /* This is a modal dialogue */
+    XtPopup(quitWid, XtGrabExclusive);
+}
+
+
+
+
+void initialDisplayedSeq(Widget toplevelWid,
+			 char *format, char *fn,
+			 int baseNum, int mag, char *astring, 
+			 char *enzyme, int bottom)
+/*
+  This function may be called once, after the application
+  has been realised, to specify a sequence to be displayed
+  initially. If it is not called, no sequence is initially
+  displayed.
+  */
+{   Arg args[10];
+    int nargs;
+    int i;
+    char NBasesString[10];
+    char *theSeq;
+    AppData app_data;
+    char enzString[100];
+    
+    
+    int num_bases;
+    Widget seqNameWid    = XtNameToWidget(toplevelWid, "mainForm.seqName");
+    Widget NorigBasesWid = XtNameToWidget(toplevelWid, "mainForm.NorigBases");
+    num_matches = 0;
+    next_indices = 0;
+    
+    XtGetApplicationResources(seqNameWid, (XtPointer) &app_data,
+                              resources, XtNumber(resources),
+                              NULL, 0);
+    
+#ifndef AUTO_CLIP
+    app_data.enz='\0';
+#endif
+    
+    /* enzyme will already equal either the app_resources.enz or
+       app_data.enz, lfw 05.02.91*/
+    if (enzyme!=NULL) 
+	strcpy(enzString,enzyme);
+    /*      if (app_data.enz != NULL)
+	    strcpy(enzString,app_data.enz);*/
+    else *enzString = '\0';
+    
+    
+    
+    /* read a sequence */
+    currSeq = readSeq(fn,bottom,enzString,format);
+    
+    if (currSeq != NULLSeq)
+	{    
+	    /*
+	      Set the `file name' and `number of bases' labels.
+	      */
+	    nargs = 0;
+	    XtSetArg(args[nargs], XtNlabel, fn); nargs++;
+	    XtSetValues(seqNameWid, args, nargs);
+	    nargs = 0;
+	    sprintf(NBasesString, "%d", getNBases(currSeq, OrigBases));
+	    XtSetArg(args[nargs], XtNlabel, NBasesString); nargs++;
+	    XtSetValues(NorigBasesWid, args, nargs);
+	    
+	    /* astring != NULL, find the string that corresponds
+	       to the app_resources.astring and center the display on that string,
+	       change by lfw, 10/24/90 from this line to free line */
+	    
+	    if (astring != NULL)  {
+		num_bases = getNBases(currSeq,EdBases);
+		theSeq = (char *)calloc(num_bases,sizeof(char));
+		for (i = 0; i < num_bases; i++)
+		    theSeq[i] = getBase(currSeq, EdBases, i);
+		
+		num_matches = string_match(astring,strlen(astring),theSeq,num_bases,
+					   0, indices);
+		baseNum = indices[0];
+		if (num_matches == 0) baseNum = 0;
+		
+		free(theSeq);
+	    }
+	    
+	    displaySeq(currSeq,
+		       ((baseNum!=NULLBaseNum)                 &&
+			(baseNum>=0)                           &&
+			(baseNum<getNBases(currSeq,OrigBases))
+			)
+		       ? baseNum
+		       : NULLBaseNum,
+		       mag
+		       );
+	}
+    else
+        XBell(XtDisplay(NorigBasesWid), 100);
+    
+}
+
+
+void  inputSearchString(Widget parentWid)
+/*
+  Set up a dialogue which will read in a search for a baseNum or string
+  and center the display on that baseNum, added by lfw 12/04/90
+  */
+{   Widget searchWid, ioFormWid, ioPromptWid;
+    Widget nameValWid, nextWid;
+    Widget okWid, cancelWid;
+    Arg	args[10];
+    int nargs;
+    Position  x, y;
+    Dimension width, height;
+    
+    
+    /* Add string to function bindings for our application actions */
+    XtActionsRec actionTable[] = { {"CRAction", CRAction} };
+    XtAppAddActions(XtWidgetToApplicationContext(parentWid),
+		    actionTable, XtNumber(actionTable));
+    
+    /*
+      Position the upper left hand corner of the popup at the
+      center of the parent widget.
+      */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtSetArg(args[nargs], XtNheight, &height); nargs++;
+    XtGetValues(parentWid, args, nargs);
+    XtTranslateCoords(parentWid,
+		      (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNx, x); nargs++;
+    XtSetArg(args[nargs], XtNy, y); nargs++;
+    searchWid = XtCreatePopupShell("search", transientShellWidgetClass,
+				   parentWid,
+				   args, nargs);
+    
+    nargs = 0;
+    ioFormWid = XtCreateManagedWidget("ioForm", formWidgetClass,
+				      searchWid, args, nargs);
+    
+    nargs = 0;
+    ioPromptWid = XtCreateManagedWidget("ioPrompt", labelWidgetClass,
+					ioFormWid, args, nargs);
+    
+    /*    nargs = 0;
+	  XtSetArg(args[nargs], XtNfromVert, ioPromptWid); nargs++;
+	  namePromptWid = XtCreateManagedWidget("namePrompt", labelWidgetClass,
+	  ioFormWid, args, nargs);*/
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  ioPromptWid); nargs++;
+    /*    XtSetArg(args[nargs], XtNfromHoriz, namePromptWid);  nargs++;*/
+    XtSetArg(args[nargs], XtNeditType,  XawtextEdit);    nargs++;
+    nameValWid = XtCreateManagedWidget("nameVal", asciiTextWidgetClass,
+				       ioFormWid, args, nargs);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, nameValWid); nargs++;
+    okWid = XtCreateManagedWidget("OK",  commandWidgetClass,
+				  ioFormWid, args, nargs);
+    XtAddCallback(okWid, XtNcallback, searchCallback, (XtPointer) searchWid);
+    
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  nameValWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, okWid);           nargs++;
+    cancelWid = XtCreateManagedWidget("Cancel",  commandWidgetClass,
+				      ioFormWid, args, nargs);
+    XtAddCallback(cancelWid, XtNcallback,
+		  inputCancelCallback, (XtPointer) searchWid);
+    
+    
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert,  nameValWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, cancelWid);           nargs++;
+    nextWid = XtCreateManagedWidget("next",  commandWidgetClass,
+				    ioFormWid, args, nargs);
+    XtAddCallback(nextWid, XtNcallback,
+		  searchNextCallback, (XtPointer) searchWid);
+    
+    
+    
+    /* nameVal (text) widget gets the keyboard focus */
+    XtSetKeyboardFocus(ioFormWid, nameValWid);
+    
+    
+    /* This is a modal dialogue */
+    XtPopup(searchWid, XtGrabExclusive);
+}
+
+
+
+
+
+void information (Widget w)
+/*
+ ** Display sequence and trace information gleaned from the sequence file
+ */
+{
+    
+    info(w, currSeq);
+    
+}
diff --git a/src/ted/dialogues.h b/src/ted/dialogues.h
new file mode 100644
index 0000000..a5649e3
--- /dev/null
+++ b/src/ted/dialogues.h
@@ -0,0 +1,71 @@
+#ifndef _dialogues_h
+#define _dialogues_h
+
+/* 
+    Title:       dialogues
+
+    File: 	 dialogues.h
+    Purpose:	 Dialogues
+    Last update: Thu Jan 03 1991
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+#include <stdio.h>
+
+/* ---- Global to ted.c and dialogues.c --- */
+extern char o_fn[200]; /* added by lfw, so a default output filename
+			  could be specified */
+extern char r_fn[200]; /* added by sd, so a default raw data filename
+			  could be specified */
+
+/* ---- Exports ---- */
+
+extern void inputSeq(Widget parentWid);
+/*
+    Set up a dialogue which will read in and display a new sequence.
+*/
+
+
+extern void outputSeq(Widget parentWid, String defaultFileName);
+/*
+    Save the current sequence using the default file name, if given.
+*/
+
+extern void inputSearchString(Widget parentWid);
+/*
+    Set up a dialogue which will read in and search for a string.
+*/
+
+extern void quitApplication(Widget parentWid);
+/*
+    This function must be called to exist the application.
+    It ensures any current sequence is saved.
+*/
+
+
+extern void initialDisplayedSeq(Widget toplevelWid,
+				char *format, char *fn,
+				int baseNum, int mag, char *astring, 
+				char *enzyme, int bottom);
+/*
+    This function may be called once, after the application
+    has been realised, to specify a sequence to be displayed
+    initially. If it is not called, no sequence is initially
+    displayed.
+*/
+
+extern void information(Widget w);
+/*
+    Display useful sequence and trace information
+*/
+
+extern int string_match(char *seq1, int n1, char *seq2, int n2,
+			int nmiss, int *indices);
+
+#endif  /*_dialogues_h*/
diff --git a/src/ted/display.c b/src/ted/display.c
new file mode 100644
index 0000000..a0c9537
--- /dev/null
+++ b/src/ted/display.c
@@ -0,0 +1,801 @@
+/* 
+    Title:       display
+
+    File: 	 display.c
+    Purpose:	 Sequence display and editing
+    Last update: Friday 15 May 1992
+*/
+
+
+/*
+    The display is created by a call to CreateDisplay. It principally
+    consists of two components: the first, created by a call to
+    createBasesDisplay displays the original and edited bases, the
+    second, created by a call to createTracesDisplay, displays the
+    traces.
+
+    Editing is done by stream: as the client presses keys and buttons
+    these must be interpreted as editing keystrokes.
+
+    The viewport widget does not itself hand out resize events, so we
+    create a dummy (graph) widget in the same patch, connected in the
+    same way to the walls of the form, to collect resize events and
+    allow us to adjust scalings for the graphs within the viewport.
+    This is slightly dodgy and should be redone some other way.
+
+  Changes to this program by lfw:
+  module displaySeq
+        Added the parameter mag to the parameters
+                passed to the subroutine
+        Changed to default magnification to 30%
+               by changing the call to setGraphWidth
+               to be 30 rather than -1.  And by
+               allowing the call to setGraphWidth
+               to be specified by the user which
+               comes from the main calling program
+               via the mag parameter
+
+  Changes made by sd:
+	Added scaleUp and scaleDown buttons + callbacks
+
+  SD 15-May-1992
+        editSeqEvents(): DEC Xlib function XlookupString didn't always work with
+	a buffer of length of 1, even when only one character returned. Made 
+	buffer larger - now eight characters (doesn't work with <=four)
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "display.h"
+#include "traceDisplay.h" /* IMPORT: createTraceDisplay, displayTrace,
+			             unDisplayTrace, setTraceWidth */
+#include "basesDisplay.h" /* IMPORT: createBasesDisplay, displayBases,
+			             unDisplayBases, setBasesWidth */
+#include "Graph.h"
+#include "seq.h"          /* IMPORT: Seq, NULLSeq, getNPoints */
+
+#include <ctype.h>        /* IMPORT: isgraph */
+
+#include <X11/Intrinsic.h>
+#include <X11/keysym.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+#include <X11/Xaw/Scrollbar.h>
+#include <X11/Xaw/Toggle.h>
+#include <X11/Xaw/Viewport.h>
+
+
+
+
+/* ---- Constants ---- */
+
+
+#define MaxDimension 32767 /* By definition, 2^16-1 */
+#define MaxMagCharSpace 5 /* At maximum magnification, the distance in
+			     characters between each base */
+#define ViewportFudge 10  /* A guess of the space required within the
+			     viewport for the graph to fit */
+
+
+/* the following allows us to get the default magnification from the
+   app_defaults file, Xted */
+typedef struct
+{   int magnif;
+    String enz;
+} AppData, *AppDataPtr;
+
+
+#define XtNmagnif "magnif"
+#define XtCMagnif "Magnif"
+#define XtNenz "enz"
+#define XtCEnz "Enz"
+
+static XtResource resources[] = {
+  { XtNmagnif,
+    XtCMagnif,
+    XtRInt,
+    sizeof(int),
+    XtOffset(AppDataPtr, magnif),
+    XtRImmediate,
+    NULL
+    },
+  { XtNenz,
+    XtCEnz,
+    XtRString,
+    sizeof(String),
+    XtOffset(AppDataPtr, enz),
+    XtRImmediate,
+    NULL
+    }
+    };
+
+
+
+
+
+
+
+
+/* ---- Statics ---- */
+
+/* Current sequence information */
+static Seq currSeq = NULLSeq;
+static int NPoints;
+static int NedBases;
+
+/* Viewport widget */
+static Widget viewportWid;
+static Widget vpFormWid;
+static Widget dummyWid;
+
+static Dimension charWidth;      /* As used by basesDisplay */
+static Dimension minGraphWidth;  /* All characters displayed
+				    (depends on width of parent viewport) */
+static Dimension maxGraphWidth;  /* Bases spaced out with `MaxMagCharSpace'
+				    blanks (depends on char size and
+				    NorigBases) */
+static Dimension graphWidth;     /* Current width of the graph */
+static Dimension plotEdgeOffset;
+
+
+/* Editing state toggles */
+static Widget togWid;            /* The group of toggles */
+
+
+/* Magnification widget */
+static Widget magWid;
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+
+
+static void setGraphWidth(int percent)
+/*
+      Set the widths of the graphs to be `percent' between
+      the minimum and maximum widths.
+      The current ``centre of interest'' is maintained.
+*/
+{   int nargs;
+    Arg args[10];
+    float centre, topOfThumb, shown;
+    Widget hScrollWid = XtNameToWidget(viewportWid, "horizontal");
+
+    /*
+        The ``centre of interest'' is the middle of the displayed
+	graph. Percentagewise, this is given by `topOfThumb+show/2'.
+    */
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtSetArg(args[nargs], XtNshown,      &shown);      nargs++;
+    XtGetValues(hScrollWid, args, nargs);
+    centre = topOfThumb+shown/2;
+
+    /*
+        Set the displays to the required width
+    */
+    graphWidth = ((maxGraphWidth-minGraphWidth) * percent)/100 + minGraphWidth;
+    XawFormDoLayout(vpFormWid, False);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, graphWidth); nargs++;
+    setBasesWidth(graphWidth);
+    setTraceWidth(graphWidth);
+    XawFormDoLayout(vpFormWid, True);
+
+    /*
+        We have changed the size of the form widget within the viewport
+	widget. The viewport does not define where exactly the form
+	will now sit, so we manually reset the ``centre of interest'',
+	keeping `shown' at its (new) value.
+    */
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtSetArg(args[nargs], XtNshown,      &shown);      nargs++;
+    XtGetValues(hScrollWid, args, nargs);
+    topOfThumb = centre-shown/2;
+    XawScrollbarSetThumb(hScrollWid, topOfThumb, shown);
+    XtCallCallbacks(hScrollWid, XtNjumpProc, &topOfThumb);
+}
+
+
+
+
+static void editSeqEvent(XEvent *eventP)
+/*
+    `eventP' (a button or key press) is interpreted as a sequence
+    editing command.
+*/
+{
+    char buffer[8];
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+
+    switch (eventP->type)
+    {   case KeyPress:
+            charCount = XLookupString((XKeyEvent *)eventP, buffer, 8,
+				      &keySym, &compose);
+	    switch (keySym)
+	    {   case XK_Right:
+		    moveCaretRight();
+		    break;
+
+                case XK_Left:
+		    moveCaretLeft();
+		    break;
+
+		case XK_Delete:
+		    /* Delete the base to the left of the caret */
+		    if (deleteBase(currSeq, getCaret()))
+		    {   baseDeleted();
+		        NedBases--;
+		    }
+		    else
+		        XBell(XtDisplay(viewportWid), 100);
+	            break;
+
+		default:
+		    if (charCount==1 && isgraph(buffer[0]))
+		    {   /* Insert the base to the right of the caret */
+			if (insertBase(currSeq, buffer[0], getCaret()))
+			{   baseInserted();
+			    NedBases++;
+			}
+			else
+		            XBell(XtDisplay(viewportWid), 100);
+		    }
+		    break;
+	    }
+            break;
+
+        case ButtonPress:
+	    {	/* Move the caret to where the pointer is */
+		int point = pixelToPoint(eventP->xbutton.x);
+		if      (point<0)         moveCaretTo(-1);
+		else if (point>NPoints-1) moveCaretTo(NedBases-1);
+		else
+	        {    int b = pointToBaseNum(currSeq, EdBases, point);
+		     moveCaretTo((b == NULLBaseNum) ? NedBases-1 : b-1);
+		}
+		break;
+	    }
+    }
+}
+
+
+
+
+static void adjustLEvent(XEvent *eventP)
+/*
+    `eventP' (a button or key press) is interpreted as a command
+    to adjust the left cutoff.
+*/
+{   char buffer[1];
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+    int leftCutoff, rightCutoff;
+
+    switch (eventP->type)
+    {   case KeyPress:
+            charCount = XLookupString((XKeyEvent *)eventP, buffer, 1,
+				      &keySym, &compose);
+	    switch (keySym)
+	    {   case XK_Right:
+		    /* Move the cutoff right one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (leftCutoff<NedBases) leftCutoff++;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+
+                case XK_Left:
+		    /* Move the cutoff left one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (leftCutoff>0) leftCutoff--;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+	    }
+            break;
+
+        case ButtonPress:
+	    {   int point, baseNum;
+
+		/* Move the cutoff to where the pointer is */
+		point = pixelToPoint(eventP->xbutton.x);
+		if      (point<0)         baseNum = 0;
+		else if (point>NPoints-1) baseNum = NedBases;
+		else
+		{ baseNum = pointToBaseNum(currSeq, EdBases, point);
+
+		  baseNum = (baseNum == NULLBaseNum)
+		              ? NedBases
+		              : baseNum;
+		}
+		getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		setCutoffs(currSeq, baseNum, rightCutoff);		      
+		basesCutoffChange();
+		traceCutoffChange();
+		break;
+	    }
+    }
+}
+
+
+
+
+static void adjustREvent(XEvent *eventP)
+/*
+    `eventP' (a button or key press) is interpreted as a command
+    to adjust the right cutoff.
+*/
+{   char buffer[1];
+    KeySym keySym;
+    int charCount;
+    XComposeStatus compose;
+    int leftCutoff, rightCutoff;
+
+    switch (eventP->type)
+    {   case KeyPress:
+            charCount = XLookupString((XKeyEvent *)eventP, buffer, 1,
+				      &keySym, &compose);
+	    switch (keySym)
+	    {   case XK_Right:
+		    /* Move the cutoff right one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (rightCutoff>0) rightCutoff--;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+
+                case XK_Left:
+		    /* Move the cutoff left one */
+		    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		    if (rightCutoff<NedBases) rightCutoff++;
+		    (void) setCutoffs(currSeq, leftCutoff, rightCutoff);
+		    basesCutoffChange();
+		    traceCutoffChange();
+		    break;
+	    }
+            break;
+
+        case ButtonPress:
+	    {   int point, baseNum;
+
+		/* Move the cutoff to where the pointer is */
+		point = pixelToPoint(eventP->xbutton.x);
+		if      (point<0)         baseNum = 0;
+		else if (point>NPoints-1) baseNum = NedBases;
+		else
+		{ baseNum = pointToBaseNum(currSeq, EdBases, point);
+
+		  baseNum = (baseNum == NULLBaseNum)
+		              ? NedBases
+		              : baseNum;
+		}
+		getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+		setCutoffs(currSeq, leftCutoff, NedBases-baseNum);
+		basesCutoffChange();
+		traceCutoffChange();
+		break;
+	    }
+    }
+}
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void resizeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Dimension width;
+    int mag;
+    Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth,  &width);  nargs++;
+    XtGetValues(viewportWid, args, nargs);
+    
+    /* Back calculate what the current magnification is */
+    mag = ((graphWidth-minGraphWidth)*100) / (maxGraphWidth-minGraphWidth);
+
+    /* Adjust the minimum magnification */
+    minGraphWidth = width-ViewportFudge;
+    graphWidth    = minGraphWidth;
+
+    /* Set us to the old magnification, but with the new dimensions */
+    setGraphWidth(mag);
+}
+
+
+
+
+static void magJumpCallback(Widget wid,
+			    XtPointer client_data, XtPointer call_data)
+{   float percent = *((float *) call_data);
+
+    setGraphWidth((int) (percent*100));
+}
+
+
+
+
+static void magScrollCallback(Widget wid,
+			      XtPointer client_data, XtPointer call_data)
+{   int position = (int) call_data;
+    float     topOfThumb;
+    int nargs;
+    Arg args[10];
+
+    nargs=0;
+    XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+    XtGetValues(wid, args, nargs);
+
+    /* Move the thumb by 1% in the appropriate direction */
+    if (position>0)
+    {   topOfThumb += .01;
+	if (topOfThumb > 1.0) topOfThumb = 1.0;
+    }
+    else
+    {   topOfThumb -= .01;
+	if (topOfThumb < 0.0) topOfThumb = 0.0;
+    }
+
+    XawScrollbarSetThumb(wid, topOfThumb, -1.0);
+
+    setGraphWidth((int) (topOfThumb*100));
+}
+
+
+static void scaleUpCallback(Widget wid,
+			    XtPointer client_data, XtPointer call_data)
+{
+    incScaleFactor();
+}
+
+static void scaleDownCallback(Widget wid,
+			    XtPointer client_data, XtPointer call_data)
+{
+    decScaleFactor();
+}
+
+
+static void complementCallback(Widget wid,
+			       XtPointer client_data, XtPointer call_data)
+{
+    complement_seq(currSeq);
+    /* force redisplay of everything in viewport widget */
+    redisplayTraces();
+    redisplayBases();
+    /* shift caret */
+    moveCaretTo(currSeq->NedBases - getCaret() - 2);
+
+    { /* set viewport position */
+	Cardinal nargs;
+	float topOfThumb, shown;
+	Arg args[2];
+	Widget hScrollWid = XtNameToWidget(viewportWid, "horizontal");
+	nargs=0;
+	XtSetArg(args[nargs], XtNtopOfThumb, &topOfThumb); nargs++;
+	XtSetArg(args[nargs], XtNshown, &shown); nargs++;
+	XtGetValues(hScrollWid, args, nargs);
+	topOfThumb = 1.0 - shown - topOfThumb;
+	XawScrollbarSetThumb(hScrollWid, topOfThumb, shown);
+	XtCallCallbacks(hScrollWid, XtNjumpProc, &topOfThumb);
+    }
+
+    /* set strand widget */
+    {
+	Widget strandWid;
+	strandWid = XtNameToWidget(XtParent(wid), "strand");
+	set_strand_label(strandWid,currSeq);
+
+    }
+
+}
+
+
+
+/* ---- Exports ---- */
+
+
+void createDisplay(Widget parentWid, Widget fromVertWid)
+/*
+    Create the display within the Form widget `parentWid'
+    with the XtNfromVert constraint `fromVertWid'.
+    No sequence is initally displayed.
+*/
+{   Widget labelWid, basesWid;
+    Widget scaleUpWid, scaleDownWid;
+    Widget compWid;
+    Arg args[10];
+    int nargs;
+
+    /* The magnification slider */
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    labelWid = XtCreateManagedWidget("maglab", labelWidgetClass,
+				     parentWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, labelWid); nargs++;
+    magWid = XtCreateManagedWidget("magscr", scrollbarWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(magWid, XtNjumpProc,   magJumpCallback, NULL);
+    XtAddCallback(magWid, XtNscrollProc, magScrollCallback, NULL);
+
+
+
+    /* The editing mode toggles */
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+   XtSetArg(args[nargs], XtNfromHoriz, magWid); nargs++;
+    labelWid = XtCreateManagedWidget("edmodelab", labelWidgetClass,
+				     parentWid, args, nargs);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, labelWid); nargs++;
+    togWid = XtCreateManagedWidget("adjustL", toggleWidgetClass,
+				   parentWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, togWid); nargs++;
+    XtSetArg(args[nargs], XtNradioGroup, togWid); nargs++;
+    togWid = XtCreateManagedWidget("edSeq", toggleWidgetClass,
+				   parentWid, args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, togWid); nargs++;
+    XtSetArg(args[nargs], XtNradioGroup, togWid); nargs++;
+    togWid = XtCreateManagedWidget("adjustR", toggleWidgetClass,
+				   parentWid, args, nargs);
+
+
+    /* for vertical scaling */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, togWid); nargs++;
+    scaleDownWid = XtCreateManagedWidget("scaleDown", commandWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(scaleDownWid, XtNcallback, scaleDownCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, scaleDownWid); nargs++;
+    scaleUpWid = XtCreateManagedWidget("scaleUp", commandWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(scaleUpWid, XtNcallback, scaleUpCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNfromHoriz, scaleUpWid); nargs++;
+    compWid = XtCreateManagedWidget("switch", commandWidgetClass,
+				   parentWid, args, nargs);
+    XtAddCallback(compWid, XtNcallback, complementCallback, NULL);
+
+
+    /* The main display */
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, labelWid); nargs++;
+    XtSetArg(args[nargs], XtNforceBars, True); nargs++;
+    viewportWid = XtCreateManagedWidget("viewport", viewportWidgetClass,
+					parentWid, args, nargs);
+
+        /*
+	    The minimum graph width is the viewport width minus a
+	    random amount to allow for spacing between the viewport
+	    and the graph, borders etc.
+	    The initial graph width is this minimum width.
+        */
+        {   Dimension vpWidth;
+
+            nargs = 0;
+            XtSetArg(args[nargs], XtNwidth, &vpWidth); nargs++;
+	    XtGetValues(viewportWid, args, nargs);
+	    minGraphWidth = vpWidth-ViewportFudge;
+	    graphWidth = minGraphWidth;
+	}
+
+    nargs = 0;
+    vpFormWid = XtCreateManagedWidget("vpForm", formWidgetClass,
+				      viewportWid, args, nargs);
+
+    basesWid = createBasesDisplay(vpFormWid, NULL, graphWidth);
+    getBasesFontInfo(&plotEdgeOffset, &charWidth);
+    createTraceDisplay(vpFormWid,basesWid,graphWidth, plotEdgeOffset);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, magWid); nargs++;
+    dummyWid = XtCreateManagedWidget("dummy", graphWidgetClass,
+				     parentWid, args, nargs);
+    XtAddCallback(dummyWid, XtNresizeCallback, resizeCallback, NULL);
+}
+
+
+
+
+void displaySeq(Seq seq, int baseNum, int mag)
+/*
+    Display the sequence `seq' for editing.
+    If baseNum is not equal to NULLBaseNum, display at 30% (or mag)
+    magnification centered on baseNum.
+*/
+{   AppData app_data;
+
+
+
+    XtGetApplicationResources(magWid, (XtPointer) &app_data,
+                              resources, XtNumber(resources),
+                              NULL, 0);
+
+
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+    displayBases(seq,baseNum);
+    displayTrace(seq);
+
+
+    /*
+      The maximum graph width is such to allow for a `MaxMagCharSpace'
+      spacing between bases.
+
+      There is a limit on Dimension sizes, noting that the graph we
+      generate has to fit into a Viewport.
+    */
+    {  Dimension maxWidth;
+       int  NorigBases;
+
+       NorigBases = getNBases(currSeq, OrigBases);
+       maxWidth = ((NorigBases-1) * charWidth * MaxMagCharSpace) +
+	                (2 * plotEdgeOffset);
+       maxGraphWidth =
+	     (maxWidth>MaxDimension-ViewportFudge) ? MaxDimension-ViewportFudge
+	   : (maxWidth<minGraphWidth)              ? minGraphWidth
+	   :                                         maxWidth;
+    }
+
+
+    /*
+        Get static information about this sequence
+    */
+    NedBases = getNBases(seq, EdBases);
+    NPoints  = getNPoints(seq);
+
+    
+    if (baseNum==NULLBaseNum)
+    {   /* Starting at LH end, minimum magnification (everything visible) */
+
+        if (mag==-5) /* back calculate the magnification */ 
+	  mag = ((graphWidth-minGraphWidth)*100) / (maxGraphWidth-minGraphWidth);
+
+
+	if (mag <= 0 || mag > 100) {
+	  setGraphWidth(0);
+	  XawScrollbarSetThumb(magWid, 0.3, -1.0);
+	}
+	else {
+	  setGraphWidth(mag);
+	  XawScrollbarSetThumb(magWid, (float)mag/100.0, -1.0);
+	}
+    }
+    else
+    {   /* Centred on baseNum, half magnification */
+        Widget hScrollWid = XtNameToWidget(viewportWid, "horizontal");
+        float topOfThumb;
+
+	/* Set the `centre of interest' on baseNum. Zero `shown'. */
+/*
+	topOfThumb = (float)baseNum / (float)(NedBases-1);
+*/
+	topOfThumb = (float) baseNumToPoint(seq, OrigBases,baseNum) / (float) NPoints;
+	XawScrollbarSetThumb(hScrollWid, topOfThumb, 0.0);
+	/* Set the actual magnification */
+
+/* the original program automatically set the magnification to 30% if
+   the user specified a baseNum other than NULL, this section was added
+   by LaDeana Hillier, 9/27/90, to allow the user to set their own
+   magnification when calling up ted. */
+
+        if (mag==-5) /* back calculate it */ 
+	  mag = ((graphWidth-minGraphWidth)*100) / (maxGraphWidth-minGraphWidth);
+	if (app_data.magnif==0) app_data.magnif = 30;
+
+	if (mag <= 0 || mag > 100) {
+	  setGraphWidth(app_data.magnif);
+	/* Set the magnification scrollbar */
+          XawScrollbarSetThumb(magWid, (float)(app_data.magnif)/100.0, -1.0);
+	}
+	else {
+	  setGraphWidth(mag);
+          XawScrollbarSetThumb(magWid, (float)mag/100.0, -1.0);
+	}
+    }
+}
+
+
+
+
+void unDisplaySeq()
+/*
+    Cease displaying the current sequence (if any).
+*/
+{   currSeq = NULLSeq;
+
+
+    /*
+        Reset the magnification to minimum.
+	The thumb is left its default size.
+    */
+    XawScrollbarSetThumb(magWid, 0.0, -1.0);
+
+    unDisplayBases();
+    unDisplayTrace();
+    /*
+        The initial width is the minimum, so everything is visible.
+	This also clears the graphs.
+    */
+    setGraphWidth(0);
+}
+
+
+
+
+Seq displayedSeq()
+/*
+    Return the currently displayed sequence, or NULL
+    if none is being displayed.
+*/
+{   return(currSeq);
+}
+
+
+
+
+void userEvent(XEvent *eventP)
+{   String currTog;
+
+    if (currSeq == NULLSeq) return;
+
+    /*
+        Find which of the toggles is currently set
+    */
+    currTog = (String) XawToggleGetCurrent(togWid);
+
+    if (strcmp(currTog, "adjustL") == 0)
+    {   /* Adjust the left cutoff */
+        adjustLEvent(eventP);
+    }
+    else if (strcmp(currTog, "adjustR") == 0)
+    {  /* Adjust the right cutoff */
+        adjustREvent(eventP);
+    }
+    else
+    {   /* Edit the sequence itself */
+	editSeqEvent(eventP);
+    }
+}
+
+
diff --git a/src/ted/display.h b/src/ted/display.h
new file mode 100644
index 0000000..a93070f
--- /dev/null
+++ b/src/ted/display.h
@@ -0,0 +1,68 @@
+/* 
+    Title:       display
+
+    File: 	 display.h
+    Purpose:	 Sequence display and editing
+    Last update: Tue Jun 5 1990
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern void createDisplay(Widget parentWid, Widget superiorWid);
+/*
+    Create the display within the Form widget `parent',
+    underneath `superiorWid'.
+    No sequence is initially displayed.
+*/
+
+
+extern void displaySeq(Seq seq, int baseNum, int mag);
+/*
+    Display the sequence `seq' for editing.
+    If baseNum is not equal to -1, display at half magnification
+    centered on baseNum. (unless the user has also set mag)
+*/
+
+
+extern void unDisplaySeq();
+/*
+    Cease displaying the current sequence (if any).
+*/
+
+
+extern Seq displayedSeq();
+/*
+    Return the currently displayed sequence, or NULLSeq
+    if none is being displayed.
+*/
+
+
+extern void userEvent(XEvent *eventP);
+/*
+    Handle the (key or button) event generated in a subdisplay.
+*/
+
+
+
+
+
+
+
+
diff --git a/src/ted/fpoint.c b/src/ted/fpoint.c
new file mode 100644
index 0000000..cd930c1
--- /dev/null
+++ b/src/ted/fpoint.c
@@ -0,0 +1,54 @@
+/*#include <math.h>*/
+extern double log ( double x ) ;
+extern double exp ( double x ) ;
+#define IEEE
+
+float int_to_float(int in)
+/*
+** interpret the integer in as a
+** floating point number in IEEE format
+*/
+{
+   /*
+  Assume `in' is stored as a float according to the 
+  ANSI IEEE 754-1985 standard. See the tables below:
+
+  s = sign ( 1 bit)
+  e = biased exponent (8 bits)
+  f = fraction (23 bits)
+
+  floating point number =  (-1)^s 2^(e-127) 1.f
+
+     Bits  Name      Content
+      31   Sign      1 iff number is negative
+    23-30  Exponent  Eight-Bit exponent, biased by 127
+     0-22  Fraction  23-bit fraction component of normalised significant.
+		     The "one" bit is "hidden"
+
+  If IEEE floating point format is supported on your machine...
+  ensure there is a #define IEEE somewhere. 
+  */
+
+#ifdef IEEE
+  union {
+    int i;
+    float f;
+  } cvt;
+  cvt.i = in;
+  return cvt.f;
+#else
+  int fraction;
+  int exponent;
+  int sign;
+
+  fraction = in & ( (1<<23)-1 );
+  exponent = (in >> 23) & ( (1<<8)-1 );
+  sign = (in >> 31);
+
+  return
+    (float) (
+      (sign?-1.0:1.0) *
+      exp ( log ( (double) 2.0) * (double) (exponent - 127 - 23) ) *
+      (double) ((1<<23)+fraction)) ;
+#endif
+}
diff --git a/src/ted/fpoint.h b/src/ted/fpoint.h
new file mode 100644
index 0000000..065c5df
--- /dev/null
+++ b/src/ted/fpoint.h
@@ -0,0 +1,6 @@
+#ifndef _fpoint_h
+#define _fpoint_h
+
+extern float int_to_float(int in);
+
+#endif /* _fpoint_h */
diff --git a/src/ted/help.c b/src/ted/help.c
new file mode 100644
index 0000000..c446e79
--- /dev/null
+++ b/src/ted/help.c
@@ -0,0 +1,128 @@
+/* 
+    Title:       help
+
+    File: 	 help.c
+    Purpose:	 Help
+    Last update: Fri Jun 15 1990
+*/
+
+
+/* ---- Includes ---- */
+#include <stdio.h>
+
+#include "help.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Statics ---- */
+
+static Widget helpShellWid;
+static Boolean helpShellMapped=False;
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+static void removeCallback(Widget w,
+			   XtPointer client_data, XtPointer call_data)
+{    XtPopdown(helpShellWid);
+     helpShellMapped=False;
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+Widget createHelpShell(Widget toplevelWid)
+/*
+    Create the help shell as a child of `toplevelWid'.
+    The shell is not initially mapped.
+*/
+{   Widget formWid, removeWid, textWid;
+    String fn;
+    Arg args[10];
+    int nargs;
+
+    char apology[] = "Sorry, I couldn't find the \"ted.help\" file.\n\
+This should either be in \"/usr/X11/app-defaults\" or somewhere\n\
+referred to by the \"XFILESEARCHPATH\" environment variable.";
+
+
+    /*
+        The toplevel shell widget holds a form widget
+    */
+    helpShellWid = XtCreatePopupShell("Help", topLevelShellWidgetClass,
+				      toplevelWid,
+				      NULL, (Cardinal) 0);
+
+    formWid = XtCreateManagedWidget("form", formWidgetClass, helpShellWid,
+				    NULL, (Cardinal) 0);
+
+
+    nargs = 0;
+    removeWid = XtCreateManagedWidget("rmButton", commandWidgetClass,
+				      formWid, args, nargs);
+    XtAddCallback(removeWid, XtNcallback, removeCallback, NULL);
+
+
+    /*
+        Find the help file.
+    */
+    fn = XtResolvePathname(XtDisplay(toplevelWid),
+			   "app-defaults",     /* type */
+			   "ted",              /* file name */
+			   ".help",            /* suffix */
+			   NULL,               /* path */
+			   NULL, (Cardinal)0,  /* substitutions */
+			   NULL                /* file predicate */
+			  );
+
+
+    /*
+        If we found the file display it, else an apology.
+    */
+    nargs = 0;
+    if (fn != NULL)
+    {   XtSetArg(args[nargs], XtNtype, XawAsciiFile); nargs++;
+	XtSetArg(args[nargs], XtNstring, fn); nargs++;
+    }
+    else
+    {   XtSetArg(args[nargs], XtNtype, XawAsciiString); nargs++;
+	XtSetArg(args[nargs], XtNstring, apology); nargs++;
+    }
+    XtSetArg(args[nargs], XtNfromVert, removeWid); nargs++;
+    XtSetArg(args[nargs], XtNscrollVertical, XawtextScrollAlways); nargs++;
+    textWid = XtCreateManagedWidget("text", asciiTextWidgetClass,
+				    formWid, args, nargs);
+
+
+    if (fn != NULL) XtFree(fn);
+
+
+    return(helpShellWid);
+}
+
+
+
+
+void help()
+/*
+    Display help modeless dialogue.
+*/
+{   if (!helpShellMapped)
+    {   XtPopup(helpShellWid, XtGrabNone);
+	helpShellMapped = True;
+    }
+}
diff --git a/src/ted/help.h b/src/ted/help.h
new file mode 100644
index 0000000..5778471
--- /dev/null
+++ b/src/ted/help.h
@@ -0,0 +1,38 @@
+#ifndef _help_h
+#define _help_h
+
+
+/* 
+    Title:       help
+
+    File: 	 help.h
+    Purpose:	 Help
+    Last update: Fri Jun 15 1990
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Widget createHelpShell(Widget toplevelWid);
+/*
+    Create the help shell as a child of `toplevelWid'.
+    The shell is not initially mapped.
+*/
+
+
+extern void help();
+/*
+    Display help modeless dialogue.
+*/
+
+
+#endif  /*_help_h*/
diff --git a/src/ted/info.c b/src/ted/info.c
new file mode 100644
index 0000000..f264a2e
--- /dev/null
+++ b/src/ted/info.c
@@ -0,0 +1,123 @@
+/* 
+    Title:       info
+
+    File: 	 info.c
+    Purpose:	 Info
+    Last update: Tue Nov 10 1992
+*/
+
+
+/* ---- Includes ---- */
+#include <stdio.h>
+
+#include "info.h"
+#include "seq.h"
+
+#include <X11/Intrinsic.h>
+#include <X11/Shell.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/AsciiText.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+
+
+
+
+/* ---- Statics ---- */
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+static void get_info_string(Seq seq, char *seq_info)
+{
+    switch (seq->format) {
+	case ABIFormat:
+	case SCFFormat:
+	    if (seq->info != NULL) {
+		sprintf(seq_info,"%s\n",seq->info);
+		break;
+	    }
+	case ALFFormat:
+        default:
+            strcpy(seq_info,"No information for this sequence\n");
+            break;
+    }
+}
+
+/*ARGSUSED*/
+static void
+DestroyPopupPrompt(Widget widget, XtPointer client_data, XtPointer call_data)
+{
+
+    Widget popup = XtParent( (Widget) client_data);
+    XtDestroyWidget(popup);
+}
+
+
+/* ---- Exports ---- */
+
+
+/* ARGSUSED */
+void info(Widget button, Seq seq)
+{
+    Arg		args[5];
+    Widget	popup;
+    Position	x, y;
+    Dimension	width, height;
+    Cardinal	n;
+    Widget apane,apane1,apane2;
+    char seq_info[500];
+
+    if (seq == NULLSeq) return;
+    /*
+     * This will position the upper left hand corner of the popup at the
+     * center of the widget which invoked this callback, which will also
+     * become the parent of the popup. 
+     */
+
+    get_info_string(seq,seq_info);
+
+    n = 0;
+    XtSetArg(args[0], XtNwidth, &width); n++;
+    XtSetArg(args[1], XtNheight, &height); n++;
+    XtGetValues(button, args, n);
+    XtTranslateCoords(button, (Position) (width / 2), (Position) (height / 2),
+		      &x, &y);
+
+    x -= 350;
+    n = 0;
+    XtSetArg(args[n], XtNx, x);				n++;
+    XtSetArg(args[n], XtNy, y);				n++;
+
+    popup = XtCreatePopupShell("prompt", transientShellWidgetClass, button,
+			       args, n);
+
+    apane = XtCreateManagedWidget("apane", formWidgetClass, popup ,NULL, 0);    
+    apane2 = XtVaCreateManagedWidget("apane2", asciiTextWidgetClass,apane,
+				         XtNeditType, XawtextRead,
+					 XtNwidth, 400,
+				         XtNheight, 80,
+				         XtNstring, seq_info,
+				         XtNscrollVertical, XawtextScrollAlways,
+				      NULL);
+
+    apane1 = XtVaCreateManagedWidget("apane3", commandWidgetClass,apane,
+				         XtNeditType, XawtextEdit,
+					 XtNwidth, 400,
+				         XtNheight, 20,
+				         XtNfromVert, apane2,
+				         XtNlabel, "Cancel",				     
+				      NULL);
+
+    XtAddCallback(apane1, XtNcallback, DestroyPopupPrompt, (XtPointer)apane);
+    XtPopup(popup, XtGrabNone);
+    
+}
+
+
diff --git a/src/ted/info.h b/src/ted/info.h
new file mode 100644
index 0000000..9d4161d
--- /dev/null
+++ b/src/ted/info.h
@@ -0,0 +1,30 @@
+#ifndef _info_h
+#define _info_h
+
+
+/* 
+    Title:       info
+
+    File: 	 info.h
+    Purpose:	 Info
+    Last update: Fri Jun 15 1990
+*/
+
+
+
+/* ---- Includes ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+/* ---- Exports ---- */
+
+extern void info();
+/*
+    Display info modeless dialogue.
+*/
+
+
+#endif  /*_info_h*/
diff --git a/src/ted/mach-io.c b/src/ted/mach-io.c
new file mode 100644
index 0000000..9b78205
--- /dev/null
+++ b/src/ted/mach-io.c
@@ -0,0 +1,238 @@
+/*
+** Machine independant io:
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value suggessfully read or written
+*/
+
+#include <stdio.h>
+#include "mach-io.h"
+
+
+
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+int be_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a big-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[0] = (uint_1) (*i2>>8)&255;
+    buf[1] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(*i4)];
+
+
+    buf[0] = (uint_1) (*i4>>24)&255;
+    buf[1] = (uint_1) (*i4>>16)&255;
+    buf[2] = (uint_1) (*i4>>8)&255;
+    buf[3] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a big-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int be_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a big-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[1]) +
+         ((uint_2)buf[0]<<8));
+    return (1);
+}
+
+
+
+
+
+int be_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a big-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[3]) +
+         ((uint_4)buf[2]<<8) +
+         ((uint_4)buf[1]<<16) +
+         ((uint_4)buf[0]<<24));
+    return (1);
+}
+
+
+
+
+
+
+
+
+
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+int le_write_int_1(FILE *fp, uint_1 *i1)
+/*
+** Write a little-endian int_1
+*/
+{
+    if (fwrite(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_2(FILE *fp, uint_2 *i2)
+/*
+** Write a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    buf[1] = (uint_1) (*i2>>8)&255;
+    buf[0] = (uint_1) *i2&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_write_int_4(FILE *fp, uint_4 *i4)
+/*
+** Write a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    buf[3] = (uint_1) (*i4>>24)&255;
+    buf[2] = (uint_1) (*i4>>16)&255;
+    buf[1] = (uint_1) (*i4>>8)&255;
+    buf[0] = (uint_1) *i4&255;
+    if (fwrite(buf, sizeof(buf), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_1(FILE *fp, uint_1 *i1)
+/*
+** Read a little-endian int_1
+*/
+{
+    if (fread(i1, sizeof(uint_1), 1, fp) != 1) return (0);
+    return (1);
+}
+
+
+
+
+
+int le_read_int_2(FILE *fp, uint_2 *i2)
+/*
+** Read a little-endian int_2
+*/
+{
+    uint_1 buf[sizeof(int_2)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i2 =
+        (((uint_2)buf[0]) +
+         ((uint_2)buf[1]<<8));
+    return (1);
+}
+
+
+
+
+
+int le_read_int_4(FILE *fp, uint_4 *i4)
+/*
+** Read a little-endian int_4
+*/
+{
+    uint_1 buf[sizeof(int_4)];
+
+    if (fread(buf, sizeof(buf), 1, fp) != 1) return (0);
+    *i4 =
+        (((uint_4)buf[0]) +
+         ((uint_4)buf[1]<<8) +
+         ((uint_4)buf[2]<<16) +
+         ((uint_4)buf[3]<<24));
+    return (1);
+}
diff --git a/src/ted/mach-io.h b/src/ted/mach-io.h
new file mode 100644
index 0000000..5783df3
--- /dev/null
+++ b/src/ted/mach-io.h
@@ -0,0 +1,115 @@
+#ifndef _mach_io_h
+#define _mach_io_h
+/*
+** Machine independant io
+** For reading and writing to big-endian and little-endian files
+**
+** Routines available:
+**     be_write_int_1()
+**     be_write_int_2()
+**     be_write_int_4()
+**     be_read_int_1()
+**     be_read_int_2()
+**     be_read_int_4()
+**     le_write_int_1()
+**     le_write_int_2()
+**     le_write_int_4()
+**     le_read_int_1()
+**     le_read_int_2()
+**     le_read_int_4()
+**
+** All routine return:
+**    0 - an error has occurred during io operation
+**    1 - value successfully read or written
+*/
+
+#include <stdio.h>
+
+typedef char		int_1;
+typedef short		int_2;
+typedef int		int_4;
+typedef unsigned char	uint_1;
+typedef unsigned short	uint_2;
+typedef unsigned int	uint_4;
+
+/**********************************************************************/
+/* IO for big-endian files                                            */
+/**********************************************************************/
+
+extern int be_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a big-endian int_1
+*/
+
+extern
+int be_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a big-endian int_2
+*/
+
+extern
+int be_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a big-endian int_4
+*/
+
+extern
+int be_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a big-endian int_1
+*/
+
+extern
+int be_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a big-endian int_2
+*/
+
+extern
+int be_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a big-endian int_4
+*/
+
+/**********************************************************************/
+/* IO for little-endian files                                         */
+/**********************************************************************/
+
+extern
+int le_write_int_1(FILE *fp, uint_1 *i1);
+/*
+** Write a little-endian int_1
+*/
+
+extern
+int le_write_int_2(FILE *fp, uint_2 *i2);
+/*
+** Write a little-endian int_2
+*/
+
+extern
+int le_write_int_4(FILE *fp, uint_4 *i4);
+/*
+** Write a little-endian int_4
+*/
+
+extern
+int le_read_int_1(FILE *fp, uint_1 *i1);
+/*
+** Read a little-endian int_1
+*/
+
+extern
+int le_read_int_2(FILE *fp, uint_2 *i2);
+/*
+** Read a little-endian int_2
+*/
+
+extern
+int le_read_int_4(FILE *fp, uint_4 *i4);
+/*
+** Read a little-endian int_4
+*/
+
+
+#endif /* _mach_io_h */
diff --git a/src/ted/makefile-alpha b/src/ted/makefile-alpha
new file mode 100644
index 0000000..e571a12
--- /dev/null
+++ b/src/ted/makefile-alpha
@@ -0,0 +1,89 @@
+#
+# Makefile for ted (trace editor) - Alpha fork
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+SWITCHES = -DAUTO_CLIP
+
+#
+# Programs made by this makefile
+#
+PROGS = ted
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Compiler - debug on
+#
+CC= cc
+INCLUDES = -I$(SRC)
+CFLAGS= -g -std1 -ieee_with_inexact $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11 -ldnet_stub
+#
+LD= cc
+LDFLAGS= -g -non_shared
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/ted/makefile-dec b/src/ted/makefile-dec
new file mode 100644
index 0000000..a3bc627
--- /dev/null
+++ b/src/ted/makefile-dec
@@ -0,0 +1,92 @@
+#
+# Makefile for ted (trace editor) - DEC Ultrix
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+SWITCHES = -DAUTO_CLIP
+
+#
+# Programs made by this makefile
+#
+PROGS = ted
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Compiler - debug on
+#
+CC= c89
+INCLUDES = -I$(SRC) -I/usr/include/mit
+CFLAGS= -g -common $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11
+#
+# If you have gcc-2.x, should link with it
+LD= c89
+LDFLAGS= -g
+# cc -Bstatic, gcc2.x -static, otherwise nothing
+LDLIBS= $(LIBS)
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LDLIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/ted/makefile-sgi b/src/ted/makefile-sgi
new file mode 100644
index 0000000..86516ef
--- /dev/null
+++ b/src/ted/makefile-sgi
@@ -0,0 +1,91 @@
+#
+# Makefile for ted (trace editor) - SGI fork
+#
+
+SHELL=/bin/sh
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+SWITCHES = -DAUTO_CLIP
+
+#
+# Programs made by this makefile
+#
+PROGS = ted
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Compiler - debug on
+#
+CC= cc
+INCLUDES = -I$(SRC)
+CFLAGS= -g -ansi -prototypes $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11
+#
+LD= cc
+LDFLAGS= -g
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/ted/makefile-solaris b/src/ted/makefile-solaris
new file mode 100644
index 0000000..76aea9c
--- /dev/null
+++ b/src/ted/makefile-solaris
@@ -0,0 +1,93 @@
+#
+# Makefile for ted (trace editor) - SunOS fork
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+SWITCHES = -DAUTO_CLIP
+
+#
+# Programs made by this makefile
+#
+PROGS = ted
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Compiler - debug on
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -Wformat
+INCLUDES = -I$(SRC) -I/usr/openwin/include
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11 -lm
+#
+# If you have gcc-2.x, should link with it
+LD= gcc
+LDFLAGS= -g # -static # causes major dl problems
+# cc -Bstatic, gcc2.x -static, otherwise nothing
+LDLIBS= $(LIBS)
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LDLIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/ted/makefile-sun b/src/ted/makefile-sun
new file mode 100644
index 0000000..c751fd5
--- /dev/null
+++ b/src/ted/makefile-sun
@@ -0,0 +1,94 @@
+#
+# Makefile for ted (trace editor) - SunOS fork
+#
+
+#
+#Turning on the AUTO_CLIP switch allows ted to automatically
+#select a left (using the -enzyme option or the .enzyme in the
+#Xted file) and right cutoff on your sequence (using 2 out of
+#5 N's)
+#
+# Turning on the SAVE_EDITS switch allows the user to maintain
+# copies of their edits, i.e. when you open up ted on a file
+# that you have already edited, the old edits show up in the
+# ted edit window.  The user may also call up any of their
+# past editing sessions.  
+#
+#Turning on the DEF_OUT  switch makes
+# trace_name.seq the default output file name
+#
+#SWITCHES = -DAUTO_CLIP  -DDEF_OUT  -DSAVE_EDITS
+SWITCHES = -DAUTO_CLIP
+
+#
+# Programs made by this makefile
+#
+PROGS = ted
+
+#
+# Where the wild things are
+#
+SRC = .
+
+#
+# Compiler - debug on
+#
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+INCLUDES = -I$(SRC)
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(SWITCHES) $(INCLUDES)
+LIBS= -lXaw -lXt -lXmu -lXext -lX11
+#
+# If you have gcc-2.x, should link with it
+LD= gcc
+LDFLAGS= -g
+# cc -Bstatic, gcc2.x -static, otherwise nothing
+LDLIBS= -static $(LIBS)
+
+#
+# Object files
+#
+TEDOBJS=\
+	ted.o\
+	dialogues.o\
+	seq.o\
+	seqIOPlain.o\
+	seqIOABI.o\
+	help.o\
+	display.o\
+	traceDisplay.o\
+	basesDisplay.o\
+	Graph.o\
+	seqIOEdit.o\
+	seqIOALF.o\
+	seqIOSCF.o\
+	seqOutput.o\
+	opp.o\
+	info.o\
+	fpoint.o\
+	mach-io.o\
+	seqRead.o
+
+
+#
+# Linking
+#
+ted: $(TEDOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(TEDOBJS) $(LDLIBS)
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o : $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+#
+# Special dependencies
+#
+clean:
+	rm -f ted $(TEDOBJS) *~ *.aux *.log *.bbl *.blg *.dvi *.ps
+
+all: $(PROGS)
+
diff --git a/src/ted/mystdlib.h b/src/ted/mystdlib.h
new file mode 100644
index 0000000..e92c64b
--- /dev/null
+++ b/src/ted/mystdlib.h
@@ -0,0 +1,33 @@
+/*
+    Title:       stdlib
+
+    File: 	 mystdlib.h
+    Purpose:	 ANSII C stdlib.h replacement
+    Last update: Thu May 24 1990
+*/
+
+
+/*
+    The SUN does not provide the ANSII C stdlib.h header file, so
+    so this file provides some of the definitions which should be
+    found there.
+
+    Note that we cannot override existing type definitions.
+*/
+
+
+
+
+/* String processing functions */
+extern double strtod(char *str, char **ptr);
+extern double atof(char *str);
+extern long   strtol(char *str, char **ptr, int base);
+
+
+/* Control functions */
+extern void exit(int status);
+
+
+/* Storage allocation */
+extern char *malloc(unsigned size);
+extern char *calloc(unsigned elt_count, unsigned elt_size);
diff --git a/src/ted/opp.c b/src/ted/opp.c
new file mode 100644
index 0000000..7fe9809
--- /dev/null
+++ b/src/ted/opp.c
@@ -0,0 +1,128 @@
+/*
+  Title:  opp.c
+
+  File:   opp.c
+  Purpose: code for complementing sequences
+
+  Last update: Tue Jan 15 1991
+
+  15.01.90 SD  Taken from seqIOEdit.c
+*/
+
+
+/* ---- Imports ---- */
+#include "opp.h"
+#include "seq.h"
+
+
+/* ---- Globals ---- */
+char opp[256]; /* complement of any given base */
+
+/* ---- Exports ---- */
+void oppInitialize()
+{ int i;
+
+  for (i = 0; i<256; i++) opp[i]='-';  
+
+	/* RMD 31/12/90 'N' -> '-' above.
+	   removed 'N' and 'n' entries below and added reciprocal
+	   'K' and 'N' entries as for full Staden table */
+
+  opp['A'] = 'T';
+  opp['G'] = 'C';
+  opp['T'] = 'A';
+  opp['C'] = 'G';
+  opp['a'] = 't';
+  opp['g'] = 'c';
+  opp['t'] = 'a';
+  opp['c'] = 'g';
+  opp['D'] = 'H';
+  opp['H'] = 'D';
+  opp['V'] = 'B';
+  opp['B'] = 'V';
+  opp['K'] = 'N';
+  opp['N'] = 'K';
+  opp['L'] = 'M';
+  opp['M'] = 'L';
+  opp['5'] = '6';
+  opp['6'] = '5';
+  opp['R'] = 'Y';
+  opp['Y'] = 'R';
+  opp['7'] = '7';
+  opp['8'] = '8';
+}
+
+
+void complement_seq(Seq seq)
+/*
+** Complement and reverse bases and traces
+*/
+{
+    int2 temp_int2;
+    TRACE *temp_TRACEptr;
+    char temp_char;
+    int temp_int;
+    int i;
+
+    /* swap */
+#define swap(A,B,I) ( (I)=(A), (A)=(B), (B)=(I) )
+
+    /* complement and reverse traces */
+    if (seq->mode == BasesAndTraces) {
+	/* swap traces A<->T and C<->G */
+	swap(seq->traceA,seq->traceT,temp_TRACEptr);
+	swap(seq->traceC,seq->traceG,temp_TRACEptr);
+	
+	/* reverse points in traces */
+	for (i=0;i<seq->NPoints/2;i++) {
+	    swap(seq->traceA[i],seq->traceA[seq->NPoints-i-1],temp_int2);
+	    swap(seq->traceC[i],seq->traceC[seq->NPoints-i-1],temp_int2);
+	    swap(seq->traceG[i],seq->traceG[seq->NPoints-i-1],temp_int2);
+	    swap(seq->traceT[i],seq->traceT[seq->NPoints-i-1],temp_int2);
+	}
+    }
+    /* complement the sequence */
+    /* handle edited base */
+    for (i=0;i<seq->NedBases;i++) {
+	if (seq->edits[i]<0) {
+	    int k = (-seq->edits[i]);
+	    seq->edBase[k] = opp[seq->edBase[k]];
+	    seq->edBasePos[k] = seq->NPoints - seq->edBasePos[k] - 1;
+	} else {
+	    int k = seq->edits[i];
+	    seq->edits[i] = seq->NorigBases-k-1;
+	    /*
+	     * skip complement and mod of base position until later
+	     * because not every orig base is necessarily represented in the
+	     * edits array
+	     */
+	}
+    }
+
+    /* handle original bases */
+    for (i=0;i<seq->NorigBases;i++) {
+	seq->base[i] = opp[seq->base[i]];
+	seq->basePos[i] = seq->NPoints - seq->basePos[i] - 1;
+    }
+
+    /* reverse sequence */
+    /* edits */
+    for (i=0;i<seq->NedBases/2;i++) {
+	swap(seq->edits[i],seq->edits[seq->NedBases-i-1],temp_int2);
+    }
+    /* orig bases */
+    for (i=0;i<seq->NorigBases/2;i++) {
+	swap(seq->base[i],seq->base[seq->NorigBases-i-1],temp_char);
+	swap(seq->basePos[i],seq->basePos[seq->NorigBases-i-1],temp_int2);
+    }
+
+
+    /* swap cutoffs */
+    swap(seq->leftCutoff,seq->rightCutoff,temp_int);
+
+    /* move caret ??? */
+
+    /* toggle strand */
+    seq->bottom = !seq->bottom;
+}
+
diff --git a/src/ted/opp.h b/src/ted/opp.h
new file mode 100644
index 0000000..0852a95
--- /dev/null
+++ b/src/ted/opp.h
@@ -0,0 +1,37 @@
+#ifndef _opp_h
+#define _opp_h
+
+/*
+  Title:       opp
+
+  File:        opp.h
+  Purpose:     Required for complimenting a sequence
+  Last update: Tue Jan 15 1991
+
+  15.01.90 SD  Taken from seqIOEdit.h
+
+*/
+
+#include "seq.h"
+
+/* ---- Exports ---- */
+
+extern char opp[256]; /* complement of any given base */
+
+extern void oppInitialize();
+
+/* initializes the array which stores the complement 
+   of any of the Staden nucleotides or ambiguity
+   codes */
+
+
+void complement_seq(Seq seq);
+
+/* complement a sequence */
+
+#endif  /*_opp_h*/
+
+
+
+
+
diff --git a/src/ted/scf.h b/src/ted/scf.h
new file mode 100644
index 0000000..adde21d
--- /dev/null
+++ b/src/ted/scf.h
@@ -0,0 +1,89 @@
+/*
+ * File: scf.h
+ * Version:
+ *
+ * Author: Simon Dear
+ *         MRC Laboratory of Molecular Biology
+ *	   Hills Road
+ *	   Cambridge CB2 2QH
+ *	   United Kingdom
+ *
+ * Description: file structure definitions for SCF file
+ *
+ * Created: 19 November 1992
+ * Updated:
+ *
+ */
+
+#ifndef _SCF_H_
+#define _SCF_H_
+
+#include "mach-io.h"
+
+#define SCF_MAGIC (((((uint_4)'.'<<8)+(uint_4)'s'<<8)+(uint_4)'c'<<8)+(uint_4)'f')
+
+#define VERSION 2.00
+
+/*
+ * Type definition for the Header structure
+ */
+typedef struct {
+    uint_4 magic_number;       /* SCF_MAGIC */
+    uint_4 samples;            /* Number of elements in Samples matrix */
+    uint_4 samples_offset;     /* Byte offset from start of file */
+    uint_4 bases;              /* Number of bases in Bases matrix */
+    uint_4 bases_left_clip;    /* Number of bases in left clip (vector)*/
+    uint_4 bases_right_clip;   /* Number of bases in right clip (unreliable) */
+    uint_4 bases_offset;       /* Byte offset from start of file */
+    uint_4 comments_size;      /* Number of bytes in Comment section */
+    uint_4 comments_offset;    /* Byte offset from start of file */
+    char version[4];	     /* "version.revision" */
+    uint_4 sample_size;	     /* precision of samples (in bytes) */
+    uint_4 code_set;	     /* uncertainty codes used */
+    uint_4 spare[20];          /* Unused */
+} Header;
+
+#define CSET_DEFAULT 0  /* {A,C,G,T,-} */
+#define CSET_STADEN  1
+#define CSET_NC_IUB  2
+#define CSET_ALF     3  /* extended NC_IUB */
+#define CSET_ABI     4  /* {A,C,G,T,N} */
+
+/*
+ * Type definition for the Sample data
+ */
+typedef unsigned char byte;
+typedef struct {
+    byte sample_A;           /* Sample for A trace */
+    byte sample_C;           /* Sample for C trace */
+    byte sample_G;           /* Sample for G trace */
+    byte sample_T;           /* Sample for T trace */
+} Samples1;
+typedef struct {
+    unsigned short sample_A;           /* Sample for A trace */
+    unsigned short sample_C;           /* Sample for C trace */
+    unsigned short sample_G;           /* Sample for G trace */
+    unsigned short sample_T;           /* Sample for T trace */
+} Samples2;
+
+/*
+ * Type definition for the sequence data
+ */
+typedef struct {
+    uint_4 peak_index;        /* Index into Samples matrix for base position */
+    byte prob_A;            /* Probability of it being an A */
+    byte prob_C;            /* Probability of it being an C */
+    byte prob_G;            /* Probability of it being an G */
+    byte prob_T;            /* Probability of it being an T */
+    char base;		    /* Base called */
+    byte spare[3];          /* Spare */
+} Bases;
+
+
+/*
+ * Type definition for the comments
+ */
+typedef char Comments;            /* Zero terminated list of \n separated entries */
+
+
+#endif /*_SCF_H_*/
diff --git a/src/ted/seq.c b/src/ted/seq.c
new file mode 100644
index 0000000..87a7a96
--- /dev/null
+++ b/src/ted/seq.c
@@ -0,0 +1,544 @@
+/* 
+    Title:       seq
+
+    File: 	 seq.c
+    Purpose:	 Sequence data type
+    Last update: Tue Nov 10 1992
+*/
+
+
+/*
+    The Seq data type is designed so that it can hold a varying degree
+    of information about sequences, yet have a single set of calls
+    to access the data.
+
+    The edited sequence has a limit of MaxEdits changes relative to
+    the original sequence. If entry M in the edited sequence is the
+    same as entry N in the original sequence then edits[M]=N. If
+    entry M is changed, then its value is negative and is a reference
+    into the newBase and newBasePos arrays (thus position 0 in these
+    arrays is unused). Initially, all the entries in newBasePos are
+    set to NULLPoint. The representation can be summarised thus
+    (where => is logical implication):
+
+        getNBases(seq, EdBases) = seq->NedBases
+
+        getBase(seq, EdBases, n) =
+	    (seq->edits[n] >= 0) => seq->base[seq->edits[n]]
+	   ~(seq->edits[n] >= 0) => seq->edBase[-(seq->edits[n])]
+
+	baseNumToPoint(seq, EdBases, n) =
+	    (seq->edits[n] >= 0) => seq->basePos[seq->edits[n]]
+	   ~(seq->edits[n] >= 0) => seq->edBasePos[-(seq->edits[n])]
+
+    The main problem with this representation is that an insertion
+    (or deletion) requires, on average, copying of half of the edits
+    array up (or down). Given that we have ~1000 bases this probably
+    OK.
+
+    A problem which has not been considered well enough is how we deal
+    with several bases located at the same point.
+
+    There are plenty of assumptions around that both the number of
+    bases and the number of points will fit into an int2, a short.
+
+    The only changes made by lfw to this module is the addition of
+    two subroutines: isBottom, setBottom.  
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"      
+
+#include <stdlib.h>
+/* #include "mystdlib.h" *//* IMPORT: malloc, calloc */
+
+
+
+
+/* ---- Exports ---- */
+    
+
+
+
+/* ---- General sequence manipulation ---- */
+
+
+
+
+Seq newSeq(Mode mode, int2 numPoints, int2 numBases)
+/*
+    Allocate a new sequence, with the given sizes.
+    The sequence is initially clean with no bases cutoff.
+    If `mode' is BasesOnly, `numPoints' is ignored.
+    Returns NULLSeq on failure.
+*/
+{   Seq seq = NULLSeq;
+    size_t int2Size = sizeof(int2);
+
+
+    /* Allocate the body of the sequence */
+    if ((seq = (Seq) malloc(sizeof(SeqData))) == NULL) return(NULLSeq);
+
+    /*   
+        Initialise the body, all pointers are set to NULL so we can
+        happily call `freeSeq'
+    */
+    seq->dirty = False;
+    seq->leftCutoff  = 0;
+    seq->rightCutoff = 0;
+    seq->bottom = False;
+
+    seq->traceC    = NULL;
+    seq->traceA    = NULL;
+    seq->traceG    = NULL;
+    seq->traceT    = NULL;
+
+    seq->base      = NULL;
+    seq->basePos   = NULL;
+
+    seq->edits     = NULL;
+    seq->edBase    = NULL;
+    seq->edBasePos = NULL;
+
+    seq->info = NULL;
+
+    /* Allocate space for the bases */
+    if (((seq->base     =        malloc(numBases))                   == NULL)||
+        ((seq->basePos  =(int2 *)calloc(numBases,int2Size))          == NULL)||
+	
+	((seq->edits    =(int2 *)calloc(numBases+MaxEdits,int2Size)) == NULL)||
+	((seq->edBase   =        malloc(MaxEdits))                   == NULL)||
+        ((seq->edBasePos=(int2 *)calloc(MaxEdits,int2Size))          == NULL))
+    {
+	freeSeq(seq);
+	return(NULLSeq);
+    }
+
+    /* If needed, allocate space for the traces */
+    if (mode == BasesAndTraces)
+	if (((seq->traceC   =(TRACE *)calloc(numPoints,int2Size))         == NULL)||
+	    ((seq->traceA   =(TRACE *)calloc(numPoints,int2Size))         == NULL)||
+	    ((seq->traceG   =(TRACE *)calloc(numPoints,int2Size))         == NULL)||
+	    ((seq->traceT   =(TRACE *)calloc(numPoints,int2Size))         == NULL))
+	    {
+		freeSeq(seq);
+		return(NULLSeq);
+	    }
+    
+    return(seq);
+}
+
+
+
+
+void freeSeq(Seq seq)
+/*
+    Free a sequence created by one of the above routines.
+    
+    ANSI C is happy to free a NULL pointer.
+*/
+{   if (seq == NULLSeq) return;
+
+    if (seq->traceC != NULL) free(seq->traceC);
+    if (seq->traceA != NULL) free(seq->traceA);
+    if (seq->traceG != NULL) free(seq->traceG);
+    if (seq->traceT != NULL) free(seq->traceT);
+
+    if (seq->base    != NULL) free(seq->base);
+    if (seq->basePos != NULL) free(seq->basePos);
+
+    if (seq->edits     != NULL) free(seq->edits);
+    if (seq->edBase    != NULL) free(seq->edBase);
+    if (seq->edBasePos != NULL) free(seq->edBasePos);
+
+    if (seq->info != NULL) free(seq->info);
+
+    free(seq);
+}
+
+
+
+
+/* ---- General sequence information ---- */
+
+
+
+
+Boolean isDirty(Seq seq)
+/*
+    Indicate if the sequence has been edited.
+*/
+{   return(seq->dirty);
+}
+
+
+
+
+void setDirty(Seq seq, Boolean b)
+/*
+    Set the sequence to the state of cleanliness indicated by `b'.
+*/
+{   seq->dirty = b;
+}
+
+/* isBottom and setBottom were added by lfw when the new element
+was added to seq. seq->bottom */
+
+
+Boolean isBottom(Seq seq)
+/*
+    Indicate if the sequence we're working with is bottom strand
+*/
+{   return(seq->bottom);
+}
+
+
+
+
+void setBottom(Seq seq, Boolean b)
+/*
+    Set the sequence to the state bottom if b is true and
+    top if b is false 
+*/
+{   seq->bottom = b;
+}
+
+
+
+
+Boolean isBasesOnly(Seq seq)
+/*
+    Indicate if the sequence only contains bases, or if it
+    contains bases and traces.
+*/
+{   return((seq->mode)==BasesOnly);
+}
+
+
+
+
+/* ---- Trace information ---- */
+
+
+
+
+int getNPoints(Seq seq)
+/*
+    Return the number of points in this sequence.
+*/
+{   return(seq->NPoints);
+}
+
+
+
+
+void getTraces(Seq seq, char base, int p0, int pN, int *traces)
+/*
+    For basetype `base' in the sequence `seq', return in consecutive
+    elements of `traces' (an array of int) the values at the points
+    between and including p0 to pN, where p0<=pN.
+*/
+{   int p;
+    int i;
+    TRACE *t;
+
+    switch (base)
+    {   case 'C': t = seq->traceC; break;
+        case 'A': t = seq->traceA; break;
+        case 'G': t = seq->traceG; break;
+        case 'T': t = seq->traceT; break;
+    }
+
+    p = p0;
+    i = 0;
+    while (p<=pN)
+    {   traces[i] = t[p];
+	p++;
+	i++;
+    }
+}
+
+
+
+
+TRACE getMaxTraceVal(Seq seq)
+/*
+    Return the maximum value from anywhere in any of the traces
+    from the sequence `seq'.
+*/
+{   return(seq->maxTraceVal);
+}
+
+
+
+
+/* ---- Base information ---- */
+
+
+
+
+int getNBases(Seq seq, WhichBases which)
+/*
+    Return the number of bases in the `which' part of `seq'
+*/
+{   return((which == OrigBases) ? seq->NorigBases : seq->NedBases);
+}
+
+
+void getCutoffs(Seq seq, int *leftCutoff, int *rightCutoff)
+/*
+    Return the number of characters to be cut off at either end.
+*/
+{   *leftCutoff  = seq->leftCutoff;
+    *rightCutoff = seq->rightCutoff;
+}
+
+
+
+char getBase(Seq seq, WhichBases which, int baseNum)
+/*
+    Given `baseNum' as the number of a base in the `which' part of
+    the sequence `seq', return the base found there.
+*/
+{   return((which == OrigBases)
+               ? (seq->base[baseNum])
+               : (seq->edits[baseNum] >= 0)
+	           ? seq->base[seq->edits[baseNum]]
+	           : seq->edBase[-(seq->edits[baseNum])]
+	  );
+}
+
+
+int baseNumToPoint(Seq seq, WhichBases which, int baseNum)
+/*
+   Given `baseNum' as the number of a base in `which' part of
+   sequence `seq', return its point location. NULLPoint is
+   returned if `baseNum' was invalid.
+*/
+{    return((which == OrigBases)
+
+                ? /* Original sequence */
+	          (baseNum<0 || baseNum>seq->NorigBases-1)
+                      ? NULLPoint
+                      : seq->basePos[baseNum]
+
+	        : /* Edited sequence */
+	          (baseNum<0 || baseNum>seq->NedBases-1)
+                      ? NULLPoint
+	              : (seq->edits[baseNum] >= 0)
+                            ? seq->basePos[seq->edits[baseNum]]
+	                    : seq->edBasePos[-(seq->edits[baseNum])]
+	   );
+	    
+}
+
+
+
+
+int pointToBaseNum(Seq seq, WhichBases which, int point)
+/*
+    Given `point' as a point in the `which' part of sequence `seq',
+    return the base number of the next base to the right (or on
+    the point), or NULLBaseNum if there isn't one.
+*/
+{   int l,r,N,m;
+
+    if (point<0 || point>seq->NPoints-1) return(NULLBaseNum);
+
+    /*
+         Let's use a groovy binary search.
+	 Roughly, we wish to find an m (0..NBases-1) such that:
+	     baseNumToPoint(m]>=point & baseNumToPoint(m-1]<point
+
+	 Let l and r indicate two sections of the array, LS and RS. We
+	 have the following predicates:
+	 NotInLS   == 0<=j<l: baseNumToPoint(j)<point
+	 MaybeInRS == r<=k<N: baseNumToPoint(k)>=point
+
+	 Maintaining these two, we adjust l and r (using a binary
+	 chop) such that the complete array is spanned by these
+	 sections.
+    */
+
+    /* Start off with the two sections empty */
+    N = (which == OrigBases) ? seq->NorigBases : seq->NedBases;
+    l=0; r=N; /* NotinLS & MaybeInRS */
+
+    while (l<r)
+    {   /* NotinLS & MaybeInRS & l<r */
+        m=(l+r)/2; /* l<=m<r */
+	if (baseNumToPoint(seq,which,m)<point)
+	    l=m+1; /* NotinLS & MaybeInRS & l<=r */
+	else
+	    r=m;   /* NotinLS & MaybeInRS & l<=r */
+    }
+
+    /* 
+        NotinLS & MaybeInRS & l=r
+        ==
+	0<=j<l=r<=k<N: baseNumToPoint(j)<point & baseNumToPoint(k)>=point
+    */
+    if (r==N)
+        return(NULLBaseNum);
+    else
+        return(r);
+}
+
+
+
+
+/* ---- Base editing ---- */
+
+
+
+
+Boolean deleteBase(Seq seq, int baseNum)
+/*
+    Delete the base `baseNum' from the sequence `seq'.
+*/
+{   int i;
+
+    /* Fail if base number bad */
+    if ((baseNum<0) || (baseNum>(seq->NedBases-1))) return(False);
+
+    /* Fail if in a cutoff region */
+    if ((baseNum <= seq->leftCutoff-1) ||
+        (baseNum > seq->NedBases-1-seq->rightCutoff)) return(False);
+
+    seq->dirty = True;
+
+    /* If this was a base we added, then free its slot */
+    if (seq->edits[baseNum]<0) 
+        seq->edBasePos[-(seq->edits[baseNum])] = NULLPoint;
+
+    /* Shuffle all the entries in `edits' down one. */
+    for (i=baseNum+1;i<seq->NedBases;i++)
+        seq->edits[i-1] = seq->edits[i];
+
+    seq->NedBases--;
+    return(True);        
+}
+
+
+
+
+Boolean insertBase(Seq seq, char base, int baseNum)
+/*
+    Insert the base `base' to the right of `baseNum'. Its position
+    is halfway between its neighbours, except when it would be
+    replacing a deleted base from the original sequence, in which
+    case it is placed there.
+
+    If `baseNum' equals -1 `base' is added to the head of the
+    sequence.
+*/
+{   int i;
+    int prevBasePos, nextBasePos;
+    int origBaseNum, origBasePos;
+
+
+    /* Fail if bad base number */
+    if ((baseNum<-1) || (baseNum>(seq->NedBases-1))) return(False);
+
+    /* Fail if in a cutoff region */
+    if ((baseNum < seq->leftCutoff-1) ||
+        (baseNum > seq->NedBases-1-seq->rightCutoff)) return(False);
+
+    seq->dirty = True;
+
+
+    /* 
+        Before we alter anything, find what will become the
+	surrounding positions
+    */
+    prevBasePos = (baseNum == -1)
+                  ? -1
+		  : baseNumToPoint(seq, EdBases, baseNum);
+    nextBasePos = (baseNum == (seq->NedBases-1))
+                  ? seq->NPoints-1
+		  : baseNumToPoint(seq, EdBases, baseNum+1);
+
+
+    /*
+        Shuffle all the entries in `edits' up one
+    */
+    for (i=seq->NedBases-1;i>baseNum;i--)
+        seq->edits[i+1] = seq->edits[i];
+    seq->NedBases++;
+
+
+    /*
+        Look in the original sequence to see if there is a base
+	which lies between prevBasePos and nextBasePos.
+    */
+    origBaseNum = pointToBaseNum(seq, OrigBases, prevBasePos+1);
+    origBasePos = baseNumToPoint(seq, OrigBases, origBaseNum);
+
+    if ((origBaseNum!=NULLBaseNum) &&
+	(prevBasePos<origBasePos)  &&
+	(origBasePos<nextBasePos)
+       )
+    {   char origBase = getBase(seq, OrigBases, origBaseNum);
+	
+	if (origBase == base)
+	{   /* We are reintroducing an existing base */
+	    seq->edits[baseNum+1] = origBaseNum;
+	}
+	else
+	{   int slot;
+
+	    /* We can consider this a replacement, at the same position */
+
+	    /* Find a free slot in the newBasePos (and thus newBase) table */
+	    for (slot=1;
+		 (slot<MaxEdits)&&(seq->edBasePos[slot]!=NULLPoint);
+		 slot++)
+	        if (slot==MaxEdits) /* All entries used */ return(False);
+
+	    /* Fill in the edBasePos and edBase entries */
+	    seq->edBasePos[slot] = origBasePos;
+	    seq->edBase[slot] = base;
+
+	    /* Put in the (negative) reference to this slot */
+	    seq->edits[baseNum+1] = -slot;
+	}
+    }
+    else
+    {   int slot;
+
+	/* Find a free slot in the newBasePos (and thus newBase) table */
+        for (slot=1;(slot<MaxEdits)&&(seq->edBasePos[slot]!=NULLPoint);slot++)
+	if (slot==MaxEdits) /* All entries used */ return(False);
+
+	/* Fill in the edBasePos and edBase entries */
+	seq->edBasePos[slot] = (prevBasePos + nextBasePos) / 2;
+	seq->edBase[slot] = base;
+
+	/* Put in the (negative) reference to this slot */
+	seq->edits[baseNum+1] = -slot;
+    }
+    
+
+    return(True);        
+}
+
+
+
+
+Boolean setCutoffs(Seq seq, int leftCutoff, int rightCutoff)
+/*
+    Set the cutoffs to be the indicated number of characters from
+    either end of the sequence. The result indicates success.
+*/
+{   if ((leftCutoff+rightCutoff) > seq->NedBases)
+        return(False);
+    else
+    {   seq->leftCutoff  = leftCutoff;
+	seq->rightCutoff = rightCutoff;
+	return(True);
+    }
+}
diff --git a/src/ted/seq.h b/src/ted/seq.h
new file mode 100644
index 0000000..9892bba
--- /dev/null
+++ b/src/ted/seq.h
@@ -0,0 +1,282 @@
+#ifndef _seq_h
+#define _seq_h
+
+
+/* 
+    Title:       seq
+
+    File: 	 seq.h
+    Purpose:	 Sequence data type
+    Last update: Tue Nov 10 1992
+*/
+
+
+/*
+    This module encodes the `Seq' sequence data structure.
+
+    A `Seq' contains information about bases and traces which are layed
+    out along a single dimension of points. The number of points in a
+    paricular sequence is given by `getNPoints', and these are numbered
+    0..getNPoints-1. At each point there are four trace readings, one
+    for each base.
+
+    There are two sequences of bases: the `original', and the
+    `editable'. There have `getNorigBases' and `getNedBases' respectively,
+    which are numbered 0..N-1. Bases are represented by `char's. Every
+    base is located at a particular point.
+
+    When written out, the list of bases is trimmed by a left and a right
+    cutoff, which indicate the number of bases to remove from either
+    end. Editing cannot be performed in the cutoff regions.
+
+    The behaviour of these routines is undefined if given NULLSeq or
+    an undefined sequence.
+
+    SD. Added information field to seq data structure
+
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget */
+
+
+
+
+
+/* ---- Private ---- */
+
+#define MaxEdits 200
+
+
+typedef short int2; /* Two byte integers  */
+typedef int   int4; /* Four byte integers */
+typedef unsigned short TRACE; /* for trace heights */
+
+#define MaxInt2 (32767) /* An estimate of the lower bound */
+
+
+typedef enum
+{   BasesOnly,
+    BasesAndTraces
+} Mode;
+
+typedef enum
+{
+    ABIFormat,
+    ALFFormat,
+    PlainFormat,
+    SCFFormat
+} Format;
+
+typedef struct
+{   Mode        mode;
+    Boolean     dirty; /* Have any changes been made? */
+    Boolean     bottom; /* are we looking at the top or
+			   bottom strand? */
+    Format	format;
+
+    int         NPoints;    /* No. of points of data */
+    int         NorigBases; /* No. of bases in the original sequence */
+    int         NedBases;   /* No. of bases in the edited sequence */
+
+    /* Traces */
+    TRACE      *traceC;      /* Array of length `NPoints' */
+    TRACE      *traceA;      /* Array of length `NPoints' */
+    TRACE      *traceG;      /* Array of length `NPoints' */
+    TRACE      *traceT;      /* Array of length `NPoints' */
+    TRACE       maxTraceVal; /* The maximal value in any trace */
+
+    /* Original bases */
+    char       *base;    /* Array of length `NorigBases' */
+    int2       *basePos; /* Array of length `NorigBases' */
+
+    /* Edited bases */
+    int2       *edits;     /* Array of length `NorigBases+MaxEdits' */
+    char       *edBase;    /* Array of length `MaxEdits' */
+    int2       *edBasePos; /* Array of length `MaxEdits' */
+
+    /* Cutoffs */
+    int leftCutoff;  /* Number of unwanted bases */
+    int rightCutoff; /* Number of unwanted bases */
+
+    /* Miscellaneous Sequence Information */
+    char *info; /* misc seq info */
+
+} SeqData, *Seq;
+
+
+extern Seq newSeq(Mode mode, int2 numPoints, int2 numBases);
+/*
+    Allocate a new sequence, with the given sizes.
+    The sequence is initially clean with no bases cutoff.
+    If `mode' is BasesOnly, `numPoints' is ignored.
+    Returns NULLSeq on failure.
+*/
+
+
+
+
+/* ---- Exports ---- */
+
+
+#define NULLSeq     ((Seq)NULL)
+#define NULLBaseNum (-1)
+#define NULLPoint   (-1)
+
+
+typedef enum
+{   OrigBases,
+    EdBases
+} WhichBases;
+    
+
+
+
+/* ---- General sequence manipulation ---- */
+
+
+extern void freeSeq(Seq seq);
+/*
+    Free a sequence.
+    Does nothing if given NULLSeq.
+*/
+
+
+/* ---- General sequence information ---- */
+
+
+extern Boolean isDirty(Seq seq);
+/*
+    Indicate if the sequence has been edited since last set clean.
+*/
+
+
+extern void setDirty(Seq seq, Boolean b);
+/*
+    Set the sequence to the state of cleanliness indicated by `b'.
+*/
+
+extern Boolean isBottom(Seq seq);
+/*
+    Indicate if we're looking at bottom strand 
+*/
+
+
+extern void setBottom(Seq seq, Boolean b);
+/*
+  Set the sequence to b - true if bottom, false if top strand
+*/
+
+
+extern Boolean isBasesOnly(Seq seq);
+/*
+    Indicate if the sequence only contains bases, or if it
+    contains bases and traces.
+*/
+
+
+/* ---- Trace information ---- */
+
+
+extern int getNPoints(Seq seq);
+/*
+    Return the number of points in this sequence.
+*/
+
+
+extern void getTraces(Seq seq, char base, int p0, int pN, int *traces);
+/*
+    For basetype `base' in the sequence `seq', return in consecutive
+    elements of `traces' (an array of int) the values at the points
+    between and including p0 to pN, where p0<=pN.
+*/
+
+
+extern TRACE getMaxTraceVal(Seq seq);
+/*
+    Return the maximum value from anywhere in any of the traces
+    from the sequence `seq'.
+*/
+
+
+/* ---- Base information ---- */
+
+
+extern int getNBases(Seq seq, WhichBases which);
+/*
+    Return the number of bases in the `which' part of `seq'
+*/
+
+
+extern void getCutoffs(Seq seq, int *leftCutoff, int *rightCutoff);
+/*
+    Return the number of characters to be cut off at either end.
+*/
+
+
+extern char getBase(Seq seq, WhichBases which, int baseNum);
+/*
+    Given `baseNum' as the number of a base in the `which' part of
+    the sequence `seq', return the base found there.
+*/
+
+/*extern int getBasePos(Seq seq, WhichBases which, int baseNum);*/
+/*
+    Given `baseNum' as the number of a base in the `which' part of
+    the sequence `seq', return the base position found there.
+*/
+
+
+
+extern int baseNumToPoint(Seq seq, WhichBases which, int baseNum);
+/*
+   Given `baseNum' as the number of a base in `which' part of
+   sequence `seq', return its point location. NULLPoint is
+   returned if `baseNum' was invalid.
+*/
+
+
+extern int pointToBaseNum(Seq seq, WhichBases which, int point);
+/*
+    Given `point' as a point in the `which' part of sequence `seq',
+    return the base number of the next base to the right (or on
+    the point), or NULLBaseNum if there isn't one.
+*/
+
+
+/* ---- Base editing ---- */
+
+
+extern Boolean deleteBase(Seq seq, int baseNum);
+/*
+    Delete the base `baseNum' from the sequence `seq'.
+    The result indicates success.
+*/
+
+
+extern Boolean insertBase(Seq seq, char base, int baseNum);
+/*
+    Insert the base `base' to the rigt of `baseNum'. Its position
+    is halfway between its neighbours, except when it would be
+    replacing a deleted base from the original sequence, in which
+    case it is placed there.
+
+    If `baseNum' equals -1 `base' is added to the head of the
+    sequence.
+
+    The result indicates success.
+*/
+
+
+extern Boolean setCutoffs(Seq seq, int leftCutoff, int rightCutoff);
+/*
+    Set the cutoffs to be the indicated number of characters from
+    either end of the sequence. The result indicates success.
+*/
+
+
+#endif  /*_seq_h*/
diff --git a/src/ted/seqIOABI.c b/src/ted/seqIOABI.c
new file mode 100644
index 0000000..76a3d6e
--- /dev/null
+++ b/src/ted/seqIOABI.c
@@ -0,0 +1,427 @@
+/* 
+  Title:       seqIOABI
+  
+  File: 	 seqIOABI.c
+  Purpose:	 IO of ABI sequences
+  Last update: Tue Nov 10 1992
+  
+  Change log:
+  27/11/90 SD     writeSeqABI() outputs header to sequence file:
+  format: ;{noOfBases}{leftCutOff}{basesWritten}{type}{tra
+  cefile}
+  eg:     ;   867    45    383ABI a09b7.s1RES
+  28.11.90 SD  put undesirables under STLOUIS compilation flag
+  11.12.90 SD  new static function tail to find file name in path name
+  02.01.91 SD  Merged with St.L version
+  15.01.91 SD  New include added (opp.h)
+  30.07.91 SD  Those ole FWO_ field blues
+  17.09.91 LFW changed STLOUIS compilation flag to SAVE_EDITS
+  and AUTO_CLIP
+  25.10.91 SD  Machine independant I/O...removed BIGENDIAN flag
+  10.11.92 SD  FWO_ and S/N% interpretation. Comments for information
+  window.
+  
+  */
+
+
+/*
+  This module should be regarded as part of `seq' since it is
+  privy to the internal structure of `Seq'.
+  
+  At the moment, the `maxTraceVal' of the sequence is hardwired
+  as 1200, but I suspect it is encoded somewhere in the ABI file
+  structure, or if not, we should calculate it as we read the
+  traces in.
+  
+  The ABI results file is controlled by an index found towards
+  the end --- this is pointed to by a longword found at `IndexPO'.
+  The index consists of a number of entries, each of which is
+  four character label followed by 6 long words. The first of these
+  long words holds a simple count (starting at 1) for those cases
+  where there are multiple entries with the same label. Entries should
+  be found by label (and count), rather than their index position,
+  because entries can be ommited or new ones added. This happens when
+  ABI changes the version of their software and also depending
+  on whether the data was analysed or unalaysed. We do, however,
+  make assumptions about the relative order of entries.
+  
+  Ideally we would have a separate module which provides a number
+  of functions to extract the data we are interested in, keeping
+  the ABI format well wrapped up and out of harms way.
+  
+  Note that we are relying on the endian-ness of the machine being
+  appropriate so we can just read long words in as integers. This
+  should be recoded to deal with running on different endians.
+  
+  All references to the seq->bottom, the bottom strand of the
+  sequence, were added by lfw
+  */
+
+
+
+
+/* ---- Imports ---- */
+
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			 newSeq, freeSeq */
+#include "seqIOABI.h"
+#include "seqIOEdit.h" /* IMPORT: writeEdSeq, 
+			  readEdSEq */
+
+#include "fpoint.h"    /* IMPORT: int_to_float */
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+			 EOF */
+#include <stdlib.h>
+
+#include "mach-io.h" /* IMPORT: machine independant io library */
+#include "dialogues.h"
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+
+/*
+  The index is located towards the end of the ABI trace file.
+  It's location is given by a longword at a fixed place.
+  */
+#define IndexPO ((off_t)26)
+
+#define IndexEntryLength 28
+
+
+/*
+  Here are some labels we will be looking for, four chars packed
+  into a long word.
+  */
+#define DataEntryLabel    ((int_4) ((((('D'<<8)+'A')<<8)+'T')<<8)+'A')
+#define BaseEntryLabel    ((int_4) ((((('P'<<8)+'B')<<8)+'A')<<8)+'S')
+#define BasePosEntryLabel ((int_4) ((((('P'<<8)+'L')<<8)+'O')<<8)+'C')
+#define SpacingEntryLabel ((int_4) ((((('S'<<8)+'P')<<8)+'A')<<8)+'C')
+#define SignalEntryLabel  ((int_4) ((((('S'<<8)+'/')<<8)+'N')<<8)+'%')
+#define FWO_Label         ((int_4) ((((('F'<<8)+'W')<<8)+'O')<<8)+'_')
+#define MCHNLabel         ((int_4) ((((('M'<<8)+'C')<<8)+'H')<<8)+'N')
+#define PDMFLabel         ((int_4) ((((('P'<<8)+'D')<<8)+'M')<<8)+'F')
+#define SMPLLabel         ((int_4) ((((('S'<<8)+'M')<<8)+'P')<<8)+'L')
+#define PPOSLabel         ((int_4) ((((('P'<<8)+'P')<<8)+'O')<<8)+'S')
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+
+static Boolean getIndexEntryLW(FILE *fp, off_t indexO,
+			       uint_4 label, uint_4 count, int lw,
+			       uint_4 *val)
+/*
+  From the ABI results file connected to `fp' whose index starts
+  at byte offset `indexO', return in `val' the `lw'th long word
+  from the `count'th entry labelled `label'.
+  The result indicates success.
+  */
+{   off_t entryNum=-1;
+    int i;
+    uint_4 entryLabel, entryLw1;
+    
+    do
+	{   entryNum++;
+	    if (fseek(fp, indexO+(entryNum*IndexEntryLength), 0) != 0) return(-1);
+	    if (!be_read_int_4(fp, &entryLabel)) return(False);
+	    if (!be_read_int_4(fp, &entryLw1))   return(False);
+	} while (!(entryLabel == label && entryLw1 == count));
+    
+    for(i=2; i<=lw; i++)
+	if (!be_read_int_4(fp, val)) return(-1);
+    
+    return(True);
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+Seq readSeqABI(char *fn)
+/*
+  Read the ABI format sequence with name `fn' into `seq'.
+  All printing characters (as defined by ANSII C `isprint')
+  are accepted, but `N's are translated to `-'s.  If bottom 
+  is True, we read the sequence in in reverse and complemented 
+  A NULL result indicates failure.
+  
+  In this module all references to the bottom strand were 
+  inserted by lfw.
+  */
+{   Seq seq = NULLSeq;
+    uint_4 fwo_; /* base -> lane mapping */
+    FILE *fp;
+    uint_4 numPoints, numBases;
+    uint_4 signalO;
+
+    uint_4 indexO;   /* File offset where the index is */
+    uint_4 baseO;    /* File offset where the bases are stored */
+    uint_4 basePosO; /* File offset where the base positions are stored */
+    uint_4 dataCO;   /* File offset where the C trace is stored */
+    uint_4 dataAO;   /* File offset where the A trace is stored */
+    uint_4 dataGO;   /* File offset where the G trace is stored */
+    uint_4 dataTO;   /* File offset where the T trace is stored */
+    uint_4 MCHN_O;   /* File offset where the machine name is stored */
+    uint_4 PDMF_O;   /* File offset where the dye primer guff is stored */
+    /*
+      uint_4 SMPL_O;   /. File offset where the sample name is stored ./
+    */
+    
+    int i;
+    
+    
+    if ((fp = fopen(fn, "rb")) == NULL) return(NULLSeq);
+    
+    
+    /* Get the index offset */
+    if ((fseek(fp, IndexPO, 0) != 0) || (!be_read_int_4(fp, &indexO)))
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    /* Get the number of points */
+    if (!getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,9,3,&numPoints))
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    /* Get the number of bases */
+    if (!getIndexEntryLW(fp,(off_t)indexO,BaseEntryLabel,1,3,&numBases))
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesAndTraces, numPoints, numBases)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    seq->mode        = BasesAndTraces;
+    seq->format      = ABIFormat;
+    seq->dirty       = False;
+    seq->maxTraceVal = 1200;
+    seq->NPoints     = numPoints;
+    seq->NorigBases  = numBases;
+    seq->NedBases    = numBases;
+    seq->bottom      = False;
+    
+    
+    /*************************************************************
+     *
+     *************************************************************/
+    
+    /*
+     ** The order of the DATA fields is determined by the field FWO_
+     ** Juggle around with data pointers to get it right
+     */
+    {
+	uint_4 *dataxO[4];
+	
+	dataxO[0] = &dataCO;
+	dataxO[1] = &dataAO;
+	dataxO[2] = &dataGO;
+	dataxO[3] = &dataTO;
+	
+	/* Get the Freak World Out (FWO?) field ... */
+	if (!getIndexEntryLW(fp,(off_t)indexO,FWO_Label,1,5,&fwo_))
+	    {   fclose(fp);
+		return(NULLSeq);
+	    }
+#define baseIndex(B) ((B)=='C'?0:(B)=='A'?1:(B)=='G'?2:3)
+	/*Get the positions of the four traces */
+	if (! ( getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,9 ,5,dataxO[baseIndex((char)(fwo_>>24&255))]) &&
+	       getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,10,5,dataxO[baseIndex((char)(fwo_>>16&255))]) &&
+	       getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,11,5,dataxO[baseIndex((char)(fwo_>>8&255))]) &&
+	       getIndexEntryLW(fp,(off_t)indexO,DataEntryLabel,12,5,dataxO[baseIndex((char)(fwo_&255))])
+	       )
+	    )    
+	    {   fclose(fp);
+		freeSeq(seq);
+		return(NULLSeq);
+	    }
+	
+    }
+    
+    
+    /* Read in the C trace */
+    if (fseek(fp, (off_t)dataCO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &(seq->traceC[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    /* Read in the A trace */
+    if (fseek(fp, (off_t)dataAO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &seq->traceA[i]))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    /* Read in the G trace */
+    if (fseek(fp, (off_t)dataGO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &seq->traceG[i]))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    
+    /* Read in the T trace */
+    if (fseek(fp, (off_t)dataTO, 0) != 0) {fclose(fp);freeSeq(seq);return(NULLSeq);}
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!be_read_int_2(fp, &seq->traceT[i]))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	}
+    
+    
+    
+    /* Read in the bases */
+    if (! (getIndexEntryLW(fp,(off_t)indexO,BaseEntryLabel,1,5,&baseO) &&
+           (fseek(fp, (off_t)baseO, 0) == 0)
+	   )
+	)
+	{   fclose(fp);
+	    freeSeq(seq);
+	    return(NULLSeq);
+	}
+    for (i=0;i<(seq->NorigBases);i++)
+	{   int ch;
+	    
+	    if ((ch = fgetc(fp)) == EOF)
+		{   fclose(fp);
+		    freeSeq(seq);
+		    return(NULLSeq);
+		}
+	    seq->base[i] = (ch == 'N') ? '-' : ch;
+	}
+    
+    
+    
+    
+    /* Read in the base positions */
+    if (!(getIndexEntryLW(fp,(off_t)indexO,BasePosEntryLabel,1,5,&basePosO) &&
+          (fseek(fp, (off_t)basePosO, 0) == 0)
+	  )
+	)
+	{    fclose(fp);
+	     freeSeq(seq);
+	     return(NULLSeq);
+	 }
+    
+    for (i=0;i<(seq->NorigBases);i++)
+	{   if (!be_read_int_2(fp, (uint_2 *)&seq->basePos[i]))
+		{ fclose(fp); freeSeq(seq); return(NULLSeq); }
+	}
+    
+    
+    /*************************************************************
+     * Gather useful information
+     *************************************************************/
+    {
+	char comment[1024];
+	char line[128];
+	int4 spacing;
+	int4 ppos;
+	
+	*comment = '\0';
+	
+	
+	/* Get Signal Strength Offset */
+	if (getIndexEntryLW(fp,(off_t)indexO,SignalEntryLabel,1,5,&signalO)) {
+	    int2 C,A,G,T;
+	    int2 *base[4];
+	    base[0] = &C;
+	    base[1] = &A;
+	    base[2] = &G;
+	    base[3] = &T;
+	    if (fseek(fp, (off_t)signalO, 0) >= 0 &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_>>24&255))]) &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_>>16&255))]) &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_>>8&255))]) &&
+		be_read_int_2(fp, (uint_2 *)base[baseIndex((char)(fwo_&255))])) {
+		sprintf(line,"avg_signal_strength = C:%d A:%d G:%d T:%d\n",C,A,G,T);
+		strcat(comment,line);
+	    }
+	}
+
+	/* Get the spacing.. it's a float but don't worry yet */
+	if (getIndexEntryLW(fp,(off_t)indexO,SpacingEntryLabel,1,5,(uint_4*)&spacing)) {
+	    sprintf(line,"avg_spacing = %6.2f\n",int_to_float(spacing));
+	    strcat(comment,line);
+	}
+
+	
+	/* Get primer position */
+	if (getIndexEntryLW(fp,(off_t)indexO,PPOSLabel,1,5,(uint_4 *)&ppos)) {
+	    sprintf(line,"primer_position = %d\n",(ppos>>16)); /* ppos stores in MBShort of pointer */
+	    strcat(comment,line);
+	}
+
+	/* Get Machine Name Offset */
+	if (getIndexEntryLW(fp,(off_t)indexO,MCHNLabel,1,5,&MCHN_O)) {
+	    if (fseek(fp, (off_t)MCHN_O, 0) >= 0) {
+		unsigned char l;
+		char buffer[256];
+		/* first byte is a length */
+		fread(&l,sizeof(char),1,fp);
+		fread(buffer,l,1,fp);
+		sprintf(line,"machine_name = %.*s\n",l,buffer);
+		strcat(comment,line);
+	    }
+	}
+	
+	/* Get Dye Primer Offset */
+	if (getIndexEntryLW(fp,(off_t)indexO,PDMFLabel,1,5,&PDMF_O)) {
+	    if (fseek(fp, (off_t)PDMF_O, 0) >= 0) {
+		unsigned char l;
+		char buffer[256];
+		/* first byte is a length */
+		fread(&l,sizeof(char),1,fp);
+		fread(buffer,l,1,fp);
+		sprintf(line,"dye_primer = %.*s\n",l,buffer);
+		strcat(comment,line);
+	    }
+	}
+
+	/* Get Sample Name Offset */
+	/*
+         * The code here needs to be a bit more clever
+	 * When sample name length is less than (or equal?) four, the
+	 * string is packed into the SMPL_O field
+	if (getIndexEntryLW(fp,(off_t)indexO,SMPLLabel,1,5,&SMPL_O)) {
+	    if (fseek(fp, (off_t)SMPL_O, 0) >= 0) {
+		unsigned char l;
+		char buffer[256];
+		/. first byte is a length ./
+		fread(&l,sizeof(char),1,fp);
+		fread(buffer,l,1,fp);
+		sprintf(line,"sample_name = %.*s\n",l,buffer);
+		strcat(comment,line);
+	    }
+	}
+	*/
+
+	
+	/* dumplicate string and set info */
+	{
+	    char *s = (char *)malloc(strlen(comment)+1);
+	    strcpy(s,comment);
+	    seq->info = s;
+	}
+    }
+    
+    /* Finished with the file */
+    fclose(fp);
+    
+    return(seq);
+}
diff --git a/src/ted/seqIOABI.h b/src/ted/seqIOABI.h
new file mode 100644
index 0000000..6e9aeca
--- /dev/null
+++ b/src/ted/seqIOABI.h
@@ -0,0 +1,39 @@
+#ifndef _seqIOABI_h
+#define _seqIOABI_h
+
+
+/* 
+    Title:       seqIOABI
+
+    File: 	 seqIOABI.h
+    Purpose:	 IO of ABI sequences
+    Last update: Mon May 28 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"   /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Seq readSeqABI(char *fn);
+/*
+    Read the ABI format sequence with name `fn' into `seq'.
+    All printing characters (as defined by ANSII C `isprint')
+    are accepted, but `N's are translated to `-'s.
+    A NULLSeq result indicates failure.
+
+    enzString is the enzyme string that is to be used for 
+    searching for the left cutoff of sequence
+*/
+#endif  /*_seqIOABI_h*/
diff --git a/src/ted/seqIOALF.c b/src/ted/seqIOALF.c
new file mode 100644
index 0000000..61735fc
--- /dev/null
+++ b/src/ted/seqIOALF.c
@@ -0,0 +1,334 @@
+/* 
+  Title:       seqIOALF
+  
+  File: 	 seqIOALF.c
+  Purpose:	 IO of ALF sequences
+  Last update: Tue Nov 10 1992
+  */
+
+/*
+  Change Log :- 
+  14.01.91 SD
+  when complimenting the sequence with an odd number of bases,
+  the middle base position was not adjusted.
+  15.01.91 SD  Put StLouis stuff on compilation flag
+  15.01.91 SD  New include file (opp.h)
+  02.08.91 SD  Changes the mapping of uncertainty codes so that we
+  now only generate A C G T and -
+  Previously... bug in interpreting ALF integer fields.
+  We now treat them as unsigned.
+  17.09.91 LFW changed STLOUIS compilation flag to SAVE_EDITS
+  and AUTO_CLIP
+  25.10.91 SD  Machine independant I/O...removed BIGENDIAN flag
+  25.11.91 SD There was a hard limit (of 1024) for allocation of
+  space for number of bases, yet program would 
+  read in more if there were any, causing nasties to happen.
+  
+  */
+
+/* RMD I made substantial changes to this file 12/28/90 so as to
+   read sequence data more freely (necessary when reading data from
+   multiple trace files).
+   The affected area is indicated by comments starting RMD, like
+   this one.
+   */
+
+/* This file was adapted by LFW from seqIOABI.c.
+   
+   At the moment, the `maxTraceVal' of the sequence is hardwired
+   as 1200.  This fudge worked for the ABI, but is not really good
+   for the ALF.  To keep compatibility we keep 1200 as a max and
+   scale to fit (search for scaleFactor).  This needs work.
+   
+   The ALF results file is a concatenation of many files with an
+   index structure at the beginning, consisting of a 512 byte
+   block that we ignore, followed by 128 byte blocks describing
+   each file.  All files, including the header region, are rounded 
+   up to a multiple of 512 bytes long.  
+   The getIndexEntry routines identify the 128 byte index component
+   of interest by matching 4 chars of its ASCII label, then extract
+   the field of choice from that entry.
+   
+   Note that the SUN and PC are of opposite endian-ness, so that
+   we have to provide special routines to read words and longwords
+   from the results file.  Luckily the floating point numbers are
+   written out in ASCII.
+   
+   All references to the seq->bottom, the bottom strand of the
+   sequence, were added by lfw.
+   */
+
+
+/* ---- Imports ---- */
+
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			 newSeq, freeSeq */
+/*#include "seqIOABI.h"*/
+#include "seqIOEdit.h" /* IMPORT: writeEdSeq, 
+			  readEdSEq */
+
+
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+			 EOF */
+#include "mach-io.h"
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+#define IndexEntryLength ((off_t)128)
+
+
+/*
+  Here are some labels we will be looking for, four chars packed
+  into a long word.
+  */
+#define EntryLabel        ((uint_4) ((((('A'<<8)+'L')<<8)+'F')<<8)+' ')
+#define BaseEntryLabel    ((uint_4) ((((('S'<<8)+'e')<<8)+'q')<<8)+'u')
+#define DataEntryLabel    ((uint_4) ((((('P'<<8)+'r')<<8)+'o')<<8)+'c')
+
+
+/* ---- Internal functions ---- */
+
+static Boolean getIndexEntryLW(FILE *fp, off_t indexO,
+			       uint_4 label, int lw,
+			       uint_4 *val)
+/*
+  From the ALF results file connected to `fp' whose index starts
+  at byte offset `indexO', return in `val' the `lw'th long word
+  from the entry labelled `label'.
+  The result indicates success.
+  */
+{   off_t entryNum=-1;
+    int i;
+    uint_4 entryLabel;
+    
+    do
+	{   entryNum++;
+	    if (fseek(fp, indexO+(entryNum*IndexEntryLength), 0) != 0) 
+		return(-1);
+	    
+	    if (!be_read_int_4(fp, &entryLabel)) return(False);
+	} while (!(entryLabel == label));
+    
+    for(i=2; i<lw; i++)
+	if (!be_read_int_4(fp, val))  return(-1);
+    
+    
+    /* when i = lw read in the 4 bytes backwards */
+    if (!le_read_int_4(fp,val)) return(-1);
+    
+    return(True);
+}
+
+static Boolean getIndexEntryW(FILE *fp, off_t indexO,
+			      uint_4 label, int lw,
+			      uint_2 *val)
+/*
+  From the ALF results file connected to `fp' whose index starts
+  at byte offset `indexO', return in `val' the `lw'th  word (int2)
+  from the entry labelled `label'.
+  The result indicates success.
+  */
+{   off_t entryNum=-1;
+    int i;
+    uint_4 entryLabel;
+    uint_4 jval;
+    
+    do
+	{   entryNum++;
+	    if (fseek(fp, indexO+(entryNum*IndexEntryLength), 0) != 0) return(-1);
+	    if (!be_read_int_4(fp, &entryLabel)) return(False);
+	} while (!(entryLabel == label));
+    
+    
+    for(i=2; i<lw; i++)
+	if (!be_read_int_4(fp, &jval)) return(-1);
+    if (!le_read_int_2(fp, val)) return(-1);
+    
+    return(True);
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+Seq readSeqALF(char *fn)
+/*
+  Read the ABI format sequence with name `fn' into `seq'.
+  All printing characters (as defined by ANSII C `isprint')
+  are accepted, but `N's are translated to `-'s.  If bottom 
+  is True, we read the sequence in in reverse and complemented 
+  A NULL result indicates failure.
+  
+  In this module all references to the bottom strand were 
+  inserted by lfw.
+  */
+{   Seq seq = NULLSeq;
+    int i;
+    FILE *fp;
+    uint_4 numPoints;
+    
+    uint_4 data_size;
+    uint_4 dataO;
+    uint_4 header_size=396; /* size of the header of the processed data
+			     section */
+    uint_2 actBaseDataSize; /* actual number of bytes of data of information
+			     containing the base and basePos information */
+    int num_points; /* keeps track of the actual number of points, rather
+		       than the early guess of numPoints */
+    
+    /* RMD 12/28/90 changed scaleFactor from 7 to 20 -
+       should read this from file, or calculate it */
+    
+    off_t indexO;   /* File offset where the index is */
+    uint_4 baseO;    /* File offset where the bases are stored */
+    
+    
+    if ((fp = fopen(fn, "rb")) == NULL) return(NULLSeq);
+    
+    
+    /* RMD lots of changes below here until end of data reading section
+       Some are cosmetic - I introduced the label abort: at the end of
+       the routine and sent all error stuff there.  
+       getIndexEntry calls in front of where they were needed, and made
+       There is a substantive change to the inner loop of the sequence
+       reading section.  This now uses fscanf - much less rigid than the
+       previous scheme.  Note that it reads bp as a float.  This is because
+       it is a float in multiple trace data files! (bizarre Pharmacia
+       programming!).
+       */
+    
+    
+    /* find offsets and length of sequence and data segments */
+    
+    indexO = 522;      /* indexO is the offset of the index */
+    /* or I could look for the first label, starting 'ALF'
+       if I used 512 then none of the entries are on long 
+       word boundaries */
+    
+    if (! ( getIndexEntryLW(fp,indexO,BaseEntryLabel,12,&baseO)  ))
+	/* offset in file of first base of sequence */
+	goto abort ;
+    
+    if (! ( getIndexEntryW(fp,indexO,BaseEntryLabel,10,&actBaseDataSize)  ))
+	/* actual size of region containing this data */
+	goto abort ;
+    
+    if (! ( getIndexEntryLW(fp,indexO,DataEntryLabel,12,&dataO)))
+	/* offset in file to start of processed data segment - there 
+	   is then a header of size header_size (currently 396) */
+	goto abort ;
+    
+    if (! ( getIndexEntryLW(fp,indexO,DataEntryLabel,10,&data_size)  ))
+	/* actual size of region containing this data */
+	goto abort ;
+    
+    numPoints = (int)((data_size - header_size)/ 8); 
+    /* because each trace value is stored in a 2 byte
+       integer, thus to store A C G T information
+       it takes 8 bytes.  So subtract off the header and
+       divide by 8 */
+    
+    /* RMD make enough space for 1024 bases - hard limit */
+#define BASELIMIT 1024
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesAndTraces, numPoints, BASELIMIT)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    seq->mode        = BasesAndTraces;
+    seq->format      = ALFFormat;
+    seq->dirty       = False;
+    seq->maxTraceVal = 0;
+    seq->NPoints     = numPoints;
+    seq->bottom	     = False;
+    
+    /* RMD set ->NorigBases and ->NedBases after reading them in */
+    
+    /* read in the sequence */
+    
+    if (!(fseek(fp, (off_t)baseO, 0) == 0))
+	goto abort ;
+    
+    {	/* new locals introduced by LFW and/or RMD for the ALF */
+	int numBases;	/* number of nucleotides read in */
+	float bp ;
+	char ch;
+	
+	
+	for (numBases = 0 ; ftell(fp) < baseO+(unsigned short)actBaseDataSize  && numBases<BASELIMIT; ) {
+	    char line[200];
+	    fgets(line,(int)sizeof(line),fp);
+	    sscanf (line,"%c %*d %f", &ch, &bp) ;
+	    
+	    /* we convert ch to Staden format here */
+	    switch (ch) {
+	    case 'A':
+	    case 'C':
+	    case 'G':
+	    case 'T':
+		break;
+	    default:
+		if (isupper(ch))
+		    ch = '-';
+		else
+		    ch = '\0';
+	    }
+	    
+	    if (ch) {
+		seq->base[numBases]    = ch;
+		seq->basePos[numBases] = bp;
+		++numBases ;
+		
+	    }
+	}
+	
+	seq->NorigBases  = numBases;
+	seq->NedBases    = numBases;
+    }
+    
+    /* read in the traces , stored in 2 byte integers in records in the
+       order A C G T A C G T A C G T ...*/
+    
+    if (fseek(fp, (off_t)(dataO+header_size), 0) != 0) 
+	goto abort ;
+    
+    num_points = 0;
+    
+    for (i=0;i<(seq->NPoints);i++)
+	{   if (!le_read_int_2(fp, &(seq->traceA[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceA[i]) seq->maxTraceVal = seq->traceA[i];
+	    if (!le_read_int_2(fp, &(seq->traceC[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceC[i]) seq->maxTraceVal = seq->traceC[i];
+	    if (!le_read_int_2(fp, &(seq->traceG[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceG[i]) seq->maxTraceVal = seq->traceG[i];
+	    if (!le_read_int_2(fp, &(seq->traceT[i])))
+		{fclose(fp);freeSeq(seq);return(NULLSeq);}
+	    if (seq->maxTraceVal < seq->traceT[i]) seq->maxTraceVal = seq->traceT[i];
+	    
+	    if (seq->traceA[i]==0 && seq->traceT[i]==0 && seq->traceC[i]==0 && seq->traceG[i]==0 && i > (numPoints-64)) break;
+	    
+	    num_points++;
+	}
+    
+    
+    /* Finished with the file */
+    fclose(fp);
+    
+    return(seq);
+    
+ abort:
+    fclose(fp);
+    freeSeq(seq);
+    return(NULLSeq);
+}
diff --git a/src/ted/seqIOALF.h b/src/ted/seqIOALF.h
new file mode 100644
index 0000000..28b7388
--- /dev/null
+++ b/src/ted/seqIOALF.h
@@ -0,0 +1,40 @@
+#ifndef _seqIOALF_h
+#define _seqIOALF_h
+
+
+/* 
+    Title:       seqIOALF
+
+    File: 	 seqIOALF.h
+    Purpose:	 IO of ABI sequences
+    Last update: Tues Dec 11 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"   /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Seq readSeqALF(char *fn);
+/*
+    Read the ABI format sequence with name `fn' into `seq'.
+    All printing characters (as defined by ANSII C `isprint')
+    are accepted, but `N's are translated to `-'s.
+    A NULLSeq result indicates failure.
+
+    enzString is the enzyme string that is to be used for 
+    searching for the left cutoff of sequence
+*/
+
+#endif  /*_seqIOALF_h*/
diff --git a/src/ted/seqIOEdit.c b/src/ted/seqIOEdit.c
new file mode 100644
index 0000000..b248555
--- /dev/null
+++ b/src/ted/seqIOEdit.c
@@ -0,0 +1,770 @@
+/*
+  Title:  seqIOEdit
+  
+  File:   seqIOEdit.c
+  Purpose: IO of the editted portion of plain or edited sequences 
+  
+  Last update: Monday 24 February 1992
+  
+  Change log :-
+  15.01.91 SD  New include file (opp.h)
+  04.12.91 lfw added sample lanes up to 40 instead of 24
+  04.12.91 changed the way the left cutoff is found...now I allow
+  look for first occurrence of the left cutting sequence;
+  looking first for an exact match, then a match with
+  one mismatch, then with two...if nothing is found at
+  that point I assume the left cutoff is not there
+  24.02.92 SD Fixed bug in findRightCutoff() when checking for overlap with
+  leftCutoff. There was some confusion over what the value of
+  rightCutoff actually means.
+  18.11.92 lfw changed the names of the temporary files to .abc* instead
+  of abc*, also used the remove() command rather than system() to get
+  rid of the files and added another remove call so they will be sure to
+  be removed
+  
+  
+  */
+
+
+/* ---- Imports ---- */
+#include <stdlib.h>
+#include "seqIOEdit.h"
+
+#include "seq.h"/* IMPORT: Seq, BasesAndTraces, NULLSeq,
+		   newSeq, freeSeq */
+#include "opp.h"       /* IMPORT: oppInitialise */
+#include "dialogues.h"
+
+
+extern int findPercntAmbig(char *theSeq, int num_bases);
+
+/* ---- Internal Functions ---- */
+
+int findPercntAmbig(char *theSeq, int num_bases);
+
+static void text_to_output(char *vec,int stp,int endp,int dvice,char *outfile)
+
+/*
+ * text_to_output(vec,stp,endp,dvice,outfile)
+ * input: char **vec,*outfile; int stp, endp, dvice;
+ * this program output a specified portion of a genbank file
+ * (from vec[stp] to vec[endp]) to the screen (default), a file (dvice
+ * =1), or lpr (dvice =2).  Outfile is the input filename if you wish to
+ * output the information to a specified file. stp and endp are integers,
+ *  not pointers.
+ */
+
+{
+    int i;
+    FILE *fopen(),*fp,*where;
+    
+    
+    if (dvice == 1) {
+	if ((fp = fopen(outfile,"a"))==NULL) {
+	    printf ("\nERROR: can't open file %s\n",outfile);
+	    return;
+	}
+	else where = fp;
+    }
+    else if (dvice == 2) {
+	if ((fp = fopen("junkfile.","a"))==NULL) {
+	    printf ("\nERROR: can't open file junkfile. to output to the lpr\n");
+	    return;
+	}
+	else where = fp;
+    }
+    else {
+	/*  default : */
+	where = stdout;
+    }
+    
+    for (i = stp; i < endp; i++)
+	putc(vec[i],where);
+    
+    if ((dvice == 1) || (dvice == 2)) fclose(fp);
+    
+}
+
+int checkForExistingEdFile(char *fn)
+/*
+ * check to see if there is an existing .seq.n file;
+ * return the largest n  or 0 if no files existed,
+ * return a -1 if it was a problem with opening files
+ */
+
+{
+    char vec[500];
+    int last_ed_num;
+    FILE *fp;
+    int i;
+    
+    /* make sure there are no files with the names I'm about to use*/
+    remove (".abcxyztmpsh.");
+    remove (".abcxyztmpout.");
+    
+    
+    /* write a little shell to see the last n in your_filename.n
+       in the current directory */
+    if ((fp=fopen(".abcxyztmpsh.", "w")) == NULL) return(-1);
+    fclose(fp);
+    sprintf(vec,"for i in 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 \n");
+    i = strlen(vec);
+    sprintf(vec+i,"  do\n   oldname=%s.$i\n   if test -f $oldname\n   then  echo $i\n",fn);
+    i = strlen(vec);
+    sprintf(vec+i,"   fi\ndone\n");
+    
+    text_to_output(vec,0,strlen(vec),1,".abcxyztmpsh.");
+    
+    /* execute the shell and have it output the last number it found into
+       a file called abcxyztmpout. */
+    system("sh .abcxyztmpsh. | tail -1 > .abcxyztmpout.");
+    
+    /* read that number from that file */
+    
+    if ((fp=fopen(".abcxyztmpout.", "r")) == NULL) return(-1);
+    vec[0]=getc(fp);
+    fclose(fp);
+    /* if that number was EOF return(0), nothing found */
+    if (vec[0]==EOF) {
+      remove (".abcxyztmpsh.");
+      remove (".abcxyztmpout.");
+
+      return(0);
+    }
+    
+    vec[1]='\0';
+    
+    sscanf(vec,"%d",&last_ed_num);
+    remove (".abcxyztmpsh.");
+    remove (".abcxyztmpout.");
+    
+    return(last_ed_num);
+}
+
+Boolean isDotSeq(char *fn)
+/* make sure there is a .seq on  the end of fn, puts one on
+   if there is not */
+{
+    int i;
+    
+    i = strlen(fn)-1;
+    if (fn[i]!='q' || fn[i-1]!='e' || fn[i-2]!='s' || fn[i-3]!='.') 
+	return(False);
+    else return(True);  
+}
+
+void stripDotSeq(char *fn)
+/* there is a .seq on  the end of fn, strip it off */
+
+{ int i;
+  
+  i = strlen(fn)-1;
+  
+  if (fn[i]=='q' && fn[i-1]=='e' && fn[i-2]=='s' && fn[i-3]=='.') 
+      fn[i-3]='\0';
+  
+  return;
+}
+
+void stripDotNum(char *fn)
+/* if there is a .num on  the end of fn, strip it off */
+
+{ int i;
+  
+  i = strlen(fn)-1;
+  
+  if (isdigit(fn[i]) && fn[i-1]=='.') 
+      fn[i-1]='\0';
+  else if (isdigit(fn[i]) && isdigit(fn[i-1]) && fn[i-2]=='.')   
+      fn[i-2]='\0';
+  
+  return;
+}
+
+int  isDotNum(char *fn)
+/*
+ * checks if there is a .1 or .2 or .m on the inputfilename.
+ * If there is, then returns that num.  If not returns -1.
+ */
+{
+    int i,j;
+    int dotnum;
+    char *atemp;
+    
+    atemp = (char *)calloc(20,sizeof(char));
+    
+    i = strlen(fn)-1;
+    j = 0;
+    
+    if (isdigit(fn[i]) && (fn[i-1]=='.')) {
+	atemp[0] = fn[i];
+	atemp[1] = fn[i+1]; 
+	sscanf(atemp,"%d",&dotnum);
+	free(atemp);
+	return(dotnum);
+    }
+    else if (isdigit(fn[i]) && isdigit(fn[i-1]) && fn[i-2]=='.')   {
+	atemp[0] = fn[i-1];
+	atemp[1] = fn[i];
+	atemp[2] = '\0';
+	sscanf(atemp,"%d",&dotnum);
+	free(atemp);
+	return(dotnum);
+    }
+    
+    free(atemp);
+    return(-1);
+}
+
+static void get_compl_seq(char *ac_seq,char *aseq,int stp,int endp, int seq_len, int rev)
+
+/*
+ * uses aseq to find the sequence that
+ * would appear on the opposite strand and places
+ * that sequence in ac_seq; stp is the starting point on
+ * aseq and endp is the ending point on aseq;
+ * seq_len is the length of the input sequence,
+ * if rev==1 reverses as well as complements, if rev==0
+ * only complements
+ */
+{
+    int i;
+    
+    oppInitialize();
+    
+    if (rev == 1) {
+	for (i = stp; i <= endp; i++)
+	    ac_seq[seq_len - i] = opp[aseq[i]];
+    }
+    else {
+	for (i = stp; i <= endp; i++)
+	    ac_seq[i] = opp[aseq[i]];
+	ac_seq[i-1] ='\0';
+    }
+    
+    
+    return;
+}
+
+
+/* ---- Externals ---- */
+
+Boolean writeEdSeq(Seq seq, char *fn)
+
+{
+    FILE *fp;
+    int i,j;
+    char ed_fn[200];
+    int last_ed_num; /* largest n of fn.seq.n in
+			current directory */
+    
+    oppInitialize();
+    
+    /* make sure the filename does not have a .seq on the end */
+    stripDotSeq(fn);
+    
+    /* get the n to put on inputfilename.n for the edited 
+       file to be kept */
+    last_ed_num = checkForExistingEdFile(fn);
+    
+    if (last_ed_num == -1) return(False);
+    else sprintf(ed_fn,"%s.%d",fn,last_ed_num+1);
+    /* that line takes care of 0 too, because
+       it names the file fn.seq.1 */
+    
+    /* Open for writing, text */
+    if ((fp=fopen(ed_fn, "w")) == NULL) return(False);
+    
+    
+    /* write information in the following format:
+       NedBases*edits array*edBase array*edBasePos array*
+       each division ends in an * and within
+       each division entries are separated by spaces */
+    
+    if (seq->bottom) {
+	fprintf(fp," %6d*%3d*%6d*%6d*",seq->NorigBases,seq->NedBases,seq->rightCutoff,seq->leftCutoff);
+    }
+    else {
+	fprintf(fp," %6d*%3d*%6d*%6d*",seq->NorigBases,seq->NedBases,seq->leftCutoff,seq->rightCutoff);
+    }
+    
+    /* print out edits array */
+    
+    if (seq->bottom) {
+	for (i=seq->NorigBases+MaxEdits-1;
+	     i > -1;
+	     i--)
+	    { 
+		j = 0;
+		if (i==seq->NorigBases+MaxEdits-1) 
+		    /* this part is a fudge to stick in 0 0 */
+		    fprintf(fp,"%6d %6d ",j,j);
+		
+		
+		if (((seq->edits[i]!=0)  && ((seq->NedBases -1- i)>=0)) || i==0){
+		    if (seq->edits[i] <0)
+			fprintf(fp,"%6d %6d ",seq->NedBases -1- i,seq->edits[i]);
+		    else
+			fprintf(fp,"%6d %6d ",seq->NedBases -1 -i,seq->NorigBases - 1-(seq->edits[i]));   
+		}
+	    }
+    }
+    else {
+	for (i=0;
+	     i<seq->NorigBases+MaxEdits;
+	     i++) {
+	    
+	    if ((seq->edits[i]!=0)  || (i==0))
+		fprintf(fp,"%6d %6d ",i,seq->edits[i]);
+	}
+    }
+    
+    fprintf(fp,"%6d %6d ",NULLPoint,NULLPoint);
+    
+    fprintf(fp,"*");
+    
+    /* print out non-NULL entries in edBase array */
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{
+	    if (seq->edBase[i] == NULL) {
+		fprintf(fp,"* ");
+		break;
+	    }
+	    else {
+		if (seq->bottom) 
+		    fprintf(fp,"%c ",opp[seq->edBase[i]]);
+		else
+		    fprintf(fp,"%c ",seq->edBase[i]);
+	    }
+	}
+    
+    fprintf(fp,"*");
+    
+    /* print out non-NULL entries in edBasePos array */
+    
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{
+	    
+	    if (seq->edBasePos[i] == NULLPoint)  {
+		fprintf(fp,"%6d ",NULLPoint); /* -1 is NULLPoint*/
+		break;
+	    }
+	    else {
+		int fudge;
+		/*
+		 * when you're plotting the strand in the reverse
+		 * order, you must move the starting position over by
+		 * the width of one character.  Because positions in the
+		 * other file, already take into account the character
+		 * width. Therefore, fudge = ~character width + basePos
+		 * of the first base
+		 */
+		
+		fudge = seq->basePos[0] + 6;
+		
+		if (seq->bottom) {
+		    /*	   fprintf(fp,"%6d ",seq->basePos[seq->NorigBases-1]-seq->edBasePos[i]+fudge);	  */
+		    fprintf(fp,"%6d ",seq->NPoints-seq->edBasePos[i]);
+		}
+		else
+		    fprintf(fp,"%6d ",seq->edBasePos[i]);
+	    }
+	}
+    
+    fprintf(fp,"*");
+    (void) fclose(fp);
+    return(True);
+}
+
+
+
+Boolean readEdSeq(Seq seq, char *fn, int dotnum)
+/*
+ * reads in the most recent fn.seq.n file. This
+ * file should be of the form:
+ * basePos;edBasePos;edBase
+ * returns True if it read in the sequence from
+ * an editted file, and False if there was no
+ * editted file or if there was a problem 
+ * 
+ * dotnum == -1 if the user did not specify a
+ * version number of the sequence to read in,
+ * if they want version m read in.
+ */
+
+{
+    int last_ed_num;
+    char ed_fn[200];
+    FILE *fp;
+    
+    oppInitialize();
+    
+    /* make sure the filename does not have a .seq on the end */
+    stripDotSeq(fn);
+    
+    if (dotnum == -1) {
+	/* get the n to put on fn.seq.n */
+	last_ed_num = checkForExistingEdFile(fn);
+	if (last_ed_num == -1) return(False);
+	else if (last_ed_num == 0) return(False);
+	else sprintf(ed_fn,"%s.%d",fn,last_ed_num);
+    }
+    else
+	sprintf(ed_fn,"%s.%d",fn,dotnum);
+    
+    
+    /* Open for reading, text */
+    if ((fp=fopen(ed_fn, "r")) == NULL) return(False);
+    fclose(fp);
+    
+    if (processEdSeqFile(seq,ed_fn))
+	return(True);
+    else return(False);
+}
+
+Boolean processEdSeqFile(Seq seq, char *fn)
+
+/*
+ * processes a char vector containing the Editted
+ * sequence in the following format:
+ *
+ * NedBases*NorigBases*leftCutoff*rightCutoff*negative
+ * components of edits array in the form (position,
+ * negative number)*non NULL components edBase array*
+ * non NULL components edBasePos array*
+ *
+ * each division ends in an H. The end of the
+ * edits array is signaled by a -1 -1 entry. Within
+ * each division entries are separated by spaces
+ * 
+ * sticks the information it finds into the seq array
+ * 
+ * returns false if there was a problem with the file format
+ */
+{
+    FILE *fp;
+    int nbases;
+    char achar;
+    int i,j,k;
+    char ed_fn[200];
+    
+    strcpy(ed_fn,fn);
+    
+    if ((fp=fopen(ed_fn, "r")) == NULL) return(False);
+    
+    fscanf(fp," %6d",&nbases);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the number of bases)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    
+    fscanf(fp,"%3d",&i);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the number of edited bases)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    seq->NedBases = i;
+    if (nbases != seq->NorigBases) {
+	printf("ERROR: Input editted sequence was of wrong format\n (Number of editted bases has changed)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    
+    fscanf(fp,"%6d",&j);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the left cutoff)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    fscanf(fp,"%6d",&k);   
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(No asterisk was found after the right cutoff)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    if (seq->bottom) {
+	seq->leftCutoff = k;
+	seq->rightCutoff = j;
+    }
+    else {
+	seq->leftCutoff = j;
+	seq->rightCutoff = k;
+    }
+    
+    /* read in the seq->edits array */
+    
+    for (i=1;
+	 i<seq->NorigBases+MaxEdits;
+	 i++)
+	{ 
+	    fscanf(fp,"%6d %6d ",&j,&k);
+	    if (j==NULLPoint) break;
+	    else {
+		if (seq->bottom) {
+		    if (k<0) /* then it is a reference to the edBase array */
+			seq->edits[seq->NedBases-1-j]=k;
+		    else /*it is a base number which needs to be converted
+			   to the base number on the opposite strand */
+			seq->edits[seq->NedBases-1-j]=seq->NorigBases-1-k;
+		}
+		else
+		    seq->edits[j]=k;
+	    }
+	    
+	}
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format \n(Error in the edits array)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    /* read in the seq->edBase array */  
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{  
+	    fscanf(fp,"%c ",&achar);
+	    if (achar == '*') break;
+	    else {
+		if (seq->bottom) 
+		    seq->edBase[i]=opp[achar];
+		else
+		    seq->edBase[i]=achar;
+	    }
+	}
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("Input editted sequence was of wrong format\n(Extraneous information after edBases and before the *\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    
+    /* read in the seq->edBasePos array */  
+    
+    for (i=1;
+	 i<MaxEdits;
+	 i++)
+	{ 
+	    fscanf(fp,"%6d ",&k);
+	    if (k==NULLPoint) break;
+	    else {
+		int fudge;
+		
+		/*
+		 * when you're plotting the strand in the reverse
+		 * order, you must move the starting position over by
+		 * the width of one character.  Because positions in the
+		 * ABI File,  already take into account the character
+		 * width. Therefore, fudge = ~character width + offset of
+		 * the first peaks position
+		 */
+		
+		fudge = seq->basePos[0] + 6;
+		
+		if (seq->bottom)  {
+		    /*	  seq->edBasePos[i]=seq->basePos[seq->NorigBases-1]-k+fudge;*/
+		    seq->edBasePos[i]=seq->NPoints - k;
+		}
+		else
+		    seq->edBasePos[i]=k;
+	    }
+	    
+	}
+    achar = getc(fp);
+    if (achar != '*') {
+	printf("ERROR: Input editted sequence was of wrong format\n(Error in editted Base Position array)\n");
+	fclose(fp);    
+	return(False);
+    }
+    
+    /*
+     * Don't set the seq to Dirty, otherwise the user won't know
+     * if they have or have not edited their input edited sequence
+     */
+    fclose(fp);
+    return(True);
+}
+
+
+
+
+int findLeftCutoff(Seq seq, char *enzInString)
+/*
+ * looks for left cutoff, if it doesn't find a "enzInString", then
+ * it looks from enzInString less it's last character, etc
+ */
+
+{
+    int maxStartPos=100; /* if the enzyme site wasn't found before this
+			    baseNum, then that's probably not the cloning
+			    site */
+    int i,j,found;
+    /* int jj,kk; */
+    int indices[100];
+    int num_matches;
+    char *theSeq;
+    int num_bases;
+    char enzString[100];
+    /* char enztemp[100]; */
+    
+    found = 0;
+    
+    if (seq->bottom)
+	get_compl_seq(enzString,enzInString,0,strlen(enzInString),strlen(enzInString),0);
+    else
+	strcpy(enzString,enzInString);
+    
+    
+    num_bases = getNBases(seq,EdBases);
+    theSeq = (char *)calloc(num_bases,sizeof(char));
+    
+    j = 0;
+    if (seq->bottom) {
+	for (i = num_bases-1; i >= 0; i--){
+	    theSeq[i] = getBase(seq, EdBases, j);
+	    j++;
+	}
+    }
+    else {
+	for (i = 0; i < num_bases; i++)
+	    theSeq[i] = getBase(seq, EdBases, i);
+    }
+    
+    
+    
+    /* first look for a complete match (one mismatch allowed), then
+       gradually take nucleotides off the 5 prime end and continue
+       looking for a match */
+    /*
+    for (i = 0; i < 3; i++)  {
+	jj=0;
+	for (kk=i; enzString[kk]; kk++) enztemp[jj++]=enzString[kk];
+	enztemp[jj]='\0';
+	num_matches=string_match(enztemp,strlen(enztemp),theSeq,num_bases,2,indices);
+	if (num_matches > 0) 
+	    if (indices[0] < maxStartPos) {
+		found = 1;
+		break;
+	    }
+    }
+    */
+    
+    /* look for first occurrence of enzString; 
+       just look a match with at most i mismatches, starting
+       with 0 mismatches down to two*/
+    for (i=0; i<3; i++) {
+	num_matches=string_match(enzString,strlen(enzString),theSeq,num_bases,i,indices);
+	if (num_matches > 0) 
+	    if (indices[0] < maxStartPos) {
+		found = 1;
+		break;
+	    }
+    }
+    
+    free(theSeq);
+    
+    if (found) return(indices[0] + strlen(enzString));
+    else return(0);
+    
+}
+
+
+int findRightCutoff(Seq seq)
+{
+    /* give the %age cutoff a default but let it be user
+       specifiable on the command line ? */
+    /* 
+     * ways to look for ends of sequence 
+     * 1. runs of nucleotides or dinucleotides
+     * -- but rick says there are lots of runs of
+     * A's and T's in what they're sequencing
+     * 2. percentage of N's
+     * 3. automatically drop down to baseNum 600 to even start
+     * looking for a cutoff
+     */
+    
+    int num_bases;
+    char *theSeq;
+    int i,j;
+    int rightCutoff;
+    
+    num_bases = getNBases(seq,EdBases);
+    theSeq = (char *)calloc(num_bases,sizeof(char));
+    
+    j = 0;
+    if (seq->bottom) {
+	for (i = num_bases-1; i >= 0; i--){
+	    theSeq[i] = getBase(seq, EdBases, j);
+	    j++;
+	}
+    }
+    else {
+	for (i = 0; i < num_bases; i++)
+	    theSeq[i] = getBase(seq, EdBases, i);
+    }
+    
+    rightCutoff = findPercntAmbig(theSeq,num_bases);
+    
+    /* added so that the left and right cutoffs do not overlap */
+    if (rightCutoff > num_bases - seq->leftCutoff) rightCutoff=num_bases - seq->leftCutoff;
+    
+    free(theSeq);
+    return(rightCutoff);
+}
+
+
+int findPercntAmbig(char *theSeq, int num_bases)
+
+{
+    int i,j;
+    int isN[256];
+    int totalN = 0;
+    int numN;
+    int nucWindow;
+    
+    numN = 2;
+    nucWindow = 5;
+    
+    for (i = 0; i <= 256; i++) isN[i]=0;
+    isN['n']=1;
+    isN['N']=1;
+    isN['-']=1;
+    
+    
+    
+    /* 
+     * start at base num 200 and look for numN Ns within
+     * a window of nucWindow nucleotides, once you find that
+     * second N send back the indices of that second
+     * N as the cutoff line
+     */
+    
+    for (i = 200; i < num_bases; i++) {
+	totalN = 0;
+	for (j = 0; j < nucWindow; j++) {
+	    if (isN[theSeq[i+j]]) totalN++; 
+	    if (totalN == numN) 	return(num_bases - (i+j));
+	}
+    }
+    return(0);
+    
+}
+
+
diff --git a/src/ted/seqIOEdit.h b/src/ted/seqIOEdit.h
new file mode 100644
index 0000000..9442d60
--- /dev/null
+++ b/src/ted/seqIOEdit.h
@@ -0,0 +1,115 @@
+#ifndef _seqIOEdit_h
+#define _seqIOEdit_h
+
+/*
+ Title:       seqIOEdit
+
+ File:        seqIOEdit.h
+ Purpose:     IO of edited sequences
+    Last update: Tue Jan 15 1991
+
+ Change log :-
+ 15.01.91 SD 
+    Removed parts required for complementation and put them in opp.h
+*/
+
+/* ---- Imports ---- */
+
+#include "seq.h"   /* IMPORT: Seq */
+#include <stdio.h>      /* IMPORT: fopen, fclose, fseek, ftell, fgetc */
+#include <ctype.h>      /* IMPORT: isprint */
+
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+/* ---- Private ---- */
+
+/* ---- Exports ---- */
+
+extern Boolean writeEdSeq(Seq seq, char *fn);
+/*
+    Write the clipped, edited part of the sequence `seq'
+    into file `fn'. The result indicates success.
+
+    (*==fn in the following description)
+
+    addition by lfw, 10/24/90,
+
+  1. writes *.out as text file without printing bases
+       before left and right cutoff, and without
+       printing base numbers (in this program)
+
+    2. write *.1 as text if seq.1 does not exist
+       write *.2 as text if seq.1 does exist
+       write *.3 as text if seq.2 does exist
+       ...
+       write *.n as text if seq.(n-1) does exist
+       ( in writeEdSeq in seqIOEdit.c )
+
+
+*/
+
+extern Boolean readEdSeq(Seq seq, String fn, int dotnum);
+/*reads in the most recent fn.seq.n file. This
+  file should be of the form:
+    basePos;edBasePos;edBase
+  returns True if it read in the sequence from
+   an editted file, and False if there was no
+   editted file or if there was a problem 
+
+   dotnum == -1 if the user did not specify a
+   version number of the sequence to read in,
+   m if they want version m read in. */
+
+
+
+extern Boolean processEdSeqFile(Seq seq, String fn);
+
+/* processes a char vector containing the Editted
+   sequence in the following format:
+
+       NedBases*NorigBases*leftCutoff*rightCutoff*negative 
+       components of edits array in the form (position,
+       negative number)*non NULL componenets edBase array*
+       non NULL components edBasePos array*
+
+       each division ends in an H. The end of the
+       edits array is signaled by a -1 -1 entry. Within
+       each division entries are separated by spaces
+
+   sticks the information it finds into the seq array
+
+   returns false if there was a problem with the file format */
+
+extern Boolean isDotSeq(char *fn);
+
+/* checks to see if the filename already has the .seq
+   suffix or not */
+
+
+extern int isDotNum(char *fn);
+/* checks if there is a .1 or .2 or .m on the inputfilename.
+If there is, then returns that num.  If not returns -1.
+   */
+
+extern void stripDotNum(char *fn);
+/* if there is a .num (indicating that the user has input
+   an edited version of the sequence they wish to have
+   read in) on  the end of fn, strip it off */
+
+extern int findLeftCutoff(Seq seq, char *enzString);
+/* if the sequence has not previously been edited, this
+subroutine is used for auto-clipping */
+
+
+extern int findRightCutoff(Seq seq);
+/* if the sequence has not previously been edited, this
+subroutine is used for auto-clipping */
+
+
+#endif  /*_seqIOEdit_h*/
+
+
+
+
+
diff --git a/src/ted/seqIOPlain.c b/src/ted/seqIOPlain.c
new file mode 100644
index 0000000..e1b8c12
--- /dev/null
+++ b/src/ted/seqIOPlain.c
@@ -0,0 +1,120 @@
+/* 
+  Title:       seqIOPlain
+  
+  File: 	 seqIOPlain.c
+  Purpose:	 IO of plain sequences
+  Last update: Tuesday Jan 15 1991
+  
+  Change log:
+  
+  28.11.90 SD  put undesirables under STLOUIS compilation flag
+  15.01.91 SD  new include file (opp.h)
+  17.09.91 LFW changed STLOUIS compilation flag to SAVE_EDITS
+  and AUTO_CLIP
+  */
+
+
+/*
+  This module should be regarded as part of `seq' since it is
+  privy to the internal structure of `Seq'.
+  
+  Any references to the writing or reading of edited sequences,
+  or to the bottom strand were added by lfw
+  */
+
+
+
+
+/* ---- Imports ---- */
+
+#include "seqIOPlain.h"
+#include "seq.h"        /* IMPORT: Seq, BasesOnly, NULLSeq,
+			   newSeq, freeSeq */
+#include "seqIOEdit.h" /* IMPORT: readEdSeq, writeEdSeq */
+
+#include <stdio.h>      /* IMPORT: fopen, fclose, fseek, ftell, fgetc */
+#include <ctype.h>      /* IMPORT: isprint */
+
+
+
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+
+
+
+/* ---- Exports ---- */
+
+
+
+
+Seq readSeqPlain(char *fn)
+/*
+  Read the plain format sequence with name `fn' into `seq'.
+  All printing characters (as defined by ANSII C `isprint')
+  are accepted, but `N's are translated to `-'s.
+  A NULLSeq result indicates failure.
+  */
+{   Seq seq = NULLSeq;
+    FILE *fp;
+    off_t fileLen;
+    int  ch;
+    int i;
+    int wait_for_eol=0; /* used for looking for end of lines in the header */
+
+    if ((fp = fopen(fn, "r")) == NULL) return(NULLSeq);
+    
+    /*
+      Find the length of the file.
+      Use this as an overestimate of the length of the sequence.
+      */
+    fseek(fp, (off_t) 0, 2);
+    if ((fileLen = ftell(fp)) > MaxInt2)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    fseek(fp, (off_t) 0, 0);
+    
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesOnly, 0, fileLen)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    seq->mode   = BasesOnly;
+    seq->format = PlainFormat;
+    seq->dirty  = False;
+    seq->bottom = False;
+    
+    /* Read in the bases */
+    
+    seq->NorigBases = 0;
+    while ((ch = fgetc(fp)) != EOF) {
+        if (ch==';' || wait_for_eol) {
+	    /* if you find a semicolon assume that line is a comment
+	       and stop reading sequence until you have reached the
+	       end of line */
+	    wait_for_eol=1;
+	    if (ch=='\n') wait_for_eol=0;
+        }
+        else if (isprint(ch))
+	    {   seq->base[seq->NorigBases] = (ch=='N') ? '-' : ch;
+		seq->NorigBases++;
+	    }
+    }
+    
+    
+    seq->NedBases = seq->NorigBases;
+    (void) fclose(fp);
+    
+    
+    /*
+      Build a dummy basePos table such that the position of the
+      base is equal to its place in the sequence * 10.
+      */
+    seq->NPoints = (seq->NorigBases)*10;
+    for (i=0; i < seq->NorigBases; i++) seq->basePos[i] = i*10;
+
+    return(seq);
+}
diff --git a/src/ted/seqIOPlain.h b/src/ted/seqIOPlain.h
new file mode 100644
index 0000000..feb513e
--- /dev/null
+++ b/src/ted/seqIOPlain.h
@@ -0,0 +1,45 @@
+#ifndef _seqIOPlain_h
+#define _seqIOPlain_h
+
+
+/* 
+    Title:       seqIOPlain
+
+    File: 	 seqIOPlain.h
+    Purpose:	 IO of plain sequences
+    Last update: Mon May 28 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern Seq readSeqPlain(char *fn);
+/*
+    Read the plain format sequence with name `fn' into `seq'.
+    All printing characters (as defined by ANSI C `isprint')
+    are accepted, but `N's are translated to `-'s.
+    A NULLSeq result indicates failure.
+
+    enzString is used to automatically determine the left
+    cutoff for a given sequence -- only determined if this
+    is the first time this sequence has been read in.  If 
+    the sequence has already been edited, the program uses
+    the previous settings for left and right cutoffs
+
+*/
+
+
+#endif  /*_seqIOPlain_h*/
diff --git a/src/ted/seqIOSCF.c b/src/ted/seqIOSCF.c
new file mode 100644
index 0000000..cee7d13
--- /dev/null
+++ b/src/ted/seqIOSCF.c
@@ -0,0 +1,286 @@
+/* 
+  Title:       seqIOSCF
+  
+  File: 	 seqIOSCF.c
+  Purpose:	 IO of Standard Chromatogram Format sequences
+  Last update: Tue Nov 10 1992
+  
+  Change log:
+  4 Feb 1992,  Now draft proposal version 2
+  20 Feb 1992, Grab info from comment lines
+  19 Aug 1992, If SCF file has clip information, don't clip automatically
+  10 Nov 1992  SCF comments now stored in seq data structure
+  
+*/
+
+
+
+
+
+/* ---- Imports ---- */
+
+#include "scf.h"      /* SCF structures */
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			 newSeq, freeSeq */
+#include "seqIOEdit.h" /* IMPORT: writeEdSeq, 
+			  readEdSEq */
+
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+			 EOF */
+#include <stdlib.h>
+#include "mach-io.h"
+
+
+/* ---- Internal functions ---- */
+
+
+static int read_scf_header(FILE *fp, Header *h)
+{
+    int i;
+
+    if (be_read_int_4(fp,&h->magic_number)==False) return (False);
+    if (be_read_int_4(fp,&h->samples)==False) return (False);
+    if (be_read_int_4(fp,&h->samples_offset)==False) return (False);
+    if (be_read_int_4(fp,&h->bases)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_left_clip)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_right_clip)==False) return (False);
+    if (be_read_int_4(fp,&h->bases_offset)==False) return (False);
+    if (be_read_int_4(fp,&h->comments_size)==False) return (False);
+    if (be_read_int_4(fp,&h->comments_offset)==False) return (False);
+    if (fread(&h->version[0],sizeof(h->version),1,fp)!=1) return (False);
+    if (be_read_int_4(fp,&h->sample_size)==False) return (False);
+    if (be_read_int_4(fp,&h->code_set)==False) return (False);
+    for (i=0;i<20;i++)
+	if (be_read_int_4(fp,&h->spare[i])==False) return (False);
+    
+    return (True);
+}
+
+
+static int read_scf_sample1(FILE *fp, Samples1 *s)
+{
+    if (be_read_int_1(fp,&s->sample_A)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_C)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_G)==False) return (False);
+    if (be_read_int_1(fp,&s->sample_T)==False) return (False);
+    
+    return (True);
+}
+
+
+static int read_scf_sample2(FILE *fp, Samples2 *s)
+{
+    if (be_read_int_2(fp,&s->sample_A)==False) return (False);
+    if (be_read_int_2(fp,&s->sample_C)==False) return (False);
+    if (be_read_int_2(fp,&s->sample_G)==False) return (False);
+    if (be_read_int_2(fp,&s->sample_T)==False) return (False);
+    
+    return (True);
+}
+
+
+
+
+
+static int read_scf_base(FILE *fp, Bases *b)
+{
+    if (be_read_int_4(fp,&b->peak_index)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_A)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_C)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_G)==False) return (False);
+    if (be_read_int_1(fp,&b->prob_T)==False) return (False);
+    if (be_read_int_1(fp,(byte *)&b->base)==False) return (False);
+    if (be_read_int_1(fp,&b->spare[0])==False) return (False);
+    if (be_read_int_1(fp,&b->spare[1])==False) return (False);
+    if (be_read_int_1(fp,&b->spare[2])==False) return (False);
+    
+    return (True);
+}
+
+
+
+static float parse_version(char version[])
+/*
+ *
+ */
+{
+    char v[5];
+    strncpy(v,version,4);v[4]='\0';
+    if (strspn(v,"0123456789. ")!=4) return 0.0;
+    return atof(v);
+
+}
+
+
+
+/* ---- Exports ---- */
+
+int is_SCF(char *fn)
+/*
+ * Check to see if file with name `fn' is in SCF format
+ * 
+ * Returns:
+ * 1 - is SCF format
+ * 0 - not SCF format
+ */
+{
+    FILE *fp;
+    uint_4 magic;
+    int ok;
+    
+    if ( (fp=fopen(fn,"rb")) == NULL) {
+	ok = 0;
+    } else {
+	if ( be_read_int_4(fp, &magic) != 1 ) {
+	    ok = 0;
+	} else {
+	    ok = (magic==SCF_MAGIC);
+	}
+	fclose(fp);
+    }
+    
+    return ok;
+}
+
+
+
+
+
+
+
+
+Seq readSeqSCF(char *fn)
+/*
+ * Read the SCF format sequence with name `fn' into `seq'.
+ * If bottom is True, we read the sequence in in reverse and
+ * complemented A NULL result indicates failure.
+ */
+{   Seq seq = NULLSeq;
+    FILE *fp;
+    Header header;
+    int i;
+    
+    /* Open for: reading, binary */
+    if ((fp = fopen(fn, "rb")) == NULL) return(NULLSeq);
+    
+    /* Read header */
+    if (fseek(fp, (off_t)0, 0) != 0) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    if (read_scf_header(fp,&header)==False) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    
+    /* Allocate the sequence */
+    if ((seq = newSeq(BasesAndTraces, header.samples, header.bases)) == NULLSeq)
+	{   fclose(fp);
+	    return(NULLSeq);
+	}
+    
+    
+    seq->mode         = BasesAndTraces;
+    seq->format       = SCFFormat;
+    seq->dirty        = False;
+    seq->maxTraceVal  = 0;
+    seq->NPoints      = header.samples;
+    seq->NorigBases   = header.bases;
+    seq->NedBases     = header.bases;
+    seq->leftCutoff   = header.bases_left_clip;
+    seq->rightCutoff  = header.bases_right_clip;
+    seq->bottom       = False;
+    
+    /* set precision if necessary */
+    if (parse_version(header.version)<VERSION) {
+	/* old 8-bit format */
+	header.sample_size = 1;
+    }
+    
+    /* read sample points */
+    if (fseek(fp, (off_t)(header.samples_offset), 0) != 0) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    for(i=0;i<seq->NPoints;i++) {
+	switch ( header.sample_size ) {
+	case 2:
+	    {
+		Samples2 sample;
+		if (read_scf_sample2(fp,&sample)==False) {
+		    fclose(fp);
+		    return(NULLSeq);
+		}
+		seq->traceA[i] = sample.sample_A;
+		seq->traceC[i] = sample.sample_C;
+		seq->traceG[i] = sample.sample_G;
+		seq->traceT[i] = sample.sample_T;
+		break;
+	    }
+	default:
+	    {
+		Samples1 sample;
+		if (read_scf_sample1(fp,&sample)==False) {
+		    fclose(fp);
+		    return(NULLSeq);
+		}
+		seq->traceA[i] = sample.sample_A;
+		seq->traceC[i] = sample.sample_C;
+		seq->traceG[i] = sample.sample_G;
+		seq->traceT[i] = sample.sample_T;
+		break;
+	    }
+	}
+	if (seq->maxTraceVal < seq->traceA[i]) seq->maxTraceVal = seq->traceA[i];
+	if (seq->maxTraceVal < seq->traceC[i]) seq->maxTraceVal = seq->traceC[i];
+	if (seq->maxTraceVal < seq->traceG[i]) seq->maxTraceVal = seq->traceG[i];
+	if (seq->maxTraceVal < seq->traceT[i]) seq->maxTraceVal = seq->traceT[i];
+    }
+    
+    
+    /* Read bases */
+    if (fseek(fp, (off_t)(header.bases_offset), 0) != 0) {
+	fclose(fp);
+	return(NULLSeq);
+    }
+    for(i=0;i<seq->NorigBases;i++) {
+	Bases base;
+	if (read_scf_base(fp,&base)==False) {
+	    fclose(fp);
+	    return(NULLSeq);
+	}
+	seq->base[i] = base.base;
+	seq->basePos[i] = base.peak_index;
+    }
+    
+    
+    /* Read selected comment information  */
+    {
+	char *comments;
+	
+	comments = (char *) malloc(header.comments_size+1);
+	
+	if (fseek(fp,(off_t)(header.comments_offset), 0) != 0) {
+	    free(comments);
+	    fclose(fp);
+	    return(NULLSeq);
+	}
+	if (fread(comments,header.comments_size,1,fp)==0) {
+	    free(comments);
+	    fclose(fp);
+	    return(NULLSeq);
+	}
+	comments[header.comments_size] = '\0';
+	
+	seq->info = comments;
+	
+    }
+    
+    
+    
+    /* Finished with the file */
+    fclose(fp);
+    
+    return(seq);
+}
diff --git a/src/ted/seqIOSCF.h b/src/ted/seqIOSCF.h
new file mode 100644
index 0000000..88c2e20
--- /dev/null
+++ b/src/ted/seqIOSCF.h
@@ -0,0 +1,50 @@
+#ifndef _seqIOSCF_h
+#define _seqIOSCF_h
+
+
+/* 
+    Title:       seqIOSCF
+
+    File: 	 seqIOSCF.h
+    Purpose:	 IO of ABI sequences
+    Last update: Tues Dec 11 1990
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+
+#include "seq.h"   /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Boolean */
+
+
+
+
+/* ---- Exports ---- */
+
+extern Seq readSeqSCF(char *fn);
+/*
+    Read the SCF format sequence with name `fn' into `seq'.
+    A NULLSeq result indicates failure.
+
+    enzString is the enzyme string that is to be used for 
+    searching for the left cutoff of sequence
+*/
+
+
+
+
+
+extern int is_SCF(char *fn);
+/*
+    Check to see if file with name `fn' is in SCF format
+
+    Returns:
+	1 - is SCF format
+	0 - not SCF format
+*/
+
+#endif  /*_seqIOSCF_h*/
diff --git a/src/ted/seqOutput.c b/src/ted/seqOutput.c
new file mode 100644
index 0000000..931c7b2
--- /dev/null
+++ b/src/ted/seqOutput.c
@@ -0,0 +1,230 @@
+/* 
+    Title:       seqOutput
+
+    File: 	 seqOutput.c
+    Purpose:	 Output of sequences
+    Last update: Monday April 8 1991
+    Change log:
+        27/11/90 SD     writeSeqABI() outputs header to sequence file:
+                        format: ;{noOfBases}{leftCutOff}{basesWritten}{type}{tra
+cefile}
+                        eg:     ;   867    45    383ABI a09b7.s1RES
+        28.11.90 SD  put undesirables under STLOUIS compilation flag
+        11.12.90 SD  new static function tail to find file name in path name
+	15.01.91 SD  new include file (opp.h)
+	08.04.91 SD  header information now written in write_header()
+	17.09.91 LFW changed STLOUIS flag to SAVE_EDITS to more accurately
+	             reflect the flag
+	07.10.91 SD  Removed width of file field in write_header()
+	26.11.91 SD  Added SCF format on output
+
+*/
+
+
+/*
+    This module should be regarded as part of `seq' since it is
+    privy to the internal structure of `Seq'.
+
+*/
+
+
+
+
+/* ---- Imports ---- */
+
+#include "seqIOEdit.h"
+#include "opp.h"      /* IMPORT: oppInitialise */
+#include "seq.h"      /* IMPORT: Seq, BasesAndTraces, NULLSeq,
+			         newSeq, freeSeq */
+#include <ctype.h>
+#include <stdio.h>    /* IMPORT: fopen, fclose, fseek, ftell, fgetc,
+		                 EOF */
+/*#include <string.h>*/
+
+
+
+
+/* ---- Constants ---- */
+
+#define BasesPerLine 50 /* For output formatting */
+
+/* ---- Private Functions ---- */
+static char *tail (char *pathname)
+{
+    char *a;
+    if ((a = (char *) strrchr(pathname,'/'))==NULL)
+	a = pathname;
+    else
+	a++;
+    return a;
+}
+
+static void write_header(Seq seq, char *seqName, FILE *fp)
+/*
+Write out a header with the sequence.
+The header consists of a line describing the cut off sequence, plus the
+actual discarded sequences. The format is:
+
+    ;header description
+    ;<left cutoff sequence (possibly several lines)
+    ;>right cutoff sequence (possibly several lines)
+
+*/
+{
+    int baseNum,lineLen;
+
+
+    if (seq->bottom) {
+	/* this is a complemented sequence */
+	fprintf (fp,";%6d%6d%6d%-4s%s\n",
+		    seq->NedBases,
+		    seq->rightCutoff,
+		    seq->NedBases - seq->rightCutoff - seq->leftCutoff,
+		    (seq->format==ABIFormat)?"ABI":
+		    (seq->format==ALFFormat)?"ALF":
+		    (seq->format==PlainFormat)?"PLN":
+		    (seq->format==SCFFormat)?"SCF":
+		    "   ",
+		    tail(seqName));
+
+	opp['N']='-';
+
+	/* output left cut off */
+	lineLen = 0;
+	for (baseNum=seq->NedBases-1;
+	    baseNum>=seq->NedBases-seq->rightCutoff;
+	    baseNum--) {
+	    if (! lineLen) fprintf(fp, ";<");
+	    fputc(opp[getBase(seq, EdBases, baseNum)], fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+	/* output right cut off */
+	lineLen = 0;
+	for (baseNum=seq->leftCutoff-1;
+	    baseNum>=0;
+	    baseNum--) {
+	    if (! lineLen) fprintf(fp, ";>");
+	    fputc(opp[getBase(seq, EdBases, baseNum)], fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+
+    } else {
+	/* this is a sequence in its normal orientation */
+	fprintf (fp,";%6d%6d%6d%-4s%s\n",
+		    seq->NedBases,
+		    seq->leftCutoff,
+		    seq->NedBases - seq->rightCutoff - seq->leftCutoff,
+		    (seq->format==ABIFormat)?"ABI":
+		    (seq->format==ALFFormat)?"ALF":
+		    (seq->format==PlainFormat)?"PLN":
+		    (seq->format==SCFFormat)?"SCF":
+		    "   ",
+		    tail(seqName));
+
+	opp['N']='-';
+
+	/* output left cut off */
+	lineLen = 0;
+	for (baseNum=0;
+	    baseNum<seq->leftCutoff;
+	    baseNum++) {
+	    if (! lineLen) fprintf(fp, ";<");
+	    fputc(getBase(seq, EdBases, baseNum), fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+	/* output right cut off */
+	lineLen = 0;
+	for (baseNum=seq->NedBases-seq->rightCutoff;
+	    baseNum<seq->NedBases;
+	    baseNum++) {
+	    if (! lineLen) fprintf(fp, ";>");
+	    fputc(getBase(seq, EdBases, baseNum), fp);
+	    if (++lineLen == BasesPerLine) {
+		fputc('\n', fp);
+		lineLen = 0;
+	    }
+	}
+	if (lineLen) fputc('\n', fp);
+
+    }
+
+}
+
+
+
+
+Boolean writeSeq(Seq seq, char *fn, char *seqName, Boolean includeHeader)
+/*
+    Write the clipped, edited part of the ABI format sequence `seq'
+    into file `fn'. The result indicates success.
+    Currently, this just writes the bases out as text.
+*/
+
+{   FILE *fp;
+    int baseNum, lineLen;
+
+    /* initialize the complement array */
+    oppInitialize();
+
+    /* Open for writing, text */
+    if ((fp=fopen(fn, "w")) == NULL) return(False);
+
+    lineLen = 0;
+
+    /* write header */
+    if (includeHeader) write_header(seq,seqName,fp);
+
+    if (seq->bottom) { /* this if seq-> bottom section
+			  added by lfw */
+      opp['N']='-';
+
+      for (baseNum=(seq->NedBases-seq->rightCutoff)-1;
+	   baseNum>=seq->leftCutoff;
+	   baseNum--)
+	{   fputc(opp[getBase(seq, EdBases, baseNum)], fp);
+	    if (++lineLen == BasesPerLine)
+	      {   fputc('\n', fp);
+		  lineLen = 0;
+		}
+	  }
+    }
+    else {
+      for (baseNum=seq->leftCutoff;
+	   baseNum<seq->NedBases-seq->rightCutoff;
+	   baseNum++)
+	{   fputc(getBase(seq, EdBases, baseNum), fp);
+	    if (++lineLen == BasesPerLine)
+	      {   fputc('\n', fp);
+		  lineLen = 0;
+		}
+	  }
+    }
+
+    if (lineLen != 0) fputc('\n', fp);
+
+    (void) fclose(fp);
+
+#ifdef SAVE_EDITS
+    if (writeEdSeq(seq,seqName)) return(True);
+    else return(False);
+#else
+    return(True);
+#endif
+}
+
+
diff --git a/src/ted/seqOutput.h b/src/ted/seqOutput.h
new file mode 100644
index 0000000..79f3d18
--- /dev/null
+++ b/src/ted/seqOutput.h
@@ -0,0 +1,18 @@
+#ifndef _seqOutput_h
+#define _seqOutput_h
+/* 
+    Title:       seqOutput
+
+    File: 	 seqOutput.h
+    Purpose:	 Output of sequences
+
+*/
+
+extern Boolean writeSeq(Seq seq, char *fn, char *seqName,Boolean includeHeader);
+/*
+    Write the clipped, edited part of the ABI format sequence `seq'
+    into file `fn'. `seqName' is for sequence header information.
+    The result indicates success.
+    Currently, this just writes the bases out as text.
+*/
+#endif  /*_seqOutput_h*/
diff --git a/src/ted/seqRead.c b/src/ted/seqRead.c
new file mode 100644
index 0000000..3862da2
--- /dev/null
+++ b/src/ted/seqRead.c
@@ -0,0 +1,131 @@
+#include <stdio.h>
+#include "seq.h"
+#include "opp.h"
+#include "seqIOABI.h"   /* IMPORT: readSeqABI   */
+#include "seqIOALF.h"   /* IMPORT: readSeqALF   */
+#include "seqIOSCF.h"   /* IMPORT: readSeqSCF   */
+#include "seqIOPlain.h"   /* IMPORT: readSeqPlain   */
+#include "traceDisplay.h" /* IMPORT: setScaleFactor */
+
+
+
+Seq readSeq(char *fn, int bottom, char *enzString, char *format)
+{
+    Seq seq;
+#ifdef SAVE_EDITS
+    int dotnum=-1;
+#endif /*SAVE_EDITS*/
+    FILE *fp;
+    /* the number of the edited sequence the
+       user wishes to read in.  It's a -1 if the
+       user wishes to read the most recent sequence */
+    int found_edited_sequence=0; /* indicates if there was a correct .n file to
+				    read when the program went to read 
+				    the edited sequence file */
+
+
+    
+    if ( (fp=fopen(fn,"r")) == NULL) {
+	
+	/*
+	 * 920917 this chunk of code was moved by lfw 
+	 * so that ted first tries to open up the filename
+	 * as given by the user...if that is not a trace
+	 * file then it goes ahead and strips off the dotnum
+	 * and tries to open that
+	 */
+	
+#ifdef SAVE_EDITS
+	/*
+	 * check to see if the file the user asked to read in is	
+	 * a .1 or .2 or .m; if it is they must have wanted to
+	 * read in a specific edited sequence
+	 */
+	
+	dotnum = isDotNum(fn);
+	if (dotnum != -1) stripDotNum(fn);
+#else
+	return(NULLSeq);
+#endif
+	
+    } else {
+	fclose(fp);
+    }
+
+    /*
+     * read in the sequence
+     */
+    if (strcmp(format, "plainFmt") == 0)
+	seq = readSeqPlain(fn);
+    else if (strcmp(format, "abiFmt") == 0)
+	seq = readSeqABI(fn);
+    else if (strcmp(format, "alfFmt") == 0)
+	seq = readSeqALF(fn);
+    else if (strcmp(format, "scfFmt") == 0)
+	seq = readSeqSCF(fn);
+
+    /*
+     * further processing
+     */
+    if (seq != NULLSeq) {
+
+	int i;
+
+	/*  
+	  
+	  Go ahead and Initialise the edited sequence
+	  with orig bases, then go to read the Edited
+	  sequence to override those bases which have been
+	  changed
+	  
+	  */
+	
+	
+	/* All ed bases are the same are orig bases */
+	for (i=0;i<seq->NorigBases;i++) seq->edits[i] = i;
+	
+	/* The table of edits has all its entries free */
+	for (i=0;i<MaxEdits;i++) seq->edBasePos[i] = NULLPoint;
+	
+
+
+
+	found_edited_sequence=0;
+	
+#ifdef SAVE_EDITS
+	found_edited_sequence=readEdSeq(seq,fn,dotnum);
+#endif
+	
+#ifdef AUTO_CLIP
+	if (!found_edited_sequence &&
+	    seq->leftCutoff == 0 &&
+	    seq->rightCutoff == 0) {
+	    
+	    
+	    if (seq->bottom) {
+		seq->rightCutoff = findLeftCutoff(seq,enzString);
+		seq->leftCutoff = findRightCutoff(seq);
+	    }
+	    else {
+		seq->leftCutoff = findLeftCutoff(seq,enzString);
+		seq->rightCutoff = findRightCutoff(seq);
+	    }
+	    
+	}
+#endif
+    }
+
+    /* initialize the complement array */
+    oppInitialize();
+
+    if (bottom) complement_seq(seq);
+
+    /* set default scale factor */
+    setScaleFactor(1.0);
+    return seq;
+
+}
+
+
+
+
diff --git a/src/ted/seqRead.h b/src/ted/seqRead.h
new file mode 100644
index 0000000..ff5d4a0
--- /dev/null
+++ b/src/ted/seqRead.h
@@ -0,0 +1,6 @@
+#ifndef _SEQREAD_H_
+#define _SEQREAD_H_
+
+extern Seq readSeq(char *fn, int bottom, char *enzString, char *format);
+
+#endif /*_SEQREAD_H_*/
diff --git a/src/ted/ted.c b/src/ted/ted.c
new file mode 100644
index 0000000..fcb3f9a
--- /dev/null
+++ b/src/ted/ted.c
@@ -0,0 +1,716 @@
+/*
+    Title: 	 ted
+
+    File: 	 ted.c
+    Purpose:	 Main module
+    Last update: 23 June 1992
+
+    Change log:
+
+        28.11.90 SD  put undesirables under STLOUIS compilation flag
+	22.07.91 LFW put in -enzyme as a command line option for STLOUIS
+	30.07.91 SD fixed bug in MRC switch
+	17.09.91 LFW changed MRC switch to !AUTO_CLIP and STLOUIS flag
+	         to DEF_OUT to more accurately reflect the use of the flags
+	26.11.91 SD  Added SCF format
+	23-Jun-92 SD Some ugly code in DEF_OUT option
+*/
+
+
+/*
+    This module contains the C language entry point `main' and
+    initialisation for the X system.
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "dialogues.h" /* IMPORT: inputSeq, outputSeq, quitApplication */
+#include "display.h"   /* IMPORT: createDisplay */
+#include "help.h"      /* IMPORT: createHelpShell, help */
+#include "seq.h"       /* IMPORT: Seq, NULLBaseNum */
+#include "seqIOEdit.h" /* IMPORT: isDotNum, stripDotNum */
+
+#include <stdio.h>     /* IMPORT: stderr, fprintf */
+#include "mystdlib.h"  /* IMPORT: exit */
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Command.h>
+#include <X11/Xaw/Form.h>
+#include <X11/Xaw/Label.h>
+
+
+/* ---- Static variables ---- */
+
+
+static String fallback_resources[] =
+{   /* The application */
+    "Xted.minWidth:  550",
+    "Xted.minHeight: 250",
+
+    "Xted*magnif: 90",
+
+    /* Labels have no borders and are not resized */
+    "Xted*Label.borderWidth: 0",
+    "Xted*Label.top:         ChainTop",
+    "Xted*Label.bottom:      ChainTop",
+    "Xted*Label.left:        ChainLeft",
+    "Xted*Label.right:       ChainLeft",
+
+    /* Buttons are not resized */
+    "Xted*Command.top:         ChainTop",
+    "Xted*Command.bottom:      ChainTop",
+    "Xted*Command.left:        ChainLeft",
+    "Xted*Command.right:       ChainLeft",
+
+    /* All the toggles are one-of-many */
+    "Xted*Toggle.Translations: #override \\n\
+         <EnterWindow>:        highlight(Always) \\n\
+         <LeaveWindow>:        unhighlight()     \\n\
+         <Btn1Down>,<Btn1Up>: set() notify()",
+
+    "Xted.mainForm.title.label: Trace editor",
+    "Xted.mainForm.version.label: \
+     June 1991",
+    "Xted.mainForm.seqNamePrompt.label: Input:",
+    "Xted.mainForm.seqNamePrompt.height: 13",
+    "Xted.mainForm.seqName.label:       NONE",
+    "Xted.mainForm.seqName.resizable:   True",
+    "Xted.mainForm.seqName.height: 13",
+    "Xted.mainForm.NorigBasesPrompt.label: Bases:",
+    "Xted.mainForm.NorigBasesPrompt.height: 13",
+    "Xted.mainForm.NorigBases.label:       ",
+    "Xted.mainForm.NorigBases.resizable:   True",
+    "Xted.mainForm.strand.label: Strand: Top",
+    "Xted.mainForm.strand.resizable: True",
+
+
+    /* Display forms have no border */
+    "Xted.mainForm.Form.borderWidth:     0",
+
+     /* The edit mode toggles */
+    "Xted.mainForm.Toggle.top:    ChainTop",
+    "Xted.mainForm.Toggle.bottom: ChainTop",
+    "Xted.mainForm.Toggle.left:   ChainLeft",
+    "Xted.mainForm.Toggle.right:  ChainLeft",
+    "Xted.mainForm.edmodelab.label: Mode:",
+    "Xted.mainForm.adjustL.label:   Adj left cut",
+    "Xted.mainForm.edSeq.label:     Edit seq",
+    "Xted.mainForm.adjustR.label:   Adj right cut",
+    "Xted.mainForm.edSeq.state:     True", /* Default toggle */
+    "Xted.mainForm.adjustL.height:   13",
+    "Xted.mainForm.edSeq.height:     13",
+    "Xted.mainForm.adjustR.height:   13",
+
+    /* The magnification slider */
+    "Xted.mainForm.maglab.top:    ChainTop",
+    "Xted.mainForm.maglab.bottom: ChainTop",
+    "Xted.mainForm.maglab.left:   ChainLeft",
+    "Xted.mainForm.maglab.right:  ChainLeft",
+    "Xted.mainForm.maglab.label: Mag:",
+    "Xted.mainForm.maglab.height: 13",
+      
+    "Xted.mainForm.magscr.top:    ChainTop",
+    "Xted.mainForm.magscr.bottom: ChainTop",
+    "Xted.mainForm.magscr.left:   ChainLeft",
+    "Xted.mainForm.magscr.right:  ChainLeft",
+    "Xted.mainForm.magscr.orientation: horizontal",
+    "Xted.mainForm.magscr.length: 100",
+    "Xted.mainForm.magscr.height: 13",
+
+    "Xted.mainForm.scaleDown.label: Scale down",
+    "Xted.mainForm.scaleDown.height: 13",
+    "Xted.mainForm.scaleUp.label: Scale up",
+    "Xted.mainForm.scaleUp.height: 13",
+    "Xted.mainForm.switch.label: Switch",
+    "Xted.mainForm.switch.height: 13",
+
+    /* The main display: a viewport containing a form containing graphs */
+    "Xted.mainForm.viewport.top:    ChainTop",
+    "Xted.mainForm.viewport.bottom: ChainBottom",
+    "Xted.mainForm.viewport.left:   ChainLeft",
+    "Xted.mainForm.viewport.right:  ChainRight",
+    "Xted.mainForm.viewport.allowHoriz: True",
+    "Xted.mainForm.viewport.forcebars:  True",
+    "Xted.mainForm.viewport.width:      700",
+    "Xted.mainForm.viewport.vpForm.Graph.resizable: True",
+    "Xted.mainForm.viewport.vpForm.Graph.top:    ChainTop",  
+    "Xted.mainForm.viewport.vpForm.Graph.bottom: ChainTop",
+    "Xted.mainForm.viewport.vpForm.Graph.left:   ChainLeft",  
+    "Xted.mainForm.viewport.vpForm.Graph.right:  ChainRight",
+    "Xted.mainForm.viewport.vpForm.trace.top:    ChainTop",  
+    "Xted.mainForm.viewport.vpForm.trace.bottom: ChainBottom",
+    "Xted.mainForm.viewport.vpForm.Graph.font: -misc-*-bold-*15-*",
+    /* The text graph heights are dynamically set to accomodate the font */
+    "Xted.mainForm.viewport.vpForm.trace.height: 150",
+    /* The graphs are dynamically initially set to fill the viewport */
+    "Xted.mainForm.viewport.vpForm.Graph.dimBackground: grey",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour1:  green",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour2:  purple",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour3:  black",
+    "Xted.mainForm.viewport.vpForm.Graph.graphColour4:  red",
+
+    /* Dummy widget - we only care about its width */
+    "Xted.mainForm.dummy.top:    ChainTop",
+    "Xted.mainForm.dummy.bottom: ChainBottom",
+    "Xted.mainForm.dummy.left:   ChainLeft",
+    "Xted.mainForm.dummy.right:  ChainRight",
+    "Xted.mainForm.dummy.width:  700",
+
+
+    /* Dialogues */
+    "Xted*input.ioForm.ioPrompt.label:   Input a file",
+    "Xted*search.ioForm.ioPrompt.label:  Search for string or base number",
+    "Xted*output.ioForm.ioPrompt.label:  Save a file",
+    "Xted*check.ioForm.ioPrompt.label:   \
+The existing sequence has been altered.\\n\
+Do you really want to overwrite it?",
+    "Xted*checkwrite.ioForm.ioPrompt.label:   \
+This output file already exists.\\n\
+Do you really want to overwrite it?",
+    "Xted*quit.ioForm.ioPrompt.label:    \
+The existing sequence has been altered.\\n\
+Do you really want to quit?",
+    "Xted*check.ioForm..translations:   #override \\n\
+         <Key>Return: CRAction()",
+    "Xted*quit.ioForm..translations:    #override \\n\
+         <Key>Return: CRAction()",
+
+    "Xted*ioForm.namePrompt.label:       File name:",
+    "Xted*ioForm.nameVal.width:          200",
+    "Xted*ioForm.nameVal.translations:   #override \\n\
+         <Key>Return: CRAction()",
+    "Xted*ioForm.formatPrompt.label:     File format:",
+    "Xted*ioForm.formatBox.borderWidth:  0",
+    "Xted*ioForm.formatBox.orientation:  horizontal",
+    "Xted*ioForm.formatBox.plainFmt.label: Plain",
+    "Xted*ioForm.formatBox.abiFmt.label:   ABI",
+    "Xted*ioForm.formatBox.abiFmt.state:   True", /* Default toggle */
+    "Xted*ioForm.formatBox.alfFmt.label:   ALF",
+    "Xted*ioForm.formatBox.alfFmt.state:   False", 
+    "Xted*ioForm.formatBox.scfFmt.label:   SCF",
+    "Xted*ioForm.formatBox.scfFmt.state:   False", 
+    "Xted*ioForm.formatBox.bottomFmt.label: Bottom",
+    "Xted*ioForm.formatBox.topFmt.label:   Top",
+    "Xted*ioForm.formatBox.topFmt.state:   True", /* Default toggle */
+    /* The `default' button for the dialogue has a wider border */
+    "Xted*input.ioForm.OK.borderWidth:     3",
+    "Xted*output.ioForm.OK.borderWidth:    3",
+    "Xted*check.ioForm.Cancel.borderWidth: 3",
+    "Xted*quit.ioForm.Cancel.borderWidth:  3",
+
+    /* Help shell widget */
+    "Xted.Help.width:  525",
+    "Xted.Help.height: 200",
+    "Xted.Help.minWidth: 525",
+    "Xted.Help.maxWidth: 525",
+    "Xted.Help.minHeight: 150",
+    "Xted.Help.title: ted help",
+    "Xted.Help.form.rmButton.label:  Remove help",
+    "Xted.Help.form.rmButton.top:    ChainTop",
+    "Xted.Help.form.rmButton.bottom: ChainTop",
+    "Xted.Help.form.rmButton.left:   ChainLeft",
+    "Xted.Help.form.rmButton.right:  ChainLeft",
+    "Xted.Help.form.text.top:    ChainTop",
+    "Xted.Help.form.text.bottom: ChainBottom",
+    "Xted.Help.form.text.left:   ChainLeft",
+    "Xted.Help.form.text.right:  ChainRight",
+
+    NULL,
+};
+
+/*
+     The command line arguments and how to parse them into resources.
+*/
+static XrmOptionDescRec options[] =
+{
+    {"-PLN",     "PLN",     XrmoptionSepArg, NULL},
+    {"-ABI",     "ABI",     XrmoptionSepArg, NULL},
+    {"-ALF",     "ALF",     XrmoptionSepArg, NULL},
+    {"-SCF",     "SCF",     XrmoptionSepArg, NULL},
+    {"-output",  "output",  XrmoptionSepArg, NULL},
+    {"-baseNum", "baseNum", XrmoptionSepArg, NULL},
+    {"-mag", "mag", XrmoptionSepArg, NULL},
+    {"-astring", "astring", XrmoptionSepArg, NULL},
+    {"-bottom", "bottom", XrmoptionSepArg, NULL},
+    {"-raw", "raw", XrmoptionSepArg, NULL},
+    {"-enzyme","enzyme",XrmoptionSepArg,NULL},
+      
+};
+
+/*
+    Where to put the command line arguments when we dig them
+    out of the resources.
+*/
+typedef struct
+{   String plain;
+    String ABI;
+    String ALF;
+    String SCF;
+    String output;
+    int baseNum;
+    int mag;
+    String astring;
+    int bottom;
+    String raw;
+    String enzyme;
+} AppResources;
+
+/*
+    Dig the command line arguments out of resources and into
+    the above structure.
+*/
+static XtResource tedResources[] =
+{
+    {"PLN", "Plain", XtRString, sizeof(String),
+     XtOffset(AppResources *, plain), XtRImmediate, NULL},
+    {"ABI", "ABI", XtRString, sizeof(String),
+     XtOffset(AppResources *, ABI), XtRImmediate, NULL},
+    {"ALF", "ALF", XtRString, sizeof(String),
+     XtOffset(AppResources *, ALF), XtRImmediate, NULL},
+    {"SCF", "SCF", XtRString, sizeof(String),
+     XtOffset(AppResources *, SCF), XtRImmediate, NULL},
+    {"output", "output", XtRString, sizeof(String),
+     XtOffset(AppResources *, output), XtRImmediate, NULL},
+    {"baseNum", "baseNum", XtRInt, sizeof(int),
+     XtOffset(AppResources *, baseNum), XtRImmediate, (XtPointer) NULLBaseNum},
+    {"mag", "mag", XtRInt, sizeof(int),
+     XtOffset(AppResources *, mag), XtRImmediate, NULL},
+    {"astring", "astring", XtRString, sizeof(String),
+     XtOffset(AppResources *, astring), XtRImmediate, NULL},
+    {"bottom", "bottom", XtRInt, sizeof(int),
+     XtOffset(AppResources *, bottom), XtRImmediate, NULL},
+    {"raw", "raw", XtRString, sizeof(String),
+     XtOffset(AppResources *, raw), XtRImmediate, NULL},
+    {"enzyme", "enzyme", XtRString, sizeof(String),
+     XtOffset(AppResources *, enzyme), XtRImmediate, NULL},
+};
+
+
+
+typedef struct
+{   int magnif;
+    String enz;
+} AppData, *AppDataPtr;
+
+
+#define XtNmagnif "magnif"
+#define XtCMagnif "Magnif"
+#define XtNenz "enz"
+#define XtCEnz "Enz"
+
+static XtResource resources[] = {
+  { XtNmagnif,
+    XtCMagnif,
+    XtRInt,
+    sizeof(int),
+    XtOffset(AppDataPtr, magnif),
+    XtRImmediate,
+    (caddr_t)30  /* default is 30, although you can override that from
+		    the app-defaults file or on the command line */
+    },
+  { XtNenz,
+    XtCEnz,
+    XtRString,
+    sizeof(String),
+    XtOffset(AppDataPtr, enz),
+    XtRImmediate,
+    NULL
+    }
+    };
+
+
+/* ---- Callbacks ---- */
+
+
+static void inputCallback(Widget widget,
+			  XtPointer client_data, XtPointer call_data)
+{   inputSeq(widget);
+}
+
+
+static void searchCallback(Widget widget,
+			  XtPointer client_data, XtPointer call_data)
+{   inputSearchString(widget);
+}
+
+
+
+static void outputCallback(Widget widget,
+			   XtPointer client_data, XtPointer call_data)
+/*
+   client_data contains the default (NULL) or command line specified
+   output file name to be used.
+*/
+{
+
+/* this next statement was added by lfw 10/16/90, to assure that
+   a default output filename of inputfilename.seq will be assigned.*/
+
+    if (client_data!=NULL) 
+	sprintf(o_fn,"%s",client_data);
+
+#ifdef DEF_OUT
+    if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+
+    if (!isDotSeq(o_fn))
+	strcat(o_fn,".seq");
+    /*
+    ** This is an aweful way of accomplishing something very simple
+      { char junk[strlen(o_fn)+5];
+	strcpy(junk,o_fn);
+	sprintf(o_fn,"%s.seq",junk);
+      }
+    */
+#endif
+
+    outputSeq(widget, (String) o_fn);
+}
+
+
+static void helpCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{
+    help();
+}
+
+static void infoCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{
+    information(widget);
+}
+
+
+static void quitCallback(Widget widget,
+			 XtPointer client_data, XtPointer call_data)
+{   /* Quit the application */
+    quitApplication(widget);
+}
+
+
+
+
+/* ---- Internal functions ---- */
+
+
+static void usage(int argc, char **argv)
+{   int i;
+
+    if (argc != 1)
+    {   fprintf(stderr, "%s: unknown option%s:", argv[0], (argc>2)?"s":"");
+	for (i=1; i<argc; i++)
+	    fprintf(stderr, " %s", argv[i]);
+	fprintf(stderr, "\n\n");
+    }
+
+    fprintf(stderr,
+"usage: %s\n", argv[0]);
+    fprintf(stderr,
+"       [(-PLN | -ABI | -ALF | -SCF) {tracefilename} [-baseNum {number}] [-mag {number, 1 to 100}] [-bottom {1(true) or 0(false)}]\n");
+    fprintf(stderr,
+"       [-output {outputfilename}]\n");
+    fprintf(stderr,
+"       [-astring {sequence}]\n");
+    fprintf(stderr,
+"       [-enzyme {5' cutting sequence}]\n");
+    fprintf(stderr,
+"       [-raw {filename to be placed at head of xdap compatible .seq file}]\n");
+    fprintf(stderr, 
+"       [-geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]]\n");
+    fprintf(stderr,
+"       [-display [{host}]:[{vs}]]\n");
+    fprintf(stderr,
+"       [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]\n");
+}
+
+
+
+
+/* ---- Exported functions ---- */
+
+
+void main(unsigned int argc, char **argv)
+{   XtAppContext app_con;
+    AppResources app_resources;
+    Widget toplevelWid, mainFormWid;
+    /* Widget titleWid, Widget versionWid; */
+    Widget seqNameWid, seqNamePromptWid;
+    Widget NorigBasesWid, NorigBasesPromptWid,strandWid;
+    Widget comWid;
+    Arg args[10];
+    int nargs;
+    AppData app_data;
+
+    /*
+        Setup X and the toplevel widget
+    */
+    toplevelWid = XtAppInitialize(&app_con, "Xted",
+				  options, XtNumber(options),
+				  (int *)&argc, argv,
+				  fallback_resources,
+				  NULL, (Cardinal) 0);
+
+
+    /*
+         Transfer the command line arguments into app_resources.
+    */
+    XtGetApplicationResources(toplevelWid, (XtPointer) &app_resources,
+			      tedResources, XtNumber(tedResources),
+			      NULL, 0);
+
+    XtGetApplicationResources(toplevelWid, (XtPointer) &app_data,
+			      resources, XtNumber(resources),
+			      NULL, 0);
+
+#ifndef AUTO_CLIP
+    app_data.enz = NULL;
+#endif
+
+
+    /*
+        Die if bad options given.
+    */
+    if ((argc != 1)                          ||
+	(app_resources.plain==NULL &&
+	 app_resources.ABI==NULL   &&
+	 app_resources.ALF==NULL   &&
+	 app_resources.SCF==NULL   &&
+	 (app_resources.baseNum!=NULLBaseNum ||
+	 app_resources.astring!=NULL)) ||
+	(app_resources.plain!=NULL && app_resources.ABI!=NULL) ||
+	(app_resources.astring!=NULL && app_resources.baseNum!=NULLBaseNum))
+	
+    {   usage(argc, argv);
+        
+	XtDestroyApplicationContext(app_con);
+	exit(1);
+    }
+
+
+    /*
+        Create the widgets
+    */
+    mainFormWid = XtCreateManagedWidget("mainForm", formWidgetClass,
+					toplevelWid,
+					NULL, (Cardinal) 0);
+    
+    /* Title and version */
+/*    nargs = 0;
+    titleWid = XtCreateManagedWidget("title",  labelWidgetClass,
+				     mainFormWid, args, nargs);
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, titleWid); nargs++;
+    versionWid = XtCreateManagedWidget("version",  labelWidgetClass,
+				       mainFormWid, args, nargs);
+*/    
+    /* Current sequence name */
+    nargs = 0;
+    seqNamePromptWid = XtCreateManagedWidget("seqNamePrompt",
+					     labelWidgetClass, mainFormWid,
+					     args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, seqNamePromptWid); nargs++;
+    seqNameWid = XtCreateManagedWidget("seqName",  labelWidgetClass,
+				       mainFormWid, args, nargs);
+    
+    /* Original number of bases */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, seqNameWid); nargs++;
+    NorigBasesPromptWid = XtCreateManagedWidget("NorigBasesPrompt",
+						labelWidgetClass, mainFormWid,
+						args, nargs);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, NorigBasesPromptWid); nargs++;
+    NorigBasesWid = XtCreateManagedWidget("NorigBases", labelWidgetClass,
+					  mainFormWid, args, nargs);
+
+    /* Buttons */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, NorigBasesWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Input",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, inputCallback,  NULL);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Output",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, outputCallback,
+		  (XtPointer) app_resources.output);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("search",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, searchCallback,
+		  NULL);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Help",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, helpCallback,  NULL);
+    
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Information",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, infoCallback,  NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    comWid = XtCreateManagedWidget("Quit",  commandWidgetClass,
+				   mainFormWid, args, nargs);
+    XtAddCallback(comWid, XtNcallback, quitCallback,  NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromHoriz, comWid); nargs++;
+    XtSetArg(args[nargs], XtNheight, 13); nargs++;
+    strandWid = XtCreateManagedWidget("strand",  labelWidgetClass,
+				   mainFormWid, args, nargs);
+
+
+    createDisplay(mainFormWid, seqNameWid);
+
+
+    /* Create the help shell, undisplayed. */
+    (void) createHelpShell(toplevelWid);
+    
+    /*
+        Realize everything.
+    */
+    XtRealizeWidget(toplevelWid);
+
+
+    /*
+        Setup initial sequence, if any.
+	Note that we have already filtered out multiple formats.
+    */
+
+    /*
+    ** set up raw file name for output
+    */
+    if (app_resources.raw!=NULL) {
+        strcpy (r_fn,app_resources.raw);
+    } else {
+        r_fn[0] = '\0';
+    }
+
+    if (app_resources.bottom!=NULL) {
+      if (app_resources.bottom==1) {
+	nargs = 0;
+	XtSetArg(args[nargs], XtNlabel, "Strand: Bottom"); nargs++;
+	XtSetValues(strandWid, args, nargs);
+      }
+    }
+
+
+    if (app_resources.plain!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.plain);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.enzyme == NULL) app_resources.enzyme = app_data.enz;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+
+      initialDisplayedSeq(toplevelWid, "plainFmt",
+			  app_resources.plain, app_resources.baseNum , app_resources.mag, 
+			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+    }
+
+
+
+    if (app_resources.ABI!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.ABI);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.enzyme == NULL) app_resources.enzyme = app_data.enz;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+      initialDisplayedSeq(toplevelWid, "abiFmt",
+			  app_resources.ABI, app_resources.baseNum , app_resources.mag,
+			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+
+    }
+
+
+
+    if (app_resources.ALF!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.ALF);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+      initialDisplayedSeq(toplevelWid, "alfFmt",
+			  app_resources.ALF, app_resources.baseNum, app_resources.mag,			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+
+    }
+
+
+    if (app_resources.SCF!=NULL) {
+#ifdef DEF_OUT
+      /* set up a default output filename */
+      strcpy(o_fn,app_resources.SCF);
+      if (isDotNum(o_fn) != -1) stripDotNum(o_fn);
+      if (!isDotSeq(o_fn))
+	sprintf(o_fn,"%s.seq",o_fn);
+#endif
+
+      /* set the default magnification to 30, the default baseNum to 1 */
+      if (app_resources.bottom == NULL) app_resources.bottom = 0;
+      else app_resources.bottom = 1;
+
+      if (app_resources.mag == NULL) app_resources.mag = app_data.magnif;
+      if (app_resources.baseNum == -1 && app_resources.astring==NULL) app_resources.baseNum = 1;
+      initialDisplayedSeq(toplevelWid, "scfFmt",
+			  app_resources.SCF, app_resources.baseNum, app_resources.mag,			  app_resources.astring,app_resources.enzyme,app_resources.bottom);
+
+    }
+
+
+    /*
+        Set everything running.
+    */
+    XtAppMainLoop(app_con);
+}
+
diff --git a/src/ted/ted.help b/src/ted/ted.help
new file mode 100644
index 0000000..bbea1bf
--- /dev/null
+++ b/src/ted/ted.help
@@ -0,0 +1,168 @@
+Trace Editor Help
+-----------------
+
+The ted trace editor is a prototype to allow the display and editing
+of traces from sequencing machines, and the simple editing of plain
+sequences. It runs under the X window system. It provides simultaneous
+display of traces and bases. The editing allows individual bases to be
+removed and new ones added, and also a range of bases at either end to
+be cutoff. Currently, only ABI result files and plain sequences are
+accepted.
+
+Only one trace can be edited at a time.
+
+
+Invocation
+----------
+
+ted can be run from the command line by simply typing:
+
+    ted
+
+It will come up with no sequence initially displayed. If provided with
+any arguments it does not understand, or invalid combinations of
+arguments, ted will exit with a message indicating its intended usage.
+ted accepts the standard X arguments allowing, for example, background
+colour or geometry to be specified. ted can accept an argument
+specifying an initial file to display. The key for this is the format
+of the file, for example:
+
+    ted -ABI {ABI format filename}
+    ted -plain {plain format filename}
+
+The file is then displayed at 50% magnification, with the caret 
+initially positioned at the first base.
+
+When an initial file is given, a base number of interest and/or a
+magnification can also be given, for example:
+
+    ted -ABI {ABI format file} -baseNum 280 -mag 30
+
+or the bottom strand may be specified:
+ 
+    ted -ABI {ABI format file} -baseNum 280 -mag 30 -bottom 1
+	or
+    ted -ABI {ABI format file} -bottom 1
+
+or a string of nucleotides on which the center the window:
+	
+    ted -ABI {ABI format file} -astring 1
+	or
+    ted -ABI {ABI format file} -astring 1 -mag 30 -bottom 1
+
+Options can be specified in any order.
+
+An output filename can be specified in a similar manner:
+
+    ted -ABI inputfilename -output outputfilename
+
+The default output filename is inputfilename.seq
+
+If you are running the program on a remote machine, you must
+specify a display parameter:
+
+    ted -display machine_name:0.2
+
+You can also specify the size of the opening window or
+other screen parameters by the following:
+
+    ted  -geometry [{width}][x{height}][{+-}{xoff}[{+-}{yoff}]]
+	   [-fg {color}] [-bg {color}] [-bd {color}] [-bw {pixels}]
+
+Displays
+--------
+
+When running, ted displays the name of the file it is currently
+operating on (if any) and the original number of bases.
+
+A so-called viewport presents four different synchronised views of
+part of the trace. The top one indicates the sequence indices - the
+first digit of the number if positioned over the base to which that 
+number corresponds.  Below this is a list of the bases as originally
+found in the file (this is the interpretation of the trace as made by
+the sequencing machine). Below this is the list of bases as edited by
+the user --- initially, if this file has not been edited in the past,
+this is identical to the list of original bases.  However, if in a 
+previous session the user has edited this sequence, the edited
+version of the sequence will appear in the edit window. 
+The final display is of the traces produced by the sequencing
+machine for the four respective bases.
+
+Two controls allow the view presented to be adjusted: both are
+horizontal sliders or scrollbars. The first affects the magnification
+at which the trace is viewed. The minimum magnification is such that
+the whole of the trace is visible within the viewport; when a trace is
+first input, this is the magnification used. The maximum magnification
+is such that bases are spaced out with several characters of space
+between them --- this should allow more than enough room for base
+insertions to be clearly visible. The second scrollbar is immediately
+above the viewport and allows the user to select which part of the
+trace is viewed. Both the sliders work in a similar way: the middle
+mouse button can be used to drag the thumb to any desired position,
+the left and right mouse buttons can be clicked within the scrollbar
+to indicate that paging up or down is desired. In the case of the
+viewport scrollbar, the amount of paging is determined by how far up
+the scrollbar the pointer is.
+
+The whole ted window can be expanded and contracted (to an extent) by
+dragging the "grow-region" provided by whatever window manager is
+running. The viewport takes up all of this change in size.
+
+Controls
+--------
+
+ted has four buttons. "Quit" exits the program after first checking
+whether there is a sequence which has been edited and not saved.
+
+"Help" pops up this window which has a scrollbar on the left allowing
+all the text to be viewed.
+
+"Input" presents a dialogue which asks for the format and name of a
+file to be processed. The bases and (if this is not a plain format
+file) traces are read in and displayed for editing. The only
+conversion performed on bases is from 'N' to '-'.
+
+"Output" presents a dialogue which asks for a filename into which the
+edited and clipped bases can be saved. The default value can be set
+on the command line using the "-output" keyword. No conversion of bases
+is performed on output.
+
+ted operates in one of three editing modes, one of which is selected
+from three "radio buttons". The currently selected mode is
+highlighted.
+
+Editing
+-------
+
+In "Edit sequence" mode, the (lower) list of editable bases can be
+edited in much the same way as a text editor operates. A "caret" which
+is visible in the display of edited bases can be moved left and right
+with the cursor keys (these are sometimes called arrow keys and often
+appear on numeric keypads). It can also be positioned by clicking any
+button while the pointer is pointing into either of the list of bases
+or the traces. The DELETE key deletes the base immediately to the left
+of the caret. Any printing character can be inserted to the right of
+the caret by simply typing it. Inserted characters are placed halfway
+between their neighbours, or if a space is left by the deletion of a
+base originally there, its position is used. A base can thus be
+changed by deleting it and entering the new base.
+
+Note that in the current version of ted the caret is not constrained
+to remain within the viewed part of the display and that editing can
+still continue while it is thus invisible. Such editing would probably
+only occur by accident.
+
+ted provides a facility to define a cutoff at either end of the trace.
+A number of the leftmost bases (corresponding to the vector) and the
+rightmost bases (corresponding to the point where the data become
+unreliable) can be defined by setting the editor into "Adjust left
+cutoff" or "Adjust right cutoff" mode. In either of these modes, the
+pointer and mouse buttons can be used to indicate the cutoff point,
+and the cursor keys can be used to adjust this leftwards or
+rightwards. Initially, the cutoff regions are both empty. The cutoff
+regions are clearly indicated on the list of edited bases display and
+on the traces display by being drawn with a dimmed background.
+
+When the sequence is written out, the list of edited bases, with both
+cutoff regions removed, is written. The output contains newlines
+for convenient formatting and always ends with one.
diff --git a/src/ted/ted.manpage b/src/ted/ted.manpage
new file mode 100644
index 0000000..21201a9
--- /dev/null
+++ b/src/ted/ted.manpage
@@ -0,0 +1,107 @@
+.TH ted 1L "July 1991" "MRC LMB" "LOCAL"
+.SH NAME
+ted \- trace editor
+.SH SYNOPSIS
+.B ted
+[(
+.B -ABI\||\|-ALF\||\|-plain
+)
+.I tracefilename
+[
+.B -baseNum
+.I number
+]
+.B [
+.B -mag
+.I number
+( 1 to 100 )
+] 
+.B [
+.B -bottom
+.I number
+(1(true) or 0(false))
+.B ]
+.B [
+.B -astring
+.I nucleotide-string
+]] 
+.B [
+.B -enzyme
+.I  5' cutting sequence
+] 
+.B [
+.B -raw
+.I  filename
+(to be placed at head of xdap compatible .seq file)
+.B ]
+[
+.B -output
+.I outputfilename
+]
+
+.SH DESCRIPTION
+.B ted
+is a simple prototype editor for traces produced from automatic
+sequencing machines. It allows the traces (from the ABI
+or ALF sequencing machines) produced to be
+displayed along with the machines interpretation of these into
+bases and an initially identical sequence which can be edited
+by the user. A cutoff region can be defined at both ends. The
+edited and clipped list of bases can then be written out.
+.LP
+When initially run,
+.B ted
+displays the trace file
+.I tracefilename
+(if given) of the specified format centered on the base number
+.I baseNum
+(if given). If no file is provided,
+.B ted
+initially displays nothing.
+.LP
+The display consists of
+the control panel and the synchronized view of the base position
+information, original and edited sequence data, 
+and graphical representation of the trace (with each nucleotide's trace
+being represented by a different color).  The control
+panel allows the user to read in new trace files (in either
+bottom or top strand orientation)
+as well as to search for a string of nucleotides or a certain base position.
+The information button brings up signal strength and average spacing for
+ABI files.
+Scroll bars allow the user to adjust the magnification of or scroll through
+the sequence and trace data.  The user may also choose to change the vertical
+magnification of the trace data.  Further, sequence on the head (vector)
+or tail (uncertain data) of the sequence may be ``cutoff'' 
+using the adjust left and right cutoff buttons. Bases can be inserted, 
+deleted, or replaced as with
+any ordinary word-processor in the sequence data window. Finally, the
+sequence may be written to an ascii file using the output button on
+the control panel.  The output filename is specified in a dialogue, 
+but a default value of inputfilename.seq is provided or the default value
+can be given with the
+.I outputfilename
+argument.
+.LP
+A simple help system is provided.
+.SH FILES
+.PD 0
+.TP 20
+.B ted.help
+Text provided in the help window.
+.TP
+.B /usr/lib/X11/app-defaults/Xted
+Default application resources.
+.SH ENVIRONMENT
+.TP 20
+.SB XFILESEARCHPATH
+Specifies the locations where
+.B ted.help
+is sought.
+If this is not defined,
+.B ted.help
+must be in the
+.B /usr/lib/X11/app-defaults
+directory.
+.SH AUTHORS
+Tim Gleeson, LaDeana Hillier, Simon Dear.
diff --git a/src/ted/ted.tex b/src/ted/ted.tex
new file mode 100644
index 0000000..0a0b291
--- /dev/null
+++ b/src/ted/ted.tex
@@ -0,0 +1,213 @@
+\documentstyle[12pt]{article}
+
+\title{A trace display and editing program for data from fluorescence based 
+sequencing machines}
+\author{Timothy Gleeson \and LaDeana Hillier}
+
+\begin{document}
+\maketitle
+\section*{}
+\subsection*{}
+\subsubsection*{ABSTRACT}
+
+``Ted'' ({\em T}race {\em ed}itor) 
+is a graphical editor for sequence and trace data from automated 
+fluorescence sequencing machines.  It provides facilities 
+for viewing sequence and trace data (in top or bottom strand 
+orientation), for editing the base sequence,  for 
+automated or manual trimming of the head (vector) and tail 
+(uncertain data) from the sequence, for vertical and horizontal trace 
+scaling, for keeping a history of sequence editing, and for output of 
+the edited sequence.  Ted has been used extensively in the C. 
+elegans genome sequencing project,
+both as a stand-alone program and integrated into 
+the Staden sequence assembly package, and  has 
+greatly aided in the efficiency 
+and accuracy of sequence editing.  It runs in the X 
+windows environment on Sun workstations and is available from the 
+authors.  Ted currently supports sequence and trace data from the ABI 
+373A and Pharmacia A.L.F. sequencers.
+
+\subsubsection*{INTRODUCTION}
+	Time involved in sequence editing is extensive, and anything easing 
+that burden will improve the efficiency of any major sequencing 
+project.  Having sequence and trace data available online in easily-
+manipulable form is invaluable. Ted (a Trace-EDitor) was developed to 
+fill this role in the C. elegans genome 
+sequencing project [1]. 
+
+\subsubsection*{METHODS}
+
+{\em Computing Design and Implementation.}
+When designing ted, we had a number of specific computing goals 
+in mind including portability and adaptability.  For portability, we 
+chose to write ted in ANSI C using the X windowing system and the 
+Xaw toolkit.  X provides basic capabilities for the creation and use 
+of windows, and the toolkit contains a number of pre-packaged 
+components, such as the ``sliders'' used for scrolling. X also allows 
+site, user and per-run defaults to be set.  Adaptability is also an 
+important goal since we are providing a new function to 
+research groups who are constantly adding new requirements.  
+
+	Stylistically, we have followed an ``Abstract Data Type''
+discipline.  In this discipline, a program is split into a number of 
+modules which provide separate, well-defined functions.  We 
+separate the interface of a module from its implementation.  For 
+example, a unified internal sequence format is used.  This can store 
+a varying amount of information.  However, there is a clear and 
+simple interface by which the rest of the program accesses this 
+module.  Such a style is not well supported by C, but its adoption has 
+been very successful.  The addition of new sequencing machines, and 
+thus new external data formats, may cause some changes in the 
+internal representation of the sequence but should not affect  
+the rest of the program.
+
+	Ted accepts a large number of optional command line arguments,
+many of which can also be specified as system defaults. This
+supports a mode of working whereby ted is invoked not directly by the
+user but instead by a script or another application which supplies
+arguments appropriate to the editing task.
+
+
+{\em Graphical Interface.}
+Ted currently accepts data from two fluorescence based sequencing
+machines, the Pharmacia A.L.F. and the ABI 373A.
+The sequencing machine data consists of 
+four traces of fluorescence levels together with the machine's 
+interpretation, which is a sequence of bases.  
+Ted displays 
+the traces and the machine-generated base list.  
+A second, initially identical, list of bases is provided for correction 
+by the user.
+
+	Ted has an X windows based 
+graphical interface. The trace file
+can either be input from the command line or by 
+clicking on the INPUT button after the program has been invoked.  
+Other parameters which the user may specify on the
+command line include: the output 
+file name; a base position or sequence string on which the trace is 
+to be centered;  a default trace magnification;  a 5' vector sequence 
+for automated elimination of the sequence head (vector); top or 
+bottom strand orientation; or any of the usual X-window parameters (e.g. 
+display, geometry...).
+
+	The graphics display (Figure 1) consists of the control 
+panel, the base position information, the original and edited sequence 
+data, and the graphical representation of the trace.  The user may 
+begin by using the control panel INPUT button to input a new trace 
+file at which time the user selects whether to view the sequence
+and trace in top or bottom strand orientation.
+The trace file is displayed and, if a 5' vector sequence has been 
+specified on the command line, the program attempts to select a 
+cutoff point corresponding to the vector sequence at the ``head'' of the 
+trace file.  The bases beyond the ``cutoff'' point are  
+displayed on a shaded background.  The user may modify the cutoff 
+position by clicking on the ``Adj left cut'' button and clicking on the 
+position of the desired cutoff.  Similarly, the user may adjust the 
+right cutoff of the sequence (chosen by starting at the 5' end of the 
+sequence and looking for the first occurrence when 2 out of 5 bases 
+are 'N') by scrolling along the sequence to that point, clicking on the 
+``Adj right cut'' button, and clicking on the appropriate base.  
+Automation of the ``cutoff'' process is optional; the user may compile 
+the program with that feature turned ``off.'' 
+
+	Clicking on the ``Edit seq'' button allows the user to enter the edit 
+mode.  The ``Search'' button can be used to skip from ``problem'' to 
+``problem'' (i.e., ambiguity to ambiguity) or to look for runs of 
+identical bases (e.g., TTTT) which are often mis-called by
+the machine software.
+
+  Bases can be inserted, deleted, or replaced as with
+any ordinary word-processor.  In difficult-to-read areas,  
+the trace may be vertically or horizontally scaled by dragging or 
+clicking on the magnification scroll bar or by clicking on the 
+vertical scaling buttons (``Scale down'', ``Scale up''), respectively.  
+Finally, the edited sequence is saved to an ascii file using the 
+``Output'' button.  A history of the editing session can also be saved
+along with the sequence. 
+The ``Quit'' button is used 
+to exit the program.  When reinvoking ted on an edited trace file the 
+edited base sequence, rather than the original sequence, is shown in 
+the edited base window.  The user may invoke ted by calling in any one 
+of the previous editing sessions.   
+
+
+\subsubsection*{APPLICATIONS AND CONCLUSIONS}
+
+	In the C. elegans genome sequencing project, data from the ABI or 
+A.L.F. sequencing machines' computers are transferred to Sun 
+workstations.  
+The user invokes a Unix shell script that calls ted systematically 
+on each of the new set of trace files creating a set of sequence files.
+The sequence files that are deemed to be of acceptable quality
+are then entered into the sequence 
+assembly program xdap [2] where the sequences are assembled into 
+contigs.  Portions of the ted trace-editor have been incorporated 
+into the xdap ``trace manager,''  which is used in 
+conjunction with the contig editor to view sets of aligned traces 
+at sites of discrepancies in the aligned sequences.  
+
+	Ted is also used at the stage of choosing oligo primers for the 
+``walking'' stage of the sequencing project.  It can be invoked directly 
+from the oligo selection program, osp [3], to allow examination
+of the trace data in the region of the primers so that  
+integrity of the sequence data can be verified.
+
+	Currently, no other programs are known to be available 
+which support editing of the ABI trace data. 
+Further, the modular design of the program should allow
+support for new types of sequencing machines, with new data 
+formats, to be implemented in a straightforward fashion.  
+
+
+\subsubsection*{AVAILABILITY}
+	Ted is freely available from the authors or from Rodger Staden and
+Simon Dear (MRC Laboratory of Molecular Biology, Hills Road, Cambridge,
+UK, CB2 2QH) for use on Sun workstations running X-windows (or OpenLook).
+
+
+\subsubsection*{ACKNOWLEDGMENTS}
+	The authors would like to thank all members of the C. elegans
+sequencing project with special thanks to the following people:
+John Sulston, Bob Waterston,  
+Phil Green, Rick Wilson, Richard Durbin, Simon Dear, and Rodger Staden 
+for their helpful suggestions for improvements in the ted interface 
+and for their parts in the development of ted.  This work was 
+supported by the Medical Research Council and NIH grant R01-HG00136.
+
+\subsubsection*{REFERENCES}
+
+1. Waterston, R., Sulston, J., et al. (1991), in preparation.
+
+2. Dear, S. and Staden, R. (1991) Nuc. Acids Res.,  in press.
+
+3. Hillier, L. and Green, P. (1991) submitted.
+
+
+{\bf Figure 1 legend.}
+
+Figure 1 shows a ``screen dump'' of the ted graphical interface.  
+The display consists of
+the control panel and the synchronized view of the base position
+information, original and edited sequence data, 
+and graphical representation of the trace (with each nucleotide's trace
+ being represented
+by a different color).  The control
+panel allows the user to read in new trace files (in either
+bottom or top strand orientation)
+as well as to search for a string of nucleotides or a certain base position.
+Scroll bars allow the user to adjust the magnification of or scroll through
+the sequence and trace data.  The user may also choose to change the vertical
+magnification of the trace data.  Further, sequence on the head (vector)
+or tail (uncertain data) of the sequence may be ``cutoff'' 
+using the adjust left and right cutoff buttons. Bases can be inserted, 
+deleted, or replaced as with
+any ordinary word-processor in the sequence data window. Finally, the
+sequence may be written to an ascii file using the output button on
+the control panel.
+
+\end{document}
+
+
+
diff --git a/src/ted/trace.abi b/src/ted/trace.abi
new file mode 100644
index 0000000..5b9b9f4
Binary files /dev/null and b/src/ted/trace.abi differ
diff --git a/src/ted/trace.alf b/src/ted/trace.alf
new file mode 100644
index 0000000..e1bf33c
Binary files /dev/null and b/src/ted/trace.alf differ
diff --git a/src/ted/trace.scf b/src/ted/trace.scf
new file mode 100644
index 0000000..ceff4dd
Binary files /dev/null and b/src/ted/trace.scf differ
diff --git a/src/ted/traceDisplay.c b/src/ted/traceDisplay.c
new file mode 100644
index 0000000..cf28145
--- /dev/null
+++ b/src/ted/traceDisplay.c
@@ -0,0 +1,618 @@
+/* 
+    Title:       traceDisplay
+
+    File: 	 traceDisplay.c
+    Purpose:	 Display the trace of a sequence on a graph widget
+    Last update: Friday 15 May 1992
+*/
+
+
+/*
+    `plotEdgeOffset' indicates the pixel position at which point 0
+    occurs. The characters for bases are printed with their centres
+    corresponding to their locations on the plot.
+
+    The cutoffs are actually drawn between the positions of the
+    bases on either side.
+
+    SD:
+	Added incScaleFactor, decScaleFactor
+	Changed scaleTrace to use scale factor
+    SD:
+	Support monochrome displays when marking cutoffs
+    SD: 15-May-1992
+        Calculation of next value of pSegN in drawTrace() gives a value one too many
+	
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "traceDisplay.h"
+#include "display.h"      /* IMPORT: userEvent */
+#include "Graph.h"
+#include "seq.h"          /* IMPORT: Seq, NULLSeq, getNPoints */
+
+
+#include <X11/Intrinsic.h>
+#include <X11/StringDefs.h>
+#include <X11/Xaw/Form.h>
+
+
+
+/* ---- Statics ---- */
+
+static Seq currSeq = NULLSeq;
+static int NPoints;
+static TRACE maxTraceVal;
+
+static Widget traceWid;
+
+static Dimension traceWidth;     /* Current width of the trace graph */
+static Dimension traceHeight;    /* Current height of the trace graph */
+static Dimension plotEdgeOffset;
+static Dimension plotWidth;      /* Current width of the plot
+				    = graphWidth - 2*plotEdgeOffset */
+
+static int leftCutoff, rightCutoff;
+
+static GC Agc, Cgc, Ggc, Tgc;
+static Pixel normBackground, dimBackground;
+
+/* monochrome stippling */
+static GC greygc;
+static int planes;
+
+static float scaleFactor;
+
+
+
+
+/* ---- Position translation ---- */
+
+/*
+    Convert a value from the `point-in-the-sequence' to the
+    `x-pixel-on-the-graph' scale, and vice-versa, given that
+    `plotWidth' holds the current pixel width of the graph which
+    starts `plotEdgeOffset' from either edge of the graph and
+    `NPoints' gives the current point width.
+    Fractional results are truncated - for pixelToPoint this may
+    mean a point one too low is indicated, but a point too high is
+    never indicated.
+*/
+
+
+static int pointToPixel(int point)
+{   return(((point * plotWidth) / (NPoints-1)) + plotEdgeOffset);
+}
+
+
+static int pixelToPoint(int pixel)
+{   return(((pixel-plotEdgeOffset) * (NPoints-1)) / plotWidth);
+}
+
+
+static int scaleTrace(int y)
+{   return(traceHeight - 
+	   (int)(scaleFactor * (float)y * (float)traceHeight / (float)maxTraceVal)
+	   );
+}
+
+
+static int leftCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a left cutoff at `baseNum'.
+*/
+{   int pL, pR;
+
+    pL = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum-1);
+    pR = (baseNum == 0) ? 0 : baseNumToPoint(currSeq, EdBases, baseNum);
+
+    return((pR==NULLPoint)?pL:(pL+pR)/2);
+}
+
+
+static int rightCutoffPoint(int baseNum)
+/*
+    Return the point corresponding to a right cutoff at `baseNum'.
+*/
+{   int NedBases = getNBases(currSeq, EdBases);
+    int pL, pR;
+
+    pL = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum);
+    pR = (baseNum == 0)
+         ? NPoints-1
+	 : baseNumToPoint(currSeq, EdBases, NedBases-baseNum-1);
+
+    return((pL+pR)/2);
+}
+
+
+
+
+
+/* ---- Internal routines ---- */
+
+
+
+
+static void drawTrace(Widget wid, char base, int p0, int pN)
+/*
+    Draw the `base' trace between points p0..pN (inclusive)
+*/
+{   GC gc;
+    int pSeg, pSeg0, pSegN;
+#define NSegs 500
+    int traces[NSegs];
+    XPoint xPoint[NSegs];
+
+    switch (base) 
+    {   case 'A': gc=Agc; break;
+        case 'C': gc=Cgc; break;
+        case 'G': gc=Ggc; break;
+        case 'T': gc=Tgc; break;
+    }
+
+
+    /*
+        The range (p0..pN) is drawn as a series of overlapping
+	segments (pSeg0..pSegN).
+    */
+    pSegN = p0;
+    do
+    {   int i;
+
+	/*
+	    This segment carries on from and including the last point
+	    of the last segment.
+	*/
+	pSeg0 = pSegN;
+	pSegN = ((pSeg0+NSegs-1)<=pN) ? pSeg0+NSegs-1 : pN;      
+	/* Get the traces for this segment */
+	getTraces(currSeq, base, pSeg0, pSegN, traces);
+
+	/* Translate to pixel coords and place in an XPoint array */
+	pSeg = pSeg0;
+	i = 0;
+	while (pSeg<=pSegN)
+	{   xPoint[i].x = pointToPixel(pSeg);
+	    xPoint[i].y = scaleTrace(traces[i]);
+	    pSeg++;
+	    i++;
+	}
+
+	/* Draw the segments in the XPoint array */
+	XDrawLines(XtDisplay(wid),
+		   XtWindow(wid),
+		   gc,
+		   xPoint,
+		   pSegN-pSeg0+1,
+		   CoordModeOrigin);
+    } while (pSegN < pN);
+}
+
+
+
+
+
+/* ---- Callbacks ---- */
+
+
+
+
+static void exposeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Region region = (Region) call_data;
+    XRectangle rect;
+    int x0, xN; /* Affected region: pixels */
+    int p0, pN; /* Affected region: points */
+    int leftCutoffP,  rightCutoffP;
+    
+
+    if ((currSeq == NULLSeq) || isBasesOnly(currSeq)) return;
+
+
+    /*
+        `region' gives us the exposed graph region.
+	XClipBox will return a bounding rectangle, in pixels.
+    */
+    XClipBox(region, &rect);
+    x0 = rect.x;
+    xN = rect.x+rect.width;
+
+    /* Trim it so we only consider the plot proper */
+    if ((x0>plotWidth+plotEdgeOffset) || (xN<plotEdgeOffset)) return;
+    x0 = (x0<plotEdgeOffset)           ? plotEdgeOffset           : x0;
+    xN = (xN>plotWidth+plotEdgeOffset) ? plotWidth+plotEdgeOffset : xN;
+
+    /* Convert to affected points */
+    p0 = pixelToPoint(x0);
+    pN = pixelToPoint(xN);
+    if (pN<NPoints-1) pN++;
+
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+    leftCutoffP  = leftCutoffPoint(leftCutoff);
+    rightCutoffP = rightCutoffPoint(rightCutoff);
+      
+
+    if (p0<leftCutoffP)
+    /*
+        Some of the exposed region is in the left cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = p0;
+       int lastP  = (pN<leftCutoffP) ? pN : leftCutoffP;
+
+       int pix0 = pointToPixel(firstP);
+       int pixN = pointToPixel(lastP);
+
+	
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(traceWid), XtWindow(traceWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, traceHeight);
+       } else {
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    dimBackground);
+       XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		  pix0, 0,
+		  pixN-pix0, traceHeight,
+		  False);
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    normBackground);
+       }
+    }
+      
+
+    if (pN>rightCutoffP)
+    /*
+        Some of the exposed region is in the right cutoff area.
+	Repaint it with dimmed background.
+    */
+    {  int firstP = (p0>rightCutoffP) ? p0 : rightCutoffP;
+       int lastP  = pN;
+
+       int pix0 = pointToPixel(firstP);
+       int pixN = pointToPixel(lastP);
+
+       if (planes == 1) {
+	   XFillRectangle(XtDisplay(traceWid), XtWindow(traceWid),
+		greygc,
+		  pix0, 0,
+		  pixN-pix0, traceHeight);
+       } else {
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    dimBackground);
+       XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		  pix0, 0,
+		  pixN-pix0, traceHeight,
+		  False);
+       XSetWindowBackground(XtDisplay(traceWid), XtWindow(traceWid),
+			    normBackground);
+	}
+    }
+
+    /*
+        Now draw the traces.
+	The GC background is unused when drawing lines, so we do
+	not need to change it depending on whether we are in the
+	cutoff region or not.
+    */
+    drawTrace(wid, 'A', p0, pN);
+    drawTrace(wid, 'C', p0, pN);
+    drawTrace(wid, 'G', p0, pN);
+    drawTrace(wid, 'T', p0, pN);
+}
+
+
+
+
+static void userCallback(Widget wid,
+			 XtPointer client_data, XtPointer call_data)
+/*
+    Handles all XtNcallbacks, i.e., buttons and keys.
+*/
+{   XEvent *eventP = (XEvent *) call_data;
+
+    if (currSeq == NULLSeq) return;
+
+    /*
+        Callback to the `display' module.
+    */
+    userEvent(eventP);
+}
+
+
+
+
+static void resizeCallback(Widget wid,
+			   XtPointer client_data, XtPointer call_data)
+{   Arg args[10];
+    int nargs;
+
+    /*
+	This routine can be called in two ways.
+	(a) as a callback when the trace is resized by the user
+	(b) indirectly when setGraphWidth alters the graph width
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &traceHeight); nargs++;
+    XtSetArg(args[nargs], XtNwidth,  &traceWidth);  nargs++;
+    XtGetValues(wid, args, nargs);
+
+    if (XtIsRealized(wid)) XClearWindow(XtDisplay(wid), XtWindow(wid));
+}
+
+
+
+
+/* ---- Exports ---- */
+
+
+void redisplayTraces()
+{
+    /*
+    **  Clear the area and generate an expose.
+    */
+    XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		0, 0,
+		0, 0,
+		True);
+}
+
+void incScaleFactor()
+{
+    scaleFactor *= 1.5;
+    redisplayTraces();
+}
+
+void decScaleFactor()
+{
+    scaleFactor /= 1.5;
+    redisplayTraces();
+}
+
+void setScaleFactor(float sf)
+{
+    scaleFactor = sf;
+    redisplayTraces();
+}
+
+
+
+void createTraceDisplay(Widget parentWid, Widget fromVertWid,
+			Dimension width, Dimension offset)
+/*
+    Create the trace display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width' and the plot is drawn `offset' from
+    the left and right hand edges.
+    No trace is initially displayed.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNfromVert, fromVertWid); nargs++;
+    XtSetArg(args[nargs], XtNwidth,    width);       nargs++;
+    traceWid = XtCreateManagedWidget("trace", graphWidgetClass,
+				     parentWid, args, nargs);
+    XtAddCallback(traceWid, XtNcallback,       userCallback, NULL);
+    XtAddCallback(traceWid, XtNexposeCallback, exposeCallback, NULL);
+    XtAddCallback(traceWid, XtNresizeCallback, resizeCallback, NULL);
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNheight, &traceHeight); nargs++;
+    XtGetValues(traceWid, args, nargs);
+    traceWidth     = width;
+    plotWidth      = traceWidth-(2*plotEdgeOffset);
+    plotEdgeOffset = offset;
+    scaleFactor = 1.0;
+
+    currSeq = NULLSeq;
+
+    /*
+        Get the GCs
+    */
+    Agc = GraphGC1(traceWid);
+    Cgc = GraphGC2(traceWid);
+    Ggc = GraphGC3(traceWid);
+    Tgc = GraphGC4(traceWid);
+
+    /*
+        Get the backgrounds
+    */
+    nargs = 0;
+    XtSetArg(args[nargs], XtNbackground,    &normBackground);    nargs++;
+    XtSetArg(args[nargs], XtNdimBackground, &dimBackground); nargs++;
+    XtGetValues(traceWid, args, nargs);
+
+    planes = DisplayPlanes(XtDisplay(traceWid),DefaultScreen(XtDisplay(traceWid)));
+    if (planes == 1) {
+	/* prepare for monochrome display */
+	XtGCMask valuemask = (GCStipple | GCFillStyle);
+	XGCValues values;
+	Pixmap grey_stipple;
+
+#define grey_width 4
+#define grey_height 4
+	static char grey_bits[] = {
+	   0x0e, 0x0b, 0x0e, 0x0b};
+
+	grey_stipple =
+	    XCreateBitmapFromData(XtDisplay(traceWid),
+				  RootWindowOfScreen(XtScreenOfObject(traceWid)),
+				  grey_bits,
+				  grey_width,
+				  grey_height);
+
+	values.stipple = grey_stipple;
+	values.fill_style = FillOpaqueStippled;
+	greygc = XCreateGC(XtDisplayOfObject(traceWid),
+			     RootWindowOfScreen(XtScreenOfObject(traceWid)),
+			     valuemask, &values);
+
+    }
+ 
+}
+
+
+
+
+void displayTrace(Seq seq)
+/*
+    Display the trace of sequence `seq'.
+*/
+{
+    if (seq == NULLSeq) return;
+    currSeq = seq;
+
+
+    /*
+        Get static information about the current sequence.
+    */
+    NPoints     = getNPoints(currSeq);
+    maxTraceVal = getMaxTraceVal(currSeq);
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+}
+
+
+
+
+void unDisplayTrace()
+/*
+    Cease displaying the current sequence (if any).
+*/
+{   currSeq = NULLSeq;
+}
+
+
+
+
+void setTraceWidth(Dimension width)
+/*
+    Set the width of the current sequence trace display.
+*/
+{   Arg args[10];
+    int nargs;
+
+    nargs = 0;
+    XtSetArg(args[nargs], XtNwidth, width); nargs++;
+    XtSetValues(traceWid,   args, nargs);
+
+    traceWidth = width;
+    plotWidth  =  traceWidth-(2*plotEdgeOffset);
+
+    /*
+        Clear the graph.
+    */
+    if (XtIsRealized(traceWid))
+        XClearWindow(XtDisplay(traceWid), XtWindow(traceWid));
+}
+
+
+
+
+void traceCutoffChange()
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
+{   int newLeftCutoff, newRightCutoff;
+
+    getCutoffs(currSeq, &newLeftCutoff, &newRightCutoff);
+
+
+    if (newLeftCutoff != leftCutoff)
+    /*
+        The region between newLeftCutoff and leftCutoff has changed.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	*/
+	if (newLeftCutoff<leftCutoff)
+	{   b0 = newLeftCutoff;
+	    bN = leftCutoff;
+	}
+	else
+	{   b0 = leftCutoff;
+	    bN = newLeftCutoff;
+	}
+
+	p0 = leftCutoffPoint(b0);
+	pN = leftCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		   pix0, 0,
+		   pixN-pix0, traceHeight,
+		   True);
+    }
+
+
+    if (newRightCutoff != rightCutoff)
+    /*
+        The region between newRightCutoff and rightCutoff has
+	changed. Expose it.
+    */
+    {   int b0, bN;
+	int p0, pN;
+	int pix0, pixN;
+	
+	/*
+	    b0 = leftmost base number. bN = rightmost base number.
+	    N.B. lower cutoff means the base is further right.
+	*/
+	if (newRightCutoff<rightCutoff)
+	{   b0 = rightCutoff;
+	    bN = newRightCutoff;
+	}
+	else
+	{   b0 = newRightCutoff;
+	    bN = rightCutoff;
+	}
+
+	p0 = rightCutoffPoint(b0);
+	pN = rightCutoffPoint(bN);
+
+	pix0 = pointToPixel(p0);
+	pixN = pointToPixel(pN);
+
+
+	/*
+	    Clear the area and generate an expose.
+	*/
+	XClearArea(XtDisplay(traceWid), XtWindow(traceWid),
+		   pix0, 0,
+		   pixN-pix0, traceHeight,
+		   True);
+  
+    }
+
+
+    /*
+        Update static data.
+    */
+    getCutoffs(currSeq, &leftCutoff, &rightCutoff);
+}
+
+
+
+
+
+
diff --git a/src/ted/traceDisplay.h b/src/ted/traceDisplay.h
new file mode 100644
index 0000000..a1a1956
--- /dev/null
+++ b/src/ted/traceDisplay.h
@@ -0,0 +1,80 @@
+/* 
+    Title:       traceDisplay
+
+    File: 	 traceDisplay.h
+    Purpose:	 Display the trace of a sequence on a graph widget
+    Last update: Wednesday 9 Jan 1991
+*/
+
+
+/*
+*/
+
+
+
+
+/* ---- Includes ---- */
+
+#include "seq.h"           /* IMPORT: Seq */
+
+#include <X11/Intrinsic.h> /* IMPORT: Widget, Dimension */
+
+
+
+
+/* ---- Exports ---- */
+
+
+extern void createTraceDisplay(Widget parentWid, Widget fromVertWid,
+			       Dimension width, Dimension offset);
+/*
+    Create the trace display within the Form widget `parent',
+    with the XtNfromVert constraint `fromVertWid'. The initial
+    width is `width' and the plot is drawn `plotEdgeOffset' from
+    the left and right hand edges.
+    No trace is initially displayed.
+*/
+
+
+extern void displayTrace(Seq seq);
+/*
+    Display the trace of sequence `seq'.
+*/
+
+
+extern void unDisplayTrace();
+/*
+    Cease displaying the trace of the current sequence (if any).
+*/
+
+
+extern void setTraceWidth(Dimension width);
+/*
+    Set the width of the current sequence trace display.
+*/
+
+
+extern void traceCutoffChange();
+/*
+    One or both of the cutoffs have changed.
+    Update our display.
+*/
+
+
+extern void incScaleFactor();
+/*
+    Increment vertical scaling of trace
+*/
+
+
+extern void decScaleFactor();
+/*
+    Decrement vertical scaling of trace
+*/
+
+
+
+extern void setScaleFactor(float sf);
+/*
+    Set vertical scaling of trace
+*/
diff --git a/src/ted/values.h b/src/ted/values.h
new file mode 100644
index 0000000..792e966
--- /dev/null
+++ b/src/ted/values.h
@@ -0,0 +1,118 @@
+/*	@(#)values.h 1.7 89/10/02 SMI; from S5R2 1.16	*/
+
+#ifndef _values_h
+#define _values_h
+
+/* These values work with any binary representation of integers
+ * where the high-order bit contains the sign. */
+
+/* a number used normally for size of a shift */
+#if gcos
+#define BITSPERBYTE	9
+#else
+#define BITSPERBYTE	8
+#endif
+#define BITS(type)	(BITSPERBYTE * (int)sizeof(type))
+
+/* short, regular and long ints with only the high-order bit turned on */
+#define HIBITS	((short)(1 << BITS(short) - 1))
+#define HIBITI	(1 << BITS(int) - 1)
+#define HIBITL	(1L << BITS(long) - 1)
+
+/* largest short, regular and long int */
+#define MAXSHORT	((short)~HIBITS)
+#define MAXINT	(~HIBITI)
+#define MAXLONG	(~HIBITL)
+
+/* various values that describe the binary floating-point representation
+ * _EXPBASE	- the exponent base
+ * DMAXEXP 	- the maximum exponent of a double (as returned by frexp())
+ * FMAXEXP 	- the maximum exponent of a float  (as returned by frexp())
+ * DMINEXP 	- the minimum exponent of a double (as returned by frexp())
+ * FMINEXP 	- the minimum exponent of a float  (as returned by frexp())
+ * MAXDOUBLE	- the largest double
+			((_EXPBASE ** DMAXEXP) * (1 - (_EXPBASE ** -DSIGNIF)))
+ * MAXFLOAT	- the largest float
+			((_EXPBASE ** FMAXEXP) * (1 - (_EXPBASE ** -FSIGNIF)))
+ * MINDOUBLE	- the smallest double (_EXPBASE ** (DMINEXP - 1))
+ * MINFLOAT	- the smallest float (_EXPBASE ** (FMINEXP - 1))
+ * DSIGNIF	- the number of significant bits in a double
+ * FSIGNIF	- the number of significant bits in a float
+ * DMAXPOWTWO	- the largest power of two exactly representable as a double
+ * FMAXPOWTWO	- the largest power of two exactly representable as a float
+ * _IEEE	- 1 if IEEE standard representation is used
+ * _DEXPLEN	- the number of bits for the exponent of a double
+ * _FEXPLEN	- the number of bits for the exponent of a float
+ * _HIDDENBIT	- 1 if high-significance bit of mantissa is implicit
+ * LN_MAXDOUBLE	- the natural log of the largest double  -- log(MAXDOUBLE)
+ * LN_MINDOUBLE	- the natural log of the smallest double -- log(MINDOUBLE)
+ * LN_MAXFLOAT	- the natural log of the largest float  
+ * LN_MINFLOAT	- the natural log of the smallest float
+ */
+#if u3b || u3b5 || sun
+#define MAXDOUBLE	1.797693134862315708e+308
+#define MAXFLOAT	((float)3.40282346638528860e+38)
+#define MINDOUBLE	4.94065645841246544e-324
+#define MINFLOAT	((float)1.40129846432481707e-45)
+#define	_IEEE		1
+#define _DEXPLEN	11
+#define _HIDDENBIT	1
+#define DMINEXP	(-(DMAXEXP + DSIGNIF - _HIDDENBIT - 3))
+#define FMINEXP	(-(FMAXEXP + FSIGNIF - _HIDDENBIT - 3))
+#define LN_MAXFLOAT	8.8722839052068e+01
+#define LN_MINFLOAT	-1.03278929903432e+02
+#endif
+#if pdp11 || vax
+#define MAXDOUBLE	1.701411834604692293e+38
+#define MAXFLOAT	((float)1.701411733192644299e+38)
+/* The following is kludged because the PDP-11 compilers botch the simple form.
+   The kludge causes the constant to be computed at run-time on the PDP-11,
+   even though it is still "folded" at compile-time on the VAX. */
+#define MINDOUBLE	(0.01 * 2.938735877055718770e-37)
+#define MINFLOAT	((float)MINDOUBLE)
+#define _IEEE		0
+#define _DEXPLEN	8
+#define _HIDDENBIT	1
+#define DMINEXP	(-DMAXEXP)
+#define FMINEXP	(-FMAXEXP)
+#endif
+#if gcos
+#define MAXDOUBLE	1.7014118346046923171e+38
+#define MAXFLOAT	((float)1.7014118219281863150e+38)
+#define MINDOUBLE	2.9387358770557187699e-39
+#define MINFLOAT	((float)MINDOUBLE)
+#define _IEEE		0
+#define _DEXPLEN	8
+#define _HIDDENBIT	0
+#define DMINEXP	(-(DMAXEXP + 1))
+#define FMINEXP	(-(FMAXEXP + 1))
+#endif
+#if u370
+#define _LENBASE	4
+#else
+#define _LENBASE	1
+#endif
+#define _EXPBASE	(1 << _LENBASE)
+#define _FEXPLEN	8
+#define DSIGNIF	(BITS(double) - _DEXPLEN + _HIDDENBIT - 1)
+#define FSIGNIF	(BITS(float)  - _FEXPLEN + _HIDDENBIT - 1)
+#define DMAXPOWTWO	((double)(1L << BITS(long) - 2) * \
+				(1L << DSIGNIF - BITS(long) + 1))
+#define FMAXPOWTWO	((float)(1L << FSIGNIF - 1))
+#define DMAXEXP	((1 << _DEXPLEN - 1) - 1 + _IEEE)
+#define FMAXEXP	((1 << _FEXPLEN - 1) - 1 + _IEEE)
+#define LN_MAXDOUBLE	(M_LN2 * DMAXEXP)
+#define LN_MINDOUBLE	(M_LN2 * (DMINEXP - 1))
+
+#define H_PREC	(DSIGNIF % 2 ? (1L << DSIGNIF/2) * M_SQRT2 : 1L << DSIGNIF/2)
+#define X_EPS	(1.0/H_PREC)
+#define X_PLOSS	((double)(long)(M_PI * H_PREC))
+#define X_TLOSS	(M_PI * DMAXPOWTWO)
+#define M_LN2	0.69314718055994530942
+#define M_PI	3.14159265358979323846
+#define M_SQRT2	1.41421356237309504880
+#define MAXBEXP	DMAXEXP /* for backward compatibility */
+#define MINBEXP	DMINEXP /* for backward compatibility */
+#define MAXPOWTWO	DMAXPOWTWO /* for backward compatibility */
+
+#endif /*!_values_h*/
diff --git a/src/update_subclones/README b/src/update_subclones/README
new file mode 100644
index 0000000..e59522b
--- /dev/null
+++ b/src/update_subclones/README
@@ -0,0 +1,25 @@
+Software to Maintain Subclone Library Database
+----------------------------------------------
+
+The subclone library database stores information on sublcone libraries.
+For the syntax of this file (subclones) refer to the file
+subclones.syntax.
+
+update_subclones
+	Programs that prompts the user for information to be added to
+	the subclone library database. Ensure that the owner of
+	update_subclones is the same as that of subclones, and that the
+	setuid bit is set.
+
+update_subclones.c
+	The source code for subclonedb and update_subclones
+
+format.awk
+	An attempt to write an awk script to produce output for humans
+	to read. It produces a tbl format file. To use type:
+
+	    awk -f format.awk subclones | tbl | ptroff -t >! out.ps
+
+subclones
+	The subclone library database
+
diff --git a/src/update_subclones/format.awk b/src/update_subclones/format.awk
new file mode 100644
index 0000000..82f81e5
--- /dev/null
+++ b/src/update_subclones/format.awk
@@ -0,0 +1,57 @@
+BEGIN {
+    entries_per_page = 50;
+    PP = 0;
+}
+{
+    if ( (NR % entries_per_page) == 1 ) {
+	if (PP)	print ".bp";
+	PP++;
+	print "Page - " PP;
+	#
+	print ".sp 3";
+	print ".TS";
+	print "box, tab (%);";
+	print "c s   s   s   s";
+	print " c | c | c | c | c | c";
+	print " c | c | c | c | c | c";
+	print " l | l | l | l | l | l.";
+	print "Cosmid Subclone Libraries";
+	print "=";
+	print "Microtitre%Cosmid%Size Range%Fragmentation%Vector%Comments" ;
+	print "Dish Number%%(bases)%Method%";
+	print "=";
+    }
+
+    if (NF > 5) {
+	line = $0;
+	s = index(line,"(");
+	t = index(line,")");
+	count = 0;
+	while (s!=0 && t!=0) {
+	    comment = substr(line,s+1,t-s-1);
+	    line = substr(line,t+1);
+	    if (count==0) {
+		print $1 "%" $2 "%" $3 "%" $4 "%" $5 "%" comment;
+	    } else {
+		print "%%%%%" comment;
+	    }
+	    count++;
+	    s = index(line,"(");
+	    t = index(line,")");
+	}
+	if (count>1) print "%%%%%_";
+    } else
+	print $1 "%" $2 "%" $3 "%" $4 "%" $5 "%";
+
+    if ( (NR % entries_per_page) == 0 ) {
+	print ".TE";
+    }
+}
+
+END {
+    if ( (NR % entries_per_page) != 0 ) {
+	# fill up page
+	for(i=NR; i % entries_per_page; i++) print "%%%%%";
+	print ".TE";
+    }
+}
diff --git a/src/update_subclones/makefile-alpha b/src/update_subclones/makefile-alpha
new file mode 100644
index 0000000..7f1782d
--- /dev/null
+++ b/src/update_subclones/makefile-alpha
@@ -0,0 +1,42 @@
+#
+# Makefile for subclonedb
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = update_subclones
+
+#
+# Where the wild things are
+#
+SRC = .
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)/alpha-binaries
+INCLUDES = -I$(SRC) -I$(MISCSRC)
+
+#
+# Alpha
+#
+CC = cc
+CFLAGS = -g -std1 -ieee_with_inexact $(INCLUDES)
+LD = cc
+LDFLAGS = -g -non_shared
+
+#
+# Here they are
+#
+
+update_subclones : update_subclones.o $(MISCBIN)/misc.a
+	$(LD) $(LDFLAGS) -o update_subclones update_subclones.o $(MISCBIN)/misc.a
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS)
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/update_subclones/makefile-dec b/src/update_subclones/makefile-dec
new file mode 100644
index 0000000..280e437
--- /dev/null
+++ b/src/update_subclones/makefile-dec
@@ -0,0 +1,42 @@
+#
+# Makefile for subclonedb
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subclonedb update_subclones
+
+#
+# Where the wild things are
+#
+SRC = .
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+INCLUDES = -I$(SRC) -I$(MISCSRC)
+
+#
+# Sun
+#
+CC = c89
+CFLAGS = -common -g $(INCLUDES)
+
+#
+# Here they are
+#
+subclonedb : $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o subclonedb $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+
+update_subclones : $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o update_subclones $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS)
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/update_subclones/makefile-sgi b/src/update_subclones/makefile-sgi
new file mode 100644
index 0000000..1b00648
--- /dev/null
+++ b/src/update_subclones/makefile-sgi
@@ -0,0 +1,44 @@
+#
+# Makefile for subclonedb
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subclonedb update_subclones
+
+SHELL=/bin/sh
+
+#
+# Where the wild things are
+#
+SRC = .
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+INCLUDES = -I$(SRC) -I$(MISCSRC)
+
+#
+# SGI
+#
+CC = cc
+CFLAGS = -g $(INCLUDES)
+
+#
+# Here they are
+#
+subclonedb : $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o subclonedb $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+
+update_subclones : $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o update_subclones $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS)
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/update_subclones/makefile-solaris b/src/update_subclones/makefile-solaris
new file mode 100644
index 0000000..b9d66dc
--- /dev/null
+++ b/src/update_subclones/makefile-solaris
@@ -0,0 +1,42 @@
+#
+# Makefile for subclonedb
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subclonedb update_subclones
+
+#
+# Where the wild things are
+#
+SRC = .
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+INCLUDES = -I$(SRC) -I$(MISCSRC)
+
+#
+# Sun
+#
+CC = gcc
+CFLAGS = -g $(INCLUDES)
+
+#
+# Here they are
+#
+subclonedb : $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o subclonedb $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+
+update_subclones : $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o update_subclones $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS)
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/update_subclones/makefile-sun b/src/update_subclones/makefile-sun
new file mode 100644
index 0000000..b9d66dc
--- /dev/null
+++ b/src/update_subclones/makefile-sun
@@ -0,0 +1,42 @@
+#
+# Makefile for subclonedb
+#
+
+#
+# Programs made by this makefile
+#
+PROGS = subclonedb update_subclones
+
+#
+# Where the wild things are
+#
+SRC = .
+MISCSRC = $(SRC)/../Misc
+MISCBIN = $(MISCSRC)
+INCLUDES = -I$(SRC) -I$(MISCSRC)
+
+#
+# Sun
+#
+CC = gcc
+CFLAGS = -g $(INCLUDES)
+
+#
+# Here they are
+#
+subclonedb : $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o subclonedb $(SRC)/subclonedb.c $(MISCBIN)/misc.a
+
+update_subclones : $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+	$(CC) $(CFLAGS) -o update_subclones $(SRC)/update_subclones.c $(MISCBIN)/misc.a
+
+#
+# Special dependencies
+#
+all: $(PROGS)
+
+clean:
+	/bin/rm -f $(PROGS)
+
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
diff --git a/src/update_subclones/subclonedb.c b/src/update_subclones/subclonedb.c
new file mode 100644
index 0000000..ae9175f
--- /dev/null
+++ b/src/update_subclones/subclonedb.c
@@ -0,0 +1,415 @@
+#include <stdio.h>
+#define USER_DATA
+#ifdef USER_DATA
+#include <pwd.h>
+#endif
+#include "misc.h" /* IMPORT: is_file */
+
+/*
+** This is an incredible hack. The original is based on a csh script
+** Beware of horrid goto's etc
+*/
+typedef char String[200];
+String SUBCLONES; /* initialised in initialise() */
+String methods[] = {
+    "shear",
+    "Sau3A",
+    "Alu-1",
+};
+String vectors[] = {
+    "M13",
+    "pBS",
+    "pEMBL9",
+    "pUC118",
+};
+String sizes[] = {
+    "1-2",
+    "6-9",
+    "6-9",
+    "9-14",
+};
+int MATCH = 3;
+
+/*
+** Defaults
+*/
+String def_mtd;
+String def_cosmid;
+String def_method;
+String def_vector;
+String def_size;
+String def_verify;
+
+/*
+** Answers
+*/
+String ans_mtd;
+String ans_cosmid;
+String ans_method;
+String ans_vector;
+String ans_size;
+#define MAX_COMMENTS 10
+String ans_comments[MAX_COMMENTS];
+int comments;
+String ans_verify;
+
+static void initialise()
+/*
+** Initialise miscellaneous variables
+*/
+{
+    char *subclones;
+
+    /* clonelib = subclones:$SUBCLONES */
+    if (is_file("subclones"))
+	strcpy(SUBCLONES,"subclones");
+    else {
+	subclones = (char *) getenv("SUBCLONES");
+	if (subclones == NULL)
+	    SUBCLONES[0] = '\0';
+	else if (is_file(subclones))
+	    strcpy(SUBCLONES,subclones);
+	else
+	    SUBCLONES[0] = '\0';
+    }
+}
+
+int set_defaults()
+{ 
+    FILE *f;
+    String line;
+    String last_cosmid;
+    String max_mtd;
+    String mtd;
+    String alpha;
+    int number;
+    int i;
+
+    /* Determine SUBCLONES */
+    initialise();
+
+    if ( (f = fopen(SUBCLONES,"r")) == NULL ) return 1;
+
+    max_mtd[0] = '\0';
+    while ( fgets(line,sizeof(line),f) ) {
+	sscanf(line, "%s %s", mtd,last_cosmid);
+	if (strcmp(max_mtd,mtd) < 0) strcpy(max_mtd,mtd);
+    }
+
+    if ( ! max_mtd[0] ) return 1;
+
+    /*
+    ** generate next number
+    */
+    for (i = 0;isalpha(max_mtd[i]);i++) alpha[i] = max_mtd[i];
+    alpha[i] = '\0';
+    number = atoi(&max_mtd[i]) + 1;
+
+    sprintf(def_mtd,"%s%02d",alpha,number);
+    strcpy(def_cosmid, last_cosmid);
+    strcpy(def_method, methods[0]);
+    strcpy(def_vector, vectors[0]);
+    strcpy(def_size, sizes[0]);
+    strcpy(def_verify, "Yes");
+
+    return 0;
+}
+
+
+
+ask_mtd()
+{
+ ask:
+    printf("Microtitre dish [%s] ? ",def_mtd);
+    gets(ans_mtd);
+
+    if (strcmp(ans_mtd,"?")==0) {
+	printf("* Enter the name of the new microtitre dish to be added to the library.\n");
+	printf("  A microtitre dish must be a letter followed by two digits. Eg a87\n");
+	goto ask;
+    } else if (ans_mtd[0]) {
+	if (! islower(ans_mtd[0]) ||
+	    (strlen(ans_mtd)==3 ?
+	     ! isdigit(ans_mtd[1]) :
+	     ! islower(ans_mtd[1])) ||
+	    ! isdigit(ans_mtd[2]) ||
+	    (strlen(ans_mtd)==3 ?
+	     ans_mtd[3] :
+	     ! isdigit(ans_mtd[3]) ||
+	     ans_mtd[4])) {
+	    printf("! Invalid microtitre dish name.\n");
+	    printf("  A microtitre dish must be a letter followed by two digits. Eg a87\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_mtd,def_mtd);
+    }
+
+}
+
+
+
+
+ask_cosmid()
+{
+ ask:
+
+    printf ("Cosmid [%s] ? ",def_cosmid);
+    gets(ans_cosmid);
+
+    if ( strcmp(ans_cosmid,"?") == 0 ) {
+	printf("* Enter the name of the cosmid from which the subclones was made.\n");
+	printf("  NO check is made on the correctness of the value entered.\n");
+	goto ask;
+    } else if ( ans_cosmid[0] ){
+	int i;
+	for (i=0; i<strlen(ans_cosmid); i++) {
+	    if (i) {
+		if (! isupper(ans_cosmid[i]) &&
+		    ! isdigit(ans_cosmid[i])) break;
+	    } else {
+		if (! isupper(ans_cosmid[i])) break;
+	    }
+	}
+	if (i != strlen(ans_cosmid)) {
+	    printf("! Invalid cosmid name.\n");
+	    printf("  A cosmid must be an letter followed by alphanumeric characters only\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_cosmid,def_cosmid);
+    }
+}
+
+#define N(A) ( sizeof(A) / sizeof(A[0]) )
+ask_method()
+{
+ ask:
+    printf("Method of fragmentation [%s] ? ",def_method);
+    gets(ans_method);
+
+    if ( strcmp(ans_method,"?") == 0) {
+	int i;
+	printf("* Enter the fragmentation method used to make the subclones\n");
+	printf("  Enter one of:\n");
+	printf("    ");
+	for (i=0; i< N(methods); i++)
+	    printf("%s ",methods[i]);
+	printf("\n");
+	goto ask;
+    } else if (ans_method[0]) {
+	int i;
+	for (i=0; i<N(methods);i++) {
+	    if ( strncmp(ans_method,methods[i],MATCH) == 0) {
+		strcpy(ans_method,methods[i]);
+		break;
+	    }
+	}
+	if (i == N(methods)) {
+	    printf("! Invalid fragmentation method\n");
+	    printf("  A valid fragmentation method is one of:\n");
+	    printf("    ");
+	    for (i=0; i< N(methods); i++)
+		printf("%s ",methods[i]);
+	    printf("\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_method, def_method);
+    }
+}
+
+
+ask_vector()
+{
+ ask:
+    printf("Vector [%s] ? ",def_vector);
+    gets(ans_vector);
+
+    if ( strcmp(ans_vector,"?") == 0) {
+	int i;
+	printf("* Enter the vector being used for the subclones\n");
+	printf("  Enter one of:\n");
+	printf("    ");
+	for (i=0; i< N(vectors); i++)
+	    printf("%s ",vectors[i]);
+	printf("\n");
+	goto ask;
+    } else if (ans_vector[0] ) {
+	int i;
+	for (i=0; i<N(vectors);i++) {
+	    if ( strncmp(ans_vector,vectors[i],MATCH) == 0) {
+		strcpy(ans_vector,vectors[i]);
+		strcpy(def_size,sizes[i]);
+		break;
+	    }
+	}
+	if (i == N(vectors)) {
+	    printf("! Invalid fragmentation method\n");
+	    printf("  A valid fragmentation method is one of:\n");
+	    printf("    ");
+	    for (i=0; i< N(vectors); i++)
+		printf("%s ",vectors[i]);
+	    printf("\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_vector, def_vector);
+    }
+}
+
+
+
+
+ask_size()
+{
+ ask:
+    printf("Size range [%s] ? ",def_size);
+    gets(ans_size);
+    
+    if (strcmp(ans_size,"?") == 0) {
+	printf("* Enter the size of the insert in the vector\n");
+	printf("  The size should be a range in kilobases\n");
+	printf("  Eg 1-2\n");
+	goto ask;
+    } else if (ans_size[0]) {
+	int i;
+	for (i=0; isdigit(ans_size[i]); i++);
+	if (ans_size[i] == '-') {
+	    for (i++; isdigit(ans_size[i]); i++);
+	}
+	if (i != strlen(ans_size)) {
+	    printf("! Invalid size range specification.\n");
+            printf("  The size range should be a range in kilobases. Eg 1-2\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_size, def_size);
+    }
+}
+
+ask_comments()
+{
+    int i;
+    String ans_comment;
+    for (i=0; i<MAX_COMMENTS; i++) ans_comments[i][0] = '\0';
+    comments = 0;
+
+ ask:
+    printf("Comment #%d [] ? ",comments+1);
+    gets(ans_comment);
+
+    if (strcmp(ans_comment,"?") == 0) {
+	printf("* Enter a one line comment.\n");
+	printf("  Multiple comments can be entered. When you wish to have entered all the\n");
+	printf("  comments you wish, press return to the comment prompt.\n");
+	goto ask;
+    } else if (ans_comment[0]) {
+	strcpy(ans_comments[comments++],ans_comment);
+	if (comments == MAX_COMMENTS) {
+	    printf("! Maximum number if comments reached. Sorry.\n");
+	} else
+	    goto ask;
+    }
+
+}
+
+
+ask_verify()
+{
+    int i;
+
+    printf("\nHere are the values you entered:\n\n");
+    printf("Microtitre dish number: %s\n", ans_mtd);
+    printf("Cosmid:                 %s\n", ans_cosmid);
+    printf("Fragmentation method:   %s\n", ans_method);
+    printf("Vector:                 %s\n", ans_vector);
+    printf("Size range:             %s\n", ans_size);
+    for(i=0; i< comments; i++)
+	printf("Comment: #%d             %s\n", i+1, ans_comments[i]);
+    printf("\n");
+
+ ask:
+    printf("Are these values correct [%s] ? ", def_verify);
+    gets(ans_verify);
+    
+    if (strcmp(ans_verify,"?") == 0 ) {
+	printf("Enter `Yes' if these values are correct. They will then be appended to the\n");
+	printf("subclone libraries file.\n");
+	printf("Enter `No' to exit the program.\n");
+	goto ask;
+    } else if (ans_verify[0]) {
+	if ( strchr("YyNn",ans_verify[0]) == NULL ) {
+	    printf("Please enter `Yes' or `No'.\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_verify, def_verify);
+    }
+}
+
+
+
+update()
+{
+    FILE *f;
+    int i;
+    String output;
+#ifdef USER_DATA
+    struct passwd *pw;
+#endif
+    if ( (f = fopen(SUBCLONES,"a")) == NULL) return 1;
+
+    fprintf(f,"%s %s %s %s %s", ans_mtd, ans_cosmid, ans_size, ans_method, ans_vector);
+    for (i=0; i< comments; i++)
+	fprintf(f," (%s)", ans_comments[i]);
+
+#ifdef USER_DATA
+    pw = getpwuid(getuid());
+    fprintf(f," # %s",pw->pw_name);
+    shell_call("hostname",output, sizeof(output));
+    fprintf(f," %s",output);
+    shell_call("date",output, sizeof(output));
+    fprintf(f," %s",output);
+#endif
+
+    fprintf(f,"\n");
+    fclose(f);
+    return 0;
+}
+
+
+
+
+main ()
+{
+
+
+    printf("Maintain subclone database\n\n");
+
+    if ( set_defaults() ) {
+	fprintf (stderr,"Cannot open file %s\n",SUBCLONES);
+	exit(1);
+    }
+
+    printf("Database=%s\n\n",SUBCLONES);
+
+    ask_mtd();
+    ask_cosmid();
+    ask_method();
+    ask_vector();
+    ask_size();
+    ask_comments();
+
+    ask_verify();
+
+    if ( strchr("Yy",ans_verify[0]) != NULL) {
+	printf("Updating file %s with new values...",SUBCLONES);
+	update();
+	printf("done\n");
+    } else {
+	printf("No update made\n");
+    }
+
+
+}
diff --git a/src/update_subclones/subclones.syntax b/src/update_subclones/subclones.syntax
new file mode 100644
index 0000000..5eba956
--- /dev/null
+++ b/src/update_subclones/subclones.syntax
@@ -0,0 +1,37 @@
+Cosmid Subclone Libraries
+
+The cosmid subclone library is a file of the following format:
+
+<file>                 ::= <line> <file> |
+<line>                 ::= <microtitre-dish-no>
+			   <cosmid>
+			   <size-range>
+			   <fragmentation-method>
+			   <vector>
+			   <comment-list>
+			   <opt-info>
+<microtitre-dish-no>   ::= <string>
+<cosmid>               ::= <string>
+<size-range>           ::= <number>..<number>
+<fragmentation-method> ::= <string>
+<vector>               ::= <string>
+<comment-list>         ::= <comment> <comment-list> |
+<comment>              ::= ( <string> )
+<opt-info>             ::= <info> | 
+<info>                 ::= # <user-id> <host-id> <date>
+
+where:
+
+<string>  ::= any series of characters terminated by white space
+<number>  ::= integer number
+<user-id> ::= user id of person who created entry
+<host-id> ::= from where entry was created
+<date>    ::= date the entry was created
+
+
+Here (at MRC):
+
+<microtitre-dish-no> ::= <string><letter><digit><digit>
+
+
+
diff --git a/src/update_subclones/update_subclones.c b/src/update_subclones/update_subclones.c
new file mode 100644
index 0000000..865ffc3
--- /dev/null
+++ b/src/update_subclones/update_subclones.c
@@ -0,0 +1,427 @@
+#include <stdio.h>
+/*#define USER_DATA*/
+#ifdef USER_DATA
+#include <pwd.h>
+#endif
+#include "misc.h" /* IMPORT: is_file */
+
+/*
+** This is an incredible hack. The original is based on a csh script
+** Beware of horrid goto's etc
+*/
+typedef char String[200];
+String SUBCLONES; /* initialised in initialise() */
+String methods[] = {
+    "shear",
+    "Sau3A",
+    "Alu-1",
+};
+String vectors[] = {
+    "M13mp18",
+    "pBS",
+    "pEMBL9",
+    "pUC118",
+};
+String sizes[] = {
+    "1000..2000",
+    "6000..9000",
+    "6000..9000",
+    "9000..14000",
+};
+int MATCH = 3;
+
+/*
+** Defaults
+*/
+String def_mtd;
+String def_cosmid;
+String def_method;
+String def_vector;
+String def_size;
+String def_verify;
+
+/*
+** Answers
+*/
+String ans_mtd;
+String ans_cosmid;
+String ans_method;
+String ans_vector;
+String ans_size;
+#define MAX_COMMENTS 10
+String ans_comments[MAX_COMMENTS];
+int comments;
+String ans_verify;
+
+static void initialise()
+/*
+** Initialise miscellaneous variables
+*/
+{
+    char *subclones;
+
+    /* clonelib = subclones:$SUBCLONES */
+    if (is_file("subclones"))
+	strcpy(SUBCLONES,"subclones");
+    else {
+	subclones = (char *) getenv("SUBCLONES");
+	if (subclones == NULL)
+	    SUBCLONES[0] = '\0';
+	else if (is_file(subclones))
+	    strcpy(SUBCLONES,subclones);
+	else
+	    SUBCLONES[0] = '\0';
+    }
+}
+
+int set_defaults()
+{ 
+    FILE *f;
+    String line;
+    String last_cosmid;
+    String max_mtd;
+    String mtd;
+    String alpha;
+    int number;
+    int i;
+
+    /* Determine SUBCLONES */
+    initialise();
+	
+    if ( (f = fopen(SUBCLONES,"r")) == NULL ) return 1;
+
+    max_mtd[0] = '\0';
+    while ( fgets(line,sizeof(line),f) ) {
+	sscanf(line, "%s %s", mtd,last_cosmid);
+	if (strcmp(max_mtd,mtd) < 0) strcpy(max_mtd,mtd);
+    }
+
+    if ( ! max_mtd[0] ) return 1;
+
+    /*
+     * generate next number:
+     *
+     * <mtd>    ::= <prefix><letter><number>
+     * <prefix> ::= <prefix><letter> | <prefix><digit> |
+     * <letter> ::= [a-z]
+     * <digit>  ::= [0-9]
+     * <number> ::= <digit><digit>
+     */
+    {
+	int j = -1;
+	for (i = 0;isalnum(max_mtd[i]);i++) {
+	    if (isalpha(max_mtd[i])) j = i;
+	    alpha[i] = max_mtd[i];
+	}
+	alpha[++j] = '\0';
+	number = atoi(&max_mtd[j]) + 1;
+    }
+
+    sprintf(def_mtd,"%s%02d",alpha,number);
+    strcpy(def_cosmid, last_cosmid);
+    strcpy(def_method, methods[0]);
+    strcpy(def_vector, vectors[0]);
+    strcpy(def_size, sizes[0]);
+    strcpy(def_verify, "Yes");
+
+    return 0;
+}
+
+
+
+ask_mtd()
+{
+ ask:
+    printf("Microtitre dish [%s] ? ",def_mtd);
+    gets(ans_mtd);
+
+    if (strcmp(ans_mtd,"?")==0) {
+	printf("* Enter the name of the new microtitre dish to be added to the library.\n");
+	printf("  A microtitre dish must be a letter followed by two digits,\n  with a possible alphanumeric prefix. Eg a87\n");
+	goto ask;
+    } else if (ans_mtd[0]) {
+	int i,d,err;
+	for(err=0,i=0;!err && ans_mtd[i];i++) {
+	    if (isdigit(ans_mtd[i]))
+		d++; /* count trailing digits */
+	    else if (isalpha(ans_mtd[i]))
+		d = 0; /* set trailing digits to 0 */
+	    else
+		err = 1; /* not alpha-numeric */
+	}
+	if (err || i<3 || d!=2) {
+	    printf("! Invalid microtitre dish name.\n");
+	    printf("  A microtitre dish must be a letter followed by two digits,\n  with a possible alphanumeric prefix. Eg a87\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_mtd,def_mtd);
+    }
+
+}
+
+
+
+
+ask_cosmid()
+{
+ ask:
+
+    printf ("Cosmid [%s] ? ",def_cosmid);
+    gets(ans_cosmid);
+
+    if ( strcmp(ans_cosmid,"?") == 0 ) {
+	printf("* Enter the name of the cosmid from which the subclones was made.\n");
+	printf("  NO check is made on the correctness of the value entered.\n");
+	goto ask;
+    } else if ( ans_cosmid[0] ){
+	int i;
+	for (i=0; i<strlen(ans_cosmid); i++) {
+	    if (! isupper(ans_cosmid[i]) &&
+		! isdigit(ans_cosmid[i])) break;
+	}
+	if (i != strlen(ans_cosmid)) {
+	    printf("! Invalid cosmid name.\n");
+	    printf("  A cosmid must be an alphanumeric string\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_cosmid,def_cosmid);
+    }
+}
+
+#define N(A) ( sizeof(A) / sizeof(A[0]) )
+ask_method()
+{
+ ask:
+    printf("Method of fragmentation [%s] ? ",def_method);
+    gets(ans_method);
+
+    if ( strcmp(ans_method,"?") == 0) {
+	int i;
+	printf("* Enter the fragmentation method used to make the subclones\n");
+	printf("  Enter one of:\n");
+	printf("    ");
+	for (i=0; i< N(methods); i++)
+	    printf("%s ",methods[i]);
+	printf("\n");
+	goto ask;
+    } else if (ans_method[0]) {
+	int i;
+	for (i=0; i<N(methods);i++) {
+	    if ( strncmp(ans_method,methods[i],MATCH) == 0) {
+		strcpy(ans_method,methods[i]);
+		break;
+	    }
+	}
+	if (i == N(methods)) {
+	    printf("! Invalid fragmentation method\n");
+	    printf("  A valid fragmentation method is one of:\n");
+	    printf("    ");
+	    for (i=0; i< N(methods); i++)
+		printf("%s ",methods[i]);
+	    printf("\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_method, def_method);
+    }
+}
+
+
+ask_vector()
+{
+ ask:
+    printf("Vector [%s] ? ",def_vector);
+    gets(ans_vector);
+
+    if ( strcmp(ans_vector,"?") == 0) {
+	int i;
+	printf("* Enter the vector being used for the subclones\n");
+	printf("  Enter one of:\n");
+	printf("    ");
+	for (i=0; i< N(vectors); i++)
+	    printf("%s ",vectors[i]);
+	printf("\n");
+	goto ask;
+    } else if (ans_vector[0] ) {
+	int i;
+	for (i=0; i<N(vectors);i++) {
+	    if ( strncmp(ans_vector,vectors[i],MATCH) == 0) {
+		strcpy(ans_vector,vectors[i]);
+		strcpy(def_size,sizes[i]);
+		break;
+	    }
+	}
+	if (i == N(vectors)) {
+	    printf("! Invalid fragmentation method\n");
+	    printf("  A valid fragmentation method is one of:\n");
+	    printf("    ");
+	    for (i=0; i< N(vectors); i++)
+		printf("%s ",vectors[i]);
+	    printf("\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_vector, def_vector);
+    }
+}
+
+
+
+
+ask_size()
+{
+ ask:
+    printf("Size range [%s] ? ",def_size);
+    gets(ans_size);
+    
+    if (strcmp(ans_size,"?") == 0) {
+	printf("* Enter the size of the insert in the vector\n");
+	printf("  The size should be a range in bases\n");
+	printf("  Eg 1000..2000\n");
+	goto ask;
+    } else if (ans_size[0]) {
+	int i;
+	int ok = 0;
+	for (i=0; isdigit(ans_size[i]); i++);
+	if (ans_size[i] == '.') {
+	    if (ans_size[++i] == '.') {
+		for (i++; isdigit(ans_size[i]); i++);
+		ok = (i == strlen(ans_size));
+	    }
+	}
+	if (!ok) {
+	    printf("! Invalid size range specification.\n");
+            printf("  The size range should be a range in kilobases. Eg 1000..2000\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_size, def_size);
+    }
+}
+
+ask_comments()
+{
+    int i;
+    String ans_comment;
+    for (i=0; i<MAX_COMMENTS; i++) ans_comments[i][0] = '\0';
+    comments = 0;
+
+ ask:
+    printf("Comment #%d [] ? ",comments+1);
+    gets(ans_comment);
+
+    if (strcmp(ans_comment,"?") == 0) {
+	printf("* Enter a one line comment.\n");
+	printf("  Multiple comments can be entered. When you wish to have entered all the\n");
+	printf("  comments you wish, press return to the comment prompt.\n");
+	goto ask;
+    } else if (ans_comment[0]) {
+	strcpy(ans_comments[comments++],ans_comment);
+	if (comments == MAX_COMMENTS) {
+	    printf("! Maximum number if comments reached. Sorry.\n");
+	} else
+	    goto ask;
+    }
+
+}
+
+
+ask_verify()
+{
+    int i;
+
+    printf("\nHere are the values you entered:\n\n");
+    printf("Microtitre dish number: %s\n", ans_mtd);
+    printf("Cosmid:                 %s\n", ans_cosmid);
+    printf("Fragmentation method:   %s\n", ans_method);
+    printf("Vector:                 %s\n", ans_vector);
+    printf("Size range:             %s\n", ans_size);
+    for(i=0; i< comments; i++)
+	printf("Comment: #%d             %s\n", i+1, ans_comments[i]);
+    printf("\n");
+
+ ask:
+    printf("Are these values correct [%s] ? ", def_verify);
+    gets(ans_verify);
+    
+    if (strcmp(ans_verify,"?") == 0 ) {
+	printf("Enter `Yes' if these values are correct. They will then be appended to the\n");
+	printf("subclone libraries file.\n");
+	printf("Enter `No' to exit the program.\n");
+	goto ask;
+    } else if (ans_verify[0]) {
+	if ( strchr("YyNn",ans_verify[0]) == NULL ) {
+	    printf("Please enter `Yes' or `No'.\n");
+	    goto ask;
+	}
+    } else {
+	strcpy(ans_verify, def_verify);
+    }
+}
+
+
+update()
+{
+    FILE *f;
+    int i;
+    String output;
+#ifdef USER_DATA
+    struct passwd *pw;
+#endif
+    if ( (f = fopen(SUBCLONES,"a")) == NULL) return 1;
+
+    fprintf(f,"%s %s %s %s %s", ans_mtd, ans_cosmid, ans_size, ans_method, ans_vector);
+    for (i=0; i< comments; i++)
+	fprintf(f," (%s)", ans_comments[i]);
+
+#ifdef USER_DATA
+    pw = getpwuid(getuid());
+    fprintf(f," # %s",pw->pw_name);
+    shell_call("hostname",output, sizeof(output));
+    fprintf(f," %s",output);
+    shell_call("date",output, sizeof(output));
+    fprintf(f," %s",output);
+#endif
+
+    fprintf(f,"\n");
+    fclose(f);
+    return 0;
+}
+
+
+
+
+main ()
+{
+
+
+    printf("Maintain subclone database\n\n");
+
+    if ( set_defaults() ) {
+	fprintf (stderr,"Cannot open file '%s' ($SUBCLONES)\n",SUBCLONES);
+	exit(1);
+    }
+
+    printf("Database=%s\n\n",SUBCLONES);
+
+    ask_mtd();
+    ask_cosmid();
+    ask_method();
+    ask_vector();
+    ask_size();
+    ask_comments();
+
+    ask_verify();
+
+    if ( strchr("Yy",ans_verify[0]) != NULL) {
+	printf("Updating file %s with new values...",SUBCLONES);
+	update();
+	printf("done\n");
+    } else {
+	printf("No update made\n");
+    }
+
+
+}
diff --git a/src/vepe/eflt.h b/src/vepe/eflt.h
new file mode 100644
index 0000000..40234ec
--- /dev/null
+++ b/src/vepe/eflt.h
@@ -0,0 +1,36 @@
+/*
+** Line types for experiment file
+*/
+static char feature_ids[MAXIMUM_EFLTS][MAXIMUM_EFLT_LENGTH+1] = {
+    "CF", /*  0 cloning vector sequence file */
+    "CN", /*  1 clone name */
+    "CS", /*  2 cloning vector sequence present in sequence */
+    "CV", /*  3 cloning vector type */
+    "DR", /*  4 direction of read */
+    "DT", /*  5 date of experiment */
+    "EN", /*  6 experiment name */
+    "EX", /*  7 experimental notes */
+    "FM", /*  8 sequencing vector fragmentation method */
+    "LN", /*  9 local format trace file name */
+    "LT", /* 10 local format trace file type */
+    "MC", /* 11 machine on which experiment ran */
+    "MN", /* 12 machine generated trace file name */
+    "MT", /* 13 machine generated trace file type */
+    "OP", /* 14 operator */
+    "PN", /* 15 primer name */
+    "QR", /* 16 poor quality sequence present at right (3') end */
+    "SC", /* 17 sequencing vector cloning site */
+    "SF", /* 18 sequencing vector sequence file */
+    "SI", /* 19 sequencing vector insertion length */
+    "SL", /* 20 sequencing vector present at left (5') end */
+    "SP", /* 21 sequencing vector primer site (relative to cloning site) */
+    "SQ", /* 22 sequence */
+    "SR", /* 23 sequencing vector present at right (3') end */
+    "ST", /* 24 strands */
+    "SV", /* 25 sequencing vector type */
+    "TN", /* 26 template name */
+    "QL", /* 27 poor quality sequence present at left (5') end */
+    "PS", /* 28 processing status */
+    "CC", /* 29 comments */
+    "SS", /* 30 sequence to screen against */
+};
diff --git a/src/vepe/expFileIO.c b/src/vepe/expFileIO.c
new file mode 100644
index 0000000..b4e59a3
--- /dev/null
+++ b/src/vepe/expFileIO.c
@@ -0,0 +1,552 @@
+/*
+ * expFileIO.c
+ *
+ * Routines for reading and writing to experiment files.
+ *
+ * 1. Opening experiment files
+ * 2. Reading information from an experiment file
+ * 3. Appending to experiment files
+ * 4. Closing an opened experiment file
+ *
+ */
+
+#include <stdio.h>
+#include "expFileIO.h"
+#include "eflt.h"
+
+
+/* The only feature id that matters */
+#define SEQUENCE 22
+
+
+
+
+
+static int exp_get_feature_index(char *e)
+{
+    int i;
+    
+    for (i = 0; i < MAXIMUM_EFLTS; i++)
+	if(strcmp(feature_ids[i],e)==0) return i;
+    
+    return -1;
+}
+
+
+static Exp_info *exp_create_info()
+/*
+ * Allocate space for new experiment file information
+ */
+{
+    Exp_info *new;
+    int i;
+    
+    new = (Exp_info *) malloc(sizeof(Exp_info));
+    if (new != NULL) {
+	for(i=0; i< MAXIMUM_EFLTS ; i++) new->entry[i] = NULL;
+    }
+    new->fp = NULL;
+    
+    return new;
+}
+
+
+void exp_destroy_info(Exp_info *e)
+/*
+ * Destroy experiment file information
+ */
+{
+    int i;
+    if (e != NULL_Exp_info) {
+	for (i = 0; i < MAXIMUM_EFLTS; i++)
+	    if (e->entry[i] != NULL) free(e->entry[i]);
+	if (e->fp != NULL) fclose(e->fp);
+	free(e);
+    }
+}
+
+
+
+
+
+
+static char *exp_read_sequence(FILE *fp)
+/*
+ * Read from file a sequence, discarding all white space til a // is encountered
+ */
+{
+    char *seq;
+    int seql;
+    char line[EXP_FILE_LINE_LENGTH+1];
+    char *l;
+    
+    seql = 0;
+    seq = (char *)malloc(seql+1);
+    seq[0] = '\0';
+    
+    l = fgets(line,EXP_FILE_LINE_LENGTH,fp);
+    while (l!= NULL && strncmp(l,"//",2)) {
+	char *a, *b;
+	for(a=b=line;*a;a++)
+	    if (! isspace(*a)) *b++=*a;
+	*b = '\0';
+	seql = seql + b-line;
+	seq = (char *)realloc(seq,seql+1);
+	strcat(seq,line);
+	l = fgets(line,EXP_FILE_LINE_LENGTH,fp);
+    }
+    
+    return seq;
+}
+
+
+
+
+
+
+Exp_info *exp_read_info(char *file)
+/*
+ * Read in an experiment file and return handle
+ */
+{
+    Exp_info *e;
+    char line[EXP_FILE_LINE_LENGTH+1];
+    
+    e = exp_create_info();
+    
+    /*
+     * open for read
+     */
+    if ((e->fp = fopen(file,"r"))==NULL) {
+	exp_destroy_info(e);
+	return NULL_Exp_info;
+    }
+    
+    
+    if (e != NULL_Exp_info) {
+	while (fgets(line,EXP_FILE_LINE_LENGTH,e->fp) != NULL) {
+	    char *c;
+	    int entry;
+	    /*
+	     * zero terminate first argument
+	     * set c to point to second argument
+	     */
+	    for (c=line;*c && !isspace(*c); c++) ;
+	    if (*c) {
+		*c++ = '\0';
+	        for (;*c && isspace(*c); c++) ;
+	    }
+
+	    
+	    entry = exp_get_feature_index(line);
+	    if (entry >= 0) {
+		/* junk previous entry for same line type is there was one*/
+		if (e->entry[entry]) free(e->entry[entry]);
+		
+		if (entry == SEQUENCE) {
+		    e->entry[entry] = exp_read_sequence(e->fp);
+		} else {
+		    int l;
+		    l = strlen(c);
+		    if (!l) l = 1; /* handle zero length lines elegantly */
+		    e->entry[entry] = (char *)malloc(l);
+		    strncpy(e->entry[entry],c,l-1);
+		    e->entry[entry][l-1] = '\0';
+		}
+	    }
+	}
+    }
+    
+    fclose(e->fp);
+    /*
+     * reopen for appending
+     */
+    e->fp = fopen(file,"a");
+    
+    return e;
+    
+}
+
+
+static int exp_check_eid_read(Exp_info *e,int id)
+/*
+ * Check these are a valid combination and that
+ * an entry exists for read
+ */
+{
+    return (e == NULL || id < 0 || id >= MAXIMUM_EFLTS || e->entry[id] == NULL || feature_ids[id][0]=='\0');
+}
+
+static int exp_check_eid_write(Exp_info *e,int id)
+/*
+ * Check these are a valid combination and that
+ * an entry exists for write
+ */
+{
+    return (e == NULL || id < 0 || id >= MAXIMUM_EFLTS || e->fp == NULL || feature_ids[id][0]=='\0');
+}
+
+
+
+
+
+
+int exp_get_int(Exp_info *e, int id, int *val)
+/*
+ * Get the integer for entry id
+ * returns:
+ *    0 - success
+ *    1 - no entry
+ */
+{
+    if ( exp_check_eid_read(e,id) ) return 1;
+    *val = atoi(e->entry[id]);
+    return 0;
+}
+
+
+int exp_get_rng(Exp_info *e, int id, int *from, int *to)
+/*
+ * Get the integer pair for entry id
+ * returns:
+ *    0 - success
+ *    1 - no entry
+ */
+{
+    if ( exp_check_eid_read(e,id) ) return 1;
+    sscanf(e->entry[id],"%d..%d",from,to);
+    return 0;
+}
+
+
+
+int exp_get_str(Exp_info *e, int id, char *s, int_fl s_l)
+/*
+ * Get the string for entry id
+ * returns:
+ *    0 - success
+ *    1 - no entry
+ */
+{
+    if ( exp_check_eid_read(e,id) ) return 1;
+    strncpy(s,e->entry[id],s_l);
+    
+    return 0;
+}
+
+
+static int exp_append_str(Exp_info *e, int id, char *s)
+/*
+ * Append the string to experiment file for entry id
+ * returns:
+ *    0 - success
+ *    1 - no update
+ */
+{
+    char *copy;
+    if ( (copy = (char *)malloc(strlen(s)+1))==NULL ) return 1;
+    if (fprintf(e->fp,"%s   %s\n",feature_ids[id],s)<0) return 1;
+    /* copied and written successfully... now switch */
+    strcpy(copy,s);
+    free(e->entry[id]);
+    e->entry[id] = copy;
+    return 0;
+}
+
+
+int exp_put_int(Exp_info *e, int id, int *val)
+/*
+ * Append the integer for entry id to the experiment file
+ * returns:
+ *    0 - success
+ *    1 - no update
+ */
+{
+    char buf[EXP_FILE_LINE_LENGTH];
+    if ( exp_check_eid_write(e,id) ) return 1;
+    sprintf(buf,"%d",*val);
+    return exp_append_str(e,id,buf);
+}
+
+
+int exp_put_rng(Exp_info *e, int id, int *from, int *to)
+/*
+ * Append the integer pair for entry id to the experiment file
+ * returns:
+ *    0 - success
+ *    1 - no update
+ */
+{
+    char buf[EXP_FILE_LINE_LENGTH];
+    if ( exp_check_eid_write(e,id) ) return 1;
+    sprintf(buf,"%d..%d",*from,*to);
+    return exp_append_str(e,id,buf);
+}
+
+
+
+int exp_put_str(Exp_info *e, int id, char *s, int_fl s_l)
+/*
+ * Append the string for entry id to the experiment file
+ * returns:
+ *    0 - success
+ *    1 - no update
+ */
+{
+    if ( exp_check_eid_write(e,id) ) return 1;
+    /* don't allow multi-line entries to be written */
+    if ( id == SEQUENCE ) return 1;
+    return exp_append_str(e,id,s);
+}
+
+
+/*
+ * FORTRAN INTERFACE
+ */
+
+static void c2fstr(char *c, int max_c, char *f, int max_f)
+{
+#define min(A,B) ( (A) < (B) ? (A) : (B) )
+    int i;
+    i = strlen(c);
+    i = min(i,max_f);
+    i = min(i,max_c);
+    strncpy(f,c,i);
+    for( ; i<max_f; i++) f[i]=' ';
+    
+}
+
+static int fstrlen(char *f, int max_f)
+{
+    for (; max_f > 0 && (isspace(f[max_f-1]) || f[max_f-1]=='\0'); max_f--);
+    return max_f;
+}
+
+static void f2cstr(char *f, int max_f, char *c, int max_c)
+{
+    int i;
+    
+    i = min(fstrlen(f,max_f),max_c);
+    strncpy(c,f,i);
+    c[i]='\0';
+}
+
+
+
+
+
+/*************************************************************
+ * FORTRAN INTERFACE
+ *************************************************************/
+
+
+
+static int init_done = 0;
+static int NHandles = 0;
+static Exp_info **Handles = NULL;
+
+static int initialise()
+{
+    int i;
+    
+    if (init_done) return 0;
+    init_done++;
+    
+    NHandles = getdtablesize();
+    if (!NHandles) return 1;
+    
+    if ( (Handles = (Exp_info **)malloc(sizeof(Exp_info *) * NHandles)) == NULL) {
+	NHandles = 0;
+	return 1;
+    }
+    
+    for (i=0; i<NHandles; i++) Handles[i] = NULL;
+    
+    return 0;
+}
+
+
+static int get_free_handle()
+/*
+ * find a free entry in the Exp array
+ * returns -1 if there is none
+ */
+{
+    int i;
+    
+    (void) initialise();
+    
+    if (!NHandles) return -1; /* no slots! */
+    for (i=0; i<NHandles && Handles[i]!=NULL; i++) ;
+    return (i==NHandles)?-1:i;
+}
+
+
+static int check_handle(int_f *handle)
+{
+    return (handle == NULL ||
+	    (int) (*handle) <= 0 ||
+	    (int) (*handle) > NHandles);
+}
+
+
+
+int_f expopn_(char *fn, int_fl fn_l)
+/*
+ * FORTRAN interface to exp_open_file()
+ */
+{
+    char cfn[1025];
+    int handle;
+    
+    if ( (handle = get_free_handle()) >= 0 ) {
+	f2cstr(fn,fn_l,cfn,1024);
+	Handles[handle] = exp_read_info(cfn);
+    }
+    
+    return (int_f) (handle+1);
+}
+
+
+
+int_f expkil_(int_f *handle)
+/*
+ * FORTRAN interface to exp_destroy_info
+ */
+{
+    Exp_info *e;
+    if ( check_handle(handle) ) return 0;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    exp_destroy_info(e);
+    
+    Handles[(int)(*handle)-1] = NULL;
+    *handle = 0;
+    
+    return 0;
+}
+
+int_f expri_(int_f *handle, int_f *id, int_f *val)
+/*
+ * FORTRAN interface to exp_get_int
+ */
+{
+    Exp_info *e;
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    return exp_get_int(e, (int)*id, (int *)val);
+}
+
+
+int_f exprr_(int_f *handle, int_f *id, int_f *from, int_f *to)
+/*
+ * FORTRAN interface to exp_get_rng
+ */
+{
+    Exp_info *e;
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    return exp_get_rng(e,(int)*id,(int *)from,(int *)to);
+    
+}
+
+int_f exprsa_(int_f *handle, int_f *id, char *s, int_f *max_len, int_fl s_l)
+/*
+ * FORTRAN interface to exp_get_str workalike
+ * NOTE: for use with FORTRAN CHARACTER arrays instead CHARACTER strings
+ */
+{
+    Exp_info *e;
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    if ( exp_check_eid_read(e,*id) ) return 1;
+    c2fstr(e->entry[*id],(int)*max_len,s,(int)*max_len);
+    return 0;
+}
+
+
+int_f exprs_(int_f *handle, int_f *id, char *s, int_fl s_l)
+/*
+ * FORTRAN interface to exp_get_str workalike
+ * NOTE: for use with FORTRAN CHARACTER strings instead CHARACTER arrays
+ */
+{
+    Exp_info *e;
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    if ( exp_check_eid_read(e,*id) ) return 1;
+    c2fstr(e->entry[*id],s_l,s,s_l);
+    return 0;
+}
+
+
+int_f expwi_(int_f *handle, int_f *id, int_f *val)
+/*
+ * FORTRAN interface to exp_put_int
+ */
+{
+    Exp_info *e;
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    return exp_put_int(e, (int)*id, (int *)val);
+}
+
+
+int_f expwr_(int_f *handle, int_f *id, int_f *from, int_f *to)
+/*
+ * FORTRAN interface to exp_put_rng
+ */
+{
+    Exp_info *e;
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    return exp_put_rng(e, (int)*id, (int *)from, (int *)to);
+}
+
+
+int_f expwsa_(int_f *handle, int_f *id, char *s, int_f *max_len, int_fl s_l)
+/*
+ * FORTRAN interface to exp_put_str workalike
+ * NOTE: for use with FORTRAN CHARACTER arrays instead CHARACTER strings
+ */
+{
+    Exp_info *e;
+    char buf[EXP_FILE_LINE_LENGTH];
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    
+    if ( exp_check_eid_write(e,*id) ) return 1;
+    /* don't allow multi-line entries to be written */
+    if (*id == SEQUENCE ) return 1;
+    f2cstr(s,(int)*max_len,buf,sizeof(buf));
+    return exp_append_str(e,*id,buf);
+    
+}
+
+int_f expws_(int_f *handle, int_f *id, char *s, int_fl s_l)
+/*
+ * FORTRAN interface to exp_put_str workalike
+ * NOTE: for use with FORTRAN CHARACTER strings instead CHARACTER arrays
+ */
+{
+    char buf[EXP_FILE_LINE_LENGTH];
+    Exp_info *e;
+    if ( check_handle(handle) ) return 1;
+    e = (Exp_info *) Handles[(int)(*handle)-1];
+    
+    
+    if ( exp_check_eid_write(e,*id) ) return 1;
+    /* don't allow multi-line entries to be written */
+    if (*id == SEQUENCE ) return 1;
+    f2cstr(s,s_l,buf,sizeof(buf));
+    return exp_append_str(e,*id,buf);
+}
+
+
diff --git a/src/vepe/expFileIO.h b/src/vepe/expFileIO.h
new file mode 100644
index 0000000..a438968
--- /dev/null
+++ b/src/vepe/expFileIO.h
@@ -0,0 +1,173 @@
+#ifndef _read_exp_h
+#define _read_exp_h
+
+#include "fort.h"
+
+/*
+** Definitions
+*/
+#define MAXIMUM_EFLT_LENGTH     4
+#define MAXIMUM_EFLTS          32
+#define EFLT_FILE_LINE_LENGTH 128
+#define EXP_FILE_LINE_LENGTH  128
+
+typedef struct {
+    char *entry[MAXIMUM_EFLTS];
+    FILE *fp;
+} Exp_info;
+
+#define NULL_Exp_info ( (Exp_info *) NULL )
+
+
+
+
+
+ /*************************************************************************************/
+
+
+extern void exp_destroy_info(Exp_info *e);
+/*
+** Destroy experiment file information
+*/
+
+
+
+
+
+
+
+
+
+
+extern Exp_info *exp_read_info(char *file);
+/*
+** Read in an experiment file and return handle
+*/
+
+
+
+
+extern int exp_get_int(Exp_info *e, int id, int *val);
+/*
+** Get the integer for entry id
+** returns:
+**    0 - success
+**    1 - no entry
+*/
+
+
+extern int exp_get_rng(Exp_info *e, int id, int *from, int *to);
+/*
+** Get the integer pair for entry id
+** returns:
+**    0 - success
+**    1 - no entry
+*/
+
+
+extern int exp_get_str(Exp_info *e, int id, char *s, int_fl s_l);
+/*
+** Get the string for entry id
+** returns:
+**    0 - success
+**    1 - no entry
+*/
+
+
+extern int exp_put_int(Exp_info *e, int id, int *val);
+/*
+** Append the integer for entry id to the experiment file
+** returns:
+**    0 - success
+**    1 - no update
+*/
+
+
+extern int exp_put_rng(Exp_info *e, int id, int *from, int *to);
+/*
+** Append the integer pair for entry id to the experiment file
+** returns:
+**    0 - success
+**    1 - no update
+*/
+
+
+
+extern int exp_put_str(Exp_info *e, int id, char *s, int_fl s_l);
+/*
+** Append the string for entry id to the experiment file
+** returns:
+**    0 - success
+**    1 - no update
+*/
+
+
+/*
+** FORTRAN INTERFACE
+*/
+
+
+
+extern int_f expopn_(char *fn, int_fl fn_l);
+/*
+** FORTRAN interface to exp_open_file()
+*/
+
+extern int_f expkil_(int_f *le);
+/*
+** FORTRAN interface to exp_destroy_info
+*/
+
+extern int_f expri_(int_f *le, int_f *id, int_f *val);
+/*
+** FORTRAN interface to exp_get_int
+*/
+
+
+extern int_f exprr_(int_f *le, int_f *id, int_f *from, int_f *to);
+/*
+** FORTRAN interface to exp_get_rng
+*/
+
+
+extern int_f exprsa_(int_f *le, int_f *id, char *s, int_f *max_len, int_fl s_l);
+/*
+** FORTRAN interface to exp_get_str workalike
+** NOTE: for use with FORTRAN CHARACTER arrays instead CHARACTER strings
+*/
+
+extern int_f exprs_(int_f *le, int_f *id, char *s, int_fl s_l);
+/*
+** FORTRAN interface to exp_get_str workalike
+** NOTE: for use with FORTRAN CHARACTER strings instead CHARACTER arrays
+*/
+
+extern int_f expwi_(int_f *le, int_f *id, int_f *val);
+/*
+** FORTRAN interface to exp_put_int
+*/
+
+
+extern int_f expwr_(int_f *le, int_f *id, int_f *from, int_f *to);
+/*
+** FORTRAN interface to exp_put_rng
+*/
+
+
+
+extern int_f expwsa_(int_f *le, int_f *id, char *s, int_f *max_len, int_fl s_l);
+/*
+** FORTRAN interface to exp_put_str workalike
+** NOTE: for use with FORTRAN CHARACTER arrays instead CHARACTER strings
+*/
+
+
+
+extern int_f expws_(int_f *le, int_f *id, char *s, int_fl s_l);
+/*
+** FORTRAN interface to exp_put_str workalike
+** NOTE: for use with FORTRAN CHARACTER strings instead CHARACTER arrays
+*/
+
+
+#endif /* _read_exp_h */
diff --git a/src/vepe/expio.f b/src/vepe/expio.f
new file mode 100644
index 0000000..409b6bf
--- /dev/null
+++ b/src/vepe/expio.f
@@ -0,0 +1,135 @@
+C routines to read and write data to the experiment files
+C rexgel read gel
+C rexcvn read cloning vector name
+C rexsvn read sequencing vector name
+C rexsvc read cloning site
+C rexsvp read primer site
+C rexpdp read poor data positions
+C rexsvp read sequencing vector positions in reading
+C wexsvp write sequencing vector positions
+C wexcvp write cloning vector positions
+C
+C
+C
+      INTEGER FUNCTION REXGEL(IDEV,GEL,LENGTH)
+      CHARACTER GEL(LENGTH)
+      INTEGER EXPRSA
+      EXTERNAL EXPRSA
+C rexgel read gel
+      JOB = 22
+      REXGEL = EXPRSA(IDEV,JOB,GEL,LENGTH)
+      JOB = LENGTH
+      DO 10 I=1,JOB
+        IF(GEL(I).EQ.' ') THEN
+          LENGTH = I-1
+          RETURN
+        END IF
+ 10     CONTINUE
+      END
+      INTEGER FUNCTION REXCVN(IDEV,NAME)
+      CHARACTER NAME*(*)
+      INTEGER EXPRS
+      EXTERNAL EXPRS
+C rexcvn read cloning vector name
+      JOB = 3
+      REXCVN = EXPRS(IDEV,JOB,NAME)
+      END
+      INTEGER FUNCTION REXCVF(IDEV,NAME)
+      CHARACTER NAME*(*)
+      INTEGER EXPRS
+      EXTERNAL EXPRS
+C rexcvf read cloning vector file name
+      JOB = 0
+      REXCVF = EXPRS(IDEV,JOB,NAME)
+      END
+      INTEGER FUNCTION REXSVF(IDEV,NAME)
+      CHARACTER NAME*(*)
+      INTEGER EXPRS
+      EXTERNAL EXPRS
+C rexsvf read sequencing vector file name
+      JOB = 18
+      REXSVF = EXPRS(IDEV,JOB,NAME)
+      END
+      INTEGER FUNCTION REXSVN(IDEV,NAME)
+      CHARACTER NAME*(*)
+      INTEGER EXPRS
+      EXTERNAL EXPRS
+C rexsvn read sequencing vector name
+      JOB = 25
+      REXSVN = EXPRS(IDEV,JOB,NAME)
+      END
+      INTEGER FUNCTION REXSVC(IDEV,ICSITE)
+      INTEGER EXPRI
+      EXTERNAL EXPRI
+C rexsvc read cloning site
+      JOB = 17
+      REXSVC = EXPRI(IDEV,JOB,ICSITE)
+      IF(REXSVC.NE.0) ICSITE = 0
+      END
+      INTEGER FUNCTION REXSVQ(IDEV,IPSITE)
+      INTEGER EXPRI
+      EXTERNAL EXPRI
+C rexsvq read primer site
+      JOB = 21
+      REXSVQ = EXPRI(IDEV,JOB,IPSITE)
+      IF(REXSVQ.NE.0) IPSITE = 0
+      END
+      INTEGER FUNCTION REXPDP(IDEV,LEFT,IRIGHT)
+      INTEGER EXPRI
+      EXTERNAL EXPRI
+C rexpdp read poor data positions
+      LEFT = 0
+      JOB = 16
+      REXPDP = EXPRI(IDEV,JOB,IRIGHT)
+      IF(REXPDP.NE.0) THEN
+        IRIGHT = 0
+      END IF
+      JOB = 27
+      REXPDP = EXPRI(IDEV,JOB,LEFT)
+      IF(REXPDP.NE.0) THEN
+        LEFT = 0
+      END IF
+      END
+      INTEGER FUNCTION REXSVP(IDEV,LEFT,IRIGHT)
+      INTEGER EXPRI
+      EXTERNAL EXPRI
+C rexsvp read sequencing vector positions
+      JOB = 20
+      REXSVP = EXPRI(IDEV,JOB,LEFT)
+      IF(REXSVP.NE.0) THEN
+        LEFT = 0
+      END IF
+      JOB = 23
+      REXSVP = EXPRI(IDEV,JOB,IRIGHT)
+      IF(REXSVP.NE.0) THEN
+        IRIGHT = 0
+      END IF
+      END
+      INTEGER FUNCTION REXCVP(IDEV,LEFT,IRIGHT)
+      INTEGER EXPRR
+      EXTERNAL EXPRR
+C rexcvp read cloning vector positions
+      JOB = 2
+      REXCVP = EXPRR(IDEV,JOB,LEFT,IRIGHT)
+      IF(REXCVP.NE.0) THEN
+        LEFT = 0
+        IRIGHT = 0
+      END IF
+      END
+      INTEGER FUNCTION WEXCVP(IDEV,LEFT,IRIGHT)
+      INTEGER EXPWR
+      EXTERNAL EXPWR
+C rexpst write cloning vector positions
+      JOB = 2
+      WEXCVP = EXPWR(IDEV,JOB,LEFT,IRIGHT)
+      END
+      INTEGER FUNCTION WEXSVP(IDEV,LEFT,IRIGHT)
+      INTEGER EXPWI
+      EXTERNAL EXPWI
+C rexpst write sequencing vector positions
+      JOB = 20
+      WEXSVP = EXPWI(IDEV,JOB,LEFT)
+      IF(WEXSVP.NE.0) RETURN
+      JOB = 23
+      WEXSVP = EXPWI(IDEV,JOB,IRIGHT)
+      END
diff --git a/src/vepe/makefile-alpha b/src/vepe/makefile-alpha
new file mode 100644
index 0000000..ff9fb35
--- /dev/null
+++ b/src/vepe/makefile-alpha
@@ -0,0 +1,136 @@
+# Makefile for vepe - Alpha version
+# -------------------------------
+
+PROGS= vepe
+
+all: $(PROGS)
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC = .
+#STADENSRC=/nfs/rutland/home3/pubseq/inhouse/share/src/staden
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)/alpha-binaries
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(STADENSRC)
+
+
+#
+# Compilers and flags
+#
+# debug on
+CC= cc
+CFLAGS= -g -std1 -ieee_with_inexact $(INCLUDES)
+F77= f77
+FFLAGS= -g
+LD= f77
+LDFLAGS= -g -non_shared
+
+FLIBS= -lfor -lutil -lUfor -lots -lFutil -lm -lfor -lutil -lUfor -lots -lFutil
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+#LFLAGS= $(FLIBS)
+XLFLAGS= $(FLIBS) $(XLIBS)
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJS=\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a\
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+
+#
+# Vector excision program
+#
+
+VEP=\
+	vep.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(COMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+VEPE=\
+	vepe.o\
+	$(STADENBIN)/dias89.a\
+	expio.o\
+	expFileIO.o
+
+
+VEPEOBJS=\
+	$(VEPE)\
+	$(COMMONOBJS)
+
+
+vepe: $(VEPEOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPEOBJS)  $(LFLAGS)
+
+COBJS =  expFileIO.o testc.o
+FOBJS =  expFileIO.o testf.o
+
+
+testc: $(COBJS)
+	$(CC) $(CFLAGS) -o $@ $(COBJS)
+
+testf:  $(FOBJS)
+	$(F77) $(FFLAGS) -o $@ $(FOBJS)
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
+
+
+install: clean all
diff --git a/src/vepe/makefile-dec b/src/vepe/makefile-dec
new file mode 100644
index 0000000..a55a27f
--- /dev/null
+++ b/src/vepe/makefile-dec
@@ -0,0 +1,139 @@
+# Makefile for vepe - dec ultrix version
+# --------------------------------------
+
+PROGS= vepe
+
+all: $(PROGS)
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+#STADENSRC=/nfs/rutland/home3/pubseq/inhouse/share/src/staden
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+MISCSRC=$(SRC)/../Misc
+MISCBIN=$(MISCSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(STADENSRC) -I$(MISCSRC)
+
+
+#
+# Compilers and flags
+#
+# debug on
+CC= c89
+CFLAGS= -g -common $(INCLUDES)
+F77= f77
+FFLAGS= -g
+LD= f77
+LDFLAGS= -g
+
+FLIBS=
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+#LFLAGS= $(FLIBS)
+XLFLAGS= $(FLIBS) $(XLIBS)
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJS=\
+	$(STADENBIN)/seeme-dec.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a\
+	$(MISCBIN)/misc.a
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+
+#
+# Vector excision program
+#
+
+VEP=\
+	vep.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(COMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+VEPE=\
+	vepe.o\
+	$(STADENBIN)/dias89.a\
+	expio.o\
+	expFileIO.o
+
+
+VEPEOBJS=\
+	$(VEPE)\
+	$(COMMONOBJS)
+
+
+vepe: $(VEPEOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPEOBJS)  $(LFLAGS)
+
+COBJS =  expFileIO.o testc.o
+FOBJS =  expFileIO.o testf.o
+
+
+testc: $(COBJS)
+	$(CC) $(CFLAGS) -o $@ $(COBJS)
+
+testf:  $(FOBJS)
+	$(F77) $(FFLAGS) -o $@ $(FOBJS)
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
+
+
+install: clean all
diff --git a/src/vepe/makefile-sgi b/src/vepe/makefile-sgi
new file mode 100644
index 0000000..63ae8b6
--- /dev/null
+++ b/src/vepe/makefile-sgi
@@ -0,0 +1,146 @@
+# Makefile for vepe - SGI version
+# -------------------------------
+
+PROGS= vepe
+
+SHELL=/bin/sh
+
+
+
+
+all: $(PROGS)
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(STADENSRC)
+
+
+#
+# Compilers and flags
+#
+# debug on
+CC= cc
+CFLAGS= -g -ansi -prototypes $(INCLUDES)
+F77= f77
+FFLAGS= -g
+LD= cc
+LDFLAGS= -g
+
+FLIBS= -lF77 -lU77 -lm -lI77 -lisam
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS= $(FLIBS)
+XLFLAGS= $(FLIBS) $(XLIBS)
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJS=\
+	$(STADENBIN)/seeme-sg.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+
+#
+# Vector excision program
+#
+
+VEP=\
+	vep.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(COMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+VEPE=\
+	vepe.o\
+	$(STADENBIN)/dias89.a\
+	expio.o\
+	expFileIO.o
+
+
+VEPEOBJS=\
+	$(VEPE)\
+	$(COMMONOBJS)
+
+
+vepe: $(VEPEOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPEOBJS)  $(LFLAGS)
+
+COBJS =  expFileIO.o testc.o
+FOBJS =  expFileIO.o testf.o
+
+
+testc: $(COBJS)
+	$(CC) $(CFLAGS) -o $@ $(COBJS)
+
+testf:  $(FOBJS)
+	$(F77) $(FFLAGS) -o $@ $(FOBJS)
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+
+
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
+
+
+install: clean all
diff --git a/src/vepe/makefile-solaris b/src/vepe/makefile-solaris
new file mode 100644
index 0000000..65fe955
--- /dev/null
+++ b/src/vepe/makefile-solaris
@@ -0,0 +1,154 @@
+# Makefile for vepe - Solaris (sparc) version
+# -------------------------------------------
+
+PROGS= vepe
+
+
+
+
+all: $(PROGS)
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+#STADENSRC=/nfs/rutland/home3/pubseq/inhouse/share/src/staden
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(STADENSRC)
+
+
+#
+# Compilers and flags
+#
+# debug on
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(INCLUDES)
+# KFLAGS - for compiling non ANSI C modules
+KFLAGS= -g $(INCLUDES)
+F77= f77
+FFLAGS= -ansi -g -C -Nx500
+
+LD	= f77
+SPRO	= /opt/SUNWspro/SC2.0.1
+LDFLAGS	= -Bstatic -g -L$(SPRO) -L/usr/ccs/lib -L/usr/ucblib -L/usr/openwin/lib
+LIBS	= -lm -lucb -lelf -Bdynamic -ldl -Bstatic -lsocket -lnsl -lintl
+FLIBS	= -lV77 -lF77 -lM77
+XLIBS	= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS	= $(FLIBS) $(LIBS)
+XLFLAGS	= $(FLIBS) $(XLIBS) $(LIBS)
+
+# For stdargs on the Sun
+#GNULIB = /usr/local/lib/gcc-gnulib
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJS=\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+
+#
+# Vector excision program
+#
+
+VEP=\
+	vep.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(COMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+VEPE=\
+	vepe.o\
+	$(STADENBIN)/dias89.a\
+	expio.o\
+	expFileIO.o
+
+
+VEPEOBJS=\
+	$(VEPE)\
+	$(COMMONOBJS)
+
+
+vepe: $(VEPEOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPEOBJS)  $(LFLAGS)
+
+COBJS =  expFileIO.o testc.o
+FOBJS =  expFileIO.o testf.o
+
+
+testc: $(COBJS)
+	$(CC) $(CFLAGS) -o $@ $(COBJS)
+
+testf:  $(FOBJS)
+	$(F77) $(FFLAGS) -o $@ $(FOBJS)
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+
+
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
+
+
+install: clean all
diff --git a/src/vepe/makefile-sun b/src/vepe/makefile-sun
new file mode 100644
index 0000000..4ea0a53
--- /dev/null
+++ b/src/vepe/makefile-sun
@@ -0,0 +1,153 @@
+# Makefile for vepe - SunOS (sparc) version
+# ----------------------------------------
+
+PROGS= vepe
+
+
+
+
+all: $(PROGS)
+
+
+
+
+
+#
+# ``Interesting'' suffices that make needs to know about
+#
+.SUFFIXES: .a $(SUFFIXES)
+
+
+#
+# Directories
+#
+SRC=.
+#STADENSRC=/nfs/rutland/home3/pubseq/inhouse/share/src/staden
+STADENSRC=$(SRC)/../staden
+STADENBIN=$(STADENSRC)
+# Use the ted sources compiled for staden executables
+INCLUDES= -I$(STADENSRC)
+
+
+#
+# Compilers and flags
+#
+# debug on
+CC= gcc
+GCCWARNINGS = -Wreturn-type -Wunused -Wswitch -Wcomment -W -Wformat \
+	      -Wuninitialized
+CFLAGS= -g -ansi -pedantic $(GCCWARNINGS) $(INCLUDES)
+# KFLAGS - for compiling non ANSI C modules
+KFLAGS= -g $(INCLUDES)
+F77= f77
+#FFLAGS= -ansi -g -C -Nx500
+FFLAGS= -ansi -O -Nx500
+LD= gcc
+LDFLAGS= -g
+
+FLIBS= -lF77 -lI77 -lU77 -lm
+XLIBS= -lXaw -lXt -lXmu -lXext -lX11
+LFLAGS= $(FLIBS)
+XLFLAGS= $(FLIBS) $(XLIBS)
+
+# For stdargs on the Sun
+#GNULIB = /usr/local/lib/gcc-gnulib
+
+#
+# Avoid compilation of objects in remote directories
+#
+$(STADENBIN)/%.o : $(STADENSRC)/%.f
+$(STADENBIN)/%.o : $(STADENSRC)/%.c
+
+#
+# Common objects, needed by every program
+#
+COMMONOBJS=\
+	$(STADENBIN)/seeme.o\
+	$(STADENBIN)/nxspec.o\
+	$(STADENBIN)/userface.o\
+	$(STADENBIN)/userfacecom.o\
+	$(STADENBIN)/helpnmenu.o\
+	$(STADENBIN)/nxhelpmenu.o\
+	$(STADENBIN)/FtoC.o\
+	$(STADENBIN)/subs89.a
+
+
+#
+# Building the programs
+# This should be just a linking phase because all of the object
+# files and library files are generated using implicit rules.
+# We use the fortran compiler to do linking.
+#
+
+#
+# Vector excision program
+#
+
+VEP=\
+	vep.o\
+	$(STADENBIN)/dias89.a\
+	$(STADENBIN)/subs89.a
+
+
+VEPOBJS=\
+	$(VEP)\
+	$(COMMONOBJS)
+
+
+vep: $(VEPOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPOBJS)  $(LFLAGS)
+
+
+
+VEPE=\
+	vepe.o\
+	$(STADENBIN)/dias89.a\
+	expio.o\
+	expFileIO.o
+
+
+VEPEOBJS=\
+	$(VEPE)\
+	$(COMMONOBJS)
+
+
+vepe: $(VEPEOBJS)
+	$(LD) $(LDFLAGS) -o $@ $(VEPEOBJS)  $(LFLAGS)
+
+COBJS =  expFileIO.o testc.o
+FOBJS =  expFileIO.o testf.o
+
+
+testc: $(COBJS)
+	$(CC) $(CFLAGS) -o $@ $(COBJS)
+
+testf:  $(FOBJS)
+	$(F77) $(FFLAGS) -o $@ $(FOBJS)
+
+#
+# Files requiring simple FORTRAN compilation (thing.f -> thing.o)
+# 
+%.o: $(SRC)/%.f
+	$(F77) $(FFLAGS) -c $<
+
+
+#
+# Files requiring simple C compilation (thing.c -> thing.o)
+# 
+%.o: %.c
+	$(CC) $(CFLAGS) -c $<
+%.o: $(SRC)/%.c
+	$(CC) $(CFLAGS) -c $<
+
+
+
+
+
+
+
+clean:
+	rm -f $(PROGS) *.o *.a *~
+
+
+install: clean all
diff --git a/src/vepe/vepe.f b/src/vepe/vepe.f
new file mode 100644
index 0000000..3524365
--- /dev/null
+++ b/src/vepe/vepe.f
@@ -0,0 +1,761 @@
+C 18 Feb 1993 forced cloning vector clipped regions to be >MINVEC long
+C 20 October 1992
+C   IDEVE was being set to 0 before EXPKIL() and so the C files
+C   were never being closed.
+C
+      PARAMETER (MAXSEQ = 50000,
+     +           MAXWLN = 6,
+     +           LCONST = 4*MAXWLN,
+     +           MAXWRD = 4**MAXWLN,
+     +           MAXDEV = 5,
+     +           NAMLEN = 80)
+      CHARACTER SEQV(MAXSEQ*2),SEQG(MAXSEQ),SEQC(MAXSEQ)
+      INTEGER WORDP(MAXWRD),POSN(MAXSEQ)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      INTEGER CONSTS(0:LCONST),DEVNOS(MAXDEV)
+      INTEGER SEQVI(MAXSEQ),SEQGI(MAXSEQ),RC,POORR,POORL
+      CHARACTER*(NAMLEN) FILNAM,HELPF,NAME,FILNMV
+      PARAMETER (MAXPRM = 22)
+      CHARACTER PROMPT(2)*(MAXPRM)
+      INTEGER GNFFOF,REXGEL,REXSVF,REXSVC,REXSVP,REXPDP,REXCVF
+      INTEGER WEXSVP,WEXCVP,SVLCLP,SVRCLP,EXPOPN,REXSVQ
+      EXTERNAL GNFFOF,REXGEL,REXSVF,REXSVC,REXSVP,REXPDP,REXCVF
+      EXTERNAL WEXSVP,WEXCVP,EXPOPN,REXSVQ
+C
+C  new experiment file version of vep (vepe)
+C
+C
+C
+C This routine prepares a reading for the assembly program.
+C It compares the sequence against vectors and marks any found
+C Vector clipping is of 4 types:
+C 1) find 5' cloning site
+C 2) look for 3' cloning site
+C 3) test for insert being all vector
+C 4) look for cosmid "vector"
+C The first 3 can all be performed using one sequence, and are made easier
+C by telling the program exactly where the cloning site is, and which of
+C the 3 it is performing. Types 1,2 and 3 look only in one orientation,
+C whereas 4 should check both strands. Additionally they
+C differ in their outcomes: 1 and 2 write a new file with the clippoints
+C marked, 3 scrubs the reading (does not add it to a file of file names),
+C 4 writes out a new file if the reading contains some non vector sequence.
+C The clip should be marked differently for cosmid vector, just so we know where
+C it is (it can be tagged).
+C Clipping off crap should probably be done first by looking at the traces.
+C What do we need to tell the program?
+C 1) name of vector file
+C 2) position of cloning site
+C 3) position of primer
+C It would be helpful to have a standard orientation for vector sequences
+C for example Cloning site------ ... ------etis gninolC for ? strand
+C NOTE that vector file should not have <---name.0001----> at the start
+C
+C error values written to file of failed names
+C
+C 1 couldnt open expt file
+C 2 couldnt get reading from expt file
+C 3 reading too short
+C 4 couldnt find vector filename in expt file
+C 5 couldnt find cloning site in expt file
+C 6 couldnt find primer site in expt file
+C 7 couldnt open vector file
+C 8 failed to write to expt file
+C 9 completely vector
+C
+      ICG = 0
+      ICB = 0
+      IPG = 0
+      IDM = 5
+      CALL INITLU(IDM)
+      CALL UNITNO(KBIN,KBOUT,DEVNOS,MAXDEV)
+      IDEVNI = DEVNOS(1)
+      IDEVV = DEVNOS(2)
+      IDEVNO = DEVNOS(3)
+      IDEVNF = DEVNOS(5)
+      WRITE(KBOUT,*)'vepe v2.0: vector excising program. June 92'
+      PROMPT(1) = 'Mark sequencing vector'
+      PROMPT(2) = 'Mark cloning vector'
+      JOB = 1
+      CALL RADION('Select task',PROMPT,2,JOB,
+     +IHELPS,IHELPE,HELPF,IDEVH,KBIN,KBOUT)
+      IF(JOB.LT.1) STOP
+      FILNAM = ' '
+      CALL OPENF1(IDEVNI,FILNAM,0,IOK,KBIN,KBOUT,
+     +'Input file of file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(IDEVNO,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Output file of passed file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      FILNAM = ' '
+      CALL OPENF1(IDEVNF,FILNAM,1,IOK,KBIN,KBOUT,
+     +'Output file of failed file names',
+     +IHELPS,IHELPE,HELPF,IDEVH)
+      IF(IOK.NE.0) STOP
+      MN = 2
+      MX = MAXWLN
+      LENGTH = 4
+      CALL GETINT(MN,MX,LENGTH,
+     +'Word length',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      LENGTH = IVAL
+      MN = 1
+      MX = 11
+      LW = 7
+      CALL GETINT(MN,MX,LW,
+     +'Number of diagonals to combine',
+     +IVAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      LW = IVAL
+      XMN = 0.1
+      XMX = 1.0
+      CUT = 0.35
+      CALL GETRL(XMN,XMX,CUT,
+     +'Cutoff score',
+     +VAL,KBIN,KBOUT,
+     +IHELPS,IHELPE,HELPF,IDEVH,IOK)
+      IF(IOK.NE.0) STOP
+      CUT = VAL
+C
+C set initial values so we hash the first vector
+C
+      FILNMV = ' '
+      ICSITT = 0
+      IPSITT = 0
+      FILNAM = 'UNLIKELY FILE NAME'
+      ICSITE = ICSITT
+      IPSITE = IPSITT
+      IDE = (IDM-1)**LENGTH
+      CALL SETCN(CONSTS,LENGTH,IDM,LCONST)
+C
+      IDEVE = 0
+ 10   CONTINUE
+C
+C main loop: get the info we need from the experiment file
+C            for sequencing vector (job=1) we need:
+C            the reading
+C            its poor data pointers
+C            the vector
+C            the cloning site
+C            the primer position
+C if poor data pointers are zero assume all ok
+C if vector, cloning site, primer site not found skip
+C
+C            for cloning vector (job=2) we need:
+C            the reading
+C            its sequencing vector pointers
+C            its poor data pointers
+C            the vector
+C
+C if sequencing vector pointers are zero use poor data pointers
+C if poor data pointers are zero assume all ok
+C if vector not found skip
+C
+C For both jobs shuffle sequence so we only process the unmarked segment
+C then add on marker values at end
+C Only hash vector if its different from the last one.
+C
+C Output: markers relative to the left end of the reading and name to file of
+C         passed file names. Plus a summary at the end.
+C
+C Make sure the experiment file is closed
+      CALL EXPKIL(IDEVE)
+C Get next experiment file name
+      IOK = GNFFOF(IDEVNI,NAME)
+      IF(IOK.EQ.1) THEN
+        WRITE(KBOUT,*)
+     + 'Finished after processing',JGEL,' files and finding'
+        WRITE(KBOUT,*)ICB,' completely vector'
+        WRITE(KBOUT,*)IPG,' partly vector'
+        WRITE(KBOUT,*)ICG,' free of vector'
+        STOP
+      ELSE IF(IOK.EQ.2) THEN
+        CALL ERROM(KBOUT,'Empty line in file of file names')
+        GO TO 10
+      ELSE IF(IOK.EQ.3) THEN
+        CALL ERROM(KBOUT,'Error reading file of file names')
+        GO TO 10
+      END IF
+      IDEVE = EXPOPN(NAME)
+      IF(IDEVE.EQ.0) THEN
+        CALL AERROR(KBOUT,IDEVNF,NAME,1)
+        CALL ERROM(KBOUT,'Error opening experiment file')
+        GO TO 10
+      END IF
+      IDIMGI = MAXSEQ
+      IOK = REXGEL(IDEVE,SEQG,IDIMGI)
+      IF(IOK.NE.0) THEN
+        CALL AERROR(KBOUT,IDEVNF,NAME,2)
+        CALL ERROM(KBOUT,'Error getting gel reading')
+        GO TO 10
+      END IF
+      JGEL = JGEL + 1
+      WRITE(KBOUT,*)'>>>> Read number',JGEL,' length',IDIMGI,' ',NAME
+C  LONG ENOUGH ?
+      IF(IDIMGI.LT.LENGTH)THEN
+        CALL AERROR(KBOUT,IDEVNF,NAME,3)
+        CALL ERROM(KBOUT,'Gel reading too short to compare')
+        GO TO 10
+      END IF
+      IDIMG = IDIMGI
+C
+C
+      IF(JOB.EQ.1) THEN
+C
+C   Sequencing vector clipping
+C
+C   get vector name, csite (cloning site), ipsite (primer site),
+C   poorl (poor data left mark), poorr
+C
+       IOK = REXSVF(IDEVE,FILNMV)
+       IF(IOK.NE.0) THEN
+         CALL AERROR(KBOUT,IDEVNF,NAME,4)
+         CALL ERROM(KBOUT,'Error reading vector file name')
+         CALL ERROM(KBOUT,FILNMV)
+         GO TO 10
+       END IF
+       IOK = REXSVC(IDEVE,ICSITT)
+       IF(IOK.NE.0) THEN
+         CALL AERROR(KBOUT,IDEVNF,NAME,5)
+         CALL ERROM(KBOUT,'Error reading cloning site')
+         GO TO 10
+       END IF
+       IOK = REXSVQ(IDEVE,IPSITT)
+       IF(IOK.NE.0) THEN
+         CALL AERROR(KBOUT,IDEVNF,NAME,6)
+         CALL ERROM(KBOUT,'Error reading primer site')
+         GO TO 10
+       END IF
+       IOK = REXPDP(IDEVE,POORL,POORR)
+       IF(IOK.NE.0) THEN
+C         CALL ERROM(KBOUT,
+C     +   'Error reading poor data positions, zero assumed')
+       END IF
+C
+C   Decided to screen for primer in the poor data at the left end even
+C   though the program expects to ignore it: set poorl=0
+      POORL = 0
+C
+C  IF ANY OF THESE ARE MISSING DO SOMETHING SENSIBLE !!!!!!!!!!!!!
+C
+C   if filnam != current or icsitt != icsite or ipsitt != icsite
+C      then reorganise vector
+C
+      IF((FILNMV.NE.FILNAM).OR.
+     +   (ICSITT.NE.ICSITE).OR.
+     +   (IPSITT.NE.IPSITE))    THEN
+        CALL OPENRS(IDEVV,FILNMV,IOK,LRECL,2)
+        IF(IOK.NE.0) THEN
+          CALL AERROR(KBOUT,IDEVNF,NAME,7)
+          CALL ERROM(KBOUT,'Error opening vector file')
+          CALL ERROM(KBOUT,FILNMV)
+          GO TO 10
+        END IF
+        FILNAM = FILNMV
+        ICSITE = ICSITT
+        IPSITE = IPSITT
+        IDIMV  = MAXSEQ
+        CALL ARRFIL(IDEVV,SEQV,IDIMV,KBOUT)
+        CLOSE(UNIT=IDEVV)
+C  check for contig header (should not be there)
+          IF(SEQV(20).EQ.'>') THEN
+            CALL SHFLCA(SEQV,MAXSEQ,21,1,IDIMV)
+            IDIMV = IDIMV - 20
+          END IF
+        WRITE(KBOUT,*)'Vector length =',IDIMV
+C make cloning site end of seq, then start of seq is icsite + 1
+        CALL SQCOPY(SEQV(1),SEQV(IDIMV+1),ICSITE)
+C if forward primer then need to complement vector
+        IF(IPSITE.GT.0) THEN
+          CALL SQREV(SEQV(ICSITE+1),IDIMV)
+          CALL SQCOM(SEQV(ICSITE+1),IDIMV)
+        END IF
+        CALL CONNUM(SEQV(ICSITE+1),SEQVI,IDIMV)
+        CALL ENCONC(SEQVI,IDIMV,POSN,WORDP,IDE,IDM,CONSTS,LENGTH,
+     +  LCONST)
+      END IF
+C
+C now hash the reading
+C
+C clip reading so only the good data is processed
+C
+C        WRITE(*,*)'POORL,POORR',POORL,POORR
+C        WRITE(*,*)(SEQG(K),K=1,IDIMG)
+C        WRITE(*,*)'IDIMG',IDIMG
+        IF(POORR.EQ.0) POORR = IDIMG + 1
+        IF(POORL.GT.0) THEN
+          CALL SHFLCA(SEQG,MAXSEQ,POORL+1,1,POORR-1)
+        END IF
+        IDIMG = POORR - POORL - 1
+C        WRITE(*,*)'IDIMG',IDIMG
+C        WRITE(*,*)(SEQG(K),K=1,IDIMG)
+        CALL CONNUM(SEQG,SEQGI,IDIMG)
+        CALL VCUT(SEQVI,IDIMV,POSN,WORDP,IDE,SEQGI,IDIMG,CONSTS,
+     +  LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,ICSITE,
+     +  IPSITE)
+C
+C 1 nothing found either end   lc = 0, rc = idimg+1
+C 2 primer only found          lc = right end of primer, rc = idimg+1
+C 3 run into vector only       lc = 0  rc = start of vector
+C 4 primer found and vector run into lc = right end of primer, 
+C                                    rc = start of vector
+CC
+C if rc = idimg+1 theres no vector at the right end of the reading
+C
+C        WRITE(*,*)'LC,RC',LC,RC
+        IF(RC-LC+1.LT.20) THEN
+          CALL AERROR(KBOUT,IDEVNF,NAME,9)
+          ICB = ICB + 1
+          GO TO 10
+        END IF
+C
+C  note below we need to add on what weve cutoff
+C
+        IF((LC.EQ.0).AND.(RC.EQ.IDIMG+1)) THEN
+C
+C No sequencing vector found
+C
+          LCO = 0
+          IRCO = IDIMGI + 1
+          IOK = WEXSVP(IDEVE,LCO,IRCO)
+          IF(IOK.NE.0) THEN
+            CALL AERROR(KBOUT,IDEVNF,NAME,8)
+            GO TO 10
+          END IF
+          ICG = ICG + 1
+        ELSE IF((LC.NE.0).AND.(RC.NE.IDIMG+1)) THEN
+C
+C Primer found and run into sequencing vector
+C
+          LCO = LC + POORL
+          IRCO = RC + POORL
+          IOK = WEXSVP(IDEVE,LCO,IRCO)
+          IF(IOK.NE.0) THEN
+            CALL AERROR(KBOUT,IDEVNF,NAME,8)
+            GO TO 10
+          END IF
+          IPG = IPG + 1
+        ELSE IF(LC.EQ.0) THEN
+C
+C Run into vector only
+C
+          LCO = 0
+          IRCO = RC + POORL
+          IOK = WEXSVP(IDEVE,LCO,IRCO)
+          IF(IOK.NE.0) THEN
+            CALL AERROR(KBOUT,IDEVNF,NAME,8)
+            GO TO 10
+          END IF
+          IPG = IPG + 1
+        ELSE IF(RC.EQ.IDIMG+1) THEN
+C
+C Primer only found
+C
+          LCO = LC + POORL
+          IRCO = IDIMGI + 1
+          IOK = WEXSVP(IDEVE,LCO,IRCO)
+          IF(IOK.NE.0) THEN
+            CALL AERROR(KBOUT,IDEVNF,NAME,8)
+            GO TO 10
+          END IF
+          IPG = IPG + 1
+        END IF
+        WRITE(IDEVNO,1005)NAME
+ 1005   FORMAT(A)
+C
+C
+      ELSE IF (JOB.EQ.2) THEN
+C
+C   Cosmid clipping
+C
+C   get vector file name, poorl (poor data left mark), poorr,
+C   svlclp (sequencing vector left clip position), svrclp
+C
+       IOK = REXCVF(IDEVE,FILNMV)
+       IF(IOK.NE.0) THEN
+         CALL AERROR(KBOUT,IDEVNF,NAME,4)
+         CALL ERROM(KBOUT,'Error reading vector file name')
+         CALL ERROM(KBOUT,FILNMV)
+         GO TO 10
+       END IF
+       IOK = REXPDP(IDEVE,POORL,POORR)
+       IF(IOK.NE.0) THEN
+C         CALL ERROM(KBOUT,
+C     +   'Error reading poor data positions, zero assumed')
+       END IF
+       IOK = REXSVP(IDEVE,SVLCLP,SVRCLP)
+       IF(IOK.NE.0) THEN
+C         CALL ERROM(KBOUT,
+C     +   'Error reading sequencing vector clip points, zero assumed')
+       END IF
+C
+C   if filnam != current
+C      then reorganise vector
+C
+        IF(FILNMV.NE.FILNAM) THEN
+          CALL OPENRS(IDEVV,FILNMV,IOK,LRECL,2)
+          IF(IOK.NE.0) THEN
+            CALL AERROR(KBOUT,IDEVNF,NAME,7)
+            CALL ERROM(KBOUT,'Error opening vector file')
+            CALL ERROM(KBOUT,FILNMV)
+            GO TO 10
+          END IF
+          FILNAM = FILNMV
+          IDIMV  = MAXSEQ
+          CALL ARRFIL(IDEVV,SEQV,IDIMV,KBOUT)
+          CLOSE(UNIT=IDEVV)
+C  check for contig header (should not be there)
+          IF(SEQV(20).EQ.'>') THEN
+            CALL SHFLCA(SEQV,MAXSEQ,21,1,IDIMV)
+            IDIMV = IDIMV - 20
+          END IF
+          WRITE(KBOUT,*)'Vector length =',IDIMV
+          CALL CONNUM(SEQV,SEQVI,IDIMV)
+          CALL ENCONC(SEQVI,IDIMV,POSN,WORDP,IDE,IDM,CONSTS,LENGTH,
+     +    LCONST)
+        END IF
+C
+C now hash the reading
+C
+C clip reading so only the good data is processed
+C and set poorl and poorr to endpoints
+          POORL = MAX(POORL,SVLCLP)
+          I = IDIMG + 1
+          IF(SVRCLP.GT.0) I = SVRCLP
+          IF(POORR.GT.0) I = MIN(I,POORR)
+          POORR = I
+C
+C        WRITE(*,*)'POORL,POORR',POORL,POORR
+C        WRITE(*,*)(SEQG(K),K=1,IDIMG)
+C        WRITE(*,*)'IDIMG',IDIMG
+          IF(POORL.GT.0) THEN
+            CALL SHFLCA(SEQG,MAXSEQ,POORL+1,1,POORR-1)
+          END IF
+          IDIMG = POORR - POORL - 1
+C        WRITE(*,*)'IDIMG',IDIMG
+C        WRITE(*,*)(SEQG(K),K=1,IDIMG)
+          CALL CONNUM(SEQG,SEQGI,IDIMG)
+          CALL VCUT(SEQVI,IDIMV,POSN,WORDP,IDE,SEQGI,IDIMG,CONSTS,
+     +    LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,0,0)
+          IF(LC.NE.0) THEN
+            IF(RC-LC+1.GE.IDIMG) THEN
+              ICB = ICB + 1
+              CALL AERROR(KBOUT,IDEVNF,NAME,9)
+            ELSE
+              IPG = IPG + 1
+              WRITE(IDEVNO,1005)NAME
+            END IF
+            LCO = LC + POORL
+            IRCO = RC + POORL
+            IOK = WEXCVP(IDEVE,LCO,IRCO)
+            IF(IOK.NE.0) THEN
+              CALL AERROR(KBOUT,IDEVNF,NAME,8)
+C              CALL ERROM(KBOUT,
+C     +        'Error writing cosmid vector positions')
+              GO TO 10
+            END IF
+            GO TO 10
+          END IF
+C
+C Try other strand
+C
+          CALL SQCOPY(SEQG,SEQC,IDIMG)
+          CALL SQREV(SEQC,IDIMG)
+          CALL SQCOM(SEQC,IDIMG)
+          CALL CONNUM(SEQC,SEQGI,IDIMG)
+          CALL VCUT(SEQVI,IDIMV,POSN,WORDP,IDE,SEQGI,IDIMG,CONSTS,
+     +    LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,0,0)
+          IF(LC.NE.0) THEN
+            LC1 = IDIMG - RC + 1
+            RC = IDIMG - LC + 1
+            LC = LC1
+            IF(RC-LC+1.GE.IDIMG) THEN
+              ICB = ICB + 1
+              CALL AERROR(KBOUT,IDEVNF,NAME,9)
+            ELSE
+              WRITE(IDEVNO,1005)NAME
+              IPG = IPG + 1
+            END IF
+            LCO = LC + POORL
+            IRCO = RC + POORL
+            IOK = WEXCVP(IDEVE,LCO,IRCO)
+            IF(IOK.NE.0) THEN
+              CALL AERROR(KBOUT,IDEVNF,NAME,8)
+C              CALL ERROM(KBOUT,
+C     +          'Error writing cosmid vector positions')
+              GO TO 10
+            END IF
+            GO TO 10
+          END IF
+          ICG = ICG + 1
+          WRITE(IDEVNO,1005)NAME
+          GO TO 10
+        ELSE
+C
+C   unknown job !!!!!!!!
+C
+        WRITE(*,*)'COCKUP'
+      END IF
+      GO TO 10
+      END
+      SUBROUTINE VCUT(SEQV,IDIMV,POSN,WORDP,IDE,SEQH,IDIMH,CONSTS,
+     +LENGTH,IDM,LCONST,HIST,MAXSEQ,KBOUT,CUT,LW,LC,RC,ICSITE,
+     +IPSITE)
+      INTEGER SEQV(IDIMV),SEQH(IDIMH)
+      INTEGER POSN(IDIMV),WORDP(IDE),CONSTS(0:LCONST)
+      INTEGER RC
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      EXTERNAL NCODEA
+C      CALL BUSY(KBOUT)
+      CALL FILLR(HIST(LENGTH-IDIMV),IDIMH+IDIMV+1,0.)
+      DO 20 I = 1,IDIMH-LENGTH+1
+        J = NCODEA(SEQH(I),LENGTH,CONSTS,IDM,LCONST)
+        IF(J.NE.0)THEN
+          J1 = WORDP(J)
+          IF(J1.NE.0)THEN
+            K = I - J1
+            HIST(K) = HIST(K) + 1.
+10          CONTINUE
+            J2 = J1
+            J1 = POSN(J2)
+            IF(J1.NE.0)THEN
+              K = I - J1
+              HIST(K) = HIST(K) + 1.
+              GO TO 10
+            END IF
+          END IF
+        END IF
+20    CONTINUE
+      CALL PHIST(HIST,IDIMV,IDIMH,LENGTH,MAXSEQ)
+      IF(ICSITE.NE.0) THEN
+C
+C  look for primer region
+C
+        CALL FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     +  MAXSEQ,CUT,LW,LC,RC,IPSITE,1)
+        IF (LC.EQ.0) THEN
+C          WRITE(KBOUT,*)' ***** No primer site found ********'
+        ELSE
+C
+C  if primer found we want to know where it ends so set lc=rc
+C
+          LC = RC
+        END IF
+C
+C  look for running into vector at cloning site
+C
+        CALL FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     +  MAXSEQ,CUT,LW,LCR,IRCR,IPSITE,2)
+C
+C set right cut to 1 past the end of the sequence
+C if some vector found we set rc to where it starts
+C 
+        RC = IDIMH + 1
+        IF(LCR.GT.0) THEN
+          RC = LCR
+        END IF
+C
+C so the outcomes are:
+C 1 nothing found either end   lc = 0, rc = idimg+1
+C 2 primer only found          lc = right end of primer, rc = idimg+1
+C 3 run into vector only       lc = 0  rc = start of vector
+C 4 primer found and vector run into lc = right end of primer, 
+C                                    rc = start of vector
+C
+C
+      ELSE
+C
+C  look for cosmid vector
+C
+        CALL FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     +  MAXSEQ,CUT,LW,LC,RC,IPSITE,3)
+        IF(LC.GT.0) THEN
+C          WRITE(KBOUT,*)
+C     +'>>>>>>>>>>>>>>>>>>>>>>>>diagonal found'
+         RETURN
+        END IF
+      END IF
+      END
+      SUBROUTINE PHIST(HIST,IDIMV,IDIMH,LENGTH,MAXSEQ)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      IF(IDIMV.GE.IDIMH) THEN
+        D  = LENGTH
+        DO 10 I=LENGTH-IDIMV,IDIMH-IDIMV-1
+          HIST(I) = HIST(I)/D
+          D = D + 1
+ 10       CONTINUE
+        D  = IDIMH
+        DO 20 I=IDIMH-IDIMV,0
+          HIST(I) = HIST(I)/D
+ 20       CONTINUE  
+        D  = IDIMH - 1
+        DO 30 I=1,IDIMH-LENGTH
+          HIST(I) = HIST(I)/D
+          D = D - 1
+ 30       CONTINUE
+      ELSE
+        D  = LENGTH
+        DO 40 I=LENGTH-IDIMV,-1
+          HIST(I) = HIST(I)/D
+          D = D + 1
+ 40       CONTINUE
+        D  = IDIMV
+        DO 50 I=0,IDIMH-IDIMV
+          HIST(I) = HIST(I)/D
+ 50       CONTINUE
+        D  = IDIMV - 1
+        DO 60 I=IDIMH-IDIMV+1,IDIMH-LENGTH
+          HIST(I) = HIST(I)/D
+          D = D - 1
+ 60       CONTINUE
+      END IF
+      END
+      SUBROUTINE FCUT(HIST,IDIMV,IDIMH,LENGTH,
+     + MAXSEQ,CUT,LW,LC,RC,PSITE,JOB)
+      REAL HIST(-MAXSEQ:MAXSEQ)
+      INTEGER RC,PSITE
+      PARAMETER (MINVEC = 6)
+C MINVEC is minimum length of cloning vector to be reported
+C PSITE is primer site
+      LC = 0
+      RC = 0
+      DMAX = 0.
+C If job = 1 look for cloning site from psite to the end of the vector
+C if job = 2 look for vector in the rest of the sequence
+C If job = 3 look for cosmid vector in whole of sequence
+C We discard lc to rc inclusive, lc=0 means discard nothing
+      IF(JOB.EQ.1) THEN
+        I1 = LENGTH - IDIMV
+        I2 =  -ABS(PSITE)
+      ELSE IF(JOB.EQ.2) THEN
+        I1 = -ABS(PSITE)
+        I2 = IDIMH - LENGTH
+      ELSE IF(JOB.EQ.3) THEN
+        I1 = LENGTH - IDIMV + MINVEC
+        I2 = IDIMH - LENGTH - MINVEC
+      ELSE
+        WRITE(*,*)'Error in FCUT'
+        RETURN
+      END IF
+C      WRITE(*,*)'LOOKING AT ',I1,I2
+      DO 10 I=I1,I2
+        DT = HIST(I)
+        IF(DT.GT.DMAX) THEN
+          DMAX = DT
+          ID = I
+        END IF
+ 10     CONTINUE
+      D = 0.
+      DO 35 I=MAX(ID-LW/2,LENGTH-IDIMV),
+     +        MIN(ID+LW/2,IDIMH-LENGTH)
+        D = D + HIST(I)
+ 35   CONTINUE
+C      WRITE(*,*)'Best diagonal, score and local sum',ID,DMAX,D
+C      WRITE(*,1000)ID,DMAX,D
+ 1000 FORMAT(I6,2F10.3)
+      IF(D.LT.CUT) RETURN
+      IF(IDIMV.GE.IDIMH) THEN
+        IF(ID.GE.0) THEN
+          LC = ID + 1
+          RC = IDIMH
+        ELSE
+          LC = 1
+          RC = MIN(IDIMH,IDIMV + ID)
+        END IF
+      ELSE
+        IF(ID.GE.0) THEN
+          LC = ID + 1
+          RC = MIN(ID+IDIMV,IDIMH)
+        ELSE
+          LC = 1
+          RC = ID + IDIMV
+        END IF
+      END IF
+      WRITE(*,*)' Discard ',LC, ' to ',RC
+      END
+      INTEGER FUNCTION GNFFOF(IDEV,NAME)
+      CHARACTER NAME*(*)
+      EXTERNAL NOTLR
+C
+C routine to read a file of file names and return a name
+C deals with leading spaces and trims names at first space
+C after name: eg '  fred is a bum' is returned as 'fred'
+C needed because file names can contain spaces (not our file names!)
+C and the open statement expects the names to match precisely
+C
+C return 0 = ok, 2 = empty line in file, 3 = error in read, 1 = end of file
+C
+      READ(IDEV,1000,ERR=100,END=200)NAME
+ 1000 FORMAT(A)
+C
+C get first non space position
+C
+      LENGTH = LEN(NAME)
+      I = NOTLR(NAME,LENGTH,' ')
+C empty line ?
+      IF(I.EQ.0) THEN
+        GNFFOF = 2
+        RETURN
+      END IF
+C now want first space after I
+      J = INDEX(NAME(I+1:),' ')
+      IF(J.EQ.0) THEN
+        J = LENGTH
+      ELSE
+        J = J + I - 1
+      END IF
+      CALL SHFTLS(NAME,I,1,J)
+      NAME(J-I+2:) = ' '
+      GNFFOF = 0
+      RETURN
+ 100  CONTINUE
+      GNFFOF = 3
+      RETURN
+ 200  CONTINUE
+      GNFFOF = 1
+      END
+      SUBROUTINE SHFTLS(STRING,FROMS,TO,FROME)
+      CHARACTER STRING*(*)
+      INTEGER FROMS,TO,FROME
+C
+C shift a string left from froms to to
+C
+      J = TO
+      DO 10 I=FROMS,FROME
+        STRING(J:J) = STRING(I:I)
+        J = J + 1
+ 10   CONTINUE
+      END
+      SUBROUTINE SHFLCA(STRING,MAXAR,FROMS,TO,FROME)
+      CHARACTER STRING(MAXAR)
+      INTEGER FROMS,FROME,TO
+C
+C  shift left from from to to
+C
+      J = TO
+      DO 10 I=FROMS,FROME
+        STRING(J) = STRING(I)
+        J = J + 1
+ 10   CONTINUE
+      END
+      SUBROUTINE AERROR(IDEVS,IDEVF,NAME,IERR)
+      CHARACTER NAME*(*)
+C
+C handle errors for assembly
+C
+C errors are:
+C 0 file not found
+C 1 read too short
+C 2 failed to align and not entered
+C 3 failed on entry
+C 4 failed to align but entered
+      WRITE(IDEVF,1000)NAME(1:INDEX(NAME,' ')),IERR
+ 1000 FORMAT(A,I2)
+      CALL ERROM(IDEVS,'Failed reading written to error file')
+      END
diff --git a/staden.login b/staden.login
new file mode 100644
index 0000000..2e43183
--- /dev/null
+++ b/staden.login
@@ -0,0 +1,158 @@
+#
+# Setup file for Staden software running on a Sun
+# This setup for c-shell (csh) users
+#
+# This file should be source'd from your .login
+# assuming the environmental variable STADENROOT has been set up
+# to point to the root directory for the staden software
+#
+# e.g.
+# setenv STADENROOT /home/BioSW/staden
+# source $STADENROOT/staden.login
+#
+#
+#echo 'Setting up the Staden software environment...'
+
+set path = ($path $STADENROOT/bin)
+
+setenv STADTABL	$STADENROOT/tables
+setenv STADUSER	$STADENROOT/userdata
+setenv STADHELP	$STADENROOT/help
+
+# The default is 'vi' but this causes the X versions to hang
+setenv SEQEDT 'xterm -e vi'
+
+#
+# files for nip
+#
+setenv ECRIBS	$STADTABL/PERCPTRON.WTS
+setenv EUKRIBS	$STADTABL/RIBSEUKAR.WTS
+setenv INTRONS	$STADTABL/INTRON.WTS
+setenv ECPROMS	$STADTABL/PROKPROMS.WTS
+setenv RENZYM4	$STADTABL/RENZYM.4
+setenv RENZYM6	$STADTABL/RENZYM.6
+setenv RENZYMAL	$STADTABL/RENZYM.ALL
+setenv HELPNIP	$STADHELP/nip_help
+setenv NIPHELP	$STADHELP/NIP.HLP
+setenv NIPHPNT	$STADHELP/NIP.PNT         
+setenv NIPMARG	$STADTABL/NIP.MRG
+
+
+# files for nipf
+
+setenv NIPFMARG	$STADTABL/NIPF.MRG
+
+
+#
+# files for sip
+#
+setenv HELPSIP	$STADHELP/sip_help
+setenv SIPHELP	$STADHELP/SIP.HLP         
+setenv SIPHPNT	$STADHELP/SIP.PNT         
+setenv SIPMARG	$STADTABL/SIP.MRG       
+setenv PROTMAT  $STADTABL/PROTMAT.DAT   
+
+#
+# files for sap
+#
+setenv HELPSAP	$STADHELP/sap_help
+setenv SAPHELP	$STADHELP/SAP.HLP         
+setenv SAPHPNT	$STADHELP/SAP.PNT
+setenv SAPMARG	$STADTABL/SAP.MRG
+
+#
+# files for bap
+#
+setenv HELPBAP	$STADHELP/bap_help
+setenv BAPHELP	$STADHELP/BAP.HLP         
+setenv BAPHPNT	$STADHELP/BAP.PNT
+setenv BAPMARG	$STADTABL/BAP.MRG
+
+#
+# files for dap
+#
+setenv HELPDAP	$STADHELP/dap_help
+setenv DAPHELP	$STADHELP/DAP.HLP         
+setenv DAPHPNT	$STADHELP/DAP.PNT
+setenv DAPMARG	$STADTABL/DAP.MRG
+setenv TAGDB    $STADTABL/TAGDB
+
+#
+# files for pip
+#
+setenv PIPMARG	$STADTABL/PIP.MRG       
+setenv PROTGRP	$STADTABL/PROTEIN.GRP   
+setenv PROTALL	$STADTABL/PROTEIN.ALL   
+setenv ROBSON	$STADTABL/ROBSON.WTS    
+setenv HELPPIP	$STADHELP/pip_help
+setenv PIPHELP	$STADHELP/PIP.HLP         
+setenv PIPHPNT	$STADHELP/PIP.PNT
+
+#
+# files for mep
+#
+setenv MEPMARG $STADTABL/MEP.MRG
+setenv HELPMEP $STADHELP/mep_help
+setenv MEPHELP $STADHELP/MEP.HLP
+setenv MEPHPNT $STADHELP/MEP.PNT
+
+#
+# files for rep
+#
+setenv ALUNAMES $STADTABL/alus/files
+
+
+
+#
+# miscellaneous files
+#
+setenv GIPMEM		$STADHELP/gip_help
+setenv HELPSTADEN	$STADHELP/staden_help
+#
+# Typing "staden" will list all the program names
+#      
+alias staden 'cat $STADHELP/stadenp_help'
+#
+# vector sequences
+#
+setenv M13MP18_VECTOR   $STADUSER/m13mp18.vec
+setenv PUC18_VECTOR     $STADUSER/puc18.vec
+setenv BLUE_VECTOR      $STADUSER/blue.vec
+
+setenv PROSITE           /pubseq/pubseq/seqlibs/prosite
+setenv PROSITENAMES      /pubseq/pubseq/seqlibs/prosite/prosite.nam
+
+
+
+
+#
+# Help X11 find where things are
+#
+if ( $?XFILESEARCHPATH ) then
+    setenv XFILESEARCHPATH ${STADHELP}/%N%S:${STADTABL}/%N%S:${XFILESEARCHPATH}
+else
+    setenv XFILESEARCHPATH ${STADHELP}/%N%S:${STADTABL}/%N%S:/usr/lib/X11/%T/%N%S
+endif
+
+#
+# Find manual pages
+#
+if ( $?MANPATH ) then
+    setenv MANPATH ${MANPATH}:${STADENROOT}
+else
+    setenv MANPATH /usr/man:${STADENROOT}
+endif
+
+#
+# For gip, specify default digitizer port
+#
+setenv DIGITIZER /dev/ttyb
+
+
+#
+# Sequence databases
+#
+
+source $STADTABL/libraries.config.csh
+
+
diff --git a/staden.profile b/staden.profile
new file mode 100644
index 0000000..5800e35
--- /dev/null
+++ b/staden.profile
@@ -0,0 +1,144 @@
+#
+# Setup file for Staden software running on a Sun
+# This setup for Bourne shell (sh) users
+#
+# This file should be source'd from your .profile
+# assuming the environmental variable STADENROOT has been set up
+# to point to the root directory for the staden software
+#
+# e.g.
+# STADENROOT=/home/BioSW/staden; export STADENROOT
+# . $STADENROOT/staden.setup
+#
+#
+#echo 'Setting up the Staden software environment...'
+
+PATH=$PATH:$STADENROOT/bin;	export PATH
+
+STADTABL=$STADENROOT/tables;	export STADTABL
+STADUSER=$STADENROOT/userdata;	export STADUSER
+STADHELP=$STADENROOT/help;	export STADHELP
+
+
+#
+# files for nip
+#
+ECRIBS=$STADTABL/PERCPTRON.WTS;	export ECRIBS
+EUKRIBS=$STADTABL/RIBSEUKAR.WTS;export EUKRIBS
+INTRONS=$STADTABL/INTRON.WTS;	export INTRONS
+ECPROMS=$STADTABL/PROKPROMS.WTS;export ECPROMS
+RENZYM4=$STADTABL/RENZYM.4;	export RENZYM4
+RENZYM6=$STADTABL/RENZYM.6;	export RENZYM6
+RENZYMAL=$STADTABL/RENZYM.ALL;	export RENZYMAL
+HELPNIP=$STADHELP/nip_help;	export HELPNIP
+NIPHELP=$STADHELP/NIP.HLP;	export NIPHELP
+NIPHPNT=$STADHELP/NIP.PNT;	export NIPHPNT
+NIPMARG=$STADTABL/NIP.MRG;	export NIPMARG
+
+#
+# files for sip
+#
+HELPSIP=$STADHELP/sip_help;	export HELPSIP
+SIPHELP=$STADHELP/SIP.HLP;	export SIPHELP
+SIPHPNT=$STADHELP/SIP.PNT;	export SIPHPNT
+SIPMARG=$STADTABL/SIP.MRG;	export SIPMARG
+PROTMAT=$STADTABL/PROTMAT.DAT;	export PROTMAT
+
+#
+# files for sap
+#
+HELPSAP=$STADHELP/sap_help;	export HELPSAP
+SAPHELP=$STADHELP/SAP.HLP;	export SAPHELP
+SAPHPNT=$STADHELP/SAP.PNT;	export SAPHPNT
+SAPMARG=$STADTABL/SAP.MRG;	export SAPMARG
+
+#
+# files for bap
+#
+HELPBAP=$STADHELP/bap_help;	export HELPBAP
+BAPHELP=$STADHELP/BAP.HLP;	export BAPHELP
+BAPHPNT=$STADHELP/BAP.PNT;	export BAPHPNT
+BAPMARG=$STADTABL/BAP.MRG;	export BAPMARG
+
+#
+# files for dap
+#
+HELPDAP=$STADHELP/dap_help;	export HELPDAP
+DAPHELP=$STADHELP/DAP.HLP;	export DAPHELP
+DAPHPNT=$STADHELP/DAP.PNT;	export DAPHPNT
+DAPMARG=$STADTABL/DAP.MRG;	export DAPMARG
+TAGDB=$STADTABL/TAGDB;          export TAGDB
+
+#
+# files for pip
+#
+PIPMARG=$STADTABL/PIP.MRG;	export PIPMARG
+PROTGRP=$STADTABL/PROTEIN.GRP;	export PROTGRP
+PROTALL=$STADTABL/PROTEIN.ALL;	export PROTALL
+ROBSON=$STADTABL/ROBSON.WTS;	export ROBSON
+HELPPIP=$STADHELP/pip_help;	export HELPPIP
+PIPHELP=$STADHELP/PIP.HLP;	export PIPHELP
+PIPHPNT=$STADHELP/PIP.PNT;	export PIPHPNT
+
+#
+# files for mep
+#
+MEPMARG=$STADTABL/MEP.MRG;	export MEPMARG
+HELPMEP=$STADHELP/mep_help;	export HELPMEP
+MEPHELP=$STADHELP/MEP.HLP;	export MEPHELP
+MEPHPNT=$STADHELP/MEP.PNT;	export MEPHPNT
+
+#
+# files for rep
+#
+ALUNAMES=$STADTABL/alus/files;	export ALUNAMES
+
+
+#
+# miscellaneous files
+#
+GIPMEM=$STADHELP/gip_help;	export GIPMEM
+HELPSTADEN=$STADHELP/staden_help;export HELPSTADEN
+
+#
+# vector sequences
+#
+M13MP18_VECTOR=$STADUSER/m13mp18.vec; export M13MP18_VECTOR
+PUC18_VECTOR=$STADUSER/puc18.vec; export PUC18_VECTOR
+BLUE_VECTOR=$STADUSER/blue.vec; export BLUE_VECTOR
+
+
+
+#
+# Sequence databases (currently embl cdrom format only)
+#
+
+# The file SEQUENCELIBRARIES lists the available libraries
+# and the *DESCRP files contain the names of each libraries files
+# 
+SEQUENCELIBRARIES=$STADTABL/SEQUENCELIBRARIES; export SEQUENCELIBRARIES
+EMBLLIBDESCRP=$STADTABL/EMBLLIBDESCRP; export EMBLLIBDESCRP
+SWISSLIBDESCRP=$STADTABL/SWISSLIBDESCRP; export SWISSLIBDESCRP
+
+
+#
+# Help X11 find where things are
+#
+XFILESEARCHPATH=$STADHELP/%N%S:$STADTABL/%N%S:${XFILESEARCHPATH-/usr/lib/X11/%T/%N%S}
+export XFILESEARCHPATH
+
+#
+# Find manual pages
+#
+MANPATH=${MANPATH-/usr/man}:$STADENROOT; export MANPATH
+
+#
+# For gip, specify default digitizer port
+#
+DIGITIZER=/dev/ttyb; export DIGITIZER
+
+#
+# Sequence databases
+#
+
+. $STADTABL/libraries.config.sh
diff --git a/tables/ATPASE.DAT b/tables/ATPASE.DAT
new file mode 100644
index 0000000..f3bc0c3
--- /dev/null
+++ b/tables/ATPASE.DAT
@@ -0,0 +1,176 @@
+ID   ATPase
+XX
+DT   10-NOV-1982  FIRST ENTRY
+XX
+DE   
+XX
+KW   
+XX
+FH   KEY        FROM     TO       DESCRIPTION
+FH
+FT   CDS         576    968       1   0 391
+FT   CDS         977   1792       a   1 814
+FT   CDS        1839   2081       c  6-12 242
+FT   CDS        2140   2610       b   2 439
+FT   CDS        2625   3158       DELTA   1 532
+FT   CDS        3171   4712       ALPHA   3 1540
+FT   CDS        4763   5626       GAMMA   1 864
+FT   CDS        5653   7035       BETA    3 1381
+FT   CDS        7056   7400       EPSILON 1 343
+FT   CDS3        576    968       1   0 391
+FT   CDS2        977   1792       a   1 814
+FT   CDS3       1839   2081       c  6-12 242
+FT   CDS1       2140   2610       b   2 439
+FT   CDS3       2625   3158       DELTA   1 532
+FT   CDS3       3171   4712       ALPHA   3 1540
+FT   CDS2       4763   5626       GAMMA   1 864
+FT   CDS1       5653   7035       BETA    3 1381
+FT   CDS3       7056   7400       EPSILON 1 343
+FT   STOK        576    968       1   0 391
+FT   STOK        977   1792       1   1 814
+FT   STOK       1839   2081       6  6-12 242
+FT   STOK       2140   2610       2   2 439
+FT   STOK       2625   3158       1ELTA   1 532
+FT   STOK       3171   4712       3LPHA   3 1540
+FT   STOK       4763   5626       1AMMA   1 864
+FT   STOK       5653   7035       3ETA    3 1381
+FT   STOK       7056   7400       1PSILON 1 343
+XX
+SQ
+     CCGGTTAGAC TGTTAACAAC AACCAGGTTT TCTACTGATA TAACTGGTTA CATTTAACGC
+     CACGTTCACT CTTTTGCATC AACAAGATAA CGTGGCTTTT TTTGGTAAGC AGAAAATAAG
+     TCATTAGTGA AAATATCAGT CTGCTAAAAA TCGGCGCTAA GAACCATCAT TGGCTGTTAA
+     AACAGTTATT AAAAATGTCA ATGGGTGGTT TTTGTTGTGT AAATGTCATT TATTAAACAG
+     GTATCTGTTT TTAGACTGAA ATATCATAAA CTTGCAAGGC ATCATTTGCC AAGTAAATAA
+     ATATGCTGTG CGCGAACATG CGCAATATGT GATCTGAAGC ACGCTTTATC ACCAGTGTTT
+     ACGCGTTATT TACAGTTTTT CATGATCGAA CAGGGTTAGC AGAAAAGTCG CAATTGTATG
+     CACTGGAAAA ATATTTAAAC ATTTATTCAC CTTTTGGCTA CTTATTGTTT GAAATCACGG
+     GGGCGCACCG TATAATTTGA CCGCTTTTTG ATGCTTGACT CTAAGCCTTA AAGAAAGTTT
+     TATACGACAC GCGGCATACC TCGAAGGGAG CAGGAGTGAA AAACGTGATG TCTGTGTCGC
+     TCGTGAGTCG AAACGTTGCT CGGAAGCTTC TGCTCGTTCA GTTACTGGTG GTGATAGCAA
+     GTGGATTGCT GTTCAGCCTC AAAGACCCCT TCTGGGGCGT CTCTGCAATA AGCGGGGGCC
+     TGGCAGTCTT TCTGCCTAAC GTTTTGTTTA TGATATTTGC CTGGCGTCAC CAGGCGCATA
+     CACCAGCGAA AGGCCGGGTG GCCTGGACAT TCGCATTTGG CGAAGCTTTC AAAGTTCTGG
+     CGATGTTGGT GTTACTGGTG GTGGCGTTGG CGGTTTTAAA GGCGGTATTC TTGCCGCTGA
+     TCGTTACGTG GGTTTTGGTG CTGGTGGTTC AGATACTGGC ACCGGCTGTA ATTAACAACA
+     AAGGGTAAAA GGCATCATGG CTTCAGAAAA TATGACGCCG CAGGATTACA TAGGACACCA
+     CCTGAATAAC CTTCAGCTGG ACCTGCGTAC ATTCTCGCTG GTGGATCCAC AAAACCCCCC
+     AGCCACCTTC TGGACAATCA ATATTGACTC CATGTTCTTC TCGGTGGTGC TGGGTCTGTT
+     GTTCCTGGTT TTATTCCGTA GCGTAGCCAA AAAGGCGACC AGCGGTGTGC CAGGTAAGTT
+     TCAGACCGCG ATTGAGCTGG TGATCGGCTT TGTTAATGGT AGCGTGAAAG ACATGTACCA
+     TGGCAAAAGC AAGCTGATTG CTCCGCTGGC CCTGACGATC TTCGTCTGGG TATTCCTGAT
+     GAACCTGATG GATTTACTGC CTATCGACCT GCTGCCGTAC ATTGCTGAAC ATGTACTGGG
+     TCTGCCTGCA CTGCGTGTGG TTCCGTCTGC GGACGTGAAC GTAACGCTGT CTATGGCACT
+     GGGCGTATTT ATCCTGATTC TGTTCTACAG CATCAAAATG AAAGGCATCG GCGGCTTCAC
+     GAAAGAGTTG ACGCTGCAGC CGTTCAATCA CTGGGCGTTC ATTCCTGTCA ACTTAATCCT
+     TGAAGGGGTA AGCCTGCTGT CCAAACCAGT TTCACTCGGT TTGCGACTGT TCGGTAACAT
+     GTATGCCGGT GAGCTGATTT TCATTCTGAT TGCTGGTCTG TTGCCGTGGT GGTCACAGTG
+     GATCCTGAAT GTGCCGTGGG CCATTTTCCA CATCCTGATC ATTACGCTGC AAGCCTTCAT
+     CTTCATGGTT CTGACGATCG TCTATCTGTC GATGGCGTCT GAAGAACATT AATTTACCAA
+     CACTACTACG TTTTAACTGA AACAAACTGG AGACTGTCAT GGAAAACCTG AATATGGATC
+     TGCTGTACAT GGCTGCCGCT GTGATGATGG GTCTGGCGGC AATCGGTGCT GCGATCGGTA
+     TCGGCATCCT CGGGGGTAAA TTCCTGGAAG GCGCAGCGCG TCAACCTGAT CTGATTCCTC
+     TGCTGCGTAC TCAGTTCTTT ATCGTTATGG GTCTGGTGGA TGCTATCCCG ATGATCGCTG
+     TAGGTCTGGG TCTGTACGTG ATGTTCGCTG TCGCGTAGTA AGCGTTGCTT TTATTTAAAG
+     AGCAATATCA GAACGTTAAC TAAATAGAGG CATTGTGCTG TGAATCTTAA CGCAACAATC
+     CTCGGCCAGG CCATCGCGTT TGTCCTGTTC GTTCTGTTCT GCATGAAGTA CGTATGGCCG
+     CCATTAATGG CAGCCATCGA AAAACGTCAA AAAGAAATTG CTGACGGCCT TGCTTCCGCA
+     GAACGAGCAC ATAAGGACCT TGACCTTGCA AAGGCCAGCG CGACCGACCA GCTGAAAAAA
+     GCGAAAGCGG AAGCCCAGGT AATCATCGAG CAGGCGAACA AACGCCGCTC GCAGATTCTG
+     GACGAAGCGA AAGCTGAGGC AGAACAGGAA CGTACTAAAA TCGTGGCCCA GGCGCAGGCG
+     GAAATTGAAG CCGAGCGTAA ACGTGCCCGT GAAGAGCTGC GTAAGCAAGT TGCTATCCTG
+     GCTGTTGCTG GCGCCGAGAA GATCATCGAA CGTTCCGTGG ATGAAGCTGC TAACAGCGAC
+     ATCGTGGATA AACTTGTCGC TGAACTGTAA GGAGGGAGGG GCTGATGTCT GAATTTATTA
+     CGGTAGCTCG CCCCTACGCC AAAGCAGCTT TTGACTTTGC CGTCGAACAC CAAAGTGTAG
+     AACGCTGGCA GGACATGCTG GCGTTTGCCG CCGAGGTAAC CAAAAACGAA CAAATGGCAG
+     AGCTTCTCTC TGGCGCGCTT GCGCCAGAAA CGCTCGCCGA GTCGTTTATC GCAGTTTGTG
+     GTGAGCAACT GGACGAAAAC GGTCAGAACC TGATTCGGGT TATGGCTGAA AATGGTCGTC
+     TTAACGCGCT CCCGGATGTT CTGGAGCAGT TTATTCACCT GCGTGCCGTG AGTGAGGCTA
+     CCGCTGAGGT AGACGTCATT TCCGCTGCCG CACTGAGTGA ACAACAGCTC GCGAAAATTT
+     CTGCTGCGAT GGAAAAACGT CTGTCACGCA AAGTTAAGCT GAATTGCAAA ATCGATAAGT
+     CTGTAATGGC AGGCGTTATC ATCCGAGCGG GTGATATGGT CATTGATGGC AGCGTACGCG
+     GTCGTCTTGA GCGCCTTGCA GACGTCTTGC AGTCTTAAGG GGACTGGAGC ATGCAACTGA
+     ATTCCACCGA AATCAGCGAA CTGATCAAGC AGCGCATTGC TCAGTTCAAT GTTGTGAGTG
+     AAGCTCACAA CGAAGGTACT ATTGTTTCTG TAAGTGACGG TGTTATCCGC ATTCACGGCC
+     TGGCCGATTG TATGCAGGGT GAAATGATCT CCCTGCCGGG TAACCGTTAC GCTATCGCAC
+     TGAACCTCGA GCGCGACTCT GTAGGTGCGG TTGTTATGGG TCCGTACGCT GACCTTGCCG
+     AAGGCATGAA AGTTAAGTGT ACTGGCCGTA TCCTGGAAGT TCCGGTTGGC CGTGGCCTGC
+     TGGGCCGTGT GGTTAACACT CTGGGTGCAC CAATCGACGG TAAAGGTCCG CTGGATCACG
+     ACGGCTTCTC TGCTGTAGAA GCAATCGCTC CGGGCGTTAT CGAACGTCAG TCCGTAGATC
+     AGCCGGTACA GACCGGTTAT AAAGCCGTTG ACTCCATGAT CCCAATCGGT CGTGGTCAGC
+     GTGAATTGAT CATCGGTGAC CGTCAGACAG GTAAAACCGC ACTGGCTATC GATGCCATCA
+     TCAACCAGCG CGATTCCGGT ATCAAATGTA TCTATGTCGC TATCGGCCAG AAAGCGTCCA
+     CCATTTCTAA CGTGGTACGT AAACTGGAAG AGCACGGCGC ACTGGCTAAC ACCATCGTTG
+     TGGTAGCAAC CGCGTCTGAA TCCGCTGCAC TGCAATACCT GGCACGTATG CCGGTTGCGC
+     TAATGGGCGA ATACTTCCGT GACCGCGGTG AAGATGCGCT GATCATTTAC GATGACCTGT
+     CTAAACAGGC TGTTGCTTAC CGTCAGATCT CCCTGCTGCT CCGTCGTCCG CCAGGACGTG
+     AAGCATTCCC GGGCGACGTT TTCTACCTCC ACTCTCGTCT GCTGGAGCGT GCTGCACGTG
+     TTAACGCCGA ATACGTTGAA GCCTTCACCA AAGGTGAAGT GAAAGGGAAA ACCGGTTCTC
+     TGACCGCACT GCCGATTATC GAAACTCAGG CGGGTGACGT TTCTGCGTTC GTTCCGACCA
+     ACGTAATCTC CATTACCGAT GGTCAGATCT TCCTGGAAAC CAACCTGTTC AACGCCGGTA
+     TTCGTCCTGC GGTTAACCCG GGTATTTCCG TATCCCGTGT TGGTGGTGCA GCACAGACCA
+     AGATCATGAA AAAACTGTCC GGTGGTATCC GTACCGCTCT GGCACAGTAT CGTGAACTGG
+     CAGCGTTCTC TCAGTTTGCA TCCGACCTTG ACGATGCAAC ACGTAAGCAG CTTGACCACG
+     GTCAGAAAGT GACCGAACTG CTGAAACAGA AACAGTATGC GCCGATGTCC GTTGCGCAGC
+     AGTCTCTGGT TCTGTTCGCA GCAGAACGTG GTTACCTGGC GGATGTTGAA CTGTCGAAAA
+     TTGGCAGCTT CGAAGCCGCT CTGCTGGCTT ACGTCGACCG TGATCACGCT CCGTTGATGC
+     AAGAGATCAA CCAGACCGGT GGCTACAACG ACGAAATCGA AGGCAAGCTG AAAGGCATCC
+     TCGATTCCTT CAAAGCAACC CAATCCTGGT AACGTCTGGC GGCTTGCCTT AGGGCAGGCC
+     GCAAGGCATT GAGGAGAAGC TCATGGCCGG CGCAAAAGAG ATACGTAGTA AGATCGCAAG
+     CGTCCAGAAC ACGCAAAAGA TCACTAAAGC GATGGAGATG GTCGCCGCTT CCAAAATGCG
+     TAAATCGCAG GATCGCATGG CGGCCAGCCG TCCTTATGCA GAAACCATGC GCAAAGTGAT
+     TGGTCACCTT GCACACGGTA ATCTGGAATA TAAGCACCCT TACCTGGAAG ACCGCGACGT
+     TAAACGCGTG GGCTACCTGG TGGTGTCGAC CGACCGTGGT TTGTGCGGTG GTTTGAACAT
+     TAACCTGTTC AAAAAACTGC TGGCGGAAAT GAAGACCTGG ACCGACAAAG GCGTTCAATG
+     CGACCTCGCA ATGATCGGCT CGAAAGGCGT GTCGTTCTTC AACTCCGTGG GCGGCAATGT
+     TGTTGCCCAG GTCACCGGCA TGGGGGATAA CCCTTCCCTG TCCGAACTGA TCGGTCCGGT
+     AAAAGTGATG TTGCAGGCCT ACGACGAAGG CCGTCTGGAC AAGCTTTACA TTGTCAGCAA
+     CAAATTTATT AACACCATGT CTCAGGTTCC GACCATCAGC CAGCTGCTGC CGTTACCGGC
+     ATCAGATGAT GATGATCTGA AACATAAATC CTGGGATTAC CTGTACGAAC CCGATCCGAA
+     GGCGTTGCTG GATACCCTGC TGCGTCGTTA TGTCGAATCT CAGGTTTATC AGGGCGTGGT
+     TGAAAACCTG GCCAGCGAGC AGGCCGCCCG TATGGTGGCG ATGAAAGCCG CGACCGACAA
+     TGGCGGCAGC CTGATTAAAG AGCTGCAGTT GGTATACAAC AAAGCTCGTC AGGCCAGCAT
+     TACTCAGGAA CTCACCGAGA TCGTCTCGGG GGCCGCCGCG GTTTAAACAG GTTATTTCGT
+     AGAGGATTTA AGATGGCTAC TGGAAAGATT GTCCAGGTAA TCGGCGCCGT AGTTGACGTC
+     GAATTCCCTC AGGATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTGCA AAATGGTAAT
+     GAGCGTCTGG TGCTGGAAGT TCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCA
+     ATGGGTTCCT CCGACGGTCT GCGTCGCGGT CTGGATGTAA AAGACCTCGA ACACCCGATT
+     GAAGTCCCGG TAGGTAAAGC GACTCTGGGC CGTATCATGA ACGTACTGGG TGAACCGGTC
+     GACATGAAAG GCGAGATCGG TGAAGAAGAG CGTTGGGCGA TTCACCGCGC AGCACCTTCC
+     TACGAAGAGC TGTCAAACTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG
+     ATGTGTCCGT TCGCTAAGGG CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA
+     ACCGTAAACA TGATGGAGCT CATTCGTAAC ATCGCGATCG AGCACTCCGG TTACTCTGTG
+     TTTGCGGGCG TAGGTGAACG TACTCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC
+     TCCAACGTTA TCGACAAAGT ATCCCTGGTG TATGGCCAGA TGAACGAGCC GCCGGGAAAC
+     CGTCTGCGCG TTGCTCTGAC CGGTCTGACC ATGGCTGAGA AATTCCGTGA CGAAGGTCGT
+     GACGTTCTGC TGTTCGTTGA CAACATCTAT CGTTACACCC TGGCCGGTAC GGAAGTATCC
+     GCACTGCTGG GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG
+     GGCGTTCTGC AGGAACGTAT CACCTCCACC AAAACTGGTT CTATCACCTC CGTACAGGCA
+     GTATACGTAC CTGCGGATGA CTTGACTGAC CCGTCTCCGG CAACCACCTT TGCGCACCTT
+     GACGCAACCG TGGTACTGAG CCGTCAGATC GCGTCTCTGG GTATCTACCC GGCCGTTGAC
+     CCGCTGGACT CCACCAGCCG TCAGCTGGAC CCGCTGGTGG TTGGTCAGGA ACACTACGAC
+     ACCGCGCGTG GCGTTCAGTC CATCCTGCAA CGTTATCAGG AACTGAAAGA CATCATCGCC
+     ATCCTGGGTA TGGATGAACT GTCTGAAGAA GACAAACTGG TGGTAGCGCG TGCTCGTAAG
+     ATCCAGCGCT TCCTGTCCCA GCCGTTCTTC GTGGCAGAAG TATTCACCGG TTCTCCGGGT
+     AAATACGTCT CCCTGAAAGA CACCATCCGT GGCTTTAAAG GCATCATGGA AGGCGAATAC
+     GATCACCTGC CGGAGCAGGC GTTCTACATG GTCGGTTCCA TCGAAGAAGC TGTGGAAAAA
+     GCCAAAAAAC TTTAACGCCT TAATCGGAGG GTGATATGGC AATGACTTAC CACCTGGACG
+     TCGTCAGCGC AGAGCAACAA ATGTTCTCTG GTCTGGTCGA GAAAATCCAG GTAACGGGTA
+     GCGAAGGTGA ACTGGGGATC TACCCTGGCC ACGCACCGCT GCTCACCGCC ATTAAGCCTG
+     GTATGATTCG CATCGTGAAA CAGCACGGTC ACGAAGAGTT TATCTATCTG TCTGGCGGCA
+     TTCTTGAAGT GCAGCCTGGC AACGTGACCG TTCTGGCCGA CACCGCAATT CGCGGCCAGG
+     ATCTCGACGA AGCGCGAGCC ATGGAAGCGA AACGTAAGGC TGAAGAGCAC ATTAGCAGCT
+     CTCACGGCGA CGTAGATTAC GCTCAGGCGT CTGCGGAACT GGCCAAAGCG ATCGCGCAGC
+     TGCGGTTATC GAGTTGACCA AAAAAGCGAT GTAACACCGG CTTGAAAAGC ACAAAAGCCA
+     GTCTGGAAAC AGGCTGGCTT TTTTTTGCGC GTGTGACCCG TCCTGAATAG CGTTCACATA
+     GATCCTGCTG ATATAAAACC CCCCTGTTTT CCTGTTTATT CATTGATCGA AATAAGAGCA
+     AAAACATCCA CCTGACGCTT AAATTAAGGT ACTGCCTTAA TTTTCTGCAG ACAAAAGGCG
+     TGACGATGGT CGAAAATGGC GCTTTCGTCA GCGGGGATAA TCCGTTATTG AACAATTTAT
+     CCTCTGTCCA TTTCACGATG AAAAAAATGT AGTTTTTTCA AGGTGAAGCG GTTTAAATTC
+     GTTCTCAAAT TACAGTCAGG ACGCGTATGT TGAATAATGC TATGAGCGTA GTGATCCTTG
+     CCGCACGAAG GCACGCGCAT GTATTCCGAT CTTCCGAAAG TGCTGCATAC CCTGCCGGGA
+     AAGCGATGGT TCAGCATGTC ATTGATGCTG CGAATGAATT AGGCGCACGG CACGTTCACC
+     TGGTGTACGG TCACGGCGGC GATCTGCTAA AACAGGCGCT GAAAGACGAC AACCTTAACT
+     GGGTGCTTCA GGCAGAGCAG CTGGGTACGG GTCATGCAAT GCAGCAGGCC GCACCTTTCT
+     TTGCCGATGA TGAAGACATT TTAATGCTCT ACGC
+//
diff --git a/tables/BAP.MRG b/tables/BAP.MRG
new file mode 100644
index 0000000..aa2ec1a
--- /dev/null
+++ b/tables/BAP.MRG
@@ -0,0 +1,11 @@
+PLOT POSITIONS FOR SAP. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12  5000  5000
+    13     1   100
+    15     1 10000
+    16    20    20
+    29  4000  1000
+    33  5000  4000
+    34  9000  1000
+    39  2000  2000
diff --git a/tables/BAP.MRG~ b/tables/BAP.MRG~
new file mode 100644
index 0000000..b9e90fd
--- /dev/null
+++ b/tables/BAP.MRG~
@@ -0,0 +1,10 @@
+PLOT POSITIONS FOR SAP. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12  5000  5000
+    13     1   100
+    15     1 10000
+    16    20    20
+    29  4000  1000
+    33  5000  4000
+    34  9000  1000
diff --git a/tables/DAP.MRG b/tables/DAP.MRG
new file mode 100644
index 0000000..b9e90fd
--- /dev/null
+++ b/tables/DAP.MRG
@@ -0,0 +1,10 @@
+PLOT POSITIONS FOR SAP. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12  5000  5000
+    13     1   100
+    15     1 10000
+    16    20    20
+    29  4000  1000
+    33  5000  4000
+    34  9000  1000
diff --git a/tables/INTRON.WTS b/tables/INTRON.WTS
new file mode 100644
index 0000000..0c72d66
--- /dev/null
+++ b/tables/INTRON.WTS
@@ -0,0 +1,16 @@
+ Mount donors redone 16-4-91                                 
+     12     3   -16.085    -7.500
+ P  -2  -1   0   1   2   3   4   5   6   7   8   9
+ N 136 136 136 136 136 136 136 136 136 136 136 136
+ T  28   8  15  17   0 136   9  16   7  84  30  36
+ C  41  60  16   7   0   0   3  13   3  17  28  39
+ A  40  56  89  12   0   0  83  91  12  23  53  33
+ G  27  12  16 100 136   0  41  16 114  12  25  28
+ Mount acceptors redone 16-4-91                              
+     18    15   -26.142   -14.400
+ P -14 -13 -12 -11 -10  -9  -8  -7  -6  -5  -4  -3  -2  -1   0   1   2   3
+ N 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113 113
+ T  58  50  57  59  67  56  58  49  47  66  64  31  34   0   0  11  41  31
+ C  21  28  34  25  29  33  35  32  42  40  33  25  74   0   0  23  28  41
+ A  17  11  11  18   7  17  12  23  15   3  10  29   5 113   0  24  21  21
+ G  17  24  11  11  10   7   8   9   9   4   6  28   0   0 113  55  23  20
diff --git a/tables/MEP.MRG b/tables/MEP.MRG
new file mode 100644
index 0000000..98956a7
--- /dev/null
+++ b/tables/MEP.MRG
@@ -0,0 +1,9 @@
+PLOT POSITIONS FOR MEP. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12    20  9480
+    13     0 10000
+    15    20  9480
+    16    20  9480
+    17   500  9400
+    33   500  9400
diff --git a/tables/NIP.MRG b/tables/NIP.MRG
new file mode 100644
index 0000000..1ae0315
--- /dev/null
+++ b/tables/NIP.MRG
@@ -0,0 +1,48 @@
+PLOT POSITIONS FOR ANALYSEQ. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12    20  9480
+    13     0 10000
+    15    20  9480
+    16    20  9480
+    17   500  9400
+    18   500   500
+    19  1000   500
+    20  1500   500
+    24  1500   500
+    25  2000   500
+    26  2500   500
+    27  3000   500
+    28  3500   500
+    29  4000   500
+    30   500   500
+    31  1000  2000
+    32  2000  2000
+    33  2000   500
+    34  2500   500
+    35  3000   500
+    36  3500   500
+    42  4000  5500
+    43  4000  5500
+    44  3000  1000
+    45  4000  5500
+    46  4000  5500
+    47  4000  5500
+    48  2000  1000
+    49  1000   500
+    50  4000  5500
+    51  4000  5500
+    52  4000  5500
+    53  4000  5500
+    55  1000  1500
+    56  6000   750
+    57  4500  1500
+    58  4000  5500
+    59  1000   500
+    60  1000   500
+    61  4000  5500
+    62  2000  1000
+    63  2000  1000
+    64  2000  1000
+    65   500   500
+    67  1000  1000
diff --git a/tables/NIPF.MRG b/tables/NIPF.MRG
new file mode 100644
index 0000000..c968441
--- /dev/null
+++ b/tables/NIPF.MRG
@@ -0,0 +1,22 @@
+PLOT POSITIONS FOR nipf. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12   500  9500
+    13     0 10000
+    15    20  9480
+    16    20  9480
+    17   500  9400
+    18   500   500
+    19  1000   500
+    20  1500   500
+    24  1500   500
+    25  2000  1000
+    26  3000  1000
+    27  4000  1000
+    28  3500   500
+    29  4000   500
+    30   500   500
+    31  1000  1000
+    32  2000  2000
+    33  2000   500
+    34  2000   500
diff --git a/tables/NIPMARG b/tables/NIPMARG
new file mode 100644
index 0000000..1ae0315
--- /dev/null
+++ b/tables/NIPMARG
@@ -0,0 +1,48 @@
+PLOT POSITIONS FOR ANALYSEQ. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12    20  9480
+    13     0 10000
+    15    20  9480
+    16    20  9480
+    17   500  9400
+    18   500   500
+    19  1000   500
+    20  1500   500
+    24  1500   500
+    25  2000   500
+    26  2500   500
+    27  3000   500
+    28  3500   500
+    29  4000   500
+    30   500   500
+    31  1000  2000
+    32  2000  2000
+    33  2000   500
+    34  2500   500
+    35  3000   500
+    36  3500   500
+    42  4000  5500
+    43  4000  5500
+    44  3000  1000
+    45  4000  5500
+    46  4000  5500
+    47  4000  5500
+    48  2000  1000
+    49  1000   500
+    50  4000  5500
+    51  4000  5500
+    52  4000  5500
+    53  4000  5500
+    55  1000  1500
+    56  6000   750
+    57  4500  1500
+    58  4000  5500
+    59  1000   500
+    60  1000   500
+    61  4000  5500
+    62  2000  1000
+    63  2000  1000
+    64  2000  1000
+    65   500   500
+    67  1000  1000
diff --git a/tables/PERCPTRON.WTS b/tables/PERCPTRON.WTS
new file mode 100644
index 0000000..fd0d556
--- /dev/null
+++ b/tables/PERCPTRON.WTS
@@ -0,0 +1,56 @@
+
+ P-60-59-58-57-56-55-54-53-52-51-50-49-48-47-46-45-44-43-42-41-40-39-38-37-36
+ T  5  1 -3  9-14  7 15 -5  3-16-17  4 18  5 -3 -1  2  4  5 -5  7  8 -5-15  6
+ C-21 -6-11-21  0  8 -7-12 -1  1  0-19 12 -3 -1 10  2 -8 -5-11  8  1 23  6 -5
+ A  7 -2 13 -2 -8-13-18  5  0 -5 13  8-15  9 -4 -7  9  0 -8-11-10 -6 -7 -5 -6
+ G -6 -9 -7  0  8-16 -4 -2-16  1 -4  8-14  5 11-13-24  3  7 22-11 -9-15 10 -4
+
+ P-35-34-33-32-31-30-29-28-27-26-25-24-23-22-21-20-19-18-17-16-15-14-13-12-11
+ T  3  4 16 -4  7 11 -4 -1 12  8 10 -1  1  8  2-10-16 11  1 -3 16 -3-36 -8-27
+ C  2-14 -3 -8-10-21  2  0 -2 -1-11 -3 -1  5-11 -4  7  0-14  6 -8-20 -7-36-44
+ A-12 -1-27 -3 -6  0-12 -3 -4 -7 14 -2 -4 -6  0 12  5 -9  0-11-11 10  8  2  8
+ G  4 -5 -6 -3 -1 -4 -1 -4-15  0-14  3 10-19 -3-10 -7 -7  7  1 -8 -6 15 21 42
+
+ P-10 -9 -8 -7 -6 -5 -4 -3 -2 -1  0  1  2  3  4  5  6  7  8  9 10 11 12 13 14
+ T-53-27-26-23  2 -7-14-40-28  0-53 75-62-20-40-10-35 -5-12 -1  4 14-23  7 -2
+ C-15-50-43-35-38-29-29  1 -9  1-87-55-64-45 11-22-14-20-15-15-10-22 -5  2  6
+ A  0 -3 -5  4-20-11  5  6 -2-15 66-69-52 -5 -4  6  8-24 -7-10 -7 13 14 -9-18
+ G 35 22 16 -6 -5-15-25-33-28-53-36-50107 -5-37-44-27-15-23-16-29-47-17-29-15
+
+ P 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39
+ T-26  1  4 -7  3 -4  0-10  8-18  7-22-21  8  4 -3 -6  7 -8  1 -5-16-16  7 -6
+ C  6 -8 19 -7  9 -3 17 -2  3 -9  5 22 22  8 -1  1 18  6 11-10 -8  7 10  0  7
+ A 14-12-42  1 -5 -4-32 12-10 20 -6 -1  3 -4  4-10 -1 -2-14 11 14 -3  2-13  5  
+ G-23 -7 -1 -6-17 -4  0-15-14 -4-17-10 -5-13 -8 10-13-13  9 -4 -3 10  2  4 -8
+
+ P 40
+ T  0
+ C 14
+ A  5
+ G-21
+
+These come from w101 of Stormo, Schneider, Gold and Ehrenfeucht nar 10 2997-
+3011, 1982. They report that this matrix gives a score of at least 2 for all
+gene starts in their library whereas all other sequences score 1 or less. The
+weightings were derived using the perceptron algorithm in the following way:
+want to find a weighting function w such that w.s > t if and only if s is a
+ribosome binding site; divide the sequences into s+ (gene starts) and s- (other
+sequences in the libarary); perform the following procedure:
+
+    test: choose a sequence s from s+ or s-
+          if s is in s+ and w.s >= t go to test
+          if s is in s+ and w.s <= t go to add
+          if s is in s- and w.s <  t go to test
+          if s is in s- and w.s >= t go to subtract
+     add: replace w by w + s
+	  go to test
+subtract: replace w by w - s
+          go to test
+
+this is repeated until differences between w in successive rounds are very
+small.
+Shorter lengths for w were found to be less good although protection 
+experiments show that the ribosome interacts with at most 35-40 bases.
+Note AUG, GUG and UUG can act as initiation codons; AUA will initiate at
+low rates in vivo and essentially any RNA polymer can be made to initiate
+translation under appropriate ionic conditions in vitro.
diff --git a/tables/PIP.MRG b/tables/PIP.MRG
new file mode 100644
index 0000000..681c3b4
--- /dev/null
+++ b/tables/PIP.MRG
@@ -0,0 +1,18 @@
+PLOT POSITIONS FOR ANALYSEP. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12    20  9480
+    13     0 10000
+    15    20  9480
+    16    20  9480
+    17    20  9480
+    18   500   500
+    19  1000   500
+    20  1500   500
+    22  1000  1000
+    23  2000  1000
+    24  3000  6500
+    25  1000  4000
+    26   100  9800
+    27  1000  1000
+    28  1000  1000
diff --git a/tables/PROKPROMS.WTS b/tables/PROKPROMS.WTS
new file mode 100644
index 0000000..5f939f1
--- /dev/null
+++ b/tables/PROKPROMS.WTS
@@ -0,0 +1,52 @@
+prokaryotic wts for promoters 
+-35 region:
+P -50-49-48-47-46-45-44-43-42-41-40-39-38-37-36-35-34-33-32-31-30-29-28-27-26
+  107109109110110110110110110111111110111112112112112112112112112112112112112
+T  41 33 32 25 34 22 35 35 42 27 32 42 47 14 92 94 11 19 15 37 46 34 38 48 34
+C  22 27 18 29 20 14 20 12 22 23 16 25 10 43  7  6 11 18 60  8 25 23 23 17 20
+A  28 38 30 37 35 56 42 42 37 42 39 18 25 26  2  6  2 72 26 50 26 34 25 26 31
+G  16 11 29 19 21 18 13 21  9 19 24 26 29 29 11  6 88  3 11 17 15 21 26 21 27
+-10 region:
+P -23-22-21-20-19-18-17-16-15-14-13-12-11-10 -9 -8 -7 -6 -5
+  112112112112112112112112112112112112112112112112112112112
+T  35 28 28 27 39 51 34 43 26 31 89  3 49 15 19108 31 29 21
+C  34 21 24 27 12 25 20 25 20 27 10  2 16 14 22  3 13 16 30
+A  20 39 33 33 39 23 29 16 23 19  2106 29 66 57  1 35 23 31
+G  23 24 27 25 22 13 29 28 43 35 11  1 18 17 14  0 33 24 30
++ region:
+P -2 -1  1  2  3  4  5  6  7  8  9 10
+  86 88 88 88 88 88 88 88 88 88 88 88
+T 16 22  2 42 27 23 20 25 27 15 16 29
+C 29 49  4 25 25 13 18 22 17 17 16 17
+A 20  9 45 16 24 25 28 24 24 32 35 26
+G 21  8 37  5 12 27 22 17 20 24 21 16
+Notes:
+D. K. and McClure, R., nar 11 2237-2255 (1983)
+
+E. coli promoters have been shown to contain 2 regions of conserved sequence
+located about 10 and 35 bases upstream of the transcription startsite. These
+are TATAAT and TTGACA with an allowed spacing of 15 to 21 bases between. The
+spacing with maximum efficiency was 17 bases and all but 12 of the 112 
+sequences could be aligned with a separation of 17 +or-1 bases. The standard
+promoter has spacing 7 and 17 bases between the startsite and the -10 region,
+and the -10 and -35 regions, respectively. The spacing between the -10 region
+and the startsite is usually 6 or 7 bases but varies between 4 and 8 bases.
+There is an AT rich region of 8 to 10 bases upstream of the -35 region.
+Iniation with a purine is highly prefered with G being used if A is not
+present.
+Gap penalties:
+	15 0.02   (only exists as mutant)
+	16 0.2
+	17 1.0
+	18 0.2
+	19 0.05   (guess)
+	20 0.02   (guess)
+	21 0.01   (guess)
+Two processes in volved: 1) recognition
+			 2) melting (needs recognition for initiation
+				     and then AT richness)
+
+factor of about 1 order of magnitude difference for all AT against all GC
+in terms of activity.
+Andrew Travers says there are upstream regions that resemble -35 and -10
+regions (on both strands).
diff --git a/tables/PROTEIN.ALL b/tables/PROTEIN.ALL
new file mode 100644
index 0000000..7bd2cf3
--- /dev/null
+++ b/tables/PROTEIN.ALL
@@ -0,0 +1,21 @@
+A/A//
+C/C//
+D/D//
+E/E//
+F/F//
+G/G//
+H/H//
+I/I//
+K/K//
+L/L//
+M/M//
+N/N//
+P/P//
+Q/Q//
+R/R//
+S/S//
+T/T//
+V/V//
+W/W//
+Y/Y//
+
diff --git a/tables/PROTEIN.GRP b/tables/PROTEIN.GRP
new file mode 100644
index 0000000..ef2b834
--- /dev/null
+++ b/tables/PROTEIN.GRP
@@ -0,0 +1,10 @@
+ACID/D/E//
+BASIC/R/K/H//
+HYDRO/F/L/I/V/Y//
+GLYCO/N-S/N-T//
++/R/K/H//
+-/D/E//
+DIBASIC/RR/KK/RK/KR//
+TURN/N/D/G/P/S//
+BLOCK/A/Q/E/I/L/M/F/W/V//
+INDIF/R/C/H/K/T/Y//
diff --git a/tables/PROTMAT.DAT b/tables/PROTMAT.DAT
new file mode 100644
index 0000000..138f617
--- /dev/null
+++ b/tables/PROTMAT.DAT
@@ -0,0 +1,29 @@
+   C  S  T  P  A  G  N  D  E  Q  B  Z  H  R  K  M  I  L  V  F  Y  W  -  X  ?  
+C 22 10  8  7  8  7  6  5  5  5  5  5  7  6  5  5  8  4  8  6 10  2 10 10 10 10
+S 10 12 11 11 11 11 11 10 10  9 10 10  9 10 10  8  9  7  9  7  7  8 10 10 10 10
+T  8 11 13 10 11 10 10 10 10  9 10 10  9  9 10  9 10  8 10  7  7  5 10 10 10 10
+P  7 11 10 16 11  9  9  9  9 10  9 10 10 10  9  8  8  7  9  5  5  4 10 10 10 10
+A  8 11 11 11 12 11 10 10 10 10 10 10  9  8  9  9  9  8 10  6  7  4 10 10 10 10
+G  7 11 10  9 11 15 10 11 10  9 10 10  8  7  8  7  7  6  9  5  5  3 10 10 10 10
+N  6 11 10  9 10 10 12 12 11 11 12 11 12 10 11  8  8  7  8  6  8  6 10 10 10 10
+D  5 10 10  9 10 11 12 14 13 12 13 12 11  9 10  7  8  6  8  4  6  3 10 10 10 10
+E  5 10 10  9 10 10 11 13 14 12 12 13 11  9 10  8  8  7  8  5  6  3 10 10 10 10
+Q  5  9  9 10 10  9 11 12 12 14 11 13 13 11 11  9  8  8  8  5  6  5 10 10 10 10
+B  5 10 10  9 10 10 12 13 12 11 13 11 11 10 10  8  8  6  8  5  7  4 10 10 10 10
+Z  5 10 10 10 10 10 11 12 13 13 11 14 12 10 10  8  8  8  8  5  6  4 10 10 10 10
+H  7  9  9 10  9  8 12 11 11 13 11 12 16 12 10  8  8  8  8  8 10  7 10 10 10 10
+R  6 10  9 10  8  7 10  9  9 11 10 10 12 16 13 10  8  7  8  6  6 12 10 10 10 10
+K  5 10 10  9  9  8 11 10 10 11 10 10 10 13 15 10  8  7  8  5  6  7 10 10 10 10
+M  5  8  9  8  9  7  8  7  8  9  8  8  8 10 10 16 12 14 12 10  8  6 10 10 10 10
+I  8  9 10  8  9  7  8  8  8  8  8  8  8  8  8 12 15 12 14 11  9  5 10 10 10 10
+L  4  7  8  7  8  6  7  6  7  8  6  8  8  7  7 14 12 16 12 12  9  8 10 10 10 10
+V  8  9 10  9 10  9  8  8  8  8  8  8  8  8  8 12 14 12 14  9  8  4 10 10 10 10
+F  6  7  7  5  6  5  6  4  5  5  5  5  8  6  5 10 11 12  9 19 17 10 10 10 10 10
+Y 10  7  7  5  7  5  8  6  6  6  7  6 10  6  6  8  9  9  8 17 20 10 10 10 10 10
+W  2  8  5  4  4  3  6  3  3  5  4  4  7 12  7  6  5  8  4 10 10 27 10 10 10 10
+- 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+X 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+? 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+  10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10
+   
+   
diff --git a/tables/README b/tables/README
new file mode 100644
index 0000000..0da80c0
--- /dev/null
+++ b/tables/README
@@ -0,0 +1,50 @@
+Sequence Library setup files
+
+
+Default files (What we currently use)
+-------------------------------------
+EMBLLIBDESCRP
+EMBLdiv.lkp
+GENBLIBDESCRP
+GENBdiv.lkp
+NRL3DLIBDESCRP
+NRL3Ddiv.lkp
+PIRLIBDESCRP
+PIRdiv.lkp
+SEQUENCELIBRARIES
+SWISSLIBDESCRP
+SWISSdiv.lkp
+
+
+Sun mounted EMBL CDROM
+----------------------
+EMBLLIBDESCRP-SUN
+EMBLdiv.lkp-sun
+SEQUENCELIBRARIES-SUN
+SWISSLIBDESCRP-SUN
+SWISSdiv.lkp-sun
+
+
+IBM AIX, DEC Ultrix mounted EMBL CDROM
+--------------------------------------
+EMBLLIBDESCRP-DEC
+EMBLdiv.lkp-dec
+SEQUENCELIBRARIES-DEC
+SWISSLIBDESCRP-DEC
+SWISSdiv.lkp-dec
+
+Generic files (suitable for templates)
+--------------------------------------
+EMBLLIBDESCRP-GENERIC
+EMBLdiv.lkp-generic
+GENBLIBDESCRP-GENERIC
+GENBdiv.lkp-generic
+NRL3DLIBDESCRP-GENERIC
+NRL3Ddiv.lkp-generic
+PIRLIBDESCRP-GENERIC
+PIRdiv.lkp-generic
+SEQUENCELIBRARIES-GENERIC
+SWISSLIBDESCRP-GENERIC
+SWISSdiv.lkp-generic
+
+
diff --git a/tables/RENZYM.4 b/tables/RENZYM.4
new file mode 100644
index 0000000..27570cc
--- /dev/null
+++ b/tables/RENZYM.4
@@ -0,0 +1,22 @@
+ALUI/AG'CT//
+CFOI/GCG'C//
+DDEI/C'TNAG//
+FNU4HI/GC'NGC//
+FNUDII/CG'CG//
+HAEIII/GG'CC//
+HINFI/G'ANTC//
+HINPI/G'CGC//
+HPAII/C'CGG//
+MAEI/C'TAG//
+MAEII/A'CGT//
+MAEIII/N'GTNAC//
+MBOI/'GATC//
+MNLI/CCTCNNNNNNN'/'NNNNNNNGAGG//
+NLAIII/CATG'//
+NLAIV/GGN'NCC//
+RSAI/GT'AC//
+SAU96I/G'GNCC//
+SCRFI/CC'NGG//
+SECI/C'CNNGG//
+SFII/GGCCNNNN'NGGCC//
+TAQI/T'CGA//
diff --git a/tables/RENZYM.6 b/tables/RENZYM.6
new file mode 100644
index 0000000..a54f191
--- /dev/null
+++ b/tables/RENZYM.6
@@ -0,0 +1,85 @@
+AATII/GACGT'C//
+ACCI/GT'MKAC//
+AFLII/C'TTAAG//
+AFLIII/A'CRYGT//
+AHAII/GR'CGYC//
+APAI/GGGCC'C//
+APALI/G'TGCAC//
+ASUII/TT'CGAA//
+AVAI/C'YCGRG//
+AVAII/G'GWCC//
+AVRII/C'CTAGG//
+BALI/TGG'CCA//
+BAMHI/G'GATCC//
+BANI/G'GYRCC//
+BANII/GRGCY'C//
+BBVI/GCAGCNNNNNNNN'/'NNNNNNNNNNNNGCTGC//
+BCLI/T'GATCA//
+BGLI/GCCNNNN'NGGC//
+BGLII/A'GATCT//
+BINI/GGATCNNNN'/'NNNNNGATCC//
+BSMI/GAATGCN'/NG'CATTC//
+BSP1286/GDGCH'C//
+BSPMI/ACCTGCNNNN'/'NNNNNNNNGCAGGT//
+BSPMII/TCCGG'A//
+BSSHII/G'CGCGC//
+BSTEII/G'GTNACC//
+BSTXI/CCANNNNN'NTGG//
+BSTNI/CC'WGG//
+CLAI/AT'CGAT//
+DRAI/TTT'AAA//
+DRAII/RG'GNCCY//
+DRAIII/CACNNN'GTG//
+EAEI/Y'GGCCR//
+ECOB/TGANNNNNNNNTGCT/AGCANNNNNNNNTCA//
+ECOK/AACNNNNNNGTGC/GCACNNNNNNGTT//
+ECORI/G'AATTC//
+ECORV/GAT'ATC//
+ESPI/GC'TNAGC//
+FOKI/GGATGNNNNNNNNN'/'NNNNNNNNNNNNNCATCC//
+FSPI/TGC'GCA//
+HAEII/RGCGC'Y//
+HGAI/GACGCNNNNN'/'NNNNNNNNNNGCGTC//
+HGIAI/GWGCW'C//
+HINCII/GTY'RAC//
+HINDIII/A'AGCTT//
+HPAI/GTT'AAC//
+HPHI/GGTGANNNNNNNN'/'NNNNNNNTCACC//
+KPNI/GGTAC'C//
+MBOII/GAAGANNNNNNNN'/'NNNNNNNTCTTC//
+MLUI/A'CGCGT//
+MSTII/CC'TNAGG//
+NAEI/GCC'GGC//
+NARI/GG'CGCC//
+NCII/CC'SGG//
+NCOI/C'CATGG//
+NDEI/CA'TATG//
+NHEI/G'CTAGC//
+NOTI/GC'GGCCGC//
+NRUI/TCG'CGA//
+NSII/ATGCA'T//
+NSPBII/CMG'CKG//
+PFIMI/CCANNNN'NTGG//
+PPUMI/RG'GWCCY//
+PSTI/CTGCA'G//
+PVUI/CGAT'CG//
+PVUII/CAG'CTG//
+RSRII/CG'GWCCG//
+SACI/GAGCT'C//
+SACII/CCGC'GG//
+SALI/G'TCGAC//
+SCAI/AGT'ACT//
+SFANI/GATGCNNNNN'/'NNNNNNNNNGCATC//
+SMAI/CCC'GGG//
+SNABI/TAC'GTA//
+SPEI/A'CTAGT//
+SPHI/GCATG'C//
+SSPI/AAT'ATT//
+STUI/AGG'CCT//
+STYI/C'CWWGG//
+TTHIIII/GACN'NNGTC//
+XBAI/T'CTAGA//
+XHOI/C'TCGAG//
+XHOII/R'GATCY//
+XMAIII/C'GGCCG//
+XMNI/GAANN'NNTTC//
diff --git a/tables/RENZYM.ALL b/tables/RENZYM.ALL
new file mode 100644
index 0000000..bf78e7d
--- /dev/null
+++ b/tables/RENZYM.ALL
@@ -0,0 +1,316 @@
+; REBASE 9210 from Rich Roberts
+AatI/AGG'CCT//
+AatII/GACGT'C//
+Acc65I/G'GTACC//
+AccI/GT'MKAC//
+AccII/CG'CG//
+AccIII/T'CCGGA//
+AciI/C'CGC/G'CGG//
+AcyI/GR'CGYC//
+AfaI/GT'AC//
+AflII/C'TTAAG//
+AflIII/A'CRYGT//
+AgeI/A'CCGGT//
+AluI/AG'CT//
+Alw21I/GWGCW'C//
+Alw26I/GTCTCN'NNNN/'NNNNNGAGAC//
+Alw44I/G'TGCAC//
+AlwI/GGATCNNNN'N/'NNNNNGATCC//
+AlwNI/CAGNNN'CTG//
+Aor51HI/AGC'GCT//
+ApaI/GGGCC'C//
+ApaLI/G'TGCAC//
+ApoI/R'AATTY//
+ApyI/CC'WGG//
+AscI/GG'CGCGCC//
+AseI/AT'TAAT//
+AsnI/AT'TAAT//
+Asp700I/GAANN'NNTTC//
+Asp718I/G'GTACC//
+AspEI/GACNNN'NNGTC/GACNN'NNNGTC//
+AspHI/GWGCW'C//
+AspI/GACN'NNGTC//
+AsuI/G'GNCC//
+AvaI/C'YCGRG//
+AvaII/G'GWCC//
+AviII/TGC'GCA//
+AvrII/C'CTAGG//
+AxyI/CC'TNAGG//
+BalI/TGG'CCA//
+BamHI/G'GATCC//
+BanI/G'GYRCC//
+BanII/GRGCY'C//
+BanIII/AT'CGAT//
+BbeI/GGCGC'C//
+BbiII/GR'CGYC//
+BbrPI/CAC'GTG//
+BbsI/GAAGACNN'NNNN/'NNNNNNGTCTTC//
+BbuI/GCATG'C//
+BbvI/GCAGCNNNNNNNN'NNNN/'NNNNNNNNNNNNGCTGC//
+BcgI/GCANNNNNNTCGNNNNNNNNNNNN'/NN'NNNNNNNNNNCGANNNNNNTGC//
+BclI/T'GATCA//
+BcnI/CC'SGG//
+BcoI/C'YCGRG//
+BfaI/C'TAG//
+BfrI/C'TTAAG//
+BglI/GCCNNNN'NGGC//
+BglII/A'GATCT//
+BlnI/C'CTAGG//
+BmyI/GDGCH'C//
+BpmI/CTGGAGNNNNNNNNNNNNNNNN'/NN'NNNNNNNNNNNNNNCTCCAG//
+Bpu1102I/GC'TNAGC//
+BsaAI/YAC'GTR//
+BsaBI/GATNN'NNATC//
+BsaHI/GR'CGYC//
+BsaI/GGTCTCN'NNNN/'NNNNNGAGACC//
+BsaJI/C'CNNGG//
+BsaWI/W'CCGGW//
+BscBI/GGN'NCC//
+BscCI/GAATGCN'/NG'CATTC//
+BscI/AT'CGAT//
+BseAI/T'CCGGA//
+BseNI/ACTGGN'/NC'CAGT//
+BsgI/GTGCAGNNNNNNNNNNNNNNNN'/NN'NNNNNNNNNNNNNNCTGCAC//
+Bsh1236I/CG'CG//
+Bsh1285I/CGRY'CG//
+BshI/GG'CC//
+BsiBI/GATNN'NNATC//
+BsiCI/TT'CGAA//
+BsiEI/CGRY'CG//
+BsiHKAI/GWGCW'C//
+BsiLI/CC'WGG//
+BsiMI/T'CCGGA//
+BsiQI/T'GATCA//
+BsiWI/C'GTACG//
+BsiXI/AT'CGAT//
+BsiYI/CCNNNNN'NNGG//
+BsiZI/G'GNCC//
+BslI/CCNNNNN'NNGG//
+BsmAI/GTCTCN'NNNN/'NNNNNGAGAC//
+BsmI/GAATGCN'/NG'CATTC//
+Bsp106I/AT'CGAT//
+Bsp119I/TT'CGAA//
+Bsp120I/G'GGCCC//
+Bsp1286I/GDGCH'C//
+Bsp1407I/T'GTACA//
+Bsp143I/'GATC//
+Bsp143II/RGCGC'Y//
+Bsp50I/CG'CG//
+Bsp68I/TCG'CGA//
+BspCI/CGAT'CG//
+BspDI/AT'CGAT//
+BspEI/T'CCGGA//
+BspHI/T'CATGA//
+BspMI/ACCTGCNNNN'NNNN/'NNNNNNNNGCAGGT//
+BspWI/GCNNNNN'NNGC//
+BspXI/AT'CGAT//
+BsrBI/GAG'CGG/CCG'CTC//
+BsrFI/R'CCGGY//
+BsrI/ACTGGN'/NC'CAGT//
+BssHII/G'CGCGC//
+Bst1107I/GTA'TAC//
+Bst71I/GCAGCNNNNNNNN'NNNN/'NNNNNNNNNNNNGCTGC//
+BstBI/TT'CGAA//
+BstEII/G'GTNACC//
+BstI/G'GATCC//
+BstNI/CC'WGG//
+BstOI/CC'WGG//
+BstPI/G'GTNACC//
+BstUI/CG'CG//
+BstXI/CCANNNNN'NTGG//
+BstYI/R'GATCY//
+BstZI/C'GGCCG//
+Bsu15I/AT'CGAT//
+Bsu36I/CC'TNAGG//
+BsuRI/GG'CC//
+CcrI/C'TCGAG//
+CelII/GC'TNAGC//
+CfoI/GCG'C//
+Cfr10I/R'CCGGY//
+Cfr13I/G'GNCC//
+Cfr42I/CCGC'GG//
+Cfr9I/C'CCGGG//
+ClaI/AT'CGAT//
+CpoI/CG'GWCCG//
+Csp45I/TT'CGAA//
+Csp6I/G'TAC//
+CspI/CG'GWCCG//
+CvnI/CC'TNAGG//
+DdeI/C'TNAG//
+DpnI/GA'TC//
+DpnII/'GATC//
+DraI/TTT'AAA//
+DraII/RG'GNCCY//
+DraIII/CACNNN'GTG//
+DrdI/GACNNNN'NNGTC//
+DsaI/C'CRYGG//
+DsaV/'CCNGG//
+EaeI/Y'GGCCR//
+EagI/C'GGCCG//
+Eam1104I/CTCTTCN'NNN/'NNNNGAAGAG//
+Eam1105I/GACNNN'NNGTC//
+EarI/CTCTTCN'NNN/'NNNNGAAGAG//
+Ecl136II/GAG'CTC//
+EclXI/C'GGCCG//
+Eco105I/TAC'GTA//
+Eco130I/C'CWWGG//
+Eco147I/AGG'CCT//
+Eco24I/GRGCY'C//
+Eco31I/GGTCTCN'NNNN/'NNNNNGAGACC//
+Eco32I/GAT'ATC//
+Eco47I/G'GWCC//
+Eco47III/AGC'GCT//
+Eco52I/C'GGCCG//
+Eco57I/CTGAAGNNNNNNNNNNNNNNNN'/NN'NNNNNNNNNNNNNNCTTCAG//
+Eco64I/G'GYRCC//
+Eco72I/CAC'GTG//
+Eco81I/CC'TNAGG//
+Eco88I/C'YCGRG//
+Eco91I/G'GTNACC//
+EcoNI/CCTNN'NNNAGG//
+EcoO109I/RG'GNCCY//
+EcoO65I/G'GTNACC//
+EcoRI/G'AATTC//
+EcoRII/'CCWGG//
+EcoRV/GAT'ATC//
+EcoT14I/C'CWWGG//
+EcoT22I/ATGCA'T//
+EheI/GGC'GCC//
+Esp3I/CGTCTCN'NNNN/'NNNNNGAGACG//
+EspI/GC'TNAGC//
+FbaI/T'GATCA//
+FdiII/TGC'GCA//
+Fnu4HI/GC'NGC//
+FokI/GGATGNNNNNNNNN'NNNN/'NNNNNNNNNNNNNCATCC//
+FspI/TGC'GCA//
+GsuI/CTGGAGNNNNNNNNNNNNNNNN'/NN'NNNNNNNNNNNNNNCTCCAG//
+HaeII/RGCGC'Y//
+HaeIII/GG'CC//
+HapII/C'CGG//
+HgaI/GACGCNNNNN'NNNNN/'NNNNNNNNNNGCGTC//
+HgiAI/GWGCW'C//
+HhaI/GCG'C//
+Hin1I/GR'CGYC//
+Hin6I/G'CGC//
+HincII/GTY'RAC//
+HindII/GTY'RAC//
+HindIII/A'AGCTT//
+HinfI/G'ANTC//
+HinP1I/G'CGC//
+HpaI/GTT'AAC//
+HpaII/C'CGG//
+HphI/GGTGANNNNNNNN'/N'NNNNNNNTCACC//
+KasI/G'GCGCC//
+Kpn2I/T'CCGGA//
+KpnI/GGTAC'C//
+Ksp632I/CTCTTCN'NNN/'NNNNGAAGAG//
+KspI/CCGC'GG//
+LspI/TT'CGAA//
+MaeI/C'TAG//
+MaeII/A'CGT//
+MaeIII/'GTNAC//
+MamI/GATNN'NNATC//
+MboI/'GATC//
+MboII/GAAGANNNNNNNN'/N'NNNNNNNTCTTC//
+McrI/CGRY'CG//
+MflI/R'GATCY//
+MluI/A'CGCGT//
+MnlI/CCTCNNNNNNN'/N'NNNNNNGAGG//
+Mph1103I/ATGCA'T//
+MroI/T'CCGGA//
+MscI/TGG'CCA//
+MseI/T'TAA//
+MspI/C'CGG//
+MstI/TGC'GCA//
+MstII/CC'TNAGG//
+MunI/C'AATTG//
+MvaI/CC'WGG//
+MvnI/CG'CG//
+NaeI/GCC'GGC//
+NarI/GG'CGCC//
+NciI/CC'SGG//
+NcoI/C'CATGG//
+NdeI/CA'TATG//
+NdeII/'GATC//
+NgoMI/G'CCGGC//
+NheI/G'CTAGC//
+NlaIII/CATG'//
+NlaIV/GGN'NCC//
+NotI/GC'GGCCGC//
+NruI/TCG'CGA//
+NsiI/ATGCA'T//
+NspBII/CMG'CKG//
+NspI/RCATG'Y//
+NspIII/C'YCGRG//
+NspV/TT'CGAA//
+PacI/TTAAT'TAA//
+PaeI/GCATG'C//
+PaeR7I/C'TCGAG//
+PalI/GG'CC//
+Pfl23II/C'GTACG//
+PflMI/CCANNNN'NTGG//
+PleI/GAGTCNNNN'N/'NNNNNGACTC//
+PmaCI/CAC'GTG//
+PmeI/GTTT'AAAC//
+PmlI/CAC'GTG//
+Ppu10I/A'TGCAT//
+PpuMI/RG'GWCCY//
+PshAI/GACNN'NNGTC//
+Psp1406I/AA'CGTT//
+Psp5II/RG'GWCCY//
+PspAI/C'CCGGG//
+PssI/RGGNC'CY//
+PstI/CTGCA'G//
+PvuI/CGAT'CG//
+PvuII/CAG'CTG//
+RmaI/C'TAG//
+RsaI/GT'AC//
+RsrII/CG'GWCCG//
+SacI/GAGCT'C//
+SacII/CCGC'GG//
+SalI/G'TCGAC//
+SapI/GCTCTTCN'NNN/'NNNNGAAGAGC//
+Sau3AI/'GATC//
+Sau96I/G'GNCC//
+SauI/CC'TNAGG//
+ScaI/AGT'ACT//
+ScrFI/CC'NGG//
+SduI/GDGCH'C//
+SfaNI/GCATCNNNNN'NNNN/'NNNNNNNNNGATGC//
+SfcI/C'TRYAG//
+SfiI/GGCCNNNN'NGGCC//
+SfuI/TT'CGAA//
+SgrAI/CR'CCGGYG//
+SinI/G'GWCC//
+SmaI/CCC'GGG//
+SnaBI/TAC'GTA//
+SnoI/G'TGCAC//
+SpeI/A'CTAGT//
+SphI/GCATG'C//
+SplI/C'GTACG//
+SpoI/TCG'CGA//
+SrfI/GCCC'GGGC//
+Sse8387I/CCTGCA'GG//
+SspI/AAT'ATT//
+SstI/GAGCT'C//
+SstII/CCGC'GG//
+StuI/AGG'CCT//
+StyI/C'CWWGG//
+SunI/C'GTACG//
+SwaI/ATTT'AAAT//
+TaqI/T'CGA//
+TfiI/G'AWTC//
+ThaI/CG'CG//
+Tth111I/GACN'NNGTC//
+TthHB8I/T'CGA//
+Van91I/CCANNNN'NTGG//
+VspI/AT'TAAT//
+XbaI/T'CTAGA//
+XcmI/CCANNNNN'NNNNTGG//
+XhoI/C'TCGAG//
+XhoII/R'GATCY//
+XmaI/C'CCGGG//
+XmaIII/C'GGCCG//
+XmnI/GAANN'NNTTC//
+XorII/CGAT'CG//
+
diff --git a/tables/RIBSEUKAR.WTS b/tables/RIBSEUKAR.WTS
new file mode 100644
index 0000000..71ee71f
--- /dev/null
+++ b/tables/RIBSEUKAR.WTS
@@ -0,0 +1,22 @@
+mRNA WTS FOR EUKARYOTES SARGAN,GREGORY,BUTTERWORTH FEBS LET 147 133-136 1982
+P  -7 -6 -5 -4 -3 -2 -1  1  2  3
+  102102102102102102102102102102
+T  19 24 31 12  0 18  5  0102  0
+C  20 15 32 65  5 42 52  0  0  0
+A  50 27 27 19 86 36 34102  0  0
+G   6 29 12  6 11  6 11  0  0102
+VIRAL ONLY
+P  -7 -6 -5 -4 -3 -2 -1  1  2  3
+   41 41 41 41 41 41 41 41 41 41
+T  14 12 16  4  2 13  9  0 41  0
+C   7  3 13 17  7  9 14  0  0  0
+A  15 10  6 10 27 15  9 41  0  0
+G   5 16  6 10  5  4  9  0  0 41
+This paper puts forward the hypothesis that there is an interaction between
+some mRNA leader sequences and a highly conserved structure in the 18S rRNA
+of eukaryotic ribosomes. The attempt to substantiate the hypothesis includes
+a table of base frequencies for sequences immediately 5' to start codons.
+They examined 102 sequences and I have used the base frequencies they found
+as a weight matrix for searching for eukaryotic gene starts. I don't yet 
+know how good this method is. The viral sequences were found to be slightly
+different but the separate table shown here is not used in the program.
diff --git a/tables/ROBSON.WTS b/tables/ROBSON.WTS
new file mode 100644
index 0000000..ed14311
--- /dev/null
+++ b/tables/ROBSON.WTS
@@ -0,0 +1,112 @@
+     DIRECTIONAL INFORMATION MEASURE FOR THE ALPHA-HELICAL CONFORMATION
+    J-8 J-7 J-6 J-5 J-4 J-3 J-2 J-1   J J+1 J+2 J+3 J+4 J+5 J+6 J+7 J+8
+ C    0   0   0   0   0   0  -5 -10 -13 -10  -5   0   0   0   0   0   0
+ S    0  -5 -10 -15 -20 -25 -30 -35 -39 -35 -30 -25 -20 -15 -10  -5   0
+ T    0   0   0  -5 -10 -15 -20 -25 -26 -25 -20 -15 -10  -5   0   0   0
+ P  -10 -20 -40 -60 -80-100-120-140 -77 -60 -30 -20 -10   0   0   0   0
+ A    5  10  15  20  30  40  50  60  65  60  50  40  30  20  15  10   5
+ G   -5 -10 -15 -20 -30 -40 -50 -60 -86 -60 -50 -40 -30 -20 -15 -10  -5
+ N    0   0   0   0 -10 -20 -30 -40 -51 -40 -30 -20 -10   0   0   0   0
+ D    0  -5 -10 -15 -20 -15 -10   0   5  10  15  20  20  20  15  10   5
+ E    0   0   0   0  10  20  60  70  78  78  78  78  78  70  60  40  20
+ Q    0   0   0   0   5  10  20  20  10 -10 -20 -20 -10  -5   0   0   0
+ B    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ Z    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ H   10  20  30  40  50  50  50  30  12 -20 -10   0   0   0   0   0   0
+ R    0   0   0   0   0   0   0   0  -9 -15 -20 -30 -40 -50 -50 -30 -10
+ K   20  40  50  55  60  60  50  30  23  10   5   0   0   0   0   0   0
+ M   10  20  25  30  35  40  45  50  53  50  45  40  35  30  25  20  10
+ I    5  10  15  20  25  20  15  10   6   0 -10 -15 -20 -25 -20 -10  -5
+ L    0   5  10  15  20  25  28  30  32  30  28  25  20  15  10   5   0
+ V    0   0   0   0   0   0   5  10  14  10   5   0   0   0   0   0   0
+ F    0   0   0   0   0   5  10  15  16  15  10   5   0   0   0   0   0
+ Y   -5 -10 -15 -20 -25 -30 -35 -40 -45 -40 -35 -30 -25 -20 -15 -10  -5
+ W  -10 -20 -40 -50 -50 -10   0  10  12  10   0 -10 -50 -50 -40 -20 -10
+ -    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ X    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ ?    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+      0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+     DIRECTIONAL INFORMATION MEASURE FOR THE EXTENDED CONFORMATION
+    J-8 J-7 J-6 J-5 J-4 J-3 J-2 J-1   J J+1 J+2 J+3 J+4 J+5 J+6 J+7 J+8
+ C    0   0   0   0   0  10  20  30  44  30  20  10   0   0   0   0   0
+ S    0  10  20  10   0  -5 -10 -15 -17 -15 -10  -5   0  10  20  10   0
+ T    5  10  15  20  15  15  10  10  13  10  10  15  15  20  15  10   5
+ P   10  20  30  30  20  10   0 -10 -18 -20 -10  10  30  40  30  20  10
+ A    0   0   0   0  -5 -10 -15 -20 -23 -20 -15 -10  -5   0   0   0   0
+ G   10  20  30  40  40  20   0 -20 -42 -20   0  20  40  40  30  20 -10
+ N   10  30  50  30  20   0 -15 -30 -41 -30 -15   0  20  30  50  30  10
+ D    0   5  10  15  20   0 -20 -30 -44 -30 -20   0   0   0   0   0   0
+ E  -10 -15 -20 -25 -30 -35 -40 -45 -50 -55 -60 -60 -50 -40 -30 -20 -10
+ Q    0   0   0   0   0  -5 -10   0  12  20  30  40  50  50  40  30  15
+ B    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ Z    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ H  -10 -20 -40 -20 -10   0 -10 -20 -25 -35 -30 -25 -20 -15 -10  -5   0
+ R    0   0   0   0   0   0   0   0   4   0   0   0   0   0   0   0   0
+ K   -5 -10 -15 -20 -30 -40 -50 -40 -33 -20 -10   0  10  10   0   0   0
+ M  -10 -20 -30 -40 -40 -30   0  10  23  10   0 -30 -40 -40 -30 -20 -10
+ I    0 -10 -20 -10   0  20  40  60  67  60  40  20   0 -10 -20 -10   0
+ L    0   0   0   0   0   5  10  20  23  20  10   5   0   0   0   0   0
+ V    0   0 -10 -20   0  20  40  60  68  60  40  20   0 -20 -10   0   0
+ F    0   0   0   0   0   5  10  20  26  10 -10 -30 -60 -65 -60 -40 -20
+ Y    0   5  10  15  20  25  30  35  40  35  30  25  20  15  10   5   0
+ W    0   0   0   0   0 -10 -10 -10 -10 -10 -10 -15 -20 -25 -30 -20 -10
+ -    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ X    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ ?    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+      0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+     DIRECTIONAL INFORMATION MEASURE FOR TURNS
+    J-8 J-7 J-6 J-5 J-4 J-3 J-2 J-1   J J+1 J+2 J+3 J+4 J+5 J+6 J+7 J+8
+ C   20  40  50  60  60  55  50  45  44  40  35  30  25  20  15  10   5
+ S    0 -10 -20 -20  10  15  20  25  26  25  20  15  10   0   0   0   0
+ T    0  10  20  20  20  15  18   5   3   5  10  15  20  20  20  10   0
+ P   10  20  30  40  50  70  10 -90  36  90  10   0   0   0   0   0   0
+ A    0   0   0 -10 -20 -30 -40 -50 -50 -40 -30 -20 -10   0   0   0   0
+ G    0   0   0   0  10  30  55  55  57  40   0   0   0   0   0   0   0
+ N    0   0   0  10  20  30  35  40  42  40  35  30  20  10   5   0   0
+ D    0   0   0   0   0   0   5  10  31  10   5   0   0   0   0   0   0
+ E    0  -5 -10 -15 -20 -30 -40 -45 -47 -20   0  10   5   0   0   0   0
+ Q   10  20  30  25  20  15  10   5   4  20  30  40  50  60  50  40  20
+ B    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ Z    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ H    0   0   0   0   0   0   0   0  -3   0  10  20  30  20  10   0   0
+ R    0   0   0   0   0   0   0  10  21  30  40  30  20  10   0   0   0
+ K  -10 -20 -30 -40 -25 -10   0  10  10  10   0 -20 -30 -20 -10  -5   0
+ M   -5 -15 -20 -25 -30 -35 -40 -45 -48 -45 -40 -35 -30 -25 -20 -15  -5
+ I    0   0   0   0   0 -10 -20 -30 -46 -40 -10   0   0  20  30  20  10
+ L    0   0   0 -10 -20 -30 -40 -50 -56 -20 -10   0   0   0   0   0   0
+ V    0   0   0   0 -10 -20 -30 -40 -60 -40 -30 -20 -10   0   0   0   0
+ F    0   0   0   0   0  -5 -10 -15 -18 -15   0  15  30  25  20  10   0
+ Y    0   0   0   5  15  15  20  25  29  25  20  15  15   5   0   0   0
+ W    0   0   0  10  20  30  40  80  36 -30  30  40  50  60  70  40  20
+ -    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ X    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ ?    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+      0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+     DIRECTIONAL INFORMATION MEASURE FOR COIL
+    J-8 J-7 J-6 J-5 J-4 J-3 J-2 J-1   J J+1 J+2 J+3 J+4 J+5 J+6 J+7 J+8
+ C    0   0   0   0   0   0 -10 -30 -47 -30 -10   0   0   0   0   0   0
+ S    0 -10 -20 -20  10  15  20  25  50  25  20  15  10   0   0   0   0
+ T    0  10  20  30  20  15  10  15  17  15  10  15  20  30  20  10   0
+ P    0   0  10  20  30  40  50  55  58  50  10   0   0   0   0   0   0
+ A    0   0   0   0  -5 -10 -20 -25 -25 -25 -20 -15 -10  -5   0   0   0
+ G    0   0   0   0  10  30  40  45  49  45  40  30  10   0   0   0   0
+ N    0   0   0  10  20  30  35  40  46  40  35  30  20  10   0   0   0
+ D    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ E    0   0  10  20  40  20   0 -10 -44 -40 -20 -10   0   0   0   0   0
+ Q   10  20  30  25  20  15  10   0  -5  20  30  40  50  60  50  40  20
+ B    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ Z    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ H    0   0   0   0   0   0   0  10  16  15  10  10  10  10   5   0   0
+ R    0   0   0   0   0   0   0   0 -12   0  20  30  20  10   0   0   0
+ K  -10 -20 -30 -40 -25 -20 -10  -8  -8   0   0 -20 -30 -20 -10  -5   0
+ M    0  -5 -10 -15 -20 -25 -30 -40 -41 -40 -30 -25 -20 -15 -10  -5   0
+ I    0   0   0   0   0 -10 -20 -30 -33 -30 -10   0  10  20  30  20   0
+ L    0   0   0 -10 -20 -30 -40 -30 -20 -20 -10   0   0   0   0   0   0
+ V    0   0   0   0 -10 -20 -25 -30 -35 -30 -25 -20 -10   0   0   0   0
+ F    0   0   0   0   0  -5 -10 -20 -41 -20   0  15  30  25  20  10   0
+ Y    0   0   0   0   0   0   0   0  -6   0   0   0   0   0   0   0   0
+ W    0   0   0  10  20  30  40  20  12  20  30  40  50  60  70  40  20
+ -    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ X    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+ ?    0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
+      0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0   0
diff --git a/tables/SAP.MRG b/tables/SAP.MRG
new file mode 100644
index 0000000..b9e90fd
--- /dev/null
+++ b/tables/SAP.MRG
@@ -0,0 +1,10 @@
+PLOT POSITIONS FOR SAP. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  1100  8500
+    12  5000  5000
+    13     1   100
+    15     1 10000
+    16    20    20
+    29  4000  1000
+    33  5000  4000
+    34  9000  1000
diff --git a/tables/SIP.MRG b/tables/SIP.MRG
new file mode 100644
index 0000000..6b30f89
--- /dev/null
+++ b/tables/SIP.MRG
@@ -0,0 +1,4 @@
+PLOT POSITIONS FOR DIAGON. DRAWING BOARD SIZE, X0 XLENGTH, OPT Y0 YLEN
+ 10000 10000
+  2400  7000  7000
+     1    10  9980
diff --git a/tables/TAGDB b/tables/TAGDB
new file mode 100644
index 0000000..f96b0a6
--- /dev/null
+++ b/tables/TAGDB
@@ -0,0 +1,60 @@
+#
+# This is a tag database for xdap
+#
+# The syntax in BNF is:
+#
+#    <tag_db>      ::= <tag> <tag_db> | <empty>
+#    <tag>         ::= <identifier> : <option_list> \n | <identifier> \n
+#    <option_list> ::= <option> : <option_list> | <option_list>
+#    <option>      ::= <option_name> = <string>
+#
+# Lines starting with a # are comments and are ignored.
+# Newlines can be escaped over by placing a back slash at the end of the line
+#
+# options        description                  default value
+#   id="string"    identifier for searching     1st 4 characters of identifier
+#   fg=colour      foreground colour            fg for window
+#   bg=colour      background colour            bf for window
+#   dt="string"    default text for tag         none
+#
+# examples:
+#   comment:fg=blue:dt="?"
+#
+
+# For general comments
+comment: \
+	id="COMM": \
+	fg=blue: \
+	dt="?"
+
+# For marking oligos
+oligo: \
+	id="OLIG": \
+	bg=yellow: \
+
+# For marking compressions
+compression: \
+	id="COMP": \
+	bg=red: \
+
+# For marking stops
+stop: \
+	id="STOP": \
+	fg=white: \
+	bg=blue: \
+
+# For marking repeats
+repeat: \
+	id="REPT": \
+	bg=green2: \
+
+# For marking cosmid vector
+"cosmid vector" : \
+	id="CVEC" : \
+	bg=cyan
+
+# For marking ALU sequences
+"Alu segment" : \
+	id="ALUS" : \
+	bg=green
+
diff --git a/tables/Xdap b/tables/Xdap
new file mode 100644
index 0000000..88c8a6d
--- /dev/null
+++ b/tables/Xdap
@@ -0,0 +1,231 @@
+!  Default resources for staden applications 
+!  ----------------------------------------- 
+!  Dialogue shell 
+!  -------------- 
+Xdap*font: 7x13bold
+Xdap.Dialogue.allowShellResize: False
+Xdap.Dialogue.width:	 563
+Xdap.Dialogue.height:    190
+Xdap.Dialogue.dialogueForm*translations: #override \n\
+         <Key>Return: CRAction()
+!  The default chaining for all children is Top,Top,Left,Left 
+Xdap.Dialogue.dialogueForm*top:    chainTop
+Xdap.Dialogue.dialogueForm*bottom: chainTop
+Xdap.Dialogue.dialogueForm*left:   chainLeft
+Xdap.Dialogue.dialogueForm*right:  chainLeft
+!  currFile 
+Xdap.Dialogue.dialogueForm.currFile.label: Current file:
+Xdap.Dialogue.dialogueForm.currFile.borderWidth: 0
+Xdap.Dialogue.dialogueForm.currFile.resizable: True
+!  currFunc 
+Xdap.Dialogue.dialogueForm.currFunc.label: Current function:
+Xdap.Dialogue.dialogueForm.currFunc.borderWidth: 0
+Xdap.Dialogue.dialogueForm.currFunc.resizable: True
+!  quit 
+Xdap.Dialogue.dialogueForm.quit.label: Quit application
+Xdap.Dialogue.dialogueForm.quit.foreground:  red
+Xdap.Dialogue.dialogueForm.quit.borderColor: red
+!  null 
+Xdap.Dialogue.dialogueForm.null.borderWidth: 0
+Xdap.Dialogue.dialogueForm.null.label: Working ... 
+!  nChoice 
+Xdap.Dialogue.dialogueForm.nChoice.resizable: True
+!  nChoice buttons 
+Xdap.Dialogue.dialogueForm.nChoice.buttons.resizable: True
+!Xdap.Dialogue.dialogueForm.nChoice.buttons.orientation: horizontal
+!  nChoice OK 
+Xdap.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3
+!  getopt 
+Xdap.Dialogue.dialogueForm.getopt.width: 540
+!  getopt execlabel 
+Xdap.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:
+Xdap.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0
+!  getopt execcommand 
+Xdap.Dialogue.dialogueForm.getopt.execcommand.label: Execute with dialogue
+Xdap.Dialogue.dialogueForm.getopt.execcommand.resizable: True
+!  getopt menubar 
+Xdap.Dialogue.dialogueForm.getopt.menubar.width:	530
+Xdap.Dialogue.dialogueForm.getopt.menubar.height:	25
+!  getstring 
+Xdap.Dialogue.dialogueForm.getstring.resizable: True
+Xdap.Dialogue.dialogueForm.getstring.label:
+Xdap.Dialogue.dialogueForm.getstring.value:
+!  getstring label 
+Xdap.Dialogue.dialogueForm.getstring.label.resizable: True
+!  getstring value 
+Xdap.Dialogue.dialogueForm.getstring.value.width: 350
+!  getstring OK 
+Xdap.Dialogue.dialogueForm.getstring.OK.borderWidth: 3
+!  yesno 
+Xdap.Dialogue.dialogueForm.yesno.resizable: True
+!  yesno label 
+Xdap.Dialogue.dialogueForm.yesno.lab.label:
+Xdap.Dialogue.dialogueForm.yesno.lab.borderWidth: 0
+Xdap.Dialogue.dialogueForm.yesno.lab.resizable: True
+Xdap.Dialogue.dialogueForm.yesno.YES.borderWidth: 3
+!  Help shell 
+!  ---------- 
+Xdap.Help.width:     563
+Xdap.Help.minWidth:  550
+Xdap.Help.height:    400
+Xdap.Help.minHeight: 150
+Xdap.Help*top:    chainTop
+Xdap.Help*bottom: chainTop
+Xdap.Help*left:   chainLeft
+Xdap.Help*right:  chainLeft
+!  currDisp 
+Xdap.Help.helpForm.currDisp.resizable: True
+Xdap.Help.helpForm.currDisp.borderWidth: 0
+!  rmHelp 
+Xdap.Help.helpForm.rmHelp.label: Remove help
+!  button 
+Xdap.Help.helpForm.button.label: List of topics
+!  topicsViewport 
+Xdap.Help.helpForm.topicsViewport.allowVert: True
+Xdap.Help.helpForm.topicsViewport.bottom: chainBottom
+Xdap.Help.helpForm.topicsViewport.right:  chainRight
+Xdap.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1
+! Xdap.Help.helpForm.topicsViewporttopicsList.forceColumns:   2
+!  text 
+Xdap.Help.helpForm.text.editType: edit
+Xdap.Help.helpForm.text.scrollVertical: always
+Xdap.Help.helpForm.text.bottom: chainBottom
+Xdap.Help.helpForm.text.right:  chainRight
+!  Output shell 
+!  ------------ 
+Xdap.Graphics.graph.width:  563
+Xdap.Graphics.graph.height: 400
+Xdap.Output.text.width:   563
+Xdap.Output.text.height:  100
+!  Editor shell 
+!  ------------ 
+Xdap.Editor.allowShellResize: True
+Xdap.Editor.mainForm.resizable: True
+! 
+    "Xdap.Editor*font:  5x8",
+
+Xdap.Editor*font:  *Fixed-*-20-200-*
+!  The default chaining for all children is Top,Top,Left,Left 
+Xdap.Editor.mainForm*top:    chainTop
+Xdap.Editor.mainForm*bottom: chainTop
+Xdap.Editor.mainForm*left:   chainLeft
+Xdap.Editor.mainForm*right:  chainLeft
+Xdap.Editor.mainForm.title.borderWidth: 0
+Xdap.Editor.mainForm.title.label: Contig Editor (V1.1)
+Xdap.Editor.mainForm.buttons.borderWidth: 0
+Xdap.Editor.mainForm.buttons.orientation: horizontal
+Xdap.Editor.mainForm.buttons.resizable:  True
+Xdap.Editor.mainForm.buttons*vertDistance: 0
+Xdap.Editor.mainForm.buttons.insert.label: Insert
+Xdap.Editor.mainForm.buttons.replace.label: Replace
+Xdap.Editor.mainForm.buttons.replace.state: True
+Xdap.Editor.mainForm.buttons.superman.label: Super Edit
+Xdap.Editor.mainForm.buttons.superman.state: False
+Xdap.Editor.mainForm.buttons.reveal.label: Reveal Cutoffs
+Xdap.Editor.mainForm.buttons.reveal.state: False
+Xdap.Editor.mainForm.buttons.undo.label: Undo
+Xdap.Editor.mainForm.buttons.save.label: Save
+Xdap.Editor.mainForm.buttons.exit.label: Leave Editor
+Xdap.Editor.mainForm.buttons.exit.foreground:  red
+Xdap.Editor.mainForm.buttons.exit.borderColor: red
+Xdap.Editor.mainForm.buttons.lock.label: Lock
+Xdap.Editor.mainForm.buttons.join.label: Join
+Xdap.Editor.mainForm.buttons.next.label: Next Problem
+Xdap.Editor.mainForm.edForm.resizable: True
+Xdap.Editor.mainForm.edForm.defaultDistance: 0
+Xdap.Editor.mainForm.edForm.borderWidth: 0
+Xdap.Editor.mainForm.edForm.sequences.resizable: True
+Xdap.Editor.mainForm.edForm.names.resizable: True
+Xdap.Editor.mainForm.edForm.slider.length:  500
+Xdap.Editor.mainForm.edForm.slider.orientation:  horizontal
+Xdap.Editor.mainForm.edForm.scrollButtons.defaultDistance:  0
+Xdap.Editor.mainForm.edForm.scrollButtons.borderWidth:  0
+Xdap.Editor.mainForm.edForm.scrollButtons.left:  ChainLeft
+Xdap.Editor.mainForm.edForm.scrollButtons.right:  ChainLeft
+Xdap.Editor.mainForm.edForm.scrollButtons.*.resizable:  True
+Xdap.Editor.mainForm.edForm.scrollButtons.*.resize:  True
+Xdap.Editor.mainForm.edForm.scrollButtons.*.left:  Rubber
+Xdap.Editor.mainForm.edForm.scrollButtons.*.right:  Rubber
+Xdap.Editor.mainForm.edForm.scrollButtons.leftleft.label:  <<
+Xdap.Editor.mainForm.edForm.scrollButtons.left.label:  <
+Xdap.Editor.mainForm.edForm.scrollButtons.right.label:  >
+Xdap.Editor.mainForm.edForm.scrollButtons.rightright.label:  >>
+Xdap.Editor*warnPrompt.label: Save changes? 
+Xdap.Editor*warnBox.borderWidth: 0
+Xdap.Editor*warnBox.orientation: vertical
+Xdap.Editor*warnPrompt.borderWidth: 0
+Xdap.Editor.mainForm.disagreeForm.resizable: True
+Xdap.Editor.mainForm.disagreeForm.dummy.resizable: True
+Xdap.Editor.mainForm.disagreeForm.disagreements.resizable: True
+Xdap.Editor.mainForm.disagreeForm.borderWidth: 0
+Xdap.Editor.mainForm.disagreeForm.defaultDistance: 0
+!  Trace Manager 
+!  ------------- 
+*Traces.mainForm.buttons.borderWidth: 0
+*Traces.mainForm.buttons.orientation: horizontal
+*Traces.mainForm.buttons*top:    ChainTop
+*Traces.mainForm.buttons*bottom: ChainTop
+*Traces.mainForm.buttons*left:   ChainLeft
+*Traces.mainForm.buttons*right:  ChainLeft
+*Traces.allowShellResize: True
+*Traces.mainForm.traceForm.resizable: True
+*Traces.mainForm.traceForm.form.resizable: True
+*Traces.mainForm.defaultDistance: 0
+*Traces.mainForm.traceForm.form.defaultDistance: 1
+*Traces.mainForm.traceForm.borderWidth: 0
+*Traces.mainForm.traceForm.form.borderWidth: 0
+!  The main display: a viewport containing a form containing graphs 
+*Traces.mainForm.traceForm.form.top:    Rubber
+*Traces.mainForm.traceForm.form.bottom: Rubber
+*Traces.mainForm.traceForm.form.left: ChainLeft
+*Traces.mainForm.traceForm.form.right: ChainRight
+*Traces.mainForm.traceForm.form.viewport.top:    Rubber
+*Traces.mainForm.traceForm.form.viewport.bottom: Rubber
+*Traces.mainForm.traceForm.form.viewport.left:   ChainLeft
+*Traces.mainForm.traceForm.form.viewport.right:  ChainRight
+*Traces.mainForm.traceForm.form.viewport.allowHoriz: True
+*Traces.mainForm.traceForm.form.viewport.forcebars:  True
+*Traces.mainForm.traceForm.form.viewport.width:      700
+*Traces.mainForm.traceForm.form.viewport.height: 150
+*Traces.mainForm.traceForm.form.viewport.vpForm.defaultDistance: 0
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.resizable: True
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.top:    ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.bottom: ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.left:   ChainLeft
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.right:  ChainRight
+*Traces.mainForm.traceForm.form.viewport.vpForm.trace.top:    ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.trace.bottom: ChainBottom
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.font: -misc-*-bold-*15-*
+!  The text graph heights are dynamically set to accomodate the font 
+!  The graphs are dynamically initially set to fill the viewport 
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.dimBackground: grey
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour1:  green3
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour2:  purple
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour3:  black
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour4:  red
+!  Gadgets 
+*Traces.mainForm.traceForm.form.gadgets.borderWidth:    0
+*Traces.mainForm.traceForm.form.gadgets.traceName.borderWidth:    0
+*Traces.mainForm.traceForm.form.gadgets*top:    ChainTop
+*Traces.mainForm.traceForm.form.gadgets*bottom: ChainTop
+*Traces.mainForm.traceForm.form.gadgets*left:   ChainLeft
+*Traces.mainForm.traceForm.form.gadgets*right:  ChainLeft
+*Traces.mainForm.traceForm.form.gadgets.scaleUp.label:  Scale up
+*Traces.mainForm.traceForm.form.gadgets.scaleDown.label:  Scale down
+!  Tag Editor 
+!  ---------- 
+*Tag.mainForm.  defaultDistance: 0
+*Tag.mainForm.buttons.borderWidth: 0
+*Tag.mainForm.buttons*resize: True
+*Tag.mainForm.buttons.orientation: horizontal
+*Tag.mainForm.buttons*top:    ChainTop
+*Tag.mainForm.buttons*bottom: ChainTop
+*Tag.mainForm.buttons*left:   ChainLeft
+*Tag.mainForm.buttons*right:  ChainLeft
+*Tag.mainForm.tagText.width:  300
+*Tag.mainForm.tagText.height: 100
+*Tag.mainForm.tagText.left:   ChainLeft
+*Tag.mainForm.tagText.right:  ChainRight
+*Tag.mainForm.tagText.top:    ChainTop
+*Tag.mainForm.tagText.bottom: ChainBottom
+*Tag.mainForm.tagText.resize: True
diff --git a/tables/Xdap.orig b/tables/Xdap.orig
new file mode 100644
index 0000000..2ef4c01
--- /dev/null
+++ b/tables/Xdap.orig
@@ -0,0 +1,234 @@
+!  Default resources for staden applications 
+!  ----------------------------------------- 
+!  Dialogue shell 
+!  -------------- 
+Xdap*font: 7x13bold
+Xdap.Dialogue.allowShellResize: False
+Xdap.Dialogue.width:	 550
+Xdap.Dialogue.minWidth:  550
+Xdap.Dialogue.maxWidth:  550
+Xdap.Dialogue.height:    180
+Xdap.Dialogue.minHeight: 180
+Xdap.Dialogue.maxHeight: 180
+Xdap.Dialogue.dialogueForm*translations: #override \n\
+         <Key>Return: CRAction()
+!  The default chaining for all children is Top,Top,Left,Left 
+Xdap.Dialogue.dialogueForm*top:    chainTop
+Xdap.Dialogue.dialogueForm*bottom: chainTop
+Xdap.Dialogue.dialogueForm*left:   chainLeft
+Xdap.Dialogue.dialogueForm*right:  chainLeft
+!  currFile 
+Xdap.Dialogue.dialogueForm.currFile.label: Current file:
+Xdap.Dialogue.dialogueForm.currFile.borderWidth: 0
+Xdap.Dialogue.dialogueForm.currFile.resizable: True
+!  currFunc 
+Xdap.Dialogue.dialogueForm.currFunc.label: Current function:
+Xdap.Dialogue.dialogueForm.currFunc.borderWidth: 0
+Xdap.Dialogue.dialogueForm.currFunc.resizable: True
+!  quit 
+Xdap.Dialogue.dialogueForm.quit.label: Quit application
+Xdap.Dialogue.dialogueForm.quit.foreground:  red
+Xdap.Dialogue.dialogueForm.quit.borderColor: red
+!  null 
+Xdap.Dialogue.dialogueForm.null.borderWidth: 0
+Xdap.Dialogue.dialogueForm.null.label: Working ... 
+!  nChoice 
+Xdap.Dialogue.dialogueForm.nChoice.resizable: True
+!  nChoice buttons 
+Xdap.Dialogue.dialogueForm.nChoice.buttons.resizable: True
+!  nChoice OK 
+Xdap.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3
+!  getopt 
+Xdap.Dialogue.dialogueForm.getopt.width: 540
+!  getopt execlabel 
+Xdap.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:
+Xdap.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0
+!  getopt execcommand 
+Xdap.Dialogue.dialogueForm.getopt.execcommand.label: Execute
+Xdap.Dialogue.dialogueForm.getopt.execcommand.resizable: True
+!  getopt menubar 
+Xdap.Dialogue.dialogueForm.getopt.menubar.width:	530
+Xdap.Dialogue.dialogueForm.getopt.menubar.height:	25
+!  getstring 
+Xdap.Dialogue.dialogueForm.getstring.label:
+Xdap.Dialogue.dialogueForm.getstring.value:
+!  getstring label 
+Xdap.Dialogue.dialogueForm.getstring.label.resizable: True
+!  getstring value 
+Xdap.Dialogue.dialogueForm.getstring.value.width: 350
+!  getstring OK 
+Xdap.Dialogue.dialogueForm.getstring.OK.borderWidth: 3
+!  yesno 
+Xdap.Dialogue.dialogueForm.yesno.resizable: True
+!  yesno label 
+Xdap.Dialogue.dialogueForm.yesno.lab.label:
+Xdap.Dialogue.dialogueForm.yesno.lab.borderWidth: 0
+Xdap.Dialogue.dialogueForm.yesno.lab.resizable: True
+Xdap.Dialogue.dialogueForm.yesno.YES.borderWidth: 3
+!  Help shell 
+!  ---------- 
+Xdap.Help.width:     550
+Xdap.Help.minWidth:  550
+Xdap.Help.maxWidth:  550
+Xdap.Help.height:    200
+Xdap.Help.minHeight: 150
+Xdap.Help*top:    chainTop
+Xdap.Help*bottom: chainTop
+Xdap.Help*left:   chainLeft
+Xdap.Help*right:  chainLeft
+!  currDisp 
+Xdap.Help.helpForm.currDisp.resizable: True
+Xdap.Help.helpForm.currDisp.borderWidth: 0
+!  rmHelp 
+Xdap.Help.helpForm.rmHelp.label: Remove help
+!  button 
+Xdap.Help.helpForm.button.label: List of topics
+!  topicsViewport 
+Xdap.Help.helpForm.topicsViewport.allowVert: True
+Xdap.Help.helpForm.topicsViewport.bottom: chainBottom
+Xdap.Help.helpForm.topicsViewport.right:  chainRight
+Xdap.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1
+Xdap.Help.helpForm.topicsViewporttopicsList.forceColumns:   2
+!  text 
+Xdap.Help.helpForm.text.editType: edit
+Xdap.Help.helpForm.text.scrollVertical: always
+Xdap.Help.helpForm.text.bottom: chainBottom
+Xdap.Help.helpForm.text.right:  chainRight
+!  Output shell 
+!  ------------ 
+Xdap.Graphics.graph.width:  550
+Xdap.Graphics.graph.height: 400
+Xdap.Output.text.width:   550
+Xdap.Output.text.height:  100
+!  Editor shell 
+!  ------------ 
+Xdap.Editor.allowShellResize: True
+Xdap.Editor.mainForm.resizable: True
+! 
+    "Xdap.Editor*font:  5x8",
+
+Xdap.Editor*font:  *Fixed-*-20-200-*
+!  The default chaining for all children is Top,Top,Left,Left 
+Xdap.Editor.mainForm*top:    chainTop
+Xdap.Editor.mainForm*bottom: chainTop
+Xdap.Editor.mainForm*left:   chainLeft
+Xdap.Editor.mainForm*right:  chainLeft
+Xdap.Editor.mainForm.title.borderWidth: 0
+Xdap.Editor.mainForm.title.label: Contig Editor (V1.1)
+Xdap.Editor.mainForm.buttons.borderWidth: 0
+Xdap.Editor.mainForm.buttons.orientation: horizontal
+Xdap.Editor.mainForm.buttons.resizable:  True
+Xdap.Editor.mainForm.buttons*vertDistance: 0
+Xdap.Editor.mainForm.buttons.insert.label: Insert
+Xdap.Editor.mainForm.buttons.replace.label: Replace
+Xdap.Editor.mainForm.buttons.replace.state: True
+Xdap.Editor.mainForm.buttons.superman.label: Super Edit
+Xdap.Editor.mainForm.buttons.superman.state: False
+Xdap.Editor.mainForm.buttons.reveal.label: Reveal Cutoffs
+Xdap.Editor.mainForm.buttons.reveal.state: False
+Xdap.Editor.mainForm.buttons.undo.label: Undo
+Xdap.Editor.mainForm.buttons.save.label: Save
+Xdap.Editor.mainForm.buttons.exit.label: Leave Editor
+Xdap.Editor.mainForm.buttons.exit.foreground:  red
+Xdap.Editor.mainForm.buttons.exit.borderColor: red
+Xdap.Editor.mainForm.buttons.lock.label: Lock
+Xdap.Editor.mainForm.buttons.join.label: Join
+Xdap.Editor.mainForm.buttons.next.label: Next Problem
+Xdap.Editor.mainForm.edForm.resizable: True
+Xdap.Editor.mainForm.edForm.defaultDistance: 0
+Xdap.Editor.mainForm.edForm.borderWidth: 0
+Xdap.Editor.mainForm.edForm.sequences.resizable: True
+Xdap.Editor.mainForm.edForm.names.resizable: True
+Xdap.Editor.mainForm.edForm.slider.length:  500
+Xdap.Editor.mainForm.edForm.slider.orientation:  horizontal
+Xdap.Editor.mainForm.edForm.scrollButtons.defaultDistance:  0
+Xdap.Editor.mainForm.edForm.scrollButtons.borderWidth:  0
+Xdap.Editor.mainForm.edForm.scrollButtons.left:  ChainLeft
+Xdap.Editor.mainForm.edForm.scrollButtons.right:  ChainLeft
+Xdap.Editor.mainForm.edForm.scrollButtons.*.resizable:  True
+Xdap.Editor.mainForm.edForm.scrollButtons.*.resize:  True
+Xdap.Editor.mainForm.edForm.scrollButtons.*.left:  Rubber
+Xdap.Editor.mainForm.edForm.scrollButtons.*.right:  Rubber
+Xdap.Editor.mainForm.edForm.scrollButtons.leftleft.label:  <<
+Xdap.Editor.mainForm.edForm.scrollButtons.left.label:  <
+Xdap.Editor.mainForm.edForm.scrollButtons.right.label:  >
+Xdap.Editor.mainForm.edForm.scrollButtons.rightright.label:  >>
+Xdap.Editor*warnPrompt.label: Save changes? 
+Xdap.Editor*warnBox.borderWidth: 0
+Xdap.Editor*warnBox.orientation: vertical
+Xdap.Editor*warnPrompt.borderWidth: 0
+Xdap.Editor.mainForm.disagreeForm.resizable: True
+Xdap.Editor.mainForm.disagreeForm.dummy.resizable: True
+Xdap.Editor.mainForm.disagreeForm.disagreements.resizable: True
+Xdap.Editor.mainForm.disagreeForm.borderWidth: 0
+Xdap.Editor.mainForm.disagreeForm.defaultDistance: 0
+!  Trace Manager 
+!  ------------- 
+*Traces.mainForm.buttons.borderWidth: 0
+*Traces.mainForm.buttons.orientation: horizontal
+*Traces.mainForm.buttons*top:    ChainTop
+*Traces.mainForm.buttons*bottom: ChainTop
+*Traces.mainForm.buttons*left:   ChainLeft
+*Traces.mainForm.buttons*right:  ChainLeft
+*Traces.allowShellResize: True
+*Traces.mainForm.traceForm.resizable: True
+*Traces.mainForm.traceForm.form.resizable: True
+*Traces.mainForm.defaultDistance: 0
+*Traces.mainForm.traceForm.form.defaultDistance: 1
+*Traces.mainForm.traceForm.borderWidth: 0
+*Traces.mainForm.traceForm.form.borderWidth: 0
+!  The main display: a viewport containing a form containing graphs 
+*Traces.mainForm.traceForm.form.top:    Rubber
+*Traces.mainForm.traceForm.form.bottom: Rubber
+*Traces.mainForm.traceForm.form.left: ChainLeft
+*Traces.mainForm.traceForm.form.right: ChainRight
+*Traces.mainForm.traceForm.form.viewport.top:    Rubber
+*Traces.mainForm.traceForm.form.viewport.bottom: Rubber
+*Traces.mainForm.traceForm.form.viewport.left:   ChainLeft
+*Traces.mainForm.traceForm.form.viewport.right:  ChainRight
+*Traces.mainForm.traceForm.form.viewport.allowHoriz: True
+*Traces.mainForm.traceForm.form.viewport.forcebars:  True
+*Traces.mainForm.traceForm.form.viewport.width:      700
+*Traces.mainForm.traceForm.form.viewport.height: 150
+*Traces.mainForm.traceForm.form.viewport.vpForm.defaultDistance: 0
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.resizable: True
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.top:    ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.bottom: ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.left:   ChainLeft
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.right:  ChainRight
+*Traces.mainForm.traceForm.form.viewport.vpForm.trace.top:    ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.trace.bottom: ChainBottom
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.font: -misc-*-bold-*15-*
+!  The text graph heights are dynamically set to accomodate the font 
+!  The graphs are dynamically initially set to fill the viewport 
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.dimBackground: grey
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour1:  green3
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour2:  purple
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour3:  black
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour4:  red
+!  Gadgets 
+*Traces.mainForm.traceForm.form.gadgets.borderWidth:    0
+*Traces.mainForm.traceForm.form.gadgets.traceName.borderWidth:    0
+*Traces.mainForm.traceForm.form.gadgets*top:    ChainTop
+*Traces.mainForm.traceForm.form.gadgets*bottom: ChainTop
+*Traces.mainForm.traceForm.form.gadgets*left:   ChainLeft
+*Traces.mainForm.traceForm.form.gadgets*right:  ChainLeft
+*Traces.mainForm.traceForm.form.gadgets.scaleUp.label:  Scale up
+*Traces.mainForm.traceForm.form.gadgets.scaleDown.label:  Scale down
+!  Tag Editor 
+!  ---------- 
+*Tag.mainForm.  defaultDistance: 0
+*Tag.mainForm.buttons.borderWidth: 0
+*Tag.mainForm.buttons*resize: True
+*Tag.mainForm.buttons.orientation: horizontal
+*Tag.mainForm.buttons*top:    ChainTop
+*Tag.mainForm.buttons*bottom: ChainTop
+*Tag.mainForm.buttons*left:   ChainLeft
+*Tag.mainForm.buttons*right:  ChainLeft
+*Tag.mainForm.tagText.width:  300
+*Tag.mainForm.tagText.height: 100
+*Tag.mainForm.tagText.left:   ChainLeft
+*Tag.mainForm.tagText.right:  ChainRight
+*Tag.mainForm.tagText.top:    ChainTop
+*Tag.mainForm.tagText.bottom: ChainBottom
+*Tag.mainForm.tagText.resize: True
diff --git a/tables/Xdap~ b/tables/Xdap~
new file mode 100644
index 0000000..5ea3b2f
--- /dev/null
+++ b/tables/Xdap~
@@ -0,0 +1,231 @@
+!  Default resources for staden applications 
+!  ----------------------------------------- 
+!  Dialogue shell 
+!  -------------- 
+Xdap*font: 7x13bold
+Xdap.Dialogue.allowShellResize: False
+Xdap.Dialogue.width:	 563
+Xdap.Dialogue.height:    190
+Xdap.Dialogue.dialogueForm*translations: #override \n\
+         <Key>Return: CRAction()
+!  The default chaining for all children is Top,Top,Left,Left 
+Xdap.Dialogue.dialogueForm*top:    chainTop
+Xdap.Dialogue.dialogueForm*bottom: chainTop
+Xdap.Dialogue.dialogueForm*left:   chainLeft
+Xdap.Dialogue.dialogueForm*right:  chainLeft
+!  currFile 
+Xdap.Dialogue.dialogueForm.currFile.label: Current file:
+Xdap.Dialogue.dialogueForm.currFile.borderWidth: 0
+Xdap.Dialogue.dialogueForm.currFile.resizable: True
+!  currFunc 
+Xdap.Dialogue.dialogueForm.currFunc.label: Current function:
+Xdap.Dialogue.dialogueForm.currFunc.borderWidth: 0
+Xdap.Dialogue.dialogueForm.currFunc.resizable: True
+!  quit 
+Xdap.Dialogue.dialogueForm.quit.label: Quit application
+Xdap.Dialogue.dialogueForm.quit.foreground:  red
+Xdap.Dialogue.dialogueForm.quit.borderColor: red
+!  null 
+Xdap.Dialogue.dialogueForm.null.borderWidth: 0
+Xdap.Dialogue.dialogueForm.null.label: Working ... 
+!  nChoice 
+Xdap.Dialogue.dialogueForm.nChoice.resizable: True
+!  nChoice buttons 
+Xdap.Dialogue.dialogueForm.nChoice.buttons.resizable: True
+Xdap.Dialogue.dialogueForm.nChoice.buttons.orientation: horizontal
+!  nChoice OK 
+Xdap.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3
+!  getopt 
+Xdap.Dialogue.dialogueForm.getopt.width: 540
+!  getopt execlabel 
+Xdap.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:
+Xdap.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0
+!  getopt execcommand 
+Xdap.Dialogue.dialogueForm.getopt.execcommand.label: Execute with dialogue
+Xdap.Dialogue.dialogueForm.getopt.execcommand.resizable: True
+!  getopt menubar 
+Xdap.Dialogue.dialogueForm.getopt.menubar.width:	530
+Xdap.Dialogue.dialogueForm.getopt.menubar.height:	25
+!  getstring 
+Xdap.Dialogue.dialogueForm.getstring.resizable: True
+Xdap.Dialogue.dialogueForm.getstring.label:
+Xdap.Dialogue.dialogueForm.getstring.value:
+!  getstring label 
+Xdap.Dialogue.dialogueForm.getstring.label.resizable: True
+!  getstring value 
+Xdap.Dialogue.dialogueForm.getstring.value.width: 350
+!  getstring OK 
+Xdap.Dialogue.dialogueForm.getstring.OK.borderWidth: 3
+!  yesno 
+Xdap.Dialogue.dialogueForm.yesno.resizable: True
+!  yesno label 
+Xdap.Dialogue.dialogueForm.yesno.lab.label:
+Xdap.Dialogue.dialogueForm.yesno.lab.borderWidth: 0
+Xdap.Dialogue.dialogueForm.yesno.lab.resizable: True
+Xdap.Dialogue.dialogueForm.yesno.YES.borderWidth: 3
+!  Help shell 
+!  ---------- 
+Xdap.Help.width:     563
+Xdap.Help.minWidth:  550
+Xdap.Help.height:    400
+Xdap.Help.minHeight: 150
+Xdap.Help*top:    chainTop
+Xdap.Help*bottom: chainTop
+Xdap.Help*left:   chainLeft
+Xdap.Help*right:  chainLeft
+!  currDisp 
+Xdap.Help.helpForm.currDisp.resizable: True
+Xdap.Help.helpForm.currDisp.borderWidth: 0
+!  rmHelp 
+Xdap.Help.helpForm.rmHelp.label: Remove help
+!  button 
+Xdap.Help.helpForm.button.label: List of topics
+!  topicsViewport 
+Xdap.Help.helpForm.topicsViewport.allowVert: True
+Xdap.Help.helpForm.topicsViewport.bottom: chainBottom
+Xdap.Help.helpForm.topicsViewport.right:  chainRight
+Xdap.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1
+! Xdap.Help.helpForm.topicsViewporttopicsList.forceColumns:   2
+!  text 
+Xdap.Help.helpForm.text.editType: edit
+Xdap.Help.helpForm.text.scrollVertical: always
+Xdap.Help.helpForm.text.bottom: chainBottom
+Xdap.Help.helpForm.text.right:  chainRight
+!  Output shell 
+!  ------------ 
+Xdap.Graphics.graph.width:  563
+Xdap.Graphics.graph.height: 400
+Xdap.Output.text.width:   563
+Xdap.Output.text.height:  100
+!  Editor shell 
+!  ------------ 
+Xdap.Editor.allowShellResize: True
+Xdap.Editor.mainForm.resizable: True
+! 
+    "Xdap.Editor*font:  5x8",
+
+Xdap.Editor*font:  *Fixed-*-20-200-*
+!  The default chaining for all children is Top,Top,Left,Left 
+Xdap.Editor.mainForm*top:    chainTop
+Xdap.Editor.mainForm*bottom: chainTop
+Xdap.Editor.mainForm*left:   chainLeft
+Xdap.Editor.mainForm*right:  chainLeft
+Xdap.Editor.mainForm.title.borderWidth: 0
+Xdap.Editor.mainForm.title.label: Contig Editor (V1.1)
+Xdap.Editor.mainForm.buttons.borderWidth: 0
+Xdap.Editor.mainForm.buttons.orientation: horizontal
+Xdap.Editor.mainForm.buttons.resizable:  True
+Xdap.Editor.mainForm.buttons*vertDistance: 0
+Xdap.Editor.mainForm.buttons.insert.label: Insert
+Xdap.Editor.mainForm.buttons.replace.label: Replace
+Xdap.Editor.mainForm.buttons.replace.state: True
+Xdap.Editor.mainForm.buttons.superman.label: Super Edit
+Xdap.Editor.mainForm.buttons.superman.state: False
+Xdap.Editor.mainForm.buttons.reveal.label: Reveal Cutoffs
+Xdap.Editor.mainForm.buttons.reveal.state: False
+Xdap.Editor.mainForm.buttons.undo.label: Undo
+Xdap.Editor.mainForm.buttons.save.label: Save
+Xdap.Editor.mainForm.buttons.exit.label: Leave Editor
+Xdap.Editor.mainForm.buttons.exit.foreground:  red
+Xdap.Editor.mainForm.buttons.exit.borderColor: red
+Xdap.Editor.mainForm.buttons.lock.label: Lock
+Xdap.Editor.mainForm.buttons.join.label: Join
+Xdap.Editor.mainForm.buttons.next.label: Next Problem
+Xdap.Editor.mainForm.edForm.resizable: True
+Xdap.Editor.mainForm.edForm.defaultDistance: 0
+Xdap.Editor.mainForm.edForm.borderWidth: 0
+Xdap.Editor.mainForm.edForm.sequences.resizable: True
+Xdap.Editor.mainForm.edForm.names.resizable: True
+Xdap.Editor.mainForm.edForm.slider.length:  500
+Xdap.Editor.mainForm.edForm.slider.orientation:  horizontal
+Xdap.Editor.mainForm.edForm.scrollButtons.defaultDistance:  0
+Xdap.Editor.mainForm.edForm.scrollButtons.borderWidth:  0
+Xdap.Editor.mainForm.edForm.scrollButtons.left:  ChainLeft
+Xdap.Editor.mainForm.edForm.scrollButtons.right:  ChainLeft
+Xdap.Editor.mainForm.edForm.scrollButtons.*.resizable:  True
+Xdap.Editor.mainForm.edForm.scrollButtons.*.resize:  True
+Xdap.Editor.mainForm.edForm.scrollButtons.*.left:  Rubber
+Xdap.Editor.mainForm.edForm.scrollButtons.*.right:  Rubber
+Xdap.Editor.mainForm.edForm.scrollButtons.leftleft.label:  <<
+Xdap.Editor.mainForm.edForm.scrollButtons.left.label:  <
+Xdap.Editor.mainForm.edForm.scrollButtons.right.label:  >
+Xdap.Editor.mainForm.edForm.scrollButtons.rightright.label:  >>
+Xdap.Editor*warnPrompt.label: Save changes? 
+Xdap.Editor*warnBox.borderWidth: 0
+Xdap.Editor*warnBox.orientation: vertical
+Xdap.Editor*warnPrompt.borderWidth: 0
+Xdap.Editor.mainForm.disagreeForm.resizable: True
+Xdap.Editor.mainForm.disagreeForm.dummy.resizable: True
+Xdap.Editor.mainForm.disagreeForm.disagreements.resizable: True
+Xdap.Editor.mainForm.disagreeForm.borderWidth: 0
+Xdap.Editor.mainForm.disagreeForm.defaultDistance: 0
+!  Trace Manager 
+!  ------------- 
+*Traces.mainForm.buttons.borderWidth: 0
+*Traces.mainForm.buttons.orientation: horizontal
+*Traces.mainForm.buttons*top:    ChainTop
+*Traces.mainForm.buttons*bottom: ChainTop
+*Traces.mainForm.buttons*left:   ChainLeft
+*Traces.mainForm.buttons*right:  ChainLeft
+*Traces.allowShellResize: True
+*Traces.mainForm.traceForm.resizable: True
+*Traces.mainForm.traceForm.form.resizable: True
+*Traces.mainForm.defaultDistance: 0
+*Traces.mainForm.traceForm.form.defaultDistance: 1
+*Traces.mainForm.traceForm.borderWidth: 0
+*Traces.mainForm.traceForm.form.borderWidth: 0
+!  The main display: a viewport containing a form containing graphs 
+*Traces.mainForm.traceForm.form.top:    Rubber
+*Traces.mainForm.traceForm.form.bottom: Rubber
+*Traces.mainForm.traceForm.form.left: ChainLeft
+*Traces.mainForm.traceForm.form.right: ChainRight
+*Traces.mainForm.traceForm.form.viewport.top:    Rubber
+*Traces.mainForm.traceForm.form.viewport.bottom: Rubber
+*Traces.mainForm.traceForm.form.viewport.left:   ChainLeft
+*Traces.mainForm.traceForm.form.viewport.right:  ChainRight
+*Traces.mainForm.traceForm.form.viewport.allowHoriz: True
+*Traces.mainForm.traceForm.form.viewport.forcebars:  True
+*Traces.mainForm.traceForm.form.viewport.width:      700
+*Traces.mainForm.traceForm.form.viewport.height: 150
+*Traces.mainForm.traceForm.form.viewport.vpForm.defaultDistance: 0
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.resizable: True
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.top:    ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.bottom: ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.left:   ChainLeft
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.right:  ChainRight
+*Traces.mainForm.traceForm.form.viewport.vpForm.trace.top:    ChainTop
+*Traces.mainForm.traceForm.form.viewport.vpForm.trace.bottom: ChainBottom
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.font: -misc-*-bold-*15-*
+!  The text graph heights are dynamically set to accomodate the font 
+!  The graphs are dynamically initially set to fill the viewport 
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.dimBackground: grey
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour1:  green3
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour2:  purple
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour3:  black
+*Traces.mainForm.traceForm.form.viewport.vpForm.Graph.graphColour4:  red
+!  Gadgets 
+*Traces.mainForm.traceForm.form.gadgets.borderWidth:    0
+*Traces.mainForm.traceForm.form.gadgets.traceName.borderWidth:    0
+*Traces.mainForm.traceForm.form.gadgets*top:    ChainTop
+*Traces.mainForm.traceForm.form.gadgets*bottom: ChainTop
+*Traces.mainForm.traceForm.form.gadgets*left:   ChainLeft
+*Traces.mainForm.traceForm.form.gadgets*right:  ChainLeft
+*Traces.mainForm.traceForm.form.gadgets.scaleUp.label:  Scale up
+*Traces.mainForm.traceForm.form.gadgets.scaleDown.label:  Scale down
+!  Tag Editor 
+!  ---------- 
+*Tag.mainForm.  defaultDistance: 0
+*Tag.mainForm.buttons.borderWidth: 0
+*Tag.mainForm.buttons*resize: True
+*Tag.mainForm.buttons.orientation: horizontal
+*Tag.mainForm.buttons*top:    ChainTop
+*Tag.mainForm.buttons*bottom: ChainTop
+*Tag.mainForm.buttons*left:   ChainLeft
+*Tag.mainForm.buttons*right:  ChainLeft
+*Tag.mainForm.tagText.width:  300
+*Tag.mainForm.tagText.height: 100
+*Tag.mainForm.tagText.left:   ChainLeft
+*Tag.mainForm.tagText.right:  ChainRight
+*Tag.mainForm.tagText.top:    ChainTop
+*Tag.mainForm.tagText.bottom: ChainBottom
+*Tag.mainForm.tagText.resize: True
diff --git a/tables/Xstaden b/tables/Xstaden
new file mode 100644
index 0000000..68e4d1b
--- /dev/null
+++ b/tables/Xstaden
@@ -0,0 +1,111 @@
+! Default resources for staden applications
+! -----------------------------------------
+
+
+! Dialogue shell
+! --------------
+Xstaden*font: 7x13bold
+Xstaden.Dialgue.allowShellResize: False
+Xstaden.Dialogue.width:	    563
+Xstaden.Dialogue.height:    190
+Xstaden.Dialogue.dialogueForm*translations: #override \n\
+	<Key>Return: CRAction()
+! The default chaining for all children is Top,Top,Left,Left
+Xstaden.Dialogue.dialogueForm*top:    chainTop
+Xstaden.Dialogue.dialogueForm*bottom: chainTop
+Xstaden.Dialogue.dialogueForm*left:   chainLeft
+Xstaden.Dialogue.dialogueForm*right:  chainLeft
+! currFile
+Xstaden.Dialogue.dialogueForm.currFile.label: Current file:
+Xstaden.Dialogue.dialogueForm.currFile.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.currFile.resizable: True
+! currFunc
+Xstaden.Dialogue.dialogueForm.currFunc.label: Current function:
+Xstaden.Dialogue.dialogueForm.currFunc.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.currFunc.resizable: True
+! quit
+Xstaden.Dialogue.dialogueForm.quit.label: Quit application
+Xstaden.Dialogue.dialogueForm.quit.foreground:  red
+Xstaden.Dialogue.dialogueForm.quit.borderColor: red
+
+! null
+Xstaden.Dialogue.dialogueForm.null.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.null.label: Working ... 
+
+! nChoice
+Xstaden.Dialogue.dialogueForm.nChoice.resizable: True
+! nChoice buttons
+Xstaden.Dialogue.dialogueForm.nChoice.buttons.resizable: True
+!Xstaden.Dialogue.dialogueForm.nChoice.buttons.orientation: horizontal
+! nChoice OK
+Xstaden.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3
+
+! getopt
+Xstaden.Dialogue.dialogueForm.getopt.width: 540
+! getopt execlabel
+Xstaden.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:
+Xstaden.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0
+! getopt execcommand
+Xstaden.Dialogue.dialogueForm.getopt.execcommand.label: Execute with dialogue
+Xstaden.Dialogue.dialogueForm.getopt.execcommand.resizable: True
+! getopt menubar
+Xstaden.Dialogue.dialogueForm.getopt.menubar.width:	530
+Xstaden.Dialogue.dialogueForm.getopt.menubar.height:	25
+
+! getstring
+Xstaden.Dialogue.dialogueForm.getstring.resizable: True
+Xstaden.Dialogue.dialogueForm.getstring.label:
+Xstaden.Dialogue.dialogueForm.getstring.value:
+! getstring label
+Xstaden.Dialogue.dialogueForm.getstring.label.resizable: True
+! getstring value
+Xstaden.Dialogue.dialogueForm.getstring.value.width: 350
+! getstring OK
+Xstaden.Dialogue.dialogueForm.getstring.OK.borderWidth: 3
+
+! yesno
+Xstaden.Dialogue.dialogueForm.yesno.resizable: True
+! yesno label
+Xstaden.Dialogue.dialogueForm.yesno.lab.label:
+Xstaden.Dialogue.dialogueForm.yesno.lab.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.yesno.lab.resizable: True
+Xstaden.Dialogue.dialogueForm.yesno.YES.borderWidth: 3
+
+
+! Help shell
+! ----------
+Xstaden.Help.width:     563
+Xstaden.Help.minWidth:  550
+Xstaden.Help.height:    400
+Xstaden.Help.minHeight: 150
+Xstaden.Help*top:    chainTop
+Xstaden.Help*bottom: chainTop
+Xstaden.Help*left:   chainLeft
+Xstaden.Help*right:  chainLeft
+! currDisp
+Xstaden.Help.helpForm.currDisp.resizable: True
+Xstaden.Help.helpForm.currDisp.borderWidth: 0
+! rmHelp
+Xstaden.Help.helpForm.rmHelp.label: Remove help
+! button
+Xstaden.Help.helpForm.button.label: List of topics
+! topicsViewport
+Xstaden.Help.helpForm.topicsViewport.allowVert: True
+Xstaden.Help.helpForm.topicsViewport.bottom: chainBottom
+Xstaden.Help.helpForm.topicsViewport.right:  chainRight
+Xstaden.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1
+!Xstaden.Help.helpForm.topicsViewporttopicsList.forceColumns:   2
+! text
+Xstaden.Help.helpForm.text.editType: edit
+Xstaden.Help.helpForm.text.scrollVertical: always
+Xstaden.Help.helpForm.text.bottom: chainBottom
+Xstaden.Help.helpForm.text.right:  chainRight
+
+
+! Output shell
+! ------------
+Xstaden.Graphics.graph.width:  563
+Xstaden.Graphics.graph.height: 400
+Xstaden.Output.text.width:     563
+Xstaden.Output.text.height:    100
+
diff --git a/tables/Xstaden~ b/tables/Xstaden~
new file mode 100644
index 0000000..08243b9
--- /dev/null
+++ b/tables/Xstaden~
@@ -0,0 +1,111 @@
+! Default resources for staden applications
+! -----------------------------------------
+
+
+! Dialogue shell
+! --------------
+Xstaden*font: 7x13bold
+Xstaden.Dialgue.allowShellResize: False
+Xstaden.Dialogue.width:	    563
+Xstaden.Dialogue.height:    190
+Xstaden.Dialogue.dialogueForm*translations: #override \n\
+	<Key>Return: CRAction()
+! The default chaining for all children is Top,Top,Left,Left
+Xstaden.Dialogue.dialogueForm*top:    chainTop
+Xstaden.Dialogue.dialogueForm*bottom: chainTop
+Xstaden.Dialogue.dialogueForm*left:   chainLeft
+Xstaden.Dialogue.dialogueForm*right:  chainLeft
+! currFile
+Xstaden.Dialogue.dialogueForm.currFile.label: Current file:
+Xstaden.Dialogue.dialogueForm.currFile.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.currFile.resizable: True
+! currFunc
+Xstaden.Dialogue.dialogueForm.currFunc.label: Current function:
+Xstaden.Dialogue.dialogueForm.currFunc.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.currFunc.resizable: True
+! quit
+Xstaden.Dialogue.dialogueForm.quit.label: Quit application
+Xstaden.Dialogue.dialogueForm.quit.foreground:  red
+Xstaden.Dialogue.dialogueForm.quit.borderColor: red
+
+! null
+Xstaden.Dialogue.dialogueForm.null.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.null.label: Working ... 
+
+! nChoice
+Xstaden.Dialogue.dialogueForm.nChoice.resizable: True
+! nChoice buttons
+Xstaden.Dialogue.dialogueForm.nChoice.buttons.resizable: True
+Xstaden.Dialogue.dialogueForm.nChoice.buttons.orientation: horizontal
+! nChoice OK
+Xstaden.Dialogue.dialogueForm.nChoice.OK.borderWidth: 3
+
+! getopt
+Xstaden.Dialogue.dialogueForm.getopt.width: 540
+! getopt execlabel
+Xstaden.Dialogue.dialogueForm.getopt.execlabel.label: Select a function to:
+Xstaden.Dialogue.dialogueForm.getopt.execlabel.borderWidth: 0
+! getopt execcommand
+Xstaden.Dialogue.dialogueForm.getopt.execcommand.label: Execute with dialogue
+Xstaden.Dialogue.dialogueForm.getopt.execcommand.resizable: True
+! getopt menubar
+Xstaden.Dialogue.dialogueForm.getopt.menubar.width:	530
+Xstaden.Dialogue.dialogueForm.getopt.menubar.height:	25
+
+! getstring
+Xstaden.Dialogue.dialogueForm.getstring.resizable: True
+Xstaden.Dialogue.dialogueForm.getstring.label:
+Xstaden.Dialogue.dialogueForm.getstring.value:
+! getstring label
+Xstaden.Dialogue.dialogueForm.getstring.label.resizable: True
+! getstring value
+Xstaden.Dialogue.dialogueForm.getstring.value.width: 350
+! getstring OK
+Xstaden.Dialogue.dialogueForm.getstring.OK.borderWidth: 3
+
+! yesno
+Xstaden.Dialogue.dialogueForm.yesno.resizable: True
+! yesno label
+Xstaden.Dialogue.dialogueForm.yesno.lab.label:
+Xstaden.Dialogue.dialogueForm.yesno.lab.borderWidth: 0
+Xstaden.Dialogue.dialogueForm.yesno.lab.resizable: True
+Xstaden.Dialogue.dialogueForm.yesno.YES.borderWidth: 3
+
+
+! Help shell
+! ----------
+Xstaden.Help.width:     563
+Xstaden.Help.minWidth:  550
+Xstaden.Help.height:    400
+Xstaden.Help.minHeight: 150
+Xstaden.Help*top:    chainTop
+Xstaden.Help*bottom: chainTop
+Xstaden.Help*left:   chainLeft
+Xstaden.Help*right:  chainLeft
+! currDisp
+Xstaden.Help.helpForm.currDisp.resizable: True
+Xstaden.Help.helpForm.currDisp.borderWidth: 0
+! rmHelp
+Xstaden.Help.helpForm.rmHelp.label: Remove help
+! button
+Xstaden.Help.helpForm.button.label: List of topics
+! topicsViewport
+Xstaden.Help.helpForm.topicsViewport.allowVert: True
+Xstaden.Help.helpForm.topicsViewport.bottom: chainBottom
+Xstaden.Help.helpForm.topicsViewport.right:  chainRight
+Xstaden.Help.helpForm.topicsViewport.topicsList.defaultColumns: 1
+!Xstaden.Help.helpForm.topicsViewporttopicsList.forceColumns:   2
+! text
+Xstaden.Help.helpForm.text.editType: edit
+Xstaden.Help.helpForm.text.scrollVertical: always
+Xstaden.Help.helpForm.text.bottom: chainBottom
+Xstaden.Help.helpForm.text.right:  chainRight
+
+
+! Output shell
+! ------------
+Xstaden.Graphics.graph.width:  563
+Xstaden.Graphics.graph.height: 400
+Xstaden.Output.text.width:     563
+Xstaden.Output.text.height:    100
+
diff --git a/tables/Xted b/tables/Xted
new file mode 100644
index 0000000..7d916cc
--- /dev/null
+++ b/tables/Xted
@@ -0,0 +1,182 @@
+!
+! Resources for the "ted" trace editor (Dec 1990)
+!
+
+! A good default font
+Xted*font: 7x13bold
+
+! default magnification
+Xted*magnif: 30
+! default enzyme string to search for as a left cutoff
+Xted*enz: CAGGTC
+
+! The application
+Xted.minWidth:  550
+Xted.minHeight: 250
+
+! Labels have no borders and are not resized
+Xted*Label.borderWidth: 0
+Xted*Label.top:         chainTop
+Xted*Label.bottom:      chainTop
+Xted*Label.left:        chainLeft
+Xted*Label.right:       chainLeft
+
+! Buttons are not resized
+Xted*Command.top:         chainTop
+Xted*Command.bottom:      chainTop
+Xted*Command.left:        chainLeft
+Xted*Command.right:       chainLeft
+
+! All the toggles are one-of-many
+Xted*Toggle.Translations: #override \n\
+         <EnterWindow>:        highlight(Always) \n\
+         <LeaveWindow>:        unhighlight()     \n\
+         <Btn1Down>,<Btn1Up>: set() notify()
+
+Xted.mainForm.title.label: Trace editor
+Xted.mainForm.version.label: \
+Dec 1990
+Xted.mainForm.seqNamePrompt.label: Input:
+Xted.mainForm.seqNamePrompt.height: 13
+Xted.mainForm.seqName.label:       NONE
+Xted.mainForm.seqName.resizable:   True
+Xted.mainForm.seqName.height: 13
+Xted.mainForm.search.label: Search
+Xted.mainForm.search.height: 13
+Xted.mainForm.NorigBasesPrompt.label: Bases:
+Xted.mainForm.NorigBases.label:
+Xted.mainForm.NorigBases.resizable:   True
+Xted.mainForm.strand.label: Strand: Top
+Xted.mainForm.strand.resizable: True
+
+! Display forms have no border
+Xted.mainForm.Form.borderWidth:     0
+
+ ! The edit mode toggles
+Xted.mainForm.Toggle.top:    chainTop
+Xted.mainForm.Toggle.bottom: chainTop
+Xted.mainForm.Toggle.left:   chainLeft
+Xted.mainForm.Toggle.right:  chainLeft
+Xted.mainForm.edmodelab.label: Mode:
+Xted.mainForm.adjustL.label:   Adj left cut
+Xted.mainForm.edSeq.label:     Edit seq
+Xted.mainForm.adjustR.label:   Adj right cut
+Xted.mainForm.edSeq.state:     True
+Xted.mainForm.adjustL.height:   13
+Xted.mainForm.edSeq.height:     13
+Xted.mainForm.adjustR.height:   13
+
+
+! The magnification slider
+Xted.mainForm.maglab.top:    chainTop
+Xted.mainForm.maglab.bottom: chainTop
+Xted.mainForm.maglab.left:   chainLeft
+Xted.mainForm.maglab.right:  chainLeft
+Xted.mainForm.maglab.label: Mag:
+Xted.mainForm.maglab.height: 13
+Xted.mainForm.magscr.top:    chainTop
+Xted.mainForm.magscr.bottom: chainTop
+Xted.mainForm.magscr.left:   chainLeft
+Xted.mainForm.magscr.right:  chainLeft
+Xted.mainForm.magscr.orientation: horizontal
+Xted.mainForm.magscr.length: 100
+Xted.mainForm.magscr.height: 13
+
+Xted.mainForm.scaleDown.label: Scale down
+Xted.mainForm.scaleDown.height: 13
+Xted.mainForm.scaleUp.label: Scale up
+Xted.mainForm.scaleUp.height: 13
+Xted.mainForm.switch.label: Switch
+Xted.mainForm.switch.height: 13
+
+
+! The main display: a viewport containing a form containing graphs
+Xted.mainForm.viewport.top:    chainTop
+Xted.mainForm.viewport.bottom: chainBottom
+Xted.mainForm.viewport.left:   chainLeft
+Xted.mainForm.viewport.right:  chainRight
+Xted.mainForm.viewport.allowHoriz: True
+Xted.mainForm.viewport.forcebars:  True
+Xted.mainForm.viewport.width:      700
+Xted.mainForm.viewport.vpForm.Graph.resizable: True
+Xted.mainForm.viewport.vpForm.Graph.top:    chainTop
+Xted.mainForm.viewport.vpForm.Graph.bottom: chainTop
+Xted.mainForm.viewport.vpForm.Graph.left:   chainLeft
+Xted.mainForm.viewport.vpForm.Graph.right:  chainRight
+Xted.mainForm.viewport.vpForm.trace.top:    chainTop
+Xted.mainForm.viewport.vpForm.trace.bottom: chainBottom
+Xted.mainForm.viewport.vpForm.Graph.font: -misc-*-bold-*15-*
+! The text graph heights are dynamically set to accomodate the font
+Xted.mainForm.viewport.vpForm.trace.height: 150
+! The graphs are dynamically initially set to fill the viewport
+Xted.mainForm.viewport.vpForm.Graph.dimBackground: grey
+Xted.mainForm.viewport.vpForm.Graph.graphColour1:  green
+Xted.mainForm.viewport.vpForm.Graph.graphColour2:  purple
+Xted.mainForm.viewport.vpForm.Graph.graphColour3:  black
+Xted.mainForm.viewport.vpForm.Graph.graphColour4:  red
+
+! Dummy widget - we only care about its width
+Xted.mainForm.dummy.top:    chainTop
+Xted.mainForm.dummy.bottom: chainBottom
+Xted.mainForm.dummy.left:   chainLeft
+Xted.mainForm.dummy.right:  chainRight
+Xted.mainForm.dummy.width:  700
+
+
+! Dialogues
+Xted*search.ioForm.ioPrompt.label:  Search string or base number
+Xted*input.ioForm.ioPrompt.label:   Input a file
+Xted*output.ioForm.ioPrompt.label:  Save a file
+Xted*check.ioForm.ioPrompt.label:   \
+The existing sequence has been altered.\n\
+Do you really want to overwrite it?
+Xted*quit.ioForm.ioPrompt.label:    \
+The existing sequence has been altered.\n\
+Do you really want to quit?
+Xted*check.ioForm..translations:   #override \n\
+         <Key>Return: CRAction()
+Xted*quit.ioForm..translations:    #override \n\
+         <Key>Return: CRAction()
+
+Xted*ioForm.namePrompt.label:       File name:
+Xted*ioForm.nameVal.width:          250
+Xted*ioForm.nameVal.translations:   #override \n\
+         <Key>Return: CRAction()
+Xted*ioForm.next.label: 
+Xted*ioForm.next.resizable:         True
+Xted*ioForm.formatPrompt.label:     File format:
+Xted*ioForm.formatBox.borderWidth:  0
+Xted*ioForm.formatBox.orientation:  horizontal
+Xted*ioForm.formatBox.plainFmt.label: Plain
+Xted*ioForm.formatBox.abiFmt.label:   ABI
+Xted*ioForm.formatBox.abiFmt.state:   True
+Xted*ioForm.formatBox.alfFmt.label:   ALF
+Xted*ioForm.formatBox.alfFmt.state:   False
+Xted*ioForm.formatBox.scfFmt.label:   SCF
+Xted*ioForm.formatBox.scfFmt.state:   False
+Xted*ioForm.formatBox.bottomFmt.label: Bottom
+Xted*ioForm.formatBox.topFmt.label:   Top
+Xted*ioForm.formatBox.topFmt.state:   True
+
+! The `default' button for the dialogue has a wider border
+Xted*input.ioForm.OK.borderWidth:     3
+Xted*output.ioForm.OK.borderWidth:    3
+Xted*check.ioForm.Cancel.borderWidth: 3
+Xted*quit.ioForm.Cancel.borderWidth:  3
+
+! Help shell widget
+Xted.Help.width:  525
+Xted.Help.height: 200
+Xted.Help.minWidth: 525
+Xted.Help.maxWidth: 525
+Xted.Help.minHeight: 150
+Xted.Help.title: ted help
+Xted.Help.form.rmButton.label:  Remove help
+Xted.Help.form.rmButton.top:    chainTop
+Xted.Help.form.rmButton.bottom: chainTop
+Xted.Help.form.rmButton.left:   chainLeft
+Xted.Help.form.rmButton.right:  chainLeft
+Xted.Help.form.text.top:    chainTop
+Xted.Help.form.text.bottom: chainBottom
+Xted.Help.form.text.left:   chainLeft
+Xted.Help.form.text.right:  chainRight
diff --git a/tables/alus/ALUA b/tables/alus/ALUA
new file mode 100644
index 0000000..5e036d1
--- /dev/null
+++ b/tables/alus/ALUA
@@ -0,0 +1,5 @@
+GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACCTGAGG
+TCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCG
+GGCGTGGTGGCGCGCGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGG
+AGGCGGAGGTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCGTC
+TCAAAAAAAA1
diff --git a/tables/alus/HUMA1ATP b/tables/alus/HUMA1ATP
new file mode 100644
index 0000000..4d5d69c
--- /dev/null
+++ b/tables/alus/HUMA1ATP
@@ -0,0 +1,5 @@
+GGCTGGGTGCAGGGGCTCTCGCCTGTAACCCCAGCACTTTGGGAGGCCAAGGCAGGCGGATCAAGAGGTC
+AGGAGTTCAAGACCTGCCTGGCCAACACGGTGAAACCTCAGCTCTACTAAAAATACAAAAATTAGCCAGG
+CGTGGTGGCACACGCCTGTAATCCCAGCTATTTGGGAAGCTGAGACAGAAGAATTTCTTGAACCCGGGAG
+GTGGAGGTTTCAGTGAGCCGAGATCACGCCACTGCACTCCACCCTGGCGGATAAAGCGAGACTCTGTCTC
+AAAAAAAA1
diff --git a/tables/alus/HUMACHRA7 b/tables/alus/HUMACHRA7
new file mode 100644
index 0000000..5fa1b6f
--- /dev/null
+++ b/tables/alus/HUMACHRA7
@@ -0,0 +1,5 @@
+GGGCAGTGGCTCACAGCTGTAATCCCAGCACTTCGGGAGGCCAAGATGGGTGGATCACTTGAGCCCAGGA
+GTTCAAGACCAGCCTGGGCAACGTGGCAAAACCCTGTCTCTGCAAAAAATACAAAAGTTAGTTGGGTGTG
+GTGACATGCACCTGTAGTCTCAGCTACCCCGGGAGGCTGAGGTAGAAGAATCACCTGAGTCTGGCAAGTG
+GAGGCTGCAGTGAGACGAGATCGGGCCACTGCACTCTAGCCTGGGTGATAAGAGTAAGACCCTGTCTCAA
+AAAACA1
diff --git a/tables/alus/HUMADAG b/tables/alus/HUMADAG
new file mode 100644
index 0000000..39d5ac5
--- /dev/null
+++ b/tables/alus/HUMADAG
@@ -0,0 +1,5 @@
+GGCCAGGCGTGGTGGTTCATGCCTGTGATCCCAGCACTTTGGGAGGCGAGGCAGGAGGATTGCTTGAGCC
+TAGGAGTTTGAGACCAGCCAGGGCAACATAGTGAGATCCATCTCTGCAAATACAAAAAAAAGTAGTCAGG
+CATGGTGGTGCATGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGTGGGAGAATTGCTTGAGTCCAGGAA
+GTCAAAGCTGCAGTGAGCTGTGATAATGCGACTGCACTCCAGCCTGGGTGACAGAGGGAGACCCTGTCTC
+AAAAAAAA1
diff --git a/tables/alus/HUMADAG-10 b/tables/alus/HUMADAG-10
new file mode 100644
index 0000000..30c6f4a
--- /dev/null
+++ b/tables/alus/HUMADAG-10
@@ -0,0 +1,5 @@
+GGTCGGGCGTGGTGGTTCATGCCTGTAATCCCAACACTTTGGGAGGCTGAGGTGGGTGGATCACTTGAGG
+TTAGGAGTTCAACACCAGCCTGGCCAACATGGTGAAATCCCCTCTCTATTAAAAATACAAAAATCAGCCA
+GGCATGGTGGCATGCATCTGTAGTCCCAGCTACTCAGGAGACTGAGGTAGGAGAATCATTTGAACCTGGG
+AGGCGGAGGTTGCAGTGAGCCGAGATTGAGCCACTAGCACTCCAGCCTGGGCAACAGAGCAATACTCCCT
+CTCAAAAAAAA1
diff --git a/tables/alus/HUMADAG-11 b/tables/alus/HUMADAG-11
new file mode 100644
index 0000000..d9343da
--- /dev/null
+++ b/tables/alus/HUMADAG-11
@@ -0,0 +1,5 @@
+CGTGCATGGTGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATCACCTGAGGTCAG
+GAGTTCAAGACCAGCGTGGCCAACATGGGAAACCCCGTGTCTACTAAACATACAAAAAATTAGCCAGGCA
+TGGTGGTGGGCATCTATAATCCCAGCTACTTGAGAGGCTGAGGCAGGAGAATCACTTGAACCAGGGAGGT
+GAAGGTTGCAGTGAGTCGAGGTCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCAAGACTCCGTCTCA
+AAAAAA1
diff --git a/tables/alus/HUMADAG-12 b/tables/alus/HUMADAG-12
new file mode 100644
index 0000000..17127ca
--- /dev/null
+++ b/tables/alus/HUMADAG-12
@@ -0,0 +1,5 @@
+GGCCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCATTTTGGGAGGCCGAGGCGGGCAGATCACCTGAGG
+TCAGGAATTCGAGACCAGCCTGGCAAACATAGTAAAACCCCATCTCTACTAAAAATACAAAACCATTAGT
+TGGGTGTGATGGTGGGTGCCTGTGATCCCAGCTACTTGGGAGGCTAAAGCAGGAGAATCCCTTAAACTGG
+GAGGTGGAGGTTGCAGTGAGCTGAGATCGCACCACTGCACTTCAGCCTGGGCAACAGAGTGAGACTCTGT
+CTCAATAAATA1
diff --git a/tables/alus/HUMADAG-13 b/tables/alus/HUMADAG-13
new file mode 100644
index 0000000..e0a3281
--- /dev/null
+++ b/tables/alus/HUMADAG-13
@@ -0,0 +1,5 @@
+GGCTGGGCATGGGGGCTCACCCCTGTAATCCAGCCACTTTGGGAGGCCGAGGTGGGTGGATCACCTGAGG
+CCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATGCAAAAAATTAGCC
+AGGCGTGGTGGCGCACACCTGTAATCCCAGCTACTTGGGAGGCTGAGACAGGAGAATCACCTGAACCTGG
+GAGGCAGAGGCTGCAGTGAGCCGAGATTACGCCACTGCACTCCAGCCTGGGCGACAAGAGCGAAACTGCA
+TCTCAAAAATAA1
diff --git a/tables/alus/HUMADAG-14 b/tables/alus/HUMADAG-14
new file mode 100644
index 0000000..4fe1faa
--- /dev/null
+++ b/tables/alus/HUMADAG-14
@@ -0,0 +1,5 @@
+GCCGGGCACAGTGGCTCACACCTGTAATCCCAGCGCTTTGGGAAGCCAAGGTGGGAGGATCACCTGAGAT
+CAGGAGTTCGAGACCAGCTGGCTAACGTGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGGG
+CGTGGTGGTGGACGCCTGTAATCCCAGCTACTCCGGAGGCTGAGGCAGAAGAATGGCTTGAACTCGGGAG
+GTGGAGGTTGTAGTGAGCCAGGATCACACCACTGCACTCCAGCCTGGGCAACAGAGTGAGACTGCGGCAC
+A1
diff --git a/tables/alus/HUMADAG-15 b/tables/alus/HUMADAG-15
new file mode 100644
index 0000000..5f4822c
--- /dev/null
+++ b/tables/alus/HUMADAG-15
@@ -0,0 +1,5 @@
+GCCTGGGCGCAGTGGTTCATGCCTGTAATCCCAGGACTTTGGGAGGCCAAGGCAGGAGGATCACCTGAGG
+TCGGGAGTTTGAGACCAGCCTGACCAACATGGTGAAACCCCGTCTCTACTAAAAAAAAAAAAAAAAATAC
+AAAATTAGCCAGGCGTGGTGGTGTATGACTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCGC
+TTGAACCTGGGAGGTGGAGGTTGTGGTGAGCCAAGATTACACCATTGCATTCCAGCCTGGGCAACAAGAG
+CGAAACTCCATCTCATGAAAAAAA1
diff --git a/tables/alus/HUMADAG-16 b/tables/alus/HUMADAG-16
new file mode 100644
index 0000000..1b72308
--- /dev/null
+++ b/tables/alus/HUMADAG-16
@@ -0,0 +1,5 @@
+GGCTGGGCATGGTGGCTCACGCCTGTAATCCCAGCACTTTGGCCTCCCAGGCAAGGTCGGAGGATCATGA
+GGTCAGGAGATTGAGACTATCCTGGCCAACATGGTGAAACCCCGTCTCTACCAAAAATACAAAAATTAGT
+TGGGTGTGGTGGTGTGTGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACCAG
+GGAGTCAGAGGCTGTGGTGAGCCGAGATCATGCTACTGCACTCCAGCCTGGTGACAGAGCAAGACTCTGT
+ATCACAAAAAA1
diff --git a/tables/alus/HUMADAG-17 b/tables/alus/HUMADAG-17
new file mode 100644
index 0000000..f57bf4d
--- /dev/null
+++ b/tables/alus/HUMADAG-17
@@ -0,0 +1,5 @@
+GGCCAGGTACAGTGGCTCACGCCTGTATCCCCAGCACTTTGGGAGGCTGAGGTGGGCGGATCATGAGGTC
+AGGAGATCGAGACCATCCTGGCTAACACAGCGAAACGCTGTCTCTACTAAAAATACAAAAAATTAGCCGG
+GCGTGGTGGCATGCGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGACGAATCACTTGAACCCAGGA
+GGCAGAGGCTGCAGTGAGCTGAGATGGCGCCACTGCACTCCAGCCTGGGCGACAGAGCAAGATTCTGTCT
+CAAAAAAAA1
diff --git a/tables/alus/HUMADAG-18 b/tables/alus/HUMADAG-18
new file mode 100644
index 0000000..e9e3dde
--- /dev/null
+++ b/tables/alus/HUMADAG-18
@@ -0,0 +1,5 @@
+GGCGGGGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCAAAGGTGGGTGGATCATGAGGCC
+AGGAGTTCGAGACCAGCCTGGCCAACATGGCAAAACCCCATTTCTACTAAAAATACAAAAAATTAGCTGG
+GCATGGTGCGGGCGCCTGTAGTCCCACGTACGCAGAAGGCTGAGGCAGGAGAATGGCATAAACCCGGGAG
+GTGGAGCTTGCAGTGAGCCGAGATCTCGCCATTGCACTCCAGCCTGGGTGACAGAGTGAGACTCTGTCTC
+AAAAAAAA1
diff --git a/tables/alus/HUMADAG-2 b/tables/alus/HUMADAG-2
new file mode 100644
index 0000000..e3a4cd3
--- /dev/null
+++ b/tables/alus/HUMADAG-2
@@ -0,0 +1,5 @@
+GCCCAGGAGGTTGGCTCACGCCTGTAATCCCAGCACTTTGTGAGGCTGAGGCGGGAGGATCATTTGAGTC
+CAGGAGTTTGAGACTAGCCTGGACAACAAAACTAGACCCCGTCTCTCCAAAAAATAAAAAAAGTTAGGGG
+AAGTGTGTGTGGTGGTGCACTCCCGTAGTCCCAGCTACTCAGGAGGCTGAGGCGGGAGGATCGCTTGAGC
+CCAGGAGGTTGAGGCTGCAGTGAGGTGTGATGGTGCCACTGACCTTCAGCCTGGGAGACAGAGCGAGACC
+CTGTCTCAAAAAAAA1
diff --git a/tables/alus/HUMADAG-3 b/tables/alus/HUMADAG-3
new file mode 100644
index 0000000..057f5f9
--- /dev/null
+++ b/tables/alus/HUMADAG-3
@@ -0,0 +1,4 @@
+GGCCGGGTGCAGTGGCTCATACCTGTAATCACAGTACTTTGGGAAGCCAAGGCTGGCGGATCACTTAAGC
+CCAGGAGTTTGAAACCAGCCTGGGCAACACAGTGAAACCCCTTCAAAACAAAAATTCACCAGGCGTGGTG
+GCGTGCACCTGTAGTCCCAGCTACTTGGGAGGCTAAGGTGGGAGGACCACCTGAGCCTGGAAGTCAAGGC
+TACTGCGGGCCAAGATTGCACCACTGCACTCCAGCTTGGGTGACAGAGCAAGACCCTGTCTCAAAAAAA1
diff --git a/tables/alus/HUMADAG-4 b/tables/alus/HUMADAG-4
new file mode 100644
index 0000000..2a306c9
--- /dev/null
+++ b/tables/alus/HUMADAG-4
@@ -0,0 +1,5 @@
+GCCAGGCACAGTGGCTCATGCCTGTAATCCCAGCACTGTGGGAGGCCAAGCTGGGTGGATCACTTGAGGT
+CAGGAGTTCGAGACCAGCCTGGCCAACATGTGAAACCCCATCTCCGCTAAAAATACAAAAATTAGCTGGA
+TGTGGTGGCATGTGCTTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAG
+GTGGAGGTTGCAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGCGACAGAGCAAGACTCTATCTC
+AAAAGAAA1
diff --git a/tables/alus/HUMADAG-5 b/tables/alus/HUMADAG-5
new file mode 100644
index 0000000..7ac99df
--- /dev/null
+++ b/tables/alus/HUMADAG-5
@@ -0,0 +1,5 @@
+GGTCGGTTGCGGTGGCACATACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGTGGACACCTGAGGT
+CAGGAGTTCGAGACTAGCCTGAGCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAACTGG
+GTGTGGTGGCACCTGCCTATAGTCCCAGCTACTCCGGAGGCTGAGGCACAAGAATTGCTTGAACCCAGGA
+GGTGGAGGTTGCAGTGAGAGGAGATCACGTCACTGCACTCCAGCCTGGGAGACAGAGCGAGACTCCATCC
+GTCTCAAAAAAAA1
diff --git a/tables/alus/HUMADAG-6 b/tables/alus/HUMADAG-6
new file mode 100644
index 0000000..a99fe14
--- /dev/null
+++ b/tables/alus/HUMADAG-6
@@ -0,0 +1,5 @@
+GGCCGGGTGCGGTGGCTCATGCCTGTAATCCCGGCACTTTGGGAGGCTGAGGTGGGCGGATCACCTGAAG
+TCGGGAGTTCGAGACCAGCCTGGCCAACAAGGAGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCC
+GGGCATGGTGGTGCATGCCTGTAATTCCAGCTACTTGGGAGGCTGAGGCAGAAGAATCACTTGAACCTGG
+GAGGCGAAGGTTGCAGTGAGCCGAGGTCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCTG
+TCTAAAAAAAGA1
diff --git a/tables/alus/HUMADAG-7 b/tables/alus/HUMADAG-7
new file mode 100644
index 0000000..a9cf66e
--- /dev/null
+++ b/tables/alus/HUMADAG-7
@@ -0,0 +1,5 @@
+GGCCAGGCGTGGTGGCTCATGCCTGTAATCACGCCTTGGGAAGCCGAGGTGGGTGGATCATGTGAGGTCA
+GGTGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTACAAATCCAAAACTTAGCAGGGC
+ACGGTGCCACGCGCCTGTAATCCCAGCTACTCGGGACGCTGAGGCCGAGAACTGCTTAAAATCCAGGAGG
+TGGAGGTTGCAGTGAGCCGAGATTTCGCCACTGCACTCCAGCCTGGGCGACAGAGCAAGAGTCCATCTCA
+AAAAAA1
diff --git a/tables/alus/HUMADAG-8 b/tables/alus/HUMADAG-8
new file mode 100644
index 0000000..d391818
--- /dev/null
+++ b/tables/alus/HUMADAG-8
@@ -0,0 +1,5 @@
+GGCTGGGCACGATGGTTCACACCTGTAACCCCAGCACTTTGGGAGGCCGAGGTGGGCAGATCACTTGAGG
+TCAGGAGTTCAAGACCAGCCTGGCCAATATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTG
+GGCATGGTGGTGGGTGCCTGTAGTCCCAACTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGG
+AGGTGGAGGTTTCCGTGAGCTGAGCTGGAGCCACTGCACTCCAGCCTGGGCAACAGAGTGAAACTCCGTC
+TTAAAAAAA1
diff --git a/tables/alus/HUMADAG-9 b/tables/alus/HUMADAG-9
new file mode 100644
index 0000000..1d244df
--- /dev/null
+++ b/tables/alus/HUMADAG-9
@@ -0,0 +1,5 @@
+GGCAGGGTAAGGCAGCTCACGCCTGTAATCATAACACTTTGGGAGGCCTAGGTGGGTGGATCACCTGAGG
+TCAGGAGTTCAAAACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAGTACAAAAATTAGCCG
+GGCATGGTGGTGGGCGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCACGAGAATCCCTTGAACCTGGG
+AGGCAGAGGCTGCAATGAGCTGAGATCTTGCCACTGCACTCCAGCCTGGGCAACAGAGCCAGACTCCATC
+TCAAAAAAAA1
diff --git a/tables/alus/HUMAGG b/tables/alus/HUMAGG
new file mode 100644
index 0000000..3055351
--- /dev/null
+++ b/tables/alus/HUMAGG
@@ -0,0 +1,5 @@
+GGCCGGGCGCGGTCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCGGATCACGAGGTCAG
+GAGATCGAGACCATCCTGGCTACCACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCGGGC
+GTAGTGCGGGCGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGGAGGC
+GGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTTGGCGACAGAGCGAGACTCCGTCTCGG
+AAAAAA1
diff --git a/tables/alus/HUMALBG b/tables/alus/HUMALBG
new file mode 100644
index 0000000..0dafb3d
--- /dev/null
+++ b/tables/alus/HUMALBG
@@ -0,0 +1,5 @@
+GGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGATGCCAAAGTGGGCATATCATCTGAGG
+TCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACTCCGTCTTTATGAAAAATACAAAAATTAGCCA
+GGCATGGTGGCAGGCGCCTGTAATCCCAGCTAATTGGGAGGCTGAGGCTGGAGAATCCCTTGAATCTAGG
+AGGCAGAGGTTGCAGTGAGCTGAGATCGTGCCATTGCATCCAGCCTGGGTGACAAGAGTAAAACTCTGTC
+TCAAAAAAAA1
diff --git a/tables/alus/HUMALBG-2 b/tables/alus/HUMALBG-2
new file mode 100644
index 0000000..0a227c1
--- /dev/null
+++ b/tables/alus/HUMALBG-2
@@ -0,0 +1,5 @@
+GCCGGGCACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGACAGGTGAATCACCTGAGGT
+CTGGAGTTCAGACCAGCCTGGCCAACGTGGTGAAAACCTGTCTTTACAAAAAATACAAAAATTAGCCTGG
+TGTGGTGGCGCACACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAG
+GCGGAGGTTGCAGTGAGCTGAGATTGCACCACTGCATTCCAGCCTGGGTGACAGGGCAAAACTCCATCTC
+AAAAAAAA1
diff --git a/tables/alus/HUMANFA b/tables/alus/HUMANFA
new file mode 100644
index 0000000..f64e410
--- /dev/null
+++ b/tables/alus/HUMANFA
@@ -0,0 +1,5 @@
+GGCTGGGCACGGTCGTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAGGCAGCGGATCATGAGGTCAAG
+AGATCAAGACTATCCTGGCCAACATGGTGAAACCCCAGCTCTACTAAAAATACAAAAATTAGCTGGGTGT
+GTGGCGGGCACCTGTACTCTCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGAGGCAG
+AGGTTGCAGTGAGCAGAGATCACGCCACTGCAGTCCAGCCTAGGTGATAGAGCGAGACTGTCTCAAAAAA
+AA1
diff --git a/tables/alus/HUMANFA-2 b/tables/alus/HUMANFA-2
new file mode 100644
index 0000000..e98d714
--- /dev/null
+++ b/tables/alus/HUMANFA-2
@@ -0,0 +1,4 @@
+GGCCAGGCGCGGTGGCTCACGCCTGTAATCCCAGCGCTTTGGGAGGCCAAGGCGGGTGGATCACGAGGTC
+AGGAGATGGAGACCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCAG
+GCGTGGTGGCAGGCGCCTGTAAGTCCTAGCTACTCCGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGG
+AGGCGGAGCTTGCAGTGAGCAGAGAT1
diff --git a/tables/alus/HUMAPOAI1 b/tables/alus/HUMAPOAI1
new file mode 100644
index 0000000..f1fa969
--- /dev/null
+++ b/tables/alus/HUMAPOAI1
@@ -0,0 +1,5 @@
+GGCCAGGCGCAGTGGCTCGTGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGAGGATCCCCTGAGG
+TCAGGAATTCAAGACCAACCTGGCCAACATGGTGAAACCCCATCTCTGCTAAAAATACAAAAATTAGCTG
+GGTGTGGTGGCGCGTGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGGTTGAACCCGGG
+AGACGGAGGTTGCAGTGAGCCGAGATGGCACCACTGCACTCCAGCCTGGGTGACAGAGGGAGACTCCATT
+AAAAAAAAA1
diff --git a/tables/alus/HUMAPOAI1-2 b/tables/alus/HUMAPOAI1-2
new file mode 100644
index 0000000..d5eb96c
--- /dev/null
+++ b/tables/alus/HUMAPOAI1-2
@@ -0,0 +1,5 @@
+GGTCGGGTGCGGTGGCTCAAGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGTATCACTTCAGG
+GCCAGGAGTTTGAGACCAGCCTGGCCAATATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAAAAT
+CAGCTGGCATGGTGGCTTGCACCCGTAATCCCAGCTACTAGGAAGGCTGAGGCAGGAGAATCGCTTGAAC
+CTGGGAGGCAGAGGTCACAGTGAGCCGAAATCATGCCACTGCACTCCAGCCTGGGCGACGGAGCAAGACT
+CCAGCTAAAAAAAAA1
diff --git a/tables/alus/HUMAPOAII b/tables/alus/HUMAPOAII
new file mode 100644
index 0000000..299dbed
--- /dev/null
+++ b/tables/alus/HUMAPOAII
@@ -0,0 +1,5 @@
+GGTGGGGTGCAGTGGCTCACACCTGTAATCCCTGCACTTTGGGAGGCCAAGGTAGGTGGATCACTTGAGA
+TCAGGAGTTCCAGACCAGCCTAGCCAACATGGTAAAACCCCGTCTCCTACTAAAAATACAAAAATTAGCC
+AGGTGTTGTGGCACGTGCCTGTAATCCCAGCTACTCGGGAGACTGAGGCATGAGAATCTTTTGAACCGGG
+GAGGCGGAGGTTGCAGTGAGCTGACATCGTGCCACTGCACTCCAGCCTAGGTGACAGAGCAGACTCCGTC
+ACAAAAAAAA1
diff --git a/tables/alus/HUMAPOAII-2 b/tables/alus/HUMAPOAII-2
new file mode 100644
index 0000000..57a75b7
--- /dev/null
+++ b/tables/alus/HUMAPOAII-2
@@ -0,0 +1,5 @@
+GGTGGGGCACAGTGGCTCAAGCCTGTAATCCTAGCACTTTGGGAGGCCAAGGTGGGCGGATCACGAGGTC
+AGAAGTTCGAGACCAGCCTGGCCAGCATGGTGAAACCCATCTCTACTAAAAATACAAAATATTAGCCGGG
+CATGGTGGCAGGTGCTTGTGATTCCAGCTCCTTGGGAGGCTGAGGCAGAAGAATTGCTAGAACCCTGGAG
+GCAGTGGGTGGCATTGAGCCCAGAATACGCCACTGCACTCCAGCCTGGGTGACAGAGCCAAACTCCATCT
+CAAAAAATA1
diff --git a/tables/alus/HUMAPOCII b/tables/alus/HUMAPOCII
new file mode 100644
index 0000000..32d199d
--- /dev/null
+++ b/tables/alus/HUMAPOCII
@@ -0,0 +1,4 @@
+GGGGTCAGTGGCTCACACCTGTACTCCCAGGACCCTGGGGAGGCCGAGATGGGAGGATCGCTTGAGCCCC
+AGGAGTTCCAGACCAGCCTGGGCAACAAAGCAAGTCTCCCATCTCTACAAAAGAATAAAAATTAGCAGCT
+GGGCATGGTGGCTCACGCCTGTAATTCCAGCACTTTTGGAGGCCAAGGCAGGCAAATCACTTGAGGTTAG
+GAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTA1
diff --git a/tables/alus/HUMAPOE4 b/tables/alus/HUMAPOE4
new file mode 100644
index 0000000..2229366
--- /dev/null
+++ b/tables/alus/HUMAPOE4
@@ -0,0 +1,5 @@
+GCTGGGCCGGTGGCTCACCCCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGAGGATCACTTGAGCCC
+AGGAGTTCAACACCAGCCTGGGCAACATAGTGAGACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGG
+CATGGTGCCACACACCTGTGCTCTCAGCTACTCAGGAGGCTGAGGCAGGAGGATCGCTTGAGCCCAGAAG
+GTCAAGGTTGCAGTGAACCATGTTCAGGCCGCTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTTTA
+TAAATACA1
diff --git a/tables/alus/HUMAPOE4-2 b/tables/alus/HUMAPOE4-2
new file mode 100644
index 0000000..f425bbe
--- /dev/null
+++ b/tables/alus/HUMAPOE4-2
@@ -0,0 +1,4 @@
+GGCCGGGCATGGTGGCTCAGGCCTGTAATCTCAGCACTTTGGGAGGCCGGCGGGTGGATCACTTGGTCAG
+GAGTTTGAGACCTGCCTGGCCAACATGGTGAAAGCTCGTCTCTACTAAAAATACAAAAATTAGCCGGGTG
+TGGTGGTGTGAGCCTGTAATCCCAGCTACTGAGGCAGCAGAATCGCTTGAACCCAAGAGGCAGAGGTTGC
+AGTGAGCCAAGATCGTGCCACTGCACTCTAGCCTGGGTGACAGAGCCAGACTCCGTCTAAAAAAAAA1
diff --git a/tables/alus/HUMAPOE4-3 b/tables/alus/HUMAPOE4-3
new file mode 100644
index 0000000..ab46592
--- /dev/null
+++ b/tables/alus/HUMAPOE4-3
@@ -0,0 +1,5 @@
+GGCTGGGGGCGGTAGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCACAGTGGGCGAATCACTTAAGG
+TCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAGTTAGCCA
+GGCGTGGTGGCGGGCGCCTGTAATCCTAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGG
+AGGCGGAGGTTACAGTGAGCCGAGATCTCGCCACTGCACTCCAGCCTCAGCAAGAGGGAGACTGTCTCAA
+AAAAA1
diff --git a/tables/alus/HUMAPOE4-4 b/tables/alus/HUMAPOE4-4
new file mode 100644
index 0000000..e1d05bc
--- /dev/null
+++ b/tables/alus/HUMAPOE4-4
@@ -0,0 +1,5 @@
+GCCAGGTGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGATCGAAACGGGCAGATCACCTGAGGT
+CAGGAGTTCCAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTGAAAATACAAAAAAGTTAGTC
+GGGTGTGGTGGCATGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGGACCTGG
+GAGGCGGAGCTTGCAGTGAGCCGAGATCCCGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTTCAT
+CTCAAAAAAA1
diff --git a/tables/alus/HUMBLYM1 b/tables/alus/HUMBLYM1
new file mode 100644
index 0000000..f500f95
--- /dev/null
+++ b/tables/alus/HUMBLYM1
@@ -0,0 +1,5 @@
+GCTGGGTGGTGGCTCACGCTGTAATCCTAGCACTGTGGGAGGCTGAGGCAGGAGGATTGCTTGAACCAAG
+GTATTCAAGAAGAGCATGGGCAACATGATGAGACCCTCGTCTCTACCAAAAATTTAAAAATTAGCTGGGC
+AGGGCATGATGATGCACATCCCTAATCCCAGCTACTTGAGGGCTTGAGGTGGCAGGATCACTTGAACCTA
+GGAACATTGAGGCTGCAGTGAGCTATGATCTTGCCACTGCACTCCATACTGCATGACAGAGCAAGACCCT
+GTCTCTTAGAAAAAA1
diff --git a/tables/alus/HUMC1A21 b/tables/alus/HUMC1A21
new file mode 100644
index 0000000..8b5b179
--- /dev/null
+++ b/tables/alus/HUMC1A21
@@ -0,0 +1,5 @@
+GGCTGGGTGGGGTGGCTCACATGTGTAATCCCAGCACTTTAGGAGGTCAAGGTGGTGGATCACCTGAGGT
+CAGGAGATCGAGACCAGCCTGACCAATATGGTGAAACCCCGTCTTTACTGAAAATACGAAAATTAGCTAG
+GCATGATGGTGTGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGACAAGATAATTACTTGAACTGGGGA
+GGCGGAAGTCGCAGTGAGCTGAGATCGTGCCACTGCACTCCAGCATGGGCACAGAGGGGAACTCCATCCC
+AAAACAAA1
diff --git a/tables/alus/HUMC1A23 b/tables/alus/HUMC1A23
new file mode 100644
index 0000000..76d7ad6
--- /dev/null
+++ b/tables/alus/HUMC1A23
@@ -0,0 +1,5 @@
+GCCGGGTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGAAGGGCGGATCACCTGAGGT
+CAGGAGTTCGAGACCAGCCTGACCAACGTGGCACAACCTGTCTCTACTAAAAATATCAGATTAGCTGGGC
+ATGGTGGTGCATGCCTGTAATCCCAGCTACTAGGGAGGATGAGGTAGTAGAGTAGGTTGAACCTGGGAGT
+TCAAGCTGTAGTGATCCAAGATCGCACCACTGCACTCCAGCCTGTGCAACAAGAGTATAACTCATCAAAA
+AAAAAA1
diff --git a/tables/alus/HUMC1AIN1 b/tables/alus/HUMC1AIN1
new file mode 100644
index 0000000..ea78340
--- /dev/null
+++ b/tables/alus/HUMC1AIN1
@@ -0,0 +1,5 @@
+GGCGGGGCACGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCGGATCACCATGAG
+GTCAGGAGTTTGAGACCAGTCGGGCCAACATAGTGAAACCACGTCTCTACTAAAAATACAAAAGATTAGC
+CTGGGTGTGGTGGTGTGCACCCTGTAACCCCAGCTAGTCAGGAGGCTGAGGCAGGAGAATTGCATGAACC
+CAGGAGGTGGAGGTTGCAGTGAGCTGAGATCGCGCCACTGCACTCCAGCCTGGGCAACAGTGCGAGACTC
+CATCTCAAAAAAAA1
diff --git a/tables/alus/HUMERPA b/tables/alus/HUMERPA
new file mode 100644
index 0000000..b2bbd15
--- /dev/null
+++ b/tables/alus/HUMERPA
@@ -0,0 +1,5 @@
+GCCTGGGCGCAGAGGCTCACGTCTATAATCCCAGGCTGAGATGGCCGAGATGGGAGAATTGCTTGAGCCC
+TGGAGTTTCAGACCAACCTAGGCAGCATAGTGAGATCCCCCATCTCTACAAACATTTAAAAAAATTAGTC
+AGGTGAAGTGGTGCATGGTGGTAGTCCCAGATATTTGGAAGGCTGAGGCGGGAGGATCGCTTGAGCCCAG
+GAATTTGAGGCTGCAGTGAGCTGTGATCACACCACTGCACTCCAGCCTCAGTGACAGAGTGAGGCCCTGT
+CTCAAAAAAGA1
diff --git a/tables/alus/HUMFIXG b/tables/alus/HUMFIXG
new file mode 100644
index 0000000..98cf551
--- /dev/null
+++ b/tables/alus/HUMFIXG
@@ -0,0 +1,5 @@
+GGCCAGGTGCAGTGGCTCATGCCAATAATCCCAGCACTTTGGGAGACTGAGACGGGAGGATTGCTTAAAC
+CCAGGAGTTTGAGACCAGCCTGGGCAACACGGCGAAACCCCACCTCTAATTAAAAAAAATACAAAAATTA
+GCTAGGTGTGATGGCTCCCACCTGTGCTCCCAGCTATTCAGGAGGCTGAGGTGGGAGAATCACCTGAGCC
+TGGAAAGTCGAGGCTGCAGTGAATTGTGATCACACCACTGCACTTCAGCCTGAGTGACAGAGTAAGACCC
+TATCTCAAAAAACA1
diff --git a/tables/alus/HUMFIXG-2 b/tables/alus/HUMFIXG-2
new file mode 100644
index 0000000..55376e3
--- /dev/null
+++ b/tables/alus/HUMFIXG-2
@@ -0,0 +1,5 @@
+GGCTGGGCCCAGTGGCTCACGCCTATAATCCCAGCACTTCTGGAGGCCAAGGTGGGCGGATCACCTGAGG
+TTAGGAGTTTCAGGCCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATAGAACAATTAGCTG
+GGCTTGGTAATGTGCACCTATAATCCCAGCTACTGGGGAGGCTGAGGCAGGAGAATCACTTGAGCCTGGG
+GAGCAGGGGGTGCGGGAGGTTGCAGTGAGACAAGATCGCACCAGTGCACTCCCCATCCTGGGTGACAGAG
+TGAGACTCTGTCTCAAAGAAAA1
diff --git a/tables/alus/HUMFIXG-3 b/tables/alus/HUMFIXG-3
new file mode 100644
index 0000000..d8626b0
--- /dev/null
+++ b/tables/alus/HUMFIXG-3
@@ -0,0 +1,5 @@
+GGCCTGGCATGGTGGCTCACACCTATAATCCCAGCACTTTCAGAGGCCCAGGCAGGCAGATCACTTGAGG
+TCAGGAGTTCGACACCAGCCTGGCCAACAAGGTGAAATGGTGAAACCCCGGCTTTACTAAAAATACAAAA
+ATTAGCTGGGCATGGTGGCGGGTGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATAGCTTG
+AACCTGGGAGATGGAGGTTGCAGTGAGCTGAGATCGCACCACTGCACTCCAGCCTGGGCGACAGAGCAAG
+ACTCTGTCAAAAAAAAAA1
diff --git a/tables/alus/HUMFIXG-4 b/tables/alus/HUMFIXG-4
new file mode 100644
index 0000000..380e182
--- /dev/null
+++ b/tables/alus/HUMFIXG-4
@@ -0,0 +1,4 @@
+GGCCGGGTGCAGTCGTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCGGATCACGAGGTCG
+AGAGATCGAGACCATCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAACTGGGC
+ATGGTGGCATGCGCCTGTAGTCCCAGGAGAATTGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCAAG
+ATCTCACCACTGCTCTCCAGCCTGGTGACAGGGCAAGACTCCGTCAAAAAAAAAA1
diff --git a/tables/alus/HUMFOL5 b/tables/alus/HUMFOL5
new file mode 100644
index 0000000..d5766fa
--- /dev/null
+++ b/tables/alus/HUMFOL5
@@ -0,0 +1,4 @@
+GCCAGGCACAGTGGCTCACACCTGTAATCCCAGCGCTTTGGGAGGCTAAGGCAGGCAGATCACTTGAGAC
+CAGCTTGGGCAACATGGCAAAGCCCCATCTCTACAAAAAACACAAAAATTAGCTGGGCATTGTGGCGCAC
+ACCTGTATTCCCATCTAGTCAGGAAGCTGAGATGGAAGAATTAATTGAGCCCACGAGTTCAAGGCTGCAG
+TGAGTCGTGATTGTGCCACTGCACTCCAGCCGGGGTGACAGAAGAGACCTTGTCTCGAAAA1
diff --git a/tables/alus/HUMFOL5-2 b/tables/alus/HUMFOL5-2
new file mode 100644
index 0000000..555fc82
--- /dev/null
+++ b/tables/alus/HUMFOL5-2
@@ -0,0 +1,5 @@
+GCCGGGTGTGGTGGCTCAAGCCGGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGTGAATCGCTTGCGGT
+CACGAGTTCGAGACTAGCCTGACCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAATACAAAAAT
+TAGCCGGGCGTGGTGGCGCGCACCTGGTCCTAGCTACTTGGGAGCCTGAGGCAGGAGAATCGCTTGAATC
+TGGGAAGGCGGAGGTTGCAGTAAGCCGAGGTCACACCATTGCATGCTAGCCTGGGCAACAAGAGCGAAAC
+GCTGTCTAAAAAAAAA1
diff --git a/tables/alus/HUMGAST2 b/tables/alus/HUMGAST2
new file mode 100644
index 0000000..5fbc8e7
--- /dev/null
+++ b/tables/alus/HUMGAST2
@@ -0,0 +1,5 @@
+CCGGGTGCGGTGGCTCATGCCTGTAATCCCAGCATTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTCA
+GGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATTACAAAAATTAGCCAGG
+CATGGTGGCACGTGCCTATAGTCCCAGATATTCTGGAGGCTGAGGCAGGAGAATCACTTGAACCCGGGGG
+AGGCGGAGGTTATAGTGAGCCGAGATCCCACCACTGCACTCCAGCCTGGGCAACAAGAGTGAAACTCTGT
+CTAAAAAAAAA1
diff --git a/tables/alus/HUMGHV b/tables/alus/HUMGHV
new file mode 100644
index 0000000..9a3b0a0
--- /dev/null
+++ b/tables/alus/HUMGHV
@@ -0,0 +1,4 @@
+GGAAGGGCCCAGTGGCTCATACCTGTAATCCCAGCAATTTGGGAGGCCGAGGCGGGCGGATTACCTGAGG
+TCAGGAGATGGAGACCAGACTGGCCAATATGGTGAAACCCCGTCTCTACCAAAAATACAAAAATTAGCTG
+AGCCTGGTCTTGCATGCCTGGAATCGCAACTATTCGGGAGACTGAGGCAGGAGAATCGCTTGAACCCAGG
+AGGCGGAGATTGCAGCGAGCCAAGACTGCCACTGCACTCCAGCCTGGTT1
diff --git a/tables/alus/HUMHBA4 b/tables/alus/HUMHBA4
new file mode 100644
index 0000000..7ba3d00
--- /dev/null
+++ b/tables/alus/HUMHBA4
@@ -0,0 +1,5 @@
+GGCTGGGTGTGGTGGCTCACGCCTGTCATCCCAGCACTTTGGGAGACCAGGAGGGCAGATCACTTGAGGT
+CAGGAGTTTGAGACCAGCCTGATCAACATGGTGAAACTGTGTCTCTATTAAAAATACATAAATTAGCCAG
+GAATGGCGGCCCATGCCTGTAATCCCAGCTACACGGGAGACAGAGGCAGGAGAATCGCTTGAACCCAGGA
+GGTTGAGGCTGCAGTGAGCCAAAACTTGCCACTGCACTCCAGCCGGGGTGTCAGAGCAAGGGCCTATCTC
+AAAAACAA1
diff --git a/tables/alus/HUMHBA4-2 b/tables/alus/HUMHBA4-2
new file mode 100644
index 0000000..d58a321
--- /dev/null
+++ b/tables/alus/HUMHBA4-2
@@ -0,0 +1,5 @@
+GACAGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGAGGATCACCTGAGGT
+CGGGAGTTTGAGACCACCCTGATCAACATGTAGAAACCCCATCTATACTAAAAATACAAAATCAGCCGGG
+CATGGTGGCCCATGCCTGTAAACCCACCTACTCCGGAGGCTGAGGCAGGAGAATCATTTTAACCAAGGAG
+GCAGAGGTTGCAGTGAGCTAAGATCACACCATTGCACTCCAGCCTGGAAAACAACAGCGAAACTCCGCCT
+CAAAAAAAA1
diff --git a/tables/alus/HUMHBA4-3 b/tables/alus/HUMHBA4-3
new file mode 100644
index 0000000..3731d17
--- /dev/null
+++ b/tables/alus/HUMHBA4-3
@@ -0,0 +1,5 @@
+GACAGGCACGGTGGCTCACACCTGTAATCCCAGTACTTTGGGAGGCTGAGGCGAGAGGATCACCTGAGGT
+CGGGAGTTTGAGACCAGCCTGACCAATATGGAGAAACCCCAGTTATACTAAAAATACAAAATTAGCTGGG
+TGTGGTGGCGCATGCCTGTAATCCTAGCTACTAGGAAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAG
+GTGGAGGTTGAGGTGAGCCGAGATCACGCCATTGCACTCCAGCCTGGGCAACAAGAGCAAAACTCCGTCT
+CAAAAAATA1
diff --git a/tables/alus/HUMHBBRT b/tables/alus/HUMHBBRT
new file mode 100644
index 0000000..03e17b1
--- /dev/null
+++ b/tables/alus/HUMHBBRT
@@ -0,0 +1,5 @@
+GGCTGGATGCGGTGGCTCAGGCTTGTAAACCCAGCACTTTGGGAGGCCAAGGCAGGCAGATCACTTGAGG
+TCAGGAGTTCAAGACCAGCCTGACCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAATCAGCCGG
+GCGTGTGGTGCATGCCTGCAGTCCCAGCTATTCAGGTGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAG
+GCAGAGGTTGCGGTGAGCCTAGATTGCACCATTGCACTCTAGCTTGGGCAATAGGGATGAAACTCCATCT
+CAGAAGAGA1
diff --git a/tables/alus/HUMHBBRT-2 b/tables/alus/HUMHBBRT-2
new file mode 100644
index 0000000..752f84a
--- /dev/null
+++ b/tables/alus/HUMHBBRT-2
@@ -0,0 +1,5 @@
+GGCTGGGTGTGGTGGCTCACGCCTGTGATCCCAGCACTTTCAGAGGCCGAGGAGGGTGGATCACCTGATG
+TTAGGAGTTCGAGATCAGCCTGACCAACATGGTGAAACCCCCTATCTACTAAAAATAGAAAATTAGCCAG
+GCATGGTGGTGGCACATGCCTGTAGTGCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTG
+GGAGGCAGATGTTGCAGTGAGCCTGGATCATGCCATTGCACTCCAGCCTGGGCAACCATAGCAATGCTCC
+ATCTCAGAAAAAA1
diff --git a/tables/alus/HUMIFNB3 b/tables/alus/HUMIFNB3
new file mode 100644
index 0000000..9692d7a
--- /dev/null
+++ b/tables/alus/HUMIFNB3
@@ -0,0 +1,4 @@
+GCTGGGCACGCTTCACGCCCATAATCCCAGCACTTTGAAAGGCCAAGGCAGAGGATCACTTGAGTGGGAG
+GAGTTTGAGACCAGCCTGGGCAATGTGGTGAAATGTCATTTCTACAAAAAAAAACTATAAAAAAAACTAC
+AAAACAACAACAACAACAACAAAAAACTAGCCCAGGTATGATGGTGTGCGCCTGTAGTCCCAGCTACTTG
+CAAGGTTCAGGTGGGAGGATTCGTTGCGCCCAGGAGGTTGTGGCTGCAG1
diff --git a/tables/alus/HUMIFNB3-2 b/tables/alus/HUMIFNB3-2
new file mode 100644
index 0000000..1af6eb0
--- /dev/null
+++ b/tables/alus/HUMIFNB3-2
@@ -0,0 +1,4 @@
+GCCTAGGTGCAGTGGCTCACACCTATAATCCAGCACTTTGGGAGGCTGAGGTGAGAGGATCACTTGAGGC
+CAAGAGGTTGAGACTAGCCGGAAACATGGAGAGACCTAGTCTCTACAAAACAAACAAACGAAAACGAAAC
+ATTAGCCAGCATGTGCTGCACCTGTAGTCCTAGTAGCTACTCGAGCTAGCGGAGACTGTTTGAGCCAAGA
+GGTTGCAGTGAGCTATGATTGCGCTACTCATTCCAGCCTGGGCAACAGAGCAAGACCTTGTCACCAA1
diff --git a/tables/alus/HUMIFNB3-3 b/tables/alus/HUMIFNB3-3
new file mode 100644
index 0000000..5cbb989
--- /dev/null
+++ b/tables/alus/HUMIFNB3-3
@@ -0,0 +1,5 @@
+GGCGGGGTGCAGTGACTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGATCACTTGAGG
+TCAGGAGTTCGAGGCCAGCCTGGCCAACATGGTGAAACCCCATCTCAACTAAAAATACAAAAAATTAGCT
+GAGCATGGTGGCACACAAGTGTAGTCCGAGCTACTCCAGAGGCTGAGGCAGGAGAATCGCTTGAACCTAG
+GAGGCAGAGGTTGCAGTGAGCGGAGATCGTGCCACTGCACTCCAGCCTGGGCGACAGAGGGAGACTCATC
+TCAAGA1
diff --git a/tables/alus/HUMIFNB3-4 b/tables/alus/HUMIFNB3-4
new file mode 100644
index 0000000..c39b536
--- /dev/null
+++ b/tables/alus/HUMIFNB3-4
@@ -0,0 +1,5 @@
+GGCCTGGCCAGGTGGTGGCTCATGCCTGTAATCTCAGCACTTTGGGAGGCCAAGGCAGGCGGATCACTAG
+AGGTCAGGAGTTCCAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTCAAAAATACAAAAATTC
+AGCCGGGCTTTGGTGGCGCACACCTGTAGTCTGAGTTACTCGGGAGGCTGCGGCAGGAGAATCGCTTGAA
+CCCAGGAGGCAGAGGTTGCAGTGAGCCAAGATCCTCTGCCTCAGCCTCCCAAGCTAGCTGGGACTACAGG
+C1
diff --git a/tables/alus/HUMIFNB3-5 b/tables/alus/HUMIFNB3-5
new file mode 100644
index 0000000..0bd24b7
--- /dev/null
+++ b/tables/alus/HUMIFNB3-5
@@ -0,0 +1,5 @@
+GGCCGGCACGGTGGCTCACGCCTGTTATCTCAGGACTTTGGGAGGCCGAGGAGGGTGGATCACGAGGTCA
+AGAGATTGAGACCAACCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCGGGC
+ATGGGGGCGGGCACCTGTAGTCCCAGCTAGCTCAGGAGGCTGAGGCAAGAGAATAGCTTGAACGCAGAAG
+GCGGAGGTTGCAGTGAGCTGAGATCACGCCATTGCACTCCAGCCTGGGTGACAGAGCGAGTCTCCATCTC
+AAAAACAA1
diff --git a/tables/alus/HUMIL2R8 b/tables/alus/HUMIL2R8
new file mode 100644
index 0000000..1bbb59d
--- /dev/null
+++ b/tables/alus/HUMIL2R8
@@ -0,0 +1,4 @@
+GGCCTGGCACGGTGGCTCAGGCCTATAACCCCAGCTCCCTAGGAGACCAAGGCGGGAGCATCTCTTGAGG
+CCAGGAGTTTGAGACCAGCCTGGGCAGCATAGCAAGACACATCCCTACAAAAAATTAGAAATTGGCTGGA
+TGTGGTGGCATACGCCTGTAGTCCTAGCCACTCAGGAGGCTGAGGCAGGAGGAGTTGCTTGAGCCCAGGA
+GTTCGAGGCTGCAGTCAGTCATGATGACACCACTGCACTCCAGCCTGGGCAACACAGCAAGATCCCCG1
diff --git a/tables/alus/HUMINS2 b/tables/alus/HUMINS2
new file mode 100644
index 0000000..31406f1
--- /dev/null
+++ b/tables/alus/HUMINS2
@@ -0,0 +1,5 @@
+GGCTGGGTGCGGTGGCTCATGCCTATAATCCCAGCACTTTAGGAGGCTGAGGCGGGCAGATCACCTGAGG
+TCGGGAGTTCAAGACCAGCCTGACCAACAGGGAGAAACCCCATCTTTACTAAAACTACAAAATTAGCTGG
+GTGTGGTGGCACATGCCTGTAATCCCAGATATTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCTGGGA
+AGCAGAGGTTGCGCTGAGCCGAGATGGCACCATTGCACTCCAGCCTGGGCAACGAGAGCGAAACTCCGTC
+TCAAAAAAA1
diff --git a/tables/alus/HUMLDLIVS b/tables/alus/HUMLDLIVS
new file mode 100644
index 0000000..577896c
--- /dev/null
+++ b/tables/alus/HUMLDLIVS
@@ -0,0 +1,5 @@
+CTGGGCGCGGTGGCTCACACCTGTAATCCCAACACTTTGGGAGGCTAAGGCGGGCAGATCACCTAAGGTC
+AGGAGTTTGAGACCAGCATGATCAACATAGTGAAACCCTGTCTCTACTAAAAATACAAATTAGCTGGCCG
+TGTGGCATGCACCTGTAATCCCAGCTACTTGGGAGGCTGAGGGAGGAGAATCACTTGAACCCGGGAAATG
+GAGTTGCAGTGAGTCAAGATCGCGCCATTGCACTCCAGCCTGGGCAACAGAGTGAGACTCCCTCTCAAAA
+AAAA1
diff --git a/tables/alus/HUMLDLR b/tables/alus/HUMLDLR
new file mode 100644
index 0000000..6bdff32
--- /dev/null
+++ b/tables/alus/HUMLDLR
@@ -0,0 +1,5 @@
+GGCCAGACACAGGTGCCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGCTGGATCACTTGAGTTCA
+GGAGTTGGAGACCAGGCCTGAGCAACAAAGCGAGATCCCATCTCTACAAAAACCAAAAAGTTAAAAATCA
+GCTGGGTATGGTGGCACGTGCCTGTGATCCCAGCTACTTGGGAGGCTGAGGCAGGAGGATCGCCTGAGCC
+CAGGAGGTGGAGGTTGCAGTGAGCCATGATCGAGCCACTGCACTCCAGCCTGGGCAACAGATGAAGACCC
+TATTTCAGAAATA1
diff --git a/tables/alus/HUMLDLR-2 b/tables/alus/HUMLDLR-2
new file mode 100644
index 0000000..2071314
--- /dev/null
+++ b/tables/alus/HUMLDLR-2
@@ -0,0 +1,5 @@
+GGCCGGGCGCAGTGGCCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCATGAGGT
+CAGGAGATCGAGACCATCCTGGCTAACAAGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCG
+GGCGCGGTGGTGGGCACCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGTGTGAACCCGGG
+AAGCGGAGCTTGCAGTGAGCCGAGATTGCGCCACTGCAGTCCGCAGTCTGGCCTGGGCGACAGAGCGAGA
+CTCCGTCTCAAAAAAAA1
diff --git a/tables/alus/HUMMHDC3B b/tables/alus/HUMMHDC3B
new file mode 100644
index 0000000..1285c54
--- /dev/null
+++ b/tables/alus/HUMMHDC3B
@@ -0,0 +1,5 @@
+GGCCGGGCACTGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTC
+AGGAGATCCAGACCATCCTGGCTAACACGGTGAAACCCCGCCTCTACTAAAAATACAAAAAAATTAGCCG
+GGCGTGGTGGCACGCGCCTGTAATCCCACCTACTCAGGAGGCTGAGGCAGGAGAATGGCGTGAACCCGGG
+AGGCGGAGCTTGCAGTGAGCCGAGATGCGCCACTGCACTCCGGCCTGGGCGACAGAGTGAGAATCCGTCT
+CAAAAAAAA1
diff --git a/tables/alus/HUMMHDRB3 b/tables/alus/HUMMHDRB3
new file mode 100644
index 0000000..084fe63
--- /dev/null
+++ b/tables/alus/HUMMHDRB3
@@ -0,0 +1,5 @@
+GCCAGGCACTGTGGCTCATGCCTGTAATCCCAGCATTTTGGGAGGCCGAGGCGGGCAGATCACGAGGTCA
+ATAGATCAATACCATCCTGGCCAATATGGTGAAACCCCGTCTCTACTAAAAATATAAAAATTAGTTGGCC
+GTGGTGGCATGCACCTGTAGTCCANGCTACTTGGAGGCTGAGGCAGGAGAATCACTTGAACCTAGGAGGT
+GGAGGTTACAGTGAGCCGAGATCGTGCCACTGCACTCCACTCTGGTGAGAGTGAGAGTCTGTCTCAAAAA
+AAA1
diff --git a/tables/alus/HUMMHDRB3-2 b/tables/alus/HUMMHDRB3-2
new file mode 100644
index 0000000..4e3627e
--- /dev/null
+++ b/tables/alus/HUMMHDRB3-2
@@ -0,0 +1,5 @@
+GGCCAGGCATGGTGCTCACGCCTGTAATCCTAACACTTTGGGAGGCCANGGCGGGTGGATCATCAGGTCA
+GGAGATCGAGACCATCCTGGCTAACACAGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGC
+ATGGTGGTGGGCGCCTGCAGTCCCAGCTACTCGTGAGGCTGAGGCAGGAGAATGGCATGAACCCGGGAGG
+CAGAGCTTGCAGTGAGCCGAGATCACGCCACTGCACTCCAGCCTGGGCGACAGAGTGACATTCCATCTCA
+AAAAAAA1
diff --git a/tables/alus/HUMMYCRT b/tables/alus/HUMMYCRT
new file mode 100644
index 0000000..4c84315
--- /dev/null
+++ b/tables/alus/HUMMYCRT
@@ -0,0 +1,4 @@
+GCCTGGTGCCCTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCGGATCGTGAGGTCA
+GGAGAGACCATCCTGGCCAACACGATGAAACCCCATCTCTACTAAAATTACAAAAATTAGCCTGGCATGG
+TGGTGTGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCTCTTGAACCAGGGGGTTGGA
+GGTTGATTGCGCCACTGCACTCCAGCCTAGTGACAGAGTGAGACTCCGTCTTAAAAAA1
diff --git a/tables/alus/HUMNGFB b/tables/alus/HUMNGFB
new file mode 100644
index 0000000..0c1ecc9
--- /dev/null
+++ b/tables/alus/HUMNGFB
@@ -0,0 +1,5 @@
+GGCAGGGCGTGGTGGCTTACGCCTATAATCCCAACACTTTGGGAGGCCGAGGGGGCGGATCACGAGATCA
+GAGATCAAGACCATCCTGGCCAACATGGTGAAACCCCGCCTCTGCAAAAAATAGAAAAATTAGCCAGGCG
+TGGCGGCGTGTGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGACTTGTTTGAACCCAGGAGGC
+AAAGGCTGCAGTGAGCCGAGATCAAGCCACTGCACTTCAGCCTGGGTGACAGAGCGAGACTCCATCTCAA
+AAAAAA1
diff --git a/tables/alus/HUMPOMC b/tables/alus/HUMPOMC
new file mode 100644
index 0000000..e3632cd
--- /dev/null
+++ b/tables/alus/HUMPOMC
@@ -0,0 +1,5 @@
+GGCCGGGGGCGGTGGCTCGCGCCTGTAATGCCAGAACTTTGGGGGGCCAAGGCGGGCGGATCACCTGTTG
+TCGGGAGTTTGAGACCAGCCTGACCAACATGGAGAAACTCCGTCACTACTAAAAATACAAAAATTAGCCG
+GGCGTGGTGGCGCATGCCTGTAATCCCAGCTACTCGGGAGGCTGAAGCAGGAGAATCGCTTGAACCCGGG
+AGGCGAAGGTTGCGGTGAGCAGAGATCACCCCTTTGCTCTCCAGCATGGGCAACAAGAGCGAAACTCCGT
+CTCAAAAAAAA1
diff --git a/tables/alus/HUMPOMC-2 b/tables/alus/HUMPOMC-2
new file mode 100644
index 0000000..719f930
--- /dev/null
+++ b/tables/alus/HUMPOMC-2
@@ -0,0 +1,5 @@
+GGCCGGGCATGGTGGCTCACACCTATAATCCCAGCACTTTGGGAGGCCAAGGCAGGCAGATCACATGAGG
+TCAGGAGTTCAAGACCAGCCTGGCCAACTATGGTGAAATACCATCTCTAGTATAAATAAATAAATAAGCT
+GGGTGTGGTGGCGGGCGCCTGTAATCCCAGCTACTCTGGAGCCTGAGGCAGGAGAACCGCTTGAACCTGG
+GAGGTAGAGGTTGCAGTGAGCCAAGATCACACCACTGCACTCCACACTCCAGCCAGGGCAACAGAGCAAG
+ACTCCGTCTCAAAAAAA1
diff --git a/tables/alus/HUMPOMC1 b/tables/alus/HUMPOMC1
new file mode 100644
index 0000000..0044b51
--- /dev/null
+++ b/tables/alus/HUMPOMC1
@@ -0,0 +1,5 @@
+GGCGGGGCACAGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCTGAGTTGGGCAGATCACGAGGTC
+AAGAGATGGAGACATTCCTGGCCAAGATGGCGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGG
+CGTGGTGGCGCACACCTGTAGTCCCAGCTACTCGGGTGGCTGAGGCAGGCGAATCGCTTGAACCTGGGAG
+GCGGAGGTTGCAGTGAGCCAAGATCGCGCCACTGCACTCCAGCCTGGCGACACAGCAAGACTCCATCTAA
+AAAAAAA1
diff --git a/tables/alus/HUMPOMC2 b/tables/alus/HUMPOMC2
new file mode 100644
index 0000000..4d6a903
--- /dev/null
+++ b/tables/alus/HUMPOMC2
@@ -0,0 +1,5 @@
+GGCTGGGTGTGGTGGCTAACACCTGTAATCCCAGCACTTCAGGATACTGATGTGGATTACTTGAGCCCAG
+GAGCTTGAGACCAGCCTAGGCAACATAGTGAAACCCTGTCTCTACCAAAAACAAAAACAAAAACAAAACA
+AAAAACCAAAACCCAAAATTAGCTGGGCATGGTGGCATGCATCTGTGGTTCCAGGTACTCAGGAGGCTGA
+AGCATGAGGATCACTTGAGCCCAGGAGGTGGAGGCTGCAGTGAGCAGTGTTCACACTGCTGCATTCCAGC
+CTGGGCAACAGAGTGAGACCCTAATTCAAAAACAA1
diff --git a/tables/alus/HUMPOMC6 b/tables/alus/HUMPOMC6
new file mode 100644
index 0000000..f89f53a
--- /dev/null
+++ b/tables/alus/HUMPOMC6
@@ -0,0 +1,4 @@
+GACCAGGCGAGATGGCTCATGGCTGTAATCCCAGCACTTTTAGATGCTAAGGTGGGAGGATTGCTTGAGC
+CCAGGGATTCAAGACAAGCCTGGGCAACATAGTGAGACCTCCACTTCTACAAAAAATAAAAAAGTTAGCC
+AGATGTGGTGGCATGTGCCTGTAGTCCTAGCTACTTGGGAGGTTCACTTGAGGCCAGGAGTCTGACGACA
+CAGTAAGCTATGATCACACCATTGCACTCCAGTCTGGGTAACAGAATGAGACCTTGTCTCAAAACAAA1
diff --git a/tables/alus/HUMRSA1 b/tables/alus/HUMRSA1
new file mode 100644
index 0000000..280a9b9
--- /dev/null
+++ b/tables/alus/HUMRSA1
@@ -0,0 +1,5 @@
+TGGCTCAGGCCTGTAATCCCAGCAAATTTGGAGGCCAAGGCAAGGGGATCACAAGGTGAAGAGATCAAGA
+CCATCCTGGCCAATACAGTGAAACGCTGTCACTACTAAAAATCCAAAAAATTAGYTAGGCATGGYNGCAG
+CTGGTGYAGTYCCAGTGTGGTGTAGTCCCAGCTACCTGGGAGGCTGAGGGAGGACAATTGCTTGGACCCT
+GTGAGGCAGAAGTTGCAGTGAGCAAAGAYGGCGCCAYYGTACCCCACCCYGTGCAACAGGACAAYGAGAT
+TCTGTCTAAAAAAAAA1
diff --git a/tables/alus/HUMRSA16 b/tables/alus/HUMRSA16
new file mode 100644
index 0000000..ec0e1c9
--- /dev/null
+++ b/tables/alus/HUMRSA16
@@ -0,0 +1,5 @@
+GGCTGGGTGCAGTGGCTCACACCTGTAATCTCAGCACTTTGGGAGGCTGAGGCAGGAGGATTACGAGGTC
+AGGAGATTGAGACCATCCTGGCTAACACAGTGAAACCCTGTCTCTATAAAAATACAAAAAAGTAGCCGGG
+CATGGTGGCAGACGCTGTAGTCCCAGCTACTCAGGCTGAAGCAGGAGAATGGCGTGAACCCGGGAGGCAG
+AGCTTGCAGTGAGCCGAGGTCACACCATTGCATTCCAGCCTGGGCGAAAGAGTGAGACTCTGTCTCAAAA
+AAAA1
diff --git a/tables/alus/HUMRSA27 b/tables/alus/HUMRSA27
new file mode 100644
index 0000000..f6552f3
--- /dev/null
+++ b/tables/alus/HUMRSA27
@@ -0,0 +1,4 @@
+GCTGGGTACGGTGGCTCATTCCTGTAATCCCAGCACTTTGGGAGCCGAGGCAGGTGNTCACTTGAGGTCA
+GGAGTTCGAGACCAGCCTGACCAACAGGTGAAACCCCGTCTCTACTAAAAATTGTAATCTCAGCTACTTG
+GGAGGCTGAGGCAGGAGAATCACTTGACNTGGGAGCCAGAGGTTGAAGTGAGCTGAGATCAGCTACTGCA
+CTCCGCNTGGCCAACAAAGTTAGACTCCGCCTAAAAAAAAA1
diff --git a/tables/alus/HUMRSAB11 b/tables/alus/HUMRSAB11
new file mode 100644
index 0000000..79895b3
--- /dev/null
+++ b/tables/alus/HUMRSAB11
@@ -0,0 +1,4 @@
+TGTAATCCCAGCACTTTTGGGAGGCTGAGGAGGGATGGATCACCTGAGGTCAGGAGTTCAAGACCAGCCT
+GGCCAACATGGTGAAACCCCGTTTCTACTAAAAATACAAAAATTAGCTGGGCATGGTGGTGGGCACCTGT
+AATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATTGCTTGAAACCAGGAGGCAGAGGTTGCAGTGAGCT
+GAGATTGCGCCACTGTACTTCAGGCTGTGTGACAGAGTGAGACTCCATCTCAAAAAAAA1
diff --git a/tables/alus/HUMRSAB13 b/tables/alus/HUMRSAB13
new file mode 100644
index 0000000..50363fe
--- /dev/null
+++ b/tables/alus/HUMRSAB13
@@ -0,0 +1,5 @@
+GGCTGGGCATGGTGGCTCACGTCCGTAATCCCAGCAGTTTGGAAGGCTGAGGCAAGTGGATCACTTTAAG
+TCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACCCCAACTCTACTAAAAACACAAAATTAGCCGG
+GCGTGGTGGCGCATGCCTGTAGCCCCAGCTACTCCTGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGA
+GGCAGATGTTGCAGTGAGCCGAGATCACACCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCCGTC
+TCAAA1
diff --git a/tables/alus/HUMRSAB19 b/tables/alus/HUMRSAB19
new file mode 100644
index 0000000..7fdbdb2
--- /dev/null
+++ b/tables/alus/HUMRSAB19
@@ -0,0 +1,4 @@
+CCCAACACTTTGGGAGGCCGAGGTAGATGGATCACCTAAGGTCAGGACTTCAAAACCCAACATGGCAAAA
+CACCATCACTGCTTAAAAAAAGTAATAAAAAATTAGCCCAGTGTAATGACACACACCTGTAGTCTCAGCT
+CTCCTGGAAGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGTGGAGGTTACAGTGAGCCGAGATAGCGC
+CACTGCACTCCAGCCTGAGCAACAGAGGAAG1
diff --git a/tables/alus/HUMRSAB2 b/tables/alus/HUMRSAB2
new file mode 100644
index 0000000..03020b8
--- /dev/null
+++ b/tables/alus/HUMRSAB2
@@ -0,0 +1,5 @@
+GCGGGGCGTGGTAGCTCACACCTGTAATCCCAATACTTTCGGAGGCTGAGGTGGGTGGATAACTTGACGT
+CAGGAGTTCAAGACCAGCTTGACCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGGCAG
+GGCTGGTGGCACGCACCTGTAACCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACCCTAGA
+GGCAGAGGTTGCAGTGAGCCGAGATCATGCTACTGTACCCAGCCTGGGCAACAGAGTGAGATTCCATCTC
+AAAAAAAA1
diff --git a/tables/alus/HUMRSAB6 b/tables/alus/HUMRSAB6
new file mode 100644
index 0000000..b658315
--- /dev/null
+++ b/tables/alus/HUMRSAB6
@@ -0,0 +1,4 @@
+CGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGTGGGCAGATCACCTGAGCTCAGGAGTT
+TGAGACCAGCCTTGCCAACATGGCAAAACCCCGTCTCTATTAAAAATACAAAAATTAGCCGGGCAAGGAG
+GTGGGTGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAAACACTTGAACCTGGGAGAGCCGAG
+ATAGTGCCACTGTACTCCAGCCTGGGCAACAGAGTGAGACTCTGTC1
diff --git a/tables/alus/HUMRSAB8 b/tables/alus/HUMRSAB8
new file mode 100644
index 0000000..9b58758
--- /dev/null
+++ b/tables/alus/HUMRSAB8
@@ -0,0 +1,4 @@
+TGTAATCCGAGCACTTTGGGAGGCCAAGGAGGGCAGATCACCTGAAGTCAGGAGTTTGAGACCAGCCTGG
+CCAACATGGTGAAACTCCATCTCTACTGAAAATACAAAAATTAGCCAGGCATGGTGATGCGTGCCTGGAA
+TCCCAGCTACTTAGGAGGCTGAGACAGAAGAATCCCTTAAACCAAGAGGTGGAGGTTGCAGTGAGCCGAG
+ATCGCACGGCTGCACTCCAGCCTGGTGACAGAGCGAGACTCCATCTCAAAAAAAA1
diff --git a/tables/alus/HUMRSAOLD b/tables/alus/HUMRSAOLD
new file mode 100644
index 0000000..4a8ca03
--- /dev/null
+++ b/tables/alus/HUMRSAOLD
@@ -0,0 +1,5 @@
+GGCCAGACATGGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAAGCAGGAGGATCATTTGAGC
+CTGGGAGTTTGAGACCAGCCTGGGCAACATAGCAGACCTCATCTCTACCAAAAAATAAAATAAAAATTAG
+CCAGGTGTGGTGNNGCATGCNNGTGGTGCCAGCTACTCAGAAGGCTGCAGTGGGAGGAGCACTTGAGTCN
+AGGAGGTGGAAGCTGCAGTGAGCCATGATGGCACCACTACACTCCAGCCAGGGCAACAGAGAGAGACTCT
+GTCTCAAAAAGAA1
diff --git a/tables/alus/HUMRSAP3 b/tables/alus/HUMRSAP3
new file mode 100644
index 0000000..ba44a0d
--- /dev/null
+++ b/tables/alus/HUMRSAP3
@@ -0,0 +1,5 @@
+GGCTAGGCGCGGGTTCACGCCTGTAATCCCAGCATTTTGGGAGGCTGAGACGGGTGGATCATGAGGTCAG
+GAGATCGAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAATACAAACAACCAGCCAGCC
+GAGTGTGGTGGTGGGCACCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATGACTTGAACCTGG
+AGGTGGAGCTTGCAGTGAGCCAACGATCGCGCCACTGTCATCATCATGGGTGACAGAGAGAGACTCCGTC
+TCAAAAAAAA1
diff --git a/tables/alus/HUMRSKA1 b/tables/alus/HUMRSKA1
new file mode 100644
index 0000000..733ba37
--- /dev/null
+++ b/tables/alus/HUMRSKA1
@@ -0,0 +1,5 @@
+GGCTGGGAGCGGTGGCTCACACCTGCAATTGCAGCACTTTGGGAGCCTAGACAGGCGGATCACGACGTCA
+GGAGTTCAAGATCAGCCTAGCCAACATAGTGAAACCTCACACGCTGGAGGTTCAAAACCAGCCTGCCAAC
+ATGTAACCTCATCGCTAGCTAAAAATAAAAACTTAGCCTACGCTGGTGGCAGGCATGTGTATTCCCAGCA
+ATTTGGGAGGCTGAGGCAGGAGAATCGCTTGATCTGGGAGGCAGAGGTTGCAGTGAGCCAAGATTGTGCC
+ACTGCATTCCAGCCAGGTGACAGCATGAGACTCCGTCACAAAAAAAA1
diff --git a/tables/alus/HUMRSKPA1 b/tables/alus/HUMRSKPA1
new file mode 100644
index 0000000..ea6a0d2
--- /dev/null
+++ b/tables/alus/HUMRSKPA1
@@ -0,0 +1,4 @@
+GGCTCATGGTGGCTCATGCCTATAATCCCAACACTTTGGAGGCTGAGGCAGGAGGATCACTTGAGCCGAG
+GAGTTCAAGACCAGCCTGGGCACATAATGAGATGCTGCCTCTACAGAAAATTTAAAAATTAGCTAGGCAT
+GCTGGAATGTGCCTATAGTCCCAGCTACCCAAGAGACTGATGTGGGAGGATTGCTGGAGCCAGGTGGTAG
+AGGCTGCAGTGAGCCATGACTGGTGCCTGGCAAGAGAGCAAGACAATCTCAAAAAGAA1
diff --git a/tables/alus/HUMSLJT1 b/tables/alus/HUMSLJT1
new file mode 100644
index 0000000..7971887
--- /dev/null
+++ b/tables/alus/HUMSLJT1
@@ -0,0 +1,5 @@
+GGCCGGGCGAAGTGGCTCATGTCTGTAATCCCAGCACTTTGGGAGGCCAAGGCAGGTAGATCAACTGAGG
+CCAGGAGTTCAAGACCAGCTTGGCCAACATGGCAAAACCCCGTCTCTACTAAAAATAAAAAATTTAGCTG
+GGCATGTTGGTGCATGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGG
+AGGCGGAGGCTGCAGTGAGCCGAGATCGTGCCACTGCACTCCAGCCTGGGTGACAGAGTGAGCCCCATCT
+CAAAAAAAA1
diff --git a/tables/alus/HUMTBB5 b/tables/alus/HUMTBB5
new file mode 100644
index 0000000..70571e9
--- /dev/null
+++ b/tables/alus/HUMTBB5
@@ -0,0 +1,4 @@
+TGGCTGCACGCCTGTAATCCCAGCACTTTAGGAGCCATGGCAGGTGTATGCTCCAGCTCAGGAGTTCAAG
+AACAGCCTGGGCAACATGTCGAAACCCCACCTCTACAAAAAAAAGATACAAAAATTAGCCAGATCTGGTA
+GCATAAGCCTGTAGTCCCAGCTACTCAAAAGGCTGGGGCAGGAGGATCACTTGACCTGGGAAGTCAAGGC
+TCAGTGAGCCATGATTGCACCTCTGTACTCCAGCCTGGGCAACATACCAAGACCCTGCCTGAAAA1
diff --git a/tables/alus/HUMTBB5-2 b/tables/alus/HUMTBB5-2
new file mode 100644
index 0000000..5a9ed54
--- /dev/null
+++ b/tables/alus/HUMTBB5-2
@@ -0,0 +1,5 @@
+CTGGGCATGGTGGTTCATGCCTATAAACCCAGGACTTTGGGAGGCTGAGTCAGGAAGAGAGCTTGAGGTC
+AGGAGTTCGAGACCAGCCTGGGCACCATGGTGAGACCCTGTCTCTACAAAAAGAAAAAAATTTTAATTAG
+CTGAGTGAGGTGGTGTGCACCTGTAGTCACAGCTACTCGTGAGGCTAAGGTAGGATAATCACTTGAGCCT
+GGGAGGTTGAGGCTGCCATGAGCTATGATGGTGCCCATGCCATCCAGCCTGAGCGATGGTGTGAGACCCA
+TCTCTAAAAAA1
diff --git a/tables/alus/HUMTBB5-3 b/tables/alus/HUMTBB5-3
new file mode 100644
index 0000000..7e09526
--- /dev/null
+++ b/tables/alus/HUMTBB5-3
@@ -0,0 +1,5 @@
+GGCCGGGCAAGGTGGCTCATGCTTGTAATCCAGCACTTTGGGAGCTGAGTTGGGTGGATCGTTTGAGTCC
+CGGAGTTTGAGACCAGCATGGGCAACATAGAGAGACCCACCTCTCTACAAAGAAAACTAAAAAATTAGCT
+GGGTGTGGTGGTGCATGCCTGTAGGTCCAGCTACTTGGGAGACTGAGGCAAGAGGATCACGTGTGCCCAG
+AGGTGGAGGTGACAGTGAGCCGTGATCATGCCACTGCACTCTAGCCTGGGTGACAGCAGGACAGACAGAC
+CCTGTCTGAAAAAAAA1
diff --git a/tables/alus/HUMTBB5-4 b/tables/alus/HUMTBB5-4
new file mode 100644
index 0000000..7d00fce
--- /dev/null
+++ b/tables/alus/HUMTBB5-4
@@ -0,0 +1,5 @@
+GGGTGGGCACGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACCTGAGG
+TCGGGAGTTTGAGACCAGCCTGACTAACATGGAGAAACCCCATCTCTACTAAAAATAAAAAATTAGCCCA
+GGCATGGTGGCACATGCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAAGAGAATTGCTTGAACGGCGAG
+AGGTTGTGGTGAGCCAAGATCATGACAGTGCACCCCAGCGTGGGCAACAAGAGTGAAACTTCGTCTCCAA
+AAAAA1
diff --git a/tables/alus/HUMTBB5-5 b/tables/alus/HUMTBB5-5
new file mode 100644
index 0000000..2b79bfe
--- /dev/null
+++ b/tables/alus/HUMTBB5-5
@@ -0,0 +1,4 @@
+GACCAGGCGCGTTGCTCATGCCTGTAATCCCAGCACTTTGGGAAGCAGAGGCAGGCAGAGTCACCTGAGA
+GGTCAGGAATAGGAGACCAGCCTGGCCAACATGGCGAAACCCCGTCACTATTAAAAATACAAAAAATTAG
+CTAGGTGTGGTGATGCATGCCTGTAGTCCCAGCTACTCGTGAGGCAGGAGAATCGCTTGAATCTGGGAGG
+TGGAGGTTGCGTCACTGCACTCCAGCCTGGGCAACGGAGACTCCGTCTCAAAAAAAA1
diff --git a/tables/alus/HUMTBB5-6 b/tables/alus/HUMTBB5-6
new file mode 100644
index 0000000..b0cae9e
--- /dev/null
+++ b/tables/alus/HUMTBB5-6
@@ -0,0 +1,5 @@
+GGCCAGGCGTGGTGGCTCACGCTTGTAAACCCAGCACTTTGGGAGGCCGAGGCGGGCGGATAGCGAGGTC
+AGGAGATCAAGACTATCCCGGCTAACACGGTGAGGCCCCGTCTCTACTAAAAATAGAAAAAATTAGCTGG
+ATGTGGTGGTGGGTGCCTGTAGTCTCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCGTGAACCCAGGA
+GGTGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGCGACAGAGTGAGACTTTGTCT
+CAAAAAAAA1
diff --git a/tables/alus/HUMTBB5-7 b/tables/alus/HUMTBB5-7
new file mode 100644
index 0000000..cc096e7
--- /dev/null
+++ b/tables/alus/HUMTBB5-7
@@ -0,0 +1,5 @@
+GGTGCGGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCAGGAG
+ATCGAGACCAACCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAATTACAAAAAAAATTAGCCGGGAG
+TGGTGCGGGCGCCTGTAGTCCTAGCTACTCGGGAGGCTAGGCAGGAGAATGGCGTGAACCCAGGAGGCGG
+AAGCTTGGCAGTGAGCCGACATTGTGCCACTGCACTGCAGCCTGGGCAACAGAGCGAGCTCCGTCTCAAA
+ATAA1
diff --git a/tables/alus/HUMTBBM40 b/tables/alus/HUMTBBM40
new file mode 100644
index 0000000..686e880
--- /dev/null
+++ b/tables/alus/HUMTBBM40
@@ -0,0 +1,5 @@
+GGCTGGGCGTGGTGGCTCACCCTTGTAATCCAGCACTTTGGGAGGCCAAGGTGGGCGGATCACGAGGTCA
+AGAGATCTAGACCATCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAATAACAAAAAGTTAGCCTG
+GGCGTGTTGTGCGCGCCTATTGCCCAGCTACTCCGGGGGCCTGAGGCAGGAGAATCGCTGACCCGGGAAG
+CAGAGGTTGCAGTGAGCCGAGATAGCGCCACTGCACTCCAGCCTGGTGACAGAGCGAGACTCCGTCTCAA
+AAAAAA1
diff --git a/tables/alus/HUMTHBNB b/tables/alus/HUMTHBNB
new file mode 100644
index 0000000..d849c15
--- /dev/null
+++ b/tables/alus/HUMTHBNB
@@ -0,0 +1,5 @@
+GGTCAGGTGCAGTGGCTCACCCCTGTAATCCCAGCACTTTGGGAGGCCAAGACAGGAAGATCACTTGAGG
+GCAGGAGTTCGAGGCCAGCTTGGGCAAAATGGTAAAACCCCGTCTCTACTAAAAATGCAAAAATTAGCCA
+GGTGTAGCAGCATGTCCCTGTAGTCCCAGCTACTAAGGAGGCTGAGGCGGGAGGATCGCCTGAGCCCAAG
+AGGCTGAGGCTTCAGTAAGCTGTGACTGTACCATTGCACTCCAGCCTGGGTGACAAGAGTGAGACCCTGT
+CTCAAAAATAA1
diff --git a/tables/alus/HUMTHBNB-2 b/tables/alus/HUMTHBNB-2
new file mode 100644
index 0000000..38a004a
--- /dev/null
+++ b/tables/alus/HUMTHBNB-2
@@ -0,0 +1,5 @@
+GGCTGGGCGCAGTGCCTCACCCCTGTAATCTCAGCAGTTTGGGAGGCCGAGGCGGGAGGATCACAAGGTC
+AGGAGTTTGAGATCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAACATAAAAAAAAATTAGCCAG
+GAGTGGTGGCACACGCCTGTAATCCTAGCTACTCAGGAGGCTGAAGCAGGGGAATCGCTTGAACCCAGGA
+GGCGGAGGTTGCAGTGAGCTGAGATAGGGCCACTGCACTCCAACAGCCTGGGCAAGGGAGTGAGACTCCT
+TCTTAGAAAGAA1
diff --git a/tables/alus/HUMTHBNB-3 b/tables/alus/HUMTHBNB-3
new file mode 100644
index 0000000..8fef218
--- /dev/null
+++ b/tables/alus/HUMTHBNB-3
@@ -0,0 +1,5 @@
+GGCCGGGTGTGGTGGCTCATGCCTGTAATCCCGGCACTTTGGGAGGCTGAGGCAGGTGGATCACTTGAGG
+TCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTTAAAATACAAAAATTAGCCG
+GGTGTGGTGGCGCATGTCTGTAAATTCAGCTACTCAGGAGGCTGAGGCAAGAGAATCGTTTGAACCCGGA
+AGGCAGAAGTTGCAGTGAGCTGAGATCGAGGTAGTGAGCCGAGATTGCAGCACTGCACTCCAGCCTAGGT
+GACAGAGCAAGACTCCATCTAAGAAACAA1
diff --git a/tables/alus/HUMTPA b/tables/alus/HUMTPA
new file mode 100644
index 0000000..727a30f
--- /dev/null
+++ b/tables/alus/HUMTPA
@@ -0,0 +1,5 @@
+GACAGGGCACAGTGACTCACACCTGTAATCTCAGCACTTTGGGAGGCCGAGGTGGGAGGATCGCTTGAGC
+CCAGGAGTTGGAGACCAGTCTGGGCAATATAGTGAGATGCTGTCTCTACAAAAAATTTAAAAATTAGCCG
+GGTGTACTAGTATGCACCTGTGGTCCCAGCTACTCAGGAGGCTGAGGCGGGAGGATCGCTTAAGTTCAGG
+AGGTTGGGACTTCAGTGAGATATGATTACGCCAATGCACTCCAGCCTGGGTGACAAACTGAGATCCTGTC
+TCAAAAAAAA1
diff --git a/tables/alus/HUMTPA-10 b/tables/alus/HUMTPA-10
new file mode 100644
index 0000000..9bc873d
--- /dev/null
+++ b/tables/alus/HUMTPA-10
@@ -0,0 +1,5 @@
+GGCTGGGCACGGTGGCTCATGCCTGTAACCCCAGCACTTTGGGAGACCGAGGTGGGTGGATCACTTGAGT
+CAGGAGTTCGAGACCAGCCTGTTCAATATGGTGAAACCCCGTCTCTACCAAAAATACAAAAATTAGCCGG
+GCATGGTATCGGGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCCAGGA
+GGTGGAGGTTGCAGTGAGCCGAGATTGTGCCACTTCACTCCAGCCTGGGCGACAGAGTGAGACTTTGTTT
+CAAAAATAA1
diff --git a/tables/alus/HUMTPA-11 b/tables/alus/HUMTPA-11
new file mode 100644
index 0000000..25bcb85
--- /dev/null
+++ b/tables/alus/HUMTPA-11
@@ -0,0 +1,4 @@
+GGCCGGGCGTGTTGGCTCACACCTGTACTCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGATG
+TTAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCAACTAAAAATACAAAATTAGCGTA
+GCATGGTGGCGCATGCCTGTAACCCCACCTACTCGGGAGGCTGAGGTAGAAGAATCGCTTGAACCCAGGA
+GGTGGAGATTGCAGTGAGCCAAGATCGCGCATTGCACTCCAGCCTGGGCAACAAAAGGGGA1
diff --git a/tables/alus/HUMTPA-12 b/tables/alus/HUMTPA-12
new file mode 100644
index 0000000..3fe1a9a
--- /dev/null
+++ b/tables/alus/HUMTPA-12
@@ -0,0 +1,5 @@
+GGCCAGGCGTGTTGGCTCACGCCTGTAATCCTAGCACTTTGGGAGGCCAAGGTGGACAGATCACCTGAGG
+TTGGGAGTTCGAGACCAGCCTGGCCAGCATGCCGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCA
+GGCATGGTGGCACACACTTGTAATCCGAGCTACTCGGGAGGCTGAAGAAGGAGAATCGCTTAAACCCAGG
+AGGCGGAGGTTGCAGTGAGCTGAGATTGCACCGTTGCAATCCAGCCTGGGCAACAGAGTGAAACTCCATC
+TCAGAAAAAA1
diff --git a/tables/alus/HUMTPA-13 b/tables/alus/HUMTPA-13
new file mode 100644
index 0000000..661ec75
--- /dev/null
+++ b/tables/alus/HUMTPA-13
@@ -0,0 +1,5 @@
+GGCTGGGCGTGGTGGCTCACGCGTAATCCCAGCACTTTGGGAAGCCAAGGCAGGTGGATCACCTGAGGTC
+AGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAGG
+CATGGTGGCAGGCACCTGTAATCCCAGCTACTCGGGAAGCTGAGGCTGAAGAATTGCCCAAATCCAGGAA
+ACGGAGGTTGCAGTGAGAGGAGATGCGCCGCTGCACTCCAGCCTGGGATAAACAGCGAGACTCTGTCAAA
+AAAAAAA1
diff --git a/tables/alus/HUMTPA-14 b/tables/alus/HUMTPA-14
new file mode 100644
index 0000000..6e647a0
--- /dev/null
+++ b/tables/alus/HUMTPA-14
@@ -0,0 +1,5 @@
+GCCTGGTGTGGTGGTTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGAGGCTCACCTGAGGT
+CAGGAGTTCGAGACCAGCCTGACCAACATGGAGAAACCCCATCTCTACTAAAAATACAAAATTAGCCTGG
+CATGGTGGCACATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAG
+GCAGAGGTTGCAGTGAGCCGAGATCGCGCCATCGCGCTTCAGCCTGGGCGACAAGAGCGGAACTCGATCT
+CAAGAAAAA1
diff --git a/tables/alus/HUMTPA-15 b/tables/alus/HUMTPA-15
new file mode 100644
index 0000000..31eb09d
--- /dev/null
+++ b/tables/alus/HUMTPA-15
@@ -0,0 +1,5 @@
+GGCTGGGCGCGGTGGCTCACACCTATAATCCCAGCACTTTGGGAGGCTGAGGCAGGTGGATCACGAGGTC
+GGGGGTTTGAGACCAGCCTGACCAACATGGTGAAACCCCGTCTCTACTAAAATACAAAAAATTAGCTGGG
+CGTGGTGGCGGGCACCTGTAATCTCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGTGG
+AGGTTGCAGTGAGCCGAGATCACACCACTGCACTCTAGCCTGGGCGACAGAGCAAGACTCTGTCTCAAAA
+AAAA1
diff --git a/tables/alus/HUMTPA-16 b/tables/alus/HUMTPA-16
new file mode 100644
index 0000000..70d00b0
--- /dev/null
+++ b/tables/alus/HUMTPA-16
@@ -0,0 +1,5 @@
+GACCAGGCACAGTGGCTTATGCCTATAAACCCAGCACTCTGGGAGGCTGAGGCGGGCGGCTCACCTGAGG
+TCAGGAGTTTGAGACCAGCCTGGCCAACATGGCGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCG
+GATACGGTGGCAGGCACCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGG
+AGATGGAGGTTGCAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGTGATAGAACAAAAAACTGTT
+TCAAAAAAAA1
diff --git a/tables/alus/HUMTPA-17 b/tables/alus/HUMTPA-17
new file mode 100644
index 0000000..b19d821
--- /dev/null
+++ b/tables/alus/HUMTPA-17
@@ -0,0 +1,4 @@
+GGTCAGGTGCGGGGCTCATGCCTGTAATCCCAGCACTTTGAGAGGCTGAGGAAGGCAGATCACCTGAGGT
+CAGGAGTTCGAGACCAGCCTGGCCACCATGGTGAAACCCTGCCTCTACTAAAAACACAAAAATTAGCCAG
+GTGTGGTGGTGCGTGCCCATAAGCACGAGAATTGCTTGAACCCCGGAGGTGGAGGGTGCAGTGAGCTGAG
+ATTGCGCCACTGCACTCCAGCCTGGGTGGGTGACAGAGTGAGACTCTGTCTCAAAACAAA1
diff --git a/tables/alus/HUMTPA-18 b/tables/alus/HUMTPA-18
new file mode 100644
index 0000000..ee5f956
--- /dev/null
+++ b/tables/alus/HUMTPA-18
@@ -0,0 +1,5 @@
+GGCCAGGCATGTTGGCTCATGCCTGTAATCCCAGCACTTCGGGGGGCCGAGGTGGGTGGATCACCTGAGG
+TCACGAGTTCAAGACCAGTCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAACCA
+GGTGTGGTGGCAGGCGCCTGTAATCCCAGTTACTCAGAAGGCTGAGGCAGGAGAATCACTTGAACCTGGG
+AGGTGGAGGTTGCAGTAGCCGGGATCATGCCATTGCACTCTAGCCTGGGCAAAAAGAGTGAAACTCCGTC
+TCAAAAAAAA1
diff --git a/tables/alus/HUMTPA-19 b/tables/alus/HUMTPA-19
new file mode 100644
index 0000000..3010aaa
--- /dev/null
+++ b/tables/alus/HUMTPA-19
@@ -0,0 +1,5 @@
+GGCCAGGCACAGTAGTTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGGCTGGCAGATCTCTTGAGG
+TCAGGAGTTCAAAACAAGCCTGGCCAACATAGTGAAACCCCATCTCTACTAAAAATACAAAAAAATTAGC
+TGGGCATGGTGGCGGGCGCCTATAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATCGCTTGAACCCG
+GGAGGCGGAAGTTGCAGTGAGCCAAGATTGCGCTACTGCAGCCTGGGCGACAGAGCAGGACTTCGTCTCA
+AAAAAGA1
diff --git a/tables/alus/HUMTPA-2 b/tables/alus/HUMTPA-2
new file mode 100644
index 0000000..861ff7f
--- /dev/null
+++ b/tables/alus/HUMTPA-2
@@ -0,0 +1,5 @@
+GGCCAGGTGCAGTGGCTCGTGTCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGAGAACTGCTTGAGC
+CCAGGAGTTTGAGACCAGCCTGGGCAACAAAGTGAGACTCCATCTCTACAAAAAATACAAAAATTAGTCA
+GGTGTGGTGGCTTGTGCCTATGGTCCCAGCTGCTTGGGAGGCTCAGGTGGGAGGATCGCTTGAGCCCAGG
+AGGTTGAGGCCACAGTGAGCAATGATTGTGCCACTGCACTCCAGCCTGGGTGACAGTGAGATCCTGCCTC
+AAAATAAA1
diff --git a/tables/alus/HUMTPA-20 b/tables/alus/HUMTPA-20
new file mode 100644
index 0000000..0e21b7e
--- /dev/null
+++ b/tables/alus/HUMTPA-20
@@ -0,0 +1,4 @@
+GTCTGGGCATGGTGGCTCACGCCTGTAATCCCAATACTTTGCGAGGCCAAGGCGGGCAGATCACTTGAGG
+TCAGGTGTTCAAGACCAGCCTGGCCAACATGGTGAAAACTTAGCTGGGTGTGGTGGCGGGCGCCTGTAAT
+CCAAGCTATTTGGGAGGCTGAGGCAGGAAAATCGCTTGAACCTGGGAGGTGGAGGTTGCAGTGAGCCAAG
+ATTGTGCCACTGCACCCCAGCCTGTGAAAAAGCGAGACTCCATCTTAAAAAAAA1
diff --git a/tables/alus/HUMTPA-21 b/tables/alus/HUMTPA-21
new file mode 100644
index 0000000..4b16d7d
--- /dev/null
+++ b/tables/alus/HUMTPA-21
@@ -0,0 +1,5 @@
+GCGGGGCGTGGTGGCTCACGCCTGTAATCCCACCACTTTGGGAGGCCAAGGCGAGCGGATCACGAGGTCA
+GGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGC
+GTGGTGGCAGGCGCCTGTAGTCCCAGCTACTCAGGAGGCTGAGACATGAGAATGGCATGAACCCGGGAGG
+CGGAGCTGTAGTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGCGACAGAGCGAGACTCCATCTTAA
+AAGAAA1
diff --git a/tables/alus/HUMTPA-22 b/tables/alus/HUMTPA-22
new file mode 100644
index 0000000..1e9a693
--- /dev/null
+++ b/tables/alus/HUMTPA-22
@@ -0,0 +1,6 @@
+GGCCGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTC
+AGGAGATCGAGACCATCCCGGCTAAAACGGTGAAACCCCGTCTCTACTAAAACTACAAAAAATAGCCGGG
+CGTAGTGGCGGGCGCCTGTAGTCCTGGCTACTTGGGAGGCTGAGGCAGGAGAATGGCATGAACCCGGGAG
+GCGGAGCTTGCAGTGAGCCGAGATCCCGCCACTGCACTCCAGCCTGGGCAACAGAGCGAGACTCCGTCTC
+AAAAAAAA1
+
diff --git a/tables/alus/HUMTPA-3 b/tables/alus/HUMTPA-3
new file mode 100644
index 0000000..62e01bd
--- /dev/null
+++ b/tables/alus/HUMTPA-3
@@ -0,0 +1,5 @@
+GCAAGGTGTGGTGGCTCACATCTGTAATCCCAACACTTTGGGAGGCCAAGGAGGGTGGGTGGCTTGATCC
+TTGGAGTTTGAGACCTCCCTGGGTAACATGGCAAAACCCCATCTCAACAAAAAAATACAAAAAATTATTT
+GGGCATGGTGGTACATGCCTGTAGTCCCAGCTACTTGGGAGGCTGAGGTGGGAGGATGGCCTGAGCCTGG
+GAGTTTGAGGCTGCAGTGAGCTGTGATCATACCACTGCACTCCAGCTCGAGACAGAGCAAGATCCTGTCT
+CAAAAAAAA1
diff --git a/tables/alus/HUMTPA-4 b/tables/alus/HUMTPA-4
new file mode 100644
index 0000000..7cb1b87
--- /dev/null
+++ b/tables/alus/HUMTPA-4
@@ -0,0 +1,5 @@
+GGCCGGGCACACAGCTCCTGCCTGTAATCCCAGCACTTTGGGAGCCCGAGGTGGGCGGGTTGCTTGAGCC
+AAGGAGTTTGAAACCAGCCCGGGTCTTGAACATAGCGAAGACTCTGTCTCTACAAAAAAATGAAAAAAAA
+AAAAAAATTAGCCAGACATGGTGGCACGCACCTGTAGTCCCAGCTACTTGAGAAGCTGAGGTGAGAGGAT
+CACTTGAGCCAGGGAGGTTGAAACTGCAGTGAGCTGTGATCACGCCACTGCACTCCAGTCTGGGTGACTG
+GGCGAGACCCTGTCTCAAAACAAA1
diff --git a/tables/alus/HUMTPA-5 b/tables/alus/HUMTPA-5
new file mode 100644
index 0000000..71791f7
--- /dev/null
+++ b/tables/alus/HUMTPA-5
@@ -0,0 +1,5 @@
+GACTGGGCACAGTGGCTCTTGCCTGTAATCCCAGCATTTTGTGAGGCCATGGCAAGAGGGTTGCTTGAGG
+CCAGGAGTTTGAGACTAGCCTGGGCAACAAAGCAAAACTCTGCTTCTACAAAAAAATTTTATAATTAGCT
+GGATATGGCAGCATGCCCCTGTAGTCCTAGCTACTTGGGAGGCTGAGATGGAGGATCACTTGAGCCCAGG
+AGTTTGAAGGTTCAGTGACCTACGATCACACTGCTGCACTCCAGCCTGGGTGATGGAGTCAGACCTTGTC
+TCTAAA1
diff --git a/tables/alus/HUMTPA-6 b/tables/alus/HUMTPA-6
new file mode 100644
index 0000000..0577ccc
--- /dev/null
+++ b/tables/alus/HUMTPA-6
@@ -0,0 +1,4 @@
+GGTGGTGCATGCCTGTGATCCCAGCACTTTAGGAGGCCAAGGTGGGAGGGTCGCTGGAGCCCGGGAGTTC
+AAGACCAATCTGGGCAAACATAGCAAGTCCCCTGTCTCTACAAAAAATAAAAAAATTAGCCAGACCTGGT
+ATGTAGTCCCAACTACTTGGGAGGTTGAGGCAGAAGGATCACTTGAGCCCAGGAGTTGGAGGCTGCAGTA
+ATCTACGATTATGCCACTGCATTTCAACCTCAGTGACAGGGCAAGCCCTCACCTCTAAAACAAAA1
diff --git a/tables/alus/HUMTPA-7 b/tables/alus/HUMTPA-7
new file mode 100644
index 0000000..1a170bd
--- /dev/null
+++ b/tables/alus/HUMTPA-7
@@ -0,0 +1,5 @@
+GCTGGGCGCGGTGGCTCACACCTTGATCCCAGCACTTTTGGGAGGCTGAGGAAGGAGGATCATTTGTGCC
+CAGGAGTTCGAGACTAGCCTGGACAACATAGAGAAACCTTGTCTCTACACACACACACACACACACACAC
+ACACACACTCTCTCTCTCTCTCTCTCTCAGCCAGGCACAGTGGCACATGCCTGAAGTCCCAGCTCTGGGA
+AGCTGAGGCAGGAGGATCTCTTGAGCCTGGTGGGTCAAGGCTGCAGTGAACCATGTTCATGCCACTGCAC
+TCCAGTCTGGATGACAGAGCGAGACCTAGTCTCAAAAAAAA1
diff --git a/tables/alus/HUMTPA-8 b/tables/alus/HUMTPA-8
new file mode 100644
index 0000000..520256d
--- /dev/null
+++ b/tables/alus/HUMTPA-8
@@ -0,0 +1,5 @@
+GTCCAGGTGCAATGGCTCACATCTGTAACTCCAGCACTTTAGGAGGCGGATCCAGGAGGATGGCTTTAGC
+CTAGGAGTTCAAGACCAGCATGGGCAACATAGTGAGACACCTCTGCCTCCCCCACCCCACCCCGCCTCTA
+CAAAAAAATTTAAAAATTAGCTGGGTGTGGTGGTGTGCACCTGTAGTCCCAGCTACTTGGGAGGTTGACG
+CAGGAGGACTGCTTGAGCCTGGGAGGCAGAGGTTGCAGTGAGCCAAGACAGCACCATTGGACTCCAGCCT
+GGGTAACAGAGACCCTCGAAAAAAAAA1
diff --git a/tables/alus/HUMTPA-9 b/tables/alus/HUMTPA-9
new file mode 100644
index 0000000..e3aed32
--- /dev/null
+++ b/tables/alus/HUMTPA-9
@@ -0,0 +1,5 @@
+GGCCAAACACGGTGACTGATGCTGGTAATCCCAGCACTTTGGGTGGCCGAGGTGGGTGAATCACTTGAGC
+TCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCTATCTCTACTAAAAATACAAAAATGAGCCA
+GGCATGGTGGCAGGTGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGTATTGCTTGAACCCAGG
+AAGTGGGGGTTGCAGTGAGCCGAGATTGTACCACTGCACTCCAGCCTGGGCGACAGAGCAAGACTCCATC
+TCAAAATAAA1
diff --git a/tables/alus/HUMUG2PD b/tables/alus/HUMUG2PD
new file mode 100644
index 0000000..dfc01ce
--- /dev/null
+++ b/tables/alus/HUMUG2PD
@@ -0,0 +1,5 @@
+GCTGGGCGTAGTGGCTCACACCTGTAATCCCAGCAATCTGGGAGGCCCAGGCAGGTGGATCACCTGAGGT
+CAGGAGTTCGAGACCAGCCTGGCCAACATGGCGAAACCCCGTTTCTACTAAAAATACAAAAATTAGCTGG
+GCATGGTGGCGGGTGCCTGTAATCCCAGATACTCGGGAGGCCGAGGCAGGAGCATTGCCCGAACTCGGGA
+GGCAGAGGTTGCAGTGAGCCGAGATGGTGCCGTTGCACTCCAGCCTGGGTACCAAGAGCTAAACTCCATC
+TCAAAAACAA1
diff --git a/tables/alus/HUMUG2PD-2 b/tables/alus/HUMUG2PD-2
new file mode 100644
index 0000000..8941a29
--- /dev/null
+++ b/tables/alus/HUMUG2PD-2
@@ -0,0 +1,5 @@
+GCCGGGCGCGGTGACTCACGCCTGTAATCCCAGCACTTTAGGAGGCCGAGGCAGGTGGATCACGAGGTCA
+GGAGATCGAGACCATTCTGGATAACACGGAGAAACCCCGTCTCTATTAAAAATACAAAAATTAGCCGGGC
+GTGGTGGCACGTGCCTGTAGTCCCAGCTACTTGGGAGGCTGATGCAGGAGAATCTCTTGAACCTGGGAGT
+CCGAGGTTGCAGTGAGCCAAGATTGCGCTACTACACTCCAGCCTGGGCGACAGAGTGAGACTCCATCTAA
+AAAAAAA1
diff --git a/tables/alus/M11591 b/tables/alus/M11591
new file mode 100644
index 0000000..f099570
--- /dev/null
+++ b/tables/alus/M11591
@@ -0,0 +1,5 @@
+GCCGGGCGCTGCCGCTCACCACCTGTAATCCCGGCACTTTGGGAGGCCGAGGCGGGCGGATTACAAGGTC
+AGGAGATTGAGACCATCCTGGCTAACACGATGAAACCCCGTCTCTACTAAAAATACAAAACAATTAGCCA
+GGCGTGGTGGCGGGCGCCGTAGTGCCAGCTACTAGGAAGGCTGAGGCAGGAGAATGGCATAAACCCGGAA
+GGCAGAGCCGCAGTGAACCGAGATCGCGTCACTGCACTCCAGCCTGGGTGACAGAGAGTGAGACTCCGTC
+GCAACAAAAA1
diff --git a/tables/alus/M12036 b/tables/alus/M12036
new file mode 100644
index 0000000..34fd97e
--- /dev/null
+++ b/tables/alus/M12036
@@ -0,0 +1,4 @@
+GGCCGGGCACGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCA
+GGAGTTTGACACCAACCTGGCCAACATGGTGAAATCCCATCTCTACTAAAAATACAAAAATTAGCCGGGC
+GTGGTGGCTCGCGCCTGTAATCCCAGCTACTCAGGAGACTGAGGGAGAAGAATCGCTTTAACCTGGGAGG
+CTGAGCTGAGACCGCACCATTTCACTCTAGCCTGGGTGACAGAGCGAGACTCTGTCTCAAAAAAAA1
diff --git a/tables/alus/M12929 b/tables/alus/M12929
new file mode 100644
index 0000000..acee3b7
--- /dev/null
+++ b/tables/alus/M12929
@@ -0,0 +1,5 @@
+GGCTGGGCACGGTGGCTCATGCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACTGAGGTC
+AGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCCCCACTAAAAATACAAAAAAATTAGCCA
+GGTGTGGTGGCGGGCACCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAAAATCGCTTGAACCTAGG
+AGGCAGAGGTTGCAGTGAGCTGAGGTCTCATTATTGCACTCCAGCCTGGGAGACAAGAGCGAAACTCCAT
+CTCAAAAATAA1
diff --git a/tables/alus/M13121 b/tables/alus/M13121
new file mode 100644
index 0000000..7a8a1d2
--- /dev/null
+++ b/tables/alus/M13121
@@ -0,0 +1,5 @@
+GCTGGACGTGGTGGCTCACAATTGTAATCCCAGCACTCTGGGAGGCTGAGACAGGTGGATCGCCTGAGCC
+CAGGAGTTTGAGACCAGCCTGGGCAACACTGTGAGACCCCATCTCTACAAAACATGCAAAAGTTGGCTGG
+CCATGGTGGCATGAACCTGTGGTCCCAGCTACTCCGGAGGCTGAGGCAGGACTGCTCGAGCCGGGGAGGC
+AAAGGCTGCAGTAAGCCAAGATCACGCCACTCCACTCCAGCCTGGGCAACAAAGCGAGACCCAGTCTCAA
+AGAAAA1
diff --git a/tables/alus/README b/tables/alus/README
new file mode 100644
index 0000000..6c04bd5
--- /dev/null
+++ b/tables/alus/README
@@ -0,0 +1,7 @@
+directory contains 126 alu sequences unaligned
+files is a list of file names with a path using $STADTABL
+
+The files are from Jerzy Jurka.
+
+They can be used with the ALU search program rep.
+
diff --git a/tables/alus/README~ b/tables/alus/README~
new file mode 100644
index 0000000..0fc0236
--- /dev/null
+++ b/tables/alus/README~
@@ -0,0 +1,3 @@
+directory contains 126 alu sequences unaligned
+filesp is a list of file names with a path
+files if same without path
diff --git a/tables/alus/consensi b/tables/alus/consensi
new file mode 100644
index 0000000..236876c
--- /dev/null
+++ b/tables/alus/consensi
@@ -0,0 +1,8 @@
+Families have following members
+
+ALUa
+HUMAPOE4
+HUMTPA
+HUMPOMC1
+HUMRSA1
+
diff --git a/tables/alus/fasta-split b/tables/alus/fasta-split
new file mode 100644
index 0000000..99481f1
--- /dev/null
+++ b/tables/alus/fasta-split
@@ -0,0 +1,17 @@
+#! /bin/sh
+AWK=gawk
+for x in $*
+  do
+    echo splitting ${x}
+$AWK '
+BEGIN { f = ""; }
+/^>/ { if ( f != "" ) close(f); f = substr($1,2);
+"if [ -r " f " ] ; then echo yes ; else echo no ; fi" | getline result;
+if (result == "yes") { print "File " f " already exists"; f = "";} else
+print f;
+next;} 
+{if (f != "") print $0 >> f ;}
+' $x
+  done
+
+
diff --git a/tables/alus/files b/tables/alus/files
new file mode 100644
index 0000000..2ebe7bc
--- /dev/null
+++ b/tables/alus/files
@@ -0,0 +1,126 @@
+STADTABL/alus/ALUA
+STADTABL/alus/HUMA1ATP
+STADTABL/alus/HUMACHRA7
+STADTABL/alus/HUMADAG
+STADTABL/alus/HUMADAG-10
+STADTABL/alus/HUMADAG-11
+STADTABL/alus/HUMADAG-12
+STADTABL/alus/HUMADAG-13
+STADTABL/alus/HUMADAG-14
+STADTABL/alus/HUMADAG-15
+STADTABL/alus/HUMADAG-16
+STADTABL/alus/HUMADAG-17
+STADTABL/alus/HUMADAG-18
+STADTABL/alus/HUMADAG-2
+STADTABL/alus/HUMADAG-3
+STADTABL/alus/HUMADAG-4
+STADTABL/alus/HUMADAG-5
+STADTABL/alus/HUMADAG-6
+STADTABL/alus/HUMADAG-7
+STADTABL/alus/HUMADAG-8
+STADTABL/alus/HUMADAG-9
+STADTABL/alus/HUMAGG
+STADTABL/alus/HUMALBG
+STADTABL/alus/HUMALBG-2
+STADTABL/alus/HUMANFA
+STADTABL/alus/HUMANFA-2
+STADTABL/alus/HUMAPOAI1
+STADTABL/alus/HUMAPOAI1-2
+STADTABL/alus/HUMAPOAII
+STADTABL/alus/HUMAPOAII-2
+STADTABL/alus/HUMAPOCII
+STADTABL/alus/HUMAPOE4
+STADTABL/alus/HUMAPOE4-2
+STADTABL/alus/HUMAPOE4-3
+STADTABL/alus/HUMAPOE4-4
+STADTABL/alus/HUMBLYM1
+STADTABL/alus/HUMC1A21
+STADTABL/alus/HUMC1A23
+STADTABL/alus/HUMC1AIN1
+STADTABL/alus/HUMERPA
+STADTABL/alus/HUMFIXG
+STADTABL/alus/HUMFIXG-2
+STADTABL/alus/HUMFIXG-3
+STADTABL/alus/HUMFIXG-4
+STADTABL/alus/HUMFOL5
+STADTABL/alus/HUMFOL5-2
+STADTABL/alus/HUMGAST2
+STADTABL/alus/HUMGHV
+STADTABL/alus/HUMHBA4
+STADTABL/alus/HUMHBA4-2
+STADTABL/alus/HUMHBA4-3
+STADTABL/alus/HUMHBBRT
+STADTABL/alus/HUMHBBRT-2
+STADTABL/alus/HUMIFNB3
+STADTABL/alus/HUMIFNB3-2
+STADTABL/alus/HUMIFNB3-3
+STADTABL/alus/HUMIFNB3-4
+STADTABL/alus/HUMIFNB3-5
+STADTABL/alus/HUMIL2R8
+STADTABL/alus/HUMINS2
+STADTABL/alus/HUMLDLIVS
+STADTABL/alus/HUMLDLR
+STADTABL/alus/HUMLDLR-2
+STADTABL/alus/HUMMHDC3B
+STADTABL/alus/HUMMHDRB3
+STADTABL/alus/HUMMHDRB3-2
+STADTABL/alus/HUMMYCRT
+STADTABL/alus/HUMNGFB
+STADTABL/alus/HUMPOMC
+STADTABL/alus/HUMPOMC-2
+STADTABL/alus/HUMPOMC1
+STADTABL/alus/HUMPOMC2
+STADTABL/alus/HUMPOMC6
+STADTABL/alus/HUMRSA1
+STADTABL/alus/HUMRSA16
+STADTABL/alus/HUMRSA27
+STADTABL/alus/HUMRSAB11
+STADTABL/alus/HUMRSAB13
+STADTABL/alus/HUMRSAB19
+STADTABL/alus/HUMRSAB2
+STADTABL/alus/HUMRSAB6
+STADTABL/alus/HUMRSAB8
+STADTABL/alus/HUMRSAOLD
+STADTABL/alus/HUMRSAP3
+STADTABL/alus/HUMRSKA1
+STADTABL/alus/HUMRSKPA1
+STADTABL/alus/HUMSLJT1
+STADTABL/alus/HUMTBB5
+STADTABL/alus/HUMTBB5-2
+STADTABL/alus/HUMTBB5-3
+STADTABL/alus/HUMTBB5-4
+STADTABL/alus/HUMTBB5-5
+STADTABL/alus/HUMTBB5-6
+STADTABL/alus/HUMTBB5-7
+STADTABL/alus/HUMTBBM40
+STADTABL/alus/HUMTHBNB
+STADTABL/alus/HUMTHBNB-2
+STADTABL/alus/HUMTHBNB-3
+STADTABL/alus/HUMTPA
+STADTABL/alus/HUMTPA-10
+STADTABL/alus/HUMTPA-11
+STADTABL/alus/HUMTPA-12
+STADTABL/alus/HUMTPA-13
+STADTABL/alus/HUMTPA-14
+STADTABL/alus/HUMTPA-15
+STADTABL/alus/HUMTPA-16
+STADTABL/alus/HUMTPA-17
+STADTABL/alus/HUMTPA-18
+STADTABL/alus/HUMTPA-19
+STADTABL/alus/HUMTPA-2
+STADTABL/alus/HUMTPA-20
+STADTABL/alus/HUMTPA-21
+STADTABL/alus/HUMTPA-22
+STADTABL/alus/HUMTPA-3
+STADTABL/alus/HUMTPA-4
+STADTABL/alus/HUMTPA-5
+STADTABL/alus/HUMTPA-6
+STADTABL/alus/HUMTPA-7
+STADTABL/alus/HUMTPA-8
+STADTABL/alus/HUMTPA-9
+STADTABL/alus/HUMUG2PD
+STADTABL/alus/HUMUG2PD-2
+STADTABL/alus/M11591
+STADTABL/alus/M12036
+STADTABL/alus/M12929
+STADTABL/alus/M13121
diff --git a/tables/alus/filesp b/tables/alus/filesp
new file mode 100644
index 0000000..76c6bfd
--- /dev/null
+++ b/tables/alus/filesp
@@ -0,0 +1,126 @@
+/home4/rs/work/alualualu/alus/unaligned/ALUA
+/home4/rs/work/alualualu/alus/unaligned/HUMA1ATP
+/home4/rs/work/alualualu/alus/unaligned/HUMACHRA7
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-10
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-11
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-12
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-13
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-14
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-15
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-16
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-17
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-18
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-2
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-3
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-4
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-5
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-6
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-7
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-8
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-9
+/home4/rs/work/alualualu/alus/unaligned/HUMAGG
+/home4/rs/work/alualualu/alus/unaligned/HUMALBG
+/home4/rs/work/alualualu/alus/unaligned/HUMALBG-2
+/home4/rs/work/alualualu/alus/unaligned/HUMANFA
+/home4/rs/work/alualualu/alus/unaligned/HUMANFA-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAI1
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAI1-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAII
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAII-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOCII
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4-3
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4-4
+/home4/rs/work/alualualu/alus/unaligned/HUMBLYM1
+/home4/rs/work/alualualu/alus/unaligned/HUMC1A21
+/home4/rs/work/alualualu/alus/unaligned/HUMC1A23
+/home4/rs/work/alualualu/alus/unaligned/HUMC1AIN1
+/home4/rs/work/alualualu/alus/unaligned/HUMERPA
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG-2
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG-3
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG-4
+/home4/rs/work/alualualu/alus/unaligned/HUMFOL5
+/home4/rs/work/alualualu/alus/unaligned/HUMFOL5-2
+/home4/rs/work/alualualu/alus/unaligned/HUMGAST2
+/home4/rs/work/alualualu/alus/unaligned/HUMGHV
+/home4/rs/work/alualualu/alus/unaligned/HUMHBA4
+/home4/rs/work/alualualu/alus/unaligned/HUMHBA4-2
+/home4/rs/work/alualualu/alus/unaligned/HUMHBA4-3
+/home4/rs/work/alualualu/alus/unaligned/HUMHBBRT
+/home4/rs/work/alualualu/alus/unaligned/HUMHBBRT-2
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-2
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-3
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-4
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-5
+/home4/rs/work/alualualu/alus/unaligned/HUMIL2R8
+/home4/rs/work/alualualu/alus/unaligned/HUMINS2
+/home4/rs/work/alualualu/alus/unaligned/HUMLDLIVS
+/home4/rs/work/alualualu/alus/unaligned/HUMLDLR
+/home4/rs/work/alualualu/alus/unaligned/HUMLDLR-2
+/home4/rs/work/alualualu/alus/unaligned/HUMMHDC3B
+/home4/rs/work/alualualu/alus/unaligned/HUMMHDRB3
+/home4/rs/work/alualualu/alus/unaligned/HUMMHDRB3-2
+/home4/rs/work/alualualu/alus/unaligned/HUMMYCRT
+/home4/rs/work/alualualu/alus/unaligned/HUMNGFB
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC-2
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC1
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC2
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC6
+/home4/rs/work/alualualu/alus/unaligned/HUMRSA1
+/home4/rs/work/alualualu/alus/unaligned/HUMRSA16
+/home4/rs/work/alualualu/alus/unaligned/HUMRSA27
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB11
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB13
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB19
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB2
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB6
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB8
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAOLD
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAP3
+/home4/rs/work/alualualu/alus/unaligned/HUMRSKA1
+/home4/rs/work/alualualu/alus/unaligned/HUMRSKPA1
+/home4/rs/work/alualualu/alus/unaligned/HUMSLJT1
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-2
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-3
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-4
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-5
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-6
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-7
+/home4/rs/work/alualualu/alus/unaligned/HUMTBBM40
+/home4/rs/work/alualualu/alus/unaligned/HUMTHBNB
+/home4/rs/work/alualualu/alus/unaligned/HUMTHBNB-2
+/home4/rs/work/alualualu/alus/unaligned/HUMTHBNB-3
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-10
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-11
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-12
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-13
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-14
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-15
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-16
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-17
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-18
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-19
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-2
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-20
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-21
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-22
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-3
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-4
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-5
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-6
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-7
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-8
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-9
+/home4/rs/work/alualualu/alus/unaligned/HUMUG2PD
+/home4/rs/work/alualualu/alus/unaligned/HUMUG2PD-2
+/home4/rs/work/alualualu/alus/unaligned/M11591
+/home4/rs/work/alualualu/alus/unaligned/M12036
+/home4/rs/work/alualualu/alus/unaligned/M12929
+/home4/rs/work/alualualu/alus/unaligned/M13121
diff --git a/tables/alus/filesp~ b/tables/alus/filesp~
new file mode 100644
index 0000000..cb3ac6d
--- /dev/null
+++ b/tables/alus/filesp~
@@ -0,0 +1,132 @@
+/home4/rs/work/alualualu/alus/unaligned/ALUA
+/home4/rs/work/alualualu/alus/unaligned/HUMA1ATP
+/home4/rs/work/alualualu/alus/unaligned/HUMACHRA7
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-10
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-11
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-12
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-13
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-14
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-15
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-16
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-17
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-18
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-2
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-3
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-4
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-5
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-6
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-7
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-8
+/home4/rs/work/alualualu/alus/unaligned/HUMADAG-9
+/home4/rs/work/alualualu/alus/unaligned/HUMAGG
+/home4/rs/work/alualualu/alus/unaligned/HUMALBG
+/home4/rs/work/alualualu/alus/unaligned/HUMALBG-2
+/home4/rs/work/alualualu/alus/unaligned/HUMANFA
+/home4/rs/work/alualualu/alus/unaligned/HUMANFA-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAI1
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAI1-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAII
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOAII-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOCII
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4-2
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4-3
+/home4/rs/work/alualualu/alus/unaligned/HUMAPOE4-4
+/home4/rs/work/alualualu/alus/unaligned/HUMBLYM1
+/home4/rs/work/alualualu/alus/unaligned/HUMC1A21
+/home4/rs/work/alualualu/alus/unaligned/HUMC1A23
+/home4/rs/work/alualualu/alus/unaligned/HUMC1AIN1
+/home4/rs/work/alualualu/alus/unaligned/HUMERPA
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG-2
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG-3
+/home4/rs/work/alualualu/alus/unaligned/HUMFIXG-4
+/home4/rs/work/alualualu/alus/unaligned/HUMFOL5
+/home4/rs/work/alualualu/alus/unaligned/HUMFOL5-2
+/home4/rs/work/alualualu/alus/unaligned/HUMGAST2
+/home4/rs/work/alualualu/alus/unaligned/HUMGHV
+/home4/rs/work/alualualu/alus/unaligned/HUMHBA4
+/home4/rs/work/alualualu/alus/unaligned/HUMHBA4-2
+/home4/rs/work/alualualu/alus/unaligned/HUMHBA4-3
+/home4/rs/work/alualualu/alus/unaligned/HUMHBBRT
+/home4/rs/work/alualualu/alus/unaligned/HUMHBBRT-2
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-2
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-3
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-4
+/home4/rs/work/alualualu/alus/unaligned/HUMIFNB3-5
+/home4/rs/work/alualualu/alus/unaligned/HUMIL2R8
+/home4/rs/work/alualualu/alus/unaligned/HUMINS2
+/home4/rs/work/alualualu/alus/unaligned/HUMLDLIVS
+/home4/rs/work/alualualu/alus/unaligned/HUMLDLR
+/home4/rs/work/alualualu/alus/unaligned/HUMLDLR-2
+/home4/rs/work/alualualu/alus/unaligned/HUMMHDC3B
+/home4/rs/work/alualualu/alus/unaligned/HUMMHDRB3
+/home4/rs/work/alualualu/alus/unaligned/HUMMHDRB3-2
+/home4/rs/work/alualualu/alus/unaligned/HUMMYCRT
+/home4/rs/work/alualualu/alus/unaligned/HUMNGFB
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC-2
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC1
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC2
+/home4/rs/work/alualualu/alus/unaligned/HUMPOMC6
+/home4/rs/work/alualualu/alus/unaligned/HUMRSA1
+/home4/rs/work/alualualu/alus/unaligned/HUMRSA16
+/home4/rs/work/alualualu/alus/unaligned/HUMRSA27
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB11
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB13
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB19
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB2
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB6
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAB8
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAOLD
+/home4/rs/work/alualualu/alus/unaligned/HUMRSAP3
+/home4/rs/work/alualualu/alus/unaligned/HUMRSKA1
+/home4/rs/work/alualualu/alus/unaligned/HUMRSKPA1
+/home4/rs/work/alualualu/alus/unaligned/HUMSLJT1
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-2
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-3
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-4
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-5
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-6
+/home4/rs/work/alualualu/alus/unaligned/HUMTBB5-7
+/home4/rs/work/alualualu/alus/unaligned/HUMTBBM40
+/home4/rs/work/alualualu/alus/unaligned/HUMTHBNB
+/home4/rs/work/alualualu/alus/unaligned/HUMTHBNB-2
+/home4/rs/work/alualualu/alus/unaligned/HUMTHBNB-3
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-10
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-11
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-12
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-13
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-14
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-15
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-16
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-17
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-18
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-19
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-2
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-20
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-21
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-22
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-3
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-4
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-5
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-6
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-7
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-8
+/home4/rs/work/alualualu/alus/unaligned/HUMTPA-9
+/home4/rs/work/alualualu/alus/unaligned/HUMUG2PD
+/home4/rs/work/alualualu/alus/unaligned/HUMUG2PD-2
+/home4/rs/work/alualualu/alus/unaligned/M11591
+/home4/rs/work/alualualu/alus/unaligned/M12036
+/home4/rs/work/alualualu/alus/unaligned/M12929
+/home4/rs/work/alualualu/alus/unaligned/M13121
+/home4/rs/work/alualualu/alus/unaligned/consensi
+/home4/rs/work/alualualu/alus/unaligned/core
+/home4/rs/work/alualualu/alus/unaligned/fasta-split*
+/home4/rs/work/alualualu/alus/unaligned/files
+/home4/rs/work/alualualu/alus/unaligned/filesp
+/home4/rs/work/alualualu/alus/unaligned/files~
diff --git a/tables/alus/files~ b/tables/alus/files~
new file mode 100644
index 0000000..4e45b47
--- /dev/null
+++ b/tables/alus/files~
@@ -0,0 +1,129 @@
+ALUA
+HUMA1ATP
+HUMACHRA7
+HUMADAG
+HUMADAG-10
+HUMADAG-11
+HUMADAG-12
+HUMADAG-13
+HUMADAG-14
+HUMADAG-15
+HUMADAG-16
+HUMADAG-17
+HUMADAG-18
+HUMADAG-2
+HUMADAG-3
+HUMADAG-4
+HUMADAG-5
+HUMADAG-6
+HUMADAG-7
+HUMADAG-8
+HUMADAG-9
+HUMAGG
+HUMALBG
+HUMALBG-2
+HUMANFA
+HUMANFA-2
+HUMAPOAI1
+HUMAPOAI1-2
+HUMAPOAII
+HUMAPOAII-2
+HUMAPOCII
+HUMAPOE4
+HUMAPOE4-2
+HUMAPOE4-3
+HUMAPOE4-4
+HUMBLYM1
+HUMC1A21
+HUMC1A23
+HUMC1AIN1
+HUMERPA
+HUMFIXG
+HUMFIXG-2
+HUMFIXG-3
+HUMFIXG-4
+HUMFOL5
+HUMFOL5-2
+HUMGAST2
+HUMGHV
+HUMHBA4
+HUMHBA4-2
+HUMHBA4-3
+HUMHBBRT
+HUMHBBRT-2
+HUMIFNB3
+HUMIFNB3-2
+HUMIFNB3-3
+HUMIFNB3-4
+HUMIFNB3-5
+HUMIL2R8
+HUMINS2
+HUMLDLIVS
+HUMLDLR
+HUMLDLR-2
+HUMMHDC3B
+HUMMHDRB3
+HUMMHDRB3-2
+HUMMYCRT
+HUMNGFB
+HUMPOMC
+HUMPOMC-2
+HUMPOMC1
+HUMPOMC2
+HUMPOMC6
+HUMRSA1
+HUMRSA16
+HUMRSA27
+HUMRSAB11
+HUMRSAB13
+HUMRSAB19
+HUMRSAB2
+HUMRSAB6
+HUMRSAB8
+HUMRSAOLD
+HUMRSAP3
+HUMRSKA1
+HUMRSKPA1
+HUMSLJT1
+HUMTBB5
+HUMTBB5-2
+HUMTBB5-3
+HUMTBB5-4
+HUMTBB5-5
+HUMTBB5-6
+HUMTBB5-7
+HUMTBBM40
+HUMTHBNB
+HUMTHBNB-2
+HUMTHBNB-3
+HUMTPA
+HUMTPA-10
+HUMTPA-11
+HUMTPA-12
+HUMTPA-13
+HUMTPA-14
+HUMTPA-15
+HUMTPA-16
+HUMTPA-17
+HUMTPA-18
+HUMTPA-19
+HUMTPA-2
+HUMTPA-20
+HUMTPA-21
+HUMTPA-22
+HUMTPA-3
+HUMTPA-4
+HUMTPA-5
+HUMTPA-6
+HUMTPA-7
+HUMTPA-8
+HUMTPA-9
+HUMUG2PD
+HUMUG2PD-2
+M11591
+M12036
+M12929
+M13121
+consensi
+fasta-split*
+files
diff --git a/tables/embl.files b/tables/embl.files
new file mode 100644
index 0000000..b7fe2f1
--- /dev/null
+++ b/tables/embl.files
@@ -0,0 +1,9 @@
+A EMBLDIVPATH/embl_div.lkp
+B EMBLINDPATH/entrynam.idx
+C EMBLINDPATH/acnum.trg
+D EMBLINDPATH/acnum.hit
+E EMBLINDPATH/brief.idx
+F EMBLINDPATH/freetext.trg
+G EMBLINDPATH/freetext.hit
+H EMBLINDPATH/author.trg
+I EMBLINDPATH/author.hit
diff --git a/tables/embl_div.lkp b/tables/embl_div.lkp
new file mode 100644
index 0000000..fd30c6f
--- /dev/null
+++ b/tables/embl_div.lkp
@@ -0,0 +1,15 @@
+     1 EMBLPATH/bb.dat
+     2 EMBLPATH/fun.dat
+     3 EMBLPATH/inv.dat
+     4 EMBLPATH/mam.dat
+     5 EMBLPATH/org.dat
+     6 EMBLPATH/patent.dat
+     7 EMBLPATH/phg.dat
+     8 EMBLPATH/pln.dat
+     9 EMBLPATH/pri.dat
+    10 EMBLPATH/pro.dat
+    11 EMBLPATH/rod.dat
+    12 EMBLPATH/syn.dat
+    13 EMBLPATH/una.dat
+    14 EMBLPATH/vrl.dat
+    15 EMBLPATH/vrt.dat
diff --git a/tables/emblu.files b/tables/emblu.files
new file mode 100644
index 0000000..dba93cb
--- /dev/null
+++ b/tables/emblu.files
@@ -0,0 +1,9 @@
+A EMBLUDIVPATH/emblu_div.lkp
+B EMBLUINDPATH/entrynam.idx
+C EMBLUINDPATH/acnum.trg
+D EMBLUINDPATH/acnum.hit
+E EMBLUINDPATH/brief.idx
+F EMBLUINDPATH/freetext.trg
+G EMBLUINDPATH/freetext.hit
+H EMBLUINDPATH/author.trg
+I EMBLUINDPATH/author.hit
diff --git a/tables/emblu_div.lkp b/tables/emblu_div.lkp
new file mode 100644
index 0000000..1f6cb9c
--- /dev/null
+++ b/tables/emblu_div.lkp
@@ -0,0 +1 @@
+     1 EMBLUPATH/emseq.all
diff --git a/tables/genbank.files b/tables/genbank.files
new file mode 100644
index 0000000..447325d
--- /dev/null
+++ b/tables/genbank.files
@@ -0,0 +1,9 @@
+A GENBDIVPATH/genbank_div.lkp
+B GENBINDPATH/entrynam.idx
+C GENBINDPATH/acnum.trg
+D GENBINDPATH/acnum.hit
+E GENBINDPATH/brief.idx
+F GENBINDPATH/freetext.trg
+G GENBINDPATH/freetext.hit
+H GENBINDPATH/author.trg
+I GENBINDPATH/author.hit
diff --git a/tables/genbank_div.lkp b/tables/genbank_div.lkp
new file mode 100644
index 0000000..ebfc1a1
--- /dev/null
+++ b/tables/genbank_div.lkp
@@ -0,0 +1,14 @@
+     1 GENBPATH/gbbct.seq
+     2 GENBPATH/gbinv.seq
+     3 GENBPATH/gbmam.seq
+     4 GENBPATH/gbnew.seq
+     5 GENBPATH/gborg.seq
+     6 GENBPATH/gbphg.seq
+     7 GENBPATH/gbpln.seq
+     8 GENBPATH/gbpri.seq
+     9 GENBPATH/gbrna.seq
+    10 GENBPATH/gbrod.seq
+    11 GENBPATH/gbsyn.seq
+    12 GENBPATH/gbuna.seq
+    13 GENBPATH/gbvrl.seq
+    14 GENBPATH/gbvrt.seq
diff --git a/tables/libraries.config.csh b/tables/libraries.config.csh
new file mode 100644
index 0000000..27cae0c
--- /dev/null
+++ b/tables/libraries.config.csh
@@ -0,0 +1,154 @@
+#!/bin/csh -f
+
+# This file must be sourced after, or at the end of, staden.login
+
+# It sets up the sequence libraries for use by the software
+
+# Edit this file to set environment variables for the paths to the homes of 
+# your sequence libraries and their associated files
+#
+# First choose a home for the library data - ie the division files, and the 
+# index files. Then set up the paths in this file.
+#
+#####################################################################################
+#
+#         Introduction
+#         ------------
+
+
+# The sequence libraries are made available to the programs by several levels
+# of files. At the top of the tree is a file that says which libraries are
+# available locally and defines environment variables to point to the next
+# level of files. It also tells the programs (with its library type flag) the
+# format of the library (embl, embl-updates and swissprot are of library type A; 
+# PIR, NRL3D are of type B and Genbank type C).
+# The file is called sequence.libs and is distributed in $STADTABL and is given
+# the environment variable SEQUENCELIBRARIES which is defined in this file.
+
+# setenv SEQUENCELIBRARIES $STADTABL/sequence.libs
+
+# Contents of file sequence.libs
+
+#A EMBLFILES EMBL 30 nucleotide library Dec 91! in cdrom format
+#A EMBLUFILES EMBL 30 updates library Dec 91! in cdrom format
+#A SWISSFILES SWISSPROT 20 protein library Nov 91! in cdrom format
+#B PIRFILES PIR 31 protein library Dec 91!
+#B NRL3DFILES NRL3D 58 From Brookhaven protein library Dec 91!
+#C GENBFILES GenBank example!
+#! format is: library type, space, name of library description file, prompt
+#! anything after ! is ignored
+
+# end of file
+
+# For example the first line defines a library of type A, and environment variable
+# (see below) EMBLFILES, and the text "EMBL 30 nucleotide library Dec 91" would
+# appear on the users screen.
+
+
+# Four environment variables are required to define each library so that it can
+# be used by the software:
+
+# 1. The first to a file that contains a list of the files (such as indexes) 
+#    used by the programs.
+# 2. A path to the directory that contains the division_lookup file.
+# 3. A path to the directory that contains the index files.
+# 4. A path to the directory that contains the division (or data) files.
+
+# For example, for EMBL
+
+# setenv EMBLFILES $STADTABL/embl.files
+# setenv EMBLDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+# setenv EMBLINDPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+# setenv EMBLPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+
+# (Of course the division_lookup, indexes and data files could be in the same place)
+
+#   Making a new library available to the programs
+#   ----------------------------------------------
+# 
+# If making (say) embl available to the programs for the first time you would need
+# to add a line to the file $STADTABL/sequence.libs (embl, embl-updates and
+# swissprot are of library type A; PIR, NRL3D are of type B and Genbank type C),
+# and you would need to define EMBLPATH and EMBLINDPATH. No other changes should
+# be required.
+
+#   Moving the location of indexes or data files
+#   --------------------------------------------
+
+# If you move the data files for embl you need to redefine EMBLPATH. 
+# If you move the index files for embl you would need to redefine EMBLINDPATH.
+
+
+# The other files (on the distribution tape stored in $STADTABL) will only
+# require changes if the libraries are reorganised - eg when embl added two
+# extra divisions in spring 1993 the division lookup file needed changing.
+#
+
+#     End of introdiction
+#     -------------------
+#####################################################################################
+#  
+#
+# First define the file containing the names and environment variables
+# for the locally available libraries.
+#
+
+setenv SEQUENCELIBRARIES $STADTABL/sequence.libs
+
+
+# Now do the individual libraries
+
+#
+#   EMBL library
+#
+
+setenv EMBLFILES $STADTABL/embl.files
+setenv EMBLDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv EMBLPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+setenv EMBLINDPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+
+#
+#   EMBL updates library
+#
+
+setenv EMBLUFILES $STADTABL/emblu.files
+setenv EMBLUDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv EMBLUPATH /nfs/al/pubseq/pubseq/seqlibs/embl-updates
+setenv EMBLUINDPATH /nfs/al/pubseq/pubseq/seqlibs/embl-updates
+
+#
+#   SWISSPROT library
+#
+
+setenv SWISSFILES $STADTABL/swiss.files
+setenv SWISSDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv SWISSPATH /nfs/al/pubseq/pubseq/seqlibs/swiss
+setenv SWISSINDPATH /nfs/al/pubseq/pubseq/seqlibs/swiss
+
+#
+#   PIR library
+#
+
+setenv PIRFILES $STADTABL/pir.files
+setenv PIRDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv PIRPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+setenv PIRINDPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+
+#
+#   GenBank library
+#
+
+setenv GENBFILES $STADTABL/genbank.files
+setenv GENBDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv GENBPATH /nfs/al/pubseq/pubseq/seqlibs/genbank
+setenv GENBINDPATH /nfs/al/pubseq/pubseq/seqlibs/genbank
+
+#
+#   NRL3D library
+#
+
+setenv NRL3DFILES $STADTABL/nrl3d.files
+setenv NRL3DDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv NRL3DPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+setenv NRL3DINDPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+
diff --git a/tables/libraries.config.sh b/tables/libraries.config.sh
new file mode 100644
index 0000000..6abb278
--- /dev/null
+++ b/tables/libraries.config.sh
@@ -0,0 +1,154 @@
+#!/bin/csh -f
+
+# This file must be sourced after, or at the end of, staden.login
+
+# It sets up the sequence libraries for use by the software
+
+# Edit this file to set environment variables for the paths to the homes of 
+# your sequence libraries and their associated files
+#
+# First choose a home for the library data - ie the division files, and the 
+# index files. Then set up the paths in this file.
+#
+#####################################################################################
+#
+#         Introduction
+#         ------------
+
+
+# The sequence libraries are made available to the programs by several levels
+# of files. At the top of the tree is a file that says which libraries are
+# available locally and defines environment variables to point to the next
+# level of files. It also tells the programs (with its library type flag) the
+# format of the library (embl, embl-updates and swissprot are of library type A; 
+# PIR, NRL3D are of type B and Genbank type C).
+# The file is called sequence.libs and is distributed in $STADTABL and is given
+# the environment variable SEQUENCELIBRARIES which is defined in this file.
+
+# SEQUENCELIBRARIES=$STADTABL/sequence.libs; export SEQUENCELIBRARIES
+
+# Contents of file sequence.libs
+
+#A EMBLFILES EMBL 30 nucleotide library Dec 91! in cdrom format
+#A EMBLUFILES EMBL 30 updates library Dec 91! in cdrom format
+#A SWISSFILES SWISSPROT 20 protein library Nov 91! in cdrom format
+#B PIRFILES PIR 31 protein library Dec 91!
+#B NRL3DFILES NRL3D 58 From Brookhaven protein library Dec 91!
+#C GENBFILES GenBank example!
+#! format is: library type, space, name of library description file, prompt
+#! anything after ! is ignored
+
+# end of file
+
+# For example the first line defines a library of type A, and environment variable
+# (see below) EMBLFILES, and the text "EMBL 30 nucleotide library Dec 91" would
+# appear on the users screen.
+
+
+# Four environment variables are required to define each library so that it can
+# be used by the software:
+
+# 1. The first to a file that contains a list of the files (such as indexes) 
+#    used by the programs.
+# 2. A path to the directory that contains the division_lookup file.
+# 3. A path to the directory that contains the index files.
+# 4. A path to the directory that contains the division (or data) files.
+
+# For example, for EMBL
+
+# EMBLFILES=$STADTABL/embl.files; export EMBLFILES
+# EMBLDIVPATH=/nfs/al/pubseq/pubseq/inhouse/al-generic/tables; export EMBLDIVPATH
+# EMBLINDPATH=/nfs/al/pubseq/pubseq/seqlibs/embl; export EMBLINDPATH
+# EMBLPATH=/nfs/al/pubseq/pubseq/seqlibs/embl; export EMBLPATH
+
+# (Of course the division_lookup, indexes and data files could be in the same place)
+
+#   Making a new library available to the programs
+#   ----------------------------------------------
+# 
+# If making (say) embl available to the programs for the first time you would need
+# to add a line to the file $STADTABL/sequence.libs (embl, embl-updates and
+# swissprot are of library type A; PIR, NRL3D are of type B and Genbank type C),
+# and you would need to define EMBLPATH and EMBLINDPATH. No other changes should
+# be required.
+
+#   Moving the location of indexes or data files
+#   --------------------------------------------
+
+# If you move the data files for embl you need to redefine EMBLPATH. 
+# If you move the index files for embl you would need to redefine EMBLINDPATH.
+
+
+# The other files (on the distribution tape stored in $STADTABL) will only
+# require changes if the libraries are reorganised - eg when embl added two
+# extra divisions in spring 1993 the division lookup file needed changing.
+#
+
+#     End of introdiction
+#     -------------------
+#####################################################################################
+#  
+#
+# First define the file containing the names and environment variables
+# for the locally available libraries.
+#
+
+SEQUENCELIBRARIES=$STADTABL/sequence.libs; export SEQUENCELIBRARIES
+
+
+# Now do the individual libraries
+
+#
+#   EMBL library
+#
+
+EMBLFILES=$STADTABL/embl.files; export EMBLFILES
+EMBLDIVPATH=/nfs/al/pubseq/pubseq/inhouse/al-generic/tables; export EMBLDIVPATH
+EMBLPATH=/nfs/al/pubseq/pubseq/seqlibs/embl; export EMBLPATH
+EMBLINDPATH=/nfs/al/pubseq/pubseq/seqlibs/embl; export EMBLINDPATH
+
+#
+#   EMBL updates library
+#
+
+EMBLUFILES=$STADTABL/emblu.files; export EMBLUFILES
+EMBLUDIVPATH=/nfs/al/pubseq/pubseq/inhouse/al-generic/tables; export EMBLUDIVPATH
+EMBLUPATH=/nfs/al/pubseq/pubseq/seqlibs/embl-updates; export EMBLUPATH
+EMBLUINDPATH=/nfs/al/pubseq/pubseq/seqlibs/embl-updates; export EMBLUINDPATH
+
+#
+#   SWISSPROT library
+#
+
+SWISSFILES=$STADTABL/swiss.files; export SWISSFILES
+SWISSDIVPATH=/nfs/al/pubseq/pubseq/inhouse/al-generic/tables; export SWISSDIVPATH
+SWISSPATH=/nfs/al/pubseq/pubseq/seqlibs/swiss; export SWISSPATH
+SWISSINDPATH=/nfs/al/pubseq/pubseq/seqlibs/swiss; export SWISSINDPATH
+
+#
+#   PIR library
+#
+
+PIRFILES=$STADTABL/pir.files; export PIRFILES
+PIRDIVPATH=/nfs/al/pubseq/pubseq/inhouse/al-generic/tables; export PIRDIVPATH
+PIRPATH=/nfs/al/pubseq/pubseq/seqlibs/pir; export PIRPATH
+PIRINDPATH=/nfs/al/pubseq/pubseq/seqlibs/pir; export PIRINDPATH
+
+#
+#   GenBank library
+#
+
+GENBFILES=$STADTABL/genbank.files; export GENBFILES
+GENBDIVPATH=/nfs/al/pubseq/pubseq/inhouse/al-generic/tables; export GENBDIVPATH
+GENBPATH=/nfs/al/pubseq/pubseq/seqlibs/genbank; export GENBPATH
+GENBINDPATH=/nfs/al/pubseq/pubseq/seqlibs/genbank; export GENBINDPATH
+
+#
+#   NRL3D library
+#
+
+NRL3DFILES=$STADTABL/nrl3d.files; export NRL3DFILES
+NRL3DDIVPATH=/nfs/al/pubseq/pubseq/inhouse/al-generic/tables; export NRL3DDIVPATH
+NRL3DPATH=/nfs/al/pubseq/pubseq/seqlibs/pir; export NRL3DPATH
+NRL3DINDPATH=/nfs/al/pubseq/pubseq/seqlibs/pir; export NRL3DINDPATH
+
diff --git a/tables/libraries.config.sh~ b/tables/libraries.config.sh~
new file mode 100644
index 0000000..27cae0c
--- /dev/null
+++ b/tables/libraries.config.sh~
@@ -0,0 +1,154 @@
+#!/bin/csh -f
+
+# This file must be sourced after, or at the end of, staden.login
+
+# It sets up the sequence libraries for use by the software
+
+# Edit this file to set environment variables for the paths to the homes of 
+# your sequence libraries and their associated files
+#
+# First choose a home for the library data - ie the division files, and the 
+# index files. Then set up the paths in this file.
+#
+#####################################################################################
+#
+#         Introduction
+#         ------------
+
+
+# The sequence libraries are made available to the programs by several levels
+# of files. At the top of the tree is a file that says which libraries are
+# available locally and defines environment variables to point to the next
+# level of files. It also tells the programs (with its library type flag) the
+# format of the library (embl, embl-updates and swissprot are of library type A; 
+# PIR, NRL3D are of type B and Genbank type C).
+# The file is called sequence.libs and is distributed in $STADTABL and is given
+# the environment variable SEQUENCELIBRARIES which is defined in this file.
+
+# setenv SEQUENCELIBRARIES $STADTABL/sequence.libs
+
+# Contents of file sequence.libs
+
+#A EMBLFILES EMBL 30 nucleotide library Dec 91! in cdrom format
+#A EMBLUFILES EMBL 30 updates library Dec 91! in cdrom format
+#A SWISSFILES SWISSPROT 20 protein library Nov 91! in cdrom format
+#B PIRFILES PIR 31 protein library Dec 91!
+#B NRL3DFILES NRL3D 58 From Brookhaven protein library Dec 91!
+#C GENBFILES GenBank example!
+#! format is: library type, space, name of library description file, prompt
+#! anything after ! is ignored
+
+# end of file
+
+# For example the first line defines a library of type A, and environment variable
+# (see below) EMBLFILES, and the text "EMBL 30 nucleotide library Dec 91" would
+# appear on the users screen.
+
+
+# Four environment variables are required to define each library so that it can
+# be used by the software:
+
+# 1. The first to a file that contains a list of the files (such as indexes) 
+#    used by the programs.
+# 2. A path to the directory that contains the division_lookup file.
+# 3. A path to the directory that contains the index files.
+# 4. A path to the directory that contains the division (or data) files.
+
+# For example, for EMBL
+
+# setenv EMBLFILES $STADTABL/embl.files
+# setenv EMBLDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+# setenv EMBLINDPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+# setenv EMBLPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+
+# (Of course the division_lookup, indexes and data files could be in the same place)
+
+#   Making a new library available to the programs
+#   ----------------------------------------------
+# 
+# If making (say) embl available to the programs for the first time you would need
+# to add a line to the file $STADTABL/sequence.libs (embl, embl-updates and
+# swissprot are of library type A; PIR, NRL3D are of type B and Genbank type C),
+# and you would need to define EMBLPATH and EMBLINDPATH. No other changes should
+# be required.
+
+#   Moving the location of indexes or data files
+#   --------------------------------------------
+
+# If you move the data files for embl you need to redefine EMBLPATH. 
+# If you move the index files for embl you would need to redefine EMBLINDPATH.
+
+
+# The other files (on the distribution tape stored in $STADTABL) will only
+# require changes if the libraries are reorganised - eg when embl added two
+# extra divisions in spring 1993 the division lookup file needed changing.
+#
+
+#     End of introdiction
+#     -------------------
+#####################################################################################
+#  
+#
+# First define the file containing the names and environment variables
+# for the locally available libraries.
+#
+
+setenv SEQUENCELIBRARIES $STADTABL/sequence.libs
+
+
+# Now do the individual libraries
+
+#
+#   EMBL library
+#
+
+setenv EMBLFILES $STADTABL/embl.files
+setenv EMBLDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv EMBLPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+setenv EMBLINDPATH /nfs/al/pubseq/pubseq/seqlibs/embl
+
+#
+#   EMBL updates library
+#
+
+setenv EMBLUFILES $STADTABL/emblu.files
+setenv EMBLUDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv EMBLUPATH /nfs/al/pubseq/pubseq/seqlibs/embl-updates
+setenv EMBLUINDPATH /nfs/al/pubseq/pubseq/seqlibs/embl-updates
+
+#
+#   SWISSPROT library
+#
+
+setenv SWISSFILES $STADTABL/swiss.files
+setenv SWISSDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv SWISSPATH /nfs/al/pubseq/pubseq/seqlibs/swiss
+setenv SWISSINDPATH /nfs/al/pubseq/pubseq/seqlibs/swiss
+
+#
+#   PIR library
+#
+
+setenv PIRFILES $STADTABL/pir.files
+setenv PIRDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv PIRPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+setenv PIRINDPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+
+#
+#   GenBank library
+#
+
+setenv GENBFILES $STADTABL/genbank.files
+setenv GENBDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv GENBPATH /nfs/al/pubseq/pubseq/seqlibs/genbank
+setenv GENBINDPATH /nfs/al/pubseq/pubseq/seqlibs/genbank
+
+#
+#   NRL3D library
+#
+
+setenv NRL3DFILES $STADTABL/nrl3d.files
+setenv NRL3DDIVPATH /nfs/al/pubseq/pubseq/inhouse/al-generic/tables
+setenv NRL3DPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+setenv NRL3DINDPATH /nfs/al/pubseq/pubseq/seqlibs/pir
+
diff --git a/tables/nrl3d.files b/tables/nrl3d.files
new file mode 100644
index 0000000..44c39c1
--- /dev/null
+++ b/tables/nrl3d.files
@@ -0,0 +1,3 @@
+A NRL3DDIVPATH/nrl3d_div.lkp
+B NRL3DINDPATH/nrl3d.idx
+E NRL3DINDPATH/nrl3dbrief.idx
diff --git a/tables/nrl3d_div.lkp b/tables/nrl3d_div.lkp
new file mode 100644
index 0000000..572a7cd
--- /dev/null
+++ b/tables/nrl3d_div.lkp
@@ -0,0 +1 @@
+     1 NRL3DPATH/nrl3d.dat
diff --git a/tables/pir.files b/tables/pir.files
new file mode 100644
index 0000000..ece2a65
--- /dev/null
+++ b/tables/pir.files
@@ -0,0 +1,9 @@
+A PIRDIVPATH/pir_div.lkp
+B PIRINDPATH/entrynam.idx
+C PIRINDPATH/acnum.trg
+D PIRINDPATH/acnum.hit
+E PIRINDPATH/brief.idx
+F PIRINDPATH/freetext.trg
+G PIRINDPATH/freetext.hit
+H PIRINDPATH/author.trg
+I PIRINDPATH/author.hit
diff --git a/tables/pir_div.lkp b/tables/pir_div.lkp
new file mode 100644
index 0000000..11e2b6c
--- /dev/null
+++ b/tables/pir_div.lkp
@@ -0,0 +1,3 @@
+     1 PIRPATH/pir1.dat
+     2 PIRPATH/pir2.dat
+     3 PIRPATH/pir3.dat
diff --git a/tables/sequence.libs b/tables/sequence.libs
new file mode 100644
index 0000000..813e8e9
--- /dev/null
+++ b/tables/sequence.libs
@@ -0,0 +1,8 @@
+A EMBLFILES EMBL 34 nucleotide library March 93! in cdrom format
+A EMBLUFILES EMBL 34 updates library March 93! in cdrom format
+A SWISSFILES SWISSPROT 24 protein library March 93! in cdrom format
+B PIRFILES PIR 36 protein library March 93!
+B NRL3DFILES NRL3D 59 From Brookhaven protein library March 92!
+C GENBFILES GenBank 72!
+! format is library type, space, name of library description file, prompt
+! any text following a ! symbol is treated as comment
diff --git a/tables/swiss.files b/tables/swiss.files
new file mode 100644
index 0000000..3d5194d
--- /dev/null
+++ b/tables/swiss.files
@@ -0,0 +1,9 @@
+A SWISSDIVPATH/swiss_div.lkp
+B SWISSINDPATH/entrynam.idx
+C SWISSINDPATH/acnum.trg
+D SWISSINDPATH/acnum.hit
+E SWISSINDPATH/brief.idx
+F SWISSINDPATH/freetext.trg
+G SWISSINDPATH/freetext.hit
+H SWISSINDPATH/author.trg
+I SWISSINDPATH/author.hit
diff --git a/tables/swiss_div.lkp b/tables/swiss_div.lkp
new file mode 100644
index 0000000..b6ff954
--- /dev/null
+++ b/tables/swiss_div.lkp
@@ -0,0 +1 @@
+     1 SWISSPATH/seq.dat
diff --git a/userdata/DEMO.AR0 b/userdata/DEMO.AR0
new file mode 100644
index 0000000..9d20716
Binary files /dev/null and b/userdata/DEMO.AR0 differ
diff --git a/userdata/DEMO.NAM b/userdata/DEMO.NAM
new file mode 100644
index 0000000..39410c1
--- /dev/null
+++ b/userdata/DEMO.NAM
@@ -0,0 +1,8 @@
+HINW.004
+HINW.009
+HINW.010
+HINW.016
+HINW.017
+HINW.025
+HINW.028
+HINW.031
diff --git a/userdata/DEMO.RL0 b/userdata/DEMO.RL0
new file mode 100644
index 0000000..f9080a0
Binary files /dev/null and b/userdata/DEMO.RL0 differ
diff --git a/userdata/DEMO.RL1F b/userdata/DEMO.RL1F
new file mode 100644
index 0000000..5e0fe7f
--- /dev/null
+++ b/userdata/DEMO.RL1F
@@ -0,0 +1,11 @@
+    50   512     5     8     2
+     1   284     0     8
+   407   274     7     6
+   251   296     4     7
+   230  -263     8     3
+     1   244     0     0
+   457   265     2     0
+   275  -270     3     2
+    59   241     1     4
+   244     0     5     5
+   721     0     1     6
diff --git a/userdata/DEMO.RL2F b/userdata/DEMO.RL2F
new file mode 100644
index 0000000..5e0fe7f
--- /dev/null
+++ b/userdata/DEMO.RL2F
@@ -0,0 +1,11 @@
+    50   512     5     8     2
+     1   284     0     8
+   407   274     7     6
+   251   296     4     7
+   230  -263     8     3
+     1   244     0     0
+   457   265     2     0
+   275  -270     3     2
+    59   241     1     4
+   244     0     5     5
+   721     0     1     6
diff --git a/userdata/DEMO.SQ0 b/userdata/DEMO.SQ0
new file mode 100644
index 0000000..c5eac64
Binary files /dev/null and b/userdata/DEMO.SQ0 differ
diff --git a/userdata/DEMO.SQ1F b/userdata/DEMO.SQ1F
new file mode 100644
index 0000000..3659c25
Binary files /dev/null and b/userdata/DEMO.SQ1F differ
diff --git a/userdata/DEMO.SQ2F b/userdata/DEMO.SQ2F
new file mode 100644
index 0000000..3414101
--- /dev/null
+++ b/userdata/DEMO.SQ2F
@@ -0,0 +1,48 @@
+//HINW.004  
+TTTTCCAGCGTGCGTCTGACGCTGTCTTGCTTAATGATCTCCATCGTGTGCCTAGGTCTG
+TTGCGTTGGGCCGAGCCCAACTTTCCCAAAAACGTATGGATCTTACTGACGTACACGTTG
+CTTACCAGCGTGGCTGTCACGGCGTCAGGCTTCCACTTTAGTCATCGTTCAGTCATTTAT
+GCCATGGTGGCCACAGTGACGCTATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCC
+CGCGACGTGGAATTACAGCGTTcCDTATTGACGGGCGCATCCAC
+//HINW.010  
+TGCACGACATCGAGTATGAGAGTTATATCCCGGGCGCGCTCTGCTTGTACATGGACCTCA
+TGTACCTCTTTGTCTCCGTGCTCTACTTCATGCCCTCCGAGCCCGGCAGCGCCCACACTG
+CTC*AGACGACGGTCGCTGCCACAGCAGCGGCGTCGCCCCAGTTCGTCTCCTAATCGACG
+TGTCCGCGGCGTGACCACGTCACCGTGCCCCACTCGCTCGCTTGTGTATAAGAGAAGGGT
+AGGTGCGCCGCAGCGGCTTTGTGCCGAGACCGTC
+//HINW.009  
+AATTACAGCGTTCCCVATTGACGGGCGCATCCACGCTGATTCTCttVCTGTTCGCCGTAT
+TTTCTCTTTTTCCCGAGGCGGTAAGTGAGATCCTGGTGATGATAGCCGGTTTGGCCGTCA
+TCGTGACCTCGGTGGTGTGCGATACGCAGGACATCCTGCACGACATCGAGTATGAGAGTT
+ATATCCCGGGCGCGCTCTGCTTGTACATGGACCTCATGTACCTCTTTGTCTCCGTGCTCT
+ACTTCATGCCCTCCGAGCCCG*CA*CGCCCACACTGCTC*AGACGACGGTCGCTGC
+//HINW.017  
+ATCTGTTTGCCCGCGACGTGGAATTACAGCGTTCCCTATTGACGGGCGCATCCACGCTGA
+TTCTCTTACTGTTCGCCGTATTTTCTCTTTTTCCCGAGGCGGTAAGTGAGATCCTGGTGA
+TGATAGCCGGTTTGGCCGTCATCGTGACCTCGGTGGTGTGCGATACGCAGGACATCCTGC
+ACGACATCGAGTATGAGAGTTATATCCCGGGCGCGCTCTGCTTGTACATGGACCTCATGT
+ACCTCTTTGTCTCCGTGCTCTAC
+//HINW.016  
+TTCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGC
+AACTCGTAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGCTTTCGCT
+GGAGCGCGACGATGATCGGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCCCTCGCTC
+AAGCCTTCGTCACTGGTCCCGCCACCAAACGTTTCGGCGAGAAGCAGGCCATTATCGCCG
+GCAT
+//HINW.025  
+ATGGACCTcATGTACCTCTTTGTCTCCGTGCTCTACTTCATGCCCTCCGAGCCCGGC3GC
+GCCCACACTGCTC*AGACGACGGTCGCTGCCACAGCAGCGGCGTCGCCCCAGTTCGTCTC
+CTAATCGACGTGTCCGCGGCGTGACCA*GTCACCGTGCCCCACTCGCTCGCTTGTGTATA
+AGAGAAGGGTAGGTGCGCCGCAGCGGCTTTGTGCCGAGACCGTCGCCACCATGCAGGCGA
+GGAGGCTAACGCGCTGCTGCTCTCC
+//HINW.028  
+GCGCATCCACGCTGATTCTCTTACTGTTCGCCGTATTTTCTCTTTTTCCCGAGGCGGTAA
+GTGAGATCCTGGTGATGATAGCCGGTTTGGCCGTCATCGTGACCTCGGTGGTGTGCGATA
+CGCAGGACATCCTGCACGACATCGAGTATGAGAGTTATATCCCGGGCGCGCTCTGCTTGT
+ACATGGACCTCATGTACCTCTTTGTCTCCGTGCTCTACTTCATGCCCTCCGAGCCCGCcG
+*CGCCCACACTGCTCGAGACGACGGTCGCT
+//HINW.031  
+TGTTGCGTTGGGCCGAGCCCAAcTTTCCCAAAAACGTATGGATCTTACTGACGTACA*GT
+TGCTTACCAGCGTGGCTGTCACGGCGTCAGGCTTCCACTTTAGTCATCGTTCAGTCATTT
+ATGCCATGGTGGCCACAGTGacG*TATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTG
+CCCGCGCAGTGGAATTACAGCGTTCCCTATTGacGGGCGcAtCCAcGCTGATTCTCTTCc
+T
diff --git a/userdata/HINW.004 b/userdata/HINW.004
new file mode 100644
index 0000000..f2d3ab2
--- /dev/null
+++ b/userdata/HINW.004
@@ -0,0 +1,5 @@
+ TTTTCCAGCGTGCGTCTGACGCTGTCTTGCTTAATGATCTCCATCGTGTGCCTAGGTCTG
+ TTGCGTTGGGCCGAGCCCAACTTTCCCAAAAACGTATGGATCTTACTGACGTACACGTTG
+ CTTACCAGCGTGGCTGTCACGGCGTCAGGCTTCCACTTTAGTCATCGTTCAGTCATTTAT
+ GCCATGGTGGCCACAGTGACGCTATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCC
+ CGCGACGTGGAATTACAGCGTTCDTATTGACGGGCGCATCCAC
diff --git a/userdata/HINW.009 b/userdata/HINW.009
new file mode 100644
index 0000000..4eaf915
--- /dev/null
+++ b/userdata/HINW.009
@@ -0,0 +1,5 @@
+ AATTACAGCGTTCCCVATTGACGGGCGCATCCACGCTGATTCTCAVCTGTTCGCCGTATT
+ TTCTCTTTTTCCCGAGGCGGTAAGTGAGATCCTGGTGATGATAGCCGGTTTGGCCGTCAT
+ CGTGACCTCGGTGGTGTGCGATACGCAGGACATCCTGCACGACATCGAGTATGAGAGTTA
+ TATCCCGGGCGCGCTCTGCTTGTACATGGACCTCATGTACCTCTTTGTCTCCGTGCTCTA
+ CTTCATGCCCTCCGAGCCCGCACGCCCACACTGCTCAGACGACGGTCGCTGC
diff --git a/userdata/HINW.010 b/userdata/HINW.010
new file mode 100644
index 0000000..392488f
--- /dev/null
+++ b/userdata/HINW.010
@@ -0,0 +1,5 @@
+ TGCACGACATCGAGTATGAGAGTTATATCCCGGGCGCGCTCTGCTTGTACATGGACCTCA
+ TGTACCTCTTTGTCTCCGTGCTCTACTTCATGCCCTCCGAGCCCGGCAGCGCCCACACTG
+ CTCAGACGACGGTCGCTGCCACAGCAGCGGCGTCGCCCCAGTTCGTCTCCTAATCGACGT
+ GTCCGCGGCGTGACCACGTCACCGTGCCCCACTCGCTCGCTTGTGTATAAGAGAAGGGTA
+ GGTGCGCCGCAGCGGCTTTGTGCCGAGACCGTC
diff --git a/userdata/HINW.016 b/userdata/HINW.016
new file mode 100644
index 0000000..17edbc6
--- /dev/null
+++ b/userdata/HINW.016
@@ -0,0 +1,5 @@
+ TTCGGTGGGCGCGGGGCATGACTATCGTCGCCGCACTTATGACTGTCTTCTTTATCATGC
+ AACTCGTAGGACAGGTGCCGGCAGCGCTCTGGGTCATTTTCGGCGAGGACCGCTTTCGCT
+ GGAGCGCGACGATGATCGGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCCCTCGCTC
+ AAGCCTTCGTCACTGGTCCCGCCACCAAACGTTTCGGCGAGAAGCAGGCCATTATCGCCG
+ GCAT
diff --git a/userdata/HINW.017 b/userdata/HINW.017
new file mode 100644
index 0000000..aafc50e
--- /dev/null
+++ b/userdata/HINW.017
@@ -0,0 +1,5 @@
+ GTAGAGCACGGAGACAAAGAGGTACATGAGGTCCATGTACAAGCAGAGCGCGCCCGGGAT
+ ATAACTCTCATACTCGATGTCGTGCAGGATGTCCTGCGTATCGCACACCACCGAGGTCAC
+ GATGACGGCCAAACCGGCTATCATCACCAGGATCTCACTTACCGCCTCGGGAAAAAGAGA
+ AAATACGGCGAACAGTAAGAGAATCAGCGTGGATGCGCCCGTCAATAGGGAACGCTGTAA
+ TTCCACGTCGCGGGCAAACAGAT
diff --git a/userdata/HINW.025 b/userdata/HINW.025
new file mode 100644
index 0000000..f760eb7
--- /dev/null
+++ b/userdata/HINW.025
@@ -0,0 +1,5 @@
+ ATGGACCTATGTACCTCTTTGTCTCCGTGCTCTACTTCATGCCCTCCGAGCCCGGC3GCG
+ CCCACACTGCTCAGACGACGGTCGCTGCCACAGCAGCGGCGTCGCCCCAGTTCGTCTCCT
+ AATCGACGTGTCCGCGGCGTGACCAGTCACCGTGCCCCACTCGCTCGCTTGTGTATAAGA
+ GAAGGGTAGGTGCGCCGCAGCGGCTTTGTGCCGAGACCGTCGCCACCATGCAGGCGAGGA
+ GGCTAACGCGCTGCTGCTCTCC
diff --git a/userdata/HINW.028 b/userdata/HINW.028
new file mode 100644
index 0000000..bca867c
--- /dev/null
+++ b/userdata/HINW.028
@@ -0,0 +1,5 @@
+ AGCGACCGTCGTCTCGAGCAGTGTGGGCGCTGCGGGCTCGGAGGGCATGAAGTAGAGCAC
+ GGAGACAAAGAGGTACATGAGGTCCATGTACAAGCAGAGCGCGCCCGGGATATAACTCTC
+ ATACTCGATGTCGTGCAGGATGTCCTGCGTATCGCACACCACCGAGGTCACGATGACGGC
+ CAAACCGGCTATCATCACCAGGATCTCACTTACCGCCTCGGGAAAAAGAGAAAATACGGC
+ GAACAGTAAGAGAATCAGCGTGGATGCGC
diff --git a/userdata/HINW.031 b/userdata/HINW.031
new file mode 100644
index 0000000..9a37530
--- /dev/null
+++ b/userdata/HINW.031
@@ -0,0 +1,4 @@
+ TGTTGCGTTGGGCCGAGCCCAA8TTTCCCAAAAACGTATGGATCTTACTGACGTACAGTT
+ GCTTACCAGCGTGGCTGTCACGGCGTCAGGCTTCCACTTTAGTCATCGTTCAGTCATTTA
+ TGCCATGGTGGCCACAGTG77GTATTTTGTTTCCTCACGCTCGCTACGTATCTGTTTGCC
+ CGCGCAGTGGAATTACAGCGTTCCCTATTGCAGGGCGTACCAGCTGATTCTCTTCAT
diff --git a/userdata/README b/userdata/README
new file mode 100644
index 0000000..5406240
--- /dev/null
+++ b/userdata/README
@@ -0,0 +1,8 @@
+   This directory contains sample user data for the staden programs
+   ----------------------------------------------------------------
+
+   m13mp7.seq, lambda.seq are sequences in staden format
+   atpase.dat is a sequence in EMBL format
+   HINW.* are gel readings taht can be assembled by sap
+   DEMO.NAM is a file of file names for the HINW gel readings
+   DEMO.RL0, DEMO.SQ0, DEMO.AR0 is a sap database.
\ No newline at end of file
diff --git a/userdata/atpase.dat b/userdata/atpase.dat
new file mode 100644
index 0000000..f3bc0c3
--- /dev/null
+++ b/userdata/atpase.dat
@@ -0,0 +1,176 @@
+ID   ATPase
+XX
+DT   10-NOV-1982  FIRST ENTRY
+XX
+DE   
+XX
+KW   
+XX
+FH   KEY        FROM     TO       DESCRIPTION
+FH
+FT   CDS         576    968       1   0 391
+FT   CDS         977   1792       a   1 814
+FT   CDS        1839   2081       c  6-12 242
+FT   CDS        2140   2610       b   2 439
+FT   CDS        2625   3158       DELTA   1 532
+FT   CDS        3171   4712       ALPHA   3 1540
+FT   CDS        4763   5626       GAMMA   1 864
+FT   CDS        5653   7035       BETA    3 1381
+FT   CDS        7056   7400       EPSILON 1 343
+FT   CDS3        576    968       1   0 391
+FT   CDS2        977   1792       a   1 814
+FT   CDS3       1839   2081       c  6-12 242
+FT   CDS1       2140   2610       b   2 439
+FT   CDS3       2625   3158       DELTA   1 532
+FT   CDS3       3171   4712       ALPHA   3 1540
+FT   CDS2       4763   5626       GAMMA   1 864
+FT   CDS1       5653   7035       BETA    3 1381
+FT   CDS3       7056   7400       EPSILON 1 343
+FT   STOK        576    968       1   0 391
+FT   STOK        977   1792       1   1 814
+FT   STOK       1839   2081       6  6-12 242
+FT   STOK       2140   2610       2   2 439
+FT   STOK       2625   3158       1ELTA   1 532
+FT   STOK       3171   4712       3LPHA   3 1540
+FT   STOK       4763   5626       1AMMA   1 864
+FT   STOK       5653   7035       3ETA    3 1381
+FT   STOK       7056   7400       1PSILON 1 343
+XX
+SQ
+     CCGGTTAGAC TGTTAACAAC AACCAGGTTT TCTACTGATA TAACTGGTTA CATTTAACGC
+     CACGTTCACT CTTTTGCATC AACAAGATAA CGTGGCTTTT TTTGGTAAGC AGAAAATAAG
+     TCATTAGTGA AAATATCAGT CTGCTAAAAA TCGGCGCTAA GAACCATCAT TGGCTGTTAA
+     AACAGTTATT AAAAATGTCA ATGGGTGGTT TTTGTTGTGT AAATGTCATT TATTAAACAG
+     GTATCTGTTT TTAGACTGAA ATATCATAAA CTTGCAAGGC ATCATTTGCC AAGTAAATAA
+     ATATGCTGTG CGCGAACATG CGCAATATGT GATCTGAAGC ACGCTTTATC ACCAGTGTTT
+     ACGCGTTATT TACAGTTTTT CATGATCGAA CAGGGTTAGC AGAAAAGTCG CAATTGTATG
+     CACTGGAAAA ATATTTAAAC ATTTATTCAC CTTTTGGCTA CTTATTGTTT GAAATCACGG
+     GGGCGCACCG TATAATTTGA CCGCTTTTTG ATGCTTGACT CTAAGCCTTA AAGAAAGTTT
+     TATACGACAC GCGGCATACC TCGAAGGGAG CAGGAGTGAA AAACGTGATG TCTGTGTCGC
+     TCGTGAGTCG AAACGTTGCT CGGAAGCTTC TGCTCGTTCA GTTACTGGTG GTGATAGCAA
+     GTGGATTGCT GTTCAGCCTC AAAGACCCCT TCTGGGGCGT CTCTGCAATA AGCGGGGGCC
+     TGGCAGTCTT TCTGCCTAAC GTTTTGTTTA TGATATTTGC CTGGCGTCAC CAGGCGCATA
+     CACCAGCGAA AGGCCGGGTG GCCTGGACAT TCGCATTTGG CGAAGCTTTC AAAGTTCTGG
+     CGATGTTGGT GTTACTGGTG GTGGCGTTGG CGGTTTTAAA GGCGGTATTC TTGCCGCTGA
+     TCGTTACGTG GGTTTTGGTG CTGGTGGTTC AGATACTGGC ACCGGCTGTA ATTAACAACA
+     AAGGGTAAAA GGCATCATGG CTTCAGAAAA TATGACGCCG CAGGATTACA TAGGACACCA
+     CCTGAATAAC CTTCAGCTGG ACCTGCGTAC ATTCTCGCTG GTGGATCCAC AAAACCCCCC
+     AGCCACCTTC TGGACAATCA ATATTGACTC CATGTTCTTC TCGGTGGTGC TGGGTCTGTT
+     GTTCCTGGTT TTATTCCGTA GCGTAGCCAA AAAGGCGACC AGCGGTGTGC CAGGTAAGTT
+     TCAGACCGCG ATTGAGCTGG TGATCGGCTT TGTTAATGGT AGCGTGAAAG ACATGTACCA
+     TGGCAAAAGC AAGCTGATTG CTCCGCTGGC CCTGACGATC TTCGTCTGGG TATTCCTGAT
+     GAACCTGATG GATTTACTGC CTATCGACCT GCTGCCGTAC ATTGCTGAAC ATGTACTGGG
+     TCTGCCTGCA CTGCGTGTGG TTCCGTCTGC GGACGTGAAC GTAACGCTGT CTATGGCACT
+     GGGCGTATTT ATCCTGATTC TGTTCTACAG CATCAAAATG AAAGGCATCG GCGGCTTCAC
+     GAAAGAGTTG ACGCTGCAGC CGTTCAATCA CTGGGCGTTC ATTCCTGTCA ACTTAATCCT
+     TGAAGGGGTA AGCCTGCTGT CCAAACCAGT TTCACTCGGT TTGCGACTGT TCGGTAACAT
+     GTATGCCGGT GAGCTGATTT TCATTCTGAT TGCTGGTCTG TTGCCGTGGT GGTCACAGTG
+     GATCCTGAAT GTGCCGTGGG CCATTTTCCA CATCCTGATC ATTACGCTGC AAGCCTTCAT
+     CTTCATGGTT CTGACGATCG TCTATCTGTC GATGGCGTCT GAAGAACATT AATTTACCAA
+     CACTACTACG TTTTAACTGA AACAAACTGG AGACTGTCAT GGAAAACCTG AATATGGATC
+     TGCTGTACAT GGCTGCCGCT GTGATGATGG GTCTGGCGGC AATCGGTGCT GCGATCGGTA
+     TCGGCATCCT CGGGGGTAAA TTCCTGGAAG GCGCAGCGCG TCAACCTGAT CTGATTCCTC
+     TGCTGCGTAC TCAGTTCTTT ATCGTTATGG GTCTGGTGGA TGCTATCCCG ATGATCGCTG
+     TAGGTCTGGG TCTGTACGTG ATGTTCGCTG TCGCGTAGTA AGCGTTGCTT TTATTTAAAG
+     AGCAATATCA GAACGTTAAC TAAATAGAGG CATTGTGCTG TGAATCTTAA CGCAACAATC
+     CTCGGCCAGG CCATCGCGTT TGTCCTGTTC GTTCTGTTCT GCATGAAGTA CGTATGGCCG
+     CCATTAATGG CAGCCATCGA AAAACGTCAA AAAGAAATTG CTGACGGCCT TGCTTCCGCA
+     GAACGAGCAC ATAAGGACCT TGACCTTGCA AAGGCCAGCG CGACCGACCA GCTGAAAAAA
+     GCGAAAGCGG AAGCCCAGGT AATCATCGAG CAGGCGAACA AACGCCGCTC GCAGATTCTG
+     GACGAAGCGA AAGCTGAGGC AGAACAGGAA CGTACTAAAA TCGTGGCCCA GGCGCAGGCG
+     GAAATTGAAG CCGAGCGTAA ACGTGCCCGT GAAGAGCTGC GTAAGCAAGT TGCTATCCTG
+     GCTGTTGCTG GCGCCGAGAA GATCATCGAA CGTTCCGTGG ATGAAGCTGC TAACAGCGAC
+     ATCGTGGATA AACTTGTCGC TGAACTGTAA GGAGGGAGGG GCTGATGTCT GAATTTATTA
+     CGGTAGCTCG CCCCTACGCC AAAGCAGCTT TTGACTTTGC CGTCGAACAC CAAAGTGTAG
+     AACGCTGGCA GGACATGCTG GCGTTTGCCG CCGAGGTAAC CAAAAACGAA CAAATGGCAG
+     AGCTTCTCTC TGGCGCGCTT GCGCCAGAAA CGCTCGCCGA GTCGTTTATC GCAGTTTGTG
+     GTGAGCAACT GGACGAAAAC GGTCAGAACC TGATTCGGGT TATGGCTGAA AATGGTCGTC
+     TTAACGCGCT CCCGGATGTT CTGGAGCAGT TTATTCACCT GCGTGCCGTG AGTGAGGCTA
+     CCGCTGAGGT AGACGTCATT TCCGCTGCCG CACTGAGTGA ACAACAGCTC GCGAAAATTT
+     CTGCTGCGAT GGAAAAACGT CTGTCACGCA AAGTTAAGCT GAATTGCAAA ATCGATAAGT
+     CTGTAATGGC AGGCGTTATC ATCCGAGCGG GTGATATGGT CATTGATGGC AGCGTACGCG
+     GTCGTCTTGA GCGCCTTGCA GACGTCTTGC AGTCTTAAGG GGACTGGAGC ATGCAACTGA
+     ATTCCACCGA AATCAGCGAA CTGATCAAGC AGCGCATTGC TCAGTTCAAT GTTGTGAGTG
+     AAGCTCACAA CGAAGGTACT ATTGTTTCTG TAAGTGACGG TGTTATCCGC ATTCACGGCC
+     TGGCCGATTG TATGCAGGGT GAAATGATCT CCCTGCCGGG TAACCGTTAC GCTATCGCAC
+     TGAACCTCGA GCGCGACTCT GTAGGTGCGG TTGTTATGGG TCCGTACGCT GACCTTGCCG
+     AAGGCATGAA AGTTAAGTGT ACTGGCCGTA TCCTGGAAGT TCCGGTTGGC CGTGGCCTGC
+     TGGGCCGTGT GGTTAACACT CTGGGTGCAC CAATCGACGG TAAAGGTCCG CTGGATCACG
+     ACGGCTTCTC TGCTGTAGAA GCAATCGCTC CGGGCGTTAT CGAACGTCAG TCCGTAGATC
+     AGCCGGTACA GACCGGTTAT AAAGCCGTTG ACTCCATGAT CCCAATCGGT CGTGGTCAGC
+     GTGAATTGAT CATCGGTGAC CGTCAGACAG GTAAAACCGC ACTGGCTATC GATGCCATCA
+     TCAACCAGCG CGATTCCGGT ATCAAATGTA TCTATGTCGC TATCGGCCAG AAAGCGTCCA
+     CCATTTCTAA CGTGGTACGT AAACTGGAAG AGCACGGCGC ACTGGCTAAC ACCATCGTTG
+     TGGTAGCAAC CGCGTCTGAA TCCGCTGCAC TGCAATACCT GGCACGTATG CCGGTTGCGC
+     TAATGGGCGA ATACTTCCGT GACCGCGGTG AAGATGCGCT GATCATTTAC GATGACCTGT
+     CTAAACAGGC TGTTGCTTAC CGTCAGATCT CCCTGCTGCT CCGTCGTCCG CCAGGACGTG
+     AAGCATTCCC GGGCGACGTT TTCTACCTCC ACTCTCGTCT GCTGGAGCGT GCTGCACGTG
+     TTAACGCCGA ATACGTTGAA GCCTTCACCA AAGGTGAAGT GAAAGGGAAA ACCGGTTCTC
+     TGACCGCACT GCCGATTATC GAAACTCAGG CGGGTGACGT TTCTGCGTTC GTTCCGACCA
+     ACGTAATCTC CATTACCGAT GGTCAGATCT TCCTGGAAAC CAACCTGTTC AACGCCGGTA
+     TTCGTCCTGC GGTTAACCCG GGTATTTCCG TATCCCGTGT TGGTGGTGCA GCACAGACCA
+     AGATCATGAA AAAACTGTCC GGTGGTATCC GTACCGCTCT GGCACAGTAT CGTGAACTGG
+     CAGCGTTCTC TCAGTTTGCA TCCGACCTTG ACGATGCAAC ACGTAAGCAG CTTGACCACG
+     GTCAGAAAGT GACCGAACTG CTGAAACAGA AACAGTATGC GCCGATGTCC GTTGCGCAGC
+     AGTCTCTGGT TCTGTTCGCA GCAGAACGTG GTTACCTGGC GGATGTTGAA CTGTCGAAAA
+     TTGGCAGCTT CGAAGCCGCT CTGCTGGCTT ACGTCGACCG TGATCACGCT CCGTTGATGC
+     AAGAGATCAA CCAGACCGGT GGCTACAACG ACGAAATCGA AGGCAAGCTG AAAGGCATCC
+     TCGATTCCTT CAAAGCAACC CAATCCTGGT AACGTCTGGC GGCTTGCCTT AGGGCAGGCC
+     GCAAGGCATT GAGGAGAAGC TCATGGCCGG CGCAAAAGAG ATACGTAGTA AGATCGCAAG
+     CGTCCAGAAC ACGCAAAAGA TCACTAAAGC GATGGAGATG GTCGCCGCTT CCAAAATGCG
+     TAAATCGCAG GATCGCATGG CGGCCAGCCG TCCTTATGCA GAAACCATGC GCAAAGTGAT
+     TGGTCACCTT GCACACGGTA ATCTGGAATA TAAGCACCCT TACCTGGAAG ACCGCGACGT
+     TAAACGCGTG GGCTACCTGG TGGTGTCGAC CGACCGTGGT TTGTGCGGTG GTTTGAACAT
+     TAACCTGTTC AAAAAACTGC TGGCGGAAAT GAAGACCTGG ACCGACAAAG GCGTTCAATG
+     CGACCTCGCA ATGATCGGCT CGAAAGGCGT GTCGTTCTTC AACTCCGTGG GCGGCAATGT
+     TGTTGCCCAG GTCACCGGCA TGGGGGATAA CCCTTCCCTG TCCGAACTGA TCGGTCCGGT
+     AAAAGTGATG TTGCAGGCCT ACGACGAAGG CCGTCTGGAC AAGCTTTACA TTGTCAGCAA
+     CAAATTTATT AACACCATGT CTCAGGTTCC GACCATCAGC CAGCTGCTGC CGTTACCGGC
+     ATCAGATGAT GATGATCTGA AACATAAATC CTGGGATTAC CTGTACGAAC CCGATCCGAA
+     GGCGTTGCTG GATACCCTGC TGCGTCGTTA TGTCGAATCT CAGGTTTATC AGGGCGTGGT
+     TGAAAACCTG GCCAGCGAGC AGGCCGCCCG TATGGTGGCG ATGAAAGCCG CGACCGACAA
+     TGGCGGCAGC CTGATTAAAG AGCTGCAGTT GGTATACAAC AAAGCTCGTC AGGCCAGCAT
+     TACTCAGGAA CTCACCGAGA TCGTCTCGGG GGCCGCCGCG GTTTAAACAG GTTATTTCGT
+     AGAGGATTTA AGATGGCTAC TGGAAAGATT GTCCAGGTAA TCGGCGCCGT AGTTGACGTC
+     GAATTCCCTC AGGATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTGCA AAATGGTAAT
+     GAGCGTCTGG TGCTGGAAGT TCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCA
+     ATGGGTTCCT CCGACGGTCT GCGTCGCGGT CTGGATGTAA AAGACCTCGA ACACCCGATT
+     GAAGTCCCGG TAGGTAAAGC GACTCTGGGC CGTATCATGA ACGTACTGGG TGAACCGGTC
+     GACATGAAAG GCGAGATCGG TGAAGAAGAG CGTTGGGCGA TTCACCGCGC AGCACCTTCC
+     TACGAAGAGC TGTCAAACTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG
+     ATGTGTCCGT TCGCTAAGGG CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA
+     ACCGTAAACA TGATGGAGCT CATTCGTAAC ATCGCGATCG AGCACTCCGG TTACTCTGTG
+     TTTGCGGGCG TAGGTGAACG TACTCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC
+     TCCAACGTTA TCGACAAAGT ATCCCTGGTG TATGGCCAGA TGAACGAGCC GCCGGGAAAC
+     CGTCTGCGCG TTGCTCTGAC CGGTCTGACC ATGGCTGAGA AATTCCGTGA CGAAGGTCGT
+     GACGTTCTGC TGTTCGTTGA CAACATCTAT CGTTACACCC TGGCCGGTAC GGAAGTATCC
+     GCACTGCTGG GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG
+     GGCGTTCTGC AGGAACGTAT CACCTCCACC AAAACTGGTT CTATCACCTC CGTACAGGCA
+     GTATACGTAC CTGCGGATGA CTTGACTGAC CCGTCTCCGG CAACCACCTT TGCGCACCTT
+     GACGCAACCG TGGTACTGAG CCGTCAGATC GCGTCTCTGG GTATCTACCC GGCCGTTGAC
+     CCGCTGGACT CCACCAGCCG TCAGCTGGAC CCGCTGGTGG TTGGTCAGGA ACACTACGAC
+     ACCGCGCGTG GCGTTCAGTC CATCCTGCAA CGTTATCAGG AACTGAAAGA CATCATCGCC
+     ATCCTGGGTA TGGATGAACT GTCTGAAGAA GACAAACTGG TGGTAGCGCG TGCTCGTAAG
+     ATCCAGCGCT TCCTGTCCCA GCCGTTCTTC GTGGCAGAAG TATTCACCGG TTCTCCGGGT
+     AAATACGTCT CCCTGAAAGA CACCATCCGT GGCTTTAAAG GCATCATGGA AGGCGAATAC
+     GATCACCTGC CGGAGCAGGC GTTCTACATG GTCGGTTCCA TCGAAGAAGC TGTGGAAAAA
+     GCCAAAAAAC TTTAACGCCT TAATCGGAGG GTGATATGGC AATGACTTAC CACCTGGACG
+     TCGTCAGCGC AGAGCAACAA ATGTTCTCTG GTCTGGTCGA GAAAATCCAG GTAACGGGTA
+     GCGAAGGTGA ACTGGGGATC TACCCTGGCC ACGCACCGCT GCTCACCGCC ATTAAGCCTG
+     GTATGATTCG CATCGTGAAA CAGCACGGTC ACGAAGAGTT TATCTATCTG TCTGGCGGCA
+     TTCTTGAAGT GCAGCCTGGC AACGTGACCG TTCTGGCCGA CACCGCAATT CGCGGCCAGG
+     ATCTCGACGA AGCGCGAGCC ATGGAAGCGA AACGTAAGGC TGAAGAGCAC ATTAGCAGCT
+     CTCACGGCGA CGTAGATTAC GCTCAGGCGT CTGCGGAACT GGCCAAAGCG ATCGCGCAGC
+     TGCGGTTATC GAGTTGACCA AAAAAGCGAT GTAACACCGG CTTGAAAAGC ACAAAAGCCA
+     GTCTGGAAAC AGGCTGGCTT TTTTTTGCGC GTGTGACCCG TCCTGAATAG CGTTCACATA
+     GATCCTGCTG ATATAAAACC CCCCTGTTTT CCTGTTTATT CATTGATCGA AATAAGAGCA
+     AAAACATCCA CCTGACGCTT AAATTAAGGT ACTGCCTTAA TTTTCTGCAG ACAAAAGGCG
+     TGACGATGGT CGAAAATGGC GCTTTCGTCA GCGGGGATAA TCCGTTATTG AACAATTTAT
+     CCTCTGTCCA TTTCACGATG AAAAAAATGT AGTTTTTTCA AGGTGAAGCG GTTTAAATTC
+     GTTCTCAAAT TACAGTCAGG ACGCGTATGT TGAATAATGC TATGAGCGTA GTGATCCTTG
+     CCGCACGAAG GCACGCGCAT GTATTCCGAT CTTCCGAAAG TGCTGCATAC CCTGCCGGGA
+     AAGCGATGGT TCAGCATGTC ATTGATGCTG CGAATGAATT AGGCGCACGG CACGTTCACC
+     TGGTGTACGG TCACGGCGGC GATCTGCTAA AACAGGCGCT GAAAGACGAC AACCTTAACT
+     GGGTGCTTCA GGCAGAGCAG CTGGGTACGG GTCATGCAAT GCAGCAGGCC GCACCTTTCT
+     TTGCCGATGA TGAAGACATT TTAATGCTCT ACGC
+//
diff --git a/userdata/blue.vec b/userdata/blue.vec
new file mode 100644
index 0000000..c999243
--- /dev/null
+++ b/userdata/blue.vec
@@ -0,0 +1,48 @@
+<---blue.001------->
+;BlueScribe cloning vector from entry blue in vecbase
+ TCGCGCGTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCA
+ CAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTG
+ TTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGC
+ ACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCC
+ ATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCTAT
+ TACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGT
+ TTTCCCAGTCACGACGTTGTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGG
+ GCGAATTCGAGCTCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTT
+ TGTTCCCTTTAGTGAGGGTTAATTCCGAGCTTGGCGTAATCATGGTCATAGCTGTTTCCT
+ GTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGT
+ AAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCC
+ GCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGGG
+ AGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCG
+ GTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACA
+ GAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAAC
+ CGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCAC
+ AAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCG
+ TTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATAC
+ CTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTAT
+ CTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAG
+ CCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGAC
+ TTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGT
+ GCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGT
+ ATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGC
+ AAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGA
+ AAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAAC
+ GAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATC
+ CTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCT
+ GACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCA
+ TCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCT
+ GGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCA
+ ATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCC
+ ATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTG
+ CGCAACGTTGTTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCT
+ TCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAA
+ AAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTA
+ TCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGC
+ TTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCG
+ AGTTGCTCTTGCCCGGCGTCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAA
+ GTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTG
+ AGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTC
+ ACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGG
+ GCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTAT
+ CAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATA
+ GGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATC
+ ATGACATTAACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTC
diff --git a/userdata/lambda.seq b/userdata/lambda.seq
new file mode 100644
index 0000000..c49bf21
--- /dev/null
+++ b/userdata/lambda.seq
@@ -0,0 +1,810 @@
+<---LAMBDA.001----->
+GGGCGGCGACCTCGCGGGTTTTCGCTATTTATGAAAATTTTCCGGTTTAAGGCGTTTCCG
+TTCTTCTTCGTCATAACTTAATGTTTTTATTTAAAATACCCTCTGAAAAGAAAGGAAACG
+ACAGGTGCTGAAAGCGAGGCTTTTTGGCCTCTGTCGTTTCCTTTCTCTGTTTTTGTCCGT
+GGAATGAACAATGGAAGTCAACAAAAAGCAGCTGGCTGACATTTTCGGTGCGAGTATCCG
+TACCATTCAGAACTGGCAGGAACAGGGAATGCCCGTTCTGCGAGGCGGTGGCAAGGGTAA
+TGAGGTGCTTTATGACTCTGCCGCCGTCATAAAATGGTATGCCGAAAGGGATGCTGAAAT
+TGAGAACGAAAAGCTGCGCCGGGAGGTTGAAGAACTGCGGCAGGCCAGCGAGGCAGATCT
+CCAGCCAGGAACTATTGAGTACGAACGCCATCGACTTACGCGTGCGCAGGCCGACGCACA
+GGAACTGAAGAATGCCAGAGACTCCGCTGAAGTGGTGGAAACCGCATTCTGTACTTTCGT
+GCTGTCGCGGATCGCAGGTGAAATTGCCAGTATTCTCGACGGGCTCCCCCTGTCGGTGCA
+GCGGCGTTTTCCGGAACTGGAAAACCGACATGTTGATTTCCTGAAACGGGATATCATCAA
+AGCCATGAACAAAGCAGCCGCGCTGGATGAACTGATACCGGGGTTGCTGAGTGAATATAT
+CGAACAGTCAGGTTAACAGGCTGCGGCATTTTGTCCGCGCCGGGCTTCGCTCACTGTTCA
+GGCCGGAGCCACAGACCGCCGTTGAATGGGCGGATGCTAATTACTATCTCCCGAAAGAAT
+CCGCATACCAGGAAGGGCGCTGGGAAACACTGCCCTTTCAGCGGGCCATCATGAATGCGA
+TGGGCAGCGACTACATCCGTGAGGTGAATGTGGTGAAGTCTGCCCGTGTCGGTTATTCCA
+AAATGCTGCTGGGTGTTTATGCCTACTTTATAGAGCATAAGCAGCGCAACACCCTTATCT
+GGTTGCCGACGGATGGTGATGCCGAGAACTTTATGAAAACCCACGTTGAGCCGACTATTC
+GTGATATTCCGTCGCTGCTGGCGCTGGCCCCGTGGTATGGCAAAAAGCACCGGGATAACA
+CGCTCACCATGAAGCGTTTCACTAATGGGCGTGGCTTCTGGTGCCTGGGCGGTAAAGCGG
+CAAAAAACTACCGTGAAAAGTCGGTGGATGTGGCGGGTTATGATGAACTTGCTGCTTTTG
+ATGATGATATTGAACAGGAAGGCTCTCCGACGTTCCTGGGTGACAAGCGTATTGAAGGCT
+CGGTCTGGCCAAAGTCCATCCGTGGCTCCACGCCAAAAGTGAGAGGCACCTGTCAGATTG
+AGCGTGCAGCCAGTGAATCCCCGCATTTTATGCGTTTTCATGTTGCCTGCCCGCATTGCG
+GGGAGGAGCAGTATCTTAAATTTGGCGACAAAGAGACGCCGTTTGGCCTCAAATGGACGC
+CGGATGACCCCTCCAGCGTGTTTTATCTCTGCGAGCATAATGCCTGCGTCATCCGCCAGC
+AGGAGCTGGACTTTACTGATGCCCGTTATATCTGCGAAAAGACCGGGATCTGGACCCGTG
+ATGGCATTCTCTGGTTTTCGTCATCCGGTGAAGAGATTGAGCCACCTGACAGTGTGACCT
+TTCACATCTGGACAGCGTACAGCCCGTTCACCACCTGGGTGCAGATTGTCAAAGACTGGA
+TGAAAACGAAAGGGGATACGGGAAAACGTAAAACCTTCGTAAACACCACGCTCGGTGAGA
+CGTGGGAGGCGAAAATTGGCGAACGTCCGGATGCTGAAGTGATGGCAGAGCGGAAAGAGC
+ATTATTCAGCGCCCGTTCCTGACCGTGTGGCTTACCTGACCGCCGGTATCGACTCCCAGC
+TGGACCGCTACGAAATGCGCGTATGGGGATGGGGGCCGGGTGAGGAAAGCTGGCTGATTG
+ACCGGCAGATTATTATGGGCCGCCACGACGATGAACAGACGCTGCTGCGTGTGGATGAGG
+CCATCAATAAAACCTATACCCGCCGGAATGGTGCAGAAATGTCGATATCCCGTATCTGCT
+GGGATACTGGCGGGATTGACCCGACCATTGTGTATGAACGCTCGAAAAAACATGGGCTGT
+TCCGGGTGATCCCCATTAAAGGGGCATCCGTCTACGGAAAGCCGGTGGCCAGCATGCCAC
+GTAAGCGAAACAAAAACGGGGTTTACCTTACCGAAATCGGTACGGATACCGCGAAAGAGC
+AGATTTATAACCGCTTCACACTGACGCCGGAAGGGGATGAACCGCTTCCCGGTGCCGTTC
+ACTTCCCGAATAACCCGGATATTTTTGATCTGACCGAAGCGCAGCAGCTGACTGCTGAAG
+AGCAGGTCGAAAAATGGGTGGATGGCAGGAAAAAAATACTGTGGGACAGCAAAAAGCGAC
+GCAATGAGGCACTCGACTGCTTCGTTTATGCGCTGGCGGCGCTGCGCATCAGTATTTCCC
+GCTGGCAGCTGGATCTCAGTGCGCTGCTGGCGAGCCTGCAGGAAGAGGATGGTGCAGCAA
+CCAACAAGAAAACACTGGCAGATTACGCCCGTGCCTTATCCGGAGAGGATGAATGACGCG
+ACAGGAAGAACTTGCCGCTGCCCGTGCGGCACTGCATGACCTGATGACAGGTAAACGGGT
+GGCAACAGTACAGAAAGACGGACGAAGGGTGGAGTTTACGGCCACTTCCGTGTCTGACCT
+GAAAAAATATATTGCAGAGCTGGAAGTGCAGACCGGCATGACACAGCGACGCAGGGGACC
+TGCAGGATTTTATGTATGAAAACGCCCACCATTCCCACCCTTCTGGGGCCGGACGGCATG
+ACATCGCTGCGCGAATATGCCGGTTATCACGGCGGTGGCAGCGGATTTGGAGGGCAGTTG
+CGGTCGTGGAACCCACCGAGTGAAAGTGTGGATGCAGCCCTGTTGCCCAACTTTACCCGT
+GGCAATGCCCGCGCAGACGATCTGGTACGCAATAACGGCTATGCCGCCAACGCCATCCAG
+CTGCATCAGGATCATATCGTCGGGTCTTTTTTCCGGCTCAGTCATCGCCCAAGCTGGCGC
+TATCTGGGCATCGGGGAGGAAGAAGCCCGTGCCTTTTCCCGCGAGGTTGAAGCGGCATGG
+AAAGAGTTTGCCGAGGATGACTGCTGCTGCATTGACGTTGAGCGAAAACGCACGTTTACC
+ATGATGATTCGGGAAGGTGTGGCCATGCACGCCTTTAACGGTGAACTGTTCGTTCAGGCC
+ACCTGGGATACCAGTTCGTCGCGGCTTTTCCGGACACAGTTCCGGATGGTCAGCCCGAAG
+CGCATCAGCAACCCGAACAATACCGGCGACAGCCGGAACTGCCGTGCCGGTGTGCAGATT
+AATGACAGCGGTGCGGCGCTGGGATATTACGTCAGCGAGGACGGGTATCCTGGCTGGATG
+CCGCAGAAATGGACATGGATACCCCGTGAGTTACCCGGCGGGCGCGCCTCGTTCATTCAC
+GTTTTTGAACCCGTGGAGGACGGGCAGACTCGCGGTGCAAATGTGTTTTACAGCGTGATG
+GAGCAGATGAAGATGCTCGACACGCTGCAGAACACGCAGCTGCAGAGCGCCATTGTGAAG
+GCGATGTATGCCGCCACCATTGAGAGTGAGCTGGATACGCAGTCAGCGATGGATTTTATT
+CTGGGCGCGAACAGTCAGGAGCAGCGGGAAAGGCTGACCGGCTGGATTGGTGAAATTGCC
+GCGTATTACGCCGCAGCGCCGGTCCGGCTGGGAGGCGCAAAAGTACCGCACCTGATGCCG
+GGTGACTCACTGAACCTGCAGACGGCTCAGGATACGGATAACGGCTACTCCGTGTTTGAG
+CAGTCACTGCTGCGGTATATCGCTGCCGGGCTGGGTGTCTCGTATGAGCAGCTTTCCCGG
+AATTACGCCCAGATGAGCTACTCCACGGCACGGGCCAGTGCGAACGAGTCGTGGGCGTAC
+TTTATGGGGCGGCGAAAATTCGTCGCATCCCGTCAGGCGAGCCAGATGTTTCTGTGCTGG
+CTGGAAGAGGCCATCGTTCGCCGCGTGGTGACGTTACCTTCAAAAGCGCGCTTCAGTTTT
+CAGGAAGCCCGCAGTGCCTGGGGGAACTGCGACTGGATAGGCTCCGGTCGTATGGCCATC
+GATGGTCTGAAAGAAGTTCAGGAAGCGGTGATGCTGATAGAAGCCGGACTGAGTACCTAC
+GAGAAAGAGTGCGCAAAACGCGGTGACGACTATCAGGAAATTTTTGCCCAGCAGGTCCGT
+GAAACGATGGAGCGCCGTGCAGCCGGTCTTAAACCGCCCGCCTGGGCGGCTGCAGCATTT
+GAATCCGGGCTGCGACAATCAACAGAGGAGGAGAAGAGTGACAGCAGAGCTGCGTAATCT
+CCCGCATATTGCCAGCATGGCCTTTAATGAGCCGCTGATGCTTGAACCCGCCTATGCGCG
+GGTTTTCTTTTGTGCGCTTGCAGGCCAGCTTGGGATCAGCAGCCTGACGGATGCGGTGTC
+CGGCGACAGCCTGACTGCCCAGGAGGCACTCGCGACGCTGGCATTATCCGGTGATGATGA
+CGGACCACGACAGGCCCGCAGTTATCAGGTCATGAACGGCATCGCCGTGCTGCCGGTGTC
+CGGCACGCTGGTCAGCCGGACGCGGGCGCTGCAGCCGTACTCGGGGATGACCGGTTACAA
+CGGCATTATCGCCCGTCTGCAACAGGCTGCCAGCGATCCGATGGTGGACGGCATTCTGCT
+CGATATGGACACGCCCGGCGGGATGGTGGCGGGGGCATTTGACTGCGCTGACATCATCGC
+CCGTGTGCGTGACATAAAACCGGTATGGGCGCTTGCCAACGACATGAACTGCAGTGCAGG
+TCAGTTGCTTGCCAGTGCCGCCTCCCGGCGTCTGGTCACGCAGACCGCCCGGACAGGCTC
+CATCGGCGTCATGATGGCTCACAGTAATTACGGTGCTGCGCTGGAGAAACAGGGTGTGGA
+AATCACGCTGATTTACAGCGGCAGCCATAAGGTGGATGGCAACCCCTACAGCCATCTTCC
+GGATGACGTCCGGGAGACACTGCAGTCCCGGATGGACGCAACCCGCCAGATGTTTGCGCA
+GAAGGTGTCGGCATATACCGGCCTGTCCGTGCAGGTTGTGCTGGATACCGAGGCTGCAGT
+GTACAGCGGTCAGGAGGCCATTGATGCCGGACTGGCTGATGAACTTGTTAACAGCACCGA
+TGCGATCACCGTCATGCGTGATGCACTGGATGCACGTAAATCCCGTCTCTCAGGAGGGCG
+AATGACCAAAGAGACTCAATCAACAACTGTTTCAGCCACTGCTTCGCAGGCTGACGTTAC
+TGACGTGGTGCCAGCGACGGAGGGCGAGAACGCCAGCGCGGCGCAGCCGGACGTGAACGC
+GCAGATCACCGCAGCGGTTGCGGCAGAAAACAGCCGCATTATGGGGATCCTCAACTGTGA
+GGAGGCTCACGGACGCGAAGAACAGGCACGCGTGCTGGCAGAAACCCCCGGTATGACCGT
+GAAAACGGCCCGCCGCATTCTGGCCGCAGCACCACAGAGTGCACAGGCGCGCAGTGACAC
+TGCGCTGGATCGTCTGATGCAGGGGGCACCGGCACCGCTGGCTGCAGGTAACCCGGCATC
+TGATGCCGTTAACGATTTGCTGAACACACCAGTGTAAGGGATGTTTATGACGAGCAAAGA
+AACCTTTACCCATTACCAGCCGCAGGGCAACAGTGACCCGGCTCATACCGCAACCGCGCC
+CGGCGGATTGAGTGCGAAAGCGCCTGCAATGACCCCGCTGATGCTGGACACCTCCAGCCG
+TAAGCTGGTTGCGTGGGATGGCACCACCGACGGTGCTGCCGTTGGCATTCTTGCGGTTGC
+TGCTGACCAGACCAGCACCACGCTGACGTTCTACAAGTCCGGCACGTTCCGTTATGAGGA
+TGTGCTCTGGCCGGAGGCTGCCAGCGACGAGACGAAAAAACGGACCGCGTTTGCCGGAAC
+GGCAATCAGCATCGTTTAACTTTACCCTTCATCACTAAAGGCCGCCTGTGCGGCTTTTTT
+TACGGGATTTTTTTATGTCGATGTACACAACCGCCCAACTGCTGGCGGCAAATGAGCAGA
+AATTTAAGTTTGATCCGCTGTTTCTGCGTCTCTTTTTCCGTGAGAGCTATCCCTTCACCA
+CGGAGAAAGTCTATCTCTCACAAATTCCGGGACTGGTAAACATGGCGCTGTACGTTTCGC
+CGATTGTTTCCGGTGAGGTTATCCGTTCCCGTGGCGGCTCCACCTCTGAATTTACGCCGG
+GATATGTCAAGCCGAAGCATGAAGTGAATCCGCAGATGACCCTGCGTCGCCTGCCGGATG
+AAGATCCGCAGAATCTGGCGGACCCGGCTTACCGCCGCCGTCGCATCATCATGCAGAACA
+TGCGTGACGAAGAGCTGGCCATTGCTCAGGTCGAAGAGATGCAGGCAGTTTCTGCCGTGC
+TTAAGGGCAAATACACCATGACCGGTGAAGCCTTCGATCCGGTTGAGGTGGATATGGGCC
+GCAGTGAGGAGAATAACATCACGCAGTCCGGCGGCACGGAGTGGAGCAAGCGTGACAAGT
+CCACGTATGACCCGACCGACGATATCGAAGCCTACGCGCTGAACGCCAGCGGTGTGGTGA
+ATATCATCGTGTTCGATCCGAAAGGCTGGGCGCTGTTCCGTTCCTTCAAAGCCGTCAAGG
+AGAAGCTGGATACCCGTCGTGGCTCTAATTCCGAGCTGGAGACAGCGGTGAAAGACCTGG
+GCAAAGCGGTGTCCTATAAGGGGATGTATGGCGATGTGGCCATCGTCGTGTATTCCGGAC
+AGTACGTGGAAAACGGCGTCAAAAAGAACTTCCTGCCGGACAACACGATGGTGCTGGGGA
+ACACTCAGGCACGCGGTCTGCGCACCTATGGCTGCATTCAGGATGCGGACGCACAGCGCG
+AAGGCATTAACGCCTCTGCCCGTTACCCGAAAAACTGGGTGACCACCGGCGATCCGGCGC
+GTGAGTTCACCATGATTCAGTCAGCACCGCTGATGCTGCTGGCTGACCCTGATGAGTTCG
+TGTCCGTACAACTGGCGTAATCATGGCCCTTCGGGGCCATTGTTTCTCTGTGGAGGAGTC
+CATGACGAAAGATGAACTGATTGCCCGTCTCCGCTCGCTGGGTGAACAACTGAACCGTGA
+TGTCAGCCTGACGGGGACGAAAGAAGAACTGGCGCTCCGTGTGGCAGAGCTGAAAGAGGA
+GCTTGATGACACGGATGAAACTGCCGGTCAGGACACCCCTCTCAGCCGGGAAAATGTGCT
+GACCGGACATGAAAATGAGGTGGGATCAGCGCAGCCGGATACCGTGATTCTGGATACGTC
+TGAACTGGTCACGGTCGTGGCACTGGTGAAGCTGCATACTGATGCACTTCACGCCACGCG
+GGATGAACCTGTGGCATTTGTGCTGCCGGGAACGGCGTTTCGTGTCTCTGCCGGTGTGGC
+AGCCGAAATGACAGAGCGCGGCCTGGCCAGAATGCAATAACGGGAGGCGCTGTGGCTGAT
+TTCGATAACCTGTTCGATGCTGCCATTGCCCGCGCCGATGAAACGATACGCGGGTACATG
+GGAACGTCAGCCACCATTACATCCGGTGAGCAGTCAGGTGCGGTGATACGTGGTGTTTTT
+GATGACCCTGAAAATATCAGCTATGCCGGACAGGGCGTGCGCGTTGAAGGCTCCAGCCCG
+TCCCTGTTTGTCCGGACTGATGAGGTGCGGCAGCTGCGGCGTGGAGACACGCTGACCATC
+GGTGAGGAAAATTTCTGGGTAGATCGGGTTTCGCCGGATGATGGCGGAAGTTGTCATCTC
+TGGCTTGGACGGGGCGTACCGCCTGCCGTTAACCGTCGCCGCTGAAAGGGGGATGTATGG
+CCATAAAAGGTCTTGAGCAGGCCGTTGAAAACCTCAGCCGTATCAGCAAAACGGCGGTGC
+CTGGTGCCGCCGCAATGGCCATTAACCGCGTTGCTTCATCCGCGATATCGCAGTCGGCGT
+CACAGGTTGCCCGTGAGACAAAGGTACGCCGGAAACTGGTAAAGGAAAGGGCCAGGCTGA
+AAAGGGCCACGGTCAAAAATCCGCAGGCCAGAATCAAAGTTAACCGGGGGGATTTGCCCG
+TAATCAAGCTGGGTAATGCGCGGGTTGTCCTTTCGCGCCGCAGGCGTCGTAAAAAGGGGC
+AGCGTTCATCCCTGAAAGGTGGCGGCAGCGTGCTTGTGGTGGGTAACCGTCGTATTCCCG
+GCGCGTTTATTCAGCAACTGAAAAATGGCCGGTGGCATGTCATGCAGCGTGTGGCTGGGA
+AAAACCGTTACCCCATTGATGTGGTGAAAATCCCGATGGCGGTGCCGCTGACCACGGCGT
+TTAAACAAAATATTGAGCGGATACGGCGTGAACGTCTTCCGAAAGAGCTGGGCTATGCGC
+TGCAGCATCAACTGAGGATGGTAATAAAGCGATGAAACATACTGAACTCCGTGCAGCCGT
+ACTGGATGCACTGGAGAAGCATGACACCGGGGCGACGTTTTTTGATGGTCGCCCCGCTGT
+TTTTGATGAGGCGGATTTTCCGGCAGTTGCCGTTTATCTCACCGGCGCTGAATACACGGG
+CGAAGAGCTGGACAGCGATACCTGGCAGGCGGAGCTGCATATCGAAGTTTTCCTGCCTGC
+TCAGGTGCCGGATTCAGAGCTGGATGCGTGGATGGAGTCCCGGATTTATCCGGTGATGAG
+CGATATCCCGGCACTGTCAGATTTGATCACCAGTATGGTGGCCAGCGGCTATGACTACCG
+GCGCGACGATGATGCGGGCTTGTGGAGTTCAGCCGATCTGACTTATGTCATTACCTATGA
+AATGTGAGGACGCTATGCCTGTACCAAATCCTACAATGCCGGTGAAAGGTGCCGGGACCA
+CCCTGTGGGTTTATAAGGGGAGCGGTGACCCTTACGCGAATCCGCTTTCAGACGTTGACT
+GGTCGCGTCTGGCAAAAGTTAAAGACCTGACGCCCGGCGAACTGACCGCTGAGTCCTATG
+ACGACAGCTATCTCGATGATGAAGATGCAGACTGGACTGCGACCGGGCAGGGGCAGAAAT
+CTGCCGGAGATACCAGCTTCACGCTGGCGTGGATGCCCGGAGAGCAGGGGCAGCAGGCGC
+TGCTGGCGTGGTTTAATGAAGGCGATACCCGTGCCTATAAAATCCGCTTCCCGAACGGCA
+CGGTCGATGTGTTCCGTGGCTGGGTCAGCAGTATCGGTAAGGCGGTGACGGCGAAGGAAG
+TGATCACCCGCACGGTGAAAGTCACCAATGTGGGACGTCCGTCGATGGCAGAAGATCGCA
+GCACGGTAACAGCGGCAACCGGCATGACCGTGACGCCTGCCAGCACCTCGGTGGTGAAAG
+GGCAGAGCACCACGCTGACCGTGGCCTTCCAGCCGGAGGGCGTAACCGACAAGAGCTTTC
+GTGCGGTGTCTGCGGATAAAACAAAAGCCACCGTGTCGGTCAGTGGTATGACCATCACCG
+TGAACGGCGTTGCTGCAGGCAAGGTCAACATTCCGGTTGTATCCGGTAATGGTGAGTTTG
+CTGCGGTTGCAGAAATTACCGTCACCGCCAGTTAATCCGGAGAGTCAGCGATGTTCCTGA
+AAACCGAATCATTTGAACATAACGGTGTGACCGTCACGCTTTCTGAACTGTCAGCCCTGC
+AGCGCATTGAGCATCTCGCCCTGATGAAACGGCAGGCAGAACAGGCGGAGTCAGACAGCA
+ACCGGAAGTTTACTGTGGAAGACGCCATCAGAACCGGCGCGTTTCTGGTGGCGATGTCCC
+TGTGGCATAACCATCCGCAGAAGACGCAGATGCCGTCCATGAATGAAGCCGTTAAACAGA
+TTGAGCAGGAAGTGCTTACCACCTGGCCCACGGAGGCAATTTCTCATGCTGAAAACGTGG
+TGTACCGGCTGTCTGGTATGTATGAGTTTGTGGTGAATAATGCCCCTGAACAGACAGAGG
+ACGCCGGGCCCGCAGAGCCTGTTTCTGCGGGAAAGTGTTCGACGGTGAGCTGAGTTTTGC
+CCTGAAACTGGCGCGTGAGATGGGGCGACCCGACTGGCGTGCCATGCTTGCCGGGATGTC
+ATCCACGGAGTATGCCGACTGGCACCGCTTTTACAGTACCCATTATTTTCATGATGTTCT
+GCTGGATATGCACTTTTCCGGGCTGACGTACACCGTGCTCAGCCTGTTTTTCAGCGATCC
+GGATATGCATCCGCTGGATTTCAGTCTGCTGAACCGGCGCGAGGCTGACGAAGAGCCTGA
+AGATGATGTGCTGATGCAGAAAGCGGCAGGGCTTGCCGGAGGTGTCCGCTTTGGCCCGGA
+CGGGAATGAAGTTATCCCCGCTTCCCCGGATGTGGCGGACATGACGGAGGATGACGTAAT
+GCTGATGACAGTATCAGAAGGGATCGCAGGAGGAGTCCGGTATGGCTGAACCGGTAGGCG
+ATCTGGTCGTTGATTTGAGTCTGGATGCGGCCAGATTTGACGAGCAGATGGCCAGAGTCA
+GGCGTCATTTTTCTGGTACGGAAAGTGATGCGAAAAAAACAGCGGCAGTCGTTGAACAGT
+CGCTGAGCCGACAGGCGCTGGCTGCACAGAAAGCGGGGATTTCCGTCGGGCAGTATAAAG
+CCGCCATGCGTATGCTGCCTGCACAGTTCACCGACGTGGCCACGCAGCTTGCAGGCGGGC
+AAAGTCCGTGGCTGATCCTGCTGCAACAGGGGGGGCAGGTGAAGGACTCCTTCGGCGGGA
+TGATCCCCATGTTCAGGGGGCTTGCCGGTGCGATCACCCTGCCGATGGTGGGGGCCACCT
+CGCTGGCGGTGGCGACCGGTGCGCTGGCGTATGCCTGGTATCAGGGCAACTCAACCCTGT
+CCGATTTCAACAAAACGCTGGTCCTTTCCGGCAATCAGGCGGGACTGACGGCAGATCGTA
+TGCTGGTCCTGTCCAGAGCCGGGCAGGCGGCAGGGCTGACGTTTAACCAGACCAGCGAGT
+CACTCAGCGCACTGGTTAAGGCGGGGGTAAGCGGTGAGGCTCAGATTGCGTCCATCAGCC
+AGAGTGTGGCGCGTTTCTCCTCTGCATCCGGCGTGGAGGTGGACAAGGTCGCTGAAGCCT
+TCGGGAAGCTGACCACAGACCCGACGTCGGGGCTGACGGCGATGGCTCGCCAGTTCCATA
+ACGTGTCGGCGGAGCAGATTGCGTATGTTGCTCAGTTGCAGCGTTCCGGCGATGAAGCCG
+GGGCATTGCAGGCGGCGAACGAGGCCGCAACGAAAGGGTTTGATGACCAGACCCGCCGCC
+TGAAAGAGAACATGGGCACGCTGGAGACCTGGGCAGACAGGACTGCGCGGGCATTCAAAT
+CCATGTGGGATGCGGTGCTGGATATTGGTCGTCCTGATACCGCGCAGGAGATGCTGATTA
+AGGCAGAGGCTGCGTATAAGAAAGCAGACGACATCTGGAATCTGCGCAAGGATGATTATT
+TTGTTAACGATGAAGCGCGGGCGCGTTACTGGGATGATCGTGAAAAGGCCCGTCTTGCGC
+TTGAAGCCGCCCGAAAGAAGGCTGAGCAGCAGACTCAACAGGACAAAAATGCGCAGCAGC
+AGAGCGATACCGAAGCGTCACGGCTGAAATATACCGAAGAGGCGCAGAAGGCTTACGAAC
+GGCTGCAGACGCCGCTGGAGAAATATACCGCCCGTCAGGAAGAACTGAACAAGGCACTGA
+AAGACGGGAAAATCCTGCAGGCGGATTACAACACGCTGATGGCGGCGGCGAAAAAGGATT
+ATGAAGCGACGCTGAAAAAGCCGAAACAGTCCAGCGTGAAGGTGTCTGCGGGCGATCGTC
+AGGAAGACAGTGCTCATGCTGCCCTGCTGACGCTTCAGGCAGAACTCCGGACGCTGGAGA
+AGCATGCCGGAGCAAATGAGAAAATCAGCCAGCAGCGCCGGGATTTGTGGAAGGCGGAGA
+GTCAGTTCGCGGTACTGGAGGAGGCGGCGCAACGTCGCCAGCTGTCTGCACAGGAGAAAT
+CCCTGCTGGCGCATAAAGATGAGACGCTGGAGTACAAACGCCAGCTGGCTGCACTTGGCG
+ACAAGGTTACGTATCAGGAGCGCCTGAACGCGCTGGCGCAGCAGGCGGATAAATTCGCAC
+AGCAGCAACGGGCAAAACGGGCCGCCATTGATGCGAAAAGCCGGGGGCTGACTGACCGGC
+AGGCAGAACGGGAAGCCACGGAACAGCGCCTGAAGGAACAGTATGGCGATAATCCGCTGG
+CGCTGAATAACGTCATGTCAGAGCAGAAAAAGACCTGGGCGGCTGAAGACCAGCTTCGCG
+GGAACTGGATGGCAGGCCTGAAGTCCGGCTGGAGTGAGTGGGAAGAGAGCGCCACGGACA
+GTATGTCGCAGGTAAAAAGTGCAGCCACGCAGACCTTTGATGGTATTGCACAGAATATGG
+CGGCGATGCTGACCGGCAGTGAGCAGAACTGGCGCAGCTTCACCCGTTCCGTGCTGTCCA
+TGATGACAGAAATTCTGCTTAAGCAGGCAATGGTGGGGATTGTCGGGAGTATCGGCAGCG
+CCATTGGCGGGGCTGTTGGTGGCGGCGCATCCGCGTCAGGCGGTACAGCCATTCAGGCCG
+CTGCGGCGAAATTCCATTTTGCAACCGGAGGATTTACGGGAACCGGCGGCAAATATGAGC
+CAGCGGGGATTGTTCACCGTGGTGAGTTTGTCTTCACGAAGGAGGCAACCAGCCGGATTG
+GCGTGGGGAATCTTTACCGGCTGATGCGCGGCTATGCCACCGGCGGTTATGTCGGTACAC
+CGGGCAGCATGGCAGACAGCCGGTCGCAGGCGTCCGGGACGTTTGAGCAGAATAACCATG
+TGGTGATTAACAACGACGGCACGAACGGGCAGATAGGTCCGGCTGCTCTGAAGGCGGTGT
+ATGACATGGCCCGCAAGGGTGCCCGTGATGAAATTCAGACACAGATGCGTGATGGTGGCC
+TGTTCTCCGGAGGTGGACGATGAAGACCTTCCGCTGGAAAGTGAAACCCGGTATGGATGT
+GGCTTCGGTCCCTTCTGTAAGAAAGGTGCGCTTTGGTGATGGCTATTCTCAGCGAGCGCC
+TGCCGGGCTGAATGCCAACCTGAAAACGTACAGCGTGACGCTTTCTGTCCCCCGTGAGGA
+GGCCACGGTACTGGAGTCGTTTCTGGAAGAGCACGGGGGCTGGAAATCCTTTCTGTGGAC
+GCCGCCTTATGAGTGGCGGCAGATAAAGGTGACCTGCGCAAAATGGTCGTCGCGGGTCAG
+TATGCTGCGTGTTGAGTTCAGCGCAGAGTTTGAACAGGTGGTGAACTGATGCAGGATATC
+CGGCAGGAAACACTGAATGAATGCACCCGTGCGGAGCAGTCGGCCAGCGTGGTGCTCTGG
+GAAATCGACCTGACAGAGGTCGGTGGAGAACGTTATTTTTTCTGTAATGAGCAGAACGAA
+AAAGGTGAGCCGGTCACCTGGCAGGGGCGACAGTATCAGCCGTATCCCATTCAGGGGAGC
+GGTTTTGAACTGAATGGCAAAGGCACCAGTACGCGCCCCACGCTGACGGTTTCTAACCTG
+TACGGTATGGTCACCGGGATGGCGGAAGATATGCAGAGTCTGGTCGGCGGAACGGTGGTC
+CGGCGTAAGGTTTACGCCCGTTTTCTGGATGCGGTGAACTTCGTCAACGGAAACAGTTAC
+GCCGATCCGGAGCAGGAGGTGATCAGCCGCTGGCGCATTGAGCAGTGCAGCGAACTGAGC
+GCGGTGAGTGCCTCCTTTGTACTGTCCACGCCGACGGAAACGGATGGCGCTGTTTTTCCG
+GGACGTATCATGCTGGCCAACACCTGCACCTGGACCTATCGCGGTGACGAGTGCGGTTAT
+AGCGGTCCGGCTGTCGCGGATGAATATGACCAGCCAACGTCCGATATCACGAAGGATAAA
+TGCAGCAAATGCCTGAGCGGTTGTAAGTTCCGCAATAACGTCGGCAACTTTGGCGGCTTC
+CTTTCCATTAACAAACTTTCGCAGTAAATCCCATGACACAGACAGAATCAGCGATTCTGG
+CGCACGCCCGGCGATGTGCGCCAGCGGAGTCGTGCGGCTTCGTGGTAAGCACGCCGGAGG
+GGGAAAGATATTTCCCCTGCGTGAATATCTCCGGTGAGCCGGAGGCTATTTCCGTATGTC
+GCCGGAAGACTGGCTGCAGGCAGAAATGCAGGGTGAGATTGTGGCGCTGGTCCACAGCCA
+CCCCGGTGGTCTGCCCTGGCTGAGTGAGGCCGACCGGCGGCTGCAGGTGCAGAGTGATTT
+GCCGTGGTGGCTGGTCTGCCGGGGGACGATTCATAAGTTCCGCTGTGTGCCGCATCTCAC
+CGGGCGGCGCTTTGAGCACGGTGTGACGGACTGTTACACACTGTTCCGGGATGCTTATCA
+TCTGGCGGGGATTGAGATGCCGGACTTTCATCGTGAGGATGACTGGTGGCGTAACGGCCA
+GAATCTCTATCTGGATAATCTGGAGGCGACGGGGCTGTATCAGGTGCCGTTGTCAGCGGC
+ACAGCCGGGCGATGTGCTGCTGTGCTGTTTTGGTTCATCAGTGCCGAATCACGCCGCAAT
+TTACTGCGGCGACGGCGAGCTGCTGCACCATATTCCTGAACAACTGAGCAAACGAGAGAG
+GTACACCGACAAATGGCAGCGACGCACACACTCCCTCTGGCGTCACCGGGCATGGCGCGC
+ATCTGCCTTTACGGGGATTTACAACGATTTGGTCGCCGCATCGACCTTCGTGTGAAAACG
+GGGGCTGAAGCCATCCGGGCACTGGCCACACAGCTCCCGGCGTTTCGTCAGAAACTGAGC
+GACGGCTGGTATCAGGTACGGATTGCCGGGCGGGACGTCAGCACGTCCGGGTTAACGGCG
+CAGTTACATGAGACTCTGCCTGATGGCGCTGTAATTCATATTGTTCCCAGAGTCGCCGGG
+GCCAAGTCAGGTGGCGTATTCCAGATTGTCCTGGGGGCTGCCGCCATTGCCGGATCATTC
+TTTACCGCCGGAGCCACCCTTGCAGCATGGGGGGCAGCCATTGGGGCCGGTGGTATGACC
+GGCATCCTGTTTTCTCTCGGTGCCAGTATGGTGCTCGGTGGTGTGGCGCAGATGCTGGCA
+CCGAAAGCCAGAACTCCCCGTATACAGACAACGGATAACGGTAAGCAGAACACCTATTTC
+TCCTCACTGGATAACATGGTTGCCCAGGGCAATGTTCTGCCTGTTCTGTACGGGGAAATG
+CGCGTGGGGTCACGCGTGGTTTCTCAGGAGATCAGCACGGCAGACGAAGGGGACGGTGGT
+CAGGTTGTGGTGATTGGTCGCTGATGCAAAATGTTTTATGTGAAACCGCCTGCGGGCGGT
+TTTGTCATTTATGGAGCGTGAGGAATGGGTAAAGGAAGCAGTAAGGGGCATACCCCGCGC
+GAAGCGAAGGACAACCTGAAGTCCACGCAGTTGCTGAGTGTGATCGATGCCATCAGCGAA
+GGGCCGATTGAAGGTCCGGTGGATGGCTTAAAAAGCGTGCTGCTGAACAGTACGCCGGTG
+CTGGACACTGAGGGGAATACCAACATATCCGGTGTCACGGTGGTGTTCCGGGCTGGTGAG
+CAGGAGCAGACTCCGCCGGAGGGATTTGAATCCTCCGGCTCCGAGACGGTGCTGGGTACG
+GAAGTGAAATATGACACGCCGATCACCCGCACCATTACGTCTGCAAACATCGACCGTCTG
+CGCTTTACCTTCGGTGTACAGGCACTGGTGGAAACCACCTCAAAGGGTGACAGGAATCCG
+TCGGAAGTCCGCCTGCTGGTTCAGATACAACGTAACGGTGGCTGGGTGACGGAAAAAGAC
+ATCACCATTAAGGGCAAAACCACCTCGCAGTATCTGGCCTCGGTGGTGATGGGTAACCTG
+CCGCCGCGCCCGTTTAATATCCGGATGCGCAGGATGACGCCGGACAGCACCACAGACCAG
+CTGCAGAACAAAACGCTCTGGTCGTCATACACTGAAATCATCGATGTGAAACAGTGCTAC
+CCGAACACGGCACTGGTCGGCGTGCAGGTGGACTCGGAGCAGTTCGGCAGCCAGCAGGTG
+AGCCGTAATTATCATCTGCGCGGGCGTATTCTGCAGGTGCCGTCGAACTATAACCCGCAG
+ACGCGGCAATACAGCGGTATCTGGGACGGAACGTTTAAACCGGCATACAGCAACAACATG
+GCCTGGTGTCTGTGGGATATGCTGACCCATCCGCGCTACGGCATGGGGAAACGTCTTGGT
+GCGGCGGATGTGGATAAATGGGCGCTGTATGTCATCGGCCAGTACTGCGACCAGTCAGTG
+CCGGACGGCTTTGGCGGCACGGAGCCGCGCATCACCTGTAATGCGTACCTGACCACACAG
+CGTAAGGCGTGGGATGTGCTCAGCGATTTCTGCTCGGCGATGCGCTGTATGCCGGTATGG
+AACGGGCAGACGCTGACGTTCGTGCAGGACCGACCGTCGGATAAGACGTGGACCTATAAC
+CGCAGTAATGTGGTGATGCCGGATGATGGCGCGCCGTTCCGCTACAGCTTCAGCGCCCTG
+AAGGACCGCCATAATGCCGTTGAGGTGAACTGGATTGACCCGAACAACGGCTGGGAGACG
+GCGACAGAGCTTGTTGAAGATACGCAGGCCATTGCCCGTTACGGTCGTAATGTTACGAAG
+ATGGATGCCTTTGGCTGTACCAGCCGGGGGCAGGCACACCGCGCCGGGCTGTGGCTGATT
+AAAACAGAACTGCTGGAAACGCAGACCGTGGATTTCAGCGTCGGCGCAGAAGGGCTTCGC
+CATGTACCGGGCGATGTTATTGAAATCTGCGATGATGACTATGCCGGTATCAGCACCGGT
+GGTCGTGTGCTGGCGGTGAACAGCCAGACCCGGACGCTGACGCTCGACCGTGAAATCACG
+CTGCCATCCTCCGGTACCGCGCTGATAAGCCTGGTTGACGGAAGTGGCAATCCGGTCAGC
+GTGGAGGTTCAGTCCGTCACCGACGGCGTGAAGGTAAAAGTGAGCCGTGTTCCTGACGGT
+GTTGCTGAATACAGCGTATGGGAGCTGAAGCTGCCGACGCTGCGCCAGCGACTGTTCCGC
+TGCGTGAGTATCCGTGAGAACGACGACGGCACGTATGCCATCACCGCCGTGCAGCATGTG
+CCGGAAAAAGAGGCCATCGTGGATAACGGGGCGCACTTTGACGGCGAACAGAGTGGCACG
+GTGAATGGTGTCACGCCGCCAGCGGTGCAGCACCTGACCGCAGAAGTCACTGCAGACAGC
+GGGGAATATCAGGTGCTGGCGCGATGGGACACACCGAAGGTGGTGAAGGGCGTGAGTTTC
+CTGCTCCGTCTGACCGTAACAGCGGACGACGGCAGTGAGCGGCTGGTCAGCACGGCCCGG
+ACGACGGAAACCACATACCGCTTCACGCAACTGGCGCTGGGGAACTACAGGCTGACAGTC
+CGGGCGGTAAATGCGTGGGGGCAGCAGGGCGATCCGGCGTCGGTATCGTTCCGGATTGCC
+GCACCGGCAGCACCGTCGAGGATTGAGCTGACGCCGGGCTATTTTCAGATAACCGCCACG
+CCGCATCTTGCCGTTTATGACCCGACGGTACAGTTTGAGTTCTGGTTCTCGGAAAAGCAG
+ATTGCGGATATCAGACAGGTTGAAACCAGCACGCGTTATCTTGGTACGGCGCTGTACTGG
+ATAGCCGCCAGTATCAATATCAAACCGGGCCATGATTATTACTTTTATATCCGCAGTGTG
+AACACCGTTGGCAAATCGGCATTCGTGGAGGCCGTCGGTCGGGCGAGCGATGATGCGGAA
+GGTTACCTGGATTTTTTCAAAGGCAAGATAACCGAATCCCATCTCGGCAAGGAGCTGCTG
+GAAAAAGTCGAGCTGACGGAGGATAACGCCAGCAGACTGGAGGAGTTTTCGAAAGAGTGG
+AAGGATGCCAGTGATAAGTGGAATGCCATGTGGGCTGTCAAAATTGAGCAGACCAAAGAC
+GGCAAACATTATGTCGCGGGTATTGGCCTCAGCATGGAGGACACGGAGGAAGGCAAACTG
+AGCCAGTTTCTGGTTGCCGCCAATCGTATCGCATTTATTGACCCGGCAAACGGGAATGAA
+ACGCCGATGTTTGTGGCGCAGGGCAACCAGATATTCATGAACGACGTGTTCCTGAAGCGC
+CTGACGGCCCCCACCATTACCAGCGGCGGCAATCCTCCGGCCTTTTCCCTGACACCGGAC
+GGAAAGCTGACCGCTAAAAATGCGGATATCAGTGGCAGTGTGAATGCGAACTCCGGGACG
+CTCAGTAATGTGACGATAGCTGAAAACTGTACGATAAACGGTACGCTGAGGGCGGAAAAA
+ATCGTCGGGGACATTGTAAAGGCGGCGAGCGCGGCTTTTCCGCGCCAGCGTGAAAGCAGT
+GTGGACTGGCCGTCAGGTACCCGTACTGTCACCGTGACCGATGACCATCCTTTTGATCGC
+CAGATAGTGGTGCTTCCGCTGACGTTTCGCGGAAGTAAGCGTACTGTCAGCGGCAGGACA
+ACGTATTCGATGTGTTATCTGAAAGTACTGATGAACGGTGCGGTGATTTATGATGGCGCG
+GCGAACGAGGCGGTACAGGTGTTCTCCCGTATTGTTGACATGCCAGCGGGTCGGGGAAAC
+GTGATCCTGACGTTCACGCTTACGTCCACACGGCATTCGGCAGATATTCCGCCGTATACG
+TTTGCCAGCGATGTGCAGGTTATGGTGATTAAGAAACAGGCGCTGGGCATCAGCGTGGTC
+TGAGTGTGTTACAGAGGTTCGTCCGGGAACGGGCGTTTTATTATAAAACAGTGAGAGGTG
+AACGATGCGTAATGTGTGTATTGCCGTTGCTGTCTTTGCCGCACTTGCGGTGACAGTCAC
+TCCGGCCCGTGCGGAAGGTGGACATGGTACGTTTACGGTGGGCTATTTTCAAGTGAAACC
+GGGTACATTGCCGTCGTTGTCGGGCGGGGATACCGGTGTGAGTCATCTGAAAGGGATTAA
+CGTGAAGTACCGTTATGAGCTGACGGACAGTGTGGGGGTGATGGCTTCCCTGGGGTTCGC
+CGCGTCGAAAAAGAGCAGCACAGTGATGACCGGGGAGGATACGTTTCACTATGAGAGCCT
+GCGTGGACGTTATGTGAGCGTGATGGCCGGACCGGTTTTACAAATCAGTAAGCAGGTCAG
+TGCGTACGCCATGGCCGGAGTGGCTCACAGTCGGTGGTCCGGCAGTACAATGGATTACCG
+TAAGACGGAAATCACTCCCGGGTATATGAAAGAGACGACCACTGCCAGGGACGAAAGTGC
+AATGCGGCATACCTCAGTGGCGTGGAGTGCAGGTATACAGATTAATCCGGCAGCGTCCGT
+CGTTGTTGATATTGCTTATGAAGGCTCCGGCAGTGGCGACTGGCGTACTGACGGATTCAT
+CGTTGGGGTCGGTTATAAATTCTGATTAGCCAGGTAACACAGTGTTATGACAGCCCGCCG
+GAACCGGTGGGCTTTTTTGTGGGGTGAATATGGCAGTAAAGATTTCAGGAGTCCTGAAAG
+ACGGCACAGGAAAACCGGTACAGAACTGCACCATTCAGCTGAAAGCCAGACGTAACAGCA
+CCACGGTGGTGGTGAACACGGTGGGCTCAGAGAATCCGGATGAAGCCGGGCGTTACAGCA
+TGGATGTGGAGTACGGTCAGTACAGTGTCATCCTGCAGGTTGACGGTTTTCCACCATCGC
+ACGCCGGGACCATCACCGTGTATGAAGATTCACAACCGGGGACGCTGAATGATTTTCTCT
+GTGCCATGACGGAGGATGATGCCCGGCCGGAGGTGCTGCGTCGTCTTGAACTGATGGTGG
+AAGAGGTGGCGCGTAACGCGTCCGTGGTGGCACAGAGTACGGCAGACGCGAAGAAATCAG
+CCGGCGATGCCAGTGCATCAGCTGCTCAGGTCGCGGCCCTTGTGACTGATGCAACTGACT
+CAGCACGCGCCGCCAGCACGTCCGCCGGACAGGCTGCATCGTCAGCTCAGGAAGCGTCCT
+CCGGCGCAGAAGCGGCATCAGCAAAGGCCACTGAAGCGGAAAAAAGTGCCGCAGCCGCAG
+AGTCCTCAAAAAACGCGGCGGCCACCAGTGCCGGTGCGGCGAAAACGTCAGAAACGAATG
+CTGCAGCGTCACAACAATCAGCCGCCACGTCTGCCTCCACCGCGGCCACGAAAGCGTCAG
+AGGCCGCCACTTCAGCACGAGATGCGGTGGCCTCAAAAGAGGCAGCAAAATCATCAGAAA
+CGAACGCATCATCAAGTGCCGGTCGTGCAGCTTCCTCGGCAACGGCGGCAGAAAATTCTG
+CCAGGGCGGCAAAAACGTCCGAGACGAATGCCAGGTCATCTGAAACAGCAGCGGAACGGA
+GCGCCTCTGCCGCGGCAGACGCAAAAACAGCGGCGGCGGGGAGTGCGTCAACGGCATCCA
+CGAAGGCGACAGAGGCTGCGGGAAGTGCGGTATCAGCATCGCAGAGCAAAAGTGCGGCAG
+AAGCGGCGGCAATACGTGCAAAAAATTCGGCAAAACGTGCAGAAGATATAGCTTCAGCTG
+TCGCGCTTGAGGATGCGGACACAACGAGAAAGGGGATAGTGCAGCTCAGCAGTGCAACCA
+ACAGCACGTCTGAAACGCTTGCTGCAACGCCAAAGGCGGTTAAGGTGGTAATGGATGAAA
+CGAACAGAAAAGCCCACTGGACAGTCCGGCACTGACCGGAACGCCAACAGCACCAACCGC
+GCTCAGGGGAACAAACAATACCCAGATTGCGAACACCGCTTTTGTACTGGCCGCGATTGC
+AGATGTTATCGACGCGTCACCTGACGCACTGAATACGCTGAATGAACTGGCCGCAGCGCT
+CGGGAATGATCCAGATTTTGCTACCACCATGACTAACGCGCTTGCGGGTAAACAACCGAA
+GAATGCGACACTGACGGCGCTGGCAGGGCTTTCCACGGCGAAAAATAAATTACCGTATTT
+TGCGGAAAATGATGCCGCCAGCCTGACTGAACTGACTCAGGTTGGCAGGGATATTCTGGC
+AAAAAATTCCGTTGCAGATGTTCTTGAATACCTTGGGGCCGGTGAGAATTCGGCCTTTCC
+GGCAGGTGCGCCGATCCCGTGGCCATCAGATATCGTTCCGTCTGGCTACGTCCTGATGCA
+GGGGCAGGCGTTTGACAAATCAGCCTACCCAAAACTTGCTGTCGCGTATCCATCGGGTGT
+GCTTCCTGATATGCGAGGCTGGACAATCAAGGGGAAACCCGCCAGCGGTCGTGCTGTATT
+GTCTCAGGAACAGGATGGAATTAAGTCGCACACCCACAGTGCCAGTGCATCCGGTACGGA
+TTTGGGGACGAAAACCACATCGTCGTTTGATTACGGGACGAAAACAACAGGCAGTTTCGA
+TTACGGCACCAAATCGACGAATAACACGGGGGCTCATGCTCACAGTCTGAGCGGTTCAAC
+AGGGGCCGCGGGTGCTCATGCCCACACAAGTGGTTTAAGGATGAACAGTTCTGGCTGGAG
+TCAGTATGGAACAGCAACCATTACAGGAAGTTTATCCACAGTTAAAGGAACCAGCACACA
+GGGTATTGCTTATTTATCGAAAACGGACAGTCAGGGCAGCCACAGTCACTCATTGTCCGG
+TACAGCCGTGAGTGCCGGTGCACATGCGCATACAGTTGGTATTGGTGCGCACCAGCATCC
+GGTTGTTATCGGTGCTCATGCCCATTCTTTCAGTATTGGTTCACACGGACACACCATCAC
+CGTTAACGCTGCGGGTAACGCGGAAAACACCGTCAAAAACATTGCATTTAACTATATTGT
+GAGGCTTGCATAATGGCATTCAGAATGAGTGAACAACCACGGACCATAAAAATTTATAAT
+CTGCTGGCCGGAACTAATGAATTTATTGGTGAAGGTGACGCATATATTCCGCCTCATACC
+GGTCTGCCTGCAAACAGTACCGATATTGCACCGCCAGATATTCCGGCTGGCTTTGTGGCT
+GTTTTCAACAGTGATGAGGCATCGTGGCATCTCGTTGAAGACCATCGGGGTAAAACCGTC
+TATGACGTGGCTTCCGGCGACGCGTTATTTATTTCTGAACTCGGTCCGTTACCGGAAAAT
+TTTACCTGGTTATCGCCGGGAGGGGAATATCAGAAGTGGAACGGCACAGCCTGGGTGAAG
+GATACGGAAGCAGAAAAACTGTTCCGGATCCGGGAGGCGGAAGAAACAAAAAAAAGCCTG
+ATGCAGGTAGCCAGTGAGCATATTGCGCCGCTTCAGGATGCTGCAGATCTGGAAATTGCA
+ACGAAGGAAGAAACCTCGTTGCTGGAAGCCTGGAAGAAGTATCGGGTGTTGCTGAACCGT
+GTTGATACATCAACTGCACCTGATATTGAGTGGCCTGCTGTCCCTGTTATGGAGTAATCG
+TTTTGTGATATGCCGCAGAAACGTTGTATGAAATAACGTTCTGCGGTTAGTTAGTATATT
+GTAAAGCTGAGTATTGGTTTATTTGGCGATTATTATCTTCAGGAGAATAATGGAAGTTCT
+ATGACTCAATTGTTCATAGTGTTTACATCACCGCCAATTGCTTTTAAGACTGAACGCATG
+AAATATGGTTTTTCGTCATGTTTTGAGTCTGCTGTTGATATTTCTAAAGTCGGTTTTTTT
+TCTTCGTTTTCTCTAACTATTTTCCATGAAATACATTTTTGATTATTATTTGAATCAATT
+CCAATTACCTGAAGTCTTTCATCTATAATTGGCATTGTATGTATTGGTTTATTGGAGTAG
+ATGCTTGCTTTTCTGAGCCATAGCTCTGATATCCAAATGAAGCCATAGGCATTTGTTATT
+TTGGCTCTGTCAGCTGCATAACGCCAAAAAATATATTTATCTGCTTGATCTTCAAATGTT
+GTATTGATTAAATCAATTGGATGGAATTGTTTATCATAAAAAATTAATGTTTGAATGTGA
+TAACCGTCCTTTAAAAAAGTCGTTTCTGCAAGCTTGGCTGTATAGTCAACTAACTCTTCT
+GTCGAAGTGATATTTTTAGGCTTATCTACCAGTTTTAGACGCTCTTTAATATCTTCAGGA
+ATTATTTTATTGTCATATTGTATCATGCTAAATGACAATTTGCTTATGGAGTAATCTTTT
+AATTTTAAATAAGTTATTCTCCTGGCTTCATCAAATAAAGAGTCGAATGATGTTGGCGAA
+ATCACATCGTCACCCATTGGATTGTTTATTTGTATGCCAAGAGAGTTACAGCAGTTATAC
+ATTCTGCCATAGATTATAGCTAAGGCATGTAATAATTCGTAATCTTTTAGCGTATTAGCG
+ACCCATCGTCTTTCTGATTTAATAATAGATGATTCAGTTAAATATGAAGGTAATTTCTTT
+TGTGCAAGTCTGACTAACTTTTTTATACCAATGTTTAACATACTTTCATTTGTAATAAAC
+TCAATGTCATTTTCTTCAATGTAAGATGAAATAAGAGTAGCCTTTGCCTCGCTATACATT
+TCTAAATCGCCTTGTTTTTCTATCGTATTGCGAGAATTTTTAGCCCAAGCCATTAATGGA
+TCATTTTTCCATTTTTCAATAACATTATTGTTATACCAAATGTCATATCCTATAATCTGG
+TTTTTGTTTTTTTGAATAATAAATGTTACTGTTCTTGCGGTTTGGAGGAATTGATTCAAA
+TTCAAGCGAAATAATTCAGGGTCAAAATATGTATCAATGCAGCATTTGAGCAAGTGCGAT
+AAATCTTTAAGTCTTCTTTCCCATGGTTTTTTAGTCATAAAACTCTCCATTTTGATAGGT
+TGCATGCTAGATGCTGATATATTTTAGAGGTGATAAAATTAACTGCTTAACTGTCAATGT
+AATACAAGTTGTTTGATCTTTGCAATGATTCTTATCAGAAACCATATAGTAAATTAGTTA
+CACAGGAAATTTTTAATATTATTATTATCATTCATTATGTATTAAAATTAGAGTTGTGGC
+TTGGCTCTGCTAACACGTTGCTCATAGGAGATATGGTAGAGCCGCAGACACGTCGTATGC
+AGGAACGTGCTGCGGCTGGCTGGTGAACTTCCGATAGTGCGGGTGTTGAATGATTTCCAG
+TTGCTACCGATTTTACATATTTTTTGCATGAGAGAATTTGTACCACCTCCCACCGACCAT
+CTATGACTGTACGCCACTGTCCCTAGGACTGCTATGTGCCGGAGCGGACATTACAAACGT
+CCTTCTCGGTGCATGCCACTGTTGCCAATGACCTGCCTAGGAATTGGTTAGCAAGTTACT
+ACCGGATTTTGTAAAAACAGCCCTCCTCATATAAAAAGTATTCGTTCACTTCCGATAAGC
+GTCGTAATTTTCTATCTTTCATCATATTCTAGATCCCTCTGAAAAAATCTTCCGAGTTTG
+CTAGGCACTGATACATAACTCTTTTCCAATAATTGGGGAAGTCATTCAAATCTATAATAG
+GTTTCAGATTTGCTTCAATAAATTCTGACTGTAGCTGCTGAAACGTTGCGGTTGAACTAT
+ATTTCCTTATAACTTTTACGAAAGAGTTTCTTTGAGTAATCACTTCACTCAAGTGCTTCC
+CTGCCTCCAAACGATACCTGTTAGCAATATTTAATAGCTTGAAATGATGAAGAGCTCTGT
+GTTTGTCTTCCTGCCTCCAGTTCGCCGGGCATTCAACATAAAAACTGATAGCACCCGGAG
+TTCCGGAAACGAAATTTGCATATACCCATTGCTCACGAAAAAAAATGTCCTTGTCGATAT
+AGGGATGAATCGCTTGGTGTACCTCATCTACTGCGAAAACTTGACCTTTCTCTCCCATAT
+TGCAGTCGCGGCACGATGGAACTAAATTAATAGGCATCACCGAAAATTCAGGATAATGTG
+CAATAGGAAGAAAATGATCTATATTTTTTGTCTGTCCTATATCACCACAAAATGGACATT
+TTTCACCTGATGAAACAAGCATGTCATCGTAATATGTTCTAGCGGGTTTGTTTTTATCTC
+GGAGATTATTTTCATAAAGCTTTTCTAATTTAACCTTTGTCAGGTTACCAACTACTAAGG
+TTGTAGGCTCAAGAGGGTGTGTCCTGTCGTAGGTAAATAACTGACCTGTCGAGCTTAATA
+TTCTATATTGTTGTTCTTTCTGCAAAAAAGTGGGGAAGTGAGTAATGAAATTATTTCTAA
+CATTTATCTGCATCATACCTTCCGAGCATTTATTAAGCATTTCGCTATAAGTTCTCGCTG
+GAAGAGGTAGTTTTTTCATTGTACTTTACCTTCATCTCTGTTCATTATCATCGCTTTTAA
+AACGGTTCGACCTTCTAATCCTATCTGACCATTATAATTTTTTAGAATGGTTTCATAAGA
+AAGCTCTGAATCAACGGACTGCGATAATAAGTGGTGGTATCCAGAATTTGTCACTTCAAG
+TAAAAACACCTCACGAGTTAAAACACCTAAGTTCTCACCGAATGTCTCAATATCCGGACG
+GATAATATTTATTGCTTCTCTTGACCGTAGGACTTTCCACATGCAGGATTTTGGAACCTC
+TTGCAGTACTACTGGGGAATGAGTTGCAATTATTGCTACACCATTGCGTGCATCGAGTAA
+GTCGCTTAATGTTCGTAAAAAAGCAGAGAGCAAAGGTGGATGCAGATGAACCTCTGGTTC
+ATCGAATAAAACTAATGACTTTTCGCCAACGACATCTACTAATCTTGTGATAGTAAATAA
+AACAATTGCATGTCCAGAGCTCATTCGAAGCAGATATTTCTGGATATTGTCATAAAACAA
+TTTAGTGAATTTATCATCGTCCACTTGAATCTGTGGTTCATTACGTCTTAACTCTTCATA
+TTTAGAAATGAGGCTGATGAGTTCCATATTTGAAAAGTTTTCATCACTACTTAGTTTTTT
+GATAGCTTCAAGCCAGAGTTGTCTTTTTCTATCTACTCTCATACAACCAATAAATGCTGA
+AATGAATTCTAAGCGGAGATCGCCTAGTGATTTTAAACTATTGCTGGCAGCATTCTTGAG
+TCCAATATAAAAGTATTGTGTACCTTTTGCTGGGTCAGGTTGTTCTTTAGGAGGAGTAAA
+AGGATCAAATGCACTAAACGAAACTGAAACAAGCGATCGAAAATATCCCTTTGGGATTCT
+TGACTCGATAAGTCTATTATTTTCAGAGAAAAAATATTCATTGTTTTCTGGGTTGGTGAT
+TGCACCAATCATTCCATTCAAAATTGTTGTTTTACCACACCCATTCCGCCCGATAAAAGC
+ATGAATGTTCGTGCTGGGCATAGAATTAACCGTCACCTCAAAAGGTATAGTTAAATCACT
+GAATCCGGGAGCACTTTTTCTATTAAATGAAAAGTGGAAATCTGACAATTCTGGCAAACC
+ATTTAACACACGTGCGAACTGTCCATGAATTTCTGAAAGAGTTACCCCTCTAAGTAATGA
+GGTGTTAAGGACGCTTTCATTTTCAATGTCGGCTAATCGATTTGGCCATACTACTAAATC
+CTGAATAGCTTTAAGAAGGTTATGTTTAAAACCATCGCTTAATTTGCTGAGATTAACATA
+GTAGTCAATGCTTTCACCTAAGGAAAAAAACATTTCAGGGAGTTGACTGAATTTTTTATC
+TATTAATGAATAAGTGCTTACTTCTTCTTTTTGACCTACAAAACCAATTTTAACATTTCC
+GATATCGCATTTTTCACCATGCTCATCAAAGACAGTAAGATAAAACATTGTAACAAAGGA
+ATAGTCATTCCAACCATCTGCTCGTAGGAATGCCTTATTTTTTTCTACTGCAGGAATATA
+CCCGCCTCTTTCAATAACACTAAACTCCAACATATAGTAACCCTTAATTTTATTAAAATA
+ACCGCAATTTATTTGGCGGCAACACAGGATCTCTCTTTTAAGTTACTCTCTATTACATAC
+GTTTTCCATCTAAAAATTAGTAGTATTGAACTTAACGGGGCATCGTATTGTAGTTTTCCA
+TATTTAGCTTTCTGCTTCCTTTTGGATAACCCACTGTTATTCATGTTGCATGGTGCACTG
+TTTATACCAACGATATAGTCTATTAATGCATATATAGTATCGCCGAACGATTAGCTCTTC
+AGGCTTCTGAAGAAGCGTTTCAAGTACTAATAAGCCGATAGATAGCCACGGACTTCGTAG
+CCATTTTTCATAAGTGTTAACTTCCGCTCCTCGCTCATAACAGACATTCACTACAGTTAT
+GGCGGAAAGGTATGCATGCTGGGTGTGGGGAAGTCGTGAAAGAAAAGAAGTCAGCTGCGT
+CGTTTGACATCACTGCTATCTTCTTACTGGTTATGCAGGTCGTAGTGGGTGGCACACAAA
+GCTTTGCACTGGATTGCGAGGCTTTGTGCTTCTCTGGAGTGCGACAGGTTTGATGACAAA
+AAATTAGCGCAAGAAGACAAAAATCACCTTGCGCTAATGCTCTGTTACAGGTCACTAATA
+CCATCTAAGTAGTTGATTCATAGTGACTGCATATGTTGTGTTTTACAGTATTATGTAGTC
+TGTTTTTTATGCAAAATCTAATTTAATATATTGATATTTATATCATTTTACGTTTCTCGT
+TCAGCTTTTTTATACTAAGTTGGCATTATAAAAAAGCATTGCTTATCAATTTGTTGCAAC
+GAACAGGTCACTATCAGTCAAAATAAAATCATTATTTGATTTCAATTTTGTCCCACTCCC
+TGCCTCTGTCATCACGATACTGTGATGCCATGGTGTCCGACTTATGCCCGAGAAGATGTT
+GAGCAAACTTATCGCTTATCTGCTTCTCATAGAGTCTTGCAGACAAACTGCGCAACTCGT
+GAAAGGTAGGCGGATCCCCTTCGAAGGAAAGACCTGATGCTTTTCGTGCGCGCATAAAAT
+ACCTTGATACTGTGCCGGATGAAAGCGGTTCGCGACGAGTAGATGCAATTATGGTTTCTC
+CGCCAAGAATCTCTTTGCATTTATCAAGTGTTTCCTTCATTGATATTCCGAGAGCATCAA
+TATGCAATGCTGTTGGGATGGCAATTTTTACGCCTGTTTTGCTTTGCTCGACATAAAGAT
+ATCCATCTACGATATCAGACCACTTCATTTCGCATAAATCACCAACTCGTTGCCCGGTAA
+CAACAGCCAGTTCCATTGCAAGTCTGAGCCAACATGGTGATGATTCTGCTGCTTGATAAA
+TTTTCAGGTATTCGTCAGCCGTAAGTCTTGATCTCCTTACCTCTGATTTTGCTGCGCGAG
+TGGCAGCGACATGGTTTGTTGTTATATGGCCTTCAGCTATTGCCTCTCGGAATGCATCGC
+TCAGTGTTGATCTGATTAACTTGGCTGACGCCGCCTTGCCCTCGTCTATGTATCCATTGA
+GCATTGCCGCAATTTCTTTTGTGGTGATGTCTTCAAGTGGAGCATCAGGCAGACCCCTCC
+TTATTGCTTTAATTTTGCTCATGTAATTTATGAGTGTCTTCTGCTTGATTCCTCTGCTGG
+CCAGGATTTTTTCGTAGCGATCAAGCCATGAATGTAACGTAACGGAATTATCACTGTTGA
+TTCTCGCTGTCAGAGGCTTGTGTTTGTGTCCTGAAAATAACTCAATGTTGGCCTGTATAG
+CTTCAGTGATTGCGATTCGCCTGTCTCTGCCTAATCCAAACTCTTTACCCGTCCTTGGGT
+CCCTGTAGCAGTAATATCCATTGTTTCTTATATAAAGGTTAGGGGGTAAATCCCGGCGCT
+CATGACTTCGCCTTCTTCCCATTTCTGATCCTCTTCAAAAGGCCACCTGTTACTGGTCGA
+TTTAAGTCAACCTTTACCGCTGATTCGTGGAACAGATACTCTCTTCCATCCTTAACCGGA
+GGTGGGAATATCCTGCATTCCCGAACCCATCGACGAACTGTTTCAAGGCTTCTTGGACGT
+CGCTGGCGTGCGTTCCACTCCTGAAGTGTCAAGTACATCGCAAAGTCTCCGCAATTACAC
+GCAAGAAAAAACCGCCATCAGGCGGCTTGGTGTTCTTTCAGTTCTTCAATTCGAATATTG
+GTTACGTCTGCATGTGCTATCTGCGCCCATATCATCCAGTGGTCGTAGCAGTCGTTGATG
+TTCTCCGCTTCGATAACTCTGTTGAATGGCTCTCCATTCCATTCTCCTGTGACTCGGAAG
+TGCATTTATCATCTCCATAAAACAAAACCCGCCGTAGCGAGTTCAGATAAAATAAATCCC
+CGCGAGTGCGAGGATTGTTATGTAATATTGGGTTTAATCATCTATATGTTTTGTACAGAG
+AGGGCAAGTATCGTTTCCACCGTACTCGTGATAATAATTTTGCACGGTATCAGTCATTTC
+TCGCACATTGCAGAATGGGGATTTGTCTTCATTAGACTTATAAACCTTCATGGAATATTT
+GTATGCCGACTCTATATCTATACCTTCATCTACATAAACACCTTCGTGATGTCTGCATGG
+AGACAAGACACCGGATCTGCACAACATTGATAACGCCCAATCTTTTTGCTCAGACTCTAA
+CTCATTGATACTCATTTATAAACTCCTTGCAATGTATGTCGTTTCAGCTAAACGGTATCA
+GCAATGTTTATGTAAAGAAACAGTAAGATAATACTCAACCCGATGTTTGAGTACGGTCAT
+CATCTGACACTACAGACTCTGGCATCGCTGTGAAGACGACGCGAAATTCAGCATTTTCAC
+AAGCGTTATCTTTTACAAAACCGATCTCACTCTCCTTTGATGCGAATGCCAGCGTCAGAC
+ATCATATGCAGATACTCACCTGCATCCTGAACCCATTGACCTCCAACCCCGTAATAGCGA
+TGCGTAATGATGTCGATAGTTACTAACGGGTCTTGTTCGATTAACTGCCGCAGAAACTCT
+TCCAGGTCACCAGTGCAGTGCTTGATAACAGGAGTCTTCCCAGGATGGCGAACAACAAGA
+AACTGGTTTCCGTCTTCACGGACTTCGTTGCTTTCCAGTTTAGCAATACGCTTACTCCCA
+TCCGAGATAACACCTTCGTAATACTCACGCTGCTCGTTGAGTTTTGATTTTGCTGTTTCA
+AGCTCAACACGCAGTTTCCCTACTGTTAGCGCAATATCCTCGTTCTCCTGGTCGCGGCGT
+TTGATGTATTGCTGGTTTCTTTCCCGTTCATCCAGCAGTTCCAGCACAATCGATGGTGTT
+ACCAATTCATGGAAAAGGTCTGCGTCAAATCCCCAGTCGTCATGCATTGCCTGCTCTGCC
+GCTTCACGCAGTGCCTGAGAGTTAATTTCGCTCACTTCGAACCTCTCTGTTTACTGATAA
+GTTCCAGATCCTCCTGGCAACTTGCACAAGTCCGACAACCCTGAACGACCAGGCGTCTTC
+GTTCATCTATCGGATCGCCACACTCACAACAATGAGTGGCAGATATAGCCTGGTGGTTCA
+GGCGGCGCATTTTTATTGCTGTGTTGCGCTGTAATTCTTCTATTTCTGATGCTGAATCAA
+TGATGTCTGCCATCTTTCATTAATCCCTGAACTGTTGGTTAATACGCTTGAGGGTGAATG
+CGAATAATAAAAAAGGAGCCTGTAGCTCCCTGATGATTTTGCTTTTCATGTTCATCGTTC
+CTTAAAGACGCCGTTTAACATGCCGATTGCCAGGCTTAAATGAGTCGGTGTGAATCCCAT
+CAGCGTTACCGTTTCGCGGTGCTTCTTCAGTACGCTACGGCAAATGTCATCGACGTTTTT
+ATCCGGAAACTGCTGTCTGGCTTTTTTTGATTTCAGAATTAGCCTGACGGGCAATGCTGC
+GAAGGGCGTTTTCCTGCTGAGGTGTCATTGAACAAGTCCCATGTCGGCAAGCATAAGCAC
+ACAGAATATGAAGCCCGCTGCCAGAAAAATGCATTCCGTGGTTGTCATACCTGGTTTCTC
+TCATCTGCTTCTGCTTTCGCCACCATCATTTCCAGCTTTTGTGAAAGGGATGCGGCTAAC
+GTATGAAATTCTTCGTCTGTTTCTACTGGTATTGGCACAAACCTGATTCCAATTTGAGCA
+AGGCTATGTGCCATCTCGATACTCGTTCTTAACTCAACAGAAGATGCTTTGTGCATACAG
+CCCCTCGTTTATTATTTATCTCCTCAGCCAGCCGCTGTGCTTTCAGTGGATTTCGGATAA
+CAGAAAGGCCGGGAAATACCCAGCCTCGCTTTGTAACGGAGTAGACGAAAGTGATTGCGC
+CTACCCGGATATTATCGTGAGGATGCGTCATCGCCATTGCTCCCCAAATACAAAACCAAT
+TTCAGCCAGTGCCTCGTCCATTTTTTCGATGAACTCCGGCACGATCTCGTCAAAACTCGC
+CATGTACTTTTCATCCCGCTCAATCACGACATAATGCAGGCCTTCACGCTTCATACGCGG
+GTCATAGTTGGCAAAGTACCAGGCATTTTTTCGCGTCACCCACATGCTGTACTGCACCTG
+GGCCATGTAAGCTGACTTTATGGCCTCGAAACCACCGAGCCGGAACTTCATGAAATCCCG
+GGAGGTAAACGGGCATTTCAGTTCAAGGCCGTTGCCGTCACTGCATAAACCATCGGGAGA
+GCAGGCGGTACGCATACTTTCGTCGCGATAGATGATCGGGGATTCAGTAACATTCACGCC
+GGAAGTGAATTCAAACAGGGTTCTGGCGTCGTTCTCGTACTGTTTTCCCCAGGCCAGTGC
+TTTAGCGTTAACTTCCGGAGCCACACCGGTGCAAACCTCAGCAAGCAGGGTGTGGAAGTA
+GGACATTTTCATGTCAGGCCACTTCTTTCCGGAGCGGGGTTTTGCTATCACGTTGTGAAC
+TTCTGAAGCGGTGATGACGCCGAGCCGTAATTTGTGCCACGCATCATCCCCCTGTTCGAC
+AGCTCTCACATCGATCCCGGTACGCTGCAGGATAATGTCCGGTGTCATGCTGCCACCTTC
+TGCTCTGCGGCTTTCTGTTTCAGGAATCCAAGAGCTTTTACTGCTTCGGCCTGTGTCAGT
+TCTGACGATGCACGAATGTCGCGGCGAAATATCTGGGAACAGAGCGGCAATAAGTCGTCA
+TCCCATGTTTTATCCAGGGCGATCAGCAGAGTGTTAATCTCCTGCATGGTTTCATCGTTA
+ACCGGAGTGATGTCGCGTTCCGGCTGACGTTCTGCAGTGTATGCAGTATTTTCGACAATG
+CGCTCGGCTTCATCCTTGTCATAGATACCAGCAAATCCGAAGGCCAGACGGGCACACTGA
+ATCATGGCTTTATGACGTAACATCCGTTTGGGATGCGACTGCCACGGCCCCGTGATTTCT
+CTGCCTTCGCGAGTTTTGAATGGTTCGCGGCGGCATTCATCCATCCATTCGGTAACGCAG
+ATCGGATGATTACGGTCCTTGCGGTAAATCCGGCATGTACAGGATTCATTGTCCTGCTCA
+AAGTCCATGCCATCAAACTGCTGGTTTTCATTGATGATGCGGGACCAGCCATCAACGCCC
+ACCACCGGAACGATGCCATTCTGCTTATCAGGAAAGGCGTAAATTTCTTTCGTCCACGGA
+TTAAGGCCGTACTGGTTGGCAACGATCAGTAATGCGATGAACTGCGCATCGCTGGCATCA
+CCTTTAAATGCCGTCTGGCGAAGAGTGGTGATCAGTTCCTGTGGGTCGACAGAATCCATG
+CCGACACGTTCAGCCAGCTTCCCAGCCAGCGTTGCGAGTGCAGTACTCATTCGTTTTATA
+CCTCTGAATCAATATCAACCTGGTGGTGAGCAATGGTTTCAACCATGTACCGGATGTGTT
+CTGCCATGCGCTCCTGAAACTCAACATCGTCATCAAACGCACGGGTAATGGATTTTTTGC
+TGGCCCCGTGGCGTTGCAAATGATCGATGCATAGCGATTCAAACAGGTGCTGGGGCAGGC
+CTTTTTCCATGTCGTCTGCCAGTTCTGCCTCTTTCTCTTCACGGGCGAGCTGCTGGTAGT
+GACGCGCCCAGCTCTGAGCCTCAAGACGATCCTGAATGTAATAAGCGTTCATGGCTGAAC
+TCCTGAAATAGCTGTGAAAATATCGCCCGCGAAATGCCGGGCTGATTAGGAAAACAGGAA
+AGGGGGTTAGTGAATGCTTTTGCTTGATCTCAGTTTCAGTATTAATATCCATTTTTTATA
+AGCGTCGACGGCTTCACGAAACATCTTTTCATCGCCAATAAAAGTGGCGATAGTGAATTT
+AGTCTGGATAGCCATAAGTGTTTGATCCATTCTTTGGGACTCCTGGCTGATTAAGTATGT
+CGATAAGGCGTTTCCATCCGTCACGTAATTTACGGGTGATTCGTTCAAGTAAAGATTCGG
+AAGGGCAGCCAGCAACAGGCCACCCTGCAATGGCATATTGCATGGTGTGCTCCTTATTTA
+TACATAACGAAAAACGCCTCGAGTGAAGCGTTATTGGTATGCGGTAAAACCGCACTCAGG
+CGGCCTTGATAGTCATATCATCTGAATCAAATATTCCTGATGTATCGATATCGGTAATTC
+TTATTCCTTCGCTACCATCCATTGGAGGCCATCCTTCCTGACCATTTCCATCATTCCAGT
+CGAACTCACACACAACACCATATGCATTTAAGTCGCTTGAAATTGCTATAAGCAGAGCAT
+GTTGCGCCAGCATGATTAATACAGCATTTAATACAGAGCCGTGTTTATTGAGTCGGTATT
+CAGAGTCTGACCAGAAATTATTAATCTGGTGAAGTTTTTCCTCTGTCATTACGTCATGGT
+CGATTTCAATTTCTATTGATGCTTTCCAGTCGTAATCAATGATGTATTTTTTGATGTTTG
+ACATCTGTTCATATCCTCACAGATAAAAAATCGCCCTCACACTGGAGGGCAAAGAAGATT
+TCCAATAATCAGAACAAGTCGGCTCCTGTTTAGTTACGAGCGACATTGCTCCGTGTATTC
+ACTCGTTGGAATGAATACACAGTGCAGTGTTTATTCTGTTATTTATGCCAAAAATAAAGG
+CCACTATCAGGCAGCTTTGTTGTTCTGTTTACCAAGTTCTCTGGCAATCATTGCCGTCGT
+TCGTATTGCCCATTTATCGACATATTTCCCATCTTCCATTACAGGAAACATTTCTTCAGG
+CTTAACCATGCATTCCGATTGCAGCTTGCATCCATTGCATCGCTTGAATTGTCCACACCA
+TTGATTTTTATCAATAGTCGTAGTCATACGGATAGTCCTGGTATTGTTCCATCACATCCT
+GAGGATGCTCTTCGAACTCTTCAAATTCTTCTTCCATATATCACCTTAAATAGTGGATTG
+CGGTAGTAAAGATTGTGCCTGTCTTTTAACCACATCAGGCTCGGTGGTTCTCGTGTACCC
+CTACAGCGAGAAATCGGATAAACTATTACAACCCCTACAGTTTGATGAGTATAGAAATGG
+ATCCACTCGTTATTCTCGGACGAGTGTTCAGTAATGAACCTCTGGAGAGAACCATGTATA
+TGATCGTTATCTGGGTTGGACTTCTGCTTTTAAGCCCAGATAACTGGCCTGAATATGTTA
+ATGAGAGAATCGGTATTCCTCATGTGTGGCATGTTTTCGTCTTTGCTCTTGCATTTTCGC
+TAGCAATTAATGTGCATCGATTATCAGCTATTGCCAGCGCCAGATATAAGCGATTTAAGC
+TAAGAAAACGCATTAAGATGCAAAACGATAAAGTGCGATCAGTAATTCAAAACCTTACAG
+AAGAGCAATCTATGGTTTTGTGCGCAGCCCTTAATGAAGGCAGGAAGTATGTGGTTACAT
+CAAAACAATTCCCATACATTAGTGAGTTGATTGAGCTTGGTGTGTTGAACAAAACTTTTT
+CCCGATGGAATGGAAAGCATATATTATTCCCTATTGAGGATATTTACTGGACTGAATTAG
+TTGCCAGCTATGATCCATATAATATTGAGATAAAGCCAAGGCCAATATCTAAGTAACTAG
+ATAAGAGGAATCGATTTTCCCTTAATTTTCTGGCGTCCACTGCATGTTATGCCGCGTTCG
+CCAGGCTTGCTGTACCATGTGCGCTGATTCTTGCGCTCAATACGTTGCAGGTTGCTTTCA
+ATCTGTTTGTGGTATTCAGCCAGCACTGTAAGGTCTATCGGATTTAGTGCGCTTTCTACT
+CGTGATTTCGGTTTGCGATTCAGCGAGAGAATAGGGCGGTTAACTGGTTTTGCGCTTACC
+CCAACCAACAGGGGATTTGCTGCTTTCCATTGAGCCTGTTTCTCTGCGCGACGTTCGCGG
+CGGCGTGTTTGTGCATCCATCTGGATTCTCCTGTCAGTTAGCTTTGGTGGTGTGTGGCAG
+TTGTAGTCCTGAACGAAAACCCCCCGCGATTGGCACATTGGCAGCTAATCCGGAATCGCA
+CTTACGGCCAATGCTTCGTTTCGTATCACACACCCCAAAGCCTTCTGCTTTGAATGCTGC
+CCTTCTTCAGGGCTTAATTTTTAAGAGCGTCACCTTCATGGTGGTCAGTGCGTCCTGCTG
+ATGTGCTCAGTATCACCGCCAGTGGTATTTATGTCAACACCGCCAGAGATAATTTATCAC
+CGCAGATGGTTATCTGTATGTTTTTTATATGAATTTATTTTTTGCAGGGGGGCATTGTTT
+GGTAGGTGAGAGATCTGAATTGCTATGTTTAGTGAGTTGTATCTATTTATTTTTCAATAA
+ATACAATTGGTTATGTGTTTTGGGGGCGATCGTGAGGCAAAGAAAACCCGGCGCTGAGGC
+CGGGTTATTCTTGTTCTCTGGTCAAATTATATAGTTGGAAAACAAGGATGCATATATGAA
+TGAACGATGCAGAGGCAATGCCGATGGCGATAGTGGGTATCATGTAGCCGCTTATGCTGG
+AAAGAAGCAATAACCCGCAGAAAAACAAAGCTCCAAGCTCAACAAAACTAAGGGCATAGA
+CAATAACTACCGATGTCATATACCCATACTCTCTAATCTTGGCCAGTCGGCGCGTTCTGC
+TTCCGATTAGAAACGTCAAGGCAGCAATCAGGATTGCAATCATGGTTCCTGCATATGATG
+ACAATGTCGCCCCAAGACCATCTCTATGAGCTGAAAAAGAAACACCAGGAATGTAGTGGC
+GGAAAAGGAGATAGCAAATGCTTACGATAACGTAAGGAATTATTACTATGTAAACACCAG
+GCATGATTCTGTTCCGCATAATTACTCCTGATAATTAATCCTTAACTTTGCCCACCTGCC
+TTTTAAAACATTCCAGTATATCACTTTTCATTCTTGCGTAGCAATATGCCATCTCTTCAG
+CTATCTCAGCATTGGTGACCTTGTTCAGAGGCGCTGAGAGATGGCCTTTTTCTGATAGAT
+AATGTTCTGTTAAAATATCTCCGGCCTCATCTTTTGCCCGCAGGCTAATGTCTGAAAATT
+GAGGTGACGGGTTAAAAATAATATCCTTGGCAACCTTTTTTATATCCCTTTTAAATTTTG
+GCTTAATGACTATATCCAATGAGTCAAAAAGCTCCCCTTCAATATCTGTTGCCCCTAAGA
+CCTTTAATATATCGCCAAATACAGGTAGCTTGGCTTCTACCTTCACCGTTGTTCGGCCGA
+TGAAATGCATATGCATAACATCGTCTTTGGTGGTTCCCCTCATCAGTGGCTCTATCTGAA
+CGCGCTCTCCACTGCTTAATGACATTCCTTTCCCGATTAAAAAATCTGTCAGATCGGATG
+TGGTCGGCCCGAAAACAGTTCTGGCAAAACCAATGGTGTCGCCTTCAACAAACAAAAAAG
+ATGGGAATCCCAATGATTCGTCATCTGCGAGGCTGTTCTTAATATCTTCAACTGAAGCTT
+TAGAGCGATTTATCTTCTGAACCAGACTCTTGTCATTTGTTTTGGTAAAGAGAAAAGTTT
+TTCCATCGATTTTATGAATATACAAATAATTGGAGCCAACCTGCAGGTGATGATTATCAG
+CCAGCAGAGAATTAAGGAAAACAGACAGGTTTATTGAGCGCTTATCTTTCCCTTTATTTT
+TGCTGCGGTAAGTCGCATAAAAACCATTCTTCATAATTCAATCCATTTACTATGTTATGT
+TCTGAGGGGAGTGAAAATTCCCCTAATTCGATGAAGATTCTTGCTCAATTGTTATCAGCT
+ATGCGCCGACCAGAACACCTTGCCGATCAGCCAAACGTCTCTTCAGGCCACTGACTAGCG
+ATAACTTTCCCCACAACGGAACAACTCTCATTGCATGGGATCATTGGGTACTGTGGGTTT
+AGTGGTTGTAAAAACACCTGACCGCTATCCCTGATCAGTTTCTTGAAGGTAAACTCATCA
+CCCCCAAGTCTGGCTATGCAGAAATCACCTGGCTCAACAGCCTGCTCAGGGTCAACGAGA
+ATTAACATTCCGTCAGGAAAGCTTGGCTTGGAGCCTGTTGGTGCGGTCATGGAATTACCT
+TCAACCTCAAGCCAGAATGCAGAATCACTGGCTTTTTTGGTTGTGCTTACCCATCTCTCC
+GCATCACCTTTGGTAAAGGTTCTAAGCTTAGGTGAGAACATCCCTGCCTGAACATGAGAA
+AAAACAGGGTACTCATACTCACTTCTAAGTGACGGCTGCATACTAACCGCTTCATACATC
+TCGTAGATTTCTCTGGCGATTGAAGGGCTAAATTCTTCAACGCTAACTTTGAGAATTTTT
+GTAAGCAATGCGGCGTTATAAGCATTTAATGCATTGATGCCATTAAATAAAGCACCAACG
+CCTGACTGCCCCATCCCCATCTTGTCTGCGACAGATTCCTGGGATAAGCCAAGTTCATTT
+TTCTTTTTTTCATAAATTGCTTTAAGGCGACGTGCGTCCTCAAGCTGCTCTTGTGTTAAT
+GGTTTCTTTTTTGTGCTCATACGTTAAATCTATCACCGCAAGGGATAAATATCTAACACC
+GTGCGTGTTGACTATTTTACCTCTGGCGGTGATAATGGTTGCATGTACTAAGGAGGTTGT
+ATGGAACAACGCATAACCCTGAAAGATTATGCAATGCGCTTTGGGCAAACCAAGACAGCT
+AAAGATCTCGGCGTATATCAAAGCGCGATCAACAAGGCCATTCATGCAGGCCGAAAGATT
+TTTTTAACTATAAACGCTGATGGAAGCGTTTATGCGGAAGAGGTAAAGCCCTTCCCGAGT
+AACAAAAAAACAACAGCATAAATAACCCCGCTCTTACACATTCCAGCCCTGAAAAAGGGC
+ATCAAATTAAACCACACCTATGGTGTATGCATTTATTTGCATACATTCAATCAATTGTTA
+TCTAAGGAAATACTTACATATGGTTCGTGCAAACAAACGCAACGAGGCTCTACGAATCGA
+GAGTGCGTTGCTTAACAAAATCGCAATGCTTGGAACTGAGAAGACAGCGGAAGCTGTGGG
+CGTTGATAAGTCGCAGATCAGCAGGTGGAAGAGGGACTGGATTCCAAAGTTCTCAATGCT
+GCTTGCTGTTCTTGAATGGGGGGTCGTTGACGACGACATGGCTCGATTGGCGCGACAAGT
+TGCTGCGATTCTCACCAATAAAAAACGCCCGGCGGCAACCGAGCGTTCTGAACAAATCCA
+GATGGAGTTCTGAGGTCATTACTGGATCTATCAACAGGAGTCATTATGACAAATACAGCA
+AAAATACTCAACTTCGGCAGAGGTAACTTTGCCGGACAGGAGCGTAATGTGGCAGATCTC
+GATGATGGTTACGCCAGACTATCAAATATGCTGCTTGAGGCTTATTCGGGCGCAGATCTG
+ACCAAGCGACAGTTTAAAGTGCTGCTTGCCATTCTGCGTAAAACCTATGGGTGGAATAAA
+CCAATGGACAGAATCACCGATTCTCAACTTAGCGAGATTACAAAGTTACCTGTCAAACGG
+TGCAATGAAGCCAAGTTAGAACTCGTCAGAATGAATATTATCAAGCAGCAAGGCGGCATG
+TTTGGACCAAATAAAAACATCTCAGAATGGTGCATCCCTCAAAACGAGGGAAAATCCCCT
+AAAACGAGGGATAAAACATCCCTCAAATTGGGGGATTGCTATCCCTCAAAACAGGGGGAC
+ACAAAAGACACTATTACAAAAGAAAAAAGAAAAGATTATTCGTCAGAGAATTCTGGCGAA
+TCCTCTGACCAGCCAGAAAACGACCTTTCTGTGGTGAAACCGGATGCTGCAATTCAGAGC
+GGCAGCAAGTGGGGGACAGCAGAAGACCTGACCGCCGCAGAGTGGATGTTTGACATGGTG
+AAGACTATCGCACCATCAGCCAGAAAACCGAATTTTGCTGGGTGGGCTAACGATATCCGC
+CTGATGCGTGAACGTGACGGACGTAACCACCGCGACATGTGTGTGCTGTTCCGCTGGGCA
+TGCCAGGACAACTTCTGGTCCGGTAACGTGCTGAGCCCGGCCAAACTCCGCGATAAGTGG
+ACCCAACTCGAAATCAACCGTAACAAGCAACAGGCAGGCGTGACAGCCAGCAAACCAAAA
+CTCGACCTGACAAACACAGACTGGATTTACGGGGTGGATCTATGAAAAACATCGCCGCAC
+AGATGGTTAACTTTGACCGTGAGCAGATGCGTCGGATCGCCAACAACATGCCGGAACAGT
+ACGACGAAAAGCCGCAGGTACAGCAGGTAGCGCAGATCATCAACGGTGTGTTCAGCCAGT
+TACTGGCAACTTTCCCGGCGAGCCTGGCTAACCGTGACCAGAACGAAGTGAACGAAATCC
+GTCGCCAGTGGGTTCTGGCTTTTCGGGAAAACGGGATCACCACGATGGAACAGGTTAACG
+CAGGAATGCGCGTAGCCCGTCGGCAGAATCGACCATTTCTGCCATCACCCGGGCAGTTTG
+TTGCATGGTGCCGGGAAGAAGCATCCGTTACCGCCGGACTGCCAAACGTCAGCGAGCTGG
+TTGATATGGTTTACGAGTATTGCCGGAAGCGAGGCCTGTATCCGGATGCGGAGTCTTATC
+CGTGGAAATCAAACGCGCACTACTGGCTGGTTACCAACCTGTATCAGAACATGCGGGCCA
+ATGCGCTTACTGATGCGGAATTACGCCGTAAGGCCGCAGATGAGCTTGTCCATATGACTG
+CGAGAATTAACCGTGGTGAGGCGATCCCTGAACCAGTAAAACAACTTCCTGTCATGGGCG
+GTAGACCTCTAAATCGTGCACAGGCTCTGGCGAAGATCGCAGAAATCAAAGCTAAGTTCG
+GACTGAAAGGAGCAAGTGTATGACGGGCAAAGAGGCAATTATTCATTACCTGGGGACGCA
+TAATAGCTTCTGTGCGCCGGACGTTGCCGCGCTAACAGGCGCAACAGTAACCAGCATAAA
+TCAGGCCGCGGCTAAAATGGCACGGGCAGGTCTTCTGGTTATCGAAGGTAAGGTCTGGCG
+AACGGTGTATTACCGGTTTGCTACCAGGGAAGAACGGGAAGGAAAGATGAGCACGAACCT
+GGTTTTTAAGGAGTGTCGCCAGAGTGCCGCGATGAAACGGGTATTGGCGGTATATGGAGT
+TAAAAGATGACCATCTACATTACTGAGCTAATAACAGGCCTGCTGGTAATCGCAGGCCTT
+TTTATTTGGGGGAGAGGGAAGTCATGAAAAAACTAACCTTTGAAATTCGATCTCCAGCAC
+ATCAGCAAAACGCTATTCACGCAGTACAGCAAATCCTTCCAGACCCAACCAAACCAATCG
+TAGTAACCATTCAGGAACGCAACCGCAGCTTAGACCAAAACAGGAAGCTATGGGCCTGCT
+TAGGTGACGTCTCTCGTCAGGTTGAATGGCATGGTCGCTGGCTGGATGCAGAAAGCTGGA
+AGTGTGTGTTTACCGCAGCATTAAAGCAGCAGGATGTTGTTCCTAACCTTGCCGGGAATG
+GCTTTGTGGTAATAGGCCAGTCAACCAGCAGGATGCGTGTAGGCGAATTTGCGGAGCTAT
+TAGAGCTTATACAGGCATTCGGTACAGAGCGTGGCGTTAAGTGGTCAGACGAAGCGAGAC
+TGGCTCTGGAGTGGAAAGCGAGATGGGGAGACAGGGCTGCATGATAAATGTCGTTAGTTT
+CTCCGGTGGCAGGACGTCAGCATATTTGCTCTGGCTAATGGAGCAAAAGCGACGGGCAGG
+TAAAGACGTGCATTACGTTTTCATGGATACAGGTTGTGAACATCCAATGACATATCGGTT
+TGTCAGGGAAGTTGTGAAGTTCTGGGATATACCGCTCACCGTATTGCAGGTTGATATCAA
+CCCGGAGCTTGGACAGCCAAATGGTTATACGGTATGGGAACCAAAGGATATTCAGACGCG
+AATGCCTGTTCTGAAGCCATTTATCGATATGGTAAAGAAATATGGCACTCCATACGTCGG
+CGGCGCGTTCTGCACTGACAGATTAAAACTCGTTCCCTTCACCAAATACTGTGATGACCA
+TTTCGGGCGAGGGAATTACACCACGTGGATTGGCATCAGAGCTGATGAACCGAAGCGGCT
+AAAGCCAAAGCCTGGAATCAGATATCTTGCTGAACTGTCAGACTTTGAGAAGGAAGATAT
+CCTCGCATGGTGGAAGCAACAACCATTCGATTTGCAAATACCGGAACATCTCGGTAACTG
+CATATTCTGCATTAAAAAATCAACGCAAAAAATCGGACTTGCCTGCAAAGATGAGGAGGG
+ATTGCAGCGTGTTTTTAATGAGGTCATCACGGGATCCCATGTGCGTGACGGACATCGGGA
+AACGCCAAAGGAGATTATGTACCGAGGAAGAATGTCGCTGGACGGTATCGCGAAAATGTA
+TTCAGAAAATGATTATCAAGCCCTGTATCAGGACATGGTACGAGCTAAAAGATTCGATAC
+CGGCTCTTGTTCTGAGTCATGCGAAATATTTGGAGGGCAGCTTGATTTCGACTTCGGGAG
+GGAAGCTGCATGATGCGATGTTATCGGTGCGGTGAATGCAAAGAAGATAACCGCTTCCGA
+CCAAATCAACCTTACTGGAATCGATGGTGTCTCCGGTGTGAAAGAACACCAACAGGGGTG
+TTACCACTACCGCAGGAAAAGGAGGACGTGTGGCGAGACAGCGACGAAGTATCACCGACA
+TAATCTGCGAAAACTGCAAATACCTTCCAACGAAACGCACCAGAAATAAACCCAAGCCAA
+TCCCAAAAGAATCTGACGTAAAAACCTTCAACTACACGGCTCACCTGTGGGATATCCGGT
+GGCTAAGACGTCGTGCGAGGAAAACAAGGTGATTGACCAAAATCGAAGTTACGAACAAGA
+AAGCGTCGAGCGAGCTTTAACGTGCGCTAACTGCGGTCAGAAGCTGCATGTGCTGGAAGT
+TCACGTGTGTGAGCACTGCTGCGCAGAACTGATGAGCGATCCGAATAGCTCGATGCACGA
+GGAAGAAGATGATGGCTAAACCAGCGCGAAGACGATGTAAAAACGATGAATGCCGGGAAT
+GGTTTCACCCTGCATTCGCTAATCAGTGGTGGTGCTCTCCAGAGTGTGGAACCAAGATAG
+CACTCGAACGACGAAGTAAAGAACGCGAAAAAGCGGAAAAAGCAGCAGAGAAGAAACGAC
+GACGAGAGGAGCAGAAACAGAAAGATAAACTTAAGATTCGAAAACTCGCCTTAAAGCCCC
+GCAGTTACTGGATTAAACAAGCCCAACAAGCCGTAAACGCCTTCATCAGAGAAAGAGACC
+GCGACTTACCATGTATCTCGTGCGGAACGCTCACGTCTGCTCAGTGGGATGCCGGACATT
+ACCGGACAACTGCTGCGGCACCTCAACTCCGATTTAATGAACGCAATATTCACAAGCAAT
+GCGTGGTGTGCAACCAGCACAAAAGCGGAAATCTCGTTCCGTATCGCGTCGAACTGATTA
+GCCGCATCGGGCAGGAAGCAGTAGACGAAATCGAATCAAACCATAACCGCCATCGCTGGA
+CTATCGAAGAGTGCAAGGCGATCAAGGCAGAGTACCAACAGAAACTCAAAGACCTGCGAA
+ATAGCAGAAGTGAGGCCGCATGACGTTCTCAGTAAAAACCATTCCAGACATGCTCGTTGA
+AACATACGGAAATCAGACAGAAGTAGCACGCAGACTGAAATGTAGTCGCGGTACGGTCAG
+AAAATACGTTGATGATAAAGACGGGAAAATGCACGCCATCGTCAACGACGTTCTCATGGT
+TCATCGCGGATGGAGTGAAAGAGATGCGCTATTACGAAAAAATTGATGGCAGCAAATACC
+GAAATATTTGGGTAGTTGGCGATCTGCACGGATGCTACACGAACCTGATGAACAAACTGG
+ATACGATTGGATTCGACAACAAAAAAGACCTGCTTATCTCGGTGGGCGATTTGGTTGATC
+GTGGTGCAGAGAACGTTGAATGCCTGGAATTAATCACATTCCCCTGGTTCAGAGCTGTAC
+GTGGAAACCATGAGCAAATGATGATTGATGGCTTATCAGAGCGTGGAAACGTTAATCACT
+GGCTGCTTAATGGCGGTGGCTGGTTCTTTAATCTCGATTACGACAAAGAAATTCTGGCTA
+AAGCTCTTGCCCATAAAGCAGATGAACTTCCGTTAATCATCGAACTGGTGAGCAAAGATA
+AAAAATATGTTATCTGCCACGCCGATTATCCCTTTGACGAATACGAGTTTGGAAAGCCAG
+TTGATCATCAGCAGGTAATCTGGAACCGCGAACGAATCAGCAACTCACAAAACGGGATCG
+TGAAAGAAATCAAAGGCGCGGACACGTTCATCTTTGGTCATACGCCAGCAGTGAAACCAC
+TCAAGTTTGCCAACCAAATGTATATCGATACCGGCGCAGTGTTCTGCGGAAACCTAACAT
+TGATTCAGGTACAGGGAGAAGGCGCATGAGACTCGAAAGCGTAGCTAAATTTCATTCGCC
+AAAAAGCCCGATGATGAGCGACTCACCACGGGCCACGGCTTCTGACTCTCTTTCCGGTAC
+TGATGTGATGGCTGCTATGGGGATGGCGCAATCACAAGCCGGATTCGGTATGGCTGCATT
+CTGCGGTAAGCACGAACTCAGCCAGAACGACAAACAAAAGGCTATCAACTATCTGATGCA
+ATTTGCACACAAGGTATCGGGGAAATACCGTGGTGTGGCAAAGCTTGAAGGAAATACTAA
+GGCAAAGGTACTGCAAGTGCTCGCAACATTCGCTTATGCGGATTATTGCCGTAGTGCCGC
+GACGCCGGGGGCAAGATGCAGAGATTGCCATGGTACAGGCCGTGCGGTTGATATTGCCAA
+AACAGAGCTGTGGGGGAGAGTTGTCGAGAAAGAGTGCGGAAGATGCAAAGGCGTCGGCTA
+TTCAAGGATGCCAGCAAGCGCAGCATATCGCGCCGTGACGATGCTAATCCCAAACCTTAC
+CCAACCCACCTGGTCACGCACTGTTAAGCCGCTGTATGACGCTCTGGTGGTGCAATGCCA
+CAAAGAAGAGTCAATCGCAGACAACATTTTGAATGCGGTCACACGTTAGCAGCATGATTG
+CCACGGATGGCAACATATTAACGGCATGATATTGACTTATTGAATAAAATTGGGTAAATT
+TGACTCAACGATGGGTTAATTCGCTCGTTGTGGTAGTGAGATGAAAAGAGGCGGCGCTTA
+CTACCGATTCCGCCTAGTTGGTCACTTCGACGTATCGTCTGGAACTCCAACCATCGCAGG
+CAGAGAGGTCTGCAAAATGCAATCCCGAAACAGTTCGCAGGTAATAGTTAGAGCCTGCAT
+AACGGTTTCGGGATTTTTTATATCTGCACAACAGGTAAGAGCATTGAGTCGATAATCGTG
+AAGAGTCGGCGAGCCTGGTTAGCCAGTGCTCTTTCCGTTGTGCTGAATTAAGCGAATACC
+GGAAGCAGAACCGGATCACCAAATGCGTACAGGCGTCATCGCCGCCCAGCAACAGCACAA
+CCCAAACTGAGCCGTAGCCACTGTCTGTCCTGAATTCATTAGTAATAGTTACGCTGCGGC
+CTTTTACACATGACCTTCGTGAAAGCGGGTGGCAGGAGGTCGCGCTAACAACCTCCTGCC
+GTTTTGCCCGTGCATATCGGTCACGAACAAATCTGATTACTAAACACAGTAGCCTGGATT
+TGTTCTATCAGTAATCGACCTTATTCCTAATTAAATAGAGCAAATCCCCTTATTGGGGGT
+AAGACATGAAGATGCCAGAAAAACATGACCTGTTGGCCGCCATTCTCGCGGCAAAGGAAC
+AAGGCATCGGGGCAATCCTTGCGTTTGCAATGGCGTACCTTCGCGGCAGATATAATGGCG
+GTGCGTTTACAAAAACAGTAATCGACGCAACGATGTGCGCCATTATCGCCTAGTTCATTC
+GTGACCTTCTCGACTTCGCCGGACTAAGTAGCAATCTCGCTTATATAACGAGCGTGTTTA
+TCGGCTACATCGGTACTGACTCGATTGGTTCGCTTATCAAACGCTTCGCTGCTAAAAAAG
+CCGGAGTAGAAGATGGTAGAAATCAATAATCAACGTAAGGCGTTCCTCGATATGCTGGCG
+TGGTCGGAGGGAACTGATAACGGACGTCAGAAAACCAGAAATCATGGTTATGACGTCATT
+GTAGGCGGAGAGCTATTTACTGATTACTCCGATCACCCTCGCAAACTTGTCACGCTAAAC
+CCAAAACTCAAATCAACAGGCGCCGGACGCTACCAGCTTCTTTCCCGTTGGTGGGATGCC
+TACCGCAAGCAGCTTGGCCTGAAAGACTTCTCTCCGAAAAGTCAGGACGCTGTGGCATTG
+CAGCAGATTAAGGAGCGTGGCGCTTTACCTATGATTGATCGTGGTGATATCCGTCAGGCA
+ATCGACCGTTGCAGCAATATCTGGGCTTCACTGCCGGGCGCTGGTTATGGTCAGTTCGAG
+CATAAGGCTGACAGCCTGATTGCAAAATTCAAAGAAGCGGGCGGAACGGTCAGAGAGATT
+GATGTATGAGCAGAGTCACCGCGATTATCTCCGCTCTGGTTATCTGCATCATCGTCTGCC
+TGTCATGGGCTGTTAATCATTACCGTGATAACGCCATTACCTACAAAGCCCAGCGCGACA
+AAAATGCCAGAGAACTGAAGCTGGCGAACGCGGCAATTACTGACATGCAGATGCGTCAGC
+GTGATGTTGCTGCGCTCGATGCAAAATACACGAAGGAGTTAGCTGATGCTAAAGCTGAAA
+ATGATGCTCTGCGTGATGATGTTGCCGCTGGTCGTCGTCGGTTGCACATCAAAGCAGTCT
+GTCAGTCAGTGCGTGAAGCCACCACCGCCTCCGGCGTGGATAATGCAGCCTCCCCCCGAC
+TGGCAGACACCGCTGAACGGGATTATTTCACCCTCAGAGAGAGGCTGATCACTATGCAAA
+AACAACTGGAAGGAACCCAGAAGTATATTAATGAGCAGTGCAGATAGAGTTGCCCATATC
+GATGGGCAACTCATGCAATTATTGTGAGCAATACACACGCGCTTCCAGCGGAGTATAAAT
+GCCTAAAGTAATAAAACCGAGCAATCCATTTACGAATGTTTGCTGGGTTTCTGTTTTAAC
+AACATTTTCTGCGCCGCCACAAATTTTGGCTGCATCGACAGTTTTCTTCTGCCCAATTCC
+AGAAACGAAGAAATGATGGGTGATGGTTTCCTTTGGTGCTACTGCTGCCGGTTTGTTTTG
+AACAGTAAACGTCTGTTGAGCACATCCTGTAATAAGCAGGGCCAGCGCAGTAGCGAGTAG
+CATTTTTTTCATGGTGTTATTCCCGATGCTTTTTGAAGTTCGCAGAATCGTATGTGTAGA
+AAATTAAACAAACCCTAAACAATGAGTTGAAATTTCATATTGTTAATATTTATTAATGTA
+TGTCAGGTGCGATGAATCGTCATTGTATTCCCGGATTAACTATGTCCACAGCCCTGACGG
+GGAACTTCTCTGCGGGAGTGTCCGGGAATAATTAAAACGATGCACACAGGGTTTAGCGCG
+TACACGTATTGCATTATGCCAACGCCCCGGTGCTGACACGGAAGAAACCGGACGTTATGA
+TTTAGCGTGGAAAGATTTGTGTAGTGTTCTGAATGCTCTCAGTAAATAGTAATGAATTAT
+CAAAGGTATAGTAATATCTTTTATGTTCATGGATATTTGTAACCCATCGGAAAACTCCTG
+CTTTAGCAAGATTTTCCCTGTATTGCTGAAATGTGATTTCTCTTGATTTCAACCTATCAT
+AGGACGTTTCTATAAGATGCGTGTTTCTTGAGAATTTAACATTTACAACCTTTTTAAGTC
+CTTTTATTAACACGGTGTTATCGTTTTCTAACACGATGTGAATATTATCTGTGGCTAGAT
+AGTAAATATAATGTGAGACGTTGTGACGTTTTAGTTCAGAATAAAACAATTCACAGTCTA
+AATCTTTTCGCACTTGATCGAATATTTCTTTAAAAATGGCAACCTGAGCCATTGGTAAAA
+CCTTCCATGTGATACGAGGGCGCGTAGTTTGCATTATCGTTTTTATCGTTTCAATCTGGT
+CTGACCTCCTTGTGTTTTGTTGATGATTTATGTCAAATATTAGGAATGTTTTCACTTAAT
+AGTATTGGTTGCGTAACAAAGTGCGGTCCTGCTGGCATTCTGGAGGGAAATACAACCGAC
+AGATGTATGTAAGGCCAACGTGCTCAAATCTTCATACAGAAAGATTTGAAGTAATATTTT
+AACCGCTAGATGAAGAGCAAGCGCATGGAGCGACAAAATGAATAAAGAACAATCTGCTGA
+TGATCCCTCCGTGGATCTGATTCGTGTAAAAAATATGCTTAATAGCACCATTTCTATGAG
+TTACCCTGATGTTGTAATTGCATGTATAGAACATAAGGTGTCTCTGGAAGCATTCAGAGC
+AATTGAGGCAGCGTTGGTGAAGCACGATAATAATATGAAGGATTATTCCCTGGTGGTTGA
+CTGATCACCATAACTGCTAATCATTCAAACTATTTAGTCTGTGACAGAGCCAACACGCAG
+TCTGTCACTGTCAGGAAAGTGGTAAAACTGCAACTCAATTACTGCAATGCCCTCGTAATT
+AAGTGAATTTACAATATCGTCCTGTTCGGAGGGAAGAACGCGGGATGTTCATTCTTCATC
+ACTTTTAATTGATGTATATGCTCTCTTTTCTGACGTTAGTCTCCGACGGCAGGCTTCAAT
+GACCCAGGCTGAGAAATTCCCGGACCCTTTTTGCTCAAGAGCGATGTTAATTTGTTCAAT
+CATTTGGTTAGGAAAGCGGATGTTGCGGGTTGTTGTTCTGCGGGTTCTGTTCTTCGTTGA
+CATGAGGTTGCCCCGTATTCAGTGTCGCTGATTTGTATTGTCTGAAGTTGTTTTTACGTT
+AAGTTGATGCAGATCAATTAATACGATACCTGCGTCATAATTGATTATTTGACGTGGTTT
+GATGGCCTCCACGCACGTTGTGATATGTAGATGATAATCATTATCACTTTACGGGTCCTT
+TCCGGTGATCCGACAGGTTACG
diff --git a/userdata/lorist6.vep b/userdata/lorist6.vep
new file mode 100644
index 0000000..96aa490
--- /dev/null
+++ b/userdata/lorist6.vep
@@ -0,0 +1,86 @@
+ GATCCTAGTATTCTATAGTGTCACCTAAATCGTATGTGTA
+ TGAT-CAT--AGGTTATGTATTAATTGTAGCCGCGTTCTAACGACAATATGTAC*AAGCC
+ TAATTGTGTAGCATCTGGCTTACT*GAAGCAGACCCT*ATCATCTCTCTCG*TAAACTGC
+ C-G-CAGAGTCGGTTTGGTTGGACGAACCTTCTG*AGTTTCTGGTAACGCCGTCCCGCAC
+ CCGGAAATGGTCAGCGAACCAATCAGCAGGGTCATCGCTAGAAATCA*-CCTTAGCGAAA
+ -CTAAGGA*-T-TTT-TTTATCT*GAATTCTAGCCAGATCCCCGCTGA*TTTATGCTGGT
+ *--------TGCGCCTGTTAGCG-GGCAACGTCCGGCGCACAGA-GCTATTATGCGTCCC
+ CA*GGTA*ATGAA*TAATTGCCTCTTTGCCC*GTCATACAC*TTGCTCCTTTCAGTCCGA
+ ACTTAGCTTTGATTTCTGCGATCTTCGCCAGAGCCTGTGCACGATTTAGAGGTCTACCGC
+ CCATGACAGGAAGTTGTTTTACTGGTTCAGGGATCGCCTCACCACGGTTAATTCTCGCAG
+ TCATATGGACAAGCTCATCTGC-G-C-TTACGGCGTAATTCCGCATCAGTAAGCGCATTG
+ GCCCGCATGTTCTGATACAGGTTGGTAACCAGCCAGTAGTGCGCGTTTGATTTCCACGGA
+ TAAGACTCCGCATCC-GGATACAGGCCTCGCTTCCGGCAATACTCGTAAACCATATCAAC
+ CAGCTCGCTGACGTTTGGCAGTCCGGCGGTAACGGATGCTTCTTCCCGGCACCATGCAAC
+ AAACTGCCCGGGTGATGGCAGAAATGGTCGATTCTGCCGACGGG--ACG-GCATTCCTGC
+ GTTAACCTG-TTCCATCGTGGTGATCCCGTTTT*CCCGAAAAGCCAGAACCCACTGGCGA
+ CGGATTTCGTTCA-TTCGTTCTGGTCACGGTTAGCCAGGCTCGCCG-GAAAGTTGCCAGT
+ AACTGGCTGAACACACCGTTGATGATCTGCGCTACCTGCTGTAC-TGCGGCTTTTCGTCG
+ TACTGTTCCG*-CATGTTGTTGGCGAT-CCGACGCATCTGCTCACGGTCAAAGTTAACCA
+ TCTGTGCGGCGATGTTTTTCATAGATCCACCCCGTAAATCCAGTCTGTGTTTGTCAGGTC
+ GAGTTTTGGTTTGCTGGCTGTCACGCCTGCCTGTTGCTTGTTACGGTTG-TTTCGAGTTG
+ GGTCCACTTATCGCGGAGTTTGGCCGGGCTCAGCACGTTACCGGACCAGAAGTTGTCCTG
+ GCATGCCCAGCGGAACAGCACACACATGTCGCG-TGGTTACGTCCGTCACGTTC*ACGCA
+ TCAGGCG-ATATCGTTAGCCCACCCAGCAAAATTCG----TTCTGGCTGATGGTGCGATA
+ GTCTTCACCATGTCAAACATCCACTCTGCGGCGGTCAGGTCTTCTGCTGT--CCCAC**T
+ T-CTGC*CGCTC*TGAA*TTGCAGCATCC*GGTTTCACCACAGAAAGGTCGTTTTCTGGC
+ TGGTCAGAGGATTCGCCAGAATTCTCTGACGAATAATCTTTTCTTTTTTCTTTTGTAATA
+ GTGTCTTTTGTGTCCCCCTGTTTTGAGGGATAGCAATCCCCCAATTTGAGGGATGTTTTA
+ TCCCTCGTTTTAGGGGATTTTCCCTCGTTTTGAGGGATGCACCATTCTGAGATGTTTTTA
+ TTTGGTCCAAACATGCCGCCTTGCTGCTTGATAATATTCATTCTGACGAGTTCTAACTTG
+ GCTTCATTGCACCGTTTGACAGGTAACTTTG-TAATCTCGCTAAGTTGAGAATCGGTGAT
+ TCTGTCCATTGGTTTATTCCACCCATAGGTTTTACGCAGAATGGCAAGCAGCAC*TTTAA
+ ACT-GTCGCTTGGTCAGATCTGCGCCCGAATAAGCC-CAAGCAG-ATATT-*GATAG-CT
+ GGCG-AACCATCATCGAGATCTGCCACATTACGCTCCTGTCCGGCAAAGTTACCTCTGCC
+ GAAGTTGAGTATTTTTGCTGTATTTGTCATAATGACTCCTGTTGATAGATCCAGTAATGA
+ CCTCAGAACTCCATCTGGATTTGTTCAGAACGCTCGGTTGCCGCCGGGCGTTTTTTATTG
+ GTGAGAATCGCAGCAACTTGTCGCGCCAATCGAGCCATGTCGTCGTCAACGACCCCCCAT
+ TCAAGAACAGCAAGCAGCATTGAGAACTTTGGAATCCAGTCCCTCTTCCACCTGCTGATC
+ TGCGACTTATCAACGCCCACAGCTTCCGCTGTCTTCTCAGTTCCAAGCATTGCGATTTTG
+ TTAAGCAACGCACTCTCGATTCGTAGAGCCTCGTTGCGTTTGTTTGCACGAACCATATGT
+ AAGTATTTCCTTAGATAACAATTGATTGAATGTATGCAAATAAATGCATACACCATAGGT
+ GTGGTTTAATTTGATGCCCTTTTTCAGGGCTGGAATGTGTAAGAGCGGGGTTATTTATGC
+ TGTTGTTTTTTTGTTACTCGGGAAGGGCTTTACCTCTTCCGCATAAACGCTTCCATCAGC
+ GTTTATAGTTAAAAAAATCTTTCGGCCTGCATGAATGGCCTTGTTGATCGCGCTTTGATA
+ TACGCCGAGATCTTTAGCTGTCTTGGTTTGCCCAAAGCGCATTGCATAATCTTTCAGGGT
+ TATGCGTTGTTCCATACAACCTCCTTAGTACATGCAACCATTATCACCGCCAGAGGTAAA
+ ATAGTCAACACGCACGGTGTTAGATATTTATCCCTTGCGGTGATAGATTTAACGTATGAG
+ CACAAAAAAGAAACCATTAACACAAGAGCAGCTTGAGGACGCACGTCGCCTTAAAGCAAT
+ TTATGAAAAAAAGAAAAATGAACTTGGCTTATCCCAGGAATCTGTCGCAGACAAGATGGG
+ GATGGGGCAGTCAGGCGTTGGTGCTTTATTTAATGGCATCAATGCATTAAATGCTTATAA
+ CGCCGCATTGCTTACAAAAATTCTCAAAGTTAGCGTTGAAGAATTTAGCCCTTCAATCGC
+ CAGAGAAATCTACGAGATGTATGAAGCGGTTAGTATGCAGCCGTCACTTAGAAGTGAGTA
+ TGAGTACCCTGTTTTTTCTCATGTTCAGGCAGGGATGTTCTCACCTAAGCTAGCGAATTC
+ TCATGTTTGACAGCTTATCATCGATATGCTTTAATGCGGTAGTGATCAAGAGACAGGATG
+ AGGATCGTTTCGCATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGT
+ GGAGAGGCTATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATGCCGCCGT
+ GTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCTTTTTGTCAAGACCGACCTGTCCGGTGC
+ CCTGAATGAACTGCAGGACGAGGCAGCGCGGCTATCGTGGCTGGCCACGACGGGCGTTCC
+ TTGCGCAGCTGTGCTCGACGTTGTCACTGAAGCGGGAAGGGACTGGCTGCTATTGGGCGA
+ AGTG-CGGGGCAGGATCTCCTGTCATCTCACCTTGCTCCTGC-GAGAAAGTATCCATCAT
+ GGCTGATGCAATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCGACCACCA
+ AGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTTGTCGATCAGGA
+ TGATCTGGACGAAGAGCATCAGGGGCTCGCGCCAGCCGAACTGTTCGCCAGGCTCAAGGC
+ GCGCATGCCCGACGGCGAGGATCTCGTCGTGACCCATGGCGATGCCTGCTTGCCGAATAT
+ CATGGTGGAAAATGGCCGCTTTTCTGGATTCATCGACTGTGGCCGGCTGGGTGTGGCGGA
+ CCGCTATCAGGACATAGCGTTGGCTACCCGTGATATTGCTGAAGAGCTTGGCGGCGAATG
+ GGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGCGCATCGCCTT
+ CTATCGCCTTCTTGACGAGTTCTTCTGAGCGGGACTCTGGGGTTCGATGTCGACAGCCCG
+ CCTAATGAGCGGGCTTTTTTTTCCTGAGGCTGGACGACCTCGCGGAGTTCTACCGGCAGT
+ GCAAATCCGTCGGCATCCAGGAAACCAGCAGCGGCTATCCGCGCATCCATGCCCCCGAAC
+ TGCAGGAGTGGGGAGGCACGATGGCCGCTTTGGTCCCGGATCAATTCGCGCGACCGGATC
+ GATCCGGGCAACGTTGTTGCCATTGCTGCAGGGGGGGGGGGGGGGGGTTCCATTGTTCAT
+ TCCACGGACAAAAACAGAGAAAGGAAACGACAGAGGCCAAAAAGCTCGCTTTCAGCACCT
+ GTCGTTTCCTTTCTTTTCAGAGGGTATTTTAAATAAAAACATTAAGTTATGACGAAGAAG
+ AACGGAAACGCCTTAAACCGGAAAATTTTCATAAATAGCGAAAACCCGCGAGGTCGCCGC
+ CCCGTAACCTGTCGGATCACCGGAAAGGACCCGTAAAGTGATAATGATTATCATCTACAT
+ ATCACAACGTGCGTGGAGGCCATCAAACCACGTCAAATAATCAATTATGACGCAGGTATC
+ GTATTAATTGATCTGCATCAACTTAACGTAAAAACAACTTCAGACAATACAAATCAGCGA
+ CACTGAATACGGGGCAACCTCATGTCCCCCCCCCCCCCC-CTGCAGGCGCAGAACTGGTA
+ GGTATGGAAGATCTCTAGAATTCAAAAAAAG-CCGCTCATTA-GCG-G-TA-TTCGAGCT
+ CGCCCGGGGATCGATCCTCTAGAGTC-TGATGCGGTATTTTCTCCTTACGCATCTGTG-G
+ GTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCGCTACGTGACT
+ GGGT*CATG---GCGCCC-GACACCCGCCAAC*-ACCCG-TGACG----CTGACGGGCTT
+ GTCTGCTCCCGG-ATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCT*GCATGTGT
+ --GAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCCCAGCTGGCTTATCGAAATTAAT
+ ACGACTCACTATAGGGAGACCCAAGCTTAGGATCGCGGCCGCAGTACTGGATCA
diff --git a/userdata/m13mp18.vec b/userdata/m13mp18.vec
new file mode 100644
index 0000000..25341c9
--- /dev/null
+++ b/userdata/m13mp18.vec
@@ -0,0 +1,123 @@
+<---m13mp.001------>
+;M13MP18 cloning vector from vecbase
+ AATGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT
+ ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT
+ CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA
+ GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA
+ TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG
+ TTGGAGTTTGCTTCCGGTCTGGTTCGCTTTGAAGCTCGAATTAAAACGCGATATTTGAAG
+ TCTTTCGGGCTTCCTCTTAATCTTTTTGATGCAATCCGCTTTGCTTCTGACTATAATAGT
+ CAGGGTAAAGACCTGATTTTTGATTTATGGTCATTCTCGTTTTCTGAACTGTTTAAAGCA
+ TTTGAGGGGGATTCAATGAATATTTATGACGATTCCGCAGTATTGGACGCTATCCAGTCT
+ AAACATTTTACTATTACCCCCTCTGGCAAAACTTCTTTTGCAAAAGCCTCTCGCTATTTT
+ GGTTTTTATCGTCGTCTGGTAAACGAGGGTTATGATAGTGTTGCTCTTACTATGCCTCGT
+ AATTCCTTTTGGCGTTATGTATCTGCATTAGTTGAATGTGGTATTCCTAAATCTCAACTG
+ ATGAATCTTTCTACCTGTAATAATGTTGTTCCGTTAGTTCGTTTTATTAACGTAGATTTT
+ TCTTCCCAACGTCCTGACTGGTATAATGAGCCAGTTCTTAAAATCGCATAAGGTAATTCA
+ CAATGATTAAAGTTGAAATTAAACCATCTCAAGCCCAATTTACTACTCGTTCTGGTGTTT
+ CTCGTCAGGGCAAGCCTTATTCACTGAATGAGCAGCTTTGTTACGTTGATTTGGGTAATG
+ AATATCCGGTTCTTGTCAAGATTACTCTTGATGAAGGTCAGCCAGCCTATGCGCCTGGTC
+ TGTACACCGTTCATCTGTCCTCTTTCAAAGTTGGTCAGTTCGGTTCCCTTATGATTGACC
+ GTCTGCGCCTCGTTCCGGCTAAGTAACATGGAGCAGGTCGCGGATTTCGACACAATTTAT
+ CAGGCGATGATACAAATCTCCGTTGTACTTTGTTTCGCGCTTGGTATAATCGCTGGGGGT
+ CAAAGATGAGTGTTTTAGTGTATTCTTTCGCCTCTTTCGTTTTAGGTTGGTGCCTTCGTA
+ GTGGCATTACGTATTTTACCCGTTTAATGGAAACTTCCTCATGAAAAAGTCTTTAGTCCT
+ CAAAGCCTCTGTAGCCGTTGCTACCCTCGTTCCGATGCTGTCTTTCGCTGCTGAGGGTGA
+ CGATCCCGCAAAAGCGGCCTTTAACTCCCTGCAAGCCTCAGCGACCGAATATATCGGTTA
+ TGCGTGGGCGATGGTTGTTGTCATTGTCGGCGCAACTATCGGTATCAAGCTGTTTAAGAA
+ ATTCACCTCGAAAGCAAGCTGATAAACCGATACAATTAAAGGCTCCTTTTGGAGCCTTTT
+ TTTTTGGAGATTTTCAACGTGAAAAAATTATTATTCGCAATTCCTTTAGTTGTTCCTTTC
+ TATTCTCACTCCGCTGAAACTGTTGAAAGTTGTTTAGCAAAACCCCATACAGAAAATTCA
+ TTTACTAACGTCTGGAAAGACGACAAAACTTTAGATCGTTACGCTAACTATGAGGGTTGT
+ CTGTGGAATGCTACAGGCGTTGTAGTTTGTACTGGTGACGAAACTCAGTGTTACGGTACA
+ TGGGTTCCTATTGGGCTTGCTATCCCTGAAAATGAGGGTGGTGGCTCTGAGGGTGGCGGT
+ TCTGAGGGTGGCGGTTCTGAGGGTGGCGGTACTAAACCTCCTGAGTACGGTGATACACCT
+ ATTCCGGGCTATACTTATATCAACCCTCTCGACGGCACTTATCCGCCTGGTACTGAGCAA
+ AACCCCGCTAATCCTAATCCTTCTCTTGAGGAGTCTCAGCCTCTTAATACTTTCATGTTT
+ CAGAATAATAGGTTCCGAAATAGGCAGGGGGCATTAACTGTTTATACGGGCACTGTTACT
+ CAAGGCACTGACCCCGTTAAAACTTATTACCAGTACACTCCTGTATCATCAAAAGCCATG
+ TATGACGCTTACTGGAACGGTAAATTCAGAGACTGCGCTTTCCATTCTGGCTTTAATGAA
+ GATCCATTCGTTTGTGAATATCAAGGCCAATCGTCTGACCTGCCTCAACCTCCTGTCAAT
+ GCTGGCGGCGGCTCTGGTGGTGGTTCTGGTGGCGGCTCTGAGGGTGGTGGCTCTGAGGGT
+ GGCGGTTCTGAGGGTGGCGGCTCTGAGGGAGGCGGTTCCGGTGGTGGCTCTGGTTCCGGT
+ GATTTTGATTATGAAAAGATGGCAAACGCTAATAAGGGGGCTATGACCGAAAATGCCGAT
+ GAAAACGCGCTACAGTCTGACGCTAAAGGCAAACTTGATTCTGTCGCTACTGATTACGGT
+ GCTGCTATCGATGGTTTCATTGGTGACGTTTCCGGCCTTGCTAATGGTAATGGTGCTACT
+ GGTGATTTTGCTGGCTCTAATTCCCAAATGGCTCAAGTCGGTGACGGTGATAATTCACCT
+ TTAATGAATAATTTCCGTCAATATTTACCTTCCCTCCCTCAATCGGTTGAATGTCGCCCT
+ TTTGTCTTTAGCGCTGGTAAACCATATGAATTTTCTATTGATTGTGACAAAATAAACTTA
+ TTCCGTGGTGTCTTTGCGTTTCTTTTATATGTTGCCACCTTTATGTATGTATTTTCTACG
+ TTTGCTAACATACTGCGTAATAAGGAGTCTTAATCATGCCAGTTCTTTTGGGTATTCCGT
+ TATTATTGCGTTTCCTCGGTTTCCTTCTGGTAACTTTGTTCGGCTATCTGCTTACTTTTC
+ TTAAAAAGGGCTTCGGTAAGATAGCTATTGCTATTTCATTGTTTCTTGCTCTTATTATTG
+ GGCTTAACTCAATTCTTGTGGGTTATCTCTCTGATATTAGCGCTCAATTACCCTCTGACT
+ TTGTTCAGGGTGTTCAGTTAATTCTCCCGTCTAATGCGCTTCCCTGTTTTTATGTTATTC
+ TCTCTGTAAAGGCTGCTATTTTCATTTTTGACGTTAAACAAAAAATCGTTTCTTATTTGG
+ ATTGGGATAAATAATATGGCTGTTTATTTTGTAACTGGCAAATTAGGCTCTGGAAAGACG
+ CTCGTTAGCGTTGGTAAGATTCAGGATAAAATTGTAGCTGGGTGCAAAATAGCAACTAAT
+ CTTGATTTAAGGCTTCAAAACCTCCCGCAAGTCGGGAGGTTCGCTAAAACGCCTCGCGTT
+ CTTAGAATACCGGATAAGCCTTCTATATCTGATTTGCTTGCTATTGGGCGCGGTAATGAT
+ TCCTACGATGAAAATAAAAACGGCTTGCTTGTTCTCGATGAGTGCGGTACTTGGTTTAAT
+ ACCCGTTCTTGGAATGATAAGGAAAGACAGCCGATTATTGATTGGTTTCTACATGCTCGT
+ AAATTAGGATGGGATATTATTTTTCTTGTTCAGGACTTATCTATTGTTGATAAACAGGCG
+ CGTTCTGCATTAGCTGAACATGTTGTTTATTGTCGTCGTCTGGACAGAATTACTTTACCT
+ TTTGTCGGTACTTTATATTCTCTTATTACTGGCTCGAAAATGCCTCTGCCTAAATTACAT
+ GTTGGCGTTGTTAAATATGGCGATTCTCAATTAAGCCCTACTGTTGAGCGTTGGCTTTAT
+ ACTGGTAAGAATTTGTATAACGCATATGATACTAAACAGGCTTTTTCTAGTAATTATGAT
+ TCCGGTGTTTATTCTTATTTAACGCCTTATTTATCACACGGTCGGTATTTCAAACCATTA
+ AATTTAGGTCAGAAGATGAAATTAACTAAAATATATTTGAAAAAGTTTTCTCGCGTTCTT
+ TGTCTTGCGATTGGATTTGCATCAGCATTTACATATAGTTATATAACCCAACCTAAGCCG
+ GAGGTTAAAAAGGTAGTCTCTCAGACCTATGATTTTGATAAATTCACTATTGACTCTTCT
+ CAGCGTCTTAATCTAAGCTATCGCTATGTTTTCAAGGATTCTAAGGGAAAATTAATTAAT
+ AGCGACGATTTACAGAAGCAAGGTTATTCACTCACATATATTGATTTATGTACTGTTTCC
+ ATTAAAAAAGGTAATTCAAATGAAATTGTTAAATGTAATTAATTTTGTTTTCTTGATGTT
+ TGTTTCATCATCTTCTTTTGCTCAGGTAATTGAAATGAATAATTCGCCTCTGCGCGATTT
+ TGTAACTTGGTATTCAAAGCAATCAGGCGAATCCGTTATTGTTTCTCCCGATGTAAAAGG
+ TACTGTTACTGTATATTCATCTGACGTTAAACCTGAAAATCTACGCAATTTCTTTATTTC
+ TGTTTTACGTGCTAATAATTTTGATATGGTTGGTTCAATTCCTTCCATAATTCAGAAGTA
+ TAATCCAAACAATCAGGATTATATTGATGAATTGCCATCATCTGATAATCAGGAATATGA
+ TGATAATTCCGCTCCTTCTGGTGGTTTCTTTGTTCCGCAAAATGATAATGTTACTCAAAC
+ TTTTAAAATTAATAACGTTCGGGCAAAGGATTTAATACGAGTTGTCGAATTGTTTGTAAA
+ GTCTAATACTTCTAAATCCTCAAATGTATTATCTATTGACGGCTCTAATCTATTAGTTGT
+ TAGTGCACCTAAAGATATTTTAGATAACCTTCCTCAATTCCTTTCTACTGTTGATTTGCC
+ AACTGACCAGATATTGATTGAGGGTTTGATATTTGAGGTTCAGCAAGGTGATGCTTTAGA
+ TTTTTCATTTGCTGCTGGCTCTCAGCGTGGCACTGTTGCAGGCGGTGTTAATACTGACCG
+ CCTCACCTCTGTTTTATCTTCTGCTGGTGGTTCGTTCGGTATTTTTAATGGCGATGTTTT
+ AGGGCTATCAGTTCGCGCATTAAAGACTAATAGCCATTCAAAAATATTGTCTGTGCCACG
+ TATTCTTACGCTTTCAGGTCAGAAGGGTTCTATCTCTGTTGGCCAGAATGTCCCTTTTAT
+ TACTGGTCGTGTGACTGGTGAATCTGCCAATGTAAATAATCCATTTCAGACGATTGAGCG
+ TCAAAATGTAGGTATTTCCATGAGCGTTTTTCCTGTTGCAATGGCTGGCGGTAATATTGT
+ TCTGGATATTACCAGCAAGGCCGATAGTTTGAGTTCTTCTACTCAGGCAAGTGATGTTAT
+ TACTAATCAAAGAAGTATTGCTACAACGGTTAATTTGCGTGATGGACAGACTCTTTTACT
+ CGGTGGCCTCACTGATTATAAAAACACTTCTCAAGATTCTGGCGTACCGTTCCTGTCTAA
+ AATCCCTTTAATCGGCCTCCTGTTTAGCTCCCGCTCTGATTCCAACGAGGAAAGCACGTT
+ ATACGTGCTCGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGG
+ GTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTT
+ TCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATC
+ GGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTG
+ ATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGA
+ CGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACC
+ CTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGAACCACCATCAA
+ ACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGG
+ CCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCGCTGGTGAAAAGAAAAACCACCCT
+ GGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGC
+ ACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGC
+ TCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA
+ TTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGAATTCGAGC
+ TCGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTGGCACTGGCCGTC
+ GTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCA
+ CATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAA
+ CAGTTGCGCAGCCTGAATGGCGAATGGCGCTTTGCCTGGTTTCCGGCACCAGAAGCGGTG
+ CCGGAAAGCTGGCTGGAGTGCGATCTTCCTGAGGCCGATACGGTCGTCGTCCCCTCAAAC
+ TGGCAGATGCACGGTTACGATGCGCCCATCTACACCAACGTAACCTATCCCATTACGGTC
+ AATCCGCCGTTTGTTCCCACGGAGAATCCGACGGGTTGTTACTCGCTCACATTTAATGTT
+ GATGAAAGCTGGCTACAGGAAGGCCAGACGCGAATTATTTTTGATGGCGTTCCTATTGGT
+ TAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTA
+ CAATTTAAATATTTGCTTATACAATCTTCCTGTTTTTGGGGCTTTTCTGATTATCAACCG
+ GGGTACATATGATTGACATGCTAGTTTTACGATTACCGTTCATCGATTCTCTTGTTTGCT
+ CCAGACTCTCAGGCAATGACCTGATAGCCTTTGTAGATCTCTCAAAAATAGCTACCCTCT
+ CCGGCATTAATTTATCAGCTAGAACGGTTGAATATCATATTGATGGTGATTTGACTGTCT
+ CCGGCCTTTCTCACCCTTTTGAATCTTTACCTACACATTACTCAGGCATTGCATTTAAAA
+ TATATGAGGGTTCTAAAAATTTTTATCCTTGCGTTGAAATAAAGGCTTCTCCCGCAAAAG
+ TATTACAGGGTCATAATGTTTTTGGTACAACCGATTTAGCTTTATGCTCTGAGGCTTTAT
+ TGCTTAATTTTGCTAATTCTTTGCCTTGCCTGTATGATTTATTGGATGTT
diff --git a/userdata/m13mp7.seq b/userdata/m13mp7.seq
new file mode 100644
index 0000000..75b10d8
--- /dev/null
+++ b/userdata/m13mp7.seq
@@ -0,0 +1,126 @@
+<---M13MP7.001----->
+AACGCTACTACTATTAGTAGAATTGATGCCACCTTTTCAGCTCGCGCCCCAAATGAAAAT                    
+ATAGCTAAACAGGTTATTGACCATTTGCGAAATGTATCTAATGGTCAAACTAAATCTACT                    
+; by starting a record with ; comments can be inserted anywhere
+; but the ; must appear in column 1.
+; comment lines will be displayed when the file is read
+CGTTCGCAGAATTGGGAATCAACTGTTACATGGAATGAAACTTCCAGACACCGTACTTTA                    
+GTTGCATATTTAAAACATGTTGAGCTACAGCACCAGATTCAGCAATTAAGCTCTAAGCCA                    
+;  here is another comment
+TCCGCAAAAATGACCTCTTATCAAAAGGAGCAATTAAAGGTACTCTCTAATCCTGACCTG                    
+TTGGAGTTTGCTTCCGGTCTGGTTCGCTTTGAAGCTCGAATTAAAACGCGATATTTGAAG                    
+TCTTTCGGGCTTCCTCTTAATCTTTTTGATGCAATCCGCTTTGCTTCTGACTATAATAGT                    
+CAGGGTAAAGACCTGATTTTTGATTTATGGTCATTCTCGTTTTCTGAACTGTTTAAAGCA                    
+TTTGAGGGGGATTCAATGAATATTTATGACGATTCCGCAGTATTGGACGCTATCCAGTCT                    
+AAACATTTTACTATTACCCCCTCTGGCAAAACTTCTTTTGCAAAAGCCTCTCGCTATTTT                    
+GGTTTTTATCGTCGTCTGGTAAACGAGGGTTATGATAGTGTTGCTCTTACTATGCCTCGT                    
+AATTCCTTTTGGCGTTATGTATCTGCATTAGTTGAATGTGGTATTCCTAAATCTCAACTG                    
+ATGAATCTTTCTACCTGTAATAATGTTGTTCCGTTAGTTCGTTTTATTAACGTAGATTTT                    
+TCTTCCCAACGTCCTGACTGGTATAATGAGCCAGTTCTTAAAATCGCATAAGGTAATTCA                    
+CAATGATTAAAGTTGAAATTAAACCATCTCAAGCCCAATTTACTACTCGTTCTGGTGGTT                    
+CTCGTCAGGGCAAGCCTTATTCACTGAATGAGCAGCTTTGTTACGTTGATTTGGGTAATG                    
+AATATCCGGTTCTTGTCAAGATTACTCTTGATGAAGGTCAGCCAGCCTATGCGCCTGGTC                    
+TGTACACCGTTCATCTGTCCTCTTTCAAAGTTGGTCAGTTCGGTTCCCTTATGATTGACC                    
+GTCTGCGCCTCGTTCCGGCTAAGTAACATGGAGCAGGTCGCGGATTTCGACACAATTTAT                    
+CAGGCGATGATACAAATCTCCGTTGTACTTTGTTTCGCGCTTGGTATAATCGCTGGGGGT                    
+CAAAGATGAGTGTTTTAGTGTATTCTTTCGCCTCTTTCGTTTTAGGTTGGTGCCTTCGTA                    
+GTGGCATTACGTATTTTACCCGTTTAATGGAAACTTCCTCATGAAAAAGTCTTTAGTCCT                    
+CAAAGCCTCTGTAGCCGTTGCTACCCTCGTTCCGATGCTGTCTTTCGCTGCTGAGGGTGA                    
+CGATCCCGCAAAAGCGGCCTTTAACTCCCTGCAAGCCTCAGCGACCGAATATATCGGTTA                    
+TGCGTGGGCGATGGTTGTTGTCATTGTCGGCGCAACTATCGGTATCAAGCTGTTTAAGAA                    
+ATTCACCTCGAAAGCAAGCTGATAAACCGATACAATTAAAGGCTCCTTTTGGAGCCTTTT                    
+TTTTTGGAGATTTTCAACGTGAAAAAATTATTATTCGCAATTCCTTTAGTTGTTCCTTTC                    
+TATTCTCACTCCGCTGAAACTGTTGAAAGTTGTTTAGCAAAACCCCATACAGAAAATTCA                    
+TTTACTAACGTCTGGAAAGACGACAAAACTTTAGATCGTTACGCTAACTATGAGGGTTGT                    
+CTGTGGAATGCTACAGGCGTTGTAGTTTGTACTGGTGACGAAACTCAGTGTTACGGTACA                    
+TGGGTTCCTATTGGGCTTGCTATCCCTGAAAATGAGGGTGGTGGCTCTGAGGGTGGCGGT                    
+TCTGAGGGTGGCGGTTCTGAGGGTGGCGGTACTAAACCTCCTGAGTACGGTGATACACCT                    
+ATTCCGGGCTATACTTATATCAACCCTCTCGACGGCACTTATCCGCCTGGTACTGAGCAA                    
+AACCCCGCTAATCCTAATCCTTCTCTTGAGGAGTCTCAGCCTCTTAATACTTTCATGTTT                    
+CAGAATAATAGGTTCCGAAATAGGCAGGGGGCATTAACTGTTTATACGGGCACTGTTACT                    
+CAAGGCACTGACCCCGTTAAAACTTATTACCAGTACACTCCTGTATCATCAAAAGCCATG                    
+TATGACGCTTACTGGAACGGTAAATTCAGAGACTGCGCTTTCCATTCTGGCTTTAATGAA                    
+GATCCATTCGTTTGTGAATATCAAGGCCAATCGTCTGACCTGCCTCAACCTCCTGTCAAT                    
+GCTGGCGGCGGCTCTGGTGGTGGTTCTGGTGGCGGCTCTGAGGGTGGTGGCTCTGAGGGT                    
+GGCGGTTCTGAGGGTGGCGGCTCTGAGGGAGGCGGTTCCGGTGGTGGCTCTGGTTCCGGT                    
+GATTTTGATTATGAAAAGATGGCAAACGCTAATAAGGGGGCTATGACCGAAAATGCCGAT                    
+GAAAACGCGCTACAGTCTGACGCTAAAGGCAAACTTGATTCTGTCGCTACTGATTACGGT                    
+GCTGCTATCGATGGTTTCATTGGTGACGTTTCCGGCCTTGCTAATGGTAATGGTGCTACT                    
+GGTGATTTTGCTGGCTCTAATTCCCAAATGGCTCAAGTCGGTGACGGTGATAATTCACCT                    
+TTAATGAATAATTTCCGTCAATATTTACCTTCCCTCCCTCAATCGGTTGAATGTCGCCCT                    
+TTTGTCTTTAGCGCTGGTAAACCATATGAATTTTCTATTGATTGTGACAAAATAAACTTA                    
+TTCCGTGGTGTCTTTGCGTTTCTTTTATATGTTGCCACCTTTATGTATGTATTTTCTACG                    
+TTTGCTAACATACTGCGTAATAAGGAGTCTTAATCATGCCAGTTCTTTTGGGTATTCCGT                    
+TATTATTGCGTTTCCTCGGTTTCCTTCTGGTAACTTTGTTCGGCTATCTGCTTACTTTTC                    
+TTAAAAAGGGCTTCGGTAAGATAGCTATTGCTATTTCATTGTTTCTTGCTCTTATTATTG                    
+GGCTTAACTCAATTCTTGTGGGTTATCTCTCTGATATTAGCGCTCAATTACCCTCTGACT                    
+TTGTTCAGGGTGTTCAGTTAATTCTCCCGTCTAATGCGCTTCCCTGTTTTTATGTTATTC                    
+TCTCTGTAAAGGCTGCTATTTTCATTTTTGACGTTAAACAAAAAATCGTTTCTTATTTGG                    
+ATTGGGATAAATAATATGGCTGTTTATTTTGTAACTGGCAAATTAGGCTCTGGAAAGACG                    
+CTCGTTAGCGTTGGTAAGATTCAGGATAAAATTGTAGCTGGGTGCAAAATAGCAACTAAT                    
+CTTGATTTAAGGCTTCAAAACCTCCCGCAAGTCGGGAGGTTCGCTAAAACGCCTCGCGTT                    
+CTTAGAATACCGGATAAGCCTTCTATATCTGATTTGCTTGCTATTGGGCGCGGTAATGAT                    
+TCCTACGATGAAAATAAAAACGGCTTGCTTGTTCTCGATGAGTGCGGTACTTGGTTTAAT                    
+ACCCGTTCTTGGAATGATAAGGAAAGACAGCCGATTATTGATTGGTTTCTACATGCTCGT                    
+AAATTAGGATGGGATATTATTTTTCTTGTTCAGGACTTATCTATTGTTGATAAACAGGCG                    
+CGTTCTGCATTAGCTGAACATGTTGTTTATTGTCGTCGTCTGGACAGAATTACTTTACCT                    
+TTTGTCGGTACTTTATATTCTCTTATTACTGGCTCGAAAATGCCTCTGCCTAAATTACAT                    
+GTTGGCGTTGTTAAATATGGCGATTCTCAATTAAGCCCTACTGTTGAGCGTTGGCTTTAT                    
+ACTGGTAAGAATTTGTATAACGCATATGATACTAAACAGGCTTTTTCTAGTAATTATGAT                    
+TCCGGTGTTTATTCTTATTTAACGCCTTATTTATCACACGGTCGGTATTTCAAACCATTA                    
+AATTTAGGTCAGAAGATGAAATTAACTAAAATATATTTGAAAAAGTTTTCTCGCGTTCTT                    
+TGTCTTGCGATTGGATTTGCATCAGCATTTACATATAGTTATATAACCCAACCTAAGCCG                    
+GAGGTTAAAAAGGTAGTCTCTCAGACCTATGATTTTGATAAATTCACTATTGACTCTTCT                    
+CAGCGTCTTAATCTAAGCTATCGCTATGTTTTCAAGGATTCTAAGGGAAAATTAATTAAT                    
+AGCGACGATTTACAGAAGCAAGGTTATTCACTCACATATATTGATTTATGTACTGTTTCC                    
+ATTAAAAAAGGTAATTCAAATGAAATTGTTAAATGTAATTAATTTTGTTTTCTTGATGTT                    
+TGTTTCATCATCTTCTTTTGCTCAGGTAATTGAAATGAATAATTCGCCTCTGCGCGATTT                    
+TGTAACTTGGTATTCAAAGCAATCAGGCGAATCCGTTATTGTTTCTCCCGATGTAAAAGG                    
+TACTGTTACTGTATATTCATCTGACGTTAAACCTGAAAATCTACGCAATTTCTTTATTTC                    
+TGTTTTACGTGCTAATAATTTTGATATGGTTGGTTCAATTCCTTCCATAATTCAGAAGTA                    
+TAATCCAAACAATCAGGATTATATTGATGAATTGCCATCATCTGATAATCAGGAATATGA                    
+TGATAATTCCGCTCCTTCTGGTGGTTTCTTTGTTCCGCAAAATGATAATGTTACTCAAAC                    
+TTTTAAAATTAATAACGTTCGGGCAAAGGATTTAATACGAGTTGTCGAATTGTTTGTAAA                    
+GTCTAATACTTCTAAATCCTCAAATGTATTATCTATTGACGGCTCTAATCTATTAGTTGT                    
+TAGTGCACCTAAAGATATTTTAGATAACCTTCCTCAATTCCTTTCTACTGTTGATTTGCC                    
+AACTGACCAGATATTGATTGAGGGTTTGATATTTGAGGTTCAGCAAGGTGATGCTTTAGA                    
+TTTTTCATTTGCTGCTGGCTCTCAGCGTGGCACTGTTGCAGGCGGTGTTAATACTGACCG                    
+CCTCACCTCTGTTTTATCTTCTGCTGGTGGTTCGTTCGGTATTTTTAATGGCGATGTTTT                    
+AGGGCTATCAGTTCGCGCATTAAAGACTAATAGCCATTCAAAAATATTGTCTGTGCCACG                    
+TATTCTTACGCTTTCAGGTCAGAAGGGTTCTATCTCTGTTGGCCAGAATGTCCCTTTTAT                    
+TACTGGTCGTGTGACTGGTGAATCTGCCAATGTAAATAATCCATTTCAGACGATTGAGCG                    
+TCAAAATGTAGGTATTTCCATGAGCGTTTTTCCTGTTGCAATGGCTGGCGGTAATATTGT                    
+TCTGGATATTACCAGCAAGGCCGATAGTTTGAGTTCTTCTACTCAGGCAAGTGATGTTAT                    
+TACTAATCAAAGAAGTATTGCTACAACGGTTAATTTGCGTGATGGACAGACTCTTTTACT                    
+CGGTGGCCTCACTGATTATAAAAACACTTCTCAAGATTCTGGCGTACCGTTCCTGTCTAA                    
+AATCCCTTTAATCGGCCTCCTGTTTAGCTCCCGCTCTGATTCCAACGAGGAAAGCACGTT                    
+ATACGTGCTCGTCAAAGCAACCATAGTACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGG                    
+GTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTT                    
+TCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATC                    
+GGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTG                    
+ATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGA                    
+CGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTTGAACAACACTCAACC                    
+CTATCTCGGGCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGAACCACCATCAA                    
+ACAGGATTTTCGCCTGCTGGGGCAAACCAGCGTGGACCGCTTGCTGCAACTCTCTCAGGG                    
+CCAGGCGGTGAAGGGCAATCAGCTGTTGCCCGTCTCGCTGGTGAAAAGAAAAACCACCCT                    
+GGCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATCCAGCTGGC                    
+ACGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTACC                    
+TCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAA                    
+TTGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGAATTCCCCG                    
+GATCCGTCGACCTGCAGGTCGACGGATCCGGGGAATTCACTGGCCGTCGTTTTACAACGT                    
+CGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATCCCCCCTTC                    
+GCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGTAGC                    
+CTGAATGGCGAATGGCGCTTTGCCTGGTTTCCGGCACCAGAAGCGGTGCCGGAAAGCTGG                    
+CTGGAGTGCGATCTTCCTGAGGCCGA-AC-GTCGTCGTCCCCTCAAACTGGCAGATGCAC                    
+GGTTACGATGCGCCCATCTACACCAACGTAACCTATCCCATTACGGTCAATCCGCCGTTT                    
+GTTCCCACGGAGAATCCGACGGGTTGTTACTCGCTCACATTTAATGTTGATGAAAGCTGG                    
+CTACAGGAAGGCCAGACGCGAATTATTTTTGATGGCGTTCCTATTGGTTAAAAAATGAGC                    
+TGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGTTTACAATTTAAATAT                    
+TTGCTTATACAATCTTCCTGTTTTTGGGGCTTTTCTGATTATCAACCGGGGTACATATGA                    
+TTGACATGCTAGTTTTACGATTACCGTTCATCGATTCTCTTGTTTGCTCCAGACTCTCAG                    
+GCAATGACCTGATAGCCTTTGTAGATCTCTCAAAAATAGCTACCCTCTCCGGCATGAATT                    
+TATCAGCTAGAACGGTTGAATATCATATTGATGGTGATTTGACTGTCTCCGGCCTTTCTC                    
+ACCCTTTTGAATCTTTACCTACACATTACTCAGGCATTGCATTTAAAATATATGAGGGTT                    
+CTAAAAATTTTTATCCTTGCGTTGAAATAAAGGCTTCTCCCGCAAAAGTATTACAGGGTC                    
+ATAATGTTTTTGGTACAACCGATTTAGCTTTATGCTCTGAGGCTTTATTGCTTAATTTTG                    
+CTAATTCTTTGCCTTGCCTGTATGATTTATTGGATGTT                                          
diff --git a/userdata/newtrna.pat b/userdata/newtrna.pat
new file mode 100644
index 0000000..a936764
--- /dev/null
+++ b/userdata/newtrna.pat
@@ -0,0 +1,68 @@
+ A6          aac      Class 
+      7      Length
+     56      Minimum loop
+     80      Maximum loop
+  12.00000   Cutoff
+ A1          t        Class 
+      1      Relative motif
+      5      Relative end
+      8      Relative start position
+      0      Number of extra positions
+ T
+ @ End of string
+ A6          d        Class 
+      1      Relative motif
+      5      Relative end
+     10      Relative start position
+      0      Number of extra positions
+      4      Length
+      4      Minimum loop
+     13      Maximum loop
+   5.00000   Cutoff
+ A1          ar       Class 
+      1      Relative motif
+      5      Relative end
+     14      Relative start position
+      0      Number of extra positions
+ AR
+ @ End of string
+ A6          ac       Class 
+      3      Relative motif
+      3      Relative end
+      3      Relative start position
+      0      Number of extra positions
+      5      Length
+      7      Minimum loop
+      7      Maximum loop
+   8.00000   Cutoff
+ A1          yt       Class 
+      5      Relative motif
+      5      Relative end
+      6      Relative start position
+      0      Number of extra positions
+ YT
+ @ End of string
+A1           V        Class
+      5               Relative motif
+      5               Relative end
+     11               Range start
+      0               Range length
+V
+@ End of string       
+ A6          tu       Class 
+      5      Relative motif
+      3      Relative end
+      5      Relative start position
+     15      Number of extra positions
+      5      Length
+      6      Minimum loop
+      7      Maximum loop
+   8.00000   Cutoff
+ A2          gtt      Class 
+      8      Relative motif
+      5      Relative end
+      5      Relative start position
+      0      Number of extra positions
+ GTTCRANY
+ @ End of string
+   8.00000   Cutoff
diff --git a/userdata/pjb8.vep b/userdata/pjb8.vep
new file mode 100644
index 0000000..ddcad83
--- /dev/null
+++ b/userdata/pjb8.vep
@@ -0,0 +1,93 @@
+; pjb8 sequence from karens r107 database 4-10-91
+ GAATTCTCATGTTTGACAGCTTATCATCGATAAGCTTTAA
+ TGCGGTAGTTTATCACAGTTAAATTGCTAACGCAGTCAGGCACCGTGTATGAAATCTAAC
+ AATGCGCTCATCGTCATCCTCGGCACCGTCACCCT-GGATGCTGTAGGCATAGGCTTGGT
+ TATGCCGGTACTGCCGGGCCTCTTGCGGGATATCGTCCATTCCGACAGCATCGCCAGTCA
+ CTATGGCGTGCTGCTAGCGCTATATGCGTTGATGCAATTTCTATGCGCACCCGTTCTCGG
+ AGCACTGTCCGACCGCTTTGGCCGCCGCC-AGTCCTGCTCGC-TTCGCTACTTGGAGCCA
+ CTATCGACTACGCGATCATGGCGACCACACCCGTCCTGTGGATCTGCCTCGCTGGCCTGC
+ CGCAGTTCTTCAACCTCCCGGCGCA-GTTTTTCGTTTCTCAATTTCAGCATCC-TTTCGG
+ CATACCATTTTATGACGGCGGCAGAGTCATAAAGCACCTCATTACCCTTGCCACCGCCTC
+ GCAGAACGGGCATTCCCTGTTCCTGCCAGTTCTGAATGGTAC*GGAT-ACTCGCACCGAA
+ AATGTCAGCC-AGCTGCTTTTTGTTGACTTCCATTGTTCATTCCACGGACAAAAACAGAG
+ AAAGGA-ACGACAGAGGCCAAAAAGCTC--TTTCAGCACC-TGTCGTTTCCTTTTC--TT
+ TTCAGAGGGTATTTTAAATAAAAACATTAAGT*TATGACGAAGAAGAACGGAAACGCCTT
+ AA-ACCGGAAAATTTTCATAAATAGCGAAA-CCCGCGAGTCGCCGCCCCGTAACAAGGCG
+ GATCC-*GAGAAGGACCCGCAAATGATAATAATTATCAATTGCATACTATCGACGGCACT
+ GCTGCCAGATAACACCACCGGGGAAACATTCCATCATGATGGCCGTGCGGACATAGGAAG
+ CCAGTTCATCCATCGCTTTCTTGTCT-CTGCCATTTGCTTTGTGACATCCAGCGCCGCAC
+ ATTCAGCAGCGTTTTTCAGCGCGTTTTCGATCAACGTTTCAA*TGTTGGTATCAACA-CC
+ AGG-TTTAACTTT--*-----TTATCGGCACTGACGGTTACCTTGTTCTGCG**CTGGCT
+ CATCACGCTGGATACCAAGGCTGATG-TT*GTA*GATATTGGTCAC*CGGCTGAGGTGT*
+ TTCGATTGCCGCTGCGTGGATAGCACCATTTGCGATAG----CGTCCTTGATGAATGACA
+ CTCCATTGCGAATAAGTTCGAAGGAGACGGTGTCACGAATGCGCTGGTCCAGCTCGTCGA
+ TTGCCTTTTGTGCAGCAGAGGTATCAATCT-CAAC-CCAAGC-TCATCGAAGCGCAATAT
+ TGCTGCTCACCAAAACGCGTATTGACCAGGTGTTCAACGGCAAATTTCTGCCCTTCTGAT
+ GTCAGAAAGGTAAAGTGATTTTCTTTCTGGTATTCAGTTGCTGT-GTGTCTGGTTTCAGC
+ AAA*--CCAAG-TCGCGCAATTCGGCTGTGCAGTATTTAGAAGGCAGATCACCAGACAGC
+ AACGGC-CACGGAAAAACAGCGCATACAGAACATCCGTCGCCGCGCCGGACAACGTGATA
+ ATTTTATGACCCATGATTTATTTCCTTTTAGACGTGAGCCTGTCGCACAGCAAAGCCGCC
+ GAAAGTTAACGGTTTGCCCAGGCTCA-CAACTGAAAGACTTTCT-ACGGT-GTGCGCGTG
+ CGATGCG--TAGAAGACTGATTTATCAACCTGTCTTTATATCAGGATTCATTACCTGACT
+ ATTTGTGGGTAAAGTTCGTAGTGCGCTGATCGTGCAAAATGATTTTAGTTGGGAACAGTT
+ CGCAACTCTGTCCCATAAAAATCAGCATATTCCCATCTATCCCATATCCAGCGCATTGAC
+ CATCG*GGATACTGAAGGGAGATTCCATCATCTCTTAGAAAGATCACCATCTCTTTTGTT
+ TCAATTTGCATATAGCTACCTGGAGGATTTATGAATACAAGGATTTTCATG-ACTATTAC
+ CATGAGATT-GATTTTCCATCTTTATTCGCGAGA-CAGT-GGAAAGCGATGACGATG-TG
+ GGTACTACATTGCGCATTCACCTACT-TGTGAGCGCATG*GTCGAAGCATGG-ATATG--
+ CATGCTGTGACTGCCAAGATCCTCTAC-CCGGACGCATCGTGGCCA-TTCACCGGGCCAC
+ AGGTGCGGTTGCTGGCGCCTATATCGCCGACATCACCGATGGGGAAGATCGGGCTCGCCA
+ CTTCGGGCTCATGAGCGCTTGTTTCGGCGTGGGTATGGTGGCAGGCCCCGTGGCG-GGG-
+ ACTGTTGGGCGCCATCTCCTTGCATGCACCATTCCTTGCGGCGGCGGGTGC-TCAACGGC
+ CTCAACCTACTACTGG-TGCTTCCTAATGCAGGAGTCGCATAAGGGAGAGCGTCGACCGA
+ TGCCCTTGAGAGCCTTCAACCCAGTCAGCCTTCT-TCCGGTGGGGCGCGGGGG-CAATGA
+ CTATCGTCGCCGCACTTATGACTGTC--TCTTTATCATGCAACTCGTAGGACA-G---CC
+ GGCAGCGCTCTGGGTCATTTTCGGCGAG----*****CC--T-T-CTGGAGCGCGACGAT
+ GATC-GGCCTGTCGCTTGCGGTATTCGGAATCTTGCACGCCCTCGCTCAAGCCTTCGT-A
+ CTG-TCCCGCCACCAAACGTTTCGGCGAGAAGCAGGCCATTA-TCGCGGCATGG*CGGCC
+ GACGCGCT-GGGCTACGTCTTGCTG-CGTTC-CGA--CGAGGC-TGGATGGCCTTCCCCA
+ TTATGATTCTTCT-CGCTTC*-----GGCATCGGGAT---CCGCGTTGCAGGCCATGCTG
+ TCCAGGCAGGTAGATGACGACCATCAGGGACAGCTTCAAGGA*TCGCTCGCGGCTCTTAC
+ CAGCCTAACTTCGATCATTGGACCGCTGATCGTCACGGCGATTT*ATGCC-CC*TC-G--
+ -GCA-CATGGAACGGGTTGGCATGGATTGTAG-CGCCG-CCTATACCTTGTCTGCCT---
+ CCCG--TTGC-TCGCGGTGCATGGAGCCGGGCCACCTCGACCTGAATGGA---CGGC--C
+ ACCTCGCTAACGGATTCACCACTCCAAGAATTGGAGCCAATCAATTCTTGCGGAGAACTG
+ TGAATGCGCAAACCAACCCTTGGCAGAACATATCCATC-CGTCCGCCATCTC---AGCA-
+ -CGCACGCG-------*CTC*GGGCAGCGTTGG*GTCCTGGCCACGGGTGCGCATGATCG
+ TGCTCCT*GTCGTTGAGGACCCGGCTAGGCTGGCGGGGTTGCCTTACTGGTTAGCAGAAT
+ GAATCACCGA-TACGCGAGCGAACGTGAAGCGACT-CTGCTGCAA-AAC-CTGCGACCTG
+ AGCAACAACATGAATGGTCTTCGGTTTCCGTGTTTCGTAAAGTCTGGAAACGCGGAAGTC
+ AGCGCTCTTCCGCTTCCTCGCTCA-ATG-ACCTGCGCTCGGTCGTTCGGCTGCGGCGAGC
+ GGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGG
+ AAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGGCCGTTGCTG
+ GCGTTTTTCCACTAGCCGCCCCCCT-ACGAGCATCACAAAAATCGACGCTCAAGTCAGAG
+ GT-GCGAAACCCGACAGGACTATAAAGATAC-AGGCGT-TC-CCCTGAAGCTCCCTCGTG
+ CGCTCTCCTGTTCCGA-CTCCTGCCGCTTACCGGATACCTGTCCGCCTTTTTCCTTCGGG
+ AAGCGTGGCGCTTTTCTTCAA-TACGTGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTC
+ GCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCTTGCGCCTTATCCGG
+ TAACTATCGTCTTGAGTTCCAACCCGGTAAGACACGACTTCATGCGCCA*CCTGGCAG-C
+ AGCCACTGGTA-ACAGGATTAGCAGAGCGAGGTATGTAGGC-GGTGCTACAGAGTTCTTG
+ AAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCCTCTGCTGA
+ AGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAA-AC-A-AACCGCTG
+ GTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAG
+ AAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAG
+ GGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAAT
+ GAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCT
+ TAATCAGTGAGGCACCTATCTCAGCG-TCTGTCTATTTCGTTCATCCATAGTTGCCTGAC
+ TCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAA
+ TGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCG
+ GAAGGCCTGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATT
+ GTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTT-CC-
+ ATTGCT-CAGGCATC-GTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGG
+ TTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTA-TCC
+ TTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATG
+ GCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGT
+ GAGTACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCG
+ GCGTCAACACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGA
+ AAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGAT*CCAGTTCGAT
+ GTAACCCACTCGTGCACCCAACTG-ATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTG
+ GGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAAT
+ GTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTC
+ TCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCA
+ CATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTAACCT
+ ATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTTCAAGAATTCAA
diff --git a/userdata/prokprom.seq b/userdata/prokprom.seq
new file mode 100644
index 0000000..77681d9
--- /dev/null
+++ b/userdata/prokprom.seq
@@ -0,0 +1,88 @@
+ TTAGCGGATCCTACCTGACGCTTTTTATCGCAACTCTCTACTGTTTCTCCATACCCGTTTTT
+ GCAAATAATCAATGTGGACTTTTCTGCCGTGATTATAGACACTTTTGTTACGCGTTTTTGT
+ CTAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTATTTCATACCATAAG
+ CACTAATTTATTCCATGTCACACTTTTCGCATCTTTGTTATGCTATGGTTATTTCATACC
+ TAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAATTGTGAGC
+ TTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCG
+ GACACCATCGAATGGCGCAAAACCTTTCGCGGTATGGCATGATAGCGCCCGGAAGAGAGT
+ AGGGGCAAGGAGGATGGAAAGAGGTTGCCGTATAAAGAAACTAGAGTCCGTTCAGGTGT
+ CAGGGGGTGGAGGATTTAAGCCATCTCCTGATGACGCATAGTCAGCCCATCATGAATG
+ AAAAACGTCATCGCTTGCATTAGAAAGGTTTCTGGCCGACCTTATAACCATTAATTACG
+ AAACAATTTCAGAATAGACAAAAACTCTGAGTGTAATAATGTAGCCTCGTGTCTTGCG
+ CAGAAACGTTTTATTCGAACATCGATCTCGTCTTGTGTTAGAATTCTAACATACGGTTGC
+ AATTGTGATGTGTATCGAAGTGTGTTGCGGAGTAGATGTTAGAATACTAACAAACTCGCAA
+ TCTGAAATGAGCTGTTGACAATTAATCATCGAACTAGTTAACTAGTACGCAAGTTCACGT
+ TGGGGACGTCGTTACTGATCCGCACGTTTATGATATGCTATCGTACTCTTTAGCGAGTACA
+ GTACTAGAGAACTAGTGCATTAGCTTATTTTTTTGTTATCATGCTAACCACCCGGCGAG
+ ACCGGAAGAAAACCGTGACATTTTAACACGTTTGTTACAAGGTAAAGGCGACGCCGCCC
+ ATATAAAAAAGTTCTTGCTTTCTAACGTGAAAGTGGTTTAGGTTAAAAGACATCAGTTGAA
+ GATCTACAAACTAATTAATAAATAGTTAATTAACGCTCATCATTGTACAATGAACTGTACAAA
+ GTTGACATCCGTTTTTGTATCCAGTAACTCTAAAAGCATATCGCATT
+ GGCCAAAAAATATCTTGTACTATTTACAAAACCTATGGTAACTCTTTAGGCATTCCTTCGA
+ TTTGTTTTTCATTGTTGACACACCTCTGGTCATGATAGTATCAATATTCATGCAGTATT
+ AAATTAAAATTTTATTGACTTAGGTCACTAAATACTTTAACCAATATAGGCATAGCGCACA
+ GCCTTCTTCAAAACGTGTTTTTTGTTGTTAATTCGGTGTAGACTTGTAAACCTAAATCT
+ TTGTCATAATCGACTTGTAAACCAAATTGAAAAGATTTAGGTTTACAAGTCTACACCGAAT
+ CATCCTCGCACCAGTCGACGACGGTTTACGCTTTACGTATAGTGGCGACAATTTTTTTT
+ TCCAGTATAATTTGTTGGCATAATTAAGTACGACGAGTAAAATTACATACCTGCCCGC
+ TCAGAAATATTATGGTGATGAACTGTTTTTTTATCCAGTATAATTTGTTGGCATAATTAA
+ ACAGTTACCCACTATTCCTGTGGATAACCATGTGTATTAGAGTTAGAAAACACGAGGCA
+ TTTCTACAAAACACTTGATACTGTATGAGCATACAGTATAATTGCTTCAACAGAACAT
+ TGTGCAGTTTATGGTTCCAAAATCGCCTTTTGCTGTATATACTCAGAGCATAACTGTAT
+ TGCTATCCTGACAGTTGTCACGCTGATTGGTGTCGTTACAATCTAACGCATCGCCAATG
+ CCATCAAAAAAATATTCTCAACATAAAAAACTTTGTGTAATACTTGTAACGCTACATGGA
+ CAAGGTAGAATGCTTTGCCTTGTCGGCCTGATTAATGGCACGATAGTCGCATCGGATCTG
+ GATCGCACGATCTGTATACTTATTTGAGTAAATTAACCCACGATCCCAGCCATTCTTCTGC
+ CTGTTGTTCAGTTTTTGAGTTGTGTATAACCCCTCATTCTGATCCCAGCTTATACGGT
+ ATTACAAAAAGTGCTTTCTGAACTGAACAAAAAAGAGTAAAGTTAGTCGCGTAGGGTACA
+ ATGCGCAACGCGGGGTGACAAGGGCGCGCAAACCCTCTATACTGCGCGCCGAAGCTGACC
+ AACGCATACGGTATTTTACCTTCCCAGTCAAGAAAACTTATCATATTCCCACTTTTCAGT
+ CTACGGCGAGGCTATCGATCTCAGCCAGCCTGATGTAATTTATCAGTCTATAAATGACC
+ TAAAAAACTAACAGTTGTCAGCCTGTCCCGCTTATAAGATCATACGCCGTTATACGTT
+ ATGCAATTTTTTAGTTGCATGAACTCGCATGTCTCCATAGAATGCGCGCTACTTGATGCC
+ TCTCAACGTAACACTTTACAGCGGGGCGTCATTTGATATGATGCGCCCCGCTTCCCGAT
+ TCGATAATTAACTATTGACGAAAAGCTGAAAACCACTAGAATGCGCCTCCGTGGTAGCAA
+ CCTTGAAAAAGAGGTTGACGCTGCAAGGCTCTATACGCATAATGCGCCCCGCAACGCCGA
+ TTTTAAATTTCCTCTTGTCAGGCCGGAATAACTCCCTATAATGCGCCACCACTGACACGG
+ TTTATATTTTTCGCTTGTCAGGCCGGAATAACTCCCTATAATGCGCCACCACTGACACGG
+ GATCAAAAAAATACTTGTGCAAAAAATTGGGATCCCTATAATGCGCCTCCGTTGAGACGA
+ CTGCAATTTTTCTATTGCGGCCTGCGGAGAACTCCCTATAATGCGCCTCCATCGACACGG
+ ATGCATTTTTCCGCTTGTCTTCCTGAGCCGACTCCCTATAATGCGCCTCCATCGACACGG
+ GCAAAAATAAATGCTTGACTCTGTAGCGGGAAGGCGTATTATGCACACCCCGCGCCGC
+ AAGCAAAGAAATGCTTGACTCTGTAGCGGGAAGGCGTATTATGCACACCGCCGCGCCG
+ CCTGAAATTCAGGGTTGACTCTGAAAGAGGAAAGCGTAATATACGCCACCTCGCGACAG
+ TCGTTGTATATTTCTTGACACCTTTTCGGCATCGCCCTAAAATTCGGCGTCCTCATAT
+ CCGTTTATTTTTTCTACCCATATCCTTGAAGCGGTGTTATAATGCCGCGCCCTCGATA
+ TACTAGCAATACGCTTGCGTTCGGTGGTTAAGTATGTATAATGCGCGGGCTTGTCGT
+ TTCGCATATTTTTCTTGCAAAGTTGGGTTGAGCTGGCTAGATTAGCCAGCCAATCTTT
+ TGTAAACTAATGCCTTTACGTGGGCGGTGATTTTGTCTACAATCTTACCCCCACGTATA
+ CGACTTAATATACTGCGACAGGACGTCCGTTCTGTGTAAATCGCAATGAAATGGTAAAA
+ TATCAAAAAGAGTATTGACTTAAAGTCTAACCTATAGGATACTTACAGCCATCGAGAGGG
+ ACGAAAAACAGGTATTGACAACATGAAGTAACATGCAGTAAGATACAAATCGCTAGGTAAC
+ GTGAAACAAAACGGTTGACAACATGAAGTAAACACGGTACGATGTACCACATGAAACGAC
+ CATTGATAAGCAACTTGACGCAATGTTAATGGGCTGATAGTCTTATCTTACAGGTCATC
+ CTTTAAGATAGGCGTTGACTTGATGGGTCTTTAGGTGTAGGCTTTAGGTGTTGGCTTTA
+ TAACACCGTGCGTGTTGACTATTTTACCTCTGGCGGTGATAATGGTTGCATGTACTAAG
+ AACACGCACGGTGTTAGATATTTATCCCTTGCGGTGATAGATTTAACGTATGAGCACAA
+ TATCTCTGGCGGTGTTGACATAAATACCACTGGCGGTGATACTGAGCACATCAGCAGGA
+ TACCTCTGCCGAAGTTGAGTATTTTTGCTGTATTTGTCATAATGACTCCTGTTGATAGAT
+ TTAACGGCATGATATTGACTTATTGAATAAAATTGGGTAAATTTGACTCAACGATGGGTT
+ TAGAGCCTCGTTGCGTTTGTTTGCACGAACCATATGTAAGTATTTCCTTAGATAACAAT
+ CGGTTTTTTCTTGCGTGTAATTGCGGAGACTTTGCGATGTACTTGACACTTCAGGAGTG
+ AAGAAAAACTGTATTTGACAAACAAGATACATTGTATCAAAATACAAGAAAGTTTGTTGAT
+ ACAATGTATCTTGTTTGTCAAATACAGTTTTTCTTGTGAAGATTGGGGGTAAATAACAGA
+ CATCTTAAATAAACTTGACTAAAGATTCCTTTAGTAGATAATTTAAGTGTTCTTTAAT
+ AAATTATCTACTAAAGGAATCTTTAGTCAAGTTTATTTAAGATGACTTAACTATCAAT
+ TCCAAGTTAGTGTATTGACATGATAGAAGCACTCTACTATATTCTCAATAGGTCCACGG
+ CCACCGTGGACCTATTGAGAATATAGTAGAGTGCTTCTATCATGTCAATACACTAACTT
+ AATAACCGTCAGGATTGACACCCTCCCAATTGTATGTTTTCATGCCTCCAAATCTTGGA
+ TAGAGATTCTCTTGTTGACATTTTAAAAGAGCGTGGATTACTATCTGAGTCCGATGCTGTTC
+ GCCAGTTAAATAGCTTGCAAAATACGTGGCCTTATGGTTACAGTATGCCCATCGCAGTT
+ GATACAAATCTCCGTTGTACTTTGTTTCGCGCTTGGTATAATCGCTGGGGGTCAAAG
+ TCCTCTTAATCTTTTTGATGCAATTCGCTTTGCTTCTGACTATAATAGACAGGGTAAAGACCT
+ TTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCT
+ AAGAATTCTCATGTTTGACAGCTTATCATCGATAAGCTTTAATGCGGTAGTTTATCACA
+ TTCATACACGGTGCCTGACTGCGTTAGCAATTTAACTGTGATAAACTACCGCATTAAAGCTTA
+ GTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTG
+ ATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAA
+ CATCTGTGCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCAT
diff --git a/userdata/puc18.vec b/userdata/puc18.vec
new file mode 100644
index 0000000..d7f9a6c
--- /dev/null
+++ b/userdata/puc18.vec
@@ -0,0 +1,47 @@
+<--puc.001--------->
+;puc 18 vector sequence from vecbase library
+ GCGCCCAATACGCAAACCGCCTCTCCCCGCGCGTTGGCCGATTCATTAATGCAGCTGGCA
+ CGACAGGTTTCCCGACTGGAAAGCGGGCAGTGAGCGCAACGCAATTAATGTGAGTTAGCT
+ CACTCATTAGGCACCCCAGGCTTTACACTTTATGCTTCCGGCTCGTATGTTGTGTGGAAT
+ TGTGAGCGGATAACAATTTCACACAGGAAACAGCTATGACCATGATTACGAATTCGAGCT
+ CGGTACCCGGGGATCCTCTAGAGTCGACCTGCAGGCATGCAAGCTTGGCACTGGCCGTCG
+ TTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCAC
+ ATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAAC
+ AGTTGCGCAGCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGT
+ GCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGT
+ TAAGCCAGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACGGGCTTGTCTGCTCC
+ CGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAGGTTTT
+ CACCGTCATCACCGAAACGCGCGAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGG
+ TTAATGTCATGATAATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGC
+ GCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGAC
+ AATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATT
+ TCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAG
+ AAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCG
+ AACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAA
+ TGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGC
+ AAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAG
+ TCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAA
+ CCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGC
+ TAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGG
+ AGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAA
+ CAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAA
+ TAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTG
+ GCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAG
+ CACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGG
+ CAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATT
+ GGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACTTCATTTTT
+ AATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAAC
+ GTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAG
+ ATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGG
+ TGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCA
+ GAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGA
+ ACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCA
+ GTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGC
+ AGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACA
+ CCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAA
+ AGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTC
+ CAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGC
+ GTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGG
+ CCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTAT
+ CCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCA
+ GCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGA