The SEQIO package is a set of C functions which can read and write biological sequence files formatted using various file formats and which can be used to perform database searches on biological databases. All of the code is packaged together into a single file, making it easy to incorporate into your programs. Here are the files included in the SEQIO package distribution.
At this point in time, the SEQIO package has been tested using gcc on Unix systems running SunOS, Solaris, Ultrix, IRIX and Windows NT, and using g++ on Ultrix. The code has been written to the ANSI C standard, so you will need an ANSI C/C++ compiler in order to compile the package. One suggestion I have is that you turn on optimization when compiling the SEQIO package. It will significantly improve the package's efficiency. Also, compiling the package may take several minutes, as the code is around 20,000 lines (this will get shorter in a later version (of course, I keep saying that every version)).
If you plan to use this package and wish to receive notices about
updates and bug fixes, please send mail to knight@cs.ucdavis.edu. In
that mail, specify whether you just want a notice about a new version
of the package, or you want the patch file or complete release
automatically sent to you.
(NOTE: If you see ANYTHING you think is either wrong, or should be
changed, please let me know. If it is wrong, I'll fix it. If I think
it isn't, I'll tell you why, and also tell you how you can get what
you want.)
Any use of the SEQIO package should be accompanied with acknowledgements and copyright notices in the documentation of any software developed using the package or derived from the package. Something along the lines of:
This software uses the SEQIO package for reading and writing sequences. Copyright (c) 1996 by James Knight at Univ. of California, Davis.Any papers describing software using the SEQIO package, or whose results were significantly aided by the use of the SEQIO package (except when the use was internal to a larger program), should include an acknowledgement and citation. The citation should be something like:
Knight, James "SEQIO: A C Package for Reading and Writing Sequences," distributed by the author.(As soon as I get a paper out about the package, this will become a reference to the paper.)
James KnightSend any bug reports, new database/file-format information, comments, complaints or extension requests to knight@cs.ucdavis.edu.
Dept. of Computer Science
Univ. of California, Davis
Davis, CA 95616
E-mail: knight@cs.ucdavis.edu
WWW-Site: http://wwwcsif.cs.ucdavis.edu/~knight
This work was supported foremost by Dan Gusfield at UCDavis, by grant DE-FG03-90ER60999 from the Department of Energy and by the Aspen Center for Physics.
My thanks to Don Gilbert for collecting descriptions of the various formats and including them with his "readseq" program. I never used his code, but the `Formats' file was quite useful in writing the package, and I did look through his code when writing "fmtseq". Thanks also to Russell Malmberg who stuck with all of my attempts to port the package to Windows NT/95 until it finally compiled and ran. Thanks to Kay Hofmann for describing the MSF format in a detailed enough form for implementation.
Please note however that in a future version, some programs added to the release may have a more restrictive copyright (those programs will be restricted to non-commercial use because of the original sources used to derive the programs). However, the SEQIO package, fmtseq, idxseq and the example programs will always be freely available for commercial or non-commercial use, now and into the future.
The copyright for the SEQIO package, its documentation and the fmtseq and idxseq programs:
Copyright (c) 1996 by James Knight at Univ. of California, Davis Permission to use, copy, modify, distribute and sell this software and its documentation is hereby granted, subject to the following restrictions and understandings: 1) Any copy of this software or any copy of software derived from it must include this copyright notice in full. 2) All materials or software developed as a consequence of the use of this software or software derived from it must duly acknowledge such use, in accordance with the usual standards of acknowledging credit in academic research. 3) The software may be used freely by anyone for any purpose, commercial or non-commercial. That includes, but is not limited to, its incorporation into software sold for a profit or the development of commercial software derived from it. 4) This software is provided AS IS with no warranties of any kind. The author shall have no liability with respect to the infringement of copyrights, trade secrets or any patents by this software or any part thereof. In no event will the author be liable for any lost revenue or profits or other special, indirect and consequential damages.