« Processing Tab-Delimited Files | Main | Version 17 is Now Live »

General Cleanup of Drug Target Files

I have created a new script, DrugClean, which prepares a drug target output file for NMPDR. The script is invoked as follows:

DrugClean -macFile fileName1 fileName2 ... fileNameN

The macFile switch is only necessary if the input files are all in Macintosh format.

The script will remove duplicate entries and entries for PEGs that are not in the current version of the Sprout database. It also converts the file to Unix format. I have changed targets.cgi to expect Unix files, so when we get new drug targets files this script must be run on them or they won't work.

I also fixed a performance problem with the organism files, and they now load in around 10 seconds instead of 50.

Leave a comment

HTML is not allowed in comments; however, if you put in a raw URL (http://www.somewhere.com/page.html) it will automatically be converted to a link.. Also, it is likely your comment will not appear unless you refresh the page manually after posting it.

About

This page contains a single entry from the blog posted on December 6, 2006 5:36 PM.

The previous post in this blog was Processing Tab-Delimited Files.

The next post in this blog is Version 17 is Now Live.

Many more can be found on the main index page or by looking through the archives.

Powered by
Movable Type 4.01