Home | Download | Purchase | knowledge

 
 


The TSCAINV database -- TSCA Inventory

A brief news item about a terrorist threat in  Britain mentioned Osmium tetroxide.  We had to look this substance up, and the process provided a useful reminder about procedures for looking up a particular chemical in a system like CIS.

Having just a name, we searched for OSMIUM TETROXIDE. The search "worked" -- that is, it retrieved 63 records -- but we bothered to look only at the very first citation in our results list.  This citation-form display was from the SANSS database, and it indicated that the CAS Registry Number for Osmium tetroxide was 20816-12-0.

Having found a CAS RN, we returned to the search screen without even bothering to look at the rest of the results from our first search.  Instead, we re-did the search, this time using the CAS RN instead of the chemical name. The result was much better -- 106 records retrieved by the CAS RN instead of the 63 records retrieved by the chemical name.

MORAL:  When you are searching in CIS and have only a chemical name to go by, look to SANSS to find an appropriate CAS RN for the substance. SANSS contains over 5 million chemical names and synonyms, and your odds of getting a hit in that database for a given chemical name are much greater than in any other CIS database -- many of which contain few or no synonyms. However, it is the CAS Registry Number that you can count on when searching in CIS, since the CAS Registry Number works generally in ALL the substance- oriented databases of CIS.  We have deliberately structured CIS results displays so that SANSS records come up first in any search that hits a  SANSS record -- and one reason for this is to enable you to get the CAS Registry Number for a new search when you only had a chemical name to start with.  If you restrict your searching to chemical names, you're missing a great deal of what the system has to offer.

DETAILED DESCRIPTION OF TSCAINV

The TSCAINV database -- TSCA Inventory -- was  recently updated.  This database is ordinarily  updated twice a year, and it's such a matter of  routine that we make little fuss about it.  However, we had occasion to assess the database  statistically this time around, and we thought we  would summarize some of the results for our  users.

For those unfamiliar with it, the TSCAINV  database lists substances included on the public  portion of the "TSCA Inventory," the list  maintained by EPA of chemical substances in industry and commerce in the US.  The appearance of a substance on the list means that it is  approved for use in the United States without any pre-manufacture notice (PMN) to EPA.   A substance  NOT appearing on the list MAY be listed on the  "confidential" portion of the inventory  maintained by EPA -- requiring an inquiry to EPA  to check -- or it may require submission of a  pre-manufacture notice (PMN) to EPA before the  substance can be legally imported or produced.  The version of the database now on CIS -- the first issuance of 2004 -- contains 65,068 substances, and it increases over time:  The first issuance of 2001 had only 64,412 entries, the first issuance of 2002 had 64,645, and the first issuance of 2003 had 64,872.

Each record in the database includes a CAS  Registry Number, and this is the best way to search the database since the records generally contain only a limited number of names/synonyms (see below).  Use SANSS as described above to find a CAS RN if you have only a name.

The limited chemical nomenclature included in each record consists of a formal chemical name -- as defined by the Chemical Abstracts Service in accordance with 9th CI nomenclature rules -- and (possibly) one or more "submitter names" or synonyms -- the name or names used by submitters of substances for inclusion on the Inventory.  If you do search for a chemical name in the TSCAINV database on CIS, a record may be retrieved if it contains your search entry as EITHER a formal chemical name or as a synonym/submitter name. However, only 36,839 of the current 65,068 records contain these alternative names, and most records that have alternative names contain only one or two.  These limitations explain why it's better to retrieve a CAS Registry Number from SANSS by the vast array of synonyms that SANSS contains than to search for a given name in TSCAINV.

A limited number of TSCAINV records -- currently, 2,184 -- may also contain a chemical definition. The chemical definition defines the substance to which the record pertains when the formal chemical name is to some extent ambiguous -- for example, when the name refers broadly to a class of derivatives from a natural source.

A chemical formula field appears in each TSCAINV record, although the field may merely stipulate that all or some portion of the formula is "unspecified." This frequently occurs when the substance in question is of unknown or variable composition, a complex reaction product, or a biological material.  Of the 65,068 molecular formula entries currently in the database, 15,874 include the notation UVCB, denoting "unknown, variable, complex, biological."

TSCAINV records also sometimes include codes indicating the status of the substance with regard to certain regulatory requirements. Currently, 24,942 records (out of 65,068 total) include one or more codes.  Here is a summary of the codes that can appear, with the counts of records in which they appear.