On-line data bases for chemical patent searches

On-line data bases for chemical patent searches

Dr. Maximilian Herz, Scientific Information and Documentation Department, CIBA-GEIGY Ltd. Basle On-line Data Bases for Chemical Patent Searches...

445KB Sizes 0 Downloads 22 Views

Dr. Maximilian

Herz, Scientific

Information

and Documentation

Department,

CIBA-GEIGY

Ltd. Basle

On-line Data Bases for Chemical Patent Searches*

Summary

The most important of the data bases available for chemical on-line searches are presented and their suitability for various types of patent searches are shown with the aid of examples. Les plus importantes bases de don&es interrogeables en conversationnel et couvrant le domaine de la chimie sont pr&entCes. On d&ermine, g l’aide d’exemples, celles qui s’adaptent le mieux aux diffkrentes categories de recherches de brevets. Es werden die wichtigsten fiir ChemieOn-line-Recherchen anhand von Beispielen deren Eignung fiir verschiedene

Available on-line patent data bases

SDC

The most frequently occurring types of on-line patent searches are: - searches concerning the technical content of patents; - searches concerning the bibliographical data; - searches for members of a patent family. The most important on-line data bases suitable for the above mentioned types of search are listed in this chapter. The data bases can be divided into three groups, namely: - on-line data bases for all fields of chemical technology ; - on-line data bases for special fields; - on-line data bases for equivalence searches. Tables l-3 show, which on-line patent data bases and in what time period are searchable by means of the two on-line systems Lockheed Information Systems (LIS) and System Development Corporation (SDC). a) On-line

data bases for all fields

of chemical

techno-

IO9Y For chemical searches, the most important data bases are those which cover the entire field of chemistry or even the whole of technology. These data bases contain more than 500 000 patent references. Table

I

ON-LINE

DATA

500 000 PATENT

BASES

CONTAINING

>

ABSTRACTS

The table shows, from left to right, the on-line system, the data base supplier, the supplier’s various data bases, and the periods they cover. In particular three large information systems deserve attention : DERWENT Service CHEMICAL ABSTRACTS Service and IFI-PLENUM Services.

LIS

X

SUPPLIER DER WENT

vorhandenen Datenbasen und Patent-Recherchentypen gezeigt. DATA

BASE

PERIOD

COVERED

FARMDOCIAGDOC

1963

-

1969

PLASDOC

-

1969 1973

X

CPI

1966 1970

X

WI

from

-1974

X

CHEM.ABSTR. x

x

CA-SEARCH

1967

-

x

x

CA-SEARCH

1972

-

X

x

CA-SEARCH

from

-

1977

1950

-

1970

-

1977 1978

-

1978

X

IFUPLENUM

X

CLAIMSICHEM. CLAIMS

X

CLAIMSIABSTR.

1971 from

X

CLAIMSNEEKLY

from

1971 1976

Whilst all Derwent data bases are only searchable on-line via SDC, all IFVPLENUM data bases are exclusively searchable via LIS. Prior to 1970, the DERWENT data bases only cover the areas of PHARMACEUTICALS (FARMDOC from 1963), AGROCHEMICALS (AGDOC from 1965) and PLASTICS (PLASDOC from 1966). From 1970 onwards, the Central Patents Index (CPI) covers all chemical patents. From 1974 onwards, the World Patents Index (WPI) documentation service published by Derwent covers all patents of 26 countries. The data bases of Chemical Abstracts Service can be searched via LIS from 1967 to the present and via SDC from 1970 to the present. The CLAIMS data base of IFVPLENUM Data Company (IFVPLENUM) exclusively covers U.S. patents. For the period 1950-1970 it only covers chemical U.S.patents.From 1971 on,it contains all U.S. patents. Some important differences between the 3 services are: CHEMICAL

ABSTRACTS

-

patents and other literature; - mainly chemical substance oriented; _ only chemical substances from the examples, if physicai data are given. Hence, chemical substances included in the patent claims are only partially covered. DER WENT

* Revised version of lecture held at Bask on 21.6.1979 Base1 on-line Experience Exchange Group.

World Patent Information 2

(1980)

No. 3

for the

Herz - On-line Data Bases

-

prior to 1974 only chemical patents, from 1974 on also patents of non-chemical IPC classes; 119

-

patent

claims and examples

are covered.

IFUPLENUM - only U.S.

-

patents; 1950-1970 only chemical patents, thereafter all classes of patents, prior to 1978, patent title, supplemented by keywords from the patent claims, from 1978 on, also the abstracts as they appear in the Official Gazette.

b) On-line data bases for special fields

FOOD ADLIBRA K & M Publications Inc., U.S.A. RAPRA Rubber and Plastics Research Association Britain Table

3

ON-LINE SDC

PATENT

LIS

x

APIPAT

ON-LINE

BASES

> 10 000 PATENT SDC

LIS

X

X

X

X

ABSTRACTS PERIOD

INTERNAT.

FSTA

from

INSPEC

1969

PAPERCHEM

from

1969

TITUS

from

1967

from

1970

from

1968

MAT.

SERVICE

INSPEC.

INST.

from

1964

6000

from

1966

CA-SEARCHICACOI

600000

from

1967

X

CLAIMS

500000

from

1950

CRDS

X

600

FARMDOCIAGDOC X

FOOD FSTA

x x

X X X X

ADLIBRA

3 000 20000

from

1978

1963

-

from from

1969

INSPEC

20 000

1969

50000

from

80 000

1966

-

6000

1972

-

RAPRA WAA WORLD WPIKPI

30000 TEXTILES

10000 30000 1000 000

1969 1974

PAPERCHEM PLASDOC TITUS X

50 000

~ 1977 1969 1969 1977

from

1967

from from from

1968 1970 1970

c) on-line data bases for equivalence

DATA-BASE

INFOR-

100000

CONTAINING

SUPPLIER

FOOD

COVERED

x

X

DATA

PERIOD

APTIC

X

Table 2

PATENTS

X

X

x x

FILES

DATA-BASE

X

For a number of technical fields, special data bases for on-line patent searches are available in addition to the comprehensive data bases referred to in the preceding chapter. In contrast to CHEMICAL ABSTRACTS and DERWENT, these special data bases, above all, stress the technological aspect. These on-line data bases each cover more than 10 000 patent references; they are listed in Table 2.

of Great

COVERED 1969

-

1977

OF ELECTRICAL

searches

Patent documentation requires an additional type of search which does not apply to the other literature documentation, namely the equivalence search. Its aim is to find all patents belonging to a patent family. Table 4 lists the data bases suitable for on-line equivalence searches in the field of chemistry.

ENGINEERS X

INST.

OF PAPER

CHEMISTRY X

INST.

TEXTILE

Table 4 PATENT

CONCORDANCE

ON-LINE

DATA

BASES

d) Subject

areas covered by the various data bases

DE FRANCE X

SHIRLEY

INST.

X

THE AMERICAN SOCIETY FOR METALS

WORLD

TEXTILES

WORLD ALUMINIUM ABSTR. (WAA)

All 6 data bases contain other literature in addition to patents; they cover patents from 1970 at the latest, but in some cases even from 1967 (TITUS). For patents, the period of at least 10 years, which is searchable on-line by means of these data bases, constitutes an additional factor of certainty. Table 3 shows a comprehensive list of the most important on-line data bases which contain patents. This list contains, in addition to the data bases already referred to in Tables 1 and 2, small special data bases, namely: APTIC Air Pollution Technical Information Center CRDS DERWENT Chemical Reactions Documentation Service 120

The major sectors of chemistry include: - Chemical substances; reactions - Biological subjects - Technical subjects - Engineering Table 5-8 indicate, for each major sector (e.g. biological subjects), the most important subject matter categories (e.g. organic compounds) of the particular major sector, in correlation with the corresponding on-line data bases. In each case, the on-line system in which the relevant data bases can be searched is also indicated. World Patent Information

2 (1980) No. 3 Herz - On-line Data Bases

All 4 tables show, for the particular sector, the breadth of the spectrum of subject matter categories covered in the most important data bases. Table 5, concerning chemical substances, shows that, for example, organic compounds are contained both in the CA-SEARCH of Chemical Abstracts as well as in the CPVWPI of DERWENT.

Table 7 TECHNICAL

-

$

1I B F

Table 5 CHEMICAL

i 8 i!

.$ g _ . . . .

::

. . .

. . . . . .

.

-.

. . .

_ _ -

.

. -.

. . . .

2 .. .. . -.

$5

:zy 5

+

DATA-BASE

SDC

LIS

CA-SEARCH

x

x X

CLAIMS X

CPI FARMDOCIAGDOC

X

PAPERCHEM

X

PLASDOC

X

. . . . . . . .

. . . . . . .

ALLJMINIUM

WORLD

TEXTILES

X

ABSTR.

WPI

X X

. . .

.

.. . . . -.

DATA-BASE

SDC

LIS

CA-SEARCH

x

x X

CLAIMS

X

CPI PAPERCHEM

X X

PLASDOC TITUS WORLD

X X

TEXTILES X

WPI

Table 8 ENGINEERING

X

TITUS WORLD

c

2 G2 $ 2 3 d g6E

SUBSTANCES

! : 2 2 2 t; ? 2 ?i ; & E -

S UB JE CTS

I

I c-

x ti

Table 6 BIOLOGICAL

DATA-BASE

SDC

LI!

SUBJECTS

-

Subject

x -8 . . . . -.

searches with

examples

In the following the most important search tools subject searches in patents with examples are given.

DATA-BASE AGDOC CA-SEARCH CLAIMS

for

a) Survey of search tools for subject searches

CPI FARMDOC

Table 9 gives a survey of the search tools which are important for subject searches in patents. It relates 8 types of search tools to the 9 most important data bases. In the following the characteristic properties of the most efficient search tools are dealt with.

FSTA

World Patent Information 2 (1980) No. 3 Hen

- On-line Data Bases

121

Multipunch

codes

This search, which is only possible in the Derwent data bases and employs three-digit numbers, allows the use of a uniform search strategy. A chemical structure is broken down into characteristic fragments, which are represented by a series of three digit multipunch position numbers, to be found in the comprehensive Chemical Coding Rules for CPI/WPI. Table 9

SEARCH

TOOLS

CA SEARCH. On the other hand, the U.S. class of a patent is only recorded in the CA SEARCH and in CLAIMS. CLAIMS furthermore contains all U.S. classes shown on a patent. Index terms Non-structural concepts, such as, for example, a known application, and properties, can be searched in most data bases by means of terms contained in a Thesaurus, In the case of the following data bases, these terms are selected from a Thesaurus: - CPInvPI - FSTA - INSPEC - PAPERCHEM - TITUS - WORLD TEXTILES To permit search for novel technical concepts not contained in the Thesaurus, most files additionally provide the possibility of a free term search.

DATA-BASE CA-SEARCH CLAIMS

b) Search

CPl/wPl

We will give 4 typical examples to show how structures (possibly combined with properties) and reactions can be searched.

FSTA INSPEC PAPERCHEM

examples

TITUS WORLD

ALUMINIUM

ABSRT.

If a defined chemical compound is concerned, its Registry No. is searched in the CA molecular formula register. (Chemical Abstracts allots a Registry No. to each individual compound newly included in its system). Virtually every defined individual compound, for example urea, can be searched unambiguously and without ballast in the CA Search by means of the Registry No. In CPITWPI, on the other hand, 16-18 punch positions and 2 search strategies (1970-7 1 and 1972-79) are needed for the search for HaN-CONH1 , without however giving a ballast-free result. On the other hand, Markush formulae are mostly easier to search via the fragment code, since the plurality of Registry Nos. makes a search by the system just described extremely troublesome. Patent classification

122

code

An on-line search in the WPI for chemical structures by means of the multipunch code is shown in Table 10. The sought Markush formula is characterised by 20 multipunch code positions.

Registry Nos

The International of every patent

Multipunch

Patent Classification units (IPC units) can be searched both in CPIWPI and

Table

10

MULTIPUNCH

CODE

(HO)--ffQ

m=1-2

Br, ’ On-line procedure: file wpi;subs e3 ss l/C? USER: synonym . for 1ink;synonym ss l/C? USER: 192.49-.525.59-.60-.62-.620.713 SS 1 PSTG (285) ss 2/C? USER : 1.058.059

n=l-3

> for not

World Patent Information 2 (1980) No. 3 Hen - On-line Data Bases

1 and p/dt and all antioxidant: PROG: SS 2 PSTG (1)

SS 2 PSTG (182) ss

3/C?

USER: 2.>01&.>01-.>010.>011.>012.>014.>015.>016.> 017.>019 SS 3 PSTG

prt full AN - CA86-189934 (25) TI - Antioxidant benzimidazole derivative IT - 62468-l S-9: (prepn. and antioxidant activity of,

(49)

ss 4/C? USER :

pi-t, an, ti 49 AN - 81398X/44 TI - 5,7-Dibromo-8-hydroxyquinoline S-hydroxy-quinoline and bromine ic acid DT2515476

prepn - from in aq hydrobrom-

for the Beckmann Using the example of “catalysts rearrangement”, Table 12 shows, how an aimed search for these catalysts can be carried out with text terms and the data type “patent” (p/dt).

Registry Nos Table 11 shows a combined structure and properties search. In the SDC CA-SEARCH data base, the properties and/or use corresponding to each structure can be searched and printed out. A search under Registry No. 62468-15-9 produces all abstracts concr-(2chlorophenyl)-[ lH-benzimidazol-2-yl-mecerning thanol] monohydrochloride. The result can subsequently be reduced, by restriction to “antioxidants” and the data type “patent” (p/dt) to those patents which contain information on the antioxidant action of this compound. Table

11

CHEM. ABSTR.

REGISTRY-NO.

HCI H

OH

1 H-Benzimidazole-2-methanol, a-(2-chlorophenyl) monohydrochloride (62468-15-9)

12

REARRANGEMENT

On-lineprocedure: file cas77 SSIIC? USER: beckmann and all rearrangement: PROG: SS 1 PSTG (117) ss 2/C? USER: 1 and pldt PROG : SS 2 PSTG (25) ss 3/C? USER: 2 and all cataly: PROG : SS 4 PSTG (9) prt full AN - CA90-104596(14) TI -, epsilon, -Caprolactam IT - Beckmann rearrangement catalyst: dride and alumina, for cyclohexanone

(boric anhyoxime)

In order to find the IPC units for unsubstituted quinoxalines, it is necessary to scrutinise the class of Heterocyclic compounds in the 2nd edition of the International Patent Classification. C07D-241/40 covers “heterocyclic compounds containing 1,4diazine rings condensed with carbocyclic rings with only hydrogen or carbon atoms directly attached to the ring nitrogen atoms which are benzopyrazines” and hence also unsubstituted

file cas77 ss l/C? USER: 62468-15-9/m PROG: SS 1 PSTG (2) ss 2/C? USER: Information

Table

BECKMANN

IPC classification

On-lrne procedure:

World Patent

Reactions

2 (1980)

No. 3

Herz - On-line

Data Bases

123

quinoxalines, which can thus be searched for, with this IPC-Unit, for example in the WPI data base. Table 13 IPC classification C 07 D HETEROCYCLIC COMPOUNDS 241 /OO Heterocyclic compounds containing 1,4-diazine or hydrogenated 1,4diazine rings 241/36 condensed with carbocyclic rings or ring systems 241138 . . with only hydrogen or carbon atoms directly attached to the ring nitrogen atoms 241140 Benzopyrazines

= N’ a) I

N'

USER: c07d-241 I40 PROG : SS 1 PSTG (8) prt full AN - 5 1565A/28 CC - MERCK & CO. INC (MERI) TI - 6-( I-Piperazinyl)quinoxalineused as anorectic, antidepressant, analgesic and hypnotic agent us4091101 PI - 23.05.78 15.06.77 -US-806898 A61K-31/49 C07D-241/40 C07D-403104 In conclusion, it should be mentioned that bibliowhich are important alongside graphical searches, subject matter searches, are beyond the scope of the present paper. Acknowledgement

Quinoxaline On-lineprocedure: file wpi ss l/C?

124

I would like to express my cordial thanks to Dr. S. Goldstein, Scientific Documentation Department of CIBA-GEIGY AG., who has greatly contributed to this publication through valuable suggestions.

World Patent

Information

2 (1980)

No. 3 Herz - On-line Data Bases