Scholarly article on CYS-GLY 19246-18-5 from Journal of Biological Chemistry p. 385,387

DOI: 10.1093/bioinformatics/18.3.488

Source and publish data:

Journal of Biological Chemistry p. 385,387 (1935)

Update date:2022-08-17

Topics:

Authors:

Loring Loring

du Vigneaud

Read Full Text PDF DownLoad Join now for total 90,000,000 free articles

Article abstract of DOI:10.1093/bioinformatics/18.3.488

Full text of DOI:10.1093/bioinformatics/18.3.488

Vol. 18 no. 3 2002

Pages 488–489

BIOINFORMATICS APPLICATIONS NOTE

ProbeMatchDB—a web database for ﬁnding

equivalent probes across microarray platforms

and species

Pinglang Wang, Fei Ding, Hsienyuan Chiang,

Robert C. Thompson, Stanley J. Watson and Fan Meng

Department of Psychiatry and Mental Health Research Institute, University of

Michigan Medical School, 205 Zina Pitcher Place, Ann Arbor, MI 48109, USA

Received on July 21, 2001; revised on October 25, 2001; accepted on October 29, 2001

ABSTRACT

public domain batch probe match database available.

Summary: ProbeMatchDB is a web-based database

designed to facilitate the search of EST/cDNA sequences

or STS markers that can be used to represent the same

gene across different microarray platforms and species.

It can be used for ﬁnding equivalent EST clones in the

Research Genetics sequence veriﬁed clone set based

on results from Affymetirx GeneChip s. It will also help

to identify probes representing orthologous genes across

human, mouse and rat on different microarray platforms.

Availability: The database is accessible at http:

In order to provide a user-friendly tool for ﬁnding

equivalent probes across different platforms and species,

we constructed the ProbeMatchDB by integrating NCBI

UniGene (Schuler, 1997), HomoloGene (Zhang et al.,

2000) and UniSTS (NCBI, 2001) databases as well as the

probe/clone information provided by Affymetrix (2000);

Research Genetics (2000) and Operon (2001).

ꢀ

Cross platform probe match

Currently, there are two major platforms for expres-

sion microarray, the oligonucleotide-based Affymetrix

//brainarray.mhri.med.umich.edu/MARRAY/BC ASP/

ꢀ

brainarray.htm by clicking the ‘Query ProbeMatchDB’ link.

Contact: mengf@umich.edu

GeneChip s and the EST-based spotted arrays. Although

both platforms use EST/cDNA sequences to represent

unique genes, sequences selected by Affymetrix for

ꢀ

Microarray-based expression assay provides an efﬁcient

way to determine the expression level of thousands or

even tens of thousands of genes in parallel. The fast

growth in microarray usage creates the need for comparing

microarray data across different array platforms and

different species. While there are many issues involved

in cross-platform and cross-species comparisons, the

foremost problem is to identify probes that represent

the same gene across different platforms and species.

Furthermore, due to the relatively high variability inherent

in microarray data, it is usually necessary to independently

conﬁrm the results with non-array methods such as

GeneChip s are usually different from those included

in the popular sequence-veriﬁed Research Genetics EST

ꢀ

clone sets used for spotted arrays. Since GeneChip s

are relatively easy to use and usually offer higher density

than spotted arrays, they are often used in ﬁrst round

microarray experiments. Interesting genes identiﬁed by

ꢀ

GeneChip s are then used for additional studies with

other methods. At this stage, it is usually advantageous to

ﬁnd EST clones that represent the differentially expressed

ꢀ

genes detected on GeneChip s, since sequence-veriﬁed

clone sets offer a convenient library for selecting speciﬁc

probes for many different applications, including the

making of custom arrays.

ꢀ

Northern blot, in situ hybridization, RT-PCR, Taqman ,

etc. Beyond such gene discovery studies, it is often

desirable to verify the function of the identiﬁed genes

through experiments in other model species, such as

creating gene knockouts in mice. All such follow up

studies require the identiﬁcation of equivalent gene probes

across platform or across species.

Such a probe matching problem is very simple concep-

ꢀ

tually. For example, GeneChip s utilize gene-speciﬁc

nucleotide sequences for their oligonucleotide probe

design. Although the exact sequence information for

those oligos are not available, the accession numbers used

for oligo design are readily available from Affymetrix

(2000), which can be used to identify the UniGene clus-

Since different platforms may use different sequences

to represent the same gene, ﬁnding equivalent probes

across platforms or species is usually very tedious and

time-consuming for a list of genes, since there is no

ꢀ

ters represented by oligonucleotides on GeneChip s by

searching the UniGene database for a particular species.

Since UniGene databases already contain detailed cluster

488

ꢀc Oxford University Press 2002

ProbeMatchDB

member information (i.e. EST/cDNA sequences included

in a cluster), one can then use a UniGene cluster ID to

ﬁnd all the accession number(s) belonging to that cluster.

Such a cluster-speciﬁc accession number list can then be

compared with the accession number list for the Research

Genetics sequence-veriﬁed clone set from the same

species. Whenever there is a unique accession number

match, it means there is an EST clone in the Research

Genetics clone set that represents the same cluster as the

corresponding oligonucleotide probes on a GeneChip .

Although this process is conceptually simple, it is

very tedious to implement, particularly for a large probe

list. Our approach is to establish an integrated Oracle

database system called ProbeMatchDB that provide a

one-step solution for this problem. ProbeMatchDB stores

the Affymetrix probe accession number list, the clone

information provided by Research Genetics as well as

the periodically updated UniGene clustering information

generated by NCBI. We also built a web-based interface

that allows batch accession number queries, which is

essential for microarray experiments.

cluster ID, LocusLink Locus ID, gene name and/or key-

word in queries. Nonetheless, although accession numbers

representing sequences that are used for gene similarity

calculations by HomoloGene may be used in searches, the

HomoloGene database itself does not contain sequence

cluster member information. Consequently, the accession

numbers for EST or cDNA sequences that are not included

in gene homology calculations cannot be used to query

the HomoloGene database. This is a serious problem for

microarray applications since most of the sequences used

in different microarray platforms, particularly EST clone

and STS sequences, are not used in the gene similarity

calculations by the HomoloGene database.

To solve this problem, the HomoloGene data set is

internally linked to UniGene databases for human, mouse

and rat in ProbeMatchDB. As a result, any accession

number included in UniGene databases can be used for

cross-species searches. Furthermore, since probe infor-

mation from various platforms is already integrated in

our database, the ProbeMatchDB allows the cross-species

searches for every possible combination of platforms,

such as Affymetrix-human versus EST-rat or EST-rat

versus EST mouse, etc.

ꢀ

Most recently, we incorporated identity information for

the 70mer oligonucleotide probe set generated by Operon

for spotted microarrays (Operon, 2001) as well as the

UniSTS database. The ability to use Clone ID to search

for probes is also added. As a result, ProbeMatchDB

enables cross platform searches for equivalent probes

In summary, ProbeMatchDB provides a one-step solu-

tion for cross-species and cross-platform probe matching.

It should be helpful for the design and validation of mi-

croarray experiments. The interface for ProbeMatchDB is

intuitive although our website also provides more detailed

information about sample input/output screens as well as

dataﬂow in ProbeMatchDB.

ꢀ

among Affymetrix GeneChip s, EST/cDNA arrays, STS

arrays, and Operon oligonucleotide arrays using accession

number, clone ID or STS names.

Cross species probe match

ACKNOWLEDGEMENTS

There are many situations where results obtained in

one species need to be veriﬁed or studied in greater

detail in another species. For example, interesting genes

identiﬁed by microarray experiments in human disease

tissue samples are usually only the starting point of a

comprehensive study. These genes are commonly studied

in rat or mouse disease models, as rat and mouse are

usually more amenable to a variety of physiological,

pharmacological or genetic manipulations. Frequently, the

expression levels of these genes require monitoring by

equivalent rat or mouse nucleic acid probes. Similarly,

there are also situations where candidate genes revealed

by rat or mouse models need to be investigated in human.

These experiments demand the ability to readily identify

equivalent probes across different species as well as across

different array platforms.

We want to thank Dr Huda Akil for her critical comments

on this manuscript. This work was supported by the

University of Michigan Microarray Network funding,

the Nancy Pritzker Depression Research Network and

NIMH program project grant L99 MH60398 to S.J.W. The

Department of Psychiatry pilot study grant to F.M. and

the National Institute on Drug Abuse R21 DA13754-01

to F.M.

REFERENCES

Affymetrix (2000) The GeneChip expression analysis sequence

information database. http://www.affymetrix.com.

NCBI (2001) UniSTS. http://www.ncbi.nlm.nih.gov/genome/sts/.

Operon (2001) Human genome oligo set. http://www.operon.com/

arrays/arraysets.php.

Research Genetics (2000) Research genetics sequence veriﬁed EST

clones. http://www.resgen.com/include/menus/cdna menu.php3.

Schuler,G.D. (1997) Pieces of the puzzle: expressed sequence tags

and the catalog of human genes. J. Mol. Med., 75, 694–698.

Zhang,Z., Schwartz,S., Wagner,L. and Miller,W. (2000) A greedy

algorithm for aligning DNA sequences. J. Comput. Biol., 7, 203–

214.

In order to implement the cross species probe search

function, we incorporated data from the HomoloGene

database into ProbeMatchDB. The HomoloGene database

is generated by NCBI for cross-referencing similar genes

across several species. It is a very useful database for ﬁnd-

ing homologous genes in several species using UniGene

489

Products guided by the article

Product name:CYS-GLY

Cas No:19246-18-5

R&D Labs maybe for 19246-18-5

Shanghai Mio Chemical Co., Ltd

Contact:0086 21- 64401188-622

Address:16 Floor NO.2 Jiefang Building, No. 4855 Dushi Road, 201100 Shanghai, P.R.China
Rely Chemicals Ltd.

Contact:+86-518-81061113

Address:No. 8 Lingzhou Road, Lianyungang, Jiangsu, China
Jiangsu Zhenfang Chemical CO.,LTD.(Suzhou Zhenfang Chemical Factory)

Contact:+86-512-69598882

Address:Room1201, Jiayuan Road No.1018, Xiangcheng District, Suzhou, China
shanghai payne pharm co.,ltd.

Contact:+86-021-58123769

Address:No.780 of Cailun Road,Zhangjiang Hi-tech Park,Pudong,Shanghai
Shanghai chemrole co.,ltd

Contact:021-50278900

Address:No.6,Room 201 ,Lane 299,bisheng road ,shanghai ,china

Relevant to this article

Determination of saikosaponin derivatives in Radix bupleuri and in pharmaceuticals of the Chinese multiherb remedy Xiaochaihu-tang using liquid chromatographic tandem mass spectrometry

Doi:10.1021/ac0499423
(2004)
Doi:10.1021/ja01311a036
(1935)
Efficient catalytic enantioselective Mannich-type reactions using a zirconium-bis(binaphthol)methane complex

Doi:10.1016/S0040-4039(99)00138-0
(1999)
Bioinspired design for the assembly of Glypromate neuropeptide conjugates with active pharmaceutical ingredients

Doi:10.1039/d0nj04851h
(2020)
Applications of Consecutive Radical Addition-Elimination Reactions in Synthesis

Doi:10.1039/c39850000682
(1985)
One-Pot Synthesis of N-(α-Peroxy)Indole/Carbazole via Chemoselective Three-Component Condensation Reaction in Open Atmosphere

Doi:10.1021/acs.orglett.5b02881
(2015)

Article Doi

DOI: 10.1093/bioinformatics/18.3.488

Source and publish data:

Authors:

Article abstract of DOI:10.1093/bioinformatics/18.3.488

Full text of DOI:10.1093/bioinformatics/18.3.488

Products guided by the article

R&D Labs maybe for 19246-18-5

Relevant to this article

Determination of saikosaponin derivatives in Radix bupleuri and in pharmaceuticals of the Chinese multiherb remedy Xiaochaihu-tang using liquid chromatographic tandem mass spectrometry

Efficient catalytic enantioselective Mannich-type reactions using a zirconium-bis(binaphthol)methane complex

Bioinspired design for the assembly of Glypromate neuropeptide conjugates with active pharmaceutical ingredients

Applications of Consecutive Radical Addition-Elimination Reactions in Synthesis

One-Pot Synthesis of N-(α-Peroxy)Indole/Carbazole via Chemoselective Three-Component Condensation Reaction in Open Atmosphere