Bioinformatics databases and data resources MCQs With Answer

Bioinformatics databases and data resources MCQs With Answer provide B.Pharm students a practical way to learn how biological and chemical information is stored, accessed and applied in drug discovery, pharmacology and genomics. Topics include sequence databases (GenBank, EMBL), protein resources (UniProt, PDB), cheminformatics hubs (PubChem, DrugBank, ChEMBL), functional annotation (GO, KEGG), and search tools (BLAST, FASTA). Understanding accession numbers, data formats (FASTA, PDB), metadata, curation and database cross-references helps pharmacists interpret literature, predict targets, and evaluate ADMET or pharmacogenomic data. This concise study set emphasizes real-world database usage, retrieval strategies and interpretation skills. Now let’s test your knowledge with 30 MCQs on this topic.

Q1. What is the primary purpose of a bioinformatics database?

To store, organize and enable retrieval of biological or chemical data
To perform laboratory experiments automatically
To manufacture pharmaceuticals at scale
To sequence DNA samples in the clinic

Correct Answer: To store, organize and enable retrieval of biological or chemical data

Q2. Which database is a primary sequence repository for nucleotide sequences maintained by NCBI?

UniProt
GenBank
PDB
DrugBank

Correct Answer: GenBank

Q3. UniProt is best described as a database for:

Small molecule drug structures
Protein sequence and functional information
Clinical trial records
Pathway visualization only

Correct Answer: Protein sequence and functional information

Q4. What does PDB (Protein Data Bank) primarily archive?

Gene expression microarray data
Protein three-dimensional structures determined experimentally
Pharmacokinetic models
Clinical case reports

Correct Answer: Protein three-dimensional structures determined experimentally

Q5. Which file format is commonly used for representing raw polymer sequences in databases?

PDB format
FASTA format
SMILES notation
CSV spreadsheet

Correct Answer: FASTA format

Q6. An accession number in a sequence database is used to:

Encrypt sequence data
Uniquely identify a record for retrieval and citation
Predict protein structure
Measure expression levels

Correct Answer: Uniquely identify a record for retrieval and citation

Q7. BLAST is a tool used for:

Visualizing 3D molecular interactions
Comparing an input sequence against a sequence database to find similarities
Simulating drug metabolism only
Creating synthetic molecules

Correct Answer: Comparing an input sequence against a sequence database to find similarities

Q8. Which database focuses on chemical structures and bioactivity of small molecules?

KEGG
PubChem
UniProt
RefSeq

Correct Answer: PubChem

Q9. DrugBank is most useful to a B.Pharm student for finding:

Protein 3D coordinates
Detailed drug chemical, pharmacological and target information
Raw genomic reads
Clinical imaging files

Correct Answer: Detailed drug chemical, pharmacological and target information

Q10. The Gene Ontology (GO) resource provides:

Tools for molecular dynamics simulations
A controlled vocabulary to describe gene and protein functions across species
Clinical trial outcome metrics
Sequences for small RNAs only

Correct Answer: A controlled vocabulary to describe gene and protein functions across species

Q11. Which resource integrates genomic, chemical and pathway information useful in drug discovery?

KEGG
PDB only
EMBL-EBI mirror without annotations
RefSeq

Correct Answer: KEGG

Q12. A low BLAST E-value indicates:

A less significant alignment
A more significant alignment with low probability of random match
Higher chance of sequencing error
No homology between sequences

Correct Answer: A more significant alignment with low probability of random match

Q13. RefSeq differs from primary sequence archives like GenBank because RefSeq:

Contains raw sequencing reads only
Provides curated, non-redundant reference sequences
Is focused on small molecules
Has no annotation

Correct Answer: Provides curated, non-redundant reference sequences

Q14. Which database would you consult for known adverse drug reaction information?

SIDER
UniProt
GenBank
EMBL

Correct Answer: SIDER

Q15. Cross-references in database entries are important because they:

Prevent any data updates
Connect related information across databases to provide context and validation
Encrypt the accession numbers
Only exist in proprietary databases

Correct Answer: Connect related information across databases to provide context and validation

Q16. Which of the following is a primary international collaboration that shares nucleotide sequence data alongside GenBank?

Swiss-Prot only
EMBL-ENA and DDBJ
DrugBank consortium
PubChem

Correct Answer: EMBL-ENA and DDBJ

Q17. Which identifier refers specifically to a protein entry in UniProt?

PDB ID
UniProt accession (e.g., P01234)
PubChem CID
GenBank GI number

Correct Answer: UniProt accession (e.g., P01234)

Q18. ChEMBL is a database primarily curated for:

Clinical imaging
Bioactive drug-like small molecules and their bioactivity data
Plant taxonomy
Protein folding pathways

Correct Answer: Bioactive drug-like small molecules and their bioactivity data

Q19. Which search strategy returns sequences similar to a query by using local alignment?

Global alignment (Needleman-Wunsch)
Local alignment (Smith-Waterman) or BLAST local aligner
Only structural superposition
Mass spectrometry matching

Correct Answer: Local alignment (Smith-Waterman) or BLAST local aligner

Q20. What type of information does the PDB file contain besides atom coordinates?

Only sequence reads
Metadata such as experimental method, resolution, chain IDs and ligand descriptions
Clinical trial data
SMILES strings for lipids only

Correct Answer: Metadata such as experimental method, resolution, chain IDs and ligand descriptions

Q21. Which database is specifically tailored for pharmacogenomics information linking genes to drug response?

PharmGKB
EMBL
RefSeq
Protein Data Bank

Correct Answer: PharmGKB

Q22. The SMILES notation is used to represent:

Protein secondary structure
Chemical structure of small molecules as a text string
Nucleotide quality scores
Three-dimensional coordinates

Correct Answer: Chemical structure of small molecules as a text string

Q23. Which term best describes databases that aggregate and standardize information from multiple primary sources?

Primary databases
Secondary or derived databases
Raw read repositories
Sequencing pipelines

Correct Answer: Secondary or derived databases

Q24. When retrieving data programmatically, which access method is commonly provided by major bioinformatics resources?

Hand-delivered USB drives only
APIs (RESTful services) and FTP/HTTP downloads
Telephone requests only
Encrypted snail mail

Correct Answer: APIs (RESTful services) and FTP/HTTP downloads

Q25. What does annotation in a sequence database typically include?

Only the raw nucleotide characters without any labels
Gene features, coding regions, functional notes and cross-references
Physical sample storage location only
Vendor pricing data

Correct Answer: Gene features, coding regions, functional notes and cross-references

Q26. Which of the following best explains a “non-redundant” database?

A database with no metadata
A database that removes duplicate or identical records to represent unique entries
A database limited to one species
A database that only stores chemical formulas

Correct Answer: A database that removes duplicate or identical records to represent unique entries

Q27. Which resource would you use to find metabolic pathways and enzyme functions related to a drug target?

KEGG
PubMed Central only
GenBank raw reads
SIDER only

Correct Answer: KEGG

Q28. In BLAST output, which metric helps judge the alignment quality normalized by length?

Sequence accession
Bit score
Publication year
File size

Correct Answer: Bit score

Q29. PubMed is most appropriate for retrieving:

Experimental 3D structures
Biomedical literature and abstracts related to drugs and biology
Raw sequencing reads in FASTQ
Chemical reaction mechanisms only

Correct Answer: Biomedical literature and abstracts related to drugs and biology

Q30. Why is data curation important in bioinformatics databases used in pharmacy?

It automatically sequences new genomes
It ensures accuracy, consistent annotation, error correction and reliable cross-references for research and clinical decisions
It increases the file size for storage costs
It prevents users from accessing data

Correct Answer: It ensures accuracy, consistent annotation, error correction and reliable cross-references for research and clinical decisions

Download

G S Sachin

I am a Registered Pharmacist under the Pharmacy Act, 1948, and the founder of PharmacyFreak.com. I hold a Bachelor of Pharmacy degree from Rungta College of Pharmaceutical Science and Research. With a strong academic foundation and practical knowledge, I am committed to providing accurate, easy-to-understand content to support pharmacy students and professionals. My aim is to make complex pharmaceutical concepts accessible and useful for real-world application.

Mail- Sachin@pharmacyfreak.com

Leave a Comment Cancel reply