Bioinformatics databases and data resources MCQs With Answer

Bioinformatics databases and data resources MCQs With Answer provide B.Pharm students a practical way to learn how biological and chemical information is stored, accessed and applied in drug discovery, pharmacology and genomics. Topics include sequence databases (GenBank, EMBL), protein resources (UniProt, PDB), cheminformatics hubs (PubChem, DrugBank, ChEMBL), functional annotation (GO, KEGG), and search tools (BLAST, FASTA). Understanding accession numbers, data formats (FASTA, PDB), metadata, curation and database cross-references helps pharmacists interpret literature, predict targets, and evaluate ADMET or pharmacogenomic data. This concise study set emphasizes real-world database usage, retrieval strategies and interpretation skills. Now let’s test your knowledge with 30 MCQs on this topic.

Q1. What is the primary purpose of a bioinformatics database?

  • To store, organize and enable retrieval of biological or chemical data
  • To perform laboratory experiments automatically
  • To manufacture pharmaceuticals at scale
  • To sequence DNA samples in the clinic

Correct Answer: To store, organize and enable retrieval of biological or chemical data

Q2. Which database is a primary sequence repository for nucleotide sequences maintained by NCBI?

  • UniProt
  • GenBank
  • PDB
  • DrugBank

Correct Answer: GenBank

Q3. UniProt is best described as a database for:

  • Small molecule drug structures
  • Protein sequence and functional information
  • Clinical trial records
  • Pathway visualization only

Correct Answer: Protein sequence and functional information

Q4. What does PDB (Protein Data Bank) primarily archive?

  • Gene expression microarray data
  • Protein three-dimensional structures determined experimentally
  • Pharmacokinetic models
  • Clinical case reports

Correct Answer: Protein three-dimensional structures determined experimentally

Q5. Which file format is commonly used for representing raw polymer sequences in databases?

  • PDB format
  • FASTA format
  • SMILES notation
  • CSV spreadsheet

Correct Answer: FASTA format

Q6. An accession number in a sequence database is used to:

  • Encrypt sequence data
  • Uniquely identify a record for retrieval and citation
  • Predict protein structure
  • Measure expression levels

Correct Answer: Uniquely identify a record for retrieval and citation

Q7. BLAST is a tool used for:

  • Visualizing 3D molecular interactions
  • Comparing an input sequence against a sequence database to find similarities
  • Simulating drug metabolism only
  • Creating synthetic molecules

Correct Answer: Comparing an input sequence against a sequence database to find similarities

Q8. Which database focuses on chemical structures and bioactivity of small molecules?

  • KEGG
  • PubChem
  • UniProt
  • RefSeq

Correct Answer: PubChem

Q9. DrugBank is most useful to a B.Pharm student for finding:

  • Protein 3D coordinates
  • Detailed drug chemical, pharmacological and target information
  • Raw genomic reads
  • Clinical imaging files

Correct Answer: Detailed drug chemical, pharmacological and target information

Q10. The Gene Ontology (GO) resource provides:

  • Tools for molecular dynamics simulations
  • A controlled vocabulary to describe gene and protein functions across species
  • Clinical trial outcome metrics
  • Sequences for small RNAs only

Correct Answer: A controlled vocabulary to describe gene and protein functions across species

Q11. Which resource integrates genomic, chemical and pathway information useful in drug discovery?

  • KEGG
  • PDB only
  • EMBL-EBI mirror without annotations
  • RefSeq

Correct Answer: KEGG

Q12. A low BLAST E-value indicates:

  • A less significant alignment
  • A more significant alignment with low probability of random match
  • Higher chance of sequencing error
  • No homology between sequences

Correct Answer: A more significant alignment with low probability of random match

Q13. RefSeq differs from primary sequence archives like GenBank because RefSeq:

  • Contains raw sequencing reads only
  • Provides curated, non-redundant reference sequences
  • Is focused on small molecules
  • Has no annotation

Correct Answer: Provides curated, non-redundant reference sequences

Q14. Which database would you consult for known adverse drug reaction information?

  • SIDER
  • UniProt
  • GenBank
  • EMBL

Correct Answer: SIDER

Q15. Cross-references in database entries are important because they:

  • Prevent any data updates
  • Connect related information across databases to provide context and validation
  • Encrypt the accession numbers
  • Only exist in proprietary databases

Correct Answer: Connect related information across databases to provide context and validation

Q16. Which of the following is a primary international collaboration that shares nucleotide sequence data alongside GenBank?

  • Swiss-Prot only
  • EMBL-ENA and DDBJ
  • DrugBank consortium
  • PubChem

Correct Answer: EMBL-ENA and DDBJ

Q17. Which identifier refers specifically to a protein entry in UniProt?

  • PDB ID
  • UniProt accession (e.g., P01234)
  • PubChem CID
  • GenBank GI number

Correct Answer: UniProt accession (e.g., P01234)

Q18. ChEMBL is a database primarily curated for:

  • Clinical imaging
  • Bioactive drug-like small molecules and their bioactivity data
  • Plant taxonomy
  • Protein folding pathways

Correct Answer: Bioactive drug-like small molecules and their bioactivity data

Q19. Which search strategy returns sequences similar to a query by using local alignment?

  • Global alignment (Needleman-Wunsch)
  • Local alignment (Smith-Waterman) or BLAST local aligner
  • Only structural superposition
  • Mass spectrometry matching

Correct Answer: Local alignment (Smith-Waterman) or BLAST local aligner

Q20. What type of information does the PDB file contain besides atom coordinates?

  • Only sequence reads
  • Metadata such as experimental method, resolution, chain IDs and ligand descriptions
  • Clinical trial data
  • SMILES strings for lipids only

Correct Answer: Metadata such as experimental method, resolution, chain IDs and ligand descriptions

Q21. Which database is specifically tailored for pharmacogenomics information linking genes to drug response?

  • PharmGKB
  • EMBL
  • RefSeq
  • Protein Data Bank

Correct Answer: PharmGKB

Q22. The SMILES notation is used to represent:

  • Protein secondary structure
  • Chemical structure of small molecules as a text string
  • Nucleotide quality scores
  • Three-dimensional coordinates

Correct Answer: Chemical structure of small molecules as a text string

Q23. Which term best describes databases that aggregate and standardize information from multiple primary sources?

  • Primary databases
  • Secondary or derived databases
  • Raw read repositories
  • Sequencing pipelines

Correct Answer: Secondary or derived databases

Q24. When retrieving data programmatically, which access method is commonly provided by major bioinformatics resources?

  • Hand-delivered USB drives only
  • APIs (RESTful services) and FTP/HTTP downloads
  • Telephone requests only
  • Encrypted snail mail

Correct Answer: APIs (RESTful services) and FTP/HTTP downloads

Q25. What does annotation in a sequence database typically include?

  • Only the raw nucleotide characters without any labels
  • Gene features, coding regions, functional notes and cross-references
  • Physical sample storage location only
  • Vendor pricing data

Correct Answer: Gene features, coding regions, functional notes and cross-references

Q26. Which of the following best explains a “non-redundant” database?

  • A database with no metadata
  • A database that removes duplicate or identical records to represent unique entries
  • A database limited to one species
  • A database that only stores chemical formulas

Correct Answer: A database that removes duplicate or identical records to represent unique entries

Q27. Which resource would you use to find metabolic pathways and enzyme functions related to a drug target?

  • KEGG
  • PubMed Central only
  • GenBank raw reads
  • SIDER only

Correct Answer: KEGG

Q28. In BLAST output, which metric helps judge the alignment quality normalized by length?

  • Sequence accession
  • Bit score
  • Publication year
  • File size

Correct Answer: Bit score

Q29. PubMed is most appropriate for retrieving:

  • Experimental 3D structures
  • Biomedical literature and abstracts related to drugs and biology
  • Raw sequencing reads in FASTQ
  • Chemical reaction mechanisms only

Correct Answer: Biomedical literature and abstracts related to drugs and biology

Q30. Why is data curation important in bioinformatics databases used in pharmacy?

  • It automatically sequences new genomes
  • It ensures accuracy, consistent annotation, error correction and reliable cross-references for research and clinical decisions
  • It increases the file size for storage costs
  • It prevents users from accessing data

Correct Answer: It ensures accuracy, consistent annotation, error correction and reliable cross-references for research and clinical decisions

Leave a Comment