Introduction: This quiz collection on Sequence Alignment Fundamentals is tailored for M.Pharm students to reinforce essential bioinformatics concepts applied in drug discovery and molecular analysis. It covers pairwise and multiple sequence alignment principles, dynamic programming algorithms (Needleman–Wunsch and Smith–Waterman), scoring systems including substitution matrices (PAM/BLOSUM), gap penalties, and practical tools such as BLAST, PSI-BLAST, and profile HMMs. Each question emphasizes conceptual understanding and practical interpretation so you can evaluate homology, functional conservation, and evolutionary relationships relevant to pharmacology, target identification, and comparative genomics. Use these MCQs to test and strengthen your analytical skills in sequence comparison and interpretation.
Q1. What is the primary purpose of sequence alignment in bioinformatics?
- To arrange sequences randomly for data storage
- To determine the three-dimensional structure directly from sequence
- To identify regions of similarity that may indicate functional, structural, or evolutionary relationships
- To convert nucleotide sequences into protein sequences without translation
Correct Answer: To identify regions of similarity that may indicate functional, structural, or evolutionary relationships
Q2. How does global alignment differ from local alignment?
- Global alignment ignores end gaps while local alignment penalizes them heavily
- Global aligns entire sequences end-to-end; local finds the highest-scoring subsequences
- Local alignment always uses a substitution matrix but global alignment does not
- Local alignment is used only for nucleotide sequences and global for proteins
Correct Answer: Global aligns entire sequences end-to-end; local finds the highest-scoring subsequences
Q3. Which algorithm is the classic method for computing optimal global pairwise alignments?
- Smith–Waterman algorithm
- Needleman–Wunsch algorithm
- BLAST heuristic
- Clustal progressive alignment
Correct Answer: Needleman–Wunsch algorithm
Q4. Which algorithm is specifically designed for optimal local alignment between two sequences?
- Needleman–Wunsch algorithm
- Progressive alignment
- Smith–Waterman algorithm
- Hidden Markov Model (HMM)
Correct Answer: Smith–Waterman algorithm
Q5. What is the affine gap penalty formula commonly used in alignment scoring?
- Gap cost = gap length × substitution matrix score
- Gap cost = gap opening penalty + (gap extension penalty × gap length)
- Gap cost = constant penalty per gap regardless of length
- Gap cost = gap opening penalty × gap extension penalty
Correct Answer: Gap cost = gap opening penalty + (gap extension penalty × gap length)
Q6. How do PAM and BLOSUM substitution matrices fundamentally differ?
- PAM is derived from conserved blocks of unrelated proteins; BLOSUM uses global alignments
- PAM is used only for nucleotides while BLOSUM is for proteins
- PAM is derived from global alignments of closely related sequences; BLOSUM is derived from conserved blocks of more diverse sequences
- There is no difference; they are identical in derivation and use
Correct Answer: PAM is derived from global alignments of closely related sequences; BLOSUM is derived from conserved blocks of more diverse sequences
Q7. What does a higher BLOSUM number (e.g., BLOSUM80) indicate about its intended use?
- It is optimized for aligning very divergent sequences
- It is optimized for aligning closely related sequences
- It should only be used for nucleotide alignments
- It ignores conservative substitutions entirely
Correct Answer: It is optimized for aligning closely related sequences
Q8. What does the E-value reported by BLAST represent?
- The exact number of homologs present in the database
- The expect value estimating how many hits one can expect to find by chance; lower is more significant
- The evolutionary distance in PAM units
- The number of mismatches in the best alignment
Correct Answer: The expect value estimating how many hits one can expect to find by chance; lower is more significant
Q9. What is the purpose of the bit score in BLAST output?
- To provide a raw unnormalized score dependent on database size
- To give a normalized score representing alignment quality independent of database size
- To count the number of gaps in the alignment
- To measure sequence complexity
Correct Answer: To give a normalized score representing alignment quality independent of database size
Q10. How does BLAST speed up database searches compared to exact dynamic programming?
- By performing global alignment instead of local alignment
- By using short exact ‘word’ matches as seeds and extending them heuristically
- By converting sequences to binary and using bitwise operations for exact alignment
- By aligning only the first 100 residues of each sequence
Correct Answer: By using short exact ‘word’ matches as seeds and extending them heuristically
Q11. What is the time and space complexity of Needleman–Wunsch dynamic programming for two sequences of lengths m and n?
- O(m + n) time and O(1) space
- O(mn) time and O(mn) space
- O(m^2 + n^2) time and O(m + n) space
- Exponential time and linear space
Correct Answer: O(mn) time and O(mn) space
Q12. How does sequence identity differ from sequence similarity?
- Identity counts identical residues; similarity includes conservative substitutions weighted by a scoring matrix
- Similarity counts identical residues; identity counts conservative substitutions
- They are interchangeable terms with no difference
- Identity measures structural similarity while similarity measures sequence length
Correct Answer: Identity counts identical residues; similarity includes conservative substitutions weighted by a scoring matrix
Q13. What is the purpose of backtracking in dynamic programming alignment algorithms?
- To fill the scoring matrix using substitution values
- To compute the E-value of the alignment
- To trace and reconstruct the optimal alignment path after filling the scoring matrix
- To convert protein alignments to nucleotide alignments
Correct Answer: To trace and reconstruct the optimal alignment path after filling the scoring matrix
Q14. What is a Position-Specific Scoring Matrix (PSSM) used for in PSI-BLAST?
- A matrix that lists only gap penalties for a sequence
- A matrix with position-specific substitution scores derived from a multiple sequence alignment used to detect distant homologs
- A simple count of nucleotide frequencies across a genome
- An alternative name for the BLOSUM matrix
Correct Answer: A matrix with position-specific substitution scores derived from a multiple sequence alignment used to detect distant homologs
Q15. What is the main principle behind progressive multiple sequence alignment (e.g., Clustal)?
- Align all sequences simultaneously using exhaustive dynamic programming
- Build the multiple alignment by aligning the most similar sequences first according to a guide tree
- Use random alignments and select the best by scoring
- Only align pairwise and report the best pair
Correct Answer: Build the multiple alignment by aligning the most similar sequences first according to a guide tree
Q16. Which multiple sequence alignment approach uses consistency information from pairwise alignments to improve accuracy?
- Progressive alignment using UPGMA only
- T-Coffee
- Needleman–Wunsch pairwise alignment
- Basic BLAST search
Correct Answer: T-Coffee
Q17. What is the rationale for performing codon-aware alignment when comparing coding DNA sequences?
- To maximize the number of gaps in the alignment
- To preserve reading frames by aligning nucleotides in triplets, often guided by translated protein alignments
- To ignore synonymous substitutions entirely
- To replace nucleotides with amino acids before alignment
Correct Answer: To preserve reading frames by aligning nucleotides in triplets, often guided by translated protein alignments
Q18. Where are gaps (indels) biologically more likely to occur in protein sequences?
- In highly conserved catalytic residues
- Randomly and uniformly across all residues
- In less conserved regions such as surface loops and inter-domain linkers
- Only at the N-terminus of proteins
Correct Answer: In less conserved regions such as surface loops and inter-domain linkers
Q19. What is a commonly used E-value cutoff in BLAST to consider hits as potentially biologically significant?
- E-value < 1 is always considered significant for homology
- E-value < 1e-5 is commonly used cutoff to consider hits significant
- E-value > 10 indicates high significance
- No cutoff is ever used; all hits are treated equally
Correct Answer: E-value < 1e-5 is commonly used cutoff to consider hits significant
Q20. What advantage do profile Hidden Markov Models (HMMs) offer for sequence alignment and database searching?
- They treat all alignment positions as equally conserved and ignore gaps
- They model position-specific residue frequencies and indel probabilities, providing sensitive detection of distant homologs
- They are equivalent to simple pairwise BLAST searches with no additional sensitivity
- They can only model nucleotide sequences, not proteins
Correct Answer: They model position-specific residue frequencies and indel probabilities, providing sensitive detection of distant homologs

I am a Registered Pharmacist under the Pharmacy Act, 1948, and the founder of PharmacyFreak.com. I hold a Bachelor of Pharmacy degree from Rungta College of Pharmaceutical Science and Research. With a strong academic foundation and practical knowledge, I am committed to providing accurate, easy-to-understand content to support pharmacy students and professionals. My aim is to make complex pharmaceutical concepts accessible and useful for real-world application.
Mail- Sachin@pharmacyfreak.com

