CRF and database design, data entry and cleaning processes MCQs With Answer

Introduction

CRF and database design, data entry and cleaning processes MCQs With Answer is a focused quiz resource tailored for M.Pharm students studying Clinical Research (MPP 104T). This collection emphasizes practical and regulatory aspects of Case Report Form (CRF) design, electronic CRF (eCRF) systems, database architecture, validation, edit checks, query management, and systematic data cleaning. Each question probes deeper topics such as metadata, audit trail requirements, SDV, 21 CFR Part 11, CDISC standards, and database lock procedures. The goal is to reinforce conceptual understanding and application skills required for accurate, compliant clinical data capture and downstream analysis in pharmaceutical research.

Q1. What is the primary purpose of a Case Report Form (CRF) in a clinical trial?

To randomize subjects to treatment groups
To collect protocol-required data for each enrolled subject
To serve as the final statistical analysis dataset
To document marketing approvals

Correct Answer: To collect protocol-required data for each enrolled subject

Q2. Which principle is most important when designing a CRF to ensure high-quality data capture?

Include as many open-text fields as possible to allow flexibility
Design forms with logical clinical flow and minimize free-text entries
Separate related data across many small forms to reduce page length
Use complex branching that relies on manual skip logic only

Correct Answer: Design forms with logical clinical flow and minimize free-text entries

Q3. What is a major advantage of an electronic CRF (eCRF) compared with paper CRF?

Eliminates the need for any monitoring or source verification
Provides real-time edit checks and programmed data validation
Makes audit trails optional
Automatically produces regulatory submission datasets without configuration

Correct Answer: Provides real-time edit checks and programmed data validation

Q4. Which of the following best describes metadata in a clinical database context?

Patient consent forms stored as scanned documents
Variable name, field type, and permissible values used to describe each data item
Printed CRF pages filed in the investigator site binder
Final statistical outputs such as p-values and tables

Correct Answer: Variable name, field type, and permissible values used to describe each data item

Q5. What is an edit check in the context of a clinical trial database?

A written SOP describing how to collect laboratory data
A programmed validation that flags missing, inconsistent, or out-of-range entries
A paper CRF version number printed on the form
An audit finding reported during an inspection

Correct Answer: A programmed validation that flags missing, inconsistent, or out-of-range entries

Q6. Why is double data entry used in some data entry workflows?

To speed up database lock by entering data twice as fast
To detect and reduce transcription errors by reconciling two independent entries
To meet CDISC SDTM formatting requirements automatically
To avoid the need for source data verification

Correct Answer: To detect and reduce transcription errors by reconciling two independent entries

Q7. 21 CFR Part 11 primarily regulates which aspect of clinical trial systems?

Clinical trial protocol development
Electronic records and electronic signatures to ensure trustworthiness
Drug manufacturing facility inspections
Statistical analysis plan content

Correct Answer: Electronic records and electronic signatures to ensure trustworthiness

Q8. What information does an audit trail in an EDC system typically record?

Only the final value entered for each field
Who made the change, when it was made, and what the previous value was
The list of users with access to the system but not their actions
Only queries raised but not their resolutions

Correct Answer: Who made the change, when it was made, and what the previous value was

Q9. What does SDV (Source Data Verification) refer to?

Standardized data values in the submission datasets
Comparing CRF/eCRF entries against the original source documents to confirm accuracy
Software development validation for EDC systems
Statistical data validation performed after database lock

Correct Answer: Comparing CRF/eCRF entries against the original source documents to confirm accuracy

Q10. Which statement best defines database lock?

A temporary freeze while data entry continues without queries
The point at which the database is declared final and no further changes are allowed except documented releases
A nightly backup of the database
A report showing outstanding queries only

Correct Answer: The point at which the database is declared final and no further changes are allowed except documented releases

Q11. What is the primary purpose of query management in clinical data handling?

To generate analytical datasets for modeling
To raise, track, and resolve discrepancies between entered data and protocol/source documentation
To perform statistical hypothesis testing
To encrypt data for transmission

Correct Answer: To raise, track, and resolve discrepancies between entered data and protocol/source documentation

Q12. Which sequence best represents the iterative data cleaning process?

Generate SDTM → Lock database → Run edit checks
Run programmed edit checks → Raise queries → Resolve and document corrections → Re-run checks
Lock database → Perform source verification → Create audit trail
Collect CRFs → Publish results → Archive database

Correct Answer: Run programmed edit checks → Raise queries → Resolve and document corrections → Re-run checks

Q13. Why is CRF version control important?

It allows investigators to change protocol requirements without approval
It tracks changes to CRF content so data collected under different versions can be identified and reconciled
It replaces the need for a data management plan
It is only required for paper CRFs, not eCRFs

Correct Answer: It tracks changes to CRF content so data collected under different versions can be identified and reconciled

Q14. Which best differentiates a CRF from source documents?

CRF is the original laboratory instrument readout; source documents are copies
CRF is designed to capture protocol-required data in a structured way; source documents are the original clinical records and notes
CRF includes audit trails; source documents never change
CRF holds patient identifiers only while source documents contain study results

Correct Answer: CRF is designed to capture protocol-required data in a structured way; source documents are the original clinical records and notes

Q15. What type of check is a “range check” in data validation?

Checks that a field was completed by the site monitor
Validates that a numeric entry falls within predefined lower and upper bounds
Verifies that the user has appropriate role-based access
Ensures that database backups are performed daily

Correct Answer: Validates that a numeric entry falls within predefined lower and upper bounds

Q16. Which activities are typically included in EDC system validation for clinical trials?

Writing the clinical study report and regulatory submissions
Developing test scripts, executing test cases (including UAT), and documenting results and traceability
Training site staff only after database lock
Automatically converting paper CRFs to SDTM without testing

Correct Answer: Developing test scripts, executing test cases (including UAT), and documenting results and traceability

Q17. What is the primary reason for implementing role-based access controls in an EDC?

To allow all users to edit any subject’s data
To ensure segregation of duties and limit system functions according to user responsibilities
To comply with CDISC submission formatting
To automatically anonymize data for submission

Correct Answer: To ensure segregation of duties and limit system functions according to user responsibilities

Q18. Which action is a standard method for de-identifying clinical trial subject data before analysis?

Replacing direct identifiers with a subject code and removing unique identifiers where possible
Keeping full names and dates to preserve context
Publishing subject contact details with the dataset
Converting all numeric values into free-text fields

Correct Answer: Replacing direct identifiers with a subject code and removing unique identifiers where possible

Q19. How does CDISC SDTM differ from ADaM?

SDTM defines analysis-ready datasets, while ADaM defines raw source tables
SDTM provides standardized submission datasets representing collected data; ADaM provides analysis datasets designed to support statistical analyses
SDTM is used only for nonclinical studies, ADaM only for clinical studies
They are interchangeable terms for the same dataset structure

Correct Answer: SDTM provides standardized submission datasets representing collected data; ADaM provides analysis datasets designed to support statistical analyses

Q20. What is the purpose of define.xml in regulatory submissions?

It contains the audit trail for the EDC system
It documents metadata and structure of submitted datasets to allow reviewers to understand variable definitions and derivations
It stores scanned informed consent forms
It is used only for archiving paper CRFs

Correct Answer: It documents metadata and structure of submitted datasets to allow reviewers to understand variable definitions and derivations

Download

G S Sachin

I am a Registered Pharmacist under the Pharmacy Act, 1948, and the founder of PharmacyFreak.com. I hold a Bachelor of Pharmacy degree from Rungta College of Pharmaceutical Science and Research. With a strong academic foundation and practical knowledge, I am committed to providing accurate, easy-to-understand content to support pharmacy students and professionals. My aim is to make complex pharmaceutical concepts accessible and useful for real-world application.

Mail- Sachin@pharmacyfreak.com

Leave a Comment Cancel reply