Data collection involves systematic gathering of information from various sources using appropriate methods to ensure reliability and validity. Primary data is collected firsthand through surveys, questionnaires, interviews, or observations, offering fresh insights but requiring significant resources. Secondary data, obtained from pre-existing sources like government publications, research reports, or databases, provides cost-effective information but may present relevance or accuracy concerns. The data collection process requires careful planning, including defining research objectives, determining appropriate sampling techniques (random, stratified, cluster, or systematic), designing effective collection instruments, and addressing potential biases or errors that might compromise data quality.
Chapter 12: Collection of Data
Data collection is the process of gathering information on variables of interest in a systematic way. The quality of data significantly affects the reliability and validity of statistical analysis, making data collection a critical step in the statistical investigation process.
Types of Data:
- Primary vs. Secondary Data:
- Primary data: Collected directly by the investigator for specific purpose
- Secondary data: Collected by someone else and used by the investigator
- Quantitative vs. Qualitative Data:
- Quantitative data: Numerical information (height, income, temperature)
- Qualitative data: Non-numerical attributes (gender, color, occupation)
- Cross-sectional vs. Time Series Data:
- Cross-sectional: Data collected at a single point in time across different units
- Time series: Data collected for same unit(s) over different time periods
- Discrete vs. Continuous Data:
- Discrete: Can take only specific values (number of children, vote count)
- Continuous: Can take any value within a range (height, weight, temperature)
Sources of Secondary Data:
- Official Sources:
- Government ministries and departments
- Reserve Bank of India (RBI)
- National Sample Survey Office (NSSO)
- Central Statistical Office (CSO)
- Census of India
- Semi-Official Sources:
- State government publications
- District gazetteers
- Municipal records
- Reports of committees and commissions
- Non-Official Sources:
- Research institutions and universities
- International organizations (World Bank, IMF, UN)
- Trade associations and chambers of commerce
- Newspapers and periodicals
- Private research organizations
- Digital Sources:
- Online databases
- Data portals (data.gov.in)
- Websites of official and non-official organizations
- Digital archives
Methods of Collecting Primary Data:
- Direct Personal Investigation:
- Data collected by investigator directly from respondents
- Advantages: Accuracy, flexibility, high response rate
- Limitations: Time-consuming, expensive, limited coverage
- Indirect Oral Investigation:
- Information collected from witnesses or third parties
- Used when direct contact with subjects is not possible
- Concerns about reliability and bias
- Questionnaire Method:
- Set of questions sent to respondents to be filled
- Mailed questionnaire or online survey
- Advantages: Wide coverage, low cost, anonymity
- Limitations: Low response rate, incomplete responses
- Schedule Method:
- Questionnaire filled by enumerators through personal interviews
- Advantages: High response rate, clarity, completeness
- Limitations: Time-consuming, interviewer bias, expensive
- Observation Method:
- Direct observation of phenomena without questioning
- Participants may or may not be aware of observation
- Suitable for behavioral studies and physical measurements
Sample Survey Methods:
- Census vs. Sample Survey:
- Census: Complete enumeration of all units in population
- Sample: Study of representative portion of population
- Advantages of sampling: Cost-effective, time-saving, detailed study possible
- Sampling Techniques:
- Probability Sampling:
- Simple random sampling
- Stratified random sampling
- Systematic sampling
- Cluster sampling
- Multistage sampling
- Non-Probability Sampling:
- Convenience sampling
- Judgment sampling
- Quota sampling
- Snowball sampling
- Probability Sampling:
- Sampling and Non-Sampling Errors:
- Sampling errors: Due to studying only a part of population
- Non-sampling errors: Processing errors, response errors, non-response errors
Designing a Questionnaire/Schedule:
- General Principles:
- Keep it brief and simple
- Logical sequence of questions
- Clear and unambiguous wording
- Avoid leading or loaded questions
- Consider respondent’s ability to answer
- Types of Questions:
- Open-ended vs. closed-ended
- Dichotomous questions (yes/no)
- Multiple choice questions
- Rating scales (Likert scale)
- Ranking questions
- Pre-testing:
- Pilot study to identify problems with questionnaire
- Revision based on feedback
- Final version for full-scale survey
Evaluation of Secondary Data:
- Criteria for Evaluation:
- Suitability for the purpose of study
- Reliability of the source
- Adequacy and coverage
- Time period and relevance
- Units of measurement and comparability
- Methodology used in data collection
- Limitations of Secondary Data:
- May not exactly fit research requirements
- Possibility of outdated information
- Unknown biases in original data collection
- Lack of control over data quality
- Differences in definitions and classifications
Ethical Considerations in Data Collection:
- Informed consent from respondents
- Protecting privacy and confidentiality
- Voluntary participation
- Avoiding deception
- Minimizing harm to respondents
- Accurate reporting of methodology
Data collection is a critical foundation for statistical analysis in economics. The choice between primary and secondary data, as well as specific collection methods, depends on the nature of the research problem, available resources, time constraints, and desired accuracy. Regardless of the method used, ensuring data quality through proper design, execution, and documentation of the collection process is essential for valid and reliable statistical inference.
Complete Chapter-wise Hsslive Plus One Economics Notes
Our HSSLive Plus One Economics Notes cover all chapters with key focus areas to help you organize your study effectively:
Economics: Indian Economic Development
- Chapter 1 Indian Economy on the Eve of Independence
- Chapter 2 Indian Economy 1950-1990
- Chapter 3 Liberalisation, Privatisation and Globalisation -An Appraisal
- Chapter 4 Poverty
- Chapter 5 Human Capital Formation in India
- Chapter 6 Rural Development
- Chapter 7 Employment-Growth, Informalisation and Related Issues
- Chapter 8 Infrastructure
- Chapter 9 Environment Sustainable Development
- Chapter 10 Comparative Development Experience of India with its Neighbours
Economics: Statistics for Economics
- Chapter 11 Introduction
- Chapter 12 Collection of Data
- Chapter 13 Organisation of Data
- Chapter 14 Presentation of Data
- Chapter 15 Measures of Central Tendency
- Chapter 16 Measures of Dispersion
- Chapter 17 Correlation
- Chapter 18 Index Numbers
- Chapter 19 Uses of Statistical Methods