Organisation of Data
Key Concepts
- Classification of Data: Arranging raw data into groups for easier analysis.
- Raw Data: Unclassified and disorganized data that is cumbersome to handle.
- Frequency Distribution: A table that shows how different values of a variable are distributed across various classes.
Types of Variables
- Continuous Variables: Can take any numerical value (e.g., height, weight).
- Discrete Variables: Can only take specific values (e.g., number of students).
Classification Methods
- Chronological Classification: Data classified by time (e.g., years, months).
- Spatial Classification: Data classified by geographical locations (e.g., countries, states).
Frequency Distribution Table Example
| Marks Range | Frequency |
|---|---|
| 0-10 | 1 |
| 10-20 | 8 |
| 20-30 | 6 |
| 30-40 | 7 |
| 40-50 | 21 |
| 50-60 | 23 |
| 60-70 | 19 |
| 70-80 | 6 |
| 80-90 | 5 |
| 90-100 | 4 |
| Total | 100 |
Important Definitions
- Class Limits: The lowest and highest values in a class.
- Class Mark: The midpoint of a class, calculated as (Upper Class Limit + Lower Class Limit) / 2.
- Relative Frequency: Frequency expressed as a proportion of the total frequency.
Loss of Information
- Classification simplifies data but can lead to loss of individual observation details.
Conclusion
- Proper classification of data is essential for effective statistical analysis.