Gini Index Calculation for Salary Distribution

Problem Context and Distribution Data

  • The provided text presents a statistical problem regarding the distribution of salaries in a company and asks for the calculation of the Gini Index (IGI_G).
  • The company's salary structure is divided into four intervals with associated frequencies representing the number of employees in each bracket.
  • Salary Distribution Table:
    • Interval 1: [1000,2000)[1000, 2000), Frequency (n1n_1): 3030
    • Interval 2: [2000,3000)[2000, 3000), Frequency (n2n_2): 4545
    • Interval 3: [3000,4000)[3000, 4000), Frequency (n3n_3): 2020
    • Interval 4: [4000,5000)[4000, 5000), Frequency (n4n_4): 55

Theoretical Foundations of the Gini Index

  • Definition: The Gini Index is a measure of statistical dispersion intended to represent the income or wealth inequality within a nation or any group of people. In this context, it measures the inequality of salary distribution among the company's employees.
  • Range: The index ranges from 00 to 11.
    • A value of 00 represents "Perfect Equality" (everyone earns the same salary).
    • A value of 11 (or 100%100\%) represents "Perfect Inequality" (one person earns everything, and others earn nothing).
  • Geometry: It is based on the Lorenz Curve, which plots the cumulative proportion of total income on the y-axis against the cumulative proportion of the population on the x-axis. The Gini Index is the ratio of the area between the line of perfect equality and the Lorenz curve to the total area under the line of perfect equality.
  • Calculation Formula: For discrete data organized into intervals, the Gini Index can be calculated using the formula:   IG=i=1k1(piqi)i=1k1piI_G = \frac{\sum_{i=1}^{k-1} (p_i - q_i)}{\sum_{i=1}^{k-1} p_i}   Where:
    • pip_i: Cumulative relative frequency of the population (percentage of employees).
    • qiq_i: Cumulative relative frequency of the total salary (percentage of total wealth).
    • kk: Number of intervals.

Step-by-Step Calculation Procedure

Step 1: Determination of Midpoints (xix_i)

Since the data is presented in intervals, we must use the midpoint of each class as the representative salary value for the calculation.

  • x1=1000+20002=1500x_1 = \frac{1000 + 2000}{2} = 1500
  • x2=2000+30002=2500x_2 = \frac{2000 + 3000}{2} = 2500
  • x3=3000+40002=3500x_3 = \frac{3000 + 4000}{2} = 3500
  • x4=4000+50002=4500x_4 = \frac{4000 + 5000}{2} = 4500
Step 2: Population Analysis (nin_i, NiN_i, and pip_i)
  • Total Population (NN):   N=30+45+20+5=100N = 30 + 45 + 20 + 5 = 100
  • Cumulative Frequencies (NiN_i):
    • N1=30N_1 = 30
    • N2=30+45=75N_2 = 30 + 45 = 75
    • N3=75+20=95N_3 = 75 + 20 = 95
    • N4=95+5=100N_4 = 95 + 5 = 100
  • Cumulative Relative Frequencies (pi=NiNp_i = \frac{N_i}{N}):
    • p1=30100=0.30p_1 = \frac{30}{100} = 0.30
    • p2=75100=0.75p_2 = \frac{75}{100} = 0.75
    • p3=95100=0.95p_3 = \frac{95}{100} = 0.95
    • p4=100100=1.00p_4 = \frac{100}{100} = 1.00
Step 3: Salary/Wealth Analysis (uiu_i, UiU_i, and qiq_i)
  • Salary per Category (ui=xi×niu_i = x_i \times n_i):
    • u1=1500×30=45000u_1 = 1500 \times 30 = 45000
    • u2=2500×45=112500u_2 = 2500 \times 45 = 112500
    • u3=3500×20=70000u_3 = 3500 \times 20 = 70000
    • u4=4500×5=22500u_4 = 4500 \times 5 = 22500
  • Total Payroll (UU):   U=45000+112500+70000+22500=250000U = 45000 + 112500 + 70000 + 22500 = 250000
  • Cumulative Salaries (UiU_i):
    • U1=45000U_1 = 45000
    • U2=45000+112500=157500U_2 = 45000 + 112500 = 157500
    • U3=157500+70000=227500U_3 = 157500 + 70000 = 227500
    • U4=227500+22500=250000U_4 = 227500 + 22500 = 250000
  • Cumulative Relative Salary (qi=UiUq_i = \frac{U_i}{U}):
    • q1=45000250000=0.18q_1 = \frac{45000}{250000} = 0.18
    • q2=157500250000=0.63q_2 = \frac{157500}{250000} = 0.63
    • q3=227500250000=0.91q_3 = \frac{227500}{250000} = 0.91
    • q4=250000250000=1.00q_4 = \frac{250000}{250000} = 1.00

Summarized Calculation Table

Intervalnin_ixix_ixi×nix_i \times n_ipip_iqiq_ipiqip_i - q_i
[1000-2000)301500450000.300.180.12
[2000-3000)4525001125000.750.630.12
[3000-4000)203500700000.950.910.04
[4000-5000)54500225001.001.00-
Total100-2500002.00 (Sum of p1p_1 to p3p_3)-0.28 (Sum of DiffDiff)

Final Evaluation and Solution

  • To find the value of the Gini Index, we sum the differences between pip_i and qiq_i for all intervals excerpt the last one, and divide by the sum of the pip_i values for those same intervals.

  • Numerator Calculation:   i=13(piqi)=(0.300.18)+(0.750.63)+(0.950.91)\sum_{i=1}^{3} (p_i - q_i) = (0.30 - 0.18) + (0.75 - 0.63) + (0.95 - 0.91)i=13(piqi)=0.12+0.12+0.04=0.28\sum_{i=1}^{3} (p_i - q_i) = 0.12 + 0.12 + 0.04 = 0.28

  • Denominator Calculation:   i=13pi=0.30+0.75+0.95=2.00\sum_{i=1}^{3} p_i = 0.30 + 0.75 + 0.95 = 2.00

  • Final Result:   IG=0.282.00I_G = \frac{0.28}{2.00}IG=0.140I_G = 0.140

  • Conclusion: The computed value of the Gini Index is 0.1400.140. Comparing this result with the provided multiple-choice options:

    • a. 0,280
    • b. 0,140
    • c. 0,093
    • d. Ninguna de las respuestas anteriores es correcta
  • The correct option is b. 0,140.