Análisis de muestreo por conglomerados

Tú y un grupo de psicólogos estáis interesados en analizar la salud mental de empleados. Vuestra investigación incluye una encuesta que busca medir las actitudes hacia la salud mental en el entorno tecnológico y examinar la frecuencia de trastornos de salud mental entre trabajadores del sector tech.

El conjunto de datos, mh_survey, incluye el gender de la persona encuestada, el estado de EE. UU. en el que vive, US_state_live, y si ha buscado tratamiento para su salud mental a través de su empresa, sought_treatment.

Vas a crear un gráfico de sectores para analizar la probabilidad de que una persona del sector tech en EE. UU. busque tratamiento relacionado con su salud mental, sought_treatment. Se ha cargado para ti una lista aleatoria de 10 conglomerados de estados, random_cluster.

Pandas y numpy se han importado como pd y np.

Este ejercicio forma parte del curso

Análisis de datos de encuestas en Python

Instrucciones del ejercicio

Haz un subconjunto del conjunto de datos para incluir solo los estados en random_clusters.
Crea un gráfico de sectores de la columna sought_treatment.

ejercicio interactivo práctico

Prueba este ejercicio completando este código de ejemplo.

# Subset dataset to inlude only states in cluster_sample
cluster_sample = ____[mh_survey.US_state_live.____(____)]

# Create a pie chart of the sought_treament column
treatment_pie = cluster_sample.____.____(normalize=True)
treatment_pie.____.____()
plt.show()

Editar y ejecutar código

Este ejercicio forma parte del curso

Análisis de datos de encuestas en Python

IntermedioNivel de habilidad

4.7+

Empieza el curso gratis

What is survey data, and how do we determine which statistical test to use to analyze the data? To answer this, you’ll be able to define all sorts of survey data types, encounter important concepts like descriptive and inferential statistics, and visualize survey data to determine the appropriate statistical modeling technique needed. In doing so, you will know how to best qualitatively and quantitatively define the trends and insights you come across in surveys.

Exercise 1: Introducing Survey Data Analysis Exercise 2: Looking at levels of measurements Exercise 3: Crosstabulation Exercise 4: Descriptive and Inferential Statistics Exercise 5: Descriptive statistics Exercise 6: Inferential statistics Exercise 7: Statistical Modeling Techniques Exercise 8: Scatter plot inspection Exercise 9: Choose a statistical method Exercise 10: Sampling technique match

In this chapter, you’ll learn the different ways of creating sample survey data out of population survey data by analyzing the parameters by which the survey data was taken.

Exercise 1: Muestreo aleatorio Exercise 2: Muestra aleatoria de empleados Exercise 3: Muestreo aleatorio reproducible Exercise 4: Muestreo aleatorio estratificado Exercise 5: Distribución de «sí» y «no»Exercise 6: Muestreo estratificado Exercise 7: Muestreo ponderado Exercise 8: Encuesta del blog Exercise 9: Muestreo ponderado sobre lateralidad Exercise 10: Muestreo por conglomerados Exercise 11: Agrupar por conglomerados Exercise 12: Elegir conglomerados Exercise 13: Análisis de muestreo por conglomerados

Ejercicio actual

Now it’s time to understand the difference between descriptive and inferential statistics concerning survey data analysis with some real-life examples. Through hands-on exercises, you’ll further interpret the meaning of different variables, key measures such as central tendency and zscore, and interpret results for actionable steps.

Exercise 1: Descriptive statistics in survey analysis Exercise 2: Frequency distribution Exercise 3: Measures of variability Exercise 4: Measures of central tendency Exercise 5: Inferential statistics in survey analysis Exercise 6: Visualize data: histogram Exercise 7: Find the z-score Exercise 8: Correlations Exercise 9: Analyze variables with .corr()Exercise 10: Are employees happy?Exercise 11: Fair and square

Last but not least, it’s time to apply statistical modeling to survey data analysis with regression analysis, the two-sample t-test, chi-square test, and interpret the assumptions associated with these tests.

Exercise 1: Regression analysis Exercise 2: Fitting a linear regression model Exercise 3: Visualizing survey data Exercise 4: Safety precautions needed?Exercise 5: Two sample t-test Exercise 6: Are women more extroverted?Exercise 7: Two sample t-test on extraversion Exercise 8: Chi-square test Exercise 9: To chi-square or not to chi-square?Exercise 10: Mental health in tech survey Exercise 11: Mental health vs. remote work Exercise 12: Congratulations